BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 001232
         (1118 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|359473774|ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Vitis vinifera]
          Length = 1238

 Score = 1059 bits (2738), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 635/1174 (54%), Positives = 773/1174 (65%), Gaps = 117/1174 (9%)

Query: 1    MG-KDVEEGEISDTASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLY 59
            MG +DVEEGEISD+ASVEEISEEDF  KQE  V+V++E KP      +A  RVWTMRDL 
Sbjct: 10   MGIEDVEEGEISDSASVEEISEEDFN-KQE--VRVLREAKP------KADTRVWTMRDLQ 60

Query: 60   N--KYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNSGA 117
            +  KY   C GY P L+NLAWAQAVQNKPLN+IFVM+ E+   S  SS  S   S ++  
Sbjct: 61   DLYKYHQACSGYTPRLYNLAWAQAVQNKPLNDIFVMDDEESKRSSSSSNTSRDDSSSA-- 118

Query: 118  AAGKDDKKVVEKVVIDDSGDEIEKEEGELEEGEIE------LDLESESNEK-------VS 164
                   K V KV+IDDSGDE++ +  ++ E E        +DL+SE + K       V+
Sbjct: 119  -------KEVAKVIIDDSGDEMDVKMDDVSEKEEGELEEGEIDLDSEPDVKDEGGVLDVN 171

Query: 165  E---QVKEEMKLINVESIREALESV--LRGDISFEGVCSKLEFTLESLREL-----VNEN 214
            E    +KE   +  V+SI+E LESV  +  + SF GVCS+L+ TL SL+++     V E+
Sbjct: 172  EPEIDLKERELVERVKSIQEDLESVTVIEAEKSFSGVCSRLQNTLGSLQKVFGEKVVGES 231

Query: 215  NVPTKDALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKE 274
            +VPTKDAL Q   +A+++++ VFCSMN   KE NK++ SRLLS ++  + P+FS   IKE
Sbjct: 232  SVPTKDALAQQLINAIRALNHVFCSMNSNQKELNKDVFSRLLSCVECGDSPIFSIQHIKE 291

Query: 275  MEAMLSSLVT-------RANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPL---P 324
            +E M+S L T        A+DK  D+    G+N    +   E++       +K+ L    
Sbjct: 292  VEVMMSFLDTPAAQSSAEASDKVNDVQVTDGMNRNILDSSVESSGRAFASAKKLSLDSIS 351

Query: 325  VDSLMQNKPLEASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVV 384
            V+S  QN P +A KPG    R R +  PLLD HK HD DSLPSPT +     PV ++ +V
Sbjct: 352  VESYNQNNP-DALKPGLSSSRGRFIFGPLLDLHKDHDEDSLPSPTGKAPQCFPVNKSELV 410

Query: 385  GDGMVKSWAAAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEE 444
                       AK++H  +      YETDAL+A S+YQQKFG  SF    +LPSPTPSEE
Sbjct: 411  ----------TAKVAHETQDSIMHPYETDALKAVSTYQQKFGLTSFLPIDKLPSPTPSEE 460

Query: 445  SGDGDGDTGGEISSATAVDQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSA 504
            SGD  GD  GE+SS++ +  P   N P LG   VSS P      MD S VQ  T   N++
Sbjct: 461  SGDTYGDISGEVSSSSTISAPITANAPALGHPIVSSAPQ-----MDSSIVQGPTVGRNTS 515

Query: 505  PASSGYNPVVKPNPVVKAPIKSRDPRLRFASSNA--LNLNHQPAPILHNAPKVEPVGRVM 562
              SSG  P +  + V  A  KSRDPRLR ASS+A  L+LN +P P + N+PKV+P+G ++
Sbjct: 516  LVSSG--PHLDSSVVASA--KSRDPRLRLASSDAGSLDLNERPLPAVSNSPKVDPLGEIV 571

Query: 563  SSRKQKTVEEPVLDGPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLL 622
            SSRKQK+ EEP+LDGP  KRQRNG  +   VRD + +  SGGWLED++   PQ+MNRN L
Sbjct: 572  SSRKQKSAEEPLLDGPVTKRQRNGLTSPATVRDAQTVVASGGWLEDSNTVIPQMMNRNQL 631

Query: 623  VDSAESNSRKLDNGAT-SPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTML 681
            +++  ++ +KL++  T + I    P V V+GNE  P    STT SL +LLKDIAVNP + 
Sbjct: 632  IENTGTDPKKLESKVTVTGIGCDKPYVTVNGNEHLPVVATSTTASLQSLLKDIAVNPAVW 691

Query: 682  LNILKMGQQQKLAADAQQKSNDSSMNTMHPPIPSSI----PPVSVTCSIPSGILSKP--- 734
            +NI    +QQK        S D + NT+ PP  +SI    PP SV    PS +  KP   
Sbjct: 692  MNIFNKVEQQK--------SGDPAKNTVLPPTSNSILGVVPPASVAPLKPSALGQKPAGA 743

Query: 735  --------MDELGKVRMKPRDPRRVLHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENLN 785
                    MDE GKVRMKPRDPRR+LH N+ QRSGS G E FKT               N
Sbjct: 744  LQVPQTGPMDESGKVRMKPRDPRRILHANSFQRSGSSGSEQFKT---------------N 788

Query: 786  FQKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQ 845
             QKQ    E K V S SV  PDI+QQFTKNLK+IAD MS SQ  +  P   Q    Q  Q
Sbjct: 789  AQKQEDQTETKSVPSHSVNPPDISQQFTKNLKNIADLMSASQASSMTPTFPQILSSQSVQ 848

Query: 846  IKSG-ADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKE 904
            + +   D+KA V++  D+ T  GS PE+       ++ WGDVEHLF+GYDDQQKAAIQ+E
Sbjct: 849  VNTDRMDVKATVSDSGDQLTANGSKPESAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRE 908

Query: 905  RTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF 964
            R RR+EEQKKMFSARKLCLVLDLDHTLLNSAKF EVDPVHDEILRKKEEQDREK  RHLF
Sbjct: 909  RARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKSQRHLF 968

Query: 965  RFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
            RFPHMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAKVLDPKGVLFAGRVIS+
Sbjct: 969  RFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISK 1028

Query: 1025 GDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRR 1084
            GDDGD  DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRR
Sbjct: 1029 GDDGDVLDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRR 1088

Query: 1085 QFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            QFGL GPSLLEIDHDER EDGTLASSL V +++H
Sbjct: 1089 QFGLPGPSLLEIDHDERPEDGTLASSLAVIERIH 1122


>gi|224053553|ref|XP_002297869.1| predicted protein [Populus trichocarpa]
 gi|222845127|gb|EEE82674.1| predicted protein [Populus trichocarpa]
          Length = 1117

 Score = 1006 bits (2601), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 605/1088 (55%), Positives = 722/1088 (66%), Gaps = 130/1088 (11%)

Query: 72   GLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNSGAAAGKDDKKVVEKVV 131
            GL+NLAWA+AVQNKPLNE+                                       VV
Sbjct: 3    GLYNLAWARAVQNKPLNEL--------------------------------------TVV 24

Query: 132  IDDSGDE------IEKEEGELEEGEIELDLESE-----SNEKVSEQVKEEMKLINVESIR 180
            IDDSGDE      I+ E+ E E  E E+DL+SE     S   VS  V+  +K     SIR
Sbjct: 25   IDDSGDEMDVVKVIDIEKEEGELEEGEIDLDSEPVVVQSEGMVSVDVENRVK-----SIR 79

Query: 181  EALE--SVLRGDISFEGVCSKLEFTLESLRELV--NENNVPTKDALIQLAFSAVQSVHSV 236
            + LE  SV+  + SFE VC KL   LESL+ELV  N+N+ P+KD L+QL F A++ V+SV
Sbjct: 80   KDLESVSVIETEKSFEAVCLKLHKVLESLKELVGGNDNSFPSKDGLVQLLFMAIRVVNSV 139

Query: 237  FCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAM 296
            FCSMN  LKEQNK + SR  SL+ SH PP FS  Q KE+           N+   D LA 
Sbjct: 140  FCSMNKKLKEQNKGVFSRFFSLLNSHYPPFFSPGQNKEV----------LNENHNDSLA- 188

Query: 297  HGVNGKDSNIVTENAVNDL-NFKEKVPLPVDSLMQNKP---LEASK-PGPPGYRSRGVLL 351
                        + A  DL    EK+P   ++ +QNKP   +EA K PG P ++SRGVLL
Sbjct: 189  ------------KTAGYDLTTMSEKLP-AAETFVQNKPNKSIEAPKPPGVPSFKSRGVLL 235

Query: 352  PLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYE 411
            PLLD  K HD DSLPSPT+ETTP  PVQR L +GDGMV S     K++  AE  +   YE
Sbjct: 236  PLLDLKKYHDEDSLPSPTQETTP-FPVQRLLAIGDGMVSSGLPVPKVTPVAEEPRMHPYE 294

Query: 412  TDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVDQPKPVNMP 471
            TDAL+A SSYQQKF RNSFF N ELPSPTPSEESG+GDGDT GE+SS++ V   + VN P
Sbjct: 295  TDALKAVSSYQQKFNRNSFFTN-ELPSPTPSEESGNGDGDTAGEVSSSSTVVNYRTVNPP 353

Query: 472  TLGQQ---PVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAPIKSRD 528
               Q+   P            D S+++ +    NSAP SSG      P+  +KA  KSRD
Sbjct: 354  VSDQKNAPPSPPPLPPPPPHPDSSNIRGVVPTRNSAPVSSG------PSSTIKASAKSRD 407

Query: 529  PRLRFASSNALNLNH--QPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQRNG 586
            PRLR+ + +A  L+H  +  P+++N P+VEP G ++ S+K K +EE VLD P+LKRQRN 
Sbjct: 408  PRLRYVNIDACALDHNQRALPMVNNLPRVEPAGAIVGSKKHK-IEEDVLDDPSLKRQRNS 466

Query: 587  FENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPITSGTP 646
            F+N G VRD +++ G+GGWLEDTDM EPQ +N+N   +++  N       A SP   G  
Sbjct: 467  FDNYGAVRDIESMTGTGGWLEDTDMAEPQTVNKNQWAENSNVNG---SGNAQSPFM-GIS 522

Query: 647  NVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQKSNDSSM 706
            N  ++G+E A  T+ +TT SLP LLKDIAVNPTML+NILKMGQQQ+LA D QQ  +D + 
Sbjct: 523  N--ITGSEQAQVTSTATT-SLPDLLKDIAVNPTMLINILKMGQQQRLALDGQQTLSDPAK 579

Query: 707  NTMHPPIPSS----IPPVSVTCSIPSGILSKPM-----------DELGKVRMKPRDPRRV 751
            +T HPPI ++    IP V+V  S PSGI  +P            DE GK+RMKPRDPRR 
Sbjct: 580  STSHPPISNTVLGAIPTVNVASSQPSGIFPRPAGTPVPSQIATSDESGKIRMKPRDPRRF 639

Query: 752  LHGNALQRSGSLGPEFKTDGPSAPCTQGSKENLNFQKQLGAPEAKPVLSQSVLQPDITQQ 811
            LH N+LQR+GS+G E        P TQG+K++ N QKQ G  E KP +      PDI+  
Sbjct: 640  LHNNSLQRAGSMGSEQFKTTTLTPTTQGTKDDQNVQKQEGLAELKPTVP-----PDISFP 694

Query: 812  FTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKS-GADMKAVVTNHDDKQTGTGSGP 870
            FTK+L++IAD +SVSQ  T+ P +SQN   QP Q KS   D K  ++  D K TG  S P
Sbjct: 695  FTKSLENIADILSVSQASTTPPFISQNVASQPMQTKSERVDGKTGISISDQK-TGPASSP 753

Query: 871  EAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHT 930
            E     +H Q+ W DVEHLFEGYDDQQKAAIQ+ER RRLEEQKKMF+ARKLCLVLDLDHT
Sbjct: 754  EVVAASSHSQNTWKDVEHLFEGYDDQQKAAIQRERARRLEEQKKMFAARKLCLVLDLDHT 813

Query: 931  LLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLF 990
            LLNSAK      +HDEILRKKEEQDREKP+RH+FR PHMGMWTKLRPGIW FLE+ASKLF
Sbjct: 814  LLNSAKAILSSSLHDEILRKKEEQDREKPYRHIFRIPHMGMWTKLRPGIWNFLEKASKLF 873

Query: 991  EMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME 1050
            E+HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME
Sbjct: 874  ELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME 933

Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASS 1110
            S VVIIDDSVRVWPHNKLNLIVVERY YFPCSRRQFGL GPSLLEIDHDER EDGTLA S
Sbjct: 934  SGVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLACS 993

Query: 1111 LGVRQQLH 1118
              V +++H
Sbjct: 994  FAVIEKIH 1001


>gi|296088169|emb|CBI35661.3| unnamed protein product [Vitis vinifera]
          Length = 1184

 Score =  969 bits (2506), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 595/1166 (51%), Positives = 720/1166 (61%), Gaps = 195/1166 (16%)

Query: 1    MG-KDVEEGEISDTASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLY 59
            MG +DVEEGEISD+ASVEEISEEDF  KQE  V+V++E KP      +A  RVWTMRDL 
Sbjct: 50   MGIEDVEEGEISDSASVEEISEEDFN-KQE--VRVLREAKP------KADTRVWTMRDLQ 100

Query: 60   N--KYPAICRGYGPGLHNLAWAQAVQNKPLNEIFV--------MEAEQDDVSKRSSPASS 109
            +  KY   C GY P L+NLAWAQAVQNKPLN+IFV        M+ + DDVS++      
Sbjct: 101  DLYKYHQACSGYTPRLYNLAWAQAVQNKPLNDIFVIIDDSGDEMDVKMDDVSEKEEGELE 160

Query: 110  VASVNSGAAAGKDDKKVVEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKVSEQVKE 169
               ++                   DS  +++ E G L+  E E+DL            KE
Sbjct: 161  EGEIDL------------------DSEPDVKDEGGVLDVNEPEIDL------------KE 190

Query: 170  EMKLINVESIREALESV--LRGDISFEGVCSKLEFTLESLREL-----VNENNVPTKDAL 222
               +  V+SI+E LESV  +  + SF GVCS+L+ TL SL+++     V E++VPTKDAL
Sbjct: 191  RELVERVKSIQEDLESVTVIEAEKSFSGVCSRLQNTLGSLQKVFGEKVVGESSVPTKDAL 250

Query: 223  IQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSL 282
             Q   +A+++++ VFCSMN   KE NK++ SRLLS ++  + P+FS   IKE+E M+S L
Sbjct: 251  AQQLINAIRALNHVFCSMNSNQKELNKDVFSRLLSCVECGDSPIFSIQHIKEVEVMMSFL 310

Query: 283  VT-------RANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNKPLE 335
             T        A+DK  D+    G+N    NI+  +              V+S    +   
Sbjct: 311  DTPAAQSSAEASDKVNDVQVTDGMN---RNILDSS--------------VES--SGRAFA 351

Query: 336  ASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAA 395
            ++K     +R R +  PLLD HK HD DSLPSPT +     PV ++ +V           
Sbjct: 352  SAK----KFRGRFIFGPLLDLHKDHDEDSLPSPTGKAPQCFPVNKSELV----------T 397

Query: 396  AKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGE 455
            AK++H  +      YETDAL+A S+YQQKFG  SF    +LPSPTPSEESGD  GD  GE
Sbjct: 398  AKVAHETQDSIMHPYETDALKAVSTYQQKFGLTSFLPIDKLPSPTPSEESGDTYGDISGE 457

Query: 456  ISSATAVDQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVK 515
            +SS++ +  P   N P LG   VSS P      MDI  VQ L    N+   +S +N +++
Sbjct: 458  VSSSSTISAPITANAPALGHPIVSSAPQ-----MDI--VQGLVVPRNTGAVNSRFNSILR 510

Query: 516  PNPVVKAPIKSRDPRLRFASSNA--LNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEP 573
                  A  KSRDPRLR ASS+A  L+LN +P P + N+PKV+P+G ++SSRKQK+ EEP
Sbjct: 511  ------ASAKSRDPRLRLASSDAGSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEP 564

Query: 574  VLDGPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKL 633
            +LDGP  KRQRNG                                         S + KL
Sbjct: 565  LLDGPVTKRQRNGLT---------------------------------------SPATKL 585

Query: 634  DNGAT-SPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQK 692
            ++  T + I    P V V+GNE  P    STT SL +LLKDIAVNP + +NI    +QQK
Sbjct: 586  ESKVTVTGIGCDKPYVTVNGNEHLPVVATSTTASLQSLLKDIAVNPAVWMNIFNKVEQQK 645

Query: 693  LAADAQQKSNDSSMNTMHPPIPSSI----PPVSVTCSIPSGILSKP-------------- 734
                    S D + NT+ PP  +SI    PP SV    PS +  KP              
Sbjct: 646  --------SGDPAKNTVLPPTSNSILGVVPPASVAPLKPSALGQKPAGALQVPQTGPMNP 697

Query: 735  MDELGKVRMKPRDPRRVLHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQKQLGAP 793
             DE GKVRMKPRDPRR+LH N+ QRSGS G E FKT               N QKQ    
Sbjct: 698  QDESGKVRMKPRDPRRILHANSFQRSGSSGSEQFKT---------------NAQKQEDQT 742

Query: 794  EAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSG-ADM 852
            E K V S SV  PDI+QQFTKNLK+IAD MS SQ  +  P   Q    Q  Q+ +   D+
Sbjct: 743  ETKSVPSHSVNPPDISQQFTKNLKNIADLMSASQASSMTPTFPQILSSQSVQVNTDRMDV 802

Query: 853  KAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQ 912
            KA V++  D+ T  GS PE+       ++ WGDVEHLF+GYDDQQKAAIQ+ER RR+EEQ
Sbjct: 803  KATVSDSGDQLTANGSKPESAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRIEEQ 862

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            KKMFSARKLCLVLDLDHTLLNSAKF EVDPVHDEILRKKEEQDREK  RHLFRFPHMGMW
Sbjct: 863  KKMFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHMGMW 922

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
            TKLRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAKVLDPKGVLFAGRVIS+GDDGD  D
Sbjct: 923  TKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGDVLD 982

Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
            GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL GPS
Sbjct: 983  GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPS 1042

Query: 1093 LLEIDHDERSEDGTLASSLGVRQQLH 1118
            LLEIDHDER EDGTLASSL V +++H
Sbjct: 1043 LLEIDHDERPEDGTLASSLAVIERIH 1068


>gi|356523718|ref|XP_003530482.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Glycine max]
          Length = 1244

 Score =  947 bits (2447), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 574/1173 (48%), Positives = 730/1173 (62%), Gaps = 100/1173 (8%)

Query: 1    MGK---DVEEGEISDTASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRD 57
            MGK   DVEEGEISDTASVEEIS EDF  KQ+    V       K  G +A  RVW + D
Sbjct: 1    MGKEAEDVEEGEISDTASVEEISAEDFN-KQD----VKLLNNNNKPNGSDA--RVWAVHD 53

Query: 58   LYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQD---DVSKRSSPASSVASVN 114
            LY+KYP ICRGY  GL+NLAWAQAVQNKPLN+IFVME + D   + ++ SS   +  +VN
Sbjct: 54   LYSKYPTICRGYASGLYNLAWAQAVQNKPLNDIFVMEVDSDANANSNRNSSHRLASVAVN 113

Query: 115  SGAAAGKDDKKVVEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKVSEQVKEEMKLI 174
                   D  K   ++   +   + E E GE E   + +  +SE  + V   V +  +L 
Sbjct: 114  PKDVVVVDVDKEEGELEEGEIDADAEPE-GEAESVVVAVS-DSEKLDDVKMDVSDSEQL- 170

Query: 175  NVESIREALESVLRGDI--SFEGVCSKLEFTLESLRELVNENNVPTKDALIQLAFSAVQS 232
                 R  LE V   ++  SF   CSKL+ TL    E+++      KD L++L+F+A + 
Sbjct: 171  ---GARGVLEGVTVANVVESFAQTCSKLQNTLP---EVLSRPAGSEKDDLVRLSFNATEV 224

Query: 233  VHSVFCSMNHVLKEQNKEILSRLLSLIK-SHEPPLFSSNQIKEMEAMLSSL-------VT 284
            V+SVFCSM+   KEQNK+ + RLLS +K   +  LFS   +KE++ M++++        +
Sbjct: 225  VYSVFCSMDSSEKEQNKDSILRLLSFVKDQQQAQLFSPEHVKEIQGMMTAIDSVGALVNS 284

Query: 285  RANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQ-NKPL--------E 335
             A  KEK++        ++S +  E  ++++  +E   +    L+  +KPL        +
Sbjct: 285  EAIGKEKELQTTEIKTQENSAV--EVQIHEIKTQENQAVEAAELISYSKPLHRDITGTSQ 342

Query: 336  ASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAA 395
            A K G    + RGVLLPLLD HK HD DSLPSPTRE     PV + L VG+ MV+S +A+
Sbjct: 343  ALKFGQNSIKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGESMVRSGSAS 402

Query: 396  AKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGE 455
            AK+  ++E  K   YETDAL+A S+YQQKFGR+S F N + PSPTPS +  D   DT  E
Sbjct: 403  AKMELDSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPSGDCEDEVVDTNEE 462

Query: 456  ISSATAVDQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVK 515
            +SSA+  D         L Q PVS+  MD S      S +   T   S P          
Sbjct: 463  VSSASTGDFLTSTKPTLLDQPPVSATSMDRSSMHGFISSRVDATGPGSFP---------- 512

Query: 516  PNPVVKAPIKSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVL 575
                VK+  K+RDPRLRF +S+A  +++  + +++N  KVE  G  +S RKQK  EEP L
Sbjct: 513  ----VKSSAKNRDPRLRFINSDASAVDNL-STLINNMSKVEYSGTTIS-RKQKAAEEPSL 566

Query: 576  DGPALKRQRNGFENSGVVRDEKNI----YGSGGWLEDTDMFEPQIMNRNLLVDSAESNSR 631
            D    KR ++  EN+     E N+     GSGGWLE+      Q++ RN L+D     ++
Sbjct: 567  DVTVSKRLKSSLENT-----EHNMSEVRTGSGGWLEENTGPGAQLIERNHLMDKFGPEAK 621

Query: 632  KLDNGATSPIT-SGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQ 690
            K  N  +S  T S   N     NE AP T  +   SLPALLK+ +VNP ML+NIL++ + 
Sbjct: 622  KTLNTVSSSCTGSDNFNATSIRNEQAPITASNVLASLPALLKEASVNPIMLVNILRLAEA 681

Query: 691  QKLAADAQQKSNDSSMNTMHP----PIPSSIPPVSVTCSIPSGILSKPM----------- 735
            QK +AD+      +++  +HP    P   +    S+  S+ +G+L   +           
Sbjct: 682  QKKSADS------AAIMLLHPTSSNPAMGTDSTASIGSSMATGLLQSSVGMLPVSSQSTS 735

Query: 736  ------DELGKVRMKPRDPRRVLH-GNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQ 787
                  D+ GK+RMKPRDPRR+LH  N +Q+SG LG E FK         Q + +N+N  
Sbjct: 736  TAQTLQDDSGKIRMKPRDPRRILHTNNTIQKSGDLGNEQFKAIVSPVSNNQRTGDNVNAP 795

Query: 788  KQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIK 847
            K  G  + K V +QS  QPDI +QFT+NLK+IAD MSVSQ  ++   VSQN       + 
Sbjct: 796  KLEGRVDNKLVPTQSSAQPDIARQFTRNLKNIADIMSVSQESSTHTPVSQNFSSASVPLT 855

Query: 848  SG-ADMKAVVTNHDDKQTGTGSGPE-AGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKER 905
            S   + K+VV++  + Q    S  E A  V +  QS WGDVEHLFEGYD+QQKAAIQ+ER
Sbjct: 856  SDRGEQKSVVSSSQNLQADMASAHETAASVTSRSQSTWGDVEHLFEGYDEQQKAAIQRER 915

Query: 906  TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
             RR+EEQ KMF+ARKLCLVLDLDHTLLNSAKF EVDP+HDEILRKKEEQDREKPHRHLFR
Sbjct: 916  ARRIEEQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFR 975

Query: 966  FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
            FPHMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG
Sbjct: 976  FPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1035

Query: 1026 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
            DD D  DG+ERVPKSKDLEGVLGMES+VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ
Sbjct: 1036 DDTDSVDGEERVPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1095

Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            FGL GPSLLEIDHDER E GTLASSL V +++H
Sbjct: 1096 FGLPGPSLLEIDHDERPEAGTLASSLAVIEKIH 1128


>gi|449487451|ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
            phosphatase-like 3-like [Cucumis sativus]
          Length = 1249

 Score =  946 bits (2445), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 598/1185 (50%), Positives = 741/1185 (62%), Gaps = 130/1185 (10%)

Query: 3    KDVEEGEISDTASVEEISEEDF-KIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNK 61
            +DVEEGEISDTASVEEISEEDF K+      KVV  +K           RVWTM DLY  
Sbjct: 10   EDVEEGEISDTASVEEISEEDFNKLDSSASPKVVVPSK-----DSNRETRVWTMSDLYKN 64

Query: 62   YPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNSGAAAGK 121
            YPA+  GY  GL+NLAWAQAVQNKPLN+IFVMEA+ D+ SK SS      + + G+   K
Sbjct: 65   YPAMRHGYASGLYNLAWAQAVQNKPLNDIFVMEADLDEKSKHSSSTPFGNAKDDGSNTTK 124

Query: 122  DDKKVVEKVVIDDSGDEIEKEEGELEEGEIE-----LDLESESNEKVSE----------- 165
            ++    ++VVIDDSGDE+  +    E+ E E     +D+++E  E+V++           
Sbjct: 125  EE----DRVVIDDSGDEMNCDNANGEKEEGELEEGEIDMDTEFVEEVADSKAMLSDSRDM 180

Query: 166  ---------QVKEEMKLINVESIREALESVL--RGDISFEGVCSKLEFTLESLRELVNEN 214
                     + KE  +L+    I++ L+ V       SF+ VCS++  ++E+  EL+   
Sbjct: 181  DINGQEFDLETKELDELLKF--IQKTLDGVTIDAAQKSFQEVCSQIHSSIETFVELLQGK 238

Query: 215  NVPTKDALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKE 274
             VP KDALIQ  ++A++ ++SVFCSMN   KE++KE LSRLLS +K+ +PPLFS  QIK 
Sbjct: 239  VVPRKDALIQRLYAALRLINSVFCSMNLSEKEEHKEHLSRLLSYVKNCDPPLFSPEQIKS 298

Query: 275  MEAMLSSLVTRANDKEKDMLAMHGVNGKDSNIVTENAVNDLNF----------------- 317
            +E  + S      D    + +M G + K+  I   N V D++F                 
Sbjct: 299  VEVKMPS-----TDSLDHLPSMRG-SAKEVEIHIPNGVKDMDFYSAYTSTSSQLTPSNKL 352

Query: 318  -KEKVPLPVDSLMQNKPL-EASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPS 375
              + +P  V        L E  + G    + RG LLPLLD HK HD DSLPSPTRE    
Sbjct: 353  ASDSIPFGVKGKNNLNILSEGLQSGVSSIKGRGPLLPLLDLHKDHDADSLPSPTREAPTI 412

Query: 376  VPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSE 435
              VQ          KS  A  K++   +  ++  YETDAL+A S+YQQKFGR+SF M   
Sbjct: 413  FSVQ----------KSGNAPTKMAFPVDGSRSHPYETDALKAVSTYQQKFGRSSFSMADR 462

Query: 436  LPSPTPSEESGDGDGDTGGEISSATAVDQPKPVNMPTLGQQPVSSQPMD--ISQPMDISS 493
            LPSPTPSEE  DG GD GGE+SS++ +   K  N+   GQ+  S+  +   +   MD SS
Sbjct: 463  LPSPTPSEEH-DGGGDIGGEVSSSSIIRSLKSSNVSKPGQKSNSASNVSTGLFPNMDSSS 521

Query: 494  VQALTTANNSAPASSGYNPVVKPNPVVKAPIKSRDPRLRFASSNA--LNLNHQPAPILHN 551
             + L +  N AP SS  NP VKP        KSRDPRLR  +S+A  ++LN +    + +
Sbjct: 522  TRVLISPLNVAPPSSVSNPTVKP------LAKSRDPRLRIVNSDASGMDLNPRTMASVQS 575

Query: 552  APKVEPVGRVMSSRKQKTVEEPVLDGPALKRQRNGFENSGVV-RDEKNIYGSGGWLEDTD 610
            +  +E     +  RKQK   EP  DGP +KR R G +N  V   D + + GSGGWLEDT 
Sbjct: 576  SSILESAA-TLHLRKQKMDGEPNTDGPEVKRLRIGSQNLAVAASDVRAVSGSGGWLEDTM 634

Query: 611  MFEPQIMNRNLLVDSAESNSRKLDNGATSPITSGTPNVVVSGNEPAPATTPSTTVSLPAL 670
               P++ NRN + + AE+N+ +          S   N   SGNE  P    S   SLP+L
Sbjct: 635  PAGPRLFNRNQM-EIAEANATE---------KSNVTNNSGSGNECTPTVNNSNDASLPSL 684

Query: 671  LKDIAVNPTMLLNILKMGQQQKLAADAQQKSNDSSMNTMHP----PIPSSIPPVSVTCSI 726
            LKDI VNPTMLLN+LKM QQQ+LAA+ + KS++   N + P    P   S P ++   + 
Sbjct: 685  LKDIVVNPTMLLNLLKMSQQQQLAAELKLKSSEPEKNAICPTSLNPCQGSSPLINAPVAT 744

Query: 727  PSGILSKP------------MDELGKVRMKPRDPRRVLHGNALQRSGSLG-PEFKTDGPS 773
             SGIL +              D+LGKVRMKPRDPRRVLHGN+LQ+ GSLG  + K   P+
Sbjct: 745  -SGILQQSAGTPSASPVVGRQDDLGKVRMKPRDPRRVLHGNSLQKVGSLGNDQLKGVVPT 803

Query: 774  APCTQGSKENLNFQKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEP 833
            A  T+GS++  N  KQ G  ++K   SQ++L PDI +QFT NLK+IAD MSV  P TS P
Sbjct: 804  ASNTEGSRDIPNGHKQEGQGDSKLASSQTIL-PDIGRQFTNNLKNIADIMSVPSPPTSSP 862

Query: 834  MVSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGY 893
              S           S  D K V T          S           Q AWGD+EHLF+ Y
Sbjct: 863  NSSSKP-----VGSSSMDSKPVTTAFQAVDMAASS---------RSQGAWGDLEHLFDSY 908

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            DD+QKAAIQ+ER RR+EEQKKMF+ARKLCLVLDLDHTLLNSAKF EVDPVHDEILRKKEE
Sbjct: 909  DDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEE 968

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
            QDREK  RHLFRFPHMGMWTKLRPG+W FLE+AS+L+E+HLYTMGNKLYATEMAKVLDPK
Sbjct: 969  QDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPK 1028

Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVV 1073
            GVLFAGRVISRGDDGDP DGD+RVPKSKDLEGVLGMES VVIIDDS+RVWPHNK+NLIVV
Sbjct: 1029 GVLFAGRVISRGDDGDPLDGDDRVPKSKDLEGVLGMESGVVIIDDSIRVWPHNKMNLIVV 1088

Query: 1074 ERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            ERYTYFPCSRRQFGLLGPSLLEIDHDER EDGTLASSLGV Q++H
Sbjct: 1089 ERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGVIQRIH 1133


>gi|449445782|ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Cucumis sativus]
          Length = 1249

 Score =  946 bits (2445), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 598/1185 (50%), Positives = 741/1185 (62%), Gaps = 130/1185 (10%)

Query: 3    KDVEEGEISDTASVEEISEEDF-KIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNK 61
            +DVEEGEISDTASVEEISEEDF K+      KVV  +K           RVWTM DLY  
Sbjct: 10   EDVEEGEISDTASVEEISEEDFNKLDSSASPKVVVPSK-----DSNRETRVWTMSDLYKN 64

Query: 62   YPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNSGAAAGK 121
            YPA+  GY  GL+NLAWAQAVQNKPLN+IFVMEA+ D+ SK SS      + + G+   K
Sbjct: 65   YPAMRHGYASGLYNLAWAQAVQNKPLNDIFVMEADLDEKSKHSSSTPFGNAKDDGSNTTK 124

Query: 122  DDKKVVEKVVIDDSGDEIEKEEGELEEGEIE-----LDLESESNEKVSE----------- 165
            ++    ++VVIDDSGDE+  +    E+ E E     +D+++E  E+V++           
Sbjct: 125  EE----DRVVIDDSGDEMNCDNANGEKEEGELEEGEIDMDTEFVEEVADSKAMLSDSRDM 180

Query: 166  ---------QVKEEMKLINVESIREALESVL--RGDISFEGVCSKLEFTLESLRELVNEN 214
                     + KE  +L+    I++ L+ V       SF+ VCS++  ++E+  EL+   
Sbjct: 181  DINGQEFDLETKELDELLKF--IQKTLDGVTIDAAQKSFQEVCSQIHSSIETFVELLQGK 238

Query: 215  NVPTKDALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKE 274
             VP KDALIQ  ++A++ ++SVFCSMN   KE++KE LSRLLS +K+ +PPLFS  QIK 
Sbjct: 239  VVPRKDALIQRLYAALRLINSVFCSMNLSEKEEHKEHLSRLLSYVKNCDPPLFSPEQIKS 298

Query: 275  MEAMLSSLVTRANDKEKDMLAMHGVNGKDSNIVTENAVNDLNF----------------- 317
            +E  + S      D    + +M G + K+  I   N V D++F                 
Sbjct: 299  VEVKMPS-----TDSLDHLPSMRG-SAKEVEIHIPNGVKDMDFYSAYTSTSSQLTPSNKL 352

Query: 318  -KEKVPLPVDSLMQNKPL-EASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPS 375
              + +P  V        L E  + G    + RG LLPLLD HK HD DSLPSPTRE    
Sbjct: 353  ASDSIPFGVKGKNNLNILSEGLQSGVSSIKGRGPLLPLLDLHKDHDADSLPSPTREAPTI 412

Query: 376  VPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSE 435
              VQ          KS  A  K++   +  ++  YETDAL+A S+YQQKFGR+SF M   
Sbjct: 413  FSVQ----------KSGNAPTKMAFPVDGSRSHPYETDALKAVSTYQQKFGRSSFSMADR 462

Query: 436  LPSPTPSEESGDGDGDTGGEISSATAVDQPKPVNMPTLGQQPVSSQPMD--ISQPMDISS 493
            LPSPTPSEE  DG GD GGE+SS++ +   K  N+   GQ+  S+  +   +   MD SS
Sbjct: 463  LPSPTPSEEH-DGGGDIGGEVSSSSIIRSLKSSNVSKPGQKSNSASNVSTGLFPNMDSSS 521

Query: 494  VQALTTANNSAPASSGYNPVVKPNPVVKAPIKSRDPRLRFASSNA--LNLNHQPAPILHN 551
             + L +  N AP SS  NP VKP        KSRDPRLR  +S+A  ++LN +    + +
Sbjct: 522  TRVLISPLNVAPPSSVSNPTVKP------LAKSRDPRLRIVNSDASGMDLNPRTMASVQS 575

Query: 552  APKVEPVGRVMSSRKQKTVEEPVLDGPALKRQRNGFENSGVV-RDEKNIYGSGGWLEDTD 610
            +  +E     +  RKQK   EP  DGP +KR R G +N  V   D + + GSGGWLEDT 
Sbjct: 576  SSILESAA-TLHLRKQKMDGEPNTDGPEVKRLRIGSQNLAVAASDVRAVSGSGGWLEDTM 634

Query: 611  MFEPQIMNRNLLVDSAESNSRKLDNGATSPITSGTPNVVVSGNEPAPATTPSTTVSLPAL 670
               P++ NRN + + AE+N+ +          S   N   SGNE  P    S   SLP+L
Sbjct: 635  PAGPRLFNRNQM-EIAEANATE---------KSNVTNNSGSGNECTPTVNNSNDASLPSL 684

Query: 671  LKDIAVNPTMLLNILKMGQQQKLAADAQQKSNDSSMNTMHP----PIPSSIPPVSVTCSI 726
            LKDI VNPTMLLN+LKM QQQ+LAA+ + KS++   N + P    P   S P ++   + 
Sbjct: 685  LKDIVVNPTMLLNLLKMSQQQQLAAELKLKSSEPEKNAICPTSLNPCQGSSPLINAPVAT 744

Query: 727  PSGILSKP------------MDELGKVRMKPRDPRRVLHGNALQRSGSLG-PEFKTDGPS 773
             SGIL +              D+LGKVRMKPRDPRRVLHGN+LQ+ GSLG  + K   P+
Sbjct: 745  -SGILQQSAGTPSASPVVGRQDDLGKVRMKPRDPRRVLHGNSLQKVGSLGNDQLKGVVPT 803

Query: 774  APCTQGSKENLNFQKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEP 833
            A  T+GS++  N  KQ G  ++K   SQ++L PDI +QFT NLK+IAD MSV  P TS P
Sbjct: 804  ASNTEGSRDIPNGHKQEGQGDSKLASSQTIL-PDIGRQFTNNLKNIADIMSVPSPPTSSP 862

Query: 834  MVSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGY 893
              S           S  D K V T          S           Q AWGD+EHLF+ Y
Sbjct: 863  NSSSKP-----VGSSSMDSKPVTTAFQAVDMAASS---------RSQGAWGDLEHLFDSY 908

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            DD+QKAAIQ+ER RR+EEQKKMF+ARKLCLVLDLDHTLLNSAKF EVDPVHDEILRKKEE
Sbjct: 909  DDKQKAAIQRERARRIEEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEE 968

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
            QDREK  RHLFRFPHMGMWTKLRPG+W FLE+AS+L+E+HLYTMGNKLYATEMAKVLDPK
Sbjct: 969  QDREKAQRHLFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPK 1028

Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVV 1073
            GVLFAGRVISRGDDGDP DGD+RVPKSKDLEGVLGMES VVIIDDS+RVWPHNK+NLIVV
Sbjct: 1029 GVLFAGRVISRGDDGDPLDGDDRVPKSKDLEGVLGMESGVVIIDDSIRVWPHNKMNLIVV 1088

Query: 1074 ERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            ERYTYFPCSRRQFGLLGPSLLEIDHDER EDGTLASSLGV Q++H
Sbjct: 1089 ERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLGVIQRIH 1133


>gi|356567192|ref|XP_003551805.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Glycine max]
          Length = 1221

 Score =  919 bits (2376), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 573/1165 (49%), Positives = 719/1165 (61%), Gaps = 107/1165 (9%)

Query: 1    MGK---DVEEGEISDTASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRD 57
            MGK   DVEEGEISDTASVEEIS EDF  KQ+  VKV+      K  G +A  RVW + D
Sbjct: 1    MGKEVEDVEEGEISDTASVEEISAEDFN-KQD--VKVLNNNN--KPNGSDA--RVWAVHD 53

Query: 58   LYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPA----SSVASV 113
            LY+KYP ICRGY  GL+NLAWAQAVQNKPLN+IFVME + D  +  +S      +SVA V
Sbjct: 54   LYSKYPTICRGYASGLYNLAWAQAVQNKPLNDIFVMEVDSDANANSNSNNSNRLASVA-V 112

Query: 114  NSGAAAGKDDKKVVEKVVIDDSGDEIEKEEGELEEG-EIELDLESESNEKVSEQVKEEMK 172
            N       D  K   ++   +   + E E GE E    + +  +SE  + V   V    +
Sbjct: 113  NPKDVVVVDVDKEEGELEEGEIDADAEPE-GEAESVVAVPVVSDSEKLDDVKRDVSNSEQ 171

Query: 173  LINVESIREALESVLRGDI--SFEGVCSKLEFTLESLRELVNENNVPTKDALIQLAFSAV 230
            L     +R  LE V   ++  SF   CSKL+    +L E+++      +D L++L+F+A 
Sbjct: 172  L----GVRGVLEGVTVANVAESFAQTCSKLQ---NALPEVLSRPADSERDDLVRLSFNAT 224

Query: 231  QSVHSVFCSMNHVLKEQNKEILSRLLSLIK-SHEPPLFSSNQIKEMEAMLSSLVTRANDK 289
            + V+SVFCSM+ + KEQNK+ + RLLS +K   +  LFS   IKE++ M++++     D 
Sbjct: 225  EVVYSVFCSMDSLKKEQNKDSILRLLSFVKDQQQAQLFSPEHIKEIQGMMTAI-----DY 279

Query: 290  EKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQ-NKPLE--------ASKPG 340
               ++    + GK+  + T    +++  +E   +    L+  NKPL         A K G
Sbjct: 280  FGALVNSEAI-GKEKELQTTVQTHEIKTQENQAVEAAELISYNKPLHSDIIGASHALKFG 338

Query: 341  PPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSH 400
                + RGVLLPLLD HK HD DSLPSPTRE     PV + L    G         K+  
Sbjct: 339  QNSIKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSPESG---------KMEL 389

Query: 401  NAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSAT 460
            ++E  K   YETDAL+A S+YQQKFGR+S F N + PSPTPS +  D   DT  E+SSA+
Sbjct: 390  DSEGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPSGDCEDEIVDTNEEVSSAS 449

Query: 461  AVDQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVV 520
              D         L   PVS+   D S      S +       S P              V
Sbjct: 450  TGDFLTSTKPTLLDLPPVSATSTDRSSLHGFISSRVDAAGPGSLP--------------V 495

Query: 521  KAPIKSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPAL 580
            K+  K+RDPRLRF +S+A  +++ P+ ++HN PKVE  G  +S RKQK  EEP LD    
Sbjct: 496  KSSAKNRDPRLRFVNSDASAVDN-PSTLIHNMPKVEYAGTTIS-RKQKAAEEPSLDVTVS 553

Query: 581  KRQRNGFENSGVVRDEKNI----YGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNG 636
            KRQ++  EN+     E N+     G GGWLE+      Q + RN L+D      +K  N 
Sbjct: 554  KRQKSPLENT-----EHNMSEVRTGIGGWLEEHTGPGAQFIERNHLMDKFGPEPQKTLNT 608

Query: 637  ATSPIT-SGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAA 695
             +S  T S   N     NE AP T+ +   SLPALLK  AVNPTML+N+L++       A
Sbjct: 609  VSSSCTGSDNFNATSIRNEQAPITSSNVLASLPALLKGAAVNPTMLVNLLRI-------A 661

Query: 696  DAQQKSNDSSMNTM-HPPIPSSI----PPVSVTCSIPSGILSKPM------------DEL 738
            +AQ+KS DS+ N + HP   +S        S+  S+ +G+L   +            D+ 
Sbjct: 662  EAQKKSADSATNMLLHPTSSNSAMGTDSTASIGSSMATGLLQSSVGMLPVSSQQTLQDDS 721

Query: 739  GKVRMKPRDPRRVLH-GNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQKQLGAPEAK 796
            GK+RMKPRDPRR+LH  N +Q+SG+LG E FK         QG+ +N+N QK  G  ++K
Sbjct: 722  GKIRMKPRDPRRILHTNNTIQKSGNLGNEQFKAIVSPVSNNQGTGDNVNAQKLEGRVDSK 781

Query: 797  PVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQ--NSPIQPGQIKSGADMKA 854
             V +Q   QPDI +QF +NLK+IAD MSVSQ  ++   V+Q  +S   P     G + K+
Sbjct: 782  LVPTQPSAQPDIARQFARNLKNIADIMSVSQESSTHTPVAQIFSSASVPLTSDRG-EQKS 840

Query: 855  VVTNHDDKQTGTGSGPEAGPVG-AHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQK 913
            VV+N  + + G  S  E    G    Q+ WGDVEHLFEGYD+QQKAAIQ+ER RR+EEQ 
Sbjct: 841  VVSNSQNLEAGMVSAHETAASGTCRSQNTWGDVEHLFEGYDEQQKAAIQRERARRIEEQN 900

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
            KMF+ARKLCLVLDLDHTLLNSAKF EVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT
Sbjct: 901  KMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 960

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
            KLRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAKVLDPKG+LFAGRVISRGDD D  DG
Sbjct: 961  KLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGLLFAGRVISRGDDTDSVDG 1020

Query: 1034 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSL 1093
            +ER PKSKDLEGVLGMES+VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL GPSL
Sbjct: 1021 EERAPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSL 1080

Query: 1094 LEIDHDERSEDGTLASSLGVRQQLH 1118
            LEIDHDER E GTLASSL V +++H
Sbjct: 1081 LEIDHDERPEAGTLASSLAVIEKIH 1105


>gi|255543174|ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
 gi|223548611|gb|EEF50102.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
          Length = 1195

 Score =  908 bits (2346), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 506/866 (58%), Positives = 609/866 (70%), Gaps = 70/866 (8%)

Query: 288  DKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNKP---LEASKPGPPGY 344
            +KEK+ L    VN KD+++  +++ +D++   K  LP DS + NK    +E  K G   +
Sbjct: 249  EKEKEPLISTVVNKKDNDVNGKSSGHDMSAVNK--LPTDSFVNNKANLSIEGPKTGVSSF 306

Query: 345  RSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEV 404
            +SR  LLPLLD HK HD DSLPSPTRE+   +P  R L     MV         + N+ +
Sbjct: 307  KSRAALLPLLDLHKDHDADSLPSPTRESALPLPAYRVLT--PKMVLD-------TGNSRM 357

Query: 405  HKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVDQ 464
            H    YETDAL+A SSYQQKF ++SF +   LPSPTPSEESG+GDGDTGGE+SS+ +V  
Sbjct: 358  HP---YETDALKAVSSYQQKFSKSSFALTDRLPSPTPSEESGNGDGDTGGEVSSSLSVSS 414

Query: 465  PKPVNMPTLGQQPVSSQPMDISQP-MDISSVQALTTANNSAPASSGYNPVVKPNPVVKAP 523
             +P N  T GQ   S     IS P MD SS+  + +  ++  ASS       P+  VKA 
Sbjct: 415  FRPANPLTSGQSNAS-----ISLPRMDGSSLPGVISIKSAVRASSA------PSLTVKAS 463

Query: 524  IKSRDPRLRFASS--NALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALK 581
             KSRDPRLRF +S  NAL+ NH+  P++ N  KVEP+G  M+ ++QK V++P+ DG +LK
Sbjct: 464  AKSRDPRLRFVNSDSNALDQNHRAVPVV-NTLKVEPIGGTMNKKRQKIVDDPIPDGHSLK 522

Query: 582  RQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPI 641
            RQ+N  ENSGVVRD K + GSGGWLEDTDM  PQ MN+N LVD+AES+ R+ D G     
Sbjct: 523  RQKNALENSGVVRDVKTMVGSGGWLEDTDMVGPQTMNKNQLVDNAESDPRRKDGGGVCTS 582

Query: 642  TSGTPNVVVSGNEPAPATTPS------------TTVSLPALLKDIAVNPTMLLNILKMGQ 689
            +S   +V +SG E  P T  S            +T ++P LLK+IAVNPTML+NILKMGQ
Sbjct: 583  SSCISSVNISGTEQIPVTGTSVPIGGELVPVKGSTAAIPDLLKNIAVNPTMLINILKMGQ 642

Query: 690  QQKLAADAQQKSNDSSMNTMHPPIPSS-IPPVSVTCSIPSGILSKPM------------D 736
            QQ+LA +AQQK  D + +T +P   +S +  V V  +  SGIL +P             D
Sbjct: 643  QQRLALEAQQKPVDPAKSTTYPLNSNSMLGTVPVVGAAHSGILPRPAGTVQVSPQLGTAD 702

Query: 737  ELGKVRMKPRDPRRVLHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQKQLGAPEA 795
            +LGK+RMKPRDPRRVLH NALQR+GS+G E  KT+  S P  Q +K+N N QKQ G  E 
Sbjct: 703  DLGKIRMKPRDPRRVLHNNALQRNGSMGSEHLKTNLTSIPINQETKDNQNLQKQEGQVEK 762

Query: 796  KPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMKAV 855
            KPV  QS+  PDI+  FTKNLK+IAD +SVS   TS+P+V QN   QP        M+  
Sbjct: 763  KPVPLQSLALPDISMPFTKNLKNIADIVSVSHASTSQPLVPQNPASQP--------MRTT 814

Query: 856  VTNHDDKQTGTGSGPEAGPVGA---HPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQ 912
            +++  D+  G GS P A    A     Q+AWGDVEHLFEGY+DQQKAAIQ+ER RR+EEQ
Sbjct: 815  ISS-SDQFLGIGSAPGAAAAAAAGPRTQNAWGDVEHLFEGYNDQQKAAIQRERARRIEEQ 873

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            KK+FSARKLCLVLDLDHTLLNSAKF EVDPVHDEILRKKEEQDREK HRHLFRFPHMGMW
Sbjct: 874  KKLFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAHRHLFRFPHMGMW 933

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
            TKLRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAKVLDP GVLF GRVISRGDDG+PFD
Sbjct: 934  TKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGVLFNGRVISRGDDGEPFD 993

Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
            GDER+PKSKDLEGVLGMES VVI+DDSVRVWPHNKLNLIVVERY YFPCSRRQFGL GPS
Sbjct: 994  GDERIPKSKDLEGVLGMESGVVIMDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPS 1053

Query: 1093 LLEIDHDERSEDGTLASSLGVRQQLH 1118
            LLEIDHDER EDGTLA SL V +++H
Sbjct: 1054 LLEIDHDERPEDGTLACSLAVIERIH 1079



 Score = 88.2 bits (217), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 95/231 (41%), Positives = 122/231 (52%), Gaps = 43/231 (18%)

Query: 4   DVEEGEISDTASVEEISEEDFKIKQEEVVK----VVKETKPIKVGGGEAAARVWTMRDLY 59
           DVEEGEISDTAS+EEISEEDF  +   VVK      + TK  + G G    RVWT+ DLY
Sbjct: 14  DVEEGEISDTASIEEISEEDFNKQDVVVVKPPSSNNETTKQKEQGNGN--GRVWTISDLY 71

Query: 60  NKYPAICRGYGPGLHNLAWAQAVQ------NKPLNEIFVMEAEQDDVSKRSSPASSVASV 113
            +Y  +  G+  GL+NLAWAQAVQ      NKPLNE+F    E+ D S + S  SS A+ 
Sbjct: 72  -RYQMVG-GHVSGLYNLAWAQAVQSKPGKSNKPLNELFADVVEELDESSKRSSPSSSAAS 129

Query: 114 NSGAAAGKD--DKKVVEKVVIDDSGDEIEKEEGELEEGEI-----------ELDLESESN 160
            +      D   KKVVEKVVIDD+GDE+  +    +  ++           E+DL+ E  
Sbjct: 130 VNSNNKDGDEEKKKVVEKVVIDDNGDEMMDDNNRNKIVDVVEKEEGELEEGEIDLDMEPG 189

Query: 161 EKVSEQVKEEMKLINVE-------------SIREALESVLRGDISFEGVCS 198
           EK +      M +  +E             SIR+ALESV    I F   C+
Sbjct: 190 EKANNGDVLNMNIDGLEVESGEKGFEKKMNSIRDALESVT---IEFVLACT 237


>gi|357502711|ref|XP_003621644.1| RNA polymerase II C-terminal domain phosphatase-like protein
            [Medicago truncatula]
 gi|355496659|gb|AES77862.1| RNA polymerase II C-terminal domain phosphatase-like protein
            [Medicago truncatula]
          Length = 1213

 Score =  852 bits (2200), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 540/1173 (46%), Positives = 706/1173 (60%), Gaps = 150/1173 (12%)

Query: 1    MGK---DVEEGEISDTASVEEISEEDFKIKQEEV------VKVVKETKPIKVGGGEAA-- 49
            MGK   DVEEGEISD+AS+EEI+EEDFK K ++V      VK  K    +K GGG     
Sbjct: 18   MGKEVEDVEEGEISDSASLEEITEEDFK-KGDDVKVNNSDVKTDKSDNKVKTGGGGGGGG 76

Query: 50   -ARVWTMRDLYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPAS 108
             +RVW ++DLY+KYP ICRGY  GL+NLAWAQAVQNKPLN+IFVME +++  +  ++  +
Sbjct: 77   DSRVWAVQDLYSKYPTICRGYASGLYNLAWAQAVQNKPLNDIFVMELDKNANANSNNSGN 136

Query: 109  SVASVNSGAAAGKDDKKVVEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKVSEQVK 168
                +N  +          EK        E  + +G+ ++  + +  E+ SN +V   V+
Sbjct: 137  KDGELNKSSKEIVVVDDDDEKEE---GELEEGEIDGDADDDCVIVGSENFSNSEVL-GVR 192

Query: 169  EEMKLINVESIREALESVLRGDISFEGVCSKLEFTLESLRELVNENNVPTKDALIQLAFS 228
              ++ + V S+ E          SF   C +++ TL+S  ++ +  +   KD L++L F+
Sbjct: 193  GVLEGVTVASVAE----------SFAETCRRIQGTLQS--KVFSGFDSAEKDDLVRLLFN 240

Query: 229  AVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRAND 288
            AV+ V+SVFC M+++ KE+NK+ +SRLLS +K+    LF+   +K++   +  ++T  + 
Sbjct: 241  AVEVVYSVFCCMDNLQKEENKDNISRLLSFLKNQH--LFTMEHMKKVIFNIQVMITVIDS 298

Query: 289  KEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVP-LPVDSLMQNKPL---------EASK 338
                + A+    G +  +  E  V  LN  E++P L  D  + +  L         EA +
Sbjct: 299  ----VFAL----GNNEVVGKEEKVEALNTTEQIPGLKADEYISSSQLVHDNSTYASEALQ 350

Query: 339  PGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALV-VGDGMVK------S 391
             G      RG++LPL D HK HD+DSLPSPTRE     PV +    +GDG+ +       
Sbjct: 351  YGQSNVVGRGLMLPLFDLHKDHDLDSLPSPTREAPSCFPVNKLFSDLGDGIDRFGLPPAV 410

Query: 392  WAAAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDG- 450
               A K+  + +  K   YETDAL+A S+YQQKF R+S+F + + PSPTPS   GD +G 
Sbjct: 411  CTEAEKMELDGKDSKLHIYETDALKAVSTYQQKFSRSSYFTDDKFPSPTPS---GDCEGE 467

Query: 451  --DTGGEISSATAVDQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASS 508
              DT  E+SSA+          P L Q PVSS  +D      +   +   T + S PA  
Sbjct: 468  AVDTNDEVSSASIASSLTSFKPPPLDQIPVSSTSLDRPNMHGLVDSRIDATGSGSYPA-- 525

Query: 509  GYNPVVKPNPVVKAPIKSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQK 568
                        K+  KSRDPRLRF + +A  L+   +   H+ P+VE  GRV+S RKQK
Sbjct: 526  ------------KSSAKSRDPRLRFINPDASTLDLNQSLGTHSMPRVEYGGRVIS-RKQK 572

Query: 569  TVEEPVLDGPALKRQRNGFENS-GVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAE 627
            TVEEP LD  A KR R   ENS    R+E+ + G GGW E+  +   Q+  RN L+   E
Sbjct: 573  TVEEPSLDATAPKRLRRSLENSEHNTREERAMAGKGGWFEENTVAGSQLAERNHLMQKGE 632

Query: 628  SNSRKLDNGATSPITSGTPNVVVS--GNEPAPATTPSTTVSLPA-LLKDIAVNPTMLLNI 684
            +  ++        I++ + N+ VS  GNE A  T+ S T SLP  LL ++AVNP ML+++
Sbjct: 633  TELKR-------TISTSSSNLTVSNNGNELASVTSSSATASLPTYLLNNVAVNPAMLIHM 685

Query: 685  LKMGQQQKLAADAQQKSNDSSMNT-----MHPP------------IPSSIPPVSVTCSIP 727
            +   Q  +  A+AQ+K  DS+  T       P             +P+S P  S+T ++P
Sbjct: 686  ILEHQHNE--AEAQKKPVDSARGTDATVNTGPAMTAGLTQSSVGILPASSPATSMTQTLP 743

Query: 728  SGILSKPMDELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQGSKENLNFQ 787
                    ++ GK+RMKPRDPRR LHG++                              Q
Sbjct: 744  --------EDSGKIRMKPRDPRRFLHGSS----------------------------TLQ 767

Query: 788  KQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQN-SPIQPGQI 846
            K     E K    QS+ QPDIT+QFTKNLK+IAD MSV Q  +S P  +QN S      +
Sbjct: 768  KFDVRVETKLAPIQSIAQPDITRQFTKNLKNIADIMSVPQETSSNPPATQNVSSASVPFM 827

Query: 847  KSGADMKAVVTNHDDKQTGTGSGPE-AGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKER 905
               ++ K+ V N  + + G GS PE   P  + PQ+ W DVEHLFE YD +QKAAIQ+ER
Sbjct: 828  SDRSEQKSGVPNSQNLKDGVGSAPETCAPGSSRPQNTWADVEHLFEAYDVKQKAAIQRER 887

Query: 906  TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
            +RRLEEQKKMF+ARKLCLVLDLDHTLLNSAKF EVDPVHDE+LRKKE++DREKP RHLFR
Sbjct: 888  SRRLEEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEMLRKKEQEDREKPQRHLFR 947

Query: 966  FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
            FPHMGMWTKLRPG+W FLE+A KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG
Sbjct: 948  FPHMGMWTKLRPGVWNFLEKAGKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1007

Query: 1026 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
            DD +  D      KSKDLEGVLGMES+VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ
Sbjct: 1008 DDAETAD-----TKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1062

Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            FGL GPSLLEIDHDER E GTLASSLGV +++H
Sbjct: 1063 FGLPGPSLLEIDHDERPESGTLASSLGVIERIH 1095


>gi|56547717|gb|AAV92930.1| putative transcription regulator CPL1 [Solanum lycopersicum]
          Length = 1227

 Score =  819 bits (2115), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 539/1186 (45%), Positives = 691/1186 (58%), Gaps = 152/1186 (12%)

Query: 3    KDVEEGEISDTASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGE-------AAARVWTM 55
            +D EEGEISD+ASVEEISE+ F  +Q+        T  IK+   E        A RVWTM
Sbjct: 8    EDAEEGEISDSASVEEISEDAFN-RQDP--PTTSTTSKIKIASNENQNQNSTTATRVWTM 64

Query: 56   RDLYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNS 115
            RD+Y KYP I R Y  GL+NLAWAQAVQNKPL+E+FVM +   D S + +   S   ++ 
Sbjct: 65   RDVY-KYP-ISRDYARGLYNLAWAQAVQNKPLDELFVMTS---DNSNQCANGESKVIIDV 119

Query: 116  GAAAGKDDKKVVEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKVSEQVKEEMKLIN 175
                   ++  +E+  ID            L+  ++ ++   E+N               
Sbjct: 120  DVDDDAKEEGELEEGEID------------LDSADLVVNFGKEAN--------------- 152

Query: 176  VESIREALESVLRGDI--SFEGVCSKLEFTLESLRELVNENNVPTKDALIQLAFSAVQSV 233
               IRE L+SV   +   SF  VCSKL+ +L +L EL    +    D LIQL  +A++++
Sbjct: 153  --FIREQLQSVTLDETHKSFSMVCSKLQTSLLALGELALSQD--KNDILIQLFMTALRTI 208

Query: 234  HSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAML----SSLV---TRA 286
            +SVF SMN   K+QN +ILSRLL   K+  P L SS Q+KE++A++     SLV   T+ 
Sbjct: 209  NSVFYSMNDHQKQQNTDILSRLLFNAKTQLPALLSSEQLKELDALILSINHSLVSSNTQD 268

Query: 287  NDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLM------QNKPLEASKPG 340
            ND    +  +  ++ KDS+  +ENA  D     K  L   S+       Q+   E+ KPG
Sbjct: 269  NDTVNGINVVQLLDMKDSHKSSENANQDFTSVNKYDLGDVSIKSSGLKEQSVSSESVKPG 328

Query: 341  PPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVK---SWAAAAK 397
                +++G+  PLLD HK HD D+LPSPTR+  P  P  +      GMVK       A+ 
Sbjct: 329  LDNSKAKGLSFPLLDLHKDHDEDTLPSPTRQIGPQFPATQT----HGMVKLDLPIFPASL 384

Query: 398  LSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEIS 457
               N+ +H    YETDAL+A SSYQQKFGR+S F++  LPSPTPSEE   G GDTGGE++
Sbjct: 385  DKGNSLLHP---YETDALKAVSSYQQKFGRSSLFVSENLPSPTPSEEDDSGKGDTGGEVT 441

Query: 458  SATAVDQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPN 517
            S   V     +N  ++GQ  +SS P       +I   Q L T   + P S        PN
Sbjct: 442  SFDVVHNASHLNESSMGQPILSSVPQ-----TNILDGQGLGTTRTADPLS------FLPN 490

Query: 518  PVVKAPI-KSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLD 576
            P +++   KSRDPRLR A+S+ +  N    PI     K+E    ++ S+KQKTV+    D
Sbjct: 491  PSLRSSTAKSRDPRLRLATSDTVAQN-TILPIPDIDLKLEASLEMIVSKKQKTVDLSAFD 549

Query: 577  GPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNG 636
             P  KRQR+   +S +V D +   G+GGWLED    E  I + N    +++++ RKL+  
Sbjct: 550  APLPKRQRSEQTDSIIVSDVRPSIGNGGWLEDRGTAELPITSSNCATYNSDNDIRKLEQ- 608

Query: 637  ATSPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAAD 696
             T+ I +  P+V+V+  E  P T  ST+ +L +LLKDIA+NP++ +NI+K  QQ+   A 
Sbjct: 609  VTATIAT-IPSVIVNAAENFPVTGISTSTTLHSLLKDIAINPSIWMNIIKTEQQKSADAS 667

Query: 697  AQQKSNDSSMNTMHPPIPSSI---PPVSVTCSIPSGILSKPM------------------ 735
                +  SS  ++   +PS++   P  S       GIL  P                   
Sbjct: 668  RTNTAQASSSKSILGAVPSTVAVAPRSSAIGQRSVGILQTPTHTASAASSIYNLLMNDFI 727

Query: 736  ----------------------DELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPS 773
                                  DE+  VRMKPRDPRRVLH  A+ + GS+G +    G +
Sbjct: 728  YSVIFTASIAQFPFYFFLTFSRDEVAIVRMKPRDPRRVLHSTAVLKGGSVGLDQCKTGVA 787

Query: 774  APCTQGSKENLNFQKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEP 833
               T  +  NL+FQ Q    + K  ++ S   PDI  QFTKNLK+IAD M    P TS  
Sbjct: 788  G--THATISNLSFQSQEDQLDRKSAVTLSTTPPDIACQFTKNLKNIAD-MISVSPSTSPS 844

Query: 834  MVSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGA-HPQSAWGDVEHLFEG 892
            + SQ   +     +S +++K  V+   +     G   E G  G+  PQ +WGDVEHLFEG
Sbjct: 845  VASQTQTLCIQAYQSRSEVKGAVSEPSEWVNDAGLASEKGSPGSLQPQISWGDVEHLFEG 904

Query: 893  YDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            Y DQQ+A IQ+ERTRRLEEQKKMFS                   F E+DPVH+EILRKKE
Sbjct: 905  YSDQQRADIQRERTRRLEEQKKMFS-------------------FVEIDPVHEEILRKKE 945

Query: 953  EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
            EQDREKP+RHLFRFPHMGMWTKLRPGIW FLE+AS LFE+HLYTMGNKLYATEMAK+LDP
Sbjct: 946  EQDREKPYRHLFRFPHMGMWTKLRPGIWNFLEKASNLFELHLYTMGNKLYATEMAKLLDP 1005

Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV 1072
            KG LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV
Sbjct: 1006 KGDLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV 1065

Query: 1073 VERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            VERY YFPCSRRQFGL GPSLLEIDHDER EDGTLAS LGV Q++H
Sbjct: 1066 VERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASCLGVIQRIH 1111


>gi|297826809|ref|XP_002881287.1| hypothetical protein ARALYDRAFT_482300 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297327126|gb|EFH57546.1| hypothetical protein ARALYDRAFT_482300 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1248

 Score =  797 bits (2058), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 531/1173 (45%), Positives = 687/1173 (58%), Gaps = 124/1173 (10%)

Query: 4    DVEEGEISDTASVEEISEEDFKIKQEEVVKV--------VKETKPIKVGGGEAAARVWTM 55
            DVEEGEI D+ + E       ++KQ+                      GG    +RVWTM
Sbjct: 26   DVEEGEIPDSGNTE------IEVKQKTTTTADVGGDVDVGGRGGGGGGGGSNGNSRVWTM 79

Query: 56   RDLYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNS 115
             DL  KYP        GL N AW+QAVQNK LNE  VM+ E  +  K     S       
Sbjct: 80   EDLLTKYPGYRLYATSGLSNFAWSQAVQNKSLNEGLVMDYEPRESDKIVIEDSGDEKEEG 139

Query: 116  GAAAGKDDKKVVEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKVSEQVKEEMKLIN 175
                G+ D  +VE    D+    ++KE     E  + +  +   ++++ +++  E K   
Sbjct: 140  ELEEGEID--LVENASDDNLVASVDKET----ESVVLISADKVEDDRIQKEIDLEKK--- 190

Query: 176  VESIREALES--VLRGDISFEGVCSKLEFTLESLRELVNENN-VPTKDALIQLAFSAVQS 232
            V+ IR  LES  ++     FEGVCS++   LESLRELV++N+  P +D L+QL+F+++Q+
Sbjct: 191  VKLIRGVLESTSLVEAQTGFEGVCSRILGALESLRELVSDNDDFPKRDTLVQLSFASLQT 250

Query: 233  VHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKD 292
            ++SVFCS+N+V KE+NKE +SRLL+L+  H     SSNQ  E+EAM         D  + 
Sbjct: 251  INSVFCSLNNVSKERNKETMSRLLTLVNDHFSRFLSSNQKNEIEAM-------NQDLSRS 303

Query: 293  MLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNK-PLEASKPGPPGYRSRGVLL 351
             +A++      +   +E  VN +      P   DS +  K   E +  G    RSR  +L
Sbjct: 304  AIAVY------TGTSSEENVNRMT----QPSNGDSFLAKKLSSEGTHRGASYVRSRLPML 353

Query: 352  PLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYE 411
            PLLD HK HD DSLPSPTRETTPS+PV         MVK      + S   E  K   YE
Sbjct: 354  PLLDLHKDHDADSLPSPTRETTPSLPVNGR----HTMVKPGFPVGRESQTTEGAKVYPYE 409

Query: 412  TDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVDQPKPVNMP 471
            +DAL+A S+Y QKFG NS F   +LPSPTPS E  DG+GD  GE+SS+       P  + 
Sbjct: 410  SDALKAVSTYHQKFGLNSVFKTDDLPSPTPSGEPNDGNGDIDGEVSSSVVKSS-NPGTLL 468

Query: 472  TLGQQ-PVSSQPMDISQPMD--ISSV---QALTTANNSAPASSGYNPVVKPNPVVKAPIK 525
              GQ  P+ S     S P+   +SS      L+    SAP  S    V   +  VK   K
Sbjct: 469  MYGQDVPLPSNFNSRSMPVANAVSSTVPPHHLSIHTISAPTGSTQT-VFASDQTVKPSAK 527

Query: 526  SRDPRLRFASSNALN--LNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQ 583
            SRDPRLR A  +  N  +N   +    N  KVE    +++ RKQK  +E  +DGPA KRQ
Sbjct: 528  SRDPRLRLAKPDTANVTINSYSSGDARNLFKVELSADLVNPRKQKAADELFIDGPAWKRQ 587

Query: 584  RNGFENSGVVRDEKNIYGSGGWLEDTDMFE-PQIMNRNLLVDSAESNSRKLDNGATSPIT 642
            ++         D     G GGWLEDT+    P++          ES  R ++NG TS  T
Sbjct: 588  KSD-------TDAPKAAGIGGWLEDTESSGLPKL----------ESKPRLIENGVTSMTT 630

Query: 643  SGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQKSN 702
            S  P   VS ++  P T  +   SL +LL+DIAVNPTMLLN+LKMG++ K+   A QK  
Sbjct: 631  SVMPTSAVSVSQKVP-TASTDAASLQSLLQDIAVNPTMLLNLLKMGERHKVPEKALQKPM 689

Query: 703  DSSMNTMHPPIPSSIPPVSVTCSIPS-----------GIL-----SKPMDELGKVRMKPR 746
            D        P  S +P VS    IP+           G+L     + P DE G +RMKPR
Sbjct: 690  DPR-RAAQLPGSSVLPGVSAPLHIPASNALATNSSKRGVLQDSSQNAPTDESGSIRMKPR 748

Query: 747  DPRRVLHGNALQRS-GSLGPEFKTDGPSAPCT---QGSKENLNFQKQL---------GAP 793
            DPRR+LHG  LQR+  S+  + K +  S   T   +G  E+L    QL         G  
Sbjct: 749  DPRRILHGGTLQRTDSSMEKQSKVNDSSTLGTLTMKGKTEDLETPSQLLPRQNISQNGTS 808

Query: 794  EAK---PVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGA 850
            + K    +LS+    PD + QFTK++K+IAD + VSQ   + P  + +      Q+K+  
Sbjct: 809  KMKISGELLSEKT--PDFSTQFTKSVKNIADMVVVSQQAGNPPASTHSI-----QLKTER 861

Query: 851  DMKAVVTNHDDKQ-----TGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKER 905
            D+K   +N +D++     +       AGP  +   ++WGDVEHLFEGYDD Q+ AIQ+ER
Sbjct: 862  DVKQNPSNPNDQEEDVSVSAASVTATAGPTRS--MNSWGDVEHLFEGYDDTQRVAIQRER 919

Query: 906  TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
             RRLEEQKKMF+++KL LVLD+DHTLLNSAKF+EV+  H+EILRKKEEQDREKP+RHLFR
Sbjct: 920  VRRLEEQKKMFASQKLSLVLDIDHTLLNSAKFNEVEFRHEEILRKKEEQDREKPYRHLFR 979

Query: 966  FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
            FPHMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAK+LDPKG+LF GRVIS+G
Sbjct: 980  FPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGILFNGRVISKG 1039

Query: 1026 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
            DDGDP DGDERVPKSKDLEGV+GMES+VVIIDDSVRVWP+NK+NLI VERY YFP SRRQ
Sbjct: 1040 DDGDPLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVWPYNKMNLIAVERYLYFPRSRRQ 1099

Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            FGLLGPSLLE+D DE  E+GTLASSL V +++H
Sbjct: 1100 FGLLGPSLLELDRDEVPEEGTLASSLAVIEKIH 1132


>gi|30685744|ref|NP_180912.2| RNA polymerase II C-terminal domain phosphatase-like 3 [Arabidopsis
            thaliana]
 gi|238055326|sp|Q8LL04.2|CPL3_ARATH RecName: Full=RNA polymerase II C-terminal domain phosphatase-like 3;
            Short=FCP-like 3; AltName: Full=Carboxyl-terminal
            phosphatase-like 3; Short=AtCPL3; Short=CTD
            phosphatase-like 3
 gi|330253756|gb|AEC08850.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Arabidopsis
            thaliana]
          Length = 1241

 Score =  789 bits (2038), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 524/1135 (46%), Positives = 668/1135 (58%), Gaps = 156/1135 (13%)

Query: 52   VWTMRDLYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVA 111
            VWTM +L ++YPA       GL NLAWA+AVQNKP NE  VM+ E         P  S  
Sbjct: 79   VWTMEELISQYPAYRPYANSGLSNLAWARAVQNKPFNEGLVMDYE---------PRES-- 127

Query: 112  SVNSGAAAGKDDKKVVEKVVIDDSGDE----------IEKEEGELEEGEIELDLESE--- 158
                            +K+VI+DS DE          I+  +   ++  +E D ES    
Sbjct: 128  ----------------DKIVIEDSDDEKEEGELEEGEIDLVDNASDDNLVEKDTESVVLI 171

Query: 159  SNEKVSEQ--VKEEMKLINVESIREALES--VLRGDISFEGVCSKLEFTLESLRELVNEN 214
            S +KV +   +KE      V+ IR  LES  ++     FEGVCS++   LESLRELV++N
Sbjct: 172  SADKVEDDRILKERDLEKKVKLIRGVLESTSLVEAQTGFEGVCSRILGALESLRELVSDN 231

Query: 215  N-VPTKDALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIK 273
            +  P +D L+QL+F+++Q+++ VFCSMN++ KE+NKE +SRLL+L+  H     S NQ  
Sbjct: 232  DDFPKRDTLVQLSFASLQTINYVFCSMNNISKERNKETMSRLLTLVNDHFSQFLSFNQKN 291

Query: 274  EMEAMLSSLVTRANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNK- 332
            E+E M         D  +  +A+      + N      VN +      P   DS +  K 
Sbjct: 292  EIETM-------NQDLSRSAIAVFAGTSSEEN------VNQMT----QPSNGDSFLAKKL 334

Query: 333  PLEASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSW 392
              E++  G    RSR  +LPLLD HK HD DSLPSPTRETTPS+PV         MV+  
Sbjct: 335  TSESTHRGAAYLRSRLPMLPLLDLHKDHDADSLPSPTRETTPSLPVNGR----HTMVRPG 390

Query: 393  AAAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDT 452
                + S   E  K   YE+DA +A S+YQQKFG NS F   +LPSPTPS E  DG+GD 
Sbjct: 391  FPVGRESQTTEGAKVYSYESDARKAVSTYQQKFGLNSVFKTDDLPSPTPSGEPNDGNGDV 450

Query: 453  GGEISSATAV------------DQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTA 500
            GGE+SS+               D P P N  +    PV++       P  +S + A+   
Sbjct: 451  GGEVSSSVVKSSNPGSHLIYGQDVPLPSNFNSRSM-PVANSVSSTVPPHHLS-IHAI--- 505

Query: 501  NNSAPASSGYNPVVKPNPVVKAPIKSRDPRLRFASSNALNLN--HQPAPILHNAPKVEPV 558
              SAP +S        +  VK   KSRDPRLR A  +A N+      +    N  KVE  
Sbjct: 506  --SAPTAS--------DQTVKPSAKSRDPRLRLAKPDAANVTIYSYSSGDARNLSKVELS 555

Query: 559  GRVMSSRKQKTVEEPVLDGPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMN 618
              +++ RKQK  +E ++DGPA KRQ++         D     G+GGWLEDT+       +
Sbjct: 556  ADLVNPRKQKAADEFLIDGPAWKRQKSD-------TDAPKAAGTGGWLEDTE-------S 601

Query: 619  RNLLVDSAESNSRKLDNGATSPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNP 678
              LL    ES  R ++NG TS  +S  P   VS ++    T  + T SL +LLKDIAVNP
Sbjct: 602  SGLL--KLESKPRLIENGVTSMTSSVMPTSAVSVSQKV-RTASTDTASLQSLLKDIAVNP 658

Query: 679  TMLLNILKMGQQQKLAADAQQKSNDSSMNTMHPPIPSSIPPVSVTCSIP----------- 727
            TMLLN+LKMG++QK+   A QK  D        P  S  P VS   SIP           
Sbjct: 659  TMLLNLLKMGERQKVPEKAIQKPMDPR-RAAQLPGSSVQPGVSTPLSIPASNALAANSLN 717

Query: 728  SGIL-----SKPMDELGKVRMKPRDPRRVLHGNALQRS-GSLGPEFKTDGPSAPCT---Q 778
            SG+L     + P  E G +RMKPRDPRR+LHG+ LQR+  S+  + K + PS   T   +
Sbjct: 718  SGVLQDSSQNAPAAESGSIRMKPRDPRRILHGSTLQRTDSSMEKQTKVNDPSTLGTLTMK 777

Query: 779  GSKENLNFQKQLGAPE-------AKPVLSQSVLQ---PDITQQFTKNLKHIADFMSVSQP 828
            G  E+L    QL   +       +K  +S  +L    PD + QFTKNLK IAD + VSQ 
Sbjct: 778  GKAEDLETPPQLDPRQNISQNGTSKMKISGELLSGKTPDFSTQFTKNLKSIADMVVVSQQ 837

Query: 829  LTSEPMVSQNSPIQPGQIKSGADMKAVVTN-----HDDKQTGTGSGPEAGPVGAHPQSAW 883
            L + P     + +   Q+K+  D+K   +N      D   +       AGP  +   ++W
Sbjct: 838  LGNPP-----ASMHSVQLKTERDVKHNPSNPNAQDEDVSVSAASVTAAAGPTRS--MNSW 890

Query: 884  GDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPV 943
            GDVEHLFEGYDD Q+ AIQ+ER RRLEEQ KMF+++KL LVLD+DHTLLNSAKF+EV+  
Sbjct: 891  GDVEHLFEGYDDIQRVAIQRERVRRLEEQNKMFASQKLSLVLDIDHTLLNSAKFNEVESR 950

Query: 944  HDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
            H+EILRKKEEQDREKP+RHLFRF HMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNKLYA
Sbjct: 951  HEEILRKKEEQDREKPYRHLFRFLHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA 1010

Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW 1063
            TEMAK+LDPKGVLF GRVIS+GDDGDP DGDERVPKSKDLEGV+GMES+VVIIDDSVRVW
Sbjct: 1011 TEMAKLLDPKGVLFNGRVISKGDDGDPLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVW 1070

Query: 1064 PHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            P +K+NLI VERY YFPCSRRQFGLLGPSLLE+D DE  E+GTLASSL V +++H
Sbjct: 1071 PQHKMNLIAVERYLYFPCSRRQFGLLGPSLLELDRDEVPEEGTLASSLAVIEKIH 1125


>gi|22212705|gb|AAM94371.1|AF486633_1 CTD phosphatase-like 3 [Arabidopsis thaliana]
          Length = 1241

 Score =  788 bits (2034), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 523/1135 (46%), Positives = 667/1135 (58%), Gaps = 156/1135 (13%)

Query: 52   VWTMRDLYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVA 111
            VWTM +L ++YPA       GL NLAWA+AVQNKP NE  VM+ E         P  S  
Sbjct: 79   VWTMEELISQYPAYRPYANSGLSNLAWARAVQNKPFNEGLVMDYE---------PRES-- 127

Query: 112  SVNSGAAAGKDDKKVVEKVVIDDSGDE----------IEKEEGELEEGEIELDLESE--- 158
                            +K+VI+DS DE          I+  +   ++  +E D ES    
Sbjct: 128  ----------------DKIVIEDSDDEKEEGELEEGEIDLVDNASDDNLVEKDTESVVLI 171

Query: 159  SNEKVSEQ--VKEEMKLINVESIREALES--VLRGDISFEGVCSKLEFTLESLRELVNEN 214
            S +KV +   +KE      V+ IR  LES  ++     FEGVCS++   LESLRELV++N
Sbjct: 172  SADKVEDDRILKERDLEKKVKLIRGVLESTSLVEAQTGFEGVCSRILGALESLRELVSDN 231

Query: 215  N-VPTKDALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIK 273
            +  P +D L+QL+F+++Q+++ VFCSMN++ KE+NKE +SRLL+L+  H     S NQ  
Sbjct: 232  DDFPKRDTLVQLSFASLQTINYVFCSMNNISKERNKETMSRLLTLVNDHFSQFLSFNQKN 291

Query: 274  EMEAMLSSLVTRANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNK- 332
            E+E M         D  +  +A+      + N      VN +      P   DS +  K 
Sbjct: 292  EIETM-------NQDLSRSAIAVFAGTSSEEN------VNQMT----QPSNGDSFLAKKL 334

Query: 333  PLEASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSW 392
              E++  G    RSR  +LPLLD HK HD DSLPSPTRETTPS+PV         MV+  
Sbjct: 335  TSESTHRGAAYLRSRLPMLPLLDLHKDHDADSLPSPTRETTPSLPVNGR----HTMVRPG 390

Query: 393  AAAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDT 452
                + S   E  K   YE+DA +A S+YQQKFG NS F   +LPSPTPS E  DG+GD 
Sbjct: 391  FPVGRESQTTEGAKVYSYESDARKAVSTYQQKFGLNSVFKTDDLPSPTPSGEPNDGNGDV 450

Query: 453  GGEISSATAV------------DQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTA 500
            GGE+SS+               D P P N  +    PV++       P  +S + A+   
Sbjct: 451  GGEVSSSVVKSSNPGSHLIYGQDVPLPSNFNSRSM-PVANSVSSTVPPHHLS-IHAI--- 505

Query: 501  NNSAPASSGYNPVVKPNPVVKAPIKSRDPRLRFASSNALNLN--HQPAPILHNAPKVEPV 558
              SAP +S        +  VK   KSRDPRLR A  +A N+      +    N  KVE  
Sbjct: 506  --SAPTAS--------DQTVKPSAKSRDPRLRLAKPDAANVTIYSYSSGDARNLSKVELS 555

Query: 559  GRVMSSRKQKTVEEPVLDGPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMN 618
              +++ RKQK  +E ++DGPA KRQ++         D     G+GGWLEDT+       +
Sbjct: 556  ADLVNPRKQKAADEFLIDGPAWKRQKSD-------TDAPKAAGTGGWLEDTE-------S 601

Query: 619  RNLLVDSAESNSRKLDNGATSPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNP 678
              LL    ES  R ++NG TS  +S  P   VS ++    T  + T SL +LLKDIAVNP
Sbjct: 602  SGLL--KLESKPRLIENGVTSMTSSVMPTSAVSVSQKV-RTASTDTASLQSLLKDIAVNP 658

Query: 679  TMLLNILKMGQQQKLAADAQQKSNDSSMNTMHPPIPSSIPPVSVTCSIP----------- 727
            TMLLN+LKMG++QK+   A QK  D        P  S  P VS   SIP           
Sbjct: 659  TMLLNLLKMGERQKVPEKAIQKPMDPR-RAAQLPGSSVQPGVSTPLSIPASNALAANSLN 717

Query: 728  SGIL-----SKPMDELGKVRMKPRDPRRVLHGNALQRS-GSLGPEFKTDGPSAPCT---Q 778
            SG+L     + P  E G +RMKPRDPRR+LHG+ LQR+  S+  + K + PS   T   +
Sbjct: 718  SGVLQDSSQNAPAAESGSIRMKPRDPRRILHGSTLQRTDSSMEKQTKVNDPSTLGTLTMK 777

Query: 779  GSKENLNFQKQLGAPE-------AKPVLSQSVLQ---PDITQQFTKNLKHIADFMSVSQP 828
            G  E+L    QL   +       +K  +S  +L    PD + QFTKNLK IAD + VSQ 
Sbjct: 778  GKAEDLETPPQLDPRQNISQNGTSKMKISGELLSGKTPDFSTQFTKNLKSIADMVVVSQQ 837

Query: 829  LTSEPMVSQNSPIQPGQIKSGADMKAVVTN-----HDDKQTGTGSGPEAGPVGAHPQSAW 883
            L + P     + +   Q+K+  D+K   +N      D   +       AGP  +   ++W
Sbjct: 838  LGNPP-----ASMHSVQLKTERDVKHNPSNPNAQDEDVSVSAASVTAAAGPTRS--MNSW 890

Query: 884  GDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPV 943
            GDVEHLFEGYDD Q+ AIQ+ER RRLEEQ KMF+++KL LVLD+DHTLLNSAKF+EV+  
Sbjct: 891  GDVEHLFEGYDDIQRVAIQRERVRRLEEQNKMFASQKLSLVLDIDHTLLNSAKFNEVESR 950

Query: 944  HDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
            H+EILRKKEEQDREKP+RHLFRF HMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNKLY 
Sbjct: 951  HEEILRKKEEQDREKPYRHLFRFLHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYV 1010

Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW 1063
            TEMAK+LDPKGVLF GRVIS+GDDGDP DGDERVPKSKDLEGV+GMES+VVIIDDSVRVW
Sbjct: 1011 TEMAKLLDPKGVLFNGRVISKGDDGDPLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVW 1070

Query: 1064 PHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            P +K+NLI VERY YFPCSRRQFGLLGPSLLE+D DE  E+GTLASSL V +++H
Sbjct: 1071 PQHKMNLIAVERYLYFPCSRRQFGLLGPSLLELDRDEVPEEGTLASSLAVIEKIH 1125


>gi|2459436|gb|AAB80671.1| unknown protein [Arabidopsis thaliana]
          Length = 1066

 Score =  724 bits (1870), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 493/1093 (45%), Positives = 632/1093 (57%), Gaps = 156/1093 (14%)

Query: 52   VWTMRDLYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVA 111
            VWTM +L ++YPA       GL NLAWA+AVQNKP NE  VM+ E         P  S  
Sbjct: 56   VWTMEELISQYPAYRPYANSGLSNLAWARAVQNKPFNEGLVMDYE---------PRES-- 104

Query: 112  SVNSGAAAGKDDKKVVEKVVIDDSGDE----------IEKEEGELEEGEIELDLESE--- 158
                            +K+VI+DS DE          I+  +   ++  +E D ES    
Sbjct: 105  ----------------DKIVIEDSDDEKEEGELEEGEIDLVDNASDDNLVEKDTESVVLI 148

Query: 159  SNEKVSEQ--VKEEMKLINVESIREALES--VLRGDISFEGVCSKLEFTLESLRELVNEN 214
            S +KV +   +KE      V+ IR  LES  ++     FEGVCS++   LESLRELV++N
Sbjct: 149  SADKVEDDRILKERDLEKKVKLIRGVLESTSLVEAQTGFEGVCSRILGALESLRELVSDN 208

Query: 215  N-VPTKDALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIK 273
            +  P +D L+QL+F+++Q+++ VFCSMN++ KE+NKE +SRLL+L+  H     S NQ  
Sbjct: 209  DDFPKRDTLVQLSFASLQTINYVFCSMNNISKERNKETMSRLLTLVNDHFSQFLSFNQKN 268

Query: 274  EMEAMLSSLVTRANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNK- 332
            E+E M         D  +  +A+      + N      VN +      P   DS +  K 
Sbjct: 269  EIETM-------NQDLSRSAIAVFAGTSSEEN------VNQMT----QPSNGDSFLAKKL 311

Query: 333  PLEASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSW 392
              E++  G    RSR  +LPLLD HK HD DSLPSPTRETTPS+PV         MV+  
Sbjct: 312  TSESTHRGAAYLRSRLPMLPLLDLHKDHDADSLPSPTRETTPSLPVNGRHT----MVRPG 367

Query: 393  AAAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDT 452
                + S   E  K   YE+DA +A S+YQQKFG NS F   +LPSPTPS E  DG+GD 
Sbjct: 368  FPVGRESQTTEGAKVYSYESDARKAVSTYQQKFGLNSVFKTDDLPSPTPSGEPNDGNGDV 427

Query: 453  GGEISSATAV------------DQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTA 500
            GGE+SS+               D P P N  +    PV++       P  +S + A+   
Sbjct: 428  GGEVSSSVVKSSNPGSHLIYGQDVPLPSNFNSRSM-PVANSVSSTVPPHHLS-IHAI--- 482

Query: 501  NNSAPASSGYNPVVKPNPVVKAPIKSRDPRLRFASSNALNLN--HQPAPILHNAPKVEPV 558
              SAP +S        +  VK   KSRDPRLR A  +A N+      +    N  KVE  
Sbjct: 483  --SAPTAS--------DQTVKPSAKSRDPRLRLAKPDAANVTIYSYSSGDARNLSKVELS 532

Query: 559  GRVMSSRKQKTVEEPVLDGPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMN 618
              +++ RKQK  +E ++DGPA KRQ++         D     G+GGWLEDT+       +
Sbjct: 533  ADLVNPRKQKAADEFLIDGPAWKRQKSD-------TDAPKAAGTGGWLEDTE-------S 578

Query: 619  RNLLVDSAESNSRKLDNGATSPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNP 678
              LL    ES  R ++NG TS  +S  P   VS ++    T  + T SL +LLKDIAVNP
Sbjct: 579  SGLL--KLESKPRLIENGVTSMTSSVMPTSAVSVSQKV-RTASTDTASLQSLLKDIAVNP 635

Query: 679  TMLLNILKMGQQQKLAADAQQKSNDSSMNTMHPPIPSSIPPVSVTCSIP----------- 727
            TMLLN+LKMG++QK+   A QK  D        P  S  P VS   SIP           
Sbjct: 636  TMLLNLLKMGERQKVPEKAIQKPMDPR-RAAQLPGSSVQPGVSTPLSIPASNALAANSLN 694

Query: 728  SGIL-----SKPMDELGKVRMKPRDPRRVLHGNALQRS-GSLGPEFKTDGPSAPCT---Q 778
            SG+L     + P  E G +RMKPRDPRR+LHG+ LQR+  S+  + K + PS   T   +
Sbjct: 695  SGVLQDSSQNAPAAESGSIRMKPRDPRRILHGSTLQRTDSSMEKQTKVNDPSTLGTLTMK 754

Query: 779  GSKENLNFQKQLGAPE-------AKPVLSQSVLQ---PDITQQFTKNLKHIADFMSVSQP 828
            G  E+L    QL   +       +K  +S  +L    PD + QFTKNLK IAD + VSQ 
Sbjct: 755  GKAEDLETPPQLDPRQNISQNGTSKMKISGELLSGKTPDFSTQFTKNLKSIADMVVVSQQ 814

Query: 829  LTSEPMVSQNSPIQPGQIKSGADMKAVVTN-----HDDKQTGTGSGPEAGPVGAHPQSAW 883
            L + P     + +   Q+K+  D+K   +N      D   +       AGP  +   ++W
Sbjct: 815  LGNPP-----ASMHSVQLKTERDVKHNPSNPNAQDEDVSVSAASVTAAAGPTRS--MNSW 867

Query: 884  GDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPV 943
            GDVEHLFEGYDD Q+ AIQ+ER RRLEEQ KMF+++KL LVLD+DHTLLNSAKF+EV+  
Sbjct: 868  GDVEHLFEGYDDIQRVAIQRERVRRLEEQNKMFASQKLSLVLDIDHTLLNSAKFNEVESR 927

Query: 944  HDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
            H+EILRKKEEQDREKP+RHLFRF HMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNKLYA
Sbjct: 928  HEEILRKKEEQDREKPYRHLFRFLHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYA 987

Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW 1063
            TEMAK+LDPKGVLF GRVIS+GDDGDP DGDERVPKSKDLEGV+GMES+VVIIDDSVRVW
Sbjct: 988  TEMAKLLDPKGVLFNGRVISKGDDGDPLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVW 1047

Query: 1064 PHNKLNLIVVERY 1076
            P +K+NLI VERY
Sbjct: 1048 PQHKMNLIAVERY 1060


>gi|77551160|gb|ABA93957.1| NLI interacting factor-like phosphatase family protein, expressed
            [Oryza sativa Japonica Group]
          Length = 1272

 Score =  675 bits (1742), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 473/1173 (40%), Positives = 644/1173 (54%), Gaps = 142/1173 (12%)

Query: 16   VEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYPAICRGYGPGLHN 75
            +EEIS +DFK +                   +  +RVW     YN    I R Y P  H+
Sbjct: 57   LEEISADDFKKESSAAGGAAAAAA------AQQRSRVWMG---YN----IPRSYAPAFHS 103

Query: 76   LAWAQAVQNKPLNEIFVMEAEQDDV---------SKRSSPASSVASVNSGAAAGKDDKKV 126
             AWAQAVQNKPL       A++D+V          K         +V +   +       
Sbjct: 104  FAWAQAVQNKPLVPRAADAADEDEVEHVVDTSDEEKEEGEIEEGEAVQTTTTSSSSPPCA 163

Query: 127  VEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKVSEQVKEEMKLINVESIREALE-- 184
                 ID   D  EK E  +              E+V    +       V SI E LE  
Sbjct: 164  QPPETIDLDSDAPEKSESMVAMYGGGAAPAGAEEEEVDFDQR-------VGSILEELEMV 216

Query: 185  SVLRGDISFEGVCSKLEFTLESLRELVNENN--VPTKDALIQLAFSAVQSVHSVFCSMNH 242
            S+   + SFEG C++L    E+L+ L  E+   +P  DAL+Q AF  + ++ +V  S + 
Sbjct: 217  SIEEAEKSFEGACTRLRTCFENLKPLFPESGSPMPMLDALVQQAFVGIDTITTVANSYDM 276

Query: 243  VLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAMHGVNGK 302
              +EQ K +L +LL  IK+    + + +Q  E+++ +  LV    +  KD       NG 
Sbjct: 277  PKREQTKNMLLKLLFHIKNRYSDMLTPDQRDELDSRVRQLVF---EDGKD-----NANGP 328

Query: 303  DSNIVTENAVNDLNFKEKVPLPVDSLMQNKPLEASKPGPPGYRSRGVLLPLLDPHKVHDV 362
            ++      A +     E+  LP +S   N   +   P     ++R ++ PLLD H  +D 
Sbjct: 329  NATSTNAAAPSGQVLSER--LPFESGAGNSFSKVEIPA----KNR-MVSPLLDLHADYDE 381

Query: 363  DSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYET--DALRAFSS 420
            +SLPSPTR++ P   V +   +G G +        +    E  K   Y++  DAL+A   
Sbjct: 382  NSLPSPTRDSKPPFDVPKP--IGYGALPMAPDRPSVLERVEPAKNSSYQSFNDALKAVCY 439

Query: 421  YQQKFGRNSFFMNSELPSPTPS---EESGDGDGDTGGEISSATAVDQPKPVNMPTLGQQP 477
            YQQK G+ S F + +LPSPTPS   ++SGD  GD  GE+SS +A ++   + +P + Q P
Sbjct: 440  YQQKHGQKSNFASDDLPSPTPSGDGDKSGDKGGDVFGEVSSFSASNK---IALPIVNQMP 496

Query: 478  VSSQPMDISQPMDISSVQALTTANNSAPASSGY-----NPVVKPNPVVKAPIKSRDPRLR 532
                    S+P  +SS      +++ A    GY     N V   N ++KA  KSRDPRL+
Sbjct: 497  --------SRPSTVSS-----NSDSFAGGPPGYAKQIENSVSGSNHLLKATAKSRDPRLK 543

Query: 533  F--------ASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQR 584
            F        A +N      +P P   +  +    G  ++SRK K V+EP++D  ALKR R
Sbjct: 544  FLNRDTGGVADANRRVNFAEPNP---SKDRTMGGGVSINSRKNKAVDEPMVDENALKRSR 600

Query: 585  NGFENSGVVRDEKNI--YGSGGWLED--------TDMFEPQIMNRNLLVDSAESNSRKLD 634
                  GV+ + +++   G GGW +D        +D F+P   N+N  + +  + +  + 
Sbjct: 601  ------GVIGNLRDMQPTGRGGWAKDGGNISSYSSDGFQP---NQNTRLGNNTTGNHNIR 651

Query: 635  NGAT--------SPITSGTPNVVVS-GNEPAPATTPSTTVSLPALLKDIAVNPTMLLNIL 685
              +T        +  +  +P +V +     AP T+ +  VSLPA+LKDIAVNPTML+  +
Sbjct: 652  TDSTLASNLNNTTNNSGTSPGIVQAPQTNSAPQTSSAPAVSLPAMLKDIAVNPTMLMQWI 711

Query: 686  KMGQQQKLAADAQQKSNDSSMNT------MHPPIPSSIPPVSVTCSIPS-----GILSKP 734
            +M QQ+  A++ QQK   S   T      M  P+  + P  +   ++PS      + S P
Sbjct: 712  QMEQQKMSASEPQQKVTASVGMTSNVTPGMVLPL-GNAPKTTEVAAVPSVRPQVPMQSAP 770

Query: 735  M---DELGKVRMKPRDPRRVLHGNALQRSGSLGP----EFKTDGPSAPCTQGSKENLNFQ 787
            M   ++ G +RMKPRDPRR+LH N +Q++ ++ P    + K++G + P +Q SK++L  Q
Sbjct: 771  MHSQNDTGVIRMKPRDPRRILHSNIVQKNDTVPPVGVEQAKSNGTAPPDSQSSKDHLLNQ 830

Query: 788  KQLGAPEAKPVLSQSVLQPDI-TQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQI 846
             Q      K    Q++  P +      + +   A+ +S SQ   +  M    +  Q    
Sbjct: 831  DQ------KAEQLQAIALPSLPVTSSARPVTMNANPVSNSQLAATALMPPHGNTKQTSSS 884

Query: 847  KSGADMK-AVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKER 905
             + AD + A   N  +    T +GP   P    P S +GDV+HL +GYDDQQKA IQKER
Sbjct: 885  VNKADPRLAAGQNESNDDAATSTGPVTAPDAVPPASPYGDVDHLLDGYDDQQKALIQKER 944

Query: 906  TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
             RR++EQ KMF+ARKLCLVLDLDHTLLNSAKF EVD +H EILRKKEEQDRE+  RHLF 
Sbjct: 945  ARRIKEQHKMFAARKLCLVLDLDHTLLNSAKFIEVDHIHGEILRKKEEQDRERAERHLFC 1004

Query: 966  FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
            F HMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNK+YATEMAKVLDP G LFAGRVISRG
Sbjct: 1005 FNHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKVYATEMAKVLDPTGTLFAGRVISRG 1064

Query: 1026 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
            DDGDPFD DERVPKSKDL+GVLGMESAVVIIDDSVRVWPHNK NLIVVERYTYFPCSRRQ
Sbjct: 1065 DDGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKHNLIVVERYTYFPCSRRQ 1124

Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            FGL GPSLLEID DER EDGTLASSL V +++H
Sbjct: 1125 FGLPGPSLLEIDRDERPEDGTLASSLAVIERIH 1157


>gi|222616055|gb|EEE52187.1| hypothetical protein OsJ_34058 [Oryza sativa Japonica Group]
          Length = 1267

 Score =  650 bits (1678), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 425/986 (43%), Positives = 579/986 (58%), Gaps = 111/986 (11%)

Query: 192  SFEGVCSKLEFTLESLRELVNENN--VPTKDALIQLAFSAVQSVHSVFCSMNHVLKEQNK 249
            SFEG C++L    E+L+ L  E+   +P  DAL+Q AF  + ++ +V  S +   +EQ K
Sbjct: 219  SFEGACTRLRTCFENLKPLFPESGSPMPMLDALVQQAFVGIDTITTVANSYDMPKREQTK 278

Query: 250  EILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAMHGVNGKDSNIVTE 309
             +L +LL  IK+    + + +Q  E+++ +  LV    +  KD       NG ++     
Sbjct: 279  NMLLKLLFHIKNRYSDMLTPDQRDELDSRVRQLVF---EDGKD-----NANGPNATSTNA 330

Query: 310  NAVNDLNFKEKVPLPVDSLMQNKPLEASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPT 369
             A +     E+  LP +S   N   +   P     ++R ++ PLLD H  +D +SLPSPT
Sbjct: 331  AAPSGQVLSER--LPFESGAGNSFSKVEIPA----KNR-MVSPLLDLHADYDENSLPSPT 383

Query: 370  RETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYET--DALRAFSSYQQKFGR 427
            R++ P   V +   +G G +        +    E  K   Y++  DAL+A   YQQK G+
Sbjct: 384  RDSKPPFDVPKP--IGYGALPMAPDRPSVLERVEPAKNSSYQSFNDALKAVCYYQQKHGQ 441

Query: 428  NSFFMNSELPSPTPS---EESGDGDGDTGGEISSATAVDQPKPVNMPTLGQQPVSSQPMD 484
             S F + +LPSPTPS   ++SGD  GD  GE+SS +A ++   + +P + Q P       
Sbjct: 442  KSNFASDDLPSPTPSGDGDKSGDKGGDVFGEVSSFSASNK---IALPIVNQMP------- 491

Query: 485  ISQPMDISSVQALTTANNSAPASSGY-----NPVVKPNPVVKAPIKSRDPRLRF------ 533
             S+P  +SS      +++ A    GY     N V   N ++KA  KSRDPRL+F      
Sbjct: 492  -SRPSTVSS-----NSDSFAGGPPGYAKQIENSVSGSNHLLKATAKSRDPRLKFLNRDTG 545

Query: 534  --ASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQRNGFENSG 591
              A +N      +P P   +  +    G  ++SRK K V+EP++D  ALKR R      G
Sbjct: 546  GVADANRRVNFAEPNP---SKDRTMGGGVSINSRKNKAVDEPMVDENALKRSR------G 596

Query: 592  VVRDEKNI--YGSGGWLED--------TDMFEPQIMNRNLLVDSAESNSRKLDNGAT--- 638
            V+ + +++   G GGW +D        +D F+P   N+N  + +  + +  +   +T   
Sbjct: 597  VIGNLRDMQPTGRGGWAKDGGNISSYSSDGFQP---NQNTRLGNNTTGNHNIRTDSTLAS 653

Query: 639  -----SPITSGTPNVVVS-GNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQK 692
                 +  +  +P +V +     AP T+ +  VSLPA+LKDIAVNPTML+  ++M QQ+ 
Sbjct: 654  NLNNTTNNSGTSPGIVQAPQTNSAPQTSSAPAVSLPAMLKDIAVNPTMLMQWIQMEQQKM 713

Query: 693  LAADAQQKSNDSSMNT------MHPPIPSSIPPVSVTCSIPS-----GILSKPM---DEL 738
             A++ QQK   S   T      M  P+  + P  +   ++PS      + S PM   ++ 
Sbjct: 714  SASEPQQKVTASVGMTSNVTPGMVLPL-GNAPKTTEVAAVPSVRPQVPMQSAPMHSQNDT 772

Query: 739  GKVRMKPRDPRRVLHGNALQRSGSLGP----EFKTDGPSAPCTQGSKENLNFQKQLGAPE 794
            G +RMKPRDPRR+LH N +Q++ ++ P    + K++G + P +Q SK++L  Q Q     
Sbjct: 773  GVIRMKPRDPRRILHSNIVQKNDTVPPVGVEQAKSNGTAPPDSQSSKDHLLNQDQ----- 827

Query: 795  AKPVLSQSVLQPDI-TQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMK 853
             K    Q++  P +      + +   A+ +S SQ   +  M    +  Q     + AD +
Sbjct: 828  -KAEQLQAIALPSLPVTSSARPVTMNANPVSNSQLAATALMPPHGNTKQTSSSVNKADPR 886

Query: 854  -AVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQ 912
             A   N  +    T +GP   P    P S +GDV+HL +GYDDQQKA IQKER RR++EQ
Sbjct: 887  LAAGQNESNDDAATSTGPVTAPDAVPPASPYGDVDHLLDGYDDQQKALIQKERARRIKEQ 946

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
             KMF+ARKLCLVLDLDHTLLNSAKF EVD +H EILRKKEEQDRE+  RHLF F HMGMW
Sbjct: 947  HKMFAARKLCLVLDLDHTLLNSAKFIEVDHIHGEILRKKEEQDRERAERHLFCFNHMGMW 1006

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
            TKLRPGIW FLE+ASKL+E+HLYTMGNK+YATEMAKVLDP G LFAGRVISRGDDGDPFD
Sbjct: 1007 TKLRPGIWNFLEKASKLYELHLYTMGNKVYATEMAKVLDPTGTLFAGRVISRGDDGDPFD 1066

Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
             DERVPKSKDL+GVLGMESAVVIIDDSVRVWPHNK NLIVVERYTYFPCSRRQFGL GPS
Sbjct: 1067 SDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKHNLIVVERYTYFPCSRRQFGLPGPS 1126

Query: 1093 LLEIDHDERSEDGTLASSLGVRQQLH 1118
            LLEID DER EDGTLASSL V +++H
Sbjct: 1127 LLEIDRDERPEDGTLASSLAVIERIH 1152


>gi|242068555|ref|XP_002449554.1| hypothetical protein SORBIDRAFT_05g019010 [Sorghum bicolor]
 gi|241935397|gb|EES08542.1| hypothetical protein SORBIDRAFT_05g019010 [Sorghum bicolor]
          Length = 1197

 Score =  645 bits (1664), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 452/1148 (39%), Positives = 597/1148 (52%), Gaps = 141/1148 (12%)

Query: 14   ASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYPAI---CRGYG 70
             S+EEIS +DF+      +                 +R W         PA+    R +G
Sbjct: 33   GSIEEISADDFRKDSSSALGGPAAAA-----AAGQRSRSWV------GPPAVGYMARNFG 81

Query: 71   PGLHNLAWAQAVQNKPLNEIFVMEAEQD------DVSKRSSPASSVASVNSGAAAGKDDK 124
               ++ AW+QAV+NKPL       +++D      D S        +    +  A     +
Sbjct: 82   HAFNSFAWSQAVRNKPLGLQPPPASDEDEVEHAVDASDGEKEEGEIEEGEAVEAEASPAR 141

Query: 125  KVVEKVVIDDSGDEIEKEEG--------ELEEGEIELDLESESNEKVSEQVKEEMKLINV 176
               E + +D   D +EK E           EE E+ LD    S       + EE++++++
Sbjct: 142  AQPETIDLDADADALEKSESLAGAVPASAAEEEEVNLDQRVGS-------ILEELEMVSI 194

Query: 177  ESIREALESVLRGDISFEGVCSKLEFTLESLRELVNE--NNVPTK--DALIQLAFSAVQS 232
            E            + SFEG C +L    E+L+ L  E  N  P    + L+Q AF  + +
Sbjct: 195  E----------EAEKSFEGACGRLHTCFENLKPLFQELENGSPMAILEPLMQQAFIGIDT 244

Query: 233  VHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKD 292
            + +V  S N    EQNK  L + L  IK+    + +  Q  E+++ +  LV    D   D
Sbjct: 245  LTTVAISYNLPRSEQNKTTLLKSLFHIKNRYSDMLTPEQRDELDSRVRKLVFGEKDNVSD 304

Query: 293  MLAMHGVNGKDSNIVTENAVNDLNFKEKVP----LPVDSLMQNKPLEASKPGPPGYRSRG 348
                 G N          A+N L    +V     LP +S   N      +   P  R   
Sbjct: 305  PSTSSGTN----------AINVLAPSGQVSSSGGLPFESGAANPFSSLPRLEVPAKR--- 351

Query: 349  VLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTP 408
             + PLLD H  +D +SLPSPTR+  P  PV +   +G G               E  K  
Sbjct: 352  -ISPLLDLHADYDENSLPSPTRDNAPPFPVPKP--IGFGAFPMVPEKLSFPERVEPAKNS 408

Query: 409  HYET--DALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEI-SSATAVDQP 465
             Y +  D L+A SSYQQK+G+ S F + +LPSPTPS + G    D GG+I S  ++   P
Sbjct: 409  LYPSLNDPLKAVSSYQQKYGQKSVFPSDDLPSPTPSGDEGKS-ADKGGDIFSEVSSFPVP 467

Query: 466  KPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAPIK 525
            K + +P+  Q P S       QP  +SS      +     A     PV  PN  +KA  K
Sbjct: 468  KSIALPSTSQMPAS-------QPSTVSSSGISYASGPPGFAKQIEQPVAGPNHAIKAASK 520

Query: 526  SRDPRLRFA---SSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKR 582
            SRDPRLRF    S+ A ++N +              G  + +RK K +++P +D   LKR
Sbjct: 521  SRDPRLRFLNRDSAGATDVNRRAN--FSELKDGNLGGASVGNRKHKAIDDPQVDENVLKR 578

Query: 583  QRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPIT 642
             R G  N    RD                 +P      L+   A +NS  ++     P  
Sbjct: 579  FRGGTANP---RD----------------LQPTGNPNQLMNIRAPTNSSGINMKTLQPPQ 619

Query: 643  SGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQ-KS 701
            +  P+V  +   P P+           LLKDIAVNPT+L+++++M  Q+K A++ Q   S
Sbjct: 620  TTAPHVSAAPAVPVPSM----------LLKDIAVNPTLLMHLIQMEHQKKSASETQGGMS 669

Query: 702  NDSSMNTMHPPI--PSSIPPVSVTCSIPSGILSKP--------MDELGKVRMKPRDPRRV 751
            +  S N +   +  P + P  +    +PS     P         ++ G +RMKPRDPRR+
Sbjct: 670  SGMSNNGIAGMVFTPGNAPKTTEAAQVPSVRPQVPAQTPSLNSQNDGGILRMKPRDPRRI 729

Query: 752  LHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQKQLGAPEAKPVLSQSVLQPDITQ 810
            LH N  Q+S ++  E  KT+G + P +QG+K+      Q  +  ++P L  SV +P    
Sbjct: 730  LHNNVAQKSDAMVLEQVKTNGITQPDSQGTKD------QTSSMPSQPTLPSSVARP---- 779

Query: 811  QFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGP 870
             FT N KH+ D +S SQ   +  M      +  G I       AV  N  +    T    
Sbjct: 780  -FT-NTKHV-DPVSNSQLAATAIMAPTQQAL--GSINKVDPRLAVEQNGQNADATTTDAS 834

Query: 871  EAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHT 930
                    P S WG+++HL +GYDD+QKA IQKER RR+ EQ KMFSARKLCLVLDLDHT
Sbjct: 835  ATELEATQPVSPWGNLDHLLDGYDDKQKALIQKERARRITEQHKMFSARKLCLVLDLDHT 894

Query: 931  LLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLF 990
            LLNSAKF EV+P+H+E+LRKKEEQDR  P RHL+RF HM MWTKLRPGIW FLE+AS LF
Sbjct: 895  LLNSAKFIEVEPIHEEMLRKKEEQDRTLPERHLYRFHHMNMWTKLRPGIWNFLEKASNLF 954

Query: 991  EMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME 1050
            E+HLYTMGNKLYATEMAKVLDP G LFAGRVISRGDDGDPFD DERVPKSKDL+GVLGME
Sbjct: 955  ELHLYTMGNKLYATEMAKVLDPTGTLFAGRVISRGDDGDPFDSDERVPKSKDLDGVLGME 1014

Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASS 1110
            SAVVIIDDSVRVWPHN+ NLIVVERYTYFPCSRRQFGL GPSLLEID DER EDGTLASS
Sbjct: 1015 SAVVIIDDSVRVWPHNRHNLIVVERYTYFPCSRRQFGLPGPSLLEIDRDERPEDGTLASS 1074

Query: 1111 LGVRQQLH 1118
            L V +++H
Sbjct: 1075 LAVIERIH 1082


>gi|218185830|gb|EEC68257.1| hypothetical protein OsI_36281 [Oryza sativa Indica Group]
          Length = 1255

 Score =  634 bits (1635), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 454/1172 (38%), Positives = 625/1172 (53%), Gaps = 157/1172 (13%)

Query: 16   VEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYPAICRGYGPGLHN 75
            +EEIS +DFK   +E                +  +RVW     YN    I R Y P  H+
Sbjct: 57   LEEISADDFK---KESSAAGGVAAAAAAAAAQQRSRVWMG---YN----IPRSYAPAFHS 106

Query: 76   LAWAQAVQNKPLNEIFVMEAEQDDV---------SKRSSPASSVASVNSGAAAGKDDKKV 126
             AWAQAVQNKPL       A++D+V          K         +V +   +       
Sbjct: 107  FAWAQAVQNKPLVPRAADAADEDEVEHVVDTSDEEKEEGEIEEGEAVQTTTTSSSSPPCA 166

Query: 127  VEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKVS-----EQVKEEMKLINVESI-R 180
                 ID   D  EK E  +              E+V        + EE++++++E   +
Sbjct: 167  QPPETIDLDSDAPEKSESMVAMDGGGAAPAGAEEEEVDFDQRVGSILEELEMVSIEEAEK 226

Query: 181  EALESVLRGDI----------------------SFEGVCSKLEFTLESLRELVNENN--V 216
              L  +L G +                      SFEG C++L    E+L+ L  E+   +
Sbjct: 227  YGLMILLYGKVHVLDVFWCMIQLLRDPILIFCRSFEGACTRLRTCFENLKPLFPESGSPM 286

Query: 217  PTKDALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEME 276
            P  DAL+Q AF  + ++ +V  S +   +EQ K +L +LL  IK+    + + +Q  E++
Sbjct: 287  PMLDALVQQAFVGIDTITTVANSYDMPKREQTKNMLLKLLFHIKNRYSDMLTPDQRDELD 346

Query: 277  AMLSSLVTRANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNKPLEA 336
            + +  LV    +  KD       NG ++      A +     E+  LP +S   N   + 
Sbjct: 347  SRVRQLVF---EDGKD-----NANGPNATSTNAAAPSGQVLSER--LPFESGAGNSFSKV 396

Query: 337  SKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAA 396
              P     ++R ++ PLLD H  +D +SLPSPTR++ P   V +   +G G +       
Sbjct: 397  EIPA----KNR-MVSPLLDLHADYDENSLPSPTRDSAPPFDVPKP--IGYGALPMAPDRP 449

Query: 397  KLSHNAEVHKTPHYET--DALRAFSSYQQKFGRNSFFMNSELPSPTPS---EESGDGDGD 451
             +    E  K   Y++  DAL+A   YQQK G+ S F + +LPSPTPS   ++SGD  GD
Sbjct: 450  SVLERVEPAKNSSYQSFNDALKAVCYYQQKHGQKSNFASDDLPSPTPSGDGDKSGDKGGD 509

Query: 452  TGGEISSATAVDQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGY- 510
              GE+SS +A ++   + +P + Q P        S+P  +SS      +++ A    GY 
Sbjct: 510  VFGEVSSFSASNK---IVLPIVNQMP--------SRPSTVSS-----NSDSFAGGPPGYA 553

Query: 511  ----NPVVKPNPVVKAPIKSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSRK 566
                N V   N ++KA  KSRDPRL+F + +                     G V  + +
Sbjct: 554  KQIENSVSGSNHLLKATAKSRDPRLKFLNRD--------------------TGGVADANR 593

Query: 567  QKTVEEPVLDGPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSA 626
            +    EP           N  +    V + ++I  +    ++T +      N N+  DS 
Sbjct: 594  RVNFAEP-----------NPLKIGPWVVEYQSIAPN----QNTRLGNNTTGNHNIRTDST 638

Query: 627  ESNSRKLDNGATSPITSGTPNVVVS-GNEPAPATTPSTTVSLPALLKDIAVNPTMLLNIL 685
             +++       T+  +  +P +V +     AP T+ +  VSLPA+LKDIAVNPTML+  +
Sbjct: 639  LASNLN----NTTNNSGTSPGIVQAPQTNSAPQTSSAPAVSLPAMLKDIAVNPTMLMQWI 694

Query: 686  KMGQQQKLAADAQQKSNDSSMNT------MHPPIPSSIPPVSVTCSIPS-----GILSKP 734
            +M   +  A++ QQK   S   T      M  P+  + P  +   ++PS      + S P
Sbjct: 695  RMEHHKMSASEPQQKVTASVGMTSNVTPGMVLPL-GNAPKTTEVAAVPSVRPQVPMQSAP 753

Query: 735  M---DELGKVRMKPRDPRRVLHGNALQRSGSLGP----EFKTDGPSAPCTQGSKENLNFQ 787
            M   ++ G +RMKPRDPRR+LH N +Q++ ++ P    + K++G + P +Q SK++L  Q
Sbjct: 754  MHSQNDTGVIRMKPRDPRRILHSNIVQKNDTVPPVGVEQAKSNGTAPPDSQSSKDHLLNQ 813

Query: 788  KQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIK 847
             Q  A + + +   S+      +  T N    A+ +S SQ   +  M    +  Q     
Sbjct: 814  DQ-KAEQLQAIALPSLPVTSSARPVTMN----ANPVSNSQLAATALMPPHGNTKQTSSSV 868

Query: 848  SGADMK-AVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERT 906
            + AD + A   N  +    T +GP   P    P S +GDV+HL +GYDDQQKA IQKER 
Sbjct: 869  NKADPRLAAGQNESNDDAATSTGPVTAPDAVPPASPYGDVDHLLDGYDDQQKALIQKERA 928

Query: 907  RRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF 966
            RR++EQ KMF+ARKLCLVLDLDHTLLNSAKF EVD +H EILRKKEEQDRE+  RHLF F
Sbjct: 929  RRIKEQHKMFAARKLCLVLDLDHTLLNSAKFIEVDHIHGEILRKKEEQDRERAERHLFCF 988

Query: 967  PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
             HMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNK+YATEMAKVLDP G LFAGRVISRGD
Sbjct: 989  NHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKVYATEMAKVLDPTGTLFAGRVISRGD 1048

Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQF 1086
            DGDPFD DERVPKSKDL+GVLGMESAVVIIDDSVRVWPHNK NLIVVERYTYFPCSRRQF
Sbjct: 1049 DGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKHNLIVVERYTYFPCSRRQF 1108

Query: 1087 GLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            GL GPSLLEID DER EDGTLASSL V +++H
Sbjct: 1109 GLPGPSLLEIDRDERPEDGTLASSLTVIERIH 1140


>gi|413920930|gb|AFW60862.1| hypothetical protein ZEAMMB73_799152, partial [Zea mays]
          Length = 1234

 Score =  633 bits (1632), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 457/1181 (38%), Positives = 617/1181 (52%), Gaps = 154/1181 (13%)

Query: 8    GEISD---TASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYPA 64
            GE SD   + S+EEI+ +DFK       K                +R W         PA
Sbjct: 25   GEGSDRDSSGSIEEITADDFK-------KDSSSALGGAAAAAGPRSRSWVAP------PA 71

Query: 65   I---CRGYGPGLHNLAWAQAVQNKPLN-----------EIFVMEAEQDDVSKRSSPASSV 110
            +    R +    ++ AW+QAV+NKPL            E  V  ++ +          +V
Sbjct: 72   VGYMARNFRYAFNSFAWSQAVRNKPLGLQPPAPDDDEVEHAVDVSDGEKEEGEIEEGEAV 131

Query: 111  ASVNSGAAA-------GKDDKKVVEKVVIDDSGDEIEKEEGELEEGEIELDLESESNEKV 163
             ++ S A A         D  +  E V ID S   +       EE E+ LD    S    
Sbjct: 132  EALASPAPAQPETIDLDSDAPEKSESVAIDGSASVVPVPAA--EEEEVNLDQRVGS---- 185

Query: 164  SEQVKEEMKLINVESIREALESVLRGDI-------SFEGVCSKLEFTLESLRELVN--EN 214
               + EE++++++E   + +       +       SFEG C++L    E+L+ L    EN
Sbjct: 186  ---ILEELEMVSIEEAEKYMGICFMFFLEQRLCFRSFEGACARLHTCFENLKPLFQELEN 242

Query: 215  NVPTK--DALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQI 272
              P    + L+Q AF  + ++ +V    N   +EQNK  L +LL  IK+    + +  Q 
Sbjct: 243  GSPMAILEPLMQQAFIGIDTLTTVANLYNLPRREQNKTTLLKLLFHIKNRYSDMLTPEQR 302

Query: 273  KEMEAMLSSLVTRANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNK 332
            +EM++ +  LV    D   D     G +  + +  +    N         LP +S   N 
Sbjct: 303  EEMDSRVRKLVFGEKDNVSDPSTSCGTSAINVSAPSGQVSNTGG------LPFESGAANL 356

Query: 333  PLEASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSW 392
                 +   P  R+     PLL+ H  +D +SLPSPTR+  P  P  +   +G G     
Sbjct: 357  FSSLPRLEVPAKRNS----PLLNLHADYDENSLPSPTRDNAPPFPALKP--IGFGAFPMV 410

Query: 393  AAAAKLSHNAEVHKTPHYE--TDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDG 450
                      E  K   Y    D L+A SSYQQK+G+ S + + +LPSPTPS + G    
Sbjct: 411  PEKLSFLDRVEPTKNSLYPPLNDPLKAVSSYQQKYGQKSVYPSDDLPSPTPSGDEGK-PA 469

Query: 451  DTGGEI-SSATAVDQPKPVNMPTLGQQPVSSQP---------------MDISQPMDISSV 494
            D GG+I S  ++   PK + +P+  Q P S                  M  SQP+ +SS 
Sbjct: 470  DKGGDIFSDVSSFPVPKSIVLPSTSQMPASQPSTVSSSSISYASSTSQMAASQPITVSSS 529

Query: 495  QALTTANNSAPASSGYNPVVKPNPVVKAPIKSRDPRLRFA---SSNALNLNHQPAPILHN 551
                 +     A         PN  +KA  KSRDPRLRF    S+ A ++N +      +
Sbjct: 530  GISYASGPPGFAKQIEQSTAGPNHAIKAASKSRDPRLRFLNRDSAGATDVNWRAN---FS 586

Query: 552  APKVEPVGRV-MSSRKQKTVEEPVLDGPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTD 610
              K   +G V + +RKQK V++P +D  ALKR R G  N      ++++  +G       
Sbjct: 587  ELKDGNLGGVSVGNRKQKAVDDPQVDDNALKRFRGGIAN------QRDMQPTGN------ 634

Query: 611  MFEPQIMNRNLLVDSAESNSRKLDNGATSPITSGTPNVVVSGNEPAPATTPSTTVSLPAL 670
                Q+MN      S+  N + L      P  +  P+V  +   P P          P L
Sbjct: 635  --PNQLMNIRAPTHSSSINMKTL-----QPPQTTAPHVSAAPAVPLP----------PML 677

Query: 671  LKDIAVNPTMLLNILKMGQQQKLAADAQQKSNDSSMNTMHPPI---PSSIPPVSVTCSIP 727
            LKDIAVNP +L+++++M  Q+K A+++Q   +    N     +   P + P ++    +P
Sbjct: 678  LKDIAVNPALLMHLIQMEHQKKSASESQGGMSSGMTNNGIAGMVFTPGNAPKITEAAQVP 737

Query: 728  S-----GILSKPM---DELGKVRMKPRDPRRVLHGNALQRSGSLGPE-FKTDGPSAPCTQ 778
            S      + + P+   ++ G VRMKPRDPRR+LH N  Q+S ++  E  K +G + P +Q
Sbjct: 738  SVRPQVPVQTPPLNSQNDGGIVRMKPRDPRRILHNNIAQKSDAMSLEQVKNNGTTQPDSQ 797

Query: 779  GSKENLNFQKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQN 838
            G+K+           +  PV SQ  L   I + F+ + KH+ D +S SQ   +  M    
Sbjct: 798  GTKD-----------QTTPVPSQPALPSSIARPFS-SAKHV-DPVSNSQLAATAIM---- 840

Query: 839  SPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGA-HPQSAWGDVEHLFEGYDDQQ 897
            +P Q     +  D +  V  +      T +G  A  + A  P S WGDV+HL +GYDDQQ
Sbjct: 841  APTQALSSVNKVDPRLAVEQNGQNADATTNGASATTLEATQPVSPWGDVDHLLDGYDDQQ 900

Query: 898  KAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE 957
            KA IQKER RR+ EQ KMFSARKLCLVLDLDHTLLNSAKF EV+P+H+E+LRKKEEQDR 
Sbjct: 901  KALIQKERARRITEQHKMFSARKLCLVLDLDHTLLNSAKFIEVEPIHEEMLRKKEEQDRT 960

Query: 958  KPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 1017
             P RHL+RF HM MWTKLRPGIW FL++AS LFE+HLYTMGNKLYATEMAKVLDP G LF
Sbjct: 961  LPERHLYRFHHMNMWTKLRPGIWNFLQKASNLFELHLYTMGNKLYATEMAKVLDPTGTLF 1020

Query: 1018 AGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
            AGRVISRGDDGDPFD DERVPKSKDL+GVLGMESAVVIIDDSVRVWPHN+ NLIVVERYT
Sbjct: 1021 AGRVISRGDDGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNRHNLIVVERYT 1080

Query: 1078 YFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            YFPCSRRQFGL GPSLLEID DER EDGTLASSL V +++H
Sbjct: 1081 YFPCSRRQFGLPGPSLLEIDRDERPEDGTLASSLAVIERIH 1121


>gi|357156660|ref|XP_003577532.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Brachypodium distachyon]
          Length = 1259

 Score =  625 bits (1613), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 458/1183 (38%), Positives = 619/1183 (52%), Gaps = 150/1183 (12%)

Query: 8    GEISD---TASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYPA 64
            GE SD   + S+EEI+  DF+ +                      +RVW           
Sbjct: 40   GETSDEDSSESLEEITAADFQKESSGGAAAGTAAS-----AAAQRSRVWMGY-------T 87

Query: 65   ICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNSGAAAGKDDK 124
            + R Y P  H+ AWAQAVQNKPL  +    A++D+V      +               D 
Sbjct: 88   MSRSYAPAFHSFAWAQAVQNKPL--VPPPAADEDEVEHIVDTSDEEKEEGEIEEGEAVDT 145

Query: 125  KV----VEKVVIDDSGDEIEKEEGELEEGE----IELDLESESNEKVSEQVKEEMKLINV 176
                   +   ID   D  EK E    EG     + ++ E + +++V   + EE++++++
Sbjct: 146  SFPSPHAQPETIDLDSDVPEKSESMAVEGSNTAAVAVEEEVDFDQRVGS-ILEELEMVSI 204

Query: 177  ESIREALESVLRGDISFEGVCSKLEFTLESLRELVNENN--VPTKDALIQLAFSAVQSVH 234
            E            + SFEG C +L    E+L+ L  E+   +P  DAL+Q  F  + ++ 
Sbjct: 205  E----------EAEKSFEGACERLRTCFENLKPLFLESGSPMPMLDALVQQGFVGIDTIT 254

Query: 235  SVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDML 294
            +V  S     + QNKE+L +LL  +++    + + +Q  E+++ +  L     ++  D  
Sbjct: 255  TVANSYAMPKRVQNKEMLLKLLFHLRNRYSDMLTPDQRVELDSRVRQLAFVDGEENTD-- 312

Query: 295  AMHGVNGKDSNIVTENAVNDLNFKEKVP---LPVDSLMQNKPLEASKPGPPGYRSRGVLL 351
                  G +++  T N+ N +    +VP   LP +S   N    +S P         ++ 
Sbjct: 313  ------GPNASCST-NSTNVVVPTGQVPSERLPFESGATNPFSGSSLPWLETQTKNRMVS 365

Query: 352  PLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYE 411
            PLLD H  HD +SLPSPTR+  P   V + +  G            L+  AE  K   Y 
Sbjct: 366  PLLDLHADHDENSLPSPTRDNAPQFSVPKPIGFG---AFPMGPDRSLTERAEPSKKNLYP 422

Query: 412  T--DALRAFSSYQQKFGRNSFFMNSELPSPTPS---EESGDGDGDTGGEISSATAVDQPK 466
            +  D+L   SSY+QK+ + S F N +LPSPTPS   ++S D DGD  GEISS ++ ++  
Sbjct: 423  SVNDSLDV-SSYKQKYSQKSNFANDDLPSPTPSGDGDKSEDKDGDMFGEISSFSSSNK-- 479

Query: 467  PVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGY-----NPVVKPNPVVK 521
               +P++ Q P        S+P  +SS      +N S     GY       V  PN  +K
Sbjct: 480  -TALPSVSQIPA-------SRPSTVSS------SNGSFSGPPGYAKKIEQSVSGPNLALK 525

Query: 522  APIKSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSR-KQKTVEEPVLDGPAL 580
               KSRDPRLR+       LN  P          EP   +  +  K K V +P++D   +
Sbjct: 526  PSAKSRDPRLRY-------LNRDPGDANRCMNFAEPNASLGGTLGKHKAVGQPLMDENMV 578

Query: 581  KRQRNGFENSGVV-----RDEKNI--YGSGGWL--EDTDMFEPQIMNRNLLVDSA-ESNS 630
            KR R    N   +     RD  NI  Y S      ++T +      N NL  DS   SN 
Sbjct: 579  KRARGSIGNPRDLQVPPGRDGSNISFYPSDRVQSNQNTRLDTKTTGNPNLRADSQLLSNV 638

Query: 631  RKLDNG---ATSPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKM 687
              + N    +T  + +G P+ V       P T+ + +VSLPA+LKDIAVNPT+L++ ++M
Sbjct: 639  SSITNSSVTSTKTLNAGQPDSV-------PQTSAAPSVSLPAVLKDIAVNPTVLMHWIQM 691

Query: 688  GQQQKLAADAQQKS-----------NDSSMNTMHPP-------IPSSIPPVSVTCSIPSG 729
             QQ++ A++ QQ             N+ +   + PP         + IP +   C   + 
Sbjct: 692  EQQKRSASEPQQTVNTLGGISSGMINNDTAGMVIPPGSALKTADAAQIPSIRPQCPTQTA 751

Query: 730  ILSKPMDELGKVRMKPRDPRRVLHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENL---- 784
             +    D  G +RMKPRDPRR+LH N   ++ +   E  +++G   P +Q SK+N+    
Sbjct: 752  PVISQTDA-GVIRMKPRDPRRILHNNTSPKNDTTNSEQARSNGIVLPVSQDSKDNMINRE 810

Query: 785  --NFQKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQ 842
                Q Q GA  ++PV   ++ +P            + D +S SQ   S  M  Q +   
Sbjct: 811  QQAEQLQTGALPSQPVSLSNIARPSTMS------ASMVDPVSNSQLAASSLMAPQQT--- 861

Query: 843  PGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQ 902
             G I       A   N  +    T + P      A P + WGD++ L  GYDDQQKA IQ
Sbjct: 862  SGSINRADPRLAPGQNDPNADAATNASPATTLGAAPPANQWGDLDDLLSGYDDQQKALIQ 921

Query: 903  KERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH 962
            KER RR+ EQ+KMFSARKLCLVLDLDHTLLNSAKF EVDP+H+EILRKKEEQDRE+P RH
Sbjct: 922  KERARRIMEQQKMFSARKLCLVLDLDHTLLNSAKFLEVDPIHEEILRKKEEQDRERPERH 981

Query: 963  LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
            LFR  HM MWTKLRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAKVLDP G LF GRVI
Sbjct: 982  LFRLHHMSMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGALFEGRVI 1041

Query: 1023 SRGDDGDP-------FDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVER 1075
            SRG DG         FD D+RVPKSKDL+GVLGMESAVVIIDDSVRVWPHNK N+IVVER
Sbjct: 1042 SRGGDGTSRGGDGDSFDSDDRVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKNNMIVVER 1101

Query: 1076 YTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            YTYFPCSRRQFGL GPSLLEID DER EDGTLASSL V  ++H
Sbjct: 1102 YTYFPCSRRQFGLPGPSLLEIDRDERPEDGTLASSLAVIGRIH 1144


>gi|326532556|dbj|BAK05207.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 891

 Score =  548 bits (1413), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 360/808 (44%), Positives = 468/808 (57%), Gaps = 76/808 (9%)

Query: 352  PLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLS-HNAEVHKTPHY 410
            PLLD H  +D  SLPSPTR++ P  PV + +  G   V   A     S    E+ K   Y
Sbjct: 4    PLLDLHADYDESSLPSPTRDSAPPFPVPKPIGFG---VFPMAPDRYFSVERVELSKKVLY 60

Query: 411  ET--DALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVDQPKPV 468
             +  DAL+  SSY+QK+G+ S F + +LPSPTPS++ GD   D  G I           V
Sbjct: 61   PSVNDALKDVSSYRQKYGQTSTFASYDLPSPTPSDD-GDKSEDKDGGIF----------V 109

Query: 469  NMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGY-----NPVVKPNPVVKAP 523
             +P+      S+ P     P    SV  +++ ++ A    GY       V  P+  +K  
Sbjct: 110  EVPSFSDSNKSAPPSGNLLPASRPSV-VISSNDSFAGGPPGYAKQIEQSVSGPSHALKPS 168

Query: 524  IKSRDPRLRFA---SSNALNLNHQPAPILHNAPKVEPVGRVMS--SRKQKTVEEPVLDGP 578
             KSRDPRLRF    S    + N        NA +   +  V+S  SRK K   +P++D  
Sbjct: 169  AKSRDPRLRFLNRDSGGTADANRHVNLAEPNASRDGTLWGVVSDNSRKHKATGQPLIDES 228

Query: 579  ALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGAT 638
             LKR R   E +G  RD +   G  G    +   +    N++  +++  + ++ + N ++
Sbjct: 229  VLKRAR---ECAGNPRDMQVPPGRDGSNISSYSGDRVQSNQHTWLETKTAGNQLISNVSS 285

Query: 639  SPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQ 698
             P ++G   +  S     P T+ +  VSLPA+LKDIAVNPT+L++ ++M  Q++  ++ Q
Sbjct: 286  IPDSTGA--LHASQPNSFPQTSAAPIVSLPAVLKDIAVNPTVLMHWIQMEHQKRSPSEPQ 343

Query: 699  QKS--------NDSSMNTMHPP-----------IPSSIPPVSVTCSIPSGILSKPMDELG 739
              S        N+ +   +  P           IPS  P  + T S+ S       ++ G
Sbjct: 344  PASGIISSGMINNVTAGMVISPGNALKTAEVAHIPSYRPQATSTASVNS------QNDPG 397

Query: 740  KVRMKPRDPRRVLHGNALQRSGSLGP-EFKTDGPSAPCTQGSKENLNFQKQLGAPEAKPV 798
             +RMK RDPRRVLH N   ++ +    + K++G + P  Q SK+NL  ++Q+       V
Sbjct: 398  VIRMKSRDPRRVLHNNTSHKNDTPNSDQAKSNGIALPANQDSKDNLINREQVAEQLQTIV 457

Query: 799  L-SQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMKAVVT 857
            L SQ V    I +Q T +   + D +S SQ   S  +  Q + +   +    AD +    
Sbjct: 458  LPSQPVSSSSIARQSTMSASKV-DSVSNSQLAASSLIAPQETLVSINR----ADPRVATG 512

Query: 858  NHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFS 917
             +D       +     P    P + WGD++ L  GYDDQQKA IQKER RR+ EQ  MFS
Sbjct: 513  QNDSNDAAPATTLGTRP----PANQWGDLDDLLNGYDDQQKALIQKERARRIMEQHTMFS 568

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
            +RKLCLVLDLDHTLLNSAKF EVDP+H+EIL KKEEQDRE+  RHLFRF HM MWTKLRP
Sbjct: 569  SRKLCLVLDLDHTLLNSAKFIEVDPIHEEILWKKEEQDRERSERHLFRFHHMQMWTKLRP 628

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV-------ISRGDDGDP 1030
            GIW FLE+ASKL+E+HLYTMGNKLYATEMAKVLDP G LFAGRV       ISRG DGD 
Sbjct: 629  GIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPSGTLFAGRVISRGGDGISRGGDGDT 688

Query: 1031 FDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 1090
            FD D+RVPKSKDL+GVLGMESAVVIIDDSVRVWPHNK N+IVVERYTYFPCSRRQFGL G
Sbjct: 689  FDSDDRVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKNNMIVVERYTYFPCSRRQFGLPG 748

Query: 1091 PSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            PSLLEID DER EDGTLASSL V  ++H
Sbjct: 749  PSLLEIDRDERPEDGTLASSLAVIGRIH 776


>gi|115485681|ref|NP_001067984.1| Os11g0521900 [Oryza sativa Japonica Group]
 gi|113645206|dbj|BAF28347.1| Os11g0521900 [Oryza sativa Japonica Group]
          Length = 664

 Score =  517 bits (1332), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 292/554 (52%), Positives = 367/554 (66%), Gaps = 45/554 (8%)

Query: 601  GSGGWLED--------TDMFEPQIMNRNLLVDSAESNSRKLDNGAT--------SPITSG 644
            G GGW +D        +D F+P   N+N  + +  + +  +   +T        +  +  
Sbjct: 5    GRGGWAKDGGNISSYSSDGFQP---NQNTRLGNNTTGNHNIRTDSTLASNLNNTTNNSGT 61

Query: 645  TPNVVVS-GNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQKSND 703
            +P +V +     AP T+ +  VSLPA+LKDIAVNPTML+  ++M QQ+  A++ QQK   
Sbjct: 62   SPGIVQAPQTNSAPQTSSAPAVSLPAMLKDIAVNPTMLMQWIQMEQQKMSASEPQQKVTA 121

Query: 704  SSMNT------MHPPIPSSIPPVSVTCSIPS-----GILSKPM---DELGKVRMKPRDPR 749
            S   T      M  P+  + P  +   ++PS      + S PM   ++ G +RMKPRDPR
Sbjct: 122  SVGMTSNVTPGMVLPL-GNAPKTTEVAAVPSVRPQVPMQSAPMHSQNDTGVIRMKPRDPR 180

Query: 750  RVLHGNALQRSGSLGP----EFKTDGPSAPCTQGSKENLNFQKQLGAPEAKPVLSQSVLQ 805
            R+LH N +Q++ ++ P    + K++G + P +Q SK++L  Q Q  A + + +   S+  
Sbjct: 181  RILHSNIVQKNDTVPPVGVEQAKSNGTAPPDSQSSKDHLLNQDQ-KAEQLQAIALPSLPV 239

Query: 806  PDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMK-AVVTNHDDKQT 864
                +  T N    A+ +S SQ   +  M    +  Q     + AD + A   N  +   
Sbjct: 240  TSSARPVTMN----ANPVSNSQLAATALMPPHGNTKQTSSSVNKADPRLAAGQNESNDDA 295

Query: 865  GTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLV 924
             T +GP   P    P S +GDV+HL +GYDDQQKA IQKER RR++EQ KMF+ARKLCLV
Sbjct: 296  ATSTGPVTAPDAVPPASPYGDVDHLLDGYDDQQKALIQKERARRIKEQHKMFAARKLCLV 355

Query: 925  LDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLE 984
            LDLDHTLLNSAKF EVD +H EILRKKEEQDRE+  RHLF F HMGMWTKLRPGIW FLE
Sbjct: 356  LDLDHTLLNSAKFIEVDHIHGEILRKKEEQDRERAERHLFCFNHMGMWTKLRPGIWNFLE 415

Query: 985  RASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLE 1044
            +ASKL+E+HLYTMGNK+YATEMAKVLDP G LFAGRVISRGDDGDPFD DERVPKSKDL+
Sbjct: 416  KASKLYELHLYTMGNKVYATEMAKVLDPTGTLFAGRVISRGDDGDPFDSDERVPKSKDLD 475

Query: 1045 GVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSED 1104
            GVLGMESAVVIIDDSVRVWPHNK NLIVVERYTYFPCSRRQFGL GPSLLEID DER ED
Sbjct: 476  GVLGMESAVVIIDDSVRVWPHNKHNLIVVERYTYFPCSRRQFGLPGPSLLEIDRDERPED 535

Query: 1105 GTLASSLGVRQQLH 1118
            GTLASSL V +++H
Sbjct: 536  GTLASSLAVIERIH 549


>gi|357478637|ref|XP_003609604.1| RNA polymerase II C-terminal domain phosphatase-like protein
            [Medicago truncatula]
 gi|355510659|gb|AES91801.1| RNA polymerase II C-terminal domain phosphatase-like protein
            [Medicago truncatula]
          Length = 1064

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 192/254 (75%), Positives = 209/254 (82%), Gaps = 7/254 (2%)

Query: 867  GSGPEAGPVG-AHPQSAWG-DVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLV 924
            GS  E    G   P + W  +VEHL EGYD QQKA IQ+ER RRLEEQ KMF+ARKLCLV
Sbjct: 691  GSAHETCASGSCQPHNTWAANVEHLLEGYDAQQKAVIQRERARRLEEQNKMFAARKLCLV 750

Query: 925  LDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLE 984
            LD+DHTLLNSAKF EVDP HD+ILRKKE+Q+R KP RHLFR PHMGMWTKLRPG+W FLE
Sbjct: 751  LDIDHTLLNSAKFVEVDPEHDKILRKKEKQERGKPRRHLFRLPHMGMWTKLRPGVWNFLE 810

Query: 985  RASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLE 1044
            +ASKLFEMHLYTMGNKLYATEMAKVLDP GVLFAGRVISRGDD +  D      K KDLE
Sbjct: 811  KASKLFEMHLYTMGNKLYATEMAKVLDPNGVLFAGRVISRGDDPETVD-----IKCKDLE 865

Query: 1045 GVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSED 1104
            GVLG+ES+VVIIDDS RVWPHN+LNLI VERY YF CSRRQFGL GPSL EIDHDER   
Sbjct: 866  GVLGLESSVVIIDDSPRVWPHNQLNLITVERYIYFLCSRRQFGLSGPSLFEIDHDERPGA 925

Query: 1105 GTLASSLGVRQQLH 1118
            GTLASSLGV +++H
Sbjct: 926  GTLASSLGVIERIH 939



 Score =  245 bits (625), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 255/803 (31%), Positives = 381/803 (47%), Gaps = 174/803 (21%)

Query: 3   KDVEEGEISDTASVEEISEEDFKIKQEEVVKVVKETKP-------IKVGGGEAAARVWTM 55
           +DVEEGEISDT+SV+ I E+D   K + VVKV  + K        IK  G    +RV   
Sbjct: 6   EDVEEGEISDTSSVKVIIEKDLN-KVDHVVKVDSDVKSNNNNIDKIKTCGN---SRVL-- 59

Query: 56  RDLYNKYPAICRGYGPGLHNLAWAQAVQNKPLNEIFVMEAEQDDVSKRSSPASSVASVNS 115
            DL N Y +     G GL+NLAWAQAVQNKPLN+IF ME ++D         +S  + N+
Sbjct: 60  -DLQNFYSSCYYASGGGLYNLAWAQAVQNKPLNDIFAMEIDKD----TDVNVTSNTNSNN 114

Query: 116 GAAAGKDDKKVVEKVVIDDSGDEIEKEEGELEE-GEIELDLESESNEKVSEQ----VKEE 170
                K  K+V+     D    E+E+ E ++++     +    +S E VSE     V++ 
Sbjct: 115 NDDLNKPLKEVIFVDDDDKEEGELEEGEIDVDDDTNCAIVGGGDSFENVSESDVIGVRDV 174

Query: 171 MKLINVESIREALESVLRGDISFEGVCSKLEFTLESLRELVNENNVPTKDALIQLAFSAV 230
           +K I+V ++ E          SF   C++++  L+S  ++ +      KD L+ L F+AV
Sbjct: 175 LKCISVANVSE----------SFAETCTRIQSALQS--KVFSGIAGSEKDDLVCLLFNAV 222

Query: 231 QSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKE 290
           + V+SV  SM++  KE+NK+ + RLLS +K+    LF+   +KE+ AM++ +    N   
Sbjct: 223 EVVYSVLFSMDNFQKEENKDNILRLLSFLKNERAHLFTPEHMKEIHAMITLVDVSGN--- 279

Query: 291 KDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNKPLEASKPGPPGYRSRGVL 350
                                 ++ N +EK    +++L + + +        G R     
Sbjct: 280 ----------------------SEANSEEK---KLEALDETRKI-------LGLR----- 302

Query: 351 LPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGM-------VKSWAAAAKLSHNAE 403
                    HD+D+LPS T+E    VPV +   VGDG        VK+   A K+  + +
Sbjct: 303 ---------HDLDNLPSLTQE----VPVNKLFSVGDGTDRFGLPPVKT--EAEKMELDGK 347

Query: 404 VHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVD 463
            +K   +ETDAL+A S+ QQKF R+SFF + E PSPTPS +   G  DT  E+SSA+   
Sbjct: 348 DYKLHIHETDALKAASTCQQKFSRSSFFTDDEFPSPTPSGDCEGGAVDTNDEVSSASIAS 407

Query: 464 QPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAP 523
                  P L Q  VSS  ++ S    + + +   +   S PA              K  
Sbjct: 408 SLTSSKPPPLDQMLVSSTYINRSNMHGLINSRIDASGAGSYPA--------------KTS 453

Query: 524 IKSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQ 583
           +KSRDPRLRF      N++ Q +   +  PKVE    V+ SRK+KTVEE  LD  A KR 
Sbjct: 454 VKSRDPRLRF------NISDQ-SSTKNIMPKVEYAEGVI-SRKRKTVEESSLDATAPKRL 505

Query: 584 RNGFENS-GVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPIT 642
               ENS    R+E+ +   GGWL                           +N   S +T
Sbjct: 506 TRSLENSQHNSREEQTMDAKGGWLA--------------------------ENTVASNLT 539

Query: 643 SGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAA------- 695
           + +     +GNE AP  +      L AL    +VN TMLLN L +   Q+LA        
Sbjct: 540 TTS-----NGNEQAPVISSCAATPLLALFNSESVNSTMLLNKL-LDIHQRLAEVKRPINF 593

Query: 696 ---------DAQQKSNDSSMNTMHPPIPSSIPPVSVT---CSIPSGILSKPMD-ELGKVR 742
                        +  +S++NT  P + S +P  S+     S P+  +++ +  +  K+ 
Sbjct: 594 ATSALHLTNSNSARGTNSTVNT-SPTMTSGVPQNSIGMLPTSSPTTSMAQTLQVDSEKIC 652

Query: 743 MKPRDPRRVLHGNA-LQRSGSLG 764
           +KPRDPRR LH ++ +Q+SGSLG
Sbjct: 653 LKPRDPRRSLHASSTVQKSGSLG 675


>gi|302761896|ref|XP_002964370.1| hypothetical protein SELMODRAFT_405568 [Selaginella moellendorffii]
 gi|300168099|gb|EFJ34703.1| hypothetical protein SELMODRAFT_405568 [Selaginella moellendorffii]
          Length = 766

 Score =  337 bits (865), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 158/239 (66%), Positives = 187/239 (78%), Gaps = 5/239 (2%)

Query: 885  DVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVH 944
            +++      D+ ++ A  KER RR++EQ KM S +KLCLVLDLDHTLLNSAKF E++   
Sbjct: 412  ELQEFLIDLDEAERIAFIKERQRRMDEQDKMLSEKKLCLVLDLDHTLLNSAKFMEIEQEW 471

Query: 945  DEILRKKEEQDREK-----PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGN 999
            D  LR  E  +R K       R L+RFP+M MWTKLRPGIW FL RAS+L+E+HLYTMGN
Sbjct: 472  DRFLRATETIERNKDAKEGTRRELYRFPYMSMWTKLRPGIWRFLARASQLYELHLYTMGN 531

Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDS 1059
            K YATEMAK+LDP GVLFAGRVIS+GDDGD   GDE+ P+SKDL+GVLGMESAV+IIDDS
Sbjct: 532  KAYATEMAKLLDPTGVLFAGRVISKGDDGDALYGDEKTPRSKDLDGVLGMESAVLIIDDS 591

Query: 1060 VRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
             RVWPH+K NLIVVERY YFPCSR+QFGL GPSLLE+ HDER  DG LAS LGV +++H
Sbjct: 592  ARVWPHHKDNLIVVERYMYFPCSRKQFGLPGPSLLEVGHDEREADGMLASILGVVERVH 650



 Score = 45.1 bits (105), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 41/176 (23%), Positives = 75/176 (42%), Gaps = 14/176 (7%)

Query: 158 ESNEKVSEQVKEEMKLINVESIREALESVLRGDISFEGVCSKLEFTLESLRELVNENNVP 217
           +S  K    ++  +KL+N  S+  A +S           C++L   L+ L EL   +   
Sbjct: 77  DSESKHDNSIQTVVKLVNNVSVGNACKSP-------NDACAQLHDALQILEELDQPSKKS 129

Query: 218 TKDALIQ-LAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEME 276
               L+Q L    +  +  V+C  N +  EQ   +L  L  L K +   LF++ Q++E++
Sbjct: 130 PNCELVQSLVLKILDKIRIVYCVYNAIGDEQKPGVLQSLAKLAKFYTDKLFTAKQVEELK 189

Query: 277 AMLSS----LVTRANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSL 328
           A+  +    L T ++ KE   +   G  G  +   T +++           P+D L
Sbjct: 190 ALYEAVNPKLETSSDGKEHAYIPWDG--GTSAEHPTSSSLTSYTHYMSASFPMDDL 243


>gi|302768485|ref|XP_002967662.1| hypothetical protein SELMODRAFT_440109 [Selaginella moellendorffii]
 gi|300164400|gb|EFJ31009.1| hypothetical protein SELMODRAFT_440109 [Selaginella moellendorffii]
          Length = 762

 Score =  337 bits (865), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 158/239 (66%), Positives = 187/239 (78%), Gaps = 5/239 (2%)

Query: 885  DVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVH 944
            +++      D+ ++ A  KER RR++EQ KM S +KLCLVLDLDHTLLNSAKF E++   
Sbjct: 408  ELQEFLIDLDEAERIAFIKERQRRMDEQDKMLSEKKLCLVLDLDHTLLNSAKFMEIEQEW 467

Query: 945  DEILRKKEEQDREK-----PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGN 999
            D  LR  E  +R K       R L+RFP+M MWTKLRPGIW FL RAS+L+E+HLYTMGN
Sbjct: 468  DRFLRATETIERNKDAKEGTRRELYRFPYMSMWTKLRPGIWRFLARASQLYELHLYTMGN 527

Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDS 1059
            K YATEMAK+LDP GVLFAGRVIS+GDDGD   GDE+ P+SKDL+GVLGMESAV+IIDDS
Sbjct: 528  KAYATEMAKLLDPTGVLFAGRVISKGDDGDALYGDEKTPRSKDLDGVLGMESAVLIIDDS 587

Query: 1060 VRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
             RVWPH+K NLIVVERY YFPCSR+QFGL GPSLLE+ HDER  DG LAS LGV +++H
Sbjct: 588  ARVWPHHKDNLIVVERYMYFPCSRKQFGLPGPSLLEVGHDEREADGMLASILGVVERVH 646



 Score = 45.1 bits (105), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 43/176 (24%), Positives = 76/176 (43%), Gaps = 18/176 (10%)

Query: 158 ESNEKVSEQVKEEMKLINVESIREALESVLRGDISFEGVCSKLEFTLESLRELVNENNVP 217
           +S  K    ++  +KL+N  S+  A +S           C++L   L+ L EL    + P
Sbjct: 77  DSESKHDNSIQTVVKLVNNVSVGNACKSP-------NDACAQLHDALQILEEL----DQP 125

Query: 218 TKDALIQ-LAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEM- 275
           +   L+Q L    +  +  V+C  N +  EQ   +L  L  L K +   LF++ Q++E+ 
Sbjct: 126 SNCELVQSLVLKILDKIRIVYCVYNAIGDEQKPGVLQSLAKLAKFYTDKLFTAKQVEELK 185

Query: 276 ---EAMLSSLVTRANDKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSL 328
              EA+   L T ++ KE   +   G  G      T +++           P+D+L
Sbjct: 186 GLYEAVNPKLETSSDGKEHAYIPWDG--GTSGEHPTSSSLTSYTHYMSASFPMDAL 239


>gi|168018017|ref|XP_001761543.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687227|gb|EDQ73611.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1984

 Score =  333 bits (854), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 191/424 (45%), Positives = 251/424 (59%), Gaps = 66/424 (15%)

Query: 736  DELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQGSKENLNFQKQLGAPEA 795
            +++G+ RM+PRDPRR+L  NA++ +       + +  + P    +   +  Q    +PE 
Sbjct: 1303 EDVGRHRMRPRDPRRILLENAVETA-------QVNPMNVPVNDAAGSEMTLQYTNRSPEV 1355

Query: 796  KPVLSQSVLQPDITQQFTKNL----------------KHIADFMSVSQPLTSEPMVSQNS 839
              V       P++  Q + N                    +D   VS  L++E   ++  
Sbjct: 1356 ADV------SPNLNNQPSTNTLGNPPNQRDPRLNPYESSQSDSAVVSIQLSTEHKTTEWK 1409

Query: 840  PIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQ-------------SAWG-- 884
             +     +S  D      ++ ++ +   SG   G  G H               S WG  
Sbjct: 1410 TLDERIKESERD------SNRNQGSEVSSGESTGDKGDHLHPWDPLLRKPRFGPSHWGGN 1463

Query: 885  ----DVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEV 940
                D E L E  D++Q+ AIQ ER RRL+EQ +MF A KLCLVLDLDHTLLNSAKF E+
Sbjct: 1464 DMHRDFEQLLEDLDEKQRIAIQNERKRRLQEQDRMFIAGKLCLVLDLDHTLLNSAKFSEI 1523

Query: 941  DPVHDEILRKKEEQDREKP------HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHL 994
            +P  +  LR+ E  +R +        + L+RFPHM MWTKLRPGIW FL +AS+L+E+H+
Sbjct: 1524 EPEWEARLRQAENMERSRALKDPSMKQELYRFPHMSMWTKLRPGIWKFLAKASELYELHV 1583

Query: 995  YTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVV 1054
            YTMGNK YATEMAK+LDP G LFAGRVIS+GD+ D  D      KSKDL+GVLGMESAVV
Sbjct: 1584 YTMGNKAYATEMAKLLDPTGTLFAGRVISKGDEVDGSD------KSKDLDGVLGMESAVV 1637

Query: 1055 IIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVR 1114
            IIDDS RVWPH++ NLIVVERY YFP SRRQFGLLGPSLLE+ HDER+ DG L+S+ GV 
Sbjct: 1638 IIDDSSRVWPHHRENLIVVERYMYFPSSRRQFGLLGPSLLEVGHDERAADGMLSSASGVI 1697

Query: 1115 QQLH 1118
             ++H
Sbjct: 1698 DRIH 1701


>gi|168040198|ref|XP_001772582.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162676137|gb|EDQ62624.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1881

 Score =  331 bits (849), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 198/445 (44%), Positives = 253/445 (56%), Gaps = 67/445 (15%)

Query: 719  PVSVTCSIPSGILSKPMDELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQ 778
            P  +  +   G   +   ++GK RM+PRDPRR L  +A           + +  S P  +
Sbjct: 1325 PFGINSTKSEGQAGEEEQDIGKHRMRPRDPRRALLDSA-------ADIVQVNQRSPPIIE 1377

Query: 779  GSKENLNFQKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIAD--------------FMS 824
             +      Q + G        S  V QP I     +N  ++ D               ++
Sbjct: 1378 AADSGTTLQIETGTSLPTNTSSDLVKQPSINS--LENPLNLRDPRLSSNNSTQSNNATLA 1435

Query: 825  VSQPLTS--------EPMV-----------SQNSPIQPGQIKSGADMKAVVTNHDDKQTG 865
              QP T         EP+V           S+N  I   ++ SG  +   V +H      
Sbjct: 1436 PEQPSTEQKNMTVEEEPVVDERNNARERESSRNQGIDAREVFSGESILDEV-DHLHPWDP 1494

Query: 866  TGSGPEAGPVGAHPQSAWG------DVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSAR 919
                P  GP      S WG      D E L E  D+ Q+  IQ ER RR++EQ +MFSA 
Sbjct: 1495 VLRKPRFGP------SHWGGSNLHRDFEQLLEDLDEDQRITIQNERKRRIQEQDRMFSAG 1548

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRFPHMGMWT 973
            KLCLVLDLDHTLLNSAKF E++P  +  LR+ E  +R +  +       L+RFPHM MWT
Sbjct: 1549 KLCLVLDLDHTLLNSAKFSEIEPEFEARLRQAENMERSRSTKDPNMKQELYRFPHMSMWT 1608

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
            KLRPGIW FL +AS+L+E+H+YTMGNK YATEMAK+LDP G+LF+GRVIS+GD+ D  D 
Sbjct: 1609 KLRPGIWKFLAKASELYELHVYTMGNKAYATEMAKLLDPTGILFSGRVISKGDEVDGSD- 1667

Query: 1034 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSL 1093
                 KSKDL+GVLGMESAVVIIDDS RVWPH++ NLIVVERY YFP SRRQFGLLGPSL
Sbjct: 1668 -----KSKDLDGVLGMESAVVIIDDSSRVWPHHRENLIVVERYMYFPSSRRQFGLLGPSL 1722

Query: 1094 LEIDHDERSEDGTLASSLGVRQQLH 1118
            LE+ HDER+ DG L+S+ GV  ++H
Sbjct: 1723 LEVGHDERAVDGMLSSASGVIDRIH 1747


>gi|224075473|ref|XP_002304648.1| predicted protein [Populus trichocarpa]
 gi|222842080|gb|EEE79627.1| predicted protein [Populus trichocarpa]
          Length = 238

 Score =  207 bits (526), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 108/122 (88%), Positives = 113/122 (92%)

Query: 997  MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVII 1056
            MGNKLYATEMAKVLDPKGVLFAGRV+SRGDDGD  DGDERVPKSKDLEGVLGMES VVII
Sbjct: 1    MGNKLYATEMAKVLDPKGVLFAGRVVSRGDDGDLLDGDERVPKSKDLEGVLGMESGVVII 60

Query: 1057 DDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQ 1116
            DDS+RVWPHNKLNLIVVERY YFPCSRRQFGL GPSLLEIDHDER EDGTLA SL V ++
Sbjct: 61   DDSLRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLACSLAVIER 120

Query: 1117 LH 1118
            +H
Sbjct: 121  IH 122


>gi|384247094|gb|EIE20582.1| hypothetical protein COCSUDRAFT_57726 [Coccomyxa subellipsoidea
            C-169]
          Length = 1018

 Score =  186 bits (472), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 94/207 (45%), Positives = 130/207 (62%), Gaps = 8/207 (3%)

Query: 915  MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKP--HRHLFRFPHMGMW 972
            +   R+LCLVLDLDHTL+NSAKF EV+P H ++L ++ +++   P   + L R   + MW
Sbjct: 693  LLRQRRLCLVLDLDHTLVNSAKFSEVEPEHLKLLERQLQREAALPAEEKRLHRLDRIAMW 752

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
            T LRPG+   L   + LF++ + T  ++ YA  MA++LDP G LF  R+IS+GDDG    
Sbjct: 753  TALRPGLRQMLAAVAPLFQLWIQTNASRAYALAMAELLDPTGELFGQRIISKGDDGSAL- 811

Query: 1033 GDERVPKSKDL-EGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGP 1091
                +  SK L +G+   E+  +I+DDS  VW H+  NL+ VERYTYFP SRRQ  L GP
Sbjct: 812  ----INHSKRLMQGLEECEAVCIIVDDSDDVWRHHAHNLLHVERYTYFPSSRRQLNLRGP 867

Query: 1092 SLLEIDHDERSEDGTLASSLGVRQQLH 1118
            S LE   DE  + G LA +LGV  ++H
Sbjct: 868  SFLEAHKDECDKTGILAVTLGVLLRVH 894


>gi|168059994|ref|XP_001781984.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666557|gb|EDQ53208.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 563

 Score =  182 bits (462), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 98/211 (46%), Positives = 133/211 (63%), Gaps = 9/211 (4%)

Query: 908  RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 967
            R  E +++   +KL LV+DLDHT+LNSA+F EV P  + I        +      L +  
Sbjct: 173  RNAELRRVTGKQKLLLVVDLDHTMLNSARFSEV-PAEERIYLTWTAGQQHGRVSSLHQLT 231

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
             +GMWTKLRP    FLE ASKL+EM++YTMG K+YA  MA++LDP G LF GR+IS+ D 
Sbjct: 232  KLGMWTKLRPFAHKFLEEASKLYEMYVYTMGEKIYAQAMAELLDPTGQLFGGRIISQTDS 291

Query: 1028 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1087
                        +KDL+ VLG ESAVVI+DD+  VWP+++ NLI++ERY +F  S  QF 
Sbjct: 292  TK--------RHTKDLDVVLGAESAVVILDDTEAVWPNHRSNLILMERYHFFTSSCHQFR 343

Query: 1088 LLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            +  PSL ++  DE   DGTLA++L   Q +H
Sbjct: 344  VRAPSLAQMHRDECEIDGTLATTLKTLQAIH 374


>gi|384251210|gb|EIE24688.1| carboxyl-terminal phosphatase-like 4 [Coccomyxa subellipsoidea C-169]
          Length = 439

 Score =  181 bits (459), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 99/219 (45%), Positives = 136/219 (62%), Gaps = 10/219 (4%)

Query: 899  AAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
            +A + ER R+ +  K+  S RKL LVLDLDHTLLNS +F E     +++   +  +  ++
Sbjct: 59   SASEAERVRQ-QSLKRALSNRKLLLVLDLDHTLLNSTRFDEAVGFEEQLAAIQRARPEDQ 117

Query: 959  PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
            P   L+   HM +WTKLRP +  FLE+A ++ EMH+YT GN  YA EMA++LDP    FA
Sbjct: 118  P-VSLYHLEHMRLWTKLRPYVREFLEKAHEVSEMHIYTHGNAEYAIEMARLLDPTKRFFA 176

Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
             R+IS+GD          V   KDL+ VLG E+AVVI+DD+  VWP ++ NL+ VERY +
Sbjct: 177  ERIISQGDST--------VKHVKDLDVVLGAETAVVILDDTAGVWPSHQQNLLQVERYVF 228

Query: 1079 FPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQL 1117
            FP   R+F L   SLLE+  DE  + G LAS+L V  + 
Sbjct: 229  FPACARRFQLNVQSLLELGRDEDEQHGMLASALRVHSRF 267


>gi|302764346|ref|XP_002965594.1| hypothetical protein SELMODRAFT_167775 [Selaginella moellendorffii]
 gi|300166408|gb|EFJ33014.1| hypothetical protein SELMODRAFT_167775 [Selaginella moellendorffii]
          Length = 411

 Score =  174 bits (442), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 103/232 (44%), Positives = 134/232 (57%), Gaps = 32/232 (13%)

Query: 908  RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH----- 962
            R +E +++   RKL LVLDLDHTLLNSA++ EV P     L   E      P        
Sbjct: 44   REDELRQVLGKRKLFLVLDLDHTLLNSARWMEVFPDETAYL---EHTYMNVPEDKIPALS 100

Query: 963  ----------------LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEM 1006
                            L R   M +WTKLRP    FLE ASKLFEM++YTMG ++YA  M
Sbjct: 101  NGAPAVAGVIQPGGGGLHRIHGMQLWTKLRPFAHKFLEEASKLFEMYVYTMGERMYAVTM 160

Query: 1007 AKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHN 1066
            A +LDP G  F GRVIS+ D            ++KDL+ VLG +SAV+I+DD+  VWP +
Sbjct: 161  AHLLDPTGKFFKGRVISQRDST--------CRQTKDLDIVLGADSAVLILDDTEAVWPKH 212

Query: 1067 KLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            + NLIV+ERY +F  S RQFGL  PSL + + DE  ++G LA+ L V Q++H
Sbjct: 213  RANLIVMERYHFFQSSCRQFGLENPSLTKAERDESKDEGALANVLKVLQRIH 264


>gi|326510557|dbj|BAJ87495.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 384

 Score =  173 bits (439), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 95/211 (45%), Positives = 128/211 (60%), Gaps = 9/211 (4%)

Query: 908  RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 967
            R  + K +   RKL L+LDLDHTL+NS K H++    +  L  +    ++ P+  LF   
Sbjct: 152  RGSDLKNLLRERKLILILDLDHTLINSTKLHDISAAENN-LGIQAAASKDDPNGSLFTLE 210

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
             M M TKLRP +  FL+ AS +FEM++YTMG+K YA E+AK+LDP+ V F  +VIS  D 
Sbjct: 211  GMQMLTKLRPFVRKFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPRNVYFNSKVISNSD- 269

Query: 1028 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1087
                   +R  K  D+  VLG ES  VI+DD+  VW  +K NLI++ERY YF  S RQFG
Sbjct: 270  -----CTQRHQKGLDM--VLGAESVAVILDDTEYVWQKHKENLILMERYHYFASSCRQFG 322

Query: 1088 LLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
                SL E+  DER  DG LA+ L V +++H
Sbjct: 323  FSVKSLSELMQDERGSDGALATILDVLKRIH 353


>gi|307111295|gb|EFN59530.1| hypothetical protein CHLNCDRAFT_138191 [Chlorella variabilis]
          Length = 1156

 Score =  173 bits (438), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 90/204 (44%), Positives = 127/204 (62%), Gaps = 9/204 (4%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDP-VHDEILRKKEEQDREKP--HRHLFRFPHMGMWTKLR 976
            KLCLVLDLDHTLLNSA F EV P +HD +  +   +    P   R LFR   + MWTKLR
Sbjct: 368  KLCLVLDLDHTLLNSATFAEVGPTLHDSLKARAASEAATLPEDQRLLFRIDGIKMWTKLR 427

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 1036
            PG+  FL+RA++ +++ ++T GN+ YA  + ++LD  G +F  R+I++G +      D+ 
Sbjct: 428  PGVHKFLQRAARYYQLWIHTNGNRAYADSVVRLLDRGGAIFGDRIIAQGAE----RVDQM 483

Query: 1037 VP--KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLL 1094
            VP    + ++G+   ES  VI+DDS  VW  ++ NL+ VERY YFP SR   GL GPSLL
Sbjct: 484  VPDQAKRLMQGLDERESITVIVDDSHSVWSQHRHNLVAVERYIYFPSSRASLGLKGPSLL 543

Query: 1095 EIDHDERSEDGTLASSLGVRQQLH 1118
            + + DE  E G L  +L V  ++H
Sbjct: 544  DANRDECPEQGMLMVALSVLVRVH 567


>gi|307106534|gb|EFN54779.1| hypothetical protein CHLNCDRAFT_134722 [Chlorella variabilis]
          Length = 513

 Score =  172 bits (436), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 101/224 (45%), Positives = 130/224 (58%), Gaps = 30/224 (13%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVH---------DEILRKK-EEQDREKPHRHL 963
            ++ + RKL L+LDLDHTLLNS +F EV P           ++ LR + E Q +  P   L
Sbjct: 111  RLLAHRKLLLILDLDHTLLNSTRFTEVPPQGAVTEQREGGEQALRAQLEAQPKGAPM--L 168

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKL------FEMHLYTMGNKLYATEMAKVLDPKGVLF 1017
            +  PHM MWTKLRPG+  FLE A         FE+ +YTMG++ YA EMAK+LDP G LF
Sbjct: 169  YCLPHMRMWTKLRPGVREFLEAAKDRQVGQVGFELAVYTMGDRDYAGEMAKLLDPAGSLF 228

Query: 1018 AGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
             GR+IS GD    +         KDL+ VLG E  V+I+DD+  VWP ++ NL+ +ERY 
Sbjct: 229  HGRIISSGDSTQRY--------VKDLDVVLGRERCVLILDDTEGVWPRHRDNLVQIERYL 280

Query: 1078 YFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSL----GVRQQL 1117
            YFP    +FG    SLLE   DE    G LA+ L    GV+QQ 
Sbjct: 281  YFPADAARFGFRSQSLLERAVDEEGGGGALATCLRVMSGVQQQF 324


>gi|326518250|dbj|BAK07377.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 488

 Score =  172 bits (435), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 94/212 (44%), Positives = 128/212 (60%), Gaps = 9/212 (4%)

Query: 908  RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEI-LRKKEEQDREKPHRHLFRF 966
            R  E K +   RKL L+LDLDHTL+NS + H++     ++ ++    ++ + P R LF  
Sbjct: 150  RESEVKNLLRERKLVLILDLDHTLINSTRLHDISAAEMDLGIQTAASKNADDPERSLFTL 209

Query: 967  PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
              M M TKLRP +  FLE AS +F+M++YTMG+K YA E+AK+LDP  V F  +VIS   
Sbjct: 210  QGMHMLTKLRPFVRKFLEEASNMFDMYIYTMGDKAYAIEIAKLLDPGNVYFDSKVISNS- 268

Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQF 1086
                 D  +R  K  D+  VLG +   VIIDD+  VW  +K NLI++ERY YF  S RQF
Sbjct: 269  -----DCTQRHQKGLDV--VLGDDKVAVIIDDTEHVWQKHKENLILMERYHYFAASCRQF 321

Query: 1087 GLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            G    SL E+  DER  DG LA+ L V +++H
Sbjct: 322  GFSDQSLSELMQDERESDGALATILDVLKRIH 353


>gi|303276827|ref|XP_003057707.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226460364|gb|EEH57658.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 692

 Score =  171 bits (434), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 96/210 (45%), Positives = 128/210 (60%), Gaps = 10/210 (4%)

Query: 915  MFSARKLCLVLDLDHTLLNSAKFHEVDP--VHDEILR-KKEEQDREKPHRHLFRFPHMGM 971
            +   R+L LVLDLDHTLLNS  F   D   +   +L  ++ E  ++   R L R  H+G+
Sbjct: 299  LLDRRRLTLVLDLDHTLLNSESFESKDGGRLQRGLLEIERLESTKDSNDRTLHRLNHIGL 358

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
            WTKLRPG+ TFL +AS +FE+H+ TMG++ YA  + ++LDP   +  G VI  G     F
Sbjct: 359  WTKLRPGVQTFLHKASAMFEIHISTMGSQPYADSIRRLLDPCRNVIKGSVIGLGG----F 414

Query: 1032 D--GDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1088
            D  G  + P  K LEGVL G E A VI+DD+  VW     NLIV ERY YFP + + FG+
Sbjct: 415  DEFGAFKSPPQKKLEGVLAGTEPAAVILDDTAEVWTGYSENLIVCERYMYFPSACKNFGV 474

Query: 1089 LGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            +GPSLLE   DE  + GTLA+ L V  ++H
Sbjct: 475  VGPSLLERGVDESEKSGTLATVLEVLTRVH 504


>gi|357163276|ref|XP_003579679.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Brachypodium distachyon]
          Length = 493

 Score =  170 bits (430), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 91/206 (44%), Positives = 124/206 (60%), Gaps = 9/206 (4%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            K +   RKL L+LDLDHTL+NS K H++     + L  +     + P + LF    M M 
Sbjct: 156  KSLLRERKLVLILDLDHTLINSTKLHDISAAERD-LGIQTFASEDAPEKSLFTLEAMQML 214

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
            TKLRP +  FL+ AS +FEM++YTMG+K YA E+AK+LDP  V F  +VIS        D
Sbjct: 215  TKLRPFVCKFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPGNVYFGSKVISNS------D 268

Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
              +R    K L+ VLG E+  +I+DD+  VW  +K NLI++ERY YF  S RQFG    +
Sbjct: 269  CTQR--HQKGLDVVLGAENVAIILDDTEYVWQKHKENLILMERYHYFASSCRQFGFSVKA 326

Query: 1093 LLEIDHDERSEDGTLASSLGVRQQLH 1118
            L E   DER  DG LA++L V +++H
Sbjct: 327  LSESMQDERESDGALATTLDVLKRIH 352


>gi|449447765|ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Cucumis sativus]
          Length = 452

 Score =  169 bits (428), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 97/211 (45%), Positives = 130/211 (61%), Gaps = 9/211 (4%)

Query: 908  RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 967
            R +E K++   +KL LVLDLDHTLLNS +   +  V +E LR + +   +     LF   
Sbjct: 125  RNKEMKELLQRKKLILVLDLDHTLLNSTELRYL-TVEEEYLRSQTDSLDDVTKGSLFLLN 183

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
             +   TKLRP + +FL+ ASKLFEM++YTMG + YA EMAK+LDPK   F+ +VISR DD
Sbjct: 184  SVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISR-DD 242

Query: 1028 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1087
            G            K L+ VLG ESAV+I+DD+   W  +K NLI++ERY +F  S RQFG
Sbjct: 243  GTQ-------KHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFG 295

Query: 1088 LLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
                SL E+ +DE   DG L + L V +Q+H
Sbjct: 296  FNCKSLSELKNDESETDGALTTILKVLKQVH 326


>gi|255570505|ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
 gi|223534449|gb|EEF36151.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
          Length = 478

 Score =  169 bits (428), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 97/211 (45%), Positives = 129/211 (61%), Gaps = 9/211 (4%)

Query: 908  RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 967
            R  + K +   RKL LVLDLDHTLLNS +   +    +E L+ + +  ++  +  LF   
Sbjct: 154  RNTDMKNLLRHRKLYLVLDLDHTLLNSTQLMHL-TAEEEYLKSQIDSMQDVSNGSLFMVD 212

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
             M M TKLRP I TFL+ AS++FEM++YTMG++ YA EMAK LDP    F  RVISR   
Sbjct: 213  FMHMMTKLRPFIRTFLKEASQMFEMYIYTMGDRAYALEMAKFLDPGREYFNARVISRD-- 270

Query: 1028 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1087
                DG +R  K  D+  VLG ESAV+I+DD+   W  +K NLI++ERY +F  S RQFG
Sbjct: 271  ----DGTQRHQKGLDI--VLGQESAVLILDDTENAWTKHKDNLILMERYHFFASSCRQFG 324

Query: 1088 LLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
                SL ++  DE   DG LAS L V +++H
Sbjct: 325  FECKSLSQLKSDENESDGALASVLKVLRRIH 355


>gi|115463681|ref|NP_001055440.1| Os05g0390500 [Oryza sativa Japonica Group]
 gi|57863785|gb|AAS86390.2| unknown protein [Oryza sativa Japonica Group]
 gi|113578991|dbj|BAF17354.1| Os05g0390500 [Oryza sativa Japonica Group]
 gi|215695102|dbj|BAG90293.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222631469|gb|EEE63601.1| hypothetical protein OsJ_18418 [Oryza sativa Japonica Group]
          Length = 536

 Score =  169 bits (427), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 94/206 (45%), Positives = 124/206 (60%), Gaps = 9/206 (4%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            K +   RKL L+LDLDHTL+NS K  ++    +E+  +   ++   P R LF    M M 
Sbjct: 163  KNLLRERKLVLILDLDHTLINSTKLFDLSAAENELGIQSAAKEV-VPDRSLFTLETMQML 221

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
            TKLRP +  FL+ AS +FEM++YTMG+K YA E+AK+LDP  V F  +VIS        D
Sbjct: 222  TKLRPFVRRFLKEASDMFEMYIYTMGDKAYAIEIAKLLDPDNVYFGSKVISNS------D 275

Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
              +R  K  D+  VLG ES  VI+DD+  VW  +K NLI++ERY YF  S RQFG    S
Sbjct: 276  CTQRHQKGLDV--VLGDESVAVILDDTEYVWQKHKENLILMERYHYFASSCRQFGFGARS 333

Query: 1093 LLEIDHDERSEDGTLASSLGVRQQLH 1118
            L E   DER  DG LA+ L V +++H
Sbjct: 334  LSETMQDERENDGALATILDVLERIH 359


>gi|242093742|ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor]
 gi|241915584|gb|EER88728.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor]
          Length = 558

 Score =  168 bits (426), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/206 (43%), Positives = 124/206 (60%), Gaps = 9/206 (4%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            K +   RKL L+LDLDHTL+NS K  ++     + L  +    ++ P+R +F    M M 
Sbjct: 157  KNLLRERKLVLILDLDHTLINSTKLQDISSAEKD-LGIQTAASKDDPNRSIFSLDSMQML 215

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
            TKLRP +  FL+ AS +FEM++YTMG+K YA E+AK+LDP  + F  +VIS        D
Sbjct: 216  TKLRPFVREFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPSKVISNS------D 269

Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
              +R  K  D+  +LG ES  VI+DD+  VW  +K NLI++ERY +F  S RQFG    S
Sbjct: 270  CTQRHQKGLDV--ILGAESVAVILDDTEYVWQKHKENLILMERYHFFASSCRQFGFGVRS 327

Query: 1093 LLEIDHDERSEDGTLASSLGVRQQLH 1118
            L E   DER  DG LA+ L V +++H
Sbjct: 328  LSESMQDERESDGALATVLDVLKRIH 353


>gi|356498756|ref|XP_003518215.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Glycine max]
          Length = 428

 Score =  168 bits (426), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 98/220 (44%), Positives = 130/220 (59%), Gaps = 11/220 (5%)

Query: 901  IQKERTRRLE--EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
            +  E   RL   + K +   +KL LVLDLDHTLLNS     +      +L + +   R+ 
Sbjct: 104  LHDEEISRLRNTDMKSLLCRKKLYLVLDLDHTLLNSTHLAHLTSEESHLLNQTDSL-RDV 162

Query: 959  PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
                LF+  HM M TKLRP +  FL+ AS++FEM++YTMG++ YA EMAK+LDP+G  F 
Sbjct: 163  SKGSLFKLEHMNMMTKLRPFVRPFLKEASEMFEMYIYTMGDRPYALEMAKLLDPQGEYFN 222

Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
             +VISR DDG            K L+ VLG ESAV+I+DD+   W  +K NLI++ERY +
Sbjct: 223  AKVISR-DDGTQ-------KHQKGLDVVLGQESAVLILDDTEHAWMKHKDNLILMERYHF 274

Query: 1079 FPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            F  S RQFG    SL E+  DE   DG LA  L V +Q+H
Sbjct: 275  FGSSCRQFGFNCKSLAELKSDENETDGALAKILKVLKQVH 314


>gi|218196729|gb|EEC79156.1| hypothetical protein OsI_19829 [Oryza sativa Indica Group]
          Length = 574

 Score =  168 bits (425), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 95/206 (46%), Positives = 122/206 (59%), Gaps = 9/206 (4%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            K +   RKL L+LDLDHTL+NS K  ++    +E L  +       P R LF    M M 
Sbjct: 189  KNLLRERKLVLILDLDHTLINSTKLFDLSAAENE-LGIQSAAKEVVPDRSLFTLETMQML 247

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
            TKLRP +  FL+ AS +FEM++YTMG+K YA E+AK+LDP  V F  +VIS        D
Sbjct: 248  TKLRPFVRRFLKEASDMFEMYIYTMGDKAYAIEIAKLLDPDNVYFGSKVISNS------D 301

Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
              +R  K  D+  VLG ES  VI+DD+  VW  +K NLI++ERY YF  S RQFG    S
Sbjct: 302  CTQRHQKGLDV--VLGDESVAVILDDTEYVWQKHKENLILMERYHYFASSCRQFGFGARS 359

Query: 1093 LLEIDHDERSEDGTLASSLGVRQQLH 1118
            L E   DER  DG LA+ L V +++H
Sbjct: 360  LSETMQDERENDGALATILDVLERIH 385


>gi|357129281|ref|XP_003566293.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Brachypodium distachyon]
          Length = 492

 Score =  168 bits (425), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 93/206 (45%), Positives = 126/206 (61%), Gaps = 9/206 (4%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            KK+   RKL L+LDLDHTL+NS + H++     + L  +    ++ P R LF    M M 
Sbjct: 157  KKLLRERKLVLILDLDHTLINSTRLHDISAAEMD-LGIQTAALKDDPDRSLFTLERMHML 215

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
            TKLRP +  FL+ AS +FEM++YTMG+K Y+ E+AK+LDP  V F  +VIS        D
Sbjct: 216  TKLRPFVRRFLKEASNMFEMYIYTMGDKAYSIEVAKLLDPGNVYFGSKVISNS------D 269

Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
              +R  K  D+  VLG ES  VI+DD+  VW  +K NLI++ERY YF  S RQFG    S
Sbjct: 270  CTQRHQKGLDV--VLGAESIAVILDDTEDVWQKHKENLILMERYHYFASSCRQFGFSVRS 327

Query: 1093 LLEIDHDERSEDGTLASSLGVRQQLH 1118
            L E+  DER  DG L++ L V +++H
Sbjct: 328  LSELMVDERESDGALSTILDVLKRIH 353


>gi|9758369|dbj|BAB08870.1| unnamed protein product [Arabidopsis thaliana]
          Length = 1065

 Score =  166 bits (421), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 98/212 (46%), Positives = 124/212 (58%), Gaps = 30/212 (14%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH------------LFRF 966
            RKL LVLDLDHTLLN+    ++ P          E++  K H H            LF  
Sbjct: 746  RKLYLVLDLDHTLLNTTILRDLKP----------EEEYLKSHTHSLQDGCNVSGGSLFLL 795

Query: 967  PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
              M M TKLRP + +FL+ AS++F M++YTMG++ YA +MAK+LDPKG  F  RVISR D
Sbjct: 796  EFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDRVISR-D 854

Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQF 1086
            DG        V   K L+ VLG ESAV+I+DD+   WP +K NLIV+ERY +F  S RQF
Sbjct: 855  DGT-------VRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQF 907

Query: 1087 GLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
                 SL E+  DE   DG LA+ L V +Q H
Sbjct: 908  DHRYKSLSELKSDESEPDGALATVLKVLKQAH 939


>gi|168012675|ref|XP_001759027.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689726|gb|EDQ76096.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 389

 Score =  166 bits (421), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 91/208 (43%), Positives = 132/208 (63%), Gaps = 13/208 (6%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
            E +++   +KL LV+DLDHT+LNSA+F +V PV    +  + +      H+       +G
Sbjct: 3    ELRRVNKTKKLLLVVDLDHTVLNSARFADV-PVGMTWIAGELQAGGSSLHQ----MTKLG 57

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
            +WTKLRP    FL+ ASKL+EM++YTMG + YA +MAK+LDP   LFA R+IS+ D    
Sbjct: 58   LWTKLRPFAHEFLQEASKLYEMYIYTMGERKYAKKMAKLLDPTRQLFADRIISQNDSTKR 117

Query: 1031 FDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 1090
            +        +KDL+ VLG +SAVVI+DD+  VWP +K NLI++ERY +F  S  QFG+  
Sbjct: 118  Y--------TKDLDVVLGADSAVVILDDTEAVWPSHKSNLILMERYHFFSSSCSQFGVNS 169

Query: 1091 PSLLEIDHDERSEDGTLASSLGVRQQLH 1118
             SL ++  DE   +GTLA++L   + +H
Sbjct: 170  ASLAQLYRDESETEGTLATTLKTLRAIH 197


>gi|226497696|ref|NP_001152445.1| CPL3 [Zea mays]
 gi|195656359|gb|ACG47647.1| CPL3 [Zea mays]
          Length = 531

 Score =  166 bits (420), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 90/206 (43%), Positives = 124/206 (60%), Gaps = 9/206 (4%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            K +   RKL L+LDLDHTL+NS K  ++     + L  +    ++ P+R +F    M M 
Sbjct: 153  KNLLRERKLVLILDLDHTLINSTKLQDISSAEKD-LGIQSAASKDDPNRSIFALDLMPML 211

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
            TKLRP +  FL+ AS +FEM++YTMG+K YA E+AK+LDP  + F  +VIS        D
Sbjct: 212  TKLRPFVREFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPSKVISNS------D 265

Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
              +R    K L+ +LG ES  VI+DD+  VW  +K NLI++ERY +F  S RQFG    S
Sbjct: 266  CTQR--HQKGLDVILGAESVAVILDDTEYVWQKHKENLILMERYHFFASSCRQFGFGVRS 323

Query: 1093 LLEIDHDERSEDGTLASSLGVRQQLH 1118
            L E   DER  DG LA+ L V +++H
Sbjct: 324  LSESLQDERESDGALATVLDVLKRIH 349


>gi|413945235|gb|AFW77884.1| CPL3 [Zea mays]
          Length = 533

 Score =  166 bits (419), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 90/206 (43%), Positives = 124/206 (60%), Gaps = 9/206 (4%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            K +   RKL L+LDLDHTL+NS K  ++     + L  +    ++ P+R +F    M M 
Sbjct: 155  KNLLRERKLVLILDLDHTLINSTKLQDISSAEKD-LGIQSAASKDDPNRSIFALDLMPML 213

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
            TKLRP +  FL+ AS +FEM++YTMG+K YA E+AK+LDP  + F  +VIS        D
Sbjct: 214  TKLRPFVREFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPSKVISNS------D 267

Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
              +R    K L+ +LG ES  VI+DD+  VW  +K NLI++ERY +F  S RQFG    S
Sbjct: 268  CTQR--HQKGLDVILGAESVAVILDDTEYVWQKHKENLILMERYHFFASSCRQFGFGVRS 325

Query: 1093 LLEIDHDERSEDGTLASSLGVRQQLH 1118
            L E   DER  DG LA+ L V +++H
Sbjct: 326  LSESLQDERESDGALATVLDVLKRIH 351


>gi|356564913|ref|XP_003550691.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Glycine max]
          Length = 442

 Score =  166 bits (419), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 97/220 (44%), Positives = 129/220 (58%), Gaps = 11/220 (5%)

Query: 901  IQKERTRRLE--EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
            +  E   RL   + K +   +KL LVLDLDHTLLNS    ++      +L + +      
Sbjct: 118  LHDEEISRLRNTDMKSLLGRKKLYLVLDLDHTLLNSTHLAQLTSEELHLLNQTDSLTNVS 177

Query: 959  PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
                LF+  HM M TKLRP +  FL+ AS++FEM++YTMG++ YA EMAK+LDP+G  F 
Sbjct: 178  KGS-LFKLEHMNMMTKLRPFVRPFLKEASEMFEMYIYTMGDRPYALEMAKLLDPQGEYFN 236

Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
             +VISR DDG            K L+ VLG ESAV+I+DD+   W  +K NLI++ERY +
Sbjct: 237  AKVISR-DDGTQ-------KHQKGLDVVLGQESAVIILDDTEHAWMKHKDNLILMERYHF 288

Query: 1079 FPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            F  S RQFG    SL E+  DE   DG LA  L V +Q+H
Sbjct: 289  FGSSCRQFGFNCKSLAELKSDEDETDGALAKILKVLKQVH 328


>gi|357501219|ref|XP_003620898.1| RNA polymerase II C-terminal domain phosphatase-like protein
            [Medicago truncatula]
 gi|355495913|gb|AES77116.1| RNA polymerase II C-terminal domain phosphatase-like protein
            [Medicago truncatula]
          Length = 720

 Score =  165 bits (418), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 97/217 (44%), Positives = 135/217 (62%), Gaps = 12/217 (5%)

Query: 903  KERTR-RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
            KE +R R  + K + + RKLCLVLDLDHTLLN+   H + P  +E+  K      E   +
Sbjct: 198  KEISRVRSRDVKNLLNRRKLCLVLDLDHTLLNTTSLHRLSP--EEMHLKTHTDSLEDISK 255

Query: 962  -HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
              LF   H+ + TKLRP + TFL+ AS++FEM++YTMG++ Y+ EMA++LDP+G  F  +
Sbjct: 256  GSLFMLEHVQVMTKLRPFVRTFLKEASEMFEMYIYTMGDRQYSLEMARLLDPQGEYFKDK 315

Query: 1021 VISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
            VISR DDG            KDL+ VLG E+++VI+DD   VWP  + NLI++ERY +F 
Sbjct: 316  VISR-DDGTQ-------KNVKDLDLVLGTENSIVILDDKEEVWPKYRDNLILMERYHFFN 367

Query: 1081 CSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQL 1117
             S + FGL   SL  ++ DE   DG LA  L V +Q+
Sbjct: 368  SSCQDFGLQCKSLAALNIDENEIDGALAKILEVLRQI 404


>gi|145334837|ref|NP_001078764.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Arabidopsis
            thaliana]
 gi|122154038|sp|Q00IB6.1|CPL4_ARATH RecName: Full=RNA polymerase II C-terminal domain phosphatase-like 4;
            Short=FCP-like 4; AltName: Full=Carboxyl-terminal
            phosphatase-like 4; Short=AtCPL4; Short=CTD
            phosphatase-like 4
 gi|95115186|gb|ABF55959.1| carboxyl-terminal phosphatase-like 4 [Arabidopsis thaliana]
 gi|332009601|gb|AED96984.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Arabidopsis
            thaliana]
          Length = 440

 Score =  164 bits (415), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 98/212 (46%), Positives = 124/212 (58%), Gaps = 30/212 (14%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH------------LFRF 966
            RKL LVLDLDHTLLN+    ++ P          E++  K H H            LF  
Sbjct: 121  RKLYLVLDLDHTLLNTTILRDLKP----------EEEYLKSHTHSLQDGCNVSGGSLFLL 170

Query: 967  PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
              M M TKLRP + +FL+ AS++F M++YTMG++ YA +MAK+LDPKG  F  RVISR D
Sbjct: 171  EFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDRVISR-D 229

Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQF 1086
            DG        V   K L+ VLG ESAV+I+DD+   WP +K NLIV+ERY +F  S RQF
Sbjct: 230  DGT-------VRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQF 282

Query: 1087 GLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
                 SL E+  DE   DG LA+ L V +Q H
Sbjct: 283  DHRYKSLSELKSDESEPDGALATVLKVLKQAH 314


>gi|242087817|ref|XP_002439741.1| hypothetical protein SORBIDRAFT_09g019310 [Sorghum bicolor]
 gi|241945026|gb|EES18171.1| hypothetical protein SORBIDRAFT_09g019310 [Sorghum bicolor]
          Length = 547

 Score =  164 bits (415), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 87/211 (41%), Positives = 124/211 (58%), Gaps = 9/211 (4%)

Query: 908  RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 967
            R  + K +   RKL L+LDLDHTL+NS K   +     + L  +    ++ P+R +F   
Sbjct: 154  RCADLKNLLRERKLVLILDLDHTLINSTKLQNISSAEKD-LGIQTAASKDDPNRSIFALE 212

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
             M + TKLRP +  FL+ AS +FEM++YTMG+K YA E+AK+LDP  + F  +VIS    
Sbjct: 213  SMQLLTKLRPFVREFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPLKVIS---- 268

Query: 1028 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1087
                + D      K L+ +LG  S  VI+DD+  VW  +K NLI++ERY +F  S R+FG
Sbjct: 269  ----NSDCTKRHQKGLDVILGAASVAVILDDTEFVWKKHKENLILMERYHFFASSCREFG 324

Query: 1088 LLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
                SL E+  DER  DG LA+ L V +++H
Sbjct: 325  FAVRSLSELMQDERESDGALATVLDVLKRIH 355


>gi|449532013|ref|XP_004172979.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
            phosphatase-like 4-like, partial [Cucumis sativus]
          Length = 340

 Score =  164 bits (414), Expect = 3e-37,   Method: Composition-based stats.
 Identities = 97/211 (45%), Positives = 129/211 (61%), Gaps = 9/211 (4%)

Query: 908  RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 967
            R +E K++    KL LVLDLDHTLLNS +   +  V +E LR + +   +     LF   
Sbjct: 13   RNKEMKELLQRXKLILVLDLDHTLLNSTELRYLT-VEEEYLRSQTDSLDDVTKGSLFLLN 71

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
             +   TKLRP + +FL+ ASKLFEM++YTMG + YA EMAK+LDPK   F+ +VISR DD
Sbjct: 72   SVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISR-DD 130

Query: 1028 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1087
            G            K L+ VLG ESAV+I+DD+   W  +K NLI++ERY +F  S RQFG
Sbjct: 131  GTQ-------KHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFG 183

Query: 1088 LLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
                SL E+ +DE   DG L + L V +Q+H
Sbjct: 184  FNCKSLSELKNDESETDGALTTILKVLKQVH 214


>gi|357450477|ref|XP_003595515.1| RNA polymerase II C-terminal domain phosphatase-like protein
            [Medicago truncatula]
 gi|355484563|gb|AES65766.1| RNA polymerase II C-terminal domain phosphatase-like protein
            [Medicago truncatula]
          Length = 382

 Score =  162 bits (409), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 93/206 (45%), Positives = 127/206 (61%), Gaps = 11/206 (5%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR-HLFRFPHMGM 971
            + + + RKLCLVLDLDHTLLN+   H + P  +E+  K      E   R  LF   H   
Sbjct: 70   RNLLNRRKLCLVLDLDHTLLNTTSLHRLSP--EEMHLKTCTDSLEDIARGRLFVLEHRQR 127

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
              KLRP + TFL+ ASK+FEM++YTMG++ Y+ EMA++LDP+G  F  +VISR DDG   
Sbjct: 128  MAKLRPFVRTFLKEASKMFEMYIYTMGDRRYSLEMARLLDPQGKFFKDKVISR-DDGTEM 186

Query: 1032 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGP 1091
                   K KDL  VLG ES+++I+DD+ +VW  +K NLI++ERY +F  S ++F L   
Sbjct: 187  -------KEKDLNLVLGTESSILILDDNKKVWRMHKDNLILMERYHFFNSSCQEFDLNCK 239

Query: 1092 SLLEIDHDERSEDGTLASSLGVRQQL 1117
            SL E+  DE   DG LA  L V + +
Sbjct: 240  SLAELHIDENETDGALARILKVLRHI 265


>gi|297793317|ref|XP_002864543.1| hypothetical protein ARALYDRAFT_332090 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297310378|gb|EFH40802.1| hypothetical protein ARALYDRAFT_332090 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1006

 Score =  160 bits (404), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 94/211 (44%), Positives = 121/211 (57%), Gaps = 36/211 (17%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH---------------- 962
            RKL LVLDLDHTLLNS    ++ P          E++  K H H                
Sbjct: 691  RKLYLVLDLDHTLLNSTVLRDLKP----------EEEYLKSHTHSLQEPFDFLLISDVSG 740

Query: 963  --LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
              LF    M M TKLRP + +FL+ AS++F M++YTMG++ YA +MAK+LDP+G  F  R
Sbjct: 741  GSLFMLEFMHMMTKLRPFVHSFLKEASEMFVMYIYTMGDRAYARQMAKLLDPRGEYFGDR 800

Query: 1021 VISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
            +ISR DDG        V   K L+ VLG ESAV+I+DD+   WP++K NLIV+ERY +F 
Sbjct: 801  IISR-DDGT-------VRHQKSLDVVLGQESAVLILDDTENAWPNHKDNLIVIERYHFFA 852

Query: 1081 CSRRQFGLLGPSLLEIDHDERSEDGTLASSL 1111
             S RQF     SL E+  DE   DG LA+ L
Sbjct: 853  SSCRQFDHKYKSLSELKSDESEPDGALATVL 883


>gi|308802003|ref|XP_003078315.1| CTD phosphatase-like protein 3 (ISS) [Ostreococcus tauri]
 gi|116056766|emb|CAL53055.1| CTD phosphatase-like protein 3 (ISS) [Ostreococcus tauri]
          Length = 480

 Score =  158 bits (400), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 88/212 (41%), Positives = 131/212 (61%), Gaps = 17/212 (8%)

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD------RE 957
            E+ +R EE++++    KL L+LDLDHTLLNSA+F E+     ++L +   Q+      RE
Sbjct: 142  EKAKR-EEKERVLKDGKLTLILDLDHTLLNSAQFKELTQEQHDLLHQCIAQEANGLAERE 200

Query: 958  KPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 1017
            +P  +  R  HMG +TKLRP ++ FLE  S++ + ++YTMG+K YA EM K++DP+G +F
Sbjct: 201  RPMLYCLR--HMGFFTKLRPHVFEFLEEVSQICQPYVYTMGDKAYAKEMVKLIDPEGKIF 258

Query: 1018 AGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
             GRVIS        + D      KDL+ VLG E++ VI+DD+ RVWP N  NLI ++RY 
Sbjct: 259  HGRVIS--------NNDSTSSHVKDLDIVLGGETSAVIVDDTERVWPANHGNLIRLDRYH 310

Query: 1078 YFPCSRRQFGLLGPSLLEIDHDERSEDGTLAS 1109
            +FP S   F   G S++E    +  E G++ +
Sbjct: 311  FFPSSAASFQQKGQSVMERSMVDEGELGSMGA 342


>gi|125541461|gb|EAY87856.1| hypothetical protein OsI_09278 [Oryza sativa Indica Group]
          Length = 420

 Score =  157 bits (397), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 90/209 (43%), Positives = 119/209 (56%), Gaps = 15/209 (7%)

Query: 915  MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTK 974
            +  ARKL LV+DLDHTL+NS +F  +    DE      E+  +   R LFR     M TK
Sbjct: 101  LLRARKLILVVDLDHTLINSTRFAHLSD--DEKANGFTERTGDDRSRGLFRMGLFRMITK 158

Query: 975  LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 1034
            LRP +  FL  AS +FEMH+YT+GN+ YAT +AK+LDP G  F  R+IS G+        
Sbjct: 159  LRPFVHEFLREASAMFEMHVYTLGNRNYATAVAKLLDPDGAYFGERIISSGESSQ----- 213

Query: 1035 ERVPKSKDLEGVLGM-----ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 1089
               P  K L  V G       +AVVI+DD+  VW   + NLI +ERY YF  SR +FG+ 
Sbjct: 214  ---PDRKSLGDVFGWAPEMERAAVVILDDTAEVWKGYRDNLIEMERYLYFASSRGKFGIA 270

Query: 1090 GPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
              SL E + DE   +G LA +L V +++H
Sbjct: 271  ARSLAERNRDESEREGALAVALRVLRRVH 299


>gi|145344421|ref|XP_001416731.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576957|gb|ABO95024.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 248

 Score =  156 bits (395), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 89/212 (41%), Positives = 133/212 (62%), Gaps = 17/212 (8%)

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEV-----DPVHDEILRKKEE-QDRE 957
            E+ +R EE+ ++    KL L+LDLDHTLLNS +F E+     D +H+ I R+ E  ++ +
Sbjct: 14   EKAKR-EEKARVLQNGKLTLILDLDHTLLNSTQFKELTQEQHDLLHECIAREAEGLKEGQ 72

Query: 958  KPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 1017
            +P  +  R  HMG +TKLRP ++ FLE  SK+ + ++YTMG+K YA EM K++DP+G +F
Sbjct: 73   RPMLYCLR--HMGFFTKLRPHVFEFLESVSKICQPYVYTMGDKPYAREMVKLIDPEGTIF 130

Query: 1018 AGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
             GRVIS        + D      KDL+ VLG E++ +I+DD+ RVWP N+ NLI ++RY 
Sbjct: 131  HGRVIS--------NNDSTSSHVKDLDIVLGGEASAIIVDDTERVWPQNQGNLIRLDRYH 182

Query: 1078 YFPCSRRQFGLLGPSLLEIDHDERSEDGTLAS 1109
            +FP S   F   G S++E    +  E G++ S
Sbjct: 183  FFPGSASSFQQKGQSVMESSMVDEGELGSVGS 214


>gi|145346053|ref|XP_001417510.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577737|gb|ABO95803.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 643

 Score =  156 bits (395), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 89/217 (41%), Positives = 129/217 (59%), Gaps = 15/217 (6%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILR----------KKEEQDREKPHRH 962
            +++ ++RKL LVLDLDHTLLNS    ++  +    LR          K+ E   +   R 
Sbjct: 302  ERLIASRKLALVLDLDHTLLNSVLVPDL-RMDSNWLRNAMRLLDADVKRAEDANDPLKRS 360

Query: 963  LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
            +F   H  + TKLRPG+  FLERAS+LFE+H+ TMG++ YA +M ++LDP+     G V 
Sbjct: 361  VFHLQHFDLLTKLRPGVRRFLERASRLFEIHINTMGSQAYADQMVELLDPEKRWIHGTV- 419

Query: 1023 SRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPC 1081
             RG  G+   G    P  K L+G L  +  A +I DD+  VW  ++ NL+  ERY +FP 
Sbjct: 420  -RGL-GEMEGGKLWAPAEKTLDGALEHLADACLIFDDTASVWESHRRNLVTCERYLFFPQ 477

Query: 1082 SRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            +RRQFGL G SLLEI  DE  ++G L++++ V + +H
Sbjct: 478  ARRQFGLSGMSLLEIGQDESEDEGMLSTAMKVFESVH 514


>gi|224142399|ref|XP_002324546.1| predicted protein [Populus trichocarpa]
 gi|222865980|gb|EEF03111.1| predicted protein [Populus trichocarpa]
          Length = 312

 Score =  156 bits (395), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 91/208 (43%), Positives = 126/208 (60%), Gaps = 9/208 (4%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
            + K +   +KL L+LDLDHTLLNS +   +  + +E L  + +  ++     LF    M 
Sbjct: 3    DMKNLLRHKKLYLILDLDHTLLNSTQLMHM-TLDEEYLNGQTDSLQDVSKGSLFMLSSMQ 61

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
            M TKLRP + TFL+ AS++FEM++YTMG++ YA EMAK+LDP    F  +VISR      
Sbjct: 62   MMTKLRPFVRTFLKEASQMFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRD----- 116

Query: 1031 FDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 1090
             DG +R  K  D+  VLG ESAV+I+DD+   W  +K NLI++ERY +F  S  QFG   
Sbjct: 117  -DGTQRHQKGLDV--VLGQESAVLILDDTENAWMKHKDNLILMERYHFFASSCHQFGFNC 173

Query: 1091 PSLLEIDHDERSEDGTLASSLGVRQQLH 1118
             SL E   DE   +G LAS L V +++H
Sbjct: 174  KSLSEQKTDESESEGALASILKVLRKIH 201


>gi|47497024|dbj|BAD19077.1| phosphatase-like [Oryza sativa Japonica Group]
 gi|47497233|dbj|BAD19278.1| phosphatase-like [Oryza sativa Japonica Group]
 gi|125584004|gb|EAZ24935.1| hypothetical protein OsJ_08715 [Oryza sativa Japonica Group]
          Length = 420

 Score =  155 bits (393), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 90/209 (43%), Positives = 119/209 (56%), Gaps = 15/209 (7%)

Query: 915  MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTK 974
            +  ARKL LV+DLDHTL+NS +F  +    DE      E+  +   R LFR     M TK
Sbjct: 101  LLRARKLILVVDLDHTLINSTRFAHLSD--DEKANGFTERTGDDRSRGLFRMGLFRMITK 158

Query: 975  LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 1034
            LRP +  FL  AS +FEMH+YT+GN+ YAT +AK+LDP G  F  R+IS G+        
Sbjct: 159  LRPFVHEFLREASAMFEMHVYTLGNRNYATAVAKLLDPDGAYFGERIISSGESSQ----- 213

Query: 1035 ERVPKSKDLEGVLGM-----ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 1089
               P  K L  V G       +AVVI+DD+  VW   + NLI +ERY YF  SR +FG+ 
Sbjct: 214  ---PDRKSLGDVFGWAPEMERAAVVILDDTAEVWKGYRDNLIEMERYLYFASSRGKFGIA 270

Query: 1090 GPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
              SL E + DE   +G LA +L V +++H
Sbjct: 271  VRSLAERNRDESEREGALAVALRVLRRVH 299


>gi|308802952|ref|XP_003078789.1| putative transcription regulator CPL1 (ISS) [Ostreococcus tauri]
 gi|116057242|emb|CAL51669.1| putative transcription regulator CPL1 (ISS) [Ostreococcus tauri]
          Length = 457

 Score =  153 bits (386), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 88/219 (40%), Positives = 124/219 (56%), Gaps = 14/219 (6%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKF----HEVDPVHDEILR-----KKEEQDREKPHR 961
            E  ++  ARKL LVLDLDHTLLNS        E + + + +        + E+  +   R
Sbjct: 121  ELSRLIKARKLALVLDLDHTLLNSVLVPSLRTEANSLQNAMRLLDHDVARAERTGDPLQR 180

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
              F  PH  ++TKLRPG+ +FLERASKLFE+H+ TMG++ YA +M  +LDP      G V
Sbjct: 181  SCFHLPHFDLFTKLRPGVRSFLERASKLFEIHISTMGSQAYADQMVALLDPAKKWINGTV 240

Query: 1022 ISRGDDGDPFDGDERVPKSKDLE--GVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
               G+     +G    P+ K L+  G+  +    VI DD+  VW  N  +L   ERY +F
Sbjct: 241  KGLGEME---NGRLIAPRYKSLDDCGLGELTDVSVIFDDTTDVWAQNLKSLFTCERYLFF 297

Query: 1080 PCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            P +RRQFGLLG SLLE+  DE   +G L +++ V + +H
Sbjct: 298  PQARRQFGLLGSSLLEVGQDESESEGMLMTAINVFESVH 336


>gi|242063380|ref|XP_002452979.1| hypothetical protein SORBIDRAFT_04g035920 [Sorghum bicolor]
 gi|241932810|gb|EES05955.1| hypothetical protein SORBIDRAFT_04g035920 [Sorghum bicolor]
          Length = 518

 Score =  149 bits (375), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 92/215 (42%), Positives = 122/215 (56%), Gaps = 15/215 (6%)

Query: 908  RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF- 966
            R+ + + +  ARKL L+LDLDHTLLNS    ++ P   E         +  P   LFR  
Sbjct: 195  RVSDLETLLRARKLTLILDLDHTLLNSTGLDDLSPA--EQANGLTRHTKGDPTAGLFRLG 252

Query: 967  -PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
                 M TKLRP    FLE+AS +FEM +YT+G++ YA  + K+LDP G  F GRV+S  
Sbjct: 253  RARFRMLTKLRPFARGFLEQASAMFEMSVYTLGDRGYARAVVKLLDPDGAYFGGRVVS-- 310

Query: 1026 DDGDPFDGDERVPK-SKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSR 1083
                    DE   +  K L+ V G E+A VVI+DDS  VWP ++ NLIV++RY YF  S 
Sbjct: 311  -------SDESTRRDRKSLDVVPGAEAAAVVILDDSSHVWPEHQENLIVMDRYLYFADSC 363

Query: 1084 RQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            R +G    SL E+  DER  DG LA +L V  ++H
Sbjct: 364  RTYGCGVSSLAELRRDEREHDGALAVALQVLTRVH 398


>gi|242063378|ref|XP_002452978.1| hypothetical protein SORBIDRAFT_04g035900 [Sorghum bicolor]
 gi|241932809|gb|EES05954.1| hypothetical protein SORBIDRAFT_04g035900 [Sorghum bicolor]
          Length = 464

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 87/204 (42%), Positives = 124/204 (60%), Gaps = 14/204 (6%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH---MGMWTKL 975
            RKL LVLDLDHTLLNS +  ++  +        + +D  + H  LFR  +   + M TKL
Sbjct: 155  RKLILVLDLDHTLLNSTRLQDLSALEQRNGFTPDTED--ELHMELFRLEYSDNVRMLTKL 212

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 1035
            RP +  FL++AS  FEMH+YT+G + YA  +  +LDP GV F GRV+SR +        +
Sbjct: 213  RPFVRGFLDQASSRFEMHVYTLGRQDYAKAVIDLLDPDGVYFRGRVVSRKE------STQ 266

Query: 1036 RVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLL 1094
            R  KS D+  + G + +AVVI+DD+   WP ++ NLI+++RY YF C+ R+F    PS+ 
Sbjct: 267  RDVKSLDV--IPGADPAAVVILDDTDSAWPGHQDNLILMDRYHYFACTCRKFRYNIPSMA 324

Query: 1095 EIDHDERSEDGTLASSLGVRQQLH 1118
            E   DER  DG+LA  LGV  ++H
Sbjct: 325  EQARDEREHDGSLAVVLGVLNRIH 348


>gi|302769312|ref|XP_002968075.1| hypothetical protein SELMODRAFT_67516 [Selaginella moellendorffii]
 gi|300163719|gb|EFJ30329.1| hypothetical protein SELMODRAFT_67516 [Selaginella moellendorffii]
          Length = 141

 Score =  146 bits (368), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 74/145 (51%), Positives = 98/145 (67%), Gaps = 8/145 (5%)

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
            KLRP    FLE ASKLFEM++YTMG ++YA  MA +LDP G  F GRVIS+ D       
Sbjct: 1    KLRPFAHKFLEEASKLFEMYVYTMGERMYAVTMAHLLDPTGKFFKGRVISQRDST----- 55

Query: 1034 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSL 1093
                 ++KDL+ VLG +SAV+I+DD+  VWP ++ NLIV+ERY +F  S RQFGL  PSL
Sbjct: 56   ---CRQTKDLDIVLGADSAVLILDDTEAVWPKHRANLIVMERYHFFQSSCRQFGLENPSL 112

Query: 1094 LEIDHDERSEDGTLASSLGVRQQLH 1118
             + + DE  ++G LA+ L V Q++H
Sbjct: 113  TKAERDESKDEGALANVLKVLQRIH 137


>gi|359494894|ref|XP_003634864.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Vitis vinifera]
          Length = 278

 Score =  144 bits (364), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 78/156 (50%), Positives = 100/156 (64%), Gaps = 8/156 (5%)

Query: 963  LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
            LF    M M TKLRP + TFL+ ASK+FEM++YTMG + YA EMAK+LDP+ V F+ RVI
Sbjct: 9    LFMLNTMHMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSRVI 68

Query: 1023 SRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCS 1082
            S+ D              K L+ VLG ESAV+I+DD+  VW  +K NLI++ERY +F  S
Sbjct: 69   SQADCTQ--------RHQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASS 120

Query: 1083 RRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
             RQFG    SL E+  DE   DG LA+ L V Q++H
Sbjct: 121  CRQFGFNCKSLSELKSDESEPDGALATVLKVLQRIH 156


>gi|359497210|ref|XP_003635453.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            4-like [Vitis vinifera]
          Length = 278

 Score =  144 bits (364), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 78/156 (50%), Positives = 100/156 (64%), Gaps = 8/156 (5%)

Query: 963  LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
            LF    M M TKLRP + TFL+ ASK+FEM++YTMG + YA EMAK+LDP+ V F+ RVI
Sbjct: 9    LFMLNTMHMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSRVI 68

Query: 1023 SRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCS 1082
            S+ D              K L+ VLG ESAV+I+DD+  VW  +K NLI++ERY +F  S
Sbjct: 69   SQADCTQ--------RHQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASS 120

Query: 1083 RRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
             RQFG    SL E+  DE   DG LA+ L V Q++H
Sbjct: 121  CRQFGFNCKSLSELKSDESEPDGALATVLKVLQRIH 156


>gi|424513770|emb|CCO66392.1| predicted protein [Bathycoccus prasinos]
          Length = 546

 Score =  144 bits (362), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 90/233 (38%), Positives = 125/233 (53%), Gaps = 37/233 (15%)

Query: 876  GAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSA 935
            GA+ ++A   V +L EG     K      R  + EE+    +  KL LVLDLDHTLLNS 
Sbjct: 147  GANAETA---VRYLHEGLTVSDKLL----REAKNEERMATLNQGKLFLVLDLDHTLLNSC 199

Query: 936  KFHEVDPVHDEIL-RKKEEQDREKPHRH---------------------LFRFPHMGMWT 973
            +F E++    E L RK E+++ E   R                      L+   H   +T
Sbjct: 200  RFDELNDEERESLDRKVEKREEEDELRSKLLGLVGGGDAGGGRRPRFPDLYCLSHFSTYT 259

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
            KLRP ++ FLE+ASK+  MH+YTMG+K YA EMA ++DP+G  F GR+I         + 
Sbjct: 260  KLRPYVFEFLEQASKICRMHVYTMGDKNYAHEMASLIDPEGKYFHGRIIG--------NS 311

Query: 1034 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQF 1086
            D    K+KDL+ VLG +   +I+DD+ RVWP +  NLI V+RY +F  S   F
Sbjct: 312  DSTCSKTKDLDIVLGGDDCTMIVDDTSRVWPRHARNLIRVDRYHFFRKSATSF 364


>gi|296088193|emb|CBI35709.3| unnamed protein product [Vitis vinifera]
          Length = 638

 Score =  144 bits (362), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 77/150 (51%), Positives = 100/150 (66%), Gaps = 8/150 (5%)

Query: 969  MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
            M M TKLRP + TFL+ ASK+FEM++YTMG + YA EMAK+LDP+ V F+ RVIS+    
Sbjct: 1    MHMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSRVISQA--- 57

Query: 1029 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1088
               D  +R    K L+ VLG ESAV+I+DD+  VW  +K NLI++ERY +F  S RQFG 
Sbjct: 58   ---DCTQR--HQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASSCRQFGF 112

Query: 1089 LGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
               SL E+  DE   DG LA+ L V Q++H
Sbjct: 113  NCKSLSELKSDESEPDGALATVLKVLQRIH 142


>gi|147774299|emb|CAN76945.1| hypothetical protein VITISV_002430 [Vitis vinifera]
          Length = 641

 Score =  143 bits (360), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 77/150 (51%), Positives = 100/150 (66%), Gaps = 8/150 (5%)

Query: 969  MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
            M M TKLRP + TFL+ ASK+FEM++YTMG + YA EMAK+LDP+ V F+ RVIS+    
Sbjct: 1    MHMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSRVISQA--- 57

Query: 1029 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1088
               D  +R    K L+ VLG ESAV+I+DD+  VW  +K NLI++ERY +F  S RQFG 
Sbjct: 58   ---DCTQR--HQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASSCRQFGF 112

Query: 1089 LGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
               SL E+  DE   DG LA+ L V Q++H
Sbjct: 113  NCKSLSELKSDESEPDGALATVLKVLQRIH 142


>gi|296090640|emb|CBI41034.3| unnamed protein product [Vitis vinifera]
          Length = 264

 Score =  143 bits (360), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 76/150 (50%), Positives = 98/150 (65%), Gaps = 8/150 (5%)

Query: 969  MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
            M M TKLRP + TFL+ ASK+FEM++YTMG + YA EMAK+LDP+ V F+ RVIS+ D  
Sbjct: 1    MHMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSRVISQADCT 60

Query: 1029 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1088
                        K L+ VLG ESAV+I+DD+  VW  +K NLI++ERY +F  S RQFG 
Sbjct: 61   Q--------RHQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASSCRQFGF 112

Query: 1089 LGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
               SL E+  DE   DG LA+ L V Q++H
Sbjct: 113  NCKSLSELKSDESEPDGALATVLKVLQRIH 142


>gi|302793512|ref|XP_002978521.1| hypothetical protein SELMODRAFT_418187 [Selaginella moellendorffii]
 gi|300153870|gb|EFJ20507.1| hypothetical protein SELMODRAFT_418187 [Selaginella moellendorffii]
          Length = 346

 Score =  142 bits (358), Expect = 1e-30,   Method: Composition-based stats.
 Identities = 91/217 (41%), Positives = 121/217 (55%), Gaps = 15/217 (6%)

Query: 908  RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH--LFR 965
            R E  +K+   +KL LVLDLDHTLLNSA F +VD      L K  +   + P R   L +
Sbjct: 25   RKEYTQKVLQQQKLILVLDLDHTLLNSASFSKVDEEERLYLEKIYDWQEKAPKRRKLLHK 84

Query: 966  FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
               + +WTK+RP  + FLE ASK F++H+YT G ++YA  MAK+LDP G LF G + SR 
Sbjct: 85   VESLQVWTKIRPFAFKFLEEASKFFDLHIYTNGREIYAETMAKLLDPTGSLFKGHIFSR- 143

Query: 1026 DDGDPFDGDERVPKS-KDLEGVLGMESAVVIIDDSVRVWP--HNKLNLIVVERYTYFPCS 1082
                    D    K+ KDL+ V G ES  +I+DDS  VWP  H+K  + V +RY +F  S
Sbjct: 144  --------DHNCMKAMKDLDTVPGDESITLIVDDSDCVWPKKHHKNLIPVYDRYLFFRSS 195

Query: 1083 RRQFGLL-GPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
               FGL    SL     DE +   TLA  L   +++H
Sbjct: 196  TGLFGLRESSSLTSKKKDEVATKATLAKLLEGLKRIH 232


>gi|255080370|ref|XP_002503765.1| predicted protein [Micromonas sp. RCC299]
 gi|226519032|gb|ACO65023.1| predicted protein [Micromonas sp. RCC299]
          Length = 574

 Score =  142 bits (357), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 97/267 (36%), Positives = 128/267 (47%), Gaps = 71/267 (26%)

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEV----------------------- 940
            E+ +R E+++ + S R L LVLDLDHTLLNSA+F E+                       
Sbjct: 100  EKAKREEKRRILLSGR-LVLVLDLDHTLLNSARFSELSQEEHYAMHRIIAAADCEANGGS 158

Query: 941  ---------------DPVHDEILRKKEE-----------------QDREKPH-------R 961
                           DPV  E   +KE+                 + R+ P        R
Sbjct: 159  KEEVQQAAAAIQPVEDPVAAESAEEKEDGADVDGEKGEEEAGGKERARDGPFPGTDPPLR 218

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            HL    HM M+TKLRP    FL  AS+L  M++YTMG++ YA EMAK+LDP G LF GRV
Sbjct: 219  HLNCLRHMAMFTKLRPHAHAFLRAASQLCTMYIYTMGDRNYAREMAKLLDPTGELFNGRV 278

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPC 1081
            I  GD    +         KDL+ VLG E  V+I DD+ RVWP N  NLI ++RY +F  
Sbjct: 279  IGSGDSTSQY--------KKDLDIVLGAEPTVLITDDTDRVWPKNLANLIRIDRYHFFKQ 330

Query: 1082 SRRQFGLLGPSLLEIDHDERSEDGTLA 1108
            S   F   G S++E    +  ++G  A
Sbjct: 331  SAAGFRQPGRSVMERQWRDEGDNGDRA 357


>gi|302774062|ref|XP_002970448.1| hypothetical protein SELMODRAFT_411029 [Selaginella moellendorffii]
 gi|300161964|gb|EFJ28578.1| hypothetical protein SELMODRAFT_411029 [Selaginella moellendorffii]
          Length = 346

 Score =  142 bits (357), Expect = 2e-30,   Method: Composition-based stats.
 Identities = 90/217 (41%), Positives = 121/217 (55%), Gaps = 15/217 (6%)

Query: 908  RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH--LFR 965
            R E  +K+   +KL LVLDLDHTLLNSA F +VD      L K  +   + P R   L +
Sbjct: 25   RKEYTQKVLQQQKLILVLDLDHTLLNSASFSKVDEEERLYLEKIYDWQEKAPKRRKLLHK 84

Query: 966  FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
               + +WTK+RP  + FLE ASK F++H+YT G ++YA  MAK+LDP G LF G + SR 
Sbjct: 85   VESLQVWTKIRPFAFKFLEEASKFFDLHIYTNGREIYAETMAKLLDPTGSLFKGHIFSR- 143

Query: 1026 DDGDPFDGDERVPKS-KDLEGVLGMESAVVIIDDSVRVWP--HNKLNLIVVERYTYFPCS 1082
                    D    K+ KDL+ V G ES  +I+DDS  VWP  H+K  + V ++Y +F  S
Sbjct: 144  --------DHNCMKAMKDLDTVPGDESITLIVDDSDYVWPKKHHKNLIPVYDQYRFFRSS 195

Query: 1083 RRQFGLL-GPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
               FGL    SL     DE +   TLA  L   +++H
Sbjct: 196  TGLFGLRESSSLTSKKKDEVATKATLAKLLEGLKRIH 232


>gi|226498568|ref|NP_001149751.1| CPL3 [Zea mays]
 gi|195631558|gb|ACG36674.1| CPL3 [Zea mays]
          Length = 493

 Score =  139 bits (350), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 85/207 (41%), Positives = 115/207 (55%), Gaps = 13/207 (6%)

Query: 915  MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM--- 971
            +   RKL LVLDLD TL+NSA+    D    E          +KPH  LFR  +      
Sbjct: 156  LMRERKLILVLDLDSTLVNSARL--CDFSAQEKRNGFTRYTGDKPHMDLFRLKYSNKARK 213

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
             TKLRP +  FLE+AS +FEMH+YT+  + YA  +  +LDP GV F GRV+SR       
Sbjct: 214  LTKLRPFVRGFLEQASSMFEMHVYTLAKRAYAKAVIDLLDPNGVYFGGRVVSRK------ 267

Query: 1032 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGP 1091
            D   R  KS D+  + G +   V+I D   VWP ++ NLI+++RY YF  + R+F    P
Sbjct: 268  DSTRRDMKSLDV--IPGADPVAVVILDDTDVWPAHQDNLILMDRYHYFASTCRKFRYDIP 325

Query: 1092 SLLEIDHDERSEDGTLASSLGVRQQLH 1118
            SL E   DER +D +LA  L V +++H
Sbjct: 326  SLAEQGRDEREQDNSLAVVLNVLRRIH 352


>gi|413924219|gb|AFW64151.1| hypothetical protein ZEAMMB73_480827 [Zea mays]
          Length = 490

 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 85/207 (41%), Positives = 115/207 (55%), Gaps = 13/207 (6%)

Query: 915  MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM--- 971
            +   RKL LVLDLD TL+NSA+    D    E          +KPH  LFR  +      
Sbjct: 153  LMRERKLILVLDLDSTLVNSARL--CDFSAQEKRNGFTRYTGDKPHMDLFRLKYSNKARK 210

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
             TKLRP +  FLE+AS +FEMH+YT+  + YA  +  +LDP GV F GRV+SR       
Sbjct: 211  LTKLRPFVRGFLEQASSMFEMHVYTLAKRAYAKAVIDLLDPNGVYFGGRVVSRK------ 264

Query: 1032 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGP 1091
            D   R  KS D+  + G +   V+I D   VWP ++ NLI+++RY YF  + R+F    P
Sbjct: 265  DSTRRDMKSLDV--IPGADPVAVVILDDTDVWPAHQDNLILMDRYHYFASTCRKFRYDIP 322

Query: 1092 SLLEIDHDERSEDGTLASSLGVRQQLH 1118
            SL E   DER +D +LA  L V +++H
Sbjct: 323  SLAEQGRDEREQDNSLAVVLNVLRRIH 349


>gi|218196728|gb|EEC79155.1| hypothetical protein OsI_19828 [Oryza sativa Indica Group]
          Length = 430

 Score =  138 bits (347), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 74/150 (49%), Positives = 94/150 (62%), Gaps = 8/150 (5%)

Query: 969  MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
            M M TKLRP +  FL+ AS +FEM++YTMG+K YA E+AK+LDP  V F  +VIS  D  
Sbjct: 1    MQMLTKLRPFVRRFLKEASDMFEMYIYTMGDKAYAIEIAKLLDPDNVYFGSKVISNSD-- 58

Query: 1029 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1088
                  +R  K  D+  VLG ES  VI+DD+  VW  +K NLI++ERY YF  S RQFG 
Sbjct: 59   ----CTQRHQKGLDV--VLGDESVAVILDDTEYVWQKHKENLILMERYHYFASSCRQFGF 112

Query: 1089 LGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
               SL E   DER  DG LA+ L V +++H
Sbjct: 113  GARSLSETMQDERENDGALATILDVLERIH 142


>gi|302816075|ref|XP_002989717.1| hypothetical protein SELMODRAFT_23521 [Selaginella moellendorffii]
 gi|302824047|ref|XP_002993670.1| hypothetical protein SELMODRAFT_23523 [Selaginella moellendorffii]
 gi|300138493|gb|EFJ05259.1| hypothetical protein SELMODRAFT_23523 [Selaginella moellendorffii]
 gi|300142494|gb|EFJ09194.1| hypothetical protein SELMODRAFT_23521 [Selaginella moellendorffii]
          Length = 312

 Score =  137 bits (346), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 89/203 (43%), Positives = 119/203 (58%), Gaps = 9/203 (4%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF-RFPHMGMWTKLRP 977
            RKL LVLDLDHTL+NSA F EV       L     +D  K    L  +   + +WTK+RP
Sbjct: 4    RKLMLVLDLDHTLVNSASFDEVCAEEKPFLESMYARDPPKGRSKLLHKLDDLQLWTKIRP 63

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERV 1037
                FL +ASKLF++++YTMG ++YA  M K+LDP GVLF G ++SR +D D  D  +R 
Sbjct: 64   FALEFLAQASKLFDLYVYTMGTRIYAEAMLKLLDPTGVLFKG-LVSR-NDNDLTDHRDR- 120

Query: 1038 PKSKDLEGVLGMESAVVIIDDSVRVWPHNKL-NLIVVERYTYFPCSRRQFGL-LGPSLLE 1095
               KDL+ VLG ES+V+I+DD    WP  +  NLI ++RY +F  S + FG     SL  
Sbjct: 121  ---KDLDTVLGQESSVLIVDDLPEAWPEEQHKNLIQIDRYHFFSSSCKSFGFDESSSLAR 177

Query: 1096 IDHDERSEDGTLASSLGVRQQLH 1118
               DE    G+LAS L   + +H
Sbjct: 178  RGIDESHSGGSLASLLQGLETIH 200


>gi|226498676|ref|NP_001145873.1| hypothetical protein [Zea mays]
 gi|219884795|gb|ACL52772.1| unknown [Zea mays]
 gi|413939308|gb|AFW73859.1| hypothetical protein ZEAMMB73_968817 [Zea mays]
          Length = 425

 Score =  133 bits (334), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 81/217 (37%), Positives = 118/217 (54%), Gaps = 25/217 (11%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH------LF 964
            ++  +   RKL L+LDLDHTLLNS   +++ PV        E+     P+        LF
Sbjct: 202  DRATLMRERKLILILDLDHTLLNSTSLYDLSPV--------EQAKGFTPYTFGDTSIDLF 253

Query: 965  R--FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
            R    ++ M  KL      FL++A+ LFEMH+YT+G + YA    ++LDP G+ F GR++
Sbjct: 254  RVDIDNLSMLVKLGAFARGFLKQANALFEMHVYTLGIRAYARAAVRLLDPNGIYFGGRIV 313

Query: 1023 SRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPC 1081
            SR +             +K L+ + G + A VVI+DD+  VWP    NLI+++RY YF  
Sbjct: 314  SRNE--------STKENTKSLDVIQGADPAMVVILDDTDGVWPGYPDNLILMDRYRYFAS 365

Query: 1082 SRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            + R F    PSL E   +ER  DG+LA  LG  Q++H
Sbjct: 366  TCRTFDYDIPSLAEQGLEEREHDGSLAVVLGALQRIH 402


>gi|224142401|ref|XP_002324547.1| predicted protein [Populus trichocarpa]
 gi|222865981|gb|EEF03112.1| predicted protein [Populus trichocarpa]
          Length = 266

 Score =  132 bits (332), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 73/150 (48%), Positives = 96/150 (64%), Gaps = 8/150 (5%)

Query: 969  MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
            M M TKLRP + TFL+ AS++FEM++YTMG++ YA EMAK+LDP    F  +VISR    
Sbjct: 5    MQMMTKLRPFVRTFLKEASQMFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRD--- 61

Query: 1029 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1088
               DG +R  K  D+  VLG ESAV+I+DD+   W  +K NLI++ERY +F  S  QFG 
Sbjct: 62   ---DGTQRHQKGLDV--VLGQESAVLILDDTENAWMKHKDNLILMERYHFFASSCHQFGF 116

Query: 1089 LGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
               SL E   DE   +G LAS L V +++H
Sbjct: 117  NCKSLSEQKTDESESEGALASILKVLRKIH 146


>gi|15224433|ref|NP_178570.1| haloacid dehalogenase-like hydrolase domain-containing protein
            [Arabidopsis thaliana]
 gi|4585924|gb|AAD25584.1| hypothetical protein [Arabidopsis thaliana]
 gi|330250795|gb|AEC05889.1| haloacid dehalogenase-like hydrolase domain-containing protein
            [Arabidopsis thaliana]
          Length = 277

 Score =  128 bits (322), Expect = 2e-26,   Method: Composition-based stats.
 Identities = 80/243 (32%), Positives = 131/243 (53%), Gaps = 25/243 (10%)

Query: 880  QSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHE 939
            +S +   +++F+G     +A      T+ L  +    + +KL LVLDLDHTLL+S     
Sbjct: 29   KSQFRKFDYIFKGLQLSNEAV---ALTKSLTTKHSCLNEKKLHLVLDLDHTLLHSKLVSN 85

Query: 940  VDPVHDEILRKKEEQDREKPHRHLFRFPHMG----MWTKLRPGIWTFLERASKLFEMHLY 995
            +      ++++   + RE     L++F  +G       KLRP +  FL+ A+++F M +Y
Sbjct: 86   LSQAERYLIQEASSRTRED----LWKFRPIGHPIDRLIKLRPFVRDFLKEANEMFTMFVY 141

Query: 996  TMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVI 1055
            TMG+++YA  + +++DPK + F  RVI++ +           P+ K L  VL  E  VVI
Sbjct: 142  TMGSRIYAKAILEMIDPKKLYFGNRVITKDES----------PRMKTLNLVLAEERGVVI 191

Query: 1056 IDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQ 1115
            +DD+  +WPH+K NLI + +Y YF    R+ GL   S  E   DE   DG LA+ L + +
Sbjct: 192  VDDTRDIWPHHKNNLIQIRKYKYF----RRSGLDSNSYSEKKTDEGENDGGLANVLKLLR 247

Query: 1116 QLH 1118
            ++H
Sbjct: 248  EVH 250


>gi|15239576|ref|NP_200232.1| haloacid dehalogenase-like hydrolase domain-containing protein
            [Arabidopsis thaliana]
 gi|9759494|dbj|BAB10744.1| unnamed protein product [Arabidopsis thaliana]
 gi|332009084|gb|AED96467.1| haloacid dehalogenase-like hydrolase domain-containing protein
            [Arabidopsis thaliana]
          Length = 306

 Score =  125 bits (315), Expect = 1e-25,   Method: Composition-based stats.
 Identities = 80/225 (35%), Positives = 118/225 (52%), Gaps = 13/225 (5%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D  Q + I    T+R+  Q   F+ +KL LVLDLDHTLL++     +      ++   EE
Sbjct: 62   DGLQLSDIAVTVTKRVTTQITCFNDKKLHLVLDLDHTLLHTVMISNLTKEETYLI---EE 118

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
            +D  +  R L          KLRP +  FL+ A+K+F M++YTMG++ YA  +  ++DP+
Sbjct: 119  EDSREDLRRLNGGYSSEFLIKLRPFVHEFLKEANKMFSMYVYTMGDRDYAMNVLNLIDPE 178

Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVV 1073
             V F  RVI+R +           P  K L+ VL  E  VVI+DD+  VWP +K NL+ +
Sbjct: 179  KVYFGDRVITRNES----------PYIKTLDLVLADECGVVIVDDTPHVWPDHKRNLLEI 228

Query: 1074 ERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
             +Y YF    R       S  E   DE   DG+LA+ L V +Q++
Sbjct: 229  TKYNYFSDKTRHDVKYTKSYAEEKRDESRNDGSLANVLKVIKQVY 273


>gi|297850432|ref|XP_002893097.1| hypothetical protein ARALYDRAFT_472260 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297338939|gb|EFH69356.1| hypothetical protein ARALYDRAFT_472260 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 281

 Score =  121 bits (303), Expect = 3e-24,   Method: Composition-based stats.
 Identities = 78/219 (35%), Positives = 114/219 (52%), Gaps = 32/219 (14%)

Query: 906  TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ------DREKP 959
            T+ L  Q    + RKL +VLDLDHTLL+S     +      +LR+ + +      DRE  
Sbjct: 62   TKSLTTQLACLNERKLHVVLDLDHTLLHSVMVSRLSEGEKYLLRESDLREDLWTLDRE-- 119

Query: 960  HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
                       M  KLRP +  FL  A++ F M++YTMGN+ YA  + K++DPK V F  
Sbjct: 120  -----------MLIKLRPFVHEFLNEANEFFSMYVYTMGNRDYAQAVLKLIDPKKVYFGD 168

Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            RVI+R + G           SK L+ VL  E  VVI+DD+  VWP ++ NL+ + +Y+YF
Sbjct: 169  RVITRDESG----------FSKTLDLVLADECGVVIVDDTRHVWPDHERNLLQITKYSYF 218

Query: 1080 PCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
                ++      S  E   DE    G+LA+ L V +++H
Sbjct: 219  RDYNQED---SKSYAEEKRDESRSQGSLANVLKVLKKIH 254


>gi|297834870|ref|XP_002885317.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
            lyrata]
 gi|297331157|gb|EFH61576.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
            lyrata]
          Length = 592

 Score =  120 bits (302), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 77/221 (34%), Positives = 119/221 (53%), Gaps = 25/221 (11%)

Query: 906  TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
            T+RL  +    + +KL LVLDLDHTLL+S +   +      ++ +     RE     L++
Sbjct: 74   TKRLTTKFSCLNMKKLHLVLDLDHTLLHSVRVQFLSEAEKYLIEEAGSTTRED----LWK 129

Query: 966  FPHMG--------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 1017
                G          TKLRP +  FL+ A+KLF M++YT G + YA  + K++DPK + F
Sbjct: 130  MKVKGDPIPITIEYLTKLRPFLREFLKEANKLFTMYVYTKGTRRYAKAILKLIDPKKLYF 189

Query: 1018 AGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
              RVI+R +           P +K L+ VL  E  VVI+DD+  +WP++K NL+V+ +Y 
Sbjct: 190  GHRVITRNES----------PHTKTLDLVLADERGVVIVDDTRNIWPNHKSNLVVIGKYK 239

Query: 1078 YFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            YF   R +  +L P   E   DE   +G LA+ L + +++H
Sbjct: 240  YF---RFEGRVLKPHSEEKTTDESENNGGLANVLKLLKEVH 277



 Score =  105 bits (263), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 109/204 (53%), Gaps = 22/204 (10%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG----MWTK 974
            +KL LVLDLDHTLL++     +      +L +     RE     L++   +G      TK
Sbjct: 389  KKLHLVLDLDHTLLHTVMVPSLSQAEKYLLEEAGSATRED----LWKIKAIGDPMEFLTK 444

Query: 975  LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 1034
            LRP +  FL+ A+++F M++YT G++ YA ++ +++DPK + F  RVI++ +        
Sbjct: 445  LRPFVREFLKEANQMFTMYVYTKGSRGYAKQVLELIDPKKLYFEDRVITKNES------- 497

Query: 1035 ERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLL 1094
               P  K L+ VL  E  VVI+DD   VWP +K NL+ + +YTYF    R  G       
Sbjct: 498  ---PHMKTLDLVLAEERGVVIVDDMRTVWPDHKSNLVDISKYTYF----RLKGQESMPYS 550

Query: 1095 EIDHDERSEDGTLASSLGVRQQLH 1118
            E   DE   DG LA+ L + +++H
Sbjct: 551  EEMTDESESDGGLANVLKLLKEVH 574


>gi|297792855|ref|XP_002864312.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
            lyrata]
 gi|297310147|gb|EFH40571.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
            lyrata]
          Length = 305

 Score =  120 bits (301), Expect = 5e-24,   Method: Composition-based stats.
 Identities = 76/216 (35%), Positives = 117/216 (54%), Gaps = 19/216 (8%)

Query: 906  TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
            T+R+  Q   F+ +KL LVLDLDHTLL++     +    +E     E   RE     L++
Sbjct: 76   TKRVTTQITCFNDKKLHLVLDLDHTLLHTVMVSNLS--KEETYLIGEADSRED----LWK 129

Query: 966  FP---HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
            F          KLRP +  FL+ A+++F M++YTMG++ YA  + K++DP+ + F  RVI
Sbjct: 130  FNGGYSSEFLIKLRPYVHEFLKEANEMFSMYVYTMGDRDYANNVLKLIDPEKIYFGHRVI 189

Query: 1023 SRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCS 1082
            +R +           P  K L+ VL  E  VVI+DD+ +VWP +K NL+ + +Y YF   
Sbjct: 190  TRNES----------PYIKTLDLVLADECGVVIVDDTPQVWPDDKRNLLEITKYNYFSDK 239

Query: 1083 RRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
             R+      S  E   DE   DG+LA+ L V ++++
Sbjct: 240  TRRDVKYSKSYAEEKRDEGRNDGSLANVLKVIKEIY 275


>gi|242066826|ref|XP_002454702.1| hypothetical protein SORBIDRAFT_04g035880 [Sorghum bicolor]
 gi|241934533|gb|EES07678.1| hypothetical protein SORBIDRAFT_04g035880 [Sorghum bicolor]
          Length = 462

 Score =  119 bits (298), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 84/211 (39%), Positives = 121/211 (57%), Gaps = 22/211 (10%)

Query: 915  MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP--HMGMW 972
            +F  RKL LVLDLD TLLNSA+  +   V +E      +   +K    +FR    ++GM 
Sbjct: 124  LFRERKLILVLDLDRTLLNSARL-DAFSVGEEWFGFTPDTG-DKVDMDIFRLDSDNLGML 181

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
            TKLRP +     R S +FEMHLYT+GN +YA     +LDP GV F GRV+SR D+     
Sbjct: 182  TKLRPFV-----RGS-MFEMHLYTLGNLVYAKAAIHLLDPNGVYFGGRVVSRDDESTQ-- 233

Query: 1033 GDERVPKSKDLEGVLGME--SAVVI--IDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1088
                   +K L+ + G +  +AV++  +DD+   WP ++ NLI+  RY YF  + R+   
Sbjct: 234  -----GGTKSLDVIPGADPVAAVILDALDDTDVAWPEHQDNLILTNRYRYFASTCRKSRH 288

Query: 1089 LGPSLLEIDHDERSED-GTLASSLGVRQQLH 1118
              PSL E+  DE+ E  G+LA +LGV +++H
Sbjct: 289  DIPSLAELRRDEKGEHGGSLAVALGVLKRVH 319


>gi|15229069|ref|NP_188382.1| haloacid dehalogenase-like hydrolase domain-containing protein
            [Arabidopsis thaliana]
 gi|9294142|dbj|BAB02044.1| unnamed protein product [Arabidopsis thaliana]
 gi|332642446|gb|AEE75967.1| haloacid dehalogenase-like hydrolase domain-containing protein
            [Arabidopsis thaliana]
          Length = 296

 Score =  119 bits (297), Expect = 1e-23,   Method: Composition-based stats.
 Identities = 79/236 (33%), Positives = 122/236 (51%), Gaps = 29/236 (12%)

Query: 887  EHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDE 946
            ++L +G     +AA     T+R   Q    + +KL LVLDLDHTLL+S +   +      
Sbjct: 55   DYLVQGLQLSHEAA---AFTKRFTTQFYCLNEKKLNLVLDLDHTLLHSIRVSLLSETEKC 111

Query: 947  ILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEM 1006
            ++ +     RE     L++       TKLRP +  FL+ A++LF M++YTMG ++YA  +
Sbjct: 112  LIEEACSTTRED----LWKLDS-DYLTKLRPFVHEFLKEANELFTMYVYTMGTRVYAESL 166

Query: 1007 AKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHN 1066
             K++DPK + F  RVI+R +           P  K L+ VL  E  VVI+DD+  VW H+
Sbjct: 167  LKLIDPKRIYFGDRVITRDES----------PYVKTLDLVLAEERGVVIVDDTSDVWTHH 216

Query: 1067 KLNLIVVERYTYFPCSRRQFGLLGP----SLLEIDHDERSEDGTLASSLGVRQQLH 1118
            K NL+ +  Y +F  +       GP    S  E   DE   +G LA+ L + +++H
Sbjct: 217  KSNLVEINEYHFFRVN-------GPEESNSYTEEKRDESKNNGGLANVLKLLKEVH 265


>gi|297834668|ref|XP_002885216.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
            lyrata]
 gi|297331056|gb|EFH61475.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
            lyrata]
          Length = 296

 Score =  117 bits (294), Expect = 3e-23,   Method: Composition-based stats.
 Identities = 78/236 (33%), Positives = 122/236 (51%), Gaps = 29/236 (12%)

Query: 887  EHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDE 946
            ++L +G     +AA     T+R   +    + +KL LVLDLDHTLL+S +   +      
Sbjct: 55   DYLVQGLQLSHEAA---AFTKRFTTEFYCLNEKKLHLVLDLDHTLLHSIRVSILSETERY 111

Query: 947  ILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEM 1006
            ++ +     RE     L++   +   TKLRP +  FL+ A+++F M++YTMG ++YA  +
Sbjct: 112  LIEEACSTTRED----LWKL-DIDYLTKLRPFVHEFLKEANEMFTMYVYTMGTRVYAESL 166

Query: 1007 AKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHN 1066
             K++DPK + F  RVI+R +           P  K L+ VL  E  VVI+DD+  VW H+
Sbjct: 167  LKLIDPKRIYFGDRVITRDES----------PYVKTLDLVLADERGVVIVDDTRDVWTHH 216

Query: 1067 KLNLIVVERYTYFPCSRRQFGLLGP----SLLEIDHDERSEDGTLASSLGVRQQLH 1118
            K NL+ +  Y YF  +       GP    S  E   DE    G LA+ L + +++H
Sbjct: 217  KSNLVEINEYHYFRVN-------GPEESKSYTEEKRDESKNSGGLANVLKLLKEVH 265


>gi|297835808|ref|XP_002885786.1| hypothetical protein ARALYDRAFT_899317 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297331626|gb|EFH62045.1| hypothetical protein ARALYDRAFT_899317 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 285

 Score =  117 bits (293), Expect = 4e-23,   Method: Composition-based stats.
 Identities = 78/240 (32%), Positives = 121/240 (50%), Gaps = 19/240 (7%)

Query: 880  QSAWGDVEHLFEGYDDQQKA-AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFH 938
            +S W   +++F G     +A A+ K RT          + +KL LVLDLDHTLL+  K  
Sbjct: 29   KSQWRAFDYIFNGLQLSHEAVALTKSRT----TNNSCLNEKKLHLVLDLDHTLLHMKKVP 84

Query: 939  EVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMG 998
             +      ++++     RE   +       +    KLRP +  FL+ A+++F M++YT G
Sbjct: 85   CLSRAEMYLIQEACSVTREDIWKIRLLGDPIDRLIKLRPFVRDFLKEANEMFTMYVYTKG 144

Query: 999  NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDD 1058
             + YA  + +++DP  + F  RVI++ +           P  K L+ VL  E  VVI+DD
Sbjct: 145  TRKYAKAVLELIDPNRLYFGDRVITKDES----------PHQKTLDLVLAEERGVVIVDD 194

Query: 1059 SVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
               +WPH+K NLI + +Y YF  S    G    S  E   DE  +DG LA+ L + +Q+H
Sbjct: 195  RRDIWPHHKSNLIEISKYKYFRVS----GQGSNSYSEKKTDESEKDGGLANVLKLLKQVH 250


>gi|297846748|ref|XP_002891255.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
            lyrata]
 gi|297337097|gb|EFH67514.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
            lyrata]
          Length = 210

 Score =  117 bits (292), Expect = 4e-23,   Method: Composition-based stats.
 Identities = 71/206 (34%), Positives = 110/206 (53%), Gaps = 29/206 (14%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ------DREKPHRHLFRFPHMGMW 972
            +KL LVLDLDHTL+++    ++      +L + + +      +++ P+  +         
Sbjct: 3    KKLHLVLDLDHTLIHTVLVSDLSEREKYLLEEADSRQDLWRCNKDSPYEFII-------- 54

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
             KLRP +  FL  A+KLF MH+YTMGN  YA ++ K++DP  V F  RVI+R        
Sbjct: 55   -KLRPFVHEFLLEANKLFTMHVYTMGNSCYAQDVLKLIDPDKVYFGNRVITR-------- 105

Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
              E  P +K L+ ++     VVI+DD++ VWPH+K NL+ + +Y YF    R  G    S
Sbjct: 106  --EASPCNKTLDLLVADTRRVVIVDDTISVWPHHKRNLLQITKYIYF----RVDGTKWDS 159

Query: 1093 LLEIDHDERSEDGTLASSLGVRQQLH 1118
              E   DE  + G+LA+ L   + +H
Sbjct: 160  YAEEKKDESRKSGSLANVLKFLEDVH 185


>gi|367047187|ref|XP_003653973.1| hypothetical protein THITE_2116513 [Thielavia terrestris NRRL 8126]
 gi|347001236|gb|AEO67637.1| hypothetical protein THITE_2116513 [Thielavia terrestris NRRL 8126]
          Length = 909

 Score =  116 bits (291), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 74/205 (36%), Positives = 112/205 (54%), Gaps = 25/205 (12%)

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQD 955
            +RT R E Q+++  +RKL LV+DLD T++ +         +    +P H+ +   K  Q 
Sbjct: 147  QRTER-ELQRRLLQSRKLSLVVDLDQTIIQACIDPTVGEWQRDPTNPNHESVKEVKSFQL 205

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
             + P     R  +   + K+RPG+  FL+R S+L+EMH+YTMG + YA  +A+V+DP+  
Sbjct: 206  DDGPSDLARRCSY---YIKMRPGLEEFLKRISELYEMHVYTMGTRAYAQNVARVVDPQRK 262

Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVE 1074
            LF  RVISR ++G+ F        +K L  +  + +  VVIIDD   VWP N+ NLI V 
Sbjct: 263  LFGNRVISRDENGNMF--------AKSLGRLFPVSTNMVVIIDDRSDVWPRNRPNLIKVS 314

Query: 1075 RYTYFPCSRRQFGLLGPSLLEIDHD 1099
             Y +F    +  G +  S L   HD
Sbjct: 315  PYEFF----KGIGDINSSFLPKRHD 335


>gi|15217916|ref|NP_173457.1| haloacid dehalogenase-like hydrolase [Arabidopsis thaliana]
 gi|9558594|gb|AAF88157.1|AC026234_8 Contains similarity to a FCP1 serine phosphatase from Xenopus laevis
            gi|6689545 [Arabidopsis thaliana]
 gi|332191840|gb|AEE29961.1| haloacid dehalogenase-like hydrolase [Arabidopsis thaliana]
          Length = 342

 Score =  116 bits (290), Expect = 9e-23,   Method: Composition-based stats.
 Identities = 79/221 (35%), Positives = 113/221 (51%), Gaps = 35/221 (15%)

Query: 906  TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ------DREKP 959
            T+ L  Q    + RKL LVLDLDHTLL+S     +      +L + + +      DRE  
Sbjct: 62   TKSLTTQLACLNERKLHLVLDLDHTLLHSIMISRLSEGEKYLLGESDFREDLWTLDRE-- 119

Query: 960  HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
                       M  KLRP +  FL+ A+++F M++YTMGN+ YA  + K +DPK V F  
Sbjct: 120  -----------MLIKLRPFVHEFLKEANEIFSMYVYTMGNRDYAQAVLKWIDPKKVYFGD 168

Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            RVI+R + G           SK L+ VL  E  VVI+DD+  VWP ++ NL+ + +Y+YF
Sbjct: 169  RVITRDESG----------FSKTLDLVLADECGVVIVDDTRHVWPDHERNLLQITKYSYF 218

Query: 1080 PCSRRQFG--LLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
                R +       S  E   DE    G+LA+ L V + +H
Sbjct: 219  ----RDYSHDKESKSYAEEKRDESRNQGSLANVLKVLKDVH 255


>gi|171680434|ref|XP_001905162.1| hypothetical protein [Podospora anserina S mat+]
 gi|170939844|emb|CAP65069.1| unnamed protein product [Podospora anserina S mat+]
          Length = 835

 Score =  115 bits (288), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 66/179 (36%), Positives = 104/179 (58%), Gaps = 20/179 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHR 961
            E QK++  +RKL LV+DLD T++ +     V        +P +D +   K  Q  + PH 
Sbjct: 154  ELQKRLLESRKLSLVVDLDQTVIQACIDPTVGEWMKDPTNPNYDSVKNVKTFQLDDGPHA 213

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             + +  +   + K+RPG+  FL+R S ++E+H+YTMG + YA  +A+V+DP+  LF  RV
Sbjct: 214  VVRKCWY---YIKMRPGLEGFLKRISTMYELHVYTMGTRAYAQNVARVIDPEKKLFGNRV 270

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            ISR ++G+ +        SK L+ +  + +  VVIIDD   VWPHN+ NL+ V  Y +F
Sbjct: 271  ISRDENGNMY--------SKSLQRLFPVSTNMVVIIDDRSDVWPHNRPNLVKVTPYEFF 321


>gi|303280109|ref|XP_003059347.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459183|gb|EEH56479.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 136

 Score =  115 bits (287), Expect = 2e-22,   Method: Composition-based stats.
 Identities = 60/122 (49%), Positives = 80/122 (65%), Gaps = 8/122 (6%)

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
            KLRP    FL  AS + ++++YTMG+K YA EMAK+LDP G LF GRVI+        + 
Sbjct: 1    KLRPRAREFLRAASAMCQLYVYTMGDKNYAREMAKILDPTGELFNGRVIA--------NS 52

Query: 1034 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSL 1093
            D    ++KDL+ VLG E +V+I+DD+ RVWPHN  NLI ++RY +FP S   F   G S+
Sbjct: 53   DSTCSRTKDLDIVLGAEGSVLIVDDTDRVWPHNLANLIRIDRYHFFPQSAAGFRQPGRSV 112

Query: 1094 LE 1095
            LE
Sbjct: 113  LE 114


>gi|255081919|ref|XP_002508178.1| predicted protein [Micromonas sp. RCC299]
 gi|226523454|gb|ACO69436.1| predicted protein [Micromonas sp. RCC299]
          Length = 318

 Score =  114 bits (285), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 66/161 (40%), Positives = 99/161 (61%), Gaps = 5/161 (3%)

Query: 961  RHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
            R L     + +WTKLRPG+  FL + + +FE+H+ TMG + YA EM +++DP      G 
Sbjct: 25   RTLHFVERLQIWTKLRPGVKKFLRQVASMFEVHVITMGTQSYADEMRQLIDPGRQHIKGS 84

Query: 1021 VISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            VI  G   D F G+ +    K L+G L G++S  V++DD V VWP ++ NLI ++RY YF
Sbjct: 85   VIGLG-QMDEF-GELQPADKKRLDGELSGLDSIAVVLDDHVGVWPDHEENLIEIDRYLYF 142

Query: 1080 PCSRRQFGLL--GPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            P + +QFG+   G SLLE   DE ++  TLA++  V +++H
Sbjct: 143  PSALKQFGVWRNGASLLEKKVDEIADRSTLAAAFEVLRRVH 183


>gi|15218405|ref|NP_175026.1| NLI interacting factor (NIF) family protein [Arabidopsis thaliana]
 gi|91805923|gb|ABE65690.1| NLI interacting factor family protein [Arabidopsis thaliana]
 gi|332193852|gb|AEE31973.1| NLI interacting factor (NIF) family protein [Arabidopsis thaliana]
          Length = 255

 Score =  113 bits (282), Expect = 7e-22,   Method: Composition-based stats.
 Identities = 75/209 (35%), Positives = 111/209 (53%), Gaps = 23/209 (11%)

Query: 914  KMFSA---RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE-EQDREKPHRHLFRFPHM 969
            ++FS    +KL LVLDLDHTLL+S    ++      +L + +  QD  + +   + F   
Sbjct: 43   QLFSVTKKKKLHLVLDLDHTLLHSVLVSDLSKREKYLLEETDSRQDLWRRNVDGYEF--- 99

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
                KLRP +  FL  A+KLF MH+YTMG+  YA ++ K++DP  V F  RVI+R     
Sbjct: 100  --IIKLRPFLHEFLLEANKLFTMHVYTMGSSSYAKQVLKLIDPDKVYFGKRVITR----- 152

Query: 1030 PFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 1089
                 E  P +K L+ +   +  VVI+DD+V VWP +K NL+ + +Y YF       G  
Sbjct: 153  -----EASPFNKSLDLLAADKRRVVIVDDTVHVWPFHKRNLLQITKYIYFKVD----GTK 203

Query: 1090 GPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
              S  E   DE   +G+LA+ L   + +H
Sbjct: 204  WDSYAEAKKDESQSNGSLANVLKFLEVVH 232


>gi|116830952|gb|ABK28432.1| unknown [Arabidopsis thaliana]
          Length = 256

 Score =  113 bits (282), Expect = 7e-22,   Method: Composition-based stats.
 Identities = 75/209 (35%), Positives = 111/209 (53%), Gaps = 23/209 (11%)

Query: 914  KMFSA---RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE-EQDREKPHRHLFRFPHM 969
            ++FS    +KL LVLDLDHTLL+S    ++      +L + +  QD  + +   + F   
Sbjct: 43   QLFSVTKKKKLHLVLDLDHTLLHSVLVSDLSKREKYLLEETDSRQDLWRRNVDGYEF--- 99

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
                KLRP +  FL  A+KLF MH+YTMG+  YA ++ K++DP  V F  RVI+R     
Sbjct: 100  --IIKLRPFLHEFLLEANKLFTMHVYTMGSSSYAKQVLKLIDPDKVYFGKRVITR----- 152

Query: 1030 PFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 1089
                 E  P +K L+ +   +  VVI+DD+V VWP +K NL+ + +Y YF       G  
Sbjct: 153  -----EASPFNKSLDLLAADKRRVVIVDDTVHVWPFHKRNLLQITKYIYFKVD----GTK 203

Query: 1090 GPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
              S  E   DE   +G+LA+ L   + +H
Sbjct: 204  WDSYAEAKKDESQSNGSLANVLKFLEVVH 232


>gi|340931931|gb|EGS19464.1| hypothetical protein CTHT_0049250 [Chaetomium thermophilum var.
            thermophilum DSM 1495]
          Length = 871

 Score =  112 bits (281), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 69/196 (35%), Positives = 109/196 (55%), Gaps = 22/196 (11%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVH 944
            DQ    +  E  +R+E+  Q+++  +RKL LV+DLD T++ +         +    +P H
Sbjct: 135  DQTNLRVGAEHAQRVEQELQRRLLQSRKLSLVVDLDQTIIQACIDPTVGEWQRDPTNPNH 194

Query: 945  DEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
            D +   K  Q  + P   L R      + K+RPG+  FL+R S+++E+H+YTMG + YA 
Sbjct: 195  DAVKDVKSFQLDDGPS-ALAR--KCWYYIKMRPGLEGFLKRISEMYELHVYTMGTRAYAQ 251

Query: 1005 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVW 1063
             +A+V+DP   LF  RVISR ++G+ +        +K L+ +  + +  VVIIDD   VW
Sbjct: 252  NVARVVDPDRKLFGNRVISRDENGNIY--------TKSLQRLFPVSTNMVVIIDDRSDVW 303

Query: 1064 PHNKLNLIVVERYTYF 1079
            P N+ NLI V  Y +F
Sbjct: 304  PRNRPNLIKVSPYEFF 319


>gi|255540901|ref|XP_002511515.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
 gi|223550630|gb|EEF52117.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
          Length = 405

 Score =  112 bits (281), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 69/172 (40%), Positives = 95/172 (55%), Gaps = 12/172 (6%)

Query: 908  RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 967
            R  E   + S +KL LVLDLD TLL+S     VD   +E   K +    +   + + R  
Sbjct: 86   RDAETDFVLSKKKLFLVLDLDQTLLHST----VDLTPEENYLKNQMDSLQDIFKLITREG 141

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
                + KLRP +  FL+ AS +F+M++YT  NK YA +M  +LDP  + F  R+I+R   
Sbjct: 142  FSPSYAKLRPFVRNFLQEASTMFKMYVYTNANKSYARKMVNLLDPDNIYFKSRLITR--- 198

Query: 1028 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                  D  V   K+L+ V+G E AVVI+DD   VWP +K NLI V+RY YF
Sbjct: 199  -----EDSTVSCQKNLDVVMGQERAVVILDDRTDVWPMHKDNLIQVQRYKYF 245


>gi|15218404|ref|NP_175025.1| NLI interacting factor (NIF) family protein [Arabidopsis thaliana]
 gi|117958727|gb|ABK59679.1| At1g43600 [Arabidopsis thaliana]
 gi|332193851|gb|AEE31972.1| NLI interacting factor (NIF) family protein [Arabidopsis thaliana]
          Length = 221

 Score =  111 bits (277), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 72/199 (36%), Positives = 105/199 (52%), Gaps = 20/199 (10%)

Query: 921  LCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE-EQDREKPHRHLFRFPHMGMWTKLRPGI 979
            L LVLDLDHTLL+S    ++      +L + +  QD  + +   + F       KLRP +
Sbjct: 19   LHLVLDLDHTLLHSVLVSDLSKREKYLLEETDSRQDLWRRNVDGYEF-----IIKLRPFL 73

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPK 1039
              FL  A+KLF MH+YTMG+  YA ++ K++DP  V F  RVI+R          E  P 
Sbjct: 74   HEFLLEANKLFTMHVYTMGSSSYAKQVLKLIDPDKVYFGKRVITR----------EASPF 123

Query: 1040 SKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHD 1099
            +K L+ +   +  VVI+DD+V VWP +K NL+ + +Y YF       G    S  E   D
Sbjct: 124  NKSLDLLAADKRRVVIVDDTVHVWPFHKRNLLQITKYVYFKVD----GTKWDSYAEAKKD 179

Query: 1100 ERSEDGTLASSLGVRQQLH 1118
            E   +G+LA+ L   + +H
Sbjct: 180  ESQSNGSLANVLKFLEDVH 198


>gi|302838991|ref|XP_002951053.1| hypothetical protein VOLCADRAFT_91454 [Volvox carteri f. nagariensis]
 gi|300263748|gb|EFJ47947.1| hypothetical protein VOLCADRAFT_91454 [Volvox carteri f. nagariensis]
          Length = 699

 Score =  110 bits (276), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 71/197 (36%), Positives = 102/197 (51%), Gaps = 38/197 (19%)

Query: 926  DLDHTLLNSAKFHEVDPVHD----EILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWT 981
            DLDHTLLNS    EV P       E+LR++EE +   P R L R     +WTKLRPG++ 
Sbjct: 377  DLDHTLLNSVHTSEVGPDTATQLAEVLRREEEANL-GPRRLLHRLAENKLWTKLRPGVFE 435

Query: 982  FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSK 1041
            FLE     +EMH+YTMG+K YA E+ K+LDP G LF+  VI++               +K
Sbjct: 436  FLEGLRDDYEMHIYTMGDKTYAAEVRKLLDPTGKLFSS-VIAK--------DHSTTATAK 486

Query: 1042 DLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDER 1101
            DL+ +L  +   +++DD+  VWP ++ NL+                         D DE 
Sbjct: 487  DLDVLLSADELALVLDDTEAVWPGHRRNLLQ------------------------DSDES 522

Query: 1102 SEDGTLASSLGVRQQLH 1118
            + DG LA+ + V + +H
Sbjct: 523  ATDGALAAHMRVLRAVH 539


>gi|380472901|emb|CCF46552.1| FCP1-like phosphatase, partial [Colletotrichum higginsianum]
          Length = 740

 Score =  110 bits (275), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 68/186 (36%), Positives = 101/186 (54%), Gaps = 32/186 (17%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILR-------------KKEEQDR 956
            E QK++   RKL LV+DLD T++++     ++P   E +              KK + + 
Sbjct: 152  ETQKRLLRQRKLSLVVDLDQTIIHAC----IEPTVGEWMEDPSNPNYEAVKDVKKFQLND 207

Query: 957  EKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
            E P   +      G W   K+RPG+  FLER ++L+E+H+YTMG + YA  +AK++DP+ 
Sbjct: 208  EGPRGMVTS----GCWYYIKMRPGLAEFLERVAELYELHVYTMGTRAYALNIAKIVDPQQ 263

Query: 1015 VLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVV 1073
             LF  RVISR ++G           SK L+ +  + +  VVIIDD   VWP N+ NLI V
Sbjct: 264  KLFGNRVISRDENGSMI--------SKSLQRLFPVNTNMVVIIDDRADVWPSNRPNLIKV 315

Query: 1074 ERYTYF 1079
              Y +F
Sbjct: 316  VPYDFF 321


>gi|320591286|gb|EFX03725.1| RNA polymerase 2 ctd phosphatase [Grosmannia clavigera kw1407]
          Length = 923

 Score =  110 bits (274), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 66/185 (35%), Positives = 103/185 (55%), Gaps = 32/185 (17%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH----RHLFR 965
            E Q+++ S RKL LV+DLD T++++     +DP   E      +QD   P+    + + R
Sbjct: 159  ELQQRLLSQRKLSLVVDLDQTIIHAC----IDPTIGEW-----QQDPSNPNYEALKDVRR 209

Query: 966  FP--------HMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
            F           G W   K+RP +  FLE+ S ++E+H+YTMG + YAT +A+++DP   
Sbjct: 210  FQLEEGFQGLARGCWYYIKMRPHLTEFLEKISTMYELHVYTMGTRTYATNIAQIVDPNQK 269

Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAV-VIIDDSVRVWPHNKLNLIVVE 1074
            LF  RVISR ++G+          +K L+ +  + + + VIIDD   VWP+N+ NLI V 
Sbjct: 270  LFGNRVISRDENGNII--------AKSLQRLFPVSTNMAVIIDDRADVWPYNRHNLIKVN 321

Query: 1075 RYTYF 1079
             Y +F
Sbjct: 322  PYDFF 326


>gi|367032510|ref|XP_003665538.1| hypothetical protein MYCTH_2309412 [Myceliophthora thermophila ATCC
            42464]
 gi|347012809|gb|AEO60293.1| hypothetical protein MYCTH_2309412 [Myceliophthora thermophila ATCC
            42464]
          Length = 913

 Score =  110 bits (274), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 67/196 (34%), Positives = 108/196 (55%), Gaps = 33/196 (16%)

Query: 899  AAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
            +A+Q +RT + E Q+++  +RKL LV+DLD T++ +     +DP   E      ++D   
Sbjct: 142  SAVQAQRTEQ-ELQRRLLKSRKLSLVVDLDQTIIQAC----IDPTVGEW-----QKDPTN 191

Query: 959  PHRHL------FRFP--------HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
            P+  L      F+              + K+RPG+  FL+R ++++E+H+YTMG + YA 
Sbjct: 192  PNHELAKEVKSFQLDDGPTDLARRCWYYIKMRPGLQDFLKRIAEMYELHVYTMGTRAYAQ 251

Query: 1005 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVV-IIDDSVRVW 1063
             +A+V+DP   LF  RVISR ++G+ F        +K L  +  + + +V IIDD   VW
Sbjct: 252  NVARVVDPDKKLFGNRVISRDENGNIF--------AKSLHRLFPVSTHMVAIIDDRSDVW 303

Query: 1064 PHNKLNLIVVERYTYF 1079
            P N+ NLI V  Y +F
Sbjct: 304  PRNRPNLIKVSPYEFF 319


>gi|334185470|ref|NP_188594.3| haloacid dehalogenase-like hydrolase domain-containing protein
            [Arabidopsis thaliana]
 gi|332642744|gb|AEE76265.1| haloacid dehalogenase-like hydrolase domain-containing protein
            [Arabidopsis thaliana]
          Length = 302

 Score =  109 bits (273), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 74/220 (33%), Positives = 116/220 (52%), Gaps = 24/220 (10%)

Query: 906  TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
            T+RL  +    + +KL LVLDLD TL++S +   +      ++ +     RE   +   R
Sbjct: 74   TKRLITKFSCLNMKKLHLVLDLDLTLIHSVRVPCLSEAEKYLIEEAGSTTREDLWKMKVR 133

Query: 966  -------FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
                     H+    KLRP +  FL+ A+++F M++YT G + YA  + K++DPK + F 
Sbjct: 134  GDPISITIEHL---VKLRPFLCEFLKEANEMFTMYVYTKGTRPYAEAILKLIDPKKLYFG 190

Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
             RVI+R +           P +K L+ VL  E  VVI+DD+ + WP+NK NL+++ RY Y
Sbjct: 191  HRVITRNES----------PHTKTLDMVLADERGVVIVDDTRKAWPNNKSNLVLIGRYNY 240

Query: 1079 FPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            F   R Q  +L P   E   DE   +G LA+ L + + +H
Sbjct: 241  F---RSQSRVLKPHSEE-KTDESENNGGLANVLKLLKGIH 276


>gi|46126951|ref|XP_388029.1| hypothetical protein FG07853.1 [Gibberella zeae PH-1]
          Length = 765

 Score =  109 bits (273), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 67/179 (37%), Positives = 97/179 (54%), Gaps = 19/179 (10%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHR 961
            E QK++   RKL LV+DLD T++++         +    +P HD +   K  Q  +   R
Sbjct: 148  ENQKRLLRQRKLSLVVDLDQTIIHACIEPTIGEWQRDPSNPNHDAVKDVKSFQLNDDGPR 207

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             +        + KLRPG+  FLE  SK++E+H+YTMG + YA  +AK++DP   LF  RV
Sbjct: 208  GVT--SGCTYYIKLRPGLMEFLEEVSKMYELHVYTMGTRAYALNIAKIVDPDKKLFGNRV 265

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            ISR ++G           SK L+ +  + +  VVIIDD   VWP N+ NLI V  Y +F
Sbjct: 266  ISRDENGS--------ITSKSLQRLFPVSTDMVVIIDDRADVWPMNRPNLIKVVPYDFF 316


>gi|408390401|gb|EKJ69801.1| hypothetical protein FPSE_10001 [Fusarium pseudograminearum CS3096]
          Length = 765

 Score =  109 bits (273), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 67/179 (37%), Positives = 97/179 (54%), Gaps = 19/179 (10%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHR 961
            E QK++   RKL LV+DLD T++++         +    +P HD +   K  Q  +   R
Sbjct: 148  ENQKRLLRQRKLSLVVDLDQTIIHACIEPTIGEWQRDPSNPNHDAVKDVKSFQLNDDGPR 207

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             +        + KLRPG+  FLE  SK++E+H+YTMG + YA  +AK++DP   LF  RV
Sbjct: 208  GVT--SGCTYYIKLRPGLMEFLEEVSKMYELHVYTMGTRAYALNIAKIVDPDKKLFGNRV 265

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            ISR ++G           SK L+ +  + +  VVIIDD   VWP N+ NLI V  Y +F
Sbjct: 266  ISRDENGS--------ITSKSLQRLFPVSTDMVVIIDDRADVWPMNRPNLIKVVPYDFF 316


>gi|225194907|gb|ACN81954.1| C-terminal domain phosphatase-like 5 [Arabidopsis thaliana]
          Length = 601

 Score =  109 bits (272), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 74/220 (33%), Positives = 116/220 (52%), Gaps = 24/220 (10%)

Query: 906  TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
            T+RL  +    + +KL LVLDLD TL++S +   +      ++ +     RE   +   R
Sbjct: 74   TKRLITKFSCLNMKKLHLVLDLDLTLIHSVRVPCLSEAEKYLIEEAGSTTREDLWKMKVR 133

Query: 966  -------FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
                     H+    KLRP +  FL+ A+++F M++YT G + YA  + K++DPK + F 
Sbjct: 134  GDPISITIEHL---VKLRPFLCEFLKEANEMFTMYVYTKGTRPYAEAILKLIDPKKLYFG 190

Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
             RVI+R +           P +K L+ VL  E  VVI+DD+ + WP+NK NL+++ RY Y
Sbjct: 191  HRVITRNES----------PHTKTLDMVLADERGVVIVDDTRKAWPNNKSNLVLIGRYNY 240

Query: 1079 FPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            F   R Q  +L P   E   DE   +G LA+ L + + +H
Sbjct: 241  F---RSQSRVLKPHSEE-KTDESENNGGLANVLKLLKGIH 276



 Score =  105 bits (262), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 74/237 (31%), Positives = 125/237 (52%), Gaps = 27/237 (11%)

Query: 887  EHLFEGYDDQQKA-AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHD 945
            +++F+G     +A A+ K  T +L       + +KL LVLDLDHTLL++     +     
Sbjct: 355  DYIFDGLQLSHEAVALTKCFTTKLS----CLNEKKLHLVLDLDHTLLHTVMVPSLSQAEK 410

Query: 946  EILRKKEEQDREKPHRHLFRFPHMG----MWTKLRPGIWTFLERASKLFEMHLYTMGNKL 1001
             ++ +     R+     L++   +G      TKLRP +  FL+ A++ F M++YT G+++
Sbjct: 411  YLIEEAGSATRDD----LWKIKAVGDPMEFLTKLRPFLRDFLKEANEFFTMYVYTKGSRV 466

Query: 1002 YATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVR 1061
            YA ++ +++DPK + F  RVI++ +           P  K L+ VL  E  VVI+DD+  
Sbjct: 467  YAKQVLELIDPKKLYFGDRVITKTES----------PHMKTLDFVLAEERGVVIVDDTRN 516

Query: 1062 VWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            VWP +K NL+ + +Y+YF    R  G       E   DE   +G LA+ L + +++H
Sbjct: 517  VWPDHKSNLVDISKYSYF----RLKGQDSMPYSEEKTDESESEGGLANVLKLLKEVH 569


>gi|310791724|gb|EFQ27251.1| FCP1-like phosphatase [Glomerella graminicola M1.001]
          Length = 860

 Score =  109 bits (272), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 67/186 (36%), Positives = 101/186 (54%), Gaps = 32/186 (17%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILR-------------KKEEQDR 956
            E QK++   RKL LV+DLD T++++     ++P   E +              KK + + 
Sbjct: 152  ETQKRLLRQRKLSLVVDLDQTIIHAC----IEPTVGEWMEDPSNPNYQAVKDVKKFQLND 207

Query: 957  EKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
            E P   +      G W   K+RPG+  FLE+ ++L+E+H+YTMG + YA  +AK++DP  
Sbjct: 208  EGPRGMVTS----GCWYYIKMRPGLAEFLEKVAELYELHVYTMGTRAYALNIAKIVDPHQ 263

Query: 1015 VLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVV 1073
             LF  RVISR ++G           SK L+ +  + +  VVIIDD   VWP+N+ NLI V
Sbjct: 264  KLFGNRVISRDENGSMI--------SKSLQRLFPVNTNMVVIIDDRADVWPNNRPNLIKV 315

Query: 1074 ERYTYF 1079
              Y +F
Sbjct: 316  VPYDFF 321


>gi|346326901|gb|EGX96497.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Cordyceps militaris
            CM01]
          Length = 780

 Score =  109 bits (272), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 70/196 (35%), Positives = 103/196 (52%), Gaps = 21/196 (10%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVH 944
            DQ    + +   +R+E   QK++   RKL LV+DLD T++++     V        +P H
Sbjct: 131  DQTGLLVSENVAQRVEHDTQKRLLRQRKLSLVVDLDQTIIHACIEPTVGEWQRDPSNPNH 190

Query: 945  DEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
              +   +  Q ++   R L        + KLRPG+  FLE  SK++E+H+YTMG + YA 
Sbjct: 191  SAVKDVRSFQLKDDGPRGLA--SGCTYYIKLRPGLRDFLEEVSKMYELHVYTMGTRAYAL 248

Query: 1005 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVW 1063
             +AK++DP   LF  RVISR ++G           +K L  +  + +  VVIIDD   VW
Sbjct: 249  NIAKIVDPDRKLFGNRVISRDENGS--------ITAKSLARLFPVSTDMVVIIDDRADVW 300

Query: 1064 PHNKLNLIVVERYTYF 1079
            P NK NLI V  Y +F
Sbjct: 301  PMNKANLIKVAAYDFF 316


>gi|402080254|gb|EJT75399.1| RNA polymerase II subunit A domain phosphatase [Gaeumannomyces
            graminis var. tritici R3-111a-1]
          Length = 850

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 69/194 (35%), Positives = 104/194 (53%), Gaps = 27/194 (13%)

Query: 897  QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEIL 948
            +KAA + E     E QK++   RKL LV+DLD T++++         +    +P H+ + 
Sbjct: 150  KKAATKTE----FELQKRLLDQRKLILVVDLDQTIIHACIEPTIGDWQRDPTNPNHEAVK 205

Query: 949  RKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEM 1006
              K  Q  +   R L      G W   K+RPG+  FLE+ + ++E+H+YTMG + YA  +
Sbjct: 206  DVKSFQLNDDGPRGLAS----GCWYYIKMRPGLVDFLEKIATMYELHVYTMGTRAYAMNI 261

Query: 1007 AKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPH 1065
            AK++DP   LF  RVISR ++G           +K L+ +  + +  VVIIDD   VWP 
Sbjct: 262  AKIVDPDQKLFGNRVISRDENGS--------MTAKSLQRLFPVSTRMVVIIDDRADVWPR 313

Query: 1066 NKLNLIVVERYTYF 1079
            N+ NLI V  Y +F
Sbjct: 314  NRPNLIKVVPYDFF 327


>gi|9294425|dbj|BAB02545.1| unnamed protein product [Arabidopsis thaliana]
          Length = 314

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 74/220 (33%), Positives = 116/220 (52%), Gaps = 24/220 (10%)

Query: 906  TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
            T+RL  +    + +KL LVLDLD TL++S +   +      ++ +     RE   +   R
Sbjct: 74   TKRLITKFSCLNMKKLHLVLDLDLTLIHSVRVPCLSEAEKYLIEEAGSTTREDLWKMKVR 133

Query: 966  -------FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
                     H+    KLRP +  FL+ A+++F M++YT G + YA  + K++DPK + F 
Sbjct: 134  GDPISITIEHL---VKLRPFLCEFLKEANEMFTMYVYTKGTRPYAEAILKLIDPKKLYFG 190

Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
             RVI+R +           P +K L+ VL  E  VVI+DD+ + WP+NK NL+++ RY Y
Sbjct: 191  HRVITRNES----------PHTKTLDMVLADERGVVIVDDTRKAWPNNKSNLVLIGRYNY 240

Query: 1079 FPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            F   R Q  +L P   E   DE   +G LA+ L + + +H
Sbjct: 241  F---RSQSRVLKPHSEE-KTDESENNGGLANVLKLLKGIH 276


>gi|302404507|ref|XP_003000091.1| RNA polymerase II subunit A C-terminal domain phosphatase
            [Verticillium albo-atrum VaMs.102]
 gi|261361273|gb|EEY23701.1| RNA polymerase II subunit A C-terminal domain phosphatase
            [Verticillium albo-atrum VaMs.102]
          Length = 755

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 69/198 (34%), Positives = 108/198 (54%), Gaps = 25/198 (12%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVH 944
            DQ    +  +   R E   Q+++   RKL LV+DLD T++++     V        +P +
Sbjct: 135  DQTGLMVSNDMAARAEHDAQRRLLRQRKLSLVVDLDQTIIHACIEPTVGEWMNDPENPNY 194

Query: 945  DEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLY 1002
            D +   ++ Q  ++  R + +    G W   K+RPG+  FLER ++L+E+H+YTMG + Y
Sbjct: 195  DAVKDVQKFQLNDEGPRGVTQ----GCWYYIKMRPGLREFLERVAELYELHVYTMGTRAY 250

Query: 1003 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVR 1061
            A  +AK++DP+  LF  RVISR ++G           SK L+ +  + +  VVIIDD   
Sbjct: 251  ALNIAKIVDPQQKLFGNRVISRDENGS--------ITSKSLQRLFPVSTNMVVIIDDRAD 302

Query: 1062 VWPHNKLNLIVVERYTYF 1079
            VWP N+ NLI V  Y +F
Sbjct: 303  VWPRNRPNLIKVVPYDFF 320


>gi|346975758|gb|EGY19210.1| RNA polymerase II subunit A C-terminal domain phosphatase
            [Verticillium dahliae VdLs.17]
          Length = 818

 Score =  107 bits (268), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 70/198 (35%), Positives = 106/198 (53%), Gaps = 25/198 (12%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            DQ    +  +   R E   Q+++   RKL LV+DLD T++++     ++P   E +   E
Sbjct: 135  DQTGLMVSNDMAARAEHDAQRRLLRQRKLSLVVDLDQTIIHAC----IEPTVGEWMNDPE 190

Query: 953  E------QDREKPHRHLF--RFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLY 1002
                   +D EK   +    R    G W   K+RPG+  FLE+ ++L+E+H+YTMG + Y
Sbjct: 191  NPNYDAVKDVEKFQLNDEGPRGVTQGCWYYIKMRPGLREFLEKVAELYELHVYTMGTRAY 250

Query: 1003 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVR 1061
            A  +AK++DP+  LF  RVISR ++G           SK L+ +  + +  VVIIDD   
Sbjct: 251  ALNIAKIVDPQQKLFGNRVISRDENGS--------ITSKSLQRLFPVSTNMVVIIDDRAD 302

Query: 1062 VWPHNKLNLIVVERYTYF 1079
            VWP N+ NLI V  Y +F
Sbjct: 303  VWPRNRPNLIKVVPYDFF 320


>gi|429854785|gb|ELA29772.1| RNA polymerase ii ctd phosphatase [Colletotrichum gloeosporioides
            Nara gc5]
          Length = 829

 Score =  107 bits (268), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 69/194 (35%), Positives = 107/194 (55%), Gaps = 26/194 (13%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSA------KFHE--VDPVHDEILRKKEEQDREKPHRHL 963
            QK++   RKL LV+DLD T++++       ++ E   +P ++ +   K+ Q  ++  R +
Sbjct: 154  QKRLLRQRKLSLVVDLDQTIIHACIEPTVGEWMEDPTNPNYNAVKDVKKFQLNDEGPRGV 213

Query: 964  FRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
                  G W   K+RPG+  FLE+ S+L+E+H+YTMG + YA  +A+++DP   LF  RV
Sbjct: 214  VT---SGCWYYIKMRPGLKEFLEKISELYELHVYTMGTRAYAMNIAQIVDPDRKLFGNRV 270

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
            ISR ++G           SK L+ +  + +  VVIIDD   VWP N+ NLI V  Y +F 
Sbjct: 271  ISRDENGSMI--------SKSLQRLFPVNTNMVVIIDDRADVWPRNRPNLIKVVPYDFF- 321

Query: 1081 CSRRQFGLLGPSLL 1094
               R  G +  S L
Sbjct: 322  ---RGIGDINSSFL 332


>gi|116179414|ref|XP_001219556.1| hypothetical protein CHGG_00335 [Chaetomium globosum CBS 148.51]
 gi|88184632|gb|EAQ92100.1| hypothetical protein CHGG_00335 [Chaetomium globosum CBS 148.51]
          Length = 828

 Score =  107 bits (266), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 62/190 (32%), Positives = 107/190 (56%), Gaps = 22/190 (11%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVH 944
            DQ    +     +R E+  Q+++  +RKL LV+DLD T++ +         +    +P H
Sbjct: 135  DQTNLTVSASHAQRTEQELQRRLLVSRKLSLVVDLDQTIIQACIDPTVGDWQKDPTNPNH 194

Query: 945  DEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
            + +   K  Q  + P +   +  +   + K+RPG+ +FL+R ++++E+H+YTMG + YA 
Sbjct: 195  ESVKSVKSFQLDDGPTQAANQCSY---YIKMRPGLESFLKRIAQMYELHVYTMGTRAYAQ 251

Query: 1005 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVV-IIDDSVRVW 1063
             +A+V+DP   LF  RVISR ++G  +        +KDL+ +  + + +V IIDD   VW
Sbjct: 252  NVARVVDPDKKLFGNRVISRDENGSIY--------AKDLQRLFPISTHMVAIIDDRSDVW 303

Query: 1064 PHNKLNLIVV 1073
            P+N+ NLI V
Sbjct: 304  PNNRANLIKV 313


>gi|322706326|gb|EFY97907.1| RNA Polymerase II CTD phosphatase Fcp1 [Metarhizium anisopliae ARSEF
            23]
          Length = 807

 Score =  106 bits (264), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 64/177 (36%), Positives = 96/177 (54%), Gaps = 19/177 (10%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHRHL 963
            QK++   RKL LV+DLD T++++         +  E +P H+ +   K  Q  +   R L
Sbjct: 150  QKRLLRQRKLSLVVDLDQTIIHACIEPTIGEWQKDESNPNHEAVKDVKSFQLNDDGPRGL 209

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
                    + KLRPG+  FLE  + ++E+H+YTMG + YA  +A+++DP   LF  RVIS
Sbjct: 210  A--SGCTYYIKLRPGLQEFLEEIATMYELHVYTMGTRAYALNIARIVDPDRKLFGNRVIS 267

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            R ++G           SK L+ +  + +  VVIIDD   VWP N+ NLI V  Y +F
Sbjct: 268  RDENGS--------ITSKSLQRLFPVSTNMVVIIDDRADVWPRNRPNLIKVVPYDFF 316


>gi|336466789|gb|EGO54953.1| hypothetical protein NEUTE1DRAFT_84976 [Neurospora tetrasperma FGSC
            2508]
 gi|350288620|gb|EGZ69856.1| hypothetical protein NEUTE2DRAFT_160171 [Neurospora tetrasperma FGSC
            2509]
          Length = 867

 Score =  106 bits (264), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 70/223 (31%), Positives = 116/223 (52%), Gaps = 42/223 (18%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            DQ    + + + ++ E   Q+++   RKL LV+DLD T++++     +DP   E      
Sbjct: 135  DQTHLTVSETQAQKTENALQRRLLQHRKLSLVVDLDQTIIHAC----IDPTVGEW----- 185

Query: 953  EQDREKPH----RHLFRFP----------HMGMWTKLRPGIWTFLERASKLFEMHLYTMG 998
            ++D   P+    R++  F           +   + K+RPG+  FL++ S ++E+H+YTMG
Sbjct: 186  QKDPSNPNYPSVRNVKSFQLDDGPRGVANNCWYYIKMRPGLEDFLKKISTMYELHVYTMG 245

Query: 999  NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIID 1057
             + YA  +A+++DP   LF  RVISR ++G+ +        +K L+ +  + +  VVIID
Sbjct: 246  TRAYAQNVARIVDPDKKLFGNRVISRDENGNMY--------AKSLQRLFPVSTKMVVIID 297

Query: 1058 DSVRVWPHNKLNLIVVERYTYFPCSR--------RQFGLLGPS 1092
            D   VWP N+ NLI V  Y +F            +Q GLL PS
Sbjct: 298  DRADVWPRNRPNLIKVSPYDFFKGIGDINSGFLPKQQGLLTPS 340


>gi|302889251|ref|XP_003043511.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256724428|gb|EEU37798.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 765

 Score =  106 bits (264), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 67/179 (37%), Positives = 97/179 (54%), Gaps = 20/179 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHR 961
            E QK++   RKL LV+DLD T++++         +    +P H  +   K  Q  + P R
Sbjct: 148  ESQKRLLRQRKLTLVVDLDQTIIHACIEPTIGEWQRDPTNPNHQAVKDVKSFQLDDGP-R 206

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             L        + KLRPG+  FLE  SK++E+H+YTMG + YA  +A+++DP   LF  RV
Sbjct: 207  GLA--SGCTYYIKLRPGLAEFLEEISKMYELHVYTMGTRAYALNIARIVDPDKKLFGNRV 264

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            ISR ++G           SK L+ +  + +  VVIIDD   VWP N+ NLI V  Y +F
Sbjct: 265  ISRDENGS--------ITSKSLQRLFPVSTDMVVIIDDRADVWPLNRPNLIKVVPYDFF 315


>gi|164429292|ref|XP_958446.2| hypothetical protein NCU11408 [Neurospora crassa OR74A]
 gi|157073422|gb|EAA29210.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 868

 Score =  106 bits (264), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 70/223 (31%), Positives = 116/223 (52%), Gaps = 42/223 (18%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            DQ    + + + ++ E   Q+++   RKL LV+DLD T++++     +DP   E      
Sbjct: 135  DQTHLTVSETQAQKTENALQRRLLQHRKLSLVVDLDQTIIHAC----IDPTVGEW----- 185

Query: 953  EQDREKPH----RHLFRFP----------HMGMWTKLRPGIWTFLERASKLFEMHLYTMG 998
            ++D   P+    R++  F           +   + K+RPG+  FL++ S ++E+H+YTMG
Sbjct: 186  QKDPSNPNYPSVRNVKSFQLDDGPRGVANNCWYYIKMRPGLEDFLKKISTMYELHVYTMG 245

Query: 999  NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIID 1057
             + YA  +A+++DP   LF  RVISR ++G+ +        +K L+ +  + +  VVIID
Sbjct: 246  TRAYAQNVARIVDPDKKLFGNRVISRDENGNMY--------AKSLQRLFPVSTKMVVIID 297

Query: 1058 DSVRVWPHNKLNLIVVERYTYFPCSR--------RQFGLLGPS 1092
            D   VWP N+ NLI V  Y +F            +Q GLL PS
Sbjct: 298  DRADVWPRNRPNLIKVSPYDFFKGIGDINSGFLPKQQGLLTPS 340


>gi|336259270|ref|XP_003344437.1| hypothetical protein SMAC_08633 [Sordaria macrospora k-hell]
 gi|380087533|emb|CCC05319.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 878

 Score =  105 bits (263), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 64/202 (31%), Positives = 110/202 (54%), Gaps = 34/202 (16%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            DQ    + + + ++ E   Q+++   RKL LV+DLD T++++     +DP   E      
Sbjct: 135  DQTHLTVSETQAQKTENALQRRLLQHRKLSLVVDLDQTIIHAC----IDPTVGEW----- 185

Query: 953  EQDREKPH----RHLFRFP----------HMGMWTKLRPGIWTFLERASKLFEMHLYTMG 998
            ++D   P+    R++  F           +   + K+RPG+  FL++ S ++E+H+YTMG
Sbjct: 186  QKDPSNPNYPSVRNVKSFQLDDGPRGVANNCWYYIKMRPGLEDFLKKISTMYELHVYTMG 245

Query: 999  NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIID 1057
             + YA  +A+++DP+  LF  RVISR ++G+ +        +K L+ +  + +  VVIID
Sbjct: 246  TRAYAQNVARIVDPEKKLFGNRVISRDENGNMY--------AKSLQRLFPVSTKMVVIID 297

Query: 1058 DSVRVWPHNKLNLIVVERYTYF 1079
            D   VWP N+ NLI V  Y +F
Sbjct: 298  DRADVWPRNRPNLIKVSPYDFF 319


>gi|342878347|gb|EGU79693.1| hypothetical protein FOXB_09806 [Fusarium oxysporum Fo5176]
          Length = 769

 Score =  105 bits (263), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 98/179 (54%), Gaps = 19/179 (10%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHR 961
            E QK++   RKL LV+DLD T++++         K    +P ++ +   ++ Q  +   R
Sbjct: 148  ENQKRLLRQRKLSLVVDLDQTIIHACIEPTIGEWKNDPTNPNYEAVKDVRDFQLNDDGPR 207

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             L        + KLRPG+  FL+  SK++E+H+YTMG + YA  +AK++DP   LF  RV
Sbjct: 208  GLT--SGCTYYIKLRPGLMEFLDEVSKMYELHVYTMGTRAYALNIAKIVDPDQKLFGNRV 265

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            ISR ++G           +K L+ +  + +  VVIIDD   VWP N+ NLI V  Y +F
Sbjct: 266  ISRDENGS--------ITAKSLQRLFPVSTDMVVIIDDRADVWPMNRPNLIKVVPYDFF 316


>gi|358390781|gb|EHK40186.1| hypothetical protein TRIATDRAFT_89336 [Trichoderma atroviride IMI
            206040]
          Length = 768

 Score =  105 bits (262), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 97/179 (54%), Gaps = 19/179 (10%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHR 961
            E QK++   RKL LV+DLD T++++         +  + +P H+ +   K  Q  +   R
Sbjct: 148  ESQKRLLRQRKLSLVVDLDQTIIHACIEPTVGEWQRDKANPNHEAVKDVKSFQLNDDGPR 207

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             L        + KLRPG+  FLE  S ++E+H+YTMG + YA  +A+++DP   LF  RV
Sbjct: 208  GLA--SGCTYYIKLRPGLHEFLETVSTMYELHVYTMGTRAYALNIARIVDPDKKLFGNRV 265

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            ISR ++G           +K L+ +  + +  VVIIDD   VWP N+ NLI V  Y +F
Sbjct: 266  ISRDENGS--------ITAKSLQRLFPVSTDMVVIIDDRSDVWPMNRPNLIKVVPYDFF 316


>gi|281206665|gb|EFA80851.1| putative tfiif-interacting component of the c-terminal domain
            phosphatase [Polysphondylium pallidum PN500]
          Length = 881

 Score =  105 bits (262), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 68/184 (36%), Positives = 97/184 (52%), Gaps = 21/184 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSA---KFHEVDPVHDEILRKKEEQDREKPHRHLFRF 966
            E  K++   RKL LVLD+DHT++++     F EV P    I       D EK +      
Sbjct: 266  ENAKRLIKQRKLSLVLDIDHTIIHAIMEPHFMEV-PYWRNI-------DCEKENIRSITL 317

Query: 967  PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
             +M  + KLRP ++ FLE  +K FE+H+YTMG + YA E+AK++D K  LF  R++SR D
Sbjct: 318  GNMKYYIKLRPFLYKFLEDVNKKFELHIYTMGTRNYALEIAKLIDEKQELFKERILSRDD 377

Query: 1027 DGD-PFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
              D  F   +R+    D        S V+I+DD   VW  +K NL+ +  Y YF   +  
Sbjct: 378  TTDMSFKTLQRLFPCDD--------SMVLIVDDRSDVWKRSK-NLVQISPYLYFVGCKDM 428

Query: 1086 FGLL 1089
              LL
Sbjct: 429  VNLL 432


>gi|378731871|gb|EHY58330.1| protein phosphatase [Exophiala dermatitidis NIH/UT8656]
          Length = 856

 Score =  105 bits (262), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 81/238 (34%), Positives = 120/238 (50%), Gaps = 36/238 (15%)

Query: 854  AVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQ- 912
             + TN     T   +G E       P     D  HL           I KE   R++E+ 
Sbjct: 102  GMCTNCGKDMTTVQAGSETTDADRAPIRMTHDTPHL----------TISKEEAARIDEEA 151

Query: 913  -KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE--QDREKPHRHLFRF--- 966
             +++ S+RKL LV+DLD T++++A    VDP   E  + K+    D  K  R  F+    
Sbjct: 152  KRRLLSSRKLSLVVDLDQTIIHAA----VDPTIAEWQKDKDNPNYDAVKDVRS-FQLIDD 206

Query: 967  -PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
             P M G W   KLRPG+  FLE  S+L+EMH+YTMG + YA ++A ++DP+   F  R++
Sbjct: 207  GPGMRGCWYYIKLRPGLTEFLEHISQLYEMHIYTMGTRQYAQQIAAIVDPERKFFGDRIL 266

Query: 1023 SRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            SR + G           +K+LE +  +++  VVIIDD   VW  +  NLI V  + +F
Sbjct: 267  SRDESGSMV--------AKNLERLFPVDTKMVVIIDDRGDVWKWSA-NLIRVRPFDFF 315


>gi|157823025|ref|NP_001099601.1| RNA polymerase II subunit A C-terminal domain phosphatase [Rattus
            norvegicus]
 gi|149015915|gb|EDL75222.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
            phosphatase, subunit 1 (predicted), isoform CRA_a [Rattus
            norvegicus]
          Length = 969

 Score =  105 bits (261), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 66/181 (36%), Positives = 101/181 (55%), Gaps = 21/181 (11%)

Query: 901  IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH 960
            +Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  I            H
Sbjct: 160  MQAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------H 207

Query: 961  RHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
              L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R
Sbjct: 208  FQLGRGEPM-LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHR 266

Query: 1021 VISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            ++SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 267  ILSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 319

Query: 1080 P 1080
            P
Sbjct: 320  P 320


>gi|186510238|ref|NP_001118664.1| haloacid dehalogenase-like hydrolase domain-containing protein
            [Arabidopsis thaliana]
 gi|9294424|dbj|BAB02544.1| unnamed protein product [Arabidopsis thaliana]
 gi|332642743|gb|AEE76264.1| haloacid dehalogenase-like hydrolase domain-containing protein
            [Arabidopsis thaliana]
          Length = 307

 Score =  105 bits (261), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 74/237 (31%), Positives = 125/237 (52%), Gaps = 27/237 (11%)

Query: 887  EHLFEGYDDQQKA-AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHD 945
            +++F+G     +A A+ K  T +L       + +KL LVLDLDHTLL++     +     
Sbjct: 61   DYIFDGLQLSHEAVALTKCFTTKL----SCLNEKKLHLVLDLDHTLLHTVMVPSLSQAEK 116

Query: 946  EILRKKEEQDREKPHRHLFRFPHMG----MWTKLRPGIWTFLERASKLFEMHLYTMGNKL 1001
             ++ +     R+     L++   +G      TKLRP +  FL+ A++ F M++YT G+++
Sbjct: 117  YLIEEAGSATRDD----LWKIKAVGDPMEFLTKLRPFLRDFLKEANEFFTMYVYTKGSRV 172

Query: 1002 YATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVR 1061
            YA ++ +++DPK + F  RVI++ +           P  K L+ VL  E  VVI+DD+  
Sbjct: 173  YAKQVLELIDPKKLYFGDRVITKTES----------PHMKTLDFVLAEERGVVIVDDTRN 222

Query: 1062 VWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            VWP +K NL+ + +Y+YF    R  G       E   DE   +G LA+ L + +++H
Sbjct: 223  VWPDHKSNLVDISKYSYF----RLKGQDSMPYSEEKTDESESEGGLANVLKLLKEVH 275


>gi|148677457|gb|EDL09404.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
            phosphatase, subunit 1, isoform CRA_a [Mus musculus]
          Length = 956

 Score =  105 bits (261), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 66/181 (36%), Positives = 101/181 (55%), Gaps = 21/181 (11%)

Query: 901  IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH 960
            +Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  I            H
Sbjct: 160  MQAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------H 207

Query: 961  RHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
              L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R
Sbjct: 208  FQLGRGEPM-LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHR 266

Query: 1021 VISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            ++SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 267  ILSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 319

Query: 1080 P 1080
            P
Sbjct: 320  P 320


>gi|351695852|gb|EHA98770.1| hypothetical protein GW7_03722 [Heterocephalus glaber]
          Length = 963

 Score =  105 bits (261), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 63/172 (36%), Positives = 97/172 (56%), Gaps = 20/172 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 170  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------HFQLGRGEPM 218

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 219  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 277

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
            PF       K+ +L+ +    +S V IIDD   VW     NLI V++Y YFP
Sbjct: 278  PFS------KTGNLKNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYFP 322


>gi|348555132|ref|XP_003463378.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
            [Cavia porcellus]
          Length = 970

 Score =  105 bits (261), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 66/180 (36%), Positives = 100/180 (55%), Gaps = 21/180 (11%)

Query: 902  QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
            Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  I            H 
Sbjct: 165  QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------HF 212

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R+
Sbjct: 213  QLGRGEPM-LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 271

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
            +SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YFP
Sbjct: 272  LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYFP 324


>gi|74140094|dbj|BAE33777.1| unnamed protein product [Mus musculus]
          Length = 960

 Score =  104 bits (260), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 66/180 (36%), Positives = 100/180 (55%), Gaps = 21/180 (11%)

Query: 902  QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
            Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  I            H 
Sbjct: 165  QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------HF 212

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R+
Sbjct: 213  QLGRGEPM-LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 271

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
            +SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YFP
Sbjct: 272  LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYFP 324


>gi|34328280|ref|NP_080571.2| RNA polymerase II subunit A C-terminal domain phosphatase [Mus
            musculus]
 gi|46395722|sp|Q7TSG2.1|CTDP1_MOUSE RecName: Full=RNA polymerase II subunit A C-terminal domain
            phosphatase; AltName: Full=TFIIF-associating CTD
            phosphatase
 gi|31419683|gb|AAH53435.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
            phosphatase, subunit 1 [Mus musculus]
          Length = 960

 Score =  104 bits (260), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 66/180 (36%), Positives = 100/180 (55%), Gaps = 21/180 (11%)

Query: 902  QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
            Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  I            H 
Sbjct: 165  QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------HF 212

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R+
Sbjct: 213  QLGRGEPM-LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 271

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
            +SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YFP
Sbjct: 272  LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYFP 324


>gi|400603434|gb|EJP71032.1| FCP1-like phosphatase [Beauveria bassiana ARSEF 2860]
          Length = 774

 Score =  104 bits (259), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 66/177 (37%), Positives = 94/177 (53%), Gaps = 19/177 (10%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHRHL 963
            QK++   RKL LV+DLD T++++     V        +P H  +   +  Q  +   R L
Sbjct: 150  QKRLLRHRKLSLVVDLDQTIIHACIEPTVGEWQRDPSNPNHSAVKDVRSFQLNDDGPRGL 209

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
                    + KLRPG+  FLE  SK++E+H+YTMG + YA  +AK++DP   LF  RVIS
Sbjct: 210  A--SGCTYYIKLRPGLSEFLEEISKMYELHVYTMGTRAYALNIAKIVDPDRKLFGNRVIS 267

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            R ++G           SK L  +  + +  VVIIDD   VWP N+ NLI V  Y +F
Sbjct: 268  RDENGS--------ITSKSLARLFPVSTDMVVIIDDRADVWPMNRPNLIKVVPYDFF 316


>gi|9294260|dbj|BAB02162.1| unnamed protein product [Arabidopsis thaliana]
          Length = 288

 Score =  104 bits (259), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 63/213 (29%), Positives = 103/213 (48%), Gaps = 16/213 (7%)

Query: 906  TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
            T+ L     ++  +KL LVLDLDHTL++S K   +      ++++++   R+   ++  R
Sbjct: 62   TKHLTTLVSVYGRKKLHLVLDLDHTLIHSMKTSNLSKAEKYLIKEEKSGSRKDLRKYNNR 121

Query: 966  FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
                    K RP +  FL+ A+KLF M  YT G   Y   + +++DP  + F  R+I+R 
Sbjct: 122  L------VKFRPFVEEFLKEANKLFTMTAYTKGGSTYGQAVVRMIDPNKIYFGDRIITRK 175

Query: 1026 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
            +           P  K L+ VL  E  +VI+D++  VWPH+K NL+ +  Y YF    + 
Sbjct: 176  ES----------PDLKTLDLVLADERGIVIVDNTPNVWPHHKRNLLEITSYFYFKNDGKN 225

Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
                  S  E   DE      L + L   +++H
Sbjct: 226  MMRSRLSYAERKSDESRTKRALVNLLKFLKEVH 258


>gi|148677459|gb|EDL09406.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
            phosphatase, subunit 1, isoform CRA_c [Mus musculus]
          Length = 1000

 Score =  104 bits (259), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 66/180 (36%), Positives = 100/180 (55%), Gaps = 21/180 (11%)

Query: 902  QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
            Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  I            H 
Sbjct: 205  QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------HF 252

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R+
Sbjct: 253  QLGRGEPM-LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 311

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
            +SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YFP
Sbjct: 312  LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYFP 364


>gi|429963056|gb|ELA42600.1| FCP1-like phosphatase, phosphatase domain-containing protein
            [Vittaforma corneae ATCC 50505]
          Length = 445

 Score =  104 bits (259), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 64/210 (30%), Positives = 106/210 (50%), Gaps = 43/210 (20%)

Query: 869  GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLD 928
            G +  PV  H        + +F+  ++ +K  +QK R ++L E+KKM       L+LDLD
Sbjct: 45   GTDLVPVLHHT-------DRVFQTSEEARK--LQKIRNKQLNEEKKMI------LILDLD 89

Query: 929  HTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASK 988
             T+L++  + ++D                      F       + KLRP +  FLE+ SK
Sbjct: 90   QTILHTTLW-KIDC------------------DFTFSISSTMFYVKLRPHLNRFLEKISK 130

Query: 989  LFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLG 1048
            +FE+H+YTMG + Y TE+ K +DP G+ F  R++SR ++ +           K +E +  
Sbjct: 131  MFEIHIYTMGTREYVTEICKAIDPNGIYFGDRIVSRNENFNEL--------KKSIERITC 182

Query: 1049 MESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
            +   VVIIDD   VW ++K NL+++  + Y
Sbjct: 183  ISRNVVIIDDRADVWNYSK-NLVLIRPFWY 211


>gi|344242866|gb|EGV98969.1| hypothetical protein I79_008270 [Cricetulus griseus]
          Length = 848

 Score =  104 bits (259), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 103/185 (55%), Gaps = 31/185 (16%)

Query: 902  QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
            Q E+  R E+Q+++   RKL L++DLD TL+++                 E+Q  +  ++
Sbjct: 46   QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTT----------------EQQCPQMSNK 88

Query: 962  HLFRFPHMG-----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
             +F F  +G     + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  L
Sbjct: 89   GIFHF-QLGRGEPMLHTRLRPHCRDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKL 147

Query: 1017 FAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVER 1075
            F+ R++SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++
Sbjct: 148  FSHRILSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKK 200

Query: 1076 YTYFP 1080
            Y YFP
Sbjct: 201  YVYFP 205


>gi|115396432|ref|XP_001213855.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114193424|gb|EAU35124.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 820

 Score =  103 bits (258), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 75/202 (37%), Positives = 111/202 (54%), Gaps = 33/202 (16%)

Query: 893  YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 951
            +D+      +KE TR  E+ K+ + S RKL LV+DLD T++++     VDP   E +   
Sbjct: 131  HDNTALTVSEKEATRVEEDAKRRLLSNRKLSLVVDLDQTIIHAT----VDPTVGEWM--- 183

Query: 952  EEQDREKP-HRHL-----FRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMG 998
              +D+E P H+ L     F+     P M G W   KLRPG+ TFLE  ++LFE+H+YTMG
Sbjct: 184  --EDKENPNHQALSDVRAFQLVDDGPGMRGCWYYVKLRPGLETFLENVAELFELHIYTMG 241

Query: 999  NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIID 1057
             + YA  +A ++DP   LF  R++SR + G           +K+L  +  +++  VVIID
Sbjct: 242  TRAYAQHIASIIDPDRKLFGDRILSRDESGS--------LTAKNLHRLFPVDTKMVVIID 293

Query: 1058 DSVRVWPHNKLNLIVVERYTYF 1079
            D   VW  +  NLI V  Y +F
Sbjct: 294  DRGDVWRWSP-NLIKVSPYDFF 314


>gi|50838820|ref|NP_001002873.1| RNA polymerase II subunit A C-terminal domain phosphatase [Danio
            rerio]
 gi|49618915|gb|AAT68042.1| RNA polymerase II CTD phosphatase [Danio rerio]
          Length = 947

 Score =  103 bits (258), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 63/171 (36%), Positives = 96/171 (56%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 159  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQRMSNKGIF-----------HFQLGRGEPM 207

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KLFE+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 208  -LHTRLRPHCKDFLEKIAKLFELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 266

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L+ +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 267  PFS------KTGNLKNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYIYF 310


>gi|334325963|ref|XP_001374906.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
            phosphatase-like [Monodelphis domestica]
          Length = 1208

 Score =  103 bits (257), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 96/171 (56%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 395  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 443

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 444  -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 502

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW +   NLI V++Y YF
Sbjct: 503  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKYAP-NLITVKKYVYF 546


>gi|354479392|ref|XP_003501894.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
            [Cricetulus griseus]
          Length = 978

 Score =  103 bits (257), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 62/177 (35%), Positives = 99/177 (55%), Gaps = 30/177 (16%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++                 E+Q  +  ++ +F F  +
Sbjct: 183  EDQQRLHRNRKLVLMVDLDQTLIHTT----------------EQQCPQMSNKGIFHF-QL 225

Query: 970  G-----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
            G     + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR
Sbjct: 226  GRGEPMLHTRLRPHCRDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSR 285

Query: 1025 GDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
             +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YFP
Sbjct: 286  DECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYFP 335


>gi|297808347|ref|XP_002872057.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
            lyrata]
 gi|297317894|gb|EFH48316.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
            lyrata]
          Length = 302

 Score =  103 bits (257), Expect = 6e-19,   Method: Composition-based stats.
 Identities = 74/232 (31%), Positives = 115/232 (49%), Gaps = 21/232 (9%)

Query: 887  EHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDE 946
            ++L++G     +A +    T+R+  Q      +KL LVLDLDHTL+++ K  ++     E
Sbjct: 56   DYLYKGMHMSHEALV---FTKRVISQTSWLEDKKLHLVLDLDHTLVHTIKASQL--YESE 110

Query: 947  ILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEM 1006
                +E   R+   R    FP   +  KLRP +  FL+  +++F M++YT G   YA  +
Sbjct: 111  KCLTEEVGSRKDLWRFNSGFPDESL-IKLRPFVHQFLKECNEMFSMYVYTKGGCDYAQVV 169

Query: 1007 AKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHN 1066
             +++DP+ + F  RVI+R +           P  K L+ VL  E  VVI+DD   VWPH+
Sbjct: 170  LELIDPEKIYFGNRVITRRES----------PDLKTLDLVLADERGVVIVDDKCSVWPHD 219

Query: 1067 KLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            K NL+ + +Y YF      F     S  +   DE  E G L   L   + +H
Sbjct: 220  KKNLLQIAKYKYFGDQSCSF-----SECKNKRDESEEKGPLDIVLRFLKDVH 266


>gi|406865754|gb|EKD18795.1| FCP1-like phosphatase [Marssonina brunnea f. sp. 'multigermtubi'
            MB_m1]
          Length = 863

 Score =  103 bits (257), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 65/181 (35%), Positives = 98/181 (54%), Gaps = 23/181 (12%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE--KPHRHLF--- 964
            E Q+++   RKL LV+DLD T++++     ++P   E  R K   + E  K  +      
Sbjct: 155  ELQRRLLKHRKLSLVVDLDQTIIHAC----IEPTVGEWQRDKNSPNYEAVKDVKSFQLND 210

Query: 965  ---RFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
               R    G W   K+RPG+  FL   S+L+E+H+YTMG + YA  +AK++DP   LF  
Sbjct: 211  DGPRGLASGCWYYIKMRPGLAEFLAHISELYELHVYTMGTRAYAINIAKIVDPDKKLFGD 270

Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTY 1078
            R+ISR ++G+          +K L  +  +++  VVIIDD   VWP N+ NLI V  Y +
Sbjct: 271  RIISRDENGN--------VTAKSLARLFPVDTKMVVIIDDRADVWPQNRPNLIKVVPYDF 322

Query: 1079 F 1079
            F
Sbjct: 323  F 323


>gi|358383388|gb|EHK21054.1| hypothetical protein TRIVIDRAFT_90991 [Trichoderma virens Gv29-8]
          Length = 758

 Score =  103 bits (257), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 68/196 (34%), Positives = 101/196 (51%), Gaps = 21/196 (10%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVH 944
            DQ    + K   +R E   QK++   RKL LV+DLD T++++         +    +P H
Sbjct: 131  DQTGLMVSKNVAKRAEHDTQKRLLRQRKLSLVVDLDQTIIHACIEPTIGEWQRDPTNPNH 190

Query: 945  DEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
            + +   K  Q  +   R L        + KLRPG+  FLE  S  +E+H+YTMG + YA 
Sbjct: 191  EAVKDVKSFQLNDDGPRGLA--SGCTYYIKLRPGLQEFLEAVSTKYELHVYTMGTRAYAL 248

Query: 1005 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVW 1063
             +A+++DP   LF  RVISR ++G           +K L+ +  + +  VVIIDD   VW
Sbjct: 249  NIARIVDPDRKLFGNRVISRDENGS--------ITAKSLQRLFPVSTDMVVIIDDRADVW 300

Query: 1064 PHNKLNLIVVERYTYF 1079
            P N+ NLI V  Y +F
Sbjct: 301  PMNRPNLIKVVPYDFF 316


>gi|395830784|ref|XP_003788497.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
            [Otolemur garnettii]
          Length = 1290

 Score =  103 bits (256), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 63/172 (36%), Positives = 96/172 (55%), Gaps = 20/172 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------HFQLGRGEPM 220

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221  -LHTRLRPHCRDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YFP
Sbjct: 280  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYFP 324


>gi|417412899|gb|JAA52807.1| Putative rna polymerase ii subunit a c-terminal domain phosphatase,
            partial [Desmodus rotundus]
          Length = 845

 Score =  103 bits (256), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 96/171 (56%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  IL           H  L R   M
Sbjct: 66   EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIL-----------HFQLGRGEPM 114

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 115  -LHTRLRPHCRQFLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 173

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 174  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 217


>gi|296222911|ref|XP_002757404.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
            [Callithrix jacchus]
          Length = 1053

 Score =  103 bits (256), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323


>gi|410911388|ref|XP_003969172.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
            phosphatase-like [Takifugu rubripes]
          Length = 905

 Score =  103 bits (256), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 65/182 (35%), Positives = 102/182 (56%), Gaps = 21/182 (11%)

Query: 899  AAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
            +A Q E+  R E+Q+++   +KL L++DLD TL+++ + H     +  IL          
Sbjct: 156  SAEQAEQLGR-EDQQRLHRNKKLVLMVDLDQTLIHTTEQHCQRMSNKGIL---------- 204

Query: 959  PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
             H  L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+
Sbjct: 205  -HFQLGRGEPM-LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFS 262

Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
             R++SR +  DPF       K+ +L  +    +S V IIDD   VW     NL+ V++Y 
Sbjct: 263  HRILSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLVTVKKYV 315

Query: 1078 YF 1079
            YF
Sbjct: 316  YF 317


>gi|326916917|ref|XP_003204751.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
            phosphatase-like [Meleagris gallopavo]
          Length = 1003

 Score =  102 bits (255), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 183  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 231

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 232  -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 290

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 291  PFS------KTGNLRDLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 334


>gi|320164786|gb|EFW41685.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 877

 Score =  102 bits (255), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 60/171 (35%), Positives = 96/171 (56%), Gaps = 12/171 (7%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
            E++ +  ++KL L++DLD TL+++    +V P   + LR   E  +E  +  L   PH+ 
Sbjct: 222  EKESLLQSKKLVLIVDLDQTLIHAVVSSQV-PWIGQFLRDNVELQKEIFNFSLPNHPHL- 279

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
             + KLRPG   FL +A+KLFE+H++TMG+++YA+ +A VLDP G LF  R++SR +    
Sbjct: 280  YYIKLRPGAREFLAQATKLFELHIFTMGSRMYASRVAAVLDPDGALFGSRIMSRDESKSA 339

Query: 1031 -FDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
             F       K   L  +     + V ++DD + VW     N+I +  Y YF
Sbjct: 340  NF-------KHTQLSQLFPSGHNMVAVLDDRIDVWARLG-NVIQISPYEYF 382


>gi|194214772|ref|XP_001496059.2| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
            [Equus caballus]
          Length = 868

 Score =  102 bits (255), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 80   EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 128

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 129  -LHTRLRPHCKEFLEKTAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 187

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 188  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 231


>gi|344269798|ref|XP_003406734.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
            C-terminal domain phosphatase-like [Loxodonta africana]
          Length = 972

 Score =  102 bits (255), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 178  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 226

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 227  -LHTRLRPHCKEFLEKVAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 285

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 286  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 329


>gi|5326898|gb|AAD42088.1| RNA polymerase II CTD phosphatase [Homo sapiens]
          Length = 961

 Score =  102 bits (255), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280  PFS------KTGNLRNLFPCGDSMVCIIDDRKDVWKFAP-NLITVKKYVYF 323


>gi|291414979|ref|XP_002723734.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
            polypeptide A) phosphatase, subunit 1-like [Oryctolagus
            cuniculus]
          Length = 940

 Score =  102 bits (255), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 65/179 (36%), Positives = 100/179 (55%), Gaps = 21/179 (11%)

Query: 902  QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
            Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  IL           H 
Sbjct: 146  QAEKIAR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIL-----------HF 193

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             L R   M + T+LRP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R+
Sbjct: 194  QLGRGEPM-LHTRLRPHCKDFLEKIARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 252

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            +SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 253  LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 304


>gi|67188445|ref|NP_004706.3| RNA polymerase II subunit A C-terminal domain phosphatase isoform 1
            [Homo sapiens]
 gi|327478586|sp|Q9Y5B0.3|CTDP1_HUMAN RecName: Full=RNA polymerase II subunit A C-terminal domain
            phosphatase; AltName: Full=TFIIF-associating CTD
            phosphatase
 gi|119587032|gb|EAW66628.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
            phosphatase, subunit 1, isoform CRA_a [Homo sapiens]
          Length = 961

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323


>gi|363730338|ref|XP_418905.3| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
            [Gallus gallus]
          Length = 958

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 137  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 185

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 186  -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 244

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 245  PFS------KTGNLRDLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 288


>gi|15226925|ref|NP_178335.1| Haloacid dehalogenase-like hydrolase-like protein [Arabidopsis
            thaliana]
 gi|3894162|gb|AAC78512.1| hypothetical protein [Arabidopsis thaliana]
 gi|330250469|gb|AEC05563.1| Haloacid dehalogenase-like hydrolase-like protein [Arabidopsis
            thaliana]
          Length = 302

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 71/213 (33%), Positives = 108/213 (50%), Gaps = 18/213 (8%)

Query: 906  TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
            T+RL  Q      +KL LVLDLDHTL+++ K  ++      I   +E + R+   R    
Sbjct: 72   TKRLISQTSWLEDKKLHLVLDLDHTLVHTIKVSQLSESEKYI--TEEVESRKDLRRFNTG 129

Query: 966  FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
            FP   +  KLR  +  FL+  +++F +++YT G   YA  + +++DP  + F  RVI+R 
Sbjct: 130  FPEESL-IKLRSFVHQFLKECNEMFSLYVYTKGGYDYAQLVLEMIDPDKIYFGNRVITRR 188

Query: 1026 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
            +           P  K L+ VL  E  +V++DD   VWPH+K NL+ + RY YF     Q
Sbjct: 189  ES----------PGFKTLDLVLADERGIVVVDDKSSVWPHDKKNLLQIARYKYFG---DQ 235

Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
              LL     +I  DE  E G L ++L     +H
Sbjct: 236  SCLLSECKKKI--DESDEKGPLNTALRFLMDVH 266


>gi|109122558|ref|XP_001088601.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
            isoform 2 [Macaca mulatta]
          Length = 964

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323


>gi|297702856|ref|XP_002828379.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
            C-terminal domain phosphatase [Pongo abelii]
          Length = 962

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323


>gi|3769521|gb|AAC64549.1| serine phosphatase FCP1a [Homo sapiens]
          Length = 842

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 53   EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 102  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 161  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204


>gi|402903417|ref|XP_003914562.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
            isoform 1 [Papio anubis]
          Length = 965

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323


>gi|402903419|ref|XP_003914563.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
            isoform 2 [Papio anubis]
          Length = 871

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323


>gi|332850750|ref|XP_001144243.2| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
            isoform 2 [Pan troglodytes]
          Length = 1026

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323


>gi|119587034|gb|EAW66630.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
            phosphatase, subunit 1, isoform CRA_c [Homo sapiens]
          Length = 948

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323


>gi|321267522|ref|NP_001189433.1| RNA polymerase II subunit A C-terminal domain phosphatase isoform 3
            [Homo sapiens]
          Length = 842

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 53   EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 102  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 161  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204


>gi|397467065|ref|XP_003805250.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
            [Pan paniscus]
          Length = 842

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 53   EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 102  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 161  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204


>gi|410294550|gb|JAA25875.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
            phosphatase, subunit 1 [Pan troglodytes]
          Length = 961

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323


>gi|426386293|ref|XP_004059621.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
            isoform 1 [Gorilla gorilla gorilla]
 gi|426386295|ref|XP_004059622.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
            isoform 2 [Gorilla gorilla gorilla]
          Length = 842

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 53   EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 102  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 161  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204


>gi|402903421|ref|XP_003914564.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
            isoform 3 [Papio anubis]
          Length = 846

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 53   EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 102  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 161  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204


>gi|410215194|gb|JAA04816.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
            phosphatase, subunit 1 [Pan troglodytes]
 gi|410254644|gb|JAA15289.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
            phosphatase, subunit 1 [Pan troglodytes]
 gi|410331971|gb|JAA34932.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
            phosphatase, subunit 1 [Pan troglodytes]
          Length = 961

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323


>gi|119587036|gb|EAW66632.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
            phosphatase, subunit 1, isoform CRA_e [Homo sapiens]
          Length = 748

 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 53   EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 102  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 161  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204


>gi|395511850|ref|XP_003760164.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
            [Sarcophilus harrisii]
          Length = 1267

 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 453  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 501

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 502  -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 560

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 561  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 604


>gi|431907029|gb|ELK11148.1| RNA polymerase II subunit A C-terminal domain phosphatase [Pteropus
            alecto]
          Length = 918

 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 96/171 (56%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  IL           H  L R   M
Sbjct: 145  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQRMSNKGIL-----------HFQLGRGEPM 193

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 194  -LHTRLRPHCREFLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 252

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 253  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 296


>gi|39645774|gb|AAH63447.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
            phosphatase, subunit 1 [Homo sapiens]
          Length = 867

 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323


>gi|67188550|ref|NP_430255.2| RNA polymerase II subunit A C-terminal domain phosphatase isoform 2
            [Homo sapiens]
 gi|119587035|gb|EAW66631.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
            phosphatase, subunit 1, isoform CRA_d [Homo sapiens]
          Length = 867

 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323


>gi|440638319|gb|ELR08238.1| hypothetical protein GMDG_03040 [Geomyces destructans 20631-21]
          Length = 1765

 Score =  102 bits (253), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 67/198 (33%), Positives = 105/198 (53%), Gaps = 25/198 (12%)

Query: 895  DQQKAAIQKERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            DQ   ++ ++   R EEQ  +++   RKL LV+DLD T++++     ++P   E  R   
Sbjct: 134  DQTLLSVSQDEASRAEEQLQRRLLKNRKLSLVVDLDQTIIHAC----IEPTIGEWQRDPT 189

Query: 953  EQDRE--------KPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLY 1002
              + E        + H    R    G W   K+RPG+  FL   ++ +E+H+YTMG + Y
Sbjct: 190  SPNYEAVKDVKSFQLHDDGPRGLASGCWYYIKMRPGLAHFLTTIAEKYELHVYTMGTRAY 249

Query: 1003 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVR 1061
            A E+AK++DP+  LF  R+ISR ++G           +K L  +  +++  VVIIDD   
Sbjct: 250  AQEIAKIVDPEHKLFGDRIISRDENGS--------LTAKTLSRLFPVDTKMVVIIDDRAD 301

Query: 1062 VWPHNKLNLIVVERYTYF 1079
            VWP N+ NLI V  Y +F
Sbjct: 302  VWPRNRSNLIKVVPYDFF 319


>gi|355702027|gb|EHH29380.1| RNA polymerase II subunit A C-terminal domain phosphatase, partial
            [Macaca mulatta]
          Length = 861

 Score =  102 bits (253), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 67   EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 115

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 116  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 174

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 175  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 218


>gi|355755122|gb|EHH58989.1| RNA polymerase II subunit A C-terminal domain phosphatase, partial
            [Macaca fascicularis]
          Length = 861

 Score =  102 bits (253), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 67   EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 115

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 116  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 174

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 175  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 218


>gi|47217775|emb|CAG05997.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 979

 Score =  101 bits (252), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 65/182 (35%), Positives = 101/182 (55%), Gaps = 21/182 (11%)

Query: 899  AAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
            +A Q E+  R E+Q+++   +KL L++DLD TL+++ + H     +  I           
Sbjct: 156  SAEQAEQLGR-EDQQRLHRNKKLVLMVDLDQTLIHTTEQHCHRMSNKGIF---------- 204

Query: 959  PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
             H  L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+
Sbjct: 205  -HFQLGRGEPM-LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFS 262

Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
             R++SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y 
Sbjct: 263  HRILSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYV 315

Query: 1078 YF 1079
            YF
Sbjct: 316  YF 317


>gi|441603466|ref|XP_004087808.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
            C-terminal domain phosphatase [Nomascus leucogenys]
          Length = 1236

 Score =  101 bits (252), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323


>gi|449493392|ref|XP_002190004.2| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
            [Taeniopygia guttata]
          Length = 871

 Score =  101 bits (252), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 53   EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 102  -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 161  PF------SKTGNLRDLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204


>gi|389637610|ref|XP_003716438.1| RNA polymerase II subunit A domain phosphatase [Magnaporthe oryzae
            70-15]
 gi|351642257|gb|EHA50119.1| RNA polymerase II subunit A domain phosphatase [Magnaporthe oryzae
            70-15]
 gi|440471327|gb|ELQ40350.1| RNA polymerase II subunit A C-terminal domain phosphatase
            [Magnaporthe oryzae Y34]
 gi|440487323|gb|ELQ67117.1| RNA polymerase II subunit A C-terminal domain phosphatase
            [Magnaporthe oryzae P131]
          Length = 866

 Score =  101 bits (252), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 63/177 (35%), Positives = 92/177 (51%), Gaps = 16/177 (9%)

Query: 909  LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEV-----DPVHDEILRKKEEQDREKPHRHL 963
            LE QK++ + RKL LV+DLD T++ +A    +     DP +      KE +  E P    
Sbjct: 159  LEMQKRLVAQRKLVLVVDLDQTVIQTACEPTIGEWQKDPSNPNYEALKEVRSFELPSEDG 218

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
             R  +   + K RPG   FL + S LFEMH+YTM  + YA  + +++DPK  LF  RVIS
Sbjct: 219  PRRNYT-YYVKCRPGTHEFLNKVSNLFEMHVYTMATRAYAEHILRIIDPKKNLFGNRVIS 277

Query: 1024 RGDDGDPFDGDERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            R ++       +R+ P S  +         V +IDD   VWP N+ N+I V  Y ++
Sbjct: 278  RNENKGIEKTLQRIFPTSTKM---------VAVIDDRTDVWPQNRSNVIKVVPYNFY 325


>gi|340518072|gb|EGR48314.1| predicted protein [Trichoderma reesei QM6a]
          Length = 594

 Score =  101 bits (252), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 63/177 (35%), Positives = 95/177 (53%), Gaps = 19/177 (10%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHRHL 963
            QK++   RKL LV+DLD T++++         +    +P H+ +   K  Q  +   R L
Sbjct: 150  QKRLLRQRKLSLVVDLDQTIIHACIEPTIGEWQRDPTNPNHEAVKDVKSFQLNDDGPRGL 209

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
                    + KLRPG+  FLE  S  +E+H+YTMG + YA  +A+++DP   LF  RVIS
Sbjct: 210  A--SGCTYYIKLRPGLKEFLEAVSTKYELHVYTMGTRAYALNIARIVDPDKKLFGNRVIS 267

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            R ++G           +K L+ +  + +  VVIIDD   VWP+N+ NLI V  Y +F
Sbjct: 268  RDENGS--------ITAKSLQRLFPVSTDMVVIIDDRADVWPNNRPNLIKVAPYDFF 316


>gi|89269074|emb|CAJ81904.1| ctd (carboxy terminal domain rna polymerase 2 polypeptide a)
            phosphatase subunit 1 [Xenopus (Silurana) tropicalis]
          Length = 567

 Score =  101 bits (251), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 93/171 (54%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q ++   RKL L++DLD TL+++ + H        I            H  L R   M
Sbjct: 167  EDQLRLHRNRKLVLMVDLDQTLIHTTEQHCQHMSRKGIF-----------HFQLGRGEPM 215

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KLFE+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 216  -LHTRLRPHCKEFLEKIAKLFELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 274

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            P+       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 275  PYS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 318


>gi|403268140|ref|XP_003926140.1| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
            [Saimiri boliviensis boliviensis]
          Length = 937

 Score =  101 bits (251), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 147  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 195

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 196  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 254

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 255  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 298


>gi|119491655|ref|XP_001263322.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Neosartorya
            fischeri NRRL 181]
 gi|119411482|gb|EAW21425.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Neosartorya
            fischeri NRRL 181]
          Length = 824

 Score =  101 bits (251), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 69/197 (35%), Positives = 109/197 (55%), Gaps = 23/197 (11%)

Query: 893  YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 951
            +D+      +KE TR  E+ K+ + + RKL LV+DLD T++++     VDP   E +  K
Sbjct: 131  HDNTALTVSEKEATRVEEDAKRRLLANRKLSLVVDLDQTIIHAT----VDPTVGEWMEDK 186

Query: 952  EEQDRE-----KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
            +  + E     +  + +   P M G W   KLRPG+ +FL+  S+LFE+H+YTMG + YA
Sbjct: 187  DNPNHEALSDVRAFQLVDEGPGMRGCWYYVKLRPGLESFLQNVSELFELHIYTMGTRAYA 246

Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRV 1062
              +A ++DP   LF  R++SR + G           +K+L+ +  +++  VVIIDD   V
Sbjct: 247  QHIAGIIDPDRKLFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDDRGDV 298

Query: 1063 WPHNKLNLIVVERYTYF 1079
            W  +  NLI V  Y +F
Sbjct: 299  WRWSP-NLIKVSPYDFF 314


>gi|297741470|emb|CBI32601.3| unnamed protein product [Vitis vinifera]
          Length = 147

 Score =  101 bits (251), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 56/122 (45%), Positives = 72/122 (59%), Gaps = 8/122 (6%)

Query: 997  MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVII 1056
            MG + YA EM KVLDP+ V F+  VIS+ D              K L+ VLG +SAV+I+
Sbjct: 1    MGEQFYALEMVKVLDPRTVYFSSSVISQADSTQR--------HQKGLDVVLGPKSAVLIL 52

Query: 1057 DDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQ 1116
            DD+ R W ++K NLI++ERY +F  S  QFG    SL E+  DE   DG LA+ L V QQ
Sbjct: 53   DDTERAWKNHKDNLILMERYHFFASSCHQFGFHCKSLSELKSDESEPDGALATILKVLQQ 112

Query: 1117 LH 1118
             H
Sbjct: 113  TH 114


>gi|62858037|ref|NP_001017022.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
            phosphatase, subunit 1 [Xenopus (Silurana) tropicalis]
          Length = 570

 Score =  101 bits (251), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 93/171 (54%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q ++   RKL L++DLD TL+++ + H        I            H  L R   M
Sbjct: 170  EDQLRLHRNRKLVLMVDLDQTLIHTTEQHCQHMSRKGIF-----------HFQLGRGEPM 218

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KLFE+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 219  -LHTRLRPHCKEFLEKIAKLFELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 277

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            P+       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 278  PYS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 321


>gi|444518074|gb|ELV11938.1| RNA polymerase II subunit A C-terminal domain phosphatase [Tupaia
            chinensis]
          Length = 876

 Score =  101 bits (251), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 17   EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCAQMSNRGIF-----------HFQLGRGEPM 65

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 66   -LHTRLRPHCKDFLEKVAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 124

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 125  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 168


>gi|348512639|ref|XP_003443850.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
            phosphatase-like [Oreochromis niloticus]
          Length = 998

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 61/171 (35%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   +KL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 166  EDQQRLHRNKKLVLMVDLDQTLIHTTEQHCQRMSNKGIF-----------HFQLGRGEPM 214

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 215  -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 273

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 274  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYIYF 317


>gi|358418617|ref|XP_003583993.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
            phosphatase-like [Bos taurus]
          Length = 864

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 99/179 (55%), Gaps = 21/179 (11%)

Query: 902  QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
            Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  I            H 
Sbjct: 162  QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HF 209

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             L R   M + T+LRP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R+
Sbjct: 210  QLGRGEPM-LHTRLRPHCKEFLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 268

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            +SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 269  LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 320


>gi|345324709|ref|XP_001509122.2| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
            [Ornithorhynchus anatinus]
          Length = 1168

 Score =  100 bits (250), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 64/182 (35%), Positives = 101/182 (55%), Gaps = 22/182 (12%)

Query: 901  IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
            +  E+ ++L  E+Q+++   RKL L++DLD TL+++ + H     +  I           
Sbjct: 167  VSSEQAKQLGREDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF---------- 216

Query: 959  PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
             H  L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+
Sbjct: 217  -HFQLGRGEPM-LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFS 274

Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
             R++SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y 
Sbjct: 275  HRILSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYV 327

Query: 1078 YF 1079
            YF
Sbjct: 328  YF 329


>gi|73945347|ref|XP_533365.2| PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase
            isoform 1 [Canis lupus familiaris]
          Length = 933

 Score =  100 bits (249), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 60/171 (35%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 169  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 217

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T++RP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 218  -LHTRVRPHCREFLEKIARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 276

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 277  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 320


>gi|432884093|ref|XP_004074439.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
            phosphatase-like [Oryzias latipes]
          Length = 1129

 Score =  100 bits (249), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 162  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 210

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 211  -LHTRLRPHCKEFLEKTAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 269

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 270  PFS------KTGNLRYLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 313


>gi|359079164|ref|XP_003587804.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
            phosphatase-like [Bos taurus]
          Length = 994

 Score =  100 bits (249), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 99/179 (55%), Gaps = 21/179 (11%)

Query: 902  QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
            Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  I            H 
Sbjct: 162  QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HF 209

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             L R   M + T+LRP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R+
Sbjct: 210  QLGRGEPM-LHTRLRPHCKEFLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 268

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            +SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 269  LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 320


>gi|350634686|gb|EHA23048.1| hypothetical protein ASPNIDRAFT_197473 [Aspergillus niger ATCC 1015]
          Length = 824

 Score =  100 bits (248), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 108/201 (53%), Gaps = 34/201 (16%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            D     + ++   R+EE  ++++ + RKL LV+DLD T++++     VDP   E +    
Sbjct: 132  DNTTLTVSEQEATRVEEDAKRRLLANRKLSLVVDLDQTIIHAT----VDPTVGEWM---- 183

Query: 953  EQDREKPHRHL------FRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
             QD+E P+         F+     P M G W   KLRPG+ +FL+  S+++E+H+YTMG 
Sbjct: 184  -QDKENPNYQALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSEMYELHIYTMGT 242

Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 1058
            + YA  +A ++DP   LF  R++SR + G           +K+L  +  +++  VVIIDD
Sbjct: 243  RSYAQHIASIIDPDRKLFGDRILSRDESGSLV--------AKNLHRLFPVDTKMVVIIDD 294

Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
               VW  N  NLI V  Y +F
Sbjct: 295  RGDVWRWNP-NLIKVSPYDFF 314


>gi|30962890|gb|AAH52576.1| CTDP1 protein, partial [Homo sapiens]
          Length = 874

 Score =  100 bits (248), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 61/171 (35%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 85   EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 133

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 134  -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 192

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            P      + K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 193  P------ISKTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 236


>gi|358372260|dbj|GAA88864.1| RNA Polymerase II CTD phosphatase Fcp1 [Aspergillus kawachii IFO
            4308]
          Length = 825

 Score =  100 bits (248), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 108/201 (53%), Gaps = 34/201 (16%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            D     + ++   R+EE  ++++ + RKL LV+DLD T++++     VDP   E +    
Sbjct: 132  DNTTLTVSEQEATRVEEDAKRRLLANRKLSLVVDLDQTIIHAT----VDPTVGEWM---- 183

Query: 953  EQDREKPHRHL------FRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
             QD+E P+         F+     P M G W   KLRPG+ +FL+  S+++E+H+YTMG 
Sbjct: 184  -QDKENPNYQALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSEMYELHIYTMGT 242

Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 1058
            + YA  +A ++DP   LF  R++SR + G           +K+L  +  +++  VVIIDD
Sbjct: 243  RSYAQHIASIIDPDRKLFGDRILSRDESGSLV--------AKNLHRLFPVDTKMVVIIDD 294

Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
               VW  N  NLI V  Y +F
Sbjct: 295  RGDVWRWNP-NLIKVSPYDFF 314


>gi|432105445|gb|ELK31660.1| RNA polymerase II subunit A C-terminal domain phosphatase [Myotis
            davidii]
          Length = 823

 Score =  100 bits (248), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 59/176 (33%), Positives = 98/176 (55%), Gaps = 30/176 (17%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++                 E+Q ++  ++ +  F  +
Sbjct: 53   EDQQRLHRNRKLVLMVDLDQTLIHTT----------------EQQCQQMSNKGILHF-QL 95

Query: 970  G-----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
            G     + T+LRP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R++SR
Sbjct: 96   GRGEPMLHTRLRPHCREFLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSR 155

Query: 1025 GDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
             +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 156  DECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204


>gi|327270066|ref|XP_003219812.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
            phosphatase-like [Anolis carolinensis]
          Length = 965

 Score =  100 bits (248), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 61/171 (35%), Positives = 96/171 (56%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H            ++  +R   H  L R   M
Sbjct: 155  EDQERLRRNRKLVLMVDLDQTLIHTTEQH-----------CQQMSNRGIFHYQLGRGEPM 203

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LD +  LF+ R++SR +  D
Sbjct: 204  -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAAFLDSEKKLFSHRILSRDECID 262

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 263  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 306


>gi|426253911|ref|XP_004020634.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
            C-terminal domain phosphatase, partial [Ovis aries]
          Length = 820

 Score = 99.8 bits (247), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 99/179 (55%), Gaps = 21/179 (11%)

Query: 902  QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
            Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  I            H 
Sbjct: 74   QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HF 121

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             L R   M + T+LRP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R+
Sbjct: 122  QLGRGEPM-LHTRLRPHCKEFLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 180

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            +SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 181  LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 232


>gi|345568228|gb|EGX51125.1| hypothetical protein AOL_s00054g501 [Arthrobotrys oligospora ATCC
            24927]
          Length = 854

 Score = 99.8 bits (247), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 72/204 (35%), Positives = 103/204 (50%), Gaps = 36/204 (17%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR---- 965
            E +K++ SA+KL LV+DLD T++ +     VDP   E        D   P+ H  +    
Sbjct: 149  EAKKRLLSAKKLSLVVDLDQTIIQAT----VDPTVGEW-----RDDPSNPNYHAVKDVEA 199

Query: 966  FPHM-------GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
            F  +       G W   KLRPG+  FL   SK++E H+YTMG + YA  +AK++DP+G +
Sbjct: 200  FQLLDEGAGGRGCWYYVKLRPGLKRFLSNISKIYECHIYTMGTRAYAMSIAKIVDPEGSI 259

Query: 1017 FAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVER 1075
            F  R++SR + G           SK LE +  +++  VVIIDD   VW  +  NLI V  
Sbjct: 260  FGERILSRDESGS--------LTSKSLERLFPVDTKMVVIIDDRGDVWKWSD-NLIKVTP 310

Query: 1076 YTYFPCSRRQFGLLGPSLLEIDHD 1099
            Y +F       G +  S L   HD
Sbjct: 311  YDFFVG----IGDINSSFLPKRHD 330


>gi|159483481|ref|XP_001699789.1| hypothetical protein CHLREDRAFT_141879 [Chlamydomonas reinhardtii]
 gi|158281731|gb|EDP07485.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 375

 Score = 99.8 bits (247), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 58/152 (38%), Positives = 87/152 (57%), Gaps = 14/152 (9%)

Query: 926  DLDHTLLNSAKFHEVD----PVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWT 981
            DLDHTLLNS   +EV     P   E+ R+++E +   P R L       +WTKLRPG++ 
Sbjct: 133  DLDHTLLNSVHMNEVGEDVAPRLAELQRREQEANL-GPRRLLHCLADKKLWTKLRPGVFE 191

Query: 982  FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSK 1041
            FLE     +EMH+YTMG+K YA E+ ++LDP G LF+  VI++               +K
Sbjct: 192  FLEGLRDAYEMHIYTMGDKTYAAEVRRLLDPTGRLFSS-VIAK--------DHSTTATAK 242

Query: 1042 DLEGVLGMESAVVIIDDSVRVWPHNKLNLIVV 1073
             L+ +L  +   +++DD+  VWP ++ NL+ V
Sbjct: 243  HLDVLLSADELALVLDDTEVVWPGHRRNLLQV 274


>gi|147770504|emb|CAN75676.1| hypothetical protein VITISV_003260 [Vitis vinifera]
          Length = 205

 Score = 99.8 bits (247), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 55/122 (45%), Positives = 71/122 (58%), Gaps = 8/122 (6%)

Query: 997  MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVII 1056
            MG + YA EM KVLDP+ V F+  VIS+ D              K L+ VLG +S V+I+
Sbjct: 1    MGEQFYALEMVKVLDPRTVYFSSSVISQADSTQR--------HQKGLDVVLGPKSXVLIL 52

Query: 1057 DDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQ 1116
            DD+ R W ++K NLI++ERY +F  S  QFG    SL E+  DE   DG LA+ L V QQ
Sbjct: 53   DDTERAWKNHKDNLILMERYHFFASSCHQFGFHCKSLSELKSDESEPDGALATILKVLQQ 112

Query: 1117 LH 1118
             H
Sbjct: 113  TH 114


>gi|355681363|gb|AER96784.1| CTD phosphatase, subunit 1 [Mustela putorius furo]
          Length = 819

 Score = 99.8 bits (247), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 60/171 (35%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 53   EDQERLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T++RP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 102  -LHTRVRPHCREFLEKIARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 161  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204


>gi|291001899|ref|XP_002683516.1| TFIIF CTD phosphatase Fcp1 [Naegleria gruberi]
 gi|284097145|gb|EFC50772.1| TFIIF CTD phosphatase Fcp1 [Naegleria gruberi]
          Length = 592

 Score = 99.4 bits (246), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 64/193 (33%), Positives = 103/193 (53%), Gaps = 15/193 (7%)

Query: 900  AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE-K 958
            A +K   R    Q+++   +KL LVLDLDHTLL++    E    H ++    +  +   +
Sbjct: 170  AYEKGLERGKANQQRLIEKKKLSLVLDLDHTLLHTINDFEYRREHHKVTYFNDIYNNSPE 229

Query: 959  PHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
              +H+ +F   G +   K RP + +FL+R S++FE+H++T G + YA ++ K+LD    L
Sbjct: 230  LQKHIHKFFMRGSYHFVKFRPRLESFLKRCSEIFELHVFTHGERAYADQIGKMLDSSKSL 289

Query: 1017 FAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVER 1075
            FA R++SR +  D          +K L  V    + +V++IDD   VW  N  N+I +  
Sbjct: 290  FADRILSRDECPD--------INTKTLSQVFPYSDKSVLVIDDKTDVWKDNVDNVIQIAP 341

Query: 1076 YTYFPCSRRQFGL 1088
            Y YF   RR FG+
Sbjct: 342  YDYF---RRIFGV 351


>gi|83767703|dbj|BAE57842.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 820

 Score = 99.4 bits (246), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 70/201 (34%), Positives = 110/201 (54%), Gaps = 34/201 (16%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            D     + ++   R+EE  ++++ S RKL LV+DLD T++++     VDP   E +    
Sbjct: 132  DNTALTVSEKEAARVEEDAKRRLLSNRKLSLVVDLDQTIIHAT----VDPTVGEWM---- 183

Query: 953  EQDREKP-HRHL-----FRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
             +D++ P H+ L     F+     P M G W   KLRPG+ +FL+  S+LFE+H+YTMG 
Sbjct: 184  -EDKDNPNHQALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSELFELHIYTMGT 242

Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 1058
            + YA  +A ++DP   LF  R++SR + G           +K+L  +  +++  VVIIDD
Sbjct: 243  RAYAQHIASIIDPDRKLFGDRILSRDESGS--------LTAKNLHRLFPVDTKMVVIIDD 294

Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
               VW  +  NLI V  Y +F
Sbjct: 295  RGDVWRWSP-NLIKVSPYDFF 314


>gi|391867600|gb|EIT76846.1| TFIIF-interacting CTD phosphatase [Aspergillus oryzae 3.042]
          Length = 820

 Score = 99.4 bits (246), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 70/201 (34%), Positives = 110/201 (54%), Gaps = 34/201 (16%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            D     + ++   R+EE  ++++ S RKL LV+DLD T++++     VDP   E +    
Sbjct: 132  DNTALTVSEKEAARVEEDAKRRLLSNRKLSLVVDLDQTIIHAT----VDPTVGEWM---- 183

Query: 953  EQDREKP-HRHL-----FRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
             +D++ P H+ L     F+     P M G W   KLRPG+ +FL+  S+LFE+H+YTMG 
Sbjct: 184  -EDKDNPNHQALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSELFELHIYTMGT 242

Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 1058
            + YA  +A ++DP   LF  R++SR + G           +K+L  +  +++  VVIIDD
Sbjct: 243  RAYAQHIASIIDPDRKLFGDRILSRDESGS--------LTAKNLHRLFPVDTKMVVIIDD 294

Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
               VW  +  NLI V  Y +F
Sbjct: 295  RGDVWRWSP-NLIKVSPYDFF 314


>gi|238486788|ref|XP_002374632.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus flavus
            NRRL3357]
 gi|220699511|gb|EED55850.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus flavus
            NRRL3357]
          Length = 698

 Score = 99.0 bits (245), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 70/201 (34%), Positives = 110/201 (54%), Gaps = 34/201 (16%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            D     + ++   R+EE  ++++ S RKL LV+DLD T++++     VDP   E +    
Sbjct: 10   DNTALTVSEKEAARVEEDAKRRLLSNRKLSLVVDLDQTIIHAT----VDPTVGEWM---- 61

Query: 953  EQDREKP-HRHL-----FRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
             +D++ P H+ L     F+     P M G W   KLRPG+ +FL+  S+LFE+H+YTMG 
Sbjct: 62   -EDKDNPNHQALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSELFELHIYTMGT 120

Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 1058
            + YA  +A ++DP   LF  R++SR + G           +K+L  +  +++  VVIIDD
Sbjct: 121  RAYAQHIASIIDPDRKLFGDRILSRDESGS--------LTAKNLHRLFPVDTKMVVIIDD 172

Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
               VW  +  NLI V  Y +F
Sbjct: 173  RGDVWRWSP-NLIKVSPYDFF 192


>gi|156050785|ref|XP_001591354.1| hypothetical protein SS1G_07980 [Sclerotinia sclerotiorum 1980]
 gi|154692380|gb|EDN92118.1| hypothetical protein SS1G_07980 [Sclerotinia sclerotiorum 1980 UF-70]
          Length = 806

 Score = 99.0 bits (245), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 66/198 (33%), Positives = 102/198 (51%), Gaps = 25/198 (12%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            DQ    +  +   + EE  Q+++   RKL LV+DLD T++++     ++P   E  R   
Sbjct: 134  DQTHLTVSHDEASKAEEELQRRLLKNRKLSLVVDLDQTIIHAC----IEPTVGEWQRDVN 189

Query: 953  EQDRE--KPHRHLF------RFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLY 1002
              + E  K  R         R    G W   K+RPG+  FL + S+++E+H+YTMG + Y
Sbjct: 190  SPNYEAVKDVRSFQLNDDGPRGLASGCWYYIKMRPGLAEFLTKISEMYELHVYTMGTRAY 249

Query: 1003 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVV-IIDDSVR 1061
            A  +AK++DP   LF  R+ISR ++G+          +K L  +    + +V IIDD   
Sbjct: 250  ALSIAKIVDPGKKLFGDRIISRDENGN--------VTAKSLARLFPQSTHMVAIIDDRAD 301

Query: 1062 VWPHNKLNLIVVERYTYF 1079
            VWP N+ NLI V  Y +F
Sbjct: 302  VWPMNRPNLIKVVPYDFF 319


>gi|242781762|ref|XP_002479866.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Talaromyces
            stipitatus ATCC 10500]
 gi|218720013|gb|EED19432.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Talaromyces
            stipitatus ATCC 10500]
          Length = 822

 Score = 98.6 bits (244), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 69/197 (35%), Positives = 107/197 (54%), Gaps = 23/197 (11%)

Query: 893  YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 951
            +D+      Q+E TR  E+ K+ + ++++L LV+DLD T++++     VDP   E    K
Sbjct: 131  HDNTALTVSQREATRVEEDAKRRLLASKRLSLVVDLDQTIIHAT----VDPTVGEWKEDK 186

Query: 952  EEQDREK-PHRHLFRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
               + E       F+     P M G W   KLRPG+ +FL+  SKL+E+H+YTMG + YA
Sbjct: 187  NNPNHEAVKDVRAFQLTDDGPGMRGCWYYIKLRPGLESFLQNISKLYELHIYTMGTRAYA 246

Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRV 1062
              +A ++DP   LF  R++SR + G           +K+L+ +  +++  VVIIDD   V
Sbjct: 247  QNIANIIDPDRKLFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDDRGDV 298

Query: 1063 WPHNKLNLIVVERYTYF 1079
            W  N  NLI V  Y +F
Sbjct: 299  WKWNP-NLIKVSPYDFF 314


>gi|159127495|gb|EDP52610.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus
            fumigatus A1163]
          Length = 827

 Score = 98.6 bits (244), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 68/197 (34%), Positives = 109/197 (55%), Gaps = 23/197 (11%)

Query: 893  YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 951
            +D+      +KE TR  E+ K+ + + RKL LV+DLD T++++     VDP   E +  K
Sbjct: 131  HDNTALTVSEKEATRVEEDAKRRLLANRKLSLVVDLDQTIIHAT----VDPTVGEWMEDK 186

Query: 952  EEQDRE-----KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
            +  + +     +  + +   P M G W   KLRPG+ +FL+  S+LFE+H+YTMG + YA
Sbjct: 187  DNPNHDALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSELFELHIYTMGTRAYA 246

Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRV 1062
              +A ++DP   LF  R++SR + G           +K+L+ +  +++  VVIIDD   V
Sbjct: 247  QHIAGIIDPDRKLFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDDRGDV 298

Query: 1063 WPHNKLNLIVVERYTYF 1079
            W  +  NLI V  Y +F
Sbjct: 299  WRWSP-NLIKVSPYDFF 314


>gi|70999518|ref|XP_754478.1| RNA Polymerase II CTD phosphatase Fcp1 [Aspergillus fumigatus Af293]
 gi|66852115|gb|EAL92440.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus
            fumigatus Af293]
          Length = 827

 Score = 98.6 bits (244), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 68/197 (34%), Positives = 109/197 (55%), Gaps = 23/197 (11%)

Query: 893  YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 951
            +D+      +KE TR  E+ K+ + + RKL LV+DLD T++++     VDP   E +  K
Sbjct: 131  HDNTALTVSEKEATRVEEDAKRRLLANRKLSLVVDLDQTIIHAT----VDPTVGEWMEDK 186

Query: 952  EEQDRE-----KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
            +  + +     +  + +   P M G W   KLRPG+ +FL+  S+LFE+H+YTMG + YA
Sbjct: 187  DNPNHDALGDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSELFELHIYTMGTRAYA 246

Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRV 1062
              +A ++DP   LF  R++SR + G           +K+L+ +  +++  VVIIDD   V
Sbjct: 247  QHIAGIIDPDRKLFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDDRGDV 298

Query: 1063 WPHNKLNLIVVERYTYF 1079
            W  +  NLI V  Y +F
Sbjct: 299  WRWSP-NLIKVSPYDFF 314


>gi|410977919|ref|XP_003995346.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
            C-terminal domain phosphatase [Felis catus]
          Length = 960

 Score = 98.6 bits (244), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 59/171 (34%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            ++Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 191  QDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 239

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T++RP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 240  -LHTRVRPHCREFLEKIARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 298

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 299  PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 342


>gi|347836062|emb|CCD50634.1| similar to FCP1-like phosphatase [Botryotinia fuckeliana]
          Length = 832

 Score = 98.2 bits (243), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 66/198 (33%), Positives = 102/198 (51%), Gaps = 25/198 (12%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            DQ    +  +   + EE  Q+++   RKL LV+DLD T++++     ++P   E  R   
Sbjct: 134  DQTHLTVSLDEASKAEEELQRRLLKNRKLSLVVDLDQTIIHAC----IEPTVGEWQRDVN 189

Query: 953  EQDRE--KPHRHLF------RFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLY 1002
              + E  K  R         R    G W   K+RPG+  FL + S+++E+H+YTMG + Y
Sbjct: 190  SPNYEAVKDVRSFQLNDDGPRGLASGCWYYIKMRPGLAEFLAKVSEMYELHVYTMGTRAY 249

Query: 1003 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVV-IIDDSVR 1061
            A  +AK++DP   LF  R+ISR ++G+          +K L  +    + +V IIDD   
Sbjct: 250  ALNIAKIVDPGKKLFGDRIISRDENGN--------VTAKSLARLFPQSTHMVAIIDDRAD 301

Query: 1062 VWPHNKLNLIVVERYTYF 1079
            VWP N+ NLI V  Y +F
Sbjct: 302  VWPMNRPNLIKVVPYDFF 319


>gi|15237769|ref|NP_197738.1| haloacid dehalogenase-like hydrolase domain-containing protein
            [Arabidopsis thaliana]
 gi|9759085|dbj|BAB09563.1| unnamed protein product [Arabidopsis thaliana]
 gi|332005790|gb|AED93173.1| haloacid dehalogenase-like hydrolase domain-containing protein
            [Arabidopsis thaliana]
          Length = 302

 Score = 97.8 bits (242), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 69/213 (32%), Positives = 105/213 (49%), Gaps = 18/213 (8%)

Query: 906  TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 965
            T+ L  Q      +KL LVLDLD TL+++ K   +      I+  +E + R+   R    
Sbjct: 72   TKGLISQTSWLEDKKLHLVLDLDQTLIHTIKTSLLYESEKYII--EEVESRKDIKRFNTG 129

Query: 966  FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
            FP   +  KLRP +  FL+  +++F M++YT G   YA  + +++DP    F  RVI+R 
Sbjct: 130  FPEESL-IKLRPFVHQFLKECNEMFSMYVYTKGGYDYARLVLEMIDPDKFYFGNRVITRR 188

Query: 1026 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
            +           P  K L+ VL  E  +VI+DD+  VWPH+K NL+ + RY YF      
Sbjct: 189  ES----------PGFKTLDLVLADERGIVIVDDTSSVWPHDKKNLLQIARYKYFGDKSCL 238

Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            F      +     DE  E G L ++L   + +H
Sbjct: 239  FSEDKKKI-----DESDEKGPLNTALRFLKDVH 266


>gi|19115680|ref|NP_594768.1| CTD phosphatase Fcp1 [Schizosaccharomyces pombe 972h-]
 gi|26393804|sp|Q9P376.1|FCP1_SCHPO RecName: Full=RNA polymerase II subunit A C-terminal domain
            phosphatase; AltName: Full=CTD phosphatase fcp1
 gi|9588462|emb|CAC00553.1| CTD phosphatase Fcp1 [Schizosaccharomyces pombe]
          Length = 723

 Score = 97.8 bits (242), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 72/216 (33%), Positives = 108/216 (50%), Gaps = 44/216 (20%)

Query: 890  FEGYDDQQKAAIQK-----------ERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAK 936
            + GY D  +A I             E   RLE +  K++   ++L L++DLD T++++  
Sbjct: 121  YMGYSDMARANISMTHNTGDLTVSLEEASRLESENVKRLRQEKRLSLIVDLDQTIIHAT- 179

Query: 937  FHEVDP-----------VHDEILRKKEEQD-REKPHRHLFRFPHMGMWTKLRPGIWTFLE 984
               VDP           V+ ++LR     + +E P  +   +     + K RPG+  FL+
Sbjct: 180  ---VDPTVGEWMSDPGNVNYDVLRDVRSFNLQEGPSGYTSCY-----YIKFRPGLAQFLQ 231

Query: 985  RASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLE 1044
            + S+L+E+H+YTMG K YA E+AK++DP G LF  RV+SR D G            K L 
Sbjct: 232  KISELYELHIYTMGTKAYAKEVAKIIDPTGKLFQDRVLSRDDSGS--------LAQKSLR 283

Query: 1045 GVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
             +   + S VV+IDD   VW  N  NLI V  Y +F
Sbjct: 284  RLFPCDTSMVVVIDDRGDVWDWNP-NLIKVVPYEFF 318


>gi|317144011|ref|XP_001819844.2| RNA polymerase II subunit A C-terminal domain phosphatase
            [Aspergillus oryzae RIB40]
          Length = 799

 Score = 97.4 bits (241), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 62/188 (32%), Positives = 102/188 (54%), Gaps = 29/188 (15%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            D     + ++   R+EE  ++++ S RKL LV+DLD T++++     VDP   E +    
Sbjct: 132  DNTALTVSEKEAARVEEDAKRRLLSNRKLSLVVDLDQTIIHAT----VDPTVGEWM---- 183

Query: 953  EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
             +D++ P+            + LRPG+ +FL+  S+LFE+H+YTMG + YA  +A ++DP
Sbjct: 184  -EDKDNPNHQAL--------SDLRPGLESFLQNVSELFELHIYTMGTRAYAQHIASIIDP 234

Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLI 1071
               LF  R++SR + G           +K+L  +  +++  VVIIDD   VW  +  NLI
Sbjct: 235  DRKLFGDRILSRDESGS--------LTAKNLHRLFPVDTKMVVIIDDRGDVWRWSP-NLI 285

Query: 1072 VVERYTYF 1079
             V  Y +F
Sbjct: 286  KVSPYDFF 293


>gi|148236185|ref|NP_001090168.1| CTD phosphatase [Xenopus laevis]
 gi|13487713|gb|AAK27686.1| CTD phosphatase [Xenopus laevis]
          Length = 980

 Score = 97.1 bits (240), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 60/171 (35%), Positives = 93/171 (54%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q ++   +KL L++DLD TL+++ + H        I            H  L R   M
Sbjct: 165  EDQFRLHRNKKLVLMVDLDQTLIHTTEQHCQHMSRKGIF-----------HFQLGRGEPM 213

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 214  -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 272

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            P+       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 273  PYS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 316


>gi|212526776|ref|XP_002143545.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Talaromyces
            marneffei ATCC 18224]
 gi|210072943|gb|EEA27030.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Talaromyces
            marneffei ATCC 18224]
          Length = 829

 Score = 97.1 bits (240), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 68/202 (33%), Positives = 110/202 (54%), Gaps = 33/202 (16%)

Query: 893  YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 951
            +D+      Q+E TR  E+ K+ + ++++L LV+DLD T++++     VDP   E     
Sbjct: 131  HDNTALTVSQREATRVEEDAKRRLLASKRLSLVVDLDQTIIHAT----VDPTVGEW---- 182

Query: 952  EEQDREKPHR------HLFRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMG 998
             ++D+  P+         F+     P M G W   KLRPG+ +FL+  S+L+E+H+YTMG
Sbjct: 183  -KEDKNNPNHDAVKDVRAFQLTDDGPGMRGCWYYIKLRPGLESFLQNISELYELHIYTMG 241

Query: 999  NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIID 1057
             + YA  +A ++DP   LF  R++SR + G           +K+L+ +  +++  VVIID
Sbjct: 242  TRAYAQHIANIIDPDRKLFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIID 293

Query: 1058 DSVRVWPHNKLNLIVVERYTYF 1079
            D   VW  N  NLI V  Y +F
Sbjct: 294  DRGDVWKWNP-NLIKVSPYDFF 314


>gi|449675210|ref|XP_002161785.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
            phosphatase-like [Hydra magnipapillata]
          Length = 718

 Score = 97.1 bits (240), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 67/183 (36%), Positives = 96/183 (52%), Gaps = 30/183 (16%)

Query: 902  QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
            + E+  + +EQ+ +  ARKL LV+DLD TL+++     V+P               K  +
Sbjct: 137  EAEKLAKYDEQQ-LLRARKLVLVVDLDMTLIHTT----VEPT-------------PKNTK 178

Query: 962  HLFRFPHMG----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 1017
             +F F   G      TKLRPG   FLE  SK +E+H++TMG++LYA  +AK LDP G  F
Sbjct: 179  DVFSFKLPGHQYEYHTKLRPGARKFLESISKFYELHIFTMGSRLYAHTVAKCLDPDGKFF 238

Query: 1018 AGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERY 1076
            A R+ SR +  + F       K  DL+ +    +  V IIDD   VW +   NLI V+ Y
Sbjct: 239  AHRIRSRDEFINSF------SKFHDLKALFPCGDHMVCIIDDREDVWNYAP-NLITVKPY 291

Query: 1077 TYF 1079
             +F
Sbjct: 292  KFF 294


>gi|148227040|ref|NP_001081726.1| FCP1 serine phosphatase [Xenopus laevis]
 gi|62185667|gb|AAH92306.1| Fcp1 protein [Xenopus laevis]
          Length = 979

 Score = 97.1 bits (240), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 60/171 (35%), Positives = 93/171 (54%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q ++   +KL L++DLD TL+++ + H        I            H  L R   M
Sbjct: 165  EDQFRLHRNQKLVLMVDLDQTLIHTTEQHCQHMSRKGIF-----------HFQLGRGEPM 213

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 214  -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 272

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            P+       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 273  PYS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 316


>gi|121705758|ref|XP_001271142.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus
            clavatus NRRL 1]
 gi|119399288|gb|EAW09716.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus
            clavatus NRRL 1]
          Length = 826

 Score = 97.1 bits (240), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 66/197 (33%), Positives = 109/197 (55%), Gaps = 23/197 (11%)

Query: 893  YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 951
            +D+      ++E TR  E+ K+ + + +KL LV+DLD T++++     VDP   E +  K
Sbjct: 131  HDNTSLTVSEREATRVEEDAKRRLLANKKLSLVVDLDQTIIHAT----VDPTVREWMEDK 186

Query: 952  EEQDRE-----KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
            +  + E     +  + +   P M G W   KLRPG+ +FL+  ++LFE+H+YTMG + YA
Sbjct: 187  DNPNHEALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVAELFELHIYTMGTRAYA 246

Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRV 1062
              +A ++DP   LF  R++SR + G           +K+L+ +  +++  VVIIDD   V
Sbjct: 247  QHIAAIIDPDRKLFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDDRGDV 298

Query: 1063 WPHNKLNLIVVERYTYF 1079
            W  +  NLI V  Y +F
Sbjct: 299  WRWSP-NLIKVSPYDFF 314


>gi|328872613|gb|EGG20980.1| putative tfiif-interacting component of the c-terminal domain
            phosphatase [Dictyostelium fasciculatum]
          Length = 757

 Score = 96.7 bits (239), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 69/201 (34%), Positives = 109/201 (54%), Gaps = 23/201 (11%)

Query: 896  QQKAAIQKERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNS---AKFHEVDPVHDEILRK 950
            Q    +  +  +++EE+  K++   +KL LVLDLDHT++++     F EV P    I   
Sbjct: 180  QPHITVSHKMAQQIEEKNAKRLLDNKKLSLVLDLDHTIIHAIMEQHFMEV-PYWRTI--- 235

Query: 951  KEEQDREKPHRH-LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKV 1009
                DR+K + H +    +   + KLRP ++ FL   ++LFE+H+YTMG + YA ++A +
Sbjct: 236  ----DRKKSNIHEIILNGNQRYFIKLRPHLYEFLREVNRLFELHIYTMGTRNYAQKIASL 291

Query: 1010 LDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKL 1068
            +DPK  +F  RV+SR  D  P D +      K L+ +    +S V+I+DD   VW  +K 
Sbjct: 292  VDPKQRVFKERVLSR--DDTPNDMNH-----KTLKRLFPCDDSMVLIVDDRSDVWKKSK- 343

Query: 1069 NLIVVERYTYFPCSRRQFGLL 1089
            NLI +  Y YF   +    LL
Sbjct: 344  NLIQIVPYLYFVGCKDMVNLL 364


>gi|428183780|gb|EKX52637.1| hypothetical protein GUITHDRAFT_101798 [Guillardia theta CCMP2712]
          Length = 749

 Score = 96.7 bits (239), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 65/181 (35%), Positives = 99/181 (54%), Gaps = 33/181 (18%)

Query: 915  MFSARKLCLVLDLDHTLLNSAKFHEVDP---VHDEILRKKEEQDREKPHRHLFRFPHMGM 971
            MF A    LVLDLDHTLL     H   P   + + I++   EQ ++    H+ +      
Sbjct: 113  MFGA----LVLDLDHTLL-----HTTLPRTEMEEMIMQTLHEQCKDV---HVLQVSAARY 160

Query: 972  WTKLRPGIWTFLERASKLFEMHLYT--MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
            +TKLRPGI  FL   S+LFE+++YT  MG++ YA  +A +LD  G +F GR+ISR D  D
Sbjct: 161  YTKLRPGIRNFLSEMSRLFELYIYTAGMGSQQYAEAVAHMLDESGRMFRGRIISRDDYTD 220

Query: 1030 PFDGDERVPKSKDLEGVLGME---SAVVIIDDSVRVWPH-------NKLNLIVVERYTYF 1079
                     + K L+ V  ++   + V+I+DD+   W H       ++ NLI V++Y+++
Sbjct: 221  V------SLEHKKLDKVFPIDEHRALVIILDDNAETWDHQYSDGRNSQENLIQVDKYSFW 274

Query: 1080 P 1080
            P
Sbjct: 275  P 275


>gi|148236996|ref|NP_001087852.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
            phosphatase, subunit 1 [Xenopus laevis]
 gi|51950264|gb|AAH82378.1| MGC81710 protein [Xenopus laevis]
          Length = 977

 Score = 96.3 bits (238), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 59/171 (34%), Positives = 93/171 (54%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q ++   +K+ L++DLD TL+++ + H        I            H  L R   M
Sbjct: 163  EDQLRLHRNKKVVLMVDLDQTLIHTTEQHCQHMSRKGIF-----------HFQLGRGEPM 211

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 212  -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 270

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            P+       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 271  PYS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 314


>gi|303317134|ref|XP_003068569.1| NLI interacting factor-like phosphatase family protein [Coccidioides
            posadasii C735 delta SOWgp]
 gi|240108250|gb|EER26424.1| NLI interacting factor-like phosphatase family protein [Coccidioides
            posadasii C735 delta SOWgp]
 gi|320038484|gb|EFW20419.1| RNA Polymerase II CTD phosphatase Fcp1 [Coccidioides posadasii str.
            Silveira]
          Length = 868

 Score = 96.3 bits (238), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 66/196 (33%), Positives = 104/196 (53%), Gaps = 24/196 (12%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            D     + K+   R+EE  ++++ S+RKL LV+DLD T++++     VDP   E    K 
Sbjct: 132  DNASLTVSKDEATRVEEDAKRRLLSSRKLSLVVDLDQTIIHAT----VDPTVAEWQEDKT 187

Query: 953  EQDRE-----KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
              + E     +  + +   P M G W   KLRPG+  FL   S L+E+H+YTMG + YA 
Sbjct: 188  NPNHEAVKDVRAFQLVDDGPGMRGCWYYIKLRPGLEDFLRSISSLYELHIYTMGTRAYAQ 247

Query: 1005 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
             +A ++DP   +F  R++SR + G           +K+L+ +  +++  VVIIDD   VW
Sbjct: 248  NIANIVDPDRKIFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDDRGDVW 299

Query: 1064 PHNKLNLIVVERYTYF 1079
              +  NLI V  Y +F
Sbjct: 300  NWSD-NLIRVHPYDFF 314


>gi|119187277|ref|XP_001244245.1| hypothetical protein CIMG_03686 [Coccidioides immitis RS]
          Length = 839

 Score = 95.9 bits (237), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 67/201 (33%), Positives = 106/201 (52%), Gaps = 34/201 (16%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            D     + K+   R+EE  ++++ S+RKL LV+DLD T++++     VDP   E      
Sbjct: 103  DNASLTVSKDEATRVEEDAKRRLLSSRKLSLVVDLDQTIIHAT----VDPTVAEW----- 153

Query: 953  EQDREKPHR------HLFRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
            ++D+  P+         F+     P M G W   KLRPG+  FL   S L+E+H+YTMG 
Sbjct: 154  QEDKTNPNHEAVKDVRAFQLVDDGPGMRGCWYYIKLRPGLEDFLRSISSLYELHIYTMGT 213

Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 1058
            + YA  +A ++DP   +F  R++SR + G           +K+L+ +  +++  VVIIDD
Sbjct: 214  RAYAQNIANIVDPDRKIFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDD 265

Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
               VW  +  NLI V  Y +F
Sbjct: 266  RGDVWNWSD-NLIRVHPYDFF 285


>gi|392870961|gb|EAS32809.2| FCP1-like phosphatase, phosphatase domain-containing protein
            [Coccidioides immitis RS]
          Length = 868

 Score = 95.9 bits (237), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 66/196 (33%), Positives = 104/196 (53%), Gaps = 24/196 (12%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            D     + K+   R+EE  ++++ S+RKL LV+DLD T++++     VDP   E    K 
Sbjct: 132  DNASLTVSKDEATRVEEDAKRRLLSSRKLSLVVDLDQTIIHAT----VDPTVAEWQEDKT 187

Query: 953  EQDRE-----KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
              + E     +  + +   P M G W   KLRPG+  FL   S L+E+H+YTMG + YA 
Sbjct: 188  NPNHEAVKDVRAFQLVDDGPGMRGCWYYIKLRPGLEDFLRSISSLYELHIYTMGTRAYAQ 247

Query: 1005 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
             +A ++DP   +F  R++SR + G           +K+L+ +  +++  VVIIDD   VW
Sbjct: 248  NIANIVDPDRKIFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDDRGDVW 299

Query: 1064 PHNKLNLIVVERYTYF 1079
              +  NLI V  Y +F
Sbjct: 300  NWSD-NLIRVHPYDFF 314


>gi|213403530|ref|XP_002172537.1| RNA polymerase II subunit A C-terminal domain phosphatase
            [Schizosaccharomyces japonicus yFS275]
 gi|212000584|gb|EEB06244.1| RNA polymerase II subunit A C-terminal domain phosphatase
            [Schizosaccharomyces japonicus yFS275]
          Length = 723

 Score = 95.9 bits (237), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 69/207 (33%), Positives = 103/207 (49%), Gaps = 30/207 (14%)

Query: 890  FEGYDDQQKAAIQKER-------TRRLEEQK--KMFSARKLCLVLDLDHTLLNSAKFHEV 940
            + G+ D  +A I            RRLE +   ++   ++L L++DLD T++++     V
Sbjct: 121  YMGFSDLSRATINMTHGSGGLTEARRLETETAIRLQKQKRLSLIVDLDQTIIHAT----V 176

Query: 941  DPVHDEILRKKEEQDRE---KPHRHLFRFPHMGM----WTKLRPGIWTFLERASKLFEMH 993
            DP   E ++     + +     H    R    G     + K RPG+  FL   SKL+E+H
Sbjct: 177  DPTVGEWMKDPNNVNYKVLRDVHYFYLREGTSGYTSCYYIKPRPGLQEFLHNVSKLYELH 236

Query: 994  LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SA 1052
            +YTMG K YATE+AKV+DP G LF  RV+SR D G+           K +  +   + S 
Sbjct: 237  IYTMGTKAYATEVAKVIDPDGELFQDRVLSRDDSGN--------LTQKSIRRLFPCDTSM 288

Query: 1053 VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            VV+IDD   VW  +  NLI V  + +F
Sbjct: 289  VVVIDDRGDVWNWSS-NLIKVYPFEFF 314


>gi|255712225|ref|XP_002552395.1| KLTH0C03894p [Lachancea thermotolerans]
 gi|238933774|emb|CAR21957.1| KLTH0C03894p [Lachancea thermotolerans CBS 6340]
          Length = 745

 Score = 95.9 bits (237), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 102/203 (50%), Gaps = 36/203 (17%)

Query: 895  DQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSA---KFHEV-----DPVHDE 946
            ++Q A I+K   + L E KK      L LV+DLD T+++       HE      +P +D 
Sbjct: 146  ERQAATIEKTAQKHLREHKK------LVLVVDLDQTVIHCGVDPTIHEWANDPSNPNYDA 199

Query: 947  ILRKKEEQDREKPHRHLFRFPHMG---------MWTKLRPGIWTFLERASKLFEMHLYTM 997
            +   K     E P    F   +MG          + KLRPG+  F ++ +  FE+H+YTM
Sbjct: 200  LKNVKTFSLDEDPILPPF---YMGPRPPPRKCQYYVKLRPGLQEFFDKIAPHFELHIYTM 256

Query: 998  GNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVII 1056
              + YA E+AK++DPKG LF  R++SR ++G            K LE +  M +S VVII
Sbjct: 257  ATRAYALEIAKIIDPKGELFGDRILSRDENGS--------LTHKSLERLFPMDQSMVVII 308

Query: 1057 DDSVRVWPHNKLNLIVVERYTYF 1079
            DD   VW   + NLI V  Y +F
Sbjct: 309  DDRGDVWSWCE-NLIKVVPYNFF 330


>gi|50306333|ref|XP_453140.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49642274|emb|CAH00236.1| KLLA0D01595p [Kluyveromyces lactis]
          Length = 719

 Score = 95.5 bits (236), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 74/204 (36%), Positives = 103/204 (50%), Gaps = 38/204 (18%)

Query: 895  DQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK---- 950
            +QQ   +++    RL E+KK      L LV+DLD T+++      VDP   E +R     
Sbjct: 138  EQQAETLERSSLTRLREEKK------LVLVVDLDQTVIHCG----VDPTIGEWMRDPKNP 187

Query: 951  --KEEQD------REKPHRHLFRF----PHMGMW--TKLRPGIWTFLERASKLFEMHLYT 996
              K  QD       ++P    F F    P    W   KLRPG+  F E  S  FEMH+YT
Sbjct: 188  NYKALQDVKSFTLEDEPIIPSFYFGPKPPARKSWYYVKLRPGLKEFFEAVSPHFEMHIYT 247

Query: 997  MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVI 1055
            M  + YA E+AK++DP G LF  R++SR ++G           +K LE +  M +S VV+
Sbjct: 248  MATRSYAHEIAKIIDPTGELFGDRILSRDENGS--------LTTKSLERLFPMDQSMVVV 299

Query: 1056 IDDSVRVWPHNKLNLIVVERYTYF 1079
            IDD   VW   + NLI V  Y++F
Sbjct: 300  IDDRGDVWNWFE-NLIKVVPYSFF 322


>gi|330930047|ref|XP_003302870.1| hypothetical protein PTT_14854 [Pyrenophora teres f. teres 0-1]
 gi|311321498|gb|EFQ89046.1| hypothetical protein PTT_14854 [Pyrenophora teres f. teres 0-1]
          Length = 803

 Score = 95.1 bits (235), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 56/178 (31%), Positives = 96/178 (53%), Gaps = 21/178 (11%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHRHL 963
            +K++ SARKL L++DLD T++++     +        +P HD +   K+ Q  +    ++
Sbjct: 152  KKRLLSARKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHDAV---KDVQGFQLADDNV 208

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
                    + K+RPG+  F +R SKL+EMH+YTM  + YA  +AK++DP+   F  R++S
Sbjct: 209  SNVAANWYYVKMRPGLKDFFDRVSKLYEMHVYTMATRAYAQAVAKIIDPERKYFGDRILS 268

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGMESAV-VIIDDSVRVWPHNKLNLIVVERYTYFP 1080
            R ++           K K L  +    +A+ VIIDD   VW ++  +L+ V  + +FP
Sbjct: 269  RDEN--------YTDKLKSLTRLFYQNTAMCVIIDDRADVWQYSP-HLVRVPVFNFFP 317


>gi|440804367|gb|ELR25244.1| FCP1like phosphatase, phosphatase subfamily protein [Acanthamoeba
            castellanii str. Neff]
          Length = 930

 Score = 95.1 bits (235), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 64/185 (34%), Positives = 102/185 (55%), Gaps = 27/185 (14%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDE------------ILRKKEEQDREKPH 960
            +++ +A+KL LVLDLD TL+++ +  EV+ +                L        + P 
Sbjct: 142  ERLTAAKKLSLVLDLDQTLVHATQDAEVETLFGTDAAEAKGGSITCALPNPPAGPEDVPA 201

Query: 961  RHLFRF-----PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
             HL+RF     PH   + KLRP +  FL     LFE+H+YTMG++ YA ++A+++DP+  
Sbjct: 202  AHLYRFTLEGNPHK-FYLKLRPHLEEFLMGVKDLFELHIYTMGSRSYARKVAQIIDPEQK 260

Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVE 1074
            LF   ++SR + G+       V   K+L+ +  + +S V+IIDD V VW  +K NLI +E
Sbjct: 261  LFRENIVSRDECGN-------VMNLKNLQRIFPVDDSMVMIIDDRVDVWGTSK-NLIKIE 312

Query: 1075 RYTYF 1079
             Y +F
Sbjct: 313  PYYFF 317


>gi|189211133|ref|XP_001941897.1| RNA polymerase II subunit A C-terminal domain phosphatase
            [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187977990|gb|EDU44616.1| RNA polymerase II subunit A C-terminal domain phosphatase
            [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 774

 Score = 95.1 bits (235), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 56/178 (31%), Positives = 96/178 (53%), Gaps = 21/178 (11%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHRHL 963
            +K++ SARKL L++DLD T++++     +        +P HD +   K+ Q  +    ++
Sbjct: 152  KKRLLSARKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHDAV---KDVQGFQLADDNV 208

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
                    + K+RPG+  F +R SKL+EMH+YTM  + YA  +AK++DP+   F  R++S
Sbjct: 209  SNVAANWYYVKMRPGLKDFFDRVSKLYEMHVYTMATRAYAQAVAKIIDPERKYFGDRILS 268

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGMESAV-VIIDDSVRVWPHNKLNLIVVERYTYFP 1080
            R ++           K K L  +    +A+ VIIDD   VW ++  +L+ V  + +FP
Sbjct: 269  RDEN--------YTDKLKSLTRLFYQNTAMCVIIDDRADVWQYSP-HLVRVPVFNFFP 317


>gi|330799899|ref|XP_003287978.1| hypothetical protein DICPUDRAFT_55168 [Dictyostelium purpureum]
 gi|325082002|gb|EGC35499.1| hypothetical protein DICPUDRAFT_55168 [Dictyostelium purpureum]
          Length = 730

 Score = 94.7 bits (234), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 62/168 (36%), Positives = 91/168 (54%), Gaps = 14/168 (8%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            K++   RKL LVLDLDHTL+++     ++   +   R +++ D    H      P M   
Sbjct: 129  KRLIKERKLSLVLDLDHTLIHAVTEQGLNSSPNWKNRNRKDYD---IHNITVNGP-MTYC 184

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD-DGDPF 1031
             K RP +  FLE  +K FE+H+YTMG + YA E+AK++DP   LF  R++SR D +G  F
Sbjct: 185  IKKRPHLNDFLENVNKNFELHIYTMGTRNYANEIAKLIDPDQTLFKERILSRDDGNGINF 244

Query: 1032 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
               +R+    D        S V+I+DD   VW  +K NLI +  Y +F
Sbjct: 245  KTLQRLFPCDD--------SMVLIVDDRSDVWKKSK-NLIQISPYVFF 283


>gi|428672202|gb|EKX73116.1| conserved hypothetical protein [Babesia equi]
          Length = 739

 Score = 94.4 bits (233), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 67/200 (33%), Positives = 99/200 (49%), Gaps = 37/200 (18%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAK--------FHEVDPVHDE--ILRKKEEQDREKPHRHL 963
            K+   RKLCLVLDLD+TLL+++           E+D +  +  I +  +  D E   +  
Sbjct: 240  KVLQKRKLCLVLDLDNTLLHASSQKLPSDVYVDEIDFLSKDADIFKDVQYNDDEGTLKLR 299

Query: 964  FRFPHMGMWT---------------KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 1008
             +F    + T               KLRPG++ FL+  S  FE++L+TMG K +A+   K
Sbjct: 300  KKFESSIIQTMVYNESETMCCKSYFKLRPGVFKFLKEMSAKFELYLFTMGTKQHASSSLK 359

Query: 1009 VLDPKGVLFAGRVISRGDDGDPFDGDERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNK 1067
            +LDPK + F  R+  R D        +R+ PK K+L         V+I+DD+  VW  N 
Sbjct: 360  ILDPKRIYFGNRIFCRNDSRSSMKSLDRIFPKHKNL---------VLIVDDTEHVWTCN- 409

Query: 1068 LNLIVVERYTYFP-CSRRQF 1086
            L LI +  Y +FP  S  QF
Sbjct: 410  LGLIKIHPYFFFPDLSYLQF 429


>gi|384501479|gb|EIE91970.1| hypothetical protein RO3G_16681 [Rhizopus delemar RA 99-880]
          Length = 494

 Score = 94.4 bits (233), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 55/174 (31%), Positives = 93/174 (53%), Gaps = 25/174 (14%)

Query: 909  LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 968
            +E  K++   +KL L+LDLD T+++++    +    +  +R+             F  P 
Sbjct: 20   VENAKRLLDQKKLSLILDLDQTIVHASCDQRISQWQNPDIRQ-------------FNLPR 66

Query: 969  --MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
              +  + KLRPG+  FL+   +L+E+H+YTMG K YA  +AK +DP+G LF  R++SR +
Sbjct: 67   SPLVYYIKLRPGLIEFLKEIEELYELHIYTMGTKDYAKAVAKEIDPEGCLFKERILSRDE 126

Query: 1027 DGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
             G            K L+ +   + S VV++DD   VW ++  NL+ ++ Y YF
Sbjct: 127  SG--------CLTQKKLQRIFPCDTSMVVVLDDRSDVWSYSP-NLVRIKPYEYF 171


>gi|156837042|ref|XP_001642557.1| hypothetical protein Kpol_1068p9 [Vanderwaltozyma polyspora DSM
            70294]
 gi|156113100|gb|EDO14699.1| hypothetical protein Kpol_1068p9 [Vanderwaltozyma polyspora DSM
            70294]
          Length = 745

 Score = 94.0 bits (232), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 66/192 (34%), Positives = 93/192 (48%), Gaps = 42/192 (21%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------ 965
            +K++   +KL LV+DLD T+++      VDP   E      + D   P+    R      
Sbjct: 156  KKRLIREKKLILVVDLDQTVIHCG----VDPTIAEW-----KNDPTNPNFETLRDVKSFV 206

Query: 966  ------FPHMGM-----------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 1008
                   P M M           + K+RPG+  F E  SKL+EMH+YTM  + YA E+AK
Sbjct: 207  LEEEPILPPMYMGPKPPTHKCWYYVKIRPGLKEFFEEVSKLYEMHIYTMATRSYAQEIAK 266

Query: 1009 VLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNK 1067
            ++DP G LFA R++SR ++G            K LE +    +S VV+IDD   VW    
Sbjct: 267  IIDPDGTLFADRILSRNENGS--------LTHKSLERLFPTDQSMVVVIDDRGDVWNWCP 318

Query: 1068 LNLIVVERYTYF 1079
             NLI V  Y +F
Sbjct: 319  -NLIKVTPYNFF 329


>gi|407929624|gb|EKG22436.1| BRCT domain-containing protein [Macrophomina phaseolina MS6]
          Length = 861

 Score = 94.0 bits (232), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 63/184 (34%), Positives = 97/184 (52%), Gaps = 32/184 (17%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR---- 965
            E ++++ S +KL LV+DLD T++++     VDP   E      ++D E P+    +    
Sbjct: 150  EAKRRLLSNKKLSLVVDLDQTIIHAT----VDPTVAEW-----QKDPENPNYEAVKDVQS 200

Query: 966  FPHM-------GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
            F  +       G W   KLRPG+  FLE  SK++E+H+YTMG + YA  +AK++DP   +
Sbjct: 201  FQLLDNGPGGRGCWYYIKLRPGLREFLENISKVYELHIYTMGTRAYAQNIAKIVDPNRKI 260

Query: 1017 FAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVER 1075
            F  R++SR + G            K L  +  +++  VVIIDD   VW  +  NLI V  
Sbjct: 261  FGDRILSRDESGS--------LTVKTLHRIFPVDTKMVVIIDDRGDVWSWSN-NLIKVTP 311

Query: 1076 YTYF 1079
            Y +F
Sbjct: 312  YDFF 315


>gi|430812451|emb|CCJ30145.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 741

 Score = 94.0 bits (232), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 68/211 (32%), Positives = 105/211 (49%), Gaps = 34/211 (16%)

Query: 890  FEGYDDQQKAAIQ-----------KERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAK 936
            F G+ D  +A IQ           KE   RLE +  +++    KL L++DLD T+L++  
Sbjct: 151  FTGFLDSTRATIQMSHDATKLTVSKEEATRLERETMERLLKEMKLSLIVDLDQTILHAT- 209

Query: 937  FHEVDPVHDEILRK---KEEQDREKPHRHLFRFPHMGM----WTKLRPGIWTFLERASKL 989
               VDP+  E L     K     +   +   +  + G+    + K+RPG+  FLE  SKL
Sbjct: 210  ---VDPIVGEWLSNPSSKHYLAVQDVQKFCLKENNSGIGNWYYVKMRPGLEQFLENISKL 266

Query: 990  FEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM 1049
            +EMH+YTMG + YA  +A ++D     F  R++SR + G            K+++ +  +
Sbjct: 267  YEMHIYTMGTRAYAASIAHLIDKDKKYFGDRILSRDESGS--------TTRKNIQRLFPV 318

Query: 1050 E-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            + S VVIIDD   VW  +  NLI V  Y +F
Sbjct: 319  DTSMVVIIDDRADVWQWSP-NLIKVTPYEFF 348


>gi|398396164|ref|XP_003851540.1| hypothetical protein MYCGRDRAFT_44229 [Zymoseptoria tritici IPO323]
 gi|339471420|gb|EGP86516.1| hypothetical protein MYCGRDRAFT_44229 [Zymoseptoria tritici IPO323]
          Length = 822

 Score = 94.0 bits (232), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 62/194 (31%), Positives = 100/194 (51%), Gaps = 20/194 (10%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-- 967
            E +K++  +RKL LV+DLD T++ +     V+P   E  +        K  + + +F   
Sbjct: 155  ERRKRLLDSRKLSLVVDLDQTIIQA----NVEPTIGE-WKNDPTNPNWKALQDVCQFQLA 209

Query: 968  ---HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
                   + KLRPG+  FL   S+L+E+H+YTMG + YA  +AK++DP   +F  R++SR
Sbjct: 210  DDGRTWYYVKLRPGLKDFLRDMSELYELHIYTMGTRAYADNIAKIVDPDRKVFGDRILSR 269

Query: 1025 GDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYFPCSR 1083
             ++G            K+L+ +   ++  VVIIDD   VW H   NLI V  + +FP   
Sbjct: 270  DENGS--------MTVKNLKRLFHADTRMVVIIDDRADVW-HWTPNLIKVNAFEFFPGVG 320

Query: 1084 RQFGLLGPSLLEID 1097
               G+  P   E++
Sbjct: 321  DINGMFLPKRQELE 334


>gi|342320998|gb|EGU12936.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Rhodotorula
            glutinis ATCC 204091]
          Length = 817

 Score = 93.6 bits (231), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 74/252 (29%), Positives = 114/252 (45%), Gaps = 78/252 (30%)

Query: 890  FEGYDDQQKAAIQK-----------ERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAK 936
            + G+ D  +A I             E   RLE+    ++  A+KL L++DLD T++++  
Sbjct: 115  YTGFSDTSRATISMAHDIGGLTVSLEEAHRLEKATTARLLDAKKLSLIVDLDQTIVHAT- 173

Query: 937  FHEVDPVHDEIL--------------RKKEEQD-----REKPHRHL-FRFPHM------- 969
               VDP   E L              ++ + QD     R KP ++   R   +       
Sbjct: 174  ---VDPTVGEWLQDPKNPNYKALEGVKRFKLQDESPATRNKPKKYRKIRIKQVDPSKGEE 230

Query: 970  ----------------GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 1011
                            G W   K+RPG+  FL+R ++++EMH+YTMG + YA+E+ KV+D
Sbjct: 231  ADDESSEDEEEDEDDGGCWYYIKMRPGLPDFLKRVAEMYEMHVYTMGTRAYASEVCKVID 290

Query: 1012 PKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVW---PHNK 1067
            P G LF GR++SR + G            K L+ +   ++  VVIIDD   VW   PH  
Sbjct: 291  PDGGLFGGRILSRDESGS--------MTRKSLQRLFPCDTNMVVIIDDRADVWDGSPH-- 340

Query: 1068 LNLIVVERYTYF 1079
              L+ V  Y +F
Sbjct: 341  --LVKVIPYEFF 350


>gi|449299873|gb|EMC95886.1| hypothetical protein BAUCODRAFT_71386 [Baudoinia compniacensis UAMH
            10762]
          Length = 790

 Score = 93.6 bits (231), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 60/181 (33%), Positives = 99/181 (54%), Gaps = 26/181 (14%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEI--LRKKEEQDREKP 959
            E ++++  +RKL LV+DLD T++++         +  E +P H  +  +RK +  D    
Sbjct: 150  EAKRRLIKSRKLSLVVDLDQTIIHATVDPTVAEWQADETNPNHAAVKGVRKFQLVDDGPG 209

Query: 960  HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
             R  + +       KLRPG+  FL+  S+ +E+H+YTM  + YA E+AK++DP   LFA 
Sbjct: 210  GRGTWYY------IKLRPGLSDFLQLVSQYYELHIYTMATRAYAEEIAKLVDPGRKLFAN 263

Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTY 1078
            R++SR ++G           SK L+ +  +++  VVIIDD   VW  +  NL+ V  Y +
Sbjct: 264  RILSRDENGS--------MNSKSLKRLFPVDTKMVVIIDDRGDVWSWSP-NLVKVSAYDF 314

Query: 1079 F 1079
            F
Sbjct: 315  F 315


>gi|451853161|gb|EMD66455.1| hypothetical protein COCSADRAFT_112846 [Cochliobolus sativus ND90Pr]
          Length = 803

 Score = 93.2 bits (230), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 54/178 (30%), Positives = 96/178 (53%), Gaps = 21/178 (11%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHRHL 963
            ++++ +ARKL L++DLD T++++     +        +P HD +   K+ Q  +    ++
Sbjct: 152  KRRLLNARKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHDAV---KDVQGFQLADDNV 208

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
                    + K+RPG+  F +R SKL+EMH+YTM  + YA  +AK++DP+   F  R++S
Sbjct: 209  SNVAANWYYVKMRPGLKDFFDRVSKLYEMHVYTMATRAYAQAVAKIIDPERKYFGDRILS 268

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGMESAV-VIIDDSVRVWPHNKLNLIVVERYTYFP 1080
            R ++           K K L  +    +A+ VIIDD   VW ++  +L+ V  + +FP
Sbjct: 269  RDEN--------YTDKLKSLTRLFYQNTAMCVIIDDRADVWQYSP-HLVRVPVFNFFP 317


>gi|402220046|gb|EJU00119.1| hypothetical protein DACRYDRAFT_81791 [Dacryopinax sp. DJM-731 SS1]
          Length = 855

 Score = 93.2 bits (230), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 68/219 (31%), Positives = 101/219 (46%), Gaps = 46/219 (21%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEIL---- 948
            D     +  E   RLE   Q ++  +RKL LV+DLD T++ +     VDP   E +    
Sbjct: 136  DAASITVSPEVAARLEHESQIRLLGSRKLSLVVDLDQTIIQAT----VDPTVGEWIDQGR 191

Query: 949  --RKKEEQDREKPH----RHLFRF---------------------PHMGMWTKLRPGIWT 981
               +  E  R+ P+    R + RF                          + K RPG+  
Sbjct: 192  AWEEGREGARKNPNWEALRDVGRFRLSEERKVVNGRGGKVIRSKREDTAYYIKPRPGLHA 251

Query: 982  FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD-PFDGDERVPKS 1040
            FL R S+L+EMH+YTMG + YA+++ +++DP G LF  RV+SR + G   F    R+   
Sbjct: 252  FLSRLSELYEMHVYTMGTRSYASQVVRLIDPLGNLFGSRVLSRDESGSLTFKNLTRLFPC 311

Query: 1041 KDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                      S+ VIIDD   VW  ++ NL+ V  Y +F
Sbjct: 312  NT--------SSAVIIDDRADVWDLSRANLVKVVPYDFF 342


>gi|255936731|ref|XP_002559392.1| Pc13g09690 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211584012|emb|CAP92038.1| Pc13g09690 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 792

 Score = 93.2 bits (230), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 64/188 (34%), Positives = 103/188 (54%), Gaps = 25/188 (13%)

Query: 903  KERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
            +E TR  E+ K+ + ++R+L LV+DLD T++++     VDP   E    K+  + E   R
Sbjct: 114  REATRVEEDAKRRLLASRRLTLVVDLDQTIIHAT----VDPTVGEWREDKQNPNHEAV-R 168

Query: 962  HLFRF------PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
             + +F      P M G W   KLRPG+  FL+  ++++E+H+YTMG + YA  +  ++DP
Sbjct: 169  DVRQFQLIDDGPGMRGCWYYIKLRPGLEEFLQNVAEIYELHIYTMGTRAYAQHIVDIIDP 228

Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLI 1071
               LF  R++SR + G            KDL+ +  +++  VVIIDD   +W  +  NLI
Sbjct: 229  TRKLFGDRILSRDESGS--------LTVKDLQRLFPVDTKMVVIIDDRGDIWRWSP-NLI 279

Query: 1072 VVERYTYF 1079
             V  Y +F
Sbjct: 280  KVSPYDFF 287


>gi|452004576|gb|EMD97032.1| hypothetical protein COCHEDRAFT_1163398 [Cochliobolus heterostrophus
            C5]
          Length = 803

 Score = 92.8 bits (229), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 54/178 (30%), Positives = 96/178 (53%), Gaps = 21/178 (11%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHRHL 963
            ++++ +ARKL L++DLD T++++     +        +P HD +   K+ Q  +    ++
Sbjct: 152  KRRLLNARKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHDAV---KDVQGFQLADDNV 208

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
                    + K+RPG+  F +R SKL+EMH+YTM  + YA  +AK++DP+   F  R++S
Sbjct: 209  SNVAANWYYVKMRPGLKDFFDRVSKLYEMHVYTMATRAYAQAVAKIIDPERKYFGDRILS 268

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGMESAV-VIIDDSVRVWPHNKLNLIVVERYTYFP 1080
            R ++           K K L  +    +A+ VIIDD   VW ++  +L+ V  + +FP
Sbjct: 269  RDEN--------YTDKLKSLTRLFYQNTAMCVIIDDRADVWQYSP-HLVRVPVFNFFP 317


>gi|258563858|ref|XP_002582674.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237908181|gb|EEP82582.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 897

 Score = 92.4 bits (228), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 103/196 (52%), Gaps = 24/196 (12%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            D     + K+   R+EE  ++++ ++RKL LV+DLD T++++     VDP   E    K 
Sbjct: 157  DNASLTVSKDEATRVEEDAKRRLLASRKLSLVVDLDQTIIHAT----VDPTVAEWREDKT 212

Query: 953  EQDRE-----KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
              + E     +  + +   P M G W   KLRPG+  FL+  S L+E+H+YTM  + YA 
Sbjct: 213  NPNHEAVKNVRSFQLIDDGPGMRGCWYYIKLRPGLEEFLKNISSLYELHIYTMATRAYAQ 272

Query: 1005 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
             +A ++DP   +F  R++SR + G           +K+L  +  +++  VVIIDD   VW
Sbjct: 273  NIANIVDPDRKIFGDRILSRDESGS--------LTAKNLHRLFPVDTKMVVIIDDRGDVW 324

Query: 1064 PHNKLNLIVVERYTYF 1079
              +  NLI V  Y +F
Sbjct: 325  KWSD-NLIRVFPYDFF 339


>gi|425767354|gb|EKV05928.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Penicillium
            digitatum PHI26]
 gi|425779797|gb|EKV17828.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Penicillium
            digitatum Pd1]
          Length = 817

 Score = 92.4 bits (228), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 62/188 (32%), Positives = 103/188 (54%), Gaps = 23/188 (12%)

Query: 902  QKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE--- 957
            ++E TR  E+ K+ + ++R+L LV+DLD T++++     VDP   E    K+  + E   
Sbjct: 140  EREATRVEEDAKRRLLASRRLTLVVDLDQTIIHAT----VDPTVGEWREDKQNPNHEAVK 195

Query: 958  --KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
              +  + +   P M G W   KLRPG+  FL+  ++++E+H+YTMG + YA  +  ++DP
Sbjct: 196  DVRQFQLIDDGPGMRGCWYYIKLRPGLEEFLQNVAEIYELHIYTMGTRAYAQHIVDIIDP 255

Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLI 1071
               LF  R++SR + G            KDL+ +  +++  VVIIDD   +W  +  NLI
Sbjct: 256  TRKLFGDRILSRDESGS--------LTVKDLQRLFPVDTKMVVIIDDRGDIWRWSP-NLI 306

Query: 1072 VVERYTYF 1079
             V  Y +F
Sbjct: 307  KVSPYDFF 314


>gi|296419837|ref|XP_002839498.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295635659|emb|CAZ83689.1| unnamed protein product [Tuber melanosporum]
          Length = 896

 Score = 92.4 bits (228), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 61/196 (31%), Positives = 100/196 (51%), Gaps = 24/196 (12%)

Query: 895  DQQKAAIQKERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDE------ 946
            D     + ++   RLEE+  +++  ++KL LV+DLD T++++     VDP   +      
Sbjct: 132  DSMGLTVSQDEATRLEEETKRRLLKSKKLSLVVDLDQTIIHAT----VDPTVGDWKNDPF 187

Query: 947  ILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
             +  +  +D +            G W   K+RPG+  FLE  S+L+E+H+YTMG + YA 
Sbjct: 188  CINHESVKDVQAFKLDEDIIGGRGTWYYVKMRPGLKEFLEHISQLYELHIYTMGTRAYAM 247

Query: 1005 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVW 1063
             + K++DP G +F  RV+SR + G            K L  +  +++  VVIIDD   VW
Sbjct: 248  SVKKIVDPDGRIFGERVLSRDESGS--------MTQKSLHRIFPVDTKMVVIIDDRGDVW 299

Query: 1064 PHNKLNLIVVERYTYF 1079
              +  NL+ V  Y +F
Sbjct: 300  KWSD-NLVKVRPYDFF 314


>gi|453084575|gb|EMF12619.1| hypothetical protein SEPMUDRAFT_149240 [Mycosphaerella populorum
            SO2202]
          Length = 848

 Score = 92.4 bits (228), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 63/192 (32%), Positives = 99/192 (51%), Gaps = 32/192 (16%)

Query: 902  QKERTRRLEE-QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH 960
            Q E T+  EE ++++  +R+L LV+DLD T++++     VDP   E      + D   P+
Sbjct: 142  QDEATKTDEEGKRRLLDSRRLSLVVDLDQTIIHAC----VDPSIGEW-----QNDPSNPN 192

Query: 961  RHLFRFPH----------MGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 1008
                R             +  W   K RPG+ +FL+  S+L+EMH+YTMG + YA  +AK
Sbjct: 193  YDALRDVQAFQLRDDNKPVATWYYIKQRPGLQSFLKGLSELYEMHIYTMGTRTYAEGVAK 252

Query: 1009 VLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNK 1067
            ++DP G +F  R+++R + G          K K L+ +   +S  VVIIDD   VW    
Sbjct: 253  IIDPDGRVFGDRIVTRTESGS--------DKEKSLKRLFPTDSKMVVIIDDRADVWRWIS 304

Query: 1068 LNLIVVERYTYF 1079
             NL+ V  + +F
Sbjct: 305  -NLVKVNVFEFF 315


>gi|67524889|ref|XP_660506.1| hypothetical protein AN2902.2 [Aspergillus nidulans FGSC A4]
 gi|40744297|gb|EAA63473.1| hypothetical protein AN2902.2 [Aspergillus nidulans FGSC A4]
 gi|259486161|tpe|CBF83781.1| TPA: CTD phosphatase-related (Eurofung) [Aspergillus nidulans FGSC
            A4]
          Length = 829

 Score = 92.4 bits (228), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 66/201 (32%), Positives = 105/201 (52%), Gaps = 34/201 (16%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            D     + +    R+EE  ++++ + RKL LV+DLD T++++A    VDP   E +    
Sbjct: 132  DNTALTVSEREAIRVEEDAKRRLLANRKLSLVVDLDQTIIHAA----VDPTIGEWM---- 183

Query: 953  EQDREKPHR------HLFRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
              D++ P+         F+     P M G W   KLRPG+  FLE  ++++E+H+YTMG 
Sbjct: 184  -ADKDNPNHAAVSDVRAFQLVDDGPGMRGCWYYVKLRPGLEEFLENVAEMYELHIYTMGT 242

Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 1058
            + YA  +A ++DP   LF  R++SR + G            K+L  +  +++  VVIIDD
Sbjct: 243  RSYAQAIANIIDPDRKLFGDRILSRDESGS--------LSVKNLHRIFPVDTKMVVIIDD 294

Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
               VW  +  NLI V  Y +F
Sbjct: 295  RGDVWRWSP-NLIKVIPYDFF 314


>gi|215794710|pdb|3EF1|A Chain A, The Structure Of Fcp1, An Essential Rna Polymerase Ii Ctd
            Phosphatase
          Length = 442

 Score = 92.0 bits (227), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 66/195 (33%), Positives = 101/195 (51%), Gaps = 33/195 (16%)

Query: 900  AIQKERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAKFHEVDP-----------VHDE 946
             +  E   RLE +  K++   ++L L++ LD T++++     VDP           V+ +
Sbjct: 4    TVSLEEASRLESENVKRLRQEKRLSLIVXLDQTIIHAT----VDPTVGEWMSDPGNVNYD 59

Query: 947  ILRKKEEQD-REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATE 1005
            +LR     + +E P  +   +     + K RPG+  FL++ S+L+E+H+YTMG K YA E
Sbjct: 60   VLRDVRSFNLQEGPSGYTSCY-----YIKFRPGLAQFLQKISELYELHIYTMGTKAYAKE 114

Query: 1006 MAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWP 1064
            +AK++DP G LF  RV+SR D G            K L  +   + S VV+IDD   VW 
Sbjct: 115  VAKIIDPTGKLFQDRVLSRDDSGS--------LAQKSLRRLFPCDTSMVVVIDDRGDVWD 166

Query: 1065 HNKLNLIVVERYTYF 1079
             N  NLI V  Y +F
Sbjct: 167  WNP-NLIKVVPYEFF 180


>gi|215794709|pdb|3EF0|A Chain A, The Structure Of Fcp1, An Essential Rna Polymerase Ii Ctd
            Phosphatase
          Length = 372

 Score = 92.0 bits (227), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 66/192 (34%), Positives = 101/192 (52%), Gaps = 37/192 (19%)

Query: 901  IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-----------VHDEILR 949
            ++ E  +RL ++K+      L L++DLD T++++     VDP           V+ ++LR
Sbjct: 5    LESENVKRLRQEKR------LSLIVDLDQTIIHAT----VDPTVGEWMSDPGNVNYDVLR 54

Query: 950  KKEEQD-REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 1008
                 + +E P  +   +     + K RPG+  FL++ S+L+E+H+YTMG K YA E+AK
Sbjct: 55   DVRSFNLQEGPSGYTSCY-----YIKFRPGLAQFLQKISELYELHIYTMGTKAYAKEVAK 109

Query: 1009 VLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNK 1067
            ++DP G LF  RV+SR D G            K L  +   + S VV+IDD   VW  N 
Sbjct: 110  IIDPTGKLFQDRVLSRDDSGS--------LAQKSLRRLFPCDTSMVVVIDDRGDVWDWNP 161

Query: 1068 LNLIVVERYTYF 1079
             NLI V  Y +F
Sbjct: 162  -NLIKVVPYEFF 172


>gi|66824241|ref|XP_645475.1| hypothetical protein DDB_G0271690 [Dictyostelium discoideum AX4]
 gi|60473594|gb|EAL71535.1| hypothetical protein DDB_G0271690 [Dictyostelium discoideum AX4]
          Length = 782

 Score = 92.0 bits (227), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 56/168 (33%), Positives = 92/168 (54%), Gaps = 14/168 (8%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            K++   +KL LVLDLDHT++++      +   +    + +++++   H      P M   
Sbjct: 128  KRLLMEKKLSLVLDLDHTVIHAVTEQGFNSSPE---WRNKDKNKNGIHTITVNGP-MNYC 183

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD-DGDPF 1031
             K RP +  FL   +K++E+H+YTMG + YA E+AK++DP+  +F  R++SR D +G  F
Sbjct: 184  IKKRPHLVKFLTEVNKIYELHIYTMGTRNYANEIAKLIDPESSIFKERILSRDDGNGINF 243

Query: 1032 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
               +R+    D        S V+I+DD   VW  +K NLI +  Y YF
Sbjct: 244  KSLQRLFPCDD--------SMVLIVDDRSDVWKKSK-NLIQISPYVYF 282


>gi|393240595|gb|EJD48120.1| hypothetical protein AURDEDRAFT_85955 [Auricularia delicata TFB-10046
            SS5]
          Length = 796

 Score = 91.7 bits (226), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 74/250 (29%), Positives = 110/250 (44%), Gaps = 73/250 (29%)

Query: 890  FEGYDDQQKAAIQK-----------ERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAK 936
            + GY D  +A IQ            E  RR+E +  +++   RKL L++DLD T++++  
Sbjct: 123  YTGYSDSARANIQMTHLAGGPTVSLEEARRIEHETAERLLKNRKLSLIVDLDQTIVHAT- 181

Query: 937  FHEVDPVHDEIL---RKKEEQDREKPH--------------------RHLFRF------P 967
               VDP   E +   +  EE    KP                     R + RF      P
Sbjct: 182  ---VDPTVGEWIAQGQAWEEYQARKPSESTTPEPDAPPEPNANWEALRDVRRFTLAHDGP 238

Query: 968  HMG-----------------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVL 1010
            H+                   + K RPG+  FLE  S+ +EMH+YTMG + YA ++   +
Sbjct: 239  HLNHKHPWKGKEKEDEHGCLYYIKPRPGLQAFLEAISQKYEMHVYTMGTRAYAEKVCAAI 298

Query: 1011 DPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLN 1069
            DP G +F  R++SR + G           +K LE +   + S VVIIDD   VW  +  N
Sbjct: 299  DPDGRMFGRRILSRDESGS--------LTAKSLERLFPCDTSMVVIIDDRSDVWDRSP-N 349

Query: 1070 LIVVERYTYF 1079
            L+ V RY +F
Sbjct: 350  LVEVVRYDFF 359


>gi|358057984|dbj|GAA96229.1| hypothetical protein E5Q_02893 [Mixia osmundae IAM 14324]
          Length = 760

 Score = 91.7 bits (226), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 63/204 (30%), Positives = 101/204 (49%), Gaps = 43/204 (21%)

Query: 895  DQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 954
            +Q+ A ++   T RL +      A+KL L++DLD T++ +     VDP   + +R     
Sbjct: 183  EQEAARLEDASTTRLRK------AKKLSLIVDLDQTIIQAT----VDPTVGDWMR----- 227

Query: 955  DREKPHRH------LFRFPHM----------GMWT--KLRPGIWTFLERASKLFEMHLYT 996
            D   P+        +F+              G W   KLRPG+  FL + + L+EMH+YT
Sbjct: 228  DGTNPNHSALKDVCVFKLGTQEDKEVVADVDGCWYYLKLRPGLQAFLRKMADLYEMHVYT 287

Query: 997  MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAV-VI 1055
            MG + YA  + +++DP G  F+ R++SR + G            K LE +   ++++ VI
Sbjct: 288  MGTRSYAMAVCRIIDPDGTYFSTRILSRDESGS--------LTRKSLERLFPCDTSMAVI 339

Query: 1056 IDDSVRVWPHNKLNLIVVERYTYF 1079
            IDD   VW H   NL+ VE + +F
Sbjct: 340  IDDRSDVW-HWSPNLVKVEPFEFF 362


>gi|256073745|ref|XP_002573189.1| rna polymerase II ctd phosphatase [Schistosoma mansoni]
 gi|360045501|emb|CCD83049.1| putative rna polymerase II ctd phosphatase [Schistosoma mansoni]
          Length = 1345

 Score = 91.7 bits (226), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 58/176 (32%), Positives = 99/176 (56%), Gaps = 27/176 (15%)

Query: 909  LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 968
            L++Q+ + +ARKL L++DLD T++++      DP           Q  +  + H +R P 
Sbjct: 139  LQDQQSLLAARKLVLLVDLDQTIIHTTN----DP-----------QAFKYKNVHRYRLPG 183

Query: 969  --MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
              +   T+LRP +   L+  S+ ++MH+ T GN++YA ++A ++DPK   F+ R++SR +
Sbjct: 184  SPLVYHTRLRPHLEKVLDCLSQYYQMHICTFGNRVYAHQLASMIDPKRRYFSQRILSRDE 243

Query: 1027 DGDPFDGDERVPKSKDLEGVL--GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
              +P      V KS +L+ +   G+ + V IIDD   VW  +  NLI V+ Y +FP
Sbjct: 244  CFNP------VTKSANLKALFPRGL-NLVCIIDDRGEVWDWSS-NLIHVKPYRFFP 291


>gi|365991295|ref|XP_003672476.1| hypothetical protein NDAI_0K00420 [Naumovozyma dairenensis CBS 421]
 gi|343771252|emb|CCD27233.1| hypothetical protein NDAI_0K00420 [Naumovozyma dairenensis CBS 421]
          Length = 778

 Score = 91.3 bits (225), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 65/187 (34%), Positives = 95/187 (50%), Gaps = 32/187 (17%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK------------KEEQDREKP 959
            +K++   +KL LV+DLD T+++      VDP   E  R             KE    E+P
Sbjct: 203  KKRLRDDKKLILVVDLDQTVIHCG----VDPTIGEWKRDPTNPNFETLKDVKEFALEEEP 258

Query: 960  HRHLFRF----PHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
               L       P    W   K+RPG+  F ++ + LFEMH+YTM  + YA+E+AK++DP 
Sbjct: 259  ILPLMYMGPKPPARKCWYYVKVRPGLKDFFQKVAPLFEMHIYTMATRAYASEIAKIIDPT 318

Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIV 1072
            G LF  R++SR ++G           +K LE +    +S V+IIDD   VW  +  NLI 
Sbjct: 319  GDLFGNRILSRDENGS--------LTTKSLERLFPTDQSMVIIIDDRGDVWNWSP-NLIK 369

Query: 1073 VERYTYF 1079
            V  Y +F
Sbjct: 370  VIPYNFF 376


>gi|325179818|emb|CCA14221.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 694

 Score = 91.3 bits (225), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 58/177 (32%), Positives = 94/177 (53%), Gaps = 27/177 (15%)

Query: 913  KKMFSARKLCLVLDLDHTLLNS---AKFHEVDPV-HDEILRKKEEQDREKPHRHLFRFP- 967
            ++   A+KL LVLDLDHTLL++   A   E  P   DEI              H F+ P 
Sbjct: 147  ERQLIAKKLSLVLDLDHTLLHAVYVADLLEQRPTASDEI--------------HYFKIPG 192

Query: 968  --HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
               M    KLRPG+  FL+   + +++ +YT G ++YA  +A+++DP   LF  R+++R 
Sbjct: 193  VMTMEYVVKLRPGLHQFLKSLREQYDLFIYTHGTRIYAEAIAEIIDPDDTLFRHRIVART 252

Query: 1026 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCS 1082
            D  D  D      KS  L      +S ++I+DD + VW  N+ N+++++ + +F C+
Sbjct: 253  DTPD-IDH-----KSLKLLFPSCDDSMILILDDRLDVWKENEGNVLLIKPFHFFNCT 303


>gi|363752479|ref|XP_003646456.1| hypothetical protein Ecym_4610 [Eremothecium cymbalariae DBVPG#7215]
 gi|356890091|gb|AET39639.1| hypothetical protein Ecym_4610 [Eremothecium cymbalariae DBVPG#7215]
          Length = 751

 Score = 90.9 bits (224), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 68/187 (36%), Positives = 95/187 (50%), Gaps = 32/187 (17%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE--KPHRH--LFRFP 967
            QK++  ARKL LV+DLD T+++      VDP   E  +  +  + E  K  R   L   P
Sbjct: 159  QKQLREARKLVLVVDLDQTVIHCG----VDPTIGEWSKDPDNPNYESLKDVRSFSLHEEP 214

Query: 968  -----HMG---------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
                 +MG          + KLRPG+  F    +  FE+H+YTM  + YA E+AK++DP 
Sbjct: 215  VLPPFYMGPKPPTRKCWYYVKLRPGLQDFFSNIAPHFELHIYTMATRTYALEIAKIIDPD 274

Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIV 1072
            G LF  R++SR ++G            K LE +  M +S VVIIDD   VW   + NLI 
Sbjct: 275  GTLFGDRILSRDENGS--------LTQKSLERLFPMDQSMVVIIDDRGDVWNWCE-NLIK 325

Query: 1073 VERYTYF 1079
            V  Y +F
Sbjct: 326  VVPYDFF 332


>gi|328713585|ref|XP_001947680.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
            phosphatase-like [Acyrthosiphon pisum]
          Length = 736

 Score = 90.9 bits (224), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 57/170 (33%), Positives = 94/170 (55%), Gaps = 20/170 (11%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
            ++K++   +KL L++DLD TL+++      D + + I        ++  H  L+      
Sbjct: 134  DEKRLLGDKKLVLLVDLDQTLIHTTN----DNIPNNI--------KDIHHFQLYGPNSPW 181

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
              T+LRPG + FL   S+L+E+H+ T G + YA  +  +LDPKG LF+ RV+SR +  +P
Sbjct: 182  YHTRLRPGTYNFLSSISELYELHICTFGARNYAHTITHILDPKGKLFSHRVLSRDECFNP 241

Query: 1031 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                    K+ +L+G+    ++ V IIDD   VW +  LNLI V+ Y +F
Sbjct: 242  ------NSKTGNLKGLFPCGDNMVCIIDDREDVWDY-ALNLIHVKPYHFF 284


>gi|269860082|ref|XP_002649764.1| carboxy-terminal domain (CTD) phosphatase [Enterocytozoon bieneusi
            H348]
 gi|220066823|gb|EED44294.1| carboxy-terminal domain (CTD) phosphatase [Enterocytozoon bieneusi
            H348]
          Length = 409

 Score = 90.9 bits (224), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 47/155 (30%), Positives = 77/155 (49%), Gaps = 27/155 (17%)

Query: 909  LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 968
            L +  +++  +KL L LDLD TL+++                      +KP    F+  +
Sbjct: 93   LHKFYELYHNKKLILFLDLDQTLIHATL-------------------SKKPCNFSFKLHN 133

Query: 969  MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
            +  + K RPG+  FL + S+ FE H+YTMG + YA  + K+LDP  + F  R+++R ++ 
Sbjct: 134  IEFFIKKRPGLDKFLSKLSRFFEFHVYTMGTREYANYICKILDPNKIFFGDRIVTRTENN 193

Query: 1029 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW 1063
              F         K LE +    + V+I+DD V VW
Sbjct: 194  KMF--------KKYLERITNFSNNVIILDDRVDVW 220


>gi|452981165|gb|EME80925.1| hypothetical protein MYCFIDRAFT_115122, partial [Pseudocercospora
            fijiensis CIRAD86]
          Length = 770

 Score = 90.9 bits (224), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 57/181 (31%), Positives = 98/181 (54%), Gaps = 29/181 (16%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF----R 965
            E ++++  +R+L LV+DLD T+++++    V+P   E      + D   P+        +
Sbjct: 151  EGKRRLLQSRRLSLVVDLDQTIIHAS----VEPTIAEW-----QNDPSNPNYEALQDVQK 201

Query: 966  F------PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
            F      P+   + K RPG+  FL   S+++EMH+YTMG + YA  +AK++DP+  +F  
Sbjct: 202  FQLDDDKPNTWYYIKPRPGLKQFLSTLSEIYEMHIYTMGTRAYAESVAKIIDPEKKIFGD 261

Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTY 1078
            R++SR + G           +K+L+ +  +++  VVIIDD   VW H   NLI V  + +
Sbjct: 262  RILSRNESGS--------MTAKNLKRLFPVDTRMVVIIDDRADVW-HWTSNLIKVNVFEF 312

Query: 1079 F 1079
            F
Sbjct: 313  F 313


>gi|358253094|dbj|GAA51983.1| RNA polymerase II subunit A C-terminal domain phosphatase [Clonorchis
            sinensis]
          Length = 1535

 Score = 90.5 bits (223), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 62/191 (32%), Positives = 100/191 (52%), Gaps = 31/191 (16%)

Query: 909  LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 968
            L++++ + +ARKL L++DLD T+L++      DP   +  R K         R+      
Sbjct: 175  LQDEQSLLAARKLVLLVDLDETVLHTTN----DP---QAYRYKNVS------RYCLPGSP 221

Query: 969  MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
            +   T  RP +   L+R SK ++MH+ T GN++YA ++A ++DPK   F+ R++SR +  
Sbjct: 222  LVYHTSFRPHLKAVLDRLSKYYQMHICTFGNRMYAHQLAGMIDPKRRYFSHRILSRDECF 281

Query: 1029 DPFDGDERVPKSKDLEGVL--GMESAVVIIDDSVRVW---PHNKLNLIVVERYTYF--PC 1081
            +P      V KS +L+ +   G+ + V IIDD   VW   PH    LI V+ Y +F   C
Sbjct: 282  NP------VTKSANLKALFPRGL-NLVCIIDDRGEVWEWSPH----LIQVKPYRFFQDAC 330

Query: 1082 SRRQFGLLGPS 1092
              + F    PS
Sbjct: 331  DTKHFAWSSPS 341


>gi|452840538|gb|EME42476.1| hypothetical protein DOTSEDRAFT_73343 [Dothistroma septosporum NZE10]
          Length = 855

 Score = 90.1 bits (222), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 58/184 (31%), Positives = 98/184 (53%), Gaps = 32/184 (17%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH----RHLFR 965
            E ++++  AR+L LV+DLD T++++     V+P   E      + D   P+    + + +
Sbjct: 152  EAKRRLLEARRLSLVVDLDQTVIHAC----VEPTIGEW-----QSDPTNPNHEAVKDVCK 202

Query: 966  F---------PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
            F         P    + KLRPG+  FL   S+ +EMH+YTMG + YA  +AK++DP   +
Sbjct: 203  FQLADDAPGRPGTWYYIKLRPGLKEFLTTMSQYYEMHIYTMGTRAYAENIAKIIDPDRSV 262

Query: 1017 FAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVER 1075
            F  R++SR + G          ++K+L+ +  +++  VVIIDD   VW     NLI V+ 
Sbjct: 263  FGDRILSRDESGS--------MQAKNLKRLFPVDTKMVVIIDDRADVWSWIS-NLIKVKV 313

Query: 1076 YTYF 1079
            + +F
Sbjct: 314  FEFF 317


>gi|6689545|emb|CAB65510.1| FCP1 serine phosphatase [Xenopus laevis]
          Length = 867

 Score = 90.1 bits (222), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 58/171 (33%), Positives = 91/171 (53%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q ++   +KL L++DLD TL+++ + H        I            H  L R   M
Sbjct: 53   EDQFRLHRNQKLVLMVDLDQTLIHTTEQHCQHMSRKGIF-----------HFQLGRGEPM 101

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 102  -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            P+       K+ +L  +    +S V IIDD   VW     NLI V++   F
Sbjct: 161  PYS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKMCIF 204


>gi|198438317|ref|XP_002131972.1| PREDICTED: similar to MGC81710 protein [Ciona intestinalis]
          Length = 895

 Score = 89.7 bits (221), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 57/176 (32%), Positives = 96/176 (54%), Gaps = 25/176 (14%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-H 968
            +++ ++    KL L++DLD TL+++ +      +        EE+D        F F  H
Sbjct: 134  QDKSRLHKLNKLVLLVDLDQTLIHTTQNQAFAAMC------SEEKD-------FFTFQLH 180

Query: 969  MG---MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
                 ++TKLRP    FL+  SK +E+ + T G++LYA ++A+ +DPK   FA R++SR 
Sbjct: 181  KNEPTLYTKLRPYCREFLQEISKCYELQVVTFGSRLYAHKIAEFIDPKKKFFANRILSRD 240

Query: 1026 DDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
            +  +P      + KS +L  +    +S V IIDD   VW  +  NL++V++Y+YFP
Sbjct: 241  ECINP------MKKSGNLRHLFPCGDSMVCIIDDRDDVWS-SAPNLVMVKKYSYFP 289


>gi|367004465|ref|XP_003686965.1| hypothetical protein TPHA_0I00240 [Tetrapisispora phaffii CBS 4417]
 gi|357525268|emb|CCE64531.1| hypothetical protein TPHA_0I00240 [Tetrapisispora phaffii CBS 4417]
          Length = 732

 Score = 89.7 bits (221), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 62/192 (32%), Positives = 93/192 (48%), Gaps = 42/192 (21%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------ 965
            + ++  ++KL LV+DLD T+++      VDP   E      + D   P+    R      
Sbjct: 159  KSRLIRSKKLILVVDLDQTVIHCG----VDPTISEW-----KNDPSNPNYETLRNVKSFV 209

Query: 966  ------FPHMGM-----------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 1008
                   P M M           + K+RPG+  F E+ + ++EMH+YTM  + YA E+AK
Sbjct: 210  LEEEAILPPMYMGPKPPVHKCSYYVKVRPGLKEFFEKVAPIYEMHIYTMATRAYAEEIAK 269

Query: 1009 VLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNK 1067
            ++DP G LF  R++SR ++G            K LE +    +S VVIIDD   VW  + 
Sbjct: 270  IIDPDGSLFGNRILSRDENGS--------LTHKSLERLFPTDQSMVVIIDDRGDVWNWSP 321

Query: 1068 LNLIVVERYTYF 1079
             NLI V  Y +F
Sbjct: 322  -NLIKVTPYNFF 332


>gi|353236741|emb|CCA68729.1| related to FCP1-TFIIF interacting component of CTD phosphatase
            [Piriformospora indica DSM 11827]
          Length = 782

 Score = 89.7 bits (221), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 68/221 (30%), Positives = 103/221 (46%), Gaps = 43/221 (19%)

Query: 886  VEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHD 945
            ++H  EG    Q+ A + ER    E   ++   RKL L++DLD T+L++      DP   
Sbjct: 126  MDHGAEGPLLSQEVAAKIER----ENTDRLLKNRKLSLIVDLDQTILHAT----FDPTVG 177

Query: 946  EILRKKEEQDREKPHRHL-------------------FRFP----HMG----MWTKLRPG 978
            E ++ K+  ++ +                        F+ P    HMG     + K RPG
Sbjct: 178  EWIKAKDAFEKRRSTTPPDHDPPPESVNWPALEDVISFQLPSDHGHMGHSERYYVKPRPG 237

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
            +  F+   S+L+EMH+YTMG + YA  +   LDP G  F  RV+SR +      G +RV 
Sbjct: 238  LQRFMNNLSELYEMHVYTMGVRSYANAICAALDPSGAWFGSRVLSRNE-----SGSDRVK 292

Query: 1039 KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
              K L      +S VV+IDD   VW  +  NL+ V  + +F
Sbjct: 293  NLKRL--FPSDQSMVVVIDDRADVWNWSP-NLVRVIPFEFF 330


>gi|410076480|ref|XP_003955822.1| hypothetical protein KAFR_0B03910 [Kazachstania africana CBS 2517]
 gi|372462405|emb|CCF56687.1| hypothetical protein KAFR_0B03910 [Kazachstania africana CBS 2517]
          Length = 724

 Score = 89.7 bits (221), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 61/186 (32%), Positives = 98/186 (52%), Gaps = 30/186 (16%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +K++ + +KL LV+DLD T+++          K    +P +D +   +     E+P   +
Sbjct: 161  KKRLRNEKKLVLVVDLDQTVIHCGVDPTIGEWKSDPNNPNYDTLKDVQMFALEEEP---V 217

Query: 964  FRFPHMG---------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
              F +MG          + K+RPG+  F ++ + LFEMH+YTM  + YA E+ K++DP G
Sbjct: 218  LPFMYMGPKPTPRKCWYYVKVRPGLKEFFKKVAPLFEMHIYTMATRAYALEITKIIDPTG 277

Query: 1015 VLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVV 1073
             LF  R++SR ++G           SK LE +    +S V+IIDD   VW  +  NLI V
Sbjct: 278  ELFGNRILSRDENGS--------LTSKSLERLFPTDQSMVIIIDDRGDVWNWSP-NLIKV 328

Query: 1074 ERYTYF 1079
              Y++F
Sbjct: 329  VPYSFF 334


>gi|403222664|dbj|BAM40795.1| uncharacterized protein TOT_030000057 [Theileria orientalis strain
            Shintoku]
          Length = 656

 Score = 89.7 bits (221), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 59/178 (33%), Positives = 87/178 (48%), Gaps = 27/178 (15%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAK--------FHEVDPVHDEILRKKEEQDREKPHRH 962
            E  K    RKLCLVLDLD+TL+++             ++     +L+     + E  + +
Sbjct: 188  EITKYLEDRKLCLVLDLDNTLVHATSQSPPADIDVETIEISSSSVLKTIVYNETETSYCN 247

Query: 963  LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
             F         KLRPGI+ F    SK +++ L+TMG + +A    ++LDP+GV F  RV 
Sbjct: 248  SF--------FKLRPGIFKFFRSVSKRYKLFLFTMGTRQHAQSALRILDPQGVYFGNRVF 299

Query: 1023 SRGDDGDPFDGDERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
             R D        +R+ P  K+L         V+++DDS  VW  +KL LI V  Y YF
Sbjct: 300  CRNDSRSCMKSLDRLFPNHKNL---------VLVMDDSEYVWT-SKLALIKVHPYYYF 347


>gi|195353179|ref|XP_002043083.1| GM11819 [Drosophila sechellia]
 gi|194127171|gb|EDW49214.1| GM11819 [Drosophila sechellia]
          Length = 874

 Score = 89.4 bits (220), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 97/172 (56%), Gaps = 22/172 (12%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            ++ +++ + RKL L++DLD T++++      D V D I        +   H  L+  PH 
Sbjct: 191  DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI--------KGIYHFQLYG-PHS 237

Query: 970  GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
              + T+LRPG   FLER S+L+E+H+ T G + YA  +A++LDP+G  F+ R++SR    
Sbjct: 238  PWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR---D 294

Query: 1029 DPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            + F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 295  ECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 342


>gi|195586452|ref|XP_002082988.1| GD24941 [Drosophila simulans]
 gi|194194997|gb|EDX08573.1| GD24941 [Drosophila simulans]
          Length = 877

 Score = 89.4 bits (220), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 97/172 (56%), Gaps = 22/172 (12%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            ++ +++ + RKL L++DLD T++++      D V D I        +   H  L+  PH 
Sbjct: 194  DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI--------KGIYHFQLYG-PHS 240

Query: 970  GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
              + T+LRPG   FLER S+L+E+H+ T G + YA  +A++LDP+G  F+ R++SR    
Sbjct: 241  PWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR---D 297

Query: 1029 DPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            + F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 298  ECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 345


>gi|320581076|gb|EFW95298.1| RNA Pol II CTD phosphatase component, putative [Ogataea
            parapolymorpha DL-1]
          Length = 743

 Score = 89.4 bits (220), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 78/249 (31%), Positives = 117/249 (46%), Gaps = 54/249 (21%)

Query: 885  DVEHL-FEGYDDQQKAAIQ--------KERTRRLEE-----QKKMFSARKLCLVLDLDHT 930
            +VE L + G++D+ +A I         K  T+  E       +++    KL LV+DLD T
Sbjct: 122  NVEDLDYTGFNDKDRAPISMSHGTTNLKVSTKEAENIERSSTQRLLKEEKLSLVVDLDQT 181

Query: 931  LLNSAKFHEVDPVHDEILRKK----------------EEQDREKPHRHLFRFPHMGMW-- 972
            ++++     VDP   E +                   EE+    P+    + P    W  
Sbjct: 182  VIHAT----VDPTVGEWMSDPTNPNYESIKDVRSFCLEEEPILPPNYKGPKPPSHKRWYY 237

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
             KLRPG+  FLE+ SKL+E+H+YTM  + YA  +AK++DP G+ F  R++SR + G    
Sbjct: 238  VKLRPGLQEFLEKVSKLYELHIYTMATRSYAKSIAKIIDPDGIYFGDRILSRDESGS--- 294

Query: 1033 GDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF--------PCSR 1083
                    K L+ +  ++ S VV+IDD   VW  +  NLI V  Y +F            
Sbjct: 295  -----LTQKTLKRLFPVDTSMVVVIDDRGDVWNWSP-NLIKVVPYDFFVGIGDINSSFLP 348

Query: 1084 RQFGLLGPS 1092
            RQ  LLGPS
Sbjct: 349  RQSTLLGPS 357


>gi|24762673|ref|NP_611934.1| Fcp1 [Drosophila melanogaster]
 gi|7291810|gb|AAF47230.1| Fcp1 [Drosophila melanogaster]
          Length = 880

 Score = 89.4 bits (220), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 97/172 (56%), Gaps = 22/172 (12%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            ++ +++ + RKL L++DLD T++++      D V D I        +   H  L+  PH 
Sbjct: 197  DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI--------KGIYHFQLYG-PHS 243

Query: 970  GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
              + T+LRPG   FLER S+L+E+H+ T G + YA  +A++LDP+G  F+ R++SR    
Sbjct: 244  PWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR---D 300

Query: 1029 DPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            + F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 301  ECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 348


>gi|194886507|ref|XP_001976627.1| GG19916 [Drosophila erecta]
 gi|190659814|gb|EDV57027.1| GG19916 [Drosophila erecta]
          Length = 876

 Score = 89.4 bits (220), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 97/172 (56%), Gaps = 22/172 (12%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            ++ +++ + RKL L++DLD T++++      D V D I        +   H  L+  PH 
Sbjct: 193  DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI--------KGIYHFQLYG-PHS 239

Query: 970  GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
              + T+LRPG   FLER S+L+E+H+ T G + YA  +A++LDP+G  F+ R++SR    
Sbjct: 240  PWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR---D 296

Query: 1029 DPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            + F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 297  ECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 344


>gi|195489702|ref|XP_002092848.1| GE11441 [Drosophila yakuba]
 gi|194178949|gb|EDW92560.1| GE11441 [Drosophila yakuba]
          Length = 879

 Score = 89.4 bits (220), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 97/172 (56%), Gaps = 22/172 (12%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            ++ +++ + RKL L++DLD T++++      D V D I        +   H  L+  PH 
Sbjct: 196  DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI--------KGIYHFQLYG-PHS 242

Query: 970  GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
              + T+LRPG   FLER S+L+E+H+ T G + YA  +A++LDP+G  F+ R++SR    
Sbjct: 243  PWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR---D 299

Query: 1029 DPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            + F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 300  ECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 347


>gi|21483550|gb|AAM52750.1| SD01014p [Drosophila melanogaster]
          Length = 896

 Score = 89.0 bits (219), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 97/172 (56%), Gaps = 22/172 (12%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            ++ +++ + RKL L++DLD T++++      D V D I        +   H  L+  PH 
Sbjct: 213  DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI--------KGIYHFQLYG-PHS 259

Query: 970  GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
              + T+LRPG   FLER S+L+E+H+ T G + YA  +A++LDP+G  F+ R++SR    
Sbjct: 260  PWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR---D 316

Query: 1029 DPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            + F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 317  ECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 364


>gi|396499223|ref|XP_003845421.1| similar to RNA polymerase II subunit A C-terminal domain phosphatase
            [Leptosphaeria maculans JN3]
 gi|312222002|emb|CBY01942.1| similar to RNA polymerase II subunit A C-terminal domain phosphatase
            [Leptosphaeria maculans JN3]
          Length = 887

 Score = 88.6 bits (218), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 51/177 (28%), Positives = 91/177 (51%), Gaps = 20/177 (11%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHRHL 963
            +K++  A+KL L++DLD T++++     +        +P H  +   K+ +  +    ++
Sbjct: 235  KKRLLGAKKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHGAV---KDVEGFQLADDNV 291

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
                    + K RPG+  F +R SKL+EMH+YTM  + YA  + K++DP    F  R++S
Sbjct: 292  SNVAANWYYVKKRPGLEDFFKRMSKLYEMHVYTMATRAYAQAVCKIIDPDRRYFGDRILS 351

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
            R ++           K+K L  +    + VVIIDD   VW ++  +L+ V  + +FP
Sbjct: 352  RDEN--------YTDKTKSLSRLFQNTTMVVIIDDRADVWQYSP-HLVRVPVFNFFP 399


>gi|195029035|ref|XP_001987380.1| GH21892 [Drosophila grimshawi]
 gi|193903380|gb|EDW02247.1| GH21892 [Drosophila grimshawi]
          Length = 889

 Score = 88.6 bits (218), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 58/173 (33%), Positives = 92/173 (53%), Gaps = 24/173 (13%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            ++ +++ + RKL L++DLD T++++      D V D I          K   H   +   
Sbjct: 174  DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI----------KGIYHFQLYGPQ 219

Query: 970  GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
              W  T+LRPG   FLER S+L+E+H+ T G + YA  +A++LDP G  F+ R++SR   
Sbjct: 220  SPWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPDGKFFSHRILSR--- 276

Query: 1028 GDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
             + F+      K+ +L+ +    +S V IIDD   VW     NLI V+ Y +F
Sbjct: 277  DECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVWSMAS-NLIQVKPYHFF 325


>gi|125541462|gb|EAY87857.1| hypothetical protein OsI_09279 [Oryza sativa Indica Group]
          Length = 390

 Score = 88.6 bits (218), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 76/214 (35%), Positives = 106/214 (49%), Gaps = 35/214 (16%)

Query: 915  MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTK 974
            +  ARKL LV+DLDHTL+NS   +++     E +    E      H              
Sbjct: 89   LLRARKLILVVDLDHTLVNSTADYDISGT--EYVNGLAELLVLGVHHQA---------QA 137

Query: 975  LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 1034
            +RP +    ER  +  +  +YT+G++ YA  +AK+LDP+GV F  R+ISR  D  P    
Sbjct: 138  VRPWLPARSERHVR--DARVYTLGDRDYAAAVAKLLDPEGVYFGERIISR--DESP---- 189

Query: 1035 ERVPKSKDLEGVLG-------MESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1087
               P  K L+ V G         +AVVI+DD+  VW  N  NLI +ERY YF  S R FG
Sbjct: 190  --QPDRKSLDVVFGSAPASAAERAAVVILDDTAEVWEGNSDNLIEMERYHYFASSCRDFG 247

Query: 1088 LLGPSLLEIDH--DERSEDGT-LASSLGVRQQLH 1118
                S  E  H   ER  D +  A++L V +++H
Sbjct: 248  ----SPWECTHSLSERGVDESERAAALRVLRRVH 277


>gi|21914376|gb|AAM81360.1|AF522873_3 RNA polymerase II C-terminal domain phosphatase component
            [Leptosphaeria maculans]
          Length = 804

 Score = 88.2 bits (217), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 51/177 (28%), Positives = 91/177 (51%), Gaps = 20/177 (11%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHRHL 963
            +K++  A+KL L++DLD T++++     +        +P H  +   K+ +  +    ++
Sbjct: 152  KKRLLGAKKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHGAV---KDVEGFQLADDNV 208

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
                    + K RPG+  F +R SKL+EMH+YTM  + YA  + K++DP    F  R++S
Sbjct: 209  SNVAANWYYVKKRPGLEDFFKRMSKLYEMHVYTMATRAYAQAVCKIIDPDRRYFGDRILS 268

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
            R ++           K+K L  +    + VVIIDD   VW ++  +L+ V  + +FP
Sbjct: 269  RDEN--------YTDKTKSLSRLFQNTTMVVIIDDRADVWQYSP-HLVRVPVFNFFP 316


>gi|164658688|ref|XP_001730469.1| hypothetical protein MGL_2265 [Malassezia globosa CBS 7966]
 gi|159104365|gb|EDP43255.1| hypothetical protein MGL_2265 [Malassezia globosa CBS 7966]
          Length = 364

 Score = 88.2 bits (217), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 63/212 (29%), Positives = 96/212 (45%), Gaps = 47/212 (22%)

Query: 901  IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPV-----HD-------- 945
            + +E   R+  E+ + +   RKL L++DLD T+++      VDP      HD        
Sbjct: 22   VSREEAIRMDSEDTRHLIEQRKLALIVDLDQTIIHVT----VDPTVKEWAHDPKNPNWCM 77

Query: 946  --------------EILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKL 989
                           +  + E  D+             G W   KLRPG+  FL+  S +
Sbjct: 78   LKDVVAFQLGSDGKTVSHQPERMDQHDVKSFATDGDENGCWYYVKLRPGLQAFLQSVSPM 137

Query: 990  FEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG--DERVPKSKDLEGVL 1047
            +EMH+YTMG + YA  + +++DP G LF  R++SR ++G+          P S D+    
Sbjct: 138  YEMHVYTMGTRSYADCICRIVDPDGHLFGARILSRDENGNEVQKSLSRLFPISTDM---- 193

Query: 1048 GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                 VV+IDD   VW  +  NLI VE Y +F
Sbjct: 194  -----VVVIDDRADVWSWSP-NLIKVEPYEFF 219


>gi|367009794|ref|XP_003679398.1| hypothetical protein TDEL_0B00580 [Torulaspora delbrueckii]
 gi|359747056|emb|CCE90187.1| hypothetical protein TDEL_0B00580 [Torulaspora delbrueckii]
          Length = 713

 Score = 88.2 bits (217), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 61/187 (32%), Positives = 92/187 (49%), Gaps = 32/187 (17%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK---------KEEQDREKPHRH 962
            + ++  ++KL LV+DLD T+++      VDP   E  R          K+ Q        
Sbjct: 152  KTRLRESKKLVLVVDLDQTVIHCG----VDPTIGEWKRDSSNPNYEALKDVQSFALDEEP 207

Query: 963  LFRFPHMG---------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
            +    +MG          + K+RPG+  F ++ + LFEMH+YTM  + YA E+AK++DP 
Sbjct: 208  ILPLLYMGPKPPVRKCWYYVKVRPGLKEFFDKVAPLFEMHIYTMATRAYALEIAKIIDPD 267

Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIV 1072
            G LF  R++SR ++G            K LE +    +S VV+IDD   VW     NLI 
Sbjct: 268  GSLFGDRILSRDENGS--------ITQKSLERLFPTDQSMVVVIDDRGDVWNWCP-NLIK 318

Query: 1073 VERYTYF 1079
            V  Y +F
Sbjct: 319  VVPYNFF 325


>gi|302306421|ref|NP_982820.2| ABL127Wp [Ashbya gossypii ATCC 10895]
 gi|299788508|gb|AAS50644.2| ABL127Wp [Ashbya gossypii ATCC 10895]
 gi|374106022|gb|AEY94932.1| FABL127Wp [Ashbya gossypii FDAG1]
          Length = 728

 Score = 88.2 bits (217), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 64/187 (34%), Positives = 92/187 (49%), Gaps = 32/187 (17%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK---------KEEQDREKPHRH 962
            Q K+  ARKL LV+DLD T+++      VDP   E  +          K+ Q        
Sbjct: 157  QLKLREARKLVLVVDLDQTVIHCG----VDPTIGEWSKDPNNPNYEALKDVQSFSLDEEP 212

Query: 963  LFRFPHMG---------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
            +    +MG          + KLRPG+  F  + +  FE+H+YTM  + YA E+AK++DP 
Sbjct: 213  VLPPFYMGPKPPTRKCWYYVKLRPGLKEFFAKIAPHFELHIYTMATRAYALEIAKIIDPD 272

Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIV 1072
            G LF  R++SR ++G            K LE +  M +S VV+IDD   VW   + NLI 
Sbjct: 273  GKLFGDRILSRDENGS--------LTQKSLERLFPMDQSMVVVIDDRGDVWNWCE-NLIK 323

Query: 1073 VERYTYF 1079
            V  Y +F
Sbjct: 324  VVPYDFF 330


>gi|336374248|gb|EGO02585.1| hypothetical protein SERLA73DRAFT_102556 [Serpula lacrymans var.
            lacrymans S7.3]
          Length = 811

 Score = 87.8 bits (216), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 65/217 (29%), Positives = 100/217 (46%), Gaps = 40/217 (18%)

Query: 890  FEGYDDQQKAAIQKER-----TRRLEEQKK--------MFSARKLCLVLDLDHTLLNSAK 936
            + G+ D  +A+IQ        T  LEE +K        + ++RKL L++DLD T++++  
Sbjct: 117  YTGFSDASRASIQMTHSAFGPTVSLEEAQKIEKETADHLLNSRKLSLIVDLDQTIVHAT- 175

Query: 937  FHEVDPV-------------HDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFL 983
               VDP              + E L+   +    K  +          + K RPG   FL
Sbjct: 176  ---VDPTVATDSESDDECNPNWEALKDVRKFQLVKGKQKFIENEGCMYYIKPRPGWQHFL 232

Query: 984  ERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDL 1043
               +  +EMH+YTMG + YA E+   +DP G +F GR++SR + G            K L
Sbjct: 233  HSIANKYEMHVYTMGTRAYAEEVCAAIDPDGTIFGGRILSRDESGS--------LTQKSL 284

Query: 1044 EGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            + +   + S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 285  QRLFPCDTSMVVIIDDRADVWEWSP-NLVKVIPYDFF 320


>gi|366991271|ref|XP_003675401.1| hypothetical protein NCAS_0C00420 [Naumovozyma castellii CBS 4309]
 gi|342301266|emb|CCC69032.1| hypothetical protein NCAS_0C00420 [Naumovozyma castellii CBS 4309]
          Length = 725

 Score = 87.8 bits (216), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 62/195 (31%), Positives = 97/195 (49%), Gaps = 42/195 (21%)

Query: 909  LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR--- 965
            L  + ++   +KL LV+DLD T+++      VDP   E      + D + P+    +   
Sbjct: 157  LNVRTRLRKEKKLVLVVDLDQTVIHCG----VDPTIGEW-----KNDPKNPNFETLKDVK 207

Query: 966  ---------FPHMGM-----------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATE 1005
                      P + M           + K+RPG+  FLE+ + LFEMH+YTM  + YA+E
Sbjct: 208  QFSLEEEPILPTLYMGPKPPLRKCWYYVKVRPGLKEFLEKIAPLFEMHIYTMATRAYASE 267

Query: 1006 MAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWP 1064
            +AK++DP G LF  R++SR ++G           +K LE +    +S V++IDD   VW 
Sbjct: 268  IAKIIDPNGDLFGDRILSRDENGS--------MTTKSLERLFPTDQSMVIVIDDRGDVWN 319

Query: 1065 HNKLNLIVVERYTYF 1079
             +  NLI V  Y +F
Sbjct: 320  WSP-NLIKVVPYNFF 333


>gi|317027693|ref|XP_001399857.2| RNA polymerase II subunit A C-terminal domain phosphatase
            [Aspergillus niger CBS 513.88]
          Length = 800

 Score = 87.8 bits (216), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 58/188 (30%), Positives = 97/188 (51%), Gaps = 32/188 (17%)

Query: 895  DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            D     + ++   R+EE  ++++ + RKL LV+DLD T++++     VDP   E +  KE
Sbjct: 132  DNTTLTVSEQEATRVEEDAKRRLLANRKLSLVVDLDQTIIHAT----VDPTVGEWMEDKE 187

Query: 953  EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
              + +   R          W +      +FL+  S+++E+H+YTMG + YA  +A ++DP
Sbjct: 188  NPNYQASER----------WLE------SFLQNVSEMYELHIYTMGTRSYAQHIASIIDP 231

Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLI 1071
               LF  R++SR + G           +K+L  +  +++  VVIIDD   VW  N  NLI
Sbjct: 232  DRKLFGDRILSRDESGSLV--------AKNLHRLFPVDTKMVVIIDDRGDVWRWNP-NLI 282

Query: 1072 VVERYTYF 1079
             V  Y +F
Sbjct: 283  KVSPYDFF 290


>gi|291234950|ref|XP_002737409.1| PREDICTED: RNA polymerase II ctd phosphatase, putative-like
            [Saccoglossus kowalevskii]
          Length = 896

 Score = 87.8 bits (216), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 54/172 (31%), Positives = 96/172 (55%), Gaps = 21/172 (12%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+++++  +RKL  ++DLD T++++     +D V + +        ++  H  L+  P  
Sbjct: 169  EDEQRLLKSRKLVCIVDLDQTIIHTT----MDNVPENL--------KDVYHFQLWSGPQY 216

Query: 970  GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
              + T++RP    FLE+ SKL+E+H++T G +LYA  +A  +DP   LF+ R++SR    
Sbjct: 217  PWFHTRIRPKCKEFLEKISKLYELHIFTFGARLYAHMIAGFIDPDKKLFSHRIVSR---D 273

Query: 1029 DPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            + FD      K+ +L+ +    ++ V IIDD   VW +   N+I V+ Y YF
Sbjct: 274  ECFDAS---SKTANLQAIFPCGDNMVCIIDDREDVW-NFAPNMIHVKPYHYF 321


>gi|388580688|gb|EIM21001.1| FCP1-like phosphatase [Wallemia sebi CBS 633.66]
          Length = 510

 Score = 86.7 bits (213), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 61/204 (29%), Positives = 98/204 (48%), Gaps = 43/204 (21%)

Query: 893  YDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            YD+ Q+     + T        +  + KL L++DLD T++++     VDP  +E+L    
Sbjct: 46   YDEAQRIGKTSKHT--------LLKSSKLALIVDLDQTIIHAT----VDPTVNELL---- 89

Query: 953  EQDREKPHR------HLFRFPHMGM----------WTKLRPGIWTFLERASKLFEMHLYT 996
             QD    ++      H F+    G+          + K RPG+  FL+  +KLFEMH+YT
Sbjct: 90   -QDPTLVYKGALNDVHKFKLGDFGLVNHHEFGSWYFVKFRPGLMEFLDNMNKLFEMHVYT 148

Query: 997  MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVI 1055
            MG + YA  + +++DP G  F  R++SR + G            K L+ +   + S  VI
Sbjct: 149  MGTRSYALAICQLIDPSGKYFGERILSRDESGS--------FTQKSLQRLFPTDTSMCVI 200

Query: 1056 IDDSVRVWPHNKLNLIVVERYTYF 1079
            IDD   VW  +  NL+ V  + +F
Sbjct: 201  IDDRADVWG-DSPNLVKVIPFEFF 223


>gi|195121496|ref|XP_002005256.1| GI20391 [Drosophila mojavensis]
 gi|193910324|gb|EDW09191.1| GI20391 [Drosophila mojavensis]
          Length = 880

 Score = 86.3 bits (212), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 57/173 (32%), Positives = 93/173 (53%), Gaps = 24/173 (13%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            ++ +++ + RKL L++DLD T++++      D V D I          K   H   +   
Sbjct: 178  DDTRRLLTDRKLVLLVDLDQTVIHTTN----DTVPDNI----------KGIYHFQLYGPQ 223

Query: 970  GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
              W  T+LRPG   FLE+ S+L+E+H+ T G + YA  +A++LDP G  F+ R++SR   
Sbjct: 224  SPWYHTRLRPGTAEFLEKMSELYELHICTFGARNYAHMIAQLLDPDGKFFSHRILSR--- 280

Query: 1028 GDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
             + F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 281  DECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 329


>gi|167520468|ref|XP_001744573.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776904|gb|EDQ90522.1| predicted protein [Monosiga brevicollis MX1]
          Length = 858

 Score = 86.3 bits (212), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 57/171 (33%), Positives = 91/171 (53%), Gaps = 18/171 (10%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E   K+  ARKL L+LDLD TL++S     +D +    LR   E   +  H   F     
Sbjct: 56   ENANKLLEARKLILILDLDKTLIHST----IDSIASHWLR---EGVYDIFH---FDLGKH 105

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
              +TK+RPG+  FLE     +EMH+YTMG + YA  + +++DP    F+ R++++ D+  
Sbjct: 106  TYYTKVRPGLHAFLEDLYPYYEMHIYTMGRRNYAERILRIIDPSNRFFSTRILTQ-DESF 164

Query: 1030 PFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
              +      K+K+L+ +L G +S  VI+DD   VW   + N++    Y +F
Sbjct: 165  SIEN-----KAKNLDALLPGGDSMAVILDDLPAVWDF-QTNVVPALPYEFF 209


>gi|254586061|ref|XP_002498598.1| ZYRO0G14168p [Zygosaccharomyces rouxii]
 gi|238941492|emb|CAR29665.1| ZYRO0G14168p [Zygosaccharomyces rouxii]
          Length = 764

 Score = 86.3 bits (212), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 60/187 (32%), Positives = 92/187 (49%), Gaps = 32/187 (17%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEVDP----------------VHDEILRKKEEQD 955
            ++++  ++KL LV+DLD T+++      VDP                + D  +   EE+ 
Sbjct: 156  KQRLRQSKKLVLVVDLDQTVIHCG----VDPTIGEWKKDPSNPNYETLKDVQMFSLEEEP 211

Query: 956  REKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
               P     R P    W   K+RPG+  F  + + L+EMH+YTM  + YA E+AK++DP 
Sbjct: 212  IVPPMYMGPRLPERKCWYFVKVRPGLREFFAQLAPLYEMHIYTMATRTYALEIAKIIDPD 271

Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIV 1072
            G LF  R++SR ++G            K LE +    +S V++IDD   VW     NLI 
Sbjct: 272  GSLFGDRILSRDENGS--------LTQKSLERLFPTDQSMVIVIDDRGDVWNWCP-NLIK 322

Query: 1073 VERYTYF 1079
            V  Y +F
Sbjct: 323  VVPYNFF 329


>gi|8778093|gb|AAF79202.1| CTD phosphatase-like protein [Emericella nidulans]
          Length = 409

 Score = 86.3 bits (212), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 61/188 (32%), Positives = 99/188 (52%), Gaps = 34/188 (18%)

Query: 908  RLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR---- 961
            R+EE  ++++ + RKL LV+DLD T++++A    VDP   E +      D++ P+     
Sbjct: 42   RVEEDAKRRLLANRKLSLVVDLDQTIIHAA----VDPTIGEWM-----ADKDNPNHAPVS 92

Query: 962  --HLFRFPHMG-------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
                F+    G       +  KLRPG+  FL+  + ++E+H+YTMG + YA  +A ++DP
Sbjct: 93   DVRAFQLVDDGPGMRGLLVLCKLRPGLEEFLKNVADMYELHIYTMGTRSYAQAIANIIDP 152

Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLI 1071
               LF  R++SR + G            K+L  +  +++  VVIIDD   VW  +  NLI
Sbjct: 153  DRKLFGDRILSRDESGS--------LSVKNLHRIFPVDTKMVVIIDDRGDVWRWSP-NLI 203

Query: 1072 VVERYTYF 1079
             V  Y +F
Sbjct: 204  KVIPYDFF 211


>gi|198460927|ref|XP_001361849.2| GA11510 [Drosophila pseudoobscura pseudoobscura]
 gi|198137180|gb|EAL26428.2| GA11510 [Drosophila pseudoobscura pseudoobscura]
          Length = 873

 Score = 86.3 bits (212), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 57/173 (32%), Positives = 93/173 (53%), Gaps = 24/173 (13%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            ++ +++ + RKL L++DLD T++++      D V + I          K   H   +   
Sbjct: 180  DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPENI----------KGIYHFQLYGPQ 225

Query: 970  GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
              W  T+LRPG   FLER S+L+E+H+ T G + YA  +A++LDP G  F+ R++SR   
Sbjct: 226  SPWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPDGKFFSHRILSR--- 282

Query: 1028 GDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
             + F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 283  DECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 331


>gi|448097224|ref|XP_004198617.1| Piso0_001998 [Millerozyma farinosa CBS 7064]
 gi|359380039|emb|CCE82280.1| Piso0_001998 [Millerozyma farinosa CBS 7064]
          Length = 830

 Score = 85.9 bits (211), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 67/203 (33%), Positives = 95/203 (46%), Gaps = 48/203 (23%)

Query: 901  IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH 960
            I+   T RL E+KK      L LV+DLD T++++     VDP   E      + D   P+
Sbjct: 152  IETNTTDRLVEEKK------LILVVDLDQTVIHAT----VDPTVGEW-----QSDPSNPN 196

Query: 961  RHLFR---------------------FPHMGMW--TKLRPGIWTFLERASKLFEMHLYTM 997
                +                      P    W   K+RPG+  FLE+ SKL+EMH+YTM
Sbjct: 197  YKAVKDVKSFCLEEESIAPLGWEGPKLPATKCWYYVKVRPGLEQFLEQISKLYEMHIYTM 256

Query: 998  GNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVII 1056
              + YA E+AK++DP G  F  R++SR + G            K+L+ +  + +S V II
Sbjct: 257  ATRNYALEIAKIIDPNGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVAII 308

Query: 1057 DDSVRVWPHNKLNLIVVERYTYF 1079
            DD   VW     NLI V  Y +F
Sbjct: 309  DDRGDVWQWEN-NLIKVVPYDFF 330


>gi|340377687|ref|XP_003387360.1| PREDICTED: hypothetical protein LOC100639785 [Amphimedon
            queenslandica]
          Length = 913

 Score = 85.9 bits (211), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 64/187 (34%), Positives = 93/187 (49%), Gaps = 26/187 (13%)

Query: 898  KAAIQKERTRRLEEQKK--MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            +  + K+  +RL    K  +   RKL L++DLD TL           +H  I R  E   
Sbjct: 125  QVKVNKKEAQRLGNLDKECLLKNRKLALIIDLDQTL-----------IHTSIDRNIE--- 170

Query: 956  REKPHRHLFRFP-HMGMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
            R  P  H F  P H  ++  +LRP +  FL   S+ +E+H+ TMG + YA  + K+LD +
Sbjct: 171  RGLPDVHSFTLPGHSCVYHCRLRPYVREFLNHISQYYELHVATMGTRDYADAITKILDQE 230

Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIV 1072
              LF+ RVISR +  DP        K+  L+ V    +  V I+DD   VW H + NLI 
Sbjct: 231  KKLFSHRVISRNELLDPHS------KAVRLKSVFPCGDEMVAIMDDRGDVWGH-RPNLIH 283

Query: 1073 VERYTYF 1079
            V+ Y +F
Sbjct: 284  VKAYVFF 290


>gi|195440020|ref|XP_002067857.1| GK12500 [Drosophila willistoni]
 gi|194163942|gb|EDW78843.1| GK12500 [Drosophila willistoni]
          Length = 657

 Score = 85.5 bits (210), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 56/172 (32%), Positives = 92/172 (53%), Gaps = 22/172 (12%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            ++ +++ + RKL L++DLD T++++      DPV + I          K   H   +   
Sbjct: 181  DDTRRLLNDRKLVLLVDLDQTIIHTTN----DPVPENI----------KGIHHFQLYGSQ 226

Query: 970  GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
              W  T LRPG   FLER S+++E+H+ T G + YA  +A+++DP+G LF+ R++SR   
Sbjct: 227  SPWYHTCLRPGTTQFLERMSQMYELHICTFGARKYAHMIAQLIDPEGKLFSHRILSR--- 283

Query: 1028 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
             + F+   ++   K L      +  V IIDD   VW +   NLI V+ Y +F
Sbjct: 284  DECFNATSKMDNLKAL--FPNGDKMVCIIDDREDVW-NMATNLIQVKPYHFF 332


>gi|226288832|gb|EEH44344.1| RNA polymerase II C-terminal domain phosphatase component
            [Paracoccidioides brasiliensis Pb18]
          Length = 920

 Score = 85.5 bits (210), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 65/201 (32%), Positives = 98/201 (48%), Gaps = 34/201 (16%)

Query: 895  DQQKAAIQKERTRRLEEQKKMFSARKLCL--VLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            D     + K    R+EE  K        L  V+DLD T++++     VDP   E      
Sbjct: 103  DNSSLTVSKSEATRVEEDAKRRLLSSRRLSLVVDLDQTIIHAT----VDPTVAEW----- 153

Query: 953  EQDREKPHRHLFR----------FPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
            +QDR+ P+    +           P M G W   KLRPG+  FL+  S L+E+H+YTMG 
Sbjct: 154  QQDRDNPNHEAVKDVRAFQLVDDGPGMKGCWYYIKLRPGLQEFLQEISALYELHIYTMGT 213

Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 1058
            + YA  +A ++DP   +F  R++SR + G           +K+L+ +  +++  VVIIDD
Sbjct: 214  RAYAQNIAAIVDPDRKIFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDD 265

Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
               VW  +  NLI V  Y +F
Sbjct: 266  RGDVWKWSD-NLIKVSPYDFF 285


>gi|295671060|ref|XP_002796077.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226284210|gb|EEH39776.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 829

 Score = 85.5 bits (210), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 66/201 (32%), Positives = 99/201 (49%), Gaps = 34/201 (16%)

Query: 895  DQQKAAIQKERTRRLEEQKKMFSARKLCL--VLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            D     + K    R+EE  K        L  V+DLD T++++     VDP   E      
Sbjct: 228  DNSSLTVSKSEATRVEEDAKRRLLSSRRLSLVVDLDQTIIHAT----VDPTVAEW----- 278

Query: 953  EQDREKPHR------HLFRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
            +QDR+ P+         F+     P M G W   KLRPG+  FL+  S L+E+H+YTMG 
Sbjct: 279  QQDRDNPNHEAVKDVRAFQLVDDGPGMKGCWYYIKLRPGLQEFLQEISALYELHIYTMGT 338

Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDD 1058
            + YA  +A ++DP   +F  R++SR + G           +K+L+ +  +++  VVIIDD
Sbjct: 339  RAYAQNIATIVDPDRKIFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDD 390

Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
               VW  +  NLI V  Y +F
Sbjct: 391  RGDVWKWSD-NLIKVSPYDFF 410


>gi|332029822|gb|EGI69691.1| RNA polymerase II subunit A C-terminal domain phosphatase [Acromyrmex
            echinatior]
          Length = 749

 Score = 85.1 bits (209), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 55/177 (31%), Positives = 92/177 (51%), Gaps = 32/177 (18%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH-RHLFRFPH 968
            E+++++ + RKL L++DLD T++++                    D   P+ + +F F  
Sbjct: 145  EDEQRLLTDRKLVLLVDLDQTIVHTT------------------NDNIPPNLKDVFHFQL 186

Query: 969  MGM---W--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
             G+   W  T+LRP    FL   S+L+E+H+ T G ++YA  +A +LD  GVLF+ R++S
Sbjct: 187  YGLNSPWYHTRLRPNTRHFLSEMSRLYELHICTFGARIYAHTVASLLDKDGVLFSHRILS 246

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            R +  DP        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y +F
Sbjct: 247  RDECFDP------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 296


>gi|195429765|ref|XP_002062928.1| GK19439 [Drosophila willistoni]
 gi|194159013|gb|EDW73914.1| GK19439 [Drosophila willistoni]
          Length = 827

 Score = 85.1 bits (209), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 57/173 (32%), Positives = 93/173 (53%), Gaps = 24/173 (13%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            ++ +++ + RKL L++DLD T++++      D V D I          K   H   +   
Sbjct: 180  DDTRRLLADRKLVLLVDLDQTVIHTTN----DVVPDNI----------KGIYHFQLYGPQ 225

Query: 970  GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
              W  T+LRPG   FL+R S L+E+H+ T G + YA  +A++LDP+G  F+ R++SR   
Sbjct: 226  SPWYHTRLRPGTADFLDRMSHLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR--- 282

Query: 1028 GDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
             + F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 283  DECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 331


>gi|448111257|ref|XP_004201796.1| Piso0_001998 [Millerozyma farinosa CBS 7064]
 gi|359464785|emb|CCE88490.1| Piso0_001998 [Millerozyma farinosa CBS 7064]
          Length = 830

 Score = 84.7 bits (208), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 61/190 (32%), Positives = 90/190 (47%), Gaps = 42/190 (22%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR-------- 965
            ++   +KL LV+DLD T++++     VDP   E      + D   P+    +        
Sbjct: 159  RLVDEKKLILVVDLDQTVIHAT----VDPTVGEW-----QSDPSNPNYKAVKDVKSFCLE 209

Query: 966  -------------FPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVL 1010
                          P    W   K+RPG+  FLE+ SKL+EMH+YTM  + YA E+AK++
Sbjct: 210  EESIAPLGWEGPKLPATKCWYYVKVRPGLEEFLEQISKLYEMHIYTMATRNYALEIAKII 269

Query: 1011 DPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLN 1069
            DP G  F  R++SR + G            K+L+ +  + +S V IIDD   VW     N
Sbjct: 270  DPDGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVAIIDDRGDVWQWEN-N 320

Query: 1070 LIVVERYTYF 1079
            LI V  Y +F
Sbjct: 321  LIKVVPYDFF 330


>gi|406602036|emb|CCH46356.1| RNA polymerase II subunit A C-terminal domain phosphatase
            [Wickerhamomyces ciferrii]
          Length = 720

 Score = 84.7 bits (208), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 70/221 (31%), Positives = 105/221 (47%), Gaps = 42/221 (19%)

Query: 901  IQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK------- 951
            I K   +++E+   K +    KL LV+DLD T++++     VDP   E +  +       
Sbjct: 153  ISKSEAQKVEQLMTKNLIKENKLILVVDLDQTVIHAT----VDPTIGEWMNDQSNPNFPS 208

Query: 952  ---------EEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNK 1000
                     EE+    P     R P    W   K+RPG+  FL+R +K++E+H+YTMG K
Sbjct: 209  LKDVQYFSLEEEPILPPGYQGPRPPTHKRWYYVKMRPGLEDFLKRIAKIYELHIYTMGTK 268

Query: 1001 LYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
             YA  +AK++DP G  F  R++SR + G            K LE +   + S VVIIDD 
Sbjct: 269  EYARSIAKIIDPDGEYFGERILSRDESGS--------LTQKSLERLFPTDTSMVVIIDDR 320

Query: 1060 VRVWPHNKLNLIVVERYTYF--------PCSRRQFGLLGPS 1092
              VW  +  +LI V  + +F            +Q  LLGP+
Sbjct: 321  GDVWNWSD-HLIKVVPFDFFVGIGDINSNFLPKQKSLLGPT 360


>gi|403217618|emb|CCK72111.1| hypothetical protein KNAG_0J00280 [Kazachstania naganishii CBS 8797]
          Length = 742

 Score = 84.7 bits (208), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 62/188 (32%), Positives = 90/188 (47%), Gaps = 43/188 (22%)

Query: 917  SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR----------- 965
            +A+KL LV+DLD T+++      VDP   E  R     D   P+    R           
Sbjct: 176  AAQKLVLVVDLDQTVVHCG----VDPTIGEWKR-----DPRNPNYEALRDVQSFALEEEP 226

Query: 966  ---FPHMG----------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
               F ++G           + K+RPG+  F +R + LFEMH+YTM  + YA E+AK++DP
Sbjct: 227  ILPFLYVGGKRPAPRKCWYYVKVRPGLKQFFKRLAPLFEMHIYTMATRAYALEIAKIIDP 286

Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLI 1071
               LF  R++SR ++G            K LE +    +S V +IDD   VW +   NLI
Sbjct: 287  DKSLFGDRILSRDENGS--------LTHKSLERLFPTDQSMVTVIDDRGDVW-NWCANLI 337

Query: 1072 VVERYTYF 1079
             V  Y +F
Sbjct: 338  KVVPYNFF 345


>gi|169600911|ref|XP_001793878.1| hypothetical protein SNOG_03310 [Phaeosphaeria nodorum SN15]
 gi|160705543|gb|EAT90041.2| hypothetical protein SNOG_03310 [Phaeosphaeria nodorum SN15]
          Length = 810

 Score = 84.7 bits (208), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 54/180 (30%), Positives = 95/180 (52%), Gaps = 17/180 (9%)

Query: 908  RLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEV-----DPVHDEILRKKEEQDREKPH 960
            R EE  +K++ +++KL L++DLD T++++     V     DP +      K+ +  +   
Sbjct: 146  RAEEDTKKRLLNSKKLTLIVDLDQTVIHTTCERTVAEWQADPENPNYEAVKDVKGFQLAD 205

Query: 961  RHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
             +L        + K+RPG+  F ++ SKL+EMH+YTM  + YA  + K++DP    F  R
Sbjct: 206  DNLSNVAANWYYVKMRPGLKEFFDKMSKLYEMHVYTMATRAYAQAIMKIIDPDRKYFGDR 265

Query: 1021 VISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            ++SR ++           K K+L  +    +A VVIIDD   VW ++  +L+ V  + +F
Sbjct: 266  ILSRDEN--------YTDKLKNLTRLFYQNTAMVVIIDDRADVWQYSP-HLVRVPVFNFF 316


>gi|297819964|ref|XP_002877865.1| hypothetical protein ARALYDRAFT_906617 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297323703|gb|EFH54124.1| hypothetical protein ARALYDRAFT_906617 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 345

 Score = 84.3 bits (207), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 62/198 (31%), Positives = 96/198 (48%), Gaps = 23/198 (11%)

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLN---SAKFHEVDPVHDEILRKKEEQDREKPH 960
            E T + ++       RKL LVL L+HTL++    +K  E+D  H  +L + +   R+   
Sbjct: 81   EATTKKQKLGIALGKRKLHLVLSLEHTLIDLISVSKLSEIDRYH--LLEEADSGSRDD-- 136

Query: 961  RHLFRFPHMGMWT-----KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
              LFR  +   ++     K RP +  FL  A K+F MH+YT      A ++ K+LDP  +
Sbjct: 137  --LFRLANESFYSSDALVKFRPFVREFLREAEKIFTMHVYTNYGPGLAKKVVKLLDPHMI 194

Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVER 1075
             F  R+I+  D     +GD      K LE VL     V+I+D   R+W     N+I + +
Sbjct: 195  YFGNRIITSKDS----NGD-----LKSLELVLAEPRGVLIVDYDHRLWKSPGHNVIFMSK 245

Query: 1076 YTYFPCSRRQFGLLGPSL 1093
            Y YF     + G+L  +L
Sbjct: 246  YVYFKEISNEDGVLAKTL 263


>gi|301118528|ref|XP_002906992.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262108341|gb|EEY66393.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 735

 Score = 84.0 bits (206), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 57/210 (27%), Positives = 93/210 (44%), Gaps = 52/210 (24%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            ++   A+KL LVLDLDHTLL++ +   VD V  EI                         
Sbjct: 265  RRQLGAKKLSLVLDLDHTLLHAVR---VDDVVSEI------------------------- 296

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
                        + + L+++ +YT G +LYA ++  ++DP    F  R+++R D  D   
Sbjct: 297  ------------KQTVLYDLFIYTHGTRLYAEKIVNIIDPDETYFKNRIVARTDTPDMLH 344

Query: 1033 GDERV--PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 1090
               ++  P   D        S ++++DD + VW  N+ N+ ++E Y YF C+       G
Sbjct: 345  KSLKLLFPSCDD--------SMILVLDDRIDVWKENEGNVFLIEPYHYFKCTSEINNASG 396

Query: 1091 PSL--LEIDHDERSEDGTLASSLGVRQQLH 1118
              +  +E    E SED  LA S  V + +H
Sbjct: 397  RGVAGMEDSEAEASEDSHLAQSTTVLRHVH 426


>gi|302497759|ref|XP_003010879.1| hypothetical protein ARB_02918 [Arthroderma benhamiae CBS 112371]
 gi|291174424|gb|EFE30239.1| hypothetical protein ARB_02918 [Arthroderma benhamiae CBS 112371]
          Length = 1048

 Score = 84.0 bits (206), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 91/170 (53%), Gaps = 32/170 (18%)

Query: 924  VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
            V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 347  VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 397

Query: 973  --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
               KLRPG+  FL+  S L+E+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 398  YYIKLRPGLEEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDKKIFGDRILSRDESGS- 456

Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                     +K+L+ +  +++  VVIIDD   VW  ++ NLI V  Y +F
Sbjct: 457  -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVSPYDFF 498


>gi|405966173|gb|EKC31485.1| RNA polymerase II subunit A C-terminal domain phosphatase
            [Crassostrea gigas]
          Length = 837

 Score = 84.0 bits (206), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 56/173 (32%), Positives = 91/173 (52%), Gaps = 25/173 (14%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
            ++ ++   RKL L++DLD TL+++     + P   ++              + F+  H  
Sbjct: 138  DEDRLLRTRKLVLLVDLDQTLIHTTN-DNIPPNLKDV--------------YHFQLSHGN 182

Query: 971  M--W--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
            M  W  T++RP    FLE  SKL+E+H+ T G+++YA  +AK LDP G  F+ R++SR  
Sbjct: 183  MMPWYHTRIRPRTEKFLENVSKLYELHICTFGSRMYAHIIAKFLDPDGKYFSHRILSR-- 240

Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
              + F+ + ++   K L      +S V IIDD   VW  +  NLI V+ Y +F
Sbjct: 241  -DECFNQNSKMANLKAL--FPCGDSMVCIIDDREDVWNFSP-NLIHVKPYRFF 289


>gi|326475449|gb|EGD99458.1| RNA Polymerase II CTD phosphatase [Trichophyton tonsurans CBS 112818]
          Length = 866

 Score = 84.0 bits (206), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 91/170 (53%), Gaps = 32/170 (18%)

Query: 924  VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
            V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 163  VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 213

Query: 973  --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
               KLRPG+  FL+  S L+E+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 214  YYIKLRPGLEEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDKKIFGDRILSRDESGS- 272

Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                     +K+L+ +  +++  VVIIDD   VW  ++ NLI V  Y +F
Sbjct: 273  -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVSPYDFF 314


>gi|321262398|ref|XP_003195918.1| carboxy-terminal domain (CTD) phosphatase; Fcp1p [Cryptococcus gattii
            WM276]
 gi|317462392|gb|ADV24131.1| Carboxy-terminal domain (CTD) phosphatase, putative; Fcp1p
            [Cryptococcus gattii WM276]
          Length = 952

 Score = 83.6 bits (205), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 48/113 (42%), Positives = 68/113 (60%), Gaps = 12/113 (10%)

Query: 970  GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
            G W  TK RPG+  FL+  S+L+EMH+YTMG + YA  + KV+DP G +F GR++SR + 
Sbjct: 304  GRWYFTKPRPGLQKFLDEMSQLYEMHVYTMGTRTYADAIVKVIDPDGKIFGGRILSRDES 363

Query: 1028 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            G  F        SK+L+ +   + S VV+IDD   VW  +  NL+ V  Y +F
Sbjct: 364  GS-F-------SSKNLKRLFPTDTSMVVVIDDRSDVW-GDCPNLVKVVPYDFF 407


>gi|50552035|ref|XP_503492.1| YALI0E03278p [Yarrowia lipolytica]
 gi|49649361|emb|CAG79071.1| YALI0E03278p [Yarrowia lipolytica CLIB122]
          Length = 750

 Score = 83.6 bits (205), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 64/193 (33%), Positives = 96/193 (49%), Gaps = 34/193 (17%)

Query: 907  RRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILR--KKEEQDREKPHRH 962
            +RLEE   K++   RKL LV+DLD T+++      VDP   E  +       D  K  R 
Sbjct: 155  QRLEEGSTKQLLKQRKLILVVDLDQTVIHVT----VDPTVGEWKKDPSNPNYDAVKDVR- 209

Query: 963  LFRFPHMGM---------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMA 1007
            +F    M M               + KLRP +  FLE  S+ +E+H+YTM  + YA  +A
Sbjct: 210  VFSLEEMTMVSYDGGKPVPQLCYYYVKLRPHLKEFLEVVSEKYELHIYTMATRAYAKAIA 269

Query: 1008 KVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVV-IIDDSVRVWPHN 1066
            +++DP G  F  R++SR + G            K L+ +  +++++V IIDD   VW  +
Sbjct: 270  EIIDPDGRYFGDRILSRDESGS--------LTQKSLQRLFPVDTSMVAIIDDRGDVWKWS 321

Query: 1067 KLNLIVVERYTYF 1079
            K NLI V  Y +F
Sbjct: 322  K-NLIRVVPYDFF 333


>gi|326477486|gb|EGE01496.1| RNA Polymerase II CTD phosphatase Fcp1 [Trichophyton equinum CBS
            127.97]
          Length = 866

 Score = 83.6 bits (205), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 91/170 (53%), Gaps = 32/170 (18%)

Query: 924  VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
            V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 163  VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 213

Query: 973  --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
               KLRPG+  FL+  S L+E+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 214  YYIKLRPGLEEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDKKIFGDRILSRDESGS- 272

Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                     +K+L+ +  +++  VVIIDD   VW  ++ NLI V  Y +F
Sbjct: 273  -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVSPYDFF 314


>gi|327296037|ref|XP_003232713.1| RNA Polymerase II CTD phosphatase [Trichophyton rubrum CBS 118892]
 gi|326465024|gb|EGD90477.1| RNA Polymerase II CTD phosphatase [Trichophyton rubrum CBS 118892]
          Length = 836

 Score = 83.6 bits (205), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 91/170 (53%), Gaps = 32/170 (18%)

Query: 924  VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
            V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 134  VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 184

Query: 973  --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
               KLRPG+  FL+  S L+E+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 185  YYIKLRPGLEEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDKKIFGDRILSRDESGS- 243

Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                     +K+L+ +  +++  VVIIDD   VW  ++ NLI V  Y +F
Sbjct: 244  -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVSPYDFF 285


>gi|260949511|ref|XP_002619052.1| hypothetical protein CLUG_00211 [Clavispora lusitaniae ATCC 42720]
 gi|238846624|gb|EEQ36088.1| hypothetical protein CLUG_00211 [Clavispora lusitaniae ATCC 42720]
          Length = 776

 Score = 83.6 bits (205), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 68/202 (33%), Positives = 94/202 (46%), Gaps = 38/202 (18%)

Query: 897  QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-------------- 942
            +   I++  T RL   KK      L LV+DLD T++++     VDP              
Sbjct: 145  EATKIEQSSTERLAADKK------LILVVDLDQTVIHAT----VDPTVGEWQRDPQNPNY 194

Query: 943  --VHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMG 998
              V D  L   EE+    P     R P    W   KLRPG+  FL   SKL+E+H+YTM 
Sbjct: 195  PFVKDVQLFSLEEEPIVPPGWVGPRPPPTKCWYYVKLRPGLKEFLAEVSKLYELHIYTMA 254

Query: 999  NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIID 1057
             + YA  +A ++DP G  F  R++SR + G            K+L  +  + +S VVIID
Sbjct: 255  TRNYALAIASIIDPDGKYFGDRILSRDESGS--------LTHKNLRRLFPVDQSMVVIID 306

Query: 1058 DSVRVWPHNKLNLIVVERYTYF 1079
            D   VW   + NLI V  Y +F
Sbjct: 307  DRGDVW-QWEANLIKVVPYDFF 327


>gi|443696103|gb|ELT96883.1| hypothetical protein CAPTEDRAFT_23527, partial [Capitella teleta]
          Length = 562

 Score = 83.6 bits (205), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 56/174 (32%), Positives = 96/174 (55%), Gaps = 27/174 (15%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
            +++++   +KL L++DLD TL+++         +D++    ++        H F+  H  
Sbjct: 132  DEQRLIRDKKLVLLVDLDQTLIHTT--------NDKVPANLKDV-------HHFQLHHGR 176

Query: 971  --MW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
              +W  TK RPG   FLER SKL+E+H+ T G ++YA  +AK+LDP G  F+ R++SR +
Sbjct: 177  NLLWYHTKFRPGTEKFLERISKLYELHICTFGVRMYAHTIAKLLDPDGKYFSHRILSRDE 236

Query: 1027 DGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
              +P        K+ +L+ +    +S V IIDD   VW  +  +L+ V+ Y +F
Sbjct: 237  CFNP------TSKTGNLKALFPCGDSMVCIIDDREDVWRFSP-SLVHVKPYLFF 283


>gi|302657133|ref|XP_003020296.1| hypothetical protein TRV_05607 [Trichophyton verrucosum HKI 0517]
 gi|291184115|gb|EFE39678.1| hypothetical protein TRV_05607 [Trichophyton verrucosum HKI 0517]
          Length = 865

 Score = 83.6 bits (205), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 91/170 (53%), Gaps = 32/170 (18%)

Query: 924  VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
            V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 163  VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 213

Query: 973  --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
               KLRPG+  FL+  S L+E+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 214  YYIKLRPGLEEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDKKIFGDRILSRDESGS- 272

Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                     +K+L+ +  +++  VVIIDD   VW  ++ NLI V  Y +F
Sbjct: 273  -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVSPYDFF 314


>gi|150866706|ref|XP_001386384.2| hypothetical protein PICST_63097 [Scheffersomyces stipitis CBS 6054]
 gi|149387962|gb|ABN68355.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 790

 Score = 83.6 bits (205), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 66/202 (32%), Positives = 95/202 (47%), Gaps = 38/202 (18%)

Query: 897  QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-------------- 942
            + A I++  T RL E+KK      L LV+DLD T++++     VDP              
Sbjct: 148  EAAKIEQSTTDRLNEEKK------LILVVDLDQTVIHAT----VDPTVGEWQSDPSNPNY 197

Query: 943  --VHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMG 998
              + D      EE+    P     R      W   K+RPG+  FLE    L+EMH+YTM 
Sbjct: 198  PAIKDVKTFCLEEEAIVPPGWTGPRLAPTKCWYYVKVRPGLSDFLEEIVNLYEMHIYTMA 257

Query: 999  NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIID 1057
             + YA  +AK++DP G  F  R++SR + G            K+L+ +  + +S VVIID
Sbjct: 258  TRNYALAIAKIIDPTGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVVIID 309

Query: 1058 DSVRVWPHNKLNLIVVERYTYF 1079
            D   +W     NLI V  Y +F
Sbjct: 310  DRGDIWQWES-NLIKVVPYDFF 330


>gi|341882050|gb|EGT37985.1| hypothetical protein CAEBREN_32558 [Caenorhabditis brenneri]
          Length = 673

 Score = 83.2 bits (204), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 55/179 (30%), Positives = 95/179 (53%), Gaps = 31/179 (17%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP--H 968
            ++  + S RKL L++DLD T+++++              K   +D EK H+ + R+   H
Sbjct: 134  DETNLVSNRKLVLLVDLDQTIIHTSD-------------KPMSEDSEK-HKDITRYGLNH 179

Query: 969  MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
                TKLRP    FL + + ++EMH+ T G + YA ++A++LDP+  LF  R++SR    
Sbjct: 180  RKYITKLRPHTTEFLNKMATMYEMHIVTYGQRQYAHKIAQILDPEARLFGQRILSR---D 236

Query: 1029 DPFDGDERVPKSKDLEGVLGMESA--------VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            + F       K+++L+ ++  + A        VVIIDD   VW ++   LI ++ Y +F
Sbjct: 237  ELFSAQH---KTRNLKVIILFQKALFPCGDNLVVIIDDRADVWMYSDA-LIQIKPYRFF 291


>gi|315051428|ref|XP_003175088.1| RNA polymerase II subunit A domain phosphatase [Arthroderma gypseum
            CBS 118893]
 gi|311340403|gb|EFQ99605.1| RNA polymerase II subunit A domain phosphatase [Arthroderma gypseum
            CBS 118893]
          Length = 867

 Score = 83.2 bits (204), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 91/170 (53%), Gaps = 32/170 (18%)

Query: 924  VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
            V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 163  VVDLDQTIIHAT----VDPTVGEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 213

Query: 973  --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
               KLRPG+  FL+  S L+E+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 214  YYIKLRPGLEEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDRKIFGDRILSRDESGS- 272

Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                     +K+L+ +  +++  VVIIDD   VW  ++ NLI V  Y +F
Sbjct: 273  -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVTPYDFF 314


>gi|391345370|ref|XP_003746962.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
            phosphatase-like [Metaseiulus occidentalis]
          Length = 475

 Score = 83.2 bits (204), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 56/174 (32%), Positives = 91/174 (52%), Gaps = 26/174 (14%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            +++ ++ + +KL L++DLD TL+++      +PV+D+I              H FR P  
Sbjct: 21   DDELRLLTQKKLVLLVDLDQTLIHTTS----EPVYDKI-----------KGVHHFRLPSS 65

Query: 970  G-MW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
               W  T++RPG   FL + S+LFE+H+ T G + YA  +A +LDP    F  R++SR +
Sbjct: 66   NNAWYHTRIRPGTEDFLRKISQLFELHIVTFGARPYANHIASLLDPGKKYFQYRILSRDE 125

Query: 1027 DGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
              +P        K+ +L+ +    +  V IIDD   VW     NLI V+ Y +F
Sbjct: 126  CFNP------QSKTANLKSLFPCGDQMVCIIDDREDVWNFAS-NLIAVKPYVFF 172


>gi|348665920|gb|EGZ05748.1| hypothetical protein PHYSODRAFT_566275 [Phytophthora sojae]
          Length = 684

 Score = 83.2 bits (204), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 52/170 (30%), Positives = 81/170 (47%), Gaps = 42/170 (24%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            ++   A+KL LVLDLDHTLL++ +   VD V  EI                   P  GM 
Sbjct: 266  RRQLGAKKLSLVLDLDHTLLHAVR---VDDVVGEI-------------------PKSGML 303

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
                          S L+++ +YT G +LYA ++ K++DP    F  R+++R D  D   
Sbjct: 304  --------------SALYDLFIYTHGTRLYAEQIVKIIDPDESYFKNRIVARTDTPDMLH 349

Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCS 1082
                  KS  L      +S ++++DD + VW  N+ N+ ++E Y YF C+
Sbjct: 350  ------KSLKLLFPSCDDSMILVLDDRIDVWKENEGNVFLIEPYHYFKCT 393


>gi|194757423|ref|XP_001960964.1| GF11242 [Drosophila ananassae]
 gi|190622262|gb|EDV37786.1| GF11242 [Drosophila ananassae]
          Length = 854

 Score = 83.2 bits (204), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 56/173 (32%), Positives = 92/173 (53%), Gaps = 24/173 (13%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            ++ +++ + RKL L++DLD T++++      D V + I          K   H   +   
Sbjct: 191  DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPENI----------KGIYHFQLYGPQ 236

Query: 970  GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
              W  T+LRPG   FLE  S+L+E+H+ T G + YA  +A++LDP G  F+ R++SR   
Sbjct: 237  SPWYHTRLRPGTAEFLESMSQLYELHICTFGARNYAHMIAQLLDPDGKFFSHRILSR--- 293

Query: 1028 GDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
             + F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 294  DECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 342


>gi|344233336|gb|EGV65209.1| hypothetical protein CANTEDRAFT_104476 [Candida tenuis ATCC 10573]
 gi|344233337|gb|EGV65210.1| hypothetical protein CANTEDRAFT_104476 [Candida tenuis ATCC 10573]
          Length = 788

 Score = 83.2 bits (204), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 65/203 (32%), Positives = 98/203 (48%), Gaps = 38/203 (18%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP------------- 942
            ++ A I++  T RL +Q      RKL LV+DLD T++++     VDP             
Sbjct: 147  EEAAKIEQNSTTRLTQQ------RKLILVVDLDQTVIHAT----VDPTVGEWQSDPSNPN 196

Query: 943  ---VHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTM 997
               V D      EE+    P+    +      W   KLRPG+  FL   ++++EMH+YTM
Sbjct: 197  YRAVKDVQSFCLEEEPITPPNWSGPKLSPTKCWYYVKLRPGLEEFLREMAEIYEMHIYTM 256

Query: 998  GNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVII 1056
              + YA  +AK++DP+G  F  R++SR + G            K+L+ +  + +S V II
Sbjct: 257  ATRNYALAIAKIIDPEGEYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVAII 308

Query: 1057 DDSVRVWPHNKLNLIVVERYTYF 1079
            DD   VW     NLI V  Y +F
Sbjct: 309  DDRGDVWQWED-NLIKVVPYDFF 330


>gi|357601986|gb|EHJ63229.1| putative RNA polymerase II subunit A C-terminal domain phosphatase
            [Danaus plexippus]
          Length = 683

 Score = 83.2 bits (204), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 59/175 (33%), Positives = 92/175 (52%), Gaps = 26/175 (14%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF-RFP- 967
            E+  ++   RKL L++DLD TL+++     + P   ++L             H F R P 
Sbjct: 134  EDADRLLKDRKLVLLVDLDQTLVHTTN-DNIPPNIKDVL-------------HFFLRGPG 179

Query: 968  HMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
            + G W  T+LRP    FLE A+K +E+H+ T G + YA  + ++LDP+   F+ R++SR 
Sbjct: 180  NQGRWCHTRLRPKTHEFLESAAKNYELHVCTFGARQYAHAITELLDPQKKFFSHRILSR- 238

Query: 1026 DDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
               + FD      KS +L+ +    ++ V IIDD   VW H   NLI V  Y++F
Sbjct: 239  --DECFDAR---TKSANLKALFPCGDNMVCIIDDREDVWRHAS-NLIQVRPYSFF 287


>gi|190346120|gb|EDK38128.2| hypothetical protein PGUG_02226 [Meyerozyma guilliermondii ATCC 6260]
          Length = 732

 Score = 83.2 bits (204), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 62/191 (32%), Positives = 92/191 (48%), Gaps = 42/191 (21%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------- 965
            +++ S RKL LV+DLD T++++     VDP   E      + D   P+    +       
Sbjct: 110  ERLTSERKLILVVDLDQTVIHAT----VDPTVGEW-----QSDPSNPNYRAVKDVRSFCL 160

Query: 966  -----------FPHM-----GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKV 1009
                        P M       + K+RPG+  FL+R S+L+EMH+YTM  + YA  +A +
Sbjct: 161  EEDPIAPPGWSGPKMTPTKCWYYVKVRPGLEDFLKRVSQLYEMHVYTMATRNYALAIAHI 220

Query: 1010 LDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKL 1068
            +DP G  F  R++SR + G            K+L  +  + +S VVIIDD   VW   K 
Sbjct: 221  IDPDGRYFGDRILSRDESGS--------LTHKNLRRLFPVDQSMVVIIDDRGDVWQWEK- 271

Query: 1069 NLIVVERYTYF 1079
            NLI V  Y +F
Sbjct: 272  NLIKVVPYEFF 282


>gi|325087549|gb|EGC40859.1| RNA polymerase II C-terminal domain phosphatase component
            [Ajellomyces capsulatus H88]
          Length = 885

 Score = 83.2 bits (204), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 59/170 (34%), Positives = 88/170 (51%), Gaps = 32/170 (18%)

Query: 924  VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
            V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 134  VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHEAVKDVRAFQLVDDGPGMKGCW 184

Query: 973  --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
               KLRPG+  FL   S LFE+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 185  YYIKLRPGLEEFLRNISTLFELHIYTMGTRAYAQHIASIVDPDRKIFGDRILSRDESGS- 243

Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                     +K+L+ +  +++  VVIIDD   VW     NLI V  Y +F
Sbjct: 244  -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWTD-NLIKVVPYDFF 285


>gi|300176006|emb|CBK22223.2| unnamed protein product [Blastocystis hominis]
          Length = 680

 Score = 82.8 bits (203), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 58/163 (35%), Positives = 85/163 (52%), Gaps = 18/163 (11%)

Query: 909  LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 968
            L  QK++ +AR+L LV DLD+TL+  +     DP      R         P+ H  +F  
Sbjct: 9    LVNQKRLLAARRLGLVFDLDNTLMEQSD----DP------RCSVAPSFGIPNIHFIQFKR 58

Query: 969  MGMWTK----LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
                +K    LRP + + L   SK +E+ +YT G + YA  + + +DPK  LF  RVI+R
Sbjct: 59   NNQLSKHTIILRPEVQSILTELSKYYELSIYTNGVRTYAQAIIESIDPKHQLFGSRVIAR 118

Query: 1025 GDDGDPFDG---DERVPKSKDLEGVL-GMESAVVIIDDSVRVW 1063
             D  D  +    +  +P SKD+  VL G+E   V++DDSV VW
Sbjct: 119  DDVPDNSETNFFNNFLPASKDISFVLPGLERLGVVVDDSVEVW 161


>gi|327358124|gb|EGE86981.1| RNA Polymerase II CTD phosphatase Fcp1 [Ajellomyces dermatitidis ATCC
            18188]
          Length = 839

 Score = 82.8 bits (203), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 59/170 (34%), Positives = 88/170 (51%), Gaps = 32/170 (18%)

Query: 924  VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
            V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 62   VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHEAVKDVRAFQLVDDGPGMRGCW 112

Query: 973  --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
               KLRPG+  FL   S LFE+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 113  YYIKLRPGLEEFLREISTLFELHIYTMGTRAYAQHIANIVDPDRKIFGDRILSRDESGS- 171

Query: 1031 FDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                     +K+L+ +  +++  VVIIDD   VW     NLI V  Y +F
Sbjct: 172  -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWTD-NLIKVLPYDFF 213


>gi|444319376|ref|XP_004180345.1| hypothetical protein TBLA_0D03260 [Tetrapisispora blattae CBS 6284]
 gi|387513387|emb|CCH60826.1| hypothetical protein TBLA_0D03260 [Tetrapisispora blattae CBS 6284]
          Length = 768

 Score = 82.8 bits (203), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 60/185 (32%), Positives = 89/185 (48%), Gaps = 42/185 (22%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------------F 966
            +KL LV+DLD T+++      VDP   E      + D + P+    +             
Sbjct: 181  KKLILVVDLDQTVIHCG----VDPTIGEW-----KNDPKNPNYETLKDVRSFSLDEEPIL 231

Query: 967  P--HMG---------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
            P  +MG          + K+RPG+  F  + + L+EMH+YTM  + YA E+AK++DP G 
Sbjct: 232  PPSYMGPRPPVRKCWYYVKVRPGLKEFFAKIAPLYEMHIYTMATRAYALEIAKIIDPDGS 291

Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVE 1074
            LF  R++SR ++G            K LE +    +S V+IIDD   VW     NLI V 
Sbjct: 292  LFGSRILSRDENGS--------LTQKSLERLFPTDQSMVIIIDDRGDVWNWCN-NLIKVI 342

Query: 1075 RYTYF 1079
             Y +F
Sbjct: 343  PYNFF 347


>gi|225556539|gb|EEH04827.1| RNA polymerase II C-terminal domain phosphatase component
            [Ajellomyces capsulatus G186AR]
          Length = 871

 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 59/170 (34%), Positives = 88/170 (51%), Gaps = 32/170 (18%)

Query: 924  VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
            V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 134  VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHEAVKDVRAFQLVDDGPGMKGCW 184

Query: 973  --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
               KLRPG+  FL   S LFE+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 185  YYIKLRPGLEEFLRNISTLFELHIYTMGTRAYAQHIASIVDPDRKIFGDRILSRDESGS- 243

Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                     +K+L+ +  +++  VVIIDD   VW     NLI V  Y +F
Sbjct: 244  -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWTD-NLIKVVPYDFF 285


>gi|296810642|ref|XP_002845659.1| RNA polymerase II subunit A C-terminal domain phosphatase
            [Arthroderma otae CBS 113480]
 gi|238843047|gb|EEQ32709.1| RNA polymerase II subunit A C-terminal domain phosphatase
            [Arthroderma otae CBS 113480]
          Length = 832

 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 90/170 (52%), Gaps = 32/170 (18%)

Query: 924  VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
            V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 134  VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 184

Query: 973  --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
               KLRPG+  FL+  S L+E+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 185  YYIKLRPGLEEFLKVVSSLYELHIYTMGTRAYAQNVANIVDPDRKIFGDRILSRDESGS- 243

Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                     +K+L  +  +++  VVIIDD   VW  ++ NLI V  Y +F
Sbjct: 244  -------LTAKNLHRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVTPYDFF 285


>gi|452820283|gb|EME27327.1| phosphoprotein phosphatase [Galdieria sulphuraria]
          Length = 734

 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 85/282 (30%), Positives = 128/282 (45%), Gaps = 52/282 (18%)

Query: 834  MVSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGS--GPEAGPVGAHP----------QS 881
            +V+QN  IQP Q+    D    V +H  +  G  +  G E     A P          ++
Sbjct: 132  VVNQNHVIQPQQMVLSVD----VCDHKVQFNGNCALCGIEMDIYSASPILESPREFSTRT 187

Query: 882  AW--GDVEHLFEGYDDQQKAA-------IQKERTRRLEEQKKMFSARKLCLVLDLDHTLL 932
             W   +  HL   Y   Q          ++ E TRRL  +KK      L LVLDLD+TL+
Sbjct: 188  DWTLSNKHHLNPAYTHPQLRVSRNELELVEGENTRRLLRRKK------LSLVLDLDNTLI 241

Query: 933  NS---AKF-HEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM-----WTKLRPGIWTFL 983
            ++   + F  E      EI ++  E+  E     +     + +       KLRP +  FL
Sbjct: 242  HATLVSHFPQEWYQYKQEIYQQATEKALECSAPLMEDIHELDLDGSISLVKLRPNVRRFL 301

Query: 984  ERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDL 1043
            E+  + +E+H+YTMG++ YA  +A +LDP G LF  R++SR       D  E +   K L
Sbjct: 302  EKIHQRYELHIYTMGSRSYADAIATLLDPSGNLFQRRIVSRD------DFVEGMMNRKSL 355

Query: 1044 EGVLGM-ESAVVIIDDSVRVW-PHNKL----NLIVVERYTYF 1079
              +    +S V+I+DD   VW  HN+     NLI  + Y +F
Sbjct: 356  RRIFPCDDSMVIIVDDREDVWMDHNQGEMVPNLIRAKPYLFF 397


>gi|384488044|gb|EIE80224.1| hypothetical protein RO3G_04929 [Rhizopus delemar RA 99-880]
          Length = 433

 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 47/133 (35%), Positives = 77/133 (57%), Gaps = 17/133 (12%)

Query: 900  AIQKERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE 957
             + +    RLE++  K++  +RKL L+LDLD T+++++     DP    I   K E+ R+
Sbjct: 9    TVSRSEAERLEKENAKRLLESRKLSLILDLDQTIVHAS----CDP---RISHWKNEEIRQ 61

Query: 958  KPHRHLFRFPH--MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  F  P      + KLRPG+  FL+    L+++H+YTMG K YA  +A+ +DP+G 
Sbjct: 62   ------FTLPKSPTMYYIKLRPGLREFLKEIENLYDLHIYTMGTKDYAKAVAREMDPEGS 115

Query: 1016 LFAGRVISRGDDG 1028
            LF  R++SR ++G
Sbjct: 116  LFKERILSRDENG 128


>gi|154284394|ref|XP_001542992.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150406633|gb|EDN02174.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 654

 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 59/170 (34%), Positives = 88/170 (51%), Gaps = 32/170 (18%)

Query: 924  VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
            V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 89   VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHEAVKDVRAFQLVDDGPGMKGCW 139

Query: 973  --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
               KLRPG+  FL   S LFE+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 140  YYIKLRPGLEEFLRNISTLFELHIYTMGTRAYAQHIASIVDPDRKIFGDRILSRDESGS- 198

Query: 1031 FDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                     +K+L+ +  +++  VVIIDD   VW     NLI V  Y +F
Sbjct: 199  -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWTD-NLIKVVPYDFF 240


>gi|242093894|ref|XP_002437437.1| hypothetical protein SORBIDRAFT_10g027050 [Sorghum bicolor]
 gi|241915660|gb|EER88804.1| hypothetical protein SORBIDRAFT_10g027050 [Sorghum bicolor]
          Length = 271

 Score = 82.4 bits (202), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 67/203 (33%), Positives = 90/203 (44%), Gaps = 54/203 (26%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH---MGMWTKL 975
            RKL LVLDLDHTLLNS + H+ D    E          ++ H  LFR  +   + M TKL
Sbjct: 8    RKLILVLDLDHTLLNSTRLHQ-DLSALEQRNGFTPDTEDELHMELFRLEYSDNVRMLTKL 66

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 1035
            RP +  FLE+AS               +T     +DP  V                    
Sbjct: 67   RPFVRGFLEQAS------------SRASTSSRAPIDPAAV-------------------- 94

Query: 1036 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLE 1095
                              VI+DD+   WP ++ NLI+++RY YF C+ R+F    PS+ E
Sbjct: 95   ------------------VILDDTDSAWPGHQDNLILMDRYHYFACTCRKFRYNIPSMAE 136

Query: 1096 IDHDERSEDGTLASSLGVRQQLH 1118
               DER  DG+LA  LGV  ++H
Sbjct: 137  QARDEREHDGSLAVVLGVLNRIH 159


>gi|254568460|ref|XP_002491340.1| hypothetical protein [Komagataella pastoris GS115]
 gi|238031137|emb|CAY69060.1| hypothetical protein PAS_chr2-1_0845 [Komagataella pastoris GS115]
 gi|328352145|emb|CCA38544.1| hypothetical protein PP7435_Chr2-0862 [Komagataella pastoris CBS
            7435]
          Length = 733

 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 67/210 (31%), Positives = 98/210 (46%), Gaps = 46/210 (21%)

Query: 913  KKMFSARKLCLVLDLDHTLL---------------NSAKFHEVDPVHDEILRKK----EE 953
            K++   +KL LV+DLD T++               N+A +  V  V    L+++    E 
Sbjct: 165  KRLLKEKKLSLVVDLDQTVIHATVDPTVGEWMKDPNNANYPAVKDVRSFSLKEEVILPEN 224

Query: 954  QDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 1011
               +KP       P    W   KLRP +  FLE  S+ +E+H+YTM  + YA E+AK++D
Sbjct: 225  YVGQKP-------PATVCWYYVKLRPHLREFLEHVSERYELHIYTMATRQYAKEIAKIID 277

Query: 1012 PKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNL 1070
            P    F  R++SR + G            K L+ +  ++ S VV+IDD   VW  +  NL
Sbjct: 278  PDEKYFGDRILSRDESGS--------LTQKSLQRLFPVDTSMVVVIDDRGDVWNWSS-NL 328

Query: 1071 IVVERYTYF--------PCSRRQFGLLGPS 1092
            I V  Y +F            RQ  LLGPS
Sbjct: 329  IKVVPYDFFVGIGDINSSFLPRQHALLGPS 358


>gi|239606973|gb|EEQ83960.1| RNA Polymerase II CTD phosphatase Fcp1 [Ajellomyces dermatitidis
            ER-3]
          Length = 901

 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 59/170 (34%), Positives = 88/170 (51%), Gaps = 32/170 (18%)

Query: 924  VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
            V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 134  VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHEAVKDVRAFQLVDDGPGMRGCW 184

Query: 973  --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
               KLRPG+  FL   S LFE+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 185  YYIKLRPGLEEFLREISTLFELHIYTMGTRAYAQHIANIVDPDRKIFGDRILSRDESGS- 243

Query: 1031 FDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                     +K+L+ +  +++  VVIIDD   VW     NLI V  Y +F
Sbjct: 244  -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWTD-NLIKVLPYDFF 285


>gi|294658166|ref|XP_460501.2| DEHA2F03102p [Debaryomyces hansenii CBS767]
 gi|202952923|emb|CAG88814.2| DEHA2F03102p [Debaryomyces hansenii CBS767]
          Length = 795

 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 67/198 (33%), Positives = 92/198 (46%), Gaps = 38/198 (19%)

Query: 901  IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK--------- 951
            I+   T RL  +KK      L LV+DLD T++++     VDP   E              
Sbjct: 152  IEHNTTDRLSREKK------LILVVDLDQTVIHAT----VDPTVGEWQSDPSNPNYPAVK 201

Query: 952  -------EEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLY 1002
                   EE     P     + P    W   KLRPG+  FL  AS L+EMH+YTM  + Y
Sbjct: 202  NVRSFCLEEDPIAPPGWTGPKLPPSKCWYYVKLRPGLEEFLRSASDLYEMHIYTMATRNY 261

Query: 1003 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVR 1061
            A  +AK++DP+G  F  R++SR + G            K+L+ +  + +S VVIIDD   
Sbjct: 262  ALAIAKIIDPEGEYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVVIIDDRGD 313

Query: 1062 VWPHNKLNLIVVERYTYF 1079
            VW     NLI V  Y +F
Sbjct: 314  VWQWEN-NLIKVVPYDFF 330


>gi|58271496|ref|XP_572904.1| protein phosphatase [Cryptococcus neoformans var. neoformans JEC21]
 gi|134115316|ref|XP_773956.1| hypothetical protein CNBH4080 [Cryptococcus neoformans var.
            neoformans B-3501A]
 gi|50256584|gb|EAL19309.1| hypothetical protein CNBH4080 [Cryptococcus neoformans var.
            neoformans B-3501A]
 gi|57229163|gb|AAW45597.1| protein phosphatase, putative [Cryptococcus neoformans var.
            neoformans JEC21]
          Length = 955

 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 47/113 (41%), Positives = 67/113 (59%), Gaps = 12/113 (10%)

Query: 970  GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
            G W  TK RPG+  FL+   +L+EMH+YTMG + YA  + KV+DP G +F GR++SR + 
Sbjct: 303  GRWYFTKPRPGLQRFLDEMCQLYEMHVYTMGTRTYADAIVKVIDPDGKIFGGRILSRDES 362

Query: 1028 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            G  F        SK+L+ +   + S VV+IDD   VW  +  NL+ V  Y +F
Sbjct: 363  GS-F-------SSKNLKRLFPTDTSMVVVIDDRSDVW-GDCPNLVKVVPYDFF 406


>gi|261194090|ref|XP_002623450.1| RNA Polymerase II CTD phosphatase Fcp1 [Ajellomyces dermatitidis
            SLH14081]
 gi|239588464|gb|EEQ71107.1| RNA Polymerase II CTD phosphatase Fcp1 [Ajellomyces dermatitidis
            SLH14081]
          Length = 901

 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 59/170 (34%), Positives = 88/170 (51%), Gaps = 32/170 (18%)

Query: 924  VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
            V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 134  VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHEAVKDVRAFQLVDDGPGMRGCW 184

Query: 973  --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
               KLRPG+  FL   S LFE+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 185  YYIKLRPGLEEFLREISTLFELHIYTMGTRAYAQHIANIVDPDRKIFGDRILSRDESGS- 243

Query: 1031 FDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                     +K+L+ +  +++  VVIIDD   VW     NLI V  Y +F
Sbjct: 244  -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWTD-NLIKVLPYDFF 285


>gi|146421209|ref|XP_001486555.1| hypothetical protein PGUG_02226 [Meyerozyma guilliermondii ATCC 6260]
          Length = 732

 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 61/184 (33%), Positives = 95/184 (51%), Gaps = 28/184 (15%)

Query: 913  KKMFSARKLCLVLDLDHTLLN-----SAKFHEVDPVHDEILRKKE------EQDREKPHR 961
            +++ S RKL LV+DLD T+++     +    ++DP++      K+      E+D   P  
Sbjct: 110  ERLTSERKLILVVDLDQTVIHATVDPTVGEWQLDPLNPNYRAVKDVRSFCLEEDPIAPPG 169

Query: 962  HLFRFPHM-----GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
              +  P M       + K+RPG+  FL+R S+L+EMH+YTM  + YA  +A ++DP G  
Sbjct: 170  --WSGPKMTPTKCWYYVKVRPGLEDFLKRVSQLYEMHVYTMATRNYALAIAHIIDPDGRY 227

Query: 1017 FAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVER 1075
            F  R++SR + G            K+L  +  ++   VVIIDD   VW   K NLI V  
Sbjct: 228  FGDRILSRDESGS--------LTHKNLRRLFPVDQLMVVIIDDRGDVWQWEK-NLIKVVP 278

Query: 1076 YTYF 1079
            Y +F
Sbjct: 279  YEFF 282


>gi|392578708|gb|EIW71836.1| hypothetical protein TREMEDRAFT_67978 [Tremella mesenterica DSM 1558]
          Length = 944

 Score = 81.3 bits (199), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 44/109 (40%), Positives = 65/109 (59%), Gaps = 10/109 (9%)

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
            +TK RPG+  FLE  +KL+EMH+YTMG + YA  +  ++DP+G  F GR++SR D     
Sbjct: 354  FTKPRPGLAKFLEEMNKLYEMHVYTMGTRTYAEAIVGIVDPEGKYFGGRILSRDDS---- 409

Query: 1032 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                R   +K+L+ +   + S VV+IDD   VW  +  NL+ V  Y +F
Sbjct: 410  ----RNFTTKNLKRLFPTDTSMVVVIDDRADVW-GDCPNLVKVRPYDFF 453


>gi|299470348|emb|CBN78397.1| Similar to RNA Polymerase II CTD phosphatase Fcp1, putative
            [Ectocarpus siliculosus]
          Length = 985

 Score = 80.9 bits (198), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 64/209 (30%), Positives = 95/209 (45%), Gaps = 30/209 (14%)

Query: 884  GDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHE---- 939
            GD   +            +  R   + +  ++ +++KL LVLDLD+TLL+ +   +    
Sbjct: 223  GDTHQVLMKGGKMMSVTAEGRRMMHMNKSGRLLNSKKLSLVLDLDNTLLHCSDHPDAGRV 282

Query: 940  VDPVHDEILRKKEEQDREKPHRHLFRFPHMG--MWTKLRPGIWTFLERASKLFEMHLYTM 997
            V P  D I              H  R P+     + KLRPG+  FL +A+ +FEM +YT 
Sbjct: 283  VVPGVDGI--------------HALRLPNQQREYYIKLRPGLRRFLAQAATMFEMTIYTA 328

Query: 998  GNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL--GMESAVVI 1055
            G   YA  +A VLDP   LF GR  S     D          +K LE +   G++ A +I
Sbjct: 329  GTSQYADAVASVLDPDRSLFQGRHFSTCYTPDLGR------NTKSLERIFPNGLDMA-LI 381

Query: 1056 IDDSVRVWPHNKL-NLIVVERYTYFPCSR 1083
            +DD   VW   +  NL++V  Y +F   R
Sbjct: 382  VDDRDDVWRGEQAKNLLLVRPYKFFVGQR 410


>gi|268566337|ref|XP_002639695.1| C. briggsae CBR-FCP-1 protein [Caenorhabditis briggsae]
          Length = 723

 Score = 80.9 bits (198), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 55/181 (30%), Positives = 96/181 (53%), Gaps = 27/181 (14%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSA-KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            ++  + + RKL L++DLD T+++++ K   VD         ++ ++R KP  +   F H 
Sbjct: 133  DETNLITTRKLVLLVDLDQTIIHTSDKPMSVDA--------EKRRNRVKPQDNNLNFQHK 184

Query: 970  GMW----------TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
             +           TKLRP    FL + S ++EMH+ T G + YA  +A++LDP   LF  
Sbjct: 185  DITKYNLHSRVYTTKLRPHTTEFLNKMSAMYEMHIVTYGQRQYAHRIAQILDPDARLFGQ 244

Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
            R++SR    + F       K+++L+ +    ++ VVIIDD   VW +++  LI ++ Y +
Sbjct: 245  RILSR---DELFSAQH---KTRNLKALFPCGDNLVVIIDDRADVWQYSE-ALIQIKPYRF 297

Query: 1079 F 1079
            F
Sbjct: 298  F 298


>gi|383859141|ref|XP_003705055.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
            phosphatase-like isoform 2 [Megachile rotundata]
          Length = 759

 Score = 80.9 bits (198), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 51/171 (29%), Positives = 88/171 (51%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+++++ + RKL L++DLD T++++     + P   ++            H  L+     
Sbjct: 142  EDEQRLLNDRKLALLVDLDQTIVHTTN-DNIPPNMKDVY-----------HYQLYGPNSP 189

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
               T+LRP    FL   S+L+E+H+ T G + YA  +A +LD  G+LF+ R++SR +  D
Sbjct: 190  WYHTRLRPNTRHFLSEMSRLYELHICTFGARNYAHTVASLLDKDGILFSNRILSRDECFD 249

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            P        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y +F
Sbjct: 250  P------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 293


>gi|383859139|ref|XP_003705054.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
            phosphatase-like isoform 1 [Megachile rotundata]
          Length = 760

 Score = 80.9 bits (198), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 51/171 (29%), Positives = 88/171 (51%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+++++ + RKL L++DLD T++++     + P   ++            H  L+     
Sbjct: 142  EDEQRLLNDRKLALLVDLDQTIVHTTN-DNIPPNMKDVY-----------HYQLYGPNSP 189

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
               T+LRP    FL   S+L+E+H+ T G + YA  +A +LD  G+LF+ R++SR +  D
Sbjct: 190  WYHTRLRPNTRHFLSEMSRLYELHICTFGARNYAHTVASLLDKDGILFSNRILSRDECFD 249

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            P        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y +F
Sbjct: 250  P------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 293


>gi|365758888|gb|EHN00710.1| Fcp1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 677

 Score = 80.9 bits (198), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 60/191 (31%), Positives = 88/191 (46%), Gaps = 48/191 (25%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------------F 966
            +KL LV+DLD T+++      VDP   E      + D   P+    R             
Sbjct: 123  KKLILVVDLDQTIIHCG----VDPTIAEW-----KNDPNNPNFETLRDVKSFTLDEELVL 173

Query: 967  PHMGM-----------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKV 1009
            P M M                 + K+RPG+  F ++ + LFEMH+YTM  + YA ++AK+
Sbjct: 174  PLMYMNEDGSVLKPPPVRKCWYYVKVRPGLKEFFDKVAPLFEMHIYTMATRAYAIQIAKI 233

Query: 1010 LDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKL 1068
            +DP G LF  R++SR ++G           +K L  +    +S VV+IDD   VW     
Sbjct: 234  VDPTGELFGDRILSRDENGS--------LTTKSLAKLFPTDQSMVVVIDDRGDVWNWCP- 284

Query: 1069 NLIVVERYTYF 1079
            NLI V  Y +F
Sbjct: 285  NLIKVVPYNFF 295


>gi|378756636|gb|EHY66660.1| hypothetical protein NERG_00300 [Nematocida sp. 1 ERTm2]
          Length = 507

 Score = 80.9 bits (198), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 45/110 (40%), Positives = 62/110 (56%), Gaps = 10/110 (9%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
             + KLR  +  FL+ A K  EMH+YTMGNK YAT + K+LDP G LF  R+I+R D+   
Sbjct: 203  YYVKLRDRLEWFLKEAEKYCEMHIYTMGNKAYATAIVKILDPTGKLFGSRIITRDDNFGC 262

Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            FD        KD++ +    S  V+I+DD   VW     NL  ++ Y +F
Sbjct: 263  FD--------KDIKRLFPTNSKHVIILDDRPDVWGFVD-NLYPIKPYYFF 303


>gi|350579777|ref|XP_003122350.3| PREDICTED: RNA polymerase II subunit A C-terminal domain
            phosphatase-like, partial [Sus scrofa]
          Length = 284

 Score = 80.9 bits (198), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 46/131 (35%), Positives = 78/131 (59%), Gaps = 13/131 (9%)

Query: 902  QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
            Q E+  R E+Q+++   RKL L++DLD TL+++ + H            ++  ++   H 
Sbjct: 162  QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQH-----------CQQMSNKGIFHF 209

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             L R   M + T+LRP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R+
Sbjct: 210  QLGRGEPM-LHTRLRPHCKEFLEKIAQLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 268

Query: 1022 ISRGDDGDPFD 1032
            +SR +  DPF 
Sbjct: 269  LSRDECIDPFS 279


>gi|390333352|ref|XP_791406.3| PREDICTED: RNA polymerase II subunit A C-terminal domain
            phosphatase-like [Strongylocentrotus purpuratus]
          Length = 673

 Score = 80.9 bits (198), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 54/170 (31%), Positives = 91/170 (53%), Gaps = 19/170 (11%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
            ++  +   RKL L++DLD TL+++     +D V  ++      Q R+ P   +F + H  
Sbjct: 22   DEDSLIKHRKLVLLVDLDQTLIHTT----LDEVPADMPGVHHFQLRKGP---MFPWYH-- 72

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
              T++R     FL+  S+ +++H++TMG +LYA  +A+++DP+G  F+ R++SR +  DP
Sbjct: 73   --TRIRDNYQQFLDLISQFYQLHIFTMGVRLYAHTVAEIIDPEGKFFSHRILSRDECVDP 130

Query: 1031 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                    K  +L  +    +  V IIDD   VW +   NLI V  Y YF
Sbjct: 131  HS------KKANLRSIFPRGDKMVCIIDDRDDVW-NFAPNLIQVPPYRYF 173


>gi|255732778|ref|XP_002551312.1| hypothetical protein CTRG_05610 [Candida tropicalis MYA-3404]
 gi|240131053|gb|EER30614.1| hypothetical protein CTRG_05610 [Candida tropicalis MYA-3404]
          Length = 818

 Score = 80.9 bits (198), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 65/207 (31%), Positives = 97/207 (46%), Gaps = 48/207 (23%)

Query: 897  QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDR 956
            + A I+   T RL ++KK      L LV+DLD T++++     VDP   E      + D 
Sbjct: 148  EAAKIEHSTTDRLIDEKK------LILVVDLDQTVIHAT----VDPTVGEW-----QSDP 192

Query: 957  EKPHRHLFR------------------FPHMG-----MWTKLRPGIWTFLERASKLFEMH 993
              P+    +                   P +       + KLRPG+  FLER S+ +EMH
Sbjct: 193  SNPNYRAVKDVRSFCLEEQPIVPPGWTGPKLAPTKCTYYVKLRPGLSEFLERMSEKYEMH 252

Query: 994  LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESA 1052
            +YTM  + YA  +AK++DP+G  F  R++SR + G            K+L+ +  + +S 
Sbjct: 253  IYTMATRNYALAIAKIIDPEGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSM 304

Query: 1053 VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            V IIDD   VW     NLI V  Y +F
Sbjct: 305  VAIIDDRGDVWQWES-NLIKVVPYDFF 330


>gi|339254478|ref|XP_003372462.1| conserved hypothetical protein [Trichinella spiralis]
 gi|316967111|gb|EFV51594.1| conserved hypothetical protein [Trichinella spiralis]
          Length = 683

 Score = 80.5 bits (197), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 56/174 (32%), Positives = 93/174 (53%), Gaps = 26/174 (14%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHE----VDPVHDEILRKKEEQDREKPHRHLFR 965
            E+ K + S +KL L++DLD TL+++++  +    +D  H ++      +    P  H   
Sbjct: 221  EDSKNLLSQKKLALLVDLDLTLIHTSETSDDSDALDVYHYQM------EGPNSPWYH--- 271

Query: 966  FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
                   T+LRP    FL++ ++ FE+H+ T GN+ YA ++ K+LDP  VLF  R++SR 
Sbjct: 272  -------TRLRPYARYFLKKINEYFELHIITHGNRKYAEKVVKMLDPNNVLFGDRILSRD 324

Query: 1026 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            +  DP   + + P  K L    G +  V IIDD   VW + + N++ V  Y +F
Sbjct: 325  ECFDP---NMKAPNLKAL--FPGGDDLVCIIDDREDVWNYAE-NVVRVRPYRFF 372


>gi|50294127|ref|XP_449475.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49528789|emb|CAG62451.1| unnamed protein product [Candida glabrata]
          Length = 758

 Score = 80.5 bits (197), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 96/197 (48%), Gaps = 36/197 (18%)

Query: 901  IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKE 952
            + K+ T RL+ +KK      L LV+DLD T+++          K    +P ++ +   K 
Sbjct: 154  LDKQITTRLKNEKK------LVLVVDLDQTVIHCGVDPTIGEWKADPSNPNYETLKDVKC 207

Query: 953  EQDREKPHRHLFRFPHMG---------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
                E+P   +    +MG          + K+RPG+  F E+ + L+EMH+YTM  + YA
Sbjct: 208  FSLEEEP---ILPLIYMGPKPPVRTCWYYVKIRPGLKEFFEKIAPLYEMHIYTMATRAYA 264

Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRV 1062
             E+AK++DP   LF  R++SR ++G            K L  +    +S VV+IDD   V
Sbjct: 265  LEIAKIIDPDKSLFGDRILSRDENGS--------LTQKSLTRLFPTDQSMVVVIDDRGDV 316

Query: 1063 WPHNKLNLIVVERYTYF 1079
            W     NLI V  Y +F
Sbjct: 317  WNWCP-NLIKVVPYNFF 332


>gi|196002231|ref|XP_002110983.1| hypothetical protein TRIADDRAFT_54465 [Trichoplax adhaerens]
 gi|190586934|gb|EDV26987.1| hypothetical protein TRIADDRAFT_54465 [Trichoplax adhaerens]
          Length = 766

 Score = 80.5 bits (197), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 57/174 (32%), Positives = 93/174 (53%), Gaps = 19/174 (10%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
            +  ++ S++KL L++DLD TL+     H      D  L    E+ +     H+F  P   
Sbjct: 218  DMNRLLSSKKLVLIVDLDLTLI-----HTRMASPDIKLSNLTEEKQIYYTCHMF--PGYN 270

Query: 971  MW----TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
            ++    TKLRP +  FL+ AS LFE+H+ TMG++ YA ++  +LDP G LF  R++SR  
Sbjct: 271  VYHQYLTKLRPHVEEFLKVASTLFELHVVTMGSRSYAQDIVGILDPTGSLFYNRILSRD- 329

Query: 1027 DGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                 +   ++ KS +L  +  + ++ V IIDD   +W  +  + I V  Y+YF
Sbjct: 330  -----ELKSQLLKSTNLNQLFPLGDNLVCIIDDRPEMWAFHP-SCIPVPPYSYF 377


>gi|344301528|gb|EGW31840.1| hypothetical protein SPAPADRAFT_140004 [Spathaspora passalidarum NRRL
            Y-27907]
          Length = 770

 Score = 80.5 bits (197), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 65/202 (32%), Positives = 95/202 (47%), Gaps = 38/202 (18%)

Query: 897  QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-------------- 942
            +   I++  T RL E+KK      L LV+DLD T++++     VDP              
Sbjct: 155  EATKIEQSTTDRLTEEKK------LILVVDLDQTVIHAT----VDPTVGEWQSDPSNPNY 204

Query: 943  --VHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMG 998
              V D      EE     P+    +      W   K+RPG+  FLE+ S  +EMH+YTM 
Sbjct: 205  PAVKDVKSFCLEEDPITPPNWTGPKLAPTKCWYYVKVRPGLAEFLEQVSNKYEMHIYTMA 264

Query: 999  NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIID 1057
             + YA  +A ++DP+G  F  R++SR + G            K+L+ +  + +S VVIID
Sbjct: 265  TRNYALAIANIIDPEGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVVIID 316

Query: 1058 DSVRVWPHNKLNLIVVERYTYF 1079
            D   VW     NLI V  Y +F
Sbjct: 317  DRGDVWQWES-NLIKVVPYDFF 337


>gi|399215912|emb|CCF72600.1| unnamed protein product [Babesia microti strain RI]
          Length = 545

 Score = 80.5 bits (197), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 51/171 (29%), Positives = 83/171 (48%), Gaps = 22/171 (12%)

Query: 895  DQQKAAIQKERTRRLEEQK--KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            ++   +I     R++EE     +   R LCLVLDLD+TL           +H + L K E
Sbjct: 145  NEASMSISATFVRQMEESNLHSLLIKRLLCLVLDLDNTL-----------IHAKTLDKNE 193

Query: 953  EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
              D     + ++      ++ +LRPG+  FL+  SK ++++L+TMG   +AT    +LDP
Sbjct: 194  VLDSNDDFKAIYFGGRCNLY-RLRPGVSEFLDAMSKYYQLYLFTMGTSEHATAALSLLDP 252

Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW 1063
            +G LF+ R+ SR D       + R   S+      G+   V ++DD    W
Sbjct: 253  QGKLFSNRIFSRSD-----SQNSRKTLSRIFPNYQGI---VCVVDDCEHAW 295


>gi|307212079|gb|EFN87962.1| RNA polymerase II subunit A C-terminal domain phosphatase
            [Harpegnathos saltator]
          Length = 734

 Score = 80.5 bits (197), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 52/173 (30%), Positives = 86/173 (49%), Gaps = 26/173 (15%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFH---EVDPVHDEILRKKEEQDREKPHRHLFRFP 967
            +++++   RKL L++DLD T++++   H    +  VH               H  L+   
Sbjct: 144  DEQRLLKDRKLVLLVDLDQTIVHTTNDHIPPNLKDVH---------------HFQLYGPN 188

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
                 T+LRP    FL   S L+E+H+ + G ++YA  +A +LD  GVLF+ R++SR + 
Sbjct: 189  SPWYHTRLRPNTRHFLSEMSHLYELHICSFGARIYAHTIASLLDKDGVLFSHRILSRDEC 248

Query: 1028 GDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
             DP        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y +F
Sbjct: 249  FDP------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 294


>gi|405122085|gb|AFR96852.1| hypothetical protein CNAG_04120 [Cryptococcus neoformans var. grubii
            H99]
          Length = 921

 Score = 80.1 bits (196), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 48/115 (41%), Positives = 67/115 (58%), Gaps = 17/115 (14%)

Query: 970  GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
            G W  TK RPG+  FL+  S+L+EMH+YTMG + YA  + KV+DP G +F GR++SR + 
Sbjct: 283  GRWYFTKPRPGLQKFLDEMSQLYEMHVYTMGTRTYADAIVKVIDPDGKIFGGRILSRDES 342

Query: 1028 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPC 1081
            G  F        SK+L+ +   + S VV+IDD   VW  +  NL+ V      PC
Sbjct: 343  GS-F-------SSKNLKRLFPTDTSMVVVIDDRSDVW-GDCPNLVKV-----VPC 383



 Score = 40.8 bits (94), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 24/71 (33%), Positives = 38/71 (53%), Gaps = 2/71 (2%)

Query: 890 FEGYDDQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEI 947
           FE   D     + K   +RLE   +  + S R+L L++DLD T++++     V    DEI
Sbjct: 135 FEIAHDAMGVTVSKNEAQRLENLTRDALLSTRRLSLIVDLDQTIIHTTVDPTVAEWMDEI 194

Query: 948 LRKKEEQDREK 958
            R++ E D+EK
Sbjct: 195 HREESEDDQEK 205


>gi|159476674|ref|XP_001696436.1| cleavage and polyadenylation factor 6-related protein [Chlamydomonas
            reinhardtii]
 gi|158282661|gb|EDP08413.1| cleavage and polyadenylation factor 6-related protein [Chlamydomonas
            reinhardtii]
          Length = 2174

 Score = 80.1 bits (196), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 64/213 (30%), Positives = 98/213 (46%), Gaps = 25/213 (11%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVD-PVHDEILRKKEEQDREKP--HRHLFRFPHMG--MWTK 974
            +L LV+DLD  L +S    ++D P    ++R+   +    P   R LFR P  G  +W K
Sbjct: 809  RLVLVVDLDGVLADSCWDAQLDGPTAAALVRRAAVEAAALPEDRRELFRLPLEGGALWLK 868

Query: 975  LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 1034
            LRPG   FL RA++ +E+   T   + YA  + ++LDP   LF  RV++ G         
Sbjct: 869  LRPGARAFLARAAERYELWARTRQGRPYADAVVELLDPHQQLFGSRVVAAG--------- 919

Query: 1035 ERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKL--NLIVVERYTYF---PC----SRRQ 1085
              V   + L  +        ++D     W    L  +L+ +  Y YF   PC    +   
Sbjct: 920  --VLAKRLLAALECRAPIAAVLDTPDAAWMGESLSGSLLALPPYAYFAVRPCAPGGAVAA 977

Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
             G+    +LE+D DE +E G LA  L + + LH
Sbjct: 978  SGMASRCMLEVDRDEDAERGALAVGLPLLEALH 1010


>gi|91087589|ref|XP_971974.1| PREDICTED: similar to RNA polymerase II subunit A C-terminal domain
            phosphatase [Tribolium castaneum]
 gi|270010700|gb|EFA07148.1| hypothetical protein TcasGA2_TC010139 [Tribolium castaneum]
          Length = 760

 Score = 80.1 bits (196), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 53/172 (30%), Positives = 81/172 (47%), Gaps = 30/172 (17%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG--- 970
            ++   RKL L++DLD TL+++   H    + D                 ++RF   G   
Sbjct: 136  RLIRDRKLVLLVDLDQTLIHTTNDHIQPNIKD-----------------IYRFQLYGPNS 178

Query: 971  --MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
               +T+LRPG   FL      +E+H+ T G + YA  +A VLD     F+ R++SR +  
Sbjct: 179  PWYFTRLRPGTHQFLNNIYPFYELHICTFGARNYAHMIAAVLDRDQKFFSNRILSRDECF 238

Query: 1029 DPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            DP        K  +L+ +    ++ V IIDD   VW  N  NLI V+ Y +F
Sbjct: 239  DP------TSKKANLKALFPCGDNMVCIIDDREDVWS-NAANLIHVKPYHFF 283


>gi|195170374|ref|XP_002025988.1| GL10108 [Drosophila persimilis]
 gi|194110852|gb|EDW32895.1| GL10108 [Drosophila persimilis]
          Length = 757

 Score = 80.1 bits (196), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 56/163 (34%), Positives = 86/163 (52%), Gaps = 24/163 (14%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRP 977
            KL L++DLD T++++      D V + I          K   H   +     W  T+LRP
Sbjct: 88   KLVLLVDLDQTVIHTTN----DTVPENI----------KGIYHFQLYGPQSPWYHTRLRP 133

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERV 1037
            G   FLER S+L+E+H+ T G + YA  +A++LDP G  F+ R++SR    + F+     
Sbjct: 134  GTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPDGKFFSHRILSR---DECFNA---T 187

Query: 1038 PKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
             K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 188  SKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 229


>gi|388853856|emb|CCF52577.1| related to FCP1-TFIIF interacting component of CTD phosphatase
            [Ustilago hordei]
          Length = 471

 Score = 80.1 bits (196), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 59/198 (29%), Positives = 92/198 (46%), Gaps = 41/198 (20%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR-HLFRFPH 968
            E    + S RKL LV+DLD T++++A    VDP   E +  +   + E       FR   
Sbjct: 20   ETTTHLLSQRKLALVVDLDQTIIHTA----VDPTVGEWMEDESNPNYEALKSVAKFRLGI 75

Query: 969  MG--------------------------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLY 1002
             G                           + KLRPG+   L++ S+ +++H+YTMG + Y
Sbjct: 76   GGEEIKDDDDPPAPKDSAAALKASRACWYYVKLRPGVPEILKKLSEKYQLHVYTMGTRSY 135

Query: 1003 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVR 1061
            A  + K++DP   +F  R++SR ++G            K L+ +  M+ S VVIIDD   
Sbjct: 136  ANLVCKLIDPDASIFGNRIVSRNENGSLV--------RKSLDKLFPMDHSMVVIIDDRED 187

Query: 1062 VWPHNKLNLIVVERYTYF 1079
            VW  +  NL+ V  Y +F
Sbjct: 188  VWSKSP-NLLQVVPYEFF 204


>gi|323307594|gb|EGA60861.1| Fcp1p [Saccharomyces cerevisiae FostersO]
          Length = 732

 Score = 79.7 bits (195), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 76/257 (29%), Positives = 106/257 (41%), Gaps = 61/257 (23%)

Query: 857  TNHDDKQTG--TGSGPE--AGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQ 912
             NHD    G  T  G E  A      P    GDV+      +  +     KE  RR    
Sbjct: 116  CNHDIVYGGLCTQCGKEVSADAFDGVPLDVVGDVDLQISETEAIRTGKALKEHLRR---- 171

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------- 965
                  +KL LV+DLD T+++      VDP   E      + D   P+    R       
Sbjct: 172  -----DKKLILVVDLDQTIIHCG----VDPTIAEW-----KNDPNNPNFETLRDVKSFTL 217

Query: 966  -----FPHMGM-----------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
                  P M M                 + K+RPG+  F  + + LFEMH+YTM  + YA
Sbjct: 218  DEELVLPLMYMNDDGSMLRPPPVRKCWYYVKVRPGLKEFFAKVAPLFEMHIYTMATRAYA 277

Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRV 1062
             ++AK++DP G LF  R++SR ++G           +K L  +    +S VV+IDD   V
Sbjct: 278  LQIAKIVDPTGELFGDRILSRDENGS--------LTTKSLAKLFPTDQSMVVVIDDRGDV 329

Query: 1063 WPHNKLNLIVVERYTYF 1079
            W     NLI V  Y +F
Sbjct: 330  WNWCP-NLIKVVPYNFF 345


>gi|349580569|dbj|GAA25729.1| K7_Fcp1p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 732

 Score = 79.7 bits (195), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 76/257 (29%), Positives = 106/257 (41%), Gaps = 61/257 (23%)

Query: 857  TNHDDKQTG--TGSGPE--AGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQ 912
             NHD    G  T  G E  A      P    GDV+      +  +     KE  RR    
Sbjct: 116  CNHDIVYGGLCTQCGKEVSADAFDGVPLDVVGDVDLQISETEAIRTGKALKEHLRR---- 171

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------- 965
                  +KL LV+DLD T+++      VDP   E      + D   P+    R       
Sbjct: 172  -----DKKLILVVDLDQTIIHCG----VDPTIAEW-----KNDPNNPNFETLRDVKSFTL 217

Query: 966  -----FPHMGM-----------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
                  P M M                 + K+RPG+  F  + + LFEMH+YTM  + YA
Sbjct: 218  DEELVLPLMYMNDDGSMLRPPPVRKCWYYVKVRPGLKEFFAKVAPLFEMHIYTMATRAYA 277

Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRV 1062
             ++AK++DP G LF  R++SR ++G           +K L  +    +S VV+IDD   V
Sbjct: 278  LQIAKIVDPTGELFGDRILSRDENGS--------LTTKSLTKLFPTDQSMVVVIDDRGDV 329

Query: 1063 WPHNKLNLIVVERYTYF 1079
            W     NLI V  Y +F
Sbjct: 330  WNWCP-NLIKVVPYNFF 345


>gi|190408503|gb|EDV11768.1| TFIIF interacting component of CTD phosphatase [Saccharomyces
            cerevisiae RM11-1a]
          Length = 732

 Score = 79.7 bits (195), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 76/257 (29%), Positives = 106/257 (41%), Gaps = 61/257 (23%)

Query: 857  TNHDDKQTG--TGSGPE--AGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQ 912
             NHD    G  T  G E  A      P    GDV+      +  +     KE  RR    
Sbjct: 116  CNHDIVYGGLCTQCGKEVSADAFDGVPLDVVGDVDLQISETEAIRTGKALKEHLRR---- 171

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------- 965
                  +KL LV+DLD T+++      VDP   E      + D   P+    R       
Sbjct: 172  -----DKKLILVVDLDQTIIHCG----VDPTIAEW-----KNDPNNPNFETLRDVKSFTL 217

Query: 966  -----FPHMGM-----------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
                  P M M                 + K+RPG+  F  + + LFEMH+YTM  + YA
Sbjct: 218  DEELVLPLMYMNDDGSMLRPPPVRKCWYYVKVRPGLKEFFAKVAPLFEMHIYTMATRAYA 277

Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRV 1062
             ++AK++DP G LF  R++SR ++G           +K L  +    +S VV+IDD   V
Sbjct: 278  LQIAKIVDPTGELFGDRILSRDENGS--------LTTKSLAKLFPTDQSMVVVIDDRGDV 329

Query: 1063 WPHNKLNLIVVERYTYF 1079
            W     NLI V  Y +F
Sbjct: 330  WNWCP-NLIKVVPYNFF 345


>gi|6323933|ref|NP_014004.1| Fcp1p [Saccharomyces cerevisiae S288c]
 gi|2497216|sp|Q03254.1|FCP1_YEAST RecName: Full=RNA polymerase II subunit A C-terminal domain
            phosphatase; AltName: Full=CTD phosphatase FCP1
 gi|825543|emb|CAA89775.1| unknown [Saccharomyces cerevisiae]
 gi|151945985|gb|EDN64217.1| protein phosphatase [Saccharomyces cerevisiae YJM789]
 gi|256270710|gb|EEU05873.1| Fcp1p [Saccharomyces cerevisiae JAY291]
 gi|259148865|emb|CAY82110.1| Fcp1p [Saccharomyces cerevisiae EC1118]
 gi|285814283|tpg|DAA10178.1| TPA: Fcp1p [Saccharomyces cerevisiae S288c]
 gi|323346974|gb|EGA81251.1| Fcp1p [Saccharomyces cerevisiae Lalvin QA23]
 gi|323353207|gb|EGA85507.1| Fcp1p [Saccharomyces cerevisiae VL3]
 gi|392297449|gb|EIW08549.1| Fcp1p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 732

 Score = 79.7 bits (195), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 76/257 (29%), Positives = 106/257 (41%), Gaps = 61/257 (23%)

Query: 857  TNHDDKQTG--TGSGPE--AGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQ 912
             NHD    G  T  G E  A      P    GDV+      +  +     KE  RR    
Sbjct: 116  CNHDIVYGGLCTQCGKEVSADAFDGVPLDVVGDVDLQISETEAIRTGKALKEHLRR---- 171

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------- 965
                  +KL LV+DLD T+++      VDP   E      + D   P+    R       
Sbjct: 172  -----DKKLILVVDLDQTIIHCG----VDPTIAEW-----KNDPNNPNFETLRDVKSFTL 217

Query: 966  -----FPHMGM-----------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
                  P M M                 + K+RPG+  F  + + LFEMH+YTM  + YA
Sbjct: 218  DEELVLPLMYMNDDGSMLRPPPVRKCWYYVKVRPGLKEFFAKVAPLFEMHIYTMATRAYA 277

Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRV 1062
             ++AK++DP G LF  R++SR ++G           +K L  +    +S VV+IDD   V
Sbjct: 278  LQIAKIVDPTGELFGDRILSRDENGS--------LTTKSLAKLFPTDQSMVVVIDDRGDV 329

Query: 1063 WPHNKLNLIVVERYTYF 1079
            W     NLI V  Y +F
Sbjct: 330  WNWCP-NLIKVVPYNFF 345


>gi|323332189|gb|EGA73600.1| Fcp1p [Saccharomyces cerevisiae AWRI796]
          Length = 646

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 76/257 (29%), Positives = 106/257 (41%), Gaps = 61/257 (23%)

Query: 857  TNHDDKQTG--TGSGPE--AGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQ 912
             NHD    G  T  G E  A      P    GDV+      +  +     KE  RR    
Sbjct: 116  CNHDIVYGGLCTQCGKEVSADAFDGVPLDVVGDVDLQISETEAIRTGKALKEHLRR---- 171

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------- 965
                  +KL LV+DLD T+++      VDP   E      + D   P+    R       
Sbjct: 172  -----DKKLILVVDLDQTIIHCG----VDPTIAEW-----KNDPNNPNFETLRDVKSFTL 217

Query: 966  -----FPHMGM-----------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
                  P M M                 + K+RPG+  F  + + LFEMH+YTM  + YA
Sbjct: 218  DEELVLPLMYMNDDGSMLRPPPVRKCWYYVKVRPGLKEFFAKVAPLFEMHIYTMATRAYA 277

Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRV 1062
             ++AK++DP G LF  R++SR ++G           +K L  +    +S VV+IDD   V
Sbjct: 278  LQIAKIVDPTGELFGDRILSRDENGS--------LTTKSLAKLFPTDQSMVVVIDDRGDV 329

Query: 1063 WPHNKLNLIVVERYTYF 1079
            W     NLI V  Y +F
Sbjct: 330  WNWCP-NLIKVVPYNFF 345


>gi|134056779|emb|CAK37687.1| unnamed protein product [Aspergillus niger]
          Length = 788

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 44/113 (38%), Positives = 66/113 (58%), Gaps = 12/113 (10%)

Query: 970  GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
            G W   KLRPG+ +FL+  S+++E+H+YTMG + YA  +A ++DP   LF  R++SR + 
Sbjct: 175  GCWYYVKLRPGLESFLQNVSEMYELHIYTMGTRSYAQHIASIIDPDRKLFGDRILSRDES 234

Query: 1028 GDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            G           +K+L  +  +++  VVIIDD   VW  N  NLI V  Y +F
Sbjct: 235  GSLV--------AKNLHRLFPVDTKMVVIIDDRGDVWRWNP-NLIKVSPYDFF 278


>gi|71004098|ref|XP_756715.1| hypothetical protein UM00568.1 [Ustilago maydis 521]
 gi|46095984|gb|EAK81217.1| hypothetical protein UM00568.1 [Ustilago maydis 521]
          Length = 779

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 61/209 (29%), Positives = 96/209 (45%), Gaps = 43/209 (20%)

Query: 901  IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK-------- 950
            +  E  +RL  E    + S RKL L++DLD T++++     VDP   E +R         
Sbjct: 46   VSAEEAQRLDSETTSHLLSQRKLALIVDLDQTVIHAT----VDPTVGEWMRDESNPNYEA 101

Query: 951  ----------------KEEQDREKPH---RHLFRFPHMGMWTKLRPGIWTFLERASKLFE 991
                            K+E+D  +P      L        + K RPG+   L+  S+ +E
Sbjct: 102  LQSVGKFRLGIDGEEIKDEEDGSEPKDPAAALKASRACWYYVKPRPGVPQVLKHLSEKYE 161

Query: 992  MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME- 1050
            +H+YTMG + YA  + K++DP   +F  R++SR ++G            K L  +  ++ 
Sbjct: 162  LHVYTMGTRSYANCVCKLIDPDASIFGNRILSRDENGSLV--------RKSLSRLFPVDH 213

Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 214  SMVVIIDDREDVWSRSP-NLLPVLPYEFF 241


>gi|241953831|ref|XP_002419637.1| RNA polymerase II subunit a c-terminal domain phosphatase, putative
            [Candida dubliniensis CD36]
 gi|223642977|emb|CAX43233.1| RNA polymerase II subunit a c-terminal domain phosphatase, putative
            [Candida dubliniensis CD36]
          Length = 771

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 66/207 (31%), Positives = 97/207 (46%), Gaps = 48/207 (23%)

Query: 897  QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDR 956
            + A I+   T RL ++      RKL LV+DLD T++++     VDP   E      + D 
Sbjct: 148  EAAKIEHNTTDRLIDE------RKLILVVDLDQTVIHAT----VDPTVGEW-----QSDP 192

Query: 957  EKPHR------HLFRFPHMGM----WT-------------KLRPGIWTFLERASKLFEMH 993
              P+         F      +    WT             KLRPG+  FLE+ ++ +EMH
Sbjct: 193  ANPNYAAVKDVKTFCLEEEAIVPPGWTGPKLAPTKCTYYVKLRPGLSEFLEKMAEKYEMH 252

Query: 994  LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESA 1052
            +YTM  + YA  +AK++DP G  F  R++SR + G            K+L+ +  + +S 
Sbjct: 253  IYTMATRNYALSIAKIIDPDGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSM 304

Query: 1053 VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            VVIIDD   VW     NLI V  Y +F
Sbjct: 305  VVIIDDRGDVWQWES-NLIKVVPYDFF 330


>gi|328772741|gb|EGF82779.1| hypothetical protein BATDEDRAFT_22917 [Batrachochytrium dendrobatidis
            JAM81]
          Length = 868

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 46/136 (33%), Positives = 69/136 (50%), Gaps = 16/136 (11%)

Query: 895  DQQKAAIQKERTRRLEEQK--KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            D     +  +   RLE++   ++   RKL LVLDLD T++++     VDP   E +    
Sbjct: 141  DASGITVSHKEAFRLEKETADRLLDERKLSLVLDLDQTVIHAT----VDPTVGEWM---- 192

Query: 953  EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
              D   P+     FP + +W    PG   FL   +  +EMH+YTMG + YA  ++K+LDP
Sbjct: 193  -ADPNNPN-----FPALTVWATHEPGTREFLRELNAKYEMHIYTMGTRNYAKAVSKILDP 246

Query: 1013 KGVLFAGRVISRGDDG 1028
                F  R++SR D G
Sbjct: 247  DKRYFKDRILSRDDSG 262


>gi|443896478|dbj|GAC73822.1| TFIIF-interacting CTD phosphatases [Pseudozyma antarctica T-34]
          Length = 751

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 61/209 (29%), Positives = 96/209 (45%), Gaps = 43/209 (20%)

Query: 901  IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK-------- 950
            +  E  +RL  E    + S RKL L++DLD T++++     VDP   E +R         
Sbjct: 46   VSAEEAQRLDSESTSHLLSQRKLALIVDLDQTVIHAT----VDPTVGEWMRDDTNPNYDA 101

Query: 951  ----------------KEEQDREKPH---RHLFRFPHMGMWTKLRPGIWTFLERASKLFE 991
                            K++ D   P      L        + K RPG+ T L++ S+ ++
Sbjct: 102  LKSVGKFRLGIDGEEIKDDDDPTAPKDAAAALRASRACWYYVKPRPGVPTILKQLSQKYQ 161

Query: 992  MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME- 1050
            +H+YTMG + YA  + K++DP   +F  R++SR ++G            K L  +  ++ 
Sbjct: 162  LHVYTMGTRSYANCVCKLIDPDASIFGNRILSRDENGSLV--------RKSLSRLFPVDH 213

Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            S VVIIDD   VW  N  NL+ V  Y +F
Sbjct: 214  SMVVIIDDREDVW-SNSPNLLPVLPYEFF 241


>gi|149241937|ref|XP_001526384.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
            YB-4239]
 gi|146450507|gb|EDK44763.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
            YB-4239]
          Length = 883

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 67/227 (29%), Positives = 105/227 (46%), Gaps = 45/227 (19%)

Query: 885  DVEHLFEGYDDQQKAAI-----------QKERTRRLEEQ--KKMFSARKLCLVLDLDHTL 931
            D E  + GYD +++A+I             +   ++E     ++   RKL LV+DLD T+
Sbjct: 117  DDEKDYSGYDYEERASIAMSHDNTELRISYDEAAKIEHNTTDRLNQERKLILVVDLDQTV 176

Query: 932  LNSAKFHEVDP----------------VHDEILRKKEEQDREKPHRHLFRFPHMGMW--T 973
            +++     VDP                V D      EE     P  +  +      W   
Sbjct: 177  IHAT----VDPTVGEWQLDPENPNYPAVKDVRTFCLEEDPVAPPGWNGPKLAPTKCWYYV 232

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
            K+RPG+  FL++  + +EMH+YTM  + YA  +AK++DP+G  F  R++SR + G     
Sbjct: 233  KVRPGLAEFLKKMDEKYEMHIYTMATRNYALSIAKIIDPEGKYFGDRILSRDESGS---- 288

Query: 1034 DERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                   K+L+ +  + +S VVIIDD   VW     NLI V  Y +F
Sbjct: 289  ----LTHKNLKRLFPVDQSMVVIIDDRGDVWQWEN-NLIKVVPYDFF 330


>gi|68472089|ref|XP_719840.1| potential RNA Pol II CTD phosphatase component [Candida albicans
            SC5314]
 gi|68472324|ref|XP_719723.1| potential RNA Pol II CTD phosphatase component [Candida albicans
            SC5314]
 gi|46441553|gb|EAL00849.1| potential RNA Pol II CTD phosphatase component [Candida albicans
            SC5314]
 gi|46441679|gb|EAL00974.1| potential RNA Pol II CTD phosphatase component [Candida albicans
            SC5314]
          Length = 768

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 66/207 (31%), Positives = 97/207 (46%), Gaps = 48/207 (23%)

Query: 897  QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDR 956
            + A I+   T RL ++      RKL LV+DLD T++++     VDP   E      + D 
Sbjct: 148  EAAKIEHNTTDRLIDE------RKLILVVDLDQTVIHAT----VDPTVGEW-----QSDP 192

Query: 957  EKPHR------HLFRFPHMGM----WT-------------KLRPGIWTFLERASKLFEMH 993
              P+         F      +    WT             KLRPG+  FLE+ ++ +EMH
Sbjct: 193  ANPNYAAVKDVKTFCLEEEAIVPPGWTGPKLAPTKCTYYVKLRPGLSEFLEKMAEKYEMH 252

Query: 994  LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESA 1052
            +YTM  + YA  +AK++DP G  F  R++SR + G            K+L+ +  + +S 
Sbjct: 253  IYTMATRNYALSIAKIIDPDGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSM 304

Query: 1053 VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            VVIIDD   VW     NLI V  Y +F
Sbjct: 305  VVIIDDRGDVWQWES-NLIKVVPYDFF 330


>gi|300701489|ref|XP_002994977.1| hypothetical protein NCER_102325 [Nosema ceranae BRL01]
 gi|239603396|gb|EEQ81306.1| hypothetical protein NCER_102325 [Nosema ceranae BRL01]
          Length = 200

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 56/177 (31%), Positives = 87/177 (49%), Gaps = 32/177 (18%)

Query: 903  KERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH 962
            K++  RL + KK      L LVLDLD T+L           H  I ++  E         
Sbjct: 48   KKKLERLHKNKK------LVLVLDLDQTIL-----------HTTITKEYMEGYSN----- 85

Query: 963  LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
             F    +    K RP +   LE   K +E+H+YTMGNK+YA ++ K++DP       R++
Sbjct: 86   -FIINDISYCVKFRPYLNYMLECLYKKYEIHVYTMGNKVYANKIVKLIDPTRKYIGNRIL 144

Query: 1023 SRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            +R ++G  F         KDL  +  + S VVI+DD   +W ++  NLI+V+ Y ++
Sbjct: 145  TRDENGIGF--------KKDLNRLFSIHSNVVILDDRDDIWDYSD-NLILVKPYFFW 192


>gi|322785368|gb|EFZ12041.1| hypothetical protein SINV_00693 [Solenopsis invicta]
          Length = 759

 Score = 79.0 bits (193), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 54/177 (30%), Positives = 89/177 (50%), Gaps = 32/177 (18%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH-RHLFRFPH 968
            E+++++   RKL L++DLD T++++                    D   P+ + +F F  
Sbjct: 149  EDEQRLLRDRKLVLLVDLDQTIVHTT------------------NDNIPPNLKDVFHFQL 190

Query: 969  MGM---W--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
             G    W  T+LRP    FL + S L+E+H+ T G ++YA  +A +LD   VLF+ R++S
Sbjct: 191  YGPNSPWYHTRLRPNTRRFLSKMSSLYELHICTFGARIYAHTVASLLDKDKVLFSHRILS 250

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            R +  DP        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y +F
Sbjct: 251  RDECFDP------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 300


>gi|307168754|gb|EFN61749.1| RNA polymerase II subunit A C-terminal domain phosphatase [Camponotus
            floridanus]
          Length = 721

 Score = 79.0 bits (193), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 53/177 (29%), Positives = 88/177 (49%), Gaps = 32/177 (18%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH-RHLFRFPH 968
            E+++++   RKL L++DLD T++++                    D   P+ + +F F  
Sbjct: 146  EDEQRLLKDRKLVLLVDLDQTIVHTT------------------NDNIPPNLKDVFHFQL 187

Query: 969  MGM---W--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
             G    W  T+ RP    FL   S L+E+H+ T G ++YA  +A +LD  G+LF+ R++S
Sbjct: 188  YGPNSPWYHTRFRPNTRHFLSEMSHLYELHICTFGARIYAHTVASLLDKDGILFSHRILS 247

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            R +  DP        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y +F
Sbjct: 248  RDECFDP------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 297


>gi|350413080|ref|XP_003489872.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
            phosphatase-like [Bombus impatiens]
          Length = 751

 Score = 79.0 bits (193), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 52/171 (30%), Positives = 88/171 (51%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+++++ + RKL L++DLD T++++      D +   I        ++  H  L+     
Sbjct: 143  EDEQRLLNDRKLALLVDLDQTIVHTTN----DNIPSNI--------KDVYHYQLYGPNSP 190

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
               T+LRP    FL   S+L+E+H+ T G + YA  +A +LD  G LF+ R++SR +  D
Sbjct: 191  WYHTRLRPNTKHFLSEMSRLYELHICTFGARNYAHTVAALLDKDGTLFSHRILSRDECFD 250

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            P        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y +F
Sbjct: 251  P------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 294


>gi|328792425|ref|XP_623605.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
            phosphatase-like [Apis mellifera]
          Length = 745

 Score = 79.0 bits (193), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 52/171 (30%), Positives = 87/171 (50%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+++++ + RKL L++DLD T++++     V P   ++            H  L+     
Sbjct: 143  EDEQRLLNDRKLALLVDLDQTIVHTTN-DNVPPNMKDVY-----------HYQLYGPNSP 190

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
               T+LRP    FL   S+L+E+H+ T G + YA  +A +LD  G LF+ R++SR +  D
Sbjct: 191  WYHTRLRPNTRHFLSEMSRLYELHICTFGARNYAHTVAALLDKDGTLFSHRILSRDECFD 250

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            P        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y +F
Sbjct: 251  P------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 294


>gi|340709144|ref|XP_003393173.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
            C-terminal domain phosphatase-like [Bombus terrestris]
          Length = 751

 Score = 79.0 bits (193), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 52/171 (30%), Positives = 88/171 (51%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+++++ + RKL L++DLD T++++      D +   I        ++  H  L+     
Sbjct: 143  EDEQRLLNDRKLALLVDLDQTIVHTTN----DNIPSNI--------KDVYHYQLYGPNSP 190

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
               T+LRP    FL   S+L+E+H+ T G + YA  +A +LD  G LF+ R++SR +  D
Sbjct: 191  WYHTRLRPNTKHFLSEMSRLYELHICTFGARNYAHTVAALLDKDGTLFSHRILSRDECFD 250

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            P        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y +F
Sbjct: 251  P------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 294


>gi|308500103|ref|XP_003112237.1| CRE-FCP-1 protein [Caenorhabditis remanei]
 gi|308268718|gb|EFP12671.1| CRE-FCP-1 protein [Caenorhabditis remanei]
          Length = 664

 Score = 79.0 bits (193), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 52/172 (30%), Positives = 94/172 (54%), Gaps = 24/172 (13%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-HM 969
            ++  + + RKL L++DLD T+++++              K    D EK H+ + ++  H 
Sbjct: 134  DETNLITTRKLVLLVDLDQTIIHTSD-------------KPMSADAEK-HKDITKYNLHS 179

Query: 970  GMWT-KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
             ++T KLRP    FL + + ++EMH+ T G + YA  +A++LDP   LF  R++SR    
Sbjct: 180  RVYTTKLRPHTTEFLNKMAAMYEMHIVTYGQRQYAHRIAQILDPDARLFGQRILSR---D 236

Query: 1029 DPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            + F       K+++L+ +    ++ VVIIDD   VW +++  LI ++ Y +F
Sbjct: 237  ELFSAQH---KTRNLKALFPCGDNLVVIIDDRADVWQYSEA-LIQIKPYRFF 284


>gi|238881126|gb|EEQ44764.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 525

 Score = 79.0 bits (193), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 66/205 (32%), Positives = 98/205 (47%), Gaps = 44/205 (21%)

Query: 897  QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-------------- 942
            + A I+   T RL ++      RKL LV+DLD T++++     VDP              
Sbjct: 63   EAAKIEHNTTDRLIDE------RKLILVVDLDQTVIHAT----VDPTVGEWQSDPANPNY 112

Query: 943  --VHDEILRKKEEQDREKPHRHLFRFPHMG-----MWTKLRPGIWTFLERASKLFEMHLY 995
              V D      EE+    P    +  P +       + KLRPG+  FLE+ ++ +EMH+Y
Sbjct: 113  AAVKDVKTFCLEEEAIVPPG---WTGPKLAPTKCTYYVKLRPGLSEFLEKMAEKYEMHIY 169

Query: 996  TMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVV 1054
            TM  + YA  +AK++DP G  F  R++SR + G            K+L+ +  + +S VV
Sbjct: 170  TMATRNYALSIAKIIDPDGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVV 221

Query: 1055 IIDDSVRVWPHNKLNLIVVERYTYF 1079
            IIDD   VW     NLI V  Y +F
Sbjct: 222  IIDDRGDVWQWES-NLIKVVPYDFF 245


>gi|380022133|ref|XP_003694908.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
            C-terminal domain phosphatase-like [Apis florea]
          Length = 749

 Score = 79.0 bits (193), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 52/171 (30%), Positives = 87/171 (50%), Gaps = 20/171 (11%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+++++ + RKL L++DLD T++++     V P   ++            H  L+     
Sbjct: 143  EDEQRLLNDRKLALLVDLDQTIVHTTN-DNVPPNMKDVY-----------HYQLYGPNSP 190

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
               T+LRP    FL   S+L+E+H+ T G + YA  +A +LD  G LF+ R++SR +  D
Sbjct: 191  WYHTRLRPNTRHFLSEMSRLYELHICTFGARNYAHTVAALLDKDGTLFSHRILSRDECFD 250

Query: 1030 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            P        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y +F
Sbjct: 251  P------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 294


>gi|336387157|gb|EGO28302.1| hypothetical protein SERLADRAFT_354339 [Serpula lacrymans var.
            lacrymans S7.9]
          Length = 874

 Score = 78.6 bits (192), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 69/263 (26%), Positives = 107/263 (40%), Gaps = 82/263 (31%)

Query: 890  FEGYDDQQKAAIQKER-----TRRLEEQKK--------MFSARKLCLVLDLDHTLLNSA- 935
            + G+ D  +A+IQ        T  LEE +K        + ++RKL L++DLD T++++  
Sbjct: 117  YTGFSDASRASIQMTHSAFGPTVSLEEAQKIEKETADHLLNSRKLSLIVDLDQTIVHATV 176

Query: 936  --------------------------KFHEVDPVHDEILRKKEEQDREKPH----RHLFR 965
                                      +  E + V DE+    E  D   P+    + + +
Sbjct: 177  DPTVGEWIAEGEAWEGKRAMKMKPPQRSKEDEDVSDEVATDSESDDECNPNWEALKDVRK 236

Query: 966  F----PHMGM------------------------WTKLRPGIWTFLERASKLFEMHLYTM 997
            F       GM                        + K RPG   FL   +  +EMH+YTM
Sbjct: 237  FQLGPESFGMPSSPRASRKVKGKQKFIENEGCMYYIKPRPGWQHFLHSIANKYEMHVYTM 296

Query: 998  GNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVII 1056
            G + YA E+   +DP G +F GR++SR + G            K L+ +   + S VVII
Sbjct: 297  GTRAYAEEVCAAIDPDGTIFGGRILSRDESGS--------LTQKSLQRLFPCDTSMVVII 348

Query: 1057 DDSVRVWPHNKLNLIVVERYTYF 1079
            DD   VW  +  NL+ V  Y +F
Sbjct: 349  DDRADVWEWSP-NLVKVIPYDFF 370


>gi|328859642|gb|EGG08750.1| hypothetical protein MELLADRAFT_115868 [Melampsora larici-populina
            98AG31]
          Length = 736

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 54/176 (30%), Positives = 83/176 (47%), Gaps = 44/176 (25%)

Query: 907  RRLEEQ--KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF 964
            RRLE +   ++    KL L++DLD T++++     VDP                      
Sbjct: 257  RRLESETRSRLLKDTKLSLIVDLDQTIVHAT----VDPT--------------------- 291

Query: 965  RFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
                +G W    PG+  FL   ++ +EMH+YTMG + YA  + +++DP   LF  RV+SR
Sbjct: 292  ----VGEWI---PGLSEFLRTLAEKYEMHVYTMGTRAYADAVCRIIDPTSELFGSRVLSR 344

Query: 1025 GDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
             + G            K L  +  ++ S VVIIDD   VW ++  NL+ V  Y +F
Sbjct: 345  DESGS--------MTQKSLTRLFPVDTSMVVIIDDRGDVWEYSP-NLVSVVPYNFF 391


>gi|308464266|ref|XP_003094401.1| hypothetical protein CRE_07009 [Caenorhabditis remanei]
 gi|308247823|gb|EFO91775.1| hypothetical protein CRE_07009 [Caenorhabditis remanei]
          Length = 754

 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 51/170 (30%), Positives = 92/170 (54%), Gaps = 21/170 (12%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
            ++  + + RKL L++DLD T+++++         D+++    E+ ++    +L    H  
Sbjct: 230  DETNLITTRKLVLLVDLDQTIIHTS---------DKLMSADAEKHKDITKYNL----HSR 276

Query: 971  MWT-KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
            ++T KLRP    FL + S ++EMH+ T G + YA  +AK+LDP   LF  R++SR +   
Sbjct: 277  VYTTKLRPHTTEFLNKMSAMYEMHIVTFGERKYALRIAKILDPDARLFGQRILSRNE--- 333

Query: 1030 PFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                 +   ++K L      ++ VVIIDD   VW +++  LI ++ Y +F
Sbjct: 334  -LSSAQHKTENKAL--FPCGDNLVVIIDDRADVWQYSEA-LIQIKPYRFF 379


>gi|323508124|emb|CBQ67995.1| related to FCP1-TFIIF interacting component of CTD phosphatase
            [Sporisorium reilianum SRZ2]
          Length = 773

 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 59/209 (28%), Positives = 95/209 (45%), Gaps = 43/209 (20%)

Query: 901  IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK-------- 950
            +  E  +RL  E    + S RKL L++DLD T++++     VDP   E +R         
Sbjct: 46   VSAEEAQRLDSESTSHLLSQRKLALIVDLDQTVIHAT----VDPTVGEWMRDESNPNYDA 101

Query: 951  ----------------KEEQDREKPH---RHLFRFPHMGMWTKLRPGIWTFLERASKLFE 991
                            K++ D   P      L        + K RPG+   L++ S+ ++
Sbjct: 102  LQSVGKFRLGIDGEEIKDDDDESAPRDSAAALRASRACWYYVKPRPGVPKVLKQLSEKYQ 161

Query: 992  MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME- 1050
            +H+YTMG + YA  + K++DP   +F  R++SR ++G            K L  +  ++ 
Sbjct: 162  LHVYTMGTRSYANCVCKLIDPDASIFGNRILSRDENGSLV--------RKSLSRLFPVDH 213

Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 214  SMVVIIDDREDVWSRSP-NLLPVLPYEFF 241


>gi|207342073|gb|EDZ69950.1| YMR277Wp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 544

 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 42/109 (38%), Positives = 62/109 (56%), Gaps = 10/109 (9%)

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
            + K+RPG+  F  + + LFEMH+YTM  + YA ++AK++DP G LF  R++SR ++G   
Sbjct: 58   YVKVRPGLKEFFAKVAPLFEMHIYTMATRAYALQIAKIVDPTGELFGDRILSRDENGS-- 115

Query: 1032 DGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                    +K L  +    +S VV+IDD   VW     NLI V  Y +F
Sbjct: 116  ------LTTKSLAKLFPTDQSMVVVIDDRGDVWNWCP-NLIKVVPYNFF 157


>gi|354545519|emb|CCE42247.1| hypothetical protein CPAR2_807960 [Candida parapsilosis]
          Length = 786

 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 65/202 (32%), Positives = 93/202 (46%), Gaps = 38/202 (18%)

Query: 897  QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-------------- 942
            + A I+   T RL ++KK      L LV+DLD T++++     VDP              
Sbjct: 148  EAAKIEHSTTDRLNDEKK------LILVVDLDQTVIHAT----VDPTVGEWQSDPSNPNY 197

Query: 943  --VHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMG 998
              V D      EE     P     +      W   K+RPG+  FLE+    +EMH+YTM 
Sbjct: 198  PAVKDVKTFCLEEDPIVPPGWTGPKLAPTKCWYYVKVRPGLSEFLEKMDTKYEMHIYTMA 257

Query: 999  NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIID 1057
             + YA  +AK++DP G  F  R++SR + G            K+L+ +  + +S VVIID
Sbjct: 258  TRNYALAIAKIIDPDGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVVIID 309

Query: 1058 DSVRVWPHNKLNLIVVERYTYF 1079
            D   VW     NLI V  Y +F
Sbjct: 310  DRGDVWQWEN-NLIKVVPYDFF 330


>gi|242015474|ref|XP_002428378.1| RNA polymerase II ctd phosphatase, putative [Pediculus humanus
            corporis]
 gi|212512990|gb|EEB15640.1| RNA polymerase II ctd phosphatase, putative [Pediculus humanus
            corporis]
          Length = 781

 Score = 77.0 bits (188), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 52/172 (30%), Positives = 87/172 (50%), Gaps = 24/172 (13%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
            ++ ++ + RKL L++DLD TL+++     + P   ++              H   +  M 
Sbjct: 133  DENRLLNDRKLVLLVDLDQTLIHTTN-DNIPPNLKDVY-------------HFRLYGQMS 178

Query: 971  MW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
             W  T++RP    FLE  SK +E+H+ T G + YA  +A  LDP G  F+ R++SR    
Sbjct: 179  PWYHTRIRPRTHKFLEEISKYYELHICTFGARNYAHMIAMFLDPDGKYFSHRILSR---D 235

Query: 1029 DPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            + F+ +    K+ +L+ +    ++ V IIDD   VW     NLI V+ Y +F
Sbjct: 236  ECFNAN---SKTANLKALFPCGDNMVCIIDDREDVWNF-AANLIHVKPYHFF 283


>gi|448520991|ref|XP_003868400.1| Fcp1 protein [Candida orthopsilosis Co 90-125]
 gi|380352740|emb|CCG25496.1| Fcp1 protein [Candida orthopsilosis]
          Length = 788

 Score = 76.3 bits (186), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 64/202 (31%), Positives = 93/202 (46%), Gaps = 38/202 (18%)

Query: 897  QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-------------- 942
            + A I+   T RL E++K      L LV+DLD T++++     VDP              
Sbjct: 148  EAAKIEHSTTDRLNEEEK------LILVVDLDQTVIHAT----VDPTVGEWQSDPSNPNY 197

Query: 943  --VHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMG 998
              V D      EE     P     +      W   K+RPG+  FL++    +EMH+YTM 
Sbjct: 198  PAVKDVKTFCLEEDPIVPPGWTGPKLAPTKCWYYVKVRPGLSEFLQKMDTKYEMHIYTMA 257

Query: 999  NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIID 1057
             + YA  +AK++DP G  F  R++SR + G            K+L+ +  + +S VVIID
Sbjct: 258  TRNYALAIAKIIDPDGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVVIID 309

Query: 1058 DSVRVWPHNKLNLIVVERYTYF 1079
            D   VW     NLI V  Y +F
Sbjct: 310  DRGDVWQWEN-NLIKVVPYDFF 330


>gi|313234471|emb|CBY24671.1| unnamed protein product [Oikopleura dioica]
          Length = 614

 Score = 76.3 bits (186), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 52/181 (28%), Positives = 90/181 (49%), Gaps = 31/181 (17%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            EE +++   RKL L++DLD T++++ +        + I  +   QD              
Sbjct: 62   EEIQRLHDNRKLVLLVDLDQTVIHTTQNRPKKLTKNTISFQLTRQD-------------P 108

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKV----------LDPKGVLFAG 1019
             +WT+LRP    F+   S+ +E+H+ T G++ YA ++A++          LD     F+ 
Sbjct: 109  WLWTRLRPFCAKFIHEMSEKYELHIVTFGSRQYAHKIAEILEDQTRRQLNLDSNKSFFSH 168

Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
            R++SR +  DPF       KS +LE +    +S   IIDD   VW ++  N I+V++Y +
Sbjct: 169  RILSRDECVDPF------HKSGNLEHLFPCGDSMCAIIDDRGDVWRYSP-NCILVKKYHF 221

Query: 1079 F 1079
            F
Sbjct: 222  F 222


>gi|170084539|ref|XP_001873493.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164651045|gb|EDR15285.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 845

 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 44/109 (40%), Positives = 60/109 (55%), Gaps = 10/109 (9%)

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
            + K RPG   FL+ AS  +EMH+YTMG + YA ++   +DP G LF GRV+SR + G   
Sbjct: 261  YIKPRPGWKEFLQEASTKYEMHVYTMGTRAYAEQVCAAIDPDGKLFGGRVLSRDESGS-- 318

Query: 1032 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                     K L+ +   + S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 319  ------LTQKSLQRLFPCDTSMVVIIDDRADVWEWSP-NLLKVVPYDFF 360


>gi|123401628|ref|XP_001301902.1| NLI interacting factor-like phosphatase family protein [Trichomonas
            vaginalis G3]
 gi|121883137|gb|EAX88972.1| NLI interacting factor-like phosphatase family protein [Trichomonas
            vaginalis G3]
          Length = 461

 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/194 (26%), Positives = 98/194 (50%), Gaps = 22/194 (11%)

Query: 900  AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFH---EVDPVHDEILRKKEEQDR 956
            + ++ + + LEE++++  A+KL LV+DLD TL+++ +     EVD +        ++ D 
Sbjct: 45   SFEEAKRKNLEEEQRLIDAKKLSLVIDLDKTLIDTTEVRNRAEVDAI--------KKLDP 96

Query: 957  EKPHRHLFRFP-HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                   F F  +  +  + RP +  FL   +  F+M +YT+ +  YA  +   +DP+  
Sbjct: 97   AATEDDFFEFNMNQNLLIRYRPHVRQFLASIAPYFDMQIYTLASPAYAHAILSKIDPEDK 156

Query: 1016 LFAGRVISRGDDG-----DPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVW--PHNK 1067
            LF  R+ SR  +      +       +   K+++ +    +  V+++DDS  VW   +NK
Sbjct: 157  LFKNRIFSRTAEDFAMIKEAMRNQTDIVNKKNIKKIFPYSDKLVLVLDDSPEVWFCDNNK 216

Query: 1068 L--NLIVVERYTYF 1079
            L   L+ ++RY+YF
Sbjct: 217  LFKGLVQIKRYSYF 230


>gi|156087501|ref|XP_001611157.1| protein phosphatase family protein [Babesia bovis]
 gi|154798411|gb|EDO07589.1| protein phosphatase family protein [Babesia bovis]
          Length = 806

 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 39/109 (35%), Positives = 63/109 (57%), Gaps = 11/109 (10%)

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
            + KLRPG++ FL R+++L+E++L+TMG + +A    K+LDP G  F  RV SR +  + F
Sbjct: 339  YYKLRPGVYDFLRRSAELYELYLFTMGTRAHANAALKILDPDGKYFGARVFSRSETNNCF 398

Query: 1032 DGDERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                R+ PK ++          ++I+DDS  +W  +   LI V  Y +F
Sbjct: 399  KSLCRIFPKYRN---------HLLILDDSENIWL-DAPGLIKVYPYYFF 437


>gi|299756470|ref|XP_002912206.1| RNA polymerase II subunit A domain phosphatase [Coprinopsis cinerea
            okayama7#130]
 gi|298411691|gb|EFI28712.1| RNA polymerase II subunit A domain phosphatase [Coprinopsis cinerea
            okayama7#130]
          Length = 801

 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 44/110 (40%), Positives = 60/110 (54%), Gaps = 10/110 (9%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
             + K RPG   FLE A+K +EMH+YTMG + YA E+   +DP G LF  R++SR + G  
Sbjct: 269  YYIKPRPGWKEFLENAAKKYEMHVYTMGTRAYAQEVCAAIDPDGKLFGSRLLSRDESGS- 327

Query: 1031 FDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                      K L+ +   + S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 328  -------LTQKSLQRLFPCDTSMVVIIDDRADVWEWSP-NLLKVIPYDFF 369


>gi|324504080|gb|ADY41763.1| RNA polymerase II subunit A C-terminal domain phosphatase [Ascaris
            suum]
          Length = 490

 Score = 75.5 bits (184), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 50/175 (28%), Positives = 88/175 (50%), Gaps = 32/175 (18%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKF-----HEVDPVHDEILRKKEEQDREKPHRHLFR 965
            +Q+ +  +R+L L++DLD TL+++          VD VH                   ++
Sbjct: 50   DQQLVLESRRLVLLVDLDQTLIHTTNHAFDMKDSVDVVH-------------------YK 90

Query: 966  FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
                  +TK+RP   TFL R S+L+EMH+ + G + YA ++A++LDP    F  R++SR 
Sbjct: 91   LRGADFYTKIRPYTHTFLRRMSELYEMHIISYGERQYAHKIAEILDPDKRYFGHRILSR- 149

Query: 1026 DDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
               + F     + K+ +++ +    +  + IIDD   VW ++   LI V+ Y +F
Sbjct: 150  --DELFSA---MYKTGNMKALFPCGDQLIAIIDDRPDVWQYSDA-LIQVKPYRFF 198


>gi|392597598|gb|EIW86920.1| hypothetical protein CONPUDRAFT_95946 [Coniophora puteana RWD-64-598
            SS2]
          Length = 830

 Score = 75.1 bits (183), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 64/259 (24%), Positives = 104/259 (40%), Gaps = 74/259 (28%)

Query: 886  VEHLFEGYDDQQKAAIQK-----------ERTRRLEEQ--KKMFSARKLCLVLDLDHTL- 931
             +H + G+ +  +A+IQ            E  +R+E +  + +  +RKL L++DLD T+ 
Sbjct: 113  TDHDYTGFSNASRASIQMTHSAFGPTVSLEEAQRIERETAEHLLKSRKLSLIVDLDQTIV 172

Query: 932  -----------LNSAKFHEVDPVHDEILRKKEE--------------------------- 953
                       +N  K  E   +  +  R + +                           
Sbjct: 173  HATVDPTVGEWINEGKQWEQKHIQKQKARDERKDGSDSDGTASSDEDDCNPNWDALKDVK 232

Query: 954  ------------QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKL 1001
                        Q +++  + L        + K RPG   F +  SK +EMH+YTMG + 
Sbjct: 233  SFRLGPESFVMPQSQKRGKQKLIENDGCLYYVKPRPGWKEFFQELSKKYEMHVYTMGTRA 292

Query: 1002 YATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSV 1060
            YA E+   +DP   +F GR++SR + G            K L+ +   + S VVIIDD  
Sbjct: 293  YAEEVCAAIDPDSKIFGGRILSRDESGS--------LTQKSLQRLFPCDTSMVVIIDDRA 344

Query: 1061 RVWPHNKLNLIVVERYTYF 1079
             VW  +  NLI V  Y +F
Sbjct: 345  DVWEWSP-NLIKVIPYDFF 362


>gi|156381374|ref|XP_001632240.1| predicted protein [Nematostella vectensis]
 gi|156219293|gb|EDO40177.1| predicted protein [Nematostella vectensis]
          Length = 122

 Score = 75.1 bits (183), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 44/112 (39%), Positives = 64/112 (57%), Gaps = 10/112 (8%)

Query: 971  MW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
            MW  TK RP    FL++ +K +E+H++TMG ++YA  +A++LDP   LF  R+ SR D  
Sbjct: 1    MWYHTKFRPWAHKFLQKIAKFYELHIFTMGTRMYAHTIARMLDPDLSLFGYRIRSRDDCF 60

Query: 1029 DPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            + F       K  DL  +    +S V IIDD   VW +N  +LI V+ Y +F
Sbjct: 61   NAFS------KFNDLRSLFPCGDSMVCIIDDRADVW-NNAPSLIKVKPYQFF 105


>gi|115533721|ref|NP_492423.2| Protein FCP-1 [Caenorhabditis elegans]
 gi|82658167|emb|CAC70088.2| Protein FCP-1 [Caenorhabditis elegans]
          Length = 659

 Score = 74.7 bits (182), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 52/172 (30%), Positives = 92/172 (53%), Gaps = 24/172 (13%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-HM 969
            ++  + + RKL L++DLD T+++++              K    D E  H+ + ++  H 
Sbjct: 134  DENNLITNRKLVLLVDLDQTIIHTSD-------------KPMTVDTEN-HKDITKYNLHS 179

Query: 970  GMWT-KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
             ++T KLRP    FL + S ++EMH+ T G + YA  +A++LDP   LF  R++SR    
Sbjct: 180  RVYTTKLRPHTTEFLNKMSNMYEMHIVTYGQRQYAHRIAQILDPDARLFEQRILSR---D 236

Query: 1029 DPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            + F       K+ +L+ +    ++ VVIIDD   VW +++  LI ++ Y +F
Sbjct: 237  ELFSAQH---KTNNLKALFPCGDNLVVIIDDRSDVWMYSEA-LIQIKPYRFF 284


>gi|324508774|gb|ADY43701.1| RNA polymerase II subunit A C-terminal domain phosphatase [Ascaris
            suum]
          Length = 576

 Score = 74.7 bits (182), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 50/175 (28%), Positives = 88/175 (50%), Gaps = 32/175 (18%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKF-----HEVDPVHDEILRKKEEQDREKPHRHLFR 965
            +Q+ +  +R+L L++DLD TL+++          VD VH                   ++
Sbjct: 136  DQQLVLESRRLVLLVDLDQTLIHTTNHAFDMKDSVDVVH-------------------YK 176

Query: 966  FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
                  +TK+RP   TFL R S+L+EMH+ + G + YA ++A++LDP    F  R++SR 
Sbjct: 177  LRGADFYTKIRPYTHTFLRRMSELYEMHIISYGERQYAHKIAEILDPDKRYFGHRILSR- 235

Query: 1026 DDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
               + F     + K+ +++ +    +  + IIDD   VW ++   LI V+ Y +F
Sbjct: 236  --DELFSA---MYKTGNMKALFPCGDQLIAIIDDRPDVWQYSD-ALIQVKPYRFF 284


>gi|170578206|ref|XP_001894313.1| NLI interacting factor-like phosphatase family protein [Brugia
            malayi]
 gi|158599134|gb|EDP36825.1| NLI interacting factor-like phosphatase family protein [Brugia
            malayi]
          Length = 576

 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 48/166 (28%), Positives = 86/166 (51%), Gaps = 22/166 (13%)

Query: 915  MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTK 974
            +  A KL L++DLD TL+++   H  +           E D +  H   ++      +TK
Sbjct: 141  LLKAHKLVLLVDLDQTLIHTTN-HTFNL----------ENDTDVLH---YKLKGTDFYTK 186

Query: 975  LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 1034
            +RP    FL R + L+EMH+ + G + YA  +A+ LDP+ + F  R++SR    + F   
Sbjct: 187  IRPHAHEFLRRMASLYEMHIISYGERQYAHRIAEFLDPEKIYFGHRILSR---DELFSA- 242

Query: 1035 ERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
              + K+++++ +    +  +V+IDD   VW ++   LI V+ Y +F
Sbjct: 243  --MYKTRNMQALFPCGDHMIVMIDDRPDVWQYSD-ALIQVKPYRFF 285


>gi|390604450|gb|EIN13841.1| hypothetical protein PUNSTDRAFT_95201 [Punctularia strigosozonata
            HHB-11173 SS5]
          Length = 1229

 Score = 74.3 bits (181), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 43/110 (39%), Positives = 60/110 (54%), Gaps = 10/110 (9%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
             + K RPG   FL   S+ +EMH+YTMG + YA E+ K +DP+G +F  R++SR + G  
Sbjct: 617  YYIKPRPGWHEFLHTLSEKYEMHVYTMGTRAYAEEVCKAIDPEGQIFGNRILSRDESGS- 675

Query: 1031 FDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                      K L+ +   + S VVIIDD   VW  +  NLI V  Y +F
Sbjct: 676  -------LTQKSLQRLFPCDTSMVVIIDDRADVWEWSP-NLIKVIPYDFF 717


>gi|389751366|gb|EIM92439.1| hypothetical protein STEHIDRAFT_136328 [Stereum hirsutum FP-91666
            SS1]
          Length = 1075

 Score = 74.3 bits (181), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 42/110 (38%), Positives = 58/110 (52%), Gaps = 10/110 (9%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
             + K RPG   FL   ++ +EMH+YTMG + YA E+   +DP G  F GR++SR + G  
Sbjct: 306  YYVKPRPGTREFLSSVAEKYEMHVYTMGTRAYAEEVCAAIDPDGKFFGGRILSRDESGS- 364

Query: 1031 FDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                      K L  +  ++ S VVIIDD   VW  +  NLI V  Y +F
Sbjct: 365  -------MTQKSLRRLFPVDTSMVVIIDDRADVWEWSP-NLIKVIPYDFF 406


>gi|118784887|ref|XP_314000.3| AGAP005119-PA [Anopheles gambiae str. PEST]
 gi|116128258|gb|EAA09414.3| AGAP005119-PA [Anopheles gambiae str. PEST]
          Length = 822

 Score = 73.9 bits (180), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 52/170 (30%), Positives = 87/170 (51%), Gaps = 20/170 (11%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
            + +++ S RKL L++DLD TL+++      D V + +        ++  H  L+      
Sbjct: 137  DTERLLSDRKLVLLVDLDQTLIHTTN----DNVPNNL--------KDVYHFQLYGPNSPW 184

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
              T+LRPG   FL +    +E+H+ T G + YA  +A+ LD  G  F+ R++SR    + 
Sbjct: 185  YHTRLRPGALEFLAKMHPYYELHICTFGARNYAHMIAQFLDKDGNFFSHRILSR---DEC 241

Query: 1031 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 242  FNA---TSKTDNLKALFPCGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 287


>gi|170036997|ref|XP_001846347.1| RNA polymerase II subunit A C-terminal domain phosphatase [Culex
            quinquefasciatus]
 gi|167879975|gb|EDS43358.1| RNA polymerase II subunit A C-terminal domain phosphatase [Culex
            quinquefasciatus]
          Length = 764

 Score = 73.9 bits (180), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 52/170 (30%), Positives = 87/170 (51%), Gaps = 20/170 (11%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
            + +++   RKL L++DLD TL+++      D V + +        ++  H  L+      
Sbjct: 135  DTERLLRDRKLVLLVDLDQTLIHTTN----DNVPNNL--------KDVYHFQLYGPNSPW 182

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
              T+LRPG   FL +    +E+H+ T G + YA  +A+ LD KG  F+ R++SR    + 
Sbjct: 183  YHTRLRPGALQFLAKMDPFYELHICTFGARNYAHMIAQFLDEKGRYFSHRILSR---DEC 239

Query: 1031 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 240  FNA---TSKTDNLKALFPCGDSMVCIIDDREDVW-NMAANLIQVKPYHFF 285


>gi|297792863|ref|XP_002864316.1| hypothetical protein ARALYDRAFT_918545 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297310151|gb|EFH40575.1| hypothetical protein ARALYDRAFT_918545 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 142

 Score = 73.9 bits (180), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 43/122 (35%), Positives = 66/122 (54%), Gaps = 10/122 (8%)

Query: 997  MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVII 1056
            MG++ YA  + K++DP+ V F  RVI+R +           P  K L+ VL  E  VVI+
Sbjct: 1    MGDRDYAKNVLKLIDPEKVYFGDRVITRNES----------PYIKTLDLVLADECGVVIV 50

Query: 1057 DDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVRQQ 1116
            DD+ +VWP +K NL+ + +Y YF    R+      S  E   DE   DG+L + L V ++
Sbjct: 51   DDTAQVWPDHKRNLLEITKYNYFSDKTRRDVKYSKSYAEEKRDEGRNDGSLGNVLKVIKE 110

Query: 1117 LH 1118
            ++
Sbjct: 111  VY 112


>gi|449551315|gb|EMD42279.1| hypothetical protein CERSUDRAFT_148004 [Ceriporiopsis subvermispora
            B]
          Length = 875

 Score = 73.2 bits (178), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 41/110 (37%), Positives = 59/110 (53%), Gaps = 10/110 (9%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
             + K RPG   FL+  +  +EMH+YTMG + YA E+   +DP G +F GR++SR + G  
Sbjct: 263  YYIKPRPGWQDFLQDMATKYEMHVYTMGTRAYAEEVCATIDPDGKIFGGRLLSRDESGS- 321

Query: 1031 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                      K L+ +    +S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 322  -------LTQKSLQRLFPCDQSMVVIIDDRADVWEWSP-NLVKVIPYDFF 363


>gi|221486680|gb|EEE24941.1| RNA polymerase II phosphatase, putative [Toxoplasma gondii GT1]
          Length = 1234

 Score = 73.2 bits (178), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 57/107 (53%), Gaps = 11/107 (10%)

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
            KLRPG   FL R S+ FE+++YTMG  L+A    ++LDPK   F  RV SR D  +    
Sbjct: 632  KLRPGCLDFLRRVSQTFELYMYTMGTALHAATALRILDPKRRFFGRRVFSRQDAVNGLKA 691

Query: 1034 DERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
             ER+ P  + +         V+++DD   +W ++    I V+ Y YF
Sbjct: 692  IERIFPHDQKM---------VLVVDDLECMWSYSPC-CIKVQGYHYF 728


>gi|312373985|gb|EFR21645.1| hypothetical protein AND_16677 [Anopheles darlingi]
          Length = 857

 Score = 73.2 bits (178), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 51/168 (30%), Positives = 86/168 (51%), Gaps = 20/168 (11%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            +++ + RKL L++DLD TL+++      D V + +        ++  H  L+        
Sbjct: 150  ERLLNDRKLVLLVDLDQTLIHTTN----DNVPNNL--------KDVYHFQLYGPNSPWYH 197

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
            T+LRPG   FL +    +E+H+ T G + YA  +A+ LD  G  F+ R++SR    + F+
Sbjct: 198  TRLRPGALEFLAKMHPYYELHICTFGARNYAHMIAQFLDKDGRFFSHRILSR---DECFN 254

Query: 1033 GDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                  K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 255  A---TSKTDNLKALFPCGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 298


>gi|221508436|gb|EEE34023.1| RNA polymerase II phosphatase, putative [Toxoplasma gondii VEG]
          Length = 1228

 Score = 73.2 bits (178), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 57/107 (53%), Gaps = 11/107 (10%)

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
            KLRPG   FL R S+ FE+++YTMG  L+A    ++LDPK   F  RV SR D  +    
Sbjct: 626  KLRPGCLDFLRRVSQTFELYMYTMGTALHAATALRILDPKRRFFGRRVFSRQDAVNGLKA 685

Query: 1034 DERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
             ER+ P  + +         V+++DD   +W ++    I V+ Y YF
Sbjct: 686  IERIFPHDQKM---------VLVVDDLECMWSYSPC-CIKVQGYHYF 722


>gi|401827003|ref|XP_003887594.1| TFIIF-interacting CTD phosphatase [Encephalitozoon hellem ATCC 50504]
 gi|392998600|gb|AFM98613.1| TFIIF-interacting CTD phosphatase [Encephalitozoon hellem ATCC 50504]
          Length = 408

 Score = 73.2 bits (178), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 58/195 (29%), Positives = 92/195 (47%), Gaps = 53/195 (27%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  +I KE+ + LE Q K      L LVLDLD T+L++  +   D              
Sbjct: 42   KEAVSIYKEKVKALEMQMK------LILVLDLDQTVLHTT-YGTSDC------------- 81

Query: 956  REKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
                 + + +F   G     KLRP +   L R SKL+E+H+YTMG + YA  +  ++DP 
Sbjct: 82   -----KGIVKFTMDGCKYSVKLRPHLNRMLRRVSKLYEIHVYTMGTRPYAERIIGIIDPA 136

Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA---------VVIIDDSVRVWP 1064
            G  F  R+I+R ++                +GVL    +         +VI+DD   VW 
Sbjct: 137  GKYFHDRIITRDEN----------------QGVLVKRLSRLFPYNHKNIVILDDRADVWD 180

Query: 1065 HNKLNLIVVERYTYF 1079
            +N+ NL++V+ + YF
Sbjct: 181  YNE-NLVLVKPFWYF 194


>gi|303389951|ref|XP_003073207.1| Fcp1-like phosphatase [Encephalitozoon intestinalis ATCC 50506]
 gi|303302352|gb|ADM11847.1| Fcp1-like phosphatase [Encephalitozoon intestinalis ATCC 50506]
          Length = 407

 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 56/195 (28%), Positives = 91/195 (46%), Gaps = 53/195 (27%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  +I KE+ + LE Q K+       LVLDLD T+L++   +    +H  +        
Sbjct: 42   KEAVSIYKEKMKTLETQMKLI------LVLDLDQTILHTT--YGESRIHGTV-------- 85

Query: 956  REKPHRHLFRFPHMG--MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
                     RF   G     KLRP +   L + S+L+E+H+YTMG + YA  +  ++DP 
Sbjct: 86   ---------RFIMDGSKYCVKLRPNLDHMLRKISRLYEIHVYTMGTRAYAERIVGIVDPS 136

Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA---------VVIIDDSVRVWP 1064
            G  F  R+I+R ++                EGVL    +         +VI+DD   VW 
Sbjct: 137  GKYFQDRIITRDEN----------------EGVLVKRLSRLFPHNHKNIVILDDRPDVWD 180

Query: 1065 HNKLNLIVVERYTYF 1079
            +++ NL++V  + YF
Sbjct: 181  YSE-NLLLVRPFWYF 194


>gi|393218252|gb|EJD03740.1| hypothetical protein FOMMEDRAFT_105888 [Fomitiporia mediterranea
            MF3/22]
          Length = 921

 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 42/110 (38%), Positives = 57/110 (51%), Gaps = 10/110 (9%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
             + K RPG   FL   +  +EMH+YTMG + YA ++   +DP G LF GR++SR + G  
Sbjct: 272  YYVKPRPGWKEFLSSVASRYEMHVYTMGTRAYAEKVCAAIDPDGRLFGGRILSRDESGS- 330

Query: 1031 FDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                      K L  +   + S VVIIDD   VW  +  NLI V  Y +F
Sbjct: 331  -------LTQKSLRRLFPCDTSMVVIIDDRADVWEWSP-NLIKVIPYDFF 372


>gi|237834315|ref|XP_002366455.1| NLI interacting factor-like phosphatase domain-containing protein
            [Toxoplasma gondii ME49]
 gi|211964119|gb|EEA99314.1| NLI interacting factor-like phosphatase domain-containing protein
            [Toxoplasma gondii ME49]
          Length = 1225

 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 57/107 (53%), Gaps = 11/107 (10%)

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
            KLRPG   FL R S+ FE+++YTMG  L+A    ++LDPK   F  RV SR D  +    
Sbjct: 623  KLRPGCLDFLRRVSQTFELYMYTMGTALHAATALRILDPKRRFFGRRVFSRQDAVNGLKA 682

Query: 1034 DERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
             ER+ P  + +         V+++DD   +W ++    I V+ Y YF
Sbjct: 683  IERIFPHDQKM---------VLVVDDLECMWRYSPC-CIKVQGYHYF 719


>gi|294868642|ref|XP_002765622.1| hypothetical protein Pmar_PMAR013688 [Perkinsus marinus ATCC 50983]
 gi|239865701|gb|EEQ98339.1| hypothetical protein Pmar_PMAR013688 [Perkinsus marinus ATCC 50983]
          Length = 956

 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 57/186 (30%), Positives = 84/186 (45%), Gaps = 22/186 (11%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAK------FHEVDPVHDEILRKKEEQDREKPHRHLFRF 966
            + + S+++L  VLD+DHT+L+         F +V   +    R     D EK ++     
Sbjct: 488  RTLASSKRLVAVLDIDHTILHVTNKRIDLLFPDVTCYNLAPNRDTGRLDEEKVYQFFIGT 547

Query: 967  ---PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG--RV 1021
                    + KLRPG +TFLE    L+E++LYT G + YA  + K LDP    F    R+
Sbjct: 548  SPTTTACCYLKLRPGFYTFLEEILPLYELYLYTHGTREYAIRLLKALDPSARYFGSPPRL 607

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVW--PHNKLNLIVVERYTY 1078
            I+R          +     K L  +        VI+DD   VW    N+ +LI V  Y +
Sbjct: 608  IAR--------PTQSALTCKTLSRIFPSNHRLAVIVDDRDDVWEAKDNEHSLIKVTPYVF 659

Query: 1079 FPCSRR 1084
            FP S R
Sbjct: 660  FPDSER 665


>gi|297830092|ref|XP_002882928.1| hypothetical protein ARALYDRAFT_897808 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297328768|gb|EFH59187.1| hypothetical protein ARALYDRAFT_897808 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 295

 Score = 72.8 bits (177), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 67/236 (28%), Positives = 101/236 (42%), Gaps = 47/236 (19%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH----LFRFPH 968
            K     +KL LVL+L  T  +S  F          L  KE+  + K +        R   
Sbjct: 53   KNSLEKKKLHLVLNLYGTFFDSQAF--------PCLSNKEKYLKGKVNSRNDLWQTRIRG 104

Query: 969  MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
              +  KLRP +  FL  A+KLF +H+ T+    YA  + K+LDP  + F  R+IS     
Sbjct: 105  HDVLIKLRPFVHEFLREANKLFILHVTTLCIPEYADFVLKLLDPHQLYFGNRIISL---- 160

Query: 1029 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW-PHNKLNLIVVERYTYFPCSRRQFG 1087
                  + V   K L+ VL  E  V+I+DD   VW P N+ NL+ +  Y+YF  ++++  
Sbjct: 161  -----SKHVIWEKTLDQVLVGEREVIILDDRYDVWSPENRSNLLQITTYSYFKATKKRNS 215

Query: 1088 LLG-------------------------PSLLEIDHDERSEDGTLASSLGVRQQLH 1118
            + G                          S  E   DE  +DG LA++L    ++H
Sbjct: 216  IDGGMFQNLFKYFLKIFSRDDDNLLSDSNSYSEERKDESVDDGALANALRFLFKIH 271


>gi|401409326|ref|XP_003884111.1| hypothetical protein NCLIV_045130 [Neospora caninum Liverpool]
 gi|325118529|emb|CBZ54080.1| hypothetical protein NCLIV_045130 [Neospora caninum Liverpool]
          Length = 1185

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 56/107 (52%), Gaps = 11/107 (10%)

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
            KLRPG   FL R S+ FE+++YTMG  L+A    ++LDP    F  RV SR D  +    
Sbjct: 649  KLRPGCLDFLRRVSQTFELYMYTMGTALHAATALRILDPGRRFFGRRVFSRQDAVNGLKA 708

Query: 1034 DERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
             ER+ P  + +         V+++DD   +W +N    I V+ Y YF
Sbjct: 709  IERIFPHDRKM---------VLVVDDLDCMWSYNPC-CIKVQGYHYF 745


>gi|393909596|gb|EFO27947.2| hypothetical protein LOAG_00540 [Loa loa]
          Length = 506

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 48/170 (28%), Positives = 90/170 (52%), Gaps = 22/170 (12%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
            +++ +  A KL L++DLD TL+++   H            K ++D +  H   ++     
Sbjct: 67   DRELLLKAHKLVLLVDLDQTLIHTTN-HTF----------KVDKDTDVLH---YKLKGTD 112

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
             +TK+RP    FL R ++L+EMH+ + G + YA  +A+ LDP  + F  R++SR    + 
Sbjct: 113  FYTKIRPYAREFLRRMAELYEMHIISYGERQYAHRIAEFLDPDKIYFGHRILSR---DEL 169

Query: 1031 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            F     + K+++++ +    +  +V+IDD   VW ++   LI V+ Y +F
Sbjct: 170  FCA---MYKTRNMQALFPCGDHMIVMIDDRPDVWQYSD-ALIQVKPYRFF 215


>gi|440493707|gb|ELQ76143.1| TFIIF-interacting CTD phosphatase, including NLI-interacting factor
            [Trachipleistophora hominis]
          Length = 466

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 42/128 (32%), Positives = 63/128 (49%), Gaps = 12/128 (9%)

Query: 950  KKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKV 1009
            ++   D   P    +      M T LRP +  FL  ASKLF MH+YTMG   Y  ++  V
Sbjct: 161  QRTSTDNSFPSSFTYTLSSTTMHTTLRPHLHQFLTEASKLFHMHIYTMGTAEYVHQITNV 220

Query: 1010 LDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKL 1068
            +D  G+ F  R+++R D+           + K LE + G +   VVI+DD   VW +   
Sbjct: 221  IDKDGMFFGDRIVTRDDE----------MQVKRLERLFGDKVDMVVIVDDRGDVWEYCG- 269

Query: 1069 NLIVVERY 1076
            NL++V  +
Sbjct: 270  NLVMVRPF 277


>gi|312066139|ref|XP_003136128.1| hypothetical protein LOAG_00540 [Loa loa]
          Length = 577

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 50/182 (27%), Positives = 95/182 (52%), Gaps = 24/182 (13%)

Query: 901  IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
            +  E  R++   +++ +  A KL L++DLD TL+++   H            K ++D + 
Sbjct: 126  VSDELARKIGSRDRELLLKAHKLVLLVDLDQTLIHTTN-HTF----------KVDKDTDV 174

Query: 959  PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
             H   ++      +TK+RP    FL R ++L+EMH+ + G + YA  +A+ LDP  + F 
Sbjct: 175  LH---YKLKGTDFYTKIRPYAREFLRRMAELYEMHIISYGERQYAHRIAEFLDPDKIYFG 231

Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
             R++SR    + F     + K+++++ +    +  +V+IDD   VW ++   LI V+ Y 
Sbjct: 232  HRILSR---DELFCA---MYKTRNMQALFPCGDHMIVMIDDRPDVWQYSD-ALIQVKPYR 284

Query: 1078 YF 1079
            +F
Sbjct: 285  FF 286


>gi|402467220|gb|EJW02558.1| FCP1-like phosphatase, phosphatase domain-containing protein
            [Edhazardia aedis USNM 41457]
          Length = 905

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 49/166 (29%), Positives = 77/166 (46%), Gaps = 28/166 (16%)

Query: 934  SAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKL---F 990
            S KF + D V D        ++  K   + F       +  LRP    FLE+   L   +
Sbjct: 209  SDKFTKNDIVTD--------KNENKTKIYTFMLNKHKYYIALRP----FLEKLLSLDEKY 256

Query: 991  EMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME 1050
            EMH+YTMGN  YA ++ K++DP G +F  R+I+R ++             K L+      
Sbjct: 257  EMHIYTMGNNQYAQKVKKIIDPTGTIFGNRIITRDENNQEL--------FKSLDRFSTNH 308

Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEI 1096
              +V+IDD + VW    +N++ V  + +F    R   +  PS+L I
Sbjct: 309  DNIVVIDDRIDVWNF-SVNVVGVRPFWFF----RDGDINDPSVLRI 349


>gi|392570766|gb|EIW63938.1| hypothetical protein TRAVEDRAFT_111329 [Trametes versicolor FP-101664
            SS1]
          Length = 900

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/110 (37%), Positives = 59/110 (53%), Gaps = 10/110 (9%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
             + K RPG+  FLE  +  +EMH+YTMG + YA E+   +DP G +F  R++SR + G  
Sbjct: 262  YYIKPRPGLPEFLETMATKYEMHVYTMGTRAYAEEVCAAIDPGGKIFGNRILSRDESGS- 320

Query: 1031 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                      K L+ +    +S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 321  -------LTQKSLQRLFPCDQSMVVIIDDRADVWEWSP-NLVKVIPYDFF 362


>gi|294935258|ref|XP_002781353.1| hypothetical protein Pmar_PMAR020737 [Perkinsus marinus ATCC 50983]
 gi|239891934|gb|EER13148.1| hypothetical protein Pmar_PMAR020737 [Perkinsus marinus ATCC 50983]
          Length = 979

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 56/186 (30%), Positives = 84/186 (45%), Gaps = 22/186 (11%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAK------FHEVDPVHDEILRKKEEQDREKPHRHLFRF 966
            + + ++++L  VLD+DHT+L+         F +V   +    R     D EK ++     
Sbjct: 511  RTLAASKRLVAVLDIDHTILHVTNKRIDLLFPDVTCYNLAPNRDTGRLDEEKVYQFFIGT 570

Query: 967  ---PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG--RV 1021
                    + KLRPG +TFLE    L+E++LYT G + YA  + K LDP    F    R+
Sbjct: 571  SPTTTACCYLKLRPGFYTFLEEILPLYELYLYTHGTREYAIRLLKALDPSARYFGSPPRL 630

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVW--PHNKLNLIVVERYTY 1078
            I+R          +     K L  +        VI+DD   VW    N+ +LI V  Y +
Sbjct: 631  IAR--------PTQSALTCKTLSRIFPSNHRLAVIVDDRDDVWEAKDNEHSLIKVTPYVF 682

Query: 1079 FPCSRR 1084
            FP S R
Sbjct: 683  FPDSER 688


>gi|395334832|gb|EJF67208.1| hypothetical protein DICSQDRAFT_142769 [Dichomitus squalens LYAD-421
            SS1]
          Length = 953

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/109 (36%), Positives = 59/109 (54%), Gaps = 10/109 (9%)

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
            + K RPG+  FL+  +  +EMH+YTMG + YA E+   +DP G +F  R++SR + G   
Sbjct: 287  YIKPRPGLLDFLQTMATKYEMHVYTMGTRAYAEEVCAAIDPGGKIFGNRILSRDESGS-- 344

Query: 1032 DGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                     K L+ +    +S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 345  ------LTQKSLQRLFPCDQSMVVIIDDRADVWEWSP-NLVKVIPYDFF 386


>gi|388858248|emb|CCF48177.1| related to FCP1-TFIIF interacting component of CTD phosphatase
            [Ustilago hordei]
          Length = 774

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 57/209 (27%), Positives = 95/209 (45%), Gaps = 43/209 (20%)

Query: 901  IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
            +  E  +RL  E    + S RKL L++DLD T++++     VDP   E ++ +   + E 
Sbjct: 47   VSAEEAQRLDSETTSHLLSQRKLALIVDLDQTVIHAT----VDPTVGEWMKDESNPNYEA 102

Query: 959  PHR-HLFRFPHMG--------------------------MWTKLRPGIWTFLERASKLFE 991
                  FR    G                           + K RPG+   +++ S+ ++
Sbjct: 103  LKSVGKFRLGIDGEEIKDDDDDSAPKDSAAALKASRACWYYVKPRPGVPEIVKKLSEKYQ 162

Query: 992  MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME- 1050
            +H+YTMG + YA  + K++DP   +F  R++SR ++G            K L  +  ++ 
Sbjct: 163  LHVYTMGTRSYANCVCKLIDPDASIFGNRILSRDENGSLV--------RKSLNRLFPVDH 214

Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 215  SMVVIIDDREDVWSRSP-NLLPVVPYEFF 242


>gi|123490666|ref|XP_001325656.1| NLI interacting factor-like phosphatase family protein [Trichomonas
            vaginalis G3]
 gi|121908559|gb|EAY13433.1| NLI interacting factor-like phosphatase family protein [Trichomonas
            vaginalis G3]
          Length = 474

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 53/195 (27%), Positives = 95/195 (48%), Gaps = 23/195 (11%)

Query: 900  AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKP 959
            + ++ R R L+E++++  A+KL LV+DLD TL+++ +  +    H E+    E   +  P
Sbjct: 45   SFEEARNRNLQEEQRLIDAKKLSLVIDLDKTLIDTTEVRD----HSEV----EAIKKLDP 96

Query: 960  HR---HLFRFP-HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
            H      F F  +  +  + RP +  FL   +  F++ +YT+    YA  +   +DP   
Sbjct: 97   HATEDDFFEFNMNQNLLIRYRPHVREFLASIAPYFDLQIYTLALPSYAHAILSKIDPDDK 156

Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-------ESAVVIIDDSVRVW--PHN 1066
            LF  R+ SR  +      +E +    D+     +       +  V+++DDS  VW    N
Sbjct: 157  LFKNRIFSRTAEDFAMLREEAMRNRTDIVHKKNIKKLFPYSDKLVLVLDDSPEVWYCDDN 216

Query: 1067 KL--NLIVVERYTYF 1079
            KL   L+ ++RY+YF
Sbjct: 217  KLFKGLVQIKRYSYF 231


>gi|19074511|ref|NP_586017.1| similarity to HYPOTHETICAL TRANSMEMBRANE PROTEINS YHG4_yeast
            [Encephalitozoon cuniculi GB-M1]
 gi|51701436|sp|Q8SV03.1|FCP1_ENCCU RecName: Full=RNA polymerase II subunit A C-terminal domain
            phosphatase; AltName: Full=CTD phosphatase FCP1
 gi|19069153|emb|CAD25621.1| similarity to HYPOTHETICAL TRANSMEMBRANE PROTEINS YHG4_yeast
            [Encephalitozoon cuniculi GB-M1]
 gi|449329538|gb|AGE95809.1| hypothetical protein ECU07_0890 [Encephalitozoon cuniculi]
          Length = 411

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 54/193 (27%), Positives = 87/193 (45%), Gaps = 49/193 (25%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  AI KE+   LE Q K+       LVLDLD T+L++   +    +   +        
Sbjct: 42   EEAVAIHKEKMEALEMQMKLI------LVLDLDQTVLHTT--YGTSSLEGTVK------- 86

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                    F         KLRP +   L R SKL+E+H+YTMG + YA  + +++DP G 
Sbjct: 87   --------FVIDRCRYCVKLRPNLDYMLRRISKLYEIHVYTMGTRAYAERIVEIIDPSGK 138

Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA---------VVIIDDSVRVWPHN 1066
             F  R+I+R ++                +GVL    +         +VI+DD   VW + 
Sbjct: 139  YFDDRIITRDEN----------------QGVLVKRLSRLFPHDHRNIVILDDRPDVWDYC 182

Query: 1067 KLNLIVVERYTYF 1079
            + NL+++  + YF
Sbjct: 183  E-NLVLIRPFWYF 194


>gi|409051930|gb|EKM61406.1| hypothetical protein PHACADRAFT_204575 [Phanerochaete carnosa
            HHB-10118-sp]
          Length = 863

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/110 (36%), Positives = 60/110 (54%), Gaps = 10/110 (9%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
             + K RPG   FLE  ++ +EMH+YTMG + YA E+   +DP G +F GR++SR + G  
Sbjct: 257  YYIKPRPGWNEFLEDMAEKYEMHVYTMGTRAYAEEVCAAIDPDGKIFGGRLLSRDESGS- 315

Query: 1031 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                      K L+ +    +S VV+IDD   VW  +  NL+ V  + +F
Sbjct: 316  -------LTQKSLQRLFPCDQSMVVVIDDRADVWEWSP-NLVKVIPFEFF 357


>gi|393225696|gb|EJD33619.1| HAD-like protein, partial [Auricularia delicata TFB-10046 SS5]
          Length = 155

 Score = 71.2 bits (173), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 52/171 (30%), Positives = 86/171 (50%), Gaps = 25/171 (14%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ-DREKPHRHLFRFPHM 969
            E++++   RKL LV+DLD+T+           VH  ++R  +E+  R + H H       
Sbjct: 3    ERERLLGCRKLSLVVDLDNTI-----------VHTIVVRTDDERMARMQDHNH----GST 47

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD-DG 1028
                  RPG+  FL+  S+ +E  +YTMG + YA ++   +D    +F GR+ SR + +G
Sbjct: 48   TFTGSCRPGLRAFLQTISEKYEPTVYTMGTRGYAEKVCAAVDGDERVFGGRIFSRDENEG 107

Query: 1029 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            +      R+    D       +S   IIDDS +VW  +K N++ V+ Y +F
Sbjct: 108  NSTKSLSRLFPPCD-------KSMTAIIDDSRKVW-EDKKNIVSVQPYVFF 150


>gi|326437795|gb|EGD83365.1| hypothetical protein PTSG_03974 [Salpingoeca sp. ATCC 50818]
          Length = 864

 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 61/113 (53%), Gaps = 14/113 (12%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---- 1026
             +TK+RPG+  FLE    ++E+H+YTMG + YA E+  ++DP    F+ R++++ +    
Sbjct: 31   YYTKIRPGVKEFLEAVKDMYELHVYTMGTRAYAKEICNIIDPGAHYFSTRILTQDESARI 90

Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            D    + +   P+  D+         VVI+DD+  +W   + NLI    Y YF
Sbjct: 91   DTKSINLNHLFPRGDDM---------VVILDDTAAMWDF-RPNLIPAAPYDYF 133


>gi|157109625|ref|XP_001650754.1| RNA polymerase ii ctd phosphatase [Aedes aegypti]
 gi|108868428|gb|EAT32653.1| AAEL015142-PA, partial [Aedes aegypti]
          Length = 569

 Score = 70.9 bits (172), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 51/170 (30%), Positives = 86/170 (50%), Gaps = 20/170 (11%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
            + +++   +KL L++DLD TL+++      D V + +        ++  H  L+      
Sbjct: 136  DTERLLRDKKLVLLVDLDQTLIHTTN----DNVPNNL--------KDVYHFQLYGSNSPW 183

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
              T+LRPG   FL +    +E+H+ T G + YA  +A+ LD  G LF+ R++SR    + 
Sbjct: 184  YHTRLRPGALEFLAKMHPYYELHICTFGARNYAHMIAQFLDRDGKLFSHRILSR---DEC 240

Query: 1031 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            F+      K+ +L  +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 241  FNA---TSKTDNLRALFPCGDSMVCIIDDREDVW-NMAANLIQVKPYHFF 286


>gi|321460734|gb|EFX71774.1| hypothetical protein DAPPUDRAFT_308742 [Daphnia pulex]
          Length = 798

 Score = 70.9 bits (172), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 55/175 (31%), Positives = 85/175 (48%), Gaps = 30/175 (17%)

Query: 911  EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 970
            +++++   RKL L++DLD TL+++         +DEI    E+         +F F   G
Sbjct: 148  DEERLLKDRKLVLLVDLDQTLIHTT--------NDEIPANIED---------VFHFQLHG 190

Query: 971  -----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
                   T+LRP     L   S L+E+H+ T G++ YA  +A  LD KG  F+ R++SR 
Sbjct: 191  PNSPWYHTRLRPFTKELLCSMSSLYELHICTFGSRTYAHMIANFLDEKGRYFSHRILSR- 249

Query: 1026 DDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
               + F       K+ +L+ +    +  VVIIDD   VW     NLI V  Y +F
Sbjct: 250  --DECFSAH---SKTANLKALFPCGDQMVVIIDDREDVWNFAP-NLIHVRPYHFF 298


>gi|255540897|ref|XP_002511513.1| conserved hypothetical protein [Ricinus communis]
 gi|223550628|gb|EEF52115.1| conserved hypothetical protein [Ricinus communis]
          Length = 161

 Score = 70.9 bits (172), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 46/130 (35%), Positives = 70/130 (53%), Gaps = 10/130 (7%)

Query: 989  LFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLG 1048
            +FEM++YT  +++ A +M   LDP    F  R+I R       +G   V   K+ + VLG
Sbjct: 1    MFEMYVYTSSSQVNARKMMSFLDPANRYFNSRLIVR-------EGST-VMALKNPDVVLG 52

Query: 1049 MESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLA 1108
             E AVVI+DD    WP +K N+I VE+Y YF  ++   G    SL E   DE +    +A
Sbjct: 53   HERAVVILDDRKSAWPMHKANVINVEKYNYFASNQSDPGSKSKSLAERKKDEHTR--VMA 110

Query: 1109 SSLGVRQQLH 1118
            + L + +++H
Sbjct: 111  AYLRILRKIH 120


>gi|402584910|gb|EJW78851.1| hypothetical protein WUBG_10241, partial [Wuchereria bancrofti]
          Length = 278

 Score = 70.5 bits (171), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 48/172 (27%), Positives = 89/172 (51%), Gaps = 29/172 (16%)

Query: 901  IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
            +  E  R++   +++ +  ARKL L++DLD TL+++   H            K E+D + 
Sbjct: 125  VSDELARKIGSRDRELLLKARKLVLLVDLDQTLIHTTN-HTF----------KLEKDTDV 173

Query: 959  PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
             H   ++      +TK+RP    FL R + L+EMH+ + G + YA  +A+ LDP+ + F 
Sbjct: 174  LH---YKLKGTDFYTKIRPHAREFLRRMAGLYEMHIISYGERQYAHRIAEFLDPEKIYFG 230

Query: 1019 GRVISRGDDGDPFDGDE---RVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHN 1066
             R++SR         DE    + K+++++ +    +  +V+IDD   VW ++
Sbjct: 231  HRILSR---------DELFCAMYKTRNMQALFPCGDHMIVMIDDRPDVWQYS 273


>gi|401886990|gb|EJT50998.1| protein phosphatase [Trichosporon asahii var. asahii CBS 2479]
          Length = 922

 Score = 70.1 bits (170), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 50/150 (33%), Positives = 70/150 (46%), Gaps = 33/150 (22%)

Query: 931  LLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLF 990
            L + AKF   D V   + R+ + +            P    +TK RPG+  FLE  SKL+
Sbjct: 279  LKDVAKFQLADDVPPGVSRRHQPE------------PVRWYYTKPRPGLNKFLEDMSKLY 326

Query: 991  EMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM- 1049
            EMH+YTMG + YA  + K++DP+G  FA                     +K L  +    
Sbjct: 327  EMHVYTMGTRSYADAICKIVDPEGKYFAM-------------------SAKSLVRLFPHD 367

Query: 1050 ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            +S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 368  QSMVVIIDDRSDVW-GDSPNLVKVVPYDFF 396


>gi|406695220|gb|EKC98531.1| protein phosphatase [Trichosporon asahii var. asahii CBS 8904]
          Length = 917

 Score = 70.1 bits (170), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 50/150 (33%), Positives = 70/150 (46%), Gaps = 33/150 (22%)

Query: 931  LLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLF 990
            L + AKF   D V   + R+ + +            P    +TK RPG+  FLE  SKL+
Sbjct: 279  LKDVAKFQLADDVPPGVSRRHQPE------------PVRWYYTKPRPGLNKFLEDMSKLY 326

Query: 991  EMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM- 1049
            EMH+YTMG + YA  + K++DP+G  FA                     +K L  +    
Sbjct: 327  EMHVYTMGTRSYADAICKIVDPEGKYFAM-------------------SAKSLVRLFPHD 367

Query: 1050 ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            +S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 368  QSMVVIIDDRSDVW-GDSPNLVKVVPYDFF 396


>gi|396081720|gb|AFN83335.1| Fcp1-like phosphatase [Encephalitozoon romaleae SJ-2008]
          Length = 408

 Score = 69.7 bits (169), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 53/195 (27%), Positives = 90/195 (46%), Gaps = 53/195 (27%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  +I KE+ + LE Q K+       LVLDLD T+L++A                    
Sbjct: 42   KEAVSIYKEKIKTLEMQMKLI------LVLDLDQTVLHTAY------------------- 76

Query: 956  REKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
                 + + RF   G     KLRP +   L + S+L+E+H+YTMG + YA  + +++DP 
Sbjct: 77   GASSEKGIVRFTMDGCKYSVKLRPNLKRMLRKVSRLYEIHVYTMGTRPYAERIVRIIDPT 136

Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA---------VVIIDDSVRVWP 1064
               F  R+I+R ++                +GVL    +         +VI+DD   VW 
Sbjct: 137  RKYFHDRIITRDEN----------------QGVLVKRLSRLFPYNHKNIVILDDRADVWD 180

Query: 1065 HNKLNLIVVERYTYF 1079
            + + NL++++ + YF
Sbjct: 181  YCE-NLVLIKPFWYF 194


>gi|302834483|ref|XP_002948804.1| hypothetical protein VOLCADRAFT_89056 [Volvox carteri f. nagariensis]
 gi|300265995|gb|EFJ50184.1| hypothetical protein VOLCADRAFT_89056 [Volvox carteri f. nagariensis]
          Length = 2442

 Score = 69.7 bits (169), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 65/237 (27%), Positives = 98/237 (41%), Gaps = 51/237 (21%)

Query: 915  MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKP--HRHLFRFP----- 967
            + S  KL LV+DLD  L +S    ++DP     L +    +   P   R LFR P     
Sbjct: 2044 LLSRGKLVLVVDLDGVLADSCWDDQLDPAAAAALSRHAAAEAGLPEDRRELFRLPLDAGA 2103

Query: 968  -------HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
                     G+W KLRPG   FL RA + FE+  ++   + YA  + ++LDP   LF  R
Sbjct: 2104 TAGAASGGSGLWLKLRPGARAFLARAHERFELWAHSRQGRPYADAVVELLDPSLALFGSR 2163

Query: 1021 VISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW--PHNKLNLIVVERYTY 1078
            V+++G+         R+  + D    +       I+D     W        L+ +  Y+Y
Sbjct: 2164 VVAQGELA------RRLLTALDARAPI-----TAILDTPSAAWMGEQLAPGLLPLPPYSY 2212

Query: 1079 F----PCSRRQF--------------------GLLGPSLLEIDHDERSEDGTLASSL 1111
            F     C+                        G+ G SLLE++ DE  E G LA++L
Sbjct: 2213 FSYRPACTADAAAGVGGGGAAPGARTLAPSASGMAGRSLLEVNRDECPERGVLAAAL 2269


>gi|323453463|gb|EGB09334.1| putative formate/nitrite transporter [Aureococcus anophagefferens]
          Length = 1144

 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 61/195 (31%), Positives = 95/195 (48%), Gaps = 40/195 (20%)

Query: 897  QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDR 956
            + + +  +R+R+LE +      R+L LVLDLDHTLL  +     DP    +      + R
Sbjct: 331  EASVLAAQRSRQLEGK------RQLQLVLDLDHTLLECS----TDPRAAALAAAPGSRVR 380

Query: 957  E------KPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVL 1010
                   +PH           W +LRP +  F    + L+E+ +YT G++ YA  +   L
Sbjct: 381  ALGAVAGRPH-----------WVRLRPRLEEFFAAVAPLYELAIYTHGSRQYAEAVRAAL 429

Query: 1011 DPK--GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNK 1067
            + +  G+ F GRV+SR  D  P   D R  KS  LE +  G  +  +I+DD + VW   +
Sbjct: 430  EAEVPGLSFGGRVVSR--DCCP---DLRGEKS--LERLFPGGAARALILDDRLDVWTRGE 482

Query: 1068 ---LNLIVVERYTYF 1079
                 ++VV+ YTYF
Sbjct: 483  DQTPRVLVVQPYTYF 497


>gi|302698337|ref|XP_003038847.1| hypothetical protein SCHCODRAFT_255670 [Schizophyllum commune H4-8]
 gi|300112544|gb|EFJ03945.1| hypothetical protein SCHCODRAFT_255670 [Schizophyllum commune H4-8]
          Length = 1207

 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 59/124 (47%), Gaps = 13/124 (10%)

Query: 961  RHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
            RH+        + K RPG   F+   S  +EMH+YTMG + YA  +  VLDP G LF  R
Sbjct: 608  RHIALDEGCVYYIKPRPGWQEFMNNMSAKYEMHVYTMGTRAYAMAVCNVLDPDGRLFGER 667

Query: 1021 VISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHN----KLNLIVVER 1075
            ++SR + G            K L+ +    +S VVIIDD   VW         NLI V  
Sbjct: 668  ILSRDESGS--------LTQKSLDRLFPTDQSMVVIIDDRADVWSGGLQFWSPNLIKVVP 719

Query: 1076 YTYF 1079
            Y +F
Sbjct: 720  YDFF 723


>gi|429964988|gb|ELA46985.1| FCP1-like phosphatase, phosphatase domain-containing protein, partial
            [Vavraia culicis 'floridensis']
          Length = 231

 Score = 68.2 bits (165), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 54/187 (28%), Positives = 86/187 (45%), Gaps = 29/187 (15%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDE-------------ILRKKEEQDREK 958
            + ++   +K+ LV+DLD T+L+S +  +   V D              I  K+  Q R +
Sbjct: 53   RDELIQKKKMILVVDLDQTILHSIEV-KGGRVGDNGSRNRNGECGGRGITNKQLLQARPR 111

Query: 959  ---PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
               P    +      M T LRP + TFL   +++F MH+YTMG   Y  ++  V+D    
Sbjct: 112  QPLPSSFTYTLASTTMKTTLRPHLHTFLTELNEMFHMHIYTMGTSEYVHQITNVIDRDRS 171

Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLG-MESAVVIIDDSVRVWPHNKLNLIVVE 1074
            LF  R+++R D+             K LE + G  E  VV+IDD   VW +   NL+++ 
Sbjct: 172  LFGDRIVTRDDE----------VLVKRLERLFGDREDMVVVIDDRGDVWEYCG-NLVMIR 220

Query: 1075 RYTYFPC 1081
             +    C
Sbjct: 221  PFFGVDC 227


>gi|409083591|gb|EKM83948.1| hypothetical protein AGABI1DRAFT_124274 [Agaricus bisporus var.
            burnettii JB137-S8]
          Length = 853

 Score = 68.2 bits (165), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 40/109 (36%), Positives = 57/109 (52%), Gaps = 10/109 (9%)

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
            + K RPG   FL   +  ++MH+YTMG + YA E+   +DP G +F  R++SR + G   
Sbjct: 269  YIKPRPGWKEFLMDMATKYDMHVYTMGTRAYAEEVCAAIDPDGSVFKSRILSRDESGS-- 326

Query: 1032 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                     K L+ +   + S VVIIDD   VW  +  NLI V  Y +F
Sbjct: 327  ------LTQKSLQRLFPCDTSMVVIIDDRADVWEWSP-NLIKVIPYDFF 368


>gi|449018404|dbj|BAM81806.1| similar to TFIIF interacting component of CTD phosphatase Fcp1p
            [Cyanidioschyzon merolae strain 10D]
          Length = 1640

 Score = 68.2 bits (165), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 43/118 (36%), Positives = 64/118 (54%), Gaps = 15/118 (12%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
             + KLRPG+  FL   +  FE+H+YTMG++ YA  +A ++D    LF GR+ SR D    
Sbjct: 521  YYIKLRPGLHEFLRTIADRFELHIYTMGSRPYADTVASIIDSDERLFQGRITSRDD---- 576

Query: 1031 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWP------HNKL--NLIVVERYTYF 1079
            F+ D R+   K+L+ V    +S V+++DD   VW       H +   NLI    Y +F
Sbjct: 577  FE-DGRL-NQKNLKHVFPCDDSMVLVVDDREDVWVAQDQSLHGRHFPNLIRARPYYFF 632


>gi|432861327|ref|XP_004069613.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Oryzias
            latipes]
          Length = 473

 Score = 67.8 bits (164), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 66/245 (26%), Positives = 114/245 (46%), Gaps = 28/245 (11%)

Query: 835  VSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYD 894
            ++ ++ I+ G+I +  DM  +           GS P + P  A P++++ D   +F+ Y 
Sbjct: 211  ITSDTSIEEGEIITETDMPPLTA---PGCMSVGSYPHSIP-SAPPETSYEDEWEVFDPYY 266

Query: 895  DQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
              +      +++ TR+     K  S  +  LVLDLD TL++ +                E
Sbjct: 267  FIKHVPPLTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NE 312

Query: 953  EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
             +D       LF+     ++ +LRP    FLER S+L+E+ L+T   K+YA ++  +LDP
Sbjct: 313  LEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMSQLYEIILFTASKKVYADKLLNILDP 372

Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLI 1071
            K  L   R+    +      G+      KDL  +LG + S  VIID+S + + +   N I
Sbjct: 373  KKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTVIIDNSPQAFAYQLSNGI 425

Query: 1072 VVERY 1076
             +E +
Sbjct: 426  PIESW 430


>gi|125584005|gb|EAZ24936.1| hypothetical protein OsJ_08716 [Oryza sativa Japonica Group]
          Length = 364

 Score = 67.0 bits (162), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 53/131 (40%), Positives = 68/131 (51%), Gaps = 22/131 (16%)

Query: 998  GNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLG-------ME 1050
            G + YA  +AK+LDP GV F  R+ISR  D  P       P  K L+ V G         
Sbjct: 99   GTEDYAAAVAKLLDPDGVYFGERIISR--DESP------QPDRKSLDVVFGSAPASAAER 150

Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDH--DERSEDGT-L 1107
            +AVVI+DD+  VW  N  NLI +ERY YF  S R FG    S  E  H   ER  D +  
Sbjct: 151  AAVVILDDTAEVWEGNSDNLIEMERYHYFASSCRDFG----SPWECTHSLSERGVDESER 206

Query: 1108 ASSLGVRQQLH 1118
            A++L V +++H
Sbjct: 207  AAALRVLRRVH 217


>gi|221055253|ref|XP_002258765.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
            knowlesi strain H]
 gi|193808835|emb|CAQ39537.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
            knowlesi strain H]
          Length = 1474

 Score = 67.0 bits (162), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 43/117 (36%), Positives = 58/117 (49%), Gaps = 14/117 (11%)

Query: 968  HMGMWT---KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
            H G +T   KLRPG+  FL++ +K +E++LYTMG   +A     +LDP    F  RV SR
Sbjct: 548  HKGSYTIYYKLRPGVIQFLQKMNKKYEIYLYTMGTLEHAKSCLLLLDPLKKFFGNRVFSR 607

Query: 1025 GDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
             D          V   K L  +L    S  + IDDS  +W  +  + I V  Y YFP
Sbjct: 608  KD---------SVNGLKHLNRILPTYRSVSLCIDDSDYMWKESS-SCIKVHGYNYFP 654


>gi|156096809|ref|XP_001614438.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148803312|gb|EDL44711.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 1467

 Score = 66.6 bits (161), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 43/117 (36%), Positives = 58/117 (49%), Gaps = 14/117 (11%)

Query: 968  HMGMWT---KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
            H G +T   KLRPG+  FL++ +K +E++LYTMG   +A     +LDP    F  RV SR
Sbjct: 532  HKGSYTIYYKLRPGVIQFLQKMNKKYEIYLYTMGTLEHAKSCLLLLDPLKNFFGNRVFSR 591

Query: 1025 GDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
             D          V   K L  +L    S  + IDDS  +W  +  + I V  Y YFP
Sbjct: 592  KD---------SVNGLKHLNRILPTYRSVSLCIDDSDYMWKESS-SCIKVHGYNYFP 638


>gi|124802229|ref|XP_001347409.1| protein phosphatase, putative [Plasmodium falciparum 3D7]
 gi|23494988|gb|AAN35322.1| protein phosphatase, putative [Plasmodium falciparum 3D7]
          Length = 1438

 Score = 66.6 bits (161), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 40/111 (36%), Positives = 54/111 (48%), Gaps = 11/111 (9%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
            ++ KLRPG+  FL   S+ +E++LYTMG   +A     +LDP    F  RV SR D    
Sbjct: 572  IYYKLRPGVIEFLRTMSEKYEIYLYTMGTLEHAKSCLFLLDPLRKFFGNRVFSRKD---- 627

Query: 1031 FDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
                  +   K L  +L    S  + IDDS  +W  N  + I V  Y YFP
Sbjct: 628  -----CLNSLKHLNKILPTYRSVSICIDDSDYIWKENS-SCIKVHGYNYFP 672


>gi|330796177|ref|XP_003286145.1| hypothetical protein DICPUDRAFT_87022 [Dictyostelium purpureum]
 gi|325083890|gb|EGC37331.1| hypothetical protein DICPUDRAFT_87022 [Dictyostelium purpureum]
          Length = 793

 Score = 66.2 bits (160), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 50/182 (27%), Positives = 84/182 (46%), Gaps = 35/182 (19%)

Query: 917  SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG--MWTK 974
            S  K+ L++D+DHTL++S K    DP  +    K +         H   FP      + K
Sbjct: 413  STPKMHLIVDIDHTLIHSTK----DPNGESYFLKDKT-------VHKISFPETNETFYVK 461

Query: 975  LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR---------- 1024
             RP    FL   S+ F +++Y+   K Y   +A +LDP   +F+ +VIS+          
Sbjct: 462  ERPNAIEFLRTLSQQFYIYVYSFHPKYYVERVASILDPHSNIFS-KVISKEIIESIENIK 520

Query: 1025 -----GDDGDPF--DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
                  +   PF    ++ VPK    E +    + ++I+DD   VW + + NLI+++ + 
Sbjct: 521  ICRENNNSQKPFIVFNEQNVPKIFKFESI----NQLIILDDREDVWRNFQDNLILLDTFK 576

Query: 1078 YF 1079
            YF
Sbjct: 577  YF 578


>gi|422292668|gb|EKU19970.1| rna polymerase ii ctd phosphatase, partial [Nannochloropsis gaditana
            CCMP526]
          Length = 419

 Score = 66.2 bits (160), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 53/167 (31%), Positives = 82/167 (49%), Gaps = 18/167 (10%)

Query: 932  LNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM----WTKLRPGIWTFLERAS 987
            LN     ++  +H  I  + E  D +K   H F   + G     W  LRP + TFL +A 
Sbjct: 207  LNLILDIDLTLLHATIDPRAERLDHQKLEVHAFDIFNQGRILRHWCCLRPHLRTFLSQAH 266

Query: 988  KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
             L+ + +YT G + YA ++A++LDP   LF  R++SR D  D       +   K L+ + 
Sbjct: 267  ALYVLTIYTHGRRDYAHQVARLLDPDRTLFEDRIVSRDDCPD-------LHGQKSLQRLF 319

Query: 1048 --GMESAVVIIDDSVRVW----PHNKLNLIVVERYTYFPCSRRQFGL 1088
              G+E A +I+DDS +VW      + L ++  + YT F    R  GL
Sbjct: 320  PGGIEMA-LILDDSPQVWQGEQSRHLLPVLPFKFYTEFEEVNRVAGL 365


>gi|391332118|ref|XP_003740485.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
            phosphatase-like [Metaseiulus occidentalis]
          Length = 646

 Score = 66.2 bits (160), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 41/122 (33%), Positives = 62/122 (50%), Gaps = 11/122 (9%)

Query: 962  HLFRFP-HMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
            H FR P     W  T++RPG   FL + S+LFE+H+ T G + YA  +  +LDP    F 
Sbjct: 168  HHFRLPGSSNAWYHTRIRPGTEDFLRKISQLFELHIVTFGARPYANHIVSLLDPGKKYFQ 227

Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
             R+++R +   P        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y 
Sbjct: 228  YRILTRDECFHP------QSKTANLKSLFPCGDQMVCIIDDREDVWNFAS-NLVAVKPYV 280

Query: 1078 YF 1079
            +F
Sbjct: 281  FF 282


>gi|348509633|ref|XP_003442352.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Oreochromis
            niloticus]
          Length = 476

 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 89/359 (24%), Positives = 151/359 (42%), Gaps = 59/359 (16%)

Query: 735  MDELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQGSK------ENLNFQK 788
            ++E   V +    PR  L G          P F    P+   + GS       E     K
Sbjct: 117  LEETTAVEVTTSPPRTTLLGTIF------SPVFNFFSPAKNASSGSDSPDQALEAEEIVK 170

Query: 789  QLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMS-VSQ-PLTSEPMVSQNSPI-QPGQ 845
            QL        + Q+V  P  T   T+ L    ++ S VSQ P    P + + SP  + G+
Sbjct: 171  QLD-------IEQAVETPTSTATSTQELCVTTNYYSSVSQLPPLRPPHILEASPTTEEGE 223

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAG-----PVGAHPQSAWGDVEHLFEGYDDQQKA- 899
            + + AD+  +        T  G+ P+       P    P++++ +   +F+ Y   +   
Sbjct: 224  LHTDADLPPL--------TAPGTSPDMAHVDTLPATVPPEASYEEDWEVFDPYFFIKHVP 275

Query: 900  -AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
               +++ TR+     K  S  +  LVLDLD TL++ +                E +D   
Sbjct: 276  PLTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NELEDAAL 321

Query: 959  PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
                LF+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L  
Sbjct: 322  TFPVLFQDVIYQVYVRLRPFFREFLERMSQIYEIILFTASKKVYADKLLNILDPKKQLVR 381

Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
             R+    +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 382  HRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 433


>gi|387196292|gb|AFJ68751.1| rna polymerase ii ctd phosphatase, partial [Nannochloropsis gaditana
            CCMP526]
          Length = 414

 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 53/167 (31%), Positives = 82/167 (49%), Gaps = 18/167 (10%)

Query: 932  LNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM----WTKLRPGIWTFLERAS 987
            LN     ++  +H  I  + E  D +K   H F   + G     W  LRP + TFL +A 
Sbjct: 202  LNLILDIDLTLLHATIDPRAERLDHQKLEVHAFDIFNQGRILRHWCCLRPHLRTFLSQAH 261

Query: 988  KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
             L+ + +YT G + YA ++A++LDP   LF  R++SR D  D       +   K L+ + 
Sbjct: 262  ALYVLTIYTHGRRDYAHQVARLLDPDRTLFEDRIVSRDDCPD-------LHGQKSLQRLF 314

Query: 1048 --GMESAVVIIDDSVRVW----PHNKLNLIVVERYTYFPCSRRQFGL 1088
              G+E A +I+DDS +VW      + L ++  + YT F    R  GL
Sbjct: 315  PGGIEMA-LILDDSPQVWQGEQSRHLLPVLPFKFYTEFEEVNRVAGL 360


>gi|297830090|ref|XP_002882927.1| hypothetical protein ARALYDRAFT_897807 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297328767|gb|EFH59186.1| hypothetical protein ARALYDRAFT_897807 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 287

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 51/175 (29%), Positives = 71/175 (40%), Gaps = 44/175 (25%)

Query: 916  FSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKL 975
            +  RKL LV+DL H LL+S                                   G+  KL
Sbjct: 96   YGQRKLHLVVDLQHVLLDSN----------------------------------GVLVKL 121

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 1035
            RP    FL  A++LF ++ YT  +   A    K+LDP  + F  R I+           E
Sbjct: 122  RPFAREFLREANELFTIYAYTKSDPKQARSFIKLLDPLKIFFPSRFITIA---------E 172

Query: 1036 RVPKSKDLEGVLGMESAVVIIDDSVRVWPH-NKLNLIVVERYTYFPCSRRQFGLL 1089
               K K LE VL  E  VVI+D     W   ++ NL++++ Y YF     Q G +
Sbjct: 173  EKRKKKSLEFVLAEERGVVILDCKSETWEKDDERNLLLIKSYDYFKGMEYQQGFI 227


>gi|71026568|ref|XP_762950.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68349902|gb|EAN30667.1| hypothetical protein TP03_0826 [Theileria parva]
          Length = 823

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 57/217 (26%), Positives = 89/217 (41%), Gaps = 59/217 (27%)

Query: 920  KLCLVLDLDHTLL------------------NSAKF------HEVDPVHDEILRKKEEQD 955
            KLCLVLDLD+TLL                  ++A +      +  D V  E+ +K E   
Sbjct: 301  KLCLVLDLDNTLLHATSQPPPPDIAIPILNYDTADYLNQYVQYGTDSVSLELQQKLENSV 360

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
             +    +     +   + KLRPGI+ F  + S  F + L+TMG K +A    +V+DP+G+
Sbjct: 361  IKTIVYNETETSYCVSYFKLRPGIFQFFHKISDKFRLFLFTMGTKQHAASALQVIDPQGI 420

Query: 1016 LFAGRVISRGDDGD--------------------------------PFDGDERVPKSKDL 1043
             F  R+ SR +                                       D R    K L
Sbjct: 421  YFGNRIFSRYNTNSHNSTNSINSVNSVNSVNSVNSMNSVSNVVGVKKLRNDLRYCM-KSL 479

Query: 1044 EGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            + +    ++ V+++DD+  VW +N L LI V  Y +F
Sbjct: 480  DRIFPNYKNLVLVMDDTEHVWTNN-LGLIKVHPYYFF 515


>gi|68525545|ref|XP_723632.1| NLI interacting factor [Plasmodium yoelii yoelii 17XNL]
 gi|23477988|gb|EAA15197.1| NLI interacting factor, putative [Plasmodium yoelii yoelii]
          Length = 1251

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 55/111 (49%), Gaps = 11/111 (9%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
            ++ KLRPG+  FL++ ++ +E++LYTMG   +A     +LDP    F  R+ SR D  + 
Sbjct: 428  IYYKLRPGVIEFLQKMNQKYEIYLYTMGTIEHAKSCLFLLDPLKKFFGNRIFSRKDCTNG 487

Query: 1031 FDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
                      K L  +L    S  + +DDS  +W     + I V  Y YFP
Sbjct: 488  M---------KHLNRILPTYRSISICVDDSEYIWKETN-SCIKVHAYNYFP 528


>gi|47224149|emb|CAG13069.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 159

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 51/170 (30%), Positives = 79/170 (46%), Gaps = 25/170 (14%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG- 970
            Q K+  +RKL L++DLD+TL+++                 E   +  P +++F+    G 
Sbjct: 11   QDKLHQSRKLVLMVDLDNTLIHTT----------------EIPCQLSPKKNVFKMKLEGS 54

Query: 971  --MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
               + +LRP    FLE+ S+LFE++++T   + YA  +A  LDP    FA R+ISR +  
Sbjct: 55   PTYYVRLRPYYKEFLEKISELFELNIFTFACQSYAKTVAGFLDPDNTFFAQRIISRDNCF 114

Query: 1029 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
             P      V            ES   +IDD   VW      L+ V+ Y Y
Sbjct: 115  YPATKMANVRFFSPCG-----ESMTCMIDDREDVWNFAP-GLVAVKPYMY 158


>gi|297819962|ref|XP_002877864.1| hypothetical protein ARALYDRAFT_906616 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297323702|gb|EFH54123.1| hypothetical protein ARALYDRAFT_906616 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 284

 Score = 64.7 bits (156), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 62/224 (27%), Positives = 100/224 (44%), Gaps = 32/224 (14%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG-MW 972
            K    ++L LVL L  TL +S    ++    + +    E + R    R    FP+ G + 
Sbjct: 54   KSLKEKRLTLVLGLHGTLYDSRLVSQLSDGENYL--TGEVKSRFDLRRSKKFFPNQGEVL 111

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 1032
             KLRP +  FL  A+KLF+M ++ + +     E+   LDP G  F  R+I+  D      
Sbjct: 112  FKLRPFVHEFLREANKLFQMTVFELCSPEQGEEVISFLDPHGTYFEKRIITNRD------ 165

Query: 1033 GDERVPKSKDLEGVLGMESAVVIIDDS-VRVWPHNKLNLIVVERYTYFPCSRRQFGL--- 1088
                  + K+L+ VL  E  +VI+DD  V  WP +  NL+ +  Y +F  +     +   
Sbjct: 166  -----SEMKNLDLVLADERGIVILDDKHVYWWPDDTTNLLQIAPYHFFKRNNNNTWITKL 220

Query: 1089 --LGPSLLEID------------HDERSEDGTLASSLGVRQQLH 1118
                   L ID             DE +EDG L ++L + +++H
Sbjct: 221  VNFFKKTLSIDDESDPKSYAEERRDEDAEDGGLENALELLKEVH 264


>gi|218197280|gb|EEC79707.1| hypothetical protein OsI_21008 [Oryza sativa Indica Group]
          Length = 485

 Score = 64.3 bits (155), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 51/171 (29%), Positives = 83/171 (48%), Gaps = 20/171 (11%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            + + LVLDLD TL++S+    VD         ++  D   P  H  +     ++ K RP 
Sbjct: 308  KNITLVLDLDETLIHSSA---VD---------RDGADFSFPMYHGLK--EHTVYVKKRPH 353

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
            + TFL++ S++F++ ++T     YA  +  +LDPK + F  R     D   P DG     
Sbjct: 354  VDTFLQKVSEMFKVVIFTASLSSYANRLLDMLDPKNIFFTKRYFR--DSCLPVDGSYL-- 409

Query: 1039 KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 1089
              KDL  ++   + VVIID+S  V+   + N I +E +T  P  +    L+
Sbjct: 410  --KDLTVIVADLAKVVIIDNSPEVFRLQEENGIPIESWTSDPADKSLVELI 458


>gi|297740632|emb|CBI30814.3| unnamed protein product [Vitis vinifera]
          Length = 479

 Score = 64.3 bits (155), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 47/149 (31%), Positives = 73/149 (48%), Gaps = 28/149 (18%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            + + LVLDLD TL++S             L   ++ D   P    F      ++ K RP 
Sbjct: 305  KSITLVLDLDETLVHST------------LEHCDDADFTFPV--FFNMKDHTVYVKQRPY 350

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 1035
            + TFLER +++FE+ ++T    +YA ++  +LDP G  F+ R         DG       
Sbjct: 351  LHTFLERVAEMFEIVVFTASQSIYAEQLLDILDPDGKFFSHRAYRESCIFSDG------- 403

Query: 1036 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
                +KDL  VLG++ A V IID+S +V+
Sbjct: 404  --SYTKDL-TVLGIDLAKVAIIDNSPQVF 429


>gi|222632581|gb|EEE64713.1| hypothetical protein OsJ_19569 [Oryza sativa Japonica Group]
          Length = 485

 Score = 64.3 bits (155), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 51/171 (29%), Positives = 83/171 (48%), Gaps = 20/171 (11%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            + + LVLDLD TL++S+    VD         ++  D   P  H  +     ++ K RP 
Sbjct: 308  KNITLVLDLDETLIHSSA---VD---------RDGADFSFPMYHGLK--EHTVYVKKRPH 353

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
            + TFL++ S++F++ ++T     YA  +  +LDPK + F  R     D   P DG     
Sbjct: 354  VDTFLQKVSEMFKVVIFTASLSSYANRLLDMLDPKNIFFTKRYFR--DSCLPVDGSYL-- 409

Query: 1039 KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 1089
              KDL  ++   + VVIID+S  V+   + N I +E +T  P  +    L+
Sbjct: 410  --KDLTVIVADLAKVVIIDNSPEVFRLQEENGIPIESWTSDPADKSLVELI 458


>gi|225463384|ref|XP_002271705.1| PREDICTED: uncharacterized protein LOC100258847 [Vitis vinifera]
          Length = 484

 Score = 64.3 bits (155), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 47/149 (31%), Positives = 73/149 (48%), Gaps = 28/149 (18%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            + + LVLDLD TL++S             L   ++ D   P    F      ++ K RP 
Sbjct: 310  KSITLVLDLDETLVHST------------LEHCDDADFTFPV--FFNMKDHTVYVKQRPY 355

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 1035
            + TFLER +++FE+ ++T    +YA ++  +LDP G  F+ R         DG       
Sbjct: 356  LHTFLERVAEMFEIVVFTASQSIYAEQLLDILDPDGKFFSHRAYRESCIFSDG------- 408

Query: 1036 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
                +KDL  VLG++ A V IID+S +V+
Sbjct: 409  --SYTKDL-TVLGIDLAKVAIIDNSPQVF 434


>gi|326499061|dbj|BAK06021.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 415

 Score = 63.9 bits (154), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 56/181 (30%), Positives = 83/181 (45%), Gaps = 28/181 (15%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            K+  S  +  LVLDLD TL++S             L   E+ D   P R  F      ++
Sbjct: 215  KQTRSCPRTTLVLDLDETLVHST------------LEPCEDSDFTFPVR--FNLRDHTIY 260

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGD 1029
             + RP +  FLER + +FE+ ++T    +YA ++  VLDPK  LF  RV        +G+
Sbjct: 261  VRCRPYLKDFLERVASMFEIIIFTASQSIYAEQLLNVLDPKRRLFRHRVYRESCVYVEGN 320

Query: 1030 PFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1088
                       KDL  VLG + S VVI+D+S + +     N I +E +   P  +    L
Sbjct: 321  YL---------KDL-SVLGRDLSRVVIVDNSPQAFGFQLDNGIPIESWFDDPNDKELLAL 370

Query: 1089 L 1089
            L
Sbjct: 371  L 371


>gi|356530555|ref|XP_003533846.1| PREDICTED: uncharacterized protein LOC100786602 [Glycine max]
          Length = 470

 Score = 63.5 bits (153), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 54/189 (28%), Positives = 87/189 (46%), Gaps = 38/189 (20%)

Query: 879  PQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFH 938
            PQS   ++  L E   + Q   I K+  RR          + + LVLDLD TL++S    
Sbjct: 266  PQSFIKNLPELSEIEVNGQPTLIPKQSPRR----------KSITLVLDLDETLVHST--- 312

Query: 939  EVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMG 998
             ++P            D +      F      ++ K RP +  FLER S++FE+ ++T  
Sbjct: 313  -LEPC----------DDADFTFTVFFNLKEYTVYVKQRPYLHAFLERVSEMFEVVIFTAS 361

Query: 999  NKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDERVPKSKDLEGVLGMESA-VV 1054
              +YA ++  +LDP G   + R+        DG+          +KDL  +LG++ A V 
Sbjct: 362  QSIYAKQLLDILDPDGRFISRRMYRESCLFSDGN---------YTKDL-TILGVDLAKVA 411

Query: 1055 IIDDSVRVW 1063
            IID+S +V+
Sbjct: 412  IIDNSPQVF 420


>gi|68074755|ref|XP_679294.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56500009|emb|CAH99961.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 983

 Score = 63.5 bits (153), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 55/111 (49%), Gaps = 11/111 (9%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
            ++ KLRPG+  FL++ ++ +E++LYTMG   +A     +LDP    F  R+ SR D  + 
Sbjct: 233  IYYKLRPGVIEFLQKMNQKYEIYLYTMGTIEHAKSCLFLLDPLKKFFGNRIFSRKDCTNG 292

Query: 1031 FDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
                      K L  +L    S  + +DDS  +W     + I V  Y YFP
Sbjct: 293  M---------KHLNRILPTYRSISICVDDSEYIWKEAN-SCIKVHAYNYFP 333


>gi|301115156|ref|XP_002905307.1| nuclear LIM factor interactor-interacting protein hyphal form,
            putative [Phytophthora infestans T30-4]
 gi|262110096|gb|EEY68148.1| nuclear LIM factor interactor-interacting protein hyphal form,
            putative [Phytophthora infestans T30-4]
          Length = 422

 Score = 63.5 bits (153), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 55/164 (33%), Positives = 81/164 (49%), Gaps = 28/164 (17%)

Query: 917  SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL---FRFPHMGMWT 973
            +A K+CLVLDLD TL++ +    VD V             + PH      F      +  
Sbjct: 236  NAPKICLVLDLDETLVHCS----VDEV-------------KNPHMQFPVTFNGVEYIVNV 278

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
            K RP +  FL+R SKLFE+ ++T  +K+YA ++  +LDP   L   R+  R D  D F  
Sbjct: 279  KKRPHMEYFLKRVSKLFEIVVFTASHKVYAEKLTNMLDPHRNLIKYRLY-RDDCLDVFGN 337

Query: 1034 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
                   KDL  VLG + S VV++D+S   + +   N I +E +
Sbjct: 338  -----YLKDL-NVLGRDLSKVVLVDNSPHAFGYQVNNGIPIETW 375


>gi|70952066|ref|XP_745226.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56525483|emb|CAH77992.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 1224

 Score = 63.5 bits (153), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 55/111 (49%), Gaps = 11/111 (9%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
            ++ KLRPG+  FL++ ++ +E++LYTMG   +A     +LDP    F  R+ SR D  + 
Sbjct: 429  IYYKLRPGVIEFLQKMNQKYEIYLYTMGTIEHAKSCLFLLDPLKKFFGNRIFSRKDCTNG 488

Query: 1031 FDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
                      K L  +L    S  + +DDS  +W     + I V  Y YFP
Sbjct: 489  M---------KHLNRILPTYRSISICVDDSEYIWKEAN-SCIKVHAYNYFP 529


>gi|15239800|ref|NP_196747.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
 gi|30683828|ref|NP_850809.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
 gi|42573341|ref|NP_974767.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
 gi|145334381|ref|NP_001078572.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
 gi|7573353|emb|CAB87659.1| putative protein [Arabidopsis thaliana]
 gi|21553575|gb|AAM62668.1| unknown [Arabidopsis thaliana]
 gi|56550687|gb|AAV97797.1| At5g11860 [Arabidopsis thaliana]
 gi|332004345|gb|AED91728.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
 gi|332004346|gb|AED91729.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
 gi|332004347|gb|AED91730.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
 gi|332004348|gb|AED91731.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
          Length = 305

 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 60/185 (32%), Positives = 87/185 (47%), Gaps = 36/185 (19%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP----- 967
            K+  S   + LVLDLD TL++S             L    E D        F FP     
Sbjct: 104  KQTRSCPPISLVLDLDETLVHST------------LEPCGEVD--------FTFPVNFNE 143

Query: 968  --HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
              HM ++ + RP +  F+ER S+LFE+ ++T    +YA ++  VLDPK  LF  RV    
Sbjct: 144  EEHM-VYVRCRPHLKEFMERVSRLFEIIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR-- 200

Query: 1026 DDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRR 1084
            D    FDG+      KDL  VLG + S V+I+D+S + +     N + +E +   P  + 
Sbjct: 201  DSCVFFDGN----YLKDL-SVLGRDLSRVIIVDNSPQAFGFQVENGVPIESWFNDPSDKE 255

Query: 1085 QFGLL 1089
               LL
Sbjct: 256  LLHLL 260


>gi|356556521|ref|XP_003546573.1| PREDICTED: uncharacterized protein LOC100799803 [Glycine max]
          Length = 471

 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 72/149 (48%), Gaps = 28/149 (18%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            + + LVLDLD TL++S   H                D +      F      ++ K RP 
Sbjct: 297  KSITLVLDLDETLVHSTLEHC--------------DDADFTFTVFFNLKEYIVYVKQRPY 342

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 1035
            + TFLER S++FE+ ++T    +YA ++  +LDP G   + R+        DG+      
Sbjct: 343  LHTFLERVSEMFEVVIFTASQSIYAKQLLDILDPDGRFISRRMYRESCLFSDGN------ 396

Query: 1036 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
                +KDL  +LG++ A V IID+S +V+
Sbjct: 397  ---YTKDL-TILGVDLAKVAIIDNSPQVF 421


>gi|147839779|emb|CAN65912.1| hypothetical protein VITISV_035567 [Vitis vinifera]
          Length = 482

 Score = 63.2 bits (152), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 47/149 (31%), Positives = 73/149 (48%), Gaps = 28/149 (18%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            + + LVLDLD TL++S             L   ++ D   P    F      ++ K RP 
Sbjct: 308  KSITLVLDLDETLVHST------------LEHCDDADFTFPV--FFNMKDHTVYVKQRPY 353

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 1035
            + TFLER +++FE+ ++T    +YA ++  +LDP G  F+ R         DG       
Sbjct: 354  LHTFLERVAEMFEIVVFTASQSIYAEQLLDILDPDGKFFSHRAYRESCIFSDG------- 406

Query: 1036 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
                +KDL  VLG++ A V IID+S +V+
Sbjct: 407  --SYTKDL-TVLGIDLAKVAIIDNSPQVF 432


>gi|255557435|ref|XP_002519748.1| conserved hypothetical protein [Ricinus communis]
 gi|223541165|gb|EEF42721.1| conserved hypothetical protein [Ricinus communis]
          Length = 474

 Score = 63.2 bits (152), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 48/155 (30%), Positives = 74/155 (47%), Gaps = 28/155 (18%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            K+    + + LVLDLD TL++S   H                D +      F      ++
Sbjct: 294  KESLMKKSVTLVLDLDETLVHSTLEHC--------------DDADFTFTVFFNLKEHTVY 339

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGD 1029
             K RP + TFLER ++LFE+ ++T    +YA ++  +LDP+  L + RV        DG 
Sbjct: 340  VKRRPHLHTFLERVAELFEVVIFTASQSIYAAQLLDILDPEKKLISRRVYRESCIFTDG- 398

Query: 1030 PFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
                      +KDL  VLG++ A V IID+S +V+
Sbjct: 399  --------SYTKDL-TVLGVDLAKVAIIDNSPQVF 424


>gi|297811303|ref|XP_002873535.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
            lyrata]
 gi|297319372|gb|EFH49794.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
            lyrata]
          Length = 305

 Score = 63.2 bits (152), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 60/185 (32%), Positives = 87/185 (47%), Gaps = 36/185 (19%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP----- 967
            K+  S   + LVLDLD TL++S             L    E D        F FP     
Sbjct: 104  KQTRSCPPISLVLDLDETLVHST------------LEPCGEVD--------FTFPVNFNE 143

Query: 968  --HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
              HM ++ + RP +  F+ER S+LFE+ ++T    +YA ++  VLDPK  LF  RV    
Sbjct: 144  EEHM-VYVRCRPHLKEFMERVSRLFEIIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR-- 200

Query: 1026 DDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRR 1084
            D    FDG+      KDL  VLG + S V+I+D+S + +     N + +E +   P  + 
Sbjct: 201  DSCVFFDGN----YLKDL-SVLGRDLSRVIIVDNSPQAFGFQVENGVPIESWFNDPSDKE 255

Query: 1085 QFGLL 1089
               LL
Sbjct: 256  LLHLL 260


>gi|351710351|gb|EHB13270.1| CTD small phosphatase-like protein 2 [Heterocephalus glaber]
          Length = 466

 Score = 62.8 bits (151), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 62/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
            I +G +      N D         PE+G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211  INNGLEEAEETVNRDIPHLTAPVTPESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271  QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317  FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377  --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|147798518|emb|CAN65472.1| hypothetical protein VITISV_037605 [Vitis vinifera]
          Length = 506

 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 61/210 (29%), Positives = 95/210 (45%), Gaps = 34/210 (16%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            EE K+    +++ LVLDLD TL++S             L   +  D   P    F     
Sbjct: 327  EESKR----KRITLVLDLDETLVHST------------LEPCDHADFTFPV--FFNMKEH 368

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---D 1026
             ++ + RP +  FLER +++FE+ ++T    +YA ++  +LDP   LF+GR         
Sbjct: 369  TIYVRQRPFLQMFLERVAEMFEIIVFTASQSIYAEQLLDILDPDRKLFSGRAYRESCIFS 428

Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
            DG           +KDL  VLG++ A V IID+S +V+     N I ++ +   P  R  
Sbjct: 429  DGS---------YTKDL-TVLGIDLAKVAIIDNSPQVFRLQVDNGIPIKSWFDDPSDRAL 478

Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVRQ 1115
              LL    LE   D       +A   GV++
Sbjct: 479  ISLL--PFLETLVDADDVRPIIAKRFGVKE 506


>gi|397787628|gb|AFO66533.1| putative NLI interacting factor family protein [Brassica napus]
          Length = 477

 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 51/140 (36%), Positives = 73/140 (52%), Gaps = 26/140 (18%)

Query: 923  LVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIW 980
            LVLDLD TL++S+     EVD        ++E               HM ++ + RP + 
Sbjct: 116  LVLDLDETLVHSSLEPCGEVDFTFTVHFNEEE---------------HM-VYVRCRPHLK 159

Query: 981  TFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKS 1040
             F+ER S+LFE+ ++T    +YA ++  VLDPK  LF  RV    D    FDG+      
Sbjct: 160  EFMERVSRLFEVIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR--DSCVFFDGN----YL 213

Query: 1041 KDLEGVLGME-SAVVIIDDS 1059
            KDL  VLG + S V+I+D+S
Sbjct: 214  KDL-SVLGRDLSRVIIVDNS 232


>gi|359487040|ref|XP_002265614.2| PREDICTED: uncharacterized protein LOC100267967 [Vitis vinifera]
          Length = 522

 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 61/210 (29%), Positives = 95/210 (45%), Gaps = 34/210 (16%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            EE K+    +++ LVLDLD TL++S             L   +  D   P    F     
Sbjct: 343  EESKR----KRITLVLDLDETLVHST------------LEPCDHADFTFPV--FFNMKEH 384

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---D 1026
             ++ + RP +  FLER +++FE+ ++T    +YA ++  +LDP   LF+GR         
Sbjct: 385  TIYVRQRPFLQMFLERVAEMFEIIVFTASQSIYAEQLLDILDPDRKLFSGRAYRESCIFS 444

Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
            DG           +KDL  VLG++ A V IID+S +V+     N I ++ +   P  R  
Sbjct: 445  DGS---------YTKDL-TVLGIDLAKVAIIDNSPQVFRLQVDNGIPIKSWFDDPSDRAL 494

Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVRQ 1115
              LL    LE   D       +A   GV++
Sbjct: 495  ISLL--PFLETLVDADDVRPIIAKRFGVKE 522


>gi|397787605|gb|AFO66511.1| putative small phosphatase-like protein 2-B [Brassica napus]
          Length = 262

 Score = 62.4 bits (150), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 49/138 (35%), Positives = 74/138 (53%), Gaps = 22/138 (15%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LVLDLD TL++S+    ++P  +         + E+         HM ++ + RP +  F
Sbjct: 71   LVLDLDETLVHSS----LEPCGEVDFTFTVHFNEEE---------HM-VYVRCRPHLKEF 116

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
            +ER S+LFE+ ++T    +YA ++  VLDPK  LF  RV    D    FDG+      KD
Sbjct: 117  MERVSRLFEVIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR--DSCVFFDGN----YLKD 170

Query: 1043 LEGVLGME-SAVVIIDDS 1059
            L  VLG + S V+I+D+S
Sbjct: 171  L-SVLGRDLSRVIIVDNS 187


>gi|66805733|ref|XP_636588.1| hypothetical protein DDB_G0288707 [Dictyostelium discoideum AX4]
 gi|60464974|gb|EAL63085.1| hypothetical protein DDB_G0288707 [Dictyostelium discoideum AX4]
          Length = 985

 Score = 62.4 bits (150), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 44/174 (25%), Positives = 81/174 (46%), Gaps = 28/174 (16%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K+ L++D+DHTLL+S K    DP  +    K    ++           +   + K RP  
Sbjct: 574  KMYLIVDIDHTLLHSTK----DPNAESYYLKDNSINK-----FTITETNETFYVKQRPNA 624

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR------------GDD 1027
              FL   S  F+++LY+   K Y  ++A +LDP   +F  +VI++               
Sbjct: 625  IEFLSSLSSQFKIYLYSFHPKYYVEQLALILDPNRSIFT-KVITKEVIEPVEPLPPINSI 683

Query: 1028 GDPFDGDERVPKSKDLEGVLGMESA--VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            G P+     V  +++   +   E+   ++I+DD   VW + + NLI+++ + +F
Sbjct: 684  GKPY----IVFNNQNFSKIFNFEAINQMIILDDREDVWRNFQDNLILLDTFKFF 733


>gi|296090552|emb|CBI40902.3| unnamed protein product [Vitis vinifera]
          Length = 570

 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 61/210 (29%), Positives = 95/210 (45%), Gaps = 34/210 (16%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            EE K+    +++ LVLDLD TL++S             L   +  D   P    F     
Sbjct: 391  EESKR----KRITLVLDLDETLVHST------------LEPCDHADFTFPV--FFNMKEH 432

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---D 1026
             ++ + RP +  FLER +++FE+ ++T    +YA ++  +LDP   LF+GR         
Sbjct: 433  TIYVRQRPFLQMFLERVAEMFEIIVFTASQSIYAEQLLDILDPDRKLFSGRAYRESCIFS 492

Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1085
            DG           +KDL  VLG++ A V IID+S +V+     N I ++ +   P  R  
Sbjct: 493  DGS---------YTKDL-TVLGIDLAKVAIIDNSPQVFRLQVDNGIPIKSWFDDPSDRAL 542

Query: 1086 FGLLGPSLLEIDHDERSEDGTLASSLGVRQ 1115
              LL    LE   D       +A   GV++
Sbjct: 543  ISLL--PFLETLVDADDVRPIIAKRFGVKE 570


>gi|327288817|ref|XP_003229121.1| PREDICTED: CTD small phosphatase-like protein 2-like [Anolis
            carolinensis]
          Length = 466

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 67/265 (25%), Positives = 118/265 (44%), Gaps = 30/265 (11%)

Query: 821  DFMSVSQPLTSEPMVSQNSPI--QPGQIKS----GADMKAVVTNHDDKQTGTGSGPEAGP 874
            D   V + +TS    +  +P   Q  Q++S    G +    VT+ D         P++G 
Sbjct: 180  DMEQVDEIITSTAASANGTPYASQVAQVRSTINNGLEEAEDVTDRDLPPLTAPVSPDSGY 239

Query: 875  VGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLL 932
              AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDLD TL+
Sbjct: 240  SSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDLDETLV 299

Query: 933  NSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEM 992
            + +                E +D       LF+     ++ +LRP    FLER S+++E+
Sbjct: 300  HCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMSQIYEI 345

Query: 993  HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-S 1051
             L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +LG + S
Sbjct: 346  ILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDLS 398

Query: 1052 AVVIIDDSVRVWPHNKLNLIVVERY 1076
              +IID+S + + +   N I +E +
Sbjct: 399  KTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|281204241|gb|EFA78437.1| hypothetical protein PPL_09089 [Polysphondylium pallidum PN500]
          Length = 1252

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 60/115 (52%), Gaps = 8/115 (6%)

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV-ISRGDDGDP 1030
            + K+RP   TFL+    LF + L+++ +K Y  +M +++DP   LF   + I    D  P
Sbjct: 934  YVKIRPYTITFLKTLYPLFNITLFSLNHKSYVNKMVEIIDPSKTLFKNIITIESFGDNIP 993

Query: 1031 FDGDERVPKS----KDLEGVLGMES--AVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
                 R P S     +   +  ++S  ++V+IDD   +W   + NLI+VER+ +F
Sbjct: 994  KQQTNR-PYSLFTPSNFSSIFKIDSSESIVVIDDREDIWRQFRDNLIMVERFIHF 1047


>gi|62078827|ref|NP_001014070.1| CTD small phosphatase-like protein 2 [Rattus norvegicus]
 gi|81883796|sp|Q5XIK8.1|CTSL2_RAT RecName: Full=CTD small phosphatase-like protein 2; Short=CTDSP-like
            2
 gi|53734232|gb|AAH83672.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase like 2 [Rattus norvegicus]
 gi|149023119|gb|EDL80013.1| similar to hypothetical protein HSPC129 [Rattus norvegicus]
          Length = 465

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 58/211 (27%), Positives = 98/211 (46%), Gaps = 24/211 (11%)

Query: 869  GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
             PE+G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 233  APESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 292

Query: 927  LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
            LD TL           VH  +    E +D       LF+     ++ +LRP    FLER 
Sbjct: 293  LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 338

Query: 987  SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
            S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 339  SQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NI 391

Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            LG + S  +IID+S + + +   N I +E +
Sbjct: 392  LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422


>gi|123900520|sp|Q3KQB6.1|CTL2B_XENLA RecName: Full=CTD small phosphatase-like protein 2-B;
            Short=CTDSP-like 2-B
 gi|76779483|gb|AAI06291.1| Ctdspl2b protein [Xenopus laevis]
          Length = 466

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 99/211 (46%), Gaps = 24/211 (11%)

Query: 869  GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
             PE+G   AH ++A+ +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 234  SPESGYSSAHAEAAYEEDWEVFDPYFFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 293

Query: 927  LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
            LD TL           VH  +    E +D       LF+     ++ +LRP    FLER 
Sbjct: 294  LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 339

Query: 987  SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
            S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 340  SQIYEIILFTASKKVYADKLLNILDPKKRLVRHRLFR--EHCVCVQGN----YIKDL-NI 392

Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            LG + S  +IID+S + + +   N I +E +
Sbjct: 393  LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|56605878|ref|NP_001008438.1| CTD small phosphatase-like protein 2 [Xenopus (Silurana) tropicalis]
 gi|82181540|sp|Q66KM5.1|CTSL2_XENTR RecName: Full=CTD small phosphatase-like protein 2; Short=CTDSP-like
            2
 gi|51512946|gb|AAH80328.1| MGC79498 protein [Xenopus (Silurana) tropicalis]
          Length = 466

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 99/211 (46%), Gaps = 24/211 (11%)

Query: 869  GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
             PE+G   AH ++A+ +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 234  SPESGYSSAHAEAAYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 293

Query: 927  LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
            LD TL           VH  +    E +D       LF+     ++ +LRP    FLER 
Sbjct: 294  LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 339

Query: 987  SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
            S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 340  SQIYEIILFTASKKVYADKLLNILDPKKRLVRHRLFR--EHCVCVQGN----YIKDL-NI 392

Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            LG + S  +IID+S + + +   N I +E +
Sbjct: 393  LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|351699228|gb|EHB02147.1| CTD small phosphatase-like protein 2 [Heterocephalus glaber]
          Length = 465

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 58/234 (24%), Positives = 102/234 (43%), Gaps = 24/234 (10%)

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
            I +G +      N D         PE+     H +  + +   +F+ Y+  +      ++
Sbjct: 210  INNGLEEAEATVNRDIPHLTAPVTPESDYSSIHAEVTYEEDWEIFDPYNFIKHVPPLTEQ 269

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +  R+     K  +  K  LVLDLD TL++ +     D  H   +              L
Sbjct: 270  QLNRKPALPLKTRAKTKFSLVLDLDETLVHCSLNELEDAAHTFPV--------------L 315

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F+     ++ +LRP    FLER SK++E+ ++T   K+YA ++  +LDPK  L   R+  
Sbjct: 316  FQGVIYQVYVRLRPFFREFLERMSKMYEIIVFTAAKKVYAEKLLNILDPKKQLVRHRLFQ 375

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 376  --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422


>gi|147907092|ref|NP_001089935.1| CTD small phosphatase-like protein 2-B [Xenopus laevis]
 gi|83405117|gb|AAI10767.1| Ctdspl2b protein [Xenopus laevis]
          Length = 466

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 99/211 (46%), Gaps = 24/211 (11%)

Query: 869  GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
             PE+G   AH ++A+ +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 234  SPESGYSSAHAEAAYEEDWEVFDPYFFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 293

Query: 927  LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
            LD TL           VH  +    E +D       LF+     ++ +LRP    FLER 
Sbjct: 294  LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 339

Query: 987  SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
            S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 340  SQIYEIILFTASKKVYADKLLNILDPKKRLVRHRLFR--EHCVCVQGN----YIKDL-NI 392

Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            LG + S  +IID+S + + +   N I +E +
Sbjct: 393  LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|357156635|ref|XP_003577523.1| PREDICTED: CTD small phosphatase-like protein 2-like isoform 1
            [Brachypodium distachyon]
          Length = 411

 Score = 61.6 bits (148), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 55/181 (30%), Positives = 82/181 (45%), Gaps = 28/181 (15%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            K+  S  +  LVLDLD TL++S             L   E+ D   P    F      ++
Sbjct: 219  KQTRSCPRTTLVLDLDETLVHST------------LEPCEDSDFTFPVH--FNLREHTIY 264

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGD 1029
             + RP +  FLER + +FE+ ++T    +YA ++  VLDPK  LF  RV        +G+
Sbjct: 265  VRCRPYLKEFLERVASMFEIIIFTASQSIYAEQLLNVLDPKRKLFRHRVYRESCVYVEGN 324

Query: 1030 PFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1088
                       KDL  VLG + A VVI+D+S + +     N I +E +   P  +    L
Sbjct: 325  YL---------KDL-SVLGRDLARVVIVDNSPQAFGFQLENGIPIESWFDDPNDKELLAL 374

Query: 1089 L 1089
            L
Sbjct: 375  L 375


>gi|229892336|ref|NP_001080602.1| CTD small phosphatase-like protein 2-A [Xenopus laevis]
 gi|82176945|sp|Q801R4.1|CTL2A_XENLA RecName: Full=CTD small phosphatase-like protein 2-A;
            Short=CTDSP-like 2-A
 gi|28838482|gb|AAH47962.1| Ctdspl2a protein [Xenopus laevis]
 gi|120538080|gb|AAI29525.1| Ctdspl2a protein [Xenopus laevis]
          Length = 466

 Score = 61.6 bits (148), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 99/211 (46%), Gaps = 24/211 (11%)

Query: 869  GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
             PE+G   AH ++A+ +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 234  SPESGYSSAHAEAAYEEDWEVFDPYFFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 293

Query: 927  LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
            LD TL           VH  +    E +D       LF+     ++ +LRP    FLER 
Sbjct: 294  LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 339

Query: 987  SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
            S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 340  SQIYEIILFTASKKVYADKLLNILDPKKRLVRHRLFR--EHCVCVQGN----YIKDL-NI 392

Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            LG + S  +IID+S + + +   N I +E +
Sbjct: 393  LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|117606236|ref|NP_001071012.1| CTD small phosphatase-like protein 2-A [Danio rerio]
 gi|123884286|sp|Q08BB5.1|CTL2A_DANRE RecName: Full=CTD small phosphatase-like protein 2-A;
            Short=CTDSP-like 2-A
 gi|115528634|gb|AAI24795.1| Zgc:154017 [Danio rerio]
          Length = 469

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 68/254 (26%), Positives = 111/254 (43%), Gaps = 36/254 (14%)

Query: 829  LTSEPMVSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEH 888
            L   P+   ++ ++ G+I + ADM  +        T  GS      V   P  A G  E 
Sbjct: 203  LNPRPLPHIDTTVEEGEIVTEADMPPL--------TAVGSNSNYPDVPPSP-PAEGTYEE 253

Query: 889  LFEGYD-----DQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPV 943
             +E +D            +++ TR+     K  S  +  LVLDLD TL++ +        
Sbjct: 254  DWEVFDPYFFIKHVPPLTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCSL------- 306

Query: 944  HDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
                    E +D       LF+     ++ +LRP    FLER S+++E+ L+T   K+YA
Sbjct: 307  -------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMSQIYEIILFTASKKVYA 359

Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRV 1062
             ++  +LDPK  L   R+    +      G+      KDL  +LG + S  VIID+S + 
Sbjct: 360  DKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTVIIDNSPQA 412

Query: 1063 WPHNKLNLIVVERY 1076
            + +   N I +E +
Sbjct: 413  FAYQLSNGIPIESW 426


>gi|209879341|ref|XP_002141111.1| NLI interacting factor-like phosphatase family protein
            [Cryptosporidium muris RN66]
 gi|209556717|gb|EEA06762.1| NLI interacting factor-like phosphatase family protein
            [Cryptosporidium muris RN66]
          Length = 590

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 53/176 (30%), Positives = 85/176 (48%), Gaps = 22/176 (12%)

Query: 913  KKMFSARKLCLVLDLDHTLL---NSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-- 967
            K   S  KL  +LDLD+TLL   NS K      + D I    E      P  + F  P  
Sbjct: 161  KDHLSQNKLVAILDLDNTLLHAYNSTKVGCNINLEDFIGANGE------PEMYKFVLPQD 214

Query: 968  -HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
             +   + KLRPG+  FL   +  + M + T   + YA  +  VLDPK   F  R+++R  
Sbjct: 215  MNTPYYLKLRPGVREFLNTIAPYYIMGICTNATREYADVIRAVLDPKRDKFGDRIVAR-- 272

Query: 1027 DGDPFDGDERVPKSKDLEGV-LGMES-AVVIIDDSVRVWPHN-KLNLIVVERYTYF 1079
              +  DG +     KD + + +G+++ A+V++DD   VW  + ++ ++  + Y YF
Sbjct: 273  --ENVDGRD---TQKDFKKICIGIDTRAIVLLDDRSDVWDSSLEIQVVKAQTYEYF 323


>gi|30851260|gb|AAH52660.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase like 2 [Mus musculus]
          Length = 465

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 58/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 870  PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
            PE+G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 234  PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 293

Query: 928  DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
            D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 294  DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 339

Query: 988  KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
            +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 340  QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 392

Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            G + S  +IID+S + + +   N I +E +
Sbjct: 393  GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422


>gi|348512761|ref|XP_003443911.1| PREDICTED: CTD small phosphatase-like protein 2-A-like isoform 2
            [Oreochromis niloticus]
          Length = 471

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 68/266 (25%), Positives = 117/266 (43%), Gaps = 50/266 (18%)

Query: 828  PLTSEPMVSQ------NSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQS 881
            P  + PM+ Q      ++ I+ G+I +  DM  +            + P   P G++P S
Sbjct: 196  PCHTAPMLPQRLHITSDTTIEEGEIVTETDMPPL------------TAPGCMPDGSYPHS 243

Query: 882  --------AWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDLDHTL 931
                    ++ +   +F+ Y   +      +++ TR+     K  S  +  LVLDLD TL
Sbjct: 244  LPSVPAEPSYDEDWEVFDPYFFIKHVPPLTEEQLTRKPALPLKTRSTPEFSLVLDLDETL 303

Query: 932  LNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFE 991
            ++ +                E +D       LF+     ++ +LRP    FLER S+L+E
Sbjct: 304  VHCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMSQLYE 349

Query: 992  MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME- 1050
            + L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +LG + 
Sbjct: 350  IILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDL 402

Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            S  VIID+S + + +   N I +E +
Sbjct: 403  SKTVIIDNSPQAFAYQLSNGIPIESW 428


>gi|26343511|dbj|BAC35412.1| unnamed protein product [Mus musculus]
          Length = 464

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 58/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 870  PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
            PE+G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 233  PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 292

Query: 928  DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
            D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 293  DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 338

Query: 988  KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
            +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 339  QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 391

Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            G + S  +IID+S + + +   N I +E +
Sbjct: 392  GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 421


>gi|426233772|ref|XP_004010888.1| PREDICTED: CTD small phosphatase-like protein 2 [Ovis aries]
          Length = 466

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 62/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
            I +G +      N D         PE+G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211  INNGLEEAEETVNRDIPPLTAPVTPESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271  QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317  FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377  --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|126281910|ref|XP_001363358.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
            polypeptide A) small phosphatase like 2 [Monodelphis
            domestica]
          Length = 466

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 60/234 (25%), Positives = 105/234 (44%), Gaps = 24/234 (10%)

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
            I +G +     TN D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211  INNGLEEVEEATNRDIPPLTAPVSPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +  R+     K  S  +  LVLDLD TL++ +                E +D       L
Sbjct: 271  QLNRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NELEDAALTFPVL 316

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317  FQDVIYQVYVRLRPFFREFLERMSQIYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377  --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|354471693|ref|XP_003498075.1| PREDICTED: CTD small phosphatase-like protein 2 [Cricetulus griseus]
          Length = 465

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 58/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 870  PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
            PE+G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 234  PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 293

Query: 928  DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
            D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 294  DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 339

Query: 988  KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
            +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 340  QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 392

Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            G + S  +IID+S + + +   N I +E +
Sbjct: 393  GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422


>gi|148696132|gb|EDL28079.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase like 2, isoform CRA_a [Mus musculus]
          Length = 465

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 58/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 870  PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
            PE+G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 234  PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 293

Query: 928  DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
            D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 294  DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 339

Query: 988  KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
            +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 340  QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 392

Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            G + S  +IID+S + + +   N I +E +
Sbjct: 393  GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422


>gi|74190363|dbj|BAE37265.1| unnamed protein product [Mus musculus]
          Length = 465

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 58/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 870  PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
            PE+G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 234  PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 293

Query: 928  DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
            D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 294  DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 339

Query: 988  KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
            +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 340  QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 392

Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            G + S  +IID+S + + +   N I +E +
Sbjct: 393  GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422


>gi|47059059|ref|NP_997615.1| CTD small phosphatase-like protein 2 [Mus musculus]
 gi|81873659|sp|Q8BG15.1|CTSL2_MOUSE RecName: Full=CTD small phosphatase-like protein 2; Short=CTDSP-like
            2
 gi|26326063|dbj|BAC26775.1| unnamed protein product [Mus musculus]
 gi|26329037|dbj|BAC28257.1| unnamed protein product [Mus musculus]
 gi|26340192|dbj|BAC33759.1| unnamed protein product [Mus musculus]
 gi|26349835|dbj|BAC38557.1| unnamed protein product [Mus musculus]
 gi|148696133|gb|EDL28080.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase like 2, isoform CRA_b [Mus musculus]
          Length = 465

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 58/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 870  PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
            PE+G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 234  PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 293

Query: 928  DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
            D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 294  DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 339

Query: 988  KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
            +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 340  QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 392

Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            G + S  +IID+S + + +   N I +E +
Sbjct: 393  GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422


>gi|348512759|ref|XP_003443910.1| PREDICTED: CTD small phosphatase-like protein 2-A-like isoform 1
            [Oreochromis niloticus]
          Length = 474

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 68/266 (25%), Positives = 117/266 (43%), Gaps = 50/266 (18%)

Query: 828  PLTSEPMVSQ------NSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQS 881
            P  + PM+ Q      ++ I+ G+I +  DM  +            + P   P G++P S
Sbjct: 199  PCHTAPMLPQRLHITSDTTIEEGEIVTETDMPPL------------TAPGCMPDGSYPHS 246

Query: 882  --------AWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDLDHTL 931
                    ++ +   +F+ Y   +      +++ TR+     K  S  +  LVLDLD TL
Sbjct: 247  LPSVPAEPSYDEDWEVFDPYFFIKHVPPLTEEQLTRKPALPLKTRSTPEFSLVLDLDETL 306

Query: 932  LNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFE 991
            ++ +                E +D       LF+     ++ +LRP    FLER S+L+E
Sbjct: 307  VHCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMSQLYE 352

Query: 992  MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME- 1050
            + L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +LG + 
Sbjct: 353  IILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDL 405

Query: 1051 SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            S  VIID+S + + +   N I +E +
Sbjct: 406  SKTVIIDNSPQAFAYQLSNGIPIESW 431


>gi|156083399|ref|XP_001609183.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154796434|gb|EDO05615.1| hypothetical protein BBOV_IV000150 [Babesia bovis]
          Length = 692

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 57/123 (46%), Gaps = 22/123 (17%)

Query: 966  FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
            F ++  + KLRPG+  FL+  S  +EM +YT   K YA  +  +LDP   LF  R+++R 
Sbjct: 307  FANIRYYMKLRPGLRGFLQVLSLYYEMSIYTNATKEYADVVVSILDPDRSLFMDRIVART 366

Query: 1026 DDGDPFDGDERVPKSKDLEGVLGM------ESAVVIIDDSVRVW---PHNKLNLIVVERY 1076
              G+           +DL+              VV  DD   VW   PHN+  ++  E Y
Sbjct: 367  SAGE-----------RDLQKTAARLYPNLDPRFVVAFDDRADVWADVPHNQ--VVKAEHY 413

Query: 1077 TYF 1079
             +F
Sbjct: 414  DFF 416


>gi|71031738|ref|XP_765511.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68352467|gb|EAN33228.1| hypothetical protein TP02_0943 [Theileria parva]
          Length = 769

 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 56/119 (47%), Gaps = 14/119 (11%)

Query: 966  FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
            FP++  + KLRP I  FL+  S  +EM +YT   K YA  +  +LDP   LF  R+++R 
Sbjct: 339  FPNVNYYMKLRPCIREFLQILSLYYEMSIYTNATKEYADVVISILDPDRSLFMDRIVARN 398

Query: 1026 --DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---PHNKLNLIVVERYTYF 1079
              D+ D      R+    D   +L         DD   VW   PH +  ++  E Y +F
Sbjct: 399  SVDEKDLLKSASRLYPDLDTRFILAF-------DDRRDVWSDIPHKQ--VVRAEHYDFF 448


>gi|357156637|ref|XP_003577524.1| PREDICTED: CTD small phosphatase-like protein 2-like isoform 2
            [Brachypodium distachyon]
          Length = 443

 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 55/181 (30%), Positives = 82/181 (45%), Gaps = 28/181 (15%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            K+  S  +  LVLDLD TL++S             L   E+ D   P    F      ++
Sbjct: 251  KQTRSCPRTTLVLDLDETLVHST------------LEPCEDSDFTFPVH--FNLREHTIY 296

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGD 1029
             + RP +  FLER + +FE+ ++T    +YA ++  VLDPK  LF  RV        +G+
Sbjct: 297  VRCRPYLKEFLERVASMFEIIIFTASQSIYAEQLLNVLDPKRKLFRHRVYRESCVYVEGN 356

Query: 1030 PFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 1088
                       KDL  VLG + A VVI+D+S + +     N I +E +   P  +    L
Sbjct: 357  YL---------KDL-SVLGRDLARVVIVDNSPQAFGFQLENGIPIESWFDDPNDKELLAL 406

Query: 1089 L 1089
            L
Sbjct: 407  L 407


>gi|254728754|gb|ACT79552.1| CTD phosphatase-like protein [Oryza glaberrima]
          Length = 462

 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 75/155 (48%), Gaps = 22/155 (14%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LVLDLD TL++S             L   E+ D   P    F F    ++ + RP +  F
Sbjct: 278  LVLDLDETLVHST------------LEPCEDADFAFPV--YFNFREHTIYVRCRPYLKEF 323

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
            LER + LFE  ++T    +YA ++  VLDPK  LF  RV    D     +G+      KD
Sbjct: 324  LERVANLFETIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR--DSCVYVEGNYL----KD 377

Query: 1043 LEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            L  VLG + + ++I+D+S + +     N I +E +
Sbjct: 378  L-TVLGRDLTRIMIVDNSPQAFGFQLDNGIPIESW 411


>gi|254728746|gb|ACT79548.1| CTD phosphatase-like protein [Oryza sativa Japonica Group]
 gi|254728748|gb|ACT79549.1| CTD phosphatase-like protein [Oryza sativa Indica Group]
 gi|254728750|gb|ACT79550.1| CTD phosphatase-like protein [Oryza rufipogon]
 gi|254728752|gb|ACT79551.1| CTD phosphatase-like protein [Oryza nivara]
          Length = 462

 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 75/155 (48%), Gaps = 22/155 (14%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LVLDLD TL++S             L   E+ D   P    F F    ++ + RP +  F
Sbjct: 278  LVLDLDETLVHST------------LEPCEDADFAFPV--YFNFREHTIYVRCRPYLKEF 323

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
            LER + LFE  ++T    +YA ++  VLDPK  LF  RV    D     +G+      KD
Sbjct: 324  LERVANLFETIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR--DSCVYVEGNYL----KD 377

Query: 1043 LEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            L  VLG + + ++I+D+S + +     N I +E +
Sbjct: 378  L-TVLGRDLTRIMIVDNSPQAFGFQLDNGIPIESW 411


>gi|26390099|dbj|BAC25842.1| unnamed protein product [Mus musculus]
          Length = 351

 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 56/210 (26%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 870  PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
            PE+G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 120  PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 179

Query: 928  DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
            D TL++ +                E +D       LF+     ++ +LRP    FLER S
Sbjct: 180  DETLVHCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 225

Query: 988  KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
            +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 226  QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 278

Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            G + S  +IID+S + + +   N I +E +
Sbjct: 279  GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 308


>gi|118369793|ref|XP_001018099.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
            thermophila]
 gi|89299866|gb|EAR97854.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
            thermophila SB210]
          Length = 874

 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 57/207 (27%), Positives = 98/207 (47%), Gaps = 35/207 (16%)

Query: 889  LFEGYDDQQKAAI----QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVH 944
            ++ G D + K+ +      E +++L  Q+ + S +KL LVLDLD+T+L     H V  + 
Sbjct: 247  VYAGLDQKDKSVLIGKEYAEYSKKLAHQQ-LHSNQKLILVLDLDNTIL-----HAVPAIK 300

Query: 945  DEILRKKE--EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLY 1002
            + +    +  +QD  K     F   +     K RP +  FL+     +E++++TM    Y
Sbjct: 301  NALFDNADGIQQDSFKE----FHNRYSKYVIKFRPYMKEFLQTVLPHYEIYIFTMAMLDY 356

Query: 1003 ATEMAK--------VLDPKGVLF-AGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESA 1052
            A  +          +LD   + F   R+ISR    + F  +     +KDL+ +L   E  
Sbjct: 357  AKCVCDYLKQTYKDILDDYPMTFNYDRIISR----EQFSSN-----NKDLQQILPNSEKI 407

Query: 1053 VVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            ++I+DD   VW  NK+NL+    Y Y+
Sbjct: 408  MLILDDRDDVWAKNKMNLVTTLPYIYW 434


>gi|114108339|gb|AAI23380.1| Ctdspl2a protein [Xenopus laevis]
          Length = 536

 Score = 60.5 bits (145), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 59/210 (28%), Positives = 99/210 (47%), Gaps = 24/210 (11%)

Query: 870  PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
            PE+G   AH ++A+ +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 305  PESGYSSAHAEAAYEEDWEVFDPYFFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 364

Query: 928  DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
            D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 365  DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 410

Query: 988  KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
            +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 411  QIYEIILFTASKKVYADKLLNILDPKKRLVRHRLFR--EHCVCVQGN----YIKDL-NIL 463

Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            G + S  +IID+S + + +   N I +E +
Sbjct: 464  GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 493


>gi|242072230|ref|XP_002446051.1| hypothetical protein SORBIDRAFT_06g001010 [Sorghum bicolor]
 gi|241937234|gb|EES10379.1| hypothetical protein SORBIDRAFT_06g001010 [Sorghum bicolor]
          Length = 447

 Score = 60.5 bits (145), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 49/104 (47%), Gaps = 24/104 (23%)

Query: 923  LVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
            LVLDLD TL++S   H  D     PVH                   F F    ++ + RP
Sbjct: 263  LVLDLDETLVHSTLEHCEDADFTFPVH-------------------FNFQEHTIYVRCRP 303

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             +  FLER + +FE  ++T    +YA ++  VLDPK  LF  RV
Sbjct: 304  YLKEFLERVASMFETIIFTASQSIYAEQLLNVLDPKRKLFRHRV 347


>gi|221057654|ref|XP_002261335.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
            knowlesi strain H]
 gi|194247340|emb|CAQ40740.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
            knowlesi strain H]
          Length = 1389

 Score = 60.5 bits (145), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 38/133 (28%), Positives = 65/133 (48%), Gaps = 16/133 (12%)

Query: 958  KPHRHLFRFPHMGM--WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
            +P  + F  P+     + K RP +  FL+  S  +E+ +YT   + YA  +  +LDP   
Sbjct: 948  EPELYKFFLPYYNFFYYLKFRPYVRQFLQILSLYYELSIYTNATREYADVVIAILDPDRT 1007

Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVI-IDDSVRVW---PHNKLNL 1070
            LFA R+++R +  D         ++K+   +   ++S  VI  DD   VW   PH+  N+
Sbjct: 1008 LFADRIVARCNSADR-------EENKNFSKIYPNVDSKYVIAFDDRKDVWTDIPHS--NI 1058

Query: 1071 IVVERYTYFPCSR 1083
            +  E Y +F  S+
Sbjct: 1059 LKAEHYNFFELSK 1071


>gi|297830094|ref|XP_002882929.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328769|gb|EFH59188.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 270

 Score = 60.5 bits (145), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 49/189 (25%), Positives = 78/189 (41%), Gaps = 28/189 (14%)

Query: 930  TLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKL 989
            TL++S K   +      ++++++   R+   ++  R        K RP +  FL+ A+KL
Sbjct: 80   TLIHSMKTLNLSNAEKYLIKEEKSGSRKDLRKYNDRL------VKFRPFVEEFLKEANKL 133

Query: 990  FEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM 1049
            F M  YT G   YA  + ++LDP  + F  R+I+R +           P  K L+ VL  
Sbjct: 134  FTMTAYTRGGSTYAKAVVRMLDPNKIYFGDRIITRKES----------PDLKTLDLVLAD 183

Query: 1050 ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLAS 1109
            E  +VI             NL+ +  Y YF    R       S  E   DE      L  
Sbjct: 184  ERGIVI------------RNLLEITSYFYFKNDHRNIMRSRLSYAERKTDESRTKRALVK 231

Query: 1110 SLGVRQQLH 1118
             L   +++H
Sbjct: 232  LLKFLKEVH 240


>gi|348685327|gb|EGZ25142.1| hypothetical protein PHYSODRAFT_311755 [Phytophthora sojae]
          Length = 257

 Score = 60.5 bits (145), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 54/164 (32%), Positives = 80/164 (48%), Gaps = 28/164 (17%)

Query: 917  SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL---FRFPHMGMWT 973
            +A K+CLVLDLD TL++ +    VD V             + PH      F      +  
Sbjct: 72   NAPKICLVLDLDETLVHCS----VDEV-------------KNPHMQFPVTFNGVEYTVNV 114

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
            K RP +  FL+R SKLFE+ ++T  +K+YA ++  +LDP       R + R D  D F  
Sbjct: 115  KKRPHLEYFLKRVSKLFEIVVFTASHKVYAEKLMNMLDPNRNFIKYR-LYREDCLDVFGN 173

Query: 1034 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
                   KDL  VLG + S VV++D+S   + +   N I +E +
Sbjct: 174  -----YLKDL-NVLGRDLSKVVLVDNSPHAFGYQVNNGIPIETW 211


>gi|410912504|ref|XP_003969729.1| PREDICTED: CTD small phosphatase-like protein 2-like [Takifugu
            rubripes]
          Length = 474

 Score = 60.5 bits (145), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 89/358 (24%), Positives = 147/358 (41%), Gaps = 59/358 (16%)

Query: 735  MDELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQGSK------ENLNFQK 788
            ++E   V +    PR  L G          P F    P+   + GS       E     K
Sbjct: 117  LEETTAVEVPSSAPRTTLLGTIF------SPVFNFFSPAKNASSGSDSPDQAMEAEEIVK 170

Query: 789  QLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPL--TSEPMVSQNSPIQPGQI 846
            QL        + Q+V  P  T     +L   ++F S   PL     P V + SP+   + 
Sbjct: 171  QLD-------MEQAVEMPTSTAMSPHDLCVASNFHSSVSPLPPLRPPHVPEASPLAVEE- 222

Query: 847  KSGADMKAVVTNHDDKQTGTGSGP-----EAGPVGAHPQSAWGDVEHLFEGYDDQQKA-- 899
            +  AD+  +        T  GS P     EA      P++++ +   +F+ Y   +    
Sbjct: 223  ELDADLPPL--------TAPGSSPDMTYVEAPLAAVPPEASYEEDWEVFDPYFFIKHVPP 274

Query: 900  AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKP 959
              +++ TR+     K  S  +  LVLDLD TL++ +                E +D    
Sbjct: 275  LTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NELEDAALT 320

Query: 960  HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
               LF+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   
Sbjct: 321  FPVLFQDVIYQVYVRLRPFFREFLERMSQIYEIILFTASKKVYADKLLNILDPKKQLVRH 380

Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            R+    +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 381  RLFR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 431


>gi|125557643|gb|EAZ03179.1| hypothetical protein OsI_25332 [Oryza sativa Indica Group]
 gi|125599502|gb|EAZ39078.1| hypothetical protein OsJ_23510 [Oryza sativa Japonica Group]
          Length = 461

 Score = 60.5 bits (145), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 75/155 (48%), Gaps = 22/155 (14%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LVLDLD TL++S             L   E+ D   P    F F    ++ + RP +  F
Sbjct: 278  LVLDLDETLVHST------------LEPCEDADFAFPV--YFNFREHTIYVRCRPYLKEF 323

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
            LER + LFE  ++T    +YA ++  VLDPK  LF  RV    D     +G+      KD
Sbjct: 324  LERVANLFETIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR--DSCVYVEGNYL----KD 377

Query: 1043 LEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            L  VLG + + ++I+D+S + +     N I +E +
Sbjct: 378  L-TVLGRDLTRIMIVDNSPQAFGFQLDNGIPIESW 411


>gi|330864811|ref|NP_001178334.1| CTD small phosphatase-like protein 2 [Bos taurus]
 gi|296482877|tpg|DAA24992.1| TPA: CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
            small phosphatase like 2 [Bos taurus]
 gi|440911957|gb|ELR61572.1| CTD small phosphatase-like protein 2 [Bos grunniens mutus]
          Length = 466

 Score = 60.1 bits (144), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
            I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211  INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271  QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317  FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377  --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|410961377|ref|XP_003987259.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Felis
            catus]
 gi|410961379|ref|XP_003987260.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Felis
            catus]
          Length = 466

 Score = 60.1 bits (144), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
            I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211  INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271  QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317  FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377  --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|294905859|ref|XP_002777694.1| hypothetical protein Pmar_PMAR016441 [Perkinsus marinus ATCC 50983]
 gi|239885585|gb|EER09510.1| hypothetical protein Pmar_PMAR016441 [Perkinsus marinus ATCC 50983]
          Length = 523

 Score = 60.1 bits (144), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 54/211 (25%), Positives = 87/211 (41%), Gaps = 55/211 (26%)

Query: 920  KLCLVLDLDHTLLNSAKFHE--------VDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
            +L +VLDLD T++NS +  +        V P+  EI R +E      P  +L     + +
Sbjct: 63   RLDVVLDLDRTMVNSFEIRKAGRSESENVTPILQEIYRDEEGL----PELYLCVISDVKV 118

Query: 972  WTKLRPGIWTFLER--ASKLFE--MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR--- 1024
             TK+RP    F+    AS  +   + +YT G++ Y   + ++LDP G L  GR++SR   
Sbjct: 119  LTKIRPHARAFIRELVASTDYGVVISIYTKGSRRYMEVVKQMLDPSGELIKGRLVSRDDE 178

Query: 1025 --------------------------GDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDD 1058
                                      G DG   +GD+   +S+       M    V++DD
Sbjct: 179  PSNMTPVEKDPDLIINASIESGAQVDGSDGRLCNGDKETKESE-------MRRWFVVLDD 231

Query: 1059 SVRVWP---HNKLNLIVVERYTYFPCSRRQF 1086
            S   WP       N++    Y +   + RQ 
Sbjct: 232  SPEAWPEELREAGNVVTANMYDFAEVNHRQL 262


>gi|149692003|ref|XP_001502897.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
            polypeptide A) small phosphatase like 2 isoform 2 [Equus
            caballus]
 gi|149692005|ref|XP_001502892.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
            polypeptide A) small phosphatase like 2 isoform 1 [Equus
            caballus]
          Length = 466

 Score = 60.1 bits (144), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
            I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211  INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271  QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317  FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377  --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|47220514|emb|CAG05540.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 473

 Score = 60.1 bits (144), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 67/249 (26%), Positives = 110/249 (44%), Gaps = 36/249 (14%)

Query: 835  VSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAG-PVGAHPQSAWGDVEHLFEGY 893
            ++ +S I+ G+I +  DM  +        T  G  P++G P    P  A    E  +E +
Sbjct: 211  INADSSIEEGEIVAETDMPPL--------TAPGCMPDSGYPHTLPPAPAETSYEEDWEVF 262

Query: 894  D-----DQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEIL 948
            D            +++ TR+     K  S  +  LVLDLD TL           VH  + 
Sbjct: 263  DPYFFIKHVPPLTEEQLTRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL- 310

Query: 949  RKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 1008
               E +D       LF+     ++ +LRP    FLER S+ +E+ L+T   K+YA ++  
Sbjct: 311  --NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMSQKYEIILFTASKKVYADKLLN 368

Query: 1009 VLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNK 1067
            +LDP+  L   R+    +      G+      KDL  +LG + S  +IID+S + + +  
Sbjct: 369  ILDPRKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQL 421

Query: 1068 LNLIVVERY 1076
             N I +E +
Sbjct: 422  SNGIPIESW 430


>gi|357451355|ref|XP_003595954.1| RNA polymerase II subunit A C-terminal domain phosphatase [Medicago
            truncatula]
 gi|355485002|gb|AES66205.1| RNA polymerase II subunit A C-terminal domain phosphatase [Medicago
            truncatula]
          Length = 239

 Score = 59.7 bits (143), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 28/59 (47%), Positives = 40/59 (67%)

Query: 969  MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
            M    KLRP + TFL+ AS++FEM++YTMG + Y+ EMAK+LDP+   F  +V  +  D
Sbjct: 65   MQRMNKLRPFVRTFLKEASEVFEMYIYTMGIRQYSLEMAKLLDPQVEYFKDKVWQKHKD 123


>gi|417401418|gb|JAA47595.1| Putative ctd carboxy-terminal domain rna polymer [Desmodus rotundus]
          Length = 466

 Score = 59.7 bits (143), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
            I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211  INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271  QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317  FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377  --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|332235387|ref|XP_003266885.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Nomascus
            leucogenys]
 gi|332235389|ref|XP_003266886.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Nomascus
            leucogenys]
          Length = 466

 Score = 59.7 bits (143), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
            I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211  INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271  QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317  FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377  --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|397480304|ref|XP_003811426.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Pan
            paniscus]
 gi|397480306|ref|XP_003811427.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Pan
            paniscus]
          Length = 466

 Score = 59.7 bits (143), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
            I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211  INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271  QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317  FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377  --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|224116454|ref|XP_002317305.1| predicted protein [Populus trichocarpa]
 gi|222860370|gb|EEE97917.1| predicted protein [Populus trichocarpa]
          Length = 377

 Score = 59.7 bits (143), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 71/149 (47%), Gaps = 28/149 (18%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            + + LVLDLD TL++S   H                D +      F      ++ K RP 
Sbjct: 203  KSITLVLDLDETLVHSTLEHC--------------DDADFTFTVFFNMKEHTVYVKQRPH 248

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 1035
            + TFLER +++FE+ ++T    +YA ++  +LDP   L + R+        DG       
Sbjct: 249  VHTFLERVAEMFEVVIFTASQSIYAAQLLDMLDPDRKLISRRIYRESCIFSDG------- 301

Query: 1036 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
                +KDL  VLG++ A V IID+S +V+
Sbjct: 302  --SYTKDL-TVLGVDLAKVAIIDNSPQVF 327


>gi|403274413|ref|XP_003928971.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Saimiri
            boliviensis boliviensis]
 gi|403274415|ref|XP_003928972.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Saimiri
            boliviensis boliviensis]
          Length = 466

 Score = 59.7 bits (143), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 61/236 (25%), Positives = 104/236 (44%), Gaps = 24/236 (10%)

Query: 844  GQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AI 901
              I +G +      N D         P++G   AH ++ + +   +F+ Y   +      
Sbjct: 209  ASINNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLT 268

Query: 902  QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
            +++  R+     K  S  +  LVLDLD TL           VH  +    E +D      
Sbjct: 269  EEQLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFP 314

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             LF+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+
Sbjct: 315  VLFQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
                +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 375  FR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|388453109|ref|NP_001253738.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase like 2 [Macaca mulatta]
 gi|114656732|ref|XP_001161756.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
            polypeptide A) small phosphatase like 2 isoform 3 [Pan
            troglodytes]
 gi|114656734|ref|XP_001161793.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
            polypeptide A) small phosphatase like 2 isoform 4 [Pan
            troglodytes]
 gi|297696523|ref|XP_002825440.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
            polypeptide A) small phosphatase like 2 isoform 1 [Pongo
            abelii]
 gi|395746659|ref|XP_003778487.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
            polypeptide A) small phosphatase like 2 isoform 2 [Pongo
            abelii]
 gi|380813572|gb|AFE78660.1| CTD small phosphatase-like protein 2 [Macaca mulatta]
 gi|383419005|gb|AFH32716.1| CTD small phosphatase-like protein 2 [Macaca mulatta]
 gi|384947558|gb|AFI37384.1| CTD small phosphatase-like protein 2 [Macaca mulatta]
 gi|410206686|gb|JAA00562.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase like 2 [Pan troglodytes]
 gi|410253512|gb|JAA14723.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase like 2 [Pan troglodytes]
 gi|410302524|gb|JAA29862.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase like 2 [Pan troglodytes]
 gi|410341327|gb|JAA39610.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase like 2 [Pan troglodytes]
          Length = 466

 Score = 59.7 bits (143), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
            I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211  INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271  QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317  FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377  --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|57108473|ref|XP_544655.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
            polypeptide A) small phosphatase like 2 isoform 1 [Canis
            lupus familiaris]
 gi|73999941|ref|XP_860654.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
            polypeptide A) small phosphatase like 2 isoform 4 [Canis
            lupus familiaris]
          Length = 466

 Score = 59.7 bits (143), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
            I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211  INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271  QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317  FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377  --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|402874166|ref|XP_003900915.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Papio
            anubis]
 gi|402874168|ref|XP_003900916.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Papio
            anubis]
          Length = 466

 Score = 59.7 bits (143), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
            I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211  INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271  QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317  FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377  --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|100815975|ref|NP_057480.2| CTD small phosphatase-like protein 2 [Homo sapiens]
 gi|187471086|sp|Q05D32.2|CTSL2_HUMAN RecName: Full=CTD small phosphatase-like protein 2; Short=CTDSP-like
            2
 gi|23273027|gb|AAH35744.1| CTDSPL2 protein [Homo sapiens]
 gi|71835542|gb|AAZ42188.1| unknown [Homo sapiens]
 gi|119597671|gb|EAW77265.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase like 2, isoform CRA_a [Homo sapiens]
 gi|119597672|gb|EAW77266.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase like 2, isoform CRA_a [Homo sapiens]
 gi|123994825|gb|ABM85014.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase like 2 [synthetic construct]
 gi|157928777|gb|ABW03674.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase like 2 [synthetic construct]
 gi|158255896|dbj|BAF83919.1| unnamed protein product [Homo sapiens]
 gi|168278020|dbj|BAG10988.1| CTD small phosphatase like 2 [synthetic construct]
          Length = 466

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 57/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 870  PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
            P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 235  PDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 294

Query: 928  DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
            D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 295  DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 340

Query: 988  KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
            +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 341  QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 393

Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            G + S  +IID+S + + +   N I +E +
Sbjct: 394  GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|296213856|ref|XP_002753450.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 3 [Callithrix
            jacchus]
          Length = 466

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 61/236 (25%), Positives = 104/236 (44%), Gaps = 24/236 (10%)

Query: 844  GQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AI 901
              I +G +      N D         P++G   AH ++ + +   +F+ Y   +      
Sbjct: 209  ASINNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLT 268

Query: 902  QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
            +++  R+     K  S  +  LVLDLD TL           VH  +    E +D      
Sbjct: 269  EEQLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFP 314

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             LF+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+
Sbjct: 315  VLFQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
                +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 375  FR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|209156270|gb|ACI34367.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
            phosphatase 2 [Salmo salar]
          Length = 367

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 48/156 (30%), Positives = 80/156 (51%), Gaps = 22/156 (14%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LVLDLD TL+ S+    ++ +          +D E   R  F+     ++  LRP +  F
Sbjct: 191  LVLDLDETLMYSS----LNVI----------EDAEYTFRTCFQDNPYKVYVILRPYVKEF 236

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
            LE  +K FEM +YT   K YA ++  +LDPK  LF  R+  +  D     G       KD
Sbjct: 237  LEAMTKHFEMFVYTSAKKEYAEKILDILDPKRRLFRHRLYQQ--DCACVLGH----YVKD 290

Query: 1043 LEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYT 1077
            L GVL  + A  V++D++   +P++ +N++ ++ ++
Sbjct: 291  L-GVLERDLAKTVVLDNAPHTYPYHLMNVLPIKSWS 325


>gi|356566193|ref|XP_003551319.1| PREDICTED: CTD small phosphatase-like protein 2-like [Glycine max]
          Length = 403

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 56/174 (32%), Positives = 80/174 (45%), Gaps = 28/174 (16%)

Query: 907  RRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF 966
            RRL   K+  S     LVLDLD TL++S             L   E+ D   P    F  
Sbjct: 198  RRLLLPKQTRSCPSTTLVLDLDETLVHST------------LEHCEDVDFTFPVN--FNS 243

Query: 967  PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
                ++ + RP +  FLER S LFE+ ++T    +YA ++  VLDPK  +F  RV     
Sbjct: 244  EEHIVYVRCRPHLKDFLERVSGLFEIIIFTASQSIYAEQLLNVLDPKRKIFRHRVYRESC 303

Query: 1027 ---DGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
               +G+           KDL  VLG + A V+IID+S + +     N I +E +
Sbjct: 304  VYVEGNYL---------KDLT-VLGRDLAHVIIIDNSPQAFGFQVDNGIPIESW 347


>gi|34596232|gb|AAQ76796.1| hypothetical protein [Homo sapiens]
          Length = 466

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 57/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 870  PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
            P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 235  PDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 294

Query: 928  DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
            D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 295  DETL-----------VHCSL---NELEDAALTFPVLFQDVVYQVYVRLRPFFREFLERMS 340

Query: 988  KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
            +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 341  QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 393

Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            G + S  +IID+S + + +   N I +E +
Sbjct: 394  GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|187471087|sp|Q5F3Z7.2|CTSL2_CHICK RecName: Full=CTD small phosphatase-like protein 2; Short=CTDSP-like
            2
          Length = 466

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 98/211 (46%), Gaps = 24/211 (11%)

Query: 869  GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
             P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 234  SPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 293

Query: 927  LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
            LD TL           VH  +    E +D       LF+     ++ +LRP    FLER 
Sbjct: 294  LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 339

Query: 987  SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
            S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 340  SQIYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NI 392

Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            LG + S  +IID+S + + +   N I +E +
Sbjct: 393  LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|61098234|ref|NP_001012790.1| CTD small phosphatase-like protein 2 [Gallus gallus]
 gi|60098613|emb|CAH65137.1| hypothetical protein RCJMB04_4a24 [Gallus gallus]
          Length = 468

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 98/211 (46%), Gaps = 24/211 (11%)

Query: 869  GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
             P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 236  SPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 295

Query: 927  LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
            LD TL           VH  +    E +D       LF+     ++ +LRP    FLER 
Sbjct: 296  LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 341

Query: 987  SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
            S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 342  SQIYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NI 394

Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            LG + S  +IID+S + + +   N I +E +
Sbjct: 395  LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 425


>gi|344297040|ref|XP_003420208.1| PREDICTED: CTD small phosphatase-like protein 2 [Loxodonta africana]
          Length = 466

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
            I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211  INNGLEEAEGTVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271  QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317  FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377  --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|6841480|gb|AAF29093.1|AF161478_1 HSPC129 [Homo sapiens]
          Length = 466

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 57/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 870  PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
            P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 235  PDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 294

Query: 928  DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
            D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 295  DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 340

Query: 988  KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
            +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 341  QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 393

Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            G + S  +IID+S + + +   N I +E +
Sbjct: 394  GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|6841354|gb|AAF29030.1|AF161543_1 HSPC058 [Homo sapiens]
          Length = 352

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 58/234 (24%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
            + +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 97   LNNGLEEAEETVNRDIPTLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 156

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +  R+     K  S  +  LVLDLD TL++ +                E +D       L
Sbjct: 157  QLNRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NELEDAALTFPVL 202

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 203  FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 262

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 263  --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 309


>gi|326926934|ref|XP_003209651.1| PREDICTED: CTD small phosphatase-like protein 2-like [Meleagris
            gallopavo]
          Length = 468

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 98/211 (46%), Gaps = 24/211 (11%)

Query: 869  GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
             P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 236  SPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 295

Query: 927  LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
            LD TL           VH  +    E +D       LF+     ++ +LRP    FLER 
Sbjct: 296  LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 341

Query: 987  SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
            S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 342  SQIYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NI 394

Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            LG + S  +IID+S + + +   N I +E +
Sbjct: 395  LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 425


>gi|410908573|ref|XP_003967765.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Takifugu
            rubripes]
          Length = 474

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 64/249 (25%), Positives = 108/249 (43%), Gaps = 36/249 (14%)

Query: 835  VSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAG-PVGAHPQSAWGDVEHLFEGY 893
            +S +S ++ G+I +  DM  +        T  G  P+ G P    P  A    E  +E +
Sbjct: 212  ISSDSTVEEGEIVTETDMPPL--------TAPGCMPDGGYPHMLPPAPAETSYEEDWEVF 263

Query: 894  D-----DQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEIL 948
            D            +++ TR+     K  S  +  LVLDLD TL++ +             
Sbjct: 264  DPYFFIKHVPPLTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCSL------------ 311

Query: 949  RKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 1008
               E +D       LF+     ++ +LRP    FLER  + +E+ L+T   K+YA ++  
Sbjct: 312  --NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMCQKYEIILFTASKKVYADKLLN 369

Query: 1009 VLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNK 1067
            +LDP+  L   R+    +      G+      KDL  +LG + S  +IID+S + + +  
Sbjct: 370  ILDPRKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQL 422

Query: 1068 LNLIVVERY 1076
             N I +E +
Sbjct: 423  SNGIPIESW 431


>gi|449270631|gb|EMC81290.1| CTD small phosphatase-like protein 2 [Columba livia]
          Length = 468

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 98/211 (46%), Gaps = 24/211 (11%)

Query: 869  GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
             P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 236  SPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 295

Query: 927  LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
            LD TL           VH  +    E +D       LF+     ++ +LRP    FLER 
Sbjct: 296  LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 341

Query: 987  SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
            S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 342  SQIYEIILFTASKKVYADKLLNILDPKKKLVRHRLFR--EHCVCVQGN----YIKDL-NI 394

Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            LG + S  +IID+S + + +   N I +E +
Sbjct: 395  LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 425


>gi|413917759|gb|AFW57691.1| hypothetical protein ZEAMMB73_437679 [Zea mays]
          Length = 451

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 24/114 (21%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFP 967
            K+  S   + LVLDLD TL++S   H  D     PVH                   F F 
Sbjct: 255  KQTRSCPTMTLVLDLDETLVHSTLEHCEDADFTFPVH-------------------FNFR 295

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
               ++ + RP +  FL+R + +FE  ++T    +YA ++  VLDPK  LF  RV
Sbjct: 296  EHTIYVRCRPYLKEFLDRVASVFETIIFTASQSIYAEQLLNVLDPKRKLFRHRV 349


>gi|413917758|gb|AFW57690.1| hypothetical protein ZEAMMB73_437679 [Zea mays]
          Length = 449

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 24/114 (21%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFP 967
            K+  S   + LVLDLD TL++S   H  D     PVH                   F F 
Sbjct: 255  KQTRSCPTMTLVLDLDETLVHSTLEHCEDADFTFPVH-------------------FNFR 295

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
               ++ + RP +  FL+R + +FE  ++T    +YA ++  VLDPK  LF  RV
Sbjct: 296  EHTIYVRCRPYLKEFLDRVASVFETIIFTASQSIYAEQLLNVLDPKRKLFRHRV 349


>gi|291403116|ref|XP_002717973.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
            polypeptide A) small phosphatase like 2 [Oryctolagus
            cuniculus]
          Length = 286

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 56/210 (26%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 870  PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
            PE+G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 55   PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 114

Query: 928  DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
            D TL++ +                E +D       LF+     ++ +LRP    FLER S
Sbjct: 115  DETLVHCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 160

Query: 988  KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
            +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 161  QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 213

Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            G + S  +IID+S + + +   N I +E +
Sbjct: 214  GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 243


>gi|426201370|gb|EKV51293.1| hypothetical protein AGABI2DRAFT_114027 [Agaricus bisporus var.
            bisporus H97]
          Length = 814

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 24/60 (40%), Positives = 37/60 (61%)

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
            + K RPG   FL   +  ++MH+YTMG + YA E+   +DP G +F  R++SR + G+ F
Sbjct: 269  YIKPRPGWKEFLMDMATKYDMHVYTMGTRAYAEEVCAAIDPDGSVFKSRILSRDESGNDF 328


>gi|224062995|ref|XP_002187586.1| PREDICTED: CTD small phosphatase-like protein 2 [Taeniopygia guttata]
          Length = 467

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 57/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 870  PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
            P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 236  PDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 295

Query: 928  DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
            D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 296  DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 341

Query: 988  KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
            +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 342  QIYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 394

Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            G + S  +IID+S + + +   N I +E +
Sbjct: 395  GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 424


>gi|357450577|ref|XP_003595565.1| CTD small phosphatase-like protein [Medicago truncatula]
 gi|355484613|gb|AES65816.1| CTD small phosphatase-like protein [Medicago truncatula]
          Length = 460

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 70/149 (46%), Gaps = 28/149 (18%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            + + LVLDLD TL++S   H                D +      F      ++ K RP 
Sbjct: 286  KSVTLVLDLDETLVHSTLEHC--------------DDADFTFNIFFNMKDYIVYVKQRPF 331

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 1035
            +  FLER S +FE+ ++T    +YA ++  +LDP     + R+        DG+      
Sbjct: 332  LHKFLERVSDMFEVVIFTASQSIYANQLLDILDPDEKFISRRLYRESCMFSDGN------ 385

Query: 1036 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
                +KDL  +LG++ A VVIID+S +V+
Sbjct: 386  ---YTKDL-TILGIDLAKVVIIDNSPQVF 410


>gi|357450579|ref|XP_003595566.1| CTD small phosphatase-like protein [Medicago truncatula]
 gi|355484614|gb|AES65817.1| CTD small phosphatase-like protein [Medicago truncatula]
          Length = 469

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 68/145 (46%), Gaps = 28/145 (19%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LVLDLD TL++S   H                D +      F      ++ K RP +  F
Sbjct: 299  LVLDLDETLVHSTLEHC--------------DDADFTFNIFFNMKDYIVYVKQRPFLHKF 344

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDERVPK 1039
            LER S +FE+ ++T    +YA ++  +LDP     + R+        DG+          
Sbjct: 345  LERVSDMFEVVIFTASQSIYANQLLDILDPDEKFISRRLYRESCMFSDGN---------Y 395

Query: 1040 SKDLEGVLGMESA-VVIIDDSVRVW 1063
            +KDL  +LG++ A VVIID+S +V+
Sbjct: 396  TKDL-TILGIDLAKVVIIDNSPQVF 419


>gi|428672173|gb|EKX73087.1| conserved hypothetical protein [Babesia equi]
          Length = 937

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/119 (32%), Positives = 57/119 (47%), Gaps = 14/119 (11%)

Query: 966  FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
            FP++  + KLRP I  FL+  S  +EM +YT   K YA  +  +LDP   LF  R+++R 
Sbjct: 504  FPNITYYMKLRPCIREFLQVLSLYYEMSIYTNATKEYADVVISILDPDRTLFMDRIVARN 563

Query: 1026 --DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---PHNKLNLIVVERYTYF 1079
              D+ D      R+    DL         V+  DD   VW   PH +  ++  E Y +F
Sbjct: 564  SVDEKDLLKSAARL--YPDLN-----RRFVLAFDDRKDVWADIPHRQ--VVRAEHYDFF 613


>gi|403222586|dbj|BAM40718.1| CTD-like phosphatase [Theileria orientalis strain Shintoku]
          Length = 763

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 56/119 (47%), Gaps = 14/119 (11%)

Query: 966  FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
            FP++  + KLRP I  FL+  S  +EM +YT   K YA  +  +LDP   LF  R+++R 
Sbjct: 336  FPNITYYMKLRPCIREFLQILSLYYEMSIYTNATKEYADVVISILDPDRSLFMDRIVARN 395

Query: 1026 --DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---PHNKLNLIVVERYTYF 1079
              D+ D      R+    D   +L         DD   VW   PH +  ++  E Y +F
Sbjct: 396  SVDEKDLLKSASRLYPDLDPRFILA-------FDDRRDVWSDIPHKQ--VVRAEHYDFF 445


>gi|350578733|ref|XP_003480441.1| PREDICTED: CTD small phosphatase-like protein 2-like [Sus scrofa]
          Length = 355

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 55/210 (26%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 870  PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
            P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 124  PDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 183

Query: 928  DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
            D TL++ +                E +D       LF+     ++ +LRP    FLER S
Sbjct: 184  DETLVHCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 229

Query: 988  KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
            +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 230  QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 282

Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            G + S  +IID+S + + +   N I +E +
Sbjct: 283  GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 312


>gi|281338163|gb|EFB13747.1| hypothetical protein PANDA_001000 [Ailuropoda melanoleuca]
          Length = 445

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 56/210 (26%), Positives = 97/210 (46%), Gaps = 24/210 (11%)

Query: 870  PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
            P++G    H ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 235  PDSGYSSTHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 294

Query: 928  DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
            D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 295  DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 340

Query: 988  KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
            +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 341  QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 393

Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            G + S  +IID+S + + +   N I +E +
Sbjct: 394  GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|209882178|ref|XP_002142526.1| NLI interacting factor-like phosphatase family protein
            [Cryptosporidium muris RN66]
 gi|209558132|gb|EEA08177.1| NLI interacting factor-like phosphatase family protein
            [Cryptosporidium muris RN66]
          Length = 710

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/155 (27%), Positives = 74/155 (47%), Gaps = 20/155 (12%)

Query: 938  HEVDPVHDEILRKKEEQDREKPHRHLFRFPH-------MGMWT----KLRPGIWTFLERA 986
            H +D   D I   + E+ R+   + +F  P+       +  W+    KLRPG+   L R 
Sbjct: 251  HYIDE-EDNIFGLEAEKYRQLIEKLIFCIPYPNSSNNGIDNWSQGFYKLRPGVLNMLRRL 309

Query: 987  SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
               FE+++YTMG +L+A    +++DP+   F  + +   ++G       +   SK L  +
Sbjct: 310  KDKFELYMYTMGTELHAYSALRIIDPEFRFFHPKRLFYRNNG------FKDCNSKSLSTL 363

Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
               +   +++IDD  + W  N  +LI V  Y +FP
Sbjct: 364  FPYDHRTLIVIDDIEQAWS-NSNSLIKVYPYNFFP 397


>gi|50949928|emb|CAH10508.1| hypothetical protein [Homo sapiens]
          Length = 394

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 57/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 870  PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
            P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 163  PDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 222

Query: 928  DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
            D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 223  DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 268

Query: 988  KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 1047
            +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 269  QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 321

Query: 1048 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            G + S  +IID+S + + +   N I +E +
Sbjct: 322  GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 351


>gi|334186662|ref|NP_001190760.1| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
 gi|332658603|gb|AEE84003.1| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
          Length = 442

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/146 (30%), Positives = 73/146 (50%), Gaps = 22/146 (15%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            + + LVLDLD TL++S           E+ R     D +   R  F      ++ K RP 
Sbjct: 267  KAVTLVLDLDETLVHSTL---------EVCR-----DTDFSFRVTFNMQENTVYVKQRPY 312

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
            ++ FLER  +LF + ++T  + +YA+++  +LDP G   + R     D     DG     
Sbjct: 313  LYRFLERVVELFHVVIFTASHSIYASQLLDILDPDGKFVSQRFYR--DSCILSDG----I 366

Query: 1039 KSKDLEGVLGMESA-VVIIDDSVRVW 1063
             +KDL  VLG++ A V I+D+  +V+
Sbjct: 367  YTKDL-TVLGLDLAKVAIVDNCPQVY 391


>gi|403332687|gb|EJY65381.1| hypothetical protein OXYTRI_14465 [Oxytricha trifallax]
          Length = 927

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/168 (26%), Positives = 84/168 (50%), Gaps = 21/168 (12%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            ++Q+K+++     L+LD+D TL+   +     P + +I++            H      +
Sbjct: 468  KQQQKLYT-----LILDMDETLIYCRQ--NPYPGYQDIIQATSS-------AHNTYSCQV 513

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             ++T  RP +  FLE+ S++FE+ ++T   K YA  +   +DP+   F+ R+    D   
Sbjct: 514  QIFTSYRPNLRKFLEQVSQIFEVVIFTASEKSYADLILDKIDPRNEFFSKRLYR--DSCL 571

Query: 1030 PFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            P  G + V   KDL  +LG + S  +I+D+S+  + +N  N I +  Y
Sbjct: 572  PTPGGQYV---KDL-TILGRDLSRTIIVDNSIMAFAYNISNGIPIPSY 615


>gi|238480828|ref|NP_001031661.2| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
 gi|240255993|ref|NP_193548.7| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
 gi|332658601|gb|AEE84001.1| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
 gi|332658602|gb|AEE84002.1| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
          Length = 446

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/146 (30%), Positives = 73/146 (50%), Gaps = 22/146 (15%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            + + LVLDLD TL++S           E+ R     D +   R  F      ++ K RP 
Sbjct: 267  KAVTLVLDLDETLVHSTL---------EVCR-----DTDFSFRVTFNMQENTVYVKQRPY 312

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
            ++ FLER  +LF + ++T  + +YA+++  +LDP G   + R     D     DG     
Sbjct: 313  LYRFLERVVELFHVVIFTASHSIYASQLLDILDPDGKFVSQRFYR--DSCILSDG----I 366

Query: 1039 KSKDLEGVLGMESA-VVIIDDSVRVW 1063
             +KDL  VLG++ A V I+D+  +V+
Sbjct: 367  YTKDL-TVLGLDLAKVAIVDNCPQVY 391


>gi|356540144|ref|XP_003538550.1| PREDICTED: CTD small phosphatase-like protein 2-like [Glycine max]
          Length = 462

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 58/181 (32%), Positives = 83/181 (45%), Gaps = 42/181 (23%)

Query: 907  RRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF 966
            RRL   K+  S     LVLDLD TL++S             L   E+ D        F F
Sbjct: 259  RRLLLPKQTRSCPSTTLVLDLDETLVHST------------LEPCEDVD--------FTF 298

Query: 967  P-------HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
            P       H+ ++ + RP +  FLER S LFE+ ++T    +YA ++  VLDPK  +F  
Sbjct: 299  PVNFNSEEHI-VYVRCRPHLKDFLERVSGLFEIIIFTASQSIYAEQLLNVLDPKRKIFRH 357

Query: 1020 RVISRGD---DGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVER 1075
            RV        +G+           KDL  VLG + A V+IID+S + +     N I +E 
Sbjct: 358  RVYRESCVYVEGNYL---------KDL-TVLGRDLAHVMIIDNSPQAFGFQVDNGIPIES 407

Query: 1076 Y 1076
            +
Sbjct: 408  W 408


>gi|224091747|ref|XP_002309339.1| predicted protein [Populus trichocarpa]
 gi|222855315|gb|EEE92862.1| predicted protein [Populus trichocarpa]
          Length = 204

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/58 (50%), Positives = 39/58 (67%), Gaps = 1/58 (1%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
            M  K RP    FL+ AS++F +++YT+G+  YA EMAK+LDP G  F  +V SR DDG
Sbjct: 1    MMIKSRPFARMFLKEASQMFGLYMYTLGDPAYALEMAKLLDPGGEFFNAKVTSR-DDG 57


>gi|297794619|ref|XP_002865194.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
            lyrata]
 gi|297311029|gb|EFH41453.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
            lyrata]
          Length = 447

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/146 (30%), Positives = 71/146 (48%), Gaps = 22/146 (15%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            + + LVLDLD TL++S             L      D     R  F      ++ K RP 
Sbjct: 278  KSVTLVLDLDETLVHST------------LESCNVADFS--FRVFFNMQENTVYVKQRPH 323

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
            ++ FLER  +LF + ++T  + +YA+++  +LDP+G   + R     D     DG     
Sbjct: 324  LYRFLERVGELFHVVIFTASHNIYASQLLDILDPEGKFISQRFYR--DSCILLDG----I 377

Query: 1039 KSKDLEGVLGMESA-VVIIDDSVRVW 1063
             +KDL  VLG++ A V IID+  +V+
Sbjct: 378  YTKDL-TVLGLDLAKVAIIDNCPQVY 402


>gi|301754747|ref|XP_002913218.1| PREDICTED: CTD small phosphatase-like protein 2-like [Ailuropoda
            melanoleuca]
          Length = 466

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 60/234 (25%), Positives = 103/234 (44%), Gaps = 24/234 (10%)

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
            I +G +      N D         P++G    H ++ + +   +F+ Y   +      ++
Sbjct: 211  INNGLEEAEETVNRDIPPLTAPVTPDSGYSSTHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271  QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317  FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377  --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|395837830|ref|XP_003791832.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Otolemur
            garnettii]
 gi|395837832|ref|XP_003791833.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Otolemur
            garnettii]
          Length = 466

 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 60/234 (25%), Positives = 103/234 (44%), Gaps = 24/234 (10%)

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
            I +G +      N D         P++G    H ++ + +   +F+ Y   +      ++
Sbjct: 211  INNGLEEAEETVNRDIPPLTAPVTPDSGYSSTHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271  QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317  FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377  --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|255547724|ref|XP_002514919.1| conserved hypothetical protein [Ricinus communis]
 gi|223545970|gb|EEF47473.1| conserved hypothetical protein [Ricinus communis]
          Length = 455

 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 76/155 (49%), Gaps = 22/155 (14%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LVLDLD TL++S     ++P  D         D   P    F      ++ + RP +  F
Sbjct: 265  LVLDLDETLVHST----LEPCGD--------ADFTFPVN--FNLQEHTVYVRCRPFLKDF 310

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
            +ER S LFE+ ++T    +YA ++  VLDPK  +F  RV    +     +G+      KD
Sbjct: 311  MERVSSLFEIIIFTASQSIYAEQLLNVLDPKRKVFRHRVFR--ESCVYVEGN----YLKD 364

Query: 1043 LEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
            L  VLG + A V+IID+S + +     N I +E +
Sbjct: 365  L-SVLGRDLARVIIIDNSPQAFGFQVDNGIPIESW 398


>gi|355681384|gb|AER96789.1| CTD small phosphatase like 2 [Mustela putorius furo]
          Length = 465

 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 60/234 (25%), Positives = 103/234 (44%), Gaps = 24/234 (10%)

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
            I +G +      N D         P++G    H ++ + +   +F+ Y   +      ++
Sbjct: 211  INNGLEEAEETVNRDIPPLTAPVTPDSGYSSTHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271  QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317  FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377  --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|294915786|ref|XP_002778342.1| hypothetical protein Pmar_PMAR015889 [Perkinsus marinus ATCC 50983]
 gi|239886620|gb|EER10137.1| hypothetical protein Pmar_PMAR015889 [Perkinsus marinus ATCC 50983]
          Length = 278

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 53/204 (25%), Positives = 84/204 (41%), Gaps = 41/204 (20%)

Query: 920  KLCLVLDLDHTLLNSAKFHE--------VDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
            +L +VLDLD T++NS +  +        V P+  EI R ++      P  +L     + +
Sbjct: 45   RLDVVLDLDRTMVNSFEIRKAGRSESENVTPILQEIYRDEQGL----PELYLCVISDVKV 100

Query: 972  WTKLRPGIWTFLERASKLFE----MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
             TK+RP    F+       E    + +YT G++ Y   + ++LDP G L  GR++SR D+
Sbjct: 101  LTKIRPHARAFIRELVASTEYGVVISIYTKGSRRYMEVVKQMLDPSGELIKGRLVSRDDE 160

Query: 1028 ---------------------GDPFDG-DERVPKSKDLEGVLGMESAVVIIDDSVRVWP- 1064
                                 G  FDG D R+           M    V++DDS   WP 
Sbjct: 161  PSNMTPVEKDPDLIINASVESGAQFDGSDGRLCNGDKETKESEMRRWFVVLDDSPEAWPE 220

Query: 1065 --HNKLNLIVVERYTYFPCSRRQF 1086
                  N++    Y +   + RQ 
Sbjct: 221  ELREAGNVVTANMYDFAEVNHRQL 244


>gi|82541597|ref|XP_725029.1| NLI interacting factor [Plasmodium yoelii yoelii 17XNL]
 gi|23479881|gb|EAA16594.1| NLI interacting factor, putative [Plasmodium yoelii yoelii]
          Length = 1177

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 39/136 (28%), Positives = 66/136 (48%), Gaps = 16/136 (11%)

Query: 955  DREKPHRHLFRFPHMGM--WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
            + ++P  + F  P+     + K RP +  FLE  S  +E+ +YT   + YA  +  +LDP
Sbjct: 759  ENDEPELYKFFLPYYNFFYYLKFRPYVRQFLEILSLYYELSIYTNATREYADVVIAILDP 818

Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVI-IDDSVRVW---PHNK 1067
               +FA R+++R       D DE    +K  E +   ++   VI  DD   VW   PH+ 
Sbjct: 819  DRTIFADRIVAR---CSSVDRDE----NKHFEKIYPNVDPKYVIAFDDRKDVWFDIPHS- 870

Query: 1068 LNLIVVERYTYFPCSR 1083
             +++  E Y +F  S+
Sbjct: 871  -HILRAEHYNFFELSK 885


>gi|412985958|emb|CCO17158.1| RNA Polymerase II CTD phosphatase Fcp1 [Bathycoccus prasinos]
          Length = 490

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 55/190 (28%), Positives = 91/190 (47%), Gaps = 27/190 (14%)

Query: 920  KLCLVLDLDHTLLNS---AKFHEVDP-----------VHDEILRKKEEQDREKPHRHLFR 965
            KL LVLDLD TLL+S    KF   +P           +  +  +K E +    P +  F 
Sbjct: 101  KLPLVLDLDSTLLHSVEKTKFLFPNPGESNTSEEEMKIIKQAQKKIESRLESSPDK--FF 158

Query: 966  FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMA-KVLDPKGVLFAGRVISR 1024
            + +   +TK+RP    FL   S+++E+++ T G++ YA  +A +VLDP G  F  R ++R
Sbjct: 159  YVNDQYFTKIRPQARRFLSELSEMYELYIVTAGSQAYAEAIANQVLDPLGKYF-NRDVNR 217

Query: 1025 GDDGDPFDG------DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
                  ++       D R     D   + G ES  ++++D   +W   +  ++ V+ Y Y
Sbjct: 218  IKGMKQWNSEVNQWVDVRTKIVND--ALEGAESVTIVVEDKPEMW-DGECAVMQVKPYYY 274

Query: 1079 FPCSRRQFGL 1088
            FP S  +  L
Sbjct: 275  FPESLEELKL 284


>gi|84994102|ref|XP_951773.1| CTD-like phosphatase [Theileria annulata strain Ankara]
 gi|65301934|emb|CAI74041.1| CTD-like phosphatase, putative [Theileria annulata]
          Length = 767

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 37/119 (31%), Positives = 55/119 (46%), Gaps = 14/119 (11%)

Query: 966  FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
            F ++  + KLRP I  FL+  S  +EM +YT   K YA  +  +LDP   LF  R+++R 
Sbjct: 338  FANVNYYMKLRPCIREFLQILSLYYEMSIYTNATKEYADVVISILDPDRSLFMDRIVARN 397

Query: 1026 --DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---PHNKLNLIVVERYTYF 1079
              D+ D      R+    D   +L         DD   VW   PH +  ++  E Y +F
Sbjct: 398  SVDEKDLLKSASRLYPDLDTRFILAF-------DDRRDVWSDIPHKQ--VVRAEHYDFF 447


>gi|359473746|ref|XP_002271611.2| PREDICTED: CTD small phosphatase-like protein 2-like [Vitis vinifera]
 gi|297738449|emb|CBI27650.3| unnamed protein product [Vitis vinifera]
          Length = 503

 Score = 57.8 bits (138), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 51/165 (30%), Positives = 76/165 (46%), Gaps = 42/165 (25%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMGMWTKL 975
            LVLDLD TL++S     ++P  D                  F FP       HM ++ + 
Sbjct: 313  LVLDLDETLVHST----LEPCDDAD----------------FTFPVNFNLKEHM-VYVRC 351

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGDPFD 1032
            RP +  F+ER + LFE+ ++T    +YA ++  VLDPK   F  RV        +G+   
Sbjct: 352  RPHLKDFMERVASLFEIIIFTASQSIYAEQLLNVLDPKRRFFRHRVYRESCVFVEGNYL- 410

Query: 1033 GDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
                    KDL  VLG + A V+IID+S + +     N I +E +
Sbjct: 411  --------KDL-SVLGRDLAHVIIIDNSPQAFGFQVDNGIPIESW 446


>gi|225716618|gb|ACO14155.1| RNA polymerase II subunit A C-terminal domain phosphatase [Esox
            lucius]
          Length = 266

 Score = 57.8 bits (138), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 58/103 (56%), Gaps = 22/103 (21%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+Q+++   +KL L++DLD TL+++                 E+  +   ++ +F F  +
Sbjct: 167  EDQQRLHRNKKLVLMVDLDQTLIHTT----------------EQHCQRMSNKGIFHF-QL 209

Query: 970  G-----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMA 1007
            G     + T+LRP    FLE+ +KL+E+H++T G++LYA  +A
Sbjct: 210  GRGEPMLHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIA 252


>gi|125527169|gb|EAY75283.1| hypothetical protein OsI_03170 [Oryza sativa Indica Group]
          Length = 507

 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 48/167 (28%), Positives = 81/167 (48%), Gaps = 36/167 (21%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMG 970
            ARK+ LVLDLD TL++S                      E+   + F FP       HM 
Sbjct: 325  ARKVTLVLDLDETLVHSTT--------------------EQCDDYDFTFPVFFDLKEHM- 363

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
            ++ + RP +  FL++ +++FE+ ++T    +YA ++  +LDP+  LF+ R          
Sbjct: 364  VYVRKRPHLHMFLQKMAEMFEVVIFTASQSVYADQLLDILDPEKKLFSRRYFRESCVF-- 421

Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
                     +KDL  V+G++ A VVIID++ +V+     N I +E +
Sbjct: 422  ----TNTSYTKDL-TVVGVDLAKVVIIDNTPQVFQLQVNNGIPIESW 463


>gi|297597322|ref|NP_001043795.2| Os01g0665300 [Oryza sativa Japonica Group]
 gi|55773815|dbj|BAD72353.1| Chain A, Three-Dimensional Structure Of A Rna-Polymerase Ii Binding
            Protein With Associated Ligand-like [Oryza sativa
            Japonica Group]
 gi|125571492|gb|EAZ13007.1| hypothetical protein OsJ_02926 [Oryza sativa Japonica Group]
 gi|255673527|dbj|BAF05709.2| Os01g0665300 [Oryza sativa Japonica Group]
          Length = 439

 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 48/167 (28%), Positives = 81/167 (48%), Gaps = 36/167 (21%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMG 970
            ARK+ LVLDLD TL++S                      E+   + F FP       HM 
Sbjct: 257  ARKVTLVLDLDETLVHSTT--------------------EQCDDYDFTFPVFFDMKEHM- 295

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
            ++ + RP +  FL++ +++FE+ ++T    +YA ++  +LDP+  LF+ R          
Sbjct: 296  VYVRKRPHLHMFLQKMAEMFEVVIFTASQSVYADQLLDILDPEKKLFSRRYFRESCVF-- 353

Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
                     +KDL  V+G++ A VVIID++ +V+     N I +E +
Sbjct: 354  ----TNTSYTKDL-TVVGVDLAKVVIIDNTPQVFQLQVNNGIPIESW 395


>gi|55740289|gb|AAV63947.1| putative nuclear LIM interactor-interacting protein [Phytophthora
            sojae]
          Length = 261

 Score = 57.4 bits (137), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 53/153 (34%), Positives = 75/153 (49%), Gaps = 32/153 (20%)

Query: 917  SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL---FRFPHMGMWT 973
            +A K+CLVLDLD TL++ +    VD V             + PH      F      +  
Sbjct: 75   NAPKICLVLDLDETLVHCS----VDEV-------------KNPHMQFPVTFNGVEYTVNV 117

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
            K RP +  FL+R SKLFE+ ++T  +K+YA ++  +LDP       R + R D  D F  
Sbjct: 118  KKRPHLEYFLKRVSKLFEIVVFTASHKVYAEKLMNMLDPNRNFIKYR-LYREDCLDVFGN 176

Query: 1034 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPH 1065
                   KDL  VLG + S VV++D+S    PH
Sbjct: 177  -----YLKDL-NVLGRDLSKVVLVDNS----PH 199


>gi|357463015|ref|XP_003601789.1| CTD small phosphatase-like protein [Medicago truncatula]
 gi|355490837|gb|AES72040.1| CTD small phosphatase-like protein [Medicago truncatula]
          Length = 885

 Score = 57.4 bits (137), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 54/171 (31%), Positives = 86/171 (50%), Gaps = 22/171 (12%)

Query: 907  RRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF 966
            RR+   K+  S   + LVLDLD TL++S+    ++P  D         + E+   H+   
Sbjct: 265  RRMLLPKQTRSCPPITLVLDLDETLVHSS----LEPCEDVDFTFTVNFNSEE---HI--- 314

Query: 967  PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
                ++ + RP +  FLER S LFE+ ++T    +YA ++  VLDPK  +F  RV    +
Sbjct: 315  ----VYVRCRPHLKEFLERVSGLFEIIIFTASQSIYAEQLLNVLDPKRKIFRHRVFR--E 368

Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
                 +G+      KDL  VLG + A V+IID+S + +     N I +E +
Sbjct: 369  SCVYVEGNYL----KDL-TVLGRDLAHVMIIDNSPQAFGFQVDNGIPIESW 414



 Score = 57.0 bits (136), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 56/181 (30%), Positives = 83/181 (45%), Gaps = 36/181 (19%)

Query: 890  FEGY------DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPV 943
            F GY      DD  KA  + E   R        S   + LVLDLD TL++S+        
Sbjct: 665  FVGYLALIPIDDISKAFKKGESWTR--------SCPPITLVLDLDETLVHSSL------- 709

Query: 944  HDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 1003
                   K  +D +      F+     ++ + RP +  FLER S LFE+ ++T    +YA
Sbjct: 710  -------KPSEDVDFTFTVNFKSEEYIVYVRCRPHLKEFLERVSGLFEIIIFTASQSIYA 762

Query: 1004 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRV 1062
             ++  +LDPK  +F  RV    +     +G+      KDL  VLG + A V+IID+S R 
Sbjct: 763  EQLLNLLDPKRKIFRHRVFR--ESCVKVEGNYL----KDL-TVLGCDLAHVMIIDNSRRA 815

Query: 1063 W 1063
            +
Sbjct: 816  F 816


>gi|355692677|gb|EHH27280.1| CTD small phosphatase-like protein 2 [Macaca mulatta]
          Length = 466

 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 60/234 (25%), Positives = 102/234 (43%), Gaps = 24/234 (10%)

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
            I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211  INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271  QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L       
Sbjct: 317  FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHHFFC 376

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377  --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|7022613|dbj|BAA91664.1| unnamed protein product [Homo sapiens]
          Length = 286

 Score = 57.0 bits (136), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 58/234 (24%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 903
            + +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 31   LNNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 90

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            +  R+     K  S  +  LVLDLD TL++ +                E +D       L
Sbjct: 91   QLNRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NELEDAALTFPVL 136

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 137  FQDVVYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 196

Query: 1024 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 197  --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 243


>gi|395503570|ref|XP_003756137.1| PREDICTED: CTD small phosphatase-like protein 2 [Sarcophilus
            harrisii]
          Length = 395

 Score = 57.0 bits (136), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 64/118 (54%), Gaps = 8/118 (6%)

Query: 960  HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
            H++L ++    ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   
Sbjct: 242  HKNLKKYIDSNVYVRLRPFFREFLERMSQIYEIILFTASKKVYADKLLNILDPKKQLVRH 301

Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            R+    +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 302  RLFR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 352


>gi|340380578|ref|XP_003388799.1| PREDICTED: hypothetical protein LOC100637093 [Amphimedon
            queenslandica]
          Length = 532

 Score = 57.0 bits (136), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 53/176 (30%), Positives = 83/176 (47%), Gaps = 37/176 (21%)

Query: 905  RTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD---REKPHR 961
            RTRR  E          CLVLDLD TL++ +            L K E  +   + +   
Sbjct: 348  RTRRTPE---------FCLVLDLDETLVHCS------------LSKLELANFTFKVEYSN 386

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             LF      ++ +LRP    FLER SK FE+ L+T   K+YA ++  ++DP   L   R+
Sbjct: 387  QLF-----DVYVRLRPYFHEFLERVSKQFEVILFTASTKVYADKLLDLIDPSRRLVKHRL 441

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
                D     DG+      K+L G+LG + A  +I+D+S + + +   N + +E +
Sbjct: 442  FR--DHCVCVDGN----FIKEL-GILGRDLAKTIIVDNSPQAFGYQLSNGVPIESW 490


>gi|358335312|dbj|GAA53844.1| CTD small phosphatase-like protein 2 [Clonorchis sinensis]
          Length = 498

 Score = 57.0 bits (136), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 51/189 (26%), Positives = 91/189 (48%), Gaps = 31/189 (16%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM- 971
            K+  SA + CLVLDLD TL++ +    + P+ D               + +F+    G+ 
Sbjct: 299  KRTRSAPEFCLVLDLDETLVHCS----LTPLPDA--------------QFIFQVVFQGVV 340

Query: 972  ---WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
               + ++RP ++ FL R S+ FE+ L+T   K+YA  +  ++DPK      R+    +  
Sbjct: 341  YMVYVRIRPHLYEFLSRVSERFEVVLFTASTKVYADRLVNLIDPKKKWIKHRLFR--EHC 398

Query: 1029 DPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 1087
               +G+      KDL  VLG +    VI+D+S + + +   N + +E + +   + R+  
Sbjct: 399  VCVNGN----YVKDLR-VLGRDLRKTVIVDNSPQAFGYQLDNGVPIESW-FVDSNDRELL 452

Query: 1088 LLGPSLLEI 1096
             L P L E+
Sbjct: 453  NLLPFLFEV 461


>gi|294877772|ref|XP_002768119.1| hypothetical protein Pmar_PMAR002906 [Perkinsus marinus ATCC 50983]
 gi|239870316|gb|EER00837.1| hypothetical protein Pmar_PMAR002906 [Perkinsus marinus ATCC 50983]
          Length = 161

 Score = 57.0 bits (136), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 45/153 (29%), Positives = 73/153 (47%), Gaps = 23/153 (15%)

Query: 931  LLNSAKFHEVDPVHDE----ILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFL-ER 985
            ++NS +  + DP   E    IL++  + +   P  +L     + + TK+RP    F+ E 
Sbjct: 1    MVNSYEIGKADPSQSESVTPILQEVYKDEEGLPELYLCVISDVKVLTKIRPHARAFIREL 60

Query: 986  ASKL---FEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
             SK      + +YT G++ Y   + K+LDP G L  GR++SR D+          P  KD
Sbjct: 61   VSKTGCGVVLSIYTKGSRRYMEVIKKMLDPSGELIKGRLVSREDEPSNM-----TPLEKD 115

Query: 1043 LEGVLGMESAV----------VIIDDSVRVWPH 1065
             + ++  +SAV          V++DDS  VWP 
Sbjct: 116  PDFIINADSAVGTEELRRRWFVVLDDSPEVWPE 148


>gi|432851772|ref|XP_004067077.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Oryzias
            latipes]
          Length = 474

 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 88/359 (24%), Positives = 149/359 (41%), Gaps = 61/359 (16%)

Query: 735  MDELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQGSK------ENLNFQK 788
            ++E   V +    PR  L G          P F    P+   + GS       E     K
Sbjct: 117  LEEASAVEVTASPPRTTLLGTIF------SPVFNFFSPAKNASSGSDSPDQALEAEEIVK 170

Query: 789  QLGAPEAKPVLSQSVLQPDI--TQQFTKNLKHIADFMSVSQPLTSEPMVSQNSP-IQPGQ 845
            QL   E   + S +V Q D+  T  F  ++ H+        P    P + + SP I   +
Sbjct: 171  QLDMEEVVEMPSSTVTQ-DVCATTHFYSSVSHL--------PSLRPPHMLEASPTIDEAE 221

Query: 846  IKSGADMKAVVTNHDDKQTGTGSGPEAG-----PVGAHPQSAWGDVEHLFEGYDDQQKA- 899
            +++ AD+  +        T  G+ PE       P    P+ ++ +   +F+ Y   +   
Sbjct: 222  LEADADLPPL--------TAPGASPEMTYVDVPPPAVPPEVSYEEDWEVFDPYFFIKHVP 273

Query: 900  -AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 958
               +++ TR+     K  S  +  LVLDLD TL++ +                E +D   
Sbjct: 274  PLTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NELEDAAL 319

Query: 959  PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
                LF+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L  
Sbjct: 320  TFPVLFQDVIYQVYVRLRPFFREFLERMSQIYEIILFTASKKVYADKLLNILDPKKQLVR 379

Query: 1019 GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
             R+    +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 380  HRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 431


>gi|348523113|ref|XP_003449068.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Oreochromis
            niloticus]
          Length = 378

 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 43/156 (27%), Positives = 77/156 (49%), Gaps = 22/156 (14%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LV+DL+ TL+    F  ++ +          +D E      F+     ++  LRP +  F
Sbjct: 202  LVVDLEETLM----FSSLNVI----------EDAEYTFHAAFQDHQYKVYMVLRPHVKEF 247

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
            L+  +K++E+ +YT   K YA ++  +LDP+  LF  R+    DD     G       KD
Sbjct: 248  LQAMAKIYELFVYTCAKKEYAEKILDILDPQRKLFRHRLYQ--DDCACVLGH----YIKD 301

Query: 1043 LEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYT 1077
            L  +LG +    V++D++   +P+N LN I ++ ++
Sbjct: 302  L-SILGRDLKKTVVLDNAPHTYPYNLLNTIPIKSWS 336


>gi|148233948|ref|NP_001082795.1| CTD small phosphatase-like protein 2-B [Danio rerio]
 gi|187471000|sp|A4QNX6.1|CTL2B_DANRE RecName: Full=CTD small phosphatase-like protein 2-B;
            Short=CTDSP-like 2-B
 gi|141796856|gb|AAI39561.1| Zgc:162265 protein [Danio rerio]
          Length = 460

 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 61/220 (27%), Positives = 99/220 (45%), Gaps = 28/220 (12%)

Query: 864  TGTGSGPEAGPVGAHPQ-SAWGDVEHLFEGYD-----DQQKAAIQKERTRRLEEQKKMFS 917
            T  GS    G V A     A G  E  +E +D            +++ TR+     K  S
Sbjct: 219  TAPGSPATGGYVDASITVPAEGSYEEEWEVFDPYFFIKHVPPLTEEQLTRKPALPLKTRS 278

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
              +  LVLDLD TL++ +  +E+D             D       LF+     ++ +LRP
Sbjct: 279  TPEFSLVLDLDETLVHCS-LNELD-------------DAALTFPVLFQDVIYQVYVRLRP 324

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERV 1037
                FLER S+++E+ L+T   K+YA ++  +LDP+  L   R+    +      G+   
Sbjct: 325  FFREFLERMSQIYEIILFTASKKVYADKLLNILDPRKQLVRHRLFR--EHCVCVQGN--- 379

Query: 1038 PKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
               KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 380  -YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 417


>gi|68068525|ref|XP_676173.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56495746|emb|CAI00611.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 953

 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 39/136 (28%), Positives = 66/136 (48%), Gaps = 16/136 (11%)

Query: 955  DREKPHRHLFRFPHMGM--WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
            + ++P  + F  P+     + K RP +  FLE  S  +E+ +YT   + YA  +  +LDP
Sbjct: 598  ENDEPELYKFFLPYYNFFYYLKFRPYVRQFLEILSLYYELSIYTNATREYADVVIAILDP 657

Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVI-IDDSVRVW---PHNK 1067
               +FA R+++R       D DE    +K  E +   ++   VI  DD   VW   PH+ 
Sbjct: 658  DRTIFADRIVAR---CSSVDRDE----NKHFEKIYPNVDPKYVIAFDDRKDVWFDIPHS- 709

Query: 1068 LNLIVVERYTYFPCSR 1083
             +++  E Y +F  S+
Sbjct: 710  -HILRAEHYNFFELSK 724


>gi|186529839|ref|NP_001119383.1| SCP1-like small phosphatase 4 [Arabidopsis thaliana]
 gi|332007998|gb|AED95381.1| SCP1-like small phosphatase 4 [Arabidopsis thaliana]
          Length = 456

 Score = 56.6 bits (135), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 70/146 (47%), Gaps = 22/146 (15%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            + + LVLDLD TL++S             L      D     R  F      ++ + RP 
Sbjct: 282  KSVTLVLDLDETLVHST------------LESCNVADFS--FRVFFNMQENTVYVRQRPH 327

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
            ++ FLER  +LF + ++T  + +YA+++  +LDP G   + R     D     DG     
Sbjct: 328  LYRFLERVGELFHVVIFTASHSIYASQLLDILDPDGKFISQRFYR--DSCILLDG----I 381

Query: 1039 KSKDLEGVLGMESA-VVIIDDSVRVW 1063
             +KDL  VLG++ A V IID+  +V+
Sbjct: 382  YTKDL-TVLGLDLAKVAIIDNCPQVY 406


>gi|326671582|ref|XP_700009.2| PREDICTED: CTD small phosphatase-like protein 2-like [Danio rerio]
          Length = 358

 Score = 56.6 bits (135), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 47/155 (30%), Positives = 75/155 (48%), Gaps = 22/155 (14%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHD-EILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWT 981
            LVLDLD TL+    F  ++ + D E       QD    H++        ++  LRP +  
Sbjct: 182  LVLDLDETLV----FSSLNVIPDAEYTFNTRFQD----HKY-------KVYVILRPHVRE 226

Query: 982  FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSK 1041
            FL+  +K FEM +YT   K YA ++  +LDP   LF  R+    DD     G       K
Sbjct: 227  FLQAMTKHFEMFVYTSAKKEYAEKIVDILDPNKKLFRHRLYQ--DDCACVLGH----YIK 280

Query: 1042 DLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
            DL  +    S  VI+D++   +P++ +N+I ++ +
Sbjct: 281  DLTILERDLSKTVILDNAPHTFPYHLMNMIPIKSW 315


>gi|22327621|ref|NP_199453.2| SCP1-like small phosphatase 4 [Arabidopsis thaliana]
 gi|18377616|gb|AAL66958.1| unknown protein [Arabidopsis thaliana]
 gi|20465765|gb|AAM20371.1| unknown protein [Arabidopsis thaliana]
 gi|332007997|gb|AED95380.1| SCP1-like small phosphatase 4 [Arabidopsis thaliana]
          Length = 453

 Score = 56.2 bits (134), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 70/146 (47%), Gaps = 22/146 (15%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            + + LVLDLD TL++S             L      D     R  F      ++ + RP 
Sbjct: 279  KSVTLVLDLDETLVHST------------LESCNVADFS--FRVFFNMQENTVYVRQRPH 324

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
            ++ FLER  +LF + ++T  + +YA+++  +LDP G   + R     D     DG     
Sbjct: 325  LYRFLERVGELFHVVIFTASHSIYASQLLDILDPDGKFISQRFYR--DSCILLDG----I 378

Query: 1039 KSKDLEGVLGMESA-VVIIDDSVRVW 1063
             +KDL  VLG++ A V IID+  +V+
Sbjct: 379  YTKDL-TVLGLDLAKVAIIDNCPQVY 403


>gi|26449836|dbj|BAC42041.1| unknown protein [Arabidopsis thaliana]
          Length = 453

 Score = 56.2 bits (134), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 70/146 (47%), Gaps = 22/146 (15%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            + + LVLDLD TL++S             L      D     R  F      ++ + RP 
Sbjct: 279  KSVTLVLDLDETLVHST------------LESCNVADFS--FRVFFNMQENTVYVRQRPH 324

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
            ++ FLER  +LF + ++T  + +YA+++  +LDP G   + R     D     DG     
Sbjct: 325  LYRFLERVGELFHVVIFTASHSIYASQLLDILDPDGKFISQRFYR--DSCILLDG----I 378

Query: 1039 KSKDLEGVLGMESA-VVIIDDSVRVW 1063
             +KDL  VLG++ A V IID+  +V+
Sbjct: 379  YTKDL-TVLGLDLAKVAIIDNCPQVY 403


>gi|293331055|ref|NP_001170732.1| uncharacterized protein LOC100384823 [Zea mays]
 gi|238007228|gb|ACR34649.1| unknown [Zea mays]
          Length = 254

 Score = 56.2 bits (134), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 24/114 (21%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFP 967
            K+  S   + LVLDLD TL++S   H  D     PVH                   F F 
Sbjct: 60   KQTRSCPTMTLVLDLDETLVHSTLEHCEDADFTFPVH-------------------FNFR 100

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
               ++ + RP +  FL+R + +FE  ++T    +YA ++  VLDPK  LF  RV
Sbjct: 101  EHTIYVRCRPYLKEFLDRVASVFETIIFTASQSIYAEQLLNVLDPKRKLFRHRV 154


>gi|55740281|gb|AAV63942.1| putative nuclear LIM factor interactor-interacting protein hyphal
            form [Phytophthora infestans]
          Length = 211

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 57/177 (32%), Positives = 86/177 (48%), Gaps = 28/177 (15%)

Query: 904  ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 963
            E  R +   ++  +A K+CLVLDLD TL++ +    VD V             + PH   
Sbjct: 12   EGKRPISLPERSHNAPKICLVLDLDETLVHCS----VDEV-------------KNPHMQF 54

Query: 964  ---FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
               F      +  K RP +  FL+R SKLFE+ ++T  +K+YA ++  +LDP   L   R
Sbjct: 55   PVTFNGVEYIVNVKKRPHMEYFLKRVSKLFEIVVFTASHKVYAEKLTNMLDPHRNLIKYR 114

Query: 1021 VISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
             + R D  D F         KDL  VLG + S VV++D+S   + +   N I +E +
Sbjct: 115  -LYRDDCLDVFGN-----YLKDLN-VLGRDLSKVVLVDNSPHAFGYQVNNGIPIETW 164


>gi|389584495|dbj|GAB67227.1| hypothetical protein PCYB_112480 [Plasmodium cynomolgi strain B]
          Length = 1447

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 36/133 (27%), Positives = 64/133 (48%), Gaps = 16/133 (12%)

Query: 958  KPHRHLFRFPHMGM--WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
            +P  + F  P+     + K RP +  FL+  S  +E+ +YT   + YA  +  +LDP   
Sbjct: 1018 EPELYKFFLPYYNFFYYLKFRPYVRQFLQILSLYYELSIYTNATREYADVVIAILDPDRT 1077

Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVI-IDDSVRVW---PHNKLNL 1070
            LFA R+++R    D         ++K+   +   ++S  +I  DD   VW   PH+  ++
Sbjct: 1078 LFADRIVARCSSADR-------EENKNFSKIYPNVDSKYIIAFDDRKDVWTDIPHS--HI 1128

Query: 1071 IVVERYTYFPCSR 1083
            +  E Y +F  S+
Sbjct: 1129 LKAEHYNFFELSK 1141


>gi|387015310|gb|AFJ49774.1| CTD small phosphatase [Crotalus adamanteus]
          Length = 466

 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 54/211 (25%), Positives = 97/211 (45%), Gaps = 24/211 (11%)

Query: 869  GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 926
             P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 234  SPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 293

Query: 927  LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 986
            LD TL++ +                E +D       LF+     ++ +LRP    FLE  
Sbjct: 294  LDETLVHCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLECM 339

Query: 987  SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
            S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 340  SQIYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NI 392

Query: 1047 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            LG + S  +IID+S + + +   N I +E +
Sbjct: 393  LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|115456605|ref|NP_001051903.1| Os03g0850100 [Oryza sativa Japonica Group]
 gi|27573336|gb|AAO20054.1| putative NLI interacting factor [Oryza sativa Japonica Group]
 gi|28269415|gb|AAO37958.1| putative NLI-interacting factor [Oryza sativa Japonica Group]
 gi|108712119|gb|ABF99914.1| NLI interacting factor, putative, expressed [Oryza sativa Japonica
            Group]
 gi|113550374|dbj|BAF13817.1| Os03g0850100 [Oryza sativa Japonica Group]
 gi|125588650|gb|EAZ29314.1| hypothetical protein OsJ_13375 [Oryza sativa Japonica Group]
          Length = 444

 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 49/168 (29%), Positives = 77/168 (45%), Gaps = 28/168 (16%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            K+  S  +  LVLDLD TL++S             L   E+ D   P    F      ++
Sbjct: 252  KQTRSCPRTTLVLDLDETLVHST------------LEPCEDSDFTFPVH--FNLREHTIY 297

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGD 1029
             + RP +  FLE  + +FE+ ++T    +YA ++  +LDPK  LF  RV        +G+
Sbjct: 298  VRCRPYLKEFLETVASMFEIIIFTASQSIYAEQLLNILDPKRRLFRHRVYRESCLFVEGN 357

Query: 1030 PFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
                       KDL  VLG + A VVI+D+S + +     N + +E +
Sbjct: 358  YL---------KDL-SVLGRDLARVVIVDNSPQAFGFQLDNGVPIESW 395


>gi|218194116|gb|EEC76543.1| hypothetical protein OsI_14336 [Oryza sativa Indica Group]
          Length = 444

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 49/168 (29%), Positives = 77/168 (45%), Gaps = 28/168 (16%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            K+  S  +  LVLDLD TL++S             L   E+ D   P    F      ++
Sbjct: 252  KQTRSCPRTTLVLDLDETLVHST------------LEPCEDSDFTFPVH--FNLREHTIY 297

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGD 1029
             + RP +  FLE  + +FE+ ++T    +YA ++  +LDPK  LF  RV        +G+
Sbjct: 298  VRCRPYLKEFLETVASMFEIIIFTASQSIYAEQLLNILDPKRRLFRHRVYRESCLFVEGN 357

Query: 1030 PFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
                       KDL  VLG + A VVI+D+S + +     N + +E +
Sbjct: 358  YL---------KDL-SVLGRDLARVVIVDNSPQAFGFQLDNGVPIESW 395


>gi|299472381|emb|CBN77569.1| putative nuclear LIM interactor-interacting protein [Ectocarpus
            siliculosus]
          Length = 602

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 49/169 (28%), Positives = 80/169 (47%), Gaps = 22/169 (13%)

Query: 909  LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 968
            L E++K    ++L LVLDLD TL++      V+P H           R + H   F    
Sbjct: 351  LPEKRKTRHGKELTLVLDLDETLVHCTVDPIVNPDH-----------RFEVH---FNGEE 396

Query: 969  MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
              ++ + RP +  FLE  S+LFE+ ++T   ++YA  +  ++DP+      R+    D  
Sbjct: 397  FQVYVRKRPHLDAFLEAVSELFEVVVFTASQQVYAERLLNMIDPQKKFVKYRLYR--DAC 454

Query: 1029 DPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
               +G+      KDL  VLG + S V I+D+S   +     N I +E +
Sbjct: 455  MALEGNYL----KDLN-VLGRDLSKVAIVDNSPYAYGFQIDNGIPIESW 498


>gi|67624539|ref|XP_668552.1| NLI interacting factor [Cryptosporidium hominis TU502]
 gi|54659751|gb|EAL38315.1| NLI interacting factor [Cryptosporidium hominis]
          Length = 595

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 51/176 (28%), Positives = 82/176 (46%), Gaps = 22/176 (12%)

Query: 913  KKMFSARKLCLVLDLDHTLL---NSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-- 967
            K   +  KL  +LDLD+TLL   NS K      + D I    +      P  + F  P  
Sbjct: 166  KDYLAQNKLVAILDLDNTLLHAYNSTKIGCNINLEDFISSSGD------PEMYKFVLPQD 219

Query: 968  -HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
             +   + KLRPG+  FL   +  + M + T   + YA  +  VLDP+   F  R+++R  
Sbjct: 220  LNTPYYLKLRPGVREFLNTIAPYYIMGICTNATREYADVIRAVLDPQRDKFGDRIVAR-- 277

Query: 1027 DGDPFDGDERVPKSKDLEGV-LGMES-AVVIIDDSVRVWPHNKLNLIV-VERYTYF 1079
              +  DG +     KD   + + +E+ A+V++DD   VW  +  + +V  + Y YF
Sbjct: 278  --ESVDGRD---TQKDFRKICVDVETRAIVLLDDRSDVWDSSLESQVVKAQTYEYF 328


>gi|156101293|ref|XP_001616340.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148805214|gb|EDL46613.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 1544

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/133 (27%), Positives = 64/133 (48%), Gaps = 16/133 (12%)

Query: 958  KPHRHLFRFPHMGM--WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
            +P  + F  P+     + K RP +  FL+  S  +E+ +YT   + YA  +  +LDP   
Sbjct: 1108 EPELYKFFLPYYNFFYYLKFRPYVRQFLQILSLYYELSIYTNATREYADVVIAILDPDRT 1167

Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVI-IDDSVRVW---PHNKLNL 1070
            LFA R+++R    D         ++K+   +   ++S  VI  DD   VW   PH+  ++
Sbjct: 1168 LFADRIVARCSSAD-------REENKNFSKIYPNVDSKYVIAFDDRKDVWTDIPHS--HI 1218

Query: 1071 IVVERYTYFPCSR 1083
            +  E Y +F  S+
Sbjct: 1219 LKAEHYNFFELSK 1231


>gi|66363226|ref|XP_628579.1| RNA pol II carboxy terminal domain phosphatase of the HAD superfamily
            with a BRCT domain at the C-terminus [Cryptosporidium
            parvum Iowa II]
 gi|46229587|gb|EAK90405.1| RNA pol II carboxy terminal domain phosphatase of the HAD superfamily
            with a BRCT domain at the C-terminus [Cryptosporidium
            parvum Iowa II]
 gi|323509333|dbj|BAJ77559.1| cgd7_4250 [Cryptosporidium parvum]
 gi|323509917|dbj|BAJ77851.1| cgd7_4250 [Cryptosporidium parvum]
          Length = 595

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 51/176 (28%), Positives = 82/176 (46%), Gaps = 22/176 (12%)

Query: 913  KKMFSARKLCLVLDLDHTLL---NSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-- 967
            K   +  KL  +LDLD+TLL   NS K      + D I    +      P  + F  P  
Sbjct: 166  KDYLAQNKLVAILDLDNTLLHAYNSTKIGCNINLEDFISSSGD------PEMYKFVLPQD 219

Query: 968  -HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
             +   + KLRPG+  FL   +  + M + T   + YA  +  VLDP+   F  R+++R  
Sbjct: 220  LNTPYYLKLRPGVREFLNTIAPYYIMGICTNATREYADVIRAVLDPQRDKFGDRIVAR-- 277

Query: 1027 DGDPFDGDERVPKSKDLEGV-LGMES-AVVIIDDSVRVWPHNKLNLIV-VERYTYF 1079
              +  DG +     KD   + + +E+ A+V++DD   VW  +  + +V  + Y YF
Sbjct: 278  --ESVDGRD---TQKDFRKICVDVETRAIVLLDDRSDVWDSSLESQVVKAQTYEYF 328


>gi|47230493|emb|CAF99686.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 2418

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 52/176 (29%), Positives = 84/176 (47%), Gaps = 22/176 (12%)

Query: 902  QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
            +++ TR+     K  S  +  LVLDLD TL           VH  +    E +D      
Sbjct: 305  EEQLTRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFP 350

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             LF+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+
Sbjct: 351  VLFQDVIYQVYVRLRPFFREFLERMSQIYEIILFTASKKVYADKLLNILDPKKQLVRHRL 410

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
                +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 411  FR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 459


>gi|353230275|emb|CCD76446.1| nuclear lim interactor-interacting factor-related [Schistosoma
            mansoni]
          Length = 429

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 45/166 (27%), Positives = 81/166 (48%), Gaps = 30/166 (18%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM- 971
            KK  S+ + CLVLDLD TL++ +    ++P+ D               + +F+    G+ 
Sbjct: 285  KKTRSSPEFCLVLDLDETLVHCS----LNPLLDA--------------QFIFQVVFQGVV 326

Query: 972  ---WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
               + ++RP ++ FL   S+ FE+ L+T   K+YA  +  ++DPK      R+    +  
Sbjct: 327  YMVYVRIRPHLYEFLTNVSEHFEVVLFTASTKVYADRLVNLIDPKKKWIKHRLFR--EHC 384

Query: 1029 DPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVV 1073
               +G+      KDL  VLG +    VIID+S + + +    L+++
Sbjct: 385  VCVNGN----YVKDLR-VLGRDLRKTVIIDNSPQAFGYQVFGLLLL 425


>gi|302806561|ref|XP_002985030.1| hypothetical protein SELMODRAFT_4374 [Selaginella moellendorffii]
 gi|300147240|gb|EFJ13905.1| hypothetical protein SELMODRAFT_4374 [Selaginella moellendorffii]
          Length = 177

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 42/138 (30%), Positives = 69/138 (50%), Gaps = 21/138 (15%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            L+LDLD TL+ +++   +    D ++    E   E+P           +W   RPG+  F
Sbjct: 14   LILDLDGTLIATSRQASLHACFDFVVEFDSE---EQP-----------VWVSKRPGLEDF 59

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
            L +AS+++E+ ++++G K Y  +M + +DP G LF    ++R    D   G   +   KD
Sbjct: 60   LRQASEIYEVVVFSLGRKSYVEKMREAIDPSG-LFVATWLAR----DSCSGSSEIKDYKD 114

Query: 1043 LEG-VLGME-SAVVIIDD 1058
            L    LG E   VV +DD
Sbjct: 115  LNSPKLGRELRKVVWVDD 132


>gi|294875260|ref|XP_002767242.1| hypothetical protein Pmar_PMAR022745 [Perkinsus marinus ATCC 50983]
 gi|239868797|gb|EEQ99959.1| hypothetical protein Pmar_PMAR022745 [Perkinsus marinus ATCC 50983]
          Length = 215

 Score = 54.7 bits (130), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 45/153 (29%), Positives = 73/153 (47%), Gaps = 23/153 (15%)

Query: 931  LLNSAKFHEVDPVHDE----ILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFL-ER 985
            ++NS +  + DP   E    IL++  + +   P  +L     + + TK+RP    F+ E 
Sbjct: 1    MVNSYEIGKADPSQSESVTPILQEVYKDEEGLPELYLCVISDVKVLTKIRPHARAFIREL 60

Query: 986  ASKL---FEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
             SK      + +YT G++ Y   + K+LDP G L  GR++SR D+          P  KD
Sbjct: 61   VSKTGCGVVLSIYTKGSRRYMEVIKKMLDPSGELIKGRLVSREDEPSNM-----TPLEKD 115

Query: 1043 LEGVLGMESAV----------VIIDDSVRVWPH 1065
             + ++  +SAV          V++DDS  VWP 
Sbjct: 116  PDFIINADSAVGTEELRRRWFVVLDDSPEVWPE 148


>gi|125526935|gb|EAY75049.1| hypothetical protein OsI_02945 [Oryza sativa Indica Group]
          Length = 577

 Score = 54.7 bits (130), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 44/153 (28%), Positives = 78/153 (50%), Gaps = 34/153 (22%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
            ++++ LVLDLD TL++S   H  D V D  L+              F   +  ++ + RP
Sbjct: 399  SKQITLVLDLDETLVHSTLDH-CDNV-DFTLQV------------FFNMKNHTVYVRQRP 444

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR------VISRGDDGDPF 1031
             +  FLE+ +++FE+ ++T   ++YA ++   LDP G L + R      + S G      
Sbjct: 445  HLKMFLEKVAQMFELVIFTASQRIYAEQLIDRLDPDGRLISHRIYRESCIFSEG------ 498

Query: 1032 DGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
                    +KDL  +LG++ A VVI+D++ +V+
Sbjct: 499  ------CYTKDLT-ILGVDLAKVVIVDNTPQVF 524


>gi|149490347|ref|XP_001511004.1| PREDICTED: CTD small phosphatase-like protein 2-like [Ornithorhynchus
            anatinus]
          Length = 374

 Score = 54.7 bits (130), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 42/149 (28%), Positives = 71/149 (47%), Gaps = 16/149 (10%)

Query: 870  PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 927
            P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 238  PDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 297

Query: 928  DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 987
            D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 298  DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 343

Query: 988  KLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
            +++E+ L+T   K+YA ++  +LDPK  L
Sbjct: 344  QIYEIILFTASKKVYADKLLNILDPKKQL 372


>gi|356510404|ref|XP_003523928.1| PREDICTED: uncharacterized protein LOC100810756 [Glycine max]
          Length = 469

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 73/146 (50%), Gaps = 22/146 (15%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            +K+ LVLDLD TL++S+   + D   D     K   DRE           + ++ + RP 
Sbjct: 295  KKVTLVLDLDETLIHSS-MGQCDGAAD--FTFKMITDRE-----------LTVYVRKRPF 340

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
            +  FL + S++FE+ ++T   ++YA  +  VLDP    F+ RV            D R  
Sbjct: 341  LQEFLVKVSEMFEIIIFTASKRMYAETLLDVLDPDKKFFSRRVYRESCTW----KDRRCV 396

Query: 1039 KSKDLEGVLGMESA-VVIIDDSVRVW 1063
              KDL  VLG++ A V IID++  V+
Sbjct: 397  --KDL-TVLGIDLAKVCIIDNTPEVF 419


>gi|387594493|gb|EIJ89517.1| hypothetical protein NEQG_00287 [Nematocida parisii ERTm3]
 gi|387596665|gb|EIJ94286.1| hypothetical protein NEPG_00953 [Nematocida parisii ERTm1]
          Length = 310

 Score = 54.3 bits (129), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 35/84 (41%), Positives = 44/84 (52%), Gaps = 10/84 (11%)

Query: 997  MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVI 1055
            MGNK YA  +A +LDP G LF  R+ISR D+   FD        KD++ +    S  VVI
Sbjct: 1    MGNKSYACSIAGLLDPTGKLFGSRIISRDDNFGCFD--------KDIKRLFPTNSKHVVI 52

Query: 1056 IDDSVRVWPHNKLNLIVVERYTYF 1079
            +DD   VW     NL  +  Y YF
Sbjct: 53   LDDRPDVWGFVD-NLYPIRPYYYF 75


>gi|357130565|ref|XP_003566918.1| PREDICTED: uncharacterized protein LOC100830008 [Brachypodium
            distachyon]
          Length = 510

 Score = 54.3 bits (129), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 44/156 (28%), Positives = 74/156 (47%), Gaps = 28/156 (17%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
            QK     + + LVLDLD TL++S   H    + D  ++              F      +
Sbjct: 326  QKSPVRTKHVTLVLDLDETLVHSTLDHC--DIADFTIQV------------FFNMKDHTV 371

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDG 1028
            + + RP +  FLE+ +++FE+ ++T   K+YA ++   LDP G L + R+        DG
Sbjct: 372  YVRQRPHLKMFLEKVAQMFELVIFTASQKIYAEQIIDRLDPDGKLISQRIYRESCIFSDG 431

Query: 1029 DPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
                       +KDL  +LG+  A V IID++ +V+
Sbjct: 432  S---------YTKDL-TILGVHLAKVAIIDNTPQVF 457


>gi|256083671|ref|XP_002578064.1| nuclear lim interactor-interacting factor-related [Schistosoma
            mansoni]
          Length = 441

 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 47/165 (28%), Positives = 80/165 (48%), Gaps = 32/165 (19%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM- 971
            KK  S+ + CLVLDLD TL++ +    ++P+ D               + +F+    G+ 
Sbjct: 285  KKTRSSPEFCLVLDLDETLVHCS----LNPLLDA--------------QFIFQVVFQGVV 326

Query: 972  ---WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
               + ++RP ++ FL   S+ FE+ L+T   K+YA  +  ++DPK      R+    +  
Sbjct: 327  YMVYVRIRPHLYEFLTNVSEHFEVVLFTASTKVYADRLVNLIDPKKKWIKHRLFR--EHC 384

Query: 1029 DPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRV--WPHNKLNL 1070
               +G+      KDL  VLG +    VIID+S +   + HN+  L
Sbjct: 385  VCVNGN----YVKDLR-VLGRDLRKTVIIDNSPQAFGYQHNERKL 424


>gi|294898997|ref|XP_002776453.1| NLI interacting factor, putative [Perkinsus marinus ATCC 50983]
 gi|294900793|ref|XP_002777118.1| NLI interacting factor, putative [Perkinsus marinus ATCC 50983]
 gi|239883444|gb|EER08269.1| NLI interacting factor, putative [Perkinsus marinus ATCC 50983]
 gi|239884575|gb|EER08934.1| NLI interacting factor, putative [Perkinsus marinus ATCC 50983]
          Length = 370

 Score = 53.9 bits (128), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 55/115 (47%), Gaps = 11/115 (9%)

Query: 967  PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP--KGVLFAGRVISR 1024
            PH   + KLRPG+  FLE    ++E +++T   ++Y   + + LDP  KG      V SR
Sbjct: 29   PH---FVKLRPGVHQFLEALQPMYEFYIHTKATRVYLEYVMEALDPHKKGFFRNDNVFSR 85

Query: 1025 GDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTY 1078
             DD      +     +KD+  V       V+I+DD  ++W   + N+I    Y Y
Sbjct: 86   CDDMKHGSNE-----NKDIRAVCSRPREEVIILDDKDKIWLDFQPNVIKCPPYKY 135


>gi|225681687|gb|EEH19971.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 869

 Score = 53.9 bits (128), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 57/201 (28%), Positives = 84/201 (41%), Gaps = 60/201 (29%)

Query: 895  DQQKAAIQKERTRRLEEQKKMFSARKLCL--VLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            D     + K    R+EE  K        L  V+DLD T++++     VDP   E      
Sbjct: 132  DNSSLTVSKSEATRVEEDAKRRLLSSRRLSLVVDLDQTIIHAT----VDPTVAEW----- 182

Query: 953  EQDREKPHRHLFR----------FPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 999
            +QDR+ P+    +           P M G W   KLRPG+  FL+  S L+E+H+YTMG 
Sbjct: 183  QQDRDNPNHEAVKDVRAFQLVDDGPGMKGCWYYIKLRPGLQEFLQEISALYELHIYTMGT 242

Query: 1000 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 1058
            +                 AG +                  +K+L+ +  +++  VVIIDD
Sbjct: 243  R-----------------AGSLT-----------------AKNLQRLFPVDTKMVVIIDD 268

Query: 1059 SVRVWPHNKLNLIVVERYTYF 1079
               VW  +  NLI V  Y +F
Sbjct: 269  RGDVWKWSD-NLIKVSPYDFF 288


>gi|326513088|dbj|BAK06784.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 445

 Score = 53.9 bits (128), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 79/164 (48%), Gaps = 30/164 (18%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
            ARK+ LVLDLD TL++S             L   ++ D   P    F      ++ + RP
Sbjct: 272  ARKVTLVLDLDETLVHST------------LEHCDDADFSFPVS--FGLKEHVVYVRKRP 317

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG----DDGDPFDG 1033
             +  FL++ +++F++ ++T    +YA ++   LDP+  LF+ R         + G     
Sbjct: 318  HLHMFLQKMAEMFDVVIFTASQSVYADQLLDRLDPENTLFSKRFFRESCVFTESG----- 372

Query: 1034 DERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
                  +KDL  V+G++ A V IID++ +V+     N I +E +
Sbjct: 373  -----YTKDLT-VIGVDLAKVAIIDNTPQVFQLQVNNGIPIESW 410


>gi|37538060|gb|AAQ92971.1| CTD-phosphatase-like protein [Hordeum vulgare subsp. vulgare]
 gi|37538062|gb|AAQ92972.1| CTD-phosphatase-like protein [Hordeum vulgare subsp. vulgare]
          Length = 445

 Score = 53.9 bits (128), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 79/164 (48%), Gaps = 30/164 (18%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
            ARK+ LVLDLD TL++S             L   ++ D   P    F      ++ + RP
Sbjct: 272  ARKVTLVLDLDETLVHST------------LEHCDDADFSFPVS--FGLKEHVVYVRKRP 317

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG----DDGDPFDG 1033
             +  FL++ +++F++ ++T    +YA ++   LDP+  LF+ R         + G     
Sbjct: 318  HLHMFLQKMAEMFDVVIFTASQSVYADQLLDRLDPENTLFSKRFFRESCVFTESG----- 372

Query: 1034 DERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
                  +KDL  V+G++ A V IID++ +V+     N I +E +
Sbjct: 373  -----YTKDLT-VIGVDLAKVAIIDNTPQVFQLQVNNGIPIESW 410


>gi|384246936|gb|EIE20424.1| hypothetical protein COCSUDRAFT_67358 [Coccomyxa subellipsoidea
            C-169]
          Length = 676

 Score = 53.9 bits (128), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 36/102 (35%), Positives = 49/102 (48%), Gaps = 17/102 (16%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKL--RPGIW 980
            LVLDLDHTL+ S  F+   P  D               R +F     G  T    RP + 
Sbjct: 108  LVLDLDHTLIRSTLFNPHKPAKDS--------------REVF-VTGDGARTAFERRPHLT 152

Query: 981  TFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
             FLE  S LFE+ ++T G++ YA  +  +LDP+  LF  R+ 
Sbjct: 153  HFLESVSTLFEIVVFTAGSQSYAGPLLDILDPERRLFEHRLF 194


>gi|242009525|ref|XP_002425534.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212509409|gb|EEB12796.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 834

 Score = 53.9 bits (128), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 48/155 (30%), Positives = 74/155 (47%), Gaps = 22/155 (14%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LVLDLD TL           VH  +   +E QD       LF+     ++ + RP    F
Sbjct: 670  LVLDLDETL-----------VHCSL---QELQDASFTFPVLFQDCAYTVFVRTRPYFREF 715

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
            LER S LFE+ L+T   ++YA ++  +LDPK      R+    +     +G+      KD
Sbjct: 716  LERVSSLFEVILFTASKRVYADKLMNLLDPKKRWIKYRLFR--EHCVCVNGN----YIKD 769

Query: 1043 LEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            L  +LG + S  +IID+S + + +   N I +E +
Sbjct: 770  L-TILGRDLSKTIIIDNSPQAFGYQLENGIPIESW 803


>gi|298705179|emb|CBJ28610.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 482

 Score = 53.5 bits (127), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 41/119 (34%), Positives = 61/119 (51%), Gaps = 15/119 (12%)

Query: 898  KAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE 957
            K  + K+R   L +Q   F+ RK  L+LDLD TL++S+ F  V P  D I+    +    
Sbjct: 281  KGGLGKKRRSLLPKQLPEFAGRK-QLILDLDETLVHSS-FKPV-PGADFIMDIMVDGTFY 337

Query: 958  KPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
            K            ++   RPG+  FLER +KL+E+ ++T     YA  +  VLDPKG +
Sbjct: 338  K------------VFVLKRPGVDAFLERVAKLYEVIIFTASLPQYANPLLDVLDPKGTI 384


>gi|325180168|emb|CCA14570.1| nuclear LIM factor interactorinteracting protein hyphal form putative
            [Albugo laibachii Nc14]
          Length = 418

 Score = 53.5 bits (127), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 78/161 (48%), Gaps = 28/161 (17%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL---FRFPHMGMWTKLR 976
            K+CLVLDLD TL           VH  +      ++ E P+      F   +  +   LR
Sbjct: 235  KICLVLDLDETL-----------VHCSV------EEIENPNFQFDVFFNGTNYNVNVSLR 277

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 1036
            P +  FL+R +K FE+ ++T   ++YA ++  +LDP   L   R+    +D    DG+  
Sbjct: 278  PHMHHFLKRVTKQFELVVFTASQRVYAEKLLNLLDPNRDLIKYRLYR--EDCLEVDGN-- 333

Query: 1037 VPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
                KDL  VLG + A V+++D+S   + +   N I +E +
Sbjct: 334  --FLKDL-NVLGRDLARVILVDNSPHAFGYQVNNGIPIESW 371


>gi|449678335|ref|XP_002165480.2| PREDICTED: CTD small phosphatase-like protein 2-like [Hydra
            magnipapillata]
          Length = 421

 Score = 53.5 bits (127), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 53/103 (51%), Gaps = 14/103 (13%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            ++ LVLDLD TL++ +            L K E  +       +F      ++ KLRP +
Sbjct: 243  QMTLVLDLDETLVHCS------------LSKLEAYN--MTFNVVFDNVTYQLFVKLRPHL 288

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
              FLER SKL+E+ L+T   ++YA ++  ++DP+   F  R+ 
Sbjct: 289  LEFLERVSKLYEVILFTASRRVYADKLLNIIDPRRQFFRHRLF 331


>gi|224116766|ref|XP_002331872.1| predicted protein [Populus trichocarpa]
 gi|222875390|gb|EEF12521.1| predicted protein [Populus trichocarpa]
          Length = 502

 Score = 53.5 bits (127), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 49/173 (28%), Positives = 75/173 (43%), Gaps = 35/173 (20%)

Query: 903  KERTRRLEEQKKMFSARKLCLVLDLDHTL--------LNSAKFHEVDPVHDEILRKKEEQ 954
            KE  RR          + + LVLDLD           L  A       VH  +   +   
Sbjct: 303  KESCRR----------KSVTLVLDLDELCPMYNTKVELQMAFLFSETLVHSTL---EHCD 349

Query: 955  DREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
            D +      F      ++ K RP + TFLER +++FE+ ++T    +YA ++  +LDP  
Sbjct: 350  DADFTFTVFFNMKEHIVYVKQRPHLHTFLERVAEMFEVVIFTASQSIYAAQLLDILDPDR 409

Query: 1015 VLFAGRVISRG---DDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
             L + R+        DG           +KDL  VLG++ A V IID+S +V+
Sbjct: 410  KLISQRLYRESCIFSDG---------SYTKDL-TVLGVDLAKVAIIDNSPQVF 452


>gi|224072608|ref|XP_002303804.1| predicted protein [Populus trichocarpa]
 gi|222841236|gb|EEE78783.1| predicted protein [Populus trichocarpa]
          Length = 244

 Score = 53.5 bits (127), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 53/168 (31%), Positives = 80/168 (47%), Gaps = 28/168 (16%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 972
            K+  S     LVLDLD TL++SA    ++P +D         D   P    F      ++
Sbjct: 44   KQTRSCPPTTLVLDLDETLVHSA----LEPCND--------ADFTFPVN--FNLQEHTVF 89

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGD 1029
             + RP +  F+ER S LFE+ ++T    +YA ++  VLDPK  +F  RV        +G+
Sbjct: 90   VRCRPYLRDFMERVSSLFEIIIFTASQSIYAEQLLNVLDPKRRIFRHRVFRESCVFVEGN 149

Query: 1030 PFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
                       KDL  VLG + A V+IID+S + +     N I +E +
Sbjct: 150  YL---------KDLS-VLGRDLARVIIIDNSPQAFGFQVDNGIPIESW 187


>gi|167384602|ref|XP_001737021.1| RNA polymerase II ctd phosphatase [Entamoeba dispar SAW760]
 gi|165900378|gb|EDR26711.1| RNA polymerase II ctd phosphatase, putative [Entamoeba dispar SAW760]
          Length = 429

 Score = 53.5 bits (127), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 64/226 (28%), Positives = 96/226 (42%), Gaps = 37/226 (16%)

Query: 880  QSAWGDVEHLFEGYDDQQKAAIQKERTRRL-EEQKKMFSAR-----KLCLVLDLDHTLLN 933
            Q+   D   L E  DD  + +     T+   EEQK+  S R     KL L+LDLD T++ 
Sbjct: 15   QNYCVDCYQLIEDVDDYIRTSGGYGITKSYAEEQKRSVSERLLKEKKLSLILDLDGTIVF 74

Query: 934  SAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG--MWTKLRPGIWTFLERASKLFE 991
            +     V P+ +E      E+         F  P     +  K R GI TF+E+ SKL++
Sbjct: 75   TNPELCV-PLENE------EEPITPEQGFYFEIPEQNAKVLIKFRDGIVTFMEKVSKLYD 127

Query: 992  MHLYTMGNKLYATEMAKVLDP-------KGVLFAGRVISR---GDDGDPFDG-------D 1034
            +H+ T+G K YA  +   ++         G L      S     D+ D  DG       +
Sbjct: 128  IHVVTLGQKEYAFAIVNAINKLRDTPFITGDLVTAEDCSSVIVCDEKDTNDGLIDREETN 187

Query: 1035 ERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
            ER    + +   +G E   VI+DD + VW +      VV+   Y P
Sbjct: 188  ERRSVKRSI-PTMGKEEMQVIVDDRIDVWDNKN----VVQICEYVP 228


>gi|125571265|gb|EAZ12780.1| hypothetical protein OsJ_02697 [Oryza sativa Japonica Group]
          Length = 576

 Score = 53.1 bits (126), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 43/153 (28%), Positives = 78/153 (50%), Gaps = 34/153 (22%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
            ++++ LVLDLD TL++S   H  D V D  L+              F   +  ++ + RP
Sbjct: 398  SKQITLVLDLDETLVHSTLDH-CDNV-DFTLQV------------FFNMKNHTVYVRQRP 443

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR------VISRGDDGDPF 1031
             +  FLE+ +++F++ ++T   ++YA ++   LDP G L + R      + S G      
Sbjct: 444  HLKMFLEKVAQMFDLVIFTASQRIYAEQLIDRLDPDGRLISHRIYRESCIFSEG------ 497

Query: 1032 DGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
                    +KDL  +LG++ A VVI+D++ +V+
Sbjct: 498  ------CYTKDLT-ILGVDLAKVVIVDNTPQVF 523


>gi|300122627|emb|CBK23195.2| unnamed protein product [Blastocystis hominis]
          Length = 598

 Score = 53.1 bits (126), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 60/129 (46%), Gaps = 16/129 (12%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE-------KPHRH---L 963
            K    +KL L++DLD TL+++    E   +    L    E + E       K   H   L
Sbjct: 146  KFLGGKKLILIIDLDMTLVHAIHEEESIGLFLNWLHGASESNEEDEWKKTLKDQVHSIEL 205

Query: 964  FRFPHMG------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 1017
            F     G      +  K+RPG+   L+  +  +EM +YT G   YA ++ +++DP   LF
Sbjct: 206  FYVDDNGSARMSKLLIKIRPGVRAMLQMLANSYEMIVYTQGENQYAEKVMQIVDPDNTLF 265

Query: 1018 AGRVISRGD 1026
              R I+RG+
Sbjct: 266  KKRFIARGE 274


>gi|237832707|ref|XP_002365651.1| NLI interacting factor-like phosphatase domain-containing protein
            [Toxoplasma gondii ME49]
 gi|211963315|gb|EEA98510.1| NLI interacting factor-like phosphatase domain-containing protein
            [Toxoplasma gondii ME49]
          Length = 1139

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/139 (26%), Positives = 62/139 (44%), Gaps = 6/139 (4%)

Query: 958  KPHRHLFRFP--HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
            +P  + F  P      + KLRP + TFL++    +EM +YT   + YA  +  +LD    
Sbjct: 664  EPELYRFELPCNRKTYYMKLRPHLRTFLKKLEPFYEMSVYTNATQEYADIVIAILDGNRQ 723

Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV-VE 1074
            LF  R+++R D G   +  E     +  EG+   +  +V  DD   +W    L  +V  +
Sbjct: 724  LFQDRIVAR-DSGFRGEASENKAVRRLYEGM--DKRCIVAFDDRQNIWTDLPLTHVVKAQ 780

Query: 1075 RYTYFPCSRRQFGLLGPSL 1093
             Y +F   + +     P L
Sbjct: 781  HYDFFDSHKTELNAYYPPL 799


>gi|221488107|gb|EEE26321.1| RNA polymerase II phosphatase, putative [Toxoplasma gondii GT1]
 gi|221508626|gb|EEE34195.1| RNA polymerase II phosphatase, putative [Toxoplasma gondii VEG]
          Length = 1139

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/139 (26%), Positives = 62/139 (44%), Gaps = 6/139 (4%)

Query: 958  KPHRHLFRFP--HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
            +P  + F  P      + KLRP + TFL++    +EM +YT   + YA  +  +LD    
Sbjct: 664  EPELYRFELPCNRKTYYMKLRPHLRTFLKKLEPFYEMSVYTNATQEYADIVIAILDGNRQ 723

Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV-VE 1074
            LF  R+++R D G   +  E     +  EG+   +  +V  DD   +W    L  +V  +
Sbjct: 724  LFQDRIVAR-DSGFRGEASENKAVRRLYEGM--DKRCIVAFDDRQNIWTDLPLTHVVKAQ 780

Query: 1075 RYTYFPCSRRQFGLLGPSL 1093
             Y +F   + +     P L
Sbjct: 781  HYDFFDSHKTELNAYYPPL 799


>gi|123454430|ref|XP_001314970.1| NLI interacting factor-like phosphatase family protein [Trichomonas
            vaginalis G3]
 gi|121897632|gb|EAY02747.1| NLI interacting factor-like phosphatase family protein [Trichomonas
            vaginalis G3]
          Length = 218

 Score = 52.8 bits (125), Expect = 0.001,   Method: Composition-based stats.
 Identities = 42/158 (26%), Positives = 74/158 (46%), Gaps = 25/158 (15%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LVLDLD TL++++ F    P H ++                 +F     +  LRP +  F
Sbjct: 44   LVLDLDETLVHTSTF----PPHSDV--------------EALKFDDTNEYVFLRPNVKKF 85

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
            LER S+LFE+ ++T G ++YA  +     P+      ++     D   F G++     KD
Sbjct: 86   LERVSELFEVFIFTAGTQIYAERILDSFCPQ----IDQMHRFYRDSCKFSGNK---CKKD 138

Query: 1043 LEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
            L       + VV++DD+ ++  +   N I ++R++  P
Sbjct: 139  LNKFGRPLTKVVMVDDNYQMRSYYPQNTIYIDRWSGTP 176


>gi|148909957|gb|ABR18063.1| unknown [Picea sitchensis]
          Length = 517

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 49/106 (46%), Gaps = 24/106 (22%)

Query: 921  LCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKL 975
            + LVLDLD TL++S   H  D     PVH                   F      ++ + 
Sbjct: 320  ITLVLDLDETLVHSTLEHCDDADFTFPVH-------------------FNLKEHTVYVRC 360

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            RP +  F++R + +FE+ ++T    +YA ++  VLDPK  L   RV
Sbjct: 361  RPHLQLFMDRVADMFEIIVFTASQSVYAEQLLNVLDPKRKLIRHRV 406


>gi|440290285|gb|ELP83711.1| RNA polymerase II ctd phosphatase, putative [Entamoeba invadens IP1]
          Length = 434

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/134 (27%), Positives = 62/134 (46%), Gaps = 23/134 (17%)

Query: 959  PHR-HLFRFPHMG--MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD--PK 1013
            P R   F+ P     ++   R GI  F+ +   L+E+H+ T+G K YA  + + L+  P 
Sbjct: 94   PERGFFFQIPEYSKKVFVYFRDGIVAFMTKLITLYEIHVVTLGQKDYAMAIVEALNKLPG 153

Query: 1014 GVLFAGRVISR--------GDDGDPFDGDERVPKSKDLEGV--------LGMESAVVIID 1057
            G    G+++           DDGD F  D  + +++D E          +G E   +++D
Sbjct: 154  GPFINGKIVCSEDCISEILKDDGD-FQNDGLIERNEDTERRAVKRTVPGMGSEEVQIVVD 212

Query: 1058 DSVRVWP-HNKLNL 1070
            D + VW  HN L +
Sbjct: 213  DRIDVWDNHNVLQI 226


>gi|401408967|ref|XP_003883932.1| hypothetical protein NCLIV_036820 [Neospora caninum Liverpool]
 gi|325118349|emb|CBZ53900.1| hypothetical protein NCLIV_036820 [Neospora caninum Liverpool]
          Length = 1149

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/139 (26%), Positives = 62/139 (44%), Gaps = 6/139 (4%)

Query: 958  KPHRHLFRFP--HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
            +P  + F  P      + KLRP + TFL++    +EM +YT   + YA  +  +LD    
Sbjct: 674  EPELYRFELPCNRKTYYMKLRPYLRTFLKKLEPFYEMSVYTNATQEYADIVIAILDDNRQ 733

Query: 1016 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV-VE 1074
            LF  R+++R D G   +  E     +  EG+   +  +V  DD   +W    L  +V  +
Sbjct: 734  LFQDRIVAR-DSGFRGEASENKAVRRLYEGM--DKRCIVAFDDRQNIWTDLPLTHVVKAQ 790

Query: 1075 RYTYFPCSRRQFGLLGPSL 1093
             Y +F   + +     P L
Sbjct: 791  HYDFFDSHKAELNAYYPPL 809


>gi|403338554|gb|EJY68521.1| Dullard-like phosphatase domain containing protein [Oxytricha
            trifallax]
          Length = 615

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 42/159 (26%), Positives = 73/159 (45%), Gaps = 24/159 (15%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            ++L +VLDLD+TL+     H V+ V         +Q+      +++ +         RP 
Sbjct: 437  KRLIVVLDLDNTLI-----HSVNSVPTS-----SDQNYFAIRDNIYVYK--------RPH 478

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
            +  FL   +K  +++++T   K YA ++  V+DPK     G+   R D       DER  
Sbjct: 479  MEYFLAEIAKFADIYIFTASMKDYADQIMDVIDPKKTF--GKCFYRTD----CKKDERRQ 532

Query: 1039 KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
              KDL  V    + +++IDD+      N LN   ++ +T
Sbjct: 533  IYKDLSTVSDDLTQLIMIDDNEINCTKNPLNTFKIKHWT 571


>gi|328874143|gb|EGG22509.1| hypothetical protein DFA_04637 [Dictyostelium fasciculatum]
          Length = 397

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 43/169 (25%), Positives = 76/169 (44%), Gaps = 23/169 (13%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K+ L++++DH L +S K  E +    E + K    +                + K RP  
Sbjct: 57   KMNLIINIDHILFHSTKNPESNETQGESVIKCVVDESN------------TYYVKFRPYA 104

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPK 1039
             TFL+    LF + L+++ +K Y  ++ ++LD    +F  ++ISR   G+     ++V K
Sbjct: 105  ATFLQSLQPLFNLILFSLYSKSYVFKLIELLDLNNNIFK-QIISRESFGESL-PKQQVGK 162

Query: 1040 SKDLEGV---------LGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
               L            +    ++ I+DD   +W   + NLI  ER+TYF
Sbjct: 163  PYALWNTPSHFTKIFKISAHESLAILDDREDIWRQFRDNLISPERFTYF 211


>gi|432854554|ref|XP_004067958.1| PREDICTED: CTD small phosphatase-like protein 2-like [Oryzias
            latipes]
          Length = 381

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 59/107 (55%), Gaps = 8/107 (7%)

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
            +  LRP +  FL+  +K++E+ +YT   K YA ++ ++ DP+  LF  R+    DD    
Sbjct: 240  YMILRPHVREFLQAMAKIYELFVYTCAKKEYAEKILEIFDPQKKLFRHRLYQ--DDCACV 297

Query: 1032 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYT 1077
             G       KDL  +LG + +  V++D++   +P++ +N I ++ ++
Sbjct: 298  LGH----YIKDL-SILGRDLTKTVVLDNAPHTYPYHLMNTIPIKSWS 339


>gi|302775067|ref|XP_002970950.1| hypothetical protein SELMODRAFT_4536 [Selaginella moellendorffii]
 gi|300161661|gb|EFJ28276.1| hypothetical protein SELMODRAFT_4536 [Selaginella moellendorffii]
          Length = 177

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 41/138 (29%), Positives = 69/138 (50%), Gaps = 21/138 (15%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            L+LDLD TL+ +++  ++    D ++    E   E+P           +W   RPG+  F
Sbjct: 14   LILDLDGTLIATSRQAKLHACFDFVVEFDSE---EQP-----------VWVSKRPGLDDF 59

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
            L +AS+++E+ ++++G K Y  +M + +DP G  F    ++R    D   G   +   KD
Sbjct: 60   LRQASEIYEVVVFSLGRKSYVEKMREAIDPSGS-FVATWLAR----DSCSGSSEIKDYKD 114

Query: 1043 LEG-VLGME-SAVVIIDD 1058
            L    LG E   VV +DD
Sbjct: 115  LNSPKLGRELRKVVWVDD 132


>gi|410921774|ref|XP_003974358.1| PREDICTED: CTD small phosphatase-like protein 2-B-like [Takifugu
            rubripes]
          Length = 381

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 66/124 (53%), Gaps = 9/124 (7%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
            ++ KLRP +  FL+  +K +E+ +YT   + YA ++  +LDP+  +F  R+    +D   
Sbjct: 239  VYMKLRPHVKEFLQSVAKNYELFVYTCAKREYAEKILNILDPQRKVFRHRLYQ--EDCIC 296

Query: 1031 FDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 1089
              G       KDL  +LG + +  V++D+    +P++ LN I ++ +T  P   R+   L
Sbjct: 297  VLGH----YIKDL-SILGRDLTKTVVLDNMPHTYPYHLLNTIPIKSWTGEP-EDRELQKL 350

Query: 1090 GPSL 1093
             P+L
Sbjct: 351  VPTL 354


>gi|302812229|ref|XP_002987802.1| hypothetical protein SELMODRAFT_126751 [Selaginella moellendorffii]
 gi|302817447|ref|XP_002990399.1| hypothetical protein SELMODRAFT_131611 [Selaginella moellendorffii]
 gi|300141784|gb|EFJ08492.1| hypothetical protein SELMODRAFT_131611 [Selaginella moellendorffii]
 gi|300144421|gb|EFJ11105.1| hypothetical protein SELMODRAFT_126751 [Selaginella moellendorffii]
          Length = 253

 Score = 52.4 bits (124), Expect = 0.001,   Method: Composition-based stats.
 Identities = 47/160 (29%), Positives = 75/160 (46%), Gaps = 32/160 (20%)

Query: 923  LVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
            LVLDLD TL++S   H  D     PV+                   F +    ++ + RP
Sbjct: 59   LVLDLDETLVHSTLEHCADADFSFPVY-------------------FNYQEHTVYVRRRP 99

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERV 1037
             +  FLE+ ++LFE+ ++T    +YA ++  +LDPK  L   R+    D     DG+   
Sbjct: 100  HLQVFLEKVAQLFEIIIFTASQSVYAEQLLNILDPKRKLIRHRIFR--DSCVYVDGNYL- 156

Query: 1038 PKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
               KDL  +LG + S V I+D+S + +     N I +E +
Sbjct: 157  ---KDLS-ILGRDLSKVAIVDNSPQAFGFQVDNGIPIESW 192


>gi|55740279|gb|AAV63941.1| putative nuclear LIM factor interactor-interacting protein hyphal
            form [Phytophthora infestans]
          Length = 237

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 56/103 (54%), Gaps = 2/103 (1%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            ++ LVLD+D  L++S   +EV+    E  R ++ ++       +       +  K RPG+
Sbjct: 40   RIALVLDMDECLVHSKFQNEVEYRQSEY-RPEQLEEYSDSFEIVMDDGERAIVNK-RPGL 97

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
              FLE A+K ++++++T G + Y   +   LDPKG LFAGR  
Sbjct: 98   DRFLEEAAKHYDVYVFTAGLEAYGKPILDALDPKGNLFAGRFF 140


>gi|156549638|ref|XP_001604265.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
            phosphatase-like, partial [Nasonia vitripennis]
          Length = 512

 Score = 52.4 bits (124), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 50/91 (54%), Gaps = 10/91 (10%)

Query: 991  EMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL--G 1048
            E+H+ T G + YA  +A +LD  G LF+ R++SR +  DP        K+ +L+ +   G
Sbjct: 1    ELHICTFGARQYAHRVAAILDNDGKLFSHRILSRDECFDP------QSKTANLKALFPCG 54

Query: 1049 MESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1079
            ++  V IIDD   VW     NL+ V+ Y +F
Sbjct: 55   VD-MVCIIDDRDDVW-QGCANLVQVKPYHFF 83


>gi|414881093|tpg|DAA58224.1| TPA: hypothetical protein ZEAMMB73_373456 [Zea mays]
 gi|414881094|tpg|DAA58225.1| TPA: hypothetical protein ZEAMMB73_373456 [Zea mays]
          Length = 442

 Score = 52.4 bits (124), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 74/164 (45%), Gaps = 30/164 (18%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
             R + LVLDLD TL++S                K   D +      +      ++ K RP
Sbjct: 264  TRNVTLVLDLDETLVHSTM--------------KHCDDADFTFSMFYDMKEHVVYVKKRP 309

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG----DDGDPFDG 1033
             +  FL+R  ++FE+ ++T    +YA ++  +LDP+  LF+ R         D G     
Sbjct: 310  HVHMFLQRMVEMFEVVIFTASQSVYADQLLDMLDPEKKLFSKRFFRESCLITDSG----- 364

Query: 1034 DERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
                   KDL  V+G++ A V IID++ +V+     N I +E +
Sbjct: 365  -----YRKDL-TVVGVDLAKVAIIDNTPQVFELQVNNGIPIESW 402


>gi|301118476|ref|XP_002906966.1| CTD small phosphatase-like protein, putative [Phytophthora infestans
            T30-4]
 gi|301126789|ref|XP_002909873.1| CTD small phosphatase-like protein, putative [Phytophthora infestans
            T30-4]
 gi|262101427|gb|EEY59479.1| CTD small phosphatase-like protein, putative [Phytophthora infestans
            T30-4]
 gi|262108315|gb|EEY66367.1| CTD small phosphatase-like protein, putative [Phytophthora infestans
            T30-4]
          Length = 237

 Score = 52.4 bits (124), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 56/103 (54%), Gaps = 2/103 (1%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            ++ LVLD+D  L++S   +EV+    E  R ++ ++       +       +  K RPG+
Sbjct: 40   RIALVLDMDECLVHSKFQNEVEYRQSEY-RPEQLEEYSDSFEIVMDDGERAIVNK-RPGL 97

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
              FLE A+K ++++++T G + Y   +   LDPKG LFAGR  
Sbjct: 98   DRFLEEAAKHYDVYVFTAGLEAYGKPILDALDPKGNLFAGRFF 140


>gi|399215917|emb|CCF72605.1| unnamed protein product [Babesia microti strain RI]
          Length = 664

 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/127 (30%), Positives = 58/127 (45%), Gaps = 13/127 (10%)

Query: 959  PHRHLFRFPH---MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
            P  + F  P    +  + KLRP +  FL   S  +EM +YT   + YA  +  +LDP   
Sbjct: 212  PELYTFTLPSYADVSYYLKLRPRLREFLHILSFYYEMSIYTNATREYADVVIAILDPDRS 271

Query: 1016 LFAGRVISRGDDGDPFDGDER--VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV- 1072
            LF  R+I+RG       G++R     ++ L   L  +  VV  DD   VW     N ++ 
Sbjct: 272  LFMDRIIARG------GGNDRGLTKSARRLYPKLS-QRFVVSFDDRRDVWTDIDPNQVLK 324

Query: 1073 VERYTYF 1079
               Y+YF
Sbjct: 325  AHHYSYF 331


>gi|449505979|ref|XP_004162620.1| PREDICTED: uncharacterized protein LOC101226452 [Cucumis sativus]
          Length = 470

 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 44/152 (28%), Positives = 73/152 (48%), Gaps = 42/152 (27%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMGMWTKL 975
            LVLDLD TL++S             L  +++ D        FRF        H+ ++ K 
Sbjct: 300  LVLDLDETLVHST------------LEPQDDAD--------FRFTVCLNMKEHI-VYVKR 338

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFD 1032
            RP +  FL+R +++FE+ ++T    +YA ++   LDP   + + R+        DG    
Sbjct: 339  RPYLQIFLDRVAEMFEVAIFTASQSIYAEQVLNKLDPDNCIISRRLYRESCIFSDG---- 394

Query: 1033 GDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
                   +KDL  VLG++ A VVI+D+  +V+
Sbjct: 395  -----CYTKDL-TVLGIDLAKVVIVDNYPQVF 420


>gi|300121382|emb|CBK21762.2| unnamed protein product [Blastocystis hominis]
          Length = 399

 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 65/251 (25%), Positives = 107/251 (42%), Gaps = 37/251 (14%)

Query: 824  SVSQPLTSEPMVSQNS-PIQPGQIKSGADMKA--VVTNHDDKQTG------TGSGPEAGP 874
            S S  +   PMV+ +  P  P  I  G D ++  +V++H  ++          S PE   
Sbjct: 118  SASTIIRQYPMVTLSCFPSSPKVIIRGKDKQSSDLVSSHHKEEIEPASPNLITSAPETQQ 177

Query: 875  VGAHPQSAWGDVEHL--FEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLL 932
              A P+S +  V  +     YD    AA+Q+    R+    K  +A K  LVLDLD TL+
Sbjct: 178  -EAEPESTFNPVIIIKNLPPYD-SLPAALQE----RVLLPPKSPTAPKYTLVLDLDETLV 231

Query: 933  NSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEM 992
            + +   E DP  D     + E  R              ++  +RP ++  L+R +  +E+
Sbjct: 232  HCSM--ERDPSADLAFSIRHEGQR------------FTIYANVRPFLFYLLKRVAPYYEI 277

Query: 993  HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA 1052
             +YT   K YA  +  +LD +  L   R+    +     DG+      KDL  +    S 
Sbjct: 278  VIYTASQKCYADRLLDILDSEQHLITHRLY--REHCLNIDGN----YIKDLNALNRDLSK 331

Query: 1053 VVIIDDSVRVW 1063
             VI+D+ +  +
Sbjct: 332  TVIVDNYISCF 342


>gi|449433585|ref|XP_004134578.1| PREDICTED: uncharacterized protein LOC101215257 [Cucumis sativus]
          Length = 484

 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 44/152 (28%), Positives = 73/152 (48%), Gaps = 42/152 (27%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMGMWTKL 975
            LVLDLD TL++S             L  +++ D        FRF        H+ ++ K 
Sbjct: 314  LVLDLDETLVHST------------LEPQDDAD--------FRFTVCLNMKEHI-VYVKR 352

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFD 1032
            RP +  FL+R +++FE+ ++T    +YA ++   LDP   + + R+        DG    
Sbjct: 353  RPYLQIFLDRVAEMFEVAIFTASQSIYAEQVLNKLDPDNCIISRRLYRESCIFSDG---- 408

Query: 1033 GDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
                   +KDL  VLG++ A VVI+D+  +V+
Sbjct: 409  -----CYTKDL-TVLGIDLAKVVIVDNYPQVF 434


>gi|449450582|ref|XP_004143041.1| PREDICTED: uncharacterized protein LOC101204959 [Cucumis sativus]
          Length = 1024

 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 47/158 (29%), Positives = 74/158 (46%), Gaps = 28/158 (17%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LVLDLD TL++S     ++P  D         D   P    F      ++ + RP +  F
Sbjct: 244  LVLDLDETLVHST----LEPCVD--------ADFTFPVN--FNLQEHTVYVRCRPYLRDF 289

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGDPFDGDERVPK 1039
            +E  ++ FE+ ++T    +YA ++  VLDPK  +F  RV        DG+          
Sbjct: 290  MEAVARHFEIIIFTASQSIYAEQLLNVLDPKRKIFRHRVFRESCVFVDGNYL-------- 341

Query: 1040 SKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
             KDL  VLG + A V+I+D+S + +     N I +E +
Sbjct: 342  -KDL-SVLGRDLARVIIVDNSPQAFGFQVDNGIPIESW 377


>gi|215695024|dbj|BAG90215.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 269

 Score = 51.6 bits (122), Expect = 0.002,   Method: Composition-based stats.
 Identities = 46/153 (30%), Positives = 77/153 (50%), Gaps = 36/153 (23%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMG 970
            ARK+ LVLDLD TL++S                      E+   + F FP       HM 
Sbjct: 143  ARKVTLVLDLDETLVHSTT--------------------EQCDDYDFTFPVFFDMKEHM- 181

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
            ++ + RP +  FL++ +++FE+ ++T    +YA ++  +LDP+  LF+ R      +   
Sbjct: 182  VYVRKRPHLHMFLQKMAEMFEVVIFTASQSVYADQLLDILDPEKKLFSRRYFR---ESCV 238

Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRV 1062
            F        +KDL  V+G++ A VVIID++ +V
Sbjct: 239  F---TNTSYTKDLT-VVGVDLAKVVIIDNTPQV 267


>gi|449494439|ref|XP_004159546.1| PREDICTED: CTD small phosphatase-like protein 2-like [Cucumis
            sativus]
          Length = 446

 Score = 51.6 bits (122), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 47/158 (29%), Positives = 74/158 (46%), Gaps = 28/158 (17%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LVLDLD TL++S     ++P  D         D   P    F      ++ + RP +  F
Sbjct: 251  LVLDLDETLVHST----LEPCVD--------ADFTFPVN--FNLQEHTVYVRCRPYLRDF 296

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGDPFDGDERVPK 1039
            +E  ++ FE+ ++T    +YA ++  VLDPK  +F  RV        DG+          
Sbjct: 297  MEAVARHFEIIIFTASQSIYAEQLLNVLDPKRKIFRHRVFRESCVFVDGNYL-------- 348

Query: 1040 SKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
             KDL  VLG + A V+I+D+S + +     N I +E +
Sbjct: 349  -KDL-SVLGRDLARVIIVDNSPQAFGFQVDNGIPIESW 384


>gi|367002193|ref|XP_003685831.1| hypothetical protein TPHA_0E03070 [Tetrapisispora phaffii CBS 4417]
 gi|357524130|emb|CCE63397.1| hypothetical protein TPHA_0E03070 [Tetrapisispora phaffii CBS 4417]
          Length = 494

 Score = 51.6 bits (122), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 53/105 (50%), Gaps = 14/105 (13%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
             RK CL+LDLD TL++S+ F  VD   D ++      D +  H ++ +          RP
Sbjct: 321  GRKKCLILDLDETLVHSS-FKYVDSA-DFVIPVT--IDNQTHHVYVIK----------RP 366

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
            G+  FL+R S+L+E+ ++T     Y   +  +LDP   +   R+ 
Sbjct: 367  GVDEFLKRVSELYEVVVFTASVSRYGDPLLNILDPANTIIHHRLF 411


>gi|55740293|gb|AAV63948.1| putative nuclear LIM interactor-interacting protein [Phytophthora
            sojae]
 gi|348665891|gb|EGZ05719.1| hypothetical protein PHYSODRAFT_551168 [Phytophthora sojae]
          Length = 237

 Score = 51.6 bits (122), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 56/103 (54%), Gaps = 2/103 (1%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            ++ LVLD+D  L++S   +EV+    E  R ++ ++       +       +  K RPG+
Sbjct: 40   RIALVLDMDECLVHSKFQNEVEYRQSEY-RPEQLEEYGDSFEIVMDDGERAVVNK-RPGL 97

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
              FLE A+K ++++++T G + Y   +   LDPKG LFAGR  
Sbjct: 98   DRFLEEAAKHYDVYVFTAGLEAYGKPILDALDPKGNLFAGRFF 140


>gi|226506682|ref|NP_001149415.1| CTD-phosphatase-like protein [Zea mays]
 gi|195627078|gb|ACG35369.1| CTD-phosphatase-like protein [Zea mays]
 gi|414881341|tpg|DAA58472.1| TPA: CTD-phosphatase-like protein [Zea mays]
          Length = 460

 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 40/150 (26%), Positives = 72/150 (48%), Gaps = 28/150 (18%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
             + + LVLDLD TL++S             L + +  D        F   +  ++ K RP
Sbjct: 284  TKHVTLVLDLDETLVHST------------LDQCDSADFTL--EVFFNMKNHTVYVKKRP 329

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGD 1034
             +  FLE+ +++FE+ ++T   ++YA ++   LDP G   + R+        DG      
Sbjct: 330  YLKVFLEKVAQMFELVIFTASQRIYAEQLIDKLDPDGKYISRRIYRESCIFSDG------ 383

Query: 1035 ERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
                 +KDL  +LG++ A V I+D++ +V+
Sbjct: 384  ---CYTKDL-TILGIDLAKVAIVDNTPQVF 409


>gi|356515353|ref|XP_003526365.1| PREDICTED: uncharacterized protein LOC100813300 [Glycine max]
          Length = 467

 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 72/146 (49%), Gaps = 23/146 (15%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            +K+ L LDLD TL++S+   + D         K   DRE+            ++ + RP 
Sbjct: 294  KKVTLALDLDETLIHSS-MEQCDGAD---FTFKMITDRERT-----------VYVRKRPF 338

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
            +  FL + S++FE+ ++T   ++YA  +  VLDP    F+ RV       +     +R  
Sbjct: 339  LQEFLAKVSEMFEIIIFTASKRMYAETLLDVLDPDKKFFSRRVCR-----ESCTWKDRCC 393

Query: 1039 KSKDLEGVLGMESA-VVIIDDSVRVW 1063
              KDL  VLG++ A V IID++  V+
Sbjct: 394  -VKDL-TVLGIDLAKVCIIDNTPEVF 417


>gi|302847022|ref|XP_002955046.1| hypothetical protein VOLCADRAFT_121370 [Volvox carteri f.
            nagariensis]
 gi|300259574|gb|EFJ43800.1| hypothetical protein VOLCADRAFT_121370 [Volvox carteri f.
            nagariensis]
          Length = 1180

 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 49/159 (30%), Positives = 74/159 (46%), Gaps = 26/159 (16%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            +++ LVLDLD TL+ S       PV  +    +E            RF    +W  LRPG
Sbjct: 564  QRMTLVLDLDGTLIASEDEPHA-PVPFDYCVDEE------------RF----VW--LRPG 604

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
            +  FL+     FE+ L+T   + +AT   + +DP GV+F  R+       D     +  P
Sbjct: 605  LRRFLDSVRPHFEVVLFTAAGESWATSALQRIDPDGVIFDSRLYR-----DHTVSHDDWP 659

Query: 1039 KSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
              KDL   LG + A VVI+DD+  ++ +   N + V  Y
Sbjct: 660  WVKDLS-RLGRDLARVVIVDDNPLMFMYQPDNALHVAAY 697


>gi|302817700|ref|XP_002990525.1| hypothetical protein SELMODRAFT_131775 [Selaginella moellendorffii]
 gi|302817706|ref|XP_002990528.1| hypothetical protein SELMODRAFT_131706 [Selaginella moellendorffii]
 gi|300141693|gb|EFJ08402.1| hypothetical protein SELMODRAFT_131775 [Selaginella moellendorffii]
 gi|300141696|gb|EFJ08405.1| hypothetical protein SELMODRAFT_131706 [Selaginella moellendorffii]
          Length = 213

 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 64/121 (52%), Gaps = 19/121 (15%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            L+LDLD TL+ +++   +    D ++   E   +E+P           +W   RPG+  F
Sbjct: 20   LILDLDGTLIATSRQAGLHAKLDFVV---EFDPQEQP-----------VWVCKRPGLDDF 65

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
            L +AS+LFE+ ++++G + Y  +M + +DP G L A   +SR    D   G + + + KD
Sbjct: 66   LSKASQLFEVVVFSLGKRAYVEKMREKIDPSGSLVAF-WLSR----DSCSGSDAIKEYKD 120

Query: 1043 L 1043
            L
Sbjct: 121  L 121


>gi|426378923|ref|XP_004056157.1| PREDICTED: CTD small phosphatase-like protein 2 [Gorilla gorilla
            gorilla]
          Length = 398

 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 56/104 (53%), Gaps = 8/104 (7%)

Query: 963  LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
            LF+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+ 
Sbjct: 280  LFQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLF 339

Query: 1023 SRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPH 1065
               +      G+      KDL  +LG + S  +IID+S + + +
Sbjct: 340  R--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAY 376


>gi|224035555|gb|ACN36853.1| unknown [Zea mays]
 gi|414881338|tpg|DAA58469.1| TPA: hypothetical protein ZEAMMB73_648049 [Zea mays]
 gi|414881339|tpg|DAA58470.1| TPA: hypothetical protein ZEAMMB73_648049 [Zea mays]
 gi|414881340|tpg|DAA58471.1| TPA: hypothetical protein ZEAMMB73_648049 [Zea mays]
          Length = 397

 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 40/150 (26%), Positives = 72/150 (48%), Gaps = 28/150 (18%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
             + + LVLDLD TL++S             L + +  D        F   +  ++ K RP
Sbjct: 221  TKHVTLVLDLDETLVHST------------LDQCDSADFTL--EVFFNMKNHTVYVKKRP 266

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGD 1034
             +  FLE+ +++FE+ ++T   ++YA ++   LDP G   + R+        DG      
Sbjct: 267  YLKVFLEKVAQMFELVIFTASQRIYAEQLIDKLDPDGKYISRRIYRESCIFSDG------ 320

Query: 1035 ERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
                 +KDL  +LG++ A V I+D++ +V+
Sbjct: 321  ---CYTKDL-TILGIDLAKVAIVDNTPQVF 346


>gi|157125124|ref|XP_001660632.1| hypothetical protein AaeL_AAEL010078 [Aedes aegypti]
 gi|108873763|gb|EAT37988.1| AAEL010078-PA [Aedes aegypti]
          Length = 678

 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 47/164 (28%), Positives = 79/164 (48%), Gaps = 22/164 (13%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
            K  S+ +  LVLDLD TL           VH  +   +E  D       LF+     ++ 
Sbjct: 493  KTRSSPEFSLVLDLDETL-----------VHCSL---QELSDASFKFPVLFQECKYTVFV 538

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
            + RP    FLE+ S++FE+ L+T   ++YA ++  +LDP+  L   R+    +     +G
Sbjct: 539  RTRPFFREFLEKVSQIFEVILFTASKRVYADKLLNLLDPERRLIKYRLFR--EHCVLVNG 596

Query: 1034 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            +      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 597  N----YIKDLT-ILGRDLSKTIIIDNSPQAFGYQLENGIPIESW 635


>gi|224057698|ref|XP_002299297.1| predicted protein [Populus trichocarpa]
 gi|222846555|gb|EEE84102.1| predicted protein [Populus trichocarpa]
          Length = 256

 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 45/91 (49%), Gaps = 14/91 (15%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LVLDLD TL++S             L   ++ D   P    F      ++ + RP +  F
Sbjct: 81   LVLDLDETLVHST------------LEPCDDADFTFPVN--FNLQQHTVFVRCRPYLRDF 126

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
            +ER S LFE+ ++T    +YA ++  VLDPK
Sbjct: 127  MERVSSLFEIIIFTASQSIYAEQLLNVLDPK 157


>gi|308485158|ref|XP_003104778.1| CRE-SCPL-3 protein [Caenorhabditis remanei]
 gi|308257476|gb|EFP01429.1| CRE-SCPL-3 protein [Caenorhabditis remanei]
          Length = 292

 Score = 50.4 bits (119), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 31/112 (27%), Positives = 56/112 (50%), Gaps = 26/112 (23%)

Query: 917  SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG------ 970
            S+ +  LVLDLD TL++ +    + P+ +  +                 FP M       
Sbjct: 61   SSAEYTLVLDLDETLVHCS----LTPLDNATMI----------------FPVMFQDITYQ 100

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
            ++ +LRP + TFL R SK+FE+ ++T   K+YA ++  ++DP+  +   R+ 
Sbjct: 101  VYVRLRPHLRTFLRRMSKIFEIIIFTASKKVYANKLCDIIDPQKTMIRHRLF 152


>gi|118390259|ref|XP_001028120.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
            thermophila]
 gi|89309890|gb|EAS07878.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
            thermophila SB210]
          Length = 623

 Score = 50.4 bits (119), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 40/153 (26%), Positives = 69/153 (45%), Gaps = 22/153 (14%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            +K  L+LDLD TL++  +   +D   D IL      D +   + +     +     +RP 
Sbjct: 432  KKKTLILDLDETLIHCNE--SLDNSSDFIL------DIQADSKEV-----VQAGINVRPF 478

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD--DGDPFDGDER 1036
               FLE  S L+E+ ++T    +YA E+   LDP+      R+           +  D R
Sbjct: 479  AKQFLEEMSHLYEIVIFTASRSVYANEVINKLDPQNKFIFKRLFRENCIYKNRIYIKDLR 538

Query: 1037 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLN 1069
            + K++D++        +VI+D+    + HN LN
Sbjct: 539  IFKNRDIKN-------LVIVDNCCLSFCHNILN 564


>gi|293332237|ref|NP_001167877.1| uncharacterized protein LOC100381584 [Zea mays]
 gi|223944585|gb|ACN26376.1| unknown [Zea mays]
 gi|413950698|gb|AFW83347.1| hypothetical protein ZEAMMB73_634755 [Zea mays]
          Length = 419

 Score = 50.4 bits (119), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 38/149 (25%), Positives = 71/149 (47%), Gaps = 28/149 (18%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            + + LVLDLD TL++S   H                + +      F   +  ++ + RP 
Sbjct: 242  KHVTLVLDLDETLVHSTLDHC--------------DNADFTLEVFFNMKNHTVYVRKRPY 287

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 1035
            +  FLE+ +++FE+ ++T   ++YA ++   LDP G   + R+        DG       
Sbjct: 288  LKMFLEKVAQMFEVVIFTASQRVYAEQLIDKLDPDGKYISRRIYRESCVFSDG------- 340

Query: 1036 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
                +KDL  +LG++ A V I+D++ +V+
Sbjct: 341  --CYTKDL-TILGIDLAKVAIVDNTPQVF 366


>gi|390356060|ref|XP_003728694.1| PREDICTED: CTD small phosphatase-like protein 2-like isoform 1
            [Strongylocentrotus purpuratus]
          Length = 514

 Score = 50.4 bits (119), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 51/176 (28%), Positives = 83/176 (47%), Gaps = 24/176 (13%)

Query: 902  QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
            QK RT  L  + +   + K  LVLDLD TL++ +            L + E      P  
Sbjct: 319  QKNRTPVLPLKTR--RSPKYSLVLDLDETLVHCS------------LAEMENCTMSFPV- 363

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
              F+     ++ + RP    FLER SK+FE+ L+T   ++YA ++  +LDP+  L   R+
Sbjct: 364  -YFQDNEYQVYVRTRPFFRDFLERMSKIFEIILFTASKRVYADKLLNLLDPEKKLVRHRL 422

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
                +      G+      KDL  +LG + +  VIID+S + + +   N I +E +
Sbjct: 423  FR--EHCICVQGN----YIKDL-NILGRDLTKTVIIDNSPQAFGYQLENGIPIESW 471


>gi|118354395|ref|XP_001010460.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
            thermophila]
 gi|89292227|gb|EAR90215.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
            thermophila SB210]
          Length = 540

 Score = 50.1 bits (118), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 45/178 (25%), Positives = 81/178 (45%), Gaps = 31/178 (17%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK--PHRHLFRFPHMG----- 970
            ++KL L+LDLD TL+N+        V +E   K EE  R K   ++++    +       
Sbjct: 201  SQKLNLILDLDETLVNTV------WVTNENQSKLEEIYRYKMPSNKNVITIQYSNNEGAQ 254

Query: 971  --MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 1028
                T LRP +  F+    K F + +Y+ G K Y  ++   +DP+  LF    I +    
Sbjct: 255  KEFITILRPHLKEFVTEMKKYFNILVYSHGRKDYVLKLLDKIDPRRDLFNRNNIFKN--- 311

Query: 1029 DPFDGDERVPKSKDLEGVL------GMESAV---VIIDDSVRVWPHNKL-NLIVVERY 1076
               +G   +   KD++ ++       +E A+   +IIDD   +W      N++ ++R+
Sbjct: 312  ---EGQVNIKTQKDIKNIIECDSPSALEKALKSSIIIDDIFEIWLEETFPNVVPIKRF 366


>gi|260789874|ref|XP_002589969.1| hypothetical protein BRAFLDRAFT_224775 [Branchiostoma floridae]
 gi|229275156|gb|EEN45980.1| hypothetical protein BRAFLDRAFT_224775 [Branchiostoma floridae]
          Length = 232

 Score = 50.1 bits (118), Expect = 0.007,   Method: Composition-based stats.
 Identities = 38/120 (31%), Positives = 60/120 (50%), Gaps = 14/120 (11%)

Query: 902  QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
            ++ R R+     K  S  +  LVLDLD TL++ +            L + E+ +   P  
Sbjct: 35   EEMRQRQPALPLKTRSTPEFSLVLDLDETLVHCS------------LNELEDANLTFPV- 81

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             LF+     ++ + RP    FLER SKL+E+ L+T   K+YA ++  +LDPK  L   R+
Sbjct: 82   -LFQDVTYQVYVRTRPYYREFLERMSKLYEIILFTASKKVYADKLMNILDPKKELVRHRL 140


>gi|390356058|ref|XP_788296.3| PREDICTED: CTD small phosphatase-like protein 2-like isoform 2
            [Strongylocentrotus purpuratus]
          Length = 485

 Score = 50.1 bits (118), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 51/176 (28%), Positives = 83/176 (47%), Gaps = 24/176 (13%)

Query: 902  QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
            QK RT  L  + +   + K  LVLDLD TL++ +            L + E      P  
Sbjct: 290  QKNRTPVLPLKTR--RSPKYSLVLDLDETLVHCS------------LAEMENCTMSFPV- 334

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
              F+     ++ + RP    FLER SK+FE+ L+T   ++YA ++  +LDP+  L   R+
Sbjct: 335  -YFQDNEYQVYVRTRPFFRDFLERMSKIFEIILFTASKRVYADKLLNLLDPEKKLVRHRL 393

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
                +      G+      KDL  +LG + +  VIID+S + + +   N I +E +
Sbjct: 394  FR--EHCICVQGN----YIKDL-NILGRDLTKTVIIDNSPQAFGYQLENGIPIESW 442


>gi|85001578|ref|XP_955502.1| ctd-like phosphatase [Theileria annulata strain Ankara]
 gi|65303648|emb|CAI76026.1| ctd-like phosphatase, putative [Theileria annulata]
          Length = 832

 Score = 50.1 bits (118), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 19/53 (35%), Positives = 32/53 (60%)

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
            + KLRPGI+ F  +    F + L+T G K +A    +++DP+ + F+ R+ SR
Sbjct: 301  YFKLRPGIFNFFHQIRDKFTLFLFTTGTKQHAESALQIIDPQLIYFSNRIFSR 353


>gi|357135834|ref|XP_003569513.1| PREDICTED: uncharacterized protein LOC100822852 [Brachypodium
            distachyon]
          Length = 447

 Score = 50.1 bits (118), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 40/150 (26%), Positives = 71/150 (47%), Gaps = 30/150 (20%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            +K+ LVLDLD TL++S   H                D +      F      ++ + RP 
Sbjct: 270  KKVTLVLDLDETLVHSTMEHC--------------SDADFTFPVFFDMKEHVVYVRKRPH 315

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG----DDGDPFDGD 1034
            +  FL++ +++F++ ++T    +YA ++   LDP+  LF  R         + G      
Sbjct: 316  LHIFLQKMAEMFDVVIFTASQSVYADQLLDRLDPEKTLFCKRFFRESCVFTESG------ 369

Query: 1035 ERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
                 +KDL  V+G++ A VVIID++ +V+
Sbjct: 370  ----YTKDLT-VVGVDLAKVVIIDNTPQVF 394


>gi|413950699|gb|AFW83348.1| hypothetical protein ZEAMMB73_634755 [Zea mays]
          Length = 400

 Score = 50.1 bits (118), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 38/149 (25%), Positives = 71/149 (47%), Gaps = 28/149 (18%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            + + LVLDLD TL++S   H                + +      F   +  ++ + RP 
Sbjct: 223  KHVTLVLDLDETLVHSTLDHC--------------DNADFTLEVFFNMKNHTVYVRKRPY 268

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 1035
            +  FLE+ +++FE+ ++T   ++YA ++   LDP G   + R+        DG       
Sbjct: 269  LKMFLEKVAQMFEVVIFTASQRVYAEQLIDKLDPDGKYISRRIYRESCVFSDG------- 321

Query: 1036 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
                +KDL  +LG++ A V I+D++ +V+
Sbjct: 322  --CYTKDL-TILGIDLAKVAIVDNTPQVF 347


>gi|431896052|gb|ELK05470.1| CTD small phosphatase-like protein 2 [Pteropus alecto]
          Length = 282

 Score = 50.1 bits (118), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 58/107 (54%), Gaps = 8/107 (7%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
            ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+    +    
Sbjct: 140  VYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVC 197

Query: 1031 FDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 198  VQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 239


>gi|195996503|ref|XP_002108120.1| hypothetical protein TRIADDRAFT_18774 [Trichoplax adhaerens]
 gi|190588896|gb|EDV28918.1| hypothetical protein TRIADDRAFT_18774, partial [Trichoplax adhaerens]
          Length = 208

 Score = 49.7 bits (117), Expect = 0.008,   Method: Composition-based stats.
 Identities = 48/159 (30%), Positives = 76/159 (47%), Gaps = 24/159 (15%)

Query: 920  KLCLVLDLDHTLLN-SAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            +  LV+DLD TL++ S    E   +H  I  K    D               ++ +LRP 
Sbjct: 30   EFTLVIDLDETLVHCSLSLLEDANLHFPIYFKNNNYD---------------VYVRLRPY 74

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
               FLER SK++E+ L+T   K+YA ++  ++DP   L   R+     +   F     V 
Sbjct: 75   YREFLERVSKIYEVILFTASKKVYANKLMDIIDPGRKLVKHRLFR---EHCVFVHGNYV- 130

Query: 1039 KSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              KDL G+LG + S  VI+D+S + + +   N I +E +
Sbjct: 131  --KDL-GILGRDLSKTVIVDNSPQAFGYQLSNGIPIESW 166


>gi|297843870|ref|XP_002889816.1| hypothetical protein ARALYDRAFT_888325 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297335658|gb|EFH66075.1| hypothetical protein ARALYDRAFT_888325 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 100

 Score = 49.7 bits (117), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 42/115 (36%), Positives = 56/115 (48%), Gaps = 29/115 (25%)

Query: 1008 KVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNK 1067
            K+LDPKG  F+ R+ISR       DG  R  KS D   V+G E AV+ +D+S  VW    
Sbjct: 7    KLLDPKGKYFSDRIISRD------DGTVRHKKSLD---VMGNEEAVLFVDESKIVWQ--- 54

Query: 1068 LNLIVVERY-TYFPCSRRQF---GLLGPSLLEIDHDERSEDGTLASSLGVRQQLH 1118
                  ++Y  +F  S +QF     L P       DE   DG L++ L V +Q H
Sbjct: 55   ------KKYGEFFASSCKQFKEDSKLLP-------DESESDGALSTVLNVLKQTH 96


>gi|302814947|ref|XP_002989156.1| hypothetical protein SELMODRAFT_129286 [Selaginella moellendorffii]
 gi|300143056|gb|EFJ09750.1| hypothetical protein SELMODRAFT_129286 [Selaginella moellendorffii]
          Length = 245

 Score = 49.7 bits (117), Expect = 0.010,   Method: Composition-based stats.
 Identities = 46/157 (29%), Positives = 73/157 (46%), Gaps = 22/157 (14%)

Query: 921  LCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIW 980
            + LVLDLD TL++S   H  +      L                 F    ++ + RP + 
Sbjct: 45   VALVLDLDETLVHSTTDHCGNADFSFSLHAN--------------FQRQTVYVRRRPHLQ 90

Query: 981  TFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKS 1040
             F+ER ++LFE+ ++T     YA ++  +LDPK  +F  R+    D     DG+      
Sbjct: 91   MFMERVAQLFEIIVFTASQSTYAEKLLNILDPKRKVFRHRIFR--DSCVLVDGNYL---- 144

Query: 1041 KDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            KDL  VLG + S  VI+D+S + +     N I +E +
Sbjct: 145  KDLS-VLGRDLSKTVIVDNSPQAFGFQVDNGIPIESW 180


>gi|156404147|ref|XP_001640269.1| predicted protein [Nematostella vectensis]
 gi|156227402|gb|EDO48206.1| predicted protein [Nematostella vectensis]
          Length = 289

 Score = 49.7 bits (117), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 51/173 (29%), Positives = 83/173 (47%), Gaps = 31/173 (17%)

Query: 905  RTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF 964
            RTRR  E    FS     LVLDLD TL++ +            L K E+     P    +
Sbjct: 97   RTRRTPE----FS-----LVLDLDETLVHCS------------LNKLEDATLSFPVS--Y 133

Query: 965  RFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
            +     ++ + RP +  FLER SK+FE+ L+T   ++YA ++  +LDP+   F  R+   
Sbjct: 134  QDITYQVFVRTRPHLKYFLERVSKVFEVILFTASKRVYADKLLNILDPEKKYFRHRLFR- 192

Query: 1025 GDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
             +      G+      KDL  +LG + S  +I+D+S + + +   N I +E +
Sbjct: 193  -EHCVCVQGN----YIKDL-SILGRDLSKTMIVDNSPQAFAYQIFNGIPIESW 239


>gi|302824588|ref|XP_002993936.1| hypothetical protein SELMODRAFT_137904 [Selaginella moellendorffii]
 gi|300138208|gb|EFJ04983.1| hypothetical protein SELMODRAFT_137904 [Selaginella moellendorffii]
          Length = 159

 Score = 49.7 bits (117), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 87/194 (44%), Gaps = 38/194 (19%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LVLDLD TL++ A        HDE          E   R + +          RPG+  F
Sbjct: 2    LVLDLDQTLVSVAD-------HDE----------ETLLRCVTK----------RPGLDRF 34

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
            L+  S+++E+ +++     Y  ++   LDP G +F+   +  G D D   G +RV   + 
Sbjct: 35   LKDMSQVYEIVIFSASGASYVKKIVSSLDPTGEIFSA--VFTGSDTDWLSG-QRVKDLRK 91

Query: 1043 LEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERS 1102
            L         +V IDD+  ++P+N  N I V  +   P +    G L P LLE+   + S
Sbjct: 92   LN-----RDKIVWIDDNASLYPYNPKNGIQVPPFHGDP-NDSILGALTPLLLEVALGQIS 145

Query: 1103 EDGTLASSLGVRQQ 1116
             +   AS L VR +
Sbjct: 146  VEN--ASELFVRAR 157


>gi|302811311|ref|XP_002987345.1| hypothetical protein SELMODRAFT_125729 [Selaginella moellendorffii]
 gi|300144980|gb|EFJ11660.1| hypothetical protein SELMODRAFT_125729 [Selaginella moellendorffii]
          Length = 240

 Score = 49.3 bits (116), Expect = 0.011,   Method: Composition-based stats.
 Identities = 46/157 (29%), Positives = 73/157 (46%), Gaps = 22/157 (14%)

Query: 921  LCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIW 980
            + LVLDLD TL++S   H  +      L                 F    ++ + RP + 
Sbjct: 45   VALVLDLDETLVHSTTDHCGNADFSFSLHAN--------------FQRQTVYVRRRPHLQ 90

Query: 981  TFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKS 1040
             F+ER ++LFE+ ++T     YA ++  +LDPK  +F  R+    D     DG+      
Sbjct: 91   MFMERVAQLFEIIVFTASQSTYAEKLLNILDPKRKVFRHRIFR--DSCVLVDGNYL---- 144

Query: 1041 KDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            KDL  VLG + S  VI+D+S + +     N I +E +
Sbjct: 145  KDLS-VLGRDLSKTVIVDNSPQAFGFQVDNGIPIESW 180


>gi|403416935|emb|CCM03635.1| predicted protein [Fibroporia radiculosa]
          Length = 580

 Score = 49.3 bits (116), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 30/84 (35%), Positives = 43/84 (51%), Gaps = 10/84 (11%)

Query: 997  MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVI 1055
            MG + YA E+   +DP+G  F GR++SR + G            K L+ +    +S VVI
Sbjct: 1    MGTRAYAEEVCAAIDPEGKFFGGRLLSRDESGS--------LTQKSLQRLFPTDQSMVVI 52

Query: 1056 IDDSVRVWPHNKLNLIVVERYTYF 1079
            IDD   VW  +  NL+ V  Y +F
Sbjct: 53   IDDRADVWEWSP-NLVKVIPYDFF 75


>gi|290992214|ref|XP_002678729.1| predicted protein [Naegleria gruberi]
 gi|284092343|gb|EFC45985.1| predicted protein [Naegleria gruberi]
          Length = 181

 Score = 49.3 bits (116), Expect = 0.012,   Method: Composition-based stats.
 Identities = 53/188 (28%), Positives = 88/188 (46%), Gaps = 31/188 (16%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG------MWT 973
            K  +VLDLD TL+ S  F++   V+D  +                 FPHMG      ++ 
Sbjct: 9    KKTIVLDLDETLIKS--FYQEPEVYDFSID--------------IEFPHMGNLIQQHVYI 52

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
            K RPG+  FL+  ++ FE+ ++T    +YA  + K +DP   LF+  V+ R       +G
Sbjct: 53   KKRPGLENFLQTLAEKFELIMFTAALPVYADAILKHIDPSAELFS-HVLYRHH----CNG 107

Query: 1034 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
                P  KDL  +LG      +++DD V  +   K N ++++ +      R    ++ P 
Sbjct: 108  SGMFP-GKDLR-ILGRNLDHTLLVDDGVMNFLQPK-NGLLIKSFKGEEGDRILADIIAPF 164

Query: 1093 LLEIDHDE 1100
            LL++D  E
Sbjct: 165  LLQLDDPE 172


>gi|357487783|ref|XP_003614179.1| CTD small phosphatase-like protein [Medicago truncatula]
 gi|355515514|gb|AES97137.1| CTD small phosphatase-like protein [Medicago truncatula]
          Length = 306

 Score = 49.3 bits (116), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 53/184 (28%), Positives = 85/184 (46%), Gaps = 24/184 (13%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LVL LD TL           VH  +++ KE+ D        F      ++ + RP +  F
Sbjct: 127  LVLGLDGTL-----------VHSTLVKPKEDHDL--TFTVSFNSVKEDVYVRYRPHLKEF 173

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
            L+  S +FE+ ++T G ++YA ++   LDP   +F  R+          + DE+    KD
Sbjct: 174  LDEVSGIFEIIVFTAGQRIYADKLLNKLDPSRKIFRHRLFRES----CVNVDEKY--VKD 227

Query: 1043 LEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL--GPSLLEIDHD 1099
            L  +LG + A V +ID S   +     N I +E +   P   +   L+    SL+E+D D
Sbjct: 228  L-SILGRDLARVTMIDSSPHSFGFQVENGIPIETWFADPSDNKLLSLIPFLESLVEVD-D 285

Query: 1100 ERSE 1103
             R+E
Sbjct: 286  VRTE 289


>gi|223943303|gb|ACN25735.1| unknown [Zea mays]
          Length = 342

 Score = 49.3 bits (116), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 38/149 (25%), Positives = 71/149 (47%), Gaps = 28/149 (18%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            + + LVLDLD TL++S   H                + +      F   +  ++ + RP 
Sbjct: 165  KHVTLVLDLDETLVHSTLDHC--------------DNADFTLEVFFNMKNHTVYVRKRPY 210

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 1035
            +  FLE+ +++FE+ ++T   ++YA ++   LDP G   + R+        DG       
Sbjct: 211  LKMFLEKVAQMFEVVIFTASQRVYAEQLIDKLDPDGKYISRRIYRESCVFSDG------- 263

Query: 1036 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
                +KDL  +LG++ A V I+D++ +V+
Sbjct: 264  --CYTKDL-TILGIDLAKVAIVDNTPQVF 289


>gi|391328122|ref|XP_003738541.1| PREDICTED: CTD small phosphatase-like protein 2-like [Metaseiulus
            occidentalis]
          Length = 236

 Score = 49.3 bits (116), Expect = 0.013,   Method: Composition-based stats.
 Identities = 37/108 (34%), Positives = 53/108 (49%), Gaps = 14/108 (12%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
            K  SA +  LVLDLD TL           VH  ++   E +        LF+     ++ 
Sbjct: 51   KTRSAPEFSLVLDLDETL-----------VHCSLM---ELEGATFTFPVLFQGIEYKVYV 96

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            + RP    FLER SK+FE+ L+T   K+YA ++  +LDPK  L   R+
Sbjct: 97   RTRPFFREFLERVSKMFEVILFTASKKVYADKLLDLLDPKRHLIRYRL 144


>gi|313226803|emb|CBY21948.1| unnamed protein product [Oikopleura dioica]
          Length = 444

 Score = 49.3 bits (116), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 50/184 (27%), Positives = 85/184 (46%), Gaps = 33/184 (17%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            + +AAI   +TR+  E           LVLDLD TL           VH  +    E Q 
Sbjct: 237  KNRAAILPCKTRKTPE---------YTLVLDLDETL-----------VHCSLC---ELQM 273

Query: 956  REKPHRHLFRFPHMG--MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
            R+       RF ++   ++ K RP +  FLER  + FE+ ++T   K+YA ++  ++DP 
Sbjct: 274  RDYEFTFPIRFQNVDYDVYVKTRPYLRDFLERMCEHFEIIIFTASKKVYADKLISIIDPN 333

Query: 1014 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIV 1072
              L   R+    +      G+      KDL  +LG + +  +I+D+S + + ++  N I 
Sbjct: 334  KKLVRHRLFR--EHCMLVQGN----YIKDL-TILGRDLTKTIIVDNSPQAFSYHMDNGIP 386

Query: 1073 VERY 1076
            +E +
Sbjct: 387  IESW 390


>gi|340508046|gb|EGR33849.1| NLI interacting factor-like phosphatase family protein, putative
            [Ichthyophthirius multifiliis]
          Length = 280

 Score = 48.9 bits (115), Expect = 0.014,   Method: Composition-based stats.
 Identities = 32/118 (27%), Positives = 64/118 (54%), Gaps = 19/118 (16%)

Query: 901  IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREK 958
            ++K+R ++L +QK+    +K  L+LDLD TL++S+  + +E D   + +++    Q    
Sbjct: 11   VKKQRIKQLGQQKQSCIGKK-TLILDLDETLVHSSFQQINEYDFQFEIVVKNIPYQ---- 65

Query: 959  PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
                        ++ K RPGI  FL++ S+ +E+ +YT     YA ++  ++D + V+
Sbjct: 66   ------------IYVKKRPGIHIFLQKLSEKYEIVIYTASISEYANQVCNIIDQQDVI 111


>gi|403353558|gb|EJY76317.1| NLI interacting factor-like phosphatase family protein [Oxytricha
            trifallax]
          Length = 1037

 Score = 48.9 bits (115), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 72/151 (47%), Gaps = 31/151 (20%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH----MGMWT 973
            ++K  L+ D+D TL+     H VD +      + E+ D   P      FP     +    
Sbjct: 654  SKKKTLIFDMDETLI-----HCVDDI------ESEDPDVIIP----IDFPDEDEIVNAGI 698

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
             +RP ++  LE A+KLF++ ++T  +K YA  +   LDP+   F  R+  R +     +G
Sbjct: 699  NIRPYLYECLEEANKLFQVIVFTASHKAYADAILDYLDPENKYFQYRLY-RDNCVQTREG 757

Query: 1034 ----DERVPKSKDLEGVLGMESAVVIIDDSV 1060
                D R+  ++DL+        ++IID+SV
Sbjct: 758  YYVKDLRIINNRDLKD-------LIIIDNSV 781


>gi|345479753|ref|XP_001603378.2| PREDICTED: hypothetical protein LOC100119644 [Nasonia vitripennis]
          Length = 563

 Score = 48.9 bits (115), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 50/170 (29%), Positives = 77/170 (45%), Gaps = 34/170 (20%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 967
            K  S+ +  LVLDLD TL           VH  +   +E  D        FRFP      
Sbjct: 378  KTRSSPEFSLVLDLDETL-----------VHCSL---QELSDAS------FRFPVVFQNI 417

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
               ++ + RP    FLE  S L+E+ L+T   ++YA ++  +LDP   L   R+    + 
Sbjct: 418  TYTVFVRTRPFFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 475

Query: 1028 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
                +G+      KDL  +LG + S  VIID+S + + +   N I +E +
Sbjct: 476  CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 520


>gi|291239709|ref|XP_002739764.1| PREDICTED: CTD small phosphatase-like protein 2-like [Saccoglossus
            kowalevskii]
          Length = 526

 Score = 48.9 bits (115), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 49/176 (27%), Positives = 84/176 (47%), Gaps = 22/176 (12%)

Query: 902  QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 961
            + E+ RR     K  S+ +  LVLDLD TL++ +  +E+D             D      
Sbjct: 329  EAEKNRRPVLPLKTRSSPEYSLVLDLDETLVHCS-LNELD-------------DANLTFP 374

Query: 962  HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             +F+     ++ + RP    FLE  S+ FE+ L+T   K+YA ++  +LDP+      R+
Sbjct: 375  VVFQDITYQVFVRTRPYFKEFLEAVSQQFEVILFTASKKVYADKLFNLLDPQKKYVKYRL 434

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
                +      G+      KDL G+LG + S V+I+D+S + + +   N I +E +
Sbjct: 435  FR--EHCVCVQGN----YIKDL-GILGRDLSRVIIVDNSPQAFGYQLSNGIPIESW 483


>gi|145502170|ref|XP_001437064.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124404211|emb|CAK69667.1| unnamed protein product [Paramecium tetraurelia]
          Length = 334

 Score = 48.9 bits (115), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 48/174 (27%), Positives = 79/174 (45%), Gaps = 28/174 (16%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LV+DLD TL++S+   E   V+D I+       + K            ++  +RPG   F
Sbjct: 48   LVIDLDETLVHSS--FEPMKVNDLIVEVTMNDQKYK------------IYVNIRPGAHDF 93

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKS-- 1040
            +E ASK FE+ ++T     YA  +   LDP G++          D   F  +  V K   
Sbjct: 94   IEEASKYFELIIFTASISEYANSVIDFLDPHGLV----------DLRLFRENCTVYKDIL 143

Query: 1041 -KDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSL 1093
             KDL  +     +V++ID+SV  +    +N + +  Y +   + ++  LL P L
Sbjct: 144  VKDLSLLKRKLDSVILIDNSVNSFMFQPMNAVHILNY-FEDKTDQELTLLIPFL 196


>gi|225711928|gb|ACO11810.1| Probable C-terminal domain small phosphatase [Lepeophtheirus
            salmonis]
          Length = 265

 Score = 48.5 bits (114), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 47/161 (29%), Positives = 78/161 (48%), Gaps = 22/161 (13%)

Query: 917  SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
            S+ +  LVLDLD TL++ +   E+D             D       +F+     ++ + R
Sbjct: 82   SSPRFSLVLDLDETLVHCS-LQELD-------------DASLSFPVVFQDTTYRVFVRTR 127

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 1036
            P I  FLER SK FE+ L+T   K+YA ++  +LDP+      R+    +     +G+  
Sbjct: 128  PRIREFLERVSKNFEVTLFTASKKVYADKLLNLLDPERKWIKYRLFR--EHCVCVNGNY- 184

Query: 1037 VPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
                KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 185  ---IKDL-NILGRDLSKTIIIDNSPQAFGYQLENGIPIESW 221


>gi|70921595|ref|XP_734099.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56506520|emb|CAH86297.1| hypothetical protein PC301933.00.0 [Plasmodium chabaudi chabaudi]
          Length = 212

 Score = 48.5 bits (114), Expect = 0.021,   Method: Composition-based stats.
 Identities = 35/136 (25%), Positives = 62/136 (45%), Gaps = 14/136 (10%)

Query: 953  EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
            E D  + ++    + +   + K RP +  FLE  S  +E+ +YT   + YA  +  +LDP
Sbjct: 5    ENDELELYKFFLPYYNFFYYLKFRPYVRQFLEILSLYYELSIYTNATREYADVVIAILDP 64

Query: 1013 KGVLFAGRVISRGD--DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---PHNK 1067
               +FA R+++R    D D     E++  + D          V+  DD   VW   P + 
Sbjct: 65   DRTIFADRIVARCSSVDRDENKHFEKIYPNVD-------PKYVIAFDDRKDVWYDIPDS- 116

Query: 1068 LNLIVVERYTYFPCSR 1083
             +++  E Y +F  S+
Sbjct: 117  -HILRAEHYNFFELSK 131


>gi|124513824|ref|XP_001350268.1| protein phosphatase, putative [Plasmodium falciparum 3D7]
 gi|23615685|emb|CAD52677.1| protein phosphatase, putative [Plasmodium falciparum 3D7]
          Length = 1288

 Score = 48.5 bits (114), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 34/125 (27%), Positives = 55/125 (44%), Gaps = 12/125 (9%)

Query: 964  FRFPHMGM--WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            F  P      + K RP +  FL+  S  +E+ +YT   + YA  +  +LDP   +F+ R+
Sbjct: 888  FYLPQYNFFYYLKFRPYVRQFLQILSLYYELAIYTNATREYADVVIAILDPDRTIFSDRI 947

Query: 1022 ISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---PHNKLNLIVVERYTY 1078
            ++R    D    DE    S+    V      V+  DD   VW   P +   ++  E Y +
Sbjct: 948  VARCSSTDR---DENKYFSRIYPNV--DPKYVIAFDDRKDVWIDIPQSH--ILKAEHYNF 1000

Query: 1079 FPCSR 1083
            F  S+
Sbjct: 1001 FELSK 1005


>gi|195125303|ref|XP_002007120.1| GI12759 [Drosophila mojavensis]
 gi|193918729|gb|EDW17596.1| GI12759 [Drosophila mojavensis]
          Length = 245

 Score = 48.5 bits (114), Expect = 0.023,   Method: Composition-based stats.
 Identities = 32/119 (26%), Positives = 58/119 (48%), Gaps = 14/119 (11%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFH------EVDPVHDEILRKKEEQDREKPHRHLFR 965
            +K++   R+  LVLD+D TL++S   +      E  P  +   + K +     P+ + F 
Sbjct: 19   KKRLLMVRRKTLVLDMDETLISSVILYRVKSLLEAGPEDNRRYKAKSKIVHSTPYDYSFY 78

Query: 966  FP--HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
             P     ++   RP +  FL+R SK + + ++T  ++ YA+++   LD      AGR I
Sbjct: 79   IPMSEASVYVYKRPYVDLFLDRVSKWYNLVVFTAASEAYASQVLDFLD------AGRNI 131


>gi|412985397|emb|CCO18843.1| predicted protein [Bathycoccus prasinos]
          Length = 601

 Score = 48.1 bits (113), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 52/177 (29%), Positives = 80/177 (45%), Gaps = 33/177 (18%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------HMGMWT 973
            K  LVLDLD TL           VH  +     E++   P    F FP         +  
Sbjct: 405  KNTLVLDLDETL-----------VHSNL-----EEEEGTPD---FTFPVQFNNETHAVNV 445

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
            ++RP +  F++R SK FE+ ++T   K+YA ++   LDP+ V F+ R+    D     +G
Sbjct: 446  RIRPHLEEFMKRVSKKFEVVIFTASQKVYADKLLDHLDPEHVYFSHRLFR--DSCVLVEG 503

Query: 1034 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 1089
            +      KDL  VLG + S  +IID+S + +     N + +E +   P       LL
Sbjct: 504  N----YLKDL-SVLGRDLSRTLIIDNSPQAFGFQVENGVPIESWYDDPTDDHLLRLL 555


>gi|325180325|emb|CCA14728.1| putative nuclear LIM factor interactorinteracting protein hyphal form
            [Albugo laibachii Nc14]
          Length = 228

 Score = 48.1 bits (113), Expect = 0.025,   Method: Composition-based stats.
 Identities = 32/104 (30%), Positives = 52/104 (50%), Gaps = 13/104 (12%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE--EQDREKPHRHLFRFPHMGMWTKLR 976
            R+L LVLD+D  L++S   H  D ++      K+  E    +  R +            R
Sbjct: 31   RRLALVLDMDECLIHSIFQH--DNIYQRYPSYKDSFEISTSEGERAI---------VNKR 79

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020
            PG+  FL  A+K F+++++T G ++Y   +   LDPKG +F  R
Sbjct: 80   PGLDAFLREAAKSFDLYVFTAGLRVYGEPILDALDPKGTIFKDR 123


>gi|302833726|ref|XP_002948426.1| hypothetical protein VOLCADRAFT_58281 [Volvox carteri f. nagariensis]
 gi|300266113|gb|EFJ50301.1| hypothetical protein VOLCADRAFT_58281 [Volvox carteri f. nagariensis]
          Length = 215

 Score = 48.1 bits (113), Expect = 0.026,   Method: Composition-based stats.
 Identities = 46/160 (28%), Positives = 77/160 (48%), Gaps = 22/160 (13%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
            AR+  LVLDLD TL++S+            L   +  D   P   +F      ++ + RP
Sbjct: 33   ARRKTLVLDLDETLVHSS------------LEAVDRSDFSFPV--IFNGTEHQVYVRQRP 78

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERV 1037
             +  F+ R + LFE+ ++T   ++YA ++  +LDP+  L   R+    D     DG+   
Sbjct: 79   YLREFMVRVAALFEVVVFTASQRIYAEKLLDILDPQQQLVRHRIYR--DSCVVVDGNYL- 135

Query: 1038 PKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
               KDL  +LG + A  VI+D+S + +     N I +E +
Sbjct: 136  ---KDLS-ILGRDLATTVIVDNSPQAFGFQVDNGIPIESW 171


>gi|405966502|gb|EKC31780.1| CTD small phosphatase-like protein 2 [Crassostrea gigas]
          Length = 402

 Score = 48.1 bits (113), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 53/192 (27%), Positives = 90/192 (46%), Gaps = 25/192 (13%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
            K  S+ +  LVLDLD TL++ +                E +D       LF      ++ 
Sbjct: 218  KTRSSPEFSLVLDLDETLVHCSL--------------TELEDAAFTFPVLFEDVTYKVFV 263

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
            + RP    FLE  S++FE+ L+T   K+YA ++  +LDP+  L   R+    +     +G
Sbjct: 264  RTRPHFREFLETVSEMFEVILFTASKKVYADKLVNILDPQKQLIKHRLFR--EHCVCING 321

Query: 1034 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 1092
            +      KDL  +LG + S  +I+D+S + + +   N I +E + +   + R+   L P 
Sbjct: 322  N----YIKDL-TILGRDLSRTIIVDNSPQAFGYQLDNGIPIESW-FVDKNDRELLNLVPF 375

Query: 1093 LLEIDHDERSED 1104
            L  + H  R+ED
Sbjct: 376  LQSLVH--RNED 385


>gi|268566879|ref|XP_002639837.1| C. briggsae CBR-SCPL-3 protein [Caenorhabditis briggsae]
          Length = 294

 Score = 48.1 bits (113), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 18/52 (34%), Positives = 33/52 (63%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
            ++ ++RP + TFL R SK+FE+ ++T   K YA ++  +LDP+  +   R+ 
Sbjct: 101  VYVRIRPFLRTFLTRMSKVFEIIVFTASKKCYANKLCDILDPQKTIIKHRLF 152


>gi|308801351|ref|XP_003077989.1| double-stranded RNA-binding domain (ISS) [Ostreococcus tauri]
 gi|116056440|emb|CAL52729.1| double-stranded RNA-binding domain (ISS) [Ostreococcus tauri]
          Length = 793

 Score = 48.1 bits (113), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 52/123 (42%), Gaps = 28/123 (22%)

Query: 967  PHMGMWTKLRPGIWTFL-------ERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
            P   M   +RPG W  L       +R SK  E  + TM N  YA EM ++LDP G +F  
Sbjct: 334  PATAMLVHIRPG-WGELRSYLSGSDRGSKRAETFVCTMANIDYAREMCRLLDPHGTVF-- 390

Query: 1020 RVISRGDDGDPFDGDERVPKSK--------DLEGVLGMESAVVIIDDSVRVW-PHNKLNL 1070
                     DP   D+R+   K        D  G+       VI+DD   VW P  + ++
Sbjct: 391  ---------DPAQLDKRIKSVKPDELKSLSDTCGLHFPSELAVIVDDRTAVWEPSAQSHI 441

Query: 1071 IVV 1073
            + V
Sbjct: 442  LAV 444


>gi|290986065|ref|XP_002675745.1| NLI interacting factor domain-containing protein [Naegleria gruberi]
 gi|284089343|gb|EFC43001.1| NLI interacting factor domain-containing protein [Naegleria gruberi]
          Length = 510

 Score = 48.1 bits (113), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 57/231 (24%), Positives = 96/231 (41%), Gaps = 39/231 (16%)

Query: 884  GDVEHLF--EGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHE-- 939
            GD+ H F  E   ++++A  Q  +T  L  Q+     +K  LVLDLD TL++S   H   
Sbjct: 271  GDLIHQFYEEVKTNKKRAHKQAPQTALLPPQRPHVQGKK-TLVLDLDETLVHSVFVHTDQ 329

Query: 940  ---VDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYT 996
               V P+         E D      ++ +          RPG+  +L    + +E+ ++T
Sbjct: 330  ADFVIPI---------EMDGRTYSCYVLK----------RPGVDEYLRELGQYYEIIIFT 370

Query: 997  MGNKLYATEMAKVLDPKGVLFAGRVISRGDD--GDPFDGDERVPKSKDLEGVLGMESAVV 1054
                LYA  +  +LD  GV+  GR+        GD +         KDL  +       +
Sbjct: 371  ASLSLYANPLLDILDKHGVI-EGRLFREHCTKVGDTY--------IKDLSRLGRDLDQTI 421

Query: 1055 IIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDG 1105
            I+D+S   +     N +    +   P + R+ GLL   L  ++ ++   DG
Sbjct: 422  IVDNSPSCYAMQPQNALACTTWYDDP-NDRELGLLADCLKRLEREKAVYDG 471


>gi|242053713|ref|XP_002456002.1| hypothetical protein SORBIDRAFT_03g028730 [Sorghum bicolor]
 gi|241927977|gb|EES01122.1| hypothetical protein SORBIDRAFT_03g028730 [Sorghum bicolor]
          Length = 400

 Score = 48.1 bits (113), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 26/104 (25%), Positives = 50/104 (48%), Gaps = 14/104 (13%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
             + + LVLDLD TL++S   H                + +      F   +  ++ + RP
Sbjct: 222  TKHVTLVLDLDETLVHSTLDHC--------------DNADFTLEVFFNMKNHTVYVRKRP 267

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             +  FLE+ +++FE+ ++T   ++YA ++   LDP G   + R+
Sbjct: 268  YLKMFLEKVAQMFEVVIFTASQRIYAEQLIDKLDPDGKYISRRI 311


>gi|332020757|gb|EGI61161.1| CTD small phosphatase-like protein 2 [Acromyrmex echinatior]
          Length = 593

 Score = 48.1 bits (113), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 50/170 (29%), Positives = 77/170 (45%), Gaps = 34/170 (20%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 967
            K  S+ +  LVLDLD TL           VH  +   +E  D        FRFP      
Sbjct: 408  KTRSSPEFSLVLDLDETL-----------VHCSL---QELSDAA------FRFPVVFQDV 447

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
               ++ + RP    FLE  S L+E+ L+T   ++YA ++  +LDP   L   R+    + 
Sbjct: 448  TYTVFVRTRPYFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 505

Query: 1028 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
                +G+      KDL  +LG + S  VIID+S + + +   N I +E +
Sbjct: 506  CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 550


>gi|71649764|ref|XP_813595.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70878493|gb|EAN91744.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 446

 Score = 48.1 bits (113), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 47/173 (27%), Positives = 79/173 (45%), Gaps = 24/173 (13%)

Query: 901  IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-VHDEILRKKEEQDREKP 959
            I K     L  Q + +  +K  L+LDLD TL++S+    + P  HD IL  K E +    
Sbjct: 239  IAKNHASLLPLQMRQYHGKK-TLILDLDETLVHSSL--TLQPKQHDLILSMKTEPEVTT- 294

Query: 960  HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
                       ++   RP +  F++  + LFE+ ++T    +Y   +   +DP+G+L + 
Sbjct: 295  -----------IYVAYRPFLHEFIQAVAGLFEVVIFTASVSMYCNPVMDAVDPEGILGSL 343

Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLI 1071
            R+    +     +G       KDL  +LG E S V I+D+S   +   + N I
Sbjct: 344  RLYR--EHCSILNG----AYVKDL-SLLGRELSQVAIVDNSPVTYLFQQRNAI 389


>gi|428182825|gb|EKX51685.1| hypothetical protein GUITHDRAFT_65993, partial [Guillardia theta
            CCMP2712]
          Length = 179

 Score = 47.8 bits (112), Expect = 0.031,   Method: Composition-based stats.
 Identities = 32/99 (32%), Positives = 50/99 (50%), Gaps = 16/99 (16%)

Query: 924  VLDLDHTLLN-SAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            VLDLD TL++ S +F E   +  ++  K  EQD               +W K+RP    F
Sbjct: 1    VLDLDETLVHASLEFMEQSHLQFDVTFK--EQDYH-------------VWVKIRPHCLEF 45

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            LER ++ FE+ ++T    +YA ++  ++DP   L   RV
Sbjct: 46   LERLAEKFEIIVFTASQSIYADKLLNLIDPDSRLIKHRV 84


>gi|322779051|gb|EFZ09448.1| hypothetical protein SINV_03717 [Solenopsis invicta]
          Length = 568

 Score = 47.8 bits (112), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 50/170 (29%), Positives = 77/170 (45%), Gaps = 34/170 (20%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 967
            K  S+ +  LVLDLD TL           VH  +   +E  D        FRFP      
Sbjct: 383  KTRSSPEFSLVLDLDETL-----------VHCSL---QELSDAA------FRFPVVFQDV 422

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
               ++ + RP    FLE  S L+E+ L+T   ++YA ++  +LDP   L   R+    + 
Sbjct: 423  TYTVFVRTRPYFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 480

Query: 1028 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
                +G+      KDL  +LG + S  VIID+S + + +   N I +E +
Sbjct: 481  CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 525


>gi|307165882|gb|EFN60237.1| CTD small phosphatase-like protein 2 [Camponotus floridanus]
          Length = 568

 Score = 47.8 bits (112), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 50/170 (29%), Positives = 77/170 (45%), Gaps = 34/170 (20%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 967
            K  S+ +  LVLDLD TL           VH  +   +E  D        FRFP      
Sbjct: 383  KTRSSPEFSLVLDLDETL-----------VHCSL---QELSDAA------FRFPVVFQDV 422

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
               ++ + RP    FLE  S L+E+ L+T   ++YA ++  +LDP   L   R+    + 
Sbjct: 423  TYTVFVRTRPYFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 480

Query: 1028 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
                +G+      KDL  +LG + S  VIID+S + + +   N I +E +
Sbjct: 481  CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 525


>gi|307194093|gb|EFN76554.1| CTD small phosphatase-like protein 2 [Harpegnathos saltator]
          Length = 546

 Score = 47.4 bits (111), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 50/170 (29%), Positives = 77/170 (45%), Gaps = 34/170 (20%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 967
            K  S+ +  LVLDLD TL           VH  +   +E  D        FRFP      
Sbjct: 361  KTRSSPEFSLVLDLDETL-----------VHCSL---QELSDAA------FRFPVVFQDV 400

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
               ++ + RP    FLE  S L+E+ L+T   ++YA ++  +LDP   L   R+    + 
Sbjct: 401  TYTVFVRTRPYFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 458

Query: 1028 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
                +G+      KDL  +LG + S  VIID+S + + +   N I +E +
Sbjct: 459  CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 503


>gi|170050634|ref|XP_001861399.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167872200|gb|EDS35583.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 627

 Score = 47.4 bits (111), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 50/100 (50%), Gaps = 14/100 (14%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
            K  S+ +  LVLDLD TL++ +               +E  D       LF+     ++ 
Sbjct: 482  KTRSSPEFSLVLDLDETLVHCSL--------------QELSDASFKFPVLFQECQYTVFV 527

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
            + RP    FLE+ S++FE+ L+T   ++YA ++  +LDP+
Sbjct: 528  RTRPFFREFLEKVSQIFEVILFTASKRVYADKLLNLLDPE 567


>gi|302824592|ref|XP_002993938.1| hypothetical protein SELMODRAFT_137927 [Selaginella moellendorffii]
 gi|300138210|gb|EFJ04985.1| hypothetical protein SELMODRAFT_137927 [Selaginella moellendorffii]
          Length = 163

 Score = 47.4 bits (111), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 42/141 (29%), Positives = 69/141 (48%), Gaps = 11/141 (7%)

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 1035
            RPG+  FL+  S+++E+ +++     Y  ++   LDP G +F+   +  G D D   G +
Sbjct: 32   RPGLDRFLKDMSQVYEIVVFSASGASYVKKIVSSLDPTGEIFSA--VFTGSDTDWLSG-Q 88

Query: 1036 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLE 1095
            RV   + L         +V IDD+  ++P+N  N I V  +   P +    G L P LLE
Sbjct: 89   RVKDLRKLN-----RDKIVWIDDNASLYPYNPKNGIQVPPFHGDP-NDSILGALTPLLLE 142

Query: 1096 IDHDERSEDGTLASSLGVRQQ 1116
            +   + S +   AS L VR +
Sbjct: 143  VALGQISVEN--ASELFVRAR 161


>gi|224002358|ref|XP_002290851.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220974273|gb|EED92603.1| predicted protein, partial [Thalassiosira pseudonana CCMP1335]
          Length = 196

 Score = 47.4 bits (111), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 44/157 (28%), Positives = 78/157 (49%), Gaps = 22/157 (14%)

Query: 921  LCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIW 980
            +CLVLDLD TL++      V+PV D  +    E +  +   H+          + RP + 
Sbjct: 19   ICLVLDLDETLVHCT----VEPVSDADMIFPVEFNGMEYTVHV----------RCRPFLT 64

Query: 981  TFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKS 1040
             FLE+ S+ FE+ ++T   ++YA ++  ++DP+G     R+    D   P +G+      
Sbjct: 65   EFLEKVSEDFEVVVFTASQQVYADKLLDMIDPEGKFIKHRMFR--DSCLPVEGN----FL 118

Query: 1041 KDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            KDL  +LG +    V++D+S   + +   N I +E +
Sbjct: 119  KDL-TILGRDLRRAVLVDNSPHAFGYQVDNGIPIESW 154


>gi|389583329|dbj|GAB66064.1| hypothetical protein PCYB_082250 [Plasmodium cynomolgi strain B]
          Length = 821

 Score = 47.4 bits (111), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 33/95 (34%), Positives = 44/95 (46%), Gaps = 11/95 (11%)

Query: 987  SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 1046
            +K +E++LYTMG   +A     +LDP    F  RV SR D          V   K L  +
Sbjct: 2    NKKYEIYLYTMGTLEHAKSCLLLLDPLKKFFGNRVFSRKD---------SVNGLKHLNRI 52

Query: 1047 L-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
            L    S  + IDDS  +W  +  + I V  Y YFP
Sbjct: 53   LPTYRSVSLCIDDSDYMWKESS-SCIKVHGYNYFP 86


>gi|407846470|gb|EKG02580.1| hypothetical protein TCSYLVIO_006391 [Trypanosoma cruzi]
          Length = 447

 Score = 47.4 bits (111), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 47/173 (27%), Positives = 79/173 (45%), Gaps = 24/173 (13%)

Query: 901  IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-VHDEILRKKEEQDREKP 959
            I K     L  Q + +  +K  L+LDLD TL++S+    + P  HD IL  K E +    
Sbjct: 240  IAKNHASLLPLQMRQYHGKK-TLILDLDETLVHSSL--TLQPKQHDLILSMKTEPEVTT- 295

Query: 960  HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
                       ++   RP +  F++  + LFE+ ++T    +Y   +   +DP+G+L + 
Sbjct: 296  -----------IYVAYRPFLHEFIQAVAGLFEVVIFTASVSMYCNPVMDAVDPEGILGSL 344

Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLI 1071
            R+    +     +G       KDL  +LG E S V I+D+S   +   + N I
Sbjct: 345  RLYR--EHCSILNG----AYVKDL-SLLGRELSQVAIVDNSPVTYLFQQRNAI 390


>gi|401624712|gb|EJS42762.1| psr2p [Saccharomyces arboricola H-6]
          Length = 391

 Score = 47.4 bits (111), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 49/105 (46%), Gaps = 19/105 (18%)

Query: 914  KMFSARKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
            + F  RK CLVLDLD TL++S+    H  D      +   E  D+              +
Sbjct: 216  QAFQQRK-CLVLDLDETLVHSSFKYMHTAD-----FVLPVEIDDQVH-----------NV 258

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
            +   RPG+  FL R S+L+E+ ++T     YA  +   LDP G +
Sbjct: 259  YVIKRPGVDEFLHRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 303


>gi|395517551|ref|XP_003762939.1| PREDICTED: CTD small phosphatase-like protein-like [Sarcophilus
            harrisii]
          Length = 461

 Score = 47.4 bits (111), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 52/189 (27%), Positives = 80/189 (42%), Gaps = 33/189 (17%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  + K+    K C+V+DLD TL++S+      P+ +       E
Sbjct: 186  DQRQVIPIPSPSAKYLLPELKLSDYGKKCMVIDLDETLVHSS----FKPISNADFIVPVE 241

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 242  IDGTVHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 291

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNK 1067
            GV  A       V  RG+              KDL   LG E S V+IID+S   +  + 
Sbjct: 292  GVFRARLFRESCVFHRGN------------YVKDLSQ-LGRELSKVIIIDNSPASYIFHP 338

Query: 1068 LNLIVVERY 1076
             N + V+ +
Sbjct: 339  ENAVPVQSW 347


>gi|225718796|gb|ACO15244.1| Probable C-terminal domain small phosphatase [Caligus clemensi]
          Length = 314

 Score = 47.4 bits (111), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 52/106 (49%), Gaps = 14/106 (13%)

Query: 917  SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
            S+ +  LVLDLD TL++ +   E+D             D       +F+     ++ + R
Sbjct: 131  SSPRFSLVLDLDETLVHCS-LQELD-------------DASLSFPVVFQDTTYRVFVRTR 176

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
            P I  FLER SK FE+ L+T   K+YA ++  +LDP+      R+ 
Sbjct: 177  PRIREFLERVSKNFEVTLFTASKKVYADKLLNLLDPERKWIKYRLF 222


>gi|149392655|gb|ABR26130.1| ctd-phosphatase-like protein [Oryza sativa Indica Group]
          Length = 187

 Score = 47.4 bits (111), Expect = 0.053,   Method: Composition-based stats.
 Identities = 43/154 (27%), Positives = 78/154 (50%), Gaps = 28/154 (18%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
            K   ++++ LVLDLD TL++S   H  D V D  L+              F   +  ++ 
Sbjct: 5    KSARSKQITLVLDLDETLVHSTLDH-CDNV-DFTLQV------------FFNMKNHTVYV 50

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDP 1030
            + RP +  FLE+ +++FE+ ++T   ++YA ++   LDP   L + R+        +G  
Sbjct: 51   RQRPHLKMFLEKVAQMFELVIFTASQRIYAEQLIDRLDPDERLISHRIYRESCIFSEG-- 108

Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
                     +KDL  +LG++ A VVI+D++ +V+
Sbjct: 109  -------CYTKDLT-ILGVDLAKVVIVDNTPQVF 134


>gi|297799336|ref|XP_002867552.1| hypothetical protein ARALYDRAFT_913891 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297313388|gb|EFH43811.1| hypothetical protein ARALYDRAFT_913891 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 113

 Score = 47.0 bits (110), Expect = 0.057,   Method: Composition-based stats.
 Identities = 24/69 (34%), Positives = 38/69 (55%), Gaps = 1/69 (1%)

Query: 1050 ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLAS 1109
            E  V+I+DD+V +WPH+K NL+ + +Y YF  +         S  E+  DE   +G+LA+
Sbjct: 21   ELRVIIVDDTVDIWPHDKRNLLQITKYIYFSVA-VSIDKRWRSYAEVKRDESLSNGSLAN 79

Query: 1110 SLGVRQQLH 1118
             L     +H
Sbjct: 80   VLKFLVYVH 88


>gi|157873633|ref|XP_001685322.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68128394|emb|CAJ08450.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 231

 Score = 47.0 bits (110), Expect = 0.057,   Method: Composition-based stats.
 Identities = 34/113 (30%), Positives = 58/113 (51%), Gaps = 7/113 (6%)

Query: 967  PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
            P +  +   RPG+  FL + ++ ++M ++T G  LYA  + + L P  ++   R  +R  
Sbjct: 60   PSVTYYAFRRPGLNEFLYQCAEHYDMRIFTAGEDLYARTLLRWLLPDNLIDESRWYTR-- 117

Query: 1027 DGDPFDGDERVPKSKD-LEGVLGMESAVVIIDDSV--RVWPHNKLNLIVVERY 1076
            D    DG  R+ K+   L+G    E A +I+DDS    V+PH   N + + R+
Sbjct: 118  DACVGDGYGRLIKNLSMLDGFQFEERATLILDDSAPDNVYPHQ--NALAIPRF 168


>gi|66803905|ref|XP_635771.1| CTD small phosphatase-like protein 2 [Dictyostelium discoideum AX4]
 gi|74851880|sp|Q54GB2.1|CTSL2_DICDI RecName: Full=CTD small phosphatase-like protein 2; Short=CTDSP-like
            2
 gi|60464148|gb|EAL62309.1| CTD small phosphatase-like protein 2 [Dictyostelium discoideum AX4]
          Length = 567

 Score = 47.0 bits (110), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 44/150 (29%), Positives = 70/150 (46%), Gaps = 29/150 (19%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL---FRFPHMG 970
            K  S+ K+ LVLDLD TL++ +     +P+             E+PH      F      
Sbjct: 384  KEHSSPKISLVLDLDETLVHCS----TEPL-------------EQPHLTFPVFFNNTEYQ 426

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
            ++ K RP    FL + S +FE+ ++T   ++YA ++  ++DP   +   +     D    
Sbjct: 427  VFAKKRPFFEEFLHKVSDIFEVIIFTASQEVYANKLLNMIDPNNKI---KYRLYRDSCVY 483

Query: 1031 FDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
             DG+      KDL  VLG +   VVIID+S
Sbjct: 484  VDGNYL----KDL-SVLGRDLKQVVIIDNS 508


>gi|299470416|emb|CBN80177.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 613

 Score = 47.0 bits (110), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 18/51 (35%), Positives = 32/51 (62%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             + +LRPG+  FLE+ + ++E+ ++T   + YA  +  +LDP G +FA R 
Sbjct: 446  YYVQLRPGLARFLEKVAAIYELVVWTASGRSYADAIIDLLDPAGDIFAERF 496


>gi|328874828|gb|EGG23193.1| CTD small phosphatase-like protein 2 [Dictyostelium fasciculatum]
          Length = 692

 Score = 47.0 bits (110), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 71/151 (47%), Gaps = 31/151 (20%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
            K     K+ LVLDLD TL++ +     DP+          +D +      F      ++ 
Sbjct: 508  KTLDTPKISLVLDLDETLVHCS----TDPI----------EDPDLTFLVTFNAIEYKVYA 553

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP----KGVLFAGRVISRGDDGD 1029
            K RP    FL +AS+LFE+ ++T   ++YA ++  ++DP    K  LF    +       
Sbjct: 554  KKRPFFEEFLVKASELFEVIIFTASQEVYANKLLNMIDPNNHVKYRLFRDSCVY------ 607

Query: 1030 PFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
              +G+      KDL  +LG + S VVI+D+S
Sbjct: 608  -VEGNYL----KDL-SILGRDLSQVVIVDNS 632


>gi|413917756|gb|AFW57688.1| hypothetical protein ZEAMMB73_437679 [Zea mays]
          Length = 186

 Score = 47.0 bits (110), Expect = 0.063,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 33/58 (56%)

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            F F    ++ + RP +  FL+R + +FE  ++T    +YA ++  VLDPK  LF  RV
Sbjct: 33   FNFREHTIYVRCRPYLKEFLDRVASVFETIIFTASQSIYAEQLLNVLDPKRKLFRHRV 90


>gi|224031885|gb|ACN35018.1| unknown [Zea mays]
          Length = 190

 Score = 47.0 bits (110), Expect = 0.063,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 33/58 (56%)

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            F F    ++ + RP +  FL+R + +FE  ++T    +YA ++  VLDPK  LF  RV
Sbjct: 33   FNFREHTIYVRCRPYLKEFLDRVASVFETIIFTASQSIYAEQLLNVLDPKRKLFRHRV 90


>gi|351697455|gb|EHB00374.1| CTD small phosphatase-like protein [Heterocephalus glaber]
          Length = 356

 Score = 47.0 bits (110), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 49/187 (26%), Positives = 80/187 (42%), Gaps = 33/187 (17%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++   +QK   + L  +  +    K C+V+DLD TL++S+      P+ +       E D
Sbjct: 162  EENGGLQKPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 217

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  GV
Sbjct: 218  GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 267

Query: 1016 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLN 1069
              A       V  RG+              KDL   LG E S V+I+D+S   +  +  N
Sbjct: 268  FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNSPASYIFHPEN 314

Query: 1070 LIVVERY 1076
             + V+ +
Sbjct: 315  AVPVQSW 321


>gi|407407114|gb|EKF31076.1| hypothetical protein MOQ_005093 [Trypanosoma cruzi marinkellei]
          Length = 463

 Score = 47.0 bits (110), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 46/173 (26%), Positives = 79/173 (45%), Gaps = 24/173 (13%)

Query: 901  IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-VHDEILRKKEEQDREKP 959
            I K     L  Q + +  +K  L+LDLD TL++S+    + P  HD +L  K E +    
Sbjct: 256  IAKNHASLLPLQMRQYHGKK-TLILDLDETLVHSSL--TLQPKQHDLVLSMKTEPEITT- 311

Query: 960  HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
                       ++   RP +  F++  + LFE+ ++T    +Y   +   +DP+G+L + 
Sbjct: 312  -----------IYVAYRPFLHEFIQAVAGLFEVVIFTASVSMYCNPVMDAVDPEGILGSL 360

Query: 1020 RVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLI 1071
            R+    +     +G       KDL  +LG E S V I+D+S   +   + N I
Sbjct: 361  RLYR--EHCSILNG----AYVKDL-SLLGRELSQVAIVDNSPVTYLFQQRNAI 406


>gi|159473212|ref|XP_001694733.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158276545|gb|EDP02317.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 215

 Score = 47.0 bits (110), Expect = 0.067,   Method: Composition-based stats.
 Identities = 50/166 (30%), Positives = 75/166 (45%), Gaps = 34/166 (20%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP--HMGM---- 971
            AR+  LVLDLD TL++S+                     E   R  F FP    GM    
Sbjct: 32   ARRKTLVLDLDETLVHSS--------------------LEAVDRSDFNFPVTFNGMDHTV 71

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
            + + RP +  F+ R + LFE+ ++T   ++YA  +  +LDP   L   R+    D     
Sbjct: 72   YVRQRPHLHDFMARVAALFEVVVFTASQRIYAERLLDILDPGQALVRHRIYR--DSCVVV 129

Query: 1032 DGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
            DG+      KDL  VLG + A  VI+D+S + +     N I +E +
Sbjct: 130  DGNYL----KDLS-VLGRDLAHTVIVDNSPQAFGFQVDNGIPIESW 170


>gi|219126682|ref|XP_002183580.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217404817|gb|EEC44762.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 224

 Score = 47.0 bits (110), Expect = 0.068,   Method: Composition-based stats.
 Identities = 34/108 (31%), Positives = 51/108 (47%), Gaps = 14/108 (12%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
            K   A  + LVLDLD TL++      V+PV +  L    +        H+          
Sbjct: 37   KTAGAPPITLVLDLDETLVHCT----VEPVENADLTFPVDFHNVTYQVHV---------- 82

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            +LRP ++TFL R    +E+ L+T   K+YA E+   +DP G  F  R+
Sbjct: 83   RLRPHLFTFLSRIEGQYEIVLFTASQKVYANELLNRIDPDGKYFHHRL 130


>gi|67463585|ref|XP_648443.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56464600|gb|EAL43056.1| hypothetical protein EHI_121510 [Entamoeba histolytica HM-1:IMSS]
 gi|449705880|gb|EMD45836.1| RNA polymerase II ctd phosphatase, putative [Entamoeba histolytica
            KU27]
          Length = 428

 Score = 47.0 bits (110), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 37/140 (26%), Positives = 66/140 (47%), Gaps = 24/140 (17%)

Query: 964  FRFPHMG--MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP-KGVLF-AG 1019
            F  P     ++ + R GI TF+E+ SKL+++H+ T+G K YA  +   ++  + + F  G
Sbjct: 98   FEIPEQNAKVFIRFRDGIVTFMEKVSKLYDIHVVTLGQKEYAFAIVNAINKLRNIPFITG 157

Query: 1020 RVISR--------GDDGDPFDG-------DERVPKSKDLEGVLGMESAVVIIDDSVRVWP 1064
             +++          D+ D  DG       +ER    + +   +G E   VI+DD + VW 
Sbjct: 158  DLVTAEDCSSVIVCDEKDTNDGLIDREETNERRSVKRSI-PTMGKEEMQVIVDDRIDVWD 216

Query: 1065 HNKLNLIVVERYTYFPCSRR 1084
            +      VV+   Y P + +
Sbjct: 217  NKN----VVQICEYVPSTNQ 232


>gi|357129571|ref|XP_003566435.1| PREDICTED: uncharacterized protein C2F7.02c-like [Brachypodium
            distachyon]
          Length = 302

 Score = 46.6 bits (109), Expect = 0.077,   Method: Composition-based stats.
 Identities = 48/146 (32%), Positives = 69/146 (47%), Gaps = 30/146 (20%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF-PHMG-----MWT 973
            K  L LDLD TL++S    + DPV               P R+ F   P +G      + 
Sbjct: 125  KKTLFLDLDETLIHS----QTDPV---------------PARYDFTVRPVIGGQAITFYV 165

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
              RPG+  FL  A++ FE+ ++T G + YA+ +   LDP G + A R + RG   D  DG
Sbjct: 166  TKRPGVDEFLRAAAEAFEVVVFTAGLEQYASLVLDRLDPDGAVIAHR-LYRGACRD--DG 222

Query: 1034 DERVPKSKDLEGVLGMESAVVIIDDS 1059
            D R+   KDL          +I+DD+
Sbjct: 223  DGRL--VKDLAATGRALDCAIIVDDN 246


>gi|145508145|ref|XP_001440022.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124407228|emb|CAK72625.1| unnamed protein product [Paramecium tetraurelia]
          Length = 506

 Score = 46.6 bits (109), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 37/145 (25%), Positives = 62/145 (42%), Gaps = 25/145 (17%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LV DLD TL++    +     H  ++             H+   P   +   +RP     
Sbjct: 313  LVFDLDETLIHCNDINNNSTDHTTVI-------------HIPNEPETEIRFNIRPHCQQM 359

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG----DDGDPFDGDERVP 1038
            L+  S+ +E+ L+T   K YA ++ + +DPKG LF+ R         ++G     D RV 
Sbjct: 360  LKALSQYYELILFTASYKEYADKILEYIDPKGNLFSYRFYRESCLELEEG-LLVKDLRVI 418

Query: 1039 KSKDLEGVLGMESAVVIIDDSVRVW 1063
            + + LE        + IID+S   +
Sbjct: 419  EGRKLEN-------MAIIDNSAYCY 436


>gi|349603764|gb|AEP99509.1| CTD small phosphatase-like protein 2-like protein, partial [Equus
            caballus]
          Length = 159

 Score = 46.6 bits (109), Expect = 0.080,   Method: Composition-based stats.
 Identities = 35/113 (30%), Positives = 58/113 (51%), Gaps = 20/113 (17%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR------VISR 1024
            ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R      V  +
Sbjct: 17   VYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFREHCVCVQ 76

Query: 1025 GDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            G+              KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 77   GN------------YIKDLN-ILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 116


>gi|32564286|ref|NP_871854.1| Protein SCPL-3, isoform b [Caenorhabditis elegans]
 gi|351059572|emb|CCD67162.1| Protein SCPL-3, isoform b [Caenorhabditis elegans]
          Length = 312

 Score = 46.6 bits (109), Expect = 0.083,   Method: Compositional matrix adjust.
 Identities = 17/43 (39%), Positives = 30/43 (69%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
            ++ +LRP + TFL R +K FE+ ++T   K+YA ++  +LDP+
Sbjct: 101  VYVRLRPHLRTFLSRMAKTFEIIIFTASKKVYANKLCDILDPR 143


>gi|70945368|ref|XP_742511.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56521536|emb|CAH80727.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 359

 Score = 46.6 bits (109), Expect = 0.085,   Method: Composition-based stats.
 Identities = 33/117 (28%), Positives = 55/117 (47%), Gaps = 14/117 (11%)

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD--DGD 1029
            + K RP +  FLE  S  +E+ +YT   + YA  +  +LDP   +FA R+++R    D D
Sbjct: 3    YLKFRPYVRQFLEILSLYYELSIYTNATREYADVVIAILDPDRTIFADRIVARCSSVDRD 62

Query: 1030 PFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---PHNKLNLIVVERYTYFPCSR 1083
                 E++  + D          V+  DD   VW   P +  +++  E Y +F  S+
Sbjct: 63   ENKHFEKIYPNVD-------PKYVIAFDDRKDVWYDIPDS--HILRAEHYNFFELSK 110


>gi|443696004|gb|ELT96785.1| hypothetical protein CAPTEDRAFT_124156, partial [Capitella teleta]
          Length = 209

 Score = 46.6 bits (109), Expect = 0.086,   Method: Composition-based stats.
 Identities = 32/108 (29%), Positives = 52/108 (48%), Gaps = 14/108 (12%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
            K  S  +  LVLDLD TL++ +                E +D       LF+     ++ 
Sbjct: 24   KTRSTPEFSLVLDLDETLVHCSL--------------NELEDAAFSFPVLFQDVTYQVFV 69

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            + RP    FLER +K+FE+ ++T   K+YA ++  +LDP+  L   R+
Sbjct: 70   RTRPRFREFLERVAKIFEVTVFTASKKVYANKLLNLLDPEKKLIRHRL 117


>gi|240273650|gb|EER37170.1| RNA Polymerase II CTD phosphatase Fcp1 [Ajellomyces capsulatus
           H143]
          Length = 592

 Score = 46.6 bits (109), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 46/88 (52%), Gaps = 22/88 (25%)

Query: 924 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 972
           V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 276 VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHEAVKDVRAFQLVDDGPGMKGCW 326

Query: 973 --TKLRPGIWTFLERASKLFEMHLYTMG 998
              KLRPG+  FL   S LFE+H+YTMG
Sbjct: 327 YYIKLRPGLEEFLRNISTLFELHIYTMG 354


>gi|350421965|ref|XP_003493014.1| PREDICTED: hypothetical protein LOC100746789 isoform 1 [Bombus
            impatiens]
          Length = 558

 Score = 46.6 bits (109), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 50/170 (29%), Positives = 77/170 (45%), Gaps = 34/170 (20%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 967
            K  S+ +  LVLDLD TL           VH  +   +E  D        FRFP      
Sbjct: 373  KTRSSPEFSLVLDLDETL-----------VHCSL---QELSDAA------FRFPVVFQDV 412

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
               ++ + RP    FLE  S L+E+ L+T   ++YA ++  +LDP   L   R+    + 
Sbjct: 413  TYTVFVRTRPYFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 470

Query: 1028 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
                +G+      KDL  +LG + S  VIID+S + + +   N I +E +
Sbjct: 471  CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 515


>gi|350421968|ref|XP_003493015.1| PREDICTED: hypothetical protein LOC100746789 isoform 2 [Bombus
            impatiens]
          Length = 457

 Score = 46.6 bits (109), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 48/170 (28%), Positives = 77/170 (45%), Gaps = 34/170 (20%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 967
            K  S+ +  LVLDLD TL++ +               +E  D        FRFP      
Sbjct: 272  KTRSSPEFSLVLDLDETLVHCSL--------------QELSDAA------FRFPVVFQDV 311

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
               ++ + RP    FLE  S L+E+ L+T   ++YA ++  +LDP   L   R+    + 
Sbjct: 312  TYTVFVRTRPYFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 369

Query: 1028 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
                +G+      KDL  +LG + S  VIID+S + + +   N I +E +
Sbjct: 370  CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 414


>gi|17509983|ref|NP_491348.1| Protein SCPL-3, isoform a [Caenorhabditis elegans]
 gi|75023288|sp|Q9N4V4.1|SCPL3_CAEEL RecName: Full=CTD small phosphatase-like protein 3; Short=CTDSP-like
            3
 gi|351059571|emb|CCD67161.1| Protein SCPL-3, isoform a [Caenorhabditis elegans]
          Length = 287

 Score = 46.2 bits (108), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 17/43 (39%), Positives = 30/43 (69%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
            ++ +LRP + TFL R +K FE+ ++T   K+YA ++  +LDP+
Sbjct: 101  VYVRLRPHLRTFLSRMAKTFEIIIFTASKKVYANKLCDILDPR 143


>gi|440293350|gb|ELP86476.1| carboxy-terminal domain RNA polymerase II polypeptide A small
            phosphatase, putative [Entamoeba invadens IP1]
          Length = 213

 Score = 46.2 bits (108), Expect = 0.098,   Method: Composition-based stats.
 Identities = 39/160 (24%), Positives = 70/160 (43%), Gaps = 25/160 (15%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            +L ++ DLD TL+++      D  H     K   Q++E               T +RPG 
Sbjct: 46   RLTVIFDLDETLIHTHSLLPEDSKHSRETCKVVVQNKEYT-------------TSIRPGA 92

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS---RGDDGDPFDGDER 1036
              FL + SK  E+ L+T   ++YA ++   ++  G +F  ++     +   G  +    +
Sbjct: 93   IQFLRQLSKTCEVVLFTASKQVYADQIIDYMEKDGKIFEHKLYQQSCKNKFGRVYKDATK 152

Query: 1037 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
            +   +D++        VVI DD   VW   +  L+V +RY
Sbjct: 153  L--GRDIKN-------VVIFDDCELVWTMTQDKLVVCKRY 183


>gi|330794863|ref|XP_003285496.1| hypothetical protein DICPUDRAFT_91512 [Dictyostelium purpureum]
 gi|325084587|gb|EGC38012.1| hypothetical protein DICPUDRAFT_91512 [Dictyostelium purpureum]
          Length = 558

 Score = 46.2 bits (108), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 68/148 (45%), Gaps = 37/148 (25%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL---FRFPHMGMWTKLR 976
            K+ LVLDLD TL++ +     +P++             +PH      F      ++ K R
Sbjct: 381  KISLVLDLDETLVHCS----TEPLN-------------QPHLIFPVFFNNTEYQVFAKKR 423

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP----KGVLFAGRVISRGDDGDPFD 1032
            P    FL + S +FE+ ++T   ++YA ++  ++DP    K  LF    +    DG+   
Sbjct: 424  PFFEEFLHKVSTIFEVIIFTASQEVYANKLLNIIDPCKKIKHRLFRDSCVYV--DGNYL- 480

Query: 1033 GDERVPKSKDLEGVLGME-SAVVIIDDS 1059
                    KDL  VLG +   VVIID+S
Sbjct: 481  --------KDL-SVLGRDLKQVVIIDNS 499


>gi|154342859|ref|XP_001567375.1| conserved hypothetical protein [Leishmania braziliensis
            MHOM/BR/75/M2904]
 gi|134064707|emb|CAM42811.1| conserved hypothetical protein [Leishmania braziliensis
            MHOM/BR/75/M2904]
          Length = 228

 Score = 46.2 bits (108), Expect = 0.099,   Method: Composition-based stats.
 Identities = 37/134 (27%), Positives = 65/134 (48%), Gaps = 18/134 (13%)

Query: 957  EKP------HRHLFRF-----PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATE 1005
            EKP      + + F+F     P +  +   RPG+  FL + ++ ++M ++T G  LYA  
Sbjct: 39   EKPTFFIDTNENFFQFTLEDDPSVTYYAFRRPGLSEFLHQCAEHYDMRIFTAGEDLYART 98

Query: 1006 MAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDL-EGVLGMESAVVIIDDSV--RV 1062
            + + L P  ++   R  +R  D    DG  R+ K+  + +G    E   +I+DDS    V
Sbjct: 99   LLRWLLPDNLIDESRWYTR--DACVGDGYGRLIKNLSMIDGFQFEERTALILDDSAPDNV 156

Query: 1063 WPHNKLNLIVVERY 1076
            +PH   N + + R+
Sbjct: 157  YPHQ--NALAIPRF 168


>gi|444322726|ref|XP_004182004.1| hypothetical protein TBLA_0H01990 [Tetrapisispora blattae CBS 6284]
 gi|387515050|emb|CCH62485.1| hypothetical protein TBLA_0H01990 [Tetrapisispora blattae CBS 6284]
          Length = 688

 Score = 46.2 bits (108), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 50/107 (46%), Gaps = 29/107 (27%)

Query: 912  QKKMFSARKLCLVLDLDHTL-------LNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF 964
            Q ++FS +K CL+LDLD TL       L SA F  V PV  +          E+ H    
Sbjct: 511  QNQIFSGKK-CLILDLDETLVHSSFKYLTSADF--VIPVDID----------EQIH---- 553

Query: 965  RFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 1011
                  ++   RPG+  FLE  SK+FE+ ++T     Y   +  VLD
Sbjct: 554  -----NVYVIKRPGVDQFLETVSKIFEVVVFTASVSRYGDPLLDVLD 595


>gi|91086797|ref|XP_973406.1| PREDICTED: similar to CTD (carboxy-terminal domain, RNA polymerase
            II, polypeptide A) small phosphatase like 2 [Tribolium
            castaneum]
 gi|270009707|gb|EFA06155.1| hypothetical protein TcasGA2_TC009000 [Tribolium castaneum]
          Length = 451

 Score = 46.2 bits (108), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 46/168 (27%), Positives = 79/168 (47%), Gaps = 36/168 (21%)

Query: 917  SARKLCLVLDLDHTL-------LNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            S+ +  LVLDLD TL       L+ A FH   PV                   LF+    
Sbjct: 269  SSPEFSLVLDLDETLVHCSLQELSDASFHF--PV-------------------LFQDCSY 307

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 1029
             ++ + RP    F+E+ S++FE+ L+T   ++YA ++  +LDP+      R+    +   
Sbjct: 308  TVYVRTRPYFREFMEKVSQMFEVILFTASKRVYADKLLNLLDPERKWIKYRLFR--EHCV 365

Query: 1030 PFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
              +G+      KDL  +LG + S  +IID+S + + ++  N I +E +
Sbjct: 366  CVNGN----YIKDL-SILGRDLSKTIIIDNSPQAFGYHLNNGIPIESW 408


>gi|66361684|ref|XP_627365.1| RNA pol II carboxy terminal domain phosphatase of the HAD superfamily
            with a BRCT domain at the C-terminus [Cryptosporidium
            parvum Iowa II]
 gi|46228744|gb|EAK89614.1| RNA pol II carboxy terminal domain phosphatase of the HAD superfamily
            with a BRCT domain at the C-terminus [Cryptosporidium
            parvum Iowa II]
          Length = 762

 Score = 46.2 bits (108), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 56/112 (50%), Gaps = 11/112 (9%)

Query: 972  WTKLRPGIWTFLERASK-LFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
            + KLRPG+   L   SK  +E+++YTMG + +A    ++LDP+   F  + I   ++G  
Sbjct: 348  YYKLRPGVINMLRTLSKDKYEIYMYTMGTEYHAYTSLRILDPELRFFHSKRIFYRNNG-- 405

Query: 1031 FDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLN-LIVVERYTYFP 1080
                 +    K L  +   +   +VI+DD  + W    +N L+ V  Y +FP
Sbjct: 406  ----FKETSIKSLNTLFPYDHRTLVILDDIEQAWT--DINSLLKVYPYNFFP 451


>gi|51013613|gb|AAT93100.1| YLR019W [Saccharomyces cerevisiae]
          Length = 397

 Score = 46.2 bits (108), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 18/100 (18%)

Query: 919  RKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
            +K CL+LDLD TL++S+    H  D      +   E  D+              ++   R
Sbjct: 226  QKKCLILDLDETLVHSSFKYMHSAD-----FVLPVEIDDQVH-----------NVYVIKR 269

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
            PG+  FL R S+L+E+ ++T     YA  +   LDP G +
Sbjct: 270  PGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309


>gi|190406060|gb|EDV09327.1| hypothetical protein SCRG_05007 [Saccharomyces cerevisiae RM11-1a]
          Length = 397

 Score = 46.2 bits (108), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 18/100 (18%)

Query: 919  RKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
            +K CL+LDLD TL++S+    H  D      +   E  D+              ++   R
Sbjct: 226  QKKCLILDLDETLVHSSFKYMHSAD-----FVLPVEIDDQVH-----------NVYVIKR 269

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
            PG+  FL R S+L+E+ ++T     YA  +   LDP G +
Sbjct: 270  PGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309


>gi|6323047|ref|NP_013119.1| Psr2p [Saccharomyces cerevisiae S288c]
 gi|55583862|sp|Q07949.1|PSR2_YEAST RecName: Full=Probable phosphatase PSR2; AltName: Full=Plasma
            membrane sodium response protein 2
 gi|1360322|emb|CAA97541.1| unnamed protein product [Saccharomyces cerevisiae]
 gi|151941187|gb|EDN59565.1| protein phosphatase [Saccharomyces cerevisiae YJM789]
 gi|207343198|gb|EDZ70734.1| YLR019Wp-like protein [Saccharomyces cerevisiae AWRI1631]
 gi|256269170|gb|EEU04502.1| Psr2p [Saccharomyces cerevisiae JAY291]
 gi|285813441|tpg|DAA09337.1| TPA: Psr2p [Saccharomyces cerevisiae S288c]
 gi|323332484|gb|EGA73892.1| Psr2p [Saccharomyces cerevisiae AWRI796]
 gi|323353905|gb|EGA85758.1| Psr2p [Saccharomyces cerevisiae VL3]
          Length = 397

 Score = 46.2 bits (108), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 18/100 (18%)

Query: 919  RKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
            +K CL+LDLD TL++S+    H  D      +   E  D+              ++   R
Sbjct: 226  QKKCLILDLDETLVHSSFKYMHSAD-----FVLPVEIDDQVH-----------NVYVIKR 269

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
            PG+  FL R S+L+E+ ++T     YA  +   LDP G +
Sbjct: 270  PGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309


>gi|323336572|gb|EGA77838.1| Psr2p [Saccharomyces cerevisiae Vin13]
          Length = 398

 Score = 46.2 bits (108), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 18/100 (18%)

Query: 919  RKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
            +K CL+LDLD TL++S+    H  D      +   E  D+              ++   R
Sbjct: 227  QKKCLILDLDETLVHSSFKYMHSAD-----FVLPVEIDDQVH-----------NVYVIKR 270

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
            PG+  FL R S+L+E+ ++T     YA  +   LDP G +
Sbjct: 271  PGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 310


>gi|401411253|ref|XP_003885074.1| hypothetical protein NCLIV_054710 [Neospora caninum Liverpool]
 gi|325119493|emb|CBZ55046.1| hypothetical protein NCLIV_054710 [Neospora caninum Liverpool]
          Length = 630

 Score = 46.2 bits (108), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 51/98 (52%), Gaps = 16/98 (16%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR-HLFRFPHMGMWTKLRPG 978
            +  LVLDLD TL++S+ F  V      I  + E     KPH+ H+ +          RPG
Sbjct: 470  RTTLVLDLDETLVHSS-FRPVSVAAFVITVEVEG----KPHKIHVCK----------RPG 514

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
            +  FLE  S L+E+ ++T   + YA  +  +LDPKG+ 
Sbjct: 515  VDRFLEVVSSLYEVVIFTASLQTYADPLIDLLDPKGLC 552


>gi|170587764|ref|XP_001898644.1| NLI interacting factor-like phosphatase family protein [Brugia
            malayi]
 gi|158593914|gb|EDP32508.1| NLI interacting factor-like phosphatase family protein [Brugia
            malayi]
          Length = 314

 Score = 46.2 bits (108), Expect = 0.11,   Method: Composition-based stats.
 Identities = 54/207 (26%), Positives = 93/207 (44%), Gaps = 33/207 (15%)

Query: 917  SARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
            S  +  LVLDLD TL++ +     D     PVH                   F+     +
Sbjct: 126  STPEFSLVLDLDETLVHCSLTELPDASLTFPVH-------------------FQENTYQV 166

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
            + ++RP +  FLER S+ FE+ L+T   ++YA ++  +LDP   L   R+     +   F
Sbjct: 167  YVRVRPHLQEFLERLSRSFEIILFTASKRVYADKLLNLLDPGKRLIRHRLFR---EHCVF 223

Query: 1032 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 1090
                 +   KDL  +LG + S  +IID+S++ + +   N I +E + +F    ++   L 
Sbjct: 224  VYGNYI---KDLT-ILGRDLSKTIIIDNSLQSFAYQIDNGIPIESW-FFQQDDQELLKLI 278

Query: 1091 PSLLEIDHDERSEDGTLASSLGVRQQL 1117
            P L +I + +      L +   +R  L
Sbjct: 279  PFLEQITNQKNDVRHILRARYRIRDLL 305


>gi|349579745|dbj|GAA24906.1| K7_Psr2p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 397

 Score = 46.2 bits (108), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 18/100 (18%)

Query: 919  RKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
            +K CL+LDLD TL++S+    H  D      +   E  D+              ++   R
Sbjct: 226  QKKCLILDLDETLVHSSFKYMHSAD-----FVLPVEIDDQVH-----------NVYVIKR 269

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
            PG+  FL R S+L+E+ ++T     YA  +   LDP G +
Sbjct: 270  PGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309


>gi|328767798|gb|EGF77846.1| hypothetical protein BATDEDRAFT_13622 [Batrachochytrium dendrobatidis
            JAM81]
          Length = 192

 Score = 46.2 bits (108), Expect = 0.12,   Method: Composition-based stats.
 Identities = 33/101 (32%), Positives = 51/101 (50%), Gaps = 16/101 (15%)

Query: 913  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPV-HDEILRKKEEQDREKPHRHLFRFPHMGM 971
            KK  S+  + LVLDLD TL++ +      P+ H +I    E           F      +
Sbjct: 23   KKTRSSPPITLVLDLDETLVHCS----TSPLDHCDITFPVE-----------FNNITYTV 67

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
              +LRP   TFLER S++FE+ ++T   K+YA  +  ++DP
Sbjct: 68   SGRLRPHYKTFLERCSEIFEVVVFTASQKIYADRLLNIIDP 108


>gi|323308065|gb|EGA61318.1| Psr2p [Saccharomyces cerevisiae FostersO]
          Length = 319

 Score = 46.2 bits (108), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 50/103 (48%), Gaps = 24/103 (23%)

Query: 919  RKLCLVLDLDHTLLNSA--KFHEVD---PVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
            +K CL+LDLD TL++S+    H  D   PV  EI         ++ H          ++ 
Sbjct: 226  QKKCLILDLDETLVHSSFKYMHSADFVLPV--EI--------DDQVH---------NVYV 266

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
              RPG+  FL R S+L+E+ ++T     YA  +   LDP G +
Sbjct: 267  IKRPGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309


>gi|392297996|gb|EIW09095.1| Psr2p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 397

 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 18/100 (18%)

Query: 919  RKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
            +K CL+LDLD TL++S+    H  D      +   E  D+              ++   R
Sbjct: 226  QKKCLILDLDETLVHSSFKYMHSAD-----FVLPVEIDDQVH-----------NVYVIKR 269

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
            PG+  FL R S+L+E+ ++T     YA  +   LDP G +
Sbjct: 270  PGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309


>gi|259148008|emb|CAY81257.1| Psr2p [Saccharomyces cerevisiae EC1118]
          Length = 397

 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 18/100 (18%)

Query: 919  RKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
            +K CL+LDLD TL++S+    H  D      +   E  D+              ++   R
Sbjct: 226  QKKCLILDLDETLVHSSFKYMHSAD-----FVLPVEIDDQVH-----------NVYVIKR 269

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
            PG+  FL R S+L+E+ ++T     YA  +   LDP G +
Sbjct: 270  PGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309


>gi|407043726|gb|EKE42114.1| NLI interacting factor family phosphatase domain containing protein
            [Entamoeba nuttalli P19]
          Length = 428

 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 62/128 (48%), Gaps = 22/128 (17%)

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP-KGVLF-AGRVISR------- 1024
            + R GI TF+E+ SKL+++H+ T+G K YA  +   ++  + V F  G +++        
Sbjct: 110  RFRDGIVTFMEKVSKLYDIHVVTLGQKEYAFAIVNAINKLRDVPFITGDLVTAEDCSSVI 169

Query: 1025 -GDDGDPFDG-------DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
              D+ D  DG       +ER    + +   +G E   VI+DD + VW +      VV+  
Sbjct: 170  VCDEKDTNDGLIDREETNERRSVKRSI-PTMGKEEMQVIVDDRIDVWDNKN----VVQIC 224

Query: 1077 TYFPCSRR 1084
             Y P + +
Sbjct: 225  EYVPSTNQ 232


>gi|158293726|ref|XP_315066.4| AGAP004967-PA [Anopheles gambiae str. PEST]
 gi|157016584|gb|EAA10342.4| AGAP004967-PA [Anopheles gambiae str. PEST]
          Length = 226

 Score = 45.8 bits (107), Expect = 0.13,   Method: Composition-based stats.
 Identities = 48/164 (29%), Positives = 78/164 (47%), Gaps = 22/164 (13%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
            K  S+ +  LVLDLD TL           VH  ++   E  D       LF+     ++ 
Sbjct: 41   KTRSSPEFSLVLDLDETL-----------VHCSLM---ELSDASFKFPVLFQECKYTVFV 86

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
            + RP    FLER S++FE+ L+T   ++YA ++  +LDP   L   R+    +     +G
Sbjct: 87   RTRPYFREFLERVSQMFEVILFTASKRVYADKLLNLLDPDRRLIKYRLFR--EHCVLVNG 144

Query: 1034 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            +      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 145  N----YIKDLT-ILGRDLSKTIIIDNSPQAFGYQLENGIPIESW 183


>gi|365985822|ref|XP_003669743.1| hypothetical protein NDAI_0D01860 [Naumovozyma dairenensis CBS 421]
 gi|343768512|emb|CCD24500.1| hypothetical protein NDAI_0D01860 [Naumovozyma dairenensis CBS 421]
          Length = 514

 Score = 45.8 bits (107), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 47/111 (42%), Gaps = 28/111 (25%)

Query: 919  RKLCLVLDLDHTLLNS-------AKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
             K CLVLDLD TL++S       A F+    + D+I                       +
Sbjct: 342  HKKCLVLDLDETLVHSSFKYLPNADFNLPVNIDDQI---------------------HNV 380

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
            +   RPG+  FLE+  KLFE+ ++T     Y   +   LDPKG     R+ 
Sbjct: 381  YVIKRPGVDEFLEKVGKLFEVVIFTASVSRYGDPLLDRLDPKGKSIHHRLF 431


>gi|145523063|ref|XP_001447370.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414881|emb|CAK79973.1| unnamed protein product [Paramecium tetraurelia]
          Length = 336

 Score = 45.8 bits (107), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 41/154 (26%), Positives = 73/154 (47%), Gaps = 21/154 (13%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LV+DLD TL++S+   E   ++D I+    +  + K            ++  +RPG   F
Sbjct: 48   LVIDLDETLVHSS--FEPMKINDLIVEVTMKDQKYK------------IYVNIRPGAQEF 93

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
            ++  SKLFE+ ++T     YA  +   +DP G L   R+    ++   ++G       KD
Sbjct: 94   IKETSKLFELIIFTASISEYANSVIDFIDPHG-LVDLRLFR--ENCTVYNG----VLVKD 146

Query: 1043 LEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
            L  +     +V++ID+SV  +    +N I +  Y
Sbjct: 147  LSLLKRNLDSVILIDNSVNSFMFQPMNAIHILNY 180


>gi|221487382|gb|EEE25614.1| protein phosphatase, putative [Toxoplasma gondii GT1]
          Length = 621

 Score = 45.8 bits (107), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 35/98 (35%), Positives = 53/98 (54%), Gaps = 16/98 (16%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR-HLFRFPHMGMWTKLRPG 978
            +  LVLDLD TL++S+ F  V PV    +  + E    KPH  H+ +          RPG
Sbjct: 443  RTTLVLDLDETLVHSS-FRPV-PVSAFAITVEVEG---KPHTIHVCK----------RPG 487

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
            +  FLE  S+L+E+ ++T   + YA  +  +LDPKG+ 
Sbjct: 488  VDRFLEVVSRLYEVVIFTASLQTYADPLIDLLDPKGLC 525


>gi|308811648|ref|XP_003083132.1| TFIIF-interacting CTD phosphatase, including NLI-interacting factor
            (involved in RNA polymerase II regulation) (ISS)
            [Ostreococcus tauri]
 gi|116055010|emb|CAL57087.1| TFIIF-interacting CTD phosphatase, including NLI-interacting factor
            (involved in RNA polymerase II regulation) (ISS)
            [Ostreococcus tauri]
          Length = 485

 Score = 45.8 bits (107), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 47/156 (30%), Positives = 72/156 (46%), Gaps = 21/156 (13%)

Query: 922  CLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWT 981
             LVLDLD TL           VH  +     + D   P   +F      +  + RP + T
Sbjct: 288  TLVLDLDETL-----------VHSNLENTGGKSDFSFPV--VFNGEIHQVNVRTRPHLQT 334

Query: 982  FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSK 1041
            F+E  SK +E+ ++T   ++YA ++  +LDPK    A RV    D     +G+      K
Sbjct: 335  FMETVSKKYEIVVFTASQQIYADKLLDLLDPKREWIAHRVFR--DSCVQIEGN----YMK 388

Query: 1042 DLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            DL  VLG + S  +IID+S + +     N I +E +
Sbjct: 389  DLR-VLGRDLSKTIIIDNSPQAFGLQVENGIPIESW 423


>gi|393909936|gb|EFO24836.2| SCP small domain phosphatase [Loa loa]
          Length = 321

 Score = 45.8 bits (107), Expect = 0.15,   Method: Composition-based stats.
 Identities = 54/207 (26%), Positives = 93/207 (44%), Gaps = 33/207 (15%)

Query: 917  SARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
            S  +  LVLDLD TL++ +     D     PVH                   F+     +
Sbjct: 133  STPEFSLVLDLDETLVHCSLTELPDASLTFPVH-------------------FQENTYQV 173

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
            + ++RP +  FLER S+ FE+ L+T   ++YA ++  +LDP   L   R+     +   F
Sbjct: 174  YVRVRPHLQEFLERLSRSFEIILFTASKRIYADKLLNLLDPGKRLIRHRLFR---EHCVF 230

Query: 1032 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 1090
                 +   KDL  +LG + S  +IID+S++ + +   N I +E + +F    ++   L 
Sbjct: 231  VYGNYI---KDLT-ILGRDLSKTIIIDNSLQSFAYQIDNGIPIESW-FFQQDDQELLKLI 285

Query: 1091 PSLLEIDHDERSEDGTLASSLGVRQQL 1117
            P L +I + +      L +   +R  L
Sbjct: 286  PFLEQITNQKNDVRHILRARYRIRDLL 312


>gi|339237973|ref|XP_003380541.1| nuclear envelope morphology protein 1 [Trichinella spiralis]
 gi|316976534|gb|EFV59811.1| nuclear envelope morphology protein 1 [Trichinella spiralis]
          Length = 281

 Score = 45.8 bits (107), Expect = 0.15,   Method: Composition-based stats.
 Identities = 51/172 (29%), Positives = 81/172 (47%), Gaps = 33/172 (19%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E+ KK+++A     VLDLD TL++S              R K + D   P   +   P  
Sbjct: 97   EKSKKLYTA-----VLDLDQTLVHS--------------RSKRKGD---PRYKIVNIPQA 134

Query: 970  G--MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATE-MAKVLDPKGVLFAGRVISRGD 1026
                +T +RP    FLE  S+ +E+ L+T G   YA   + +++DP+   F+        
Sbjct: 135  TRRFYTAVRPCCAEFLESISEFYEVILFTAGTPRYAAAVIDQLVDPEHKYFSN--FYYRP 192

Query: 1027 DGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYT 1077
            D  P D +      KDL  +LG + S  VI+DD++  +  +  N I+VE +T
Sbjct: 193  DCAPVDHE----FVKDLS-ILGRDLSKTVIMDDNMMSFCCHIDNGILVEPWT 239


>gi|237830029|ref|XP_002364312.1| NLI interacting factor-like phosphatase domain-containing protein
            [Toxoplasma gondii ME49]
 gi|211961976|gb|EEA97171.1| NLI interacting factor-like phosphatase domain-containing protein
            [Toxoplasma gondii ME49]
 gi|221507180|gb|EEE32784.1| dullard protein, putative [Toxoplasma gondii VEG]
          Length = 621

 Score = 45.4 bits (106), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 35/98 (35%), Positives = 53/98 (54%), Gaps = 16/98 (16%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR-HLFRFPHMGMWTKLRPG 978
            +  LVLDLD TL++S+ F  V PV    +  + E    KPH  H+ +          RPG
Sbjct: 443  RTTLVLDLDETLVHSS-FRPV-PVSAFAITVEVEG---KPHTIHVCK----------RPG 487

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
            +  FLE  S+L+E+ ++T   + YA  +  +LDPKG+ 
Sbjct: 488  VDRFLEVVSRLYEVVIFTASLQTYADPLIDLLDPKGLC 525


>gi|401426731|ref|XP_003877849.1| conserved hypothetical protein [Leishmania mexicana
            MHOM/GT/2001/U1103]
 gi|322494096|emb|CBZ29393.1| conserved hypothetical protein [Leishmania mexicana
            MHOM/GT/2001/U1103]
          Length = 231

 Score = 45.4 bits (106), Expect = 0.18,   Method: Composition-based stats.
 Identities = 33/113 (29%), Positives = 57/113 (50%), Gaps = 7/113 (6%)

Query: 967  PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
            P +  +   RPG+  FL + ++ ++M ++T G  LYA  + + L P  ++   R  +R  
Sbjct: 60   PSVTYYAFRRPGLNEFLYQCAEHYDMRIFTAGEDLYARTLLRWLLPANLIDESRWYTR-- 117

Query: 1027 DGDPFDGDERVPKSKD-LEGVLGMESAVVIIDDSV--RVWPHNKLNLIVVERY 1076
            D    DG  R+ K+   L+G    E   +I+DDS    V+PH   N + + R+
Sbjct: 118  DACVGDGYGRLIKNLSMLDGFQFEERTALILDDSAPDNVYPHQ--NALAIPRF 168


>gi|312072812|ref|XP_003139236.1| SCP small domain phosphatase [Loa loa]
          Length = 321

 Score = 45.4 bits (106), Expect = 0.18,   Method: Composition-based stats.
 Identities = 54/207 (26%), Positives = 93/207 (44%), Gaps = 33/207 (15%)

Query: 917  SARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
            S  +  LVLDLD TL++ +     D     PVH                   F+     +
Sbjct: 133  STPEFSLVLDLDETLVHCSLTELPDASLTFPVH-------------------FQENTYQV 173

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
            + ++RP +  FLER S+ FE+ L+T   ++YA ++  +LDP   L   R+     +   F
Sbjct: 174  YVRVRPHLQEFLERLSRSFEIILFTASKRIYADKLLNLLDPGKRLIRHRLFR---EHCVF 230

Query: 1032 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 1090
                 +   KDL  +LG + S  +IID+S++ + +   N I +E + +F    ++   L 
Sbjct: 231  VYGNYI---KDLT-ILGRDLSKTIIIDNSLQSFAYQIDNGIPIESW-FFQQDDQELLKLI 285

Query: 1091 PSLLEIDHDERSEDGTLASSLGVRQQL 1117
            P L +I + +      L +   +R  L
Sbjct: 286  PFLEQITNQKNDVRHILRARYRIRDLL 312


>gi|341894763|gb|EGT50698.1| hypothetical protein CAEBREN_25349 [Caenorhabditis brenneri]
          Length = 250

 Score = 45.4 bits (106), Expect = 0.19,   Method: Composition-based stats.
 Identities = 30/108 (27%), Positives = 56/108 (51%), Gaps = 14/108 (12%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
            K  ++ +  LVLDLD TL++ +    + P+ +  +              +F+     ++ 
Sbjct: 22   KTRASAEYTLVLDLDETLVHCS----LTPLDNATM----------IFPVVFQNITYQVYV 67

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            +LRP + TFL R +K FE+ ++T   K+YA ++  +LDP+  L   R+
Sbjct: 68   RLRPHLRTFLNRMAKTFEIIIFTASKKVYANKLCDILDPRKNLIRHRL 115


>gi|146096062|ref|XP_001467692.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|398020532|ref|XP_003863429.1| hypothetical protein, conserved [Leishmania donovani]
 gi|134072058|emb|CAM70757.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|322501662|emb|CBZ36743.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 231

 Score = 45.4 bits (106), Expect = 0.19,   Method: Composition-based stats.
 Identities = 33/113 (29%), Positives = 57/113 (50%), Gaps = 7/113 (6%)

Query: 967  PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 1026
            P +  +   RPG+  FL + ++ ++M ++T G  LYA  + + L P  ++   R  +R  
Sbjct: 60   PSVTYYAFRRPGLNEFLYQCAEHYDMRIFTAGEDLYARTLLRWLLPDNLIDESRWYTR-- 117

Query: 1027 DGDPFDGDERVPKSKD-LEGVLGMESAVVIIDDSV--RVWPHNKLNLIVVERY 1076
            D    DG  R+ K+   L+G    E   +I+DDS    V+PH   N + + R+
Sbjct: 118  DACVGDGYGRLIKNLSMLDGFQFEERTALILDDSAPDNVYPHQ--NALAIPRF 168


>gi|254578106|ref|XP_002495039.1| ZYRO0B01958p [Zygosaccharomyces rouxii]
 gi|238937929|emb|CAR26106.1| ZYRO0B01958p [Zygosaccharomyces rouxii]
          Length = 336

 Score = 45.4 bits (106), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 49/174 (28%), Positives = 76/174 (43%), Gaps = 20/174 (11%)

Query: 907  RRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF 966
            ++L  Q  +F+ RK  LV+DLD TL++SA             R     +  + H    RF
Sbjct: 140  KKLVPQSILFAERKKRLVVDLDETLIHSAT------------RSVSHSNSAQGHMVEVRF 187

Query: 967  PHMGM----WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
            P   +    +   RP    FL + SK +++ ++T   K YA  +   L+     F G+  
Sbjct: 188  PPSSISTLYYVHKRPHCDLFLSKVSKWYDLIIFTASMKEYADPVIDWLESS---FTGKFC 244

Query: 1023 SRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
             R    +     E V   KDL  V  +   VV+ID+S   +  N+ N I VE +
Sbjct: 245  KRLYRHNCV-VREGVGYIKDLSVVTEVLDEVVLIDNSPTSYARNEDNAIQVEGW 297


>gi|403368592|gb|EJY84135.1| Putative tfiif-interacting component of the c-terminal domain
            phosphatase [Oxytricha trifallax]
          Length = 525

 Score = 45.1 bits (105), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 63/243 (25%), Positives = 91/243 (37%), Gaps = 86/243 (35%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHE-------VDPVHDEILRKKEEQDREKPHRHLFRF 966
            ++   RKL LVLDLD+TLL++    E        DP    ++          P + ++  
Sbjct: 3    QLLQDRKLVLVLDLDNTLLHTKSIEEREFQTKSRDPTFINLI---------DPLKSIYEI 53

Query: 967  PHM--GMWTKLRPGIWTFLERA--SKLFEMHLYTMGNK----------------LYATEM 1006
                 G  TKLRP ++ FL++    + FE++ YT G K                L+  E 
Sbjct: 54   KLFRGGFHTKLRPFLFEFLKKVFDERKFEIYFYTAGTKDYGMLIIDIFKMEITRLFGKEY 113

Query: 1007 AKVLDPKGVLFAGRVISRGDD---------------------------------GDPFDG 1033
            AK +  +  L   ++ISR D                                  GD    
Sbjct: 114  AKQISEE--LSHRKLISRCDKERFANKNSSNEIDIDSMQQQLYQQIENQQMGQGGDATLN 171

Query: 1034 DERVPKS-KDLEGVLGMESAVVIIDDSVRVWPH-------NKL-----NLIVVERYTYFP 1080
             +   KS   L G  G ES  +IIDD   VW         NKL     NLI++  Y Y+ 
Sbjct: 172  MKHFIKSLSSLAG--GDESIFIIIDDRSDVWTEEVKDQNGNKLRRVSDNLILIPEYFYWE 229

Query: 1081 CSR 1083
             S+
Sbjct: 230  TSQ 232


>gi|321470826|gb|EFX81801.1| hypothetical protein DAPPUDRAFT_49973 [Daphnia pulex]
          Length = 237

 Score = 45.1 bits (105), Expect = 0.24,   Method: Composition-based stats.
 Identities = 30/100 (30%), Positives = 49/100 (49%), Gaps = 14/100 (14%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
            K  S+    LVLDLD TL++ +               +E +D        F+     ++ 
Sbjct: 52   KTRSSPTFSLVLDLDETLVHCSL--------------EELEDAAFSFPVFFQDTTYQVFV 97

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
            + RP    FLER S++FE+ L+T   K+YA ++  +LDP+
Sbjct: 98   RTRPHFREFLERVSQIFEVILFTASKKVYADKLLNLLDPQ 137


>gi|145516326|ref|XP_001444057.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411457|emb|CAK76660.1| unnamed protein product [Paramecium tetraurelia]
          Length = 411

 Score = 45.1 bits (105), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 46/81 (56%), Gaps = 5/81 (6%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
            ++  +RP    FL++ S L+ +++YT  +  YA  + K LDPKG   +G ++SR +  + 
Sbjct: 266  IYLNVRPFCQWFLQQMSLLYTIYVYTASSSAYANTIVKYLDPKGQWISG-ILSRQNCLET 324

Query: 1031 FDG----DERVPKSKDLEGVL 1047
             +G    D R+  +K ++ +L
Sbjct: 325  KNGFYIKDLRIIANKQIKNML 345


>gi|341876625|gb|EGT32560.1| hypothetical protein CAEBREN_01530 [Caenorhabditis brenneri]
          Length = 286

 Score = 45.1 bits (105), Expect = 0.25,   Method: Composition-based stats.
 Identities = 30/108 (27%), Positives = 56/108 (51%), Gaps = 14/108 (12%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
            K  ++ +  LVLDLD TL++ +    + P+ +  +              +F+     ++ 
Sbjct: 58   KTRASAEYTLVLDLDETLVHCS----LTPLDNATM----------IFPVVFQNITYQVYV 103

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            +LRP + TFL R +K FE+ ++T   K+YA ++  +LDP+  L   R+
Sbjct: 104  RLRPHLRTFLNRMAKTFEIIIFTASKKVYANKLCDILDPRKNLIRHRL 151


>gi|399218895|emb|CCF75782.1| unnamed protein product [Babesia microti strain RI]
          Length = 460

 Score = 45.1 bits (105), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 49/102 (48%), Gaps = 8/102 (7%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVH-DEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            KL +VLD+D TL++    H    VH D  +   E    E P   +    H  M+  LRPG
Sbjct: 199  KLLVVLDMDETLIH---MHTNPHVHYDYTINLLEHSKSEDP-MFISCNIHPTMYISLRPG 254

Query: 979  IWTFLERAS---KLFEMHLYTMGNKLYATEMAKVLDPKGVLF 1017
            +  FL   S     +E+ L+T G +LYA  + + LDP   + 
Sbjct: 255  VKEFLRYLSVNSDFYEVALFTAGTQLYADAVLEGLDPNCTII 296


>gi|126644240|ref|XP_001388239.1| hypothetical protein [Cryptosporidium parvum Iowa II]
 gi|126117312|gb|EAZ51412.1| hypothetical protein cgd2_3810 [Cryptosporidium parvum Iowa II]
          Length = 475

 Score = 45.1 bits (105), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 67/151 (44%), Gaps = 18/151 (11%)

Query: 863  QTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKE-RTRRLEEQKKMFSARKL 921
            + G       G  G   +S  GD E +   YD+  ++ ++   +   LE Q++ +  RK 
Sbjct: 285  EAGIYDKDNLGRNGFSLRSITGDRESII--YDEDYESYLESTIKEPFLEPQRREYIGRK- 341

Query: 922  CLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWT 981
             LVLDLD TL++S+      P+ +       E D +          +  ++   RPG+  
Sbjct: 342  TLVLDLDETLIHSS----FQPIRNASFTINIEIDGD----------YYDVYVLKRPGVDK 387

Query: 982  FLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
            FL   S +FE+ ++T     YA  +   LDP
Sbjct: 388  FLNIVSAIFEVVIFTASLSKYANPLLDRLDP 418


>gi|219109563|ref|XP_002176536.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411071|gb|EEC50999.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 809

 Score = 45.1 bits (105), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 35/126 (27%), Positives = 58/126 (46%), Gaps = 26/126 (20%)

Query: 901  IQKERTRRLEEQ--KKMFSARKLCLVLDLDHTLL---NSAKFHEVDPVHDEI-------L 948
            + +   +R+ +Q  +++   +KL LVLDLDHTL+   N  +  +     D++       L
Sbjct: 236  VSRAEGQRMAQQDAERLQKRKKLSLVLDLDHTLVHATNDTRAQQFCKSRDDVRTLILPML 295

Query: 949  RKKEEQDREKPHRHLFRFPHMGMWT----KLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
            R   E           R P    WT    K+RP +  FL  A   +E+ +YT G + YA 
Sbjct: 296  RPNGEP----------RQPQHPEWTQHFVKMRPHVEVFLNEAQDQYEIGVYTAGTRDYAE 345

Query: 1005 EMAKVL 1010
            ++  +L
Sbjct: 346  QICILL 351


>gi|159483225|ref|XP_001699661.1| hypothetical protein CHLREDRAFT_111940 [Chlamydomonas reinhardtii]
 gi|158281603|gb|EDP07357.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 202

 Score = 45.1 bits (105), Expect = 0.26,   Method: Composition-based stats.
 Identities = 37/124 (29%), Positives = 55/124 (44%), Gaps = 28/124 (22%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH-MG 970
            Q++   A KLC++LDLD TL++S                     R  P    +   H +G
Sbjct: 2    QQRPEHAGKLCVLLDLDGTLVSSYT------------------PRRAPRLPSYVRTHVVG 43

Query: 971  MWTKL---------RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            M +KL         RPG+  FLE  +   E+ ++T G + YA  +   +DP   LFA R+
Sbjct: 44   MGSKLNPAGVFVVERPGLTEFLEELATFAEVIIFTAGLEDYAKPIIDAIDPSNRLFAHRI 103

Query: 1022 ISRG 1025
               G
Sbjct: 104  YREG 107


>gi|340508012|gb|EGR33824.1| NLI interacting factor-like phosphatase family protein, putative
            [Ichthyophthirius multifiliis]
          Length = 222

 Score = 44.7 bits (104), Expect = 0.27,   Method: Composition-based stats.
 Identities = 35/126 (27%), Positives = 67/126 (53%), Gaps = 17/126 (13%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            L+LDLD TL++S   +E +P  D ++  +EE + +K  +  FR         +RP    F
Sbjct: 61   LLLDLDETLIHSCGLNE-NP--DAVIMAQEEYNSQKQFQIAFR---------IRPYCIEF 108

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG----DERVP 1038
            L++ SK ++++++T  +  YA  +   LD +   +  +V++R +  +  +G    D R+ 
Sbjct: 109  LQQVSKYWDIYVFTASSASYANAIVNYLDSQQE-YIHQVLTRQNCMETKNGFFIKDLRII 167

Query: 1039 KSKDLE 1044
            K  DL+
Sbjct: 168  KDIDLQ 173


>gi|344253634|gb|EGW09738.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
            phosphatase 1 [Cricetulus griseus]
          Length = 354

 Score = 44.7 bits (104), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 36/127 (28%), Positives = 59/127 (46%), Gaps = 15/127 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 159  EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 214

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 215  GVIHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 264

Query: 1016 LFAGRVI 1022
             F  R+ 
Sbjct: 265  -FRARLF 270


>gi|157870945|ref|XP_001684022.1| nuclear lim interactor-interacting factor-like protein [Leishmania
            major strain Friedlin]
 gi|68127090|emb|CAJ04515.1| nuclear lim interactor-interacting factor-like protein [Leishmania
            major strain Friedlin]
          Length = 290

 Score = 44.7 bits (104), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 70/160 (43%), Gaps = 18/160 (11%)

Query: 917  SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
            S  K+ LVLD+D TL++S      D V+D++L    E                 +  K R
Sbjct: 109  SVPKVTLVLDVDETLVHSTFQPSSDVVYDKVLLVPSEGK------------TYTVSVKYR 156

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 1036
            P +  FL   S+ FE+ ++T   + Y  ++   +DP G+L   R+     +   F     
Sbjct: 157  PYLEDFLRFVSRRFEVVIFTASMRAYCDKLMDEIDPHGILGNLRLFR---EHCTFSERSY 213

Query: 1037 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
            V   KDL  +      VVI+D+S   +   + N I ++ +
Sbjct: 214  V---KDLHRLGRDLRRVVILDNSPAAYSFQQRNAIPIKTW 250


>gi|281210104|gb|EFA84272.1| CTD small phosphatase-like protein 2 [Polysphondylium pallidum PN500]
          Length = 539

 Score = 44.7 bits (104), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 71/148 (47%), Gaps = 37/148 (25%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF-RFPHM--GMWTKLR 976
            K+ LVLDLD TL++ +     +P+             ++P    F  F ++   ++ K R
Sbjct: 362  KISLVLDLDETLVHCS----TEPI-------------DEPDLTFFVTFNNVEYKVFAKKR 404

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP----KGVLFAGRVISRGDDGDPFD 1032
            P    FL +AS LFE+ ++T   ++YA ++  ++DP    K  L+    +         D
Sbjct: 405  PFFEDFLSKASSLFELIIFTASQEVYANKLLNMIDPNKHIKYRLYRDSCVC-------VD 457

Query: 1033 GDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            G       KDL  +LG + S VVI+D+S
Sbjct: 458  G----TYLKDL-SILGRDLSQVVIVDNS 480


>gi|123404051|ref|XP_001302356.1| NLI interacting factor-like phosphatase family protein [Trichomonas
            vaginalis G3]
 gi|121883637|gb|EAX89426.1| NLI interacting factor-like phosphatase family protein [Trichomonas
            vaginalis G3]
          Length = 205

 Score = 44.7 bits (104), Expect = 0.30,   Method: Composition-based stats.
 Identities = 27/102 (26%), Positives = 51/102 (50%), Gaps = 18/102 (17%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
            ++ + +  +  LVLDLD TL++++ F    P H ++                 +F     
Sbjct: 21   RRTIVTDSRKALVLDLDETLIHTSTF----PPHSDV--------------ESLKFDDSPD 62

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
            +  LRP +  FL++ S+LFE+ ++T G + YA  +  +L P+
Sbjct: 63   YVFLRPNVRIFLDKVSELFEVFIFTAGTQNYAERILDLLCPQ 104


>gi|194866038|ref|XP_001971726.1| GG14268 [Drosophila erecta]
 gi|190653509|gb|EDV50752.1| GG14268 [Drosophila erecta]
          Length = 260

 Score = 44.7 bits (104), Expect = 0.31,   Method: Composition-based stats.
 Identities = 34/120 (28%), Positives = 60/120 (50%), Gaps = 19/120 (15%)

Query: 903  KERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKP--- 959
            ++R   + +++     RK  LVLD+D T++ S             L+K  ++ R KP   
Sbjct: 54   EDRLSPVSKRRLSLVGRK-TLVLDMDETMITSW------------LKKSGKKPRNKPRVA 100

Query: 960  HRHLFRFP--HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP-KGVL 1016
            H   F  P     ++   RP +  FL+R SK +++ ++T G +LYA+ +   LD  +G+L
Sbjct: 101  HDFKFYLPAYEATIYVYKRPYLDHFLDRVSKWYDLTVFTAGAELYASPILDFLDRGRGIL 160


>gi|327260340|ref|XP_003214992.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1-like [Anolis carolinensis]
          Length = 345

 Score = 44.7 bits (104), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 34/121 (28%), Positives = 55/121 (45%), Gaps = 14/121 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  ++ K   R L  + K   A K+C+V+DLD TL++S+      PV++       E D
Sbjct: 150  EENGSVTKATVRYLLPEIKPQDANKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 205

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 206  GVMHQVYVLKRPHVD----------EFLRRMGELFECVLFTASLAKYADPVADLLDKWGA 255

Query: 1016 L 1016
             
Sbjct: 256  F 256


>gi|302806320|ref|XP_002984910.1| hypothetical protein SELMODRAFT_121210 [Selaginella moellendorffii]
 gi|300147496|gb|EFJ14160.1| hypothetical protein SELMODRAFT_121210 [Selaginella moellendorffii]
          Length = 198

 Score = 44.7 bits (104), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 29/46 (63%)

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            RPG+ TFL   S+++E+ ++T   KLYA  +   LDP G LF  R+
Sbjct: 65   RPGVDTFLNEMSQIYEIVVFTRAVKLYADRILDRLDPAGNLFTHRL 110


>gi|159485684|ref|XP_001700874.1| hypothetical protein CHLREDRAFT_142839 [Chlamydomonas reinhardtii]
 gi|158281373|gb|EDP07128.1| predicted protein, partial [Chlamydomonas reinhardtii]
          Length = 418

 Score = 44.7 bits (104), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 54/107 (50%), Gaps = 9/107 (8%)

Query: 971  MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
            +W  LRPG+  FLE    +FE+ L+T   + +AT   + +DP G +F  R+       D 
Sbjct: 32   VW--LRPGLRRFLESVRPMFEVVLFTAAGESWATCAMQRIDPDGRIFDTRLYR-----DH 84

Query: 1031 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
                +  P  KDL   LG + A VVI+DD+  ++ +   N + V  Y
Sbjct: 85   TVSHDDWPWVKDLSR-LGRDLARVVIVDDNPLMFMYQPDNALHVAPY 130


>gi|335298853|ref|XP_003132160.2| PREDICTED: CTD small phosphatase-like protein-like [Sus scrofa]
          Length = 265

 Score = 44.3 bits (103), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 73/170 (42%), Gaps = 33/170 (19%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++   +QK   + L  +  +    K C+V+DLD TL++S+      P+ +       E D
Sbjct: 71   EENGGLQKPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 126

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  GV
Sbjct: 127  GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 176

Query: 1016 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
              A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 177  FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 213


>gi|340504501|gb|EGR30938.1| NLI interacting factor-like phosphatase family protein, putative
            [Ichthyophthirius multifiliis]
          Length = 230

 Score = 44.3 bits (103), Expect = 0.39,   Method: Composition-based stats.
 Identities = 40/157 (25%), Positives = 76/157 (48%), Gaps = 25/157 (15%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            L LDLD TL++S   +E +P  D IL+  E  +           P   +  ++RP    F
Sbjct: 49   LFLDLDETLIHSCSLNE-NP--DVILKVGEINE-----------PQFHIGFRIRPYCMDF 94

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG----DERVP 1038
            L+   + ++++++T  +  Y+  +   LDP+     G +++R +  +  +G    D R+ 
Sbjct: 95   LKALVEYWDIYIFTASSSTYSNAIINYLDPERKYING-ILNRSNCMETKNGFFIKDLRIA 153

Query: 1039 KSKDLEGVLGME----SAVVIIDDSVRV--WPHNKLN 1069
            K KDL  ++ ++    S    ID+ + +  W HNK +
Sbjct: 154  KGKDLRKIILVDNLSHSFGFQIDNGIPILEWHHNKYD 190


>gi|384247176|gb|EIE20663.1| hypothetical protein COCSUDRAFT_30404 [Coccomyxa subellipsoidea
            C-169]
          Length = 243

 Score = 44.3 bits (103), Expect = 0.41,   Method: Composition-based stats.
 Identities = 51/173 (29%), Positives = 81/173 (46%), Gaps = 22/173 (12%)

Query: 905  RTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF 964
            R RR    ++    ++  LVLDLD TL++S             L   +E D   P    F
Sbjct: 43   RWRRSLLPRQTRQCKRKTLVLDLDETLVHST------------LDGCDEPDFSFPVA--F 88

Query: 965  RFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
                  +  + RP +  FL+R ++LFE+ ++T   K+YA ++  +LDP   L   RV   
Sbjct: 89   NGREHRVHVRRRPHLQHFLQRCAELFEVVVFTASQKVYAEQLLNILDPTRTLIRHRVFR- 147

Query: 1025 GDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
             D     +G+      KDL  VLG + A  VI+D+S + + +   N I +E +
Sbjct: 148  -DSCVFVEGNYL----KDLS-VLGRDLAHTVIVDNSPQAFGYQLPNGIPIESW 194


>gi|395816723|ref|XP_003781843.1| PREDICTED: CTD small phosphatase-like protein isoform 1 [Otolemur
            garnettii]
          Length = 265

 Score = 44.3 bits (103), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 73/170 (42%), Gaps = 33/170 (19%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++   +QK   + L  +  +    K C+V+DLD TL++S+      P+ +       E D
Sbjct: 71   EENGGLQKPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 126

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  GV
Sbjct: 127  GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 176

Query: 1016 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
              A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 177  FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 213


>gi|403266874|ref|XP_003925585.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1 isoform 1 [Saimiri boliviensis
            boliviensis]
          Length = 262

 Score = 44.3 bits (103), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 67   EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 122

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 123  GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 172

Query: 1016 LFA 1018
              A
Sbjct: 173  FRA 175


>gi|390464816|ref|XP_003733289.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1 isoform 2 [Callithrix jacchus]
          Length = 260

 Score = 44.3 bits (103), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65   EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121  GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 1016 LFA 1018
              A
Sbjct: 171  FRA 173


>gi|340501300|gb|EGR28100.1| NLI interacting factor-like phosphatase family protein, putative
            [Ichthyophthirius multifiliis]
          Length = 306

 Score = 43.9 bits (102), Expect = 0.46,   Method: Composition-based stats.
 Identities = 26/90 (28%), Positives = 47/90 (52%), Gaps = 12/90 (13%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            L LDLD TL++S + +E     +  ++ K  +D      +L +F       ++RP    F
Sbjct: 111  LFLDLDETLIHSCRINE-----NYNVQIKAFEDNNSQQEYLIQF-------RIRPYCMEF 158

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
            L++ SK ++++L+T  +  YA  +   LDP
Sbjct: 159  LQKISKYWDIYLFTASSTTYANAIVNYLDP 188


>gi|402889397|ref|XP_003908003.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1 isoform 2 [Papio anubis]
          Length = 260

 Score = 43.9 bits (102), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65   EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121  GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 1016 LFA 1018
              A
Sbjct: 171  FRA 173


>gi|297597243|ref|NP_001043640.2| Os01g0629400 [Oryza sativa Japonica Group]
 gi|255673485|dbj|BAF05554.2| Os01g0629400, partial [Oryza sativa Japonica Group]
          Length = 177

 Score = 43.9 bits (102), Expect = 0.49,   Method: Composition-based stats.
 Identities = 28/105 (26%), Positives = 57/105 (54%), Gaps = 14/105 (13%)

Query: 963  LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
             F   +  ++ + RP +  FLE+ +++F++ ++T   ++YA ++   LDP G L + R+ 
Sbjct: 30   FFNMKNHTVYVRQRPHLKMFLEKVAQMFDLVIFTASQRIYAEQLIDRLDPDGRLISHRIY 89

Query: 1023 SRG---DDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 1063
                   +G           +KDL  +LG++ A VVI+D++ +V+
Sbjct: 90   RESCIFSEG---------CYTKDLT-ILGVDLAKVVIVDNTPQVF 124


>gi|359323950|ref|XP_003640241.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1-like [Canis lupus familiaris]
          Length = 260

 Score = 43.9 bits (102), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  A+ K   + L  + K   A K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65   EENGAVPKTPVQYLLPEAKAQDADKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121  GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 1016 LFA 1018
              A
Sbjct: 171  FRA 173


>gi|145538816|ref|XP_001455108.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124422896|emb|CAK87711.1| unnamed protein product [Paramecium tetraurelia]
          Length = 282

 Score = 43.9 bits (102), Expect = 0.49,   Method: Composition-based stats.
 Identities = 40/132 (30%), Positives = 62/132 (46%), Gaps = 23/132 (17%)

Query: 889  LFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEIL 948
            LF+   +  K   QK       +  K +S +K+ LVLDLD TL           VH E  
Sbjct: 6    LFKNCYEHFKGKFQKNYIS--AQTPKQYSQKKV-LVLDLDETL-----------VHCEF- 50

Query: 949  RKKEEQDREKPHRHLFRFPHMG----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
              KE ++ +  H  L    H G    ++ K RP +  FL+ ASK +E+ ++T G + Y  
Sbjct: 51   --KENENFQ--HEVLLEVIHKGQLYTVYLKARPYLNQFLQEASKDYEIFIFTAGYEAYCQ 106

Query: 1005 EMAKVLDPKGVL 1016
            E+   +D K ++
Sbjct: 107  EVLSFIDKKKII 118


>gi|384250655|gb|EIE24134.1| hypothetical protein COCSUDRAFT_41430 [Coccomyxa subellipsoidea
            C-169]
          Length = 1029

 Score = 43.9 bits (102), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 59/197 (29%), Positives = 85/197 (43%), Gaps = 45/197 (22%)

Query: 920  KLCLVLDLDHTLL-----NSAKFH------EVDP-----VHDEILRKKEEQDREKPHRHL 963
            +L LVLDLD TLL     N  + H      E+D      V  ++  K+E+  +E+ +  L
Sbjct: 120  RLPLVLDLDETLLEAFTANQLRKHIKDLSAEIDGGNWSNVEKKLQLKREKAFKEEDYNLL 179

Query: 964  FRFPHMGMWTKLRPGI---WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-- 1018
             +F      T L   I   W   ER    FE+++ T  ++ YA E  + LDP  +L    
Sbjct: 180  VQFIQTNSVT-LNGQIHKAWPGRER----FEVYVCTTADRSYALEAWRHLDPSALLIPYA 234

Query: 1019 ---GRVISRGDDGDPFDGDERVPKSKDLEGVLGM------------ESAV---VIIDDSV 1060
                R  +   D D  D D  V   KDL  V+G+             SA+   VIIDD  
Sbjct: 235  DRRKRFHNVHQDKDSKDKDGNVKPVKDLAHVMGLLGHPWSAPCTPPNSAMPLAVIIDDQP 294

Query: 1061 RVW-PHNKLNLIVVERY 1076
             VW   ++  L  VE++
Sbjct: 295  AVWTAESQGQLYQVEKF 311


>gi|332308973|ref|NP_001193807.1| carboxy-terminal domain RNA polymerase II polypeptide A small
            phosphatase 1 isoform 3 [Homo sapiens]
 gi|397495664|ref|XP_003818667.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1 isoform 2 [Pan paniscus]
 gi|410036206|ref|XP_003950023.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1 [Pan troglodytes]
 gi|426338591|ref|XP_004033259.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1 isoform 2 [Gorilla gorilla gorilla]
          Length = 260

 Score = 43.9 bits (102), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65   EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121  GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 1016 LFA 1018
              A
Sbjct: 171  FRA 173


>gi|255087422|ref|XP_002505634.1| predicted protein [Micromonas sp. RCC299]
 gi|226520904|gb|ACO66892.1| predicted protein [Micromonas sp. RCC299]
          Length = 548

 Score = 43.9 bits (102), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 50/109 (45%), Gaps = 27/109 (24%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMGMW 972
            K  LVLDLD TL           VH  + +  EE D        F FP       H+ + 
Sbjct: 354  KNTLVLDLDETL-----------VHSNLEQTIEEAD--------FSFPVTFNGQQHI-VN 393

Query: 973  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             + RP +  F+E A++ FE+ ++T   ++YA  +   +DP  VL   R+
Sbjct: 394  VRRRPYLTEFMEFAARHFEVVVFTASQRVYAERLLNKIDPNQVLIKHRL 442


>gi|209882797|ref|XP_002142834.1| NLI interacting factor-like phosphatase family protein
            [Cryptosporidium muris RN66]
 gi|209558440|gb|EEA08485.1| NLI interacting factor-like phosphatase family protein
            [Cryptosporidium muris RN66]
          Length = 536

 Score = 43.9 bits (102), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 52/107 (48%), Gaps = 15/107 (14%)

Query: 909  LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 968
            LE QK  +  RK  LVLDLD TL++S+      P+         E + E  + ++ +   
Sbjct: 332  LEPQKPEYFGRKT-LVLDLDETLVHSS----FQPIRAASFVISVEIEYEMYNVYVLK--- 383

Query: 969  MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                   RPG+  FLE  S L+E+ ++T     YA  +   LDP+G+
Sbjct: 384  -------RPGVDKFLEVVSSLYEVVIFTASLSKYANPLLDKLDPRGL 423


>gi|363736290|ref|XP_003641697.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1-like [Gallus gallus]
          Length = 275

 Score = 43.9 bits (102), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 36/127 (28%), Positives = 58/127 (45%), Gaps = 15/127 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++   + K   + L  + K   A KLC+V+DLD TL++S+      PV++       E D
Sbjct: 80   EENGTVPKAAVKHLLPEIKPQDASKLCVVIDLDETLVHSS----FKPVNNADFIIPVEID 135

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 136  GIMHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 185

Query: 1016 LFAGRVI 1022
             F  R+ 
Sbjct: 186  -FRARLF 191


>gi|145544070|ref|XP_001457720.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124425538|emb|CAK90323.1| unnamed protein product [Paramecium tetraurelia]
          Length = 659

 Score = 43.9 bits (102), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 63/249 (25%), Positives = 92/249 (36%), Gaps = 44/249 (17%)

Query: 860  DDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSAR 919
            D  QT      E        +    +VE L   Y D     I KE    L+  KK    R
Sbjct: 193  DSSQTCNHLKIENNYCLICNEKVIRNVESLDLNYSDDISKKISKEIV--LDILKK----R 246

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG--------- 970
            KL +VLDLD T+L++ K       + E   K+ +  +         F  +G         
Sbjct: 247  KLIMVLDLDQTILHAIKVSTTFNKY-EFCEKQNKMIQADSEAQFNGFQQLGFNIKEHLLD 305

Query: 971  --------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA---- 1018
                       KLRP    F      LF++ +YT  +K YA  +   +  +   F     
Sbjct: 306  MTCDQQSKFIIKLRPYFEQFFLTLIPLFDIFIYTKASKSYADFILSFITHRLNEFIPEHK 365

Query: 1019 -----GRVISRGDDGDPFDGDERVPKSKDLEGVL--GMES-AVVIIDDSVRVWPHNKLNL 1070
                  RV+SR D             SK L  +   G+ +  +VI+DD+  +W   K NL
Sbjct: 366  PFFPPQRVLSREDTI--------CSNSKSLNRLFYPGIATNLLVILDDNAGMWNQFKENL 417

Query: 1071 IVVERYTYF 1079
            I  + + YF
Sbjct: 418  IHTKPFVYF 426


>gi|146089360|ref|XP_001470364.1| nuclear lim interactor-interacting factor-like protein [Leishmania
            infantum JPCM5]
 gi|134070397|emb|CAM68734.1| nuclear lim interactor-interacting factor-like protein [Leishmania
            infantum JPCM5]
          Length = 290

 Score = 43.9 bits (102), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 70/160 (43%), Gaps = 18/160 (11%)

Query: 917  SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
            S  K+ LVLD+D TL++S      D V+D++L    E                 +  K R
Sbjct: 109  SVPKVTLVLDVDETLVHSTFQPSSDVVYDKVLLVASEGK------------TYTVSVKYR 156

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 1036
            P +  FL   S+ FE+ ++T   + Y  ++   +DP G+L   R+     +   F     
Sbjct: 157  PYLEDFLRFVSRRFEVVIFTASMRAYCDKLMDEIDPHGILGNLRLFR---EHCTFCERSY 213

Query: 1037 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
            V   KDL  +      VVI+D+S   +   + N I ++ +
Sbjct: 214  V---KDLHRLGRDLRRVVILDNSPAAYSFQQRNAIPIKTW 250


>gi|340376943|ref|XP_003386990.1| PREDICTED: hypothetical protein LOC100641299 [Amphimedon
            queenslandica]
          Length = 1244

 Score = 43.9 bits (102), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 43/166 (25%), Positives = 73/166 (43%), Gaps = 16/166 (9%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
            + ++FS R+  +VLDLD TL++S         HD  L    E   +   R       +  
Sbjct: 123  ETRLFSVRRKIMVLDLDETLIHSH--------HDNTLLPATEMLPDFYVRVYIENHPVKF 174

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP-KGVLFAGRVISRGDDGDP 1030
            +   RP +  FL   S+ +++ ++T   + Y  E+A  LD  KG+L   R   R D    
Sbjct: 175  YVYKRPHVDYFLSVVSQWYDLVIFTASMQKYGMEVANHLDQNKGIL--PRRYFRQDCTMD 232

Query: 1031 FDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
             +G      +K+L  +    S + I+D+S   +  N  N I +  +
Sbjct: 233  MNG-----YTKNLSMISEDLSNIFILDNSPSAYRGNPDNAIPITSW 273


>gi|324518550|gb|ADY47137.1| CTD small phosphatase-like protein 2 [Ascaris suum]
          Length = 248

 Score = 43.9 bits (102), Expect = 0.59,   Method: Composition-based stats.
 Identities = 54/207 (26%), Positives = 93/207 (44%), Gaps = 33/207 (15%)

Query: 917  SARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
            S  +  LVLDLD TL++ +     D     PVH                   F+     +
Sbjct: 60   STPEFALVLDLDETLVHCSLTELPDASLTFPVH-------------------FQDNTYQV 100

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
            + ++RP +  FLER S+ FE+ L+T   ++YA ++  +LDP   L   R+     +   F
Sbjct: 101  YVRVRPHLHEFLERLSQSFEIILFTASKRVYADKLLNLLDPGKRLIRHRLFR---EHCVF 157

Query: 1032 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 1090
                 +   KDL  +LG + S  +IID+S++ + +   N I +E + +F    ++   L 
Sbjct: 158  VYGNYI---KDLT-ILGRDLSKTIIIDNSLQSFAYQIDNGIPIESW-FFEQDDQELLKLI 212

Query: 1091 PSLLEIDHDERSEDGTLASSLGVRQQL 1117
            P L  I + +      L +   +R+ L
Sbjct: 213  PFLENITNQKSDVRTILRARYRIRELL 239


>gi|45184666|ref|NP_982384.1| AAL158Wp [Ashbya gossypii ATCC 10895]
 gi|44980012|gb|AAS50208.1| AAL158Wp [Ashbya gossypii ATCC 10895]
          Length = 478

 Score = 43.5 bits (101), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 50/105 (47%), Gaps = 25/105 (23%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSA--KFHEVD---PVHDEILRKKEEQDREKPHRHLFRF 966
            Q+  F  RK CLVLDLD TL++S+    H  D   PV         E D +  + ++ + 
Sbjct: 301  QRPEFRGRK-CLVLDLDETLVHSSFKYLHTADFVIPV---------EIDNQVHNVYVIK- 349

Query: 967  PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 1011
                     RPG+  FL+R  +LFE+ ++T     Y   +  +LD
Sbjct: 350  ---------RPGVDEFLKRVGELFEVVVFTASVSRYGDPLLDILD 385


>gi|2289786|dbj|BAA21667.1| HYA22 [Homo sapiens]
          Length = 340

 Score = 43.5 bits (101), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 73/170 (42%), Gaps = 33/170 (19%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++   +QK   + L  +  +    K C+V+DLD TL++S+      P+ +       E D
Sbjct: 146  EENGGLQKPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 201

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  GV
Sbjct: 202  GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 251

Query: 1016 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
              A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 252  FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 288


>gi|55740285|gb|AAV63944.1| nuclear LIM factor interactor-interacting protein [Phytophthora
            sojae]
 gi|348684603|gb|EGZ24418.1| hypothetical protein PHYSODRAFT_296510 [Phytophthora sojae]
          Length = 336

 Score = 43.5 bits (101), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 45/97 (46%), Gaps = 14/97 (14%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K+CLVLDLD TL++S+      P  +       E D    H          ++   RPG 
Sbjct: 165  KMCLVLDLDETLVHSS----FRPTPNPDFVIPVEIDGTIHH----------VFVAKRPGA 210

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
              FL   +K +E+ +YT     YA  +   LDP+GV+
Sbjct: 211  EEFLVEMAKYYEIVIYTASLSKYADPLLDQLDPEGVI 247


>gi|55740291|gb|AAV63946.1| nuclear LIM factor interactor-interacting protein [Phytophthora
            sojae]
 gi|348684596|gb|EGZ24411.1| hypothetical protein PHYSODRAFT_325530 [Phytophthora sojae]
          Length = 336

 Score = 43.5 bits (101), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 45/97 (46%), Gaps = 14/97 (14%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K+CLVLDLD TL++S+      P  +       E D    H          ++   RPG 
Sbjct: 165  KMCLVLDLDETLVHSS----FRPTPNPDFVIPVEIDGTIHH----------VFVAKRPGA 210

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
              FL   +K +E+ +YT     YA  +   LDP+GV+
Sbjct: 211  EEFLVEMAKYYEIVIYTASLSKYADPLLDQLDPEGVI 247


>gi|401841683|gb|EJT44034.1| PSR2-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 392

 Score = 43.5 bits (101), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 28/99 (28%), Positives = 49/99 (49%), Gaps = 16/99 (16%)

Query: 919  RKLCLVLDLDHTLLNSA-KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
            +K CL+LDLD TL++S+ K+ +      + +   E  D+              ++   RP
Sbjct: 221  QKKCLILDLDETLVHSSFKYMQTA----DFVLPVEIDDQVH-----------NVYVIKRP 265

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
            G+  FL R S+++E+ ++T     YA  +   LDP G +
Sbjct: 266  GVDEFLHRVSQVYEVVVFTASVSRYANPLLDTLDPNGTI 304


>gi|15220552|ref|NP_174271.1| haloacid dehalogenase-like hydrolase [Arabidopsis thaliana]
 gi|124301096|gb|ABN04800.1| At1g29780 [Arabidopsis thaliana]
 gi|332193007|gb|AEE31128.1| haloacid dehalogenase-like hydrolase [Arabidopsis thaliana]
          Length = 221

 Score = 43.5 bits (101), Expect = 0.65,   Method: Composition-based stats.
 Identities = 31/109 (28%), Positives = 57/109 (52%), Gaps = 13/109 (11%)

Query: 908  RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 967
            +LE+    ++  K  ++LDLD TL++ A  H     HD ++  K E++            
Sbjct: 37   KLEDPLTGYTNMKRTIILDLDETLVH-ATTHLPGVKHDFMVMVKMEREI----------- 84

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
             M ++   RPG+  FLER  + +++ ++T G + YA+++   LD  GV+
Sbjct: 85   -MPIFVVKRPGVTEFLERLGENYKVVVFTAGLEEYASQVLDKLDKNGVI 132


>gi|422294377|gb|EKU21677.1| nli interacting factor family protein [Nannochloropsis gaditana
            CCMP526]
          Length = 378

 Score = 43.5 bits (101), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 40/150 (26%), Positives = 60/150 (40%), Gaps = 47/150 (31%)

Query: 909  LEEQKKMFSA----RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL- 963
            ++ Q + FS      KL +VLD+D  LL+S +F E           ++E+ RE  H+ L 
Sbjct: 109  MQGQGRRFSVYRKREKLTVVLDMDECLLHS-RFEE---------DMRDERGRELAHQLLP 158

Query: 964  ------FRFPHMG----------------MW----------TKLRPGIWTFLERASKLFE 991
                  F + H                   W            LRPG+  FL+R S  + 
Sbjct: 159  NGDSESFHYQHQADVGEALGDVRHRSVDYFWLELEEGERVRVNLRPGVEAFLQRLSDEYN 218

Query: 992  MHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            + ++T   + YA  +   LDP G L  GR 
Sbjct: 219  VFVFTAATETYARPVLDRLDPTGSLLDGRF 248


>gi|374105582|gb|AEY94493.1| FAAL158Wp [Ashbya gossypii FDAG1]
          Length = 478

 Score = 43.5 bits (101), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 50/105 (47%), Gaps = 25/105 (23%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSA--KFHEVD---PVHDEILRKKEEQDREKPHRHLFRF 966
            Q+  F  RK CLVLDLD TL++S+    H  D   PV         E D +  + ++ + 
Sbjct: 301  QRPEFRGRK-CLVLDLDETLVHSSFKYLHTADFVIPV---------EIDNQVHNVYVIK- 349

Query: 967  PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 1011
                     RPG+  FL+R  +LFE+ ++T     Y   +  +LD
Sbjct: 350  ---------RPGVDEFLKRVGELFEVVVFTASVSRYGDPLLDILD 385


>gi|119591022|gb|EAW70616.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase 1, isoform CRA_b [Homo sapiens]
 gi|119591023|gb|EAW70617.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase 1, isoform CRA_b [Homo sapiens]
          Length = 255

 Score = 43.5 bits (101), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 36/127 (28%), Positives = 59/127 (46%), Gaps = 15/127 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 60   EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 115

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 116  GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 165

Query: 1016 LFAGRVI 1022
             F  R+ 
Sbjct: 166  -FRARLF 171


>gi|410224860|gb|JAA09649.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase 1 [Pan troglodytes]
          Length = 260

 Score = 43.5 bits (101), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65   EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121  GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 1016 LFA 1018
              A
Sbjct: 171  FRA 173


>gi|398016831|ref|XP_003861603.1| nuclear lim interactor-interacting factor-like protein [Leishmania
            donovani]
 gi|322499830|emb|CBZ34903.1| nuclear lim interactor-interacting factor-like protein [Leishmania
            donovani]
          Length = 290

 Score = 43.5 bits (101), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 70/160 (43%), Gaps = 18/160 (11%)

Query: 917  SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
            S  K+ LVLD+D TL++S      D V+D++L    E                 +  K R
Sbjct: 109  SVPKVTLVLDVDETLVHSTFQPSSDVVYDKVLLVASEGK------------TYTVSVKYR 156

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 1036
            P +  FL   S+ FE+ ++T   + Y  ++   +DP G+L   R+     +   F     
Sbjct: 157  PYLEDFLRFVSRRFEVVIFTASMRAYCDKLMDEIDPHGILGNLRLFR---EHCTFCERSY 213

Query: 1037 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
            V   KDL  +      VVI+D+S   +   + N I ++ +
Sbjct: 214  V---KDLHRLGRDLRRVVILDNSPAAYSFQQRNAIPIKTW 250


>gi|32813443|ref|NP_872580.1| carboxy-terminal domain RNA polymerase II polypeptide A small
            phosphatase 1 isoform 2 [Homo sapiens]
 gi|31074175|gb|AAP34397.1| small CTD phosphatase 1 [Homo sapiens]
 gi|410351181|gb|JAA42194.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase 1 [Pan troglodytes]
          Length = 260

 Score = 43.5 bits (101), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65   EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121  GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 1016 LFA 1018
              A
Sbjct: 171  FRA 173


>gi|302819617|ref|XP_002991478.1| hypothetical protein SELMODRAFT_133639 [Selaginella moellendorffii]
 gi|300140680|gb|EFJ07400.1| hypothetical protein SELMODRAFT_133639 [Selaginella moellendorffii]
          Length = 219

 Score = 43.5 bits (101), Expect = 0.72,   Method: Composition-based stats.
 Identities = 32/102 (31%), Positives = 47/102 (46%), Gaps = 17/102 (16%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K  LVLD+D TL+++         H  I   K    +  P +              RPG+
Sbjct: 2    KPTLVLDMDETLIHA---------HKAIASLKLFSGKTLPLKRYL--------VAKRPGV 44

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
             TFL+  SK++E+ ++T   K YA  +   LDP G LF  R+
Sbjct: 45   NTFLDEMSKIYEIVVFTRVVKPYADRILDRLDPVGNLFTHRL 86


>gi|145534239|ref|XP_001452864.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124420563|emb|CAK85467.1| unnamed protein product [Paramecium tetraurelia]
          Length = 414

 Score = 43.5 bits (101), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 34/135 (25%), Positives = 64/135 (47%), Gaps = 16/135 (11%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            + LDLD TL+++    E   V    L+++ E   E           M +   +RP    F
Sbjct: 231  IFLDLDETLIHACHARETPSVK---LKQQNEDGSETDS--------MQVGINVRPYTGYF 279

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG----DERVP 1038
            L+  ++ + +++YT  ++ YA  +   LDP     +G ++SR +  +  +G    D R+ 
Sbjct: 280  LQELAQYYTIYIYTASSQQYAQTIVNYLDPLKQYISG-ILSRSNCMETKNGFFIKDLRII 338

Query: 1039 KSKDLEGVLGMESAV 1053
            K  DL+  L +++ V
Sbjct: 339  KDLDLDRTLIVDNLV 353


>gi|395823467|ref|XP_003785008.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1 [Otolemur garnettii]
          Length = 260

 Score = 43.5 bits (101), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65   EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121  GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 1016 LFA 1018
              A
Sbjct: 171  FRA 173


>gi|345788882|ref|XP_851254.2| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
            polypeptide A) small phosphatase-like [Canis lupus
            familiaris]
          Length = 328

 Score = 43.5 bits (101), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 132  DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 187

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 188  IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 237

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 238  GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 276


>gi|365759524|gb|EHN01307.1| Psr2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 338

 Score = 43.5 bits (101), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 28/99 (28%), Positives = 48/99 (48%), Gaps = 16/99 (16%)

Query: 919  RKLCLVLDLDHTLLNSA-KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
             K CL+LDLD TL++S+ K+ +      + +   E  D+              ++   RP
Sbjct: 167  HKKCLILDLDETLVHSSFKYMQTA----DFVLPVEIDDQVH-----------NVYVIKRP 211

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
            G+  FL R S+++E+ ++T     YA  +   LDP G +
Sbjct: 212  GVDEFLHRVSQVYEVVVFTASVSRYANPLLDTLDPNGTI 250


>gi|145483633|ref|XP_001427839.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124394922|emb|CAK60441.1| unnamed protein product [Paramecium tetraurelia]
          Length = 308

 Score = 43.5 bits (101), Expect = 0.75,   Method: Composition-based stats.
 Identities = 34/96 (35%), Positives = 54/96 (56%), Gaps = 15/96 (15%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
            ARKLC VLDLD TL++S    + D  +D +L    +         LF+     ++  +RP
Sbjct: 54   ARKLC-VLDLDETLVHSQ--FKGDNGYDFLLDIIVQS-------QLFK-----VFVTVRP 98

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
            G+ TFLE+ S+ F++ L+T   K YA  +  ++DP+
Sbjct: 99   GVETFLEQLSEHFDIVLWTASLKEYADPVIDIIDPQ 134


>gi|242089885|ref|XP_002440775.1| hypothetical protein SORBIDRAFT_09g006400 [Sorghum bicolor]
 gi|241946060|gb|EES19205.1| hypothetical protein SORBIDRAFT_09g006400 [Sorghum bicolor]
          Length = 319

 Score = 43.5 bits (101), Expect = 0.75,   Method: Composition-based stats.
 Identities = 46/143 (32%), Positives = 67/143 (46%), Gaps = 30/143 (20%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF-PHMG-----MWTKLR 976
            L LDLD TL++S    + DP                P R  F   P +G      +   R
Sbjct: 145  LFLDLDETLIHS----QTDP---------------PPSRFDFTVRPVIGGHAVTFYVVKR 185

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 1036
            PG+  FL  A+++F++ ++T G + YA+ +   LDP G +FA R + RG   D   GD R
Sbjct: 186  PGVEAFLRAAAEIFDVVVFTAGLQEYASLVLDRLDPDGEVFAHR-LYRGACRDA--GDGR 242

Query: 1037 VPKSKDLEGVLGMESAVVIIDDS 1059
            +   KDL          VI+DD+
Sbjct: 243  L--VKDLAATGRALDRAVIVDDN 263


>gi|403336757|gb|EJY67573.1| NLI interacting factor-like phosphatase family protein [Oxytricha
            trifallax]
          Length = 515

 Score = 43.1 bits (100), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 24/82 (29%), Positives = 43/82 (52%), Gaps = 3/82 (3%)

Query: 975  LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR---GDDGDPF 1031
            +RP     LE  +K FE+ ++T  +K YA  +   +DP G L   R+          + +
Sbjct: 249  IRPYTQECLEFVNKYFEVVVFTASHKFYADVILDYIDPTGTLIQHRLYREHCIKTQDNVY 308

Query: 1032 DGDERVPKSKDLEGVLGMESAV 1053
              D RV K++DL+ ++ +++AV
Sbjct: 309  IKDLRVFKNRDLKDLIIVDNAV 330


>gi|354547373|emb|CCE44108.1| hypothetical protein CPAR2_503330 [Candida parapsilosis]
          Length = 373

 Score = 43.1 bits (100), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 51/115 (44%), Gaps = 33/115 (28%)

Query: 917  SARKLCLVLDLDHTL-------LNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            +A K CLVLDLD TL       L SA F  V PV         E D +  H ++ +    
Sbjct: 200  TANKKCLVLDLDETLVHSSFKYLRSADF--VIPV---------EIDGQVHHVYVIK---- 244

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 1024
                  RPG+  FLER  KL+E+ ++T     Y   +   LD     F+  V+ R
Sbjct: 245  ------RPGVDEFLERVGKLYEVVVFTASVSKYGDPLLNKLD-----FSQSVLHR 288


>gi|403333806|gb|EJY66027.1| hypothetical protein OXYTRI_13811 [Oxytricha trifallax]
          Length = 509

 Score = 43.1 bits (100), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 39/168 (23%), Positives = 76/168 (45%), Gaps = 17/168 (10%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVH--DEILRKKEEQDREKP----HRHLFRFPHMGMWT 973
            K  LVLD+D TL++ +    ++P +   E++     QD  KP       ++    + ++ 
Sbjct: 297  KKTLVLDMDETLIHCS----LEPFYGYQEVIHVM--QDTYKPISPDSDLIYSQKSLQIYV 350

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV-----ISRGDDG 1028
              RP +  FLE+ S  +E+ ++T  +K YA  +   +DP    F+ R+     +    + 
Sbjct: 351  AYRPYLIHFLEKVSSQYEVVVFTASDKSYADVILDKIDPYHKYFSYRLYRDSCLQVNINA 410

Query: 1029 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
                  +     KDL  +    S  +I+D+S++ + +   N I +  Y
Sbjct: 411  KNSSSQQTTLFVKDLSALGRDLSQTIIVDNSIQAFGYQLSNGIPIPSY 458


>gi|301617231|ref|XP_002938048.1| PREDICTED: CTD small phosphatase-like protein-like [Xenopus
            (Silurana) tropicalis]
          Length = 276

 Score = 43.1 bits (100), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 49/172 (28%), Positives = 73/172 (42%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D  Q   I     + L  + K+    K C+V+DLD TL++S+      P+++       E
Sbjct: 80   DQTQALTIPSPPAKYLLPELKVSDYGKKCVVIDLDETLVHSS----FKPINNADFIVPVE 135

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL++  +LFE  L+T     YA  +A +LD  
Sbjct: 136  IDGTIHQVYVLKRPHVD----------EFLQKMGELFECVLFTASLAKYADPVADLLDRW 185

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+IID+S
Sbjct: 186  GVFNARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIIDNS 224


>gi|146185627|ref|XP_001032201.2| NLI interacting factor-like phosphatase family protein [Tetrahymena
            thermophila]
 gi|146142847|gb|EAR84538.2| NLI interacting factor-like phosphatase family protein [Tetrahymena
            thermophila SB210]
          Length = 446

 Score = 43.1 bits (100), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 26/104 (25%), Positives = 58/104 (55%), Gaps = 14/104 (13%)

Query: 945  DEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 1004
            D I++ K + D +  ++       +G+  ++RP    FL++ ++ ++++++T  +  YA+
Sbjct: 282  DHIIKAKADNDDKVGYQ-------IGL--RVRPYCLEFLQKLAQYWDIYIFTASSPTYAS 332

Query: 1005 EMAKVLDPKGVLFAGRVISRGDDGDPFDG----DERVPKSKDLE 1044
             + K LDP+G    G +++R +  +  +G    D R+ K KDL+
Sbjct: 333  AIVKFLDPEGKYING-ILNRSNCMETKNGFFIKDLRIVKGKDLK 375


>gi|159112651|ref|XP_001706554.1| Nuclear LIM interactor-interacting factor 1 [Giardia lamblia ATCC
            50803]
 gi|157434651|gb|EDO78880.1| Nuclear LIM interactor-interacting factor 1 [Giardia lamblia ATCC
            50803]
          Length = 432

 Score = 43.1 bits (100), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 29/97 (29%), Positives = 49/97 (50%), Gaps = 8/97 (8%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K  LVLDLD TL++S+ F++VD     I    E+   +    H        ++   RP +
Sbjct: 258  KKLLVLDLDETLVHSS-FNKVDNADMIIPLSIEDPVSKATISH-------QVYVYKRPYV 309

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
              FLE  +K +E+ ++T   ++Y   + + LDP G+ 
Sbjct: 310  DEFLETMAKYYELAIFTASLRVYCDAVMEKLDPNGLC 346


>gi|391344643|ref|XP_003746605.1| PREDICTED: phosphatase PSR1-like [Metaseiulus occidentalis]
          Length = 239

 Score = 43.1 bits (100), Expect = 0.87,   Method: Composition-based stats.
 Identities = 31/96 (32%), Positives = 48/96 (50%), Gaps = 12/96 (12%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
            A+K+ LVLDLD TL++    + + P +D     K  Q +            M ++  +RP
Sbjct: 45   AKKILLVLDLDETLIHGT--YCMPPKYDFRFELKLPQSKRV----------MNVYVLVRP 92

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             +  FLE A K FE+  YT    +YA ++   +DPK
Sbjct: 93   YLQDFLEFAHKWFEVMAYTASLPIYADKILDEIDPK 128


>gi|410258922|gb|JAA17427.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase 1 [Pan troglodytes]
 gi|410290720|gb|JAA23960.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase 1 [Pan troglodytes]
          Length = 260

 Score = 43.1 bits (100), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65   EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121  GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 1016 LFA 1018
              A
Sbjct: 171  FRA 173


>gi|449275333|gb|EMC84205.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
            phosphatase 1, partial [Columba livia]
          Length = 230

 Score = 43.1 bits (100), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 37/127 (29%), Positives = 58/127 (45%), Gaps = 15/127 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  A+ K   R L  + K   A  LC+V+DLD TL++S+      PV++       E D
Sbjct: 35   EENGALPKAAVRHLLPEIKPQDASNLCVVIDLDETLVHSS----FKPVNNADFIIPVEID 90

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 91   GIMHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 140

Query: 1016 LFAGRVI 1022
             F  R+ 
Sbjct: 141  -FRARLF 146


>gi|396479189|ref|XP_003840695.1| similar to phosphoprotein phosphatase [Leptosphaeria maculans JN3]
 gi|312217268|emb|CBX97216.1| similar to phosphoprotein phosphatase [Leptosphaeria maculans JN3]
          Length = 511

 Score = 43.1 bits (100), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 81/189 (42%), Gaps = 45/189 (23%)

Query: 845  QIKSGADMKAV-VTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQK 903
            QI    +MK V ++ HD  Q G     +A P   HP+     V+        +++ A+Q 
Sbjct: 263  QIDDDIEMKDVPLSTHDVHQEGEDQSTDAQP--DHPK-----VDLPPPPPLVERQHAVQS 315

Query: 904  ERT---RRLEEQKKM-------FSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            + T      E QK +       F  +K CLVLDLD TL++S+          +IL     
Sbjct: 316  QVTDASEASEPQKYLLGPIAPRFKGKK-CLVLDLDETLVHSSF---------KIL----- 360

Query: 954  QDREKPHRHLFRFP------HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMA 1007
                  H+  F  P      +  ++   RPG+  F++R  +L+E+ ++T     Y   + 
Sbjct: 361  ------HQADFTIPVEIEGQYHNVYVIKRPGVDQFMKRVGELYEVVVFTASVSKYGDPLL 414

Query: 1008 KVLDPKGVL 1016
              LD  GV+
Sbjct: 415  DQLDIHGVV 423


>gi|387539470|gb|AFJ70362.1| CTD small phosphatase-like protein isoform 1 [Macaca mulatta]
          Length = 276

 Score = 43.1 bits (100), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 80   DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 136  IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 186  GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224


>gi|410969412|ref|XP_003991189.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1 [Felis catus]
          Length = 259

 Score = 42.7 bits (99), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 34/123 (27%), Positives = 56/123 (45%), Gaps = 14/123 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  A+ K   + L  + K     K+C+V+DLD TL++S+      PV++       E D
Sbjct: 64   EENGAVPKTPVQYLLPEAKAQDVDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 119

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 120  GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 169

Query: 1016 LFA 1018
              A
Sbjct: 170  FRA 172


>gi|324525869|gb|ADY48608.1| Serine/threonine-protein phosphatase dullard, partial [Ascaris suum]
          Length = 257

 Score = 42.7 bits (99), Expect = 1.1,   Method: Composition-based stats.
 Identities = 46/161 (28%), Positives = 72/161 (44%), Gaps = 31/161 (19%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD-------REKPHRHLFRFPHMGM 971
            R+  LVLDLD TL++S         HD I+R   +         R    RH  RF     
Sbjct: 73   RRKILVLDLDETLIHSH--------HDGIIRPMVKPGTPPDFILRVNIDRHPVRF----- 119

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP-KGVLFAGRVISRGDDGDP 1030
            +   RP +  FL   S+ F++ ++T   ++Y + +A  LD  KG+L   R   R      
Sbjct: 120  FVHCRPHVDYFLSMVSQWFDLVVFTASMEIYGSSVADKLDNGKGIL--QRRYFRQHCTMD 177

Query: 1031 FDGDERVPKSKDLEGVLGMESAVVIIDDS---VRVWPHNKL 1068
            + G      +KDL  +    S++ I+D+S    R +P N +
Sbjct: 178  YGG-----YTKDLSAIHADLSSIFILDNSPGAYRKFPQNAI 213


>gi|355746816|gb|EHH51430.1| hypothetical protein EGM_10796, partial [Macaca fascicularis]
          Length = 270

 Score = 42.7 bits (99), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 74   DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 129

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 130  IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 179

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 180  GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 218


>gi|344238378|gb|EGV94481.1| CTD small phosphatase-like protein [Cricetulus griseus]
          Length = 239

 Score = 42.7 bits (99), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 74/170 (43%), Gaps = 33/170 (19%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++ + +QK   + L  +  +    K C+V+DLD TL++S+      P+ +       E D
Sbjct: 45   EENSGLQKPPAKSLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 100

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  GV
Sbjct: 101  GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 150

Query: 1016 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
              A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 151  FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 187


>gi|300122949|emb|CBK23956.2| Tim50 [Blastocystis hominis]
          Length = 348

 Score = 42.7 bits (99), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 49/171 (28%), Positives = 74/171 (43%), Gaps = 35/171 (20%)

Query: 909  LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 968
            L EQ      RK  LVLDLD TL++S  F   D     ++  + E D             
Sbjct: 163  LGEQSSANQGRKT-LVLDLDETLVHST-FQPTDDC-SYVIPVEIEGDL------------ 207

Query: 969  MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV----LFAGRVISR 1024
              ++  LRPG   FL R S+++E+ +YT    +YA  +   +DP  +    LF    +  
Sbjct: 208  YNVYVYLRPGTTEFLRRMSEIYEVVVYTASLPVYADPLLDKIDPNNLISARLFRDHCVQS 267

Query: 1025 GDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDD---SVRVWPHNKLNLI 1071
            G               KDL G+LG    +VV+ID+   S +  P+N +  +
Sbjct: 268  GG-----------ILVKDL-GLLGRSLDSVVMIDNSAVSFQFQPNNGIECV 306


>gi|361067247|gb|AEW07935.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149610|gb|AFG56720.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149612|gb|AFG56721.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149614|gb|AFG56722.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149616|gb|AFG56723.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149618|gb|AFG56724.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149620|gb|AFG56725.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149622|gb|AFG56726.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149624|gb|AFG56727.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149628|gb|AFG56729.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149630|gb|AFG56730.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149632|gb|AFG56731.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149634|gb|AFG56732.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149636|gb|AFG56733.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149638|gb|AFG56734.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
          Length = 140

 Score = 42.7 bits (99), Expect = 1.2,   Method: Composition-based stats.
 Identities = 34/117 (29%), Positives = 56/117 (47%), Gaps = 14/117 (11%)

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 1023
            F      ++ + RP +  F++R + +FE+ ++T    +YA ++  VLDPK  L   RV  
Sbjct: 18   FNLKEHTVYVRCRPHLQLFMDRVADMFEIIVFTASQSVYAEQLLNVLDPKRKLIRHRVYR 77

Query: 1024 RG---DDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 1076
                  +G+           KDL  VLG + A V IID+S + +     N I +E +
Sbjct: 78   ESCVFVEGNYL---------KDLT-VLGRDLAQVAIIDNSPQAFGFQVDNGIPIESW 124


>gi|300794122|ref|NP_001179369.1| carboxy-terminal domain RNA polymerase II polypeptide A small
            phosphatase 1 [Bos taurus]
 gi|296490317|tpg|DAA32430.1| TPA: CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
            small phosphatase 1-like [Bos taurus]
          Length = 260

 Score = 42.7 bits (99), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 35/127 (27%), Positives = 59/127 (46%), Gaps = 15/127 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  A+ K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65   EENGAVPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121  GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 1016 LFAGRVI 1022
             F  R+ 
Sbjct: 171  -FRARLF 176


>gi|380815184|gb|AFE79466.1| carboxy-terminal domain RNA polymerase II polypeptide A small
            phosphatase 1 isoform 2 [Macaca mulatta]
 gi|383420375|gb|AFH33401.1| carboxy-terminal domain RNA polymerase II polypeptide A small
            phosphatase 1 isoform 2 [Macaca mulatta]
 gi|384948522|gb|AFI37866.1| carboxy-terminal domain RNA polymerase II polypeptide A small
            phosphatase 1 isoform 2 [Macaca mulatta]
          Length = 260

 Score = 42.7 bits (99), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 56/123 (45%), Gaps = 14/123 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  AI K   + L    K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65   EENGAIPKTPVQYLLPAAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121  GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 1016 LFA 1018
              A
Sbjct: 171  FRA 173


>gi|291392229|ref|XP_002712521.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
            polypeptide A) small phosphatase 1 [Oryctolagus
            cuniculus]
          Length = 260

 Score = 42.7 bits (99), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 56/123 (45%), Gaps = 14/123 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV +       E D
Sbjct: 65   EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVSNADFIIPVEID 120

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121  GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 1016 LFA 1018
              A
Sbjct: 171  FRA 173


>gi|402860629|ref|XP_003894728.1| PREDICTED: CTD small phosphatase-like protein [Papio anubis]
          Length = 276

 Score = 42.7 bits (99), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 80   DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 136  IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 186  GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224


>gi|335298851|ref|XP_003358411.1| PREDICTED: CTD small phosphatase-like protein-like [Sus scrofa]
          Length = 276

 Score = 42.7 bits (99), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 80   DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 136  IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 186  GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224


>gi|302794308|ref|XP_002978918.1| hypothetical protein SELMODRAFT_418692 [Selaginella moellendorffii]
 gi|300153236|gb|EFJ19875.1| hypothetical protein SELMODRAFT_418692 [Selaginella moellendorffii]
          Length = 218

 Score = 42.7 bits (99), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 29/46 (63%)

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            RPG+  FL+  SK++E+ ++T   K YA  +   LDP G LFA R+
Sbjct: 81   RPGVDIFLDEMSKIYEIVVFTRAVKPYADRILDRLDPAGNLFAHRL 126


>gi|156848006|ref|XP_001646886.1| hypothetical protein Kpol_2002p100 [Vanderwaltozyma polyspora DSM
            70294]
 gi|156117567|gb|EDO19028.1| hypothetical protein Kpol_2002p100 [Vanderwaltozyma polyspora DSM
            70294]
          Length = 477

 Score = 42.4 bits (98), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 29/103 (28%), Positives = 51/103 (49%), Gaps = 15/103 (14%)

Query: 909  LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 968
            L ++  +F  +K CLVLDLD TL++S+ F  +D     +    ++Q  +           
Sbjct: 297  LAKKDPVFKNKK-CLVLDLDETLVHSS-FKYIDTADFVLPVTIDDQTHQ----------- 343

Query: 969  MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 1011
              ++   RPG+  FL+R  K+FE+ ++T     Y   +  +LD
Sbjct: 344  --VYVIKRPGVDEFLKRVGKIFEVVVFTASVSRYGDPLLDILD 384


>gi|56549681|ref|NP_005799.2| CTD small phosphatase-like protein isoform 2 [Homo sapiens]
 gi|31074181|gb|AAP34400.1| small CTD phosphatase 3 [Homo sapiens]
 gi|34392245|emb|CAE11804.1| RB serine phosphatase [Homo sapiens]
 gi|34596234|gb|AAQ76797.1| HYA22 [Homo sapiens]
 gi|187252491|gb|AAI66643.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase-like [synthetic construct]
 gi|410228142|gb|JAA11290.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase-like [Pan troglodytes]
 gi|410291072|gb|JAA24136.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase-like [Pan troglodytes]
 gi|410334183|gb|JAA36038.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase-like [Pan troglodytes]
          Length = 265

 Score = 42.4 bits (98), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 73/170 (42%), Gaps = 33/170 (19%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++   +QK   + L  +  +    K C+V+DLD TL++S+      P+ +       E D
Sbjct: 71   EENGGLQKPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 126

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  GV
Sbjct: 127  GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 176

Query: 1016 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
              A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 177  FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 213


>gi|383149626|gb|AFG56728.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
          Length = 140

 Score = 42.4 bits (98), Expect = 1.3,   Method: Composition-based stats.
 Identities = 18/58 (31%), Positives = 32/58 (55%)

Query: 964  FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            F      ++ + RP +  F++R + +FE+ ++T    +YA ++  VLDPK  L   RV
Sbjct: 18   FNLKEHTVYVRCRPHLQLFMDRVADMFEIIVFTASQSVYAEQLLNVLDPKRKLIRHRV 75


>gi|350406069|ref|XP_003487644.1| PREDICTED: CTD nuclear envelope phosphatase 1 homolog [Bombus
            impatiens]
          Length = 243

 Score = 42.4 bits (98), Expect = 1.3,   Method: Composition-based stats.
 Identities = 44/156 (28%), Positives = 70/156 (44%), Gaps = 21/156 (13%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD---REKPHRHLFRFPHMGMWTKL 975
            ++  LVLDLD TL++S      D V    +R     D   + K  RH  RF     +   
Sbjct: 59   KRKVLVLDLDETLIHSHH----DGVARPTVRFGTPPDFILKVKIDRHPVRF-----FVHK 109

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 1035
            RP +  FL+  S+ +E+ ++T   ++Y   +A+ LD        R I R          E
Sbjct: 110  RPHVDFFLDIVSQWYELVVFTASMEIYGAAVAEKLD------NNRGILRRRYYRQHCTPE 163

Query: 1036 RVPKSKDLEGVLGMESAVVIIDDS---VRVWPHNKL 1068
                +KDL  +    ++V I+D+S    R +PHN +
Sbjct: 164  MGSYTKDLSAICSDLASVFILDNSPGAYRAYPHNAI 199


>gi|328779252|ref|XP_391964.4| PREDICTED: CTD nuclear envelope phosphatase 1 homolog [Apis
            mellifera]
          Length = 233

 Score = 42.4 bits (98), Expect = 1.3,   Method: Composition-based stats.
 Identities = 44/156 (28%), Positives = 70/156 (44%), Gaps = 21/156 (13%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD---REKPHRHLFRFPHMGMWTKL 975
            ++  LVLDLD TL++S      D V    +R     D   + K  RH  RF     +   
Sbjct: 56   KRKVLVLDLDETLIHSHH----DGVARPTVRFGTPPDFILKVKIDRHPVRF-----FVHK 106

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 1035
            RP +  FL+  S+ +E+ ++T   ++Y   +A+ LD        R I R          E
Sbjct: 107  RPHVDFFLDIVSQWYELVVFTASMEIYGAAVAEKLD------NNRGILRRRYYRQHCTPE 160

Query: 1036 RVPKSKDLEGVLGMESAVVIIDDS---VRVWPHNKL 1068
                +KDL  +    ++V I+D+S    R +PHN +
Sbjct: 161  MGSYTKDLSAICSDLASVFILDNSPGAYRAYPHNAI 196


>gi|302808549|ref|XP_002985969.1| hypothetical protein SELMODRAFT_122967 [Selaginella moellendorffii]
 gi|300146476|gb|EFJ13146.1| hypothetical protein SELMODRAFT_122967 [Selaginella moellendorffii]
          Length = 198

 Score = 42.4 bits (98), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 46/155 (29%), Positives = 69/155 (44%), Gaps = 22/155 (14%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LVLD+D TL+++         H  I   K    +  P   L R+         RPG+ TF
Sbjct: 29   LVLDMDETLIHA---------HKAIASLKLFSGKTLP---LQRY-----LVAKRPGVDTF 71

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD-GDERVPKSK 1041
            L   S+++E+ ++T   K YA  +   LDP G LF  R+    D   P + G  +V   K
Sbjct: 72   LNEMSEIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRLYR--DSCSPKEVGGRKV--VK 127

Query: 1042 DLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
            DL  +       VI+DD +  +     N IV+  +
Sbjct: 128  DLSRLGRDLRHTVIVDDKLESFCLQPSNGIVIRAF 162


>gi|335298855|ref|XP_003358412.1| PREDICTED: CTD small phosphatase-like protein-like [Sus scrofa]
          Length = 256

 Score = 42.4 bits (98), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 60   DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 115

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 116  IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 165

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 166  GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 204


>gi|395816725|ref|XP_003781844.1| PREDICTED: CTD small phosphatase-like protein isoform 2 [Otolemur
            garnettii]
          Length = 276

 Score = 42.4 bits (98), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 80   DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 136  IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 186  GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224


>gi|302766621|ref|XP_002966731.1| hypothetical protein SELMODRAFT_85337 [Selaginella moellendorffii]
 gi|300166151|gb|EFJ32758.1| hypothetical protein SELMODRAFT_85337 [Selaginella moellendorffii]
          Length = 131

 Score = 42.4 bits (98), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 50/101 (49%), Gaps = 8/101 (7%)

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 1035
            RP +  FLER +KLF++  +T   +  A  +  VLDP    F+ R+          D  +
Sbjct: 5    RPHLSKFLERMAKLFDVVAFTSRAQRRAETILDVLDPAKEFFSRRLY--------LDSCK 56

Query: 1036 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
            +  K KDL  +    + V+I+DD+      N  NL++V R+
Sbjct: 57   KGGKVKDLAVLERPLNRVIIVDDTSSKCVLNPDNLVLVSRF 97


>gi|268565001|ref|XP_002639300.1| C. briggsae CBR-SCPL-1 protein [Caenorhabditis briggsae]
          Length = 484

 Score = 42.4 bits (98), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 50/105 (47%), Gaps = 15/105 (14%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
            ++K CLV+DLD TL++S+      PV +       E D  +   ++ +          RP
Sbjct: 329  SKKKCLVIDLDETLVHSS----FKPVKNPDFVIPVEIDGVEHQVYVLK----------RP 374

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
             +  FL R  + FE  L+T     YA  +A +LD K V F GR+ 
Sbjct: 375  YVDEFLARVGEHFECILFTASLAKYADPVADLLDKKKV-FRGRLF 418


>gi|296475139|tpg|DAA17254.1| TPA: small CTD phosphatase 3-like [Bos taurus]
          Length = 276

 Score = 42.4 bits (98), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 80   DQRQIIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 136  IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 186  GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224


>gi|397610231|gb|EJK60724.1| hypothetical protein THAOC_18872 [Thalassiosira oceanica]
          Length = 231

 Score = 42.4 bits (98), Expect = 1.5,   Method: Composition-based stats.
 Identities = 47/166 (28%), Positives = 70/166 (42%), Gaps = 20/166 (12%)

Query: 921  LCLVLDLDHTLLNSAK-----FHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKL 975
            L +  DLDHT+L S        + VD      LR  ++ D + P    F        T L
Sbjct: 31   LDIFFDLDHTILCSISPLPISDNGVDNGVGRTLRWFDQIDDDFP----FEGNSPNTRTFL 86

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 1035
            RP     +   S +  +H+YT   K Y   + KV+DP+  LF G+V+ R D  D      
Sbjct: 87   RPLSTATIYFCSVIGRVHVYTAAQKSYTDNILKVIDPRRTLF-GQVLHRDDHPDI----- 140

Query: 1036 RVPKSKDL----EGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 1077
             V   KDL     G +G+  +V+  D      P    N +++  +T
Sbjct: 141  -VRNGKDLLFAGGGEVGLRRSVLFDDKFSNFVPQQYRNGVLIRPFT 185


>gi|302850140|ref|XP_002956598.1| hypothetical protein VOLCADRAFT_36131 [Volvox carteri f. nagariensis]
 gi|300258125|gb|EFJ42365.1| hypothetical protein VOLCADRAFT_36131 [Volvox carteri f. nagariensis]
          Length = 119

 Score = 42.4 bits (98), Expect = 1.5,   Method: Composition-based stats.
 Identities = 21/56 (37%), Positives = 31/56 (55%)

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 1025
            G++   RPG+  FLE  +   E+ ++T G + YA  +   LDP G LFA R+   G
Sbjct: 5    GVFVVERPGLQEFLEELASFAEVVIFTAGLEDYAKPIIDALDPSGKLFAHRIYREG 60


>gi|330688428|ref|NP_001180010.2| CTD small phosphatase-like protein [Bos taurus]
          Length = 276

 Score = 42.4 bits (98), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 80   DQRQIIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 136  IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 186  GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224


>gi|431919455|gb|ELK17974.1| CTD small phosphatase-like protein [Pteropus alecto]
          Length = 309

 Score = 42.4 bits (98), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 113  DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 168

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 169  IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 218

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 219  GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 257


>gi|297286147|ref|XP_001086442.2| PREDICTED: CTD small phosphatase-like protein-like [Macaca mulatta]
          Length = 260

 Score = 42.4 bits (98), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 73/170 (42%), Gaps = 33/170 (19%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++   +QK   + L  +  +    K C+V+DLD TL++S+      P+ +       E D
Sbjct: 66   EENGGLQKPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 121

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  GV
Sbjct: 122  GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 171

Query: 1016 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
              A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 172  FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 208


>gi|410971731|ref|XP_003992318.1| PREDICTED: CTD small phosphatase-like protein [Felis catus]
          Length = 307

 Score = 42.4 bits (98), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 111  DQRQVIPIPSPPAKYLLPEATVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 166

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 167  IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 216

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 217  GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 255


>gi|403333986|gb|EJY66132.1| Serine/threonine-protein phosphatase dullard [Oxytricha trifallax]
          Length = 913

 Score = 42.4 bits (98), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 53/188 (28%), Positives = 84/188 (44%), Gaps = 21/188 (11%)

Query: 912  QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 971
            Q+K ++  K  LVLDLD TL++S+      P  D +L  + E +               +
Sbjct: 113  QQKPWATGKKTLVLDLDETLVHSSFKPPAKP--DIVLPVEIEGNV------------CNV 158

Query: 972  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 1031
            +  +RPG   FL+R +K +E+ +YT     YA  +  +LD K        + R +    F
Sbjct: 159  FVLIRPGTEFFLQRLAKCYEIVIYTASLSKYADPLIDILDNKTQKIIDYRLFR-EHCTFF 217

Query: 1032 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL--L 1089
             G     K   L G L  +S  +IID+S   +  ++ N + +  +   P  R  F L  L
Sbjct: 218  QG--VFIKDMSLPGRLLQDS--IIIDNSPTSYAFHQENALPILSWYDDPKDRCLFELIPL 273

Query: 1090 GPSLLEID 1097
              SL E+D
Sbjct: 274  LESLAEVD 281


>gi|338714770|ref|XP_001489080.3| PREDICTED: CTD small phosphatase-like protein-like [Equus caballus]
          Length = 268

 Score = 42.4 bits (98), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 72   DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 127

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 128  IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 177

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 178  GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 216


>gi|413948975|gb|AFW81624.1| hypothetical protein ZEAMMB73_313891 [Zea mays]
          Length = 322

 Score = 42.4 bits (98), Expect = 1.7,   Method: Composition-based stats.
 Identities = 46/143 (32%), Positives = 61/143 (42%), Gaps = 30/143 (20%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF-PHMG-----MWTKLR 976
            L LDLD TL++S                   Q    P R  F   P +G      +   R
Sbjct: 148  LFLDLDETLIHS-------------------QTEPPPSRFDFTVRPVIGGHAVTFYVVKR 188

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 1036
            PG+  FL  A+  FE+ ++T G + YA+ +   LDP G +FA R + RG   D  DG   
Sbjct: 189  PGVEAFLRAAADAFEVVVFTAGLQEYASLVLDRLDPDGEVFAHR-LYRGACRDAGDGT-- 245

Query: 1037 VPKSKDLEGVLGMESAVVIIDDS 1059
                KDL          VIIDD+
Sbjct: 246  --LVKDLAATGRALDRAVIIDDN 266


>gi|355750837|gb|EHH55164.1| hypothetical protein EGM_04316, partial [Macaca fascicularis]
          Length = 237

 Score = 42.0 bits (97), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 36/127 (28%), Positives = 58/127 (45%), Gaps = 15/127 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  AI K   + L    K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 42   EENGAIPKTPVQYLLPAAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 97

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 98   GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 147

Query: 1016 LFAGRVI 1022
             F  R+ 
Sbjct: 148  -FRARLF 153


>gi|302792499|ref|XP_002978015.1| hypothetical protein SELMODRAFT_108059 [Selaginella moellendorffii]
 gi|300154036|gb|EFJ20672.1| hypothetical protein SELMODRAFT_108059 [Selaginella moellendorffii]
          Length = 131

 Score = 42.0 bits (97), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 50/101 (49%), Gaps = 8/101 (7%)

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 1035
            RP +  FLER +KLF++  +T   +  A  +  VLDP    F+ R+          D  +
Sbjct: 5    RPHLGKFLERMAKLFDVVAFTSRAQRRAETILDVLDPAKEFFSRRLY--------LDSCK 56

Query: 1036 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
            +  K KDL  +    + V+I+DD+      N  NL++V R+
Sbjct: 57   KGGKVKDLAVLERPLNRVIIVDDTSSKCVLNPDNLVLVSRF 97


>gi|148232046|ref|NP_001084286.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase-like [Xenopus laevis]
 gi|32396218|gb|AAP43959.1| NIF [Xenopus laevis]
 gi|114107822|gb|AAI23152.1| NIF protein [Xenopus laevis]
          Length = 276

 Score = 42.0 bits (97), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 49/172 (28%), Positives = 74/172 (43%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D  Q   I    T+ L  + K+    K C+V+DLD TL++S+      P+++       E
Sbjct: 80   DQTQALTIPSPPTKYLLPELKVSEYGKKCVVIDLDETLVHSS----FKPINNADFIVPVE 135

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL++  ++FE  L+T     YA  +A +LD  
Sbjct: 136  IDGTIHQVYVLKRPHVD----------EFLQKMGEMFECVLFTASLAKYADPVADLLDRW 185

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+IID+S
Sbjct: 186  GVFNARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIIDNS 224


>gi|145527362|ref|XP_001449481.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124417069|emb|CAK82084.1| unnamed protein product [Paramecium tetraurelia]
          Length = 249

 Score = 42.0 bits (97), Expect = 1.8,   Method: Composition-based stats.
 Identities = 53/184 (28%), Positives = 76/184 (41%), Gaps = 32/184 (17%)

Query: 893  YDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 952
            +DD+ K  I  ++T +           +  LVLDLD TL++S    E     DE +  K 
Sbjct: 55   FDDECKDKITAKKTEK-----------EFTLVLDLDETLIHSDM--ERTSFLDEEILVKI 101

Query: 953  EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
                EK             + K+RP    FL+  S  FE+ ++T   K YA ++   LDP
Sbjct: 102  GNTIEK------------YYVKIRPFARDFLKALSNYFELVIFTAAIKEYADKVIDYLDP 149

Query: 1013 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV 1072
             G  F  R   R D     DG       KDL  V        IID+S+     N  N I+
Sbjct: 150  SG--FIKRRFYR-DSCTKKDG----VFYKDLTKVNSNLDKTFIIDNSLSGMSLNPQNGIL 202

Query: 1073 VERY 1076
            ++ +
Sbjct: 203  IKSW 206


>gi|302808547|ref|XP_002985968.1| hypothetical protein SELMODRAFT_5469 [Selaginella moellendorffii]
 gi|300146475|gb|EFJ13145.1| hypothetical protein SELMODRAFT_5469 [Selaginella moellendorffii]
          Length = 134

 Score = 42.0 bits (97), Expect = 1.8,   Method: Composition-based stats.
 Identities = 46/156 (29%), Positives = 68/156 (43%), Gaps = 24/156 (15%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKP-HRHLFRFPHMGMWTKLRPGIWT 981
            LVLD+D TL+++         H  I   K    +  P  R+L            RPG+ T
Sbjct: 1    LVLDMDETLIHA---------HKAIASLKLFSGKTLPLQRYL---------VAKRPGVDT 42

Query: 982  FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD-GDERVPKS 1040
            FL   S+++E+ ++T   K YA  +   LDP G LF  R+    D   P + G  +V   
Sbjct: 43   FLNEMSEIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRLYR--DSCSPKEVGGRKV--V 98

Query: 1041 KDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
            KDL  +       VI+DD    +     N IV+  +
Sbjct: 99   KDLSRLGRDLRHTVIVDDKPESFCLQPSNGIVIRAF 134


>gi|380011615|ref|XP_003689895.1| PREDICTED: LOW QUALITY PROTEIN: CTD nuclear envelope phosphatase 1
            homolog [Apis florea]
          Length = 233

 Score = 42.0 bits (97), Expect = 1.8,   Method: Composition-based stats.
 Identities = 44/152 (28%), Positives = 68/152 (44%), Gaps = 21/152 (13%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD---REKPHRHLFRFPHMGMWTKLRPGI 979
            LVLDLD TL++S      D V    +R     D   + K  RH  RF     +   RP +
Sbjct: 60   LVLDLDETLIHSHH----DGVARPTVRFGTPPDFILKVKIDRHPVRF-----FVHKRPHV 110

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPK 1039
              FL+  S+ +E+ ++T   ++Y   +A+ LD        R I R          E    
Sbjct: 111  DFFLDIVSQWYELVVFTASMEIYGAAVAEKLD------NNRGILRRRYYRQHCTPEMGSY 164

Query: 1040 SKDLEGVLGMESAVVIIDDS---VRVWPHNKL 1068
            +KDL  +    ++V I+D+S    R +PHN +
Sbjct: 165  TKDLSAICSDLASVFILDNSPGAYRAYPHNAI 196


>gi|195587438|ref|XP_002083469.1| GD13336 [Drosophila simulans]
 gi|194195478|gb|EDX09054.1| GD13336 [Drosophila simulans]
          Length = 253

 Score = 42.0 bits (97), Expect = 2.1,   Method: Composition-based stats.
 Identities = 37/142 (26%), Positives = 70/142 (49%), Gaps = 22/142 (15%)

Query: 883  WGDVEHLFEGYDDQQKAAIQKE-RTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVD 941
            W  +E ++  + +      Q E R   + + +    ARK  LVLD+D+T++ S       
Sbjct: 33   WFFLERVYRDFMEYTPIIYQSEDRLSPVSKSRLSLVARK-TLVLDMDNTMITSW------ 85

Query: 942  PVHDEILRKKEEQDREKPH-RHLFRFPHMG-----MWTKLRPGIWTFLERASKLFEMHLY 995
                    K+ ++   KP   H F+F ++G     ++   RP +  FL+R SK +++ ++
Sbjct: 86   ------FIKRGKKPENKPRIAHDFKF-YLGAYGATIYVYKRPYLDHFLDRVSKWYDLTVF 138

Query: 996  TMGNKLYATEMAKVLDP-KGVL 1016
            T G ++YA+ +   LD  +G+L
Sbjct: 139  TSGAEIYASPILDFLDRGRGIL 160


>gi|92399505|gb|ABE76501.1| SCP3-like protein [Mustela putorius furo]
          Length = 276

 Score = 42.0 bits (97), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 80   DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 136  IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 186  GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224


>gi|355559784|gb|EHH16512.1| hypothetical protein EGK_11800, partial [Macaca mulatta]
          Length = 250

 Score = 42.0 bits (97), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 54   DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 109

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 110  IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 159

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 160  GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 198


>gi|145549970|ref|XP_001460664.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124428494|emb|CAK93267.1| unnamed protein product [Paramecium tetraurelia]
          Length = 346

 Score = 42.0 bits (97), Expect = 2.2,   Method: Composition-based stats.
 Identities = 31/101 (30%), Positives = 48/101 (47%), Gaps = 21/101 (20%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            +K  + +DLD TL           VH E L         KP+R  + F ++   T +RP 
Sbjct: 161  KKYSIAIDLDETL-----------VHSEEL---------KPNRR-YDFQNLQFGTFIRPY 199

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 1019
               FL+  +K   + ++T  N  YAT + ++LDP+  LF G
Sbjct: 200  CLQFLQLLNKHANLFVFTSSNIKYATTIMQILDPQKDLFQG 240


>gi|67588036|ref|XP_665317.1| hypothetical protein [Cryptosporidium hominis TU502]
 gi|54655944|gb|EAL35087.1| hypothetical protein Chro.80553 [Cryptosporidium hominis]
          Length = 364

 Score = 42.0 bits (97), Expect = 2.2,   Method: Composition-based stats.
 Identities = 32/111 (28%), Positives = 55/111 (49%), Gaps = 9/111 (8%)

Query: 972  WTKLRPGIWTFLERASK-LFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 1030
            + KLRPG+   L   SK  +E+++YTMG + +A    ++LDP+   F  + I   ++G  
Sbjct: 181  YYKLRPGVINMLRTLSKDKYEIYMYTMGTEYHAYTSLRILDPELRFFHSKRIFYRNNG-- 238

Query: 1031 FDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080
                 +    K L  +   +   +VI+DD  + W     +L+ V  Y +FP
Sbjct: 239  ----FKETSIKSLNTLFPYDHRTLVILDDIEQAWTDIN-SLLKVYPYNFFP 284


>gi|145533244|ref|XP_001452372.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124420060|emb|CAK84975.1| unnamed protein product [Paramecium tetraurelia]
          Length = 250

 Score = 42.0 bits (97), Expect = 2.2,   Method: Composition-based stats.
 Identities = 31/98 (31%), Positives = 50/98 (51%), Gaps = 14/98 (14%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            ++  LVLDLD TL++S    E   + DE +  K  ++ EK             + K+RP 
Sbjct: 71   KEFTLVLDLDETLIHSDL--ERTSILDEEIIVKIGENIEK------------YYIKVRPY 116

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
               FL+  S+LF++ ++T   K YA ++   LDP G +
Sbjct: 117  AREFLQSLSQLFDLVIFTAALKEYADKVIDFLDPCGFI 154


>gi|322779024|gb|EFZ09423.1| hypothetical protein SINV_01392 [Solenopsis invicta]
          Length = 216

 Score = 42.0 bits (97), Expect = 2.2,   Method: Composition-based stats.
 Identities = 49/169 (28%), Positives = 73/169 (43%), Gaps = 22/169 (13%)

Query: 906  TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD---REKPHRH 962
            +R L E       RK+ LVLDLD TL++S      D V    +R     D   +    RH
Sbjct: 20   SRYLNENVPGIVKRKV-LVLDLDETLIHSHH----DGVARPTVRPGTPPDFVLKVTIDRH 74

Query: 963  LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
              RF     +   RP +  FL+  S+ +E+ ++T   ++Y   +A  LD        R I
Sbjct: 75   PVRF-----FVHKRPHVDFFLDIVSQWYELVVFTASMEIYGAAVADKLD------NNRGI 123

Query: 1023 SRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDS---VRVWPHNKL 1068
             R          E    +KDL  +    S+V I+D+S    R +PHN +
Sbjct: 124  LRRRYYRQHCTPEMGSYTKDLSAICSDLSSVFILDNSPGAYRAYPHNAI 172


>gi|390476487|ref|XP_002759760.2| PREDICTED: CTD small phosphatase-like protein-like [Callithrix
            jacchus]
          Length = 451

 Score = 42.0 bits (97), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 255  DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 310

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 311  IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 360

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 361  GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 399


>gi|440900046|gb|ELR51261.1| CTD small phosphatase-like protein, partial [Bos grunniens mutus]
          Length = 250

 Score = 41.6 bits (96), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 54   DQRQIIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 109

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 110  IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 159

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 160  GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 198


>gi|426221551|ref|XP_004004972.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1 [Ovis aries]
          Length = 260

 Score = 41.6 bits (96), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 35/127 (27%), Positives = 58/127 (45%), Gaps = 15/127 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  A+ K   + L  + K     K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65   EENGAVPKAPVQYLLPEAKAQDLDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121  GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 1016 LFAGRVI 1022
             F  R+ 
Sbjct: 171  -FRARLF 176


>gi|148677300|gb|EDL09247.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase-like [Mus musculus]
          Length = 288

 Score = 41.6 bits (96), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 118  KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 164

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
              FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 165  -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 215

Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
                  KDL   LG E S V+I+D+S
Sbjct: 216  ----YVKDLSR-LGRELSKVIIVDNS 236


>gi|118371686|ref|XP_001019041.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
            thermophila]
 gi|89300808|gb|EAR98796.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
            thermophila SB210]
          Length = 379

 Score = 41.6 bits (96), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 36/138 (26%), Positives = 68/138 (49%), Gaps = 23/138 (16%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LVLDLD TL++    +E     D  +   + Q+++K +           +   R  +  F
Sbjct: 212  LVLDLDETLIHC---NEKSLNDDSSIITVQFQNQQKNY-----------YLHQRGYLQEF 257

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
            LE+ +  F +++YT   + YA E+ K++DP+ V+   +V  R      +DG + +   K 
Sbjct: 258  LEQCALNFNIYIYTASTRDYAEEVVKIIDPRSVI--KKVYDRS--SCFYDGKQYLKSLK- 312

Query: 1043 LEGVLGME-SAVVIIDDS 1059
                LG++ +  V+ID++
Sbjct: 313  ---TLGLDLNRTVMIDNN 327


>gi|380799053|gb|AFE71402.1| CTD small phosphatase-like protein isoform 1, partial [Macaca
            mulatta]
          Length = 249

 Score = 41.6 bits (96), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 53   DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 108

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 109  IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 158

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 159  GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 197


>gi|145356819|ref|XP_001422622.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582865|gb|ABP00939.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 190

 Score = 41.6 bits (96), Expect = 2.4,   Method: Composition-based stats.
 Identities = 46/155 (29%), Positives = 70/155 (45%), Gaps = 21/155 (13%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 982
            LVLDLD TL           VH  +    E  D   P   +F      +  + RP + TF
Sbjct: 17   LVLDLDETL-----------VHSNLENTVERCDFSFPV--VFNGDMHRVNVRKRPHLSTF 63

Query: 983  LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 1042
            +E  SK +E+ ++T   ++YA ++  +LDP       R+    D     DG+      KD
Sbjct: 64   MELVSKQYEIVVFTASQQIYADKLLDILDPSQKWIKHRIFR--DSCVQIDGN----FMKD 117

Query: 1043 LEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 1076
            L  VLG + S  +IID+S + +     N I +E +
Sbjct: 118  LR-VLGRDLSRTIIIDNSPQAFGLQVENGIPIESW 151


>gi|354467729|ref|XP_003496321.1| PREDICTED: CTD small phosphatase-like protein-like [Cricetulus
            griseus]
          Length = 342

 Score = 41.6 bits (96), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 172  KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 218

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
              FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 219  -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 269

Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
                  KDL   LG E S V+I+D+S
Sbjct: 270  ----YVKDLSR-LGRELSKVIIVDNS 290


>gi|291234069|ref|XP_002736972.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
            polypeptide A) small phosphatase 1-like [Saccoglossus
            kowalevskii]
          Length = 251

 Score = 41.6 bits (96), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 45/142 (31%), Positives = 66/142 (46%), Gaps = 23/142 (16%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
             KLC+V+DLD TL++S+      PV +       E D      ++ +          RP 
Sbjct: 61   HKLCIVIDLDETLVHSS----FKPVSNADFVVPVEIDGTVHQVYVLK----------RPF 106

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 1038
            +  FL++  +LFE  L+T     YA  +A +LD  GV F  R+     D   F    R  
Sbjct: 107  VDEFLQKMGELFECVLFTASLSKYADPVADLLDKWGV-FRARLFR---DSCVF---HRGN 159

Query: 1039 KSKDLEGVLGME-SAVVIIDDS 1059
              KDL G LG +   +VI+D+S
Sbjct: 160  YVKDL-GRLGRDLKKIVIVDNS 180


>gi|301623726|ref|XP_002941162.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Xenopus
            (Silurana) tropicalis]
          Length = 353

 Score = 41.6 bits (96), Expect = 2.5,   Method: Composition-based stats.
 Identities = 36/133 (27%), Positives = 59/133 (44%), Gaps = 30/133 (22%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNS-------AKFHEVDPVHDEIL 948
            QQ    Q++R + +  + +  SA +  LVLDLD  L++S       A F  + P  D   
Sbjct: 153  QQTPCDQRQRGKDIPFKTR--SAPESTLVLDLDEILVDSSLLPLTGADFTFLIPFQDTYY 210

Query: 949  RKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 1008
            +                     ++ KLRP    FLE   K++E+ ++T   K YA ++  
Sbjct: 211  K---------------------VYVKLRPHAMEFLETLCKVYEIFVFTTAKKEYAEKILD 249

Query: 1009 VLDPKGVLFAGRV 1021
            +LDP+  L   R+
Sbjct: 250  LLDPQKKLIRHRL 262


>gi|344288137|ref|XP_003415807.1| PREDICTED: CTD small phosphatase-like protein-like [Loxodonta
            africana]
          Length = 281

 Score = 41.6 bits (96), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 85   DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 140

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 141  IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 190

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 191  GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 229


>gi|171460950|ref|NP_598471.3| CTD small phosphatase-like protein [Mus musculus]
 gi|408360295|sp|P58465.3|CTDSL_MOUSE RecName: Full=CTD small phosphatase-like protein; Short=CTDSP-like;
            AltName: Full=Carboxy-terminal domain RNA polymerase II
            polypeptide A small phosphatase 3; AltName: Full=NIF-like
            protein; AltName: Full=Nuclear LIM interactor-interacting
            factor 1; Short=NLI-interacting factor 1; AltName:
            Full=Small C-terminal domain phosphatase 3; Short=SCP3;
            Short=Small CTD phosphatase 3
 gi|62948141|gb|AAH94289.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase-like [Mus musculus]
          Length = 276

 Score = 41.6 bits (96), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 106  KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 152

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
              FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 153  -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 203

Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
                  KDL   LG E S V+I+D+S
Sbjct: 204  ----YVKDLSR-LGRELSKVIIVDNS 224


>gi|32812783|emb|CAC69078.2| Mya22 protein [Mus musculus]
          Length = 276

 Score = 41.6 bits (96), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 106  KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 152

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
              FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 153  -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 203

Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
                  KDL   LG E S V+I+D+S
Sbjct: 204  ----YVKDLSR-LGRELSKVIIVDNS 224


>gi|410897359|ref|XP_003962166.1| PREDICTED: uncharacterized protein LOC101077160 [Takifugu rubripes]
          Length = 934

 Score = 41.6 bits (96), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 50/105 (47%), Gaps = 15/105 (14%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
            A+K+C+V+DLD TL++S+      PV D       E +      ++ + PH+        
Sbjct: 761  AKKICVVIDLDETLVHSS----FTPVSDADFIIPVEIEGTVHQVYVLKRPHVD------- 809

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
                FL+R  +LFE  L+T     YA  ++ +LD  G  F  R+ 
Sbjct: 810  ---EFLKRMGELFECVLFTASLSKYADPVSDMLDTWGA-FRNRLF 850


>gi|426339960|ref|XP_004033903.1| PREDICTED: CTD small phosphatase-like protein [Gorilla gorilla
            gorilla]
          Length = 369

 Score = 41.2 bits (95), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 168  KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 214

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
              FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 215  -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 265

Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
                  KDL   LG E S V+I+D+S
Sbjct: 266  ----YVKDLSR-LGRELSKVIIVDNS 286


>gi|194752999|ref|XP_001958806.1| GF12569 [Drosophila ananassae]
 gi|190620104|gb|EDV35628.1| GF12569 [Drosophila ananassae]
          Length = 282

 Score = 41.2 bits (95), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 36/141 (25%), Positives = 63/141 (44%), Gaps = 7/141 (4%)

Query: 875  VGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNS 934
            +G++ +S +  +    E Y       I KE     E Q+++    +  LVLDLD TL++S
Sbjct: 50   LGSYVRSFFSLIATKVESYLRPVTEPIYKEVPLSPESQRRLRQVGRKTLVLDLDETLVHS 109

Query: 935  AKFHEVDPVHDEILRKKEEQDREKPHRHLFR----FPHMGMWTKLRPGIWTFLERASKLF 990
                  DP  +E++         KP   L         +      RP +  FL+ ASK +
Sbjct: 110  CY---SDPETNELVGCSLVPQTAKPDYELSVTLEGLDPIAFQVYKRPHVDVFLKFASKWY 166

Query: 991  EMHLYTMGNKLYATEMAKVLD 1011
            ++ ++T   ++YA ++   LD
Sbjct: 167  DLVIFTASLEVYAAQVVDRLD 187


>gi|375097582|ref|ZP_09743847.1| hypothetical protein SacmaDRAFT_4978 [Saccharomonospora marina
           XMU15]
 gi|374658315|gb|EHR53148.1| hypothetical protein SacmaDRAFT_4978 [Saccharomonospora marina
           XMU15]
          Length = 527

 Score = 41.2 bits (95), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 22/71 (30%), Positives = 44/71 (61%), Gaps = 4/71 (5%)

Query: 150 EIELDLESESNEKVSEQVKEEMKLIN--VESIREALESVLRGDISFEGVCSKLEFTLESL 207
           E ELD+E+++ E+V ++ +EE++ +   V ++RE+L+++  G++ +E V    + T   +
Sbjct: 96  EFELDVEAQTRERVEDETQEELQALRSEVTALRESLQALFGGEVLWERVALTAQST--RM 153

Query: 208 RELVNENNVPT 218
           R L  E  V T
Sbjct: 154 RSLAEEPRVVT 164


>gi|348575309|ref|XP_003473432.1| PREDICTED: CTD small phosphatase-like protein-like [Cavia porcellus]
          Length = 294

 Score = 41.2 bits (95), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 98   DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 153

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 154  IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 203

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 204  GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 242


>gi|154339115|ref|XP_001562249.1| nuclear lim interactor-interacting factor-like protein [Leishmania
            braziliensis MHOM/BR/75/M2904]
 gi|134062832|emb|CAM39277.1| nuclear lim interactor-interacting factor-like protein [Leishmania
            braziliensis MHOM/BR/75/M2904]
          Length = 290

 Score = 41.2 bits (95), Expect = 3.1,   Method: Composition-based stats.
 Identities = 32/104 (30%), Positives = 51/104 (49%), Gaps = 20/104 (19%)

Query: 917  SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG-MWT-- 973
            S  K+ LVLD+D TL++S      D V+D++L                  P  G M+T  
Sbjct: 109  SVPKVTLVLDVDETLVHSTFQPSSDVVYDKVLH----------------VPSDGRMYTVS 152

Query: 974  -KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
             K RP +  FL   S+ FE+ ++T   + Y  ++   +DP+G+L
Sbjct: 153  VKYRPYLEDFLRFISRRFEVVVFTASMRAYCDKLMDEIDPQGIL 196


>gi|401423666|ref|XP_003876319.1| nuclear lim interactor-interacting factor-like protein [Leishmania
            mexicana MHOM/GT/2001/U1103]
 gi|322492561|emb|CBZ27838.1| nuclear lim interactor-interacting factor-like protein [Leishmania
            mexicana MHOM/GT/2001/U1103]
          Length = 290

 Score = 41.2 bits (95), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 47/100 (47%), Gaps = 12/100 (12%)

Query: 917  SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
            S  K+ LVLD+D TL++S      D V+D++L    E                 +  K R
Sbjct: 109  SVPKVTLVLDVDETLVHSTFQPSSDVVYDKVLLVPSEGK------------TYTVSVKYR 156

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
            P +  FL   S+ FE+ ++T   + Y  ++   +D +G+L
Sbjct: 157  PYLEDFLRFVSRRFEIVVFTASMRAYCDKLMDEIDTQGIL 196


>gi|432100877|gb|ELK29230.1| CTD small phosphatase-like protein [Myotis davidii]
          Length = 280

 Score = 41.2 bits (95), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 894  DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 84   DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 139

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
             D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 140  IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 189

Query: 1014 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 1059
            GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 190  GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 228


>gi|6572954|gb|AAF17482.1|AF189774_1 NLI-interacting factor isoform T2 [Gallus gallus]
          Length = 293

 Score = 41.2 bits (95), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 94   KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 140

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
              FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 141  -EFLQRMGELFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 191

Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
                  KDL   LG E S V+I+D+S
Sbjct: 192  ----YVKDLSR-LGRELSKVIIVDNS 212


>gi|417409172|gb|JAA51106.1| Putative ctd carboxy-terminal domain rna polymer, partial [Desmodus
            rotundus]
          Length = 265

 Score = 41.2 bits (95), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 95   KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTTHQVYVLKRPHVD--------- 141

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
              FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 142  -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 192

Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
                  KDL   LG E S V+I+D+S
Sbjct: 193  ----YVKDLSR-LGRELSKVIIVDNS 213


>gi|432849192|ref|XP_004066577.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1-like [Oryzias latipes]
          Length = 262

 Score = 41.2 bits (95), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 55/121 (45%), Gaps = 14/121 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++   + K + + L    K   A K+C+V+DLD TL++S+      PV++       E D
Sbjct: 67   EENGTLSKAQVKPLLPPVKSKDAGKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 122

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  ++ +LD  G 
Sbjct: 123  GTVHQVYVLKRPHVD----------EFLKRMGELFECVLFTASLAKYADPVSDLLDKWGA 172

Query: 1016 L 1016
             
Sbjct: 173  F 173


>gi|452823685|gb|EME30693.1| putative CTD small phosphatase [Galdieria sulphuraria]
          Length = 397

 Score = 41.2 bits (95), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 42/156 (26%), Positives = 67/156 (42%), Gaps = 21/156 (13%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K  LVLDLD TL++S  F       D +L  + E              ++ ++ K+RP +
Sbjct: 203  KKTLVLDLDETLVHSG-FEGSRETSDFVLSMQVEN------------TNLQLFVKMRPYL 249

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVL-DPKGVLF---AGRVISRGDDGDPFDGDE 1035
              FL+  +K FE+ ++T     YA  +  ++ D  GV       R+     + DP    E
Sbjct: 250  KEFLQEVTKHFEIVIFTASMVTYADPVIDLMFDATGVAHIPETHRLFRESCEYDP----E 305

Query: 1036 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLI 1071
                 KDL  +      V+I+D+S   +  N  N I
Sbjct: 306  TCSFHKDLMALGRDIKKVIIVDNSPTAYTKNPYNAI 341


>gi|302806322|ref|XP_002984911.1| hypothetical protein SELMODRAFT_121282 [Selaginella moellendorffii]
 gi|300147497|gb|EFJ14161.1| hypothetical protein SELMODRAFT_121282 [Selaginella moellendorffii]
          Length = 198

 Score = 41.2 bits (95), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 28/46 (60%)

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            RPG+ TFL   S+++E+ ++T   K YA  +   LDP G LF  R+
Sbjct: 65   RPGVDTFLNEMSQIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRL 110


>gi|145498624|ref|XP_001435299.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124402430|emb|CAK67902.1| unnamed protein product [Paramecium tetraurelia]
          Length = 417

 Score = 41.2 bits (95), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 54/111 (48%), Gaps = 16/111 (14%)

Query: 956  REKPHRHLFRFPHMGMWTKL----RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 1011
            RE P   +  F   G   K+    RP    FL++ ++L+ +++YT  +  YA  +   LD
Sbjct: 253  RENPQVTVTAFGEYGEEAKIHFNIRPFCTWFLQQMNQLYTIYVYTASSSAYANAIVNYLD 312

Query: 1012 PKGVLFAGRVISRGDDGDPFDG----DERVPKSKDLEGVLGMESAVVIIDD 1058
            PK     G ++SRG+  +  +G    D R+  +K L+        +VI+D+
Sbjct: 313  PKRQWIMG-ILSRGNCMETKNGFFIKDLRIIGNKQLKD-------MVIVDN 355


>gi|354502403|ref|XP_003513276.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1-like [Cricetulus griseus]
          Length = 342

 Score = 41.2 bits (95), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 14/99 (14%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K+C+V+DLD TL++S+      PV++       E D      ++ + PH+          
Sbjct: 171  KICVVIDLDETLVHSS----FKPVNNADFIIPVEIDGVIHQVYVLKRPHVD--------- 217

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
              FL+R  +LFE  L+T     YA  +A +LD  G   A
Sbjct: 218  -EFLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRA 255


>gi|345321149|ref|XP_001521318.2| PREDICTED: CTD small phosphatase-like protein-like, partial
            [Ornithorhynchus anatinus]
          Length = 295

 Score = 41.2 bits (95), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 126  KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTVHQVYVLKRPHVD--------- 172

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
              FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 173  -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 223

Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
                  KDL   LG E S V+I+D+S
Sbjct: 224  ----YVKDLSR-LGRELSKVIIVDNS 244


>gi|327274307|ref|XP_003221919.1| PREDICTED: CTD small phosphatase-like protein-like [Anolis
            carolinensis]
          Length = 340

 Score = 41.2 bits (95), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 170  KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 216

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
              FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 217  -EFLQRMGELFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 267

Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
                  KDL   LG E S V+I+D+S
Sbjct: 268  ----YVKDLSR-LGRELSKVIIVDNS 288


>gi|9972366|gb|AAG10616.1|AC008030_16 Hypothetical protein [Arabidopsis thaliana]
          Length = 247

 Score = 41.2 bits (95), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 45/163 (27%), Positives = 81/163 (49%), Gaps = 24/163 (14%)

Query: 908  RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 967
            +LE+    ++  K  ++LDLD TL++ A  H     HD ++  K E++            
Sbjct: 63   KLEDPLTGYTNMKRTIILDLDETLVH-ATTHLPGVKHDFMVMVKMEREI----------- 110

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
             M ++   RPG+  FLER  + +++ ++T G + YA+++   LD  GV+ + R+    D 
Sbjct: 111  -MPIFVVKRPGVTEFLERLGENYKVVVFTAGLEEYASQVLDKLDKNGVI-SQRLYR--DS 166

Query: 1028 GDPFDGDERVPKSKDLEGVLG--MESAVVIIDD--SVRVWPHN 1066
                +G       KDL  V+G  + SA+++ D+  S  + P N
Sbjct: 167  CTEVNG----KYVKDLSLVVGKDLRSALIVDDNPSSYSLQPEN 205


>gi|433602192|ref|YP_007034561.1| putative membrane protein [Saccharothrix espanaensis DSM 44229]
 gi|407880045|emb|CCH27688.1| putative membrane protein [Saccharothrix espanaensis DSM 44229]
          Length = 609

 Score = 41.2 bits (95), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 45/69 (65%), Gaps = 4/69 (5%)

Query: 150 EIELDLESESNEKVSEQVKEEMKLINVE--SIREALESVLRGDISFEGVCSKLEFTLESL 207
           E EL++E+E+ ++V E+ K++++ +  E  ++RE LE++L G++  E V  + E T   +
Sbjct: 99  EYELEIEAETRKRVQEEAKDDLEALRGELRALRENLEALLGGEVLVERVALRAESTR--M 156

Query: 208 RELVNENNV 216
           R L +++ V
Sbjct: 157 RALSDQSRV 165


>gi|302806328|ref|XP_002984914.1| hypothetical protein SELMODRAFT_121036 [Selaginella moellendorffii]
 gi|302806330|ref|XP_002984915.1| hypothetical protein SELMODRAFT_121271 [Selaginella moellendorffii]
 gi|300147500|gb|EFJ14164.1| hypothetical protein SELMODRAFT_121036 [Selaginella moellendorffii]
 gi|300147501|gb|EFJ14165.1| hypothetical protein SELMODRAFT_121271 [Selaginella moellendorffii]
          Length = 198

 Score = 41.2 bits (95), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 28/46 (60%)

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            RPG+ TFL   S+++E+ ++T   K YA  +   LDP G LF  R+
Sbjct: 65   RPGVDTFLNEMSQIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRL 110


>gi|302808545|ref|XP_002985967.1| hypothetical protein SELMODRAFT_123223 [Selaginella moellendorffii]
 gi|300146474|gb|EFJ13144.1| hypothetical protein SELMODRAFT_123223 [Selaginella moellendorffii]
          Length = 198

 Score = 41.2 bits (95), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 28/46 (60%)

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            RPG+ TFL   S+++E+ ++T   K YA  +   LDP G LF  R+
Sbjct: 65   RPGVDTFLNEMSEIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRL 110


>gi|326921454|ref|XP_003206974.1| PREDICTED: CTD small phosphatase-like protein-like [Meleagris
            gallopavo]
          Length = 264

 Score = 41.2 bits (95), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 94   KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 140

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
              FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 141  -EFLQRMGELFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 191

Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
                  KDL   LG E S V+I+D+S
Sbjct: 192  ----YVKDLSR-LGRELSKVIIVDNS 212


>gi|145494426|ref|XP_001433207.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400324|emb|CAK65810.1| unnamed protein product [Paramecium tetraurelia]
          Length = 223

 Score = 40.8 bits (94), Expect = 3.8,   Method: Composition-based stats.
 Identities = 43/168 (25%), Positives = 77/168 (45%), Gaps = 34/168 (20%)

Query: 903  KERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH 962
            K R  RL+E     + ++  LVLDLD TL++S    +   +   I      QD ++P   
Sbjct: 27   KSRFVRLKESN---NRKQKILVLDLDETLIHSCTHRDFPHITITI------QDNDEPIDI 77

Query: 963  LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
             F          +RP    F++  S  + ++L+T  +++YA  +   LDPK   +   ++
Sbjct: 78   AF---------NVRPYCKEFIKEMSNYYTIYLFTASSEMYARAIVNHLDPKR-QYITDIL 127

Query: 1023 SRGDDGDPFDG----DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHN 1066
             R +  +  +G    D R+  ++DL+        +VIID+     PH+
Sbjct: 128  CRNNCFETKNGFFIKDLRIITNRDLKD-------IVIIDN----LPHS 164


>gi|403278958|ref|XP_003931046.1| PREDICTED: CTD small phosphatase-like protein [Saimiri boliviensis
            boliviensis]
          Length = 390

 Score = 40.8 bits (94), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 124  KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 170

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
              FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 171  -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 221

Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
                  KDL   LG E S V+I+D+S
Sbjct: 222  ----YVKDLSR-LGRELSKVIIVDNS 242


>gi|224044591|ref|XP_002196499.1| PREDICTED: uncharacterized protein LOC100232268 isoform 2
            [Taeniopygia guttata]
 gi|6572958|gb|AAF17484.1|AF189776_1 NLI-interacting factor isoform R5 [Gallus gallus]
          Length = 264

 Score = 40.8 bits (94), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 94   KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 140

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
              FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 141  -EFLQRMGELFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 191

Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
                  KDL   LG E S V+I+D+S
Sbjct: 192  ----YVKDLSR-LGRELSKVIIVDNS 212


>gi|145545436|ref|XP_001458402.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426222|emb|CAK91005.1| unnamed protein product [Paramecium tetraurelia]
          Length = 423

 Score = 40.8 bits (94), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 54/111 (48%), Gaps = 16/111 (14%)

Query: 956  REKPHRHLFRFPHMGMWTKL----RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 1011
            RE P   +  F   G   K+    RP    FL++ ++LF ++++T  +  YA  +   LD
Sbjct: 259  RENPQVTVTVFGDYGEEAKIHFNIRPFCTWFLQQMNQLFTIYVFTASSSAYANAIVNYLD 318

Query: 1012 PKGVLFAGRVISRGDDGDPFDG----DERVPKSKDLEGVLGMESAVVIIDD 1058
            PK     G ++SRG+  +  +G    D R+  +K L+        +VI+D+
Sbjct: 319  PKKQWIMG-ILSRGNCMETKNGFFIKDLRIVGNKQLKD-------MVIVDN 361


>gi|332216348|ref|XP_003257311.1| PREDICTED: CTD small phosphatase-like protein [Nomascus leucogenys]
          Length = 341

 Score = 40.8 bits (94), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 140  KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 186

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
              FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 187  -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 237

Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
                  KDL   LG E S V+I+D+S
Sbjct: 238  ----YVKDLSR-LGRELSKVIIVDNS 258


>gi|417397992|gb|JAA46029.1| Putative carboxy-terminal domain rna polymerase ii polypeptide a
            small phosphatase 1 isoform 2 [Desmodus rotundus]
          Length = 260

 Score = 40.8 bits (94), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 34/123 (27%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  A+ K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65   EENGAVPKTPVQYLLPEAKPQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ +          RP +  FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121  GVVHQVYVLK----------RPYVDEFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 1016 LFA 1018
              A
Sbjct: 171  FRA 173


>gi|145497555|ref|XP_001434766.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124401894|emb|CAK67369.1| unnamed protein product [Paramecium tetraurelia]
          Length = 249

 Score = 40.8 bits (94), Expect = 4.4,   Method: Composition-based stats.
 Identities = 48/160 (30%), Positives = 68/160 (42%), Gaps = 21/160 (13%)

Query: 917  SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
            + ++  LVLDLD TL+ S    E     DE +  K     EK             + K+R
Sbjct: 68   TEKEFTLVLDLDETLIRSEM--ERTSFLDEEIIVKIGNTIEK------------YYVKIR 113

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 1036
            P    FL+  SK FE+ ++T   K YA ++   LDP G  F  R   R D     DG   
Sbjct: 114  PFARDFLKALSKYFELVIFTAALKEYADKVIDYLDPSG--FIKRRFYR-DSCTKKDG--- 167

Query: 1037 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
                KDL  V        IID+S+     N  N ++++ +
Sbjct: 168  -VFYKDLTKVNSNLEKTFIIDNSLSGMSLNPQNGLLIKSW 206


>gi|348552620|ref|XP_003462125.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1-like [Cavia porcellus]
          Length = 261

 Score = 40.8 bits (94), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 15/124 (12%)

Query: 896  QQKAAIQKER-TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 954
            ++  AI K+   + L  + K   + K+C+V+DLD TL++S+      PV++       E 
Sbjct: 65   EENGAIPKQTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEI 120

Query: 955  DREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G
Sbjct: 121  DGVIHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWG 170

Query: 1015 VLFA 1018
               A
Sbjct: 171  AFRA 174


>gi|71026803|ref|XP_763045.1| nuclear LIM interactor-interacting factor 1 [Theileria parva strain
            Muguga]
 gi|68349998|gb|EAN30762.1| nuclear LIM interactor-interacting factor 1, putative [Theileria
            parva]
          Length = 254

 Score = 40.8 bits (94), Expect = 4.4,   Method: Composition-based stats.
 Identities = 46/163 (28%), Positives = 74/163 (45%), Gaps = 22/163 (13%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
            K+ + RK  LVLDLD TL++S+     +P ++    +  +   E+            ++ 
Sbjct: 64   KVCTVRKKMLVLDLDETLIHSS----FEPSNNSFPMQLMQNGVERT-----------IYI 108

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 1033
              RP +  FL   S  +E+ ++T G K YA  +   +DP GV    R + R D    ++G
Sbjct: 109  GKRPYLSEFLSVVSNFYEIVIFTAGLKSYADPVIDFIDPDGV--CKRRLFR-DSCKYWNG 165

Query: 1034 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
                   KDLE +      VV ID+S   +  N  N I +E +
Sbjct: 166  ----YYIKDLEILNKPLKDVVTIDNSPCCYCLNPENAIPIETW 204


>gi|301757683|ref|XP_002914696.1| PREDICTED: CTD small phosphatase-like protein-like [Ailuropoda
            melanoleuca]
          Length = 283

 Score = 40.8 bits (94), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 113  KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 159

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
              FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 160  -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 210

Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
                  KDL   LG E S V+I+D+S
Sbjct: 211  ----YVKDLSR-LGRELSKVIIVDNS 231


>gi|403346652|gb|EJY72729.1| NLI interacting factor-like phosphatase family protein [Oxytricha
            trifallax]
          Length = 368

 Score = 40.8 bits (94), Expect = 4.5,   Method: Composition-based stats.
 Identities = 18/43 (41%), Positives = 28/43 (65%)

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVL 1010
             M M+  LRPGI+TFL+  S+ FE+ L+  GN+ Y   + K++
Sbjct: 179  QMKMFAYLRPGIYTFLDTLSEHFEIVLFNNGNQEYTENLVKLI 221


>gi|410906319|ref|XP_003966639.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1-like [Takifugu rubripes]
          Length = 262

 Score = 40.8 bits (94), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 31/121 (25%), Positives = 55/121 (45%), Gaps = 14/121 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++   + K + + L    K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 67   EENGTVSKVQVKPLLPPAKSKDSGKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 122

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  ++ +LD  G 
Sbjct: 123  GTVHQVYVLKRPHVD----------EFLKRMGELFECVLFTASLAKYADPVSDLLDKWGA 172

Query: 1016 L 1016
             
Sbjct: 173  F 173


>gi|351699531|gb|EHB02450.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
            phosphatase 1 [Heterocephalus glaber]
          Length = 261

 Score = 40.8 bits (94), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 15/124 (12%)

Query: 896  QQKAAIQKER-TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 954
            ++  AI K+   + L  + K   + K+C+V+DLD TL++S+      PV++       E 
Sbjct: 65   EENGAIPKQSPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEI 120

Query: 955  DREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G
Sbjct: 121  DGVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWG 170

Query: 1015 VLFA 1018
               A
Sbjct: 171  AFRA 174


>gi|397512000|ref|XP_003826348.1| PREDICTED: CTD small phosphatase-like protein [Pan paniscus]
          Length = 326

 Score = 40.8 bits (94), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 125  KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 171

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
              FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 172  -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 222

Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
                  KDL   LG E S V+I+D+S
Sbjct: 223  ----YVKDLSR-LGRELSKVIIVDNS 243


>gi|56549683|ref|NP_001008393.1| CTD small phosphatase-like protein isoform 1 [Homo sapiens]
 gi|114586014|ref|XP_001170981.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
            polypeptide A) small phosphatase-like isoform 2 [Pan
            troglodytes]
 gi|51704233|sp|O15194.2|CTDSL_HUMAN RecName: Full=CTD small phosphatase-like protein; Short=CTDSP-like;
            AltName: Full=Carboxy-terminal domain RNA polymerase II
            polypeptide A small phosphatase 3; AltName: Full=NIF-like
            protein; AltName: Full=Nuclear LIM interactor-interacting
            factor 1; Short=NLI-interacting factor 1; AltName:
            Full=Protein YA22; Short=hYA22; AltName: Full=RBSP3;
            AltName: Full=Small C-terminal domain phosphatase 3;
            Short=SCP3; Short=Small CTD phosphatase 3
 gi|34392247|emb|CAE11805.1| RB serine phosphatase [Homo sapiens]
 gi|410228144|gb|JAA11291.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase-like [Pan troglodytes]
 gi|410291074|gb|JAA24137.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase-like [Pan troglodytes]
 gi|410334185|gb|JAA36039.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase-like [Pan troglodytes]
          Length = 276

 Score = 40.8 bits (94), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 106  KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 152

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
              FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 153  -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 203

Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
                  KDL   LG E S V+I+D+S
Sbjct: 204  ----YVKDLSR-LGRELSKVIIVDNS 224


>gi|290990355|ref|XP_002677802.1| nuclear lim interactor-interacting protein [Naegleria gruberi]
 gi|284091411|gb|EFC45058.1| nuclear lim interactor-interacting protein [Naegleria gruberi]
          Length = 332

 Score = 40.8 bits (94), Expect = 4.8,   Method: Composition-based stats.
 Identities = 29/100 (29%), Positives = 48/100 (48%), Gaps = 14/100 (14%)

Query: 914  KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 973
            K  S   + LVLDLD TL++ +     +P+ D           +     LF      ++ 
Sbjct: 145  KELSQPDITLVLDLDETLVHCS----TEPIPDP----------DFTFTVLFHGVEYTVYV 190

Query: 974  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
            + RP    FLE  SK+FE+ ++T    +YA ++  +LDP+
Sbjct: 191  RKRPYFVEFLEAVSKIFEVVVFTASQSVYADKLLSILDPE 230


>gi|10864009|ref|NP_067021.1| carboxy-terminal domain RNA polymerase II polypeptide A small
            phosphatase 1 isoform 1 [Homo sapiens]
 gi|397495662|ref|XP_003818666.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1 isoform 1 [Pan paniscus]
 gi|402889395|ref|XP_003908002.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1 isoform 1 [Papio anubis]
 gi|426338589|ref|XP_004033258.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1 isoform 1 [Gorilla gorilla gorilla]
 gi|17865510|sp|Q9GZU7.1|CTDS1_HUMAN RecName: Full=Carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1; AltName: Full=Nuclear LIM
            interactor-interacting factor 3; Short=NLI-IF;
            Short=NLI-interacting factor 3; AltName: Full=Small
            C-terminal domain phosphatase 1; Short=SCP1; Short=Small
            CTD phosphatase 1
 gi|10257407|gb|AAG15402.1|AF229162_1 nuclear LIM interactor-interacting factor [Homo sapiens]
 gi|10257410|gb|AAG15404.1| nuclear LIM interactor-interacting factor [Homo sapiens]
 gi|15278033|gb|AAH12977.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase 1 [Homo sapiens]
 gi|119591021|gb|EAW70615.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase 1, isoform CRA_a [Homo sapiens]
 gi|119591024|gb|EAW70618.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase 1, isoform CRA_a [Homo sapiens]
 gi|167773945|gb|ABZ92407.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase 1 [synthetic construct]
 gi|208966090|dbj|BAG73059.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase 1 [synthetic construct]
          Length = 261

 Score = 40.8 bits (94), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 15/124 (12%)

Query: 896  QQKAAIQKER-TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 954
            ++  AI K+   + L  + K   + K+C+V+DLD TL++S+      PV++       E 
Sbjct: 65   EENGAIPKQTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEI 120

Query: 955  DREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G
Sbjct: 121  DGVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWG 170

Query: 1015 VLFA 1018
               A
Sbjct: 171  AFRA 174


>gi|296205578|ref|XP_002749828.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1 isoform 1 [Callithrix jacchus]
          Length = 261

 Score = 40.4 bits (93), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 15/124 (12%)

Query: 896  QQKAAIQKER-TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 954
            ++  AI K+   + L  + K   + K+C+V+DLD TL++S+      PV++       E 
Sbjct: 65   EENGAIPKQTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEI 120

Query: 955  DREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G
Sbjct: 121  DGVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWG 170

Query: 1015 VLFA 1018
               A
Sbjct: 171  AFRA 174


>gi|431917984|gb|ELK17213.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
            phosphatase 1 [Pteropus alecto]
          Length = 261

 Score = 40.4 bits (93), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 14/99 (14%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K+C+V+DLD TL++S+      PV++       E D      ++ + PH+          
Sbjct: 90   KICVVIDLDETLVHSS----FKPVNNADFIIPVEIDGVVHQVYVLKRPHVD--------- 136

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
              FL+R  +LFE  L+T     YA  +A +LD  G   A
Sbjct: 137  -EFLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRA 174


>gi|302806318|ref|XP_002984909.1| hypothetical protein SELMODRAFT_423987 [Selaginella moellendorffii]
 gi|300147495|gb|EFJ14159.1| hypothetical protein SELMODRAFT_423987 [Selaginella moellendorffii]
          Length = 214

 Score = 40.4 bits (93), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 28/46 (60%)

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            RPG+ TFL   S+++E+ ++T   K YA  +   LDP G LF  R+
Sbjct: 81   RPGVDTFLNEMSQIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRL 126


>gi|145524639|ref|XP_001448147.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415680|emb|CAK80750.1| unnamed protein product [Paramecium tetraurelia]
          Length = 411

 Score = 40.4 bits (93), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 25/84 (29%), Positives = 47/84 (55%), Gaps = 5/84 (5%)

Query: 968  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 1027
            H  ++  +RP    FL++ S L+ +++YT  +  YA+ + + LDPKG   +G ++SR + 
Sbjct: 263  HARIYLNIRPFCQWFLQQMSLLYTIYVYTASSSAYASAIVRYLDPKGQWISG-ILSRQNC 321

Query: 1028 GDPFDG----DERVPKSKDLEGVL 1047
             +   G    D RV  +K ++ +L
Sbjct: 322  LETKQGFYIKDLRVISNKQIKNML 345


>gi|395734008|ref|XP_002813985.2| PREDICTED: LOW QUALITY PROTEIN: CTD small phosphatase-like protein
            [Pongo abelii]
          Length = 336

 Score = 40.4 bits (93), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 135  KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 181

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
              FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 182  -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 232

Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
                  KDL   LG E S V+I+D+S
Sbjct: 233  ----YVKDLSR-LGRELSKVIIVDNS 253


>gi|359494479|ref|XP_002266587.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
           4-like isoform 2 [Vitis vinifera]
          Length = 193

 Score = 40.4 bits (93), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 3/52 (5%)

Query: 913 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF 964
           K +   +KL LVLDLDHTLLNS +  ++ P   E L  K + D  + +R LF
Sbjct: 140 KNLLRHKKLYLVLDLDHTLLNSTRLLDITP---EELYLKNQTDPLQVYRFLF 188


>gi|440911023|gb|ELR60752.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
            phosphatase 1 [Bos grunniens mutus]
          Length = 261

 Score = 40.4 bits (93), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 49/103 (47%), Gaps = 15/103 (14%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K+C+V+DLD TL++S+      PV++       E D      ++ + PH+          
Sbjct: 90   KICVVIDLDETLVHSS----FKPVNNADFIIPVEIDGVVHQVYVLKRPHVD--------- 136

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
              FL+R  +LFE  L+T     YA  +A +LD  G  F  R+ 
Sbjct: 137  -EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA-FRARLF 177


>gi|392578955|gb|EIW72082.1| hypothetical protein TREMEDRAFT_41494 [Tremella mesenterica DSM 1558]
          Length = 193

 Score = 40.4 bits (93), Expect = 5.7,   Method: Composition-based stats.
 Identities = 30/96 (31%), Positives = 47/96 (48%), Gaps = 14/96 (14%)

Query: 922  CLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWT 981
            CLVLDLD TLL+S+   ++ P  D I+  + E                 ++   RPG+  
Sbjct: 27   CLVLDLDETLLHSS--FKMLPSADYIVPVEIEGQVHN------------VYVIKRPGVDR 72

Query: 982  FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 1017
            FL    K++E+ ++T     YA  +  +LDP GV+ 
Sbjct: 73   FLYEMGKIYEVVVFTASLSKYADPVLDMLDPNGVVL 108


>gi|302806324|ref|XP_002984912.1| hypothetical protein SELMODRAFT_48489 [Selaginella moellendorffii]
 gi|300147498|gb|EFJ14162.1| hypothetical protein SELMODRAFT_48489 [Selaginella moellendorffii]
          Length = 171

 Score = 40.4 bits (93), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 28/46 (60%)

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            RPG+ TFL   S+++E+ ++T   K YA  +   LDP G LF  R+
Sbjct: 40   RPGVDTFLNEMSQIYEIVVFTRAVKPYADRILDRLDPVGNLFTHRL 85


>gi|255562534|ref|XP_002522273.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
            phosphatase, putative [Ricinus communis]
 gi|223538526|gb|EEF40131.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
            phosphatase, putative [Ricinus communis]
          Length = 300

 Score = 40.4 bits (93), Expect = 5.7,   Method: Composition-based stats.
 Identities = 40/148 (27%), Positives = 69/148 (46%), Gaps = 26/148 (17%)

Query: 879  PQSAWGDVEHLFEGYDD---QQKAAIQKERTRRL----EEQKKMFSARKLCLVLDLDHTL 931
            P S +   + L  G  D   +Q+  ++K+   R+    E+   + S  K  + LDLD TL
Sbjct: 70   PNSRYKGYKILKNGVKDRKREQEPDVEKDGICRVLFFNEKLPPLISPNKRTVFLDLDETL 129

Query: 932  LNSAKFHEVDP---VHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASK 988
            ++S    + DP   V D ++R   + +             M  +   RPG+  FLE  + 
Sbjct: 130  VHS----KADPPPHVFDFVVRPNIDGE------------FMNFYVLKRPGVDEFLEALAA 173

Query: 989  LFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
             +E+ ++T G K YA+ +   LD KG++
Sbjct: 174  KYEVVVFTAGLKAYASLVLDRLDEKGLI 201


>gi|308321688|gb|ADO27995.1| carboxy-terminal domain RNA polymerase II polypeptide A small
            phosphatase 1 [Ictalurus furcatus]
          Length = 264

 Score = 40.4 bits (93), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 33/121 (27%), Positives = 54/121 (44%), Gaps = 14/121 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++   I K   + L  Q K     K+C+V+DLD TL++S+      PV++       E D
Sbjct: 69   EENGTISKVPAKPLLPQIKSKDVGKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 124

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  ++ +LD  G 
Sbjct: 125  GAVHQVYVLKRPHVD----------EFLKRMGELFECVLFTASLAKYADPVSDLLDKWGA 174

Query: 1016 L 1016
             
Sbjct: 175  F 175


>gi|302806336|ref|XP_002984918.1| hypothetical protein SELMODRAFT_424005 [Selaginella moellendorffii]
 gi|300147504|gb|EFJ14168.1| hypothetical protein SELMODRAFT_424005 [Selaginella moellendorffii]
          Length = 199

 Score = 40.4 bits (93), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 28/46 (60%)

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 1021
            RPG+ TFL   S+++E+ ++T   K YA  +   LDP G LF  R+
Sbjct: 66   RPGVDTFLNEMSQIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRL 111


>gi|114583310|ref|XP_001156881.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1 isoform 2 [Pan troglodytes]
          Length = 261

 Score = 40.4 bits (93), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 14/99 (14%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K+C+V+DLD TL++S+      PV++       E D      ++ + PH+          
Sbjct: 90   KICVVIDLDETLVHSS----FKPVNNADFIIPVEIDGVVHQVYVLKRPHVD--------- 136

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
              FL+R  +LFE  L+T     YA  +A +LD  G   A
Sbjct: 137  -EFLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRA 174


>gi|403266876|ref|XP_003925586.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1 isoform 2 [Saimiri boliviensis
            boliviensis]
          Length = 248

 Score = 40.4 bits (93), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 60/128 (46%), Gaps = 16/128 (12%)

Query: 896  QQKAAIQKER-TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 954
            ++  AI K+   + L  + K   + K+C+V+DLD TL++S+      PV++       E 
Sbjct: 52   EENGAIPKQTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEI 107

Query: 955  DREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G
Sbjct: 108  DGVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWG 157

Query: 1015 VLFAGRVI 1022
              F  R+ 
Sbjct: 158  A-FRARLF 164


>gi|384484378|gb|EIE76558.1| hypothetical protein RO3G_01262 [Rhizopus delemar RA 99-880]
          Length = 348

 Score = 40.4 bits (93), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 50/175 (28%), Positives = 75/175 (42%), Gaps = 39/175 (22%)

Query: 919  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 978
            RKL LVLDLD TL        V  V +E  R   E D  K    +      G    L   
Sbjct: 113  RKLPLVLDLDDTL--------VRLVGNENGRFVSESDIPKCKDRVAVLKD-GKRVVLTER 163

Query: 979  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP-----KGVLFAGR-----VISRGDDG 1028
            +  FLE A +L+++ + ++G++ Y   +  VLDP     KG+L++ R     + S  D G
Sbjct: 164  VREFLEWAQQLYDISICSLGDQNYVDSVIDVLDPTRSWVKGILYSARAEHDYIRSSPDPG 223

Query: 1029 DPFDGDERVPKSKDLEGVLGM-----------ESAVVIIDDSVRVWPHNKLNLIV 1072
             P          KDL+ +               S  +I+DD  R+WP  + + I+
Sbjct: 224  RP---------PKDLQALYSFCALRDQTLGSGFSLPLILDDETRMWPAEQHDNII 269


>gi|426249781|ref|XP_004018627.1| PREDICTED: CTD small phosphatase-like protein [Ovis aries]
          Length = 255

 Score = 40.4 bits (93), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 85   KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 131

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
              FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 132  -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 182

Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
                  KDL   LG E S V+I+D+S
Sbjct: 183  ----YVKDLSR-LGRELSKVIIVDNS 203


>gi|23346509|ref|NP_694728.1| carboxy-terminal domain RNA polymerase II polypeptide A small
            phosphatase 1 [Mus musculus]
 gi|17865506|sp|P58466.1|CTDS1_MOUSE RecName: Full=Carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1; AltName: Full=Golli-interacting
            protein; Short=GIP; AltName: Full=Nuclear LIM
            interactor-interacting factor 3; Short=NLI-interacting
            factor 3; AltName: Full=Small C-terminal domain
            phosphatase 1; Short=SCP1; Short=Small CTD phosphatase 1
 gi|15145799|gb|AAK83555.1| golli-interacting protein [Mus musculus]
 gi|40796195|gb|AAH65158.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase 1 [Mus musculus]
 gi|51258970|gb|AAH79638.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase 1 [Mus musculus]
 gi|57169202|gb|AAH49184.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase 1 [Mus musculus]
 gi|74191312|dbj|BAE39480.1| unnamed protein product [Mus musculus]
 gi|148667908|gb|EDL00325.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small
            phosphatase 1, isoform CRA_a [Mus musculus]
          Length = 261

 Score = 40.0 bits (92), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 14/99 (14%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K+C+V+DLD TL++S+      PV++       E D      ++ + PH+          
Sbjct: 90   KICVVIDLDETLVHSS----FKPVNNADFIIPVEIDGVVHQVYVLKRPHVD--------- 136

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
              FL+R  +LFE  L+T     YA  +A +LD  G   A
Sbjct: 137  -EFLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRA 174


>gi|189303571|ref|NP_001121551.1| carboxy-terminal domain RNA polymerase II polypeptide A small
            phosphatase 1 [Rattus norvegicus]
 gi|149016108|gb|EDL75354.1| rCG23761 [Rattus norvegicus]
 gi|171846749|gb|AAI61976.1| Ctdsp1 protein [Rattus norvegicus]
          Length = 261

 Score = 40.0 bits (92), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 14/99 (14%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K+C+V+DLD TL++S+      PV++       E D      ++ + PH+          
Sbjct: 90   KICVVIDLDETLVHSS----FKPVNNADFIIPVEIDGVVHQVYVLKRPHVD--------- 136

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
              FL+R  +LFE  L+T     YA  +A +LD  G   A
Sbjct: 137  -EFLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRA 174


>gi|344268533|ref|XP_003406112.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1-like [Loxodonta africana]
          Length = 261

 Score = 40.0 bits (92), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 14/99 (14%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K+C+V+DLD TL++S+      PV++       E D      ++ + PH+          
Sbjct: 90   KICVVIDLDETLVHSS----FKPVNNADFIIPVEIDGVVHQVYVLKRPHVD--------- 136

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
              FL+R  +LFE  L+T     YA  +A +LD  G   A
Sbjct: 137  -EFLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRA 174


>gi|301608836|ref|XP_002933982.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1 [Xenopus (Silurana) tropicalis]
          Length = 260

 Score = 40.0 bits (92), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 33/127 (25%), Positives = 58/127 (45%), Gaps = 15/127 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++  ++ K   + L  + K   A K+C+V+DLD TL++S+      PV++       E +
Sbjct: 65   EENGSVPKSSVKYLLPEVKAQDAGKICVVIDLDETLVHSS----FKPVNNADFIIPVEIE 120

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL R  ++FE  L+T     YA  +A +LD  G 
Sbjct: 121  GTVHQVYVLKRPHVD----------EFLRRMGEMFECVLFTASLAKYADPVADLLDKWGA 170

Query: 1016 LFAGRVI 1022
             F  R+ 
Sbjct: 171  -FRSRLF 176


>gi|145542289|ref|XP_001456832.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124424645|emb|CAK89435.1| unnamed protein product [Paramecium tetraurelia]
          Length = 423

 Score = 40.0 bits (92), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 47/88 (53%), Gaps = 12/88 (13%)

Query: 975  LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG- 1033
            +RP    FL++ S+L+ ++++T  +  YA  +   LDPK     G ++SRG+  +  +G 
Sbjct: 282  IRPFCAWFLQQMSQLYTIYVFTASSSAYANAIVNYLDPKRQWILG-ILSRGNCMETKNGF 340

Query: 1034 ---DERVPKSKDLEGVLGMESAVVIIDD 1058
               D R+  +K L+        +VI+D+
Sbjct: 341  FIKDLRIVGNKQLKD-------MVIVDN 361


>gi|63101171|gb|AAH95870.1| Zgc:113169 [Danio rerio]
          Length = 230

 Score = 40.0 bits (92), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 35/127 (27%), Positives = 57/127 (44%), Gaps = 15/127 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++   I K   + L  Q K     K+C+V+DLD TL++S+      PV++       E D
Sbjct: 70   EENGTISKVPAKPLLPQIKSKDVGKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 125

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  ++ +LD  G 
Sbjct: 126  GTVHQVYVLKRPHVD----------EFLKRMGELFECVLFTASLAKYADPVSDLLDKWGA 175

Query: 1016 LFAGRVI 1022
             F  R+ 
Sbjct: 176  -FRSRLF 181


>gi|355681366|gb|AER96785.1| CTD small phosphatase 1 [Mustela putorius furo]
          Length = 260

 Score = 40.0 bits (92), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 14/99 (14%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K+C+V+DLD TL++S+      PV++       E D      ++ + PH+          
Sbjct: 90   KICVVIDLDETLVHSS----FKPVNNADFIIPVEIDGVVHQVYVLKRPHVD--------- 136

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
              FL+R  +LFE  L+T     YA  +A +LD  G   A
Sbjct: 137  -EFLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRA 174


>gi|126337836|ref|XP_001365381.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1-like [Monodelphis domestica]
          Length = 346

 Score = 40.0 bits (92), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 32/119 (26%), Positives = 53/119 (44%), Gaps = 14/119 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++   + K   + L  + K     K+C+V+DLD TL++S+      PV +       E D
Sbjct: 151  EENGTVPKAPVKYLLPEAKAQDLGKICVVIDLDETLVHSS----FKPVSNADFIIPVEID 206

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
                  ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G
Sbjct: 207  GMVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWG 255


>gi|348518153|ref|XP_003446596.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1-like [Oreochromis niloticus]
          Length = 262

 Score = 40.0 bits (92), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 31/121 (25%), Positives = 55/121 (45%), Gaps = 14/121 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++   + K + + L    K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 67   EENGTVSKIQAKPLLPPVKSKDSGKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 122

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  ++ +LD  G 
Sbjct: 123  GTIHQVYVLKRPHVD----------EFLKRMGELFECVLFTASLAKYADPVSDLLDKWGA 172

Query: 1016 L 1016
             
Sbjct: 173  F 173


>gi|340723842|ref|XP_003400297.1| PREDICTED: CTD nuclear envelope phosphatase 1 homolog [Bombus
            terrestris]
          Length = 286

 Score = 40.0 bits (92), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 44/152 (28%), Positives = 68/152 (44%), Gaps = 21/152 (13%)

Query: 923  LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD---REKPHRHLFRFPHMGMWTKLRPGI 979
            LVLDLD TL++S      D V    +R     D   + K  RH  RF     +   RP +
Sbjct: 106  LVLDLDETLIHSHH----DGVARPTVRFGTPPDFILKVKIDRHPVRF-----FVHKRPHV 156

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPK 1039
              FL+  S+ +E+ ++T   ++Y   +A+ LD        R I R          E    
Sbjct: 157  DFFLDIVSQWYELVVFTASMEIYGAAVAEKLD------NNRGILRRRYYRQHCTPEMGSY 210

Query: 1040 SKDLEGVLGMESAVVIIDDS---VRVWPHNKL 1068
            +KDL  +    ++V I+D+S    R +PHN +
Sbjct: 211  TKDLSAICSDLASVFILDNSPGAYRAYPHNAI 242


>gi|281340231|gb|EFB15815.1| hypothetical protein PANDA_001554 [Ailuropoda melanoleuca]
          Length = 243

 Score = 40.0 bits (92), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 49/103 (47%), Gaps = 15/103 (14%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K+C+V+DLD TL++S+      PV++       E D      ++ + PH+          
Sbjct: 72   KICVVIDLDETLVHSS----FKPVNNADFIIPVEIDGVVHQVYVLKRPHVD--------- 118

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
              FL+R  +LFE  L+T     YA  +A +LD  G  F  R+ 
Sbjct: 119  -EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA-FRARLF 159


>gi|281353948|gb|EFB29532.1| hypothetical protein PANDA_002599 [Ailuropoda melanoleuca]
          Length = 250

 Score = 40.0 bits (92), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 80   KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 126

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 1034
              FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 127  -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 177

Query: 1035 ERVPKSKDLEGVLGME-SAVVIIDDS 1059
                  KDL   LG E S V+I+D+S
Sbjct: 178  ----YVKDLSR-LGRELSKVIIVDNS 198


>gi|395527645|ref|XP_003765953.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1 isoform 1 [Sarcophilus harrisii]
          Length = 257

 Score = 40.0 bits (92), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 46/95 (48%), Gaps = 14/95 (14%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K+C+V+DLD TL++S+      PV++       E D      ++ + PH+          
Sbjct: 86   KICVVIDLDETLVHSS----FKPVNNADFIIPVEIDGMVHQVYVLKRPHVD--------- 132

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 1014
              FL+R  +LFE  L+T     YA  +A +LD  G
Sbjct: 133  -EFLQRMGELFECVLFTASLAKYADPVADLLDKWG 166


>gi|224000223|ref|XP_002289784.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220974992|gb|EED93321.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 179

 Score = 40.0 bits (92), Expect = 8.3,   Method: Composition-based stats.
 Identities = 32/97 (32%), Positives = 49/97 (50%), Gaps = 14/97 (14%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K CLVLDLD TL++S+ F  V P  D ++  + E         +  F    ++   RPG+
Sbjct: 12   KKCLVLDLDETLVHSS-FRAV-PGADFVIPVQIED--------VVHF----VYVAKRPGV 57

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 1016
              FL   +K +E+ +YT     YA  +  +LDP  V+
Sbjct: 58   DEFLTEMAKHYEIVVYTASLNKYADPLLDLLDPNRVI 94


>gi|399215866|emb|CCF72554.1| unnamed protein product [Babesia microti strain RI]
          Length = 248

 Score = 39.7 bits (91), Expect = 8.6,   Method: Composition-based stats.
 Identities = 31/123 (25%), Positives = 56/123 (45%), Gaps = 15/123 (12%)

Query: 895  DQQKAAIQKERTRRLEEQKKMFSARK-LCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 953
            +  +   Q +  + L  +K + S +K   LVLDLD TL++S    + +      ++   E
Sbjct: 30   EANRPTFQTQLKKFLTSEKPVTSGKKKFTLVLDLDETLIHSEFVTDGNHSFSTTIKNDTE 89

Query: 954  QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 1013
                             ++   RP    FLE+ +KLFE+ ++T G++ YA  +  +LD  
Sbjct: 90   NQT--------------IYVYKRPYADEFLEQVAKLFEVVIFTAGSEPYAKAVIDILDKN 135

Query: 1014 GVL 1016
             V+
Sbjct: 136  KVV 138


>gi|355565181|gb|EHH21670.1| hypothetical protein EGK_04793 [Macaca mulatta]
          Length = 270

 Score = 39.7 bits (91), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 49/103 (47%), Gaps = 15/103 (14%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K+C+V+DLD TL++S+      PV++       E D      ++ + PH+          
Sbjct: 99   KICVVIDLDETLVHSS----FKPVNNADFIIPVEIDGVVHQVYVLKRPHVD--------- 145

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
              FL+R  +LFE  L+T     YA  +A +LD  G  F  R+ 
Sbjct: 146  -EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA-FRARLF 186


>gi|145529824|ref|XP_001450695.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124418317|emb|CAK83298.1| unnamed protein product [Paramecium tetraurelia]
          Length = 308

 Score = 39.7 bits (91), Expect = 8.6,   Method: Composition-based stats.
 Identities = 27/95 (28%), Positives = 48/95 (50%), Gaps = 15/95 (15%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
            +RK+C VLDLD TL++S              + K E D +     + +     ++  +RP
Sbjct: 54   SRKVC-VLDLDETLVHS--------------QFKAENDHDFSLDIIVQSQLFKVYVTVRP 98

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 1012
            G+  F++  S+ FE+ ++T   K YA  +  ++DP
Sbjct: 99   GVENFIDTLSEYFEVIMWTASLKEYADPVMDIIDP 133


>gi|318037543|ref|NP_001188083.1| carboxy-terminal domain RNA polymerase II polypeptide a small
            phohatase 1 [Ictalurus punctatus]
 gi|308323757|gb|ADO29014.1| carboxy-terminal domain RNA polymerase II polypeptide a small
            phohatase 1 [Ictalurus punctatus]
          Length = 264

 Score = 39.7 bits (91), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 33/121 (27%), Positives = 54/121 (44%), Gaps = 14/121 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++   I K   + L  Q K     K+C+V+DLD TL++S+      PV++       E D
Sbjct: 69   EENGTISKVPAKPLLPQIKSKDVGKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 124

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  ++ +LD  G 
Sbjct: 125  GAVHQVYVLKRPHVD----------EFLKRMGELFERVLFTASLAKYADPVSDLLDKWGA 174

Query: 1016 L 1016
             
Sbjct: 175  F 175


>gi|301755758|ref|XP_002913748.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
            small phosphatase 1-like, partial [Ailuropoda
            melanoleuca]
          Length = 252

 Score = 39.7 bits (91), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 14/99 (14%)

Query: 920  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 979
            K+C+V+DLD TL++S+      PV++       E D      ++ + PH+          
Sbjct: 81   KICVVIDLDETLVHSS----FKPVNNADFIIPVEIDGVVHQVYVLKRPHVD--------- 127

Query: 980  WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 1018
              FL+R  +LFE  L+T     YA  +A +LD  G   A
Sbjct: 128  -EFLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRA 165


>gi|393247111|gb|EJD54619.1| NLI interacting factor [Auricularia delicata TFB-10046 SS5]
          Length = 182

 Score = 39.7 bits (91), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 48/95 (50%), Gaps = 15/95 (15%)

Query: 917  SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 976
            + RK CLVLDLD TL++S+   ++ P  D I+    E                 ++   R
Sbjct: 12   TGRK-CLVLDLDETLVHSS--FKMIPQADYIIPVLIEHQLHN------------VYVVKR 56

Query: 977  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 1011
            PG+ TFLE+  +L+E+ ++T    +YA  +   LD
Sbjct: 57   PGVDTFLEKMGELYEVVVFTASLSMYADPVLDKLD 91


>gi|47221014|emb|CAF98243.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 258

 Score = 39.7 bits (91), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 31/121 (25%), Positives = 55/121 (45%), Gaps = 14/121 (11%)

Query: 896  QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 955
            ++   + K + + L    K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 67   EENGTVSKVQVKPLLPPVKSKDSGKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 122

Query: 956  REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 1015
                  ++ + PH+            FL+R  +LFE  L+T     YA  ++ +LD  G 
Sbjct: 123  GTVHQVYVLKRPHVD----------EFLKRMGELFECVLFTASLAKYADPVSDLLDKWGA 172

Query: 1016 L 1016
             
Sbjct: 173  F 173


>gi|302806326|ref|XP_002984913.1| hypothetical protein SELMODRAFT_5868 [Selaginella moellendorffii]
 gi|300147499|gb|EFJ14163.1| hypothetical protein SELMODRAFT_5868 [Selaginella moellendorffii]
          Length = 173

 Score = 39.7 bits (91), Expect = 9.5,   Method: Composition-based stats.
 Identities = 31/101 (30%), Positives = 46/101 (45%), Gaps = 3/101 (2%)

Query: 976  RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 1035
            RPG+ TFL   S+++E+ ++T   K YA  +   LDP G LF  R+           G +
Sbjct: 41   RPGVDTFLNEMSQIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRLYRDSCSPKEVGGRK 100

Query: 1036 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 1076
             V   KDL  +       VI+DD    +     N IV+  +
Sbjct: 101  VV---KDLSRLGRDLRHTVIVDDKPESFCLQPSNGIVIRAF 138


>gi|229367296|gb|ACQ58628.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
            phosphatase 1 [Anoplopoma fimbria]
          Length = 262

 Score = 39.7 bits (91), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 50/105 (47%), Gaps = 15/105 (14%)

Query: 918  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 977
            A K+C+V+DLD TL++S+      PV++       E D      ++ + PH+        
Sbjct: 89   AGKICVVIDLDETLVHSS----FKPVNNADFIIPVEIDGTVHQVYVLKRPHVD------- 137

Query: 978  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1022
                FL+R  +LFE  L+T     YA  ++ +LD  G  F  R+ 
Sbjct: 138  ---EFLKRMGELFECVLFTASLSKYADPVSDLLDKWGA-FRSRLF 178


>gi|297794689|ref|XP_002865229.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
            lyrata]
 gi|297311064|gb|EFH41488.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
            lyrata]
          Length = 272

 Score = 39.7 bits (91), Expect = 9.8,   Method: Composition-based stats.
 Identities = 44/153 (28%), Positives = 70/153 (45%), Gaps = 25/153 (16%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E   K F   K  +VLDLD TL++S+      P +D ++  K +               +
Sbjct: 87   ERSDKSFDETKKTIVLDLDETLVHSSMEKPEVP-YDFVVNPKIDGQI------------L 133

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---D 1026
              +   RPG+  FL++  + +++ ++T G + YA+ +   LDP+      RVISR    D
Sbjct: 134  TFFVIKRPGVDEFLKKIGEKYQIVVFTAGLREYASLVLDKLDPE-----RRVISRSFYRD 188

Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDS 1059
                 DG       KDL  V+     VVI+DD+
Sbjct: 189  ACSEIDGR----LVKDLGFVMRDLRRVVIVDDN 217


>gi|15242476|ref|NP_199382.1| haloacid dehalogenase-like hydrolase domain-containing protein
            [Arabidopsis thaliana]
 gi|9758673|dbj|BAB09212.1| unnamed protein product [Arabidopsis thaliana]
 gi|332007902|gb|AED95285.1| haloacid dehalogenase-like hydrolase domain-containing protein
            [Arabidopsis thaliana]
          Length = 272

 Score = 39.7 bits (91), Expect = 9.9,   Method: Composition-based stats.
 Identities = 44/153 (28%), Positives = 70/153 (45%), Gaps = 25/153 (16%)

Query: 910  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 969
            E   K F   K  +VLDLD TL++S+      P +D ++  K +               +
Sbjct: 87   ERSGKSFDETKKTIVLDLDETLVHSSMEKPEVP-YDFVVNPKIDGQI------------L 133

Query: 970  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---D 1026
              +   RPG+  FL++  + +++ ++T G + YA+ +   LDP+      RVISR    D
Sbjct: 134  TFFVIKRPGVDEFLKKIGEKYQIVVFTAGLREYASLVLDKLDPE-----RRVISRSFYRD 188

Query: 1027 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDS 1059
                 DG       KDL  V+     VVI+DD+
Sbjct: 189  ACSEIDGR----LVKDLGFVMRDLRRVVIVDDN 217


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.311    0.130    0.368 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 18,348,977,898
Number of Sequences: 23463169
Number of extensions: 849874707
Number of successful extensions: 2496284
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 706
Number of HSP's successfully gapped in prelim test: 1656
Number of HSP's that attempted gapping in prelim test: 2484964
Number of HSP's gapped (non-prelim): 10117
length of query: 1118
length of database: 8,064,228,071
effective HSP length: 154
effective length of query: 964
effective length of database: 8,745,867,341
effective search space: 8431016116724
effective search space used: 8431016116724
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 83 (36.6 bits)