BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 001926
         (995 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255543174|ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
 gi|223548611|gb|EEF50102.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
          Length = 1195

 Score = 1140 bits (2949), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 614/982 (62%), Positives = 721/982 (73%), Gaps = 70/982 (7%)

Query: 49   DKEKDMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNKP---LEASKPGPPGY 105
            +KEK+ L    VN KD+++  +++ +D++   K  LP DS + NK    +E  K G   +
Sbjct: 249  EKEKEPLISTVVNKKDNDVNGKSSGHDMSAVNK--LPTDSFVNNKANLSIEGPKTGVSSF 306

Query: 106  RSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEV 165
            +SR  LLPLLD HK HD DSLPSPTRE+   +P  R L     MV         + N+ +
Sbjct: 307  KSRAALLPLLDLHKDHDADSLPSPTRESALPLPAYRVLT--PKMVLD-------TGNSRM 357

Query: 166  HKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVDQ 225
            H    YETDAL+A SSYQQKF ++SF +   LPSPTPSEESG+GDGDTGGE+SS+ +V  
Sbjct: 358  HP---YETDALKAVSSYQQKFSKSSFALTDRLPSPTPSEESGNGDGDTGGEVSSSLSVSS 414

Query: 226  PKPVNMPTLGQQPVSSQPMDISQP-MDISSVQALTTANNSAPASSGYNPVVKPNPVVKAP 284
             +P N  T GQ   S     IS P MD SS+  + +  ++  ASS       P+  VKA 
Sbjct: 415  FRPANPLTSGQSNAS-----ISLPRMDGSSLPGVISIKSAVRASSA------PSLTVKAS 463

Query: 285  IKSRDPRLRF--ASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALK 342
             KSRDPRLRF  + SNAL+ NH+  P++ N  KVEP+G  M+ ++QK V++P+ DG +LK
Sbjct: 464  AKSRDPRLRFVNSDSNALDQNHRAVPVV-NTLKVEPIGGTMNKKRQKIVDDPIPDGHSLK 522

Query: 343  RQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPI 402
            RQ+N  ENSGVVRD K + GSGGWLEDTDM  PQ MN+N LVD+AES+ R+ D G     
Sbjct: 523  RQKNALENSGVVRDVKTMVGSGGWLEDTDMVGPQTMNKNQLVDNAESDPRRKDGGGVCTS 582

Query: 403  TSGTPNVVVSGNEPAPATTPS------------TTVSLPALLKDIAVNPTMLLNILKMGQ 450
            +S   +V +SG E  P T  S            +T ++P LLK+IAVNPTML+NILKMGQ
Sbjct: 583  SSCISSVNISGTEQIPVTGTSVPIGGELVPVKGSTAAIPDLLKNIAVNPTMLINILKMGQ 642

Query: 451  QQKLAADAQQKSNDSSMNTMHPPIPSS-IPPVSVTCSIPSGILSKPM------------D 497
            QQ+LA +AQQK  D + +T +P   +S +  V V  +  SGIL +P             D
Sbjct: 643  QQRLALEAQQKPVDPAKSTTYPLNSNSMLGTVPVVGAAHSGILPRPAGTVQVSPQLGTAD 702

Query: 498  ELGKVRMKPRDPRRVLHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQKQLGAPEA 556
            +LGK+RMKPRDPRRVLH NALQR+GS+G E  KT+  S P  Q +K+N N QKQ G  E 
Sbjct: 703  DLGKIRMKPRDPRRVLHNNALQRNGSMGSEHLKTNLTSIPINQETKDNQNLQKQEGQVEK 762

Query: 557  KPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMKAV 616
            KPV  QS+  PDI+  FTKNLK+IAD +SVS   TS+P+V QN   QP        M+  
Sbjct: 763  KPVPLQSLALPDISMPFTKNLKNIADIVSVSHASTSQPLVPQNPASQP--------MRTT 814

Query: 617  VTNHDDKQTGTGSGPEAGPVGA---HPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQ 673
            +++ D +  G GS P A    A     Q+AWGDVEHLFEGY+DQQKAAIQ+ER RR+EEQ
Sbjct: 815  ISSSD-QFLGIGSAPGAAAAAAAGPRTQNAWGDVEHLFEGYNDQQKAAIQRERARRIEEQ 873

Query: 674  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
            KK+FSARKLCLVLDLDHTLLNSAKF EVDPVHDEILRKKEEQDREK HRHLFRFPHMGMW
Sbjct: 874  KKLFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAHRHLFRFPHMGMW 933

Query: 734  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 793
            TKLRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAKVLDP GVLF GRVISRGDDG+PFD
Sbjct: 934  TKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGVLFNGRVISRGDDGEPFD 993

Query: 794  GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 853
            GDER+PKSKDLEGVLGMES VVI+DDSVRVWPHNKLNLIVVERY YFPCSRRQFGL GPS
Sbjct: 994  GDERIPKSKDLEGVLGMESGVVIMDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPS 1053

Query: 854  LLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFS 913
            LLEIDHDER EDGTLA SL VIER+H+ FF+H SLD+ DVRNILA+EQRKILAGCRIVFS
Sbjct: 1054 LLEIDHDERPEDGTLACSLAVIERIHQNFFTHPSLDEADVRNILASEQRKILAGCRIVFS 1113

Query: 914  RVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHP 973
            RVFPVGEANPHLHPLWQTAEQFGAVCT  ID+QVTHVVANSLGTDKVNWALSTGRFVV+P
Sbjct: 1114 RVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVYP 1173

Query: 974  GWVEASALLYRRANEQDFAIKP 995
            GWVEASALLYRRANEQDFAIKP
Sbjct: 1174 GWVEASALLYRRANEQDFAIKP 1195


>gi|224053553|ref|XP_002297869.1| predicted protein [Populus trichocarpa]
 gi|222845127|gb|EEE82674.1| predicted protein [Populus trichocarpa]
          Length = 1117

 Score = 1138 bits (2943), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 631/1021 (61%), Positives = 736/1021 (72%), Gaps = 72/1021 (7%)

Query: 1    MNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAMHGV 60
            MN  LKEQNK + SR  SL+ SH PP FS  Q KE+           N+   D LA    
Sbjct: 143  MNKKLKEQNKGVFSRFFSLLNSHYPPFFSPGQNKEV----------LNENHNDSLA---- 188

Query: 61   NGKDSNIVTENAVNDLN-FKEKVPLPVDSLMQNKP---LEASKP-GPPGYRSRGVLLPLL 115
                     + A  DL    EK+P   ++ +QNKP   +EA KP G P ++SRGVLLPLL
Sbjct: 189  ---------KTAGYDLTTMSEKLP-AAETFVQNKPNKSIEAPKPPGVPSFKSRGVLLPLL 238

Query: 116  DPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYETDA 175
            D  K HD DSLPSPT+ETTP  PVQR L +GDGMV S     K++  AE  +   YETDA
Sbjct: 239  DLKKYHDEDSLPSPTQETTP-FPVQRLLAIGDGMVSSGLPVPKVTPVAEEPRMHPYETDA 297

Query: 176  LRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVDQPKPVNMPTLG 235
            L+A SSYQQKF RNSFF N ELPSPTPSEESG+GDGDT GE+SS++ V   + VN P   
Sbjct: 298  LKAVSSYQQKFNRNSFFTN-ELPSPTPSEESGNGDGDTAGEVSSSSTVVNYRTVNPPVSD 356

Query: 236  QQPVSSQPMDISQPM---DISSVQALTTANNSAPASSGYNPVVKPNPVVKAPIKSRDPRL 292
            Q+     P  +  P    D S+++ +    NSAP SSG      P+  +KA  KSRDPRL
Sbjct: 357  QKNAPPSPPPLPPPPPHPDSSNIRGVVPTRNSAPVSSG------PSSTIKASAKSRDPRL 410

Query: 293  RFASSNALNLNH--QPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQRNGFEN 350
            R+ + +A  L+H  +  P+++N P+VEP G ++ S+K K +EE VLD P+LKRQRN F+N
Sbjct: 411  RYVNIDACALDHNQRALPMVNNLPRVEPAGAIVGSKKHK-IEEDVLDDPSLKRQRNSFDN 469

Query: 351  SGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPITSGTPNVV 410
             G VRD +++ G+GGWLEDTDM EPQ +N+N   +++  N       A SP   G  N+ 
Sbjct: 470  YGAVRDIESMTGTGGWLEDTDMAEPQTVNKNQWAENSNVNG---SGNAQSPFM-GISNI- 524

Query: 411  VSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQKSNDSSMNTM 470
             +G+E A  T+ +TT SLP LLKDIAVNPTML+NILKMGQQQ+LA D QQ  +D + +T 
Sbjct: 525  -TGSEQAQVTSTATT-SLPDLLKDIAVNPTMLINILKMGQQQRLALDGQQTLSDPAKSTS 582

Query: 471  HPPIPSS----IPPVSVTCSIPSGILSKPM-----------DELGKVRMKPRDPRRVLHG 515
            HPPI ++    IP V+V  S PSGI  +P            DE GK+RMKPRDPRR LH 
Sbjct: 583  HPPISNTVLGAIPTVNVASSQPSGIFPRPAGTPVPSQIATSDESGKIRMKPRDPRRFLHN 642

Query: 516  NALQRSGSLGPEFKTDGPSAPCTQGSKENLNFQKQLGAPEAKPVLSQSVLQPDITQQFTK 575
            N+LQR+GS+G E        P TQG+K++ N QKQ G  E KP +      PDI+  FTK
Sbjct: 643  NSLQRAGSMGSEQFKTTTLTPTTQGTKDDQNVQKQEGLAELKPTVP-----PDISFPFTK 697

Query: 576  NLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSG-ADMKAVVTNHDDKQTGTGSGPEAG 634
            +L++IAD +SVSQ  T+ P +SQN   QP Q KS   D K  ++  D K TG  S PE  
Sbjct: 698  SLENIADILSVSQASTTPPFISQNVASQPMQTKSERVDGKTGISISDQK-TGPASSPEVV 756

Query: 635  PVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLN 694
               +H Q+ W DVEHLFEGYDDQQKAAIQ+ER RRLEEQKKMF+ARKLCLVLDLDHTLLN
Sbjct: 757  AASSHSQNTWKDVEHLFEGYDDQQKAAIQRERARRLEEQKKMFAARKLCLVLDLDHTLLN 816

Query: 695  SAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMH 754
            SAK      +HDEILRKKEEQDREKP+RH+FR PHMGMWTKLRPGIW FLE+ASKLFE+H
Sbjct: 817  SAKAILSSSLHDEILRKKEEQDREKPYRHIFRIPHMGMWTKLRPGIWNFLEKASKLFELH 876

Query: 755  LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAV 814
            LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES V
Sbjct: 877  LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESGV 936

Query: 815  VIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGV 874
            VIIDDSVRVWPHNKLNLIVVERY YFPCSRRQFGL GPSLLEIDHDER EDGTLA S  V
Sbjct: 937  VIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLACSFAV 996

Query: 875  IERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQ 934
            IE++H+ FF+H+SLD+ DVRNILA+EQRKIL GCRI+FSRVFPVGE NPHLHPLWQ AEQ
Sbjct: 997  IEKIHQNFFTHRSLDEADVRNILASEQRKILGGCRILFSRVFPVGEVNPHLHPLWQMAEQ 1056

Query: 935  FGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 994
            FGAVCT  ID+QVTHVVANSLGTDKVNWALSTGR VVHPGWVEASALLYRRANEQDF+IK
Sbjct: 1057 FGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRIVVHPGWVEASALLYRRANEQDFSIK 1116

Query: 995  P 995
            P
Sbjct: 1117 P 1117


>gi|359473774|ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Vitis vinifera]
          Length = 1238

 Score = 1124 bits (2906), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 617/1025 (60%), Positives = 718/1025 (70%), Gaps = 73/1025 (7%)

Query: 1    MNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVT-------RANDKEKD 53
            MN   KE NK++ SRLLS ++  + P+FS   IKE+E M+S L T        A+DK  D
Sbjct: 257  MNSNQKELNKDVFSRLLSCVECGDSPIFSIQHIKEVEVMMSFLDTPAAQSSAEASDKVND 316

Query: 54   MLAMHGVNGKDSNIVTENAVNDLNFKEKVPL---PVDSLMQNKPLEASKPGPPGYRSRGV 110
            +    G+N    +   E++       +K+ L    V+S  QN P +A KPG    R R +
Sbjct: 317  VQVTDGMNRNILDSSVESSGRAFASAKKLSLDSISVESYNQNNP-DALKPGLSSSRGRFI 375

Query: 111  LLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPH 170
              PLLD HK HD DSLPSPT +     PV ++ +V           AK++H  +      
Sbjct: 376  FGPLLDLHKDHDEDSLPSPTGKAPQCFPVNKSELV----------TAKVAHETQDSIMHP 425

Query: 171  YETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVDQPKPVN 230
            YETDAL+A S+YQQKFG  SF    +LPSPTPSEESGD  GD  GE+SS++ +  P   N
Sbjct: 426  YETDALKAVSTYQQKFGLTSFLPIDKLPSPTPSEESGDTYGDISGEVSSSSTISAPITAN 485

Query: 231  MPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAPIKSRDP 290
             P LG   VSS P      MD S VQ  T   N++  SSG  P +  + V  A  KSRDP
Sbjct: 486  APALGHPIVSSAPQ-----MDSSIVQGPTVGRNTSLVSSG--PHLDSSVVASA--KSRDP 536

Query: 291  RLRFASSNA--LNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQRNGF 348
            RLR ASS+A  L+LN +P P + N+PKV+P+G ++SSRKQK+ EEP+LDGP  KRQRNG 
Sbjct: 537  RLRLASSDAGSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDGPVTKRQRNGL 596

Query: 349  ENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGAT-SPITSGTP 407
             +   VRD + +  SGGWLED++   PQ+MNRN L+++  ++ +KL++  T + I    P
Sbjct: 597  TSPATVRDAQTVVASGGWLEDSNTVIPQMMNRNQLIENTGTDPKKLESKVTVTGIGCDKP 656

Query: 408  NVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQKSNDSSM 467
             V V+GNE  P    STT SL +LLKDIAVNP + +NI    +QQK        S D + 
Sbjct: 657  YVTVNGNEHLPVVATSTTASLQSLLKDIAVNPAVWMNIFNKVEQQK--------SGDPAK 708

Query: 468  NTMHPPIPSSI----PPVSVTCSIPSGILSKP-----------MDELGKVRMKPRDPRRV 512
            NT+ PP  +SI    PP SV    PS +  KP           MDE GKVRMKPRDPRR+
Sbjct: 709  NTVLPPTSNSILGVVPPASVAPLKPSALGQKPAGALQVPQTGPMDESGKVRMKPRDPRRI 768

Query: 513  LHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQKQLGAPEAKPVLSQSVLQPDITQ 571
            LH N+ QRSGS G E FKT               N QKQ    E K V S SV  PDI+Q
Sbjct: 769  LHANSFQRSGSSGSEQFKT---------------NAQKQEDQTETKSVPSHSVNPPDISQ 813

Query: 572  QFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSG-ADMKAVVTNHDDKQTGTGSG 630
            QFTKNLK+IAD MS SQ  +  P   Q    Q  Q+ +   D+KA V++  D+ T  GS 
Sbjct: 814  QFTKNLKNIADLMSASQASSMTPTFPQILSSQSVQVNTDRMDVKATVSDSGDQLTANGSK 873

Query: 631  PEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDH 690
            PE+       ++ WGDVEHLF+GYDDQQKAAIQ+ER RR+EEQKKMFSARKLCLVLDLDH
Sbjct: 874  PESAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDH 933

Query: 691  TLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKL 750
            TLLNSAKF EVDPVHDEILRKKEEQDREK  RHLFRFPHMGMWTKLRPGIW FLE+ASKL
Sbjct: 934  TLLNSAKFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHMGMWTKLRPGIWNFLEKASKL 993

Query: 751  FEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM 810
            +E+HLYTMGNKLYATEMAKVLDPKGVLFAGRVIS+GDDGD  DGDERVPKSKDLEGVLGM
Sbjct: 994  YELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGDVLDGDERVPKSKDLEGVLGM 1053

Query: 811  ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLAS 870
            ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL GPSLLEIDHDER EDGTLAS
Sbjct: 1054 ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLAS 1113

Query: 871  SLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQ 930
            SL VIER+H+ FFS+++LD+VDVRNILA+EQRKILAGCRIVFSRVFPVGEANPHLHPLWQ
Sbjct: 1114 SLAVIERIHQSFFSNRALDEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQ 1173

Query: 931  TAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQD 990
            TAE FGAVCT  ID+QVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQD
Sbjct: 1174 TAESFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQD 1233

Query: 991  FAIKP 995
            FAIKP
Sbjct: 1234 FAIKP 1238


>gi|296088169|emb|CBI35661.3| unnamed protein product [Vitis vinifera]
          Length = 1184

 Score = 1043 bits (2697), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 590/1025 (57%), Positives = 682/1025 (66%), Gaps = 138/1025 (13%)

Query: 1    MNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVT-------RANDKEKD 53
            MN   KE NK++ SRLLS ++  + P+FS   IKE+E M+S L T        A+DK  D
Sbjct: 268  MNSNQKELNKDVFSRLLSCVECGDSPIFSIQHIKEVEVMMSFLDTPAAQSSAEASDKVND 327

Query: 54   MLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNKPLEASKPGPPGYRSRGVLLP 113
            +    G+N    NI+  +              V+S    +   ++K     +R R +  P
Sbjct: 328  VQVTDGMN---RNILDSS--------------VES--SGRAFASAK----KFRGRFIFGP 364

Query: 114  LLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYET 173
            LLD HK HD DSLPSPT +     PV ++ +V           AK++H  +      YET
Sbjct: 365  LLDLHKDHDEDSLPSPTGKAPQCFPVNKSELV----------TAKVAHETQDSIMHPYET 414

Query: 174  DALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVDQPKPVNMPT 233
            DAL+A S+YQQKFG  SF    +LPSPTPSEESGD  GD  GE+SS++ +  P   N P 
Sbjct: 415  DALKAVSTYQQKFGLTSFLPIDKLPSPTPSEESGDTYGDISGEVSSSSTISAPITANAPA 474

Query: 234  LGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAPIKSRDPRLR 293
            LG   VSS P      MDI  VQ L    N+   +S +N +++      A  KSRDPRLR
Sbjct: 475  LGHPIVSSAPQ-----MDI--VQGLVVPRNTGAVNSRFNSILR------ASAKSRDPRLR 521

Query: 294  FASSNA--LNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQRNGFENS 351
             ASS+A  L+LN +P P + N+PKV+P+G ++SSRKQK+ EEP+LDGP  KRQRNG    
Sbjct: 522  LASSDAGSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDGPVTKRQRNGLT-- 579

Query: 352  GVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGAT-SPITSGTPNVV 410
                                                 S + KL++  T + I    P V 
Sbjct: 580  -------------------------------------SPATKLESKVTVTGIGCDKPYVT 602

Query: 411  VSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQKSNDSSMNTM 470
            V+GNE  P    STT SL +LLKDIAVNP + +NI    +QQK        S D + NT+
Sbjct: 603  VNGNEHLPVVATSTTASLQSLLKDIAVNPAVWMNIFNKVEQQK--------SGDPAKNTV 654

Query: 471  HPPIPSSI----PPVSVTCSIPSGILSKP--------------MDELGKVRMKPRDPRRV 512
             PP  +SI    PP SV    PS +  KP               DE GKVRMKPRDPRR+
Sbjct: 655  LPPTSNSILGVVPPASVAPLKPSALGQKPAGALQVPQTGPMNPQDESGKVRMKPRDPRRI 714

Query: 513  LHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQKQLGAPEAKPVLSQSVLQPDITQ 571
            LH N+ QRSGS G E FKT               N QKQ    E K V S SV  PDI+Q
Sbjct: 715  LHANSFQRSGSSGSEQFKT---------------NAQKQEDQTETKSVPSHSVNPPDISQ 759

Query: 572  QFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSG-ADMKAVVTNHDDKQTGTGSG 630
            QFTKNLK+IAD MS SQ  +  P   Q    Q  Q+ +   D+KA V++  D+ T  GS 
Sbjct: 760  QFTKNLKNIADLMSASQASSMTPTFPQILSSQSVQVNTDRMDVKATVSDSGDQLTANGSK 819

Query: 631  PEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDH 690
            PE+       ++ WGDVEHLF+GYDDQQKAAIQ+ER RR+EEQKKMFSARKLCLVLDLDH
Sbjct: 820  PESAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDH 879

Query: 691  TLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKL 750
            TLLNSAKF EVDPVHDEILRKKEEQDREK  RHLFRFPHMGMWTKLRPGIW FLE+ASKL
Sbjct: 880  TLLNSAKFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHMGMWTKLRPGIWNFLEKASKL 939

Query: 751  FEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM 810
            +E+HLYTMGNKLYATEMAKVLDPKGVLFAGRVIS+GDDGD  DGDERVPKSKDLEGVLGM
Sbjct: 940  YELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGDVLDGDERVPKSKDLEGVLGM 999

Query: 811  ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLAS 870
            ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL GPSLLEIDHDER EDGTLAS
Sbjct: 1000 ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLAS 1059

Query: 871  SLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQ 930
            SL VIER+H+ FFS+++LD+VDVRNILA+EQRKILAGCRIVFSRVFPVGEANPHLHPLWQ
Sbjct: 1060 SLAVIERIHQSFFSNRALDEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQ 1119

Query: 931  TAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQD 990
            TAE FGAVCT  ID+QVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQD
Sbjct: 1120 TAESFGAVCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQD 1179

Query: 991  FAIKP 995
            FAIKP
Sbjct: 1180 FAIKP 1184


>gi|356523718|ref|XP_003530482.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Glycine max]
          Length = 1244

 Score = 1041 bits (2692), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 574/1042 (55%), Positives = 702/1042 (67%), Gaps = 76/1042 (7%)

Query: 1    MNHVLKEQNKEILSRLLSLIK-SHEPPLFSSNQIKEMEAMLSSL-------VTRANDKEK 52
            M+   KEQNK+ + RLLS +K   +  LFS   +KE++ M++++        + A  KEK
Sbjct: 232  MDSSEKEQNKDSILRLLSFVKDQQQAQLFSPEHVKEIQGMMTAIDSVGALVNSEAIGKEK 291

Query: 53   DMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQ-NKPL--------EASKPGPP 103
            ++        ++S +  E  ++++  +E   +    L+  +KPL        +A K G  
Sbjct: 292  ELQTTEIKTQENSAV--EVQIHEIKTQENQAVEAAELISYSKPLHRDITGTSQALKFGQN 349

Query: 104  GYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNA 163
              + RGVLLPLLD HK HD DSLPSPTRE     PV + L VG+ MV+S +A+AK+  ++
Sbjct: 350  SIKGRGVLLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSVGESMVRSGSASAKMELDS 409

Query: 164  EVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAV 223
            E  K   YETDAL+A S+YQQKFGR+S F N + PSPTPS +  D   DT  E+SSA+  
Sbjct: 410  EGSKFHLYETDALKAVSTYQQKFGRSSLFTNDKFPSPTPSGDCEDEVVDTNEEVSSASTG 469

Query: 224  DQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKA 283
            D         L Q PVS+  MD S      S +   T   S P              VK+
Sbjct: 470  DFLTSTKPTLLDQPPVSATSMDRSSMHGFISSRVDATGPGSFP--------------VKS 515

Query: 284  PIKSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKR 343
              K+RDPRLRF +S+A  +++  + +++N  KVE  G  +S RKQK  EEP LD    KR
Sbjct: 516  SAKNRDPRLRFINSDASAVDNL-STLINNMSKVEYSGTTIS-RKQKAAEEPSLDVTVSKR 573

Query: 344  QRNGFENSGVVRDEKNI----YGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGAT 399
             ++  EN+     E N+     GSGGWLE+      Q++ RN L+D     ++K  N  +
Sbjct: 574  LKSSLENT-----EHNMSEVRTGSGGWLEENTGPGAQLIERNHLMDKFGPEAKKTLNTVS 628

Query: 400  SPIT-SGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADA 458
            S  T S   N     NE AP T  +   SLPALLK+ +VNP ML+NIL++ + QK +AD+
Sbjct: 629  SSCTGSDNFNATSIRNEQAPITASNVLASLPALLKEASVNPIMLVNILRLAEAQKKSADS 688

Query: 459  QQKSNDSSMNTMHP----PIPSSIPPVSVTCSIPSGILSKPM-----------------D 497
                  +++  +HP    P   +    S+  S+ +G+L   +                 D
Sbjct: 689  ------AAIMLLHPTSSNPAMGTDSTASIGSSMATGLLQSSVGMLPVSSQSTSTAQTLQD 742

Query: 498  ELGKVRMKPRDPRRVLH-GNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQKQLGAPE 555
            + GK+RMKPRDPRR+LH  N +Q+SG LG E FK         Q + +N+N  K  G  +
Sbjct: 743  DSGKIRMKPRDPRRILHTNNTIQKSGDLGNEQFKAIVSPVSNNQRTGDNVNAPKLEGRVD 802

Query: 556  AKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSG-ADMK 614
             K V +QS  QPDI +QFT+NLK+IAD MSVSQ  ++   VSQN       + S   + K
Sbjct: 803  NKLVPTQSSAQPDIARQFTRNLKNIADIMSVSQESSTHTPVSQNFSSASVPLTSDRGEQK 862

Query: 615  AVVTNHDDKQTGTGSGPE-AGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQ 673
            +VV++  + Q    S  E A  V +  QS WGDVEHLFEGYD+QQKAAIQ+ER RR+EEQ
Sbjct: 863  SVVSSSQNLQADMASAHETAASVTSRSQSTWGDVEHLFEGYDEQQKAAIQRERARRIEEQ 922

Query: 674  KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
             KMF+ARKLCLVLDLDHTLLNSAKF EVDP+HDEILRKKEEQDREKPHRHLFRFPHMGMW
Sbjct: 923  NKMFAARKLCLVLDLDHTLLNSAKFVEVDPLHDEILRKKEEQDREKPHRHLFRFPHMGMW 982

Query: 734  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 793
            TKLRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD D  D
Sbjct: 983  TKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDTDSVD 1042

Query: 794  GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 853
            G+ERVPKSKDLEGVLGMES+VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL GPS
Sbjct: 1043 GEERVPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPS 1102

Query: 854  LLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFS 913
            LLEIDHDER E GTLASSL VIE++H+IFF+ QSL++VDVRNILA+EQRKILAGCRIVFS
Sbjct: 1103 LLEIDHDERPEAGTLASSLAVIEKIHQIFFASQSLEEVDVRNILASEQRKILAGCRIVFS 1162

Query: 914  RVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHP 973
            RVFPVGEANPHLHPLWQTAEQFGAVCT  ID+QVTHVVANS GTDKVNWAL+ GRFVVHP
Sbjct: 1163 RVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSPGTDKVNWALNNGRFVVHP 1222

Query: 974  GWVEASALLYRRANEQDFAIKP 995
            GWVEASALLYRRANEQDFAIKP
Sbjct: 1223 GWVEASALLYRRANEQDFAIKP 1244


>gi|356567192|ref|XP_003551805.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Glycine max]
          Length = 1221

 Score = 1014 bits (2623), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 570/1037 (54%), Positives = 692/1037 (66%), Gaps = 91/1037 (8%)

Query: 1    MNHVLKEQNKEILSRLLSLIK-SHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAMHG 59
            M+ + KEQNK+ + RLLS +K   +  LFS   IKE++ M++++     D    ++    
Sbjct: 234  MDSLKKEQNKDSILRLLSFVKDQQQAQLFSPEHIKEIQGMMTAI-----DYFGALVNSEA 288

Query: 60   VNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQ-NKPLE--------ASKPGPPGYRSRGV 110
            + GK+  + T    +++  +E   +    L+  NKPL         A K G    + RGV
Sbjct: 289  I-GKEKELQTTVQTHEIKTQENQAVEAAELISYNKPLHSDIIGASHALKFGQNSIKGRGV 347

Query: 111  LLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPH 170
            LLPLLD HK HD DSLPSPTRE     PV + L    G         K+  ++E  K   
Sbjct: 348  LLPLLDLHKDHDADSLPSPTREAPSCFPVNKLLSPESG---------KMELDSEGSKFHL 398

Query: 171  YETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVD-----Q 225
            YETDAL+A S+YQQKFGR+S F N + PSPTPS +  D   DT  E+SSA+  D     +
Sbjct: 399  YETDALKAVSTYQQKFGRSSLFTNDKFPSPTPSGDCEDEIVDTNEEVSSASTGDFLTSTK 458

Query: 226  PKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAPI 285
            P  +++P     PVS+   D S      S +       S P              VK+  
Sbjct: 459  PTLLDLP-----PVSATSTDRSSLHGFISSRVDAAGPGSLP--------------VKSSA 499

Query: 286  KSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQR 345
            K+RDPRLRF +S+A  +++ P+ ++HN PKVE  G  +S RKQK  EEP LD    KRQ+
Sbjct: 500  KNRDPRLRFVNSDASAVDN-PSTLIHNMPKVEYAGTTIS-RKQKAAEEPSLDVTVSKRQK 557

Query: 346  NGFENSGVVRDEKNI----YGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSP 401
            +  EN+     E N+     G GGWLE+      Q + RN L+D      +K  N  +S 
Sbjct: 558  SPLENT-----EHNMSEVRTGIGGWLEEHTGPGAQFIERNHLMDKFGPEPQKTLNTVSSS 612

Query: 402  IT-SGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQ 460
             T S   N     NE AP T+ +   SLPALLK  AVNPTML+N+L++       A+AQ+
Sbjct: 613  CTGSDNFNATSIRNEQAPITSSNVLASLPALLKGAAVNPTMLVNLLRI-------AEAQK 665

Query: 461  KSNDSSMNTM-HPPIPSSI----PPVSVTCSIPSGILSKPM------------DELGKVR 503
            KS DS+ N + HP   +S        S+  S+ +G+L   +            D+ GK+R
Sbjct: 666  KSADSATNMLLHPTSSNSAMGTDSTASIGSSMATGLLQSSVGMLPVSSQQTLQDDSGKIR 725

Query: 504  MKPRDPRRVLH-GNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQKQLGAPEAKPVLS 561
            MKPRDPRR+LH  N +Q+SG+LG E FK         QG+ +N+N QK  G  ++K V +
Sbjct: 726  MKPRDPRRILHTNNTIQKSGNLGNEQFKAIVSPVSNNQGTGDNVNAQKLEGRVDSKLVPT 785

Query: 562  QSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQ--NSPIQPGQIKSGADMKAVVTN 619
            Q   QPDI +QF +NLK+IAD MSVSQ  ++   V+Q  +S   P     G + K+VV+N
Sbjct: 786  QPSAQPDIARQFARNLKNIADIMSVSQESSTHTPVAQIFSSASVPLTSDRG-EQKSVVSN 844

Query: 620  HDDKQTGTGSGPEAGPVG-AHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFS 678
              + + G  S  E    G    Q+ WGDVEHLFEGYD+QQKAAIQ+ER RR+EEQ KMF+
Sbjct: 845  SQNLEAGMVSAHETAASGTCRSQNTWGDVEHLFEGYDEQQKAAIQRERARRIEEQNKMFA 904

Query: 679  ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
            ARKLCLVLDLDHTLLNSAKF EVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP
Sbjct: 905  ARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 964

Query: 739  GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERV 798
            GIW FLE+ASKL+E+HLYTMGNKLYATEMAKVLDPKG+LFAGRVISRGDD D  DG+ER 
Sbjct: 965  GIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGLLFAGRVISRGDDTDSVDGEERA 1024

Query: 799  PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEID 858
            PKSKDLEGVLGMES+VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL GPSLLEID
Sbjct: 1025 PKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEID 1084

Query: 859  HDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPV 918
            HDER E GTLASSL VIE++H+IFF+ +SL++VDVRNILA+EQRKILAGCRIVFSRVFPV
Sbjct: 1085 HDERPEAGTLASSLAVIEKIHQIFFASRSLEEVDVRNILASEQRKILAGCRIVFSRVFPV 1144

Query: 919  GEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEA 978
            GEANPHLHPLWQTAEQFGA CT  ID+QVTHVVANS GTDKVNWAL+ GRFVVHPGWVEA
Sbjct: 1145 GEANPHLHPLWQTAEQFGAFCTNQIDEQVTHVVANSPGTDKVNWALNNGRFVVHPGWVEA 1204

Query: 979  SALLYRRANEQDFAIKP 995
            SALLYRRANEQDFAIKP
Sbjct: 1205 SALLYRRANEQDFAIKP 1221


>gi|449487451|ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
            phosphatase-like 3-like [Cucumis sativus]
          Length = 1249

 Score = 1004 bits (2595), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 589/1036 (56%), Positives = 693/1036 (66%), Gaps = 91/1036 (8%)

Query: 1    MNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAMHGV 60
            MN   KE++KE LSRLLS +K+ +PPLFS  QIK +E  + S      D    + +M G 
Sbjct: 264  MNLSEKEEHKEHLSRLLSYVKNCDPPLFSPEQIKSVEVKMPS-----TDSLDHLPSMRG- 317

Query: 61   NGKDSNIVTENAVNDLNF------------------KEKVPLPVDSLMQNKPL-EASKPG 101
            + K+  I   N V D++F                   + +P  V        L E  + G
Sbjct: 318  SAKEVEIHIPNGVKDMDFYSAYTSTSSQLTPSNKLASDSIPFGVKGKNNLNILSEGLQSG 377

Query: 102  PPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSH 161
                + RG LLPLLD HK HD DSLPSPTRE      VQ          KS  A  K++ 
Sbjct: 378  VSSIKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQ----------KSGNAPTKMAF 427

Query: 162  NAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSAT 221
              +  ++  YETDAL+A S+YQQKFGR+SF M   LPSPTPSEE  DG GD GGE+SS++
Sbjct: 428  PVDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEEH-DGGGDIGGEVSSSS 486

Query: 222  AVDQPKPVNMPTLGQQPVSSQPMD--ISQPMDISSVQALTTANNSAPASSGYNPVVKPNP 279
             +   K  N+   GQ+  S+  +   +   MD SS + L +  N AP SS  NP VKP  
Sbjct: 487  IIRSLKSSNVSKPGQKSNSASNVSTGLFPNMDSSSTRVLISPLNVAPPSSVSNPTVKP-- 544

Query: 280  VVKAPIKSRDPRLRFASSNA--LNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLD 337
                  KSRDPRLR  +S+A  ++LN +    + ++  +E     +  RKQK   EP  D
Sbjct: 545  ----LAKSRDPRLRIVNSDASGMDLNPRTMASVQSSSILESAA-TLHLRKQKMDGEPNTD 599

Query: 338  GPALKRQRNGFENSGVV-RDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDN 396
            GP +KR R G +N  V   D + + GSGGWLEDT    P++ NRN + + AE+N+ +   
Sbjct: 600  GPEVKRLRIGSQNLAVAASDVRAVSGSGGWLEDTMPAGPRLFNRNQM-EIAEANATE--- 655

Query: 397  GATSPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAA 456
                   S   N   SGNE  P    S   SLP+LLKDI VNPTMLLN+LKM QQQ+LAA
Sbjct: 656  ------KSNVTNNSGSGNECTPTVNNSNDASLPSLLKDIVVNPTMLLNLLKMSQQQQLAA 709

Query: 457  DAQQKSNDSSMNTMHP----PIPSSIPPVSVTCSIPSGILSKP------------MDELG 500
            + + KS++   N + P    P   S P ++   +  SGIL +              D+LG
Sbjct: 710  ELKLKSSEPEKNAICPTSLNPCQGSSPLINAPVAT-SGILQQSAGTPSASPVVGRQDDLG 768

Query: 501  KVRMKPRDPRRVLHGNALQRSGSLG-PEFKTDGPSAPCTQGSKENLNFQKQLGAPEAKPV 559
            KVRMKPRDPRRVLHGN+LQ+ GSLG  + K   P+A  T+GS++  N  KQ G  ++K  
Sbjct: 769  KVRMKPRDPRRVLHGNSLQKVGSLGNDQLKGVVPTASNTEGSRDIPNGHKQEGQGDSKLA 828

Query: 560  LSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMKAVVTN 619
             SQ++L PDI +QFT NLK+IAD MSV  P TS P  S           S  D K V T 
Sbjct: 829  SSQTIL-PDIGRQFTNNLKNIADIMSVPSPPTSSPNSSSKP-----VGSSSMDSKPVTTA 882

Query: 620  HDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSA 679
                     S           Q AWGD+EHLF+ YDD+QKAAIQ+ER RR+EEQKKMF+A
Sbjct: 883  FQAVDMAASS---------RSQGAWGDLEHLFDSYDDKQKAAIQRERARRIEEQKKMFAA 933

Query: 680  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
            RKLCLVLDLDHTLLNSAKF EVDPVHDEILRKKEEQDREK  RHLFRFPHMGMWTKLRPG
Sbjct: 934  RKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPG 993

Query: 740  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 799
            +W FLE+AS+L+E+HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP DGD+RVP
Sbjct: 994  VWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPLDGDDRVP 1053

Query: 800  KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDH 859
            KSKDLEGVLGMES VVIIDDS+RVWPHNK+NLIVVERYTYFPCSRRQFGLLGPSLLEIDH
Sbjct: 1054 KSKDLEGVLGMESGVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDH 1113

Query: 860  DERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVG 919
            DER EDGTLASSLGVI+R+H+ FFS+  LD VDVR IL+AEQ+KILAGCRIVFSRVFPVG
Sbjct: 1114 DERPEDGTLASSLGVIQRIHQXFFSNPELDQVDVRTILSAEQQKILAGCRIVFSRVFPVG 1173

Query: 920  EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEAS 979
            EANPHLHPLWQTAEQFGA CT  ID+QVTHVVANSLGTDKVNWALSTGRFVVHPGWVEAS
Sbjct: 1174 EANPHLHPLWQTAEQFGAQCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEAS 1233

Query: 980  ALLYRRANEQDFAIKP 995
            ALLYRRA EQDFAIKP
Sbjct: 1234 ALLYRRATEQDFAIKP 1249


>gi|449445782|ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Cucumis sativus]
          Length = 1249

 Score = 1003 bits (2594), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 589/1036 (56%), Positives = 693/1036 (66%), Gaps = 91/1036 (8%)

Query: 1    MNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAMHGV 60
            MN   KE++KE LSRLLS +K+ +PPLFS  QIK +E  + S      D    + +M G 
Sbjct: 264  MNLSEKEEHKEHLSRLLSYVKNCDPPLFSPEQIKSVEVKMPS-----TDSLDHLPSMRG- 317

Query: 61   NGKDSNIVTENAVNDLNF------------------KEKVPLPVDSLMQNKPL-EASKPG 101
            + K+  I   N V D++F                   + +P  V        L E  + G
Sbjct: 318  SAKEVEIHIPNGVKDMDFYSAYTSTSSQLTPSNKLASDSIPFGVKGKNNLNILSEGLQSG 377

Query: 102  PPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSH 161
                + RG LLPLLD HK HD DSLPSPTRE      VQ          KS  A  K++ 
Sbjct: 378  VSSIKGRGPLLPLLDLHKDHDADSLPSPTREAPTIFSVQ----------KSGNAPTKMAF 427

Query: 162  NAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSAT 221
              +  ++  YETDAL+A S+YQQKFGR+SF M   LPSPTPSEE  DG GD GGE+SS++
Sbjct: 428  PVDGSRSHPYETDALKAVSTYQQKFGRSSFSMADRLPSPTPSEEH-DGGGDIGGEVSSSS 486

Query: 222  AVDQPKPVNMPTLGQQPVSSQPMD--ISQPMDISSVQALTTANNSAPASSGYNPVVKPNP 279
             +   K  N+   GQ+  S+  +   +   MD SS + L +  N AP SS  NP VKP  
Sbjct: 487  IIRSLKSSNVSKPGQKSNSASNVSTGLFPNMDSSSTRVLISPLNVAPPSSVSNPTVKP-- 544

Query: 280  VVKAPIKSRDPRLRFASSNA--LNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLD 337
                  KSRDPRLR  +S+A  ++LN +    + ++  +E     +  RKQK   EP  D
Sbjct: 545  ----LAKSRDPRLRIVNSDASGMDLNPRTMASVQSSSILESAA-TLHLRKQKMDGEPNTD 599

Query: 338  GPALKRQRNGFENSGVV-RDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDN 396
            GP +KR R G +N  V   D + + GSGGWLEDT    P++ NRN + + AE+N+ +  N
Sbjct: 600  GPEVKRLRIGSQNLAVAASDVRAVSGSGGWLEDTMPAGPRLFNRNQM-EIAEANATEKSN 658

Query: 397  GATSPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAA 456
                       N   SGNE  P    S   SLP+LLKDI VNPTMLLN+LKM QQQ+LAA
Sbjct: 659  ---------VTNNSGSGNECTPTVNNSNDASLPSLLKDIVVNPTMLLNLLKMSQQQQLAA 709

Query: 457  DAQQKSNDSSMNTMHP----PIPSSIPPVSVTCSIPSGILSKP------------MDELG 500
            + + KS++   N + P    P   S P ++   +  SGIL +              D+LG
Sbjct: 710  ELKLKSSEPEKNAICPTSLNPCQGSSPLINAPVAT-SGILQQSAGTPSASPVVGRQDDLG 768

Query: 501  KVRMKPRDPRRVLHGNALQRSGSLG-PEFKTDGPSAPCTQGSKENLNFQKQLGAPEAKPV 559
            KVRMKPRDPRRVLHGN+LQ+ GSLG  + K   P+A  T+GS++  N  KQ G  ++K  
Sbjct: 769  KVRMKPRDPRRVLHGNSLQKVGSLGNDQLKGVVPTASNTEGSRDIPNGHKQEGQGDSKLA 828

Query: 560  LSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMKAVVTN 619
             SQ++L PDI +QFT NLK+IAD MSV  P TS P  S           S  D K V T 
Sbjct: 829  SSQTIL-PDIGRQFTNNLKNIADIMSVPSPPTSSPNSSSKP-----VGSSSMDSKPVTTA 882

Query: 620  HDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSA 679
                     S           Q AWGD+EHLF+ YDD+QKAAIQ+ER RR+EEQKKMF+A
Sbjct: 883  FQAVDMAASS---------RSQGAWGDLEHLFDSYDDKQKAAIQRERARRIEEQKKMFAA 933

Query: 680  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
            RKLCLVLDLDHTLLNSAKF EVDPVHDEILRKKEEQDREK  RHLFRFPHMGMWTKLRPG
Sbjct: 934  RKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAQRHLFRFPHMGMWTKLRPG 993

Query: 740  IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 799
            +W FLE+AS+L+E+HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP DGD+RVP
Sbjct: 994  VWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPLDGDDRVP 1053

Query: 800  KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDH 859
            KSKDLEGVLGMES VVIIDDS+RVWPHNK+NLIVVERYTYFPCSRRQFGLLGPSLLEIDH
Sbjct: 1054 KSKDLEGVLGMESGVVIIDDSIRVWPHNKMNLIVVERYTYFPCSRRQFGLLGPSLLEIDH 1113

Query: 860  DERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVG 919
            DER EDGTLASSLGVI+R+H+ FFS+  LD VDVR IL+AEQ+KILAGCRIVFSRVFPVG
Sbjct: 1114 DERPEDGTLASSLGVIQRIHQSFFSNPELDQVDVRTILSAEQQKILAGCRIVFSRVFPVG 1173

Query: 920  EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEAS 979
            EANPHLHPLWQTAEQFGA CT  ID+QVTHVVANSLGTDKVNWALSTGRFVVHPGWVEAS
Sbjct: 1174 EANPHLHPLWQTAEQFGAQCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEAS 1233

Query: 980  ALLYRRANEQDFAIKP 995
            ALLYRRA EQDFAIKP
Sbjct: 1234 ALLYRRATEQDFAIKP 1249


>gi|357502711|ref|XP_003621644.1| RNA polymerase II C-terminal domain phosphatase-like protein
            [Medicago truncatula]
 gi|355496659|gb|AES77862.1| RNA polymerase II C-terminal domain phosphatase-like protein
            [Medicago truncatula]
          Length = 1213

 Score =  941 bits (2432), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 542/1037 (52%), Positives = 669/1037 (64%), Gaps = 121/1037 (11%)

Query: 1    MNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAMHGV 60
            M+++ KE+NK+ +SRLLS +K+    LF+   +K++   +  ++T  +     + A+   
Sbjct: 252  MDNLQKEENKDNISRLLSFLKNQH--LFTMEHMKKVIFNIQVMITVIDS----VFAL--- 302

Query: 61   NGKDSNIVTENAVNDLNFKEKVP-LPVDSLMQNKPL---------EASKPGPPGYRSRGV 110
             G +  +  E  V  LN  E++P L  D  + +  L         EA + G      RG+
Sbjct: 303  -GNNEVVGKEEKVEALNTTEQIPGLKADEYISSSQLVHDNSTYASEALQYGQSNVVGRGL 361

Query: 111  LLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALV-VGDGMVK------SWAAAAKLSHNA 163
            +LPL D HK HD+DSLPSPTRE     PV +    +GDG+ +          A K+  + 
Sbjct: 362  MLPLFDLHKDHDLDSLPSPTREAPSCFPVNKLFSDLGDGIDRFGLPPAVCTEAEKMELDG 421

Query: 164  EVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDG---DTGGEISSA 220
            +  K   YETDAL+A S+YQQKF R+S+F + + PSPTPS   GD +G   DT  E+SSA
Sbjct: 422  KDSKLHIYETDALKAVSTYQQKFSRSSYFTDDKFPSPTPS---GDCEGEAVDTNDEVSSA 478

Query: 221  TAVDQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPV 280
            +          P L Q PVSS  +D      +   +   T + S PA             
Sbjct: 479  SIASSLTSFKPPPLDQIPVSSTSLDRPNMHGLVDSRIDATGSGSYPA------------- 525

Query: 281  VKAPIKSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPA 340
             K+  KSRDPRLRF + +A  L+   +   H+ P+VE  GRV+S RKQKTVEEP LD  A
Sbjct: 526  -KSSAKSRDPRLRFINPDASTLDLNQSLGTHSMPRVEYGGRVIS-RKQKTVEEPSLDATA 583

Query: 341  LKRQRNGFENSG-VVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGAT 399
             KR R   ENS    R+E+ + G GGW E+  +   Q+  RN L+   E+  ++      
Sbjct: 584  PKRLRRSLENSEHNTREERAMAGKGGWFEENTVAGSQLAERNHLMQKGETELKR------ 637

Query: 400  SPITSGTPNVVVS--GNEPAPATTPSTTVSLPA-LLKDIAVNPTMLLNILKMGQQQKLAA 456
              I++ + N+ VS  GNE A  T+ S T SLP  LL ++AVNP ML++++   Q  +  A
Sbjct: 638  -TISTSSSNLTVSNNGNELASVTSSSATASLPTYLLNNVAVNPAMLIHMILEHQHNE--A 694

Query: 457  DAQQKSNDSSMNT-----MHPP------------IPSSIPPVSVTCSIPSGILSKPMDEL 499
            +AQ+K  DS+  T       P             +P+S P  S+T ++P        ++ 
Sbjct: 695  EAQKKPVDSARGTDATVNTGPAMTAGLTQSSVGILPASSPATSMTQTLP--------EDS 746

Query: 500  GKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQGSKENLNFQKQLGAPEAKPV 559
            GK+RMKPRDPRR LHG++                              QK     E K  
Sbjct: 747  GKIRMKPRDPRRFLHGSS----------------------------TLQKFDVRVETKLA 778

Query: 560  LSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQN-SPIQPGQIKSGADMKAVVT 618
              QS+ QPDIT+QFTKNLK+IAD MSV Q  +S P  +QN S      +   ++ K+ V 
Sbjct: 779  PIQSIAQPDITRQFTKNLKNIADIMSVPQETSSNPPATQNVSSASVPFMSDRSEQKSGVP 838

Query: 619  NHDDKQTGTGSGPE-AGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMF 677
            N  + + G GS PE   P  + PQ+ W DVEHLFE YD +QKAAIQ+ER+RRLEEQKKMF
Sbjct: 839  NSQNLKDGVGSAPETCAPGSSRPQNTWADVEHLFEAYDVKQKAAIQRERSRRLEEQKKMF 898

Query: 678  SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 737
            +ARKLCLVLDLDHTLLNSAKF EVDPVHDE+LRKKE++DREKP RHLFRFPHMGMWTKLR
Sbjct: 899  AARKLCLVLDLDHTLLNSAKFVEVDPVHDEMLRKKEQEDREKPQRHLFRFPHMGMWTKLR 958

Query: 738  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 797
            PG+W FLE+A KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD +  D    
Sbjct: 959  PGVWNFLEKAGKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDAETAD---- 1014

Query: 798  VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEI 857
              KSKDLEGVLGMES+VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL GPSLLEI
Sbjct: 1015 -TKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEI 1073

Query: 858  DHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFP 917
            DHDER E GTLASSLGVIER+H+ FF+ QSL++VDVRNILA+EQRKIL GCRIVFSR+FP
Sbjct: 1074 DHDERPESGTLASSLGVIERIHQNFFASQSLEEVDVRNILASEQRKILDGCRIVFSRMFP 1133

Query: 918  VGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVE 977
            VG+ANPHLHPLWQTAEQFGA CT  IDDQVTHVVA+S GTDKVNWA++ G+FVVHPGWVE
Sbjct: 1134 VGDANPHLHPLWQTAEQFGASCTNQIDDQVTHVVAHSPGTDKVNWAIANGKFVVHPGWVE 1193

Query: 978  ASALLYRRANEQDFAIK 994
            ASALLYRRANEQDFAIK
Sbjct: 1194 ASALLYRRANEQDFAIK 1210


>gi|56547717|gb|AAV92930.1| putative transcription regulator CPL1 [Solanum lycopersicum]
          Length = 1227

 Score =  929 bits (2401), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 549/1055 (52%), Positives = 671/1055 (63%), Gaps = 104/1055 (9%)

Query: 1    MNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAML----SSLV---TRANDKEKD 53
            MN   K+QN +ILSRLL   K+  P L SS Q+KE++A++     SLV   T+ ND    
Sbjct: 215  MNDHQKQQNTDILSRLLFNAKTQLPALLSSEQLKELDALILSINHSLVSSNTQDNDTVNG 274

Query: 54   MLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLM------QNKPLEASKPGPPGYRS 107
            +  +  ++ KDS+  +ENA  D     K  L   S+       Q+   E+ KPG    ++
Sbjct: 275  INVVQLLDMKDSHKSSENANQDFTSVNKYDLGDVSIKSSGLKEQSVSSESVKPGLDNSKA 334

Query: 108  RGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKS---WAAAAKLSHNAE 164
            +G+  PLLD HK HD D+LPSPTR+  P  P  +      GMVK       A+    N+ 
Sbjct: 335  KGLSFPLLDLHKDHDEDTLPSPTRQIGPQFPATQT----HGMVKLDLPIFPASLDKGNSL 390

Query: 165  VHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVD 224
            +H    YETDAL+A SSYQQKFGR+S F++  LPSPTPSEE   G GDTGGE++S   V 
Sbjct: 391  LHP---YETDALKAVSSYQQKFGRSSLFVSENLPSPTPSEEDDSGKGDTGGEVTSFDVVH 447

Query: 225  QPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAP 284
                +N  ++GQ  +SS P       +I   Q L T   + P S        PNP +++ 
Sbjct: 448  NASHLNESSMGQPILSSVPQ-----TNILDGQGLGTTRTADPLS------FLPNPSLRSS 496

Query: 285  I-KSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKR 343
              KSRDPRLR A+S+ +  N    PI     K+E    ++ S+KQKTV+    D P  KR
Sbjct: 497  TAKSRDPRLRLATSDTVAQN-TILPIPDIDLKLEASLEMIVSKKQKTVDLSAFDAPLPKR 555

Query: 344  QRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPIT 403
            QR+   +S +V D +   G+GGWLED    E  I + N    +++++ RKL+   T+ I 
Sbjct: 556  QRSEQTDSIIVSDVRPSIGNGGWLEDRGTAELPITSSNCATYNSDNDIRKLEQ-VTATIA 614

Query: 404  SGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQKSN 463
            +  P+V+V+  E  P T  ST+ +L +LLKDIA+NP++ +NI+K  QQ+   A     + 
Sbjct: 615  T-IPSVIVNAAENFPVTGISTSTTLHSLLKDIAINPSIWMNIIKTEQQKSADASRTNTAQ 673

Query: 464  DSSMNTMHPPIPSSI---PPVSVTCSIPSGILSKPM------------------------ 496
             SS  ++   +PS++   P  S       GIL  P                         
Sbjct: 674  ASSSKSILGAVPSTVAVAPRSSAIGQRSVGILQTPTHTASAASSIYNLLMNDFIYSVIFT 733

Query: 497  ----------------DELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQG 540
                            DE+  VRMKPRDPRRVLH  A+ + GS+G +    G +   T  
Sbjct: 734  ASIAQFPFYFFLTFSRDEVAIVRMKPRDPRRVLHSTAVLKGGSVGLDQCKTGVAG--THA 791

Query: 541  SKENLNFQKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNS 600
            +  NL+FQ Q    + K  ++ S   PDI  QFTKNLK+IAD M    P TS  + SQ  
Sbjct: 792  TISNLSFQSQEDQLDRKSAVTLSTTPPDIACQFTKNLKNIAD-MISVSPSTSPSVASQTQ 850

Query: 601  PIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGA-HPQSAWGDVEHLFEGYDDQQK 659
             +     +S +++K  V+   +     G   E G  G+  PQ +WGDVEHLFEGY DQQ+
Sbjct: 851  TLCIQAYQSRSEVKGAVSEPSEWVNDAGLASEKGSPGSLQPQISWGDVEHLFEGYSDQQR 910

Query: 660  AAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 719
            A IQ+ERTRRLEEQKKMFS                   F E+DPVH+EILRKKEEQDREK
Sbjct: 911  ADIQRERTRRLEEQKKMFS-------------------FVEIDPVHEEILRKKEEQDREK 951

Query: 720  PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 779
            P+RHLFRFPHMGMWTKLRPGIW FLE+AS LFE+HLYTMGNKLYATEMAK+LDPKG LFA
Sbjct: 952  PYRHLFRFPHMGMWTKLRPGIWNFLEKASNLFELHLYTMGNKLYATEMAKLLDPKGDLFA 1011

Query: 780  GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 839
            GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY Y
Sbjct: 1012 GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIY 1071

Query: 840  FPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAA 899
            FPCSRRQFGL GPSLLEIDHDER EDGTLAS LGVI+R+H+ FF+H+S+D+ DVRNILA 
Sbjct: 1072 FPCSRRQFGLPGPSLLEIDHDERPEDGTLASCLGVIQRIHQNFFTHRSIDEADVRNILAT 1131

Query: 900  EQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDK 959
            EQ+KILAGCRIVFSRVFPVGEA+PHLHPLWQTAEQFGAVCT  IDDQVTHVVANSLGTDK
Sbjct: 1132 EQKKILAGCRIVFSRVFPVGEASPHLHPLWQTAEQFGAVCTSQIDDQVTHVVANSLGTDK 1191

Query: 960  VNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 994
            VNWALSTGR VVHPGWVEASALLYRRANE DFAIK
Sbjct: 1192 VNWALSTGRSVVHPGWVEASALLYRRANEHDFAIK 1226


>gi|297826809|ref|XP_002881287.1| hypothetical protein ARALYDRAFT_482300 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297327126|gb|EFH57546.1| hypothetical protein ARALYDRAFT_482300 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1248

 Score =  904 bits (2335), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 542/1040 (52%), Positives = 663/1040 (63%), Gaps = 94/1040 (9%)

Query: 1    MNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAMHGV 60
            +N+V KE+NKE +SRLL+L+  H     SSNQ  E+EAM         D  +  +A++  
Sbjct: 258  LNNVSKERNKETMSRLLTLVNDHFSRFLSSNQKNEIEAM-------NQDLSRSAIAVY-- 308

Query: 61   NGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNK-PLEASKPGPPGYRSRGVLLPLLDPHK 119
                +   +E  VN +      P   DS +  K   E +  G    RSR  +LPLLD HK
Sbjct: 309  ----TGTSSEENVNRMT----QPSNGDSFLAKKLSSEGTHRGASYVRSRLPMLPLLDLHK 360

Query: 120  VHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYETDALRAF 179
             HD DSLPSPTRETTPS+PV         MVK      + S   E  K   YE+DAL+A 
Sbjct: 361  DHDADSLPSPTRETTPSLPVNGR----HTMVKPGFPVGRESQTTEGAKVYPYESDALKAV 416

Query: 180  SSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVDQPKPVNMPTLGQQ-P 238
            S+Y QKFG NS F   +LPSPTPS E  DG+GD  GE+SS+       P  +   GQ  P
Sbjct: 417  STYHQKFGLNSVFKTDDLPSPTPSGEPNDGNGDIDGEVSSSVVKSS-NPGTLLMYGQDVP 475

Query: 239  VSSQPMDISQPMD--ISSV---QALTTANNSAPASSGYNPVVKPNPVVKAPIKSRDPRLR 293
            + S     S P+   +SS      L+    SAP  S    V   +  VK   KSRDPRLR
Sbjct: 476  LPSNFNSRSMPVANAVSSTVPPHHLSIHTISAPTGS-TQTVFASDQTVKPSAKSRDPRLR 534

Query: 294  FASSNALN--LNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQRNGFENS 351
             A  +  N  +N   +    N  KVE    +++ RKQK  +E  +DGPA KRQ++     
Sbjct: 535  LAKPDTANVTINSYSSGDARNLFKVELSADLVNPRKQKAADELFIDGPAWKRQKSD---- 590

Query: 352  GVVRDEKNIYGSGGWLEDTDMFE-PQIMNRNLLVDSAESNSRKLDNGATSPITSGTPNVV 410
                D     G GGWLEDT+    P++          ES  R ++NG TS  TS  P   
Sbjct: 591  ---TDAPKAAGIGGWLEDTESSGLPKL----------ESKPRLIENGVTSMTTSVMPTSA 637

Query: 411  VSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQKSNDSSMNTM 470
            VS ++  P T  +   SL +LL+DIAVNPTMLLN+LKMG++ K+   A QK  D      
Sbjct: 638  VSVSQKVP-TASTDAASLQSLLQDIAVNPTMLLNLLKMGERHKVPEKALQKPMDPR-RAA 695

Query: 471  HPPIPSSIPPVSVTCSIPS-----------GIL-----SKPMDELGKVRMKPRDPRRVLH 514
              P  S +P VS    IP+           G+L     + P DE G +RMKPRDPRR+LH
Sbjct: 696  QLPGSSVLPGVSAPLHIPASNALATNSSKRGVLQDSSQNAPTDESGSIRMKPRDPRRILH 755

Query: 515  GNALQRS-GSLGPEFKTDGPSAPCT---QGSKENLNFQKQL---------GAPEAK---P 558
            G  LQR+  S+  + K +  S   T   +G  E+L    QL         G  + K    
Sbjct: 756  GGTLQRTDSSMEKQSKVNDSSTLGTLTMKGKTEDLETPSQLLPRQNISQNGTSKMKISGE 815

Query: 559  VLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMKAVVT 618
            +LS+    PD + QFTK++K+IAD + VSQ   + P  + +      Q+K+  D+K   +
Sbjct: 816  LLSEKT--PDFSTQFTKSVKNIADMVVVSQQAGNPPASTHSI-----QLKTERDVKQNPS 868

Query: 619  NHDDKQTGTGSGPEAGPVGAHP---QSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKK 675
            N +D++        +    A P    ++WGDVEHLFEGYDD Q+ AIQ+ER RRLEEQKK
Sbjct: 869  NPNDQEEDVSVSAASVTATAGPTRSMNSWGDVEHLFEGYDDTQRVAIQRERVRRLEEQKK 928

Query: 676  MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTK 735
            MF+++KL LVLD+DHTLLNSAKF+EV+  H+EILRKKEEQDREKP+RHLFRFPHMGMWTK
Sbjct: 929  MFASQKLSLVLDIDHTLLNSAKFNEVEFRHEEILRKKEEQDREKPYRHLFRFPHMGMWTK 988

Query: 736  LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 795
            LRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAK+LDPKG+LF GRVIS+GDDGDP DGD
Sbjct: 989  LRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGILFNGRVISKGDDGDPLDGD 1048

Query: 796  ERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLL 855
            ERVPKSKDLEGV+GMES+VVIIDDSVRVWP+NK+NLI VERY YFP SRRQFGLLGPSLL
Sbjct: 1049 ERVPKSKDLEGVMGMESSVVIIDDSVRVWPYNKMNLIAVERYLYFPRSRRQFGLLGPSLL 1108

Query: 856  EIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV 915
            E+D DE  E+GTLASSL VIE++HK FFSH SLD+VDVRNILA+EQRKILAGCRIVFSR+
Sbjct: 1109 ELDRDEVPEEGTLASSLAVIEKIHKNFFSHTSLDEVDVRNILASEQRKILAGCRIVFSRI 1168

Query: 916  FPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGW 975
             PVGEA PHLHPLWQTAEQFGAVCT  +D+ VTHVV NSLGTDKVNWAL+ GRFVVHPGW
Sbjct: 1169 IPVGEAKPHLHPLWQTAEQFGAVCTTQVDEHVTHVVTNSLGTDKVNWALTRGRFVVHPGW 1228

Query: 976  VEASALLYRRANEQDFAIKP 995
            VEASA LY+RANE  +AI P
Sbjct: 1229 VEASAFLYQRANENLYAINP 1248


>gi|30685744|ref|NP_180912.2| RNA polymerase II C-terminal domain phosphatase-like 3 [Arabidopsis
            thaliana]
 gi|238055326|sp|Q8LL04.2|CPL3_ARATH RecName: Full=RNA polymerase II C-terminal domain phosphatase-like 3;
            Short=FCP-like 3; AltName: Full=Carboxyl-terminal
            phosphatase-like 3; Short=AtCPL3; Short=CTD
            phosphatase-like 3
 gi|330253756|gb|AEC08850.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Arabidopsis
            thaliana]
          Length = 1241

 Score =  900 bits (2325), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 543/1045 (51%), Positives = 662/1045 (63%), Gaps = 111/1045 (10%)

Query: 1    MNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAMHGV 60
            MN++ KE+NKE +SRLL+L+  H     S NQ  E+E M         D  +  +A+   
Sbjct: 258  MNNISKERNKETMSRLLTLVNDHFSQFLSFNQKNEIETM-------NQDLSRSAIAVFAG 310

Query: 61   NGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNK-PLEASKPGPPGYRSRGVLLPLLDPHK 119
               + N      VN +      P   DS +  K   E++  G    RSR  +LPLLD HK
Sbjct: 311  TSSEEN------VNQMT----QPSNGDSFLAKKLTSESTHRGAAYLRSRLPMLPLLDLHK 360

Query: 120  VHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYETDALRAF 179
             HD DSLPSPTRETTPS+PV         MV+      + S   E  K   YE+DA +A 
Sbjct: 361  DHDADSLPSPTRETTPSLPVNGR----HTMVRPGFPVGRESQTTEGAKVYSYESDARKAV 416

Query: 180  SSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAV------------DQPK 227
            S+YQQKFG NS F   +LPSPTPS E  DG+GD GGE+SS+               D P 
Sbjct: 417  STYQQKFGLNSVFKTDDLPSPTPSGEPNDGNGDVGGEVSSSVVKSSNPGSHLIYGQDVPL 476

Query: 228  PVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAPIKS 287
            P N  +    PV++       P  +S + A+     SAP +S        +  VK   KS
Sbjct: 477  PSNFNSRSM-PVANSVSSTVPPHHLS-IHAI-----SAPTAS--------DQTVKPSAKS 521

Query: 288  RDPRLRFASSNALNLN--HQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQR 345
            RDPRLR A  +A N+      +    N  KVE    +++ RKQK  +E ++DGPA KRQ+
Sbjct: 522  RDPRLRLAKPDAANVTIYSYSSGDARNLSKVELSADLVNPRKQKAADEFLIDGPAWKRQK 581

Query: 346  NGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPITSG 405
            +         D     G+GGWLEDT+       +  LL    ES  R ++NG TS  +S 
Sbjct: 582  SD-------TDAPKAAGTGGWLEDTE-------SSGLL--KLESKPRLIENGVTSMTSSV 625

Query: 406  TPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQKSNDS 465
             P   VS ++    T  + T SL +LLKDIAVNPTMLLN+LKMG++QK+   A QK  D 
Sbjct: 626  MPTSAVSVSQKV-RTASTDTASLQSLLKDIAVNPTMLLNLLKMGERQKVPEKAIQKPMDP 684

Query: 466  SMNTMHPPIPSSIPPVSVTCSIP-----------SGIL-----SKPMDELGKVRMKPRDP 509
                   P  S  P VS   SIP           SG+L     + P  E G +RMKPRDP
Sbjct: 685  R-RAAQLPGSSVQPGVSTPLSIPASNALAANSLNSGVLQDSSQNAPAAESGSIRMKPRDP 743

Query: 510  RRVLHGNALQRS-GSLGPEFKTDGPSAPCT---QGSKENLNFQKQLGAPE-------AKP 558
            RR+LHG+ LQR+  S+  + K + PS   T   +G  E+L    QL   +       +K 
Sbjct: 744  RRILHGSTLQRTDSSMEKQTKVNDPSTLGTLTMKGKAEDLETPPQLDPRQNISQNGTSKM 803

Query: 559  VLSQSVLQ---PDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMKA 615
             +S  +L    PD + QFTKNLK IAD + VSQ L + P     + +   Q+K+  D+K 
Sbjct: 804  KISGELLSGKTPDFSTQFTKNLKSIADMVVVSQQLGNPP-----ASMHSVQLKTERDVKH 858

Query: 616  VVTN-----HDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRL 670
              +N      D   +       AGP  +   ++WGDVEHLFEGYDD Q+ AIQ+ER RRL
Sbjct: 859  NPSNPNAQDEDVSVSAASVTAAAGPTRS--MNSWGDVEHLFEGYDDIQRVAIQRERVRRL 916

Query: 671  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
            EEQ KMF+++KL LVLD+DHTLLNSAKF+EV+  H+EILRKKEEQDREKP+RHLFRF HM
Sbjct: 917  EEQNKMFASQKLSLVLDIDHTLLNSAKFNEVESRHEEILRKKEEQDREKPYRHLFRFLHM 976

Query: 731  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            GMWTKLRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAK+LDPKGVLF GRVIS+GDDGD
Sbjct: 977  GMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGVLFNGRVISKGDDGD 1036

Query: 791  PFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 850
            P DGDERVPKSKDLEGV+GMES+VVIIDDSVRVWP +K+NLI VERY YFPCSRRQFGLL
Sbjct: 1037 PLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVWPQHKMNLIAVERYLYFPCSRRQFGLL 1096

Query: 851  GPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRI 910
            GPSLLE+D DE  E+GTLASSL VIE++H+ FFSH SLD+VDVRNILA+EQRKILAGCRI
Sbjct: 1097 GPSLLELDRDEVPEEGTLASSLAVIEKIHQNFFSHTSLDEVDVRNILASEQRKILAGCRI 1156

Query: 911  VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFV 970
            VFSR+ PVGEA PHLHPLWQTAEQFGAVCT  +D+ VTHVV NSLGTDKVNWAL+ GRFV
Sbjct: 1157 VFSRIIPVGEAKPHLHPLWQTAEQFGAVCTTQVDEHVTHVVTNSLGTDKVNWALTRGRFV 1216

Query: 971  VHPGWVEASALLYRRANEQDFAIKP 995
            VHPGWVEASA LY+RANE  +AI P
Sbjct: 1217 VHPGWVEASAFLYQRANENLYAINP 1241


>gi|22212705|gb|AAM94371.1|AF486633_1 CTD phosphatase-like 3 [Arabidopsis thaliana]
          Length = 1241

 Score =  898 bits (2321), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 542/1045 (51%), Positives = 661/1045 (63%), Gaps = 111/1045 (10%)

Query: 1    MNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAMHGV 60
            MN++ KE+NKE +SRLL+L+  H     S NQ  E+E M         D  +  +A+   
Sbjct: 258  MNNISKERNKETMSRLLTLVNDHFSQFLSFNQKNEIETM-------NQDLSRSAIAVFAG 310

Query: 61   NGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNK-PLEASKPGPPGYRSRGVLLPLLDPHK 119
               + N      VN +      P   DS +  K   E++  G    RSR  +LPLLD HK
Sbjct: 311  TSSEEN------VNQMT----QPSNGDSFLAKKLTSESTHRGAAYLRSRLPMLPLLDLHK 360

Query: 120  VHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYETDALRAF 179
             HD DSLPSPTRETTPS+PV         MV+      + S   E  K   YE+DA +A 
Sbjct: 361  DHDADSLPSPTRETTPSLPVNGR----HTMVRPGFPVGRESQTTEGAKVYSYESDARKAV 416

Query: 180  SSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAV------------DQPK 227
            S+YQQKFG NS F   +LPSPTPS E  DG+GD GGE+SS+               D P 
Sbjct: 417  STYQQKFGLNSVFKTDDLPSPTPSGEPNDGNGDVGGEVSSSVVKSSNPGSHLIYGQDVPL 476

Query: 228  PVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAPIKS 287
            P N  +    PV++       P  +S + A+     SAP +S        +  VK   KS
Sbjct: 477  PSNFNSRSM-PVANSVSSTVPPHHLS-IHAI-----SAPTAS--------DQTVKPSAKS 521

Query: 288  RDPRLRFASSNALNLN--HQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQR 345
            RDPRLR A  +A N+      +    N  KVE    +++ RKQK  +E ++DGPA KRQ+
Sbjct: 522  RDPRLRLAKPDAANVTIYSYSSGDARNLSKVELSADLVNPRKQKAADEFLIDGPAWKRQK 581

Query: 346  NGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPITSG 405
            +         D     G+GGWLEDT+       +  LL    ES  R ++NG TS  +S 
Sbjct: 582  SD-------TDAPKAAGTGGWLEDTE-------SSGLL--KLESKPRLIENGVTSMTSSV 625

Query: 406  TPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQKSNDS 465
             P   VS ++    T  + T SL +LLKDIAVNPTMLLN+LKMG++QK+   A QK  D 
Sbjct: 626  MPTSAVSVSQKV-RTASTDTASLQSLLKDIAVNPTMLLNLLKMGERQKVPEKAIQKPMDP 684

Query: 466  SMNTMHPPIPSSIPPVSVTCSIP-----------SGIL-----SKPMDELGKVRMKPRDP 509
                   P  S  P VS   SIP           SG+L     + P  E G +RMKPRDP
Sbjct: 685  R-RAAQLPGSSVQPGVSTPLSIPASNALAANSLNSGVLQDSSQNAPAAESGSIRMKPRDP 743

Query: 510  RRVLHGNALQRS-GSLGPEFKTDGPSAPCT---QGSKENLNFQKQLGAPE-------AKP 558
            RR+LHG+ LQR+  S+  + K + PS   T   +G  E+L    QL   +       +K 
Sbjct: 744  RRILHGSTLQRTDSSMEKQTKVNDPSTLGTLTMKGKAEDLETPPQLDPRQNISQNGTSKM 803

Query: 559  VLSQSVLQ---PDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMKA 615
             +S  +L    PD + QFTKNLK IAD + VSQ L + P     + +   Q+K+  D+K 
Sbjct: 804  KISGELLSGKTPDFSTQFTKNLKSIADMVVVSQQLGNPP-----ASMHSVQLKTERDVKH 858

Query: 616  VVTN-----HDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRL 670
              +N      D   +       AGP  +   ++WGDVEHLFEGYDD Q+ AIQ+ER RRL
Sbjct: 859  NPSNPNAQDEDVSVSAASVTAAAGPTRS--MNSWGDVEHLFEGYDDIQRVAIQRERVRRL 916

Query: 671  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
            EEQ KMF+++KL LVLD+DHTLLNSAKF+EV+  H+EILRKKEEQDREKP+RHLFRF HM
Sbjct: 917  EEQNKMFASQKLSLVLDIDHTLLNSAKFNEVESRHEEILRKKEEQDREKPYRHLFRFLHM 976

Query: 731  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            GMWTKLRPGIW FLE+ASKL+E+HLYTMGNKLY TEMAK+LDPKGVLF GRVIS+GDDGD
Sbjct: 977  GMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYVTEMAKLLDPKGVLFNGRVISKGDDGD 1036

Query: 791  PFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 850
            P DGDERVPKSKDLEGV+GMES+VVIIDDSVRVWP +K+NLI VERY YFPCSRRQFGLL
Sbjct: 1037 PLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVWPQHKMNLIAVERYLYFPCSRRQFGLL 1096

Query: 851  GPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRI 910
            GPSLLE+D DE  E+GTLASSL VIE++H+ FFSH SLD+VDVRNILA+EQRKILAGCRI
Sbjct: 1097 GPSLLELDRDEVPEEGTLASSLAVIEKIHQNFFSHTSLDEVDVRNILASEQRKILAGCRI 1156

Query: 911  VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFV 970
            VFSR+ PVGEA PHLHPLWQTAEQFGAVCT  +D+ VTHVV NSLGTDKVNWAL+ GRFV
Sbjct: 1157 VFSRIIPVGEAKPHLHPLWQTAEQFGAVCTTQVDEHVTHVVTNSLGTDKVNWALTRGRFV 1216

Query: 971  VHPGWVEASALLYRRANEQDFAIKP 995
            VHPGWVEASA LY+RANE  +AI P
Sbjct: 1217 VHPGWVEASAFLYQRANENLYAINP 1241


>gi|77551160|gb|ABA93957.1| NLI interacting factor-like phosphatase family protein, expressed
            [Oryza sativa Japonica Group]
          Length = 1272

 Score =  838 bits (2165), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 506/1046 (48%), Positives = 657/1046 (62%), Gaps = 109/1046 (10%)

Query: 6    KEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAMHGVNGKDS 65
            +EQ K +L +LL  IK+    + + +Q  E+++ +  LV    +  KD       NG ++
Sbjct: 279  REQTKNMLLKLLFHIKNRYSDMLTPDQRDELDSRVRQLVF---EDGKD-----NANGPNA 330

Query: 66   NIVTENAVNDLNFKEKVPLPVDSLMQNKPLEASKPGPPGYRSRGVLLPLLDPHKVHDVDS 125
                  A +     E+  LP +S   N   +   P     ++R ++ PLLD H  +D +S
Sbjct: 331  TSTNAAAPSGQVLSER--LPFESGAGNSFSKVEIPA----KNR-MVSPLLDLHADYDENS 383

Query: 126  LPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYET--DALRAFSSYQ 183
            LPSPTR++ P   V +   +G G +        +    E  K   Y++  DAL+A   YQ
Sbjct: 384  LPSPTRDSKPPFDVPKP--IGYGALPMAPDRPSVLERVEPAKNSSYQSFNDALKAVCYYQ 441

Query: 184  QKFGRNSFFMNSELPSPTPS---EESGDGDGDTGGEISSATAVDQPKPVNMPTLGQQPVS 240
            QK G+ S F + +LPSPTPS   ++SGD  GD  GE+SS +A ++   + +P + Q P  
Sbjct: 442  QKHGQKSNFASDDLPSPTPSGDGDKSGDKGGDVFGEVSSFSASNK---IALPIVNQMP-- 496

Query: 241  SQPMDISQPMDISSVQALTTANNSAPASSGY-----NPVVKPNPVVKAPIKSRDPRLRF- 294
                  S+P  +SS      +++ A    GY     N V   N ++KA  KSRDPRL+F 
Sbjct: 497  ------SRPSTVSS-----NSDSFAGGPPGYAKQIENSVSGSNHLLKATAKSRDPRLKFL 545

Query: 295  -------ASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQRNG 347
                   A +N      +P P   +  +    G  ++SRK K V+EP++D  ALKR R  
Sbjct: 546  NRDTGGVADANRRVNFAEPNP---SKDRTMGGGVSINSRKNKAVDEPMVDENALKRSR-- 600

Query: 348  FENSGVVRDEKNI--YGSGGWLED--------TDMFEPQIMNRNLLVDSAESNSRKLDNG 397
                GV+ + +++   G GGW +D        +D F+P   N+N  + +  + +  +   
Sbjct: 601  ----GVIGNLRDMQPTGRGGWAKDGGNISSYSSDGFQP---NQNTRLGNNTTGNHNIRTD 653

Query: 398  AT--------SPITSGTPNVVVS-GNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKM 448
            +T        +  +  +P +V +     AP T+ +  VSLPA+LKDIAVNPTML+  ++M
Sbjct: 654  STLASNLNNTTNNSGTSPGIVQAPQTNSAPQTSSAPAVSLPAMLKDIAVNPTMLMQWIQM 713

Query: 449  GQQQKLAADAQQKSNDSSMNT------MHPPIPSSIPPVSVTCSIPS-----GILSKPM- 496
             QQ+  A++ QQK   S   T      M  P+  + P  +   ++PS      + S PM 
Sbjct: 714  EQQKMSASEPQQKVTASVGMTSNVTPGMVLPL-GNAPKTTEVAAVPSVRPQVPMQSAPMH 772

Query: 497  --DELGKVRMKPRDPRRVLHGNALQRSGSLGP----EFKTDGPSAPCTQGSKENLNFQKQ 550
              ++ G +RMKPRDPRR+LH N +Q++ ++ P    + K++G + P +Q SK++L  Q Q
Sbjct: 773  SQNDTGVIRMKPRDPRRILHSNIVQKNDTVPPVGVEQAKSNGTAPPDSQSSKDHLLNQDQ 832

Query: 551  LGAPEAKPVLSQSVLQPDI-TQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKS 609
                  K    Q++  P +      + +   A+ +S SQ   +  M    +  Q     +
Sbjct: 833  ------KAEQLQAIALPSLPVTSSARPVTMNANPVSNSQLAATALMPPHGNTKQTSSSVN 886

Query: 610  GADMK-AVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTR 668
             AD + A   N  +    T +GP   P    P S +GDV+HL +GYDDQQKA IQKER R
Sbjct: 887  KADPRLAAGQNESNDDAATSTGPVTAPDAVPPASPYGDVDHLLDGYDDQQKALIQKERAR 946

Query: 669  RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 728
            R++EQ KMF+ARKLCLVLDLDHTLLNSAKF EVD +H EILRKKEEQDRE+  RHLF F 
Sbjct: 947  RIKEQHKMFAARKLCLVLDLDHTLLNSAKFIEVDHIHGEILRKKEEQDRERAERHLFCFN 1006

Query: 729  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
            HMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNK+YATEMAKVLDP G LFAGRVISRGDD
Sbjct: 1007 HMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKVYATEMAKVLDPTGTLFAGRVISRGDD 1066

Query: 789  GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 848
            GDPFD DERVPKSKDL+GVLGMESAVVIIDDSVRVWPHNK NLIVVERYTYFPCSRRQFG
Sbjct: 1067 GDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKHNLIVVERYTYFPCSRRQFG 1126

Query: 849  LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGC 908
            L GPSLLEID DER EDGTLASSL VIER+HK FFSH +L+D DVR+ILA+EQ++IL GC
Sbjct: 1127 LPGPSLLEIDRDERPEDGTLASSLAVIERIHKNFFSHPNLNDADVRSILASEQQRILGGC 1186

Query: 909  RIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGR 968
            RIVFSR+FPVGEANPH+HPLWQTAEQFGAVCT  IDD+VTHVVANSLGTDKVNWALSTGR
Sbjct: 1187 RIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKVNWALSTGR 1246

Query: 969  FVVHPGWVEASALLYRRANEQDFAIK 994
            FVVHPGWVEASALLYRRA+E DFA+K
Sbjct: 1247 FVVHPGWVEASALLYRRASELDFAVK 1272


>gi|222616055|gb|EEE52187.1| hypothetical protein OsJ_34058 [Oryza sativa Japonica Group]
          Length = 1267

 Score =  837 bits (2163), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 506/1046 (48%), Positives = 657/1046 (62%), Gaps = 109/1046 (10%)

Query: 6    KEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAMHGVNGKDS 65
            +EQ K +L +LL  IK+    + + +Q  E+++ +  LV    +  KD       NG ++
Sbjct: 274  REQTKNMLLKLLFHIKNRYSDMLTPDQRDELDSRVRQLVF---EDGKD-----NANGPNA 325

Query: 66   NIVTENAVNDLNFKEKVPLPVDSLMQNKPLEASKPGPPGYRSRGVLLPLLDPHKVHDVDS 125
                  A +     E+  LP +S   N   +   P     ++R ++ PLLD H  +D +S
Sbjct: 326  TSTNAAAPSGQVLSER--LPFESGAGNSFSKVEIPA----KNR-MVSPLLDLHADYDENS 378

Query: 126  LPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYET--DALRAFSSYQ 183
            LPSPTR++ P   V +   +G G +        +    E  K   Y++  DAL+A   YQ
Sbjct: 379  LPSPTRDSKPPFDVPKP--IGYGALPMAPDRPSVLERVEPAKNSSYQSFNDALKAVCYYQ 436

Query: 184  QKFGRNSFFMNSELPSPTPS---EESGDGDGDTGGEISSATAVDQPKPVNMPTLGQQPVS 240
            QK G+ S F + +LPSPTPS   ++SGD  GD  GE+SS +A ++   + +P + Q P  
Sbjct: 437  QKHGQKSNFASDDLPSPTPSGDGDKSGDKGGDVFGEVSSFSASNK---IALPIVNQMP-- 491

Query: 241  SQPMDISQPMDISSVQALTTANNSAPASSGY-----NPVVKPNPVVKAPIKSRDPRLRF- 294
                  S+P  +SS      +++ A    GY     N V   N ++KA  KSRDPRL+F 
Sbjct: 492  ------SRPSTVSS-----NSDSFAGGPPGYAKQIENSVSGSNHLLKATAKSRDPRLKFL 540

Query: 295  -------ASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQRNG 347
                   A +N      +P P   +  +    G  ++SRK K V+EP++D  ALKR R  
Sbjct: 541  NRDTGGVADANRRVNFAEPNP---SKDRTMGGGVSINSRKNKAVDEPMVDENALKRSR-- 595

Query: 348  FENSGVVRDEKNI--YGSGGWLED--------TDMFEPQIMNRNLLVDSAESNSRKLDNG 397
                GV+ + +++   G GGW +D        +D F+P   N+N  + +  + +  +   
Sbjct: 596  ----GVIGNLRDMQPTGRGGWAKDGGNISSYSSDGFQP---NQNTRLGNNTTGNHNIRTD 648

Query: 398  AT--------SPITSGTPNVVVS-GNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKM 448
            +T        +  +  +P +V +     AP T+ +  VSLPA+LKDIAVNPTML+  ++M
Sbjct: 649  STLASNLNNTTNNSGTSPGIVQAPQTNSAPQTSSAPAVSLPAMLKDIAVNPTMLMQWIQM 708

Query: 449  GQQQKLAADAQQKSNDSSMNT------MHPPIPSSIPPVSVTCSIPS-----GILSKPM- 496
             QQ+  A++ QQK   S   T      M  P+  + P  +   ++PS      + S PM 
Sbjct: 709  EQQKMSASEPQQKVTASVGMTSNVTPGMVLPL-GNAPKTTEVAAVPSVRPQVPMQSAPMH 767

Query: 497  --DELGKVRMKPRDPRRVLHGNALQRSGSLGP----EFKTDGPSAPCTQGSKENLNFQKQ 550
              ++ G +RMKPRDPRR+LH N +Q++ ++ P    + K++G + P +Q SK++L  Q Q
Sbjct: 768  SQNDTGVIRMKPRDPRRILHSNIVQKNDTVPPVGVEQAKSNGTAPPDSQSSKDHLLNQDQ 827

Query: 551  LGAPEAKPVLSQSVLQPDI-TQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKS 609
                  K    Q++  P +      + +   A+ +S SQ   +  M    +  Q     +
Sbjct: 828  ------KAEQLQAIALPSLPVTSSARPVTMNANPVSNSQLAATALMPPHGNTKQTSSSVN 881

Query: 610  GADMK-AVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTR 668
             AD + A   N  +    T +GP   P    P S +GDV+HL +GYDDQQKA IQKER R
Sbjct: 882  KADPRLAAGQNESNDDAATSTGPVTAPDAVPPASPYGDVDHLLDGYDDQQKALIQKERAR 941

Query: 669  RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 728
            R++EQ KMF+ARKLCLVLDLDHTLLNSAKF EVD +H EILRKKEEQDRE+  RHLF F 
Sbjct: 942  RIKEQHKMFAARKLCLVLDLDHTLLNSAKFIEVDHIHGEILRKKEEQDRERAERHLFCFN 1001

Query: 729  HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
            HMGMWTKLRPGIW FLE+ASKL+E+HLYTMGNK+YATEMAKVLDP G LFAGRVISRGDD
Sbjct: 1002 HMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKVYATEMAKVLDPTGTLFAGRVISRGDD 1061

Query: 789  GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 848
            GDPFD DERVPKSKDL+GVLGMESAVVIIDDSVRVWPHNK NLIVVERYTYFPCSRRQFG
Sbjct: 1062 GDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKHNLIVVERYTYFPCSRRQFG 1121

Query: 849  LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGC 908
            L GPSLLEID DER EDGTLASSL VIER+HK FFSH +L+D DVR+ILA+EQ++IL GC
Sbjct: 1122 LPGPSLLEIDRDERPEDGTLASSLAVIERIHKNFFSHPNLNDADVRSILASEQQRILGGC 1181

Query: 909  RIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGR 968
            RIVFSR+FPVGEANPH+HPLWQTAEQFGAVCT  IDD+VTHVVANSLGTDKVNWALSTGR
Sbjct: 1182 RIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKVNWALSTGR 1241

Query: 969  FVVHPGWVEASALLYRRANEQDFAIK 994
            FVVHPGWVEASALLYRRA+E DFA+K
Sbjct: 1242 FVVHPGWVEASALLYRRASELDFAVK 1267


>gi|242068555|ref|XP_002449554.1| hypothetical protein SORBIDRAFT_05g019010 [Sorghum bicolor]
 gi|241935397|gb|EES08542.1| hypothetical protein SORBIDRAFT_05g019010 [Sorghum bicolor]
          Length = 1197

 Score =  810 bits (2093), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 489/1010 (48%), Positives = 606/1010 (60%), Gaps = 92/1010 (9%)

Query: 7    EQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAMHGVNGKDSN 66
            EQNK  L + L  IK+    + +  Q  E+++ +  LV    D   D     G N     
Sbjct: 258  EQNKTTLLKSLFHIKNRYSDMLTPEQRDELDSRVRKLVFGEKDNVSDPSTSSGTN----- 312

Query: 67   IVTENAVNDLNFKEKVP----LPVDSLMQNKPLEASKPGPPGYRSRGVLLPLLDPHKVHD 122
                 A+N L    +V     LP +S   N      +   P  R    + PLLD H  +D
Sbjct: 313  -----AINVLAPSGQVSSSGGLPFESGAANPFSSLPRLEVPAKR----ISPLLDLHADYD 363

Query: 123  VDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYET--DALRAFS 180
             +SLPSPTR+  P  PV +   +G G               E  K   Y +  D L+A S
Sbjct: 364  ENSLPSPTRDNAPPFPVPKP--IGFGAFPMVPEKLSFPERVEPAKNSLYPSLNDPLKAVS 421

Query: 181  SYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEI-SSATAVDQPKPVNMPTLGQQPV 239
            SYQQK+G+ S F + +LPSPTPS + G    D GG+I S  ++   PK + +P+  Q P 
Sbjct: 422  SYQQKYGQKSVFPSDDLPSPTPSGDEGKS-ADKGGDIFSEVSSFPVPKSIALPSTSQMPA 480

Query: 240  SSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAPIKSRDPRLRFA---S 296
            S       QP  +SS      +     A     PV  PN  +KA  KSRDPRLRF    S
Sbjct: 481  S-------QPSTVSSSGISYASGPPGFAKQIEQPVAGPNHAIKAASKSRDPRLRFLNRDS 533

Query: 297  SNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQRNGFENSGVVRD 356
            + A ++N +              G  + +RK K +++P +D   LKR R G  N    RD
Sbjct: 534  AGATDVNRRAN--FSELKDGNLGGASVGNRKHKAIDDPQVDENVLKRFRGGTANP---RD 588

Query: 357  EKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPITSGTPNVVVSGNEP 416
                             +P      L+   A +NS  ++     P  +  P+V  +   P
Sbjct: 589  ----------------LQPTGNPNQLMNIRAPTNSSGINMKTLQPPQTTAPHVSAAPAVP 632

Query: 417  APATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQK-SNDSSMNTMHPPI- 474
             P+           LLKDIAVNPT+L+++++M  Q+K A++ Q   S+  S N +   + 
Sbjct: 633  VPSM----------LLKDIAVNPTLLMHLIQMEHQKKSASETQGGMSSGMSNNGIAGMVF 682

Query: 475  -PSSIPPVSVTCSIPSGILSKP--------MDELGKVRMKPRDPRRVLHGNALQRSGSLG 525
             P + P  +    +PS     P         ++ G +RMKPRDPRR+LH N  Q+S ++ 
Sbjct: 683  TPGNAPKTTEAAQVPSVRPQVPAQTPSLNSQNDGGILRMKPRDPRRILHNNVAQKSDAMV 742

Query: 526  PE-FKTDGPSAPCTQGSKENLNFQKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFM 584
             E  KT+G + P +QG+K+      Q  +  ++P L  SV +P     FT N KH+ D +
Sbjct: 743  LEQVKTNGITQPDSQGTKD------QTSSMPSQPTLPSSVARP-----FT-NTKHV-DPV 789

Query: 585  SVSQPLTSEPMVSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAW 644
            S SQ   +  M      +  G I       AV  N  +    T            P S W
Sbjct: 790  SNSQLAATAIMAPTQQAL--GSINKVDPRLAVEQNGQNADATTTDASATELEATQPVSPW 847

Query: 645  GDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPV 704
            G+++HL +GYDD+QKA IQKER RR+ EQ KMFSARKLCLVLDLDHTLLNSAKF EV+P+
Sbjct: 848  GNLDHLLDGYDDKQKALIQKERARRITEQHKMFSARKLCLVLDLDHTLLNSAKFIEVEPI 907

Query: 705  HDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 764
            H+E+LRKKEEQDR  P RHL+RF HM MWTKLRPGIW FLE+AS LFE+HLYTMGNKLYA
Sbjct: 908  HEEMLRKKEEQDRTLPERHLYRFHHMNMWTKLRPGIWNFLEKASNLFELHLYTMGNKLYA 967

Query: 765  TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW 824
            TEMAKVLDP G LFAGRVISRGDDGDPFD DERVPKSKDL+GVLGMESAVVIIDDSVRVW
Sbjct: 968  TEMAKVLDPTGTLFAGRVISRGDDGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVW 1027

Query: 825  PHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS 884
            PHN+ NLIVVERYTYFPCSRRQFGL GPSLLEID DER EDGTLASSL VIER+H  FFS
Sbjct: 1028 PHNRHNLIVVERYTYFPCSRRQFGLPGPSLLEIDRDERPEDGTLASSLAVIERIHHNFFS 1087

Query: 885  HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHID 944
            H +L++ DVR+ILA+EQR+ILAGCRIVFSRVFPVG+A+PHLHPLWQTAEQFGAVCT  +D
Sbjct: 1088 HPNLNEADVRSILASEQRRILAGCRIVFSRVFPVGDASPHLHPLWQTAEQFGAVCTNLVD 1147

Query: 945  DQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 994
            D+VTHVVANS GTDKVNWALS G+FVVHPGWVEASALLYRRANE DFA+K
Sbjct: 1148 DRVTHVVANSPGTDKVNWALSKGKFVVHPGWVEASALLYRRANEHDFAVK 1197


>gi|413920930|gb|AFW60862.1| hypothetical protein ZEAMMB73_799152, partial [Zea mays]
          Length = 1234

 Score =  803 bits (2074), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 486/1022 (47%), Positives = 616/1022 (60%), Gaps = 97/1022 (9%)

Query: 6    KEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAMHGVNGKDS 65
            +EQNK  L +LL  IK+    + +  Q +EM++ +  LV    D   D     G +  + 
Sbjct: 275  REQNKTTLLKLLFHIKNRYSDMLTPEQREEMDSRVRKLVFGEKDNVSDPSTSCGTSAINV 334

Query: 66   NIVTENAVNDLNFKEKVPLPVDSLMQNKPLEASKPGPPGYRSRGVLLPLLDPHKVHDVDS 125
            +  +    N         LP +S   N      +   P  R+     PLL+ H  +D +S
Sbjct: 335  SAPSGQVSNTGG------LPFESGAANLFSSLPRLEVPAKRNS----PLLNLHADYDENS 384

Query: 126  LPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYE--TDALRAFSSYQ 183
            LPSPTR+  P  P  +   +G G               E  K   Y    D L+A SSYQ
Sbjct: 385  LPSPTRDNAPPFPALKP--IGFGAFPMVPEKLSFLDRVEPTKNSLYPPLNDPLKAVSSYQ 442

Query: 184  QKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEI-SSATAVDQPKPVNMPTLGQQPVSSQ 242
            QK+G+ S + + +LPSPTPS + G    D GG+I S  ++   PK + +P+  Q P S  
Sbjct: 443  QKYGQKSVYPSDDLPSPTPSGDEGK-PADKGGDIFSDVSSFPVPKSIVLPSTSQMPASQP 501

Query: 243  P---------------MDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAPIKS 287
                            M  SQP+ +SS      +     A         PN  +KA  KS
Sbjct: 502  STVSSSSISYASSTSQMAASQPITVSSSGISYASGPPGFAKQIEQSTAGPNHAIKAASKS 561

Query: 288  RDPRLRFA---SSNALNLNHQPAPILHNAPKVEPVGRV-MSSRKQKTVEEPVLDGPALKR 343
            RDPRLRF    S+ A ++N +      +  K   +G V + +RKQK V++P +D  ALKR
Sbjct: 562  RDPRLRFLNRDSAGATDVNWRAN---FSELKDGNLGGVSVGNRKQKAVDDPQVDDNALKR 618

Query: 344  QRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPIT 403
             R G  N      ++++  +G           Q+MN      S+  N + L      P  
Sbjct: 619  FRGGIAN------QRDMQPTGN--------PNQLMNIRAPTHSSSINMKTL-----QPPQ 659

Query: 404  SGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQKSN 463
            +  P+V  +   P P          P LLKDIAVNP +L+++++M  Q+K A+++Q   +
Sbjct: 660  TTAPHVSAAPAVPLP----------PMLLKDIAVNPALLMHLIQMEHQKKSASESQGGMS 709

Query: 464  DSSMNTMHPPI---PSSIPPVSVTCSIPS-----GILSKPMD---ELGKVRMKPRDPRRV 512
                N     +   P + P ++    +PS      + + P++   + G VRMKPRDPRR+
Sbjct: 710  SGMTNNGIAGMVFTPGNAPKITEAAQVPSVRPQVPVQTPPLNSQNDGGIVRMKPRDPRRI 769

Query: 513  LHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENLNFQKQLGAPEAKPVLSQSVLQPDITQ 571
            LH N  Q+S ++  E  K +G + P +QG+K+           +  PV SQ  L   I +
Sbjct: 770  LHNNIAQKSDAMSLEQVKNNGTTQPDSQGTKD-----------QTTPVPSQPALPSSIAR 818

Query: 572  QFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGP 631
             F+ + KH+ D +S SQ   +  M    +P Q     +  D +  V  +      T +G 
Sbjct: 819  PFS-SAKHV-DPVSNSQLAATAIM----APTQALSSVNKVDPRLAVEQNGQNADATTNGA 872

Query: 632  EAGPVGA-HPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDH 690
             A  + A  P S WGDV+HL +GYDDQQKA IQKER RR+ EQ KMFSARKLCLVLDLDH
Sbjct: 873  SATTLEATQPVSPWGDVDHLLDGYDDQQKALIQKERARRITEQHKMFSARKLCLVLDLDH 932

Query: 691  TLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKL 750
            TLLNSAKF EV+P+H+E+LRKKEEQDR  P RHL+RF HM MWTKLRPGIW FL++AS L
Sbjct: 933  TLLNSAKFIEVEPIHEEMLRKKEEQDRTLPERHLYRFHHMNMWTKLRPGIWNFLQKASNL 992

Query: 751  FEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM 810
            FE+HLYTMGNKLYATEMAKVLDP G LFAGRVISRGDDGDPFD DERVPKSKDL+GVLGM
Sbjct: 993  FELHLYTMGNKLYATEMAKVLDPTGTLFAGRVISRGDDGDPFDSDERVPKSKDLDGVLGM 1052

Query: 811  ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLAS 870
            ESAVVIIDDSVRVWPHN+ NLIVVERYTYFPCSRRQFGL GPSLLEID DER EDGTLAS
Sbjct: 1053 ESAVVIIDDSVRVWPHNRHNLIVVERYTYFPCSRRQFGLPGPSLLEIDRDERPEDGTLAS 1112

Query: 871  SLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQ 930
            SL VIER+H  FFSH +L++ DVR+ILA+EQR+IL GCRIVFSRVFPVG+A+PHLHPLWQ
Sbjct: 1113 SLAVIERIHHNFFSHPNLNEADVRSILASEQRRILTGCRIVFSRVFPVGDASPHLHPLWQ 1172

Query: 931  TAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQD 990
            TAEQFGAVCT  +DD+VTH+VANS GTDKVNWALS G+FVVHPGWVEASALLYRRANE D
Sbjct: 1173 TAEQFGAVCTNLVDDRVTHIVANSPGTDKVNWALSKGKFVVHPGWVEASALLYRRANEHD 1232

Query: 991  FA 992
            FA
Sbjct: 1233 FA 1234


>gi|218185830|gb|EEC68257.1| hypothetical protein OsI_36281 [Oryza sativa Indica Group]
          Length = 1255

 Score =  796 bits (2055), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 489/1023 (47%), Positives = 628/1023 (61%), Gaps = 116/1023 (11%)

Query: 6    KEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAMHGVNGKDS 65
            +EQ K +L +LL  IK+    + + +Q  E+++ +  LV    +  KD       NG ++
Sbjct: 315  REQTKNMLLKLLFHIKNRYSDMLTPDQRDELDSRVRQLVF---EDGKD-----NANGPNA 366

Query: 66   NIVTENAVNDLNFKEKVPLPVDSLMQNKPLEASKPGPPGYRSRGVLLPLLDPHKVHDVDS 125
                  A +     E+  LP +S   N   +   P     ++R ++ PLLD H  +D +S
Sbjct: 367  TSTNAAAPSGQVLSER--LPFESGAGNSFSKVEIPA----KNR-MVSPLLDLHADYDENS 419

Query: 126  LPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYET--DALRAFSSYQ 183
            LPSPTR++ P   V +   +G G +        +    E  K   Y++  DAL+A   YQ
Sbjct: 420  LPSPTRDSAPPFDVPKP--IGYGALPMAPDRPSVLERVEPAKNSSYQSFNDALKAVCYYQ 477

Query: 184  QKFGRNSFFMNSELPSPTPS---EESGDGDGDTGGEISSATAVDQPKPVNMPTLGQQPVS 240
            QK G+ S F + +LPSPTPS   ++SGD  GD  GE+SS +A ++   + +P + Q P  
Sbjct: 478  QKHGQKSNFASDDLPSPTPSGDGDKSGDKGGDVFGEVSSFSASNK---IVLPIVNQMP-- 532

Query: 241  SQPMDISQPMDISSVQALTTANNSAPASSGY-----NPVVKPNPVVKAPIKSRDPRLRFA 295
                  S+P  +SS      +++ A    GY     N V   N ++KA  KSRDPRL+F 
Sbjct: 533  ------SRPSTVSS-----NSDSFAGGPPGYAKQIENSVSGSNHLLKATAKSRDPRLKFL 581

Query: 296  SSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEE--PVLDGPALKRQRNGFENSGV 353
            + +                     G V  + ++    E  P+  GP +            
Sbjct: 582  NRD--------------------TGGVADANRRVNFAEPNPLKIGPWVVE---------- 611

Query: 354  VRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDS--AESNSRKLDNGATSPITSGTPNVVV 411
                   Y S    ++T +      N N+  DS  A + +   +N  TSP     P    
Sbjct: 612  -------YQSIAPNQNTRLGNNTTGNHNIRTDSTLASNLNNTTNNSGTSPGIVQAPQ--- 661

Query: 412  SGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQKSNDSSMNT-- 469
            + + P  ++ P+  VSLPA+LKDIAVNPTML+  ++M   +  A++ QQK   S   T  
Sbjct: 662  TNSAPQTSSAPA--VSLPAMLKDIAVNPTMLMQWIRMEHHKMSASEPQQKVTASVGMTSN 719

Query: 470  ----MHPPIPSSIPPVSVTCSIPS-----GILSKPM---DELGKVRMKPRDPRRVLHGNA 517
                M  P+  + P  +   ++PS      + S PM   ++ G +RMKPRDPRR+LH N 
Sbjct: 720  VTPGMVLPL-GNAPKTTEVAAVPSVRPQVPMQSAPMHSQNDTGVIRMKPRDPRRILHSNI 778

Query: 518  LQRSGSLGP----EFKTDGPSAPCTQGSKENLNFQKQLGAPEAKPVLSQSVLQPDI-TQQ 572
            +Q++ ++ P    + K++G + P +Q SK++L  Q Q      K    Q++  P +    
Sbjct: 779  VQKNDTVPPVGVEQAKSNGTAPPDSQSSKDHLLNQDQ------KAEQLQAIALPSLPVTS 832

Query: 573  FTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMK-AVVTNHDDKQTGTGSGP 631
              + +   A+ +S SQ   +  M    +  Q     + AD + A   N  +    T +GP
Sbjct: 833  SARPVTMNANPVSNSQLAATALMPPHGNTKQTSSSVNKADPRLAAGQNESNDDAATSTGP 892

Query: 632  EAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHT 691
               P    P S +GDV+HL +GYDDQQKA IQKER RR++EQ KMF+ARKLCLVLDLDHT
Sbjct: 893  VTAPDAVPPASPYGDVDHLLDGYDDQQKALIQKERARRIKEQHKMFAARKLCLVLDLDHT 952

Query: 692  LLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLF 751
            LLNSAKF EVD +H EILRKKEEQDRE+  RHLF F HMGMWTKLRPGIW FLE+ASKL+
Sbjct: 953  LLNSAKFIEVDHIHGEILRKKEEQDRERAERHLFCFNHMGMWTKLRPGIWNFLEKASKLY 1012

Query: 752  EMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME 811
            E+HLYTMGNK+YATEMAKVLDP G LFAGRVISRGDDGDPFD DERVPKSKDL+GVLGME
Sbjct: 1013 ELHLYTMGNKVYATEMAKVLDPTGTLFAGRVISRGDDGDPFDSDERVPKSKDLDGVLGME 1072

Query: 812  SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASS 871
            SAVVIIDDSVRVWPHNK NLIVVERYTYFPCSRRQFGL GPSLLEID DER EDGTLASS
Sbjct: 1073 SAVVIIDDSVRVWPHNKHNLIVVERYTYFPCSRRQFGLPGPSLLEIDRDERPEDGTLASS 1132

Query: 872  LGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 931
            L VIER+HK FFSH +L+D DVR+ILA+EQ++IL GCRIVFSR+FPVGEANPH+HPLWQT
Sbjct: 1133 LTVIERIHKNFFSHPNLNDADVRSILASEQQRILGGCRIVFSRIFPVGEANPHMHPLWQT 1192

Query: 932  AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 991
            AEQFGAVCT  IDD+VTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRA+E DF
Sbjct: 1193 AEQFGAVCTNQIDDRVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRASELDF 1252

Query: 992  AIK 994
            A+K
Sbjct: 1253 AVK 1255


>gi|357156660|ref|XP_003577532.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Brachypodium distachyon]
          Length = 1259

 Score =  786 bits (2030), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 493/1046 (47%), Positives = 625/1046 (59%), Gaps = 112/1046 (10%)

Query: 8    QNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAMHGVNGKDSNI 67
            QNKE+L +LL  +++    + + +Q  E+++ +  L     ++  D        G +++ 
Sbjct: 267  QNKEMLLKLLFHLRNRYSDMLTPDQRVELDSRVRQLAFVDGEENTD--------GPNASC 318

Query: 68   VTENAVNDLNFKEKVP---LPVDSLMQNKPLEASKPGPPGYRSRGVLLPLLDPHKVHDVD 124
             T N+ N +    +VP   LP +S   N    +S P         ++ PLLD H  HD +
Sbjct: 319  ST-NSTNVVVPTGQVPSERLPFESGATNPFSGSSLPWLETQTKNRMVSPLLDLHADHDEN 377

Query: 125  SLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYET--DALRAFSSY 182
            SLPSPTR+  P   V + +  G            L+  AE  K   Y +  D+L   SSY
Sbjct: 378  SLPSPTRDNAPQFSVPKPIGFG---AFPMGPDRSLTERAEPSKKNLYPSVNDSLD-VSSY 433

Query: 183  QQKFGRNSFFMNSELPSPTPS---EESGDGDGDTGGEISSATAVDQPKPVNMPTLGQQPV 239
            +QK+ + S F N +LPSPTPS   ++S D DGD  GEISS ++ ++     +P++ Q P 
Sbjct: 434  KQKYSQKSNFANDDLPSPTPSGDGDKSEDKDGDMFGEISSFSSSNK---TALPSVSQIPA 490

Query: 240  SSQPMDISQPMDISSVQALTTANNSAPASSGY-----NPVVKPNPVVKAPIKSRDPRLRF 294
            S       +P  +SS      +N S     GY       V  PN  +K   KSRDPRLR+
Sbjct: 491  S-------RPSTVSS------SNGSFSGPPGYAKKIEQSVSGPNLALKPSAKSRDPRLRY 537

Query: 295  ASSNALNLNHQPAPILHNAPKVEPVGRVMSSR-KQKTVEEPVLDGPALKRQRNGFENSGV 353
                   LN  P          EP   +  +  K K V +P++D   +KR R    N   
Sbjct: 538  -------LNRDPGDANRCMNFAEPNASLGGTLGKHKAVGQPLMDENMVKRARGSIGNPRD 590

Query: 354  V-----RDEKNI--YGSGGWL--EDTDMFEPQIMNRNLLVDSAE-SNSRKLDNGA---TS 400
            +     RD  NI  Y S      ++T +      N NL  DS   SN   + N +   T 
Sbjct: 591  LQVPPGRDGSNISFYPSDRVQSNQNTRLDTKTTGNPNLRADSQLLSNVSSITNSSVTSTK 650

Query: 401  PITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQ 460
             + +G P+ V       P T+ + +VSLPA+LKDIAVNPT+L++ ++M QQ++ A++ QQ
Sbjct: 651  TLNAGQPDSV-------PQTSAAPSVSLPAVLKDIAVNPTVLMHWIQMEQQKRSASEPQQ 703

Query: 461  KSN-----------DSSMNTMHPP-------IPSSIPPVSVTCSIPSGILSKPMDELGKV 502
              N           + +   + PP         + IP +   C   +  +    D  G +
Sbjct: 704  TVNTLGGISSGMINNDTAGMVIPPGSALKTADAAQIPSIRPQCPTQTAPVISQTDA-GVI 762

Query: 503  RMKPRDPRRVLHGNALQRSGSLGPE-FKTDGPSAPCTQGSKENL------NFQKQLGAPE 555
            RMKPRDPRR+LH N   ++ +   E  +++G   P +Q SK+N+        Q Q GA  
Sbjct: 763  RMKPRDPRRILHNNTSPKNDTTNSEQARSNGIVLPVSQDSKDNMINREQQAEQLQTGALP 822

Query: 556  AKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMKA 615
            ++PV   ++ +P            + D +S SQ   S  M  Q +    G I       A
Sbjct: 823  SQPVSLSNIARPSTMS------ASMVDPVSNSQLAASSLMAPQQTS---GSINRADPRLA 873

Query: 616  VVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKK 675
               N  +    T + P      A P + WGD++ L  GYDDQQKA IQKER RR+ EQ+K
Sbjct: 874  PGQNDPNADAATNASPATTLGAAPPANQWGDLDDLLSGYDDQQKALIQKERARRIMEQQK 933

Query: 676  MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTK 735
            MFSARKLCLVLDLDHTLLNSAKF EVDP+H+EILRKKEEQDRE+P RHLFR  HM MWTK
Sbjct: 934  MFSARKLCLVLDLDHTLLNSAKFLEVDPIHEEILRKKEEQDRERPERHLFRLHHMSMWTK 993

Query: 736  LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP---- 791
            LRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAKVLDP G LF GRVISRG DG      
Sbjct: 994  LRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGALFEGRVISRGGDGTSRGGD 1053

Query: 792  ---FDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 848
               FD D+RVPKSKDL+GVLGMESAVVIIDDSVRVWPHNK N+IVVERYTYFPCSRRQFG
Sbjct: 1054 GDSFDSDDRVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKNNMIVVERYTYFPCSRRQFG 1113

Query: 849  LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGC 908
            L GPSLLEID DER EDGTLASSL VI R+H+ FFSH +L+D DVR+ILA+EQR+ILAGC
Sbjct: 1114 LPGPSLLEIDRDERPEDGTLASSLAVIGRIHQNFFSHPNLNDADVRSILASEQRRILAGC 1173

Query: 909  RIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGR 968
            RIVFSR+FPVGEANPHLHPLWQ+AEQFGAVCT  IDD+VTHVVANSLGTDKVNWAL TGR
Sbjct: 1174 RIVFSRIFPVGEANPHLHPLWQSAEQFGAVCTNQIDDRVTHVVANSLGTDKVNWALQTGR 1233

Query: 969  FVVHPGWVEASALLYRRANEQDFAIK 994
            +VVHPGWVEASALLYRRA+E DFA+K
Sbjct: 1234 YVVHPGWVEASALLYRRASEHDFAVK 1259


>gi|326532556|dbj|BAK05207.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 891

 Score =  767 bits (1980), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 460/923 (49%), Positives = 576/923 (62%), Gaps = 76/923 (8%)

Query: 113 PLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLS-HNAEVHKTPHY 171
           PLLD H  +D  SLPSPTR++ P  PV + +  G   V   A     S    E+ K   Y
Sbjct: 4   PLLDLHADYDESSLPSPTRDSAPPFPVPKPIGFG---VFPMAPDRYFSVERVELSKKVLY 60

Query: 172 ET--DALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVDQPKPV 229
            +  DAL+  SSY+QK+G+ S F + +LPSPTPS++ GD   D  G I           V
Sbjct: 61  PSVNDALKDVSSYRQKYGQTSTFASYDLPSPTPSDD-GDKSEDKDGGIF----------V 109

Query: 230 NMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGY-----NPVVKPNPVVKAP 284
            +P+      S+ P     P    SV  +++ ++ A    GY       V  P+  +K  
Sbjct: 110 EVPSFSDSNKSAPPSGNLLPASRPSV-VISSNDSFAGGPPGYAKQIEQSVSGPSHALKPS 168

Query: 285 IKSRDPRLRFA---SSNALNLNHQPAPILHNAPKVEPVGRVMS--SRKQKTVEEPVLDGP 339
            KSRDPRLRF    S    + N        NA +   +  V+S  SRK K   +P++D  
Sbjct: 169 AKSRDPRLRFLNRDSGGTADANRHVNLAEPNASRDGTLWGVVSDNSRKHKATGQPLIDES 228

Query: 340 ALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGAT 399
            LKR R   E +G  RD +   G  G    +   +    N++  +++  + ++ + N ++
Sbjct: 229 VLKRAR---ECAGNPRDMQVPPGRDGSNISSYSGDRVQSNQHTWLETKTAGNQLISNVSS 285

Query: 400 SPITSGTPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQ 459
            P ++G   +  S     P T+ +  VSLPA+LKDIAVNPT+L++ ++M  Q++  ++ Q
Sbjct: 286 IPDSTGA--LHASQPNSFPQTSAAPIVSLPAVLKDIAVNPTVLMHWIQMEHQKRSPSEPQ 343

Query: 460 QKS--------NDSSMNTMHPP-----------IPSSIPPVSVTCSIPSGILSKPMDELG 500
             S        N+ +   +  P           IPS  P  + T S+ S       ++ G
Sbjct: 344 PASGIISSGMINNVTAGMVISPGNALKTAEVAHIPSYRPQATSTASVNS------QNDPG 397

Query: 501 KVRMKPRDPRRVLHGNALQRSGSLGP-EFKTDGPSAPCTQGSKENLNFQKQLGAPEAKPV 559
            +RMK RDPRRVLH N   ++ +    + K++G + P  Q SK+NL  ++Q+       V
Sbjct: 398 VIRMKSRDPRRVLHNNTSHKNDTPNSDQAKSNGIALPANQDSKDNLINREQVAEQLQTIV 457

Query: 560 L-SQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMKAVVT 618
           L SQ V    I +Q T +   + D +S SQ   S  +  Q + +   +    AD +    
Sbjct: 458 LPSQPVSSSSIARQSTMSASKV-DSVSNSQLAASSLIAPQETLVSINR----ADPRVATG 512

Query: 619 NHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFS 678
            +D       +     P    P + WGD++ L  GYDDQQKA IQKER RR+ EQ  MFS
Sbjct: 513 QNDSNDAAPATTLGTRP----PANQWGDLDDLLNGYDDQQKALIQKERARRIMEQHTMFS 568

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
           +RKLCLVLDLDHTLLNSAKF EVDP+H+EIL KKEEQDRE+  RHLFRF HM MWTKLRP
Sbjct: 569 SRKLCLVLDLDHTLLNSAKFIEVDPIHEEILWKKEEQDRERSERHLFRFHHMQMWTKLRP 628

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG-------DP 791
           GIW FLE+ASKL+E+HLYTMGNKLYATEMAKVLDP G LFAGRVISRG DG       D 
Sbjct: 629 GIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPSGTLFAGRVISRGGDGISRGGDGDT 688

Query: 792 FDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 851
           FD D+RVPKSKDL+GVLGMESAVVIIDDSVRVWPHNK N+IVVERYTYFPCSRRQFGL G
Sbjct: 689 FDSDDRVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKNNMIVVERYTYFPCSRRQFGLPG 748

Query: 852 PSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIV 911
           PSLLEID DER EDGTLASSL VI R+H+ FFSH +L+D DVR+ILA+EQR+ILAGCRIV
Sbjct: 749 PSLLEIDRDERPEDGTLASSLAVIGRIHQNFFSHPNLNDADVRSILASEQRRILAGCRIV 808

Query: 912 FSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVV 971
           FSR+FPVGEANP LHPLWQTAEQFGAVCT  IDD+VTHVVANSLGTDKVNWAL TGRFVV
Sbjct: 809 FSRIFPVGEANPQLHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKVNWALQTGRFVV 868

Query: 972 HPGWVEASALLYRRANEQDFAIK 994
           HPGWVEASALLYRRANE DFA+K
Sbjct: 869 HPGWVEASALLYRRANEHDFAVK 891


>gi|115485681|ref|NP_001067984.1| Os11g0521900 [Oryza sativa Japonica Group]
 gi|113645206|dbj|BAF28347.1| Os11g0521900 [Oryza sativa Japonica Group]
          Length = 664

 Score =  735 bits (1898), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 392/669 (58%), Positives = 476/669 (71%), Gaps = 45/669 (6%)

Query: 362 GSGGWLED--------TDMFEPQIMNRNLLVDSAESNSRKLDNGAT--------SPITSG 405
           G GGW +D        +D F+P   N+N  + +  + +  +   +T        +  +  
Sbjct: 5   GRGGWAKDGGNISSYSSDGFQP---NQNTRLGNNTTGNHNIRTDSTLASNLNNTTNNSGT 61

Query: 406 TPNVVVS-GNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQKSND 464
           +P +V +     AP T+ +  VSLPA+LKDIAVNPTML+  ++M QQ+  A++ QQK   
Sbjct: 62  SPGIVQAPQTNSAPQTSSAPAVSLPAMLKDIAVNPTMLMQWIQMEQQKMSASEPQQKVTA 121

Query: 465 SSMNT------MHPPIPSSIPPVSVTCSIPSG-----ILSKPM---DELGKVRMKPRDPR 510
           S   T      M  P+  + P  +   ++PS      + S PM   ++ G +RMKPRDPR
Sbjct: 122 SVGMTSNVTPGMVLPL-GNAPKTTEVAAVPSVRPQVPMQSAPMHSQNDTGVIRMKPRDPR 180

Query: 511 RVLHGNALQRSGSLGP----EFKTDGPSAPCTQGSKENLNFQKQLGAPEAKPVLSQSVLQ 566
           R+LH N +Q++ ++ P    + K++G + P +Q SK++L  Q Q  A + + +   S+  
Sbjct: 181 RILHSNIVQKNDTVPPVGVEQAKSNGTAPPDSQSSKDHLLNQDQ-KAEQLQAIALPSLPV 239

Query: 567 PDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMK-AVVTNHDDKQT 625
               +  T N    A+ +S SQ   +  M    +  Q     + AD + A   N  +   
Sbjct: 240 TSSARPVTMN----ANPVSNSQLAATALMPPHGNTKQTSSSVNKADPRLAAGQNESNDDA 295

Query: 626 GTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLV 685
            T +GP   P    P S +GDV+HL +GYDDQQKA IQKER RR++EQ KMF+ARKLCLV
Sbjct: 296 ATSTGPVTAPDAVPPASPYGDVDHLLDGYDDQQKALIQKERARRIKEQHKMFAARKLCLV 355

Query: 686 LDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLE 745
           LDLDHTLLNSAKF EVD +H EILRKKEEQDRE+  RHLF F HMGMWTKLRPGIW FLE
Sbjct: 356 LDLDHTLLNSAKFIEVDHIHGEILRKKEEQDRERAERHLFCFNHMGMWTKLRPGIWNFLE 415

Query: 746 RASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLE 805
           +ASKL+E+HLYTMGNK+YATEMAKVLDP G LFAGRVISRGDDGDPFD DERVPKSKDL+
Sbjct: 416 KASKLYELHLYTMGNKVYATEMAKVLDPTGTLFAGRVISRGDDGDPFDSDERVPKSKDLD 475

Query: 806 GVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSED 865
           GVLGMESAVVIIDDSVRVWPHNK NLIVVERYTYFPCSRRQFGL GPSLLEID DER ED
Sbjct: 476 GVLGMESAVVIIDDSVRVWPHNKHNLIVVERYTYFPCSRRQFGLPGPSLLEIDRDERPED 535

Query: 866 GTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHL 925
           GTLASSL VIER+HK FFSH +L+D DVR+ILA+EQ++IL GCRIVFSR+FPVGEANPH+
Sbjct: 536 GTLASSLAVIERIHKNFFSHPNLNDADVRSILASEQQRILGGCRIVFSRIFPVGEANPHM 595

Query: 926 HPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRR 985
           HPLWQTAEQFGAVCT  IDD+VTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRR
Sbjct: 596 HPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRR 655

Query: 986 ANEQDFAIK 994
           A+E DFA+K
Sbjct: 656 ASELDFAVK 664


>gi|2459436|gb|AAB80671.1| unknown protein [Arabidopsis thaliana]
          Length = 1066

 Score =  618 bits (1594), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 416/887 (46%), Positives = 522/887 (58%), Gaps = 111/887 (12%)

Query: 1    MNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAMHGV 60
            MN++ KE+NKE +SRLL+L+  H     S NQ  E+E M         D  +  +A+   
Sbjct: 235  MNNISKERNKETMSRLLTLVNDHFSQFLSFNQKNEIETM-------NQDLSRSAIAVFAG 287

Query: 61   NGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNK-PLEASKPGPPGYRSRGVLLPLLDPHK 119
               + N      VN +      P   DS +  K   E++  G    RSR  +LPLLD HK
Sbjct: 288  TSSEEN------VNQMT----QPSNGDSFLAKKLTSESTHRGAAYLRSRLPMLPLLDLHK 337

Query: 120  VHDVDSLPSPTRETTPSVPVQRALVVGDGMVKSWAAAAKLSHNAEVHKTPHYETDALRAF 179
             HD DSLPSPTRETTPS+PV         MV+      + S   E  K   YE+DA +A 
Sbjct: 338  DHDADSLPSPTRETTPSLPVNGRHT----MVRPGFPVGRESQTTEGAKVYSYESDARKAV 393

Query: 180  SSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAV------------DQPK 227
            S+YQQKFG NS F   +LPSPTPS E  DG+GD GGE+SS+               D P 
Sbjct: 394  STYQQKFGLNSVFKTDDLPSPTPSGEPNDGNGDVGGEVSSSVVKSSNPGSHLIYGQDVPL 453

Query: 228  PVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAPIKS 287
            P N  +    PV++       P  +S + A+     SAP +S        +  VK   KS
Sbjct: 454  PSNFNSRSM-PVANSVSSTVPPHHLS-IHAI-----SAPTAS--------DQTVKPSAKS 498

Query: 288  RDPRLRFASSNALNLN--HQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQR 345
            RDPRLR A  +A N+      +    N  KVE    +++ RKQK  +E ++DGPA KRQ+
Sbjct: 499  RDPRLRLAKPDAANVTIYSYSSGDARNLSKVELSADLVNPRKQKAADEFLIDGPAWKRQK 558

Query: 346  NGFENSGVVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPITSG 405
            +         D     G+GGWLEDT+       +  LL    ES  R ++NG TS  +S 
Sbjct: 559  SD-------TDAPKAAGTGGWLEDTE-------SSGLL--KLESKPRLIENGVTSMTSSV 602

Query: 406  TPNVVVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAADAQQKSNDS 465
             P   VS ++    T  + T SL +LLKDIAVNPTMLLN+LKMG++QK+   A QK  D 
Sbjct: 603  MPTSAVSVSQKV-RTASTDTASLQSLLKDIAVNPTMLLNLLKMGERQKVPEKAIQKPMDP 661

Query: 466  SMNTMHPPIPSSIPPVSVTCSIP-----------SGIL-----SKPMDELGKVRMKPRDP 509
                   P  S  P VS   SIP           SG+L     + P  E G +RMKPRDP
Sbjct: 662  R-RAAQLPGSSVQPGVSTPLSIPASNALAANSLNSGVLQDSSQNAPAAESGSIRMKPRDP 720

Query: 510  RRVLHGNALQRS-GSLGPEFKTDGPSAPCT---QGSKENLNFQKQLGAPE-------AKP 558
            RR+LHG+ LQR+  S+  + K + PS   T   +G  E+L    QL   +       +K 
Sbjct: 721  RRILHGSTLQRTDSSMEKQTKVNDPSTLGTLTMKGKAEDLETPPQLDPRQNISQNGTSKM 780

Query: 559  VLSQSVLQ---PDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQIKSGADMKA 615
             +S  +L    PD + QFTKNLK IAD + VSQ L + P     + +   Q+K+  D+K 
Sbjct: 781  KISGELLSGKTPDFSTQFTKNLKSIADMVVVSQQLGNPP-----ASMHSVQLKTERDVKH 835

Query: 616  VVTN-----HDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRL 670
              +N      D   +       AGP  +   ++WGDVEHLFEGYDD Q+ AIQ+ER RRL
Sbjct: 836  NPSNPNAQDEDVSVSAASVTAAAGPTRS--MNSWGDVEHLFEGYDDIQRVAIQRERVRRL 893

Query: 671  EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
            EEQ KMF+++KL LVLD+DHTLLNSAKF+EV+  H+EILRKKEEQDREKP+RHLFRF HM
Sbjct: 894  EEQNKMFASQKLSLVLDIDHTLLNSAKFNEVESRHEEILRKKEEQDREKPYRHLFRFLHM 953

Query: 731  GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            GMWTKLRPGIW FLE+ASKL+E+HLYTMGNKLYATEMAK+LDPKGVLF GRVIS+GDDGD
Sbjct: 954  GMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGVLFNGRVISKGDDGD 1013

Query: 791  PFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 837
            P DGDERVPKSKDLEGV+GMES+VVIIDDSVRVWP +K+NLI VERY
Sbjct: 1014 PLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVWPQHKMNLIAVERY 1060


>gi|357478637|ref|XP_003609604.1| RNA polymerase II C-terminal domain phosphatase-like protein
            [Medicago truncatula]
 gi|355510659|gb|AES91801.1| RNA polymerase II C-terminal domain phosphatase-like protein
            [Medicago truncatula]
          Length = 1064

 Score =  602 bits (1553), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 282/369 (76%), Positives = 310/369 (84%), Gaps = 7/369 (1%)

Query: 628  GSGPEAGPVGA-HPQSAWG-DVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLV 685
            GS  E    G+  P + W  +VEHL EGYD QQKA IQ+ER RRLEEQ KMF+ARKLCLV
Sbjct: 691  GSAHETCASGSCQPHNTWAANVEHLLEGYDAQQKAVIQRERARRLEEQNKMFAARKLCLV 750

Query: 686  LDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLE 745
            LD+DHTLLNSAKF EVDP HD+ILRKKE+Q+R KP RHLFR PHMGMWTKLRPG+W FLE
Sbjct: 751  LDIDHTLLNSAKFVEVDPEHDKILRKKEKQERGKPRRHLFRLPHMGMWTKLRPGVWNFLE 810

Query: 746  RASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLE 805
            +ASKLFEMHLYTMGNKLYATEMAKVLDP GVLFAGRVISRGDD +  D      K KDLE
Sbjct: 811  KASKLFEMHLYTMGNKLYATEMAKVLDPNGVLFAGRVISRGDDPETVD-----IKCKDLE 865

Query: 806  GVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSED 865
            GVLG+ES+VVIIDDS RVWPHN+LNLI VERY YF CSRRQFGL GPSL EIDHDER   
Sbjct: 866  GVLGLESSVVIIDDSPRVWPHNQLNLITVERYIYFLCSRRQFGLSGPSLFEIDHDERPGA 925

Query: 866  GTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHL 925
            GTLASSLGVIER+H+ FF+ QSL+++DVRNILA+EQRKIL GCRIVFS VFPVGE NPHL
Sbjct: 926  GTLASSLGVIERIHQNFFASQSLEEMDVRNILASEQRKILGGCRIVFSGVFPVGETNPHL 985

Query: 926  HPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRR 985
            HPLW+TAEQFGA CT  +D QVTHVVA S GTDKVNW +S G+FVV+P WVEAS LLYRR
Sbjct: 986  HPLWRTAEQFGASCTNKVDPQVTHVVAQSPGTDKVNWGISNGKFVVYPNWVEASTLLYRR 1045

Query: 986  ANEQDFAIK 994
             NEQDFA+K
Sbjct: 1046 MNEQDFAVK 1054



 Score =  165 bits (417), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 168/554 (30%), Positives = 251/554 (45%), Gaps = 139/554 (25%)

Query: 1   MNHVLKEQNKEILSRLLSLIKSHEPPLFSSNQIKEMEAMLSSLVTRANDKEKDMLAMHGV 60
           M++  KE+NK+ + RLLS +K+    LF+   +KE+ AM++ +    N            
Sbjct: 232 MDNFQKEENKDNILRLLSFLKNERAHLFTPEHMKEIHAMITLVDVSGN------------ 279

Query: 61  NGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNKPLEASKPGPPGYRSRGVLLPLLDPHKV 120
                        ++ N +EK    +++L + + +        G R              
Sbjct: 280 -------------SEANSEEK---KLEALDETRKI-------LGLR-------------- 302

Query: 121 HDVDSLPSPTRETTPSVPVQRALVVGDGM-------VKSWAAAAKLSHNAEVHKTPHYET 173
           HD+D+LPS T+E    VPV +   VGDG        VK+   A K+  + + +K   +ET
Sbjct: 303 HDLDNLPSLTQE----VPVNKLFSVGDGTDRFGLPPVKT--EAEKMELDGKDYKLHIHET 356

Query: 174 DALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTGGEISSATAVDQPKPVNMPT 233
           DAL+A S+ QQKF R+SFF + E PSPTPS +   G  DT  E+SSA+          P 
Sbjct: 357 DALKAASTCQQKFSRSSFFTDDEFPSPTPSGDCEGGAVDTNDEVSSASIASSLTSSKPPP 416

Query: 234 LGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPVVKPNPVVKAPIKSRDPRLR 293
           L Q  VSS  ++ S    + + +   +   S PA              K  +KSRDPRLR
Sbjct: 417 LDQMLVSSTYINRSNMHGLINSRIDASGAGSYPA--------------KTSVKSRDPRLR 462

Query: 294 FASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEPVLDGPALKRQRNGFENSG- 352
           F      N++ Q +   +  PKVE    V+ SRK+KTVEE  LD  A KR     ENS  
Sbjct: 463 F------NISDQ-SSTKNIMPKVEYAEGVI-SRKRKTVEESSLDATAPKRLTRSLENSQH 514

Query: 353 VVRDEKNIYGSGGWLEDTDMFEPQIMNRNLLVDSAESNSRKLDNGATSPITSGTPNVVVS 412
             R+E+ +   GGWL                           +N   S +T+ +     +
Sbjct: 515 NSREEQTMDAKGGWLA--------------------------ENTVASNLTTTS-----N 543

Query: 413 GNEPAPATTPSTTVSLPALLKDIAVNPTMLLNILKMGQQQKLAA---------------- 456
           GNE AP  +      L AL    +VN TMLLN L +   Q+LA                 
Sbjct: 544 GNEQAPVISSCAATPLLALFNSESVNSTMLLNKL-LDIHQRLAEVKRPINFATSALHLTN 602

Query: 457 DAQQKSNDSSMNTMHPPIPSSIPPVSVT---CSIPSGILSKPMD-ELGKVRMKPRDPRRV 512
               +  +S++NT  P + S +P  S+     S P+  +++ +  +  K+ +KPRDPRR 
Sbjct: 603 SNSARGTNSTVNT-SPTMTSGVPQNSIGMLPTSSPTTSMAQTLQVDSEKICLKPRDPRRS 661

Query: 513 LHGNA-LQRSGSLG 525
           LH ++ +Q+SGSLG
Sbjct: 662 LHASSTVQKSGSLG 675


>gi|302761896|ref|XP_002964370.1| hypothetical protein SELMODRAFT_405568 [Selaginella moellendorffii]
 gi|300168099|gb|EFJ34703.1| hypothetical protein SELMODRAFT_405568 [Selaginella moellendorffii]
          Length = 766

 Score =  505 bits (1300), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 232/355 (65%), Positives = 279/355 (78%), Gaps = 5/355 (1%)

Query: 646 DVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVH 705
           +++      D+ ++ A  KER RR++EQ KM S +KLCLVLDLDHTLLNSAKF E++   
Sbjct: 412 ELQEFLIDLDEAERIAFIKERQRRMDEQDKMLSEKKLCLVLDLDHTLLNSAKFMEIEQEW 471

Query: 706 DEILRKKEEQDREK-----PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGN 760
           D  LR  E  +R K       R L+RFP+M MWTKLRPGIW FL RAS+L+E+HLYTMGN
Sbjct: 472 DRFLRATETIERNKDAKEGTRRELYRFPYMSMWTKLRPGIWRFLARASQLYELHLYTMGN 531

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDS 820
           K YATEMAK+LDP GVLFAGRVIS+GDDGD   GDE+ P+SKDL+GVLGMESAV+IIDDS
Sbjct: 532 KAYATEMAKLLDPTGVLFAGRVISKGDDGDALYGDEKTPRSKDLDGVLGMESAVLIIDDS 591

Query: 821 VRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHK 880
            RVWPH+K NLIVVERY YFPCSR+QFGL GPSLLE+ HDER  DG LAS LGV+ER+H+
Sbjct: 592 ARVWPHHKDNLIVVERYMYFPCSRKQFGLPGPSLLEVGHDEREADGMLASILGVVERVHE 651

Query: 881 IFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCT 940
            F+S     +VD+R +L+  QR+IL GC+I+FSRVFPV E  P LHPLW+ AEQFGAVCT
Sbjct: 652 EFYSRPLPKEVDIREVLSVVQRRILGGCKIIFSRVFPVEETQPQLHPLWRMAEQFGAVCT 711

Query: 941 KHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIKP 995
             +++ VTHVVA S+GTDK NWAL+TGRF+V P WVEAS +LYRRANE+DF + P
Sbjct: 712 TRMEEDVTHVVAISMGTDKSNWALATGRFLVRPAWVEASTVLYRRANERDFPVPP 766


>gi|302768485|ref|XP_002967662.1| hypothetical protein SELMODRAFT_440109 [Selaginella moellendorffii]
 gi|300164400|gb|EFJ31009.1| hypothetical protein SELMODRAFT_440109 [Selaginella moellendorffii]
          Length = 762

 Score =  504 bits (1299), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 232/355 (65%), Positives = 279/355 (78%), Gaps = 5/355 (1%)

Query: 646 DVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVH 705
           +++      D+ ++ A  KER RR++EQ KM S +KLCLVLDLDHTLLNSAKF E++   
Sbjct: 408 ELQEFLIDLDEAERIAFIKERQRRMDEQDKMLSEKKLCLVLDLDHTLLNSAKFMEIEQEW 467

Query: 706 DEILRKKEEQDREK-----PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGN 760
           D  LR  E  +R K       R L+RFP+M MWTKLRPGIW FL RAS+L+E+HLYTMGN
Sbjct: 468 DRFLRATETIERNKDAKEGTRRELYRFPYMSMWTKLRPGIWRFLARASQLYELHLYTMGN 527

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDS 820
           K YATEMAK+LDP GVLFAGRVIS+GDDGD   GDE+ P+SKDL+GVLGMESAV+IIDDS
Sbjct: 528 KAYATEMAKLLDPTGVLFAGRVISKGDDGDALYGDEKTPRSKDLDGVLGMESAVLIIDDS 587

Query: 821 VRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHK 880
            RVWPH+K NLIVVERY YFPCSR+QFGL GPSLLE+ HDER  DG LAS LGV+ER+H+
Sbjct: 588 ARVWPHHKDNLIVVERYMYFPCSRKQFGLPGPSLLEVGHDEREADGMLASILGVVERVHE 647

Query: 881 IFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCT 940
            F+S     +VD+R +L+  QR+IL GC+I+FSRVFPV E  P LHPLW+ AEQFGAVCT
Sbjct: 648 EFYSRPLPKEVDIREVLSVVQRRILGGCKIIFSRVFPVEETQPQLHPLWRMAEQFGAVCT 707

Query: 941 KHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIKP 995
             +++ VTHVVA S+GTDK NWAL+TGRF+V P WVEAS +LYRRANE+DF + P
Sbjct: 708 TRMEEDVTHVVAISMGTDKSNWALATGRFLVRPAWVEASTVLYRRANERDFPVPP 762


>gi|168018017|ref|XP_001761543.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687227|gb|EDQ73611.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1984

 Score =  494 bits (1272), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 263/521 (50%), Positives = 335/521 (64%), Gaps = 66/521 (12%)

Query: 497  DELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQGSKENLNFQKQLGAPEA 556
            +++G+ RM+PRDPRR+L  NA++ +       + +  + P    +   +  Q    +PE 
Sbjct: 1303 EDVGRHRMRPRDPRRILLENAVETA-------QVNPMNVPVNDAAGSEMTLQYTNRSPEV 1355

Query: 557  KPVLSQSVLQPDITQQFTKNL----------------KHIADFMSVSQPLTSEPMVSQNS 600
              V       P++  Q + N                    +D   VS  L++E   ++  
Sbjct: 1356 ADV------SPNLNNQPSTNTLGNPPNQRDPRLNPYESSQSDSAVVSIQLSTEHKTTEWK 1409

Query: 601  PIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQ-------------SAWG-- 645
             +     +S  D      ++ ++ +   SG   G  G H               S WG  
Sbjct: 1410 TLDERIKESERD------SNRNQGSEVSSGESTGDKGDHLHPWDPLLRKPRFGPSHWGGN 1463

Query: 646  ----DVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEV 701
                D E L E  D++Q+ AIQ ER RRL+EQ +MF A KLCLVLDLDHTLLNSAKF E+
Sbjct: 1464 DMHRDFEQLLEDLDEKQRIAIQNERKRRLQEQDRMFIAGKLCLVLDLDHTLLNSAKFSEI 1523

Query: 702  DPVHDEILRKKEEQDREKP------HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHL 755
            +P  +  LR+ E  +R +        + L+RFPHM MWTKLRPGIW FL +AS+L+E+H+
Sbjct: 1524 EPEWEARLRQAENMERSRALKDPSMKQELYRFPHMSMWTKLRPGIWKFLAKASELYELHV 1583

Query: 756  YTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVV 815
            YTMGNK YATEMAK+LDP G LFAGRVIS+GD+ D  D      KSKDL+GVLGMESAVV
Sbjct: 1584 YTMGNKAYATEMAKLLDPTGTLFAGRVISKGDEVDGSD------KSKDLDGVLGMESAVV 1637

Query: 816  IIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVI 875
            IIDDS RVWPH++ NLIVVERY YFP SRRQFGLLGPSLLE+ HDER+ DG L+S+ GVI
Sbjct: 1638 IIDDSSRVWPHHRENLIVVERYMYFPSSRRQFGLLGPSLLEVGHDERAADGMLSSASGVI 1697

Query: 876  ERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQF 935
            +R+HK FFS++ L +VDVR ILAAEQR++L GCR++FSR+FPVGEANPHLHPLW+ AEQF
Sbjct: 1698 DRIHKNFFSNKRLREVDVRAILAAEQRRVLDGCRVLFSRIFPVGEANPHLHPLWRLAEQF 1757

Query: 936  GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWV 976
            GA C  HI+D+VTHVVA SLGTDKVNWA +TGR VV P W 
Sbjct: 1758 GASCCLHINDKVTHVVAISLGTDKVNWAAATGRPVVRPAWC 1798


>gi|168040198|ref|XP_001772582.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162676137|gb|EDQ62624.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1881

 Score =  488 bits (1257), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 268/541 (49%), Positives = 337/541 (62%), Gaps = 67/541 (12%)

Query: 480  PVSVTCSIPSGILSKPMDELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQ 539
            P  +  +   G   +   ++GK RM+PRDPRR L  +A           + +  S P  +
Sbjct: 1325 PFGINSTKSEGQAGEEEQDIGKHRMRPRDPRRALLDSA-------ADIVQVNQRSPPIIE 1377

Query: 540  GSKENLNFQKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIAD--------------FMS 585
             +      Q + G        S  V QP I     +N  ++ D               ++
Sbjct: 1378 AADSGTTLQIETGTSLPTNTSSDLVKQPSINS--LENPLNLRDPRLSSNNSTQSNNATLA 1435

Query: 586  VSQPLTS--------EPMV-----------SQNSPIQPGQIKSGADMKAVVTNHDDKQTG 626
              QP T         EP+V           S+N  I   ++ SG  +   V +H      
Sbjct: 1436 PEQPSTEQKNMTVEEEPVVDERNNARERESSRNQGIDAREVFSGESILDEV-DHLHPWDP 1494

Query: 627  TGSGPEAGPVGAHPQSAWG------DVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSAR 680
                P  GP      S WG      D E L E  D+ Q+  IQ ER RR++EQ +MFSA 
Sbjct: 1495 VLRKPRFGP------SHWGGSNLHRDFEQLLEDLDEDQRITIQNERKRRIQEQDRMFSAG 1548

Query: 681  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRFPHMGMWT 734
            KLCLVLDLDHTLLNSAKF E++P  +  LR+ E  +R +  +       L+RFPHM MWT
Sbjct: 1549 KLCLVLDLDHTLLNSAKFSEIEPEFEARLRQAENMERSRSTKDPNMKQELYRFPHMSMWT 1608

Query: 735  KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 794
            KLRPGIW FL +AS+L+E+H+YTMGNK YATEMAK+LDP G+LF+GRVIS+GD+ D  D 
Sbjct: 1609 KLRPGIWKFLAKASELYELHVYTMGNKAYATEMAKLLDPTGILFSGRVISKGDEVDGSD- 1667

Query: 795  DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSL 854
                 KSKDL+GVLGMESAVVIIDDS RVWPH++ NLIVVERY YFP SRRQFGLLGPSL
Sbjct: 1668 -----KSKDLDGVLGMESAVVIIDDSSRVWPHHRENLIVVERYMYFPSSRRQFGLLGPSL 1722

Query: 855  LEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSR 914
            LE+ HDER+ DG L+S+ GVI+R+H+ FFS++ L +VDVR ILAAEQR++L GCR++FSR
Sbjct: 1723 LEVGHDERAVDGMLSSASGVIDRIHRNFFSNKKLREVDVRAILAAEQRRVLDGCRVLFSR 1782

Query: 915  VFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPG 974
            +FPVGEANPHLHPLW+ AEQFGA C  +I+D+VTHVVA SLGTDKVNWA +TGR VV P 
Sbjct: 1783 IFPVGEANPHLHPLWRLAEQFGASCCLYINDKVTHVVAISLGTDKVNWATATGRPVVRPT 1842

Query: 975  W 975
            W
Sbjct: 1843 W 1843


>gi|224075473|ref|XP_002304648.1| predicted protein [Populus trichocarpa]
 gi|222842080|gb|EEE79627.1| predicted protein [Populus trichocarpa]
          Length = 238

 Score =  435 bits (1119), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 215/238 (90%), Positives = 224/238 (94%)

Query: 758 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVII 817
           MGNKLYATEMAKVLDPKGVLFAGRV+SRGDDGD  DGDERVPKSKDLEGVLGMES VVII
Sbjct: 1   MGNKLYATEMAKVLDPKGVLFAGRVVSRGDDGDLLDGDERVPKSKDLEGVLGMESGVVII 60

Query: 818 DDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIER 877
           DDS+RVWPHNKLNLIVVERY YFPCSRRQFGL GPSLLEIDHDER EDGTLA SL VIER
Sbjct: 61  DDSLRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLACSLAVIER 120

Query: 878 LHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGA 937
           +H+ FF+H SLD+ DVRNILA+EQRKILAGCRIVFSRVFPVGE NPHLHPLWQ+AEQFGA
Sbjct: 121 IHQNFFTHHSLDEADVRNILASEQRKILAGCRIVFSRVFPVGEVNPHLHPLWQSAEQFGA 180

Query: 938 VCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIKP 995
           VCT  ID+QVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIKP
Sbjct: 181 VCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIKP 238


>gi|168059994|ref|XP_001781984.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666557|gb|EDQ53208.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 563

 Score =  286 bits (732), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 151/334 (45%), Positives = 208/334 (62%), Gaps = 18/334 (5%)

Query: 669 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 728
           R  E +++   +KL LV+DLDHT+LNSA+F EV P  + I        +      L +  
Sbjct: 173 RNAELRRVTGKQKLLLVVDLDHTMLNSARFSEV-PAEERIYLTWTAGQQHGRVSSLHQLT 231

Query: 729 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
            +GMWTKLRP    FLE ASKL+EM++YTMG K+YA  MA++LDP G LF GR+IS+ D 
Sbjct: 232 KLGMWTKLRPFAHKFLEEASKLYEMYVYTMGEKIYAQAMAELLDPTGQLFGGRIISQTDS 291

Query: 789 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 848
                       +KDL+ VLG ESAVVI+DD+  VWP+++ NLI++ERY +F  S  QF 
Sbjct: 292 TKR--------HTKDLDVVLGAESAVVILDDTEAVWPNHRSNLILMERYHFFTSSCHQFR 343

Query: 849 LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS-HQS--------LDDVDVRNILAA 899
           +  PSL ++  DE   DGTLA++L  ++ +H  FF+ H+         L+  DVR+++ +
Sbjct: 344 VRAPSLAQMHRDECEIDGTLATTLKTLQAIHHEFFNGHKGKSMKRRPPLELPDVRDVIRS 403

Query: 900 EQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDK 959
            + K+L+GC IVFSR+FP G  NP  HP WQ A + GA C+   D   THVVA   GTDK
Sbjct: 404 IRGKLLSGCHIVFSRIFPTGLQNPEFHPFWQLAVELGARCSTVCDHTTTHVVALDRGTDK 463

Query: 960 VNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 993
             WA   G  +VHP WVEA++ L++R  E+DF +
Sbjct: 464 ARWAKQHGISLVHPRWVEAASYLWKRPREKDFPV 497


>gi|384247094|gb|EIE20582.1| hypothetical protein COCSUDRAFT_57726 [Coccomyxa subellipsoidea
            C-169]
          Length = 1018

 Score =  285 bits (729), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 146/334 (43%), Positives = 205/334 (61%), Gaps = 22/334 (6%)

Query: 676  MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKP--HRHLFRFPHMGMW 733
            +   R+LCLVLDLDHTL+NSAKF EV+P H ++L ++ +++   P   + L R   + MW
Sbjct: 693  LLRQRRLCLVLDLDHTLVNSAKFSEVEPEHLKLLERQLQREAALPAEEKRLHRLDRIAMW 752

Query: 734  TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 793
            T LRPG+   L   + LF++ + T  ++ YA  MA++LDP G LF  R+IS+GDDG    
Sbjct: 753  TALRPGLRQMLAAVAPLFQLWIQTNASRAYALAMAELLDPTGELFGQRIISKGDDGSAL- 811

Query: 794  GDERVPKSKDL-EGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGP 852
                +  SK L +G+   E+  +I+DDS  VW H+  NL+ VERYTYFP SRRQ  L GP
Sbjct: 812  ----INHSKRLMQGLEECEAVCIIVDDSDDVWRHHAHNLLHVERYTYFPSSRRQLNLRGP 867

Query: 853  SLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLD-----------DVDVRNILAAEQ 901
            S LE   DE  + G LA +LGV+ R+H   F+  +LD             DVR++L   +
Sbjct: 868  SFLEAHKDECDKTGILAVTLGVLLRVHIAVFA--ALDAPPTAGIREEHHWDVRHVLGLLR 925

Query: 902  RKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVN 961
            +++L G R++FS+VFP+G+A P     W+ AE +GA CT  +D+ VTHVVA S GT K  
Sbjct: 926  KQVLLGVRVLFSKVFPLGQA-PSEQLYWKQAEAYGASCTSQLDEHVTHVVALSRGTHKAQ 984

Query: 962  WALSTGRFVVHPGWVEASALLYRRANEQDFAIKP 995
            WAL  G+ VV P W+E S  L++RA E+ +   P
Sbjct: 985  WALQAGKHVVSPAWLECSCTLWQRAKERAYPAPP 1018


>gi|255570505|ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
 gi|223534449|gb|EEF36151.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
          Length = 478

 Score =  280 bits (716), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 148/327 (45%), Positives = 205/327 (62%), Gaps = 12/327 (3%)

Query: 669 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 728
           R  + K +   RKL LVLDLDHTLLNS +   +    +E L+ + +  ++  +  LF   
Sbjct: 154 RNTDMKNLLRHRKLYLVLDLDHTLLNSTQLMHL-TAEEEYLKSQIDSMQDVSNGSLFMVD 212

Query: 729 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
            M M TKLRP I TFL+ AS++FEM++YTMG++ YA EMAK LDP    F  RVISR D 
Sbjct: 213 FMHMMTKLRPFIRTFLKEASQMFEMYIYTMGDRAYALEMAKFLDPGREYFNARVISRDD- 271

Query: 789 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 848
                G +R  K  D+  VLG ESAV+I+DD+   W  +K NLI++ERY +F  S RQFG
Sbjct: 272 -----GTQRHQKGLDI--VLGQESAVLILDDTENAWTKHKDNLILMERYHFFASSCRQFG 324

Query: 849 LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS--HQSLDDVDVRNILAAEQRKILA 906
               SL ++  DE   DG LAS L V+ R+H IFF     ++D  DVR +L+  ++ +L 
Sbjct: 325 FECKSLSQLKSDENESDGALASVLKVLRRIHHIFFDELEDAIDGRDVRQVLSTVRKDVLK 384

Query: 907 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST 966
           GC+IVFSRVFP  +     H LW+ AEQ GA C++ +D  VTHVV+   GT+K  WAL  
Sbjct: 385 GCKIVFSRVFPT-QFQADNHHLWKMAEQLGATCSREVDPSVTHVVSAEAGTEKSRWALKN 443

Query: 967 GRFVVHPGWVEASALLYRRANEQDFAI 993
            +F+VHP W+EA+  +++R  E++F++
Sbjct: 444 DKFLVHPRWIEATNYMWQRQPEENFSV 470


>gi|115463681|ref|NP_001055440.1| Os05g0390500 [Oryza sativa Japonica Group]
 gi|57863785|gb|AAS86390.2| unknown protein [Oryza sativa Japonica Group]
 gi|113578991|dbj|BAF17354.1| Os05g0390500 [Oryza sativa Japonica Group]
 gi|215695102|dbj|BAG90293.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222631469|gb|EEE63601.1| hypothetical protein OsJ_18418 [Oryza sativa Japonica Group]
          Length = 536

 Score =  278 bits (712), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 145/323 (44%), Positives = 199/323 (61%), Gaps = 13/323 (4%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           K +   RKL L+LDLDHTL+NS K  ++    +E+  +   ++   P R LF    M M 
Sbjct: 163 KNLLRERKLVLILDLDHTLINSTKLFDLSAAENELGIQSAAKEV-VPDRSLFTLETMQML 221

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 793
           TKLRP +  FL+ AS +FEM++YTMG+K YA E+AK+LDP  V F  +VIS  D      
Sbjct: 222 TKLRPFVRRFLKEASDMFEMYIYTMGDKAYAIEIAKLLDPDNVYFGSKVISNSD------ 275

Query: 794 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 853
             +R  K  D+  VLG ES  VI+DD+  VW  +K NLI++ERY YF  S RQFG    S
Sbjct: 276 CTQRHQKGLDV--VLGDESVAVILDDTEYVWQKHKENLILMERYHYFASSCRQFGFGARS 333

Query: 854 LLEIDHDERSEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNILAAEQRKILAGCRI 910
           L E   DER  DG LA+ L V+ER+H IFF     + L   DVR ++   ++++L GC++
Sbjct: 334 LSETMQDERENDGALATILDVLERIHTIFFDPDDQKPLSSRDVRQVIKRVRQEVLQGCKL 393

Query: 911 VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFV 970
           VF+RVFP+ +       +W+ AEQ GAVC   +D  VTHVVA  LGT+K  WA+S  +F+
Sbjct: 394 VFTRVFPLHQRQQD-QMIWKMAEQLGAVCCTDVDSTVTHVVALDLGTEKARWAVSNKKFL 452

Query: 971 VHPGWVEASALLYRRANEQDFAI 993
           VHP W+EA+   ++R  E+DF +
Sbjct: 453 VHPRWIEAANFRWQRQQEEDFPV 475


>gi|242093742|ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor]
 gi|241915584|gb|EER88728.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor]
          Length = 558

 Score =  278 bits (711), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 142/323 (43%), Positives = 195/323 (60%), Gaps = 13/323 (4%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           K +   RKL L+LDLDHTL+NS K  ++     + L  +    ++ P+R +F    M M 
Sbjct: 157 KNLLRERKLVLILDLDHTLINSTKLQDISSAEKD-LGIQTAASKDDPNRSIFSLDSMQML 215

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 793
           TKLRP +  FL+ AS +FEM++YTMG+K YA E+AK+LDP  + F  +VIS  D      
Sbjct: 216 TKLRPFVREFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPSKVISNSD------ 269

Query: 794 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 853
             +R  K  D+  +LG ES  VI+DD+  VW  +K NLI++ERY +F  S RQFG    S
Sbjct: 270 CTQRHQKGLDV--ILGAESVAVILDDTEYVWQKHKENLILMERYHFFASSCRQFGFGVRS 327

Query: 854 LLEIDHDERSEDGTLASSLGVIERLHKIFFS---HQSLDDVDVRNILAAEQRKILAGCRI 910
           L E   DER  DG LA+ L V++R+H IFF       L   DVR ++ A +++IL GC+I
Sbjct: 328 LSESMQDERESDGALATVLDVLKRIHSIFFDLAVETDLSSQDVRQVIKAVRKEILQGCKI 387

Query: 911 VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFV 970
           VFSRVFP     P    LW+ AE  GAVC+  +D  VTHVV   LGT+K  W ++  +F+
Sbjct: 388 VFSRVFP-NNTRPQEQMLWKMAEHLGAVCSTDVDSSVTHVVTVDLGTEKARWGVANKKFL 446

Query: 971 VHPGWVEASALLYRRANEQDFAI 993
           VHP W+EA+   + R  E+DF +
Sbjct: 447 VHPRWIEAANFRWHRQPEEDFPV 469


>gi|218196729|gb|EEC79156.1| hypothetical protein OsI_19829 [Oryza sativa Indica Group]
          Length = 574

 Score =  278 bits (710), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 147/321 (45%), Positives = 196/321 (61%), Gaps = 13/321 (4%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           K +   RKL L+LDLDHTL+NS K  ++    +E L  +       P R LF    M M 
Sbjct: 189 KNLLRERKLVLILDLDHTLINSTKLFDLSAAENE-LGIQSAAKEVVPDRSLFTLETMQML 247

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 793
           TKLRP +  FL+ AS +FEM++YTMG+K YA E+AK+LDP  V F  +VIS  D      
Sbjct: 248 TKLRPFVRRFLKEASDMFEMYIYTMGDKAYAIEIAKLLDPDNVYFGSKVISNSD------ 301

Query: 794 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 853
             +R  K  D+  VLG ES  VI+DD+  VW  +K NLI++ERY YF  S RQFG    S
Sbjct: 302 CTQRHQKGLDV--VLGDESVAVILDDTEYVWQKHKENLILMERYHYFASSCRQFGFGARS 359

Query: 854 LLEIDHDERSEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNILAAEQRKILAGCRI 910
           L E   DER  DG LA+ L V+ER+H IFF     + L   DVR ++   ++++L GC++
Sbjct: 360 LSETMQDERENDGALATILDVLERIHTIFFDPDDQKPLSSRDVRQVIKRVRQEVLQGCKL 419

Query: 911 VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFV 970
           VF+RVFP+ +       LW+ AEQ GAVC   +D  VTHVVA  LGT+K  WA+S  +F+
Sbjct: 420 VFTRVFPLHQRQQD-QMLWKMAEQLGAVCCTDVDSTVTHVVALDLGTEKARWAVSNKKFL 478

Query: 971 VHPGWVEASALLYRRANEQDF 991
           VHP W+EA+   ++R  E+DF
Sbjct: 479 VHPRWIEAANFRWQRQQEEDF 499


>gi|449447765|ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
           4-like [Cucumis sativus]
          Length = 452

 Score =  276 bits (707), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 146/328 (44%), Positives = 203/328 (61%), Gaps = 12/328 (3%)

Query: 669 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 728
           R +E K++   +KL LVLDLDHTLLNS +   +  V +E LR + +   +     LF   
Sbjct: 125 RNKEMKELLQRKKLILVLDLDHTLLNSTELRYL-TVEEEYLRSQTDSLDDVTKGSLFLLN 183

Query: 729 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
            +   TKLRP + +FL+ ASKLFEM++YTMG + YA EMAK+LDPK   F+ +VISR DD
Sbjct: 184 SVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISR-DD 242

Query: 789 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 848
           G            K L+ VLG ESAV+I+DD+   W  +K NLI++ERY +F  S RQFG
Sbjct: 243 GTQ-------KHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFG 295

Query: 849 LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLD--DVDVRNILAAEQRKILA 906
               SL E+ +DE   DG L + L V++++H +FF+  S D  D DVR +L   + ++L 
Sbjct: 296 FNCKSLSELKNDESETDGALTTILKVLKQVHHMFFNEVSGDLVDRDVRQVLKTVRAEVLE 355

Query: 907 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST 966
           GC++VFSRVFP  +     H LW+  EQ G  C+  +D  VTHVVA   GT+K  WAL  
Sbjct: 356 GCKVVFSRVFPT-KFQAENHQLWKMVEQLGGTCSTELDQSVTHVVATDAGTEKSRWALKE 414

Query: 967 GRFVVHPGWVEASALLYRRANEQDFAIK 994
            +F+VHP W+EAS   ++R  E++F ++
Sbjct: 415 KKFLVHPRWIEASNYFWKRQMEENFTVE 442


>gi|302764346|ref|XP_002965594.1| hypothetical protein SELMODRAFT_167775 [Selaginella moellendorffii]
 gi|300166408|gb|EFJ33014.1| hypothetical protein SELMODRAFT_167775 [Selaginella moellendorffii]
          Length = 411

 Score =  276 bits (705), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 156/349 (44%), Positives = 204/349 (58%), Gaps = 34/349 (9%)

Query: 669 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH----- 723
           R +E +++   RKL LVLDLDHTLLNSA++ EV P     L   E      P        
Sbjct: 44  REDELRQVLGKRKLFLVLDLDHTLLNSARWMEVFPDETAYL---EHTYMNVPEDKIPALS 100

Query: 724 ----------------LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEM 767
                           L R   M +WTKLRP    FLE ASKLFEM++YTMG ++YA  M
Sbjct: 101 NGAPAVAGVIQPGGGGLHRIHGMQLWTKLRPFAHKFLEEASKLFEMYVYTMGERMYAVTM 160

Query: 768 AKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHN 827
           A +LDP G  F GRVIS+ D            ++KDL+ VLG +SAV+I+DD+  VWP +
Sbjct: 161 AHLLDPTGKFFKGRVISQRDST--------CRQTKDLDIVLGADSAVLILDDTEAVWPKH 212

Query: 828 KLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF--SH 885
           + NLIV+ERY +F  S RQFGL  PSL + + DE  ++G LA+ L V++R+H  FF  S 
Sbjct: 213 RANLIVMERYHFFQSSCRQFGLENPSLTKAERDESKDEGALANVLKVLQRIHSDFFMESD 272

Query: 886 QSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDD 945
            S    DVR+I +  + +IL+GC++VFSR+FP     P L PLW+     GA C    DD
Sbjct: 273 DSRYTCDVRDITSVVRSEILSGCKLVFSRIFPTDCLEPELTPLWRLCVDLGAECVLAHDD 332

Query: 946 QVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 994
            VTHVVA    TDK  WA    +F+VHP WVEA+  L+RR NE +F ++
Sbjct: 333 SVTHVVALDRFTDKAKWAKEHRKFLVHPAWVEAAHSLWRRPNELEFPVR 381


>gi|357163276|ref|XP_003579679.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
           4-like [Brachypodium distachyon]
          Length = 493

 Score =  274 bits (700), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 139/323 (43%), Positives = 195/323 (60%), Gaps = 13/323 (4%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           K +   RKL L+LDLDHTL+NS K H++     + L  +     + P + LF    M M 
Sbjct: 156 KSLLRERKLVLILDLDHTLINSTKLHDISAAERD-LGIQTFASEDAPEKSLFTLEAMQML 214

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 793
           TKLRP +  FL+ AS +FEM++YTMG+K YA E+AK+LDP  V F  +VIS  D      
Sbjct: 215 TKLRPFVCKFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPGNVYFGSKVISNSD------ 268

Query: 794 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 853
             +R    K L+ VLG E+  +I+DD+  VW  +K NLI++ERY YF  S RQFG    +
Sbjct: 269 CTQR--HQKGLDVVLGAENVAIILDDTEYVWQKHKENLILMERYHYFASSCRQFGFSVKA 326

Query: 854 LLEIDHDERSEDGTLASSLGVIERLHKIFFS---HQSLDDVDVRNILAAEQRKILAGCRI 910
           L E   DER  DG LA++L V++R+H +FF      +L   DVR ++   ++++L GC++
Sbjct: 327 LSESMQDERESDGALATTLDVLKRIHTLFFDSAVETALSSRDVRQVIKKVRQEVLQGCKV 386

Query: 911 VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFV 970
           VFSRVFP   + P    +W+ AEQ GA+C   +D  VTHVVA   GT+K  WA+   + +
Sbjct: 387 VFSRVFP-SSSRPQDQIIWKMAEQLGAICCADMDSTVTHVVAVDSGTEKARWAVGNNKIL 445

Query: 971 VHPGWVEASALLYRRANEQDFAI 993
           VHP W+EAS   + R  E+DF +
Sbjct: 446 VHPRWIEASNFRWHRQQEEDFPV 468


>gi|9758369|dbj|BAB08870.1| unnamed protein product [Arabidopsis thaliana]
          Length = 1065

 Score =  273 bits (699), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 147/329 (44%), Positives = 203/329 (61%), Gaps = 33/329 (10%)

Query: 680  RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH------------LFRF 727
            RKL LVLDLDHTLLN+    ++ P          E++  K H H            LF  
Sbjct: 746  RKLYLVLDLDHTLLNTTILRDLKP----------EEEYLKSHTHSLQDGCNVSGGSLFLL 795

Query: 728  PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 787
              M M TKLRP + +FL+ AS++F M++YTMG++ YA +MAK+LDPKG  F  RVISR D
Sbjct: 796  EFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDRVISR-D 854

Query: 788  DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQF 847
            DG        V   K L+ VLG ESAV+I+DD+   WP +K NLIV+ERY +F  S RQF
Sbjct: 855  DGT-------VRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQF 907

Query: 848  GLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH--QSLDDVDVRNILAAEQRKIL 905
                 SL E+  DE   DG LA+ L V+++ H +FF +  + + + DVR +L   +++IL
Sbjct: 908  DHRYKSLSELKSDESEPDGALATVLKVLKQAHALFFENVDEGISNRDVRLMLKQVRKEIL 967

Query: 906  AGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALS 965
             GC+IVFSRVFP  +A P  HPLW+ AE+ GA C   +D  VTHVVA  +GT+K  WA+ 
Sbjct: 968  KGCKIVFSRVFPT-KAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARWAVR 1026

Query: 966  TGRFVVHPGWVEASALLYRRANEQDFAIK 994
              ++VVH GW++A+  L+ +  E++F ++
Sbjct: 1027 EKKYVVHRGWIDAANYLWMKQPEENFGLE 1055


>gi|449532013|ref|XP_004172979.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
           phosphatase-like 4-like, partial [Cucumis sativus]
          Length = 340

 Score =  273 bits (699), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 146/328 (44%), Positives = 202/328 (61%), Gaps = 12/328 (3%)

Query: 669 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 728
           R +E K++    KL LVLDLDHTLLNS +   +  V +E LR + +   +     LF   
Sbjct: 13  RNKEMKELLQRXKLILVLDLDHTLLNSTELRYL-TVEEEYLRSQTDSLDDVTKGSLFLLN 71

Query: 729 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
            +   TKLRP + +FL+ ASKLFEM++YTMG + YA EMAK+LDPK   F+ +VISR DD
Sbjct: 72  SVHTMTKLRPFVHSFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISR-DD 130

Query: 789 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 848
           G            K L+ VLG ESAV+I+DD+   W  +K NLI++ERY +F  S RQFG
Sbjct: 131 GTQ-------KHQKGLDVVLGKESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFG 183

Query: 849 LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLD--DVDVRNILAAEQRKILA 906
               SL E+ +DE   DG L + L V++++H +FF+  S D  D DVR +L   + ++L 
Sbjct: 184 FNCKSLSELKNDESETDGALTTILKVLKQVHHMFFNEVSGDLVDRDVRQVLKTVRAEVLE 243

Query: 907 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST 966
           GC++VFSRVFP  +     H LW+  EQ G  C+  +D  VTHVVA   GT+K  WAL  
Sbjct: 244 GCKVVFSRVFPT-KFQAENHQLWKMVEQLGGTCSTELDQSVTHVVATDAGTEKSRWALKE 302

Query: 967 GRFVVHPGWVEASALLYRRANEQDFAIK 994
            +F+VHP W+EAS   ++R  E++F ++
Sbjct: 303 KKFLVHPRWIEASNYFWKRQMEENFTVE 330


>gi|413945235|gb|AFW77884.1| CPL3 [Zea mays]
          Length = 533

 Score =  271 bits (694), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 140/323 (43%), Positives = 195/323 (60%), Gaps = 13/323 (4%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           K +   RKL L+LDLDHTL+NS K  ++     + L  +    ++ P+R +F    M M 
Sbjct: 155 KNLLRERKLVLILDLDHTLINSTKLQDISSAEKD-LGIQSAASKDDPNRSIFALDLMPML 213

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 793
           TKLRP +  FL+ AS +FEM++YTMG+K YA E+AK+LDP  + F  +VIS  D      
Sbjct: 214 TKLRPFVREFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPSKVISNSD------ 267

Query: 794 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 853
             +R  K  D+  +LG ES  VI+DD+  VW  +K NLI++ERY +F  S RQFG    S
Sbjct: 268 CTQRHQKGLDV--ILGAESVAVILDDTEYVWQKHKENLILMERYHFFASSCRQFGFGVRS 325

Query: 854 LLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV---DVRNILAAEQRKILAGCRI 910
           L E   DER  DG LA+ L V++R+H  FF   +  D+   D+R ++   +++IL GC+I
Sbjct: 326 LSESLQDERESDGALATVLDVLKRIHATFFDMAAETDLSSRDIRQVIKTLRKEILQGCKI 385

Query: 911 VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFV 970
           VFSRVFP     P    +W+ AE  GAVC K +D  VTHVV   LGT+K  W L+  +F+
Sbjct: 386 VFSRVFP-NNTRPQEQMVWKMAEYLGAVCVKDVDPSVTHVVTVDLGTEKARWGLNNKKFL 444

Query: 971 VHPGWVEASALLYRRANEQDFAI 993
           VHP W+EA+   + R  E+DF +
Sbjct: 445 VHPRWIEAANFRWHRQPEEDFPV 467


>gi|145334837|ref|NP_001078764.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Arabidopsis
           thaliana]
 gi|122154038|sp|Q00IB6.1|CPL4_ARATH RecName: Full=RNA polymerase II C-terminal domain phosphatase-like
           4; Short=FCP-like 4; AltName: Full=Carboxyl-terminal
           phosphatase-like 4; Short=AtCPL4; Short=CTD
           phosphatase-like 4
 gi|95115186|gb|ABF55959.1| carboxyl-terminal phosphatase-like 4 [Arabidopsis thaliana]
 gi|332009601|gb|AED96984.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Arabidopsis
           thaliana]
          Length = 440

 Score =  271 bits (694), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 147/329 (44%), Positives = 203/329 (61%), Gaps = 33/329 (10%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH------------LFRF 727
           RKL LVLDLDHTLLN+    ++ P          E++  K H H            LF  
Sbjct: 121 RKLYLVLDLDHTLLNTTILRDLKP----------EEEYLKSHTHSLQDGCNVSGGSLFLL 170

Query: 728 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 787
             M M TKLRP + +FL+ AS++F M++YTMG++ YA +MAK+LDPKG  F  RVISR D
Sbjct: 171 EFMQMMTKLRPFVHSFLKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDRVISR-D 229

Query: 788 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQF 847
           DG        V   K L+ VLG ESAV+I+DD+   WP +K NLIV+ERY +F  S RQF
Sbjct: 230 DGT-------VRHEKSLDVVLGQESAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQF 282

Query: 848 GLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH--QSLDDVDVRNILAAEQRKIL 905
                SL E+  DE   DG LA+ L V+++ H +FF +  + + + DVR +L   +++IL
Sbjct: 283 DHRYKSLSELKSDESEPDGALATVLKVLKQAHALFFENVDEGISNRDVRLMLKQVRKEIL 342

Query: 906 AGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALS 965
            GC+IVFSRVFP  +A P  HPLW+ AE+ GA C   +D  VTHVVA  +GT+K  WA+ 
Sbjct: 343 KGCKIVFSRVFPT-KAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKARWAVR 401

Query: 966 TGRFVVHPGWVEASALLYRRANEQDFAIK 994
             ++VVH GW++A+  L+ +  E++F ++
Sbjct: 402 EKKYVVHRGWIDAANYLWMKQPEENFGLE 430


>gi|326518250|dbj|BAK07377.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 488

 Score =  271 bits (693), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 142/329 (43%), Positives = 197/329 (59%), Gaps = 13/329 (3%)

Query: 669 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEI-LRKKEEQDREKPHRHLFRF 727
           R  E K +   RKL L+LDLDHTL+NS + H++     ++ ++    ++ + P R LF  
Sbjct: 150 RESEVKNLLRERKLVLILDLDHTLINSTRLHDISAAEMDLGIQTAASKNADDPERSLFTL 209

Query: 728 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 787
             M M TKLRP +  FLE AS +F+M++YTMG+K YA E+AK+LDP  V F  +VIS  D
Sbjct: 210 QGMHMLTKLRPFVRKFLEEASNMFDMYIYTMGDKAYAIEIAKLLDPGNVYFDSKVISNSD 269

Query: 788 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQF 847
                   +R  K  D+  VLG +   VIIDD+  VW  +K NLI++ERY YF  S RQF
Sbjct: 270 ------CTQRHQKGLDV--VLGDDKVAVIIDDTEHVWQKHKENLILMERYHYFAASCRQF 321

Query: 848 GLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS---HQSLDDVDVRNILAAEQRKI 904
           G    SL E+  DER  DG LA+ L V++R+H IFF      +L   DVR ++   ++++
Sbjct: 322 GFSDQSLSELMQDERESDGALATILDVLKRIHTIFFDSGVETALSSRDVRQVIKRVRQEV 381

Query: 905 LAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWAL 964
           L GC++VFSRVFP  +       +W+ AEQ GAVC   +D  VTHVVA   GT+K  WA 
Sbjct: 382 LQGCKLVFSRVFP-SDCRSQDQIMWKMAEQLGAVCCSEVDPSVTHVVAVHAGTEKARWAA 440

Query: 965 STGRFVVHPGWVEASALLYRRANEQDFAI 993
              +F++HP W+EA    + R  E+DF +
Sbjct: 441 GNKKFLLHPRWIEACNYRWHRQPEEDFPV 469


>gi|242087817|ref|XP_002439741.1| hypothetical protein SORBIDRAFT_09g019310 [Sorghum bicolor]
 gi|241945026|gb|EES18171.1| hypothetical protein SORBIDRAFT_09g019310 [Sorghum bicolor]
          Length = 547

 Score =  271 bits (693), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 139/329 (42%), Positives = 196/329 (59%), Gaps = 14/329 (4%)

Query: 669 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 728
           R  + K +   RKL L+LDLDHTL+NS K   +     + L  +    ++ P+R +F   
Sbjct: 154 RCADLKNLLRERKLVLILDLDHTLINSTKLQNISSAEKD-LGIQTAASKDDPNRSIFALE 212

Query: 729 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
            M + TKLRP +  FL+ AS +FEM++YTMG+K YA E+AK+LDP  + F  +VIS  D 
Sbjct: 213 SMQLLTKLRPFVREFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPLKVISNSDC 272

Query: 789 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 848
                        K L+ +LG  S  VI+DD+  VW  +K NLI++ERY +F  S R+FG
Sbjct: 273 TKR--------HQKGLDVILGAASVAVILDDTEFVWKKHKENLILMERYHFFASSCREFG 324

Query: 849 LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQ-SLDDV---DVRNILAAEQRKI 904
               SL E+  DER  DG LA+ L V++R+H IFF      DD+   DVR ++ A +++I
Sbjct: 325 FAVRSLSELMQDERESDGALATVLDVLKRIHAIFFDMAVETDDLSSRDVRQVIKAVRKEI 384

Query: 905 LAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWAL 964
           L GC+IVFSRVFP     P    +W+ AE  GAVC+  +D  VTHVV   LGT+K  W +
Sbjct: 385 LQGCKIVFSRVFP-NNTRPQKQMVWKMAEYLGAVCSTDVDSSVTHVVTVDLGTEKARWGV 443

Query: 965 STGRFVVHPGWVEASALLYRRANEQDFAI 993
           +  +F+VHP W+EA+   + R  E+DF +
Sbjct: 444 ANKKFLVHPRWIEAANFRWHRQPEEDFPV 472


>gi|226497696|ref|NP_001152445.1| CPL3 [Zea mays]
 gi|195656359|gb|ACG47647.1| CPL3 [Zea mays]
          Length = 531

 Score =  271 bits (693), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 140/323 (43%), Positives = 195/323 (60%), Gaps = 13/323 (4%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           K +   RKL L+LDLDHTL+NS K  ++     + L  +    ++ P+R +F    M M 
Sbjct: 153 KNLLRERKLVLILDLDHTLINSTKLQDISSAEKD-LGIQSAASKDDPNRSIFALDLMPML 211

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 793
           TKLRP +  FL+ AS +FEM++YTMG+K YA E+AK+LDP  + F  +VIS  D      
Sbjct: 212 TKLRPFVREFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPSKVISNSD------ 265

Query: 794 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 853
             +R  K  D+  +LG ES  VI+DD+  VW  +K NLI++ERY +F  S RQFG    S
Sbjct: 266 CTQRHQKGLDV--ILGAESVAVILDDTEYVWQKHKENLILMERYHFFASSCRQFGFGVRS 323

Query: 854 LLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV---DVRNILAAEQRKILAGCRI 910
           L E   DER  DG LA+ L V++R+H  FF   +  D+   D+R ++   +++IL GC+I
Sbjct: 324 LSESLQDERESDGALATVLDVLKRIHATFFDMAAETDLSSRDIRQVIKTLRKEILQGCKI 383

Query: 911 VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFV 970
           VFSRVFP     P    +W+ AE  GAVC K +D  VTHVV   LGT+K  W L+  +F+
Sbjct: 384 VFSRVFP-NNTRPQEQMVWKMAEYLGAVCVKDVDPSVTHVVTVDLGTEKSRWGLNNKKFL 442

Query: 971 VHPGWVEASALLYRRANEQDFAI 993
           VHP W+EA+   + R  E+DF +
Sbjct: 443 VHPRWIEAANFRWHRQPEEDFPV 465


>gi|307106534|gb|EFN54779.1| hypothetical protein CHLNCDRAFT_134722 [Chlorella variabilis]
          Length = 513

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 151/348 (43%), Positives = 199/348 (57%), Gaps = 28/348 (8%)

Query: 663 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVH---------DEILRKK- 712
           + ER R+     ++ + RKL L+LDLDHTLLNS +F EV P           ++ LR + 
Sbjct: 100 EAERVRQ-GTADRLLAHRKLLLILDLDHTLLNSTRFTEVPPQGAVTEQREGGEQALRAQL 158

Query: 713 EEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKL------FEMHLYTMGNKLYATE 766
           E Q +  P   L+  PHM MWTKLRPG+  FLE A         FE+ +YTMG++ YA E
Sbjct: 159 EAQPKGAPM--LYCLPHMRMWTKLRPGVREFLEAAKDRQVGQVGFELAVYTMGDRDYAGE 216

Query: 767 MAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPH 826
           MAK+LDP G LF GR+IS GD    +         KDL+ VLG E  V+I+DD+  VWP 
Sbjct: 217 MAKLLDPAGSLFHGRIISSGDSTQRY--------VKDLDVVLGRERCVLILDDTEGVWPR 268

Query: 827 NKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQ 886
           ++ NL+ +ERY YFP    +FG    SLLE   DE    G LA+ L V+  + + FF   
Sbjct: 269 HRDNLVQIERYLYFPADAARFGFRSQSLLERAVDEEGGGGALATCLRVMSGVQQQFFEQG 328

Query: 887 SLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQ 946
                DVR +L A +R +LA CR++FSRV P+  A+P  HPLWQ A + GA C +     
Sbjct: 329 DPGAADVRPLLGAARRAVLAECRLLFSRVMPLDCADPSAHPLWQLALKLGAECVRETGQG 388

Query: 947 VTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 994
           VTHVVA    TDK  WA   G+ VV P W+   A  ++RA+E  F +K
Sbjct: 389 VTHVVATDT-TDKTRWACGQGKHVVSPSWLWCCAYTWQRADEAGFPVK 435


>gi|357129281|ref|XP_003566293.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
           4-like [Brachypodium distachyon]
          Length = 492

 Score =  268 bits (684), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 140/320 (43%), Positives = 197/320 (61%), Gaps = 12/320 (3%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           KK+   RKL L+LDLDHTL+NS + H++     + L  +    ++ P R LF    M M 
Sbjct: 157 KKLLRERKLVLILDLDHTLINSTRLHDISAAEMD-LGIQTAALKDDPDRSLFTLERMHML 215

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 793
           TKLRP +  FL+ AS +FEM++YTMG+K Y+ E+AK+LDP  V F  +VIS  D      
Sbjct: 216 TKLRPFVRRFLKEASNMFEMYIYTMGDKAYSIEVAKLLDPGNVYFGSKVISNSD------ 269

Query: 794 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 853
             +R  K  D+  VLG ES  VI+DD+  VW  +K NLI++ERY YF  S RQFG    S
Sbjct: 270 CTQRHQKGLDV--VLGAESIAVILDDTEDVWQKHKENLILMERYHYFASSCRQFGFSVRS 327

Query: 854 LLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAE--QRKILAGCRIV 911
           L E+  DER  DG L++ L V++R+H IFF       +  R ++  +  ++++L GC++V
Sbjct: 328 LSELMVDERESDGALSTILDVLKRIHTIFFDSGVETALSSRTLMVIKRVRQEVLQGCKLV 387

Query: 912 FSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVV 971
           FSRVFP   + P    +W+ AE+ GA C  H+D  VTHVVA  +GT+K  WA+   +F++
Sbjct: 388 FSRVFP-SNSCPQDQIIWKMAEKLGASCCAHVDSTVTHVVAVDVGTEKARWAVENKKFLL 446

Query: 972 HPGWVEASALLYRRANEQDF 991
           HP W+EAS   +RR  E+DF
Sbjct: 447 HPRWIEASNYRWRRQPEEDF 466


>gi|307111295|gb|EFN59530.1| hypothetical protein CHLNCDRAFT_138191 [Chlorella variabilis]
          Length = 1156

 Score =  267 bits (682), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 139/335 (41%), Positives = 202/335 (60%), Gaps = 22/335 (6%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDP-VHDEILRKKEEQDREKP--HRHLFRFPHM 730
           +++    KLCLVLDLDHTLLNSA F EV P +HD +  +   +    P   R LFR   +
Sbjct: 361 EELLGRGKLCLVLDLDHTLLNSATFAEVGPTLHDSLKARAASEAATLPEDQRLLFRIDGI 420

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            MWTKLRPG+  FL+RA++ +++ ++T GN+ YA  + ++LD  G +F  R+I++G +  
Sbjct: 421 KMWTKLRPGVHKFLQRAARYYQLWIHTNGNRAYADSVVRLLDRGGAIFGDRIIAQGAE-- 478

Query: 791 PFDGDERVP--KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 848
               D+ VP    + ++G+   ES  VI+DDS  VW  ++ NL+ VERY YFP SR   G
Sbjct: 479 --RVDQMVPDQAKRLMQGLDERESITVIVDDSHSVWSQHRHNLVAVERYIYFPSSRASLG 536

Query: 849 LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQS------------LDDVDVRNI 896
           L GPSLL+ + DE  E G L  +L V+ R+H       +              + D R  
Sbjct: 537 LKGPSLLDANRDECPEQGMLMVALSVLVRVHGAVMRALAAPPTVLPGGEVVFQNWDARQA 596

Query: 897 LAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLG 956
           LA E++K+LAG  +VF+RV P+ E  P  HPLW+ A+ FGA C+  +D   THV+A + G
Sbjct: 597 LAQERQKVLAGVHLVFTRVIPL-EMEPESHPLWRLAQSFGARCSGSLDASTTHVIAGASG 655

Query: 957 TDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 991
           T+KV  A S G++VV P W+E S +L++RA+E+ F
Sbjct: 656 TEKVLSARSMGKWVVTPAWLECSCILWKRAHEERF 690


>gi|356498756|ref|XP_003518215.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
           4-like [Glycine max]
          Length = 428

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 148/336 (44%), Positives = 203/336 (60%), Gaps = 17/336 (5%)

Query: 662 IQKERTRRLE--EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 719
           +  E   RL   + K +   +KL LVLDLDHTLLNS     +      +L + +   R+ 
Sbjct: 104 LHDEEISRLRNTDMKSLLCRKKLYLVLDLDHTLLNSTHLAHLTSEESHLLNQTDSL-RDV 162

Query: 720 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 779
               LF+  HM M TKLRP +  FL+ AS++FEM++YTMG++ YA EMAK+LDP+G  F 
Sbjct: 163 SKGSLFKLEHMNMMTKLRPFVRPFLKEASEMFEMYIYTMGDRPYALEMAKLLDPQGEYFN 222

Query: 780 GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 839
            +VISR DDG            K L+ VLG ESAV+I+DD+   W  +K NLI++ERY +
Sbjct: 223 AKVISR-DDGTQ-------KHQKGLDVVLGQESAVLILDDTEHAWMKHKDNLILMERYHF 274

Query: 840 FPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQS-LDDVDVRNILA 898
           F  S RQFG    SL E+  DE   DG LA  L V++++H +FF  Q   DD DVR +L+
Sbjct: 275 FGSSCRQFGFNCKSLAELKSDENETDGALAKILKVLKQVHCMFFDKQEDFDDRDVRQMLS 334

Query: 899 AEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTD 958
             +R++L+GC I+FSR+  V  A P L  +   AEQ GA C   ID  VTHVVA   GT+
Sbjct: 335 LVRREVLSGCVIIFSRI--VHGAIPSLRKM---AEQMGATCLTEIDPSVTHVVATDAGTE 389

Query: 959 KVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 994
           K  WA+   +FVVHP W+EA+   +++  E++F +K
Sbjct: 390 KCRWAVKEKKFVVHPLWIEAANYFWQKQPEENFILK 425


>gi|356564913|ref|XP_003550691.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
           4-like [Glycine max]
          Length = 442

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 147/336 (43%), Positives = 204/336 (60%), Gaps = 17/336 (5%)

Query: 662 IQKERTRRLE--EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 719
           +  E   RL   + K +   +KL LVLDLDHTLLNS    ++      +L + +      
Sbjct: 118 LHDEEISRLRNTDMKSLLGRKKLYLVLDLDHTLLNSTHLAQLTSEELHLLNQTDSLTNVS 177

Query: 720 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 779
               LF+  HM M TKLRP +  FL+ AS++FEM++YTMG++ YA EMAK+LDP+G  F 
Sbjct: 178 KGS-LFKLEHMNMMTKLRPFVRPFLKEASEMFEMYIYTMGDRPYALEMAKLLDPQGEYFN 236

Query: 780 GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 839
            +VISR DDG            K L+ VLG ESAV+I+DD+   W  +K NLI++ERY +
Sbjct: 237 AKVISR-DDGTQ-------KHQKGLDVVLGQESAVIILDDTEHAWMKHKDNLILMERYHF 288

Query: 840 FPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQS-LDDVDVRNILA 898
           F  S RQFG    SL E+  DE   DG LA  L V++++H +FF  Q   DD DVR +L+
Sbjct: 289 FGSSCRQFGFNCKSLAELKSDEDETDGALAKILKVLKQVHCMFFDKQEDFDDQDVRQVLS 348

Query: 899 AEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTD 958
           + +R++L+GC I+FSR+  V  A P L  +   AEQ GA C   ID  VTHVVA   GT+
Sbjct: 349 SVRREVLSGCVIIFSRI--VHGAIPSLRKM---AEQMGATCLTEIDPSVTHVVATDAGTE 403

Query: 959 KVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 994
           K  WA+   +FVVHP W+EA+   +++  E++F++K
Sbjct: 404 KCRWAVKEKKFVVHPLWIEAANYFWQKQPEENFSLK 439


>gi|384251210|gb|EIE24688.1| carboxyl-terminal phosphatase-like 4 [Coccomyxa subellipsoidea
           C-169]
          Length = 439

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 150/358 (41%), Positives = 208/358 (58%), Gaps = 38/358 (10%)

Query: 660 AAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 719
           +A + ER R+ +  K+  S RKL LVLDLDHTLLNS +F E     +++   +  +  ++
Sbjct: 59  SASEAERVRQ-QSLKRALSNRKLLLVLDLDHTLLNSTRFDEAVGFEEQLAAIQRARPEDQ 117

Query: 720 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 779
           P   L+   HM +WTKLRP +  FLE+A ++ EMH+YT GN  YA EMA++LDP    FA
Sbjct: 118 P-VSLYHLEHMRLWTKLRPYVREFLEKAHEVSEMHIYTHGNAEYAIEMARLLDPTKRFFA 176

Query: 780 GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 839
            R+IS+GD          V   KDL+ VLG E+AVVI+DD+  VWP ++ NL+ VERY +
Sbjct: 177 ERIISQGDST--------VKHVKDLDVVLGAETAVVILDDTAGVWPSHQQNLLQVERYVF 228

Query: 840 FPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF-SHQSLDDVDVRNILA 898
           FP   R+F L   SLLE+  DE  + G LAS+L    R+H  FF +       DVR  L 
Sbjct: 229 FPACARRFQLNVQSLLELGRDEDEQHGMLASAL----RVHSRFFGASAGGGQQDVRQHLQ 284

Query: 899 AEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAE-----------------------QF 935
           A + ++L+GCRI FSR+ P G+  P  HP WQ A+                       Q 
Sbjct: 285 ALRLQVLSGCRISFSRIIPRGDRFPETHPHWQMAQQHKPIKFAVSNGGSRESQSIEWVQL 344

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 993
           GAV T  +++  THVVA +  TDKV+WA++  R +V P W+ ASA L+R+ +E  F +
Sbjct: 345 GAVVTLGVEEDTTHVVAAAKDTDKVHWAVANERHIVSPDWLTASACLWRKMDEDRFPV 402


>gi|297793317|ref|XP_002864543.1| hypothetical protein ARALYDRAFT_332090 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297310378|gb|EFH40802.1| hypothetical protein ARALYDRAFT_332090 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 1006

 Score =  260 bits (664), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 140/333 (42%), Positives = 199/333 (59%), Gaps = 46/333 (13%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH---------------- 723
           RKL LVLDLDHTLLNS    ++ P          E++  K H H                
Sbjct: 691 RKLYLVLDLDHTLLNSTVLRDLKP----------EEEYLKSHTHSLQEPFDFLLISDVSG 740

Query: 724 --LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 781
             LF    M M TKLRP + +FL+ AS++F M++YTMG++ YA +MAK+LDP+G  F  R
Sbjct: 741 GSLFMLEFMHMMTKLRPFVHSFLKEASEMFVMYIYTMGDRAYARQMAKLLDPRGEYFGDR 800

Query: 782 VISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
           +ISR DDG        V   K L+ VLG ESAV+I+DD+   WP++K NLIV+ERY +F 
Sbjct: 801 IISR-DDGT-------VRHQKSLDVVLGQESAVLILDDTENAWPNHKDNLIVIERYHFFA 852

Query: 842 CSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQ 901
            S RQF     SL E+  DE   DG LA+ L  ++         + + + DVR++L   +
Sbjct: 853 SSCRQFDHKYKSLSELKSDESEPDGALATVLKNVD---------EDISNRDVRSMLKQVR 903

Query: 902 RKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVN 961
           +++L GC++VFSRVFP  +A P  HPLW+ AE+ GA C   +D  VTHVVA  +GT+K  
Sbjct: 904 KEVLKGCKVVFSRVFPT-KAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEKAR 962

Query: 962 WALSTGRFVVHPGWVEASALLYRRANEQDFAIK 994
           WA+   ++VVH GW++A+  L+++  E+ F+++
Sbjct: 963 WAVREKKYVVHRGWIDAANYLWKKQPEEKFSLE 995


>gi|224142399|ref|XP_002324546.1| predicted protein [Populus trichocarpa]
 gi|222865980|gb|EEF03111.1| predicted protein [Populus trichocarpa]
          Length = 312

 Score =  254 bits (650), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 137/322 (42%), Positives = 197/322 (61%), Gaps = 12/322 (3%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 731
           + K +   +KL L+LDLDHTLLNS +   +  + +E L  + +  ++     LF    M 
Sbjct: 3   DMKNLLRHKKLYLILDLDHTLLNSTQLMHM-TLDEEYLNGQTDSLQDVSKGSLFMLSSMQ 61

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
           M TKLRP + TFL+ AS++FEM++YTMG++ YA EMAK+LDP    F  +VISR D    
Sbjct: 62  MMTKLRPFVRTFLKEASQMFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRDD---- 117

Query: 792 FDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 851
             G +R  K  D+  VLG ESAV+I+DD+   W  +K NLI++ERY +F  S  QFG   
Sbjct: 118 --GTQRHQKGLDV--VLGQESAVLILDDTENAWMKHKDNLILMERYHFFASSCHQFGFNC 173

Query: 852 PSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIV 911
            SL E   DE   +G LAS L V+ ++H+IFF   +L       +L   ++ +L GC+IV
Sbjct: 174 KSLSEQKTDESESEGALASILKVLRKIHQIFFEDHTLS--LALQVLKTVRKDVLKGCKIV 231

Query: 912 FSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVV 971
           FSRVFP  ++    H LW+ AEQ GA C+  +D  VTHVV+   GT+K +WA    +F+V
Sbjct: 232 FSRVFPT-QSQADNHHLWRMAEQLGATCSTELDPSVTHVVSKDSGTEKSHWASKHNKFLV 290

Query: 972 HPGWVEASALLYRRANEQDFAI 993
            PGW+EA+   ++R  E++F++
Sbjct: 291 QPGWIEATNYFWQRQPEENFSV 312


>gi|145346053|ref|XP_001417510.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577737|gb|ABO95803.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 643

 Score =  253 bits (647), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 147/347 (42%), Positives = 202/347 (58%), Gaps = 32/347 (9%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILR----------KKEEQDREKPHRH 723
           +++ ++RKL LVLDLDHTLLNS    ++  +    LR          K+ E   +   R 
Sbjct: 302 ERLIASRKLALVLDLDHTLLNSVLVPDL-RMDSNWLRNAMRLLDADVKRAEDANDPLKRS 360

Query: 724 LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
           +F   H  + TKLRPG+  FLERAS+LFE+H+ TMG++ YA +M ++LDP+     G V 
Sbjct: 361 VFHLQHFDLLTKLRPGVRRFLERASRLFEIHINTMGSQAYADQMVELLDPEKRWIHGTV- 419

Query: 784 SRGDDGDPFDGDERVPKSKDLEGVLG-MESAVVIIDDSVRVWPHNKLNLIVVERYTYFPC 842
            RG  G+   G    P  K L+G L  +  A +I DD+  VW  ++ NL+  ERY +FP 
Sbjct: 420 -RGL-GEMEGGKLWAPAEKTLDGALEHLADACLIFDDTASVWESHRRNLVTCERYLFFPQ 477

Query: 843 SRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS--------------HQSL 888
           +RRQFGL G SLLEI  DE  ++G L++++ V E +H  +F+               Q  
Sbjct: 478 ARRQFGLSGMSLLEIGQDESEDEGMLSTAMKVFESVHSAYFAGGYDKNVKHKVRALKQHA 537

Query: 889 DDVD-VRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQV 947
            DV  V+ IL A+++K+LA  RIVFSRVFP+ +A+P  HPLW  AE FGA C + + D  
Sbjct: 538 SDVRAVQEILCAQRKKVLADVRIVFSRVFPI-DADPTTHPLWILAEDFGATCGRTLCDDT 596

Query: 948 THVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           THVV  +  TDKV  A + G    V P W+E S LL+RRANE  F I
Sbjct: 597 THVVGTASSTDKVKAAKARGNVHAVTPHWLECSMLLWRRANEATFRI 643


>gi|218196728|gb|EEC79155.1| hypothetical protein OsI_19828 [Oryza sativa Indica Group]
          Length = 430

 Score =  251 bits (642), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 127/267 (47%), Positives = 170/267 (63%), Gaps = 12/267 (4%)

Query: 730 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
           M M TKLRP +  FL+ AS +FEM++YTMG+K YA E+AK+LDP  V F  +VIS  D  
Sbjct: 1   MQMLTKLRPFVRRFLKEASDMFEMYIYTMGDKAYAIEIAKLLDPDNVYFGSKVISNSD-- 58

Query: 790 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 849
                 +R  K  D+  VLG ES  VI+DD+  VW  +K NLI++ERY YF  S RQFG 
Sbjct: 59  ----CTQRHQKGLDV--VLGDESVAVILDDTEYVWQKHKENLILMERYHYFASSCRQFGF 112

Query: 850 LGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNILAAEQRKILA 906
              SL E   DER  DG LA+ L V+ER+H IFF     + L   DVR ++   ++++L 
Sbjct: 113 GARSLSETMQDERENDGALATILDVLERIHTIFFDPDDQKPLSSRDVRQVIKRVRQEVLQ 172

Query: 907 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST 966
           GC++VF+RVFP+ +  P    LW+ AEQ GAVC   +D  VTHVVA  LGT+K  WA+S 
Sbjct: 173 GCKLVFTRVFPLHQ-RPQDQMLWKMAEQLGAVCCTDVDSTVTHVVALDLGTEKARWAISN 231

Query: 967 GRFVVHPGWVEASALLYRRANEQDFAI 993
            +F+VHP W+EA+   ++R  E+DF +
Sbjct: 232 KKFLVHPRWIEAANFRWQRQQEEDFPV 258


>gi|359494894|ref|XP_003634864.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
           4-like [Vitis vinifera]
          Length = 278

 Score =  248 bits (632), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 127/273 (46%), Positives = 173/273 (63%), Gaps = 12/273 (4%)

Query: 724 LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
           LF    M M TKLRP + TFL+ ASK+FEM++YTMG + YA EMAK+LDP+ V F+ RVI
Sbjct: 9   LFMLNTMHMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSRVI 68

Query: 784 SRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCS 843
           S+ D              K L+ VLG ESAV+I+DD+  VW  +K NLI++ERY +F  S
Sbjct: 69  SQADCTQR--------HQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASS 120

Query: 844 RRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV---DVRNILAAE 900
            RQFG    SL E+  DE   DG LA+ L V++R+H +FF  +  DD    DVR ++   
Sbjct: 121 CRQFGFNCKSLSELKSDESEPDGALATVLKVLQRIHSMFFDPELGDDFSGRDVRQVVKRV 180

Query: 901 QRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKV 960
           ++++L GC+IVFSRVFP        H LW+ AEQ GA C   +D  VTHVV+   GT+K 
Sbjct: 181 RKEVLKGCKIVFSRVFPT-RFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEKS 239

Query: 961 NWALSTGRFVVHPGWVEASALLYRRANEQDFAI 993
            WAL   +F+VHPGW+EA+   +++  E++F +
Sbjct: 240 RWALQEKKFLVHPGWIEAANYFWQKQPEENFPV 272


>gi|359497210|ref|XP_003635453.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
           4-like [Vitis vinifera]
          Length = 278

 Score =  247 bits (630), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 127/273 (46%), Positives = 172/273 (63%), Gaps = 12/273 (4%)

Query: 724 LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
           LF    M M TKLRP + TFL+ ASK+FEM++YTMG + YA EMAK+LDP+ V F+ RVI
Sbjct: 9   LFMLNTMHMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSRVI 68

Query: 784 SRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCS 843
           S+ D              K L+ VLG ESAV+I+DD+  VW  +K NLI++ERY +F  S
Sbjct: 69  SQADCTQR--------HQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASS 120

Query: 844 RRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV---DVRNILAAE 900
            RQFG    SL E+  DE   DG LA+ L V++R+H +FF  +  DD    DVR ++   
Sbjct: 121 CRQFGFNCKSLSELKSDESEPDGALATVLKVLQRIHSMFFDPELGDDFSGRDVRQVVKRV 180

Query: 901 QRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKV 960
           ++ +L GC+IVFSRVFP        H LW+ AEQ GA C   +D  VTHVV+   GT+K 
Sbjct: 181 RKDVLKGCKIVFSRVFPT-RFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEKS 239

Query: 961 NWALSTGRFVVHPGWVEASALLYRRANEQDFAI 993
            WAL   +F+VHPGW+EA+   +++  E++F +
Sbjct: 240 RWALQEKKFLVHPGWIEAANYFWQKQPEENFPV 272


>gi|296088193|emb|CBI35709.3| unnamed protein product [Vitis vinifera]
          Length = 638

 Score =  246 bits (629), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 126/267 (47%), Positives = 172/267 (64%), Gaps = 12/267 (4%)

Query: 730 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
           M M TKLRP + TFL+ ASK+FEM++YTMG + YA EMAK+LDP+ V F+ RVIS+ D  
Sbjct: 1   MHMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSRVISQAD-- 58

Query: 790 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 849
                 +R    K L+ VLG ESAV+I+DD+  VW  +K NLI++ERY +F  S RQFG 
Sbjct: 59  ----CTQR--HQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASSCRQFGF 112

Query: 850 LGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV---DVRNILAAEQRKILA 906
              SL E+  DE   DG LA+ L V++R+H +FF  +  DD    DVR ++   ++ +L 
Sbjct: 113 NCKSLSELKSDESEPDGALATVLKVLQRIHSMFFDPELGDDFSGRDVRQVVKRVRKDVLK 172

Query: 907 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST 966
           GC+IVFSRVFP        H LW+ AEQ GA C   +D  VTHVV+   GT+K  WAL  
Sbjct: 173 GCKIVFSRVFPT-RFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEKSRWALQE 231

Query: 967 GRFVVHPGWVEASALLYRRANEQDFAI 993
            +F+VHPGW+EA+   +++  E++F +
Sbjct: 232 KKFLVHPGWIEAANYFWQKQPEENFPV 258


>gi|296090640|emb|CBI41034.3| unnamed protein product [Vitis vinifera]
          Length = 264

 Score =  246 bits (627), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 125/267 (46%), Positives = 171/267 (64%), Gaps = 12/267 (4%)

Query: 730 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
           M M TKLRP + TFL+ ASK+FEM++YTMG + YA EMAK+LDP+ V F+ RVIS+ D  
Sbjct: 1   MHMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSRVISQADCT 60

Query: 790 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 849
                       K L+ VLG ESAV+I+DD+  VW  +K NLI++ERY +F  S RQFG 
Sbjct: 61  QR--------HQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASSCRQFGF 112

Query: 850 LGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV---DVRNILAAEQRKILA 906
              SL E+  DE   DG LA+ L V++R+H +FF  +  DD    DVR ++   ++++L 
Sbjct: 113 NCKSLSELKSDESEPDGALATVLKVLQRIHSMFFDPELGDDFSGRDVRQVVKRVRKEVLK 172

Query: 907 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST 966
           GC+IVFSRVFP        H LW+ AEQ GA C   +D  VTHVV+   GT+K  WAL  
Sbjct: 173 GCKIVFSRVFPT-RFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEKSRWALQE 231

Query: 967 GRFVVHPGWVEASALLYRRANEQDFAI 993
            +F+VHPGW+EA+   +++  E++F +
Sbjct: 232 KKFLVHPGWIEAANYFWQKQPEENFPV 258


>gi|224142401|ref|XP_002324547.1| predicted protein [Populus trichocarpa]
 gi|222865981|gb|EEF03112.1| predicted protein [Populus trichocarpa]
          Length = 266

 Score =  243 bits (620), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 124/266 (46%), Positives = 173/266 (65%), Gaps = 11/266 (4%)

Query: 730 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
           M M TKLRP + TFL+ AS++FEM++YTMG++ YA EMAK+LDP    F  +VISR D  
Sbjct: 5   MQMMTKLRPFVRTFLKEASQMFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRDD-- 62

Query: 790 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 849
               G +R  K  D+  VLG ESAV+I+DD+   W  +K NLI++ERY +F  S  QFG 
Sbjct: 63  ----GTQRHQKGLDV--VLGQESAVLILDDTENAWMKHKDNLILMERYHFFASSCHQFGF 116

Query: 850 LGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH--QSLDDVDVRNILAAEQRKILAG 907
              SL E   DE   +G LAS L V+ ++H+IFF    +++D  DVR +L   ++ +L G
Sbjct: 117 NCKSLSEQKTDESESEGALASILKVLRKIHQIFFEELEENMDGRDVRQVLKTVRKDVLKG 176

Query: 908 CRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTG 967
           C+IVFSRVFP  ++    H LW+ AEQ GA C+  +D  VTHVV+   GT+K +WAL   
Sbjct: 177 CKIVFSRVFPT-QSQADNHHLWRMAEQLGATCSTELDPSVTHVVSKDSGTEKSHWALKHN 235

Query: 968 RFVVHPGWVEASALLYRRANEQDFAI 993
           +F+V PGW+EA+   ++R  E++FA+
Sbjct: 236 KFLVQPGWIEAANYFWQRQPEENFAV 261


>gi|242063380|ref|XP_002452979.1| hypothetical protein SORBIDRAFT_04g035920 [Sorghum bicolor]
 gi|241932810|gb|EES05955.1| hypothetical protein SORBIDRAFT_04g035920 [Sorghum bicolor]
          Length = 518

 Score =  243 bits (620), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 144/335 (42%), Positives = 193/335 (57%), Gaps = 27/335 (8%)

Query: 669 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF- 727
           R+ + + +  ARKL L+LDLDHTLLNS    ++ P   E         +  P   LFR  
Sbjct: 195 RVSDLETLLRARKLTLILDLDHTLLNSTGLDDLSPA--EQANGLTRHTKGDPTAGLFRLG 252

Query: 728 -PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
                M TKLRP    FLE+AS +FEM +YT+G++ YA  + K+LDP G  F GRV+S  
Sbjct: 253 RARFRMLTKLRPFARGFLEQASAMFEMSVYTLGDRGYARAVVKLLDPDGAYFGGRVVS-- 310

Query: 787 DDGDPFDGDERVPKS-KDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSR 844
                   DE   +  K L+ V G E+A VVI+DDS  VWP ++ NLIV++RY YF  S 
Sbjct: 311 -------SDESTRRDRKSLDVVPGAEAAAVVILDDSSHVWPEHQENLIVMDRYLYFADSC 363

Query: 845 RQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV------DVRNILA 898
           R +G    SL E+  DER  DG LA +L V+ R+H+ FF     D V      DVR ++ 
Sbjct: 364 RTYGCGVSSLAELRRDEREHDGALAVALQVLTRVHQGFF-----DSVLGGRFSDVREVIR 418

Query: 899 AEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTD 958
           A + ++L GC + FSRV P+ E     HP+W+ AEQ GAVCT   D  VTHVVA   GTD
Sbjct: 419 AVRSEVLRGCTVAFSRVIPL-EGVAGDHPMWKLAEQLGAVCTADADATVTHVVALDPGTD 477

Query: 959 KVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 993
           K  WA    +F+V+P W+ A+++ + R  EQ+F I
Sbjct: 478 KARWARDNCKFLVNPKWIMAASIRWCRPCEQEFPI 512


>gi|242063378|ref|XP_002452978.1| hypothetical protein SORBIDRAFT_04g035900 [Sorghum bicolor]
 gi|241932809|gb|EES05954.1| hypothetical protein SORBIDRAFT_04g035900 [Sorghum bicolor]
          Length = 464

 Score =  236 bits (603), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 132/318 (41%), Positives = 190/318 (59%), Gaps = 17/318 (5%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH---MGMWTKL 736
           RKL LVLDLDHTLLNS +  ++  +        + +D  + H  LFR  +   + M TKL
Sbjct: 155 RKLILVLDLDHTLLNSTRLQDLSALEQRNGFTPDTED--ELHMELFRLEYSDNVRMLTKL 212

Query: 737 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 796
           RP +  FL++AS  FEMH+YT+G + YA  +  +LDP GV F GRV+SR +        +
Sbjct: 213 RPFVRGFLDQASSRFEMHVYTLGRQDYAKAVIDLLDPDGVYFRGRVVSRKE------STQ 266

Query: 797 RVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLL 855
           R  KS D+  + G + +AVVI+DD+   WP ++ NLI+++RY YF C+ R+F    PS+ 
Sbjct: 267 RDVKSLDV--IPGADPAAVVILDDTDSAWPGHQDNLILMDRYHYFACTCRKFRYNIPSMA 324

Query: 856 EIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV 915
           E   DER  DG+LA  LGV+ R+H+ FF     D  DVR ++A  +R++L  C +VFS +
Sbjct: 325 EQARDEREHDGSLAVVLGVLNRIHQAFFDD---DRADVREVIAEVRRQVLPVCTVVFSYL 381

Query: 916 FPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGW 975
               E  P    +W  AE+ GA C K +D+ VTHVVA   GT K  WA   G+F+V+P W
Sbjct: 382 EEYMEDFPEDTLMWTLAERLGAACQKDVDETVTHVVAEDPGTQKAQWAREHGKFLVNPEW 441

Query: 976 VEASALLYRRANEQDFAI 993
           ++A    + R +E+DF +
Sbjct: 442 IKAVNFRWCRVDERDFPV 459


>gi|168012675|ref|XP_001759027.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689726|gb|EDQ76096.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 389

 Score =  235 bits (600), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 128/324 (39%), Positives = 192/324 (59%), Gaps = 24/324 (7%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 731
           E +++   +KL LV+DLDHT+LNSA+F +V PV    +  + +      H+       +G
Sbjct: 3   ELRRVNKTKKLLLVVDLDHTVLNSARFADV-PVGMTWIAGELQAGGSSLHQ----MTKLG 57

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
           +WTKLRP    FL+ ASKL+EM++YTMG + YA +MAK+LDP   LFA R+IS+ D    
Sbjct: 58  LWTKLRPFAHEFLQEASKLYEMYIYTMGERKYAKKMAKLLDPTRQLFADRIISQNDSTKR 117

Query: 792 FDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 851
           +        +KDL+ VLG +SAVVI+DD+  VWP +K NLI++ERY +F  S  QFG+  
Sbjct: 118 Y--------TKDLDVVLGADSAVVILDDTEAVWPSHKSNLILMERYHFFSSSCSQFGVNS 169

Query: 852 PSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQS--LDDVDVRNILAAEQRKILAGCR 909
            SL ++  DE   +GTLA++L  +  +H  +F+ +      + +  ++ + + K+LAGC 
Sbjct: 170 ASLAQLYRDESETEGTLATTLKTLRAIHHEYFNGKVYFFKQLSLFFVIRSLRAKLLAGCN 229

Query: 910 IVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF 969
           +V           P +HP WQ   + GA C+   D   THVVA   GTD+  WA     F
Sbjct: 230 VVL---------GPEIHPFWQLPAELGARCSTFCDHTTTHVVALDPGTDQALWAKEHDVF 280

Query: 970 VVHPGWVEASALLYRRANEQDFAI 993
           +VHP WV+A++ L+ R  E+D+ +
Sbjct: 281 LVHPRWVDATSYLWSRPPEEDYPV 304


>gi|125541461|gb|EAY87856.1| hypothetical protein OsI_09278 [Oryza sativa Indica Group]
          Length = 420

 Score =  234 bits (598), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 134/327 (40%), Positives = 184/327 (56%), Gaps = 18/327 (5%)

Query: 676 MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTK 735
           +  ARKL LV+DLDHTL+NS +F  +    DE      E+  +   R LFR     M TK
Sbjct: 101 LLRARKLILVVDLDHTLINSTRFAHLSD--DEKANGFTERTGDDRSRGLFRMGLFRMITK 158

Query: 736 LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 795
           LRP +  FL  AS +FEMH+YT+GN+ YAT +AK+LDP G  F  R+IS G+        
Sbjct: 159 LRPFVHEFLREASAMFEMHVYTLGNRNYATAVAKLLDPDGAYFGERIISSGESSQ----- 213

Query: 796 ERVPKSKDLEGVLG----ME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 850
              P  K L  V G    ME +AVVI+DD+  VW   + NLI +ERY YF  SR +FG+ 
Sbjct: 214 ---PDRKSLGDVFGWAPEMERAAVVILDDTAEVWKGYRDNLIEMERYLYFASSRGKFGIA 270

Query: 851 GPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQ--SLDDVDVRNILAAEQRKILAGC 908
             SL E + DE   +G LA +L V+ R+H  FFS    S    DVR ++   +R++L GC
Sbjct: 271 ARSLAERNRDESEREGALAVALRVLRRVHGEFFSGSVCSGSFADVREVIRQARREVLRGC 330

Query: 909 RIVFSRVFPVGEAN-PHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTG 967
            + F+ V P G+      HP+W+ AEQ GA C   + + VTHVVA    T K  WA + G
Sbjct: 331 TVAFTGVIPSGDGGRASDHPVWRKAEQLGATCADDVGEGVTHVVAGKPVTGKALWAQTHG 390

Query: 968 RFVVHPGWVEASALLYRRANEQDFAIK 994
           +F+V   W+ A+   + +  E+ + +K
Sbjct: 391 KFLVDTEWINAAHFRWSKPEERMYPVK 417


>gi|47497024|dbj|BAD19077.1| phosphatase-like [Oryza sativa Japonica Group]
 gi|47497233|dbj|BAD19278.1| phosphatase-like [Oryza sativa Japonica Group]
 gi|125584004|gb|EAZ24935.1| hypothetical protein OsJ_08715 [Oryza sativa Japonica Group]
          Length = 420

 Score =  230 bits (587), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 133/327 (40%), Positives = 183/327 (55%), Gaps = 18/327 (5%)

Query: 676 MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTK 735
           +  ARKL LV+DLDHTL+NS +F  +    DE      E+  +   R LFR     M TK
Sbjct: 101 LLRARKLILVVDLDHTLINSTRFAHLSD--DEKANGFTERTGDDRSRGLFRMGLFRMITK 158

Query: 736 LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 795
           LRP +  FL  AS +FEMH+YT+GN+ YAT +AK+LDP G  F  R+IS G+        
Sbjct: 159 LRPFVHEFLREASAMFEMHVYTLGNRNYATAVAKLLDPDGAYFGERIISSGESSQ----- 213

Query: 796 ERVPKSKDLEGVLG----ME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 850
              P  K L  V G    ME +AVVI+DD+  VW   + NLI +ERY YF  SR +FG+ 
Sbjct: 214 ---PDRKSLGDVFGWAPEMERAAVVILDDTAEVWKGYRDNLIEMERYLYFASSRGKFGIA 270

Query: 851 GPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQ--SLDDVDVRNILAAEQRKILAGC 908
             SL E + DE   +G LA +L V+ R+H  FFS    S    DVR ++   +R++L GC
Sbjct: 271 VRSLAERNRDESEREGALAVALRVLRRVHGEFFSGSVCSGSFADVREVIRQARREVLRGC 330

Query: 909 RIVFSRVFPVGEAN-PHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTG 967
            + F+ V P G+      HP+W+ AEQ GA C   + + VTH VA    T K  WA + G
Sbjct: 331 TVAFTGVIPSGDGGRASDHPVWRRAEQLGATCADDVGEGVTHFVAGKPVTRKALWAQTHG 390

Query: 968 RFVVHPGWVEASALLYRRANEQDFAIK 994
           +F+V   W+ A+   + +  E+ + +K
Sbjct: 391 KFLVDTEWINAAHFRWSKPEERMYPVK 417


>gi|308802952|ref|XP_003078789.1| putative transcription regulator CPL1 (ISS) [Ostreococcus tauri]
 gi|116057242|emb|CAL51669.1| putative transcription regulator CPL1 (ISS) [Ostreococcus tauri]
          Length = 457

 Score =  230 bits (586), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 136/340 (40%), Positives = 189/340 (55%), Gaps = 24/340 (7%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKF----HEVDPVHDEILR-----KKEEQDREKPHR 722
           E  ++  ARKL LVLDLDHTLLNS        E + + + +        + E+  +   R
Sbjct: 121 ELSRLIKARKLALVLDLDHTLLNSVLVPSLRTEANSLQNAMRLLDHDVARAERTGDPLQR 180

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
             F  PH  ++TKLRPG+ +FLERASKLFE+H+ TMG++ YA +M  +LDP      G V
Sbjct: 181 SCFHLPHFDLFTKLRPGVRSFLERASKLFEIHISTMGSQAYADQMVALLDPAKKWINGTV 240

Query: 783 ISRGDDGDPFDGDERVPKSKDLE--GVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
              G+  +   G    P+ K L+  G+  +    VI DD+  VW  N  +L   ERY +F
Sbjct: 241 KGLGEMEN---GRLIAPRYKSLDDCGLGELTDVSVIFDDTTDVWAQNLKSLFTCERYLFF 297

Query: 841 PCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVD------VR 894
           P +RRQFGLLG SLLE+  DE   +G L +++ V E +H  +F  +  D +       ++
Sbjct: 298 PQARRQFGLLGSSLLEVGQDESESEGMLMTAINVFESVHAEYFKRR--DALKGKKSPCMQ 355

Query: 895 NILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANS 954
           +IL   ++ +L+G  +VFSRVFP+    P   PLW  AE FGA C+  I    THVV  S
Sbjct: 356 DILEERRKVVLSGVHVVFSRVFPL-HVKPEEQPLWILAENFGANCSSEITSHTTHVVGTS 414

Query: 955 LGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
             T KV  AL  G    V P W+E S L +RRA+E++F I
Sbjct: 415 KATAKVREALKRGGIHAVTPHWLECSMLFWRRASEKNFTI 454


>gi|308802003|ref|XP_003078315.1| CTD phosphatase-like protein 3 (ISS) [Ostreococcus tauri]
 gi|116056766|emb|CAL53055.1| CTD phosphatase-like protein 3 (ISS) [Ostreococcus tauri]
          Length = 480

 Score =  229 bits (585), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 139/349 (39%), Positives = 200/349 (57%), Gaps = 30/349 (8%)

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD------RE 718
           E+ +R EE++++    KL L+LDLDHTLLNSA+F E+     ++L +   Q+      RE
Sbjct: 142 EKAKR-EEKERVLKDGKLTLILDLDHTLLNSAQFKELTQEQHDLLHQCIAQEANGLAERE 200

Query: 719 KPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 778
           +P  +  R  HMG +TKLRP ++ FLE  S++ + ++YTMG+K YA EM K++DP+G +F
Sbjct: 201 RPMLYCLR--HMGFFTKLRPHVFEFLEEVSQICQPYVYTMGDKAYAKEMVKLIDPEGKIF 258

Query: 779 AGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 838
            GRVIS        + D      KDL+ VLG E++ VI+DD+ RVWP N  NLI ++RY 
Sbjct: 259 HGRVIS--------NNDSTSSHVKDLDIVLGGETSAVIVDDTERVWPANHGNLIRLDRYH 310

Query: 839 YFPCSRRQFGLLGPSLLEIDHDERSEDGTLASS------LGVIERLHKIFFSHQSLDDVD 892
           +FP S   F   G S++E    +  E G++ +       L VI+  H+ +F H S+++ D
Sbjct: 311 FFPSSAASFQQKGQSVMERSMVDEGELGSMGARAVLLDVLAVIQSAHRSYFKHASIEEPD 370

Query: 893 VRNILAAEQRKI--LAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHID---DQV 947
           VR +L    R    L+G + V S V P+ + NP  HPL   A   GA     I+   D V
Sbjct: 371 VRTLLVKPDRIDLPLSGVKFVMSGVTPLSDRNPERHPLRLLASTLGAEFVSSIERDGDSV 430

Query: 948 THVVANSLGTDKVNWALSTGR--FVVHPGWVEASALLYRRANEQDFAIK 994
           THV+A S GTDKV WA  TG    +V P W+ A A    R +E  + I+
Sbjct: 431 THVIARSSGTDKVKWAKKTGGRVLIVEPSWLVACAQANTRVSESLYPIE 479


>gi|357450477|ref|XP_003595515.1| RNA polymerase II C-terminal domain phosphatase-like protein
           [Medicago truncatula]
 gi|355484563|gb|AES65766.1| RNA polymerase II C-terminal domain phosphatase-like protein
           [Medicago truncatula]
          Length = 382

 Score =  229 bits (583), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 135/326 (41%), Positives = 193/326 (59%), Gaps = 19/326 (5%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR-HLFRFPHMGM 732
           + + + RKLCLVLDLDHTLLN+   H + P  +E+  K      E   R  LF   H   
Sbjct: 70  RNLLNRRKLCLVLDLDHTLLNTTSLHRLSP--EEMHLKTCTDSLEDIARGRLFVLEHRQR 127

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
             KLRP + TFL+ ASK+FEM++YTMG++ Y+ EMA++LDP+G  F  +VISR DDG   
Sbjct: 128 MAKLRPFVRTFLKEASKMFEMYIYTMGDRRYSLEMARLLDPQGKFFKDKVISR-DDGTEM 186

Query: 793 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGP 852
                  K KDL  VLG ES+++I+DD+ +VW  +K NLI++ERY +F  S ++F L   
Sbjct: 187 -------KEKDLNLVLGTESSILILDDNKKVWRMHKDNLILMERYHFFNSSCQEFDLNCK 239

Query: 853 SLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLD--DVDVRNILAAEQRKILAGCRI 910
           SL E+  DE   DG LA  L V+  ++  FF     D  D DVR +L++ + ++L+GC I
Sbjct: 240 SLAELHIDENETDGALARILKVLRHINSKFFDELQGDLVDRDVRQVLSSLRGEVLSGCII 299

Query: 911 VFSRVFPVGEANPH-LHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF 969
           VFS  F     N H L  L + AE+ GA C   +   VTH VAN L T++  WA    +F
Sbjct: 300 VFSCAF-----NGHDLRKLRRIAERLGATCLTELGPTVTHAVANELVTEESMWAEKENKF 354

Query: 970 VVHPGWVEASALLYRRANEQDFAIKP 995
           +V+  W+EAS    ++  E+++ ++ 
Sbjct: 355 LVNRRWLEASNFFLQKQPEENYIVRK 380


>gi|357501219|ref|XP_003620898.1| RNA polymerase II C-terminal domain phosphatase-like protein
           [Medicago truncatula]
 gi|355495913|gb|AES77116.1| RNA polymerase II C-terminal domain phosphatase-like protein
           [Medicago truncatula]
          Length = 720

 Score =  227 bits (579), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 135/332 (40%), Positives = 196/332 (59%), Gaps = 19/332 (5%)

Query: 664 KERTR-RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 722
           KE +R R  + K + + RKLCLVLDLDHTLLN+   H + P  +E+  K      E   +
Sbjct: 198 KEISRVRSRDVKNLLNRRKLCLVLDLDHTLLNTTSLHRLSP--EEMHLKTHTDSLEDISK 255

Query: 723 -HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 781
             LF   H+ + TKLRP + TFL+ AS++FEM++YTMG++ Y+ EMA++LDP+G  F  +
Sbjct: 256 GSLFMLEHVQVMTKLRPFVRTFLKEASEMFEMYIYTMGDRQYSLEMARLLDPQGEYFKDK 315

Query: 782 VISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
           VISR DDG            KDL+ VLG E+++VI+DD   VWP  + NLI++ERY +F 
Sbjct: 316 VISR-DDGTQ-------KNVKDLDLVLGTENSIVILDDKEEVWPKYRDNLILMERYHFFN 367

Query: 842 CSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLD--DVDVRNILAA 899
            S + FGL   SL  ++ DE   DG LA  L V+ +++  FF     D  D DVR +L++
Sbjct: 368 SSCQDFGLQCKSLAALNIDENEIDGALAKILEVLRQINYKFFDELQGDLVDRDVRQVLSS 427

Query: 900 EQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDK 959
            + ++L GC IVFS  F     +  L  L + AE+ GA C K +D  VTHV+     T +
Sbjct: 428 FRGEVLRGCVIVFSLNF-----HGDLRILRRIAERLGATCLKKLDPTVTHVIGTDFVTKE 482

Query: 960 VNWALSTGRFVVHPGWVEASALLYRRANEQDF 991
             WA+   +F+V   W+EA+    ++  E++F
Sbjct: 483 SRWAVQEKKFLVSRRWLEAANFFLQKQPEENF 514


>gi|413924219|gb|AFW64151.1| hypothetical protein ZEAMMB73_480827 [Zea mays]
          Length = 490

 Score =  227 bits (578), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 131/321 (40%), Positives = 179/321 (55%), Gaps = 16/321 (4%)

Query: 676 MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM--- 732
           +   RKL LVLDLD TL+NSA+    D    E          +KPH  LFR  +      
Sbjct: 153 LMRERKLILVLDLDSTLVNSARL--CDFSAQEKRNGFTRYTGDKPHMDLFRLKYSNKARK 210

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
            TKLRP +  FLE+AS +FEMH+YT+  + YA  +  +LDP GV F GRV+SR D     
Sbjct: 211 LTKLRPFVRGFLEQASSMFEMHVYTLAKRAYAKAVIDLLDPNGVYFGGRVVSRKD----- 265

Query: 793 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGP 852
               R  KS D+  + G +   V+I D   VWP ++ NLI+++RY YF  + R+F    P
Sbjct: 266 -STRRDMKSLDV--IPGADPVAVVILDDTDVWPAHQDNLILMDRYHYFASTCRKFRYDIP 322

Query: 853 SLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVF 912
           SL E   DER +D +LA  L V+ R+H+ FF     D  DVR ++   +R++L  C I F
Sbjct: 323 SLAEQGRDEREQDNSLAVVLNVLRRIHQDFFDG---DQADVREVIREVRRQVLPECTIAF 379

Query: 913 SRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVH 972
           S +    E  P    +W  AE+ GAVC K +D+ VTHVVA   GT K  WA   G+F+V+
Sbjct: 380 SYLDDCMEDFPENTLMWTLAERLGAVCRKDVDETVTHVVAEDPGTQKAQWARDHGKFLVN 439

Query: 973 PGWVEASALLYRRANEQDFAI 993
           P W++AS   + R +EQ F +
Sbjct: 440 PEWIKASGFRWCRVDEQGFPV 460


>gi|226498568|ref|NP_001149751.1| CPL3 [Zea mays]
 gi|195631558|gb|ACG36674.1| CPL3 [Zea mays]
          Length = 493

 Score =  226 bits (577), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 130/321 (40%), Positives = 179/321 (55%), Gaps = 16/321 (4%)

Query: 676 MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM--- 732
           +   RKL LVLDLD TL+NSA+    D    E          +KPH  LFR  +      
Sbjct: 156 LMRERKLILVLDLDSTLVNSARL--CDFSAQEKRNGFTRYTGDKPHMDLFRLKYSNKARK 213

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
            TKLRP +  FLE+AS +FEMH+YT+  + YA  +  +LDP GV F GRV+SR D     
Sbjct: 214 LTKLRPFVRGFLEQASSMFEMHVYTLAKRAYAKAVIDLLDPNGVYFGGRVVSRKD----- 268

Query: 793 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGP 852
               R  KS D+  + G +   V+I D   VWP ++ NLI+++RY YF  + R+F    P
Sbjct: 269 -STRRDMKSLDV--IPGADPVAVVILDDTDVWPAHQDNLILMDRYHYFASTCRKFRYDIP 325

Query: 853 SLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVF 912
           SL E   DER +D +LA  L V+ R+H+ FF     D  DVR ++   +R++L  C + F
Sbjct: 326 SLAEQGRDEREQDNSLAVVLNVLRRIHQDFFDG---DQADVREVIREVRRQVLPECTVAF 382

Query: 913 SRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVH 972
           S +    E  P    +W  AE+ GAVC K +D+ VTHVVA   GT K  WA   G+F+V+
Sbjct: 383 SYLDDCMEDFPENTLMWTLAERLGAVCRKDVDETVTHVVAEDPGTQKAQWARDHGKFLVN 442

Query: 973 PGWVEASALLYRRANEQDFAI 993
           P W++AS   + R +EQ F +
Sbjct: 443 PEWIKASGFRWCRVDEQGFPV 463


>gi|147774299|emb|CAN76945.1| hypothetical protein VITISV_002430 [Vitis vinifera]
          Length = 641

 Score =  222 bits (566), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 118/245 (48%), Positives = 156/245 (63%), Gaps = 12/245 (4%)

Query: 730 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
           M M TKLRP + TFL+ ASK+FEM++YTMG + YA EMAK+LDP+ V F+ RVIS+ D  
Sbjct: 1   MHMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSRVISQAD-- 58

Query: 790 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 849
                 +R    K L+ VLG ESAV+I+DD+  VW  +K NLI++ERY +F  S RQFG 
Sbjct: 59  ----CTQR--HQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASSCRQFGF 112

Query: 850 LGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV---DVRNILAAEQRKILA 906
              SL E+  DE   DG LA+ L V++R+H +FF  +  DD    DVR ++   ++ +L 
Sbjct: 113 NCKSLSELKSDESEPDGALATVLKVLQRIHSMFFDPELGDDFSGRDVRQVVKRVRKDVLK 172

Query: 907 GCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST 966
           GC+IVFSRVFP        H LW+ AEQ GA C   +D  VTHVV+   GT+K  WAL  
Sbjct: 173 GCKIVFSRVFPT-RFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEKSRWALQE 231

Query: 967 GRFVV 971
            +F+V
Sbjct: 232 KKFLV 236


>gi|302816075|ref|XP_002989717.1| hypothetical protein SELMODRAFT_23521 [Selaginella moellendorffii]
 gi|302824047|ref|XP_002993670.1| hypothetical protein SELMODRAFT_23523 [Selaginella moellendorffii]
 gi|300138493|gb|EFJ05259.1| hypothetical protein SELMODRAFT_23523 [Selaginella moellendorffii]
 gi|300142494|gb|EFJ09194.1| hypothetical protein SELMODRAFT_23521 [Selaginella moellendorffii]
          Length = 312

 Score =  211 bits (536), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 129/319 (40%), Positives = 181/319 (56%), Gaps = 14/319 (4%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF-RFPHMGMWTKLRP 738
           RKL LVLDLDHTL+NSA F EV       L     +D  K    L  +   + +WTK+RP
Sbjct: 4   RKLMLVLDLDHTLVNSASFDEVCAEEKPFLESMYARDPPKGRSKLLHKLDDLQLWTKIRP 63

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERV 798
               FL +ASKLF++++YTMG ++YA  M K+LDP GVLF G ++SR +D D  D  +R 
Sbjct: 64  FALEFLAQASKLFDLYVYTMGTRIYAEAMLKLLDPTGVLFKG-LVSR-NDNDLTDHRDR- 120

Query: 799 PKSKDLEGVLGMESAVVIIDDSVRVWPHNK-LNLIVVERYTYFPCSRRQFGL-LGPSLLE 856
              KDL+ VLG ES+V+I+DD    WP  +  NLI ++RY +F  S + FG     SL  
Sbjct: 121 ---KDLDTVLGQESSVLIVDDLPEAWPEEQHKNLIQIDRYHFFSSSCKSFGFDESSSLAR 177

Query: 857 IDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV-DVRNILAAEQRKILAGCRIVFSRV 915
              DE    G+LAS L  +E +H+ FF +     + DVR+ ++  +  IL GC++ FS V
Sbjct: 178 RGIDESHSGGSLASLLQGLETIHRDFFQYGEFSFLEDVRDTVSELRSHILEGCKLAFSSV 237

Query: 916 FPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGW 975
            P+   +     LW   E  GA C   IDD VTHVVA    + +  WA+  G+ +V+P W
Sbjct: 238 VPIDCED----SLWILCEGLGAECVLEIDDSVTHVVAMDPESARARWAVENGKHLVNPSW 293

Query: 976 VEASALLYRRANEQDFAIK 994
           + A+A    R  E +F ++
Sbjct: 294 MRAAAFRLGRPRESEFQVR 312


>gi|242066826|ref|XP_002454702.1| hypothetical protein SORBIDRAFT_04g035880 [Sorghum bicolor]
 gi|241934533|gb|EES07678.1| hypothetical protein SORBIDRAFT_04g035880 [Sorghum bicolor]
          Length = 462

 Score =  204 bits (518), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 128/325 (39%), Positives = 188/325 (57%), Gaps = 24/325 (7%)

Query: 676 MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP--HMGMW 733
           +F  RKL LVLDLD TLLNSA+  +   V +E      +   +K    +FR    ++GM 
Sbjct: 124 LFRERKLILVLDLDRTLLNSARL-DAFSVGEEWFGFTPDTG-DKVDMDIFRLDSDNLGML 181

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 793
           TKLRP +     R S +FEMHLYT+GN +YA     +LDP GV F GRV+SR D+     
Sbjct: 182 TKLRPFV-----RGS-MFEMHLYTLGNLVYAKAAIHLLDPNGVYFGGRVVSRDDESTQ-- 233

Query: 794 GDERVPKSKDLEGVLGME--SAVVI--IDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 849
                  +K L+ + G +  +AV++  +DD+   WP ++ NLI+  RY YF  + R+   
Sbjct: 234 -----GGTKSLDVIPGADPVAAVILDALDDTDVAWPEHQDNLILTNRYRYFASTCRKSRH 288

Query: 850 LGPSLLEIDHDERSE-DGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGC 908
             PSL E+  DE+ E  G+LA +LGV++R+H  FF  +     DVR ++A  + ++L GC
Sbjct: 289 DIPSLAELRRDEKGEHGGSLAVALGVLKRVHDAFFDGRP--HADVREVIAELRGQVLRGC 346

Query: 909 RIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGR 968
            + FS +    E +P    LW  AE+ GAVC K +D+ VTHVVA   GT K  WA   G+
Sbjct: 347 TVAFSYLEQRMEDSPDDTRLWTLAERLGAVCRKDVDETVTHVVAEDPGTQKAQWAREHGK 406

Query: 969 FVVHPGWVEASALLYRRANEQDFAI 993
           F+V+P W++A++  + R + Q+F +
Sbjct: 407 FLVNPEWIKAASFRWCRQDPQEFPV 431


>gi|255081919|ref|XP_002508178.1| predicted protein [Micromonas sp. RCC299]
 gi|226523454|gb|ACO69436.1| predicted protein [Micromonas sp. RCC299]
          Length = 318

 Score =  202 bits (515), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 118/290 (40%), Positives = 169/290 (58%), Gaps = 23/290 (7%)

Query: 722 RHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 781
           R L     + +WTKLRPG+  FL + + +FE+H+ TMG + YA EM +++DP      G 
Sbjct: 25  RTLHFVERLQIWTKLRPGVKKFLRQVASMFEVHVITMGTQSYADEMRQLIDPGRQHIKGS 84

Query: 782 VISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           VI  G   D F G+ +    K L+G L G++S  V++DD V VWP ++ NLI ++RY YF
Sbjct: 85  VIGLGQ-MDEF-GELQPADKKRLDGELSGLDSIAVVLDDHVGVWPDHEENLIEIDRYLYF 142

Query: 841 PCSRRQFGLL--GPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQS----------- 887
           P + +QFG+   G SLLE   DE ++  TLA++  V+ R+H+ FF+ ++           
Sbjct: 143 PSALKQFGVWRNGASLLEKKVDEIADRSTLAAAFEVLRRVHQDFFAERAGHLALANKKAK 202

Query: 888 ------LDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTK 941
                 L  V V  ILA E++K+LAG  +VFS VFP+ +A PH   +W+ AEQFGA C  
Sbjct: 203 DAAMAELARVTVPEILALEKKKVLAGTGLVFSGVFPL-DAPPHEQKMWRLAEQFGARCET 261

Query: 942 HIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 991
                 +HVVA + GT K  WA   GR VV P W+  SA L+ +A+E+ F
Sbjct: 262 QPGPNTSHVVAKTWGTGKCQWAKENGRHVVSPDWLFCSAFLWSKADERAF 311


>gi|424513770|emb|CCO66392.1| predicted protein [Bathycoccus prasinos]
          Length = 546

 Score =  187 bits (475), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 137/413 (33%), Positives = 199/413 (48%), Gaps = 72/413 (17%)

Query: 637 GAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSA 696
           GA+ ++A   V +L EG     K      R  + EE+    +  KL LVLDLDHTLLNS 
Sbjct: 147 GANAETA---VRYLHEGLTVSDKLL----REAKNEERMATLNQGKLFLVLDLDHTLLNSC 199

Query: 697 KFHEVDPVHDEIL-RKKEEQDREKPHRH---------------------LFRFPHMGMWT 734
           +F E++    E L RK E+++ E   R                      L+   H   +T
Sbjct: 200 RFDELNDEERESLDRKVEKREEEDELRSKLLGLVGGGDAGGGRRPRFPDLYCLSHFSTYT 259

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 794
           KLRP ++ FLE+ASK+  MH+YTMG+K YA EMA ++DP+G  F GR+I         + 
Sbjct: 260 KLRPYVFEFLEQASKICRMHVYTMGDKNYAHEMASLIDPEGKYFHGRIIG--------NS 311

Query: 795 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG-LLGPS 853
           D    K+KDL+ VLG +   +I+DD+ RVWP +  NLI V+RY +F  S   F  +   S
Sbjct: 312 DSTCSKTKDLDIVLGGDDCTMIVDDTSRVWPRHARNLIRVDRYHFFRKSATSFREMEKSS 371

Query: 854 LLEIDHDERSEDG---------TLASSLGVIERLHKIF-FS----------HQSLDDVDV 893
           ++E   DE   +           L   L V+   H++  FS          H+ +  V  
Sbjct: 372 VMERGLDEGEAEEEGAPAKHREVLKDVLAVLTVAHRMMAFSDGSGHREKKDHKDVRQVFE 431

Query: 894 RNILAAEQRK--------------ILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVC 939
           RN +A+E                 +L GC I+ S + P  +  P  HPL   A   GA  
Sbjct: 432 RNGMASEGENGGNVESVDPASSSLLLKGCVILPSGITPSNDERPDRHPLLLVAVGLGATI 491

Query: 940 TKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
              ++D VTHV+A +  T+KV W    G F+V+  W+   A+   + +E +FA
Sbjct: 492 ATAMNDNVTHVLARADNTEKVKWGRKRGLFIVNGNWLRECAMQNAKLDEHNFA 544


>gi|302793512|ref|XP_002978521.1| hypothetical protein SELMODRAFT_418187 [Selaginella moellendorffii]
 gi|300153870|gb|EFJ20507.1| hypothetical protein SELMODRAFT_418187 [Selaginella moellendorffii]
          Length = 346

 Score =  186 bits (472), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 128/331 (38%), Positives = 174/331 (52%), Gaps = 23/331 (6%)

Query: 669 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH--LFR 726
           R E  +K+   +KL LVLDLDHTLLNSA F +VD      L K  +   + P R   L +
Sbjct: 25  RKEYTQKVLQQQKLILVLDLDHTLLNSASFSKVDEEERLYLEKIYDWQEKAPKRRKLLHK 84

Query: 727 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
              + +WTK+RP  + FLE ASK F++H+YT G ++YA  MAK+LDP G LF G + SR 
Sbjct: 85  VESLQVWTKIRPFAFKFLEEASKFFDLHIYTNGREIYAETMAKLLDPTGSLFKGHIFSR- 143

Query: 787 DDGDPFDGDERVPKS-KDLEGVLGMESAVVIIDDSVRVWP--HNKLNLI-VVERYTYFPC 842
                   D    K+ KDL+ V G ES  +I+DDS  VWP  H+K NLI V +RY +F  
Sbjct: 144 --------DHNCMKAMKDLDTVPGDESITLIVDDSDCVWPKKHHK-NLIPVYDRYLFFRS 194

Query: 843 SRRQFGLL-GPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQ 901
           S   FGL    SL     DE +   TLA  L  ++R+H  FF        DVR  +   +
Sbjct: 195 STGLFGLRESSSLTSKKKDEVATKATLAKLLEGLKRIHSEFFQESGCFAGDVRQTMREVK 254

Query: 902 RKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGT-DKV 960
              L+GC+IV   +    +A   L  LW + ++ GA C   IDD VTHVV  S      +
Sbjct: 255 GHALSGCKIV---ICAKSQAAHEL--LWDSCQELGAECVVDIDDTVTHVVVASKQQPQGL 309

Query: 961 NWALSTGRFVVHPGWVEASALLYRRANEQDF 991
             +   G+++V P W+  +     R +E  F
Sbjct: 310 ELSAQAGKYLVWPSWIHTAHYRCCRPDEAAF 340


>gi|255080370|ref|XP_002503765.1| predicted protein [Micromonas sp. RCC299]
 gi|226519032|gb|ACO65023.1| predicted protein [Micromonas sp. RCC299]
          Length = 574

 Score =  185 bits (470), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 145/448 (32%), Positives = 191/448 (42%), Gaps = 128/448 (28%)

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEV----------------------- 701
           E+ +R E+++ + S R L LVLDLDHTLLNSA+F E+                       
Sbjct: 100 EKAKREEKRRILLSGR-LVLVLDLDHTLLNSARFSELSQEEHYAMHRIIAAADCEANGGS 158

Query: 702 ---------------DPVHDEILRKKEE-----------------QDREKPH-------R 722
                          DPV  E   +KE+                 + R+ P        R
Sbjct: 159 KEEVQQAAAAIQPVEDPVAAESAEEKEDGADVDGEKGEEEAGGKERARDGPFPGTDPPLR 218

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
           HL    HM M+TKLRP    FL  AS+L  M++YTMG++ YA EMAK+LDP G LF GRV
Sbjct: 219 HLNCLRHMAMFTKLRPHAHAFLRAASQLCTMYIYTMGDRNYAREMAKLLDPTGELFNGRV 278

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPC 842
           I  GD    +         KDL+ VLG E  V+I DD+ RVWP N  NLI ++RY +F  
Sbjct: 279 IGSGDSTSQY--------KKDLDIVLGAEPTVLITDDTDRVWPKNLANLIRIDRYHFFKQ 330

Query: 843 SRRQFGLLGPSLLEIDHDERSEDG---TLASSLGVIERLHKIFF---------------- 883
           S   F   G S++E    +  ++G    L   L VI   H+ FF                
Sbjct: 331 SAAGFRQPGRSVMERQWRDEGDNGDRAQLRDVLAVIAAAHRRFFEGTAAANTADDAIAEM 390

Query: 884 ----------------------------SHQSLDDVDVRNILAAEQRKILAGCRIVFSRV 915
                                       +   L+  DVR +L   +   LA  R+VFSRV
Sbjct: 391 DASMLRSASATEGGDIRQTNHKPVSDEEASTKLESRDVRRLLTVPEDGPLANVRVVFSRV 450

Query: 916 FPVGEANPHLHPLWQTAEQFGAVCTKHIDD--QVTHVVANS-------LGTDKVNWALST 966
               E  P  HPLW  A   GA     +DD    THVVA++         TDKV WA  T
Sbjct: 451 VAQSEPRPERHPLWLLATALGAEVLNAVDDGKGATHVVAHAGTGDGDGRRTDKVKWAAKT 510

Query: 967 -GRFVVHPGWVEASALLYRRANEQDFAI 993
            G   V   W+   A  + R +E  +++
Sbjct: 511 PGASAVSADWLAKCADEWARVDESRYSL 538


>gi|326510557|dbj|BAJ87495.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 384

 Score =  184 bits (466), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 99/215 (46%), Positives = 132/215 (61%), Gaps = 9/215 (4%)

Query: 669 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 728
           R  + K +   RKL L+LDLDHTL+NS K H++    +  L  +    ++ P+  LF   
Sbjct: 152 RGSDLKNLLRERKLILILDLDHTLINSTKLHDISAAENN-LGIQAAASKDDPNGSLFTLE 210

Query: 729 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
            M M TKLRP +  FL+ AS +FEM++YTMG+K YA E+AK+LDP+ V F  +VIS  D 
Sbjct: 211 GMQMLTKLRPFVRKFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPRNVYFNSKVISNSD- 269

Query: 789 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 848
                  +R  K  D+  VLG ES  VI+DD+  VW  +K NLI++ERY YF  S RQFG
Sbjct: 270 -----CTQRHQKGLDM--VLGAESVAVILDDTEYVWQKHKENLILMERYHYFASSCRQFG 322

Query: 849 LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF 883
               SL E+  DER  DG LA+ L V++R+H IFF
Sbjct: 323 FSVKSLSELMQDERGSDGALATILDVLKRIHTIFF 357


>gi|302774062|ref|XP_002970448.1| hypothetical protein SELMODRAFT_411029 [Selaginella moellendorffii]
 gi|300161964|gb|EFJ28578.1| hypothetical protein SELMODRAFT_411029 [Selaginella moellendorffii]
          Length = 346

 Score =  183 bits (465), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 127/331 (38%), Positives = 173/331 (52%), Gaps = 23/331 (6%)

Query: 669 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH--LFR 726
           R E  +K+   +KL LVLDLDHTLLNSA F +VD      L K  +   + P R   L +
Sbjct: 25  RKEYTQKVLQQQKLILVLDLDHTLLNSASFSKVDEEERLYLEKIYDWQEKAPKRRKLLHK 84

Query: 727 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
              + +WTK+RP  + FLE ASK F++H+YT G ++YA  MAK+LDP G LF G + SR 
Sbjct: 85  VESLQVWTKIRPFAFKFLEEASKFFDLHIYTNGREIYAETMAKLLDPTGSLFKGHIFSR- 143

Query: 787 DDGDPFDGDERVPKS-KDLEGVLGMESAVVIIDDSVRVWP--HNKLNLI-VVERYTYFPC 842
                   D    K+ KDL+ V G ES  +I+DDS  VWP  H+K NLI V ++Y +F  
Sbjct: 144 --------DHNCMKAMKDLDTVPGDESITLIVDDSDYVWPKKHHK-NLIPVYDQYRFFRS 194

Query: 843 SRRQFGLL-GPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQ 901
           S   FGL    SL     DE +   TLA  L  ++R+H  FF        DVR  +   +
Sbjct: 195 STGLFGLRESSSLTSKKKDEVATKATLAKLLEGLKRIHSEFFQEYGCFAGDVRQTMREVK 254

Query: 902 RKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGT-DKV 960
              L+GC+IV   +    +A   L  LW + +  GA C   IDD VTHVV  S      +
Sbjct: 255 GHALSGCKIV---ICAKTQAAHEL--LWDSCQALGAECVVDIDDTVTHVVVASKQQPQGL 309

Query: 961 NWALSTGRFVVHPGWVEASALLYRRANEQDF 991
             +   G+++V P W+  +     R +E  F
Sbjct: 310 ELSAQAGKYLVWPSWIHTAHYRCCRPDEAAF 340


>gi|303276827|ref|XP_003057707.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226460364|gb|EEH57658.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 692

 Score =  182 bits (463), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 103/232 (44%), Positives = 140/232 (60%), Gaps = 15/232 (6%)

Query: 676 MFSARKLCLVLDLDHTLLNSAKFHEVDP--VHDEILR-KKEEQDREKPHRHLFRFPHMGM 732
           +   R+L LVLDLDHTLLNS  F   D   +   +L  ++ E  ++   R L R  H+G+
Sbjct: 299 LLDRRRLTLVLDLDHTLLNSESFESKDGGRLQRGLLEIERLESTKDSNDRTLHRLNHIGL 358

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
           WTKLRPG+ TFL +AS +FE+H+ TMG++ YA  + ++LDP   +  G VI  G     F
Sbjct: 359 WTKLRPGVQTFLHKASAMFEIHISTMGSQPYADSIRRLLDPCRNVIKGSVIGLGG----F 414

Query: 793 D--GDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 849
           D  G  + P  K LEGVL G E A VI+DD+  VW     NLIV ERY YFP + + FG+
Sbjct: 415 DEFGAFKSPPQKKLEGVLAGTEPAAVILDDTAEVWTGYSENLIVCERYMYFPSACKNFGV 474

Query: 850 LGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQ 901
           +GPSLLE   DE  + GTLA+ L V+ R+H  FF+ ++      R +LA +Q
Sbjct: 475 VGPSLLERGVDESEKSGTLATVLEVLTRVHSEFFARRA-----ARQVLAEKQ 521



 Score = 84.3 bits (207), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 46/109 (42%), Positives = 60/109 (55%), Gaps = 5/109 (4%)

Query: 891 VDVRNILAAEQRKILAGCRIVFSRVFPVGEAN--PHLHPLWQTAEQFGAVCTKHIDDQVT 948
           V V  +LA E++ +L G  IVFS VF   +    P  HPLW+ AE+ GA          T
Sbjct: 581 VTVPKLLADERKNVLRGVEIVFSGVFDHNDKTLTPREHPLWRLAERLGARVVSEPGTSTT 640

Query: 949 HVVANSLGTD---KVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 994
           HVVA  +GTD   K  W L   + VV P WVEASA+++ R +E  F +K
Sbjct: 641 HVVAKCVGTDLTHKAMWGLQYLKHVVSPAWVEASAMMWTRQDEGRFPVK 689


>gi|242093894|ref|XP_002437437.1| hypothetical protein SORBIDRAFT_10g027050 [Sorghum bicolor]
 gi|241915660|gb|EER88804.1| hypothetical protein SORBIDRAFT_10g027050 [Sorghum bicolor]
          Length = 271

 Score =  167 bits (422), Expect = 3e-38,   Method: Composition-based stats.
 Identities = 112/317 (35%), Positives = 156/317 (49%), Gaps = 61/317 (19%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH---MGMWTKL 736
           RKL LVLDLDHTLLNS + H+ D    E          ++ H  LFR  +   + M TKL
Sbjct: 8   RKLILVLDLDHTLLNSTRLHQ-DLSALEQRNGFTPDTEDELHMELFRLEYSDNVRMLTKL 66

Query: 737 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 796
           RP +  FLE+AS               +T     +DP  V                    
Sbjct: 67  RPFVRGFLEQAS------------SRASTSSRAPIDPAAV-------------------- 94

Query: 797 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLE 856
                             VI+DD+   WP ++ NLI+++RY YF C+ R+F    PS+ E
Sbjct: 95  ------------------VILDDTDSAWPGHQDNLILMDRYHYFACTCRKFRYNIPSMAE 136

Query: 857 IDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVF 916
              DER  DG+LA  LGV+ R+H+ FF     D  DVR ++A  +R++L  C +VFS + 
Sbjct: 137 QARDEREHDGSLAVVLGVLNRIHQAFFDD---DRADVREVIAEVRRQVLPVCTVVFSYL- 192

Query: 917 PVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWV 976
              E  P    +W  AE+ GA C K +D+ VTHVVA   GT K  WA   G+F+V+P W+
Sbjct: 193 ---EDFPEDTLMWTLAERLGAACQKDVDETVTHVVAEDPGTQKAQWAREHGKFLVNPEWI 249

Query: 977 EASALLYRRANEQDFAI 993
           +A    + R +E+DF +
Sbjct: 250 KAVNFRWCRVDERDFPV 266


>gi|145344421|ref|XP_001416731.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576957|gb|ABO95024.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 248

 Score =  166 bits (419), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 102/248 (41%), Positives = 151/248 (60%), Gaps = 28/248 (11%)

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEV-----DPVHDEILRKKEE-QDRE 718
           E+ +R EE+ ++    KL L+LDLDHTLLNS +F E+     D +H+ I R+ E  ++ +
Sbjct: 14  EKAKR-EEKARVLQNGKLTLILDLDHTLLNSTQFKELTQEQHDLLHECIAREAEGLKEGQ 72

Query: 719 KPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 778
           +P  +  R  HMG +TKLRP ++ FLE  SK+ + ++YTMG+K YA EM K++DP+G +F
Sbjct: 73  RPMLYCLR--HMGFFTKLRPHVFEFLESVSKICQPYVYTMGDKPYAREMVKLIDPEGTIF 130

Query: 779 AGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 838
            GRVIS        + D      KDL+ VLG E++ +I+DD+ RVWP N+ NLI ++RY 
Sbjct: 131 HGRVIS--------NNDSTSSHVKDLDIVLGGEASAIIVDDTERVWPQNQGNLIRLDRYH 182

Query: 839 YFPCSRRQFGLLGPSLLEIDHDERSEDGTLASS------LGVIERLHKIFFSHQSLDDV- 891
           +FP S   F   G S++E    +  E G++ S       L VIE +H+ FF  ++ DD  
Sbjct: 183 FFPGSASSFQQKGQSVMESSMVDEGELGSVGSRAVLLDVLAVIESVHRSFF--KNTDDGE 240

Query: 892 --DVRNIL 897
             DVR +L
Sbjct: 241 EPDVRKLL 248


>gi|302838991|ref|XP_002951053.1| hypothetical protein VOLCADRAFT_91454 [Volvox carteri f.
           nagariensis]
 gi|300263748|gb|EFJ47947.1| hypothetical protein VOLCADRAFT_91454 [Volvox carteri f.
           nagariensis]
          Length = 699

 Score =  165 bits (417), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 116/321 (36%), Positives = 160/321 (49%), Gaps = 67/321 (20%)

Query: 687 DLDHTLLNSAKFHEVDPVHD----EILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWT 742
           DLDHTLLNS    EV P       E+LR++EE +   P R L R     +WTKLRPG++ 
Sbjct: 377 DLDHTLLNSVHTSEVGPDTATQLAEVLRREEEANL-GPRRLLHRLAENKLWTKLRPGVFE 435

Query: 743 FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSK 802
           FLE     +EMH+YTMG+K YA E+ K+LDP G LF+  VI++               +K
Sbjct: 436 FLEGLRDDYEMHIYTMGDKTYAAEVRKLLDPTGKLFSS-VIAK--------DHSTTATAK 486

Query: 803 DLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDER 862
           DL+ +L  +   +++DD+  VWP ++ NL+                         D DE 
Sbjct: 487 DLDVLLSADELALVLDDTEAVWPGHRRNLLQ------------------------DSDES 522

Query: 863 SEDGTLASSLGVIERLHKIFFSHQS-----LDDVDVRNILAAEQRKIL----AGCRIVFS 913
           + DG LA+ + V+  +H  FFS        L+  DVR+IL+ ++R+IL     GC I FS
Sbjct: 523 ATDGALAAHMRVLRAVHTRFFSADDPSLPPLERRDVRDILSEQRREILQLMPQGCCITFS 582

Query: 914 RVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQV-THVVANSLGTDKVNWALSTGRFVVH 972
           R +P  + NP   PLWQ A   GA C    D  V THVVA + GT+K             
Sbjct: 583 RCWPQ-DRNPLREPLWQLAMSLGANCLTTYDPGVTTHVVAAAGGTEK------------- 628

Query: 973 PGWVEASALLYRRANEQDFAI 993
                  A  + R  E+DFA+
Sbjct: 629 -----GCAHRWTRLPEEDFAV 644


>gi|325179818|emb|CCA14221.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 694

 Score =  154 bits (388), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 119/349 (34%), Positives = 173/349 (49%), Gaps = 51/349 (14%)

Query: 674 KKMFSARKLCLVLDLDHTLLNS---AKFHEVDPV-HDEILRKKEEQDREKPHRHLFRFP- 728
           ++   A+KL LVLDLDHTLL++   A   E  P   DEI              H F+ P 
Sbjct: 147 ERQLIAKKLSLVLDLDHTLLHAVYVADLLEQRPTASDEI--------------HYFKIPG 192

Query: 729 --HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
              M    KLRPG+  FL+   + +++ +YT G ++YA  +A+++DP   LF  R+++R 
Sbjct: 193 VMTMEYVVKLRPGLHQFLKSLREQYDLFIYTHGTRIYAEAIAEIIDPDDTLFRHRIVART 252

Query: 787 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 846
           D  D  D      KS  L      +S ++I+DD + VW  N+ N+++++ + +F C+   
Sbjct: 253 DTPD-IDH-----KSLKLLFPSCDDSMILILDDRLDVWKENEGNVLLIKPFHFFNCTAEI 306

Query: 847 FG----LLGPSLLEIDHDE-----RSEDGTLASSLGVIERLHKIFF----------SHQS 887
                  + PS    + D         D      L +++R+H+ F+             S
Sbjct: 307 NNAPGETISPSASSQNQDSDPVEPTKMDTDFEYILKILQRVHQAFYQCTKAGRTAEEQMS 366

Query: 888 LDDVDVRNILAAEQRKILAGCRIVFSRVFPVGE-ANPHLHPLWQTAEQFGAVCTKHIDD- 945
               DV+ ILA EQRKIL GC IVFS VFPV +   P  H LW+ A   GAV +  IDD 
Sbjct: 367 GRGNDVKQILAQEQRKILQGCFIVFSGVFPVSDPRGPKSHSLWRLAADMGAVPSLVIDDF 426

Query: 946 QVTHVVAN--SLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDF 991
            VTH+V +   LGT K   A  T    VV P W+  SA ++ RA E D+
Sbjct: 427 PVTHLVIHPMRLGTQKHVKARETPHIHVVTPDWLVRSARIWHRAPESDY 475


>gi|341882050|gb|EGT37985.1| hypothetical protein CAEBREN_32558 [Caenorhabditis brenneri]
          Length = 673

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 105/346 (30%), Positives = 175/346 (50%), Gaps = 53/346 (15%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP--H 729
           ++  + S RKL L++DLD T+++++              K   +D EK H+ + R+   H
Sbjct: 134 DETNLVSNRKLVLLVDLDQTIIHTSD-------------KPMSEDSEK-HKDITRYGLNH 179

Query: 730 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
               TKLRP    FL + + ++EMH+ T G + YA ++A++LDP+  LF  R++SR    
Sbjct: 180 RKYITKLRPHTTEFLNKMATMYEMHIVTYGQRQYAHKIAQILDPEARLFGQRILSR---D 236

Query: 790 DPFDGDERVPKSKDLEGVLGMESA--------VVIIDDSVRVWPHNKLNLIVVERYTYFP 841
           + F       K+++L+ ++  + A        VVIIDD   VW ++   LI ++ Y +F 
Sbjct: 237 ELFSAQH---KTRNLKVIILFQKALFPCGDNLVVIIDDRADVWMYSDA-LIQIKPYRFF- 291

Query: 842 CSRRQFGLLG---------PSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDD-- 890
              ++ G +          P  +E   D+  ED  L     V+  +H  ++    L    
Sbjct: 292 ---KEVGDINAPQNSKEQMPVQIE---DDAHEDKVLEEIERVLTNIHDKYYEKYDLKGSD 345

Query: 891 ---VDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQV 947
              +DV+ ++  E+RK+L GC IVFS + P GE       +++  +QFGA     + DQV
Sbjct: 346 QCLLDVKEVIKEERRKVLDGCVIVFSGIVPTGEKLERT-DIYRLCQQFGATILPEVTDQV 404

Query: 948 THVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 993
           TH+V    GT K++ AL  G+FVV   WV A    + +A+E+ F +
Sbjct: 405 THIVGARYGTQKIHQALRLGKFVVTVQWVYACVEKWMKADEKLFEL 450


>gi|125541462|gb|EAY87857.1| hypothetical protein OsI_09279 [Oryza sativa Indica Group]
          Length = 390

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 113/330 (34%), Positives = 164/330 (49%), Gaps = 40/330 (12%)

Query: 676 MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTK 735
           +  ARKL LV+DLDHTL+NS   +++     E +    E      H              
Sbjct: 89  LLRARKLILVVDLDHTLVNSTADYDISGT--EYVNGLAELLVLGVHHQA---------QA 137

Query: 736 LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 795
           +RP  W        + +  +YT+G++ YA  +AK+LDP+GV F  R+ISR  D  P    
Sbjct: 138 VRP--WLPARSERHVRDARVYTLGDRDYAAAVAKLLDPEGVYFGERIISR--DESP---- 189

Query: 796 ERVPKSKDLEGVLG-------MESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 848
              P  K L+ V G         +AVVI+DD+  VW  N  NLI +ERY YF  S R FG
Sbjct: 190 --QPDRKSLDVVFGSAPASAAERAAVVILDDTAEVWEGNSDNLIEMERYHYFASSCRDFG 247

Query: 849 LLGPSLLEIDHD--ERSEDGT-LASSLGVIERLHKIFFSHQSLDDV-DVRNILAAEQRKI 904
               S  E  H   ER  D +  A++L V+ R+H  FF+      V DVR ++   +R++
Sbjct: 248 ----SPWECTHSLSERGVDESERAAALRVLRRVHAGFFAGGGGSFVADVREVIRRTRREV 303

Query: 905 LAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWAL 964
           L GC + F+R      A+   H +W+  EQ GA C   +   VTHVVA +  T K  WA 
Sbjct: 304 LRGCTVAFTRAI----ASDDHHSVWRRTEQLGATCADDVGPAVTHVVATNPTTFKAVWAQ 359

Query: 965 STGRFVVHPGWVEASALLYRRANEQDFAIK 994
             G+F+V+P W+  +   + +  E+ F ++
Sbjct: 360 VFGKFLVNPEWINTAHFRWSKPKEEHFPVR 389


>gi|302769312|ref|XP_002968075.1| hypothetical protein SELMODRAFT_67516 [Selaginella moellendorffii]
 gi|300163719|gb|EFJ30329.1| hypothetical protein SELMODRAFT_67516 [Selaginella moellendorffii]
          Length = 141

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 76/149 (51%), Positives = 101/149 (67%), Gaps = 8/149 (5%)

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 794
           KLRP    FLE ASKLFEM++YTMG ++YA  MA +LDP G  F GRVIS+ D       
Sbjct: 1   KLRPFAHKFLEEASKLFEMYVYTMGERMYAVTMAHLLDPTGKFFKGRVISQRDST----- 55

Query: 795 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSL 854
                ++KDL+ VLG +SAV+I+DD+  VWP ++ NLIV+ERY +F  S RQFGL  PSL
Sbjct: 56  ---CRQTKDLDIVLGADSAVLILDDTEAVWPKHRANLIVMERYHFFQSSCRQFGLENPSL 112

Query: 855 LEIDHDERSEDGTLASSLGVIERLHKIFF 883
            + + DE  ++G LA+ L V++R+H  FF
Sbjct: 113 TKAERDESKDEGALANVLKVLQRIHSDFF 141


>gi|255540901|ref|XP_002511515.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
 gi|223550630|gb|EEF52117.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
          Length = 405

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 109/331 (32%), Positives = 163/331 (49%), Gaps = 33/331 (9%)

Query: 669 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 728
           R  E   + S +KL LVLDLD TLL+S     VD   +E   K +    +   + + R  
Sbjct: 86  RDAETDFVLSKKKLFLVLDLDQTLLHST----VDLTPEENYLKNQMDSLQDIFKLITREG 141

Query: 729 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
               + KLRP +  FL+ AS +F+M++YT  NK YA +M  +LDP  + F  R+I+R   
Sbjct: 142 FSPSYAKLRPFVRNFLQEASTMFKMYVYTNANKSYARKMVNLLDPDNIYFKSRLITR--- 198

Query: 789 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 848
                 D  V   K+L+ V+G E AVVI+DD   VWP +K NLI V+RY YF  +     
Sbjct: 199 -----EDSTVSCQKNLDVVMGQERAVVILDDRTDVWPMHKDNLIQVQRYKYFASTANWSN 253

Query: 849 LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF----SHQSLDDVDVRNILAAEQRKI 904
               S  + + DE ++   +A+ L +++++H  FF       +L   DVR+++   Q  I
Sbjct: 254 --SKSFAQREVDESTD--IMATYLEILKKIHSQFFDPKLDEDNLASRDVRDVMRTVQAGI 309

Query: 905 LAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANS---------L 955
           L GC+++  +      A   L  L + AE+ GA+C   +D  VTHVV            L
Sbjct: 310 LQGCKLILRKNLT---AKYKLDNLSKMAEKLGAICVSELDPTVTHVVTLEAKPEDDQLQL 366

Query: 956 GTDKVNWALSTGRF-VVHPGWVEASALLYRR 985
              K       G + +V P W+  S  L+ R
Sbjct: 367 ENKKKKKNKQKGTYHLVFPEWIRDSYKLWHR 397


>gi|268566337|ref|XP_002639695.1| C. briggsae CBR-FCP-1 protein [Caenorhabditis briggsae]
          Length = 723

 Score =  147 bits (370), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 104/348 (29%), Positives = 174/348 (50%), Gaps = 49/348 (14%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSA-KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           ++  + + RKL L++DLD T+++++ K   VD         ++ ++R KP  +   F H 
Sbjct: 133 DETNLITTRKLVLLVDLDQTIIHTSDKPMSVDA--------EKRRNRVKPQDNNLNFQHK 184

Query: 731 GMW----------TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 780
            +           TKLRP    FL + S ++EMH+ T G + YA  +A++LDP   LF  
Sbjct: 185 DITKYNLHSRVYTTKLRPHTTEFLNKMSAMYEMHIVTYGQRQYAHRIAQILDPDARLFGQ 244

Query: 781 RVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTY 839
           R++SR    + F       K+++L+ +    ++ VVIIDD   VW +++  LI ++ Y +
Sbjct: 245 RILSR---DELFSAQH---KTRNLKALFPCGDNLVVIIDDRADVWQYSE-ALIQIKPYRF 297

Query: 840 FPCSRRQFGLLG---------PSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDD 890
           F    ++ G +          P  +E   D+  ED  L     V+  +H  ++    L D
Sbjct: 298 F----KEVGDINAPKNSKEQMPVQIE---DDAHEDRVLEEIERVLTNIHDKYYEKHDLRD 350

Query: 891 -----VDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDD 945
                +DV+ ++  E+ K+L GC IVFS + P GE       +++   QFGA     + D
Sbjct: 351 AEQPLLDVKEVIKDERHKVLDGCVIVFSGIVPTGEKLERT-DIYRLCMQFGATIVPEVTD 409

Query: 946 QVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 993
           +VTHVV    GT K++ A   G+FVV   WV A    + +A+E+ F +
Sbjct: 410 EVTHVVGARYGTQKIHQAHRLGKFVVTVQWVYACVEKWMKADEKQFEL 457


>gi|291001899|ref|XP_002683516.1| TFIIF CTD phosphatase Fcp1 [Naegleria gruberi]
 gi|284097145|gb|EFC50772.1| TFIIF CTD phosphatase Fcp1 [Naegleria gruberi]
          Length = 592

 Score =  146 bits (368), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 113/366 (30%), Positives = 178/366 (48%), Gaps = 49/366 (13%)

Query: 661 AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE-K 719
           A +K   R    Q+++   +KL LVLDLDHTLL++    E    H ++    +  +   +
Sbjct: 170 AYEKGLERGKANQQRLIEKKKLSLVLDLDHTLLHTINDFEYRREHHKVTYFNDIYNNSPE 229

Query: 720 PHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
             +H+ +F   G +   K RP + +FL+R S++FE+H++T G + YA ++ K+LD    L
Sbjct: 230 LQKHIHKFFMRGSYHFVKFRPRLESFLKRCSEIFELHVFTHGERAYADQIGKMLDSSKSL 289

Query: 778 FAGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVER 836
           FA R++SR +  D          +K L  V    + +V++IDD   VW  N  N+I +  
Sbjct: 290 FADRILSRDECPD--------INTKTLSQVFPYSDKSVLVIDDKTDVWKDNVDNVIQIAP 341

Query: 837 YTYFPCSRRQFGLLGPSLLEI------DHDERSE------------DGTLASS------- 871
           Y YF   RR FG+     L++      D D +              + T+          
Sbjct: 342 YDYF---RRIFGV----QLDVNNAPGNDADRKKNNSQQTPRTFKVIENTIDDYNDDDQLD 394

Query: 872 --LGVIERLHKIFF-SHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPL 928
               ++E++H  +F S Q L + DV+ IL  +++ IL G  IVFS V P+ +  P  H  
Sbjct: 395 IIYNLLEKVHLDYFNSPQDLSERDVKAILKEKKKDILKGAHIVFSGVIPLKQ-QPETHID 453

Query: 929 WQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST-GRFVVHPGWVEASALLYRRAN 987
           W+ A   GA C   I   +TH+VA   GT+KV  A +  G  VV   W+ A     +R N
Sbjct: 454 WKIATDLGAKCYTDITPNMTHLVARQKGTEKVKKAENMPGVHVVDLSWLHACLKYLKRQN 513

Query: 988 EQDFAI 993
           E D+ +
Sbjct: 514 EFDYFL 519


>gi|159476674|ref|XP_001696436.1| cleavage and polyadenylation factor 6-related protein [Chlamydomonas
            reinhardtii]
 gi|158282661|gb|EDP08413.1| cleavage and polyadenylation factor 6-related protein [Chlamydomonas
            reinhardtii]
          Length = 2174

 Score =  143 bits (361), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 113/365 (30%), Positives = 162/365 (44%), Gaps = 65/365 (17%)

Query: 681  KLCLVLDLDHTLLNSAKFHEVD-PVHDEILRKKEEQDREKP--HRHLFRFPHMG--MWTK 735
            +L LV+DLD  L +S    ++D P    ++R+   +    P   R LFR P  G  +W K
Sbjct: 809  RLVLVVDLDGVLADSCWDAQLDGPTAAALVRRAAVEAAALPEDRRELFRLPLEGGALWLK 868

Query: 736  LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 795
            LRPG   FL RA++ +E+   T   + YA  + ++LDP   LF  RV++ G         
Sbjct: 869  LRPGARAFLARAAERYELWARTRQGRPYADAVVELLDPHQQLFGSRVVAAG--------- 919

Query: 796  ERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLN--LIVVERYTYF---PCS----RRQ 846
              V   + L  +        ++D     W    L+  L+ +  Y YF   PC+       
Sbjct: 920  --VLAKRLLAALECRAPIAAVLDTPDAAWMGESLSGSLLALPPYAYFAVRPCAPGGAVAA 977

Query: 847  FGLLGPSLLEIDHDERSEDGTLASSLGVIERLHK-------------------------- 880
             G+    +LE+D DE +E G LA  L ++E LH                           
Sbjct: 978  SGMASRCMLEVDRDEDAERGALAVGLPLLEALHGRVLHAYASGGAGASANGGTGSGSAAA 1037

Query: 881  ----IFFSHQ----------SLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH 926
                +  S Q          +L+  DVR +L   + +IL G  I FSRV+  G A    H
Sbjct: 1038 PPSPVHGSQQPPPAAQSAGPALEAWDVRRVLRELRERILTGTHITFSRVYSGGTAAGPQH 1097

Query: 927  PLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRA 986
            PLW+ AE  GA  +    D  THVV+ S  T K  WA   GRFVV+P W+E S   +R+A
Sbjct: 1098 PLWRLAEACGATVSAACTDSTTHVVSLSGATAKALWAQQHGRFVVYPSWLECSCYTWRKA 1157

Query: 987  NEQDF 991
            +E  F
Sbjct: 1158 DESLF 1162


>gi|226498676|ref|NP_001145873.1| hypothetical protein [Zea mays]
 gi|219884795|gb|ACL52772.1| unknown [Zea mays]
 gi|413939308|gb|AFW73859.1| hypothetical protein ZEAMMB73_968817 [Zea mays]
          Length = 425

 Score =  143 bits (360), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 88/244 (36%), Positives = 134/244 (54%), Gaps = 30/244 (12%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH------LF 725
           ++  +   RKL L+LDLDHTLLNS   +++ PV        E+     P+        LF
Sbjct: 202 DRATLMRERKLILILDLDHTLLNSTSLYDLSPV--------EQAKGFTPYTFGDTSIDLF 253

Query: 726 R--FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
           R    ++ M  KL      FL++A+ LFEMH+YT+G + YA    ++LDP G+ F GR++
Sbjct: 254 RVDIDNLSMLVKLGAFARGFLKQANALFEMHVYTLGIRAYARAAVRLLDPNGIYFGGRIV 313

Query: 784 SRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPC 842
           SR +             +K L+ + G + A VVI+DD+  VWP    NLI+++RY YF  
Sbjct: 314 SRNE--------STKENTKSLDVIQGADPAMVVILDDTDGVWPGYPDNLILMDRYRYFAS 365

Query: 843 SRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF-SHQSLDDVDVRNILAAEQ 901
           + R F    PSL E   +ER  DG+LA  LG ++R+H+ FF  H++    DVR ++A  +
Sbjct: 366 TCRTFDYDIPSLAEQGLEEREHDGSLAVVLGALQRIHQGFFDGHRA----DVREVIAKVR 421

Query: 902 RKIL 905
            ++ 
Sbjct: 422 SQVC 425


>gi|302834483|ref|XP_002948804.1| hypothetical protein VOLCADRAFT_89056 [Volvox carteri f. nagariensis]
 gi|300265995|gb|EFJ50184.1| hypothetical protein VOLCADRAFT_89056 [Volvox carteri f. nagariensis]
          Length = 2442

 Score =  142 bits (357), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 117/410 (28%), Positives = 168/410 (40%), Gaps = 103/410 (25%)

Query: 676  MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKP--HRHLFRFP----- 728
            + S  KL LV+DLD  L +S    ++DP     L +    +   P   R LFR P     
Sbjct: 2044 LLSRGKLVLVVDLDGVLADSCWDDQLDPAAAAALSRHAAAEAGLPEDRRELFRLPLDAGA 2103

Query: 729  -------HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 781
                     G+W KLRPG   FL RA + FE+  ++   + YA  + ++LDP   LF  R
Sbjct: 2104 TAGAASGGSGLWLKLRPGARAFLARAHERFELWAHSRQGRPYADAVVELLDPSLALFGSR 2163

Query: 782  VISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW--PHNKLNLIVVERYTY 839
            V+++G+         R+  + D    +       I+D     W        L+ +  Y+Y
Sbjct: 2164 VVAQGELA------RRLLTALDARAPI-----TAILDTPSAAWMGEQLAPGLLPLPPYSY 2212

Query: 840  F----PCSRRQF--------------------GLLGPSLLEIDHDERSEDGTLASSLGVI 875
            F     C+                        G+ G SLLE++ DE  E G LA++L ++
Sbjct: 2213 FSYRPACTADAAAGVGGGGAAPGARTLAPSASGMAGRSLLEVNRDECPERGVLAAALPLL 2272

Query: 876  ERLHKIFFSH-------------------------------------------QSLDDVD 892
              L+    S                                              L+  D
Sbjct: 2273 YVLYTRVMSTYGAAATTTASSSAGSTAAAAAAAAIAKPPTDGPPPPPPPAPPLAGLEPWD 2332

Query: 893  VRNILAAEQRKILAGCRIVFSRVFPVGEANPHL---------HPLWQTAEQFGAVCTKHI 943
            VR +L  +++++LAG  I FSRVF  G               HPLW+ AE FGA      
Sbjct: 2333 VRRVLREQRQRVLAGFHITFSRVFTSGGGGGDGGGGGAAGPQHPLWRLAEAFGATVGASC 2392

Query: 944  DDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 993
             D  THVVA S GT+K  WA   GRFVV+P W+EAS  L+R+ +E  F +
Sbjct: 2393 SDATTHVVALSAGTEKAIWAAQHGRFVVYPSWLEASCFLWRKVDESLFLL 2442


>gi|15224433|ref|NP_178570.1| haloacid dehalogenase-like hydrolase domain-containing protein
           [Arabidopsis thaliana]
 gi|4585924|gb|AAD25584.1| hypothetical protein [Arabidopsis thaliana]
 gi|330250795|gb|AEC05889.1| haloacid dehalogenase-like hydrolase domain-containing protein
           [Arabidopsis thaliana]
          Length = 277

 Score =  139 bits (350), Expect = 8e-30,   Method: Composition-based stats.
 Identities = 88/269 (32%), Positives = 144/269 (53%), Gaps = 28/269 (10%)

Query: 641 QSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHE 700
           +S +   +++F+G     +A      T+ L  +    + +KL LVLDLDHTLL+S     
Sbjct: 29  KSQFRKFDYIFKGLQLSNEAV---ALTKSLTTKHSCLNEKKLHLVLDLDHTLLHSKLVSN 85

Query: 701 VDPVHDEILRKKEEQDREKPHRHLFRFPHMG----MWTKLRPGIWTFLERASKLFEMHLY 756
           +      ++++   + RE     L++F  +G       KLRP +  FL+ A+++F M +Y
Sbjct: 86  LSQAERYLIQEASSRTRED----LWKFRPIGHPIDRLIKLRPFVRDFLKEANEMFTMFVY 141

Query: 757 TMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVI 816
           TMG+++YA  + +++DPK + F  RVI++ +           P+ K L  VL  E  VVI
Sbjct: 142 TMGSRIYAKAILEMIDPKKLYFGNRVITKDES----------PRMKTLNLVLAEERGVVI 191

Query: 817 IDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIE 876
           +DD+  +WPH+K NLI + +Y YF    R+ GL   S  E   DE   DG LA+ L ++ 
Sbjct: 192 VDDTRDIWPHHKNNLIQIRKYKYF----RRSGLDSNSYSEKKTDEGENDGGLANVLKLLR 247

Query: 877 RLHKIFF---SHQSLDDVDVRNILAAEQR 902
            +H+ FF     + L+ +DVR++L    R
Sbjct: 248 EVHRRFFIVEVEEVLESMDVRSLLKEGYR 276


>gi|159483481|ref|XP_001699789.1| hypothetical protein CHLREDRAFT_141879 [Chlamydomonas reinhardtii]
 gi|158281731|gb|EDP07485.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 375

 Score =  138 bits (347), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 98/285 (34%), Positives = 140/285 (49%), Gaps = 61/285 (21%)

Query: 687 DLDHTLLNSAKFHEVD----PVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWT 742
           DLDHTLLNS   +EV     P   E+ R+++E +   P R L       +WTKLRPG++ 
Sbjct: 133 DLDHTLLNSVHMNEVGEDVAPRLAELQRREQEANL-GPRRLLHCLADKKLWTKLRPGVFE 191

Query: 743 FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSK 802
           FLE     +EMH+YTMG+K YA E+ ++LDP G LF+  VI++               +K
Sbjct: 192 FLEGLRDAYEMHIYTMGDKTYAAEVRRLLDPTGRLFSS-VIAK--------DHSTTATAK 242

Query: 803 DLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDER 862
            L+ +L  +   +++DD+  VWP ++ NL+ V                            
Sbjct: 243 HLDVLLSADELALVLDDTEVVWPGHRRNLLQV---------------------------- 274

Query: 863 SEDGTLASSLGVIERLHKIFFSHQS-----LDDVDVRNILAAEQRKILAGCRIVFSRVFP 917
                  S++G+       FFS +      LD+ DVR++LA ++ +IL G  I FSR + 
Sbjct: 275 ------MSAVGL------QFFSQEDSRLPPLDERDVRDVLAEQKARILTGVHITFSRCW- 321

Query: 918 VGEANPHLHPLWQTAEQFGAVCTKHIDDQV-THVVANSLGTDKVN 961
             + +P   PLWQ AE  GA C    D  V THVVA   GT KV 
Sbjct: 322 AQDKDPRKEPLWQLAEGLGATCLPAYDPAVTTHVVAAGGGTAKVR 366


>gi|328872613|gb|EGG20980.1| putative tfiif-interacting component of the c-terminal domain
           phosphatase [Dictyostelium fasciculatum]
          Length = 757

 Score =  136 bits (343), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 114/391 (29%), Positives = 182/391 (46%), Gaps = 81/391 (20%)

Query: 668 RRLEEQ--KKMFSARKLCLVLDLDHTLLNS---AKFHEVDPVHDEILRKKEEQDREKPHR 722
           +++EE+  K++   +KL LVLDLDHT++++     F EV P    I       DR+K + 
Sbjct: 191 QQIEEKNAKRLLDNKKLSLVLDLDHTIIHAIMEQHFMEV-PYWRTI-------DRKKSNI 242

Query: 723 H-LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 781
           H +    +   + KLRP ++ FL   ++LFE+H+YTMG + YA ++A ++DPK  +F  R
Sbjct: 243 HEIILNGNQRYFIKLRPHLYEFLREVNRLFELHIYTMGTRNYAQKIASLVDPKQRVFKER 302

Query: 782 VISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
           V+SR  D  P D + +  K          +S V+I+DD   VW  +K NLI +  Y YF 
Sbjct: 303 VLSR--DDTPNDMNHKTLK----RLFPCDDSMVLIVDDRSDVWKKSK-NLIQIVPYLYFV 355

Query: 842 CSRRQFGLL-------GPSLL------------------------------EIDHDERSE 864
             +    LL          L+                              ++D  E  +
Sbjct: 356 GCKDMVNLLPTDKQSPHQQLMNDEPAAATAAAVNTPTPPSPGSPNVSSPKQKVDQQEEDD 415

Query: 865 DGTLASS-------------LGVIERLHKIFFSHQSLDDV-DVRNILAAEQRKILAGCRI 910
           + ++ +              L  +  +H  F+   S +    V NI+   ++KIL    I
Sbjct: 416 ENSIYNENDESEGDHHLRVILSKLTEIHTEFYKQVSNNQKPHVTNIVDDIKKKILKDVYI 475

Query: 911 VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVA-------NSLGTDKVNWA 963
           V S ++P+  + P   PL   AE+ GA     I  + THV+A          GT KVN A
Sbjct: 476 VLSGIYPINSSQP--QPLRILAEEMGATVQNEITPKTTHVMAARVRRKRKRKGTSKVNQA 533

Query: 964 LSTGRFVVHPGWVEASALLYRRANEQDFAIK 994
           +S G  VV+  W+E++A+ + + +E DF IK
Sbjct: 534 ISMGLHVVNSSWLESTAMHWTKLDENDFPIK 564


>gi|15239576|ref|NP_200232.1| haloacid dehalogenase-like hydrolase domain-containing protein
           [Arabidopsis thaliana]
 gi|9759494|dbj|BAB10744.1| unnamed protein product [Arabidopsis thaliana]
 gi|332009084|gb|AED96467.1| haloacid dehalogenase-like hydrolase domain-containing protein
           [Arabidopsis thaliana]
          Length = 306

 Score =  135 bits (341), Expect = 9e-29,   Method: Composition-based stats.
 Identities = 85/239 (35%), Positives = 127/239 (53%), Gaps = 13/239 (5%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D  Q + I    T+R+  Q   F+ +KL LVLDLDHTLL++     +      ++   EE
Sbjct: 62  DGLQLSDIAVTVTKRVTTQITCFNDKKLHLVLDLDHTLLHTVMISNLTKEETYLI---EE 118

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
           +D  +  R L          KLRP +  FL+ A+K+F M++YTMG++ YA  +  ++DP+
Sbjct: 119 EDSREDLRRLNGGYSSEFLIKLRPFVHEFLKEANKMFSMYVYTMGDRDYAMNVLNLIDPE 178

Query: 775 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVV 834
            V F  RVI+R +           P  K L+ VL  E  VVI+DD+  VWP +K NL+ +
Sbjct: 179 KVYFGDRVITRNES----------PYIKTLDLVLADECGVVIVDDTPHVWPDHKRNLLEI 228

Query: 835 ERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDV 893
            +Y YF    R       S  E   DE   DG+LA+ L VI+++++ FFS     D+D+
Sbjct: 229 TKYNYFSDKTRHDVKYTKSYAEEKRDESRNDGSLANVLKVIKQVYEGFFSGGVEKDLDI 287


>gi|115533721|ref|NP_492423.2| Protein FCP-1 [Caenorhabditis elegans]
 gi|82658167|emb|CAC70088.2| Protein FCP-1 [Caenorhabditis elegans]
          Length = 659

 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 102/340 (30%), Positives = 167/340 (49%), Gaps = 48/340 (14%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSA-KFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-H 729
           ++  + + RKL L++DLD T+++++ K   VD                + H+ + ++  H
Sbjct: 134 DENNLITNRKLVLLVDLDQTIIHTSDKPMTVDT---------------ENHKDITKYNLH 178

Query: 730 MGMWT-KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
             ++T KLRP    FL + S ++EMH+ T G + YA  +A++LDP   LF  R++SR   
Sbjct: 179 SRVYTTKLRPHTTEFLNKMSNMYEMHIVTYGQRQYAHRIAQILDPDARLFEQRILSR--- 235

Query: 789 GDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF------- 840
            + F       K+ +L+ +    ++ VVIIDD   VW +++  LI ++ Y +F       
Sbjct: 236 DELFSAQH---KTNNLKALFPCGDNLVVIIDDRSDVWMYSEA-LIQIKPYRFFKEVGDIN 291

Query: 841 --PCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDD-----VDV 893
               S+ Q     P  +E   D+  ED  L     V+  +H  ++    L       +DV
Sbjct: 292 APKNSKEQM----PVQIE---DDAHEDKVLEEIERVLTNIHDKYYEKHDLRGSEEVLLDV 344

Query: 894 RNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVAN 953
           + ++  E+ K+L GC IVFS + P+GE       +++   QFGAV    + D VTHVV  
Sbjct: 345 KEVIKEERHKVLDGCVIVFSGIVPMGEKLERT-DIYRLCTQFGAVIVPDVTDDVTHVVGA 403

Query: 954 SLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 993
             GT KV  A    +FVV   WV A    + +A+E  F +
Sbjct: 404 RYGTQKVYQANRLNKFVVTVQWVYACVEKWLKADENLFQL 443


>gi|297850432|ref|XP_002893097.1| hypothetical protein ARALYDRAFT_472260 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297338939|gb|EFH69356.1| hypothetical protein ARALYDRAFT_472260 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 281

 Score =  134 bits (337), Expect = 3e-28,   Method: Composition-based stats.
 Identities = 88/246 (35%), Positives = 129/246 (52%), Gaps = 34/246 (13%)

Query: 667 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ------DREKP 720
           T+ L  Q    + RKL +VLDLDHTLL+S     +      +LR+ + +      DRE  
Sbjct: 62  TKSLTTQLACLNERKLHVVLDLDHTLLHSVMVSRLSEGEKYLLRESDLREDLWTLDRE-- 119

Query: 721 HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 780
                      M  KLRP +  FL  A++ F M++YTMGN+ YA  + K++DPK V F  
Sbjct: 120 -----------MLIKLRPFVHEFLNEANEFFSMYVYTMGNRDYAQAVLKLIDPKKVYFGD 168

Query: 781 RVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           RVI+R + G           SK L+ VL  E  VVI+DD+  VWP ++ NL+ + +Y+YF
Sbjct: 169 RVITRDESG----------FSKTLDLVLADECGVVIVDDTRHVWPDHERNLLQITKYSYF 218

Query: 841 PCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH--QSLDDVDVRNILA 898
               ++      S  E   DE    G+LA+ L V++++H+ FFS   + LD  DVR +L 
Sbjct: 219 RDYNQED---SKSYAEEKRDESRSQGSLANVLKVLKKIHQEFFSGGIEKLDSKDVRLLLQ 275

Query: 899 AEQRKI 904
            +   I
Sbjct: 276 EQHIAI 281


>gi|452820283|gb|EME27327.1| phosphoprotein phosphatase [Galdieria sulphuraria]
          Length = 734

 Score =  134 bits (336), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 140/498 (28%), Positives = 215/498 (43%), Gaps = 116/498 (23%)

Query: 595 MVSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGS--GPEAGPVGAHP----------QS 642
           +V+QN  IQP Q+    D    V +H  +  G  +  G E     A P          ++
Sbjct: 132 VVNQNHVIQPQQMVLSVD----VCDHKVQFNGNCALCGIEMDIYSASPILESPREFSTRT 187

Query: 643 AW--GDVEHLFEGYDDQQKAA-------IQKERTRRLEEQKKMFSARKLCLVLDLDHTLL 693
            W   +  HL   Y   Q          ++ E TRRL  +KK      L LVLDLD+TL+
Sbjct: 188 DWTLSNKHHLNPAYTHPQLRVSRNELELVEGENTRRLLRRKK------LSLVLDLDNTLI 241

Query: 694 NSAKF----HEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM-----WTKLRPGIWTFL 744
           ++        E      EI ++  E+  E     +     + +       KLRP +  FL
Sbjct: 242 HATLVSHFPQEWYQYKQEIYQQATEKALECSAPLMEDIHELDLDGSISLVKLRPNVRRFL 301

Query: 745 ERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDL 804
           E+  + +E+H+YTMG++ YA  +A +LDP G LF  R++SR       D  E +   K L
Sbjct: 302 EKIHQRYELHIYTMGSRSYADAIATLLDPSGNLFQRRIVSRD------DFVEGMMNRKSL 355

Query: 805 EGVLGM-ESAVVIIDDSVRVW-PHNKL----NLIVVERYTYF------------------ 840
             +    +S V+I+DD   VW  HN+     NLI  + Y +F                  
Sbjct: 356 RRIFPCDDSMVIIVDDREDVWMDHNQGEMVPNLIRAKPYLFFVQDVHENMNNHLVWDSTT 415

Query: 841 ----PCS---RRQFGLLGPSLLE-IDHDERSEDGTL--------------ASSLGVIERL 878
               P S   +  F  +   +L  ++  E  E G                 + LG +E+L
Sbjct: 416 TSIHPSSESHKESFANISTCMLTCLNWKENLESGCYFPYLPWVQKTVESDENYLGRLEQL 475

Query: 879 ----HKIFF---------------SHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVG 919
               H+ FF               +H  L   DV+ ILA  + ++L  C + F+ +F + 
Sbjct: 476 LIQIHESFFKEMEQCHNVSQTDDSNHVKLPLRDVKLILAEMRHRVLRNCYLSFTGIFRLE 535

Query: 920 EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANS---LGTDKVNWALSTGR-FVVHPGW 975
           E+ P +  +W+ AE+FGA+C K +  Q TH++ +S   L T K  +AL  G  F+V   W
Sbjct: 536 ES-PEVSTVWRLAEEFGAICNKQVTSQTTHLIVDSQRGLHTGKTKYALRRGDIFLVTLEW 594

Query: 976 VEASALLYRRANEQDFAI 993
           +E S   Y RA+E  FA+
Sbjct: 595 LETSMQYYIRASELQFAL 612


>gi|324504080|gb|ADY41763.1| RNA polymerase II subunit A C-terminal domain phosphatase [Ascaris
           suum]
          Length = 490

 Score =  132 bits (332), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 96/343 (27%), Positives = 171/343 (49%), Gaps = 49/343 (14%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKF-----HEVDPVHDEILRKKEEQDREKPHRHLFR 726
           +Q+ +  +R+L L++DLD TL+++          VD VH                   ++
Sbjct: 50  DQQLVLESRRLVLLVDLDQTLIHTTNHAFDMKDSVDVVH-------------------YK 90

Query: 727 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
                 +TK+RP   TFL R S+L+EMH+ + G + YA ++A++LDP    F  R++SR 
Sbjct: 91  LRGADFYTKIRPYTHTFLRRMSELYEMHIISYGERQYAHKIAEILDPDKRYFGHRILSR- 149

Query: 787 DDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP---- 841
              + F     + K+ +++ +    +  + IIDD   VW ++   LI V+ Y +F     
Sbjct: 150 --DELFSA---MYKTGNMKALFPCGDQLIAIIDDRPDVWQYSDA-LIQVKPYRFFKETGD 203

Query: 842 ------CSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH--QSLDD--V 891
                 C+ +Q  L+   + +++  E   D TL     V+ R+H  F++H  +++ D   
Sbjct: 204 INAPTICNAQQQSLVQERIAQVNV-EGDGDETLEFVATVLTRVHTTFYAHYDKNMSDRVQ 262

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           DV+ ++A  + ++L GC IV S + P+G  +      +    QFGA  T+ + D  THV+
Sbjct: 263 DVKLVIAYMRSQVLRGCSIVLSGIVPIG-MDVRNTEAFHLCIQFGATVTESVTDTTTHVI 321

Query: 952 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           A   GT KV+ A       +V+P W+ A    + +A+E++F +
Sbjct: 322 AARWGTTKVHDARRRANIAIVNPRWLYACVERWEKADEKEFEL 364


>gi|297792855|ref|XP_002864312.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297310147|gb|EFH40571.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 305

 Score =  132 bits (332), Expect = 1e-27,   Method: Composition-based stats.
 Identities = 86/244 (35%), Positives = 130/244 (53%), Gaps = 24/244 (9%)

Query: 667 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 726
           T+R+  Q   F+ +KL LVLDLDHTLL++     +    +E     E   RE     L++
Sbjct: 76  TKRVTTQITCFNDKKLHLVLDLDHTLLHTVMVSNLS--KEETYLIGEADSRED----LWK 129

Query: 727 FP---HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
           F          KLRP +  FL+ A+++F M++YTMG++ YA  + K++DP+ + F  RVI
Sbjct: 130 FNGGYSSEFLIKLRPYVHEFLKEANEMFSMYVYTMGDRDYANNVLKLIDPEKIYFGHRVI 189

Query: 784 SRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCS 843
           +R +           P  K L+ VL  E  VVI+DD+ +VWP +K NL+ + +Y YF   
Sbjct: 190 TRNES----------PYIKTLDLVLADECGVVIVDDTPQVWPDDKRNLLEITKYNYFSDK 239

Query: 844 RRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS-----HQSLDDVDVRNILA 898
            R+      S  E   DE   DG+LA+ L VI+ +++ FFS        +D  DVR +L 
Sbjct: 240 TRRDVKYSKSYAEEKRDEGRNDGSLANVLKVIKEIYEGFFSGGVEKELDIDSKDVRLLLH 299

Query: 899 AEQR 902
             +R
Sbjct: 300 NARR 303


>gi|324508774|gb|ADY43701.1| RNA polymerase II subunit A C-terminal domain phosphatase [Ascaris
           suum]
          Length = 576

 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 96/343 (27%), Positives = 171/343 (49%), Gaps = 49/343 (14%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKF-----HEVDPVHDEILRKKEEQDREKPHRHLFR 726
           +Q+ +  +R+L L++DLD TL+++          VD VH                   ++
Sbjct: 136 DQQLVLESRRLVLLVDLDQTLIHTTNHAFDMKDSVDVVH-------------------YK 176

Query: 727 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
                 +TK+RP   TFL R S+L+EMH+ + G + YA ++A++LDP    F  R++SR 
Sbjct: 177 LRGADFYTKIRPYTHTFLRRMSELYEMHIISYGERQYAHKIAEILDPDKRYFGHRILSR- 235

Query: 787 DDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP---- 841
              + F     + K+ +++ +    +  + IIDD   VW ++   LI V+ Y +F     
Sbjct: 236 --DELFSA---MYKTGNMKALFPCGDQLIAIIDDRPDVWQYSDA-LIQVKPYRFFKETGD 289

Query: 842 ------CSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH--QSLDD--V 891
                 C+ +Q  L+   + +++  E   D TL     V+ R+H  F++H  +++ D   
Sbjct: 290 INAPTICNAQQQSLVQERIAQVNV-EGDGDETLEFVATVLTRVHTTFYAHYDKNMSDRVQ 348

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           DV+ ++A  + ++L GC IV S + P+G  +      +    QFGA  T+ + D  THV+
Sbjct: 349 DVKLVIAYMRSQVLRGCSIVLSGIVPIG-MDVRNTEAFHLCIQFGATVTESVTDTTTHVI 407

Query: 952 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           A   GT KV+ A       +V+P W+ A    + +A+E++F +
Sbjct: 408 AARWGTTKVHDARRRANIAIVNPRWLYACVERWEKADEKEFEL 450


>gi|15229069|ref|NP_188382.1| haloacid dehalogenase-like hydrolase domain-containing protein
           [Arabidopsis thaliana]
 gi|9294142|dbj|BAB02044.1| unnamed protein product [Arabidopsis thaliana]
 gi|332642446|gb|AEE75967.1| haloacid dehalogenase-like hydrolase domain-containing protein
           [Arabidopsis thaliana]
          Length = 296

 Score =  127 bits (318), Expect = 5e-26,   Method: Composition-based stats.
 Identities = 88/263 (33%), Positives = 135/263 (51%), Gaps = 30/263 (11%)

Query: 648 EHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDE 707
           ++L +G     +AA     T+R   Q    + +KL LVLDLDHTLL+S +   +      
Sbjct: 55  DYLVQGLQLSHEAA---AFTKRFTTQFYCLNEKKLNLVLDLDHTLLHSIRVSLLSETEKC 111

Query: 708 ILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEM 767
           ++ +     RE     L++       TKLRP +  FL+ A++LF M++YTMG ++YA  +
Sbjct: 112 LIEEACSTTRED----LWKLDS-DYLTKLRPFVHEFLKEANELFTMYVYTMGTRVYAESL 166

Query: 768 AKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHN 827
            K++DPK + F  RVI+R +           P  K L+ VL  E  VVI+DD+  VW H+
Sbjct: 167 LKLIDPKRIYFGDRVITRDES----------PYVKTLDLVLAEERGVVIVDDTSDVWTHH 216

Query: 828 KLNLIVVERYTYFPCSRRQFGLLGP----SLLEIDHDERSEDGTLASSLGVIERLHKIFF 883
           K NL+ +  Y +F  +       GP    S  E   DE   +G LA+ L +++ +H  FF
Sbjct: 217 KSNLVEINEYHFFRVN-------GPEESNSYTEEKRDESKNNGGLANVLKLLKEVHYGFF 269

Query: 884 S-HQSLDDVDVRNILAAEQRKIL 905
              + L+  DVR +L     K+L
Sbjct: 270 RVKEELESQDVRFLLQEIDFKLL 292


>gi|297846748|ref|XP_002891255.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297337097|gb|EFH67514.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 210

 Score =  126 bits (317), Expect = 5e-26,   Method: Composition-based stats.
 Identities = 78/221 (35%), Positives = 119/221 (53%), Gaps = 31/221 (14%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ------DREKPHRHLFRFPHMGMW 733
           +KL LVLDLDHTL+++    ++      +L + + +      +++ P+  +         
Sbjct: 3   KKLHLVLDLDHTLIHTVLVSDLSEREKYLLEEADSRQDLWRCNKDSPYEFII-------- 54

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 793
            KLRP +  FL  A+KLF MH+YTMGN  YA ++ K++DP  V F  RVI+R        
Sbjct: 55  -KLRPFVHEFLLEANKLFTMHVYTMGNSCYAQDVLKLIDPDKVYFGNRVITR-------- 105

Query: 794 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 853
             E  P +K L+ ++     VVI+DD++ VWPH+K NL+ + +Y YF    R  G    S
Sbjct: 106 --EASPCNKTLDLLVADTRRVVIVDDTISVWPHHKRNLLQITKYIYF----RVDGTKWDS 159

Query: 854 LLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVR 894
             E   DE  + G+LA+ L  +E +HK F   + LD  D+R
Sbjct: 160 YAEEKKDESRKSGSLANVLKFLEDVHKRF--EEDLDSKDLR 198


>gi|297835808|ref|XP_002885786.1| hypothetical protein ARALYDRAFT_899317 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297331626|gb|EFH62045.1| hypothetical protein ARALYDRAFT_899317 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 285

 Score =  126 bits (317), Expect = 6e-26,   Method: Composition-based stats.
 Identities = 84/260 (32%), Positives = 134/260 (51%), Gaps = 21/260 (8%)

Query: 641 QSAWGDVEHLFEGYDDQQKA-AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFH 699
           +S W   +++F G     +A A+ K RT          + +KL LVLDLDHTLL+  K  
Sbjct: 29  KSQWRAFDYIFNGLQLSHEAVALTKSRT----TNNSCLNEKKLHLVLDLDHTLLHMKKVP 84

Query: 700 EVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMG 759
            +      ++++     RE   +       +    KLRP +  FL+ A+++F M++YT G
Sbjct: 85  CLSRAEMYLIQEACSVTREDIWKIRLLGDPIDRLIKLRPFVRDFLKEANEMFTMYVYTKG 144

Query: 760 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDD 819
            + YA  + +++DP  + F  RVI++ +           P  K L+ VL  E  VVI+DD
Sbjct: 145 TRKYAKAVLELIDPNRLYFGDRVITKDES----------PHQKTLDLVLAEERGVVIVDD 194

Query: 820 SVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLH 879
              +WPH+K NLI + +Y YF  S    G    S  E   DE  +DG LA+ L +++++H
Sbjct: 195 RRDIWPHHKSNLIEISKYKYFRVS----GQGSNSYSEKKTDESEKDGGLANVLKLLKQVH 250

Query: 880 KIFF--SHQSLDDVDVRNIL 897
             FF    + L+ +DVR++L
Sbjct: 251 CRFFMVEEEKLESMDVRSLL 270


>gi|297834870|ref|XP_002885317.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297331157|gb|EFH61576.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 592

 Score =  126 bits (316), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 81/239 (33%), Positives = 131/239 (54%), Gaps = 27/239 (11%)

Query: 667 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 726
           T+RL  +    + +KL LVLDLDHTLL+S +   +      ++ +     RE     L++
Sbjct: 74  TKRLTTKFSCLNMKKLHLVLDLDHTLLHSVRVQFLSEAEKYLIEEAGSTTRED----LWK 129

Query: 727 FPHMG--------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 778
               G          TKLRP +  FL+ A+KLF M++YT G + YA  + K++DPK + F
Sbjct: 130 MKVKGDPIPITIEYLTKLRPFLREFLKEANKLFTMYVYTKGTRRYAKAILKLIDPKKLYF 189

Query: 779 AGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 838
             RVI+R +           P +K L+ VL  E  VVI+DD+  +WP++K NL+V+ +Y 
Sbjct: 190 GHRVITRNES----------PHTKTLDLVLADERGVVIVDDTRNIWPNHKSNLVVIGKYK 239

Query: 839 YFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNIL 897
           YF   R +  +L P   E   DE   +G LA+ L +++ +H+ FF  +  ++V+ +++L
Sbjct: 240 YF---RFEGRVLKPHSEEKTTDESENNGGLANVLKLLKEVHRKFFRVE--EEVESQDVL 293



 Score =  110 bits (276), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 77/227 (33%), Positives = 117/227 (51%), Gaps = 28/227 (12%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG----MWTK 735
           +KL LVLDLDHTLL++     +      +L +     RE     L++   +G      TK
Sbjct: 389 KKLHLVLDLDHTLLHTVMVPSLSQAEKYLLEEAGSATRED----LWKIKAIGDPMEFLTK 444

Query: 736 LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 795
           LRP +  FL+ A+++F M++YT G++ YA ++ +++DPK + F  RVI++ +        
Sbjct: 445 LRPFVREFLKEANQMFTMYVYTKGSRGYAKQVLELIDPKKLYFEDRVITKNES------- 497

Query: 796 ERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLL 855
              P  K L+ VL  E  VVI+DD   VWP +K NL+ + +YTYF    R  G       
Sbjct: 498 ---PHMKTLDLVLAEERGVVIVDDMRTVWPDHKSNLVDISKYTYF----RLKGQESMPYS 550

Query: 856 EIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQR 902
           E   DE   DG LA+ L +++ +H  FF       V+ R + A   R
Sbjct: 551 EEMTDESESDGGLANVLKLLKEVHSRFFR------VEGRQVAATRNR 591


>gi|384501479|gb|EIE91970.1| hypothetical protein RO3G_16681 [Rhizopus delemar RA 99-880]
          Length = 494

 Score =  126 bits (316), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 89/330 (26%), Positives = 160/330 (48%), Gaps = 50/330 (15%)

Query: 670 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 729
           +E  K++   +KL L+LDLD T+++++    +    +  +R+             F  P 
Sbjct: 20  VENAKRLLDQKKLSLILDLDQTIVHASCDQRISQWQNPDIRQ-------------FNLPR 66

Query: 730 --MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 787
             +  + KLRPG+  FL+   +L+E+H+YTMG K YA  +AK +DP+G LF  R++SR +
Sbjct: 67  SPLVYYIKLRPGLIEFLKEIEELYELHIYTMGTKDYAKAVAKEIDPEGCLFKERILSRDE 126

Query: 788 DGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 846
            G            K L+ +   + S VV++DD   VW ++  NL+ ++ Y YF      
Sbjct: 127 SG--------CLTQKKLQRIFPCDTSMVVVLDDRSDVWSYSP-NLVRIKPYEYF------ 171

Query: 847 FGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILA 906
              +G             D    +   +++++H+ F+ ++   + DV  I+   +R++L 
Sbjct: 172 ---IGTG-----------DIHSPTKNKILKKIHQEFYKNKK--EGDVTKIIPNMKRQVLH 215

Query: 907 GCRIVFS-RVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALS 965
            C I F+  V P    +P L  +WQ    FGA+C+  +  + TH++A      K   A  
Sbjct: 216 HCIISFAPDVIPSNLKDPTLSWIWQMGTSFGALCSNDLTGKTTHLIAVRWDA-KAKAAKD 274

Query: 966 TGRF-VVHPGWVEASALLYRRANEQDFAIK 994
            G   +V P W+  S   +   +E+ +A++
Sbjct: 275 YGHSKIVTPAWLLDSTARWAIQDEEAYALE 304


>gi|297834668|ref|XP_002885216.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297331056|gb|EFH61475.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 296

 Score =  125 bits (314), Expect = 1e-25,   Method: Composition-based stats.
 Identities = 87/263 (33%), Positives = 135/263 (51%), Gaps = 30/263 (11%)

Query: 648 EHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDE 707
           ++L +G     +AA     T+R   +    + +KL LVLDLDHTLL+S +   +      
Sbjct: 55  DYLVQGLQLSHEAA---AFTKRFTTEFYCLNEKKLHLVLDLDHTLLHSIRVSILSETERY 111

Query: 708 ILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEM 767
           ++ +     RE     L++   +   TKLRP +  FL+ A+++F M++YTMG ++YA  +
Sbjct: 112 LIEEACSTTRED----LWKL-DIDYLTKLRPFVHEFLKEANEMFTMYVYTMGTRVYAESL 166

Query: 768 AKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHN 827
            K++DPK + F  RVI+R +           P  K L+ VL  E  VVI+DD+  VW H+
Sbjct: 167 LKLIDPKRIYFGDRVITRDES----------PYVKTLDLVLADERGVVIVDDTRDVWTHH 216

Query: 828 KLNLIVVERYTYFPCSRRQFGLLGP----SLLEIDHDERSEDGTLASSLGVIERLHKIFF 883
           K NL+ +  Y YF  +       GP    S  E   DE    G LA+ L +++ +H  FF
Sbjct: 217 KSNLVEINEYHYFRVN-------GPEESKSYTEEKRDESKNSGGLANVLKLLKEVHYGFF 269

Query: 884 S-HQSLDDVDVRNILAAEQRKIL 905
              + L+  DVR +L     K+L
Sbjct: 270 RVKEELESQDVRFMLQEIDFKLL 292


>gi|308500103|ref|XP_003112237.1| CRE-FCP-1 protein [Caenorhabditis remanei]
 gi|308268718|gb|EFP12671.1| CRE-FCP-1 protein [Caenorhabditis remanei]
          Length = 664

 Score =  124 bits (312), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 100/339 (29%), Positives = 171/339 (50%), Gaps = 46/339 (13%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-HM 730
           ++  + + RKL L++DLD T+++++              K    D EK H+ + ++  H 
Sbjct: 134 DETNLITTRKLVLLVDLDQTIIHTSD-------------KPMSADAEK-HKDITKYNLHS 179

Query: 731 GMWT-KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
            ++T KLRP    FL + + ++EMH+ T G + YA  +A++LDP   LF  R++SR    
Sbjct: 180 RVYTTKLRPHTTEFLNKMAAMYEMHIVTYGQRQYAHRIAQILDPDARLFGQRILSR---D 236

Query: 790 DPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 848
           + F       K+++L+ +    ++ VVIIDD   VW +++  LI ++ Y +F    ++ G
Sbjct: 237 ELFSAQH---KTRNLKALFPCGDNLVVIIDDRADVWQYSEA-LIQIKPYRFF----KEVG 288

Query: 849 LLG---------PSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDD-----VDVR 894
            +          P  +E   D+  ED  L     V+  +H  ++    L D     +DV+
Sbjct: 289 DINAPKDSKEQMPVQIE---DDAHEDRVLEEIERVLTNIHDKYYEKHDLKDGDQALLDVK 345

Query: 895 NILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANS 954
            ++  E+ K+L GC IVFS + P GE       +++   QFGA     + ++VTHVV   
Sbjct: 346 EVIKEERHKVLDGCVIVFSGIVPTGEKLERT-DIYRLCVQFGATIVPEVVEEVTHVVGAR 404

Query: 955 LGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 993
            GT KV+ A   G+ VV   WV A    + +A+E+ F +
Sbjct: 405 YGTTKVHQAHRLGKNVVTVQWVYACVEKWMKADEKQFEL 443


>gi|224091747|ref|XP_002309339.1| predicted protein [Populus trichocarpa]
 gi|222855315|gb|EEE92862.1| predicted protein [Populus trichocarpa]
          Length = 204

 Score =  123 bits (308), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 81/266 (30%), Positives = 116/266 (43%), Gaps = 78/266 (29%)

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
           M  K RP    FL+ AS++F +++YT+G+  YA EMAK+LDP G  F  +V SR D    
Sbjct: 1   MMIKSRPFARMFLKEASQMFGLYMYTLGDPAYALEMAKLLDPGGEFFNAKVTSRDD---- 56

Query: 792 FDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 851
             G +R  K  D+                                               
Sbjct: 57  --GTQRHQKGHDV----------------------------------------------- 67

Query: 852 PSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV-------DVRNILAAEQRKI 904
                +  DE    G LAS L  + ++H IFF    L ++       DVR +L   +R +
Sbjct: 68  -----LKSDESESGGALASVLKALRKVHHIFFEGTLLQELEENPDGRDVRKVLKTVRRDV 122

Query: 905 LAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWAL 964
           L GC+IVFSRVFP  +     H LW+  EQ GA C+               GT+K   AL
Sbjct: 123 LKGCKIVFSRVFPT-QFQADNHHLWRMVEQLGATCS------------TEAGTEKSRRAL 169

Query: 965 STGRFVVHPGWVEASALLYRRANEQD 990
              +F+VHPGW+EA+   +++  E++
Sbjct: 170 KHNKFLVHPGWIEATNYFWQKQPEEN 195


>gi|170578206|ref|XP_001894313.1| NLI interacting factor-like phosphatase family protein [Brugia
           malayi]
 gi|158599134|gb|EDP36825.1| NLI interacting factor-like phosphatase family protein [Brugia
           malayi]
          Length = 576

 Score =  123 bits (308), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 92/332 (27%), Positives = 166/332 (50%), Gaps = 37/332 (11%)

Query: 676 MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTK 735
           +  A KL L++DLD TL+++   H  +           E D +  H   ++      +TK
Sbjct: 141 LLKAHKLVLLVDLDQTLIHTTN-HTFNL----------ENDTDVLH---YKLKGTDFYTK 186

Query: 736 LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 795
           +RP    FL R + L+EMH+ + G + YA  +A+ LDP+ + F  R++SR    + F   
Sbjct: 187 IRPHAHEFLRRMASLYEMHIISYGERQYAHRIAEFLDPEKIYFGHRILSR---DELFSA- 242

Query: 796 ERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF--------PCSRRQ 846
             + K+++++ +    +  +V+IDD   VW ++   LI V+ Y +F        P + + 
Sbjct: 243 --MYKTRNMQALFPCGDHMIVMIDDRPDVWQYSD-ALIQVKPYRFFKEIGDINAPRNEKG 299

Query: 847 FGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV----DVRNILAAEQR 902
             +L  S  E D  E  +D TL     V+ ++H  F+       V    D++ I++  ++
Sbjct: 300 EPILSGSYAEQDM-ESEDDETLEYIALVLTKVHSAFYELFDSAKVNRFPDLKGIISYLRK 358

Query: 903 KILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNW 962
           ++L  C IV S + PVG  +     +++   QFGAV T ++++  THV+A   GT KV+ 
Sbjct: 359 QVLRECSIVLSGIVPVG-VDIKKTEVYRLCVQFGAVVTDNVNNSTTHVIAARWGTTKVHE 417

Query: 963 ALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           A       +V+P W+      + +A+E++F +
Sbjct: 418 AEKLSNIHIVNPKWLFTCVERWEKADEKEFEL 449


>gi|308464266|ref|XP_003094401.1| hypothetical protein CRE_07009 [Caenorhabditis remanei]
 gi|308247823|gb|EFO91775.1| hypothetical protein CRE_07009 [Caenorhabditis remanei]
          Length = 754

 Score =  122 bits (306), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 98/338 (28%), Positives = 169/338 (50%), Gaps = 45/338 (13%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-HM 730
           ++  + + RKL L++DLD T+++++         D+++    E+     H+ + ++  H 
Sbjct: 230 DETNLITTRKLVLLVDLDQTIIHTS---------DKLMSADAEK-----HKDITKYNLHS 275

Query: 731 GMWT-KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
            ++T KLRP    FL + S ++EMH+ T G + YA  +AK+LDP   LF  R++SR +  
Sbjct: 276 RVYTTKLRPHTTEFLNKMSAMYEMHIVTFGERKYALRIAKILDPDARLFGQRILSRNE-- 333

Query: 790 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 849
                 +   ++K L      ++ VVIIDD   VW +++  LI ++ Y +F    ++ G 
Sbjct: 334 --LSSAQHKTENKAL--FPCGDNLVVIIDDRADVWQYSEA-LIQIKPYRFF----KEVGD 384

Query: 850 LG---------PSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDD-----VDVRN 895
           +          P  +E   D+  ED  L     V+  +H  ++    L D     +DV+ 
Sbjct: 385 INAPKHSKEQMPVQIE---DDAHEDRVLEEIERVLTNIHNKYYEKHDLKDADQALLDVKE 441

Query: 896 ILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSL 955
           ++  E+ K+L GC IVFS +   GE       ++    QFGA     + ++VTHVV    
Sbjct: 442 VIKEERHKVLDGCVIVFSGIVSAGEKLERTE-IYHLCLQFGATIVPEVVEEVTHVVGARY 500

Query: 956 GTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 993
           GT+KV+ A   G+ VV   WV A    + +A+E+ F +
Sbjct: 501 GTEKVHQAHRLGKNVVTVQWVYACVEKWMKADEKQFEL 538


>gi|15218405|ref|NP_175026.1| NLI interacting factor (NIF) family protein [Arabidopsis thaliana]
 gi|91805923|gb|ABE65690.1| NLI interacting factor family protein [Arabidopsis thaliana]
 gi|332193852|gb|AEE31973.1| NLI interacting factor (NIF) family protein [Arabidopsis thaliana]
          Length = 255

 Score =  122 bits (305), Expect = 1e-24,   Method: Composition-based stats.
 Identities = 83/232 (35%), Positives = 123/232 (53%), Gaps = 25/232 (10%)

Query: 675 KMFSA---RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE-EQDREKPHRHLFRFPHM 730
           ++FS    +KL LVLDLDHTLL+S    ++      +L + +  QD  + +   + F   
Sbjct: 43  QLFSVTKKKKLHLVLDLDHTLLHSVLVSDLSKREKYLLEETDSRQDLWRRNVDGYEF--- 99

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
               KLRP +  FL  A+KLF MH+YTMG+  YA ++ K++DP  V F  RVI+R     
Sbjct: 100 --IIKLRPFLHEFLLEANKLFTMHVYTMGSSSYAKQVLKLIDPDKVYFGKRVITR----- 152

Query: 791 PFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 850
                E  P +K L+ +   +  VVI+DD+V VWP +K NL+ + +Y YF       G  
Sbjct: 153 -----EASPFNKSLDLLAADKRRVVIVDDTVHVWPFHKRNLLQITKYIYFKVD----GTK 203

Query: 851 GPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQR 902
             S  E   DE   +G+LA+ L  +E +HK F   + L   D+R ++   QR
Sbjct: 204 WDSYAEAKKDESQSNGSLANVLKFLEVVHKRF--EEDLGFKDLRLLIPCRQR 253


>gi|116830952|gb|ABK28432.1| unknown [Arabidopsis thaliana]
          Length = 256

 Score =  122 bits (305), Expect = 1e-24,   Method: Composition-based stats.
 Identities = 83/232 (35%), Positives = 123/232 (53%), Gaps = 25/232 (10%)

Query: 675 KMFSA---RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE-EQDREKPHRHLFRFPHM 730
           ++FS    +KL LVLDLDHTLL+S    ++      +L + +  QD  + +   + F   
Sbjct: 43  QLFSVTKKKKLHLVLDLDHTLLHSVLVSDLSKREKYLLEETDSRQDLWRRNVDGYEF--- 99

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
               KLRP +  FL  A+KLF MH+YTMG+  YA ++ K++DP  V F  RVI+R     
Sbjct: 100 --IIKLRPFLHEFLLEANKLFTMHVYTMGSSSYAKQVLKLIDPDKVYFGKRVITR----- 152

Query: 791 PFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 850
                E  P +K L+ +   +  VVI+DD+V VWP +K NL+ + +Y YF       G  
Sbjct: 153 -----EASPFNKSLDLLAADKRRVVIVDDTVHVWPFHKRNLLQITKYIYFKVD----GTK 203

Query: 851 GPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQR 902
             S  E   DE   +G+LA+ L  +E +HK F   + L   D+R ++   QR
Sbjct: 204 WDSYAEAKKDESQSNGSLANVLKFLEVVHKRF--EEDLGFKDLRLLIPCRQR 253


>gi|334185470|ref|NP_188594.3| haloacid dehalogenase-like hydrolase domain-containing protein
           [Arabidopsis thaliana]
 gi|332642744|gb|AEE76265.1| haloacid dehalogenase-like hydrolase domain-containing protein
           [Arabidopsis thaliana]
          Length = 302

 Score =  122 bits (305), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 82/242 (33%), Positives = 131/242 (54%), Gaps = 25/242 (10%)

Query: 667 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 726
           T+RL  +    + +KL LVLDLD TL++S +   +      ++ +     RE   +   R
Sbjct: 74  TKRLITKFSCLNMKKLHLVLDLDLTLIHSVRVPCLSEAEKYLIEEAGSTTREDLWKMKVR 133

Query: 727 -------FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 779
                    H+    KLRP +  FL+ A+++F M++YT G + YA  + K++DPK + F 
Sbjct: 134 GDPISITIEHL---VKLRPFLCEFLKEANEMFTMYVYTKGTRPYAEAILKLIDPKKLYFG 190

Query: 780 GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 839
            RVI+R +           P +K L+ VL  E  VVI+DD+ + WP+NK NL+++ RY Y
Sbjct: 191 HRVITRNES----------PHTKTLDMVLADERGVVIVDDTRKAWPNNKSNLVLIGRYNY 240

Query: 840 FPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS-HQSLDDVDVRNILA 898
           F   R Q  +L P   E   DE   +G LA+ L +++ +H  FF   + ++  DVR+ILA
Sbjct: 241 F---RSQSRVLKPHSEE-KTDESENNGGLANVLKLLKGIHHKFFKVEEEVESQDVRDILA 296

Query: 899 AE 900
           ++
Sbjct: 297 SK 298


>gi|15217916|ref|NP_173457.1| haloacid dehalogenase-like hydrolase [Arabidopsis thaliana]
 gi|9558594|gb|AAF88157.1|AC026234_8 Contains similarity to a FCP1 serine phosphatase from Xenopus
           laevis gi|6689545 [Arabidopsis thaliana]
 gi|332191840|gb|AEE29961.1| haloacid dehalogenase-like hydrolase [Arabidopsis thaliana]
          Length = 342

 Score =  120 bits (302), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 87/241 (36%), Positives = 125/241 (51%), Gaps = 37/241 (15%)

Query: 667 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ------DREKP 720
           T+ L  Q    + RKL LVLDLDHTLL+S     +      +L + + +      DRE  
Sbjct: 62  TKSLTTQLACLNERKLHLVLDLDHTLLHSIMISRLSEGEKYLLGESDFREDLWTLDRE-- 119

Query: 721 HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 780
                      M  KLRP +  FL+ A+++F M++YTMGN+ YA  + K +DPK V F  
Sbjct: 120 -----------MLIKLRPFVHEFLKEANEIFSMYVYTMGNRDYAQAVLKWIDPKKVYFGD 168

Query: 781 RVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           RVI+R + G           SK L+ VL  E  VVI+DD+  VWP ++ NL+ + +Y+YF
Sbjct: 169 RVITRDESG----------FSKTLDLVLADECGVVIVDDTRHVWPDHERNLLQITKYSYF 218

Query: 841 PCSRRQFG--LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH--QSLDDVDVRNI 896
               R +       S  E   DE    G+LA+ L V++ +H+ FF    + LD  DVR +
Sbjct: 219 ----RDYSHDKESKSYAEEKRDESRNQGSLANVLKVLKDVHQEFFRGGIEELDSKDVRLL 274

Query: 897 L 897
           L
Sbjct: 275 L 275


>gi|9294425|dbj|BAB02545.1| unnamed protein product [Arabidopsis thaliana]
          Length = 314

 Score =  120 bits (301), Expect = 3e-24,   Method: Composition-based stats.
 Identities = 79/236 (33%), Positives = 127/236 (53%), Gaps = 19/236 (8%)

Query: 667 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 726
           T+RL  +    + +KL LVLDLD TL++S +   +      ++ +     RE   +   R
Sbjct: 74  TKRLITKFSCLNMKKLHLVLDLDLTLIHSVRVPCLSEAEKYLIEEAGSTTREDLWKMKVR 133

Query: 727 FPHMGM----WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
              + +      KLRP +  FL+ A+++F M++YT G + YA  + K++DPK + F  RV
Sbjct: 134 GDPISITIEHLVKLRPFLCEFLKEANEMFTMYVYTKGTRPYAEAILKLIDPKKLYFGHRV 193

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPC 842
           I+R +           P +K L+ VL  E  VVI+DD+ + WP+NK NL+++ RY YF  
Sbjct: 194 ITRNES----------PHTKTLDMVLADERGVVIVDDTRKAWPNNKSNLVLIGRYNYF-- 241

Query: 843 SRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS-HQSLDDVDVRNIL 897
            R Q  +L P   E   DE   +G LA+ L +++ +H  FF   + ++  DVR +L
Sbjct: 242 -RSQSRVLKPHSEE-KTDESENNGGLANVLKLLKGIHHKFFKVEEEVESQDVRLLL 295


>gi|297808347|ref|XP_002872057.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297317894|gb|EFH48316.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 302

 Score =  119 bits (297), Expect = 1e-23,   Method: Composition-based stats.
 Identities = 84/253 (33%), Positives = 128/253 (50%), Gaps = 24/253 (9%)

Query: 648 EHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDE 707
           ++L++G     +A +    T+R+  Q      +KL LVLDLDHTL+++ K  ++     E
Sbjct: 56  DYLYKGMHMSHEALV---FTKRVISQTSWLEDKKLHLVLDLDHTLVHTIKASQL--YESE 110

Query: 708 ILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEM 767
               +E   R+   R    FP   +  KLRP +  FL+  +++F M++YT G   YA  +
Sbjct: 111 KCLTEEVGSRKDLWRFNSGFPDESL-IKLRPFVHQFLKECNEMFSMYVYTKGGCDYAQVV 169

Query: 768 AKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHN 827
            +++DP+ + F  RVI+R +           P  K L+ VL  E  VVI+DD   VWPH+
Sbjct: 170 LELIDPEKIYFGNRVITRRES----------PDLKTLDLVLADERGVVIVDDKCSVWPHD 219

Query: 828 KLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF---S 884
           K NL+ + +Y YF      F     S  +   DE  E G L   L  ++ +H  FF   S
Sbjct: 220 KKNLLQIAKYKYFGDQSCSF-----SECKNKRDESEEKGPLDIVLRFLKDVHNEFFCDWS 274

Query: 885 HQSLDDVDVRNIL 897
            + LD VDVR +L
Sbjct: 275 RKDLDSVDVRPLL 287


>gi|15226925|ref|NP_178335.1| Haloacid dehalogenase-like hydrolase-like protein [Arabidopsis
           thaliana]
 gi|3894162|gb|AAC78512.1| hypothetical protein [Arabidopsis thaliana]
 gi|330250469|gb|AEC05563.1| Haloacid dehalogenase-like hydrolase-like protein [Arabidopsis
           thaliana]
          Length = 302

 Score =  118 bits (296), Expect = 2e-23,   Method: Composition-based stats.
 Identities = 81/234 (34%), Positives = 122/234 (52%), Gaps = 21/234 (8%)

Query: 667 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 726
           T+RL  Q      +KL LVLDLDHTL+++ K  ++      I   +E + R+   R    
Sbjct: 72  TKRLISQTSWLEDKKLHLVLDLDHTLVHTIKVSQLSESEKYI--TEEVESRKDLRRFNTG 129

Query: 727 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
           FP   +  KLR  +  FL+  +++F +++YT G   YA  + +++DP  + F  RVI+R 
Sbjct: 130 FPEESL-IKLRSFVHQFLKECNEMFSLYVYTKGGYDYAQLVLEMIDPDKIYFGNRVITRR 188

Query: 787 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 846
           +           P  K L+ VL  E  +V++DD   VWPH+K NL+ + RY YF     Q
Sbjct: 189 ES----------PGFKTLDLVLADERGIVVVDDKSSVWPHDKKNLLQIARYKYFG---DQ 235

Query: 847 FGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNIL 897
             LL     +I  DE  E G L ++L  +  +H+ FF   S + LD VDVR +L
Sbjct: 236 SCLLSECKKKI--DESDEKGPLNTALRFLMDVHEEFFCDWSRKDLDSVDVRPLL 287


>gi|312066139|ref|XP_003136128.1| hypothetical protein LOAG_00540 [Loa loa]
          Length = 577

 Score =  117 bits (294), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 96/353 (27%), Positives = 173/353 (49%), Gaps = 49/353 (13%)

Query: 662 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 719
           +  E  R++   +++ +  A KL L++DLD TL+++   H            K ++D + 
Sbjct: 126 VSDELARKIGSRDRELLLKAHKLVLLVDLDQTLIHTTN-HTF----------KVDKDTDV 174

Query: 720 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 779
            H   ++      +TK+RP    FL R ++L+EMH+ + G + YA  +A+ LDP  + F 
Sbjct: 175 LH---YKLKGTDFYTKIRPYAREFLRRMAELYEMHIISYGERQYAHRIAEFLDPDKIYFG 231

Query: 780 GRVISRGDDGDPFDGDE---RVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVE 835
            R++SR         DE    + K+++++ +    +  +V+IDD   VW ++   LI V+
Sbjct: 232 HRILSR---------DELFCAMYKTRNMQALFPCGDHMIVMIDDRPDVWQYSD-ALIQVK 281

Query: 836 RYTYFP------CSRRQFG--LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQS 887
            Y +F         R + G  +L  S  E D  E  +D TL     V+ ++H  F+  + 
Sbjct: 282 PYRFFKEIGDINAPRYEKGEPILSGSYAEQDM-ESEDDETLEYVAVVLTKIHNAFY--EL 338

Query: 888 LDDV------DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTK 941
            D        D++ I++  ++++L  C IV S + PVG  +      ++   QFGAV   
Sbjct: 339 FDGAKINRFPDLKGIISYLRKQVLRDCSIVLSGIVPVG-VDIKKTEAYRLCMQFGAVVMD 397

Query: 942 HIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           ++ D  THV+A   GT KV+ A       +V+P W+      + +A+E++F +
Sbjct: 398 NVSDSTTHVIAARWGTTKVHEAHKLPNIHIVNPKWLFTCVERWEKADEKEFEL 450


>gi|15218404|ref|NP_175025.1| NLI interacting factor (NIF) family protein [Arabidopsis thaliana]
 gi|117958727|gb|ABK59679.1| At1g43600 [Arabidopsis thaliana]
 gi|332193851|gb|AEE31972.1| NLI interacting factor (NIF) family protein [Arabidopsis thaliana]
          Length = 221

 Score =  117 bits (294), Expect = 3e-23,   Method: Composition-based stats.
 Identities = 79/221 (35%), Positives = 116/221 (52%), Gaps = 22/221 (9%)

Query: 682 LCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE-EQDREKPHRHLFRFPHMGMWTKLRPGI 740
           L LVLDLDHTLL+S    ++      +L + +  QD  + +   + F       KLRP +
Sbjct: 19  LHLVLDLDHTLLHSVLVSDLSKREKYLLEETDSRQDLWRRNVDGYEF-----IIKLRPFL 73

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPK 800
             FL  A+KLF MH+YTMG+  YA ++ K++DP  V F  RVI+R          E  P 
Sbjct: 74  HEFLLEANKLFTMHVYTMGSSSYAKQVLKLIDPDKVYFGKRVITR----------EASPF 123

Query: 801 SKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHD 860
           +K L+ +   +  VVI+DD+V VWP +K NL+ + +Y YF       G    S  E   D
Sbjct: 124 NKSLDLLAADKRRVVIVDDTVHVWPFHKRNLLQITKYVYFKVD----GTKWDSYAEAKKD 179

Query: 861 ERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQ 901
           E   +G+LA+ L  +E +HK F   + L   D+R ++   Q
Sbjct: 180 ESQSNGSLANVLKFLEDVHKRF--EEDLGFKDLRLLIPCRQ 218


>gi|393909596|gb|EFO27947.2| hypothetical protein LOAG_00540 [Loa loa]
          Length = 506

 Score =  117 bits (293), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 93/338 (27%), Positives = 168/338 (49%), Gaps = 41/338 (12%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 731
           +++ +  A KL L++DLD TL+++   H            K ++D +  H   ++     
Sbjct: 67  DRELLLKAHKLVLLVDLDQTLIHTTN-HTF----------KVDKDTDVLH---YKLKGTD 112

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
            +TK+RP    FL R ++L+EMH+ + G + YA  +A+ LDP  + F  R++SR    + 
Sbjct: 113 FYTKIRPYAREFLRRMAELYEMHIISYGERQYAHRIAEFLDPDKIYFGHRILSR---DEL 169

Query: 792 FDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP------CSR 844
           F     + K+++++ +    +  +V+IDD   VW ++   LI V+ Y +F         R
Sbjct: 170 FCA---MYKTRNMQALFPCGDHMIVMIDDRPDVWQYSD-ALIQVKPYRFFKEIGDINAPR 225

Query: 845 RQFG--LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV------DVRNI 896
            + G  +L  S  E D  E  +D TL     V+ ++H  F+  +  D        D++ I
Sbjct: 226 YEKGEPILSGSYAEQDM-ESEDDETLEYVAVVLTKIHNAFY--ELFDGAKINRFPDLKGI 282

Query: 897 LAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLG 956
           ++  ++++L  C IV S + PVG  +      ++   QFGAV   ++ D  THV+A   G
Sbjct: 283 ISYLRKQVLRDCSIVLSGIVPVG-VDIKKTEAYRLCMQFGAVVMDNVSDSTTHVIAARWG 341

Query: 957 TDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           T KV+ A       +V+P W+      + +A+E++F +
Sbjct: 342 TTKVHEAHKLPNIHIVNPKWLFTCVERWEKADEKEFEL 379


>gi|367047187|ref|XP_003653973.1| hypothetical protein THITE_2116513 [Thielavia terrestris NRRL 8126]
 gi|347001236|gb|AEO67637.1| hypothetical protein THITE_2116513 [Thielavia terrestris NRRL 8126]
          Length = 909

 Score =  116 bits (290), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 79/226 (34%), Positives = 122/226 (53%), Gaps = 28/226 (12%)

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQD 716
           +RT R E Q+++  +RKL LV+DLD T++ +         +    +P H+ +   K  Q 
Sbjct: 147 QRTER-ELQRRLLQSRKLSLVVDLDQTIIQACIDPTVGEWQRDPTNPNHESVKEVKSFQL 205

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
            + P     R  +   + K+RPG+  FL+R S+L+EMH+YTMG + YA  +A+V+DP+  
Sbjct: 206 DDGPSDLARRCSY---YIKMRPGLEEFLKRISELYEMHVYTMGTRAYAQNVARVVDPQRK 262

Query: 777 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVE 835
           LF  RVISR ++G+ F        +K L  +  + +  VVIIDD   VWP N+ NLI V 
Sbjct: 263 LFGNRVISRDENGNMF--------AKSLGRLFPVSTNMVVIIDDRSDVWPRNRPNLIKVS 314

Query: 836 RYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKI 881
            Y +F    +  G +  S L   HD  + D   A++ GV  +  K+
Sbjct: 315 PYEFF----KGIGDINSSFLPKRHDLLTSD---AATNGVQNKGRKM 353


>gi|388580688|gb|EIM21001.1| FCP1-like phosphatase [Wallemia sebi CBS 633.66]
          Length = 510

 Score =  115 bits (289), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 103/397 (25%), Positives = 177/397 (44%), Gaps = 84/397 (21%)

Query: 654 YDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           YD+ Q+     + T        +  + KL L++DLD T++++     VDP  +E+L    
Sbjct: 46  YDEAQRIGKTSKHT--------LLKSSKLALIVDLDQTIIHAT----VDPTVNELL---- 89

Query: 714 EQDREKPHR------HLFRFPHMGM----------WTKLRPGIWTFLERASKLFEMHLYT 757
            QD    ++      H F+    G+          + K RPG+  FL+  +KLFEMH+YT
Sbjct: 90  -QDPTLVYKGALNDVHKFKLGDFGLVNHHEFGSWYFVKFRPGLMEFLDNMNKLFEMHVYT 148

Query: 758 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVI 816
           MG + YA  + +++DP G  F  R++SR + G            K L+ +   + S  VI
Sbjct: 149 MGTRSYALAICQLIDPSGKYFGERILSRDESGS--------FTQKSLQRLFPTDTSMCVI 200

Query: 817 IDDSVRVWPHNKLNLIVVERYTYFPC--------SRRQFGLLGPSLLEIDHDERSEDG-- 866
           IDD   VW  +  NL+ V  + +F           R++         + D +E+ +D   
Sbjct: 201 IDDRADVWG-DSPNLVKVIPFEFFVGIGDINALNKRKKLRSNQNGDGDGDQEEQEQDSNE 259

Query: 867 ------------------------TLASSL----GVIERLHKIFFSHQSL-DDVDVRNIL 897
                                   T+   L     ++ ++H  F++ ++  DD D + I+
Sbjct: 260 SLSKQADQRPLAKRQKEHQQILSETIDEELPRLSKILTQIHSNFYNFKNAGDDPDTKEII 319

Query: 898 AAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGT 957
              +RK+L G ++VFS V P+G     +  ++  A +FGA    + +++VTHVVA   GT
Sbjct: 320 PTLKRKVLHGLKLVFSSVIPLG-MPLEISGIYNLASKFGATIDHNYNEKVTHVVAAKKGT 378

Query: 958 DKVNWA-LSTGRFVVHPGWVEASALLYRRANEQDFAI 993
            KV  A       VV   W+  S   + +  E+++ +
Sbjct: 379 AKVEDAKKGDSAHVVWSEWLLDSCAKWEKMPEENYYL 415


>gi|215794709|pdb|3EF0|A Chain A, The Structure Of Fcp1, An Essential Rna Polymerase Ii Ctd
           Phosphatase
          Length = 372

 Score =  115 bits (288), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 113/397 (28%), Positives = 181/397 (45%), Gaps = 101/397 (25%)

Query: 669 RLEEQ--KKMFSARKLCLVLDLDHTLLNSAKFHEVDP-----------VHDEILRKKEEQ 715
           RLE +  K++   ++L L++DLD T++++     VDP           V+ ++LR     
Sbjct: 4   RLESENVKRLRQEKRLSLIVDLDQTIIHAT----VDPTVGEWMSDPGNVNYDVLRDVRSF 59

Query: 716 D-REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
           + +E P  +   +     + K RPG+  FL++ S+L+E+H+YTMG K YA E+AK++DP 
Sbjct: 60  NLQEGPSGYTSCY-----YIKFRPGLAQFLQKISELYELHIYTMGTKAYAKEVAKIIDPT 114

Query: 775 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIV 833
           G LF  RV+SR D G            K L  +   + S VV+IDD   VW  N  NLI 
Sbjct: 115 GKLFQDRVLSRDDSGS--------LAQKSLRRLFPCDTSMVVVIDDRGDVWDWNP-NLIK 165

Query: 834 VERYTYF------------------------------------PCSRRQFGLL------G 851
           V  Y +F                                    P +++Q  LL       
Sbjct: 166 VVPYEFFVGIGDINSNFLSGNREALEEQNKERVTALELQKSERPLAKQQNALLEDEGKPT 225

Query: 852 PS---LLEIDHD-ERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRN----------IL 897
           PS   L   DH+ ER E         V++ +H +++  +  +D+  R+          I+
Sbjct: 226 PSHTLLHNRDHELERLEK--------VLKDIHAVYYEEE--NDISSRSGNHKHANVGLII 275

Query: 898 AAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGT 957
              ++K+L GCR++FS V P+G  +     + + A  FGA          TH++A  + T
Sbjct: 276 PKMKQKVLKGCRLLFSGVIPLG-VDVLSSDIAKWAMSFGAEVVLDFSVPPTHLIAAKIRT 334

Query: 958 DKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           +KV  A+S G   VV   W+  S   ++R  E D+ +
Sbjct: 335 EKVKKAVSMGNIKVVKLNWLTESLSQWKRLPESDYLL 371


>gi|303280109|ref|XP_003059347.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459183|gb|EEH56479.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 136

 Score =  115 bits (287), Expect = 1e-22,   Method: Composition-based stats.
 Identities = 60/122 (49%), Positives = 80/122 (65%), Gaps = 8/122 (6%)

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 794
           KLRP    FL  AS + ++++YTMG+K YA EMAK+LDP G LF GRVI+        + 
Sbjct: 1   KLRPRAREFLRAASAMCQLYVYTMGDKNYAREMAKILDPTGELFNGRVIA--------NS 52

Query: 795 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSL 854
           D    ++KDL+ VLG E +V+I+DD+ RVWPHN  NLI ++RY +FP S   F   G S+
Sbjct: 53  DSTCSRTKDLDIVLGAEGSVLIVDDTDRVWPHNLANLIRIDRYHFFPQSAAGFRQPGRSV 112

Query: 855 LE 856
           LE
Sbjct: 113 LE 114


>gi|225194907|gb|ACN81954.1| C-terminal domain phosphatase-like 5 [Arabidopsis thaliana]
          Length = 601

 Score =  115 bits (287), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 81/256 (31%), Positives = 137/256 (53%), Gaps = 28/256 (10%)

Query: 648 EHLFEGYDDQQKA-AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHD 706
           +++F+G     +A A+ K  T +L       + +KL LVLDLDHTLL++     +     
Sbjct: 355 DYIFDGLQLSHEAVALTKCFTTKL----SCLNEKKLHLVLDLDHTLLHTVMVPSLSQAEK 410

Query: 707 EILRKKEEQDREKPHRHLFRFPHMG----MWTKLRPGIWTFLERASKLFEMHLYTMGNKL 762
            ++ +     R+     L++   +G      TKLRP +  FL+ A++ F M++YT G+++
Sbjct: 411 YLIEEAGSATRDD----LWKIKAVGDPMEFLTKLRPFLRDFLKEANEFFTMYVYTKGSRV 466

Query: 763 YATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVR 822
           YA ++ +++DPK + F  RVI++ +           P  K L+ VL  E  VVI+DD+  
Sbjct: 467 YAKQVLELIDPKKLYFGDRVITKTES----------PHMKTLDFVLAEERGVVIVDDTRN 516

Query: 823 VWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIF 882
           VWP +K NL+ + +Y+YF    R  G       E   DE   +G LA+ L +++ +H+ F
Sbjct: 517 VWPDHKSNLVDISKYSYF----RLKGQDSMPYSEEKTDESESEGGLANVLKLLKEVHQRF 572

Query: 883 FS-HQSLDDVDVRNIL 897
           F   + L+  DVR++L
Sbjct: 573 FRVEEELESKDVRSLL 588



 Score =  115 bits (287), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 79/236 (33%), Positives = 125/236 (52%), Gaps = 25/236 (10%)

Query: 667 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 726
           T+RL  +    + +KL LVLDLD TL++S +   +      ++ +     RE   +   R
Sbjct: 74  TKRLITKFSCLNMKKLHLVLDLDLTLIHSVRVPCLSEAEKYLIEEAGSTTREDLWKMKVR 133

Query: 727 -------FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 779
                    H+    KLRP +  FL+ A+++F M++YT G + YA  + K++DPK + F 
Sbjct: 134 GDPISITIEHL---VKLRPFLCEFLKEANEMFTMYVYTKGTRPYAEAILKLIDPKKLYFG 190

Query: 780 GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 839
            RVI+R +           P +K L+ VL  E  VVI+DD+ + WP+NK NL+++ RY Y
Sbjct: 191 HRVITRNES----------PHTKTLDMVLADERGVVIVDDTRKAWPNNKSNLVLIGRYNY 240

Query: 840 FPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS-HQSLDDVDVR 894
           F   R Q  +L P   E   DE   +G LA+ L +++ +H  FF   + ++  DVR
Sbjct: 241 F---RSQSRVLKPHSEE-KTDESENNGGLANVLKLLKGIHHKFFKVEEEVESQDVR 292


>gi|186510238|ref|NP_001118664.1| haloacid dehalogenase-like hydrolase domain-containing protein
           [Arabidopsis thaliana]
 gi|9294424|dbj|BAB02544.1| unnamed protein product [Arabidopsis thaliana]
 gi|332642743|gb|AEE76264.1| haloacid dehalogenase-like hydrolase domain-containing protein
           [Arabidopsis thaliana]
          Length = 307

 Score =  114 bits (286), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 81/256 (31%), Positives = 137/256 (53%), Gaps = 28/256 (10%)

Query: 648 EHLFEGYDDQQKA-AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHD 706
           +++F+G     +A A+ K  T +L       + +KL LVLDLDHTLL++     +     
Sbjct: 61  DYIFDGLQLSHEAVALTKCFTTKL----SCLNEKKLHLVLDLDHTLLHTVMVPSLSQAEK 116

Query: 707 EILRKKEEQDREKPHRHLFRFPHMG----MWTKLRPGIWTFLERASKLFEMHLYTMGNKL 762
            ++ +     R+     L++   +G      TKLRP +  FL+ A++ F M++YT G+++
Sbjct: 117 YLIEEAGSATRDD----LWKIKAVGDPMEFLTKLRPFLRDFLKEANEFFTMYVYTKGSRV 172

Query: 763 YATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVR 822
           YA ++ +++DPK + F  RVI++ +           P  K L+ VL  E  VVI+DD+  
Sbjct: 173 YAKQVLELIDPKKLYFGDRVITKTES----------PHMKTLDFVLAEERGVVIVDDTRN 222

Query: 823 VWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIF 882
           VWP +K NL+ + +Y+YF    R  G       E   DE   +G LA+ L +++ +H+ F
Sbjct: 223 VWPDHKSNLVDISKYSYF----RLKGQDSMPYSEEKTDESESEGGLANVLKLLKEVHQRF 278

Query: 883 FS-HQSLDDVDVRNIL 897
           F   + L+  DVR++L
Sbjct: 279 FRVEEELESKDVRSLL 294


>gi|171680434|ref|XP_001905162.1| hypothetical protein [Podospora anserina S mat+]
 gi|170939844|emb|CAP65069.1| unnamed protein product [Podospora anserina S mat+]
          Length = 835

 Score =  114 bits (285), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 66/179 (36%), Positives = 104/179 (58%), Gaps = 20/179 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHR 722
           E QK++  +RKL LV+DLD T++ +     V        +P +D +   K  Q  + PH 
Sbjct: 154 ELQKRLLESRKLSLVVDLDQTVIQACIDPTVGEWMKDPTNPNYDSVKNVKTFQLDDGPHA 213

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            + +  +   + K+RPG+  FL+R S ++E+H+YTMG + YA  +A+V+DP+  LF  RV
Sbjct: 214 VVRKCWY---YIKMRPGLEGFLKRISTMYELHVYTMGTRAYAQNVARVIDPEKKLFGNRV 270

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           ISR ++G+ +        SK L+ +  + +  VVIIDD   VWPHN+ NL+ V  Y +F
Sbjct: 271 ISRDENGNMY--------SKSLQRLFPVSTNMVVIIDDRSDVWPHNRPNLVKVTPYEFF 321


>gi|15237769|ref|NP_197738.1| haloacid dehalogenase-like hydrolase domain-containing protein
           [Arabidopsis thaliana]
 gi|9759085|dbj|BAB09563.1| unnamed protein product [Arabidopsis thaliana]
 gi|332005790|gb|AED93173.1| haloacid dehalogenase-like hydrolase domain-containing protein
           [Arabidopsis thaliana]
          Length = 302

 Score =  113 bits (283), Expect = 5e-22,   Method: Composition-based stats.
 Identities = 79/234 (33%), Positives = 119/234 (50%), Gaps = 21/234 (8%)

Query: 667 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 726
           T+ L  Q      +KL LVLDLD TL+++ K   +      I+  +E + R+   R    
Sbjct: 72  TKGLISQTSWLEDKKLHLVLDLDQTLIHTIKTSLLYESEKYII--EEVESRKDIKRFNTG 129

Query: 727 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
           FP   +  KLRP +  FL+  +++F M++YT G   YA  + +++DP    F  RVI+R 
Sbjct: 130 FPEESL-IKLRPFVHQFLKECNEMFSMYVYTKGGYDYARLVLEMIDPDKFYFGNRVITRR 188

Query: 787 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 846
           +           P  K L+ VL  E  +VI+DD+  VWPH+K NL+ + RY YF      
Sbjct: 189 ES----------PGFKTLDLVLADERGIVIVDDTSSVWPHDKKNLLQIARYKYFGDKSCL 238

Query: 847 FGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNIL 897
           F      +     DE  E G L ++L  ++ +H+ FF   S + LD VDVR +L
Sbjct: 239 FSEDKKKI-----DESDEKGPLNTALRFLKDVHEEFFYDWSKKDLDSVDVRPLL 287


>gi|9294260|dbj|BAB02162.1| unnamed protein product [Arabidopsis thaliana]
          Length = 288

 Score =  112 bits (281), Expect = 9e-22,   Method: Composition-based stats.
 Identities = 70/234 (29%), Positives = 114/234 (48%), Gaps = 19/234 (8%)

Query: 667 TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 726
           T+ L     ++  +KL LVLDLDHTL++S K   +      ++++++   R+   ++  R
Sbjct: 62  TKHLTTLVSVYGRKKLHLVLDLDHTLIHSMKTSNLSKAEKYLIKEEKSGSRKDLRKYNNR 121

Query: 727 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
                   K RP +  FL+ A+KLF M  YT G   Y   + +++DP  + F  R+I+R 
Sbjct: 122 L------VKFRPFVEEFLKEANKLFTMTAYTKGGSTYGQAVVRMIDPNKIYFGDRIITRK 175

Query: 787 DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 846
           +           P  K L+ VL  E  +VI+D++  VWPH+K NL+ +  Y YF    + 
Sbjct: 176 ES----------PDLKTLDLVLADERGIVIVDNTPNVWPHHKRNLLEITSYFYFKNDGKN 225

Query: 847 FGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS---HQSLDDVDVRNIL 897
                 S  E   DE      L + L  ++ +H  FF+    + LD  DVR ++
Sbjct: 226 MMRSRLSYAERKSDESRTKRALVNLLKFLKEVHNGFFTCGLEEELDFKDVRYLI 279


>gi|340931931|gb|EGS19464.1| hypothetical protein CTHT_0049250 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 871

 Score =  111 bits (278), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 69/196 (35%), Positives = 109/196 (55%), Gaps = 22/196 (11%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVH 705
           DQ    +  E  +R+E+  Q+++  +RKL LV+DLD T++ +         +    +P H
Sbjct: 135 DQTNLRVGAEHAQRVEQELQRRLLQSRKLSLVVDLDQTIIQACIDPTVGEWQRDPTNPNH 194

Query: 706 DEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 765
           D +   K  Q  + P   L R      + K+RPG+  FL+R S+++E+H+YTMG + YA 
Sbjct: 195 DAVKDVKSFQLDDGPS-ALAR--KCWYYIKMRPGLEGFLKRISEMYELHVYTMGTRAYAQ 251

Query: 766 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVW 824
            +A+V+DP   LF  RVISR ++G+ +        +K L+ +  + +  VVIIDD   VW
Sbjct: 252 NVARVVDPDRKLFGNRVISRDENGNIY--------TKSLQRLFPVSTNMVVIIDDRSDVW 303

Query: 825 PHNKLNLIVVERYTYF 840
           P N+ NLI V  Y +F
Sbjct: 304 PRNRPNLIKVSPYEFF 319


>gi|380472901|emb|CCF46552.1| FCP1-like phosphatase, partial [Colletotrichum higginsianum]
          Length = 740

 Score =  109 bits (272), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 68/186 (36%), Positives = 101/186 (54%), Gaps = 32/186 (17%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILR-------------KKEEQDR 717
           E QK++   RKL LV+DLD T++++     ++P   E +              KK + + 
Sbjct: 152 ETQKRLLRQRKLSLVVDLDQTIIHAC----IEPTVGEWMEDPSNPNYEAVKDVKKFQLND 207

Query: 718 EKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 775
           E P   +      G W   K+RPG+  FLER ++L+E+H+YTMG + YA  +AK++DP+ 
Sbjct: 208 EGPRGMVTS----GCWYYIKMRPGLAEFLERVAELYELHVYTMGTRAYALNIAKIVDPQQ 263

Query: 776 VLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVV 834
            LF  RVISR ++G           SK L+ +  + +  VVIIDD   VWP N+ NLI V
Sbjct: 264 KLFGNRVISRDENGSMI--------SKSLQRLFPVNTNMVVIIDDRADVWPSNRPNLIKV 315

Query: 835 ERYTYF 840
             Y +F
Sbjct: 316 VPYDFF 321


>gi|320591286|gb|EFX03725.1| RNA polymerase 2 ctd phosphatase [Grosmannia clavigera kw1407]
          Length = 923

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 66/185 (35%), Positives = 103/185 (55%), Gaps = 32/185 (17%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH----RHLFR 726
           E Q+++ S RKL LV+DLD T++++     +DP   E      +QD   P+    + + R
Sbjct: 159 ELQQRLLSQRKLSLVVDLDQTIIHAC----IDPTIGEW-----QQDPSNPNYEALKDVRR 209

Query: 727 FP--------HMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
           F           G W   K+RP +  FLE+ S ++E+H+YTMG + YAT +A+++DP   
Sbjct: 210 FQLEEGFQGLARGCWYYIKMRPHLTEFLEKISTMYELHVYTMGTRTYATNIAQIVDPNQK 269

Query: 777 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAV-VIIDDSVRVWPHNKLNLIVVE 835
           LF  RVISR ++G+          +K L+ +  + + + VIIDD   VWP+N+ NLI V 
Sbjct: 270 LFGNRVISRDENGNII--------AKSLQRLFPVSTNMAVIIDDRADVWPYNRHNLIKVN 321

Query: 836 RYTYF 840
            Y +F
Sbjct: 322 PYDFF 326


>gi|408390401|gb|EKJ69801.1| hypothetical protein FPSE_10001 [Fusarium pseudograminearum CS3096]
          Length = 765

 Score =  108 bits (270), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 67/179 (37%), Positives = 97/179 (54%), Gaps = 19/179 (10%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHR 722
           E QK++   RKL LV+DLD T++++         +    +P HD +   K  Q  +   R
Sbjct: 148 ENQKRLLRQRKLSLVVDLDQTIIHACIEPTIGEWQRDPSNPNHDAVKDVKSFQLNDDGPR 207

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            +        + KLRPG+  FLE  SK++E+H+YTMG + YA  +AK++DP   LF  RV
Sbjct: 208 GVT--SGCTYYIKLRPGLMEFLEEVSKMYELHVYTMGTRAYALNIAKIVDPDKKLFGNRV 265

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           ISR ++G           SK L+ +  + +  VVIIDD   VWP N+ NLI V  Y +F
Sbjct: 266 ISRDENGS--------ITSKSLQRLFPVSTDMVVIIDDRADVWPMNRPNLIKVVPYDFF 316


>gi|367032510|ref|XP_003665538.1| hypothetical protein MYCTH_2309412 [Myceliophthora thermophila ATCC
           42464]
 gi|347012809|gb|AEO60293.1| hypothetical protein MYCTH_2309412 [Myceliophthora thermophila ATCC
           42464]
          Length = 913

 Score =  108 bits (270), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 67/196 (34%), Positives = 107/196 (54%), Gaps = 33/196 (16%)

Query: 660 AAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 719
           +A+Q +RT + E Q+++  +RKL LV+DLD T++ +     +DP   E      ++D   
Sbjct: 142 SAVQAQRTEQ-ELQRRLLKSRKLSLVVDLDQTIIQAC----IDPTVGEW-----QKDPTN 191

Query: 720 PHRHL------FRFP--------HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 765
           P+  L      F+              + K+RPG+  FL+R ++++E+H+YTMG + YA 
Sbjct: 192 PNHELAKEVKSFQLDDGPTDLARRCWYYIKMRPGLQDFLKRIAEMYELHVYTMGTRAYAQ 251

Query: 766 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVW 824
            +A+V+DP   LF  RVISR ++G+ F        +K L  +  + +  V IIDD   VW
Sbjct: 252 NVARVVDPDKKLFGNRVISRDENGNIF--------AKSLHRLFPVSTHMVAIIDDRSDVW 303

Query: 825 PHNKLNLIVVERYTYF 840
           P N+ NLI V  Y +F
Sbjct: 304 PRNRPNLIKVSPYEFF 319


>gi|46126951|ref|XP_388029.1| hypothetical protein FG07853.1 [Gibberella zeae PH-1]
          Length = 765

 Score =  108 bits (270), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 67/179 (37%), Positives = 97/179 (54%), Gaps = 19/179 (10%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHR 722
           E QK++   RKL LV+DLD T++++         +    +P HD +   K  Q  +   R
Sbjct: 148 ENQKRLLRQRKLSLVVDLDQTIIHACIEPTIGEWQRDPSNPNHDAVKDVKSFQLNDDGPR 207

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            +        + KLRPG+  FLE  SK++E+H+YTMG + YA  +AK++DP   LF  RV
Sbjct: 208 GVT--SGCTYYIKLRPGLMEFLEEVSKMYELHVYTMGTRAYALNIAKIVDPDKKLFGNRV 265

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           ISR ++G           SK L+ +  + +  VVIIDD   VWP N+ NLI V  Y +F
Sbjct: 266 ISRDENGS--------ITSKSLQRLFPVSTDMVVIIDDRADVWPMNRPNLIKVVPYDFF 316


>gi|310791724|gb|EFQ27251.1| FCP1-like phosphatase [Glomerella graminicola M1.001]
          Length = 860

 Score =  108 bits (269), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 67/186 (36%), Positives = 101/186 (54%), Gaps = 32/186 (17%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILR-------------KKEEQDR 717
           E QK++   RKL LV+DLD T++++     ++P   E +              KK + + 
Sbjct: 152 ETQKRLLRQRKLSLVVDLDQTIIHAC----IEPTVGEWMEDPSNPNYQAVKDVKKFQLND 207

Query: 718 EKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 775
           E P   +      G W   K+RPG+  FLE+ ++L+E+H+YTMG + YA  +AK++DP  
Sbjct: 208 EGPRGMVTS----GCWYYIKMRPGLAEFLEKVAELYELHVYTMGTRAYALNIAKIVDPHQ 263

Query: 776 VLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVV 834
            LF  RVISR ++G           SK L+ +  + +  VVIIDD   VWP+N+ NLI V
Sbjct: 264 KLFGNRVISRDENGSMI--------SKSLQRLFPVNTNMVVIIDDRADVWPNNRPNLIKV 315

Query: 835 ERYTYF 840
             Y +F
Sbjct: 316 VPYDFF 321


>gi|402080254|gb|EJT75399.1| RNA polymerase II subunit A domain phosphatase [Gaeumannomyces
           graminis var. tritici R3-111a-1]
          Length = 850

 Score =  107 bits (268), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 69/194 (35%), Positives = 104/194 (53%), Gaps = 27/194 (13%)

Query: 658 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEIL 709
           +KAA + E     E QK++   RKL LV+DLD T++++         +    +P H+ + 
Sbjct: 150 KKAATKTE----FELQKRLLDQRKLILVVDLDQTIIHACIEPTIGDWQRDPTNPNHEAVK 205

Query: 710 RKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEM 767
             K  Q  +   R L      G W   K+RPG+  FLE+ + ++E+H+YTMG + YA  +
Sbjct: 206 DVKSFQLNDDGPRGLAS----GCWYYIKMRPGLVDFLEKIATMYELHVYTMGTRAYAMNI 261

Query: 768 AKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPH 826
           AK++DP   LF  RVISR ++G           +K L+ +  + +  VVIIDD   VWP 
Sbjct: 262 AKIVDPDQKLFGNRVISRDENGS--------MTAKSLQRLFPVSTRMVVIIDDRADVWPR 313

Query: 827 NKLNLIVVERYTYF 840
           N+ NLI V  Y +F
Sbjct: 314 NRPNLIKVVPYDFF 327



 Score = 43.1 bits (100), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 43/146 (29%), Positives = 68/146 (46%), Gaps = 17/146 (11%)

Query: 864 EDGTLASSLGVIERLHKIFF---------SHQSLDDV----DVRNILAAEQRKILAGCRI 910
           +D  LA     + +LH+ F+         + +S DD     DV N+L   + K+L G RI
Sbjct: 453 DDSELAYLEQHLTQLHEAFYRAHDAEVASAKKSADDQTNPPDVGNVLDGLKAKVLRGTRI 512

Query: 911 VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTH-VVANSLG-TDKVNWALST-G 967
           V S + PVG  +     +   A  FGA     +  +VTH VV+NS   T K+  A    G
Sbjct: 513 VLSGLVPVG-VDARQSEIGLQARSFGATIRGKVSKRVTHLVVSNSRARTQKMVEAERIPG 571

Query: 968 RFVVHPGWVEASALLYRRANEQDFAI 993
             +V+  W+  S   ++  +E  +A+
Sbjct: 572 IKMVNQDWLINSLSQWKHLDETPYAV 597


>gi|346326901|gb|EGX96497.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Cordyceps
           militaris CM01]
          Length = 780

 Score =  107 bits (268), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 70/196 (35%), Positives = 103/196 (52%), Gaps = 21/196 (10%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVH 705
           DQ    + +   +R+E   QK++   RKL LV+DLD T++++     V        +P H
Sbjct: 131 DQTGLLVSENVAQRVEHDTQKRLLRQRKLSLVVDLDQTIIHACIEPTVGEWQRDPSNPNH 190

Query: 706 DEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 765
             +   +  Q ++   R L        + KLRPG+  FLE  SK++E+H+YTMG + YA 
Sbjct: 191 SAVKDVRSFQLKDDGPRGLA--SGCTYYIKLRPGLRDFLEEVSKMYELHVYTMGTRAYAL 248

Query: 766 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVW 824
            +AK++DP   LF  RVISR ++G           +K L  +  + +  VVIIDD   VW
Sbjct: 249 NIAKIVDPDRKLFGNRVISRDENGS--------ITAKSLARLFPVSTDMVVIIDDRADVW 300

Query: 825 PHNKLNLIVVERYTYF 840
           P NK NLI V  Y +F
Sbjct: 301 PMNKANLIKVAAYDFF 316


>gi|302404507|ref|XP_003000091.1| RNA polymerase II subunit A C-terminal domain phosphatase
           [Verticillium albo-atrum VaMs.102]
 gi|261361273|gb|EEY23701.1| RNA polymerase II subunit A C-terminal domain phosphatase
           [Verticillium albo-atrum VaMs.102]
          Length = 755

 Score =  107 bits (267), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 69/198 (34%), Positives = 108/198 (54%), Gaps = 25/198 (12%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVH 705
           DQ    +  +   R E   Q+++   RKL LV+DLD T++++     V        +P +
Sbjct: 135 DQTGLMVSNDMAARAEHDAQRRLLRQRKLSLVVDLDQTIIHACIEPTVGEWMNDPENPNY 194

Query: 706 DEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLY 763
           D +   ++ Q  ++  R + +    G W   K+RPG+  FLER ++L+E+H+YTMG + Y
Sbjct: 195 DAVKDVQKFQLNDEGPRGVTQ----GCWYYIKMRPGLREFLERVAELYELHVYTMGTRAY 250

Query: 764 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVR 822
           A  +AK++DP+  LF  RVISR ++G           SK L+ +  + +  VVIIDD   
Sbjct: 251 ALNIAKIVDPQQKLFGNRVISRDENGS--------ITSKSLQRLFPVSTNMVVIIDDRAD 302

Query: 823 VWPHNKLNLIVVERYTYF 840
           VWP N+ NLI V  Y +F
Sbjct: 303 VWPRNRPNLIKVVPYDFF 320


>gi|346975758|gb|EGY19210.1| RNA polymerase II subunit A C-terminal domain phosphatase
           [Verticillium dahliae VdLs.17]
          Length = 818

 Score =  107 bits (266), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 66/179 (36%), Positives = 100/179 (55%), Gaps = 23/179 (12%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE------QDREKPHRHLF- 725
           Q+++   RKL LV+DLD T++++     ++P   E +   E       +D EK   +   
Sbjct: 154 QRRLLRQRKLSLVVDLDQTIIHAC----IEPTVGEWMNDPENPNYDAVKDVEKFQLNDEG 209

Query: 726 -RFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            R    G W   K+RPG+  FLE+ ++L+E+H+YTMG + YA  +AK++DP+  LF  RV
Sbjct: 210 PRGVTQGCWYYIKMRPGLREFLEKVAELYELHVYTMGTRAYALNIAKIVDPQQKLFGNRV 269

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           ISR ++G           SK L+ +  + +  VVIIDD   VWP N+ NLI V  Y +F
Sbjct: 270 ISRDENGS--------ITSKSLQRLFPVSTNMVVIIDDRADVWPRNRPNLIKVVPYDFF 320


>gi|429854785|gb|ELA29772.1| RNA polymerase ii ctd phosphatase [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 829

 Score =  106 bits (265), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 69/194 (35%), Positives = 107/194 (55%), Gaps = 26/194 (13%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSA------KFHE--VDPVHDEILRKKEEQDREKPHRHL 724
           QK++   RKL LV+DLD T++++       ++ E   +P ++ +   K+ Q  ++  R +
Sbjct: 154 QKRLLRQRKLSLVVDLDQTIIHACIEPTVGEWMEDPTNPNYNAVKDVKKFQLNDEGPRGV 213

Query: 725 FRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
                 G W   K+RPG+  FLE+ S+L+E+H+YTMG + YA  +A+++DP   LF  RV
Sbjct: 214 VT---SGCWYYIKMRPGLKEFLEKISELYELHVYTMGTRAYAMNIAQIVDPDRKLFGNRV 270

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
           ISR ++G           SK L+ +  + +  VVIIDD   VWP N+ NLI V  Y +F 
Sbjct: 271 ISRDENGSMI--------SKSLQRLFPVNTNMVVIIDDRADVWPRNRPNLIKVVPYDFF- 321

Query: 842 CSRRQFGLLGPSLL 855
              R  G +  S L
Sbjct: 322 ---RGIGDINSSFL 332


>gi|116179414|ref|XP_001219556.1| hypothetical protein CHGG_00335 [Chaetomium globosum CBS 148.51]
 gi|88184632|gb|EAQ92100.1| hypothetical protein CHGG_00335 [Chaetomium globosum CBS 148.51]
          Length = 828

 Score =  106 bits (265), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 62/190 (32%), Positives = 106/190 (55%), Gaps = 22/190 (11%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVH 705
           DQ    +     +R E+  Q+++  +RKL LV+DLD T++ +         +    +P H
Sbjct: 135 DQTNLTVSASHAQRTEQELQRRLLVSRKLSLVVDLDQTIIQACIDPTVGDWQKDPTNPNH 194

Query: 706 DEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 765
           + +   K  Q  + P +   +  +   + K+RPG+ +FL+R ++++E+H+YTMG + YA 
Sbjct: 195 ESVKSVKSFQLDDGPTQAANQCSY---YIKMRPGLESFLKRIAQMYELHVYTMGTRAYAQ 251

Query: 766 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVW 824
            +A+V+DP   LF  RVISR ++G  +        +KDL+ +  + +  V IIDD   VW
Sbjct: 252 NVARVVDPDKKLFGNRVISRDENGSIY--------AKDLQRLFPISTHMVAIIDDRSDVW 303

Query: 825 PHNKLNLIVV 834
           P+N+ NLI V
Sbjct: 304 PNNRANLIKV 313


>gi|322706326|gb|EFY97907.1| RNA Polymerase II CTD phosphatase Fcp1 [Metarhizium anisopliae
           ARSEF 23]
          Length = 807

 Score =  105 bits (262), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 64/177 (36%), Positives = 96/177 (54%), Gaps = 19/177 (10%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHRHL 724
           QK++   RKL LV+DLD T++++         +  E +P H+ +   K  Q  +   R L
Sbjct: 150 QKRLLRQRKLSLVVDLDQTIIHACIEPTIGEWQKDESNPNHEAVKDVKSFQLNDDGPRGL 209

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
                   + KLRPG+  FLE  + ++E+H+YTMG + YA  +A+++DP   LF  RVIS
Sbjct: 210 A--SGCTYYIKLRPGLQEFLEEIATMYELHVYTMGTRAYALNIARIVDPDRKLFGNRVIS 267

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           R ++G           SK L+ +  + +  VVIIDD   VWP N+ NLI V  Y +F
Sbjct: 268 RDENGS--------ITSKSLQRLFPVSTNMVVIIDDRADVWPRNRPNLIKVVPYDFF 316


>gi|336466789|gb|EGO54953.1| hypothetical protein NEUTE1DRAFT_84976 [Neurospora tetrasperma FGSC
           2508]
 gi|350288620|gb|EGZ69856.1| hypothetical protein NEUTE2DRAFT_160171 [Neurospora tetrasperma
           FGSC 2509]
          Length = 867

 Score =  105 bits (261), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 70/223 (31%), Positives = 116/223 (52%), Gaps = 42/223 (18%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           DQ    + + + ++ E   Q+++   RKL LV+DLD T++++     +DP   E      
Sbjct: 135 DQTHLTVSETQAQKTENALQRRLLQHRKLSLVVDLDQTIIHAC----IDPTVGEW----- 185

Query: 714 EQDREKPH----RHLFRFP----------HMGMWTKLRPGIWTFLERASKLFEMHLYTMG 759
           ++D   P+    R++  F           +   + K+RPG+  FL++ S ++E+H+YTMG
Sbjct: 186 QKDPSNPNYPSVRNVKSFQLDDGPRGVANNCWYYIKMRPGLEDFLKKISTMYELHVYTMG 245

Query: 760 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIID 818
            + YA  +A+++DP   LF  RVISR ++G+ +        +K L+ +  + +  VVIID
Sbjct: 246 TRAYAQNVARIVDPDKKLFGNRVISRDENGNMY--------AKSLQRLFPVSTKMVVIID 297

Query: 819 DSVRVWPHNKLNLIVVERYTYFPCSR--------RQFGLLGPS 853
           D   VWP N+ NLI V  Y +F            +Q GLL PS
Sbjct: 298 DRADVWPRNRPNLIKVSPYDFFKGIGDINSGFLPKQQGLLTPS 340


>gi|342878347|gb|EGU79693.1| hypothetical protein FOXB_09806 [Fusarium oxysporum Fo5176]
          Length = 769

 Score =  105 bits (261), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 98/179 (54%), Gaps = 19/179 (10%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHR 722
           E QK++   RKL LV+DLD T++++         K    +P ++ +   ++ Q  +   R
Sbjct: 148 ENQKRLLRQRKLSLVVDLDQTIIHACIEPTIGEWKNDPTNPNYEAVKDVRDFQLNDDGPR 207

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            L        + KLRPG+  FL+  SK++E+H+YTMG + YA  +AK++DP   LF  RV
Sbjct: 208 GLT--SGCTYYIKLRPGLMEFLDEVSKMYELHVYTMGTRAYALNIAKIVDPDQKLFGNRV 265

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           ISR ++G           +K L+ +  + +  VVIIDD   VWP N+ NLI V  Y +F
Sbjct: 266 ISRDENGS--------ITAKSLQRLFPVSTDMVVIIDDRADVWPMNRPNLIKVVPYDFF 316


>gi|302889251|ref|XP_003043511.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256724428|gb|EEU37798.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 765

 Score =  105 bits (261), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 67/179 (37%), Positives = 97/179 (54%), Gaps = 20/179 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHR 722
           E QK++   RKL LV+DLD T++++         +    +P H  +   K  Q  + P R
Sbjct: 148 ESQKRLLRQRKLTLVVDLDQTIIHACIEPTIGEWQRDPTNPNHQAVKDVKSFQLDDGP-R 206

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            L        + KLRPG+  FLE  SK++E+H+YTMG + YA  +A+++DP   LF  RV
Sbjct: 207 GLA--SGCTYYIKLRPGLAEFLEEISKMYELHVYTMGTRAYALNIARIVDPDKKLFGNRV 264

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           ISR ++G           SK L+ +  + +  VVIIDD   VWP N+ NLI V  Y +F
Sbjct: 265 ISRDENGS--------ITSKSLQRLFPVSTDMVVIIDDRADVWPLNRPNLIKVVPYDFF 315


>gi|164429292|ref|XP_958446.2| hypothetical protein NCU11408 [Neurospora crassa OR74A]
 gi|157073422|gb|EAA29210.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 868

 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 70/223 (31%), Positives = 116/223 (52%), Gaps = 42/223 (18%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           DQ    + + + ++ E   Q+++   RKL LV+DLD T++++     +DP   E      
Sbjct: 135 DQTHLTVSETQAQKTENALQRRLLQHRKLSLVVDLDQTIIHAC----IDPTVGEW----- 185

Query: 714 EQDREKPH----RHLFRFP----------HMGMWTKLRPGIWTFLERASKLFEMHLYTMG 759
           ++D   P+    R++  F           +   + K+RPG+  FL++ S ++E+H+YTMG
Sbjct: 186 QKDPSNPNYPSVRNVKSFQLDDGPRGVANNCWYYIKMRPGLEDFLKKISTMYELHVYTMG 245

Query: 760 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIID 818
            + YA  +A+++DP   LF  RVISR ++G+ +        +K L+ +  + +  VVIID
Sbjct: 246 TRAYAQNVARIVDPDKKLFGNRVISRDENGNMY--------AKSLQRLFPVSTKMVVIID 297

Query: 819 DSVRVWPHNKLNLIVVERYTYFPCSR--------RQFGLLGPS 853
           D   VWP N+ NLI V  Y +F            +Q GLL PS
Sbjct: 298 DRADVWPRNRPNLIKVSPYDFFKGIGDINSGFLPKQQGLLTPS 340


>gi|336259270|ref|XP_003344437.1| hypothetical protein SMAC_08633 [Sordaria macrospora k-hell]
 gi|380087533|emb|CCC05319.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 878

 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 64/202 (31%), Positives = 110/202 (54%), Gaps = 34/202 (16%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           DQ    + + + ++ E   Q+++   RKL LV+DLD T++++     +DP   E      
Sbjct: 135 DQTHLTVSETQAQKTENALQRRLLQHRKLSLVVDLDQTIIHAC----IDPTVGEW----- 185

Query: 714 EQDREKPH----RHLFRFP----------HMGMWTKLRPGIWTFLERASKLFEMHLYTMG 759
           ++D   P+    R++  F           +   + K+RPG+  FL++ S ++E+H+YTMG
Sbjct: 186 QKDPSNPNYPSVRNVKSFQLDDGPRGVANNCWYYIKMRPGLEDFLKKISTMYELHVYTMG 245

Query: 760 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIID 818
            + YA  +A+++DP+  LF  RVISR ++G+ +        +K L+ +  + +  VVIID
Sbjct: 246 TRAYAQNVARIVDPEKKLFGNRVISRDENGNMY--------AKSLQRLFPVSTKMVVIID 297

Query: 819 DSVRVWPHNKLNLIVVERYTYF 840
           D   VWP N+ NLI V  Y +F
Sbjct: 298 DRADVWPRNRPNLIKVSPYDFF 319


>gi|281206665|gb|EFA80851.1| putative tfiif-interacting component of the c-terminal domain
           phosphatase [Polysphondylium pallidum PN500]
          Length = 881

 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 68/184 (36%), Positives = 97/184 (52%), Gaps = 21/184 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSA---KFHEVDPVHDEILRKKEEQDREKPHRHLFRF 727
           E  K++   RKL LVLD+DHT++++     F EV P    I       D EK +      
Sbjct: 266 ENAKRLIKQRKLSLVLDIDHTIIHAIMEPHFMEV-PYWRNI-------DCEKENIRSITL 317

Query: 728 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 787
            +M  + KLRP ++ FLE  +K FE+H+YTMG + YA E+AK++D K  LF  R++SR D
Sbjct: 318 GNMKYYIKLRPFLYKFLEDVNKKFELHIYTMGTRNYALEIAKLIDEKQELFKERILSRDD 377

Query: 788 DGD-PFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 846
             D  F   +R+    D        S V+I+DD   VW  +K NL+ +  Y YF   +  
Sbjct: 378 TTDMSFKTLQRLFPCDD--------SMVLIVDDRSDVWKRSK-NLVQISPYLYFVGCKDM 428

Query: 847 FGLL 850
             LL
Sbjct: 429 VNLL 432



 Score = 79.0 bits (193), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 74/131 (56%), Gaps = 2/131 (1%)

Query: 865 DGTLASSLGVIERLHKIFFS-HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANP 923
           D  L +    +  +H+I++  +   + + V ++L+  ++KIL G  +VFS V+P+ +   
Sbjct: 548 DKHLLTVWNQLSEVHRIYYEEYDKGNKLHVADVLSIVKKKILNGVNLVFSGVYPL-QLPA 606

Query: 924 HLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLY 983
           H  PL   AE+ GA     I +  THVVA   GT KV+ A+S G  +V+  W+E SA  +
Sbjct: 607 HRQPLRLLAEELGATVQNDITNTTTHVVAARKGTSKVHKAISKGLKIVNQNWIEQSAFQW 666

Query: 984 RRANEQDFAIK 994
           +R +E DF ++
Sbjct: 667 QRLDEADFPVQ 677


>gi|358390781|gb|EHK40186.1| hypothetical protein TRIATDRAFT_89336 [Trichoderma atroviride IMI
           206040]
          Length = 768

 Score =  104 bits (259), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 97/179 (54%), Gaps = 19/179 (10%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHR 722
           E QK++   RKL LV+DLD T++++         +  + +P H+ +   K  Q  +   R
Sbjct: 148 ESQKRLLRQRKLSLVVDLDQTIIHACIEPTVGEWQRDKANPNHEAVKDVKSFQLNDDGPR 207

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            L        + KLRPG+  FLE  S ++E+H+YTMG + YA  +A+++DP   LF  RV
Sbjct: 208 GLA--SGCTYYIKLRPGLHEFLETVSTMYELHVYTMGTRAYALNIARIVDPDKKLFGNRV 265

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           ISR ++G           +K L+ +  + +  VVIIDD   VWP N+ NLI V  Y +F
Sbjct: 266 ISRDENGS--------ITAKSLQRLFPVSTDMVVIIDDRSDVWPMNRPNLIKVVPYDFF 316


>gi|378731871|gb|EHY58330.1| protein phosphatase [Exophiala dermatitidis NIH/UT8656]
          Length = 856

 Score =  104 bits (259), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 72/191 (37%), Positives = 109/191 (57%), Gaps = 26/191 (13%)

Query: 662 IQKERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE--QDR 717
           I KE   R++E+  +++ S+RKL LV+DLD T++++A    VDP   E  + K+    D 
Sbjct: 139 ISKEEAARIDEEAKRRLLSSRKLSLVVDLDQTIIHAA----VDPTIAEWQKDKDNPNYDA 194

Query: 718 EKPHRHLFRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKV 770
            K  R  F+     P M G W   KLRPG+  FLE  S+L+EMH+YTMG + YA ++A +
Sbjct: 195 VKDVRS-FQLIDDGPGMRGCWYYIKLRPGLTEFLEHISQLYEMHIYTMGTRQYAQQIAAI 253

Query: 771 LDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKL 829
           +DP+   F  R++SR + G           +K+LE +  +++  VVIIDD   VW  +  
Sbjct: 254 VDPERKFFGDRILSRDESGSMV--------AKNLERLFPVDTKMVVIIDDRGDVWKWSA- 304

Query: 830 NLIVVERYTYF 840
           NLI V  + +F
Sbjct: 305 NLIRVRPFDFF 315



 Score = 60.1 bits (144), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 56/103 (54%), Gaps = 1/103 (0%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           D++ ++ A + K+L+G  IVFS V P+G  +     +   A+ FGA  T  +   VTHVV
Sbjct: 527 DIKKVMPAMKMKVLSGVTIVFSGVLPLG-TDIQSADISTWAKTFGATITDKVGRGVTHVV 585

Query: 952 ANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 994
           A   GT KV  A+  G  VV   W+  S   +R+ +E+ + ++
Sbjct: 586 AARPGTAKVKQAVKRGIKVVGTAWLIESMQQWRKLDEKPYLLE 628


>gi|297741470|emb|CBI32601.3| unnamed protein product [Vitis vinifera]
          Length = 147

 Score =  103 bits (258), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 60/143 (41%), Positives = 82/143 (57%), Gaps = 11/143 (7%)

Query: 758 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVII 817
           MG + YA EM KVLDP+ V F+  VIS+ D              K L+ VLG +SAV+I+
Sbjct: 1   MGEQFYALEMVKVLDPRTVYFSSSVISQADSTQR--------HQKGLDVVLGPKSAVLIL 52

Query: 818 DDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIER 877
           DD+ R W ++K NLI++ERY +F  S  QFG    SL E+  DE   DG LA+ L V+++
Sbjct: 53  DDTERAWKNHKDNLILMERYHFFASSCHQFGFHCKSLSELKSDESEPDGALATILKVLQQ 112

Query: 878 LHKIFFSHQSLDDV---DVRNIL 897
            H   F  +  D+    DVR +L
Sbjct: 113 THSTLFDPELSDNFSGRDVRQVL 135


>gi|148677457|gb|EDL09404.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           phosphatase, subunit 1, isoform CRA_a [Mus musculus]
          Length = 956

 Score =  103 bits (258), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 66/181 (36%), Positives = 101/181 (55%), Gaps = 21/181 (11%)

Query: 662 IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH 721
           +Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  I            H
Sbjct: 160 MQAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------H 207

Query: 722 RHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 781
             L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R
Sbjct: 208 FQLGRGEPM-LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHR 266

Query: 782 VISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           ++SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 267 ILSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 319

Query: 841 P 841
           P
Sbjct: 320 P 320



 Score = 45.8 bits (107), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 36/138 (26%), Positives = 64/138 (46%), Gaps = 23/138 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ L++  D+R I+   + K+LA   ++FS +    FPV +  
Sbjct: 582 ILVRVHTDYYTKYDRYLNKELEEAPDIRKIVPELKSKVLADVAVIFSGLHPTNFPVEKTR 641

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A       VV P W
Sbjct: 642 EHYH-----ATALGAKVLTQLVLSPDAPDRATHLIAARAGTEKVRQAQECKHLHVVSPDW 696

Query: 976 VEASALLYRRANEQDFAI 993
           + +    + +  EQ F +
Sbjct: 697 LWSCLERWDKVEEQLFPL 714


>gi|351695852|gb|EHA98770.1| hypothetical protein GW7_03722 [Heterocephalus glaber]
          Length = 963

 Score =  103 bits (257), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 65/183 (35%), Positives = 102/183 (55%), Gaps = 22/183 (12%)

Query: 662 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 719
           +  E+  +L  E+Q+++   RKL L++DLD TL+++ + H     +  I           
Sbjct: 159 VSSEQAEKLGKEDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF---------- 208

Query: 720 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 779
            H  L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+
Sbjct: 209 -HFQLGRGEPM-LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFS 266

Query: 780 GRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYT 838
            R++SR +  DPF       K+ +L+ +    +S V IIDD   VW     NLI V++Y 
Sbjct: 267 HRILSRDECIDPFS------KTGNLKNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYV 319

Query: 839 YFP 841
           YFP
Sbjct: 320 YFP 322



 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 36/133 (27%), Positives = 65/133 (48%), Gaps = 13/133 (9%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ L++  D+R I+   + K+LAG  ++FS +    FPV +  
Sbjct: 594 ILVRVHTDYYTKYDRYLNKELEEAPDIRKIVPELKSKVLAGVTVIFSGLHPTNFPVEKTR 653

Query: 923 PHLHPLWQTAEQFGA-VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASA 980
            H H     A+     V +    D+ THV+A   GT+KV+ A       VV P W+ +  
Sbjct: 654 EHYHATALGAKVLTQLVLSPDAPDRATHVIAARAGTEKVHQAQECRHLHVVSPDWLWSCL 713

Query: 981 LLYRRANEQDFAI 993
             + +  EQ F +
Sbjct: 714 ERWDKVEEQLFPL 726


>gi|157823025|ref|NP_001099601.1| RNA polymerase II subunit A C-terminal domain phosphatase [Rattus
           norvegicus]
 gi|149015915|gb|EDL75222.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           phosphatase, subunit 1 (predicted), isoform CRA_a
           [Rattus norvegicus]
          Length = 969

 Score =  103 bits (257), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 66/181 (36%), Positives = 101/181 (55%), Gaps = 21/181 (11%)

Query: 662 IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH 721
           +Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  I            H
Sbjct: 160 MQAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------H 207

Query: 722 RHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 781
             L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R
Sbjct: 208 FQLGRGEPM-LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHR 266

Query: 782 VISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           ++SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 267 ILSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 319

Query: 841 P 841
           P
Sbjct: 320 P 320



 Score = 45.4 bits (106), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 36/138 (26%), Positives = 64/138 (46%), Gaps = 23/138 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ L++  D+R I+   + K+LA   ++FS +    FPV +  
Sbjct: 595 ILVRVHTDYYTKYDRYLNKELEEAPDIRKIVPELKSKVLADVAVIFSGLHPTNFPVEKTR 654

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A       VV P W
Sbjct: 655 EHYH-----ATALGAKVLTQLVLSPDAPDRATHLIAARAGTEKVRQAQECKHLHVVSPEW 709

Query: 976 VEASALLYRRANEQDFAI 993
           + +    + +  EQ F +
Sbjct: 710 LWSCLERWDKVEEQLFPL 727


>gi|348555132|ref|XP_003463378.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase [Cavia porcellus]
          Length = 970

 Score =  103 bits (257), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 66/180 (36%), Positives = 100/180 (55%), Gaps = 21/180 (11%)

Query: 663 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 722
           Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  I            H 
Sbjct: 165 QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------HF 212

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R+
Sbjct: 213 QLGRGEPM-LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 271

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
           +SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YFP
Sbjct: 272 LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYFP 324



 Score = 45.4 bits (106), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 36/138 (26%), Positives = 64/138 (46%), Gaps = 23/138 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ L++  D+R I+   + K+LA   ++FS +    FPV +  
Sbjct: 597 ILVRVHTDYYTKYDRYLNKELEEAPDIRKIVPELKSKVLADVTLIFSGLHPTNFPVEKTR 656

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A       VV P W
Sbjct: 657 EHYH-----ATALGAKVLTQLVLSPDAPDRATHLIAARAGTEKVRQAQECKHLHVVSPDW 711

Query: 976 VEASALLYRRANEQDFAI 993
           + +    + +  EQ F +
Sbjct: 712 LWSCLERWDKVEEQLFPL 729


>gi|400603434|gb|EJP71032.1| FCP1-like phosphatase [Beauveria bassiana ARSEF 2860]
          Length = 774

 Score =  103 bits (256), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 66/177 (37%), Positives = 94/177 (53%), Gaps = 19/177 (10%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHRHL 724
           QK++   RKL LV+DLD T++++     V        +P H  +   +  Q  +   R L
Sbjct: 150 QKRLLRHRKLSLVVDLDQTIIHACIEPTVGEWQRDPSNPNHSAVKDVRSFQLNDDGPRGL 209

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
                   + KLRPG+  FLE  SK++E+H+YTMG + YA  +AK++DP   LF  RVIS
Sbjct: 210 A--SGCTYYIKLRPGLSEFLEEISKMYELHVYTMGTRAYALNIAKIVDPDRKLFGNRVIS 267

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           R ++G           SK L  +  + +  VVIIDD   VWP N+ NLI V  Y +F
Sbjct: 268 RDENGS--------ITSKSLARLFPVSTDMVVIIDDRADVWPMNRPNLIKVVPYDFF 316


>gi|429963056|gb|ELA42600.1| FCP1-like phosphatase, phosphatase domain-containing protein
           [Vittaforma corneae ATCC 50505]
          Length = 445

 Score =  103 bits (256), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 70/256 (27%), Positives = 118/256 (46%), Gaps = 52/256 (20%)

Query: 584 MSVSQPLTSEPMVSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSA 643
           M +   L  E ++  N+     +I S   +                G +  PV  H    
Sbjct: 8   MKIKDCLNDETLIEANNCTHSLRIDSLCAICG---------AEILKGTDLVPVLHHT--- 55

Query: 644 WGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP 703
               + +F+  ++ +K  +QK R ++L E+KKM       L+LDLD T+L++  + ++D 
Sbjct: 56  ----DRVFQTSEEARK--LQKIRNKQLNEEKKMI------LILDLDQTILHTTLW-KIDC 102

Query: 704 VHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLY 763
                                F       + KLRP +  FLE+ SK+FE+H+YTMG + Y
Sbjct: 103 ------------------DFTFSISSTMFYVKLRPHLNRFLEKISKMFEIHIYTMGTREY 144

Query: 764 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRV 823
            TE+ K +DP G+ F  R++SR ++ +           K +E +  +   VVIIDD   V
Sbjct: 145 VTEICKAIDPNGIYFGDRIVSRNENFNEL--------KKSIERITCISRNVVIIDDRADV 196

Query: 824 WPHNKLNLIVVERYTY 839
           W ++K NL+++  + Y
Sbjct: 197 WNYSK-NLVLIRPFWY 211


>gi|74140094|dbj|BAE33777.1| unnamed protein product [Mus musculus]
          Length = 960

 Score =  103 bits (256), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 66/180 (36%), Positives = 100/180 (55%), Gaps = 21/180 (11%)

Query: 663 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 722
           Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  I            H 
Sbjct: 165 QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------HF 212

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R+
Sbjct: 213 QLGRGEPM-LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 271

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
           +SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YFP
Sbjct: 272 LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYFP 324



 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 36/138 (26%), Positives = 64/138 (46%), Gaps = 23/138 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ L++  D+R I+   + K+LA   ++FS +    FPV +  
Sbjct: 586 ILVRVHTDYYTKYDRYLNKELEEAPDIRKIVPELKSKVLADVAVIFSGLHPTNFPVEKTR 645

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A       VV P W
Sbjct: 646 EHYH-----ATALGAKVLTQLVLSPDAPDRATHLIAARAGTEKVRQAQECKHLHVVSPDW 700

Query: 976 VEASALLYRRANEQDFAI 993
           + +    + +  EQ F +
Sbjct: 701 LWSCLERWDKVEEQLFPL 718


>gi|148677459|gb|EDL09406.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           phosphatase, subunit 1, isoform CRA_c [Mus musculus]
          Length = 1000

 Score =  103 bits (256), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 66/180 (36%), Positives = 100/180 (55%), Gaps = 21/180 (11%)

Query: 663 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 722
           Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  I            H 
Sbjct: 205 QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------HF 252

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R+
Sbjct: 253 QLGRGEPM-LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 311

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
           +SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YFP
Sbjct: 312 LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYFP 364



 Score = 45.4 bits (106), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 36/138 (26%), Positives = 64/138 (46%), Gaps = 23/138 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ L++  D+R I+   + K+LA   ++FS +    FPV +  
Sbjct: 626 ILVRVHTDYYTKYDRYLNKELEEAPDIRKIVPELKSKVLADVAVIFSGLHPTNFPVEKTR 685

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A       VV P W
Sbjct: 686 EHYH-----ATALGAKVLTQLVLSPDAPDRATHLIAARAGTEKVRQAQECKHLHVVSPDW 740

Query: 976 VEASALLYRRANEQDFAI 993
           + +    + +  EQ F +
Sbjct: 741 LWSCLERWDKVEEQLFPL 758


>gi|34328280|ref|NP_080571.2| RNA polymerase II subunit A C-terminal domain phosphatase [Mus
           musculus]
 gi|46395722|sp|Q7TSG2.1|CTDP1_MOUSE RecName: Full=RNA polymerase II subunit A C-terminal domain
           phosphatase; AltName: Full=TFIIF-associating CTD
           phosphatase
 gi|31419683|gb|AAH53435.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           phosphatase, subunit 1 [Mus musculus]
          Length = 960

 Score =  103 bits (256), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 66/180 (36%), Positives = 100/180 (55%), Gaps = 21/180 (11%)

Query: 663 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 722
           Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  I            H 
Sbjct: 165 QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------HF 212

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R+
Sbjct: 213 QLGRGEPM-LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 271

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
           +SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YFP
Sbjct: 272 LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYFP 324



 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 36/138 (26%), Positives = 64/138 (46%), Gaps = 23/138 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ L++  D+R I+   + K+LA   ++FS +    FPV +  
Sbjct: 586 ILVRVHTDYYTKYDRYLNKELEEAPDIRKIVPELKSKVLADVAVIFSGLHPTNFPVEKTR 645

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A       VV P W
Sbjct: 646 EHYH-----ATALGAKVLTQLVLSPDAPDRATHLIAARAGTEKVRQAQECKHLHVVSPDW 700

Query: 976 VEASALLYRRANEQDFAI 993
           + +    + +  EQ F +
Sbjct: 701 LWSCLERWDKVEEQLFPL 718


>gi|339254478|ref|XP_003372462.1| conserved hypothetical protein [Trichinella spiralis]
 gi|316967111|gb|EFV51594.1| conserved hypothetical protein [Trichinella spiralis]
          Length = 683

 Score =  103 bits (256), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 92/367 (25%), Positives = 167/367 (45%), Gaps = 64/367 (17%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+ K + S +KL L++DLD TL+++++               ++ D    + +    P+ 
Sbjct: 221 EDSKNLLSQKKLALLVDLDLTLIHTSE-------------TSDDSDALDVYHYQMEGPNS 267

Query: 731 GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
             + T+LRP    FL++ ++ FE+H+ T GN+ YA ++ K+LDP  VLF  R++SR    
Sbjct: 268 PWYHTRLRPYARYFLKKINEYFELHIITHGNRKYAEKVVKMLDPNNVLFGDRILSR---D 324

Query: 790 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF-------PC 842
           + FD + + P  K L    G +  V IIDD   VW + + N++ V  Y +F         
Sbjct: 325 ECFDPNMKAPNLKAL--FPGGDDLVCIIDDREDVWNYAE-NVVRVRPYRFFKHTDDFNAA 381

Query: 843 SRRQFGLLGPSLL--EIDHDERSE---------------------DGTLASSLGVIERLH 879
           +  + G +  + L   +  DE                        D  L     ++ R+H
Sbjct: 382 TLAELGNIIETELAKNVAGDEEDAGKDEVKLWLEKADEVPTDNDPDNYLVYLFFLLRRIH 441

Query: 880 KIFFSHQSL-----DDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQ 934
           + F++ + L       + ++ ++ A +  +    R VF+ + P  +A       +  ++Q
Sbjct: 442 ETFYNVRKLTGNPKKTLSLKTVMNALRENVFKNLRFVFTGLVPADQAITD-SIFYYRSKQ 500

Query: 935 FGAVCTKHI-----DDQV--THVVANSLGTDKVNWALSTGR-FVVHPGWVEASALLYRRA 986
           FGAV  K +     D+ +  TH++A  L T+KV  A  +G   +V P W       + R 
Sbjct: 501 FGAVVQKEVVVHDGDETLPTTHLIAGKLDTEKVARARKSGSVIIVSPAWFWTCVERWCRV 560

Query: 987 NEQDFAI 993
            E+D+ +
Sbjct: 561 PEKDYQL 567


>gi|115396432|ref|XP_001213855.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114193424|gb|EAU35124.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 820

 Score =  102 bits (255), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 75/202 (37%), Positives = 111/202 (54%), Gaps = 33/202 (16%)

Query: 654 YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 712
           +D+      +KE TR  E+ K+ + S RKL LV+DLD T++++     VDP   E +   
Sbjct: 131 HDNTALTVSEKEATRVEEDAKRRLLSNRKLSLVVDLDQTIIHAT----VDPTVGEWM--- 183

Query: 713 EEQDREKP-HRHL-----FRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMG 759
             +D+E P H+ L     F+     P M G W   KLRPG+ TFLE  ++LFE+H+YTMG
Sbjct: 184 --EDKENPNHQALSDVRAFQLVDDGPGMRGCWYYVKLRPGLETFLENVAELFELHIYTMG 241

Query: 760 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIID 818
            + YA  +A ++DP   LF  R++SR + G           +K+L  +  +++  VVIID
Sbjct: 242 TRAYAQHIASIIDPDRKLFGDRILSRDESGS--------LTAKNLHRLFPVDTKMVVIID 293

Query: 819 DSVRVWPHNKLNLIVVERYTYF 840
           D   VW  +  NLI V  Y +F
Sbjct: 294 DRGDVWRWSP-NLIKVSPYDFF 314



 Score = 48.1 bits (113), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 43/73 (58%), Gaps = 3/73 (4%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           DV++I+   +RK+L G  +VFS V P+G    +    LW  A+ FGAV ++ I+ + TH+
Sbjct: 500 DVKDIMPYIKRKVLGGVVLVFSGVLPLGTDTQNADISLW--AKSFGAVISQKINMKTTHL 557

Query: 951 VANSLGTDKVNWA 963
           VA    T KV  A
Sbjct: 558 VAGRNRTAKVREA 570


>gi|344242866|gb|EGV98969.1| hypothetical protein I79_008270 [Cricetulus griseus]
          Length = 848

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 103/185 (55%), Gaps = 31/185 (16%)

Query: 663 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 722
           Q E+  R E+Q+++   RKL L++DLD TL+++                 E+Q  +  ++
Sbjct: 46  QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTT----------------EQQCPQMSNK 88

Query: 723 HLFRFPHMG-----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
            +F F  +G     + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  L
Sbjct: 89  GIFHF-QLGRGEPMLHTRLRPHCRDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKL 147

Query: 778 FAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVER 836
           F+ R++SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++
Sbjct: 148 FSHRILSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKK 200

Query: 837 YTYFP 841
           Y YFP
Sbjct: 201 YVYFP 205



 Score = 44.7 bits (104), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 36/138 (26%), Positives = 62/138 (44%), Gaps = 23/138 (16%)

Query: 874 VIERLHKIFFSH------QSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++       + L++  D+R I+   + K+LA   ++FS +    FPV +  
Sbjct: 474 ILVRVHTDYYTKYDRYLKKELEEAPDIRKIVPELKSKVLADVAVIFSGLHPTNFPVEKTR 533

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V      D+ TH++A   GT+KV  A       VV P W
Sbjct: 534 EHYH-----ATALGAKVLTQLVLNPDAPDRATHLIAARAGTEKVRQAQECRHLHVVSPDW 588

Query: 976 VEASALLYRRANEQDFAI 993
           + +    + +  EQ F +
Sbjct: 589 LWSCLERWDKVEEQLFPL 606


>gi|334325963|ref|XP_001374906.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like [Monodelphis domestica]
          Length = 1208

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 96/171 (56%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 395 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 443

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 444 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 502

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW +   NLI V++Y YF
Sbjct: 503 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKYAP-NLITVKKYVYF 546



 Score = 48.1 bits (113), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 36/114 (31%), Positives = 54/114 (47%), Gaps = 16/114 (14%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGAVCTKHI---- 943
           D+R I+   + K+LA   I+FS V    FP+     H H     A   GA   K++    
Sbjct: 857 DIRKIVPELKSKVLADVTIIFSGVCPTNFPIERTREHYH-----ATALGAKINKNLILSA 911

Query: 944 DD--QVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIK 994
           DD  + TH++A   GT+KV  A       VV+P W+ +    + +  EQ F +K
Sbjct: 912 DDPNKATHLIAARAGTEKVRQAQECKHLHVVNPDWLWSCLERWDKVEEQLFPLK 965


>gi|391345370|ref|XP_003746962.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like [Metaseiulus occidentalis]
          Length = 475

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 104/398 (26%), Positives = 175/398 (43%), Gaps = 91/398 (22%)

Query: 662 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 719
           + +E +R +  +++ ++ + +KL L++DLD TL+++      +PV+D+I           
Sbjct: 10  VSEELSRSIGRDDELRLLTQKKLVLLVDLDQTLIHTTS----EPVYDKI----------- 54

Query: 720 PHRHLFRFPHMG-MW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
              H FR P     W  T++RPG   FL + S+LFE+H+ T G + YA  +A +LDP   
Sbjct: 55  KGVHHFRLPSSNNAWYHTRIRPGTEDFLRKISQLFELHIVTFGARPYANHIASLLDPGKK 114

Query: 777 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVE 835
            F  R++SR +  +P        K+ +L+ +    +  V IIDD   VW     NLI V+
Sbjct: 115 YFQYRILSRDECFNP------QSKTANLKSLFPCGDQMVCIIDDREDVWNFAS-NLIAVK 167

Query: 836 RYTYFPCSRRQFGLLGPSLLEIDHDE--RSEDGTLASSL-------------------GV 874
            Y +F   R    +  P+ L  D      SE GT ++ L                   G+
Sbjct: 168 PYVFF---RGAGDINAPAGLLADCHALPASEGGTCSTVLSHRNPEALRADREVVACLQGL 224

Query: 875 IER---------------------------LHKIFFS------HQSLDDV----DVRNIL 897
           IE                            +H+ +F       H+   +     D++ ++
Sbjct: 225 IEHTCGATDGFIDYEDTDDYLLHLEDSLRTVHRAYFELYEQMKHEKHGEASSIPDLKTVI 284

Query: 898 AAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGT 957
              ++K+L    IVF+  FP+ +  P    ++  A   GA   K +  +VTH+VA   GT
Sbjct: 285 PYVRQKVLKDVVIVFTGCFPINQ-RPESAKIFLVAVSLGAKVQKELSKEVTHLVAARPGT 343

Query: 958 DKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIK 994
            KV  A       VV P W+ + A  + +++E  F +K
Sbjct: 344 AKVQQARKFRSIKVVSPDWLWSCAERWEKSSEALFPLK 381


>gi|354479392|ref|XP_003501894.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase [Cricetulus griseus]
          Length = 978

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/177 (35%), Positives = 99/177 (55%), Gaps = 30/177 (16%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++                 E+Q  +  ++ +F F  +
Sbjct: 183 EDQQRLHRNRKLVLMVDLDQTLIHTT----------------EQQCPQMSNKGIFHF-QL 225

Query: 731 G-----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 785
           G     + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR
Sbjct: 226 GRGEPMLHTRLRPHCRDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSR 285

Query: 786 GDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
            +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YFP
Sbjct: 286 DECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYFP 335



 Score = 44.3 bits (103), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 36/138 (26%), Positives = 62/138 (44%), Gaps = 23/138 (16%)

Query: 874 VIERLHKIFFSH------QSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++       + L++  D+R I+   + K+LA   ++FS +    FPV +  
Sbjct: 604 ILVRVHTDYYTKYDRYLKKELEEAPDIRKIVPELKSKVLADVAVIFSGLHPTNFPVEKTR 663

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V      D+ TH++A   GT+KV  A       VV P W
Sbjct: 664 EHYH-----ATALGAKVLTQLVLNPDAPDRATHLIAARAGTEKVRQAQECRHLHVVSPDW 718

Query: 976 VEASALLYRRANEQDFAI 993
           + +    + +  EQ F +
Sbjct: 719 LWSCLERWDKVEEQLFPL 736


>gi|395830784|ref|XP_003788497.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase [Otolemur garnettii]
          Length = 1290

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 63/172 (36%), Positives = 96/172 (55%), Gaps = 20/172 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIF-----------HFQLGRGEPM 220

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221 -LHTRLRPHCRDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YFP
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYFP 324



 Score = 42.4 bits (98), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 52/114 (45%), Gaps = 16/114 (14%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGAVCTKHI---- 943
           D+R I+   + K+LA   I+FS +    FP+ +   H H     A   GA     +    
Sbjct: 603 DIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGAKILTQLVLNP 657

Query: 944 DD--QVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIK 994
           DD  + TH++A   GT+KV  A       VV P W+ +    + +  EQ F +K
Sbjct: 658 DDPGRATHLIAARAGTEKVLQAQGCEHLHVVSPDWLWSCLERWDKVEEQLFPLK 711


>gi|417412899|gb|JAA52807.1| Putative rna polymerase ii subunit a c-terminal domain phosphatase,
           partial [Desmodus rotundus]
          Length = 845

 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 96/171 (56%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  IL           H  L R   M
Sbjct: 66  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIL-----------HFQLGRGEPM 114

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 115 -LHTRLRPHCRQFLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 173

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 174 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 217



 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 37/138 (26%), Positives = 65/138 (47%), Gaps = 23/138 (16%)

Query: 874 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++        ++ +  D+R I+   + ++LA   I+FS +    FPV +  
Sbjct: 474 ILARVHSDYYARYDRYLRGETPEAPDIRKIVPELKSRVLADVAIIFSGLHPTNFPVEKTR 533

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V      D+ TH++A   GT+KV  A + GR  VV+P W
Sbjct: 534 EHYH-----ATALGARILTQLVLDPDAPDRATHLIAARAGTEKVRQAQACGRLHVVNPDW 588

Query: 976 VEASALLYRRANEQDFAI 993
           + +    + +  EQ F +
Sbjct: 589 LWSCLERWDKVEEQLFPL 606


>gi|296222911|ref|XP_002757404.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase [Callithrix jacchus]
          Length = 1053

 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323



 Score = 48.9 bits (115), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ +++  D+R I+   + K+LA   I+FS +    FP+ +  
Sbjct: 596 ILVRVHTDYYAKYDRFLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 655

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A   G   VV+P W
Sbjct: 656 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 710

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F ++
Sbjct: 711 LWSCLERWDKVEEQLFPLR 729


>gi|358383388|gb|EHK21054.1| hypothetical protein TRIVIDRAFT_90991 [Trichoderma virens Gv29-8]
          Length = 758

 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 68/196 (34%), Positives = 101/196 (51%), Gaps = 21/196 (10%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVH 705
           DQ    + K   +R E   QK++   RKL LV+DLD T++++         +    +P H
Sbjct: 131 DQTGLMVSKNVAKRAEHDTQKRLLRQRKLSLVVDLDQTIIHACIEPTIGEWQRDPTNPNH 190

Query: 706 DEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 765
           + +   K  Q  +   R L        + KLRPG+  FLE  S  +E+H+YTMG + YA 
Sbjct: 191 EAVKDVKSFQLNDDGPRGLA--SGCTYYIKLRPGLQEFLEAVSTKYELHVYTMGTRAYAL 248

Query: 766 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVW 824
            +A+++DP   LF  RVISR ++G           +K L+ +  + +  VVIIDD   VW
Sbjct: 249 NIARIVDPDRKLFGNRVISRDENGS--------ITAKSLQRLFPVSTDMVVIIDDRADVW 300

Query: 825 PHNKLNLIVVERYTYF 840
           P N+ NLI V  Y +F
Sbjct: 301 PMNRPNLIKVVPYDFF 316


>gi|320164786|gb|EFW41685.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 877

 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 60/171 (35%), Positives = 96/171 (56%), Gaps = 12/171 (7%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 731
           E++ +  ++KL L++DLD TL+++    +V P   + LR   E  +E  +  L   PH+ 
Sbjct: 222 EKESLLQSKKLVLIVDLDQTLIHAVVSSQV-PWIGQFLRDNVELQKEIFNFSLPNHPHL- 279

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
            + KLRPG   FL +A+KLFE+H++TMG+++YA+ +A VLDP G LF  R++SR +    
Sbjct: 280 YYIKLRPGAREFLAQATKLFELHIFTMGSRMYASRVAAVLDPDGALFGSRIMSRDESKSA 339

Query: 792 -FDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
            F       K   L  +     + V ++DD + VW     N+I +  Y YF
Sbjct: 340 NF-------KHTQLSQLFPSGHNMVAVLDDRIDVWARLG-NVIQISPYEYF 382


>gi|406865754|gb|EKD18795.1| FCP1-like phosphatase [Marssonina brunnea f. sp. 'multigermtubi'
           MB_m1]
          Length = 863

 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 65/181 (35%), Positives = 98/181 (54%), Gaps = 23/181 (12%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE--KPHRHLF--- 725
           E Q+++   RKL LV+DLD T++++     ++P   E  R K   + E  K  +      
Sbjct: 155 ELQRRLLKHRKLSLVVDLDQTIIHAC----IEPTVGEWQRDKNSPNYEAVKDVKSFQLND 210

Query: 726 ---RFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 780
              R    G W   K+RPG+  FL   S+L+E+H+YTMG + YA  +AK++DP   LF  
Sbjct: 211 DGPRGLASGCWYYIKMRPGLAEFLAHISELYELHVYTMGTRAYAINIAKIVDPDKKLFGD 270

Query: 781 RVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTY 839
           R+ISR ++G+          +K L  +  +++  VVIIDD   VWP N+ NLI V  Y +
Sbjct: 271 RIISRDENGN--------VTAKSLARLFPVDTKMVVIIDDRADVWPQNRPNLIKVVPYDF 322

Query: 840 F 840
           F
Sbjct: 323 F 323


>gi|50838820|ref|NP_001002873.1| RNA polymerase II subunit A C-terminal domain phosphatase [Danio
           rerio]
 gi|49618915|gb|AAT68042.1| RNA polymerase II CTD phosphatase [Danio rerio]
          Length = 947

 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 63/171 (36%), Positives = 96/171 (56%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 159 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQRMSNKGIF-----------HFQLGRGEPM 207

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KLFE+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 208 -LHTRLRPHCKDFLEKIAKLFELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 266

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L+ +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 267 PFS------KTGNLKNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYIYF 310



 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 59/128 (46%), Gaps = 18/128 (14%)

Query: 879 HKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPL-----WQTAE 933
           ++ F   +S +  D+R I+   + + LAG  ++FS ++P        +P+     W   +
Sbjct: 568 YEAFLKKESSESPDIRKIVPELKGRTLAGTTVIFSGLYPTN------YPMERTREWYHGK 621

Query: 934 QFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRA 986
             GA   K +       ++ TH++A   GT+KV  A       VV+P W+ A    + R 
Sbjct: 622 ALGAKLCKTLVLNAKDPNRTTHLIAARAGTEKVRQAQGCKHLHVVNPDWLWACLERWERV 681

Query: 987 NEQDFAIK 994
            EQ F +K
Sbjct: 682 EEQLFPLK 689


>gi|410911388|ref|XP_003969172.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like [Takifugu rubripes]
          Length = 905

 Score =  102 bits (253), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 65/182 (35%), Positives = 102/182 (56%), Gaps = 21/182 (11%)

Query: 660 AAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 719
           +A Q E+  R E+Q+++   +KL L++DLD TL+++ + H     +  IL          
Sbjct: 156 SAEQAEQLGR-EDQQRLHRNKKLVLMVDLDQTLIHTTEQHCQRMSNKGIL---------- 204

Query: 720 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 779
            H  L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+
Sbjct: 205 -HFQLGRGEPM-LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFS 262

Query: 780 GRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYT 838
            R++SR +  DPF       K+ +L  +    +S V IIDD   VW     NL+ V++Y 
Sbjct: 263 HRILSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLVTVKKYV 315

Query: 839 YF 840
           YF
Sbjct: 316 YF 317



 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/140 (26%), Positives = 66/140 (47%), Gaps = 25/140 (17%)

Query: 874 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH 926
           V+ER+H  +++        ++ +  D+R I+   + K L G  IVFS ++P        +
Sbjct: 571 VLERVHAEYYARYEAYLGKEAPESPDIRKIVPELKSKTLEGATIVFSGLYPTN------Y 624

Query: 927 PLWQTAEQF-----GAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPG 974
           P+ +T E +     GA  ++ +        Q TH++A   GT+KV  A       VV+P 
Sbjct: 625 PIEKTREYYHAKALGATISRSLILSTKGPGQTTHLIAARAGTEKVRQAQGCKHLRVVNPD 684

Query: 975 WVEASALLYRRANEQDFAIK 994
           W+ +    + R  EQ + +K
Sbjct: 685 WLWSCLERWERVEEQLYPLK 704


>gi|291414979|ref|XP_002723734.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
           polypeptide A) phosphatase, subunit 1-like [Oryctolagus
           cuniculus]
          Length = 940

 Score =  101 bits (252), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 65/179 (36%), Positives = 100/179 (55%), Gaps = 21/179 (11%)

Query: 663 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 722
           Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  IL           H 
Sbjct: 146 QAEKIAR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCPQMSNKGIL-----------HF 193

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            L R   M + T+LRP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R+
Sbjct: 194 QLGRGEPM-LHTRLRPHCKDFLEKIARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 252

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           +SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 253 LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 304



 Score = 46.2 bits (108), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 65/139 (46%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ +++  D+R I+   + K+LAG  I FS +    FP+ +  
Sbjct: 565 ILARVHADYYTRYDRYLNREVEEAPDMRRIVPELKSKVLAGVVITFSGLHPANFPIEKTR 624

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V      D+ TH++A   GT+KV  A       VV+P W
Sbjct: 625 EHYH-----ATALGAKILTRLVLNPDAPDRATHLIAARAGTEKVRQAQECRHLHVVNPDW 679

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F ++
Sbjct: 680 LWSCLERWDKVEEQLFPLR 698


>gi|194214772|ref|XP_001496059.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase [Equus caballus]
          Length = 868

 Score =  101 bits (252), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 80  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 128

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 129 -LHTRLRPHCKEFLEKTAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 187

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 188 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 231



 Score = 49.7 bits (117), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 61/124 (49%), Gaps = 16/124 (12%)

Query: 882 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 937
           F S +  +  D+R I+   + K+LA   I+FS +    FP+ +   H H     A   GA
Sbjct: 510 FLSGEIQEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGA 564

Query: 938 -VCTKHIDD-----QVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
            + T+ + D     + TH+VA   GT+KV  A   G+  VV+P W+ +    + +  EQ 
Sbjct: 565 KILTQLVLDPNDPNRATHLVAARAGTEKVRQAQECGQLHVVNPDWLWSCLERWDKVEEQL 624

Query: 991 FAIK 994
           F ++
Sbjct: 625 FPLR 628


>gi|344269798|ref|XP_003406734.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
           C-terminal domain phosphatase-like [Loxodonta africana]
          Length = 972

 Score =  101 bits (252), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 178 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 226

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 227 -LHTRLRPHCKEFLEKVAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 285

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 286 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 329



 Score = 43.9 bits (102), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 59/124 (47%), Gaps = 16/124 (12%)

Query: 882 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 937
           + + ++ +  D+R I+   + K+LA   I+FS +    FPV +   H H     A   GA
Sbjct: 608 YLNKETEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPVEKTREHYH-----ATALGA 662

Query: 938 VCTKHI----DD--QVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
               ++    DD  + TH++A   GT+KV  A       VV+P W+ +    + +  EQ 
Sbjct: 663 RILTNLVLSPDDPNRATHLIAARAGTEKVRQAQECRHLHVVNPDWLWSCLERWDKVEEQL 722

Query: 991 FAIK 994
           F ++
Sbjct: 723 FPLR 726


>gi|402903419|ref|XP_003914563.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase isoform 2 [Papio anubis]
          Length = 871

 Score =  101 bits (251), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323



 Score = 47.8 bits (112), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 16/124 (12%)

Query: 882 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 937
           F + +  +  D+R I+   + K+LA   I+FS +    FP+ +   H H     A   GA
Sbjct: 615 FLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGA 669

Query: 938 ------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
                 V +    D+ TH++A   GT+KV  A   G   VV+P W+ +    + +  EQ 
Sbjct: 670 KILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQL 729

Query: 991 FAIK 994
           F ++
Sbjct: 730 FPLR 733


>gi|326916917|ref|XP_003204751.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like [Meleagris gallopavo]
          Length = 1003

 Score =  101 bits (251), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 183 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 231

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 232 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 290

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 291 PFS------KTGNLRDLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 334



 Score = 49.7 bits (117), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFSH------QSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++       + ++++ D+R I+   + K+LA   I+FS +    FP+ +  
Sbjct: 622 ILVRVHTDYYTKYDKYLKKEIEEIPDIRKIVPELKSKVLADVTIIFSGLYPTNFPIEKTR 681

Query: 923 PHLHPLWQTAEQFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA   K++       D+ TH++A   GT+KV  A       VV+P W
Sbjct: 682 EHYH-----ATALGAKIVKNLVLSADDPDKATHLIAARTGTEKVRQAQDCKDLHVVNPDW 736

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F +K
Sbjct: 737 LWSCLERWDKVEEQLFPLK 755


>gi|3769521|gb|AAC64549.1| serine phosphatase FCP1a [Homo sapiens]
          Length = 842

 Score =  101 bits (251), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 53  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 102 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 161 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204



 Score = 45.4 bits (106), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 35/139 (25%), Positives = 66/139 (47%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ +++  D+R I+   + K+LA   I+FS +    FP+ +  
Sbjct: 477 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 536

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
              H     A   GA      V +    D+ TH++A   GT+KV  A   G   VV+P W
Sbjct: 537 DDYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 591

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F ++
Sbjct: 592 LWSCLERWDKVEEQLFPLR 610


>gi|67188445|ref|NP_004706.3| RNA polymerase II subunit A C-terminal domain phosphatase isoform 1
           [Homo sapiens]
 gi|327478586|sp|Q9Y5B0.3|CTDP1_HUMAN RecName: Full=RNA polymerase II subunit A C-terminal domain
           phosphatase; AltName: Full=TFIIF-associating CTD
           phosphatase
 gi|119587032|gb|EAW66628.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           phosphatase, subunit 1, isoform CRA_a [Homo sapiens]
          Length = 961

 Score =  101 bits (251), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323



 Score = 48.9 bits (115), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ +++  D+R I+   + K+LA   I+FS +    FP+ +  
Sbjct: 596 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 655

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A   G   VV+P W
Sbjct: 656 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 710

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F ++
Sbjct: 711 LWSCLERWDKVEEQLFPLR 729


>gi|402903417|ref|XP_003914562.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase isoform 1 [Papio anubis]
          Length = 965

 Score =  101 bits (251), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323



 Score = 48.5 bits (114), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 16/124 (12%)

Query: 882 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 937
           F + +  +  D+R I+   + K+LA   I+FS +    FP+ +   H H     A   GA
Sbjct: 615 FLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGA 669

Query: 938 ------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
                 V +    D+ TH++A   GT+KV  A   G   VV+P W+ +    + +  EQ 
Sbjct: 670 KILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQL 729

Query: 991 FAIK 994
           F ++
Sbjct: 730 FPLR 733


>gi|332850750|ref|XP_001144243.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase isoform 2 [Pan troglodytes]
          Length = 1026

 Score =  101 bits (251), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323



 Score = 48.1 bits (113), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ +++  D+R I+   + K+LA   I+FS +    FP+ +  
Sbjct: 596 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 655

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A   G   VV+P W
Sbjct: 656 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 710

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F ++
Sbjct: 711 LWSCLERWDKVEEQLFPLR 729


>gi|5326898|gb|AAD42088.1| RNA polymerase II CTD phosphatase [Homo sapiens]
          Length = 961

 Score =  101 bits (251), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDRKDVWKFAP-NLITVKKYVYF 323



 Score = 48.5 bits (114), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ +++  D+R I+   + K+LA   I+FS +    FP+ +  
Sbjct: 596 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 655

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A   G   VV+P W
Sbjct: 656 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 710

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F ++
Sbjct: 711 LWSCLERWDKVEEQLFPLR 729


>gi|297702856|ref|XP_002828379.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
           C-terminal domain phosphatase [Pongo abelii]
          Length = 962

 Score =  101 bits (251), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323



 Score = 48.5 bits (114), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ +++  D+R I+   + K+LA   I+FS +    FP+ +  
Sbjct: 597 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 656

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A   G   VV+P W
Sbjct: 657 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 711

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F ++
Sbjct: 712 LWSCLERWDKVEEQLFPLR 730


>gi|321267522|ref|NP_001189433.1| RNA polymerase II subunit A C-terminal domain phosphatase isoform 3
           [Homo sapiens]
          Length = 842

 Score =  101 bits (251), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 53  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 102 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 161 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204



 Score = 49.3 bits (116), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ +++  D+R I+   + K+LA   I+FS +    FP+ +  
Sbjct: 477 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 536

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A   G   VV+P W
Sbjct: 537 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 591

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F ++
Sbjct: 592 LWSCLERWDKVEEQLFPLR 610


>gi|109122558|ref|XP_001088601.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase isoform 2 [Macaca mulatta]
          Length = 964

 Score =  101 bits (251), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323



 Score = 48.5 bits (114), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 16/124 (12%)

Query: 882 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 937
           F + +  +  D+R I+   + K+LA   I+FS +    FP+ +   H H     A   GA
Sbjct: 616 FLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGA 670

Query: 938 ------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
                 V +    D+ TH++A   GT+KV  A   G   VV+P W+ +    + +  EQ 
Sbjct: 671 KILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQL 730

Query: 991 FAIK 994
           F ++
Sbjct: 731 FPLR 734


>gi|397467065|ref|XP_003805250.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase [Pan paniscus]
          Length = 842

 Score =  101 bits (251), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 53  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 102 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 161 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204



 Score = 49.3 bits (116), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ +++  D+R I+   + K+LA   I+FS +    FP+ +  
Sbjct: 477 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 536

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A   G   VV+P W
Sbjct: 537 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 591

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F ++
Sbjct: 592 LWSCLERWDKVEEQLFPLR 610


>gi|119587034|gb|EAW66630.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           phosphatase, subunit 1, isoform CRA_c [Homo sapiens]
          Length = 948

 Score =  101 bits (251), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323



 Score = 48.9 bits (115), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ +++  D+R I+   + K+LA   I+FS +    FP+ +  
Sbjct: 596 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 655

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A   G   VV+P W
Sbjct: 656 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 710

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F ++
Sbjct: 711 LWSCLERWDKVEEQLFPLR 729


>gi|119587036|gb|EAW66632.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           phosphatase, subunit 1, isoform CRA_e [Homo sapiens]
          Length = 748

 Score =  101 bits (251), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 53  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 102 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 161 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204



 Score = 49.3 bits (116), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ +++  D+R I+   + K+LA   I+FS +    FP+ +  
Sbjct: 477 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 536

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A   G   VV+P W
Sbjct: 537 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 591

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F ++
Sbjct: 592 LWSCLERWDKVEEQLFPLR 610


>gi|402903421|ref|XP_003914564.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase isoform 3 [Papio anubis]
          Length = 846

 Score =  101 bits (251), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 53  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 102 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 161 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204



 Score = 48.9 bits (115), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 16/124 (12%)

Query: 882 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 937
           F + +  +  D+R I+   + K+LA   I+FS +    FP+ +   H H     A   GA
Sbjct: 496 FLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGA 550

Query: 938 ------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
                 V +    D+ TH++A   GT+KV  A   G   VV+P W+ +    + +  EQ 
Sbjct: 551 KILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQL 610

Query: 991 FAIK 994
           F ++
Sbjct: 611 FPLR 614


>gi|123401628|ref|XP_001301902.1| NLI interacting factor-like phosphatase family protein [Trichomonas
           vaginalis G3]
 gi|121883137|gb|EAX88972.1| NLI interacting factor-like phosphatase family protein [Trichomonas
           vaginalis G3]
          Length = 461

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 86/352 (24%), Positives = 163/352 (46%), Gaps = 40/352 (11%)

Query: 661 AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFH---EVDPVHDEILRKKEEQDR 717
           + ++ + + LEE++++  A+KL LV+DLD TL+++ +     EVD +        ++ D 
Sbjct: 45  SFEEAKRKNLEEEQRLIDAKKLSLVIDLDKTLIDTTEVRNRAEVDAI--------KKLDP 96

Query: 718 EKPHRHLFRFP-HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                  F F  +  +  + RP +  FL   +  F+M +YT+ +  YA  +   +DP+  
Sbjct: 97  AATEDDFFEFNMNQNLLIRYRPHVRQFLASIAPYFDMQIYTLASPAYAHAILSKIDPEDK 156

Query: 777 LFAGRVISRGDDG-----DPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVW--PHNK 828
           LF  R+ SR  +      +       +   K+++ +    +  V+++DDS  VW   +NK
Sbjct: 157 LFKNRIFSRTAEDFAMIKEAMRNQTDIVNKKNIKKIFPYSDKLVLVLDDSPEVWFCDNNK 216

Query: 829 L--NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQ 886
           L   L+ ++RY+YF    RQ    GP+     + +   D  L     V+  +H +F+ + 
Sbjct: 217 LFKGLVQIKRYSYFT---RQ----GPNSPPTVNPDYVNDDILIQMRSVLIDVHDMFYKNY 269

Query: 887 SLDDVDVRNILAAEQRK--ILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHID 944
             ++  V  I+   QRK  +  G    FS     G +        + AE+FGA+      
Sbjct: 270 DPEESHV--IMTLHQRKAQVFEGKTFYFS-----GLSEDDTMTFTRLAEEFGALVVDSFT 322

Query: 945 DQVTHVVANSLGT-DKVNWALST-GRFVVHPGWVEASALLYRRANEQDFAIK 994
              TH++    GT D++  A+   G ++++  W+    + Y R  E  ++I+
Sbjct: 323 PYTTHIIVGEGGTDDQIQKAMEYRGVYIIYLKWLFECFIQYARIEESLYSIQ 374


>gi|440638319|gb|ELR08238.1| hypothetical protein GMDG_03040 [Geomyces destructans 20631-21]
          Length = 1765

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 67/198 (33%), Positives = 105/198 (53%), Gaps = 25/198 (12%)

Query: 656 DQQKAAIQKERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           DQ   ++ ++   R EEQ  +++   RKL LV+DLD T++++     ++P   E  R   
Sbjct: 134 DQTLLSVSQDEASRAEEQLQRRLLKNRKLSLVVDLDQTIIHAC----IEPTIGEWQRDPT 189

Query: 714 EQDRE--------KPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLY 763
             + E        + H    R    G W   K+RPG+  FL   ++ +E+H+YTMG + Y
Sbjct: 190 SPNYEAVKDVKSFQLHDDGPRGLASGCWYYIKMRPGLAHFLTTIAEKYELHVYTMGTRAY 249

Query: 764 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVR 822
           A E+AK++DP+  LF  R+ISR ++G           +K L  +  +++  VVIIDD   
Sbjct: 250 AQEIAKIVDPEHKLFGDRIISRDENGS--------LTAKTLSRLFPVDTKMVVIIDDRAD 301

Query: 823 VWPHNKLNLIVVERYTYF 840
           VWP N+ NLI V  Y +F
Sbjct: 302 VWPRNRSNLIKVVPYDFF 319


>gi|410294550|gb|JAA25875.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           phosphatase, subunit 1 [Pan troglodytes]
          Length = 961

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323



 Score = 48.9 bits (115), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ +++  D+R I+   + K+LA   I+FS +    FP+ +  
Sbjct: 596 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 655

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A   G   VV+P W
Sbjct: 656 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 710

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F ++
Sbjct: 711 LWSCLERWDKVEEQLFPLR 729


>gi|39645774|gb|AAH63447.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           phosphatase, subunit 1 [Homo sapiens]
          Length = 867

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323



 Score = 48.1 bits (113), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ +++  D+R I+   + K+LA   I+FS +    FP+ +  
Sbjct: 596 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 655

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A   G   VV+P W
Sbjct: 656 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 710

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F ++
Sbjct: 711 LWSCLERWDKVEEQLFPLR 729


>gi|395511850|ref|XP_003760164.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase [Sarcophilus harrisii]
          Length = 1267

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 453 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 501

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 502 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 560

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 561 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 604



 Score = 47.0 bits (110), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 54/114 (47%), Gaps = 16/114 (14%)

Query: 892  DVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGAVCTKHI---- 943
            D+R I+   + K+LA   I+FS V    +P+     H H     A   GA   K++    
Sbjct: 914  DIRKIVPELKSKVLADVTIIFSGVCPTNYPIERTREHYH-----ATALGAKINKNLILNA 968

Query: 944  DD--QVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIK 994
            DD  + TH++A   GT+KV  A       VV+P W+ +    + +  EQ F +K
Sbjct: 969  DDPNKATHLIAARAGTEKVRQAQECKHLHVVNPDWLWSCLERWDKVEEQLFPLK 1022


>gi|147770504|emb|CAN75676.1| hypothetical protein VITISV_003260 [Vitis vinifera]
          Length = 205

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 56/139 (40%), Positives = 79/139 (56%), Gaps = 8/139 (5%)

Query: 758 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVII 817
           MG + YA EM KVLDP+ V F+  VIS+ D              K L+ VLG +S V+I+
Sbjct: 1   MGEQFYALEMVKVLDPRTVYFSSSVISQADSTQR--------HQKGLDVVLGPKSXVLIL 52

Query: 818 DDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIER 877
           DD+ R W ++K NLI++ERY +F  S  QFG    SL E+  DE   DG LA+ L V+++
Sbjct: 53  DDTERAWKNHKDNLILMERYHFFASSCHQFGFHCKSLSELKSDESEPDGALATILKVLQQ 112

Query: 878 LHKIFFSHQSLDDVDVRNI 896
            H   F  +  D+   R++
Sbjct: 113 THSTLFDPELSDNFSGRDV 131


>gi|67188550|ref|NP_430255.2| RNA polymerase II subunit A C-terminal domain phosphatase isoform 2
           [Homo sapiens]
 gi|119587035|gb|EAW66631.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           phosphatase, subunit 1, isoform CRA_d [Homo sapiens]
          Length = 867

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323



 Score = 48.1 bits (113), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ +++  D+R I+   + K+LA   I+FS +    FP+ +  
Sbjct: 596 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 655

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A   G   VV+P W
Sbjct: 656 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 710

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F ++
Sbjct: 711 LWSCLERWDKVEEQLFPLR 729


>gi|426386293|ref|XP_004059621.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase isoform 1 [Gorilla gorilla gorilla]
 gi|426386295|ref|XP_004059622.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase isoform 2 [Gorilla gorilla gorilla]
          Length = 842

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 53  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 102 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 161 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204



 Score = 48.9 bits (115), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ +++  D+R I+   + K+LA   I+FS +    FP+ +  
Sbjct: 477 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 536

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A   G   VV+P W
Sbjct: 537 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 591

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F ++
Sbjct: 592 LWSCLERWDKVEEQLFPLR 610


>gi|431907029|gb|ELK11148.1| RNA polymerase II subunit A C-terminal domain phosphatase [Pteropus
           alecto]
          Length = 918

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 96/171 (56%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  IL           H  L R   M
Sbjct: 145 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQRMSNKGIL-----------HFQLGRGEPM 193

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 194 -LHTRLRPHCREFLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 252

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 253 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 296



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/138 (27%), Positives = 64/138 (46%), Gaps = 23/138 (16%)

Query: 874 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++        ++ +  D+R I+   + ++LA   I+FS +    FPV +  
Sbjct: 553 ILARVHSDYYARYDRYLRGEAPEAPDIRKIVPELKSRVLADVAIIFSGLHPTNFPVEKTR 612

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V      D+ TH++A   GT+KV  A   GR  VV+P W
Sbjct: 613 EHYH-----ATALGAKILTQLVLDPDAPDRATHLIAARAGTEKVRQAQECGRLHVVNPDW 667

Query: 976 VEASALLYRRANEQDFAI 993
           + +    + R  EQ F +
Sbjct: 668 LWSCLERWDRVEEQLFPL 685


>gi|363730338|ref|XP_418905.3| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase [Gallus gallus]
          Length = 958

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 137 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 185

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 186 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 244

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 245 PFS------KTGNLRDLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 288



 Score = 49.7 bits (117), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFSH------QSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++       + ++++ D+R I+   + K+LA   I+FS +    FP+ +  
Sbjct: 576 ILVRVHTDYYTKYDKYLKKEIEEIPDIRKIVPELKSKVLADVTIIFSGLYPTNFPIEKTR 635

Query: 923 PHLHPLWQTAEQFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA   K++       D+ TH++A   GT+KV  A       VV+P W
Sbjct: 636 EHYH-----ATALGAKIVKNLVLSADDPDKATHLIAARTGTEKVRQAQDCKDLHVVNPDW 690

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F +K
Sbjct: 691 LWSCLERWDKVEEQLFPLK 709


>gi|410215194|gb|JAA04816.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           phosphatase, subunit 1 [Pan troglodytes]
 gi|410254644|gb|JAA15289.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           phosphatase, subunit 1 [Pan troglodytes]
 gi|410331971|gb|JAA34932.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           phosphatase, subunit 1 [Pan troglodytes]
          Length = 961

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 172 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 220

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 221 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 279

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 280 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323



 Score = 48.9 bits (115), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ +++  D+R I+   + K+LA   I+FS +    FP+ +  
Sbjct: 596 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 655

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A   G   VV+P W
Sbjct: 656 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 710

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F ++
Sbjct: 711 LWSCLERWDKVEEQLFPLR 729


>gi|389637610|ref|XP_003716438.1| RNA polymerase II subunit A domain phosphatase [Magnaporthe oryzae
           70-15]
 gi|351642257|gb|EHA50119.1| RNA polymerase II subunit A domain phosphatase [Magnaporthe oryzae
           70-15]
 gi|440471327|gb|ELQ40350.1| RNA polymerase II subunit A C-terminal domain phosphatase
           [Magnaporthe oryzae Y34]
 gi|440487323|gb|ELQ67117.1| RNA polymerase II subunit A C-terminal domain phosphatase
           [Magnaporthe oryzae P131]
          Length = 866

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 63/177 (35%), Positives = 92/177 (51%), Gaps = 16/177 (9%)

Query: 670 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEV-----DPVHDEILRKKEEQDREKPHRHL 724
           LE QK++ + RKL LV+DLD T++ +A    +     DP +      KE +  E P    
Sbjct: 159 LEMQKRLVAQRKLVLVVDLDQTVIQTACEPTIGEWQKDPSNPNYEALKEVRSFELPSEDG 218

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
            R  +   + K RPG   FL + S LFEMH+YTM  + YA  + +++DPK  LF  RVIS
Sbjct: 219 PRRNYT-YYVKCRPGTHEFLNKVSNLFEMHVYTMATRAYAEHILRIIDPKKNLFGNRVIS 277

Query: 785 RGDDGDPFDGDERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           R ++       +R+ P S  +         V +IDD   VWP N+ N+I V  Y ++
Sbjct: 278 RNENKGIEKTLQRIFPTSTKM---------VAVIDDRTDVWPQNRSNVIKVVPYNFY 325


>gi|355702027|gb|EHH29380.1| RNA polymerase II subunit A C-terminal domain phosphatase, partial
           [Macaca mulatta]
          Length = 861

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 67  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 115

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 116 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 174

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 175 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 218



 Score = 48.5 bits (114), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 16/124 (12%)

Query: 882 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 937
           F + +  +  D+R I+   + K+LA   I+FS +    FP+ +   H H     A   GA
Sbjct: 511 FLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGA 565

Query: 938 ------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
                 V +    D+ TH++A   GT+KV  A   G   VV+P W+ +    + +  EQ 
Sbjct: 566 KILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQL 625

Query: 991 FAIK 994
           F ++
Sbjct: 626 FPLR 629


>gi|355755122|gb|EHH58989.1| RNA polymerase II subunit A C-terminal domain phosphatase, partial
           [Macaca fascicularis]
          Length = 861

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 67  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 115

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 116 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 174

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 175 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 218



 Score = 48.9 bits (115), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 16/124 (12%)

Query: 882 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 937
           F + +  +  D+R I+   + K+LA   I+FS +    FP+ +   H H     A   GA
Sbjct: 511 FLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGA 565

Query: 938 ------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
                 V +    D+ TH++A   GT+KV  A   G   VV+P W+ +    + +  EQ 
Sbjct: 566 KILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQL 625

Query: 991 FAIK 994
           F ++
Sbjct: 626 FPLR 629


>gi|441603466|ref|XP_004087808.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
           C-terminal domain phosphatase [Nomascus leucogenys]
          Length = 1236

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 65/179 (36%), Positives = 99/179 (55%), Gaps = 21/179 (11%)

Query: 663 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 722
           Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  I            H 
Sbjct: 165 QAEQLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HF 212

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R+
Sbjct: 213 QLGRGEPM-LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 271

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           +SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 272 LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 323



 Score = 48.1 bits (113), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 16/124 (12%)

Query: 882 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 937
           + S +  +  D+R I+   + K+LA   I+FS +    FP+ +   H H     A   GA
Sbjct: 612 YLSKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGA 666

Query: 938 ------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
                 V +    D+ TH++A   GT+KV  A   G   VV+P W+ +    + +  EQ 
Sbjct: 667 KILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQL 726

Query: 991 FAIK 994
           F ++
Sbjct: 727 FPLR 730


>gi|340518072|gb|EGR48314.1| predicted protein [Trichoderma reesei QM6a]
          Length = 594

 Score =  100 bits (250), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 63/177 (35%), Positives = 95/177 (53%), Gaps = 19/177 (10%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHRHL 724
           QK++   RKL LV+DLD T++++         +    +P H+ +   K  Q  +   R L
Sbjct: 150 QKRLLRQRKLSLVVDLDQTIIHACIEPTIGEWQRDPTNPNHEAVKDVKSFQLNDDGPRGL 209

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
                   + KLRPG+  FLE  S  +E+H+YTMG + YA  +A+++DP   LF  RVIS
Sbjct: 210 A--SGCTYYIKLRPGLKEFLEAVSTKYELHVYTMGTRAYALNIARIVDPDKKLFGNRVIS 267

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           R ++G           +K L+ +  + +  VVIIDD   VWP+N+ NLI V  Y +F
Sbjct: 268 RDENGS--------ITAKSLQRLFPVSTDMVVIIDDRADVWPNNRPNLIKVAPYDFF 316


>gi|358418617|ref|XP_003583993.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like [Bos taurus]
          Length = 864

 Score =  100 bits (249), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 99/179 (55%), Gaps = 21/179 (11%)

Query: 663 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 722
           Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  I            H 
Sbjct: 162 QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HF 209

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            L R   M + T+LRP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R+
Sbjct: 210 QLGRGEPM-LHTRLRPHCKEFLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 268

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           +SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 269 LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 320



 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/124 (31%), Positives = 61/124 (49%), Gaps = 16/124 (12%)

Query: 882 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 937
           F S +S +  D+R I+   + K+LA   I+FS +    FPV     H H     A   GA
Sbjct: 588 FLSGESPEAPDIRKIVPELRSKVLADVAIIFSGLHPTNFPVERTREHYH-----ARALGA 642

Query: 938 -VCTKHIDD-----QVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
            + T+ + D     + TH++A   GT+KV  A   G+  VV+P W+ +    + R  EQ 
Sbjct: 643 RILTQLVLDPDDPARPTHLIAARAGTEKVRQAQECGQLHVVNPDWLWSCLERWDRVEEQL 702

Query: 991 FAIK 994
           F ++
Sbjct: 703 FPLR 706


>gi|345324709|ref|XP_001509122.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase [Ornithorhynchus anatinus]
          Length = 1168

 Score =  100 bits (249), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 64/182 (35%), Positives = 101/182 (55%), Gaps = 22/182 (12%)

Query: 662 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 719
           +  E+ ++L  E+Q+++   RKL L++DLD TL+++ + H     +  I           
Sbjct: 167 VSSEQAKQLGREDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF---------- 216

Query: 720 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 779
            H  L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+
Sbjct: 217 -HFQLGRGEPM-LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFS 274

Query: 780 GRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYT 838
            R++SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y 
Sbjct: 275 HRILSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYV 327

Query: 839 YF 840
           YF
Sbjct: 328 YF 329



 Score = 47.4 bits (111), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 37/139 (26%), Positives = 65/139 (46%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++       Q + +V D+R I+   + K+LA   I+FS +    FP+    
Sbjct: 588 ILVRVHTDYYAKYDKYLRQEIQEVPDIRKIVPELKSKVLADVTIIFSGLYPTNFPMERTR 647

Query: 923 PHLHPLWQTAEQFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA   K++       ++ TH++A   GT+KV  A       VV+P W
Sbjct: 648 EHYH-----ATALGAKINKNLILSADNPNRATHLIAARAGTEKVRQAQDCKHLHVVNPDW 702

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F +K
Sbjct: 703 LWSCLERWDKVEEQLFPLK 721


>gi|123490666|ref|XP_001325656.1| NLI interacting factor-like phosphatase family protein [Trichomonas
           vaginalis G3]
 gi|121908559|gb|EAY13433.1| NLI interacting factor-like phosphatase family protein [Trichomonas
           vaginalis G3]
          Length = 474

 Score =  100 bits (249), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 88/349 (25%), Positives = 155/349 (44%), Gaps = 37/349 (10%)

Query: 661 AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKP 720
           + ++ R R L+E++++  A+KL LV+DLD TL+++ +  +    H E+    E   +  P
Sbjct: 45  SFEEARNRNLQEEQRLIDAKKLSLVIDLDKTLIDTTEVRD----HSEV----EAIKKLDP 96

Query: 721 HR---HLFRFP-HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
           H      F F  +  +  + RP +  FL   +  F++ +YT+    YA  +   +DP   
Sbjct: 97  HATEDDFFEFNMNQNLLIRYRPHVREFLASIAPYFDLQIYTLALPSYAHAILSKIDPDDK 156

Query: 777 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-------ESAVVIIDDSVRVW--PHN 827
           LF  R+ SR  +      +E +    D+     +       +  V+++DDS  VW    N
Sbjct: 157 LFKNRIFSRTAEDFAMLREEAMRNRTDIVHKKNIKKLFPYSDKLVLVLDDSPEVWYCDDN 216

Query: 828 KL--NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH 885
           KL   L+ ++RY+YF    RQ    GP+     + +  ED  L     V+  +H +F+ +
Sbjct: 217 KLFKGLVQIKRYSYFT---RQ----GPNFPPTVNPDYVEDDILIQMRSVLIEVHDLFYKN 269

Query: 886 QSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDD 945
              +   V   L   + ++  G    FS     G ++         AE+FGA+       
Sbjct: 270 YDPEQSHVIMTLHQRKAQVFEGKTFYFS-----GLSDADARSFTYLAEEFGALVVDSFTP 324

Query: 946 QVTHVVANSLGTD-KVNWALS-TGRFVVHPGWVEASALLYRRANEQDFA 992
             TH++    G D +V  AL   G +V++  W+    + Y R  E  +A
Sbjct: 325 YTTHIIVGEGGADEEVQKALKYNGVYVIYLKWLFECFIQYARLEESTYA 373


>gi|444518074|gb|ELV11938.1| RNA polymerase II subunit A C-terminal domain phosphatase [Tupaia
           chinensis]
          Length = 876

 Score =  100 bits (248), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 17  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCAQMSNRGIF-----------HFQLGRGEPM 65

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 66  -LHTRLRPHCKDFLEKVAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 124

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 125 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 168



 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 36/114 (31%), Positives = 55/114 (48%), Gaps = 16/114 (14%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA------VCTK 941
           D+R I+   + K+LA   I+FS +    FPV +   H H     A   GA      V + 
Sbjct: 444 DIRRIVPELRSKVLADVAILFSGLHPTNFPVEKTREHYH-----ATALGAKILTQLVLSP 498

Query: 942 HIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIK 994
              D+ TH++A   GT+KV  A  +G   VV PGW+ +    + +  EQ F ++
Sbjct: 499 DAPDRATHLIAARAGTEKVRQAQESGHVHVVSPGWLWSCLERWDKVEEQLFPLR 552


>gi|403268140|ref|XP_003926140.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase [Saimiri boliviensis boliviensis]
          Length = 937

 Score =  100 bits (248), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 147 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 195

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 196 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 254

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 255 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 298



 Score = 48.9 bits (115), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 16/124 (12%)

Query: 882 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 937
           F + +  +  D+R I+   + K+LA   I+FS +    FP+ +   H H     A   GA
Sbjct: 587 FLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYH-----ATALGA 641

Query: 938 ------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
                 V +    D+ TH++A   GT+KV  A   G   VV+P W+ +    + +  EQ 
Sbjct: 642 KILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQL 701

Query: 991 FAIK 994
           F ++
Sbjct: 702 FPLR 705


>gi|449493392|ref|XP_002190004.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase [Taeniopygia guttata]
          Length = 871

 Score =  100 bits (248), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 53  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 102 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 161 PF------SKTGNLRDLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204



 Score = 48.9 bits (115), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFSH------QSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++       + ++++ D+R I+   + K+LA   I+FS +    FP+ +  
Sbjct: 492 ILVRVHTDYYTKYDKYLKKEIEEIPDIRKIVPELKSKVLADVTIIFSGLYPTNFPIEKTR 551

Query: 923 PHLHPLWQTAEQFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA   K++       D+ TH++A   GT+KV  A       VV+P W
Sbjct: 552 EHYH-----ATALGAKIVKNLVLSADDPDKATHLIAARTGTEKVRQAQDCKDLHVVNPDW 606

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F +K
Sbjct: 607 LWSCLERWDKVEEQLFPLK 625


>gi|47217775|emb|CAG05997.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 979

 Score =  100 bits (248), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 65/182 (35%), Positives = 101/182 (55%), Gaps = 21/182 (11%)

Query: 660 AAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 719
           +A Q E+  R E+Q+++   +KL L++DLD TL+++ + H     +  I           
Sbjct: 156 SAEQAEQLGR-EDQQRLHRNKKLVLMVDLDQTLIHTTEQHCHRMSNKGIF---------- 204

Query: 720 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 779
            H  L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+
Sbjct: 205 -HFQLGRGEPM-LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFS 262

Query: 780 GRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYT 838
            R++SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y 
Sbjct: 263 HRILSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYV 315

Query: 839 YF 840
           YF
Sbjct: 316 YF 317



 Score = 49.7 bits (117), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 36/140 (25%), Positives = 67/140 (47%), Gaps = 25/140 (17%)

Query: 874 VIERLHKIFF-------SHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH 926
           V+ER+H  ++         ++ +  D+R I+   + K L G  IVFS ++P        +
Sbjct: 571 VLERVHAEYYVRYEAYQRKEASESPDIRKIVPELKSKTLEGATIVFSGLYPTN------Y 624

Query: 927 PLWQTAEQF-----GAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPG 974
           P+ +T E +     GA  ++++        + TH++A   GT+KV  A    +  VV+P 
Sbjct: 625 PIEKTREYYHAKALGAKISRNLVLSSKDPGRTTHLIAARAGTEKVRQAQGCKQLQVVNPD 684

Query: 975 WVEASALLYRRANEQDFAIK 994
           W+ +    + R  EQ + +K
Sbjct: 685 WLWSCLERWERVEEQLYPLK 704


>gi|119491655|ref|XP_001263322.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Neosartorya
           fischeri NRRL 181]
 gi|119411482|gb|EAW21425.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Neosartorya
           fischeri NRRL 181]
          Length = 824

 Score = 99.8 bits (247), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 69/197 (35%), Positives = 109/197 (55%), Gaps = 23/197 (11%)

Query: 654 YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 712
           +D+      +KE TR  E+ K+ + + RKL LV+DLD T++++     VDP   E +  K
Sbjct: 131 HDNTALTVSEKEATRVEEDAKRRLLANRKLSLVVDLDQTIIHAT----VDPTVGEWMEDK 186

Query: 713 EEQDRE-----KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYA 764
           +  + E     +  + +   P M G W   KLRPG+ +FL+  S+LFE+H+YTMG + YA
Sbjct: 187 DNPNHEALSDVRAFQLVDEGPGMRGCWYYVKLRPGLESFLQNVSELFELHIYTMGTRAYA 246

Query: 765 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRV 823
             +A ++DP   LF  R++SR + G           +K+L+ +  +++  VVIIDD   V
Sbjct: 247 QHIAGIIDPDRKLFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDDRGDV 298

Query: 824 WPHNKLNLIVVERYTYF 840
           W  +  NLI V  Y +F
Sbjct: 299 WRWSP-NLIKVSPYDFF 314



 Score = 47.0 bits (110), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 53/104 (50%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           DV++I+   + ++L G  +VFS V P+G    +    LW  A+ FGAV +  I+ + TH+
Sbjct: 500 DVKDIMPRIKHRVLGGVVLVFSGVLPLGTDTQNADISLW--AKSFGAVISTKINMRTTHL 557

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A       +V   W+  S   ++R  E+ + +
Sbjct: 558 VAGRNRTAKVREATRYPNIKIVTTQWLLDSLTQWKRLEEEPYLL 601


>gi|359079164|ref|XP_003587804.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like [Bos taurus]
          Length = 994

 Score = 99.8 bits (247), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 99/179 (55%), Gaps = 21/179 (11%)

Query: 663 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 722
           Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  I            H 
Sbjct: 162 QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HF 209

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            L R   M + T+LRP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R+
Sbjct: 210 QLGRGEPM-LHTRLRPHCKEFLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 268

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           +SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 269 LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 320



 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/124 (31%), Positives = 61/124 (49%), Gaps = 16/124 (12%)

Query: 882 FFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEANPHLHPLWQTAEQFGA 937
           F S +S +  D+R I+   + K+LA   I+FS +    FPV     H H     A   GA
Sbjct: 588 FLSGESPEAPDIRKIVPELRSKVLADVAIIFSGLHPTNFPVERTREHYH-----ARALGA 642

Query: 938 -VCTKHIDD-----QVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
            + T+ + D     + TH++A   GT+KV  A   G+  VV+P W+ +    + R  EQ 
Sbjct: 643 RILTQLVLDPDDPARPTHLIAARAGTEKVRQAQECGQLHVVNPDWLWSCLERWDRVEEQL 702

Query: 991 FAIK 994
           F ++
Sbjct: 703 FPLR 706


>gi|432884093|ref|XP_004074439.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like [Oryzias latipes]
          Length = 1129

 Score = 99.8 bits (247), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 162 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 210

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 211 -LHTRLRPHCKEFLEKTAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 269

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 270 PFS------KTGNLRYLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 313



 Score = 43.9 bits (102), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 34/140 (24%), Positives = 65/140 (46%), Gaps = 25/140 (17%)

Query: 874 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH 926
           V+ER+H  +++        ++    D+R I+   + K L    IVFS ++P        +
Sbjct: 557 VLERIHAEYYARYEAYLRKETAQMPDIRKIVPELKSKTLEDATIVFSGLYPTN------Y 610

Query: 927 PLWQTAEQF-----GAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPG 974
           P+ +T E +     GA  ++++        + TH++A   GT+KV          VV+P 
Sbjct: 611 PMDRTREYYHAKALGAKVSRNLILNPQDPGRTTHLIAARAGTEKVRQGQGYKHLHVVNPD 670

Query: 975 WVEASALLYRRANEQDFAIK 994
           W+ +    + R +EQ + +K
Sbjct: 671 WLWSCLERWERVDEQLYPLK 690


>gi|62858037|ref|NP_001017022.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           phosphatase, subunit 1 [Xenopus (Silurana) tropicalis]
          Length = 570

 Score = 99.4 bits (246), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 93/171 (54%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q ++   RKL L++DLD TL+++ + H        I            H  L R   M
Sbjct: 170 EDQLRLHRNRKLVLMVDLDQTLIHTTEQHCQHMSRKGIF-----------HFQLGRGEPM 218

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KLFE+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 219 -LHTRLRPHCKEFLEKIAKLFELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 277

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           P+       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 278 PYS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 321


>gi|89269074|emb|CAJ81904.1| ctd (carboxy terminal domain rna polymerase 2 polypeptide a)
           phosphatase subunit 1 [Xenopus (Silurana) tropicalis]
          Length = 567

 Score = 99.4 bits (246), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 93/171 (54%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q ++   RKL L++DLD TL+++ + H        I            H  L R   M
Sbjct: 167 EDQLRLHRNRKLVLMVDLDQTLIHTTEQHCQHMSRKGIF-----------HFQLGRGEPM 215

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KLFE+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 216 -LHTRLRPHCKEFLEKIAKLFELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 274

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           P+       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 275 PYS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 318


>gi|348512639|ref|XP_003443850.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like [Oreochromis niloticus]
          Length = 998

 Score = 99.4 bits (246), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 61/171 (35%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   +KL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 166 EDQQRLHRNKKLVLMVDLDQTLIHTTEQHCQRMSNKGIF-----------HFQLGRGEPM 214

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 215 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 273

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 274 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYIYF 317



 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 37/140 (26%), Positives = 67/140 (47%), Gaps = 25/140 (17%)

Query: 874 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH 926
           V+ER+H  +++        ++ +  D+R I+   + K L G  IVFS ++P        +
Sbjct: 591 VLERIHAEYYARYEGYLRKEASEMPDIRKIVPELKSKTLEGTMIVFSGLYPTN------Y 644

Query: 927 PLWQTAEQF-----GAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPG 974
           P+ +T E +     GA   K++       ++ TH++A   GT+KV  A       VV+P 
Sbjct: 645 PMERTREYYHAKALGAKIGKNLILSAQNPNRTTHLIAARAGTEKVRQAQGCKHLHVVNPD 704

Query: 975 WVEASALLYRRANEQDFAIK 994
           W+ +    + R  EQ + +K
Sbjct: 705 WLWSCLERWERVEEQLYPLK 724


>gi|426253911|ref|XP_004020634.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
           C-terminal domain phosphatase, partial [Ovis aries]
          Length = 820

 Score = 99.4 bits (246), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 99/179 (55%), Gaps = 21/179 (11%)

Query: 663 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 722
           Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  I            H 
Sbjct: 74  QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HF 121

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            L R   M + T+LRP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R+
Sbjct: 122 QLGRGEPM-LHTRLRPHCKEFLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 180

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           +SR +  DPF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 181 LSRDECIDPFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 232


>gi|73945347|ref|XP_533365.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase isoform 1 [Canis lupus familiaris]
          Length = 933

 Score = 99.4 bits (246), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 60/171 (35%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 169 EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 217

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T++RP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 218 -LHTRVRPHCREFLEKIARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 276

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 277 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 320



 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 64/139 (46%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++         S +  D+R I+   + ++LA   I+FS +    FP+ +  
Sbjct: 572 ILARVHSDYYAKYDRYLRGDSQEAPDIRKIVPELKSRVLADVAIIFSGLHPTNFPIEKTR 631

Query: 923 PHLHPLWQTAEQFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA     +       D+ TH++A   GT+KV  A   G+  VV+P W
Sbjct: 632 EHYH-----ATALGAKILTQLVLDPDNPDRATHLIAARAGTEKVRQAQECGQLHVVNPDW 686

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + R  EQ F ++
Sbjct: 687 LWSCLERWDRVEEQLFPLR 705


>gi|432105445|gb|ELK31660.1| RNA polymerase II subunit A C-terminal domain phosphatase [Myotis
           davidii]
          Length = 823

 Score = 99.0 bits (245), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 61/171 (35%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ +       +  IL           H  L R   M
Sbjct: 53  EDQQRLHRNRKLVLMVDLDQTLIHTTEQQCQQMSNKGIL-----------HFQLGRGEPM 101

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 102 -LHTRLRPHCREFLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 161 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204



 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 36/138 (26%), Positives = 65/138 (47%), Gaps = 23/138 (16%)

Query: 874 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++        ++ +  D+R I+   + ++LA   I+FS +    FPV +  
Sbjct: 453 ILARVHSDYYARYDRYLRGETQEAPDIRKIVPELKSRVLADVAIIFSGLYPTNFPVEKTR 512

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V      D+ TH++A   GT+KV  A + G+  VV+P W
Sbjct: 513 EHYH-----ATALGARILTQLVLDPDAPDRATHLIAAKAGTEKVRQAQACGQLHVVNPDW 567

Query: 976 VEASALLYRRANEQDFAI 993
           + +    + +  EQ F +
Sbjct: 568 LWSCLERWDKVEEQLFPL 585


>gi|345568228|gb|EGX51125.1| hypothetical protein AOL_s00054g501 [Arthrobotrys oligospora ATCC
           24927]
          Length = 854

 Score = 99.0 bits (245), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 67/184 (36%), Positives = 97/184 (52%), Gaps = 32/184 (17%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR---- 726
           E +K++ SA+KL LV+DLD T++ +     VDP   E        D   P+ H  +    
Sbjct: 149 EAKKRLLSAKKLSLVVDLDQTIIQAT----VDPTVGEW-----RDDPSNPNYHAVKDVEA 199

Query: 727 FPHM-------GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
           F  +       G W   KLRPG+  FL   SK++E H+YTMG + YA  +AK++DP+G +
Sbjct: 200 FQLLDEGAGGRGCWYYVKLRPGLKRFLSNISKIYECHIYTMGTRAYAMSIAKIVDPEGSI 259

Query: 778 FAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVER 836
           F  R++SR + G           SK LE +  +++  VVIIDD   VW  +  NLI V  
Sbjct: 260 FGERILSRDESGS--------LTSKSLERLFPVDTKMVVIIDDRGDVWKWSD-NLIKVTP 310

Query: 837 YTYF 840
           Y +F
Sbjct: 311 YDFF 314



 Score = 47.8 bits (112), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 51/106 (48%), Gaps = 7/106 (6%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           D+++I+   +R++     IVFS V P+G  +     + Q    FGA+    I   VTHVV
Sbjct: 529 DIKHIMPRMKRRVFDNIVIVFSGVIPLG-VDVQSSDVAQWGRAFGAMVVDGISAAVTHVV 587

Query: 952 ANSLGTDKVNWALSTGRF----VVHPGWVEASALLYRRANEQDFAI 993
           A    T KV  A +  +F    +V P W   S  ++ + +E  F I
Sbjct: 588 AARTRTAKVRTAAT--KFPHIRIVTPQWFFQSITMWHKLDETPFLI 631


>gi|30962890|gb|AAH52576.1| CTDP1 protein, partial [Homo sapiens]
          Length = 874

 Score = 99.0 bits (245), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 61/171 (35%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 85  EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 133

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 134 -LHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 192

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           P      + K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 193 P------ISKTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 236



 Score = 48.9 bits (115), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ +++  D+R I+   + K+LA   I+FS +    FP+ +  
Sbjct: 509 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 568

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A   G   VV+P W
Sbjct: 569 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 623

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F ++
Sbjct: 624 LWSCLERWDKVEEQLFPLR 642


>gi|358372260|dbj|GAA88864.1| RNA Polymerase II CTD phosphatase Fcp1 [Aspergillus kawachii IFO
           4308]
          Length = 825

 Score = 99.0 bits (245), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 108/201 (53%), Gaps = 34/201 (16%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + ++   R+EE  ++++ + RKL LV+DLD T++++     VDP   E +    
Sbjct: 132 DNTTLTVSEQEATRVEEDAKRRLLANRKLSLVVDLDQTIIHAT----VDPTVGEWM---- 183

Query: 714 EQDREKPHRHL------FRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 760
            QD+E P+         F+     P M G W   KLRPG+ +FL+  S+++E+H+YTMG 
Sbjct: 184 -QDKENPNYQALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSEMYELHIYTMGT 242

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 819
           + YA  +A ++DP   LF  R++SR + G           +K+L  +  +++  VVIIDD
Sbjct: 243 RSYAQHIASIIDPDRKLFGDRILSRDESGSLV--------AKNLHRLFPVDTKMVVIIDD 294

Query: 820 SVRVWPHNKLNLIVVERYTYF 840
              VW  N  NLI V  Y +F
Sbjct: 295 RGDVWRWNP-NLIKVSPYDFF 314



 Score = 45.1 bits (105), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 51/104 (49%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           DV+ I+   + +IL G  +VFS V P+G    +    LW  A+ FGAV    I+ + TH+
Sbjct: 501 DVKIIMPQMKHRILGGVTLVFSGVLPLGTDTQNADISLW--AKSFGAVIASKINMKTTHL 558

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A    +  +V   W+      ++R  E+ + +
Sbjct: 559 VAGRNRTAKVREATRYPKVKIVTTQWLLDCLTQWKRLAEEPYLL 602


>gi|350634686|gb|EHA23048.1| hypothetical protein ASPNIDRAFT_197473 [Aspergillus niger ATCC
           1015]
          Length = 824

 Score = 99.0 bits (245), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 108/201 (53%), Gaps = 34/201 (16%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + ++   R+EE  ++++ + RKL LV+DLD T++++     VDP   E +    
Sbjct: 132 DNTTLTVSEQEATRVEEDAKRRLLANRKLSLVVDLDQTIIHAT----VDPTVGEWM---- 183

Query: 714 EQDREKPHRHL------FRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 760
            QD+E P+         F+     P M G W   KLRPG+ +FL+  S+++E+H+YTMG 
Sbjct: 184 -QDKENPNYQALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSEMYELHIYTMGT 242

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 819
           + YA  +A ++DP   LF  R++SR + G           +K+L  +  +++  VVIIDD
Sbjct: 243 RSYAQHIASIIDPDRKLFGDRILSRDESGSLV--------AKNLHRLFPVDTKMVVIIDD 294

Query: 820 SVRVWPHNKLNLIVVERYTYF 840
              VW  N  NLI V  Y +F
Sbjct: 295 RGDVWRWNP-NLIKVSPYDFF 314



 Score = 45.1 bits (105), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 51/104 (49%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           DV+ I+   + +IL G  +VFS V P+G    +    LW  A+ FGAV    I+ + TH+
Sbjct: 502 DVKIIMPNMKHRILGGVTLVFSGVLPLGTDTQNADISLW--AKSFGAVIASKINMKTTHL 559

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A    +  +V   W+      ++R  E+ + +
Sbjct: 560 VAGRNRTAKVREATRYPKVKIVTTQWLLDCLTQWKRLAEEPYLL 603


>gi|327270066|ref|XP_003219812.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like [Anolis carolinensis]
          Length = 965

 Score = 98.6 bits (244), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 61/171 (35%), Positives = 96/171 (56%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H            ++  +R   H  L R   M
Sbjct: 155 EDQERLRRNRKLVLMVDLDQTLIHTTEQH-----------CQQMSNRGIFHYQLGRGEPM 203

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LD +  LF+ R++SR +  D
Sbjct: 204 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAAFLDSEKKLFSHRILSRDECID 262

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 263 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 306



 Score = 49.3 bits (116), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 35/135 (25%), Positives = 65/135 (48%), Gaps = 15/135 (11%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH 926
           ++ R+H  ++S       +  D++ D+R I+   +RK+LAG  I+FS ++P   A     
Sbjct: 589 ILVRVHADYYSKYDKYVRKETDEIPDIRKIVPELKRKVLAGVTILFSGLYPTNFAIERTR 648

Query: 927 PLWQTAEQFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEAS 979
             +  A   GA   K +       ++ TH++A   GT+KV    +     +V+P W+ + 
Sbjct: 649 ENYH-ATALGAKIAKTLILDENDPNKTTHLIAARAGTEKVRQGQTCKSLHIVNPDWLWSC 707

Query: 980 ALLYRRANEQDFAIK 994
              + +  EQ F +K
Sbjct: 708 LERWDKVEEQLFPLK 722


>gi|355681363|gb|AER96784.1| CTD phosphatase, subunit 1 [Mustela putorius furo]
          Length = 819

 Score = 98.2 bits (243), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 60/171 (35%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 53  EDQERLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 101

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T++RP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 102 -LHTRVRPHCREFLEKIARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 160

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 161 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 204



 Score = 48.1 bits (113), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 64/139 (46%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++         S +  D+R I+   + ++LA   I+FS +    FPV +  
Sbjct: 455 ILARVHSDYYARYDRHLLGDSPEAPDIRKIVPELKSRVLADVVIIFSGLHPTNFPVEKTR 514

Query: 923 PHLHPLWQTAEQFGAVCTKHID------DQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA     +       ++ TH++A   GT+KV  A   G+  VV+P W
Sbjct: 515 EHYH-----ATALGAKVLTQLALDPGSPNRATHLIAARAGTEKVRQAQECGQLHVVNPDW 569

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + R  EQ F ++
Sbjct: 570 LWSCLERWDRVEEQLFPLR 588


>gi|156050785|ref|XP_001591354.1| hypothetical protein SS1G_07980 [Sclerotinia sclerotiorum 1980]
 gi|154692380|gb|EDN92118.1| hypothetical protein SS1G_07980 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 806

 Score = 97.8 bits (242), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 66/198 (33%), Positives = 102/198 (51%), Gaps = 25/198 (12%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           DQ    +  +   + EE  Q+++   RKL LV+DLD T++++     ++P   E  R   
Sbjct: 134 DQTHLTVSHDEASKAEEELQRRLLKNRKLSLVVDLDQTIIHAC----IEPTVGEWQRDVN 189

Query: 714 EQDRE--KPHRHLF------RFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLY 763
             + E  K  R         R    G W   K+RPG+  FL + S+++E+H+YTMG + Y
Sbjct: 190 SPNYEAVKDVRSFQLNDDGPRGLASGCWYYIKMRPGLAEFLTKISEMYELHVYTMGTRAY 249

Query: 764 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVV-IIDDSVR 822
           A  +AK++DP   LF  R+ISR ++G+          +K L  +    + +V IIDD   
Sbjct: 250 ALSIAKIVDPGKKLFGDRIISRDENGN--------VTAKSLARLFPQSTHMVAIIDDRAD 301

Query: 823 VWPHNKLNLIVVERYTYF 840
           VWP N+ NLI V  Y +F
Sbjct: 302 VWPMNRPNLIKVVPYDFF 319


>gi|83767703|dbj|BAE57842.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 820

 Score = 97.8 bits (242), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 70/201 (34%), Positives = 110/201 (54%), Gaps = 34/201 (16%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + ++   R+EE  ++++ S RKL LV+DLD T++++     VDP   E +    
Sbjct: 132 DNTALTVSEKEAARVEEDAKRRLLSNRKLSLVVDLDQTIIHAT----VDPTVGEWM---- 183

Query: 714 EQDREKP-HRHL-----FRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 760
            +D++ P H+ L     F+     P M G W   KLRPG+ +FL+  S+LFE+H+YTMG 
Sbjct: 184 -EDKDNPNHQALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSELFELHIYTMGT 242

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 819
           + YA  +A ++DP   LF  R++SR + G           +K+L  +  +++  VVIIDD
Sbjct: 243 RAYAQHIASIIDPDRKLFGDRILSRDESGS--------LTAKNLHRLFPVDTKMVVIIDD 294

Query: 820 SVRVWPHNKLNLIVVERYTYF 840
              VW  +  NLI V  Y +F
Sbjct: 295 RGDVWRWSP-NLIKVSPYDFF 314



 Score = 47.4 bits (111), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 55/104 (52%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           D+++I+   +R+IL G  +VFS V P+G    +    LW  A+ FGAV ++ I+ + TH+
Sbjct: 500 DIKDIMPQIKRQILGGVILVFSGVLPLGTDTQNADISLW--AKSFGAVISQKINVKTTHL 557

Query: 951 VANSLGTDKVNWALS-TGRFVVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A   T   +V   W+      ++  +E+ + +
Sbjct: 558 VAGRNRTAKVREATRYTNVKIVTTQWLLDCLTQWKWLDEEPYLL 601


>gi|391867600|gb|EIT76846.1| TFIIF-interacting CTD phosphatase [Aspergillus oryzae 3.042]
          Length = 820

 Score = 97.8 bits (242), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 70/201 (34%), Positives = 110/201 (54%), Gaps = 34/201 (16%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + ++   R+EE  ++++ S RKL LV+DLD T++++     VDP   E +    
Sbjct: 132 DNTALTVSEKEAARVEEDAKRRLLSNRKLSLVVDLDQTIIHAT----VDPTVGEWM---- 183

Query: 714 EQDREKP-HRHL-----FRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 760
            +D++ P H+ L     F+     P M G W   KLRPG+ +FL+  S+LFE+H+YTMG 
Sbjct: 184 -EDKDNPNHQALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSELFELHIYTMGT 242

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 819
           + YA  +A ++DP   LF  R++SR + G           +K+L  +  +++  VVIIDD
Sbjct: 243 RAYAQHIASIIDPDRKLFGDRILSRDESGS--------LTAKNLHRLFPVDTKMVVIIDD 294

Query: 820 SVRVWPHNKLNLIVVERYTYF 840
              VW  +  NLI V  Y +F
Sbjct: 295 RGDVWRWSP-NLIKVSPYDFF 314



 Score = 47.4 bits (111), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 55/104 (52%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           D+++I+   +R+IL G  +VFS V P+G    +    LW  A+ FGAV ++ I+ + TH+
Sbjct: 500 DIKDIMPQIKRQILGGVILVFSGVLPLGTDTQNADISLW--AKSFGAVISQKINVKTTHL 557

Query: 951 VANSLGTDKVNWALS-TGRFVVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A   T   +V   W+      ++  +E+ + +
Sbjct: 558 VAGRNRTAKVREATRYTNVKIVTTQWLLDCLTQWKWLDEEPYLL 601


>gi|238486788|ref|XP_002374632.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus
           flavus NRRL3357]
 gi|220699511|gb|EED55850.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus
           flavus NRRL3357]
          Length = 698

 Score = 97.4 bits (241), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 70/201 (34%), Positives = 110/201 (54%), Gaps = 34/201 (16%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + ++   R+EE  ++++ S RKL LV+DLD T++++     VDP   E +    
Sbjct: 10  DNTALTVSEKEAARVEEDAKRRLLSNRKLSLVVDLDQTIIHAT----VDPTVGEWM---- 61

Query: 714 EQDREKP-HRHL-----FRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 760
            +D++ P H+ L     F+     P M G W   KLRPG+ +FL+  S+LFE+H+YTMG 
Sbjct: 62  -EDKDNPNHQALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSELFELHIYTMGT 120

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 819
           + YA  +A ++DP   LF  R++SR + G           +K+L  +  +++  VVIIDD
Sbjct: 121 RAYAQHIASIIDPDRKLFGDRILSRDESGS--------LTAKNLHRLFPVDTKMVVIIDD 172

Query: 820 SVRVWPHNKLNLIVVERYTYF 840
              VW  +  NLI V  Y +F
Sbjct: 173 RGDVWRWSP-NLIKVSPYDFF 192



 Score = 47.4 bits (111), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 55/104 (52%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           D+++I+   +R+IL G  +VFS V P+G    +    LW  A+ FGAV ++ I+ + TH+
Sbjct: 378 DIKDIMPQIKRQILGGVILVFSGVLPLGTDTQNADISLW--AKSFGAVISQKINVKTTHL 435

Query: 951 VANSLGTDKVNWALS-TGRFVVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A   T   +V   W+      ++  +E+ + +
Sbjct: 436 VAGRNRTAKVREATRYTNVKIVTTQWLLDCLTQWKWLDEEPYLL 479


>gi|242781762|ref|XP_002479866.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Talaromyces
           stipitatus ATCC 10500]
 gi|218720013|gb|EED19432.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Talaromyces
           stipitatus ATCC 10500]
          Length = 822

 Score = 97.4 bits (241), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 69/197 (35%), Positives = 107/197 (54%), Gaps = 23/197 (11%)

Query: 654 YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 712
           +D+      Q+E TR  E+ K+ + ++++L LV+DLD T++++     VDP   E    K
Sbjct: 131 HDNTALTVSQREATRVEEDAKRRLLASKRLSLVVDLDQTIIHAT----VDPTVGEWKEDK 186

Query: 713 EEQDREK-PHRHLFRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYA 764
              + E       F+     P M G W   KLRPG+ +FL+  SKL+E+H+YTMG + YA
Sbjct: 187 NNPNHEAVKDVRAFQLTDDGPGMRGCWYYIKLRPGLESFLQNISKLYELHIYTMGTRAYA 246

Query: 765 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRV 823
             +A ++DP   LF  R++SR + G           +K+L+ +  +++  VVIIDD   V
Sbjct: 247 QNIANIIDPDRKLFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDDRGDV 298

Query: 824 WPHNKLNLIVVERYTYF 840
           W  N  NLI V  Y +F
Sbjct: 299 WKWNP-NLIKVSPYDFF 314



 Score = 42.7 bits (99), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 52/105 (49%), Gaps = 6/105 (5%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGE--ANPHLHPLWQTAEQFGAVCTKHIDDQVTH 949
           DV+ I+   ++K+L    +VFS V P+G    N  +  LW  A+ FGA  +  I  + TH
Sbjct: 504 DVKVIMPQIKQKVLGDVVLVFSGVLPLGTDLQNADI-SLW--AKSFGATISPRIGSKTTH 560

Query: 950 VVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           +VA    T KV  A    R  +V   W+  S   ++  +E+ + I
Sbjct: 561 LVAGRNRTAKVREATRYPRIKIVTTQWLLDSLTQWKHMDEEPYLI 605


>gi|70999518|ref|XP_754478.1| RNA Polymerase II CTD phosphatase Fcp1 [Aspergillus fumigatus
           Af293]
 gi|66852115|gb|EAL92440.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus
           fumigatus Af293]
          Length = 827

 Score = 97.4 bits (241), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 68/197 (34%), Positives = 109/197 (55%), Gaps = 23/197 (11%)

Query: 654 YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 712
           +D+      +KE TR  E+ K+ + + RKL LV+DLD T++++     VDP   E +  K
Sbjct: 131 HDNTALTVSEKEATRVEEDAKRRLLANRKLSLVVDLDQTIIHAT----VDPTVGEWMEDK 186

Query: 713 EEQDRE-----KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYA 764
           +  + +     +  + +   P M G W   KLRPG+ +FL+  S+LFE+H+YTMG + YA
Sbjct: 187 DNPNHDALGDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSELFELHIYTMGTRAYA 246

Query: 765 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRV 823
             +A ++DP   LF  R++SR + G           +K+L+ +  +++  VVIIDD   V
Sbjct: 247 QHIAGIIDPDRKLFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDDRGDV 298

Query: 824 WPHNKLNLIVVERYTYF 840
           W  +  NLI V  Y +F
Sbjct: 299 WRWSP-NLIKVSPYDFF 314



 Score = 46.6 bits (109), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 53/104 (50%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           DV++I+   + ++L G  +VFS V P+G    +    LW  A+ FGAV +  I+ + TH+
Sbjct: 500 DVKDIMPRIKHRVLGGVVLVFSGVLPLGTDTQNADISLW--AKSFGAVISTKINMRTTHL 557

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A       +V   W+  S   ++R  E+ + +
Sbjct: 558 VAGRNRTAKVREATRYPNIKIVTTQWLLDSLTQWKRLEEEPYLL 601


>gi|410977919|ref|XP_003995346.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
           C-terminal domain phosphatase [Felis catus]
          Length = 960

 Score = 97.4 bits (241), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 59/171 (34%), Positives = 95/171 (55%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           ++Q+++   RKL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 191 QDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HFQLGRGEPM 239

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T++RP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 240 -LHTRVRPHCREFLEKIARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 298

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           PF       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 299 PFS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 342



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 35/134 (26%), Positives = 64/134 (47%), Gaps = 13/134 (9%)

Query: 874 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++         S +  D+R I+   + ++LA   I+FS +    FP+ +  
Sbjct: 598 ILARVHSDYYAKYDRYLRGDSQEPPDIRKIVPELKSRVLADVAIIFSGLHPTNFPIEKTR 657

Query: 923 PHLHPLWQTAEQFGAVCTKHID-DQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASA 980
            H H     A+    +     D D+ TH++A   GT+KV  A   G+  VV+P W+ +  
Sbjct: 658 EHYHATALGAKILTQLVLDPDDPDRATHLIAARAGTEKVRQAQECGQLHVVNPDWLWSCL 717

Query: 981 LLYRRANEQDFAIK 994
             + R  EQ F ++
Sbjct: 718 ERWDRVEEQLFPLR 731


>gi|347836062|emb|CCD50634.1| similar to FCP1-like phosphatase [Botryotinia fuckeliana]
          Length = 832

 Score = 97.4 bits (241), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 66/198 (33%), Positives = 102/198 (51%), Gaps = 25/198 (12%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           DQ    +  +   + EE  Q+++   RKL LV+DLD T++++     ++P   E  R   
Sbjct: 134 DQTHLTVSLDEASKAEEELQRRLLKNRKLSLVVDLDQTIIHAC----IEPTVGEWQRDVN 189

Query: 714 EQDRE--KPHRHLF------RFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLY 763
             + E  K  R         R    G W   K+RPG+  FL + S+++E+H+YTMG + Y
Sbjct: 190 SPNYEAVKDVRSFQLNDDGPRGLASGCWYYIKMRPGLAEFLAKVSEMYELHVYTMGTRAY 249

Query: 764 ATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVV-IIDDSVR 822
           A  +AK++DP   LF  R+ISR ++G+          +K L  +    + +V IIDD   
Sbjct: 250 ALNIAKIVDPGKKLFGDRIISRDENGN--------VTAKSLARLFPQSTHMVAIIDDRAD 301

Query: 823 VWPHNKLNLIVVERYTYF 840
           VWP N+ NLI V  Y +F
Sbjct: 302 VWPMNRPNLIKVVPYDFF 319



 Score = 40.8 bits (94), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 47/95 (49%), Gaps = 4/95 (4%)

Query: 903 KILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANS--LGTDKV 960
           ++L GC IV S + P+G  +     + Q  E FG    K +  +VTHVVA+S    T KV
Sbjct: 534 QVLDGCVIVMSGLVPLG-VDLMRSEIAQQIESFGGKIHKKVSKRVTHVVASSQKTRTQKV 592

Query: 961 NWALSTGRF-VVHPGWVEASALLYRRANEQDFAIK 994
             A       +V   W+  S   +++ +E ++ ++
Sbjct: 593 REAAKYPHIKIVTQQWLTQSMSKWKKEDESEYLVE 627


>gi|159127495|gb|EDP52610.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus
           fumigatus A1163]
          Length = 827

 Score = 97.4 bits (241), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 68/197 (34%), Positives = 109/197 (55%), Gaps = 23/197 (11%)

Query: 654 YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 712
           +D+      +KE TR  E+ K+ + + RKL LV+DLD T++++     VDP   E +  K
Sbjct: 131 HDNTALTVSEKEATRVEEDAKRRLLANRKLSLVVDLDQTIIHAT----VDPTVGEWMEDK 186

Query: 713 EEQDRE-----KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYA 764
           +  + +     +  + +   P M G W   KLRPG+ +FL+  S+LFE+H+YTMG + YA
Sbjct: 187 DNPNHDALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVSELFELHIYTMGTRAYA 246

Query: 765 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRV 823
             +A ++DP   LF  R++SR + G           +K+L+ +  +++  VVIIDD   V
Sbjct: 247 QHIAGIIDPDRKLFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDDRGDV 298

Query: 824 WPHNKLNLIVVERYTYF 840
           W  +  NLI V  Y +F
Sbjct: 299 WRWSP-NLIKVSPYDFF 314



 Score = 46.6 bits (109), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 53/104 (50%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           DV++I+   + ++L G  +VFS V P+G    +    LW  A+ FGAV +  I+ + TH+
Sbjct: 500 DVKDIMPRIKHRVLGGVVLVFSGVLPLGTDTQNADISLW--AKSFGAVISTKINMRTTHL 557

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A       +V   W+  S   ++R  E+ + +
Sbjct: 558 VAGRNRTAKVREATRYPNIKIVTTQWLLDSLTQWKRLEEEPYLL 601


>gi|19115680|ref|NP_594768.1| CTD phosphatase Fcp1 [Schizosaccharomyces pombe 972h-]
 gi|26393804|sp|Q9P376.1|FCP1_SCHPO RecName: Full=RNA polymerase II subunit A C-terminal domain
           phosphatase; AltName: Full=CTD phosphatase fcp1
 gi|9588462|emb|CAC00553.1| CTD phosphatase Fcp1 [Schizosaccharomyces pombe]
          Length = 723

 Score = 96.7 bits (239), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 72/216 (33%), Positives = 108/216 (50%), Gaps = 44/216 (20%)

Query: 651 FEGYDDQQKAAIQK-----------ERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAK 697
           + GY D  +A I             E   RLE +  K++   ++L L++DLD T++++  
Sbjct: 121 YMGYSDMARANISMTHNTGDLTVSLEEASRLESENVKRLRQEKRLSLIVDLDQTIIHAT- 179

Query: 698 FHEVDP-----------VHDEILRKKEEQD-REKPHRHLFRFPHMGMWTKLRPGIWTFLE 745
              VDP           V+ ++LR     + +E P  +   +     + K RPG+  FL+
Sbjct: 180 ---VDPTVGEWMSDPGNVNYDVLRDVRSFNLQEGPSGYTSCY-----YIKFRPGLAQFLQ 231

Query: 746 RASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLE 805
           + S+L+E+H+YTMG K YA E+AK++DP G LF  RV+SR D G            K L 
Sbjct: 232 KISELYELHIYTMGTKAYAKEVAKIIDPTGKLFQDRVLSRDDSGS--------LAQKSLR 283

Query: 806 GVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
            +   + S VV+IDD   VW  N  NLI V  Y +F
Sbjct: 284 RLFPCDTSMVVVIDDRGDVWDWNP-NLIKVVPYEFF 318



 Score = 53.5 bits (127), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 36/133 (27%), Positives = 66/133 (49%), Gaps = 14/133 (10%)

Query: 874 VIERLHKIFFSHQSLDDVDVRN----------ILAAEQRKILAGCRIVFSRVFPVGEANP 923
           V++ +H +++  +  +D+  R+          I+   ++K+L GCR++FS V P+G  + 
Sbjct: 452 VLKDIHAVYYEEE--NDISSRSGNHKHANVGLIIPKMKQKVLKGCRLLFSGVIPLG-VDV 508

Query: 924 HLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALL 982
               + + A  FGA          TH++A  + T+KV  A+S G   VV   W+  S   
Sbjct: 509 LSSDIAKWAMSFGAEVVLDFSVPPTHLIAAKIRTEKVKKAVSMGNIKVVKLNWLTESLSQ 568

Query: 983 YRRANEQDFAIKP 995
           ++R  E D+ + P
Sbjct: 569 WKRLPESDYLLYP 581


>gi|317144011|ref|XP_001819844.2| RNA polymerase II subunit A C-terminal domain phosphatase
           [Aspergillus oryzae RIB40]
          Length = 799

 Score = 96.3 bits (238), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 62/188 (32%), Positives = 102/188 (54%), Gaps = 29/188 (15%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + ++   R+EE  ++++ S RKL LV+DLD T++++     VDP   E +    
Sbjct: 132 DNTALTVSEKEAARVEEDAKRRLLSNRKLSLVVDLDQTIIHAT----VDPTVGEWM---- 183

Query: 714 EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 773
            +D++ P+            + LRPG+ +FL+  S+LFE+H+YTMG + YA  +A ++DP
Sbjct: 184 -EDKDNPNHQAL--------SDLRPGLESFLQNVSELFELHIYTMGTRAYAQHIASIIDP 234

Query: 774 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLI 832
              LF  R++SR + G           +K+L  +  +++  VVIIDD   VW  +  NLI
Sbjct: 235 DRKLFGDRILSRDESGS--------LTAKNLHRLFPVDTKMVVIIDDRGDVWRWSP-NLI 285

Query: 833 VVERYTYF 840
            V  Y +F
Sbjct: 286 KVSPYDFF 293



 Score = 47.8 bits (112), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 55/104 (52%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           D+++I+   +R+IL G  +VFS V P+G    +    LW  A+ FGAV ++ I+ + TH+
Sbjct: 479 DIKDIMPQIKRQILGGVILVFSGVLPLGTDTQNADISLW--AKSFGAVISQKINVKTTHL 536

Query: 951 VANSLGTDKVNWALS-TGRFVVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A   T   +V   W+      ++  +E+ + +
Sbjct: 537 VAGRNRTAKVREATRYTNVKIVTTQWLLDCLTQWKWLDEEPYLL 580


>gi|449675210|ref|XP_002161785.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like [Hydra magnipapillata]
          Length = 718

 Score = 95.9 bits (237), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 67/183 (36%), Positives = 96/183 (52%), Gaps = 30/183 (16%)

Query: 663 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 722
           + E+  + +EQ+ +  ARKL LV+DLD TL+++     V+P               K  +
Sbjct: 137 EAEKLAKYDEQQ-LLRARKLVLVVDLDMTLIHTT----VEPT-------------PKNTK 178

Query: 723 HLFRFPHMG----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 778
            +F F   G      TKLRPG   FLE  SK +E+H++TMG++LYA  +AK LDP G  F
Sbjct: 179 DVFSFKLPGHQYEYHTKLRPGARKFLESISKFYELHIFTMGSRLYAHTVAKCLDPDGKFF 238

Query: 779 AGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERY 837
           A R+ SR +  + F       K  DL+ +    +  V IIDD   VW +   NLI V+ Y
Sbjct: 239 AHRIRSRDEFINSF------SKFHDLKALFPCGDHMVCIIDDREDVWNYAP-NLITVKPY 291

Query: 838 TYF 840
            +F
Sbjct: 292 KFF 294



 Score = 48.9 bits (115), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 46/156 (29%), Positives = 67/156 (42%), Gaps = 44/156 (28%)

Query: 874 VIERLHKIFFSHQSLDDV------------------------DVRNILAAEQRKILAGCR 909
           ++ER+H  F+S  +LD V                        D+R IL   +R+ L GC 
Sbjct: 395 ILERVHYSFYS--TLDAVKAKGGAASKDTEITKFCTFGTMNPDIRVILPELRRQTLKGCN 452

Query: 910 IVFSRVFPVGEANPHLHPL-----WQTAEQFGAVCTKHI----DD--QVTHVVANSLGTD 958
           IVF+ V P         PL     W+TA   GA  T  +    +D  + THVVA   GT 
Sbjct: 453 IVFTGVIPTN------CPLEKSKAWKTAVSLGARVTSEVVGKEEDGLRTTHVVAARHGTH 506

Query: 959 KVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           K + A  +    +V+P W+  S+  +  A E  F +
Sbjct: 507 KAHKAYKSPDINLVNPNWLWCSSERWEWAEESIFPV 542


>gi|212526776|ref|XP_002143545.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Talaromyces
           marneffei ATCC 18224]
 gi|210072943|gb|EEA27030.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Talaromyces
           marneffei ATCC 18224]
          Length = 829

 Score = 95.9 bits (237), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 68/202 (33%), Positives = 110/202 (54%), Gaps = 33/202 (16%)

Query: 654 YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 712
           +D+      Q+E TR  E+ K+ + ++++L LV+DLD T++++     VDP   E     
Sbjct: 131 HDNTALTVSQREATRVEEDAKRRLLASKRLSLVVDLDQTIIHAT----VDPTVGEW---- 182

Query: 713 EEQDREKPHR------HLFRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMG 759
            ++D+  P+         F+     P M G W   KLRPG+ +FL+  S+L+E+H+YTMG
Sbjct: 183 -KEDKNNPNHDAVKDVRAFQLTDDGPGMRGCWYYIKLRPGLESFLQNISELYELHIYTMG 241

Query: 760 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIID 818
            + YA  +A ++DP   LF  R++SR + G           +K+L+ +  +++  VVIID
Sbjct: 242 TRAYAQHIANIIDPDRKLFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIID 293

Query: 819 DSVRVWPHNKLNLIVVERYTYF 840
           D   VW  N  NLI V  Y +F
Sbjct: 294 DRGDVWKWNP-NLIKVSPYDFF 314



 Score = 47.8 bits (112), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 6/105 (5%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGE--ANPHLHPLWQTAEQFGAVCTKHIDDQVTH 949
           DV+ ++   + K+L G  +VFS V P+G    N  +  LW  A+ FGAV +  I  + TH
Sbjct: 504 DVKVVMPQIKAKVLGGVVLVFSGVLPLGTDLQNADI-SLW--AKSFGAVISSRIGSKTTH 560

Query: 950 VVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           +VA    T KV  A    R  +V   W+  S   ++  NE+ + I
Sbjct: 561 LVAGRNRTAKVREATRYPRIKIVTTQWLLDSLTNWKHMNEEPYLI 605


>gi|148236185|ref|NP_001090168.1| CTD phosphatase [Xenopus laevis]
 gi|13487713|gb|AAK27686.1| CTD phosphatase [Xenopus laevis]
          Length = 980

 Score = 95.5 bits (236), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 60/171 (35%), Positives = 93/171 (54%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q ++   +KL L++DLD TL+++ + H        I            H  L R   M
Sbjct: 165 EDQFRLHRNKKLVLMVDLDQTLIHTTEQHCQHMSRKGIF-----------HFQLGRGEPM 213

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 214 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 272

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           P+       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 273 PYS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 316



 Score = 42.0 bits (97), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 63/139 (45%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFSH------QSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++       + +D V D+R I+   + K+L    I FS +    FP+    
Sbjct: 601 ILVRVHTDYYAKYDRYLKKEVDSVPDIRKIVPELKSKVLENVIISFSGLHPTNFPIERTR 660

Query: 923 PHLHPLWQTAEQFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
              H     A   GA   K++       D+ TH++A   GT+KV  A +     VV+P W
Sbjct: 661 EFYH-----ARALGASIHKNLILKPDDPDRTTHLIAARAGTEKVRKAQNCKHLHVVNPEW 715

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F +K
Sbjct: 716 LWSCLERWEKVEEQLFPLK 734


>gi|148227040|ref|NP_001081726.1| FCP1 serine phosphatase [Xenopus laevis]
 gi|62185667|gb|AAH92306.1| Fcp1 protein [Xenopus laevis]
          Length = 979

 Score = 95.5 bits (236), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 60/171 (35%), Positives = 93/171 (54%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q ++   +KL L++DLD TL+++ + H        I            H  L R   M
Sbjct: 165 EDQFRLHRNQKLVLMVDLDQTLIHTTEQHCQHMSRKGIF-----------HFQLGRGEPM 213

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+ R++SR +  D
Sbjct: 214 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECID 272

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           P+       K+ +L  +    +S V IIDD   VW     NLI V++Y YF
Sbjct: 273 PYS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYVYF 316



 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 39/139 (28%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFSH------QSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++       + +DDV D+RNI+   + K+L    I FS +    FP+    
Sbjct: 598 ILVRVHTDYYAKYDRYLKKEVDDVPDIRNIVPELKSKVLENVIITFSGLHPTNFPIERTR 657

Query: 923 PHLHPLWQTAEQFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
              H     A   GA  +K++       D+VTH++A   GT+KV  A +     VV+P W
Sbjct: 658 EFYH-----ARALGANISKNLILKPDDSDRVTHLIAARSGTEKVRQAQNCKHLHVVNPEW 712

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F +K
Sbjct: 713 LWSCLERWEKVEEQLFPLK 731


>gi|121705758|ref|XP_001271142.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus
           clavatus NRRL 1]
 gi|119399288|gb|EAW09716.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus
           clavatus NRRL 1]
          Length = 826

 Score = 95.5 bits (236), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 66/197 (33%), Positives = 109/197 (55%), Gaps = 23/197 (11%)

Query: 654 YDDQQKAAIQKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK 712
           +D+      ++E TR  E+ K+ + + +KL LV+DLD T++++     VDP   E +  K
Sbjct: 131 HDNTSLTVSEREATRVEEDAKRRLLANKKLSLVVDLDQTIIHAT----VDPTVREWMEDK 186

Query: 713 EEQDRE-----KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYA 764
           +  + E     +  + +   P M G W   KLRPG+ +FL+  ++LFE+H+YTMG + YA
Sbjct: 187 DNPNHEALSDVRAFQLVDDGPGMRGCWYYVKLRPGLESFLQNVAELFELHIYTMGTRAYA 246

Query: 765 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRV 823
             +A ++DP   LF  R++SR + G           +K+L+ +  +++  VVIIDD   V
Sbjct: 247 QHIAAIIDPDRKLFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDDRGDV 298

Query: 824 WPHNKLNLIVVERYTYF 840
           W  +  NLI V  Y +F
Sbjct: 299 WRWSP-NLIKVSPYDFF 314



 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 53/104 (50%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           DV++I+   + ++L G  +VFS V P+G    +    LW   + FGAV +  I+ + TH+
Sbjct: 499 DVKDIMPRIKHRVLGGVVLVFSGVLPLGTDTQNADISLW--TKSFGAVISTKINRKTTHL 556

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A       +V   W+  +   ++R +E+ + +
Sbjct: 557 VAGRNRTAKVREATRYPNIKIVTTQWLLDTLTQWKRLDEEPYLL 600


>gi|428183780|gb|EKX52637.1| hypothetical protein GUITHDRAFT_101798 [Guillardia theta CCMP2712]
          Length = 749

 Score = 95.5 bits (236), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 65/181 (35%), Positives = 99/181 (54%), Gaps = 33/181 (18%)

Query: 676 MFSARKLCLVLDLDHTLLNSAKFHEVDP---VHDEILRKKEEQDREKPHRHLFRFPHMGM 732
           MF A    LVLDLDHTLL     H   P   + + I++   EQ ++    H+ +      
Sbjct: 113 MFGA----LVLDLDHTLL-----HTTLPRTEMEEMIMQTLHEQCKDV---HVLQVSAARY 160

Query: 733 WTKLRPGIWTFLERASKLFEMHLYT--MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
           +TKLRPGI  FL   S+LFE+++YT  MG++ YA  +A +LD  G +F GR+ISR D  D
Sbjct: 161 YTKLRPGIRNFLSEMSRLFELYIYTAGMGSQQYAEAVAHMLDESGRMFRGRIISRDDYTD 220

Query: 791 PFDGDERVPKSKDLEGVLGME---SAVVIIDDSVRVWPH-------NKLNLIVVERYTYF 840
                    + K L+ V  ++   + V+I+DD+   W H       ++ NLI V++Y+++
Sbjct: 221 V------SLEHKKLDKVFPIDEHRALVIILDDNAETWDHQYSDGRNSQENLIQVDKYSFW 274

Query: 841 P 841
           P
Sbjct: 275 P 275



 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 34/108 (31%), Positives = 53/108 (49%), Gaps = 15/108 (13%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQ-----FGAVCTKHIDDQ 946
           D+R +L + Q ++L+G +I FS V       PH  PL  + E       GA C + I   
Sbjct: 482 DIRELLRSAQSRVLSGLKIAFSGVI------PHSFPLLSSREGRLASLLGAQCCEEIAG- 534

Query: 947 VTH--VVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDF 991
           +TH  VV  +  T+KV  ++  G   +V P W+ A A  + + +E  F
Sbjct: 535 ITHLVVVIRTGLTEKVIESIRQGDVEIVSPEWLYACASRWEKVSEDKF 582


>gi|303317134|ref|XP_003068569.1| NLI interacting factor-like phosphatase family protein
           [Coccidioides posadasii C735 delta SOWgp]
 gi|240108250|gb|EER26424.1| NLI interacting factor-like phosphatase family protein
           [Coccidioides posadasii C735 delta SOWgp]
 gi|320038484|gb|EFW20419.1| RNA Polymerase II CTD phosphatase Fcp1 [Coccidioides posadasii str.
           Silveira]
          Length = 868

 Score = 95.1 bits (235), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 66/201 (32%), Positives = 105/201 (52%), Gaps = 34/201 (16%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + K+   R+EE  ++++ S+RKL LV+DLD T++++     VDP   E      
Sbjct: 132 DNASLTVSKDEATRVEEDAKRRLLSSRKLSLVVDLDQTIIHAT----VDPTVAEW----- 182

Query: 714 EQDREKPHRHLFR----------FPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 760
           ++D+  P+    +           P M G W   KLRPG+  FL   S L+E+H+YTMG 
Sbjct: 183 QEDKTNPNHEAVKDVRAFQLVDDGPGMRGCWYYIKLRPGLEDFLRSISSLYELHIYTMGT 242

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 819
           + YA  +A ++DP   +F  R++SR + G           +K+L+ +  +++  VVIIDD
Sbjct: 243 RAYAQNIANIVDPDRKIFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDD 294

Query: 820 SVRVWPHNKLNLIVVERYTYF 840
              VW  +  NLI V  Y +F
Sbjct: 295 RGDVWNWSD-NLIRVHPYDFF 314



 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 55/104 (52%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           D++ I+   +R+IL    +VFS V P+ +   +    LW  A+ FGA+ T+ ID + TH+
Sbjct: 510 DIKIIMPQIKRRILEAVVVVFSGVIPLDKDTQNAEISLW--AKSFGAIITQKIDSRTTHL 567

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A    +  +V   W+  S   ++R +E+ + +
Sbjct: 568 VAGRNRTAKVREATRYPKIKIVTVQWLLDSLTQWKRLDEEPYLV 611


>gi|148236996|ref|NP_001087852.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           phosphatase, subunit 1 [Xenopus laevis]
 gi|51950264|gb|AAH82378.1| MGC81710 protein [Xenopus laevis]
          Length = 977

 Score = 95.1 bits (235), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 61/182 (33%), Positives = 98/182 (53%), Gaps = 22/182 (12%)

Query: 662 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 719
           +  E+  +L  E+Q ++   +K+ L++DLD TL+++ + H        I           
Sbjct: 152 VSSEKAEQLGREDQLRLHRNKKVVLMVDLDQTLIHTTEQHCQHMSRKGIF---------- 201

Query: 720 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 779
            H  L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+
Sbjct: 202 -HFQLGRGEPM-LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFS 259

Query: 780 GRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYT 838
            R++SR +  DP+       K+ +L  +    +S V IIDD   VW     NLI V++Y 
Sbjct: 260 HRILSRDECIDPYS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKYV 312

Query: 839 YF 840
           YF
Sbjct: 313 YF 314



 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 63/139 (45%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFSH------QSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++       + +D V D+R I+   + K+L    I FS +    FP+    
Sbjct: 597 ILVRVHTDYYAKYDRYLKKEVDSVPDIRKIVPELKSKVLENVIISFSGLHPTNFPIERTR 656

Query: 923 PHLHPLWQTAEQFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
              H     A   GA   K++       D+ TH++A   GT+KV  A +     VV+P W
Sbjct: 657 EFYH-----ARALGASIHKNLILKPDDPDRTTHLIAARAGTEKVRKAQNCKHLHVVNPEW 711

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F +K
Sbjct: 712 LWSCLERWEKVEEQLFPLK 730


>gi|392870961|gb|EAS32809.2| FCP1-like phosphatase, phosphatase domain-containing protein
           [Coccidioides immitis RS]
          Length = 868

 Score = 95.1 bits (235), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 66/201 (32%), Positives = 105/201 (52%), Gaps = 34/201 (16%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + K+   R+EE  ++++ S+RKL LV+DLD T++++     VDP   E      
Sbjct: 132 DNASLTVSKDEATRVEEDAKRRLLSSRKLSLVVDLDQTIIHAT----VDPTVAEW----- 182

Query: 714 EQDREKPHRHLFR----------FPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 760
           ++D+  P+    +           P M G W   KLRPG+  FL   S L+E+H+YTMG 
Sbjct: 183 QEDKTNPNHEAVKDVRAFQLVDDGPGMRGCWYYIKLRPGLEDFLRSISSLYELHIYTMGT 242

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 819
           + YA  +A ++DP   +F  R++SR + G           +K+L+ +  +++  VVIIDD
Sbjct: 243 RAYAQNIANIVDPDRKIFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDD 294

Query: 820 SVRVWPHNKLNLIVVERYTYF 840
              VW  +  NLI V  Y +F
Sbjct: 295 RGDVWNWSD-NLIRVHPYDFF 314



 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 55/104 (52%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           D++ I+   +R+IL    +VFS V P+ +   +    LW  A+ FGA+ T+ ID + TH+
Sbjct: 510 DIKIIMPQIKRRILEAVVVVFSGVIPLDKDTQNAEISLW--AKSFGAIITQKIDSRTTHL 567

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A    +  +V   W+  S   ++R +E+ + +
Sbjct: 568 VAGRNRTAKVREATRYPKIKIVTVQWLLDSLTQWKRLDEEPYLV 611


>gi|213403530|ref|XP_002172537.1| RNA polymerase II subunit A C-terminal domain phosphatase
           [Schizosaccharomyces japonicus yFS275]
 gi|212000584|gb|EEB06244.1| RNA polymerase II subunit A C-terminal domain phosphatase
           [Schizosaccharomyces japonicus yFS275]
          Length = 723

 Score = 94.7 bits (234), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 69/207 (33%), Positives = 103/207 (49%), Gaps = 30/207 (14%)

Query: 651 FEGYDDQQKAAIQKE-------RTRRLEEQK--KMFSARKLCLVLDLDHTLLNSAKFHEV 701
           + G+ D  +A I            RRLE +   ++   ++L L++DLD T++++     V
Sbjct: 121 YMGFSDLSRATINMTHGSGGLTEARRLETETAIRLQKQKRLSLIVDLDQTIIHAT----V 176

Query: 702 DPVHDEILRKKEEQDRE---KPHRHLFRFPHMGM----WTKLRPGIWTFLERASKLFEMH 754
           DP   E ++     + +     H    R    G     + K RPG+  FL   SKL+E+H
Sbjct: 177 DPTVGEWMKDPNNVNYKVLRDVHYFYLREGTSGYTSCYYIKPRPGLQEFLHNVSKLYELH 236

Query: 755 LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SA 813
           +YTMG K YATE+AKV+DP G LF  RV+SR D G+           K +  +   + S 
Sbjct: 237 IYTMGTKAYATEVAKVIDPDGELFQDRVLSRDDSGN--------LTQKSIRRLFPCDTSM 288

Query: 814 VVIIDDSVRVWPHNKLNLIVVERYTYF 840
           VV+IDD   VW  +  NLI V  + +F
Sbjct: 289 VVVIDDRGDVWNWSS-NLIKVYPFEFF 314



 Score = 48.1 bits (113), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 56/105 (53%), Gaps = 2/105 (1%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           DV  I+   ++++L+ CRI+FS + P+G  +     + + A  FGA     + D  TH+V
Sbjct: 451 DVGLIIPEMKKRVLSHCRILFSGIIPLG-VDVITSDIAKWAMSFGAHVLLDLKDNPTHLV 509

Query: 952 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIKP 995
           A+ + T+KV  AL      VV+  W+  S   ++   E++F + P
Sbjct: 510 ASKVQTEKVRRALEIQTVKVVNLDWLLHSMSQWKALPEENFLLYP 554


>gi|50306333|ref|XP_453140.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49642274|emb|CAH00236.1| KLLA0D01595p [Kluyveromyces lactis]
          Length = 719

 Score = 94.7 bits (234), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 74/204 (36%), Positives = 103/204 (50%), Gaps = 38/204 (18%)

Query: 656 DQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK---- 711
           +QQ   +++    RL E+KK      L LV+DLD T+++      VDP   E +R     
Sbjct: 138 EQQAETLERSSLTRLREEKK------LVLVVDLDQTVIHCG----VDPTIGEWMRDPKNP 187

Query: 712 --KEEQD------REKPHRHLFRF----PHMGMW--TKLRPGIWTFLERASKLFEMHLYT 757
             K  QD       ++P    F F    P    W   KLRPG+  F E  S  FEMH+YT
Sbjct: 188 NYKALQDVKSFTLEDEPIIPSFYFGPKPPARKSWYYVKLRPGLKEFFEAVSPHFEMHIYT 247

Query: 758 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVI 816
           M  + YA E+AK++DP G LF  R++SR ++G           +K LE +  M +S VV+
Sbjct: 248 MATRSYAHEIAKIIDPTGELFGDRILSRDENGS--------LTTKSLERLFPMDQSMVVV 299

Query: 817 IDDSVRVWPHNKLNLIVVERYTYF 840
           IDD   VW   + NLI V  Y++F
Sbjct: 300 IDDRGDVWNWFE-NLIKVVPYSFF 322



 Score = 48.1 bits (113), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 39/158 (24%), Positives = 70/158 (44%), Gaps = 12/158 (7%)

Query: 841 PCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQS---LDDVDVRNIL 897
           P +  Q  L   +LL  D DE      L     ++ R+HK +F+  +    +  D++ +L
Sbjct: 420 PLAELQKHLHNQTLLIDDDDE------LPHLSQILLRVHKEYFNQYTEAPKNPPDIKYLL 473

Query: 898 AAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGT 957
              + K+  GC  VFS + P+ E +     +    + FGA  +  I+ + TH++  +  T
Sbjct: 474 PQMKYKVFQGCHFVFSGLIPL-ETDVRKADIVLWTDMFGATTSSDINYKTTHIITTTSRT 532

Query: 958 DKVNWALSTG--RFVVHPGWVEASALLYRRANEQDFAI 993
            K   A S      +VHP W+    + + R  E  + +
Sbjct: 533 FKARLAKSFNPDIKIVHPDWLFECLVQWERVKESPYEL 570


>gi|440804367|gb|ELR25244.1| FCP1like phosphatase, phosphatase subfamily protein [Acanthamoeba
           castellanii str. Neff]
          Length = 930

 Score = 94.7 bits (234), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 66/200 (33%), Positives = 109/200 (54%), Gaps = 30/200 (15%)

Query: 659 KAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD-- 716
           +A  +++  R  E   ++ +A+KL LVLDLD TL+++ +  EV+ +      + +     
Sbjct: 130 RAVAERDAERTAE---RLTAAKKLSLVLDLDQTLVHATQDAEVETLFGTDAAEAKGGSIT 186

Query: 717 ----------REKPHRHLFRF-----PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNK 761
                      + P  HL+RF     PH   + KLRP +  FL     LFE+H+YTMG++
Sbjct: 187 CALPNPPAGPEDVPAAHLYRFTLEGNPHK-FYLKLRPHLEEFLMGVKDLFELHIYTMGSR 245

Query: 762 LYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDS 820
            YA ++A+++DP+  LF   ++SR + G+       V   K+L+ +  + +S V+IIDD 
Sbjct: 246 SYARKVAQIIDPEQKLFRENIVSRDECGN-------VMNLKNLQRIFPVDDSMVMIIDDR 298

Query: 821 VRVWPHNKLNLIVVERYTYF 840
           V VW  +K NLI +E Y +F
Sbjct: 299 VDVWGTSK-NLIKIEPYYFF 317



 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 56/101 (55%), Gaps = 2/101 (1%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           D++  L  ++R++L G  I FS +FP G + P   PLW+ +E+FGA C+     + TH+V
Sbjct: 600 DIKYCLHVQRRRVLEGVHICFSSIFPTG-SKPESTPLWRLSEEFGACCSNVFTPETTHLV 658

Query: 952 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDF 991
           A +  T+KV  A   G   +VH  W+  S   +RR  E  +
Sbjct: 659 ALNERTEKVKLAHERGGVHIVHLEWLLESIRTWRRPEEARY 699


>gi|119187277|ref|XP_001244245.1| hypothetical protein CIMG_03686 [Coccidioides immitis RS]
          Length = 839

 Score = 94.7 bits (234), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 66/201 (32%), Positives = 105/201 (52%), Gaps = 34/201 (16%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + K+   R+EE  ++++ S+RKL LV+DLD T++++     VDP   E      
Sbjct: 103 DNASLTVSKDEATRVEEDAKRRLLSSRKLSLVVDLDQTIIHAT----VDPTVAEW----- 153

Query: 714 EQDREKPHRHLFR----------FPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 760
           ++D+  P+    +           P M G W   KLRPG+  FL   S L+E+H+YTMG 
Sbjct: 154 QEDKTNPNHEAVKDVRAFQLVDDGPGMRGCWYYIKLRPGLEDFLRSISSLYELHIYTMGT 213

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 819
           + YA  +A ++DP   +F  R++SR + G           +K+L+ +  +++  VVIIDD
Sbjct: 214 RAYAQNIANIVDPDRKIFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDD 265

Query: 820 SVRVWPHNKLNLIVVERYTYF 840
              VW  +  NLI V  Y +F
Sbjct: 266 RGDVWNWSD-NLIRVHPYDFF 285



 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 55/104 (52%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           D++ I+   +R+IL    +VFS V P+ +   +    LW  A+ FGA+ T+ ID + TH+
Sbjct: 481 DIKIIMPQIKRRILEAVVVVFSGVIPLDKDTQNAEISLW--AKSFGAIITQKIDSRTTHL 538

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A    +  +V   W+  S   ++R +E+ + +
Sbjct: 539 VAGRNRTAKVREATRYPKIKIVTVQWLLDSLTQWKRLDEEPYLV 582


>gi|255712225|ref|XP_002552395.1| KLTH0C03894p [Lachancea thermotolerans]
 gi|238933774|emb|CAR21957.1| KLTH0C03894p [Lachancea thermotolerans CBS 6340]
          Length = 745

 Score = 94.4 bits (233), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 102/203 (50%), Gaps = 36/203 (17%)

Query: 656 DQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSA---KFHE-----VDPVHDE 707
           ++Q A I+K   + L E KK      L LV+DLD T+++       HE      +P +D 
Sbjct: 146 ERQAATIEKTAQKHLREHKK------LVLVVDLDQTVIHCGVDPTIHEWANDPSNPNYDA 199

Query: 708 ILRKKEEQDREKPHRHLFRFPHMG---------MWTKLRPGIWTFLERASKLFEMHLYTM 758
           +   K     E P    F   +MG          + KLRPG+  F ++ +  FE+H+YTM
Sbjct: 200 LKNVKTFSLDEDPILPPF---YMGPRPPPRKCQYYVKLRPGLQEFFDKIAPHFELHIYTM 256

Query: 759 GNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVII 817
             + YA E+AK++DPKG LF  R++SR ++G            K LE +  M +S VVII
Sbjct: 257 ATRAYALEIAKIIDPKGELFGDRILSRDENGS--------LTHKSLERLFPMDQSMVVII 308

Query: 818 DDSVRVWPHNKLNLIVVERYTYF 840
           DD   VW   + NLI V  Y +F
Sbjct: 309 DDRGDVWSWCE-NLIKVVPYNFF 330



 Score = 48.9 bits (115), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 26/104 (25%), Positives = 49/104 (47%), Gaps = 3/104 (2%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           D++ ++   + ++   C  VFS + P+G  N     +      FGA  +  +D+  THV+
Sbjct: 473 DIKTLMPQMKSQVFKRCHFVFSGLIPLG-TNVRQADIVLWTSMFGATTSVDVDEATTHVI 531

Query: 952 ANSLGTDKVNWALST--GRFVVHPGWVEASALLYRRANEQDFAI 993
             + GT K   A +      V+HP W+    + + R NE+ + +
Sbjct: 532 TKTPGTFKARLAKTYNPNIKVIHPDWIFECLVKWVRVNEKPYEL 575


>gi|281204241|gb|EFA78437.1| hypothetical protein PPL_09089 [Polysphondylium pallidum PN500]
          Length = 1252

 Score = 94.4 bits (233), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 77/311 (24%), Positives = 131/311 (42%), Gaps = 59/311 (18%)

Query: 733  WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
            + K+RP   TFL+    LF + L+++ +K Y  +M +++DP   LF   +I+    GD  
Sbjct: 934  YVKIRPYTITFLKTLYPLFNITLFSLNHKSYVNKMVEIIDPSKTLFKN-IITIESFGD-- 990

Query: 793  DGDERVPKSK-----------DLEGVLGMES--AVVIIDDSVRVWPHNKLNLIVVERYTY 839
                 +PK +           +   +  ++S  ++V+IDD   +W   + NLI+VER+ +
Sbjct: 991  ----NIPKQQTNRPYSLFTPSNFSSIFKIDSSESIVVIDDREDIWRQFRDNLIMVERFIH 1046

Query: 840  F--------------PCS------RRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLH 879
            F              P S      +  F  L PS+  ++   R         +  ++   
Sbjct: 1047 FSKDVEHSKITTYHYPFSIQDLSKQYHFPKLEPSIPYLEIFSRLLKIIQCEYMSRVQETQ 1106

Query: 880  KI---------------FFSHQSLDD--VDVRNILAAEQRKILAGCRIVFSRVFP--VGE 920
            +I                   Q++DD     R I+   ++ +L  C IVFS +FP  +  
Sbjct: 1107 QINNNNNNNSSSGKQQLMAGAQTIDDNLSQSRTIMKEIRQSVLMDCNIVFSGIFPKQIDS 1166

Query: 921  ANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASA 980
            A      + Q  E FGA     I D  TH++    GT KV  A+  G  VVH  W+  S 
Sbjct: 1167 AKLVTTRIVQLTESFGAKVQMDITDNTTHLIYIKEGTSKVLQAMKCGIKVVHFAWLRDSI 1226

Query: 981  LLYRRANEQDF 991
              + + +E ++
Sbjct: 1227 YHWEKMDESNY 1237


>gi|430812451|emb|CCJ30145.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 741

 Score = 94.0 bits (232), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 68/211 (32%), Positives = 105/211 (49%), Gaps = 34/211 (16%)

Query: 651 FEGYDDQQKAAIQ-----------KERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAK 697
           F G+ D  +A IQ           KE   RLE +  +++    KL L++DLD T+L++  
Sbjct: 151 FTGFLDSTRATIQMSHDATKLTVSKEEATRLERETMERLLKEMKLSLIVDLDQTILHAT- 209

Query: 698 FHEVDPVHDEILRK---KEEQDREKPHRHLFRFPHMGM----WTKLRPGIWTFLERASKL 750
              VDP+  E L     K     +   +   +  + G+    + K+RPG+  FLE  SKL
Sbjct: 210 ---VDPIVGEWLSNPSSKHYLAVQDVQKFCLKENNSGIGNWYYVKMRPGLEQFLENISKL 266

Query: 751 FEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM 810
           +EMH+YTMG + YA  +A ++D     F  R++SR + G            K+++ +  +
Sbjct: 267 YEMHIYTMGTRAYAASIAHLIDKDKKYFGDRILSRDESGS--------TTRKNIQRLFPV 318

Query: 811 E-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           + S VVIIDD   VW  +  NLI V  Y +F
Sbjct: 319 DTSMVVIIDDRADVWQWSP-NLIKVTPYEFF 348



 Score = 48.9 bits (115), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 56/107 (52%), Gaps = 2/107 (1%)

Query: 888 LDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQV 947
           ++ +D++ I+   +  +L G  +VFS + P+G  +     + Q A  FGA  +K+I + V
Sbjct: 510 INKLDIKVIMPMLKSNVLKGTNLVFSGIVPMG-MDVLSSNIAQWAINFGAKVSKNISNDV 568

Query: 948 THVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           TH++A  + T KV  AL      +V   W+  S   ++R  E D+ +
Sbjct: 569 THLIATKIRTTKVKKALQQKNIKIVSIDWLLHSISHWKRLPESDYQL 615


>gi|189211133|ref|XP_001941897.1| RNA polymerase II subunit A C-terminal domain phosphatase
           [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187977990|gb|EDU44616.1| RNA polymerase II subunit A C-terminal domain phosphatase
           [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 774

 Score = 94.0 bits (232), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 56/178 (31%), Positives = 96/178 (53%), Gaps = 21/178 (11%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHRHL 724
           +K++ SARKL L++DLD T++++     +        +P HD +   K+ Q  +    ++
Sbjct: 152 KKRLLSARKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHDAV---KDVQGFQLADDNV 208

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
                   + K+RPG+  F +R SKL+EMH+YTM  + YA  +AK++DP+   F  R++S
Sbjct: 209 SNVAANWYYVKMRPGLKDFFDRVSKLYEMHVYTMATRAYAQAVAKIIDPERKYFGDRILS 268

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGMESAV-VIIDDSVRVWPHNKLNLIVVERYTYFP 841
           R ++           K K L  +    +A+ VIIDD   VW ++  +L+ V  + +FP
Sbjct: 269 RDEN--------YTDKLKSLTRLFYQNTAMCVIIDDRADVWQYSP-HLVRVPVFNFFP 317


>gi|330930047|ref|XP_003302870.1| hypothetical protein PTT_14854 [Pyrenophora teres f. teres 0-1]
 gi|311321498|gb|EFQ89046.1| hypothetical protein PTT_14854 [Pyrenophora teres f. teres 0-1]
          Length = 803

 Score = 93.6 bits (231), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 56/178 (31%), Positives = 96/178 (53%), Gaps = 21/178 (11%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHRHL 724
           +K++ SARKL L++DLD T++++     +        +P HD +   K+ Q  +    ++
Sbjct: 152 KKRLLSARKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHDAV---KDVQGFQLADDNV 208

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
                   + K+RPG+  F +R SKL+EMH+YTM  + YA  +AK++DP+   F  R++S
Sbjct: 209 SNVAANWYYVKMRPGLKDFFDRVSKLYEMHVYTMATRAYAQAVAKIIDPERKYFGDRILS 268

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGMESAV-VIIDDSVRVWPHNKLNLIVVERYTYFP 841
           R ++           K K L  +    +A+ VIIDD   VW ++  +L+ V  + +FP
Sbjct: 269 RDEN--------YTDKLKSLTRLFYQNTAMCVIIDDRADVWQYSP-HLVRVPVFNFFP 317


>gi|428672202|gb|EKX73116.1| conserved hypothetical protein [Babesia equi]
          Length = 739

 Score = 93.6 bits (231), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 67/200 (33%), Positives = 99/200 (49%), Gaps = 37/200 (18%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAK--------FHEVDPVHDE--ILRKKEEQDREKPHRHL 724
           K+   RKLCLVLDLD+TLL+++           E+D +  +  I +  +  D E   +  
Sbjct: 240 KVLQKRKLCLVLDLDNTLLHASSQKLPSDVYVDEIDFLSKDADIFKDVQYNDDEGTLKLR 299

Query: 725 FRFPHMGMWT---------------KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 769
            +F    + T               KLRPG++ FL+  S  FE++L+TMG K +A+   K
Sbjct: 300 KKFESSIIQTMVYNESETMCCKSYFKLRPGVFKFLKEMSAKFELYLFTMGTKQHASSSLK 359

Query: 770 VLDPKGVLFAGRVISRGDDGDPFDGDERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNK 828
           +LDPK + F  R+  R D        +R+ PK K+L         V+I+DD+  VW  N 
Sbjct: 360 ILDPKRIYFGNRIFCRNDSRSSMKSLDRIFPKHKNL---------VLIVDDTEHVWTCN- 409

Query: 829 LNLIVVERYTYFP-CSRRQF 847
           L LI +  Y +FP  S  QF
Sbjct: 410 LGLIKIHPYFFFPDLSYLQF 429


>gi|330799899|ref|XP_003287978.1| hypothetical protein DICPUDRAFT_55168 [Dictyostelium purpureum]
 gi|325082002|gb|EGC35499.1| hypothetical protein DICPUDRAFT_55168 [Dictyostelium purpureum]
          Length = 730

 Score = 93.6 bits (231), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 62/168 (36%), Positives = 91/168 (54%), Gaps = 14/168 (8%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           K++   RKL LVLDLDHTL+++     ++   +   R +++ D    H      P M   
Sbjct: 129 KRLIKERKLSLVLDLDHTLIHAVTEQGLNSSPNWKNRNRKDYD---IHNITVNGP-MTYC 184

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD-DGDPF 792
            K RP +  FLE  +K FE+H+YTMG + YA E+AK++DP   LF  R++SR D +G  F
Sbjct: 185 IKKRPHLNDFLENVNKNFELHIYTMGTRNYANEIAKLIDPDQTLFKERILSRDDGNGINF 244

Query: 793 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
              +R+    D        S V+I+DD   VW  +K NLI +  Y +F
Sbjct: 245 KTLQRLFPCDD--------SMVLIVDDRSDVWKKSK-NLIQISPYVFF 283



 Score = 72.8 bits (177), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 72/131 (54%), Gaps = 2/131 (1%)

Query: 865 DGTLASSLGVIERLHKIFFSHQSLDDV-DVRNILAAEQRKILAGCRIVFSRVFPVGEANP 923
           D  L   L  + ++H +++  +  ++   V  ++   +++IL    IVFS V+P+G    
Sbjct: 406 DCHLLIVLDKLTKIHDLYYKSKDKNEKPHVIQMVDIVKKEILKDQFIVFSGVYPLGTP-V 464

Query: 924 HLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLY 983
           +  PL   AE+FGA     I  + THV+A   GT KVN A+S G  VV P W+  S  ++
Sbjct: 465 NKQPLRYLAEEFGASVENDITSKTTHVIAQRKGTSKVNKAISKGLKVVSPQWLVESTRIW 524

Query: 984 RRANEQDFAIK 994
           +RA+E DF ++
Sbjct: 525 QRADENDFKLE 535


>gi|301118528|ref|XP_002906992.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262108341|gb|EEY66393.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 735

 Score = 93.6 bits (231), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 68/254 (26%), Positives = 112/254 (44%), Gaps = 63/254 (24%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           ++   A+KL LVLDLDHTLL++ +   VD V  EI                         
Sbjct: 265 RRQLGAKKLSLVLDLDHTLLHAVR---VDDVVSEI------------------------- 296

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 793
                       + + L+++ +YT G +LYA ++  ++DP    F  R+++R D  D   
Sbjct: 297 ------------KQTVLYDLFIYTHGTRLYAEKIVNIIDPDETYFKNRIVARTDTPDMLH 344

Query: 794 GDERV--PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 851
              ++  P   D        S ++++DD + VW  N+ N+ ++E Y YF C+       G
Sbjct: 345 KSLKLLFPSCDD--------SMILVLDDRIDVWKENEGNVFLIEPYHYFKCTSEINNASG 396

Query: 852 PSL--LEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQ-------- 901
             +  +E    E SED  LA S  V+  +H+ F++     +  +R   A EQ        
Sbjct: 397 RGVAGMEDSEAEASEDSHLAQSTTVLRHVHEAFYAGH---ETGMRGTTAEEQMGGHGRDV 453

Query: 902 RKILAGCRIVFSRV 915
           + IL+ C   +SRV
Sbjct: 454 KGILSACARSWSRV 467


>gi|156837042|ref|XP_001642557.1| hypothetical protein Kpol_1068p9 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156113100|gb|EDO14699.1| hypothetical protein Kpol_1068p9 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 745

 Score = 93.2 bits (230), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 66/192 (34%), Positives = 93/192 (48%), Gaps = 42/192 (21%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------ 726
           +K++   +KL LV+DLD T+++      VDP   E      + D   P+    R      
Sbjct: 156 KKRLIREKKLILVVDLDQTVIHCG----VDPTIAEW-----KNDPTNPNFETLRDVKSFV 206

Query: 727 ------FPHMGM-----------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 769
                  P M M           + K+RPG+  F E  SKL+EMH+YTM  + YA E+AK
Sbjct: 207 LEEEPILPPMYMGPKPPTHKCWYYVKIRPGLKEFFEEVSKLYEMHIYTMATRSYAQEIAK 266

Query: 770 VLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNK 828
           ++DP G LFA R++SR ++G            K LE +    +S VV+IDD   VW    
Sbjct: 267 IIDPDGTLFADRILSRNENGS--------LTHKSLERLFPTDQSMVVVIDDRGDVWNWCP 318

Query: 829 LNLIVVERYTYF 840
            NLI V  Y +F
Sbjct: 319 -NLIKVTPYNFF 329



 Score = 46.2 bits (108), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 29/124 (23%), Positives = 58/124 (46%), Gaps = 4/124 (3%)

Query: 864 EDGTLASSLGVIERLHKIFFSHQSLD-DVDVRNILAAEQRKILAGCRIVFSRVFPVGEAN 922
           +D  L     +++ + + ++     D ++ ++ ++   ++K+L GC  VFS + P+G  N
Sbjct: 445 DDDELFYLKDILKNVQQKYYEELKTDKEIQIQTLMPKLKKKVLEGCNFVFSGLIPLG-TN 503

Query: 923 PHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGT--DKVNWALSTGRFVVHPGWVEASA 980
                +      FGA  +  I +  THV+  +  T   K+  A  +   V+HP WV    
Sbjct: 504 IQKADIVLWTNMFGAQSSADITEDTTHVITKTPHTYKAKIAKAFKSDIKVLHPDWVFECL 563

Query: 981 LLYR 984
           L ++
Sbjct: 564 LNWK 567


>gi|407929624|gb|EKG22436.1| BRCT domain-containing protein [Macrophomina phaseolina MS6]
          Length = 861

 Score = 92.8 bits (229), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 63/184 (34%), Positives = 97/184 (52%), Gaps = 32/184 (17%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR---- 726
           E ++++ S +KL LV+DLD T++++     VDP   E      ++D E P+    +    
Sbjct: 150 EAKRRLLSNKKLSLVVDLDQTIIHAT----VDPTVAEW-----QKDPENPNYEAVKDVQS 200

Query: 727 FPHM-------GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
           F  +       G W   KLRPG+  FLE  SK++E+H+YTMG + YA  +AK++DP   +
Sbjct: 201 FQLLDNGPGGRGCWYYIKLRPGLREFLENISKVYELHIYTMGTRAYAQNIAKIVDPNRKI 260

Query: 778 FAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVER 836
           F  R++SR + G            K L  +  +++  VVIIDD   VW  +  NLI V  
Sbjct: 261 FGDRILSRDESGS--------LTVKTLHRIFPVDTKMVVIIDDRGDVWSWSN-NLIKVTP 311

Query: 837 YTYF 840
           Y +F
Sbjct: 312 YDFF 315



 Score = 43.1 bits (100), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 31/100 (31%), Positives = 52/100 (52%), Gaps = 5/100 (5%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHV 950
           D+  I+   +R +L G  IVFS V P+G       + +W  A+ FGA  ++ I+ + THV
Sbjct: 527 DIGLIMPEMKRSVLKGVSIVFSGVVPLGTNIQASEYAVW--AKSFGAKVSETINKKTTHV 584

Query: 951 V-ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANE 988
           V A +  T KV  A    +  +V   W++   + ++R +E
Sbjct: 585 VAARNRRTQKVRQAARHPQIKIVTVDWLKECFVTWQRVSE 624


>gi|342320998|gb|EGU12936.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Rhodotorula
           glutinis ATCC 204091]
          Length = 817

 Score = 92.8 bits (229), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 74/252 (29%), Positives = 114/252 (45%), Gaps = 78/252 (30%)

Query: 651 FEGYDDQQKAAIQK-----------ERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAK 697
           + G+ D  +A I             E   RLE+    ++  A+KL L++DLD T++++  
Sbjct: 115 YTGFSDTSRATISMAHDIGGLTVSLEEAHRLEKATTARLLDAKKLSLIVDLDQTIVHAT- 173

Query: 698 FHEVDPVHDEIL--------------RKKEEQD-----REKPHRHL-FRFPHM------- 730
              VDP   E L              ++ + QD     R KP ++   R   +       
Sbjct: 174 ---VDPTVGEWLQDPKNPNYKALEGVKRFKLQDESPATRNKPKKYRKIRIKQVDPSKGEE 230

Query: 731 ----------------GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 772
                           G W   K+RPG+  FL+R ++++EMH+YTMG + YA+E+ KV+D
Sbjct: 231 ADDESSEDEEEDEDDGGCWYYIKMRPGLPDFLKRVAEMYEMHVYTMGTRAYASEVCKVID 290

Query: 773 PKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVW---PHNK 828
           P G LF GR++SR + G            K L+ +   ++  VVIIDD   VW   PH  
Sbjct: 291 PDGGLFGGRILSRDESGS--------MTRKSLQRLFPCDTNMVVIIDDRADVWDGSPH-- 340

Query: 829 LNLIVVERYTYF 840
             L+ V  Y +F
Sbjct: 341 --LVKVIPYEFF 350



 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 34/116 (29%), Positives = 56/116 (48%), Gaps = 2/116 (1%)

Query: 879 HKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAV 938
            + +  H +    DV +I+ A + + L    +VFS +  +G + P     W+ A  FGA 
Sbjct: 485 ERFYEQHDAKGAADVADIIPAMKAQTLRDTHLVFSGLVALG-SRPEDSEYWKLARTFGAR 543

Query: 939 CTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           C+  +    TH+VAN  GT KV+ A    R  +V+P W+  S   + R  E  + +
Sbjct: 544 CSADLSSSTTHLVANGWGTAKVHSAQRNPRIKIVYPEWLLESVARWTRVPEDAYVL 599


>gi|398396164|ref|XP_003851540.1| hypothetical protein MYCGRDRAFT_44229 [Zymoseptoria tritici IPO323]
 gi|339471420|gb|EGP86516.1| hypothetical protein MYCGRDRAFT_44229 [Zymoseptoria tritici IPO323]
          Length = 822

 Score = 92.4 bits (228), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 68/213 (31%), Positives = 106/213 (49%), Gaps = 34/213 (15%)

Query: 656 DQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 715
           + + A I +ER +RL        +RKL LV+DLD T++ +     V+P   E      + 
Sbjct: 146 ENEAAKIDEERRKRL------LDSRKLSLVVDLDQTIIQA----NVEPTIGEW-----KN 190

Query: 716 DREKPHRHL------FRFPHMG---MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATE 766
           D   P+         F+    G    + KLRPG+  FL   S+L+E+H+YTMG + YA  
Sbjct: 191 DPTNPNWKALQDVCQFQLADDGRTWYYVKLRPGLKDFLRDMSELYELHIYTMGTRAYADN 250

Query: 767 MAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWP 825
           +AK++DP   +F  R++SR ++G            K+L+ +   ++  VVIIDD   VW 
Sbjct: 251 IAKIVDPDRKVFGDRILSRDENGS--------MTVKNLKRLFHADTRMVVIIDDRADVW- 301

Query: 826 HNKLNLIVVERYTYFPCSRRQFGLLGPSLLEID 858
           H   NLI V  + +FP      G+  P   E++
Sbjct: 302 HWTPNLIKVNAFEFFPGVGDINGMFLPKRQELE 334



 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/116 (32%), Positives = 63/116 (54%), Gaps = 10/116 (8%)

Query: 886 QSLDDV----DVRNILAAEQRKILAGCRIVFSRVFPVGE-ANPHLHPLWQTAEQFGAVCT 940
           +S+D+V    D   I+ + + K+LAG  IVFS V P+G+  N +   +W  A  FGA  T
Sbjct: 501 RSIDEVHSIPDAAPIMQSMKAKVLAGVHIVFSGVIPLGQDLNTNDTAIW--ARSFGAKIT 558

Query: 941 KHIDDQVTHVVAN-SLGTDKVNWAL-STGRF-VVHPGWVEASALLYRRANEQDFAI 993
           ++I  + THV+A+    T KV  A   +GR  +V   W+ +  + + + +E  + I
Sbjct: 559 ENITKKTTHVIASPERRTAKVRQAAKKSGRIAIVSQNWLSSCFMQWEKVDESPYRI 614


>gi|296419837|ref|XP_002839498.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295635659|emb|CAZ83689.1| unnamed protein product [Tuber melanosporum]
          Length = 896

 Score = 92.4 bits (228), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 61/196 (31%), Positives = 100/196 (51%), Gaps = 24/196 (12%)

Query: 656 DQQKAAIQKERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDE------ 707
           D     + ++   RLEE+  +++  ++KL LV+DLD T++++     VDP   +      
Sbjct: 132 DSMGLTVSQDEATRLEEETKRRLLKSKKLSLVVDLDQTIIHAT----VDPTVGDWKNDPF 187

Query: 708 ILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 765
            +  +  +D +            G W   K+RPG+  FLE  S+L+E+H+YTMG + YA 
Sbjct: 188 CINHESVKDVQAFKLDEDIIGGRGTWYYVKMRPGLKEFLEHISQLYELHIYTMGTRAYAM 247

Query: 766 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
            + K++DP G +F  RV+SR + G            K L  +  +++  VVIIDD   VW
Sbjct: 248 SVKKIVDPDGRIFGERVLSRDESGS--------MTQKSLHRIFPVDTKMVVIIDDRGDVW 299

Query: 825 PHNKLNLIVVERYTYF 840
             +  NL+ V  Y +F
Sbjct: 300 KWSD-NLVKVRPYDFF 314


>gi|449299873|gb|EMC95886.1| hypothetical protein BAUCODRAFT_71386 [Baudoinia compniacensis UAMH
           10762]
          Length = 790

 Score = 92.0 bits (227), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 98/180 (54%), Gaps = 24/180 (13%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E ++++  +RKL LV+DLD T++++     VDP   E  +  E        + + +F  +
Sbjct: 150 EAKRRLIKSRKLSLVVDLDQTIIHAT----VDPTVAE-WQADETNPNHAAVKGVRKFQLV 204

Query: 731 -------GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 781
                  G W   KLRPG+  FL+  S+ +E+H+YTM  + YA E+AK++DP   LFA R
Sbjct: 205 DDGPGGRGTWYYIKLRPGLSDFLQLVSQYYELHIYTMATRAYAEEIAKLVDPGRKLFANR 264

Query: 782 VISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 840
           ++SR ++G           SK L+ +  +++  VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 265 ILSRDENGS--------MNSKSLKRLFPVDTKMVVIIDDRGDVWSWSP-NLVKVSAYDFF 315



 Score = 49.3 bits (116), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 58/113 (51%), Gaps = 8/113 (7%)

Query: 888 LDDV----DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHI 943
           LDD+    D   I+ + + ++L+G  +VFS V P+G  NP  H L   A  FGA     +
Sbjct: 524 LDDIENIPDAAAIMTSMKSRVLSGVHLVFSGVVPLG-VNPLNHDLAIWARSFGANVQVKV 582

Query: 944 DDQVTHVVAN-SLGTDKVNWALSTGR--FVVHPGWVEASALLYRRANEQDFAI 993
             + TH++++    T KV  AL  G    +V+  W+ A    +++ +E+ + I
Sbjct: 583 SRRTTHLISSPDRRTAKVRQALKKGSRIAIVNQNWLYACFSQWKKVDEEPYRI 635


>gi|402220046|gb|EJU00119.1| hypothetical protein DACRYDRAFT_81791 [Dacryopinax sp. DJM-731 SS1]
          Length = 855

 Score = 92.0 bits (227), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 64/202 (31%), Positives = 96/202 (47%), Gaps = 44/202 (21%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEIL------RKKEEQDREKPH--- 721
           E Q ++  +RKL LV+DLD T++ +     VDP   E +       +  E  R+ P+   
Sbjct: 153 ESQIRLLGSRKLSLVVDLDQTIIQAT----VDPTVGEWIDQGRAWEEGREGARKNPNWEA 208

Query: 722 -RHLFRF---------------------PHMGMWTKLRPGIWTFLERASKLFEMHLYTMG 759
            R + RF                          + K RPG+  FL R S+L+EMH+YTMG
Sbjct: 209 LRDVGRFRLSEERKVVNGRGGKVIRSKREDTAYYIKPRPGLHAFLSRLSELYEMHVYTMG 268

Query: 760 NKLYATEMAKVLDPKGVLFAGRVISRGDDGD-PFDGDERVPKSKDLEGVLGMESAVVIID 818
            + YA+++ +++DP G LF  RV+SR + G   F    R+             S+ VIID
Sbjct: 269 TRSYASQVVRLIDPLGNLFGSRVLSRDESGSLTFKNLTRLFPCNT--------SSAVIID 320

Query: 819 DSVRVWPHNKLNLIVVERYTYF 840
           D   VW  ++ NL+ V  Y +F
Sbjct: 321 DRADVWDLSRANLVKVVPYDFF 342



 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 43/141 (30%), Positives = 74/141 (52%), Gaps = 11/141 (7%)

Query: 863 SEDGTLASSLGVIERLHKIFFSHQSLDDVD--------VRNILAAEQRKILAGCRIVFSR 914
           ++D  L+  L ++E +H+ +F    L   D        V  I+   + ++L+G  +VFS 
Sbjct: 510 NDDHELSRVLRILEEIHRRYFQQYDLYVADKNTRKEPSVMEIIPEMKAEVLSGVHLVFSS 569

Query: 915 VFPVGEANPHLHP-LWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHP 973
           + P+    PH +  LW+ A QFGA C   +  +VTHVVA   GT+KV   ++ G  +V+P
Sbjct: 570 LIPID--MPHQNTDLWRQALQFGAACYTRVAREVTHVVAAKRGTEKVRQGVARGCKIVNP 627

Query: 974 GWVEASALLYRRANEQDFAIK 994
            W   S   + R  E+++ ++
Sbjct: 628 YWFMDSVAAWERKKEEEYPLE 648


>gi|255936731|ref|XP_002559392.1| Pc13g09690 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211584012|emb|CAP92038.1| Pc13g09690 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 792

 Score = 92.0 bits (227), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 64/188 (34%), Positives = 103/188 (54%), Gaps = 25/188 (13%)

Query: 664 KERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 722
           +E TR  E+ K+ + ++R+L LV+DLD T++++     VDP   E    K+  + E   R
Sbjct: 114 REATRVEEDAKRRLLASRRLTLVVDLDQTIIHAT----VDPTVGEWREDKQNPNHEAV-R 168

Query: 723 HLFRF------PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 773
            + +F      P M G W   KLRPG+  FL+  ++++E+H+YTMG + YA  +  ++DP
Sbjct: 169 DVRQFQLIDDGPGMRGCWYYIKLRPGLEEFLQNVAEIYELHIYTMGTRAYAQHIVDIIDP 228

Query: 774 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLI 832
              LF  R++SR + G            KDL+ +  +++  VVIIDD   +W  +  NLI
Sbjct: 229 TRKLFGDRILSRDESGS--------LTVKDLQRLFPVDTKMVVIIDDRGDIWRWSP-NLI 279

Query: 833 VVERYTYF 840
            V  Y +F
Sbjct: 280 KVSPYDFF 287



 Score = 45.8 bits (107), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 66/135 (48%), Gaps = 14/135 (10%)

Query: 865 DGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVG--EAN 922
           DG +A+  G  ERLH            D+++I+   +R+IL G  +VFS V P+G    N
Sbjct: 445 DGRVAALRG--ERLHSRDKDMDLKSVPDIKDIMPRIKRRILGGVVLVFSGVLPLGIDFQN 502

Query: 923 PHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF----VVHPGWVEA 978
             +  LW  A+ FG   +  ++ + TH+VA    T KV  A    R+    +V   W+  
Sbjct: 503 ADI-SLW--AKSFGVTISSRVNARTTHLVAGRNRTAKVREAT---RYPNVKIVTTQWLVD 556

Query: 979 SALLYRRANEQDFAI 993
           S + +R  +E+ + +
Sbjct: 557 SLVQWRHVDEEPYLL 571


>gi|453084575|gb|EMF12619.1| hypothetical protein SEPMUDRAFT_149240 [Mycosphaerella populorum
           SO2202]
          Length = 848

 Score = 91.7 bits (226), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 63/192 (32%), Positives = 99/192 (51%), Gaps = 32/192 (16%)

Query: 663 QKERTRRLEE-QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH 721
           Q E T+  EE ++++  +R+L LV+DLD T++++     VDP   E      + D   P+
Sbjct: 142 QDEATKTDEEGKRRLLDSRRLSLVVDLDQTIIHAC----VDPSIGEW-----QNDPSNPN 192

Query: 722 RHLFRFPH----------MGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 769
               R             +  W   K RPG+ +FL+  S+L+EMH+YTMG + YA  +AK
Sbjct: 193 YDALRDVQAFQLRDDNKPVATWYYIKQRPGLQSFLKGLSELYEMHIYTMGTRTYAEGVAK 252

Query: 770 VLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNK 828
           ++DP G +F  R+++R + G          K K L+ +   +S  VVIIDD   VW    
Sbjct: 253 IIDPDGRVFGDRIVTRTESGS--------DKEKSLKRLFPTDSKMVVIIDDRADVWRWIS 304

Query: 829 LNLIVVERYTYF 840
            NL+ V  + +F
Sbjct: 305 -NLVKVNVFEFF 315



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 41/132 (31%), Positives = 63/132 (47%), Gaps = 10/132 (7%)

Query: 864 EDGTLASSLGVIERLHKIFFSHQSLDDV-----DVRNILAAEQRKILAGCRIVFSRVFPV 918
           E   + S  G +  L     + +S DD      D   ++   + ++LA C IVFS V P+
Sbjct: 506 EKQAMGSKGGRVSELRPGLSTKRSADDELQHIPDAAVVMDDLKCRVLADCHIVFSGVVPL 565

Query: 919 G-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVAN-SLGTDKVNWALSTGRF-VVHPGW 975
           G + N H   LW  A+ FGA  ++HI  + THV+A+    T KV  A    R  VV   W
Sbjct: 566 GVDPNNHDAALW--AKSFGASVSQHITKKTTHVIASPERKTAKVRQAAKRPRIAVVSQHW 623

Query: 976 VEASALLYRRAN 987
           + A    +++ +
Sbjct: 624 LHACFSQWKKVD 635


>gi|215794710|pdb|3EF1|A Chain A, The Structure Of Fcp1, An Essential Rna Polymerase Ii Ctd
           Phosphatase
          Length = 442

 Score = 91.7 bits (226), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 66/191 (34%), Positives = 100/191 (52%), Gaps = 33/191 (17%)

Query: 665 ERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAKFHEVDP-----------VHDEILRK 711
           E   RLE +  K++   ++L L++ LD T++++     VDP           V+ ++LR 
Sbjct: 8   EEASRLESENVKRLRQEKRLSLIVXLDQTIIHAT----VDPTVGEWMSDPGNVNYDVLRD 63

Query: 712 KEEQD-REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKV 770
               + +E P  +   +     + K RPG+  FL++ S+L+E+H+YTMG K YA E+AK+
Sbjct: 64  VRSFNLQEGPSGYTSCY-----YIKFRPGLAQFLQKISELYELHIYTMGTKAYAKEVAKI 118

Query: 771 LDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKL 829
           +DP G LF  RV+SR D G            K L  +   + S VV+IDD   VW  N  
Sbjct: 119 IDPTGKLFQDRVLSRDDSGS--------LAQKSLRRLFPCDTSMVVVIDDRGDVWDWNP- 169

Query: 830 NLIVVERYTYF 840
           NLI V  Y +F
Sbjct: 170 NLIKVVPYEFF 180



 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 65/131 (49%), Gaps = 14/131 (10%)

Query: 874 VIERLHKIFFSHQSLDDVDVRN----------ILAAEQRKILAGCRIVFSRVFPVGEANP 923
           V++ +H +++  +  +D+  R+          I+   ++K+L GCR++FS V P+G  + 
Sbjct: 314 VLKDIHAVYYEEE--NDISSRSGNHKHANVGLIIPKMKQKVLKGCRLLFSGVIPLG-VDV 370

Query: 924 HLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALL 982
               + + A  FGA          TH++A  + T+KV  A+S G   VV   W+  S   
Sbjct: 371 LSSDIAKWAMSFGAEVVLDFSVPPTHLIAAKIRTEKVKKAVSMGNIKVVKLNWLTESLSQ 430

Query: 983 YRRANEQDFAI 993
           ++R  E D+ +
Sbjct: 431 WKRLPESDYLL 441


>gi|403161615|ref|XP_003321927.2| hypothetical protein PGTG_03464 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375171855|gb|EFP77508.2| hypothetical protein PGTG_03464 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 423

 Score = 91.7 bits (226), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 51/148 (34%), Positives = 80/148 (54%), Gaps = 9/148 (6%)

Query: 855 LEIDHDERSEDGTLASSLGVIERL-------HKIFFSHQSLDDVDVRNILAAEQRKILAG 907
           ++ DH  R E   L      + RL       H  F+S  S+++ D R +++A +  +L G
Sbjct: 48  VDFDHPLRPEQAVLVDRDNELSRLKLILGDIHSGFYSQDSIEEADARAVISAIKHDVLHG 107

Query: 908 CRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTG 967
             + FS ++P+ EA       W+ AEQFGA C  H+  +VTH++A  LGT KVN ALS  
Sbjct: 108 LHLAFSSLWPM-EAVSEQQYAWKLAEQFGARCYTHLSPKVTHLIAAKLGTSKVNLALSRP 166

Query: 968 RF-VVHPGWVEASALLYRRANEQDFAIK 994
              +V P W+  +ALL+ R +E  ++ +
Sbjct: 167 NVSIVRPKWLYDAALLWTRPSEAGYSWR 194


>gi|258563858|ref|XP_002582674.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237908181|gb|EEP82582.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 897

 Score = 91.7 bits (226), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 103/196 (52%), Gaps = 24/196 (12%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + K+   R+EE  ++++ ++RKL LV+DLD T++++     VDP   E    K 
Sbjct: 157 DNASLTVSKDEATRVEEDAKRRLLASRKLSLVVDLDQTIIHAT----VDPTVAEWREDKT 212

Query: 714 EQDRE-----KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 765
             + E     +  + +   P M G W   KLRPG+  FL+  S L+E+H+YTM  + YA 
Sbjct: 213 NPNHEAVKNVRSFQLIDDGPGMRGCWYYIKLRPGLEEFLKNISSLYELHIYTMATRAYAQ 272

Query: 766 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
            +A ++DP   +F  R++SR + G           +K+L  +  +++  VVIIDD   VW
Sbjct: 273 NIANIVDPDRKIFGDRILSRDESGS--------LTAKNLHRLFPVDTKMVVIIDDRGDVW 324

Query: 825 PHNKLNLIVVERYTYF 840
             +  NLI V  Y +F
Sbjct: 325 KWSD-NLIRVFPYDFF 339



 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 55/105 (52%), Gaps = 6/105 (5%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGE--ANPHLHPLWQTAEQFGAVCTKHIDDQVTH 949
           D++ I+   +R+IL    +VFS V P+G    N  +  LW  A+ FGA+ T  ID + TH
Sbjct: 535 DIKIIMPQIKRRILEAVVMVFSGVLPLGTDLQNADI-SLW--AKSFGAIITNRIDSRTTH 591

Query: 950 VVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           +VA    T KV  A    +  +V   W+  S   ++R +E+ + +
Sbjct: 592 LVAGRNRTAKVREATRYSKIKIVTVQWLLDSLTQWKRLDEEQYLV 636


>gi|393240595|gb|EJD48120.1| hypothetical protein AURDEDRAFT_85955 [Auricularia delicata
           TFB-10046 SS5]
          Length = 796

 Score = 91.7 bits (226), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 74/250 (29%), Positives = 110/250 (44%), Gaps = 73/250 (29%)

Query: 651 FEGYDDQQKAAIQK-----------ERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAK 697
           + GY D  +A IQ            E  RR+E +  +++   RKL L++DLD T++++  
Sbjct: 123 YTGYSDSARANIQMTHLAGGPTVSLEEARRIEHETAERLLKNRKLSLIVDLDQTIVHAT- 181

Query: 698 FHEVDPVHDEILRKK---EEQDREKPH--------------------RHLFRF------P 728
              VDP   E + +    EE    KP                     R + RF      P
Sbjct: 182 ---VDPTVGEWIAQGQAWEEYQARKPSESTTPEPDAPPEPNANWEALRDVRRFTLAHDGP 238

Query: 729 HMG-----------------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVL 771
           H+                   + K RPG+  FLE  S+ +EMH+YTMG + YA ++   +
Sbjct: 239 HLNHKHPWKGKEKEDEHGCLYYIKPRPGLQAFLEAISQKYEMHVYTMGTRAYAEKVCAAI 298

Query: 772 DPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLN 830
           DP G +F  R++SR + G           +K LE +   + S VVIIDD   VW  +  N
Sbjct: 299 DPDGRMFGRRILSRDESGS--------LTAKSLERLFPCDTSMVVIIDDRSDVWDRSP-N 349

Query: 831 LIVVERYTYF 840
           L+ V RY +F
Sbjct: 350 LVEVVRYDFF 359



 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           DVR ++ A + +  AG   +FS + P+ E  P   P+W+ A +FGA+C   +  ++THV+
Sbjct: 510 DVRKLIGAIKAQTFAGMHFLFSSLIPL-EDKPEESPIWKQAREFGAICHSEVSPRLTHVI 568

Query: 952 ANSLGTDKVNWALSTGRFVVHPGWVE 977
                T KV+ A   G    H  WV+
Sbjct: 569 TAKRSTAKVDAARRRGE--AHIVWVQ 592


>gi|425767354|gb|EKV05928.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Penicillium
           digitatum PHI26]
 gi|425779797|gb|EKV17828.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Penicillium
           digitatum Pd1]
          Length = 817

 Score = 91.7 bits (226), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 62/188 (32%), Positives = 103/188 (54%), Gaps = 23/188 (12%)

Query: 663 QKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE--- 718
           ++E TR  E+ K+ + ++R+L LV+DLD T++++     VDP   E    K+  + E   
Sbjct: 140 EREATRVEEDAKRRLLASRRLTLVVDLDQTIIHAT----VDPTVGEWREDKQNPNHEAVK 195

Query: 719 --KPHRHLFRFPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 773
             +  + +   P M G W   KLRPG+  FL+  ++++E+H+YTMG + YA  +  ++DP
Sbjct: 196 DVRQFQLIDDGPGMRGCWYYIKLRPGLEEFLQNVAEIYELHIYTMGTRAYAQHIVDIIDP 255

Query: 774 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLI 832
              LF  R++SR + G            KDL+ +  +++  VVIIDD   +W  +  NLI
Sbjct: 256 TRKLFGDRILSRDESGS--------LTVKDLQRLFPVDTKMVVIIDDRGDIWRWSP-NLI 306

Query: 833 VVERYTYF 840
            V  Y +F
Sbjct: 307 KVSPYDFF 314



 Score = 46.2 bits (108), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 54/105 (51%), Gaps = 6/105 (5%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVG--EANPHLHPLWQTAEQFGAVCTKHIDDQVTH 949
           DV++I+   +R+IL G  +VFS V P+G    N  +  LW  A+ FG   +  I+ + TH
Sbjct: 495 DVKDIMPQIKRRILGGVVLVFSGVLPLGIDFQNADI-SLW--AKSFGVTISSRINTRTTH 551

Query: 950 VVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           +VA    T KV  A       +V   W+  + + +R  +E+ + +
Sbjct: 552 LVAGRNRTAKVREATRYPNIKIVTTQWLVDALVQWRHVDEEPYLL 596


>gi|452004576|gb|EMD97032.1| hypothetical protein COCHEDRAFT_1163398 [Cochliobolus
           heterostrophus C5]
          Length = 803

 Score = 91.3 bits (225), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/178 (30%), Positives = 96/178 (53%), Gaps = 21/178 (11%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHRHL 724
           ++++ +ARKL L++DLD T++++     +        +P HD +   K+ Q  +    ++
Sbjct: 152 KRRLLNARKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHDAV---KDVQGFQLADDNV 208

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
                   + K+RPG+  F +R SKL+EMH+YTM  + YA  +AK++DP+   F  R++S
Sbjct: 209 SNVAANWYYVKMRPGLKDFFDRVSKLYEMHVYTMATRAYAQAVAKIIDPERKYFGDRILS 268

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGMESAV-VIIDDSVRVWPHNKLNLIVVERYTYFP 841
           R ++           K K L  +    +A+ VIIDD   VW ++  +L+ V  + +FP
Sbjct: 269 RDEN--------YTDKLKSLTRLFYQNTAMCVIIDDRADVWQYSP-HLVRVPVFNFFP 317



 Score = 42.7 bits (99), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 59/117 (50%), Gaps = 13/117 (11%)

Query: 886 QSLDDV--DVRNILAAEQRKILAGCRIVFSRVFPVG-EANPHLHPLWQTAEQFGAVCTKH 942
           +++DDV  DV  I+   + ++L G  +VFS + P+G +       LW ++  FGA  T  
Sbjct: 503 RAVDDVIPDVAEIMPRIKSEVLDGAVVVFSGIIPLGVDVLSSDFALWISS--FGAEVTTS 560

Query: 943 IDDQVTHVVAN-SLGTDKVNWALSTGRF----VVHPGWVEASALLYRRANEQDFAIK 994
           ++ + THV+AN    T KV  A    R+    +V+P W+      +   +E  + I+
Sbjct: 561 VNRRTTHVIANPDRKTTKVKRA---ARYPHIKIVNPEWMFQCCTRWEHVDEMPYLIE 614


>gi|451853161|gb|EMD66455.1| hypothetical protein COCSADRAFT_112846 [Cochliobolus sativus
           ND90Pr]
          Length = 803

 Score = 91.3 bits (225), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/178 (30%), Positives = 96/178 (53%), Gaps = 21/178 (11%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKEEQDREKPHRHL 724
           ++++ +ARKL L++DLD T++++     +        +P HD +   K+ Q  +    ++
Sbjct: 152 KRRLLNARKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHDAV---KDVQGFQLADDNV 208

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
                   + K+RPG+  F +R SKL+EMH+YTM  + YA  +AK++DP+   F  R++S
Sbjct: 209 SNVAANWYYVKMRPGLKDFFDRVSKLYEMHVYTMATRAYAQAVAKIIDPERKYFGDRILS 268

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGMESAV-VIIDDSVRVWPHNKLNLIVVERYTYFP 841
           R ++           K K L  +    +A+ VIIDD   VW ++  +L+ V  + +FP
Sbjct: 269 RDEN--------YTDKLKSLTRLFYQNTAMCVIIDDRADVWQYSP-HLVRVPVFNFFP 317



 Score = 43.1 bits (100), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 59/117 (50%), Gaps = 13/117 (11%)

Query: 886 QSLDDV--DVRNILAAEQRKILAGCRIVFSRVFPVG-EANPHLHPLWQTAEQFGAVCTKH 942
           +++DDV  DV  I+   + ++L G  +VFS + P+G +       LW ++  FGA  T  
Sbjct: 503 RAMDDVIPDVAEIMPRIKSEVLDGAVVVFSGIIPLGVDVLSSDFALWISS--FGAEVTTS 560

Query: 943 IDDQVTHVVAN-SLGTDKVNWALSTGRF----VVHPGWVEASALLYRRANEQDFAIK 994
           ++ + THV+AN    T KV  A    R+    +V+P W+      +   +E  + I+
Sbjct: 561 VNRRTTHVIANPDRKTTKVKRA---ARYPHIKIVNPEWMFQCCTRWEHVDETPYLIE 614


>gi|67524889|ref|XP_660506.1| hypothetical protein AN2902.2 [Aspergillus nidulans FGSC A4]
 gi|40744297|gb|EAA63473.1| hypothetical protein AN2902.2 [Aspergillus nidulans FGSC A4]
 gi|259486161|tpe|CBF83781.1| TPA: CTD phosphatase-related (Eurofung) [Aspergillus nidulans FGSC
           A4]
          Length = 829

 Score = 91.3 bits (225), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 66/201 (32%), Positives = 105/201 (52%), Gaps = 34/201 (16%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + +    R+EE  ++++ + RKL LV+DLD T++++A    VDP   E +    
Sbjct: 132 DNTALTVSEREAIRVEEDAKRRLLANRKLSLVVDLDQTIIHAA----VDPTIGEWM---- 183

Query: 714 EQDREKPHR------HLFRF----PHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 760
             D++ P+         F+     P M G W   KLRPG+  FLE  ++++E+H+YTMG 
Sbjct: 184 -ADKDNPNHAAVSDVRAFQLVDDGPGMRGCWYYVKLRPGLEEFLENVAEMYELHIYTMGT 242

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 819
           + YA  +A ++DP   LF  R++SR + G            K+L  +  +++  VVIIDD
Sbjct: 243 RSYAQAIANIIDPDRKLFGDRILSRDESGS--------LSVKNLHRIFPVDTKMVVIIDD 294

Query: 820 SVRVWPHNKLNLIVVERYTYF 840
              VW  +  NLI V  Y +F
Sbjct: 295 RGDVWRWSP-NLIKVIPYDFF 314



 Score = 40.8 bits (94), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 27/103 (26%), Positives = 52/103 (50%), Gaps = 2/103 (1%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           DV++IL A +R+I+    +V S ++ + E   ++  L Q A+ FG V    I  + TH++
Sbjct: 494 DVKDILPAIKRRIMGNVHVVHSGLWKLNEDIANIE-LAQHAKTFGVVFADRITHKTTHLL 552

Query: 952 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           +    T K   A+   +  +V   W+  S L ++  +E  + +
Sbjct: 553 SAGKRTAKFQEAMQRPKIKIVRKEWLVDSLLQWKHLDEGPYLV 595


>gi|66824241|ref|XP_645475.1| hypothetical protein DDB_G0271690 [Dictyostelium discoideum AX4]
 gi|60473594|gb|EAL71535.1| hypothetical protein DDB_G0271690 [Dictyostelium discoideum AX4]
          Length = 782

 Score = 91.3 bits (225), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 56/168 (33%), Positives = 92/168 (54%), Gaps = 14/168 (8%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           K++   +KL LVLDLDHT++++      +   +    + +++++   H      P M   
Sbjct: 128 KRLLMEKKLSLVLDLDHTVIHAVTEQGFNSSPE---WRNKDKNKNGIHTITVNGP-MNYC 183

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD-DGDPF 792
            K RP +  FL   +K++E+H+YTMG + YA E+AK++DP+  +F  R++SR D +G  F
Sbjct: 184 IKKRPHLVKFLTEVNKIYELHIYTMGTRNYANEIAKLIDPESSIFKERILSRDDGNGINF 243

Query: 793 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
              +R+    D        S V+I+DD   VW  +K NLI +  Y YF
Sbjct: 244 KSLQRLFPCDD--------SMVLIVDDRSDVWKKSK-NLIQISPYVYF 282



 Score = 77.8 bits (190), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 45/130 (34%), Positives = 74/130 (56%), Gaps = 2/130 (1%)

Query: 865 DGTLASSLGVIERLHKIFF-SHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANP 923
           D  L   L  ++++H++++ S        V +++   ++ IL G  IVFS V+P+G    
Sbjct: 394 DCHLLIILDKLQKIHRLYYESKDKGGSPHVIDLVEIVKKDILKGTYIVFSGVYPLGTP-I 452

Query: 924 HLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLY 983
              PL   AE+FG+V    I+++ THV+A   GT KVN ALS G  V+ P W+  S  ++
Sbjct: 453 QKQPLRWLAEEFGSVVQNDINNETTHVIAQRKGTSKVNKALSKGLKVISPQWLIESTRIW 512

Query: 984 RRANEQDFAI 993
           + A+E DF++
Sbjct: 513 QMADEDDFSL 522


>gi|412985958|emb|CCO17158.1| RNA Polymerase II CTD phosphatase Fcp1 [Bathycoccus prasinos]
          Length = 490

 Score = 90.9 bits (224), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 95/368 (25%), Positives = 161/368 (43%), Gaps = 67/368 (18%)

Query: 681 KLCLVLDLDHTLLNSA---KF-----------HEVDPVHDEILRKKEEQDREKPHRHLFR 726
           KL LVLDLD TLL+S    KF            E   +  +  +K E +    P +  F 
Sbjct: 101 KLPLVLDLDSTLLHSVEKTKFLFPNPGESNTSEEEMKIIKQAQKKIESRLESSPDK--FF 158

Query: 727 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMA-KVLDPKGVLFAGRVISR 785
           + +   +TK+RP    FL   S+++E+++ T G++ YA  +A +VLDP G  F  R ++R
Sbjct: 159 YVNDQYFTKIRPQARRFLSELSEMYELYIVTAGSQAYAEAIANQVLDPLGKYF-NRDVNR 217

Query: 786 GDDGDPFDG------DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 839
                 ++       D R     D   + G ES  ++++D   +W   +  ++ V+ Y Y
Sbjct: 218 IKGMKQWNSEVNQWVDVRTKIVND--ALEGAESVTIVVEDKPEMW-DGECAVMQVKPYYY 274

Query: 840 FPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH------QSLDDVD- 892
           FP S  +  L     +  D  E+++   + + L  +  +H++ F        Q+    D 
Sbjct: 275 FPESLEELKLSHFYNM-TDESEKNDSYLVDNILPRLRNVHRMMFREAVPHFLQNGGSADG 333

Query: 893 -----VRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHI---- 943
                V  +L+ E++ IL  C I F+ VF + E  P    +W+ A+  GA   +      
Sbjct: 334 QYWPCVSELLSLERKSILRNCFICFTNVFKIHEV-PKDSQIWKEAKNLGATVQETFIDNA 392

Query: 944 -----------DDQVTHVVANSLG-----------TDKVNWALSTGRFVVHPGWVEASAL 981
                      DD+ THV+    G           T K+N A    R +V   ++  SA 
Sbjct: 393 DIAKRTESIVKDDRTTHVIVGEHGKRNKATGLVLPTSKINQARRNNRPLVDCSFIAESAY 452

Query: 982 LYRRANEQ 989
           L++ A+E+
Sbjct: 453 LWKPADEK 460


>gi|365991295|ref|XP_003672476.1| hypothetical protein NDAI_0K00420 [Naumovozyma dairenensis CBS 421]
 gi|343771252|emb|CCD27233.1| hypothetical protein NDAI_0K00420 [Naumovozyma dairenensis CBS 421]
          Length = 778

 Score = 90.5 bits (223), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 65/187 (34%), Positives = 95/187 (50%), Gaps = 32/187 (17%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK------------KEEQDREKP 720
           +K++   +KL LV+DLD T+++      VDP   E  R             KE    E+P
Sbjct: 203 KKRLRDDKKLILVVDLDQTVIHCG----VDPTIGEWKRDPTNPNFETLKDVKEFALEEEP 258

Query: 721 HRHLFRF----PHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
              L       P    W   K+RPG+  F ++ + LFEMH+YTM  + YA+E+AK++DP 
Sbjct: 259 ILPLMYMGPKPPARKCWYYVKVRPGLKDFFQKVAPLFEMHIYTMATRAYASEIAKIIDPT 318

Query: 775 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIV 833
           G LF  R++SR ++G           +K LE +    +S V+IIDD   VW  +  NLI 
Sbjct: 319 GDLFGNRILSRDENGS--------LTTKSLERLFPTDQSMVIIIDDRGDVWNWSP-NLIK 369

Query: 834 VERYTYF 840
           V  Y +F
Sbjct: 370 VIPYNFF 376



 Score = 54.3 bits (129), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 37/147 (25%), Positives = 68/147 (46%), Gaps = 13/147 (8%)

Query: 859 HDER---SEDGTLASSLGVIERLHKIFF-----SHQSLDDV--DVRNILAAEQRKILAGC 908
           HD+R    +D  L      + ++H  ++     ++++ D +  D++ I+   ++ +L  C
Sbjct: 490 HDQRLLIDDDEELYHLKDTLSKIHSKYYEMLQKANENKDSIEPDIKYIMPQMKQTVLQDC 549

Query: 909 RIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST-- 966
             VFS + P+G  N     +      FGA  T  ID+  THV+  + GT K   A S   
Sbjct: 550 HFVFSGLIPLG-TNIQKSDIVIWTNMFGARSTADIDENTTHVITKTPGTYKARLAKSYDP 608

Query: 967 GRFVVHPGWVEASALLYRRANEQDFAI 993
              +VHP WV    + ++   E+ + +
Sbjct: 609 DIKIVHPDWVFECLMRWKHVEEKPYEL 635


>gi|358057984|dbj|GAA96229.1| hypothetical protein E5Q_02893 [Mixia osmundae IAM 14324]
          Length = 760

 Score = 90.5 bits (223), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 63/204 (30%), Positives = 101/204 (49%), Gaps = 43/204 (21%)

Query: 656 DQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 715
           +Q+ A ++   T RL +      A+KL L++DLD T++ +     VDP   + +R     
Sbjct: 183 EQEAARLEDASTTRLRK------AKKLSLIVDLDQTIIQAT----VDPTVGDWMR----- 227

Query: 716 DREKPHRH------LFRFPHM----------GMWT--KLRPGIWTFLERASKLFEMHLYT 757
           D   P+        +F+              G W   KLRPG+  FL + + L+EMH+YT
Sbjct: 228 DGTNPNHSALKDVCVFKLGTQEDKEVVADVDGCWYYLKLRPGLQAFLRKMADLYEMHVYT 287

Query: 758 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAV-VI 816
           MG + YA  + +++DP G  F+ R++SR + G            K LE +   ++++ VI
Sbjct: 288 MGTRSYAMAVCRIIDPDGTYFSTRILSRDESGS--------LTRKSLERLFPCDTSMAVI 339

Query: 817 IDDSVRVWPHNKLNLIVVERYTYF 840
           IDD   VW H   NL+ VE + +F
Sbjct: 340 IDDRSDVW-HWSPNLVKVEPFEFF 362



 Score = 73.9 bits (180), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 40/116 (34%), Positives = 66/116 (56%), Gaps = 4/116 (3%)

Query: 877 RLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFG 936
           ++H  F++ QS  DV +  I+   +  +L GC+I FS + P+G  NP    +W+ A+ FG
Sbjct: 491 KVHTDFYAEQSSQDVKL--IIPNIKESVLRGCKIAFSSMIPLG-TNPEAADIWKLAKMFG 547

Query: 937 AVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDF 991
           A C+  ++ + TH+VA + GT KV  A       VV   W+  SA L++R  E+++
Sbjct: 548 AYCSSDVNSKTTHLVARNPGTVKVQQAQKRQDLHVVWSNWLVRSAALWQRQPEREY 603


>gi|269860082|ref|XP_002649764.1| carboxy-terminal domain (CTD) phosphatase [Enterocytozoon bieneusi
           H348]
 gi|220066823|gb|EED44294.1| carboxy-terminal domain (CTD) phosphatase [Enterocytozoon bieneusi
           H348]
          Length = 409

 Score = 90.1 bits (222), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 47/155 (30%), Positives = 77/155 (49%), Gaps = 27/155 (17%)

Query: 670 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 729
           L +  +++  +KL L LDLD TL+++                      +KP    F+  +
Sbjct: 93  LHKFYELYHNKKLILFLDLDQTLIHATL-------------------SKKPCNFSFKLHN 133

Query: 730 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
           +  + K RPG+  FL + S+ FE H+YTMG + YA  + K+LDP  + F  R+++R ++ 
Sbjct: 134 IEFFIKKRPGLDKFLSKLSRFFEFHVYTMGTREYANYICKILDPNKIFFGDRIVTRTENN 193

Query: 790 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW 824
             F         K LE +    + V+I+DD V VW
Sbjct: 194 KMF--------KKYLERITNFSNNVIILDDRVDVW 220


>gi|363752479|ref|XP_003646456.1| hypothetical protein Ecym_4610 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356890091|gb|AET39639.1| hypothetical protein Ecym_4610 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 751

 Score = 90.1 bits (222), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 68/187 (36%), Positives = 95/187 (50%), Gaps = 32/187 (17%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE--KPHRH--LFRFP 728
           QK++  ARKL LV+DLD T+++      VDP   E  +  +  + E  K  R   L   P
Sbjct: 159 QKQLREARKLVLVVDLDQTVIHCG----VDPTIGEWSKDPDNPNYESLKDVRSFSLHEEP 214

Query: 729 -----HMG---------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
                +MG          + KLRPG+  F    +  FE+H+YTM  + YA E+AK++DP 
Sbjct: 215 VLPPFYMGPKPPTRKCWYYVKLRPGLQDFFSNIAPHFELHIYTMATRTYALEIAKIIDPD 274

Query: 775 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIV 833
           G LF  R++SR ++G            K LE +  M +S VVIIDD   VW   + NLI 
Sbjct: 275 GTLFGDRILSRDENGS--------LTQKSLERLFPMDQSMVVIIDDRGDVWNWCE-NLIK 325

Query: 834 VERYTYF 840
           V  Y +F
Sbjct: 326 VVPYDFF 332



 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/147 (23%), Positives = 67/147 (45%), Gaps = 13/147 (8%)

Query: 859 HDER---SEDGTLASSLGVIERLHKIFFSHQSLDDV-------DVRNILAAEQRKILAGC 908
           H+ER    +D  L     ++ R+H  ++     +         D+++++ + +  +  GC
Sbjct: 438 HNERLLIDDDDELPHLTDILLRVHSAYYDQLEKNKHGATELSPDIKHLIPSLKEIVFKGC 497

Query: 909 RIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGR 968
             VFS + P+   N     +      FGA  T +ID   TH++  + GT K   A S  R
Sbjct: 498 NFVFSGLIPL-HTNIDRADIVLWTNTFGAKTTINIDYNTTHLITKTPGTMKARLASSFNR 556

Query: 969 --FVVHPGWVEASALLYRRANEQDFAI 993
              +VHP W+    + ++R +E+ + +
Sbjct: 557 DIKIVHPDWIFECIVNWKRVDEKPYEL 583


>gi|256073745|ref|XP_002573189.1| rna polymerase II ctd phosphatase [Schistosoma mansoni]
 gi|360045501|emb|CCD83049.1| putative rna polymerase II ctd phosphatase [Schistosoma mansoni]
          Length = 1345

 Score = 90.1 bits (222), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 58/176 (32%), Positives = 99/176 (56%), Gaps = 27/176 (15%)

Query: 670 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 729
           L++Q+ + +ARKL L++DLD T++++      DP           Q  +  + H +R P 
Sbjct: 139 LQDQQSLLAARKLVLLVDLDQTIIHTTN----DP-----------QAFKYKNVHRYRLPG 183

Query: 730 --MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 787
             +   T+LRP +   L+  S+ ++MH+ T GN++YA ++A ++DPK   F+ R++SR +
Sbjct: 184 SPLVYHTRLRPHLEKVLDCLSQYYQMHICTFGNRVYAHQLASMIDPKRRYFSQRILSRDE 243

Query: 788 DGDPFDGDERVPKSKDLEGVL--GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
             +P      V KS +L+ +   G+ + V IIDD   VW  +  NLI V+ Y +FP
Sbjct: 244 CFNP------VTKSANLKALFPRGL-NLVCIIDDRGEVWDWSS-NLIHVKPYRFFP 291


>gi|452981165|gb|EME80925.1| hypothetical protein MYCFIDRAFT_115122, partial [Pseudocercospora
           fijiensis CIRAD86]
          Length = 770

 Score = 89.7 bits (221), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 57/181 (31%), Positives = 98/181 (54%), Gaps = 29/181 (16%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF----R 726
           E ++++  +R+L LV+DLD T+++++    V+P   E      + D   P+        +
Sbjct: 151 EGKRRLLQSRRLSLVVDLDQTIIHAS----VEPTIAEW-----QNDPSNPNYEALQDVQK 201

Query: 727 F------PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 780
           F      P+   + K RPG+  FL   S+++EMH+YTMG + YA  +AK++DP+  +F  
Sbjct: 202 FQLDDDKPNTWYYIKPRPGLKQFLSTLSEIYEMHIYTMGTRAYAESVAKIIDPEKKIFGD 261

Query: 781 RVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVERYTY 839
           R++SR + G           +K+L+ +  +++  VVIIDD   VW H   NLI V  + +
Sbjct: 262 RILSRNESGS--------MTAKNLKRLFPVDTRMVVIIDDRADVW-HWTSNLIKVNVFEF 312

Query: 840 F 840
           F
Sbjct: 313 F 313



 Score = 49.3 bits (116), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 61/115 (53%), Gaps = 8/115 (6%)

Query: 886 QSLDDV----DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTK 941
           +S+DD+    D   I+   + K+L+G  +VFS V P+G  + H H     A+ FGA  ++
Sbjct: 506 RSVDDLENIPDAAAIMDRMKAKVLSGVHLVFSGVVPLG-VDIHSHDTAVWAKSFGATVSE 564

Query: 942 HIDDQVTHVVAN-SLGTDKVNWAL-STGRF-VVHPGWVEASALLYRRANEQDFAI 993
           +I  + THV+A+    T KV  A   +GR  +V   W+ A    +++ +E  + I
Sbjct: 565 NITKKTTHVIASPERRTAKVRQAAKKSGRIAIVSQHWLHACFAQWKKVDENMYRI 619


>gi|328713585|ref|XP_001947680.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like [Acyrthosiphon pisum]
          Length = 736

 Score = 89.4 bits (220), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 57/170 (33%), Positives = 94/170 (55%), Gaps = 20/170 (11%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 731
           ++K++   +KL L++DLD TL+++      D + + I        ++  H  L+      
Sbjct: 134 DEKRLLGDKKLVLLVDLDQTLIHTTN----DNIPNNI--------KDIHHFQLYGPNSPW 181

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
             T+LRPG + FL   S+L+E+H+ T G + YA  +  +LDPKG LF+ RV+SR +  +P
Sbjct: 182 YHTRLRPGTYNFLSSISELYELHICTFGARNYAHTITHILDPKGKLFSHRVLSRDECFNP 241

Query: 792 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
                   K+ +L+G+    ++ V IIDD   VW +  LNLI V+ Y +F
Sbjct: 242 N------SKTGNLKGLFPCGDNMVCIIDDREDVWDY-ALNLIHVKPYHFF 284



 Score = 40.4 bits (93), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 38/145 (26%), Positives = 65/145 (44%), Gaps = 19/145 (13%)

Query: 864 EDGTLASSLGVIERLHKIFFSH---------QSLDDVDVRNILAAEQRKILAGCRIVFSR 914
           ED  L     +++ LHK +F            +++  D++ I+   + KILAG ++VFS 
Sbjct: 373 EDDYLLYLEDILKNLHKEYFKEYDQNYELYGNNVEVPDMKRIIPTYRSKILAGKKLVFSG 432

Query: 915 V----FPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF- 969
           +     P+ E+       ++ A   GA  T++I    TH+VA   GT K + A       
Sbjct: 433 LVPTPVPLTESRA-----YKVARLLGAEVTENIKPDSTHLVAVRQGTLKASAARKRPDIK 487

Query: 970 VVHPGWVEASALLYRRANEQDFAIK 994
            V P W+   A  +    E+ F ++
Sbjct: 488 TVTPEWLWLCAERWEHVEERLFPLR 512


>gi|403222664|dbj|BAM40795.1| uncharacterized protein TOT_030000057 [Theileria orientalis strain
           Shintoku]
          Length = 656

 Score = 89.0 bits (219), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 59/178 (33%), Positives = 87/178 (48%), Gaps = 27/178 (15%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAK--------FHEVDPVHDEILRKKEEQDREKPHRH 723
           E  K    RKLCLVLDLD+TL+++             ++     +L+     + E  + +
Sbjct: 188 EITKYLEDRKLCLVLDLDNTLVHATSQSPPADIDVETIEISSSSVLKTIVYNETETSYCN 247

Query: 724 LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
            F         KLRPGI+ F    SK +++ L+TMG + +A    ++LDP+GV F  RV 
Sbjct: 248 SF--------FKLRPGIFKFFRSVSKRYKLFLFTMGTRQHAQSALRILDPQGVYFGNRVF 299

Query: 784 SRGDDGDPFDGDERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
            R D        +R+ P  K+L         V+++DDS  VW  +KL LI V  Y YF
Sbjct: 300 CRNDSRSCMKSLDRLFPNHKNL---------VLVMDDSEYVWT-SKLALIKVHPYYYF 347


>gi|452840538|gb|EME42476.1| hypothetical protein DOTSEDRAFT_73343 [Dothistroma septosporum
           NZE10]
          Length = 855

 Score = 89.0 bits (219), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 58/184 (31%), Positives = 98/184 (53%), Gaps = 32/184 (17%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH----RHLFR 726
           E ++++  AR+L LV+DLD T++++     V+P   E      + D   P+    + + +
Sbjct: 152 EAKRRLLEARRLSLVVDLDQTVIHAC----VEPTIGEW-----QSDPTNPNHEAVKDVCK 202

Query: 727 F---------PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
           F         P    + KLRPG+  FL   S+ +EMH+YTMG + YA  +AK++DP   +
Sbjct: 203 FQLADDAPGRPGTWYYIKLRPGLKEFLTTMSQYYEMHIYTMGTRAYAENIAKIIDPDRSV 262

Query: 778 FAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVER 836
           F  R++SR + G          ++K+L+ +  +++  VVIIDD   VW     NLI V+ 
Sbjct: 263 FGDRILSRDESGS--------MQAKNLKRLFPVDTKMVVIIDDRADVWSWIS-NLIKVKV 313

Query: 837 YTYF 840
           + +F
Sbjct: 314 FEFF 317



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 55/106 (51%), Gaps = 6/106 (5%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHV 950
           D   I++  +RK+LAG  +VFS V P+G +   H   +W  A  FGA  +++I  + TH+
Sbjct: 535 DAAAIMSDMKRKVLAGVHLVFSGVVPLGVDIQSHDMAVW--ARSFGAKVSENIGKKTTHI 592

Query: 951 VAN-SLGTDKVNWALSTGR--FVVHPGWVEASALLYRRANEQDFAI 993
           +A+    T KV  A   G    +V  GW+      ++R +E  + I
Sbjct: 593 IASPERRTAKVRQAAKKGNRIAIVGQGWLFECFSQWKRLDENPYRI 638


>gi|358253094|dbj|GAA51983.1| RNA polymerase II subunit A C-terminal domain phosphatase
           [Clonorchis sinensis]
          Length = 1535

 Score = 89.0 bits (219), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 62/191 (32%), Positives = 100/191 (52%), Gaps = 31/191 (16%)

Query: 670 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 729
           L++++ + +ARKL L++DLD T+L++      DP   +  R K         R+      
Sbjct: 175 LQDEQSLLAARKLVLLVDLDETVLHTTN----DP---QAYRYKNVS------RYCLPGSP 221

Query: 730 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
           +   T  RP +   L+R SK ++MH+ T GN++YA ++A ++DPK   F+ R++SR +  
Sbjct: 222 LVYHTSFRPHLKAVLDRLSKYYQMHICTFGNRMYAHQLAGMIDPKRRYFSHRILSRDECF 281

Query: 790 DPFDGDERVPKSKDLEGVL--GMESAVVIIDDSVRVW---PHNKLNLIVVERYTYF--PC 842
           +P      V KS +L+ +   G+ + V IIDD   VW   PH    LI V+ Y +F   C
Sbjct: 282 NP------VTKSANLKALFPRGL-NLVCIIDDRGEVWEWSPH----LIQVKPYRFFQDAC 330

Query: 843 SRRQFGLLGPS 853
             + F    PS
Sbjct: 331 DTKHFAWSSPS 341


>gi|255540897|ref|XP_002511513.1| conserved hypothetical protein [Ricinus communis]
 gi|223550628|gb|EEF52115.1| conserved hypothetical protein [Ricinus communis]
          Length = 161

 Score = 89.0 bits (219), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 59/170 (34%), Positives = 91/170 (53%), Gaps = 18/170 (10%)

Query: 750 LFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLG 809
           +FEM++YT  +++ A +M   LDP    F  R+I R       +G   V   K+ + VLG
Sbjct: 1   MFEMYVYTSSSQVNARKMMSFLDPANRYFNSRLIVR-------EGST-VMALKNPDVVLG 52

Query: 810 MESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLA 869
            E AVVI+DD    WP +K N+I VE+Y YF  ++   G    SL E   DE +    +A
Sbjct: 53  HERAVVILDDRKSAWPMHKANVINVEKYNYFASNQSDPGSKSKSLAERKKDEHTR--VMA 110

Query: 870 SSLGVIERLHKIFFSHQSLDDV-------DVRNILAAEQRKILAGCRIVF 912
           + L ++ ++H+ FF  + L+ +       DVR ++   + KIL GC+I F
Sbjct: 111 AYLRILRKIHRQFFDPK-LEAIVTAGAARDVREVMRMVRAKILKGCKICF 159


>gi|410076480|ref|XP_003955822.1| hypothetical protein KAFR_0B03910 [Kazachstania africana CBS 2517]
 gi|372462405|emb|CCF56687.1| hypothetical protein KAFR_0B03910 [Kazachstania africana CBS 2517]
          Length = 724

 Score = 88.6 bits (218), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 61/186 (32%), Positives = 98/186 (52%), Gaps = 30/186 (16%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +K++ + +KL LV+DLD T+++          K    +P +D +   +     E+P   +
Sbjct: 161 KKRLRNEKKLVLVVDLDQTVIHCGVDPTIGEWKSDPNNPNYDTLKDVQMFALEEEP---V 217

Query: 725 FRFPHMG---------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 775
             F +MG          + K+RPG+  F ++ + LFEMH+YTM  + YA E+ K++DP G
Sbjct: 218 LPFMYMGPKPTPRKCWYYVKVRPGLKEFFKKVAPLFEMHIYTMATRAYALEITKIIDPTG 277

Query: 776 VLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVV 834
            LF  R++SR ++G           SK LE +    +S V+IIDD   VW  +  NLI V
Sbjct: 278 ELFGNRILSRDENGS--------LTSKSLERLFPTDQSMVIIIDDRGDVWNWSP-NLIKV 328

Query: 835 ERYTYF 840
             Y++F
Sbjct: 329 VPYSFF 334



 Score = 48.1 bits (113), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 35/145 (24%), Positives = 70/145 (48%), Gaps = 11/145 (7%)

Query: 859 HDER---SEDGTLASSLGVIERLHKIFFS---HQSLDD--VDVRNILAAEQRKILAGCRI 910
           H++R    +D  L      + ++H  ++S   ++SLD+   D++ I+   ++ +  GC  
Sbjct: 441 HNQRLLIDDDDELYFLKDTLAKVHNEYYSILENKSLDESGADIKLIMPKLKQSVFEGCHF 500

Query: 911 VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA--LSTGR 968
           VFS + P+         +  T+  FGA  +  I++  THV+  + GT K   A   ++  
Sbjct: 501 VFSGLIPLHTDIRRADIVVWTS-MFGAESSSEINEVTTHVITKTPGTFKARLAKSFNSDI 559

Query: 969 FVVHPGWVEASALLYRRANEQDFAI 993
            +VHP WV    + +   +E+ + +
Sbjct: 560 KIVHPDWVFECLIRWTHVDEKPYTL 584


>gi|367004465|ref|XP_003686965.1| hypothetical protein TPHA_0I00240 [Tetrapisispora phaffii CBS 4417]
 gi|357525268|emb|CCE64531.1| hypothetical protein TPHA_0I00240 [Tetrapisispora phaffii CBS 4417]
          Length = 732

 Score = 88.6 bits (218), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 62/192 (32%), Positives = 93/192 (48%), Gaps = 42/192 (21%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------ 726
           + ++  ++KL LV+DLD T+++      VDP   E      + D   P+    R      
Sbjct: 159 KSRLIRSKKLILVVDLDQTVIHCG----VDPTISEW-----KNDPSNPNYETLRNVKSFV 209

Query: 727 ------FPHMGM-----------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 769
                  P M M           + K+RPG+  F E+ + ++EMH+YTM  + YA E+AK
Sbjct: 210 LEEEAILPPMYMGPKPPVHKCSYYVKVRPGLKEFFEKVAPIYEMHIYTMATRAYAEEIAK 269

Query: 770 VLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNK 828
           ++DP G LF  R++SR ++G            K LE +    +S VVIIDD   VW  + 
Sbjct: 270 IIDPDGSLFGNRILSRDENGS--------LTHKSLERLFPTDQSMVVIIDDRGDVWNWSP 321

Query: 829 LNLIVVERYTYF 840
            NLI V  Y +F
Sbjct: 322 -NLIKVTPYNFF 332



 Score = 54.3 bits (129), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 38/141 (26%), Positives = 63/141 (44%), Gaps = 10/141 (7%)

Query: 862 RSEDGTLASSLGVIERLHKIFFSHQSL-----DDVDVRNILAAEQRKILAGCRIVFSRVF 916
           R +D  L+    ++  +   F+    L     DD  ++ ++   +R +   C  VFS + 
Sbjct: 448 RDDDDELSYLKEILATVQSEFYKALELKNGNSDDTSIQELMPKLKRSVFKDCNFVFSGLI 507

Query: 917 PVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTG--RFVVHP 973
           P+G +       LW   + FGA  T +I +  THV+  + GT K   A S      VVHP
Sbjct: 508 PLGTDIQKADIVLW--TKMFGASSTSNITENTTHVITKTPGTLKARIAKSFNPSIKVVHP 565

Query: 974 GWVEASALLYRRANEQDFAIK 994
            WV      ++  +E+ F +K
Sbjct: 566 DWVFECLSTWKHVDEKSFELK 586


>gi|198438317|ref|XP_002131972.1| PREDICTED: similar to MGC81710 protein [Ciona intestinalis]
          Length = 895

 Score = 88.6 bits (218), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 55/176 (31%), Positives = 94/176 (53%), Gaps = 25/176 (14%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-H 729
           +++ ++    KL L++DLD TL+++ +      +  E              +  F F  H
Sbjct: 134 QDKSRLHKLNKLVLLVDLDQTLIHTTQNQAFAAMCSE-------------EKDFFTFQLH 180

Query: 730 MG---MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
                ++TKLRP    FL+  SK +E+ + T G++LYA ++A+ +DPK   FA R++SR 
Sbjct: 181 KNEPTLYTKLRPYCREFLQEISKCYELQVVTFGSRLYAHKIAEFIDPKKKFFANRILSRD 240

Query: 787 DDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
           +  +P      + KS +L  +    +S V IIDD   VW  +  NL++V++Y+YFP
Sbjct: 241 ECINP------MKKSGNLRHLFPCGDSMVCIIDDRDDVWS-SAPNLVMVKKYSYFP 289



 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 52/104 (50%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHV 950
           D+R ++   + K+L GC IV + + P   +A PH+H     A Q GA     +D+  TH+
Sbjct: 573 DLRGVVPFLRSKVLYGCCIVLTGIIPNNFKAAPHMHRAHIVARQLGAAINSTVDENTTHL 632

Query: 951 VANSLGTDKVNWALSTGRFVVH---PGWVEASALLYRRANEQDF 991
           +    GT K   AL  G+  +H     W+ A +  + R +E+ F
Sbjct: 633 IGAKKGTAKYQDALKMGKKKIHMVSIEWLWACSDRWERVSEKLF 676


>gi|6689545|emb|CAB65510.1| FCP1 serine phosphatase [Xenopus laevis]
          Length = 867

 Score = 88.2 bits (217), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 60/182 (32%), Positives = 96/182 (52%), Gaps = 22/182 (12%)

Query: 662 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 719
           +  E+  +L  E+Q ++   +KL L++DLD TL+++ + H        I           
Sbjct: 42  VSSEKAEQLGREDQFRLHRNQKLVLMVDLDQTLIHTTEQHCQHMSRKGIF---------- 91

Query: 720 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 779
            H  L R   M + T+LRP    FLE+ +KL+E+H++T G++LYA  +A  LDP+  LF+
Sbjct: 92  -HFQLGRGEPM-LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFS 149

Query: 780 GRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYT 838
            R++SR +  DP+       K+ +L  +    +S V IIDD   VW     NLI V++  
Sbjct: 150 HRILSRDECIDPYS------KTGNLRNLFPCGDSMVCIIDDREDVWKFAP-NLITVKKMC 202

Query: 839 YF 840
            F
Sbjct: 203 IF 204



 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 39/139 (28%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFSH------QSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++       + +DDV D+RNI+   + K+L    I FS +    FP+    
Sbjct: 486 ILVRVHTDYYAKYDRILKKEVDDVPDIRNIVPELKSKVLENVIITFSGLHPTNFPIERTR 545

Query: 923 PHLHPLWQTAEQFGAVCTKHI------DDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
              H     A   GA  +K++       D+VTH++A   GT+KV  A +     VV+P W
Sbjct: 546 EFYH-----ARALGANISKNLILKPDDSDRVTHLIAARSGTEKVRQAQNCKHLHVVNPEW 600

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F +K
Sbjct: 601 LWSCLERWEKVEEQLFPLK 619


>gi|353236741|emb|CCA68729.1| related to FCP1-TFIIF interacting component of CTD phosphatase
           [Piriformospora indica DSM 11827]
          Length = 782

 Score = 88.2 bits (217), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 66/208 (31%), Positives = 97/208 (46%), Gaps = 45/208 (21%)

Query: 660 AAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 719
           A I++E T RL         RKL L++DLD T+L++      DP   E ++ K+  ++ +
Sbjct: 141 AKIERENTDRL------LKNRKLSLIVDLDQTILHAT----FDPTVGEWIKAKDAFEKRR 190

Query: 720 PHRHL-------------------FRFP----HMG----MWTKLRPGIWTFLERASKLFE 752
                                   F+ P    HMG     + K RPG+  F+   S+L+E
Sbjct: 191 STTPPDHDPPPESVNWPALEDVISFQLPSDHGHMGHSERYYVKPRPGLQRFMNNLSELYE 250

Query: 753 MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES 812
           MH+YTMG + YA  +   LDP G  F  RV+SR +      G +RV   K L      +S
Sbjct: 251 MHVYTMGVRSYANAICAALDPSGAWFGSRVLSRNE-----SGSDRVKNLKRL--FPSDQS 303

Query: 813 AVVIIDDSVRVWPHNKLNLIVVERYTYF 840
            VV+IDD   VW  +  NL+ V  + +F
Sbjct: 304 MVVVIDDRADVWNWSP-NLVRVIPFEFF 330



 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 38/104 (36%), Positives = 59/104 (56%), Gaps = 2/104 (1%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           DV  ++   + K LAG  +VFS + P+ +  P   P+W+ A +FGA C   I+ QVTH+V
Sbjct: 462 DVLGVIPGIKAKTLAGVHLVFSGILPL-DGRPERQPIWKAALEFGATCHVDINPQVTHLV 520

Query: 952 ANSLGTDKVNWALSTGR-FVVHPGWVEASALLYRRANEQDFAIK 994
            N LGT K + A + G  FVV+  W   S + + R  E ++ ++
Sbjct: 521 TNKLGTVKADKAFAQGNIFVVNIKWFNDSLIKWERQPEANYLME 564


>gi|195489702|ref|XP_002092848.1| GE11441 [Drosophila yakuba]
 gi|194178949|gb|EDW92560.1| GE11441 [Drosophila yakuba]
          Length = 879

 Score = 88.2 bits (217), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 67/213 (31%), Positives = 110/213 (51%), Gaps = 32/213 (15%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           ++ +++ + RKL L++DLD T++++      D V D I        +   H  L+  PH 
Sbjct: 196 DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI--------KGIYHFQLYG-PHS 242

Query: 731 GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
             + T+LRPG   FLER S+L+E+H+ T G + YA  +A++LDP+G  F+ R++SR    
Sbjct: 243 PWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR---D 299

Query: 790 DPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP------- 841
           + F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F        
Sbjct: 300 ECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFFQHTGDINA 355

Query: 842 ---CSRRQFGLLGPSLLEIDHDERSEDGTLASS 871
               S+ +    G    EI   +  +D T +SS
Sbjct: 356 PPGLSKHELDGEGVDFKEITEKQEDKDKTESSS 388



 Score = 40.8 bits (94), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 59/121 (48%), Gaps = 4/121 (3%)

Query: 874 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 931
           ++  +HK F+S   ++ +  D++ I+   + ++L G  +VFS + P  +        +  
Sbjct: 557 ILRNIHKRFYSIYDETTEIPDLKVIVPKIRCEVLRGKNLVFSGLVPT-QMKLEQSRAYFI 615

Query: 932 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
           A+  GA    +I  ++TH+VA + GT KVN A       VV+  W+   A  +    E+ 
Sbjct: 616 AKSLGAEVKPNIGKEITHLVAVNAGTYKVNAAKKEPAIKVVNANWLWTCAERWEHVEEKL 675

Query: 991 F 991
           F
Sbjct: 676 F 676


>gi|320581076|gb|EFW95298.1| RNA Pol II CTD phosphatase component, putative [Ogataea
           parapolymorpha DL-1]
          Length = 743

 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 78/249 (31%), Positives = 117/249 (46%), Gaps = 54/249 (21%)

Query: 646 DVEHL-FEGYDDQQKAAIQ--------KERTRRLEE-----QKKMFSARKLCLVLDLDHT 691
           +VE L + G++D+ +A I         K  T+  E       +++    KL LV+DLD T
Sbjct: 122 NVEDLDYTGFNDKDRAPISMSHGTTNLKVSTKEAENIERSSTQRLLKEEKLSLVVDLDQT 181

Query: 692 LLNSAKFHEVDPVHDEILRK----------------KEEQDREKPHRHLFRFPHMGMW-- 733
           ++++     VDP   E +                   EE+    P+    + P    W  
Sbjct: 182 VIHAT----VDPTVGEWMSDPTNPNYESIKDVRSFCLEEEPILPPNYKGPKPPSHKRWYY 237

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 793
            KLRPG+  FLE+ SKL+E+H+YTM  + YA  +AK++DP G+ F  R++SR + G    
Sbjct: 238 VKLRPGLQEFLEKVSKLYELHIYTMATRSYAKSIAKIIDPDGIYFGDRILSRDESGS--- 294

Query: 794 GDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF--------PCSR 844
                   K L+ +  ++ S VV+IDD   VW  +  NLI V  Y +F            
Sbjct: 295 -----LTQKTLKRLFPVDTSMVVVIDDRGDVWNWSP-NLIKVVPYDFFVGIGDINSSFLP 348

Query: 845 RQFGLLGPS 853
           RQ  LLGPS
Sbjct: 349 RQSTLLGPS 357



 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 73/146 (50%), Gaps = 13/146 (8%)

Query: 858 DHDERSEDGTLASSLG-VIERLHKIFF-----SHQSLDDVDVRNILAAEQRKILAGCRIV 911
           +H+   +D     +LG  + R+H  F+     S ++L DV  ++I+ + ++ +  G   +
Sbjct: 496 EHNLLCDDDNELETLGQALVRIHNEFYFEVEKSSKTLPDV--KDIMTSMKQLVFQGYVFL 553

Query: 912 FSRVFPVGE-ANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST--GR 968
            S V P+G         +W  A+ FGA    + +  VTHV+  + GT KV  A S     
Sbjct: 554 LSGVLPLGTPLETADIVIW--AKSFGASFVANYNRSVTHVICKNPGTFKVRLAKSIDPNV 611

Query: 969 FVVHPGWVEASALLYRRANEQDFAIK 994
            VV+P W+ +   L+ +  E+D+A+K
Sbjct: 612 HVVNPDWLFSCIALWEKVPEEDYAMK 637


>gi|396499223|ref|XP_003845421.1| similar to RNA polymerase II subunit A C-terminal domain
           phosphatase [Leptosphaeria maculans JN3]
 gi|312222002|emb|CBY01942.1| similar to RNA polymerase II subunit A C-terminal domain
           phosphatase [Leptosphaeria maculans JN3]
          Length = 887

 Score = 87.8 bits (216), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 56/188 (29%), Positives = 95/188 (50%), Gaps = 21/188 (11%)

Query: 663 QKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKE 713
           QKE     E+ KK +  A+KL L++DLD T++++     +        +P H  +   K+
Sbjct: 224 QKEANAAEEDAKKRLLGAKKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHGAV---KD 280

Query: 714 EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 773
            +  +    ++        + K RPG+  F +R SKL+EMH+YTM  + YA  + K++DP
Sbjct: 281 VEGFQLADDNVSNVAANWYYVKKRPGLEDFFKRMSKLYEMHVYTMATRAYAQAVCKIIDP 340

Query: 774 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV 833
               F  R++SR ++           K+K L  +    + VVIIDD   VW ++  +L+ 
Sbjct: 341 DRRYFGDRILSRDEN--------YTDKTKSLSRLFQNTTMVVIIDDRADVWQYSP-HLVR 391

Query: 834 VERYTYFP 841
           V  + +FP
Sbjct: 392 VPVFNFFP 399


>gi|194886507|ref|XP_001976627.1| GG19916 [Drosophila erecta]
 gi|190659814|gb|EDV57027.1| GG19916 [Drosophila erecta]
          Length = 876

 Score = 87.4 bits (215), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 97/172 (56%), Gaps = 22/172 (12%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           ++ +++ + RKL L++DLD T++++      D V D I        +   H  L+  PH 
Sbjct: 193 DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI--------KGIYHFQLYG-PHS 239

Query: 731 GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
             + T+LRPG   FLER S+L+E+H+ T G + YA  +A++LDP+G  F+ R++SR    
Sbjct: 240 PWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR---D 296

Query: 790 DPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           + F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 297 ECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 344



 Score = 41.2 bits (95), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 59/121 (48%), Gaps = 4/121 (3%)

Query: 874 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 931
           ++  +HK F+S   ++ +  D++ I+   + ++L G  +VFS + P  +        +  
Sbjct: 554 ILRNIHKRFYSIYDETTEIPDLKVIVPKIRCEVLRGKNLVFSGLVPT-QMKLEQSRAYFI 612

Query: 932 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
           A+  GA    +I  ++TH+VA + GT KVN A       VV+  W+   A  +    E+ 
Sbjct: 613 AKSLGAEVKPNIGKEITHLVAVNAGTYKVNAAKKEPAIKVVNANWLWTCAERWEHVEEKL 672

Query: 991 F 991
           F
Sbjct: 673 F 673


>gi|164658688|ref|XP_001730469.1| hypothetical protein MGL_2265 [Malassezia globosa CBS 7966]
 gi|159104365|gb|EDP43255.1| hypothetical protein MGL_2265 [Malassezia globosa CBS 7966]
          Length = 364

 Score = 87.4 bits (215), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 63/212 (29%), Positives = 96/212 (45%), Gaps = 47/212 (22%)

Query: 662 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPV-----HD-------- 706
           + +E   R+  E+ + +   RKL L++DLD T+++      VDP      HD        
Sbjct: 22  VSREEAIRMDSEDTRHLIEQRKLALIVDLDQTIIHVT----VDPTVKEWAHDPKNPNWCM 77

Query: 707 --------------EILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKL 750
                          +  + E  D+             G W   KLRPG+  FL+  S +
Sbjct: 78  LKDVVAFQLGSDGKTVSHQPERMDQHDVKSFATDGDENGCWYYVKLRPGLQAFLQSVSPM 137

Query: 751 FEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG--DERVPKSKDLEGVL 808
           +EMH+YTMG + YA  + +++DP G LF  R++SR ++G+          P S D+    
Sbjct: 138 YEMHVYTMGTRSYADCICRIVDPDGHLFGARILSRDENGNEVQKSLSRLFPISTDM---- 193

Query: 809 GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
                VV+IDD   VW  +  NLI VE Y +F
Sbjct: 194 -----VVVIDDRADVWSWSP-NLIKVEPYEFF 219


>gi|24762673|ref|NP_611934.1| Fcp1 [Drosophila melanogaster]
 gi|7291810|gb|AAF47230.1| Fcp1 [Drosophila melanogaster]
          Length = 880

 Score = 87.4 bits (215), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 97/172 (56%), Gaps = 22/172 (12%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           ++ +++ + RKL L++DLD T++++      D V D I        +   H  L+  PH 
Sbjct: 197 DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI--------KGIYHFQLYG-PHS 243

Query: 731 GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
             + T+LRPG   FLER S+L+E+H+ T G + YA  +A++LDP+G  F+ R++SR    
Sbjct: 244 PWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR---D 300

Query: 790 DPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           + F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 301 ECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 348



 Score = 44.7 bits (104), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 33/121 (27%), Positives = 60/121 (49%), Gaps = 4/121 (3%)

Query: 874 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 931
           ++  +HK F+S   ++ +  D++ I+   + ++L G  +VFS + P  +        +  
Sbjct: 558 ILRNIHKRFYSIYDETTEIPDLKVIVPKIRSEVLRGKNLVFSGLVPT-QMKLEQSRAYFI 616

Query: 932 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
           A+  GA    +ID ++TH+VA + GT KVN A       VV+  W+   A  +    E+ 
Sbjct: 617 AKSLGAEVKPNIDKEITHLVAVNAGTYKVNAAKKEPAIKVVNANWLWTCAERWEHVEEKL 676

Query: 991 F 991
           F
Sbjct: 677 F 677


>gi|195586452|ref|XP_002082988.1| GD24941 [Drosophila simulans]
 gi|194194997|gb|EDX08573.1| GD24941 [Drosophila simulans]
          Length = 877

 Score = 87.4 bits (215), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 97/172 (56%), Gaps = 22/172 (12%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           ++ +++ + RKL L++DLD T++++      D V D I        +   H  L+  PH 
Sbjct: 194 DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI--------KGIYHFQLYG-PHS 240

Query: 731 GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
             + T+LRPG   FLER S+L+E+H+ T G + YA  +A++LDP+G  F+ R++SR    
Sbjct: 241 PWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR---D 297

Query: 790 DPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           + F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 298 ECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 345



 Score = 41.6 bits (96), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 59/121 (48%), Gaps = 4/121 (3%)

Query: 874 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 931
           ++  +HK F+S   ++ +  D++ I+   + ++L G  +VFS + P  +        +  
Sbjct: 555 ILRNIHKRFYSIYDETTEIPDLKVIVPKIRSEVLRGKNLVFSGLVPT-QMKLEQSRAYFI 613

Query: 932 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
           A+  GA    +I  ++TH+VA + GT KVN A       VV+  W+   A  +    E+ 
Sbjct: 614 AKSLGAEVKPNIGKEITHLVAVNAGTYKVNAAKKEPAIKVVNANWLWTCAERWEHVEEKL 673

Query: 991 F 991
           F
Sbjct: 674 F 674


>gi|195353179|ref|XP_002043083.1| GM11819 [Drosophila sechellia]
 gi|194127171|gb|EDW49214.1| GM11819 [Drosophila sechellia]
          Length = 874

 Score = 87.4 bits (215), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 97/172 (56%), Gaps = 22/172 (12%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           ++ +++ + RKL L++DLD T++++      D V D I        +   H  L+  PH 
Sbjct: 191 DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI--------KGIYHFQLYG-PHS 237

Query: 731 GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
             + T+LRPG   FLER S+L+E+H+ T G + YA  +A++LDP+G  F+ R++SR    
Sbjct: 238 PWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR---D 294

Query: 790 DPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           + F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 295 ECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 342



 Score = 41.6 bits (96), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 59/121 (48%), Gaps = 4/121 (3%)

Query: 874 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 931
           ++  +HK F+S   ++ +  D++ I+   + ++L G  +VFS + P  +        +  
Sbjct: 552 ILRNIHKRFYSIYDETTEIPDLKVIVPKIRSEVLRGKNLVFSGLVPT-QMKLEQSRAYFI 610

Query: 932 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
           A+  GA    +I  ++TH+VA + GT KVN A       VV+  W+   A  +    E+ 
Sbjct: 611 AKSLGAEVKPNIGKEITHLVAVNAGTYKVNAAKKEPAIKVVNANWLWTCAERWEHVEEKL 670

Query: 991 F 991
           F
Sbjct: 671 F 671


>gi|21914376|gb|AAM81360.1|AF522873_3 RNA polymerase II C-terminal domain phosphatase component
           [Leptosphaeria maculans]
          Length = 804

 Score = 87.4 bits (215), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 56/188 (29%), Positives = 95/188 (50%), Gaps = 21/188 (11%)

Query: 663 QKERTRRLEEQKK-MFSARKLCLVLDLDHTLLNSAKFHEV--------DPVHDEILRKKE 713
           QKE     E+ KK +  A+KL L++DLD T++++     +        +P H  +   K+
Sbjct: 141 QKEANAAEEDAKKRLLGAKKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHGAV---KD 197

Query: 714 EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 773
            +  +    ++        + K RPG+  F +R SKL+EMH+YTM  + YA  + K++DP
Sbjct: 198 VEGFQLADDNVSNVAANWYYVKKRPGLEDFFKRMSKLYEMHVYTMATRAYAQAVCKIIDP 257

Query: 774 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV 833
               F  R++SR ++           K+K L  +    + VVIIDD   VW ++  +L+ 
Sbjct: 258 DRRYFGDRILSRDEN--------YTDKTKSLSRLFQNTTMVVIIDDRADVWQYSP-HLVR 308

Query: 834 VERYTYFP 841
           V  + +FP
Sbjct: 309 VPVFNFFP 316


>gi|302306421|ref|NP_982820.2| ABL127Wp [Ashbya gossypii ATCC 10895]
 gi|299788508|gb|AAS50644.2| ABL127Wp [Ashbya gossypii ATCC 10895]
 gi|374106022|gb|AEY94932.1| FABL127Wp [Ashbya gossypii FDAG1]
          Length = 728

 Score = 87.4 bits (215), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 64/187 (34%), Positives = 92/187 (49%), Gaps = 32/187 (17%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK---------KEEQDREKPHRH 723
           Q K+  ARKL LV+DLD T+++      VDP   E  +          K+ Q        
Sbjct: 157 QLKLREARKLVLVVDLDQTVIHCG----VDPTIGEWSKDPNNPNYEALKDVQSFSLDEEP 212

Query: 724 LFRFPHMG---------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
           +    +MG          + KLRPG+  F  + +  FE+H+YTM  + YA E+AK++DP 
Sbjct: 213 VLPPFYMGPKPPTRKCWYYVKLRPGLKEFFAKIAPHFELHIYTMATRAYALEIAKIIDPD 272

Query: 775 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIV 833
           G LF  R++SR ++G            K LE +  M +S VV+IDD   VW   + NLI 
Sbjct: 273 GKLFGDRILSRDENGS--------LTQKSLERLFPMDQSMVVVIDDRGDVWNWCE-NLIK 323

Query: 834 VERYTYF 840
           V  Y +F
Sbjct: 324 VVPYDFF 330



 Score = 47.4 bits (111), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 27/104 (25%), Positives = 47/104 (45%), Gaps = 3/104 (2%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           D++ ++   +  +  GCR VFS + P+   N     +      FGA  T ++D   TH++
Sbjct: 474 DIKILMPKLKETVFEGCRFVFSGLIPL-HTNIERADIVLWTNMFGASTTANLDYNTTHLI 532

Query: 952 ANSLGTDKVNWALST--GRFVVHPGWVEASALLYRRANEQDFAI 993
             + GT K   A S      +VHP W+    + + R  E  + +
Sbjct: 533 TRTPGTMKARLAKSFNPAIKIVHPDWIFECLVGWERVEESPYEL 576


>gi|336374248|gb|EGO02585.1| hypothetical protein SERLA73DRAFT_102556 [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 811

 Score = 87.4 bits (215), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 65/214 (30%), Positives = 100/214 (46%), Gaps = 34/214 (15%)

Query: 651 FEGYDDQQKAAIQKER-----TRRLEEQKK--------MFSARKLCLVLDLDHTLL---- 693
           + G+ D  +A+IQ        T  LEE +K        + ++RKL L++DLD T++    
Sbjct: 117 YTGFSDASRASIQMTHSAFGPTVSLEEAQKIEKETADHLLNSRKLSLIVDLDQTIVHATV 176

Query: 694 ------NSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 747
                 +S    E +P + E L+   +    K  +          + K RPG   FL   
Sbjct: 177 DPTVATDSESDDECNP-NWEALKDVRKFQLVKGKQKFIENEGCMYYIKPRPGWQHFLHSI 235

Query: 748 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 807
           +  +EMH+YTMG + YA E+   +DP G +F GR++SR + G            K L+ +
Sbjct: 236 ANKYEMHVYTMGTRAYAEEVCAAIDPDGTIFGGRILSRDESGS--------LTQKSLQRL 287

Query: 808 LGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
              + S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 288 FPCDTSMVVIIDDRADVWEWSP-NLVKVIPYDFF 320



 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 53/103 (51%), Gaps = 2/103 (1%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           DV  I+ + +++ L G  I+FS V P+ +  P    +W+ AE FGA C   +  ++THVV
Sbjct: 494 DVTKIIPSMRKETLDGIHILFSSVIPL-DTKPETTEIWKVAEMFGAQCCTELSSRITHVV 552

Query: 952 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           A   GT KV+ A   G   +V   W   S  L+   +E  + +
Sbjct: 553 AAKHGTVKVDAARKRGGIKIVWLSWFTDSIALWSHQDETPYLL 595


>gi|367009794|ref|XP_003679398.1| hypothetical protein TDEL_0B00580 [Torulaspora delbrueckii]
 gi|359747056|emb|CCE90187.1| hypothetical protein TDEL_0B00580 [Torulaspora delbrueckii]
          Length = 713

 Score = 87.4 bits (215), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 61/185 (32%), Positives = 91/185 (49%), Gaps = 32/185 (17%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK---------KEEQDREKPHRHLF 725
           ++  ++KL LV+DLD T+++      VDP   E  R          K+ Q        + 
Sbjct: 154 RLRESKKLVLVVDLDQTVIHCG----VDPTIGEWKRDSSNPNYEALKDVQSFALDEEPIL 209

Query: 726 RFPHMG---------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
              +MG          + K+RPG+  F ++ + LFEMH+YTM  + YA E+AK++DP G 
Sbjct: 210 PLLYMGPKPPVRKCWYYVKVRPGLKEFFDKVAPLFEMHIYTMATRAYALEIAKIIDPDGS 269

Query: 777 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVE 835
           LF  R++SR ++G            K LE +    +S VV+IDD   VW     NLI V 
Sbjct: 270 LFGDRILSRDENGS--------ITQKSLERLFPTDQSMVVVIDDRGDVWNWCP-NLIKVV 320

Query: 836 RYTYF 840
            Y +F
Sbjct: 321 PYNFF 325



 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 38/144 (26%), Positives = 66/144 (45%), Gaps = 11/144 (7%)

Query: 859 HDER---SEDGTLASSLGVIERLHKIFFS---HQSLDDVDVRNILAAEQRKILAGCRIVF 912
           H++R    +D  L     ++ ++H  ++      S    D++ +L   + K+   C  VF
Sbjct: 432 HNQRLLVDDDDELNYLREILAKVHDEYYKTLGQDSEKKADIKTLLPLMKSKVFEDCHFVF 491

Query: 913 SRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDK--VNWALSTGRF 969
           S + P+G   N     LW     FGA  +  IDD  THV+  + GT K  +  A +    
Sbjct: 492 SGLIPLGTNINRADIVLW--TNMFGAQSSPEIDDSTTHVITRTPGTYKARIAKAFNENIK 549

Query: 970 VVHPGWVEASALLYRRANEQDFAI 993
           VVHP WV    + ++  +E+ + +
Sbjct: 550 VVHPDWVFECLVSWKHVDEKPYEL 573


>gi|21483550|gb|AAM52750.1| SD01014p [Drosophila melanogaster]
          Length = 896

 Score = 87.4 bits (215), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 97/172 (56%), Gaps = 22/172 (12%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           ++ +++ + RKL L++DLD T++++      D V D I        +   H  L+  PH 
Sbjct: 213 DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI--------KGIYHFQLYG-PHS 259

Query: 731 GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
             + T+LRPG   FLER S+L+E+H+ T G + YA  +A++LDP+G  F+ R++SR    
Sbjct: 260 PWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR---D 316

Query: 790 DPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           + F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 317 ECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 364



 Score = 44.3 bits (103), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 33/121 (27%), Positives = 60/121 (49%), Gaps = 4/121 (3%)

Query: 874 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 931
           ++  +HK F+S   ++ +  D++ I+   + ++L G  +VFS + P  +        +  
Sbjct: 574 ILRNIHKRFYSIYDETTEIPDLKVIVPKIRSEVLRGKNLVFSGLVPT-QMKLEQSRAYFI 632

Query: 932 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
           A+  GA    +ID ++TH+VA + GT KVN A       VV+  W+   A  +    E+ 
Sbjct: 633 AKSLGAEVKPNIDKEITHLVAVNAGTYKVNAAKKEPAIKVVNANWLWTCAERWEHVEEKL 692

Query: 991 F 991
           F
Sbjct: 693 F 693


>gi|328874143|gb|EGG22509.1| hypothetical protein DFA_04637 [Dictyostelium fasciculatum]
          Length = 397

 Score = 87.0 bits (214), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 87/349 (24%), Positives = 148/349 (42%), Gaps = 50/349 (14%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K+ L++++DH L +S K  E +    E + K    +                + K RP  
Sbjct: 57  KMNLIINIDHILFHSTKNPESNETQGESVIKCVVDESN------------TYYVKFRPYA 104

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPK 800
            TFL+    LF + L+++ +K Y  ++ ++LD    +F  ++ISR   G+     ++V K
Sbjct: 105 ATFLQSLQPLFNLILFSLYSKSYVFKLIELLDLNNNIFK-QIISRESFGESL-PKQQVGK 162

Query: 801 SKDLEGV---------LGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 851
              L            +    ++ I+DD   +W   + NLI  ER+TYF     + GL  
Sbjct: 163 PYALWNTPSHFTKIFKISAHESLAILDDREDIWRQFRDNLISPERFTYFTKEDDENGLSS 222

Query: 852 PSLLEIDHDERSEDGT---LASSLGVIERLHKIFFS----------------HQSLDDVD 892
                    E S+  T     +S+  +E   ++  S                 QS DD +
Sbjct: 223 TINYPFSIQELSKQFTFPRFNASIPYLEIFSRLLISIQNEYMSRLQETSLQQQQSADDEN 282

Query: 893 -----VRNILAAEQRKILAGCRIVFSRVFP--VGEANPHLHPLWQTAEQFGAVCTKHIDD 945
                 + ++   ++ +L  C IVFS +FP  +     H   + Q AE FGA   + I  
Sbjct: 283 NNFNQSKAVVKELRQCVLMDCNIVFSGIFPKQIDATKLHQTRIVQMAESFGAQVHQDITP 342

Query: 946 QVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
             TH++    GT KV  A+  G+  VVH  W+  S   + + NE +++I
Sbjct: 343 TTTHLIFIKEGTSKVIQAVKQGQVKVVHFSWLRDSLYNWEKMNESNYSI 391


>gi|366991271|ref|XP_003675401.1| hypothetical protein NCAS_0C00420 [Naumovozyma castellii CBS 4309]
 gi|342301266|emb|CCC69032.1| hypothetical protein NCAS_0C00420 [Naumovozyma castellii CBS 4309]
          Length = 725

 Score = 87.0 bits (214), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 62/195 (31%), Positives = 97/195 (49%), Gaps = 42/195 (21%)

Query: 670 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR--- 726
           L  + ++   +KL LV+DLD T+++      VDP   E      + D + P+    +   
Sbjct: 157 LNVRTRLRKEKKLVLVVDLDQTVIHCG----VDPTIGEW-----KNDPKNPNFETLKDVK 207

Query: 727 ---------FPHMGM-----------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATE 766
                     P + M           + K+RPG+  FLE+ + LFEMH+YTM  + YA+E
Sbjct: 208 QFSLEEEPILPTLYMGPKPPLRKCWYYVKVRPGLKEFLEKIAPLFEMHIYTMATRAYASE 267

Query: 767 MAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWP 825
           +AK++DP G LF  R++SR ++G           +K LE +    +S V++IDD   VW 
Sbjct: 268 IAKIIDPNGDLFGDRILSRDENGS--------MTTKSLERLFPTDQSMVIVIDDRGDVWN 319

Query: 826 HNKLNLIVVERYTYF 840
            +  NLI V  Y +F
Sbjct: 320 WSP-NLIKVVPYNFF 333



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/144 (27%), Positives = 69/144 (47%), Gaps = 11/144 (7%)

Query: 859 HDER---SEDGTLASSLGVIERLHKIFFSHQSLD---DVDVRNILAAEQRKILAGCRIVF 912
           HD+R    +D  L      + R+H  +++    D     D++ I+ A ++ +L  CR VF
Sbjct: 442 HDQRLLIDDDDELYHLKDTLVRIHDKYYNVLKKDGELKADIKLIMPAMKQNVLDNCRFVF 501

Query: 913 SRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTG--RF 969
           S + P+G +       +W     FGA+ T  ID++ THV+  + GT K   A S      
Sbjct: 502 SGLIPLGTDIQRADIVIWTNT--FGAISTSDIDEKTTHVITKTPGTYKARLAKSFNPKIK 559

Query: 970 VVHPGWVEASALLYRRANEQDFAI 993
           ++HP WV      ++  +E+ + +
Sbjct: 560 ILHPDWVFECLTSWKHVDEKPYEL 583


>gi|195029035|ref|XP_001987380.1| GH21892 [Drosophila grimshawi]
 gi|193903380|gb|EDW02247.1| GH21892 [Drosophila grimshawi]
          Length = 889

 Score = 86.7 bits (213), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 58/173 (33%), Positives = 92/173 (53%), Gaps = 24/173 (13%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           ++ +++ + RKL L++DLD T++++      D V D I          K   H   +   
Sbjct: 174 DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPDNI----------KGIYHFQLYGPQ 219

Query: 731 GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
             W  T+LRPG   FLER S+L+E+H+ T G + YA  +A++LDP G  F+ R++SR   
Sbjct: 220 SPWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPDGKFFSHRILSR--- 276

Query: 789 GDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
            + F+      K+ +L+ +    +S V IIDD   VW     NLI V+ Y +F
Sbjct: 277 DECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVWSMAS-NLIQVKPYHFF 325



 Score = 43.9 bits (102), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 30/119 (25%), Positives = 60/119 (50%), Gaps = 4/119 (3%)

Query: 874 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 931
           ++  +HK F++   ++ +  D++ ++   + ++L G  +VFS + P  +        +  
Sbjct: 566 ILRNIHKRFYAIYDETTEIPDLKIVVPKIRSEVLRGQNLVFSGLVPT-QMKMEQSRAYFI 624

Query: 932 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQ 989
           A+  GA    +I+  +TH+VA + GT KVN A       VV+  W+ A A  +   +E+
Sbjct: 625 AKSLGAEVKSNINKDITHLVAVNAGTYKVNAAKKEANIRVVNANWLWACAERWEHVDER 683


>gi|317027693|ref|XP_001399857.2| RNA polymerase II subunit A C-terminal domain phosphatase
           [Aspergillus niger CBS 513.88]
          Length = 800

 Score = 86.3 bits (212), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 58/188 (30%), Positives = 97/188 (51%), Gaps = 32/188 (17%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + ++   R+EE  ++++ + RKL LV+DLD T++++     VDP   E +  KE
Sbjct: 132 DNTTLTVSEQEATRVEEDAKRRLLANRKLSLVVDLDQTIIHAT----VDPTVGEWMEDKE 187

Query: 714 EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 773
             + +   R          W +      +FL+  S+++E+H+YTMG + YA  +A ++DP
Sbjct: 188 NPNYQASER----------WLE------SFLQNVSEMYELHIYTMGTRSYAQHIASIIDP 231

Query: 774 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLI 832
              LF  R++SR + G           +K+L  +  +++  VVIIDD   VW  N  NLI
Sbjct: 232 DRKLFGDRILSRDESGSLV--------AKNLHRLFPVDTKMVVIIDDRGDVWRWNP-NLI 282

Query: 833 VVERYTYF 840
            V  Y +F
Sbjct: 283 KVSPYDFF 290



 Score = 44.7 bits (104), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           DV+ I+   + +IL G  +VFS V P+G    +    LW  A+ FGAV    I+   TH+
Sbjct: 478 DVKIIMPNMKHRILGGVTLVFSGVLPLGTDTQNADIXLW--AKSFGAVIASKINMXTTHL 535

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A    +  +V   W+      ++R  E+ + +
Sbjct: 536 VAGRNRTXKVREATRYPKVKIVTTQWLLDCLTQWKRLAEEPYLL 579


>gi|291234950|ref|XP_002737409.1| PREDICTED: RNA polymerase II ctd phosphatase, putative-like
           [Saccoglossus kowalevskii]
          Length = 896

 Score = 85.9 bits (211), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 54/172 (31%), Positives = 95/172 (55%), Gaps = 21/172 (12%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+++++  +RKL  ++DLD T++++     +D V + +        ++  H  L+  P  
Sbjct: 169 EDEQRLLKSRKLVCIVDLDQTIIHTT----MDNVPENL--------KDVYHFQLWSGPQY 216

Query: 731 GMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
             + T++RP    FLE+ SKL+E+H++T G +LYA  +A  +DP   LF+ R++SR    
Sbjct: 217 PWFHTRIRPKCKEFLEKISKLYELHIFTFGARLYAHMIAGFIDPDKKLFSHRIVSR---D 273

Query: 790 DPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           + FD      K+ +L+ +    ++ V IIDD   VW     N+I V+ Y YF
Sbjct: 274 ECFDAS---SKTANLQAIFPCGDNMVCIIDDREDVWNFAP-NMIHVKPYHYF 321



 Score = 47.0 bits (110), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 48/104 (46%), Gaps = 12/104 (11%)

Query: 902 RKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTK----HIDDQ------VTHVV 951
           R++L G  I+FS VFP    +P     W+ A+  GA         + D+       THVV
Sbjct: 576 RQVLKGTNILFSGVFPTN-MSPEKSRAWKVAQTLGANVQSSFVPKLKDKTNAATATTHVV 634

Query: 952 ANSLGTDKVNWALST-GRFVVHPGWVEASALLYRRANEQDFAIK 994
           A   GT KV  A  T G  +V P W+      + R +E+ F +K
Sbjct: 635 AAKAGTVKVKQAQCTRGIHIVTPEWLWCCYDRWERVDERIFRLK 678


>gi|8778093|gb|AAF79202.1| CTD phosphatase-like protein [Emericella nidulans]
          Length = 409

 Score = 85.5 bits (210), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 102/201 (50%), Gaps = 34/201 (16%)

Query: 656 DQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + +    R+EE  ++++ + RKL LV+DLD T++++A    VDP   E +    
Sbjct: 29  DNTALTVSEREAIRVEEDAKRRLLANRKLSLVVDLDQTIIHAA----VDPTIGEWM---- 80

Query: 714 EQDREKPHR------HLFRFPHMG-------MWTKLRPGIWTFLERASKLFEMHLYTMGN 760
             D++ P+         F+    G       +  KLRPG+  FL+  + ++E+H+YTMG 
Sbjct: 81  -ADKDNPNHAPVSDVRAFQLVDDGPGMRGLLVLCKLRPGLEEFLKNVADMYELHIYTMGT 139

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 819
           + YA  +A ++DP   LF  R++SR + G            K+L  +  +++  VVIIDD
Sbjct: 140 RSYAQAIANIIDPDRKLFGDRILSRDESGS--------LSVKNLHRIFPVDTKMVVIIDD 191

Query: 820 SVRVWPHNKLNLIVVERYTYF 840
              VW  +  NLI V  Y +F
Sbjct: 192 RGDVWRWSP-NLIKVIPYDFF 211


>gi|167520468|ref|XP_001744573.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776904|gb|EDQ90522.1| predicted protein [Monosiga brevicollis MX1]
          Length = 858

 Score = 85.5 bits (210), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 57/171 (33%), Positives = 91/171 (53%), Gaps = 18/171 (10%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E   K+  ARKL L+LDLD TL++S     +D +    LR   E   +  H   F     
Sbjct: 56  ENANKLLEARKLILILDLDKTLIHST----IDSIASHWLR---EGVYDIFH---FDLGKH 105

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
             +TK+RPG+  FLE     +EMH+YTMG + YA  + +++DP    F+ R++++ D+  
Sbjct: 106 TYYTKVRPGLHAFLEDLYPYYEMHIYTMGRRNYAERILRIIDPSNRFFSTRILTQ-DESF 164

Query: 791 PFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
             +      K+K+L+ +L G +S  VI+DD   VW   + N++    Y +F
Sbjct: 165 SIEN-----KAKNLDALLPGGDSMAVILDDLPAVWDF-QTNVVPALPYEFF 209



 Score = 47.8 bits (112), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 49/98 (50%), Gaps = 9/98 (9%)

Query: 905 LAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHI-------DDQVTHVVANSLGT 957
           LAGC ++F+ + P G+ +   H  W+ A   GA     +         +VTHV+A   GT
Sbjct: 592 LAGCILLFTGIIPKGQ-DVATHRAWRQAVAMGARVVDEVKFASILMSRRVTHVIARKAGT 650

Query: 958 DKVNWALST-GRFVVHPGWVEASALLYRRANEQDFAIK 994
           +K+N AL   G F+V   W+E +     RA E  + ++
Sbjct: 651 EKLNQALDYPGVFLVSLRWLEDTFHQGARAKESKYPLE 688


>gi|295671060|ref|XP_002796077.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226284210|gb|EEH39776.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 829

 Score = 85.1 bits (209), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 65/201 (32%), Positives = 98/201 (48%), Gaps = 34/201 (16%)

Query: 656 DQQKAAIQKERTRRLEEQKKMFSARKLCL--VLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + K    R+EE  K        L  V+DLD T++++     VDP   E      
Sbjct: 228 DNSSLTVSKSEATRVEEDAKRRLLSSRRLSLVVDLDQTIIHAT----VDPTVAEW----- 278

Query: 714 EQDREKPHRHLFR----------FPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 760
           +QDR+ P+    +           P M G W   KLRPG+  FL+  S L+E+H+YTMG 
Sbjct: 279 QQDRDNPNHEAVKDVRAFQLVDDGPGMKGCWYYIKLRPGLQEFLQEISALYELHIYTMGT 338

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 819
           + YA  +A ++DP   +F  R++SR + G           +K+L+ +  +++  VVIIDD
Sbjct: 339 RAYAQNIATIVDPDRKIFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDD 390

Query: 820 SVRVWPHNKLNLIVVERYTYF 840
              VW  +  NLI V  Y +F
Sbjct: 391 RGDVWKWSD-NLIKVSPYDFF 410


>gi|448097224|ref|XP_004198617.1| Piso0_001998 [Millerozyma farinosa CBS 7064]
 gi|359380039|emb|CCE82280.1| Piso0_001998 [Millerozyma farinosa CBS 7064]
          Length = 830

 Score = 85.1 bits (209), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 67/203 (33%), Positives = 95/203 (46%), Gaps = 48/203 (23%)

Query: 662 IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH 721
           I+   T RL E+KK      L LV+DLD T++++     VDP   E      + D   P+
Sbjct: 152 IETNTTDRLVEEKK------LILVVDLDQTVIHAT----VDPTVGEW-----QSDPSNPN 196

Query: 722 RHLFR---------------------FPHMGMW--TKLRPGIWTFLERASKLFEMHLYTM 758
               +                      P    W   K+RPG+  FLE+ SKL+EMH+YTM
Sbjct: 197 YKAVKDVKSFCLEEESIAPLGWEGPKLPATKCWYYVKVRPGLEQFLEQISKLYEMHIYTM 256

Query: 759 GNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVII 817
             + YA E+AK++DP G  F  R++SR + G            K+L+ +  + +S V II
Sbjct: 257 ATRNYALEIAKIIDPNGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVAII 308

Query: 818 DDSVRVWPHNKLNLIVVERYTYF 840
           DD   VW     NLI V  Y +F
Sbjct: 309 DDRGDVWQWEN-NLIKVVPYDFF 330



 Score = 42.0 bits (97), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 31/135 (22%), Positives = 61/135 (45%), Gaps = 17/135 (12%)

Query: 875 IERLHKIFFSHQSLDDV----DVRNILAAEQRKILAGCRIVFSRVFPVG-EANPHLHPLW 929
           +ER+H  +++   + +     D+  I+   +R  L G  I+FS + P+G   N     +W
Sbjct: 523 LERIHNAYYTDLKVAENGQRPDLTEIIPNLKRTCLEGITILFSGILPLGTNYNNADIVIW 582

Query: 930 QTAEQFGAVCTKHIDDQVTHVVANSLGTDK----VNWALSTGRFV------VHPGWVEAS 979
              +QFG      +  +VTH+VA    T        + +   R +      V+P W+   
Sbjct: 583 --CKQFGVKVVNEVVPEVTHIVARDPSTSNYKTGFTFKVRVAREILPSAKLVNPDWLFMC 640

Query: 980 ALLYRRANEQDFAIK 994
              +++ +E+D+ I+
Sbjct: 641 LSSWKKVDEEDYLIQ 655


>gi|340377687|ref|XP_003387360.1| PREDICTED: hypothetical protein LOC100639785 [Amphimedon
           queenslandica]
          Length = 913

 Score = 85.1 bits (209), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 63/187 (33%), Positives = 95/187 (50%), Gaps = 26/187 (13%)

Query: 659 KAAIQKERTRRLE--EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           +  + K+  +RL   +++ +   RKL L++DLD TL           +H  I R  E   
Sbjct: 125 QVKVNKKEAQRLGNLDKECLLKNRKLALIIDLDQTL-----------IHTSIDRNIE--- 170

Query: 717 REKPHRHLFRFP-HMGMW-TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
           R  P  H F  P H  ++  +LRP +  FL   S+ +E+H+ TMG + YA  + K+LD +
Sbjct: 171 RGLPDVHSFTLPGHSCVYHCRLRPYVREFLNHISQYYELHVATMGTRDYADAITKILDQE 230

Query: 775 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIV 833
             LF+ RVISR +  DP        K+  L+ V    +  V I+DD   VW H + NLI 
Sbjct: 231 KKLFSHRVISRNELLDPHS------KAVRLKSVFPCGDEMVAIMDDRGDVWGH-RPNLIH 283

Query: 834 VERYTYF 840
           V+ Y +F
Sbjct: 284 VKAYVFF 290



 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/147 (24%), Positives = 64/147 (43%), Gaps = 28/147 (19%)

Query: 874 VIERLHKIFFSH-------------------QSLDDVDVRNILAAEQRKILAGCRIVFSR 914
           ++ER+H+ F+                     +SL   D++ I+ + ++ + +G R++F+ 
Sbjct: 604 LLERIHETFYKEYDEMKREEEENKGGRDTGSESLSSPDLKVIIPSLRKSVFSGLRLLFTG 663

Query: 915 VFPVGEANPHLHPLWQTAEQFGA------VCTKHIDDQ-VTHVVANSLGTDKVNWALST- 966
           + P    NP     W T   FG       V T   DD+  TH++     TDK   AL   
Sbjct: 664 IIPTN-MNPEKSREWNTTRAFGGEIHSTLVTTPAPDDEHTTHIIVGKKDTDKYKQALRIP 722

Query: 967 GRFVVHPGWVEASALLYRRANEQDFAI 993
           G  +V P W    A  ++R +E ++ +
Sbjct: 723 GIEIVSPEWFWDCAERWKRLDEDEYRL 749


>gi|254586061|ref|XP_002498598.1| ZYRO0G14168p [Zygosaccharomyces rouxii]
 gi|238941492|emb|CAR29665.1| ZYRO0G14168p [Zygosaccharomyces rouxii]
          Length = 764

 Score = 85.1 bits (209), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 60/187 (32%), Positives = 92/187 (49%), Gaps = 32/187 (17%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDP----------------VHDEILRKKEEQD 716
           ++++  ++KL LV+DLD T+++      VDP                + D  +   EE+ 
Sbjct: 156 KQRLRQSKKLVLVVDLDQTVIHCG----VDPTIGEWKKDPSNPNYETLKDVQMFSLEEEP 211

Query: 717 REKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
              P     R P    W   K+RPG+  F  + + L+EMH+YTM  + YA E+AK++DP 
Sbjct: 212 IVPPMYMGPRLPERKCWYFVKVRPGLREFFAQLAPLYEMHIYTMATRTYALEIAKIIDPD 271

Query: 775 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIV 833
           G LF  R++SR ++G            K LE +    +S V++IDD   VW     NLI 
Sbjct: 272 GSLFGDRILSRDENGS--------LTQKSLERLFPTDQSMVIVIDDRGDVWNWCP-NLIK 322

Query: 834 VERYTYF 840
           V  Y +F
Sbjct: 323 VVPYNFF 329



 Score = 53.5 bits (127), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 33/136 (24%), Positives = 61/136 (44%), Gaps = 3/136 (2%)

Query: 860 DERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVG 919
           D+  E   L  +L  I R +       + +  D+++++   + ++   C  VFS + P+G
Sbjct: 443 DDDDELYYLKDTLAKIHRKYYKLLDEGNENSADIQSLVPTMKGQVFKDCYFVFSGLIPLG 502

Query: 920 EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA--LSTGRFVVHPGWVE 977
             N     +      FGA+ +  ID+  THV+  + GT K   A   +    VVHP WV 
Sbjct: 503 -TNIRRADIVLWTGMFGAITSSEIDENTTHVITKTPGTYKARIAKGFNENVKVVHPDWVF 561

Query: 978 ASALLYRRANEQDFAI 993
              + +   +E+ + +
Sbjct: 562 ECLVTWTHVDEKPYEL 577


>gi|226288832|gb|EEH44344.1| RNA polymerase II C-terminal domain phosphatase component
           [Paracoccidioides brasiliensis Pb18]
          Length = 920

 Score = 84.7 bits (208), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 65/201 (32%), Positives = 98/201 (48%), Gaps = 34/201 (16%)

Query: 656 DQQKAAIQKERTRRLEEQKKMFSARKLCL--VLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + K    R+EE  K        L  V+DLD T++++     VDP   E      
Sbjct: 103 DNSSLTVSKSEATRVEEDAKRRLLSSRRLSLVVDLDQTIIHAT----VDPTVAEW----- 153

Query: 714 EQDREKPHRHLFR----------FPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 760
           +QDR+ P+    +           P M G W   KLRPG+  FL+  S L+E+H+YTMG 
Sbjct: 154 QQDRDNPNHEAVKDVRAFQLVDDGPGMKGCWYYIKLRPGLQEFLQEISALYELHIYTMGT 213

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 819
           + YA  +A ++DP   +F  R++SR + G           +K+L+ +  +++  VVIIDD
Sbjct: 214 RAYAQNIAAIVDPDRKIFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDD 265

Query: 820 SVRVWPHNKLNLIVVERYTYF 840
              VW  +  NLI V  Y +F
Sbjct: 266 RGDVWKWSD-NLIKVSPYDFF 285


>gi|195121496|ref|XP_002005256.1| GI20391 [Drosophila mojavensis]
 gi|193910324|gb|EDW09191.1| GI20391 [Drosophila mojavensis]
          Length = 880

 Score = 84.7 bits (208), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 57/173 (32%), Positives = 93/173 (53%), Gaps = 24/173 (13%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           ++ +++ + RKL L++DLD T++++      D V D I          K   H   +   
Sbjct: 178 DDTRRLLTDRKLVLLVDLDQTVIHTTN----DTVPDNI----------KGIYHFQLYGPQ 223

Query: 731 GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
             W  T+LRPG   FLE+ S+L+E+H+ T G + YA  +A++LDP G  F+ R++SR   
Sbjct: 224 SPWYHTRLRPGTAEFLEKMSELYELHICTFGARNYAHMIAQLLDPDGKFFSHRILSR--- 280

Query: 789 GDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
            + F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 281 DECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 329



 Score = 42.0 bits (97), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 33/121 (27%), Positives = 60/121 (49%), Gaps = 4/121 (3%)

Query: 874 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 931
           ++  +HK F++   ++ +  D++ I+   + ++L G  +VFS + P  +        +  
Sbjct: 556 ILRNIHKRFYAIYDETTEIPDLKIIVPKIRCEVLRGQVLVFSGLVPT-QMKLEQSRAYFI 614

Query: 932 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
           A+  GA    +I   VTH+VA + GT KVN A       VV+  W+ A A  +   +E+ 
Sbjct: 615 AKSLGAEVQSNISKDVTHLVAVNAGTYKVNAAKKEPNIKVVNANWLWACAERWEHVDERL 674

Query: 991 F 991
           F
Sbjct: 675 F 675


>gi|198460927|ref|XP_001361849.2| GA11510 [Drosophila pseudoobscura pseudoobscura]
 gi|198137180|gb|EAL26428.2| GA11510 [Drosophila pseudoobscura pseudoobscura]
          Length = 873

 Score = 84.7 bits (208), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 57/173 (32%), Positives = 93/173 (53%), Gaps = 24/173 (13%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           ++ +++ + RKL L++DLD T++++      D V + I          K   H   +   
Sbjct: 180 DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPENI----------KGIYHFQLYGPQ 225

Query: 731 GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
             W  T+LRPG   FLER S+L+E+H+ T G + YA  +A++LDP G  F+ R++SR   
Sbjct: 226 SPWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPDGKFFSHRILSR--- 282

Query: 789 GDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
            + F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 283 DECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 331



 Score = 39.7 bits (91), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 36/147 (24%), Positives = 69/147 (46%), Gaps = 10/147 (6%)

Query: 854 LLEIDHDE------RSEDGTLASSLGVIERLHKIFFS--HQSLDDVDVRNILAAEQRKIL 905
           +++++HDE         D  L     ++  +HK F++   ++++  D++ I+   + ++L
Sbjct: 528 IVKVNHDEGKLIEIEDPDDYLLYLEVILRNIHKRFYAIYDETMEIPDLKIIVPKIRCEVL 587

Query: 906 AGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALS 965
            G  +VFS + P  +        +  A+  GA    +I  + TH+VA + GT KVN A  
Sbjct: 588 RGKNLVFSGLVPT-QMRLEQSRAYFIAKSLGAEVQPNISKESTHLVAVNAGTYKVNAAKK 646

Query: 966 TGRF-VVHPGWVEASALLYRRANEQDF 991
                VV+  W+   A  +    E+ F
Sbjct: 647 EVNIRVVNANWLWTCAERWEHVEEKLF 673


>gi|195440020|ref|XP_002067857.1| GK12500 [Drosophila willistoni]
 gi|194163942|gb|EDW78843.1| GK12500 [Drosophila willistoni]
          Length = 657

 Score = 84.3 bits (207), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 56/172 (32%), Positives = 92/172 (53%), Gaps = 22/172 (12%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           ++ +++ + RKL L++DLD T++++      DPV + I          K   H   +   
Sbjct: 181 DDTRRLLNDRKLVLLVDLDQTIIHTTN----DPVPENI----------KGIHHFQLYGSQ 226

Query: 731 GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
             W  T LRPG   FLER S+++E+H+ T G + YA  +A+++DP+G LF+ R++SR   
Sbjct: 227 SPWYHTCLRPGTTQFLERMSQMYELHICTFGARKYAHMIAQLIDPEGKLFSHRILSR--- 283

Query: 789 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
            + F+   ++   K L      +  V IIDD   VW +   NLI V+ Y +F
Sbjct: 284 DECFNATSKMDNLKAL--FPNGDKMVCIIDDREDVW-NMATNLIQVKPYHFF 332


>gi|448111257|ref|XP_004201796.1| Piso0_001998 [Millerozyma farinosa CBS 7064]
 gi|359464785|emb|CCE88490.1| Piso0_001998 [Millerozyma farinosa CBS 7064]
          Length = 830

 Score = 84.3 bits (207), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 61/190 (32%), Positives = 90/190 (47%), Gaps = 42/190 (22%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR-------- 726
           ++   +KL LV+DLD T++++     VDP   E      + D   P+    +        
Sbjct: 159 RLVDEKKLILVVDLDQTVIHAT----VDPTVGEW-----QSDPSNPNYKAVKDVKSFCLE 209

Query: 727 -------------FPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVL 771
                         P    W   K+RPG+  FLE+ SKL+EMH+YTM  + YA E+AK++
Sbjct: 210 EESIAPLGWEGPKLPATKCWYYVKVRPGLEEFLEQISKLYEMHIYTMATRNYALEIAKII 269

Query: 772 DPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLN 830
           DP G  F  R++SR + G            K+L+ +  + +S V IIDD   VW     N
Sbjct: 270 DPDGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVAIIDDRGDVWQWEN-N 320

Query: 831 LIVVERYTYF 840
           LI V  Y +F
Sbjct: 321 LIKVVPYDFF 330



 Score = 43.9 bits (102), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 33/134 (24%), Positives = 62/134 (46%), Gaps = 17/134 (12%)

Query: 875 IERLHKIFFSH----QSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVG-EANPHLHPLW 929
           +ER+H  +++     ++ +  D+  I+   +RK L G  I+FS + P+G   N     +W
Sbjct: 524 LERIHNAYYTDLKEARNGERPDLTEIIPNLKRKCLEGITILFSGILPLGTNYNNADIVIW 583

Query: 930 QTAEQFGAVCTKHIDDQVTHVVANSLGTDK----VNWALSTGRFV------VHPGWVEAS 979
              +QFG      +  +VTH+VA    T        + +   R V      V+P W+   
Sbjct: 584 --CKQFGVRVVNEVVPEVTHIVARDPSTSNYRTGFTFKVRVARKVLPSAKLVNPDWLFMC 641

Query: 980 ALLYRRANEQDFAI 993
              +++ +E+D+ I
Sbjct: 642 LSSWKKVDEEDYLI 655


>gi|297792863|ref|XP_002864316.1| hypothetical protein ARALYDRAFT_918545 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297310151|gb|EFH40575.1| hypothetical protein ARALYDRAFT_918545 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 142

 Score = 84.0 bits (206), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 52/150 (34%), Positives = 78/150 (52%), Gaps = 15/150 (10%)

Query: 758 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVII 817
           MG++ YA  + K++DP+ V F  RVI+R +           P  K L+ VL  E  VVI+
Sbjct: 1   MGDRDYAKNVLKLIDPEKVYFGDRVITRNES----------PYIKTLDLVLADECGVVIV 50

Query: 818 DDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIER 877
           DD+ +VWP +K NL+ + +Y YF    R+      S  E   DE   DG+L + L VI+ 
Sbjct: 51  DDTAQVWPDHKRNLLEITKYNYFSDKTRRDVKYSKSYAEEKRDEGRNDGSLGNVLKVIKE 110

Query: 878 LHKIFFS-----HQSLDDVDVRNILAAEQR 902
           +++ FFS        +D  DVR +    +R
Sbjct: 111 VYERFFSGGVEKELDIDSKDVRLLFHDARR 140


>gi|297830092|ref|XP_002882928.1| hypothetical protein ARALYDRAFT_897808 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328768|gb|EFH59187.1| hypothetical protein ARALYDRAFT_897808 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 295

 Score = 84.0 bits (206), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 73/254 (28%), Positives = 114/254 (44%), Gaps = 48/254 (18%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH----LFRFPH 729
           K     +KL LVL+L  T  +S  F          L  KE+  + K +        R   
Sbjct: 53  KNSLEKKKLHLVLNLYGTFFDSQAF--------PCLSNKEKYLKGKVNSRNDLWQTRIRG 104

Query: 730 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
             +  KLRP +  FL  A+KLF +H+ T+    YA  + K+LDP  + F  R+IS     
Sbjct: 105 HDVLIKLRPFVHEFLREANKLFILHVTTLCIPEYADFVLKLLDPHQLYFGNRIISL---- 160

Query: 790 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW-PHNKLNLIVVERYTYFPCSRRQFG 848
                 + V   K L+ VL  E  V+I+DD   VW P N+ NL+ +  Y+YF  ++++  
Sbjct: 161 -----SKHVIWEKTLDQVLVGEREVIILDDRYDVWSPENRSNLLQITTYSYFKATKKRNS 215

Query: 849 LLG-------------------------PSLLEIDHDERSEDGTLASSLGVIERLHKIFF 883
           + G                          S  E   DE  +DG LA++L  + ++H+ FF
Sbjct: 216 IDGGMFQNLFKYFLKIFSRDDDNLLSDSNSYSEERKDESVDDGALANALRFLFKIHQDFF 275

Query: 884 SHQ-SLDDVDVRNI 896
           +H  S +D+  R++
Sbjct: 276 NHHYSENDIYKRDV 289


>gi|169600911|ref|XP_001793878.1| hypothetical protein SNOG_03310 [Phaeosphaeria nodorum SN15]
 gi|160705543|gb|EAT90041.2| hypothetical protein SNOG_03310 [Phaeosphaeria nodorum SN15]
          Length = 810

 Score = 84.0 bits (206), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 54/180 (30%), Positives = 95/180 (52%), Gaps = 17/180 (9%)

Query: 669 RLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEV-----DPVHDEILRKKEEQDREKPH 721
           R EE  +K++ +++KL L++DLD T++++     V     DP +      K+ +  +   
Sbjct: 146 RAEEDTKKRLLNSKKLTLIVDLDQTVIHTTCERTVAEWQADPENPNYEAVKDVKGFQLAD 205

Query: 722 RHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 781
            +L        + K+RPG+  F ++ SKL+EMH+YTM  + YA  + K++DP    F  R
Sbjct: 206 DNLSNVAANWYYVKMRPGLKEFFDKMSKLYEMHVYTMATRAYAQAIMKIIDPDRKYFGDR 265

Query: 782 VISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 840
           ++SR ++           K K+L  +    +A VVIIDD   VW ++  +L+ V  + +F
Sbjct: 266 ILSRDEN--------YTDKLKNLTRLFYQNTAMVVIIDDRADVWQYSP-HLVRVPVFNFF 316



 Score = 44.7 bits (104), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 32/114 (28%), Positives = 56/114 (49%), Gaps = 7/114 (6%)

Query: 886 QSLDDV--DVRNILAAEQRKILAGCRIVFSRVFPVG-EANPHLHPLWQTAEQFGAVCTKH 942
           ++L D+  DV  I+   + ++L G  +VFS + P+G +     + LW     FGA  T  
Sbjct: 507 RALSDIIPDVAEIMPRIKEEVLDGVVVVFSGIIPLGVDVQTSDYALW--IRSFGAEVTTS 564

Query: 943 IDDQVTHVVAN-SLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIK 994
           ++ + THV+AN    T KV  A       +V+P W+      +   +E  ++I+
Sbjct: 565 VNRRTTHVIANPDRKTTKVKQAARYQHIKIVNPEWMFQCCTRWEHVDETPYSIE 618


>gi|403217618|emb|CCK72111.1| hypothetical protein KNAG_0J00280 [Kazachstania naganishii CBS
           8797]
          Length = 742

 Score = 83.6 bits (205), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 60/183 (32%), Positives = 90/183 (49%), Gaps = 33/183 (18%)

Query: 678 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK---------KEEQDREKPHRHLFRFP 728
           +A+KL LV+DLD T+++      VDP   E  R          ++ Q        +  F 
Sbjct: 176 AAQKLVLVVDLDQTVVHCG----VDPTIGEWKRDPRNPNYEALRDVQSFALEEEPILPFL 231

Query: 729 HMG----------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 778
           ++G           + K+RPG+  F +R + LFEMH+YTM  + YA E+AK++DP   LF
Sbjct: 232 YVGGKRPAPRKCWYYVKVRPGLKQFFKRLAPLFEMHIYTMATRAYALEIAKIIDPDKSLF 291

Query: 779 AGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERY 837
             R++SR ++G            K LE +    +S V +IDD   VW +   NLI V  Y
Sbjct: 292 GDRILSRDENGS--------LTHKSLERLFPTDQSMVTVIDDRGDVW-NWCANLIKVVPY 342

Query: 838 TYF 840
            +F
Sbjct: 343 NFF 345



 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/114 (24%), Positives = 56/114 (49%), Gaps = 5/114 (4%)

Query: 878 LHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQF 935
           LHK ++S   ++  D ++++I+   + ++   C  VFS + P+   N     +     +F
Sbjct: 486 LHKRYYSTLKENPQDTNIQSIMTQLKTRVFQDCHFVFSGLIPLN-TNVQAADIVIWTNKF 544

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNW--ALSTGRFVVHPGWVEASALLYRRAN 987
           GA  T  +D+  TH++  + GT+K     A +    +VHP W+    + + R +
Sbjct: 545 GATTTNDLDETTTHLITKTPGTNKARLAKAFNPDIKIVHPDWIFECLIKWERVD 598


>gi|332029822|gb|EGI69691.1| RNA polymerase II subunit A C-terminal domain phosphatase
           [Acromyrmex echinatior]
          Length = 749

 Score = 83.6 bits (205), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 55/177 (31%), Positives = 92/177 (51%), Gaps = 32/177 (18%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH-RHLFRFPH 729
           E+++++ + RKL L++DLD T++++                    D   P+ + +F F  
Sbjct: 145 EDEQRLLTDRKLVLLVDLDQTIVHTT------------------NDNIPPNLKDVFHFQL 186

Query: 730 MGM---W--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
            G+   W  T+LRP    FL   S+L+E+H+ T G ++YA  +A +LD  GVLF+ R++S
Sbjct: 187 YGLNSPWYHTRLRPNTRHFLSEMSRLYELHICTFGARIYAHTVASLLDKDGVLFSHRILS 246

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           R +  DP        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y +F
Sbjct: 247 RDECFDP------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 296



 Score = 46.6 bits (109), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 33/123 (26%), Positives = 62/123 (50%), Gaps = 8/123 (6%)

Query: 874 VIERLHKIFFSHQSLDDVD----VRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLW 929
           ++ R+H  F++  +LD  +    +R+I+   + ++L G  + FS + P  +   H    +
Sbjct: 443 ILRRIHTEFYA--TLDQENTRKSLRDIIPRVRSQVLKGLCLTFSGLIPTHQ-KLHQSRAY 499

Query: 930 QTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWAL-STGRFVVHPGWVEASALLYRRANE 988
           + A  FGA  T+ + ++ TH+VA   GT K N A   T   +V+  W+   A  +   +E
Sbjct: 500 KVARAFGAEVTQDLTEKTTHLVAIRKGTAKANAARKDTNIKIVNSDWLWTCAERWEHVDE 559

Query: 989 QDF 991
           + F
Sbjct: 560 RLF 562


>gi|406602036|emb|CCH46356.1| RNA polymerase II subunit A C-terminal domain phosphatase
           [Wickerhamomyces ciferrii]
          Length = 720

 Score = 83.6 bits (205), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 70/221 (31%), Positives = 105/221 (47%), Gaps = 42/221 (19%)

Query: 662 IQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKK------- 712
           I K   +++E+   K +    KL LV+DLD T++++     VDP   E +  +       
Sbjct: 153 ISKSEAQKVEQLMTKNLIKENKLILVVDLDQTVIHAT----VDPTIGEWMNDQSNPNFPS 208

Query: 713 ---------EEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNK 761
                    EE+    P     R P    W   K+RPG+  FL+R +K++E+H+YTMG K
Sbjct: 209 LKDVQYFSLEEEPILPPGYQGPRPPTHKRWYYVKMRPGLEDFLKRIAKIYELHIYTMGTK 268

Query: 762 LYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
            YA  +AK++DP G  F  R++SR + G            K LE +   + S VVIIDD 
Sbjct: 269 EYARSIAKIIDPDGEYFGERILSRDESGS--------LTQKSLERLFPTDTSMVVIIDDR 320

Query: 821 VRVWPHNKLNLIVVERYTYFPCSR--------RQFGLLGPS 853
             VW  +  +LI V  + +F            +Q  LLGP+
Sbjct: 321 GDVWNWSD-HLIKVVPFDFFVGIGDINSNFLPKQKSLLGPT 360



 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 67/144 (46%), Gaps = 9/144 (6%)

Query: 857 IDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDV--DVRNILAAEQRKILAGCRIVFSR 914
           ID     +D  L      +  +H+ F+S    +    DV+ IL  ++ ++  G   VFS 
Sbjct: 462 IDRLLYDDDDELVGLEKALNEIHETFYSILKKNQTHPDVKEILPQKKHEVFGGFNFVFSG 521

Query: 915 VFPVGE--ANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST--GRFV 970
           +FP+G    N H+  +W   + FGA  +  I  + THV+  S  T K   A +      V
Sbjct: 522 LFPIGGNINNEHI-VIW--VKSFGASVSTDITMETTHVITKSPHTYKARLAKTMIPNVKV 578

Query: 971 VHPGWVEASALLYRRANEQDFAIK 994
           VHP W+  S   +R A++  F IK
Sbjct: 579 VHPNWLFDSMSNWRPADDIPFEIK 602


>gi|195429765|ref|XP_002062928.1| GK19439 [Drosophila willistoni]
 gi|194159013|gb|EDW73914.1| GK19439 [Drosophila willistoni]
          Length = 827

 Score = 83.6 bits (205), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 57/173 (32%), Positives = 93/173 (53%), Gaps = 24/173 (13%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           ++ +++ + RKL L++DLD T++++      D V D I          K   H   +   
Sbjct: 180 DDTRRLLADRKLVLLVDLDQTVIHTTN----DVVPDNI----------KGIYHFQLYGPQ 225

Query: 731 GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
             W  T+LRPG   FL+R S L+E+H+ T G + YA  +A++LDP+G  F+ R++SR   
Sbjct: 226 SPWYHTRLRPGTADFLDRMSHLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSR--- 282

Query: 789 GDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
            + F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 283 DECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 331



 Score = 40.4 bits (93), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 59/121 (48%), Gaps = 4/121 (3%)

Query: 874 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 931
           ++  +HK F++   ++ +  D++ I+   + ++L G  +VFS + P  +        +  
Sbjct: 505 ILRNIHKRFYAIYDETREIPDLKIIVPKIRCEVLRGQNLVFSGLVPT-QMQLEQSRAYFI 563

Query: 932 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
           A+  GA    +I   +TH+VA + GT KVN A       VV+  W+ A A  +    E+ 
Sbjct: 564 AKSLGAEVQSNIGKDITHLVAVNAGTYKVNAAKRDEHIKVVNANWLWACAERWEHVEEKL 623

Query: 991 F 991
           F
Sbjct: 624 F 624


>gi|50552035|ref|XP_503492.1| YALI0E03278p [Yarrowia lipolytica]
 gi|49649361|emb|CAG79071.1| YALI0E03278p [Yarrowia lipolytica CLIB122]
          Length = 750

 Score = 83.2 bits (204), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 64/193 (33%), Positives = 96/193 (49%), Gaps = 34/193 (17%)

Query: 668 RRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILR--KKEEQDREKPHRH 723
           +RLEE   K++   RKL LV+DLD T+++      VDP   E  +       D  K  R 
Sbjct: 155 QRLEEGSTKQLLKQRKLILVVDLDQTVIHVT----VDPTVGEWKKDPSNPNYDAVKDVR- 209

Query: 724 LFRFPHMGM---------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMA 768
           +F    M M               + KLRP +  FLE  S+ +E+H+YTM  + YA  +A
Sbjct: 210 VFSLEEMTMVSYDGGKPVPQLCYYYVKLRPHLKEFLEVVSEKYELHIYTMATRAYAKAIA 269

Query: 769 KVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVV-IIDDSVRVWPHN 827
           +++DP G  F  R++SR + G            K L+ +  +++++V IIDD   VW  +
Sbjct: 270 EIIDPDGRYFGDRILSRDESGS--------LTQKSLQRLFPVDTSMVAIIDDRGDVWKWS 321

Query: 828 KLNLIVVERYTYF 840
           K NLI V  Y +F
Sbjct: 322 K-NLIRVVPYDFF 333



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/97 (30%), Positives = 48/97 (49%), Gaps = 4/97 (4%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           DV++IL++    +  GC  +FS + P+G    H    +W   ++FGA   + +  QVTHV
Sbjct: 539 DVQDILSSLTYPVFGGCVFLFSGIIPIGVNIQHADISMW--VKKFGATVVEDVSKQVTHV 596

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRA 986
           +A S  T KV  A    R   V+  W+      ++R 
Sbjct: 597 IAASGNTRKVRQASRYKRIKFVYISWIFDCISQWKRV 633


>gi|302497759|ref|XP_003010879.1| hypothetical protein ARB_02918 [Arthroderma benhamiae CBS 112371]
 gi|291174424|gb|EFE30239.1| hypothetical protein ARB_02918 [Arthroderma benhamiae CBS 112371]
          Length = 1048

 Score = 83.2 bits (204), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 91/170 (53%), Gaps = 32/170 (18%)

Query: 685 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 733
           V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 347 VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 397

Query: 734 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
              KLRPG+  FL+  S L+E+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 398 YYIKLRPGLEEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDKKIFGDRILSRDESGS- 456

Query: 792 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 840
                    +K+L+ +  +++  VVIIDD   VW  ++ NLI V  Y +F
Sbjct: 457 -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVSPYDFF 498



 Score = 40.8 bits (94), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 29/104 (27%), Positives = 50/104 (48%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           DV+ ++   +R  L    +VFS V P+G    +    LW  A+ FGA  T  I+ + TH+
Sbjct: 711 DVKVLMPEIKRVALGSVILVFSGVLPLGTDTQNADISLW--AKSFGATITSKINSRTTHL 768

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A    +  +V   W+    + ++  +E+ + +
Sbjct: 769 VAGRNRTAKVREATRYPKIKIVTVQWLLDCMIQWKHLDEEPYLV 812


>gi|300176006|emb|CBK22223.2| unnamed protein product [Blastocystis hominis]
          Length = 680

 Score = 82.8 bits (203), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 80/314 (25%), Positives = 142/314 (45%), Gaps = 48/314 (15%)

Query: 670 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 729
           L  QK++ +AR+L LV DLD+TL+  +     DP      R         P+ H  +F  
Sbjct: 9   LVNQKRLLAARRLGLVFDLDNTLMEQSD----DP------RCSVAPSFGIPNIHFIQFKR 58

Query: 730 MGMWTK----LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 785
               +K    LRP + + L   SK +E+ +YT G + YA  + + +DPK  LF  RVI+R
Sbjct: 59  NNQLSKHTIILRPEVQSILTELSKYYELSIYTNGVRTYAQAIIESIDPKHQLFGSRVIAR 118

Query: 786 GDDGDPFDG---DERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
            D  D  +    +  +P SKD+  VL G+E   V++DDSV VW    + ++ + ++ ++ 
Sbjct: 119 DDVPDNSETNFFNNFLPASKDISFVLPGLERLGVVVDDSVEVWKDRAI-VLHIPKFCFWR 177

Query: 842 CSRRQFGLLGP---SLLE-----IDHDERSEDGTLASSLGVIE----RLHKIFFSHQSLD 889
              + +   G    ++ E     I++D          S+ +++    ++H+ F++     
Sbjct: 178 SFLKCYETGGKKVETMFEAVGWIINND----------SMNIVKDTLVQIHQQFYARAQQR 227

Query: 890 D-----VDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHID 944
           D       V  ++   +  +     I F   F   + +P    L+   +Q G    +   
Sbjct: 228 DTGVPSTSVGEVIGQLRASLFRNTLIYFDETF--SKRDPKSQYLFSLVKQMGGKIAEAYT 285

Query: 945 DQVTHVVANSLGTD 958
             VTHVV+  + T+
Sbjct: 286 PDVTHVVSAGVETE 299


>gi|321262398|ref|XP_003195918.1| carboxy-terminal domain (CTD) phosphatase; Fcp1p [Cryptococcus
           gattii WM276]
 gi|317462392|gb|ADV24131.1| Carboxy-terminal domain (CTD) phosphatase, putative; Fcp1p
           [Cryptococcus gattii WM276]
          Length = 952

 Score = 82.8 bits (203), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 48/113 (42%), Positives = 68/113 (60%), Gaps = 12/113 (10%)

Query: 731 GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
           G W  TK RPG+  FL+  S+L+EMH+YTMG + YA  + KV+DP G +F GR++SR + 
Sbjct: 304 GRWYFTKPRPGLQKFLDEMSQLYEMHVYTMGTRTYADAIVKVIDPDGKIFGGRILSRDES 363

Query: 789 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           G  F        SK+L+ +   + S VV+IDD   VW  +  NL+ V  Y +F
Sbjct: 364 GS-F-------SSKNLKRLFPTDTSMVVVIDDRSDVW-GDCPNLVKVVPYDFF 407



 Score = 66.6 bits (161), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 41/124 (33%), Positives = 61/124 (49%), Gaps = 11/124 (8%)

Query: 877 RLHKIFFSHQSLD-------DVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLW 929
           R +K F + +  D         DV  I+   + ++L GC +VFS + P  EA+P    +W
Sbjct: 610 RFYKAFDALEGWDPKKALPMSCDVEFIIPEMKAEVLDGCSLVFSGMIPR-EADPSTTTIW 668

Query: 930 QTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST--GRFVVHPGWVEASALLYRRAN 987
           QTAE FGA+ T  +  + TH+V   L T+K  W      G  VV   W   S  L+ R +
Sbjct: 669 QTAESFGALITPSLTSRTTHLVTALLNTEKT-WRAGKMEGVKVVWAQWFWDSVALWERQD 727

Query: 988 EQDF 991
           E+ +
Sbjct: 728 EEKY 731


>gi|150866706|ref|XP_001386384.2| hypothetical protein PICST_63097 [Scheffersomyces stipitis CBS
           6054]
 gi|149387962|gb|ABN68355.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 790

 Score = 82.8 bits (203), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 66/202 (32%), Positives = 95/202 (47%), Gaps = 38/202 (18%)

Query: 658 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-------------- 703
           + A I++  T RL E+KK      L LV+DLD T++++     VDP              
Sbjct: 148 EAAKIEQSTTDRLNEEKK------LILVVDLDQTVIHAT----VDPTVGEWQSDPSNPNY 197

Query: 704 --VHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMG 759
             + D      EE+    P     R      W   K+RPG+  FLE    L+EMH+YTM 
Sbjct: 198 PAIKDVKTFCLEEEAIVPPGWTGPRLAPTKCWYYVKVRPGLSDFLEEIVNLYEMHIYTMA 257

Query: 760 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIID 818
            + YA  +AK++DP G  F  R++SR + G            K+L+ +  + +S VVIID
Sbjct: 258 TRNYALAIAKIIDPTGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVVIID 309

Query: 819 DSVRVWPHNKLNLIVVERYTYF 840
           D   +W     NLI V  Y +F
Sbjct: 310 DRGDIWQWES-NLIKVVPYDFF 330


>gi|326475449|gb|EGD99458.1| RNA Polymerase II CTD phosphatase [Trichophyton tonsurans CBS
           112818]
          Length = 866

 Score = 82.8 bits (203), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 91/170 (53%), Gaps = 32/170 (18%)

Query: 685 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 733
           V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 163 VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 213

Query: 734 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
              KLRPG+  FL+  S L+E+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 214 YYIKLRPGLEEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDKKIFGDRILSRDESGS- 272

Query: 792 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 840
                    +K+L+ +  +++  VVIIDD   VW  ++ NLI V  Y +F
Sbjct: 273 -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVSPYDFF 314



 Score = 45.4 bits (106), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 51/104 (49%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           DV+N++   +R  L    +VFS V P+G    +    LW  A+ FGA  T  I+ + TH+
Sbjct: 528 DVKNLMPEIKRVALGSVILVFSGVLPLGTDTQNADISLW--AKSFGATITSKINSRTTHL 585

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A    +  +V   W+    + ++  +E+ + +
Sbjct: 586 VAGRNRTAKVREATRYTKIKIVTVQWLLDCMIQWKHLDEEPYLV 629


>gi|326477486|gb|EGE01496.1| RNA Polymerase II CTD phosphatase Fcp1 [Trichophyton equinum CBS
           127.97]
          Length = 866

 Score = 82.8 bits (203), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 91/170 (53%), Gaps = 32/170 (18%)

Query: 685 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 733
           V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 163 VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 213

Query: 734 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
              KLRPG+  FL+  S L+E+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 214 YYIKLRPGLEEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDKKIFGDRILSRDESGS- 272

Query: 792 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 840
                    +K+L+ +  +++  VVIIDD   VW  ++ NLI V  Y +F
Sbjct: 273 -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVSPYDFF 314



 Score = 45.4 bits (106), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 51/104 (49%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           DV+N++   +R  L    +VFS V P+G    +    LW  A+ FGA  T  I+ + TH+
Sbjct: 528 DVKNLMPEIKRVALGSVILVFSGVLPLGTDTQNADISLW--AKSFGATITSKINSRTTHL 585

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A    +  +V   W+    + ++  +E+ + +
Sbjct: 586 VAGRNRTAKVREATRYTKIKIVTVQWLLDCMIQWKHLDEEPYLV 629


>gi|297819964|ref|XP_002877865.1| hypothetical protein ARALYDRAFT_906617 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323703|gb|EFH54124.1| hypothetical protein ARALYDRAFT_906617 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 345

 Score = 82.8 bits (203), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 65/207 (31%), Positives = 100/207 (48%), Gaps = 41/207 (19%)

Query: 680 RKLCLVLDLDHTLLN---SAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT-- 734
           RKL LVL L+HTL++    +K  E+D  H  +L + +   R+     LFR  +   ++  
Sbjct: 96  RKLHLVLSLEHTLIDLISVSKLSEIDRYH--LLEEADSGSRDD----LFRLANESFYSSD 149

Query: 735 ---KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
              K RP +  FL  A K+F MH+YT      A ++ K+LDP  + F  R+I+  D    
Sbjct: 150 ALVKFRPFVREFLREAEKIFTMHVYTNYGPGLAKKVVKLLDPHMIYFGNRIITSKDS--- 206

Query: 792 FDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG 851
            +GD      K LE VL     V+I+D   R+W     N+I + +Y YF           
Sbjct: 207 -NGD-----LKSLELVLAEPRGVLIVDYDHRLWKSPGHNVIFMSKYVYFK---------- 250

Query: 852 PSLLEIDHDERSEDGTLASSLGVIERL 878
               EI     +EDG LA +L +++++
Sbjct: 251 ----EIS----NEDGVLAKTLNLLKKI 269


>gi|302657133|ref|XP_003020296.1| hypothetical protein TRV_05607 [Trichophyton verrucosum HKI 0517]
 gi|291184115|gb|EFE39678.1| hypothetical protein TRV_05607 [Trichophyton verrucosum HKI 0517]
          Length = 865

 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 91/170 (53%), Gaps = 32/170 (18%)

Query: 685 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 733
           V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 163 VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 213

Query: 734 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
              KLRPG+  FL+  S L+E+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 214 YYIKLRPGLEEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDKKIFGDRILSRDESGS- 272

Query: 792 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 840
                    +K+L+ +  +++  VVIIDD   VW  ++ NLI V  Y +F
Sbjct: 273 -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVSPYDFF 314


>gi|327296037|ref|XP_003232713.1| RNA Polymerase II CTD phosphatase [Trichophyton rubrum CBS 118892]
 gi|326465024|gb|EGD90477.1| RNA Polymerase II CTD phosphatase [Trichophyton rubrum CBS 118892]
          Length = 836

 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 91/170 (53%), Gaps = 32/170 (18%)

Query: 685 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 733
           V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 134 VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 184

Query: 734 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
              KLRPG+  FL+  S L+E+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 185 YYIKLRPGLEEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDKKIFGDRILSRDESGS- 243

Query: 792 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 840
                    +K+L+ +  +++  VVIIDD   VW  ++ NLI V  Y +F
Sbjct: 244 -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVSPYDFF 285



 Score = 41.2 bits (95), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 29/104 (27%), Positives = 50/104 (48%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           DV+ ++   +R  L    +VFS V P+G    +    LW  A+ FGA  T  I+ + TH+
Sbjct: 498 DVKVLMPEIKRVALGSVILVFSGVLPLGTDTQNADISLW--AKSFGATITSKINSRTTHL 555

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A    +  +V   W+    + ++  +E+ + +
Sbjct: 556 VAGRNRTAKVREATRYPKIKIVTVQWLLDCMIQWKHLDEEPYLV 599


>gi|260949511|ref|XP_002619052.1| hypothetical protein CLUG_00211 [Clavispora lusitaniae ATCC 42720]
 gi|238846624|gb|EEQ36088.1| hypothetical protein CLUG_00211 [Clavispora lusitaniae ATCC 42720]
          Length = 776

 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 68/202 (33%), Positives = 94/202 (46%), Gaps = 38/202 (18%)

Query: 658 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-------------- 703
           +   I++  T RL   KK      L LV+DLD T++++     VDP              
Sbjct: 145 EATKIEQSSTERLAADKK------LILVVDLDQTVIHAT----VDPTVGEWQRDPQNPNY 194

Query: 704 --VHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMG 759
             V D  L   EE+    P     R P    W   KLRPG+  FL   SKL+E+H+YTM 
Sbjct: 195 PFVKDVQLFSLEEEPIVPPGWVGPRPPPTKCWYYVKLRPGLKEFLAEVSKLYELHIYTMA 254

Query: 760 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIID 818
            + YA  +A ++DP G  F  R++SR + G            K+L  +  + +S VVIID
Sbjct: 255 TRNYALAIASIIDPDGKYFGDRILSRDESGS--------LTHKNLRRLFPVDQSMVVIID 306

Query: 819 DSVRVWPHNKLNLIVVERYTYF 840
           D   VW   + NLI V  Y +F
Sbjct: 307 DRGDVW-QWEANLIKVVPYDFF 327


>gi|348665920|gb|EGZ05748.1| hypothetical protein PHYSODRAFT_566275 [Phytophthora sojae]
          Length = 684

 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 52/170 (30%), Positives = 81/170 (47%), Gaps = 42/170 (24%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           ++   A+KL LVLDLDHTLL++ +   VD V  EI                   P  GM 
Sbjct: 266 RRQLGAKKLSLVLDLDHTLLHAVR---VDDVVGEI-------------------PKSGML 303

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 793
                         S L+++ +YT G +LYA ++ K++DP    F  R+++R D  D   
Sbjct: 304 --------------SALYDLFIYTHGTRLYAEQIVKIIDPDESYFKNRIVARTDTPDMLH 349

Query: 794 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCS 843
                 KS  L      +S ++++DD + VW  N+ N+ ++E Y YF C+
Sbjct: 350 ------KSLKLLFPSCDDSMILVLDDRIDVWKENEGNVFLIEPYHYFKCT 393


>gi|384488044|gb|EIE80224.1| hypothetical protein RO3G_04929 [Rhizopus delemar RA 99-880]
          Length = 433

 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 47/133 (35%), Positives = 77/133 (57%), Gaps = 17/133 (12%)

Query: 661 AIQKERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE 718
            + +    RLE++  K++  +RKL L+LDLD T+++++     DP    I   K E+ R+
Sbjct: 9   TVSRSEAERLEKENAKRLLESRKLSLILDLDQTIVHAS----CDP---RISHWKNEEIRQ 61

Query: 719 KPHRHLFRFPH--MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 F  P      + KLRPG+  FL+    L+++H+YTMG K YA  +A+ +DP+G 
Sbjct: 62  ------FTLPKSPTMYYIKLRPGLREFLKEIENLYDLHIYTMGTKDYAKAVAREMDPEGS 115

Query: 777 LFAGRVISRGDDG 789
           LF  R++SR ++G
Sbjct: 116 LFKERILSRDENG 128


>gi|344233336|gb|EGV65209.1| hypothetical protein CANTEDRAFT_104476 [Candida tenuis ATCC 10573]
 gi|344233337|gb|EGV65210.1| hypothetical protein CANTEDRAFT_104476 [Candida tenuis ATCC 10573]
          Length = 788

 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 65/203 (32%), Positives = 98/203 (48%), Gaps = 38/203 (18%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP------------- 703
           ++ A I++  T RL +Q      RKL LV+DLD T++++     VDP             
Sbjct: 147 EEAAKIEQNSTTRLTQQ------RKLILVVDLDQTVIHAT----VDPTVGEWQSDPSNPN 196

Query: 704 ---VHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTM 758
              V D      EE+    P+    +      W   KLRPG+  FL   ++++EMH+YTM
Sbjct: 197 YRAVKDVQSFCLEEEPITPPNWSGPKLSPTKCWYYVKLRPGLEEFLREMAEIYEMHIYTM 256

Query: 759 GNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVII 817
             + YA  +AK++DP+G  F  R++SR + G            K+L+ +  + +S V II
Sbjct: 257 ATRNYALAIAKIIDPEGEYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVAII 308

Query: 818 DDSVRVWPHNKLNLIVVERYTYF 840
           DD   VW     NLI V  Y +F
Sbjct: 309 DDRGDVWQWED-NLIKVVPYDFF 330


>gi|325087549|gb|EGC40859.1| RNA polymerase II C-terminal domain phosphatase component
           [Ajellomyces capsulatus H88]
          Length = 885

 Score = 82.0 bits (201), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 96/201 (47%), Gaps = 34/201 (16%)

Query: 656 DQQKAAIQKERTRRLEEQKKMFSARKLCL--VLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + +    R+EE  K        L  V+DLD T++++     VDP   E      
Sbjct: 103 DNSALTVSENEATRVEEDAKRRLLSSRRLSLVVDLDQTIIHAT----VDPTVAEW----- 153

Query: 714 EQDREKPHRHLFR----------FPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 760
           +QD++ P+    +           P M G W   KLRPG+  FL   S LFE+H+YTMG 
Sbjct: 154 QQDKDNPNHEAVKDVRAFQLVDDGPGMKGCWYYIKLRPGLEEFLRNISTLFELHIYTMGT 213

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 819
           + YA  +A ++DP   +F  R++SR + G           +K+L+ +  +++  VVIIDD
Sbjct: 214 RAYAQHIASIVDPDRKIFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDD 265

Query: 820 SVRVWPHNKLNLIVVERYTYF 840
              VW     NLI V  Y +F
Sbjct: 266 RGDVWKWTD-NLIKVVPYDFF 285



 Score = 44.7 bits (104), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 52/104 (50%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           D++ I+   +R +L    +VFS V P+G    +    LW  A+ FGA+ ++ I  + TH+
Sbjct: 482 DIKVIMPQIKRPVLQSVDLVFSGVLPLGTDTQNADISLW--AKSFGAIISQKISSKTTHL 539

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A       +V   W+  S   ++R +E+ + +
Sbjct: 540 VAGRNRTAKVREATRYPNIKIVTVQWLLDSITQWKRLDEEPYLV 583


>gi|190346120|gb|EDK38128.2| hypothetical protein PGUG_02226 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 732

 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 62/191 (32%), Positives = 92/191 (48%), Gaps = 42/191 (21%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------- 726
           +++ S RKL LV+DLD T++++     VDP   E      + D   P+    +       
Sbjct: 110 ERLTSERKLILVVDLDQTVIHAT----VDPTVGEW-----QSDPSNPNYRAVKDVRSFCL 160

Query: 727 -----------FPHM-----GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKV 770
                       P M       + K+RPG+  FL+R S+L+EMH+YTM  + YA  +A +
Sbjct: 161 EEDPIAPPGWSGPKMTPTKCWYYVKVRPGLEDFLKRVSQLYEMHVYTMATRNYALAIAHI 220

Query: 771 LDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKL 829
           +DP G  F  R++SR + G            K+L  +  + +S VVIIDD   VW   K 
Sbjct: 221 IDPDGRYFGDRILSRDESGS--------LTHKNLRRLFPVDQSMVVIIDDRGDVWQWEK- 271

Query: 830 NLIVVERYTYF 840
           NLI V  Y +F
Sbjct: 272 NLIKVVPYEFF 282



 Score = 42.0 bits (97), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 38/145 (26%), Positives = 66/145 (45%), Gaps = 15/145 (10%)

Query: 864 EDGTLASSLGVIERLHKIFF-------SHQSLDDVDVRNILAAEQRKILAGCRIVFSRVF 916
           +D  L S    + R+H  ++         +SL + D+  I+ + + K L G  ++FS + 
Sbjct: 462 DDNELESLNDALVRVHDEYYRIWDSYNEDKSLTNPDLTTIIPSLKSKCLEGIVVLFSGIL 521

Query: 917 PVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVAN--SLG---TDKVNWALST--GRF 969
             G ++P    +    +QFG      +  +VTHV+    S G   T KV  A  T     
Sbjct: 522 RWG-SDPQKADIVIWCQQFGVKVVNEVYPEVTHVICREPSAGGGLTFKVRVAKKTLPNVH 580

Query: 970 VVHPGWVEASALLYRRANEQDFAIK 994
           +V P W+ A    + R +E+ +AI+
Sbjct: 581 IVTPDWLFACMSKWTRVDEKPYAIE 605


>gi|315051428|ref|XP_003175088.1| RNA polymerase II subunit A domain phosphatase [Arthroderma gypseum
           CBS 118893]
 gi|311340403|gb|EFQ99605.1| RNA polymerase II subunit A domain phosphatase [Arthroderma gypseum
           CBS 118893]
          Length = 867

 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 91/170 (53%), Gaps = 32/170 (18%)

Query: 685 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 733
           V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 163 VVDLDQTIIHAT----VDPTVGEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 213

Query: 734 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
              KLRPG+  FL+  S L+E+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 214 YYIKLRPGLEEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDRKIFGDRILSRDESGS- 272

Query: 792 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 840
                    +K+L+ +  +++  VVIIDD   VW  ++ NLI V  Y +F
Sbjct: 273 -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVTPYDFF 314


>gi|443696103|gb|ELT96883.1| hypothetical protein CAPTEDRAFT_23527, partial [Capitella teleta]
          Length = 562

 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 57/174 (32%), Positives = 93/174 (53%), Gaps = 27/174 (15%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 731
           +++++   +KL L++DLD TL+++      D V   +              H F+  H  
Sbjct: 132 DEQRLIRDKKLVLLVDLDQTLIHTTN----DKVPANL-----------KDVHHFQLHHGR 176

Query: 732 --MW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 787
             +W  TK RPG   FLER SKL+E+H+ T G ++YA  +AK+LDP G  F+ R++SR +
Sbjct: 177 NLLWYHTKFRPGTEKFLERISKLYELHICTFGVRMYAHTIAKLLDPDGKYFSHRILSRDE 236

Query: 788 DGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
             +P        K+ +L+ +    +S V IIDD   VW  +  +L+ V+ Y +F
Sbjct: 237 CFNP------TSKTGNLKALFPCGDSMVCIIDDREDVWRFSP-SLVHVKPYLFF 283



 Score = 47.4 bits (111), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 39/152 (25%), Positives = 71/152 (46%), Gaps = 19/152 (12%)

Query: 858 DHDERSE----DGTLASSLGVIERLHKIFFS--HQSLDDV-------DVRNILAAEQRKI 904
           D++E  E    D  L     ++ R+H  +F   +Q ++         D++ +L   ++++
Sbjct: 412 DYEEMIEWEDSDNYLVYLEDILSRIHTAYFQFYNQMIEKKAEDKQLPDIKTVLPYVKKRV 471

Query: 905 LAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHI----DDQVTHVVANSLGTDKV 960
           L G  I+FS + P+ + N      +  A+  GA     +    +D+ THVVA   GT K+
Sbjct: 472 LRGVNILFSGMIPINK-NYEKSRAYIVAKSLGANIQTSLETEGEDRTTHVVAARDGTQKI 530

Query: 961 NWALST-GRFVVHPGWVEASALLYRRANEQDF 991
           N A    G  +V+  W+   A  + R +EQ +
Sbjct: 531 NDARKMKGVHIVNADWLWTCAYRWERVDEQLY 562


>gi|357601986|gb|EHJ63229.1| putative RNA polymerase II subunit A C-terminal domain phosphatase
           [Danaus plexippus]
          Length = 683

 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 59/175 (33%), Positives = 92/175 (52%), Gaps = 26/175 (14%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF-RFP- 728
           E+  ++   RKL L++DLD TL+++     + P   ++L             H F R P 
Sbjct: 134 EDADRLLKDRKLVLLVDLDQTLVHTTN-DNIPPNIKDVL-------------HFFLRGPG 179

Query: 729 HMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
           + G W  T+LRP    FLE A+K +E+H+ T G + YA  + ++LDP+   F+ R++SR 
Sbjct: 180 NQGRWCHTRLRPKTHEFLESAAKNYELHVCTFGARQYAHAITELLDPQKKFFSHRILSR- 238

Query: 787 DDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
              + FD      KS +L+ +    ++ V IIDD   VW H   NLI V  Y++F
Sbjct: 239 --DECFDAR---TKSANLKALFPCGDNMVCIIDDREDVWRHAS-NLIQVRPYSFF 287



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 35/134 (26%), Positives = 65/134 (48%), Gaps = 14/134 (10%)

Query: 874 VIERLHKIFFS-HQSLDDV-------DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHL 925
           +++R+H  F+  +  +++        D++ I+   + ++LAG  +VFS + P  +     
Sbjct: 395 ILKRIHNHFYDIYDKMENSENEKSIPDLKYIIPEVKSQVLAGSSLVFSGLVPTHQ-RLET 453

Query: 926 HPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGR-----FVVHPGWVEASA 980
              +Q A+  GA  T+   D+ TH+VA   GT KVN +   G       VV P W+   A
Sbjct: 454 SRAYQVAKTLGAEVTQDFTDKTTHLVAMRAGTAKVNASKKLGEDKSKIHVVTPEWLWTCA 513

Query: 981 LLYRRANEQDFAIK 994
             + R  E+ + ++
Sbjct: 514 ERWERVEEKLYPLQ 527


>gi|327358124|gb|EGE86981.1| RNA Polymerase II CTD phosphatase Fcp1 [Ajellomyces dermatitidis
           ATCC 18188]
          Length = 839

 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 96/201 (47%), Gaps = 34/201 (16%)

Query: 656 DQQKAAIQKERTRRLEEQKKMFSARKLCL--VLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + +    R+EE  K        L  V+DLD T++++     VDP   E      
Sbjct: 31  DNSALTVSENEATRVEEDAKRRLLSSRRLSLVVDLDQTIIHAT----VDPTVAEW----- 81

Query: 714 EQDREKPHRHLFR----------FPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 760
           +QD++ P+    +           P M G W   KLRPG+  FL   S LFE+H+YTMG 
Sbjct: 82  QQDKDNPNHEAVKDVRAFQLVDDGPGMRGCWYYIKLRPGLEEFLREISTLFELHIYTMGT 141

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 819
           + YA  +A ++DP   +F  R++SR + G           +K+L+ +  +++  VVIIDD
Sbjct: 142 RAYAQHIANIVDPDRKIFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDD 193

Query: 820 SVRVWPHNKLNLIVVERYTYF 840
              VW     NLI V  Y +F
Sbjct: 194 RGDVWKWTD-NLIKVLPYDFF 213



 Score = 41.6 bits (96), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 50/104 (48%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           D++ I+   +R +L    +VFS V P+G    +    LW  A+ FGA  +  I  + TH+
Sbjct: 410 DIKVIMPQIKRPVLNAVVLVFSGVLPLGTDTQNADISLW--AKSFGAAISTKISSRTTHL 467

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A       +V   W+  S   ++R +E+ + +
Sbjct: 468 VAGRNRTAKVREATRYPNIKIVTVQWLLDSITQWKRIDEEPYLV 511


>gi|225556539|gb|EEH04827.1| RNA polymerase II C-terminal domain phosphatase component
           [Ajellomyces capsulatus G186AR]
          Length = 871

 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 96/201 (47%), Gaps = 34/201 (16%)

Query: 656 DQQKAAIQKERTRRLEEQKKMFSARKLCL--VLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + +    R+EE  K        L  V+DLD T++++     VDP   E      
Sbjct: 103 DNSALTVSENEATRVEEDAKRRLLSSRRLSLVVDLDQTIIHAT----VDPTVAEW----- 153

Query: 714 EQDREKPHRHLFR----------FPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 760
           +QD++ P+    +           P M G W   KLRPG+  FL   S LFE+H+YTMG 
Sbjct: 154 QQDKDNPNHEAVKDVRAFQLVDDGPGMKGCWYYIKLRPGLEEFLRNISTLFELHIYTMGT 213

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 819
           + YA  +A ++DP   +F  R++SR + G           +K+L+ +  +++  VVIIDD
Sbjct: 214 RAYAQHIASIVDPDRKIFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDD 265

Query: 820 SVRVWPHNKLNLIVVERYTYF 840
              VW     NLI V  Y +F
Sbjct: 266 RGDVWKWTD-NLIKVVPYDFF 285



 Score = 43.5 bits (101), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 51/104 (49%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           D++ I+   +R +L    +VFS V P+G    +    LW  A+ FGA+ +  I  + TH+
Sbjct: 482 DIKVIMPQIKRPVLQSVDLVFSGVLPLGTDTQNADISLW--AKSFGAIISHKISSKTTHL 539

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A       +V   W+  S   ++R +E+ + +
Sbjct: 540 VAGRNRTAKVREATRYPNIKIVTVQWLLDSITQWKRLDEEPYLV 583


>gi|444319376|ref|XP_004180345.1| hypothetical protein TBLA_0D03260 [Tetrapisispora blattae CBS 6284]
 gi|387513387|emb|CCH60826.1| hypothetical protein TBLA_0D03260 [Tetrapisispora blattae CBS 6284]
          Length = 768

 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 60/185 (32%), Positives = 89/185 (48%), Gaps = 42/185 (22%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------------F 727
           +KL LV+DLD T+++      VDP   E      + D + P+    +             
Sbjct: 181 KKLILVVDLDQTVIHCG----VDPTIGEW-----KNDPKNPNYETLKDVRSFSLDEEPIL 231

Query: 728 P--HMG---------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
           P  +MG          + K+RPG+  F  + + L+EMH+YTM  + YA E+AK++DP G 
Sbjct: 232 PPSYMGPRPPVRKCWYYVKVRPGLKEFFAKIAPLYEMHIYTMATRAYALEIAKIIDPDGS 291

Query: 777 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVE 835
           LF  R++SR ++G            K LE +    +S V+IIDD   VW     NLI V 
Sbjct: 292 LFGSRILSRDENGS--------LTQKSLERLFPTDQSMVIIIDDRGDVWNWCN-NLIKVI 342

Query: 836 RYTYF 840
            Y +F
Sbjct: 343 PYNFF 347



 Score = 48.1 bits (113), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 34/136 (25%), Positives = 60/136 (44%), Gaps = 8/136 (5%)

Query: 864 EDGTLASSLGVIERLHKIFFSHQSLD---DVDVRNILAAEQRKILAGCRIVFSRVFPVG- 919
           +D  L     ++  +H  F+     D    V ++ ++   +  + +G   VFS + P+G 
Sbjct: 485 DDDELFYLKDILTEVHDTFYEQLDEDKNESVSIQTLMPRLKFSVFSGYNFVFSGLIPLGT 544

Query: 920 EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNW--ALSTGRFVVHPGWVE 977
           +       LW     FGA  T  ID+  THV+  + GT K     A +    VVHP W+ 
Sbjct: 545 DIRRADIVLW--TNMFGANSTADIDENTTHVITKTAGTYKARLAKAFNPEIKVVHPDWIF 602

Query: 978 ASALLYRRANEQDFAI 993
              + ++R  E+ + +
Sbjct: 603 ECLVSWKRMPEKPYEL 618


>gi|194757423|ref|XP_001960964.1| GF11242 [Drosophila ananassae]
 gi|190622262|gb|EDV37786.1| GF11242 [Drosophila ananassae]
          Length = 854

 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 56/173 (32%), Positives = 92/173 (53%), Gaps = 24/173 (13%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           ++ +++ + RKL L++DLD T++++      D V + I          K   H   +   
Sbjct: 191 DDTRRLLADRKLVLLVDLDQTVIHTTN----DTVPENI----------KGIYHFQLYGPQ 236

Query: 731 GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
             W  T+LRPG   FLE  S+L+E+H+ T G + YA  +A++LDP G  F+ R++SR   
Sbjct: 237 SPWYHTRLRPGTAEFLESMSQLYELHICTFGARNYAHMIAQLLDPDGKFFSHRILSR--- 293

Query: 789 GDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
            + F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 294 DECFNA---TSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 342



 Score = 39.7 bits (91), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 30/121 (24%), Positives = 59/121 (48%), Gaps = 4/121 (3%)

Query: 874 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 931
           ++  +HK F++   ++++  D++ I+   + ++L    +VFS + P  +        +  
Sbjct: 534 ILRNIHKRFYAIYDETMEIPDLKIIVPKIRCEVLRNQNLVFSGLVPT-QMKLEQSRAYFI 592

Query: 932 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
           A+  GA    +I  + TH+VA + GT KVN A       VV+  W+ + A  +    E+ 
Sbjct: 593 AKSLGAEVQGNIGKETTHLVAVNAGTYKVNAAKKEANIKVVNANWLWSCAERWEHVEEKL 652

Query: 991 F 991
           F
Sbjct: 653 F 653


>gi|405966173|gb|EKC31485.1| RNA polymerase II subunit A C-terminal domain phosphatase
           [Crassostrea gigas]
          Length = 837

 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 56/173 (32%), Positives = 92/173 (53%), Gaps = 25/173 (14%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 731
           ++ ++   RKL L++DLD TL+++   + + P   ++              + F+  H  
Sbjct: 138 DEDRLLRTRKLVLLVDLDQTLIHTTNDN-IPPNLKDV--------------YHFQLSHGN 182

Query: 732 M--W--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 787
           M  W  T++RP    FLE  SKL+E+H+ T G+++YA  +AK LDP G  F+ R++SR  
Sbjct: 183 MMPWYHTRIRPRTEKFLENVSKLYELHICTFGSRMYAHIIAKFLDPDGKYFSHRILSR-- 240

Query: 788 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
             + F+ + ++   K L      +S V IIDD   VW  +  NLI V+ Y +F
Sbjct: 241 -DECFNQNSKMANLKAL--FPCGDSMVCIIDDREDVWNFSP-NLIHVKPYRFF 289


>gi|261194090|ref|XP_002623450.1| RNA Polymerase II CTD phosphatase Fcp1 [Ajellomyces dermatitidis
           SLH14081]
 gi|239588464|gb|EEQ71107.1| RNA Polymerase II CTD phosphatase Fcp1 [Ajellomyces dermatitidis
           SLH14081]
          Length = 901

 Score = 81.3 bits (199), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 96/201 (47%), Gaps = 34/201 (16%)

Query: 656 DQQKAAIQKERTRRLEEQKKMFSARKLCL--VLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + +    R+EE  K        L  V+DLD T++++     VDP   E      
Sbjct: 103 DNSALTVSENEATRVEEDAKRRLLSSRRLSLVVDLDQTIIHAT----VDPTVAEW----- 153

Query: 714 EQDREKPHRHLFR----------FPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 760
           +QD++ P+    +           P M G W   KLRPG+  FL   S LFE+H+YTMG 
Sbjct: 154 QQDKDNPNHEAVKDVRAFQLVDDGPGMRGCWYYIKLRPGLEEFLREISTLFELHIYTMGT 213

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 819
           + YA  +A ++DP   +F  R++SR + G           +K+L+ +  +++  VVIIDD
Sbjct: 214 RAYAQHIANIVDPDRKIFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDD 265

Query: 820 SVRVWPHNKLNLIVVERYTYF 840
              VW     NLI V  Y +F
Sbjct: 266 RGDVWKWTD-NLIKVLPYDFF 285



 Score = 42.0 bits (97), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 50/104 (48%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           D++ I+   +R +L    +VFS V P+G    +    LW  A+ FGA  +  I  + TH+
Sbjct: 482 DIKVIMPQIKRPVLNAVVLVFSGVLPLGTDTQNADISLW--AKSFGAAISTKISSRTTHL 539

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A       +V   W+  S   ++R +E+ + +
Sbjct: 540 VAGRNRTAKVREATRYPNIKIVTVQWLLDSITQWKRVDEEPYLV 583


>gi|239606973|gb|EEQ83960.1| RNA Polymerase II CTD phosphatase Fcp1 [Ajellomyces dermatitidis
           ER-3]
          Length = 901

 Score = 81.3 bits (199), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 96/201 (47%), Gaps = 34/201 (16%)

Query: 656 DQQKAAIQKERTRRLEEQKKMFSARKLCL--VLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + +    R+EE  K        L  V+DLD T++++     VDP   E      
Sbjct: 103 DNSALTVSENEATRVEEDAKRRLLSSRRLSLVVDLDQTIIHAT----VDPTVAEW----- 153

Query: 714 EQDREKPHRHLFR----------FPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 760
           +QD++ P+    +           P M G W   KLRPG+  FL   S LFE+H+YTMG 
Sbjct: 154 QQDKDNPNHEAVKDVRAFQLVDDGPGMRGCWYYIKLRPGLEEFLREISTLFELHIYTMGT 213

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 819
           + YA  +A ++DP   +F  R++SR + G           +K+L+ +  +++  VVIIDD
Sbjct: 214 RAYAQHIANIVDPDRKIFGDRILSRDESGS--------LTAKNLQRLFPVDTKMVVIIDD 265

Query: 820 SVRVWPHNKLNLIVVERYTYF 840
              VW     NLI V  Y +F
Sbjct: 266 RGDVWKWTD-NLIKVLPYDFF 285



 Score = 41.6 bits (96), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 50/104 (48%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           D++ I+   +R +L    +VFS V P+G    +    LW  A+ FGA  +  I  + TH+
Sbjct: 482 DIKVIMPQIKRPVLNAVVLVFSGVLPLGTDTQNADISLW--AKSFGAAISTKISSRTTHL 539

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A       +V   W+  S   ++R +E+ + +
Sbjct: 540 VAGRNRTAKVREATRYPNIKIVTVQWLLDSITQWKRIDEEPYLV 583


>gi|154284394|ref|XP_001542992.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150406633|gb|EDN02174.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 654

 Score = 81.3 bits (199), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 87/170 (51%), Gaps = 32/170 (18%)

Query: 685 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR----------FPHM-GMW 733
           V+DLD T++++     VDP   E      +QD++ P+    +           P M G W
Sbjct: 89  VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHEAVKDVRAFQLVDDGPGMKGCW 139

Query: 734 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
              KLRPG+  FL   S LFE+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 140 YYIKLRPGLEEFLRNISTLFELHIYTMGTRAYAQHIASIVDPDRKIFGDRILSRDESGS- 198

Query: 792 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 840
                    +K+L+ +  +++  VVIIDD   VW     NLI V  Y +F
Sbjct: 199 -------LTAKNLQRLFPVDTKMVVIIDDRGDVWKWTD-NLIKVVPYDFF 240



 Score = 43.5 bits (101), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 52/104 (50%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           D++ I+   +R +L    +VFS V P+G    +    LW  A+ FGA+ ++ I  + TH+
Sbjct: 437 DIKVIMPQIKRPVLQSVDLVFSGVLPLGTDTQNADISLW--AKSFGAIISQKISSKTTHL 494

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A       +V   W+  S   ++R +E+ + +
Sbjct: 495 VAGRNRTAKVREATRYPNIKIVTVQWLLDSITQWKRLDEEPYLV 538


>gi|296810642|ref|XP_002845659.1| RNA polymerase II subunit A C-terminal domain phosphatase
           [Arthroderma otae CBS 113480]
 gi|238843047|gb|EEQ32709.1| RNA polymerase II subunit A C-terminal domain phosphatase
           [Arthroderma otae CBS 113480]
          Length = 832

 Score = 81.3 bits (199), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 90/170 (52%), Gaps = 32/170 (18%)

Query: 685 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR------HLFRF----PHM-GMW 733
           V+DLD T++++     VDP   E      +QD++ P+         F+     P M G W
Sbjct: 134 VVDLDQTIIHAT----VDPTVAEW-----QQDKDNPNHDAVKDVRCFQLVDDGPGMRGCW 184

Query: 734 --TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
              KLRPG+  FL+  S L+E+H+YTMG + YA  +A ++DP   +F  R++SR + G  
Sbjct: 185 YYIKLRPGLEEFLKVVSSLYELHIYTMGTRAYAQNVANIVDPDRKIFGDRILSRDESGS- 243

Query: 792 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 840
                    +K+L  +  +++  VVIIDD   VW  ++ NLI V  Y +F
Sbjct: 244 -------LTAKNLHRLFPVDTKMVVIIDDRGDVWKWSE-NLIKVTPYDFF 285



 Score = 42.0 bits (97), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 50/104 (48%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           DV+ ++   +R  L    +VFS V P+G    +    LW  A+ FGA  T  I+ + TH+
Sbjct: 491 DVKVLMPEIKRAALESVVLVFSGVLPLGTDTQNADISLW--AKSFGATITSKINSRTTHL 548

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A    +  +V   W+      ++R +E+ + +
Sbjct: 549 VAGRNRTAKVREATRYPKVKIVTVQWLVDCMTQWKRLDEEPYLV 592


>gi|58271496|ref|XP_572904.1| protein phosphatase [Cryptococcus neoformans var. neoformans JEC21]
 gi|134115316|ref|XP_773956.1| hypothetical protein CNBH4080 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50256584|gb|EAL19309.1| hypothetical protein CNBH4080 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57229163|gb|AAW45597.1| protein phosphatase, putative [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 955

 Score = 80.9 bits (198), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 47/113 (41%), Positives = 67/113 (59%), Gaps = 12/113 (10%)

Query: 731 GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
           G W  TK RPG+  FL+   +L+EMH+YTMG + YA  + KV+DP G +F GR++SR + 
Sbjct: 303 GRWYFTKPRPGLQRFLDEMCQLYEMHVYTMGTRTYADAIVKVIDPDGKIFGGRILSRDES 362

Query: 789 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           G  F        SK+L+ +   + S VV+IDD   VW  +  NL+ V  Y +F
Sbjct: 363 G-SF-------SSKNLKRLFPTDTSMVVVIDDRSDVW-GDCPNLVKVVPYDFF 406



 Score = 67.0 bits (162), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 42/124 (33%), Positives = 60/124 (48%), Gaps = 11/124 (8%)

Query: 877 RLHKIFFSHQSLD-------DVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLW 929
           R +K F +    D         DV  I+   + ++L GC +VFS + P  EANP    +W
Sbjct: 616 RFYKAFDAIDGWDPKKALPMSCDVEFIIPEMKAEVLDGCNLVFSGMIPR-EANPSTTAIW 674

Query: 930 QTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST--GRFVVHPGWVEASALLYRRAN 987
           QTAE FGA+ T  +  + TH+V   L T+K  W      G  VV   W   S  L+ R +
Sbjct: 675 QTAESFGALITPSLTPRTTHLVTALLNTEKT-WRAGKMEGVKVVWAEWFWDSVALWERQD 733

Query: 988 EQDF 991
           E+ +
Sbjct: 734 EEKY 737


>gi|254568460|ref|XP_002491340.1| hypothetical protein [Komagataella pastoris GS115]
 gi|238031137|emb|CAY69060.1| hypothetical protein PAS_chr2-1_0845 [Komagataella pastoris GS115]
 gi|328352145|emb|CCA38544.1| hypothetical protein PP7435_Chr2-0862 [Komagataella pastoris CBS
           7435]
          Length = 733

 Score = 80.9 bits (198), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 67/210 (31%), Positives = 98/210 (46%), Gaps = 46/210 (21%)

Query: 674 KKMFSARKLCLVLDLDHTLL---------------NSAKFHEVDPVHDEILRKK----EE 714
           K++   +KL LV+DLD T++               N+A +  V  V    L+++    E 
Sbjct: 165 KRLLKEKKLSLVVDLDQTVIHATVDPTVGEWMKDPNNANYPAVKDVRSFSLKEEVILPEN 224

Query: 715 QDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 772
              +KP       P    W   KLRP +  FLE  S+ +E+H+YTM  + YA E+AK++D
Sbjct: 225 YVGQKP-------PATVCWYYVKLRPHLREFLEHVSERYELHIYTMATRQYAKEIAKIID 277

Query: 773 PKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNL 831
           P    F  R++SR + G            K L+ +  ++ S VV+IDD   VW  +  NL
Sbjct: 278 PDEKYFGDRILSRDESGS--------LTQKSLQRLFPVDTSMVVVIDDRGDVWNWSS-NL 328

Query: 832 IVVERYTYF--------PCSRRQFGLLGPS 853
           I V  Y +F            RQ  LLGPS
Sbjct: 329 IKVVPYDFFVGIGDINSSFLPRQHALLGPS 358



 Score = 48.1 bits (113), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 42/175 (24%), Positives = 77/175 (44%), Gaps = 12/175 (6%)

Query: 830 NLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERS----EDGTLASSLGVIERLHKIFFSH 885
           NL + ++ +  P ++ Q  L      E DH+  +    +D  L +    + R+H  F+  
Sbjct: 447 NLEIEQQQSERPLAKLQENLEKIVQTESDHESANLLFDDDNELETLEQALIRIHNEFYFE 506

Query: 886 QSLDDV---DVRNILAAEQRKILAGCRIVFSRVFPVG-EANPHLHPLWQTAEQFGAVCTK 941
                V   DV++IL + ++ +      + S + P+G + N     +W  A  FGA    
Sbjct: 507 YDDSKVGNPDVKDILNSMKQLVFKEYTFLLSGILPLGTKLNSADIVIW--ARSFGATVVA 564

Query: 942 HIDDQVTHVVANSLGTDKVNWA--LSTGRFVVHPGWVEASALLYRRANEQDFAIK 994
             D  VTHV+  + GT KV  A  L     +V P W+      + + +E ++ ++
Sbjct: 565 DYDKSVTHVITRNTGTFKVKLAKTLDPNVKIVDPNWLFKCISFWDKVDEDEYLLE 619


>gi|294658166|ref|XP_460501.2| DEHA2F03102p [Debaryomyces hansenii CBS767]
 gi|202952923|emb|CAG88814.2| DEHA2F03102p [Debaryomyces hansenii CBS767]
          Length = 795

 Score = 80.9 bits (198), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 67/202 (33%), Positives = 93/202 (46%), Gaps = 38/202 (18%)

Query: 658 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK------ 711
           +   I+   T RL  +KK      L LV+DLD T++++     VDP   E          
Sbjct: 148 EATKIEHNTTDRLSREKK------LILVVDLDQTVIHAT----VDPTVGEWQSDPSNPNY 197

Query: 712 ----------KEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMG 759
                      EE     P     + P    W   KLRPG+  FL  AS L+EMH+YTM 
Sbjct: 198 PAVKNVRSFCLEEDPIAPPGWTGPKLPPSKCWYYVKLRPGLEEFLRSASDLYEMHIYTMA 257

Query: 760 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIID 818
            + YA  +AK++DP+G  F  R++SR + G            K+L+ +  + +S VVIID
Sbjct: 258 TRNYALAIAKIIDPEGEYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVVIID 309

Query: 819 DSVRVWPHNKLNLIVVERYTYF 840
           D   VW     NLI V  Y +F
Sbjct: 310 DRGDVWQWEN-NLIKVVPYDFF 330


>gi|146421209|ref|XP_001486555.1| hypothetical protein PGUG_02226 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 732

 Score = 80.5 bits (197), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 61/184 (33%), Positives = 95/184 (51%), Gaps = 28/184 (15%)

Query: 674 KKMFSARKLCLVLDLDHTLLN-----SAKFHEVDPVHDEILRKKE------EQDREKPHR 722
           +++ S RKL LV+DLD T+++     +    ++DP++      K+      E+D   P  
Sbjct: 110 ERLTSERKLILVVDLDQTVIHATVDPTVGEWQLDPLNPNYRAVKDVRSFCLEEDPIAPPG 169

Query: 723 HLFRFPHM-----GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
             +  P M       + K+RPG+  FL+R S+L+EMH+YTM  + YA  +A ++DP G  
Sbjct: 170 --WSGPKMTPTKCWYYVKVRPGLEDFLKRVSQLYEMHVYTMATRNYALAIAHIIDPDGRY 227

Query: 778 FAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMES-AVVIIDDSVRVWPHNKLNLIVVER 836
           F  R++SR + G            K+L  +  ++   VVIIDD   VW   K NLI V  
Sbjct: 228 FGDRILSRDESGS--------LTHKNLRRLFPVDQLMVVIIDDRGDVWQWEK-NLIKVVP 278

Query: 837 YTYF 840
           Y +F
Sbjct: 279 YEFF 282



 Score = 41.6 bits (96), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 38/145 (26%), Positives = 66/145 (45%), Gaps = 15/145 (10%)

Query: 864 EDGTLASSLGVIERLHKIFF-------SHQSLDDVDVRNILAAEQRKILAGCRIVFSRVF 916
           +D  L S    + R+H  ++         +SL + D+  I+ + + K L G  ++FS + 
Sbjct: 462 DDNELESLNDALVRVHDEYYRIWDSYNEDKSLTNPDLTTIIPSLKSKCLEGIVVLFSGIL 521

Query: 917 PVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVAN--SLG---TDKVNWALST--GRF 969
             G ++P    +    +QFG      +  +VTHV+    S G   T KV  A  T     
Sbjct: 522 RWG-SDPQKADIVIWCQQFGVKVVNEVYPEVTHVICREPSAGGGLTFKVRVAKKTLPNVH 580

Query: 970 VVHPGWVEASALLYRRANEQDFAIK 994
           +V P W+ A    + R +E+ +AI+
Sbjct: 581 IVTPDWLFACMSKWTRVDEKPYAIE 605


>gi|392578708|gb|EIW71836.1| hypothetical protein TREMEDRAFT_67978 [Tremella mesenterica DSM
           1558]
          Length = 944

 Score = 80.5 bits (197), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 44/109 (40%), Positives = 65/109 (59%), Gaps = 10/109 (9%)

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
           +TK RPG+  FLE  +KL+EMH+YTMG + YA  +  ++DP+G  F GR++SR D     
Sbjct: 354 FTKPRPGLAKFLEEMNKLYEMHVYTMGTRTYAEAIVGIVDPEGKYFGGRILSRDDS---- 409

Query: 793 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
               R   +K+L+ +   + S VV+IDD   VW  +  NL+ V  Y +F
Sbjct: 410 ----RNFTTKNLKRLFPTDTSMVVVIDDRADVW-GDCPNLVKVRPYDFF 453



 Score = 60.5 bits (145), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 71/155 (45%), Gaps = 21/155 (13%)

Query: 849 LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFS--HQSLDDVDV------RNILAAE 900
           LL P+  E+D  E+           ++ R+H+ F++   + L DVD+        I+   
Sbjct: 621 LLDPNDDELDRVEK-----------LLRRIHRKFYNAYDRRLSDVDIPLACDCELIIPEI 669

Query: 901 QRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKV 960
           + ++  GC  VFS +    +  P     WQ AE FGA C   +  + TH +  + GT+KV
Sbjct: 670 KAQVFDGCYFVFSGII-ARDVEPETTSHWQWAEMFGARCQPTLTRKTTHCITTNAGTEKV 728

Query: 961 NWALST-GRFVVHPGWVEASALLYRRANEQDFAIK 994
             A    G  +V   W   S  L+ R  E +F+++
Sbjct: 729 YQASKLPGCKIVWVQWFYQSLSLWERQPEDEFSVE 763


>gi|350579777|ref|XP_003122350.3| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like, partial [Sus scrofa]
          Length = 284

 Score = 80.5 bits (197), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 47/131 (35%), Positives = 76/131 (58%), Gaps = 13/131 (9%)

Query: 663 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 722
           Q E+  R E+Q+++   RKL L++DLD TL+++ + H     +  I            H 
Sbjct: 162 QAEKLGR-EDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIF-----------HF 209

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            L R   M + T+LRP    FLE+ ++L+E+H++T G++LYA  +A  LDP+  LF+ R+
Sbjct: 210 QLGRGEPM-LHTRLRPHCKEFLEKIAQLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRI 268

Query: 783 ISRGDDGDPFD 793
           +SR +  DPF 
Sbjct: 269 LSRDECIDPFS 279


>gi|399215912|emb|CCF72600.1| unnamed protein product [Babesia microti strain RI]
          Length = 545

 Score = 80.5 bits (197), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 50/172 (29%), Positives = 82/172 (47%), Gaps = 24/172 (13%)

Query: 656 DQQKAAIQKERTRRLEEQK--KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           ++   +I     R++EE     +   R LCLVLDLD+TL           +H + L K E
Sbjct: 145 NEASMSISATFVRQMEESNLHSLLIKRLLCLVLDLDNTL-----------IHAKTLDKNE 193

Query: 714 EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 773
             D     + ++      ++ +LRPG+  FL+  SK ++++L+TMG   +AT    +LDP
Sbjct: 194 VLDSNDDFKAIYFGGRCNLY-RLRPGVSEFLDAMSKYYQLYLFTMGTSEHATAALSLLDP 252

Query: 774 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVW 824
           +G LF+ R+ SR D  +           K L  +    +  V ++DD    W
Sbjct: 253 QGKLFSNRIFSRSDSQNS---------RKTLSRIFPNYQGIVCVVDDCEHAW 295


>gi|299470348|emb|CBN78397.1| Similar to RNA Polymerase II CTD phosphatase Fcp1, putative
           [Ectocarpus siliculosus]
          Length = 985

 Score = 80.1 bits (196), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 64/213 (30%), Positives = 96/213 (45%), Gaps = 30/213 (14%)

Query: 641 QSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHE 700
           +   GD   +            +  R   + +  ++ +++KL LVLDLD+TLL+ +   +
Sbjct: 219 EEGGGDTHQVLMKGGKMMSVTAEGRRMMHMNKSGRLLNSKKLSLVLDLDNTLLHCSDHPD 278

Query: 701 ----VDPVHDEILRKKEEQDREKPHRHLFRFPHMG--MWTKLRPGIWTFLERASKLFEMH 754
               V P  D I              H  R P+     + KLRPG+  FL +A+ +FEM 
Sbjct: 279 AGRVVVPGVDGI--------------HALRLPNQQREYYIKLRPGLRRFLAQAATMFEMT 324

Query: 755 LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL--GMES 812
           +YT G   YA  +A VLDP   LF GR  S     D          +K LE +   G++ 
Sbjct: 325 IYTAGTSQYADAVASVLDPDRSLFQGRHFSTCYTPDLGR------NTKSLERIFPNGLDM 378

Query: 813 AVVIIDDSVRVWPHNKL-NLIVVERYTYFPCSR 844
           A +I+DD   VW   +  NL++V  Y +F   R
Sbjct: 379 A-LIVDDRDDVWRGEQAKNLLLVRPYKFFVGQR 410



 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 64/135 (47%), Gaps = 6/135 (4%)

Query: 865 DGTLASSLGVIERLHKIFFSHQSLDDVDVR---NILAAEQRKILAGCRIVFSRVFPVGEA 921
           D  L  +   +E +H  F++ ++ +    R     LA  + ++L G R+VFS V PV  A
Sbjct: 682 DPQLDCTFKTLEAVHGAFYAPENSNHGQPRAAAGFLAKVRLRVLTGVRMVFSGVIPVSGA 741

Query: 922 --NPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST-GRFVVHPGWVEA 978
             +P  H LW  AE  GA   + I    THVVA  LGT K    L   G FVVH  W+  
Sbjct: 742 PADPRTHRLWMMAESHGATVERDIGRHTTHVVAVRLGTAKTKTGLRMPGVFVVHLDWLMN 801

Query: 979 SALLYRRANEQDFAI 993
           S    RR  E  F +
Sbjct: 802 SVWHCRRERETMFLL 816


>gi|365758888|gb|EHN00710.1| Fcp1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 677

 Score = 80.1 bits (196), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 70/246 (28%), Positives = 102/246 (41%), Gaps = 57/246 (23%)

Query: 625 TGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCL 684
           T  G    A      P    GD++      +  +     KE  RR          +KL L
Sbjct: 77  TQCGKEVSADAFDGVPLDVVGDMDLQISETEAIRSGEALKEHLRR---------DKKLIL 127

Query: 685 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------------FPHMGM 732
           V+DLD T+++      VDP   E      + D   P+    R             P M M
Sbjct: 128 VVDLDQTIIHCG----VDPTIAEW-----KNDPNNPNFETLRDVKSFTLDEELVLPLMYM 178

Query: 733 -----------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 775
                            + K+RPG+  F ++ + LFEMH+YTM  + YA ++AK++DP G
Sbjct: 179 NEDGSVLKPPPVRKCWYYVKVRPGLKEFFDKVAPLFEMHIYTMATRAYAIQIAKIVDPTG 238

Query: 776 VLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVV 834
            LF  R++SR ++G           +K L  +    +S VV+IDD   VW     NLI V
Sbjct: 239 ELFGDRILSRDENGS--------LTTKSLAKLFPTDQSMVVVIDDRGDVWNWCP-NLIKV 289

Query: 835 ERYTYF 840
             Y +F
Sbjct: 290 VPYNFF 295



 Score = 48.9 bits (115), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 32/147 (21%), Positives = 69/147 (46%), Gaps = 17/147 (11%)

Query: 859 HDER---SEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNILAAEQRKILAGCRIVF 912
           HD++    +D  L   +G +  +H  ++   S  +  + ++  I+ + ++K+   C  VF
Sbjct: 402 HDQKLLVDDDDELYYLMGTLSNIHNTYYRMISQSNEPEPNLMEIIPSLKQKVFQNCHFVF 461

Query: 913 SRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-- 969
           S + P+G +       +W +   FGA  T  ID   TH++  +  T K   A    +F  
Sbjct: 462 SGLIPLGTDIQRSDIVIWTST--FGATSTSDIDYLTTHLITKNPSTYKARLA---KKFNP 516

Query: 970 ---VVHPGWVEASALLYRRANEQDFAI 993
              ++HP W+    + +++ +E+ + +
Sbjct: 517 EIKIIHPDWIFECLVNWKKVDEKPYTL 543


>gi|378756636|gb|EHY66660.1| hypothetical protein NERG_00300 [Nematocida sp. 1 ERTm2]
          Length = 507

 Score = 79.7 bits (195), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 45/110 (40%), Positives = 62/110 (56%), Gaps = 10/110 (9%)

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
            + KLR  +  FL+ A K  EMH+YTMGNK YAT + K+LDP G LF  R+I+R D+   
Sbjct: 203 YYVKLRDRLEWFLKEAEKYCEMHIYTMGNKAYATAIVKILDPTGKLFGSRIITRDDNFGC 262

Query: 792 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 840
           FD        KD++ +    S  V+I+DD   VW     NL  ++ Y +F
Sbjct: 263 FD--------KDIKRLFPTNSKHVIILDDRPDVWGFVD-NLYPIKPYYFF 303


>gi|388853856|emb|CCF52577.1| related to FCP1-TFIIF interacting component of CTD phosphatase
           [Ustilago hordei]
          Length = 471

 Score = 79.7 bits (195), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 61/209 (29%), Positives = 97/209 (46%), Gaps = 43/209 (20%)

Query: 662 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 719
           +  +  +RL  E    + S RKL LV+DLD T++++A    VDP   E +  +   + E 
Sbjct: 9   VSAKEAQRLGPETTTHLLSQRKLALVVDLDQTIIHTA----VDPTVGEWMEDESNPNYEA 64

Query: 720 PHR-HLFRFPHMG--------------------------MWTKLRPGIWTFLERASKLFE 752
                 FR    G                           + KLRPG+   L++ S+ ++
Sbjct: 65  LKSVAKFRLGIGGEEIKDDDDPPAPKDSAAALKASRACWYYVKLRPGVPEILKKLSEKYQ 124

Query: 753 MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME- 811
           +H+YTMG + YA  + K++DP   +F  R++SR ++G            K L+ +  M+ 
Sbjct: 125 LHVYTMGTRSYANLVCKLIDPDASIFGNRIVSRNENGSLV--------RKSLDKLFPMDH 176

Query: 812 SAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 177 SMVVIIDDREDVWSKSP-NLLQVVPYEFF 204



 Score = 43.5 bits (101), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 45/101 (44%), Gaps = 14/101 (13%)

Query: 893 VRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAV-CTKHIDDQVTHVV 951
           V + ++A + K+LAGC IVF          P    LW  A +FGA    K      THV+
Sbjct: 362 VVDFISARKTKVLAGCTIVF---------YPETSELWALAREFGATPAFKEEAGVTTHVI 412

Query: 952 A---NSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANE 988
           +   + +GT K       G   +V P W + S   + R +E
Sbjct: 413 SALQDDIGTVKATRLARAGMVELVWPSWFDISTSRWERQDE 453


>gi|50294127|ref|XP_449475.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49528789|emb|CAG62451.1| unnamed protein product [Candida glabrata]
          Length = 758

 Score = 79.7 bits (195), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 96/197 (48%), Gaps = 36/197 (18%)

Query: 662 IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSA--------KFHEVDPVHDEILRKKE 713
           + K+ T RL+ +KK      L LV+DLD T+++          K    +P ++ +   K 
Sbjct: 154 LDKQITTRLKNEKK------LVLVVDLDQTVIHCGVDPTIGEWKADPSNPNYETLKDVKC 207

Query: 714 EQDREKPHRHLFRFPHMG---------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 764
               E+P   +    +MG          + K+RPG+  F E+ + L+EMH+YTM  + YA
Sbjct: 208 FSLEEEP---ILPLIYMGPKPPVRTCWYYVKIRPGLKEFFEKIAPLYEMHIYTMATRAYA 264

Query: 765 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRV 823
            E+AK++DP   LF  R++SR ++G            K L  +    +S VV+IDD   V
Sbjct: 265 LEIAKIIDPDKSLFGDRILSRDENGS--------LTQKSLTRLFPTDQSMVVVIDDRGDV 316

Query: 824 WPHNKLNLIVVERYTYF 840
           W     NLI V  Y +F
Sbjct: 317 WNWCP-NLIKVVPYNFF 332



 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/144 (27%), Positives = 65/144 (45%), Gaps = 10/144 (6%)

Query: 859 HDER---SEDGTLASSLGVIERLHKIFF----SHQSLDDVDVRNILAAEQRKILAGCRIV 911
           HD+R    +D  L      +  +HK F+         ++ D+  +L   ++K+   C  V
Sbjct: 439 HDQRLLIDDDDELFYLKDTLAYIHKEFYHLLEDASDQNEADITMLLPKLKKKVFKECHFV 498

Query: 912 FSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNW--ALSTGRF 969
           FS + P+G        +  T+  FGA  T  ID+  THVV  + GT K     A +    
Sbjct: 499 FSGLIPLGTDIKRADIVIWTS-LFGAESTSDIDEHTTHVVTKTPGTYKARLAKAFNPDIK 557

Query: 970 VVHPGWVEASALLYRRANEQDFAI 993
           VVHP W+      ++R +E+ + +
Sbjct: 558 VVHPDWIFECLASWKRIDEKPYLL 581


>gi|255732778|ref|XP_002551312.1| hypothetical protein CTRG_05610 [Candida tropicalis MYA-3404]
 gi|240131053|gb|EER30614.1| hypothetical protein CTRG_05610 [Candida tropicalis MYA-3404]
          Length = 818

 Score = 79.7 bits (195), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 65/207 (31%), Positives = 97/207 (46%), Gaps = 48/207 (23%)

Query: 658 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDR 717
           + A I+   T RL ++KK      L LV+DLD T++++     VDP   E      + D 
Sbjct: 148 EAAKIEHSTTDRLIDEKK------LILVVDLDQTVIHAT----VDPTVGEW-----QSDP 192

Query: 718 EKPHRHLFR------------------FPHMG-----MWTKLRPGIWTFLERASKLFEMH 754
             P+    +                   P +       + KLRPG+  FLER S+ +EMH
Sbjct: 193 SNPNYRAVKDVRSFCLEEQPIVPPGWTGPKLAPTKCTYYVKLRPGLSEFLERMSEKYEMH 252

Query: 755 LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESA 813
           +YTM  + YA  +AK++DP+G  F  R++SR + G            K+L+ +  + +S 
Sbjct: 253 IYTMATRNYALAIAKIIDPEGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSM 304

Query: 814 VVIIDDSVRVWPHNKLNLIVVERYTYF 840
           V IIDD   VW     NLI V  Y +F
Sbjct: 305 VAIIDDRGDVWQWES-NLIKVVPYDFF 330



 Score = 41.2 bits (95), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 31/141 (21%), Positives = 63/141 (44%), Gaps = 10/141 (7%)

Query: 862 RSEDGTLASSLGVIERLHKIFFSHQSLDDV-DVRNILAAEQRKILAGCRIVFSRVFPVGE 920
           R +D  L +   V+  +H+ ++     ++  D+  I+   + K+L G  ++FS + P+G 
Sbjct: 500 RDDDNELVALDKVLVNIHEEYYKRYDKENKPDLTEIIPTMKSKVLDGITVLFSGIIPLG- 558

Query: 921 ANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTD-------KVNWALSTGRF-VVH 972
            N     +    +QFG      +   +THVV   +G D       +V   L      +V+
Sbjct: 559 INLDSADIVIWCKQFGVKVVNEVYPDITHVVCRDIGEDIGPTFKARVARKLYPDTVKIVN 618

Query: 973 PGWVEASALLYRRANEQDFAI 993
           P W+ A    +   +E+++ +
Sbjct: 619 PDWLFACLSNWTIVDEKEYLV 639


>gi|344301528|gb|EGW31840.1| hypothetical protein SPAPADRAFT_140004 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 770

 Score = 79.7 bits (195), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 65/202 (32%), Positives = 95/202 (47%), Gaps = 38/202 (18%)

Query: 658 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-------------- 703
           +   I++  T RL E+KK      L LV+DLD T++++     VDP              
Sbjct: 155 EATKIEQSTTDRLTEEKK------LILVVDLDQTVIHAT----VDPTVGEWQSDPSNPNY 204

Query: 704 --VHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMG 759
             V D      EE     P+    +      W   K+RPG+  FLE+ S  +EMH+YTM 
Sbjct: 205 PAVKDVKSFCLEEDPITPPNWTGPKLAPTKCWYYVKVRPGLAEFLEQVSNKYEMHIYTMA 264

Query: 760 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIID 818
            + YA  +A ++DP+G  F  R++SR + G            K+L+ +  + +S VVIID
Sbjct: 265 TRNYALAIANIIDPEGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVVIID 316

Query: 819 DSVRVWPHNKLNLIVVERYTYF 840
           D   VW     NLI V  Y +F
Sbjct: 317 DRGDVWQWES-NLIKVVPYDFF 337



 Score = 40.4 bits (93), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 32/139 (23%), Positives = 61/139 (43%), Gaps = 11/139 (7%)

Query: 864 EDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVG-EAN 922
           +D  L S   V+  +H  ++   +    D+  I+ + + K L G  ++FS + P+G   +
Sbjct: 481 DDNELNSLNKVLANIHDAYYKQANSSKPDLTEIIPSLKSKCLEGITLLFSGIIPLGVPLD 540

Query: 923 PHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLG-----TDKVNWA---LSTGRFVVHPG 974
                +W  A+QFG      +  +VTHV+   +      T K   A         +V+P 
Sbjct: 541 SADIVIW--AKQFGVKVVNEVYPEVTHVICRDITPESGPTFKARAAHHLYPDTIKIVNPD 598

Query: 975 WVEASALLYRRANEQDFAI 993
           W+ A    + R +E ++ +
Sbjct: 599 WLFACLSTWSRVDETEYLL 617


>gi|405122085|gb|AFR96852.1| hypothetical protein CNAG_04120 [Cryptococcus neoformans var.
           grubii H99]
          Length = 921

 Score = 79.7 bits (195), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 48/115 (41%), Positives = 67/115 (58%), Gaps = 17/115 (14%)

Query: 731 GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
           G W  TK RPG+  FL+  S+L+EMH+YTMG + YA  + KV+DP G +F GR++SR + 
Sbjct: 283 GRWYFTKPRPGLQKFLDEMSQLYEMHVYTMGTRTYADAIVKVIDPDGKIFGGRILSRDES 342

Query: 789 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPC 842
           G  F        SK+L+ +   + S VV+IDD   VW  +  NL+ V      PC
Sbjct: 343 G-SF-------SSKNLKRLFPTDTSMVVVIDDRSDVW-GDCPNLVKV-----VPC 383



 Score = 68.6 bits (166), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/140 (31%), Positives = 67/140 (47%), Gaps = 15/140 (10%)

Query: 865 DGTLASSLGVIERLHKIFF-SHQSLD----------DVDVRNILAAEQRKILAGCRIVFS 913
           D  L     +++ +H  F+ +  +LD            DV  I+   + ++L GC +VFS
Sbjct: 564 DYELVRVANILQEIHSRFYKAFDALDGWNPKKALPMSCDVEFIIPEMKAEVLDGCSLVFS 623

Query: 914 RVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST--GRFVV 971
            + P  E+NP    +WQTAE FGA+ T  +  + TH+V   L T+K  W      G  VV
Sbjct: 624 GMIPR-ESNPSTTTIWQTAESFGALITPSLTPRTTHLVTALLNTEKT-WRAGKMEGVKVV 681

Query: 972 HPGWVEASALLYRRANEQDF 991
              W   S  L+ R NE+ +
Sbjct: 682 WAEWFWDSVALWERQNEEKY 701



 Score = 40.4 bits (93), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 24/71 (33%), Positives = 38/71 (53%), Gaps = 2/71 (2%)

Query: 651 FEGYDDQQKAAIQKERTRRLEE--QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEI 708
           FE   D     + K   +RLE   +  + S R+L L++DLD T++++     V    DEI
Sbjct: 135 FEIAHDAMGVTVSKNEAQRLENLTRDALLSTRRLSLIVDLDQTIIHTTVDPTVAEWMDEI 194

Query: 709 LRKKEEQDREK 719
            R++ E D+EK
Sbjct: 195 HREESEDDQEK 205


>gi|383859139|ref|XP_003705054.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like isoform 1 [Megachile rotundata]
          Length = 760

 Score = 79.3 bits (194), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 51/171 (29%), Positives = 88/171 (51%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+++++ + RKL L++DLD T++++     + P   ++            H  L+     
Sbjct: 142 EDEQRLLNDRKLALLVDLDQTIVHTTN-DNIPPNMKDVY-----------HYQLYGPNSP 189

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
              T+LRP    FL   S+L+E+H+ T G + YA  +A +LD  G+LF+ R++SR +  D
Sbjct: 190 WYHTRLRPNTRHFLSEMSRLYELHICTFGARNYAHTVASLLDKDGILFSNRILSRDECFD 249

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           P        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y +F
Sbjct: 250 P------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 293



 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 41/156 (26%), Positives = 71/156 (45%), Gaps = 19/156 (12%)

Query: 856 EIDHDERSEDGTLASSLGVI---------------ERLHKIFFSHQSLDD--VDVRNILA 898
           EI  D +SED TL+    +I                R+H  F++ +  +     +R+I+ 
Sbjct: 410 EITQDTKSEDKTLSGENNIIDEDDDDYLLYLEDILRRIHAEFYATKEKESGRSSLRDIIP 469

Query: 899 AEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTD 958
             + ++L G  I FS + P  +   H    ++ A  FGA  ++ + D+ TH+VA   GT 
Sbjct: 470 RVRAQVLKGVHITFSGLIPTHQK-IHQSRAYKVARAFGAEVSQELTDKTTHLVAIRPGTA 528

Query: 959 KVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           K N A       +V+P W+   A  +   +E+ F +
Sbjct: 529 KANAAKKNLNIKIVNPDWLWTCAERWEHVDERLFPL 564


>gi|383859141|ref|XP_003705055.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like isoform 2 [Megachile rotundata]
          Length = 759

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 51/171 (29%), Positives = 88/171 (51%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+++++ + RKL L++DLD T++++     + P   ++            H  L+     
Sbjct: 142 EDEQRLLNDRKLALLVDLDQTIVHTTN-DNIPPNMKDVY-----------HYQLYGPNSP 189

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
              T+LRP    FL   S+L+E+H+ T G + YA  +A +LD  G+LF+ R++SR +  D
Sbjct: 190 WYHTRLRPNTRHFLSEMSRLYELHICTFGARNYAHTVASLLDKDGILFSNRILSRDECFD 249

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           P        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y +F
Sbjct: 250 P------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 293



 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 41/156 (26%), Positives = 71/156 (45%), Gaps = 19/156 (12%)

Query: 856 EIDHDERSEDGTLASSLGVI---------------ERLHKIFFSHQSLDD--VDVRNILA 898
           EI  D +SED TL+    +I                R+H  F++ +  +     +R+I+ 
Sbjct: 410 EITQDTKSEDKTLSGENNIIDEDDDDYLLYLEDILRRIHAEFYATKEKESGRSSLRDIIP 469

Query: 899 AEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTD 958
             + ++L G  I FS + P  +   H    ++ A  FGA  ++ + D+ TH+VA   GT 
Sbjct: 470 RVRAQVLKGVHITFSGLIPTHQK-IHQSRAYKVARAFGAEVSQELTDKTTHLVAIRPGTA 528

Query: 959 KVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           K N A       +V+P W+   A  +   +E+ F +
Sbjct: 529 KANAAKKNLNIKIVNPDWLWTCAERWEHVDERLFPL 564


>gi|300701489|ref|XP_002994977.1| hypothetical protein NCER_102325 [Nosema ceranae BRL01]
 gi|239603396|gb|EEQ81306.1| hypothetical protein NCER_102325 [Nosema ceranae BRL01]
          Length = 200

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 56/177 (31%), Positives = 87/177 (49%), Gaps = 32/177 (18%)

Query: 664 KERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH 723
           K++  RL + KK      L LVLDLD T+L           H  I ++  E         
Sbjct: 48  KKKLERLHKNKK------LVLVLDLDQTIL-----------HTTITKEYMEGYSN----- 85

Query: 724 LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
            F    +    K RP +   LE   K +E+H+YTMGNK+YA ++ K++DP       R++
Sbjct: 86  -FIINDISYCVKFRPYLNYMLECLYKKYEIHVYTMGNKVYANKIVKLIDPTRKYIGNRIL 144

Query: 784 SRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           +R ++G  F         KDL  +  + S VVI+DD   +W ++  NLI+V+ Y ++
Sbjct: 145 TRDENGIGF--------KKDLNRLFSIHSNVVILDDRDDIWDYSD-NLILVKPYFFW 192


>gi|323332189|gb|EGA73600.1| Fcp1p [Saccharomyces cerevisiae AWRI796]
          Length = 646

 Score = 79.0 bits (193), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 76/257 (29%), Positives = 106/257 (41%), Gaps = 61/257 (23%)

Query: 618 TNHDDKQTG--TGSGPE--AGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQ 673
            NHD    G  T  G E  A      P    GDV+      +  +     KE  RR    
Sbjct: 116 CNHDIVYGGLCTQCGKEVSADAFDGVPLDVVGDVDLQISETEAIRTGKALKEHLRR---- 171

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------- 726
                 +KL LV+DLD T+++      VDP   E      + D   P+    R       
Sbjct: 172 -----DKKLILVVDLDQTIIHCG----VDPTIAEW-----KNDPNNPNFETLRDVKSFTL 217

Query: 727 -----FPHMGM-----------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 764
                 P M M                 + K+RPG+  F  + + LFEMH+YTM  + YA
Sbjct: 218 DEELVLPLMYMNDDGSMLRPPPVRKCWYYVKVRPGLKEFFAKVAPLFEMHIYTMATRAYA 277

Query: 765 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRV 823
            ++AK++DP G LF  R++SR ++G           +K L  +    +S VV+IDD   V
Sbjct: 278 LQIAKIVDPTGELFGDRILSRDENGS--------LTTKSLAKLFPTDQSMVVVIDDRGDV 329

Query: 824 WPHNKLNLIVVERYTYF 840
           W     NLI V  Y +F
Sbjct: 330 WNWCP-NLIKVVPYNFF 345



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/144 (23%), Positives = 69/144 (47%), Gaps = 11/144 (7%)

Query: 859 HDER---SEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNILAAEQRKILAGCRIVF 912
           HD++    +D  L   +G +  +HK ++   S Q+  + ++  I+ + ++K+   C  VF
Sbjct: 452 HDQKLLVDDDDELYYLMGTLSNIHKTYYDMLSQQNEPEPNLMEIIPSLKQKVFQNCYFVF 511

Query: 913 SRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA--LSTGRF 969
           S + P+G +       +W +   FGA  T  ID   TH++  +  T K   A   +    
Sbjct: 512 SGLIPLGTDIQRSDIVIWTST--FGATSTPDIDYLTTHLITKNPSTYKARLAKKFNPQIK 569

Query: 970 VVHPGWVEASALLYRRANEQDFAI 993
           +VHP W+    + +++ +E+   +
Sbjct: 570 IVHPDWIFECLVNWKKVDEKTLHV 593


>gi|323307594|gb|EGA60861.1| Fcp1p [Saccharomyces cerevisiae FostersO]
          Length = 732

 Score = 79.0 bits (193), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 71/246 (28%), Positives = 101/246 (41%), Gaps = 57/246 (23%)

Query: 625 TGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCL 684
           T  G    A      P    GDV+      +  +     KE  RR          +KL L
Sbjct: 127 TQCGKEVSADAFDGVPLDVVGDVDLQISETEAIRTGKALKEHLRR---------DKKLIL 177

Query: 685 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------------FPHMGM 732
           V+DLD T+++      VDP   E      + D   P+    R             P M M
Sbjct: 178 VVDLDQTIIHCG----VDPTIAEW-----KNDPNNPNFETLRDVKSFTLDEELVLPLMYM 228

Query: 733 -----------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 775
                            + K+RPG+  F  + + LFEMH+YTM  + YA ++AK++DP G
Sbjct: 229 NDDGSMLRPPPVRKCWYYVKVRPGLKEFFAKVAPLFEMHIYTMATRAYALQIAKIVDPTG 288

Query: 776 VLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVV 834
            LF  R++SR ++G           +K L  +    +S VV+IDD   VW     NLI V
Sbjct: 289 ELFGDRILSRDENGS--------LTTKSLAKLFPTDQSMVVVIDDRGDVWNWCP-NLIKV 339

Query: 835 ERYTYF 840
             Y +F
Sbjct: 340 VPYNFF 345



 Score = 52.8 bits (125), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 34/144 (23%), Positives = 70/144 (48%), Gaps = 11/144 (7%)

Query: 859 HDER---SEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNILAAEQRKILAGCRIVF 912
           HD++    +D  L   +G +  +HK ++   S Q+  + ++  I+ + ++K+   C  VF
Sbjct: 452 HDQKLLVDDDDELYYLMGTLSNIHKTYYDMLSQQNEPEPNLMEIIPSLKQKVFQNCYFVF 511

Query: 913 SRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA--LSTGRF 969
           S + P+G +       +W +   FGA  T  ID   TH++  +  T K   A   +    
Sbjct: 512 SGLIPLGTDIQRSDIVIWTST--FGATSTPDIDYLTTHLITKNPSTYKARLAKKFNPQIK 569

Query: 970 VVHPGWVEASALLYRRANEQDFAI 993
           +VHP W+    + +++ +E+ + +
Sbjct: 570 IVHPDWIFECLVNWKKVDEKPYTL 593


>gi|349580569|dbj|GAA25729.1| K7_Fcp1p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 732

 Score = 79.0 bits (193), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 71/246 (28%), Positives = 101/246 (41%), Gaps = 57/246 (23%)

Query: 625 TGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCL 684
           T  G    A      P    GDV+      +  +     KE  RR          +KL L
Sbjct: 127 TQCGKEVSADAFDGVPLDVVGDVDLQISETEAIRTGKALKEHLRR---------DKKLIL 177

Query: 685 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------------FPHMGM 732
           V+DLD T+++      VDP   E      + D   P+    R             P M M
Sbjct: 178 VVDLDQTIIHCG----VDPTIAEW-----KNDPNNPNFETLRDVKSFTLDEELVLPLMYM 228

Query: 733 -----------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 775
                            + K+RPG+  F  + + LFEMH+YTM  + YA ++AK++DP G
Sbjct: 229 NDDGSMLRPPPVRKCWYYVKVRPGLKEFFAKVAPLFEMHIYTMATRAYALQIAKIVDPTG 288

Query: 776 VLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVV 834
            LF  R++SR ++G           +K L  +    +S VV+IDD   VW     NLI V
Sbjct: 289 ELFGDRILSRDENGS--------LTTKSLTKLFPTDQSMVVVIDDRGDVWNWCP-NLIKV 339

Query: 835 ERYTYF 840
             Y +F
Sbjct: 340 VPYNFF 345



 Score = 52.8 bits (125), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 34/144 (23%), Positives = 70/144 (48%), Gaps = 11/144 (7%)

Query: 859 HDER---SEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNILAAEQRKILAGCRIVF 912
           HD++    +D  L   +G +  +HK ++   S Q+  + ++  I+ + ++K+   C  VF
Sbjct: 452 HDQKLLVDDDDELYYLMGTLSNIHKTYYDMLSQQNEPEPNLMEIIPSLKQKVFQNCYFVF 511

Query: 913 SRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA--LSTGRF 969
           S + P+G +       +W +   FGA  T  ID   TH++  +  T K   A   +    
Sbjct: 512 SGLIPLGTDIQRSDIVIWTST--FGATSTPDIDYLTTHLITKNPSTYKARLAKKFNPQIK 569

Query: 970 VVHPGWVEASALLYRRANEQDFAI 993
           +VHP W+    + +++ +E+ + +
Sbjct: 570 IVHPDWIFECLVNWKKVDEKPYTL 593


>gi|71004098|ref|XP_756715.1| hypothetical protein UM00568.1 [Ustilago maydis 521]
 gi|46095984|gb|EAK81217.1| hypothetical protein UM00568.1 [Ustilago maydis 521]
          Length = 779

 Score = 79.0 bits (193), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 61/209 (29%), Positives = 96/209 (45%), Gaps = 43/209 (20%)

Query: 662 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK-------- 711
           +  E  +RL  E    + S RKL L++DLD T++++     VDP   E +R         
Sbjct: 46  VSAEEAQRLDSETTSHLLSQRKLALIVDLDQTVIHAT----VDPTVGEWMRDESNPNYEA 101

Query: 712 ----------------KEEQDREKPH---RHLFRFPHMGMWTKLRPGIWTFLERASKLFE 752
                           K+E+D  +P      L        + K RPG+   L+  S+ +E
Sbjct: 102 LQSVGKFRLGIDGEEIKDEEDGSEPKDPAAALKASRACWYYVKPRPGVPQVLKHLSEKYE 161

Query: 753 MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME- 811
           +H+YTMG + YA  + K++DP   +F  R++SR ++G            K L  +  ++ 
Sbjct: 162 LHVYTMGTRSYANCVCKLIDPDASIFGNRILSRDENGSLV--------RKSLSRLFPVDH 213

Query: 812 SAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 214 SMVVIIDDREDVWSRSP-NLLPVLPYEFF 241



 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 40/104 (38%), Positives = 56/104 (53%), Gaps = 4/104 (3%)

Query: 893 VRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQV-THVV 951
           V +++  ++ ++L GC IVFS + P G  N     LW  A +FGA     I+  V THVV
Sbjct: 508 VVDLIGMKKAQVLKGCTIVFSSMIPFGH-NVEKSDLWAMAREFGATPASEIEVGVTTHVV 566

Query: 952 ANSLGTDKVNWALS-TGRF-VVHPGWVEASALLYRRANEQDFAI 993
           A   GT KV+ AL  +G+  VV P W   SA  + R +E  + I
Sbjct: 567 AARPGTAKVHQALRLSGQLEVVWPSWFHVSASRWSRQDEALYRI 610


>gi|328772741|gb|EGF82779.1| hypothetical protein BATDEDRAFT_22917 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 868

 Score = 79.0 bits (193), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 45/123 (36%), Positives = 66/123 (53%), Gaps = 16/123 (13%)

Query: 669 RLEEQK--KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR 726
           RLE++   ++   RKL LVLDLD T++++     VDP   E +      D   P+     
Sbjct: 154 RLEKETADRLLDERKLSLVLDLDQTVIHAT----VDPTVGEWM-----ADPNNPN----- 199

Query: 727 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
           FP + +W    PG   FL   +  +EMH+YTMG + YA  ++K+LDP    F  R++SR 
Sbjct: 200 FPALTVWATHEPGTREFLRELNAKYEMHIYTMGTRNYAKAVSKILDPDKRYFKDRILSRD 259

Query: 787 DDG 789
           D G
Sbjct: 260 DSG 262



 Score = 69.7 bits (169), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 39/115 (33%), Positives = 60/115 (52%), Gaps = 5/115 (4%)

Query: 878 LHKIFFSHQSLDDV---DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQ 934
           +H+ F+      D+   DVR I+   +R IL G  I+F+ + P+G   P  H  W  A  
Sbjct: 504 IHQTFYERSDAGDIKTSDVRVIVPEMKRSILEGVHILFTSIIPLG-LEPQKHEHWIAATS 562

Query: 935 FGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANE 988
           +GAVC   +D +VTHV+A   GT KVN A       ++   W+  +   ++RA+E
Sbjct: 563 YGAVCHVDLDPEVTHVIAGKTGTAKVNAARKRPNVAILKIDWLIDTIRTWQRADE 617


>gi|6323933|ref|NP_014004.1| Fcp1p [Saccharomyces cerevisiae S288c]
 gi|2497216|sp|Q03254.1|FCP1_YEAST RecName: Full=RNA polymerase II subunit A C-terminal domain
           phosphatase; AltName: Full=CTD phosphatase FCP1
 gi|825543|emb|CAA89775.1| unknown [Saccharomyces cerevisiae]
 gi|151945985|gb|EDN64217.1| protein phosphatase [Saccharomyces cerevisiae YJM789]
 gi|256270710|gb|EEU05873.1| Fcp1p [Saccharomyces cerevisiae JAY291]
 gi|259148865|emb|CAY82110.1| Fcp1p [Saccharomyces cerevisiae EC1118]
 gi|285814283|tpg|DAA10178.1| TPA: Fcp1p [Saccharomyces cerevisiae S288c]
 gi|323346974|gb|EGA81251.1| Fcp1p [Saccharomyces cerevisiae Lalvin QA23]
 gi|323353207|gb|EGA85507.1| Fcp1p [Saccharomyces cerevisiae VL3]
 gi|392297449|gb|EIW08549.1| Fcp1p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 732

 Score = 79.0 bits (193), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 71/246 (28%), Positives = 101/246 (41%), Gaps = 57/246 (23%)

Query: 625 TGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCL 684
           T  G    A      P    GDV+      +  +     KE  RR          +KL L
Sbjct: 127 TQCGKEVSADAFDGVPLDVVGDVDLQISETEAIRTGKALKEHLRR---------DKKLIL 177

Query: 685 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------------FPHMGM 732
           V+DLD T+++      VDP   E      + D   P+    R             P M M
Sbjct: 178 VVDLDQTIIHCG----VDPTIAEW-----KNDPNNPNFETLRDVKSFTLDEELVLPLMYM 228

Query: 733 -----------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 775
                            + K+RPG+  F  + + LFEMH+YTM  + YA ++AK++DP G
Sbjct: 229 NDDGSMLRPPPVRKCWYYVKVRPGLKEFFAKVAPLFEMHIYTMATRAYALQIAKIVDPTG 288

Query: 776 VLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVV 834
            LF  R++SR ++G           +K L  +    +S VV+IDD   VW     NLI V
Sbjct: 289 ELFGDRILSRDENGS--------LTTKSLAKLFPTDQSMVVVIDDRGDVWNWCP-NLIKV 339

Query: 835 ERYTYF 840
             Y +F
Sbjct: 340 VPYNFF 345



 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/144 (23%), Positives = 70/144 (48%), Gaps = 11/144 (7%)

Query: 859 HDER---SEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNILAAEQRKILAGCRIVF 912
           HD++    +D  L   +G +  +HK ++   S Q+  + ++  I+ + ++K+   C  VF
Sbjct: 452 HDQKLLVDDDDELYYLMGTLSNIHKTYYDMLSQQNEPEPNLMEIIPSLKQKVFQNCYFVF 511

Query: 913 SRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA--LSTGRF 969
           S + P+G +       +W +   FGA  T  ID   TH++  +  T K   A   +    
Sbjct: 512 SGLIPLGTDIQRSDIVIWTST--FGATSTPDIDYLTTHLITKNPSTYKARLAKKFNPQIK 569

Query: 970 VVHPGWVEASALLYRRANEQDFAI 993
           +VHP W+    + +++ +E+ + +
Sbjct: 570 IVHPDWIFECLVNWKKVDEKPYTL 593


>gi|196002231|ref|XP_002110983.1| hypothetical protein TRIADDRAFT_54465 [Trichoplax adhaerens]
 gi|190586934|gb|EDV26987.1| hypothetical protein TRIADDRAFT_54465 [Trichoplax adhaerens]
          Length = 766

 Score = 79.0 bits (193), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 57/171 (33%), Positives = 92/171 (53%), Gaps = 19/171 (11%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW- 733
           ++ S++KL L++DLD TL+     H      D  L    E+ +     H+F  P   ++ 
Sbjct: 221 RLLSSKKLVLIVDLDLTLI-----HTRMASPDIKLSNLTEEKQIYYTCHMF--PGYNVYH 273

Query: 734 ---TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
              TKLRP +  FL+ AS LFE+H+ TMG++ YA ++  +LDP G LF  R++SR     
Sbjct: 274 QYLTKLRPHVEEFLKVASTLFELHVVTMGSRSYAQDIVGILDPTGSLFYNRILSRD---- 329

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
             +   ++ KS +L  +  + ++ V IIDD   +W  +  + I V  Y+YF
Sbjct: 330 --ELKSQLLKSTNLNQLFPLGDNLVCIIDDRPEMWAFHP-SCIPVPPYSYF 377



 Score = 54.3 bits (129), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 38/135 (28%), Positives = 62/135 (45%), Gaps = 16/135 (11%)

Query: 877 RLHKIFFSH-----QSLDDV-DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQ 930
           R+H  F+       + LD + D+++I+   +R +L   +IVFS + P G  +P     W 
Sbjct: 516 RIHNRFYQQYDQLKEQLDQLPDLKSIIPEIRRNVLKDVKIVFSAIIPSGHPSPEKTYEWI 575

Query: 931 TAEQFGAVCTKHI----DDQVTHVVANSLG-----TDKVNWALST-GRFVVHPGWVEASA 980
            AE  GA  T         + THVV   +      T KV+ A+ T G FVV   W+    
Sbjct: 576 LAESLGAKVTHKFHTSPSRKTTHVVTKRVAFQSGYTQKVHLAMKTAGVFVVDIDWLYKCN 635

Query: 981 LLYRRANEQDFAIKP 995
             +++  E+ + + P
Sbjct: 636 EFWKKIEEEPYLLAP 650


>gi|190408503|gb|EDV11768.1| TFIIF interacting component of CTD phosphatase [Saccharomyces
           cerevisiae RM11-1a]
          Length = 732

 Score = 79.0 bits (193), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 71/246 (28%), Positives = 101/246 (41%), Gaps = 57/246 (23%)

Query: 625 TGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCL 684
           T  G    A      P    GDV+      +  +     KE  RR          +KL L
Sbjct: 127 TQCGKEVSADAFDGVPLDVVGDVDLQISETEAIRTGKALKEHLRR---------DKKLIL 177

Query: 685 VLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFR------------FPHMGM 732
           V+DLD T+++      VDP   E      + D   P+    R             P M M
Sbjct: 178 VVDLDQTIIHCG----VDPTIAEW-----KNDPNNPNFETLRDVKSFTLDEELVLPLMYM 228

Query: 733 -----------------WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 775
                            + K+RPG+  F  + + LFEMH+YTM  + YA ++AK++DP G
Sbjct: 229 NDDGSMLRPPPVRKCWYYVKVRPGLKEFFAKVAPLFEMHIYTMATRAYALQIAKIVDPTG 288

Query: 776 VLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVV 834
            LF  R++SR ++G           +K L  +    +S VV+IDD   VW     NLI V
Sbjct: 289 ELFGDRILSRDENGS--------LTTKSLAKLFPTDQSMVVVIDDRGDVWNWCP-NLIKV 339

Query: 835 ERYTYF 840
             Y +F
Sbjct: 340 VPYNFF 345



 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/144 (23%), Positives = 70/144 (48%), Gaps = 11/144 (7%)

Query: 859 HDER---SEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNILAAEQRKILAGCRIVF 912
           HD++    +D  L   +G +  +HK ++   S Q+  + ++  I+ + ++K+   C  VF
Sbjct: 452 HDQKLLVDDDDELYYLMGTLSNIHKTYYDMLSQQNEPEPNLMEIIPSLKQKVFQNCYFVF 511

Query: 913 SRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA--LSTGRF 969
           S + P+G +       +W +   FGA  T  ID   TH++  +  T K   A   +    
Sbjct: 512 SGLIPLGTDIQRSDIVIWTST--FGATSTPDIDYLTTHLITKNPSTYKARLAKKFNPQIK 569

Query: 970 VVHPGWVEASALLYRRANEQDFAI 993
           +VHP W+    + +++ +E+ + +
Sbjct: 570 IVHPDWIFECLVNWKKVDEKPYTL 593


>gi|297819962|ref|XP_002877864.1| hypothetical protein ARALYDRAFT_906616 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323702|gb|EFH54123.1| hypothetical protein ARALYDRAFT_906616 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 284

 Score = 78.6 bits (192), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 70/242 (28%), Positives = 110/242 (45%), Gaps = 32/242 (13%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG-MW 733
           K    ++L LVL L  TL +S    ++    + +    E + R    R    FP+ G + 
Sbjct: 54  KSLKEKRLTLVLGLHGTLYDSRLVSQLSDGENYL--TGEVKSRFDLRRSKKFFPNQGEVL 111

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 793
            KLRP +  FL  A+KLF+M ++ + +     E+   LDP G  F  R+I+  D      
Sbjct: 112 FKLRPFVHEFLREANKLFQMTVFELCSPEQGEEVISFLDPHGTYFEKRIITNRD------ 165

Query: 794 GDERVPKSKDLEGVLGMESAVVIIDDS-VRVWPHNKLNLIVVERYTYFPCSRRQFGL--- 849
                 + K+L+ VL  E  +VI+DD  V  WP +  NL+ +  Y +F  +     +   
Sbjct: 166 -----SEMKNLDLVLADERGIVILDDKHVYWWPDDTTNLLQIAPYHFFKRNNNNTWITKL 220

Query: 850 --LGPSLLEID------------HDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRN 895
                  L ID             DE +EDG L ++L +++ +HK FF  +  D  DVR 
Sbjct: 221 VNFFKKTLSIDDESDPKSYAEERRDEDAEDGGLENALELLKEVHKNFFDEEDEDSRDVRA 280

Query: 896 IL 897
           +L
Sbjct: 281 LL 282


>gi|195170374|ref|XP_002025988.1| GL10108 [Drosophila persimilis]
 gi|194110852|gb|EDW32895.1| GL10108 [Drosophila persimilis]
          Length = 757

 Score = 78.6 bits (192), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 56/163 (34%), Positives = 86/163 (52%), Gaps = 24/163 (14%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRP 738
           KL L++DLD T++++      D V + I          K   H   +     W  T+LRP
Sbjct: 88  KLVLLVDLDQTVIHTTN----DTVPENI----------KGIYHFQLYGPQSPWYHTRLRP 133

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERV 798
           G   FLER S+L+E+H+ T G + YA  +A++LDP G  F+ R++SR    + F+     
Sbjct: 134 GTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPDGKFFSHRILSR---DECFNA---T 187

Query: 799 PKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
            K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 188 SKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 229



 Score = 40.4 bits (93), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 36/149 (24%), Positives = 70/149 (46%), Gaps = 10/149 (6%)

Query: 854 LLEIDHDE------RSEDGTLASSLGVIERLHKIFFS--HQSLDDVDVRNILAAEQRKIL 905
           +++++HDE         D  L     ++  +HK F++   ++++  D++ I+   + ++L
Sbjct: 412 IVKVNHDEGKLIEIEDPDDYLLYLEVILRNIHKRFYAIYDETMEIPDLKIIVPKIRCEVL 471

Query: 906 AGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALS 965
            G  +VFS + P  +        +  A+  GA    +I  + TH+VA + GT KVN A  
Sbjct: 472 RGKNLVFSGLVPT-QMRLEQSRAYFIAKSLGAEVQPNISKESTHLVAVNAGTYKVNAAKK 530

Query: 966 TGRF-VVHPGWVEASALLYRRANEQDFAI 993
                VV+  W+   A  +    E+ F +
Sbjct: 531 EVNIRVVNANWLWTCAERWEHVEEKLFPL 559


>gi|134056779|emb|CAK37687.1| unnamed protein product [Aspergillus niger]
          Length = 788

 Score = 78.6 bits (192), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 44/113 (38%), Positives = 66/113 (58%), Gaps = 12/113 (10%)

Query: 731 GMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
           G W   KLRPG+ +FL+  S+++E+H+YTMG + YA  +A ++DP   LF  R++SR + 
Sbjct: 175 GCWYYVKLRPGLESFLQNVSEMYELHIYTMGTRSYAQHIASIIDPDRKLFGDRILSRDES 234

Query: 789 GDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYF 840
           G           +K+L  +  +++  VVIIDD   VW  N  NLI V  Y +F
Sbjct: 235 GSLV--------AKNLHRLFPVDTKMVVIIDDRGDVWRWNP-NLIKVSPYDFF 278



 Score = 44.7 bits (104), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 50/104 (48%), Gaps = 4/104 (3%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLH-PLWQTAEQFGAVCTKHIDDQVTHV 950
           DV+ I+   + +IL G  +VFS V P+G    +    LW  A+ FGAV    I+   TH+
Sbjct: 466 DVKIIMPNMKHRILGGVTLVFSGVLPLGTDTQNADIXLW--AKSFGAVIASKINMXTTHL 523

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           VA    T KV  A    +  +V   W+      ++R  E+ + +
Sbjct: 524 VAGRNRTXKVREATRYPKVKIVTTQWLLDCLTQWKRLAEEPYLL 567


>gi|390333352|ref|XP_791406.3| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like [Strongylocentrotus purpuratus]
          Length = 673

 Score = 78.6 bits (192), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 54/170 (31%), Positives = 91/170 (53%), Gaps = 19/170 (11%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 731
           ++  +   RKL L++DLD TL+++     +D V  ++      Q R+ P   +F + H  
Sbjct: 22  DEDSLIKHRKLVLLVDLDQTLIHTT----LDEVPADMPGVHHFQLRKGP---MFPWYH-- 72

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
             T++R     FL+  S+ +++H++TMG +LYA  +A+++DP+G  F+ R++SR +  DP
Sbjct: 73  --TRIRDNYQQFLDLISQFYQLHIFTMGVRLYAHTVAEIIDPEGKFFSHRILSRDECVDP 130

Query: 792 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
                   K  +L  +    +  V IIDD   VW +   NLI V  Y YF
Sbjct: 131 H------SKKANLRSIFPRGDKMVCIIDDRDDVW-NFAPNLIQVPPYRYF 173



 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 46/149 (30%), Positives = 70/149 (46%), Gaps = 19/149 (12%)

Query: 861 ERSEDGTLASSLGVIERLHKIFFSH-----QSLDDVDVRNILAAEQRKILAGCRIVFSRV 915
           E  +D  L      + R+H+ FF       Q L   D+++I+   +  +L GC IVFS V
Sbjct: 325 EEDKDEYLLYLADTLARIHRTFFKQVEANPQEL--PDLKSIVPGMRMSVLKGCNIVFSSV 382

Query: 916 FPVGEANPHLHPLWQTAEQFGA-----VCTKHIDDQV-----THVVANSLGTDKVNWA-L 964
           FP     P    +W+ A   GA     + TK  ++Q      TH+VA  +GT KV+ A  
Sbjct: 383 FPTN-MPPEQSRVWKVALALGAKVSPQIVTKSKEEQAKGRASTHLVAAKVGTSKVHAARR 441

Query: 965 STGRFVVHPGWVEASALLYRRANEQDFAI 993
           S   F+V P W+      + R +E+ F +
Sbjct: 442 SKSIFIVEPDWLWCCWERWERVDERLFIL 470


>gi|68472089|ref|XP_719840.1| potential RNA Pol II CTD phosphatase component [Candida albicans
           SC5314]
 gi|68472324|ref|XP_719723.1| potential RNA Pol II CTD phosphatase component [Candida albicans
           SC5314]
 gi|46441553|gb|EAL00849.1| potential RNA Pol II CTD phosphatase component [Candida albicans
           SC5314]
 gi|46441679|gb|EAL00974.1| potential RNA Pol II CTD phosphatase component [Candida albicans
           SC5314]
          Length = 768

 Score = 78.6 bits (192), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 66/207 (31%), Positives = 95/207 (45%), Gaps = 48/207 (23%)

Query: 658 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDR 717
           + A I+   T RL         RKL LV+DLD T++++     VDP   E      + D 
Sbjct: 148 EAAKIEHNTTDRL------IDERKLILVVDLDQTVIHAT----VDPTVGEW-----QSDP 192

Query: 718 EKPHR------HLFRFPHMGM----WT-------------KLRPGIWTFLERASKLFEMH 754
             P+         F      +    WT             KLRPG+  FLE+ ++ +EMH
Sbjct: 193 ANPNYAAVKDVKTFCLEEEAIVPPGWTGPKLAPTKCTYYVKLRPGLSEFLEKMAEKYEMH 252

Query: 755 LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SA 813
           +YTM  + YA  +AK++DP G  F  R++SR + G            K+L+ +  ++ S 
Sbjct: 253 IYTMATRNYALSIAKIIDPDGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSM 304

Query: 814 VVIIDDSVRVWPHNKLNLIVVERYTYF 840
           VVIIDD   VW     NLI V  Y +F
Sbjct: 305 VVIIDDRGDVWQWES-NLIKVVPYDFF 330



 Score = 42.7 bits (99), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 34/142 (23%), Positives = 61/142 (42%), Gaps = 9/142 (6%)

Query: 860 DERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVG 919
           D+ +E   L   LG I + +   F    ++  D+  I+ + + K L G  ++FS + P+G
Sbjct: 476 DDDNELAALDKVLGNIHQGYYNLFDKDKINKPDLTEIIPSMKSKTLEGITVLFSGIIPLG 535

Query: 920 EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLG-------TDKVNWALSTGRF-VV 971
             N     +     QFG      +  +VTHVV   +          +V   L      +V
Sbjct: 536 -INLDSADIVIWCRQFGVKVVNEVYPEVTHVVCRDVSEGAGPTFKTRVARKLYPDTIKIV 594

Query: 972 HPGWVEASALLYRRANEQDFAI 993
           +P W+ A    + + +E+D+ I
Sbjct: 595 NPDWLFACLSNWTKVDEKDYLI 616


>gi|241953831|ref|XP_002419637.1| RNA polymerase II subunit a c-terminal domain phosphatase, putative
           [Candida dubliniensis CD36]
 gi|223642977|emb|CAX43233.1| RNA polymerase II subunit a c-terminal domain phosphatase, putative
           [Candida dubliniensis CD36]
          Length = 771

 Score = 78.6 bits (192), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 66/207 (31%), Positives = 95/207 (45%), Gaps = 48/207 (23%)

Query: 658 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDR 717
           + A I+   T RL         RKL LV+DLD T++++     VDP   E      + D 
Sbjct: 148 EAAKIEHNTTDRL------IDERKLILVVDLDQTVIHAT----VDPTVGEW-----QSDP 192

Query: 718 EKPHR------HLFRFPHMGM----WT-------------KLRPGIWTFLERASKLFEMH 754
             P+         F      +    WT             KLRPG+  FLE+ ++ +EMH
Sbjct: 193 ANPNYAAVKDVKTFCLEEEAIVPPGWTGPKLAPTKCTYYVKLRPGLSEFLEKMAEKYEMH 252

Query: 755 LYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESA 813
           +YTM  + YA  +AK++DP G  F  R++SR + G            K+L+ +  + +S 
Sbjct: 253 IYTMATRNYALSIAKIIDPDGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSM 304

Query: 814 VVIIDDSVRVWPHNKLNLIVVERYTYF 840
           VVIIDD   VW     NLI V  Y +F
Sbjct: 305 VVIIDDRGDVWQWES-NLIKVVPYDFF 330


>gi|391332118|ref|XP_003740485.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like [Metaseiulus occidentalis]
          Length = 646

 Score = 78.6 bits (192), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 81/332 (24%), Positives = 134/332 (40%), Gaps = 68/332 (20%)

Query: 723 HLFRFP-HMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 779
           H FR P     W  T++RPG   FL + S+LFE+H+ T G + YA  +  +LDP    F 
Sbjct: 168 HHFRLPGSSNAWYHTRIRPGTEDFLRKISQLFELHIVTFGARPYANHIVSLLDPGKKYFQ 227

Query: 780 GRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYT 838
            R+++R +   P        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y 
Sbjct: 228 YRILTRDECFHP------QSKTANLKSLFPCGDQMVCIIDDREDVWNFAS-NLVAVKPYV 280

Query: 839 YF-------------------------PCS-----------RRQFGLLGPSLLEIDHDER 862
           +F                          CS           R    +L      I+H   
Sbjct: 281 FFRGAGDINAPAGLLADCHALPASEGGTCSSVLSHKNPEALRADREVLACLQSLIEHTCG 340

Query: 863 SEDGTLASSLG---------VIERLHKIFFS----------HQSLDDVDVRNILAAEQRK 903
           + DG +    G          +  +H+ +F            +++   D++ ++   ++K
Sbjct: 341 ATDGFIDYEDGDDYLFYLEETLRTIHRAYFELYEQMQLKKHSEAVSIPDLKTVIPYVRQK 400

Query: 904 ILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA 963
           +L    IVF+  FP+ +       ++  A   GA   K +  +VTH+VA   GT KV  A
Sbjct: 401 VLKDVVIVFTGCFPINQRQESA-KIYLVAIALGAKVQKELSKEVTHLVAARPGTAKVQQA 459

Query: 964 LSTGRF-VVHPGWVEASALLYRRANEQDFAIK 994
                  VV   W+ + A  + ++ E  F +K
Sbjct: 460 RKFRSIKVVSAQWLWSCAERWEKSPETLFPLK 491


>gi|336387157|gb|EGO28302.1| hypothetical protein SERLADRAFT_354339 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 874

 Score = 78.6 bits (192), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 69/263 (26%), Positives = 107/263 (40%), Gaps = 82/263 (31%)

Query: 651 FEGYDDQQKAAIQKER-----TRRLEEQKK--------MFSARKLCLVLDLDHTLLNSA- 696
           + G+ D  +A+IQ        T  LEE +K        + ++RKL L++DLD T++++  
Sbjct: 117 YTGFSDASRASIQMTHSAFGPTVSLEEAQKIEKETADHLLNSRKLSLIVDLDQTIVHATV 176

Query: 697 --------------------------KFHEVDPVHDEILRKKEEQDREKPH----RHLFR 726
                                     +  E + V DE+    E  D   P+    + + +
Sbjct: 177 DPTVGEWIAEGEAWEGKRAMKMKPPQRSKEDEDVSDEVATDSESDDECNPNWEALKDVRK 236

Query: 727 F----PHMGM------------------------WTKLRPGIWTFLERASKLFEMHLYTM 758
           F       GM                        + K RPG   FL   +  +EMH+YTM
Sbjct: 237 FQLGPESFGMPSSPRASRKVKGKQKFIENEGCMYYIKPRPGWQHFLHSIANKYEMHVYTM 296

Query: 759 GNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVII 817
           G + YA E+   +DP G +F GR++SR + G            K L+ +   + S VVII
Sbjct: 297 GTRAYAEEVCAAIDPDGTIFGGRILSRDESGS--------LTQKSLQRLFPCDTSMVVII 348

Query: 818 DDSVRVWPHNKLNLIVVERYTYF 840
           DD   VW  +  NL+ V  Y +F
Sbjct: 349 DDRADVWEWSP-NLVKVIPYDFF 370



 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 53/103 (51%), Gaps = 2/103 (1%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           DV  I+ + +++ L G  I+FS V P+ +  P    +W+ AE FGA C   +  ++THVV
Sbjct: 557 DVTKIIPSMRKETLDGIHILFSSVIPL-DTKPETTEIWKVAEMFGAQCCTELSSRITHVV 615

Query: 952 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           A   GT KV+ A   G   +V   W   S  L+   +E  + +
Sbjct: 616 AAKHGTVKVDAARKRGGIKIVWLSWFTDSIALWSHQDETPYLL 658


>gi|307212079|gb|EFN87962.1| RNA polymerase II subunit A C-terminal domain phosphatase
           [Harpegnathos saltator]
          Length = 734

 Score = 78.6 bits (192), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 52/173 (30%), Positives = 86/173 (49%), Gaps = 26/173 (15%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKFH---EVDPVHDEILRKKEEQDREKPHRHLFRFP 728
           +++++   RKL L++DLD T++++   H    +  VH               H  L+   
Sbjct: 144 DEQRLLKDRKLVLLVDLDQTIVHTTNDHIPPNLKDVH---------------HFQLYGPN 188

Query: 729 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
                T+LRP    FL   S L+E+H+ + G ++YA  +A +LD  GVLF+ R++SR + 
Sbjct: 189 SPWYHTRLRPNTRHFLSEMSHLYELHICSFGARIYAHTIASLLDKDGVLFSHRILSRDEC 248

Query: 789 GDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
            DP        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y +F
Sbjct: 249 FDP------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 294



 Score = 48.9 bits (115), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 34/125 (27%), Positives = 63/125 (50%), Gaps = 8/125 (6%)

Query: 874 VIERLHKIFFSHQSLDDVD----VRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLW 929
           ++ R+H  F+   +LD  +    +R+I+   + ++L G  + FS + P  +   H    +
Sbjct: 472 ILRRIHTEFYC--TLDKGNGRRSLRDIIPRVRSQVLKGLYLTFSGLIPTHQ-KLHQSRAY 528

Query: 930 QTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWA-LSTGRFVVHPGWVEASALLYRRANE 988
           + A  FGA  T+ + ++ TH+VA   GT K N A   T   +V+P W+   A  +   +E
Sbjct: 529 KVARAFGAEVTQDLTEKTTHLVAIRKGTAKANAAKKDTNIKIVNPEWLWTCAERWEHVDE 588

Query: 989 QDFAI 993
           + F +
Sbjct: 589 RLFPV 593


>gi|443896478|dbj|GAC73822.1| TFIIF-interacting CTD phosphatases [Pseudozyma antarctica T-34]
          Length = 751

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 61/209 (29%), Positives = 97/209 (46%), Gaps = 43/209 (20%)

Query: 662 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK-------- 711
           +  E  +RL  E    + S RKL L++DLD T++++     VDP   E +R         
Sbjct: 46  VSAEEAQRLDSESTSHLLSQRKLALIVDLDQTVIHAT----VDPTVGEWMRDDTNPNYDA 101

Query: 712 -------------KEEQDREKP------HRHLFRFPHMGMWTKLRPGIWTFLERASKLFE 752
                        +E +D + P         L        + K RPG+ T L++ S+ ++
Sbjct: 102 LKSVGKFRLGIDGEEIKDDDDPTAPKDAAAALRASRACWYYVKPRPGVPTILKQLSQKYQ 161

Query: 753 MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME- 811
           +H+YTMG + YA  + K++DP   +F  R++SR ++G            K L  +  ++ 
Sbjct: 162 LHVYTMGTRSYANCVCKLIDPDASIFGNRILSRDENGSLV--------RKSLSRLFPVDH 213

Query: 812 SAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           S VVIIDD   VW  N  NL+ V  Y +F
Sbjct: 214 SMVVIIDDREDVW-SNSPNLLPVLPYEFF 241



 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 59/112 (52%), Gaps = 12/112 (10%)

Query: 893 VRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQV-THVV 951
           V +++ A++ ++L GC IVFS + PVG  +     LW TA  FGA     I+  V THVV
Sbjct: 503 VVDLMGAKKAQVLKGCVIVFSSMIPVGH-DAAKSELWATARAFGATPAAEIEPGVTTHVV 561

Query: 952 ANSLGTDKVNWAL----------STGRFVVHPGWVEASALLYRRANEQDFAI 993
           +  +GT KV+ A+          S+   +V P W E S   + R +E+ + +
Sbjct: 562 SARMGTAKVHQAMKLVKQQRERGSSQLKLVWPSWFEVSTSRWARQDEELYRL 613


>gi|238881126|gb|EEQ44764.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 525

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 66/205 (32%), Positives = 98/205 (47%), Gaps = 44/205 (21%)

Query: 658 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-------------- 703
           + A I+   T RL ++      RKL LV+DLD T++++     VDP              
Sbjct: 63  EAAKIEHNTTDRLIDE------RKLILVVDLDQTVIHAT----VDPTVGEWQSDPANPNY 112

Query: 704 --VHDEILRKKEEQDREKPHRHLFRFPHMG-----MWTKLRPGIWTFLERASKLFEMHLY 756
             V D      EE+    P    +  P +       + KLRPG+  FLE+ ++ +EMH+Y
Sbjct: 113 AAVKDVKTFCLEEEAIVPPG---WTGPKLAPTKCTYYVKLRPGLSEFLEKMAEKYEMHIY 169

Query: 757 TMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVV 815
           TM  + YA  +AK++DP G  F  R++SR + G            K+L+ +  + +S VV
Sbjct: 170 TMATRNYALSIAKIIDPDGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVV 221

Query: 816 IIDDSVRVWPHNKLNLIVVERYTYF 840
           IIDD   VW     NLI V  Y +F
Sbjct: 222 IIDDRGDVWQWES-NLIKVVPYDFF 245


>gi|149241937|ref|XP_001526384.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146450507|gb|EDK44763.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 883

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 67/232 (28%), Positives = 109/232 (46%), Gaps = 55/232 (23%)

Query: 646 DVEHLFEGYDDQQKAAI-----------QKERTRRLEEQ--KKMFSARKLCLVLDLDHTL 692
           D E  + GYD +++A+I             +   ++E     ++   RKL LV+DLD T+
Sbjct: 117 DDEKDYSGYDYEERASIAMSHDNTELRISYDEAAKIEHNTTDRLNQERKLILVVDLDQTV 176

Query: 693 LNSAKFHEVDPVHDEILRKKEEQDREKPH----RHLFRF--------------PHMG--- 731
           +++     VDP   E      + D E P+    + +  F              P +    
Sbjct: 177 IHAT----VDPTVGEW-----QLDPENPNYPAVKDVRTFCLEEDPVAPPGWNGPKLAPTK 227

Query: 732 --MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
              + K+RPG+  FL++  + +EMH+YTM  + YA  +AK++DP+G  F  R++SR + G
Sbjct: 228 CWYYVKVRPGLAEFLKKMDEKYEMHIYTMATRNYALSIAKIIDPEGKYFGDRILSRDESG 287

Query: 790 DPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
                       K+L+ +  ++ S VVIIDD   VW     NLI V  Y +F
Sbjct: 288 S--------LTHKNLKRLFPVDQSMVVIIDDRGDVWQWEN-NLIKVVPYDFF 330


>gi|91087589|ref|XP_971974.1| PREDICTED: similar to RNA polymerase II subunit A C-terminal domain
           phosphatase [Tribolium castaneum]
 gi|270010700|gb|EFA07148.1| hypothetical protein TcasGA2_TC010139 [Tribolium castaneum]
          Length = 760

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 53/172 (30%), Positives = 81/172 (47%), Gaps = 30/172 (17%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG--- 731
           ++   RKL L++DLD TL+++   H    + D                 ++RF   G   
Sbjct: 136 RLIRDRKLVLLVDLDQTLIHTTNDHIQPNIKD-----------------IYRFQLYGPNS 178

Query: 732 --MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
              +T+LRPG   FL      +E+H+ T G + YA  +A VLD     F+ R++SR +  
Sbjct: 179 PWYFTRLRPGTHQFLNNIYPFYELHICTFGARNYAHMIAAVLDRDQKFFSNRILSRDECF 238

Query: 790 DPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           DP        K  +L+ +    ++ V IIDD   VW  N  NLI V+ Y +F
Sbjct: 239 DP------TSKKANLKALFPCGDNMVCIIDDREDVWS-NAANLIHVKPYHFF 283



 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/134 (26%), Positives = 66/134 (49%), Gaps = 6/134 (4%)

Query: 865 DGTLASSLGVIERLHKIFFS----HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGE 920
           D  L+    +++R+H+ F+      +S +  D++ ++   + ++L G ++VFS + P   
Sbjct: 444 DDYLSYLEDILKRIHQAFYDAYDKMESGEVPDLKKVIPGVRSQVLQGYKLVFSGLVPT-H 502

Query: 921 ANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEAS 979
                   +Q A+  GA  T+ ++D  TH+VA   GT KVN         +V P W+   
Sbjct: 503 IKLEQSKAYQIAKSLGAEVTQELEDDTTHLVAVRPGTAKVNAGRRKKNLKIVTPDWLWCC 562

Query: 980 ALLYRRANEQDFAI 993
           A  +   +E+ FA+
Sbjct: 563 AERWEHVDEKIFAL 576


>gi|328859642|gb|EGG08750.1| hypothetical protein MELLADRAFT_115868 [Melampsora larici-populina
           98AG31]
          Length = 736

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 54/176 (30%), Positives = 83/176 (47%), Gaps = 44/176 (25%)

Query: 668 RRLEEQ--KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF 725
           RRLE +   ++    KL L++DLD T++++     VDP                      
Sbjct: 257 RRLESETRSRLLKDTKLSLIVDLDQTIVHAT----VDPT--------------------- 291

Query: 726 RFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 785
               +G W    PG+  FL   ++ +EMH+YTMG + YA  + +++DP   LF  RV+SR
Sbjct: 292 ----VGEWI---PGLSEFLRTLAEKYEMHVYTMGTRAYADAVCRIIDPTSELFGSRVLSR 344

Query: 786 GDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
            + G            K L  +  ++ S VVIIDD   VW ++  NL+ V  Y +F
Sbjct: 345 DESGS--------MTQKSLTRLFPVDTSMVVIIDDRGDVWEYSP-NLVSVVPYNFF 391



 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 45/81 (55%), Gaps = 6/81 (7%)

Query: 915 VFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHP 973
           V+P+ +A P     W+ AEQFGA C   +  +VTH+VA       +N AL+  +  VV P
Sbjct: 525 VWPM-DAIPDQQDAWKLAEQFGAQCYTRLTPRVTHLVA----AKAINIALTRRKVSVVKP 579

Query: 974 GWVEASALLYRRANEQDFAIK 994
            W+  +  L+R  NE+++  K
Sbjct: 580 KWLYDAVTLWRHPNEEEYLWK 600


>gi|294898997|ref|XP_002776453.1| NLI interacting factor, putative [Perkinsus marinus ATCC 50983]
 gi|294900793|ref|XP_002777118.1| NLI interacting factor, putative [Perkinsus marinus ATCC 50983]
 gi|239883444|gb|EER08269.1| NLI interacting factor, putative [Perkinsus marinus ATCC 50983]
 gi|239884575|gb|EER08934.1| NLI interacting factor, putative [Perkinsus marinus ATCC 50983]
          Length = 370

 Score = 77.8 bits (190), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 73/300 (24%), Positives = 126/300 (42%), Gaps = 52/300 (17%)

Query: 728 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP--KGVLFAGRVISR 785
           PH   + KLRPG+  FLE    ++E +++T   ++Y   + + LDP  KG      V SR
Sbjct: 29  PH---FVKLRPGVHQFLEALQPMYEFYIHTKATRVYLEYVMEALDPHKKGFFRNDNVFSR 85

Query: 786 GDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSR 844
            DD      +     +KD+  V       V+I+DD  ++W   + N+I    Y Y     
Sbjct: 86  CDDMKHGSNE-----NKDIRAVCSRPREEVIILDDKDKIWLDFQPNVIKCPPYKYMDQKL 140

Query: 845 RQF------------------GLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQ 886
            Q                   G   P   E+D   ++ DG L + + V   +H  +   +
Sbjct: 141 LQVVRALKQTSDWIKEGGPESGYPKP---ELDDASKNFDGYLPAMVRVFTEIHHRYC--K 195

Query: 887 SLDD-------VDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVC 939
           +L D       VDV+N++   +++    CRI+ +  F   EA+       +  E +G   
Sbjct: 196 ALVDAAGDGFVVDVKNVIDDTRKQTFKNCRIMLTG-FNQNEASERA----EMIEMYGGTV 250

Query: 940 TKHIDDQVTHVVANSLGTDKVNWALSTGR------FVVHPGWVEASALLYRRANEQDFAI 993
             +++D+ TH+V    GT K +      +       +VHP W++     ++  +E  F I
Sbjct: 251 INNVEDEPTHLVCAKGGTAKCHLIAKAMKEDSLKCKIVHPCWLDHGFATWKHPSEFIFEI 310


>gi|401827003|ref|XP_003887594.1| TFIIF-interacting CTD phosphatase [Encephalitozoon hellem ATCC
           50504]
 gi|392998600|gb|AFM98613.1| TFIIF-interacting CTD phosphatase [Encephalitozoon hellem ATCC
           50504]
          Length = 408

 Score = 77.4 bits (189), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 101/404 (25%), Positives = 163/404 (40%), Gaps = 122/404 (30%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  +I KE+ + LE Q K      L LVLDLD T+L++  +   D              
Sbjct: 42  KEAVSIYKEKVKALEMQMK------LILVLDLDQTVLHTT-YGTSDC------------- 81

Query: 717 REKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
                + + +F   G     KLRP +   L R SKL+E+H+YTMG + YA  +  ++DP 
Sbjct: 82  -----KGIVKFTMDGCKYSVKLRPHLNRMLRRVSKLYEIHVYTMGTRPYAERIIGIIDPA 136

Query: 775 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA---------VVIIDDSVRVWP 825
           G  F  R+I+R ++                +GVL    +         +VI+DD   VW 
Sbjct: 137 GKYFHDRIITRDEN----------------QGVLVKRLSRLFPYNHKNIVILDDRADVWD 180

Query: 826 HNKLNLIVVERYTYF---------PCSRRQFGL--------------------------- 849
           +N+ NL++V+ + YF            RR  G+                           
Sbjct: 181 YNE-NLVLVKPFWYFNRVDINDPSKLERRAEGVSDGHGDLGEFVGKRKKVEEGEDARIIS 239

Query: 850 ------LGPSL----LEIDHD---------ERSEDGTLASSLGVIERLHKIFFSHQSLDD 890
                 LG S      E+D D         E S+D  L   +  ++++HK +FS +    
Sbjct: 240 RPDGTGLGGSSGKPETEVDSDRMGEKKVREEPSDDCELLRMVKFLKKVHKKYFSSKHR-- 297

Query: 891 VDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHV 950
            +V+ IL   +++I  G      R F VG  N     L +  E  G + ++ ++ +V  V
Sbjct: 298 -NVKKILRRIRKRIFDG-----DRFFIVGPTNRAW--LVKVIEMHGGMVSR-LESEVDFV 348

Query: 951 VANSLGTDKVN-WALSTGRFVVHPGWVEASALLYRRANEQDFAI 993
           +++  G D+V   A      VV P W+E       R     + I
Sbjct: 349 ISS--GLDEVRELAQKLECLVVSPKWIEDCVYSLERVGYGRYVI 390


>gi|322785368|gb|EFZ12041.1| hypothetical protein SINV_00693 [Solenopsis invicta]
          Length = 759

 Score = 77.4 bits (189), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 53/177 (29%), Positives = 88/177 (49%), Gaps = 32/177 (18%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH-RHLFRFPH 729
           E+++++   RKL L++DLD T++++                    D   P+ + +F F  
Sbjct: 149 EDEQRLLRDRKLVLLVDLDQTIVHTT------------------NDNIPPNLKDVFHFQL 190

Query: 730 MG-----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
            G       T+LRP    FL + S L+E+H+ T G ++YA  +A +LD   VLF+ R++S
Sbjct: 191 YGPNSPWYHTRLRPNTRRFLSKMSSLYELHICTFGARIYAHTVASLLDKDKVLFSHRILS 250

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           R +  DP        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y +F
Sbjct: 251 RDECFDP------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 300



 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/125 (28%), Positives = 65/125 (52%), Gaps = 8/125 (6%)

Query: 874 VIERLHKIFFSHQSLDDVD----VRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLW 929
           ++ER+H+ F+  ++LD  +    +R+I+   + ++L G  + FS + P  +   H    +
Sbjct: 450 ILERIHREFY--KTLDQENTRKSLRDIIPRVRSQVLKGVCLTFSGLIPTHQ-KLHQSRAY 506

Query: 930 QTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANE 988
           + A  FGA  T+ + ++ TH+VA   GT K N A   G+  +V+  W+   A  + R  E
Sbjct: 507 KVARAFGAEVTQELTEKTTHLVAIRKGTAKANAAKKHGKIKIVNSDWLWTCAERWERVEE 566

Query: 989 QDFAI 993
             F +
Sbjct: 567 DLFQL 571


>gi|307168754|gb|EFN61749.1| RNA polymerase II subunit A C-terminal domain phosphatase
           [Camponotus floridanus]
          Length = 721

 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 52/177 (29%), Positives = 87/177 (49%), Gaps = 32/177 (18%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH-RHLFRFPH 729
           E+++++   RKL L++DLD T++++                    D   P+ + +F F  
Sbjct: 146 EDEQRLLKDRKLVLLVDLDQTIVHTT------------------NDNIPPNLKDVFHFQL 187

Query: 730 MG-----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
            G       T+ RP    FL   S L+E+H+ T G ++YA  +A +LD  G+LF+ R++S
Sbjct: 188 YGPNSPWYHTRFRPNTRHFLSEMSHLYELHICTFGARIYAHTVASLLDKDGILFSHRILS 247

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           R +  DP        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y +F
Sbjct: 248 RDECFDP------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 297



 Score = 52.8 bits (125), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 33/121 (27%), Positives = 61/121 (50%), Gaps = 4/121 (3%)

Query: 874 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 931
           ++ R+H  F++   Q  D   +R+I+   + ++L G  + FS + P  +   H   +++ 
Sbjct: 448 ILRRIHTEFYAILDQGNDRKSLRDIIPRVRSQVLKGLCLTFSGLIPTHQ-KLHQSRVYKV 506

Query: 932 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
           A  FGA  T+ + ++ TH+VA   GT K N A       +V+P W+   A  +   +E+ 
Sbjct: 507 ARAFGAEITQDLTEKTTHLVAIRKGTAKANAARKDANIKIVNPEWLWTCAERWEHVDERL 566

Query: 991 F 991
           F
Sbjct: 567 F 567


>gi|350413080|ref|XP_003489872.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like [Bombus impatiens]
          Length = 751

 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 52/171 (30%), Positives = 88/171 (51%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+++++ + RKL L++DLD T++++      D +   I        ++  H  L+     
Sbjct: 143 EDEQRLLNDRKLALLVDLDQTIVHTTN----DNIPSNI--------KDVYHYQLYGPNSP 190

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
              T+LRP    FL   S+L+E+H+ T G + YA  +A +LD  G LF+ R++SR +  D
Sbjct: 191 WYHTRLRPNTKHFLSEMSRLYELHICTFGARNYAHTVAALLDKDGTLFSHRILSRDECFD 250

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           P        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y +F
Sbjct: 251 P------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 294



 Score = 47.8 bits (112), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 31/121 (25%), Positives = 58/121 (47%), Gaps = 4/121 (3%)

Query: 874 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 931
           ++ R+H  F++   +      +R+I+   + ++L G  + FS + P  +   H    ++ 
Sbjct: 444 ILRRIHTEFYTTIEKEGGRKSLRDIIPRVRAQVLKGVYLTFSGLIPTHQK-LHQSRAYKV 502

Query: 932 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
           A  FGA   + + D+ TH+VA   GT K N A       +V+P W+   A  +   +E+ 
Sbjct: 503 ARAFGAEVAQDLSDKTTHLVAIRPGTAKANAAKKNRNIKIVNPDWLWTCAERWEHVDERL 562

Query: 991 F 991
           F
Sbjct: 563 F 563


>gi|328792425|ref|XP_623605.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like [Apis mellifera]
          Length = 745

 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 52/171 (30%), Positives = 87/171 (50%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+++++ + RKL L++DLD T++++     V P   ++            H  L+     
Sbjct: 143 EDEQRLLNDRKLALLVDLDQTIVHTTN-DNVPPNMKDVY-----------HYQLYGPNSP 190

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
              T+LRP    FL   S+L+E+H+ T G + YA  +A +LD  G LF+ R++SR +  D
Sbjct: 191 WYHTRLRPNTRHFLSEMSRLYELHICTFGARNYAHTVAALLDKDGTLFSHRILSRDECFD 250

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           P        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y +F
Sbjct: 251 P------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 294



 Score = 45.1 bits (105), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 30/123 (24%), Positives = 58/123 (47%), Gaps = 4/123 (3%)

Query: 874 VIERLHKIFFSHQSLDD--VDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 931
           ++ R+H  F++    +     +R+I+   + ++L G  + FS + P  +   H    ++ 
Sbjct: 438 ILRRIHTEFYATMEKEGGRKSLRDIIPRVRAQVLKGVHLTFSGLIPTHQK-LHQSRAYKV 496

Query: 932 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
           A  FGA   + +  + TH+VA   GT K N A       +V+P W+   A  +   +E+ 
Sbjct: 497 ARAFGAEVAQDLSKKTTHLVAIRPGTAKANTAKKNPNIKIVNPDWLWTCAERWEHVDERL 556

Query: 991 FAI 993
           F +
Sbjct: 557 FPL 559


>gi|340709144|ref|XP_003393173.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
           C-terminal domain phosphatase-like [Bombus terrestris]
          Length = 751

 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 52/171 (30%), Positives = 88/171 (51%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+++++ + RKL L++DLD T++++      D +   I        ++  H  L+     
Sbjct: 143 EDEQRLLNDRKLALLVDLDQTIVHTTN----DNIPSNI--------KDVYHYQLYGPNSP 190

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
              T+LRP    FL   S+L+E+H+ T G + YA  +A +LD  G LF+ R++SR +  D
Sbjct: 191 WYHTRLRPNTKHFLSEMSRLYELHICTFGARNYAHTVAALLDKDGTLFSHRILSRDECFD 250

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           P        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y +F
Sbjct: 251 P------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 294



 Score = 47.4 bits (111), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 31/121 (25%), Positives = 58/121 (47%), Gaps = 4/121 (3%)

Query: 874 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 931
           ++ R+H  F++   +      +R+I+   + ++L G  + FS + P  +   H    ++ 
Sbjct: 444 ILRRIHTEFYATIEKEGGRKSLRDIIPRVRAQVLKGVYLTFSGLIPTHQK-LHQSRAYKV 502

Query: 932 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
           A  FGA   + + D+ TH+VA   GT K N A       +V+P W+   A  +   +E+ 
Sbjct: 503 ARAFGAEVAQDLSDKTTHLVAIRPGTAKANAAKKNRNIKIVNPDWLWTCAERWEHVDERL 562

Query: 991 F 991
           F
Sbjct: 563 F 563


>gi|380022133|ref|XP_003694908.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
           C-terminal domain phosphatase-like [Apis florea]
          Length = 749

 Score = 77.0 bits (188), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 52/171 (30%), Positives = 87/171 (50%), Gaps = 20/171 (11%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+++++ + RKL L++DLD T++++     V P   ++            H  L+     
Sbjct: 143 EDEQRLLNDRKLALLVDLDQTIVHTTN-DNVPPNMKDVY-----------HYQLYGPNSP 190

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
              T+LRP    FL   S+L+E+H+ T G + YA  +A +LD  G LF+ R++SR +  D
Sbjct: 191 WYHTRLRPNTRHFLSEMSRLYELHICTFGARNYAHTVAALLDKDGTLFSHRILSRDECFD 250

Query: 791 PFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           P        K+ +L+ +    +  V IIDD   VW     NL+ V+ Y +F
Sbjct: 251 P------ASKTANLKALFPCGDDLVCIIDDREDVW-QGCGNLVQVKPYHFF 294



 Score = 47.4 bits (111), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 30/122 (24%), Positives = 59/122 (48%), Gaps = 5/122 (4%)

Query: 874 VIERLHKIFFS---HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQ 930
           ++ R+H  F++    +  +   +R+I+   + ++L G  + FS + P  +   H    ++
Sbjct: 440 ILRRIHTEFYATIEKEGGNRKSLRDIIPRVRAQVLKGVHLTFSGLIPTHQK-LHQSRAYK 498

Query: 931 TAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQ 989
            A  FGA   + + ++ TH+VA   GT K N A       +V+P W+   A  +   +E+
Sbjct: 499 VARAFGAEVAQDLSEKTTHLVAIRPGTAKANTAKKNSNIKIVNPDWLWTCAERWEHVDER 558

Query: 990 DF 991
            F
Sbjct: 559 LF 560


>gi|326437795|gb|EGD83365.1| hypothetical protein PTSG_03974 [Salpingoeca sp. ATCC 50818]
          Length = 864

 Score = 77.0 bits (188), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 49/130 (37%), Positives = 67/130 (51%), Gaps = 17/130 (13%)

Query: 865 DGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPH 924
           DG   S+  ++ RLH+ F  +                  IL G RIVF+ V P G+ + +
Sbjct: 617 DGVAPSTKAILPRLHRKFPHYPPW---------------ILEGVRIVFTGVIPRGQ-SAY 660

Query: 925 LHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLY 983
            HP W+ A   GAV    +D++VTHVVA   GTDKV  A   G   VV+  W+EA A  +
Sbjct: 661 THPAWRMAVNMGAVVVDQVDERVTHVVARVDGTDKVRQARKMGGVHVVYLKWLEACASQH 720

Query: 984 RRANEQDFAI 993
           RR +E  FA+
Sbjct: 721 RRVDEALFAV 730



 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 61/112 (54%), Gaps = 14/112 (12%)

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD----D 788
           +TK+RPG+  FLE    ++E+H+YTMG + YA E+  ++DP    F+ R++++ +    D
Sbjct: 32  YTKIRPGVKEFLEAVKDMYELHVYTMGTRAYAKEICNIIDPGAHYFSTRILTQDESARID 91

Query: 789 GDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
               + +   P+  D+         VVI+DD+  +W   + NLI    Y YF
Sbjct: 92  TKSINLNHLFPRGDDM---------VVILDDTAAMWDF-RPNLIPAAPYDYF 133


>gi|323508124|emb|CBQ67995.1| related to FCP1-TFIIF interacting component of CTD phosphatase
           [Sporisorium reilianum SRZ2]
          Length = 773

 Score = 76.6 bits (187), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 59/209 (28%), Positives = 95/209 (45%), Gaps = 43/209 (20%)

Query: 662 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRK-------- 711
           +  E  +RL  E    + S RKL L++DLD T++++     VDP   E +R         
Sbjct: 46  VSAEEAQRLDSESTSHLLSQRKLALIVDLDQTVIHAT----VDPTVGEWMRDESNPNYDA 101

Query: 712 ----------------KEEQDREKPH---RHLFRFPHMGMWTKLRPGIWTFLERASKLFE 752
                           K++ D   P      L        + K RPG+   L++ S+ ++
Sbjct: 102 LQSVGKFRLGIDGEEIKDDDDESAPRDSAAALRASRACWYYVKPRPGVPKVLKQLSEKYQ 161

Query: 753 MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME- 811
           +H+YTMG + YA  + K++DP   +F  R++SR ++G            K L  +  ++ 
Sbjct: 162 LHVYTMGTRSYANCVCKLIDPDASIFGNRILSRDENGSLV--------RKSLSRLFPVDH 213

Query: 812 SAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 214 SMVVIIDDREDVWSRSP-NLLPVLPYEFF 241



 Score = 66.2 bits (160), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 42/107 (39%), Positives = 57/107 (53%), Gaps = 7/107 (6%)

Query: 893 VRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQV-THVV 951
           V ++++A +  +L GC IVFS + P G  +P    LW  A +FGA  +  I+  V THVV
Sbjct: 505 VVDLISARKAHVLQGCTIVFSSMIPFGH-DPEKSDLWAMAREFGATPSSEIEAGVTTHVV 563

Query: 952 ANSLGTDKVNWAL-----STGRFVVHPGWVEASALLYRRANEQDFAI 993
           A   GT KV+ AL     S G  VV P W   SA  + R +E  + I
Sbjct: 564 AARPGTAKVHQALRLAQKSAGLEVVWPSWFHVSASRWARQDEGLYRI 610


>gi|207342073|gb|EDZ69950.1| YMR277Wp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 544

 Score = 76.6 bits (187), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 42/109 (38%), Positives = 62/109 (56%), Gaps = 10/109 (9%)

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
           + K+RPG+  F  + + LFEMH+YTM  + YA ++AK++DP G LF  R++SR ++G   
Sbjct: 58  YVKVRPGLKEFFAKVAPLFEMHIYTMATRAYALQIAKIVDPTGELFGDRILSRDENGS-- 115

Query: 793 DGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
                   +K L  +    +S VV+IDD   VW     NLI V  Y +F
Sbjct: 116 ------LTTKSLAKLFPTDQSMVVVIDDRGDVWNWCP-NLIKVVPYNFF 157



 Score = 53.9 bits (128), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 35/147 (23%), Positives = 71/147 (48%), Gaps = 17/147 (11%)

Query: 859 HDER---SEDGTLASSLGVIERLHKIFF---SHQSLDDVDVRNILAAEQRKILAGCRIVF 912
           HD++    +D  L   +G +  +HK ++   S Q+  + ++  I+ + ++K+   C  VF
Sbjct: 264 HDQKLLVDDDDELYYLMGTLSNIHKTYYDMLSQQNEPEPNLMEIIPSLKQKVFQNCYFVF 323

Query: 913 SRVFPVG-EANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-- 969
           S + P+G +       +W +   FGA  T  ID   TH++  +  T K   A    +F  
Sbjct: 324 SGLIPLGTDIQRSDIVIWTST--FGATSTPDIDYLTTHLITKNPSTYKARLA---KKFNP 378

Query: 970 ---VVHPGWVEASALLYRRANEQDFAI 993
              +VHP W+    + +++ +E+ + +
Sbjct: 379 QIKIVHPDWIFECLVNWKKVDEKPYTL 405


>gi|354545519|emb|CCE42247.1| hypothetical protein CPAR2_807960 [Candida parapsilosis]
          Length = 786

 Score = 76.3 bits (186), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 65/202 (32%), Positives = 93/202 (46%), Gaps = 38/202 (18%)

Query: 658 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-------------- 703
           + A I+   T RL ++KK      L LV+DLD T++++     VDP              
Sbjct: 148 EAAKIEHSTTDRLNDEKK------LILVVDLDQTVIHAT----VDPTVGEWQSDPSNPNY 197

Query: 704 --VHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMG 759
             V D      EE     P     +      W   K+RPG+  FLE+    +EMH+YTM 
Sbjct: 198 PAVKDVKTFCLEEDPIVPPGWTGPKLAPTKCWYYVKVRPGLSEFLEKMDTKYEMHIYTMA 257

Query: 760 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIID 818
            + YA  +AK++DP G  F  R++SR + G            K+L+ +  + +S VVIID
Sbjct: 258 TRNYALAIAKIIDPDGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVVIID 309

Query: 819 DSVRVWPHNKLNLIVVERYTYF 840
           D   VW     NLI V  Y +F
Sbjct: 310 DRGDVWQWEN-NLIKVVPYDFF 330



 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 34/147 (23%), Positives = 66/147 (44%), Gaps = 20/147 (13%)

Query: 864 EDGTLASSLGVIERLHKIFFSHQSLDDV---------DVRNILAAEQRKILAGCRIVFSR 914
           +D  L S + V++ +H+ +++    DD          D+  I+   + + L G  I+FS 
Sbjct: 470 DDNELTSLMKVLDNVHEEYYN--LYDDTKAKLHSIKPDLTKIIPRMKSQCLQGVTILFSG 527

Query: 915 VFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTD-------KVNWALSTG 967
           + P+G  N     +    +QFG      +  +VTHVV   +  +       +V   L   
Sbjct: 528 IIPLG-VNIDSADIVIWCKQFGVKVVNEVYPEVTHVVCRDVSPEAGPTFKARVAHKLYPD 586

Query: 968 RF-VVHPGWVEASALLYRRANEQDFAI 993
              +V+P W+ +   L+ + +E+D+ I
Sbjct: 587 SIKIVNPDWLFSCLSLWTKVDEEDYLI 613


>gi|170084539|ref|XP_001873493.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164651045|gb|EDR15285.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 845

 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 44/109 (40%), Positives = 60/109 (55%), Gaps = 10/109 (9%)

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
           + K RPG   FL+ AS  +EMH+YTMG + YA ++   +DP G LF GRV+SR + G   
Sbjct: 261 YIKPRPGWKEFLQEASTKYEMHVYTMGTRAYAEQVCAAIDPDGKLFGGRVLSRDESGS-- 318

Query: 793 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
                    K L+ +   + S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 319 ------LTQKSLQRLFPCDTSMVVIIDDRADVWEWSP-NLLKVVPYDFF 360



 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 52/103 (50%), Gaps = 2/103 (1%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           DV  I+   + ++L G  I+FS V P+ +  P    +W+ A  FGA C+  +   +THVV
Sbjct: 532 DVTRIIPRLRSEVLEGVHILFSSVIPL-DTKPETTEIWRMAHMFGARCSTELTSDITHVV 590

Query: 952 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           A   GT KV+ A   G   +V   W       ++R +E+ + +
Sbjct: 591 AAKRGTVKVDMARKRGGIKIVWLAWFTDCIASWQRQDEKPYLL 633


>gi|392597598|gb|EIW86920.1| hypothetical protein CONPUDRAFT_95946 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 830

 Score = 75.5 bits (184), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 64/259 (24%), Positives = 104/259 (40%), Gaps = 74/259 (28%)

Query: 647 VEHLFEGYDDQQKAAIQK-----------ERTRRLEEQ--KKMFSARKLCLVLDLDHTL- 692
            +H + G+ +  +A+IQ            E  +R+E +  + +  +RKL L++DLD T+ 
Sbjct: 113 TDHDYTGFSNASRASIQMTHSAFGPTVSLEEAQRIERETAEHLLKSRKLSLIVDLDQTIV 172

Query: 693 -----------LNSAKFHEVDPVHDEILRKKEE--------------------------- 714
                      +N  K  E   +  +  R + +                           
Sbjct: 173 HATVDPTVGEWINEGKQWEQKHIQKQKARDERKDGSDSDGTASSDEDDCNPNWDALKDVK 232

Query: 715 ------------QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKL 762
                       Q +++  + L        + K RPG   F +  SK +EMH+YTMG + 
Sbjct: 233 SFRLGPESFVMPQSQKRGKQKLIENDGCLYYVKPRPGWKEFFQELSKKYEMHVYTMGTRA 292

Query: 763 YATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSV 821
           YA E+   +DP   +F GR++SR + G            K L+ +   + S VVIIDD  
Sbjct: 293 YAEEVCAAIDPDSKIFGGRILSRDESGS--------LTQKSLQRLFPCDTSMVVIIDDRA 344

Query: 822 RVWPHNKLNLIVVERYTYF 840
            VW  +  NLI V  Y +F
Sbjct: 345 DVWEWSP-NLIKVIPYDFF 362



 Score = 70.1 bits (170), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 46/151 (30%), Positives = 70/151 (46%), Gaps = 19/151 (12%)

Query: 862 RSEDGTLASSLGVIERLHKIFFS-----------------HQSLDDVDVRNILAAEQRKI 904
           +++D  L    G++E +H  FFS                  +S    DV +I+   + K+
Sbjct: 488 KNDDTELKRVGGLLEDIHARFFSLYDSRSTDVSSKKRRHGSKSSKSHDVTHIIPEIRAKV 547

Query: 905 LAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWAL 964
             G  I+FS V P+ +  P    +W+ A  FGA C   +   +THVVA  LGT KV+ A 
Sbjct: 548 FDGVHILFSSVIPL-DTPPETTEIWKVAHMFGAKCYTELSSSITHVVAARLGTVKVDAAR 606

Query: 965 STGRF-VVHPGWVEASALLYRRANEQDFAIK 994
             G   VV   W   S  L+RR +E  + ++
Sbjct: 607 RRGGIKVVWVAWFTDSVALWRRMDESHYYLE 637


>gi|448520991|ref|XP_003868400.1| Fcp1 protein [Candida orthopsilosis Co 90-125]
 gi|380352740|emb|CCG25496.1| Fcp1 protein [Candida orthopsilosis]
          Length = 788

 Score = 75.5 bits (184), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 64/202 (31%), Positives = 93/202 (46%), Gaps = 38/202 (18%)

Query: 658 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-------------- 703
           + A I+   T RL E++K      L LV+DLD T++++     VDP              
Sbjct: 148 EAAKIEHSTTDRLNEEEK------LILVVDLDQTVIHAT----VDPTVGEWQSDPSNPNY 197

Query: 704 --VHDEILRKKEEQDREKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMG 759
             V D      EE     P     +      W   K+RPG+  FL++    +EMH+YTM 
Sbjct: 198 PAVKDVKTFCLEEDPIVPPGWTGPKLAPTKCWYYVKVRPGLSEFLQKMDTKYEMHIYTMA 257

Query: 760 NKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIID 818
            + YA  +AK++DP G  F  R++SR + G            K+L+ +  + +S VVIID
Sbjct: 258 TRNYALAIAKIIDPDGKYFGDRILSRDESGS--------LTHKNLKRLFPVDQSMVVIID 309

Query: 819 DSVRVWPHNKLNLIVVERYTYF 840
           D   VW     NLI V  Y +F
Sbjct: 310 DRGDVWQWEN-NLIKVVPYDFF 330


>gi|299756470|ref|XP_002912206.1| RNA polymerase II subunit A domain phosphatase [Coprinopsis cinerea
           okayama7#130]
 gi|298411691|gb|EFI28712.1| RNA polymerase II subunit A domain phosphatase [Coprinopsis cinerea
           okayama7#130]
          Length = 801

 Score = 75.5 bits (184), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 44/109 (40%), Positives = 60/109 (55%), Gaps = 10/109 (9%)

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
           + K RPG   FLE A+K +EMH+YTMG + YA E+   +DP G LF  R++SR + G   
Sbjct: 270 YIKPRPGWKEFLENAAKKYEMHVYTMGTRAYAQEVCAAIDPDGKLFGSRLLSRDESGS-- 327

Query: 793 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
                    K L+ +   + S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 328 ------LTQKSLQRLFPCDTSMVVIIDDRADVWEWSP-NLLKVIPYDFF 369


>gi|209879341|ref|XP_002141111.1| NLI interacting factor-like phosphatase family protein
           [Cryptosporidium muris RN66]
 gi|209556717|gb|EEA06762.1| NLI interacting factor-like phosphatase family protein
           [Cryptosporidium muris RN66]
          Length = 590

 Score = 75.5 bits (184), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 92/365 (25%), Positives = 148/365 (40%), Gaps = 62/365 (16%)

Query: 674 KKMFSARKLCLVLDLDHTLL---NSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-- 728
           K   S  KL  +LDLD+TLL   NS K      + D I    E      P  + F  P  
Sbjct: 161 KDHLSQNKLVAILDLDNTLLHAYNSTKVGCNINLEDFIGANGE------PEMYKFVLPQD 214

Query: 729 -HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 787
            +   + KLRPG+  FL   +  + M + T   + YA  +  VLDPK   F  R+++R  
Sbjct: 215 MNTPYYLKLRPGVREFLNTIAPYYIMGICTNATREYADVIRAVLDPKRDKFGDRIVAR-- 272

Query: 788 DGDPFDGDERVPKSKDLEGV-LGMES-AVVIIDDSVRVWPHN-KLNLIVVERYTYFPCSR 844
             +  DG +     KD + + +G+++ A+V++DD   VW  + ++ ++  + Y YF   R
Sbjct: 273 --ENVDGRD---TQKDFKKICIGIDTRAIVLLDDRSDVWDSSLEIQVVKAQTYEYFEHRR 327

Query: 845 RQF----------------------GLLGPSLLEIDHDE-----RSEDGTLASSLGVIER 877
                                     +L  +L  + +          D  L   + V + 
Sbjct: 328 DALRAHYPPLTSNSASMGAAGSAPGDILSAALSSLSNASGGNTIADYDRHLDYLIKVFKE 387

Query: 878 LHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGE--ANPHLHPLWQTAE-- 933
           LH  FF   S +   V NIL   + +ILA C I F+      E      L   W  ++  
Sbjct: 388 LHVRFFC--SPETASVGNILQTMKSEILANCVICFTGFLKADEKPTTKSLPSTWGDSQAE 445

Query: 934 ------QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRA 986
                 + GA   ++++   TH+V     T K + A +     +VH  W+ A    ++R 
Sbjct: 446 AEAASIRLGAKVEENLESMATHLVVQKTNTAKFHQAKNNPDIKIVHTLWLWACDGQWQRV 505

Query: 987 NEQDF 991
            E  F
Sbjct: 506 PESLF 510


>gi|242015474|ref|XP_002428378.1| RNA polymerase II ctd phosphatase, putative [Pediculus humanus
           corporis]
 gi|212512990|gb|EEB15640.1| RNA polymerase II ctd phosphatase, putative [Pediculus humanus
           corporis]
          Length = 781

 Score = 75.1 bits (183), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 52/172 (30%), Positives = 87/172 (50%), Gaps = 24/172 (13%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 731
           ++ ++ + RKL L++DLD TL+++     + P   ++              H   +  M 
Sbjct: 133 DENRLLNDRKLVLLVDLDQTLIHTTN-DNIPPNLKDVY-------------HFRLYGQMS 178

Query: 732 MW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
            W  T++RP    FLE  SK +E+H+ T G + YA  +A  LDP G  F+ R++SR    
Sbjct: 179 PWYHTRIRPRTHKFLEEISKYYELHICTFGARNYAHMIAMFLDPDGKYFSHRILSR---D 235

Query: 790 DPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           + F+ +    K+ +L+ +    ++ V IIDD   VW     NLI V+ Y +F
Sbjct: 236 ECFNAN---SKTANLKALFPCGDNMVCIIDDREDVWNF-AANLIHVKPYHFF 283



 Score = 47.4 bits (111), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 32/129 (24%), Positives = 59/129 (45%), Gaps = 18/129 (13%)

Query: 874 VIERLHKIFFSHQSLDDV--------DVRNILAAEQRKILAGCRIVFSRV----FPVGEA 921
           +++ +HK ++                D++ ++   ++  L GC +VFS +     P+ E+
Sbjct: 482 ILKNIHKAYYEEYDFQKKINSNCVIPDLKTVIPNFKKNTLKGCHLVFSGLVPSHIPLQES 541

Query: 922 NPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALS-TGRFVVHPGWVEASA 980
             +L      A   GA+ +  I    TH+VA   GT KVN +    G F+V P W+   A
Sbjct: 542 RAYL-----VAISLGAIVSADISSNCTHLVAARPGTAKVNSSRRHKGIFIVTPLWLWHCA 596

Query: 981 LLYRRANEQ 989
             + + +E+
Sbjct: 597 ERWEKVDEK 605


>gi|313234471|emb|CBY24671.1| unnamed protein product [Oikopleura dioica]
          Length = 614

 Score = 75.1 bits (183), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 52/181 (28%), Positives = 90/181 (49%), Gaps = 31/181 (17%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           EE +++   RKL L++DLD T++++ +        + I  +   QD              
Sbjct: 62  EEIQRLHDNRKLVLLVDLDQTVIHTTQNRPKKLTKNTISFQLTRQD-------------P 108

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKV----------LDPKGVLFAG 780
            +WT+LRP    F+   S+ +E+H+ T G++ YA ++A++          LD     F+ 
Sbjct: 109 WLWTRLRPFCAKFIHEMSEKYELHIVTFGSRQYAHKIAEILEDQTRRQLNLDSNKSFFSH 168

Query: 781 RVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTY 839
           R++SR +  DPF       KS +LE +    +S   IIDD   VW ++  N I+V++Y +
Sbjct: 169 RILSRDECVDPF------HKSGNLEHLFPCGDSMCAIIDDRGDVWRYSP-NCILVKKYHF 221

Query: 840 F 840
           F
Sbjct: 222 F 222



 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 52/103 (50%), Gaps = 2/103 (1%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           DVRN+    ++ IL GC++VFS V P G      H   + A   GAV  + I    TH++
Sbjct: 368 DVRNVCVGIRKNILKGCQLVFSGVVPNG-CRMEEHRAVKNARAMGAVIHERIQKNTTHLI 426

Query: 952 ANSLGTDKVNWALSTGR-FVVHPGWVEASALLYRRANEQDFAI 993
               GT K N A      FVV+P W+  +   +RR +E DF I
Sbjct: 427 CARPGTAKHNEAKRKANVFVVNPAWLWVTYHEWRRQDESDFEI 469


>gi|402582824|gb|EJW76769.1| hypothetical protein WUBG_12323 [Wuchereria bancrofti]
          Length = 334

 Score = 75.1 bits (183), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 56/193 (29%), Positives = 98/193 (50%), Gaps = 16/193 (8%)

Query: 814 VVIIDDSVRVWPHNKLNLIVVERYTYFP------CSRRQFG--LLGPSLLEIDHDERSED 865
           +V+IDD   VW ++   LI V+ Y +F         R + G  +L  S  E D  E  +D
Sbjct: 18  IVMIDDRPDVWQYSDA-LIQVKPYRFFKEIGDINAPRYEKGEPILSSSYAEQDM-ESEDD 75

Query: 866 GTLASSLGVIERLHKIFFSHQSLDDV----DVRNILAAEQRKILAGCRIVFSRVFPVGEA 921
            TL     V+ ++H  F+       V    D++ I++  ++++L  C IV S + PVG  
Sbjct: 76  ETLEYVALVLTKVHSAFYELFDSAKVNRFPDLKGIISYLRKQVLRECSIVLSGIVPVG-V 134

Query: 922 NPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGR-FVVHPGWVEASA 980
           +     +++   QFGAV T ++++  THV+A   GT KV+ A      ++V+P W+    
Sbjct: 135 DIKKTEVYRLCVQFGAVVTDNVNNSTTHVIAARWGTTKVHEAEKLRNIYIVNPKWLFTCV 194

Query: 981 LLYRRANEQDFAI 993
             + +A+E++F +
Sbjct: 195 ERWEKADEKEFEL 207


>gi|156087501|ref|XP_001611157.1| protein phosphatase family protein [Babesia bovis]
 gi|154798411|gb|EDO07589.1| protein phosphatase family protein [Babesia bovis]
          Length = 806

 Score = 75.1 bits (183), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 39/109 (35%), Positives = 63/109 (57%), Gaps = 11/109 (10%)

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
           + KLRPG++ FL R+++L+E++L+TMG + +A    K+LDP G  F  RV SR +  + F
Sbjct: 339 YYKLRPGVYDFLRRSAELYELYLFTMGTRAHANAALKILDPDGKYFGARVFSRSETNNCF 398

Query: 793 DGDERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
               R+ PK ++          ++I+DDS  +W  +   LI V  Y +F
Sbjct: 399 KSLCRIFPKYRN---------HLLILDDSENIWL-DAPGLIKVYPYYFF 437


>gi|156381374|ref|XP_001632240.1| predicted protein [Nematostella vectensis]
 gi|156219293|gb|EDO40177.1| predicted protein [Nematostella vectensis]
          Length = 122

 Score = 74.3 bits (181), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 44/112 (39%), Positives = 64/112 (57%), Gaps = 10/112 (8%)

Query: 732 MW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
           MW  TK RP    FL++ +K +E+H++TMG ++YA  +A++LDP   LF  R+ SR D  
Sbjct: 1   MWYHTKFRPWAHKFLQKIAKFYELHIFTMGTRMYAHTIARMLDPDLSLFGYRIRSRDDCF 60

Query: 790 DPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           + F       K  DL  +    +S V IIDD   VW +N  +LI V+ Y +F
Sbjct: 61  NAFS------KFNDLRSLFPCGDSMVCIIDDRADVW-NNAPSLIKVKPYQFF 105


>gi|389751366|gb|EIM92439.1| hypothetical protein STEHIDRAFT_136328 [Stereum hirsutum FP-91666
           SS1]
          Length = 1075

 Score = 73.9 bits (180), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 42/109 (38%), Positives = 58/109 (53%), Gaps = 10/109 (9%)

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
           + K RPG   FL   ++ +EMH+YTMG + YA E+   +DP G  F GR++SR + G   
Sbjct: 307 YVKPRPGTREFLSSVAEKYEMHVYTMGTRAYAEEVCAAIDPDGKFFGGRILSRDESGS-- 364

Query: 793 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
                    K L  +  ++ S VVIIDD   VW  +  NLI V  Y +F
Sbjct: 365 ------MTQKSLRRLFPVDTSMVVIIDDRADVWEWSP-NLIKVIPYDFF 406



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 48/101 (47%), Gaps = 2/101 (1%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           D   I+   +   L G  I+FS V P+ +  P    +W+ A  FGA C   +  ++THVV
Sbjct: 659 DATAIIPQLRFNTLFGVHILFSSVIPL-DTRPETTEVWRLAHAFGAKCYTELSSKITHVV 717

Query: 952 ANSLGTDKVNWALSTGR-FVVHPGWVEASALLYRRANEQDF 991
           A   GT KV+ A   G   +V P W   S   + R +E  +
Sbjct: 718 AAKRGTVKVDQARKRGNILIVWPAWFTDSIAKWERQDETPY 758


>gi|390604450|gb|EIN13841.1| hypothetical protein PUNSTDRAFT_95201 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 1229

 Score = 73.6 bits (179), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 43/109 (39%), Positives = 60/109 (55%), Gaps = 10/109 (9%)

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
           + K RPG   FL   S+ +EMH+YTMG + YA E+ K +DP+G +F  R++SR + G   
Sbjct: 618 YIKPRPGWHEFLHTLSEKYEMHVYTMGTRAYAEEVCKAIDPEGQIFGNRILSRDESGS-- 675

Query: 793 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
                    K L+ +   + S VVIIDD   VW  +  NLI V  Y +F
Sbjct: 676 ------LTQKSLQRLFPCDTSMVVIIDDRADVWEWSP-NLIKVIPYDFF 717



 Score = 63.9 bits (154), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 46/160 (28%), Positives = 76/160 (47%), Gaps = 28/160 (17%)

Query: 862  RSEDGTLASSLGVIERLHKIFF-----SHQSLDDV---------------DVRNILAAEQ 901
            R++D  L     ++E++H  F+     S +  D++               DV+ I+   +
Sbjct: 855  RNDDVELQRVRRLLEQVHSTFYADYDASKKKTDNLSVPKRKQRTQETLPYDVKLIIPRLR 914

Query: 902  RKILAGCRIVFSRVFPVGEAN----PHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLG- 956
              +  G  +VFS V P+   N    P    +W+TA +FGA C K +++ VTHVVA   G 
Sbjct: 915  SNVFDGVHLVFSGVIPLESNNMPIDPEATEIWRTAIRFGARCYKEMNEAVTHVVATPEGV 974

Query: 957  ---TDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFAI 993
               T K+N A + G  VV   W   S  L++R +E+ ++ 
Sbjct: 975  ERRTSKINTARARGIPVVSISWFHDSVALWQRLDERPYSF 1014


>gi|323453463|gb|EGB09334.1| putative formate/nitrite transporter [Aureococcus anophagefferens]
          Length = 1144

 Score = 73.2 bits (178), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 77/268 (28%), Positives = 120/268 (44%), Gaps = 53/268 (19%)

Query: 658 QKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDR 717
           + + +  +R+R+LE +      R+L LVLDLDHTLL  +     DP    +      + R
Sbjct: 331 EASVLAAQRSRQLEGK------RQLQLVLDLDHTLLECS----TDPRAAALAAAPGSRVR 380

Query: 718 E------KPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVL 771
                  +PH           W +LRP +  F    + L+E+ +YT G++ YA  +   L
Sbjct: 381 ALGAVAGRPH-----------WVRLRPRLEEFFAAVAPLYELAIYTHGSRQYAEAVRAAL 429

Query: 772 DPK--GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNK 828
           + +  G+ F GRV+SR  D  P   D R  KS  LE +  G  +  +I+DD + VW   +
Sbjct: 430 EAEVPGLSFGGRVVSR--DCCP---DLRGEKS--LERLFPGGAARALILDDRLDVWTRGE 482

Query: 829 ---LNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH 885
                ++VV+ YTYF          G +L +  H +   D  L+ S   +   H  F++ 
Sbjct: 483 DQTPRVLVVQPYTYF----------GKALADPAHAD--GDSQLSHSARALVAAHAAFYAG 530

Query: 886 QSLDDVDVRNILAAEQRKILAGCRIVFS 913
                 D    L A +  + AGC   FS
Sbjct: 531 GGASG-DAVACLDAARGAVFAGCVFSFS 557


>gi|403416935|emb|CCM03635.1| predicted protein [Fibroporia radiculosa]
          Length = 580

 Score = 72.8 bits (177), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 40/105 (38%), Positives = 57/105 (54%), Gaps = 2/105 (1%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           DVR I+   +   LAG  IVFS V P+ +  P    +W+TA  FGA C   + ++VTHVV
Sbjct: 268 DVRIIIPRMRMDTLAGVHIVFSSVIPL-DTRPEATEIWRTAHAFGAKCYTELSNRVTHVV 326

Query: 952 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIKP 995
           A   GT KV+ A  +G   +V   W   S  L++R +E  + + P
Sbjct: 327 AAKRGTQKVDAARRSGGIKIVWLSWFTDSVALWKRQDETPYLMDP 371



 Score = 48.9 bits (115), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 30/84 (35%), Positives = 43/84 (51%), Gaps = 10/84 (11%)

Query: 758 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVI 816
           MG + YA E+   +DP+G  F GR++SR + G            K L+ +    +S VVI
Sbjct: 1   MGTRAYAEEVCAAIDPEGKFFGGRLLSRDESGS--------LTQKSLQRLFPTDQSMVVI 52

Query: 817 IDDSVRVWPHNKLNLIVVERYTYF 840
           IDD   VW  +  NL+ V  Y +F
Sbjct: 53  IDDRADVWEWSP-NLVKVIPYDFF 75


>gi|449551315|gb|EMD42279.1| hypothetical protein CERSUDRAFT_148004 [Ceriporiopsis subvermispora
           B]
          Length = 875

 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/109 (37%), Positives = 59/109 (54%), Gaps = 10/109 (9%)

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
           + K RPG   FL+  +  +EMH+YTMG + YA E+   +DP G +F GR++SR + G   
Sbjct: 264 YIKPRPGWQDFLQDMATKYEMHVYTMGTRAYAEEVCATIDPDGKIFGGRLLSRDESGS-- 321

Query: 793 DGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
                    K L+ +    +S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 322 ------LTQKSLQRLFPCDQSMVVIIDDRADVWEWSP-NLVKVIPYDFF 363



 Score = 69.7 bits (169), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 38/105 (36%), Positives = 57/105 (54%), Gaps = 2/105 (1%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           DVR I+ + + K L G  I+FS V P+ +  P +  +W+TA  FGA C   +  ++THVV
Sbjct: 555 DVRTIIPSIRMKALEGVHILFSSVIPL-DTRPEVTEVWRTAHAFGAQCHTELSSRITHVV 613

Query: 952 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIKP 995
           A   GT KV+ A   G   +V   W   S  L++R +E  + + P
Sbjct: 614 AAKRGTVKVDAARKQGGIKIVWLSWFTDSIALWQRQDETPYLMDP 658


>gi|221486680|gb|EEE24941.1| RNA polymerase II phosphatase, putative [Toxoplasma gondii GT1]
          Length = 1234

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 66/232 (28%), Positives = 102/232 (43%), Gaps = 25/232 (10%)

Query: 621 DDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMF--- 677
           DD   G     EA PV    + A   V  +    D   K A + ++ RR +E++  +   
Sbjct: 510 DDAGLGASRDEEAKPVEREAREALQTVGDMDAERDVDGKEADRTKQDRRGKEEETEYPHY 569

Query: 678 --SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH------LFRFPH 729
              A+   +   L+ +++      E  PV     R+  E  R    RH      L    H
Sbjct: 570 REDAQGEAIRHTLEKSVIGVRVAVEASPVG--ASRRAREGWRRPASRHEAGALPLPNEQH 627

Query: 730 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
           +  + KLRPG   FL R S+ FE+++YTMG  L+A    ++LDPK   F  RV SR D  
Sbjct: 628 ITFF-KLRPGCLDFLRRVSQTFELYMYTMGTALHAATALRILDPKRRFFGRRVFSRQDAV 686

Query: 790 DPFDGDERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           +     ER+ P  + +         V+++DD   +W ++    I V+ Y YF
Sbjct: 687 NGLKAIERIFPHDQKM---------VLVVDDLECMWSYSPC-CIKVQGYHYF 728


>gi|294868642|ref|XP_002765622.1| hypothetical protein Pmar_PMAR013688 [Perkinsus marinus ATCC 50983]
 gi|239865701|gb|EEQ98339.1| hypothetical protein Pmar_PMAR013688 [Perkinsus marinus ATCC 50983]
          Length = 956

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 57/186 (30%), Positives = 84/186 (45%), Gaps = 22/186 (11%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAK------FHEVDPVHDEILRKKEEQDREKPHRHLFRF 727
           + + S+++L  VLD+DHT+L+         F +V   +    R     D EK ++     
Sbjct: 488 RTLASSKRLVAVLDIDHTILHVTNKRIDLLFPDVTCYNLAPNRDTGRLDEEKVYQFFIGT 547

Query: 728 P---HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG--RV 782
                   + KLRPG +TFLE    L+E++LYT G + YA  + K LDP    F    R+
Sbjct: 548 SPTTTACCYLKLRPGFYTFLEEILPLYELYLYTHGTREYAIRLLKALDPSARYFGSPPRL 607

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVW--PHNKLNLIVVERYTY 839
           I+R          +     K L  +        VI+DD   VW    N+ +LI V  Y +
Sbjct: 608 IAR--------PTQSALTCKTLSRIFPSNHRLAVIVDDRDDVWEAKDNEHSLIKVTPYVF 659

Query: 840 FPCSRR 845
           FP S R
Sbjct: 660 FPDSER 665


>gi|393225696|gb|EJD33619.1| HAD-like protein, partial [Auricularia delicata TFB-10046 SS5]
          Length = 155

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 52/171 (30%), Positives = 86/171 (50%), Gaps = 25/171 (14%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ-DREKPHRHLFRFPHM 730
           E++++   RKL LV+DLD+T+           VH  ++R  +E+  R + H H       
Sbjct: 3   ERERLLGCRKLSLVVDLDNTI-----------VHTIVVRTDDERMARMQDHNH----GST 47

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD-DG 789
                 RPG+  FL+  S+ +E  +YTMG + YA ++   +D    +F GR+ SR + +G
Sbjct: 48  TFTGSCRPGLRAFLQTISEKYEPTVYTMGTRGYAEKVCAAVDGDERVFGGRIFSRDENEG 107

Query: 790 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           +      R+    D       +S   IIDDS +VW  +K N++ V+ Y +F
Sbjct: 108 NSTKSLSRLFPPCD-------KSMTAIIDDSRKVW-EDKKNIVSVQPYVFF 150


>gi|303389951|ref|XP_003073207.1| Fcp1-like phosphatase [Encephalitozoon intestinalis ATCC 50506]
 gi|303302352|gb|ADM11847.1| Fcp1-like phosphatase [Encephalitozoon intestinalis ATCC 50506]
          Length = 407

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 56/195 (28%), Positives = 91/195 (46%), Gaps = 53/195 (27%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  +I KE+ + LE Q K+       LVLDLD T+L++   +    +H  +        
Sbjct: 42  KEAVSIYKEKMKTLETQMKLI------LVLDLDQTILHTT--YGESRIHGTV-------- 85

Query: 717 REKPHRHLFRFPHMG--MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
                    RF   G     KLRP +   L + S+L+E+H+YTMG + YA  +  ++DP 
Sbjct: 86  ---------RFIMDGSKYCVKLRPNLDHMLRKISRLYEIHVYTMGTRAYAERIVGIVDPS 136

Query: 775 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA---------VVIIDDSVRVWP 825
           G  F  R+I+R ++                EGVL    +         +VI+DD   VW 
Sbjct: 137 GKYFQDRIITRDEN----------------EGVLVKRLSRLFPHNHKNIVILDDRPDVWD 180

Query: 826 HNKLNLIVVERYTYF 840
           +++ NL++V  + YF
Sbjct: 181 YSE-NLLLVRPFWYF 194


>gi|170036997|ref|XP_001846347.1| RNA polymerase II subunit A C-terminal domain phosphatase [Culex
           quinquefasciatus]
 gi|167879975|gb|EDS43358.1| RNA polymerase II subunit A C-terminal domain phosphatase [Culex
           quinquefasciatus]
          Length = 764

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 52/170 (30%), Positives = 87/170 (51%), Gaps = 20/170 (11%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 731
           + +++   RKL L++DLD TL+++      D V + +        ++  H  L+      
Sbjct: 135 DTERLLRDRKLVLLVDLDQTLIHTTN----DNVPNNL--------KDVYHFQLYGPNSPW 182

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
             T+LRPG   FL +    +E+H+ T G + YA  +A+ LD KG  F+ R++SR    + 
Sbjct: 183 YHTRLRPGALQFLAKMDPFYELHICTFGARNYAHMIAQFLDEKGRYFSHRILSR---DEC 239

Query: 792 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           F+      K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 240 FNA---TSKTDNLKALFPCGDSMVCIIDDREDVW-NMAANLIQVKPYHFF 285


>gi|440493707|gb|ELQ76143.1| TFIIF-interacting CTD phosphatase, including NLI-interacting factor
           [Trachipleistophora hominis]
          Length = 466

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 42/125 (33%), Positives = 62/125 (49%), Gaps = 12/125 (9%)

Query: 711 KKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKV 770
           ++   D   P    +      M T LRP +  FL  ASKLF MH+YTMG   Y  ++  V
Sbjct: 161 QRTSTDNSFPSSFTYTLSSTTMHTTLRPHLHQFLTEASKLFHMHIYTMGTAEYVHQITNV 220

Query: 771 LDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKL 829
           +D  G+ F  R+++R D+           + K LE + G +   VVI+DD   VW +   
Sbjct: 221 IDKDGMFFGDRIVTRDDE----------MQVKRLERLFGDKVDMVVIVDDRGDVWEYCG- 269

Query: 830 NLIVV 834
           NL++V
Sbjct: 270 NLVMV 274


>gi|393218252|gb|EJD03740.1| hypothetical protein FOMMEDRAFT_105888 [Fomitiporia mediterranea
           MF3/22]
          Length = 921

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 42/109 (38%), Positives = 57/109 (52%), Gaps = 10/109 (9%)

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
           + K RPG   FL   +  +EMH+YTMG + YA ++   +DP G LF GR++SR + G   
Sbjct: 273 YVKPRPGWKEFLSSVASRYEMHVYTMGTRAYAEKVCAAIDPDGRLFGGRILSRDESGS-- 330

Query: 793 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
                    K L  +   + S VVIIDD   VW  +  NLI V  Y +F
Sbjct: 331 ------LTQKSLRRLFPCDTSMVVIIDDRADVWEWSP-NLIKVIPYDFF 372



 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 52/104 (50%), Gaps = 2/104 (1%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPH-LHPLWQTAEQFGAVCTKHIDDQVTHV 950
           DV+ I+   +++      I+FS V P      H    +W+ A  FGA C + +D +VTHV
Sbjct: 567 DVKFIIPNIRKETFKDVHILFSGVIPTNIRMDHEATEIWRMARAFGATCHRDLDKEVTHV 626

Query: 951 VANSLGTDKVNWALST-GRFVVHPGWVEASALLYRRANEQDFAI 993
           V +  GT KV  A S    FVV   W   S   +RR +E+ + +
Sbjct: 627 VTSKRGTQKVEKARSQPNIFVVWLQWFTDSVAQWRRQDERRYLL 670


>gi|118784887|ref|XP_314000.3| AGAP005119-PA [Anopheles gambiae str. PEST]
 gi|116128258|gb|EAA09414.3| AGAP005119-PA [Anopheles gambiae str. PEST]
          Length = 822

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 52/168 (30%), Positives = 86/168 (51%), Gaps = 20/168 (11%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           +++ S RKL L++DLD TL+++      D V + +        ++  H  L+        
Sbjct: 139 ERLLSDRKLVLLVDLDQTLIHTTN----DNVPNNL--------KDVYHFQLYGPNSPWYH 186

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 793
           T+LRPG   FL +    +E+H+ T G + YA  +A+ LD  G  F+ R++SR    + F+
Sbjct: 187 TRLRPGALEFLAKMHPYYELHICTFGARNYAHMIAQFLDKDGNFFSHRILSR---DECFN 243

Query: 794 GDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
                 K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 244 A---TSKTDNLKALFPCGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 287



 Score = 48.1 bits (113), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 29/124 (23%), Positives = 61/124 (49%), Gaps = 4/124 (3%)

Query: 874 VIERLHKIFFSH--QSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 931
           ++ ++H+ F+    +S    D++ ++   + ++L G ++ FS + P           +  
Sbjct: 496 ILLKIHETFYEEYDKSKTISDLKQLIPQVKARVLVGAKLCFSGLIP-NNVKLEQSKAYLI 554

Query: 932 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
           A   GA  T++++   TH+VA ++GT KVN A    +  +V P W+ + A  +    E  
Sbjct: 555 ARSLGAAVTQNLEPTTTHLVAVTIGTSKVNNARKNPKIKIVTPEWLWSCAERWEHVEELL 614

Query: 991 FAIK 994
           + +K
Sbjct: 615 YPLK 618


>gi|221508436|gb|EEE34023.1| RNA polymerase II phosphatase, putative [Toxoplasma gondii VEG]
          Length = 1228

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 57/107 (53%), Gaps = 11/107 (10%)

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 794
           KLRPG   FL R S+ FE+++YTMG  L+A    ++LDPK   F  RV SR D  +    
Sbjct: 626 KLRPGCLDFLRRVSQTFELYMYTMGTALHAATALRILDPKRRFFGRRVFSRQDAVNGLKA 685

Query: 795 DERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
            ER+ P  + +         V+++DD   +W ++    I V+ Y YF
Sbjct: 686 IERIFPHDQKM---------VLVVDDLECMWSYSPC-CIKVQGYHYF 722


>gi|294935258|ref|XP_002781353.1| hypothetical protein Pmar_PMAR020737 [Perkinsus marinus ATCC 50983]
 gi|239891934|gb|EER13148.1| hypothetical protein Pmar_PMAR020737 [Perkinsus marinus ATCC 50983]
          Length = 979

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 56/186 (30%), Positives = 84/186 (45%), Gaps = 22/186 (11%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAK------FHEVDPVHDEILRKKEEQDREKPHRHLFRF 727
           + + ++++L  VLD+DHT+L+         F +V   +    R     D EK ++     
Sbjct: 511 RTLAASKRLVAVLDIDHTILHVTNKRIDLLFPDVTCYNLAPNRDTGRLDEEKVYQFFIGT 570

Query: 728 P---HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG--RV 782
                   + KLRPG +TFLE    L+E++LYT G + YA  + K LDP    F    R+
Sbjct: 571 SPTTTACCYLKLRPGFYTFLEEILPLYELYLYTHGTREYAIRLLKALDPSARYFGSPPRL 630

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVW--PHNKLNLIVVERYTY 839
           I+R          +     K L  +        VI+DD   VW    N+ +LI V  Y +
Sbjct: 631 IAR--------PTQSALTCKTLSRIFPSNHRLAVIVDDRDDVWEAKDNEHSLIKVTPYVF 682

Query: 840 FPCSRR 845
           FP S R
Sbjct: 683 FPDSER 688


>gi|392570766|gb|EIW63938.1| hypothetical protein TRAVEDRAFT_111329 [Trametes versicolor
           FP-101664 SS1]
          Length = 900

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/110 (37%), Positives = 59/110 (53%), Gaps = 10/110 (9%)

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
            + K RPG+  FLE  +  +EMH+YTMG + YA E+   +DP G +F  R++SR + G  
Sbjct: 262 YYIKPRPGLPEFLETMATKYEMHVYTMGTRAYAEEVCAAIDPGGKIFGNRILSRDESGS- 320

Query: 792 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
                     K L+ +    +S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 321 -------LTQKSLQRLFPCDQSMVVIIDDRADVWEWSP-NLVKVIPYDFF 362



 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 46/156 (29%), Positives = 73/156 (46%), Gaps = 23/156 (14%)

Query: 862 RSEDGTLASSLGVIERLHKIFFS----HQSLDD-----------------VDVRNILAAE 900
           +++D  L     ++E +H  F+S    ++S D+                  DVR I+   
Sbjct: 519 KNDDAELTRVKQILELVHDRFYSAWERYKSEDEGKHGKRKHGKRTEAQREYDVRTIIPRI 578

Query: 901 QRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKV 960
           + + L GC I+FS V P+ +  P    +W+TA  FGA C   +  ++THVVA   GT KV
Sbjct: 579 RTETLDGCHILFSSVIPL-DTRPEATEIWKTAHAFGAKCYTELSPRITHVVAAKRGTQKV 637

Query: 961 NWALSTGRF-VVHPGWVEASALLYRRANEQDFAIKP 995
           + A   G   +V   W   S  L+ R +E  + + P
Sbjct: 638 DAARRRGGIKIVWLAWFTDSVALWHRQDEGPYLLDP 673


>gi|237834315|ref|XP_002366455.1| NLI interacting factor-like phosphatase domain-containing protein
           [Toxoplasma gondii ME49]
 gi|211964119|gb|EEA99314.1| NLI interacting factor-like phosphatase domain-containing protein
           [Toxoplasma gondii ME49]
          Length = 1225

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 57/107 (53%), Gaps = 11/107 (10%)

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 794
           KLRPG   FL R S+ FE+++YTMG  L+A    ++LDPK   F  RV SR D  +    
Sbjct: 623 KLRPGCLDFLRRVSQTFELYMYTMGTALHAATALRILDPKRRFFGRRVFSRQDAVNGLKA 682

Query: 795 DERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
            ER+ P  + +         V+++DD   +W ++    I V+ Y YF
Sbjct: 683 IERIFPHDQKM---------VLVVDDLECMWRYSPC-CIKVQGYHYF 719


>gi|395334832|gb|EJF67208.1| hypothetical protein DICSQDRAFT_142769 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 953

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/109 (36%), Positives = 59/109 (54%), Gaps = 10/109 (9%)

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
           + K RPG+  FL+  +  +EMH+YTMG + YA E+   +DP G +F  R++SR + G   
Sbjct: 287 YIKPRPGLLDFLQTMATKYEMHVYTMGTRAYAEEVCAAIDPGGKIFGNRILSRDESGS-- 344

Query: 793 DGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
                    K L+ +    +S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 345 ------LTQKSLQRLFPCDQSMVVIIDDRADVWEWSP-NLVKVIPYDFF 386



 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 57/112 (50%), Gaps = 2/112 (1%)

Query: 885 HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHID 944
           H++  + DVR I+   +   LAG  I+F+ V P+ +  P    +W+TA  FGA C   + 
Sbjct: 598 HEAPVEHDVRTIIPRIRMDTLAGVHILFTGVIPLNQ-RPETAEIWKTATAFGAQCHTDLG 656

Query: 945 DQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIKP 995
             +THVV N   T KV+ A       +V   W+  S  L++R +E  + + P
Sbjct: 657 KHITHVVTNKDNTQKVDAARRYADVRIVWLNWLTDSLALWQRQDETPYLLHP 708


>gi|19074511|ref|NP_586017.1| similarity to HYPOTHETICAL TRANSMEMBRANE PROTEINS YHG4_yeast
           [Encephalitozoon cuniculi GB-M1]
 gi|51701436|sp|Q8SV03.1|FCP1_ENCCU RecName: Full=RNA polymerase II subunit A C-terminal domain
           phosphatase; AltName: Full=CTD phosphatase FCP1
 gi|19069153|emb|CAD25621.1| similarity to HYPOTHETICAL TRANSMEMBRANE PROTEINS YHG4_yeast
           [Encephalitozoon cuniculi GB-M1]
 gi|449329538|gb|AGE95809.1| hypothetical protein ECU07_0890 [Encephalitozoon cuniculi]
          Length = 411

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 54/193 (27%), Positives = 87/193 (45%), Gaps = 49/193 (25%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  AI KE+   LE Q K+       LVLDLD T+L++   +    +   +        
Sbjct: 42  EEAVAIHKEKMEALEMQMKLI------LVLDLDQTVLHTT--YGTSSLEGTVK------- 86

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                   F         KLRP +   L R SKL+E+H+YTMG + YA  + +++DP G 
Sbjct: 87  --------FVIDRCRYCVKLRPNLDYMLRRISKLYEIHVYTMGTRAYAERIVEIIDPSGK 138

Query: 777 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA---------VVIIDDSVRVWPHN 827
            F  R+I+R ++                +GVL    +         +VI+DD   VW + 
Sbjct: 139 YFDDRIITRDEN----------------QGVLVKRLSRLFPHDHRNIVILDDRPDVWDYC 182

Query: 828 KLNLIVVERYTYF 840
           + NL+++  + YF
Sbjct: 183 E-NLVLIRPFWYF 194


>gi|409051930|gb|EKM61406.1| hypothetical protein PHACADRAFT_204575 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 863

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/110 (36%), Positives = 60/110 (54%), Gaps = 10/110 (9%)

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
            + K RPG   FLE  ++ +EMH+YTMG + YA E+   +DP G +F GR++SR + G  
Sbjct: 257 YYIKPRPGWNEFLEDMAEKYEMHVYTMGTRAYAEEVCAAIDPDGKIFGGRLLSRDESGS- 315

Query: 792 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
                     K L+ +    +S VV+IDD   VW  +  NL+ V  + +F
Sbjct: 316 -------LTQKSLQRLFPCDQSMVVVIDDRADVWEWSP-NLVKVIPFEFF 357



 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 54/105 (51%), Gaps = 2/105 (1%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           DVR I+   ++  LAGC +VFS V P+ +        W+ A  FGA C   ++ ++TH++
Sbjct: 551 DVRLIIPLMRQNALAGCHVVFSSVIPL-DTRAETSETWRIAVMFGAKCYTELNPRITHLI 609

Query: 952 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIKP 995
           A   GT KV+ A   G   +V   W   S   +RR +E  + I P
Sbjct: 610 AAKRGTAKVDAARRQGGVKIVWVNWFLDSINQWRRQDETPYLIDP 654


>gi|388858248|emb|CCF48177.1| related to FCP1-TFIIF interacting component of CTD phosphatase
           [Ustilago hordei]
          Length = 774

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 57/209 (27%), Positives = 95/209 (45%), Gaps = 43/209 (20%)

Query: 662 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 719
           +  E  +RL  E    + S RKL L++DLD T++++     VDP   E ++ +   + E 
Sbjct: 47  VSAEEAQRLDSETTSHLLSQRKLALIVDLDQTVIHAT----VDPTVGEWMKDESNPNYEA 102

Query: 720 PHR-HLFRFPHMG--------------------------MWTKLRPGIWTFLERASKLFE 752
                 FR    G                           + K RPG+   +++ S+ ++
Sbjct: 103 LKSVGKFRLGIDGEEIKDDDDDSAPKDSAAALKASRACWYYVKPRPGVPEIVKKLSEKYQ 162

Query: 753 MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME- 811
           +H+YTMG + YA  + K++DP   +F  R++SR ++G            K L  +  ++ 
Sbjct: 163 LHVYTMGTRSYANCVCKLIDPDASIFGNRILSRDENGSLV--------RKSLNRLFPVDH 214

Query: 812 SAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 215 SMVVIIDDREDVWSRSP-NLLPVVPYEFF 242



 Score = 64.3 bits (155), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 39/101 (38%), Positives = 54/101 (53%), Gaps = 6/101 (5%)

Query: 893 VRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQV-THVV 951
           V ++++A++ K+LAGC IVFS + P G  NP    LW  A +FGA     +++ V THVV
Sbjct: 497 VVDLISAQKTKVLAGCTIVFSSMIPTGH-NPETSDLWALAREFGATPAFEVEEGVTTHVV 555

Query: 952 ANSLGTDKVNWALSTGRF----VVHPGWVEASALLYRRANE 988
           A   GT KV  A    R     +V P W   S   + R +E
Sbjct: 556 AARQGTLKVVNATRLARAGKVELVWPSWFHVSTSRWERQDE 596


>gi|402584910|gb|EJW78851.1| hypothetical protein WUBG_10241, partial [Wuchereria bancrofti]
          Length = 278

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 48/172 (27%), Positives = 89/172 (51%), Gaps = 29/172 (16%)

Query: 662 IQKERTRRL--EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 719
           +  E  R++   +++ +  ARKL L++DLD TL+++   H            K E+D + 
Sbjct: 125 VSDELARKIGSRDRELLLKARKLVLLVDLDQTLIHTTN-HTF----------KLEKDTDV 173

Query: 720 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 779
            H   ++      +TK+RP    FL R + L+EMH+ + G + YA  +A+ LDP+ + F 
Sbjct: 174 LH---YKLKGTDFYTKIRPHAREFLRRMAGLYEMHIISYGERQYAHRIAEFLDPEKIYFG 230

Query: 780 GRVISRGDDGDPFDGDE---RVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHN 827
            R++SR         DE    + K+++++ +    +  +V+IDD   VW ++
Sbjct: 231 HRILSR---------DELFCAMYKTRNMQALFPCGDHMIVMIDDRPDVWQYS 273


>gi|401409326|ref|XP_003884111.1| hypothetical protein NCLIV_045130 [Neospora caninum Liverpool]
 gi|325118529|emb|CBZ54080.1| hypothetical protein NCLIV_045130 [Neospora caninum Liverpool]
          Length = 1185

 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 39/107 (36%), Positives = 56/107 (52%), Gaps = 11/107 (10%)

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 794
           KLRPG   FL R S+ FE+++YTMG  L+A    ++LDP    F  RV SR D  +    
Sbjct: 649 KLRPGCLDFLRRVSQTFELYMYTMGTALHAATALRILDPGRRFFGRRVFSRQDAVNGLKA 708

Query: 795 DERV-PKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
            ER+ P  + +         V+++DD   +W +N    I V+ Y YF
Sbjct: 709 IERIFPHDRKM---------VLVVDDLDCMWSYNPC-CIKVQGYHYF 745


>gi|402467220|gb|EJW02558.1| FCP1-like phosphatase, phosphatase domain-containing protein
           [Edhazardia aedis USNM 41457]
          Length = 905

 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 49/166 (29%), Positives = 78/166 (46%), Gaps = 28/166 (16%)

Query: 695 SAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKL---F 751
           S KF + D V D        ++  K   + F       +  LRP    FLE+   L   +
Sbjct: 209 SDKFTKNDIVTD--------KNENKTKIYTFMLNKHKYYIALRP----FLEKLLSLDEKY 256

Query: 752 EMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME 811
           EMH+YTMGN  YA ++ K++DP G +F  R+I+R ++             K L+      
Sbjct: 257 EMHIYTMGNNQYAQKVKKIIDPTGTIFGNRIITRDENNQEL--------FKSLDRFSTNH 308

Query: 812 SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEI 857
             +V+IDD + VW  + +N++ V  + +F    R   +  PS+L I
Sbjct: 309 DNIVVIDDRIDVWNFS-VNVVGVRPFWFF----RDGDINDPSVLRI 349


>gi|312373985|gb|EFR21645.1| hypothetical protein AND_16677 [Anopheles darlingi]
          Length = 857

 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 51/168 (30%), Positives = 86/168 (51%), Gaps = 20/168 (11%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           +++ + RKL L++DLD TL+++      D V + +        ++  H  L+        
Sbjct: 150 ERLLNDRKLVLLVDLDQTLIHTTN----DNVPNNL--------KDVYHFQLYGPNSPWYH 197

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD 793
           T+LRPG   FL +    +E+H+ T G + YA  +A+ LD  G  F+ R++SR    + F+
Sbjct: 198 TRLRPGALEFLAKMHPYYELHICTFGARNYAHMIAQFLDKDGRFFSHRILSR---DECFN 254

Query: 794 GDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
                 K+ +L+ +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 255 A---TSKTDNLKALFPCGDSMVCIIDDREDVW-NMASNLIQVKPYHFF 298



 Score = 48.9 bits (115), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 31/124 (25%), Positives = 62/124 (50%), Gaps = 4/124 (3%)

Query: 874 VIERLHKIFFS-HQSLDDV-DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 931
           ++ ++H++F+  + S   + D++ ++   + K+L   R+ FS + P           +Q 
Sbjct: 538 ILLKIHEMFYKKYDSTKTISDLKQLIPKVKAKVLVNARLCFSGLIP-NNVKLEQSKAYQI 596

Query: 932 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
           A   GA+ T++++   TH+VA ++GT KV  A    +  +V P W+   A  +    E  
Sbjct: 597 ARSLGAIVTQNLEPTTTHLVAVTIGTSKVINARKNPKIKIVTPQWLWTCAERWEHVEELL 656

Query: 991 FAIK 994
           F +K
Sbjct: 657 FPLK 660


>gi|300122627|emb|CBK23195.2| unnamed protein product [Blastocystis hominis]
          Length = 598

 Score = 70.1 bits (170), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 82/359 (22%), Positives = 141/359 (39%), Gaps = 56/359 (15%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR----------HL 724
           K    +KL L++DLD TL+++    E   +    L    E + E   +           L
Sbjct: 146 KFLGGKKLILIIDLDMTLVHAIHEEESIGLFLNWLHGASESNEEDEWKKTLKDQVHSIEL 205

Query: 725 FRFPHMG------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 778
           F     G      +  K+RPG+   L+  +  +EM +YT G   YA ++ +++DP   LF
Sbjct: 206 FYVDDNGSARMSKLLIKIRPGVRAMLQMLANSYEMIVYTQGENQYAEKVMQIVDPDNTLF 265

Query: 779 AGRVISRGDDGDP----------------------FDGDERVPKSKDLEGVLGMESAVVI 816
             R I+RG+  +                       +D     P+S     +  M   ++I
Sbjct: 266 KKRFIARGETRNEPQKKLLSKIVDCWNQYVRKQNVYDPANPTPESLPELTLEEMCRRLLI 325

Query: 817 IDDSVRVW-PHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGV- 874
           +DD   VW  H +  +I+    +   C    F      L   +         L++  GV 
Sbjct: 326 LDDKDEVWGMHEESGMILNPTSSLIKCFPYVFFDTKSDLYNFE--------KLSAYEGVE 377

Query: 875 ---IERLHKIFFS-HQSL---DDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHP 927
              I RL +IF   HQ+    +  DVR  L   +  +  G  + F+ +  + E     + 
Sbjct: 378 QQYILRLSEIFRDIHQTFTLENAEDVRKTLRERKHSVFHGLHMAFTSIMEITE-KVETNV 436

Query: 928 LWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRA 986
           L+++   FG V    + ++   ++  +L T KVN A      VV   W+E     ++ A
Sbjct: 437 LYRSLVDFGGVYMSEVTEECDLLICRTLRTAKVNKAQLLRIPVVSVRWLEECVKFWKLA 495


>gi|302698337|ref|XP_003038847.1| hypothetical protein SCHCODRAFT_255670 [Schizophyllum commune H4-8]
 gi|300112544|gb|EFJ03945.1| hypothetical protein SCHCODRAFT_255670 [Schizophyllum commune H4-8]
          Length = 1207

 Score = 70.1 bits (170), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 38/105 (36%), Positives = 55/105 (52%), Gaps = 2/105 (1%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           DV+ I+ + +  +  GC I FS V P+ +  P  H  W+ A  FGA C   +  +VTHVV
Sbjct: 880 DVKRIIPSIRASVFQGCHICFSSVIPL-DIQPESHECWRIANMFGARCHATLAPEVTHVV 938

Query: 952 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIKP 995
           A   GT KV+ A   G   VV P W + S  +++R  E  + + P
Sbjct: 939 AGKQGTAKVDEARRRGNIKVVTPMWFKDSVNMWQRQPESRYLLDP 983



 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 69/265 (26%), Positives = 102/265 (38%), Gaps = 87/265 (32%)

Query: 651 FEGYDDQQKAAIQ-----------KERTRRLEEQ--KKMFSARKLCLVLDLDHTLLNSAK 697
           + G  D  +A+IQ           +E   R+E +  + +  AR+L L++DLD T++++  
Sbjct: 471 YLGVSDTSRASIQMTHSKNGPTVSREVAERIEREANEALLKARRLKLIVDLDQTIVHAT- 529

Query: 698 FHEVDPVHDEILRKK------------------------EEQDREKPHRHL--------- 724
              VDP   E + +                         E+ D   P   L         
Sbjct: 530 ---VDPTVGEWIAEGQAWEAKRARLEEKAAARAAAEESGEDIDDSDPEEELEECNPNWDA 586

Query: 725 ------------------FRFPHMGM------WTKLRPGIWTFLERASKLFEMHLYTMGN 760
                              R  H+ +      + K RPG   F+   S  +EMH+YTMG 
Sbjct: 587 LKDVKKFRLGPELLAPTNMRGRHIALDEGCVYYIKPRPGWQEFMNNMSAKYEMHVYTMGT 646

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDD 819
           + YA  +  VLDP G LF  R++SR + G            K L+ +    +S VVIIDD
Sbjct: 647 RAYAMAVCNVLDPDGRLFGERILSRDESGS--------LTQKSLDRLFPTDQSMVVIIDD 698

Query: 820 SVRVWPHN----KLNLIVVERYTYF 840
              VW         NLI V  Y +F
Sbjct: 699 RADVWSGGLQFWSPNLIKVVPYDFF 723


>gi|125584005|gb|EAZ24936.1| hypothetical protein OsJ_08716 [Oryza sativa Japonica Group]
          Length = 364

 Score = 69.7 bits (169), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 54/131 (41%), Positives = 68/131 (51%), Gaps = 22/131 (16%)

Query: 759 GNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLG-------ME 811
           G + YA  +AK+LDP GV F  R+ISR  D  P       P  K L+ V G         
Sbjct: 99  GTEDYAAAVAKLLDPDGVYFGERIISR--DESP------QPDRKSLDVVFGSAPASAAER 150

Query: 812 SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDH--DERSEDGT-L 868
           +AVVI+DD+  VW  N  NLI +ERY YF  S R FG    S  E  H   ER  D +  
Sbjct: 151 AAVVILDDTAEVWEGNSDNLIEMERYHYFASSCRDFG----SPWECTHSLSERGVDESER 206

Query: 869 ASSLGVIERLH 879
           A++L V+ R+H
Sbjct: 207 AAALRVLRRVH 217


>gi|406695220|gb|EKC98531.1| protein phosphatase [Trichosporon asahii var. asahii CBS 8904]
          Length = 917

 Score = 69.7 bits (169), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 36/102 (35%), Positives = 56/102 (54%), Gaps = 2/102 (1%)

Query: 891 VDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHV 950
            DV  ++   + ++L+GC IVF+ V  + +  P    +WQ AE FGA C   +D++VTH 
Sbjct: 591 CDVPLLIGEIKDQVLSGCVIVFTGVIAINQ-KPQDSEIWQQAEAFGAQCQVELDERVTHC 649

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDF 991
           V  S+GT+K+  A       VV   W++ S   +RR  E+ F
Sbjct: 650 VIGSIGTEKMRRASRMPHVQVVWLAWLQTSIAFWRREPEEPF 691



 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 52/152 (34%), Positives = 70/152 (46%), Gaps = 33/152 (21%)

Query: 690 HTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASK 749
             L + AKF   D V   + R      R +P       P    +TK RPG+  FLE  SK
Sbjct: 277 EALKDVAKFQLADDVPPGVSR------RHQPE------PVRWYYTKPRPGLNKFLEDMSK 324

Query: 750 LFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLG 809
           L+EMH+YTMG + YA  + K++DP+G  FA                     +K L  +  
Sbjct: 325 LYEMHVYTMGTRSYADAICKIVDPEGKYFAM-------------------SAKSLVRLFP 365

Query: 810 M-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
             +S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 366 HDQSMVVIIDDRSDVW-GDSPNLVKVVPYDFF 396


>gi|157109625|ref|XP_001650754.1| RNA polymerase ii ctd phosphatase [Aedes aegypti]
 gi|108868428|gb|EAT32653.1| AAEL015142-PA, partial [Aedes aegypti]
          Length = 569

 Score = 69.3 bits (168), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 51/170 (30%), Positives = 86/170 (50%), Gaps = 20/170 (11%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 731
           + +++   +KL L++DLD TL+++      D V + +        ++  H  L+      
Sbjct: 136 DTERLLRDKKLVLLVDLDQTLIHTTN----DNVPNNL--------KDVYHFQLYGSNSPW 183

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
             T+LRPG   FL +    +E+H+ T G + YA  +A+ LD  G LF+ R++SR    + 
Sbjct: 184 YHTRLRPGALEFLAKMHPYYELHICTFGARNYAHMIAQFLDRDGKLFSHRILSR---DEC 240

Query: 792 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           F+      K+ +L  +    +S V IIDD   VW +   NLI V+ Y +F
Sbjct: 241 FNA---TSKTDNLRALFPCGDSMVCIIDDREDVW-NMAANLIQVKPYHFF 286



 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 30/124 (24%), Positives = 59/124 (47%), Gaps = 4/124 (3%)

Query: 874 VIERLHKIFFSH--QSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 931
           ++ ++H+ F+    ++    D++ ++   + ++L G  +VFS + P           +Q 
Sbjct: 429 ILLKIHQTFYEEYEKTKQISDLKRLIPQVKSQVLVGFNLVFSGLVP-NSMKLEESKAYQV 487

Query: 932 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
           A   GA  T+      TH+VA + GT KV+ A    +  +V P W+ A A  +    E+ 
Sbjct: 488 ARSLGATVTQDFTPDTTHLVAVTFGTSKVHNARKNPKIKMVTPEWLWACAERWEHVEERL 547

Query: 991 FAIK 994
           + +K
Sbjct: 548 YPLK 551


>gi|396081720|gb|AFN83335.1| Fcp1-like phosphatase [Encephalitozoon romaleae SJ-2008]
          Length = 408

 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 53/195 (27%), Positives = 90/195 (46%), Gaps = 53/195 (27%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  +I KE+ + LE Q K+       LVLDLD T+L++A                    
Sbjct: 42  KEAVSIYKEKIKTLEMQMKLI------LVLDLDQTVLHTAY------------------- 76

Query: 717 REKPHRHLFRFPHMGMW--TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
                + + RF   G     KLRP +   L + S+L+E+H+YTMG + YA  + +++DP 
Sbjct: 77  GASSEKGIVRFTMDGCKYSVKLRPNLKRMLRKVSRLYEIHVYTMGTRPYAERIVRIIDPT 136

Query: 775 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA---------VVIIDDSVRVWP 825
              F  R+I+R ++                +GVL    +         +VI+DD   VW 
Sbjct: 137 RKYFHDRIITRDEN----------------QGVLVKRLSRLFPYNHKNIVILDDRADVWD 180

Query: 826 HNKLNLIVVERYTYF 840
           + + NL++++ + YF
Sbjct: 181 YCE-NLVLIKPFWYF 194


>gi|401886990|gb|EJT50998.1| protein phosphatase [Trichosporon asahii var. asahii CBS 2479]
          Length = 922

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 52/152 (34%), Positives = 70/152 (46%), Gaps = 33/152 (21%)

Query: 690 HTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASK 749
             L + AKF   D V   + R      R +P       P    +TK RPG+  FLE  SK
Sbjct: 277 EALKDVAKFQLADDVPPGVSR------RHQPE------PVRWYYTKPRPGLNKFLEDMSK 324

Query: 750 LFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLG 809
           L+EMH+YTMG + YA  + K++DP+G  FA                     +K L  +  
Sbjct: 325 LYEMHVYTMGTRSYADAICKIVDPEGKYFAM-------------------SAKSLVRLFP 365

Query: 810 M-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
             +S VVIIDD   VW  +  NL+ V  Y +F
Sbjct: 366 HDQSMVVIIDDRSDVW-GDSPNLVKVVPYDFF 396



 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/102 (35%), Positives = 56/102 (54%), Gaps = 2/102 (1%)

Query: 891 VDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHV 950
            DV  ++   + ++L+GC IVF+ V  + +  P    +WQ AE FGA C   +D++VTH 
Sbjct: 596 CDVPLLIGEIKDQMLSGCVIVFTGVIAINQ-KPQDSEIWQQAEAFGAQCQVELDERVTHC 654

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDF 991
           V  S+GT+K+  A       VV   W++ S   +RR  E+ F
Sbjct: 655 VIGSIGTEKMRRASRMPHVQVVWLAWLQTSIAFWRREPEEPF 696


>gi|321460734|gb|EFX71774.1| hypothetical protein DAPPUDRAFT_308742 [Daphnia pulex]
          Length = 798

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 55/175 (31%), Positives = 85/175 (48%), Gaps = 30/175 (17%)

Query: 672 EQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG 731
           +++++   RKL L++DLD TL+++         +DEI    E+         +F F   G
Sbjct: 148 DEERLLKDRKLVLLVDLDQTLIHT--------TNDEIPANIED---------VFHFQLHG 190

Query: 732 -----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
                  T+LRP     L   S L+E+H+ T G++ YA  +A  LD KG  F+ R++SR 
Sbjct: 191 PNSPWYHTRLRPFTKELLCSMSSLYELHICTFGSRTYAHMIANFLDEKGRYFSHRILSR- 249

Query: 787 DDGDPFDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
              + F       K+ +L+ +    +  VVIIDD   VW     NLI V  Y +F
Sbjct: 250 --DECFSAH---SKTANLKALFPCGDQMVVIIDDREDVWNFAP-NLIHVRPYHFF 298


>gi|156083399|ref|XP_001609183.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154796434|gb|EDO05615.1| hypothetical protein BBOV_IV000150 [Babesia bovis]
          Length = 692

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 68/255 (26%), Positives = 106/255 (41%), Gaps = 39/255 (15%)

Query: 727 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
           F ++  + KLRPG+  FL+  S  +EM +YT   K YA  +  +LDP   LF  R+++R 
Sbjct: 307 FANIRYYMKLRPGLRGFLQVLSLYYEMSIYTNATKEYADVVVSILDPDRSLFMDRIVART 366

Query: 787 DDGDPFDGDERVPKSKDLEGVLGM------ESAVVIIDDSVRVW---PHNKLNLIVVERY 837
             G+           +DL+              VV  DD   VW   PHN+  ++  E Y
Sbjct: 367 SAGE-----------RDLQKTAARLYPNLDPRFVVAFDDRADVWADVPHNQ--VVKAEHY 413

Query: 838 TYFPCSRRQ----FGLLGPS---LLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDD 890
            +F     +    +G++  S    L ID D       L   + V   LHK FF+     +
Sbjct: 414 DFFDSHIAELSDLYGIVNSSTENTLYIDSDRH-----LDHMVKVFLELHKRFFNDPFKSN 468

Query: 891 V-----DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDD 945
           V     ++++ +  +   +L G R        V + +          E    V +K  D 
Sbjct: 469 VGTLVQEIQSNVLKDTGILLTGYRKNSKGQGQVLQTDCEQRQREIAVELGATVVSKLSDK 528

Query: 946 QVTHVVANSLGTDKV 960
           ++THVVA    TD V
Sbjct: 529 RLTHVVAGKNCTDNV 543


>gi|449018404|dbj|BAM81806.1| similar to TFIIF interacting component of CTD phosphatase Fcp1p
            [Cyanidioschyzon merolae strain 10D]
          Length = 1640

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 59/104 (56%), Gaps = 11/104 (10%)

Query: 893  VRNILAAEQRKILAGCRIVFSRVFP--VGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHV 950
            +++ILA  +R +L GC + F+ VF    G A P  H LW+ A +FGAVC + +  QV+H+
Sbjct: 1389 IKDILAELRRSVLTGCELCFTGVFAKHAGMA-PEDHELWRLAVRFGAVCHREVLPQVSHL 1447

Query: 951  VAN-SLGTDKVNWALSTGR-----FVVHPGWVEASALLYRRANE 988
            +A+   G D +     T R     F+V P W+  SA   RRANE
Sbjct: 1448 IADPQRGRDTLK--TQTARAMNSVFIVKPTWLVCSAEDMRRANE 1489



 Score = 67.0 bits (162), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 43/118 (36%), Positives = 65/118 (55%), Gaps = 15/118 (12%)

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
            + KLRPG+  FL   +  FE+H+YTMG++ YA  +A ++D    LF GR+ SR D    
Sbjct: 521 YYIKLRPGLHEFLRTIADRFELHIYTMGSRPYADTVASIIDSDERLFQGRITSRDD---- 576

Query: 792 FDGDERVPKSKDLEGVLGM-ESAVVIIDDSVRVWP------HNKL--NLIVVERYTYF 840
           F+ D R+ + K+L+ V    +S V+++DD   VW       H +   NLI    Y +F
Sbjct: 577 FE-DGRLNQ-KNLKHVFPCDDSMVLVVDDREDVWVAQDQSLHGRHFPNLIRARPYYFF 632


>gi|297830094|ref|XP_002882929.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328769|gb|EFH59188.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 270

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 56/210 (26%), Positives = 89/210 (42%), Gaps = 31/210 (14%)

Query: 691 TLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKL 750
           TL++S K   +      ++++++   R+   ++  R        K RP +  FL+ A+KL
Sbjct: 80  TLIHSMKTLNLSNAEKYLIKEEKSGSRKDLRKYNDRL------VKFRPFVEEFLKEANKL 133

Query: 751 FEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGM 810
           F M  YT G   YA  + ++LDP  + F  R+I+R +           P  K L+ VL  
Sbjct: 134 FTMTAYTRGGSTYAKAVVRMLDPNKIYFGDRIITRKES----------PDLKTLDLVLAD 183

Query: 811 ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLAS 870
           E  +VI             NL+ +  Y YF    R       S  E   DE      L  
Sbjct: 184 ERGIVI------------RNLLEITSYFYFKNDHRNIMRSRLSYAERKTDESRTKRALVK 231

Query: 871 SLGVIERLHKIFFS---HQSLDDVDVRNIL 897
            L  ++ +H  FF+    + LD  DVR ++
Sbjct: 232 LLKFLKEVHNGFFTCGLEEQLDIKDVRYLI 261


>gi|357451355|ref|XP_003595954.1| RNA polymerase II subunit A C-terminal domain phosphatase [Medicago
           truncatula]
 gi|355485002|gb|AES66205.1| RNA polymerase II subunit A C-terminal domain phosphatase [Medicago
           truncatula]
          Length = 239

 Score = 68.6 bits (166), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 65/231 (28%), Positives = 98/231 (42%), Gaps = 59/231 (25%)

Query: 730 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
           M    KLRP + TFL+ AS++FEM++YTMG + Y+ EMAK+LDP+   F  +V  +  D 
Sbjct: 65  MQRMNKLRPFVRTFLKEASEVFEMYIYTMGIRQYSLEMAKLLDPQVEYFKDKVWQKHKDN 124

Query: 790 -DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 848
              F     +  ++ L  +   ESA     D ++ +       + + R  YF        
Sbjct: 125 LTTFS----IKVARKLTSISNSESAKT---DQLKFFD------VYLIRLRYFD------- 164

Query: 849 LLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGC 908
                                    VI  L  I   H          +L++ + ++L+GC
Sbjct: 165 ------------------------NVISLLQSIHVFH----------VLSSLRGEVLSGC 190

Query: 909 RIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDK 959
            IVFS  F        L  L + AE+ GA     +   VTHVVAN+  T +
Sbjct: 191 VIVFSCAF----HGHDLRKLRKIAERLGATHLTELRPTVTHVVANAFVTKE 237


>gi|429964988|gb|ELA46985.1| FCP1-like phosphatase, phosphatase domain-containing protein,
           partial [Vavraia culicis 'floridensis']
          Length = 231

 Score = 68.2 bits (165), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 54/187 (28%), Positives = 86/187 (45%), Gaps = 29/187 (15%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDE-------------ILRKKEEQDREK 719
           + ++   +K+ LV+DLD T+L+S +  +   V D              I  K+  Q R +
Sbjct: 53  RDELIQKKKMILVVDLDQTILHSIEV-KGGRVGDNGSRNRNGECGGRGITNKQLLQARPR 111

Query: 720 ---PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
              P    +      M T LRP + TFL   +++F MH+YTMG   Y  ++  V+D    
Sbjct: 112 QPLPSSFTYTLASTTMKTTLRPHLHTFLTELNEMFHMHIYTMGTSEYVHQITNVIDRDRS 171

Query: 777 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLG-MESAVVIIDDSVRVWPHNKLNLIVVE 835
           LF  R+++R D+             K LE + G  E  VV+IDD   VW +   NL+++ 
Sbjct: 172 LFGDRIVTRDDE----------VLVKRLERLFGDREDMVVVIDDRGDVWEYCG-NLVMIR 220

Query: 836 RYTYFPC 842
            +    C
Sbjct: 221 PFFGVDC 227


>gi|409083591|gb|EKM83948.1| hypothetical protein AGABI1DRAFT_124274 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 853

 Score = 67.8 bits (164), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 40/109 (36%), Positives = 57/109 (52%), Gaps = 10/109 (9%)

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
           + K RPG   FL   +  ++MH+YTMG + YA E+   +DP G +F  R++SR + G   
Sbjct: 269 YIKPRPGWKEFLMDMATKYDMHVYTMGTRAYAEEVCAAIDPDGSVFKSRILSRDESGS-- 326

Query: 793 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
                    K L+ +   + S VVIIDD   VW  +  NLI V  Y +F
Sbjct: 327 ------LTQKSLQRLFPCDTSMVVIIDDRADVWEWSP-NLIKVIPYDFF 368



 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 48/101 (47%), Gaps = 2/101 (1%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           +V  I+   + ++L G  +VFS V P+ +  P     W+ A  FGA C   +   VTHV+
Sbjct: 540 NVTTIIPRLRSEVLGGLHLVFSGVIPL-DTPPETTEFWRLARMFGAKCHTDLTPDVTHVI 598

Query: 952 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDF 991
               GT KV  A   G   +V P W   S  L+ R +E  +
Sbjct: 599 TAKRGTKKVETARQRGGIKIVRPEWFNDSIALWHRQDETPY 639


>gi|432861327|ref|XP_004069613.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Oryzias
           latipes]
          Length = 473

 Score = 67.8 bits (164), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 66/245 (26%), Positives = 114/245 (46%), Gaps = 28/245 (11%)

Query: 596 VSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYD 655
           ++ ++ I+ G+I +  DM  +           GS P + P  A P++++ D   +F+ Y 
Sbjct: 211 ITSDTSIEEGEIITETDMPPLTA---PGCMSVGSYPHSIP-SAPPETSYEDEWEVFDPYY 266

Query: 656 DQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
             +      +++ TR+     K  S  +  LVLDLD TL++ +                E
Sbjct: 267 FIKHVPPLTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NE 312

Query: 714 EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 773
            +D       LF+     ++ +LRP    FLER S+L+E+ L+T   K+YA ++  +LDP
Sbjct: 313 LEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMSQLYEIILFTASKKVYADKLLNILDP 372

Query: 774 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLI 832
           K  L   R+    +      G+      KDL  +LG + S  VIID+S + + +   N I
Sbjct: 373 KKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTVIIDNSPQAFAYQLSNGI 425

Query: 833 VVERY 837
            +E +
Sbjct: 426 PIESW 430


>gi|297830090|ref|XP_002882927.1| hypothetical protein ARALYDRAFT_897807 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328767|gb|EFH59186.1| hypothetical protein ARALYDRAFT_897807 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 287

 Score = 66.6 bits (161), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 60/225 (26%), Positives = 88/225 (39%), Gaps = 55/225 (24%)

Query: 677 FSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKL 736
           +  RKL LV+DL H LL+S                                   G+  KL
Sbjct: 96  YGQRKLHLVVDLQHVLLDSN----------------------------------GVLVKL 121

Query: 737 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 796
           RP    FL  A++LF ++ YT  +   A    K+LDP  + F  R I+           E
Sbjct: 122 RPFAREFLREANELFTIYAYTKSDPKQARSFIKLLDPLKIFFPSRFITIA---------E 172

Query: 797 RVPKSKDLEGVLGMESAVVIIDDSVRVWPH-NKLNLIVVERYTYFPCSRRQFGLLGPSLL 855
              K K LE VL  E  VVI+D     W   ++ NL++++ Y YF     Q G +   + 
Sbjct: 173 EKRKKKSLEFVLAEERGVVILDCKSETWEKDDERNLLLIKSYDYFKGMEYQQGFITKFIN 232

Query: 856 EIDHDER-----------SEDGTLASSLGVIERLHKIFFSHQSLD 889
             +                +DG L  +L  ++ +H+ FF  Q  D
Sbjct: 233 FFNKSSSEEKRNEKEEEDDDDGVLVDALNSLKTIHQRFFHGQCKD 277


>gi|221055253|ref|XP_002258765.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
 gi|193808835|emb|CAQ39537.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
          Length = 1474

 Score = 66.6 bits (161), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 43/117 (36%), Positives = 58/117 (49%), Gaps = 14/117 (11%)

Query: 729 HMGMWT---KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 785
           H G +T   KLRPG+  FL++ +K +E++LYTMG   +A     +LDP    F  RV SR
Sbjct: 548 HKGSYTIYYKLRPGVIQFLQKMNKKYEIYLYTMGTLEHAKSCLLLLDPLKKFFGNRVFSR 607

Query: 786 GDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
            D          V   K L  +L    S  + IDDS  +W  +  + I V  Y YFP
Sbjct: 608 KD---------SVNGLKHLNRILPTYRSVSLCIDDSDYMWKESS-SCIKVHGYNYFP 654


>gi|348509633|ref|XP_003442352.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Oreochromis
           niloticus]
          Length = 476

 Score = 66.2 bits (160), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 89/359 (24%), Positives = 151/359 (42%), Gaps = 59/359 (16%)

Query: 496 MDELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQGSK------ENLNFQK 549
           ++E   V +    PR  L G          P F    P+   + GS       E     K
Sbjct: 117 LEETTAVEVTTSPPRTTLLGTIF------SPVFNFFSPAKNASSGSDSPDQALEAEEIVK 170

Query: 550 QLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMS-VSQ-PLTSEPMVSQNSPI-QPGQ 606
           QL        + Q+V  P  T   T+ L    ++ S VSQ P    P + + SP  + G+
Sbjct: 171 QLD-------IEQAVETPTSTATSTQELCVTTNYYSSVSQLPPLRPPHILEASPTTEEGE 223

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAG-----PVGAHPQSAWGDVEHLFEGYDDQQKA- 660
           + + AD+  +        T  G+ P+       P    P++++ +   +F+ Y   +   
Sbjct: 224 LHTDADLPPL--------TAPGTSPDMAHVDTLPATVPPEASYEEDWEVFDPYFFIKHVP 275

Query: 661 -AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 719
              +++ TR+     K  S  +  LVLDLD TL++ +                E +D   
Sbjct: 276 PLTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NELEDAAL 321

Query: 720 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 779
               LF+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L  
Sbjct: 322 TFPVLFQDVIYQVYVRLRPFFREFLERMSQIYEIILFTASKKVYADKLLNILDPKKQLVR 381

Query: 780 GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
            R+    +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 382 HRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 433


>gi|156096809|ref|XP_001614438.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148803312|gb|EDL44711.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 1467

 Score = 66.2 bits (160), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 43/117 (36%), Positives = 58/117 (49%), Gaps = 14/117 (11%)

Query: 729 HMGMWT---KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 785
           H G +T   KLRPG+  FL++ +K +E++LYTMG   +A     +LDP    F  RV SR
Sbjct: 532 HKGSYTIYYKLRPGVIQFLQKMNKKYEIYLYTMGTLEHAKSCLLLLDPLKNFFGNRVFSR 591

Query: 786 GDDGDPFDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
            D          V   K L  +L    S  + IDDS  +W  +  + I V  Y YFP
Sbjct: 592 KD---------SVNGLKHLNRILPTYRSVSLCIDDSDYMWKESS-SCIKVHGYNYFP 638


>gi|224004348|ref|XP_002295825.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|209585857|gb|ACI64542.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1073

 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 66/234 (28%), Positives = 107/234 (45%), Gaps = 39/234 (16%)

Query: 777 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLN------ 830
           LF  R++SR D GD     + + +     GV+       I+DD   VW + K N      
Sbjct: 771 LFGSRIVSRTDVGDLGTDVKSLKRVFPCGGVMA-----AIVDDREDVWANAKNNDTGRPG 825

Query: 831 -----LIVVERYTYFPCS--RRQFGLLGPSLLEID---------HDERSEDGTLASSLGV 874
                L++V+ Y + P S  R      G  L + D         H ++ +D  L  +  +
Sbjct: 826 EPPDNLLLVQPYHWKPFSGYRDVNNASGQDLSKSDDDKSSSEKEHADKEQDVQLLWTADI 885

Query: 875 IERLHKIFFSHQ-SLDDVD---VRNILAAEQRKILA---GCRIVFSRVFPVGEANPHLH- 926
           ++RLH+ ++S   SL+D D   V ++L + +++ L       IVFS V P+ +       
Sbjct: 886 LKRLHERYYSPSISLEDRDKLSVPSLLRSMRKETLLRFPQANIVFSGVIPIDKQKEERKV 945

Query: 927 --PLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST--GRFVVHPGWV 976
             PL + AE+ GA     +   VTHVVA   G+DK++ A     G ++V P W+
Sbjct: 946 RVPLVRYAEELGARVLPDVTKGVTHVVARRDGSDKIHRARKEVPGCYIVQPSWL 999


>gi|124802229|ref|XP_001347409.1| protein phosphatase, putative [Plasmodium falciparum 3D7]
 gi|23494988|gb|AAN35322.1| protein phosphatase, putative [Plasmodium falciparum 3D7]
          Length = 1438

 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 40/111 (36%), Positives = 54/111 (48%), Gaps = 11/111 (9%)

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
           ++ KLRPG+  FL   S+ +E++LYTMG   +A     +LDP    F  RV SR D    
Sbjct: 572 IYYKLRPGVIEFLRTMSEKYEIYLYTMGTLEHAKSCLFLLDPLRKFFGNRVFSRKD---- 627

Query: 792 FDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
                 +   K L  +L    S  + IDDS  +W  N  + I V  Y YFP
Sbjct: 628 -----CLNSLKHLNKILPTYRSVSICIDDSDYIWKENS-SCIKVHGYNYFP 672


>gi|422292668|gb|EKU19970.1| rna polymerase ii ctd phosphatase, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 419

 Score = 65.5 bits (158), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 53/167 (31%), Positives = 82/167 (49%), Gaps = 18/167 (10%)

Query: 693 LNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM----WTKLRPGIWTFLERAS 748
           LN     ++  +H  I  + E  D +K   H F   + G     W  LRP + TFL +A 
Sbjct: 207 LNLILDIDLTLLHATIDPRAERLDHQKLEVHAFDIFNQGRILRHWCCLRPHLRTFLSQAH 266

Query: 749 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 808
            L+ + +YT G + YA ++A++LDP   LF  R++SR D  D       +   K L+ + 
Sbjct: 267 ALYVLTIYTHGRRDYAHQVARLLDPDRTLFEDRIVSRDDCPD-------LHGQKSLQRLF 319

Query: 809 --GMESAVVIIDDSVRVWPHNK----LNLIVVERYTYFPCSRRQFGL 849
             G+E A +I+DDS +VW   +    L ++  + YT F    R  GL
Sbjct: 320 PGGIEMA-LILDDSPQVWQGEQSRHLLPVLPFKFYTEFEEVNRVAGL 365


>gi|387196292|gb|AFJ68751.1| rna polymerase ii ctd phosphatase, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 414

 Score = 65.5 bits (158), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 53/167 (31%), Positives = 82/167 (49%), Gaps = 18/167 (10%)

Query: 693 LNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM----WTKLRPGIWTFLERAS 748
           LN     ++  +H  I  + E  D +K   H F   + G     W  LRP + TFL +A 
Sbjct: 202 LNLILDIDLTLLHATIDPRAERLDHQKLEVHAFDIFNQGRILRHWCCLRPHLRTFLSQAH 261

Query: 749 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 808
            L+ + +YT G + YA ++A++LDP   LF  R++SR D  D       +   K L+ + 
Sbjct: 262 ALYVLTIYTHGRRDYAHQVARLLDPDRTLFEDRIVSRDDCPD-------LHGQKSLQRLF 314

Query: 809 --GMESAVVIIDDSVRVWPHNK----LNLIVVERYTYFPCSRRQFGL 849
             G+E A +I+DDS +VW   +    L ++  + YT F    R  GL
Sbjct: 315 PGGIEMA-LILDDSPQVWQGEQSRHLLPVLPFKFYTEFEEVNRVAGL 360


>gi|66363226|ref|XP_628579.1| RNA pol II carboxy terminal domain phosphatase of the HAD
           superfamily with a BRCT domain at the C-terminus
           [Cryptosporidium parvum Iowa II]
 gi|46229587|gb|EAK90405.1| RNA pol II carboxy terminal domain phosphatase of the HAD
           superfamily with a BRCT domain at the C-terminus
           [Cryptosporidium parvum Iowa II]
 gi|323509333|dbj|BAJ77559.1| cgd7_4250 [Cryptosporidium parvum]
 gi|323509917|dbj|BAJ77851.1| cgd7_4250 [Cryptosporidium parvum]
          Length = 595

 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 90/366 (24%), Positives = 149/366 (40%), Gaps = 64/366 (17%)

Query: 674 KKMFSARKLCLVLDLDHTLL---NSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-- 728
           K   +  KL  +LDLD+TLL   NS K      + D I    +      P  + F  P  
Sbjct: 166 KDYLAQNKLVAILDLDNTLLHAYNSTKIGCNINLEDFISSSGD------PEMYKFVLPQD 219

Query: 729 -HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 787
            +   + KLRPG+  FL   +  + M + T   + YA  +  VLDP+   F  R+++R  
Sbjct: 220 LNTPYYLKLRPGVREFLNTIAPYYIMGICTNATREYADVIRAVLDPQRDKFGDRIVAR-- 277

Query: 788 DGDPFDGDERVPKSKDLEGV-LGMES-AVVIIDDSVRVWPHNKLNLIV-VERYTYFPCSR 844
             +  DG +     KD   + + +E+ A+V++DD   VW  +  + +V  + Y YF   +
Sbjct: 278 --ESVDGRD---TQKDFRKICVDVETRAIVLLDDRSDVWDSSLESQVVKAQTYEYFEQRK 332

Query: 845 RQFGLLGPSLL----EIDHDERSEDGTLASSLG-----------------------VIER 877
                  PSL      I  +  +    L+++L                        V + 
Sbjct: 333 DALKSHYPSLSSGANSISANSSAPGDILSAALSSLSNASGGNSIADYDRHLDYLIRVFKE 392

Query: 878 LHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPL---W----- 929
           LH  FF  Q+ +   V +IL   + +IL  C + F+      E  P +  L   W     
Sbjct: 393 LHTRFF--QNPETACVGDILKKMRSEILENCIVCFTGFLKADE-KPIVKGLPSNWGDSQA 449

Query: 930 ---QTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRR 985
                A + GA   ++++   TH+V     T K + A +     +VH  W+ A    ++R
Sbjct: 450 DAESAAIRLGAGIEENLEPLATHLVVQKTNTAKFHQAKNNPNVKIVHTLWLWACDGQWQR 509

Query: 986 ANEQDF 991
            +E  F
Sbjct: 510 VSEDLF 515


>gi|71026568|ref|XP_762950.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68349902|gb|EAN30667.1| hypothetical protein TP03_0826 [Theileria parva]
          Length = 823

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 57/217 (26%), Positives = 89/217 (41%), Gaps = 59/217 (27%)

Query: 681 KLCLVLDLDHTLL------------------NSAKF------HEVDPVHDEILRKKEEQD 716
           KLCLVLDLD+TLL                  ++A +      +  D V  E+ +K E   
Sbjct: 301 KLCLVLDLDNTLLHATSQPPPPDIAIPILNYDTADYLNQYVQYGTDSVSLELQQKLENSV 360

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
            +    +     +   + KLRPGI+ F  + S  F + L+TMG K +A    +V+DP+G+
Sbjct: 361 IKTIVYNETETSYCVSYFKLRPGIFQFFHKISDKFRLFLFTMGTKQHAASALQVIDPQGI 420

Query: 777 LFAGRVISRGDDGD--------------------------------PFDGDERVPKSKDL 804
            F  R+ SR +                                       D R    K L
Sbjct: 421 YFGNRIFSRYNTNSHNSTNSINSVNSVNSVNSVNSMNSVSNVVGVKKLRNDLRYCM-KSL 479

Query: 805 EGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           + +    ++ V+++DD+  VW +N L LI V  Y +F
Sbjct: 480 DRIFPNYKNLVLVMDDTEHVWTNN-LGLIKVHPYYFF 515


>gi|330796177|ref|XP_003286145.1| hypothetical protein DICPUDRAFT_87022 [Dictyostelium purpureum]
 gi|325083890|gb|EGC37331.1| hypothetical protein DICPUDRAFT_87022 [Dictyostelium purpureum]
          Length = 793

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 50/182 (27%), Positives = 84/182 (46%), Gaps = 35/182 (19%)

Query: 678 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG--MWTK 735
           S  K+ L++D+DHTL++S K    DP  +    K +         H   FP      + K
Sbjct: 413 STPKMHLIVDIDHTLIHSTK----DPNGESYFLKDKT-------VHKISFPETNETFYVK 461

Query: 736 LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR---------- 785
            RP    FL   S+ F +++Y+   K Y   +A +LDP   +F+ +VIS+          
Sbjct: 462 ERPNAIEFLRTLSQQFYIYVYSFHPKYYVERVASILDPHSNIFS-KVISKEIIESIENIK 520

Query: 786 -----GDDGDPF--DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 838
                 +   PF    ++ VPK    E +    + ++I+DD   VW + + NLI+++ + 
Sbjct: 521 ICRENNNSQKPFIVFNEQNVPKIFKFESI----NQLIILDDREDVWRNFQDNLILLDTFK 576

Query: 839 YF 840
           YF
Sbjct: 577 YF 578



 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 53/105 (50%), Gaps = 5/105 (4%)

Query: 893 VRNILAAEQRKILAGCRIVFSRVFPVGEANPH--LHP-LWQTAEQFGAVCTKHIDDQVTH 949
           +++++   +  +L  C IVFS +FP  + +P    H  + +  E FGA  ++ ID   TH
Sbjct: 680 IKSVIQEIRSSVLMDCNIVFSGIFP-KQIDPSKLCHTRVSKITESFGASISQEIDSNTTH 738

Query: 950 VVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAI 993
           V+    GT KV  AL      VVH  W+      + R +E ++++
Sbjct: 739 VIFIKEGTSKVLQALKNPNIKVVHFAWLRDCIHRWERIDELNYSV 783


>gi|427782099|gb|JAA56501.1| Putative rna polymerase ii ctd phosphatase [Rhipicephalus
           pulchellus]
          Length = 360

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 38/105 (36%), Positives = 56/105 (53%), Gaps = 2/105 (1%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           D+++++   +RK+L G  IVFS V P+ +        WQ A+  GA  ++ +   VTH+V
Sbjct: 11  DLKHVVPYVRRKVLKGVHIVFSGVVPMNQPAEKSQ-AWQVAKSLGATVSRDLCPGVTHLV 69

Query: 952 ANSLGTDKVNWALST-GRFVVHPGWVEASALLYRRANEQDFAIKP 995
           A  LGT KVN A    G  VVHP W+   A  +    E  F ++P
Sbjct: 70  AARLGTAKVNKARRMPGVHVVHPSWLWCCAERWEHVAEALFPLRP 114


>gi|218197280|gb|EEC79707.1| hypothetical protein OsI_21008 [Oryza sativa Indica Group]
          Length = 485

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 51/171 (29%), Positives = 83/171 (48%), Gaps = 20/171 (11%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           + + LVLDLD TL++S+    VD         ++  D   P  H  +     ++ K RP 
Sbjct: 308 KNITLVLDLDETLIHSSA---VD---------RDGADFSFPMYHGLK--EHTVYVKKRPH 353

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 799
           + TFL++ S++F++ ++T     YA  +  +LDPK + F  R     D   P DG     
Sbjct: 354 VDTFLQKVSEMFKVVIFTASLSSYANRLLDMLDPKNIFFTKRYFR--DSCLPVDGSYL-- 409

Query: 800 KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 850
             KDL  ++   + VVIID+S  V+   + N I +E +T  P  +    L+
Sbjct: 410 --KDLTVIVADLAKVVIIDNSPEVFRLQEENGIPIESWTSDPADKSLVELI 458


>gi|222632581|gb|EEE64713.1| hypothetical protein OsJ_19569 [Oryza sativa Japonica Group]
          Length = 485

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 51/171 (29%), Positives = 83/171 (48%), Gaps = 20/171 (11%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           + + LVLDLD TL++S+    VD         ++  D   P  H  +     ++ K RP 
Sbjct: 308 KNITLVLDLDETLIHSSA---VD---------RDGADFSFPMYHGLK--EHTVYVKKRPH 353

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 799
           + TFL++ S++F++ ++T     YA  +  +LDPK + F  R     D   P DG     
Sbjct: 354 VDTFLQKVSEMFKVVIFTASLSSYANRLLDMLDPKNIFFTKRYFR--DSCLPVDGSYL-- 409

Query: 800 KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 850
             KDL  ++   + VVIID+S  V+   + N I +E +T  P  +    L+
Sbjct: 410 --KDLTVIVADLAKVVIIDNSPEVFRLQEENGIPIESWTSDPADKSLVELI 458


>gi|47224149|emb|CAG13069.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 159

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 51/170 (30%), Positives = 79/170 (46%), Gaps = 25/170 (14%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG- 731
           Q K+  +RKL L++DLD+TL+++                 E   +  P +++F+    G 
Sbjct: 11  QDKLHQSRKLVLMVDLDNTLIHTT----------------EIPCQLSPKKNVFKMKLEGS 54

Query: 732 --MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
              + +LRP    FLE+ S+LFE++++T   + YA  +A  LDP    FA R+ISR +  
Sbjct: 55  PTYYVRLRPYYKEFLEKISELFELNIFTFACQSYAKTVAGFLDPDNTFFAQRIISRDNCF 114

Query: 790 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTY 839
            P      V            ES   +IDD   VW      L+ V+ Y Y
Sbjct: 115 YPATKMANVRFFSPCG-----ESMTCMIDDREDVWNFAP-GLVAVKPYMY 158


>gi|428672173|gb|EKX73087.1| conserved hypothetical protein [Babesia equi]
          Length = 937

 Score = 64.3 bits (155), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 77/323 (23%), Positives = 128/323 (39%), Gaps = 68/323 (21%)

Query: 727 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
           FP++  + KLRP I  FL+  S  +EM +YT   K YA  +  +LDP   LF  R+++R 
Sbjct: 504 FPNITYYMKLRPCIREFLQVLSLYYEMSIYTNATKEYADVVISILDPDRTLFMDRIVARN 563

Query: 787 --DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---PHNKLNLIVVERYTYFP 841
             D+ D      R+    DL         V+  DD   VW   PH +  ++  E Y +F 
Sbjct: 564 SVDEKDLLKSAARL--YPDLN-----RRFVLAFDDRKDVWADIPHRQ--VVRAEHYDFFE 614

Query: 842 CSRRQ----FGLLGPSLLEID----------------HDERSEDGTLASSLGVIE----- 876
               +    +G +       D                +  ++ DG   ++ G+ +     
Sbjct: 615 SYSMELANNYGFVAQDAFSPDVPGPQMENTSKSSNNLYISQNFDGNAENTEGITDDKDVV 674

Query: 877 ---------------RLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFS--RVFPVG 919
                           +HK FF+     + +V  IL   Q + L   R++ +  R    G
Sbjct: 675 RDYDRHLRHMITLFLEIHKRFFNDPF--NANVGTILENLQSETLKDTRVLLTGYRKNAKG 732

Query: 920 EANPHLHPLW-----QTAEQFGAVCTKHIDDQ-VTHVVANSLGTDKVNWAL-STGRFV-- 970
            +N  LH        +   + G++    I D  +TH+VA    TD +  +  ST   +  
Sbjct: 733 ISN-VLHSDCEQRQREVVSELGSIVLNKITDPGLTHIVAGKNCTDNIAKSRNSTYSHIHK 791

Query: 971 VHPGWVEASALLYRRANEQDFAI 993
           VH  W+ A    + R +E+ F +
Sbjct: 792 VHTLWLYACKATFSRVDEKVFDM 814


>gi|68525545|ref|XP_723632.1| NLI interacting factor [Plasmodium yoelii yoelii 17XNL]
 gi|23477988|gb|EAA15197.1| NLI interacting factor, putative [Plasmodium yoelii yoelii]
          Length = 1251

 Score = 64.3 bits (155), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 55/111 (49%), Gaps = 11/111 (9%)

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
           ++ KLRPG+  FL++ ++ +E++LYTMG   +A     +LDP    F  R+ SR D  + 
Sbjct: 428 IYYKLRPGVIEFLQKMNQKYEIYLYTMGTIEHAKSCLFLLDPLKKFFGNRIFSRKDCTNG 487

Query: 792 FDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
                     K L  +L    S  + +DDS  +W     + I V  Y YFP
Sbjct: 488 M---------KHLNRILPTYRSISICVDDSEYIWKETN-SCIKVHAYNYFP 528


>gi|67624539|ref|XP_668552.1| NLI interacting factor [Cryptosporidium hominis TU502]
 gi|54659751|gb|EAL38315.1| NLI interacting factor [Cryptosporidium hominis]
          Length = 595

 Score = 63.9 bits (154), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 91/367 (24%), Positives = 149/367 (40%), Gaps = 66/367 (17%)

Query: 674 KKMFSARKLCLVLDLDHTLL---NSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-- 728
           K   +  KL  +LDLD+TLL   NS K      + D I    +      P  + F  P  
Sbjct: 166 KDYLAQNKLVAILDLDNTLLHAYNSTKIGCNINLEDFISSSGD------PEMYKFVLPQD 219

Query: 729 -HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 787
            +   + KLRPG+  FL   +  + M + T   + YA  +  VLDP+   F  R+++R  
Sbjct: 220 LNTPYYLKLRPGVREFLNTIAPYYIMGICTNATREYADVIRAVLDPQRDKFGDRIVAR-- 277

Query: 788 DGDPFDGDERVPKSKDLEGV-LGMES-AVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRR 845
             +  DG +     KD   + + +E+ A+V++DD   VW  + L   VV+  TY    +R
Sbjct: 278 --ESVDGRD---TQKDFRKICVDVETRAIVLLDDRSDVW-DSSLESQVVKAQTYEYFEQR 331

Query: 846 QFGL------LGPSLLEIDHDERSEDGTLASSLG-----------------------VIE 876
           +  L      L      I  +  +    L+++L                        V +
Sbjct: 332 KDALKSHYPPLSSGANSISANSSAPGDILSAALSSLSNASGGNSIADYDRHLDYLIRVFK 391

Query: 877 RLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPL---W---- 929
            LH  FF  Q+ +   V +IL   + +IL  C + F+      E  P +  L   W    
Sbjct: 392 ELHTRFF--QNPETACVGDILKKMRSEILENCIVCFTGFLKADE-KPIVKGLPSNWGDSQ 448

Query: 930 ----QTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYR 984
                 A + GA   ++++   TH+V     T K + A +     +VH  W+ A    ++
Sbjct: 449 ADAESAAIRLGAGIEENLEPLATHLVVQKTNTAKFHQAKNNPNVKIVHTLWLWACDGQWQ 508

Query: 985 RANEQDF 991
           R +E  F
Sbjct: 509 RVSEDLF 515


>gi|297740632|emb|CBI30814.3| unnamed protein product [Vitis vinifera]
          Length = 479

 Score = 63.9 bits (154), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 47/149 (31%), Positives = 73/149 (48%), Gaps = 28/149 (18%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           + + LVLDLD TL++S             L   ++ D   P    F      ++ K RP 
Sbjct: 305 KSITLVLDLDETLVHST------------LEHCDDADFTFPV--FFNMKDHTVYVKQRPY 350

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 796
           + TFLER +++FE+ ++T    +YA ++  +LDP G  F+ R         DG       
Sbjct: 351 LHTFLERVAEMFEIVVFTASQSIYAEQLLDILDPDGKFFSHRAYRESCIFSDG------- 403

Query: 797 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
               +KDL  VLG++ A V IID+S +V+
Sbjct: 404 --SYTKDL-TVLGIDLAKVAIIDNSPQVF 429


>gi|225463384|ref|XP_002271705.1| PREDICTED: uncharacterized protein LOC100258847 [Vitis vinifera]
          Length = 484

 Score = 63.9 bits (154), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 47/149 (31%), Positives = 73/149 (48%), Gaps = 28/149 (18%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           + + LVLDLD TL++S             L   ++ D   P    F      ++ K RP 
Sbjct: 310 KSITLVLDLDETLVHST------------LEHCDDADFTFPV--FFNMKDHTVYVKQRPY 355

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 796
           + TFLER +++FE+ ++T    +YA ++  +LDP G  F+ R         DG       
Sbjct: 356 LHTFLERVAEMFEIVVFTASQSIYAEQLLDILDPDGKFFSHRAYRESCIFSDG------- 408

Query: 797 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
               +KDL  VLG++ A V IID+S +V+
Sbjct: 409 --SYTKDL-TVLGIDLAKVAIIDNSPQVF 434


>gi|326499061|dbj|BAK06021.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 415

 Score = 63.5 bits (153), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 56/181 (30%), Positives = 83/181 (45%), Gaps = 28/181 (15%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           K+  S  +  LVLDLD TL++S             L   E+ D   P R  F      ++
Sbjct: 215 KQTRSCPRTTLVLDLDETLVHST------------LEPCEDSDFTFPVR--FNLRDHTIY 260

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGD 790
            + RP +  FLER + +FE+ ++T    +YA ++  VLDPK  LF  RV        +G+
Sbjct: 261 VRCRPYLKDFLERVASMFEIIIFTASQSIYAEQLLNVLDPKRRLFRHRVYRESCVYVEGN 320

Query: 791 PFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 849
                      KDL  VLG + S VVI+D+S + +     N I +E +   P  +    L
Sbjct: 321 YL---------KDL-SVLGRDLSRVVIVDNSPQAFGFQLDNGIPIESWFDDPNDKELLAL 370

Query: 850 L 850
           L
Sbjct: 371 L 371


>gi|440290285|gb|ELP83711.1| RNA polymerase II ctd phosphatase, putative [Entamoeba invadens
           IP1]
          Length = 434

 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 65/248 (26%), Positives = 107/248 (43%), Gaps = 57/248 (22%)

Query: 720 PHR-HLFRFPHMG--MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD--PK 774
           P R   F+ P     ++   R GI  F+ +   L+E+H+ T+G K YA  + + L+  P 
Sbjct: 94  PERGFFFQIPEYSKKVFVYFRDGIVAFMTKLITLYEIHVVTLGQKDYAMAIVEALNKLPG 153

Query: 775 GVLFAGRVISR--------GDDGDPFDGDERVPKSKDLEGV--------LGMESAVVIID 818
           G    G+++           DDGD F  D  + +++D E          +G E   +++D
Sbjct: 154 GPFINGKIVCSEDCISEILKDDGD-FQNDGLIERNEDTERRAVKRTVPGMGSEEVQIVVD 212

Query: 819 DSVRVWP-HNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIER 877
           D + VW  HN L +             ++F L  P      + E  +D  L     +++ 
Sbjct: 213 DRIDVWDNHNVLQI-------------QEFVL--P------NSENVKDNELERISEILKY 251

Query: 878 LHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVG---EANPHLHPLW----Q 930
           +H  FF  +S    +VR IL+  +  IL G +I FS+  P G   + N     +W    +
Sbjct: 252 VHTEFFGGKS----NVRVILSEMREHILKGKKIYFSK--PKGGETKMNKMTEFIWNAIKR 305

Query: 931 TAEQFGAV 938
             E  GAV
Sbjct: 306 NCEMMGAV 313


>gi|255557435|ref|XP_002519748.1| conserved hypothetical protein [Ricinus communis]
 gi|223541165|gb|EEF42721.1| conserved hypothetical protein [Ricinus communis]
          Length = 474

 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 48/155 (30%), Positives = 74/155 (47%), Gaps = 28/155 (18%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           K+    + + LVLDLD TL++S   H                D +      F      ++
Sbjct: 294 KESLMKKSVTLVLDLDETLVHSTLEHC--------------DDADFTFTVFFNLKEHTVY 339

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGD 790
            K RP + TFLER ++LFE+ ++T    +YA ++  +LDP+  L + RV        DG 
Sbjct: 340 VKRRPHLHTFLERVAELFEVVIFTASQSIYAAQLLDILDPEKKLISRRVYRESCIFTDG- 398

Query: 791 PFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
                     +KDL  VLG++ A V IID+S +V+
Sbjct: 399 --------SYTKDL-TVLGVDLAKVAIIDNSPQVF 424


>gi|301115156|ref|XP_002905307.1| nuclear LIM factor interactor-interacting protein hyphal form,
           putative [Phytophthora infestans T30-4]
 gi|262110096|gb|EEY68148.1| nuclear LIM factor interactor-interacting protein hyphal form,
           putative [Phytophthora infestans T30-4]
          Length = 422

 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 55/164 (33%), Positives = 81/164 (49%), Gaps = 28/164 (17%)

Query: 678 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL---FRFPHMGMWT 734
           +A K+CLVLDLD TL++ +    VD V             + PH      F      +  
Sbjct: 236 NAPKICLVLDLDETLVHCS----VDEV-------------KNPHMQFPVTFNGVEYIVNV 278

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 794
           K RP +  FL+R SKLFE+ ++T  +K+YA ++  +LDP   L   R+  R D  D F  
Sbjct: 279 KKRPHMEYFLKRVSKLFEIVVFTASHKVYAEKLTNMLDPHRNLIKYRLY-RDDCLDVFGN 337

Query: 795 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
                  KDL  VLG + S VV++D+S   + +   N I +E +
Sbjct: 338 -----YLKDL-NVLGRDLSKVVLVDNSPHAFGYQVNNGIPIETW 375


>gi|147839779|emb|CAN65912.1| hypothetical protein VITISV_035567 [Vitis vinifera]
          Length = 482

 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 47/149 (31%), Positives = 73/149 (48%), Gaps = 28/149 (18%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           + + LVLDLD TL++S             L   ++ D   P    F      ++ K RP 
Sbjct: 308 KSITLVLDLDETLVHST------------LEHCDDADFTFPV--FFNMKDHTVYVKQRPY 353

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 796
           + TFLER +++FE+ ++T    +YA ++  +LDP G  F+ R         DG       
Sbjct: 354 LHTFLERVAEMFEIVVFTASQSIYAEQLLDILDPDGKFFSHRAYRESCIFSDG------- 406

Query: 797 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
               +KDL  VLG++ A V IID+S +V+
Sbjct: 407 --SYTKDL-TVLGIDLAKVAIIDNSPQVF 432


>gi|15239800|ref|NP_196747.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
 gi|30683828|ref|NP_850809.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
 gi|42573341|ref|NP_974767.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
 gi|145334381|ref|NP_001078572.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
 gi|7573353|emb|CAB87659.1| putative protein [Arabidopsis thaliana]
 gi|21553575|gb|AAM62668.1| unknown [Arabidopsis thaliana]
 gi|56550687|gb|AAV97797.1| At5g11860 [Arabidopsis thaliana]
 gi|332004345|gb|AED91728.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
 gi|332004346|gb|AED91729.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
 gi|332004347|gb|AED91730.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
 gi|332004348|gb|AED91731.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
          Length = 305

 Score = 63.2 bits (152), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 60/185 (32%), Positives = 87/185 (47%), Gaps = 36/185 (19%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP----- 728
           K+  S   + LVLDLD TL++S             L    E D        F FP     
Sbjct: 104 KQTRSCPPISLVLDLDETLVHST------------LEPCGEVD--------FTFPVNFNE 143

Query: 729 --HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
             HM ++ + RP +  F+ER S+LFE+ ++T    +YA ++  VLDPK  LF  RV    
Sbjct: 144 EEHM-VYVRCRPHLKEFMERVSRLFEIIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR-- 200

Query: 787 DDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRR 845
           D    FDG+      KDL  VLG + S V+I+D+S + +     N + +E +   P  + 
Sbjct: 201 DSCVFFDGN----YLKDL-SVLGRDLSRVIIVDNSPQAFGFQVENGVPIESWFNDPSDKE 255

Query: 846 QFGLL 850
              LL
Sbjct: 256 LLHLL 260


>gi|356556521|ref|XP_003546573.1| PREDICTED: uncharacterized protein LOC100799803 [Glycine max]
          Length = 471

 Score = 62.8 bits (151), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 72/149 (48%), Gaps = 28/149 (18%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           + + LVLDLD TL++S   H                D +      F      ++ K RP 
Sbjct: 297 KSITLVLDLDETLVHSTLEHC--------------DDADFTFTVFFNLKEYIVYVKQRPY 342

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 796
           + TFLER S++FE+ ++T    +YA ++  +LDP G   + R+        DG+      
Sbjct: 343 LHTFLERVSEMFEVVIFTASQSIYAKQLLDILDPDGRFISRRMYRESCLFSDGN------ 396

Query: 797 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
               +KDL  +LG++ A V IID+S +V+
Sbjct: 397 ---YTKDL-TILGVDLAKVAIIDNSPQVF 421


>gi|297811303|ref|XP_002873535.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319372|gb|EFH49794.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 305

 Score = 62.8 bits (151), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 60/185 (32%), Positives = 87/185 (47%), Gaps = 36/185 (19%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP----- 728
           K+  S   + LVLDLD TL++S             L    E D        F FP     
Sbjct: 104 KQTRSCPPISLVLDLDETLVHST------------LEPCGEVD--------FTFPVNFNE 143

Query: 729 --HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
             HM ++ + RP +  F+ER S+LFE+ ++T    +YA ++  VLDPK  LF  RV    
Sbjct: 144 EEHM-VYVRCRPHLKEFMERVSRLFEIIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR-- 200

Query: 787 DDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRR 845
           D    FDG+      KDL  VLG + S V+I+D+S + +     N + +E +   P  + 
Sbjct: 201 DSCVFFDGN----YLKDL-SVLGRDLSRVIIVDNSPQAFGFQVENGVPIESWFNDPSDKE 255

Query: 846 QFGLL 850
              LL
Sbjct: 256 LLHLL 260


>gi|68074755|ref|XP_679294.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56500009|emb|CAH99961.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 983

 Score = 62.8 bits (151), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 55/111 (49%), Gaps = 11/111 (9%)

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
           ++ KLRPG+  FL++ ++ +E++LYTMG   +A     +LDP    F  R+ SR D  + 
Sbjct: 233 IYYKLRPGVIEFLQKMNQKYEIYLYTMGTIEHAKSCLFLLDPLKKFFGNRIFSRKDCTNG 292

Query: 792 FDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
                     K L  +L    S  + +DDS  +W     + I V  Y YFP
Sbjct: 293 M---------KHLNRILPTYRSISICVDDSEYIWKEAN-SCIKVHAYNYFP 333


>gi|356530555|ref|XP_003533846.1| PREDICTED: uncharacterized protein LOC100786602 [Glycine max]
          Length = 470

 Score = 62.8 bits (151), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 54/189 (28%), Positives = 87/189 (46%), Gaps = 38/189 (20%)

Query: 640 PQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFH 699
           PQS   ++  L E   + Q   I K+  RR          + + LVLDLD TL++S    
Sbjct: 266 PQSFIKNLPELSEIEVNGQPTLIPKQSPRR----------KSITLVLDLDETLVHST--- 312

Query: 700 EVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMG 759
            ++P            D +      F      ++ K RP +  FLER S++FE+ ++T  
Sbjct: 313 -LEPC----------DDADFTFTVFFNLKEYTVYVKQRPYLHAFLERVSEMFEVVIFTAS 361

Query: 760 NKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDERVPKSKDLEGVLGMESA-VV 815
             +YA ++  +LDP G   + R+        DG+          +KDL  +LG++ A V 
Sbjct: 362 QSIYAKQLLDILDPDGRFISRRMYRESCLFSDGN---------YTKDL-TILGVDLAKVA 411

Query: 816 IIDDSVRVW 824
           IID+S +V+
Sbjct: 412 IIDNSPQVF 420


>gi|70952066|ref|XP_745226.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56525483|emb|CAH77992.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 1224

 Score = 62.8 bits (151), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 55/111 (49%), Gaps = 11/111 (9%)

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
           ++ KLRPG+  FL++ ++ +E++LYTMG   +A     +LDP    F  R+ SR D  + 
Sbjct: 429 IYYKLRPGVIEFLQKMNQKYEIYLYTMGTIEHAKSCLFLLDPLKKFFGNRIFSRKDCTNG 488

Query: 792 FDGDERVPKSKDLEGVL-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
                     K L  +L    S  + +DDS  +W     + I V  Y YFP
Sbjct: 489 M---------KHLNRILPTYRSISICVDDSEYIWKEAN-SCIKVHAYNYFP 529


>gi|397787628|gb|AFO66533.1| putative NLI interacting factor family protein [Brassica napus]
          Length = 477

 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 51/140 (36%), Positives = 73/140 (52%), Gaps = 26/140 (18%)

Query: 684 LVLDLDHTLLNSAK--FHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIW 741
           LVLDLD TL++S+     EVD        ++E               HM ++ + RP + 
Sbjct: 116 LVLDLDETLVHSSLEPCGEVDFTFTVHFNEEE---------------HM-VYVRCRPHLK 159

Query: 742 TFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKS 801
            F+ER S+LFE+ ++T    +YA ++  VLDPK  LF  RV    D    FDG+      
Sbjct: 160 EFMERVSRLFEVIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR--DSCVFFDGN----YL 213

Query: 802 KDLEGVLGME-SAVVIIDDS 820
           KDL  VLG + S V+I+D+S
Sbjct: 214 KDL-SVLGRDLSRVIIVDNS 232


>gi|351710351|gb|EHB13270.1| CTD small phosphatase-like protein 2 [Heterocephalus glaber]
          Length = 466

 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 62/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 664
           I +G +      N D         PE+G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211 INNGLEEAEETVNRDIPHLTAPVTPESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|397787605|gb|AFO66511.1| putative small phosphatase-like protein 2-B [Brassica napus]
          Length = 262

 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 51/140 (36%), Positives = 73/140 (52%), Gaps = 26/140 (18%)

Query: 684 LVLDLDHTLLNSAK--FHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIW 741
           LVLDLD TL++S+     EVD        ++E               HM ++ + RP + 
Sbjct: 71  LVLDLDETLVHSSLEPCGEVDFTFTVHFNEEE---------------HM-VYVRCRPHLK 114

Query: 742 TFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKS 801
            F+ER S+LFE+ ++T    +YA ++  VLDPK  LF  RV    D    FDG+      
Sbjct: 115 EFMERVSRLFEVIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR--DSCVFFDGN----YL 168

Query: 802 KDLEGVLGME-SAVVIIDDS 820
           KDL  VLG + S V+I+D+S
Sbjct: 169 KDL-SVLGRDLSRVIIVDNS 187


>gi|147798518|emb|CAN65472.1| hypothetical protein VITISV_037605 [Vitis vinifera]
          Length = 506

 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 55/184 (29%), Positives = 86/184 (46%), Gaps = 32/184 (17%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           EE K+    +++ LVLDLD TL++S             L   +  D   P    F     
Sbjct: 327 EESKR----KRITLVLDLDETLVHST------------LEPCDHADFTFPV--FFNMKEH 368

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---D 787
            ++ + RP +  FLER +++FE+ ++T    +YA ++  +LDP   LF+GR         
Sbjct: 369 TIYVRQRPFLQMFLERVAEMFEIIVFTASQSIYAEQLLDILDPDRKLFSGRAYRESCIFS 428

Query: 788 DGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 846
           DG           +KDL  VLG++ A V IID+S +V+     N I ++ +   P  R  
Sbjct: 429 DG---------SYTKDL-TVLGIDLAKVAIIDNSPQVFRLQVDNGIPIKSWFDDPSDRAL 478

Query: 847 FGLL 850
             LL
Sbjct: 479 ISLL 482


>gi|71031738|ref|XP_765511.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68352467|gb|EAN33228.1| hypothetical protein TP02_0943 [Theileria parva]
          Length = 769

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 73/328 (22%), Positives = 128/328 (39%), Gaps = 72/328 (21%)

Query: 727 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
           FP++  + KLRP I  FL+  S  +EM +YT   K YA  +  +LDP   LF  R+++R 
Sbjct: 339 FPNVNYYMKLRPCIREFLQILSLYYEMSIYTNATKEYADVVISILDPDRSLFMDRIVARN 398

Query: 787 --DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---PHNKLNLIVVERYTYF- 840
             D+ D      R+    D   +L         DD   VW   PH +  ++  E Y +F 
Sbjct: 399 SVDEKDLLKSASRLYPDLDTRFILAF-------DDRRDVWSDIPHKQ--VVRAEHYDFFE 449

Query: 841 ---------------PCSRR-----QFGL------------LGPSLLEIDHDERSE---- 864
                          P +++      F L            +      I+ +  S+    
Sbjct: 450 SYISELNNNYSSSPTPSNKQTPESNSFNLTTNVSSSAPGKPVNDYTFNIEQESESDPDVT 509

Query: 865 ---------DGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRV 915
                    D  L   + +   +HK FF+    +DV   +IL + Q ++L G  ++ +  
Sbjct: 510 REKVSIVDYDRHLKYMVKIFLEIHKRFFADPFNNDVG--SILESIQSQVLKGIGVLLTGY 567

Query: 916 FPVGE-ANPHLHPLWQTAE-----QFGAVCTKHIDD-QVTHVVANSLGTDKVNWALSTGR 968
               + AN  LH   +  +     + GA     ++D ++TH++A    TD +  +     
Sbjct: 568 RKNTKVANSVLHADCELRQKENLLELGASLVLRLNDSKLTHIMAGKNCTDNITKSRDPQY 627

Query: 969 ---FVVHPGWVEASALLYRRANEQDFAI 993
                VH  W+ +    + + +E  F +
Sbjct: 628 NHIHKVHTLWLYSCRATFSKVDESLFKV 655


>gi|147907092|ref|NP_001089935.1| CTD small phosphatase-like protein 2-B [Xenopus laevis]
 gi|83405117|gb|AAI10767.1| Ctdspl2b protein [Xenopus laevis]
          Length = 466

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 99/211 (46%), Gaps = 24/211 (11%)

Query: 630 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 687
            PE+G   AH ++A+ +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 234 SPESGYSSAHAEAAYEEDWEVFDPYFFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 293

Query: 688 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 747
           LD TL           VH  +    E +D       LF+     ++ +LRP    FLER 
Sbjct: 294 LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 339

Query: 748 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 807
           S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 340 SQIYEIILFTASKKVYADKLLNILDPKKRLVRHRLFR--EHCVCVQGN----YIKDL-NI 392

Query: 808 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           LG + S  +IID+S + + +   N I +E +
Sbjct: 393 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|327288817|ref|XP_003229121.1| PREDICTED: CTD small phosphatase-like protein 2-like [Anolis
           carolinensis]
          Length = 466

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 67/265 (25%), Positives = 118/265 (44%), Gaps = 30/265 (11%)

Query: 582 DFMSVSQPLTSEPMVSQNSPI--QPGQIKS----GADMKAVVTNHDDKQTGTGSGPEAGP 635
           D   V + +TS    +  +P   Q  Q++S    G +    VT+ D         P++G 
Sbjct: 180 DMEQVDEIITSTAASANGTPYASQVAQVRSTINNGLEEAEDVTDRDLPPLTAPVSPDSGY 239

Query: 636 VGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLL 693
             AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDLD TL+
Sbjct: 240 SSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDLDETLV 299

Query: 694 NSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEM 753
           + +                E +D       LF+     ++ +LRP    FLER S+++E+
Sbjct: 300 HCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMSQIYEI 345

Query: 754 HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-S 812
            L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +LG + S
Sbjct: 346 ILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDLS 398

Query: 813 AVVIIDDSVRVWPHNKLNLIVVERY 837
             +IID+S + + +   N I +E +
Sbjct: 399 KTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|123900520|sp|Q3KQB6.1|CTL2B_XENLA RecName: Full=CTD small phosphatase-like protein 2-B;
           Short=CTDSP-like 2-B
 gi|76779483|gb|AAI06291.1| Ctdspl2b protein [Xenopus laevis]
          Length = 466

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 99/211 (46%), Gaps = 24/211 (11%)

Query: 630 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 687
            PE+G   AH ++A+ +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 234 SPESGYSSAHAEAAYEEDWEVFDPYFFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 293

Query: 688 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 747
           LD TL           VH  +    E +D       LF+     ++ +LRP    FLER 
Sbjct: 294 LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 339

Query: 748 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 807
           S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 340 SQIYEIILFTASKKVYADKLLNILDPKKRLVRHRLFR--EHCVCVQGN----YIKDL-NI 392

Query: 808 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           LG + S  +IID+S + + +   N I +E +
Sbjct: 393 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|56605878|ref|NP_001008438.1| CTD small phosphatase-like protein 2 [Xenopus (Silurana)
           tropicalis]
 gi|82181540|sp|Q66KM5.1|CTSL2_XENTR RecName: Full=CTD small phosphatase-like protein 2;
           Short=CTDSP-like 2
 gi|51512946|gb|AAH80328.1| MGC79498 protein [Xenopus (Silurana) tropicalis]
          Length = 466

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 99/211 (46%), Gaps = 24/211 (11%)

Query: 630 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 687
            PE+G   AH ++A+ +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 234 SPESGYSSAHAEAAYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 293

Query: 688 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 747
           LD TL           VH  +    E +D       LF+     ++ +LRP    FLER 
Sbjct: 294 LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 339

Query: 748 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 807
           S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 340 SQIYEIILFTASKKVYADKLLNILDPKKRLVRHRLFR--EHCVCVQGN----YIKDL-NI 392

Query: 808 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           LG + S  +IID+S + + +   N I +E +
Sbjct: 393 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|66805733|ref|XP_636588.1| hypothetical protein DDB_G0288707 [Dictyostelium discoideum AX4]
 gi|60464974|gb|EAL63085.1| hypothetical protein DDB_G0288707 [Dictyostelium discoideum AX4]
          Length = 985

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 44/174 (25%), Positives = 81/174 (46%), Gaps = 28/174 (16%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K+ L++D+DHTLL+S K    DP  +    K         ++      +   + K RP  
Sbjct: 574 KMYLIVDIDHTLLHSTK----DPNAESYYLKDN-----SINKFTITETNETFYVKQRPNA 624

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR------------GDD 788
             FL   S  F+++LY+   K Y  ++A +LDP   +F  +VI++               
Sbjct: 625 IEFLSSLSSQFKIYLYSFHPKYYVEQLALILDPNRSIFT-KVITKEVIEPVEPLPPINSI 683

Query: 789 GDPFDGDERVPKSKDLEGVLGMESA--VVIIDDSVRVWPHNKLNLIVVERYTYF 840
           G P+     V  +++   +   E+   ++I+DD   VW + + NLI+++ + +F
Sbjct: 684 GKPY----IVFNNQNFSKIFNFEAINQMIILDDREDVWRNFQDNLILLDTFKFF 733



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/125 (27%), Positives = 60/125 (48%), Gaps = 5/125 (4%)

Query: 873 GVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPH--LHP-LW 929
           G++  +   F S   ++ + V++I+   +  +L  C IVFS +FP  + +P    H  + 
Sbjct: 851 GLLPAIQGEFLSRGDINTIHVKSIIDEIRSSVLMDCNIVFSGIFP-KQIDPSKLCHTRVS 909

Query: 930 QTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANE 988
           +  E FGA  +  ID   TH++    GT K   AL      VVH  W+      + + +E
Sbjct: 910 KITESFGAKISLEIDSTTTHLIFIKEGTSKAQQALKNPNIKVVHFAWLRDCVHRWEKMDE 969

Query: 989 QDFAI 993
           Q++ +
Sbjct: 970 QNYTV 974


>gi|229892336|ref|NP_001080602.1| CTD small phosphatase-like protein 2-A [Xenopus laevis]
 gi|82176945|sp|Q801R4.1|CTL2A_XENLA RecName: Full=CTD small phosphatase-like protein 2-A;
           Short=CTDSP-like 2-A
 gi|28838482|gb|AAH47962.1| Ctdspl2a protein [Xenopus laevis]
 gi|120538080|gb|AAI29525.1| Ctdspl2a protein [Xenopus laevis]
          Length = 466

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 99/211 (46%), Gaps = 24/211 (11%)

Query: 630 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 687
            PE+G   AH ++A+ +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 234 SPESGYSSAHAEAAYEEDWEVFDPYFFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 293

Query: 688 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 747
           LD TL++ +                E +D       LF+     ++ +LRP    FLER 
Sbjct: 294 LDETLVHCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 339

Query: 748 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 807
           S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 340 SQIYEIILFTASKKVYADKLLNILDPKKRLVRHRLFR--EHCVCVQGN----YIKDL-NI 392

Query: 808 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           LG + S  +IID+S + + +   N I +E +
Sbjct: 393 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|359487040|ref|XP_002265614.2| PREDICTED: uncharacterized protein LOC100267967 [Vitis vinifera]
          Length = 522

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 55/184 (29%), Positives = 86/184 (46%), Gaps = 32/184 (17%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           EE K+    +++ LVLDLD TL++S             L   +  D   P    F     
Sbjct: 343 EESKR----KRITLVLDLDETLVHST------------LEPCDHADFTFPV--FFNMKEH 384

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---D 787
            ++ + RP +  FLER +++FE+ ++T    +YA ++  +LDP   LF+GR         
Sbjct: 385 TIYVRQRPFLQMFLERVAEMFEIIVFTASQSIYAEQLLDILDPDRKLFSGRAYRESCIFS 444

Query: 788 DGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 846
           DG           +KDL  VLG++ A V IID+S +V+     N I ++ +   P  R  
Sbjct: 445 DG---------SYTKDL-TVLGIDLAKVAIIDNSPQVFRLQVDNGIPIKSWFDDPSDRAL 494

Query: 847 FGLL 850
             LL
Sbjct: 495 ISLL 498


>gi|357156635|ref|XP_003577523.1| PREDICTED: CTD small phosphatase-like protein 2-like isoform 1
           [Brachypodium distachyon]
          Length = 411

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 55/181 (30%), Positives = 82/181 (45%), Gaps = 28/181 (15%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           K+  S  +  LVLDLD TL++S             L   E+ D   P    F      ++
Sbjct: 219 KQTRSCPRTTLVLDLDETLVHST------------LEPCEDSDFTFPVH--FNLREHTIY 264

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGD 790
            + RP +  FLER + +FE+ ++T    +YA ++  VLDPK  LF  RV        +G+
Sbjct: 265 VRCRPYLKEFLERVASMFEIIIFTASQSIYAEQLLNVLDPKRKLFRHRVYRESCVYVEGN 324

Query: 791 PFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 849
                      KDL  VLG + A VVI+D+S + +     N I +E +   P  +    L
Sbjct: 325 YL---------KDL-SVLGRDLARVVIVDNSPQAFGFQLENGIPIESWFDDPNDKELLAL 374

Query: 850 L 850
           L
Sbjct: 375 L 375


>gi|296090552|emb|CBI40902.3| unnamed protein product [Vitis vinifera]
          Length = 570

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 55/184 (29%), Positives = 86/184 (46%), Gaps = 32/184 (17%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           EE K+    +++ LVLDLD TL++S             L   +  D   P    F     
Sbjct: 391 EESKR----KRITLVLDLDETLVHST------------LEPCDHADFTFPV--FFNMKEH 432

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---D 787
            ++ + RP +  FLER +++FE+ ++T    +YA ++  +LDP   LF+GR         
Sbjct: 433 TIYVRQRPFLQMFLERVAEMFEIIVFTASQSIYAEQLLDILDPDRKLFSGRAYRESCIFS 492

Query: 788 DGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 846
           DG           +KDL  VLG++ A V IID+S +V+     N I ++ +   P  R  
Sbjct: 493 DG---------SYTKDL-TVLGIDLAKVAIIDNSPQVFRLQVDNGIPIKSWFDDPSDRAL 542

Query: 847 FGLL 850
             LL
Sbjct: 543 ISLL 546


>gi|351699228|gb|EHB02147.1| CTD small phosphatase-like protein 2 [Heterocephalus glaber]
          Length = 465

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 58/234 (24%), Positives = 102/234 (43%), Gaps = 24/234 (10%)

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 664
           I +G +      N D         PE+     H +  + +   +F+ Y+  +      ++
Sbjct: 210 INNGLEEAEATVNRDIPHLTAPVTPESDYSSIHAEVTYEEDWEIFDPYNFIKHVPPLTEQ 269

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +  R+     K  +  K  LVLDLD TL++ +     D  H   +              L
Sbjct: 270 QLNRKPALPLKTRAKTKFSLVLDLDETLVHCSLNELEDAAHTFPV--------------L 315

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F+     ++ +LRP    FLER SK++E+ ++T   K+YA ++  +LDPK  L   R+  
Sbjct: 316 FQGVIYQVYVRLRPFFREFLERMSKMYEIIVFTAAKKVYAEKLLNILDPKKQLVRHRLFQ 375

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 376 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422


>gi|348512759|ref|XP_003443910.1| PREDICTED: CTD small phosphatase-like protein 2-A-like isoform 1
           [Oreochromis niloticus]
          Length = 474

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 64/253 (25%), Positives = 113/253 (44%), Gaps = 44/253 (17%)

Query: 596 VSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQS--------AWGDV 647
           ++ ++ I+ G+I +  DM  +            + P   P G++P S        ++ + 
Sbjct: 212 ITSDTTIEEGEIVTETDMPPL------------TAPGCMPDGSYPHSLPSVPAEPSYDED 259

Query: 648 EHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVH 705
             +F+ Y   +      +++ TR+     K  S  +  LVLDLD TL++ +         
Sbjct: 260 WEVFDPYFFIKHVPPLTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCSL-------- 311

Query: 706 DEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 765
                  E +D       LF+     ++ +LRP    FLER S+L+E+ L+T   K+YA 
Sbjct: 312 ------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMSQLYEIILFTASKKVYAD 365

Query: 766 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVW 824
           ++  +LDPK  L   R+    +      G+      KDL  +LG + S  VIID+S + +
Sbjct: 366 KLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTVIIDNSPQAF 418

Query: 825 PHNKLNLIVVERY 837
            +   N I +E +
Sbjct: 419 AYQLSNGIPIESW 431


>gi|117606236|ref|NP_001071012.1| CTD small phosphatase-like protein 2-A [Danio rerio]
 gi|123884286|sp|Q08BB5.1|CTL2A_DANRE RecName: Full=CTD small phosphatase-like protein 2-A;
           Short=CTDSP-like 2-A
 gi|115528634|gb|AAI24795.1| Zgc:154017 [Danio rerio]
          Length = 469

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 68/254 (26%), Positives = 111/254 (43%), Gaps = 36/254 (14%)

Query: 590 LTSEPMVSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEH 649
           L   P+   ++ ++ G+I + ADM  +        T  GS      V   P  A G  E 
Sbjct: 203 LNPRPLPHIDTTVEEGEIVTEADMPPL--------TAVGSNSNYPDVPPSP-PAEGTYEE 253

Query: 650 LFEGYD-----DQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPV 704
            +E +D            +++ TR+     K  S  +  LVLDLD TL++ +        
Sbjct: 254 DWEVFDPYFFIKHVPPLTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCSL------- 306

Query: 705 HDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYA 764
                   E +D       LF+     ++ +LRP    FLER S+++E+ L+T   K+YA
Sbjct: 307 -------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMSQIYEIILFTASKKVYA 359

Query: 765 TEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRV 823
            ++  +LDPK  L   R+    +      G+      KDL  +LG + S  VIID+S + 
Sbjct: 360 DKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTVIIDNSPQA 412

Query: 824 WPHNKLNLIVVERY 837
           + +   N I +E +
Sbjct: 413 FAYQLSNGIPIESW 426


>gi|348512761|ref|XP_003443911.1| PREDICTED: CTD small phosphatase-like protein 2-A-like isoform 2
           [Oreochromis niloticus]
          Length = 471

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 64/253 (25%), Positives = 113/253 (44%), Gaps = 44/253 (17%)

Query: 596 VSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQS--------AWGDV 647
           ++ ++ I+ G+I +  DM  +            + P   P G++P S        ++ + 
Sbjct: 209 ITSDTTIEEGEIVTETDMPPL------------TAPGCMPDGSYPHSLPSVPAEPSYDED 256

Query: 648 EHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVH 705
             +F+ Y   +      +++ TR+     K  S  +  LVLDLD TL++ +         
Sbjct: 257 WEVFDPYFFIKHVPPLTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCSL-------- 308

Query: 706 DEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 765
                  E +D       LF+     ++ +LRP    FLER S+L+E+ L+T   K+YA 
Sbjct: 309 ------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMSQLYEIILFTASKKVYAD 362

Query: 766 EMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVW 824
           ++  +LDPK  L   R+    +      G+      KDL  +LG + S  VIID+S + +
Sbjct: 363 KLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTVIIDNSPQAF 415

Query: 825 PHNKLNLIVVERY 837
            +   N I +E +
Sbjct: 416 AYQLSNGIPIESW 428


>gi|357156637|ref|XP_003577524.1| PREDICTED: CTD small phosphatase-like protein 2-like isoform 2
           [Brachypodium distachyon]
          Length = 443

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 55/181 (30%), Positives = 82/181 (45%), Gaps = 28/181 (15%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           K+  S  +  LVLDLD TL++S             L   E+ D   P    F      ++
Sbjct: 251 KQTRSCPRTTLVLDLDETLVHST------------LEPCEDSDFTFPVH--FNLREHTIY 296

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGD 790
            + RP +  FLER + +FE+ ++T    +YA ++  VLDPK  LF  RV        +G+
Sbjct: 297 VRCRPYLKEFLERVASMFEIIIFTASQSIYAEQLLNVLDPKRKLFRHRVYRESCVYVEGN 356

Query: 791 PFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL 849
                      KDL  VLG + A VVI+D+S + +     N I +E +   P  +    L
Sbjct: 357 YL---------KDL-SVLGRDLARVVIVDNSPQAFGFQLENGIPIESWFDDPNDKELLAL 406

Query: 850 L 850
           L
Sbjct: 407 L 407


>gi|114108339|gb|AAI23380.1| Ctdspl2a protein [Xenopus laevis]
          Length = 536

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 59/210 (28%), Positives = 99/210 (47%), Gaps = 24/210 (11%)

Query: 631 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 688
           PE+G   AH ++A+ +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 305 PESGYSSAHAEAAYEEDWEVFDPYFFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 364

Query: 689 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 748
           D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 365 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 410

Query: 749 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 808
           +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 411 QIYEIILFTASKKVYADKLLNILDPKKRLVRHRLFR--EHCVCVQGN----YIKDL-NIL 463

Query: 809 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           G + S  +IID+S + + +   N I +E +
Sbjct: 464 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 493


>gi|62078827|ref|NP_001014070.1| CTD small phosphatase-like protein 2 [Rattus norvegicus]
 gi|81883796|sp|Q5XIK8.1|CTSL2_RAT RecName: Full=CTD small phosphatase-like protein 2;
           Short=CTDSP-like 2
 gi|53734232|gb|AAH83672.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase like 2 [Rattus norvegicus]
 gi|149023119|gb|EDL80013.1| similar to hypothetical protein HSPC129 [Rattus norvegicus]
          Length = 465

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 58/211 (27%), Positives = 98/211 (46%), Gaps = 24/211 (11%)

Query: 630 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 687
            PE+G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 233 APESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 292

Query: 688 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 747
           LD TL           VH  +    E +D       LF+     ++ +LRP    FLER 
Sbjct: 293 LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 338

Query: 748 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 807
           S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 339 SQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NI 391

Query: 808 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           LG + S  +IID+S + + +   N I +E +
Sbjct: 392 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422


>gi|426233772|ref|XP_004010888.1| PREDICTED: CTD small phosphatase-like protein 2 [Ovis aries]
          Length = 466

 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 62/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 664
           I +G +      N D         PE+G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|26390099|dbj|BAC25842.1| unnamed protein product [Mus musculus]
          Length = 351

 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 56/210 (26%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 631 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 688
           PE+G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 120 PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 179

Query: 689 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 748
           D TL++ +                E +D       LF+     ++ +LRP    FLER S
Sbjct: 180 DETLVHCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 225

Query: 749 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 808
           +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 226 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 278

Query: 809 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           G + S  +IID+S + + +   N I +E +
Sbjct: 279 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 308


>gi|126281910|ref|XP_001363358.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
           polypeptide A) small phosphatase like 2 [Monodelphis
           domestica]
          Length = 466

 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 60/234 (25%), Positives = 105/234 (44%), Gaps = 24/234 (10%)

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 664
           I +G +     TN D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211 INNGLEEVEEATNRDIPPLTAPVSPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +  R+     K  S  +  LVLDLD TL++ +                E +D       L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NELEDAALTFPVL 316

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQIYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|26343511|dbj|BAC35412.1| unnamed protein product [Mus musculus]
          Length = 464

 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 58/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 631 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 688
           PE+G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 233 PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 292

Query: 689 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 748
           D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 293 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 338

Query: 749 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 808
           +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 339 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 391

Query: 809 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           G + S  +IID+S + + +   N I +E +
Sbjct: 392 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 421


>gi|148696132|gb|EDL28079.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase like 2, isoform CRA_a [Mus musculus]
          Length = 465

 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 58/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 631 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 688
           PE+G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 234 PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 293

Query: 689 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 748
           D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 294 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 339

Query: 749 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 808
           +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 340 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 392

Query: 809 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           G + S  +IID+S + + +   N I +E +
Sbjct: 393 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422


>gi|30851260|gb|AAH52660.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase like 2 [Mus musculus]
          Length = 465

 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 58/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 631 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 688
           PE+G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 234 PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 293

Query: 689 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 748
           D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 294 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 339

Query: 749 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 808
           +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 340 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 392

Query: 809 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           G + S  +IID+S + + +   N I +E +
Sbjct: 393 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422


>gi|74190363|dbj|BAE37265.1| unnamed protein product [Mus musculus]
          Length = 465

 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 58/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 631 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 688
           PE+G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 234 PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 293

Query: 689 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 748
           D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 294 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 339

Query: 749 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 808
           +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 340 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 392

Query: 809 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           G + S  +IID+S + + +   N I +E +
Sbjct: 393 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422


>gi|47059059|ref|NP_997615.1| CTD small phosphatase-like protein 2 [Mus musculus]
 gi|81873659|sp|Q8BG15.1|CTSL2_MOUSE RecName: Full=CTD small phosphatase-like protein 2;
           Short=CTDSP-like 2
 gi|26326063|dbj|BAC26775.1| unnamed protein product [Mus musculus]
 gi|26329037|dbj|BAC28257.1| unnamed protein product [Mus musculus]
 gi|26340192|dbj|BAC33759.1| unnamed protein product [Mus musculus]
 gi|26349835|dbj|BAC38557.1| unnamed protein product [Mus musculus]
 gi|148696133|gb|EDL28080.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase like 2, isoform CRA_b [Mus musculus]
          Length = 465

 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 58/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 631 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 688
           PE+G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 234 PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 293

Query: 689 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 748
           D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 294 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 339

Query: 749 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 808
           +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 340 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 392

Query: 809 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           G + S  +IID+S + + +   N I +E +
Sbjct: 393 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422


>gi|354471693|ref|XP_003498075.1| PREDICTED: CTD small phosphatase-like protein 2 [Cricetulus
           griseus]
          Length = 465

 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 58/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 631 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 688
           PE+G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 234 PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 293

Query: 689 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 748
           D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 294 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 339

Query: 749 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 808
           +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 340 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 392

Query: 809 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           G + S  +IID+S + + +   N I +E +
Sbjct: 393 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 422


>gi|294905859|ref|XP_002777694.1| hypothetical protein Pmar_PMAR016441 [Perkinsus marinus ATCC 50983]
 gi|239885585|gb|EER09510.1| hypothetical protein Pmar_PMAR016441 [Perkinsus marinus ATCC 50983]
          Length = 523

 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 54/211 (25%), Positives = 87/211 (41%), Gaps = 55/211 (26%)

Query: 681 KLCLVLDLDHTLLNSAKFHE--------VDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 732
           +L +VLDLD T++NS +  +        V P+  EI R +E      P  +L     + +
Sbjct: 63  RLDVVLDLDRTMVNSFEIRKAGRSESENVTPILQEIYRDEEGL----PELYLCVISDVKV 118

Query: 733 WTKLRPGIWTFLER--ASKLFE--MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR--- 785
            TK+RP    F+    AS  +   + +YT G++ Y   + ++LDP G L  GR++SR   
Sbjct: 119 LTKIRPHARAFIRELVASTDYGVVISIYTKGSRRYMEVVKQMLDPSGELIKGRLVSRDDE 178

Query: 786 --------------------------GDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDD 819
                                     G DG   +GD+   +S+       M    V++DD
Sbjct: 179 PSNMTPVEKDPDLIINASIESGAQVDGSDGRLCNGDKETKESE-------MRRWFVVLDD 231

Query: 820 SVRVWPH---NKLNLIVVERYTYFPCSRRQF 847
           S   WP       N++    Y +   + RQ 
Sbjct: 232 SPEAWPEELREAGNVVTANMYDFAEVNHRQL 262


>gi|387594493|gb|EIJ89517.1| hypothetical protein NEQG_00287 [Nematocida parisii ERTm3]
 gi|387596665|gb|EIJ94286.1| hypothetical protein NEPG_00953 [Nematocida parisii ERTm1]
          Length = 310

 Score = 60.5 bits (145), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 71/272 (26%), Positives = 111/272 (40%), Gaps = 59/272 (21%)

Query: 758 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVI 816
           MGNK YA  +A +LDP G LF  R+ISR D+   FD        KD++ +    S  VVI
Sbjct: 1   MGNKSYACSIAGLLDPTGKLFGSRIISRDDNFGCFD--------KDIKRLFPTNSKHVVI 52

Query: 817 IDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLE-----IDHDERSE------- 864
           +DD   VW     NL  +  Y YF        +  P  L+     +  D R+        
Sbjct: 53  LDDRPDVWGFVD-NLYPIRPYYYFQTD----DINSPEALQGMKSALSEDVRNSPVGEVFR 107

Query: 865 ------------------DGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILA 906
                             D  L   L  ++ +H  FFS Q     D  +IL  ++++I  
Sbjct: 108 NKNDLIELIDRECILTYFDNELEKVLSGLKEVHTEFFSTQQ----DTASIL-KKKKEIFK 162

Query: 907 GCRI-VFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALS 965
           GC   +FS    V E   +L+ L++     G   +    ++VTH++    G  +  W   
Sbjct: 163 GCTAEIFS---SVNEYTLYLNLLFK--HHGGQTFSHGSVNKVTHLITTGNGRIRSVWKNK 217

Query: 966 TGRFV----VHPGWVEASALLYRRANEQDFAI 993
               +    V   W+  S   ++R +E++F +
Sbjct: 218 KDPVIDPVCVQVEWMHESIYAFKRLDEKEFIL 249


>gi|242072230|ref|XP_002446051.1| hypothetical protein SORBIDRAFT_06g001010 [Sorghum bicolor]
 gi|241937234|gb|EES10379.1| hypothetical protein SORBIDRAFT_06g001010 [Sorghum bicolor]
          Length = 447

 Score = 60.5 bits (145), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 49/104 (47%), Gaps = 24/104 (23%)

Query: 684 LVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
           LVLDLD TL++S   H  D     PVH                   F F    ++ + RP
Sbjct: 263 LVLDLDETLVHSTLEHCEDADFTFPVH-------------------FNFQEHTIYVRCRP 303

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            +  FLER + +FE  ++T    +YA ++  VLDPK  LF  RV
Sbjct: 304 YLKEFLERVASMFETIIFTASQSIYAEQLLNVLDPKRKLFRHRV 347


>gi|410912504|ref|XP_003969729.1| PREDICTED: CTD small phosphatase-like protein 2-like [Takifugu
           rubripes]
          Length = 474

 Score = 60.5 bits (145), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 89/358 (24%), Positives = 147/358 (41%), Gaps = 59/358 (16%)

Query: 496 MDELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQGSK------ENLNFQK 549
           ++E   V +    PR  L G          P F    P+   + GS       E     K
Sbjct: 117 LEETTAVEVPSSAPRTTLLGTIF------SPVFNFFSPAKNASSGSDSPDQAMEAEEIVK 170

Query: 550 QLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPL--TSEPMVSQNSPIQPGQI 607
           QL        + Q+V  P  T     +L   ++F S   PL     P V + SP+   + 
Sbjct: 171 QLD-------MEQAVEMPTSTAMSPHDLCVASNFHSSVSPLPPLRPPHVPEASPLAVEE- 222

Query: 608 KSGADMKAVVTNHDDKQTGTGSGP-----EAGPVGAHPQSAWGDVEHLFEGYDDQQKA-- 660
           +  AD+  +        T  GS P     EA      P++++ +   +F+ Y   +    
Sbjct: 223 ELDADLPPL--------TAPGSSPDMTYVEAPLAAVPPEASYEEDWEVFDPYFFIKHVPP 274

Query: 661 AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKP 720
             +++ TR+     K  S  +  LVLDLD TL++ +                E +D    
Sbjct: 275 LTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NELEDAALT 320

Query: 721 HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 780
              LF+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   
Sbjct: 321 FPVLFQDVIYQVYVRLRPFFREFLERMSQIYEIILFTASKKVYADKLLNILDPKKQLVRH 380

Query: 781 RVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           R+    +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 381 RLFR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 431


>gi|237832707|ref|XP_002365651.1| NLI interacting factor-like phosphatase domain-containing protein
           [Toxoplasma gondii ME49]
 gi|211963315|gb|EEA98510.1| NLI interacting factor-like phosphatase domain-containing protein
           [Toxoplasma gondii ME49]
          Length = 1139

 Score = 60.5 bits (145), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 78/335 (23%), Positives = 124/335 (37%), Gaps = 67/335 (20%)

Query: 719 KPHRHLFRFP--HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
           +P  + F  P      + KLRP + TFL++    +EM +YT   + YA  +  +LD    
Sbjct: 664 EPELYRFELPCNRKTYYMKLRPHLRTFLKKLEPFYEMSVYTNATQEYADIVIAILDGNRQ 723

Query: 777 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV-VE 835
           LF  R+++R D G   +  E     +  EG+   +  +V  DD   +W    L  +V  +
Sbjct: 724 LFQDRIVAR-DSGFRGEASENKAVRRLYEGM--DKRCIVAFDDRQNIWTDLPLTHVVKAQ 780

Query: 836 RYTYF------------PCSRRQFGLLGPSLLE--------------------IDHDERS 863
            Y +F            P S    G+  P +L                     +  D +S
Sbjct: 781 HYDFFDSHKTELNAYYPPLSNGIEGMTAPDVLGNRQNEDEGVMDPVGCGSFPPVGQDTQS 840

Query: 864 E---------------------DGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQR 902
                                 D  L   L +   LH  FF  +   + ++  IL   Q+
Sbjct: 841 TNTPVQQHLSRAAADGKKPCDWDRHLECMLKLFLHLHAEFF--KDPVNANIGAILCNFQQ 898

Query: 903 KILAGCRIVFS---RVFPVGEANPHLHPLW-QTAEQFGAVCTKHIDDQ-VTHVVANSLGT 957
           K+L+G  I F+   + F  G A         + A++ GA   K  D++ VTHVVA    T
Sbjct: 899 KVLSGVGIFFTGFRKTFSPGAAVADCEERQAELAQRLGAKVYKRFDEEGVTHVVAGKNNT 958

Query: 958 DKVNWALSTGRFV-VHPGWVEASALLYRRANEQDF 991
           + +       +   VH  W+        R  E  F
Sbjct: 959 NNMLACKENTKLARVHTLWLYCCEAALARVPESAF 993


>gi|221488107|gb|EEE26321.1| RNA polymerase II phosphatase, putative [Toxoplasma gondii GT1]
 gi|221508626|gb|EEE34195.1| RNA polymerase II phosphatase, putative [Toxoplasma gondii VEG]
          Length = 1139

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 78/335 (23%), Positives = 124/335 (37%), Gaps = 67/335 (20%)

Query: 719 KPHRHLFRFP--HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
           +P  + F  P      + KLRP + TFL++    +EM +YT   + YA  +  +LD    
Sbjct: 664 EPELYRFELPCNRKTYYMKLRPHLRTFLKKLEPFYEMSVYTNATQEYADIVIAILDGNRQ 723

Query: 777 LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV-VE 835
           LF  R+++R D G   +  E     +  EG+   +  +V  DD   +W    L  +V  +
Sbjct: 724 LFQDRIVAR-DSGFRGEASENKAVRRLYEGM--DKRCIVAFDDRQNIWTDLPLTHVVKAQ 780

Query: 836 RYTYF------------PCSRRQFGLLGPSLLE--------------------IDHDERS 863
            Y +F            P S    G+  P +L                     +  D +S
Sbjct: 781 HYDFFDSHKTELNAYYPPLSNGIEGMTAPDVLGNRQNEDEGVMDPVGCGSFPPVGQDTQS 840

Query: 864 E---------------------DGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQR 902
                                 D  L   L +   LH  FF  +   + ++  IL   Q+
Sbjct: 841 TNTPVQQHLSRAAADGKKPCDWDRHLECMLKLFLHLHAEFF--KDPVNANIGAILCNFQQ 898

Query: 903 KILAGCRIVFS---RVFPVGEANPHLHPLW-QTAEQFGAVCTKHIDDQ-VTHVVANSLGT 957
           K+L+G  I F+   + F  G A         + A++ GA   K  D++ VTHVVA    T
Sbjct: 899 KVLSGVGIFFTGFRKTFSPGAAVADCEERQAELAQRLGAKVYKRFDEEGVTHVVAGKNNT 958

Query: 958 DKVNWALSTGRFV-VHPGWVEASALLYRRANEQDF 991
           + +       +   VH  W+        R  E  F
Sbjct: 959 NNMLACKENTKLARVHTLWLYCCEAALARVPESAF 993


>gi|47220514|emb|CAG05540.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 473

 Score = 60.1 bits (144), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 74/277 (26%), Positives = 123/277 (44%), Gaps = 40/277 (14%)

Query: 596 VSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAG-PVGAHPQSAWGDVEHLFEGY 654
           ++ +S I+ G+I +  DM  +        T  G  P++G P    P  A    E  +E +
Sbjct: 211 INADSSIEEGEIVAETDMPPL--------TAPGCMPDSGYPHTLPPAPAETSYEEDWEVF 262

Query: 655 D-----DQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEIL 709
           D            +++ TR+     K  S  +  LVLDLD TL           VH  + 
Sbjct: 263 DPYFFIKHVPPLTEEQLTRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL- 310

Query: 710 RKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 769
              E +D       LF+     ++ +LRP    FLER S+ +E+ L+T   K+YA ++  
Sbjct: 311 --NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMSQKYEIILFTASKKVYADKLLN 368

Query: 770 VLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNK 828
           +LDP+  L   R+    +      G+      KDL  +LG + S  +IID+S + + +  
Sbjct: 369 ILDPRKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQL 421

Query: 829 LNLIVVERYTYFPCSRRQFGLLGP---SLLEIDHDER 862
            N I +E + +   +  +   L P   SL+E++ D R
Sbjct: 422 SNGIPIESW-FMDRNDNELEKLVPFLESLVELNEDVR 457


>gi|401408967|ref|XP_003883932.1| hypothetical protein NCLIV_036820 [Neospora caninum Liverpool]
 gi|325118349|emb|CBZ53900.1| hypothetical protein NCLIV_036820 [Neospora caninum Liverpool]
          Length = 1149

 Score = 60.1 bits (144), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 78/335 (23%), Positives = 127/335 (37%), Gaps = 67/335 (20%)

Query: 719  KPHRHLFRFP--HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
            +P  + F  P      + KLRP + TFL++    +EM +YT   + YA  +  +LD    
Sbjct: 674  EPELYRFELPCNRKTYYMKLRPYLRTFLKKLEPFYEMSVYTNATQEYADIVIAILDDNRQ 733

Query: 777  LFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV-VE 835
            LF  R+++R D G   +  E     +  EG+   +  +V  DD   +W    L  +V  +
Sbjct: 734  LFQDRIVAR-DSGFRGEASENKAVRRLYEGM--DKRCIVAFDDRQNIWTDLPLTHVVKAQ 790

Query: 836  RYTYFPCSRRQF----------------------------GLLGP----SLLEIDHDERS 863
             Y +F   + +                             G++ P    SL +   D++S
Sbjct: 791  HYDFFDSHKAELNAYYPPLSNGIGGMATADIMGNRQNEDEGMMDPVGCSSLSQAGQDQQS 850

Query: 864  E---------------------DGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQR 902
                                  D  L   L +   LH  FF  +   + ++  IL   Q+
Sbjct: 851  TNTPVQQHLSRAAAEGKKPCDWDRHLECMLKLFLHLHTEFF--KDPINANIGAILCNFQQ 908

Query: 903  KILAGCRIVFS---RVFPVGEANPHLHPLW-QTAEQFGAVCTKHIDDQ-VTHVVANSLGT 957
            K+L+G  I F+   + F  G A         + A++ GA   K  D++ VTHVVA    T
Sbjct: 909  KVLSGVGIFFTGFRKTFSPGAAVADCEERQAELAQRLGAKVYKRYDEEGVTHVVAGKNNT 968

Query: 958  DK-VNWALSTGRFVVHPGWVEASALLYRRANEQDF 991
            +  +    +T    VH  W+        R  E  F
Sbjct: 969  NNMLACKENTNLARVHTLWLYCCEAALARVPESAF 1003


>gi|221057654|ref|XP_002261335.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
            knowlesi strain H]
 gi|194247340|emb|CAQ40740.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
            knowlesi strain H]
          Length = 1389

 Score = 60.1 bits (144), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 38/133 (28%), Positives = 65/133 (48%), Gaps = 16/133 (12%)

Query: 719  KPHRHLFRFPHMGM--WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
            +P  + F  P+     + K RP +  FL+  S  +E+ +YT   + YA  +  +LDP   
Sbjct: 948  EPELYKFFLPYYNFFYYLKFRPYVRQFLQILSLYYELSIYTNATREYADVVIAILDPDRT 1007

Query: 777  LFAGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVI-IDDSVRVW---PHNKLNL 831
            LFA R+++R +  D         ++K+   +   ++S  VI  DD   VW   PH+  N+
Sbjct: 1008 LFADRIVARCNSADR-------EENKNFSKIYPNVDSKYVIAFDDRKDVWTDIPHS--NI 1058

Query: 832  IVVERYTYFPCSR 844
            +  E Y +F  S+
Sbjct: 1059 LKAEHYNFFELSK 1071


>gi|254728746|gb|ACT79548.1| CTD phosphatase-like protein [Oryza sativa Japonica Group]
 gi|254728748|gb|ACT79549.1| CTD phosphatase-like protein [Oryza sativa Indica Group]
 gi|254728750|gb|ACT79550.1| CTD phosphatase-like protein [Oryza rufipogon]
 gi|254728752|gb|ACT79551.1| CTD phosphatase-like protein [Oryza nivara]
          Length = 462

 Score = 60.1 bits (144), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 75/155 (48%), Gaps = 22/155 (14%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LVLDLD TL++S             L   E+ D   P    F F    ++ + RP +  F
Sbjct: 278 LVLDLDETLVHST------------LEPCEDADFAFPV--YFNFREHTIYVRCRPYLKEF 323

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 803
           LER + LFE  ++T    +YA ++  VLDPK  LF  RV    D     +G+      KD
Sbjct: 324 LERVANLFETIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR--DSCVYVEGNYL----KD 377

Query: 804 LEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           L  VLG + + ++I+D+S + +     N I +E +
Sbjct: 378 L-TVLGRDLTRIMIVDNSPQAFGFQLDNGIPIESW 411


>gi|167384602|ref|XP_001737021.1| RNA polymerase II ctd phosphatase [Entamoeba dispar SAW760]
 gi|165900378|gb|EDR26711.1| RNA polymerase II ctd phosphatase, putative [Entamoeba dispar
           SAW760]
          Length = 429

 Score = 60.1 bits (144), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 80/301 (26%), Positives = 126/301 (41%), Gaps = 62/301 (20%)

Query: 641 QSAWGDVEHLFEGYDDQQKAAIQKERTRRL-EEQKKMFSAR-----KLCLVLDLDHTLLN 694
           Q+   D   L E  DD  + +     T+   EEQK+  S R     KL L+LDLD T++ 
Sbjct: 15  QNYCVDCYQLIEDVDDYIRTSGGYGITKSYAEEQKRSVSERLLKEKKLSLILDLDGTIVF 74

Query: 695 SAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG--MWTKLRPGIWTFLERASKLFE 752
           +     V P+ +E      E+         F  P     +  K R GI TF+E+ SKL++
Sbjct: 75  TNPELCV-PLENE------EEPITPEQGFYFEIPEQNAKVLIKFRDGIVTFMEKVSKLYD 127

Query: 753 MHLYTMGNKLYATEMAKVLDP-------KGVLFAGRVISR---GDDGDPFDG-------D 795
           +H+ T+G K YA  +   ++         G L      S     D+ D  DG       +
Sbjct: 128 IHVVTLGQKEYAFAIVNAINKLRDTPFITGDLVTAEDCSSVIVCDEKDTNDGLIDREETN 187

Query: 796 ERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLL 855
           ER    + +   +G E   VI+DD + VW +   N++ +  Y              PS  
Sbjct: 188 ERRSVKRSI-PTMGKEEMQVIVDDRIDVWDNK--NVVQICEYV-------------PSTN 231

Query: 856 EIDHDERSEDGTLASSLGVIERLHKIF--FSHQSLDDVDVRNILAAEQRKILAGCRIVFS 913
           ++D             L V E L  I+  F  + ++  DV+ IL + ++KIL    + F+
Sbjct: 232 QVD----------TELLRVTEVLQNIYNKFYDEHIE--DVKEILHSFRKKILENKNLYFN 279

Query: 914 R 914
           R
Sbjct: 280 R 280


>gi|348685327|gb|EGZ25142.1| hypothetical protein PHYSODRAFT_311755 [Phytophthora sojae]
          Length = 257

 Score = 60.1 bits (144), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 54/164 (32%), Positives = 80/164 (48%), Gaps = 28/164 (17%)

Query: 678 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL---FRFPHMGMWT 734
           +A K+CLVLDLD TL++ +    VD V             + PH      F      +  
Sbjct: 72  NAPKICLVLDLDETLVHCS----VDEV-------------KNPHMQFPVTFNGVEYTVNV 114

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 794
           K RP +  FL+R SKLFE+ ++T  +K+YA ++  +LDP       R + R D  D F  
Sbjct: 115 KKRPHLEYFLKRVSKLFEIVVFTASHKVYAEKLMNMLDPNRNFIKYR-LYREDCLDVFGN 173

Query: 795 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
                  KDL  VLG + S VV++D+S   + +   N I +E +
Sbjct: 174 -----YLKDL-NVLGRDLSKVVLVDNSPHAFGYQVNNGIPIETW 211


>gi|254728754|gb|ACT79552.1| CTD phosphatase-like protein [Oryza glaberrima]
          Length = 462

 Score = 60.1 bits (144), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 75/155 (48%), Gaps = 22/155 (14%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LVLDLD TL++S             L   E+ D   P    F F    ++ + RP +  F
Sbjct: 278 LVLDLDETLVHST------------LEPCEDADFAFPV--YFNFREHTIYVRCRPYLKEF 323

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 803
           LER + LFE  ++T    +YA ++  VLDPK  LF  RV    D     +G+      KD
Sbjct: 324 LERVANLFETIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR--DSCVYVEGNYL----KD 377

Query: 804 LEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           L  VLG + + ++I+D+S + +     N I +E +
Sbjct: 378 L-TVLGRDLTRIMIVDNSPQAFGFQLDNGIPIESW 411


>gi|399215917|emb|CCF72605.1| unnamed protein product [Babesia microti strain RI]
          Length = 664

 Score = 59.7 bits (143), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 79/295 (26%), Positives = 120/295 (40%), Gaps = 64/295 (21%)

Query: 720 PHRHLFRFPH---MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
           P  + F  P    +  + KLRP +  FL   S  +EM +YT   + YA  +  +LDP   
Sbjct: 212 PELYTFTLPSYADVSYYLKLRPRLREFLHILSFYYEMSIYTNATREYADVVIAILDPDRS 271

Query: 777 LFAGRVISRGDDGDPFDGDER--VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV- 833
           LF  R+I+RG       G++R     ++ L   L  +  VV  DD   VW     N ++ 
Sbjct: 272 LFMDRIIARG------GGNDRGLTKSARRLYPKLS-QRFVVSFDDRRDVWTDIDPNQVLK 324

Query: 834 VERYTYF------------------PCSRRQFGLLGPSLLEIDHDERSEDGTL--ASSLG 873
              Y+YF                  P S  +  L G S +     + S+ G +  +SS+G
Sbjct: 325 AHHYSYFMENLPQNIGKFKTDIPAMPNSTSKTSLDGKSQINGLTLDLSQSGAINDSSSVG 384

Query: 874 ---------------------VIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGCRIVF 912
                                V   LHK F+S+      +V  ILA  Q++ L+G  I F
Sbjct: 385 CNSSVNNPLQDFDMHLMYMTKVFLELHKRFYSNPL--KANVGTILAEMQKETLSGVGIFF 442

Query: 913 S------RVFPVGEANPHLHPLWQTAEQFGA-VCTKHIDDQVTHVVANSLGTDKV 960
           +      +    G  +  +    + A +FGA V  K +D + THVVA    TD +
Sbjct: 443 TGFRKNVKNAVSGWTDCEVRQK-EMALEFGAHVVDKLVDKKTTHVVAAKNCTDNL 496


>gi|224116454|ref|XP_002317305.1| predicted protein [Populus trichocarpa]
 gi|222860370|gb|EEE97917.1| predicted protein [Populus trichocarpa]
          Length = 377

 Score = 59.7 bits (143), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 71/149 (47%), Gaps = 28/149 (18%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           + + LVLDLD TL++S   H                D +      F      ++ K RP 
Sbjct: 203 KSITLVLDLDETLVHSTLEHC--------------DDADFTFTVFFNMKEHTVYVKQRPH 248

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 796
           + TFLER +++FE+ ++T    +YA ++  +LDP   L + R+        DG       
Sbjct: 249 VHTFLERVAEMFEVVIFTASQSIYAAQLLDMLDPDRKLISRRIYRESCIFSDG------- 301

Query: 797 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
               +KDL  VLG++ A V IID+S +V+
Sbjct: 302 --SYTKDL-TVLGVDLAKVAIIDNSPQVF 327


>gi|330864811|ref|NP_001178334.1| CTD small phosphatase-like protein 2 [Bos taurus]
 gi|296482877|tpg|DAA24992.1| TPA: CTD (carboxy-terminal domain, RNA polymerase II, polypeptide
           A) small phosphatase like 2 [Bos taurus]
 gi|440911957|gb|ELR61572.1| CTD small phosphatase-like protein 2 [Bos grunniens mutus]
          Length = 466

 Score = 59.7 bits (143), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 664
           I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|125557643|gb|EAZ03179.1| hypothetical protein OsI_25332 [Oryza sativa Indica Group]
 gi|125599502|gb|EAZ39078.1| hypothetical protein OsJ_23510 [Oryza sativa Japonica Group]
          Length = 461

 Score = 59.3 bits (142), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 75/155 (48%), Gaps = 22/155 (14%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LVLDLD TL++S             L   E+ D   P    F F    ++ + RP +  F
Sbjct: 278 LVLDLDETLVHST------------LEPCEDADFAFPV--YFNFREHTIYVRCRPYLKEF 323

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 803
           LER + LFE  ++T    +YA ++  VLDPK  LF  RV    D     +G+      KD
Sbjct: 324 LERVANLFETIIFTASQSIYAEQLLNVLDPKRKLFRHRVYR--DSCVYVEGNYL----KD 377

Query: 804 LEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           L  VLG + + ++I+D+S + +     N I +E +
Sbjct: 378 L-TVLGRDLTRIMIVDNSPQAFGFQLDNGIPIESW 411


>gi|410961377|ref|XP_003987259.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Felis
           catus]
 gi|410961379|ref|XP_003987260.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Felis
           catus]
          Length = 466

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 664
           I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|149692003|ref|XP_001502897.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
           polypeptide A) small phosphatase like 2 isoform 2 [Equus
           caballus]
 gi|149692005|ref|XP_001502892.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
           polypeptide A) small phosphatase like 2 isoform 1 [Equus
           caballus]
          Length = 466

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 664
           I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|82541597|ref|XP_725029.1| NLI interacting factor [Plasmodium yoelii yoelii 17XNL]
 gi|23479881|gb|EAA16594.1| NLI interacting factor, putative [Plasmodium yoelii yoelii]
          Length = 1177

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 68/265 (25%), Positives = 112/265 (42%), Gaps = 37/265 (13%)

Query: 716  DREKPHRHLFRFPHMGM--WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 773
            + ++P  + F  P+     + K RP +  FLE  S  +E+ +YT   + YA  +  +LDP
Sbjct: 759  ENDEPELYKFFLPYYNFFYYLKFRPYVRQFLEILSLYYELSIYTNATREYADVVIAILDP 818

Query: 774  KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVI-IDDSVRVW---PHNK 828
               +FA R+++R       D DE    +K  E +   ++   VI  DD   VW   PH+ 
Sbjct: 819  DRTIFADRIVAR---CSSVDRDE----NKHFEKIYPNVDPKYVIAFDDRKDVWFDIPHS- 870

Query: 829  LNLIVVERYTYFPCSRRQF--GLLGPSLLE---IDHDERSEDGTLASSLGVIERLHKIFF 883
             +++  E Y +F  S+         PS  +   +D D       L   + +  ++HK FF
Sbjct: 871  -HILRAEHYNFFELSKYDIISHFKEPSTCKKRFVDMDMH-----LHYMIKIFLKIHKQFF 924

Query: 884  SHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAE-------QFG 936
             +    +VDV  I+       L+   + F+  F     N     L    E       + G
Sbjct: 925  ENPL--NVDVGKIIDNIMLSTLSNVGLYFTG-FRKNSKNLQNVILSDCEERQKEIALELG 981

Query: 937  AVCTKHIDD-QVTHVVANSLGTDKV 960
            A    + DD  VTH++A    TD +
Sbjct: 982  AKIYSNYDDPGVTHIIAAKNCTDNL 1006


>gi|209156270|gb|ACI34367.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
           phosphatase 2 [Salmo salar]
          Length = 367

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 48/156 (30%), Positives = 80/156 (51%), Gaps = 22/156 (14%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LVLDLD TL+ S+    ++ +          +D E   R  F+     ++  LRP +  F
Sbjct: 191 LVLDLDETLMYSS----LNVI----------EDAEYTFRTCFQDNPYKVYVILRPYVKEF 236

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 803
           LE  +K FEM +YT   K YA ++  +LDPK  LF  R+  +  D     G       KD
Sbjct: 237 LEAMTKHFEMFVYTSAKKEYAEKILDILDPKRRLFRHRLYQQ--DCACVLGH----YVKD 290

Query: 804 LEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYT 838
           L GVL  + A  V++D++   +P++ +N++ ++ ++
Sbjct: 291 L-GVLERDLAKTVVLDNAPHTYPYHLMNVLPIKSWS 325


>gi|6841354|gb|AAF29030.1|AF161543_1 HSPC058 [Homo sapiens]
          Length = 352

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 58/234 (24%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 664
           + +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 97  LNNGLEEAEETVNRDIPTLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 156

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +  R+     K  S  +  LVLDLD TL++ +                E +D       L
Sbjct: 157 QLNRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NELEDAALTFPVL 202

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 203 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 262

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 263 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 309


>gi|332235387|ref|XP_003266885.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Nomascus
           leucogenys]
 gi|332235389|ref|XP_003266886.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Nomascus
           leucogenys]
          Length = 466

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 664
           I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|388453109|ref|NP_001253738.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase like 2 [Macaca mulatta]
 gi|114656732|ref|XP_001161756.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
           polypeptide A) small phosphatase like 2 isoform 3 [Pan
           troglodytes]
 gi|114656734|ref|XP_001161793.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
           polypeptide A) small phosphatase like 2 isoform 4 [Pan
           troglodytes]
 gi|297696523|ref|XP_002825440.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
           polypeptide A) small phosphatase like 2 isoform 1 [Pongo
           abelii]
 gi|395746659|ref|XP_003778487.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
           polypeptide A) small phosphatase like 2 isoform 2 [Pongo
           abelii]
 gi|380813572|gb|AFE78660.1| CTD small phosphatase-like protein 2 [Macaca mulatta]
 gi|383419005|gb|AFH32716.1| CTD small phosphatase-like protein 2 [Macaca mulatta]
 gi|384947558|gb|AFI37384.1| CTD small phosphatase-like protein 2 [Macaca mulatta]
 gi|410206686|gb|JAA00562.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase like 2 [Pan troglodytes]
 gi|410253512|gb|JAA14723.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase like 2 [Pan troglodytes]
 gi|410302524|gb|JAA29862.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase like 2 [Pan troglodytes]
 gi|410341327|gb|JAA39610.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase like 2 [Pan troglodytes]
          Length = 466

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 664
           I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|397480304|ref|XP_003811426.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Pan
           paniscus]
 gi|397480306|ref|XP_003811427.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Pan
           paniscus]
          Length = 466

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 664
           I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|403274413|ref|XP_003928971.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Saimiri
           boliviensis boliviensis]
 gi|403274415|ref|XP_003928972.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Saimiri
           boliviensis boliviensis]
          Length = 466

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 61/236 (25%), Positives = 104/236 (44%), Gaps = 24/236 (10%)

Query: 605 GQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AI 662
             I +G +      N D         P++G   AH ++ + +   +F+ Y   +      
Sbjct: 209 ASINNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLT 268

Query: 663 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 722
           +++  R+     K  S  +  LVLDLD TL           VH  +    E +D      
Sbjct: 269 EEQLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFP 314

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            LF+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+
Sbjct: 315 VLFQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
               +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 375 FR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|417401418|gb|JAA47595.1| Putative ctd carboxy-terminal domain rna polymer [Desmodus
           rotundus]
          Length = 466

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 664
           I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|57108473|ref|XP_544655.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
           polypeptide A) small phosphatase like 2 isoform 1 [Canis
           lupus familiaris]
 gi|73999941|ref|XP_860654.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
           polypeptide A) small phosphatase like 2 isoform 4 [Canis
           lupus familiaris]
          Length = 466

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 664
           I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|410908573|ref|XP_003967765.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Takifugu
           rubripes]
          Length = 474

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 71/277 (25%), Positives = 121/277 (43%), Gaps = 40/277 (14%)

Query: 596 VSQNSPIQPGQIKSGADMKAVVTNHDDKQTGTGSGPEAG-PVGAHPQSAWGDVEHLFEGY 654
           +S +S ++ G+I +  DM  +        T  G  P+ G P    P  A    E  +E +
Sbjct: 212 ISSDSTVEEGEIVTETDMPPL--------TAPGCMPDGGYPHMLPPAPAETSYEEDWEVF 263

Query: 655 D-----DQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEIL 709
           D            +++ TR+     K  S  +  LVLDLD TL++ +             
Sbjct: 264 DPYFFIKHVPPLTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCSL------------ 311

Query: 710 RKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 769
              E +D       LF+     ++ +LRP    FLER  + +E+ L+T   K+YA ++  
Sbjct: 312 --NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMCQKYEIILFTASKKVYADKLLN 369

Query: 770 VLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNK 828
           +LDP+  L   R+    +      G+      KDL  +LG + S  +IID+S + + +  
Sbjct: 370 ILDPRKQLVRHRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQL 422

Query: 829 LNLIVVERYTYFPCSRRQFGLLGP---SLLEIDHDER 862
            N I +E + +   +  +   L P   SL+E++ D R
Sbjct: 423 SNGIPIESW-FMDRNDNELEKLVPFLESLVELNEDVR 458


>gi|356566193|ref|XP_003551319.1| PREDICTED: CTD small phosphatase-like protein 2-like [Glycine max]
          Length = 403

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 56/174 (32%), Positives = 80/174 (45%), Gaps = 28/174 (16%)

Query: 668 RRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF 727
           RRL   K+  S     LVLDLD TL++S             L   E+ D   P    F  
Sbjct: 198 RRLLLPKQTRSCPSTTLVLDLDETLVHST------------LEHCEDVDFTFPVN--FNS 243

Query: 728 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 787
               ++ + RP +  FLER S LFE+ ++T    +YA ++  VLDPK  +F  RV     
Sbjct: 244 EEHIVYVRCRPHLKDFLERVSGLFEIIIFTASQSIYAEQLLNVLDPKRKIFRHRVYRESC 303

Query: 788 ---DGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
              +G+           KDL  VLG + A V+IID+S + +     N I +E +
Sbjct: 304 VYVEGNYL---------KDLT-VLGRDLAHVIIIDNSPQAFGFQVDNGIPIESW 347


>gi|402874166|ref|XP_003900915.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Papio
           anubis]
 gi|402874168|ref|XP_003900916.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Papio
           anubis]
          Length = 466

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 664
           I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|291403116|ref|XP_002717973.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
           polypeptide A) small phosphatase like 2 [Oryctolagus
           cuniculus]
          Length = 286

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 56/210 (26%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 631 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 688
           PE+G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 55  PESGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 114

Query: 689 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 748
           D TL++ +                E +D       LF+     ++ +LRP    FLER S
Sbjct: 115 DETLVHCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 160

Query: 749 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 808
           +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 161 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 213

Query: 809 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           G + S  +IID+S + + +   N I +E +
Sbjct: 214 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 243


>gi|296213856|ref|XP_002753450.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 3
           [Callithrix jacchus]
          Length = 466

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 61/236 (25%), Positives = 104/236 (44%), Gaps = 24/236 (10%)

Query: 605 GQIKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AI 662
             I +G +      N D         P++G   AH ++ + +   +F+ Y   +      
Sbjct: 209 ASINNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLT 268

Query: 663 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 722
           +++  R+     K  S  +  LVLDLD TL           VH  +    E +D      
Sbjct: 269 EEQLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFP 314

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            LF+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+
Sbjct: 315 VLFQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
               +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 375 FR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|294915786|ref|XP_002778342.1| hypothetical protein Pmar_PMAR015889 [Perkinsus marinus ATCC 50983]
 gi|239886620|gb|EER10137.1| hypothetical protein Pmar_PMAR015889 [Perkinsus marinus ATCC 50983]
          Length = 278

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 53/204 (25%), Positives = 84/204 (41%), Gaps = 41/204 (20%)

Query: 681 KLCLVLDLDHTLLNSAKFHE--------VDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 732
           +L +VLDLD T++NS +  +        V P+  EI R ++      P  +L     + +
Sbjct: 45  RLDVVLDLDRTMVNSFEIRKAGRSESENVTPILQEIYRDEQGL----PELYLCVISDVKV 100

Query: 733 WTKLRPGIWTFLERASKLFE----MHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
            TK+RP    F+       E    + +YT G++ Y   + ++LDP G L  GR++SR D+
Sbjct: 101 LTKIRPHARAFIRELVASTEYGVVISIYTKGSRRYMEVVKQMLDPSGELIKGRLVSRDDE 160

Query: 789 ---------------------GDPFDG-DERVPKSKDLEGVLGMESAVVIIDDSVRVWP- 825
                                G  FDG D R+           M    V++DDS   WP 
Sbjct: 161 PSNMTPVEKDPDLIINASVESGAQFDGSDGRLCNGDKETKESEMRRWFVVLDDSPEAWPE 220

Query: 826 --HNKLNLIVVERYTYFPCSRRQF 847
                 N++    Y +   + RQ 
Sbjct: 221 ELREAGNVVTANMYDFAEVNHRQL 244


>gi|426201370|gb|EKV51293.1| hypothetical protein AGABI2DRAFT_114027 [Agaricus bisporus var.
           bisporus H97]
          Length = 814

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 24/60 (40%), Positives = 37/60 (61%)

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
           + K RPG   FL   +  ++MH+YTMG + YA E+   +DP G +F  R++SR + G+ F
Sbjct: 269 YIKPRPGWKEFLMDMATKYDMHVYTMGTRAYAEEVCAAIDPDGSVFKSRILSRDESGNDF 328



 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 48/102 (47%), Gaps = 2/102 (1%)

Query: 891 VDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHV 950
            +V  I+   + ++L G  +VFS V P+ +  P     W+ A  FGA C   +   VTHV
Sbjct: 500 FNVTTIIPRLRSEVLGGLHLVFSGVIPL-DTPPETTEFWRLARMFGAKCHTDLTPDVTHV 558

Query: 951 VANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDF 991
           +    GT KV  A   G   +V P W   S  L+ R +E  +
Sbjct: 559 ITAKRGTKKVETARQRGGIKIVRPEWFNDSIALWHRQDETPY 600


>gi|118369793|ref|XP_001018099.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
           thermophila]
 gi|89299866|gb|EAR97854.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
           thermophila SB210]
          Length = 874

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 57/207 (27%), Positives = 98/207 (47%), Gaps = 35/207 (16%)

Query: 650 LFEGYDDQQKAAI----QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVH 705
           ++ G D + K+ +      E +++L  Q+ + S +KL LVLDLD+T+L     H V  + 
Sbjct: 247 VYAGLDQKDKSVLIGKEYAEYSKKLAHQQ-LHSNQKLILVLDLDNTIL-----HAVPAIK 300

Query: 706 DEILRKKE--EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLY 763
           + +    +  +QD  K     F   +     K RP +  FL+     +E++++TM    Y
Sbjct: 301 NALFDNADGIQQDSFKE----FHNRYSKYVIKFRPYMKEFLQTVLPHYEIYIFTMAMLDY 356

Query: 764 ATEMAK--------VLDPKGVLF-AGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESA 813
           A  +          +LD   + F   R+ISR    + F  +     +KDL+ +L   E  
Sbjct: 357 AKCVCDYLKQTYKDILDDYPMTFNYDRIISR----EQFSSN-----NKDLQQILPNSEKI 407

Query: 814 VVIIDDSVRVWPHNKLNLIVVERYTYF 840
           ++I+DD   VW  NK+NL+    Y Y+
Sbjct: 408 MLILDDRDDVWAKNKMNLVTTLPYIYW 434


>gi|61098234|ref|NP_001012790.1| CTD small phosphatase-like protein 2 [Gallus gallus]
 gi|60098613|emb|CAH65137.1| hypothetical protein RCJMB04_4a24 [Gallus gallus]
          Length = 468

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 55/211 (26%), Positives = 98/211 (46%), Gaps = 24/211 (11%)

Query: 630 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 687
            P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 236 SPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 295

Query: 688 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 747
           LD TL++ +                E +D       LF+     ++ +LRP    FLER 
Sbjct: 296 LDETLVHCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 341

Query: 748 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 807
           S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 342 SQIYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NI 394

Query: 808 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           LG + S  +IID+S + + +   N I +E +
Sbjct: 395 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 425


>gi|413917758|gb|AFW57690.1| hypothetical protein ZEAMMB73_437679 [Zea mays]
          Length = 449

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 24/114 (21%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFP 728
           K+  S   + LVLDLD TL++S   H  D     PVH                   F F 
Sbjct: 255 KQTRSCPTMTLVLDLDETLVHSTLEHCEDADFTFPVH-------------------FNFR 295

Query: 729 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
              ++ + RP +  FL+R + +FE  ++T    +YA ++  VLDPK  LF  RV
Sbjct: 296 EHTIYVRCRPYLKEFLDRVASVFETIIFTASQSIYAEQLLNVLDPKRKLFRHRV 349


>gi|187471087|sp|Q5F3Z7.2|CTSL2_CHICK RecName: Full=CTD small phosphatase-like protein 2;
           Short=CTDSP-like 2
          Length = 466

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 55/211 (26%), Positives = 98/211 (46%), Gaps = 24/211 (11%)

Query: 630 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 687
            P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 234 SPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 293

Query: 688 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 747
           LD TL++ +                E +D       LF+     ++ +LRP    FLER 
Sbjct: 294 LDETLVHCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 339

Query: 748 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 807
           S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 340 SQIYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NI 392

Query: 808 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           LG + S  +IID+S + + +   N I +E +
Sbjct: 393 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|34596232|gb|AAQ76796.1| hypothetical protein [Homo sapiens]
          Length = 466

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 57/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 631 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 688
           P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 235 PDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 294

Query: 689 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 748
           D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 295 DETL-----------VHCSL---NELEDAALTFPVLFQDVVYQVYVRLRPFFREFLERMS 340

Query: 749 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 808
           +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 341 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 393

Query: 809 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           G + S  +IID+S + + +   N I +E +
Sbjct: 394 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|100815975|ref|NP_057480.2| CTD small phosphatase-like protein 2 [Homo sapiens]
 gi|187471086|sp|Q05D32.2|CTSL2_HUMAN RecName: Full=CTD small phosphatase-like protein 2;
           Short=CTDSP-like 2
 gi|23273027|gb|AAH35744.1| CTDSPL2 protein [Homo sapiens]
 gi|71835542|gb|AAZ42188.1| unknown [Homo sapiens]
 gi|119597671|gb|EAW77265.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase like 2, isoform CRA_a [Homo sapiens]
 gi|119597672|gb|EAW77266.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase like 2, isoform CRA_a [Homo sapiens]
 gi|123994825|gb|ABM85014.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase like 2 [synthetic construct]
 gi|157928777|gb|ABW03674.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase like 2 [synthetic construct]
 gi|158255896|dbj|BAF83919.1| unnamed protein product [Homo sapiens]
 gi|168278020|dbj|BAG10988.1| CTD small phosphatase like 2 [synthetic construct]
          Length = 466

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 57/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 631 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 688
           P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 235 PDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 294

Query: 689 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 748
           D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 295 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 340

Query: 749 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 808
           +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 341 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 393

Query: 809 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           G + S  +IID+S + + +   N I +E +
Sbjct: 394 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|224062995|ref|XP_002187586.1| PREDICTED: CTD small phosphatase-like protein 2 [Taeniopygia
           guttata]
          Length = 467

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 98/211 (46%), Gaps = 24/211 (11%)

Query: 630 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 687
            P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 235 SPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 294

Query: 688 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 747
           LD TL           VH  +    E +D       LF+     ++ +LRP    FLER 
Sbjct: 295 LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 340

Query: 748 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 807
           S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 341 SQIYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NI 393

Query: 808 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           LG + S  +IID+S + + +   N I +E +
Sbjct: 394 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 424


>gi|413917759|gb|AFW57691.1| hypothetical protein ZEAMMB73_437679 [Zea mays]
          Length = 451

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 24/114 (21%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFP 728
           K+  S   + LVLDLD TL++S   H  D     PVH                   F F 
Sbjct: 255 KQTRSCPTMTLVLDLDETLVHSTLEHCEDADFTFPVH-------------------FNFR 295

Query: 729 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
              ++ + RP +  FL+R + +FE  ++T    +YA ++  VLDPK  LF  RV
Sbjct: 296 EHTIYVRCRPYLKEFLDRVASVFETIIFTASQSIYAEQLLNVLDPKRKLFRHRV 349


>gi|350578733|ref|XP_003480441.1| PREDICTED: CTD small phosphatase-like protein 2-like [Sus scrofa]
          Length = 355

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 55/210 (26%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 631 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 688
           P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 124 PDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 183

Query: 689 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 748
           D TL++ +                E +D       LF+     ++ +LRP    FLER S
Sbjct: 184 DETLVHCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 229

Query: 749 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 808
           +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 230 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 282

Query: 809 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           G + S  +IID+S + + +   N I +E +
Sbjct: 283 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 312


>gi|238480828|ref|NP_001031661.2| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
 gi|240255993|ref|NP_193548.7| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
 gi|332658601|gb|AEE84001.1| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
 gi|332658602|gb|AEE84002.1| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
          Length = 446

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/146 (30%), Positives = 73/146 (50%), Gaps = 22/146 (15%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           + + LVLDLD TL++S           E+ R     D +   R  F      ++ K RP 
Sbjct: 267 KAVTLVLDLDETLVHSTL---------EVCR-----DTDFSFRVTFNMQENTVYVKQRPY 312

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 799
           ++ FLER  +LF + ++T  + +YA+++  +LDP G   + R     D     DG     
Sbjct: 313 LYRFLERVVELFHVVIFTASHSIYASQLLDILDPDGKFVSQRFYR--DSCILSDG----I 366

Query: 800 KSKDLEGVLGMESA-VVIIDDSVRVW 824
            +KDL  VLG++ A V I+D+  +V+
Sbjct: 367 YTKDL-TVLGLDLAKVAIVDNCPQVY 391


>gi|403222586|dbj|BAM40718.1| CTD-like phosphatase [Theileria orientalis strain Shintoku]
          Length = 763

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 56/119 (47%), Gaps = 14/119 (11%)

Query: 727 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
           FP++  + KLRP I  FL+  S  +EM +YT   K YA  +  +LDP   LF  R+++R 
Sbjct: 336 FPNITYYMKLRPCIREFLQILSLYYEMSIYTNATKEYADVVISILDPDRSLFMDRIVARN 395

Query: 787 --DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---PHNKLNLIVVERYTYF 840
             D+ D      R+    D   +L         DD   VW   PH +  ++  E Y +F
Sbjct: 396 SVDEKDLLKSASRLYPDLDPRFILA-------FDDRRDVWSDIPHKQ--VVRAEHYDFF 445


>gi|6841480|gb|AAF29093.1|AF161478_1 HSPC129 [Homo sapiens]
          Length = 466

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 57/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 631 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 688
           P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 235 PDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 294

Query: 689 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 748
           D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 295 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 340

Query: 749 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 808
           +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 341 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 393

Query: 809 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           G + S  +IID+S + + +   N I +E +
Sbjct: 394 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|344297040|ref|XP_003420208.1| PREDICTED: CTD small phosphatase-like protein 2 [Loxodonta
           africana]
          Length = 466

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 664
           I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211 INNGLEEAEGTVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|326926934|ref|XP_003209651.1| PREDICTED: CTD small phosphatase-like protein 2-like [Meleagris
           gallopavo]
          Length = 468

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 98/211 (46%), Gaps = 24/211 (11%)

Query: 630 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 687
            P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 236 SPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 295

Query: 688 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 747
           LD TL           VH  +    E +D       LF+     ++ +LRP    FLER 
Sbjct: 296 LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 341

Query: 748 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 807
           S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 342 SQIYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NI 394

Query: 808 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           LG + S  +IID+S + + +   N I +E +
Sbjct: 395 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 425


>gi|357450577|ref|XP_003595565.1| CTD small phosphatase-like protein [Medicago truncatula]
 gi|355484613|gb|AES65816.1| CTD small phosphatase-like protein [Medicago truncatula]
          Length = 460

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 70/149 (46%), Gaps = 28/149 (18%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           + + LVLDLD TL++S   H                D +      F      ++ K RP 
Sbjct: 286 KSVTLVLDLDETLVHSTLEHC--------------DDADFTFNIFFNMKDYIVYVKQRPF 331

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 796
           +  FLER S +FE+ ++T    +YA ++  +LDP     + R+        DG+      
Sbjct: 332 LHKFLERVSDMFEVVIFTASQSIYANQLLDILDPDEKFISRRLYRESCMFSDGN------ 385

Query: 797 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
               +KDL  +LG++ A VVIID+S +V+
Sbjct: 386 ---YTKDL-TILGIDLAKVVIIDNSPQVF 410


>gi|449270631|gb|EMC81290.1| CTD small phosphatase-like protein 2 [Columba livia]
          Length = 468

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 98/211 (46%), Gaps = 24/211 (11%)

Query: 630 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 687
            P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 236 SPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 295

Query: 688 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 747
           LD TL           VH  +    E +D       LF+     ++ +LRP    FLER 
Sbjct: 296 LDETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERM 341

Query: 748 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 807
           S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 342 SQIYEIILFTASKKVYADKLLNILDPKKKLVRHRLFR--EHCVCVQGN----YIKDL-NI 394

Query: 808 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           LG + S  +IID+S + + +   N I +E +
Sbjct: 395 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 425


>gi|50949928|emb|CAH10508.1| hypothetical protein [Homo sapiens]
          Length = 394

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 57/210 (27%), Positives = 98/210 (46%), Gaps = 24/210 (11%)

Query: 631 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 688
           P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 163 PDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 222

Query: 689 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 748
           D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 223 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 268

Query: 749 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 808
           +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 269 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 321

Query: 809 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           G + S  +IID+S + + +   N I +E +
Sbjct: 322 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 351


>gi|357450579|ref|XP_003595566.1| CTD small phosphatase-like protein [Medicago truncatula]
 gi|355484614|gb|AES65817.1| CTD small phosphatase-like protein [Medicago truncatula]
          Length = 469

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 70/149 (46%), Gaps = 28/149 (18%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           + + LVLDLD TL++S   H                D +      F      ++ K RP 
Sbjct: 295 KSVTLVLDLDETLVHSTLEHC--------------DDADFTFNIFFNMKDYIVYVKQRPF 340

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 796
           +  FLER S +FE+ ++T    +YA ++  +LDP     + R+        DG+      
Sbjct: 341 LHKFLERVSDMFEVVIFTASQSIYANQLLDILDPDEKFISRRLYRESCMFSDGN------ 394

Query: 797 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
               +KDL  +LG++ A VVIID+S +V+
Sbjct: 395 ---YTKDL-TILGIDLAKVVIIDNSPQVF 419


>gi|334186662|ref|NP_001190760.1| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
 gi|332658603|gb|AEE84003.1| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
          Length = 442

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/146 (30%), Positives = 73/146 (50%), Gaps = 22/146 (15%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           + + LVLDLD TL++S           E+ R     D +   R  F      ++ K RP 
Sbjct: 267 KAVTLVLDLDETLVHSTL---------EVCR-----DTDFSFRVTFNMQENTVYVKQRPY 312

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 799
           ++ FLER  +LF + ++T  + +YA+++  +LDP G   + R     D     DG     
Sbjct: 313 LYRFLERVVELFHVVIFTASHSIYASQLLDILDPDGKFVSQRFYR--DSCILSDG----I 366

Query: 800 KSKDLEGVLGMESA-VVIIDDSVRVW 824
            +KDL  VLG++ A V I+D+  +V+
Sbjct: 367 YTKDL-TVLGLDLAKVAIVDNCPQVY 391


>gi|209882178|ref|XP_002142526.1| NLI interacting factor-like phosphatase family protein
           [Cryptosporidium muris RN66]
 gi|209558132|gb|EEA08177.1| NLI interacting factor-like phosphatase family protein
           [Cryptosporidium muris RN66]
          Length = 710

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/155 (27%), Positives = 74/155 (47%), Gaps = 20/155 (12%)

Query: 699 HEVDPVHDEILRKKEEQDREKPHRHLFRFPH-------MGMWT----KLRPGIWTFLERA 747
           H +D   D I   + E+ R+   + +F  P+       +  W+    KLRPG+   L R 
Sbjct: 251 HYIDE-EDNIFGLEAEKYRQLIEKLIFCIPYPNSSNNGIDNWSQGFYKLRPGVLNMLRRL 309

Query: 748 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 807
              FE+++YTMG +L+A    +++DP+   F  + +   ++G       +   SK L  +
Sbjct: 310 KDKFELYMYTMGTELHAYSALRIIDPEFRFFHPKRLFYRNNG------FKDCNSKSLSTL 363

Query: 808 LGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
              +   +++IDD  + W  N  +LI V  Y +FP
Sbjct: 364 FPYDHRTLIVIDDIEQAWS-NSNSLIKVYPYNFFP 397


>gi|297794619|ref|XP_002865194.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311029|gb|EFH41453.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 447

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/146 (30%), Positives = 71/146 (48%), Gaps = 22/146 (15%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           + + LVLDLD TL++S             L      D     R  F      ++ K RP 
Sbjct: 278 KSVTLVLDLDETLVHST------------LESCNVADFS--FRVFFNMQENTVYVKQRPH 323

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 799
           ++ FLER  +LF + ++T  + +YA+++  +LDP+G   + R     D     DG     
Sbjct: 324 LYRFLERVGELFHVVIFTASHNIYASQLLDILDPEGKFISQRFYR--DSCILLDG----I 377

Query: 800 KSKDLEGVLGMESA-VVIIDDSVRVW 824
            +KDL  VLG++ A V IID+  +V+
Sbjct: 378 YTKDL-TVLGLDLAKVAIIDNCPQVY 402


>gi|68068525|ref|XP_676173.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56495746|emb|CAI00611.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 953

 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 67/267 (25%), Positives = 115/267 (43%), Gaps = 41/267 (15%)

Query: 716 DREKPHRHLFRFPHMGM--WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 773
           + ++P  + F  P+     + K RP +  FLE  S  +E+ +YT   + YA  +  +LDP
Sbjct: 598 ENDEPELYKFFLPYYNFFYYLKFRPYVRQFLEILSLYYELSIYTNATREYADVVIAILDP 657

Query: 774 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVI-IDDSVRVW---PHNK 828
              +FA R+++R       D DE    +K  E +   ++   VI  DD   VW   PH+ 
Sbjct: 658 DRTIFADRIVAR---CSSVDRDE----NKHFEKIYPNVDPKYVIAFDDRKDVWFDIPHS- 709

Query: 829 LNLIVVERYTYFPCSRRQF--GLLGPSLLE---IDHDERSEDGTLASSLGVIERLHKIFF 883
            +++  E Y +F  S+         PS  +   +D D       L   + +  ++HK FF
Sbjct: 710 -HILRAEHYNFFELSKYDIISHFKEPSTCKKRFVDMDMH-----LHYMIKIFLKIHKQFF 763

Query: 884 SHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT---------AEQ 934
            +    +VDV  I+       L+   + F+      + + +L  +  T         A +
Sbjct: 764 ENPL--NVDVGKIIDNIMLSTLSNVGLYFT---GFRKNSKNLQNVISTDCEERQKEIALE 818

Query: 935 FGAVCTKHIDD-QVTHVVANSLGTDKV 960
            GA    + DD  VTH++A    TD +
Sbjct: 819 LGAKIYSNYDDPGVTHIIAAKNCTDNL 845


>gi|395837830|ref|XP_003791832.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Otolemur
           garnettii]
 gi|395837832|ref|XP_003791833.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Otolemur
           garnettii]
          Length = 466

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 60/234 (25%), Positives = 103/234 (44%), Gaps = 24/234 (10%)

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 664
           I +G +      N D         P++G    H ++ + +   +F+ Y   +      ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSTHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|84994102|ref|XP_951773.1| CTD-like phosphatase [Theileria annulata strain Ankara]
 gi|65301934|emb|CAI74041.1| CTD-like phosphatase, putative [Theileria annulata]
          Length = 767

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 37/119 (31%), Positives = 55/119 (46%), Gaps = 14/119 (11%)

Query: 727 FPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
           F ++  + KLRP I  FL+  S  +EM +YT   K YA  +  +LDP   LF  R+++R 
Sbjct: 338 FANVNYYMKLRPCIREFLQILSLYYEMSIYTNATKEYADVVISILDPDRSLFMDRIVARN 397

Query: 787 --DDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---PHNKLNLIVVERYTYF 840
             D+ D      R+    D   +L         DD   VW   PH +  ++  E Y +F
Sbjct: 398 SVDEKDLLKSASRLYPDLDTRFILAF-------DDRRDVWSDIPHKQ--VVRAEHYDFF 447


>gi|403332687|gb|EJY65381.1| hypothetical protein OXYTRI_14465 [Oxytricha trifallax]
          Length = 927

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 45/168 (26%), Positives = 84/168 (50%), Gaps = 21/168 (12%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           ++Q+K+++     L+LD+D TL+   +     P + +I++            H      +
Sbjct: 468 KQQQKLYT-----LILDMDETLIYCRQ--NPYPGYQDIIQATSS-------AHNTYSCQV 513

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            ++T  RP +  FLE+ S++FE+ ++T   K YA  +   +DP+   F+ R+    D   
Sbjct: 514 QIFTSYRPNLRKFLEQVSQIFEVVIFTASEKSYADLILDKIDPRNEFFSKRLYR--DSCL 571

Query: 791 PFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           P  G + V   KDL  +LG + S  +I+D+S+  + +N  N I +  Y
Sbjct: 572 PTPGGQYV---KDL-TILGRDLSRTIIVDNSIMAFAYNISNGIPIPSY 615


>gi|255547724|ref|XP_002514919.1| conserved hypothetical protein [Ricinus communis]
 gi|223545970|gb|EEF47473.1| conserved hypothetical protein [Ricinus communis]
          Length = 455

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 50/158 (31%), Positives = 75/158 (47%), Gaps = 28/158 (17%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LVLDLD TL++S     ++P  D         D   P    F      ++ + RP +  F
Sbjct: 265 LVLDLDETLVHST----LEPCGD--------ADFTFPVN--FNLQEHTVYVRCRPFLKDF 310

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGDPFDGDERVPK 800
           +ER S LFE+ ++T    +YA ++  VLDPK  +F  RV        +G+          
Sbjct: 311 MERVSSLFEIIIFTASQSIYAEQLLNVLDPKRKVFRHRVFRESCVYVEGNYL-------- 362

Query: 801 SKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
            KDL  VLG + A V+IID+S + +     N I +E +
Sbjct: 363 -KDL-SVLGRDLARVIIIDNSPQAFGFQVDNGIPIESW 398


>gi|294875260|ref|XP_002767242.1| hypothetical protein Pmar_PMAR022745 [Perkinsus marinus ATCC 50983]
 gi|239868797|gb|EEQ99959.1| hypothetical protein Pmar_PMAR022745 [Perkinsus marinus ATCC 50983]
          Length = 215

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 45/152 (29%), Positives = 73/152 (48%), Gaps = 23/152 (15%)

Query: 692 LLNSAKFHEVDPVHDE----ILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFL-ER 746
           ++NS +  + DP   E    IL++  + +   P  +L     + + TK+RP    F+ E 
Sbjct: 1   MVNSYEIGKADPSQSESVTPILQEVYKDEEGLPELYLCVISDVKVLTKIRPHARAFIREL 60

Query: 747 ASKL---FEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 803
            SK      + +YT G++ Y   + K+LDP G L  GR++SR D     +     P  KD
Sbjct: 61  VSKTGCGVVLSIYTKGSRRYMEVIKKMLDPSGELIKGRLVSRED-----EPSNMTPLEKD 115

Query: 804 LEGVLGMESAV----------VIIDDSVRVWP 825
            + ++  +SAV          V++DDS  VWP
Sbjct: 116 PDFIINADSAVGTEELRRRWFVVLDDSPEVWP 147


>gi|355681384|gb|AER96789.1| CTD small phosphatase like 2 [Mustela putorius furo]
          Length = 465

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 60/234 (25%), Positives = 103/234 (44%), Gaps = 24/234 (10%)

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 664
           I +G +      N D         P++G    H ++ + +   +F+ Y   +      ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSTHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|356540144|ref|XP_003538550.1| PREDICTED: CTD small phosphatase-like protein 2-like [Glycine max]
          Length = 462

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 56/174 (32%), Positives = 80/174 (45%), Gaps = 28/174 (16%)

Query: 668 RRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF 727
           RRL   K+  S     LVLDLD TL++S             L   E+ D   P    F  
Sbjct: 259 RRLLLPKQTRSCPSTTLVLDLDETLVHST------------LEPCEDVDFTFPVN--FNS 304

Query: 728 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 787
               ++ + RP +  FLER S LFE+ ++T    +YA ++  VLDPK  +F  RV     
Sbjct: 305 EEHIVYVRCRPHLKDFLERVSGLFEIIIFTASQSIYAEQLLNVLDPKRKIFRHRVYRESC 364

Query: 788 ---DGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
              +G+           KDL  VLG + A V+IID+S + +     N I +E +
Sbjct: 365 VYVEGNYL---------KDL-TVLGRDLAHVMIIDNSPQAFGFQVDNGIPIESW 408


>gi|281338163|gb|EFB13747.1| hypothetical protein PANDA_001000 [Ailuropoda melanoleuca]
          Length = 445

 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 56/210 (26%), Positives = 97/210 (46%), Gaps = 24/210 (11%)

Query: 631 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 688
           P++G    H ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 235 PDSGYSSTHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 294

Query: 689 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 748
           D TL           VH  +    E +D       LF+     ++ +LRP    FLER S
Sbjct: 295 DETL-----------VHCSL---NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLERMS 340

Query: 749 KLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL 808
           +++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +L
Sbjct: 341 QMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NIL 393

Query: 809 GME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           G + S  +IID+S + + +   N I +E +
Sbjct: 394 GRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|55740289|gb|AAV63947.1| putative nuclear LIM interactor-interacting protein [Phytophthora
           sojae]
          Length = 261

 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 51/147 (34%), Positives = 73/147 (49%), Gaps = 28/147 (19%)

Query: 678 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL---FRFPHMGMWT 734
           +A K+CLVLDLD TL++ +    VD V             + PH      F      +  
Sbjct: 75  NAPKICLVLDLDETLVHCS----VDEV-------------KNPHMQFPVTFNGVEYTVNV 117

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 794
           K RP +  FL+R SKLFE+ ++T  +K+YA ++  +LDP       R + R D  D F  
Sbjct: 118 KKRPHLEYFLKRVSKLFEIVVFTASHKVYAEKLMNMLDPNRNFIKYR-LYREDCLDVFGN 176

Query: 795 DERVPKSKDLEGVLGME-SAVVIIDDS 820
                  KDL  VLG + S VV++D+S
Sbjct: 177 -----YLKDL-NVLGRDLSKVVLVDNS 197


>gi|301754747|ref|XP_002913218.1| PREDICTED: CTD small phosphatase-like protein 2-like [Ailuropoda
           melanoleuca]
          Length = 466

 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 60/234 (25%), Positives = 103/234 (44%), Gaps = 24/234 (10%)

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 664
           I +G +      N D         P++G    H ++ + +   +F+ Y   +      ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSTHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 376

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|358335312|dbj|GAA53844.1| CTD small phosphatase-like protein 2 [Clonorchis sinensis]
          Length = 498

 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 51/189 (26%), Positives = 91/189 (48%), Gaps = 31/189 (16%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM- 732
           K+  SA + CLVLDLD TL++ +    + P+ D               + +F+    G+ 
Sbjct: 299 KRTRSAPEFCLVLDLDETLVHCS----LTPLPDA--------------QFIFQVVFQGVV 340

Query: 733 ---WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
              + ++RP ++ FL R S+ FE+ L+T   K+YA  +  ++DPK      R+    +  
Sbjct: 341 YMVYVRIRPHLYEFLSRVSERFEVVLFTASTKVYADRLVNLIDPKKKWIKHRLFR--EHC 398

Query: 790 DPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFG 848
              +G+      KDL  VLG +    VI+D+S + + +   N + +E + +   + R+  
Sbjct: 399 VCVNGN----YVKDLR-VLGRDLRKTVIVDNSPQAFGYQLDNGVPIESW-FVDSNDRELL 452

Query: 849 LLGPSLLEI 857
            L P L E+
Sbjct: 453 NLLPFLFEV 461


>gi|7022613|dbj|BAA91664.1| unnamed protein product [Homo sapiens]
          Length = 286

 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 58/234 (24%), Positives = 104/234 (44%), Gaps = 24/234 (10%)

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 664
           + +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 31  LNNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 90

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +  R+     K  S  +  LVLDLD TL++ +                E +D       L
Sbjct: 91  QLNRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NELEDAALTFPVL 136

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+  
Sbjct: 137 FQDVVYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR 196

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 197 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 243


>gi|225716618|gb|ACO14155.1| RNA polymerase II subunit A C-terminal domain phosphatase [Esox
           lucius]
          Length = 266

 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 56/98 (57%), Gaps = 12/98 (12%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+Q+++   +KL L++DLD TL+++ + H     +  I            H  L R   M
Sbjct: 167 EDQQRLHRNKKLVLMVDLDQTLIHTTEQHCQRMSNKGIF-----------HFQLGRGEPM 215

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMA 768
            + T+LRP    FLE+ +KL+E+H++T G++LYA  +A
Sbjct: 216 -LHTRLRPHCKEFLEKIAKLYELHVFTFGSRLYAHTIA 252


>gi|432851772|ref|XP_004067077.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Oryzias
           latipes]
          Length = 474

 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 88/359 (24%), Positives = 149/359 (41%), Gaps = 61/359 (16%)

Query: 496 MDELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQGSK------ENLNFQK 549
           ++E   V +    PR  L G          P F    P+   + GS       E     K
Sbjct: 117 LEEASAVEVTASPPRTTLLGTIF------SPVFNFFSPAKNASSGSDSPDQALEAEEIVK 170

Query: 550 QLGAPEAKPVLSQSVLQPDI--TQQFTKNLKHIADFMSVSQPLTSEPMVSQNSP-IQPGQ 606
           QL   E   + S +V Q D+  T  F  ++ H+        P    P + + SP I   +
Sbjct: 171 QLDMEEVVEMPSSTVTQ-DVCATTHFYSSVSHL--------PSLRPPHMLEASPTIDEAE 221

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAG-----PVGAHPQSAWGDVEHLFEGYDDQQKA- 660
           +++ AD+  +        T  G+ PE       P    P+ ++ +   +F+ Y   +   
Sbjct: 222 LEADADLPPL--------TAPGASPEMTYVDVPPPAVPPEVSYEEDWEVFDPYFFIKHVP 273

Query: 661 -AIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK 719
              +++ TR+     K  S  +  LVLDLD TL++ +                E +D   
Sbjct: 274 PLTEEQLTRKPALPLKTRSTPEFSLVLDLDETLVHCSL--------------NELEDAAL 319

Query: 720 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA 779
               LF+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L  
Sbjct: 320 TFPVLFQDVIYQVYVRLRPFFREFLERMSQIYEIILFTASKKVYADKLLNILDPKKQLVR 379

Query: 780 GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
            R+    +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 380 HRLFR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 431


>gi|125527169|gb|EAY75283.1| hypothetical protein OsI_03170 [Oryza sativa Indica Group]
          Length = 507

 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 48/167 (28%), Positives = 81/167 (48%), Gaps = 36/167 (21%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMG 731
           ARK+ LVLDLD TL++S                      E+   + F FP       HM 
Sbjct: 325 ARKVTLVLDLDETLVHSTT--------------------EQCDDYDFTFPVFFDLKEHM- 363

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
           ++ + RP +  FL++ +++FE+ ++T    +YA ++  +LDP+  LF+ R          
Sbjct: 364 VYVRKRPHLHMFLQKMAEMFEVVIFTASQSVYADQLLDILDPEKKLFSRRYFRESCVF-- 421

Query: 792 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
                    +KDL  V+G++ A VVIID++ +V+     N I +E +
Sbjct: 422 ----TNTSYTKDL-TVVGVDLAKVVIIDNTPQVFQLQVNNGIPIESW 463


>gi|340380578|ref|XP_003388799.1| PREDICTED: hypothetical protein LOC100637093 [Amphimedon
           queenslandica]
          Length = 532

 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 53/176 (30%), Positives = 83/176 (47%), Gaps = 37/176 (21%)

Query: 666 RTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD---REKPHR 722
           RTRR  E          CLVLDLD TL++ +            L K E  +   + +   
Sbjct: 348 RTRRTPE---------FCLVLDLDETLVHCS------------LSKLELANFTFKVEYSN 386

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            LF      ++ +LRP    FLER SK FE+ L+T   K+YA ++  ++DP   L   R+
Sbjct: 387 QLF-----DVYVRLRPYFHEFLERVSKQFEVILFTASTKVYADKLLDLIDPSRRLVKHRL 441

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
               D     DG+      K+L G+LG + A  +I+D+S + + +   N + +E +
Sbjct: 442 FR--DHCVCVDGN----FIKEL-GILGRDLAKTIIVDNSPQAFGYQLSNGVPIESW 490


>gi|357463015|ref|XP_003601789.1| CTD small phosphatase-like protein [Medicago truncatula]
 gi|355490837|gb|AES72040.1| CTD small phosphatase-like protein [Medicago truncatula]
          Length = 885

 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 46/144 (31%), Positives = 72/144 (50%), Gaps = 22/144 (15%)

Query: 682 LCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIW 741
           + LVLDLD TL++S+               K  +D +      F+     ++ + RP + 
Sbjct: 694 ITLVLDLDETLVHSSL--------------KPSEDVDFTFTVNFKSEEYIVYVRCRPHLK 739

Query: 742 TFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKS 801
            FLER S LFE+ ++T    +YA ++  +LDPK  +F  RV    +     +G+      
Sbjct: 740 EFLERVSGLFEIIIFTASQSIYAEQLLNLLDPKRKIFRHRVFR--ESCVKVEGNYL---- 793

Query: 802 KDLEGVLGMESA-VVIIDDSVRVW 824
           KDL  VLG + A V+IID+S R +
Sbjct: 794 KDL-TVLGCDLAHVMIIDNSRRAF 816



 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 54/171 (31%), Positives = 86/171 (50%), Gaps = 22/171 (12%)

Query: 668 RRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF 727
           RR+   K+  S   + LVLDLD TL++S+    ++P  D         + E+   H+   
Sbjct: 265 RRMLLPKQTRSCPPITLVLDLDETLVHSS----LEPCEDVDFTFTVNFNSEE---HI--- 314

Query: 728 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 787
               ++ + RP +  FLER S LFE+ ++T    +YA ++  VLDPK  +F  RV    +
Sbjct: 315 ----VYVRCRPHLKEFLERVSGLFEIIIFTASQSIYAEQLLNVLDPKRKIFRHRVFR--E 368

Query: 788 DGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
                +G+      KDL  VLG + A V+IID+S + +     N I +E +
Sbjct: 369 SCVYVEGNYL----KDL-TVLGRDLAHVMIIDNSPQAFGFQVDNGIPIESW 414


>gi|359473746|ref|XP_002271611.2| PREDICTED: CTD small phosphatase-like protein 2-like [Vitis
           vinifera]
 gi|297738449|emb|CBI27650.3| unnamed protein product [Vitis vinifera]
          Length = 503

 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 51/165 (30%), Positives = 76/165 (46%), Gaps = 42/165 (25%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMGMWTKL 736
           LVLDLD TL++S     ++P  D                  F FP       HM ++ + 
Sbjct: 313 LVLDLDETLVHST----LEPCDDAD----------------FTFPVNFNLKEHM-VYVRC 351

Query: 737 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGDPFD 793
           RP +  F+ER + LFE+ ++T    +YA ++  VLDPK   F  RV        +G+   
Sbjct: 352 RPHLKDFMERVASLFEIIIFTASQSIYAEQLLNVLDPKRRFFRHRVYRESCVFVEGNYL- 410

Query: 794 GDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
                   KDL  VLG + A V+IID+S + +     N I +E +
Sbjct: 411 --------KDL-SVLGRDLAHVIIIDNSPQAFGFQVDNGIPIESW 446


>gi|297597322|ref|NP_001043795.2| Os01g0665300 [Oryza sativa Japonica Group]
 gi|55773815|dbj|BAD72353.1| Chain A, Three-Dimensional Structure Of A Rna-Polymerase Ii Binding
           Protein With Associated Ligand-like [Oryza sativa
           Japonica Group]
 gi|125571492|gb|EAZ13007.1| hypothetical protein OsJ_02926 [Oryza sativa Japonica Group]
 gi|255673527|dbj|BAF05709.2| Os01g0665300 [Oryza sativa Japonica Group]
          Length = 439

 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 48/167 (28%), Positives = 81/167 (48%), Gaps = 36/167 (21%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMG 731
           ARK+ LVLDLD TL++S                      E+   + F FP       HM 
Sbjct: 257 ARKVTLVLDLDETLVHSTT--------------------EQCDDYDFTFPVFFDMKEHM- 295

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
           ++ + RP +  FL++ +++FE+ ++T    +YA ++  +LDP+  LF+ R          
Sbjct: 296 VYVRKRPHLHMFLQKMAEMFEVVIFTASQSVYADQLLDILDPEKKLFSRRYFRESCVF-- 353

Query: 792 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
                    +KDL  V+G++ A VVIID++ +V+     N I +E +
Sbjct: 354 ----TNTSYTKDL-TVVGVDLAKVVIIDNTPQVFQLQVNNGIPIESW 395


>gi|395503570|ref|XP_003756137.1| PREDICTED: CTD small phosphatase-like protein 2 [Sarcophilus
           harrisii]
          Length = 395

 Score = 56.6 bits (135), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 64/118 (54%), Gaps = 8/118 (6%)

Query: 721 HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 780
           H++L ++    ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   
Sbjct: 242 HKNLKKYIDSNVYVRLRPFFREFLERMSQIYEIILFTASKKVYADKLLNILDPKKQLVRH 301

Query: 781 RVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           R+    +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 302 RLFR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 352


>gi|348523113|ref|XP_003449068.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Oreochromis
           niloticus]
          Length = 378

 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 43/156 (27%), Positives = 77/156 (49%), Gaps = 22/156 (14%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LV+DL+ TL+    F  ++ +          +D E      F+     ++  LRP +  F
Sbjct: 202 LVVDLEETLM----FSSLNVI----------EDAEYTFHAAFQDHQYKVYMVLRPHVKEF 247

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 803
           L+  +K++E+ +YT   K YA ++  +LDP+  LF  R+    DD     G       KD
Sbjct: 248 LQAMAKIYELFVYTCAKKEYAEKILDILDPQRKLFRHRLYQ--DDCACVLGH----YIKD 301

Query: 804 LEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYT 838
           L  +LG +    V++D++   +P+N LN I ++ ++
Sbjct: 302 L-SILGRDLKKTVVLDNAPHTYPYNLLNTIPIKSWS 336


>gi|355692677|gb|EHH27280.1| CTD small phosphatase-like protein 2 [Macaca mulatta]
          Length = 466

 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 60/234 (25%), Positives = 102/234 (43%), Gaps = 24/234 (10%)

Query: 607 IKSGADMKAVVTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQK 664
           I +G +      N D         P++G   AH ++ + +   +F+ Y   +      ++
Sbjct: 211 INNGLEEAEETVNRDIPPLTAPVTPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEE 270

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           +  R+     K  S  +  LVLDLD TL           VH  +    E +D       L
Sbjct: 271 QLNRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFPVL 316

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L       
Sbjct: 317 FQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHHFFC 376

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 377 --EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|148233948|ref|NP_001082795.1| CTD small phosphatase-like protein 2-B [Danio rerio]
 gi|187471000|sp|A4QNX6.1|CTL2B_DANRE RecName: Full=CTD small phosphatase-like protein 2-B;
           Short=CTDSP-like 2-B
 gi|141796856|gb|AAI39561.1| Zgc:162265 protein [Danio rerio]
          Length = 460

 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 61/220 (27%), Positives = 99/220 (45%), Gaps = 28/220 (12%)

Query: 625 TGTGSGPEAGPVGAHPQS-AWGDVEHLFEGYD-----DQQKAAIQKERTRRLEEQKKMFS 678
           T  GS    G V A     A G  E  +E +D            +++ TR+     K  S
Sbjct: 219 TAPGSPATGGYVDASITVPAEGSYEEEWEVFDPYFFIKHVPPLTEEQLTRKPALPLKTRS 278

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
             +  LVLDLD TL++ +  +E+D             D       LF+     ++ +LRP
Sbjct: 279 TPEFSLVLDLDETLVHCS-LNELD-------------DAALTFPVLFQDVIYQVYVRLRP 324

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERV 798
               FLER S+++E+ L+T   K+YA ++  +LDP+  L   R+    +      G+   
Sbjct: 325 FFREFLERMSQIYEIILFTASKKVYADKLLNILDPRKQLVRHRLFR--EHCVCVQGN--- 379

Query: 799 PKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
              KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 380 -YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 417


>gi|326671582|ref|XP_700009.2| PREDICTED: CTD small phosphatase-like protein 2-like [Danio rerio]
          Length = 358

 Score = 56.2 bits (134), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 47/154 (30%), Positives = 72/154 (46%), Gaps = 20/154 (12%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LVLDLD TL+ S+    V P            D E      F+     ++  LRP +  F
Sbjct: 182 LVLDLDETLVFSSL--NVIP------------DAEYTFNTRFQDHKYKVYVILRPHVREF 227

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 803
           L+  +K FEM +YT   K YA ++  +LDP   LF  R+    DD     G       KD
Sbjct: 228 LQAMTKHFEMFVYTSAKKEYAEKIVDILDPNKKLFRHRLYQ--DDCACVLGH----YIKD 281

Query: 804 LEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 837
           L  +    S  VI+D++   +P++ +N+I ++ +
Sbjct: 282 LTILERDLSKTVILDNAPHTFPYHLMNMIPIKSW 315


>gi|55740281|gb|AAV63942.1| putative nuclear LIM factor interactor-interacting protein hyphal
           form [Phytophthora infestans]
          Length = 211

 Score = 56.2 bits (134), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 57/177 (32%), Positives = 86/177 (48%), Gaps = 28/177 (15%)

Query: 665 ERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL 724
           E  R +   ++  +A K+CLVLDLD TL++ +    VD V             + PH   
Sbjct: 12  EGKRPISLPERSHNAPKICLVLDLDETLVHCS----VDEV-------------KNPHMQF 54

Query: 725 ---FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 781
              F      +  K RP +  FL+R SKLFE+ ++T  +K+YA ++  +LDP   L   R
Sbjct: 55  PVTFNGVEYIVNVKKRPHMEYFLKRVSKLFEIVVFTASHKVYAEKLTNMLDPHRNLIKYR 114

Query: 782 VISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
            + R D  D F         KDL  VLG + S VV++D+S   + +   N I +E +
Sbjct: 115 -LYRDDCLDVFGN-----YLKDLN-VLGRDLSKVVLVDNSPHAFGYQVNNGIPIETW 164


>gi|186529839|ref|NP_001119383.1| SCP1-like small phosphatase 4 [Arabidopsis thaliana]
 gi|332007998|gb|AED95381.1| SCP1-like small phosphatase 4 [Arabidopsis thaliana]
          Length = 456

 Score = 56.2 bits (134), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 70/146 (47%), Gaps = 22/146 (15%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           + + LVLDLD TL++S             L      D     R  F      ++ + RP 
Sbjct: 282 KSVTLVLDLDETLVHST------------LESCNVADFS--FRVFFNMQENTVYVRQRPH 327

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 799
           ++ FLER  +LF + ++T  + +YA+++  +LDP G   + R     D     DG     
Sbjct: 328 LYRFLERVGELFHVVIFTASHSIYASQLLDILDPDGKFISQRFYR--DSCILLDG----I 381

Query: 800 KSKDLEGVLGMESA-VVIIDDSVRVW 824
            +KDL  VLG++ A V IID+  +V+
Sbjct: 382 YTKDL-TVLGLDLAKVAIIDNCPQVY 406


>gi|241249809|ref|XP_002403164.1| RNA polymerase II ctd phosphatase, putative [Ixodes scapularis]
 gi|215496447|gb|EEC06087.1| RNA polymerase II ctd phosphatase, putative [Ixodes scapularis]
          Length = 185

 Score = 56.2 bits (134), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 54/104 (51%), Gaps = 2/104 (1%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           D+++++   +RK+L G  +VFS V P  +  P     WQTA   GA  +  +   VTH+V
Sbjct: 11  DLKHVVPYVRRKVLKGSHLVFSGVVPTNQ-EPEKSRAWQTARALGARVSSDLCPGVTHLV 69

Query: 952 ANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQDFAIK 994
           A   GT KVN A  T +  VV P W+   A  +   +E  F ++
Sbjct: 70  AARPGTAKVNRARRTRQLHVVSPAWLWCCAERWEHVHEALFPLE 113


>gi|294877772|ref|XP_002768119.1| hypothetical protein Pmar_PMAR002906 [Perkinsus marinus ATCC 50983]
 gi|239870316|gb|EER00837.1| hypothetical protein Pmar_PMAR002906 [Perkinsus marinus ATCC 50983]
          Length = 161

 Score = 56.2 bits (134), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 45/152 (29%), Positives = 73/152 (48%), Gaps = 23/152 (15%)

Query: 692 LLNSAKFHEVDPVHDE----ILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFL-ER 746
           ++NS +  + DP   E    IL++  + +   P  +L     + + TK+RP    F+ E 
Sbjct: 1   MVNSYEIGKADPSQSESVTPILQEVYKDEEGLPELYLCVISDVKVLTKIRPHARAFIREL 60

Query: 747 ASKL---FEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 803
            SK      + +YT G++ Y   + K+LDP G L  GR++SR D     +     P  KD
Sbjct: 61  VSKTGCGVVLSIYTKGSRRYMEVIKKMLDPSGELIKGRLVSRED-----EPSNMTPLEKD 115

Query: 804 LEGVLGMESAV----------VIIDDSVRVWP 825
            + ++  +SAV          V++DDS  VWP
Sbjct: 116 PDFIINADSAVGTEELRRRWFVVLDDSPEVWP 147


>gi|22327621|ref|NP_199453.2| SCP1-like small phosphatase 4 [Arabidopsis thaliana]
 gi|18377616|gb|AAL66958.1| unknown protein [Arabidopsis thaliana]
 gi|20465765|gb|AAM20371.1| unknown protein [Arabidopsis thaliana]
 gi|332007997|gb|AED95380.1| SCP1-like small phosphatase 4 [Arabidopsis thaliana]
          Length = 453

 Score = 56.2 bits (134), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 70/146 (47%), Gaps = 22/146 (15%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           + + LVLDLD TL++S             L      D     R  F      ++ + RP 
Sbjct: 279 KSVTLVLDLDETLVHST------------LESCNVADFS--FRVFFNMQENTVYVRQRPH 324

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 799
           ++ FLER  +LF + ++T  + +YA+++  +LDP G   + R     D     DG     
Sbjct: 325 LYRFLERVGELFHVVIFTASHSIYASQLLDILDPDGKFISQRFYR--DSCILLDG----I 378

Query: 800 KSKDLEGVLGMESA-VVIIDDSVRVW 824
            +KDL  VLG++ A V IID+  +V+
Sbjct: 379 YTKDL-TVLGLDLAKVAIIDNCPQVY 403


>gi|293331055|ref|NP_001170732.1| uncharacterized protein LOC100384823 [Zea mays]
 gi|238007228|gb|ACR34649.1| unknown [Zea mays]
          Length = 254

 Score = 56.2 bits (134), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 53/114 (46%), Gaps = 24/114 (21%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFP 728
           K+  S   + LVLDLD TL++S   H  D     PVH                   F F 
Sbjct: 60  KQTRSCPTMTLVLDLDETLVHSTLEHCEDADFTFPVH-------------------FNFR 100

Query: 729 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
              ++ + RP +  FL+R + +FE  ++T    +YA ++  VLDPK  LF  RV
Sbjct: 101 EHTIYVRCRPYLKEFLDRVASVFETIIFTASQSIYAEQLLNVLDPKRKLFRHRV 154


>gi|26449836|dbj|BAC42041.1| unknown protein [Arabidopsis thaliana]
          Length = 453

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 70/146 (47%), Gaps = 22/146 (15%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           + + LVLDLD TL++S             L      D     R  F      ++ + RP 
Sbjct: 279 KSVTLVLDLDETLVHST------------LESCNVADFS--FRVFFNMQENTVYVRQRPH 324

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 799
           ++ FLER  +LF + ++T  + +YA+++  +LDP G   + R     D     DG     
Sbjct: 325 LYRFLERVGELFHVVIFTASHSIYASQLLDILDPDGKFISQRFYR--DSCILLDG----I 378

Query: 800 KSKDLEGVLGMESA-VVIIDDSVRVW 824
            +KDL  VLG++ A V IID+  +V+
Sbjct: 379 YTKDL-TVLGLDLAKVAIIDNCPQVY 403


>gi|302806561|ref|XP_002985030.1| hypothetical protein SELMODRAFT_4374 [Selaginella moellendorffii]
 gi|300147240|gb|EFJ13905.1| hypothetical protein SELMODRAFT_4374 [Selaginella moellendorffii]
          Length = 177

 Score = 55.8 bits (133), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 43/141 (30%), Positives = 70/141 (49%), Gaps = 21/141 (14%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K  L+LDLD TL+ +++   +    D ++   E    E+P           +W   RPG+
Sbjct: 11  KPTLILDLDGTLIATSRQASLHACFDFVV---EFDSEEQP-----------VWVSKRPGL 56

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPK 800
             FL +AS+++E+ ++++G K Y  +M + +DP G LF    ++R    D   G   +  
Sbjct: 57  EDFLRQASEIYEVVVFSLGRKSYVEKMREAIDPSG-LFVATWLAR----DSCSGSSEIKD 111

Query: 801 SKDLEG-VLGME-SAVVIIDD 819
            KDL    LG E   VV +DD
Sbjct: 112 YKDLNSPKLGRELRKVVWVDD 132


>gi|389584495|dbj|GAB67227.1| hypothetical protein PCYB_112480 [Plasmodium cynomolgi strain B]
          Length = 1447

 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 62/261 (23%), Positives = 111/261 (42%), Gaps = 35/261 (13%)

Query: 719  KPHRHLFRFPHMGM--WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
            +P  + F  P+     + K RP +  FL+  S  +E+ +YT   + YA  +  +LDP   
Sbjct: 1018 EPELYKFFLPYYNFFYYLKFRPYVRQFLQILSLYYELSIYTNATREYADVVIAILDPDRT 1077

Query: 777  LFAGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVI-IDDSVRVW---PHNKLNL 831
            LFA R+++R    D         ++K+   +   ++S  +I  DD   VW   PH+  ++
Sbjct: 1078 LFADRIVARCSSADR-------EENKNFSKIYPNVDSKYIIAFDDRKDVWTDIPHS--HI 1128

Query: 832  IVVERYTYFPCSRRQF--GLLGPSLLE---IDHDERSEDGTLASSLGVIERLHKIFFSHQ 886
            +  E Y +F  S+         P+  +   +D D       L     V+ +LHK FF   
Sbjct: 1129 LKAEHYNFFELSKYDIISHFKEPTTCKKRFVDMDMH-----LHFMTKVLLKLHKHFFERP 1183

Query: 887  SLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHL------HPLWQTAEQFGA-VC 939
               +VDV  ++       L+   + F+      + + H+          + A + GA + 
Sbjct: 1184 L--EVDVGTLIENIMLSTLSNVGVYFTGFRKNSKNSQHVLSSDCEDRQKEIALELGAKIY 1241

Query: 940  TKHIDDQVTHVVANSLGTDKV 960
            T +    VTH++A    TD +
Sbjct: 1242 TNYDMPGVTHIIAAKNCTDNL 1262


>gi|156101293|ref|XP_001616340.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148805214|gb|EDL46613.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 1544

 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 64/261 (24%), Positives = 110/261 (42%), Gaps = 35/261 (13%)

Query: 719  KPHRHLFRFPHMGM--WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
            +P  + F  P+     + K RP +  FL+  S  +E+ +YT   + YA  +  +LDP   
Sbjct: 1108 EPELYKFFLPYYNFFYYLKFRPYVRQFLQILSLYYELSIYTNATREYADVVIAILDPDRT 1167

Query: 777  LFAGRVISRGDDGDPFDGDERVPKSKDLEGVL-GMESAVVI-IDDSVRVW---PHNKLNL 831
            LFA R+++R    D         ++K+   +   ++S  VI  DD   VW   PH+  ++
Sbjct: 1168 LFADRIVARCSSAD-------REENKNFSKIYPNVDSKYVIAFDDRKDVWTDIPHS--HI 1218

Query: 832  IVVERYTYFPCSRRQF--GLLGPSLLE---IDHDERSEDGTLASSLGVIERLHKIFFSHQ 886
            +  E Y +F  S+         PS  +   +D D       L     V+ +LHK FF   
Sbjct: 1219 LKAEHYNFFELSKYDIISHFKEPSTCKKRFVDMDMH-----LHFMTKVLLKLHKQFFERP 1273

Query: 887  SLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHL------HPLWQTAEQFGA-VC 939
               +VDV  ++       L    + F+      + + H+          + A + GA + 
Sbjct: 1274 L--EVDVGRLIDDIMLSTLCNVGVYFTGFRKNSKNSQHVLSSDCEDRQKEIALELGAKIY 1331

Query: 940  TKHIDDQVTHVVANSLGTDKV 960
            T +    VTH++A    TD +
Sbjct: 1332 TNYDMPGVTHIIAAKNCTDNL 1352


>gi|47230493|emb|CAF99686.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 2418

 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 52/176 (29%), Positives = 84/176 (47%), Gaps = 22/176 (12%)

Query: 663 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 722
           +++ TR+     K  S  +  LVLDLD TL           VH  +    E +D      
Sbjct: 305 EEQLTRKPALPLKTRSTPEFSLVLDLDETL-----------VHCSL---NELEDAALTFP 350

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            LF+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+
Sbjct: 351 VLFQDVIYQVYVRLRPFFREFLERMSQIYEIILFTASKKVYADKLLNILDPKKQLVRHRL 410

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
               +      G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 411 FR--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 459


>gi|115456605|ref|NP_001051903.1| Os03g0850100 [Oryza sativa Japonica Group]
 gi|27573336|gb|AAO20054.1| putative NLI interacting factor [Oryza sativa Japonica Group]
 gi|28269415|gb|AAO37958.1| putative NLI-interacting factor [Oryza sativa Japonica Group]
 gi|108712119|gb|ABF99914.1| NLI interacting factor, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113550374|dbj|BAF13817.1| Os03g0850100 [Oryza sativa Japonica Group]
 gi|125588650|gb|EAZ29314.1| hypothetical protein OsJ_13375 [Oryza sativa Japonica Group]
          Length = 444

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 49/168 (29%), Positives = 77/168 (45%), Gaps = 28/168 (16%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           K+  S  +  LVLDLD TL++S             L   E+ D   P    F      ++
Sbjct: 252 KQTRSCPRTTLVLDLDETLVHST------------LEPCEDSDFTFPVH--FNLREHTIY 297

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGD 790
            + RP +  FLE  + +FE+ ++T    +YA ++  +LDPK  LF  RV        +G+
Sbjct: 298 VRCRPYLKEFLETVASMFEIIIFTASQSIYAEQLLNILDPKRRLFRHRVYRESCLFVEGN 357

Query: 791 PFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
                      KDL  VLG + A VVI+D+S + +     N + +E +
Sbjct: 358 YL---------KDL-SVLGRDLARVVIVDNSPQAFGFQLDNGVPIESW 395


>gi|218194116|gb|EEC76543.1| hypothetical protein OsI_14336 [Oryza sativa Indica Group]
          Length = 444

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 49/168 (29%), Positives = 77/168 (45%), Gaps = 28/168 (16%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           K+  S  +  LVLDLD TL++S             L   E+ D   P    F      ++
Sbjct: 252 KQTRSCPRTTLVLDLDETLVHST------------LEPCEDSDFTFPVH--FNLREHTIY 297

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGD 790
            + RP +  FLE  + +FE+ ++T    +YA ++  +LDPK  LF  RV        +G+
Sbjct: 298 VRCRPYLKEFLETVASMFEIIIFTASQSIYAEQLLNILDPKRRLFRHRVYRESCLFVEGN 357

Query: 791 PFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
                      KDL  VLG + A VVI+D+S + +     N + +E +
Sbjct: 358 YL---------KDL-SVLGRDLARVVIVDNSPQAFGFQLDNGVPIESW 395


>gi|387015310|gb|AFJ49774.1| CTD small phosphatase [Crotalus adamanteus]
          Length = 466

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 54/211 (25%), Positives = 97/211 (45%), Gaps = 24/211 (11%)

Query: 630 GPEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLD 687
            P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLD
Sbjct: 234 SPDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLD 293

Query: 688 LDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERA 747
           LD TL++ +                E +D       LF+     ++ +LRP    FLE  
Sbjct: 294 LDETLVHCSL--------------NELEDAALTFPVLFQDVIYQVYVRLRPFFREFLECM 339

Query: 748 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 807
           S+++E+ L+T   K+YA ++  +LDPK  L   R+    +      G+      KDL  +
Sbjct: 340 SQIYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVCVQGN----YIKDL-NI 392

Query: 808 LGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           LG + S  +IID+S + + +   N I +E +
Sbjct: 393 LGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 423


>gi|299472381|emb|CBN77569.1| putative nuclear LIM interactor-interacting protein [Ectocarpus
           siliculosus]
          Length = 602

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 49/169 (28%), Positives = 80/169 (47%), Gaps = 22/169 (13%)

Query: 670 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 729
           L E++K    ++L LVLDLD TL++      V+P H           R + H   F    
Sbjct: 351 LPEKRKTRHGKELTLVLDLDETLVHCTVDPIVNPDH-----------RFEVH---FNGEE 396

Query: 730 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
             ++ + RP +  FLE  S+LFE+ ++T   ++YA  +  ++DP+      R+    D  
Sbjct: 397 FQVYVRKRPHLDAFLEAVSELFEVVVFTASQQVYAERLLNMIDPQKKFVKYRLYR--DAC 454

Query: 790 DPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
              +G+      KDL  VLG + S V I+D+S   +     N I +E +
Sbjct: 455 MALEGNYL----KDLN-VLGRDLSKVAIVDNSPYAYGFQIDNGIPIESW 498


>gi|353230275|emb|CCD76446.1| nuclear lim interactor-interacting factor-related [Schistosoma
           mansoni]
          Length = 429

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 45/166 (27%), Positives = 81/166 (48%), Gaps = 30/166 (18%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM- 732
           KK  S+ + CLVLDLD TL++ +    ++P+ D               + +F+    G+ 
Sbjct: 285 KKTRSSPEFCLVLDLDETLVHCS----LNPLLDA--------------QFIFQVVFQGVV 326

Query: 733 ---WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
              + ++RP ++ FL   S+ FE+ L+T   K+YA  +  ++DPK      R+    +  
Sbjct: 327 YMVYVRIRPHLYEFLTNVSEHFEVVLFTASTKVYADRLVNLIDPKKKWIKHRLFR--EHC 384

Query: 790 DPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVV 834
              +G+      KDL  VLG +    VIID+S + + +    L+++
Sbjct: 385 VCVNGN----YVKDLR-VLGRDLRKTVIIDNSPQAFGYQVFGLLLL 425


>gi|356510404|ref|XP_003523928.1| PREDICTED: uncharacterized protein LOC100810756 [Glycine max]
          Length = 469

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 73/146 (50%), Gaps = 22/146 (15%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           +K+ LVLDLD TL++S+   + D   D     K   DRE           + ++ + RP 
Sbjct: 295 KKVTLVLDLDETLIHSS-MGQCDGAAD--FTFKMITDRE-----------LTVYVRKRPF 340

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 799
           +  FL + S++FE+ ++T   ++YA  +  VLDP    F+ RV            D R  
Sbjct: 341 LQEFLVKVSEMFEIIIFTASKRMYAETLLDVLDPDKKFFSRRVYRESCTW----KDRRCV 396

Query: 800 KSKDLEGVLGMESA-VVIIDDSVRVW 824
             KDL  VLG++ A V IID++  V+
Sbjct: 397 --KDL-TVLGIDLAKVCIIDNTPEVF 419


>gi|125526935|gb|EAY75049.1| hypothetical protein OsI_02945 [Oryza sativa Indica Group]
          Length = 577

 Score = 54.7 bits (130), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 44/153 (28%), Positives = 78/153 (50%), Gaps = 34/153 (22%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
           ++++ LVLDLD TL++S   H  D V D  L+              F   +  ++ + RP
Sbjct: 399 SKQITLVLDLDETLVHSTLDH-CDNV-DFTLQV------------FFNMKNHTVYVRQRP 444

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR------VISRGDDGDPF 792
            +  FLE+ +++FE+ ++T   ++YA ++   LDP G L + R      + S G      
Sbjct: 445 HLKMFLEKVAQMFELVIFTASQRIYAEQLIDRLDPDGRLISHRIYRESCIFSEG------ 498

Query: 793 DGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
                   +KDL  +LG++ A VVI+D++ +V+
Sbjct: 499 ------CYTKDLT-ILGVDLAKVVIVDNTPQVF 524


>gi|149490347|ref|XP_001511004.1| PREDICTED: CTD small phosphatase-like protein 2-like
           [Ornithorhynchus anatinus]
          Length = 374

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 40/149 (26%), Positives = 73/149 (48%), Gaps = 16/149 (10%)

Query: 631 PEAGPVGAHPQSAWGDVEHLFEGYDDQQKA--AIQKERTRRLEEQKKMFSARKLCLVLDL 688
           P++G   AH ++ + +   +F+ Y   +      +++  R+     K  S  +  LVLDL
Sbjct: 238 PDSGYSSAHAEATYEEDWEVFDPYYFIKHVPPLTEEQLNRKPALPLKTRSTPEFSLVLDL 297

Query: 689 DHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERAS 748
           D TL++ +    ++ + D  L              LF+     ++ +LRP    FLER S
Sbjct: 298 DETLVHCS----LNELEDAALT----------FPVLFQDVIYQVYVRLRPFFREFLERMS 343

Query: 749 KLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
           +++E+ L+T   K+YA ++  +LDPK  L
Sbjct: 344 QIYEIILFTASKKVYADKLLNILDPKKQL 372


>gi|357130565|ref|XP_003566918.1| PREDICTED: uncharacterized protein LOC100830008 [Brachypodium
           distachyon]
          Length = 510

 Score = 54.3 bits (129), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 44/156 (28%), Positives = 74/156 (47%), Gaps = 28/156 (17%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 732
           QK     + + LVLDLD TL++S   H    + D  ++              F      +
Sbjct: 326 QKSPVRTKHVTLVLDLDETLVHSTLDHC--DIADFTIQV------------FFNMKDHTV 371

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDG 789
           + + RP +  FLE+ +++FE+ ++T   K+YA ++   LDP G L + R+        DG
Sbjct: 372 YVRQRPHLKMFLEKVAQMFELVIFTASQKIYAEQIIDRLDPDGKLISQRIYRESCIFSDG 431

Query: 790 DPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
                      +KDL  +LG+  A V IID++ +V+
Sbjct: 432 ---------SYTKDL-TILGVHLAKVAIIDNTPQVF 457


>gi|256083671|ref|XP_002578064.1| nuclear lim interactor-interacting factor-related [Schistosoma
           mansoni]
          Length = 441

 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 47/165 (28%), Positives = 80/165 (48%), Gaps = 32/165 (19%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM- 732
           KK  S+ + CLVLDLD TL++ +    ++P+ D               + +F+    G+ 
Sbjct: 285 KKTRSSPEFCLVLDLDETLVHCS----LNPLLDA--------------QFIFQVVFQGVV 326

Query: 733 ---WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
              + ++RP ++ FL   S+ FE+ L+T   K+YA  +  ++DPK      R+    +  
Sbjct: 327 YMVYVRIRPHLYEFLTNVSEHFEVVLFTASTKVYADRLVNLIDPKKKWIKHRLFR--EHC 384

Query: 790 DPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRV--WPHNKLNL 831
              +G+      KDL  VLG +    VIID+S +   + HN+  L
Sbjct: 385 VCVNGN----YVKDLR-VLGRDLRKTVIIDNSPQAFGYQHNERKL 424


>gi|297799336|ref|XP_002867552.1| hypothetical protein ARALYDRAFT_913891 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313388|gb|EFH43811.1| hypothetical protein ARALYDRAFT_913891 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 113

 Score = 53.9 bits (128), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 29/84 (34%), Positives = 47/84 (55%), Gaps = 3/84 (3%)

Query: 811 ESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLAS 870
           E  V+I+DD+V +WPH+K NL+ + +Y YF  +         S  E+  DE   +G+LA+
Sbjct: 21  ELRVIIVDDTVDIWPHDKRNLLQITKYIYFSVA-VSIDKRWRSYAEVKRDESLSNGSLAN 79

Query: 871 SLGVIERLHKIFFSHQSLDDVDVR 894
            L  +  +HK +   + LD  D+R
Sbjct: 80  VLKFLVYVHKRY--EKKLDSKDLR 101


>gi|325180168|emb|CCA14570.1| nuclear LIM factor interactorinteracting protein hyphal form
           putative [Albugo laibachii Nc14]
          Length = 418

 Score = 53.5 bits (127), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 78/161 (48%), Gaps = 28/161 (17%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL---FRFPHMGMWTKLR 737
           K+CLVLDLD TL           VH  +      ++ E P+      F   +  +   LR
Sbjct: 235 KICLVLDLDETL-----------VHCSV------EEIENPNFQFDVFFNGTNYNVNVSLR 277

Query: 738 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 797
           P +  FL+R +K FE+ ++T   ++YA ++  +LDP   L   R+    +D    DG+  
Sbjct: 278 PHMHHFLKRVTKQFELVVFTASQRVYAEKLLNLLDPNRDLIKYRLYR--EDCLEVDGN-- 333

Query: 798 VPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
               KDL  VLG + A V+++D+S   + +   N I +E +
Sbjct: 334 --FLKDL-NVLGRDLARVILVDNSPHAFGYQVNNGIPIESW 371


>gi|326513088|dbj|BAK06784.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 445

 Score = 53.5 bits (127), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 79/164 (48%), Gaps = 30/164 (18%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
           ARK+ LVLDLD TL++S             L   ++ D   P    F      ++ + RP
Sbjct: 272 ARKVTLVLDLDETLVHST------------LEHCDDADFSFPVS--FGLKEHVVYVRKRP 317

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG----DDGDPFDG 794
            +  FL++ +++F++ ++T    +YA ++   LDP+  LF+ R         + G     
Sbjct: 318 HLHMFLQKMAEMFDVVIFTASQSVYADQLLDRLDPENTLFSKRFFRESCVFTESG----- 372

Query: 795 DERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
                 +KDL  V+G++ A V IID++ +V+     N I +E +
Sbjct: 373 -----YTKDLT-VIGVDLAKVAIIDNTPQVFQLQVNNGIPIESW 410


>gi|449678335|ref|XP_002165480.2| PREDICTED: CTD small phosphatase-like protein 2-like [Hydra
           magnipapillata]
          Length = 421

 Score = 53.5 bits (127), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 53/103 (51%), Gaps = 14/103 (13%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           ++ LVLDLD TL++ +            L K E  +       +F      ++ KLRP +
Sbjct: 243 QMTLVLDLDETLVHCS------------LSKLEAYNM--TFNVVFDNVTYQLFVKLRPHL 288

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
             FLER SKL+E+ L+T   ++YA ++  ++DP+   F  R+ 
Sbjct: 289 LEFLERVSKLYEVILFTASRRVYADKLLNIIDPRRQFFRHRLF 331


>gi|37538060|gb|AAQ92971.1| CTD-phosphatase-like protein [Hordeum vulgare subsp. vulgare]
 gi|37538062|gb|AAQ92972.1| CTD-phosphatase-like protein [Hordeum vulgare subsp. vulgare]
          Length = 445

 Score = 53.5 bits (127), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 79/164 (48%), Gaps = 30/164 (18%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
           ARK+ LVLDLD TL++S             L   ++ D   P    F      ++ + RP
Sbjct: 272 ARKVTLVLDLDETLVHST------------LEHCDDADFSFPVS--FGLKEHVVYVRKRP 317

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG----DDGDPFDG 794
            +  FL++ +++F++ ++T    +YA ++   LDP+  LF+ R         + G     
Sbjct: 318 HLHMFLQKMAEMFDVVIFTASQSVYADQLLDRLDPENTLFSKRFFRESCVFTESG----- 372

Query: 795 DERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
                 +KDL  V+G++ A V IID++ +V+     N I +E +
Sbjct: 373 -----YTKDLT-VIGVDLAKVAIIDNTPQVFQLQVNNGIPIESW 410


>gi|123496394|ref|XP_001326959.1| NLI interacting factor-like phosphatase family protein [Trichomonas
           vaginalis G3]
 gi|121909881|gb|EAY14736.1| NLI interacting factor-like phosphatase family protein [Trichomonas
           vaginalis G3]
          Length = 397

 Score = 53.5 bits (127), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 65/271 (23%), Positives = 109/271 (40%), Gaps = 42/271 (15%)

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 794
           +LRPGI  FL+  ++ F++ L +  ++    E+ K LDP    F  R I R  D D F  
Sbjct: 102 RLRPGIDNFLKSITEHFKIVLISGLDQYIINEVVKRLDPMHTYFGNR-IYRDQDFD-FKN 159

Query: 795 DER----VPKSKDLEGVLGMESAVVIIDDSVRVWPHNK----LNLIVVERYTYFPCSRRQ 846
           DE      P S DL          VI+D S   W ++        + V  Y YF  S   
Sbjct: 160 DENNYYLFPPSSDL---------TVILDTSRDHWKNDNGFTFSGFVFVCPYNYFQPS--- 207

Query: 847 FGLLGPSLLEIDHD-----ERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQ 901
                 +++ +  D         D  L      +E++H +F+  + +D + +   +   Q
Sbjct: 208 ------TIINVSEDIALPLVSKLDSVLFGLSRYLEQIH-LFYYVKKVDSIVLS--IGFAQ 258

Query: 902 RKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVN 961
             IL GC I    + P    N  +        +FG       D   TH+V +S  +  + 
Sbjct: 259 HPILKGCYICIPDLSPEEAKNFDI-----LIARFGGKFMPSYDSAATHLVISSKDSPAIA 313

Query: 962 WALS-TGRFVVHPGWVEASALLYRRANEQDF 991
            A    G ++   GW+  +   Y + +E+++
Sbjct: 314 QASQYNGVYITTIGWIVDTFTHYEKKDERNY 344


>gi|302775067|ref|XP_002970950.1| hypothetical protein SELMODRAFT_4536 [Selaginella moellendorffii]
 gi|300161661|gb|EFJ28276.1| hypothetical protein SELMODRAFT_4536 [Selaginella moellendorffii]
          Length = 177

 Score = 53.1 bits (126), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 42/141 (29%), Positives = 70/141 (49%), Gaps = 21/141 (14%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K  L+LDLD TL+ +++  ++    D ++   E    E+P           +W   RPG+
Sbjct: 11  KPTLILDLDGTLIATSRQAKLHACFDFVV---EFDSEEQP-----------VWVSKRPGL 56

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPK 800
             FL +AS+++E+ ++++G K Y  +M + +DP G  F    ++R    D   G   +  
Sbjct: 57  DDFLRQASEIYEVVVFSLGRKSYVEKMREAIDPSGS-FVATWLAR----DSCSGSSEIKD 111

Query: 801 SKDLEG-VLGME-SAVVIIDD 819
            KDL    LG E   VV +DD
Sbjct: 112 YKDLNSPKLGRELRKVVWVDD 132


>gi|224116766|ref|XP_002331872.1| predicted protein [Populus trichocarpa]
 gi|222875390|gb|EEF12521.1| predicted protein [Populus trichocarpa]
          Length = 502

 Score = 53.1 bits (126), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 49/173 (28%), Positives = 75/173 (43%), Gaps = 35/173 (20%)

Query: 664 KERTRRLEEQKKMFSARKLCLVLDLDHTL--------LNSAKFHEVDPVHDEILRKKEEQ 715
           KE  RR          + + LVLDLD           L  A       VH  +   +   
Sbjct: 303 KESCRR----------KSVTLVLDLDELCPMYNTKVELQMAFLFSETLVHSTL---EHCD 349

Query: 716 DREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 775
           D +      F      ++ K RP + TFLER +++FE+ ++T    +YA ++  +LDP  
Sbjct: 350 DADFTFTVFFNMKEHIVYVKQRPHLHTFLERVAEMFEVVIFTASQSIYAAQLLDILDPDR 409

Query: 776 VLFAGRVISRG---DDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
            L + R+        DG           +KDL  VLG++ A V IID+S +V+
Sbjct: 410 KLISQRLYRESCIFSDG---------SYTKDL-TVLGVDLAKVAIIDNSPQVF 452


>gi|55740279|gb|AAV63941.1| putative nuclear LIM factor interactor-interacting protein hyphal
           form [Phytophthora infestans]
          Length = 237

 Score = 53.1 bits (126), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 57/105 (54%), Gaps = 2/105 (1%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
           + ++ LVLD+D  L++S   +EV+    E  R ++ ++       +       +  K RP
Sbjct: 38  SERIALVLDMDECLVHSKFQNEVEYRQSEY-RPEQLEEYSDSFEIVMDDGERAIVNK-RP 95

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
           G+  FLE A+K ++++++T G + Y   +   LDPKG LFAGR  
Sbjct: 96  GLDRFLEEAAKHYDVYVFTAGLEAYGKPILDALDPKGNLFAGRFF 140


>gi|297843870|ref|XP_002889816.1| hypothetical protein ARALYDRAFT_888325 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335658|gb|EFH66075.1| hypothetical protein ARALYDRAFT_888325 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 100

 Score = 53.1 bits (126), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 42/116 (36%), Positives = 58/116 (50%), Gaps = 23/116 (19%)

Query: 769 KVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNK 828
           K+LDPKG  F+ R+ISR       DG  R  KS D   V+G E AV+ +D+S  VW    
Sbjct: 7   KLLDPKGKYFSDRIISRD------DGTVRHKKSLD---VMGNEEAVLFVDESKIVWQ--- 54

Query: 829 LNLIVVERY-TYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFF 883
                 ++Y  +F  S +QF      L     DE   DG L++ L V+++ H I F
Sbjct: 55  ------KKYGEFFASSCKQFKEDSKLL----PDESESDGALSTVLNVLKQTHGILF 100


>gi|384246936|gb|EIE20424.1| hypothetical protein COCSUDRAFT_67358 [Coccomyxa subellipsoidea
           C-169]
          Length = 676

 Score = 53.1 bits (126), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 36/102 (35%), Positives = 49/102 (48%), Gaps = 17/102 (16%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKL--RPGIW 741
           LVLDLDHTL+ S  F+   P  D               R +F     G  T    RP + 
Sbjct: 108 LVLDLDHTLIRSTLFNPHKPAKDS--------------REVF-VTGDGARTAFERRPHLT 152

Query: 742 TFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
            FLE  S LFE+ ++T G++ YA  +  +LDP+  LF  R+ 
Sbjct: 153 HFLESVSTLFEIVVFTAGSQSYAGPLLDILDPERRLFEHRLF 194


>gi|225681687|gb|EEH19971.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 869

 Score = 53.1 bits (126), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 57/201 (28%), Positives = 84/201 (41%), Gaps = 60/201 (29%)

Query: 656 DQQKAAIQKERTRRLEEQKKMFSARKLCL--VLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + K    R+EE  K        L  V+DLD T++++     VDP   E      
Sbjct: 132 DNSSLTVSKSEATRVEEDAKRRLLSSRRLSLVVDLDQTIIHAT----VDPTVAEW----- 182

Query: 714 EQDREKPHRHLFR----------FPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMGN 760
           +QDR+ P+    +           P M G W   KLRPG+  FL+  S L+E+H+YTMG 
Sbjct: 183 QQDRDNPNHEAVKDVRAFQLVDDGPGMKGCWYYIKLRPGLQEFLQEISALYELHIYTMGT 242

Query: 761 KLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDD 819
           +                 AG +                  +K+L+ +  +++  VVIIDD
Sbjct: 243 R-----------------AGSLT-----------------AKNLQRLFPVDTKMVVIIDD 268

Query: 820 SVRVWPHNKLNLIVVERYTYF 840
              VW  +  NLI V  Y +F
Sbjct: 269 RGDVWKWSD-NLIKVSPYDFF 288


>gi|125571265|gb|EAZ12780.1| hypothetical protein OsJ_02697 [Oryza sativa Japonica Group]
          Length = 576

 Score = 53.1 bits (126), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 43/153 (28%), Positives = 78/153 (50%), Gaps = 34/153 (22%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
           ++++ LVLDLD TL++S   H  D V D  L+              F   +  ++ + RP
Sbjct: 398 SKQITLVLDLDETLVHSTLDH-CDNV-DFTLQV------------FFNMKNHTVYVRQRP 443

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR------VISRGDDGDPF 792
            +  FLE+ +++F++ ++T   ++YA ++   LDP G L + R      + S G      
Sbjct: 444 HLKMFLEKVAQMFDLVIFTASQRIYAEQLIDRLDPDGRLISHRIYRESCIFSEG------ 497

Query: 793 DGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
                   +KDL  +LG++ A VVI+D++ +V+
Sbjct: 498 ------CYTKDLT-ILGVDLAKVVIVDNTPQVF 523


>gi|224072608|ref|XP_002303804.1| predicted protein [Populus trichocarpa]
 gi|222841236|gb|EEE78783.1| predicted protein [Populus trichocarpa]
          Length = 244

 Score = 53.1 bits (126), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 53/168 (31%), Positives = 80/168 (47%), Gaps = 28/168 (16%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           K+  S     LVLDLD TL++SA    ++P +D         D   P    F      ++
Sbjct: 44  KQTRSCPPTTLVLDLDETLVHSA----LEPCND--------ADFTFPVN--FNLQEHTVF 89

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGD 790
            + RP +  F+ER S LFE+ ++T    +YA ++  VLDPK  +F  RV        +G+
Sbjct: 90  VRCRPYLRDFMERVSSLFEIIIFTASQSIYAEQLLNVLDPKRRIFRHRVFRESCVFVEGN 149

Query: 791 PFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
                      KDL  VLG + A V+IID+S + +     N I +E +
Sbjct: 150 YL---------KDLS-VLGRDLARVIIIDNSPQAFGFQVDNGIPIESW 187


>gi|301118476|ref|XP_002906966.1| CTD small phosphatase-like protein, putative [Phytophthora
           infestans T30-4]
 gi|301126789|ref|XP_002909873.1| CTD small phosphatase-like protein, putative [Phytophthora
           infestans T30-4]
 gi|262101427|gb|EEY59479.1| CTD small phosphatase-like protein, putative [Phytophthora
           infestans T30-4]
 gi|262108315|gb|EEY66367.1| CTD small phosphatase-like protein, putative [Phytophthora
           infestans T30-4]
          Length = 237

 Score = 53.1 bits (126), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 56/103 (54%), Gaps = 2/103 (1%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           ++ LVLD+D  L++S   +EV+    E  R ++ ++       +       +  K RPG+
Sbjct: 40  RIALVLDMDECLVHSKFQNEVEYRQSEY-RPEQLEEYSDSFEIVMDDGERAIVNK-RPGL 97

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
             FLE A+K ++++++T G + Y   +   LDPKG LFAGR  
Sbjct: 98  DRFLEEAAKHYDVYVFTAGLEAYGKPILDALDPKGNLFAGRFF 140


>gi|302847022|ref|XP_002955046.1| hypothetical protein VOLCADRAFT_121370 [Volvox carteri f.
           nagariensis]
 gi|300259574|gb|EFJ43800.1| hypothetical protein VOLCADRAFT_121370 [Volvox carteri f.
           nagariensis]
          Length = 1180

 Score = 53.1 bits (126), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 100/230 (43%), Gaps = 53/230 (23%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           +++ LVLDLD TL+ S       PV  +    +E            RF    +W  LRPG
Sbjct: 564 QRMTLVLDLDGTLIASEDEPHA-PVPFDYCVDEE------------RF----VW--LRPG 604

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 799
           +  FL+     FE+ L+T   + +AT   + +DP GV+F  R+       D     +  P
Sbjct: 605 LRRFLDSVRPHFEVVLFTAAGESWATSALQRIDPDGVIFDSRLYR-----DHTVSHDDWP 659

Query: 800 KSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEID 858
             KDL   LG + A VVI+DD+  ++ +   N + V  Y              P L    
Sbjct: 660 WVKDLS-RLGRDLARVVIVDDNPLMFMYQPDNALHVAAYD-------------PQL--TG 703

Query: 859 HDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQRKILAGC 908
           H+    D  L  +L V+  +HK+  ++      DVR +L + +  I A C
Sbjct: 704 HN----DDVLEQALDVL--MHKVLIAN------DVREVLRSIKEPITASC 741


>gi|298705179|emb|CBJ28610.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 482

 Score = 53.1 bits (126), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 41/119 (34%), Positives = 61/119 (51%), Gaps = 15/119 (12%)

Query: 659 KAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDRE 718
           K  + K+R   L +Q   F+ RK  L+LDLD TL++S+ F  V P  D I+    +    
Sbjct: 281 KGGLGKKRRSLLPKQLPEFAGRK-QLILDLDETLVHSS-FKPV-PGADFIMDIMVDGTFY 337

Query: 719 KPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
           K            ++   RPG+  FLER +KL+E+ ++T     YA  +  VLDPKG +
Sbjct: 338 K------------VFVLKRPGVDAFLERVAKLYEVIIFTASLPQYANPLLDVLDPKGTI 384


>gi|148909957|gb|ABR18063.1| unknown [Picea sitchensis]
          Length = 517

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 49/106 (46%), Gaps = 24/106 (22%)

Query: 682 LCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKL 736
           + LVLDLD TL++S   H  D     PVH                   F      ++ + 
Sbjct: 320 ITLVLDLDETLVHSTLEHCDDADFTFPVH-------------------FNLKEHTVYVRC 360

Query: 737 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
           RP +  F++R + +FE+ ++T    +YA ++  VLDPK  L   RV
Sbjct: 361 RPHLQLFMDRVADMFEIIVFTASQSVYAEQLLNVLDPKRKLIRHRV 406


>gi|123454430|ref|XP_001314970.1| NLI interacting factor-like phosphatase family protein [Trichomonas
           vaginalis G3]
 gi|121897632|gb|EAY02747.1| NLI interacting factor-like phosphatase family protein [Trichomonas
           vaginalis G3]
          Length = 218

 Score = 52.8 bits (125), Expect = 0.001,   Method: Composition-based stats.
 Identities = 42/158 (26%), Positives = 74/158 (46%), Gaps = 25/158 (15%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LVLDLD TL++++ F    P H ++                 +F     +  LRP +  F
Sbjct: 44  LVLDLDETLVHTSTF----PPHSDV--------------EALKFDDTNEYVFLRPNVKKF 85

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 803
           LER S+LFE+ ++T G ++YA  +     P+      ++     D   F G++     KD
Sbjct: 86  LERVSELFEVFIFTAGTQIYAERILDSFCPQ----IDQMHRFYRDSCKFSGNK---CKKD 138

Query: 804 LEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
           L       + VV++DD+ ++  +   N I ++R++  P
Sbjct: 139 LNKFGRPLTKVVMVDDNYQMRSYYPQNTIYIDRWSGTP 176


>gi|242009525|ref|XP_002425534.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212509409|gb|EEB12796.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 834

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 48/155 (30%), Positives = 74/155 (47%), Gaps = 22/155 (14%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LVLDLD TL           VH  +   +E QD       LF+     ++ + RP    F
Sbjct: 670 LVLDLDETL-----------VHCSL---QELQDASFTFPVLFQDCAYTVFVRTRPYFREF 715

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 803
           LER S LFE+ L+T   ++YA ++  +LDPK      R+    +     +G+      KD
Sbjct: 716 LERVSSLFEVILFTASKRVYADKLMNLLDPKKRWIKYRLFR--EHCVCVNGN----YIKD 769

Query: 804 LEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           L  +LG + S  +IID+S + + +   N I +E +
Sbjct: 770 L-TILGRDLSKTIIIDNSPQAFGYQLENGIPIESW 803


>gi|410921774|ref|XP_003974358.1| PREDICTED: CTD small phosphatase-like protein 2-B-like [Takifugu
           rubripes]
          Length = 381

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 46/172 (26%), Positives = 84/172 (48%), Gaps = 23/172 (13%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LV+DL+ TL+ S+  + +D             + E      F+     ++ KLRP +  F
Sbjct: 205 LVVDLEETLMFSS-LNVID-------------EAEYTFDTTFQDHQYKVYMKLRPHVKEF 250

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 803
           L+  +K +E+ +YT   + YA ++  +LDP+  +F  R+    +D     G       KD
Sbjct: 251 LQSVAKNYELFVYTCAKREYAEKILNILDPQRKVFRHRLYQ--EDCICVLGH----YIKD 304

Query: 804 LEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSL 854
           L  +LG + +  V++D+    +P++ LN I ++ +T  P   R+   L P+L
Sbjct: 305 L-SILGRDLTKTVVLDNMPHTYPYHLLNTIPIKSWTGEP-EDRELQKLVPTL 354


>gi|414881093|tpg|DAA58224.1| TPA: hypothetical protein ZEAMMB73_373456 [Zea mays]
 gi|414881094|tpg|DAA58225.1| TPA: hypothetical protein ZEAMMB73_373456 [Zea mays]
          Length = 442

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 74/164 (45%), Gaps = 30/164 (18%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
            R + LVLDLD TL++S                K   D +      +      ++ K RP
Sbjct: 264 TRNVTLVLDLDETLVHSTM--------------KHCDDADFTFSMFYDMKEHVVYVKKRP 309

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG----DDGDPFDG 794
            +  FL+R  ++FE+ ++T    +YA ++  +LDP+  LF+ R         D G     
Sbjct: 310 HVHMFLQRMVEMFEVVIFTASQSVYADQLLDMLDPEKKLFSKRFFRESCLITDSG----- 364

Query: 795 DERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
                  KDL  V+G++ A V IID++ +V+     N I +E +
Sbjct: 365 -----YRKDL-TVVGVDLAKVAIIDNTPQVFELQVNNGIPIESW 402


>gi|215695024|dbj|BAG90215.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 269

 Score = 52.4 bits (124), Expect = 0.001,   Method: Composition-based stats.
 Identities = 46/153 (30%), Positives = 77/153 (50%), Gaps = 36/153 (23%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMG 731
           ARK+ LVLDLD TL++S                      E+   + F FP       HM 
Sbjct: 143 ARKVTLVLDLDETLVHSTT--------------------EQCDDYDFTFPVFFDMKEHM- 181

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
           ++ + RP +  FL++ +++FE+ ++T    +YA ++  +LDP+  LF+ R      +   
Sbjct: 182 VYVRKRPHLHMFLQKMAEMFEVVIFTASQSVYADQLLDILDPEKKLFSRRYFR---ESCV 238

Query: 792 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRV 823
           F        +KDL  V+G++ A VVIID++ +V
Sbjct: 239 F---TNTSYTKDLT-VVGVDLAKVVIIDNTPQV 267


>gi|55740293|gb|AAV63948.1| putative nuclear LIM interactor-interacting protein [Phytophthora
           sojae]
 gi|348665891|gb|EGZ05719.1| hypothetical protein PHYSODRAFT_551168 [Phytophthora sojae]
          Length = 237

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 56/103 (54%), Gaps = 2/103 (1%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           ++ LVLD+D  L++S   +EV+    E  R ++ ++       +       +  K RPG+
Sbjct: 40  RIALVLDMDECLVHSKFQNEVEYRQSEY-RPEQLEEYGDSFEIVMDDGERAVVNK-RPGL 97

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
             FLE A+K ++++++T G + Y   +   LDPKG LFAGR  
Sbjct: 98  DRFLEEAAKHYDVYVFTAGLEAYGKPILDALDPKGNLFAGRFF 140


>gi|302812229|ref|XP_002987802.1| hypothetical protein SELMODRAFT_126751 [Selaginella moellendorffii]
 gi|302817447|ref|XP_002990399.1| hypothetical protein SELMODRAFT_131611 [Selaginella moellendorffii]
 gi|300141784|gb|EFJ08492.1| hypothetical protein SELMODRAFT_131611 [Selaginella moellendorffii]
 gi|300144421|gb|EFJ11105.1| hypothetical protein SELMODRAFT_126751 [Selaginella moellendorffii]
          Length = 253

 Score = 52.4 bits (124), Expect = 0.001,   Method: Composition-based stats.
 Identities = 47/160 (29%), Positives = 75/160 (46%), Gaps = 32/160 (20%)

Query: 684 LVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
           LVLDLD TL++S   H  D     PV+                   F +    ++ + RP
Sbjct: 59  LVLDLDETLVHSTLEHCADADFSFPVY-------------------FNYQEHTVYVRRRP 99

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERV 798
            +  FLE+ ++LFE+ ++T    +YA ++  +LDPK  L   R+    D     DG+   
Sbjct: 100 HLQVFLEKVAQLFEIIIFTASQSVYAEQLLNILDPKRKLIRHRIFR--DSCVYVDGNYL- 156

Query: 799 PKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
              KDL  +LG + S V I+D+S + +     N I +E +
Sbjct: 157 ---KDLS-ILGRDLSKVAIVDNSPQAFGFQVDNGIPIESW 192


>gi|67463585|ref|XP_648443.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56464600|gb|EAL43056.1| hypothetical protein EHI_121510 [Entamoeba histolytica HM-1:IMSS]
 gi|449705880|gb|EMD45836.1| RNA polymerase II ctd phosphatase, putative [Entamoeba histolytica
           KU27]
          Length = 428

 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 50/209 (23%), Positives = 89/209 (42%), Gaps = 45/209 (21%)

Query: 725 FRFPHMG--MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP-KGVLF-AG 780
           F  P     ++ + R GI TF+E+ SKL+++H+ T+G K YA  +   ++  + + F  G
Sbjct: 98  FEIPEQNAKVFIRFRDGIVTFMEKVSKLYDIHVVTLGQKEYAFAIVNAINKLRNIPFITG 157

Query: 781 RVISR--------GDDGDPFDG-------DERVPKSKDLEGVLGMESAVVIIDDSVRVWP 825
            +++          D+ D  DG       +ER    + +   +G E   VI+DD + VW 
Sbjct: 158 DLVTAEDCSSVIVCDEKDTNDGLIDREETNERRSVKRSI-PTMGKEEMQVIVDDRIDVWD 216

Query: 826 HNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH 885
           +      VV+   Y P + +                   D  L     V++ ++  F+  
Sbjct: 217 NKN----VVQICEYVPSTNQV------------------DTELVRVTEVLQNIYTKFYDE 254

Query: 886 QSLDDVDVRNILAAEQRKILAGCRIVFSR 914
                 DV+ IL   ++KIL    + F+R
Sbjct: 255 HI---EDVKEILHLFRKKILENKNLYFNR 280


>gi|403338554|gb|EJY68521.1| Dullard-like phosphatase domain containing protein [Oxytricha
           trifallax]
          Length = 615

 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 42/159 (26%), Positives = 71/159 (44%), Gaps = 24/159 (15%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           ++L +VLDLD+TL+     H V+ V                 ++ F      ++   RP 
Sbjct: 437 KRLIVVLDLDNTLI-----HSVNSVPTS------------SDQNYFAIRD-NIYVYKRPH 478

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 799
           +  FL   +K  +++++T   K YA ++  V+DPK     G+   R D       DER  
Sbjct: 479 MEYFLAEIAKFADIYIFTASMKDYADQIMDVIDPKKTF--GKCFYRTD----CKKDERRQ 532

Query: 800 KSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 838
             KDL  V    + +++IDD+      N LN   ++ +T
Sbjct: 533 IYKDLSTVSDDLTQLIMIDDNEINCTKNPLNTFKIKHWT 571


>gi|432854554|ref|XP_004067958.1| PREDICTED: CTD small phosphatase-like protein 2-like [Oryzias
           latipes]
          Length = 381

 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 59/107 (55%), Gaps = 8/107 (7%)

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
           +  LRP +  FL+  +K++E+ +YT   K YA ++ ++ DP+  LF  R+    DD    
Sbjct: 240 YMILRPHVREFLQAMAKIYELFVYTCAKKEYAEKILEIFDPQKKLFRHRLYQ--DDCACV 297

Query: 793 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYT 838
            G       KDL  +LG + +  V++D++   +P++ +N I ++ ++
Sbjct: 298 LGH----YIKDL-SILGRDLTKTVVLDNAPHTYPYHLMNTIPIKSWS 339


>gi|300121382|emb|CBK21762.2| unnamed protein product [Blastocystis hominis]
          Length = 399

 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 65/251 (25%), Positives = 107/251 (42%), Gaps = 37/251 (14%)

Query: 585 SVSQPLTSEPMVSQNS-PIQPGQIKSGADMKA--VVTNHDDKQTG------TGSGPEAGP 635
           S S  +   PMV+ +  P  P  I  G D ++  +V++H  ++          S PE   
Sbjct: 118 SASTIIRQYPMVTLSCFPSSPKVIIRGKDKQSSDLVSSHHKEEIEPASPNLITSAPETQQ 177

Query: 636 VGAHPQSAWGDVEHL--FEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLL 693
             A P+S +  V  +     YD    AA+Q+    R+    K  +A K  LVLDLD TL+
Sbjct: 178 -EAEPESTFNPVIIIKNLPPYD-SLPAALQE----RVLLPPKSPTAPKYTLVLDLDETLV 231

Query: 694 NSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEM 753
           + +   E DP  D     + E  R              ++  +RP ++  L+R +  +E+
Sbjct: 232 HCSM--ERDPSADLAFSIRHEGQR------------FTIYANVRPFLFYLLKRVAPYYEI 277

Query: 754 HLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA 813
            +YT   K YA  +  +LD +  L   R+    +     DG+      KDL  +    S 
Sbjct: 278 VIYTASQKCYADRLLDILDSEQHLITHRLY--REHCLNIDGN----YIKDLNALNRDLSK 331

Query: 814 VVIIDDSVRVW 824
            VI+D+ +  +
Sbjct: 332 TVIVDNYISCF 342


>gi|302811311|ref|XP_002987345.1| hypothetical protein SELMODRAFT_125729 [Selaginella moellendorffii]
 gi|300144980|gb|EFJ11660.1| hypothetical protein SELMODRAFT_125729 [Selaginella moellendorffii]
          Length = 240

 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 43/138 (31%), Positives = 65/138 (47%), Gaps = 22/138 (15%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LVLDLD TL++S   H  +      L                 F    ++ + RP +  F
Sbjct: 47  LVLDLDETLVHSTTDHCGNADFSFSLHAN--------------FQRQTVYVRRRPHLQMF 92

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 803
           +ER ++LFE+ ++T     YA ++  +LDPK  +F  R+    D     DG+      KD
Sbjct: 93  MERVAQLFEIIVFTASQSTYAEKLLNILDPKRKVFRHRIFR--DSCVLVDGN----YLKD 146

Query: 804 LEGVLGME-SAVVIIDDS 820
           L  VLG + S  VI+D+S
Sbjct: 147 L-SVLGRDLSKTVIVDNS 163


>gi|302814947|ref|XP_002989156.1| hypothetical protein SELMODRAFT_129286 [Selaginella moellendorffii]
 gi|300143056|gb|EFJ09750.1| hypothetical protein SELMODRAFT_129286 [Selaginella moellendorffii]
          Length = 245

 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 43/138 (31%), Positives = 65/138 (47%), Gaps = 22/138 (15%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LVLDLD TL++S   H  +      L                 F    ++ + RP +  F
Sbjct: 47  LVLDLDETLVHSTTDHCGNADFSFSLHAN--------------FQRQTVYVRRRPHLQMF 92

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 803
           +ER ++LFE+ ++T     YA ++  +LDPK  +F  R+    D     DG+      KD
Sbjct: 93  MERVAQLFEIIVFTASQSTYAEKLLNILDPKRKVFRHRIFR--DSCVLVDGN----YLKD 146

Query: 804 LEGVLGME-SAVVIIDDS 820
           L  VLG + S  VI+D+S
Sbjct: 147 L-SVLGRDLSKTVIVDNS 163


>gi|449505979|ref|XP_004162620.1| PREDICTED: uncharacterized protein LOC101226452 [Cucumis sativus]
          Length = 470

 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 44/152 (28%), Positives = 73/152 (48%), Gaps = 42/152 (27%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMGMWTKL 736
           LVLDLD TL++S             L  +++ D        FRF        H+ ++ K 
Sbjct: 300 LVLDLDETLVHST------------LEPQDDAD--------FRFTVCLNMKEHI-VYVKR 338

Query: 737 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFD 793
           RP +  FL+R +++FE+ ++T    +YA ++   LDP   + + R+        DG    
Sbjct: 339 RPYLQIFLDRVAEMFEVAIFTASQSIYAEQVLNKLDPDNCIISRRLYRESCIFSDG---- 394

Query: 794 GDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
                  +KDL  VLG++ A VVI+D+  +V+
Sbjct: 395 -----CYTKDL-TVLGIDLAKVVIVDNYPQVF 420


>gi|449433585|ref|XP_004134578.1| PREDICTED: uncharacterized protein LOC101215257 [Cucumis sativus]
          Length = 484

 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 44/152 (28%), Positives = 73/152 (48%), Gaps = 42/152 (27%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMGMWTKL 736
           LVLDLD TL++S             L  +++ D        FRF        H+ ++ K 
Sbjct: 314 LVLDLDETLVHST------------LEPQDDAD--------FRFTVCLNMKEHI-VYVKR 352

Query: 737 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFD 793
           RP +  FL+R +++FE+ ++T    +YA ++   LDP   + + R+        DG    
Sbjct: 353 RPYLQIFLDRVAEMFEVAIFTASQSIYAEQVLNKLDPDNCIISRRLYRESCIFSDG---- 408

Query: 794 GDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
                  +KDL  VLG++ A VVI+D+  +V+
Sbjct: 409 -----CYTKDL-TVLGIDLAKVVIVDNYPQVF 434


>gi|226506682|ref|NP_001149415.1| CTD-phosphatase-like protein [Zea mays]
 gi|195627078|gb|ACG35369.1| CTD-phosphatase-like protein [Zea mays]
 gi|414881341|tpg|DAA58472.1| TPA: CTD-phosphatase-like protein [Zea mays]
          Length = 460

 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 40/150 (26%), Positives = 72/150 (48%), Gaps = 28/150 (18%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
            + + LVLDLD TL++S             L + +  D        F   +  ++ K RP
Sbjct: 284 TKHVTLVLDLDETLVHST------------LDQCDSADFTL--EVFFNMKNHTVYVKKRP 329

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGD 795
            +  FLE+ +++FE+ ++T   ++YA ++   LDP G   + R+        DG      
Sbjct: 330 YLKVFLEKVAQMFELVIFTASQRIYAEQLIDKLDPDGKYISRRIYRESCIFSDG------ 383

Query: 796 ERVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
                +KDL  +LG++ A V I+D++ +V+
Sbjct: 384 ---CYTKDL-TILGIDLAKVAIVDNTPQVF 409


>gi|356515353|ref|XP_003526365.1| PREDICTED: uncharacterized protein LOC100813300 [Glycine max]
          Length = 467

 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 72/146 (49%), Gaps = 23/146 (15%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           +K+ L LDLD TL++S+   + D         K   DRE+            ++ + RP 
Sbjct: 294 KKVTLALDLDETLIHSS-MEQCDGAD---FTFKMITDRERT-----------VYVRKRPF 338

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 799
           +  FL + S++FE+ ++T   ++YA  +  VLDP    F+ RV       +     +R  
Sbjct: 339 LQEFLAKVSEMFEIIIFTASKRMYAETLLDVLDPDKKFFSRRVCR-----ESCTWKDRCC 393

Query: 800 KSKDLEGVLGMESA-VVIIDDSVRVW 824
             KDL  VLG++ A V IID++  V+
Sbjct: 394 -VKDL-TVLGIDLAKVCIIDNTPEVF 417


>gi|302817700|ref|XP_002990525.1| hypothetical protein SELMODRAFT_131775 [Selaginella moellendorffii]
 gi|302817706|ref|XP_002990528.1| hypothetical protein SELMODRAFT_131706 [Selaginella moellendorffii]
 gi|300141693|gb|EFJ08402.1| hypothetical protein SELMODRAFT_131775 [Selaginella moellendorffii]
 gi|300141696|gb|EFJ08405.1| hypothetical protein SELMODRAFT_131706 [Selaginella moellendorffii]
          Length = 213

 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 64/121 (52%), Gaps = 19/121 (15%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           L+LDLD TL+ +++   +    D ++   E   +E+P           +W   RPG+  F
Sbjct: 20  LILDLDGTLIATSRQAGLHAKLDFVV---EFDPQEQP-----------VWVCKRPGLDDF 65

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 803
           L +AS+LFE+ ++++G + Y  +M + +DP G L A   +SR    D   G + + + KD
Sbjct: 66  LSKASQLFEVVVFSLGKRAYVEKMREKIDPSGSLVAF-WLSR----DSCSGSDAIKEYKD 120

Query: 804 L 804
           L
Sbjct: 121 L 121


>gi|156549638|ref|XP_001604265.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like, partial [Nasonia vitripennis]
          Length = 512

 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 50/91 (54%), Gaps = 10/91 (10%)

Query: 752 EMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVL--G 809
           E+H+ T G + YA  +A +LD  G LF+ R++SR +  DP        K+ +L+ +   G
Sbjct: 1   ELHICTFGARQYAHRVAAILDNDGKLFSHRILSRDECFDP------QSKTANLKALFPCG 54

Query: 810 MESAVVIIDDSVRVWPHNKLNLIVVERYTYF 840
           ++  V IIDD   VW     NL+ V+ Y +F
Sbjct: 55  VD-MVCIIDDRDDVW-QGCANLVQVKPYHFF 83



 Score = 49.3 bits (116), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 64/126 (50%), Gaps = 8/126 (6%)

Query: 874 VIERLHKIFFSHQSLD--DVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHP--LW 929
           +++R+H  F++    +     ++ I+   + ++L G  + FS + P    N  LH    +
Sbjct: 267 ILKRIHTEFYAEGDSEPRSRTLKEIIPKVRSRVLKGLCLTFSGLVP---NNQKLHQSRAY 323

Query: 930 QTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANE 988
           + A  FGA  ++ + +Q TH+VA   GT KV  A   G+  +V+P W+   A  +   +E
Sbjct: 324 KVARAFGAQASQDLTEQTTHLVAIQPGTVKVREAKRQGKVKIVNPDWLWTCAERWEHVDE 383

Query: 989 QDFAIK 994
           + + +K
Sbjct: 384 RLYPLK 389


>gi|449494439|ref|XP_004159546.1| PREDICTED: CTD small phosphatase-like protein 2-like [Cucumis
           sativus]
          Length = 446

 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 47/158 (29%), Positives = 74/158 (46%), Gaps = 28/158 (17%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LVLDLD TL++S     ++P  D         D   P    F      ++ + RP +  F
Sbjct: 251 LVLDLDETLVHST----LEPCVD--------ADFTFPVN--FNLQEHTVYVRCRPYLRDF 296

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGDPFDGDERVPK 800
           +E  ++ FE+ ++T    +YA ++  VLDPK  +F  RV        DG+          
Sbjct: 297 MEAVARHFEIIIFTASQSIYAEQLLNVLDPKRKIFRHRVFRESCVFVDGNYL-------- 348

Query: 801 SKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
            KDL  VLG + A V+I+D+S + +     N I +E +
Sbjct: 349 -KDL-SVLGRDLARVIIVDNSPQAFGFQVDNGIPIESW 384


>gi|407043726|gb|EKE42114.1| NLI interacting factor family phosphatase domain containing protein
           [Entamoeba nuttalli P19]
          Length = 428

 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 51/209 (24%), Positives = 88/209 (42%), Gaps = 45/209 (21%)

Query: 725 FRFPHMG--MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP-KGVLF-AG 780
           F  P     +  + R GI TF+E+ SKL+++H+ T+G K YA  +   ++  + V F  G
Sbjct: 98  FEIPEQNAKVLIRFRDGIVTFMEKVSKLYDIHVVTLGQKEYAFAIVNAINKLRDVPFITG 157

Query: 781 RVISR--------GDDGDPFDG-------DERVPKSKDLEGVLGMESAVVIIDDSVRVWP 825
            +++          D+ D  DG       +ER    + +   +G E   VI+DD + VW 
Sbjct: 158 DLVTAEDCSSVIVCDEKDTNDGLIDREETNERRSVKRSI-PTMGKEEMQVIVDDRIDVWD 216

Query: 826 HNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSH 885
           +      VV+   Y P + +                   D  L     V++ ++  F+  
Sbjct: 217 NKN----VVQICEYVPSTNQV------------------DTELVRVTEVLQNIYTKFYDE 254

Query: 886 QSLDDVDVRNILAAEQRKILAGCRIVFSR 914
                 DV+ IL   ++KIL    + F+R
Sbjct: 255 HI---EDVKEILHLFRKKILENKNLYFNR 280


>gi|449450582|ref|XP_004143041.1| PREDICTED: uncharacterized protein LOC101204959 [Cucumis sativus]
          Length = 1024

 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 47/158 (29%), Positives = 74/158 (46%), Gaps = 28/158 (17%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LVLDLD TL++S     ++P  D         D   P    F      ++ + RP +  F
Sbjct: 244 LVLDLDETLVHST----LEPCVD--------ADFTFPVN--FNLQEHTVYVRCRPYLRDF 289

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD---DGDPFDGDERVPK 800
           +E  ++ FE+ ++T    +YA ++  VLDPK  +F  RV        DG+          
Sbjct: 290 MEAVARHFEIIIFTASQSIYAEQLLNVLDPKRKIFRHRVFRESCVFVDGNYL-------- 341

Query: 801 SKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
            KDL  VLG + A V+I+D+S + +     N I +E +
Sbjct: 342 -KDL-SVLGRDLARVIIVDNSPQAFGFQVDNGIPIESW 377


>gi|367002193|ref|XP_003685831.1| hypothetical protein TPHA_0E03070 [Tetrapisispora phaffii CBS 4417]
 gi|357524130|emb|CCE63397.1| hypothetical protein TPHA_0E03070 [Tetrapisispora phaffii CBS 4417]
          Length = 494

 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 53/104 (50%), Gaps = 14/104 (13%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
            RK CL+LDLD TL++S+ F  VD   D ++      D +  H ++ +          RP
Sbjct: 321 GRKKCLILDLDETLVHSS-FKYVDSA-DFVIPVT--IDNQTHHVYVIK----------RP 366

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
           G+  FL+R S+L+E+ ++T     Y   +  +LDP   +   R+
Sbjct: 367 GVDEFLKRVSELYEVVVFTASVSRYGDPLLNILDPANTIIHHRL 410


>gi|224035555|gb|ACN36853.1| unknown [Zea mays]
 gi|414881338|tpg|DAA58469.1| TPA: hypothetical protein ZEAMMB73_648049 [Zea mays]
 gi|414881339|tpg|DAA58470.1| TPA: hypothetical protein ZEAMMB73_648049 [Zea mays]
 gi|414881340|tpg|DAA58471.1| TPA: hypothetical protein ZEAMMB73_648049 [Zea mays]
          Length = 397

 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 40/150 (26%), Positives = 72/150 (48%), Gaps = 28/150 (18%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
            + + LVLDLD TL++S             L + +  D        F   +  ++ K RP
Sbjct: 221 TKHVTLVLDLDETLVHST------------LDQCDSADFTL--EVFFNMKNHTVYVKKRP 266

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGD 795
            +  FLE+ +++FE+ ++T   ++YA ++   LDP G   + R+        DG      
Sbjct: 267 YLKVFLEKVAQMFELVIFTASQRIYAEQLIDKLDPDGKYISRRIYRESCIFSDG------ 320

Query: 796 ERVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
                +KDL  +LG++ A V I+D++ +V+
Sbjct: 321 ---CYTKDL-TILGIDLAKVAIVDNTPQVF 346


>gi|157125124|ref|XP_001660632.1| hypothetical protein AaeL_AAEL010078 [Aedes aegypti]
 gi|108873763|gb|EAT37988.1| AAEL010078-PA [Aedes aegypti]
          Length = 678

 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 47/164 (28%), Positives = 79/164 (48%), Gaps = 22/164 (13%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           K  S+ +  LVLDLD TL           VH  +   +E  D       LF+     ++ 
Sbjct: 493 KTRSSPEFSLVLDLDETL-----------VHCSL---QELSDASFKFPVLFQECKYTVFV 538

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 794
           + RP    FLE+ S++FE+ L+T   ++YA ++  +LDP+  L   R+    +     +G
Sbjct: 539 RTRPFFREFLEKVSQIFEVILFTASKRVYADKLLNLLDPERRLIKYRLFR--EHCVLVNG 596

Query: 795 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           +      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 597 N----YIKDLT-ILGRDLSKTIIIDNSPQAFGYQLENGIPIESW 635


>gi|224057698|ref|XP_002299297.1| predicted protein [Populus trichocarpa]
 gi|222846555|gb|EEE84102.1| predicted protein [Populus trichocarpa]
          Length = 256

 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 45/91 (49%), Gaps = 14/91 (15%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LVLDLD TL++S             L   ++ D   P    F      ++ + RP +  F
Sbjct: 81  LVLDLDETLVHST------------LEPCDDADFTFPVN--FNLQQHTVFVRCRPYLRDF 126

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
           +ER S LFE+ ++T    +YA ++  VLDPK
Sbjct: 127 MERVSSLFEIIIFTASQSIYAEQLLNVLDPK 157


>gi|118354395|ref|XP_001010460.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
           thermophila]
 gi|89292227|gb|EAR90215.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
           thermophila SB210]
          Length = 540

 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 45/178 (25%), Positives = 81/178 (45%), Gaps = 31/178 (17%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREK--PHRHLFRFPHMG----- 731
           ++KL L+LDLD TL+N+        V +E   K EE  R K   ++++    +       
Sbjct: 201 SQKLNLILDLDETLVNTV------WVTNENQSKLEEIYRYKMPSNKNVITIQYSNNEGAQ 254

Query: 732 --MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDG 789
               T LRP +  F+    K F + +Y+ G K Y  ++   +DP+  LF    I +    
Sbjct: 255 KEFITILRPHLKEFVTEMKKYFNILVYSHGRKDYVLKLLDKIDPRRDLFNRNNIFKN--- 311

Query: 790 DPFDGDERVPKSKDLEGVL------GMESAV---VIIDDSVRVWPHNKL-NLIVVERY 837
              +G   +   KD++ ++       +E A+   +IIDD   +W      N++ ++R+
Sbjct: 312 ---EGQVNIKTQKDIKNIIECDSPSALEKALKSSIIIDDIFEIWLEETFPNVVPIKRF 366


>gi|308485158|ref|XP_003104778.1| CRE-SCPL-3 protein [Caenorhabditis remanei]
 gi|308257476|gb|EFP01429.1| CRE-SCPL-3 protein [Caenorhabditis remanei]
          Length = 292

 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 31/112 (27%), Positives = 56/112 (50%), Gaps = 26/112 (23%)

Query: 678 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM------G 731
           S+ +  LVLDLD TL++ +    + P+ +  +                 FP M       
Sbjct: 61  SSAEYTLVLDLDETLVHCS----LTPLDNATMI----------------FPVMFQDITYQ 100

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
           ++ +LRP + TFL R SK+FE+ ++T   K+YA ++  ++DP+  +   R+ 
Sbjct: 101 VYVRLRPHLRTFLRRMSKIFEIIIFTASKKVYANKLCDIIDPQKTMIRHRLF 152


>gi|426378923|ref|XP_004056157.1| PREDICTED: CTD small phosphatase-like protein 2 [Gorilla gorilla
           gorilla]
          Length = 398

 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 56/104 (53%), Gaps = 8/104 (7%)

Query: 724 LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
           LF+     ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+ 
Sbjct: 280 LFQDVIYQVYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLF 339

Query: 784 SRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPH 826
              +      G+      KDL  +LG + S  +IID+S + + +
Sbjct: 340 R--EHCVCVQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAY 376


>gi|260789874|ref|XP_002589969.1| hypothetical protein BRAFLDRAFT_224775 [Branchiostoma floridae]
 gi|229275156|gb|EEN45980.1| hypothetical protein BRAFLDRAFT_224775 [Branchiostoma floridae]
          Length = 232

 Score = 50.1 bits (118), Expect = 0.006,   Method: Composition-based stats.
 Identities = 38/120 (31%), Positives = 60/120 (50%), Gaps = 14/120 (11%)

Query: 663 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 722
           ++ R R+     K  S  +  LVLDLD TL++ +            L + E+ +   P  
Sbjct: 35  EEMRQRQPALPLKTRSTPEFSLVLDLDETLVHCS------------LNELEDANLTFPV- 81

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            LF+     ++ + RP    FLER SKL+E+ L+T   K+YA ++  +LDPK  L   R+
Sbjct: 82  -LFQDVTYQVYVRTRPYYREFLERMSKLYEIILFTASKKVYADKLMNILDPKKELVRHRL 140


>gi|390356060|ref|XP_003728694.1| PREDICTED: CTD small phosphatase-like protein 2-like isoform 1
           [Strongylocentrotus purpuratus]
          Length = 514

 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 46/158 (29%), Positives = 75/158 (47%), Gaps = 22/158 (13%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K  LVLDLD TL++ +            L + E      P    F+     ++ + RP  
Sbjct: 335 KYSLVLDLDETLVHCS------------LAEMENCTMSFPV--YFQDNEYQVYVRTRPFF 380

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPK 800
             FLER SK+FE+ L+T   ++YA ++  +LDP+  L   R+    +      G+     
Sbjct: 381 RDFLERMSKIFEIILFTASKRVYADKLLNLLDPEKKLVRHRLFR--EHCICVQGN----Y 434

Query: 801 SKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
            KDL  +LG + +  VIID+S + + +   N I +E +
Sbjct: 435 IKDL-NILGRDLTKTVIIDNSPQAFGYQLENGIPIESW 471


>gi|293332237|ref|NP_001167877.1| uncharacterized protein LOC100381584 [Zea mays]
 gi|223944585|gb|ACN26376.1| unknown [Zea mays]
 gi|413950698|gb|AFW83347.1| hypothetical protein ZEAMMB73_634755 [Zea mays]
          Length = 419

 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 38/149 (25%), Positives = 71/149 (47%), Gaps = 28/149 (18%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           + + LVLDLD TL++S   H                + +      F   +  ++ + RP 
Sbjct: 242 KHVTLVLDLDETLVHSTLDHC--------------DNADFTLEVFFNMKNHTVYVRKRPY 287

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 796
           +  FLE+ +++FE+ ++T   ++YA ++   LDP G   + R+        DG       
Sbjct: 288 LKMFLEKVAQMFEVVIFTASQRVYAEQLIDKLDPDGKYISRRIYRESCVFSDG------- 340

Query: 797 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
               +KDL  +LG++ A V I+D++ +V+
Sbjct: 341 --CYTKDL-TILGIDLAKVAIVDNTPQVF 366


>gi|219109563|ref|XP_002176536.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411071|gb|EEC50999.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 809

 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 61/253 (24%), Positives = 101/253 (39%), Gaps = 33/253 (13%)

Query: 769 KVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---- 824
           KVL+ +  LF  R++SR D  D     + + +     G++      V++DD   VW    
Sbjct: 510 KVLELRQRLFGSRIVSRTDVRDLGQNVKSLKRIFPCGGIMA-----VVMDDREDVWANAA 564

Query: 825 ---------PHNKLNLIVVERYTYFPCSRRQFGLLGPSLL-EIDHDERSEDGTLASSLGV 874
                    P + L L+    ++ F          G  L  E +  +   D  L  SL +
Sbjct: 565 DILTVRKGEPPDNLLLVRPYHWSSFLGFADVNNASGADLSGESEAGDVETDEQLLWSLDI 624

Query: 875 IERLHKIFF----SHQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHP--- 927
           ++R+H+ F+    S        V +I+   + + L G  +VFS + P+      L     
Sbjct: 625 LQRVHRRFYESDGSFLGALTQTVPDIVKQLRAETLHGAHLVFSGMVPLHRQQQQLESGDK 684

Query: 928 ------LWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST-GRFVVHPGWVEASA 980
                 + + AE  GA     +   +THVVA   GTDK+  A    G  +V PGW+    
Sbjct: 685 VVPRPTVIRYAETLGAKVWSKVTPVLTHVVAAKDGTDKILAARKLPGCRIVKPGWLMECV 744

Query: 981 LLYRRANEQDFAI 993
               R +E  + +
Sbjct: 745 WSLTRRDEGRYLL 757



 Score = 45.1 bits (105), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 35/126 (27%), Positives = 58/126 (46%), Gaps = 26/126 (20%)

Query: 662 IQKERTRRLEEQ--KKMFSARKLCLVLDLDHTLL---NSAKFHEVDPVHDEI-------L 709
           + +   +R+ +Q  +++   +KL LVLDLDHTL+   N  +  +     D++       L
Sbjct: 236 VSRAEGQRMAQQDAERLQKRKKLSLVLDLDHTLVHATNDTRAQQFCKSRDDVRTLILPML 295

Query: 710 RKKEEQDREKPHRHLFRFPHMGMWT----KLRPGIWTFLERASKLFEMHLYTMGNKLYAT 765
           R   E           R P    WT    K+RP +  FL  A   +E+ +YT G + YA 
Sbjct: 296 RPNGEP----------RQPQHPEWTQHFVKMRPHVEVFLNEAQDQYEIGVYTAGTRDYAE 345

Query: 766 EMAKVL 771
           ++  +L
Sbjct: 346 QICILL 351


>gi|145502170|ref|XP_001437064.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124404211|emb|CAK69667.1| unnamed protein product [Paramecium tetraurelia]
          Length = 334

 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 54/221 (24%), Positives = 94/221 (42%), Gaps = 40/221 (18%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LV+DLD TL++S+   E   V+D I+       + K            ++  +RPG   F
Sbjct: 48  LVIDLDETLVHSS--FEPMKVNDLIVEVTMNDQKYK------------IYVNIRPGAHDF 93

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKS-- 801
           +E ASK FE+ ++T     YA  +   LDP G++          D   F  +  V K   
Sbjct: 94  IEEASKYFELIIFTASISEYANSVIDFLDPHGLV----------DLRLFRENCTVYKDIL 143

Query: 802 -KDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHD 860
            KDL  +     +V++ID+SV  +    +N + +  Y +   + ++  LL P L      
Sbjct: 144 VKDLSLLKRKLDSVILIDNSVNSFMFQPMNAVHILNY-FEDKTDQELTLLIPFL------ 196

Query: 861 ERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQ 901
                  L S    +  +H+  F +   D  +  +++  +Q
Sbjct: 197 ------KLLSQFQDVRPVHEWLFKYAHFDKFEYVDVMGIKQ 231


>gi|413950699|gb|AFW83348.1| hypothetical protein ZEAMMB73_634755 [Zea mays]
          Length = 400

 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 38/149 (25%), Positives = 71/149 (47%), Gaps = 28/149 (18%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           + + LVLDLD TL++S   H                + +      F   +  ++ + RP 
Sbjct: 223 KHVTLVLDLDETLVHSTLDHC--------------DNADFTLEVFFNMKNHTVYVRKRPY 268

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 796
           +  FLE+ +++FE+ ++T   ++YA ++   LDP G   + R+        DG       
Sbjct: 269 LKMFLEKVAQMFEVVIFTASQRVYAEQLIDKLDPDGKYISRRIYRESCVFSDG------- 321

Query: 797 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
               +KDL  +LG++ A V I+D++ +V+
Sbjct: 322 --CYTKDL-TILGIDLAKVAIVDNTPQVF 347


>gi|390356058|ref|XP_788296.3| PREDICTED: CTD small phosphatase-like protein 2-like isoform 2
           [Strongylocentrotus purpuratus]
          Length = 485

 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 46/158 (29%), Positives = 75/158 (47%), Gaps = 22/158 (13%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K  LVLDLD TL++ +            L + E      P    F+     ++ + RP  
Sbjct: 306 KYSLVLDLDETLVHCS------------LAEMENCTMSFPV--YFQDNEYQVYVRTRPFF 351

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPK 800
             FLER SK+FE+ L+T   ++YA ++  +LDP+  L   R+    +      G+     
Sbjct: 352 RDFLERMSKIFEIILFTASKRVYADKLLNLLDPEKKLVRHRLFR--EHCICVQGN----Y 405

Query: 801 SKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
            KDL  +LG + +  VIID+S + + +   N I +E +
Sbjct: 406 IKDL-NILGRDLTKTVIIDNSPQAFGYQLENGIPIESW 442


>gi|340508046|gb|EGR33849.1| NLI interacting factor-like phosphatase family protein, putative
           [Ichthyophthirius multifiliis]
          Length = 280

 Score = 50.1 bits (118), Expect = 0.007,   Method: Composition-based stats.
 Identities = 32/118 (27%), Positives = 64/118 (54%), Gaps = 19/118 (16%)

Query: 662 IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREK 719
           ++K+R ++L +QK+    +K  L+LDLD TL++S+  + +E D   + +++    Q    
Sbjct: 11  VKKQRIKQLGQQKQSCIGKK-TLILDLDETLVHSSFQQINEYDFQFEIVVKNIPYQ---- 65

Query: 720 PHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
                       ++ K RPGI  FL++ S+ +E+ +YT     YA ++  ++D + V+
Sbjct: 66  ------------IYVKKRPGIHIFLQKLSEKYEIVIYTASISEYANQVCNIIDQQDVI 111


>gi|118390259|ref|XP_001028120.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
           thermophila]
 gi|89309890|gb|EAS07878.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
           thermophila SB210]
          Length = 623

 Score = 49.7 bits (117), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 40/153 (26%), Positives = 69/153 (45%), Gaps = 22/153 (14%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           +K  L+LDLD TL++  +   +D   D IL      D +   + +     +     +RP 
Sbjct: 432 KKKTLILDLDETLIHCNE--SLDNSSDFIL------DIQADSKEV-----VQAGINVRPF 478

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD--DGDPFDGDER 797
              FLE  S L+E+ ++T    +YA E+   LDP+      R+           +  D R
Sbjct: 479 AKQFLEEMSHLYEIVIFTASRSVYANEVINKLDPQNKFIFKRLFRENCIYKNRIYIKDLR 538

Query: 798 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLN 830
           + K++D++        +VI+D+    + HN LN
Sbjct: 539 IFKNRDIKN-------LVIVDNCCLSFCHNILN 564


>gi|357135834|ref|XP_003569513.1| PREDICTED: uncharacterized protein LOC100822852 [Brachypodium
           distachyon]
          Length = 447

 Score = 49.7 bits (117), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 40/150 (26%), Positives = 71/150 (47%), Gaps = 30/150 (20%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           +K+ LVLDLD TL++S   H                D +      F      ++ + RP 
Sbjct: 270 KKVTLVLDLDETLVHSTMEHC--------------SDADFTFPVFFDMKEHVVYVRKRPH 315

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG----DDGDPFDGD 795
           +  FL++ +++F++ ++T    +YA ++   LDP+  LF  R         + G      
Sbjct: 316 LHIFLQKMAEMFDVVIFTASQSVYADQLLDRLDPEKTLFCKRFFRESCVFTESG------ 369

Query: 796 ERVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
                +KDL  V+G++ A VVIID++ +V+
Sbjct: 370 ----YTKDLT-VVGVDLAKVVIIDNTPQVF 394


>gi|413948975|gb|AFW81624.1| hypothetical protein ZEAMMB73_313891 [Zea mays]
          Length = 322

 Score = 49.7 bits (117), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 46/142 (32%), Positives = 60/142 (42%), Gaps = 30/142 (21%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF-PHMG-----MWTKLR 737
           L LDLD TL++S                   Q    P R  F   P +G      +   R
Sbjct: 148 LFLDLDETLIHS-------------------QTEPPPSRFDFTVRPVIGGHAVTFYVVKR 188

Query: 738 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 797
           PG+  FL  A+  FE+ ++T G + YA+ +   LDP G +FA R + RG   D  DG   
Sbjct: 189 PGVEAFLRAAADAFEVVVFTAGLQEYASLVLDRLDPDGEVFAHR-LYRGACRDAGDGT-- 245

Query: 798 VPKSKDLEGVLGMESAVVIIDD 819
               KDL          VIIDD
Sbjct: 246 --LVKDLAATGRALDRAVIIDD 265


>gi|431896052|gb|ELK05470.1| CTD small phosphatase-like protein 2 [Pteropus alecto]
          Length = 282

 Score = 49.7 bits (117), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 58/107 (54%), Gaps = 8/107 (7%)

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
           ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R+    +    
Sbjct: 140 VYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFR--EHCVC 197

Query: 792 FDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             G+      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 198 VQGN----YIKDL-NILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 239


>gi|357487783|ref|XP_003614179.1| CTD small phosphatase-like protein [Medicago truncatula]
 gi|355515514|gb|AES97137.1| CTD small phosphatase-like protein [Medicago truncatula]
          Length = 306

 Score = 49.7 bits (117), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 53/184 (28%), Positives = 85/184 (46%), Gaps = 24/184 (13%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LVL LD TL           VH  +++ KE+ D        F      ++ + RP +  F
Sbjct: 127 LVLGLDGTL-----------VHSTLVKPKEDHDL--TFTVSFNSVKEDVYVRYRPHLKEF 173

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 803
           L+  S +FE+ ++T G ++YA ++   LDP   +F  R+          + DE+    KD
Sbjct: 174 LDEVSGIFEIIVFTAGQRIYADKLLNKLDPSRKIFRHRLFRES----CVNVDEKY--VKD 227

Query: 804 LEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL--GPSLLEIDHD 860
           L  +LG + A V +ID S   +     N I +E +   P   +   L+    SL+E+D D
Sbjct: 228 L-SILGRDLARVTMIDSSPHSFGFQVENGIPIETWFADPSDNKLLSLIPFLESLVEVD-D 285

Query: 861 ERSE 864
            R+E
Sbjct: 286 VRTE 289


>gi|242089885|ref|XP_002440775.1| hypothetical protein SORBIDRAFT_09g006400 [Sorghum bicolor]
 gi|241946060|gb|EES19205.1| hypothetical protein SORBIDRAFT_09g006400 [Sorghum bicolor]
          Length = 319

 Score = 49.7 bits (117), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 46/143 (32%), Positives = 67/143 (46%), Gaps = 30/143 (20%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF-PHMG-----MWTKLR 737
           L LDLD TL++S    + DP                P R  F   P +G      +   R
Sbjct: 145 LFLDLDETLIHS----QTDP---------------PPSRFDFTVRPVIGGHAVTFYVVKR 185

Query: 738 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 797
           PG+  FL  A+++F++ ++T G + YA+ +   LDP G +FA R + RG   D   GD R
Sbjct: 186 PGVEAFLRAAAEIFDVVVFTAGLQEYASLVLDRLDPDGEVFAHR-LYRGACRDA--GDGR 242

Query: 798 VPKSKDLEGVLGMESAVVIIDDS 820
           +   KDL          VI+DD+
Sbjct: 243 L--VKDLAATGRALDRAVIVDDN 263


>gi|195996503|ref|XP_002108120.1| hypothetical protein TRIADDRAFT_18774 [Trichoplax adhaerens]
 gi|190588896|gb|EDV28918.1| hypothetical protein TRIADDRAFT_18774, partial [Trichoplax
           adhaerens]
          Length = 208

 Score = 49.7 bits (117), Expect = 0.008,   Method: Composition-based stats.
 Identities = 48/159 (30%), Positives = 76/159 (47%), Gaps = 24/159 (15%)

Query: 681 KLCLVLDLDHTLLN-SAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           +  LV+DLD TL++ S    E   +H  I  K    D               ++ +LRP 
Sbjct: 30  EFTLVIDLDETLVHCSLSLLEDANLHFPIYFKNNNYD---------------VYVRLRPY 74

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 799
              FLER SK++E+ L+T   K+YA ++  ++DP   L   R+     +   F     V 
Sbjct: 75  YREFLERVSKIYEVILFTASKKVYANKLMDIIDPGRKLVKHRLFR---EHCVFVHGNYV- 130

Query: 800 KSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             KDL G+LG + S  VI+D+S + + +   N I +E +
Sbjct: 131 --KDL-GILGRDLSKTVIVDNSPQAFGYQLSNGIPIESW 166


>gi|156404147|ref|XP_001640269.1| predicted protein [Nematostella vectensis]
 gi|156227402|gb|EDO48206.1| predicted protein [Nematostella vectensis]
          Length = 289

 Score = 49.7 bits (117), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 54/193 (27%), Positives = 90/193 (46%), Gaps = 32/193 (16%)

Query: 666 RTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF 725
           RTRR  E           LVLDLD TL++ +            L K E+     P    +
Sbjct: 97  RTRRTPE---------FSLVLDLDETLVHCS------------LNKLEDATLSFPVS--Y 133

Query: 726 RFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 785
           +     ++ + RP +  FLER SK+FE+ L+T   ++YA ++  +LDP+   F  R+   
Sbjct: 134 QDITYQVFVRTRPHLKYFLERVSKVFEVILFTASKRVYADKLLNILDPEKKYFRHRLFR- 192

Query: 786 GDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSR 844
            +      G+      KDL  +LG + S  +I+D+S + + +   N I +E + +   + 
Sbjct: 193 -EHCVCVQGN----YIKDL-SILGRDLSKTMIVDNSPQAFAYQIFNGIPIESW-FVDQTD 245

Query: 845 RQFGLLGPSLLEI 857
           R+   L P L E+
Sbjct: 246 RELVELLPFLEEL 258


>gi|290992214|ref|XP_002678729.1| predicted protein [Naegleria gruberi]
 gi|284092343|gb|EFC45985.1| predicted protein [Naegleria gruberi]
          Length = 181

 Score = 49.7 bits (117), Expect = 0.009,   Method: Composition-based stats.
 Identities = 53/188 (28%), Positives = 88/188 (46%), Gaps = 31/188 (16%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG------MWT 734
           K  +VLDLD TL+ S  F++   V+D  +                 FPHMG      ++ 
Sbjct: 9   KKTIVLDLDETLIKS--FYQEPEVYDFSID--------------IEFPHMGNLIQQHVYI 52

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 794
           K RPG+  FL+  ++ FE+ ++T    +YA  + K +DP   LF+  V+ R       +G
Sbjct: 53  KKRPGLENFLQTLAEKFELIMFTAALPVYADAILKHIDPSAELFS-HVLYRHH----CNG 107

Query: 795 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 853
               P  KDL  +LG      +++DD V  +   K N ++++ +      R    ++ P 
Sbjct: 108 SGMFP-GKDLR-ILGRNLDHTLLVDDGVMNFLQPK-NGLLIKSFKGEEGDRILADIIAPF 164

Query: 854 LLEIDHDE 861
           LL++D  E
Sbjct: 165 LLQLDDPE 172


>gi|15929101|gb|AAH15010.1| Unknown (protein for IMAGE:4864552), partial [Homo sapiens]
          Length = 565

 Score = 49.3 bits (116), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 67/139 (48%), Gaps = 23/139 (16%)

Query: 874 VIERLHKIFFS------HQSLDDV-DVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++      ++ +++  D+R I+   + K+LA   I+FS +    FP+ +  
Sbjct: 294 ILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTR 353

Query: 923 PHLHPLWQTAEQFGA------VCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGW 975
            H H     A   GA      V +    D+ TH++A   GT+KV  A   G   VV+P W
Sbjct: 354 EHYH-----ATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDW 408

Query: 976 VEASALLYRRANEQDFAIK 994
           + +    + +  EQ F ++
Sbjct: 409 LWSCLERWDKVEEQLFPLR 427


>gi|85001578|ref|XP_955502.1| ctd-like phosphatase [Theileria annulata strain Ankara]
 gi|65303648|emb|CAI76026.1| ctd-like phosphatase, putative [Theileria annulata]
          Length = 832

 Score = 49.3 bits (116), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 32/54 (59%)

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
           + KLRPGI+ F  +    F + L+T G K +A    +++DP+ + F+ R+ SR 
Sbjct: 301 YFKLRPGIFNFFHQIRDKFTLFLFTTGTKQHAESALQIIDPQLIYFSNRIFSRS 354


>gi|159473212|ref|XP_001694733.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158276545|gb|EDP02317.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 215

 Score = 49.3 bits (116), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 48/149 (32%), Positives = 69/149 (46%), Gaps = 34/149 (22%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP--HMGM---- 732
           AR+  LVLDLD TL++S+                E  DR       F FP    GM    
Sbjct: 32  ARRKTLVLDLDETLVHSSL---------------EAVDRSD-----FNFPVTFNGMDHTV 71

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
           + + RP +  F+ R + LFE+ ++T   ++YA  +  +LDP   L   R+    D     
Sbjct: 72  YVRQRPHLHDFMARVAALFEVVVFTASQRIYAERLLDILDPGQALVRHRIYR--DSCVVV 129

Query: 793 DGDERVPKSKDLEGVLGMESA-VVIIDDS 820
           DG+      KDL  VLG + A  VI+D+S
Sbjct: 130 DGN----YLKDL-SVLGRDLAHTVIVDNS 153


>gi|223943303|gb|ACN25735.1| unknown [Zea mays]
          Length = 342

 Score = 49.3 bits (116), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 38/149 (25%), Positives = 71/149 (47%), Gaps = 28/149 (18%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           + + LVLDLD TL++S   H                + +      F   +  ++ + RP 
Sbjct: 165 KHVTLVLDLDETLVHSTLDHC--------------DNADFTLEVFFNMKNHTVYVRKRPY 210

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDPFDGDE 796
           +  FLE+ +++FE+ ++T   ++YA ++   LDP G   + R+        DG       
Sbjct: 211 LKMFLEKVAQMFEVVIFTASQRVYAEQLIDKLDPDGKYISRRIYRESCVFSDG------- 263

Query: 797 RVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
               +KDL  +LG++ A V I+D++ +V+
Sbjct: 264 --CYTKDL-TILGIDLAKVAIVDNTPQVF 289


>gi|391328122|ref|XP_003738541.1| PREDICTED: CTD small phosphatase-like protein 2-like [Metaseiulus
           occidentalis]
          Length = 236

 Score = 49.3 bits (116), Expect = 0.012,   Method: Composition-based stats.
 Identities = 37/108 (34%), Positives = 54/108 (50%), Gaps = 14/108 (12%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           K  SA +  LVLDLD TL++ +            L + E      P   LF+     ++ 
Sbjct: 51  KTRSAPEFSLVLDLDETLVHCS------------LMELEGATFTFPV--LFQGIEYKVYV 96

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
           + RP    FLER SK+FE+ L+T   K+YA ++  +LDPK  L   R+
Sbjct: 97  RTRPFFREFLERVSKMFEVILFTASKKVYADKLLDLLDPKRHLIRYRL 144


>gi|225711928|gb|ACO11810.1| Probable C-terminal domain small phosphatase [Lepeophtheirus
           salmonis]
          Length = 265

 Score = 48.9 bits (115), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 47/161 (29%), Positives = 78/161 (48%), Gaps = 22/161 (13%)

Query: 678 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 737
           S+ +  LVLDLD TL++ +   E+D             D       +F+     ++ + R
Sbjct: 82  SSPRFSLVLDLDETLVHCS-LQELD-------------DASLSFPVVFQDTTYRVFVRTR 127

Query: 738 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 797
           P I  FLER SK FE+ L+T   K+YA ++  +LDP+      R+    +     +G+  
Sbjct: 128 PRIREFLERVSKNFEVTLFTASKKVYADKLLNLLDPERKWIKYRLFR--EHCVCVNGN-- 183

Query: 798 VPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
               KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 184 --YIKDL-NILGRDLSKTIIIDNSPQAFGYQLENGIPIESW 221


>gi|145523063|ref|XP_001447370.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414881|emb|CAK79973.1| unnamed protein product [Paramecium tetraurelia]
          Length = 336

 Score = 48.9 bits (115), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 52/219 (23%), Positives = 98/219 (44%), Gaps = 34/219 (15%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LV+DLD TL++S+   E   ++D I+    +  + K            ++  +RPG   F
Sbjct: 48  LVIDLDETLVHSS--FEPMKINDLIVEVTMKDQKYK------------IYVNIRPGAQEF 93

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 803
           ++  SKLFE+ ++T     YA  +   +DP G L   R+    ++   ++G       KD
Sbjct: 94  IKETSKLFELIIFTASISEYANSVIDFIDPHG-LVDLRLFR--ENCTVYNG----VLVKD 146

Query: 804 LEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERS 863
           L  +     +V++ID+SV  +    +N I +  Y +   + ++  LL P L         
Sbjct: 147 LSLLKRNLDSVILIDNSVNSFMFQPMNAIHILNY-FEDKTDQELILLIPFL--------- 196

Query: 864 EDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQR 902
               L S    +  +H+  F +   D  +  +I+ ++Q+
Sbjct: 197 ---KLISQFQDVRPVHEWLFKYAHFDKFEYVDIMGSKQQ 232


>gi|302824588|ref|XP_002993936.1| hypothetical protein SELMODRAFT_137904 [Selaginella moellendorffii]
 gi|300138208|gb|EFJ04983.1| hypothetical protein SELMODRAFT_137904 [Selaginella moellendorffii]
          Length = 159

 Score = 48.9 bits (115), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 50/174 (28%), Positives = 78/174 (44%), Gaps = 36/174 (20%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LVLDLD TL++ A        HDE          E   R + +          RPG+  F
Sbjct: 2   LVLDLDQTLVSVAD-------HDE----------ETLLRCVTK----------RPGLDRF 34

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 803
           L+  S+++E+ +++     Y  ++   LDP G +F+   +  G D D   G +RV   + 
Sbjct: 35  LKDMSQVYEIVIFSASGASYVKKIVSSLDPTGEIFSA--VFTGSDTDWLSG-QRVKDLRK 91

Query: 804 LEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEI 857
           L         +V IDD+  ++P+N  N I V  +   P +    G L P LLE+
Sbjct: 92  LN-----RDKIVWIDDNASLYPYNPKNGIQVPPFHGDP-NDSILGALTPLLLEV 139


>gi|405966502|gb|EKC31780.1| CTD small phosphatase-like protein 2 [Crassostrea gigas]
          Length = 402

 Score = 48.9 bits (115), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 53/192 (27%), Positives = 90/192 (46%), Gaps = 25/192 (13%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           K  S+ +  LVLDLD TL++ +                E +D       LF      ++ 
Sbjct: 218 KTRSSPEFSLVLDLDETLVHCSL--------------TELEDAAFTFPVLFEDVTYKVFV 263

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 794
           + RP    FLE  S++FE+ L+T   K+YA ++  +LDP+  L   R+    +     +G
Sbjct: 264 RTRPHFREFLETVSEMFEVILFTASKKVYADKLVNILDPQKQLIKHRLFR--EHCVCING 321

Query: 795 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPS 853
           +      KDL  +LG + S  +I+D+S + + +   N I +E + +   + R+   L P 
Sbjct: 322 N----YIKDL-TILGRDLSRTIIVDNSPQAFGYQLDNGIPIESW-FVDKNDRELLNLVPF 375

Query: 854 LLEIDHDERSED 865
           L  + H  R+ED
Sbjct: 376 LQSLVH--RNED 385


>gi|403353558|gb|EJY76317.1| NLI interacting factor-like phosphatase family protein [Oxytricha
           trifallax]
          Length = 1037

 Score = 48.9 bits (115), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 72/151 (47%), Gaps = 31/151 (20%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH----MGMWT 734
           ++K  L+ D+D TL+     H VD +      + E+ D   P      FP     +    
Sbjct: 654 SKKKTLIFDMDETLI-----HCVDDI------ESEDPDVIIP----IDFPDEDEIVNAGI 698

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 794
            +RP ++  LE A+KLF++ ++T  +K YA  +   LDP+   F  R+  R +     +G
Sbjct: 699 NIRPYLYECLEEANKLFQVIVFTASHKAYADAILDYLDPENKYFQYRLY-RDNCVQTREG 757

Query: 795 ----DERVPKSKDLEGVLGMESAVVIIDDSV 821
               D R+  ++DL+        ++IID+SV
Sbjct: 758 YYVKDLRIINNRDLKD-------LIIIDNSV 781


>gi|345479753|ref|XP_001603378.2| PREDICTED: hypothetical protein LOC100119644 [Nasonia vitripennis]
          Length = 563

 Score = 48.9 bits (115), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 50/170 (29%), Positives = 77/170 (45%), Gaps = 34/170 (20%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 728
           K  S+ +  LVLDLD TL           VH  +   +E  D        FRFP      
Sbjct: 378 KTRSSPEFSLVLDLDETL-----------VHCSL---QELSDAS------FRFPVVFQNI 417

Query: 729 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
              ++ + RP    FLE  S L+E+ L+T   ++YA ++  +LDP   L   R+    + 
Sbjct: 418 TYTVFVRTRPFFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 475

Query: 789 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
               +G+      KDL  +LG + S  VIID+S + + +   N I +E +
Sbjct: 476 CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 520


>gi|313226803|emb|CBY21948.1| unnamed protein product [Oikopleura dioica]
          Length = 444

 Score = 48.5 bits (114), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 50/182 (27%), Positives = 84/182 (46%), Gaps = 33/182 (18%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           + +AAI   +TR+  E           LVLDLD TL           VH  +    E Q 
Sbjct: 237 KNRAAILPCKTRKTPE---------YTLVLDLDETL-----------VHCSLC---ELQM 273

Query: 717 REKPHRHLFRFPHMG--MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
           R+       RF ++   ++ K RP +  FLER  + FE+ ++T   K+YA ++  ++DP 
Sbjct: 274 RDYEFTFPIRFQNVDYDVYVKTRPYLRDFLERMCEHFEIIIFTASKKVYADKLISIIDPN 333

Query: 775 GVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIV 833
             L   R+    +      G+      KDL  +LG + +  +I+D+S + + ++  N I 
Sbjct: 334 KKLVRHRLFR--EHCMLVQGN----YIKDL-TILGRDLTKTIIVDNSPQAFSYHMDNGIP 386

Query: 834 VE 835
           +E
Sbjct: 387 IE 388


>gi|291239709|ref|XP_002739764.1| PREDICTED: CTD small phosphatase-like protein 2-like [Saccoglossus
           kowalevskii]
          Length = 526

 Score = 48.5 bits (114), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 49/176 (27%), Positives = 84/176 (47%), Gaps = 22/176 (12%)

Query: 663 QKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR 722
           + E+ RR     K  S+ +  LVLDLD TL++ +  +E+D             D      
Sbjct: 329 EAEKNRRPVLPLKTRSSPEYSLVLDLDETLVHCS-LNELD-------------DANLTFP 374

Query: 723 HLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            +F+     ++ + RP    FLE  S+ FE+ L+T   K+YA ++  +LDP+      R+
Sbjct: 375 VVFQDITYQVFVRTRPYFKEFLEAVSQQFEVILFTASKKVYADKLFNLLDPQKKYVKYRL 434

Query: 783 ISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
               +      G+      KDL G+LG + S V+I+D+S + + +   N I +E +
Sbjct: 435 FR--EHCVCVQGN----YIKDL-GILGRDLSRVIIVDNSPQAFGYQLSNGIPIESW 483


>gi|335281247|ref|XP_001925490.3| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase [Sus scrofa]
          Length = 666

 Score = 48.5 bits (114), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 33/134 (24%), Positives = 64/134 (47%), Gaps = 13/134 (9%)

Query: 874 VIERLHKIFFSH-------QSLDDVDVRNILAAEQRKILAGCRIVFSRV----FPVGEAN 922
           ++ R+H  +++        ++ +  D+R I+   + ++LA   I+FS +    FP+ +  
Sbjct: 297 ILARVHSDYYAKYDRFLCGETQEAPDIRKIVPELRSRVLADVAIIFSGLHPTNFPIEKTR 356

Query: 923 PHLHPLWQTAEQFGAVCTKHID-DQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASA 980
            H H     A     +     D D+ TH++A   GT+KV  A   G+  VV+P W+ +  
Sbjct: 357 EHYHATALGARILTQLVLDPDDPDRPTHLIAARAGTEKVRQAQECGQLHVVNPDWLWSCL 416

Query: 981 LLYRRANEQDFAIK 994
             + +  EQ F ++
Sbjct: 417 ERWDKVEEQLFPLQ 430


>gi|70921595|ref|XP_734099.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56506520|emb|CAH86297.1| hypothetical protein PC301933.00.0 [Plasmodium chabaudi chabaudi]
          Length = 212

 Score = 48.5 bits (114), Expect = 0.018,   Method: Composition-based stats.
 Identities = 35/136 (25%), Positives = 62/136 (45%), Gaps = 14/136 (10%)

Query: 714 EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 773
           E D  + ++    + +   + K RP +  FLE  S  +E+ +YT   + YA  +  +LDP
Sbjct: 5   ENDELELYKFFLPYYNFFYYLKFRPYVRQFLEILSLYYELSIYTNATREYADVVIAILDP 64

Query: 774 KGVLFAGRVISRGD--DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW---PHNK 828
              +FA R+++R    D D     E++  + D          V+  DD   VW   P + 
Sbjct: 65  DRTIFADRIVARCSSVDRDENKHFEKIYPNVD-------PKYVIAFDDRKDVWYDIPDS- 116

Query: 829 LNLIVVERYTYFPCSR 844
            +++  E Y +F  S+
Sbjct: 117 -HILRAEHYNFFELSK 131


>gi|242218013|ref|XP_002474801.1| predicted protein [Postia placenta Mad-698-R]
 gi|220726049|gb|EED80012.1| predicted protein [Postia placenta Mad-698-R]
          Length = 925

 Score = 48.5 bits (114), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 21/64 (32%), Positives = 36/64 (56%)

Query: 928 LWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRAN 987
           L++ A + GAVC   + D+VTH++A   G+ K   AL     ++HP W+  S  ++ R +
Sbjct: 77  LFKQALELGAVCLSDLTDRVTHLLAVEPGSAKYRCALENKITIMHPSWITESHEIWLRGD 136

Query: 988 EQDF 991
           + D 
Sbjct: 137 DVDL 140


>gi|242205780|ref|XP_002468747.1| predicted protein [Postia placenta Mad-698-R]
 gi|220732132|gb|EED85970.1| predicted protein [Postia placenta Mad-698-R]
          Length = 925

 Score = 48.5 bits (114), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 21/64 (32%), Positives = 36/64 (56%)

Query: 928 LWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRAN 987
           L++ A + GAVC   + D+VTH++A   G+ K   AL     ++HP W+  S  ++ R +
Sbjct: 77  LFKQALELGAVCLSDLTDRVTHLLAVEPGSAKYRCALENKITIMHPSWITESHEIWLRGD 136

Query: 988 EQDF 991
           + D 
Sbjct: 137 DVDL 140


>gi|195125303|ref|XP_002007120.1| GI12759 [Drosophila mojavensis]
 gi|193918729|gb|EDW17596.1| GI12759 [Drosophila mojavensis]
          Length = 245

 Score = 48.5 bits (114), Expect = 0.021,   Method: Composition-based stats.
 Identities = 32/119 (26%), Positives = 58/119 (48%), Gaps = 14/119 (11%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSAKFH------EVDPVHDEILRKKEEQDREKPHRHLFR 726
           +K++   R+  LVLD+D TL++S   +      E  P  +   + K +     P+ + F 
Sbjct: 19  KKRLLMVRRKTLVLDMDETLISSVILYRVKSLLEAGPEDNRRYKAKSKIVHSTPYDYSFY 78

Query: 727 FP--HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
            P     ++   RP +  FL+R SK + + ++T  ++ YA+++   LD      AGR I
Sbjct: 79  IPMSEASVYVYKRPYVDLFLDRVSKWYNLVVFTAASEAYASQVLDFLD------AGRNI 131


>gi|385301990|gb|EIF46141.1| putative rna pol ii ctd phosphatase component [Dekkera bruxellensis
           AWRI1499]
          Length = 527

 Score = 48.5 bits (114), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 36/144 (25%), Positives = 65/144 (45%), Gaps = 11/144 (7%)

Query: 859 HDERSEDGTLASSLG-VIERLHKIFFSHQSLDDV-----DVRNILAAEQRKILAGCRIVF 912
           H+   +D T   +LG  + ++H  F+  +  D       DV++I+   +  +  GC  + 
Sbjct: 225 HNLLCDDDTELQTLGKALTKIHDKFYLEKZRDVASXEVPDVKDIMNTMKSVVFQGCVFLL 284

Query: 913 SRVFPVGE-ANPHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALST--GRF 969
           S + P+G   +     +W  A  +GA         VTHV+  +  T KV  A S      
Sbjct: 285 SGILPLGTPLDSADIVIW--ARSYGAQFVSEYSTSVTHVICKNPXTFKVRLAKSVDPNVK 342

Query: 970 VVHPGWVEASALLYRRANEQDFAI 993
           +V+P W+     ++RR  E+D+ +
Sbjct: 343 IVNPDWLFKCMSMWRRVPEEDYTV 366


>gi|397609378|gb|EJK60336.1| hypothetical protein THAOC_19327, partial [Thalassiosira oceanica]
          Length = 334

 Score = 48.1 bits (113), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 50/101 (49%), Gaps = 17/101 (16%)

Query: 874 VIERLHKIFFSHQSLDDVDVR---NILAAEQRKILAG----CRIVFSRVFPVGEAN---- 922
           V+ RLH  F+S     D   R   ++L + +R++L G    C +VFS V P+ + N    
Sbjct: 238 VLRRLHARFYSPSPGSDAGRRTVPDLLRSMRREVLGGHPRAC-LVFSGVVPLNQQNVALR 296

Query: 923 --PHLHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVN 961
             PHL    + AE+ G      +   VTHVVA   G++K N
Sbjct: 297 VRPHL---VRYAEELGGTVLPAVGPDVTHVVAKKDGSEKCN 334


>gi|325180325|emb|CCA14728.1| putative nuclear LIM factor interactorinteracting protein hyphal
           form [Albugo laibachii Nc14]
          Length = 228

 Score = 48.1 bits (113), Expect = 0.024,   Method: Composition-based stats.
 Identities = 32/104 (30%), Positives = 52/104 (50%), Gaps = 13/104 (12%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE--EQDREKPHRHLFRFPHMGMWTKLR 737
           R+L LVLD+D  L++S   H  D ++      K+  E    +  R +            R
Sbjct: 31  RRLALVLDMDECLIHSIFQH--DNIYQRYPSYKDSFEISTSEGERAI---------VNKR 79

Query: 738 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 781
           PG+  FL  A+K F+++++T G ++Y   +   LDPKG +F  R
Sbjct: 80  PGLDAFLREAAKSFDLYVFTAGLRVYGEPILDALDPKGTIFKDR 123


>gi|412985397|emb|CCO18843.1| predicted protein [Bathycoccus prasinos]
          Length = 601

 Score = 48.1 bits (113), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 52/177 (29%), Positives = 80/177 (45%), Gaps = 33/177 (18%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------HMGMWT 734
           K  LVLDLD TL           VH  +     E++   P    F FP         +  
Sbjct: 405 KNTLVLDLDETL-----------VHSNL-----EEEEGTPD---FTFPVQFNNETHAVNV 445

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 794
           ++RP +  F++R SK FE+ ++T   K+YA ++   LDP+ V F+ R+    D     +G
Sbjct: 446 RIRPHLEEFMKRVSKKFEVVIFTASQKVYADKLLDHLDPEHVYFSHRLFR--DSCVLVEG 503

Query: 795 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLL 850
           +      KDL  VLG + S  +IID+S + +     N + +E +   P       LL
Sbjct: 504 N----YLKDL-SVLGRDLSRTLIIDNSPQAFGFQVENGVPIESWYDDPTDDHLLRLL 555


>gi|242053713|ref|XP_002456002.1| hypothetical protein SORBIDRAFT_03g028730 [Sorghum bicolor]
 gi|241927977|gb|EES01122.1| hypothetical protein SORBIDRAFT_03g028730 [Sorghum bicolor]
          Length = 400

 Score = 48.1 bits (113), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 26/104 (25%), Positives = 50/104 (48%), Gaps = 14/104 (13%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
            + + LVLDLD TL++S   H                + +      F   +  ++ + RP
Sbjct: 222 TKHVTLVLDLDETLVHSTLDHC--------------DNADFTLEVFFNMKNHTVYVRKRP 267

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            +  FLE+ +++FE+ ++T   ++YA ++   LDP G   + R+
Sbjct: 268 YLKMFLEKVAQMFEVVIFTASQRIYAEQLIDKLDPDGKYISRRI 311


>gi|332020757|gb|EGI61161.1| CTD small phosphatase-like protein 2 [Acromyrmex echinatior]
          Length = 593

 Score = 48.1 bits (113), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 50/170 (29%), Positives = 77/170 (45%), Gaps = 34/170 (20%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 728
           K  S+ +  LVLDLD TL           VH  +   +E  D        FRFP      
Sbjct: 408 KTRSSPEFSLVLDLDETL-----------VHCSL---QELSDAA------FRFPVVFQDV 447

Query: 729 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
              ++ + RP    FLE  S L+E+ L+T   ++YA ++  +LDP   L   R+    + 
Sbjct: 448 TYTVFVRTRPYFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 505

Query: 789 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
               +G+      KDL  +LG + S  VIID+S + + +   N I +E +
Sbjct: 506 CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 550


>gi|322779051|gb|EFZ09448.1| hypothetical protein SINV_03717 [Solenopsis invicta]
          Length = 568

 Score = 48.1 bits (113), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 75/170 (44%), Gaps = 34/170 (20%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 728
           K  S+ +  LVLDLD TL++ +     D                      FRFP      
Sbjct: 383 KTRSSPEFSLVLDLDETLVHCSLQELSDAA--------------------FRFPVVFQDV 422

Query: 729 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
              ++ + RP    FLE  S L+E+ L+T   ++YA ++  +LDP   L   R+    + 
Sbjct: 423 TYTVFVRTRPYFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 480

Query: 789 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
               +G+      KDL  +LG + S  VIID+S + + +   N I +E +
Sbjct: 481 CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 525


>gi|428182825|gb|EKX51685.1| hypothetical protein GUITHDRAFT_65993, partial [Guillardia theta
           CCMP2712]
          Length = 179

 Score = 48.1 bits (113), Expect = 0.026,   Method: Composition-based stats.
 Identities = 32/99 (32%), Positives = 50/99 (50%), Gaps = 16/99 (16%)

Query: 685 VLDLDHTLLN-SAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           VLDLD TL++ S +F E   +  ++  K  EQD               +W K+RP    F
Sbjct: 1   VLDLDETLVHASLEFMEQSHLQFDVTFK--EQDYH-------------VWVKIRPHCLEF 45

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
           LER ++ FE+ ++T    +YA ++  ++DP   L   RV
Sbjct: 46  LERLAEKFEIIVFTASQSIYADKLLNLIDPDSRLIKHRV 84


>gi|268566879|ref|XP_002639837.1| C. briggsae CBR-SCPL-3 protein [Caenorhabditis briggsae]
          Length = 294

 Score = 48.1 bits (113), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 18/52 (34%), Positives = 33/52 (63%)

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
           ++ ++RP + TFL R SK+FE+ ++T   K YA ++  +LDP+  +   R+ 
Sbjct: 101 VYVRIRPFLRTFLTRMSKVFEIIVFTASKKCYANKLCDILDPQKTIIKHRLF 152


>gi|307165882|gb|EFN60237.1| CTD small phosphatase-like protein 2 [Camponotus floridanus]
          Length = 568

 Score = 47.8 bits (112), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 75/170 (44%), Gaps = 34/170 (20%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 728
           K  S+ +  LVLDLD TL++ +     D                      FRFP      
Sbjct: 383 KTRSSPEFSLVLDLDETLVHCSLQELSDAA--------------------FRFPVVFQDV 422

Query: 729 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
              ++ + RP    FLE  S L+E+ L+T   ++YA ++  +LDP   L   R+    + 
Sbjct: 423 TYTVFVRTRPYFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 480

Query: 789 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
               +G+      KDL  +LG + S  VIID+S + + +   N I +E +
Sbjct: 481 CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 525


>gi|341876625|gb|EGT32560.1| hypothetical protein CAEBREN_01530 [Caenorhabditis brenneri]
          Length = 286

 Score = 47.8 bits (112), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 19/52 (36%), Positives = 33/52 (63%)

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
           ++ +LRP + TFL R +K FE+ ++T   K+YA ++  +LDP+  L   R+ 
Sbjct: 101 VYVRLRPHLRTFLNRMAKTFEIIIFTASKKVYANKLCDILDPRKNLIRHRLF 152


>gi|328874828|gb|EGG23193.1| CTD small phosphatase-like protein 2 [Dictyostelium fasciculatum]
          Length = 692

 Score = 47.8 bits (112), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 71/151 (47%), Gaps = 31/151 (20%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           K     K+ LVLDLD TL++ +     DP+          +D +      F      ++ 
Sbjct: 508 KTLDTPKISLVLDLDETLVHCS----TDPI----------EDPDLTFLVTFNAIEYKVYA 553

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP----KGVLFAGRVISRGDDGD 790
           K RP    FL +AS+LFE+ ++T   ++YA ++  ++DP    K  LF    +       
Sbjct: 554 KKRPFFEEFLVKASELFEVIIFTASQEVYANKLLNMIDPNNHVKYRLFRDSCVY------ 607

Query: 791 PFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
             +G+      KDL  +LG + S VVI+D+S
Sbjct: 608 -VEGNYL----KDL-SILGRDLSQVVIVDNS 632


>gi|224002358|ref|XP_002290851.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220974273|gb|EED92603.1| predicted protein, partial [Thalassiosira pseudonana CCMP1335]
          Length = 196

 Score = 47.8 bits (112), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 44/157 (28%), Positives = 76/157 (48%), Gaps = 22/157 (14%)

Query: 682 LCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIW 741
           +CLVLDLD TL++      V+PV D  +    E          F      +  + RP + 
Sbjct: 19  ICLVLDLDETLVHCT----VEPVSDADMIFPVE----------FNGMEYTVHVRCRPFLT 64

Query: 742 TFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKS 801
            FLE+ S+ FE+ ++T   ++YA ++  ++DP+G     R+    D   P +G+      
Sbjct: 65  EFLEKVSEDFEVVVFTASQQVYADKLLDMIDPEGKFIKHRMFR--DSCLPVEGN----FL 118

Query: 802 KDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           KDL  +LG +    V++D+S   + +   N I +E +
Sbjct: 119 KDL-TILGRDLRRAVLVDNSPHAFGYQVDNGIPIESW 154


>gi|70945368|ref|XP_742511.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56521536|emb|CAH80727.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 359

 Score = 47.8 bits (112), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 35/118 (29%), Positives = 57/118 (48%), Gaps = 14/118 (11%)

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
            + K RP +  FLE  S  +E+ +YT   + YA  +  +LDP   +FA R+++R      
Sbjct: 2   YYLKFRPYVRQFLEILSLYYELSIYTNATREYADVVIAILDPDRTIFADRIVAR---CSS 58

Query: 792 FDGDERVPKSKDLEGVL-GMESAVVI-IDDSVRVW---PHNKLNLIVVERYTYFPCSR 844
            D DE    +K  E +   ++   VI  DD   VW   P +  +++  E Y +F  S+
Sbjct: 59  VDRDE----NKHFEKIYPNVDPKYVIAFDDRKDVWYDIPDS--HILRAEHYNFFELSK 110


>gi|302833726|ref|XP_002948426.1| hypothetical protein VOLCADRAFT_58281 [Volvox carteri f.
           nagariensis]
 gi|300266113|gb|EFJ50301.1| hypothetical protein VOLCADRAFT_58281 [Volvox carteri f.
           nagariensis]
          Length = 215

 Score = 47.8 bits (112), Expect = 0.031,   Method: Composition-based stats.
 Identities = 46/160 (28%), Positives = 77/160 (48%), Gaps = 22/160 (13%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
           AR+  LVLDLD TL++S+            L   +  D   P   +F      ++ + RP
Sbjct: 33  ARRKTLVLDLDETLVHSS------------LEAVDRSDFSFPV--IFNGTEHQVYVRQRP 78

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERV 798
            +  F+ R + LFE+ ++T   ++YA ++  +LDP+  L   R+    D     DG+   
Sbjct: 79  YLREFMVRVAALFEVVVFTASQRIYAEKLLDILDPQQQLVRHRIYR--DSCVVVDGNYL- 135

Query: 799 PKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
              KDL  +LG + A  VI+D+S + +     N I +E +
Sbjct: 136 ---KDLS-ILGRDLATTVIVDNSPQAFGFQVDNGIPIESW 171


>gi|307194093|gb|EFN76554.1| CTD small phosphatase-like protein 2 [Harpegnathos saltator]
          Length = 546

 Score = 47.8 bits (112), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 50/170 (29%), Positives = 77/170 (45%), Gaps = 34/170 (20%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 728
           K  S+ +  LVLDLD TL           VH  +   +E  D        FRFP      
Sbjct: 361 KTRSSPEFSLVLDLDETL-----------VHCSL---QELSDAA------FRFPVVFQDV 400

Query: 729 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
              ++ + RP    FLE  S L+E+ L+T   ++YA ++  +LDP   L   R+    + 
Sbjct: 401 TYTVFVRTRPYFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 458

Query: 789 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
               +G+      KDL  +LG + S  VIID+S + + +   N I +E +
Sbjct: 459 CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 503


>gi|170050634|ref|XP_001861399.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167872200|gb|EDS35583.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 627

 Score = 47.8 bits (112), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 50/100 (50%), Gaps = 14/100 (14%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           K  S+ +  LVLDLD TL++ +               +E  D       LF+     ++ 
Sbjct: 482 KTRSSPEFSLVLDLDETLVHCSL--------------QELSDASFKFPVLFQECQYTVFV 527

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
           + RP    FLE+ S++FE+ L+T   ++YA ++  +LDP+
Sbjct: 528 RTRPFFREFLEKVSQIFEVILFTASKRVYADKLLNLLDPE 567


>gi|225718796|gb|ACO15244.1| Probable C-terminal domain small phosphatase [Caligus clemensi]
          Length = 314

 Score = 47.8 bits (112), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 32/97 (32%), Positives = 50/97 (51%), Gaps = 14/97 (14%)

Query: 678 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 737
           S+ +  LVLDLD TL++ +   E+D             D       +F+     ++ + R
Sbjct: 131 SSPRFSLVLDLDETLVHCS-LQELD-------------DASLSFPVVFQDTTYRVFVRTR 176

Query: 738 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
           P I  FLER SK FE+ L+T   K+YA ++  +LDP+
Sbjct: 177 PRIREFLERVSKNFEVTLFTASKKVYADKLLNLLDPE 213


>gi|308801351|ref|XP_003077989.1| double-stranded RNA-binding domain (ISS) [Ostreococcus tauri]
 gi|116056440|emb|CAL52729.1| double-stranded RNA-binding domain (ISS) [Ostreococcus tauri]
          Length = 793

 Score = 47.8 bits (112), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 53/123 (43%), Gaps = 28/123 (22%)

Query: 728 PHMGMWTKLRPGIWTFL-------ERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 780
           P   M   +RPG W  L       +R SK  E  + TM N  YA EM ++LDP G +F  
Sbjct: 334 PATAMLVHIRPG-WGELRSYLSGSDRGSKRAETFVCTMANIDYAREMCRLLDPHGTVF-- 390

Query: 781 RVISRGDDGDPFDGDERVP-----KSKDLEGVLGM---ESAVVIIDDSVRVW-PHNKLNL 831
                    DP   D+R+      + K L    G+       VI+DD   VW P  + ++
Sbjct: 391 ---------DPAQLDKRIKSVKPDELKSLSDTCGLHFPSELAVIVDDRTAVWEPSAQSHI 441

Query: 832 IVV 834
           + V
Sbjct: 442 LAV 444


>gi|395517551|ref|XP_003762939.1| PREDICTED: CTD small phosphatase-like protein-like [Sarcophilus
           harrisii]
          Length = 461

 Score = 47.8 bits (112), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 49/188 (26%), Positives = 78/188 (41%), Gaps = 31/188 (16%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D +Q   I     + L  + K+    K C+V+DLD TL++S+      P+ +       E
Sbjct: 186 DQRQVIPIPSPSAKYLLPELKLSDYGKKCMVIDLDETLVHSS----FKPISNADFIVPVE 241

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 242 IDGTVHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 291

Query: 775 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKL 829
           GV  A       V  RG+              KDL  +    S V+IID+S   +  +  
Sbjct: 292 GVFRARLFRESCVFHRGN------------YVKDLSQLGRELSKVIIIDNSPASYIFHPE 339

Query: 830 NLIVVERY 837
           N + V+ +
Sbjct: 340 NAVPVQSW 347


>gi|124513824|ref|XP_001350268.1| protein phosphatase, putative [Plasmodium falciparum 3D7]
 gi|23615685|emb|CAD52677.1| protein phosphatase, putative [Plasmodium falciparum 3D7]
          Length = 1288

 Score = 47.4 bits (111), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 34/127 (26%), Positives = 54/127 (42%), Gaps = 16/127 (12%)

Query: 725  FRFPHMGM--WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            F  P      + K RP +  FL+  S  +E+ +YT   + YA  +  +LDP   +F+ R+
Sbjct: 888  FYLPQYNFFYYLKFRPYVRQFLQILSLYYELAIYTNATREYADVVIAILDPDRTIFSDRI 947

Query: 783  ISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVW-----PHNKLNLIVVERY 837
            ++R    D    DE    S+    V      V+  DD   VW      H    ++  E Y
Sbjct: 948  VARCSSTDR---DENKYFSRIYPNV--DPKYVIAFDDRKDVWIDIPQSH----ILKAEHY 998

Query: 838  TYFPCSR 844
             +F  S+
Sbjct: 999  NFFELSK 1005


>gi|149392655|gb|ABR26130.1| ctd-phosphatase-like protein [Oryza sativa Indica Group]
          Length = 187

 Score = 47.4 bits (111), Expect = 0.041,   Method: Composition-based stats.
 Identities = 43/154 (27%), Positives = 78/154 (50%), Gaps = 28/154 (18%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           K   ++++ LVLDLD TL++S   H  D V D  L+              F   +  ++ 
Sbjct: 5   KSARSKQITLVLDLDETLVHSTLDH-CDNV-DFTLQV------------FFNMKNHTVYV 50

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG---DDGDP 791
           + RP +  FLE+ +++FE+ ++T   ++YA ++   LDP   L + R+        +G  
Sbjct: 51  RQRPHLKMFLEKVAQMFELVIFTASQRIYAEQLIDRLDPDERLISHRIYRESCIFSEG-- 108

Query: 792 FDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
                    +KDL  +LG++ A VVI+D++ +V+
Sbjct: 109 -------CYTKDLT-ILGVDLAKVVIVDNTPQVF 134


>gi|290986065|ref|XP_002675745.1| NLI interacting factor domain-containing protein [Naegleria
           gruberi]
 gi|284089343|gb|EFC43001.1| NLI interacting factor domain-containing protein [Naegleria
           gruberi]
          Length = 510

 Score = 47.4 bits (111), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 57/231 (24%), Positives = 96/231 (41%), Gaps = 39/231 (16%)

Query: 645 GDVEHLF--EGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHE-- 700
           GD+ H F  E   ++++A  Q  +T  L  Q+     +K  LVLDLD TL++S   H   
Sbjct: 271 GDLIHQFYEEVKTNKKRAHKQAPQTALLPPQRPHVQGKK-TLVLDLDETLVHSVFVHTDQ 329

Query: 701 ---VDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYT 757
              V P+         E D      ++ +          RPG+  +L    + +E+ ++T
Sbjct: 330 ADFVIPI---------EMDGRTYSCYVLK----------RPGVDEYLRELGQYYEIIIFT 370

Query: 758 MGNKLYATEMAKVLDPKGVLFAGRVISR--GDDGDPFDGDERVPKSKDLEGVLGMESAVV 815
               LYA  +  +LD  GV+  GR+        GD +         KDL  +       +
Sbjct: 371 ASLSLYANPLLDILDKHGVI-EGRLFREHCTKVGDTY--------IKDLSRLGRDLDQTI 421

Query: 816 IIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERSEDG 866
           I+D+S   +     N +    + Y   + R+ GLL   L  ++ ++   DG
Sbjct: 422 IVDNSPSCYAMQPQNALACTTW-YDDPNDRELGLLADCLKRLEREKAVYDG 471


>gi|351697455|gb|EHB00374.1| CTD small phosphatase-like protein [Heterocephalus glaber]
          Length = 356

 Score = 47.4 bits (111), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 46/186 (24%), Positives = 78/186 (41%), Gaps = 31/186 (16%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++   +QK   + L  +  +    K C+V+DLD TL++S+      P+ +       E D
Sbjct: 162 EENGGLQKPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 217

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  GV
Sbjct: 218 GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 267

Query: 777 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNL 831
             A       V  RG+              KDL  +    S V+I+D+S   +  +  N 
Sbjct: 268 FRARLFRESCVFHRGN------------YVKDLSRLGRELSKVIIVDNSPASYIFHPENA 315

Query: 832 IVVERY 837
           + V+ +
Sbjct: 316 VPVQSW 321


>gi|71649764|ref|XP_813595.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70878493|gb|EAN91744.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 446

 Score = 47.4 bits (111), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 47/173 (27%), Positives = 79/173 (45%), Gaps = 24/173 (13%)

Query: 662 IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-VHDEILRKKEEQDREKP 720
           I K     L  Q + +  +K  L+LDLD TL++S+    + P  HD IL  K E +    
Sbjct: 239 IAKNHASLLPLQMRQYHGKK-TLILDLDETLVHSSL--TLQPKQHDLILSMKTEPEVTT- 294

Query: 721 HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 780
                      ++   RP +  F++  + LFE+ ++T    +Y   +   +DP+G+L + 
Sbjct: 295 -----------IYVAYRPFLHEFIQAVAGLFEVVIFTASVSMYCNPVMDAVDPEGILGSL 343

Query: 781 RVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLI 832
           R+    +     +G       KDL  +LG E S V I+D+S   +   + N I
Sbjct: 344 RLYR--EHCSILNG----AYVKDL-SLLGRELSQVAIVDNSPVTYLFQQRNAI 389


>gi|145508145|ref|XP_001440022.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124407228|emb|CAK72625.1| unnamed protein product [Paramecium tetraurelia]
          Length = 506

 Score = 47.4 bits (111), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 37/145 (25%), Positives = 62/145 (42%), Gaps = 25/145 (17%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LV DLD TL++    +     H  ++             H+   P   +   +RP     
Sbjct: 313 LVFDLDETLIHCNDINNNSTDHTTVI-------------HIPNEPETEIRFNIRPHCQQM 359

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG----DDGDPFDGDERVP 799
           L+  S+ +E+ L+T   K YA ++ + +DPKG LF+ R         ++G     D RV 
Sbjct: 360 LKALSQYYELILFTASYKEYADKILEYIDPKGNLFSYRFYRESCLELEEG-LLVKDLRVI 418

Query: 800 KSKDLEGVLGMESAVVIIDDSVRVW 824
           + + LE        + IID+S   +
Sbjct: 419 EGRKLEN-------MAIIDNSAYCY 436


>gi|66803905|ref|XP_635771.1| CTD small phosphatase-like protein 2 [Dictyostelium discoideum AX4]
 gi|74851880|sp|Q54GB2.1|CTSL2_DICDI RecName: Full=CTD small phosphatase-like protein 2;
           Short=CTDSP-like 2
 gi|60464148|gb|EAL62309.1| CTD small phosphatase-like protein 2 [Dictyostelium discoideum AX4]
          Length = 567

 Score = 47.0 bits (110), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 44/150 (29%), Positives = 70/150 (46%), Gaps = 29/150 (19%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL---FRFPHMG 731
           K  S+ K+ LVLDLD TL++ +     +P+             E+PH      F      
Sbjct: 384 KEHSSPKISLVLDLDETLVHCS----TEPL-------------EQPHLTFPVFFNNTEYQ 426

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
           ++ K RP    FL + S +FE+ ++T   ++YA ++  ++DP   +   +     D    
Sbjct: 427 VFAKKRPFFEEFLHKVSDIFEVIIFTASQEVYANKLLNMIDPNNKI---KYRLYRDSCVY 483

Query: 792 FDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
            DG+      KDL  VLG +   VVIID+S
Sbjct: 484 VDGNYL----KDL-SVLGRDLKQVVIIDNS 508


>gi|407846470|gb|EKG02580.1| hypothetical protein TCSYLVIO_006391 [Trypanosoma cruzi]
          Length = 447

 Score = 47.0 bits (110), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 47/173 (27%), Positives = 79/173 (45%), Gaps = 24/173 (13%)

Query: 662 IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-VHDEILRKKEEQDREKP 720
           I K     L  Q + +  +K  L+LDLD TL++S+    + P  HD IL  K E +    
Sbjct: 240 IAKNHASLLPLQMRQYHGKK-TLILDLDETLVHSSL--TLQPKQHDLILSMKTEPEVTT- 295

Query: 721 HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 780
                      ++   RP +  F++  + LFE+ ++T    +Y   +   +DP+G+L + 
Sbjct: 296 -----------IYVAYRPFLHEFIQAVAGLFEVVIFTASVSMYCNPVMDAVDPEGILGSL 344

Query: 781 RVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLI 832
           R+    +     +G       KDL  +LG E S V I+D+S   +   + N I
Sbjct: 345 RLYR--EHCSILNG----AYVKDL-SLLGRELSQVAIVDNSPVTYLFQQRNAI 390


>gi|413917756|gb|AFW57688.1| hypothetical protein ZEAMMB73_437679 [Zea mays]
          Length = 186

 Score = 47.0 bits (110), Expect = 0.055,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 33/58 (56%)

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
           F F    ++ + RP +  FL+R + +FE  ++T    +YA ++  VLDPK  LF  RV
Sbjct: 33  FNFREHTIYVRCRPYLKEFLDRVASVFETIIFTASQSIYAEQLLNVLDPKRKLFRHRV 90


>gi|299470416|emb|CBN80177.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 613

 Score = 47.0 bits (110), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 32/50 (64%)

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 781
            + +LRPG+  FLE+ + ++E+ ++T   + YA  +  +LDP G +FA R
Sbjct: 446 YYVQLRPGLARFLEKVAAIYELVVWTASGRSYADAIIDLLDPAGDIFAER 495


>gi|224031885|gb|ACN35018.1| unknown [Zea mays]
          Length = 190

 Score = 47.0 bits (110), Expect = 0.055,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 33/58 (56%)

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
           F F    ++ + RP +  FL+R + +FE  ++T    +YA ++  VLDPK  LF  RV
Sbjct: 33  FNFREHTIYVRCRPYLKEFLDRVASVFETIIFTASQSIYAEQLLNVLDPKRKLFRHRV 90


>gi|440293350|gb|ELP86476.1| carboxy-terminal domain RNA polymerase II polypeptide A small
           phosphatase, putative [Entamoeba invadens IP1]
          Length = 213

 Score = 47.0 bits (110), Expect = 0.057,   Method: Composition-based stats.
 Identities = 39/160 (24%), Positives = 70/160 (43%), Gaps = 25/160 (15%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           +L ++ DLD TL+++      D  H     K   Q++E               T +RPG 
Sbjct: 46  RLTVIFDLDETLIHTHSLLPEDSKHSRETCKVVVQNKEYT-------------TSIRPGA 92

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS---RGDDGDPFDGDER 797
             FL + SK  E+ L+T   ++YA ++   ++  G +F  ++     +   G  +    +
Sbjct: 93  IQFLRQLSKTCEVVLFTASKQVYADQIIDYMEKDGKIFEHKLYQQSCKNKFGRVYKDATK 152

Query: 798 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 837
           +   +D++        VVI DD   VW   +  L+V +RY
Sbjct: 153 L--GRDIKN-------VVIFDDCELVWTMTQDKLVVCKRY 183


>gi|401624712|gb|EJS42762.1| psr2p [Saccharomyces arboricola H-6]
          Length = 391

 Score = 47.0 bits (110), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 49/105 (46%), Gaps = 19/105 (18%)

Query: 675 KMFSARKLCLVLDLDHTLLNSA--KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 732
           + F  RK CLVLDLD TL++S+    H  D      +   E  D+              +
Sbjct: 216 QAFQQRK-CLVLDLDETLVHSSFKYMHTAD-----FVLPVEIDDQVH-----------NV 258

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
           +   RPG+  FL R S+L+E+ ++T     YA  +   LDP G +
Sbjct: 259 YVIKRPGVDEFLHRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 303


>gi|389583329|dbj|GAB66064.1| hypothetical protein PCYB_082250 [Plasmodium cynomolgi strain B]
          Length = 821

 Score = 47.0 bits (110), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 33/95 (34%), Positives = 44/95 (46%), Gaps = 11/95 (11%)

Query: 748 SKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGV 807
           +K +E++LYTMG   +A     +LDP    F  RV SR D          V   K L  +
Sbjct: 2   NKKYEIYLYTMGTLEHAKSCLLLLDPLKKFFGNRVFSRKD---------SVNGLKHLNRI 52

Query: 808 L-GMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 841
           L    S  + IDDS  +W  +  + I V  Y YFP
Sbjct: 53  LPTYRSVSLCIDDSDYMWKESS-SCIKVHGYNYFP 86


>gi|302824592|ref|XP_002993938.1| hypothetical protein SELMODRAFT_137927 [Selaginella moellendorffii]
 gi|300138210|gb|EFJ04985.1| hypothetical protein SELMODRAFT_137927 [Selaginella moellendorffii]
          Length = 163

 Score = 46.6 bits (109), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 60/121 (49%), Gaps = 9/121 (7%)

Query: 737 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 796
           RPG+  FL+  S+++E+ +++     Y  ++   LDP G +F+   +  G D D   G +
Sbjct: 32  RPGLDRFLKDMSQVYEIVVFSASGASYVKKIVSSLDPTGEIFSA--VFTGSDTDWLSG-Q 88

Query: 797 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLE 856
           RV   + L         +V IDD+  ++P+N  N I V  +   P +    G L P LLE
Sbjct: 89  RVKDLRKLN-----RDKIVWIDDNASLYPYNPKNGIQVPPFHGDP-NDSILGALTPLLLE 142

Query: 857 I 857
           +
Sbjct: 143 V 143


>gi|321474368|gb|EFX85333.1| hypothetical protein DAPPUDRAFT_300415 [Daphnia pulex]
          Length = 867

 Score = 46.6 bits (109), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 23/65 (35%), Positives = 35/65 (53%), Gaps = 3/65 (4%)

Query: 930 QTAEQFGAVCT--KHIDDQVTHVVANSL-GTDKVNWALSTGRFVVHPGWVEASALLYRRA 986
           QT E  G +    KH D   TH+V   L   DK+  +++ G+++VHPGW+  S   Y   
Sbjct: 662 QTIEVLGGIAIDGKHYDSTCTHLVVAKLECNDKLMTSIAAGKWIVHPGWIAKSEQTYHFV 721

Query: 987 NEQDF 991
           +E+ F
Sbjct: 722 DERIF 726


>gi|157873633|ref|XP_001685322.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68128394|emb|CAJ08450.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 231

 Score = 46.6 bits (109), Expect = 0.063,   Method: Composition-based stats.
 Identities = 34/113 (30%), Positives = 58/113 (51%), Gaps = 7/113 (6%)

Query: 728 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 787
           P +  +   RPG+  FL + ++ ++M ++T G  LYA  + + L P  ++   R  +R  
Sbjct: 60  PSVTYYAFRRPGLNEFLYQCAEHYDMRIFTAGEDLYARTLLRWLLPDNLIDESRWYTR-- 117

Query: 788 DGDPFDGDERVPKSKD-LEGVLGMESAVVIIDDSV--RVWPHNKLNLIVVERY 837
           D    DG  R+ K+   L+G    E A +I+DDS    V+PH   N + + R+
Sbjct: 118 DACVGDGYGRLIKNLSMLDGFQFEERATLILDDSAPDNVYPHQ--NALAIPRF 168


>gi|301623726|ref|XP_002941162.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Xenopus
           (Silurana) tropicalis]
          Length = 353

 Score = 46.6 bits (109), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 53/203 (26%), Positives = 88/203 (43%), Gaps = 41/203 (20%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNS-------AKFHEVDPVHDEIL 709
           QQ    Q++R + +  + +  SA +  LVLDLD  L++S       A F  + P  D   
Sbjct: 153 QQTPCDQRQRGKDIPFKTR--SAPESTLVLDLDEILVDSSLLPLTGADFTFLIPFQDT-- 208

Query: 710 RKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK 769
                              +  ++ KLRP    FLE   K++E+ ++T   K YA ++  
Sbjct: 209 -------------------YYKVYVKLRPHAMEFLETLCKVYEIFVFTTAKKEYAEKILD 249

Query: 770 VLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNK 828
           +LDP+  L   R+    D     +G       KDL G+L  + A  V +D +    P++ 
Sbjct: 250 LLDPQKKLIRHRLFQ--DQCLCVEGHY----VKDL-GILQRDLAKTVALDTAPHTIPYHL 302

Query: 829 LNLIVVERYTYFPCSRRQFGLLG 851
            N I ++ +     S++  GLL 
Sbjct: 303 ANRIPIQSWK---GSKKDRGLLS 322


>gi|350421965|ref|XP_003493014.1| PREDICTED: hypothetical protein LOC100746789 isoform 1 [Bombus
           impatiens]
          Length = 558

 Score = 46.6 bits (109), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 75/170 (44%), Gaps = 34/170 (20%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 728
           K  S+ +  LVLDLD TL++ +     D                      FRFP      
Sbjct: 373 KTRSSPEFSLVLDLDETLVHCSLQELSDAA--------------------FRFPVVFQDV 412

Query: 729 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
              ++ + RP    FLE  S L+E+ L+T   ++YA ++  +LDP   L   R+    + 
Sbjct: 413 TYTVFVRTRPYFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 470

Query: 789 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
               +G+      KDL  +LG + S  VIID+S + + +   N I +E +
Sbjct: 471 CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 515


>gi|219126682|ref|XP_002183580.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217404817|gb|EEC44762.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 224

 Score = 46.6 bits (109), Expect = 0.076,   Method: Composition-based stats.
 Identities = 34/108 (31%), Positives = 51/108 (47%), Gaps = 14/108 (12%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           K   A  + LVLDLD TL++      V+PV +  L    +        H+          
Sbjct: 37  KTAGAPPITLVLDLDETLVHCT----VEPVENADLTFPVDFHNVTYQVHV---------- 82

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
           +LRP ++TFL R    +E+ L+T   K+YA E+   +DP G  F  R+
Sbjct: 83  RLRPHLFTFLSRIEGQYEIVLFTASQKVYANELLNRIDPDGKYFHHRL 130


>gi|145549970|ref|XP_001460664.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124428494|emb|CAK93267.1| unnamed protein product [Paramecium tetraurelia]
          Length = 346

 Score = 46.6 bits (109), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 37/146 (25%), Positives = 67/146 (45%), Gaps = 26/146 (17%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
            +K  + +DLD TL           VH E L         KP+R  + F ++   T +RP
Sbjct: 160 TKKYSIAIDLDETL-----------VHSEEL---------KPNRR-YDFQNLQFGTFIRP 198

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERV 798
               FL+  +K   + ++T  N  YAT + ++LDP+  LF G      D       + +V
Sbjct: 199 YCLQFLQLLNKHANLFVFTSSNIKYATTIMQILDPQKDLFQGLFFR--DHCTILQDNSQV 256

Query: 799 PKSKDLEGVLGMESAVVIIDDSVRVW 824
              KD++ +    + +++ID++ + +
Sbjct: 257 ---KDIKIISSDLTKIILIDNNPQCF 279


>gi|240273650|gb|EER37170.1| RNA Polymerase II CTD phosphatase Fcp1 [Ajellomyces capsulatus
           H143]
          Length = 592

 Score = 46.6 bits (109), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 54/119 (45%), Gaps = 24/119 (20%)

Query: 656 DQQKAAIQKERTRRLEEQKKMFSARKLCL--VLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           D     + +    R+EE  K        L  V+DLD T++++     VDP   E      
Sbjct: 245 DNSALTVSENEATRVEEDAKRRLLSSRRLSLVVDLDQTIIHAT----VDPTVAEW----- 295

Query: 714 EQDREKPHRHLFR----------FPHM-GMW--TKLRPGIWTFLERASKLFEMHLYTMG 759
           +QD++ P+    +           P M G W   KLRPG+  FL   S LFE+H+YTMG
Sbjct: 296 QQDKDNPNHEAVKDVRAFQLVDDGPGMKGCWYYIKLRPGLEEFLRNISTLFELHIYTMG 354


>gi|349603764|gb|AEP99509.1| CTD small phosphatase-like protein 2-like protein, partial [Equus
           caballus]
          Length = 159

 Score = 46.6 bits (109), Expect = 0.079,   Method: Composition-based stats.
 Identities = 35/113 (30%), Positives = 58/113 (51%), Gaps = 20/113 (17%)

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR------VISR 785
           ++ +LRP    FLER S+++E+ L+T   K+YA ++  +LDPK  L   R      V  +
Sbjct: 17  VYVRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLFREHCVCVQ 76

Query: 786 GDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           G+              KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 77  GN------------YIKDLN-ILGRDLSKTIIIDNSPQAFAYQLSNGIPIESW 116


>gi|32564286|ref|NP_871854.1| Protein SCPL-3, isoform b [Caenorhabditis elegans]
 gi|351059572|emb|CCD67162.1| Protein SCPL-3, isoform b [Caenorhabditis elegans]
          Length = 312

 Score = 46.6 bits (109), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 17/43 (39%), Positives = 30/43 (69%)

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
           ++ +LRP + TFL R +K FE+ ++T   K+YA ++  +LDP+
Sbjct: 101 VYVRLRPHLRTFLSRMAKTFEIIIFTASKKVYANKLCDILDPR 143


>gi|407407114|gb|EKF31076.1| hypothetical protein MOQ_005093 [Trypanosoma cruzi marinkellei]
          Length = 463

 Score = 46.2 bits (108), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 46/173 (26%), Positives = 79/173 (45%), Gaps = 24/173 (13%)

Query: 662 IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDP-VHDEILRKKEEQDREKP 720
           I K     L  Q + +  +K  L+LDLD TL++S+    + P  HD +L  K E +    
Sbjct: 256 IAKNHASLLPLQMRQYHGKK-TLILDLDETLVHSSL--TLQPKQHDLVLSMKTEPEITT- 311

Query: 721 HRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 780
                      ++   RP +  F++  + LFE+ ++T    +Y   +   +DP+G+L + 
Sbjct: 312 -----------IYVAYRPFLHEFIQAVAGLFEVVIFTASVSMYCNPVMDAVDPEGILGSL 360

Query: 781 RVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLI 832
           R+    +     +G       KDL  +LG E S V I+D+S   +   + N I
Sbjct: 361 RLYR--EHCSILNG----AYVKDL-SLLGRELSQVAIVDNSPVTYLFQQRNAI 406


>gi|444322726|ref|XP_004182004.1| hypothetical protein TBLA_0H01990 [Tetrapisispora blattae CBS 6284]
 gi|387515050|emb|CCH62485.1| hypothetical protein TBLA_0H01990 [Tetrapisispora blattae CBS 6284]
          Length = 688

 Score = 46.2 bits (108), Expect = 0.083,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 50/107 (46%), Gaps = 29/107 (27%)

Query: 673 QKKMFSARKLCLVLDLDHTL-------LNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF 725
           Q ++FS +K CL+LDLD TL       L SA F  V PV  +          E+ H    
Sbjct: 511 QNQIFSGKK-CLILDLDETLVHSSFKYLTSADF--VIPVDID----------EQIH---- 553

Query: 726 RFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 772
                 ++   RPG+  FLE  SK+FE+ ++T     Y   +  VLD
Sbjct: 554 -----NVYVIKRPGVDQFLETVSKIFEVVVFTASVSRYGDPLLDVLD 595


>gi|443696004|gb|ELT96785.1| hypothetical protein CAPTEDRAFT_124156, partial [Capitella teleta]
          Length = 209

 Score = 46.2 bits (108), Expect = 0.086,   Method: Composition-based stats.
 Identities = 32/108 (29%), Positives = 52/108 (48%), Gaps = 14/108 (12%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           K  S  +  LVLDLD TL++ +                E +D       LF+     ++ 
Sbjct: 24  KTRSTPEFSLVLDLDETLVHCSL--------------NELEDAAFSFPVLFQDVTYQVFV 69

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
           + RP    FLER +K+FE+ ++T   K+YA ++  +LDP+  L   R+
Sbjct: 70  RTRPRFREFLERVAKIFEVTVFTASKKVYANKLLNLLDPEKKLIRHRL 117


>gi|350421968|ref|XP_003493015.1| PREDICTED: hypothetical protein LOC100746789 isoform 2 [Bombus
           impatiens]
          Length = 457

 Score = 46.2 bits (108), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 48/170 (28%), Positives = 77/170 (45%), Gaps = 34/170 (20%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP------ 728
           K  S+ +  LVLDLD TL++ +               +E  D        FRFP      
Sbjct: 272 KTRSSPEFSLVLDLDETLVHCSL--------------QELSDAA------FRFPVVFQDV 311

Query: 729 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDD 788
              ++ + RP    FLE  S L+E+ L+T   ++YA ++  +LDP   L   R+    + 
Sbjct: 312 TYTVFVRTRPYFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLFR--EH 369

Query: 789 GDPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
               +G+      KDL  +LG + S  VIID+S + + +   N I +E +
Sbjct: 370 CVCVNGN----YIKDL-SILGRDLSKTVIIDNSPQAFGYQLENGIPIESW 414


>gi|330794863|ref|XP_003285496.1| hypothetical protein DICPUDRAFT_91512 [Dictyostelium purpureum]
 gi|325084587|gb|EGC38012.1| hypothetical protein DICPUDRAFT_91512 [Dictyostelium purpureum]
          Length = 558

 Score = 46.2 bits (108), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 68/148 (45%), Gaps = 37/148 (25%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL---FRFPHMGMWTKLR 737
           K+ LVLDLD TL++ +     +P++             +PH      F      ++ K R
Sbjct: 381 KISLVLDLDETLVHCS----TEPLN-------------QPHLIFPVFFNNTEYQVFAKKR 423

Query: 738 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP----KGVLFAGRVISRGDDGDPFD 793
           P    FL + S +FE+ ++T   ++YA ++  ++DP    K  LF    +    DG+   
Sbjct: 424 PFFEEFLHKVSTIFEVIIFTASQEVYANKLLNIIDPCKKIKHRLFRDSCVYV--DGNYL- 480

Query: 794 GDERVPKSKDLEGVLGME-SAVVIIDDS 820
                   KDL  VLG +   VVIID+S
Sbjct: 481 --------KDL-SVLGRDLKQVVIIDNS 499


>gi|401411253|ref|XP_003885074.1| hypothetical protein NCLIV_054710 [Neospora caninum Liverpool]
 gi|325119493|emb|CBZ55046.1| hypothetical protein NCLIV_054710 [Neospora caninum Liverpool]
          Length = 630

 Score = 46.2 bits (108), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 51/98 (52%), Gaps = 16/98 (16%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR-HLFRFPHMGMWTKLRPG 739
           +  LVLDLD TL++S+ F  V      I  + E     KPH+ H+ +          RPG
Sbjct: 470 RTTLVLDLDETLVHSS-FRPVSVAAFVITVEVEG----KPHKIHVCK----------RPG 514

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
           +  FLE  S L+E+ ++T   + YA  +  +LDPKG+ 
Sbjct: 515 VDRFLEVVSSLYEVVIFTASLQTYADPLIDLLDPKGLC 552


>gi|17509983|ref|NP_491348.1| Protein SCPL-3, isoform a [Caenorhabditis elegans]
 gi|75023288|sp|Q9N4V4.1|SCPL3_CAEEL RecName: Full=CTD small phosphatase-like protein 3;
           Short=CTDSP-like 3
 gi|351059571|emb|CCD67161.1| Protein SCPL-3, isoform a [Caenorhabditis elegans]
          Length = 287

 Score = 46.2 bits (108), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 17/43 (39%), Positives = 30/43 (69%)

Query: 732 MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
           ++ +LRP + TFL R +K FE+ ++T   K+YA ++  +LDP+
Sbjct: 101 VYVRLRPHLRTFLSRMAKTFEIIIFTASKKVYANKLCDILDPR 143


>gi|323308065|gb|EGA61318.1| Psr2p [Saccharomyces cerevisiae FostersO]
          Length = 319

 Score = 46.2 bits (108), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 29/105 (27%), Positives = 47/105 (44%), Gaps = 28/105 (26%)

Query: 680 RKLCLVLDLDHTLLNSA--KFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGM 732
           +K CL+LDLD TL++S+    H  D      + D++                       +
Sbjct: 226 QKKCLILDLDETLVHSSFKYMHSADFVLPVEIDDQV---------------------HNV 264

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
           +   RPG+  FL R S+L+E+ ++T     YA  +   LDP G +
Sbjct: 265 YVIKRPGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309


>gi|190406060|gb|EDV09327.1| hypothetical protein SCRG_05007 [Saccharomyces cerevisiae RM11-1a]
          Length = 397

 Score = 46.2 bits (108), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 50/103 (48%), Gaps = 24/103 (23%)

Query: 680 RKLCLVLDLDHTLLNSA--KFHEVD---PVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           +K CL+LDLD TL++S+    H  D   PV         E D +  + ++ +        
Sbjct: 226 QKKCLILDLDETLVHSSFKYMHSADFVLPV---------EIDDQVHNVYVIK-------- 268

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
             RPG+  FL R S+L+E+ ++T     YA  +   LDP G +
Sbjct: 269 --RPGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309


>gi|51013613|gb|AAT93100.1| YLR019W [Saccharomyces cerevisiae]
          Length = 397

 Score = 46.2 bits (108), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 50/103 (48%), Gaps = 24/103 (23%)

Query: 680 RKLCLVLDLDHTLLNSA--KFHEVD---PVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           +K CL+LDLD TL++S+    H  D   PV         E D +  + ++ +        
Sbjct: 226 QKKCLILDLDETLVHSSFKYMHSADFVLPV---------EIDDQVHNVYVIK-------- 268

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
             RPG+  FL R S+L+E+ ++T     YA  +   LDP G +
Sbjct: 269 --RPGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309


>gi|6323047|ref|NP_013119.1| Psr2p [Saccharomyces cerevisiae S288c]
 gi|55583862|sp|Q07949.1|PSR2_YEAST RecName: Full=Probable phosphatase PSR2; AltName: Full=Plasma
           membrane sodium response protein 2
 gi|1360322|emb|CAA97541.1| unnamed protein product [Saccharomyces cerevisiae]
 gi|151941187|gb|EDN59565.1| protein phosphatase [Saccharomyces cerevisiae YJM789]
 gi|207343198|gb|EDZ70734.1| YLR019Wp-like protein [Saccharomyces cerevisiae AWRI1631]
 gi|256269170|gb|EEU04502.1| Psr2p [Saccharomyces cerevisiae JAY291]
 gi|285813441|tpg|DAA09337.1| TPA: Psr2p [Saccharomyces cerevisiae S288c]
 gi|323332484|gb|EGA73892.1| Psr2p [Saccharomyces cerevisiae AWRI796]
 gi|323353905|gb|EGA85758.1| Psr2p [Saccharomyces cerevisiae VL3]
          Length = 397

 Score = 46.2 bits (108), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 50/103 (48%), Gaps = 24/103 (23%)

Query: 680 RKLCLVLDLDHTLLNSA--KFHEVD---PVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           +K CL+LDLD TL++S+    H  D   PV         E D +  + ++ +        
Sbjct: 226 QKKCLILDLDETLVHSSFKYMHSADFVLPV---------EIDDQVHNVYVIK-------- 268

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
             RPG+  FL R S+L+E+ ++T     YA  +   LDP G +
Sbjct: 269 --RPGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309


>gi|290990355|ref|XP_002677802.1| nuclear lim interactor-interacting protein [Naegleria gruberi]
 gi|284091411|gb|EFC45058.1| nuclear lim interactor-interacting protein [Naegleria gruberi]
          Length = 332

 Score = 46.2 bits (108), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 43/151 (28%), Positives = 67/151 (44%), Gaps = 22/151 (14%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           K  S   + LVLDLD TL++ +     +P+           D +     LF      ++ 
Sbjct: 145 KELSQPDITLVLDLDETLVHCS----TEPI----------PDPDFTFTVLFHGVEYTVYV 190

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 794
           + RP    FLE  SK+FE+ ++T    +YA ++  +LDP+      RV            
Sbjct: 191 RKRPYFVEFLEAVSKIFEVVVFTASQSVYADKLLSILDPERKYIKYRVFRNS------CI 244

Query: 795 DERVPKSKDLEGVLGME-SAVVIIDDSVRVW 824
           D      KDLE VLG + S   I+D+S + +
Sbjct: 245 DVERNYLKDLE-VLGRDLSKTCIVDNSPQAY 274


>gi|323336572|gb|EGA77838.1| Psr2p [Saccharomyces cerevisiae Vin13]
          Length = 398

 Score = 45.8 bits (107), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 50/103 (48%), Gaps = 24/103 (23%)

Query: 680 RKLCLVLDLDHTLLNSA--KFHEVD---PVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           +K CL+LDLD TL++S+    H  D   PV         E D +  + ++ +        
Sbjct: 227 QKKCLILDLDETLVHSSFKYMHSADFVLPV---------EIDDQVHNVYVIK-------- 269

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
             RPG+  FL R S+L+E+ ++T     YA  +   LDP G +
Sbjct: 270 --RPGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 310


>gi|392297996|gb|EIW09095.1| Psr2p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 397

 Score = 45.8 bits (107), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 50/103 (48%), Gaps = 24/103 (23%)

Query: 680 RKLCLVLDLDHTLLNSA--KFHEVD---PVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           +K CL+LDLD TL++S+    H  D   PV         E D +  + ++ +        
Sbjct: 226 QKKCLILDLDETLVHSSFKYMHSADFVLPV---------EIDDQVHNVYVIK-------- 268

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
             RPG+  FL R S+L+E+ ++T     YA  +   LDP G +
Sbjct: 269 --RPGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309


>gi|91086797|ref|XP_973406.1| PREDICTED: similar to CTD (carboxy-terminal domain, RNA polymerase
           II, polypeptide A) small phosphatase like 2 [Tribolium
           castaneum]
 gi|270009707|gb|EFA06155.1| hypothetical protein TcasGA2_TC009000 [Tribolium castaneum]
          Length = 451

 Score = 45.8 bits (107), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 46/168 (27%), Positives = 79/168 (47%), Gaps = 36/168 (21%)

Query: 678 SARKLCLVLDLDHTL-------LNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           S+ +  LVLDLD TL       L+ A FH   PV                   LF+    
Sbjct: 269 SSPEFSLVLDLDETLVHCSLQELSDASFHF--PV-------------------LFQDCSY 307

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGD 790
            ++ + RP    F+E+ S++FE+ L+T   ++YA ++  +LDP+      R+    +   
Sbjct: 308 TVYVRTRPYFREFMEKVSQMFEVILFTASKRVYADKLLNLLDPERKWIKYRLFR--EHCV 365

Query: 791 PFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
             +G+      KDL  +LG + S  +IID+S + + ++  N I +E +
Sbjct: 366 CVNGN----YIKDL-SILGRDLSKTIIIDNSPQAFGYHLNNGIPIESW 408


>gi|349579745|dbj|GAA24906.1| K7_Psr2p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 397

 Score = 45.8 bits (107), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 50/103 (48%), Gaps = 24/103 (23%)

Query: 680 RKLCLVLDLDHTLLNSA--KFHEVD---PVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           +K CL+LDLD TL++S+    H  D   PV         E D +  + ++ +        
Sbjct: 226 QKKCLILDLDETLVHSSFKYMHSADFVLPV---------EIDDQVHNVYVIK-------- 268

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
             RPG+  FL R S+L+E+ ++T     YA  +   LDP G +
Sbjct: 269 --RPGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309


>gi|259148008|emb|CAY81257.1| Psr2p [Saccharomyces cerevisiae EC1118]
          Length = 397

 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 50/103 (48%), Gaps = 24/103 (23%)

Query: 680 RKLCLVLDLDHTLLNSA--KFHEVD---PVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           +K CL+LDLD TL++S+    H  D   PV         E D +  + ++ +        
Sbjct: 226 QKKCLILDLDETLVHSSFKYMHSADFVLPV---------EIDDQVHNVYVIK-------- 268

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
             RPG+  FL R S+L+E+ ++T     YA  +   LDP G +
Sbjct: 269 --RPGVDEFLNRVSQLYEVVVFTASVSRYANPLLDTLDPNGTI 309


>gi|154342859|ref|XP_001567375.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134064707|emb|CAM42811.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 228

 Score = 45.8 bits (107), Expect = 0.12,   Method: Composition-based stats.
 Identities = 37/134 (27%), Positives = 65/134 (48%), Gaps = 18/134 (13%)

Query: 718 EKP------HRHLFRF-----PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATE 766
           EKP      + + F+F     P +  +   RPG+  FL + ++ ++M ++T G  LYA  
Sbjct: 39  EKPTFFIDTNENFFQFTLEDDPSVTYYAFRRPGLSEFLHQCAEHYDMRIFTAGEDLYART 98

Query: 767 MAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDL-EGVLGMESAVVIIDDSV--RV 823
           + + L P  ++   R  +R  D    DG  R+ K+  + +G    E   +I+DDS    V
Sbjct: 99  LLRWLLPDNLIDESRWYTR--DACVGDGYGRLIKNLSMIDGFQFEERTALILDDSAPDNV 156

Query: 824 WPHNKLNLIVVERY 837
           +PH   N + + R+
Sbjct: 157 YPHQ--NALAIPRF 168


>gi|403346652|gb|EJY72729.1| NLI interacting factor-like phosphatase family protein [Oxytricha
           trifallax]
          Length = 368

 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 32/118 (27%), Positives = 62/118 (52%), Gaps = 13/118 (11%)

Query: 729 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAK-VLD--PKGVL-FAGRVIS 784
            M M+  LRPGI+TFL+  S+ FE+ L+  GN+ Y   + K +LD  P G   +   VIS
Sbjct: 179 QMKMFAYLRPGIYTFLDTLSEHFEIVLFNNGNQEYTENLVKLILDNSPSGRRDYFSYVIS 238

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGMESA-----VVIIDDSVRVWPHNKLNLIVVERY 837
           +    D    ++   + K++E     +S       +I+D+++  +  +  N ++++++
Sbjct: 239 K----DKCSINDSGHEIKNIEHFCNFDSNREISDCLIVDNNIYSFQKHLTNGLLIDKF 292


>gi|357129571|ref|XP_003566435.1| PREDICTED: uncharacterized protein C2F7.02c-like [Brachypodium
           distachyon]
          Length = 302

 Score = 45.8 bits (107), Expect = 0.13,   Method: Composition-based stats.
 Identities = 48/146 (32%), Positives = 69/146 (47%), Gaps = 30/146 (20%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF-PHMG-----MWT 734
           K  L LDLD TL++S    + DPV               P R+ F   P +G      + 
Sbjct: 125 KKTLFLDLDETLIHS----QTDPV---------------PARYDFTVRPVIGGQAITFYV 165

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 794
             RPG+  FL  A++ FE+ ++T G + YA+ +   LDP G + A R + RG   D  DG
Sbjct: 166 TKRPGVDEFLRAAAEAFEVVVFTAGLEQYASLVLDRLDPDGAVIAHR-LYRGACRD--DG 222

Query: 795 DERVPKSKDLEGVLGMESAVVIIDDS 820
           D R+   KDL          +I+DD+
Sbjct: 223 DGRL--VKDLAATGRALDCAIIVDDN 246


>gi|221487382|gb|EEE25614.1| protein phosphatase, putative [Toxoplasma gondii GT1]
          Length = 621

 Score = 45.8 bits (107), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 35/95 (36%), Positives = 52/95 (54%), Gaps = 16/95 (16%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR-HLFRFPHMGMWTKLRPGIWT 742
           LVLDLD TL++S+ F  V PV    +  + E    KPH  H+ +          RPG+  
Sbjct: 446 LVLDLDETLVHSS-FRPV-PVSAFAITVEVEG---KPHTIHVCK----------RPGVDR 490

Query: 743 FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
           FLE  S+L+E+ ++T   + YA  +  +LDPKG+ 
Sbjct: 491 FLEVVSRLYEVVIFTASLQTYADPLIDLLDPKGLC 525


>gi|158293726|ref|XP_315066.4| AGAP004967-PA [Anopheles gambiae str. PEST]
 gi|157016584|gb|EAA10342.4| AGAP004967-PA [Anopheles gambiae str. PEST]
          Length = 226

 Score = 45.8 bits (107), Expect = 0.13,   Method: Composition-based stats.
 Identities = 48/164 (29%), Positives = 78/164 (47%), Gaps = 22/164 (13%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           K  S+ +  LVLDLD TL           VH  ++   E  D       LF+     ++ 
Sbjct: 41  KTRSSPEFSLVLDLDETL-----------VHCSLM---ELSDASFKFPVLFQECKYTVFV 86

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 794
           + RP    FLER S++FE+ L+T   ++YA ++  +LDP   L   R+    +     +G
Sbjct: 87  RTRPYFREFLERVSQMFEVILFTASKRVYADKLLNLLDPDRRLIKYRLFR--EHCVLVNG 144

Query: 795 DERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           +      KDL  +LG + S  +IID+S + + +   N I +E +
Sbjct: 145 N----YIKDLT-ILGRDLSKTIIIDNSPQAFGYQLENGIPIESW 183


>gi|328767798|gb|EGF77846.1| hypothetical protein BATDEDRAFT_13622 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 192

 Score = 45.4 bits (106), Expect = 0.14,   Method: Composition-based stats.
 Identities = 33/101 (32%), Positives = 51/101 (50%), Gaps = 16/101 (15%)

Query: 674 KKMFSARKLCLVLDLDHTLLNSAKFHEVDPV-HDEILRKKEEQDREKPHRHLFRFPHMGM 732
           KK  S+  + LVLDLD TL++ +      P+ H +I    E           F      +
Sbjct: 23  KKTRSSPPITLVLDLDETLVHCS----TSPLDHCDITFPVE-----------FNNITYTV 67

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 773
             +LRP   TFLER S++FE+ ++T   K+YA  +  ++DP
Sbjct: 68  SGRLRPHYKTFLERCSEIFEVVVFTASQKIYADRLLNIIDP 108


>gi|308811648|ref|XP_003083132.1| TFIIF-interacting CTD phosphatase, including NLI-interacting factor
           (involved in RNA polymerase II regulation) (ISS)
           [Ostreococcus tauri]
 gi|116055010|emb|CAL57087.1| TFIIF-interacting CTD phosphatase, including NLI-interacting factor
           (involved in RNA polymerase II regulation) (ISS)
           [Ostreococcus tauri]
          Length = 485

 Score = 45.4 bits (106), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 47/156 (30%), Positives = 72/156 (46%), Gaps = 21/156 (13%)

Query: 683 CLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWT 742
            LVLDLD TL           VH  +     + D   P   +F      +  + RP + T
Sbjct: 288 TLVLDLDETL-----------VHSNLENTGGKSDFSFPV--VFNGEIHQVNVRTRPHLQT 334

Query: 743 FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSK 802
           F+E  SK +E+ ++T   ++YA ++  +LDPK    A RV    D     +G+      K
Sbjct: 335 FMETVSKKYEIVVFTASQQIYADKLLDLLDPKREWIAHRVFR--DSCVQIEGN----YMK 388

Query: 803 DLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERY 837
           DL  VLG + S  +IID+S + +     N I +E +
Sbjct: 389 DLR-VLGRDLSKTIIIDNSPQAFGLQVENGIPIESW 423


>gi|365985822|ref|XP_003669743.1| hypothetical protein NDAI_0D01860 [Naumovozyma dairenensis CBS 421]
 gi|343768512|emb|CCD24500.1| hypothetical protein NDAI_0D01860 [Naumovozyma dairenensis CBS 421]
          Length = 514

 Score = 45.4 bits (106), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 47/111 (42%), Gaps = 28/111 (25%)

Query: 680 RKLCLVLDLDHTLLNS-------AKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 732
            K CLVLDLD TL++S       A F+    + D+I                       +
Sbjct: 342 HKKCLVLDLDETLVHSSFKYLPNADFNLPVNIDDQI---------------------HNV 380

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
           +   RPG+  FLE+  KLFE+ ++T     Y   +   LDPKG     R+ 
Sbjct: 381 YVIKRPGVDEFLEKVGKLFEVVIFTASVSRYGDPLLDRLDPKGKSIHHRLF 431


>gi|67588036|ref|XP_665317.1| hypothetical protein [Cryptosporidium hominis TU502]
 gi|54655944|gb|EAL35087.1| hypothetical protein Chro.80553 [Cryptosporidium hominis]
          Length = 364

 Score = 45.4 bits (106), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 57/114 (50%), Gaps = 11/114 (9%)

Query: 733 WTKLRPGIWTFLERASK-LFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
           + KLRPG+   L   SK  +E+++YTMG + +A    ++LDP+   F  + I   ++G  
Sbjct: 181 YYKLRPGVINMLRTLSKDKYEIYMYTMGTEYHAYTSLRILDPELRFFHSKRIFYRNNG-- 238

Query: 792 FDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLN-LIVVERYTYFPCS 843
                +    K L  +   +   +VI+DD  + W    +N L+ V  Y +FP +
Sbjct: 239 ----FKETSIKSLNTLFPYDHRTLVILDDIEQAWT--DINSLLKVYPYNFFPSN 286


>gi|237830029|ref|XP_002364312.1| NLI interacting factor-like phosphatase domain-containing protein
           [Toxoplasma gondii ME49]
 gi|211961976|gb|EEA97171.1| NLI interacting factor-like phosphatase domain-containing protein
           [Toxoplasma gondii ME49]
 gi|221507180|gb|EEE32784.1| dullard protein, putative [Toxoplasma gondii VEG]
          Length = 621

 Score = 45.4 bits (106), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 35/98 (35%), Positives = 53/98 (54%), Gaps = 16/98 (16%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHR-HLFRFPHMGMWTKLRPG 739
           +  LVLDLD TL++S+ F  V PV    +  + E    KPH  H+ +          RPG
Sbjct: 443 RTTLVLDLDETLVHSS-FRPV-PVSAFAITVEVEG---KPHTIHVCK----------RPG 487

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
           +  FLE  S+L+E+ ++T   + YA  +  +LDPKG+ 
Sbjct: 488 VDRFLEVVSRLYEVVIFTASLQTYADPLIDLLDPKGLC 525


>gi|195383304|ref|XP_002050366.1| GJ22116 [Drosophila virilis]
 gi|194145163|gb|EDW61559.1| GJ22116 [Drosophila virilis]
          Length = 703

 Score = 45.4 bits (106), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 62/121 (51%), Gaps = 4/121 (3%)

Query: 874 VIERLHKIFFS--HQSLDDVDVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQT 931
           ++  +HK F++   ++ +  D++ I+   + ++L G  +VFS + P  +        +  
Sbjct: 382 ILRNIHKRFYAIYDETTEIPDLKIIVPKIRCEVLRGQNLVFSGLVPT-QMKLEQSRAYFI 440

Query: 932 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
           A+  GA    +I+  +TH+VA + GT KVN A    +  VV+  W+ A A  +   +E+ 
Sbjct: 441 AKSLGAEVQSNINKDITHLVAVNAGTYKVNAAKKESKIKVVNANWLWACAERWEHVDERL 500

Query: 991 F 991
           F
Sbjct: 501 F 501


>gi|66361684|ref|XP_627365.1| RNA pol II carboxy terminal domain phosphatase of the HAD
           superfamily with a BRCT domain at the C-terminus
           [Cryptosporidium parvum Iowa II]
 gi|46228744|gb|EAK89614.1| RNA pol II carboxy terminal domain phosphatase of the HAD
           superfamily with a BRCT domain at the C-terminus
           [Cryptosporidium parvum Iowa II]
          Length = 762

 Score = 45.1 bits (105), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 56/112 (50%), Gaps = 11/112 (9%)

Query: 733 WTKLRPGIWTFLERASK-LFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDP 791
           + KLRPG+   L   SK  +E+++YTMG + +A    ++LDP+   F  + I   ++G  
Sbjct: 348 YYKLRPGVINMLRTLSKDKYEIYMYTMGTEYHAYTSLRILDPELRFFHSKRIFYRNNG-- 405

Query: 792 FDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLN-LIVVERYTYFP 841
                +    K L  +   +   +VI+DD  + W    +N L+ V  Y +FP
Sbjct: 406 ----FKETSIKSLNTLFPYDHRTLVILDDIEQAWT--DINSLLKVYPYNFFP 451


>gi|327260340|ref|XP_003214992.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
           small phosphatase 1-like [Anolis carolinensis]
          Length = 345

 Score = 45.1 bits (105), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 34/121 (28%), Positives = 55/121 (45%), Gaps = 14/121 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  ++ K   R L  + K   A K+C+V+DLD TL++S+      PV++       E D
Sbjct: 150 EENGSVTKATVRYLLPEIKPQDANKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 205

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 206 GVMHQVYVLKRPHVD----------EFLRRMGELFECVLFTASLAKYADPVADLLDKWGA 255

Query: 777 L 777
            
Sbjct: 256 F 256


>gi|401426731|ref|XP_003877849.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322494096|emb|CBZ29393.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 231

 Score = 45.1 bits (105), Expect = 0.19,   Method: Composition-based stats.
 Identities = 33/113 (29%), Positives = 57/113 (50%), Gaps = 7/113 (6%)

Query: 728 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 787
           P +  +   RPG+  FL + ++ ++M ++T G  LYA  + + L P  ++   R  +R  
Sbjct: 60  PSVTYYAFRRPGLNEFLYQCAEHYDMRIFTAGEDLYARTLLRWLLPANLIDESRWYTR-- 117

Query: 788 DGDPFDGDERVPKSKD-LEGVLGMESAVVIIDDSV--RVWPHNKLNLIVVERY 837
           D    DG  R+ K+   L+G    E   +I+DDS    V+PH   N + + R+
Sbjct: 118 DACVGDGYGRLIKNLSMLDGFQFEERTALILDDSAPDNVYPHQ--NALAIPRF 168


>gi|344253634|gb|EGW09738.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
           phosphatase 1 [Cricetulus griseus]
          Length = 354

 Score = 45.1 bits (105), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 159 EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 214

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 215 GVIHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 264

Query: 777 LFA 779
             A
Sbjct: 265 FRA 267


>gi|145538816|ref|XP_001455108.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124422896|emb|CAK87711.1| unnamed protein product [Paramecium tetraurelia]
          Length = 282

 Score = 45.1 bits (105), Expect = 0.20,   Method: Composition-based stats.
 Identities = 40/132 (30%), Positives = 62/132 (46%), Gaps = 23/132 (17%)

Query: 650 LFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEIL 709
           LF+   +  K   QK       +  K +S +K+ LVLDLD TL           VH E  
Sbjct: 6   LFKNCYEHFKGKFQKNYIS--AQTPKQYSQKKV-LVLDLDETL-----------VHCEF- 50

Query: 710 RKKEEQDREKPHRHLFRFPHMG----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 765
             KE ++ +  H  L    H G    ++ K RP +  FL+ ASK +E+ ++T G + Y  
Sbjct: 51  --KENENFQ--HEVLLEVIHKGQLYTVYLKARPYLNQFLQEASKDYEIFIFTAGYEAYCQ 106

Query: 766 EMAKVLDPKGVL 777
           E+   +D K ++
Sbjct: 107 EVLSFIDKKKII 118


>gi|146096062|ref|XP_001467692.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|398020532|ref|XP_003863429.1| hypothetical protein, conserved [Leishmania donovani]
 gi|134072058|emb|CAM70757.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|322501662|emb|CBZ36743.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 231

 Score = 45.1 bits (105), Expect = 0.20,   Method: Composition-based stats.
 Identities = 33/113 (29%), Positives = 57/113 (50%), Gaps = 7/113 (6%)

Query: 728 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 787
           P +  +   RPG+  FL + ++ ++M ++T G  LYA  + + L P  ++   R  +R  
Sbjct: 60  PSVTYYAFRRPGLNEFLYQCAEHYDMRIFTAGEDLYARTLLRWLLPDNLIDESRWYTR-- 117

Query: 788 DGDPFDGDERVPKSKD-LEGVLGMESAVVIIDDSV--RVWPHNKLNLIVVERY 837
           D    DG  R+ K+   L+G    E   +I+DDS    V+PH   N + + R+
Sbjct: 118 DACVGDGYGRLIKNLSMLDGFQFEERTALILDDSAPDNVYPHQ--NALAIPRF 168


>gi|403368592|gb|EJY84135.1| Putative tfiif-interacting component of the c-terminal domain
           phosphatase [Oxytricha trifallax]
          Length = 525

 Score = 45.1 bits (105), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 61/241 (25%), Positives = 91/241 (37%), Gaps = 82/241 (34%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHE-------VDPVHDEILRKKEEQDREKPHRHLFRF 727
           ++   RKL LVLDLD+TLL++    E        DP    ++          P + ++  
Sbjct: 3   QLLQDRKLVLVLDLDNTLLHTKSIEEREFQTKSRDPTFINLI---------DPLKSIYEI 53

Query: 728 PHM--GMWTKLRPGIWTFLERA--SKLFEMHLYTMGNK----------------LYATEM 767
                G  TKLRP ++ FL++    + FE++ YT G K                L+  E 
Sbjct: 54  KLFRGGFHTKLRPFLFEFLKKVFDERKFEIYFYTAGTKDYGMLIIDIFKMEITRLFGKEY 113

Query: 768 AKVLDPKGVLFAGRVISRGD--------DGDPFDGD----------ERVPKSKDLEGVLG 809
           AK +  +  L   ++ISR D          +  D D          E     +  +  L 
Sbjct: 114 AKQISEE--LSHRKLISRCDKERFANKNSSNEIDIDSMQQQLYQQIENQQMGQGGDATLN 171

Query: 810 M--------------ESAVVIIDDSVRVWPH-------NKL-----NLIVVERYTYFPCS 843
           M              ES  +IIDD   VW         NKL     NLI++  Y Y+  S
Sbjct: 172 MKHFIKSLSSLAGGDESIFIIIDDRSDVWTEEVKDQNGNKLRRVSDNLILIPEYFYWETS 231

Query: 844 R 844
           +
Sbjct: 232 Q 232


>gi|384250655|gb|EIE24134.1| hypothetical protein COCSUDRAFT_41430 [Coccomyxa subellipsoidea
           C-169]
          Length = 1029

 Score = 45.1 bits (105), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 59/197 (29%), Positives = 85/197 (43%), Gaps = 45/197 (22%)

Query: 681 KLCLVLDLDHTLL-----NSAKFH------EVDP-----VHDEILRKKEEQDREKPHRHL 724
           +L LVLDLD TLL     N  + H      E+D      V  ++  K+E+  +E+ +  L
Sbjct: 120 RLPLVLDLDETLLEAFTANQLRKHIKDLSAEIDGGNWSNVEKKLQLKREKAFKEEDYNLL 179

Query: 725 FRFPHMGMWTKLRPGI---WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF--- 778
            +F      T L   I   W   ER    FE+++ T  ++ YA E  + LDP  +L    
Sbjct: 180 VQFIQTNSVT-LNGQIHKAWPGRER----FEVYVCTTADRSYALEAWRHLDPSALLIPYA 234

Query: 779 --AGRVISRGDDGDPFDGDERVPKSKDLEGVLGM------------ESAV---VIIDDSV 821
               R  +   D D  D D  V   KDL  V+G+             SA+   VIIDD  
Sbjct: 235 DRRKRFHNVHQDKDSKDKDGNVKPVKDLAHVMGLLGHPWSAPCTPPNSAMPLAVIIDDQP 294

Query: 822 RVW-PHNKLNLIVVERY 837
            VW   ++  L  VE++
Sbjct: 295 AVWTAESQGQLYQVEKF 311


>gi|340508012|gb|EGR33824.1| NLI interacting factor-like phosphatase family protein, putative
           [Ichthyophthirius multifiliis]
          Length = 222

 Score = 45.1 bits (105), Expect = 0.21,   Method: Composition-based stats.
 Identities = 35/126 (27%), Positives = 67/126 (53%), Gaps = 17/126 (13%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           L+LDLD TL++S   +E +P  D ++  +EE + +K  +  FR         +RP    F
Sbjct: 61  LLLDLDETLIHSCGLNE-NP--DAVIMAQEEYNSQKQFQIAFR---------IRPYCIEF 108

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG----DERVP 799
           L++ SK ++++++T  +  YA  +   LD +   +  +V++R +  +  +G    D R+ 
Sbjct: 109 LQQVSKYWDIYVFTASSASYANAIVNYLDSQQE-YIHQVLTRQNCMETKNGFFIKDLRII 167

Query: 800 KSKDLE 805
           K  DL+
Sbjct: 168 KDIDLQ 173


>gi|341894763|gb|EGT50698.1| hypothetical protein CAEBREN_25349 [Caenorhabditis brenneri]
          Length = 250

 Score = 45.1 bits (105), Expect = 0.21,   Method: Composition-based stats.
 Identities = 30/108 (27%), Positives = 56/108 (51%), Gaps = 14/108 (12%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           K  ++ +  LVLDLD TL++ +    + P+ +  +              +F+     ++ 
Sbjct: 22  KTRASAEYTLVLDLDETLVHCS----LTPLDNATM----------IFPVVFQNITYQVYV 67

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
           +LRP + TFL R +K FE+ ++T   K+YA ++  +LDP+  L   R+
Sbjct: 68  RLRPHLRTFLNRMAKTFEIIIFTASKKVYANKLCDILDPRKNLIRHRL 115


>gi|254578106|ref|XP_002495039.1| ZYRO0B01958p [Zygosaccharomyces rouxii]
 gi|238937929|emb|CAR26106.1| ZYRO0B01958p [Zygosaccharomyces rouxii]
          Length = 336

 Score = 45.1 bits (105), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 49/174 (28%), Positives = 76/174 (43%), Gaps = 20/174 (11%)

Query: 668 RRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRF 727
           ++L  Q  +F+ RK  LV+DLD TL++SA             R     +  + H    RF
Sbjct: 140 KKLVPQSILFAERKKRLVVDLDETLIHSAT------------RSVSHSNSAQGHMVEVRF 187

Query: 728 PHMGM----WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
           P   +    +   RP    FL + SK +++ ++T   K YA  +   L+     F G+  
Sbjct: 188 PPSSISTLYYVHKRPHCDLFLSKVSKWYDLIIFTASMKEYADPVIDWLESS---FTGKFC 244

Query: 784 SRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 837
            R    +     E V   KDL  V  +   VV+ID+S   +  N+ N I VE +
Sbjct: 245 KRLYRHNCV-VREGVGYIKDLSVVTEVLDEVVLIDNSPTSYARNEDNAIQVEGW 297


>gi|403266874|ref|XP_003925585.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
           small phosphatase 1 isoform 1 [Saimiri boliviensis
           boliviensis]
          Length = 262

 Score = 45.1 bits (105), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 67  EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 122

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 123 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 172

Query: 777 LFA 779
             A
Sbjct: 173 FRA 175


>gi|157870945|ref|XP_001684022.1| nuclear lim interactor-interacting factor-like protein [Leishmania
           major strain Friedlin]
 gi|68127090|emb|CAJ04515.1| nuclear lim interactor-interacting factor-like protein [Leishmania
           major strain Friedlin]
          Length = 290

 Score = 45.1 bits (105), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 70/160 (43%), Gaps = 18/160 (11%)

Query: 678 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 737
           S  K+ LVLD+D TL++S      D V+D++L    E                 +  K R
Sbjct: 109 SVPKVTLVLDVDETLVHSTFQPSSDVVYDKVLLVPSEGK------------TYTVSVKYR 156

Query: 738 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 797
           P +  FL   S+ FE+ ++T   + Y  ++   +DP G+L   R+     +   F     
Sbjct: 157 PYLEDFLRFVSRRFEVVIFTASMRAYCDKLMDEIDPHGILGNLRLFR---EHCTFSERSY 213

Query: 798 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 837
           V   KDL  +      VVI+D+S   +   + N I ++ +
Sbjct: 214 V---KDLHRLGRDLRRVVILDNSPAAYSFQQRNAIPIKTW 250


>gi|170587764|ref|XP_001898644.1| NLI interacting factor-like phosphatase family protein [Brugia
           malayi]
 gi|158593914|gb|EDP32508.1| NLI interacting factor-like phosphatase family protein [Brugia
           malayi]
          Length = 314

 Score = 45.1 bits (105), Expect = 0.22,   Method: Composition-based stats.
 Identities = 46/168 (27%), Positives = 78/168 (46%), Gaps = 32/168 (19%)

Query: 678 SARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGM 732
           S  +  LVLDLD TL++ +     D     PVH                   F+     +
Sbjct: 126 STPEFSLVLDLDETLVHCSLTELPDASLTFPVH-------------------FQENTYQV 166

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
           + ++RP +  FLER S+ FE+ L+T   ++YA ++  +LDP   L   R+     +   F
Sbjct: 167 YVRVRPHLQEFLERLSRSFEIILFTASKRVYADKLLNLLDPGKRLIRHRLFR---EHCVF 223

Query: 793 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTY 839
                +   KDL  +LG + S  +IID+S++ + +   N I +E + +
Sbjct: 224 VYGNYI---KDLT-ILGRDLSKTIIIDNSLQSFAYQIDNGIPIESWFF 267


>gi|393909936|gb|EFO24836.2| SCP small domain phosphatase [Loa loa]
          Length = 321

 Score = 45.1 bits (105), Expect = 0.23,   Method: Composition-based stats.
 Identities = 46/168 (27%), Positives = 78/168 (46%), Gaps = 32/168 (19%)

Query: 678 SARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGM 732
           S  +  LVLDLD TL++ +     D     PVH                   F+     +
Sbjct: 133 STPEFSLVLDLDETLVHCSLTELPDASLTFPVH-------------------FQENTYQV 173

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
           + ++RP +  FLER S+ FE+ L+T   ++YA ++  +LDP   L   R+     +   F
Sbjct: 174 YVRVRPHLQEFLERLSRSFEIILFTASKRIYADKLLNLLDPGKRLIRHRLFR---EHCVF 230

Query: 793 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTY 839
                +   KDL  +LG + S  +IID+S++ + +   N I +E + +
Sbjct: 231 VYGNYI---KDLT-ILGRDLSKTIIIDNSLQSFAYQIDNGIPIESWFF 274


>gi|126644240|ref|XP_001388239.1| hypothetical protein [Cryptosporidium parvum Iowa II]
 gi|126117312|gb|EAZ51412.1| hypothetical protein cgd2_3810 [Cryptosporidium parvum Iowa II]
          Length = 475

 Score = 45.1 bits (105), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 67/151 (44%), Gaps = 18/151 (11%)

Query: 624 QTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKE-RTRRLEEQKKMFSARKL 682
           + G       G  G   +S  GD E +   YD+  ++ ++   +   LE Q++ +  RK 
Sbjct: 285 EAGIYDKDNLGRNGFSLRSITGDRESII--YDEDYESYLESTIKEPFLEPQRREYIGRK- 341

Query: 683 CLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWT 742
            LVLDLD TL++S+      P+ +       E D +          +  ++   RPG+  
Sbjct: 342 TLVLDLDETLIHSS----FQPIRNASFTINIEIDGD----------YYDVYVLKRPGVDK 387

Query: 743 FLERASKLFEMHLYTMGNKLYATEMAKVLDP 773
           FL   S +FE+ ++T     YA  +   LDP
Sbjct: 388 FLNIVSAIFEVVIFTASLSKYANPLLDRLDP 418


>gi|145490634|ref|XP_001431317.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398421|emb|CAK63919.1| unnamed protein product [Paramecium tetraurelia]
          Length = 473

 Score = 45.1 bits (105), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 39/147 (26%), Positives = 67/147 (45%), Gaps = 20/147 (13%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           R+  LV+DLD TL++          ++  L  K+ Q      + LF          +RP 
Sbjct: 280 RQKTLVIDLDETLVH---------CNESKLMPKDLQ------KQLFEAYSNQAEISVRPY 324

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVP 799
              FL++ +K FE+ +YT  N+ YA ++ + LDP   L   R + R D  +  +G     
Sbjct: 325 AQQFLQKMAKHFEIMIYTASNEDYANQIIEYLDPTKSLVKYR-LYRNDCINLSEG----C 379

Query: 800 KSKDLEGVLGMESAVVIIDDSVRVWPH 826
             KDL  +      +++ID+S   + +
Sbjct: 380 HIKDLRTLNRNLKDIILIDNSAYSFAY 406


>gi|281210104|gb|EFA84272.1| CTD small phosphatase-like protein 2 [Polysphondylium pallidum
           PN500]
          Length = 539

 Score = 44.7 bits (104), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 71/148 (47%), Gaps = 37/148 (25%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF-RFPHM--GMWTKLR 737
           K+ LVLDLD TL++ +     +P+             ++P    F  F ++   ++ K R
Sbjct: 362 KISLVLDLDETLVHCS----TEPI-------------DEPDLTFFVTFNNVEYKVFAKKR 404

Query: 738 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP----KGVLFAGRVISRGDDGDPFD 793
           P    FL +AS LFE+ ++T   ++YA ++  ++DP    K  L+    +         D
Sbjct: 405 PFFEDFLSKASSLFELIIFTASQEVYANKLLNMIDPNKHIKYRLYRDSCVC-------VD 457

Query: 794 GDERVPKSKDLEGVLGME-SAVVIIDDS 820
           G       KDL  +LG + S VVI+D+S
Sbjct: 458 G----TYLKDL-SILGRDLSQVVIVDNS 480


>gi|399218895|emb|CCF75782.1| unnamed protein product [Babesia microti strain RI]
          Length = 460

 Score = 44.7 bits (104), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 48/101 (47%), Gaps = 6/101 (5%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           KL +VLD+D TL++    H    VH +      E  + +    +    H  M+  LRPG+
Sbjct: 199 KLLVVLDMDETLIH---MHTNPHVHYDYTINLLEHSKSEDPMFISCNIHPTMYISLRPGV 255

Query: 741 WTFLERAS---KLFEMHLYTMGNKLYATEMAKVLDPKGVLF 778
             FL   S     +E+ L+T G +LYA  + + LDP   + 
Sbjct: 256 KEFLRYLSVNSDFYEVALFTAGTQLYADAVLEGLDPNCTII 296


>gi|390464816|ref|XP_003733289.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
           small phosphatase 1 isoform 2 [Callithrix jacchus]
          Length = 260

 Score = 44.7 bits (104), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65  EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 777 LFA 779
             A
Sbjct: 171 FRA 173


>gi|321470826|gb|EFX81801.1| hypothetical protein DAPPUDRAFT_49973 [Daphnia pulex]
          Length = 237

 Score = 44.7 bits (104), Expect = 0.26,   Method: Composition-based stats.
 Identities = 30/100 (30%), Positives = 49/100 (49%), Gaps = 14/100 (14%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           K  S+    LVLDLD TL++ +               +E +D        F+     ++ 
Sbjct: 52  KTRSSPTFSLVLDLDETLVHCSL--------------EELEDAAFSFPVFFQDTTYQVFV 97

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
           + RP    FLER S++FE+ L+T   K+YA ++  +LDP+
Sbjct: 98  RTRPHFREFLERVSQIFEVILFTASKKVYADKLLNLLDPQ 137


>gi|194866038|ref|XP_001971726.1| GG14268 [Drosophila erecta]
 gi|190653509|gb|EDV50752.1| GG14268 [Drosophila erecta]
          Length = 260

 Score = 44.7 bits (104), Expect = 0.27,   Method: Composition-based stats.
 Identities = 34/120 (28%), Positives = 60/120 (50%), Gaps = 19/120 (15%)

Query: 664 KERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKP--- 720
           ++R   + +++     RK  LVLD+D T++ S             L+K  ++ R KP   
Sbjct: 54  EDRLSPVSKRRLSLVGRK-TLVLDMDETMITSW------------LKKSGKKPRNKPRVA 100

Query: 721 HRHLFRFP--HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP-KGVL 777
           H   F  P     ++   RP +  FL+R SK +++ ++T G +LYA+ +   LD  +G+L
Sbjct: 101 HDFKFYLPAYEATIYVYKRPYLDHFLDRVSKWYDLTVFTAGAELYASPILDFLDRGRGIL 160


>gi|145516326|ref|XP_001444057.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411457|emb|CAK76660.1| unnamed protein product [Paramecium tetraurelia]
          Length = 411

 Score = 44.7 bits (104), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 51/100 (51%), Gaps = 9/100 (9%)

Query: 717 REKPHRHLFRFPHMG----MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 772
           REK    +      G    ++  +RP    FL++ S L+ +++YT  +  YA  + K LD
Sbjct: 247 REKSQTCILAVSEQGEEARIYLNVRPFCQWFLQQMSLLYTIYVYTASSSAYANTIVKYLD 306

Query: 773 PKGVLFAGRVISRGDDGDPFDG----DERVPKSKDLEGVL 808
           PKG   +G ++SR +  +  +G    D R+  +K ++ +L
Sbjct: 307 PKGQWISG-ILSRQNCLETKNGFYIKDLRIIANKQIKNML 345


>gi|339237973|ref|XP_003380541.1| nuclear envelope morphology protein 1 [Trichinella spiralis]
 gi|316976534|gb|EFV59811.1| nuclear envelope morphology protein 1 [Trichinella spiralis]
          Length = 281

 Score = 44.7 bits (104), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 50/170 (29%), Positives = 83/170 (48%), Gaps = 29/170 (17%)

Query: 671 EEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           E+ KK+++A     VLDLD TL++S    + DP + +I+   +   R             
Sbjct: 97  EKSKKLYTA-----VLDLDQTLVHSRSKRKGDPRY-KIVNIPQATRR------------- 137

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMA-KVLDPKGVLFAGRVISRGDDG 789
             +T +RP    FLE  S+ +E+ L+T G   YA  +  +++DP+   F+        D 
Sbjct: 138 -FYTAVRPCCAEFLESISEFYEVILFTAGTPRYAAAVIDQLVDPEHKYFSN--FYYRPDC 194

Query: 790 DPFDGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYT 838
            P D +      KDL  +LG + S  VI+DD++  +  +  N I+VE +T
Sbjct: 195 APVDHE----FVKDL-SILGRDLSKTVIMDDNMMSFCCHIDNGILVEPWT 239


>gi|402889397|ref|XP_003908003.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
           small phosphatase 1 isoform 2 [Papio anubis]
          Length = 260

 Score = 44.3 bits (103), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65  EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 777 LFA 779
             A
Sbjct: 171 FRA 173


>gi|118378638|ref|XP_001022493.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
            thermophila]
 gi|89304260|gb|EAS02248.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
            thermophila SB210]
          Length = 1393

 Score = 44.3 bits (103), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 40/156 (25%), Positives = 68/156 (43%), Gaps = 28/156 (17%)

Query: 681  KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH---MGMWTKLR 737
            K  LV DLD TL++  +   +    D IL          P     RFP    +     +R
Sbjct: 1197 KKTLVFDLDETLIHCNENANIPS--DVIL----------P----IRFPTGEVIEAGINVR 1240

Query: 738  PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG----DDGDPFD 793
            P     L+  SK +E+ ++T  +  YA  +   LDPKG    GR+        ++G  + 
Sbjct: 1241 PYCMEILQELSKFYEIIVFTASHSCYANVVLDYLDPKGQYITGRLFRENCVTTEEG-VYI 1299

Query: 794  GDERVPKSKDLEGVLGMESAVVI----IDDSVRVWP 825
             D RV  +++L  ++ +++A       ID+ + + P
Sbjct: 1300 KDLRVIANRNLSDIVLVDNAAYSFGFQIDNGIPIIP 1335


>gi|326430493|gb|EGD76063.1| hypothetical protein PTSG_00772 [Salpingoeca sp. ATCC 50818]
          Length = 840

 Score = 44.3 bits (103), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 7/70 (10%)

Query: 932 AEQFGAVCTKHIDDQVTHVVA----NSL---GTDKVNWALSTGRFVVHPGWVEASALLYR 984
           A+  GA      D+ VTHV+A    N L    T KV   +++G+++V P W+ A     R
Sbjct: 635 ADAMGATVLSKFDESVTHVLAALDENHLLISNTLKVMMGIASGKWIVSPAWLYACDTENR 694

Query: 985 RANEQDFAIK 994
           RANE+ F ++
Sbjct: 695 RANEEAFEMQ 704


>gi|312072812|ref|XP_003139236.1| SCP small domain phosphatase [Loa loa]
          Length = 321

 Score = 44.3 bits (103), Expect = 0.32,   Method: Composition-based stats.
 Identities = 46/168 (27%), Positives = 78/168 (46%), Gaps = 32/168 (19%)

Query: 678 SARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGM 732
           S  +  LVLDLD TL++ +     D     PVH                   F+     +
Sbjct: 133 STPEFSLVLDLDETLVHCSLTELPDASLTFPVH-------------------FQENTYQV 173

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
           + ++RP +  FLER S+ FE+ L+T   ++YA ++  +LDP   L   R+     +   F
Sbjct: 174 YVRVRPHLQEFLERLSRSFEIILFTASKRIYADKLLNLLDPGKRLIRHRLFR---EHCVF 230

Query: 793 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTY 839
                +   KDL  +LG + S  +IID+S++ + +   N I +E + +
Sbjct: 231 VYGNYI---KDLT-ILGRDLSKTIIIDNSLQSFAYQIDNGIPIESWFF 274


>gi|449542561|gb|EMD33540.1| hypothetical protein CERSUDRAFT_160532 [Ceriporiopsis subvermispora
           B]
          Length = 965

 Score = 44.3 bits (103), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 27/99 (27%), Positives = 44/99 (44%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           D  ++L   Q  IL  C   F  +            L++ A + GA     + D +TH+V
Sbjct: 43  DDEDVLGETQASILESCPRPFRGITLCTTGISDKTTLFKQAIELGAQYDSDLRDNITHLV 102

Query: 952 ANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQD 990
           A   G+ K   AL+    ++HP WV  S  ++ R ++ D
Sbjct: 103 AEVPGSAKYKCALANKIPIMHPSWVTDSYEIWLRGDDVD 141


>gi|340501300|gb|EGR28100.1| NLI interacting factor-like phosphatase family protein, putative
           [Ichthyophthirius multifiliis]
          Length = 306

 Score = 44.3 bits (103), Expect = 0.33,   Method: Composition-based stats.
 Identities = 26/90 (28%), Positives = 47/90 (52%), Gaps = 12/90 (13%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           L LDLD TL++S + +E     +  ++ K  +D      +L +F       ++RP    F
Sbjct: 111 LFLDLDETLIHSCRINE-----NYNVQIKAFEDNNSQQEYLIQF-------RIRPYCMEF 158

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDP 773
           L++ SK ++++L+T  +  YA  +   LDP
Sbjct: 159 LQKISKYWDIYLFTASSTTYANAIVNYLDP 188


>gi|302806320|ref|XP_002984910.1| hypothetical protein SELMODRAFT_121210 [Selaginella moellendorffii]
 gi|300147496|gb|EFJ14160.1| hypothetical protein SELMODRAFT_121210 [Selaginella moellendorffii]
          Length = 198

 Score = 44.3 bits (103), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 29/46 (63%)

Query: 737 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
           RPG+ TFL   S+++E+ ++T   KLYA  +   LDP G LF  R+
Sbjct: 65  RPGVDTFLNEMSQIYEIVVFTRAVKLYADRILDRLDPAGNLFTHRL 110


>gi|159483225|ref|XP_001699661.1| hypothetical protein CHLREDRAFT_111940 [Chlamydomonas reinhardtii]
 gi|158281603|gb|EDP07357.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 202

 Score = 44.3 bits (103), Expect = 0.34,   Method: Composition-based stats.
 Identities = 37/124 (29%), Positives = 55/124 (44%), Gaps = 28/124 (22%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH-MG 731
           Q++   A KLC++LDLD TL++S                     R  P    +   H +G
Sbjct: 2   QQRPEHAGKLCVLLDLDGTLVSSY------------------TPRRAPRLPSYVRTHVVG 43

Query: 732 MWTKL---------RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
           M +KL         RPG+  FLE  +   E+ ++T G + YA  +   +DP   LFA R+
Sbjct: 44  MGSKLNPAGVFVVERPGLTEFLEELATFAEVIIFTAGLEDYAKPIIDAIDPSNRLFAHRI 103

Query: 783 ISRG 786
              G
Sbjct: 104 YREG 107


>gi|340504501|gb|EGR30938.1| NLI interacting factor-like phosphatase family protein, putative
           [Ichthyophthirius multifiliis]
          Length = 230

 Score = 44.3 bits (103), Expect = 0.34,   Method: Composition-based stats.
 Identities = 40/157 (25%), Positives = 76/157 (48%), Gaps = 25/157 (15%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           L LDLD TL++S   +E +P  D IL+  E  +           P   +  ++RP    F
Sbjct: 49  LFLDLDETLIHSCSLNE-NP--DVILKVGEINE-----------PQFHIGFRIRPYCMDF 94

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG----DERVP 799
           L+   + ++++++T  +  Y+  +   LDP+     G +++R +  +  +G    D R+ 
Sbjct: 95  LKALVEYWDIYIFTASSSTYSNAIINYLDPERKYING-ILNRSNCMETKNGFFIKDLRIA 153

Query: 800 KSKDLEGVLGME----SAVVIIDDSVRV--WPHNKLN 830
           K KDL  ++ ++    S    ID+ + +  W HNK +
Sbjct: 154 KGKDLRKIILVDNLSHSFGFQIDNGIPILEWHHNKYD 190


>gi|332308973|ref|NP_001193807.1| carboxy-terminal domain RNA polymerase II polypeptide A small
           phosphatase 1 isoform 3 [Homo sapiens]
 gi|397495664|ref|XP_003818667.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
           small phosphatase 1 isoform 2 [Pan paniscus]
 gi|410036206|ref|XP_003950023.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
           small phosphatase 1 [Pan troglodytes]
 gi|426338591|ref|XP_004033259.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
           small phosphatase 1 isoform 2 [Gorilla gorilla gorilla]
          Length = 260

 Score = 44.3 bits (103), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65  EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 777 LFA 779
             A
Sbjct: 171 FRA 173


>gi|123404051|ref|XP_001302356.1| NLI interacting factor-like phosphatase family protein [Trichomonas
           vaginalis G3]
 gi|121883637|gb|EAX89426.1| NLI interacting factor-like phosphatase family protein [Trichomonas
           vaginalis G3]
          Length = 205

 Score = 44.3 bits (103), Expect = 0.35,   Method: Composition-based stats.
 Identities = 27/102 (26%), Positives = 51/102 (50%), Gaps = 18/102 (17%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 732
           ++ + +  +  LVLDLD TL++++ F    P H ++                 +F     
Sbjct: 21  RRTIVTDSRKALVLDLDETLIHTSTF----PPHSDV--------------ESLKFDDSPD 62

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
           +  LRP +  FL++ S+LFE+ ++T G + YA  +  +L P+
Sbjct: 63  YVFLRPNVRIFLDKVSELFEVFIFTAGTQNYAERILDLLCPQ 104


>gi|363736290|ref|XP_003641697.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
           small phosphatase 1-like [Gallus gallus]
          Length = 275

 Score = 44.3 bits (103), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 56/123 (45%), Gaps = 14/123 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++   + K   + L  + K   A KLC+V+DLD TL++S+      PV++       E D
Sbjct: 80  EENGTVPKAAVKHLLPEIKPQDASKLCVVIDLDETLVHSS----FKPVNNADFIIPVEID 135

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 136 GIMHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 185

Query: 777 LFA 779
             A
Sbjct: 186 FRA 188


>gi|359323950|ref|XP_003640241.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
           small phosphatase 1-like [Canis lupus familiaris]
          Length = 260

 Score = 44.3 bits (103), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  A+ K   + L  + K   A K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65  EENGAVPKTPVQYLLPEAKAQDADKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 777 LFA 779
             A
Sbjct: 171 FRA 173


>gi|290981094|ref|XP_002673266.1| BRCT domain-containing protein [Naegleria gruberi]
 gi|284086848|gb|EFC40522.1| BRCT domain-containing protein [Naegleria gruberi]
          Length = 706

 Score = 44.3 bits (103), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 45/92 (48%), Gaps = 6/92 (6%)

Query: 903 KILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHID-DQVTHVVANSLGTDKVN 961
           KIL G  I  S     G  NP    L +TA + GA  T      + TH++A    TDK  
Sbjct: 239 KILPGVTIAIS-----GIVNPERTTLRETAMKMGAKYTPEFQAGKTTHMIAAFSNTDKTK 293

Query: 962 WALSTGRFVVHPGWVEASALLYRRANEQDFAI 993
            A++ G FVV   WV   + L +R +E+ +++
Sbjct: 294 EAINAGIFVVSKQWVYDCSTLSKRMDERRYSM 325


>gi|146089360|ref|XP_001470364.1| nuclear lim interactor-interacting factor-like protein [Leishmania
           infantum JPCM5]
 gi|134070397|emb|CAM68734.1| nuclear lim interactor-interacting factor-like protein [Leishmania
           infantum JPCM5]
          Length = 290

 Score = 44.3 bits (103), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 47/100 (47%), Gaps = 12/100 (12%)

Query: 678 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 737
           S  K+ LVLD+D TL++S      D V+D++L    E                 +  K R
Sbjct: 109 SVPKVTLVLDVDETLVHSTFQPSSDVVYDKVLLVASEGK------------TYTVSVKYR 156

Query: 738 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
           P +  FL   S+ FE+ ++T   + Y  ++   +DP G+L
Sbjct: 157 PYLEDFLRFVSRRFEVVIFTASMRAYCDKLMDEIDPHGIL 196


>gi|395816723|ref|XP_003781843.1| PREDICTED: CTD small phosphatase-like protein isoform 1 [Otolemur
           garnettii]
          Length = 265

 Score = 44.3 bits (103), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 73/170 (42%), Gaps = 33/170 (19%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++   +QK   + L  +  +    K C+V+DLD TL++S+      P+ +       E D
Sbjct: 71  EENGGLQKPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 126

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  GV
Sbjct: 127 GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 176

Query: 777 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
             A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 177 FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 213


>gi|335298853|ref|XP_003132160.2| PREDICTED: CTD small phosphatase-like protein-like [Sus scrofa]
          Length = 265

 Score = 44.3 bits (103), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 73/170 (42%), Gaps = 33/170 (19%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++   +QK   + L  +  +    K C+V+DLD TL++S+      P+ +       E D
Sbjct: 71  EENGGLQKPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 126

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  GV
Sbjct: 127 GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 176

Query: 777 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
             A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 177 FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 213


>gi|388579583|gb|EIM19905.1| hypothetical protein WALSEDRAFT_58475 [Wallemia sebi CBS 633.66]
          Length = 798

 Score = 43.9 bits (102), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 32/58 (55%)

Query: 926 HPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLY 983
           H L++ A + GA    +  D VTH++A   G+ K N A+  G  V+ P ++E S L +
Sbjct: 66  HELYKKAMEMGATYESNFTDNVTHLIAIEFGSPKYNCAIQLGATVLLPAFIEQSYLEW 123


>gi|145512870|ref|XP_001442346.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409699|emb|CAK74949.1| unnamed protein product [Paramecium tetraurelia]
          Length = 400

 Score = 43.9 bits (102), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 33/135 (24%), Positives = 64/135 (47%), Gaps = 19/135 (14%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           + LDLD TL+++    E   V    L+++ E   E           +G+   +RP    F
Sbjct: 220 IFLDLDETLVHACHSRETHTVK---LQQQNEDGSE-----------IGVGINIRPYTTYF 265

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG----DERVP 799
           L+  ++ + +++YT  ++ YA  +   LDP     +G ++SR +  +  +G    D R+ 
Sbjct: 266 LQELAQYYTIYIYTASSQPYAQTIVNYLDPHKQYISG-ILSRSNCMETKNGFFIKDLRII 324

Query: 800 KSKDLEGVLGMESAV 814
           +  DL   L +++ V
Sbjct: 325 QDIDLNRTLIIDNLV 339


>gi|410224860|gb|JAA09649.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase 1 [Pan troglodytes]
          Length = 260

 Score = 43.9 bits (102), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65  EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 777 LFA 779
             A
Sbjct: 171 FRA 173


>gi|145513150|ref|XP_001442486.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409839|emb|CAK75089.1| unnamed protein product [Paramecium tetraurelia]
          Length = 425

 Score = 43.9 bits (102), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 33/134 (24%), Positives = 61/134 (45%), Gaps = 27/134 (20%)

Query: 683 CLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKL----RP 738
            L+LDLD TL++S                     RE P  ++      G   K+    RP
Sbjct: 244 ALILDLDETLIHSCA------------------QRENPQVYVTAVGDFGEEAKIGINIRP 285

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG---- 794
               FL++ S+ + +++YT  ++ YA  +   LDP     +G +++R +  +  +G    
Sbjct: 286 YTSLFLQQLSQYYTIYIYTASSQAYAQAIINYLDPTKQYISG-IMTRNNCMETKNGFFIK 344

Query: 795 DERVPKSKDLEGVL 808
           D R+  +K+L+ +L
Sbjct: 345 DLRLISNKELKDML 358


>gi|15220552|ref|NP_174271.1| haloacid dehalogenase-like hydrolase [Arabidopsis thaliana]
 gi|124301096|gb|ABN04800.1| At1g29780 [Arabidopsis thaliana]
 gi|332193007|gb|AEE31128.1| haloacid dehalogenase-like hydrolase [Arabidopsis thaliana]
          Length = 221

 Score = 43.9 bits (102), Expect = 0.46,   Method: Composition-based stats.
 Identities = 31/109 (28%), Positives = 57/109 (52%), Gaps = 13/109 (11%)

Query: 669 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP 728
           +LE+    ++  K  ++LDLD TL++ A  H     HD ++  K E++            
Sbjct: 37  KLEDPLTGYTNMKRTIILDLDETLVH-ATTHLPGVKHDFMVMVKMEREI----------- 84

Query: 729 HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
            M ++   RPG+  FLER  + +++ ++T G + YA+++   LD  GV+
Sbjct: 85  -MPIFVVKRPGVTEFLERLGENYKVVVFTAGLEEYASQVLDKLDKNGVI 132


>gi|55740285|gb|AAV63944.1| nuclear LIM factor interactor-interacting protein [Phytophthora
           sojae]
 gi|348684603|gb|EGZ24418.1| hypothetical protein PHYSODRAFT_296510 [Phytophthora sojae]
          Length = 336

 Score = 43.9 bits (102), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 45/97 (46%), Gaps = 14/97 (14%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K+CLVLDLD TL++S+      P  +       E D    H          ++   RPG 
Sbjct: 165 KMCLVLDLDETLVHSS----FRPTPNPDFVIPVEIDGTIHH----------VFVAKRPGA 210

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
             FL   +K +E+ +YT     YA  +   LDP+GV+
Sbjct: 211 EEFLVEMAKYYEIVIYTASLSKYADPLLDQLDPEGVI 247


>gi|145483633|ref|XP_001427839.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124394922|emb|CAK60441.1| unnamed protein product [Paramecium tetraurelia]
          Length = 308

 Score = 43.9 bits (102), Expect = 0.48,   Method: Composition-based stats.
 Identities = 34/96 (35%), Positives = 54/96 (56%), Gaps = 15/96 (15%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
           ARKLC VLDLD TL++S    + D  +D +L    +         LF+     ++  +RP
Sbjct: 54  ARKLC-VLDLDETLVHSQ--FKGDNGYDFLLDIIVQS-------QLFK-----VFVTVRP 98

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
           G+ TFLE+ S+ F++ L+T   K YA  +  ++DP+
Sbjct: 99  GVETFLEQLSEHFDIVLWTASLKEYADPVIDIIDPQ 134


>gi|119591022|gb|EAW70616.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase 1, isoform CRA_b [Homo sapiens]
 gi|119591023|gb|EAW70617.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase 1, isoform CRA_b [Homo sapiens]
          Length = 255

 Score = 43.9 bits (102), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 60  EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 115

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 116 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 165

Query: 777 LFA 779
             A
Sbjct: 166 FRA 168


>gi|209882797|ref|XP_002142834.1| NLI interacting factor-like phosphatase family protein
           [Cryptosporidium muris RN66]
 gi|209558440|gb|EEA08485.1| NLI interacting factor-like phosphatase family protein
           [Cryptosporidium muris RN66]
          Length = 536

 Score = 43.9 bits (102), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 52/107 (48%), Gaps = 15/107 (14%)

Query: 670 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 729
           LE QK  +  RK  LVLDLD TL++S+      P+         E + E  + ++ +   
Sbjct: 332 LEPQKPEYFGRKT-LVLDLDETLVHSS----FQPIRAASFVISVEIEYEMYNVYVLK--- 383

Query: 730 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                  RPG+  FLE  S L+E+ ++T     YA  +   LDP+G+
Sbjct: 384 -------RPGVDKFLEVVSSLYEVVIFTASLSKYANPLLDKLDPRGL 423


>gi|55740291|gb|AAV63946.1| nuclear LIM factor interactor-interacting protein [Phytophthora
           sojae]
 gi|348684596|gb|EGZ24411.1| hypothetical protein PHYSODRAFT_325530 [Phytophthora sojae]
          Length = 336

 Score = 43.9 bits (102), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 45/97 (46%), Gaps = 14/97 (14%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K+CLVLDLD TL++S+      P  +       E D    H          ++   RPG 
Sbjct: 165 KMCLVLDLDETLVHSS----FRPTPNPDFVIPVEIDGTIHH----------VFVAKRPGA 210

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
             FL   +K +E+ +YT     YA  +   LDP+GV+
Sbjct: 211 EEFLVEMAKYYEIVIYTASLSKYADPLLDQLDPEGVI 247


>gi|392345559|ref|XP_003749304.1| PREDICTED: LOW QUALITY PROTEIN: protein ECT2 [Rattus norvegicus]
          Length = 913

 Score = 43.9 bits (102), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)

Query: 934 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
             G V  K  + +VTH+VAN    +K   A+S G  ++ PGW+  +   + R NEQ F 
Sbjct: 201 HMGGVIRKECNSKVTHLVANCTQGEKFRVAVSLGTPIMRPGWIYEA---WERRNEQCFC 256


>gi|403420598|ref|NP_001102017.2| protein ECT2 [Rattus norvegicus]
 gi|392338694|ref|XP_003753608.1| PREDICTED: protein ECT2 isoform 2 [Rattus norvegicus]
          Length = 913

 Score = 43.9 bits (102), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)

Query: 934 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
             G V  K  + +VTH+VAN    +K   A+S G  ++ PGW+  +   + R NEQ F 
Sbjct: 201 HMGGVIRKECNSKVTHLVANCTQGEKFRVAVSLGTPIMRPGWIYEA---WERRNEQCFC 256


>gi|297597243|ref|NP_001043640.2| Os01g0629400 [Oryza sativa Japonica Group]
 gi|255673485|dbj|BAF05554.2| Os01g0629400, partial [Oryza sativa Japonica Group]
          Length = 177

 Score = 43.9 bits (102), Expect = 0.49,   Method: Composition-based stats.
 Identities = 28/105 (26%), Positives = 57/105 (54%), Gaps = 14/105 (13%)

Query: 724 LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
            F   +  ++ + RP +  FLE+ +++F++ ++T   ++YA ++   LDP G L + R+ 
Sbjct: 30  FFNMKNHTVYVRQRPHLKMFLEKVAQMFDLVIFTASQRIYAEQLIDRLDPDGRLISHRIY 89

Query: 784 SRG---DDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVW 824
                  +G           +KDL  +LG++ A VVI+D++ +V+
Sbjct: 90  RESCIFSEG---------CYTKDLT-ILGVDLAKVVIVDNTPQVF 124


>gi|392338696|ref|XP_003753609.1| PREDICTED: protein ECT2 isoform 3 [Rattus norvegicus]
          Length = 754

 Score = 43.9 bits (102), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)

Query: 934 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
             G V  K  + +VTH+VAN    +K   A+S G  ++ PGW+  +   + R NEQ F 
Sbjct: 42  HMGGVIRKECNSKVTHLVANCTQGEKFRVAVSLGTPIMRPGWIYEA---WERRNEQCFC 97


>gi|398016831|ref|XP_003861603.1| nuclear lim interactor-interacting factor-like protein [Leishmania
           donovani]
 gi|322499830|emb|CBZ34903.1| nuclear lim interactor-interacting factor-like protein [Leishmania
           donovani]
          Length = 290

 Score = 43.9 bits (102), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 47/100 (47%), Gaps = 12/100 (12%)

Query: 678 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 737
           S  K+ LVLD+D TL++S      D V+D++L    E                 +  K R
Sbjct: 109 SVPKVTLVLDVDETLVHSTFQPSSDVVYDKVLLVASEGK------------TYTVSVKYR 156

Query: 738 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
           P +  FL   S+ FE+ ++T   + Y  ++   +DP G+L
Sbjct: 157 PYLEDFLRFVSRRFEVVIFTASMRAYCDKLMDEIDPHGIL 196


>gi|32813443|ref|NP_872580.1| carboxy-terminal domain RNA polymerase II polypeptide A small
           phosphatase 1 isoform 2 [Homo sapiens]
 gi|31074175|gb|AAP34397.1| small CTD phosphatase 1 [Homo sapiens]
 gi|410351181|gb|JAA42194.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase 1 [Pan troglodytes]
          Length = 260

 Score = 43.9 bits (102), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65  EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 777 LFA 779
             A
Sbjct: 171 FRA 173


>gi|395823467|ref|XP_003785008.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
           small phosphatase 1 [Otolemur garnettii]
          Length = 260

 Score = 43.9 bits (102), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65  EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 777 LFA 779
             A
Sbjct: 171 FRA 173


>gi|255087422|ref|XP_002505634.1| predicted protein [Micromonas sp. RCC299]
 gi|226520904|gb|ACO66892.1| predicted protein [Micromonas sp. RCC299]
          Length = 548

 Score = 43.5 bits (101), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 50/109 (45%), Gaps = 27/109 (24%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFP-------HMGMW 733
           K  LVLDLD TL           VH  + +  EE D        F FP       H+ + 
Sbjct: 354 KNTLVLDLDETL-----------VHSNLEQTIEEAD--------FSFPVTFNGQQHI-VN 393

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            + RP +  F+E A++ FE+ ++T   ++YA  +   +DP  VL   R+
Sbjct: 394 VRRRPYLTEFMEFAARHFEVVVFTASQRVYAERLLNKIDPNQVLIKHRL 442


>gi|340376943|ref|XP_003386990.1| PREDICTED: hypothetical protein LOC100641299 [Amphimedon
           queenslandica]
          Length = 1244

 Score = 43.5 bits (101), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 43/166 (25%), Positives = 73/166 (43%), Gaps = 16/166 (9%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 732
           + ++FS R+  +VLDLD TL++S         HD  L    E   +   R       +  
Sbjct: 123 ETRLFSVRRKIMVLDLDETLIHSH--------HDNTLLPATEMLPDFYVRVYIENHPVKF 174

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP-KGVLFAGRVISRGDDGDP 791
           +   RP +  FL   S+ +++ ++T   + Y  E+A  LD  KG+L   R   R D    
Sbjct: 175 YVYKRPHVDYFLSVVSQWYDLVIFTASMQKYGMEVANHLDQNKGIL--PRRYFRQDCTMD 232

Query: 792 FDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 837
            +G      +K+L  +    S + I+D+S   +  N  N I +  +
Sbjct: 233 MNG-----YTKNLSMISEDLSNIFILDNSPSAYRGNPDNAIPITSW 273


>gi|410969412|ref|XP_003991189.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
           small phosphatase 1 [Felis catus]
          Length = 259

 Score = 43.5 bits (101), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 34/123 (27%), Positives = 56/123 (45%), Gaps = 14/123 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  A+ K   + L  + K     K+C+V+DLD TL++S+      PV++       E D
Sbjct: 64  EENGAVPKTPVQYLLPEAKAQDVDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 119

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 120 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 169

Query: 777 LFA 779
             A
Sbjct: 170 FRA 172


>gi|422294377|gb|EKU21677.1| nli interacting factor family protein [Nannochloropsis gaditana
           CCMP526]
          Length = 378

 Score = 43.5 bits (101), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 40/149 (26%), Positives = 61/149 (40%), Gaps = 47/149 (31%)

Query: 670 LEEQKKMFSA----RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHL- 724
           ++ Q + FS      KL +VLD+D  LL+S +F E           ++E+ RE  H+ L 
Sbjct: 109 MQGQGRRFSVYRKREKLTVVLDMDECLLHS-RFEE---------DMRDERGRELAHQLLP 158

Query: 725 ------FRFPHMG----------------MWTKL----------RPGIWTFLERASKLFE 752
                 F + H                   W +L          RPG+  FL+R S  + 
Sbjct: 159 NGDSESFHYQHQADVGEALGDVRHRSVDYFWLELEEGERVRVNLRPGVEAFLQRLSDEYN 218

Query: 753 MHLYTMGNKLYATEMAKVLDPKGVLFAGR 781
           + ++T   + YA  +   LDP G L  GR
Sbjct: 219 VFVFTAATETYARPVLDRLDPTGSLLDGR 247


>gi|392338692|ref|XP_003753607.1| PREDICTED: protein ECT2 isoform 1 [Rattus norvegicus]
          Length = 882

 Score = 43.5 bits (101), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)

Query: 934 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
             G V  K  + +VTH+VAN    +K   A+S G  ++ PGW+  +   + R NEQ F 
Sbjct: 170 HMGGVIRKECNSKVTHLVANCTQGEKFRVAVSLGTPIMRPGWIYEA---WERRNEQCFC 225


>gi|148236343|ref|NP_001091042.1| protein ECT2 [Bos taurus]
 gi|146186917|gb|AAI40474.1| ECT2 protein [Bos taurus]
          Length = 914

 Score = 43.5 bits (101), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 3/57 (5%)

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
           G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF 
Sbjct: 203 GGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIHKA---WERRNEQDFC 256


>gi|45184666|ref|NP_982384.1| AAL158Wp [Ashbya gossypii ATCC 10895]
 gi|44980012|gb|AAS50208.1| AAL158Wp [Ashbya gossypii ATCC 10895]
          Length = 478

 Score = 43.5 bits (101), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 50/105 (47%), Gaps = 25/105 (23%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSA--KFHEVD---PVHDEILRKKEEQDREKPHRHLFRF 727
           Q+  F  RK CLVLDLD TL++S+    H  D   PV         E D +  + ++ + 
Sbjct: 301 QRPEFRGRK-CLVLDLDETLVHSSFKYLHTADFVIPV---------EIDNQVHNVYVIK- 349

Query: 728 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 772
                    RPG+  FL+R  +LFE+ ++T     Y   +  +LD
Sbjct: 350 ---------RPGVDEFLKRVGELFEVVVFTASVSRYGDPLLDILD 385


>gi|296491169|tpg|DAA33242.1| TPA: epithelial cell transforming sequence 2 oncogene [Bos taurus]
          Length = 914

 Score = 43.5 bits (101), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 3/57 (5%)

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
           G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF 
Sbjct: 203 GGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIHKA---WERRNEQDFC 256


>gi|410258922|gb|JAA17427.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase 1 [Pan troglodytes]
 gi|410290720|gb|JAA23960.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase 1 [Pan troglodytes]
          Length = 260

 Score = 43.5 bits (101), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65  EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 777 LFA 779
             A
Sbjct: 171 FRA 173


>gi|354547373|emb|CCE44108.1| hypothetical protein CPAR2_503330 [Candida parapsilosis]
          Length = 373

 Score = 43.5 bits (101), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 51/115 (44%), Gaps = 33/115 (28%)

Query: 678 SARKLCLVLDLDHTL-------LNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHM 730
           +A K CLVLDLD TL       L SA F  V PV         E D +  H ++ +    
Sbjct: 200 TANKKCLVLDLDETLVHSSFKYLRSADF--VIPV---------EIDGQVHHVYVIK---- 244

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 785
                 RPG+  FLER  KL+E+ ++T     Y   +   LD     F+  V+ R
Sbjct: 245 ------RPGVDEFLERVGKLYEVVVFTASVSKYGDPLLNKLD-----FSQSVLHR 288


>gi|384247176|gb|EIE20663.1| hypothetical protein COCSUDRAFT_30404 [Coccomyxa subellipsoidea
           C-169]
          Length = 243

 Score = 43.5 bits (101), Expect = 0.60,   Method: Composition-based stats.
 Identities = 51/173 (29%), Positives = 81/173 (46%), Gaps = 22/173 (12%)

Query: 666 RTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF 725
           R RR    ++    ++  LVLDLD TL++S             L   +E D   P    F
Sbjct: 43  RWRRSLLPRQTRQCKRKTLVLDLDETLVHST------------LDGCDEPDFSFPVA--F 88

Query: 726 RFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 785
                 +  + RP +  FL+R ++LFE+ ++T   K+YA ++  +LDP   L   RV   
Sbjct: 89  NGREHRVHVRRRPHLQHFLQRCAELFEVVVFTASQKVYAEQLLNILDPTRTLIRHRVFR- 147

Query: 786 GDDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
            D     +G+      KDL  VLG + A  VI+D+S + + +   N I +E +
Sbjct: 148 -DSCVFVEGNYL----KDLS-VLGRDLAHTVIVDNSPQAFGYQLPNGIPIESW 194


>gi|154339115|ref|XP_001562249.1| nuclear lim interactor-interacting factor-like protein [Leishmania
           braziliensis MHOM/BR/75/M2904]
 gi|134062832|emb|CAM39277.1| nuclear lim interactor-interacting factor-like protein [Leishmania
           braziliensis MHOM/BR/75/M2904]
          Length = 290

 Score = 43.5 bits (101), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 53/104 (50%), Gaps = 20/104 (19%)

Query: 678 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG-MWT-- 734
           S  K+ LVLD+D TL++S      D V+D++L             H+   P  G M+T  
Sbjct: 109 SVPKVTLVLDVDETLVHSTFQPSSDVVYDKVL-------------HV---PSDGRMYTVS 152

Query: 735 -KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
            K RP +  FL   S+ FE+ ++T   + Y  ++   +DP+G+L
Sbjct: 153 VKYRPYLEDFLRFISRRFEVVVFTASMRAYCDKLMDEIDPQGIL 196


>gi|2289786|dbj|BAA21667.1| HYA22 [Homo sapiens]
          Length = 340

 Score = 43.5 bits (101), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 44/169 (26%), Positives = 71/169 (42%), Gaps = 31/169 (18%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++   +QK   + L  +  +    K C+V+DLD TL++S+      P+ +       E D
Sbjct: 146 EENGGLQKPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 201

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  GV
Sbjct: 202 GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 251

Query: 777 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDS 820
             A       V  RG+              KDL  +    S V+I+D+S
Sbjct: 252 FRARLFRESCVFHRGN------------YVKDLSRLGRELSKVIIVDNS 288


>gi|159485684|ref|XP_001700874.1| hypothetical protein CHLREDRAFT_142839 [Chlamydomonas reinhardtii]
 gi|158281373|gb|EDP07128.1| predicted protein, partial [Chlamydomonas reinhardtii]
          Length = 418

 Score = 43.5 bits (101), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 52/103 (50%), Gaps = 7/103 (6%)

Query: 736 LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGD 795
           LRPG+  FLE    +FE+ L+T   + +AT   + +DP G +F  R+       D     
Sbjct: 34  LRPGLRRFLESVRPMFEVVLFTAAGESWATCAMQRIDPDGRIFDTRLYR-----DHTVSH 88

Query: 796 ERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
           +  P  KDL   LG + A VVI+DD+  ++ +   N + V  Y
Sbjct: 89  DDWPWVKDLSR-LGRDLARVVIVDDNPLMFMYQPDNALHVAPY 130


>gi|440910832|gb|ELR60586.1| Protein ECT2 [Bos grunniens mutus]
          Length = 934

 Score = 43.5 bits (101), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 3/57 (5%)

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
           G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF 
Sbjct: 203 GGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIHKA---WERRNEQDFC 256


>gi|396479189|ref|XP_003840695.1| similar to phosphoprotein phosphatase [Leptosphaeria maculans JN3]
 gi|312217268|emb|CBX97216.1| similar to phosphoprotein phosphatase [Leptosphaeria maculans JN3]
          Length = 511

 Score = 43.5 bits (101), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 81/189 (42%), Gaps = 45/189 (23%)

Query: 606 QIKSGADMKAV-VTNHDDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQK 664
           QI    +MK V ++ HD  Q G     +A P   HP+     V+        +++ A+Q 
Sbjct: 263 QIDDDIEMKDVPLSTHDVHQEGEDQSTDAQP--DHPK-----VDLPPPPPLVERQHAVQS 315

Query: 665 ERT---RRLEEQKKM-------FSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           + T      E QK +       F  +K CLVLDLD TL++S+          +IL     
Sbjct: 316 QVTDASEASEPQKYLLGPIAPRFKGKK-CLVLDLDETLVHSSF---------KIL----- 360

Query: 715 QDREKPHRHLFRFP------HMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMA 768
                 H+  F  P      +  ++   RPG+  F++R  +L+E+ ++T     Y   + 
Sbjct: 361 ------HQADFTIPVEIEGQYHNVYVIKRPGVDQFMKRVGELYEVVVFTASVSKYGDPLL 414

Query: 769 KVLDPKGVL 777
             LD  GV+
Sbjct: 415 DQLDIHGVV 423


>gi|410970977|ref|XP_003991950.1| PREDICTED: LOW QUALITY PROTEIN: protein ECT2 [Felis catus]
          Length = 914

 Score = 43.5 bits (101), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)

Query: 934 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
             G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF 
Sbjct: 201 HMGGVIRKDFNSKVTHLVANCTQGEKFRIAVSLGTPIMKPEWIYKA---WERRNEQDFC 256


>gi|302819617|ref|XP_002991478.1| hypothetical protein SELMODRAFT_133639 [Selaginella moellendorffii]
 gi|300140680|gb|EFJ07400.1| hypothetical protein SELMODRAFT_133639 [Selaginella moellendorffii]
          Length = 219

 Score = 43.5 bits (101), Expect = 0.63,   Method: Composition-based stats.
 Identities = 32/102 (31%), Positives = 47/102 (46%), Gaps = 17/102 (16%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K  LVLD+D TL+++         H  I   K    +  P +              RPG+
Sbjct: 2   KPTLVLDMDETLIHA---------HKAIASLKLFSGKTLPLKRYL--------VAKRPGV 44

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
            TFL+  SK++E+ ++T   K YA  +   LDP G LF  R+
Sbjct: 45  NTFLDEMSKIYEIVVFTRVVKPYADRILDRLDPVGNLFTHRL 86


>gi|395527897|ref|XP_003766073.1| PREDICTED: protein ECT2 isoform 2 [Sarcophilus harrisii]
          Length = 915

 Score = 43.5 bits (101), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 33/68 (48%), Gaps = 3/68 (4%)

Query: 925 LHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYR 984
           L  L   A   G    K  + +VTH+VAN    +K   A+S G  +V P W+  +   + 
Sbjct: 192 LVKLVTLAHYMGGTIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIVKPEWIFKA---WE 248

Query: 985 RANEQDFA 992
           R NEQDF 
Sbjct: 249 RRNEQDFC 256


>gi|145533625|ref|XP_001452557.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124420256|emb|CAK85160.1| unnamed protein product [Paramecium tetraurelia]
          Length = 343

 Score = 43.5 bits (101), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 27/103 (26%), Positives = 50/103 (48%), Gaps = 21/103 (20%)

Query: 678 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 737
           S +++ +V DLD TL++S +  + D V+D                  F+    G++  +R
Sbjct: 156 SQKQIKIVFDLDETLVHSEEVQK-DKVYD------------------FQNNEFGLF--VR 194

Query: 738 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAG 780
           P     L+  S+L ++ +YT  N+ YA  +  ++DP+   F G
Sbjct: 195 PYCCHVLKELSQLADLFVYTSANQKYAKTIINLIDPENTFFKG 237


>gi|401841683|gb|EJT44034.1| PSR2-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 392

 Score = 43.5 bits (101), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 28/99 (28%), Positives = 49/99 (49%), Gaps = 16/99 (16%)

Query: 680 RKLCLVLDLDHTLLNSA-KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
           +K CL+LDLD TL++S+ K+ +      + +   E  D+              ++   RP
Sbjct: 221 QKKCLILDLDETLVHSSFKYMQTA----DFVLPVEIDDQVH-----------NVYVIKRP 265

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
           G+  FL R S+++E+ ++T     YA  +   LDP G +
Sbjct: 266 GVDEFLHRVSQVYEVVVFTASVSRYANPLLDTLDPNGTI 304


>gi|358421543|ref|XP_003585009.1| PREDICTED: protein ECT2-like [Bos taurus]
          Length = 524

 Score = 43.1 bits (100), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 3/57 (5%)

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
           G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF 
Sbjct: 203 GGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIHKA---WERRNEQDFC 256


>gi|351709551|gb|EHB12470.1| Protein ECT2, partial [Heterocephalus glaber]
          Length = 856

 Score = 43.1 bits (100), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)

Query: 934 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
             G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF 
Sbjct: 170 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPDWIYKA---WERRNEQDFC 225


>gi|432102147|gb|ELK29956.1| Protein ECT2 [Myotis davidii]
          Length = 882

 Score = 43.1 bits (100), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 3/57 (5%)

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
           G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF 
Sbjct: 171 GGVIRKDFNSKVTHLVANCTQGEKFRIAVSLGTPIMKPEWIYKA---WERRNEQDFC 224


>gi|395527895|ref|XP_003766072.1| PREDICTED: protein ECT2 isoform 1 [Sarcophilus harrisii]
          Length = 884

 Score = 43.1 bits (100), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 33/68 (48%), Gaps = 3/68 (4%)

Query: 925 LHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYR 984
           L  L   A   G    K  + +VTH+VAN    +K   A+S G  +V P W+  +   + 
Sbjct: 161 LVKLVTLAHYMGGTIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIVKPEWIFKA---WE 217

Query: 985 RANEQDFA 992
           R NEQDF 
Sbjct: 218 RRNEQDFC 225


>gi|159112651|ref|XP_001706554.1| Nuclear LIM interactor-interacting factor 1 [Giardia lamblia ATCC
           50803]
 gi|157434651|gb|EDO78880.1| Nuclear LIM interactor-interacting factor 1 [Giardia lamblia ATCC
           50803]
          Length = 432

 Score = 43.1 bits (100), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 29/97 (29%), Positives = 49/97 (50%), Gaps = 8/97 (8%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K  LVLDLD TL++S+ F++VD     I    E+   +    H        ++   RP +
Sbjct: 258 KKLLVLDLDETLVHSS-FNKVDNADMIIPLSIEDPVSKATISH-------QVYVYKRPYV 309

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
             FLE  +K +E+ ++T   ++Y   + + LDP G+ 
Sbjct: 310 DEFLETMAKYYELAIFTASLRVYCDAVMEKLDPNGLC 346


>gi|374105582|gb|AEY94493.1| FAAL158Wp [Ashbya gossypii FDAG1]
          Length = 478

 Score = 43.1 bits (100), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 50/105 (47%), Gaps = 25/105 (23%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSA--KFHEVD---PVHDEILRKKEEQDREKPHRHLFRF 727
           Q+  F  RK CLVLDLD TL++S+    H  D   PV         E D +  + ++ + 
Sbjct: 301 QRPEFRGRK-CLVLDLDETLVHSSFKYLHTADFVIPV---------EIDNQVHNVYVIK- 349

Query: 728 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 772
                    RPG+  FL+R  +LFE+ ++T     Y   +  +LD
Sbjct: 350 ---------RPGVDEFLKRVGELFEVVVFTASVSRYGDPLLDILD 385


>gi|145513909|ref|XP_001442865.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124410226|emb|CAK75468.1| unnamed protein product [Paramecium tetraurelia]
          Length = 392

 Score = 43.1 bits (100), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 31/133 (23%), Positives = 62/133 (46%), Gaps = 28/133 (21%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           L+LDLD TL           +H  I  + ++++R              +   +RP    F
Sbjct: 221 LILDLDETL-----------IHITITLQDDDEER------------FDLCFNVRPFCNEF 257

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG----DERVP 799
           L+  SK + +HL+T  ++LYA  +   LDPK   +   ++ R +  +  +G    D R+ 
Sbjct: 258 LKEMSKYYNIHLFTASSELYANAIVNHLDPKRQ-YINEILCRNNCFETKNGFFIKDLRII 316

Query: 800 KSKDLEGVLGMES 812
            ++ L+ ++ +++
Sbjct: 317 TNRTLKDIVIVDN 329


>gi|145534239|ref|XP_001452864.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124420563|emb|CAK85467.1| unnamed protein product [Paramecium tetraurelia]
          Length = 414

 Score = 43.1 bits (100), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 34/135 (25%), Positives = 64/135 (47%), Gaps = 16/135 (11%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           + LDLD TL+++    E   V    L+++ E   E           M +   +RP    F
Sbjct: 231 IFLDLDETLIHACHARETPSVK---LKQQNEDGSETDS--------MQVGINVRPYTGYF 279

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG----DERVP 799
           L+  ++ + +++YT  ++ YA  +   LDP     +G ++SR +  +  +G    D R+ 
Sbjct: 280 LQELAQYYTIYIYTASSQQYAQTIVNYLDPLKQYISG-ILSRSNCMETKNGFFIKDLRII 338

Query: 800 KSKDLEGVLGMESAV 814
           K  DL+  L +++ V
Sbjct: 339 KDLDLDRTLIVDNLV 353


>gi|291392229|ref|XP_002712521.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
           polypeptide A) small phosphatase 1 [Oryctolagus
           cuniculus]
          Length = 260

 Score = 43.1 bits (100), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 56/123 (45%), Gaps = 14/123 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  AI K   + L  + K   + K+C+V+DLD TL++S+      PV +       E D
Sbjct: 65  EENGAIPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVSNADFIIPVEID 120

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 777 LFA 779
             A
Sbjct: 171 FRA 173


>gi|145544070|ref|XP_001457720.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124425538|emb|CAK90323.1| unnamed protein product [Paramecium tetraurelia]
          Length = 659

 Score = 43.1 bits (100), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 60/249 (24%), Positives = 89/249 (35%), Gaps = 44/249 (17%)

Query: 621 DDKQTGTGSGPEAGPVGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSAR 680
           D  QT      E        +    +VE L   Y D     I KE          +   R
Sbjct: 193 DSSQTCNHLKIENNYCLICNEKVIRNVESLDLNYSDDISKKISKEIVL------DILKKR 246

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMG--------- 731
           KL +VLDLD T+L++ K       + E   K+ +  +         F  +G         
Sbjct: 247 KLIMVLDLDQTILHAIKVSTTFNKY-EFCEKQNKMIQADSEAQFNGFQQLGFNIKEHLLD 305

Query: 732 --------MWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA---- 779
                      KLRP    F      LF++ +YT  +K YA  +   +  +   F     
Sbjct: 306 MTCDQQSKFIIKLRPYFEQFFLTLIPLFDIFIYTKASKSYADFILSFITHRLNEFIPEHK 365

Query: 780 -----GRVISRGDDGDPFDGDERVPKSKDLEGVL--GMES-AVVIIDDSVRVWPHNKLNL 831
                 RV+SR D             SK L  +   G+ +  +VI+DD+  +W   K NL
Sbjct: 366 PFFPPQRVLSREDTI--------CSNSKSLNRLFYPGIATNLLVILDDNAGMWNQFKENL 417

Query: 832 IVVERYTYF 840
           I  + + YF
Sbjct: 418 IHTKPFVYF 426


>gi|390476118|ref|XP_003735075.1| PREDICTED: protein ECT2 isoform 2 [Callithrix jacchus]
          Length = 914

 Score = 43.1 bits (100), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)

Query: 934 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
             G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF 
Sbjct: 201 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDFC 256


>gi|145527362|ref|XP_001449481.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124417069|emb|CAK82084.1| unnamed protein product [Paramecium tetraurelia]
          Length = 249

 Score = 43.1 bits (100), Expect = 0.77,   Method: Composition-based stats.
 Identities = 51/184 (27%), Positives = 76/184 (41%), Gaps = 32/184 (17%)

Query: 654 YDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE 713
           +DD+ K  I  ++T +           +  LVLDLD TL++S        + +EIL K  
Sbjct: 55  FDDECKDKITAKKTEK-----------EFTLVLDLDETLIHS-DMERTSFLDEEILVKIG 102

Query: 714 EQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 773
               +              + K+RP    FL+  S  FE+ ++T   K YA ++   LDP
Sbjct: 103 NTIEK-------------YYVKIRPFARDFLKALSNYFELVIFTAAIKEYADKVIDYLDP 149

Query: 774 KGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIV 833
            G  F  R   R D     DG       KDL  V        IID+S+     N  N I+
Sbjct: 150 SG--FIKRRFYR-DSCTKKDG----VFYKDLTKVNSNLDKTFIIDNSLSGMSLNPQNGIL 202

Query: 834 VERY 837
           ++ +
Sbjct: 203 IKSW 206


>gi|365759524|gb|EHN01307.1| Psr2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 338

 Score = 43.1 bits (100), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 28/99 (28%), Positives = 48/99 (48%), Gaps = 16/99 (16%)

Query: 680 RKLCLVLDLDHTLLNSA-KFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
            K CL+LDLD TL++S+ K+ +      + +   E  D+              ++   RP
Sbjct: 167 HKKCLILDLDETLVHSSFKYMQTA----DFVLPVEIDDQVH-----------NVYVIKRP 211

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
           G+  FL R S+++E+ ++T     YA  +   LDP G +
Sbjct: 212 GVDEFLHRVSQVYEVVVFTASVSRYANPLLDTLDPNGTI 250


>gi|403265922|ref|XP_003925160.1| PREDICTED: protein ECT2 isoform 2 [Saimiri boliviensis boliviensis]
          Length = 914

 Score = 43.1 bits (100), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)

Query: 934 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
             G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF 
Sbjct: 201 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDFC 256


>gi|194222593|ref|XP_001494218.2| PREDICTED: LOW QUALITY PROTEIN: protein ECT2 isoform 1 [Equus
           caballus]
          Length = 883

 Score = 43.1 bits (100), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)

Query: 934 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
             G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF 
Sbjct: 170 HMGGVIRKDCNSKVTHLVANCTQGEKFRIAVSLGTPIMKPEWIYKA---WERRNEQDFC 225


>gi|449275333|gb|EMC84205.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
           phosphatase 1, partial [Columba livia]
          Length = 230

 Score = 43.1 bits (100), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 37/127 (29%), Positives = 58/127 (45%), Gaps = 15/127 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  A+ K   R L  + K   A  LC+V+DLD TL++S+      PV++       E D
Sbjct: 35  EENGALPKAAVRHLLPEIKPQDASNLCVVIDLDETLVHSS----FKPVNNADFIIPVEID 90

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 91  GIMHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 140

Query: 777 LFAGRVI 783
            F  R+ 
Sbjct: 141 -FRARLF 146


>gi|441633012|ref|XP_004089721.1| PREDICTED: protein ECT2 [Nomascus leucogenys]
          Length = 914

 Score = 43.1 bits (100), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)

Query: 934 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
             G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF 
Sbjct: 201 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDFC 256


>gi|146185627|ref|XP_001032201.2| NLI interacting factor-like phosphatase family protein [Tetrahymena
           thermophila]
 gi|146142847|gb|EAR84538.2| NLI interacting factor-like phosphatase family protein [Tetrahymena
           thermophila SB210]
          Length = 446

 Score = 43.1 bits (100), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 26/107 (24%), Positives = 59/107 (55%), Gaps = 14/107 (13%)

Query: 706 DEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYAT 765
           D I++ K + D +  ++       +G+  ++RP    FL++ ++ ++++++T  +  YA+
Sbjct: 282 DHIIKAKADNDDKVGYQ-------IGL--RVRPYCLEFLQKLAQYWDIYIFTASSPTYAS 332

Query: 766 EMAKVLDPKGVLFAGRVISRGDDGDPFDG----DERVPKSKDLEGVL 808
            + K LDP+G    G +++R +  +  +G    D R+ K KDL+  +
Sbjct: 333 AIVKFLDPEGKYING-ILNRSNCMETKNGFFIKDLRIVKGKDLKKTV 378


>gi|402860934|ref|XP_003894870.1| PREDICTED: protein ECT2 isoform 2 [Papio anubis]
          Length = 917

 Score = 43.1 bits (100), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)

Query: 934 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
             G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF 
Sbjct: 201 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDFC 256


>gi|357610246|gb|EHJ66893.1| hypothetical protein KGM_16951 [Danaus plexippus]
          Length = 673

 Score = 43.1 bits (100), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 29/97 (29%), Positives = 44/97 (45%), Gaps = 28/97 (28%)

Query: 684 LVLDLDHTLLNS-------AKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKL 736
           LVLDLD TL++        A FH   PV                   LF+     ++ + 
Sbjct: 498 LVLDLDETLVHCSLQELPDASFHF--PV-------------------LFQDCRYTVFVRT 536

Query: 737 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP 773
           RP    FL + S+L+E+ L+T   ++YA  +  +LDP
Sbjct: 537 RPHFAEFLSKVSRLYEVILFTASKRVYADRLLNLLDP 573


>gi|300794122|ref|NP_001179369.1| carboxy-terminal domain RNA polymerase II polypeptide A small
           phosphatase 1 [Bos taurus]
 gi|296490317|tpg|DAA32430.1| TPA: CTD (carboxy-terminal domain, RNA polymerase II, polypeptide
           A) small phosphatase 1-like [Bos taurus]
          Length = 260

 Score = 43.1 bits (100), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 34/123 (27%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  A+ K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65  EENGAVPKTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 777 LFA 779
             A
Sbjct: 171 FRA 173


>gi|390476116|ref|XP_002759424.2| PREDICTED: protein ECT2 isoform 1 [Callithrix jacchus]
          Length = 883

 Score = 43.1 bits (100), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 3/57 (5%)

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
           G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF 
Sbjct: 172 GGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDFC 225


>gi|332214798|ref|XP_003256522.1| PREDICTED: protein ECT2 isoform 1 [Nomascus leucogenys]
 gi|441633016|ref|XP_003256523.2| PREDICTED: protein ECT2 isoform 2 [Nomascus leucogenys]
          Length = 883

 Score = 43.1 bits (100), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 3/57 (5%)

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
           G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF 
Sbjct: 172 GGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDFC 225


>gi|403265920|ref|XP_003925159.1| PREDICTED: protein ECT2 isoform 1 [Saimiri boliviensis boliviensis]
          Length = 883

 Score = 42.7 bits (99), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 3/57 (5%)

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
           G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF 
Sbjct: 172 GGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDFC 225


>gi|402860932|ref|XP_003894869.1| PREDICTED: protein ECT2 isoform 1 [Papio anubis]
 gi|402860936|ref|XP_003894871.1| PREDICTED: protein ECT2 isoform 3 [Papio anubis]
          Length = 886

 Score = 42.7 bits (99), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 3/59 (5%)

Query: 934 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
             G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF 
Sbjct: 170 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDFC 225


>gi|365759502|gb|EHN01285.1| Psr1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 410

 Score = 42.7 bits (99), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 47/104 (45%), Gaps = 28/104 (26%)

Query: 681 KLCLVLDLDHTL-------LNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMW 733
           K CL+LDLD TL       L SA F  V PV         E D +  + ++ +       
Sbjct: 240 KKCLILDLDETLVHSSFKYLRSADF--VLPV---------EIDDQVHNVYVIK------- 281

Query: 734 TKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
              RPG+  FLER  KLFE+ ++T     Y   +  +LD   V+
Sbjct: 282 ---RPGVEEFLERVGKLFEVVVFTASVSRYGDPLLDILDTNKVI 322


>gi|253741456|gb|EES98325.1| Nuclear LIM interactor-interacting factor 1 [Giardia intestinalis
           ATCC 50581]
          Length = 429

 Score = 42.7 bits (99), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 29/97 (29%), Positives = 49/97 (50%), Gaps = 8/97 (8%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K  LVLDLD TL++S+ F++VD     I    E+   +    H        ++   RP +
Sbjct: 255 KKLLVLDLDETLVHSS-FNKVDNADMIIPLSIEDPVSKATISH-------QVYVYKRPYV 306

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
             FLE  +K +E+ ++T   ++Y   + + LDP G+ 
Sbjct: 307 DEFLETMAKYYELAIFTASLQVYCDAVMEKLDPSGLC 343


>gi|383416341|gb|AFH31384.1| protein ECT2 [Macaca mulatta]
          Length = 883

 Score = 42.7 bits (99), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 3/57 (5%)

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
           G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF 
Sbjct: 172 GGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDFC 225


>gi|380815184|gb|AFE79466.1| carboxy-terminal domain RNA polymerase II polypeptide A small
           phosphatase 1 isoform 2 [Macaca mulatta]
 gi|383420375|gb|AFH33401.1| carboxy-terminal domain RNA polymerase II polypeptide A small
           phosphatase 1 isoform 2 [Macaca mulatta]
 gi|384948522|gb|AFI37866.1| carboxy-terminal domain RNA polymerase II polypeptide A small
           phosphatase 1 isoform 2 [Macaca mulatta]
          Length = 260

 Score = 42.7 bits (99), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 56/123 (45%), Gaps = 14/123 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  AI K   + L    K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65  EENGAIPKTPVQYLLPAAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 777 LFA 779
             A
Sbjct: 171 FRA 173


>gi|345788882|ref|XP_851254.2| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
           polypeptide A) small phosphatase-like [Canis lupus
           familiaris]
          Length = 328

 Score = 42.7 bits (99), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 132 DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 187

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 188 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 237

Query: 775 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
           GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 238 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 276


>gi|403333806|gb|EJY66027.1| hypothetical protein OXYTRI_13811 [Oxytricha trifallax]
          Length = 509

 Score = 42.7 bits (99), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 39/168 (23%), Positives = 76/168 (45%), Gaps = 17/168 (10%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVH--DEILRKKEEQDREKP----HRHLFRFPHMGMWT 734
           K  LVLD+D TL++ +    ++P +   E++     QD  KP       ++    + ++ 
Sbjct: 297 KKTLVLDMDETLIHCS----LEPFYGYQEVIHVM--QDTYKPISPDSDLIYSQKSLQIYV 350

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV-----ISRGDDG 789
             RP +  FLE+ S  +E+ ++T  +K YA  +   +DP    F+ R+     +    + 
Sbjct: 351 AYRPYLIHFLEKVSSQYEVVVFTASDKSYADVILDKIDPYHKYFSYRLYRDSCLQVNINA 410

Query: 790 DPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 837
                 +     KDL  +    S  +I+D+S++ + +   N I +  Y
Sbjct: 411 KNSSSQQTTLFVKDLSALGRDLSQTIIVDNSIQAFGYQLSNGIPIPSY 458


>gi|403336757|gb|EJY67573.1| NLI interacting factor-like phosphatase family protein [Oxytricha
           trifallax]
          Length = 515

 Score = 42.7 bits (99), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 24/82 (29%), Positives = 43/82 (52%), Gaps = 3/82 (3%)

Query: 736 LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR---GDDGDPF 792
           +RP     LE  +K FE+ ++T  +K YA  +   +DP G L   R+          + +
Sbjct: 249 IRPYTQECLEFVNKYFEVVVFTASHKFYADVILDYIDPTGTLIQHRLYREHCIKTQDNVY 308

Query: 793 DGDERVPKSKDLEGVLGMESAV 814
             D RV K++DL+ ++ +++AV
Sbjct: 309 IKDLRVFKNRDLKDLIIVDNAV 330


>gi|391344643|ref|XP_003746605.1| PREDICTED: phosphatase PSR1-like [Metaseiulus occidentalis]
          Length = 239

 Score = 42.7 bits (99), Expect = 0.97,   Method: Composition-based stats.
 Identities = 31/96 (32%), Positives = 48/96 (50%), Gaps = 12/96 (12%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
           A+K+ LVLDLD TL++    + + P +D     K  Q +            M ++  +RP
Sbjct: 45  AKKILLVLDLDETLIHGT--YCMPPKYDFRFELKLPQSKRV----------MNVYVLVRP 92

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            +  FLE A K FE+  YT    +YA ++   +DPK
Sbjct: 93  YLQDFLEFAHKWFEVMAYTASLPIYADKILDEIDPK 128


>gi|302792499|ref|XP_002978015.1| hypothetical protein SELMODRAFT_108059 [Selaginella moellendorffii]
 gi|300154036|gb|EFJ20672.1| hypothetical protein SELMODRAFT_108059 [Selaginella moellendorffii]
          Length = 131

 Score = 42.7 bits (99), Expect = 0.97,   Method: Composition-based stats.
 Identities = 39/127 (30%), Positives = 61/127 (48%), Gaps = 11/127 (8%)

Query: 737 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 796
           RP +  FLER +KLF++  +T   +  A  +  VLDP    F+ R+          D  +
Sbjct: 5   RPHLGKFLERMAKLFDVVAFTSRAQRRAETILDVLDPAKEFFSRRLY--------LDSCK 56

Query: 797 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLG--PSL 854
           +  K KDL  +    + V+I+DD+      N  NL++V R+     S  + GLL   P L
Sbjct: 57  KGGKVKDLAVLERPLNRVIIVDDTSSKCVLNPDNLVLVSRFKRHNLSTDR-GLLDLIPFL 115

Query: 855 LEIDHDE 861
            E+  +E
Sbjct: 116 EELSKEE 122


>gi|344238378|gb|EGV94481.1| CTD small phosphatase-like protein [Cricetulus griseus]
          Length = 239

 Score = 42.7 bits (99), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 74/170 (43%), Gaps = 33/170 (19%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++ + +QK   + L  +  +    K C+V+DLD TL++S+      P+ +       E D
Sbjct: 45  EENSGLQKPPAKSLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 100

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  GV
Sbjct: 101 GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 150

Query: 777 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
             A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 151 FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 187


>gi|361067247|gb|AEW07935.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149610|gb|AFG56720.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149612|gb|AFG56721.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149614|gb|AFG56722.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149616|gb|AFG56723.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149618|gb|AFG56724.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149620|gb|AFG56725.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149622|gb|AFG56726.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149624|gb|AFG56727.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149628|gb|AFG56729.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149630|gb|AFG56730.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149632|gb|AFG56731.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149634|gb|AFG56732.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149636|gb|AFG56733.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
 gi|383149638|gb|AFG56734.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
          Length = 140

 Score = 42.7 bits (99), Expect = 1.1,   Method: Composition-based stats.
 Identities = 34/117 (29%), Positives = 56/117 (47%), Gaps = 14/117 (11%)

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVIS 784
           F      ++ + RP +  F++R + +FE+ ++T    +YA ++  VLDPK  L   RV  
Sbjct: 18  FNLKEHTVYVRCRPHLQLFMDRVADMFEIIVFTASQSVYAEQLLNVLDPKRKLIRHRVYR 77

Query: 785 RG---DDGDPFDGDERVPKSKDLEGVLGMESA-VVIIDDSVRVWPHNKLNLIVVERY 837
                 +G+           KDL  VLG + A V IID+S + +     N I +E +
Sbjct: 78  ESCVFVEGNYL---------KDLT-VLGRDLAQVAIIDNSPQAFGFQVDNGIPIESW 124


>gi|355750837|gb|EHH55164.1| hypothetical protein EGM_04316, partial [Macaca fascicularis]
          Length = 237

 Score = 42.7 bits (99), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 56/123 (45%), Gaps = 14/123 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  AI K   + L    K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 42  EENGAIPKTPVQYLLPAAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 97

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 98  GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 147

Query: 777 LFA 779
             A
Sbjct: 148 FRA 150


>gi|308163009|gb|EFO65375.1| Nuclear LIM interactor-interacting factor 1 [Giardia lamblia P15]
          Length = 432

 Score = 42.7 bits (99), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 29/97 (29%), Positives = 49/97 (50%), Gaps = 8/97 (8%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K  LVLDLD TL++S+ F++VD     I    E+   +    H        ++   RP +
Sbjct: 258 KKLLVLDLDETLVHSS-FNKVDNADMIIPLSIEDPVSKATISH-------QVYVYKRPYV 309

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
             FLE  +K +E+ ++T   ++Y   + + LDP G+ 
Sbjct: 310 DEFLETMAKYYELAIFTASLQVYCDAVMEKLDPNGLC 346


>gi|387219521|gb|AFJ69469.1| rna polymerase ii ctd phosphatase, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 268

 Score = 42.7 bits (99), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 48/98 (48%), Gaps = 2/98 (2%)

Query: 892 DVRNILAAEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHIDDQVTHVV 951
           +V   L++ +R++LAG  I+FS V P    +P    L   A   GA   +     VTH+V
Sbjct: 80  NVAGCLSSVRRQVLAGVTILFSGVLPR-NVDPRRSDLGYMALSLGARIVEDFSPTVTHLV 138

Query: 952 ANSLGTDKVNWA-LSTGRFVVHPGWVEASALLYRRANE 988
           A +  T+KV  A    G ++V   W++ S     R +E
Sbjct: 139 AENASTEKVFRARRQGGMWIVARTWLQLSFFHCDRKDE 176


>gi|268565001|ref|XP_002639300.1| C. briggsae CBR-SCPL-1 protein [Caenorhabditis briggsae]
          Length = 484

 Score = 42.7 bits (99), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 50/105 (47%), Gaps = 15/105 (14%)

Query: 679 ARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRP 738
           ++K CLV+DLD TL++S+      PV +       E D  +   ++ +          RP
Sbjct: 329 SKKKCLVIDLDETLVHSS----FKPVKNPDFVIPVEIDGVEHQVYVLK----------RP 374

Query: 739 GIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
            +  FL R  + FE  L+T     YA  +A +LD K V F GR+ 
Sbjct: 375 YVDEFLARVGEHFECILFTASLAKYADPVADLLDKKKV-FRGRLF 418


>gi|301617231|ref|XP_002938048.1| PREDICTED: CTD small phosphatase-like protein-like [Xenopus
           (Silurana) tropicalis]
          Length = 276

 Score = 42.7 bits (99), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 49/172 (28%), Positives = 73/172 (42%), Gaps = 33/172 (19%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D  Q   I     + L  + K+    K C+V+DLD TL++S+      P+++       E
Sbjct: 80  DQTQALTIPSPPAKYLLPELKVSDYGKKCVVIDLDETLVHSS----FKPINNADFIVPVE 135

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            D      ++ + PH+            FL++  +LFE  L+T     YA  +A +LD  
Sbjct: 136 IDGTIHQVYVLKRPHVD----------EFLQKMGELFECVLFTASLAKYADPVADLLDRW 185

Query: 775 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
           GV  A       V  RG+              KDL   LG E S V+IID+S
Sbjct: 186 GVFNARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIIDNS 224


>gi|109044251|ref|XP_001083830.1| PREDICTED: protein ECT2 [Macaca mulatta]
          Length = 927

 Score = 42.7 bits (99), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 3/57 (5%)

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
           G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF 
Sbjct: 172 GGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDFC 225


>gi|355759094|gb|EHH61572.1| hypothetical protein EGM_19509 [Macaca fascicularis]
          Length = 927

 Score = 42.7 bits (99), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 3/57 (5%)

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
           G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF 
Sbjct: 172 GGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDFC 225


>gi|355559865|gb|EHH16593.1| hypothetical protein EGK_11894 [Macaca mulatta]
          Length = 927

 Score = 42.7 bits (99), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 31/57 (54%), Gaps = 3/57 (5%)

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDFA 992
           G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF 
Sbjct: 172 GGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDFC 225


>gi|385198079|ref|NP_001245244.1| protein ECT2 isoform a [Homo sapiens]
 gi|357529579|sp|Q9H8V3.4|ECT2_HUMAN RecName: Full=Protein ECT2; AltName: Full=Epithelial
           cell-transforming sequence 2 oncogene
 gi|111550251|gb|ABH10140.1| epithelial cell transforming sequence 2 oncogene protein splice
           variant b [Homo sapiens]
          Length = 914

 Score = 42.7 bits (99), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 3/58 (5%)

Query: 934 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 991
             G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF
Sbjct: 201 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDF 255


>gi|426342904|ref|XP_004038068.1| PREDICTED: protein ECT2 isoform 2 [Gorilla gorilla gorilla]
          Length = 914

 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 31/56 (55%), Gaps = 3/56 (5%)

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 991
           G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF
Sbjct: 203 GGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDF 255


>gi|397523977|ref|XP_003831992.1| PREDICTED: protein ECT2 isoform 2 [Pan paniscus]
 gi|410037742|ref|XP_003950278.1| PREDICTED: protein ECT2 [Pan troglodytes]
 gi|410339813|gb|JAA38853.1| epithelial cell transforming sequence 2 oncogene [Pan troglodytes]
          Length = 914

 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 3/58 (5%)

Query: 934 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 991
             G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF
Sbjct: 201 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDF 255


>gi|388852402|emb|CCF54017.1| related to DNA topoisomerase II binding protein [Ustilago hordei]
          Length = 1166

 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 17/49 (34%), Positives = 29/49 (59%)

Query: 928 LWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWV 976
           L Q A + GA    ++ + VTH++A+  G++K  +AL  G  +V P W+
Sbjct: 105 LTQYARELGARVEGNLTEDVTHLIADRPGSEKYRYALELGMHIVSPNWI 153


>gi|300122949|emb|CBK23956.2| Tim50 [Blastocystis hominis]
          Length = 348

 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 49/171 (28%), Positives = 74/171 (43%), Gaps = 35/171 (20%)

Query: 670 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 729
           L EQ      RK  LVLDLD TL++S  F   D     ++  + E D             
Sbjct: 163 LGEQSSANQGRKT-LVLDLDETLVHST-FQPTDDC-SYVIPVEIEGDL------------ 207

Query: 730 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV----LFAGRVISR 785
             ++  LRPG   FL R S+++E+ +YT    +YA  +   +DP  +    LF    +  
Sbjct: 208 YNVYVYLRPGTTEFLRRMSEIYEVVVYTASLPVYADPLLDKIDPNNLISARLFRDHCVQS 267

Query: 786 GDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDD---SVRVWPHNKLNLI 832
           G               KDL G+LG    +VV+ID+   S +  P+N +  +
Sbjct: 268 GG-----------ILVKDL-GLLGRSLDSVVMIDNSAVSFQFQPNNGIECV 306


>gi|426342902|ref|XP_004038067.1| PREDICTED: protein ECT2 isoform 1 [Gorilla gorilla gorilla]
 gi|426342906|ref|XP_004038069.1| PREDICTED: protein ECT2 isoform 3 [Gorilla gorilla gorilla]
          Length = 883

 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 31/56 (55%), Gaps = 3/56 (5%)

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 991
           G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF
Sbjct: 172 GGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDF 224


>gi|324518550|gb|ADY47137.1| CTD small phosphatase-like protein 2 [Ascaris suum]
          Length = 248

 Score = 42.4 bits (98), Expect = 1.2,   Method: Composition-based stats.
 Identities = 46/168 (27%), Positives = 78/168 (46%), Gaps = 32/168 (19%)

Query: 678 SARKLCLVLDLDHTLLNSAKFHEVD-----PVHDEILRKKEEQDREKPHRHLFRFPHMGM 732
           S  +  LVLDLD TL++ +     D     PVH                   F+     +
Sbjct: 60  STPEFALVLDLDETLVHCSLTELPDASLTFPVH-------------------FQDNTYQV 100

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
           + ++RP +  FLER S+ FE+ L+T   ++YA ++  +LDP   L   R+     +   F
Sbjct: 101 YVRVRPHLHEFLERLSQSFEIILFTASKRVYADKLLNLLDPGKRLIRHRLFR---EHCVF 157

Query: 793 DGDERVPKSKDLEGVLGME-SAVVIIDDSVRVWPHNKLNLIVVERYTY 839
                +   KDL  +LG + S  +IID+S++ + +   N I +E + +
Sbjct: 158 VYGNYI---KDLT-ILGRDLSKTIIIDNSLQSFAYQIDNGIPIESWFF 201


>gi|34978932|gb|AAQ83675.1| epithelial cell transforming 2 [Homo sapiens]
          Length = 882

 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 31/56 (55%), Gaps = 3/56 (5%)

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 991
           G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF
Sbjct: 171 GGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDF 223


>gi|21735572|ref|NP_060568.3| protein ECT2 isoform b [Homo sapiens]
 gi|385198081|ref|NP_001245245.1| protein ECT2 isoform b [Homo sapiens]
 gi|85566758|gb|AAI12087.1| Epithelial cell transforming sequence 2 oncogene [Homo sapiens]
 gi|189069491|dbj|BAG37157.1| unnamed protein product [Homo sapiens]
 gi|261859358|dbj|BAI46201.1| Protein ECT2 [synthetic construct]
          Length = 883

 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 31/56 (55%), Gaps = 3/56 (5%)

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 991
           G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF
Sbjct: 172 GGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDF 224


>gi|387539470|gb|AFJ70362.1| CTD small phosphatase-like protein isoform 1 [Macaca mulatta]
          Length = 276

 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 80  DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 136 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185

Query: 775 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
           GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 186 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224


>gi|302794308|ref|XP_002978918.1| hypothetical protein SELMODRAFT_418692 [Selaginella moellendorffii]
 gi|300153236|gb|EFJ19875.1| hypothetical protein SELMODRAFT_418692 [Selaginella moellendorffii]
          Length = 218

 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 29/46 (63%)

Query: 737 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
           RPG+  FL+  SK++E+ ++T   K YA  +   LDP G LFA R+
Sbjct: 81  RPGVDIFLDEMSKIYEIVVFTRAVKPYADRILDRLDPAGNLFAHRL 126


>gi|332818395|ref|XP_516880.3| PREDICTED: protein ECT2 isoform 2 [Pan troglodytes]
 gi|397523975|ref|XP_003831991.1| PREDICTED: protein ECT2 isoform 1 [Pan paniscus]
 gi|397523979|ref|XP_003831993.1| PREDICTED: protein ECT2 isoform 3 [Pan paniscus]
 gi|410037744|ref|XP_003310160.2| PREDICTED: protein ECT2 isoform 1 [Pan troglodytes]
 gi|410223152|gb|JAA08795.1| epithelial cell transforming sequence 2 oncogene [Pan troglodytes]
 gi|410258448|gb|JAA17191.1| epithelial cell transforming sequence 2 oncogene [Pan troglodytes]
 gi|410306608|gb|JAA31904.1| epithelial cell transforming sequence 2 oncogene [Pan troglodytes]
 gi|410339811|gb|JAA38852.1| epithelial cell transforming sequence 2 oncogene [Pan troglodytes]
          Length = 883

 Score = 42.4 bits (98), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 31/56 (55%), Gaps = 3/56 (5%)

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 991
           G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF
Sbjct: 172 GGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDF 224


>gi|297286147|ref|XP_001086442.2| PREDICTED: CTD small phosphatase-like protein-like [Macaca mulatta]
          Length = 260

 Score = 42.4 bits (98), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 73/170 (42%), Gaps = 33/170 (19%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++   +QK   + L  +  +    K C+V+DLD TL++S+      P+ +       E D
Sbjct: 66  EENGGLQKPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 121

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  GV
Sbjct: 122 GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 171

Query: 777 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
             A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 172 FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 208


>gi|302850140|ref|XP_002956598.1| hypothetical protein VOLCADRAFT_36131 [Volvox carteri f.
           nagariensis]
 gi|300258125|gb|EFJ42365.1| hypothetical protein VOLCADRAFT_36131 [Volvox carteri f.
           nagariensis]
          Length = 119

 Score = 42.4 bits (98), Expect = 1.3,   Method: Composition-based stats.
 Identities = 21/56 (37%), Positives = 31/56 (55%)

Query: 731 GMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRG 786
           G++   RPG+  FLE  +   E+ ++T G + YA  +   LDP G LFA R+   G
Sbjct: 5   GVFVVERPGLQEFLEELASFAEVVIFTAGLEDYAKPIIDALDPSGKLFAHRIYREG 60


>gi|335298855|ref|XP_003358412.1| PREDICTED: CTD small phosphatase-like protein-like [Sus scrofa]
          Length = 256

 Score = 42.4 bits (98), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 60  DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 115

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 116 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 165

Query: 775 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
           GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 166 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 204


>gi|334347358|ref|XP_003341918.1| PREDICTED: protein ECT2 isoform 2 [Monodelphis domestica]
          Length = 912

 Score = 42.4 bits (98), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 33/68 (48%), Gaps = 3/68 (4%)

Query: 925 LHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYR 984
           L  L   A   G    K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + 
Sbjct: 189 LVKLVTLAHHMGGTIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIFKA---WE 245

Query: 985 RANEQDFA 992
           R NEQDF 
Sbjct: 246 RRNEQDFC 253


>gi|156848006|ref|XP_001646886.1| hypothetical protein Kpol_2002p100 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156117567|gb|EDO19028.1| hypothetical protein Kpol_2002p100 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 477

 Score = 42.4 bits (98), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 29/103 (28%), Positives = 51/103 (49%), Gaps = 15/103 (14%)

Query: 670 LEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPH 729
           L ++  +F  +K CLVLDLD TL++S+ F  +D     +    ++Q  +           
Sbjct: 297 LAKKDPVFKNKK-CLVLDLDETLVHSS-FKYIDTADFVLPVTIDDQTHQ----------- 343

Query: 730 MGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 772
             ++   RPG+  FL+R  K+FE+ ++T     Y   +  +LD
Sbjct: 344 --VYVIKRPGVDEFLKRVGKIFEVVVFTASVSRYGDPLLDILD 384


>gi|10435126|dbj|BAB14498.1| unnamed protein product [Homo sapiens]
          Length = 798

 Score = 42.4 bits (98), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 31/56 (55%), Gaps = 3/56 (5%)

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 991
           G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF
Sbjct: 172 GGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDF 224


>gi|324525869|gb|ADY48608.1| Serine/threonine-protein phosphatase dullard, partial [Ascaris
           suum]
          Length = 257

 Score = 42.4 bits (98), Expect = 1.4,   Method: Composition-based stats.
 Identities = 46/161 (28%), Positives = 72/161 (44%), Gaps = 31/161 (19%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD-------REKPHRHLFRFPHMGM 732
           R+  LVLDLD TL++S         HD I+R   +         R    RH  RF     
Sbjct: 73  RRKILVLDLDETLIHSH--------HDGIIRPMVKPGTPPDFILRVNIDRHPVRF----- 119

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP-KGVLFAGRVISRGDDGDP 791
           +   RP +  FL   S+ F++ ++T   ++Y + +A  LD  KG+L   R   R      
Sbjct: 120 FVHCRPHVDYFLSMVSQWFDLVVFTASMEIYGSSVADKLDNGKGIL--QRRYFRQHCTMD 177

Query: 792 FDGDERVPKSKDLEGVLGMESAVVIIDDS---VRVWPHNKL 829
           + G      +KDL  +    S++ I+D+S    R +P N +
Sbjct: 178 YGG-----YTKDLSAIHADLSSIFILDNSPGAYRKFPQNAI 213


>gi|302766621|ref|XP_002966731.1| hypothetical protein SELMODRAFT_85337 [Selaginella moellendorffii]
 gi|300166151|gb|EFJ32758.1| hypothetical protein SELMODRAFT_85337 [Selaginella moellendorffii]
          Length = 131

 Score = 42.4 bits (98), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 50/101 (49%), Gaps = 8/101 (7%)

Query: 737 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 796
           RP +  FLER +KLF++  +T   +  A  +  VLDP    F+ R+          D  +
Sbjct: 5   RPHLSKFLERMAKLFDVVAFTSRAQRRAETILDVLDPAKEFFSRRLY--------LDSCK 56

Query: 797 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 837
           +  K KDL  +    + V+I+DD+      N  NL++V R+
Sbjct: 57  KGGKVKDLAVLERPLNRVIIVDDTSSKCVLNPDNLVLVSRF 97


>gi|302808549|ref|XP_002985969.1| hypothetical protein SELMODRAFT_122967 [Selaginella moellendorffii]
 gi|300146476|gb|EFJ13146.1| hypothetical protein SELMODRAFT_122967 [Selaginella moellendorffii]
          Length = 198

 Score = 42.4 bits (98), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 50/102 (49%), Gaps = 5/102 (4%)

Query: 737 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD-GD 795
           RPG+ TFL   S+++E+ ++T   K YA  +   LDP G LF  R+    D   P + G 
Sbjct: 65  RPGVDTFLNEMSEIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRLYR--DSCSPKEVGG 122

Query: 796 ERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 837
            +V   KDL  +       VI+DD +  +     N IV+  +
Sbjct: 123 RKV--VKDLSRLGRDLRHTVIVDDKLESFCLQPSNGIVIRAF 162


>gi|383149626|gb|AFG56728.1| Pinus taeda anonymous locus 0_14860_01 genomic sequence
          Length = 140

 Score = 42.4 bits (98), Expect = 1.4,   Method: Composition-based stats.
 Identities = 18/58 (31%), Positives = 32/58 (55%)

Query: 725 FRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
           F      ++ + RP +  F++R + +FE+ ++T    +YA ++  VLDPK  L   RV
Sbjct: 18  FNLKEHTVYVRCRPHLQLFMDRVADMFEIIVFTASQSVYAEQLLNVLDPKRKLIRHRV 75


>gi|355746816|gb|EHH51430.1| hypothetical protein EGM_10796, partial [Macaca fascicularis]
          Length = 270

 Score = 42.4 bits (98), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 74  DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 129

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 130 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 179

Query: 775 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
           GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 180 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 218


>gi|164658690|ref|XP_001730470.1| hypothetical protein MGL_2266 [Malassezia globosa CBS 7966]
 gi|159104366|gb|EDP43256.1| hypothetical protein MGL_2266 [Malassezia globosa CBS 7966]
          Length = 280

 Score = 42.4 bits (98), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 32/63 (50%), Gaps = 1/63 (1%)

Query: 932 AEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRF-VVHPGWVEASALLYRRANEQD 990
           AE FGA C + ++D +TH+VA S  T K   A    +  VV P W+  S   + R  E  
Sbjct: 2   AEDFGARCHRTLNDNITHLVATSPRTSKAQQAFRKQKIQVVWPSWLNDSICRWVRQGEVP 61

Query: 991 FAI 993
           + I
Sbjct: 62  YLI 64


>gi|392578955|gb|EIW72082.1| hypothetical protein TREMEDRAFT_41494 [Tremella mesenterica DSM
           1558]
          Length = 193

 Score = 42.4 bits (98), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 47/96 (48%), Gaps = 14/96 (14%)

Query: 683 CLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWT 742
           CLVLDLD TLL+S+   ++ P  D I+  + E                 ++   RPG+  
Sbjct: 27  CLVLDLDETLLHSS--FKMLPSADYIVPVEIEGQVHN------------VYVIKRPGVDR 72

Query: 743 FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLF 778
           FL    K++E+ ++T     YA  +  +LDP GV+ 
Sbjct: 73  FLYEMGKIYEVVVFTASLSKYADPVLDMLDPNGVVL 108


>gi|145533244|ref|XP_001452372.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124420060|emb|CAK84975.1| unnamed protein product [Paramecium tetraurelia]
          Length = 250

 Score = 42.4 bits (98), Expect = 1.4,   Method: Composition-based stats.
 Identities = 31/98 (31%), Positives = 50/98 (51%), Gaps = 14/98 (14%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPG 739
           ++  LVLDLD TL++S    E   + DE +  K  ++ EK             + K+RP 
Sbjct: 71  KEFTLVLDLDETLIHSDL--ERTSILDEEIIVKIGENIEK------------YYIKVRPY 116

Query: 740 IWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
              FL+  S+LF++ ++T   K YA ++   LDP G +
Sbjct: 117 AREFLQSLSQLFDLVIFTAALKEYADKVIDFLDPCGFI 154


>gi|56549681|ref|NP_005799.2| CTD small phosphatase-like protein isoform 2 [Homo sapiens]
 gi|31074181|gb|AAP34400.1| small CTD phosphatase 3 [Homo sapiens]
 gi|34392245|emb|CAE11804.1| RB serine phosphatase [Homo sapiens]
 gi|34596234|gb|AAQ76797.1| HYA22 [Homo sapiens]
 gi|187252491|gb|AAI66643.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase-like [synthetic construct]
 gi|410228142|gb|JAA11290.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase-like [Pan troglodytes]
 gi|410291072|gb|JAA24136.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase-like [Pan troglodytes]
 gi|410334183|gb|JAA36038.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase-like [Pan troglodytes]
          Length = 265

 Score = 42.4 bits (98), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 73/170 (42%), Gaps = 33/170 (19%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++   +QK   + L  +  +    K C+V+DLD TL++S+      P+ +       E D
Sbjct: 71  EENGGLQKPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVEID 126

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  GV
Sbjct: 127 GTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGV 176

Query: 777 LFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
             A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 177 FRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 213


>gi|389749543|gb|EIM90714.1| hypothetical protein STEHIDRAFT_144286 [Stereum hirsutum FP-91666
           SS1]
          Length = 1202

 Score = 42.4 bits (98), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 21/64 (32%), Positives = 34/64 (53%)

Query: 928 LWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRAN 987
           L+  A + GA  T    D+VTH++A + G  K   AL     ++ P WV  S  +++R +
Sbjct: 75  LFSCASKLGANTTSDFTDRVTHLIAITHGGAKYMCALERKIPIMQPSWVHDSFEIWQRGD 134

Query: 988 EQDF 991
           + DF
Sbjct: 135 DVDF 138


>gi|126338058|ref|XP_001362218.1| PREDICTED: protein ECT2 isoform 1 [Monodelphis domestica]
          Length = 881

 Score = 42.4 bits (98), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 33/68 (48%), Gaps = 3/68 (4%)

Query: 925 LHPLWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYR 984
           L  L   A   G    K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + 
Sbjct: 158 LVKLVTLAHHMGGTIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIFKA---WE 214

Query: 985 RANEQDFA 992
           R NEQDF 
Sbjct: 215 RRNEQDFC 222


>gi|119598867|gb|EAW78461.1| epithelial cell transforming sequence 2 oncogene, isoform CRA_b
           [Homo sapiens]
          Length = 927

 Score = 42.4 bits (98), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 31/56 (55%), Gaps = 3/56 (5%)

Query: 936 GAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 991
           G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF
Sbjct: 172 GGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDF 224


>gi|403333986|gb|EJY66132.1| Serine/threonine-protein phosphatase dullard [Oxytricha trifallax]
          Length = 913

 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 53/188 (28%), Positives = 84/188 (44%), Gaps = 21/188 (11%)

Query: 673 QKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGM 732
           Q+K ++  K  LVLDLD TL++S+      P  D +L  + E +               +
Sbjct: 113 QQKPWATGKKTLVLDLDETLVHSSFKPPAKP--DIVLPVEIEGNV------------CNV 158

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPF 792
           +  +RPG   FL+R +K +E+ +YT     YA  +  +LD K        + R +    F
Sbjct: 159 FVLIRPGTEFFLQRLAKCYEIVIYTASLSKYADPLIDILDNKTQKIIDYRLFR-EHCTFF 217

Query: 793 DGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGL--L 850
            G     K   L G L  +S  +IID+S   +  ++ N + +  +   P  R  F L  L
Sbjct: 218 QG--VFIKDMSLPGRLLQDS--IIIDNSPTSYAFHQENALPILSWYDDPKDRCLFELIPL 273

Query: 851 GPSLLEID 858
             SL E+D
Sbjct: 274 LESLAEVD 281


>gi|426221551|ref|XP_004004972.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
           small phosphatase 1 [Ovis aries]
          Length = 260

 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 34/123 (27%), Positives = 56/123 (45%), Gaps = 14/123 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  A+ K   + L  + K     K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65  EENGAVPKAPVQYLLPEAKAQDLDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121 GVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 777 LFA 779
             A
Sbjct: 171 FRA 173


>gi|402860629|ref|XP_003894728.1| PREDICTED: CTD small phosphatase-like protein [Papio anubis]
          Length = 276

 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 80  DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 136 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185

Query: 775 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
           GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 186 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224


>gi|443896177|dbj|GAC73521.1| nucleotide excision repair factor NEF2, RAD4/CUT5 component
           [Pseudozyma antarctica T-34]
          Length = 1956

 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 18/51 (35%), Positives = 29/51 (56%)

Query: 928 LWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEA 978
           L Q A + GA    ++ + VTH++A+  G+ K  +AL  G  +V P W+ A
Sbjct: 100 LTQYARELGARVEGNLTEDVTHLIADRPGSQKYRFALDLGMHIVSPDWIHA 150


>gi|431919455|gb|ELK17974.1| CTD small phosphatase-like protein [Pteropus alecto]
          Length = 309

 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 113 DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 168

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 169 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 218

Query: 775 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
           GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 219 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 257


>gi|395816725|ref|XP_003781844.1| PREDICTED: CTD small phosphatase-like protein isoform 2 [Otolemur
           garnettii]
          Length = 276

 Score = 42.0 bits (97), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 80  DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 136 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185

Query: 775 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
           GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 186 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224


>gi|335298851|ref|XP_003358411.1| PREDICTED: CTD small phosphatase-like protein-like [Sus scrofa]
          Length = 276

 Score = 42.0 bits (97), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 80  DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 136 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185

Query: 775 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
           GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 186 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224


>gi|71026803|ref|XP_763045.1| nuclear LIM interactor-interacting factor 1 [Theileria parva strain
           Muguga]
 gi|68349998|gb|EAN30762.1| nuclear LIM interactor-interacting factor 1, putative [Theileria
           parva]
          Length = 254

 Score = 42.0 bits (97), Expect = 1.7,   Method: Composition-based stats.
 Identities = 59/219 (26%), Positives = 98/219 (44%), Gaps = 33/219 (15%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           K+ + RK  LVLDLD TL++S+     +P ++    +  +   E+            ++ 
Sbjct: 64  KVCTVRKKMLVLDLDETLIHSS----FEPSNNSFPMQLMQNGVERT-----------IYI 108

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG 794
             RP +  FL   S  +E+ ++T G K YA  +   +DP GV    R + R D    ++G
Sbjct: 109 GKRPYLSEFLSVVSNFYEIVIFTAGLKSYADPVIDFIDPDGV--CKRRLFR-DSCKYWNG 165

Query: 795 DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSL 854
                  KDLE +      VV ID+S   +  N  N I +E + +   +  +   L P L
Sbjct: 166 ----YYIKDLEILNKPLKDVVTIDNSPCCYCLNPENAIPIETW-FNDENDSELCDLVPLL 220

Query: 855 LEIDHDERSEDGTLASSLGVIERL--HKIFFSHQSLDDV 891
             + H   +ED T      +I  L  H+IFF+  ++  +
Sbjct: 221 RRLAH---TEDVT-----NIIPSLFNHEIFFTKVTISQI 251


>gi|410971731|ref|XP_003992318.1| PREDICTED: CTD small phosphatase-like protein [Felis catus]
          Length = 307

 Score = 42.0 bits (97), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 111 DQRQVIPIPSPPAKYLLPEATVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 166

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 167 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 216

Query: 775 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
           GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 217 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 255


>gi|195587438|ref|XP_002083469.1| GD13336 [Drosophila simulans]
 gi|194195478|gb|EDX09054.1| GD13336 [Drosophila simulans]
          Length = 253

 Score = 42.0 bits (97), Expect = 1.8,   Method: Composition-based stats.
 Identities = 37/142 (26%), Positives = 70/142 (49%), Gaps = 22/142 (15%)

Query: 644 WGDVEHLFEGYDDQQKAAIQKE-RTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVD 702
           W  +E ++  + +      Q E R   + + +    ARK  LVLD+D+T++ S       
Sbjct: 33  WFFLERVYRDFMEYTPIIYQSEDRLSPVSKSRLSLVARK-TLVLDMDNTMITSW------ 85

Query: 703 PVHDEILRKKEEQDREKPH-RHLFRFPHMG-----MWTKLRPGIWTFLERASKLFEMHLY 756
                   K+ ++   KP   H F+F ++G     ++   RP +  FL+R SK +++ ++
Sbjct: 86  ------FIKRGKKPENKPRIAHDFKF-YLGAYGATIYVYKRPYLDHFLDRVSKWYDLTVF 138

Query: 757 TMGNKLYATEMAKVLDP-KGVL 777
           T G ++YA+ +   LD  +G+L
Sbjct: 139 TSGAEIYASPILDFLDRGRGIL 160


>gi|118371686|ref|XP_001019041.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
           thermophila]
 gi|89300808|gb|EAR98796.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
           thermophila SB210]
          Length = 379

 Score = 42.0 bits (97), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 36/138 (26%), Positives = 68/138 (49%), Gaps = 23/138 (16%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LVLDLD TL++    +E     D  +   + Q+++K +           +   R  +  F
Sbjct: 212 LVLDLDETLIHC---NEKSLNDDSSIITVQFQNQQKNY-----------YLHQRGYLQEF 257

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 803
           LE+ +  F +++YT   + YA E+ K++DP+ V+   +V  R      +DG + +   K 
Sbjct: 258 LEQCALNFNIYIYTASTRDYAEEVVKIIDPRSVI--KKVYDRS--SCFYDGKQYLKSLK- 312

Query: 804 LEGVLGME-SAVVIIDDS 820
               LG++ +  V+ID++
Sbjct: 313 ---TLGLDLNRTVMIDNN 327


>gi|194752999|ref|XP_001958806.1| GF12569 [Drosophila ananassae]
 gi|190620104|gb|EDV35628.1| GF12569 [Drosophila ananassae]
          Length = 282

 Score = 42.0 bits (97), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 36/141 (25%), Positives = 63/141 (44%), Gaps = 7/141 (4%)

Query: 636 VGAHPQSAWGDVEHLFEGYDDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNS 695
           +G++ +S +  +    E Y       I KE     E Q+++    +  LVLDLD TL++S
Sbjct: 50  LGSYVRSFFSLIATKVESYLRPVTEPIYKEVPLSPESQRRLRQVGRKTLVLDLDETLVHS 109

Query: 696 AKFHEVDPVHDEILRKKEEQDREKPHRHLFR----FPHMGMWTKLRPGIWTFLERASKLF 751
                 DP  +E++         KP   L         +      RP +  FL+ ASK +
Sbjct: 110 CY---SDPETNELVGCSLVPQTAKPDYELSVTLEGLDPIAFQVYKRPHVDVFLKFASKWY 166

Query: 752 EMHLYTMGNKLYATEMAKVLD 772
           ++ ++T   ++YA ++   LD
Sbjct: 167 DLVIFTASLEVYAAQVVDRLD 187


>gi|302808547|ref|XP_002985968.1| hypothetical protein SELMODRAFT_5469 [Selaginella moellendorffii]
 gi|300146475|gb|EFJ13145.1| hypothetical protein SELMODRAFT_5469 [Selaginella moellendorffii]
          Length = 134

 Score = 42.0 bits (97), Expect = 1.9,   Method: Composition-based stats.
 Identities = 46/156 (29%), Positives = 68/156 (43%), Gaps = 24/156 (15%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKP-HRHLFRFPHMGMWTKLRPGIWT 742
           LVLD+D TL+++         H  I   K    +  P  R+L            RPG+ T
Sbjct: 1   LVLDMDETLIHA---------HKAIASLKLFSGKTLPLQRYL---------VAKRPGVDT 42

Query: 743 FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFD-GDERVPKS 801
           FL   S+++E+ ++T   K YA  +   LDP G LF  R+    D   P + G  +V   
Sbjct: 43  FLNEMSEIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRLYR--DSCSPKEVGGRKV--V 98

Query: 802 KDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 837
           KDL  +       VI+DD    +     N IV+  +
Sbjct: 99  KDLSRLGRDLRHTVIVDDKPESFCLQPSNGIVIRAF 134


>gi|145528305|ref|XP_001449952.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124417541|emb|CAK82555.1| unnamed protein product [Paramecium tetraurelia]
          Length = 460

 Score = 42.0 bits (97), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 56/115 (48%), Gaps = 17/115 (14%)

Query: 669 RLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVD-PVHDEILRKKEEQDREKPHRHLFRF 727
           +L++Q K   +  + L+ DLD TL++    +E D  ++D IL             +L + 
Sbjct: 249 KLDKQNKYKDS--ITLIFDLDETLIHC---NERDHKLYDAILT-----------VNLNKT 292

Query: 728 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRV 782
             +     +RP     L + S+ FE+ ++T  NK+YA  +   LDP   +FA R+
Sbjct: 293 QQVQAKINVRPNAVEILRKLSENFELIVFTASNKIYAKSVIDYLDPNKDIFAHRL 347


>gi|145497555|ref|XP_001434766.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124401894|emb|CAK67369.1| unnamed protein product [Paramecium tetraurelia]
          Length = 249

 Score = 42.0 bits (97), Expect = 1.9,   Method: Composition-based stats.
 Identities = 48/160 (30%), Positives = 68/160 (42%), Gaps = 21/160 (13%)

Query: 678 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 737
           + ++  LVLDLD TL+ S    E     DE +  K     EK             + K+R
Sbjct: 68  TEKEFTLVLDLDETLIRSEM--ERTSFLDEEIIVKIGNTIEK------------YYVKIR 113

Query: 738 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDER 797
           P    FL+  SK FE+ ++T   K YA ++   LDP G  F  R   R D     DG   
Sbjct: 114 PFARDFLKALSKYFELVIFTAALKEYADKVIDYLDPSG--FIKRRFYR-DSCTKKDG--- 167

Query: 798 VPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERY 837
               KDL  V        IID+S+     N  N ++++ +
Sbjct: 168 -VFYKDLTKVNSNLEKTFIIDNSLSGMSLNPQNGLLIKSW 206


>gi|291234069|ref|XP_002736972.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
           polypeptide A) small phosphatase 1-like [Saccoglossus
           kowalevskii]
          Length = 251

 Score = 42.0 bits (97), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 45/141 (31%), Positives = 66/141 (46%), Gaps = 23/141 (16%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           KLC+V+DLD TL++S+      PV +       E D      ++ +          RP +
Sbjct: 62  KLCIVIDLDETLVHSS----FKPVSNADFVVPVEIDGTVHQVYVLK----------RPFV 107

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPK 800
             FL++  +LFE  L+T     YA  +A +LD  GV F  R+     D   F    R   
Sbjct: 108 DEFLQKMGELFECVLFTASLSKYADPVADLLDKWGV-FRARLFR---DSCVF---HRGNY 160

Query: 801 SKDLEGVLGME-SAVVIIDDS 820
            KDL G LG +   +VI+D+S
Sbjct: 161 VKDL-GRLGRDLKKIVIVDNS 180


>gi|296475139|tpg|DAA17254.1| TPA: small CTD phosphatase 3-like [Bos taurus]
          Length = 276

 Score = 42.0 bits (97), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 80  DQRQIIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 136 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185

Query: 775 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
           GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 186 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224


>gi|397610231|gb|EJK60724.1| hypothetical protein THAOC_18872 [Thalassiosira oceanica]
          Length = 231

 Score = 42.0 bits (97), Expect = 2.0,   Method: Composition-based stats.
 Identities = 47/166 (28%), Positives = 70/166 (42%), Gaps = 20/166 (12%)

Query: 682 LCLVLDLDHTLLNSAK-----FHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKL 736
           L +  DLDHT+L S        + VD      LR  ++ D + P    F        T L
Sbjct: 31  LDIFFDLDHTILCSISPLPISDNGVDNGVGRTLRWFDQIDDDFP----FEGNSPNTRTFL 86

Query: 737 RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDE 796
           RP     +   S +  +H+YT   K Y   + KV+DP+  LF G+V+ R D  D      
Sbjct: 87  RPLSTATIYFCSVIGRVHVYTAAQKSYTDNILKVIDPRRTLF-GQVLHRDDHPDI----- 140

Query: 797 RVPKSKDL----EGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYT 838
            V   KDL     G +G+  +V+  D      P    N +++  +T
Sbjct: 141 -VRNGKDLLFAGGGEVGLRRSVLFDDKFSNFVPQQYRNGVLIRPFT 185


>gi|354467729|ref|XP_003496321.1| PREDICTED: CTD small phosphatase-like protein-like [Cricetulus
           griseus]
          Length = 342

 Score = 41.6 bits (96), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 172 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 218

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 795
             FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 219 -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 269

Query: 796 ERVPKSKDLEGVLGME-SAVVIIDDS 820
                 KDL   LG E S V+I+D+S
Sbjct: 270 ----YVKDLSR-LGRELSKVIIVDNS 290


>gi|338714770|ref|XP_001489080.3| PREDICTED: CTD small phosphatase-like protein-like [Equus caballus]
          Length = 268

 Score = 41.6 bits (96), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 72  DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 127

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 128 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 177

Query: 775 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
           GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 178 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 216


>gi|330688428|ref|NP_001180010.2| CTD small phosphatase-like protein [Bos taurus]
          Length = 276

 Score = 41.6 bits (96), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 80  DQRQIIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 135

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 136 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 185

Query: 775 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
           GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 186 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 224


>gi|390476487|ref|XP_002759760.2| PREDICTED: CTD small phosphatase-like protein-like [Callithrix
           jacchus]
          Length = 451

 Score = 41.6 bits (96), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 255 DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 310

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 311 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 360

Query: 775 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
           GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 361 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 399


>gi|354502403|ref|XP_003513276.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
           small phosphatase 1-like [Cricetulus griseus]
          Length = 342

 Score = 41.6 bits (96), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 15/124 (12%)

Query: 657 QQKAAIQKER-TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 715
           ++  AI K+   + L  + K   + K+C+V+DLD TL++S+      PV++       E 
Sbjct: 146 EENGAIPKQTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEI 201

Query: 716 DREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 775
           D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G
Sbjct: 202 DGVIHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWG 251

Query: 776 VLFA 779
              A
Sbjct: 252 AFRA 255


>gi|401423666|ref|XP_003876319.1| nuclear lim interactor-interacting factor-like protein [Leishmania
           mexicana MHOM/GT/2001/U1103]
 gi|322492561|emb|CBZ27838.1| nuclear lim interactor-interacting factor-like protein [Leishmania
           mexicana MHOM/GT/2001/U1103]
          Length = 290

 Score = 41.6 bits (96), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 47/100 (47%), Gaps = 12/100 (12%)

Query: 678 SARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLR 737
           S  K+ LVLD+D TL++S      D V+D++L    E                 +  K R
Sbjct: 109 SVPKVTLVLDVDETLVHSTFQPSSDVVYDKVLLVPSEGK------------TYTVSVKYR 156

Query: 738 PGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
           P +  FL   S+ FE+ ++T   + Y  ++   +D +G+L
Sbjct: 157 PYLEDFLRFVSRRFEIVVFTASMRAYCDKLMDEIDTQGIL 196


>gi|449546309|gb|EMD37278.1| hypothetical protein CERSUDRAFT_49651 [Ceriporiopsis subvermispora
           B]
          Length = 295

 Score = 41.6 bits (96), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 45/91 (49%), Gaps = 14/91 (15%)

Query: 683 CLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWT 742
           CLVLDLD TL++S+    + P  D I+  + EQ           F H   +   RPG+  
Sbjct: 127 CLVLDLDETLVHSSLRAVLSP--DYIVPVEIEQ-----------FWH-NFYVLKRPGVDD 172

Query: 743 FLERASKLFEMHLYTMGNKLYATEMAKVLDP 773
           FL R  +++E+ ++T     YA  +   LDP
Sbjct: 173 FLRRMGEIYEVVVFTASLSKYADPLLDRLDP 203


>gi|148232046|ref|NP_001084286.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase-like [Xenopus laevis]
 gi|32396218|gb|AAP43959.1| NIF [Xenopus laevis]
 gi|114107822|gb|AAI23152.1| NIF protein [Xenopus laevis]
          Length = 276

 Score = 41.6 bits (96), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 49/172 (28%), Positives = 74/172 (43%), Gaps = 33/172 (19%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D  Q   I    T+ L  + K+    K C+V+DLD TL++S+      P+++       E
Sbjct: 80  DQTQALTIPSPPTKYLLPELKVSEYGKKCVVIDLDETLVHSS----FKPINNADFIVPVE 135

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            D      ++ + PH+            FL++  ++FE  L+T     YA  +A +LD  
Sbjct: 136 IDGTIHQVYVLKRPHVD----------EFLQKMGEMFECVLFTASLAKYADPVADLLDRW 185

Query: 775 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
           GV  A       V  RG+              KDL   LG E S V+IID+S
Sbjct: 186 GVFNARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIIDNS 224


>gi|380799053|gb|AFE71402.1| CTD small phosphatase-like protein isoform 1, partial [Macaca
           mulatta]
          Length = 249

 Score = 41.6 bits (96), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 53  DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 108

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 109 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 158

Query: 775 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
           GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 159 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 197


>gi|9972366|gb|AAG10616.1|AC008030_16 Hypothetical protein [Arabidopsis thaliana]
          Length = 247

 Score = 41.6 bits (96), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 45/166 (27%), Positives = 82/166 (49%), Gaps = 24/166 (14%)

Query: 666 RTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLF 725
           +  +LE+    ++  K  ++LDLD TL++ A  H     HD ++  K E++         
Sbjct: 60  KFEKLEDPLTGYTNMKRTIILDLDETLVH-ATTHLPGVKHDFMVMVKMEREI-------- 110

Query: 726 RFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 785
               M ++   RPG+  FLER  + +++ ++T G + YA+++   LD  GV+ + R+   
Sbjct: 111 ----MPIFVVKRPGVTEFLERLGENYKVVVFTAGLEEYASQVLDKLDKNGVI-SQRLYR- 164

Query: 786 GDDGDPFDGDERVPKSKDLEGVLG--MESAVVIIDD--SVRVWPHN 827
            D     +G       KDL  V+G  + SA+++ D+  S  + P N
Sbjct: 165 -DSCTEVNG----KYVKDLSLVVGKDLRSALIVDDNPSSYSLQPEN 205


>gi|148677300|gb|EDL09247.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase-like [Mus musculus]
          Length = 288

 Score = 41.6 bits (96), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 118 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 164

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 795
             FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 165 -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 215

Query: 796 ERVPKSKDLEGVLGME-SAVVIIDDS 820
                 KDL   LG E S V+I+D+S
Sbjct: 216 ----YVKDLSR-LGRELSKVIIVDNS 236


>gi|145356819|ref|XP_001422622.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582865|gb|ABP00939.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 190

 Score = 41.6 bits (96), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 43/138 (31%), Positives = 63/138 (45%), Gaps = 21/138 (15%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           LVLDLD TL           VH  +    E  D   P   +F      +  + RP + TF
Sbjct: 17  LVLDLDETL-----------VHSNLENTVERCDFSFPV--VFNGDMHRVNVRKRPHLSTF 63

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKD 803
           +E  SK +E+ ++T   ++YA ++  +LDP       R+    D     DG+      KD
Sbjct: 64  MELVSKQYEIVVFTASQQIYADKLLDILDPSQKWIKHRIFR--DSCVQIDGN----FMKD 117

Query: 804 LEGVLGME-SAVVIIDDS 820
           L  VLG + S  +IID+S
Sbjct: 118 LR-VLGRDLSRTIIIDNS 134


>gi|355559784|gb|EHH16512.1| hypothetical protein EGK_11800, partial [Macaca mulatta]
          Length = 250

 Score = 41.6 bits (96), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 54  DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 109

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 110 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 159

Query: 775 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
           GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 160 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 198


>gi|401840826|gb|EJT43491.1| PSR1-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 270

 Score = 41.6 bits (96), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 48/102 (47%), Gaps = 24/102 (23%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKE-----EQDREKPHRHLFRFPHMGMWTK 735
           K CL+LDLD TL++S+          + LR  +     E D +  + ++ +         
Sbjct: 100 KKCLILDLDETLVHSSF---------KYLRSADFVLPVEIDDQVHNVYVIK--------- 141

Query: 736 LRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVL 777
            RPG+  FLER  KLFE+ ++T     Y   +  +LD   V+
Sbjct: 142 -RPGVEEFLERVGKLFEVVVFTASVSRYGDPLLDILDTNKVI 182


>gi|47228058|emb|CAF97687.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 315

 Score = 41.6 bits (96), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 40/145 (27%), Positives = 63/145 (43%), Gaps = 31/145 (21%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K C+V+DLD TL++S+      P+++       E D      ++ + PH+          
Sbjct: 95  KKCVVIDLDETLVHSS----FKPINNADFIVPVEIDGIVHQVYVLKRPHVD--------- 141

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 795
             FL++  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 142 -EFLQKMGELFECVLFTASLAKYADPVADLLDQWGVFRARLFRESCVFHRGN-------- 192

Query: 796 ERVPKSKDLEGVLGMESAVVIIDDS 820
                 KDL  +    S V+I+D+S
Sbjct: 193 ----YVKDLSRLGRKLSNVIIVDNS 213


>gi|380804483|gb|AFE74117.1| protein ECT2, partial [Macaca mulatta]
          Length = 186

 Score = 41.6 bits (96), Expect = 2.5,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 3/58 (5%)

Query: 934 QFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 991
             G V  K  + +VTH+VAN    +K   A+S G  ++ P W+  +   + R NEQDF
Sbjct: 120 HMGGVIRKDFNSKVTHLVANCTQGEKFRVAVSLGTPIMKPEWIYKA---WERRNEQDF 174


>gi|170583107|ref|XP_001896433.1| NLI interacting factor-like phosphatase family protein [Brugia
           malayi]
 gi|158596360|gb|EDP34717.1| NLI interacting factor-like phosphatase family protein [Brugia
           malayi]
          Length = 259

 Score = 41.6 bits (96), Expect = 2.5,   Method: Composition-based stats.
 Identities = 46/161 (28%), Positives = 72/161 (44%), Gaps = 31/161 (19%)

Query: 680 RKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD-------REKPHRHLFRFPHMGM 732
           ++  LVLDLD TL++S         HD I+R   +         R    RH  RF     
Sbjct: 75  KRKILVLDLDETLIHSH--------HDGIIRPMVKPGTPPDFVLRVNIDRHPVRF----- 121

Query: 733 WTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP-KGVLFAGRVISRGDDGDP 791
           +   RP +  FL   S+ F++ ++T   ++Y + +A  LD  KG+L   R   R      
Sbjct: 122 FVHCRPHVDYFLSMVSQWFDLVIFTASMEIYGSSVADKLDNGKGIL--QRRYFRQHCTMD 179

Query: 792 FDGDERVPKSKDLEGVLGMESAVVIIDDS---VRVWPHNKL 829
           + G      +KDL  V    S++ I+D+S    R +P N +
Sbjct: 180 YGG-----YTKDLSAVHADLSSIFILDNSPSAYRKFPQNAI 215


>gi|156839904|ref|XP_001643638.1| hypothetical protein Kpol_478p16 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156114257|gb|EDO15780.1| hypothetical protein Kpol_478p16 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 350

 Score = 41.6 bits (96), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 46/101 (45%), Gaps = 14/101 (13%)

Query: 683 CLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWT 742
           CLVLDLD TL++S+ F  V      +    ++Q                ++   RPG+  
Sbjct: 181 CLVLDLDETLVHSS-FKYVSTADFVLPVDIDDQ-------------FQNVYVIKRPGVDA 226

Query: 743 FLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
           FL+  SKLFE+ ++T   + Y   +  +LD    L   R+ 
Sbjct: 227 FLQYTSKLFEVVIFTASVEKYGNPLLDILDSTNDLVHHRLF 267


>gi|145494426|ref|XP_001433207.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400324|emb|CAK65810.1| unnamed protein product [Paramecium tetraurelia]
          Length = 223

 Score = 41.2 bits (95), Expect = 2.7,   Method: Composition-based stats.
 Identities = 43/168 (25%), Positives = 77/168 (45%), Gaps = 34/168 (20%)

Query: 664 KERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRH 723
           K R  RL+E     + ++  LVLDLD TL++S    +   +   I      QD ++P   
Sbjct: 27  KSRFVRLKESN---NRKQKILVLDLDETLIHSCTHRDFPHITITI------QDNDEPIDI 77

Query: 724 LFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 783
            F          +RP    F++  S  + ++L+T  +++YA  +   LDPK   +   ++
Sbjct: 78  AF---------NVRPYCKEFIKEMSNYYTIYLFTASSEMYARAIVNHLDPKR-QYITDIL 127

Query: 784 SRGDDGDPFDG----DERVPKSKDLEGVLGMESAVVIIDDSVRVWPHN 827
            R +  +  +G    D R+  ++DL+        +VIID+     PH+
Sbjct: 128 CRNNCFETKNGFFIKDLRIITNRDLKD-------IVIIDN----LPHS 164


>gi|171460950|ref|NP_598471.3| CTD small phosphatase-like protein [Mus musculus]
 gi|408360295|sp|P58465.3|CTDSL_MOUSE RecName: Full=CTD small phosphatase-like protein; Short=CTDSP-like;
           AltName: Full=Carboxy-terminal domain RNA polymerase II
           polypeptide A small phosphatase 3; AltName:
           Full=NIF-like protein; AltName: Full=Nuclear LIM
           interactor-interacting factor 1; Short=NLI-interacting
           factor 1; AltName: Full=Small C-terminal domain
           phosphatase 3; Short=SCP3; Short=Small CTD phosphatase 3
 gi|62948141|gb|AAH94289.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
           small phosphatase-like [Mus musculus]
          Length = 276

 Score = 41.2 bits (95), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 106 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 152

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 795
             FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 153 -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 203

Query: 796 ERVPKSKDLEGVLGME-SAVVIIDDS 820
                 KDL   LG E S V+I+D+S
Sbjct: 204 ----YVKDLSR-LGRELSKVIIVDNS 224


>gi|417397992|gb|JAA46029.1| Putative carboxy-terminal domain rna polymerase ii polypeptide a
           small phosphatase 1 isoform 2 [Desmodus rotundus]
          Length = 260

 Score = 41.2 bits (95), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 34/123 (27%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++  A+ K   + L  + K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 65  EENGAVPKTPVQYLLPEAKPQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 120

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ +          RP +  FL+R  +LFE  L+T     YA  +A +LD  G 
Sbjct: 121 GVVHQVYVLK----------RPYVDEFLQRMGELFECVLFTASLAKYADPVADLLDKWGA 170

Query: 777 LFA 779
             A
Sbjct: 171 FRA 173


>gi|432849192|ref|XP_004066577.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
           small phosphatase 1-like [Oryzias latipes]
          Length = 262

 Score = 41.2 bits (95), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 55/121 (45%), Gaps = 14/121 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++   + K + + L    K   A K+C+V+DLD TL++S+      PV++       E D
Sbjct: 67  EENGTLSKAQVKPLLPPVKSKDAGKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 122

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  ++ +LD  G 
Sbjct: 123 GTVHQVYVLKRPHVD----------EFLKRMGELFECVLFTASLAKYADPVSDLLDKWGA 172

Query: 777 L 777
            
Sbjct: 173 F 173


>gi|32812783|emb|CAC69078.2| Mya22 protein [Mus musculus]
          Length = 276

 Score = 41.2 bits (95), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 106 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 152

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 795
             FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 153 -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 203

Query: 796 ERVPKSKDLEGVLGME-SAVVIIDDS 820
                 KDL   LG E S V+I+D+S
Sbjct: 204 ----YVKDLSR-LGRELSKVIIVDNS 224


>gi|348552620|ref|XP_003462125.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
           small phosphatase 1-like [Cavia porcellus]
          Length = 261

 Score = 41.2 bits (95), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 15/124 (12%)

Query: 657 QQKAAIQKER-TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 715
           ++  AI K+   + L  + K   + K+C+V+DLD TL++S+      PV++       E 
Sbjct: 65  EENGAIPKQTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEI 120

Query: 716 DREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 775
           D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G
Sbjct: 121 DGVIHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWG 170

Query: 776 VLFA 779
              A
Sbjct: 171 AFRA 174


>gi|345321149|ref|XP_001521318.2| PREDICTED: CTD small phosphatase-like protein-like, partial
           [Ornithorhynchus anatinus]
          Length = 295

 Score = 41.2 bits (95), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 41/145 (28%), Positives = 62/145 (42%), Gaps = 31/145 (21%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 126 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTVHQVYVLKRPHVD--------- 172

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 795
             FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 173 -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 223

Query: 796 ERVPKSKDLEGVLGMESAVVIIDDS 820
                 KDL  +    S V+I+D+S
Sbjct: 224 ----YVKDLSRLGRELSKVIIVDNS 244


>gi|145514934|ref|XP_001443372.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124410750|emb|CAK75975.1| unnamed protein product [Paramecium tetraurelia]
          Length = 401

 Score = 41.2 bits (95), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 29/133 (21%), Positives = 65/133 (48%), Gaps = 20/133 (15%)

Query: 684 LVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGIWTF 743
           L+LDLD TL++S  F +   V    +  ++++D+            + ++  +RP    F
Sbjct: 222 LILDLDETLIHSCTFRDSPQV---TITLQDDEDK------------VDLFFNVRPFCKEF 266

Query: 744 LERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDG----DERVP 799
           L   S  + ++++T  ++LYA  +   LDP    +   V+ R +  +  +G    D R+ 
Sbjct: 267 LREMSNYYNIYIFTASSELYANAIVNHLDP-NRQYINDVLCRNNCFETKNGFFIKDLRII 325

Query: 800 KSKDLEGVLGMES 812
            ++ L+ ++ +++
Sbjct: 326 TNRHLKDIVIVDN 338


>gi|452823685|gb|EME30693.1| putative CTD small phosphatase [Galdieria sulphuraria]
          Length = 397

 Score = 41.2 bits (95), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 42/156 (26%), Positives = 67/156 (42%), Gaps = 21/156 (13%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K  LVLDLD TL++S  F       D +L  + E              ++ ++ K+RP +
Sbjct: 203 KKTLVLDLDETLVHSG-FEGSRETSDFVLSMQVEN------------TNLQLFVKMRPYL 249

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVL-DPKGVLF---AGRVISRGDDGDPFDGDE 796
             FL+  +K FE+ ++T     YA  +  ++ D  GV       R+     + DP    E
Sbjct: 250 KEFLQEVTKHFEIVIFTASMVTYADPVIDLMFDATGVAHIPETHRLFRESCEYDP----E 305

Query: 797 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLI 832
                KDL  +      V+I+D+S   +  N  N I
Sbjct: 306 TCSFHKDLMALGRDIKKVIIVDNSPTAYTKNPYNAI 341


>gi|440900046|gb|ELR51261.1| CTD small phosphatase-like protein, partial [Bos grunniens mutus]
          Length = 250

 Score = 41.2 bits (95), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 54  DQRQIIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 109

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 110 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 159

Query: 775 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
           GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 160 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 198


>gi|323507821|emb|CBQ67692.1| related to DNA topoisomerase II binding protein [Sporisorium
           reilianum SRZ2]
          Length = 1201

 Score = 41.2 bits (95), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 17/49 (34%), Positives = 29/49 (59%)

Query: 928 LWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWV 976
           L Q A + GA    ++ + VTH++A+  G++K  +AL  G  +V P W+
Sbjct: 92  LTQYARELGARVEGNLTEDVTHLIADRPGSEKYRFALELGMHIVSPDWI 140


>gi|308503210|ref|XP_003113789.1| CRE-SCPL-2 protein [Caenorhabditis remanei]
 gi|308263748|gb|EFP07701.1| CRE-SCPL-2 protein [Caenorhabditis remanei]
          Length = 243

 Score = 41.2 bits (95), Expect = 2.9,   Method: Composition-based stats.
 Identities = 44/165 (26%), Positives = 73/165 (44%), Gaps = 29/165 (17%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD-------REKPHRHLFRF 727
           ++ S ++  LVLDLD TL++S         HD +LR+  +         R    RH  +F
Sbjct: 51  RLLSVKRKILVLDLDETLIHSH--------HDGVLRQTVKPGTPSDFTIRVVIDRHPVKF 102

Query: 728 PHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGRVISRGD 787
                    RP +  FL   S+ +E+ ++T   ++Y + +A  LD       GR I +  
Sbjct: 103 S-----VHERPHVDYFLSTVSQWYELVVFTASMEVYGSSVADKLD------RGRGILKRR 151

Query: 788 DGDPFDGDERVPKSKDLEGVLGMESAVVIIDDS---VRVWPHNKL 829
                   E    +KDL  +    S++ I+D+S    R +PHN +
Sbjct: 152 YFRQHCTMEVGGYTKDLSAIHPDLSSICILDNSPGAYRKFPHNAI 196


>gi|344288137|ref|XP_003415807.1| PREDICTED: CTD small phosphatase-like protein-like [Loxodonta
           africana]
          Length = 281

 Score = 41.2 bits (95), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 111 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 157

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 795
             FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 158 -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 208

Query: 796 ERVPKSKDLEGVLGME-SAVVIIDDS 820
                 KDL   LG E S V+I+D+S
Sbjct: 209 ----YVKDLSR-LGRELSKVIIVDNS 229


>gi|92399505|gb|ABE76501.1| SCP3-like protein [Mustela putorius furo]
          Length = 276

 Score = 41.2 bits (95), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 106 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 152

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 795
             FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 153 -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 203

Query: 796 ERVPKSKDLEGVLGME-SAVVIIDDS 820
                 KDL   LG E S V+I+D+S
Sbjct: 204 ----YVKDLSR-LGRELSKVIIVDNS 224


>gi|410906319|ref|XP_003966639.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
           small phosphatase 1-like [Takifugu rubripes]
          Length = 262

 Score = 41.2 bits (95), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 31/121 (25%), Positives = 55/121 (45%), Gaps = 14/121 (11%)

Query: 657 QQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQD 716
           ++   + K + + L    K   + K+C+V+DLD TL++S+      PV++       E D
Sbjct: 67  EENGTVSKVQVKPLLPPAKSKDSGKICVVIDLDETLVHSS----FKPVNNADFIIPVEID 122

Query: 717 REKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGV 776
                 ++ + PH+            FL+R  +LFE  L+T     YA  ++ +LD  G 
Sbjct: 123 GTVHQVYVLKRPHVD----------EFLKRMGELFECVLFTASLAKYADPVSDLLDKWGA 172

Query: 777 L 777
            
Sbjct: 173 F 173


>gi|351699531|gb|EHB02450.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
           phosphatase 1 [Heterocephalus glaber]
          Length = 261

 Score = 41.2 bits (95), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 15/124 (12%)

Query: 657 QQKAAIQKER-TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 715
           ++  AI K+   + L  + K   + K+C+V+DLD TL++S+      PV++       E 
Sbjct: 65  EENGAIPKQSPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEI 120

Query: 716 DREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 775
           D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G
Sbjct: 121 DGVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWG 170

Query: 776 VLFA 779
              A
Sbjct: 171 AFRA 174


>gi|348575309|ref|XP_003473432.1| PREDICTED: CTD small phosphatase-like protein-like [Cavia
           porcellus]
          Length = 294

 Score = 41.2 bits (95), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 124 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 170

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 795
             FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 171 -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 221

Query: 796 ERVPKSKDLEGVLGME-SAVVIIDDS 820
                 KDL   LG E S V+I+D+S
Sbjct: 222 ----YVKDLSR-LGRELSKVIIVDNS 242


>gi|145498624|ref|XP_001435299.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124402430|emb|CAK67902.1| unnamed protein product [Paramecium tetraurelia]
          Length = 417

 Score = 41.2 bits (95), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 54/111 (48%), Gaps = 16/111 (14%)

Query: 717 REKPHRHLFRFPHMGMWTKL----RPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLD 772
           RE P   +  F   G   K+    RP    FL++ ++L+ +++YT  +  YA  +   LD
Sbjct: 253 RENPQVTVTAFGEYGEEAKIHFNIRPFCTWFLQQMNQLYTIYVYTASSSAYANAIVNYLD 312

Query: 773 PKGVLFAGRVISRGDDGDPFDG----DERVPKSKDLEGVLGMESAVVIIDD 819
           PK     G ++SRG+  +  +G    D R+  +K L+        +VI+D+
Sbjct: 313 PKRQWIMG-ILSRGNCMETKNGFFIKDLRIIGNKQLKD-------MVIVDN 355


>gi|417409172|gb|JAA51106.1| Putative ctd carboxy-terminal domain rna polymer, partial [Desmodus
           rotundus]
          Length = 265

 Score = 41.2 bits (95), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 95  KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTTHQVYVLKRPHVD--------- 141

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 795
             FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 142 -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 192

Query: 796 ERVPKSKDLEGVLGME-SAVVIIDDS 820
                 KDL   LG E S V+I+D+S
Sbjct: 193 ----YVKDLSR-LGRELSKVIIVDNS 213


>gi|403266876|ref|XP_003925586.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
           small phosphatase 1 isoform 2 [Saimiri boliviensis
           boliviensis]
          Length = 248

 Score = 41.2 bits (95), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 15/124 (12%)

Query: 657 QQKAAIQKER-TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 715
           ++  AI K+   + L  + K   + K+C+V+DLD TL++S+      PV++       E 
Sbjct: 52  EENGAIPKQTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEI 107

Query: 716 DREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 775
           D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G
Sbjct: 108 DGVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWG 157

Query: 776 VLFA 779
              A
Sbjct: 158 AFRA 161


>gi|426339960|ref|XP_004033903.1| PREDICTED: CTD small phosphatase-like protein [Gorilla gorilla
           gorilla]
          Length = 369

 Score = 41.2 bits (95), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 168 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 214

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 795
             FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 215 -EFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 265

Query: 796 ERVPKSKDLEGVLGME-SAVVIIDDS 820
                 KDL   LG E S V+I+D+S
Sbjct: 266 ----YVKDLSR-LGRELSKVIIVDNS 286


>gi|6572954|gb|AAF17482.1|AF189774_1 NLI-interacting factor isoform T2 [Gallus gallus]
          Length = 293

 Score = 41.2 bits (95), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 94  KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 140

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 795
             FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 141 -EFLQRMGELFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 191

Query: 796 ERVPKSKDLEGVLGME-SAVVIIDDS 820
                 KDL   LG E S V+I+D+S
Sbjct: 192 ----YVKDLSR-LGRELSKVIIVDNS 212


>gi|326921452|ref|XP_003206973.1| PREDICTED: CTD small phosphatase-like protein-like [Meleagris
           gallopavo]
          Length = 275

 Score = 41.2 bits (95), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 105 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 151

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 795
             FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 152 -EFLQRMGELFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 202

Query: 796 ERVPKSKDLEGVLGME-SAVVIIDDS 820
                 KDL   LG E S V+I+D+S
Sbjct: 203 ----YVKDLSR-LGRELSKVIIVDNS 223


>gi|47604964|ref|NP_001001316.1| CTD small phosphatase-like protein [Gallus gallus]
 gi|224044589|ref|XP_002196491.1| PREDICTED: uncharacterized protein LOC100232268 isoform 1
           [Taeniopygia guttata]
 gi|17865514|sp|Q9PTJ6.2|CTDSL_CHICK RecName: Full=CTD small phosphatase-like protein; Short=CTDSP-like;
           AltName: Full=Nuclear LIM interactor-interacting factor
           1; Short=NLI-interacting factor 1; AltName: Full=Small
           C-terminal domain phosphatase 3
 gi|6572952|gb|AAF17481.1|AF189773_1 NLI-interacting factor isoform T1 [Gallus gallus]
 gi|449273796|gb|EMC83183.1| CTD small phosphatase-like protein [Columba livia]
          Length = 275

 Score = 41.2 bits (95), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 33/146 (22%)

Query: 681 KLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWTKLRPGI 740
           K C+V+DLD TL++S+      P+ +       E D      ++ + PH+          
Sbjct: 105 KKCVVIDLDETLVHSS----FKPISNADFIVPVEIDGTIHQVYVLKRPHVD--------- 151

Query: 741 WTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFA-----GRVISRGDDGDPFDGD 795
             FL+R  +LFE  L+T     YA  +A +LD  GV  A       V  RG+        
Sbjct: 152 -EFLQRMGELFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRGN-------- 202

Query: 796 ERVPKSKDLEGVLGME-SAVVIIDDS 820
                 KDL   LG E S V+I+D+S
Sbjct: 203 ----YVKDLSR-LGRELSKVIIVDNS 223


>gi|384484378|gb|EIE76558.1| hypothetical protein RO3G_01262 [Rhizopus delemar RA 99-880]
          Length = 348

 Score = 41.2 bits (95), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 50/180 (27%), Positives = 77/180 (42%), Gaps = 39/180 (21%)

Query: 675 KMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPHRHLFRFPHMGMWT 734
           ++   RKL LVLDLD TL        V  V +E  R   E D  K    +      G   
Sbjct: 108 ELVRHRKLPLVLDLDDTL--------VRLVGNENGRFVSESDIPKCKDRVAVLKD-GKRV 158

Query: 735 KLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDP-----KGVLFAGR-----VIS 784
            L   +  FLE A +L+++ + ++G++ Y   +  VLDP     KG+L++ R     + S
Sbjct: 159 VLTERVREFLEWAQQLYDISICSLGDQNYVDSVIDVLDPTRSWVKGILYSARAEHDYIRS 218

Query: 785 RGDDGDPFDGDERVPKSKDLEGVLGM-----------ESAVVIIDDSVRVWPHNKLNLIV 833
             D G P          KDL+ +               S  +I+DD  R+WP  + + I+
Sbjct: 219 SPDPGRP---------PKDLQALYSFCALRDQTLGSGFSLPLILDDETRMWPAEQHDNII 269


>gi|432100877|gb|ELK29230.1| CTD small phosphatase-like protein [Myotis davidii]
          Length = 280

 Score = 40.8 bits (94), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 33/172 (19%)

Query: 655 DDQQKAAIQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEE 714
           D +Q   I     + L  +  +    K C+V+DLD TL++S+      P+ +       E
Sbjct: 84  DQRQVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSS----FKPISNADFIVPVE 139

Query: 715 QDREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPK 774
            D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  
Sbjct: 140 IDGTIHQVYVLKRPHVD----------EFLQRMGQLFECVLFTASLAKYADPVADLLDRW 189

Query: 775 GVLFA-----GRVISRGDDGDPFDGDERVPKSKDLEGVLGME-SAVVIIDDS 820
           GV  A       V  RG+              KDL   LG E S V+I+D+S
Sbjct: 190 GVFRARLFRESCVFHRGN------------YVKDLSR-LGRELSKVIIVDNS 228


>gi|296205578|ref|XP_002749828.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
           small phosphatase 1 isoform 1 [Callithrix jacchus]
          Length = 261

 Score = 40.8 bits (94), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 15/124 (12%)

Query: 657 QQKAAIQKER-TRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQ 715
           ++  AI K+   + L  + K   + K+C+V+DLD TL++S+      PV++       E 
Sbjct: 65  EENGAIPKQTPVQYLLPEAKAQDSDKICVVIDLDETLVHSS----FKPVNNADFIIPVEI 120

Query: 716 DREKPHRHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKG 775
           D      ++ + PH+            FL+R  +LFE  L+T     YA  +A +LD  G
Sbjct: 121 DGVVHQVYVLKRPHVD----------EFLQRMGELFECVLFTASLAKYADPVADLLDKWG 170

Query: 776 VLFA 779
              A
Sbjct: 171 AFRA 174


>gi|71003542|ref|XP_756437.1| hypothetical protein UM00290.1 [Ustilago maydis 521]
 gi|46096042|gb|EAK81275.1| hypothetical protein UM00290.1 [Ustilago maydis 521]
          Length = 1206

 Score = 40.8 bits (94), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 16/49 (32%), Positives = 29/49 (59%)

Query: 928 LWQTAEQFGAVCTKHIDDQVTHVVANSLGTDKVNWALSTGRFVVHPGWV 976
           L Q A + GA    ++ + VTH++A+  G++K  +AL  G  ++ P W+
Sbjct: 99  LTQYARELGARVEGNLTEDVTHLIADRPGSEKYRFALQLGMHIIRPDWI 147


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.314    0.131    0.383 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 16,692,420,396
Number of Sequences: 23463169
Number of extensions: 776417305
Number of successful extensions: 1745328
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 685
Number of HSP's successfully gapped in prelim test: 794
Number of HSP's that attempted gapping in prelim test: 1740990
Number of HSP's gapped (non-prelim): 3356
length of query: 995
length of database: 8,064,228,071
effective HSP length: 153
effective length of query: 842
effective length of database: 8,769,330,510
effective search space: 7383776289420
effective search space used: 7383776289420
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 82 (36.2 bits)