BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 033679
         (113 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255570505|ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
 gi|223534449|gb|EEF36151.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
          Length = 478

 Score = 86.3 bits (212), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 37/58 (63%), Positives = 47/58 (81%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           +++VLKGCK+VFS  FP++F A  H+LWK+ EQLGATCS E+DPSVTHVVS +   EK
Sbjct: 379 RKDVLKGCKIVFSRVFPTQFQADNHHLWKMAEQLGATCSREVDPSVTHVVSAEAGTEK 436


>gi|296090640|emb|CBI41034.3| unnamed protein product [Vitis vinifera]
          Length = 264

 Score = 85.5 bits (210), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 37/58 (63%), Positives = 46/58 (79%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           ++EVLKGCK+VFS  FP++F A  H+LW++ EQLGATC+ ELDPSVTHVVS     EK
Sbjct: 167 RKEVLKGCKIVFSRVFPTRFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEK 224


>gi|359494894|ref|XP_003634864.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
           4-like [Vitis vinifera]
          Length = 278

 Score = 85.1 bits (209), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 37/58 (63%), Positives = 46/58 (79%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           ++EVLKGCK+VFS  FP++F A  H+LW++ EQLGATC+ ELDPSVTHVVS     EK
Sbjct: 181 RKEVLKGCKIVFSRVFPTRFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEK 238


>gi|147774299|emb|CAN76945.1| hypothetical protein VITISV_002430 [Vitis vinifera]
          Length = 641

 Score = 84.7 bits (208), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 36/58 (62%), Positives = 46/58 (79%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           +++VLKGCK+VFS  FP++F A  H+LW++ EQLGATC+ ELDPSVTHVVS     EK
Sbjct: 167 RKDVLKGCKIVFSRVFPTRFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEK 224


>gi|296088193|emb|CBI35709.3| unnamed protein product [Vitis vinifera]
          Length = 638

 Score = 84.7 bits (208), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 36/58 (62%), Positives = 46/58 (79%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           +++VLKGCK+VFS  FP++F A  H+LW++ EQLGATC+ ELDPSVTHVVS     EK
Sbjct: 167 RKDVLKGCKIVFSRVFPTRFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEK 224


>gi|359497210|ref|XP_003635453.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
           4-like [Vitis vinifera]
          Length = 278

 Score = 84.0 bits (206), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 36/58 (62%), Positives = 46/58 (79%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           +++VLKGCK+VFS  FP++F A  H+LW++ EQLGATC+ ELDPSVTHVVS     EK
Sbjct: 181 RKDVLKGCKIVFSRVFPTRFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEK 238


>gi|224142399|ref|XP_002324546.1| predicted protein [Populus trichocarpa]
 gi|222865980|gb|EEF03111.1| predicted protein [Populus trichocarpa]
          Length = 312

 Score = 83.6 bits (205), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 38/67 (56%), Positives = 48/67 (71%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           +++VLKGCK+VFS  FP++  A  H+LW++ EQLGATCS ELDPSVTHVVS     EK  
Sbjct: 221 RKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQLGATCSTELDPSVTHVVSKDSGTEKSH 280

Query: 77  LGSKGGQ 83
             SK  +
Sbjct: 281 WASKHNK 287


>gi|449532013|ref|XP_004172979.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
           phosphatase-like 4-like, partial [Cucumis sativus]
          Length = 340

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 37/56 (66%), Positives = 42/56 (75%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           EVL+GCK+VFS  FP+KF A  H LWK+VEQLG TCS ELD SVTHVV+     EK
Sbjct: 240 EVLEGCKVVFSRVFPTKFQAENHQLWKMVEQLGGTCSTELDQSVTHVVATDAGTEK 295


>gi|224142401|ref|XP_002324547.1| predicted protein [Populus trichocarpa]
 gi|222865981|gb|EEF03112.1| predicted protein [Populus trichocarpa]
          Length = 266

 Score = 81.6 bits (200), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 36/58 (62%), Positives = 45/58 (77%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           +++VLKGCK+VFS  FP++  A  H+LW++ EQLGATCS ELDPSVTHVVS     EK
Sbjct: 170 RKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQLGATCSTELDPSVTHVVSKDSGTEK 227


>gi|449447765|ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
           4-like [Cucumis sativus]
          Length = 452

 Score = 80.9 bits (198), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 37/56 (66%), Positives = 42/56 (75%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           EVL+GCK+VFS  FP+KF A  H LWK+VEQLG TCS ELD SVTHVV+     EK
Sbjct: 352 EVLEGCKVVFSRVFPTKFQAENHQLWKMVEQLGGTCSTELDQSVTHVVATDAGTEK 407


>gi|145334837|ref|NP_001078764.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Arabidopsis
           thaliana]
 gi|122154038|sp|Q00IB6.1|CPL4_ARATH RecName: Full=RNA polymerase II C-terminal domain phosphatase-like
           4; Short=FCP-like 4; AltName: Full=Carboxyl-terminal
           phosphatase-like 4; Short=AtCPL4; Short=CTD
           phosphatase-like 4
 gi|95115186|gb|ABF55959.1| carboxyl-terminal phosphatase-like 4 [Arabidopsis thaliana]
 gi|332009601|gb|AED96984.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Arabidopsis
           thaliana]
          Length = 440

 Score = 74.7 bits (182), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 32/58 (55%), Positives = 42/58 (72%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           ++E+LKGCK+VFS  FP+K     H LWK+ E+LGATC+ E+D SVTHVV+     EK
Sbjct: 338 RKEILKGCKIVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEK 395


>gi|297793317|ref|XP_002864543.1| hypothetical protein ARALYDRAFT_332090 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297310378|gb|EFH40802.1| hypothetical protein ARALYDRAFT_332090 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 1006

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 33/58 (56%), Positives = 42/58 (72%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           ++EVLKGCK+VFS  FP+K     H LWK+ E+LGATC+ E+D SVTHVV+     EK
Sbjct: 903 RKEVLKGCKVVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEK 960


>gi|9758369|dbj|BAB08870.1| unnamed protein product [Arabidopsis thaliana]
          Length = 1065

 Score = 73.2 bits (178), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 32/58 (55%), Positives = 42/58 (72%)

Query: 17   QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
            ++E+LKGCK+VFS  FP+K     H LWK+ E+LGATC+ E+D SVTHVV+     EK
Sbjct: 963  RKEILKGCKIVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEK 1020


>gi|326518250|dbj|BAK07377.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 488

 Score = 71.6 bits (174), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 32/58 (55%), Positives = 40/58 (68%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           ++EVL+GCKLVFS  FPS   +    +WK+ EQLGA C  E+DPSVTHVV+     EK
Sbjct: 378 RQEVLQGCKLVFSRVFPSDCRSQDQIMWKMAEQLGAVCCSEVDPSVTHVVAVHAGTEK 435


>gi|224053553|ref|XP_002297869.1| predicted protein [Populus trichocarpa]
 gi|222845127|gb|EEE82674.1| predicted protein [Populus trichocarpa]
          Length = 1117

 Score = 70.1 bits (170), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 33/92 (35%), Positives = 52/92 (56%), Gaps = 1/92 (1%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            QR++L GC+++FS  FP  +   H+H LW++ EQ GA C+ ++D  VTHVV+N    +KV
Sbjct: 1023 QRKILGGCRILFSRVFPVGEVNPHLHPLWQMAEQFGAVCTNQIDEQVTHVVANSLGTDKV 1082

Query: 76   SLGSKGGQVFGGSTVDRGSQLFVARATRREVS 107
            +     G++         S L   RA  ++ S
Sbjct: 1083 NWALSTGRIVVHPGWVEASALLYRRANEQDFS 1114


>gi|449487451|ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
            phosphatase-like 3-like [Cucumis sativus]
          Length = 1249

 Score = 67.8 bits (164), Expect = 8e-10,   Method: Composition-based stats.
 Identities = 33/92 (35%), Positives = 51/92 (55%), Gaps = 1/92 (1%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            Q+++L GC++VFS  FP  +   H+H LW+  EQ GA C+ ++D  VTHVV+N    +KV
Sbjct: 1155 QQKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAQCTNQIDEQVTHVVANSLGTDKV 1214

Query: 76   SLGSKGGQVFGGSTVDRGSQLFVARATRREVS 107
            +     G+          S L   RAT ++ +
Sbjct: 1215 NWALSTGRFVVHPGWVEASALLYRRATEQDFA 1246


>gi|449445782|ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Cucumis sativus]
          Length = 1249

 Score = 67.8 bits (164), Expect = 8e-10,   Method: Composition-based stats.
 Identities = 33/92 (35%), Positives = 51/92 (55%), Gaps = 1/92 (1%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            Q+++L GC++VFS  FP  +   H+H LW+  EQ GA C+ ++D  VTHVV+N    +KV
Sbjct: 1155 QQKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAQCTNQIDEQVTHVVANSLGTDKV 1214

Query: 76   SLGSKGGQVFGGSTVDRGSQLFVARATRREVS 107
            +     G+          S L   RAT ++ +
Sbjct: 1215 NWALSTGRFVVHPGWVEASALLYRRATEQDFA 1246


>gi|255543174|ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
 gi|223548611|gb|EEF50102.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
          Length = 1195

 Score = 67.8 bits (164), Expect = 9e-10,   Method: Composition-based stats.
 Identities = 33/92 (35%), Positives = 50/92 (54%), Gaps = 1/92 (1%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            QR++L GC++VFS  FP  +   H+H LW+  EQ GA C+ ++D  VTHVV+N    +KV
Sbjct: 1101 QRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKV 1160

Query: 76   SLGSKGGQVFGGSTVDRGSQLFVARATRREVS 107
            +     G+          S L   RA  ++ +
Sbjct: 1161 NWALSTGRFVVYPGWVEASALLYRRANEQDFA 1192


>gi|357478637|ref|XP_003609604.1| RNA polymerase II C-terminal domain phosphatase-like protein
            [Medicago truncatula]
 gi|355510659|gb|AES91801.1| RNA polymerase II C-terminal domain phosphatase-like protein
            [Medicago truncatula]
          Length = 1064

 Score = 67.4 bits (163), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 30/68 (44%), Positives = 45/68 (66%), Gaps = 1/68 (1%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            QR++L GC++VFS  FP  +   H+H LW+  EQ GA+C+ ++DP VTHVV+     +KV
Sbjct: 961  QRKILGGCRIVFSGVFPVGETNPHLHPLWRTAEQFGASCTNKVDPQVTHVVAQSPGTDKV 1020

Query: 76   SLGSKGGQ 83
            + G   G+
Sbjct: 1021 NWGISNGK 1028


>gi|356523718|ref|XP_003530482.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Glycine max]
          Length = 1244

 Score = 67.0 bits (162), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 33/92 (35%), Positives = 50/92 (54%), Gaps = 1/92 (1%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            QR++L GC++VFS  FP  +   H+H LW+  EQ GA C+ ++D  VTHVV+N    +KV
Sbjct: 1150 QRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSPGTDKV 1209

Query: 76   SLGSKGGQVFGGSTVDRGSQLFVARATRREVS 107
            +     G+          S L   RA  ++ +
Sbjct: 1210 NWALNNGRFVVHPGWVEASALLYRRANEQDFA 1241


>gi|356567192|ref|XP_003551805.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Glycine max]
          Length = 1221

 Score = 66.6 bits (161), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 33/92 (35%), Positives = 50/92 (54%), Gaps = 1/92 (1%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            QR++L GC++VFS  FP  +   H+H LW+  EQ GA C+ ++D  VTHVV+N    +KV
Sbjct: 1127 QRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAFCTNQIDEQVTHVVANSPGTDKV 1186

Query: 76   SLGSKGGQVFGGSTVDRGSQLFVARATRREVS 107
            +     G+          S L   RA  ++ +
Sbjct: 1187 NWALNNGRFVVHPGWVEASALLYRRANEQDFA 1218


>gi|357156660|ref|XP_003577532.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Brachypodium distachyon]
          Length = 1259

 Score = 66.6 bits (161), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 33/92 (35%), Positives = 50/92 (54%), Gaps = 1/92 (1%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            QR +L GC++VFS  FP  +   H+H LW+  EQ GA C+ ++D  VTHVV+N    +KV
Sbjct: 1166 QRRILAGCRIVFSRIFPVGEANPHLHPLWQSAEQFGAVCTNQIDDRVTHVVANSLGTDKV 1225

Query: 76   SLGSKGGQVFGGSTVDRGSQLFVARATRREVS 107
            +   + G+          S L   RA+  + +
Sbjct: 1226 NWALQTGRYVVHPGWVEASALLYRRASEHDFA 1257


>gi|56547717|gb|AAV92930.1| putative transcription regulator CPL1 [Solanum lycopersicum]
          Length = 1227

 Score = 66.6 bits (161), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 32/95 (33%), Positives = 51/95 (53%), Gaps = 1/95 (1%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            Q+++L GC++VFS  FP  +   H+H LW+  EQ GA C+ ++D  VTHVV+N    +KV
Sbjct: 1133 QKKILAGCRIVFSRVFPVGEASPHLHPLWQTAEQFGAVCTSQIDDQVTHVVANSLGTDKV 1192

Query: 76   SLGSKGGQVFGGSTVDRGSQLFVARATRREVSCEA 110
            +     G+          S L   RA   + + ++
Sbjct: 1193 NWALSTGRSVVHPGWVEASALLYRRANEHDFAIKS 1227


>gi|242093742|ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor]
 gi|241915584|gb|EER88728.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor]
          Length = 558

 Score = 66.6 bits (161), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 29/62 (46%), Positives = 39/62 (62%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           ++E+L+GCK+VFS  FP+        LWK+ E LGA CS ++D SVTHVV+     EK  
Sbjct: 378 RKEILQGCKIVFSRVFPNNTRPQEQMLWKMAEHLGAVCSTDVDSSVTHVVTVDLGTEKAR 437

Query: 77  LG 78
            G
Sbjct: 438 WG 439


>gi|359473774|ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Vitis vinifera]
          Length = 1238

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 32/92 (34%), Positives = 49/92 (53%), Gaps = 1/92 (1%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            QR++L GC++VFS  FP  +   H+H LW+  E  GA C+ ++D  VTHVV+N    +KV
Sbjct: 1144 QRKILAGCRIVFSRVFPVGEANPHLHPLWQTAESFGAVCTNQIDEQVTHVVANSLGTDKV 1203

Query: 76   SLGSKGGQVFGGSTVDRGSQLFVARATRREVS 107
            +     G+          S L   RA  ++ +
Sbjct: 1204 NWALSTGRFVVHPGWVEASALLYRRANEQDFA 1235


>gi|296088169|emb|CBI35661.3| unnamed protein product [Vitis vinifera]
          Length = 1184

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 32/92 (34%), Positives = 49/92 (53%), Gaps = 1/92 (1%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            QR++L GC++VFS  FP  +   H+H LW+  E  GA C+ ++D  VTHVV+N    +KV
Sbjct: 1090 QRKILAGCRIVFSRVFPVGEANPHLHPLWQTAESFGAVCTNQIDEQVTHVVANSLGTDKV 1149

Query: 76   SLGSKGGQVFGGSTVDRGSQLFVARATRREVS 107
            +     G+          S L   RA  ++ +
Sbjct: 1150 NWALSTGRFVVHPGWVEASALLYRRANEQDFA 1181


>gi|224075473|ref|XP_002304648.1| predicted protein [Populus trichocarpa]
 gi|222842080|gb|EEE79627.1| predicted protein [Populus trichocarpa]
          Length = 238

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 51/94 (54%), Gaps = 1/94 (1%)

Query: 17  QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
           QR++L GC++VFS  FP  +   H+H LW+  EQ GA C+ ++D  VTHVV+N    +KV
Sbjct: 144 QRKILAGCRIVFSRVFPVGEVNPHLHPLWQSAEQFGAVCTNQIDEQVTHVVANSLGTDKV 203

Query: 76  SLGSKGGQVFGGSTVDRGSQLFVARATRREVSCE 109
           +     G+          S L   RA  ++ + +
Sbjct: 204 NWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 237


>gi|357502711|ref|XP_003621644.1| RNA polymerase II C-terminal domain phosphatase-like protein
            [Medicago truncatula]
 gi|355496659|gb|AES77862.1| RNA polymerase II C-terminal domain phosphatase-like protein
            [Medicago truncatula]
          Length = 1213

 Score = 65.5 bits (158), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 32/97 (32%), Positives = 53/97 (54%), Gaps = 1/97 (1%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            QR++L GC++VFS  FP      H+H LW+  EQ GA+C+ ++D  VTHVV++    +KV
Sbjct: 1117 QRKILDGCRIVFSRMFPVGDANPHLHPLWQTAEQFGASCTNQIDDQVTHVVAHSPGTDKV 1176

Query: 76   SLGSKGGQVFGGSTVDRGSQLFVARATRREVSCEANQ 112
            +     G+          S L   RA  ++ + + ++
Sbjct: 1177 NWAIANGKFVVHPGWVEASALLYRRANEQDFAIKLDK 1213


>gi|224091747|ref|XP_002309339.1| predicted protein [Populus trichocarpa]
 gi|222855315|gb|EEE92862.1| predicted protein [Populus trichocarpa]
          Length = 204

 Score = 65.5 bits (158), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 27/41 (65%), Positives = 35/41 (85%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIE 57
           +R+VLKGCK+VFS  FP++F A  H+LW++VEQLGATCS E
Sbjct: 119 RRDVLKGCKIVFSRVFPTQFQADNHHLWRMVEQLGATCSTE 159


>gi|30685744|ref|NP_180912.2| RNA polymerase II C-terminal domain phosphatase-like 3 [Arabidopsis
            thaliana]
 gi|238055326|sp|Q8LL04.2|CPL3_ARATH RecName: Full=RNA polymerase II C-terminal domain phosphatase-like 3;
            Short=FCP-like 3; AltName: Full=Carboxyl-terminal
            phosphatase-like 3; Short=AtCPL3; Short=CTD
            phosphatase-like 3
 gi|330253756|gb|AEC08850.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Arabidopsis
            thaliana]
          Length = 1241

 Score = 65.1 bits (157), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 27/61 (44%), Positives = 40/61 (65%), Gaps = 1/61 (1%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            QR++L GC++VFS   P  +   H+H LW+  EQ GA C+ ++D  VTHVV+N    +KV
Sbjct: 1147 QRKILAGCRIVFSRIIPVGEAKPHLHPLWQTAEQFGAVCTTQVDEHVTHVVTNSLGTDKV 1206

Query: 76   S 76
            +
Sbjct: 1207 N 1207


>gi|297826809|ref|XP_002881287.1| hypothetical protein ARALYDRAFT_482300 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297327126|gb|EFH57546.1| hypothetical protein ARALYDRAFT_482300 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1248

 Score = 65.1 bits (157), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 27/61 (44%), Positives = 40/61 (65%), Gaps = 1/61 (1%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            QR++L GC++VFS   P  +   H+H LW+  EQ GA C+ ++D  VTHVV+N    +KV
Sbjct: 1154 QRKILAGCRIVFSRIIPVGEAKPHLHPLWQTAEQFGAVCTTQVDEHVTHVVTNSLGTDKV 1213

Query: 76   S 76
            +
Sbjct: 1214 N 1214


>gi|22212705|gb|AAM94371.1|AF486633_1 CTD phosphatase-like 3 [Arabidopsis thaliana]
          Length = 1241

 Score = 65.1 bits (157), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 27/61 (44%), Positives = 40/61 (65%), Gaps = 1/61 (1%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            QR++L GC++VFS   P  +   H+H LW+  EQ GA C+ ++D  VTHVV+N    +KV
Sbjct: 1147 QRKILAGCRIVFSRIIPVGEAKPHLHPLWQTAEQFGAVCTTQVDEHVTHVVTNSLGTDKV 1206

Query: 76   S 76
            +
Sbjct: 1207 N 1207


>gi|413945235|gb|AFW77884.1| CPL3 [Zea mays]
          Length = 533

 Score = 64.7 bits (156), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 28/62 (45%), Positives = 39/62 (62%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           ++E+L+GCK+VFS  FP+        +WK+ E LGA C  ++DPSVTHVV+     EK  
Sbjct: 376 RKEILQGCKIVFSRVFPNNTRPQEQMVWKMAEYLGAVCVKDVDPSVTHVVTVDLGTEKAR 435

Query: 77  LG 78
            G
Sbjct: 436 WG 437


>gi|242087817|ref|XP_002439741.1| hypothetical protein SORBIDRAFT_09g019310 [Sorghum bicolor]
 gi|241945026|gb|EES18171.1| hypothetical protein SORBIDRAFT_09g019310 [Sorghum bicolor]
          Length = 547

 Score = 64.7 bits (156), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 28/62 (45%), Positives = 39/62 (62%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           ++E+L+GCK+VFS  FP+        +WK+ E LGA CS ++D SVTHVV+     EK  
Sbjct: 381 RKEILQGCKIVFSRVFPNNTRPQKQMVWKMAEYLGAVCSTDVDSSVTHVVTVDLGTEKAR 440

Query: 77  LG 78
            G
Sbjct: 441 WG 442


>gi|242068555|ref|XP_002449554.1| hypothetical protein SORBIDRAFT_05g019010 [Sorghum bicolor]
 gi|241935397|gb|EES08542.1| hypothetical protein SORBIDRAFT_05g019010 [Sorghum bicolor]
          Length = 1197

 Score = 64.3 bits (155), Expect = 8e-09,   Method: Composition-based stats.
 Identities = 37/93 (39%), Positives = 50/93 (53%), Gaps = 3/93 (3%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            QR +L GC++VFS  FP      H+H LW+  EQ GA C+  +D  VTHVV+N    +KV
Sbjct: 1104 QRRILAGCRIVFSRVFPVGDASPHLHPLWQTAEQFGAVCTNLVDDRVTHVVANSPGTDKV 1163

Query: 76   SLG-SKGGQVFGGSTVDRGSQLFVARATRREVS 107
            +   SKG  V     V+  S L   RA   + +
Sbjct: 1164 NWALSKGKFVVHPGWVE-ASALLYRRANEHDFA 1195


>gi|226497696|ref|NP_001152445.1| CPL3 [Zea mays]
 gi|195656359|gb|ACG47647.1| CPL3 [Zea mays]
          Length = 531

 Score = 64.3 bits (155), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 28/62 (45%), Positives = 39/62 (62%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           ++E+L+GCK+VFS  FP+        +WK+ E LGA C  ++DPSVTHVV+     EK  
Sbjct: 374 RKEILQGCKIVFSRVFPNNTRPQEQMVWKMAEYLGAVCVKDVDPSVTHVVTVDLGTEKSR 433

Query: 77  LG 78
            G
Sbjct: 434 WG 435


>gi|413920930|gb|AFW60862.1| hypothetical protein ZEAMMB73_799152, partial [Zea mays]
          Length = 1234

 Score = 63.9 bits (154), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 36/93 (38%), Positives = 50/93 (53%), Gaps = 3/93 (3%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            QR +L GC++VFS  FP      H+H LW+  EQ GA C+  +D  VTH+V+N    +KV
Sbjct: 1143 QRRILTGCRIVFSRVFPVGDASPHLHPLWQTAEQFGAVCTNLVDDRVTHIVANSPGTDKV 1202

Query: 76   SLG-SKGGQVFGGSTVDRGSQLFVARATRREVS 107
            +   SKG  V     V+  S L   RA   + +
Sbjct: 1203 NWALSKGKFVVHPGWVE-ASALLYRRANEHDFA 1234


>gi|326532556|dbj|BAK05207.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 891

 Score = 63.9 bits (154), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 32/92 (34%), Positives = 48/92 (52%), Gaps = 1/92 (1%)

Query: 17  QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
           QR +L GC++VFS  FP  +    +H LW+  EQ GA C+ ++D  VTHVV+N    +KV
Sbjct: 798 QRRILAGCRIVFSRIFPVGEANPQLHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKV 857

Query: 76  SLGSKGGQVFGGSTVDRGSQLFVARATRREVS 107
           +   + G+          S L   RA   + +
Sbjct: 858 NWALQTGRFVVHPGWVEASALLYRRANEHDFA 889


>gi|77551160|gb|ABA93957.1| NLI interacting factor-like phosphatase family protein, expressed
            [Oryza sativa Japonica Group]
          Length = 1272

 Score = 63.9 bits (154), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 32/87 (36%), Positives = 47/87 (54%), Gaps = 1/87 (1%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            Q+ +L GC++VFS  FP  +   H+H LW+  EQ GA C+ ++D  VTHVV+N    +KV
Sbjct: 1179 QQRILGGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKV 1238

Query: 76   SLGSKGGQVFGGSTVDRGSQLFVARAT 102
            +     G+          S L   RA+
Sbjct: 1239 NWALSTGRFVVHPGWVEASALLYRRAS 1265


>gi|222616055|gb|EEE52187.1| hypothetical protein OsJ_34058 [Oryza sativa Japonica Group]
          Length = 1267

 Score = 63.9 bits (154), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 32/87 (36%), Positives = 47/87 (54%), Gaps = 1/87 (1%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            Q+ +L GC++VFS  FP  +   H+H LW+  EQ GA C+ ++D  VTHVV+N    +KV
Sbjct: 1174 QQRILGGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKV 1233

Query: 76   SLGSKGGQVFGGSTVDRGSQLFVARAT 102
            +     G+          S L   RA+
Sbjct: 1234 NWALSTGRFVVHPGWVEASALLYRRAS 1260


>gi|218185830|gb|EEC68257.1| hypothetical protein OsI_36281 [Oryza sativa Indica Group]
          Length = 1255

 Score = 63.9 bits (154), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 32/87 (36%), Positives = 47/87 (54%), Gaps = 1/87 (1%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            Q+ +L GC++VFS  FP  +   H+H LW+  EQ GA C+ ++D  VTHVV+N    +KV
Sbjct: 1162 QQRILGGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKV 1221

Query: 76   SLGSKGGQVFGGSTVDRGSQLFVARAT 102
            +     G+          S L   RA+
Sbjct: 1222 NWALSTGRFVVHPGWVEASALLYRRAS 1248


>gi|115485681|ref|NP_001067984.1| Os11g0521900 [Oryza sativa Japonica Group]
 gi|113645206|dbj|BAF28347.1| Os11g0521900 [Oryza sativa Japonica Group]
          Length = 664

 Score = 63.9 bits (154), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 32/87 (36%), Positives = 47/87 (54%), Gaps = 1/87 (1%)

Query: 17  QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
           Q+ +L GC++VFS  FP  +   H+H LW+  EQ GA C+ ++D  VTHVV+N    +KV
Sbjct: 571 QQRILGGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKV 630

Query: 76  SLGSKGGQVFGGSTVDRGSQLFVARAT 102
           +     G+          S L   RA+
Sbjct: 631 NWALSTGRFVVHPGWVEASALLYRRAS 657


>gi|357129281|ref|XP_003566293.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
           4-like [Brachypodium distachyon]
          Length = 492

 Score = 63.2 bits (152), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 28/58 (48%), Positives = 38/58 (65%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           ++EVL+GCKLVFS  FPS        +WK+ E+LGA+C   +D +VTHVV+     EK
Sbjct: 377 RQEVLQGCKLVFSRVFPSNSCPQDQIIWKMAEKLGASCCAHVDSTVTHVVAVDVGTEK 434


>gi|168018017|ref|XP_001761543.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687227|gb|EDQ73611.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1984

 Score = 62.4 bits (150), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 27/68 (39%), Positives = 43/68 (63%), Gaps = 1/68 (1%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            QR VL GC+++FS  FP  +   H+H LW++ EQ GA+C + ++  VTHVV+     +KV
Sbjct: 1723 QRRVLDGCRVLFSRIFPVGEANPHLHPLWRLAEQFGASCCLHINDKVTHVVAISLGTDKV 1782

Query: 76   SLGSKGGQ 83
            +  +  G+
Sbjct: 1783 NWAAATGR 1790


>gi|357163276|ref|XP_003579679.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
           4-like [Brachypodium distachyon]
          Length = 493

 Score = 62.4 bits (150), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 28/58 (48%), Positives = 38/58 (65%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           ++EVL+GCK+VFS  FPS        +WK+ EQLGA C  ++D +VTHVV+     EK
Sbjct: 377 RQEVLQGCKVVFSRVFPSSSRPQDQIIWKMAEQLGAICCADMDSTVTHVVAVDSGTEK 434


>gi|218196729|gb|EEC79156.1| hypothetical protein OsI_19829 [Oryza sativa Indica Group]
          Length = 574

 Score = 62.0 bits (149), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 28/58 (48%), Positives = 37/58 (63%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           ++EVL+GCKLVF+  FP         LWK+ EQLGA C  ++D +VTHVV+     EK
Sbjct: 410 RQEVLQGCKLVFTRVFPLHQRQQDQMLWKMAEQLGAVCCTDVDSTVTHVVALDLGTEK 467


>gi|218196728|gb|EEC79155.1| hypothetical protein OsI_19828 [Oryza sativa Indica Group]
          Length = 430

 Score = 61.6 bits (148), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 28/58 (48%), Positives = 37/58 (63%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           ++EVL+GCKLVF+  FP         LWK+ EQLGA C  ++D +VTHVV+     EK
Sbjct: 167 RQEVLQGCKLVFTRVFPLHQRPQDQMLWKMAEQLGAVCCTDVDSTVTHVVALDLGTEK 224


>gi|168040198|ref|XP_001772582.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162676137|gb|EDQ62624.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1881

 Score = 61.2 bits (147), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 27/68 (39%), Positives = 43/68 (63%), Gaps = 1/68 (1%)

Query: 17   QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
            QR VL GC+++FS  FP  +   H+H LW++ EQ GA+C + ++  VTHVV+     +KV
Sbjct: 1769 QRRVLDGCRVLFSRIFPVGEANPHLHPLWRLAEQFGASCCLYINDKVTHVVAISLGTDKV 1828

Query: 76   SLGSKGGQ 83
            +  +  G+
Sbjct: 1829 NWATATGR 1836


>gi|115463681|ref|NP_001055440.1| Os05g0390500 [Oryza sativa Japonica Group]
 gi|57863785|gb|AAS86390.2| unknown protein [Oryza sativa Japonica Group]
 gi|113578991|dbj|BAF17354.1| Os05g0390500 [Oryza sativa Japonica Group]
 gi|215695102|dbj|BAG90293.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222631469|gb|EEE63601.1| hypothetical protein OsJ_18418 [Oryza sativa Japonica Group]
          Length = 536

 Score = 60.8 bits (146), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 27/58 (46%), Positives = 37/58 (63%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           ++EVL+GCKLVF+  FP         +WK+ EQLGA C  ++D +VTHVV+     EK
Sbjct: 384 RQEVLQGCKLVFTRVFPLHQRQQDQMIWKMAEQLGAVCCTDVDSTVTHVVALDLGTEK 441


>gi|302761896|ref|XP_002964370.1| hypothetical protein SELMODRAFT_405568 [Selaginella moellendorffii]
 gi|300168099|gb|EFJ34703.1| hypothetical protein SELMODRAFT_405568 [Selaginella moellendorffii]
          Length = 766

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 29/90 (32%), Positives = 45/90 (50%), Gaps = 1/90 (1%)

Query: 17  QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
           QR +L GCK++FS  FP  +    +H LW++ EQ GA C+  ++  VTHVV+     +K 
Sbjct: 672 QRRILGGCKIIFSRVFPVEETQPQLHPLWRMAEQFGAVCTTRMEEDVTHVVAISMGTDKS 731

Query: 76  SLGSKGGQVFGGSTVDRGSQLFVARATRRE 105
           +     G+          S +   RA  R+
Sbjct: 732 NWALATGRFLVRPAWVEASTVLYRRANERD 761


>gi|302768485|ref|XP_002967662.1| hypothetical protein SELMODRAFT_440109 [Selaginella moellendorffii]
 gi|300164400|gb|EFJ31009.1| hypothetical protein SELMODRAFT_440109 [Selaginella moellendorffii]
          Length = 762

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 29/90 (32%), Positives = 45/90 (50%), Gaps = 1/90 (1%)

Query: 17  QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
           QR +L GCK++FS  FP  +    +H LW++ EQ GA C+  ++  VTHVV+     +K 
Sbjct: 668 QRRILGGCKIIFSRVFPVEETQPQLHPLWRMAEQFGAVCTTRMEEDVTHVVAISMGTDKS 727

Query: 76  SLGSKGGQVFGGSTVDRGSQLFVARATRRE 105
           +     G+          S +   RA  R+
Sbjct: 728 NWALATGRFLVRPAWVEASTVLYRRANERD 757


>gi|356498756|ref|XP_003518215.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
           4-like [Glycine max]
          Length = 428

 Score = 57.8 bits (138), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 28/58 (48%), Positives = 36/58 (62%), Gaps = 4/58 (6%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           +REVL GC ++FS       P+    L K+ EQ+GATC  E+DPSVTHVV+     EK
Sbjct: 337 RREVLSGCVIIFSRIVHGAIPS----LRKMAEQMGATCLTEIDPSVTHVVATDAGTEK 390


>gi|356564913|ref|XP_003550691.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
           4-like [Glycine max]
          Length = 442

 Score = 57.4 bits (137), Expect = 9e-07,   Method: Composition-based stats.
 Identities = 28/58 (48%), Positives = 36/58 (62%), Gaps = 4/58 (6%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           +REVL GC ++FS       P+    L K+ EQ+GATC  E+DPSVTHVV+     EK
Sbjct: 351 RREVLSGCVIIFSRIVHGAIPS----LRKMAEQMGATCLTEIDPSVTHVVATDAGTEK 404


>gi|242063380|ref|XP_002452979.1| hypothetical protein SORBIDRAFT_04g035920 [Sorghum bicolor]
 gi|241932810|gb|EES05955.1| hypothetical protein SORBIDRAFT_04g035920 [Sorghum bicolor]
          Length = 518

 Score = 55.5 bits (132), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 24/51 (47%), Positives = 34/51 (66%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVS 67
           + EVL+GC + FS   P +  A  H +WK+ EQLGA C+ + D +VTHVV+
Sbjct: 421 RSEVLRGCTVAFSRVIPLEGVAGDHPMWKLAEQLGAVCTADADATVTHVVA 471


>gi|125541462|gb|EAY87857.1| hypothetical protein OsI_09279 [Oryza sativa Indica Group]
          Length = 390

 Score = 54.7 bits (130), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 29/70 (41%), Positives = 41/70 (58%), Gaps = 7/70 (10%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           +REVL+GC + F+ A  S      H +W+  EQLGATC+ ++ P+VTHVV+   +  K  
Sbjct: 300 RREVLRGCTVAFTRAIASD---DHHSVWRRTEQLGATCADDVGPAVTHVVATNPTTFKAV 356

Query: 77  LGSKGGQVFG 86
                 QVFG
Sbjct: 357 W----AQVFG 362


>gi|403161615|ref|XP_003321927.2| hypothetical protein PGTG_03464 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375171855|gb|EFP77508.2| hypothetical protein PGTG_03464 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 423

 Score = 53.5 bits (127), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 34/62 (54%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           + +VL G  L FS  +P +  +   Y WK+ EQ GA C   L P VTH+++ K    KV+
Sbjct: 101 KHDVLHGLHLAFSSLWPMEAVSEQQYAWKLAEQFGARCYTHLSPKVTHLIAAKLGTSKVN 160

Query: 77  LG 78
           L 
Sbjct: 161 LA 162


>gi|291001899|ref|XP_002683516.1| TFIIF CTD phosphatase Fcp1 [Naegleria gruberi]
 gi|284097145|gb|EFC50772.1| TFIIF CTD phosphatase Fcp1 [Naegleria gruberi]
          Length = 592

 Score = 52.0 bits (123), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 22/59 (37%), Positives = 35/59 (59%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
           ++++LKG  +VFS   P K     H  WK+   LGA C  ++ P++TH+V+ +   EKV
Sbjct: 427 KKDILKGAHIVFSGVIPLKQQPETHIDWKIATDLGAKCYTDITPNMTHLVARQKGTEKV 485


>gi|307111295|gb|EFN59530.1| hypothetical protein CHLNCDRAFT_138191 [Chlorella variabilis]
          Length = 1156

 Score = 52.0 bits (123), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 23/59 (38%), Positives = 33/59 (55%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
           +++VL G  LVF+   P +     H LW++ +  GA CS  LD S THV++     EKV
Sbjct: 601 RQKVLAGVHLVFTRVIPLEMEPESHPLWRLAQSFGARCSGSLDASTTHVIAGASGTEKV 659


>gi|440804367|gb|ELR25244.1| FCP1like phosphatase, phosphatase subfamily protein [Acanthamoeba
           castellanii str. Neff]
          Length = 930

 Score = 52.0 bits (123), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 27/76 (35%), Positives = 38/76 (50%)

Query: 9   IFFCTENGQREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSN 68
           I +C    +R VL+G  + FS  FP+        LW++ E+ GA CS    P  TH+V+ 
Sbjct: 601 IKYCLHVQRRRVLEGVHICFSSIFPTGSKPESTPLWRLSEEFGACCSNVFTPETTHLVAL 660

Query: 69  KCSNEKVSLGSKGGQV 84
               EKV L  + G V
Sbjct: 661 NERTEKVKLAHERGGV 676


>gi|125541461|gb|EAY87856.1| hypothetical protein OsI_09278 [Oryza sativa Indica Group]
          Length = 420

 Score = 51.2 bits (121), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 25/56 (44%), Positives = 35/56 (62%), Gaps = 2/56 (3%)

Query: 16  GQREVLKGCKLVFSHAFPSK--FPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNK 69
            +REVL+GC + F+   PS     A  H +W+  EQLGATC+ ++   VTHVV+ K
Sbjct: 322 ARREVLRGCTVAFTGVIPSGDGGRASDHPVWRKAEQLGATCADDVGEGVTHVVAGK 377


>gi|255081919|ref|XP_002508178.1| predicted protein [Micromonas sp. RCC299]
 gi|226523454|gb|ACO69436.1| predicted protein [Micromonas sp. RCC299]
          Length = 318

 Score = 51.2 bits (121), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 35/67 (52%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           +++VL G  LVFS  FP   P H   +W++ EQ GA C  +  P+ +HVV+      K  
Sbjct: 222 KKKVLAGTGLVFSGVFPLDAPPHEQKMWRLAEQFGARCETQPGPNTSHVVAKTWGTGKCQ 281

Query: 77  LGSKGGQ 83
              + G+
Sbjct: 282 WAKENGR 288


>gi|426201370|gb|EKV51293.1| hypothetical protein AGABI2DRAFT_114027 [Agaricus bisporus var.
           bisporus H97]
          Length = 814

 Score = 50.8 bits (120), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 33/68 (48%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           + EVL G  LVFS   P   P      W++    GA C  +L P VTHV++ K   +KV 
Sbjct: 510 RSEVLGGLHLVFSGVIPLDTPPETTEFWRLARMFGAKCHTDLTPDVTHVITAKRGTKKVE 569

Query: 77  LGSKGGQV 84
              + G +
Sbjct: 570 TARQRGGI 577


>gi|409083591|gb|EKM83948.1| hypothetical protein AGABI1DRAFT_124274 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 853

 Score = 50.8 bits (120), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 33/68 (48%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           + EVL G  LVFS   P   P      W++    GA C  +L P VTHV++ K   +KV 
Sbjct: 549 RSEVLGGLHLVFSGVIPLDTPPETTEFWRLARMFGAKCHTDLTPDVTHVITAKRGTKKVE 608

Query: 77  LGSKGGQV 84
              + G +
Sbjct: 609 TARQRGGI 616


>gi|357450477|ref|XP_003595515.1| RNA polymerase II C-terminal domain phosphatase-like protein
           [Medicago truncatula]
 gi|355484563|gb|AES65766.1| RNA polymerase II C-terminal domain phosphatase-like protein
           [Medicago truncatula]
          Length = 382

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 26/56 (46%), Positives = 35/56 (62%), Gaps = 3/56 (5%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           EVL GC +VFS AF       +  L ++ E+LGATC  EL P+VTH V+N+   E+
Sbjct: 292 EVLSGCIIVFSCAFNGH---DLRKLRRIAERLGATCLTELGPTVTHAVANELVTEE 344


>gi|392570766|gb|EIW63938.1| hypothetical protein TRAVEDRAFT_111329 [Trametes versicolor
           FP-101664 SS1]
          Length = 900

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 23/66 (34%), Positives = 31/66 (46%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
           E L GC ++FS   P         +WK     GA C  EL P +THVV+ K   +KV   
Sbjct: 581 ETLDGCHILFSSVIPLDTRPEATEIWKTAHAFGAKCYTELSPRITHVVAAKRGTQKVDAA 640

Query: 79  SKGGQV 84
            + G +
Sbjct: 641 RRRGGI 646


>gi|427782099|gb|JAA56501.1| Putative rna polymerase ii ctd phosphatase [Rhipicephalus
          pulchellus]
          Length = 360

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/60 (43%), Positives = 36/60 (60%)

Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
          +R+VLKG  +VFS   P   PA     W+V + LGAT S +L P VTH+V+ +    KV+
Sbjct: 20 RRKVLKGVHIVFSGVVPMNQPAEKSQAWQVAKSLGATVSRDLCPGVTHLVAARLGTAKVN 79


>gi|302764346|ref|XP_002965594.1| hypothetical protein SELMODRAFT_167775 [Selaginella moellendorffii]
 gi|300166408|gb|EFJ33014.1| hypothetical protein SELMODRAFT_167775 [Selaginella moellendorffii]
          Length = 411

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 24/52 (46%), Positives = 32/52 (61%), Gaps = 1/52 (1%)

Query: 17  QREVLKGCKLVFSHAFPSK-FPAHIHYLWKVVEQLGATCSIELDPSVTHVVS 67
           + E+L GCKLVFS  FP+      +  LW++   LGA C +  D SVTHVV+
Sbjct: 288 RSEILSGCKLVFSRIFPTDCLEPELTPLWRLCVDLGAECVLAHDDSVTHVVA 339


>gi|328772741|gb|EGF82779.1| hypothetical protein BATDEDRAFT_22917 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 868

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 21/64 (32%), Positives = 32/64 (50%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           +R +L+G  ++F+   P       H  W      GA C ++LDP VTHV++ K    KV+
Sbjct: 530 KRSILEGVHILFTSIIPLGLEPQKHEHWIAATSYGAVCHVDLDPEVTHVIAGKTGTAKVN 589

Query: 77  LGSK 80
              K
Sbjct: 590 AARK 593


>gi|302698337|ref|XP_003038847.1| hypothetical protein SCHCODRAFT_255670 [Schizophyllum commune H4-8]
 gi|300112544|gb|EFJ03945.1| hypothetical protein SCHCODRAFT_255670 [Schizophyllum commune H4-8]
          Length = 1207

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 22/65 (33%), Positives = 29/65 (44%)

Query: 20  VLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLGS 79
           V +GC + FS   P       H  W++    GA C   L P VTHVV+ K    KV    
Sbjct: 892 VFQGCHICFSSVIPLDIQPESHECWRIANMFGARCHATLAPEVTHVVAGKQGTAKVDEAR 951

Query: 80  KGGQV 84
           + G +
Sbjct: 952 RRGNI 956


>gi|47497024|dbj|BAD19077.1| phosphatase-like [Oryza sativa Japonica Group]
 gi|47497233|dbj|BAD19278.1| phosphatase-like [Oryza sativa Japonica Group]
 gi|125584004|gb|EAZ24935.1| hypothetical protein OsJ_08715 [Oryza sativa Japonica Group]
          Length = 420

 Score = 49.3 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 2/56 (3%)

Query: 16  GQREVLKGCKLVFSHAFPSK--FPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNK 69
            +REVL+GC + F+   PS     A  H +W+  EQLGATC+ ++   VTH V+ K
Sbjct: 322 ARREVLRGCTVAFTGVIPSGDGGRASDHPVWRRAEQLGATCADDVGEGVTHFVAGK 377


>gi|409051930|gb|EKM61406.1| hypothetical protein PHACADRAFT_204575 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 863

 Score = 48.9 bits (115), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 22/68 (32%), Positives = 33/68 (48%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           ++  L GC +VFS   P    A     W++    GA C  EL+P +TH+++ K    KV 
Sbjct: 560 RQNALAGCHVVFSSVIPLDTRAETSETWRIAVMFGAKCYTELNPRITHLIAAKRGTAKVD 619

Query: 77  LGSKGGQV 84
              + G V
Sbjct: 620 AARRQGGV 627


>gi|353236741|emb|CCA68729.1| related to FCP1-TFIIF interacting component of CTD phosphatase
           [Piriformospora indica DSM 11827]
          Length = 782

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 23/67 (34%), Positives = 33/67 (49%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
           + L G  LVFS   P         +WK   + GATC ++++P VTH+V+NK    K    
Sbjct: 473 KTLAGVHLVFSGILPLDGRPERQPIWKAALEFGATCHVDINPQVTHLVTNKLGTVKADKA 532

Query: 79  SKGGQVF 85
              G +F
Sbjct: 533 FAQGNIF 539


>gi|336387157|gb|EGO28302.1| hypothetical protein SERLADRAFT_354339 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 874

 Score = 48.5 bits (114), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 32/68 (47%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           ++E L G  ++FS   P         +WKV E  GA C  EL   +THVV+ K    KV 
Sbjct: 566 RKETLDGIHILFSSVIPLDTKPETTEIWKVAEMFGAQCCTELSSRITHVVAAKHGTVKVD 625

Query: 77  LGSKGGQV 84
              K G +
Sbjct: 626 AARKRGGI 633


>gi|336374248|gb|EGO02585.1| hypothetical protein SERLA73DRAFT_102556 [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 811

 Score = 48.5 bits (114), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 32/68 (47%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           ++E L G  ++FS   P         +WKV E  GA C  EL   +THVV+ K    KV 
Sbjct: 503 RKETLDGIHILFSSVIPLDTKPETTEIWKVAEMFGAQCCTELSSRITHVVAAKHGTVKVD 562

Query: 77  LGSKGGQV 84
              K G +
Sbjct: 563 AARKRGGI 570


>gi|145346053|ref|XP_001417510.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577737|gb|ABO95803.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 643

 Score = 48.1 bits (113), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 25/68 (36%), Positives = 32/68 (47%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           +++VL   ++VFS  FP       H LW + E  GATC   L    THVV    S +KV 
Sbjct: 551 RKKVLADVRIVFSRVFPIDADPTTHPLWILAEDFGATCGRTLCDDTTHVVGTASSTDKVK 610

Query: 77  LGSKGGQV 84
                G V
Sbjct: 611 AAKARGNV 618


>gi|402220046|gb|EJU00119.1| hypothetical protein DACRYDRAFT_81791 [Dacryopinax sp. DJM-731 SS1]
          Length = 855

 Score = 48.1 bits (113), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 27/64 (42%), Positives = 30/64 (46%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
           EVL G  LVFS   P   P     LW+   Q GA C   +   VTHVV+ K   EKV  G
Sbjct: 558 EVLSGVHLVFSSLIPIDMPHQNTDLWRQALQFGAACYTRVAREVTHVVAAKRGTEKVRQG 617

Query: 79  SKGG 82
              G
Sbjct: 618 VARG 621


>gi|170084539|ref|XP_001873493.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164651045|gb|EDR15285.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 845

 Score = 48.1 bits (113), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 34/68 (50%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           + EVL+G  ++FS   P         +W++    GA CS EL   +THVV+ K    KV 
Sbjct: 541 RSEVLEGVHILFSSVIPLDTKPETTEIWRMAHMFGARCSTELTSDITHVVAAKRGTVKVD 600

Query: 77  LGSKGGQV 84
           +  K G +
Sbjct: 601 MARKRGGI 608


>gi|255540901|ref|XP_002511515.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
 gi|223550630|gb|EEF52117.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
          Length = 405

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 2/51 (3%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVS 67
           Q  +L+GCKL+      +K+   +  L K+ E+LGA C  ELDP+VTHVV+
Sbjct: 306 QAGILQGCKLILRKNLTAKY--KLDNLSKMAEKLGAICVSELDPTVTHVVT 354


>gi|357501219|ref|XP_003620898.1| RNA polymerase II C-terminal domain phosphatase-like protein
           [Medicago truncatula]
 gi|355495913|gb|AES77116.1| RNA polymerase II C-terminal domain phosphatase-like protein
           [Medicago truncatula]
          Length = 720

 Score = 47.8 bits (112), Expect = 9e-04,   Method: Composition-based stats.
 Identities = 23/48 (47%), Positives = 32/48 (66%), Gaps = 4/48 (8%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVV 66
           EVL+GC +VFS      F   +  L ++ E+LGATC  +LDP+VTHV+
Sbjct: 431 EVLRGCVIVFS----LNFHGDLRILRRIAERLGATCLKKLDPTVTHVI 474


>gi|392597598|gb|EIW86920.1| hypothetical protein CONPUDRAFT_95946 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 830

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 22/66 (33%), Positives = 31/66 (46%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
           +V  G  ++FS   P   P     +WKV    GA C  EL  S+THVV+ +    KV   
Sbjct: 546 KVFDGVHILFSSVIPLDTPPETTEIWKVAHMFGAKCYTELSSSITHVVAARLGTVKVDAA 605

Query: 79  SKGGQV 84
            + G +
Sbjct: 606 RRRGGI 611


>gi|281206665|gb|EFA80851.1| putative tfiif-interacting component of the c-terminal domain
           phosphatase [Polysphondylium pallidum PN500]
          Length = 881

 Score = 47.4 bits (111), Expect = 0.001,   Method: Composition-based stats.
 Identities = 25/79 (31%), Positives = 45/79 (56%), Gaps = 1/79 (1%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           ++++L G  LVFS  +P + PAH   L  + E+LGAT   ++  + THVV+ +    KV 
Sbjct: 585 KKKILNGVNLVFSGVYPLQLPAHRQPLRLLAEELGATVQNDITNTTTHVVAARKGTSKVH 644

Query: 77  LG-SKGGQVFGGSTVDRGS 94
              SKG ++   + +++ +
Sbjct: 645 KAISKGLKIVNQNWIEQSA 663


>gi|388580688|gb|EIM21001.1| FCP1-like phosphatase [Wallemia sebi CBS 633.66]
          Length = 510

 Score = 46.6 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 25/65 (38%), Positives = 33/65 (50%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           +R+VL G KLVFS   P   P  I  ++ +  + GAT     +  VTHVV+ K    KV 
Sbjct: 323 KRKVLHGLKLVFSSVIPLGMPLEISGIYNLASKFGATIDHNYNEKVTHVVAAKKGTAKVE 382

Query: 77  LGSKG 81
              KG
Sbjct: 383 DAKKG 387


>gi|443896478|dbj|GAC73822.1| TFIIF-interacting CTD phosphatases [Pseudozyma antarctica T-34]
          Length = 751

 Score = 46.2 bits (108), Expect = 0.003,   Method: Composition-based stats.
 Identities = 26/63 (41%), Positives = 32/63 (50%), Gaps = 1/63 (1%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSV-THVVSNKCSNEKVSL 77
           +VLKGC +VFS   P    A    LW      GAT + E++P V THVVS +    KV  
Sbjct: 513 QVLKGCVIVFSSMIPVGHDAAKSELWATARAFGATPAAEIEPGVTTHVVSARMGTAKVHQ 572

Query: 78  GSK 80
             K
Sbjct: 573 AMK 575


>gi|390333352|ref|XP_791406.3| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like [Strongylocentrotus purpuratus]
          Length = 673

 Score = 45.8 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 25/76 (32%), Positives = 35/76 (46%), Gaps = 10/76 (13%)

Query: 20  VLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIEL----------DPSVTHVVSNK 69
           VLKGC +VFS  FP+  P     +WKV   LGA  S ++            + TH+V+ K
Sbjct: 371 VLKGCNIVFSSVFPTNMPPEQSRVWKVALALGAKVSPQIVTKSKEEQAKGRASTHLVAAK 430

Query: 70  CSNEKVSLGSKGGQVF 85
               KV    +   +F
Sbjct: 431 VGTSKVHAARRSKSIF 446


>gi|308802952|ref|XP_003078789.1| putative transcription regulator CPL1 (ISS) [Ostreococcus tauri]
 gi|116057242|emb|CAL51669.1| putative transcription regulator CPL1 (ISS) [Ostreococcus tauri]
          Length = 457

 Score = 45.8 bits (107), Expect = 0.004,   Method: Composition-based stats.
 Identities = 25/71 (35%), Positives = 32/71 (45%)

Query: 14  ENGQREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNE 73
           E  ++ VL G  +VFS  FP         LW + E  GA CS E+    THVV    +  
Sbjct: 359 EERRKVVLSGVHVVFSRVFPLHVKPEEQPLWILAENFGANCSSEITSHTTHVVGTSKATA 418

Query: 74  KVSLGSKGGQV 84
           KV    K G +
Sbjct: 419 KVREALKRGGI 429


>gi|242093894|ref|XP_002437437.1| hypothetical protein SORBIDRAFT_10g027050 [Sorghum bicolor]
 gi|241915660|gb|EER88804.1| hypothetical protein SORBIDRAFT_10g027050 [Sorghum bicolor]
          Length = 271

 Score = 45.4 bits (106), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 37/67 (55%), Gaps = 3/67 (4%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           +R+VL  C +VFS+     FP     +W + E+LGA C  ++D +VTHVV+     +K  
Sbjct: 178 RRQVLPVCTVVFSYL--EDFPEDT-LMWTLAERLGAACQKDVDETVTHVVAEDPGTQKAQ 234

Query: 77  LGSKGGQ 83
              + G+
Sbjct: 235 WAREHGK 241


>gi|242063378|ref|XP_002452978.1| hypothetical protein SORBIDRAFT_04g035900 [Sorghum bicolor]
 gi|241932809|gb|EES05954.1| hypothetical protein SORBIDRAFT_04g035900 [Sorghum bicolor]
          Length = 464

 Score = 45.4 bits (106), Expect = 0.004,   Method: Composition-based stats.
 Identities = 23/69 (33%), Positives = 38/69 (55%), Gaps = 3/69 (4%)

Query: 17  QREVLKGCKLVFSH--AFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           +R+VL  C +VFS+   +   FP     +W + E+LGA C  ++D +VTHVV+     +K
Sbjct: 367 RRQVLPVCTVVFSYLEEYMEDFPEDT-LMWTLAERLGAACQKDVDETVTHVVAEDPGTQK 425

Query: 75  VSLGSKGGQ 83
                + G+
Sbjct: 426 AQWAREHGK 434


>gi|357451355|ref|XP_003595954.1| RNA polymerase II subunit A C-terminal domain phosphatase [Medicago
           truncatula]
 gi|355485002|gb|AES66205.1| RNA polymerase II subunit A C-terminal domain phosphatase [Medicago
           truncatula]
          Length = 239

 Score = 45.1 bits (105), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 27/60 (45%), Positives = 36/60 (60%), Gaps = 3/60 (5%)

Query: 9   IFFCTENGQREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSN 68
           +F    + + EVL GC +VFS AF       +  L K+ E+LGAT   EL P+VTHVV+N
Sbjct: 175 VFHVLSSLRGEVLSGCVIVFSCAFHGH---DLRKLRKIAERLGATHLTELRPTVTHVVAN 231


>gi|168059994|ref|XP_001781984.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666557|gb|EDQ53208.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 563

 Score = 44.7 bits (104), Expect = 0.006,   Method: Composition-based stats.
 Identities = 22/65 (33%), Positives = 33/65 (50%), Gaps = 1/65 (1%)

Query: 19  EVLKGCKLVFSHAFPSKFP-AHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSL 77
           ++L GC +VFS  FP+       H  W++  +LGA CS   D + THVV+     +K   
Sbjct: 407 KLLSGCHIVFSRIFPTGLQNPEFHPFWQLAVELGARCSTVCDHTTTHVVALDRGTDKARW 466

Query: 78  GSKGG 82
             + G
Sbjct: 467 AKQHG 471


>gi|393218252|gb|EJD03740.1| hypothetical protein FOMMEDRAFT_105888 [Fomitiporia mediterranea
           MF3/22]
          Length = 921

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 23/77 (29%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 11  FCTENGQREVLKGCKLVFSHAFPS--KFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSN 68
           F   N ++E  K   ++FS   P+  +       +W++    GATC  +LD  VTHVV++
Sbjct: 570 FIIPNIRKETFKDVHILFSGVIPTNIRMDHEATEIWRMARAFGATCHRDLDKEVTHVVTS 629

Query: 69  KCSNEKVSLGSKGGQVF 85
           K   +KV        +F
Sbjct: 630 KRGTQKVEKARSQPNIF 646


>gi|384247094|gb|EIE20582.1| hypothetical protein COCSUDRAFT_57726 [Coccomyxa subellipsoidea
           C-169]
          Length = 1018

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 2/68 (2%)

Query: 17  QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
           +++VL G +++FS  FP  + P+   Y WK  E  GA+C+ +LD  VTHVV+      K 
Sbjct: 925 RKQVLLGVRVLFSKVFPLGQAPSEQLY-WKQAEAYGASCTSQLDEHVTHVVALSRGTHKA 983

Query: 76  SLGSKGGQ 83
               + G+
Sbjct: 984 QWALQAGK 991


>gi|241249809|ref|XP_002403164.1| RNA polymerase II ctd phosphatase, putative [Ixodes scapularis]
 gi|215496447|gb|EEC06087.1| RNA polymerase II ctd phosphatase, putative [Ixodes scapularis]
          Length = 185

 Score = 44.7 bits (104), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 35/68 (51%)

Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
          +R+VLKG  LVFS   P+         W+    LGA  S +L P VTH+V+ +    KV+
Sbjct: 20 RRKVLKGSHLVFSGVVPTNQEPEKSRAWQTARALGARVSSDLCPGVTHLVAARPGTAKVN 79

Query: 77 LGSKGGQV 84
             +  Q+
Sbjct: 80 RARRTRQL 87


>gi|302816075|ref|XP_002989717.1| hypothetical protein SELMODRAFT_23521 [Selaginella moellendorffii]
 gi|302824047|ref|XP_002993670.1| hypothetical protein SELMODRAFT_23523 [Selaginella moellendorffii]
 gi|300138493|gb|EFJ05259.1| hypothetical protein SELMODRAFT_23523 [Selaginella moellendorffii]
 gi|300142494|gb|EFJ09194.1| hypothetical protein SELMODRAFT_23521 [Selaginella moellendorffii]
          Length = 312

 Score = 44.3 bits (103), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 40/86 (46%), Gaps = 3/86 (3%)

Query: 20  VLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLGS 79
           +L+GCKL FS   P         LW + E LGA C +E+D SVTHVV+    + +     
Sbjct: 226 ILEGCKLAFSSVVPIDCEDS---LWILCEGLGAECVLEIDDSVTHVVAMDPESARARWAV 282

Query: 80  KGGQVFGGSTVDRGSQLFVARATRRE 105
           + G+     +  R +   + R    E
Sbjct: 283 ENGKHLVNPSWMRAAAFRLGRPRESE 308


>gi|358057984|dbj|GAA96229.1| hypothetical protein E5Q_02893 [Mixia osmundae IAM 14324]
          Length = 760

 Score = 43.9 bits (102), Expect = 0.012,   Method: Composition-based stats.
 Identities = 21/66 (31%), Positives = 32/66 (48%)

Query: 15  NGQREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           N +  VL+GCK+ FS   P         +WK+ +  GA CS +++   TH+V+      K
Sbjct: 511 NIKESVLRGCKIAFSSMIPLGTNPEAADIWKLAKMFGAYCSSDVNSKTTHLVARNPGTVK 570

Query: 75  VSLGSK 80
           V    K
Sbjct: 571 VQQAQK 576


>gi|58271496|ref|XP_572904.1| protein phosphatase [Cryptococcus neoformans var. neoformans JEC21]
 gi|134115316|ref|XP_773956.1| hypothetical protein CNBH4080 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50256584|gb|EAL19309.1| hypothetical protein CNBH4080 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57229163|gb|AAW45597.1| protein phosphatase, putative [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 955

 Score = 43.9 bits (102), Expect = 0.013,   Method: Composition-based stats.
 Identities = 21/56 (37%), Positives = 28/56 (50%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           EVL GC LVFS   P +       +W+  E  GA  +  L P  TH+V+   + EK
Sbjct: 649 EVLDGCNLVFSGMIPREANPSTTAIWQTAESFGALITPSLTPRTTHLVTALLNTEK 704


>gi|405122085|gb|AFR96852.1| hypothetical protein CNAG_04120 [Cryptococcus neoformans var.
           grubii H99]
          Length = 921

 Score = 43.9 bits (102), Expect = 0.013,   Method: Composition-based stats.
 Identities = 21/56 (37%), Positives = 28/56 (50%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           EVL GC LVFS   P +       +W+  E  GA  +  L P  TH+V+   + EK
Sbjct: 613 EVLDGCSLVFSGMIPRESNPSTTTIWQTAESFGALITPSLTPRTTHLVTALLNTEK 668


>gi|406695220|gb|EKC98531.1| protein phosphatase [Trichosporon asahii var. asahii CBS 8904]
          Length = 917

 Score = 43.9 bits (102), Expect = 0.014,   Method: Composition-based stats.
 Identities = 21/62 (33%), Positives = 29/62 (46%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
           +VL GC +VF+             +W+  E  GA C +ELD  VTH V      EK+   
Sbjct: 603 QVLSGCVIVFTGVIAINQKPQDSEIWQQAEAFGAQCQVELDERVTHCVIGSIGTEKMRRA 662

Query: 79  SK 80
           S+
Sbjct: 663 SR 664


>gi|242015474|ref|XP_002428378.1| RNA polymerase II ctd phosphatase, putative [Pediculus humanus
           corporis]
 gi|212512990|gb|EEB15640.1| RNA polymerase II ctd phosphatase, putative [Pediculus humanus
           corporis]
          Length = 781

 Score = 43.1 bits (100), Expect = 0.023,   Method: Composition-based stats.
 Identities = 23/71 (32%), Positives = 35/71 (49%)

Query: 15  NGQREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           N ++  LKGC LVFS   PS  P      + V   LGA  S ++  + TH+V+ +    K
Sbjct: 515 NFKKNTLKGCHLVFSGLVPSHIPLQESRAYLVAISLGAIVSADISSNCTHLVAARPGTAK 574

Query: 75  VSLGSKGGQVF 85
           V+   +   +F
Sbjct: 575 VNSSRRHKGIF 585


>gi|325179818|emb|CCA14221.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 694

 Score = 42.7 bits (99), Expect = 0.027,   Method: Composition-based stats.
 Identities = 22/53 (41%), Positives = 32/53 (60%), Gaps = 3/53 (5%)

Query: 17  QREVLKGCKLVFSHAFPSKFP--AHIHYLWKVVEQLGATCSIELDP-SVTHVV 66
           QR++L+GC +VFS  FP   P     H LW++   +GA  S+ +D   VTH+V
Sbjct: 380 QRKILQGCFIVFSGVFPVSDPRGPKSHSLWRLAADMGAVPSLVIDDFPVTHLV 432


>gi|403416935|emb|CCM03635.1| predicted protein [Fibroporia radiculosa]
          Length = 580

 Score = 42.7 bits (99), Expect = 0.029,   Method: Composition-based stats.
 Identities = 21/66 (31%), Positives = 29/66 (43%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
           + L G  +VFS   P         +W+     GA C  EL   VTHVV+ K   +KV   
Sbjct: 279 DTLAGVHIVFSSVIPLDTRPEATEIWRTAHAFGAKCYTELSNRVTHVVAAKRGTQKVDAA 338

Query: 79  SKGGQV 84
            + G +
Sbjct: 339 RRSGGI 344


>gi|326437795|gb|EGD83365.1| hypothetical protein PTSG_03974 [Salpingoeca sp. ATCC 50818]
          Length = 864

 Score = 42.7 bits (99), Expect = 0.031,   Method: Composition-based stats.
 Identities = 21/65 (32%), Positives = 34/65 (52%)

Query: 20  VLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLGS 79
           +L+G ++VF+   P    A+ H  W++   +GA    ++D  VTHVV+     +KV    
Sbjct: 641 ILEGVRIVFTGVIPRGQSAYTHPAWRMAVNMGAVVVDQVDERVTHVVARVDGTDKVRQAR 700

Query: 80  KGGQV 84
           K G V
Sbjct: 701 KMGGV 705


>gi|401886990|gb|EJT50998.1| protein phosphatase [Trichosporon asahii var. asahii CBS 2479]
          Length = 922

 Score = 42.4 bits (98), Expect = 0.034,   Method: Composition-based stats.
 Identities = 20/62 (32%), Positives = 29/62 (46%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
           ++L GC +VF+             +W+  E  GA C +ELD  VTH V      EK+   
Sbjct: 608 QMLSGCVIVFTGVIAINQKPQDSEIWQQAEAFGAQCQVELDERVTHCVIGSIGTEKMRRA 667

Query: 79  SK 80
           S+
Sbjct: 668 SR 669


>gi|449551315|gb|EMD42279.1| hypothetical protein CERSUDRAFT_148004 [Ceriporiopsis subvermispora
           B]
          Length = 875

 Score = 42.0 bits (97), Expect = 0.051,   Method: Composition-based stats.
 Identities = 20/66 (30%), Positives = 30/66 (45%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
           + L+G  ++FS   P      +  +W+     GA C  EL   +THVV+ K    KV   
Sbjct: 566 KALEGVHILFSSVIPLDTRPEVTEVWRTAHAFGAQCHTELSSRITHVVAAKRGTVKVDAA 625

Query: 79  SKGGQV 84
            K G +
Sbjct: 626 RKQGGI 631


>gi|413924219|gb|AFW64151.1| hypothetical protein ZEAMMB73_480827 [Zea mays]
          Length = 490

 Score = 41.6 bits (96), Expect = 0.054,   Method: Composition-based stats.
 Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 3/69 (4%)

Query: 17  QREVLKGCKLVFSHA--FPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           +R+VL  C + FS+       FP +   +W + E+LGA C  ++D +VTHVV+     +K
Sbjct: 368 RRQVLPECTIAFSYLDDCMEDFPENT-LMWTLAERLGAVCRKDVDETVTHVVAEDPGTQK 426

Query: 75  VSLGSKGGQ 83
                  G+
Sbjct: 427 AQWARDHGK 435


>gi|226498568|ref|NP_001149751.1| CPL3 [Zea mays]
 gi|195631558|gb|ACG36674.1| CPL3 [Zea mays]
          Length = 493

 Score = 41.6 bits (96), Expect = 0.067,   Method: Composition-based stats.
 Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 3/69 (4%)

Query: 17  QREVLKGCKLVFSHA--FPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           +R+VL  C + FS+       FP +   +W + E+LGA C  ++D +VTHVV+     +K
Sbjct: 371 RRQVLPECTVAFSYLDDCMEDFPENT-LMWTLAERLGAVCRKDVDETVTHVVAEDPGTQK 429

Query: 75  VSLGSKGGQ 83
                  G+
Sbjct: 430 AQWARDHGK 438


>gi|449018404|dbj|BAM81806.1| similar to TFIIF interacting component of CTD phosphatase Fcp1p
            [Cyanidioschyzon merolae strain 10D]
          Length = 1640

 Score = 40.8 bits (94), Expect = 0.10,   Method: Composition-based stats.
 Identities = 18/54 (33%), Positives = 31/54 (57%), Gaps = 2/54 (3%)

Query: 17   QREVLKGCKLVFSHAFP--SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSN 68
            +R VL GC+L F+  F   +      H LW++  + GA C  E+ P V+H++++
Sbjct: 1397 RRSVLTGCELCFTGVFAKHAGMAPEDHELWRLAVRFGAVCHREVLPQVSHLIAD 1450


>gi|321262398|ref|XP_003195918.1| carboxy-terminal domain (CTD) phosphatase; Fcp1p [Cryptococcus
           gattii WM276]
 gi|317462392|gb|ADV24131.1| Carboxy-terminal domain (CTD) phosphatase, putative; Fcp1p
           [Cryptococcus gattii WM276]
          Length = 952

 Score = 40.8 bits (94), Expect = 0.12,   Method: Composition-based stats.
 Identities = 20/56 (35%), Positives = 27/56 (48%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           EVL GC LVFS   P +       +W+  E  GA  +  L    TH+V+   + EK
Sbjct: 643 EVLDGCSLVFSGMIPREADPSTTTIWQTAESFGALITPSLTSRTTHLVTALLNTEK 698


>gi|393240595|gb|EJD48120.1| hypothetical protein AURDEDRAFT_85955 [Auricularia delicata
           TFB-10046 SS5]
          Length = 796

 Score = 40.4 bits (93), Expect = 0.12,   Method: Composition-based stats.
 Identities = 19/65 (29%), Positives = 30/65 (46%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
           +   G   +FS   P +       +WK   + GA C  E+ P +THV++ K S  KV   
Sbjct: 521 QTFAGMHFLFSSLIPLEDKPEESPIWKQAREFGAICHSEVSPRLTHVITAKRSTAKVDAA 580

Query: 79  SKGGQ 83
            + G+
Sbjct: 581 RRRGE 585


>gi|242066826|ref|XP_002454702.1| hypothetical protein SORBIDRAFT_04g035880 [Sorghum bicolor]
 gi|241934533|gb|EES07678.1| hypothetical protein SORBIDRAFT_04g035880 [Sorghum bicolor]
          Length = 462

 Score = 40.4 bits (93), Expect = 0.12,   Method: Composition-based stats.
 Identities = 22/67 (32%), Positives = 35/67 (52%), Gaps = 3/67 (4%)

Query: 19  EVLKGCKLVFSHAFP--SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           +VL+GC + FS+        P     LW + E+LGA C  ++D +VTHVV+     +K  
Sbjct: 341 QVLRGCTVAFSYLEQRMEDSPDDTR-LWTLAERLGAVCRKDVDETVTHVVAEDPGTQKAQ 399

Query: 77  LGSKGGQ 83
              + G+
Sbjct: 400 WAREHGK 406


>gi|307106534|gb|EFN54779.1| hypothetical protein CHLNCDRAFT_134722 [Chlorella variabilis]
          Length = 513

 Score = 40.0 bits (92), Expect = 0.16,   Method: Composition-based stats.
 Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 1/59 (1%)

Query: 16  GQREVLKGCKLVFSHAFPSKFP-AHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNE 73
            +R VL  C+L+FS   P        H LW++  +LGA C  E    VTHVV+   +++
Sbjct: 342 ARRAVLAECRLLFSRVMPLDCADPSAHPLWQLALKLGAECVRETGQGVTHVVATDTTDK 400


>gi|395334832|gb|EJF67208.1| hypothetical protein DICSQDRAFT_142769 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 953

 Score = 40.0 bits (92), Expect = 0.17,   Method: Composition-based stats.
 Identities = 19/66 (28%), Positives = 30/66 (45%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
           + L G  ++F+   P         +WK     GA C  +L   +THVV+NK + +KV   
Sbjct: 616 DTLAGVHILFTGVIPLNQRPETAEIWKTATAFGAQCHTDLGKHITHVVTNKDNTQKVDAA 675

Query: 79  SKGGQV 84
            +   V
Sbjct: 676 RRYADV 681


>gi|302793512|ref|XP_002978521.1| hypothetical protein SELMODRAFT_418187 [Selaginella moellendorffii]
 gi|300153870|gb|EFJ20507.1| hypothetical protein SELMODRAFT_418187 [Selaginella moellendorffii]
          Length = 346

 Score = 40.0 bits (92), Expect = 0.18,   Method: Composition-based stats.
 Identities = 25/71 (35%), Positives = 40/71 (56%), Gaps = 9/71 (12%)

Query: 18  REV----LKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTH-VVSNKCSN 72
           REV    L GCK+V      +K  A    LW   ++LGA C +++D +VTH VV++K   
Sbjct: 251 REVKGHALSGCKIVIC----AKSQAAHELLWDSCQELGAECVVDIDDTVTHVVVASKQQP 306

Query: 73  EKVSLGSKGGQ 83
           + + L ++ G+
Sbjct: 307 QGLELSAQAGK 317


>gi|291234950|ref|XP_002737409.1| PREDICTED: RNA polymerase II ctd phosphatase, putative-like
           [Saccoglossus kowalevskii]
          Length = 896

 Score = 39.7 bits (91), Expect = 0.21,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 30/68 (44%), Gaps = 10/68 (14%)

Query: 18  REVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDP----------SVTHVVS 67
           R+VLKG  ++FS  FP+         WKV + LGA       P          + THVV+
Sbjct: 576 RQVLKGTNILFSGVFPTNMSPEKSRAWKVAQTLGANVQSSFVPKLKDKTNAATATTHVVA 635

Query: 68  NKCSNEKV 75
            K    KV
Sbjct: 636 AKAGTVKV 643


>gi|71004098|ref|XP_756715.1| hypothetical protein UM00568.1 [Ustilago maydis 521]
 gi|46095984|gb|EAK81217.1| hypothetical protein UM00568.1 [Ustilago maydis 521]
          Length = 779

 Score = 39.7 bits (91), Expect = 0.22,   Method: Composition-based stats.
 Identities = 22/58 (37%), Positives = 31/58 (53%), Gaps = 1/58 (1%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSV-THVVSNKCSNEKV 75
           +VLKGC +VFS   P         LW +  + GAT + E++  V THVV+ +    KV
Sbjct: 518 QVLKGCTIVFSSMIPFGHNVEKSDLWAMAREFGATPASEIEVGVTTHVVAARPGTAKV 575


>gi|341882050|gb|EGT37985.1| hypothetical protein CAEBREN_32558 [Caenorhabditis brenneri]
          Length = 673

 Score = 39.7 bits (91), Expect = 0.23,   Method: Composition-based stats.
 Identities = 19/59 (32%), Positives = 33/59 (55%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
           +R+VL GC +VFS   P+        ++++ +Q GAT   E+   VTH+V  +   +K+
Sbjct: 359 RRKVLDGCVIVFSGIVPTGEKLERTDIYRLCQQFGATILPEVTDQVTHIVGARYGTQKI 417


>gi|196002231|ref|XP_002110983.1| hypothetical protein TRIADDRAFT_54465 [Trichoplax adhaerens]
 gi|190586934|gb|EDV26987.1| hypothetical protein TRIADDRAFT_54465 [Trichoplax adhaerens]
          Length = 766

 Score = 39.7 bits (91), Expect = 0.25,   Method: Composition-based stats.
 Identities = 30/79 (37%), Positives = 38/79 (48%), Gaps = 10/79 (12%)

Query: 17  QREVLKGCKLVFSHAFPSKFPA-HIHYLWKVVEQLGA--TCSIELDPS--VTHVVSNKCS 71
           +R VLK  K+VFS   PS  P+    Y W + E LGA  T      PS   THVV+ + +
Sbjct: 546 RRNVLKDVKIVFSAIIPSGHPSPEKTYEWILAESLGAKVTHKFHTSPSRKTTHVVTKRVA 605

Query: 72  -----NEKVSLGSKGGQVF 85
                 +KV L  K   VF
Sbjct: 606 FQSGYTQKVHLAMKTAGVF 624


>gi|323508124|emb|CBQ67995.1| related to FCP1-TFIIF interacting component of CTD phosphatase
           [Sporisorium reilianum SRZ2]
          Length = 773

 Score = 39.7 bits (91), Expect = 0.26,   Method: Composition-based stats.
 Identities = 25/72 (34%), Positives = 34/72 (47%), Gaps = 1/72 (1%)

Query: 20  VLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSV-THVVSNKCSNEKVSLG 78
           VL+GC +VFS   P         LW +  + GAT S E++  V THVV+ +    KV   
Sbjct: 516 VLQGCTIVFSSMIPFGHDPEKSDLWAMAREFGATPSSEIEAGVTTHVVAARPGTAKVHQA 575

Query: 79  SKGGQVFGGSTV 90
            +  Q   G  V
Sbjct: 576 LRLAQKSAGLEV 587


>gi|302774062|ref|XP_002970448.1| hypothetical protein SELMODRAFT_411029 [Selaginella moellendorffii]
 gi|300161964|gb|EFJ28578.1| hypothetical protein SELMODRAFT_411029 [Selaginella moellendorffii]
          Length = 346

 Score = 39.3 bits (90), Expect = 0.29,   Method: Composition-based stats.
 Identities = 25/71 (35%), Positives = 39/71 (54%), Gaps = 9/71 (12%)

Query: 18  REV----LKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTH-VVSNKCSN 72
           REV    L GCK+V      +K  A    LW   + LGA C +++D +VTH VV++K   
Sbjct: 251 REVKGHALSGCKIVIC----AKTQAAHELLWDSCQALGAECVVDIDDTVTHVVVASKQQP 306

Query: 73  EKVSLGSKGGQ 83
           + + L ++ G+
Sbjct: 307 QGLELSAQAGK 317


>gi|388858248|emb|CCF48177.1| related to FCP1-TFIIF interacting component of CTD phosphatase
           [Ustilago hordei]
          Length = 774

 Score = 39.3 bits (90), Expect = 0.29,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 32/60 (53%), Gaps = 1/60 (1%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSV-THVVSNKCSNEKV 75
           + +VL GC +VFS   P+        LW +  + GAT + E++  V THVV+ +    KV
Sbjct: 505 KTKVLAGCTIVFSSMIPTGHNPETSDLWALAREFGATPAFEVEEGVTTHVVAARQGTLKV 564


>gi|91087589|ref|XP_971974.1| PREDICTED: similar to RNA polymerase II subunit A C-terminal domain
           phosphatase [Tribolium castaneum]
 gi|270010700|gb|EFA07148.1| hypothetical protein TcasGA2_TC010139 [Tribolium castaneum]
          Length = 760

 Score = 39.3 bits (90), Expect = 0.29,   Method: Composition-based stats.
 Identities = 20/64 (31%), Positives = 35/64 (54%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           + +VL+G KLVFS   P+         +++ + LGA  + EL+   TH+V+ +    KV+
Sbjct: 484 RSQVLQGYKLVFSGLVPTHIKLEQSKAYQIAKSLGAEVTQELEDDTTHLVAVRPGTAKVN 543

Query: 77  LGSK 80
            G +
Sbjct: 544 AGRR 547


>gi|389751366|gb|EIM92439.1| hypothetical protein STEHIDRAFT_136328 [Stereum hirsutum FP-91666
           SS1]
          Length = 1075

 Score = 39.3 bits (90), Expect = 0.30,   Method: Composition-based stats.
 Identities = 20/65 (30%), Positives = 28/65 (43%)

Query: 21  LKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLGSK 80
           L G  ++FS   P         +W++    GA C  EL   +THVV+ K    KV    K
Sbjct: 672 LFGVHILFSSVIPLDTRPETTEVWRLAHAFGAKCYTELSSKITHVVAAKRGTVKVDQARK 731

Query: 81  GGQVF 85
            G + 
Sbjct: 732 RGNIL 736


>gi|387219521|gb|AFJ69469.1| rna polymerase ii ctd phosphatase, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 268

 Score = 38.9 bits (89), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 21/64 (32%), Positives = 31/64 (48%)

Query: 12  CTENGQREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCS 71
           C  + +R+VL G  ++FS   P         L  +   LGA    +  P+VTH+V+   S
Sbjct: 84  CLSSVRRQVLAGVTILFSGVLPRNVDPRRSDLGYMALSLGARIVEDFSPTVTHLVAENAS 143

Query: 72  NEKV 75
            EKV
Sbjct: 144 TEKV 147


>gi|302838991|ref|XP_002951053.1| hypothetical protein VOLCADRAFT_91454 [Volvox carteri f.
           nagariensis]
 gi|300263748|gb|EFJ47947.1| hypothetical protein VOLCADRAFT_91454 [Volvox carteri f.
           nagariensis]
          Length = 699

 Score = 38.9 bits (89), Expect = 0.37,   Method: Composition-based stats.
 Identities = 23/63 (36%), Positives = 31/63 (49%), Gaps = 5/63 (7%)

Query: 17  QREVLK----GCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSV-THVVSNKCS 71
           +RE+L+    GC + FS  +P         LW++   LGA C    DP V THVV+    
Sbjct: 566 RREILQLMPQGCCITFSRCWPQDRNPLREPLWQLAMSLGANCLTTYDPGVTTHVVAAAGG 625

Query: 72  NEK 74
            EK
Sbjct: 626 TEK 628


>gi|170036997|ref|XP_001846347.1| RNA polymerase II subunit A C-terminal domain phosphatase [Culex
           quinquefasciatus]
 gi|167879975|gb|EDS43358.1| RNA polymerase II subunit A C-terminal domain phosphatase [Culex
           quinquefasciatus]
          Length = 764

 Score = 38.9 bits (89), Expect = 0.38,   Method: Composition-based stats.
 Identities = 19/51 (37%), Positives = 28/51 (54%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVS 67
           + +VL G KLVFS   P+    H    ++V   LGAT +   +P  TH+V+
Sbjct: 466 KSQVLVGHKLVFSGLVPNSMKLHQSKAFQVARSLGATVTQSFEPDTTHLVA 516


>gi|328874143|gb|EGG22509.1| hypothetical protein DFA_04637 [Dictyostelium fasciculatum]
          Length = 397

 Score = 38.5 bits (88), Expect = 0.46,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 33/68 (48%), Gaps = 3/68 (4%)

Query: 20  VLKGCKLVFSHAFPSKFPA---HIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           VL  C +VFS  FP +  A   H   + ++ E  GA    ++ P+ TH++  K    KV 
Sbjct: 299 VLMDCNIVFSGIFPKQIDATKLHQTRIVQMAESFGAQVHQDITPTTTHLIFIKEGTSKVI 358

Query: 77  LGSKGGQV 84
              K GQV
Sbjct: 359 QAVKQGQV 366


>gi|307213748|gb|EFN89086.1| Ornithine decarboxylase [Harpegnathos saltator]
          Length = 409

 Score = 37.7 bits (86), Expect = 0.77,   Method: Composition-based stats.
 Identities = 26/71 (36%), Positives = 38/71 (53%), Gaps = 11/71 (15%)

Query: 21  LKGCKLVFSHAFPSKFPAHIHYLWKV-VEQLGATCSIELD------PSVTHVVSNKC--S 71
           +KG +++F+H  P+K P+HI Y  KV VE++      EL       P    V+  +C   
Sbjct: 96  VKGERIIFAH--PAKLPSHIKYARKVGVERMTVDGETELSKIQEFFPEAKVVLRIRCDAK 153

Query: 72  NEKVSLGSKGG 82
           N  VSLG+K G
Sbjct: 154 NSPVSLGTKFG 164


>gi|299470348|emb|CBN78397.1| Similar to RNA Polymerase II CTD phosphatase Fcp1, putative
           [Ectocarpus siliculosus]
          Length = 985

 Score = 37.7 bits (86), Expect = 0.77,   Method: Composition-based stats.
 Identities = 25/69 (36%), Positives = 33/69 (47%), Gaps = 3/69 (4%)

Query: 20  VLKGCKLVFSHAFP-SKFPA--HIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           VL G ++VFS   P S  PA    H LW + E  GAT   ++    THVV+ +    K  
Sbjct: 724 VLTGVRMVFSGVIPVSGAPADPRTHRLWMMAESHGATVERDIGRHTTHVVAVRLGTAKTK 783

Query: 77  LGSKGGQVF 85
            G +   VF
Sbjct: 784 TGLRMPGVF 792


>gi|322785368|gb|EFZ12041.1| hypothetical protein SINV_00693 [Solenopsis invicta]
          Length = 759

 Score = 37.7 bits (86), Expect = 0.79,   Method: Composition-based stats.
 Identities = 21/68 (30%), Positives = 32/68 (47%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           + +VLKG  L FS   P+    H    +KV    GA  + EL    TH+V+ +    K +
Sbjct: 479 RSQVLKGVCLTFSGLIPTHQKLHQSRAYKVARAFGAEVTQELTEKTTHLVAIRKGTAKAN 538

Query: 77  LGSKGGQV 84
              K G++
Sbjct: 539 AAKKHGKI 546


>gi|330796177|ref|XP_003286145.1| hypothetical protein DICPUDRAFT_87022 [Dictyostelium purpureum]
 gi|325083890|gb|EGC37331.1| hypothetical protein DICPUDRAFT_87022 [Dictyostelium purpureum]
          Length = 793

 Score = 37.7 bits (86), Expect = 0.80,   Method: Composition-based stats.
 Identities = 22/62 (35%), Positives = 34/62 (54%), Gaps = 3/62 (4%)

Query: 17  QREVLKGCKLVFSHAFPSKF-PAHIHY--LWKVVEQLGATCSIELDPSVTHVVSNKCSNE 73
           +  VL  C +VFS  FP +  P+ + +  + K+ E  GA+ S E+D + THV+  K    
Sbjct: 688 RSSVLMDCNIVFSGIFPKQIDPSKLCHTRVSKITESFGASISQEIDSNTTHVIFIKEGTS 747

Query: 74  KV 75
           KV
Sbjct: 748 KV 749


>gi|168012675|ref|XP_001759027.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689726|gb|EDQ76096.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 389

 Score = 37.7 bits (86), Expect = 0.83,   Method: Composition-based stats.
 Identities = 19/59 (32%), Positives = 30/59 (50%), Gaps = 8/59 (13%)

Query: 9   IFFCTENGQREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVS 67
           +FF   + + ++L GC +V            IH  W++  +LGA CS   D + THVV+
Sbjct: 213 LFFVIRSLRAKLLAGCNVVLG--------PEIHPFWQLPAELGARCSTFCDHTTTHVVA 263


>gi|198438317|ref|XP_002131972.1| PREDICTED: similar to MGC81710 protein [Ciona intestinalis]
          Length = 895

 Score = 37.4 bits (85), Expect = 1.2,   Method: Composition-based stats.
 Identities = 22/69 (31%), Positives = 34/69 (49%), Gaps = 2/69 (2%)

Query: 17  QREVLKGCKLVFSHAFPSKFPA--HIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
           + +VL GC +V +   P+ F A  H+H    V  QLGA  +  +D + TH++  K    K
Sbjct: 582 RSKVLYGCCIVLTGIIPNNFKAAPHMHRAHIVARQLGAAINSTVDENTTHLIGAKKGTAK 641

Query: 75  VSLGSKGGQ 83
                K G+
Sbjct: 642 YQDALKMGK 650


>gi|303276827|ref|XP_003057707.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226460364|gb|EEH57658.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 692

 Score = 37.4 bits (85), Expect = 1.2,   Method: Composition-based stats.
 Identities = 22/57 (38%), Positives = 31/57 (54%), Gaps = 4/57 (7%)

Query: 17  QREVLKGCKLVFSHAFP---SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKC 70
           ++ VL+G ++VFS  F           H LW++ E+LGA    E   S THVV+ KC
Sbjct: 591 RKNVLRGVEIVFSGVFDHNDKTLTPREHPLWRLAERLGARVVSEPGTSTTHVVA-KC 646


>gi|320163842|gb|EFW40741.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 933

 Score = 37.4 bits (85), Expect = 1.3,   Method: Composition-based stats.
 Identities = 15/24 (62%), Positives = 17/24 (70%)

Query: 43  LWKVVEQLGATCSIELDPSVTHVV 66
           LW +VE  G TCS ELD S TH+V
Sbjct: 114 LWGIVEYFGGTCSAELDSSCTHLV 137


>gi|328859642|gb|EGG08750.1| hypothetical protein MELLADRAFT_115868 [Melampsora larici-populina
           98AG31]
          Length = 736

 Score = 37.0 bits (84), Expect = 1.3,   Method: Composition-based stats.
 Identities = 15/29 (51%), Positives = 18/29 (62%)

Query: 44  WKVVEQLGATCSIELDPSVTHVVSNKCSN 72
           WK+ EQ GA C   L P VTH+V+ K  N
Sbjct: 538 WKLAEQFGAQCYTRLTPRVTHLVAAKAIN 566


>gi|268566337|ref|XP_002639695.1| C. briggsae CBR-FCP-1 protein [Caenorhabditis briggsae]
          Length = 723

 Score = 37.0 bits (84), Expect = 1.3,   Method: Composition-based stats.
 Identities = 20/67 (29%), Positives = 34/67 (50%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           + +VL GC +VFS   P+        ++++  Q GAT   E+   VTHVV  +   +K+ 
Sbjct: 366 RHKVLDGCVIVFSGIVPTGEKLERTDIYRLCMQFGATIVPEVTDEVTHVVGARYGTQKIH 425

Query: 77  LGSKGGQ 83
              + G+
Sbjct: 426 QAHRLGK 432


>gi|66824241|ref|XP_645475.1| hypothetical protein DDB_G0271690 [Dictyostelium discoideum AX4]
 gi|60473594|gb|EAL71535.1| hypothetical protein DDB_G0271690 [Dictyostelium discoideum AX4]
          Length = 782

 Score = 37.0 bits (84), Expect = 1.5,   Method: Composition-based stats.
 Identities = 20/69 (28%), Positives = 37/69 (53%), Gaps = 1/69 (1%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           ++++LKG  +VFS  +P   P     L  + E+ G+    +++   THV++ +    KV+
Sbjct: 431 KKDILKGTYIVFSGVYPLGTPIQKQPLRWLAEEFGSVVQNDINNETTHVIAQRKGTSKVN 490

Query: 77  LG-SKGGQV 84
              SKG +V
Sbjct: 491 KALSKGLKV 499


>gi|328713585|ref|XP_001947680.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like [Acyrthosiphon pisum]
          Length = 736

 Score = 36.6 bits (83), Expect = 1.9,   Method: Composition-based stats.
 Identities = 21/64 (32%), Positives = 31/64 (48%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           + ++L G KLVFS   P+  P      +KV   LGA  +  + P  TH+V+ +    K S
Sbjct: 419 RSKILAGKKLVFSGLVPTPVPLTESRAYKVARLLGAEVTENIKPDSTHLVAVRQGTLKAS 478

Query: 77  LGSK 80
              K
Sbjct: 479 AARK 482


>gi|66805733|ref|XP_636588.1| hypothetical protein DDB_G0288707 [Dictyostelium discoideum AX4]
 gi|60464974|gb|EAL63085.1| hypothetical protein DDB_G0288707 [Dictyostelium discoideum AX4]
          Length = 985

 Score = 36.6 bits (83), Expect = 1.9,   Method: Composition-based stats.
 Identities = 18/53 (33%), Positives = 31/53 (58%), Gaps = 3/53 (5%)

Query: 17  QREVLKGCKLVFSHAFPSKF-PAHIHY--LWKVVEQLGATCSIELDPSVTHVV 66
           +  VL  C +VFS  FP +  P+ + +  + K+ E  GA  S+E+D + TH++
Sbjct: 879 RSSVLMDCNIVFSGIFPKQIDPSKLCHTRVSKITESFGAKISLEIDSTTTHLI 931


>gi|449675210|ref|XP_002161785.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like [Hydra magnipapillata]
          Length = 718

 Score = 36.6 bits (83), Expect = 2.0,   Method: Composition-based stats.
 Identities = 20/64 (31%), Positives = 29/64 (45%), Gaps = 6/64 (9%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIEL------DPSVTHVVSNKC 70
           +R+ LKGC +VF+   P+  P      WK    LGA  + E+          THVV+ + 
Sbjct: 444 RRQTLKGCNIVFTGVIPTNCPLEKSKAWKTAVSLGARVTSEVVGKEEDGLRTTHVVAARH 503

Query: 71  SNEK 74
              K
Sbjct: 504 GTHK 507


>gi|159483481|ref|XP_001699789.1| hypothetical protein CHLREDRAFT_141879 [Chlamydomonas reinhardtii]
 gi|158281731|gb|EDP07485.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 375

 Score = 36.6 bits (83), Expect = 2.0,   Method: Composition-based stats.
 Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 1/57 (1%)

Query: 20  VLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSV-THVVSNKCSNEKV 75
           +L G  + FS  +          LW++ E LGATC    DP+V THVV+      KV
Sbjct: 309 ILTGVHITFSRCWAQDKDPRKEPLWQLAEGLGATCLPAYDPAVTTHVVAAGGGTAKV 365


>gi|392578708|gb|EIW71836.1| hypothetical protein TREMEDRAFT_67978 [Tremella mesenterica DSM
           1558]
          Length = 944

 Score = 36.2 bits (82), Expect = 2.3,   Method: Composition-based stats.
 Identities = 19/62 (30%), Positives = 23/62 (37%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
           +V  GC  VFS              W+  E  GA C   L    TH ++     EKV   
Sbjct: 672 QVFDGCYFVFSGIIARDVEPETTSHWQWAEMFGARCQPTLTRKTTHCITTNAGTEKVYQA 731

Query: 79  SK 80
           SK
Sbjct: 732 SK 733


>gi|330799899|ref|XP_003287978.1| hypothetical protein DICPUDRAFT_55168 [Dictyostelium purpureum]
 gi|325082002|gb|EGC35499.1| hypothetical protein DICPUDRAFT_55168 [Dictyostelium purpureum]
          Length = 730

 Score = 36.2 bits (82), Expect = 2.3,   Method: Composition-based stats.
 Identities = 21/69 (30%), Positives = 37/69 (53%), Gaps = 1/69 (1%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           ++E+LK   +VFS  +P   P +   L  + E+ GA+   ++    THV++ +    KV+
Sbjct: 443 KKEILKDQFIVFSGVYPLGTPVNKQPLRYLAEEFGASVENDITSKTTHVIAQRKGTSKVN 502

Query: 77  LG-SKGGQV 84
              SKG +V
Sbjct: 503 KAISKGLKV 511


>gi|383859141|ref|XP_003705055.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like isoform 2 [Megachile rotundata]
          Length = 759

 Score = 35.8 bits (81), Expect = 3.4,   Method: Composition-based stats.
 Identities = 20/62 (32%), Positives = 28/62 (45%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
           +VLKG  + FS   P+    H    +KV    GA  S EL    TH+V+ +    K +  
Sbjct: 474 QVLKGVHITFSGLIPTHQKIHQSRAYKVARAFGAEVSQELTDKTTHLVAIRPGTAKANAA 533

Query: 79  SK 80
            K
Sbjct: 534 KK 535


>gi|383859139|ref|XP_003705054.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like isoform 1 [Megachile rotundata]
          Length = 760

 Score = 35.8 bits (81), Expect = 3.4,   Method: Composition-based stats.
 Identities = 20/62 (32%), Positives = 28/62 (45%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
           +VLKG  + FS   P+    H    +KV    GA  S EL    TH+V+ +    K +  
Sbjct: 474 QVLKGVHITFSGLIPTHQKIHQSRAYKVARAFGAEVSQELTDKTTHLVAIRPGTAKANAA 533

Query: 79  SK 80
            K
Sbjct: 534 KK 535


>gi|24762673|ref|NP_611934.1| Fcp1 [Drosophila melanogaster]
 gi|7291810|gb|AAF47230.1| Fcp1 [Drosophila melanogaster]
          Length = 880

 Score = 35.8 bits (81), Expect = 3.5,   Method: Composition-based stats.
 Identities = 19/64 (29%), Positives = 31/64 (48%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           + EVL+G  LVFS   P++        + + + LGA     +D  +TH+V+      KV+
Sbjct: 587 RSEVLRGKNLVFSGLVPTQMKLEQSRAYFIAKSLGAEVKPNIDKEITHLVAVNAGTYKVN 646

Query: 77  LGSK 80
              K
Sbjct: 647 AAKK 650


>gi|21483550|gb|AAM52750.1| SD01014p [Drosophila melanogaster]
          Length = 896

 Score = 35.8 bits (81), Expect = 3.5,   Method: Composition-based stats.
 Identities = 19/64 (29%), Positives = 31/64 (48%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           + EVL+G  LVFS   P++        + + + LGA     +D  +TH+V+      KV+
Sbjct: 603 RSEVLRGKNLVFSGLVPTQMKLEQSRAYFIAKSLGAEVKPNIDKEITHLVAVNAGTYKVN 662

Query: 77  LGSK 80
              K
Sbjct: 663 AAKK 666


>gi|342320998|gb|EGU12936.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Rhodotorula
           glutinis ATCC 204091]
          Length = 817

 Score = 35.4 bits (80), Expect = 3.8,   Method: Composition-based stats.
 Identities = 23/59 (38%), Positives = 30/59 (50%), Gaps = 4/59 (6%)

Query: 19  EVLKGCKLVFSH--AFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
           + L+   LVFS   A  S+ P    Y WK+    GA CS +L  S TH+V+N     KV
Sbjct: 509 QTLRDTHLVFSGLVALGSR-PEDSEY-WKLARTFGARCSADLSSSTTHLVANGWGTAKV 565


>gi|307212079|gb|EFN87962.1| RNA polymerase II subunit A C-terminal domain phosphatase
           [Harpegnathos saltator]
          Length = 734

 Score = 35.4 bits (80), Expect = 3.9,   Method: Composition-based stats.
 Identities = 25/85 (29%), Positives = 36/85 (42%), Gaps = 14/85 (16%)

Query: 10  FFCT---ENGQR-----------EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCS 55
           F+CT    NG+R           +VLKG  L FS   P+    H    +KV    GA  +
Sbjct: 480 FYCTLDKGNGRRSLRDIIPRVRSQVLKGLYLTFSGLIPTHQKLHQSRAYKVARAFGAEVT 539

Query: 56  IELDPSVTHVVSNKCSNEKVSLGSK 80
            +L    TH+V+ +    K +   K
Sbjct: 540 QDLTEKTTHLVAIRKGTAKANAAKK 564


>gi|156549638|ref|XP_001604265.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like, partial [Nasonia vitripennis]
          Length = 512

 Score = 35.4 bits (80), Expect = 4.0,   Method: Composition-based stats.
 Identities = 22/68 (32%), Positives = 31/68 (45%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           +  VLKG  L FS   P+    H    +KV    GA  S +L    TH+V+ +    KV 
Sbjct: 296 RSRVLKGLCLTFSGLVPNNQKLHQSRAYKVARAFGAQASQDLTEQTTHLVAIQPGTVKVR 355

Query: 77  LGSKGGQV 84
              + G+V
Sbjct: 356 EAKRQGKV 363


>gi|380022133|ref|XP_003694908.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
           C-terminal domain phosphatase-like [Apis florea]
          Length = 749

 Score = 35.4 bits (80), Expect = 4.2,   Method: Composition-based stats.
 Identities = 19/66 (28%), Positives = 29/66 (43%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
           +VLKG  L FS   P+    H    +KV    GA  + +L    TH+V+ +    K +  
Sbjct: 472 QVLKGVHLTFSGLIPTHQKLHQSRAYKVARAFGAEVAQDLSEKTTHLVAIRPGTAKANTA 531

Query: 79  SKGGQV 84
            K   +
Sbjct: 532 KKNSNI 537


>gi|424513770|emb|CCO66392.1| predicted protein [Bathycoccus prasinos]
          Length = 546

 Score = 35.4 bits (80), Expect = 4.5,   Method: Composition-based stats.
 Identities = 25/65 (38%), Positives = 36/65 (55%), Gaps = 3/65 (4%)

Query: 20  VLKGCKLVFSHAFPS--KFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSL 77
           +LKGC ++ S   PS  + P   H L  V   LGAT +  ++ +VTHV++   + EKV  
Sbjct: 456 LLKGCVILPSGITPSNDERPDR-HPLLLVAVGLGATIATAMNDNVTHVLARADNTEKVKW 514

Query: 78  GSKGG 82
           G K G
Sbjct: 515 GRKRG 519


>gi|157109625|ref|XP_001650754.1| RNA polymerase ii ctd phosphatase [Aedes aegypti]
 gi|108868428|gb|EAT32653.1| AAEL015142-PA, partial [Aedes aegypti]
          Length = 569

 Score = 35.0 bits (79), Expect = 5.4,   Method: Composition-based stats.
 Identities = 20/68 (29%), Positives = 31/68 (45%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           + +VL G  LVFS   P+         ++V   LGAT + +  P  TH+V+      KV 
Sbjct: 458 KSQVLVGFNLVFSGLVPNSMKLEESKAYQVARSLGATVTQDFTPDTTHLVAVTFGTSKVH 517

Query: 77  LGSKGGQV 84
              K  ++
Sbjct: 518 NARKNPKI 525


>gi|452820283|gb|EME27327.1| phosphoprotein phosphatase [Galdieria sulphuraria]
          Length = 734

 Score = 35.0 bits (79), Expect = 6.2,   Method: Composition-based stats.
 Identities = 13/50 (26%), Positives = 27/50 (54%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVV 66
           +  VL+ C L F+  F  +    +  +W++ E+ GA C+ ++    TH++
Sbjct: 517 RHRVLRNCYLSFTGIFRLEESPEVSTVWRLAEEFGAICNKQVTSQTTHLI 566


>gi|328792425|ref|XP_623605.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
           phosphatase-like [Apis mellifera]
          Length = 745

 Score = 34.7 bits (78), Expect = 6.9,   Method: Composition-based stats.
 Identities = 19/62 (30%), Positives = 28/62 (45%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
           +VLKG  L FS   P+    H    +KV    GA  + +L    TH+V+ +    K +  
Sbjct: 469 QVLKGVHLTFSGLIPTHQKLHQSRAYKVARAFGAEVAQDLSKKTTHLVAIRPGTAKANTA 528

Query: 79  SK 80
            K
Sbjct: 529 KK 530


>gi|118784887|ref|XP_314000.3| AGAP005119-PA [Anopheles gambiae str. PEST]
 gi|116128258|gb|EAA09414.3| AGAP005119-PA [Anopheles gambiae str. PEST]
          Length = 822

 Score = 34.7 bits (78), Expect = 7.2,   Method: Composition-based stats.
 Identities = 19/65 (29%), Positives = 30/65 (46%)

Query: 20  VLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLGS 79
           VL G KL FS   P+         + +   LGA  +  L+P+ TH+V+      KV+   
Sbjct: 528 VLVGAKLCFSGLIPNNVKLEQSKAYLIARSLGAAVTQNLEPTTTHLVAVTIGTSKVNNAR 587

Query: 80  KGGQV 84
           K  ++
Sbjct: 588 KNPKI 592


>gi|195383304|ref|XP_002050366.1| GJ22116 [Drosophila virilis]
 gi|194145163|gb|EDW61559.1| GJ22116 [Drosophila virilis]
          Length = 703

 Score = 34.7 bits (78), Expect = 7.9,   Method: Composition-based stats.
 Identities = 18/66 (27%), Positives = 32/66 (48%)

Query: 19  EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
           EVL+G  LVFS   P++        + + + LGA     ++  +TH+V+      KV+  
Sbjct: 413 EVLRGQNLVFSGLVPTQMKLEQSRAYFIAKSLGAEVQSNINKDITHLVAVNAGTYKVNAA 472

Query: 79  SKGGQV 84
            K  ++
Sbjct: 473 KKESKI 478


>gi|357601986|gb|EHJ63229.1| putative RNA polymerase II subunit A C-terminal domain phosphatase
           [Danaus plexippus]
          Length = 683

 Score = 34.7 bits (78), Expect = 8.2,   Method: Composition-based stats.
 Identities = 19/67 (28%), Positives = 32/67 (47%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           + +VL G  LVFS   P+         ++V + LGA  + +     TH+V+ +    KV+
Sbjct: 430 KSQVLAGSSLVFSGLVPTHQRLETSRAYQVAKTLGAEVTQDFTDKTTHLVAMRAGTAKVN 489

Query: 77  LGSKGGQ 83
              K G+
Sbjct: 490 ASKKLGE 496


>gi|195029035|ref|XP_001987380.1| GH21892 [Drosophila grimshawi]
 gi|193903380|gb|EDW02247.1| GH21892 [Drosophila grimshawi]
          Length = 889

 Score = 34.7 bits (78), Expect = 8.3,   Method: Composition-based stats.
 Identities = 18/68 (26%), Positives = 32/68 (47%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           + EVL+G  LVFS   P++        + + + LGA     ++  +TH+V+      KV+
Sbjct: 595 RSEVLRGQNLVFSGLVPTQMKMEQSRAYFIAKSLGAEVKSNINKDITHLVAVNAGTYKVN 654

Query: 77  LGSKGGQV 84
              K   +
Sbjct: 655 AAKKEANI 662


>gi|313234471|emb|CBY24671.1| unnamed protein product [Oikopleura dioica]
          Length = 614

 Score = 34.3 bits (77), Expect = 8.8,   Method: Composition-based stats.
 Identities = 18/69 (26%), Positives = 31/69 (44%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           ++ +LKGC+LVFS   P+      H   K    +GA     +  + TH++  +    K +
Sbjct: 377 RKNILKGCQLVFSGVVPNGCRMEEHRAVKNARAMGAVIHERIQKNTTHLICARPGTAKHN 436

Query: 77  LGSKGGQVF 85
              +   VF
Sbjct: 437 EAKRKANVF 445


>gi|307168754|gb|EFN61749.1| RNA polymerase II subunit A C-terminal domain phosphatase
           [Camponotus floridanus]
          Length = 721

 Score = 34.3 bits (77), Expect = 9.5,   Method: Composition-based stats.
 Identities = 19/68 (27%), Positives = 31/68 (45%)

Query: 17  QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
           + +VLKG  L FS   P+    H   ++KV    GA  + +L    TH+V+ +    K +
Sbjct: 477 RSQVLKGLCLTFSGLIPTHQKLHQSRVYKVARAFGAEITQDLTEKTTHLVAIRKGTAKAN 536

Query: 77  LGSKGGQV 84
              K   +
Sbjct: 537 AARKDANI 544


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.133    0.413 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,744,306,824
Number of Sequences: 23463169
Number of extensions: 62356907
Number of successful extensions: 119952
Number of sequences better than 100.0: 171
Number of HSP's better than 100.0 without gapping: 140
Number of HSP's successfully gapped in prelim test: 31
Number of HSP's that attempted gapping in prelim test: 119767
Number of HSP's gapped (non-prelim): 172
length of query: 113
length of database: 8,064,228,071
effective HSP length: 81
effective length of query: 32
effective length of database: 6,163,711,382
effective search space: 197238764224
effective search space used: 197238764224
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 69 (31.2 bits)