BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 033679
(113 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255570505|ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
gi|223534449|gb|EEF36151.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
Length = 478
Score = 86.3 bits (212), Expect = 2e-15, Method: Composition-based stats.
Identities = 37/58 (63%), Positives = 47/58 (81%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
+++VLKGCK+VFS FP++F A H+LWK+ EQLGATCS E+DPSVTHVVS + EK
Sbjct: 379 RKDVLKGCKIVFSRVFPTQFQADNHHLWKMAEQLGATCSREVDPSVTHVVSAEAGTEK 436
>gi|296090640|emb|CBI41034.3| unnamed protein product [Vitis vinifera]
Length = 264
Score = 85.5 bits (210), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 37/58 (63%), Positives = 46/58 (79%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
++EVLKGCK+VFS FP++F A H+LW++ EQLGATC+ ELDPSVTHVVS EK
Sbjct: 167 RKEVLKGCKIVFSRVFPTRFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEK 224
>gi|359494894|ref|XP_003634864.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Vitis vinifera]
Length = 278
Score = 85.1 bits (209), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 37/58 (63%), Positives = 46/58 (79%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
++EVLKGCK+VFS FP++F A H+LW++ EQLGATC+ ELDPSVTHVVS EK
Sbjct: 181 RKEVLKGCKIVFSRVFPTRFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEK 238
>gi|147774299|emb|CAN76945.1| hypothetical protein VITISV_002430 [Vitis vinifera]
Length = 641
Score = 84.7 bits (208), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 36/58 (62%), Positives = 46/58 (79%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
+++VLKGCK+VFS FP++F A H+LW++ EQLGATC+ ELDPSVTHVVS EK
Sbjct: 167 RKDVLKGCKIVFSRVFPTRFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEK 224
>gi|296088193|emb|CBI35709.3| unnamed protein product [Vitis vinifera]
Length = 638
Score = 84.7 bits (208), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 36/58 (62%), Positives = 46/58 (79%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
+++VLKGCK+VFS FP++F A H+LW++ EQLGATC+ ELDPSVTHVVS EK
Sbjct: 167 RKDVLKGCKIVFSRVFPTRFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEK 224
>gi|359497210|ref|XP_003635453.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Vitis vinifera]
Length = 278
Score = 84.0 bits (206), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/58 (62%), Positives = 46/58 (79%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
+++VLKGCK+VFS FP++F A H+LW++ EQLGATC+ ELDPSVTHVVS EK
Sbjct: 181 RKDVLKGCKIVFSRVFPTRFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEK 238
>gi|224142399|ref|XP_002324546.1| predicted protein [Populus trichocarpa]
gi|222865980|gb|EEF03111.1| predicted protein [Populus trichocarpa]
Length = 312
Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 38/67 (56%), Positives = 48/67 (71%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+++VLKGCK+VFS FP++ A H+LW++ EQLGATCS ELDPSVTHVVS EK
Sbjct: 221 RKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQLGATCSTELDPSVTHVVSKDSGTEKSH 280
Query: 77 LGSKGGQ 83
SK +
Sbjct: 281 WASKHNK 287
>gi|449532013|ref|XP_004172979.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
phosphatase-like 4-like, partial [Cucumis sativus]
Length = 340
Score = 82.0 bits (201), Expect = 4e-14, Method: Composition-based stats.
Identities = 37/56 (66%), Positives = 42/56 (75%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
EVL+GCK+VFS FP+KF A H LWK+VEQLG TCS ELD SVTHVV+ EK
Sbjct: 240 EVLEGCKVVFSRVFPTKFQAENHQLWKMVEQLGGTCSTELDQSVTHVVATDAGTEK 295
>gi|224142401|ref|XP_002324547.1| predicted protein [Populus trichocarpa]
gi|222865981|gb|EEF03112.1| predicted protein [Populus trichocarpa]
Length = 266
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 36/58 (62%), Positives = 45/58 (77%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
+++VLKGCK+VFS FP++ A H+LW++ EQLGATCS ELDPSVTHVVS EK
Sbjct: 170 RKDVLKGCKIVFSRVFPTQSQADNHHLWRMAEQLGATCSTELDPSVTHVVSKDSGTEK 227
>gi|449447765|ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Cucumis sativus]
Length = 452
Score = 80.9 bits (198), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 37/56 (66%), Positives = 42/56 (75%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
EVL+GCK+VFS FP+KF A H LWK+VEQLG TCS ELD SVTHVV+ EK
Sbjct: 352 EVLEGCKVVFSRVFPTKFQAENHQLWKMVEQLGGTCSTELDQSVTHVVATDAGTEK 407
>gi|145334837|ref|NP_001078764.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Arabidopsis
thaliana]
gi|122154038|sp|Q00IB6.1|CPL4_ARATH RecName: Full=RNA polymerase II C-terminal domain phosphatase-like
4; Short=FCP-like 4; AltName: Full=Carboxyl-terminal
phosphatase-like 4; Short=AtCPL4; Short=CTD
phosphatase-like 4
gi|95115186|gb|ABF55959.1| carboxyl-terminal phosphatase-like 4 [Arabidopsis thaliana]
gi|332009601|gb|AED96984.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Arabidopsis
thaliana]
Length = 440
Score = 74.7 bits (182), Expect = 7e-12, Method: Composition-based stats.
Identities = 32/58 (55%), Positives = 42/58 (72%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
++E+LKGCK+VFS FP+K H LWK+ E+LGATC+ E+D SVTHVV+ EK
Sbjct: 338 RKEILKGCKIVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEK 395
>gi|297793317|ref|XP_002864543.1| hypothetical protein ARALYDRAFT_332090 [Arabidopsis lyrata subsp.
lyrata]
gi|297310378|gb|EFH40802.1| hypothetical protein ARALYDRAFT_332090 [Arabidopsis lyrata subsp.
lyrata]
Length = 1006
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 33/58 (56%), Positives = 42/58 (72%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
++EVLKGCK+VFS FP+K H LWK+ E+LGATC+ E+D SVTHVV+ EK
Sbjct: 903 RKEVLKGCKVVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEK 960
>gi|9758369|dbj|BAB08870.1| unnamed protein product [Arabidopsis thaliana]
Length = 1065
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 32/58 (55%), Positives = 42/58 (72%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
++E+LKGCK+VFS FP+K H LWK+ E+LGATC+ E+D SVTHVV+ EK
Sbjct: 963 RKEILKGCKIVFSRVFPTKAKPEDHPLWKMAEELGATCATEVDASVTHVVAMDVGTEK 1020
>gi|326518250|dbj|BAK07377.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 488
Score = 71.6 bits (174), Expect = 5e-11, Method: Composition-based stats.
Identities = 32/58 (55%), Positives = 40/58 (68%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
++EVL+GCKLVFS FPS + +WK+ EQLGA C E+DPSVTHVV+ EK
Sbjct: 378 RQEVLQGCKLVFSRVFPSDCRSQDQIMWKMAEQLGAVCCSEVDPSVTHVVAVHAGTEK 435
>gi|224053553|ref|XP_002297869.1| predicted protein [Populus trichocarpa]
gi|222845127|gb|EEE82674.1| predicted protein [Populus trichocarpa]
Length = 1117
Score = 70.1 bits (170), Expect = 2e-10, Method: Composition-based stats.
Identities = 33/92 (35%), Positives = 52/92 (56%), Gaps = 1/92 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
QR++L GC+++FS FP + H+H LW++ EQ GA C+ ++D VTHVV+N +KV
Sbjct: 1023 QRKILGGCRILFSRVFPVGEVNPHLHPLWQMAEQFGAVCTNQIDEQVTHVVANSLGTDKV 1082
Query: 76 SLGSKGGQVFGGSTVDRGSQLFVARATRREVS 107
+ G++ S L RA ++ S
Sbjct: 1083 NWALSTGRIVVHPGWVEASALLYRRANEQDFS 1114
>gi|449487451|ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
phosphatase-like 3-like [Cucumis sativus]
Length = 1249
Score = 67.8 bits (164), Expect = 8e-10, Method: Composition-based stats.
Identities = 33/92 (35%), Positives = 51/92 (55%), Gaps = 1/92 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
Q+++L GC++VFS FP + H+H LW+ EQ GA C+ ++D VTHVV+N +KV
Sbjct: 1155 QQKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAQCTNQIDEQVTHVVANSLGTDKV 1214
Query: 76 SLGSKGGQVFGGSTVDRGSQLFVARATRREVS 107
+ G+ S L RAT ++ +
Sbjct: 1215 NWALSTGRFVVHPGWVEASALLYRRATEQDFA 1246
>gi|449445782|ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
3-like [Cucumis sativus]
Length = 1249
Score = 67.8 bits (164), Expect = 8e-10, Method: Composition-based stats.
Identities = 33/92 (35%), Positives = 51/92 (55%), Gaps = 1/92 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
Q+++L GC++VFS FP + H+H LW+ EQ GA C+ ++D VTHVV+N +KV
Sbjct: 1155 QQKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAQCTNQIDEQVTHVVANSLGTDKV 1214
Query: 76 SLGSKGGQVFGGSTVDRGSQLFVARATRREVS 107
+ G+ S L RAT ++ +
Sbjct: 1215 NWALSTGRFVVHPGWVEASALLYRRATEQDFA 1246
>gi|255543174|ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
gi|223548611|gb|EEF50102.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
Length = 1195
Score = 67.8 bits (164), Expect = 9e-10, Method: Composition-based stats.
Identities = 33/92 (35%), Positives = 50/92 (54%), Gaps = 1/92 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
QR++L GC++VFS FP + H+H LW+ EQ GA C+ ++D VTHVV+N +KV
Sbjct: 1101 QRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKV 1160
Query: 76 SLGSKGGQVFGGSTVDRGSQLFVARATRREVS 107
+ G+ S L RA ++ +
Sbjct: 1161 NWALSTGRFVVYPGWVEASALLYRRANEQDFA 1192
>gi|357478637|ref|XP_003609604.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
gi|355510659|gb|AES91801.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
Length = 1064
Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 45/68 (66%), Gaps = 1/68 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
QR++L GC++VFS FP + H+H LW+ EQ GA+C+ ++DP VTHVV+ +KV
Sbjct: 961 QRKILGGCRIVFSGVFPVGETNPHLHPLWRTAEQFGASCTNKVDPQVTHVVAQSPGTDKV 1020
Query: 76 SLGSKGGQ 83
+ G G+
Sbjct: 1021 NWGISNGK 1028
>gi|356523718|ref|XP_003530482.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
3-like [Glycine max]
Length = 1244
Score = 67.0 bits (162), Expect = 1e-09, Method: Composition-based stats.
Identities = 33/92 (35%), Positives = 50/92 (54%), Gaps = 1/92 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
QR++L GC++VFS FP + H+H LW+ EQ GA C+ ++D VTHVV+N +KV
Sbjct: 1150 QRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSPGTDKV 1209
Query: 76 SLGSKGGQVFGGSTVDRGSQLFVARATRREVS 107
+ G+ S L RA ++ +
Sbjct: 1210 NWALNNGRFVVHPGWVEASALLYRRANEQDFA 1241
>gi|356567192|ref|XP_003551805.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
3-like [Glycine max]
Length = 1221
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 33/92 (35%), Positives = 50/92 (54%), Gaps = 1/92 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
QR++L GC++VFS FP + H+H LW+ EQ GA C+ ++D VTHVV+N +KV
Sbjct: 1127 QRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAFCTNQIDEQVTHVVANSPGTDKV 1186
Query: 76 SLGSKGGQVFGGSTVDRGSQLFVARATRREVS 107
+ G+ S L RA ++ +
Sbjct: 1187 NWALNNGRFVVHPGWVEASALLYRRANEQDFA 1218
>gi|357156660|ref|XP_003577532.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
3-like [Brachypodium distachyon]
Length = 1259
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 33/92 (35%), Positives = 50/92 (54%), Gaps = 1/92 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
QR +L GC++VFS FP + H+H LW+ EQ GA C+ ++D VTHVV+N +KV
Sbjct: 1166 QRRILAGCRIVFSRIFPVGEANPHLHPLWQSAEQFGAVCTNQIDDRVTHVVANSLGTDKV 1225
Query: 76 SLGSKGGQVFGGSTVDRGSQLFVARATRREVS 107
+ + G+ S L RA+ + +
Sbjct: 1226 NWALQTGRYVVHPGWVEASALLYRRASEHDFA 1257
>gi|56547717|gb|AAV92930.1| putative transcription regulator CPL1 [Solanum lycopersicum]
Length = 1227
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 32/95 (33%), Positives = 51/95 (53%), Gaps = 1/95 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
Q+++L GC++VFS FP + H+H LW+ EQ GA C+ ++D VTHVV+N +KV
Sbjct: 1133 QKKILAGCRIVFSRVFPVGEASPHLHPLWQTAEQFGAVCTSQIDDQVTHVVANSLGTDKV 1192
Query: 76 SLGSKGGQVFGGSTVDRGSQLFVARATRREVSCEA 110
+ G+ S L RA + + ++
Sbjct: 1193 NWALSTGRSVVHPGWVEASALLYRRANEHDFAIKS 1227
>gi|242093742|ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor]
gi|241915584|gb|EER88728.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor]
Length = 558
Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats.
Identities = 29/62 (46%), Positives = 39/62 (62%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
++E+L+GCK+VFS FP+ LWK+ E LGA CS ++D SVTHVV+ EK
Sbjct: 378 RKEILQGCKIVFSRVFPNNTRPQEQMLWKMAEHLGAVCSTDVDSSVTHVVTVDLGTEKAR 437
Query: 77 LG 78
G
Sbjct: 438 WG 439
>gi|359473774|ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
3-like [Vitis vinifera]
Length = 1238
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 32/92 (34%), Positives = 49/92 (53%), Gaps = 1/92 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
QR++L GC++VFS FP + H+H LW+ E GA C+ ++D VTHVV+N +KV
Sbjct: 1144 QRKILAGCRIVFSRVFPVGEANPHLHPLWQTAESFGAVCTNQIDEQVTHVVANSLGTDKV 1203
Query: 76 SLGSKGGQVFGGSTVDRGSQLFVARATRREVS 107
+ G+ S L RA ++ +
Sbjct: 1204 NWALSTGRFVVHPGWVEASALLYRRANEQDFA 1235
>gi|296088169|emb|CBI35661.3| unnamed protein product [Vitis vinifera]
Length = 1184
Score = 65.9 bits (159), Expect = 3e-09, Method: Composition-based stats.
Identities = 32/92 (34%), Positives = 49/92 (53%), Gaps = 1/92 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
QR++L GC++VFS FP + H+H LW+ E GA C+ ++D VTHVV+N +KV
Sbjct: 1090 QRKILAGCRIVFSRVFPVGEANPHLHPLWQTAESFGAVCTNQIDEQVTHVVANSLGTDKV 1149
Query: 76 SLGSKGGQVFGGSTVDRGSQLFVARATRREVS 107
+ G+ S L RA ++ +
Sbjct: 1150 NWALSTGRFVVHPGWVEASALLYRRANEQDFA 1181
>gi|224075473|ref|XP_002304648.1| predicted protein [Populus trichocarpa]
gi|222842080|gb|EEE79627.1| predicted protein [Populus trichocarpa]
Length = 238
Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 51/94 (54%), Gaps = 1/94 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
QR++L GC++VFS FP + H+H LW+ EQ GA C+ ++D VTHVV+N +KV
Sbjct: 144 QRKILAGCRIVFSRVFPVGEVNPHLHPLWQSAEQFGAVCTNQIDEQVTHVVANSLGTDKV 203
Query: 76 SLGSKGGQVFGGSTVDRGSQLFVARATRREVSCE 109
+ G+ S L RA ++ + +
Sbjct: 204 NWALSTGRFVVHPGWVEASALLYRRANEQDFAIK 237
>gi|357502711|ref|XP_003621644.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
gi|355496659|gb|AES77862.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
Length = 1213
Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats.
Identities = 32/97 (32%), Positives = 53/97 (54%), Gaps = 1/97 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
QR++L GC++VFS FP H+H LW+ EQ GA+C+ ++D VTHVV++ +KV
Sbjct: 1117 QRKILDGCRIVFSRMFPVGDANPHLHPLWQTAEQFGASCTNQIDDQVTHVVAHSPGTDKV 1176
Query: 76 SLGSKGGQVFGGSTVDRGSQLFVARATRREVSCEANQ 112
+ G+ S L RA ++ + + ++
Sbjct: 1177 NWAIANGKFVVHPGWVEASALLYRRANEQDFAIKLDK 1213
>gi|224091747|ref|XP_002309339.1| predicted protein [Populus trichocarpa]
gi|222855315|gb|EEE92862.1| predicted protein [Populus trichocarpa]
Length = 204
Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 27/41 (65%), Positives = 35/41 (85%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIE 57
+R+VLKGCK+VFS FP++F A H+LW++VEQLGATCS E
Sbjct: 119 RRDVLKGCKIVFSRVFPTQFQADNHHLWRMVEQLGATCSTE 159
>gi|30685744|ref|NP_180912.2| RNA polymerase II C-terminal domain phosphatase-like 3 [Arabidopsis
thaliana]
gi|238055326|sp|Q8LL04.2|CPL3_ARATH RecName: Full=RNA polymerase II C-terminal domain phosphatase-like 3;
Short=FCP-like 3; AltName: Full=Carboxyl-terminal
phosphatase-like 3; Short=AtCPL3; Short=CTD
phosphatase-like 3
gi|330253756|gb|AEC08850.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Arabidopsis
thaliana]
Length = 1241
Score = 65.1 bits (157), Expect = 6e-09, Method: Composition-based stats.
Identities = 27/61 (44%), Positives = 40/61 (65%), Gaps = 1/61 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
QR++L GC++VFS P + H+H LW+ EQ GA C+ ++D VTHVV+N +KV
Sbjct: 1147 QRKILAGCRIVFSRIIPVGEAKPHLHPLWQTAEQFGAVCTTQVDEHVTHVVTNSLGTDKV 1206
Query: 76 S 76
+
Sbjct: 1207 N 1207
>gi|297826809|ref|XP_002881287.1| hypothetical protein ARALYDRAFT_482300 [Arabidopsis lyrata subsp.
lyrata]
gi|297327126|gb|EFH57546.1| hypothetical protein ARALYDRAFT_482300 [Arabidopsis lyrata subsp.
lyrata]
Length = 1248
Score = 65.1 bits (157), Expect = 6e-09, Method: Composition-based stats.
Identities = 27/61 (44%), Positives = 40/61 (65%), Gaps = 1/61 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
QR++L GC++VFS P + H+H LW+ EQ GA C+ ++D VTHVV+N +KV
Sbjct: 1154 QRKILAGCRIVFSRIIPVGEAKPHLHPLWQTAEQFGAVCTTQVDEHVTHVVTNSLGTDKV 1213
Query: 76 S 76
+
Sbjct: 1214 N 1214
>gi|22212705|gb|AAM94371.1|AF486633_1 CTD phosphatase-like 3 [Arabidopsis thaliana]
Length = 1241
Score = 65.1 bits (157), Expect = 6e-09, Method: Composition-based stats.
Identities = 27/61 (44%), Positives = 40/61 (65%), Gaps = 1/61 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
QR++L GC++VFS P + H+H LW+ EQ GA C+ ++D VTHVV+N +KV
Sbjct: 1147 QRKILAGCRIVFSRIIPVGEAKPHLHPLWQTAEQFGAVCTTQVDEHVTHVVTNSLGTDKV 1206
Query: 76 S 76
+
Sbjct: 1207 N 1207
>gi|413945235|gb|AFW77884.1| CPL3 [Zea mays]
Length = 533
Score = 64.7 bits (156), Expect = 6e-09, Method: Composition-based stats.
Identities = 28/62 (45%), Positives = 39/62 (62%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
++E+L+GCK+VFS FP+ +WK+ E LGA C ++DPSVTHVV+ EK
Sbjct: 376 RKEILQGCKIVFSRVFPNNTRPQEQMVWKMAEYLGAVCVKDVDPSVTHVVTVDLGTEKAR 435
Query: 77 LG 78
G
Sbjct: 436 WG 437
>gi|242087817|ref|XP_002439741.1| hypothetical protein SORBIDRAFT_09g019310 [Sorghum bicolor]
gi|241945026|gb|EES18171.1| hypothetical protein SORBIDRAFT_09g019310 [Sorghum bicolor]
Length = 547
Score = 64.7 bits (156), Expect = 7e-09, Method: Composition-based stats.
Identities = 28/62 (45%), Positives = 39/62 (62%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
++E+L+GCK+VFS FP+ +WK+ E LGA CS ++D SVTHVV+ EK
Sbjct: 381 RKEILQGCKIVFSRVFPNNTRPQKQMVWKMAEYLGAVCSTDVDSSVTHVVTVDLGTEKAR 440
Query: 77 LG 78
G
Sbjct: 441 WG 442
>gi|242068555|ref|XP_002449554.1| hypothetical protein SORBIDRAFT_05g019010 [Sorghum bicolor]
gi|241935397|gb|EES08542.1| hypothetical protein SORBIDRAFT_05g019010 [Sorghum bicolor]
Length = 1197
Score = 64.3 bits (155), Expect = 8e-09, Method: Composition-based stats.
Identities = 37/93 (39%), Positives = 50/93 (53%), Gaps = 3/93 (3%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
QR +L GC++VFS FP H+H LW+ EQ GA C+ +D VTHVV+N +KV
Sbjct: 1104 QRRILAGCRIVFSRVFPVGDASPHLHPLWQTAEQFGAVCTNLVDDRVTHVVANSPGTDKV 1163
Query: 76 SLG-SKGGQVFGGSTVDRGSQLFVARATRREVS 107
+ SKG V V+ S L RA + +
Sbjct: 1164 NWALSKGKFVVHPGWVE-ASALLYRRANEHDFA 1195
>gi|226497696|ref|NP_001152445.1| CPL3 [Zea mays]
gi|195656359|gb|ACG47647.1| CPL3 [Zea mays]
Length = 531
Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats.
Identities = 28/62 (45%), Positives = 39/62 (62%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
++E+L+GCK+VFS FP+ +WK+ E LGA C ++DPSVTHVV+ EK
Sbjct: 374 RKEILQGCKIVFSRVFPNNTRPQEQMVWKMAEYLGAVCVKDVDPSVTHVVTVDLGTEKSR 433
Query: 77 LG 78
G
Sbjct: 434 WG 435
>gi|413920930|gb|AFW60862.1| hypothetical protein ZEAMMB73_799152, partial [Zea mays]
Length = 1234
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 36/93 (38%), Positives = 50/93 (53%), Gaps = 3/93 (3%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
QR +L GC++VFS FP H+H LW+ EQ GA C+ +D VTH+V+N +KV
Sbjct: 1143 QRRILTGCRIVFSRVFPVGDASPHLHPLWQTAEQFGAVCTNLVDDRVTHIVANSPGTDKV 1202
Query: 76 SLG-SKGGQVFGGSTVDRGSQLFVARATRREVS 107
+ SKG V V+ S L RA + +
Sbjct: 1203 NWALSKGKFVVHPGWVE-ASALLYRRANEHDFA 1234
>gi|326532556|dbj|BAK05207.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 891
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 32/92 (34%), Positives = 48/92 (52%), Gaps = 1/92 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
QR +L GC++VFS FP + +H LW+ EQ GA C+ ++D VTHVV+N +KV
Sbjct: 798 QRRILAGCRIVFSRIFPVGEANPQLHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKV 857
Query: 76 SLGSKGGQVFGGSTVDRGSQLFVARATRREVS 107
+ + G+ S L RA + +
Sbjct: 858 NWALQTGRFVVHPGWVEASALLYRRANEHDFA 889
>gi|77551160|gb|ABA93957.1| NLI interacting factor-like phosphatase family protein, expressed
[Oryza sativa Japonica Group]
Length = 1272
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 32/87 (36%), Positives = 47/87 (54%), Gaps = 1/87 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
Q+ +L GC++VFS FP + H+H LW+ EQ GA C+ ++D VTHVV+N +KV
Sbjct: 1179 QQRILGGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKV 1238
Query: 76 SLGSKGGQVFGGSTVDRGSQLFVARAT 102
+ G+ S L RA+
Sbjct: 1239 NWALSTGRFVVHPGWVEASALLYRRAS 1265
>gi|222616055|gb|EEE52187.1| hypothetical protein OsJ_34058 [Oryza sativa Japonica Group]
Length = 1267
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 32/87 (36%), Positives = 47/87 (54%), Gaps = 1/87 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
Q+ +L GC++VFS FP + H+H LW+ EQ GA C+ ++D VTHVV+N +KV
Sbjct: 1174 QQRILGGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKV 1233
Query: 76 SLGSKGGQVFGGSTVDRGSQLFVARAT 102
+ G+ S L RA+
Sbjct: 1234 NWALSTGRFVVHPGWVEASALLYRRAS 1260
>gi|218185830|gb|EEC68257.1| hypothetical protein OsI_36281 [Oryza sativa Indica Group]
Length = 1255
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 32/87 (36%), Positives = 47/87 (54%), Gaps = 1/87 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
Q+ +L GC++VFS FP + H+H LW+ EQ GA C+ ++D VTHVV+N +KV
Sbjct: 1162 QQRILGGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKV 1221
Query: 76 SLGSKGGQVFGGSTVDRGSQLFVARAT 102
+ G+ S L RA+
Sbjct: 1222 NWALSTGRFVVHPGWVEASALLYRRAS 1248
>gi|115485681|ref|NP_001067984.1| Os11g0521900 [Oryza sativa Japonica Group]
gi|113645206|dbj|BAF28347.1| Os11g0521900 [Oryza sativa Japonica Group]
Length = 664
Score = 63.9 bits (154), Expect = 1e-08, Method: Composition-based stats.
Identities = 32/87 (36%), Positives = 47/87 (54%), Gaps = 1/87 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
Q+ +L GC++VFS FP + H+H LW+ EQ GA C+ ++D VTHVV+N +KV
Sbjct: 571 QQRILGGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTDKV 630
Query: 76 SLGSKGGQVFGGSTVDRGSQLFVARAT 102
+ G+ S L RA+
Sbjct: 631 NWALSTGRFVVHPGWVEASALLYRRAS 657
>gi|357129281|ref|XP_003566293.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Brachypodium distachyon]
Length = 492
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 28/58 (48%), Positives = 38/58 (65%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
++EVL+GCKLVFS FPS +WK+ E+LGA+C +D +VTHVV+ EK
Sbjct: 377 RQEVLQGCKLVFSRVFPSNSCPQDQIIWKMAEKLGASCCAHVDSTVTHVVAVDVGTEK 434
>gi|168018017|ref|XP_001761543.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687227|gb|EDQ73611.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1984
Score = 62.4 bits (150), Expect = 3e-08, Method: Composition-based stats.
Identities = 27/68 (39%), Positives = 43/68 (63%), Gaps = 1/68 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
QR VL GC+++FS FP + H+H LW++ EQ GA+C + ++ VTHVV+ +KV
Sbjct: 1723 QRRVLDGCRVLFSRIFPVGEANPHLHPLWRLAEQFGASCCLHINDKVTHVVAISLGTDKV 1782
Query: 76 SLGSKGGQ 83
+ + G+
Sbjct: 1783 NWAAATGR 1790
>gi|357163276|ref|XP_003579679.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Brachypodium distachyon]
Length = 493
Score = 62.4 bits (150), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 28/58 (48%), Positives = 38/58 (65%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
++EVL+GCK+VFS FPS +WK+ EQLGA C ++D +VTHVV+ EK
Sbjct: 377 RQEVLQGCKVVFSRVFPSSSRPQDQIIWKMAEQLGAICCADMDSTVTHVVAVDSGTEK 434
>gi|218196729|gb|EEC79156.1| hypothetical protein OsI_19829 [Oryza sativa Indica Group]
Length = 574
Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats.
Identities = 28/58 (48%), Positives = 37/58 (63%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
++EVL+GCKLVF+ FP LWK+ EQLGA C ++D +VTHVV+ EK
Sbjct: 410 RQEVLQGCKLVFTRVFPLHQRQQDQMLWKMAEQLGAVCCTDVDSTVTHVVALDLGTEK 467
>gi|218196728|gb|EEC79155.1| hypothetical protein OsI_19828 [Oryza sativa Indica Group]
Length = 430
Score = 61.6 bits (148), Expect = 5e-08, Method: Composition-based stats.
Identities = 28/58 (48%), Positives = 37/58 (63%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
++EVL+GCKLVF+ FP LWK+ EQLGA C ++D +VTHVV+ EK
Sbjct: 167 RQEVLQGCKLVFTRVFPLHQRPQDQMLWKMAEQLGAVCCTDVDSTVTHVVALDLGTEK 224
>gi|168040198|ref|XP_001772582.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162676137|gb|EDQ62624.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1881
Score = 61.2 bits (147), Expect = 7e-08, Method: Composition-based stats.
Identities = 27/68 (39%), Positives = 43/68 (63%), Gaps = 1/68 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
QR VL GC+++FS FP + H+H LW++ EQ GA+C + ++ VTHVV+ +KV
Sbjct: 1769 QRRVLDGCRVLFSRIFPVGEANPHLHPLWRLAEQFGASCCLYINDKVTHVVAISLGTDKV 1828
Query: 76 SLGSKGGQ 83
+ + G+
Sbjct: 1829 NWATATGR 1836
>gi|115463681|ref|NP_001055440.1| Os05g0390500 [Oryza sativa Japonica Group]
gi|57863785|gb|AAS86390.2| unknown protein [Oryza sativa Japonica Group]
gi|113578991|dbj|BAF17354.1| Os05g0390500 [Oryza sativa Japonica Group]
gi|215695102|dbj|BAG90293.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222631469|gb|EEE63601.1| hypothetical protein OsJ_18418 [Oryza sativa Japonica Group]
Length = 536
Score = 60.8 bits (146), Expect = 8e-08, Method: Composition-based stats.
Identities = 27/58 (46%), Positives = 37/58 (63%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
++EVL+GCKLVF+ FP +WK+ EQLGA C ++D +VTHVV+ EK
Sbjct: 384 RQEVLQGCKLVFTRVFPLHQRQQDQMIWKMAEQLGAVCCTDVDSTVTHVVALDLGTEK 441
>gi|302761896|ref|XP_002964370.1| hypothetical protein SELMODRAFT_405568 [Selaginella moellendorffii]
gi|300168099|gb|EFJ34703.1| hypothetical protein SELMODRAFT_405568 [Selaginella moellendorffii]
Length = 766
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 29/90 (32%), Positives = 45/90 (50%), Gaps = 1/90 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
QR +L GCK++FS FP + +H LW++ EQ GA C+ ++ VTHVV+ +K
Sbjct: 672 QRRILGGCKIIFSRVFPVEETQPQLHPLWRMAEQFGAVCTTRMEEDVTHVVAISMGTDKS 731
Query: 76 SLGSKGGQVFGGSTVDRGSQLFVARATRRE 105
+ G+ S + RA R+
Sbjct: 732 NWALATGRFLVRPAWVEASTVLYRRANERD 761
>gi|302768485|ref|XP_002967662.1| hypothetical protein SELMODRAFT_440109 [Selaginella moellendorffii]
gi|300164400|gb|EFJ31009.1| hypothetical protein SELMODRAFT_440109 [Selaginella moellendorffii]
Length = 762
Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats.
Identities = 29/90 (32%), Positives = 45/90 (50%), Gaps = 1/90 (1%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
QR +L GCK++FS FP + +H LW++ EQ GA C+ ++ VTHVV+ +K
Sbjct: 668 QRRILGGCKIIFSRVFPVEETQPQLHPLWRMAEQFGAVCTTRMEEDVTHVVAISMGTDKS 727
Query: 76 SLGSKGGQVFGGSTVDRGSQLFVARATRRE 105
+ G+ S + RA R+
Sbjct: 728 NWALATGRFLVRPAWVEASTVLYRRANERD 757
>gi|356498756|ref|XP_003518215.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Glycine max]
Length = 428
Score = 57.8 bits (138), Expect = 7e-07, Method: Composition-based stats.
Identities = 28/58 (48%), Positives = 36/58 (62%), Gaps = 4/58 (6%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
+REVL GC ++FS P+ L K+ EQ+GATC E+DPSVTHVV+ EK
Sbjct: 337 RREVLSGCVIIFSRIVHGAIPS----LRKMAEQMGATCLTEIDPSVTHVVATDAGTEK 390
>gi|356564913|ref|XP_003550691.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Glycine max]
Length = 442
Score = 57.4 bits (137), Expect = 9e-07, Method: Composition-based stats.
Identities = 28/58 (48%), Positives = 36/58 (62%), Gaps = 4/58 (6%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
+REVL GC ++FS P+ L K+ EQ+GATC E+DPSVTHVV+ EK
Sbjct: 351 RREVLSGCVIIFSRIVHGAIPS----LRKMAEQMGATCLTEIDPSVTHVVATDAGTEK 404
>gi|242063380|ref|XP_002452979.1| hypothetical protein SORBIDRAFT_04g035920 [Sorghum bicolor]
gi|241932810|gb|EES05955.1| hypothetical protein SORBIDRAFT_04g035920 [Sorghum bicolor]
Length = 518
Score = 55.5 bits (132), Expect = 5e-06, Method: Composition-based stats.
Identities = 24/51 (47%), Positives = 34/51 (66%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVS 67
+ EVL+GC + FS P + A H +WK+ EQLGA C+ + D +VTHVV+
Sbjct: 421 RSEVLRGCTVAFSRVIPLEGVAGDHPMWKLAEQLGAVCTADADATVTHVVA 471
>gi|125541462|gb|EAY87857.1| hypothetical protein OsI_09279 [Oryza sativa Indica Group]
Length = 390
Score = 54.7 bits (130), Expect = 7e-06, Method: Composition-based stats.
Identities = 29/70 (41%), Positives = 41/70 (58%), Gaps = 7/70 (10%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+REVL+GC + F+ A S H +W+ EQLGATC+ ++ P+VTHVV+ + K
Sbjct: 300 RREVLRGCTVAFTRAIASD---DHHSVWRRTEQLGATCADDVGPAVTHVVATNPTTFKAV 356
Query: 77 LGSKGGQVFG 86
QVFG
Sbjct: 357 W----AQVFG 362
>gi|403161615|ref|XP_003321927.2| hypothetical protein PGTG_03464 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375171855|gb|EFP77508.2| hypothetical protein PGTG_03464 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 423
Score = 53.5 bits (127), Expect = 1e-05, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 34/62 (54%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+ +VL G L FS +P + + Y WK+ EQ GA C L P VTH+++ K KV+
Sbjct: 101 KHDVLHGLHLAFSSLWPMEAVSEQQYAWKLAEQFGARCYTHLSPKVTHLIAAKLGTSKVN 160
Query: 77 LG 78
L
Sbjct: 161 LA 162
>gi|291001899|ref|XP_002683516.1| TFIIF CTD phosphatase Fcp1 [Naegleria gruberi]
gi|284097145|gb|EFC50772.1| TFIIF CTD phosphatase Fcp1 [Naegleria gruberi]
Length = 592
Score = 52.0 bits (123), Expect = 4e-05, Method: Composition-based stats.
Identities = 22/59 (37%), Positives = 35/59 (59%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
++++LKG +VFS P K H WK+ LGA C ++ P++TH+V+ + EKV
Sbjct: 427 KKDILKGAHIVFSGVIPLKQQPETHIDWKIATDLGAKCYTDITPNMTHLVARQKGTEKV 485
>gi|307111295|gb|EFN59530.1| hypothetical protein CHLNCDRAFT_138191 [Chlorella variabilis]
Length = 1156
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/59 (38%), Positives = 33/59 (55%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
+++VL G LVF+ P + H LW++ + GA CS LD S THV++ EKV
Sbjct: 601 RQKVLAGVHLVFTRVIPLEMEPESHPLWRLAQSFGARCSGSLDASTTHVIAGASGTEKV 659
>gi|440804367|gb|ELR25244.1| FCP1like phosphatase, phosphatase subfamily protein [Acanthamoeba
castellanii str. Neff]
Length = 930
Score = 52.0 bits (123), Expect = 5e-05, Method: Composition-based stats.
Identities = 27/76 (35%), Positives = 38/76 (50%)
Query: 9 IFFCTENGQREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSN 68
I +C +R VL+G + FS FP+ LW++ E+ GA CS P TH+V+
Sbjct: 601 IKYCLHVQRRRVLEGVHICFSSIFPTGSKPESTPLWRLSEEFGACCSNVFTPETTHLVAL 660
Query: 69 KCSNEKVSLGSKGGQV 84
EKV L + G V
Sbjct: 661 NERTEKVKLAHERGGV 676
>gi|125541461|gb|EAY87856.1| hypothetical protein OsI_09278 [Oryza sativa Indica Group]
Length = 420
Score = 51.2 bits (121), Expect = 7e-05, Method: Composition-based stats.
Identities = 25/56 (44%), Positives = 35/56 (62%), Gaps = 2/56 (3%)
Query: 16 GQREVLKGCKLVFSHAFPSK--FPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNK 69
+REVL+GC + F+ PS A H +W+ EQLGATC+ ++ VTHVV+ K
Sbjct: 322 ARREVLRGCTVAFTGVIPSGDGGRASDHPVWRKAEQLGATCADDVGEGVTHVVAGK 377
>gi|255081919|ref|XP_002508178.1| predicted protein [Micromonas sp. RCC299]
gi|226523454|gb|ACO69436.1| predicted protein [Micromonas sp. RCC299]
Length = 318
Score = 51.2 bits (121), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 35/67 (52%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+++VL G LVFS FP P H +W++ EQ GA C + P+ +HVV+ K
Sbjct: 222 KKKVLAGTGLVFSGVFPLDAPPHEQKMWRLAEQFGARCETQPGPNTSHVVAKTWGTGKCQ 281
Query: 77 LGSKGGQ 83
+ G+
Sbjct: 282 WAKENGR 288
>gi|426201370|gb|EKV51293.1| hypothetical protein AGABI2DRAFT_114027 [Agaricus bisporus var.
bisporus H97]
Length = 814
Score = 50.8 bits (120), Expect = 1e-04, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 33/68 (48%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+ EVL G LVFS P P W++ GA C +L P VTHV++ K +KV
Sbjct: 510 RSEVLGGLHLVFSGVIPLDTPPETTEFWRLARMFGAKCHTDLTPDVTHVITAKRGTKKVE 569
Query: 77 LGSKGGQV 84
+ G +
Sbjct: 570 TARQRGGI 577
>gi|409083591|gb|EKM83948.1| hypothetical protein AGABI1DRAFT_124274 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 853
Score = 50.8 bits (120), Expect = 1e-04, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 33/68 (48%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+ EVL G LVFS P P W++ GA C +L P VTHV++ K +KV
Sbjct: 549 RSEVLGGLHLVFSGVIPLDTPPETTEFWRLARMFGAKCHTDLTPDVTHVITAKRGTKKVE 608
Query: 77 LGSKGGQV 84
+ G +
Sbjct: 609 TARQRGGI 616
>gi|357450477|ref|XP_003595515.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
gi|355484563|gb|AES65766.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
Length = 382
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 26/56 (46%), Positives = 35/56 (62%), Gaps = 3/56 (5%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
EVL GC +VFS AF + L ++ E+LGATC EL P+VTH V+N+ E+
Sbjct: 292 EVLSGCIIVFSCAFNGH---DLRKLRRIAERLGATCLTELGPTVTHAVANELVTEE 344
>gi|392570766|gb|EIW63938.1| hypothetical protein TRAVEDRAFT_111329 [Trametes versicolor
FP-101664 SS1]
Length = 900
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 23/66 (34%), Positives = 31/66 (46%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
E L GC ++FS P +WK GA C EL P +THVV+ K +KV
Sbjct: 581 ETLDGCHILFSSVIPLDTRPEATEIWKTAHAFGAKCYTELSPRITHVVAAKRGTQKVDAA 640
Query: 79 SKGGQV 84
+ G +
Sbjct: 641 RRRGGI 646
>gi|427782099|gb|JAA56501.1| Putative rna polymerase ii ctd phosphatase [Rhipicephalus
pulchellus]
Length = 360
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 36/60 (60%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+R+VLKG +VFS P PA W+V + LGAT S +L P VTH+V+ + KV+
Sbjct: 20 RRKVLKGVHIVFSGVVPMNQPAEKSQAWQVAKSLGATVSRDLCPGVTHLVAARLGTAKVN 79
>gi|302764346|ref|XP_002965594.1| hypothetical protein SELMODRAFT_167775 [Selaginella moellendorffii]
gi|300166408|gb|EFJ33014.1| hypothetical protein SELMODRAFT_167775 [Selaginella moellendorffii]
Length = 411
Score = 50.1 bits (118), Expect = 2e-04, Method: Composition-based stats.
Identities = 24/52 (46%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 17 QREVLKGCKLVFSHAFPSK-FPAHIHYLWKVVEQLGATCSIELDPSVTHVVS 67
+ E+L GCKLVFS FP+ + LW++ LGA C + D SVTHVV+
Sbjct: 288 RSEILSGCKLVFSRIFPTDCLEPELTPLWRLCVDLGAECVLAHDDSVTHVVA 339
>gi|328772741|gb|EGF82779.1| hypothetical protein BATDEDRAFT_22917 [Batrachochytrium
dendrobatidis JAM81]
Length = 868
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 21/64 (32%), Positives = 32/64 (50%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+R +L+G ++F+ P H W GA C ++LDP VTHV++ K KV+
Sbjct: 530 KRSILEGVHILFTSIIPLGLEPQKHEHWIAATSYGAVCHVDLDPEVTHVIAGKTGTAKVN 589
Query: 77 LGSK 80
K
Sbjct: 590 AARK 593
>gi|302698337|ref|XP_003038847.1| hypothetical protein SCHCODRAFT_255670 [Schizophyllum commune H4-8]
gi|300112544|gb|EFJ03945.1| hypothetical protein SCHCODRAFT_255670 [Schizophyllum commune H4-8]
Length = 1207
Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats.
Identities = 22/65 (33%), Positives = 29/65 (44%)
Query: 20 VLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLGS 79
V +GC + FS P H W++ GA C L P VTHVV+ K KV
Sbjct: 892 VFQGCHICFSSVIPLDIQPESHECWRIANMFGARCHATLAPEVTHVVAGKQGTAKVDEAR 951
Query: 80 KGGQV 84
+ G +
Sbjct: 952 RRGNI 956
>gi|47497024|dbj|BAD19077.1| phosphatase-like [Oryza sativa Japonica Group]
gi|47497233|dbj|BAD19278.1| phosphatase-like [Oryza sativa Japonica Group]
gi|125584004|gb|EAZ24935.1| hypothetical protein OsJ_08715 [Oryza sativa Japonica Group]
Length = 420
Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats.
Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 2/56 (3%)
Query: 16 GQREVLKGCKLVFSHAFPSK--FPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNK 69
+REVL+GC + F+ PS A H +W+ EQLGATC+ ++ VTH V+ K
Sbjct: 322 ARREVLRGCTVAFTGVIPSGDGGRASDHPVWRRAEQLGATCADDVGEGVTHFVAGK 377
>gi|409051930|gb|EKM61406.1| hypothetical protein PHACADRAFT_204575 [Phanerochaete carnosa
HHB-10118-sp]
Length = 863
Score = 48.9 bits (115), Expect = 3e-04, Method: Composition-based stats.
Identities = 22/68 (32%), Positives = 33/68 (48%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
++ L GC +VFS P A W++ GA C EL+P +TH+++ K KV
Sbjct: 560 RQNALAGCHVVFSSVIPLDTRAETSETWRIAVMFGAKCYTELNPRITHLIAAKRGTAKVD 619
Query: 77 LGSKGGQV 84
+ G V
Sbjct: 620 AARRQGGV 627
>gi|353236741|emb|CCA68729.1| related to FCP1-TFIIF interacting component of CTD phosphatase
[Piriformospora indica DSM 11827]
Length = 782
Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats.
Identities = 23/67 (34%), Positives = 33/67 (49%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
+ L G LVFS P +WK + GATC ++++P VTH+V+NK K
Sbjct: 473 KTLAGVHLVFSGILPLDGRPERQPIWKAALEFGATCHVDINPQVTHLVTNKLGTVKADKA 532
Query: 79 SKGGQVF 85
G +F
Sbjct: 533 FAQGNIF 539
>gi|336387157|gb|EGO28302.1| hypothetical protein SERLADRAFT_354339 [Serpula lacrymans var.
lacrymans S7.9]
Length = 874
Score = 48.5 bits (114), Expect = 6e-04, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 32/68 (47%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
++E L G ++FS P +WKV E GA C EL +THVV+ K KV
Sbjct: 566 RKETLDGIHILFSSVIPLDTKPETTEIWKVAEMFGAQCCTELSSRITHVVAAKHGTVKVD 625
Query: 77 LGSKGGQV 84
K G +
Sbjct: 626 AARKRGGI 633
>gi|336374248|gb|EGO02585.1| hypothetical protein SERLA73DRAFT_102556 [Serpula lacrymans var.
lacrymans S7.3]
Length = 811
Score = 48.5 bits (114), Expect = 6e-04, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 32/68 (47%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
++E L G ++FS P +WKV E GA C EL +THVV+ K KV
Sbjct: 503 RKETLDGIHILFSSVIPLDTKPETTEIWKVAEMFGAQCCTELSSRITHVVAAKHGTVKVD 562
Query: 77 LGSKGGQV 84
K G +
Sbjct: 563 AARKRGGI 570
>gi|145346053|ref|XP_001417510.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577737|gb|ABO95803.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 643
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 25/68 (36%), Positives = 32/68 (47%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+++VL ++VFS FP H LW + E GATC L THVV S +KV
Sbjct: 551 RKKVLADVRIVFSRVFPIDADPTTHPLWILAEDFGATCGRTLCDDTTHVVGTASSTDKVK 610
Query: 77 LGSKGGQV 84
G V
Sbjct: 611 AAKARGNV 618
>gi|402220046|gb|EJU00119.1| hypothetical protein DACRYDRAFT_81791 [Dacryopinax sp. DJM-731 SS1]
Length = 855
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 27/64 (42%), Positives = 30/64 (46%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
EVL G LVFS P P LW+ Q GA C + VTHVV+ K EKV G
Sbjct: 558 EVLSGVHLVFSSLIPIDMPHQNTDLWRQALQFGAACYTRVAREVTHVVAAKRGTEKVRQG 617
Query: 79 SKGG 82
G
Sbjct: 618 VARG 621
>gi|170084539|ref|XP_001873493.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164651045|gb|EDR15285.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 845
Score = 48.1 bits (113), Expect = 6e-04, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 34/68 (50%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+ EVL+G ++FS P +W++ GA CS EL +THVV+ K KV
Sbjct: 541 RSEVLEGVHILFSSVIPLDTKPETTEIWRMAHMFGARCSTELTSDITHVVAAKRGTVKVD 600
Query: 77 LGSKGGQV 84
+ K G +
Sbjct: 601 MARKRGGI 608
>gi|255540901|ref|XP_002511515.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
gi|223550630|gb|EEF52117.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
Length = 405
Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 2/51 (3%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVS 67
Q +L+GCKL+ +K+ + L K+ E+LGA C ELDP+VTHVV+
Sbjct: 306 QAGILQGCKLILRKNLTAKY--KLDNLSKMAEKLGAICVSELDPTVTHVVT 354
>gi|357501219|ref|XP_003620898.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
gi|355495913|gb|AES77116.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
Length = 720
Score = 47.8 bits (112), Expect = 9e-04, Method: Composition-based stats.
Identities = 23/48 (47%), Positives = 32/48 (66%), Gaps = 4/48 (8%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVV 66
EVL+GC +VFS F + L ++ E+LGATC +LDP+VTHV+
Sbjct: 431 EVLRGCVIVFS----LNFHGDLRILRRIAERLGATCLKKLDPTVTHVI 474
>gi|392597598|gb|EIW86920.1| hypothetical protein CONPUDRAFT_95946 [Coniophora puteana
RWD-64-598 SS2]
Length = 830
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 22/66 (33%), Positives = 31/66 (46%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
+V G ++FS P P +WKV GA C EL S+THVV+ + KV
Sbjct: 546 KVFDGVHILFSSVIPLDTPPETTEIWKVAHMFGAKCYTELSSSITHVVAARLGTVKVDAA 605
Query: 79 SKGGQV 84
+ G +
Sbjct: 606 RRRGGI 611
>gi|281206665|gb|EFA80851.1| putative tfiif-interacting component of the c-terminal domain
phosphatase [Polysphondylium pallidum PN500]
Length = 881
Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats.
Identities = 25/79 (31%), Positives = 45/79 (56%), Gaps = 1/79 (1%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
++++L G LVFS +P + PAH L + E+LGAT ++ + THVV+ + KV
Sbjct: 585 KKKILNGVNLVFSGVYPLQLPAHRQPLRLLAEELGATVQNDITNTTTHVVAARKGTSKVH 644
Query: 77 LG-SKGGQVFGGSTVDRGS 94
SKG ++ + +++ +
Sbjct: 645 KAISKGLKIVNQNWIEQSA 663
>gi|388580688|gb|EIM21001.1| FCP1-like phosphatase [Wallemia sebi CBS 633.66]
Length = 510
Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats.
Identities = 25/65 (38%), Positives = 33/65 (50%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+R+VL G KLVFS P P I ++ + + GAT + VTHVV+ K KV
Sbjct: 323 KRKVLHGLKLVFSSVIPLGMPLEISGIYNLASKFGATIDHNYNEKVTHVVAAKKGTAKVE 382
Query: 77 LGSKG 81
KG
Sbjct: 383 DAKKG 387
>gi|443896478|dbj|GAC73822.1| TFIIF-interacting CTD phosphatases [Pseudozyma antarctica T-34]
Length = 751
Score = 46.2 bits (108), Expect = 0.003, Method: Composition-based stats.
Identities = 26/63 (41%), Positives = 32/63 (50%), Gaps = 1/63 (1%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSV-THVVSNKCSNEKVSL 77
+VLKGC +VFS P A LW GAT + E++P V THVVS + KV
Sbjct: 513 QVLKGCVIVFSSMIPVGHDAAKSELWATARAFGATPAAEIEPGVTTHVVSARMGTAKVHQ 572
Query: 78 GSK 80
K
Sbjct: 573 AMK 575
>gi|390333352|ref|XP_791406.3| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Strongylocentrotus purpuratus]
Length = 673
Score = 45.8 bits (107), Expect = 0.003, Method: Composition-based stats.
Identities = 25/76 (32%), Positives = 35/76 (46%), Gaps = 10/76 (13%)
Query: 20 VLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIEL----------DPSVTHVVSNK 69
VLKGC +VFS FP+ P +WKV LGA S ++ + TH+V+ K
Sbjct: 371 VLKGCNIVFSSVFPTNMPPEQSRVWKVALALGAKVSPQIVTKSKEEQAKGRASTHLVAAK 430
Query: 70 CSNEKVSLGSKGGQVF 85
KV + +F
Sbjct: 431 VGTSKVHAARRSKSIF 446
>gi|308802952|ref|XP_003078789.1| putative transcription regulator CPL1 (ISS) [Ostreococcus tauri]
gi|116057242|emb|CAL51669.1| putative transcription regulator CPL1 (ISS) [Ostreococcus tauri]
Length = 457
Score = 45.8 bits (107), Expect = 0.004, Method: Composition-based stats.
Identities = 25/71 (35%), Positives = 32/71 (45%)
Query: 14 ENGQREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNE 73
E ++ VL G +VFS FP LW + E GA CS E+ THVV +
Sbjct: 359 EERRKVVLSGVHVVFSRVFPLHVKPEEQPLWILAENFGANCSSEITSHTTHVVGTSKATA 418
Query: 74 KVSLGSKGGQV 84
KV K G +
Sbjct: 419 KVREALKRGGI 429
>gi|242093894|ref|XP_002437437.1| hypothetical protein SORBIDRAFT_10g027050 [Sorghum bicolor]
gi|241915660|gb|EER88804.1| hypothetical protein SORBIDRAFT_10g027050 [Sorghum bicolor]
Length = 271
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 37/67 (55%), Gaps = 3/67 (4%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+R+VL C +VFS+ FP +W + E+LGA C ++D +VTHVV+ +K
Sbjct: 178 RRQVLPVCTVVFSYL--EDFPEDT-LMWTLAERLGAACQKDVDETVTHVVAEDPGTQKAQ 234
Query: 77 LGSKGGQ 83
+ G+
Sbjct: 235 WAREHGK 241
>gi|242063378|ref|XP_002452978.1| hypothetical protein SORBIDRAFT_04g035900 [Sorghum bicolor]
gi|241932809|gb|EES05954.1| hypothetical protein SORBIDRAFT_04g035900 [Sorghum bicolor]
Length = 464
Score = 45.4 bits (106), Expect = 0.004, Method: Composition-based stats.
Identities = 23/69 (33%), Positives = 38/69 (55%), Gaps = 3/69 (4%)
Query: 17 QREVLKGCKLVFSH--AFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
+R+VL C +VFS+ + FP +W + E+LGA C ++D +VTHVV+ +K
Sbjct: 367 RRQVLPVCTVVFSYLEEYMEDFPEDT-LMWTLAERLGAACQKDVDETVTHVVAEDPGTQK 425
Query: 75 VSLGSKGGQ 83
+ G+
Sbjct: 426 AQWAREHGK 434
>gi|357451355|ref|XP_003595954.1| RNA polymerase II subunit A C-terminal domain phosphatase [Medicago
truncatula]
gi|355485002|gb|AES66205.1| RNA polymerase II subunit A C-terminal domain phosphatase [Medicago
truncatula]
Length = 239
Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 36/60 (60%), Gaps = 3/60 (5%)
Query: 9 IFFCTENGQREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSN 68
+F + + EVL GC +VFS AF + L K+ E+LGAT EL P+VTHVV+N
Sbjct: 175 VFHVLSSLRGEVLSGCVIVFSCAFHGH---DLRKLRKIAERLGATHLTELRPTVTHVVAN 231
>gi|168059994|ref|XP_001781984.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666557|gb|EDQ53208.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 563
Score = 44.7 bits (104), Expect = 0.006, Method: Composition-based stats.
Identities = 22/65 (33%), Positives = 33/65 (50%), Gaps = 1/65 (1%)
Query: 19 EVLKGCKLVFSHAFPSKFP-AHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSL 77
++L GC +VFS FP+ H W++ +LGA CS D + THVV+ +K
Sbjct: 407 KLLSGCHIVFSRIFPTGLQNPEFHPFWQLAVELGARCSTVCDHTTTHVVALDRGTDKARW 466
Query: 78 GSKGG 82
+ G
Sbjct: 467 AKQHG 471
>gi|393218252|gb|EJD03740.1| hypothetical protein FOMMEDRAFT_105888 [Fomitiporia mediterranea
MF3/22]
Length = 921
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 23/77 (29%), Positives = 37/77 (48%), Gaps = 2/77 (2%)
Query: 11 FCTENGQREVLKGCKLVFSHAFPS--KFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSN 68
F N ++E K ++FS P+ + +W++ GATC +LD VTHVV++
Sbjct: 570 FIIPNIRKETFKDVHILFSGVIPTNIRMDHEATEIWRMARAFGATCHRDLDKEVTHVVTS 629
Query: 69 KCSNEKVSLGSKGGQVF 85
K +KV +F
Sbjct: 630 KRGTQKVEKARSQPNIF 646
>gi|384247094|gb|EIE20582.1| hypothetical protein COCSUDRAFT_57726 [Coccomyxa subellipsoidea
C-169]
Length = 1018
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 2/68 (2%)
Query: 17 QREVLKGCKLVFSHAFP-SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
+++VL G +++FS FP + P+ Y WK E GA+C+ +LD VTHVV+ K
Sbjct: 925 RKQVLLGVRVLFSKVFPLGQAPSEQLY-WKQAEAYGASCTSQLDEHVTHVVALSRGTHKA 983
Query: 76 SLGSKGGQ 83
+ G+
Sbjct: 984 QWALQAGK 991
>gi|241249809|ref|XP_002403164.1| RNA polymerase II ctd phosphatase, putative [Ixodes scapularis]
gi|215496447|gb|EEC06087.1| RNA polymerase II ctd phosphatase, putative [Ixodes scapularis]
Length = 185
Score = 44.7 bits (104), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 35/68 (51%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+R+VLKG LVFS P+ W+ LGA S +L P VTH+V+ + KV+
Sbjct: 20 RRKVLKGSHLVFSGVVPTNQEPEKSRAWQTARALGARVSSDLCPGVTHLVAARPGTAKVN 79
Query: 77 LGSKGGQV 84
+ Q+
Sbjct: 80 RARRTRQL 87
>gi|302816075|ref|XP_002989717.1| hypothetical protein SELMODRAFT_23521 [Selaginella moellendorffii]
gi|302824047|ref|XP_002993670.1| hypothetical protein SELMODRAFT_23523 [Selaginella moellendorffii]
gi|300138493|gb|EFJ05259.1| hypothetical protein SELMODRAFT_23523 [Selaginella moellendorffii]
gi|300142494|gb|EFJ09194.1| hypothetical protein SELMODRAFT_23521 [Selaginella moellendorffii]
Length = 312
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 40/86 (46%), Gaps = 3/86 (3%)
Query: 20 VLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLGS 79
+L+GCKL FS P LW + E LGA C +E+D SVTHVV+ + +
Sbjct: 226 ILEGCKLAFSSVVPIDCEDS---LWILCEGLGAECVLEIDDSVTHVVAMDPESARARWAV 282
Query: 80 KGGQVFGGSTVDRGSQLFVARATRRE 105
+ G+ + R + + R E
Sbjct: 283 ENGKHLVNPSWMRAAAFRLGRPRESE 308
>gi|358057984|dbj|GAA96229.1| hypothetical protein E5Q_02893 [Mixia osmundae IAM 14324]
Length = 760
Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats.
Identities = 21/66 (31%), Positives = 32/66 (48%)
Query: 15 NGQREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
N + VL+GCK+ FS P +WK+ + GA CS +++ TH+V+ K
Sbjct: 511 NIKESVLRGCKIAFSSMIPLGTNPEAADIWKLAKMFGAYCSSDVNSKTTHLVARNPGTVK 570
Query: 75 VSLGSK 80
V K
Sbjct: 571 VQQAQK 576
>gi|58271496|ref|XP_572904.1| protein phosphatase [Cryptococcus neoformans var. neoformans JEC21]
gi|134115316|ref|XP_773956.1| hypothetical protein CNBH4080 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50256584|gb|EAL19309.1| hypothetical protein CNBH4080 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57229163|gb|AAW45597.1| protein phosphatase, putative [Cryptococcus neoformans var.
neoformans JEC21]
Length = 955
Score = 43.9 bits (102), Expect = 0.013, Method: Composition-based stats.
Identities = 21/56 (37%), Positives = 28/56 (50%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
EVL GC LVFS P + +W+ E GA + L P TH+V+ + EK
Sbjct: 649 EVLDGCNLVFSGMIPREANPSTTAIWQTAESFGALITPSLTPRTTHLVTALLNTEK 704
>gi|405122085|gb|AFR96852.1| hypothetical protein CNAG_04120 [Cryptococcus neoformans var.
grubii H99]
Length = 921
Score = 43.9 bits (102), Expect = 0.013, Method: Composition-based stats.
Identities = 21/56 (37%), Positives = 28/56 (50%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
EVL GC LVFS P + +W+ E GA + L P TH+V+ + EK
Sbjct: 613 EVLDGCSLVFSGMIPRESNPSTTTIWQTAESFGALITPSLTPRTTHLVTALLNTEK 668
>gi|406695220|gb|EKC98531.1| protein phosphatase [Trichosporon asahii var. asahii CBS 8904]
Length = 917
Score = 43.9 bits (102), Expect = 0.014, Method: Composition-based stats.
Identities = 21/62 (33%), Positives = 29/62 (46%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
+VL GC +VF+ +W+ E GA C +ELD VTH V EK+
Sbjct: 603 QVLSGCVIVFTGVIAINQKPQDSEIWQQAEAFGAQCQVELDERVTHCVIGSIGTEKMRRA 662
Query: 79 SK 80
S+
Sbjct: 663 SR 664
>gi|242015474|ref|XP_002428378.1| RNA polymerase II ctd phosphatase, putative [Pediculus humanus
corporis]
gi|212512990|gb|EEB15640.1| RNA polymerase II ctd phosphatase, putative [Pediculus humanus
corporis]
Length = 781
Score = 43.1 bits (100), Expect = 0.023, Method: Composition-based stats.
Identities = 23/71 (32%), Positives = 35/71 (49%)
Query: 15 NGQREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
N ++ LKGC LVFS PS P + V LGA S ++ + TH+V+ + K
Sbjct: 515 NFKKNTLKGCHLVFSGLVPSHIPLQESRAYLVAISLGAIVSADISSNCTHLVAARPGTAK 574
Query: 75 VSLGSKGGQVF 85
V+ + +F
Sbjct: 575 VNSSRRHKGIF 585
>gi|325179818|emb|CCA14221.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 694
Score = 42.7 bits (99), Expect = 0.027, Method: Composition-based stats.
Identities = 22/53 (41%), Positives = 32/53 (60%), Gaps = 3/53 (5%)
Query: 17 QREVLKGCKLVFSHAFPSKFP--AHIHYLWKVVEQLGATCSIELDP-SVTHVV 66
QR++L+GC +VFS FP P H LW++ +GA S+ +D VTH+V
Sbjct: 380 QRKILQGCFIVFSGVFPVSDPRGPKSHSLWRLAADMGAVPSLVIDDFPVTHLV 432
>gi|403416935|emb|CCM03635.1| predicted protein [Fibroporia radiculosa]
Length = 580
Score = 42.7 bits (99), Expect = 0.029, Method: Composition-based stats.
Identities = 21/66 (31%), Positives = 29/66 (43%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
+ L G +VFS P +W+ GA C EL VTHVV+ K +KV
Sbjct: 279 DTLAGVHIVFSSVIPLDTRPEATEIWRTAHAFGAKCYTELSNRVTHVVAAKRGTQKVDAA 338
Query: 79 SKGGQV 84
+ G +
Sbjct: 339 RRSGGI 344
>gi|326437795|gb|EGD83365.1| hypothetical protein PTSG_03974 [Salpingoeca sp. ATCC 50818]
Length = 864
Score = 42.7 bits (99), Expect = 0.031, Method: Composition-based stats.
Identities = 21/65 (32%), Positives = 34/65 (52%)
Query: 20 VLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLGS 79
+L+G ++VF+ P A+ H W++ +GA ++D VTHVV+ +KV
Sbjct: 641 ILEGVRIVFTGVIPRGQSAYTHPAWRMAVNMGAVVVDQVDERVTHVVARVDGTDKVRQAR 700
Query: 80 KGGQV 84
K G V
Sbjct: 701 KMGGV 705
>gi|401886990|gb|EJT50998.1| protein phosphatase [Trichosporon asahii var. asahii CBS 2479]
Length = 922
Score = 42.4 bits (98), Expect = 0.034, Method: Composition-based stats.
Identities = 20/62 (32%), Positives = 29/62 (46%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
++L GC +VF+ +W+ E GA C +ELD VTH V EK+
Sbjct: 608 QMLSGCVIVFTGVIAINQKPQDSEIWQQAEAFGAQCQVELDERVTHCVIGSIGTEKMRRA 667
Query: 79 SK 80
S+
Sbjct: 668 SR 669
>gi|449551315|gb|EMD42279.1| hypothetical protein CERSUDRAFT_148004 [Ceriporiopsis subvermispora
B]
Length = 875
Score = 42.0 bits (97), Expect = 0.051, Method: Composition-based stats.
Identities = 20/66 (30%), Positives = 30/66 (45%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
+ L+G ++FS P + +W+ GA C EL +THVV+ K KV
Sbjct: 566 KALEGVHILFSSVIPLDTRPEVTEVWRTAHAFGAQCHTELSSRITHVVAAKRGTVKVDAA 625
Query: 79 SKGGQV 84
K G +
Sbjct: 626 RKQGGI 631
>gi|413924219|gb|AFW64151.1| hypothetical protein ZEAMMB73_480827 [Zea mays]
Length = 490
Score = 41.6 bits (96), Expect = 0.054, Method: Composition-based stats.
Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 3/69 (4%)
Query: 17 QREVLKGCKLVFSHA--FPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
+R+VL C + FS+ FP + +W + E+LGA C ++D +VTHVV+ +K
Sbjct: 368 RRQVLPECTIAFSYLDDCMEDFPENT-LMWTLAERLGAVCRKDVDETVTHVVAEDPGTQK 426
Query: 75 VSLGSKGGQ 83
G+
Sbjct: 427 AQWARDHGK 435
>gi|226498568|ref|NP_001149751.1| CPL3 [Zea mays]
gi|195631558|gb|ACG36674.1| CPL3 [Zea mays]
Length = 493
Score = 41.6 bits (96), Expect = 0.067, Method: Composition-based stats.
Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 3/69 (4%)
Query: 17 QREVLKGCKLVFSHA--FPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
+R+VL C + FS+ FP + +W + E+LGA C ++D +VTHVV+ +K
Sbjct: 371 RRQVLPECTVAFSYLDDCMEDFPENT-LMWTLAERLGAVCRKDVDETVTHVVAEDPGTQK 429
Query: 75 VSLGSKGGQ 83
G+
Sbjct: 430 AQWARDHGK 438
>gi|449018404|dbj|BAM81806.1| similar to TFIIF interacting component of CTD phosphatase Fcp1p
[Cyanidioschyzon merolae strain 10D]
Length = 1640
Score = 40.8 bits (94), Expect = 0.10, Method: Composition-based stats.
Identities = 18/54 (33%), Positives = 31/54 (57%), Gaps = 2/54 (3%)
Query: 17 QREVLKGCKLVFSHAFP--SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSN 68
+R VL GC+L F+ F + H LW++ + GA C E+ P V+H++++
Sbjct: 1397 RRSVLTGCELCFTGVFAKHAGMAPEDHELWRLAVRFGAVCHREVLPQVSHLIAD 1450
>gi|321262398|ref|XP_003195918.1| carboxy-terminal domain (CTD) phosphatase; Fcp1p [Cryptococcus
gattii WM276]
gi|317462392|gb|ADV24131.1| Carboxy-terminal domain (CTD) phosphatase, putative; Fcp1p
[Cryptococcus gattii WM276]
Length = 952
Score = 40.8 bits (94), Expect = 0.12, Method: Composition-based stats.
Identities = 20/56 (35%), Positives = 27/56 (48%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
EVL GC LVFS P + +W+ E GA + L TH+V+ + EK
Sbjct: 643 EVLDGCSLVFSGMIPREADPSTTTIWQTAESFGALITPSLTSRTTHLVTALLNTEK 698
>gi|393240595|gb|EJD48120.1| hypothetical protein AURDEDRAFT_85955 [Auricularia delicata
TFB-10046 SS5]
Length = 796
Score = 40.4 bits (93), Expect = 0.12, Method: Composition-based stats.
Identities = 19/65 (29%), Positives = 30/65 (46%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
+ G +FS P + +WK + GA C E+ P +THV++ K S KV
Sbjct: 521 QTFAGMHFLFSSLIPLEDKPEESPIWKQAREFGAICHSEVSPRLTHVITAKRSTAKVDAA 580
Query: 79 SKGGQ 83
+ G+
Sbjct: 581 RRRGE 585
>gi|242066826|ref|XP_002454702.1| hypothetical protein SORBIDRAFT_04g035880 [Sorghum bicolor]
gi|241934533|gb|EES07678.1| hypothetical protein SORBIDRAFT_04g035880 [Sorghum bicolor]
Length = 462
Score = 40.4 bits (93), Expect = 0.12, Method: Composition-based stats.
Identities = 22/67 (32%), Positives = 35/67 (52%), Gaps = 3/67 (4%)
Query: 19 EVLKGCKLVFSHAFP--SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+VL+GC + FS+ P LW + E+LGA C ++D +VTHVV+ +K
Sbjct: 341 QVLRGCTVAFSYLEQRMEDSPDDTR-LWTLAERLGAVCRKDVDETVTHVVAEDPGTQKAQ 399
Query: 77 LGSKGGQ 83
+ G+
Sbjct: 400 WAREHGK 406
>gi|307106534|gb|EFN54779.1| hypothetical protein CHLNCDRAFT_134722 [Chlorella variabilis]
Length = 513
Score = 40.0 bits (92), Expect = 0.16, Method: Composition-based stats.
Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 1/59 (1%)
Query: 16 GQREVLKGCKLVFSHAFPSKFP-AHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNE 73
+R VL C+L+FS P H LW++ +LGA C E VTHVV+ +++
Sbjct: 342 ARRAVLAECRLLFSRVMPLDCADPSAHPLWQLALKLGAECVRETGQGVTHVVATDTTDK 400
>gi|395334832|gb|EJF67208.1| hypothetical protein DICSQDRAFT_142769 [Dichomitus squalens
LYAD-421 SS1]
Length = 953
Score = 40.0 bits (92), Expect = 0.17, Method: Composition-based stats.
Identities = 19/66 (28%), Positives = 30/66 (45%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
+ L G ++F+ P +WK GA C +L +THVV+NK + +KV
Sbjct: 616 DTLAGVHILFTGVIPLNQRPETAEIWKTATAFGAQCHTDLGKHITHVVTNKDNTQKVDAA 675
Query: 79 SKGGQV 84
+ V
Sbjct: 676 RRYADV 681
>gi|302793512|ref|XP_002978521.1| hypothetical protein SELMODRAFT_418187 [Selaginella moellendorffii]
gi|300153870|gb|EFJ20507.1| hypothetical protein SELMODRAFT_418187 [Selaginella moellendorffii]
Length = 346
Score = 40.0 bits (92), Expect = 0.18, Method: Composition-based stats.
Identities = 25/71 (35%), Positives = 40/71 (56%), Gaps = 9/71 (12%)
Query: 18 REV----LKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTH-VVSNKCSN 72
REV L GCK+V +K A LW ++LGA C +++D +VTH VV++K
Sbjct: 251 REVKGHALSGCKIVIC----AKSQAAHELLWDSCQELGAECVVDIDDTVTHVVVASKQQP 306
Query: 73 EKVSLGSKGGQ 83
+ + L ++ G+
Sbjct: 307 QGLELSAQAGK 317
>gi|291234950|ref|XP_002737409.1| PREDICTED: RNA polymerase II ctd phosphatase, putative-like
[Saccoglossus kowalevskii]
Length = 896
Score = 39.7 bits (91), Expect = 0.21, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 30/68 (44%), Gaps = 10/68 (14%)
Query: 18 REVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDP----------SVTHVVS 67
R+VLKG ++FS FP+ WKV + LGA P + THVV+
Sbjct: 576 RQVLKGTNILFSGVFPTNMSPEKSRAWKVAQTLGANVQSSFVPKLKDKTNAATATTHVVA 635
Query: 68 NKCSNEKV 75
K KV
Sbjct: 636 AKAGTVKV 643
>gi|71004098|ref|XP_756715.1| hypothetical protein UM00568.1 [Ustilago maydis 521]
gi|46095984|gb|EAK81217.1| hypothetical protein UM00568.1 [Ustilago maydis 521]
Length = 779
Score = 39.7 bits (91), Expect = 0.22, Method: Composition-based stats.
Identities = 22/58 (37%), Positives = 31/58 (53%), Gaps = 1/58 (1%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSV-THVVSNKCSNEKV 75
+VLKGC +VFS P LW + + GAT + E++ V THVV+ + KV
Sbjct: 518 QVLKGCTIVFSSMIPFGHNVEKSDLWAMAREFGATPASEIEVGVTTHVVAARPGTAKV 575
>gi|341882050|gb|EGT37985.1| hypothetical protein CAEBREN_32558 [Caenorhabditis brenneri]
Length = 673
Score = 39.7 bits (91), Expect = 0.23, Method: Composition-based stats.
Identities = 19/59 (32%), Positives = 33/59 (55%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
+R+VL GC +VFS P+ ++++ +Q GAT E+ VTH+V + +K+
Sbjct: 359 RRKVLDGCVIVFSGIVPTGEKLERTDIYRLCQQFGATILPEVTDQVTHIVGARYGTQKI 417
>gi|196002231|ref|XP_002110983.1| hypothetical protein TRIADDRAFT_54465 [Trichoplax adhaerens]
gi|190586934|gb|EDV26987.1| hypothetical protein TRIADDRAFT_54465 [Trichoplax adhaerens]
Length = 766
Score = 39.7 bits (91), Expect = 0.25, Method: Composition-based stats.
Identities = 30/79 (37%), Positives = 38/79 (48%), Gaps = 10/79 (12%)
Query: 17 QREVLKGCKLVFSHAFPSKFPA-HIHYLWKVVEQLGA--TCSIELDPS--VTHVVSNKCS 71
+R VLK K+VFS PS P+ Y W + E LGA T PS THVV+ + +
Sbjct: 546 RRNVLKDVKIVFSAIIPSGHPSPEKTYEWILAESLGAKVTHKFHTSPSRKTTHVVTKRVA 605
Query: 72 -----NEKVSLGSKGGQVF 85
+KV L K VF
Sbjct: 606 FQSGYTQKVHLAMKTAGVF 624
>gi|323508124|emb|CBQ67995.1| related to FCP1-TFIIF interacting component of CTD phosphatase
[Sporisorium reilianum SRZ2]
Length = 773
Score = 39.7 bits (91), Expect = 0.26, Method: Composition-based stats.
Identities = 25/72 (34%), Positives = 34/72 (47%), Gaps = 1/72 (1%)
Query: 20 VLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSV-THVVSNKCSNEKVSLG 78
VL+GC +VFS P LW + + GAT S E++ V THVV+ + KV
Sbjct: 516 VLQGCTIVFSSMIPFGHDPEKSDLWAMAREFGATPSSEIEAGVTTHVVAARPGTAKVHQA 575
Query: 79 SKGGQVFGGSTV 90
+ Q G V
Sbjct: 576 LRLAQKSAGLEV 587
>gi|302774062|ref|XP_002970448.1| hypothetical protein SELMODRAFT_411029 [Selaginella moellendorffii]
gi|300161964|gb|EFJ28578.1| hypothetical protein SELMODRAFT_411029 [Selaginella moellendorffii]
Length = 346
Score = 39.3 bits (90), Expect = 0.29, Method: Composition-based stats.
Identities = 25/71 (35%), Positives = 39/71 (54%), Gaps = 9/71 (12%)
Query: 18 REV----LKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTH-VVSNKCSN 72
REV L GCK+V +K A LW + LGA C +++D +VTH VV++K
Sbjct: 251 REVKGHALSGCKIVIC----AKTQAAHELLWDSCQALGAECVVDIDDTVTHVVVASKQQP 306
Query: 73 EKVSLGSKGGQ 83
+ + L ++ G+
Sbjct: 307 QGLELSAQAGK 317
>gi|388858248|emb|CCF48177.1| related to FCP1-TFIIF interacting component of CTD phosphatase
[Ustilago hordei]
Length = 774
Score = 39.3 bits (90), Expect = 0.29, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 32/60 (53%), Gaps = 1/60 (1%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSV-THVVSNKCSNEKV 75
+ +VL GC +VFS P+ LW + + GAT + E++ V THVV+ + KV
Sbjct: 505 KTKVLAGCTIVFSSMIPTGHNPETSDLWALAREFGATPAFEVEEGVTTHVVAARQGTLKV 564
>gi|91087589|ref|XP_971974.1| PREDICTED: similar to RNA polymerase II subunit A C-terminal domain
phosphatase [Tribolium castaneum]
gi|270010700|gb|EFA07148.1| hypothetical protein TcasGA2_TC010139 [Tribolium castaneum]
Length = 760
Score = 39.3 bits (90), Expect = 0.29, Method: Composition-based stats.
Identities = 20/64 (31%), Positives = 35/64 (54%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+ +VL+G KLVFS P+ +++ + LGA + EL+ TH+V+ + KV+
Sbjct: 484 RSQVLQGYKLVFSGLVPTHIKLEQSKAYQIAKSLGAEVTQELEDDTTHLVAVRPGTAKVN 543
Query: 77 LGSK 80
G +
Sbjct: 544 AGRR 547
>gi|389751366|gb|EIM92439.1| hypothetical protein STEHIDRAFT_136328 [Stereum hirsutum FP-91666
SS1]
Length = 1075
Score = 39.3 bits (90), Expect = 0.30, Method: Composition-based stats.
Identities = 20/65 (30%), Positives = 28/65 (43%)
Query: 21 LKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLGSK 80
L G ++FS P +W++ GA C EL +THVV+ K KV K
Sbjct: 672 LFGVHILFSSVIPLDTRPETTEVWRLAHAFGAKCYTELSSKITHVVAAKRGTVKVDQARK 731
Query: 81 GGQVF 85
G +
Sbjct: 732 RGNIL 736
>gi|387219521|gb|AFJ69469.1| rna polymerase ii ctd phosphatase, partial [Nannochloropsis
gaditana CCMP526]
Length = 268
Score = 38.9 bits (89), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 21/64 (32%), Positives = 31/64 (48%)
Query: 12 CTENGQREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCS 71
C + +R+VL G ++FS P L + LGA + P+VTH+V+ S
Sbjct: 84 CLSSVRRQVLAGVTILFSGVLPRNVDPRRSDLGYMALSLGARIVEDFSPTVTHLVAENAS 143
Query: 72 NEKV 75
EKV
Sbjct: 144 TEKV 147
>gi|302838991|ref|XP_002951053.1| hypothetical protein VOLCADRAFT_91454 [Volvox carteri f.
nagariensis]
gi|300263748|gb|EFJ47947.1| hypothetical protein VOLCADRAFT_91454 [Volvox carteri f.
nagariensis]
Length = 699
Score = 38.9 bits (89), Expect = 0.37, Method: Composition-based stats.
Identities = 23/63 (36%), Positives = 31/63 (49%), Gaps = 5/63 (7%)
Query: 17 QREVLK----GCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSV-THVVSNKCS 71
+RE+L+ GC + FS +P LW++ LGA C DP V THVV+
Sbjct: 566 RREILQLMPQGCCITFSRCWPQDRNPLREPLWQLAMSLGANCLTTYDPGVTTHVVAAAGG 625
Query: 72 NEK 74
EK
Sbjct: 626 TEK 628
>gi|170036997|ref|XP_001846347.1| RNA polymerase II subunit A C-terminal domain phosphatase [Culex
quinquefasciatus]
gi|167879975|gb|EDS43358.1| RNA polymerase II subunit A C-terminal domain phosphatase [Culex
quinquefasciatus]
Length = 764
Score = 38.9 bits (89), Expect = 0.38, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 28/51 (54%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVS 67
+ +VL G KLVFS P+ H ++V LGAT + +P TH+V+
Sbjct: 466 KSQVLVGHKLVFSGLVPNSMKLHQSKAFQVARSLGATVTQSFEPDTTHLVA 516
>gi|328874143|gb|EGG22509.1| hypothetical protein DFA_04637 [Dictyostelium fasciculatum]
Length = 397
Score = 38.5 bits (88), Expect = 0.46, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 33/68 (48%), Gaps = 3/68 (4%)
Query: 20 VLKGCKLVFSHAFPSKFPA---HIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
VL C +VFS FP + A H + ++ E GA ++ P+ TH++ K KV
Sbjct: 299 VLMDCNIVFSGIFPKQIDATKLHQTRIVQMAESFGAQVHQDITPTTTHLIFIKEGTSKVI 358
Query: 77 LGSKGGQV 84
K GQV
Sbjct: 359 QAVKQGQV 366
>gi|307213748|gb|EFN89086.1| Ornithine decarboxylase [Harpegnathos saltator]
Length = 409
Score = 37.7 bits (86), Expect = 0.77, Method: Composition-based stats.
Identities = 26/71 (36%), Positives = 38/71 (53%), Gaps = 11/71 (15%)
Query: 21 LKGCKLVFSHAFPSKFPAHIHYLWKV-VEQLGATCSIELD------PSVTHVVSNKC--S 71
+KG +++F+H P+K P+HI Y KV VE++ EL P V+ +C
Sbjct: 96 VKGERIIFAH--PAKLPSHIKYARKVGVERMTVDGETELSKIQEFFPEAKVVLRIRCDAK 153
Query: 72 NEKVSLGSKGG 82
N VSLG+K G
Sbjct: 154 NSPVSLGTKFG 164
>gi|299470348|emb|CBN78397.1| Similar to RNA Polymerase II CTD phosphatase Fcp1, putative
[Ectocarpus siliculosus]
Length = 985
Score = 37.7 bits (86), Expect = 0.77, Method: Composition-based stats.
Identities = 25/69 (36%), Positives = 33/69 (47%), Gaps = 3/69 (4%)
Query: 20 VLKGCKLVFSHAFP-SKFPA--HIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
VL G ++VFS P S PA H LW + E GAT ++ THVV+ + K
Sbjct: 724 VLTGVRMVFSGVIPVSGAPADPRTHRLWMMAESHGATVERDIGRHTTHVVAVRLGTAKTK 783
Query: 77 LGSKGGQVF 85
G + VF
Sbjct: 784 TGLRMPGVF 792
>gi|322785368|gb|EFZ12041.1| hypothetical protein SINV_00693 [Solenopsis invicta]
Length = 759
Score = 37.7 bits (86), Expect = 0.79, Method: Composition-based stats.
Identities = 21/68 (30%), Positives = 32/68 (47%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+ +VLKG L FS P+ H +KV GA + EL TH+V+ + K +
Sbjct: 479 RSQVLKGVCLTFSGLIPTHQKLHQSRAYKVARAFGAEVTQELTEKTTHLVAIRKGTAKAN 538
Query: 77 LGSKGGQV 84
K G++
Sbjct: 539 AAKKHGKI 546
>gi|330796177|ref|XP_003286145.1| hypothetical protein DICPUDRAFT_87022 [Dictyostelium purpureum]
gi|325083890|gb|EGC37331.1| hypothetical protein DICPUDRAFT_87022 [Dictyostelium purpureum]
Length = 793
Score = 37.7 bits (86), Expect = 0.80, Method: Composition-based stats.
Identities = 22/62 (35%), Positives = 34/62 (54%), Gaps = 3/62 (4%)
Query: 17 QREVLKGCKLVFSHAFPSKF-PAHIHY--LWKVVEQLGATCSIELDPSVTHVVSNKCSNE 73
+ VL C +VFS FP + P+ + + + K+ E GA+ S E+D + THV+ K
Sbjct: 688 RSSVLMDCNIVFSGIFPKQIDPSKLCHTRVSKITESFGASISQEIDSNTTHVIFIKEGTS 747
Query: 74 KV 75
KV
Sbjct: 748 KV 749
>gi|168012675|ref|XP_001759027.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689726|gb|EDQ76096.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 389
Score = 37.7 bits (86), Expect = 0.83, Method: Composition-based stats.
Identities = 19/59 (32%), Positives = 30/59 (50%), Gaps = 8/59 (13%)
Query: 9 IFFCTENGQREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVS 67
+FF + + ++L GC +V IH W++ +LGA CS D + THVV+
Sbjct: 213 LFFVIRSLRAKLLAGCNVVLG--------PEIHPFWQLPAELGARCSTFCDHTTTHVVA 263
>gi|198438317|ref|XP_002131972.1| PREDICTED: similar to MGC81710 protein [Ciona intestinalis]
Length = 895
Score = 37.4 bits (85), Expect = 1.2, Method: Composition-based stats.
Identities = 22/69 (31%), Positives = 34/69 (49%), Gaps = 2/69 (2%)
Query: 17 QREVLKGCKLVFSHAFPSKFPA--HIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEK 74
+ +VL GC +V + P+ F A H+H V QLGA + +D + TH++ K K
Sbjct: 582 RSKVLYGCCIVLTGIIPNNFKAAPHMHRAHIVARQLGAAINSTVDENTTHLIGAKKGTAK 641
Query: 75 VSLGSKGGQ 83
K G+
Sbjct: 642 YQDALKMGK 650
>gi|303276827|ref|XP_003057707.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460364|gb|EEH57658.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 692
Score = 37.4 bits (85), Expect = 1.2, Method: Composition-based stats.
Identities = 22/57 (38%), Positives = 31/57 (54%), Gaps = 4/57 (7%)
Query: 17 QREVLKGCKLVFSHAFP---SKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKC 70
++ VL+G ++VFS F H LW++ E+LGA E S THVV+ KC
Sbjct: 591 RKNVLRGVEIVFSGVFDHNDKTLTPREHPLWRLAERLGARVVSEPGTSTTHVVA-KC 646
>gi|320163842|gb|EFW40741.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 933
Score = 37.4 bits (85), Expect = 1.3, Method: Composition-based stats.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 43 LWKVVEQLGATCSIELDPSVTHVV 66
LW +VE G TCS ELD S TH+V
Sbjct: 114 LWGIVEYFGGTCSAELDSSCTHLV 137
>gi|328859642|gb|EGG08750.1| hypothetical protein MELLADRAFT_115868 [Melampsora larici-populina
98AG31]
Length = 736
Score = 37.0 bits (84), Expect = 1.3, Method: Composition-based stats.
Identities = 15/29 (51%), Positives = 18/29 (62%)
Query: 44 WKVVEQLGATCSIELDPSVTHVVSNKCSN 72
WK+ EQ GA C L P VTH+V+ K N
Sbjct: 538 WKLAEQFGAQCYTRLTPRVTHLVAAKAIN 566
>gi|268566337|ref|XP_002639695.1| C. briggsae CBR-FCP-1 protein [Caenorhabditis briggsae]
Length = 723
Score = 37.0 bits (84), Expect = 1.3, Method: Composition-based stats.
Identities = 20/67 (29%), Positives = 34/67 (50%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+ +VL GC +VFS P+ ++++ Q GAT E+ VTHVV + +K+
Sbjct: 366 RHKVLDGCVIVFSGIVPTGEKLERTDIYRLCMQFGATIVPEVTDEVTHVVGARYGTQKIH 425
Query: 77 LGSKGGQ 83
+ G+
Sbjct: 426 QAHRLGK 432
>gi|66824241|ref|XP_645475.1| hypothetical protein DDB_G0271690 [Dictyostelium discoideum AX4]
gi|60473594|gb|EAL71535.1| hypothetical protein DDB_G0271690 [Dictyostelium discoideum AX4]
Length = 782
Score = 37.0 bits (84), Expect = 1.5, Method: Composition-based stats.
Identities = 20/69 (28%), Positives = 37/69 (53%), Gaps = 1/69 (1%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
++++LKG +VFS +P P L + E+ G+ +++ THV++ + KV+
Sbjct: 431 KKDILKGTYIVFSGVYPLGTPIQKQPLRWLAEEFGSVVQNDINNETTHVIAQRKGTSKVN 490
Query: 77 LG-SKGGQV 84
SKG +V
Sbjct: 491 KALSKGLKV 499
>gi|328713585|ref|XP_001947680.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Acyrthosiphon pisum]
Length = 736
Score = 36.6 bits (83), Expect = 1.9, Method: Composition-based stats.
Identities = 21/64 (32%), Positives = 31/64 (48%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+ ++L G KLVFS P+ P +KV LGA + + P TH+V+ + K S
Sbjct: 419 RSKILAGKKLVFSGLVPTPVPLTESRAYKVARLLGAEVTENIKPDSTHLVAVRQGTLKAS 478
Query: 77 LGSK 80
K
Sbjct: 479 AARK 482
>gi|66805733|ref|XP_636588.1| hypothetical protein DDB_G0288707 [Dictyostelium discoideum AX4]
gi|60464974|gb|EAL63085.1| hypothetical protein DDB_G0288707 [Dictyostelium discoideum AX4]
Length = 985
Score = 36.6 bits (83), Expect = 1.9, Method: Composition-based stats.
Identities = 18/53 (33%), Positives = 31/53 (58%), Gaps = 3/53 (5%)
Query: 17 QREVLKGCKLVFSHAFPSKF-PAHIHY--LWKVVEQLGATCSIELDPSVTHVV 66
+ VL C +VFS FP + P+ + + + K+ E GA S+E+D + TH++
Sbjct: 879 RSSVLMDCNIVFSGIFPKQIDPSKLCHTRVSKITESFGAKISLEIDSTTTHLI 931
>gi|449675210|ref|XP_002161785.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Hydra magnipapillata]
Length = 718
Score = 36.6 bits (83), Expect = 2.0, Method: Composition-based stats.
Identities = 20/64 (31%), Positives = 29/64 (45%), Gaps = 6/64 (9%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIEL------DPSVTHVVSNKC 70
+R+ LKGC +VF+ P+ P WK LGA + E+ THVV+ +
Sbjct: 444 RRQTLKGCNIVFTGVIPTNCPLEKSKAWKTAVSLGARVTSEVVGKEEDGLRTTHVVAARH 503
Query: 71 SNEK 74
K
Sbjct: 504 GTHK 507
>gi|159483481|ref|XP_001699789.1| hypothetical protein CHLREDRAFT_141879 [Chlamydomonas reinhardtii]
gi|158281731|gb|EDP07485.1| predicted protein [Chlamydomonas reinhardtii]
Length = 375
Score = 36.6 bits (83), Expect = 2.0, Method: Composition-based stats.
Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 1/57 (1%)
Query: 20 VLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSV-THVVSNKCSNEKV 75
+L G + FS + LW++ E LGATC DP+V THVV+ KV
Sbjct: 309 ILTGVHITFSRCWAQDKDPRKEPLWQLAEGLGATCLPAYDPAVTTHVVAAGGGTAKV 365
>gi|392578708|gb|EIW71836.1| hypothetical protein TREMEDRAFT_67978 [Tremella mesenterica DSM
1558]
Length = 944
Score = 36.2 bits (82), Expect = 2.3, Method: Composition-based stats.
Identities = 19/62 (30%), Positives = 23/62 (37%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
+V GC VFS W+ E GA C L TH ++ EKV
Sbjct: 672 QVFDGCYFVFSGIIARDVEPETTSHWQWAEMFGARCQPTLTRKTTHCITTNAGTEKVYQA 731
Query: 79 SK 80
SK
Sbjct: 732 SK 733
>gi|330799899|ref|XP_003287978.1| hypothetical protein DICPUDRAFT_55168 [Dictyostelium purpureum]
gi|325082002|gb|EGC35499.1| hypothetical protein DICPUDRAFT_55168 [Dictyostelium purpureum]
Length = 730
Score = 36.2 bits (82), Expect = 2.3, Method: Composition-based stats.
Identities = 21/69 (30%), Positives = 37/69 (53%), Gaps = 1/69 (1%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
++E+LK +VFS +P P + L + E+ GA+ ++ THV++ + KV+
Sbjct: 443 KKEILKDQFIVFSGVYPLGTPVNKQPLRYLAEEFGASVENDITSKTTHVIAQRKGTSKVN 502
Query: 77 LG-SKGGQV 84
SKG +V
Sbjct: 503 KAISKGLKV 511
>gi|383859141|ref|XP_003705055.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like isoform 2 [Megachile rotundata]
Length = 759
Score = 35.8 bits (81), Expect = 3.4, Method: Composition-based stats.
Identities = 20/62 (32%), Positives = 28/62 (45%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
+VLKG + FS P+ H +KV GA S EL TH+V+ + K +
Sbjct: 474 QVLKGVHITFSGLIPTHQKIHQSRAYKVARAFGAEVSQELTDKTTHLVAIRPGTAKANAA 533
Query: 79 SK 80
K
Sbjct: 534 KK 535
>gi|383859139|ref|XP_003705054.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like isoform 1 [Megachile rotundata]
Length = 760
Score = 35.8 bits (81), Expect = 3.4, Method: Composition-based stats.
Identities = 20/62 (32%), Positives = 28/62 (45%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
+VLKG + FS P+ H +KV GA S EL TH+V+ + K +
Sbjct: 474 QVLKGVHITFSGLIPTHQKIHQSRAYKVARAFGAEVSQELTDKTTHLVAIRPGTAKANAA 533
Query: 79 SK 80
K
Sbjct: 534 KK 535
>gi|24762673|ref|NP_611934.1| Fcp1 [Drosophila melanogaster]
gi|7291810|gb|AAF47230.1| Fcp1 [Drosophila melanogaster]
Length = 880
Score = 35.8 bits (81), Expect = 3.5, Method: Composition-based stats.
Identities = 19/64 (29%), Positives = 31/64 (48%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+ EVL+G LVFS P++ + + + LGA +D +TH+V+ KV+
Sbjct: 587 RSEVLRGKNLVFSGLVPTQMKLEQSRAYFIAKSLGAEVKPNIDKEITHLVAVNAGTYKVN 646
Query: 77 LGSK 80
K
Sbjct: 647 AAKK 650
>gi|21483550|gb|AAM52750.1| SD01014p [Drosophila melanogaster]
Length = 896
Score = 35.8 bits (81), Expect = 3.5, Method: Composition-based stats.
Identities = 19/64 (29%), Positives = 31/64 (48%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+ EVL+G LVFS P++ + + + LGA +D +TH+V+ KV+
Sbjct: 603 RSEVLRGKNLVFSGLVPTQMKLEQSRAYFIAKSLGAEVKPNIDKEITHLVAVNAGTYKVN 662
Query: 77 LGSK 80
K
Sbjct: 663 AAKK 666
>gi|342320998|gb|EGU12936.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Rhodotorula
glutinis ATCC 204091]
Length = 817
Score = 35.4 bits (80), Expect = 3.8, Method: Composition-based stats.
Identities = 23/59 (38%), Positives = 30/59 (50%), Gaps = 4/59 (6%)
Query: 19 EVLKGCKLVFSH--AFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKV 75
+ L+ LVFS A S+ P Y WK+ GA CS +L S TH+V+N KV
Sbjct: 509 QTLRDTHLVFSGLVALGSR-PEDSEY-WKLARTFGARCSADLSSSTTHLVANGWGTAKV 565
>gi|307212079|gb|EFN87962.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Harpegnathos saltator]
Length = 734
Score = 35.4 bits (80), Expect = 3.9, Method: Composition-based stats.
Identities = 25/85 (29%), Positives = 36/85 (42%), Gaps = 14/85 (16%)
Query: 10 FFCT---ENGQR-----------EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCS 55
F+CT NG+R +VLKG L FS P+ H +KV GA +
Sbjct: 480 FYCTLDKGNGRRSLRDIIPRVRSQVLKGLYLTFSGLIPTHQKLHQSRAYKVARAFGAEVT 539
Query: 56 IELDPSVTHVVSNKCSNEKVSLGSK 80
+L TH+V+ + K + K
Sbjct: 540 QDLTEKTTHLVAIRKGTAKANAAKK 564
>gi|156549638|ref|XP_001604265.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like, partial [Nasonia vitripennis]
Length = 512
Score = 35.4 bits (80), Expect = 4.0, Method: Composition-based stats.
Identities = 22/68 (32%), Positives = 31/68 (45%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+ VLKG L FS P+ H +KV GA S +L TH+V+ + KV
Sbjct: 296 RSRVLKGLCLTFSGLVPNNQKLHQSRAYKVARAFGAQASQDLTEQTTHLVAIQPGTVKVR 355
Query: 77 LGSKGGQV 84
+ G+V
Sbjct: 356 EAKRQGKV 363
>gi|380022133|ref|XP_003694908.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
C-terminal domain phosphatase-like [Apis florea]
Length = 749
Score = 35.4 bits (80), Expect = 4.2, Method: Composition-based stats.
Identities = 19/66 (28%), Positives = 29/66 (43%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
+VLKG L FS P+ H +KV GA + +L TH+V+ + K +
Sbjct: 472 QVLKGVHLTFSGLIPTHQKLHQSRAYKVARAFGAEVAQDLSEKTTHLVAIRPGTAKANTA 531
Query: 79 SKGGQV 84
K +
Sbjct: 532 KKNSNI 537
>gi|424513770|emb|CCO66392.1| predicted protein [Bathycoccus prasinos]
Length = 546
Score = 35.4 bits (80), Expect = 4.5, Method: Composition-based stats.
Identities = 25/65 (38%), Positives = 36/65 (55%), Gaps = 3/65 (4%)
Query: 20 VLKGCKLVFSHAFPS--KFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSL 77
+LKGC ++ S PS + P H L V LGAT + ++ +VTHV++ + EKV
Sbjct: 456 LLKGCVILPSGITPSNDERPDR-HPLLLVAVGLGATIATAMNDNVTHVLARADNTEKVKW 514
Query: 78 GSKGG 82
G K G
Sbjct: 515 GRKRG 519
>gi|157109625|ref|XP_001650754.1| RNA polymerase ii ctd phosphatase [Aedes aegypti]
gi|108868428|gb|EAT32653.1| AAEL015142-PA, partial [Aedes aegypti]
Length = 569
Score = 35.0 bits (79), Expect = 5.4, Method: Composition-based stats.
Identities = 20/68 (29%), Positives = 31/68 (45%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+ +VL G LVFS P+ ++V LGAT + + P TH+V+ KV
Sbjct: 458 KSQVLVGFNLVFSGLVPNSMKLEESKAYQVARSLGATVTQDFTPDTTHLVAVTFGTSKVH 517
Query: 77 LGSKGGQV 84
K ++
Sbjct: 518 NARKNPKI 525
>gi|452820283|gb|EME27327.1| phosphoprotein phosphatase [Galdieria sulphuraria]
Length = 734
Score = 35.0 bits (79), Expect = 6.2, Method: Composition-based stats.
Identities = 13/50 (26%), Positives = 27/50 (54%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVV 66
+ VL+ C L F+ F + + +W++ E+ GA C+ ++ TH++
Sbjct: 517 RHRVLRNCYLSFTGIFRLEESPEVSTVWRLAEEFGAICNKQVTSQTTHLI 566
>gi|328792425|ref|XP_623605.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Apis mellifera]
Length = 745
Score = 34.7 bits (78), Expect = 6.9, Method: Composition-based stats.
Identities = 19/62 (30%), Positives = 28/62 (45%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
+VLKG L FS P+ H +KV GA + +L TH+V+ + K +
Sbjct: 469 QVLKGVHLTFSGLIPTHQKLHQSRAYKVARAFGAEVAQDLSKKTTHLVAIRPGTAKANTA 528
Query: 79 SK 80
K
Sbjct: 529 KK 530
>gi|118784887|ref|XP_314000.3| AGAP005119-PA [Anopheles gambiae str. PEST]
gi|116128258|gb|EAA09414.3| AGAP005119-PA [Anopheles gambiae str. PEST]
Length = 822
Score = 34.7 bits (78), Expect = 7.2, Method: Composition-based stats.
Identities = 19/65 (29%), Positives = 30/65 (46%)
Query: 20 VLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLGS 79
VL G KL FS P+ + + LGA + L+P+ TH+V+ KV+
Sbjct: 528 VLVGAKLCFSGLIPNNVKLEQSKAYLIARSLGAAVTQNLEPTTTHLVAVTIGTSKVNNAR 587
Query: 80 KGGQV 84
K ++
Sbjct: 588 KNPKI 592
>gi|195383304|ref|XP_002050366.1| GJ22116 [Drosophila virilis]
gi|194145163|gb|EDW61559.1| GJ22116 [Drosophila virilis]
Length = 703
Score = 34.7 bits (78), Expect = 7.9, Method: Composition-based stats.
Identities = 18/66 (27%), Positives = 32/66 (48%)
Query: 19 EVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVSLG 78
EVL+G LVFS P++ + + + LGA ++ +TH+V+ KV+
Sbjct: 413 EVLRGQNLVFSGLVPTQMKLEQSRAYFIAKSLGAEVQSNINKDITHLVAVNAGTYKVNAA 472
Query: 79 SKGGQV 84
K ++
Sbjct: 473 KKESKI 478
>gi|357601986|gb|EHJ63229.1| putative RNA polymerase II subunit A C-terminal domain phosphatase
[Danaus plexippus]
Length = 683
Score = 34.7 bits (78), Expect = 8.2, Method: Composition-based stats.
Identities = 19/67 (28%), Positives = 32/67 (47%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+ +VL G LVFS P+ ++V + LGA + + TH+V+ + KV+
Sbjct: 430 KSQVLAGSSLVFSGLVPTHQRLETSRAYQVAKTLGAEVTQDFTDKTTHLVAMRAGTAKVN 489
Query: 77 LGSKGGQ 83
K G+
Sbjct: 490 ASKKLGE 496
>gi|195029035|ref|XP_001987380.1| GH21892 [Drosophila grimshawi]
gi|193903380|gb|EDW02247.1| GH21892 [Drosophila grimshawi]
Length = 889
Score = 34.7 bits (78), Expect = 8.3, Method: Composition-based stats.
Identities = 18/68 (26%), Positives = 32/68 (47%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+ EVL+G LVFS P++ + + + LGA ++ +TH+V+ KV+
Sbjct: 595 RSEVLRGQNLVFSGLVPTQMKMEQSRAYFIAKSLGAEVKSNINKDITHLVAVNAGTYKVN 654
Query: 77 LGSKGGQV 84
K +
Sbjct: 655 AAKKEANI 662
>gi|313234471|emb|CBY24671.1| unnamed protein product [Oikopleura dioica]
Length = 614
Score = 34.3 bits (77), Expect = 8.8, Method: Composition-based stats.
Identities = 18/69 (26%), Positives = 31/69 (44%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
++ +LKGC+LVFS P+ H K +GA + + TH++ + K +
Sbjct: 377 RKNILKGCQLVFSGVVPNGCRMEEHRAVKNARAMGAVIHERIQKNTTHLICARPGTAKHN 436
Query: 77 LGSKGGQVF 85
+ VF
Sbjct: 437 EAKRKANVF 445
>gi|307168754|gb|EFN61749.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Camponotus floridanus]
Length = 721
Score = 34.3 bits (77), Expect = 9.5, Method: Composition-based stats.
Identities = 19/68 (27%), Positives = 31/68 (45%)
Query: 17 QREVLKGCKLVFSHAFPSKFPAHIHYLWKVVEQLGATCSIELDPSVTHVVSNKCSNEKVS 76
+ +VLKG L FS P+ H ++KV GA + +L TH+V+ + K +
Sbjct: 477 RSQVLKGLCLTFSGLIPTHQKLHQSRVYKVARAFGAEITQDLTEKTTHLVAIRKGTAKAN 536
Query: 77 LGSKGGQV 84
K +
Sbjct: 537 AARKDANI 544
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.133 0.413
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,744,306,824
Number of Sequences: 23463169
Number of extensions: 62356907
Number of successful extensions: 119952
Number of sequences better than 100.0: 171
Number of HSP's better than 100.0 without gapping: 140
Number of HSP's successfully gapped in prelim test: 31
Number of HSP's that attempted gapping in prelim test: 119767
Number of HSP's gapped (non-prelim): 172
length of query: 113
length of database: 8,064,228,071
effective HSP length: 81
effective length of query: 32
effective length of database: 6,163,711,382
effective search space: 197238764224
effective search space used: 197238764224
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 69 (31.2 bits)