BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 040058
(326 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255570505|ref|XP_002526210.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
gi|223534449|gb|EEF36151.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
Length = 478
Score = 210 bits (535), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 128/360 (35%), Positives = 202/360 (56%), Gaps = 60/360 (16%)
Query: 25 LSCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQE--------------ERKLQ 70
++C H CI C + + + G++F Y+ +GLR + E RKL
Sbjct: 109 VACTHPGSFGDMCILCGERLIEETGVTFGYIHKGLRLANDEIVRLRNTDMKNLLRHRKLY 168
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFI----GSLFQMA-NDKLVKLRPFVRTFL 125
LVL+LDHTLL+ + L++ E+YLK QI S GSLF + + KLRPF+RTFL
Sbjct: 169 LVLDLDHTLLNSTQLMHLTAEEEYLKSQIDSMQDVSNGSLFMVDFMHMMTKLRPFIRTFL 228
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQER 185
++AS + ++Y+ TM R YA K LD +YF++R+I+R+D + +K D+V GQE
Sbjct: 229 KEASQMFEMYIYTMGDRAYALEMAKFLDPGREYFNARVISRDDGTQRHQKGLDIVLGQES 288
Query: 186 GIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDESENEEALANVL 243
++ILDDTE+ W+ H +NLI++ +Y +F ++ + KS S+ +DE+E++ ALA+VL
Sbjct: 289 AVLILDDTENAWTKHKDNLILMERYHFFASSCRQFGFECKSLSQLKSDENESDGALASVL 348
Query: 244 RVLKTIHRLFF----DSVCG-DVRTYLPKVRSEFSRDV-LYFSAIF-------RDCLW-- 288
+VL+ IH +FF D++ G DVR L VR + + + FS +F LW
Sbjct: 349 KVLRRIHHIFFDELEDAIDGRDVRQVLSTVRKDVLKGCKIVFSRVFPTQFQADNHHLWKM 408
Query: 289 AEQ------------------------EEKFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
AEQ + ++ ++ KFLVHPRWI+A ++W+R+PE+++
Sbjct: 409 AEQLGATCSREVDPSVTHVVSAEAGTEKSRWALKNDKFLVHPRWIEATNYMWQRQPEENF 468
>gi|449447765|ref|XP_004141638.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Cucumis sativus]
Length = 452
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 127/358 (35%), Positives = 195/358 (54%), Gaps = 60/358 (16%)
Query: 27 CAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQE--------------ERKLQLV 72
C+H + CI C Q +++ G++F Y+ + LR + E +KL LV
Sbjct: 82 CSHPGSFGNMCIICGQRLDEESGVTFGYIHKELRLNNDEINRMRNKEMKELLQRKKLILV 141
Query: 73 LNLDHTLLHCRNIKSLSSGEKYLKKQIHSF----IGSLFQMAN-DKLVKLRPFVRTFLEQ 127
L+LDHTLL+ ++ L+ E+YL+ Q S GSLF + + + KLRPFV +FL++
Sbjct: 142 LDLDHTLLNSTELRYLTVEEEYLRSQTDSLDDVTKGSLFLLNSVHTMTKLRPFVHSFLKE 201
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGI 187
AS L ++Y+ TM R YA KLLD +YFSS++I+R+D K +K D+V G+E +
Sbjct: 202 ASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLGKESAV 261
Query: 188 VILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDESENEEALANVLRV 245
+ILDDTE+ W+ H ENLI++ +Y +F ++ + KS SE DESE + AL +L+V
Sbjct: 262 LILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKSLSELKNDESETDGALTTILKV 321
Query: 246 LKTIHRLFFDSVCG-----DVRTYLPKVRSEFSRDV-LYFSAIFRDCLWAE--------- 290
LK +H +FF+ V G DVR L VR+E + FS +F AE
Sbjct: 322 LKQVHHMFFNEVSGDLVDRDVRQVLKTVRAEVLEGCKVVFSRVFPTKFQAENHQLWKMVE 381
Query: 291 ------------------------QEEKFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
++ ++ ++EKKFLVHPRWI+A + W+R+ E+++
Sbjct: 382 QLGGTCSTELDQSVTHVVATDAGTEKSRWALKEKKFLVHPRWIEASNYFWKRQMEENF 439
>gi|356564913|ref|XP_003550691.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Glycine max]
Length = 442
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 134/380 (35%), Positives = 206/380 (54%), Gaps = 60/380 (15%)
Query: 5 SCKECVGKT-KFVIKRKCEQS----LSCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGL 59
S +E G T + ++KR E S + C H + CI C Q ++ G++F Y+ +GL
Sbjct: 57 SIEETEGSTSEGIVKRSLEASSEVDVCCTHPGSFGNMCIRCGQKLDGESGVTFGYIHKGL 116
Query: 60 RYSEQE--------------ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFI-- 103
R ++E +KL LVL+LDHTLL+ ++ L+S E +L Q S
Sbjct: 117 RLHDEEISRLRNTDMKSLLGRKKLYLVLDLDHTLLNSTHLAQLTSEELHLLNQTDSLTNV 176
Query: 104 --GSLFQMAN-DKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFS 160
GSLF++ + + + KLRPFVR FL++AS + ++Y+ TM R YA KLLD +YF+
Sbjct: 177 SKGSLFKLEHMNMMTKLRPFVRPFLKEASEMFEMYIYTMGDRPYALEMAKLLDPQGEYFN 236
Query: 161 SRIIAREDFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KEL 218
+++I+R+D K +K D+V GQE ++ILDDTE W H +NLI++ +Y +F ++
Sbjct: 237 AKVISRDDGTQKHQKGLDVVLGQESAVIILDDTEHAWMKHKDNLILMERYHFFGSSCRQF 296
Query: 219 NGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVCG----DVRTYLPKVRSE-FS 273
+ KS +E +DE E + ALA +L+VLK +H +FFD DVR L VR E S
Sbjct: 297 GFNCKSLAELKSDEDETDGALAKILKVLKQVHCMFFDKQEDFDDQDVRQVLSSVRREVLS 356
Query: 274 RDVLYFSAIFRDCL-----WAEQEE------------------------KFLVQEKKFLV 304
V+ FS I + AEQ ++ V+EKKF+V
Sbjct: 357 GCVIIFSRIVHGAIPSLRKMAEQMGATCLTEIDPSVTHVVATDAGTEKCRWAVKEKKFVV 416
Query: 305 HPRWIDAYYFLWRRRPEDDY 324
HP WI+A + W+++PE+++
Sbjct: 417 HPLWIEAANYFWQKQPEENF 436
>gi|356498756|ref|XP_003518215.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Glycine max]
Length = 428
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 131/374 (35%), Positives = 203/374 (54%), Gaps = 58/374 (15%)
Query: 10 VGKTKFVIKRKCEQSLS---CAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQE- 65
+ + KF + E S S C H + CI C Q ++ G++F Y+ +GLR ++E
Sbjct: 50 IKRRKFESIEETEGSTSEGVCTHPGSFGNMCIRCGQKLDGESGVTFGYIHKGLRLHDEEI 109
Query: 66 -------------ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSF----IGSLFQ 108
+KL LVL+LDHTLL+ ++ L+S E +L Q S GSLF+
Sbjct: 110 SRLRNTDMKSLLCRKKLYLVLDLDHTLLNSTHLAHLTSEESHLLNQTDSLRDVSKGSLFK 169
Query: 109 MAN-DKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE 167
+ + + + KLRPFVR FL++AS + ++Y+ TM R YA KLLD +YF++++I+R+
Sbjct: 170 LEHMNMMTKLRPFVRPFLKEASEMFEMYIYTMGDRPYALEMAKLLDPQGEYFNAKVISRD 229
Query: 168 DFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSY 225
D K +K D+V GQE ++ILDDTE W H +NLI++ +Y +F ++ + KS
Sbjct: 230 DGTQKHQKGLDVVLGQESAVLILDDTEHAWMKHKDNLILMERYHFFGSSCRQFGFNCKSL 289
Query: 226 SETLTDESENEEALANVLRVLKTIHRLFFDSVCG----DVRTYLPKVRSE-FSRDVLYFS 280
+E +DE+E + ALA +L+VLK +H +FFD DVR L VR E S V+ FS
Sbjct: 290 AELKSDENETDGALAKILKVLKQVHCMFFDKQEDFDDRDVRQMLSLVRREVLSGCVIIFS 349
Query: 281 AIFRDCL-----WAEQEE------------------------KFLVQEKKFLVHPRWIDA 311
I + AEQ ++ V+EKKF+VHP WI+A
Sbjct: 350 RIVHGAIPSLRKMAEQMGATCLTEIDPSVTHVVATDAGTEKCRWAVKEKKFVVHPLWIEA 409
Query: 312 YYFLWRRRPEDDYL 325
+ W+++PE++++
Sbjct: 410 ANYFWQKQPEENFI 423
>gi|9758369|dbj|BAB08870.1| unnamed protein product [Arabidopsis thaliana]
Length = 1065
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 132/361 (36%), Positives = 189/361 (52%), Gaps = 64/361 (17%)
Query: 27 CAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSE--------------QEERKLQLV 72
C H + C C Q + ++ G+SF Y+ + +R +E Q +RKL LV
Sbjct: 693 CEHPGSFGNMCFVCGQKLEET-GVSFRYIHKEMRLNEDEISRLRDSDSRFLQRQRKLYLV 751
Query: 73 LNLDHTLLHCRNIKSLSSGEKYLKKQIHSFI-------GSLFQMA-NDKLVKLRPFVRTF 124
L+LDHTLL+ ++ L E+YLK HS GSLF + + KLRPFV +F
Sbjct: 752 LDLDHTLLNTTILRDLKPEEEYLKSHTHSLQDGCNVSGGSLFLLEFMQMMTKLRPFVHSF 811
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQE 184
L++AS + +Y+ TM R YA KLLD +YF R+I+R+D + K+ D+V GQE
Sbjct: 812 LKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDRVISRDDGTVRHEKSLDVVLGQE 871
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDH--KSYSETLTDESENEEALANV 242
++ILDDTE+ W H +NLIV+ +Y +F DH KS SE +DESE + ALA V
Sbjct: 872 SAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQFDHRYKSLSELKSDESEPDGALATV 931
Query: 243 LRVLKTIHRLFFDSV-----CGDVRTYLPKVRSEFSRDV-LYFSAIFRD-------CLWA 289
L+VLK H LFF++V DVR L +VR E + + FS +F LW
Sbjct: 932 LKVLKQAHALFFENVDEGISNRDVRLMLKQVRKEILKGCKIVFSRVFPTKAKPEDHPLWK 991
Query: 290 EQEE--------------------------KFLVQEKKFLVHPRWIDAYYFLWRRRPEDD 323
EE ++ V+EKK++VH WIDA +LW ++PE++
Sbjct: 992 MAEELGATCATEVDASVTHVVAMDVGTEKARWAVREKKYVVHRGWIDAANYLWMKQPEEN 1051
Query: 324 Y 324
+
Sbjct: 1052 F 1052
>gi|145334837|ref|NP_001078764.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Arabidopsis
thaliana]
gi|122154038|sp|Q00IB6.1|CPL4_ARATH RecName: Full=RNA polymerase II C-terminal domain phosphatase-like
4; Short=FCP-like 4; AltName: Full=Carboxyl-terminal
phosphatase-like 4; Short=AtCPL4; Short=CTD
phosphatase-like 4
gi|95115186|gb|ABF55959.1| carboxyl-terminal phosphatase-like 4 [Arabidopsis thaliana]
gi|332009601|gb|AED96984.1| RNA polymerase II C-terminal domain phosphatase-like 4 [Arabidopsis
thaliana]
Length = 440
Score = 201 bits (510), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 132/361 (36%), Positives = 189/361 (52%), Gaps = 64/361 (17%)
Query: 27 CAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSE--------------QEERKLQLV 72
C H + C C Q + ++ G+SF Y+ + +R +E Q +RKL LV
Sbjct: 68 CEHPGSFGNMCFVCGQKLEET-GVSFRYIHKEMRLNEDEISRLRDSDSRFLQRQRKLYLV 126
Query: 73 LNLDHTLLHCRNIKSLSSGEKYLKKQIHSFI-------GSLFQMA-NDKLVKLRPFVRTF 124
L+LDHTLL+ ++ L E+YLK HS GSLF + + KLRPFV +F
Sbjct: 127 LDLDHTLLNTTILRDLKPEEEYLKSHTHSLQDGCNVSGGSLFLLEFMQMMTKLRPFVHSF 186
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQE 184
L++AS + +Y+ TM R YA KLLD +YF R+I+R+D + K+ D+V GQE
Sbjct: 187 LKEASEMFVMYIYTMGDRNYARQMAKLLDPKGEYFGDRVISRDDGTVRHEKSLDVVLGQE 246
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDH--KSYSETLTDESENEEALANV 242
++ILDDTE+ W H +NLIV+ +Y +F DH KS SE +DESE + ALA V
Sbjct: 247 SAVLILDDTENAWPKHKDNLIVIERYHFFSSSCRQFDHRYKSLSELKSDESEPDGALATV 306
Query: 243 LRVLKTIHRLFFDSV-----CGDVRTYLPKVRSEFSRDV-LYFSAIFRD-------CLWA 289
L+VLK H LFF++V DVR L +VR E + + FS +F LW
Sbjct: 307 LKVLKQAHALFFENVDEGISNRDVRLMLKQVRKEILKGCKIVFSRVFPTKAKPEDHPLWK 366
Query: 290 EQEE--------------------------KFLVQEKKFLVHPRWIDAYYFLWRRRPEDD 323
EE ++ V+EKK++VH WIDA +LW ++PE++
Sbjct: 367 MAEELGATCATEVDASVTHVVAMDVGTEKARWAVREKKYVVHRGWIDAANYLWMKQPEEN 426
Query: 324 Y 324
+
Sbjct: 427 F 427
>gi|449532013|ref|XP_004172979.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
phosphatase-like 4-like, partial [Cucumis sativus]
Length = 340
Score = 191 bits (484), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 115/303 (37%), Positives = 172/303 (56%), Gaps = 46/303 (15%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSF----IGSLFQMAN-DKLVKLRPFVR 122
KL LVL+LDHTLL+ ++ L+ E+YL+ Q S GSLF + + + KLRPFV
Sbjct: 25 KLILVLDLDHTLLNSTELRYLTVEEEYLRSQTDSLDDVTKGSLFLLNSVHTMTKLRPFVH 84
Query: 123 TFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRG 182
+FL++AS L ++Y+ TM R YA KLLD +YFSS++I+R+D K +K D+V G
Sbjct: 85 SFLKEASKLFEMYIYTMGERRYAFEMAKLLDPKKEYFSSKVISRDDGTQKHQKGLDVVLG 144
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDESENEEALA 240
+E ++ILDDTE+ W+ H ENLI++ +Y +F ++ + KS SE DESE + AL
Sbjct: 145 KESAVLILDDTENAWTKHKENLILMERYHFFASSCRQFGFNCKSLSELKNDESETDGALT 204
Query: 241 NVLRVLKTIHRLFFDSVCG-----DVRTYLPKVRSEFSRDV-LYFSAIFRDCLWAE---- 290
+L+VLK +H +FF+ V G DVR L VR+E + FS +F AE
Sbjct: 205 TILKVLKQVHHMFFNEVSGDLVDRDVRQVLKTVRAEVLEGCKVVFSRVFPTKFQAENHQL 264
Query: 291 -----------------------------QEEKFLVQEKKFLVHPRWIDAYYFLWRRRPE 321
++ ++ ++EKKFLVHPRWI+A + W+R+ E
Sbjct: 265 WKMVEQLGGTCSTELDQSVTHVVATDAGTEKSRWALKEKKFLVHPRWIEASNYFWKRQME 324
Query: 322 DDY 324
+++
Sbjct: 325 ENF 327
>gi|326518250|dbj|BAK07377.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 488
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 134/383 (34%), Positives = 192/383 (50%), Gaps = 70/383 (18%)
Query: 7 KECVGKTKFVIKRKCEQSLSCAHTTVRDSRCIFC--SQAMNDSFGLSFDYMLRGLRYSEQ 64
++ +G K +KC H CI C SQ D G++F Y+ +GLR
Sbjct: 90 EDVIGSVKDAQIKKCP-----PHPGFFGGLCINCGKSQDEEDVPGVAFGYIHKGLRLGTS 144
Query: 65 E--------------ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIG------ 104
E ERKL L+L+LDHTL++ + +S+ E L Q +
Sbjct: 145 EMDRLRESEVKNLLRERKLVLILDLDHTLINSTRLHDISAAEMDLGIQTAASKNADDPER 204
Query: 105 SLFQMAN-DKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
SLF + L KLRPFVR FLE+AS++ D+Y+ TM + YA KLLD + YF S++
Sbjct: 205 SLFTLQGMHMLTKLRPFVRKFLEEASNMFDMYIYTMGDKAYAIEIAKLLDPGNVYFDSKV 264
Query: 164 IAREDFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGD 221
I+ D + +K D+V G ++ VI+DDTE VW H ENLI++ +Y YF ++
Sbjct: 265 ISNSDCTQRHQKGLDVVLGDDKVAVIIDDTEHVWQKHKENLILMERYHYFAASCRQFGFS 324
Query: 222 HKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVCG------DVRTYLPKVRSEFSRD 275
+S SE + DE E++ ALA +L VLK IH +FFDS DVR + +VR E +
Sbjct: 325 DQSLSELMQDERESDGALATILDVLKRIHTIFFDSGVETALSSRDVRQVIKRVRQEVLQG 384
Query: 276 V-LYFSAIF-RDC------LW--AEQ------------------------EEKFLVQEKK 301
L FS +F DC +W AEQ + ++ KK
Sbjct: 385 CKLVFSRVFPSDCRSQDQIMWKMAEQLGAVCCSEVDPSVTHVVAVHAGTEKARWAAGNKK 444
Query: 302 FLVHPRWIDAYYFLWRRRPEDDY 324
FL+HPRWI+A + W R+PE+D+
Sbjct: 445 FLLHPRWIEACNYRWHRQPEEDF 467
>gi|242093742|ref|XP_002437361.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor]
gi|241915584|gb|EER88728.1| hypothetical protein SORBIDRAFT_10g025580 [Sorghum bicolor]
Length = 558
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 125/365 (34%), Positives = 185/365 (50%), Gaps = 63/365 (17%)
Query: 23 QSLSCAHTTVRDSRCIFCSQAMNDS--FGLSFDYMLRGLRYSEQE--------------E 66
Q +C H C C + ++ G++F Y+ +GLR E E
Sbjct: 103 QVEACPHPGYFGGLCFRCGKPQDEENVSGVAFGYIHKGLRLGTSEIDRLRGADLKNLLRE 162
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIG----SLFQM-ANDKLVKLRPFV 121
RKL L+L+LDHTL++ ++ +SS EK L Q + S+F + + L KLRPFV
Sbjct: 163 RKLVLILDLDHTLINSTKLQDISSAEKDLGIQTAASKDDPNRSIFSLDSMQMLTKLRPFV 222
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
R FL++AS++ ++Y+ TM + YA KLLD + YF S++I+ D + +K D++
Sbjct: 223 REFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPSKVISNSDCTQRHQKGLDVIL 282
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDESENEEAL 239
G E VILDDTE VW H ENLI++ +Y +F ++ +S SE++ DE E++ AL
Sbjct: 283 GAESVAVILDDTEYVWQKHKENLILMERYHFFASSCRQFGFGVRSLSESMQDERESDGAL 342
Query: 240 ANVLRVLKTIHRLFFDSVC------GDVRTYLPKVRSEFSRDV-LYFSAIFRD------- 285
A VL VLK IH +FFD DVR + VR E + + FS +F +
Sbjct: 343 ATVLDVLKRIHSIFFDLAVETDLSSQDVRQVIKAVRKEILQGCKIVFSRVFPNNTRPQEQ 402
Query: 286 CLW--------------------------AEQEEKFLVQEKKFLVHPRWIDAYYFLWRRR 319
LW ++ ++ V KKFLVHPRWI+A F W R+
Sbjct: 403 MLWKMAEHLGAVCSTDVDSSVTHVVTVDLGTEKARWGVANKKFLVHPRWIEAANFRWHRQ 462
Query: 320 PEDDY 324
PE+D+
Sbjct: 463 PEEDF 467
>gi|413945235|gb|AFW77884.1| CPL3 [Zea mays]
Length = 533
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 126/372 (33%), Positives = 184/372 (49%), Gaps = 71/372 (19%)
Query: 20 KCEQSLSCAHTTVRDSRCIFCSQAMN--DSFGLSFDYMLRGLRYSEQE------------ 65
K Q +C H CI C + + D G++F Y+ +GLR E
Sbjct: 98 KIVQVEACPHPGHFGGLCIICGKPQDEEDVSGVAFGYIHKGLRLGTSEIDRLRGADLKNL 157
Query: 66 --ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHS---------FIGSLFQMANDKL 114
ERKL L+L+LDHTL++ ++ +SS EK L Q + F L M L
Sbjct: 158 LRERKLVLILDLDHTLINSTKLQDISSAEKDLGIQSAASKDDPNRSIFALDLMPM----L 213
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
KLRPFVR FL++AS++ ++Y+ TM + YA KLLD + YF S++I+ D + +
Sbjct: 214 TKLRPFVREFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPSKVISNSDCTQRHQ 273
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDE 232
K D++ G E VILDDTE VW H ENLI++ +Y +F ++ +S SE+L DE
Sbjct: 274 KGLDVILGAESVAVILDDTEYVWQKHKENLILMERYHFFASSCRQFGFGVRSLSESLQDE 333
Query: 233 SENEEALANVLRVLKTIHRLFFDSVCG------DVRTYLPKVRSEFSRDV-LYFSAIFRD 285
E++ ALA VL VLK IH FFD D+R + +R E + + FS +F +
Sbjct: 334 RESDGALATVLDVLKRIHATFFDMAAETDLSSRDIRQVIKTLRKEILQGCKIVFSRVFPN 393
Query: 286 -------CLW--------------------------AEQEEKFLVQEKKFLVHPRWIDAY 312
+W ++ ++ + KKFLVHPRWI+A
Sbjct: 394 NTRPQEQMVWKMAEYLGAVCVKDVDPSVTHVVTVDLGTEKARWGLNNKKFLVHPRWIEAA 453
Query: 313 YFLWRRRPEDDY 324
F W R+PE+D+
Sbjct: 454 NFRWHRQPEEDF 465
>gi|226497696|ref|NP_001152445.1| CPL3 [Zea mays]
gi|195656359|gb|ACG47647.1| CPL3 [Zea mays]
Length = 531
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 126/372 (33%), Positives = 184/372 (49%), Gaps = 71/372 (19%)
Query: 20 KCEQSLSCAHTTVRDSRCIFCSQAMN--DSFGLSFDYMLRGLRYSEQE------------ 65
K Q +C H CI C + + D G++F Y+ +GLR E
Sbjct: 96 KIVQVEACPHPGHFGGLCIICGKPQDEEDVSGVAFGYIHKGLRLGTSEIDRLRGADLKNL 155
Query: 66 --ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHS---------FIGSLFQMANDKL 114
ERKL L+L+LDHTL++ ++ +SS EK L Q + F L M L
Sbjct: 156 LRERKLVLILDLDHTLINSTKLQDISSAEKDLGIQSAASKDDPNRSIFALDLMPM----L 211
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
KLRPFVR FL++AS++ ++Y+ TM + YA KLLD + YF S++I+ D + +
Sbjct: 212 TKLRPFVREFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPSKVISNSDCTQRHQ 271
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDE 232
K D++ G E VILDDTE VW H ENLI++ +Y +F ++ +S SE+L DE
Sbjct: 272 KGLDVILGAESVAVILDDTEYVWQKHKENLILMERYHFFASSCRQFGFGVRSLSESLQDE 331
Query: 233 SENEEALANVLRVLKTIHRLFFDSVCG------DVRTYLPKVRSEFSRDV-LYFSAIFRD 285
E++ ALA VL VLK IH FFD D+R + +R E + + FS +F +
Sbjct: 332 RESDGALATVLDVLKRIHATFFDMAAETDLSSRDIRQVIKTLRKEILQGCKIVFSRVFPN 391
Query: 286 -------CLW--------------------------AEQEEKFLVQEKKFLVHPRWIDAY 312
+W ++ ++ + KKFLVHPRWI+A
Sbjct: 392 NTRPQEQMVWKMAEYLGAVCVKDVDPSVTHVVTVDLGTEKSRWGLNNKKFLVHPRWIEAA 451
Query: 313 YFLWRRRPEDDY 324
F W R+PE+D+
Sbjct: 452 NFRWHRQPEEDF 463
>gi|297793317|ref|XP_002864543.1| hypothetical protein ARALYDRAFT_332090 [Arabidopsis lyrata subsp.
lyrata]
gi|297310378|gb|EFH40802.1| hypothetical protein ARALYDRAFT_332090 [Arabidopsis lyrata subsp.
lyrata]
Length = 1006
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 124/362 (34%), Positives = 186/362 (51%), Gaps = 71/362 (19%)
Query: 27 CAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSE--------------QEERKLQLV 72
C H + C C Q + ++ G+SF Y+ + +R +E Q +RKL LV
Sbjct: 638 CQHPGSFGNMCFVCGQKLEET-GVSFRYIHKEMRLNEDEISRLRDSDSRFLQRQRKLYLV 696
Query: 73 LNLDHTLLHCRNIKSLSSGEKYLKKQIHSFI-------------GSLFQMA-NDKLVKLR 118
L+LDHTLL+ ++ L E+YLK HS GSLF + + KLR
Sbjct: 697 LDLDHTLLNSTVLRDLKPEEEYLKSHTHSLQEPFDFLLISDVSGGSLFMLEFMHMMTKLR 756
Query: 119 PFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPD 178
PFV +FL++AS + +Y+ TM R YA KLLD +YF RII+R+D + +K+ D
Sbjct: 757 PFVHSFLKEASEMFVMYIYTMGDRAYARQMAKLLDPRGEYFGDRIISRDDGTVRHQKSLD 816
Query: 179 LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDESENE 236
+V GQE ++ILDDTE+ W +H +NLIV+ +Y +F ++ + +KS SE +DESE +
Sbjct: 817 VVLGQESAVLILDDTENAWPNHKDNLIVIERYHFFASSCRQFDHKYKSLSELKSDESEPD 876
Query: 237 EALANVLRVLKTIHRLFFDSVCGDVRTYLPKVRSEFSRDV-LYFSAIFRD-------CLW 288
ALA VL+ + D DVR+ L +VR E + + FS +F LW
Sbjct: 877 GALATVLKNVDE------DISNRDVRSMLKQVRKEVLKGCKVVFSRVFPTKAKPEDHPLW 930
Query: 289 AEQEE--------------------------KFLVQEKKFLVHPRWIDAYYFLWRRRPED 322
EE ++ V+EKK++VH WIDA +LW+++PE+
Sbjct: 931 KMAEELGATCATEVDASVTHVVAMDVGTEKARWAVREKKYVVHRGWIDAANYLWKKQPEE 990
Query: 323 DY 324
+
Sbjct: 991 KF 992
>gi|115463681|ref|NP_001055440.1| Os05g0390500 [Oryza sativa Japonica Group]
gi|57863785|gb|AAS86390.2| unknown protein [Oryza sativa Japonica Group]
gi|113578991|dbj|BAF17354.1| Os05g0390500 [Oryza sativa Japonica Group]
gi|215695102|dbj|BAG90293.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222631469|gb|EEE63601.1| hypothetical protein OsJ_18418 [Oryza sativa Japonica Group]
Length = 536
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 135/387 (34%), Positives = 189/387 (48%), Gaps = 76/387 (19%)
Query: 5 SCKECVGKTKFVIKRKCEQSLSCAHTTVRDSRCIFCS--QAMNDSFGLSFDYMLRGLRYS 62
S ++ VG +K V +C H C C Q D G++F Y+ +GLR
Sbjct: 96 SDEDTVGSSKDVKIDECP-----PHPGFFGGLCYRCGKRQDEEDVPGVAFGYIHKGLRLG 150
Query: 63 EQE--------------ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHS------- 101
E ERKL L+L+LDHTL++ + LS+ E L Q +
Sbjct: 151 TTEIDRLRGADLKNLLRERKLVLILDLDHTLINSTKLFDLSAAENELGIQSAAKEVVPDR 210
Query: 102 --FIGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYF 159
F QM L KLRPFVR FL++AS + ++Y+ TM + YA KLLD D+ YF
Sbjct: 211 SLFTLETMQM----LTKLRPFVRRFLKEASDMFEMYIYTMGDKAYAIEIAKLLDPDNVYF 266
Query: 160 SSRIIAREDFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KE 217
S++I+ D + +K D+V G E VILDDTE VW H ENLI++ +Y YF ++
Sbjct: 267 GSKVISNSDCTQRHQKGLDVVLGDESVAVILDDTEYVWQKHKENLILMERYHYFASSCRQ 326
Query: 218 LNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDS------VCGDVRTYLPKVRSE 271
+S SET+ DE EN+ ALA +L VL+ IH +FFD DVR + +VR E
Sbjct: 327 FGFGARSLSETMQDERENDGALATILDVLERIHTIFFDPDDQKPLSSRDVRQVIKRVRQE 386
Query: 272 FSRDV-LYFSAIFR-------DCLW--AEQ------------------------EEKFLV 297
+ L F+ +F +W AEQ + ++ V
Sbjct: 387 VLQGCKLVFTRVFPLHQRQQDQMIWKMAEQLGAVCCTDVDSTVTHVVALDLGTEKARWAV 446
Query: 298 QEKKFLVHPRWIDAYYFLWRRRPEDDY 324
KKFLVHPRWI+A F W+R+ E+D+
Sbjct: 447 SNKKFLVHPRWIEAANFRWQRQQEEDF 473
>gi|218196729|gb|EEC79156.1| hypothetical protein OsI_19829 [Oryza sativa Indica Group]
Length = 574
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 129/355 (36%), Positives = 178/355 (50%), Gaps = 71/355 (20%)
Query: 37 CIFCS--QAMNDSFGLSFDYMLRGLRYSEQE--------------ERKLQLVLNLDHTLL 80
C C Q D G++F Y+ +GLR E ERKL L+L+LDHTL+
Sbjct: 149 CYRCGKRQDEEDVPGVAFGYIHKGLRLGTTEIDRLRGADLKNLLRERKLVLILDLDHTLI 208
Query: 81 HCRNIKSLSSGEKYLKKQIHS---------FIGSLFQMANDKLVKLRPFVRTFLEQASSL 131
+ + LS+ E L Q + F QM L KLRPFVR FL++AS +
Sbjct: 209 NSTKLFDLSAAENELGIQSAAKEVVPDRSLFTLETMQM----LTKLRPFVRRFLKEASDM 264
Query: 132 VDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIVILD 191
++Y+ TM + YA KLLD D+ YF S++I+ D + +K D+V G E VILD
Sbjct: 265 FEMYIYTMGDKAYAIEIAKLLDPDNVYFGSKVISNSDCTQRHQKGLDVVLGDESVAVILD 324
Query: 192 DTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDESENEEALANVLRVLKTI 249
DTE VW H ENLI++ +Y YF ++ +S SET+ DE EN+ ALA +L VL+ I
Sbjct: 325 DTEYVWQKHKENLILMERYHYFASSCRQFGFGARSLSETMQDERENDGALATILDVLERI 384
Query: 250 HRLFFDS------VCGDVRTYLPKVRSEFSRDV-LYFSAIFR-------DCLW--AEQ-- 291
H +FFD DVR + +VR E + L F+ +F LW AEQ
Sbjct: 385 HTIFFDPDDQKPLSSRDVRQVIKRVRQEVLQGCKLVFTRVFPLHQRQQDQMLWKMAEQLG 444
Query: 292 ----------------------EEKFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ ++ V KKFLVHPRWI+A F W+R+ E+D+
Sbjct: 445 AVCCTDVDSTVTHVVALDLGTEKARWAVSNKKFLVHPRWIEAANFRWQRQQEEDF 499
>gi|242087817|ref|XP_002439741.1| hypothetical protein SORBIDRAFT_09g019310 [Sorghum bicolor]
gi|241945026|gb|EES18171.1| hypothetical protein SORBIDRAFT_09g019310 [Sorghum bicolor]
Length = 547
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 123/366 (33%), Positives = 185/366 (50%), Gaps = 64/366 (17%)
Query: 23 QSLSCAHTTVRDSRCIFCSQAMNDSF--GLSFDYMLRGLRYSEQE--------------E 66
Q +C H C C ++ + G++ DY+ +GLR E E
Sbjct: 105 QVEACPHPGYIRGLCYICGNPQDEEYISGVALDYIDKGLRLRTSEIDRLRCADLKNLLRE 164
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIG----SLFQMANDKLV-KLRPFV 121
RKL L+L+LDHTL++ ++++SS EK L Q + S+F + + +L+ KLRPFV
Sbjct: 165 RKLVLILDLDHTLINSTKLQNISSAEKDLGIQTAASKDDPNRSIFALESMQLLTKLRPFV 224
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
R FL++AS++ ++Y+ TM + YA KLLD + YF ++I+ D + +K D++
Sbjct: 225 REFLKEASNMFEMYIYTMGDKAYAIEIAKLLDPSNIYFPLKVISNSDCTKRHQKGLDVIL 284
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDESENEEAL 239
G VILDDTE VW H ENLI++ +Y +F +E +S SE + DE E++ AL
Sbjct: 285 GAASVAVILDDTEFVWKKHKENLILMERYHFFASSCREFGFAVRSLSELMQDERESDGAL 344
Query: 240 ANVLRVLKTIHRLFFDSVCG-------DVRTYLPKVRSEFSRDV-LYFSAIF-------R 284
A VL VLK IH +FFD DVR + VR E + + FS +F +
Sbjct: 345 ATVLDVLKRIHAIFFDMAVETDDLSSRDVRQVIKAVRKEILQGCKIVFSRVFPNNTRPQK 404
Query: 285 DCLW--------------------------AEQEEKFLVQEKKFLVHPRWIDAYYFLWRR 318
+W ++ ++ V KKFLVHPRWI+A F W R
Sbjct: 405 QMVWKMAEYLGAVCSTDVDSSVTHVVTVDLGTEKARWGVANKKFLVHPRWIEAANFRWHR 464
Query: 319 RPEDDY 324
+PE+D+
Sbjct: 465 QPEEDF 470
>gi|15217916|ref|NP_173457.1| haloacid dehalogenase-like hydrolase [Arabidopsis thaliana]
gi|9558594|gb|AAF88157.1|AC026234_8 Contains similarity to a FCP1 serine phosphatase from Xenopus
laevis gi|6689545 [Arabidopsis thaliana]
gi|332191840|gb|AEE29961.1| haloacid dehalogenase-like hydrolase [Arabidopsis thaliana]
Length = 342
Score = 181 bits (458), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 104/246 (42%), Positives = 143/246 (58%), Gaps = 18/246 (7%)
Query: 24 SLSCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQ---------------EERK 68
+L C H VR C C ++ +G +FDY++ GL+ S + ERK
Sbjct: 17 TLICGHFFVRYGICCNCRSTVDRDYGRAFDYLVHGLQLSHKAVAVTKSLTTQLACLNERK 76
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQA 128
L LVL+LDHTLLH I LS GEKYL + F L+ + + L+KLRPFV FL++A
Sbjct: 77 LHLVLDLDHTLLHSIMISRLSEGEKYLLGE-SDFREDLWTLDREMLIKLRPFVHEFLKEA 135
Query: 129 SSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIV 188
+ + +Y+ TM R YA+A +K +D YF R+I R++ K DLV E G+V
Sbjct: 136 NEIFSMYVYTMGNRDYAQAVLKWIDPKKVYFGDRVITRDESGFS--KTLDLVLADECGVV 193
Query: 189 ILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALANVLRVLKT 248
I+DDT VW DH NL+ + KY YFRD + + KSY+E DES N+ +LANVL+VLK
Sbjct: 194 IVDDTRHVWPDHERNLLQITKYSYFRDYSHDKESKSYAEEKRDESRNQGSLANVLKVLKD 253
Query: 249 IHRLFF 254
+H+ FF
Sbjct: 254 VHQEFF 259
>gi|357163276|ref|XP_003579679.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Brachypodium distachyon]
Length = 493
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 126/353 (35%), Positives = 178/353 (50%), Gaps = 67/353 (18%)
Query: 37 CIFCS--QAMNDSFGLSFDYMLRGLRYSEQE--------------ERKLQLVLNLDHTLL 80
C C Q D G++F Y+ +GLR E ERKL L+L+LDHTL+
Sbjct: 116 CFRCGKRQDEEDVPGVAFGYIHKGLRLGTSEIDRLRGSNVKSLLRERKLVLILDLDHTLI 175
Query: 81 HCRNIKSLSSGEKYLKKQIHSFIG------SLFQM-ANDKLVKLRPFVRTFLEQASSLVD 133
+ + +S+ E+ L I +F SLF + A L KLRPFV FL++AS++ +
Sbjct: 176 NSTKLHDISAAERDLG--IQTFASEDAPEKSLFTLEAMQMLTKLRPFVCKFLKEASNMFE 233
Query: 134 IYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIVILDDT 193
+Y+ TM + YA KLLD + YF S++I+ D + +K D+V G E +ILDDT
Sbjct: 234 MYIYTMGDKAYAIEIAKLLDPGNVYFGSKVISNSDCTQRHQKGLDVVLGAENVAIILDDT 293
Query: 194 ESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDESENEEALANVLRVLKTIHR 251
E VW H ENLI++ +Y YF ++ K+ SE++ DE E++ ALA L VLK IH
Sbjct: 294 EYVWQKHKENLILMERYHYFASSCRQFGFSVKALSESMQDERESDGALATTLDVLKRIHT 353
Query: 252 LFFDSVCG------DVRTYLPKVRSEFSRDV-LYFSAIFRDC-------LW--AEQ---- 291
LFFDS DVR + KVR E + + FS +F +W AEQ
Sbjct: 354 LFFDSAVETALSSRDVRQVIKKVRQEVLQGCKVVFSRVFPSSSRPQDQIIWKMAEQLGAI 413
Query: 292 --------------------EEKFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ ++ V K LVHPRWI+A F W R+ E+D+
Sbjct: 414 CCADMDSTVTHVVAVDSGTEKARWAVGNNKILVHPRWIEASNFRWHRQQEEDF 466
>gi|224142399|ref|XP_002324546.1| predicted protein [Populus trichocarpa]
gi|222865980|gb|EEF03111.1| predicted protein [Populus trichocarpa]
Length = 312
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 109/300 (36%), Positives = 170/300 (56%), Gaps = 42/300 (14%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFI----GSLFQMANDKLV-KLRPFV 121
+KL L+L+LDHTLL+ + ++ E+YL Q S GSLF +++ +++ KLRPFV
Sbjct: 11 KKLYLILDLDHTLLNSTQLMHMTLDEEYLNGQTDSLQDVSKGSLFMLSSMQMMTKLRPFV 70
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
RTFL++AS + ++Y+ TM R YA KLLD +YF++++I+R+D + +K D+V
Sbjct: 71 RTFLKEASQMFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRDDGTQRHQKGLDVVL 130
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFRDK--ELNGDHKSYSETLTDESENEEAL 239
GQE ++ILDDTE+ W H +NLI++ +Y +F + + KS SE TDESE+E AL
Sbjct: 131 GQESAVLILDDTENAWMKHKDNLILMERYHFFASSCHQFGFNCKSLSEQKTDESESEGAL 190
Query: 240 ANVLRVLKTIHRLFF-DSVCGDVRTYLPKVRSEFSRDV-LYFSAIF-------RDCLW-- 288
A++L+VL+ IH++FF D L VR + + + FS +F LW
Sbjct: 191 ASILKVLRKIHQIFFEDHTLSLALQVLKTVRKDVLKGCKIVFSRVFPTQSQADNHHLWRM 250
Query: 289 AEQ------------------------EEKFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
AEQ + + + KFLV P WI+A + W+R+PE+++
Sbjct: 251 AEQLGATCSTELDPSVTHVVSKDSGTEKSHWASKHNKFLVQPGWIEATNYFWQRQPEENF 310
>gi|357129281|ref|XP_003566293.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Brachypodium distachyon]
Length = 492
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 119/350 (34%), Positives = 176/350 (50%), Gaps = 62/350 (17%)
Query: 37 CIFCS--QAMNDSFGLSFDYMLRGLRYSEQE--------------ERKLQLVLNLDHTLL 80
CI C Q D G++ Y+ GLR E ERKL L+L+LDHTL+
Sbjct: 117 CIKCGKIQDEEDVPGVACGYIHEGLRLGTSEIERLRGSDLKKLLRERKLVLILDLDHTLI 176
Query: 81 HCRNIKSLSSGEKYLKKQIHSFIG----SLFQMAN-DKLVKLRPFVRTFLEQASSLVDIY 135
+ + +S+ E L Q + SLF + L KLRPFVR FL++AS++ ++Y
Sbjct: 177 NSTRLHDISAAEMDLGIQTAALKDDPDRSLFTLERMHMLTKLRPFVRRFLKEASNMFEMY 236
Query: 136 LCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIVILDDTES 195
+ TM + Y+ KLLD + YF S++I+ D + +K D+V G E VILDDTE
Sbjct: 237 IYTMGDKAYSIEVAKLLDPGNVYFGSKVISNSDCTQRHQKGLDVVLGAESIAVILDDTED 296
Query: 196 VWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLF 253
VW H ENLI++ +Y YF ++ +S SE + DE E++ AL+ +L VLK IH +F
Sbjct: 297 VWQKHKENLILMERYHYFASSCRQFGFSVRSLSELMVDERESDGALSTILDVLKRIHTIF 356
Query: 254 FDS-----VCGDVRTYLPKVRSEFSRDV-LYFSAIFRD-------CLW------------ 288
FDS + + +VR E + L FS +F +W
Sbjct: 357 FDSGVETALSSRTLMVIKRVRQEVLQGCKLVFSRVFPSNSCPQDQIIWKMAEKLGASCCA 416
Query: 289 --------------AEQEEKFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
++ ++ V+ KKFL+HPRWI+A + WRR+PE+D+
Sbjct: 417 HVDSTVTHVVAVDVGTEKARWAVENKKFLLHPRWIEASNYRWRRQPEEDF 466
>gi|297850432|ref|XP_002893097.1| hypothetical protein ARALYDRAFT_472260 [Arabidopsis lyrata subsp.
lyrata]
gi|297338939|gb|EFH69356.1| hypothetical protein ARALYDRAFT_472260 [Arabidopsis lyrata subsp.
lyrata]
Length = 281
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 103/245 (42%), Positives = 141/245 (57%), Gaps = 19/245 (7%)
Query: 25 LSCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQ---------------EERKL 69
L+C H VR C C ++ +G +FDY++ GL+ S + ERKL
Sbjct: 18 LNCGHFFVRYGICCNCRSKVDREYGRAFDYLVHGLQLSHKAVAVTKSLTTQLACLNERKL 77
Query: 70 QLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQAS 129
+VL+LDHTLLH + LS GEKYL ++ L+ + + L+KLRPFV FL +A+
Sbjct: 78 HVVLDLDHTLLHSVMVSRLSEGEKYLLRE-SDLREDLWTLDREMLIKLRPFVHEFLNEAN 136
Query: 130 SLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIVI 189
+Y+ TM R YA+A +KL+D YF R+I R++ K DLV E G+VI
Sbjct: 137 EFFSMYVYTMGNRDYAQAVLKLIDPKKVYFGDRVITRDESGFS--KTLDLVLADECGVVI 194
Query: 190 LDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALANVLRVLKTI 249
+DDT VW DH NL+ + KY YFRD D KSY+E DES ++ +LANVL+VLK I
Sbjct: 195 VDDTRHVWPDHERNLLQITKYSYFRDYN-QEDSKSYAEEKRDESRSQGSLANVLKVLKKI 253
Query: 250 HRLFF 254
H+ FF
Sbjct: 254 HQEFF 258
>gi|297834668|ref|XP_002885216.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297331056|gb|EFH61475.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 296
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 103/246 (41%), Positives = 144/246 (58%), Gaps = 23/246 (9%)
Query: 27 CAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQ---------------EERKLQL 71
C H VR CI C +N G +FDY+++GL+ S + E+KL L
Sbjct: 29 CGHWYVRYGVCIACKSTVNKRQGRAFDYLVQGLQLSHEAAAFTKRFTTEFYCLNEKKLHL 88
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFI-GSLFQMANDKLVKLRPFVRTFLEQASS 130
VL+LDHTLLH + LS E+YL ++ S L+++ D L KLRPFV FL++A+
Sbjct: 89 VLDLDHTLLHSIRVSILSETERYLIEEACSTTREDLWKLDIDYLTKLRPFVHEFLKEANE 148
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIVIL 190
+ +Y+ TM TR YAE+ +KL+D YF R+I R++ K DLV ERG+VI+
Sbjct: 149 MFTMYVYTMGTRVYAESLLKLIDPKRIYFGDRVITRDE--SPYVKTLDLVLADERGVVIV 206
Query: 191 DDTESVWSDHTENLIVLGKYVYFRDKELNG--DHKSYSETLTDESENEEALANVLRVLKT 248
DDT VW+ H NL+ + +Y YFR +NG + KSY+E DES+N LANVL++LK
Sbjct: 207 DDTRDVWTHHKSNLVEINEYHYFR---VNGPEESKSYTEEKRDESKNSGGLANVLKLLKE 263
Query: 249 IHRLFF 254
+H FF
Sbjct: 264 VHYGFF 269
>gi|15229069|ref|NP_188382.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
gi|9294142|dbj|BAB02044.1| unnamed protein product [Arabidopsis thaliana]
gi|332642446|gb|AEE75967.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
Length = 296
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 102/246 (41%), Positives = 144/246 (58%), Gaps = 23/246 (9%)
Query: 27 CAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQ---------------EERKLQL 71
C H VR CI C +N G +FDY+++GL+ S + E+KL L
Sbjct: 29 CGHWYVRYGVCIACKSTVNKRHGRAFDYLVQGLQLSHEAAAFTKRFTTQFYCLNEKKLNL 88
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFI-GSLFQMANDKLVKLRPFVRTFLEQASS 130
VL+LDHTLLH + LS EK L ++ S L+++ +D L KLRPFV FL++A+
Sbjct: 89 VLDLDHTLLHSIRVSLLSETEKCLIEEACSTTREDLWKLDSDYLTKLRPFVHEFLKEANE 148
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIVIL 190
L +Y+ TM TR YAE+ +KL+D YF R+I R++ K DLV +ERG+VI+
Sbjct: 149 LFTMYVYTMGTRVYAESLLKLIDPKRIYFGDRVITRDE--SPYVKTLDLVLAEERGVVIV 206
Query: 191 DDTESVWSDHTENLIVLGKYVYFRDKELNG--DHKSYSETLTDESENEEALANVLRVLKT 248
DDT VW+ H NL+ + +Y +FR +NG + SY+E DES+N LANVL++LK
Sbjct: 207 DDTSDVWTHHKSNLVEINEYHFFR---VNGPEESNSYTEEKRDESKNNGGLANVLKLLKE 263
Query: 249 IHRLFF 254
+H FF
Sbjct: 264 VHYGFF 269
>gi|297834870|ref|XP_002885317.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297331157|gb|EFH61576.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 592
Score = 171 bits (432), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 100/253 (39%), Positives = 139/253 (54%), Gaps = 33/253 (13%)
Query: 27 CAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQ---------------EERKLQL 71
C H V CI C +N S G +FDY+ GL+ S + ++KL L
Sbjct: 334 CGHWYVFHGICIACKSTVNKSQGRAFDYIFNGLQLSHEAVALTKCFTTKFSCLNDKKLHL 393
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSF----------IGSLFQMANDKLVKLRPFV 121
VL+LDHTLLH + SLS EKYL ++ S IG + L KLRPFV
Sbjct: 394 VLDLDHTLLHTVMVPSLSQAEKYLLEEAGSATREDLWKIKAIGDPMEF----LTKLRPFV 449
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
R FL++A+ + +Y+ T +R YA+ ++L+D YF R+I + + K DLV
Sbjct: 450 REFLKEANQMFTMYVYTKGSRGYAKQVLELIDPKKLYFEDRVITKNE--SPHMKTLDLVL 507
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALAN 241
+ERG+VI+DD +VW DH NL+ + KY YFR K + YSE +TDESE++ LAN
Sbjct: 508 AEERGVVIVDDMRTVWPDHKSNLVDISKYTYFRLK--GQESMPYSEEMTDESESDGGLAN 565
Query: 242 VLRVLKTIHRLFF 254
VL++LK +H FF
Sbjct: 566 VLKLLKEVHSRFF 578
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 100/256 (39%), Positives = 138/256 (53%), Gaps = 35/256 (13%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSF-GLSFDYMLRGLRYSEQ---------------EERKL 69
+C H +R CI C ++ + G FD GL+ S + +KL
Sbjct: 34 NCGHWYIRHGVCIVCKSTVDKNIQGRVFD----GLQLSSEALALTKRLTTKFSCLNMKKL 89
Query: 70 QLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFI----------GSLFQMANDKLVKLRP 119
LVL+LDHTLLH ++ LS EKYL ++ S G + + L KLRP
Sbjct: 90 HLVLDLDHTLLHSVRVQFLSEAEKYLIEEAGSTTREDLWKMKVKGDPIPITIEYLTKLRP 149
Query: 120 FVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDL 179
F+R FL++A+ L +Y+ T TR YA+A +KL+D YF R+I R + K DL
Sbjct: 150 FLREFLKEANKLFTMYVYTKGTRRYAKAILKLIDPKKLYFGHRVITRNE--SPHTKTLDL 207
Query: 180 VRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR-DKELNGDHKSYSETLTDESENEEA 238
V ERG+VI+DDT ++W +H NL+V+GKY YFR + + H E TDESEN
Sbjct: 208 VLADERGVVIVDDTRNIWPNHKSNLVVIGKYKYFRFEGRVLKPHS--EEKTTDESENNGG 265
Query: 239 LANVLRVLKTIHRLFF 254
LANVL++LK +HR FF
Sbjct: 266 LANVLKLLKEVHRKFF 281
>gi|15224433|ref|NP_178570.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
gi|4585924|gb|AAD25584.1| hypothetical protein [Arabidopsis thaliana]
gi|330250795|gb|AEC05889.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
Length = 277
Score = 170 bits (431), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 99/253 (39%), Positives = 144/253 (56%), Gaps = 33/253 (13%)
Query: 27 CAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQ---------------EERKLQL 71
C H V CI C ++ S FDY+ +GL+ S + E+KL L
Sbjct: 10 CGHWYVFQGICIGCKSKVHKSQFRKFDYIFKGLQLSNEAVALTKSLTTKHSCLNEKKLHL 69
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHS----------FIGSLFQMANDKLVKLRPFV 121
VL+LDHTLLH + + +LS E+YL ++ S IG D+L+KLRPFV
Sbjct: 70 VLDLDHTLLHSKLVSNLSQAERYLIQEASSRTREDLWKFRPIGHPI----DRLIKLRPFV 125
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
R FL++A+ + +++ TM +R YA+A ++++D YF +R+I +++ K +LV
Sbjct: 126 RDFLKEANEMFTMFVYTMGSRIYAKAILEMIDPKKLYFGNRVITKDE--SPRMKTLNLVL 183
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALAN 241
+ERG+VI+DDT +W H NLI + KY YFR L D SYSE TDE EN+ LAN
Sbjct: 184 AEERGVVIVDDTRDIWPHHKNNLIQIRKYKYFRRSGL--DSNSYSEKKTDEGENDGGLAN 241
Query: 242 VLRVLKTIHRLFF 254
VL++L+ +HR FF
Sbjct: 242 VLKLLREVHRRFF 254
>gi|357450477|ref|XP_003595515.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
gi|355484563|gb|AES65766.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
Length = 382
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 122/372 (32%), Positives = 186/372 (50%), Gaps = 73/372 (19%)
Query: 27 CAHTTVRDSRCIFCSQAMNDSFGLSFDYM----------------LRGLRYSEQE----- 65
C H + CI C Q ++ GL+F Y+ +GLR E+E
Sbjct: 6 CRHPGSFECLCIRCGQKIDGDSGLTFGYIHKKLGRTPRWSILFLYAQGLRLHEEEISRVR 65
Query: 66 ---------ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSF----IGSLFQMAN- 111
RKL LVL+LDHTLL+ ++ LS E +LK S G LF + +
Sbjct: 66 SLHTRNLLNRRKLCLVLDLDHTLLNTTSLHRLSPEEMHLKTCTDSLEDIARGRLFVLEHR 125
Query: 112 DKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNG 171
++ KLRPFVRTFL++AS + ++Y+ TM R Y+ +LLD K+F ++I+R+D
Sbjct: 126 QRMAKLRPFVRTFLKEASKMFEMYIYTMGDRRYSLEMARLLDPQGKFFKDKVISRDDGTE 185
Query: 172 KDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETL 229
K+ +LV G E I+ILDD + VW H +NLI++ +Y +F +E + + KS +E
Sbjct: 186 MKEKDLNLVLGTESSILILDDNKKVWRMHKDNLILMERYHFFNSSCQEFDLNCKSLAELH 245
Query: 230 TDESENEEALANVLRVLKTIHRLFFDSVCG-----DVRTYLPKVRSE-FSRDVLYFSAIF 283
DE+E + ALA +L+VL+ I+ FFD + G DVR L +R E S ++ FS F
Sbjct: 246 IDENETDGALARILKVLRHINSKFFDELQGDLVDRDVRQVLSSLRGEVLSGCIIVFSCAF 305
Query: 284 RD----------------CL--------------WAEQEEKFLVQEKKFLVHPRWIDAYY 313
CL +E + +E KFLV+ RW++A
Sbjct: 306 NGHDLRKLRRIAERLGATCLTELGPTVTHAVANELVTEESMWAEKENKFLVNRRWLEASN 365
Query: 314 FLWRRRPEDDYL 325
F +++PE++Y+
Sbjct: 366 FFLQKQPEENYI 377
>gi|359494894|ref|XP_003634864.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Vitis vinifera]
Length = 278
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 102/264 (38%), Positives = 150/264 (56%), Gaps = 43/264 (16%)
Query: 104 GSLFQMAN-DKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
G+LF + L KLRP+V TFL++AS + ++Y+ TM R YA KLLD + YFSSR
Sbjct: 7 GNLFMLNTMHMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSR 66
Query: 163 IIAREDFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNG 220
+I++ D + +K D+V GQE ++ILDDTESVW H +NLI++ +Y +F ++
Sbjct: 67 VISQADCTQRHQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASSCRQFGF 126
Query: 221 DHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVCG------DVRTYLPKVRSEFSR 274
+ KS SE +DESE + ALA VL+VL+ IH +FFD G DVR + +VR E +
Sbjct: 127 NCKSLSELKSDESEPDGALATVLKVLQRIHSMFFDPELGDDFSGRDVRQVVKRVRKEVLK 186
Query: 275 DV-LYFSAIF-------RDCLW--AEQ------------------------EEKFLVQEK 300
+ FS +F LW AEQ + ++ +QEK
Sbjct: 187 GCKIVFSRVFPTRFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEKSRWALQEK 246
Query: 301 KFLVHPRWIDAYYFLWRRRPEDDY 324
KFLVHP WI+A + W+++PE+++
Sbjct: 247 KFLVHPGWIEAANYFWQKQPEENF 270
>gi|359497210|ref|XP_003635453.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like [Vitis vinifera]
Length = 278
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 101/264 (38%), Positives = 150/264 (56%), Gaps = 43/264 (16%)
Query: 104 GSLFQMAN-DKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
G+LF + L KLRP+V TFL++AS + ++Y+ TM R YA KLLD + YFSSR
Sbjct: 7 GNLFMLNTMHMLTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSR 66
Query: 163 IIAREDFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNG 220
+I++ D + +K D+V GQE ++ILDDTESVW H +NLI++ +Y +F ++
Sbjct: 67 VISQADCTQRHQKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASSCRQFGF 126
Query: 221 DHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVCG------DVRTYLPKVRSEFSR 274
+ KS SE +DESE + ALA VL+VL+ IH +FFD G DVR + +VR + +
Sbjct: 127 NCKSLSELKSDESEPDGALATVLKVLQRIHSMFFDPELGDDFSGRDVRQVVKRVRKDVLK 186
Query: 275 DV-LYFSAIF-------RDCLW--AEQ------------------------EEKFLVQEK 300
+ FS +F LW AEQ + ++ +QEK
Sbjct: 187 GCKIVFSRVFPTRFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEKSRWALQEK 246
Query: 301 KFLVHPRWIDAYYFLWRRRPEDDY 324
KFLVHP WI+A + W+++PE+++
Sbjct: 247 KFLVHPGWIEAANYFWQKQPEENF 270
>gi|296090640|emb|CBI41034.3| unnamed protein product [Vitis vinifera]
Length = 264
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 99/253 (39%), Positives = 145/253 (57%), Gaps = 42/253 (16%)
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
L KLRP+V TFL++AS + ++Y+ TM R YA KLLD + YFSSR+I++ D +
Sbjct: 4 LTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSRVISQADCTQRH 63
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTD 231
+K D+V GQE ++ILDDTESVW H +NLI++ +Y +F ++ + KS SE +D
Sbjct: 64 QKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASSCRQFGFNCKSLSELKSD 123
Query: 232 ESENEEALANVLRVLKTIHRLFFDSVCG------DVRTYLPKVRSEFSRDV-LYFSAIF- 283
ESE + ALA VL+VL+ IH +FFD G DVR + +VR E + + FS +F
Sbjct: 124 ESEPDGALATVLKVLQRIHSMFFDPELGDDFSGRDVRQVVKRVRKEVLKGCKIVFSRVFP 183
Query: 284 ------RDCLW--AEQ------------------------EEKFLVQEKKFLVHPRWIDA 311
LW AEQ + ++ +QEKKFLVHP WI+A
Sbjct: 184 TRFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEKSRWALQEKKFLVHPGWIEA 243
Query: 312 YYFLWRRRPEDDY 324
+ W+++PE+++
Sbjct: 244 ANYFWQKQPEENF 256
>gi|357501219|ref|XP_003620898.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
gi|355495913|gb|AES77116.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
Length = 720
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 113/301 (37%), Positives = 165/301 (54%), Gaps = 42/301 (13%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSF----IGSLFQMANDK-LVKLRPFV 121
RKL LVL+LDHTLL+ ++ LS E +LK S GSLF + + + + KLRPFV
Sbjct: 215 RKLCLVLDLDHTLLNTTSLHRLSPEEMHLKTHTDSLEDISKGSLFMLEHVQVMTKLRPFV 274
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
RTFL++AS + ++Y+ TM R Y+ +LLD +YF ++I+R+D K+ K+ DLV
Sbjct: 275 RTFLKEASEMFEMYIYTMGDRQYSLEMARLLDPQGEYFKDKVISRDDGTQKNVKDLDLVL 334
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDESENEEAL 239
G E IVILDD E VW + +NLI++ +Y +F ++ KS + DE+E + AL
Sbjct: 335 GTENSIVILDDKEEVWPKYRDNLILMERYHFFNSSCQDFGLQCKSLAALNIDENEIDGAL 394
Query: 240 ANVLRVLKTIHRLFFDSVCG-----DVRTYLPKVRSEFSRD-VLYFSAIFRD-------- 285
A +L VL+ I+ FFD + G DVR L R E R V+ FS F
Sbjct: 395 AKILEVLRQINYKFFDELQGDLVDRDVRQVLSSFRGEVLRGCVIVFSLNFHGDLRILRRI 454
Query: 286 -------CL--------------WAEQEEKFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
CL + +E ++ VQEKKFLV RW++A F +++PE+++
Sbjct: 455 AERLGATCLKKLDPTVTHVIGTDFVTKESRWAVQEKKFLVSRRWLEAANFFLQKQPEENF 514
Query: 325 L 325
L
Sbjct: 515 L 515
>gi|186510238|ref|NP_001118664.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
gi|9294424|dbj|BAB02544.1| unnamed protein product [Arabidopsis thaliana]
gi|332642743|gb|AEE76264.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
Length = 307
Score = 167 bits (423), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 104/276 (37%), Positives = 147/276 (53%), Gaps = 37/276 (13%)
Query: 27 CAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQ---------------EERKLQL 71
C H + CI C + S G +FDY+ GL+ S + E+KL L
Sbjct: 35 CGHWYICHGICIGCKSTVKKSQGRAFDYIFDGLQLSHEAVALTKCFTTKLSCLNEKKLHL 94
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSF----------IGSLFQMANDKLVKLRPFV 121
VL+LDHTLLH + SLS EKYL ++ S +G + L KLRPF+
Sbjct: 95 VLDLDHTLLHTVMVPSLSQAEKYLIEEAGSATRDDLWKIKAVGDPMEF----LTKLRPFL 150
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
R FL++A+ +Y+ T +R YA+ ++L+D YF R+I + + K D V
Sbjct: 151 RDFLKEANEFFTMYVYTKGSRVYAKQVLELIDPKKLYFGDRVITKTE--SPHMKTLDFVL 208
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALAN 241
+ERG+VI+DDT +VW DH NL+ + KY YFR K D YSE TDESE+E LAN
Sbjct: 209 AEERGVVIVDDTRNVWPDHKSNLVDISKYSYFRLK--GQDSMPYSEEKTDESESEGGLAN 266
Query: 242 VLRVLKTIHRLFF----DSVCGDVRTYLPKVRSEFS 273
VL++LK +H+ FF + DVR+ L ++ E +
Sbjct: 267 VLKLLKEVHQRFFRVEEELESKDVRSLLQEIDFELN 302
>gi|255540901|ref|XP_002511515.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
gi|223550630|gb|EEF52117.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
Length = 405
Score = 167 bits (423), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 100/249 (40%), Positives = 142/249 (57%), Gaps = 24/249 (9%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQE--------------ERKLQL 71
+C+H V C C Q M++ +GL FDY++ GLR SE + ++KL L
Sbjct: 42 TCSHPLVMKLVCTTCGQKMSNFYGLPFDYIMGGLRLSETKADWTRDAETDFVLSKKKLFL 101
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMA-----NDKLVKLRPFVRTFLE 126
VL+LD TLLH + L+ E YLK Q+ S + +F++ + KLRPFVR FL+
Sbjct: 102 VLDLDQTLLH--STVDLTPEENYLKNQMDS-LQDIFKLITREGFSPSYAKLRPFVRNFLQ 158
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERG 186
+AS++ +Y+ T + + YA V LLD D+ YF SR+I RED +KN D+V GQER
Sbjct: 159 EASTMFKMYVYTNANKSYARKMVNLLDPDNIYFKSRLITREDSTVSCQKNLDVVMGQERA 218
Query: 187 IVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALANVLRVL 246
+VILDD VW H +NLI + +Y YF + KS+++ DES + +A L +L
Sbjct: 219 VVILDDRTDVWPMHKDNLIQVQRYKYFASTANWSNSKSFAQREVDES--TDIMATYLEIL 276
Query: 247 KTIHRLFFD 255
K IH FFD
Sbjct: 277 KKIHSQFFD 285
>gi|225194907|gb|ACN81954.1| C-terminal domain phosphatase-like 5 [Arabidopsis thaliana]
Length = 601
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 104/276 (37%), Positives = 147/276 (53%), Gaps = 37/276 (13%)
Query: 27 CAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQ---------------EERKLQL 71
C H + CI C + S G +FDY+ GL+ S + E+KL L
Sbjct: 329 CGHWYICHGICIGCKSTVKKSQGRAFDYIFDGLQLSHEAVALTKCFTTKLSCLNEKKLHL 388
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSF----------IGSLFQMANDKLVKLRPFV 121
VL+LDHTLLH + SLS EKYL ++ S +G + L KLRPF+
Sbjct: 389 VLDLDHTLLHTVMVPSLSQAEKYLIEEAGSATRDDLWKIKAVGDPMEF----LTKLRPFL 444
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
R FL++A+ +Y+ T +R YA+ ++L+D YF R+I + + K D V
Sbjct: 445 RDFLKEANEFFTMYVYTKGSRVYAKQVLELIDPKKLYFGDRVITKTE--SPHMKTLDFVL 502
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALAN 241
+ERG+VI+DDT +VW DH NL+ + KY YFR K D YSE TDESE+E LAN
Sbjct: 503 AEERGVVIVDDTRNVWPDHKSNLVDISKYSYFRLK--GQDSMPYSEEKTDESESEGGLAN 560
Query: 242 VLRVLKTIHRLFF----DSVCGDVRTYLPKVRSEFS 273
VL++LK +H+ FF + DVR+ L ++ E +
Sbjct: 561 VLKLLKEVHQRFFRVEEELESKDVRSLLQEIDFELN 596
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 95/255 (37%), Positives = 134/255 (52%), Gaps = 34/255 (13%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSF-GLSFDYMLRGLRYSEQ---------------EERKL 69
+C H +R CI C ++ + G FD GL S + +KL
Sbjct: 34 NCGHWYIRYGFCIVCKSTVDKTIEGRVFD----GLHLSSEALALTKRLITKFSCLNMKKL 89
Query: 70 QLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFI----------GSLFQMANDKLVKLRP 119
LVL+LD TL+H + LS EKYL ++ S G + + LVKLRP
Sbjct: 90 HLVLDLDLTLIHSVRVPCLSEAEKYLIEEAGSTTREDLWKMKVRGDPISITIEHLVKLRP 149
Query: 120 FVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDL 179
F+ FL++A+ + +Y+ T TR YAEA +KL+D YF R+I R + K D+
Sbjct: 150 FLCEFLKEANEMFTMYVYTKGTRPYAEAILKLIDPKKLYFGHRVITRNE--SPHTKTLDM 207
Query: 180 VRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEAL 239
V ERG+VI+DDT W ++ NL+++G+Y YFR + + K +SE TDESEN L
Sbjct: 208 VLADERGVVIVDDTRKAWPNNKSNLVLIGRYNYFRSQ--SRVLKPHSEEKTDESENNGGL 265
Query: 240 ANVLRVLKTIHRLFF 254
ANVL++LK IH FF
Sbjct: 266 ANVLKLLKGIHHKFF 280
>gi|296088193|emb|CBI35709.3| unnamed protein product [Vitis vinifera]
Length = 638
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 98/253 (38%), Positives = 145/253 (57%), Gaps = 42/253 (16%)
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
L KLRP+V TFL++AS + ++Y+ TM R YA KLLD + YFSSR+I++ D +
Sbjct: 4 LTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSRVISQADCTQRH 63
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTD 231
+K D+V GQE ++ILDDTESVW H +NLI++ +Y +F ++ + KS SE +D
Sbjct: 64 QKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASSCRQFGFNCKSLSELKSD 123
Query: 232 ESENEEALANVLRVLKTIHRLFFDSVCG------DVRTYLPKVRSEFSRDV-LYFSAIF- 283
ESE + ALA VL+VL+ IH +FFD G DVR + +VR + + + FS +F
Sbjct: 124 ESEPDGALATVLKVLQRIHSMFFDPELGDDFSGRDVRQVVKRVRKDVLKGCKIVFSRVFP 183
Query: 284 ------RDCLW--AEQ------------------------EEKFLVQEKKFLVHPRWIDA 311
LW AEQ + ++ +QEKKFLVHP WI+A
Sbjct: 184 TRFQAENHHLWRMAEQLGATCATELDPSVTHVVSTDAGTEKSRWALQEKKFLVHPGWIEA 243
Query: 312 YYFLWRRRPEDDY 324
+ W+++PE+++
Sbjct: 244 ANYFWQKQPEENF 256
>gi|242063380|ref|XP_002452979.1| hypothetical protein SORBIDRAFT_04g035920 [Sorghum bicolor]
gi|241932810|gb|EES05955.1| hypothetical protein SORBIDRAFT_04g035920 [Sorghum bicolor]
Length = 518
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 111/305 (36%), Positives = 157/305 (51%), Gaps = 47/305 (15%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHS---FIGSLFQMANDK---LVKLRPF 120
RKL L+L+LDHTLL+ + LS E+ H+ LF++ + L KLRPF
Sbjct: 206 RKLTLILDLDHTLLNSTGLDDLSPAEQANGLTRHTKGDPTAGLFRLGRARFRMLTKLRPF 265
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLV 180
R FLEQAS++ ++ + T+ R YA A VKLLD D YF R+++ ++ +DRK+ D+V
Sbjct: 266 ARGFLEQASAMFEMSVYTLGDRGYARAVVKLLDPDGAYFGGRVVSSDESTRRDRKSLDVV 325
Query: 181 RGQE-RGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDESENEE 237
G E +VILDD+ VW +H ENLIV+ +Y+YF D + S +E DE E++
Sbjct: 326 PGAEAAAVVILDDSSHVWPEHQENLIVMDRYLYFADSCRTYGCGVSSLAELRRDEREHDG 385
Query: 238 ALANVLRVLKTIHRLFFDSVCG----DVRTYLPKVRSE--------FSRDVLYFSAIFRD 285
ALA L+VL +H+ FFDSV G DVR + VRSE FSR +
Sbjct: 386 ALAVALQVLTRVHQGFFDSVLGGRFSDVREVIRAVRSEVLRGCTVAFSRVIPLEGVAGDH 445
Query: 286 CLW--AEQ------------------------EEKFLVQEKKFLVHPRWIDAYYFLWRRR 319
+W AEQ + ++ KFLV+P+WI A W R
Sbjct: 446 PMWKLAEQLGAVCTADADATVTHVVALDPGTDKARWARDNCKFLVNPKWIMAASIRWCRP 505
Query: 320 PEDDY 324
E ++
Sbjct: 506 CEQEF 510
>gi|297835808|ref|XP_002885786.1| hypothetical protein ARALYDRAFT_899317 [Arabidopsis lyrata subsp.
lyrata]
gi|297331626|gb|EFH62045.1| hypothetical protein ARALYDRAFT_899317 [Arabidopsis lyrata subsp.
lyrata]
Length = 285
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 97/249 (38%), Positives = 137/249 (55%), Gaps = 25/249 (10%)
Query: 27 CAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQ---------------EERKLQL 71
C H CI C ++ S +FDY+ GL+ S + E+KL L
Sbjct: 10 CGHWYGFHGVCIGCKSIVHKSQWRAFDYIFNGLQLSHEAVALTKSRTTNNSCLNEKKLHL 69
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGS------LFQMANDKLVKLRPFVRTFL 125
VL+LDHTLLH + + LS E YL ++ S L D+L+KLRPFVR FL
Sbjct: 70 VLDLDHTLLHMKKVPCLSRAEMYLIQEACSVTREDIWKIRLLGDPIDRLIKLRPFVRDFL 129
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQER 185
++A+ + +Y+ T TR YA+A ++L+D + YF R+I +++ +K DLV +ER
Sbjct: 130 KEANEMFTMYVYTKGTRKYAKAVLELIDPNRLYFGDRVITKDE--SPHQKTLDLVLAEER 187
Query: 186 GIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALANVLRV 245
G+VI+DD +W H NLI + KY YFR + SYSE TDESE + LANVL++
Sbjct: 188 GVVIVDDRRDIWPHHKSNLIEISKYKYFRVSGQGSN--SYSEKKTDESEKDGGLANVLKL 245
Query: 246 LKTIHRLFF 254
LK +H FF
Sbjct: 246 LKQVHCRFF 254
>gi|218196728|gb|EEC79155.1| hypothetical protein OsI_19828 [Oryza sativa Indica Group]
Length = 430
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 98/253 (38%), Positives = 137/253 (54%), Gaps = 42/253 (16%)
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
L KLRPFVR FL++AS + ++Y+ TM + YA KLLD D+ YF S++I+ D +
Sbjct: 4 LTKLRPFVRRFLKEASDMFEMYIYTMGDKAYAIEIAKLLDPDNVYFGSKVISNSDCTQRH 63
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTD 231
+K D+V G E VILDDTE VW H ENLI++ +Y YF ++ +S SET+ D
Sbjct: 64 QKGLDVVLGDESVAVILDDTEYVWQKHKENLILMERYHYFASSCRQFGFGARSLSETMQD 123
Query: 232 ESENEEALANVLRVLKTIHRLFFDS------VCGDVRTYLPKVRSEFSRDV-LYFSAIFR 284
E EN+ ALA +L VL+ IH +FFD DVR + +VR E + L F+ +F
Sbjct: 124 ERENDGALATILDVLERIHTIFFDPDDQKPLSSRDVRQVIKRVRQEVLQGCKLVFTRVFP 183
Query: 285 -------DCLW--AEQ------------------------EEKFLVQEKKFLVHPRWIDA 311
LW AEQ + ++ + KKFLVHPRWI+A
Sbjct: 184 LHQRPQDQMLWKMAEQLGAVCCTDVDSTVTHVVALDLGTEKARWAISNKKFLVHPRWIEA 243
Query: 312 YYFLWRRRPEDDY 324
F W+R+ E+D+
Sbjct: 244 ANFRWQRQQEEDF 256
>gi|224142401|ref|XP_002324547.1| predicted protein [Populus trichocarpa]
gi|222865981|gb|EEF03112.1| predicted protein [Populus trichocarpa]
Length = 266
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 92/252 (36%), Positives = 143/252 (56%), Gaps = 41/252 (16%)
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
+ KLRPFVRTFL++AS + ++Y+ TM R YA KLLD +YF++++I+R+D +
Sbjct: 8 MTKLRPFVRTFLKEASQMFEMYIYTMGDRAYALEMAKLLDPGREYFNAKVISRDDGTQRH 67
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDK--ELNGDHKSYSETLTD 231
+K D+V GQE ++ILDDTE+ W H +NLI++ +Y +F + + KS SE TD
Sbjct: 68 QKGLDVVLGQESAVLILDDTENAWMKHKDNLILMERYHFFASSCHQFGFNCKSLSEQKTD 127
Query: 232 ESENEEALANVLRVLKTIHRLFFDSV-----CGDVRTYLPKVRSEFSRDV-LYFSAIF-- 283
ESE+E ALA++L+VL+ IH++FF+ + DVR L VR + + + FS +F
Sbjct: 128 ESESEGALASILKVLRKIHQIFFEELEENMDGRDVRQVLKTVRKDVLKGCKIVFSRVFPT 187
Query: 284 -----RDCLW--AEQ------------------------EEKFLVQEKKFLVHPRWIDAY 312
LW AEQ + + ++ KFLV P WI+A
Sbjct: 188 QSQADNHHLWRMAEQLGATCSTELDPSVTHVVSKDSGTEKSHWALKHNKFLVQPGWIEAA 247
Query: 313 YFLWRRRPEDDY 324
+ W+R+PE+++
Sbjct: 248 NYFWQRQPEENF 259
>gi|9294260|dbj|BAB02162.1| unnamed protein product [Arabidopsis thaliana]
Length = 288
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 94/251 (37%), Positives = 134/251 (53%), Gaps = 21/251 (8%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQE---------------ERKLQ 70
+C+H VR C C + ++ G F Y+ GLR S + +KL
Sbjct: 19 NCSHLFVRHGICFACKKKVSCVHGREFGYLFSGLRLSHEAVSFTKHLTTLVSVYGRKKLH 78
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LDHTL+H +LS EKYL K+ S + N++LVK RPFV FL++A+
Sbjct: 79 LVLDLDHTLIHSMKTSNLSKAEKYLIKEEKSGSRKDLRKYNNRLVKFRPFVEEFLKEANK 138
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIVIL 190
L + T Y +A V+++D + YF RII R++ D K DLV ERGIVI+
Sbjct: 139 LFTMTAYTKGGSTYGQAVVRMIDPNKIYFGDRIITRKE--SPDLKTLDLVLADERGIVIV 196
Query: 191 DDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDESENEEALANVLRVLKT 248
D+T +VW H NL+ + Y YF++ K + SY+E +DES + AL N+L+ LK
Sbjct: 197 DNTPNVWPHHKRNLLEITSYFYFKNDGKNMMRSRLSYAERKSDESRTKRALVNLLKFLKE 256
Query: 249 IHRLFFDSVCG 259
+H FF CG
Sbjct: 257 VHNGFF--TCG 265
>gi|326510557|dbj|BAJ87495.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 384
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 98/244 (40%), Positives = 137/244 (56%), Gaps = 23/244 (9%)
Query: 37 CIFCS--QAMNDSFGLSFDYMLRGLRYSEQE--------------ERKLQLVLNLDHTLL 80
C C Q D G++F Y+ +GLR E ERKL L+L+LDHTL+
Sbjct: 117 CFRCGKRQDEEDVPGVAFGYVHKGLRLGTSEIDRLRGSDLKNLLRERKLILILDLDHTLI 176
Query: 81 HCRNIKSLSSGEKYLKKQIHSFI----GSLFQMAN-DKLVKLRPFVRTFLEQASSLVDIY 135
+ + +S+ E L Q + GSLF + L KLRPFVR FL++AS++ ++Y
Sbjct: 177 NSTKLHDISAAENNLGIQAAASKDDPNGSLFTLEGMQMLTKLRPFVRKFLKEASNMFEMY 236
Query: 136 LCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIVILDDTES 195
+ TM + YA KLLD + YF+S++I+ D + +K D+V G E VILDDTE
Sbjct: 237 IYTMGDKAYAIEIAKLLDPRNVYFNSKVISNSDCTQRHQKGLDMVLGAESVAVILDDTEY 296
Query: 196 VWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLF 253
VW H ENLI++ +Y YF ++ KS SE + DE ++ ALA +L VLK IH +F
Sbjct: 297 VWQKHKENLILMERYHYFASSCRQFGFSVKSLSELMQDERGSDGALATILDVLKRIHTIF 356
Query: 254 FDSV 257
FDSV
Sbjct: 357 FDSV 360
>gi|302764346|ref|XP_002965594.1| hypothetical protein SELMODRAFT_167775 [Selaginella moellendorffii]
gi|300166408|gb|EFJ33014.1| hypothetical protein SELMODRAFT_167775 [Selaginella moellendorffii]
Length = 411
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 121/352 (34%), Positives = 174/352 (49%), Gaps = 73/352 (20%)
Query: 44 MNDSFGLSFDYMLRGLRYSEQEE----RKLQLVLNLDHTLLHCR---------------- 83
+++ F L+ D + R +R E + RKL LVL+LDHTLL+
Sbjct: 29 IHEEFELAGDVLAR-VREDELRQVLGKRKLFLVLDLDHTLLNSARWMEVFPDETAYLEHT 87
Query: 84 -------NIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVRTFLEQASSLVDIY 135
I +LS+G + I G L ++ +L KLRPF FLE+AS L ++Y
Sbjct: 88 YMNVPEDKIPALSNGAPAVAGVIQPGGGGLHRIHGMQLWTKLRPFAHKFLEEASKLFEMY 147
Query: 136 LCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIVILDDTES 195
+ TM R YA LLD K+F R+I++ D + K+ D+V G + ++ILDDTE+
Sbjct: 148 VYTMGERMYAVTMAHLLDPTGKFFKGRVISQRDSTCRQTKDLDIVLGADSAVLILDDTEA 207
Query: 196 VWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLF 253
VW H NLIV+ +Y +F+ ++ ++ S ++ DES++E ALANVL+VL+ IH F
Sbjct: 208 VWPKHRANLIVMERYHFFQSSCRQFGLENPSLTKAERDESKDEGALANVLKVLQRIHSDF 267
Query: 254 F----DS--VCGDVRTYLPKVRSE-FSRDVLYFSAIF-RDCLWAE--------------- 290
F DS C DVR VRSE S L FS IF DCL E
Sbjct: 268 FMESDDSRYTC-DVRDITSVVRSEILSGCKLVFSRIFPTDCLEPELTPLWRLCVDLGAEC 326
Query: 291 ------------------QEEKFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ K+ + +KFLVHP W++A + LWRR E ++
Sbjct: 327 VLAHDDSVTHVVALDRFTDKAKWAKEHRKFLVHPAWVEAAHSLWRRPNELEF 378
>gi|168012675|ref|XP_001759027.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689726|gb|EDQ76096.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 389
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 98/299 (32%), Positives = 148/299 (49%), Gaps = 48/299 (16%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVRTFL 125
+KL LV++LDHT+L+ + G ++ ++ + SL QM L KLRPF FL
Sbjct: 11 KKLLLVVDLDHTVLNSARFADVPVGMTWIAGELQAGGSSLHQMTKLGLWTKLRPFAHEFL 70
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQER 185
++AS L ++Y+ TM R YA+ KLLD + F+ RII++ D + K+ D+V G +
Sbjct: 71 QEASKLYEMYIYTMGERKYAKKMAKLLDPTRQLFADRIISQNDSTKRYTKDLDVVLGADS 130
Query: 186 GIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDESENEEALANVL 243
+VILDDTE+VW H NLI++ +Y +F + + S ++ DESE E LA L
Sbjct: 131 AVVILDDTEAVWPSHKSNLILMERYHFFSSSCSQFGVNSASLAQLYRDESETEGTLATTL 190
Query: 244 RVLKTIHRLFFDS------------VCGDVRTYL---------PKVR------------- 269
+ L+ IH +F+ V +R L P++
Sbjct: 191 KTLRAIHHEYFNGKVYFFKQLSLFFVIRSLRAKLLAGCNVVLGPEIHPFWQLPAELGARC 250
Query: 270 SEF----SRDVLYFSAIFRDCLWAEQEEKFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
S F + V+ LWA++ + FLVHPRW+DA +LW R PE+DY
Sbjct: 251 STFCDHTTTHVVALDPGTDQALWAKEHD-------VFLVHPRWVDATSYLWSRPPEEDY 302
>gi|9294425|dbj|BAB02545.1| unnamed protein product [Arabidopsis thaliana]
Length = 314
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 95/255 (37%), Positives = 134/255 (52%), Gaps = 34/255 (13%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSF-GLSFDYMLRGLRYSEQ---------------EERKL 69
+C H +R CI C ++ + G FD GL S + +KL
Sbjct: 34 NCGHWYIRYGFCIVCKSTVDKTIEGRVFD----GLHLSSEALALTKRLITKFSCLNMKKL 89
Query: 70 QLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFI----------GSLFQMANDKLVKLRP 119
LVL+LD TL+H + LS EKYL ++ S G + + LVKLRP
Sbjct: 90 HLVLDLDLTLIHSVRVPCLSEAEKYLIEEAGSTTREDLWKMKVRGDPISITIEHLVKLRP 149
Query: 120 FVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDL 179
F+ FL++A+ + +Y+ T TR YAEA +KL+D YF R+I R + K D+
Sbjct: 150 FLCEFLKEANEMFTMYVYTKGTRPYAEAILKLIDPKKLYFGHRVITRNE--SPHTKTLDM 207
Query: 180 VRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEAL 239
V ERG+VI+DDT W ++ NL+++G+Y YFR + + K +SE TDESEN L
Sbjct: 208 VLADERGVVIVDDTRKAWPNNKSNLVLIGRYNYFRSQ--SRVLKPHSEEKTDESENNGGL 265
Query: 240 ANVLRVLKTIHRLFF 254
ANVL++LK IH FF
Sbjct: 266 ANVLKLLKGIHHKFF 280
>gi|334185470|ref|NP_188594.3| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
gi|332642744|gb|AEE76265.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
Length = 302
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 95/255 (37%), Positives = 134/255 (52%), Gaps = 34/255 (13%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSF-GLSFDYMLRGLRYSEQ---------------EERKL 69
+C H +R CI C ++ + G FD GL S + +KL
Sbjct: 34 NCGHWYIRYGFCIVCKSTVDKTIEGRVFD----GLHLSSEALALTKRLITKFSCLNMKKL 89
Query: 70 QLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFI----------GSLFQMANDKLVKLRP 119
LVL+LD TL+H + LS EKYL ++ S G + + LVKLRP
Sbjct: 90 HLVLDLDLTLIHSVRVPCLSEAEKYLIEEAGSTTREDLWKMKVRGDPISITIEHLVKLRP 149
Query: 120 FVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDL 179
F+ FL++A+ + +Y+ T TR YAEA +KL+D YF R+I R + K D+
Sbjct: 150 FLCEFLKEANEMFTMYVYTKGTRPYAEAILKLIDPKKLYFGHRVITRNE--SPHTKTLDM 207
Query: 180 VRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEAL 239
V ERG+VI+DDT W ++ NL+++G+Y YFR + + K +SE TDESEN L
Sbjct: 208 VLADERGVVIVDDTRKAWPNNKSNLVLIGRYNYFRSQ--SRVLKPHSEEKTDESENNGGL 265
Query: 240 ANVLRVLKTIHRLFF 254
ANVL++LK IH FF
Sbjct: 266 ANVLKLLKGIHHKFF 280
>gi|226498568|ref|NP_001149751.1| CPL3 [Zea mays]
gi|195631558|gb|ACG36674.1| CPL3 [Zea mays]
Length = 493
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 103/304 (33%), Positives = 154/304 (50%), Gaps = 46/304 (15%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEK------YLKKQIHSFIGSL-FQMANDKLVKLR 118
ERKL LVL+LD TL++ + S+ EK Y + H + L + KL KLR
Sbjct: 159 ERKLILVLDLDSTLVNSARLCDFSAQEKRNGFTRYTGDKPHMDLFRLKYSNKARKLTKLR 218
Query: 119 PFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPD 178
PFVR FLEQASS+ ++++ T++ R YA+A + LLD + YF R+++R+D +D K+ D
Sbjct: 219 PFVRGFLEQASSMFEMHVYTLAKRAYAKAVIDLLDPNGVYFGGRVVSRKDSTRRDMKSLD 278
Query: 179 LVRGQER-GIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDESEN 235
++ G + +VILDDT+ VW H +NLI++ +Y YF ++ D S +E DE E
Sbjct: 279 VIPGADPVAVVILDDTD-VWPAHQDNLILMDRYHYFASTCRKFRYDIPSLAEQGRDEREQ 337
Query: 236 EEALANVLRVLKTIHRLFFDSVCGDVRTYLPKVRSEFSRDVLYFSAIFRDC--------- 286
+ +LA VL VL+ IH+ FFD DVR + +VR + + + DC
Sbjct: 338 DNSLAVVLNVLRRIHQDFFDGDQADVREVIREVRRQVLPECTVAFSYLDDCMEDFPENTL 397
Query: 287 LW--------------------------AEQEEKFLVQEKKFLVHPRWIDAYYFLWRRRP 320
+W Q+ ++ KFLV+P WI A F W R
Sbjct: 398 MWTLAERLGAVCRKDVDETVTHVVAEDPGTQKAQWARDHGKFLVNPEWIKASGFRWCRVD 457
Query: 321 EDDY 324
E +
Sbjct: 458 EQGF 461
>gi|413924219|gb|AFW64151.1| hypothetical protein ZEAMMB73_480827 [Zea mays]
Length = 490
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 103/304 (33%), Positives = 154/304 (50%), Gaps = 46/304 (15%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEK------YLKKQIHSFIGSL-FQMANDKLVKLR 118
ERKL LVL+LD TL++ + S+ EK Y + H + L + KL KLR
Sbjct: 156 ERKLILVLDLDSTLVNSARLCDFSAQEKRNGFTRYTGDKPHMDLFRLKYSNKARKLTKLR 215
Query: 119 PFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPD 178
PFVR FLEQASS+ ++++ T++ R YA+A + LLD + YF R+++R+D +D K+ D
Sbjct: 216 PFVRGFLEQASSMFEMHVYTLAKRAYAKAVIDLLDPNGVYFGGRVVSRKDSTRRDMKSLD 275
Query: 179 LVRGQER-GIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDESEN 235
++ G + +VILDDT+ VW H +NLI++ +Y YF ++ D S +E DE E
Sbjct: 276 VIPGADPVAVVILDDTD-VWPAHQDNLILMDRYHYFASTCRKFRYDIPSLAEQGRDEREQ 334
Query: 236 EEALANVLRVLKTIHRLFFDSVCGDVRTYLPKVRSEFSRDVLYFSAIFRDC--------- 286
+ +LA VL VL+ IH+ FFD DVR + +VR + + + DC
Sbjct: 335 DNSLAVVLNVLRRIHQDFFDGDQADVREVIREVRRQVLPECTIAFSYLDDCMEDFPENTL 394
Query: 287 LW--------------------------AEQEEKFLVQEKKFLVHPRWIDAYYFLWRRRP 320
+W Q+ ++ KFLV+P WI A F W R
Sbjct: 395 MWTLAERLGAVCRKDVDETVTHVVAEDPGTQKAQWARDHGKFLVNPEWIKASGFRWCRVD 454
Query: 321 EDDY 324
E +
Sbjct: 455 EQGF 458
>gi|15239576|ref|NP_200232.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
gi|9759494|dbj|BAB10744.1| unnamed protein product [Arabidopsis thaliana]
gi|332009084|gb|AED96467.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
Length = 306
Score = 147 bits (371), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 96/254 (37%), Positives = 137/254 (53%), Gaps = 28/254 (11%)
Query: 24 SLSCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQ---------------EERK 68
S +C H VR C C + G SFDY++ GL+ S+ ++K
Sbjct: 29 STNCDHFFVRYGICCNCRSNVERHRGRSFDYLVDGLQLSDIAVTVTKRVTTQITCFNDKK 88
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYL------KKQIHSFIGSLFQMANDKLVKLRPFVR 122
L LVL+LDHTLLH I +L+ E YL ++ + G +++ L+KLRPFV
Sbjct: 89 LHLVLDLDHTLLHTVMISNLTKEETYLIEEEDSREDLRRLNGG---YSSEFLIKLRPFVH 145
Query: 123 TFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRG 182
FL++A+ + +Y+ TM R YA + L+D + YF R+I R + K DLV
Sbjct: 146 EFLKEANKMFSMYVYTMGDRDYAMNVLNLIDPEKVYFGDRVITRNE--SPYIKTLDLVLA 203
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDH--KSYSETLTDESENEEALA 240
E G+VI+DDT VW DH NL+ + KY YF DK + KSY+E DES N+ +LA
Sbjct: 204 DECGVVIVDDTPHVWPDHKRNLLEITKYNYFSDKTRHDVKYTKSYAEEKRDESRNDGSLA 263
Query: 241 NVLRVLKTIHRLFF 254
NVL+V+K ++ FF
Sbjct: 264 NVLKVIKQVYEGFF 277
>gi|242063378|ref|XP_002452978.1| hypothetical protein SORBIDRAFT_04g035900 [Sorghum bicolor]
gi|241932809|gb|EES05954.1| hypothetical protein SORBIDRAFT_04g035900 [Sorghum bicolor]
Length = 464
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 106/304 (34%), Positives = 163/304 (53%), Gaps = 45/304 (14%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKY------LKKQIHSFIGSLFQMANDK-LVKLR 118
ERKL LVL+LDHTLL+ ++ LS+ E+ + ++H + L N + L KLR
Sbjct: 154 ERKLILVLDLDHTLLNSTRLQDLSALEQRNGFTPDTEDELHMELFRLEYSDNVRMLTKLR 213
Query: 119 PFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPD 178
PFVR FL+QASS ++++ T+ + YA+A + LLD D YF R+++R++ +D K+ D
Sbjct: 214 PFVRGFLDQASSRFEMHVYTLGRQDYAKAVIDLLDPDGVYFRGRVVSRKESTQRDVKSLD 273
Query: 179 LVRGQERG-IVILDDTESVWSDHTENLIVLGKYVYF--RDKELNGDHKSYSETLTDESEN 235
++ G + +VILDDT+S W H +NLI++ +Y YF ++ + S +E DE E+
Sbjct: 274 VIPGADPAAVVILDDTDSAWPGHQDNLILMDRYHYFACTCRKFRYNIPSMAEQARDEREH 333
Query: 236 EEALANVLRVLKTIHRLFFDSVCGDVRTYLPKVR------------------SEFSRDVL 277
+ +LA VL VL IH+ FFD DVR + +VR +F D L
Sbjct: 334 DGSLAVVLGVLNRIHQAFFDDDRADVREVIAEVRRQVLPVCTVVFSYLEEYMEDFPEDTL 393
Query: 278 YFS-------AIFRDC------LWAE----QEEKFLVQEKKFLVHPRWIDAYYFLWRRRP 320
++ A +D + AE Q+ ++ + KFLV+P WI A F W R
Sbjct: 394 MWTLAERLGAACQKDVDETVTHVVAEDPGTQKAQWAREHGKFLVNPEWIKAVNFRWCRVD 453
Query: 321 EDDY 324
E D+
Sbjct: 454 ERDF 457
>gi|297792855|ref|XP_002864312.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297310147|gb|EFH40571.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 305
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 94/251 (37%), Positives = 133/251 (52%), Gaps = 28/251 (11%)
Query: 27 CAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQ---------------EERKLQL 71
C H VR C C + G +FDY++ GL S+ ++KL L
Sbjct: 34 CGHFFVRYGICCHCRSNVERHGGRAFDYLVDGLELSDVAVKVTKRVTTQITCFNDKKLHL 93
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYL------KKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
VL+LDHTLLH + +LS E YL ++ + F G +++ L+KLRP+V FL
Sbjct: 94 VLDLDHTLLHTVMVSNLSKEETYLIGEADSREDLWKFNGG---YSSEFLIKLRPYVHEFL 150
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQER 185
++A+ + +Y+ TM R YA +KL+D + YF R+I R + K DLV E
Sbjct: 151 KEANEMFSMYVYTMGDRDYANNVLKLIDPEKIYFGHRVITRNE--SPYIKTLDLVLADEC 208
Query: 186 GIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDH--KSYSETLTDESENEEALANVL 243
G+VI+DDT VW D NL+ + KY YF DK KSY+E DE N+ +LANVL
Sbjct: 209 GVVIVDDTPQVWPDDKRNLLEITKYNYFSDKTRRDVKYSKSYAEEKRDEGRNDGSLANVL 268
Query: 244 RVLKTIHRLFF 254
+V+K I+ FF
Sbjct: 269 KVIKEIYEGFF 279
>gi|168059994|ref|XP_001781984.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666557|gb|EDQ53208.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 563
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 105/347 (30%), Positives = 156/347 (44%), Gaps = 70/347 (20%)
Query: 46 DSFGLSFDYMLRGLRYSEQE--------------ERKLQLVLNLDHTLLHCRNIKSLSSG 91
D GL Y+ GL SE E ++KL LV++LDHT+L+ + +
Sbjct: 151 DRVGLR--YIHEGLEVSELEAARVRNAELRRVTGKQKLLLVVDLDHTMLNSARFSEVPAE 208
Query: 92 EK----YLKKQIHSFIGSLFQMANDKL-VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAE 146
E+ + Q H + SL Q+ + KLRPF FLE+AS L ++Y+ TM + YA+
Sbjct: 209 ERIYLTWTAGQQHGRVSSLHQLTKLGMWTKLRPFAHKFLEEASKLYEMYVYTMGEKIYAQ 268
Query: 147 AAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIV 206
A +LLD + F RII++ D + K+ D+V G E +VILDDTE+VW +H NLI+
Sbjct: 269 AMAELLDPTGQLFGGRIISQTDSTKRHTKDLDVVLGAESAVVILDDTEAVWPNHRSNLIL 328
Query: 207 LGKYVYFRDK--ELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVCGDVRTY 264
+ +Y +F + S ++ DE E + LA L+ L+ IH FF+ G
Sbjct: 329 MERYHFFTSSCHQFRVRAPSLAQMHRDECEIDGTLATTLKTLQAIHHEFFNGHKGKSMKR 388
Query: 265 LPKVRSEFSRDV-------------LYFSAIFRDCL--------W--------------- 288
P + RDV + FS IF L W
Sbjct: 389 RPPLELPDVRDVIRSIRGKLLSGCHIVFSRIFPTGLQNPEFHPFWQLAVELGARCSTVCD 448
Query: 289 -----------AEQEEKFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ ++ Q LVHPRW++A +LW+R E D+
Sbjct: 449 HTTTHVVALDRGTDKARWAKQHGISLVHPRWVEAASYLWKRPREKDF 495
>gi|147774299|emb|CAN76945.1| hypothetical protein VITISV_002430 [Vitis vinifera]
Length = 641
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 81/196 (41%), Positives = 119/196 (60%), Gaps = 9/196 (4%)
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
L KLRP+V TFL++AS + ++Y+ TM R YA KLLD + YFSSR+I++ D +
Sbjct: 4 LTKLRPYVHTFLKEASKMFEMYIYTMGERSYALEMAKLLDPERVYFSSRVISQADCTQRH 63
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTD 231
+K D+V GQE ++ILDDTESVW H +NLI++ +Y +F ++ + KS SE +D
Sbjct: 64 QKGLDVVLGQESAVLILDDTESVWQKHKDNLILMERYHFFASSCRQFGFNCKSLSELKSD 123
Query: 232 ESENEEALANVLRVLKTIHRLFFDSVCG------DVRTYLPKVRSEFSRDV-LYFSAIFR 284
ESE + ALA VL+VL+ IH +FFD G DVR + +VR + + + FS +F
Sbjct: 124 ESEPDGALATVLKVLQRIHSMFFDPELGDDFSGRDVRQVVKRVRKDVLKGCKIVFSRVFP 183
Query: 285 DCLWAEQEEKFLVQEK 300
AE + + E+
Sbjct: 184 TRFQAENHHLWRMAEQ 199
>gi|297808347|ref|XP_002872057.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297317894|gb|EFH48316.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 302
Score = 140 bits (354), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 100/279 (35%), Positives = 145/279 (51%), Gaps = 41/279 (14%)
Query: 24 SLSCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQ---------------EERK 68
S +C H VR+ CI C+ ++ G SFDY+ +G+ S + E++K
Sbjct: 27 SRNCEHWFVRNKICISCNTTLDKYDGRSFDYLYKGMHMSHEALVFTKRVISQTSWLEDKK 86
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSF-----IGSLFQMANDKLVKLRPFVRT 123
L LVL+LDHTL+H L EK L +++ S S F ++ L+KLRPFV
Sbjct: 87 LHLVLDLDHTLVHTIKASQLYESEKCLTEEVGSRKDLWRFNSGF--PDESLIKLRPFVHQ 144
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQ 183
FL++ + + +Y+ T YA+ ++L+D + YF +R+I R + D K DLV
Sbjct: 145 FLKECNEMFSMYVYTKGGCDYAQVVLELIDPEKIYFGNRVITRRE--SPDLKTLDLVLAD 202
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLT--DESENEEALAN 241
ERG+VI+DD SVW +NL+ + KY YF D+ S+SE DESE + L
Sbjct: 203 ERGVVIVDDKCSVWPHDKKNLLQIAKYKYFGDQSC-----SFSECKNKRDESEEKGPLDI 257
Query: 242 VLRVLKTIHRLFF--------DSVCGDVRTYLPKVRSEF 272
VLR LK +H FF DSV DVR L ++ S +
Sbjct: 258 VLRFLKDVHNEFFCDWSRKDLDSV--DVRPLLKEISSRW 294
>gi|226498676|ref|NP_001145873.1| hypothetical protein [Zea mays]
gi|219884795|gb|ACL52772.1| unknown [Zea mays]
gi|413939308|gb|AFW73859.1| hypothetical protein ZEAMMB73_968817 [Zea mays]
Length = 425
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 85/221 (38%), Positives = 135/221 (61%), Gaps = 9/221 (4%)
Query: 60 RYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGS---LFQMANDKL-- 114
R + ERKL L+L+LDHTLL+ ++ LS E+ ++F + LF++ D L
Sbjct: 203 RATLMRERKLILILDLDHTLLNSTSLYDLSPVEQAKGFTPYTFGDTSIDLFRVDIDNLSM 262
Query: 115 -VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
VKL F R FL+QA++L ++++ T+ R YA AAV+LLD + YF RI++R + ++
Sbjct: 263 LVKLGAFARGFLKQANALFEMHVYTLGIRAYARAAVRLLDPNGIYFGGRIVSRNESTKEN 322
Query: 174 RKNPDLVRGQERG-IVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLT 230
K+ D+++G + +VILDDT+ VW + +NLI++ +Y YF + + D S +E
Sbjct: 323 TKSLDVIQGADPAMVVILDDTDGVWPGYPDNLILMDRYRYFASTCRTFDYDIPSLAEQGL 382
Query: 231 DESENEEALANVLRVLKTIHRLFFDSVCGDVRTYLPKVRSE 271
+E E++ +LA VL L+ IH+ FFD DVR + KVRS+
Sbjct: 383 EEREHDGSLAVVLGALQRIHQGFFDGHRADVREVIAKVRSQ 423
>gi|15237769|ref|NP_197738.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
gi|9759085|dbj|BAB09563.1| unnamed protein product [Arabidopsis thaliana]
gi|332005790|gb|AED93173.1| haloacid dehalogenase-like hydrolase domain-containing protein
[Arabidopsis thaliana]
Length = 302
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 98/282 (34%), Positives = 141/282 (50%), Gaps = 43/282 (15%)
Query: 24 SLSCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQ---------------EERK 68
S +C H VR+ CI C +++ G SFDY+ +G++ S + E++K
Sbjct: 27 SPNCNHWFVRNKICISCYTTVDNFEGRSFDYLYKGMQMSNEALGFTKGLISQTSWLEDKK 86
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYL------KKQIHSFIGSLFQMANDKLVKLRPFVR 122
L LVL+LD TL+H L EKY+ +K I F + L+KLRPFV
Sbjct: 87 LHLVLDLDQTLIHTIKTSLLYESEKYIIEEVESRKDIKRFNTGF---PEESLIKLRPFVH 143
Query: 123 TFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRG 182
FL++ + + +Y+ T YA ++++D D YF +R+I R + G K DLV
Sbjct: 144 QFLKECNEMFSMYVYTKGGYDYARLVLEMIDPDKFYFGNRVITRRESPG--FKTLDLVLA 201
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFRDKE--LNGDHKSYSETLTDESENEEALA 240
ERGIVI+DDT SVW +NL+ + +Y YF DK + D K DES+ + L
Sbjct: 202 DERGIVIVDDTSSVWPHDKKNLLQIARYKYFGDKSCLFSEDKKK-----IDESDEKGPLN 256
Query: 241 NVLRVLKTIHRLFF--------DSVCGDVRTYLPKVRSEFSR 274
LR LK +H FF DSV DVR L ++ + R
Sbjct: 257 TALRFLKDVHEEFFYDWSKKDLDSV--DVRPLLKEISLRWKR 296
>gi|47497024|dbj|BAD19077.1| phosphatase-like [Oryza sativa Japonica Group]
gi|47497233|dbj|BAD19278.1| phosphatase-like [Oryza sativa Japonica Group]
gi|125584004|gb|EAZ24935.1| hypothetical protein OsJ_08715 [Oryza sativa Japonica Group]
Length = 420
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 106/319 (33%), Positives = 151/319 (47%), Gaps = 67/319 (21%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEK---YLKKQIHSFIGSLFQMANDKLV-KLRPFVR 122
RKL LV++LDHTL++ LS EK + ++ LF+M +++ KLRPFV
Sbjct: 105 RKLILVVDLDHTLINSTRFAHLSDDEKANGFTERTGDDRSRGLFRMGLFRMITKLRPFVH 164
Query: 123 TFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRG 182
FL +AS++ ++++ T+ R YA A KLLD D YF RII+ + + DRK+ V G
Sbjct: 165 EFLREASAMFEMHVYTLGNRNYATAVAKLLDPDGAYFGERIISSGESSQPDRKSLGDVFG 224
Query: 183 -----QERGIVILDDTESVWSDHTENLIVLGKYVYFRDK--ELNGDHKSYSETLTDESEN 235
+ +VILDDT VW + +NLI + +Y+YF + +S +E DESE
Sbjct: 225 WAPEMERAAVVILDDTAEVWKGYRDNLIEMERYLYFASSRGKFGIAVRSLAERNRDESER 284
Query: 236 EEALANVLRVLKTIHRLFF-DSVC----GDVRTYLPKVRSEFSR---------------- 274
E ALA LRVL+ +H FF SVC DVR + + R E R
Sbjct: 285 EGALAVALRVLRRVHGEFFSGSVCSGSFADVREVIRQARREVLRGCTVAFTGVIPSGDGG 344
Query: 275 ------------------------DVLYFSA---IFRDCLWAEQEEKFLVQEKKFLVHPR 307
V +F A + R LWA+ KFLV +
Sbjct: 345 RASDHPVWRRAEQLGATCADDVGEGVTHFVAGKPVTRKALWAQTHGKFLVDTE------- 397
Query: 308 WIDAYYFLWRRRPEDDYLP 326
WI+A +F W +PE+ P
Sbjct: 398 WINAAHFRW-SKPEERMYP 415
>gi|15226925|ref|NP_178335.1| Haloacid dehalogenase-like hydrolase-like protein [Arabidopsis
thaliana]
gi|3894162|gb|AAC78512.1| hypothetical protein [Arabidopsis thaliana]
gi|330250469|gb|AEC05563.1| Haloacid dehalogenase-like hydrolase-like protein [Arabidopsis
thaliana]
Length = 302
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 92/279 (32%), Positives = 143/279 (51%), Gaps = 37/279 (13%)
Query: 24 SLSCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQ---------------EERK 68
S +C+H VR+ C C+ +++ G SFDY+ G++ S + E++K
Sbjct: 27 SRNCSHWFVRNKVCASCNTIVDNYQGRSFDYLYTGIQMSNEALGFTKRLISQTSWLEDKK 86
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSF-----IGSLFQMANDKLVKLRPFVRT 123
L LVL+LDHTL+H + LS EKY+ +++ S + F + L+KLR FV
Sbjct: 87 LHLVLDLDHTLVHTIKVSQLSESEKYITEEVESRKDLRRFNTGF--PEESLIKLRSFVHQ 144
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQ 183
FL++ + + +Y+ T YA+ ++++D D YF +R+I R + G K DLV
Sbjct: 145 FLKECNEMFSLYVYTKGGYDYAQLVLEMIDPDKIYFGNRVITRRESPG--FKTLDLVLAD 202
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALANVL 243
ERGIV++DD SVW +NL+ + +Y YF D+ S + DES+ + L L
Sbjct: 203 ERGIVVVDDKSSVWPHDKKNLLQIARYKYFGDQSC---LLSECKKKIDESDEKGPLNTAL 259
Query: 244 RVLKTIHRLFF--------DSVCGDVRTYLPKVRSEFSR 274
R L +H FF DSV DVR L ++ + R
Sbjct: 260 RFLMDVHEEFFCDWSRKDLDSV--DVRPLLKEISLRWKR 296
>gi|297846748|ref|XP_002891255.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297337097|gb|EFH67514.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 210
Score = 134 bits (337), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 120/206 (58%), Gaps = 10/206 (4%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK----LVKLRPFV 121
++KL LVL+LDHTL+H + LS EKYL ++ S L++ D ++KLRPFV
Sbjct: 2 KKKLHLVLDLDHTLIHTVLVSDLSEREKYLLEEADSR-QDLWRCNKDSPYEFIIKLRPFV 60
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
FL +A+ L +++ TM CYA+ +KL+D D YF +R+I RE K DL+
Sbjct: 61 HEFLLEANKLFTMHVYTMGNSCYAQDVLKLIDPDKVYFGNRVITRE--ASPCNKTLDLLV 118
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALAN 241
R +VI+DDT SVW H NL+ + KY+YFR D SY+E DES +LAN
Sbjct: 119 ADTRRVVIVDDTISVWPHHKRNLLQITKYIYFRVDGTKWD--SYAEEKKDESRKSGSLAN 176
Query: 242 VLRVLKTIHRLFFDSV-CGDVRTYLP 266
VL+ L+ +H+ F + + D+R +P
Sbjct: 177 VLKFLEDVHKRFEEDLDSKDLRLLIP 202
>gi|125541461|gb|EAY87856.1| hypothetical protein OsI_09278 [Oryza sativa Indica Group]
Length = 420
Score = 133 bits (335), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 104/312 (33%), Positives = 152/312 (48%), Gaps = 53/312 (16%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEK---YLKKQIHSFIGSLFQMANDKLV-KLRPFVR 122
RKL LV++LDHTL++ LS EK + ++ LF+M +++ KLRPFV
Sbjct: 105 RKLILVVDLDHTLINSTRFAHLSDDEKANGFTERTGDDRSRGLFRMGLFRMITKLRPFVH 164
Query: 123 TFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRG 182
FL +AS++ ++++ T+ R YA A KLLD D YF RII+ + + DRK+ V G
Sbjct: 165 EFLREASAMFEMHVYTLGNRNYATAVAKLLDPDGAYFGERIISSGESSQPDRKSLGDVFG 224
Query: 183 -----QERGIVILDDTESVWSDHTENLIVLGKYVYFRDK--ELNGDHKSYSETLTDESEN 235
+ +VILDDT VW + +NLI + +Y+YF + +S +E DESE
Sbjct: 225 WAPEMERAAVVILDDTAEVWKGYRDNLIEMERYLYFASSRGKFGIAARSLAERNRDESER 284
Query: 236 EEALANVLRVLKTIHRLFF-DSVC----GDVRTYLPKVRSEFSRD-VLYFSAIFRDC--- 286
E ALA LRVL+ +H FF SVC DVR + + R E R + F+ +
Sbjct: 285 EGALAVALRVLRRVHGEFFSGSVCSGSFADVREVIRQARREVLRGCTVAFTGVIPSGDGG 344
Query: 287 ------LWAEQEE-------------KFLVQEK-------------KFLVHPRWIDAYYF 314
+W + E+ +V K KFLV WI+A +F
Sbjct: 345 RASDHPVWRKAEQLGATCADDVGEGVTHVVAGKPVTGKALWAQTHGKFLVDTEWINAAHF 404
Query: 315 LWRRRPEDDYLP 326
W +PE+ P
Sbjct: 405 RW-SKPEERMYP 415
>gi|15218405|ref|NP_175026.1| NLI interacting factor (NIF) family protein [Arabidopsis thaliana]
gi|91805923|gb|ABE65690.1| NLI interacting factor family protein [Arabidopsis thaliana]
gi|332193852|gb|AEE31973.1| NLI interacting factor (NIF) family protein [Arabidopsis thaliana]
Length = 255
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 86/224 (38%), Positives = 130/224 (58%), Gaps = 13/224 (5%)
Query: 49 GLSFDYMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQ 108
G F L +S +++KL LVL+LDHTLLH + LS EKYL ++ S L++
Sbjct: 33 GAWFKKHLTTQLFSVTKKKKLHLVLDLDHTLLHSVLVSDLSKREKYLLEETDS-RQDLWR 91
Query: 109 MANDK---LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIA 165
D ++KLRPF+ FL +A+ L +++ TM + YA+ +KL+D D YF R+I
Sbjct: 92 RNVDGYEFIIKLRPFLHEFLLEANKLFTMHVYTMGSSSYAKQVLKLIDPDKVYFGKRVIT 151
Query: 166 RE--DFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHK 223
RE FN K+ DL+ +R +VI+DDT VW H NL+ + KY+YF+ D
Sbjct: 152 REASPFN----KSLDLLAADKRRVVIVDDTVHVWPFHKRNLLQITKYIYFKVDGTKWD-- 205
Query: 224 SYSETLTDESENEEALANVLRVLKTIHRLFFDSVC-GDVRTYLP 266
SY+E DES++ +LANVL+ L+ +H+ F + + D+R +P
Sbjct: 206 SYAEAKKDESQSNGSLANVLKFLEVVHKRFEEDLGFKDLRLLIP 249
>gi|116830952|gb|ABK28432.1| unknown [Arabidopsis thaliana]
Length = 256
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 86/224 (38%), Positives = 130/224 (58%), Gaps = 13/224 (5%)
Query: 49 GLSFDYMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQ 108
G F L +S +++KL LVL+LDHTLLH + LS EKYL ++ S L++
Sbjct: 33 GAWFKKHLTTQLFSVTKKKKLHLVLDLDHTLLHSVLVSDLSKREKYLLEETDS-RQDLWR 91
Query: 109 MANDK---LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIA 165
D ++KLRPF+ FL +A+ L +++ TM + YA+ +KL+D D YF R+I
Sbjct: 92 RNVDGYEFIIKLRPFLHEFLLEANKLFTMHVYTMGSSSYAKQVLKLIDPDKVYFGKRVIT 151
Query: 166 RE--DFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHK 223
RE FN K+ DL+ +R +VI+DDT VW H NL+ + KY+YF+ D
Sbjct: 152 REASPFN----KSLDLLAADKRRVVIVDDTVHVWPFHKRNLLQITKYIYFKVDGTKWD-- 205
Query: 224 SYSETLTDESENEEALANVLRVLKTIHRLFFDSVC-GDVRTYLP 266
SY+E DES++ +LANVL+ L+ +H+ F + + D+R +P
Sbjct: 206 SYAEAKKDESQSNGSLANVLKFLEVVHKRFEEDLGFKDLRLLIP 249
>gi|15218404|ref|NP_175025.1| NLI interacting factor (NIF) family protein [Arabidopsis thaliana]
gi|117958727|gb|ABK59679.1| At1g43600 [Arabidopsis thaliana]
gi|332193851|gb|AEE31972.1| NLI interacting factor (NIF) family protein [Arabidopsis thaliana]
Length = 221
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 82/204 (40%), Positives = 121/204 (59%), Gaps = 13/204 (6%)
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK---LVKLRPFVRTFL 125
L LVL+LDHTLLH + LS EKYL ++ S L++ D ++KLRPF+ FL
Sbjct: 19 LHLVLDLDHTLLHSVLVSDLSKREKYLLEETDS-RQDLWRRNVDGYEFIIKLRPFLHEFL 77
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE--DFNGKDRKNPDLVRGQ 183
+A+ L +++ TM + YA+ +KL+D D YF R+I RE FN K+ DL+
Sbjct: 78 LEANKLFTMHVYTMGSSSYAKQVLKLIDPDKVYFGKRVITREASPFN----KSLDLLAAD 133
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALANVL 243
+R +VI+DDT VW H NL+ + KYVYF+ D SY+E DES++ +LANVL
Sbjct: 134 KRRVVIVDDTVHVWPFHKRNLLQITKYVYFKVDGTKWD--SYAEAKKDESQSNGSLANVL 191
Query: 244 RVLKTIHRLFFDSVC-GDVRTYLP 266
+ L+ +H+ F + + D+R +P
Sbjct: 192 KFLEDVHKRFEEDLGFKDLRLLIP 215
>gi|302769312|ref|XP_002968075.1| hypothetical protein SELMODRAFT_67516 [Selaginella moellendorffii]
gi|300163719|gb|EFJ30329.1| hypothetical protein SELMODRAFT_67516 [Selaginella moellendorffii]
Length = 141
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 61/141 (43%), Positives = 90/141 (63%), Gaps = 2/141 (1%)
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRK 175
KLRPF FLE+AS L ++Y+ TM R YA LLD K+F R+I++ D + K
Sbjct: 1 KLRPFAHKFLEEASKLFEMYVYTMGERMYAVTMAHLLDPTGKFFKGRVISQRDSTCRQTK 60
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDES 233
+ D+V G + ++ILDDTE+VW H NLIV+ +Y +F+ ++ ++ S ++ DES
Sbjct: 61 DLDIVLGADSAVLILDDTEAVWPKHRANLIVMERYHFFQSSCRQFGLENPSLTKAERDES 120
Query: 234 ENEEALANVLRVLKTIHRLFF 254
++E ALANVL+VL+ IH FF
Sbjct: 121 KDEGALANVLKVLQRIHSDFF 141
>gi|297830094|ref|XP_002882929.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297328769|gb|EFH59188.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 270
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 88/252 (34%), Positives = 126/252 (50%), Gaps = 34/252 (13%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQE---ERKLQLVLNL------- 75
+C+H VR C C ++ G +FDY+ GLR S + ++L ++++
Sbjct: 12 NCSHLFVRHGICFTCKTKVSYVEGRAFDYLFSGLRLSHEAVSFTKQLTTLVSVYGHKKLH 71
Query: 76 ------DHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQAS 129
DHTL+H +LS+ EKYL K+ S + ND+LVK RPFV FL++A+
Sbjct: 72 LLVLDLDHTLIHSMKTLNLSNAEKYLIKEEKSGSRKDLRKYNDRLVKFRPFVEEFLKEAN 131
Query: 130 SLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIVI 189
L + T YA+A V++LD + YF RII R++ D K DLV ERGIVI
Sbjct: 132 KLFTMTAYTRGGSTYAKAVVRMLDPNKIYFGDRIITRKE--SPDLKTLDLVLADERGIVI 189
Query: 190 LDDTESVWSDHTENLIVLGKYVYFRDKELN--GDHKSYSETLTDESENEEALANVLRVLK 247
NL+ + Y YF++ N SY+E TDES + AL +L+ LK
Sbjct: 190 ------------RNLLEITSYFYFKNDHRNIMRSRLSYAERKTDESRTKRALVKLLKFLK 237
Query: 248 TIHRLFFDSVCG 259
+H FF CG
Sbjct: 238 EVHNGFF--TCG 247
>gi|297819962|ref|XP_002877864.1| hypothetical protein ARALYDRAFT_906616 [Arabidopsis lyrata subsp.
lyrata]
gi|297323702|gb|EFH54123.1| hypothetical protein ARALYDRAFT_906616 [Arabidopsis lyrata subsp.
lyrata]
Length = 284
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 88/279 (31%), Positives = 121/279 (43%), Gaps = 44/279 (15%)
Query: 25 LSCAHTTVRDSRCIFCSQAMNDSFG--LSFDYMLRGLRYSEQ--------------EERK 68
++C H VR C C A++ + F Y+ GL++ + +E++
Sbjct: 1 MACIHDIVRHGFCSQCKSAVDARHYALIPFSYLGNGLQFRPEFVGTTKRHVWMKSLKEKR 60
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIG-----SLFQMANDKLVKLRPFVRT 123
L LVL L TL R + LS GE YL ++ S F + L KLRPFV
Sbjct: 61 LTLVLGLHGTLYDSRLVSQLSDGENYLTGEVKSRFDLRRSKKFFPNQGEVLFKLRPFVHE 120
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQ 183
FL +A+ L + + + + E + LD YF RII D + KN DLV
Sbjct: 121 FLREANKLFQMTVFELCSPEQGEEVISFLDPHGTYFEKRIITNRD---SEMKNLDLVLAD 177
Query: 184 ERGIVILDDTESV-WSDHTENLIVLGKYVYFRDKELN-------------------GDHK 223
ERGIVILDD W D T NL+ + Y +F+ N D K
Sbjct: 178 ERGIVILDDKHVYWWPDDTTNLLQIAPYHFFKRNNNNTWITKLVNFFKKTLSIDDESDPK 237
Query: 224 SYSETLTDESENEEALANVLRVLKTIHRLFFDSVCGDVR 262
SY+E DE + L N L +LK +H+ FFD D R
Sbjct: 238 SYAEERRDEDAEDGGLENALELLKEVHKNFFDEEDEDSR 276
>gi|297819964|ref|XP_002877865.1| hypothetical protein ARALYDRAFT_906617 [Arabidopsis lyrata subsp.
lyrata]
gi|297323703|gb|EFH54124.1| hypothetical protein ARALYDRAFT_906617 [Arabidopsis lyrata subsp.
lyrata]
Length = 345
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 100/309 (32%), Positives = 150/309 (48%), Gaps = 48/309 (15%)
Query: 6 CKECVGKTKFVIKRKCEQSLS--CAHTTVRDSRCIFCSQAMND-SFGLSFDYMLRGLRYS 62
CK V K E SL+ C H ++ RC C ++ F +F+Y+ + L S
Sbjct: 18 CKSPVKTYDANTKVAKETSLNPNCRHRLYQNRRCCRCGYYLDTWYFARAFNYIAKSLSMS 77
Query: 63 EQEE--------------RKLQLVLNLDHTLLHCRNIKSLSSGEKY--LKKQIHSFIGSL 106
+ E RKL LVL+L+HTL+ ++ LS ++Y L++ L
Sbjct: 78 PEFEATTKKQKLGIALGKRKLHLVLSLEHTLIDLISVSKLSEIDRYHLLEEADSGSRDDL 137
Query: 107 FQMAN------DKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFS 160
F++AN D LVK RPFVR FL +A + +++ T A+ VKLLD YF
Sbjct: 138 FRLANESFYSSDALVKFRPFVREFLREAEKIFTMHVYTNYGPGLAKKVVKLLDPHMIYFG 197
Query: 161 SRIIAREDFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD----- 215
+RII +D NG D K+ +LV + RG++I+D +W N+I + KYVYF++
Sbjct: 198 NRIITSKDSNG-DLKSLELVLAEPRGVLIVDYDHRLWKSPGHNVIFMSKYVYFKEISNED 256
Query: 216 ----KELN--------GDHK-----SYSETLTDESENEEALANVLRVLKTIHRLFFDSVC 258
K LN GD+K SE + + ++E L +LR LK +H LFF+
Sbjct: 257 GVLAKTLNLLKKISLTGDYKVVDLEGKSEGESPDDDDELLLKVLLRSLKELHELFFNGGY 316
Query: 259 GDVRTYLPK 267
+V LP+
Sbjct: 317 QEVNPLLPR 325
>gi|302816075|ref|XP_002989717.1| hypothetical protein SELMODRAFT_23521 [Selaginella moellendorffii]
gi|302824047|ref|XP_002993670.1| hypothetical protein SELMODRAFT_23523 [Selaginella moellendorffii]
gi|300138493|gb|EFJ05259.1| hypothetical protein SELMODRAFT_23523 [Selaginella moellendorffii]
gi|300142494|gb|EFJ09194.1| hypothetical protein SELMODRAFT_23521 [Selaginella moellendorffii]
Length = 312
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 99/309 (32%), Positives = 147/309 (47%), Gaps = 50/309 (16%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEK------YLKKQIHSFIGSLFQMANDKL-VKL 117
E RKL LVL+LDHTL++ + + + EK Y + L ++ + +L K+
Sbjct: 2 EHRKLMLVLDLDHTLVNSASFDEVCAEEKPFLESMYARDPPKGRSKLLHKLDDLQLWTKI 61
Query: 118 RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDR 174
RPF FL QAS L D+Y+ TM TR YAEA +KLLD F +++R D + +DR
Sbjct: 62 RPFALEFLAQASKLFDLYVYTMGTRIYAEAMLKLLDPTGVLFKG-LVSRNDNDLTDHRDR 120
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHT-ENLIVLGKYVYFRD--KELNGDH-KSYSETLT 230
K+ D V GQE ++I+DD W + +NLI + +Y +F K D S +
Sbjct: 121 KDLDTVLGQESSVLIVDDLPEAWPEEQHKNLIQIDRYHFFSSSCKSFGFDESSSLARRGI 180
Query: 231 DESENEEALANVLRVLKTIHRLFFD----SVCGDVRTYLPKVRSEFSRDV-LYFSAIF-- 283
DES + +LA++L+ L+TIHR FF S DVR + ++RS L FS++
Sbjct: 181 DESHSGGSLASLLQGLETIHRDFFQYGEFSFLEDVRDTVSELRSHILEGCKLAFSSVVPI 240
Query: 284 --RDCLW--------------------------AEQEEKFLVQEKKFLVHPRWIDAYYFL 315
D LW ++ V+ K LV+P W+ A F
Sbjct: 241 DCEDSLWILCEGLGAECVLEIDDSVTHVVAMDPESARARWAVENGKHLVNPSWMRAAAFR 300
Query: 316 WRRRPEDDY 324
R E ++
Sbjct: 301 LGRPRESEF 309
>gi|255543174|ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
gi|223548611|gb|EEF50102.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
Length = 1195
Score = 114 bits (285), Expect = 6e-23, Method: Composition-based stats.
Identities = 99/328 (30%), Positives = 149/328 (45%), Gaps = 62/328 (18%)
Query: 57 RGLRYSEQEE----RKLQLVLNLDHTLLHCRNI--------KSLSSGEKYLKKQIHSFIG 104
R R EQ++ RKL LVL+LDHTLL+ + L E+ +++ H +
Sbjct: 866 RARRIEEQKKLFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAHRHLF 925
Query: 105 SLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII 164
M KLRP + FLE+AS L +++L TM + YA K+LD F+ R+I
Sbjct: 926 RFPHMG--MWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGVLFNGRVI 983
Query: 165 ARED----FNGKDR--KNPDL--VRGQERGIVILDDTESVWSDHTENLIVLGKYVYF--R 214
+R D F+G +R K+ DL V G E G+VI+DD+ VW + NLIV+ +Y+YF
Sbjct: 984 SRGDDGEPFDGDERIPKSKDLEGVLGMESGVVIMDDSVRVWPHNKLNLIVVERYIYFPCS 1043
Query: 215 DKELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVC---GDVRTYL-PKVRS 270
++ S E DE + LA L V++ IH+ FF DVR L + R
Sbjct: 1044 RRQFGLPGPSLLEIDHDERPEDGTLACSLAVIERIHQNFFTHPSLDEADVRNILASEQRK 1103
Query: 271 EFSRDVLYFSAIFR--------DCLWAEQEE--------------------------KFL 296
+ + FS +F LW E+ +
Sbjct: 1104 ILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWA 1163
Query: 297 VQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ +F+V+P W++A L+RR E D+
Sbjct: 1164 LSTGRFVVYPGWVEASALLYRRANEQDF 1191
>gi|307106534|gb|EFN54779.1| hypothetical protein CHLNCDRAFT_134722 [Chlorella variabilis]
Length = 513
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 90/282 (31%), Positives = 128/282 (45%), Gaps = 41/282 (14%)
Query: 29 HTTVRDSRCIFCS--QAMNDSFGLSFDYMLRGLRYSEQE--------------ERKLQLV 72
H CI C + + G++ Y+ RGL S+ E RKL L+
Sbjct: 62 HPGFMGGICIRCGALKGEAEEQGVALTYIHRGLVVSKHEAERVRQGTADRLLAHRKLLLI 121
Query: 73 LNLDHTLLHCRNIKSLS----------SGEKYLKKQIHS---FIGSLFQMANDKL-VKLR 118
L+LDHTLL+ + GE+ L+ Q+ + L+ + + ++ KLR
Sbjct: 122 LDLDHTLLNSTRFTEVPPQGAVTEQREGGEQALRAQLEAQPKGAPMLYCLPHMRMWTKLR 181
Query: 119 PFVRTFLEQASS------LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGK 172
P VR FLE A ++ + TM R YA KLLD F RII+ D +
Sbjct: 182 PGVREFLEAAKDRQVGQVGFELAVYTMGDRDYAGEMAKLLDPAGSLFHGRIISSGDSTQR 241
Query: 173 DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYS--ETLT 230
K+ D+V G+ER ++ILDDTE VW H +NL+ + +Y+YF +S S E
Sbjct: 242 YVKDLDVVLGRERCVLILDDTEGVWPRHRDNLVQIERYLYFPADAARFGFRSQSLLERAV 301
Query: 231 DESENEEALANVLRVLKTIHRLFF---DSVCGDVRTYLPKVR 269
DE ALA LRV+ + + FF D DVR L R
Sbjct: 302 DEEGGGGALATCLRVMSGVQQQFFEQGDPGAADVRPLLGAAR 343
>gi|218185830|gb|EEC68257.1| hypothetical protein OsI_36281 [Oryza sativa Indica Group]
Length = 1255
Score = 113 bits (283), Expect = 1e-22, Method: Composition-based stats.
Identities = 103/333 (30%), Positives = 147/333 (44%), Gaps = 72/333 (21%)
Query: 57 RGLRYSEQEE----RKLQLVLNLDHTLLHCRNIKSLSS--GEKYLKKQ--------IHSF 102
R R EQ + RKL LVL+LDHTLL+ + GE KK+ H F
Sbjct: 927 RARRIKEQHKMFAARKLCLVLDLDHTLLNSAKFIEVDHIHGEILRKKEEQDRERAERHLF 986
Query: 103 IGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
+ M KLRP + FLE+AS L +++L TM + YA K+LD F+ R
Sbjct: 987 CFNHMGM----WTKLRPGIWNFLEKASKLYELHLYTMGNKVYATEMAKVLDPTGTLFAGR 1042
Query: 163 IIARED----FNGKDR----KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF- 213
+I+R D F+ +R K+ D V G E +VI+DD+ VW + NLIV+ +Y YF
Sbjct: 1043 VISRGDDGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKHNLIVVERYTYFP 1102
Query: 214 -RDKELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVC---GDVRTYLPKVR 269
++ S E DE + LA+ L V++ IH+ FF DVR+ L
Sbjct: 1103 CSRRQFGLPGPSLLEIDRDERPEDGTLASSLTVIERIHKNFFSHPNLNDADVRSILA--- 1159
Query: 270 SEFSRDV----LYFSAIF--------RDCLWAEQEE------------------------ 293
SE R + + FS IF LW E+
Sbjct: 1160 SEQQRILGGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTD 1219
Query: 294 --KFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ + +F+VHP W++A L+RR E D+
Sbjct: 1220 KVNWALSTGRFVVHPGWVEASALLYRRASELDF 1252
>gi|242068555|ref|XP_002449554.1| hypothetical protein SORBIDRAFT_05g019010 [Sorghum bicolor]
gi|241935397|gb|EES08542.1| hypothetical protein SORBIDRAFT_05g019010 [Sorghum bicolor]
Length = 1197
Score = 113 bits (283), Expect = 1e-22, Method: Composition-based stats.
Identities = 99/326 (30%), Positives = 152/326 (46%), Gaps = 58/326 (17%)
Query: 57 RGLRYSEQEE----RKLQLVLNLDHTLLH-CRNIKSLSSGEKYLKKQIHSFIG----SLF 107
R R +EQ + RKL LVL+LDHTLL+ + I+ E+ L+K+ L+
Sbjct: 869 RARRITEQHKMFSARKLCLVLDLDHTLLNSAKFIEVEPIHEEMLRKKEEQDRTLPERHLY 928
Query: 108 QMANDKL-VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR 166
+ + + KLRP + FLE+AS+L +++L TM + YA K+LD F+ R+I+R
Sbjct: 929 RFHHMNMWTKLRPGIWNFLEKASNLFELHLYTMGNKLYATEMAKVLDPTGTLFAGRVISR 988
Query: 167 ED----FNGKDR----KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF--RDK 216
D F+ +R K+ D V G E +VI+DD+ VW + NLIV+ +Y YF +
Sbjct: 989 GDDGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNRHNLIVVERYTYFPCSRR 1048
Query: 217 ELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVC---GDVRTYL-PKVRSEF 272
+ S E DE + LA+ L V++ IH FF DVR+ L + R
Sbjct: 1049 QFGLPGPSLLEIDRDERPEDGTLASSLAVIERIHHNFFSHPNLNEADVRSILASEQRRIL 1108
Query: 273 SRDVLYFSAIFR--------DCLWAEQEE--------------------------KFLVQ 298
+ + FS +F LW E+ + +
Sbjct: 1109 AGCRIVFSRVFPVGDASPHLHPLWQTAEQFGAVCTNLVDDRVTHVVANSPGTDKVNWALS 1168
Query: 299 EKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ KF+VHP W++A L+RR E D+
Sbjct: 1169 KGKFVVHPGWVEASALLYRRANEHDF 1194
>gi|77551160|gb|ABA93957.1| NLI interacting factor-like phosphatase family protein, expressed
[Oryza sativa Japonica Group]
Length = 1272
Score = 113 bits (283), Expect = 1e-22, Method: Composition-based stats.
Identities = 103/333 (30%), Positives = 147/333 (44%), Gaps = 72/333 (21%)
Query: 57 RGLRYSEQEE----RKLQLVLNLDHTLLHCRNIKSLSS--GEKYLKKQ--------IHSF 102
R R EQ + RKL LVL+LDHTLL+ + GE KK+ H F
Sbjct: 944 RARRIKEQHKMFAARKLCLVLDLDHTLLNSAKFIEVDHIHGEILRKKEEQDRERAERHLF 1003
Query: 103 IGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
+ M KLRP + FLE+AS L +++L TM + YA K+LD F+ R
Sbjct: 1004 CFNHMGM----WTKLRPGIWNFLEKASKLYELHLYTMGNKVYATEMAKVLDPTGTLFAGR 1059
Query: 163 IIARED----FNGKDR----KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF- 213
+I+R D F+ +R K+ D V G E +VI+DD+ VW + NLIV+ +Y YF
Sbjct: 1060 VISRGDDGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKHNLIVVERYTYFP 1119
Query: 214 -RDKELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVC---GDVRTYLPKVR 269
++ S E DE + LA+ L V++ IH+ FF DVR+ L
Sbjct: 1120 CSRRQFGLPGPSLLEIDRDERPEDGTLASSLAVIERIHKNFFSHPNLNDADVRSILA--- 1176
Query: 270 SEFSRDV----LYFSAIF--------RDCLWAEQEE------------------------ 293
SE R + + FS IF LW E+
Sbjct: 1177 SEQQRILGGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTD 1236
Query: 294 --KFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ + +F+VHP W++A L+RR E D+
Sbjct: 1237 KVNWALSTGRFVVHPGWVEASALLYRRASELDF 1269
>gi|222616055|gb|EEE52187.1| hypothetical protein OsJ_34058 [Oryza sativa Japonica Group]
Length = 1267
Score = 113 bits (283), Expect = 1e-22, Method: Composition-based stats.
Identities = 103/333 (30%), Positives = 147/333 (44%), Gaps = 72/333 (21%)
Query: 57 RGLRYSEQEE----RKLQLVLNLDHTLLHCRNIKSLSS--GEKYLKKQ--------IHSF 102
R R EQ + RKL LVL+LDHTLL+ + GE KK+ H F
Sbjct: 939 RARRIKEQHKMFAARKLCLVLDLDHTLLNSAKFIEVDHIHGEILRKKEEQDRERAERHLF 998
Query: 103 IGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
+ M KLRP + FLE+AS L +++L TM + YA K+LD F+ R
Sbjct: 999 CFNHMGM----WTKLRPGIWNFLEKASKLYELHLYTMGNKVYATEMAKVLDPTGTLFAGR 1054
Query: 163 IIARED----FNGKDR----KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF- 213
+I+R D F+ +R K+ D V G E +VI+DD+ VW + NLIV+ +Y YF
Sbjct: 1055 VISRGDDGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKHNLIVVERYTYFP 1114
Query: 214 -RDKELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVC---GDVRTYLPKVR 269
++ S E DE + LA+ L V++ IH+ FF DVR+ L
Sbjct: 1115 CSRRQFGLPGPSLLEIDRDERPEDGTLASSLAVIERIHKNFFSHPNLNDADVRSILA--- 1171
Query: 270 SEFSRDV----LYFSAIF--------RDCLWAEQEE------------------------ 293
SE R + + FS IF LW E+
Sbjct: 1172 SEQQRILGGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLGTD 1231
Query: 294 --KFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ + +F+VHP W++A L+RR E D+
Sbjct: 1232 KVNWALSTGRFVVHPGWVEASALLYRRASELDF 1264
>gi|242066826|ref|XP_002454702.1| hypothetical protein SORBIDRAFT_04g035880 [Sorghum bicolor]
gi|241934533|gb|EES07678.1| hypothetical protein SORBIDRAFT_04g035880 [Sorghum bicolor]
Length = 462
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 93/308 (30%), Positives = 145/308 (47%), Gaps = 57/308 (18%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYL---KKQIHSFIGSLFQMANDKL---VKLRP 119
ERKL LVL+LD TLL+ + + S GE++ +F++ +D L KLRP
Sbjct: 127 ERKLILVLDLDRTLLNSARLDAFSVGEEWFGFTPDTGDKVDMDIFRLDSDNLGMLTKLRP 186
Query: 120 FVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGK-DRKNPD 178
FVR S+ +++L T+ YA+AA+ LLD + YF R+++R+D + + K+ D
Sbjct: 187 FVR------GSMFEMHLYTLGNLVYAKAAIHLLDPNGVYFGGRVVSRDDESTQGGTKSLD 240
Query: 179 LVRGQERGIVI----LDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTDE 232
++ G + + LDDT+ W +H +NLI+ +Y YF ++ D S +E DE
Sbjct: 241 VIPGADPVAAVILDALDDTDVAWPEHQDNLILTNRYRYFASTCRKSRHDIPSLAELRRDE 300
Query: 233 -SENEEALANVLRVLKTIHRLFFDS-VCGDVRTYLPKVRSEFSRDVL----YFSAIFRDC 286
E+ +LA L VLK +H FFD DVR + ++R + R Y D
Sbjct: 301 KGEHGGSLAVALGVLKRVHDAFFDGRPHADVREVIAELRGQVLRGCTVAFSYLEQRMEDS 360
Query: 287 -----LW--------------------------AEQEEKFLVQEKKFLVHPRWIDAYYFL 315
LW Q+ ++ + KFLV+P WI A F
Sbjct: 361 PDDTRLWTLAERLGAVCRKDVDETVTHVVAEDPGTQKAQWAREHGKFLVNPEWIKAASFR 420
Query: 316 W-RRRPED 322
W R+ P++
Sbjct: 421 WCRQDPQE 428
>gi|413920930|gb|AFW60862.1| hypothetical protein ZEAMMB73_799152, partial [Zea mays]
Length = 1234
Score = 112 bits (280), Expect = 2e-22, Method: Composition-based stats.
Identities = 98/326 (30%), Positives = 152/326 (46%), Gaps = 58/326 (17%)
Query: 57 RGLRYSEQEE----RKLQLVLNLDHTLLH-CRNIKSLSSGEKYLKKQIHSFIG----SLF 107
R R +EQ + RKL LVL+LDHTLL+ + I+ E+ L+K+ L+
Sbjct: 908 RARRITEQHKMFSARKLCLVLDLDHTLLNSAKFIEVEPIHEEMLRKKEEQDRTLPERHLY 967
Query: 108 QMANDKL-VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR 166
+ + + KLRP + FL++AS+L +++L TM + YA K+LD F+ R+I+R
Sbjct: 968 RFHHMNMWTKLRPGIWNFLQKASNLFELHLYTMGNKLYATEMAKVLDPTGTLFAGRVISR 1027
Query: 167 ED----FNGKDR----KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF--RDK 216
D F+ +R K+ D V G E +VI+DD+ VW + NLIV+ +Y YF +
Sbjct: 1028 GDDGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNRHNLIVVERYTYFPCSRR 1087
Query: 217 ELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVC---GDVRTYL-PKVRSEF 272
+ S E DE + LA+ L V++ IH FF DVR+ L + R
Sbjct: 1088 QFGLPGPSLLEIDRDERPEDGTLASSLAVIERIHHNFFSHPNLNEADVRSILASEQRRIL 1147
Query: 273 SRDVLYFSAIFR--------DCLWAEQEE--------------------------KFLVQ 298
+ + FS +F LW E+ + +
Sbjct: 1148 TGCRIVFSRVFPVGDASPHLHPLWQTAEQFGAVCTNLVDDRVTHIVANSPGTDKVNWALS 1207
Query: 299 EKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ KF+VHP W++A L+RR E D+
Sbjct: 1208 KGKFVVHPGWVEASALLYRRANEHDF 1233
>gi|384251210|gb|EIE24688.1| carboxyl-terminal phosphatase-like 4 [Coccomyxa subellipsoidea
C-169]
Length = 439
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 80/216 (37%), Positives = 116/216 (53%), Gaps = 19/216 (8%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYL----KKQIHSFIGSLFQMANDKL-VKLRPFV 121
RKL LVL+LDHTLL+ E+ L + + SL+ + + +L KLRP+V
Sbjct: 78 RKLLLVLDLDHTLLNSTRFDEAVGFEEQLAAIQRARPEDQPVSLYHLEHMRLWTKLRPYV 137
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
R FLE+A + ++++ T YA +LLD ++F+ RII++ D K K+ D+V
Sbjct: 138 REFLEKAHEVSEMHIYTHGNAEYAIEMARLLDPTKRFFAERIISQGDSTVKHVKDLDVVL 197
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYF----RDKELNGDHKSYSETLTDESENEE 237
G E +VILDDT VW H +NL+ + +YV+F R +LN +S E DE E
Sbjct: 198 GAETAVVILDDTAGVWPSHQQNLLQVERYVFFPACARRFQLNV--QSLLELGRDEDEQHG 255
Query: 238 ALANVLRVLKTIHRLFFDSVCG----DVRTYLPKVR 269
LA+ LRV H FF + G DVR +L +R
Sbjct: 256 MLASALRV----HSRFFGASAGGGQQDVRQHLQALR 287
>gi|356523718|ref|XP_003530482.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
3-like [Glycine max]
Length = 1244
Score = 111 bits (278), Expect = 4e-22, Method: Composition-based stats.
Identities = 99/328 (30%), Positives = 148/328 (45%), Gaps = 62/328 (18%)
Query: 57 RGLRYSEQEE----RKLQLVLNLDHTLLHCRNI--------KSLSSGEKYLKKQIHSFIG 104
R R EQ + RKL LVL+LDHTLL+ + L E+ +++ H +
Sbjct: 915 RARRIEEQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPLHDEILRKKEEQDREKPHRHLF 974
Query: 105 SLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII 164
M KLRP + FLE+AS L +++L TM + YA K+LD F+ R+I
Sbjct: 975 RFPHMG--MWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVI 1032
Query: 165 ARED----FNGKDR--KNPDL--VRGQERGIVILDDTESVWSDHTENLIVLGKYVYF--R 214
+R D +G++R K+ DL V G E +VI+DD+ VW + NLIV+ +Y YF
Sbjct: 1033 SRGDDTDSVDGEERVPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCS 1092
Query: 215 DKELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVC---GDVRTYL-PKVRS 270
++ S E DE LA+ L V++ IH++FF S DVR L + R
Sbjct: 1093 RRQFGLPGPSLLEIDHDERPEAGTLASSLAVIEKIHQIFFASQSLEEVDVRNILASEQRK 1152
Query: 271 EFSRDVLYFSAIFR--------DCLWAEQEE--------------------------KFL 296
+ + FS +F LW E+ +
Sbjct: 1153 ILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTNQIDEQVTHVVANSPGTDKVNWA 1212
Query: 297 VQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ +F+VHP W++A L+RR E D+
Sbjct: 1213 LNNGRFVVHPGWVEASALLYRRANEQDF 1240
>gi|297830092|ref|XP_002882928.1| hypothetical protein ARALYDRAFT_897808 [Arabidopsis lyrata subsp.
lyrata]
gi|297328768|gb|EFH59187.1| hypothetical protein ARALYDRAFT_897808 [Arabidopsis lyrata subsp.
lyrata]
Length = 295
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 96/293 (32%), Positives = 137/293 (46%), Gaps = 51/293 (17%)
Query: 25 LSCAHTTVRDSRCIFCSQAM---NDSFGLSFDYMLRGLRYSEQ--------------EER 67
+SC H + + C C ++ ND F F+ + GL S + E++
Sbjct: 1 MSCNHRIIVEGICRECRSSVTQPNDDFQ-HFNNLANGLSLSHEFVGSLKSHVSKNSLEKK 59
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQM---ANDKLVKLRPFVRTF 124
KL LVLNL T + LS+ EKYLK +++S L+Q +D L+KLRPFV F
Sbjct: 60 KLHLVLNLYGTFFDSQAFPCLSNKEKYLKGKVNS-RNDLWQTRIRGHDVLIKLRPFVHEF 118
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQE 184
L +A+ L +++ T+ YA+ +KLLD YF +RII+ + K D V E
Sbjct: 119 LREANKLFILHVTTLCIPEYADFVLKLLDPHQLYFGNRIISLSK-HVIWEKTLDQVLVGE 177
Query: 185 RGIVILDDTESVWS-DHTENLIVLGKYVYFR---------------------------DK 216
R ++ILDD VWS ++ NL+ + Y YF+ D
Sbjct: 178 REVIILDDRYDVWSPENRSNLLQITTYSYFKATKKRNSIDGGMFQNLFKYFLKIFSRDDD 237
Query: 217 ELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVCGDVRTYLPKVR 269
L D SYSE DES ++ ALAN LR L IH+ FF+ + Y VR
Sbjct: 238 NLLSDSNSYSEERKDESVDDGALANALRFLFKIHQDFFNHHYSENDIYKRDVR 290
>gi|115485681|ref|NP_001067984.1| Os11g0521900 [Oryza sativa Japonica Group]
gi|113645206|dbj|BAF28347.1| Os11g0521900 [Oryza sativa Japonica Group]
Length = 664
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 106/335 (31%), Positives = 148/335 (44%), Gaps = 76/335 (22%)
Query: 57 RGLRYSEQEE----RKLQLVLNLDHTLLHCRNIKSLSS--GEKYLKKQI--------HSF 102
R R EQ + RKL LVL+LDHTLL+ + GE KK+ H F
Sbjct: 336 RARRIKEQHKMFAARKLCLVLDLDHTLLNSAKFIEVDHIHGEILRKKEEQDRERAERHLF 395
Query: 103 IGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
+ M KLRP + FLE+AS L +++L TM + YA K+LD F+ R
Sbjct: 396 CFNHMGM----WTKLRPGIWNFLEKASKLYELHLYTMGNKVYATEMAKVLDPTGTLFAGR 451
Query: 163 IIARED----FNGKDR----KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF- 213
+I+R D F+ +R K+ D V G E +VI+DD+ VW + NLIV+ +Y YF
Sbjct: 452 VISRGDDGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKHNLIVVERYTYFP 511
Query: 214 ---RDKELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVC---GDVRTYLPK 267
R L G S E DE + LA+ L V++ IH+ FF DVR+ L
Sbjct: 512 CSRRQFGLPG--PSLLEIDRDERPEDGTLASSLAVIERIHKNFFSHPNLNDADVRSIL-- 567
Query: 268 VRSEFSRDV----LYFSAIFR--------DCLWAEQEE---------------------- 293
SE R + + FS IF LW E+
Sbjct: 568 -ASEQQRILGGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQIDDRVTHVVANSLG 626
Query: 294 ----KFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ + +F+VHP W++A L+RR E D+
Sbjct: 627 TDKVNWALSTGRFVVHPGWVEASALLYRRASELDF 661
>gi|357478637|ref|XP_003609604.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
gi|355510659|gb|AES91801.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
Length = 1064
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 102/323 (31%), Positives = 151/323 (46%), Gaps = 57/323 (17%)
Query: 57 RGLRYSEQEE----RKLQLVLNLDHTLLH-CRNIKSLSSGEKYLKKQIHSFIGS----LF 107
R R EQ + RKL LVL++DHTLL+ + ++ +K L+K+ G LF
Sbjct: 731 RARRLEEQNKMFAARKLCLVLDIDHTLLNSAKFVEVDPEHDKILRKKEKQERGKPRRHLF 790
Query: 108 QMANDKL-VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR 166
++ + + KLRP V FLE+AS L +++L TM + YA K+LD + F+ R+I+R
Sbjct: 791 RLPHMGMWTKLRPGVWNFLEKASKLFEMHLYTMGNKLYATEMAKVLDPNGVLFAGRVISR 850
Query: 167 -EDFNGKDRKNPDL--VRGQERGIVILDDTESVWSDHTENLIVLGKYVYF----RDKELN 219
+D D K DL V G E +VI+DD+ VW + NLI + +Y+YF R L+
Sbjct: 851 GDDPETVDIKCKDLEGVLGLESSVVIIDDSPRVWPHNQLNLITVERYIYFLCSRRQFGLS 910
Query: 220 GDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVC---GDVRTYLP-KVRSEFSRD 275
G S E DE LA+ L V++ IH+ FF S DVR L + R
Sbjct: 911 G--PSLFEIDHDERPGAGTLASSLGVIERIHQNFFASQSLEEMDVRNILASEQRKILGGC 968
Query: 276 VLYFSAIFR--------DCLWAEQEE--------------------------KFLVQEKK 301
+ FS +F LW E+ + + K
Sbjct: 969 RIVFSGVFPVGETNPHLHPLWRTAEQFGASCTNKVDPQVTHVVAQSPGTDKVNWGISNGK 1028
Query: 302 FLVHPRWIDAYYFLWRRRPEDDY 324
F+V+P W++A L+RR E D+
Sbjct: 1029 FVVYPNWVEASTLLYRRMNEQDF 1051
>gi|356567192|ref|XP_003551805.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
3-like [Glycine max]
Length = 1221
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 102/330 (30%), Positives = 149/330 (45%), Gaps = 66/330 (20%)
Query: 57 RGLRYSEQEE----RKLQLVLNLDHTLLHCRNI--------KSLSSGEKYLKKQIHSFIG 104
R R EQ + RKL LVL+LDHTLL+ + L E+ +++ H +
Sbjct: 892 RARRIEEQNKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPHRHLF 951
Query: 105 SLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII 164
M KLRP + FLE+AS L +++L TM + YA K+LD F+ R+I
Sbjct: 952 RFPHMG--MWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGLLFAGRVI 1009
Query: 165 ARED----FNGKDR--KNPDL--VRGQERGIVILDDTESVWSDHTENLIVLGKYVYF--- 213
+R D +G++R K+ DL V G E +VI+DD+ VW + NLIV+ +Y YF
Sbjct: 1010 SRGDDTDSVDGEERAPKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCS 1069
Query: 214 -RDKELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVC---GDVRTYLP-KV 268
R L G S E DE LA+ L V++ IH++FF S DVR L +
Sbjct: 1070 RRQFGLPG--PSLLEIDHDERPEAGTLASSLAVIEKIHQIFFASRSLEEVDVRNILASEQ 1127
Query: 269 RSEFSRDVLYFSAIFR--------DCLWAEQEE--------------------------K 294
R + + FS +F LW E+
Sbjct: 1128 RKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAFCTNQIDEQVTHVVANSPGTDKVN 1187
Query: 295 FLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ + +F+VHP W++A L+RR E D+
Sbjct: 1188 WALNNGRFVVHPGWVEASALLYRRANEQDF 1217
>gi|308802003|ref|XP_003078315.1| CTD phosphatase-like protein 3 (ISS) [Ostreococcus tauri]
gi|116056766|emb|CAL53055.1| CTD phosphatase-like protein 3 (ISS) [Ostreococcus tauri]
Length = 480
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 92/297 (30%), Positives = 128/297 (43%), Gaps = 55/297 (18%)
Query: 26 SCAHTTVRDSRCIFCSQ--------------------AMNDSFGLSFDYMLRGLRYS--- 62
+CAH C+ C + A+ F S Y+ GL S
Sbjct: 81 TCAHPAFMFEICVVCGERKRDDGGGSKGEMRSGSGEEALRGHFTTSMRYIHEGLTLSNAE 140
Query: 63 ------EQEER-----KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQ-IHSFIGSLFQMA 110
E++ER KL L+L+LDHTLL+ K L+ + L Q I L +
Sbjct: 141 LEKAKREEKERVLKDGKLTLILDLDHTLLNSAQFKELTQEQHDLLHQCIAQEANGLAERE 200
Query: 111 NDKL---------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSS 161
L KLRP V FLE+ S + Y+ TM + YA+ VKL+D + K F
Sbjct: 201 RPMLYCLRHMGFFTKLRPHVFEFLEEVSQICQPYVYTMGDKAYAKEMVKLIDPEGKIFHG 260
Query: 162 RIIAREDFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGD 221
R+I+ D K+ D+V G E VI+DDTE VW + NLI L +Y +F +
Sbjct: 261 RVISNNDSTSSHVKDLDIVLGGETSAVIVDDTERVWPANHGNLIRLDRYHFFPSSAASFQ 320
Query: 222 HKSYS---ETLTDESE-----NEEALANVLRVLKTIHRLFFDSVC---GDVRTYLPK 267
K S ++ DE E L +VL V+++ HR +F DVRT L K
Sbjct: 321 QKGQSVMERSMVDEGELGSMGARAVLLDVLAVIQSAHRSYFKHASIEEPDVRTLLVK 377
>gi|357502711|ref|XP_003621644.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
gi|355496659|gb|AES77862.1| RNA polymerase II C-terminal domain phosphatase-like protein
[Medicago truncatula]
Length = 1213
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 103/327 (31%), Positives = 139/327 (42%), Gaps = 65/327 (19%)
Query: 57 RGLRYSEQEE----RKLQLVLNLDHTLLHCRNIKSLSS----------GEKYLKKQIHSF 102
R R EQ++ RKL LVL+LDHTLL+ + E K Q H F
Sbjct: 887 RSRRLEEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEMLRKKEQEDREKPQRHLF 946
Query: 103 IGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
M KLRP V FLE+A L +++L TM + YA K+LD F+ R
Sbjct: 947 RFPHMGM----WTKLRPGVWNFLEKAGKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1002
Query: 163 IIAR-EDFNGKDRKNPDL--VRGQERGIVILDDTESVWSDHTENLIVLGKYVYF----RD 215
+I+R +D D K+ DL V G E +VI+DD+ VW + NLIV+ +Y YF R
Sbjct: 1003 VISRGDDAETADTKSKDLEGVLGMESSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQ 1062
Query: 216 KELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDS-----------VCGDVRTY 264
L G S E DE LA+ L V++ IH+ FF S + + R
Sbjct: 1063 FGLPG--PSLLEIDHDERPESGTLASSLGVIERIHQNFFASQSLEEVDVRNILASEQRKI 1120
Query: 265 LPKVRSEFSRDVLYFSA-IFRDCLWAEQEE--------------------------KFLV 297
L R FSR A LW E+ + +
Sbjct: 1121 LDGCRIVFSRMFPVGDANPHLHPLWQTAEQFGASCTNQIDDQVTHVVAHSPGTDKVNWAI 1180
Query: 298 QEKKFLVHPRWIDAYYFLWRRRPEDDY 324
KF+VHP W++A L+RR E D+
Sbjct: 1181 ANGKFVVHPGWVEASALLYRRANEQDF 1207
>gi|326532556|dbj|BAK05207.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 891
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 106/342 (30%), Positives = 149/342 (43%), Gaps = 83/342 (24%)
Query: 57 RGLRYSEQ----EERKLQLVLNLDHTLLH-CRNIKSLSSGEKYL---------KKQIHSF 102
R R EQ RKL LVL+LDHTLL+ + I+ E+ L + + H F
Sbjct: 556 RARRIMEQHTMFSSRKLCLVLDLDHTLLNSAKFIEVDPIHEEILWKKEEQDRERSERHLF 615
Query: 103 IGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
QM KLRP + FLE+AS L +++L TM + YA K+LD F+ R
Sbjct: 616 RFHHMQM----WTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPSGTLFAGR 671
Query: 163 IIAR-----------EDFNGKDR----KNPDLVRGQERGIVILDDTESVWSDHTENLIVL 207
+I+R + F+ DR K+ D V G E +VI+DD+ VW + N+IV+
Sbjct: 672 VISRGGDGISRGGDGDTFDSDDRVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKNNMIVV 731
Query: 208 GKYVYF----RDKELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVC---GD 260
+Y YF R L G S E DE + LA+ L V+ IH+ FF D
Sbjct: 732 ERYTYFPCSRRQFGLPG--PSLLEIDRDERPEDGTLASSLAVIGRIHQNFFSHPNLNDAD 789
Query: 261 VRTYLPKVRSEFSRDV----LYFSAIFR--------DCLWAEQEE--------------- 293
VR+ L SE R + + FS IF LW E+
Sbjct: 790 VRSIL---ASEQRRILAGCRIVFSRIFPVGEANPQLHPLWQTAEQFGAVCTNQIDDRVTH 846
Query: 294 -----------KFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ +Q +F+VHP W++A L+RR E D+
Sbjct: 847 VVANSLGTDKVNWALQTGRFVVHPGWVEASALLYRRANEHDF 888
>gi|357156660|ref|XP_003577532.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
3-like [Brachypodium distachyon]
Length = 1259
Score = 106 bits (264), Expect = 2e-20, Method: Composition-based stats.
Identities = 96/336 (28%), Positives = 145/336 (43%), Gaps = 71/336 (21%)
Query: 57 RGLRYSEQEE----RKLQLVLNLDHTLLHCRNI--------KSLSSGEKYLKKQIHSFIG 104
R R EQ++ RKL LVL+LDHTLL+ + L E+ +++ +
Sbjct: 924 RARRIMEQQKMFSARKLCLVLDLDHTLLNSAKFLEVDPIHEEILRKKEEQDRERPERHLF 983
Query: 105 SLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII 164
L M+ KLRP + FLE+AS L +++L TM + YA K+LD F R+I
Sbjct: 984 RLHHMS--MWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGALFEGRVI 1041
Query: 165 AREDFNGKDR----------------KNPDLVRGQERGIVILDDTESVWSDHTENLIVLG 208
+R +G R K+ D V G E +VI+DD+ VW + N+IV+
Sbjct: 1042 SRGG-DGTSRGGDGDSFDSDDRVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKNNMIVVE 1100
Query: 209 KYVYF--RDKELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVC---GDVRT 263
+Y YF ++ S E DE + LA+ L V+ IH+ FF DVR+
Sbjct: 1101 RYTYFPCSRRQFGLPGPSLLEIDRDERPEDGTLASSLAVIGRIHQNFFSHPNLNDADVRS 1160
Query: 264 YL-PKVRSEFSRDVLYFSAIFR--------DCLWAEQEE--------------------- 293
L + R + + FS IF LW E+
Sbjct: 1161 ILASEQRRILAGCRIVFSRIFPVGEANPHLHPLWQSAEQFGAVCTNQIDDRVTHVVANSL 1220
Query: 294 -----KFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ +Q +++VHP W++A L+RR E D+
Sbjct: 1221 GTDKVNWALQTGRYVVHPGWVEASALLYRRASEHDF 1256
>gi|224053553|ref|XP_002297869.1| predicted protein [Populus trichocarpa]
gi|222845127|gb|EEE82674.1| predicted protein [Populus trichocarpa]
Length = 1117
Score = 103 bits (258), Expect = 9e-20, Method: Composition-based stats.
Identities = 97/326 (29%), Positives = 147/326 (45%), Gaps = 58/326 (17%)
Query: 57 RGLRYSEQEE----RKLQLVLNLDHTLLH-CRNIKSLSSGEKYLKKQ----IHSFIGSLF 107
R R EQ++ RKL LVL+LDHTLL+ + I S S ++ L+K+ +F
Sbjct: 788 RARRLEEQKKMFAARKLCLVLDLDHTLLNSAKAILSSSLHDEILRKKEEQDREKPYRHIF 847
Query: 108 QMANDKL-VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR 166
++ + + KLRP + FLE+AS L +++L TM + YA K+LD F+ R+I+R
Sbjct: 848 RIPHMGMWTKLRPGIWNFLEKASKLFELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISR 907
Query: 167 EDFNGKDR------KNPDL--VRGQERGIVILDDTESVWSDHTENLIVLGKYVYF--RDK 216
D K+ DL V G E G+VI+DD+ VW + NLIV+ +Y+YF +
Sbjct: 908 GDDGDPFDGDERVPKSKDLEGVLGMESGVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRR 967
Query: 217 ELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVC---GDVRTYL-PKVRSEF 272
+ S E DE + LA V++ IH+ FF DVR L + R
Sbjct: 968 QFGLPGPSLLEIDHDERPEDGTLACSFAVIEKIHQNFFTHRSLDEADVRNILASEQRKIL 1027
Query: 273 SRDVLYFSAIFR--------DCLWAEQEE--------------------------KFLVQ 298
+ FS +F LW E+ + +
Sbjct: 1028 GGCRILFSRVFPVGEVNPHLHPLWQMAEQFGAVCTNQIDEQVTHVVANSLGTDKVNWALS 1087
Query: 299 EKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ +VHP W++A L+RR E D+
Sbjct: 1088 TGRIVVHPGWVEASALLYRRANEQDF 1113
>gi|449487451|ref|XP_004157633.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
phosphatase-like 3-like [Cucumis sativus]
Length = 1249
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 104/335 (31%), Positives = 147/335 (43%), Gaps = 76/335 (22%)
Query: 57 RGLRYSEQEE----RKLQLVLNLDHTLLHCRNIKSLSSGEKYL----------KKQIHSF 102
R R EQ++ RKL LVL+LDHTLL+ + + K Q H
Sbjct: 920 RARRIEEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAQRH-- 977
Query: 103 IGSLFQMANDKL-VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSS 161
LF+ + + KLRP V FLE+AS L +++L TM + YA K+LD F+
Sbjct: 978 ---LFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGVLFAG 1034
Query: 162 RIIAREDFNGKDR------KNPDL--VRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
R+I+R D K+ DL V G E G+VI+DD+ VW + NLIV+ +Y YF
Sbjct: 1035 RVISRGDDGDPLDGDDRVPKSKDLEGVLGMESGVVIIDDSIRVWPHNKMNLIVVERYTYF 1094
Query: 214 ----RDKELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFF-----DSVCGDVRTY 264
R L G S E DE + LA+ L V++ IH+ FF D V DVRT
Sbjct: 1095 PCSRRQFGLLG--PSLLEIDHDERPEDGTLASSLGVIQRIHQXFFSNPELDQV--DVRTI 1150
Query: 265 LPKVRSEFSRDV-LYFSAIFR--------DCLWAEQEE---------------------- 293
L + + + FS +F LW E+
Sbjct: 1151 LSAEQQKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAQCTNQIDEQVTHVVANSLG 1210
Query: 294 ----KFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ + +F+VHP W++A L+RR E D+
Sbjct: 1211 TDKVNWALSTGRFVVHPGWVEASALLYRRATEQDF 1245
>gi|449445782|ref|XP_004140651.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
3-like [Cucumis sativus]
Length = 1249
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 104/335 (31%), Positives = 147/335 (43%), Gaps = 76/335 (22%)
Query: 57 RGLRYSEQEE----RKLQLVLNLDHTLLHCRNIKSLSSGEKYL----------KKQIHSF 102
R R EQ++ RKL LVL+LDHTLL+ + + K Q H
Sbjct: 920 RARRIEEQKKMFAARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAQRH-- 977
Query: 103 IGSLFQMANDKL-VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSS 161
LF+ + + KLRP V FLE+AS L +++L TM + YA K+LD F+
Sbjct: 978 ---LFRFPHMGMWTKLRPGVWNFLEKASELYELHLYTMGNKLYATEMAKVLDPKGVLFAG 1034
Query: 162 RIIAREDFNGKDR------KNPDL--VRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
R+I+R D K+ DL V G E G+VI+DD+ VW + NLIV+ +Y YF
Sbjct: 1035 RVISRGDDGDPLDGDDRVPKSKDLEGVLGMESGVVIIDDSIRVWPHNKMNLIVVERYTYF 1094
Query: 214 ----RDKELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFF-----DSVCGDVRTY 264
R L G S E DE + LA+ L V++ IH+ FF D V DVRT
Sbjct: 1095 PCSRRQFGLLG--PSLLEIDHDERPEDGTLASSLGVIQRIHQSFFSNPELDQV--DVRTI 1150
Query: 265 LPKVRSEFSRDV-LYFSAIFR--------DCLWAEQEE---------------------- 293
L + + + FS +F LW E+
Sbjct: 1151 LSAEQQKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAQCTNQIDEQVTHVVANSLG 1210
Query: 294 ----KFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ + +F+VHP W++A L+RR E D+
Sbjct: 1211 TDKVNWALSTGRFVVHPGWVEASALLYRRATEQDF 1245
>gi|303276827|ref|XP_003057707.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460364|gb|EEH57658.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 692
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 107/206 (51%), Gaps = 18/206 (8%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGE--------KYLKKQIHSFIGSLFQMANDKL-VKL 117
R+L LVL+LDHTLL+ + +S G + L+ S +L ++ + L KL
Sbjct: 303 RRLTLVLDLDHTLLNSESFESKDGGRLQRGLLEIERLESTKDSNDRTLHRLNHIGLWTKL 362
Query: 118 RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFN--GKDRK 175
RP V+TFL +AS++ +I++ TM ++ YA++ +LLD +I F+ G +
Sbjct: 363 RPGVQTFLHKASAMFEIHISTMGSQPYADSIRRLLDPCRNVIKGSVIGLGGFDEFGAFKS 422
Query: 176 NP-----DLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSET 228
P ++ G E VILDDT VW+ ++ENLIV +Y+YF K S E
Sbjct: 423 PPQKKLEGVLAGTEPAAVILDDTAEVWTGYSENLIVCERYMYFPSACKNFGVVGPSLLER 482
Query: 229 LTDESENEEALANVLRVLKTIHRLFF 254
DESE LA VL VL +H FF
Sbjct: 483 GVDESEKSGTLATVLEVLTRVHSEFF 508
>gi|302768485|ref|XP_002967662.1| hypothetical protein SELMODRAFT_440109 [Selaginella moellendorffii]
gi|300164400|gb|EFJ31009.1| hypothetical protein SELMODRAFT_440109 [Selaginella moellendorffii]
Length = 762
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 97/337 (28%), Positives = 146/337 (43%), Gaps = 75/337 (22%)
Query: 57 RGLRYSEQE----ERKLQLVLNLDHTLLH-----------------CRNIKSLSSGEKYL 95
R R EQ+ E+KL LVL+LDHTLL+ I+ ++
Sbjct: 428 RQRRMDEQDKMLSEKKLCLVLDLDHTLLNSAKFMEIEQEWDRFLRATETIERNKDAKEGT 487
Query: 96 KKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLD 155
+++++ F M KLRP + FL +AS L +++L TM + YA KLLD
Sbjct: 488 RRELYRF--PYMSM----WTKLRPGIWRFLARASQLYELHLYTMGNKAYATEMAKLLDPT 541
Query: 156 SKYFSSRIIAREDFNGK---DRKNP-----DLVRGQERGIVILDDTESVWSDHTENLIVL 207
F+ R+I++ D D K P D V G E ++I+DD+ VW H +NLIV+
Sbjct: 542 GVLFAGRVISKGDDGDALYGDEKTPRSKDLDGVLGMESAVLIIDDSARVWPHHKDNLIVV 601
Query: 208 GKYVYF--RDKELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVCG---DVR 262
+Y+YF K+ S E DE E + LA++L V++ +H F+ D+R
Sbjct: 602 ERYMYFPCSRKQFGLPGPSLLEVGHDEREADGMLASILGVVERVHEEFYSRPLPKEVDIR 661
Query: 263 TYLPKV-RSEFSRDVLYFSAIFR--------DCLW--AEQ-------------------- 291
L V R + FS +F LW AEQ
Sbjct: 662 EVLSVVQRRILGGCKIIFSRVFPVEETQPQLHPLWRMAEQFGAVCTTRMEEDVTHVVAIS 721
Query: 292 ----EEKFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ + + +FLV P W++A L+RR E D+
Sbjct: 722 MGTDKSNWALATGRFLVRPAWVEASTVLYRRANERDF 758
>gi|359473774|ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
3-like [Vitis vinifera]
Length = 1238
Score = 102 bits (253), Expect = 3e-19, Method: Composition-based stats.
Identities = 99/328 (30%), Positives = 145/328 (44%), Gaps = 62/328 (18%)
Query: 57 RGLRYSEQEE----RKLQLVLNLDHTLLHCRNIKSLSS--GEKYLKKQIHSFIGS---LF 107
R R EQ++ RKL LVL+LDHTLL+ + E KK+ S LF
Sbjct: 909 RARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKSQRHLF 968
Query: 108 QMANDKL-VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR 166
+ + + KLRP + FLE+AS L +++L TM + YA K+LD F+ R+I++
Sbjct: 969 RFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVISK 1028
Query: 167 EDFNGKDR------KNPDL--VRGQERGIVILDDTESVWSDHTENLIVLGKYVYF--RDK 216
D K+ DL V G E +VI+DD+ VW + NLIV+ +Y YF +
Sbjct: 1029 GDDGDVLDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRR 1088
Query: 217 ELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFF-----DSVCGDVRTYL-PKVRS 270
+ S E DE + LA+ L V++ IH+ FF D V DVR L + R
Sbjct: 1089 QFGLPGPSLLEIDHDERPEDGTLASSLAVIERIHQSFFSNRALDEV--DVRNILASEQRK 1146
Query: 271 EFSRDVLYFSAIFR--------DCLWAEQEE--------------------------KFL 296
+ + FS +F LW E +
Sbjct: 1147 ILAGCRIVFSRVFPVGEANPHLHPLWQTAESFGAVCTNQIDEQVTHVVANSLGTDKVNWA 1206
Query: 297 VQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ +F+VHP W++A L+RR E D+
Sbjct: 1207 LSTGRFVVHPGWVEASALLYRRANEQDF 1234
>gi|302761896|ref|XP_002964370.1| hypothetical protein SELMODRAFT_405568 [Selaginella moellendorffii]
gi|300168099|gb|EFJ34703.1| hypothetical protein SELMODRAFT_405568 [Selaginella moellendorffii]
Length = 766
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 97/337 (28%), Positives = 146/337 (43%), Gaps = 75/337 (22%)
Query: 57 RGLRYSEQE----ERKLQLVLNLDHTLLH-----------------CRNIKSLSSGEKYL 95
R R EQ+ E+KL LVL+LDHTLL+ I+ ++
Sbjct: 432 RQRRMDEQDKMLSEKKLCLVLDLDHTLLNSAKFMEIEQEWDRFLRATETIERNKDAKEGT 491
Query: 96 KKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLD 155
+++++ F M KLRP + FL +AS L +++L TM + YA KLLD
Sbjct: 492 RRELYRF--PYMSM----WTKLRPGIWRFLARASQLYELHLYTMGNKAYATEMAKLLDPT 545
Query: 156 SKYFSSRIIAREDFNGK---DRKNP-----DLVRGQERGIVILDDTESVWSDHTENLIVL 207
F+ R+I++ D D K P D V G E ++I+DD+ VW H +NLIV+
Sbjct: 546 GVLFAGRVISKGDDGDALYGDEKTPRSKDLDGVLGMESAVLIIDDSARVWPHHKDNLIVV 605
Query: 208 GKYVYF--RDKELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVCG---DVR 262
+Y+YF K+ S E DE E + LA++L V++ +H F+ D+R
Sbjct: 606 ERYMYFPCSRKQFGLPGPSLLEVGHDEREADGMLASILGVVERVHEEFYSRPLPKEVDIR 665
Query: 263 TYLPKV-RSEFSRDVLYFSAIFR--------DCLW--AEQ-------------------- 291
L V R + FS +F LW AEQ
Sbjct: 666 EVLSVVQRRILGGCKIIFSRVFPVEETQPQLHPLWRMAEQFGAVCTTRMEEDVTHVVAIS 725
Query: 292 ----EEKFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ + + +FLV P W++A L+RR E D+
Sbjct: 726 MGTDKSNWALATGRFLVRPAWVEASTVLYRRANERDF 762
>gi|30685744|ref|NP_180912.2| RNA polymerase II C-terminal domain phosphatase-like 3 [Arabidopsis
thaliana]
gi|238055326|sp|Q8LL04.2|CPL3_ARATH RecName: Full=RNA polymerase II C-terminal domain phosphatase-like 3;
Short=FCP-like 3; AltName: Full=Carboxyl-terminal
phosphatase-like 3; Short=AtCPL3; Short=CTD
phosphatase-like 3
gi|330253756|gb|AEC08850.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Arabidopsis
thaliana]
Length = 1241
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 95/314 (30%), Positives = 141/314 (44%), Gaps = 58/314 (18%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSS-GEKYLKKQIHS----FIGSLFQMANDKL-VKLRPF 120
+KL LVL++DHTLL+ + S E+ L+K+ LF+ + + KLRP
Sbjct: 926 QKLSLVLDIDHTLLNSAKFNEVESRHEEILRKKEEQDREKPYRHLFRFLHMGMWTKLRPG 985
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR------ 174
+ FLE+AS L +++L TM + YA KLLD F+ R+I++ D
Sbjct: 986 IWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGVLFNGRVISKGDDGDPLDGDERVP 1045
Query: 175 KNPDL--VRGQERGIVILDDTESVWSDHTENLIVLGKYVYF----RDKELNGDHKSYSET 228
K+ DL V G E +VI+DD+ VW H NLI + +Y+YF R L G S E
Sbjct: 1046 KSKDLEGVMGMESSVVIIDDSVRVWPQHKMNLIAVERYLYFPCSRRQFGLLG--PSLLEL 1103
Query: 229 LTDESENEEALANVLRVLKTIHRLFF-----------DSVCGDVRTYLPKVRSEFSRDVL 277
DE E LA+ L V++ IH+ FF + + + R L R FSR +
Sbjct: 1104 DRDEVPEEGTLASSLAVIEKIHQNFFSHTSLDEVDVRNILASEQRKILAGCRIVFSRIIP 1163
Query: 278 YFSA-IFRDCLWAEQEE--------------------------KFLVQEKKFLVHPRWID 310
A LW E+ + + +F+VHP W++
Sbjct: 1164 VGEAKPHLHPLWQTAEQFGAVCTTQVDEHVTHVVTNSLGTDKVNWALTRGRFVVHPGWVE 1223
Query: 311 AYYFLWRRRPEDDY 324
A FL++R E+ Y
Sbjct: 1224 ASAFLYQRANENLY 1237
>gi|145344421|ref|XP_001416731.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576957|gb|ABO95024.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 248
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 74/222 (33%), Positives = 105/222 (47%), Gaps = 26/222 (11%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGE------------KYLKKQIHSFIGSLFQMANDKLV 115
KL L+L+LDHTLL+ K L+ + + LK+ + L M
Sbjct: 29 KLTLILDLDHTLLNSTQFKELTQEQHDLLHECIAREAEGLKEGQRPMLYCLRHMGF--FT 86
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRK 175
KLRP V FLE S + Y+ TM + YA VKL+D + F R+I+ D K
Sbjct: 87 KLRPHVFEFLESVSKICQPYVYTMGDKPYAREMVKLIDPEGTIFHGRVISNNDSTSSHVK 146
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYS---ETLTDE 232
+ D+V G E +I+DDTE VW + NLI L +Y +F + K S ++ DE
Sbjct: 147 DLDIVLGGEASAIIVDDTERVWPQNQGNLIRLDRYHFFPGSASSFQQKGQSVMESSMVDE 206
Query: 233 SE-----NEEALANVLRVLKTIHRLFF----DSVCGDVRTYL 265
E + L +VL V++++HR FF D DVR L
Sbjct: 207 GELGSVGSRAVLLDVLAVIESVHRSFFKNTDDGEEPDVRKLL 248
>gi|297826809|ref|XP_002881287.1| hypothetical protein ARALYDRAFT_482300 [Arabidopsis lyrata subsp.
lyrata]
gi|297327126|gb|EFH57546.1| hypothetical protein ARALYDRAFT_482300 [Arabidopsis lyrata subsp.
lyrata]
Length = 1248
Score = 98.2 bits (243), Expect = 5e-18, Method: Composition-based stats.
Identities = 94/326 (28%), Positives = 145/326 (44%), Gaps = 58/326 (17%)
Query: 57 RGLRYSEQEE----RKLQLVLNLDHTLLHCRNIKSLS-SGEKYLKKQIHSF----IGSLF 107
R R EQ++ +KL LVL++DHTLL+ + E+ L+K+ LF
Sbjct: 919 RVRRLEEQKKMFASQKLSLVLDIDHTLLNSAKFNEVEFRHEEILRKKEEQDREKPYRHLF 978
Query: 108 QMANDKL-VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR 166
+ + + KLRP + FLE+AS L +++L TM + YA KLLD F+ R+I++
Sbjct: 979 RFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGILFNGRVISK 1038
Query: 167 EDFNGKDR------KNPDL--VRGQERGIVILDDTESVWSDHTENLIVLGKYVYF--RDK 216
D K+ DL V G E +VI+DD+ VW + NLI + +Y+YF +
Sbjct: 1039 GDDGDPLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVWPYNKMNLIAVERYLYFPRSRR 1098
Query: 217 ELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFF-----------DSVCGDVRTYL 265
+ S E DE E LA+ L V++ IH+ FF + + + R L
Sbjct: 1099 QFGLLGPSLLELDRDEVPEEGTLASSLAVIEKIHKNFFSHTSLDEVDVRNILASEQRKIL 1158
Query: 266 PKVRSEFSRDVLYFSAI-FRDCLWAEQEE--------------------------KFLVQ 298
R FSR + A LW E+ + +
Sbjct: 1159 AGCRIVFSRIIPVGEAKPHLHPLWQTAEQFGAVCTTQVDEHVTHVVTNSLGTDKVNWALT 1218
Query: 299 EKKFLVHPRWIDAYYFLWRRRPEDDY 324
+F+VHP W++A FL++R E+ Y
Sbjct: 1219 RGRFVVHPGWVEASAFLYQRANENLY 1244
>gi|22212705|gb|AAM94371.1|AF486633_1 CTD phosphatase-like 3 [Arabidopsis thaliana]
Length = 1241
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 94/314 (29%), Positives = 140/314 (44%), Gaps = 58/314 (18%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSS-GEKYLKKQIHS----FIGSLFQMANDKL-VKLRPF 120
+KL LVL++DHTLL+ + S E+ L+K+ LF+ + + KLRP
Sbjct: 926 QKLSLVLDIDHTLLNSAKFNEVESRHEEILRKKEEQDREKPYRHLFRFLHMGMWTKLRPG 985
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR------ 174
+ FLE+AS L +++L TM + Y KLLD F+ R+I++ D
Sbjct: 986 IWNFLEKASKLYELHLYTMGNKLYVTEMAKLLDPKGVLFNGRVISKGDDGDPLDGDERVP 1045
Query: 175 KNPDL--VRGQERGIVILDDTESVWSDHTENLIVLGKYVYF----RDKELNGDHKSYSET 228
K+ DL V G E +VI+DD+ VW H NLI + +Y+YF R L G S E
Sbjct: 1046 KSKDLEGVMGMESSVVIIDDSVRVWPQHKMNLIAVERYLYFPCSRRQFGLLG--PSLLEL 1103
Query: 229 LTDESENEEALANVLRVLKTIHRLFF-----------DSVCGDVRTYLPKVRSEFSRDVL 277
DE E LA+ L V++ IH+ FF + + + R L R FSR +
Sbjct: 1104 DRDEVPEEGTLASSLAVIEKIHQNFFSHTSLDEVDVRNILASEQRKILAGCRIVFSRIIP 1163
Query: 278 YFSA-IFRDCLWAEQEE--------------------------KFLVQEKKFLVHPRWID 310
A LW E+ + + +F+VHP W++
Sbjct: 1164 VGEAKPHLHPLWQTAEQFGAVCTTQVDEHVTHVVTNSLGTDKVNWALTRGRFVVHPGWVE 1223
Query: 311 AYYFLWRRRPEDDY 324
A FL++R E+ Y
Sbjct: 1224 ASAFLYQRANENLY 1237
>gi|125541462|gb|EAY87857.1| hypothetical protein OsI_09279 [Oryza sativa Indica Group]
Length = 390
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 87/303 (28%), Positives = 137/303 (45%), Gaps = 51/303 (16%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
RKL LV++LDHTL++ +S G +Y+ + + A +RP++ E
Sbjct: 93 RKLILVVDLDHTLVNSTADYDIS-GTEYVNGLAELLVLGVHHQAQ----AVRPWLPARSE 147
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQ--- 183
+ + D + T+ R YA A KLLD + YF RII+R++ DRK+ D+V G
Sbjct: 148 R--HVRDARVYTLGDRDYAAAVAKLLDPEGVYFGERIISRDESPQPDRKSLDVVFGSAPA 205
Query: 184 ----ERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELN-GDHKSYSETLTDESENEEA 238
+VILDDT VW +++NLI + +Y YF + G + +L++ +E
Sbjct: 206 SAAERAAVVILDDTAEVWEGNSDNLIEMERYHYFASSCRDFGSPWECTHSLSERGVDESE 265
Query: 239 LANVLRVLKTIH----RLFFDSVCGDVRTYLPKVRSEFSRD--VLYFSAIFRD---CLWA 289
A LRVL+ +H S DVR + + R E R V + AI D +W
Sbjct: 266 RAAALRVLRRVHAGFFAGGGGSFVADVREVIRRTRREVLRGCTVAFTRAIASDDHHSVWR 325
Query: 290 EQEE-------------KFLVQEK-------------KFLVHPRWIDAYYFLWRRRPEDD 323
E+ +V KFLV+P WI+ +F W +P+++
Sbjct: 326 RTEQLGATCADDVGPAVTHVVATNPTTFKAVWAQVFGKFLVNPEWINTAHFRW-SKPKEE 384
Query: 324 YLP 326
+ P
Sbjct: 385 HFP 387
>gi|168040198|ref|XP_001772582.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162676137|gb|EDQ62624.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1881
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 71/209 (33%), Positives = 107/209 (51%), Gaps = 28/209 (13%)
Query: 68 KLQLVLNLDHTLLHCRNI------------------KSLSSGEKYLKKQIHSFIGSLFQM 109
KL LVL+LDHTLL+ +S S+ + +K++++ F M
Sbjct: 1549 KLCLVLDLDHTLLNSAKFSEIEPEFEARLRQAENMERSRSTKDPNMKQELYRFP----HM 1604
Query: 110 ANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED- 168
+ KLRP + FL +AS L ++++ TM + YA KLLD FS R+I++ D
Sbjct: 1605 S--MWTKLRPGIWKFLAKASELYELHVYTMGNKAYATEMAKLLDPTGILFSGRVISKGDE 1662
Query: 169 FNGKDR-KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSY 225
+G D+ K+ D V G E +VI+DD+ VW H ENLIV+ +Y+YF ++ S
Sbjct: 1663 VDGSDKSKDLDGVLGMESAVVIIDDSSRVWPHHRENLIVVERYMYFPSSRRQFGLLGPSL 1722
Query: 226 SETLTDESENEEALANVLRVLKTIHRLFF 254
E DE + L++ V+ IHR FF
Sbjct: 1723 LEVGHDERAVDGMLSSASGVIDRIHRNFF 1751
>gi|296088169|emb|CBI35661.3| unnamed protein product [Vitis vinifera]
Length = 1184
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 146/335 (43%), Gaps = 76/335 (22%)
Query: 57 RGLRYSEQEE----RKLQLVLNLDHTLLHCRNIKSLSSGEKYL----------KKQIHSF 102
R R EQ++ RKL LVL+LDHTLL+ + + K Q H
Sbjct: 855 RARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKSQRH-- 912
Query: 103 IGSLFQMANDKL-VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSS 161
LF+ + + KLRP + FLE+AS L +++L TM + YA K+LD F+
Sbjct: 913 ---LFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAG 969
Query: 162 RIIAREDFNG------KDRKNPDL--VRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
R+I++ D + K+ DL V G E +VI+DD+ VW + NLIV+ +Y YF
Sbjct: 970 RVISKGDDGDVLDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYF 1029
Query: 214 ----RDKELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFF-----DSVCGDVRTY 264
R L G S E DE + LA+ L V++ IH+ FF D V DVR
Sbjct: 1030 PCSRRQFGLPG--PSLLEIDHDERPEDGTLASSLAVIERIHQSFFSNRALDEV--DVRNI 1085
Query: 265 LP-KVRSEFSRDVLYFSAIFR--------DCLWAEQEE---------------------- 293
L + R + + FS +F LW E
Sbjct: 1086 LASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAESFGAVCTNQIDEQVTHVVANSLG 1145
Query: 294 ----KFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ + +F+VHP W++A L+RR E D+
Sbjct: 1146 TDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 1180
>gi|302793512|ref|XP_002978521.1| hypothetical protein SELMODRAFT_418187 [Selaginella moellendorffii]
gi|300153870|gb|EFJ20507.1| hypothetical protein SELMODRAFT_418187 [Selaginella moellendorffii]
Length = 346
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 94/311 (30%), Positives = 147/311 (47%), Gaps = 57/311 (18%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGS-------LFQMANDKL-VK 116
+++KL LVL+LDHTLL+ + + E+ ++I+ + L ++ + ++ K
Sbjct: 34 QQQKLILVLDLDHTLLNSASFSKVDEEERLYLEKIYDWQEKAPKRRKLLHKVESLQVWTK 93
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
+RPF FLE+AS D+++ T YAE KLLD F I +R+ K K+
Sbjct: 94 IRPFAFKFLEEASKFFDLHIYTNGREIYAETMAKLLDPTGSLFKGHIFSRDHNCMKAMKD 153
Query: 177 PDLVRGQERGIVILDDTESVWS-DHTENLI-VLGKYVYFRDKE-LNGDHKSYSETL--TD 231
D V G E +I+DD++ VW H +NLI V +Y++FR L G +S S T D
Sbjct: 154 LDTVPGDESITLIVDDSDCVWPKKHHKNLIPVYDRYLFFRSSTGLFGLRESSSLTSKKKD 213
Query: 232 ESENEEALANVLRVLKTIHRLFF-DSVC--GDVRTYLPKV--------------RSEFSR 274
E + LA +L LK IH FF +S C GDVR + +V +S+ +
Sbjct: 214 EVATKATLAKLLEGLKRIHSEFFQESGCFAGDVRQTMREVKGHALSGCKIVICAKSQAAH 273
Query: 275 DVLYFS--------------AIFRDCLWAEQEEKFL---VQEKKFLVHPRWI-------- 309
++L+ S + + ++Q+ + L Q K+LV P WI
Sbjct: 274 ELLWDSCQELGAECVVDIDDTVTHVVVASKQQPQGLELSAQAGKYLVWPSWIHTAHYRCC 333
Query: 310 --DAYYFLWRR 318
D FLWR+
Sbjct: 334 RPDEAAFLWRK 344
>gi|255540897|ref|XP_002511513.1| conserved hypothetical protein [Ricinus communis]
gi|223550628|gb|EEF52115.1| conserved hypothetical protein [Ricinus communis]
Length = 161
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 54/151 (35%), Positives = 80/151 (52%), Gaps = 9/151 (5%)
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIVIL 190
+ ++Y+ T S++ A + LD ++YF+SR+I RE KNPD+V G ER +VIL
Sbjct: 1 MFEMYVYTSSSQVNARKMMSFLDPANRYFNSRLIVREGSTVMALKNPDVVLGHERAVVIL 60
Query: 191 DDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALANVLRVLKTIH 250
DD +S W H N+I + KY YF + + KS S + E+ +A LR+L+ IH
Sbjct: 61 DDRKSAWPMHKANVINVEKYNYFASNQSDPGSKSKSLAERKKDEHTRVMAAYLRILRKIH 120
Query: 251 RLFFD---------SVCGDVRTYLPKVRSEF 272
R FFD DVR + VR++
Sbjct: 121 RQFFDPKLEAIVTAGAARDVREVMRMVRAKI 151
>gi|168018017|ref|XP_001761543.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687227|gb|EDQ73611.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1984
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 67/203 (33%), Positives = 105/203 (51%), Gaps = 16/203 (7%)
Query: 68 KLQLVLNLDHTLLHCRN-----------IKSLSSGEKYLKKQIHSFIGSLFQMANDKL-V 115
KL LVL+LDHTLL+ ++ + E+ + S L++ + +
Sbjct: 1503 KLCLVLDLDHTLLNSAKFSEIEPEWEARLRQAENMERSRALKDPSMKQELYRFPHMSMWT 1562
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDR 174
KLRP + FL +AS L ++++ TM + YA KLLD F+ R+I++ D +G D+
Sbjct: 1563 KLRPGIWKFLAKASELYELHVYTMGNKAYATEMAKLLDPTGTLFAGRVISKGDEVDGSDK 1622
Query: 175 -KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDHKSYSETLTD 231
K+ D V G E +VI+DD+ VW H ENLIV+ +Y+YF ++ S E D
Sbjct: 1623 SKDLDGVLGMESAVVIIDDSSRVWPHHRENLIVVERYMYFPSSRRQFGLLGPSLLEVGHD 1682
Query: 232 ESENEEALANVLRVLKTIHRLFF 254
E + L++ V+ IH+ FF
Sbjct: 1683 ERAADGMLSSASGVIDRIHKNFF 1705
>gi|302774062|ref|XP_002970448.1| hypothetical protein SELMODRAFT_411029 [Selaginella moellendorffii]
gi|300161964|gb|EFJ28578.1| hypothetical protein SELMODRAFT_411029 [Selaginella moellendorffii]
Length = 346
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 93/311 (29%), Positives = 146/311 (46%), Gaps = 57/311 (18%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGS-------LFQMANDKL-VK 116
+++KL LVL+LDHTLL+ + + E+ ++I+ + L ++ + ++ K
Sbjct: 34 QQQKLILVLDLDHTLLNSASFSKVDEEERLYLEKIYDWQEKAPKRRKLLHKVESLQVWTK 93
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
+RPF FLE+AS D+++ T YAE KLLD F I +R+ K K+
Sbjct: 94 IRPFAFKFLEEASKFFDLHIYTNGREIYAETMAKLLDPTGSLFKGHIFSRDHNCMKAMKD 153
Query: 177 PDLVRGQERGIVILDDTESVWS-DHTENLI-VLGKYVYFRDKE-LNGDHKSYSETL--TD 231
D V G E +I+DD++ VW H +NLI V +Y +FR L G +S S T D
Sbjct: 154 LDTVPGDESITLIVDDSDYVWPKKHHKNLIPVYDQYRFFRSSTGLFGLRESSSLTSKKKD 213
Query: 232 ESENEEALANVLRVLKTIHRLFFDS---VCGDVRTYLPKV--------------RSEFSR 274
E + LA +L LK IH FF GDVR + +V +++ +
Sbjct: 214 EVATKATLAKLLEGLKRIHSEFFQEYGCFAGDVRQTMREVKGHALSGCKIVICAKTQAAH 273
Query: 275 DVLYFS--AIFRDC------------LWAEQEEKFL---VQEKKFLVHPRWI-------- 309
++L+ S A+ +C + ++Q+ + L Q K+LV P WI
Sbjct: 274 ELLWDSCQALGAECVVDIDDTVTHVVVASKQQPQGLELSAQAGKYLVWPSWIHTAHYRCC 333
Query: 310 --DAYYFLWRR 318
D FLWR+
Sbjct: 334 RPDEAAFLWRK 344
>gi|297830090|ref|XP_002882927.1| hypothetical protein ARALYDRAFT_897807 [Arabidopsis lyrata subsp.
lyrata]
gi|297328767|gb|EFH59186.1| hypothetical protein ARALYDRAFT_897807 [Arabidopsis lyrata subsp.
lyrata]
Length = 287
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 84/268 (31%), Positives = 121/268 (45%), Gaps = 57/268 (21%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQ--------------EERKLQL 71
+C H+ C C + +++S G F+Y+ +G +S + +RKL L
Sbjct: 44 NCDHSMSYRGYCSRCCRKVDESNGEFFNYISQGQHFSYKYIAYMKRQRFGIGYGQRKLHL 103
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASSL 131
V++L H LL +N LVKLRPF R FL +A+ L
Sbjct: 104 VVDLQHVLLD----------------------------SNGVLVKLRPFAREFLREANEL 135
Query: 132 VDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIVILD 191
IY T S A + +KLLD +F SR I + + +K+ + V +ERG+VILD
Sbjct: 136 FTIYAYTKSDPKQARSFIKLLDPLKIFFPSRFITIAE-EKRKKKSLEFVLAEERGVVILD 194
Query: 192 DTESVW-SDHTENLIVLGKYVYFRDKE---------LNGDHKSYSETLTDESENEE---- 237
W D NL+++ Y YF+ E +N +KS SE +E E E+
Sbjct: 195 CKSETWEKDDERNLLLIKSYDYFKGMEYQQGFITKFINFFNKSSSEEKRNEKEEEDDDDG 254
Query: 238 ALANVLRVLKTIHRLFFDSVCGDVRTYL 265
L + L LKTIH+ FF C DVR L
Sbjct: 255 VLVDALNSLKTIHQRFFHGQCKDVRLLL 282
>gi|145346053|ref|XP_001417510.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577737|gb|ABO95803.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 643
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 67/213 (31%), Positives = 104/213 (48%), Gaps = 27/213 (12%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIH--------------SFIGSLFQMAN- 111
RKL LVL+LDHTLL+ + L +L+ + S+F + +
Sbjct: 308 RKLALVLDLDHTLLNSVLVPDLRMDSNWLRNAMRLLDADVKRAEDANDPLKRSVFHLQHF 367
Query: 112 DKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNG 171
D L KLRP VR FLE+AS L +I++ TM ++ YA+ V+LLD + ++ + + G
Sbjct: 368 DLLTKLRPGVRRFLERASRLFEIHINTMGSQAYADQMVELLDPEKRWIHGTVRGLGEMEG 427
Query: 172 KDRKNP------DLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF----RDKELNGD 221
P + +I DDT SVW H NL+ +Y++F R L+G
Sbjct: 428 GKLWAPAEKTLDGALEHLADACLIFDDTASVWESHRRNLVTCERYLFFPQARRQFGLSG- 486
Query: 222 HKSYSETLTDESENEEALANVLRVLKTIHRLFF 254
S E DESE+E L+ ++V +++H +F
Sbjct: 487 -MSLLEIGQDESEDEGMLSTAMKVFESVHSAYF 518
>gi|56547717|gb|AAV92930.1| putative transcription regulator CPL1 [Solanum lycopersicum]
Length = 1227
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 78/260 (30%), Positives = 115/260 (44%), Gaps = 52/260 (20%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNG--- 171
KLRP + FLE+AS+L +++L TM + YA KLLD F+ R+I+R D
Sbjct: 966 TKLRPGIWNFLEKASNLFELHLYTMGNKLYATEMAKLLDPKGDLFAGRVISRGDDGDPFD 1025
Query: 172 ---KDRKNPDL--VRGQERGIVILDDTESVWSDHTENLIVLGKYVYF----RDKELNGDH 222
+ K+ DL V G E +VI+DD+ VW + NLIV+ +Y+YF R L G
Sbjct: 1026 GDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPG-- 1083
Query: 223 KSYSETLTDESENEEALANVLRVLKTIHRLFFDSVC---GDVRTYLPKVRSEFSRDV-LY 278
S E DE + LA+ L V++ IH+ FF DVR L + + +
Sbjct: 1084 PSLLEIDHDERPEDGTLASCLGVIQRIHQNFFTHRSIDEADVRNILATEQKKILAGCRIV 1143
Query: 279 FSAIFR--------DCLWAEQEE--------------------------KFLVQEKKFLV 304
FS +F LW E+ + + + +V
Sbjct: 1144 FSRVFPVGEASPHLHPLWQTAEQFGAVCTSQIDDQVTHVVANSLGTDKVNWALSTGRSVV 1203
Query: 305 HPRWIDAYYFLWRRRPEDDY 324
HP W++A L+RR E D+
Sbjct: 1204 HPGWVEASALLYRRANEHDF 1223
>gi|325179818|emb|CCA14221.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 694
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 71/263 (26%), Positives = 124/263 (47%), Gaps = 37/263 (14%)
Query: 27 CAHTTVRDSRCIFC-----SQAMNDSFGLSFDYMLRG--LRYSEQEERK----------- 68
C H + S C+ C + + D S + + G LR + E +K
Sbjct: 92 CIHPLMSGSTCMMCLAIVTDEELVDGAHGSVNIVSHGQVLRLNSAEAKKFDSHTMERQLI 151
Query: 69 ---LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSF-IGSLFQMANDKLVKLRPFVRTF 124
L LVL+LDHTLLH + L +IH F I + M + +VKLRP + F
Sbjct: 152 AKKLSLVLDLDHTLLHAVYVADLLEQRPTASDEIHYFKIPGVMTM--EYVVKLRPGLHQF 209
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQ- 183
L+ D+++ T TR YAEA +++D D F RI+AR D D K+ L+
Sbjct: 210 LKSLREQYDLFIYTHGTRIYAEAIAEIIDPDDTLFRHRIVARTDTPDIDHKSLKLLFPSC 269
Query: 184 -ERGIVILDDTESVWSDHTENLIVLGKYVYFR-DKEL-NGDHKSYSETLTDESENEEALA 240
+ I+ILDD VW ++ N++++ + +F E+ N ++ S + + ++++ + +
Sbjct: 270 DDSMILILDDRLDVWKENEGNVLLIKPFHFFNCTAEINNAPGETISPSASSQNQDSDPVE 329
Query: 241 N---------VLRVLKTIHRLFF 254
+L++L+ +H+ F+
Sbjct: 330 PTKMDTDFEYILKILQRVHQAFY 352
>gi|147770504|emb|CAN75676.1| hypothetical protein VITISV_003260 [Vitis vinifera]
Length = 205
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 50/124 (40%), Positives = 73/124 (58%), Gaps = 2/124 (1%)
Query: 139 MSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIVILDDTESVWS 198
M + YA VK+LD + YFSS +I++ D + +K D+V G + ++ILDDTE W
Sbjct: 1 MGEQFYALEMVKVLDPRTVYFSSSVISQADSTQRHQKGLDVVLGPKSXVLILDDTERAWK 60
Query: 199 DHTENLIVLGKYVYFRDK-ELNGDH-KSYSETLTDESENEEALANVLRVLKTIHRLFFDS 256
+H +NLI++ +Y +F G H KS SE +DESE + ALA +L+VL+ H FD
Sbjct: 61 NHKDNLILMERYHFFASSCHQFGFHCKSLSELKSDESEPDGALATILKVLQQTHSTLFDP 120
Query: 257 VCGD 260
D
Sbjct: 121 ELSD 124
>gi|297741470|emb|CBI32601.3| unnamed protein product [Vitis vinifera]
Length = 147
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 55/146 (37%), Positives = 80/146 (54%), Gaps = 8/146 (5%)
Query: 139 MSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIVILDDTESVWS 198
M + YA VK+LD + YFSS +I++ D + +K D+V G + ++ILDDTE W
Sbjct: 1 MGEQFYALEMVKVLDPRTVYFSSSVISQADSTQRHQKGLDVVLGPKSAVLILDDTERAWK 60
Query: 199 DHTENLIVLGKYVYFRDK-ELNGDH-KSYSETLTDESENEEALANVLRVLKTIHRLFFDS 256
+H +NLI++ +Y +F G H KS SE +DESE + ALA +L+VL+ H FD
Sbjct: 61 NHKDNLILMERYHFFASSCHQFGFHCKSLSELKSDESEPDGALATILKVLQQTHSTLFDP 120
Query: 257 VCG------DVRTYLPKVRSEFSRDV 276
DVR L + + RD
Sbjct: 121 ELSDNFSGRDVRQVLNRFGGKSRRDA 146
>gi|308802952|ref|XP_003078789.1| putative transcription regulator CPL1 (ISS) [Ostreococcus tauri]
gi|116057242|emb|CAL51669.1| putative transcription regulator CPL1 (ISS) [Ostreococcus tauri]
Length = 457
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 82/324 (25%), Positives = 137/324 (42%), Gaps = 66/324 (20%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIH--------------SFIGSLFQMAN- 111
RKL LVL+LDHTLL+ + SL + L+ + S F + +
Sbjct: 129 RKLALVLDLDHTLLNSVLVPSLRTEANSLQNAMRLLDHDVARAERTGDPLQRSCFHLPHF 188
Query: 112 DKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNG 171
D KLRP VR+FLE+AS L +I++ TM ++ YA+ V LLD K+ + + +
Sbjct: 189 DLFTKLRPGVRSFLERASKLFEIHISTMGSQAYADQMVALLDPAKKWINGTVKGLGEMEN 248
Query: 172 KDRKNP------DLVRGQERGI-VILDDTESVWSDHTENLIVLGKYVYFRD--KELNGDH 222
P D G+ + VI DDT VW+ + ++L +Y++F ++
Sbjct: 249 GRLIAPRYKSLDDCGLGELTDVSVIFDDTTDVWAQNLKSLFTCERYLFFPQARRQFGLLG 308
Query: 223 KSYSETLTDESENEEALANVLRVLKTIHRLFF---DSVCGDVRTYLPKVRSEFSRDVL-- 277
S E DESE+E L + V +++H +F D++ G + + E + VL
Sbjct: 309 SSLLEVGQDESESEGMLMTAINVFESVHAEYFKRRDALKGKKSPCMQDILEERRKVVLSG 368
Query: 278 ---YFSAIFRDCLWAEQEEKFLVQE-------------------------------KKFL 303
FS +F + E++ +++ E K+
Sbjct: 369 VHVVFSRVFPLHVKPEEQPLWILAENFGANCSSEITSHTTHVVGTSKATAKVREALKRGG 428
Query: 304 VH---PRWIDAYYFLWRRRPEDDY 324
+H P W++ WRR E ++
Sbjct: 429 IHAVTPHWLECSMLFWRRASEKNF 452
>gi|430812451|emb|CCJ30145.1| unnamed protein product [Pneumocystis jirovecii]
Length = 741
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/188 (32%), Positives = 97/188 (51%), Gaps = 24/188 (12%)
Query: 65 EERKLQLVLNLDHTLLHC---------------RNIKSLSSGEKYLKKQIHSFIGSLFQM 109
+E KL L+++LD T+LH ++ ++ +K+ K+ +S IG+ +
Sbjct: 191 KEMKLSLIVDLDQTILHATVDPIVGEWLSNPSSKHYLAVQDVQKFCLKENNSGIGNWY-- 248
Query: 110 ANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDF 169
VK+RP + FLE S L ++++ TM TR YA + L+D D KYF RI++R++
Sbjct: 249 ----YVKMRPGLEQFLENISKLYEMHIYTMGTRAYAASIAHLIDKDKKYFGDRILSRDES 304
Query: 170 NGKDRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD-KELNGDHKSYSE 227
RKN L +VI+DD VW + NLI + Y +F ++NGD+ S
Sbjct: 305 GSTTRKNIQRLFPVDTSMVVIIDDRADVWQ-WSPNLIKVTPYEFFVGIGDINGDYLSNKP 363
Query: 228 TLTDESEN 235
TL + S N
Sbjct: 364 TLHNFSPN 371
>gi|307111295|gb|EFN59530.1| hypothetical protein CHLNCDRAFT_138191 [Chlorella variabilis]
Length = 1156
Score = 85.5 bits (210), Expect = 3e-14, Method: Composition-based stats.
Identities = 91/329 (27%), Positives = 138/329 (41%), Gaps = 74/329 (22%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLS-SGEKYLKKQIHSFIGSL-------FQMANDKL-VKLR 118
KL LVL+LDHTLL+ + + LK + S +L F++ K+ KLR
Sbjct: 368 KLCLVLDLDHTLLNSATFAEVGPTLHDSLKARAASEAATLPEDQRLLFRIDGIKMWTKLR 427
Query: 119 PFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-- 176
P V FL++A+ +++ T R YA++ V+LLD F RIIA+ G +R +
Sbjct: 428 PGVHKFLQRAARYYQLWIHTNGNRAYADSVVRLLDRGGAIFGDRIIAQ----GAERVDQM 483
Query: 177 -PD----LVRG---QERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYS-- 226
PD L++G +E VI+DD+ SVWS H NL+ + +Y+YF + K S
Sbjct: 484 VPDQAKRLMQGLDERESITVIVDDSHSVWSQHRHNLVAVERYIYFPSSRASLGLKGPSLL 543
Query: 227 ETLTDESENEEALANVLRVLKTIHRLFFDSVCG---------------DVRTYLPKVRSE 271
+ DE + L L VL +H ++ D R L + R +
Sbjct: 544 DANRDECPEQGMLMVALSVLVRVHGAVMRALAAPPTVLPGGEVVFQNWDARQALAQERQK 603
Query: 272 FSRDV-LYFSAIF-------RDCLW------------------------AEQEEKFLVQE 299
V L F+ + LW A EK L
Sbjct: 604 VLAGVHLVFTRVIPLEMEPESHPLWRLAQSFGARCSGSLDASTTHVIAGASGTEKVLSAR 663
Query: 300 K--KFLVHPRWIDAYYFLWRRRPEDDYLP 326
K++V P W++ LW+R E+ +LP
Sbjct: 664 SMGKWVVTPAWLECSCILWKRAHEERFLP 692
>gi|224091747|ref|XP_002309339.1| predicted protein [Populus trichocarpa]
gi|222855315|gb|EEE92862.1| predicted protein [Populus trichocarpa]
Length = 204
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 62/248 (25%), Positives = 105/248 (42%), Gaps = 88/248 (35%)
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
++K RPF R FL++AS + +Y+ T+ YA KLLD ++F++++ +R+D +
Sbjct: 2 MIKSRPFARMFLKEASQMFGLYMYTLGDPAYALEMAKLLDPGGEFFNAKVTSRDDGTQRH 61
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDES 233
+K D+++ +DES
Sbjct: 62 QKGHDVLK------------------------------------------------SDES 73
Query: 234 ENEEALANVLRVLKTIHRLFFDSVC----------GDVRTYLPKVRSEFSRDVL-----Y 278
E+ ALA+VL+ L+ +H +FF+ DVR L VR RDVL
Sbjct: 74 ESGGALASVLKALRKVHHIFFEGTLLQELEENPDGRDVRKVLKTVR----RDVLKGCKIV 129
Query: 279 FSAIF-------RDCLW--------------AEQEEKFLVQEKKFLVHPRWIDAYYFLWR 317
FS +F LW ++ + ++ KFLVHP WI+A + W+
Sbjct: 130 FSRVFPTQFQADNHHLWRMVEQLGATCSTEAGTEKSRRALKHNKFLVHPGWIEATNYFWQ 189
Query: 318 RRPEDDYL 325
++PE++ +
Sbjct: 190 KQPEENRI 197
>gi|242093894|ref|XP_002437437.1| hypothetical protein SORBIDRAFT_10g027050 [Sorghum bicolor]
gi|241915660|gb|EER88804.1| hypothetical protein SORBIDRAFT_10g027050 [Sorghum bicolor]
Length = 271
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 93/302 (30%), Positives = 129/302 (42%), Gaps = 83/302 (27%)
Query: 64 QEERKLQLVLNLDHTLLHC----RNIKSLSSGEKYLKKQIHSFIGSLFQMANDK----LV 115
+ ERKL LVL+LDHTLL+ +++ +L + LF++ L
Sbjct: 5 KRERKLILVLDLDHTLLNSTRLHQDLSALEQRNGFTPDTEDELHMELFRLEYSDNVRMLT 64
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRK 175
KLRPFVR FLEQASS T S AAV
Sbjct: 65 KLRPFVRGFLEQASS----RASTSSRAPIDPAAV-------------------------- 94
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF--RDKELNGDHKSYSETLTDES 233
VILDDT+S W H +NLI++ +Y YF ++ + S +E DE
Sbjct: 95 ------------VILDDTDSAWPGHQDNLILMDRYHYFACTCRKFRYNIPSMAEQARDER 142
Query: 234 ENEEALANVLRVLKTIHRLFFDSVCGDVRTYLPKVR--------------SEFSRDVLYF 279
E++ +LA VL VL IH+ FFD DVR + +VR +F D L +
Sbjct: 143 EHDGSLAVVLGVLNRIHQAFFDDDRADVREVIAEVRRQVLPVCTVVFSYLEDFPEDTLMW 202
Query: 280 S-------AIFRDC------LWAE----QEEKFLVQEKKFLVHPRWIDAYYFLWRRRPED 322
+ A +D + AE Q+ ++ + KFLV+P WI A F W R E
Sbjct: 203 TLAERLGAACQKDVDETVTHVVAEDPGTQKAQWAREHGKFLVNPEWIKAVNFRWCRVDER 262
Query: 323 DY 324
D+
Sbjct: 263 DF 264
>gi|297792863|ref|XP_002864316.1| hypothetical protein ARALYDRAFT_918545 [Arabidopsis lyrata subsp.
lyrata]
gi|297310151|gb|EFH40575.1| hypothetical protein ARALYDRAFT_918545 [Arabidopsis lyrata subsp.
lyrata]
Length = 142
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 50/118 (42%), Positives = 67/118 (56%), Gaps = 4/118 (3%)
Query: 139 MSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIVILDDTESVWS 198
M R YA+ +KL+D + YF R+I R + K DLV E G+VI+DDT VW
Sbjct: 1 MGDRDYAKNVLKLIDPEKVYFGDRVITRNE--SPYIKTLDLVLADECGVVIVDDTAQVWP 58
Query: 199 DHTENLIVLGKYVYFRDKELNGDH--KSYSETLTDESENEEALANVLRVLKTIHRLFF 254
DH NL+ + KY YF DK KSY+E DE N+ +L NVL+V+K ++ FF
Sbjct: 59 DHKRNLLEITKYNYFSDKTRRDVKYSKSYAEEKRDEGRNDGSLGNVLKVIKEVYERFF 116
>gi|255080370|ref|XP_002503765.1| predicted protein [Micromonas sp. RCC299]
gi|226519032|gb|ACO65023.1| predicted protein [Micromonas sp. RCC299]
Length = 574
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 51/148 (34%), Positives = 73/148 (49%), Gaps = 5/148 (3%)
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
KLRP FL AS L +Y+ TM R YA KLLD + F+ R+I D +
Sbjct: 229 FTKLRPHAHAFLRAASQLCTMYIYTMGDRNYAREMAKLLDPTGELFNGRVIGSGDSTSQY 288
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYS---ETLT 230
+K+ D+V G E ++I DDT+ VW + NLI + +Y +F+ S
Sbjct: 289 KKDLDIVLGAEPTVLITDDTDRVWPKNLANLIRIDRYHFFKQSAAGFRQPGRSVMERQWR 348
Query: 231 DESENEE--ALANVLRVLKTIHRLFFDS 256
DE +N + L +VL V+ HR FF+
Sbjct: 349 DEGDNGDRAQLRDVLAVIAAAHRRFFEG 376
>gi|291001899|ref|XP_002683516.1| TFIIF CTD phosphatase Fcp1 [Naegleria gruberi]
gi|284097145|gb|EFC50772.1| TFIIF CTD phosphatase Fcp1 [Naegleria gruberi]
Length = 592
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 55/189 (29%), Positives = 96/189 (50%), Gaps = 25/189 (13%)
Query: 45 NDSFGLSFDYMLRGLRYSEQ---EERKLQLVLNLDHTLLHCRN-------------IKSL 88
N + ++++ L + ++Q E++KL LVL+LDHTLLH N +
Sbjct: 164 NVGYTIAYEKGLERGKANQQRLIEKKKLSLVLDLDHTLLHTINDFEYRREHHKVTYFNDI 223
Query: 89 SSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAA 148
+ L+K IH F F + VK RP + +FL++ S + ++++ T R YA+
Sbjct: 224 YNNSPELQKHIHKF----FMRGSYHFVKFRPRLESFLKRCSEIFELHVFTHGERAYADQI 279
Query: 149 VKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLI 205
K+LD F+ RI++R+ D N K + ++ ++++DD VW D+ +N+I
Sbjct: 280 GKMLDSSKSLFADRILSRDECPDINTKTLSQ--VFPYSDKSVLVIDDKTDVWKDNVDNVI 337
Query: 206 VLGKYVYFR 214
+ Y YFR
Sbjct: 338 QIAPYDYFR 346
>gi|330799899|ref|XP_003287978.1| hypothetical protein DICPUDRAFT_55168 [Dictyostelium purpureum]
gi|325082002|gb|EGC35499.1| hypothetical protein DICPUDRAFT_55168 [Dictyostelium purpureum]
Length = 730
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 55/159 (34%), Positives = 83/159 (52%), Gaps = 16/159 (10%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQ-----IHSFIGSLFQMANDKL---VKL 117
ERKL LVL+LDHTL+H + L+S + + IH+ N + +K
Sbjct: 134 ERKLSLVLDLDHTLIHAVTEQGLNSSPNWKNRNRKDYDIHNI------TVNGPMTYCIKK 187
Query: 118 RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN- 176
RP + FLE + ++++ TM TR YA KL+D D F RI++R+D NG + K
Sbjct: 188 RPHLNDFLENVNKNFELHIYTMGTRNYANEIAKLIDPDQTLFKERILSRDDGNGINFKTL 247
Query: 177 PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD 215
L + ++I+DD VW ++NLI + YV+F D
Sbjct: 248 QRLFPCDDSMVLIVDDRSDVWK-KSKNLIQISPYVFFTD 285
>gi|424513770|emb|CCO66392.1| predicted protein [Bathycoccus prasinos]
Length = 546
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 73/242 (30%), Positives = 109/242 (45%), Gaps = 47/242 (19%)
Query: 59 LRYSEQEER-------KLQLVLNLDHTLLHCRNIKSLSSGE------KYLKKQIHSFIGS 105
LR ++ EER KL LVL+LDHTLL+ L+ E K K++ + S
Sbjct: 168 LREAKNEERMATLNQGKLFLVLDLDHTLLNSCRFDELNDEERESLDRKVEKREEEDELRS 227
Query: 106 ---------------------LFQMAN-DKLVKLRPFVRTFLEQASSLVDIYLCTMSTRC 143
L+ +++ KLRP+V FLEQAS + +++ TM +
Sbjct: 228 KLLGLVGGGDAGGGRRPRFPDLYCLSHFSTYTKLRPYVFEFLEQASKICRMHVYTMGDKN 287
Query: 144 YAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTEN 203
YA L+D + KYF RII D K+ D+V G + +I+DDT VW H N
Sbjct: 288 YAHEMASLIDPEGKYFHGRIIGNSDSTCSKTKDLDIVLGGDDCTMIVDDTSRVWPRHARN 347
Query: 204 LIVLGKYVYFRDKELN------------GDHKSYSETLTDESENEEALANVLRVLKTIHR 251
LI + +Y +FR + G + +E +++ E L +VL VL HR
Sbjct: 348 LIRVDRYHFFRKSATSFREMEKSSVMERGLDEGEAEEEGAPAKHREVLKDVLAVLTVAHR 407
Query: 252 LF 253
+
Sbjct: 408 MM 409
>gi|303389951|ref|XP_003073207.1| Fcp1-like phosphatase [Encephalitozoon intestinalis ATCC 50506]
gi|303302352|gb|ADM11847.1| Fcp1-like phosphatase [Encephalitozoon intestinalis ATCC 50506]
Length = 407
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 59/158 (37%), Positives = 84/158 (53%), Gaps = 15/158 (9%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVR 122
+ + KL LVL+LD T+LH Y + +IH + F M K VKLRP +
Sbjct: 56 ETQMKLILVLDLDQTILHT----------TYGESRIHGTVR--FIMDGSKYCVKLRPNLD 103
Query: 123 TFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLVR 181
L + S L +I++ TM TR YAE V ++D KYF RII R++ G K L
Sbjct: 104 HMLRKISRLYEIHVYTMGTRAYAERIVGIVDPSGKYFQDRIITRDENEGVLVKRLSRLFP 163
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELN 219
+ IVILDD VW D++ENL+++ + YF ++N
Sbjct: 164 HNHKNIVILDDRPDVW-DYSENLLLVRPFWYFNRTDIN 200
>gi|66824241|ref|XP_645475.1| hypothetical protein DDB_G0271690 [Dictyostelium discoideum AX4]
gi|60473594|gb|EAL71535.1| hypothetical protein DDB_G0271690 [Dictyostelium discoideum AX4]
Length = 782
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 62/210 (29%), Positives = 100/210 (47%), Gaps = 25/210 (11%)
Query: 27 CAHTTVRDSRCIFCSQAMNDSF-GLSFDYMLRGLRYSEQE--------------ERKLQL 71
C H C C + + D+ LS + L S +E E+KL L
Sbjct: 79 CTHDIQFSGLCATCGRELTDTQESLSILHGHSHLTVSHKEAQRIGDINTKRLLMEKKLSL 138
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQ-----IHSFIGSLFQMANDKLVKLRPFVRTFLE 126
VL+LDHT++H + +S ++ K IH+ + +K RP + FL
Sbjct: 139 VLDLDHTVIHAVTEQGFNSSPEWRNKDKNKNGIHTIT---VNGPMNYCIKKRPHLVKFLT 195
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPD-LVRGQER 185
+ + + ++++ TM TR YA KL+D +S F RI++R+D NG + K+ L +
Sbjct: 196 EVNKIYELHIYTMGTRNYANEIAKLIDPESSIFKERILSRDDGNGINFKSLQRLFPCDDS 255
Query: 186 GIVILDDTESVWSDHTENLIVLGKYVYFRD 215
++I+DD VW ++NLI + YVYF D
Sbjct: 256 MVLIVDDRSDVWK-KSKNLIQISPYVYFTD 284
>gi|384247094|gb|EIE20582.1| hypothetical protein COCSUDRAFT_57726 [Coccomyxa subellipsoidea
C-169]
Length = 1018
Score = 82.8 bits (203), Expect = 2e-13, Method: Composition-based stats.
Identities = 81/319 (25%), Positives = 135/319 (42%), Gaps = 60/319 (18%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGE-KYLKKQIHSFIG------SLFQMANDKL-VKL 117
+R+L LVL+LDHTL++ + K L++Q+ L ++ + L
Sbjct: 696 QRRLCLVLDLDHTLVNSAKFSEVEPEHLKLLERQLQREAALPAEEKRLHRLDRIAMWTAL 755
Query: 118 RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR-EDFNGKDRKN 176
RP +R L + L +++ T ++R YA A +LLD + F RII++ +D + +
Sbjct: 756 RPGLRQMLAAVAPLFQLWIQTNASRAYALAMAELLDPTGELFGQRIISKGDDGSALINHS 815
Query: 177 PDLVRGQERG---IVILDDTESVWSDHTENLIVLGKYVYF--RDKELNGDHKSYSETLTD 231
L++G E +I+DD++ VW H NL+ + +Y YF ++LN S+ E D
Sbjct: 816 KRLMQGLEECEAVCIIVDDSDDVWRHHAHNLLHVERYTYFPSSRRQLNLRGPSFLEAHKD 875
Query: 232 ESENEEALANVLRVLKTIHRLFFDSVCG------------DVRTYLPKVRSEFSRDV-LY 278
E + LA L VL +H F ++ DVR L +R + V +
Sbjct: 876 ECDKTGILAVTLGVLLRVHIAVFAALDAPPTAGIREEHHWDVRHVLGLLRKQVLLGVRVL 935
Query: 279 FSAIF-------RDCLWAEQE--------------------------EKFLVQEKKFLVH 305
FS +F W + E ++ +Q K +V
Sbjct: 936 FSKVFPLGQAPSEQLYWKQAEAYGASCTSQLDEHVTHVVALSRGTHKAQWALQAGKHVVS 995
Query: 306 PRWIDAYYFLWRRRPEDDY 324
P W++ LW+R E Y
Sbjct: 996 PAWLECSCTLWQRAKERAY 1014
>gi|396081720|gb|AFN83335.1| Fcp1-like phosphatase [Encephalitozoon romaleae SJ-2008]
Length = 408
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 66/187 (35%), Positives = 96/187 (51%), Gaps = 16/187 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVRTFLE 126
KL LVL+LD T+LH +S EK + + F M K VKLRP ++ L
Sbjct: 60 KLILVLDLDQTVLHT---AYGASSEKGIVR---------FTMDGCKYSVKLRPNLKRMLR 107
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLVRGQER 185
+ S L +I++ TM TR YAE V+++D KYF RII R++ G K L +
Sbjct: 108 KVSRLYEIHVYTMGTRPYAERIVRIIDPTRKYFHDRIITRDENQGVLVKRLSRLFPYNHK 167
Query: 186 GIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALANVLRV 245
IVILDD VW D+ ENL+++ + YF ++N D + ESE + L + +R
Sbjct: 168 NIVILDDRADVW-DYCENLVLIKPFWYFNRVDIN-DPLRLKRKIEKESEECKELGDSVRK 225
Query: 246 LKTIHRL 252
K + +
Sbjct: 226 RKKVEEV 232
>gi|452820283|gb|EME27327.1| phosphoprotein phosphatase [Galdieria sulphuraria]
Length = 734
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 70/252 (27%), Positives = 111/252 (44%), Gaps = 43/252 (17%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK------------- 113
+KL LVL+LD+TL+H + + ++ + + ++Q A +K
Sbjct: 228 KKLSLVLDLDNTLIHATLVS-------HFPQEWYQYKQEIYQQATEKALECSAPLMEDIH 280
Query: 114 ---------LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII 164
LVKLRP VR FLE+ ++++ TM +R YA+A LLD F RI+
Sbjct: 281 ELDLDGSISLVKLRPNVRRFLEKIHQRYELHIYTMGSRSYADAIATLLDPSGNLFQRRIV 340
Query: 165 AREDFNGKDRKNPDLVR---GQERGIVILDDTESVWSDHTE-----NLIVLGKYVYF-RD 215
+R+DF L R + ++I+DD E VW DH + NLI Y++F +D
Sbjct: 341 SRDDFVEGMMNRKSLRRIFPCDDSMVIIVDDREDVWMDHNQGEMVPNLIRAKPYLFFVQD 400
Query: 216 KELN-GDHKSYSETLT----DESENEEALANVLRVLKTIHRLFFDSVCGDVRTYLPKVRS 270
N +H + T T ++E+ AN+ + T + G YLP V+
Sbjct: 401 VHENMNNHLVWDSTTTSIHPSSESHKESFANISTCMLTCLNWKENLESGCYFPYLPWVQK 460
Query: 271 EFSRDVLYFSAI 282
D Y +
Sbjct: 461 TVESDENYLGRL 472
>gi|340377687|ref|XP_003387360.1| PREDICTED: hypothetical protein LOC100639785 [Amphimedon
queenslandica]
Length = 913
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 102/218 (46%), Gaps = 41/218 (18%)
Query: 27 CAHTTVRDSRCIFCS-----------QAMNDSFGLSFDYMLRGLRYSEQE---------- 65
C H+ V C FC + D +S + + ++ +++E
Sbjct: 83 CDHSVVALDLCAFCGLDLRSISSVSDRGTEDHANVSMLHGMPQVKVNKKEAQRLGNLDKE 142
Query: 66 ----ERKLQLVLNLDHTLLHC---RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLR 118
RKL L+++LD TL+H RNI E+ L +HSF +L + +LR
Sbjct: 143 CLLKNRKLALIIDLDQTLIHTSIDRNI------ERGLP-DVHSF--TLPGHSCVYHCRLR 193
Query: 119 PFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRK 175
P+VR FL S ++++ TM TR YA+A K+LD + K FS R+I+R D + K +
Sbjct: 194 PYVREFLNHISQYYELHVATMGTRDYADAITKILDQEKKLFSHRVISRNELLDPHSKAVR 253
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ + + I+DD VW H NLI + YV+F
Sbjct: 254 LKSVFPCGDEMVAIMDDRGDVWG-HRPNLIHVKAYVFF 290
>gi|346326901|gb|EGX96497.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Cordyceps
militaris CM01]
Length = 780
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/169 (30%), Positives = 89/169 (52%), Gaps = 14/169 (8%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------- 114
+RKL LV++LD T++H ++ ++ HS + + FQ+ +D
Sbjct: 156 QRKLSLVVDLDQTIIHACIEPTVGEWQRDPSNPNHSAVKDVRSFQLKDDGPRGLASGCTY 215
Query: 115 -VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
+KLRP +R FLE+ S + ++++ TM TR YA K++D D K F +R+I+R++
Sbjct: 216 YIKLRPGLRDFLEEVSKMYELHVYTMGTRAYALNIAKIVDPDRKLFGNRVISRDENGSIT 275
Query: 174 RKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD-KELNG 220
K+ L +VI+DD VW + NLI + Y +F+ ++NG
Sbjct: 276 AKSLARLFPVSTDMVVIIDDRADVWPMNKANLIKVAAYDFFKGIGDING 324
>gi|260949511|ref|XP_002619052.1| hypothetical protein CLUG_00211 [Clavispora lusitaniae ATCC 42720]
gi|238846624|gb|EEQ36088.1| hypothetical protein CLUG_00211 [Clavispora lusitaniae ATCC 42720]
Length = 776
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 64/241 (26%), Positives = 105/241 (43%), Gaps = 51/241 (21%)
Query: 21 CEQSLSCAHTTVRDSRCIFCSQAMND-------------SFGLSFDYMLRGLRYSEQE-- 65
C+ C+HT C C +A+ D + +S D GLR S E
Sbjct: 90 CQIEEPCSHTVQYGGLCALCGKAVEDEKDYTGYNYEDRATIAMSHDNT--GLRISLDEAT 147
Query: 66 ------------ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIG--SLFQMAN 111
++KL LV++LD T++H ++ ++ + + F+ LF +
Sbjct: 148 KIEQSSTERLAADKKLILVVDLDQTVIHATVDPTVGEWQRDPQNPNYPFVKDVQLFSLEE 207
Query: 112 DKLV------------------KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD 153
+ +V KLRP ++ FL + S L ++++ TM+TR YA A ++D
Sbjct: 208 EPIVPPGWVGPRPPPTKCWYYVKLRPGLKEFLAEVSKLYELHIYTMATRNYALAIASIID 267
Query: 154 LDSKYFSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVY 212
D KYF RI++R++ KN L + +VI+DD VW NLI + Y +
Sbjct: 268 PDGKYFGDRILSRDESGSLTHKNLRRLFPVDQSMVVIIDDRGDVWQ-WEANLIKVVPYDF 326
Query: 213 F 213
F
Sbjct: 327 F 327
>gi|308464266|ref|XP_003094401.1| hypothetical protein CRE_07009 [Caenorhabditis remanei]
gi|308247823|gb|EFO91775.1| hypothetical protein CRE_07009 [Caenorhabditis remanei]
Length = 754
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 55/198 (27%), Positives = 100/198 (50%), Gaps = 19/198 (9%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKY---LKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
RKL L+++LD T++H + + EK+ K +HS + + KLRP
Sbjct: 238 RKLVLLVDLDQTIIHTSDKLMSADAEKHKDITKYNLHSRVYT---------TKLRPHTTE 288
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRK--NPDLVR 181
FL + S++ ++++ T R YA K+LD D++ F RI++R + + K N L
Sbjct: 289 FLNKMSAMYEMHIVTFGERKYALRIAKILDPDARLFGQRILSRNELSSAQHKTENKALFP 348
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFRD-KELNGDHKSYSE---TLTDESENEE 237
+ +VI+DD VW ++E LI + Y +F++ ++N S + + D++ +
Sbjct: 349 CGDNLVVIIDDRADVWQ-YSEALIQIKPYRFFKEVGDINAPKHSKEQMPVQIEDDAHEDR 407
Query: 238 ALANVLRVLKTIHRLFFD 255
L + RVL IH +++
Sbjct: 408 VLEEIERVLTNIHNKYYE 425
>gi|367004465|ref|XP_003686965.1| hypothetical protein TPHA_0I00240 [Tetrapisispora phaffii CBS 4417]
gi|357525268|emb|CCE64531.1| hypothetical protein TPHA_0I00240 [Tetrapisispora phaffii CBS 4417]
Length = 732
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 61/234 (26%), Positives = 107/234 (45%), Gaps = 44/234 (18%)
Query: 21 CEQSLSCAHTTVRDSRCIFCSQAMND----SFGLSFDYMLRGLRYSEQE----------- 65
CE C H V C C + +++ S L+ + L+ S QE
Sbjct: 102 CEVKRPCDHDIVYAGICTMCGKEVDERDQVSANLTISHTDTNLKVSRQEANNIGQTNKSR 161
Query: 66 ---ERKLQLVLNLDHTLLHC---------------------RNIKSLSSGEKYLKKQIHS 101
+KL LV++LD T++HC RN+KS E+ + +
Sbjct: 162 LIRSKKLILVVDLDQTVIHCGVDPTISEWKNDPSNPNYETLRNVKSFVLEEEAILPPM-- 219
Query: 102 FIGSLFQMAN-DKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFS 160
++G + VK+RP ++ F E+ + + ++++ TM+TR YAE K++D D F
Sbjct: 220 YMGPKPPVHKCSYYVKVRPGLKEFFEKVAPIYEMHIYTMATRAYAEEIAKIIDPDGSLFG 279
Query: 161 SRIIAREDFNGKDRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+RI++R++ K+ + L + +VI+DD VW + + NLI + Y +F
Sbjct: 280 NRILSRDENGSLTHKSLERLFPTDQSMVVIIDDRGDVW-NWSPNLIKVTPYNFF 332
>gi|303280109|ref|XP_003059347.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459183|gb|EEH56479.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 136
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 48/136 (35%), Positives = 69/136 (50%), Gaps = 5/136 (3%)
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRK 175
KLRP R FL AS++ +Y+ TM + YA K+LD + F+ R+IA D K
Sbjct: 1 KLRPRAREFLRAASAMCQLYVYTMGDKNYAREMAKILDPTGELFNGRVIANSDSTCSRTK 60
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYS---ETLTDE 232
+ D+V G E ++I+DDT+ VW + NLI + +Y +F S DE
Sbjct: 61 DLDIVLGAEGSVLIVDDTDRVWPHNLANLIRIDRYHFFPQSAAGFRQPGRSVLERAWKDE 120
Query: 233 SEN--EEALANVLRVL 246
N E L +VLRV+
Sbjct: 121 GANGDREQLRDVLRVI 136
>gi|19074511|ref|NP_586017.1| similarity to HYPOTHETICAL TRANSMEMBRANE PROTEINS YHG4_yeast
[Encephalitozoon cuniculi GB-M1]
gi|51701436|sp|Q8SV03.1|FCP1_ENCCU RecName: Full=RNA polymerase II subunit A C-terminal domain
phosphatase; AltName: Full=CTD phosphatase FCP1
gi|19069153|emb|CAD25621.1| similarity to HYPOTHETICAL TRANSMEMBRANE PROTEINS YHG4_yeast
[Encephalitozoon cuniculi GB-M1]
gi|449329538|gb|AGE95809.1| hypothetical protein ECU07_0890 [Encephalitozoon cuniculi]
Length = 411
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 57/154 (37%), Positives = 79/154 (51%), Gaps = 15/154 (9%)
Query: 68 KLQLVLNLDHTLLHCR-NIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
KL LVL+LD T+LH SL K++ + VKLRP + L
Sbjct: 60 KLILVLDLDQTVLHTTYGTSSLEGTVKFVIDRCRY------------CVKLRPNLDYMLR 107
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLVRGQER 185
+ S L +I++ TM TR YAE V+++D KYF RII R++ G K L R
Sbjct: 108 RISKLYEIHVYTMGTRAYAERIVEIIDPSGKYFDDRIITRDENQGVLVKRLSRLFPHDHR 167
Query: 186 GIVILDDTESVWSDHTENLIVLGKYVYFRDKELN 219
IVILDD VW D+ ENL+++ + YF ++N
Sbjct: 168 NIVILDDRPDVW-DYCENLVLIRPFWYFNRVDIN 200
>gi|2459436|gb|AAB80671.1| unknown protein [Arabidopsis thaliana]
Length = 1066
Score = 78.6 bits (192), Expect = 4e-12, Method: Composition-based stats.
Identities = 60/172 (34%), Positives = 88/172 (51%), Gaps = 18/172 (10%)
Query: 57 RGLRYSEQEE----RKLQLVLNLDHTLLHCRNIKSLSS-GEKYLKKQIHSF----IGSLF 107
R R EQ + +KL LVL++DHTLL+ + S E+ L+K+ LF
Sbjct: 889 RVRRLEEQNKMFASQKLSLVLDIDHTLLNSAKFNEVESRHEEILRKKEEQDREKPYRHLF 948
Query: 108 QMANDKL-VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR 166
+ + + KLRP + FLE+AS L +++L TM + YA KLLD F+ R+I++
Sbjct: 949 RFLHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKLLDPKGVLFNGRVISK 1008
Query: 167 EDFNGKDR------KNPDL--VRGQERGIVILDDTESVWSDHTENLIVLGKY 210
D K+ DL V G E +VI+DD+ VW H NLI + +Y
Sbjct: 1009 GDDGDPLDGDERVPKSKDLEGVMGMESSVVIIDDSVRVWPQHKMNLIAVERY 1060
>gi|255081919|ref|XP_002508178.1| predicted protein [Micromonas sp. RCC299]
gi|226523454|gb|ACO69436.1| predicted protein [Micromonas sp. RCC299]
Length = 318
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 50/158 (31%), Positives = 79/158 (50%), Gaps = 15/158 (9%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIA---REDFNG 171
KLRP V+ FL Q +S+ ++++ TM T+ YA+ +L+D ++ +I ++F
Sbjct: 37 TKLRPGVKKFLRQVASMFEVHVITMGTQSYADEMRQLIDPGRQHIKGSVIGLGQMDEFGE 96
Query: 172 KDRKNPDLVRGQERGI----VILDDTESVWSDHTENLIVLGKYVYFRD--KEL----NGD 221
+ + G+ G+ V+LDD VW DH ENLI + +Y+YF K+ NG
Sbjct: 97 LQPADKKRLDGELSGLDSIAVVLDDHVGVWPDHEENLIEIDRYLYFPSALKQFGVWRNG- 155
Query: 222 HKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVCG 259
S E DE + LA VL+ +H+ FF G
Sbjct: 156 -ASLLEKKVDEIADRSTLAAAFEVLRRVHQDFFAERAG 192
>gi|321460734|gb|EFX71774.1| hypothetical protein DAPPUDRAFT_308742 [Daphnia pulex]
Length = 798
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 58/212 (27%), Positives = 98/212 (46%), Gaps = 30/212 (14%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDS------FGLSFDYMLRGLRYSEQE-------------- 65
SC+H TV C C + ++ ++ + + L S +E
Sbjct: 95 SCSHPTVMKEMCAECGADLRETDQRSQTAAVAMVHNIPELMVSMKEATKLGKKDEERLLK 154
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD TL+H N + ++ E Q+H + +LRPF + L
Sbjct: 155 DRKLVLLVDLDQTLIHTTNDEIPANIEDVFHFQLHGPNSPWYH------TRLRPFTKELL 208
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPD---LVRG 182
SSL ++++CT +R YA LD +YFS RI++R++ K + L
Sbjct: 209 CSMSSLYELHICTFGSRTYAHMIANFLDEKGRYFSHRILSRDECFSAHSKTANLKALFPC 268
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
++ +VI+DD E VW + NLI + Y +F+
Sbjct: 269 GDQMVVIIDDREDVW-NFAPNLIHVRPYHFFQ 299
>gi|70999518|ref|XP_754478.1| RNA Polymerase II CTD phosphatase Fcp1 [Aspergillus fumigatus
Af293]
gi|66852115|gb|EAL92440.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus
fumigatus Af293]
Length = 827
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 50/158 (31%), Positives = 81/158 (51%), Gaps = 12/158 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VK 116
RKL LV++LD T++H ++ + H +G + FQ+ +D VK
Sbjct: 158 RKLSLVVDLDQTIIHATVDPTVGEWMEDKDNPNHDALGDVRAFQLVDDGPGMRGCWYYVK 217
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP + +FL+ S L ++++ TM TR YA+ ++D D K F RI++R++ KN
Sbjct: 218 LRPGLESFLQNVSELFELHIYTMGTRAYAQHIAGIIDPDRKLFGDRILSRDESGSLTAKN 277
Query: 177 -PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + +VI+DD VW + NLI + Y +F
Sbjct: 278 LQRLFPVDTKMVVIIDDRGDVWR-WSPNLIKVSPYDFF 314
>gi|66363226|ref|XP_628579.1| RNA pol II carboxy terminal domain phosphatase of the HAD
superfamily with a BRCT domain at the C-terminus
[Cryptosporidium parvum Iowa II]
gi|46229587|gb|EAK90405.1| RNA pol II carboxy terminal domain phosphatase of the HAD
superfamily with a BRCT domain at the C-terminus
[Cryptosporidium parvum Iowa II]
gi|323509333|dbj|BAJ77559.1| cgd7_4250 [Cryptosporidium parvum]
gi|323509917|dbj|BAJ77851.1| cgd7_4250 [Cryptosporidium parvum]
Length = 595
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 78/257 (30%), Positives = 112/257 (43%), Gaps = 60/257 (23%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSL-----------SSGEKYLKKQIHSFIGSLFQMANDKL 114
+ KL +L+LD+TLLH N + SSG+ +++ F+ L Q N
Sbjct: 171 QNKLVAILDLDNTLLHAYNSTKIGCNINLEDFISSSGDP----EMYKFV--LPQDLNTPY 224
Query: 115 -VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
+KLRP VR FL + + +CT +TR YA+ +LD F RI+ARE +G+D
Sbjct: 225 YLKLRPGVREFLNTIAPYYIMGICTNATREYADVIRAVLDPQRDKFGDRIVARESVDGRD 284
Query: 174 RKNPDL----VRGQERGIVILDDTESVWSDHTENLIVLGK-YVYF--RDKELNGDHKSYS 226
+ D V + R IV+LDD VW E+ +V + Y YF R L + S S
Sbjct: 285 TQK-DFRKICVDVETRAIVLLDDRSDVWDSSLESQVVKAQTYEYFEQRKDALKSHYPSLS 343
Query: 227 ETLTDESENEEALANVL---------------------------RVLKTIHRLFF---DS 256
S N A ++L RV K +H FF ++
Sbjct: 344 SGANSISANSSAPGDILSAALSSLSNASGGNSIADYDRHLDYLIRVFKELHTRFFQNPET 403
Query: 257 VC-GDVRTYLPKVRSEF 272
C GD+ L K+RSE
Sbjct: 404 ACVGDI---LKKMRSEI 417
>gi|401827003|ref|XP_003887594.1| TFIIF-interacting CTD phosphatase [Encephalitozoon hellem ATCC
50504]
gi|392998600|gb|AFM98613.1| TFIIF-interacting CTD phosphatase [Encephalitozoon hellem ATCC
50504]
Length = 408
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 80/165 (48%), Gaps = 29/165 (17%)
Query: 68 KLQLVLNLDHTLLH-------CRNI-KSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRP 119
KL LVL+LD T+LH C+ I K G KY VKLRP
Sbjct: 60 KLILVLDLDQTVLHTTYGTSDCKGIVKFTMDGCKYS-------------------VKLRP 100
Query: 120 FVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PD 178
+ L + S L +I++ TM TR YAE + ++D KYF RII R++ G K
Sbjct: 101 HLNRMLRRVSKLYEIHVYTMGTRPYAERIIGIIDPAGKYFHDRIITRDENQGVLVKRLSR 160
Query: 179 LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHK 223
L + IVILDD VW D+ ENL+++ + YF ++N K
Sbjct: 161 LFPYNHKNIVILDDRADVW-DYNENLVLVKPFWYFNRVDINDPSK 204
>gi|405966173|gb|EKC31485.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Crassostrea gigas]
Length = 837
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 62/223 (27%), Positives = 102/223 (45%), Gaps = 45/223 (20%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSFGLSFD------------YMLRGLRYSEQEE------- 66
SC H TV C C + G++ + + + L SE++
Sbjct: 79 SCTHPTVMKDMCADCGADLRKEAGIAGNRKEPVSASVAMVHNIPELIISEKQALELGKMD 138
Query: 67 -------RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---- 115
RKL L+++LD TL+H N + LK H FQ+++ ++
Sbjct: 139 EDRLLRTRKLVLLVDLDQTLIHTTNDNIPPN----LKDVYH------FQLSHGNMMPWYH 188
Query: 116 -KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
++RP FLE S L ++++CT +R YA K LD D KYFS RI++R++ ++
Sbjct: 189 TRIRPRTEKFLENVSKLYELHICTFGSRMYAHIIAKFLDPDGKYFSHRILSRDECFNQNS 248
Query: 175 KNPD---LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
K + L + + I+DD E VW + + NLI + Y +F+
Sbjct: 249 KMANLKALFPCGDSMVCIIDDREDVW-NFSPNLIHVKPYRFFQ 290
>gi|302838991|ref|XP_002951053.1| hypothetical protein VOLCADRAFT_91454 [Volvox carteri f.
nagariensis]
gi|300263748|gb|EFJ47947.1| hypothetical protein VOLCADRAFT_91454 [Volvox carteri f.
nagariensis]
Length = 699
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 104/213 (48%), Gaps = 36/213 (16%)
Query: 74 NLDHTLL---HCRNIKSLSSGE--KYLKKQIHSFIGS---LFQMANDKL-VKLRPFVRTF 124
+LDHTLL H + ++ + + L+++ + +G L ++A +KL KLRP V F
Sbjct: 377 DLDHTLLNSVHTSEVGPDTATQLAEVLRREEEANLGPRRLLHRLAENKLWTKLRPGVFEF 436
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQE 184
LE ++++ TM + YA KLLD K FSS +IA++ K+ D++ +
Sbjct: 437 LEGLRDDYEMHIYTMGDKTYAAEVRKLLDPTGKLFSS-VIAKDHSTTATAKDLDVLLSAD 495
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALANVLR 244
++LDDTE+VW H NL+ +D +DES + ALA +R
Sbjct: 496 ELALVLDDTEAVWPGHRRNLL--------QD--------------SDESATDGALAAHMR 533
Query: 245 VLKTIHRLFFDSVCGDVRTYLPKVRSEFSRDVL 277
VL+ +H FF + LP + RD+L
Sbjct: 534 VLRAVHTRFFSA----DDPSLPPLERRDVRDIL 562
>gi|169600911|ref|XP_001793878.1| hypothetical protein SNOG_03310 [Phaeosphaeria nodorum SN15]
gi|160705543|gb|EAT90041.2| hypothetical protein SNOG_03310 [Phaeosphaeria nodorum SN15]
Length = 810
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 91/159 (57%), Gaps = 13/159 (8%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL---------V 115
+KL L+++LD T++H ++++ + + + + + FQ+A+D L V
Sbjct: 159 KKLTLIVDLDQTVIHTTCERTVAEWQADPENPNYEAVKDVKGFQLADDNLSNVAANWYYV 218
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR-EDFNGKDR 174
K+RP ++ F ++ S L ++++ TM+TR YA+A +K++D D KYF RI++R E++ K +
Sbjct: 219 KMRPGLKEFFDKMSKLYEMHVYTMATRAYAQAIMKIIDPDRKYFGDRILSRDENYTDKLK 278
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L +VI+DD VW ++ +L+ + + +F
Sbjct: 279 NLTRLFYQNTAMVVIIDDRADVWQ-YSPHLVRVPVFNFF 316
>gi|189211133|ref|XP_001941897.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187977990|gb|EDU44616.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 774
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 89/159 (55%), Gaps = 13/159 (8%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL---------V 115
RKL L+++LD T++H ++++ + + H + + FQ+A+D + V
Sbjct: 159 RKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHDAVKDVQGFQLADDNVSNVAANWYYV 218
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR-EDFNGKDR 174
K+RP ++ F ++ S L ++++ TM+TR YA+A K++D + KYF RI++R E++ K +
Sbjct: 219 KMRPGLKDFFDRVSKLYEMHVYTMATRAYAQAVAKIIDPERKYFGDRILSRDENYTDKLK 278
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L VI+DD VW ++ +L+ + + +F
Sbjct: 279 SLTRLFYQNTAMCVIIDDRADVWQ-YSPHLVRVPVFNFF 316
>gi|123490666|ref|XP_001325656.1| NLI interacting factor-like phosphatase family protein [Trichomonas
vaginalis G3]
gi|121908559|gb|EAY13433.1| NLI interacting factor-like phosphatase family protein [Trichomonas
vaginalis G3]
Length = 474
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 68/270 (25%), Positives = 120/270 (44%), Gaps = 44/270 (16%)
Query: 24 SLSCAHTTVRDSRCIFCSQAMNDSF---------------GLSFDYMLRGLRYSEQ---E 65
S C H+ V + C+ C + M+ ++ +SF+ EQ +
Sbjct: 3 SEECKHSVVINYSCVQCGKPMDQTYLDKNYVRADPNSSVVMISFEEARNRNLQEEQRLID 62
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQ--MANDKLVKLRPFVRT 123
+KL LV++LD TL+ ++ S E K H+ F+ M + L++ RP VR
Sbjct: 63 AKKLSLVIDLDKTLIDTTEVRDHSEVEAIKKLDPHATEDDFFEFNMNQNLLIRYRPHVRE 122
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR--EDF----NGKDRKNP 177
FL + D+ + T++ YA A + +D D K F +RI +R EDF R
Sbjct: 123 FLASIAPYFDLQIYTLALPSYAHAILSKIDPDDKLFKNRIFSRTAEDFAMLREEAMRNRT 182
Query: 178 DLVRGQ---------ERGIVILDDTESVW--SDHT--ENLIVLGKYVYFRDKELNGDHKS 224
D+V + ++ +++LDD+ VW D+ + L+ + +Y YF + N
Sbjct: 183 DIVHKKNIKKLFPYSDKLVLVLDDSPEVWYCDDNKLFKGLVQIKRYSYFTRQGPN----- 237
Query: 225 YSETLTDESENEEALANVLRVLKTIHRLFF 254
+ T+ + ++ L + VL +H LF+
Sbjct: 238 FPPTVNPDYVEDDILIQMRSVLIEVHDLFY 267
>gi|115396432|ref|XP_001213855.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114193424|gb|EAU35124.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 820
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/158 (31%), Positives = 81/158 (51%), Gaps = 12/158 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VK 116
RKL LV++LD T++H ++ + + H + + FQ+ +D VK
Sbjct: 158 RKLSLVVDLDQTIIHATVDPTVGEWMEDKENPNHQALSDVRAFQLVDDGPGMRGCWYYVK 217
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP + TFLE + L ++++ TM TR YA+ ++D D K F RI++R++ KN
Sbjct: 218 LRPGLETFLENVAELFELHIYTMGTRAYAQHIASIIDPDRKLFGDRILSRDESGSLTAKN 277
Query: 177 -PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + +VI+DD VW + NLI + Y +F
Sbjct: 278 LHRLFPVDTKMVVIIDDRGDVWR-WSPNLIKVSPYDFF 314
>gi|330930047|ref|XP_003302870.1| hypothetical protein PTT_14854 [Pyrenophora teres f. teres 0-1]
gi|311321498|gb|EFQ89046.1| hypothetical protein PTT_14854 [Pyrenophora teres f. teres 0-1]
Length = 803
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 89/159 (55%), Gaps = 13/159 (8%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL---------V 115
RKL L+++LD T++H ++++ + + H + + FQ+A+D + V
Sbjct: 159 RKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHDAVKDVQGFQLADDNVSNVAANWYYV 218
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR-EDFNGKDR 174
K+RP ++ F ++ S L ++++ TM+TR YA+A K++D + KYF RI++R E++ K +
Sbjct: 219 KMRPGLKDFFDRVSKLYEMHVYTMATRAYAQAVAKIIDPERKYFGDRILSRDENYTDKLK 278
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L VI+DD VW ++ +L+ + + +F
Sbjct: 279 SLTRLFYQNTAMCVIIDDRADVWQ-YSPHLVRVPVFNFF 316
>gi|302306421|ref|NP_982820.2| ABL127Wp [Ashbya gossypii ATCC 10895]
gi|299788508|gb|AAS50644.2| ABL127Wp [Ashbya gossypii ATCC 10895]
gi|374106022|gb|AEY94932.1| FABL127Wp [Ashbya gossypii FDAG1]
Length = 728
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 62/232 (26%), Positives = 102/232 (43%), Gaps = 45/232 (19%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSFG---------LSFDYMLRGLRYSEQ------------ 64
+C H C+ C QA+ D G L+ + +R SE+
Sbjct: 100 ACPHDVTYGGLCVQCGQAVEDEAGAADGVEQAKLTVSHTNTHIRVSERQAASLGQSAQLK 159
Query: 65 --EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLK-------KQIHSF------IGSLFQM 109
E RKL LV++LD T++HC ++ K K + SF + F M
Sbjct: 160 LREARKLVLVVDLDQTVIHCGVDPTIGEWSKDPNNPNYEALKDVQSFSLDEEPVLPPFYM 219
Query: 110 ANDKL-------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
VKLRP ++ F + + ++++ TM+TR YA K++D D K F R
Sbjct: 220 GPKPPTRKCWYYVKLRPGLKEFFAKIAPHFELHIYTMATRAYALEIAKIIDPDGKLFGDR 279
Query: 163 IIAREDFNGKDRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
I++R++ +K+ + L + +V++DD VW + ENLI + Y +F
Sbjct: 280 ILSRDENGSLTQKSLERLFPMDQSMVVVIDDRGDVW-NWCENLIKVVPYDFF 330
>gi|340518072|gb|EGR48314.1| predicted protein [Trichoderma reesei QM6a]
Length = 594
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 84/162 (51%), Gaps = 13/162 (8%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------- 114
+RKL LV++LD T++H ++ ++ H + + FQ+ +D
Sbjct: 156 QRKLSLVVDLDQTIIHACIEPTIGEWQRDPTNPNHEAVKDVKSFQLNDDGPRGLASGCTY 215
Query: 115 -VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
+KLRP ++ FLE S+ ++++ TM TR YA +++D D K F +R+I+R++
Sbjct: 216 YIKLRPGLKEFLEAVSTKYELHVYTMGTRAYALNIARIVDPDKKLFGNRVISRDENGSIT 275
Query: 174 RKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
K+ L +VI+DD VW ++ NLI + Y +F+
Sbjct: 276 AKSLQRLFPVSTDMVVIIDDRADVWPNNRPNLIKVAPYDFFK 317
>gi|449675210|ref|XP_002161785.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Hydra magnipapillata]
Length = 718
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 55/161 (34%), Positives = 81/161 (50%), Gaps = 14/161 (8%)
Query: 60 RYSEQE---ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVK 116
+Y EQ+ RKL LV++LD TL+H ++ E K F L + K
Sbjct: 143 KYDEQQLLRARKLVLVVDLDMTLIH-------TTVEPTPKNTKDVFSFKLPGHQYEYHTK 195
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP R FLE S ++++ TM +R YA K LD D K+F+ RI +R++F K
Sbjct: 196 LRPGARKFLESISKFYELHIFTMGSRLYAHTVAKCLDPDGKFFAHRIRSRDEFINSFSKF 255
Query: 177 PD---LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
D L + + I+DD E VW ++ NLI + Y +F+
Sbjct: 256 HDLKALFPCGDHMVCIIDDREDVW-NYAPNLITVKPYKFFK 295
>gi|451853161|gb|EMD66455.1| hypothetical protein COCSADRAFT_112846 [Cochliobolus sativus
ND90Pr]
Length = 803
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 89/159 (55%), Gaps = 13/159 (8%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL---------V 115
RKL L+++LD T++H ++++ + + H + + FQ+A+D + V
Sbjct: 159 RKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHDAVKDVQGFQLADDNVSNVAANWYYV 218
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR-EDFNGKDR 174
K+RP ++ F ++ S L ++++ TM+TR YA+A K++D + KYF RI++R E++ K +
Sbjct: 219 KMRPGLKDFFDRVSKLYEMHVYTMATRAYAQAVAKIIDPERKYFGDRILSRDENYTDKLK 278
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L VI+DD VW ++ +L+ + + +F
Sbjct: 279 SLTRLFYQNTAMCVIIDDRADVWQ-YSPHLVRVPVFNFF 316
>gi|400603434|gb|EJP71032.1| FCP1-like phosphatase [Beauveria bassiana ARSEF 2860]
Length = 774
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 53/169 (31%), Positives = 87/169 (51%), Gaps = 16/169 (9%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL---------- 114
RKL LV++LD T++H ++ ++ HS + + FQ+ +D
Sbjct: 157 RKLSLVVDLDQTIIHACIEPTVGEWQRDPSNPNHSAVKDVRSFQLNDDGPRGLASGCTYY 216
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGK-- 172
+KLRP + FLE+ S + ++++ TM TR YA K++D D K F +R+I+R D NG
Sbjct: 217 IKLRPGLSEFLEEISKMYELHVYTMGTRAYALNIAKIVDPDRKLFGNRVISR-DENGSIT 275
Query: 173 DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD-KELNG 220
+ L +VI+DD VW + NLI + Y +F+ ++NG
Sbjct: 276 SKSLARLFPVSTDMVVIIDDRADVWPMNRPNLIKVVPYDFFKGIGDING 324
>gi|357601986|gb|EHJ63229.1| putative RNA polymerase II subunit A C-terminal domain phosphatase
[Danaus plexippus]
Length = 683
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 60/212 (28%), Positives = 93/212 (43%), Gaps = 29/212 (13%)
Query: 27 CAHTTVRDSRCIFCSQAMN-------DSFGLSFDYMLRGLRYSEQ--------------E 65
C H TV C C + D + + + L+ SE+ +
Sbjct: 82 CRHPTVMKEMCAECGADLRSGESQKRDVAVVPMVHSVPELKVSEELAQKLGREDADRLLK 141
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD TL+H N + +K +H F+ +LRP FL
Sbjct: 142 DRKLVLLVDLDQTLVHTTN----DNIPPNIKDVLHFFLRGPGNQGRWCHTRLRPKTHEFL 197
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRG 182
E A+ ++++CT R YA A +LLD K+FS RI++R+ D K L
Sbjct: 198 ESAAKNYELHVCTFGARQYAHAITELLDPQKKFFSHRILSRDECFDARTKSANLKALFPC 257
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW H NLI + Y +F+
Sbjct: 258 GDNMVCIIDDREDVWR-HASNLIQVRPYSFFQ 288
>gi|452004576|gb|EMD97032.1| hypothetical protein COCHEDRAFT_1163398 [Cochliobolus
heterostrophus C5]
Length = 803
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 89/159 (55%), Gaps = 13/159 (8%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL---------V 115
RKL L+++LD T++H ++++ + + H + + FQ+A+D + V
Sbjct: 159 RKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHDAVKDVQGFQLADDNVSNVAANWYYV 218
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR-EDFNGKDR 174
K+RP ++ F ++ S L ++++ TM+TR YA+A K++D + KYF RI++R E++ K +
Sbjct: 219 KMRPGLKDFFDRVSKLYEMHVYTMATRAYAQAVAKIIDPERKYFGDRILSRDENYTDKLK 278
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L VI+DD VW ++ +L+ + + +F
Sbjct: 279 SLTRLFYQNTAMCVIIDDRADVWQ-YSPHLVRVPVFNFF 316
>gi|429963056|gb|ELA42600.1| FCP1-like phosphatase, phosphatase domain-containing protein
[Vittaforma corneae ATCC 50505]
Length = 445
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 60/213 (28%), Positives = 100/213 (46%), Gaps = 44/213 (20%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSFGLS--FDYMLRGLRYSEQ-------------EERKLQ 70
+C H+ DS C C + L + R + SE+ EE+K+
Sbjct: 24 NCTHSLRIDSLCAICGAEILKGTDLVPVLHHTDRVFQTSEEARKLQKIRNKQLNEEKKMI 83
Query: 71 LVLNLDHTLLH-------CRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
L+L+LD T+LH C S+SS Y VKLRP +
Sbjct: 84 LILDLDQTILHTTLWKIDCDFTFSISSTMFY--------------------VKLRPHLNR 123
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQ 183
FLE+ S + +I++ TM TR Y K +D + YF RI++R + + +K+ + +
Sbjct: 124 FLEKISKMFEIHIYTMGTREYVTEICKAIDPNGIYFGDRIVSRNENFNELKKSIERITCI 183
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYFRDK 216
R +VI+DD VW ++++NL+++ + ++RDK
Sbjct: 184 SRNVVIIDDRADVW-NYSKNLVLIRPF-WYRDK 214
>gi|443696103|gb|ELT96883.1| hypothetical protein CAPTEDRAFT_23527, partial [Capitella teleta]
Length = 562
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 61/215 (28%), Positives = 94/215 (43%), Gaps = 33/215 (15%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSFG---------------------LSFDYMLRGLRYSEQ 64
C H TV C C + D +S L + EQ
Sbjct: 75 GCTHPTVMKDMCAECGADLRDGTPGKRKNPSDASVAMVHSIPELIISQKVTLELGKADEQ 134
Query: 65 E---ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
++KL L+++LD TL+H N K ++ + Q+H L+ K RP
Sbjct: 135 RLIRDKKLVLLVDLDQTLIHTTNDKVPANLKDVHHFQLHHGRNLLWYH-----TKFRPGT 189
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKN--PD 178
FLE+ S L ++++CT R YA KLLD D KYFS RI++R++ FN +
Sbjct: 190 EKFLERISKLYELHICTFGVRMYAHTIAKLLDPDGKYFSHRILSRDECFNPTSKTGNLKA 249
Query: 179 LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + + I+DD E VW + +L+ + Y++F
Sbjct: 250 LFPCGDSMVCIIDDREDVWR-FSPSLVHVKPYLFF 283
>gi|67624539|ref|XP_668552.1| NLI interacting factor [Cryptosporidium hominis TU502]
gi|54659751|gb|EAL38315.1| NLI interacting factor [Cryptosporidium hominis]
Length = 595
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 77/257 (29%), Positives = 111/257 (43%), Gaps = 60/257 (23%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSL-----------SSGEKYLKKQIHSFIGSLFQMANDKL 114
+ KL +L+LD+TLLH N + SSG+ +++ F+ L Q N
Sbjct: 171 QNKLVAILDLDNTLLHAYNSTKIGCNINLEDFISSSGDP----EMYKFV--LPQDLNTPY 224
Query: 115 -VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
+KLRP VR FL + + +CT +TR YA+ +LD F RI+ARE +G+D
Sbjct: 225 YLKLRPGVREFLNTIAPYYIMGICTNATREYADVIRAVLDPQRDKFGDRIVARESVDGRD 284
Query: 174 RKNPDL----VRGQERGIVILDDTESVWSDHTENLIVLGK-YVYF--RDKELNGDHKSYS 226
+ D V + R IV+LDD VW E+ +V + Y YF R L + S
Sbjct: 285 TQK-DFRKICVDVETRAIVLLDDRSDVWDSSLESQVVKAQTYEYFEQRKDALKSHYPPLS 343
Query: 227 ETLTDESENEEALANVL---------------------------RVLKTIHRLFF---DS 256
S N A ++L RV K +H FF ++
Sbjct: 344 SGANSISANSSAPGDILSAALSSLSNASGGNSIADYDRHLDYLIRVFKELHTRFFQNPET 403
Query: 257 VC-GDVRTYLPKVRSEF 272
C GD+ L K+RSE
Sbjct: 404 ACVGDI---LKKMRSEI 417
>gi|303317134|ref|XP_003068569.1| NLI interacting factor-like phosphatase family protein
[Coccidioides posadasii C735 delta SOWgp]
gi|240108250|gb|EER26424.1| NLI interacting factor-like phosphatase family protein
[Coccidioides posadasii C735 delta SOWgp]
gi|320038484|gb|EFW20419.1| RNA Polymerase II CTD phosphatase Fcp1 [Coccidioides posadasii str.
Silveira]
Length = 868
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 83/158 (52%), Gaps = 12/158 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VK 116
RKL LV++LD T++H +++ ++ H + + FQ+ +D +K
Sbjct: 158 RKLSLVVDLDQTIIHATVDPTVAEWQEDKTNPNHEAVKDVRAFQLVDDGPGMRGCWYYIK 217
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP + FL SSL ++++ TM TR YA+ ++D D K F RI++R++ KN
Sbjct: 218 LRPGLEDFLRSISSLYELHIYTMGTRAYAQNIANIVDPDRKIFGDRILSRDESGSLTAKN 277
Query: 177 -PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + +VI+DD VW + ++NLI + Y +F
Sbjct: 278 LQRLFPVDTKMVVIIDDRGDVW-NWSDNLIRVHPYDFF 314
>gi|398396164|ref|XP_003851540.1| hypothetical protein MYCGRDRAFT_44229 [Zymoseptoria tritici IPO323]
gi|339471420|gb|EGP86516.1| hypothetical protein MYCGRDRAFT_44229 [Zymoseptoria tritici IPO323]
Length = 822
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 52/162 (32%), Positives = 83/162 (51%), Gaps = 9/162 (5%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDK----LVKLRPF 120
RKL LV++LD T++ ++ + + + FQ+A+D VKLRP
Sbjct: 164 RKLSLVVDLDQTIIQANVEPTIGEWKNDPTNPNWKALQDVCQFQLADDGRTWYYVKLRPG 223
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDL 179
++ FL S L ++++ TM TR YA+ K++D D K F RI++R++ KN L
Sbjct: 224 LKDFLRDMSELYELHIYTMGTRAYADNIAKIVDPDRKVFGDRILSRDENGSMTVKNLKRL 283
Query: 180 VRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD-KELNG 220
R +VI+DD VW T NLI + + +F ++NG
Sbjct: 284 FHADTRMVVIIDDRADVWH-WTPNLIKVNAFEFFPGVGDING 324
>gi|402080254|gb|EJT75399.1| RNA polymerase II subunit A domain phosphatase [Gaeumannomyces
graminis var. tritici R3-111a-1]
Length = 850
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 86/163 (52%), Gaps = 13/163 (7%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL-------- 114
++RKL LV++LD T++H ++ ++ H + + FQ+ +D
Sbjct: 166 DQRKLILVVDLDQTIIHACIEPTIGDWQRDPTNPNHEAVKDVKSFQLNDDGPRGLASGCW 225
Query: 115 --VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGK 172
+K+RP + FLE+ +++ ++++ TM TR YA K++D D K F +R+I+R++
Sbjct: 226 YYIKMRPGLVDFLEKIATMYELHVYTMGTRAYAMNIAKIVDPDQKLFGNRVISRDENGSM 285
Query: 173 DRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
K+ L R +VI+DD VW + NLI + Y +F+
Sbjct: 286 TAKSLQRLFPVSTRMVVIIDDRADVWPRNRPNLIKVVPYDFFK 328
>gi|392870961|gb|EAS32809.2| FCP1-like phosphatase, phosphatase domain-containing protein
[Coccidioides immitis RS]
Length = 868
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 83/158 (52%), Gaps = 12/158 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VK 116
RKL LV++LD T++H +++ ++ H + + FQ+ +D +K
Sbjct: 158 RKLSLVVDLDQTIIHATVDPTVAEWQEDKTNPNHEAVKDVRAFQLVDDGPGMRGCWYYIK 217
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP + FL SSL ++++ TM TR YA+ ++D D K F RI++R++ KN
Sbjct: 218 LRPGLEDFLRSISSLYELHIYTMGTRAYAQNIANIVDPDRKIFGDRILSRDESGSLTAKN 277
Query: 177 -PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + +VI+DD VW + ++NLI + Y +F
Sbjct: 278 LQRLFPVDTKMVVIIDDRGDVW-NWSDNLIRVHPYDFF 314
>gi|119187277|ref|XP_001244245.1| hypothetical protein CIMG_03686 [Coccidioides immitis RS]
Length = 839
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 83/158 (52%), Gaps = 12/158 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VK 116
RKL LV++LD T++H +++ ++ H + + FQ+ +D +K
Sbjct: 129 RKLSLVVDLDQTIIHATVDPTVAEWQEDKTNPNHEAVKDVRAFQLVDDGPGMRGCWYYIK 188
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP + FL SSL ++++ TM TR YA+ ++D D K F RI++R++ KN
Sbjct: 189 LRPGLEDFLRSISSLYELHIYTMGTRAYAQNIANIVDPDRKIFGDRILSRDESGSLTAKN 248
Query: 177 -PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + +VI+DD VW + ++NLI + Y +F
Sbjct: 249 LQRLFPVDTKMVVIIDDRGDVW-NWSDNLIRVHPYDFF 285
>gi|195121496|ref|XP_002005256.1| GI20391 [Drosophila mojavensis]
gi|193910324|gb|EDW09191.1| GI20391 [Drosophila mojavensis]
Length = 880
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 63/226 (27%), Positives = 104/226 (46%), Gaps = 32/226 (14%)
Query: 14 KFVIKRKCEQSLS-CAHTTVRDSRCIFCSQAM--NDSFGLS-----FDYMLRGLRYSEQ- 64
+ VIK LS C HTTV C C + ND+ S + + L+ +++
Sbjct: 112 EIVIKGDALLELSECIHTTVIKDMCADCGADLRQNDNGQTSEASVPMVHTMPDLKVTQKL 171
Query: 65 -------------EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMAN 111
+RKL L+++LD T++H N + + Q++ +
Sbjct: 172 AQKLGHDDTRRLLTDRKLVLLVDLDQTVIHTTNDTVPDNIKGIYHFQLYGPQSPWYH--- 228
Query: 112 DKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FN 170
+LRP FLE+ S L ++++CT R YA +LLD D K+FS RI++R++ FN
Sbjct: 229 ---TRLRPGTAEFLEKMSELYELHICTFGARNYAHMIAQLLDPDGKFFSHRILSRDECFN 285
Query: 171 GKDRKN--PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + L + + I+DD E VW + NLI + Y +F+
Sbjct: 286 ATSKTDNLKALFPNGDSMVCIIDDREDVW-NMASNLIQVKPYHFFQ 330
>gi|195029035|ref|XP_001987380.1| GH21892 [Drosophila grimshawi]
gi|193903380|gb|EDW02247.1| GH21892 [Drosophila grimshawi]
Length = 889
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 59/212 (27%), Positives = 98/212 (46%), Gaps = 31/212 (14%)
Query: 27 CAHTTVRDSRCIFCSQAM--NDSFGLS-----FDYMLRGLRYSEQ--------------E 65
C HTTV C C + ND+ S + + L+ +++
Sbjct: 122 CIHTTVIKDMCADCGADLRQNDNGQTSEASVPMVHTMPDLKVTQKLAQKLGHDDTRRLLA 181
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N + + Q++ + +LRP FL
Sbjct: 182 DRKLVLLVDLDQTVIHTTNDTVPDNIKGIYHFQLYGPQSPWYH------TRLRPGTAEFL 235
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKN--PDLVRG 182
E+ S L ++++CT R YA +LLD D K+FS RI++R++ FN + + L
Sbjct: 236 ERMSQLYELHICTFGARNYAHMIAQLLDPDGKFFSHRILSRDECFNATSKTDNLKALFPN 295
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VWS NLI + Y +F+
Sbjct: 296 GDSMVCIIDDREDVWS-MASNLIQVKPYHFFQ 326
>gi|449299873|gb|EMC95886.1| hypothetical protein BAUCODRAFT_71386 [Baudoinia compniacensis UAMH
10762]
Length = 790
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 50/160 (31%), Positives = 85/160 (53%), Gaps = 16/160 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VK 116
RKL LV++LD T++H +++ + H+ + + FQ+ +D +K
Sbjct: 159 RKLSLVVDLDQTIIHATVDPTVAEWQADETNPNHAAVKGVRKFQLVDDGPGGRGTWYYIK 218
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKD 173
LRP + FL+ S ++++ TM+TR YAE KL+D K F++RI++R++ N K
Sbjct: 219 LRPGLSDFLQLVSQYYELHIYTMATRAYAEEIAKLVDPGRKLFANRILSRDENGSMNSKS 278
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K L + +VI+DD VWS + NL+ + Y +F
Sbjct: 279 LKR--LFPVDTKMVVIIDDRGDVWS-WSPNLVKVSAYDFF 315
>gi|322706326|gb|EFY97907.1| RNA Polymerase II CTD phosphatase Fcp1 [Metarhizium anisopliae
ARSEF 23]
Length = 807
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 49/163 (30%), Positives = 85/163 (52%), Gaps = 15/163 (9%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------- 114
+RKL LV++LD T++H ++ +K H + + FQ+ +D
Sbjct: 156 QRKLSLVVDLDQTIIHACIEPTIGEWQKDESNPNHEAVKDVKSFQLNDDGPRGLASGCTY 215
Query: 115 -VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGK- 172
+KLRP ++ FLE+ +++ ++++ TM TR YA +++D D K F +R+I+R D NG
Sbjct: 216 YIKLRPGLQEFLEEIATMYELHVYTMGTRAYALNIARIVDPDRKLFGNRVISR-DENGSI 274
Query: 173 -DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ L +VI+DD VW + NLI + Y +F+
Sbjct: 275 TSKSLQRLFPVSTNMVVIIDDRADVWPRNRPNLIKVVPYDFFK 317
>gi|254568460|ref|XP_002491340.1| hypothetical protein [Komagataella pastoris GS115]
gi|238031137|emb|CAY69060.1| hypothetical protein PAS_chr2-1_0845 [Komagataella pastoris GS115]
gi|328352145|emb|CCA38544.1| hypothetical protein PP7435_Chr2-0862 [Komagataella pastoris CBS
7435]
Length = 733
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 65/237 (27%), Positives = 109/237 (45%), Gaps = 56/237 (23%)
Query: 27 CAHTTVRDSRCIFCSQAM--NDSFGLSFD-----YMLRG-----LRYSEQE--------- 65
C+H+ C C A+ ND G S+D M G + +E E
Sbjct: 107 CSHSVQYGGLCALCGSAVEGNDYTGFSYDKQAPVVMSHGSADLKISLTEAEKIEQTSSKR 166
Query: 66 ---ERKLQLVLNLDHTLLHC---------------------RNIKSLSSGEKYLKKQIHS 101
E+KL LV++LD T++H ++++S S E+ + + +
Sbjct: 167 LLKEKKLSLVVDLDQTVIHATVDPTVGEWMKDPNNANYPAVKDVRSFSLKEEVILPE--N 224
Query: 102 FIGSLFQMANDKL----VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSK 157
++G Q + VKLRP +R FLE S ++++ TM+TR YA+ K++D D K
Sbjct: 225 YVG---QKPPATVCWYYVKLRPHLREFLEHVSERYELHIYTMATRQYAKEIAKIIDPDEK 281
Query: 158 YFSSRIIAREDFNGKDRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
YF RI++R++ +K+ L +V++DD VW + + NLI + Y +F
Sbjct: 282 YFGDRILSRDESGSLTQKSLQRLFPVDTSMVVVIDDRGDVW-NWSSNLIKVVPYDFF 337
>gi|367009794|ref|XP_003679398.1| hypothetical protein TDEL_0B00580 [Torulaspora delbrueckii]
gi|359747056|emb|CCE90187.1| hypothetical protein TDEL_0B00580 [Torulaspora delbrueckii]
Length = 713
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 62/232 (26%), Positives = 105/232 (45%), Gaps = 40/232 (17%)
Query: 21 CEQSLSCAHTTVRDSRCIFCSQAMNDS----FGLSFDYMLRGLRYSEQE----------- 65
CE S C H V C C + +++S L+ + L+ S +E
Sbjct: 95 CEISRPCNHDIVYGGLCTLCGKEVDESEQFNGNLAISHTDVNLKVSRKEATDIENNLKTR 154
Query: 66 ---ERKLQLVLNLDHTLLHCRNIKSL------SSGEKYLK-KQIHSF------IGSLFQM 109
+KL LV++LD T++HC ++ SS Y K + SF I L M
Sbjct: 155 LRESKKLVLVVDLDQTVIHCGVDPTIGEWKRDSSNPNYEALKDVQSFALDEEPILPLLYM 214
Query: 110 ANDK-------LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
VK+RP ++ F ++ + L ++++ TM+TR YA K++D D F R
Sbjct: 215 GPKPPVRKCWYYVKVRPGLKEFFDKVAPLFEMHIYTMATRAYALEIAKIIDPDGSLFGDR 274
Query: 163 IIAREDFNGKDRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
I++R++ +K+ + L + +V++DD VW + NLI + Y +F
Sbjct: 275 ILSRDENGSITQKSLERLFPTDQSMVVVIDDRGDVW-NWCPNLIKVVPYNFF 325
>gi|308500103|ref|XP_003112237.1| CRE-FCP-1 protein [Caenorhabditis remanei]
gi|308268718|gb|EFP12671.1| CRE-FCP-1 protein [Caenorhabditis remanei]
Length = 664
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 53/199 (26%), Positives = 100/199 (50%), Gaps = 20/199 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKY---LKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
RKL L+++LD T++H + + EK+ K +HS + + KLRP
Sbjct: 142 RKLVLLVDLDQTIIHTSDKPMSADAEKHKDITKYNLHSRVYT---------TKLRPHTTE 192
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDF---NGKDRKNPDLV 180
FL + +++ ++++ T R YA ++LD D++ F RI++R++ K R L
Sbjct: 193 FLNKMAAMYEMHIVTYGQRQYAHRIAQILDPDARLFGQRILSRDELFSAQHKTRNLKALF 252
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD-KELNGDHKSYSE---TLTDESENE 236
+ +VI+DD VW ++E LI + Y +F++ ++N S + + D++ +
Sbjct: 253 PCGDNLVVIIDDRADVWQ-YSEALIQIKPYRFFKEVGDINAPKDSKEQMPVQIEDDAHED 311
Query: 237 EALANVLRVLKTIHRLFFD 255
L + RVL IH +++
Sbjct: 312 RVLEEIERVLTNIHDKYYE 330
>gi|46126951|ref|XP_388029.1| hypothetical protein FG07853.1 [Gibberella zeae PH-1]
Length = 765
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 50/163 (30%), Positives = 83/163 (50%), Gaps = 15/163 (9%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------- 114
+RKL LV++LD T++H ++ ++ H + + FQ+ +D
Sbjct: 156 QRKLSLVVDLDQTIIHACIEPTIGEWQRDPSNPNHDAVKDVKSFQLNDDGPRGVTSGCTY 215
Query: 115 -VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGK- 172
+KLRP + FLE+ S + ++++ TM TR YA K++D D K F +R+I+R D NG
Sbjct: 216 YIKLRPGLMEFLEEVSKMYELHVYTMGTRAYALNIAKIVDPDKKLFGNRVISR-DENGSI 274
Query: 173 -DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ L +VI+DD VW + NLI + Y +F+
Sbjct: 275 TSKSLQRLFPVSTDMVVIIDDRADVWPMNRPNLIKVVPYDFFK 317
>gi|408390401|gb|EKJ69801.1| hypothetical protein FPSE_10001 [Fusarium pseudograminearum CS3096]
Length = 765
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 50/163 (30%), Positives = 83/163 (50%), Gaps = 15/163 (9%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------- 114
+RKL LV++LD T++H ++ ++ H + + FQ+ +D
Sbjct: 156 QRKLSLVVDLDQTIIHACIEPTIGEWQRDPSNPNHDAVKDVKSFQLNDDGPRGVTSGCTY 215
Query: 115 -VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGK- 172
+KLRP + FLE+ S + ++++ TM TR YA K++D D K F +R+I+R D NG
Sbjct: 216 YIKLRPGLMEFLEEVSKMYELHVYTMGTRAYALNIAKIVDPDKKLFGNRVISR-DENGSI 274
Query: 173 -DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ L +VI+DD VW + NLI + Y +F+
Sbjct: 275 TSKSLQRLFPVSTDMVVIIDDRADVWPMNRPNLIKVVPYDFFK 317
>gi|159127495|gb|EDP52610.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus
fumigatus A1163]
Length = 827
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 80/158 (50%), Gaps = 12/158 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VK 116
RKL LV++LD T++H ++ + H + + FQ+ +D VK
Sbjct: 158 RKLSLVVDLDQTIIHATVDPTVGEWMEDKDNPNHDALSDVRAFQLVDDGPGMRGCWYYVK 217
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP + +FL+ S L ++++ TM TR YA+ ++D D K F RI++R++ KN
Sbjct: 218 LRPGLESFLQNVSELFELHIYTMGTRAYAQHIAGIIDPDRKLFGDRILSRDESGSLTAKN 277
Query: 177 -PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + +VI+DD VW + NLI + Y +F
Sbjct: 278 LQRLFPVDTKMVVIIDDRGDVWR-WSPNLIKVSPYDFF 314
>gi|83767703|dbj|BAE57842.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 820
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 80/158 (50%), Gaps = 12/158 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VK 116
RKL LV++LD T++H ++ + H + + FQ+ +D VK
Sbjct: 158 RKLSLVVDLDQTIIHATVDPTVGEWMEDKDNPNHQALSDVRAFQLVDDGPGMRGCWYYVK 217
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP + +FL+ S L ++++ TM TR YA+ ++D D K F RI++R++ KN
Sbjct: 218 LRPGLESFLQNVSELFELHIYTMGTRAYAQHIASIIDPDRKLFGDRILSRDESGSLTAKN 277
Query: 177 -PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + +VI+DD VW + NLI + Y +F
Sbjct: 278 LHRLFPVDTKMVVIIDDRGDVWR-WSPNLIKVSPYDFF 314
>gi|391867600|gb|EIT76846.1| TFIIF-interacting CTD phosphatase [Aspergillus oryzae 3.042]
Length = 820
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 80/158 (50%), Gaps = 12/158 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VK 116
RKL LV++LD T++H ++ + H + + FQ+ +D VK
Sbjct: 158 RKLSLVVDLDQTIIHATVDPTVGEWMEDKDNPNHQALSDVRAFQLVDDGPGMRGCWYYVK 217
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP + +FL+ S L ++++ TM TR YA+ ++D D K F RI++R++ KN
Sbjct: 218 LRPGLESFLQNVSELFELHIYTMGTRAYAQHIASIIDPDRKLFGDRILSRDESGSLTAKN 277
Query: 177 -PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + +VI+DD VW + NLI + Y +F
Sbjct: 278 LHRLFPVDTKMVVIIDDRGDVWR-WSPNLIKVSPYDFF 314
>gi|350634686|gb|EHA23048.1| hypothetical protein ASPNIDRAFT_197473 [Aspergillus niger ATCC
1015]
Length = 824
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 64/226 (28%), Positives = 98/226 (43%), Gaps = 41/226 (18%)
Query: 26 SCAHTTVRDSRCIFCSQAMND-SFGLSFDYMLRG----------LRYSEQE--------- 65
CAH C C + M D S+ + R L SEQE
Sbjct: 92 PCAHEVQFGGLCAICGKDMTDFSYNTEVTDVHRAPIQMAHDNTTLTVSEQEATRVEEDAK 151
Query: 66 -----ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIG----SLFQMANDKL-- 114
RKL LV++LD T++H ++ GE K+ ++ FQ+ +D
Sbjct: 152 RRLLANRKLSLVVDLDQTIIHATVDPTV--GEWMQDKENPNYQALSDVRAFQLVDDGPGM 209
Query: 115 ------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED 168
VKLRP + +FL+ S + ++++ TM TR YA+ ++D D K F RI++R++
Sbjct: 210 RGCWYYVKLRPGLESFLQNVSEMYELHIYTMGTRSYAQHIASIIDPDRKLFGDRILSRDE 269
Query: 169 FNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
KN L + +VI+DD VW NLI + Y +F
Sbjct: 270 SGSLVAKNLHRLFPVDTKMVVIIDDRGDVWR-WNPNLIKVSPYDFF 314
>gi|156837042|ref|XP_001642557.1| hypothetical protein Kpol_1068p9 [Vanderwaltozyma polyspora DSM
70294]
gi|156113100|gb|EDO14699.1| hypothetical protein Kpol_1068p9 [Vanderwaltozyma polyspora DSM
70294]
Length = 745
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 57/233 (24%), Positives = 105/233 (45%), Gaps = 42/233 (18%)
Query: 21 CEQSLSCAHTTVRDSRCIFCSQAMNDS----FGLSFDYMLRGLRYSEQE----------- 65
CE C H V C C + +++S L+ + L+ S +E
Sbjct: 99 CEIVRPCNHDIVYAGICTMCGKEVDESDQVSANLTISHTDTNLKVSRREANDIGQGIKKR 158
Query: 66 ---ERKLQLVLNLDHTLLHC---------------RNIKSLSSGEKYLKKQIHSFIGSLF 107
E+KL LV++LD T++HC N ++L + ++ ++ + ++
Sbjct: 159 LIREKKLILVVDLDQTVIHCGVDPTIAEWKNDPTNPNFETLRDVKSFVLEE-EPILPPMY 217
Query: 108 QMANDKL------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSS 161
VK+RP ++ F E+ S L ++++ TM+TR YA+ K++D D F+
Sbjct: 218 MGPKPPTHKCWYYVKIRPGLKEFFEEVSKLYEMHIYTMATRSYAQEIAKIIDPDGTLFAD 277
Query: 162 RIIAREDFNGKDRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
RI++R + K+ + L + +V++DD VW + NLI + Y +F
Sbjct: 278 RILSRNENGSLTHKSLERLFPTDQSMVVVIDDRGDVW-NWCPNLIKVTPYNFF 329
>gi|300701489|ref|XP_002994977.1| hypothetical protein NCER_102325 [Nosema ceranae BRL01]
gi|239603396|gb|EEQ81306.1| hypothetical protein NCER_102325 [Nosema ceranae BRL01]
Length = 200
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 59/212 (27%), Positives = 100/212 (47%), Gaps = 27/212 (12%)
Query: 25 LSCAHTTVRDSRCIFCSQAMNDSFGL-SFDYMLRGLRYSEQE--------------ERKL 69
+SC H+ S C C + ++D L S + ++ SE E +KL
Sbjct: 1 MSCLHSLRIGSLCCDCGEEVHDDKKLFSVLHNNSDIKLSEDEALLRDKKKLERLHKNKKL 60
Query: 70 QLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQAS 129
LVL+LD T+LH K G +FI + VK RP++ LE
Sbjct: 61 VLVLDLDQTILHTTITKEYMEGYS-------NFIINDISYC----VKFRPYLNYMLECLY 109
Query: 130 SLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIVI 189
+I++ TM + YA VKL+D KY +RI+ R++ +K+ + + +VI
Sbjct: 110 KKYEIHVYTMGNKVYANKIVKLIDPTRKYIGNRILTRDENGIGFKKDLNRLFSIHSNVVI 169
Query: 190 LDDTESVWSDHTENLIVLGKYVYFRDKELNGD 221
LDD + +W D+++NLI++ Y ++ ++N +
Sbjct: 170 LDDRDDIW-DYSDNLILVKPYFFWNIGDINSE 200
>gi|358372260|dbj|GAA88864.1| RNA Polymerase II CTD phosphatase Fcp1 [Aspergillus kawachii IFO
4308]
Length = 825
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 64/226 (28%), Positives = 98/226 (43%), Gaps = 41/226 (18%)
Query: 26 SCAHTTVRDSRCIFCSQAMND-SFGLSFDYMLRG----------LRYSEQE--------- 65
CAH C C + M D S+ + R L SEQE
Sbjct: 92 PCAHEVQFGGLCAICGKDMTDFSYNTEVTDVHRAPIQMAHDNTTLTVSEQEATRVEEDAK 151
Query: 66 -----ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIG----SLFQMANDKL-- 114
RKL LV++LD T++H ++ GE K+ ++ FQ+ +D
Sbjct: 152 RRLLANRKLSLVVDLDQTIIHATVDPTV--GEWMQDKENPNYQALSDVRAFQLVDDGPGM 209
Query: 115 ------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED 168
VKLRP + +FL+ S + ++++ TM TR YA+ ++D D K F RI++R++
Sbjct: 210 RGCWYYVKLRPGLESFLQNVSEMYELHIYTMGTRSYAQHIASIIDPDRKLFGDRILSRDE 269
Query: 169 FNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
KN L + +VI+DD VW NLI + Y +F
Sbjct: 270 SGSLVAKNLHRLFPVDTKMVVIIDDRGDVWR-WNPNLIKVSPYDFF 314
>gi|238486788|ref|XP_002374632.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus
flavus NRRL3357]
gi|220699511|gb|EED55850.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus
flavus NRRL3357]
Length = 698
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 80/158 (50%), Gaps = 12/158 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VK 116
RKL LV++LD T++H ++ + H + + FQ+ +D VK
Sbjct: 36 RKLSLVVDLDQTIIHATVDPTVGEWMEDKDNPNHQALSDVRAFQLVDDGPGMRGCWYYVK 95
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP + +FL+ S L ++++ TM TR YA+ ++D D K F RI++R++ KN
Sbjct: 96 LRPGLESFLQNVSELFELHIYTMGTRAYAQHIASIIDPDRKLFGDRILSRDESGSLTAKN 155
Query: 177 -PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + +VI+DD VW + NLI + Y +F
Sbjct: 156 LHRLFPVDTKMVVIIDDRGDVWR-WSPNLIKVSPYDFF 192
>gi|452981165|gb|EME80925.1| hypothetical protein MYCFIDRAFT_115122, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 770
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 83/158 (52%), Gaps = 11/158 (6%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL-----VKL 117
+ R+L LV++LD T++H +++ + + + + FQ+ +DK +K
Sbjct: 158 QSRRLSLVVDLDQTIIHASVEPTIAEWQNDPSNPNYEALQDVQKFQLDDDKPNTWYYIKP 217
Query: 118 RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN- 176
RP ++ FL S + ++++ TM TR YAE+ K++D + K F RI++R + KN
Sbjct: 218 RPGLKQFLSTLSEIYEMHIYTMGTRAYAESVAKIIDPEKKIFGDRILSRNESGSMTAKNL 277
Query: 177 PDLVRGQERGIVILDDTESVWSDH-TENLIVLGKYVYF 213
L R +VI+DD VW H T NLI + + +F
Sbjct: 278 KRLFPVDTRMVVIIDDRADVW--HWTSNLIKVNVFEFF 313
>gi|366991271|ref|XP_003675401.1| hypothetical protein NCAS_0C00420 [Naumovozyma castellii CBS 4309]
gi|342301266|emb|CCC69032.1| hypothetical protein NCAS_0C00420 [Naumovozyma castellii CBS 4309]
Length = 725
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 107/232 (46%), Gaps = 46/232 (19%)
Query: 26 SCAHTTVRDSRCIFCSQAM----NDSFG----LSFDYMLRGLRYSEQE------------ 65
C H V C C + + ND+ G L+ + L+ S +E
Sbjct: 104 PCNHDVVYGGLCTLCGEEVDEDDNDASGSGANLTISHTDTNLKISTREALDIGLNVRTRL 163
Query: 66 --ERKLQLVLNLDHTLLHC---------------RNIKSLSSGEKYLKKQIHSFIGSLFQ 108
E+KL LV++LD T++HC N ++L +++ ++ + +L+
Sbjct: 164 RKEKKLVLVVDLDQTVIHCGVDPTIGEWKNDPKNPNFETLKDVKQFSLEE-EPILPTLYM 222
Query: 109 MANDKL------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
L VK+RP ++ FLE+ + L ++++ TM+TR YA K++D + F R
Sbjct: 223 GPKPPLRKCWYYVKVRPGLKEFLEKIAPLFEMHIYTMATRAYASEIAKIIDPNGDLFGDR 282
Query: 163 IIAREDFNGKDRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
I++R++ K+ + L + ++++DD VW + + NLI + Y +F
Sbjct: 283 ILSRDENGSMTTKSLERLFPTDQSMVIVIDDRGDVW-NWSPNLIKVVPYNFF 333
>gi|242015474|ref|XP_002428378.1| RNA polymerase II ctd phosphatase, putative [Pediculus humanus
corporis]
gi|212512990|gb|EEB15640.1| RNA polymerase II ctd phosphatase, putative [Pediculus humanus
corporis]
Length = 781
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 95/211 (45%), Gaps = 30/211 (14%)
Query: 27 CAHTTVRDSRCIFCSQAM--NDSFG----LSFDYMLRGLRYSEQE--------------E 66
C H TV C C + N+ F + + + L+ SE++ +
Sbjct: 81 CNHPTVMKDMCAECGADLRKNEQFSTNASVPMVHSIPELKVSEEQAQIIGKADENRLLND 140
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
RKL L+++LD TL+H N + LK H + QM+ ++RP FLE
Sbjct: 141 RKLVLLVDLDQTLIHTTN----DNIPPNLKDVYHFRL--YGQMSPWYHTRIRPRTHKFLE 194
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQ 183
+ S ++++CT R YA LD D KYFS RI++R+ + N K L
Sbjct: 195 EISKYYELHICTFGARNYAHMIAMFLDPDGKYFSHRILSRDECFNANSKTANLKALFPCG 254
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW + NLI + Y +F+
Sbjct: 255 DNMVCIIDDREDVW-NFAANLIHVKPYHFFK 284
>gi|194757423|ref|XP_001960964.1| GF11242 [Drosophila ananassae]
gi|190622262|gb|EDV37786.1| GF11242 [Drosophila ananassae]
Length = 854
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 93/212 (43%), Gaps = 31/212 (14%)
Query: 27 CAHTTVRDSRCIFCS-------QAMNDSFGLSFDYMLRGLRYSEQ--------------E 65
C HTTV C C + + + L+ +++
Sbjct: 139 CIHTTVIKDMCADCGADLRQNENGQTSEASVPMVHTMPDLKVTQKLAQKLGHDDTRRLLA 198
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N + + Q++ + +LRP FL
Sbjct: 199 DRKLVLLVDLDQTVIHTTNDTVPENIKGIYHFQLYGPQSPWYH------TRLRPGTAEFL 252
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKN--PDLVRG 182
E S L ++++CT R YA +LLD D K+FS RI++R++ FN + + L
Sbjct: 253 ESMSQLYELHICTFGARNYAHMIAQLLDPDGKFFSHRILSRDECFNATSKTDNLKALFPN 312
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW + NLI + Y +F+
Sbjct: 313 GDSMVCIIDDREDVW-NMASNLIQVKPYHFFQ 343
>gi|198460927|ref|XP_001361849.2| GA11510 [Drosophila pseudoobscura pseudoobscura]
gi|198137180|gb|EAL26428.2| GA11510 [Drosophila pseudoobscura pseudoobscura]
Length = 873
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 57/212 (26%), Positives = 98/212 (46%), Gaps = 31/212 (14%)
Query: 27 CAHTTVRDSRCIFCSQAM-NDSFGLSFD------YMLRGLRYSEQ--------------E 65
C HTTV C C + D G + + + + L+ +++
Sbjct: 128 CIHTTVIKDMCADCGADLRKDDNGQTSEASVPMVHTMPDLKVTQKLAQKLGHDDTRRLLA 187
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N + + Q++ + +LRP FL
Sbjct: 188 DRKLVLLVDLDQTVIHTTNDTVPENIKGIYHFQLYGPQSPWYH------TRLRPGTAEFL 241
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKN--PDLVRG 182
E+ S L ++++CT R YA +LLD D K+FS RI++R++ FN + + L
Sbjct: 242 ERMSQLYELHICTFGARNYAHMIAQLLDPDGKFFSHRILSRDECFNATSKTDNLKALFPN 301
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW + NLI + Y +F+
Sbjct: 302 GDSMVCIIDDREDVW-NMASNLIQVKPYHFFQ 332
>gi|258563858|ref|XP_002582674.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237908181|gb|EEP82582.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 897
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 84/158 (53%), Gaps = 12/158 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VK 116
RKL LV++LD T++H +++ + H + ++ FQ+ +D +K
Sbjct: 183 RKLSLVVDLDQTIIHATVDPTVAEWREDKTNPNHEAVKNVRSFQLIDDGPGMRGCWYYIK 242
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP + FL+ SSL ++++ TM+TR YA+ ++D D K F RI++R++ KN
Sbjct: 243 LRPGLEEFLKNISSLYELHIYTMATRAYAQNIANIVDPDRKIFGDRILSRDESGSLTAKN 302
Query: 177 -PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + +VI+DD VW ++NLI + Y +F
Sbjct: 303 LHRLFPVDTKMVVIIDDRGDVWK-WSDNLIRVFPYDFF 339
>gi|336466789|gb|EGO54953.1| hypothetical protein NEUTE1DRAFT_84976 [Neurospora tetrasperma FGSC
2508]
gi|350288620|gb|EGZ69856.1| hypothetical protein NEUTE2DRAFT_160171 [Neurospora tetrasperma
FGSC 2509]
Length = 867
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/162 (27%), Positives = 86/162 (53%), Gaps = 12/162 (7%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDK--------- 113
+ RKL LV++LD T++H ++ +K + + ++ FQ+ +
Sbjct: 159 QHRKLSLVVDLDQTIIHACIDPTVGEWQKDPSNPNYPSVRNVKSFQLDDGPRGVANNCWY 218
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR-EDFNGK 172
+K+RP + FL++ S++ ++++ TM TR YA+ +++D D K F +R+I+R E+ N
Sbjct: 219 YIKMRPGLEDFLKKISTMYELHVYTMGTRAYAQNVARIVDPDKKLFGNRVISRDENGNMY 278
Query: 173 DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ L + +VI+DD VW + NLI + Y +F+
Sbjct: 279 AKSLQRLFPVSTKMVVIIDDRADVWPRNRPNLIKVSPYDFFK 320
>gi|164429292|ref|XP_958446.2| hypothetical protein NCU11408 [Neurospora crassa OR74A]
gi|157073422|gb|EAA29210.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 868
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/162 (27%), Positives = 86/162 (53%), Gaps = 12/162 (7%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDK--------- 113
+ RKL LV++LD T++H ++ +K + + ++ FQ+ +
Sbjct: 159 QHRKLSLVVDLDQTIIHACIDPTVGEWQKDPSNPNYPSVRNVKSFQLDDGPRGVANNCWY 218
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR-EDFNGK 172
+K+RP + FL++ S++ ++++ TM TR YA+ +++D D K F +R+I+R E+ N
Sbjct: 219 YIKMRPGLEDFLKKISTMYELHVYTMGTRAYAQNVARIVDPDKKLFGNRVISRDENGNMY 278
Query: 173 DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ L + +VI+DD VW + NLI + Y +F+
Sbjct: 279 AKSLQRLFPVSTKMVVIIDDRADVWPRNRPNLIKVSPYDFFK 320
>gi|67524889|ref|XP_660506.1| hypothetical protein AN2902.2 [Aspergillus nidulans FGSC A4]
gi|40744297|gb|EAA63473.1| hypothetical protein AN2902.2 [Aspergillus nidulans FGSC A4]
gi|259486161|tpe|CBF83781.1| TPA: CTD phosphatase-related (Eurofung) [Aspergillus nidulans FGSC
A4]
Length = 829
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 53/183 (28%), Positives = 89/183 (48%), Gaps = 13/183 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VK 116
RKL LV++LD T++H ++ H+ + + FQ+ +D VK
Sbjct: 158 RKLSLVVDLDQTIIHAAVDPTIGEWMADKDNPNHAAVSDVRAFQLVDDGPGMRGCWYYVK 217
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP + FLE + + ++++ TM TR YA+A ++D D K F RI++R++ KN
Sbjct: 218 LRPGLEEFLENVAEMYELHIYTMGTRSYAQAIANIIDPDRKLFGDRILSRDESGSLSVKN 277
Query: 177 PDLV-RGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD-KELNGDHKSYSETLTDESE 234
+ + +VI+DD VW + NLI + Y +F ++N + L E
Sbjct: 278 LHRIFPVDTKMVVIIDDRGDVWR-WSPNLIKVIPYDFFVGIGDINSSFLPKKQELETPGE 336
Query: 235 NEE 237
N+E
Sbjct: 337 NQE 339
>gi|344301528|gb|EGW31840.1| hypothetical protein SPAPADRAFT_140004 [Spathaspora passalidarum
NRRL Y-27907]
Length = 770
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 72/241 (29%), Positives = 107/241 (44%), Gaps = 51/241 (21%)
Query: 21 CEQSLSCAHTTVRDSRCIFCSQAMNDSFGLS-FDYMLR----------GLRYS------- 62
C C HT C C +++ D S ++Y R GLR S
Sbjct: 100 CTIKEPCTHTVQYGGLCALCGKSLEDERDYSGYNYEDRATISMAHDNTGLRISLDEATKI 159
Query: 63 EQ-------EERKLQLVLNLDHTLLHCR------NIKSLSSGEKYLK-KQIHSF------ 102
EQ EE+KL LV++LD T++H +S S Y K + SF
Sbjct: 160 EQSTTDRLTEEKKLILVVDLDQTVIHATVDPTVGEWQSDPSNPNYPAVKDVKSFCLEEDP 219
Query: 103 ------IGSLFQMANDK---LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD 153
G ++A K VK+RP + FLEQ S+ ++++ TM+TR YA A ++D
Sbjct: 220 ITPPNWTGP--KLAPTKCWYYVKVRPGLAEFLEQVSNKYEMHIYTMATRNYALAIANIID 277
Query: 154 LDSKYFSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVY 212
+ KYF RI++R++ KN L + +VI+DD VW + NLI + Y +
Sbjct: 278 PEGKYFGDRILSRDESGSLTHKNLKRLFPVDQSMVVIIDDRGDVWQWES-NLIKVVPYDF 336
Query: 213 F 213
F
Sbjct: 337 F 337
>gi|294658166|ref|XP_460501.2| DEHA2F03102p [Debaryomyces hansenii CBS767]
gi|202952923|emb|CAG88814.2| DEHA2F03102p [Debaryomyces hansenii CBS767]
Length = 795
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 70/234 (29%), Positives = 102/234 (43%), Gaps = 47/234 (20%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSFGLS-FDYMLR----------GLRYSEQE--------- 65
CAH C C +A+ D S ++Y R GL+ S E
Sbjct: 98 PCAHAVQYGGLCALCGKAVEDEKDYSGYNYEDRATISMSHDNTGLKISLDEATKIEHNTT 157
Query: 66 -----ERKLQLVLNLDHTLLHCR------NIKSLSSGEKYLK-KQIHSFIGSLFQMA--- 110
E+KL LV++LD T++H +S S Y K + SF +A
Sbjct: 158 DRLSREKKLILVVDLDQTVIHATVDPTVGEWQSDPSNPNYPAVKNVRSFCLEEDPIAPPG 217
Query: 111 --NDKL--------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFS 160
KL VKLRP + FL AS L ++++ TM+TR YA A K++D + +YF
Sbjct: 218 WTGPKLPPSKCWYYVKLRPGLEEFLRSASDLYEMHIYTMATRNYALAIAKIIDPEGEYFG 277
Query: 161 SRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
RI++R++ KN L + +VI+DD VW NLI + Y +F
Sbjct: 278 DRILSRDESGSLTHKNLKRLFPVDQSMVVIIDDRGDVWQ-WENNLIKVVPYDFF 330
>gi|396499223|ref|XP_003845421.1| similar to RNA polymerase II subunit A C-terminal domain
phosphatase [Leptosphaeria maculans JN3]
gi|312222002|emb|CBY01942.1| similar to RNA polymerase II subunit A C-terminal domain
phosphatase [Leptosphaeria maculans JN3]
Length = 887
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 89/159 (55%), Gaps = 14/159 (8%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL---------V 115
+KL L+++LD T++H ++++ + + H + + FQ+A+D + V
Sbjct: 242 KKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHGAVKDVEGFQLADDNVSNVAANWYYV 301
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR-EDFNGKDR 174
K RP + F ++ S L ++++ TM+TR YA+A K++D D +YF RI++R E++ K +
Sbjct: 302 KKRPGLEDFFKRMSKLYEMHVYTMATRAYAQAVCKIIDPDRRYFGDRILSRDENYTDKTK 361
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + +VI+DD VW ++ +L+ + + +F
Sbjct: 362 SLSRLFQNTTM-VVIIDDRADVWQ-YSPHLVRVPVFNFF 398
>gi|328713585|ref|XP_001947680.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Acyrthosiphon pisum]
Length = 736
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 62/219 (28%), Positives = 98/219 (44%), Gaps = 44/219 (20%)
Query: 26 SCAHTTVRDSRCIFCS------QAMNDSFGLSFDYMLRGLRYSEQE-------------- 65
C+H+TV C C A + +S + + L+ SEQ
Sbjct: 81 GCSHSTVVSDLCADCGADLRIDNASKPTASVSMVHSVPDLKVSEQSALLLGKADEKRLLG 140
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVRTF 124
++KL L+++LD TL+H N ++ K IH F L+ + +LRP F
Sbjct: 141 DKKLVLLVDLDQTLIHTTNDNIPNN-----IKDIHHF--QLYGPNSPWYHTRLRPGTYNF 193
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKNPDLVRGQ 183
L S L ++++CT R YA +LD K FS R+++R++ F NP+ G
Sbjct: 194 LSSISELYELHICTFGARNYAHTITHILDPKGKLFSHRVLSRDECF------NPNSKTGN 247
Query: 184 ERG--------IVILDDTESVWSDHTENLIVLGKYVYFR 214
+G + I+DD E VW D+ NLI + Y +F+
Sbjct: 248 LKGLFPCGDNMVCIIDDREDVW-DYALNLIHVKPYHFFQ 285
>gi|320164786|gb|EFW41685.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 877
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/158 (31%), Positives = 83/158 (52%), Gaps = 12/158 (7%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLK------KQIHSFIGSLFQMANDKLVKLR 118
+ +KL L+++LD TL+H + ++L+ K+I +F SL + +KLR
Sbjct: 228 QSKKLVLIVDLDQTLIHAVVSSQVPWIGQFLRDNVELQKEIFNF--SLPNHPHLYYIKLR 285
Query: 119 PFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPD 178
P R FL QA+ L ++++ TM +R YA +LD D F SRI++R++ + K+
Sbjct: 286 PGAREFLAQATKLFELHIFTMGSRMYASRVAAVLDPDGALFGSRIMSRDESKSANFKHTQ 345
Query: 179 LVRGQERG---IVILDDTESVWSDHTENLIVLGKYVYF 213
L + G + +LDD VW+ N+I + Y YF
Sbjct: 346 LSQLFPSGHNMVAVLDDRIDVWA-RLGNVIQISPYEYF 382
>gi|21914376|gb|AAM81360.1|AF522873_3 RNA polymerase II C-terminal domain phosphatase component
[Leptosphaeria maculans]
Length = 804
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 89/159 (55%), Gaps = 14/159 (8%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL---------V 115
+KL L+++LD T++H ++++ + + H + + FQ+A+D + V
Sbjct: 159 KKLTLIVDLDQTVIHTTCERTIAEWQADPENPNHGAVKDVEGFQLADDNVSNVAANWYYV 218
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR-EDFNGKDR 174
K RP + F ++ S L ++++ TM+TR YA+A K++D D +YF RI++R E++ K +
Sbjct: 219 KKRPGLEDFFKRMSKLYEMHVYTMATRAYAQAVCKIIDPDRRYFGDRILSRDENYTDKTK 278
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + +VI+DD VW ++ +L+ + + +F
Sbjct: 279 SLSRLFQNTTM-VVIIDDRADVWQ-YSPHLVRVPVFNFF 315
>gi|156087501|ref|XP_001611157.1| protein phosphatase family protein [Babesia bovis]
gi|154798411|gb|EDO07589.1| protein phosphatase family protein [Babesia bovis]
Length = 806
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/121 (35%), Positives = 68/121 (56%), Gaps = 5/121 (4%)
Query: 95 LKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDL 154
+ + ++ GSLF KLRP V FL +++ L ++YL TM TR +A AA+K+LD
Sbjct: 324 MTRTLNEMDGSLFV----NYYKLRPGVYDFLRRSAELYELYLFTMGTRAHANAALKILDP 379
Query: 155 DSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
D KYF +R+ +R + N + + ++ILDD+E++W D LI + Y +F
Sbjct: 380 DGKYFGARVFSRSETNNCFKSLCRIFPKYRNHLLILDDSENIWLD-APGLIKVYPYYFFT 438
Query: 215 D 215
D
Sbjct: 439 D 439
>gi|167520468|ref|XP_001744573.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776904|gb|EDQ90522.1| predicted protein [Monosiga brevicollis MX1]
Length = 858
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 61/203 (30%), Positives = 112/203 (55%), Gaps = 20/203 (9%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVRT 123
E RKL L+L+LD TL+H I S++S +L++ ++ F + K+RP +
Sbjct: 63 EARKLILILDLDKTLIHS-TIDSIAS--HWLREGVYDIFH--FDLGKHTYYTKVRPGLHA 117
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR-EDFNGKDR-KNPD-LV 180
FLE ++++ TM R YAE ++++D +++FS+RI+ + E F+ +++ KN D L+
Sbjct: 118 FLEDLYPYYEMHIYTMGRRNYAERILRIIDPSNRFFSTRILTQDESFSIENKAKNLDALL 177
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD-KELNGDHKSYSETLTDESENEEAL 239
G + VILDD +VW D N++ Y +F+ +E+N + S++ + EAL
Sbjct: 178 PGGDSMAVILDDLPAVW-DFQTNVVPALPYEFFKHVEEVNAIPQQRSQSDRRMARKHEAL 236
Query: 240 -----ANVLRV----LKTIHRLF 253
+N +R+ ++ ++R F
Sbjct: 237 QRMHASNAIRITDRLIEPLYRAF 259
>gi|358390781|gb|EHK40186.1| hypothetical protein TRIATDRAFT_89336 [Trichoderma atroviride IMI
206040]
Length = 768
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 83/162 (51%), Gaps = 13/162 (8%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------- 114
+RKL LV++LD T++H ++ ++ H + + FQ+ +D
Sbjct: 156 QRKLSLVVDLDQTIIHACIEPTVGEWQRDKANPNHEAVKDVKSFQLNDDGPRGLASGCTY 215
Query: 115 -VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
+KLRP + FLE S++ ++++ TM TR YA +++D D K F +R+I+R++
Sbjct: 216 YIKLRPGLHEFLETVSTMYELHVYTMGTRAYALNIARIVDPDKKLFGNRVISRDENGSIT 275
Query: 174 RKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
K+ L +VI+DD VW + NLI + Y +F+
Sbjct: 276 AKSLQRLFPVSTDMVVIIDDRSDVWPMNRPNLIKVVPYDFFK 317
>gi|196002231|ref|XP_002110983.1| hypothetical protein TRIADDRAFT_54465 [Trichoplax adhaerens]
gi|190586934|gb|EDV26987.1| hypothetical protein TRIADDRAFT_54465 [Trichoplax adhaerens]
Length = 766
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 88/157 (56%), Gaps = 11/157 (7%)
Query: 67 RKLQLVLNLDHTLLHCR----NIK-SLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+KL L+++LD TL+H R +IK S + EK + H F G + + + L KLRP V
Sbjct: 226 KKLVLIVDLDLTLIHTRMASPDIKLSNLTEEKQIYYTCHMFPG--YNVYHQYLTKLRPHV 283
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
FL+ AS+L ++++ TM +R YA+ V +LD F +RI++R++ + K+ +L +
Sbjct: 284 EEFLKVASTLFELHVVTMGSRSYAQDIVGILDPTGSLFYNRILSRDELKSQLLKSTNLNQ 343
Query: 182 GQERG---IVILDDTESVWSDHTENLIVLGKYVYFRD 215
G + I+DD +W+ H + I + Y YF +
Sbjct: 344 LFPLGDNLVCIIDDRPEMWAFHP-SCIPVPPYSYFAN 379
>gi|452840538|gb|EME42476.1| hypothetical protein DOTSEDRAFT_73343 [Dothistroma septosporum
NZE10]
Length = 855
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 80/160 (50%), Gaps = 12/160 (7%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL-------- 114
E R+L LV++LD T++H ++ + H + + FQ+A+D
Sbjct: 159 EARRLSLVVDLDQTVIHACVEPTIGEWQSDPTNPNHEAVKDVCKFQLADDAPGRPGTWYY 218
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+KLRP ++ FL S ++++ TM TR YAE K++D D F RI++R++
Sbjct: 219 IKLRPGLKEFLTTMSQYYEMHIYTMGTRAYAENIAKIIDPDRSVFGDRILSRDESGSMQA 278
Query: 175 KN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
KN L + +VI+DD VWS NLI + + +F
Sbjct: 279 KNLKRLFPVDTKMVVIIDDRADVWS-WISNLIKVKVFEFF 317
>gi|119491655|ref|XP_001263322.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Neosartorya
fischeri NRRL 181]
gi|119411482|gb|EAW21425.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Neosartorya
fischeri NRRL 181]
Length = 824
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 80/158 (50%), Gaps = 12/158 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VK 116
RKL LV++LD T++H ++ + H + + FQ+ ++ VK
Sbjct: 158 RKLSLVVDLDQTIIHATVDPTVGEWMEDKDNPNHEALSDVRAFQLVDEGPGMRGCWYYVK 217
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP + +FL+ S L ++++ TM TR YA+ ++D D K F RI++R++ KN
Sbjct: 218 LRPGLESFLQNVSELFELHIYTMGTRAYAQHIAGIIDPDRKLFGDRILSRDESGSLTAKN 277
Query: 177 -PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + +VI+DD VW + NLI + Y +F
Sbjct: 278 LQRLFPVDTKMVVIIDDRGDVWR-WSPNLIKVSPYDFF 314
>gi|242781762|ref|XP_002479866.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Talaromyces
stipitatus ATCC 10500]
gi|218720013|gb|EED19432.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Talaromyces
stipitatus ATCC 10500]
Length = 822
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 46/158 (29%), Positives = 80/158 (50%), Gaps = 12/158 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VK 116
++L LV++LD T++H ++ ++ H + + FQ+ +D +K
Sbjct: 158 KRLSLVVDLDQTIIHATVDPTVGEWKEDKNNPNHEAVKDVRAFQLTDDGPGMRGCWYYIK 217
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP + +FL+ S L ++++ TM TR YA+ ++D D K F RI++R++ KN
Sbjct: 218 LRPGLESFLQNISKLYELHIYTMGTRAYAQNIANIIDPDRKLFGDRILSRDESGSLTAKN 277
Query: 177 -PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + +VI+DD VW NLI + Y +F
Sbjct: 278 LQRLFPVDTKMVVIIDDRGDVWK-WNPNLIKVSPYDFF 314
>gi|212526776|ref|XP_002143545.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Talaromyces
marneffei ATCC 18224]
gi|210072943|gb|EEA27030.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Talaromyces
marneffei ATCC 18224]
Length = 829
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 46/158 (29%), Positives = 80/158 (50%), Gaps = 12/158 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VK 116
++L LV++LD T++H ++ ++ H + + FQ+ +D +K
Sbjct: 158 KRLSLVVDLDQTIIHATVDPTVGEWKEDKNNPNHDAVKDVRAFQLTDDGPGMRGCWYYIK 217
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP + +FL+ S L ++++ TM TR YA+ ++D D K F RI++R++ KN
Sbjct: 218 LRPGLESFLQNISELYELHIYTMGTRAYAQHIANIIDPDRKLFGDRILSRDESGSLTAKN 277
Query: 177 -PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + +VI+DD VW NLI + Y +F
Sbjct: 278 LQRLFPVDTKMVVIIDDRGDVWK-WNPNLIKVSPYDFF 314
>gi|50294127|ref|XP_449475.1| hypothetical protein [Candida glabrata CBS 138]
gi|49528789|emb|CAG62451.1| unnamed protein product [Candida glabrata]
Length = 758
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 62/232 (26%), Positives = 101/232 (43%), Gaps = 40/232 (17%)
Query: 21 CEQSLSCAHTTVRDSRCIFCSQAMND----SFGLSFDYMLRGLRYSEQE----------- 65
CE C H V C C + +++ L+ + LR S +E
Sbjct: 102 CEIKRPCNHDIVYGGLCTMCGKEVDEYDQVDANLTISHTDTNLRVSRKEAIDLDKQITTR 161
Query: 66 ---ERKLQLVLNLDHTLLHC------RNIKSLSSGEKYLK-KQIHSF------IGSLFQM 109
E+KL LV++LD T++HC K+ S Y K + F I L M
Sbjct: 162 LKNEKKLVLVVDLDQTVIHCGVDPTIGEWKADPSNPNYETLKDVKCFSLEEEPILPLIYM 221
Query: 110 ANDKL-------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
VK+RP ++ F E+ + L ++++ TM+TR YA K++D D F R
Sbjct: 222 GPKPPVRTCWYYVKIRPGLKEFFEKIAPLYEMHIYTMATRAYALEIAKIIDPDKSLFGDR 281
Query: 163 IIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
I++R++ +K+ L + +V++DD VW+ NLI + Y +F
Sbjct: 282 ILSRDENGSLTQKSLTRLFPTDQSMVVVIDDRGDVWN-WCPNLIKVVPYNFF 332
>gi|115533721|ref|NP_492423.2| Protein FCP-1 [Caenorhabditis elegans]
gi|82658167|emb|CAC70088.2| Protein FCP-1 [Caenorhabditis elegans]
Length = 659
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 53/199 (26%), Positives = 102/199 (51%), Gaps = 20/199 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKY---LKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
RKL L+++LD T++H + E + K +HS + + KLRP
Sbjct: 142 RKLVLLVDLDQTIIHTSDKPMTVDTENHKDITKYNLHSRVYT---------TKLRPHTTE 192
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKN--PDLV 180
FL + S++ ++++ T R YA ++LD D++ F RI++R++ F+ + + N L
Sbjct: 193 FLNKMSNMYEMHIVTYGQRQYAHRIAQILDPDARLFEQRILSRDELFSAQHKTNNLKALF 252
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD-KELNGDHKSYSE---TLTDESENE 236
+ +VI+DD VW ++E LI + Y +F++ ++N S + + D++ +
Sbjct: 253 PCGDNLVVIIDDRSDVWM-YSEALIQIKPYRFFKEVGDINAPKNSKEQMPVQIEDDAHED 311
Query: 237 EALANVLRVLKTIHRLFFD 255
+ L + RVL IH +++
Sbjct: 312 KVLEEIERVLTNIHDKYYE 330
>gi|336259270|ref|XP_003344437.1| hypothetical protein SMAC_08633 [Sordaria macrospora k-hell]
gi|380087533|emb|CCC05319.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 878
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 44/162 (27%), Positives = 86/162 (53%), Gaps = 12/162 (7%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDK--------- 113
+ RKL LV++LD T++H ++ +K + + ++ FQ+ +
Sbjct: 159 QHRKLSLVVDLDQTIIHACIDPTVGEWQKDPSNPNYPSVRNVKSFQLDDGPRGVANNCWY 218
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR-EDFNGK 172
+K+RP + FL++ S++ ++++ TM TR YA+ +++D + K F +R+I+R E+ N
Sbjct: 219 YIKMRPGLEDFLKKISTMYELHVYTMGTRAYAQNVARIVDPEKKLFGNRVISRDENGNMY 278
Query: 173 DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ L + +VI+DD VW + NLI + Y +F+
Sbjct: 279 AKSLQRLFPVSTKMVVIIDDRADVWPRNRPNLIKVSPYDFFK 320
>gi|21483550|gb|AAM52750.1| SD01014p [Drosophila melanogaster]
Length = 896
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 94/212 (44%), Gaps = 31/212 (14%)
Query: 27 CAHTTVRDSRCIFCS-------QAMNDSFGLSFDYMLRGLRYSEQ--------------E 65
C HTTV C C + + + L+ +++
Sbjct: 161 CIHTTVIKDMCADCGADLRQNENGQTSEASVPMVHTMPDLKVTQKLAQKLGHDDTRRLLA 220
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N + + Q++ + +LRP FL
Sbjct: 221 DRKLVLLVDLDQTVIHTTNDTVPDNIKGIYHFQLYGPHSPWYH------TRLRPGTAEFL 274
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKN--PDLVRG 182
E+ S L ++++CT R YA +LLD + K+FS RI++R++ FN + + L
Sbjct: 275 ERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSRDECFNATSKTDNLKALFPN 334
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW + NLI + Y +F+
Sbjct: 335 GDSMVCIIDDREDVW-NMASNLIQVKPYHFFQ 365
>gi|194886507|ref|XP_001976627.1| GG19916 [Drosophila erecta]
gi|190659814|gb|EDV57027.1| GG19916 [Drosophila erecta]
Length = 876
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 94/212 (44%), Gaps = 31/212 (14%)
Query: 27 CAHTTVRDSRCIFCS-------QAMNDSFGLSFDYMLRGLRYSEQ--------------E 65
C HTTV C C + + + L+ +++
Sbjct: 141 CIHTTVIKDMCADCGADLRQNENGQTSEASVPMVHTMPDLKVTQKLAQKLGHDDTRRLLA 200
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N + + Q++ + +LRP FL
Sbjct: 201 DRKLVLLVDLDQTVIHTTNDTVPDNIKGIYHFQLYGPHSPWYH------TRLRPGTAEFL 254
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKN--PDLVRG 182
E+ S L ++++CT R YA +LLD + K+FS RI++R++ FN + + L
Sbjct: 255 ERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSRDECFNATSKTDNLKALFPN 314
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW + NLI + Y +F+
Sbjct: 315 GDSMVCIIDDREDVW-NMASNLIQVKPYHFFQ 345
>gi|412985958|emb|CCO17158.1| RNA Polymerase II CTD phosphatase Fcp1 [Bathycoccus prasinos]
Length = 490
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 59/219 (26%), Positives = 104/219 (47%), Gaps = 34/219 (15%)
Query: 68 KLQLVLNLDHTLLHC------------------RNIKSLSSGEKYLKKQIHSFIGSLFQM 109
KL LVL+LD TLLH +K + +K ++ ++ S F +
Sbjct: 101 KLPLVLDLDSTLLHSVEKTKFLFPNPGESNTSEEEMKIIKQAQKKIESRLESSPDKFFYV 160
Query: 110 ANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEA-AVKLLDLDSKYFS---SRIIA 165
+ K+RP R FL + S + ++Y+ T ++ YAEA A ++LD KYF+ +RI
Sbjct: 161 NDQYFTKIRPQARRFLSELSEMYELYIVTAGSQAYAEAIANQVLDPLGKYFNRDVNRIKG 220
Query: 166 REDFNG--------KDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKE 217
+ +N + + D + G E ++++D +W D ++ + Y YF +
Sbjct: 221 MKQWNSEVNQWVDVRTKIVNDALEGAESVTIVVEDKPEMW-DGECAVMQVKPYYYFPES- 278
Query: 218 LNGDHKSYSETLTDESENEEA--LANVLRVLKTIHRLFF 254
L S+ +TDESE ++ + N+L L+ +HR+ F
Sbjct: 279 LEELKLSHFYNMTDESEKNDSYLVDNILPRLRNVHRMMF 317
>gi|334325963|ref|XP_001374906.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Monodelphis domestica]
Length = 1208
Score = 71.6 bits (174), Expect = 5e-10, Method: Composition-based stats.
Identities = 49/157 (31%), Positives = 82/157 (52%), Gaps = 16/157 (10%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPF 120
RKL L+++LD TL+H + E++ ++ + I FQ+ + + +LRP
Sbjct: 401 HRNRKLVLMVDLDQTLIH--------TTEQHCQQMSNKGIFH-FQLGRGEPMLHTRLRPH 451
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNP 177
+ FLE+ + L ++++ T +R YA LD + K FS RI++R+ D K
Sbjct: 452 CKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLR 511
Query: 178 DLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+L + + I+DD E VW + NLI + KYVYF+
Sbjct: 512 NLFPCGDSMVCIIDDREDVWK-YAPNLITVKKYVYFQ 547
>gi|358383388|gb|EHK21054.1| hypothetical protein TRIVIDRAFT_90991 [Trichoderma virens Gv29-8]
Length = 758
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 83/162 (51%), Gaps = 13/162 (8%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------- 114
+RKL LV++LD T++H ++ ++ H + + FQ+ +D
Sbjct: 156 QRKLSLVVDLDQTIIHACIEPTIGEWQRDPTNPNHEAVKDVKSFQLNDDGPRGLASGCTY 215
Query: 115 -VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
+KLRP ++ FLE S+ ++++ TM TR YA +++D D K F +R+I+R++
Sbjct: 216 YIKLRPGLQEFLEAVSTKYELHVYTMGTRAYALNIARIVDPDRKLFGNRVISRDENGSIT 275
Query: 174 RKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
K+ L +VI+DD VW + NLI + Y +F+
Sbjct: 276 AKSLQRLFPVSTDMVVIIDDRADVWPMNRPNLIKVVPYDFFK 317
>gi|195170374|ref|XP_002025988.1| GL10108 [Drosophila persimilis]
gi|194110852|gb|EDW32895.1| GL10108 [Drosophila persimilis]
Length = 757
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 77/150 (51%), Gaps = 10/150 (6%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
KL L+++LD T++H N + + Q++ + +LRP FLE+
Sbjct: 88 KLVLLVDLDQTVIHTTNDTVPENIKGIYHFQLYGPQSPWYH------TRLRPGTAEFLER 141
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKN--PDLVRGQE 184
S L ++++CT R YA +LLD D K+FS RI++R++ FN + + L +
Sbjct: 142 MSQLYELHICTFGARNYAHMIAQLLDPDGKFFSHRILSRDECFNATSKTDNLKALFPNGD 201
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ I+DD E VW + NLI + Y +F+
Sbjct: 202 SMVCIIDDREDVW-NMASNLIQVKPYHFFQ 230
>gi|195586452|ref|XP_002082988.1| GD24941 [Drosophila simulans]
gi|194194997|gb|EDX08573.1| GD24941 [Drosophila simulans]
Length = 877
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 94/212 (44%), Gaps = 31/212 (14%)
Query: 27 CAHTTVRDSRCIFCS-------QAMNDSFGLSFDYMLRGLRYSEQ--------------E 65
C HTTV C C + + + L+ +++
Sbjct: 142 CIHTTVIKDMCADCGADLRQNENGQTSEASVPMVHTMPDLKVTQKLAQKLGHDDTRRLLA 201
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N + + Q++ + +LRP FL
Sbjct: 202 DRKLVLLVDLDQTVIHTTNDTVPDNIKGIYHFQLYGPHSPWYH------TRLRPGTAEFL 255
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKN--PDLVRG 182
E+ S L ++++CT R YA +LLD + K+FS RI++R++ FN + + L
Sbjct: 256 ERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSRDECFNATSKTDNLKALFPN 315
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW + NLI + Y +F+
Sbjct: 316 GDSMVCIIDDREDVW-NMASNLIQVKPYHFFQ 346
>gi|91087589|ref|XP_971974.1| PREDICTED: similar to RNA polymerase II subunit A C-terminal domain
phosphatase [Tribolium castaneum]
gi|270010700|gb|EFA07148.1| hypothetical protein TcasGA2_TC010139 [Tribolium castaneum]
Length = 760
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 93/210 (44%), Gaps = 29/210 (13%)
Query: 27 CAHTTVRDSRCIFCSQAM--ND---SFGLSFDYMLRGLRYSEQ--------------EER 67
C H TV + C C + ND + + + + L+ SE+ +R
Sbjct: 82 CTHPTVMNDMCAECGTDLRKNDVSVAASVPMVHAIPDLKVSEELAQKLGKADVDRLIRDR 141
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
KL L+++LD TL+H N + + + Q++ + +LRP FL
Sbjct: 142 KLVLLVDLDQTLIHTTNDHIQPNIKDIYRFQLYGPNSPWY------FTRLRPGTHQFLNN 195
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQE 184
++++CT R YA +LD D K+FS+RI++R+ D K L +
Sbjct: 196 IYPFYELHICTFGARNYAHMIAAVLDRDQKFFSNRILSRDECFDPTSKKANLKALFPCGD 255
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ I+DD E VWS + NLI + Y +F+
Sbjct: 256 NMVCIIDDREDVWS-NAANLIHVKPYHFFQ 284
>gi|24762673|ref|NP_611934.1| Fcp1 [Drosophila melanogaster]
gi|7291810|gb|AAF47230.1| Fcp1 [Drosophila melanogaster]
Length = 880
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 94/212 (44%), Gaps = 31/212 (14%)
Query: 27 CAHTTVRDSRCIFCS-------QAMNDSFGLSFDYMLRGLRYSEQ--------------E 65
C HTTV C C + + + L+ +++
Sbjct: 145 CIHTTVIKDMCADCGADLRQNENGQTSEASVPMVHTMPDLKVTQKLAQKLGHDDTRRLLA 204
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N + + Q++ + +LRP FL
Sbjct: 205 DRKLVLLVDLDQTVIHTTNDTVPDNIKGIYHFQLYGPHSPWYH------TRLRPGTAEFL 258
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKN--PDLVRG 182
E+ S L ++++CT R YA +LLD + K+FS RI++R++ FN + + L
Sbjct: 259 ERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSRDECFNATSKTDNLKALFPN 318
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW + NLI + Y +F+
Sbjct: 319 GDSMVCIIDDREDVW-NMASNLIQVKPYHFFQ 349
>gi|195353179|ref|XP_002043083.1| GM11819 [Drosophila sechellia]
gi|194127171|gb|EDW49214.1| GM11819 [Drosophila sechellia]
Length = 874
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 94/212 (44%), Gaps = 31/212 (14%)
Query: 27 CAHTTVRDSRCIFCS-------QAMNDSFGLSFDYMLRGLRYSEQ--------------E 65
C HTTV C C + + + L+ +++
Sbjct: 139 CIHTTVIKDMCADCGADLRQNENGQTSEASVPMVHTMPDLKVTQKLAQKLGHDDTRRLLA 198
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N + + Q++ + +LRP FL
Sbjct: 199 DRKLVLLVDLDQTVIHTTNDTVPDNIKGIYHFQLYGPHSPWYH------TRLRPGTAEFL 252
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKN--PDLVRG 182
E+ S L ++++CT R YA +LLD + K+FS RI++R++ FN + + L
Sbjct: 253 ERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSRDECFNATSKTDNLKALFPN 312
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW + NLI + Y +F+
Sbjct: 313 GDSMVCIIDDREDVW-NMASNLIQVKPYHFFQ 343
>gi|428672202|gb|EKX73116.1| conserved hypothetical protein [Babesia equi]
Length = 739
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 75/270 (27%), Positives = 114/270 (42%), Gaps = 68/270 (25%)
Query: 12 KTKFVIKRKCEQSLSCAHTTVRDSRCIFCSQAMN----DSFGL----------SFDYMLR 57
KTK I + E S C H V C++CS +N D + + SFD ++
Sbjct: 156 KTKSDILGRIESS-ECNHEVVIHGLCVYCSTLVNPPKEDDYDIDQSDPKRRCGSFDQVVP 214
Query: 58 G-----------------LRYSE----QEERKLQLVLNLDHTLLHCRNIKSLSS------ 90
G + Y+E ++RKL LVL+LD+TLLH + K S
Sbjct: 215 GFITNDSAMRINSSLAYDMEYNEILKVLQKRKLCLVLDLDNTLLHASSQKLPSDVYVDEI 274
Query: 91 -------------------GEKYLKKQIHSFIGSLF------QMANDKLVKLRPFVRTFL 125
G L+K+ S I M KLRP V FL
Sbjct: 275 DFLSKDADIFKDVQYNDDEGTLKLRKKFESSIIQTMVYNESETMCCKSYFKLRPGVFKFL 334
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQER 185
++ S+ ++YL TM T+ +A +++K+LD YF +RI R D + + +
Sbjct: 335 KEMSAKFELYLFTMGTKQHASSSLKILDPKRIYFGNRIFCRNDSRSSMKSLDRIFPKHKN 394
Query: 186 GIVILDDTESVWSDHTENLIVLGKYVYFRD 215
++I+DDTE VW+ + LI + Y +F D
Sbjct: 395 LVLIVDDTEHVWTCNL-GLIKIHPYFFFPD 423
>gi|195489702|ref|XP_002092848.1| GE11441 [Drosophila yakuba]
gi|194178949|gb|EDW92560.1| GE11441 [Drosophila yakuba]
Length = 879
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 94/212 (44%), Gaps = 31/212 (14%)
Query: 27 CAHTTVRDSRCIFCS-------QAMNDSFGLSFDYMLRGLRYSEQ--------------E 65
C HTTV C C + + + L+ +++
Sbjct: 144 CIHTTVIKDMCADCGADLRQNENGQTSEASVPMVHTMPDLKVTQKLAQKLGHDDTRRLLA 203
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N + + Q++ + +LRP FL
Sbjct: 204 DRKLVLLVDLDQTVIHTTNDTVPDNIKGIYHFQLYGPHSPWYH------TRLRPGTAEFL 257
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKN--PDLVRG 182
E+ S L ++++CT R YA +LLD + K+FS RI++R++ FN + + L
Sbjct: 258 ERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSRDECFNATSKTDNLKALFPN 317
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW + NLI + Y +F+
Sbjct: 318 GDSMVCIIDDREDVW-NMASNLIQVKPYHFFQ 348
>gi|50306333|ref|XP_453140.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49642274|emb|CAH00236.1| KLLA0D01595p [Kluyveromyces lactis]
Length = 719
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 60/225 (26%), Positives = 101/225 (44%), Gaps = 39/225 (17%)
Query: 27 CAHTTVRDSRCIFCSQAMN---DSFGLSFDYMLRGLRYSEQ--------------EERKL 69
C H C+ C ++ +S L+ ++ ++ SEQ EE+KL
Sbjct: 99 CNHDITYGGLCVQCGNTVDEEDNSKNLTISHVNTNIKVSEQQAETLERSSLTRLREEKKL 158
Query: 70 QLVLNLDHTLLHC---------------RNIKSLSSGEKYL---KKQIHSFIGSLFQMAN 111
LV++LD T++HC N K+L + + + I SF A
Sbjct: 159 VLVVDLDQTVIHCGVDPTIGEWMRDPKNPNYKALQDVKSFTLEDEPIIPSFYFGPKPPAR 218
Query: 112 DK--LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDF 169
VKLRP ++ F E S ++++ TM+TR YA K++D + F RI++R++
Sbjct: 219 KSWYYVKLRPGLKEFFEAVSPHFEMHIYTMATRSYAHEIAKIIDPTGELFGDRILSRDEN 278
Query: 170 NGKDRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K+ + L + +V++DD VW + ENLI + Y +F
Sbjct: 279 GSLTTKSLERLFPMDQSMVVVIDDRGDVW-NWFENLIKVVPYSFF 322
>gi|8778093|gb|AAF79202.1| CTD phosphatase-like protein [Emericella nidulans]
Length = 409
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 52/183 (28%), Positives = 89/183 (48%), Gaps = 13/183 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDK--------LVK 116
RKL LV++LD T++H ++ H+ + + FQ+ +D L K
Sbjct: 55 RKLSLVVDLDQTIIHAAVDPTIGEWMADKDNPNHAPVSDVRAFQLVDDGPGMRGLLVLCK 114
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP + FL+ + + ++++ TM TR YA+A ++D D K F RI++R++ KN
Sbjct: 115 LRPGLEEFLKNVADMYELHIYTMGTRSYAQAIANIIDPDRKLFGDRILSRDESGSLSVKN 174
Query: 177 PDLV-RGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD-KELNGDHKSYSETLTDESE 234
+ + +VI+DD VW + NLI + Y +F ++N + L E
Sbjct: 175 LHRIFPVDTKMVVIIDDRGDVWR-WSPNLIKVIPYDFFVGIGDINSSFLPKKQELETPGE 233
Query: 235 NEE 237
N+E
Sbjct: 234 NQE 236
>gi|354545519|emb|CCE42247.1| hypothetical protein CPAR2_807960 [Candida parapsilosis]
Length = 786
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 62/234 (26%), Positives = 102/234 (43%), Gaps = 47/234 (20%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSFGLS-FDYMLR----------GLRYSEQE--------- 65
+CAHT C C +++ + S +DY R GL+ S E
Sbjct: 98 ACAHTVQYGGLCALCGKSLEEERDYSGYDYEDRATIAMSHDNSGLKISFDEAAKIEHSTT 157
Query: 66 -----ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKLV--- 115
E+KL LV++LD T++H ++ + + + + F + D +V
Sbjct: 158 DRLNDEKKLILVVDLDQTVIHATVDPTVGEWQSDPSNPNYPAVKDVKTFCLEEDPIVPPG 217
Query: 116 ---------------KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFS 160
K+RP + FLE+ + ++++ TM+TR YA A K++D D KYF
Sbjct: 218 WTGPKLAPTKCWYYVKVRPGLSEFLEKMDTKYEMHIYTMATRNYALAIAKIIDPDGKYFG 277
Query: 161 SRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
RI++R++ KN L + +VI+DD VW NLI + Y +F
Sbjct: 278 DRILSRDESGSLTHKNLKRLFPVDQSMVVIIDDRGDVWQ-WENNLIKVVPYDFF 330
>gi|121705758|ref|XP_001271142.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus
clavatus NRRL 1]
gi|119399288|gb|EAW09716.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Aspergillus
clavatus NRRL 1]
Length = 826
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 80/158 (50%), Gaps = 12/158 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VK 116
+KL LV++LD T++H ++ + H + + FQ+ +D VK
Sbjct: 158 KKLSLVVDLDQTIIHATVDPTVREWMEDKDNPNHEALSDVRAFQLVDDGPGMRGCWYYVK 217
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP + +FL+ + L ++++ TM TR YA+ ++D D K F RI++R++ KN
Sbjct: 218 LRPGLESFLQNVAELFELHIYTMGTRAYAQHIAAIIDPDRKLFGDRILSRDESGSLTAKN 277
Query: 177 -PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + +VI+DD VW + NLI + Y +F
Sbjct: 278 LQRLFPVDTKMVVIIDDRGDVWR-WSPNLIKVSPYDFF 314
>gi|326437795|gb|EGD83365.1| hypothetical protein PTSG_03974 [Salpingoeca sp. ATCC 50818]
Length = 864
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 64/115 (55%), Gaps = 7/115 (6%)
Query: 106 LFQMANDK---LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
FQ+ D K+RP V+ FLE + ++++ TM TR YA+ ++D + YFS+R
Sbjct: 21 FFQIGGDPRFYYTKIRPGVKEFLEAVKDMYELHVYTMGTRAYAKEICNIIDPGAHYFSTR 80
Query: 163 IIAREDFNGKDRKNPDLVRGQERG---IVILDDTESVWSDHTENLIVLGKYVYFR 214
I+ +++ D K+ +L RG +VILDDT ++W D NLI Y YF+
Sbjct: 81 ILTQDESARIDTKSINLNHLFPRGDDMVVILDDTAAMW-DFRPNLIPAAPYDYFQ 134
>gi|359079164|ref|XP_003587804.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Bos taurus]
Length = 994
Score = 70.5 bits (171), Expect = 9e-10, Method: Composition-based stats.
Identities = 49/157 (31%), Positives = 81/157 (51%), Gaps = 16/157 (10%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPF 120
RKL L+++LD TL+H + E++ ++ + I FQ+ + + +LRP
Sbjct: 175 HRNRKLVLMVDLDQTLIH--------TTEQHCQQMSNKGIFH-FQLGRGEPMLHTRLRPH 225
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNP 177
+ FLE+ + L ++++ T +R YA LD + K FS RI++R+ D K
Sbjct: 226 CKEFLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLR 285
Query: 178 DLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+L + + I+DD E VW NLI + KYVYF+
Sbjct: 286 NLFPCGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 321
>gi|345324709|ref|XP_001509122.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase [Ornithorhynchus anatinus]
Length = 1168
Score = 70.5 bits (171), Expect = 1e-09, Method: Composition-based stats.
Identities = 49/157 (31%), Positives = 81/157 (51%), Gaps = 16/157 (10%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPF 120
RKL L+++LD TL+H + E++ ++ + I FQ+ + + +LRP
Sbjct: 184 HRNRKLVLMVDLDQTLIH--------TTEQHCQQMSNKGIFH-FQLGRGEPMLHTRLRPH 234
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNP 177
+ FLE+ + L ++++ T +R YA LD + K FS RI++R+ D K
Sbjct: 235 CKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLR 294
Query: 178 DLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+L + + I+DD E VW NLI + KYVYF+
Sbjct: 295 NLFPCGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 330
>gi|395830784|ref|XP_003788497.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase [Otolemur garnettii]
Length = 1290
Score = 70.5 bits (171), Expect = 1e-09, Method: Composition-based stats.
Identities = 50/156 (32%), Positives = 79/156 (50%), Gaps = 16/156 (10%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPF 120
RKL L+++LD TL+H + E++ + + I FQ+ + + +LRP
Sbjct: 178 HRNRKLVLMVDLDQTLIH--------TTEQHCPQMSNKGIFH-FQLGRGEPMLHTRLRPH 228
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNP 177
R FLE+ + L ++++ T +R YA LD + K FS RI++R+ D K
Sbjct: 229 CRDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLR 288
Query: 178 DLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+L + + I+DD E VW NLI + KYVYF
Sbjct: 289 NLFPCGDSMVCIIDDREDVWK-FAPNLITVKKYVYF 323
>gi|441603466|ref|XP_004087808.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
C-terminal domain phosphatase [Nomascus leucogenys]
Length = 1236
Score = 70.5 bits (171), Expect = 1e-09, Method: Composition-based stats.
Identities = 49/157 (31%), Positives = 81/157 (51%), Gaps = 16/157 (10%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPF 120
RKL L+++LD TL+H + E++ ++ + I FQ+ + + +LRP
Sbjct: 178 HRNRKLVLMVDLDQTLIH--------TTEQHCQQMSNKGIFH-FQLGRGEPMLHTRLRPH 228
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNP 177
+ FLE+ + L ++++ T +R YA LD + K FS RI++R+ D K
Sbjct: 229 CKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLR 288
Query: 178 DLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+L + + I+DD E VW NLI + KYVYF+
Sbjct: 289 NLFPCGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 324
>gi|332850750|ref|XP_001144243.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase isoform 2 [Pan troglodytes]
Length = 1026
Score = 70.5 bits (171), Expect = 1e-09, Method: Composition-based stats.
Identities = 49/157 (31%), Positives = 81/157 (51%), Gaps = 16/157 (10%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPF 120
RKL L+++LD TL+H + E++ ++ + I FQ+ + + +LRP
Sbjct: 178 HRNRKLVLMVDLDQTLIH--------TTEQHCQQMSNKGIFH-FQLGRGEPMLHTRLRPH 228
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNP 177
+ FLE+ + L ++++ T +R YA LD + K FS RI++R+ D K
Sbjct: 229 CKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLR 288
Query: 178 DLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+L + + I+DD E VW NLI + KYVYF+
Sbjct: 289 NLFPCGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 324
>gi|296222911|ref|XP_002757404.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase [Callithrix jacchus]
Length = 1053
Score = 70.5 bits (171), Expect = 1e-09, Method: Composition-based stats.
Identities = 49/157 (31%), Positives = 81/157 (51%), Gaps = 16/157 (10%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPF 120
RKL L+++LD TL+H + E++ ++ + I FQ+ + + +LRP
Sbjct: 178 HRNRKLVLMVDLDQTLIH--------TTEQHCQQMSNKGIFH-FQLGRGEPMLHTRLRPH 228
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNP 177
+ FLE+ + L ++++ T +R YA LD + K FS RI++R+ D K
Sbjct: 229 CKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLR 288
Query: 178 DLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+L + + I+DD E VW NLI + KYVYF+
Sbjct: 289 NLFPCGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 324
>gi|123401628|ref|XP_001301902.1| NLI interacting factor-like phosphatase family protein [Trichomonas
vaginalis G3]
gi|121883137|gb|EAX88972.1| NLI interacting factor-like phosphatase family protein [Trichomonas
vaginalis G3]
Length = 461
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 73/301 (24%), Positives = 131/301 (43%), Gaps = 47/301 (15%)
Query: 27 CAHTTVRDSRCIFCSQAM---------------NDSFGLSFDYMLRGLRYSEQ---EERK 68
C+H V + C CS + N +SF+ R EQ + +K
Sbjct: 6 CSHPVVINGICTTCSSQIDQKLLDTNYVRADPNNSIVMISFEEAKRKNLEEEQRLIDAKK 65
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQ--MANDKLVKLRPFVRTFLE 126
L LV++LD TL+ +++ + + K + F+ M + L++ RP VR FL
Sbjct: 66 LSLVIDLDKTLIDTTEVRNRAEVDAIKKLDPAATEDDFFEFNMNQNLLIRYRPHVRQFLA 125
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR--EDF---NGKDRKNPDLVR 181
+ D+ + T+++ YA A + +D + K F +RI +R EDF R D+V
Sbjct: 126 SIAPYFDMQIYTLASPAYAHAILSKIDPEDKLFKNRIFSRTAEDFAMIKEAMRNQTDIVN 185
Query: 182 GQ---------ERGIVILDDTESVW-SDHT---ENLIVLGKYVYFRDKELNGDHKSYSET 228
+ ++ +++LDD+ VW D+ + L+ + +Y YF + N T
Sbjct: 186 KKNIKKIFPYSDKLVLVLDDSPEVWFCDNNKLFKGLVQIKRYSYFTRQGPNS-----PPT 240
Query: 229 LTDESENEEALANVLRVLKTIHRLF---FDSVCGDVRTYLPKVRSE-FSRDVLYFSAIFR 284
+ + N++ L + VL +H +F +D V L + +++ F YFS +
Sbjct: 241 VNPDYVNDDILIQMRSVLIDVHDMFYKNYDPEESHVIMTLHQRKAQVFEGKTFYFSGLSE 300
Query: 285 D 285
D
Sbjct: 301 D 301
>gi|339254478|ref|XP_003372462.1| conserved hypothetical protein [Trichinella spiralis]
gi|316967111|gb|EFV51594.1| conserved hypothetical protein [Trichinella spiralis]
Length = 683
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 43/152 (28%), Positives = 79/152 (51%), Gaps = 10/152 (6%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
++KL L+++LD TL+H S Q+ + +LRP+ R FL
Sbjct: 229 QKKLALLVDLDLTLIHTSETSDDSDALDVYHYQMEGPNSPWYH------TRLRPYARYFL 282
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPD---LVRG 182
++ + ++++ T R YAE VK+LD ++ F RI++R++ + K P+ L G
Sbjct: 283 KKINEYFELHIITHGNRKYAEKVVKMLDPNNVLFGDRILSRDECFDPNMKAPNLKALFPG 342
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW ++ EN++ + Y +F+
Sbjct: 343 GDDLVCIIDDREDVW-NYAENVVRVRPYRFFK 373
>gi|198438317|ref|XP_002131972.1| PREDICTED: similar to MGC81710 protein [Ciona intestinalis]
Length = 895
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 61/216 (28%), Positives = 107/216 (49%), Gaps = 27/216 (12%)
Query: 6 CKECVGKTKFVIKRKCEQSLSCAHTTVRDSRC-IFCSQAMNDSFGLSFDYMLRGLRYSEQ 64
C EC G ++KR+CE+ AH ++ S + S+ + G L L
Sbjct: 92 CAEC-GVDLRMVKRRCEKQ---AHVSMIPSVPELKISKQQAEEIGNQDKSRLHKLN---- 143
Query: 65 EERKLQLVLNLDHTLLHC---RNIKSLSSGEK-YLKKQIHSFIGSLFQMANDKLVKLRPF 120
KL L+++LD TL+H + ++ S EK + Q+H +L+ KLRP+
Sbjct: 144 ---KLVLLVDLDQTLIHTTQNQAFAAMCSEEKDFFTFQLHKNEPTLY-------TKLRPY 193
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPD-- 178
R FL++ S ++ + T +R YA + +D K+F++RI++R++ +K+ +
Sbjct: 194 CREFLQEISKCYELQVVTFGSRLYAHKIAEFIDPKKKFFANRILSRDECINPMKKSGNLR 253
Query: 179 -LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + + I+DD + VWS NL+++ KY YF
Sbjct: 254 HLFPCGDSMVCIIDDRDDVWSS-APNLVMVKKYSYF 288
>gi|224075473|ref|XP_002304648.1| predicted protein [Populus trichocarpa]
gi|222842080|gb|EEE79627.1| predicted protein [Populus trichocarpa]
Length = 238
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 64/234 (27%), Positives = 98/234 (41%), Gaps = 48/234 (20%)
Query: 139 MSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNG------KDRKNPDL--VRGQERGIVIL 190
M + YA K+LD F+ R+++R D + K+ DL V G E G+VI+
Sbjct: 1 MGNKLYATEMAKVLDPKGVLFAGRVVSRGDDGDLLDGDERVPKSKDLEGVLGMESGVVII 60
Query: 191 DDTESVWSDHTENLIVLGKYVYF--RDKELNGDHKSYSETLTDESENEEALANVLRVLKT 248
DD+ VW + NLIV+ +Y+YF ++ S E DE + LA L V++
Sbjct: 61 DDSLRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLACSLAVIER 120
Query: 249 IHRLFFDSVC---GDVRTYLP-KVRSEFSRDVLYFSAIFR--------DCLWAEQEE--- 293
IH+ FF DVR L + R + + FS +F LW E+
Sbjct: 121 IHQNFFTHHSLDEADVRNILASEQRKILAGCRIVFSRVFPVGEVNPHLHPLWQSAEQFGA 180
Query: 294 -----------------------KFLVQEKKFLVHPRWIDAYYFLWRRRPEDDY 324
+ + +F+VHP W++A L+RR E D+
Sbjct: 181 VCTNQIDEQVTHVVANSLGTDKVNWALSTGRFVVHPGWVEASALLYRRANEQDF 234
>gi|406865754|gb|EKD18795.1| FCP1-like phosphatase [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 863
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 59/226 (26%), Positives = 99/226 (43%), Gaps = 38/226 (16%)
Query: 26 SCAHTTVRDSRCIFCSQAMND----SFG-------LSFDYMLRGLRYSEQE--------- 65
+C H+ C C + M + SFG ++ + L+ S+ E
Sbjct: 98 TCPHSVQFQGLCGMCGKDMTEVTFASFGDDTARANINMIHDHTSLKVSQDEASKAEDELQ 157
Query: 66 -----ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL---- 114
RKL LV++LD T++H ++ ++ + + + FQ+ +D
Sbjct: 158 RRLLKHRKLSLVVDLDQTIIHACIEPTVGEWQRDKNSPNYEAVKDVKSFQLNDDGPRGLA 217
Query: 115 ------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR-E 167
+K+RP + FL S L ++++ TM TR YA K++D D K F RII+R E
Sbjct: 218 SGCWYYIKMRPGLAEFLAHISELYELHVYTMGTRAYAINIAKIVDPDKKLFGDRIISRDE 277
Query: 168 DFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ N + L + +VI+DD VW + NLI + Y +F
Sbjct: 278 NGNVTAKSLARLFPVDTKMVVIIDDRADVWPQNRPNLIKVVPYDFF 323
>gi|150866706|ref|XP_001386384.2| hypothetical protein PICST_63097 [Scheffersomyces stipitis CBS
6054]
gi|149387962|gb|ABN68355.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 790
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 67/239 (28%), Positives = 106/239 (44%), Gaps = 47/239 (19%)
Query: 21 CEQSLSCAHTTVRDSRCIFCSQAMNDS-----------FGLSFDYMLRGLRYS------- 62
C C+HT C C +A+ D +S + GL+ S
Sbjct: 93 CSIREPCSHTVQYGGLCALCGKAVEDEKDYSGYTFEDRATISMSHDNTGLKISLDEAAKI 152
Query: 63 EQ-------EERKLQLVLNLDHTLLHCR------NIKSLSSGEKYLK-KQIHSFI----- 103
EQ EE+KL LV++LD T++H +S S Y K + +F
Sbjct: 153 EQSTTDRLNEEKKLILVVDLDQTVIHATVDPTVGEWQSDPSNPNYPAIKDVKTFCLEEEA 212
Query: 104 -----GSLFQMANDK---LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLD 155
+ ++A K VK+RP + FLE+ +L ++++ TM+TR YA A K++D
Sbjct: 213 IVPPGWTGPRLAPTKCWYYVKVRPGLSDFLEEIVNLYEMHIYTMATRNYALAIAKIIDPT 272
Query: 156 SKYFSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
KYF RI++R++ KN L + +VI+DD +W + NLI + Y +F
Sbjct: 273 GKYFGDRILSRDESGSLTHKNLKRLFPVDQSMVVIIDDRGDIWQWES-NLIKVVPYDFF 330
>gi|302889251|ref|XP_003043511.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256724428|gb|EEU37798.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 765
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 82/162 (50%), Gaps = 14/162 (8%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDK---------L 114
+RKL LV++LD T++H ++ ++ H + + FQ+ +
Sbjct: 156 QRKLTLVVDLDQTIIHACIEPTIGEWQRDPTNPNHQAVKDVKSFQLDDGPRGLASGCTYY 215
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGK-- 172
+KLRP + FLE+ S + ++++ TM TR YA +++D D K F +R+I+R D NG
Sbjct: 216 IKLRPGLAEFLEEISKMYELHVYTMGTRAYALNIARIVDPDKKLFGNRVISR-DENGSIT 274
Query: 173 DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ L +VI+DD VW + NLI + Y +F+
Sbjct: 275 SKSLQRLFPVSTDMVVIIDDRADVWPLNRPNLIKVVPYDFFK 316
>gi|195440020|ref|XP_002067857.1| GK12500 [Drosophila willistoni]
gi|194163942|gb|EDW78843.1| GK12500 [Drosophila willistoni]
Length = 657
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 98/212 (46%), Gaps = 31/212 (14%)
Query: 27 CAHTTVRDSRCIFCSQAM--NDSFGLS-----FDYMLRGLRYSEQ--------------E 65
C H TV C C + ND+ +S + + L+ +++
Sbjct: 129 CLHNTVMRDMCADCGADLRQNDNAQMSEASVPMVHTMPDLKVTQKLAQKLGHDDTRRLLN 188
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N + + Q++ + LRP FL
Sbjct: 189 DRKLVLLVDLDQTIIHTTNDPVPENIKGIHHFQLYGSQSPWYHTC------LRPGTTQFL 242
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKN--PDLVRG 182
E+ S + ++++CT R YA +L+D + K FS RI++R++ FN + + L
Sbjct: 243 ERMSQMYELHICTFGARKYAHMIAQLIDPEGKLFSHRILSRDECFNATSKMDNLKALFPN 302
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
++ + I+DD E VW+ T NLI + Y +F+
Sbjct: 303 GDKMVCIIDDREDVWNMAT-NLIQVKPYHFFQ 333
>gi|393225696|gb|EJD33619.1| HAD-like protein, partial [Auricularia delicata TFB-10046 SS5]
Length = 155
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/154 (30%), Positives = 76/154 (49%), Gaps = 17/154 (11%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
RKL LV++LD+T++H I + E+ + Q H+ + F + RP +R FL+
Sbjct: 11 RKLSLVVDLDNTIVH--TIVVRTDDERMARMQDHNHGSTTFTGS------CRPGLRAFLQ 62
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-----PDLVR 181
S + + TM TR YAE +D D + F RI +R++ G K+ P +
Sbjct: 63 TISEKYEPTVYTMGTRGYAEKVCAAVDGDERVFGGRIFSRDENEGNSTKSLSRLFPPCDK 122
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFRD 215
I+DD+ VW D +N++ + YV+F D
Sbjct: 123 SM---TAIIDDSRKVWEDK-KNIVSVQPYVFFGD 152
>gi|432884093|ref|XP_004074439.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Oryzias latipes]
Length = 1129
Score = 69.7 bits (169), Expect = 2e-09, Method: Composition-based stats.
Identities = 48/157 (30%), Positives = 81/157 (51%), Gaps = 16/157 (10%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPF 120
RKL L+++LD TL+H + E++ ++ + I FQ+ + + +LRP
Sbjct: 168 HRNRKLVLMVDLDQTLIH--------TTEQHCQQMSNKGIFH-FQLGRGEPMLHTRLRPH 218
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPD-- 178
+ FLE+ + L ++++ T +R YA LD + K FS RI++R++ K +
Sbjct: 219 CKEFLEKTAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLR 278
Query: 179 -LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
L + + I+DD E VW NLI + KYVYF+
Sbjct: 279 YLFPCGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 314
>gi|296810642|ref|XP_002845659.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Arthroderma otae CBS 113480]
gi|238843047|gb|EEQ32709.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Arthroderma otae CBS 113480]
Length = 832
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/153 (30%), Positives = 79/153 (51%), Gaps = 12/153 (7%)
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VKLRPFV 121
V++LD T++H +++ ++ H + + FQ+ +D +KLRP +
Sbjct: 134 VVDLDQTIIHATVDPTVAEWQQDKDNPNHDAVKDVRCFQLVDDGPGMRGCWYYIKLRPGL 193
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLV 180
FL+ SSL ++++ TM TR YA+ ++D D K F RI++R++ KN L
Sbjct: 194 EEFLKVVSSLYELHIYTMGTRAYAQNVANIVDPDRKIFGDRILSRDESGSLTAKNLHRLF 253
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ +VI+DD VW +ENLI + Y +F
Sbjct: 254 PVDTKMVVIIDDRGDVWK-WSENLIKVTPYDFF 285
>gi|268566337|ref|XP_002639695.1| C. briggsae CBR-FCP-1 protein [Caenorhabditis briggsae]
Length = 723
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 102/206 (49%), Gaps = 19/206 (9%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL-FQMANDK---------LVK 116
RKL L+++LD T++H + K +S + + ++ +L FQ + K
Sbjct: 141 RKLVLLVDLDQTIIHTSD-KPMSVDAEKRRNRVKPQDNNLNFQHKDITKYNLHSRVYTTK 199
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDF---NGKD 173
LRP FL + S++ ++++ T R YA ++LD D++ F RI++R++ K
Sbjct: 200 LRPHTTEFLNKMSAMYEMHIVTYGQRQYAHRIAQILDPDARLFGQRILSRDELFSAQHKT 259
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD-KELNGDHKSYSET---L 229
R L + +VI+DD VW ++E LI + Y +F++ ++N S + +
Sbjct: 260 RNLKALFPCGDNLVVIIDDRADVWQ-YSEALIQIKPYRFFKEVGDINAPKNSKEQMPVQI 318
Query: 230 TDESENEEALANVLRVLKTIHRLFFD 255
D++ + L + RVL IH +++
Sbjct: 319 EDDAHEDRVLEEIERVLTNIHDKYYE 344
>gi|327296037|ref|XP_003232713.1| RNA Polymerase II CTD phosphatase [Trichophyton rubrum CBS 118892]
gi|326465024|gb|EGD90477.1| RNA Polymerase II CTD phosphatase [Trichophyton rubrum CBS 118892]
Length = 836
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 79/153 (51%), Gaps = 12/153 (7%)
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VKLRPFV 121
V++LD T++H +++ ++ H + + FQ+ +D +KLRP +
Sbjct: 134 VVDLDQTIIHATVDPTVAEWQQDKDNPNHDAVKDVRCFQLVDDGPGMRGCWYYIKLRPGL 193
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLV 180
FL+ S+L ++++ TM TR YA+ ++D D K F RI++R++ KN L
Sbjct: 194 EEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDKKIFGDRILSRDESGSLTAKNLQRLF 253
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ +VI+DD VW +ENLI + Y +F
Sbjct: 254 PVDTKMVVIIDDRGDVWK-WSENLIKVSPYDFF 285
>gi|407929624|gb|EKG22436.1| BRCT domain-containing protein [Macrophomina phaseolina MS6]
Length = 861
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/158 (29%), Positives = 84/158 (53%), Gaps = 12/158 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VK 116
+KL LV++LD T++H +++ +K + + + + FQ+ ++ +K
Sbjct: 159 KKLSLVVDLDQTIIHATVDPTVAEWQKDPENPNYEAVKDVQSFQLLDNGPGGRGCWYYIK 218
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP +R FLE S + ++++ TM TR YA+ K++D + K F RI++R++ K
Sbjct: 219 LRPGLREFLENISKVYELHIYTMGTRAYAQNIAKIVDPNRKIFGDRILSRDESGSLTVKT 278
Query: 177 PDLV-RGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ + +VI+DD VWS + NLI + Y +F
Sbjct: 279 LHRIFPVDTKMVVIIDDRGDVWS-WSNNLIKVTPYDFF 315
>gi|344233336|gb|EGV65209.1| hypothetical protein CANTEDRAFT_104476 [Candida tenuis ATCC 10573]
gi|344233337|gb|EGV65210.1| hypothetical protein CANTEDRAFT_104476 [Candida tenuis ATCC 10573]
Length = 788
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 66/239 (27%), Positives = 104/239 (43%), Gaps = 47/239 (19%)
Query: 21 CEQSLSCAHTTVRDSRCIFCSQAMNDSFGLS-FDYMLR----------GLRYSEQE---- 65
C CAH C C +A+ D + F+Y R GL+ S +E
Sbjct: 93 CSIREPCAHAVQYGGLCALCGKAVEDEKDYTGFNYEDRATISMSHDNTGLKISYEEAAKI 152
Query: 66 ----------ERKLQLVLNLDHTLLHCR------NIKSLSSGEKYLK-KQIHSF-----I 103
+RKL LV++LD T++H +S S Y K + SF
Sbjct: 153 EQNSTTRLTQQRKLILVVDLDQTVIHATVDPTVGEWQSDPSNPNYRAVKDVQSFCLEEEP 212
Query: 104 GSLFQMANDKL--------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLD 155
+ + KL VKLRP + FL + + + ++++ TM+TR YA A K++D +
Sbjct: 213 ITPPNWSGPKLSPTKCWYYVKLRPGLEEFLREMAEIYEMHIYTMATRNYALAIAKIIDPE 272
Query: 156 SKYFSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+YF RI++R++ KN L + + I+DD VW +NLI + Y +F
Sbjct: 273 GEYFGDRILSRDESGSLTHKNLKRLFPVDQSMVAIIDDRGDVWQ-WEDNLIKVVPYDFF 330
>gi|393909596|gb|EFO27947.2| hypothetical protein LOAG_00540 [Loa loa]
Length = 506
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 62/232 (26%), Positives = 102/232 (43%), Gaps = 26/232 (11%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
KL L+++LD TL+H N K + + D K+RP+ R FL +
Sbjct: 76 KLVLLVDLDQTLIHTTN--------HTFKVDKDTDVLHYKLKGTDFYTKIRPYAREFLRR 127
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDF---NGKDRKNPDLVRGQE 184
+ L ++++ + R YA + LD D YF RI++R++ K R L +
Sbjct: 128 MAELYEMHIISYGERQYAHRIAEFLDPDKIYFGHRILSRDELFCAMYKTRNMQALFPCGD 187
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVYFRD-KELNGDHKSYSETLTD--------ESEN 235
IV++DD VW +++ LI + Y +F++ ++N E + ESE+
Sbjct: 188 HMIVMIDDRPDVWQ-YSDALIQVKPYRFFKEIGDINAPRYEKGEPILSGSYAEQDMESED 246
Query: 236 EEALANVLRVLKTIHRLFFDSVCGDVRTYLPKVRSEFSRDVLYF-SAIFRDC 286
+E L V VL IH F++ G P ++ S Y + RDC
Sbjct: 247 DETLEYVAVVLTKIHNAFYELFDGAKINRFPDLKGIIS----YLRKQVLRDC 294
>gi|326477486|gb|EGE01496.1| RNA Polymerase II CTD phosphatase Fcp1 [Trichophyton equinum CBS
127.97]
Length = 866
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 79/153 (51%), Gaps = 12/153 (7%)
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VKLRPFV 121
V++LD T++H +++ ++ H + + FQ+ +D +KLRP +
Sbjct: 163 VVDLDQTIIHATVDPTVAEWQQDKDNPNHDAVKDVRCFQLVDDGPGMRGCWYYIKLRPGL 222
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLV 180
FL+ S+L ++++ TM TR YA+ ++D D K F RI++R++ KN L
Sbjct: 223 EEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDKKIFGDRILSRDESGSLTAKNLQRLF 282
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ +VI+DD VW +ENLI + Y +F
Sbjct: 283 PVDTKMVVIIDDRGDVWK-WSENLIKVSPYDFF 314
>gi|302657133|ref|XP_003020296.1| hypothetical protein TRV_05607 [Trichophyton verrucosum HKI 0517]
gi|291184115|gb|EFE39678.1| hypothetical protein TRV_05607 [Trichophyton verrucosum HKI 0517]
Length = 865
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 79/153 (51%), Gaps = 12/153 (7%)
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VKLRPFV 121
V++LD T++H +++ ++ H + + FQ+ +D +KLRP +
Sbjct: 163 VVDLDQTIIHATVDPTVAEWQQDKDNPNHDAVKDVRCFQLVDDGPGMRGCWYYIKLRPGL 222
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLV 180
FL+ S+L ++++ TM TR YA+ ++D D K F RI++R++ KN L
Sbjct: 223 EEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDKKIFGDRILSRDESGSLTAKNLQRLF 282
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ +VI+DD VW +ENLI + Y +F
Sbjct: 283 PVDTKMVVIIDDRGDVWK-WSENLIKVSPYDFF 314
>gi|295671060|ref|XP_002796077.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226284210|gb|EEH39776.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 829
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/153 (28%), Positives = 81/153 (52%), Gaps = 12/153 (7%)
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VKLRPFV 121
V++LD T++H +++ ++ H + + FQ+ +D +KLRP +
Sbjct: 259 VVDLDQTIIHATVDPTVAEWQQDRDNPNHEAVKDVRAFQLVDDGPGMKGCWYYIKLRPGL 318
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLV 180
+ FL++ S+L ++++ TM TR YA+ ++D D K F RI++R++ KN L
Sbjct: 319 QEFLQEISALYELHIYTMGTRAYAQNIATIVDPDRKIFGDRILSRDESGSLTAKNLQRLF 378
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ +VI+DD VW ++NLI + Y +F
Sbjct: 379 PVDTKMVVIIDDRGDVWK-WSDNLIKVSPYDFF 410
>gi|395511850|ref|XP_003760164.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase [Sarcophilus harrisii]
Length = 1267
Score = 69.3 bits (168), Expect = 2e-09, Method: Composition-based stats.
Identities = 49/157 (31%), Positives = 81/157 (51%), Gaps = 16/157 (10%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPF 120
RKL L+++LD TL+H + E++ ++ + I FQ+ + + +LRP
Sbjct: 459 HRNRKLVLMVDLDQTLIH--------TTEQHCQQMSNKGIFH-FQLGRGEPMLHTRLRPH 509
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNP 177
+ FLE+ + L ++++ T +R YA LD + K FS RI++R+ D K
Sbjct: 510 CKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLR 569
Query: 178 DLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+L + + I+DD E VW NLI + KYVYF+
Sbjct: 570 NLFPCGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 605
>gi|149241937|ref|XP_001526384.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146450507|gb|EDK44763.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 883
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 61/236 (25%), Positives = 102/236 (43%), Gaps = 51/236 (21%)
Query: 26 SCAHTTVRDSRCIFCSQAMND-------------SFGLSFDYMLRGLRYSE--------- 63
+CAHT C C ++++D S +S D + Y E
Sbjct: 98 ACAHTVQYGGLCALCGKSLDDEKDYSGYDYEERASIAMSHDNTELRISYDEAAKIEHNTT 157
Query: 64 ---QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIG----SLFQMANDKL-- 114
+ERKL LV++LD T++H ++ GE L + ++ F + D +
Sbjct: 158 DRLNQERKLILVVDLDQTVIHATVDPTV--GEWQLDPENPNYPAVKDVRTFCLEEDPVAP 215
Query: 115 ----------------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKY 158
VK+RP + FL++ ++++ TM+TR YA + K++D + KY
Sbjct: 216 PGWNGPKLAPTKCWYYVKVRPGLAEFLKKMDEKYEMHIYTMATRNYALSIAKIIDPEGKY 275
Query: 159 FSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
F RI++R++ KN L + +VI+DD VW NLI + Y +F
Sbjct: 276 FGDRILSRDESGSLTHKNLKRLFPVDQSMVVIIDDRGDVWQ-WENNLIKVVPYDFF 330
>gi|326475449|gb|EGD99458.1| RNA Polymerase II CTD phosphatase [Trichophyton tonsurans CBS
112818]
Length = 866
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 79/153 (51%), Gaps = 12/153 (7%)
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VKLRPFV 121
V++LD T++H +++ ++ H + + FQ+ +D +KLRP +
Sbjct: 163 VVDLDQTIIHATVDPTVAEWQQDKDNPNHDAVKDVRCFQLVDDGPGMRGCWYYIKLRPGL 222
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLV 180
FL+ S+L ++++ TM TR YA+ ++D D K F RI++R++ KN L
Sbjct: 223 EEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDKKIFGDRILSRDESGSLTAKNLQRLF 282
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ +VI+DD VW +ENLI + Y +F
Sbjct: 283 PVDTKMVVIIDDRGDVWK-WSENLIKVSPYDFF 314
>gi|429854785|gb|ELA29772.1| RNA polymerase ii ctd phosphatase [Colletotrichum gloeosporioides
Nara gc5]
Length = 829
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/167 (29%), Positives = 86/167 (51%), Gaps = 22/167 (13%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL-----FQMANDKL------ 114
+RKL LV++LD T++H + GE +++ + ++ FQ+ ++
Sbjct: 160 QRKLSLVVDLDQTIIHA--CIEPTVGE-WMEDPTNPNYNAVKDVKKFQLNDEGPRGVVTS 216
Query: 115 -----VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDF 169
+K+RP ++ FLE+ S L ++++ TM TR YA +++D D K F +R+I+R D
Sbjct: 217 GCWYYIKMRPGLKEFLEKISELYELHVYTMGTRAYAMNIAQIVDPDRKLFGNRVISR-DE 275
Query: 170 NGK--DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
NG + L +VI+DD VW + NLI + Y +FR
Sbjct: 276 NGSMISKSLQRLFPVNTNMVVIIDDRADVWPRNRPNLIKVVPYDFFR 322
>gi|340931931|gb|EGS19464.1| hypothetical protein CTHT_0049250 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 871
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 82/163 (50%), Gaps = 14/163 (8%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDK--------- 113
+ RKL LV++LD T++ ++ ++ H + + FQ+ +
Sbjct: 159 QSRKLSLVVDLDQTIIQACIDPTVGEWQRDPTNPNHDAVKDVKSFQLDDGPSALARKCWY 218
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGK- 172
+K+RP + FL++ S + ++++ TM TR YA+ +++D D K F +R+I+R D NG
Sbjct: 219 YIKMRPGLEGFLKRISEMYELHVYTMGTRAYAQNVARVVDPDRKLFGNRVISR-DENGNI 277
Query: 173 -DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ L +VI+DD VW + NLI + Y +F+
Sbjct: 278 YTKSLQRLFPVSTNMVVIIDDRSDVWPRNRPNLIKVSPYEFFK 320
>gi|448520991|ref|XP_003868400.1| Fcp1 protein [Candida orthopsilosis Co 90-125]
gi|380352740|emb|CCG25496.1| Fcp1 protein [Candida orthopsilosis]
Length = 788
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 59/235 (25%), Positives = 100/235 (42%), Gaps = 49/235 (20%)
Query: 26 SCAHTTVRDSRCIFCSQAM----------------------NDSFGLSFDYMLRGLRYSE 63
+CAHT C C +++ N +SFD + + +S
Sbjct: 98 ACAHTVQYGGLCALCGKSLEEERDYSGYDYEDRATIAMSHDNSGLKISFDEAAK-IEHST 156
Query: 64 ----QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKLV-- 115
EE KL LV++LD T++H ++ + + + + F + D +V
Sbjct: 157 TDRLNEEEKLILVVDLDQTVIHATVDPTVGEWQSDPSNPNYPAVKDVKTFCLEEDPIVPP 216
Query: 116 ----------------KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYF 159
K+RP + FL++ + ++++ TM+TR YA A K++D D KYF
Sbjct: 217 GWTGPKLAPTKCWYYVKVRPGLSEFLQKMDTKYEMHIYTMATRNYALAIAKIIDPDGKYF 276
Query: 160 SSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
RI++R++ KN L + +VI+DD VW NLI + Y +F
Sbjct: 277 GDRILSRDESGSLTHKNLKRLFPVDQSMVVIIDDRGDVWQ-WENNLIKVVPYDFF 330
>gi|378731871|gb|EHY58330.1| protein phosphatase [Exophiala dermatitidis NIH/UT8656]
Length = 856
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 83/158 (52%), Gaps = 12/158 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VK 116
RKL LV++LD T++H +++ +K + + + FQ+ +D +K
Sbjct: 159 RKLSLVVDLDQTIIHAAVDPTIAEWQKDKDNPNYDAVKDVRSFQLIDDGPGMRGCWYYIK 218
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP + FLE S L ++++ TM TR YA+ ++D + K+F RI++R++ KN
Sbjct: 219 LRPGLTEFLEHISQLYEMHIYTMGTRQYAQQIAAIVDPERKFFGDRILSRDESGSMVAKN 278
Query: 177 PD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ L + +VI+DD VW + NLI + + +F
Sbjct: 279 LERLFPVDTKMVVIIDDRGDVWK-WSANLIRVRPFDFF 315
>gi|226288832|gb|EEH44344.1| RNA polymerase II C-terminal domain phosphatase component
[Paracoccidioides brasiliensis Pb18]
Length = 920
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/153 (28%), Positives = 81/153 (52%), Gaps = 12/153 (7%)
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VKLRPFV 121
V++LD T++H +++ ++ H + + FQ+ +D +KLRP +
Sbjct: 134 VVDLDQTIIHATVDPTVAEWQQDRDNPNHEAVKDVRAFQLVDDGPGMKGCWYYIKLRPGL 193
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLV 180
+ FL++ S+L ++++ TM TR YA+ ++D D K F RI++R++ KN L
Sbjct: 194 QEFLQEISALYELHIYTMGTRAYAQNIAAIVDPDRKIFGDRILSRDESGSLTAKNLQRLF 253
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ +VI+DD VW ++NLI + Y +F
Sbjct: 254 PVDTKMVVIIDDRGDVWK-WSDNLIKVSPYDFF 285
>gi|312066139|ref|XP_003136128.1| hypothetical protein LOAG_00540 [Loa loa]
Length = 577
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 80/326 (24%), Positives = 129/326 (39%), Gaps = 54/326 (16%)
Query: 2 GAYSCKECVGKTKFVIKRK-CEQSL-SCAHTTVRDSRCIFCSQAMNDSFGLSF------- 52
G + V K K IK+ SL C+H V C C + + G S
Sbjct: 53 GVVTIDATVKKGKVNIKKGMIVASLRGCSHEIVIKDMCASCGKDLRSKPGTSGNLTEAST 112
Query: 53 ---------------DYMLRGLRYSEQE----ERKLQLVLNLDHTLLHCRNIKSLSSGEK 93
D + R + ++E KL L+++LD TL+H N
Sbjct: 113 ANVSMIHHVPELIVSDELARKIGSRDRELLLKAHKLVLLVDLDQTLIHTTN--------H 164
Query: 94 YLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD 153
K + + D K+RP+ R FL + + L ++++ + R YA + LD
Sbjct: 165 TFKVDKDTDVLHYKLKGTDFYTKIRPYAREFLRRMAELYEMHIISYGERQYAHRIAEFLD 224
Query: 154 LDSKYFSSRIIAREDF---NGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
D YF RI++R++ K R L + IV++DD VW +++ LI + Y
Sbjct: 225 PDKIYFGHRILSRDELFCAMYKTRNMQALFPCGDHMIVMIDDRPDVWQ-YSDALIQVKPY 283
Query: 211 VYFRD-KELNGDHKSYSETLTD--------ESENEEALANVLRVLKTIHRLFFDSVCGDV 261
+F++ ++N E + ESE++E L V VL IH F++ G
Sbjct: 284 RFFKEIGDINAPRYEKGEPILSGSYAEQDMESEDDETLEYVAVVLTKIHNAFYELFDGAK 343
Query: 262 RTYLPKVRSEFSRDVLYF-SAIFRDC 286
P ++ S Y + RDC
Sbjct: 344 INRFPDLKGIIS----YLRKQVLRDC 365
>gi|432105445|gb|ELK31660.1| RNA polymerase II subunit A C-terminal domain phosphatase [Myotis
davidii]
Length = 823
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 77/151 (50%), Gaps = 10/151 (6%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
RKL L+++LD TL+H + K + +H +G M + +LRP R FLE
Sbjct: 62 RKLVLMVDLDQTLIHTTEQQCQQMSNKGI---LHFQLGRGEPMLH---TRLRPHCREFLE 115
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQ 183
+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 116 KVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFPCG 175
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 176 DSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 205
>gi|391345370|ref|XP_003746962.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Metaseiulus occidentalis]
Length = 475
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 54/172 (31%), Positives = 86/172 (50%), Gaps = 19/172 (11%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLK-KQIHSFIGSLFQMANDKL--VKLRPFVR 122
++KL L+++LD TL+H +S Y K K +H F +N+ ++RP
Sbjct: 29 QKKLVLLVDLDQTLIHT------TSEPVYDKIKGVHHF---RLPSSNNAWYHTRIRPGTE 79
Query: 123 TFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKN--PDL 179
FL + S L ++++ T R YA LLD KYF RI++R++ FN + + L
Sbjct: 80 DFLRKISQLFELHIVTFGARPYANHIASLLDPGKKYFQYRILSRDECFNPQSKTANLKSL 139
Query: 180 VRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTD 231
++ + I+DD E VW + NLI + YV+FR GD + + L D
Sbjct: 140 FPCGDQMVCIIDDREDVW-NFASNLIAVKPYVFFRGA---GDINAPAGLLAD 187
>gi|302497759|ref|XP_003010879.1| hypothetical protein ARB_02918 [Arthroderma benhamiae CBS 112371]
gi|291174424|gb|EFE30239.1| hypothetical protein ARB_02918 [Arthroderma benhamiae CBS 112371]
Length = 1048
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 79/153 (51%), Gaps = 12/153 (7%)
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VKLRPFV 121
V++LD T++H +++ ++ H + + FQ+ +D +KLRP +
Sbjct: 347 VVDLDQTIIHATVDPTVAEWQQDKDNPNHDAVKDVRCFQLVDDGPGMRGCWYYIKLRPGL 406
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLV 180
FL+ S+L ++++ TM TR YA+ ++D D K F RI++R++ KN L
Sbjct: 407 EEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDKKIFGDRILSRDESGSLTAKNLQRLF 466
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ +VI+DD VW +ENLI + Y +F
Sbjct: 467 PVDTKMVVIIDDRGDVWK-WSENLIKVSPYDFF 498
>gi|254586061|ref|XP_002498598.1| ZYRO0G14168p [Zygosaccharomyces rouxii]
gi|238941492|emb|CAR29665.1| ZYRO0G14168p [Zygosaccharomyces rouxii]
Length = 764
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 54/232 (23%), Positives = 102/232 (43%), Gaps = 40/232 (17%)
Query: 21 CEQSLSCAHTTVRDSRCIFCSQAMNDS----FGLSFDYMLRGLRYSEQE----------- 65
CE C H V C C + ++++ L+ + L+ S +E
Sbjct: 99 CEIMRPCNHDVVYGGLCTMCGKEVDENDQMEANLAISHTDTNLKVSRKEAEDMEHFLKQR 158
Query: 66 ---ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIG--SLFQMANDKL------ 114
+KL LV++LD T++HC ++ +K + + +F + + +
Sbjct: 159 LRQSKKLVLVVDLDQTVIHCGVDPTIGEWKKDPSNPNYETLKDVQMFSLEEEPIVPPMYM 218
Query: 115 ------------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
VK+RP +R F Q + L ++++ TM+TR YA K++D D F R
Sbjct: 219 GPRLPERKCWYFVKVRPGLREFFAQLAPLYEMHIYTMATRTYALEIAKIIDPDGSLFGDR 278
Query: 163 IIAREDFNGKDRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
I++R++ +K+ + L + ++++DD VW + NLI + Y +F
Sbjct: 279 ILSRDENGSLTQKSLERLFPTDQSMVIVIDDRGDVW-NWCPNLIKVVPYNFF 329
>gi|148677459|gb|EDL09406.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1, isoform CRA_c [Mus musculus]
Length = 1000
Score = 68.9 bits (167), Expect = 3e-09, Method: Composition-based stats.
Identities = 49/156 (31%), Positives = 79/156 (50%), Gaps = 16/156 (10%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPF 120
RKL L+++LD TL+H + E++ + + I FQ+ + + +LRP
Sbjct: 218 HRNRKLVLMVDLDQTLIH--------TTEQHCPQMSNKGIFH-FQLGRGEPMLHTRLRPH 268
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNP 177
+ FLE+ + L ++++ T +R YA LD + K FS RI++R+ D K
Sbjct: 269 CKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLR 328
Query: 178 DLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+L + + I+DD E VW NLI + KYVYF
Sbjct: 329 NLFPCGDSMVCIIDDREDVWK-FAPNLITVKKYVYF 363
>gi|444319376|ref|XP_004180345.1| hypothetical protein TBLA_0D03260 [Tetrapisispora blattae CBS 6284]
gi|387513387|emb|CCH60826.1| hypothetical protein TBLA_0D03260 [Tetrapisispora blattae CBS 6284]
Length = 768
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 102/232 (43%), Gaps = 40/232 (17%)
Query: 21 CEQSLSCAHTTVRDSRCIFCSQAMNDS----FGLSFDYMLRGLRYSEQEER--------- 67
C+ C H V C C + ++DS LS + L+ S +E R
Sbjct: 117 CDIKRPCNHDIVYAGICTQCGKEVDDSDIMDASLSISHTDTNLKISRKEARDIDQSSMSR 176
Query: 68 -----KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK--------- 113
KL LV++LD T++HC ++ + K + + + + D+
Sbjct: 177 LKKIKKLILVVDLDQTVIHCGVDPTIGEWKNDPKNPNYETLKDVRSFSLDEEPILPPSYM 236
Query: 114 -----------LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
VK+RP ++ F + + L ++++ TM+TR YA K++D D F SR
Sbjct: 237 GPRPPVRKCWYYVKVRPGLKEFFAKIAPLYEMHIYTMATRAYALEIAKIIDPDGSLFGSR 296
Query: 163 IIAREDFNGKDRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
I++R++ +K+ + L + ++I+DD VW + NLI + Y +F
Sbjct: 297 ILSRDENGSLTQKSLERLFPTDQSMVIIIDDRGDVW-NWCNNLIKVIPYNFF 347
>gi|195429765|ref|XP_002062928.1| GK19439 [Drosophila willistoni]
gi|194159013|gb|EDW73914.1| GK19439 [Drosophila willistoni]
Length = 827
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 53/212 (25%), Positives = 94/212 (44%), Gaps = 31/212 (14%)
Query: 27 CAHTTVRDSRCIFCS-------QAMNDSFGLSFDYMLRGLRYSEQ--------------E 65
C HTTV C C + + + L+ +++
Sbjct: 128 CIHTTVIKDMCADCGADLRQNENGQTSEASVPIVHTMPDLKVTQKLAQKLGHDDTRRLLA 187
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N + + Q++ + +LRP FL
Sbjct: 188 DRKLVLLVDLDQTVIHTTNDVVPDNIKGIYHFQLYGPQSPWYH------TRLRPGTADFL 241
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKN--PDLVRG 182
++ S L ++++CT R YA +LLD + K+FS RI++R++ FN + + L
Sbjct: 242 DRMSHLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSRDECFNATSKTDNLKALFPN 301
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW + NLI + Y +F+
Sbjct: 302 GDSMVCIIDDREDVW-NMASNLIQVKPYHFFQ 332
>gi|125584005|gb|EAZ24936.1| hypothetical protein OsJ_08716 [Oryza sativa Japonica Group]
Length = 364
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 63/119 (52%), Gaps = 8/119 (6%)
Query: 140 STRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQ-------ERGIVILDD 192
T YA A KLLD D YF RII+R++ DRK+ D+V G +VILDD
Sbjct: 99 GTEDYAAAVAKLLDPDGVYFGERIISRDESPQPDRKSLDVVFGSAPASAAERAAVVILDD 158
Query: 193 TESVWSDHTENLIVLGKYVYFRDKELN-GDHKSYSETLTDESENEEALANVLRVLKTIH 250
T VW +++NLI + +Y YF + G + +L++ +E A LRVL+ +H
Sbjct: 159 TAEVWEGNSDNLIEMERYHYFASSCRDFGSPWECTHSLSERGVDESERAAALRVLRRVH 217
>gi|317144011|ref|XP_001819844.2| RNA polymerase II subunit A C-terminal domain phosphatase
[Aspergillus oryzae RIB40]
Length = 799
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 73/148 (49%), Gaps = 13/148 (8%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
RKL LV++LD T++H + + ++ + L LRP + +FL+
Sbjct: 158 RKLSLVVDLDQTIIHA-----------TVDPTVGEWMEDKDNPNHQALSDLRPGLESFLQ 206
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLVRGQER 185
S L ++++ TM TR YA+ ++D D K F RI++R++ KN L +
Sbjct: 207 NVSELFELHIYTMGTRAYAQHIASIIDPDRKLFGDRILSRDESGSLTAKNLHRLFPVDTK 266
Query: 186 GIVILDDTESVWSDHTENLIVLGKYVYF 213
+VI+DD VW + NLI + Y +F
Sbjct: 267 MVVIIDDRGDVWR-WSPNLIKVSPYDFF 293
>gi|384501479|gb|EIE91970.1| hypothetical protein RO3G_16681 [Rhizopus delemar RA 99-880]
Length = 494
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 108/224 (48%), Gaps = 38/224 (16%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPFV 121
+++KL L+L+LD T++H + +S + +Q F + LV KLRP +
Sbjct: 28 DQKKLSLILDLDQTIVHASCDQRISQWQNPDIRQ--------FNLPRSPLVYYIKLRPGL 79
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNG--KDRKNPDL 179
FL++ L ++++ TM T+ YA+A K +D + F RI++R D +G +K +
Sbjct: 80 IEFLKEIEELYELHIYTMGTKDYAKAVAKEIDPEGCLFKERILSR-DESGCLTQKKLQRI 138
Query: 180 VRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEAL 239
+V+LDD VWS ++ NL+ + Y YF GD S ++
Sbjct: 139 FPCDTSMVVVLDDRSDVWS-YSPNLVRIKPYEYFIG---TGDIHSPTKN----------- 183
Query: 240 ANVLRVLKTIHRLFF-DSVCGDVRTYLPKVRSEFSRDVLYFSAI 282
++LK IH+ F+ + GDV +P ++ R VL+ I
Sbjct: 184 ----KILKKIHQEFYKNKKEGDVTKIIPNMK----RQVLHHCII 219
>gi|342878347|gb|EGU79693.1| hypothetical protein FOXB_09806 [Fusarium oxysporum Fo5176]
Length = 769
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 82/162 (50%), Gaps = 13/162 (8%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------- 114
+RKL LV++LD T++H ++ + + + + FQ+ +D
Sbjct: 156 QRKLSLVVDLDQTIIHACIEPTIGEWKNDPTNPNYEAVKDVRDFQLNDDGPRGLTSGCTY 215
Query: 115 -VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
+KLRP + FL++ S + ++++ TM TR YA K++D D K F +R+I+R++
Sbjct: 216 YIKLRPGLMEFLDEVSKMYELHVYTMGTRAYALNIAKIVDPDQKLFGNRVISRDENGSIT 275
Query: 174 RKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
K+ L +VI+DD VW + NLI + Y +F+
Sbjct: 276 AKSLQRLFPVSTDMVVIIDDRADVWPMNRPNLIKVVPYDFFK 317
>gi|417412899|gb|JAA52807.1| Putative rna polymerase ii subunit a c-terminal domain phosphatase,
partial [Desmodus rotundus]
Length = 845
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 51/153 (33%), Positives = 81/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K +H +G M + +LRP R F
Sbjct: 75 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGILHFQLGRGEPMLH---TRLRPHCRQF 126
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 127 LEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 186
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 187 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 218
>gi|315051428|ref|XP_003175088.1| RNA polymerase II subunit A domain phosphatase [Arthroderma gypseum
CBS 118893]
gi|311340403|gb|EFQ99605.1| RNA polymerase II subunit A domain phosphatase [Arthroderma gypseum
CBS 118893]
Length = 867
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 78/153 (50%), Gaps = 12/153 (7%)
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VKLRPFV 121
V++LD T++H ++ ++ H + + FQ+ +D +KLRP +
Sbjct: 163 VVDLDQTIIHATVDPTVGEWQQDKDNPNHDAVKDVRCFQLVDDGPGMRGCWYYIKLRPGL 222
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLV 180
FL+ S+L ++++ TM TR YA+ ++D D K F RI++R++ KN L
Sbjct: 223 EEFLKVISTLYELHIYTMGTRAYAQNVANIVDPDRKIFGDRILSRDESGSLTAKNLQRLF 282
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ +VI+DD VW +ENLI + Y +F
Sbjct: 283 PVDTKMVVIIDDRGDVWK-WSENLIKVTPYDFF 314
>gi|170036997|ref|XP_001846347.1| RNA polymerase II subunit A C-terminal domain phosphatase [Culex
quinquefasciatus]
gi|167879975|gb|EDS43358.1| RNA polymerase II subunit A C-terminal domain phosphatase [Culex
quinquefasciatus]
Length = 764
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 93/212 (43%), Gaps = 31/212 (14%)
Query: 27 CAHTTVRDSRCIFCS-------QAMNDSFGLSFDYMLRGLRYSEQ--------------E 65
C+HTTV C C QA + + + L+ +E+
Sbjct: 82 CSHTTVIKDMCADCGADLRQDEQAGGSEASVPMIHSVPELKVTEKLAKKLGQADTERLLR 141
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD TL+H N ++ + Q++ + +LRP FL
Sbjct: 142 DRKLVLLVDLDQTLIHTTNDNVPNNLKDVYHFQLYGPNSPWYH------TRLRPGALQFL 195
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKN--PDLVRG 182
+ ++++CT R YA + LD +YFS RI++R++ FN + + L
Sbjct: 196 AKMDPFYELHICTFGARNYAHMIAQFLDEKGRYFSHRILSRDECFNATSKTDNLKALFPC 255
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW + NLI + Y +F+
Sbjct: 256 GDSMVCIIDDREDVW-NMAANLIQVKPYHFFQ 286
>gi|428183780|gb|EKX52637.1| hypothetical protein GUITHDRAFT_101798 [Guillardia theta CCMP2712]
Length = 749
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 98/198 (49%), Gaps = 33/198 (16%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSF---IGSLFQMANDKLVKLRPFVRTFLEQ 127
LVL+LDHTLLH ++ E+ + + +H + L A KLRP +R FL +
Sbjct: 117 LVLDLDHTLLHTTLPRT--EMEEMIMQTLHEQCKDVHVLQVSAARYYTKLRPGIRNFLSE 174
Query: 128 ASSLVDIYLCT--MSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR---- 181
S L ++Y+ T M ++ YAEA +LD + F RII+R+D+ ++ L +
Sbjct: 175 MSRLFELYIYTAGMGSQQYAEAVAHMLDESGRMFRGRIISRDDYTDVSLEHKKLDKVFPI 234
Query: 182 GQERG-IVILDDTESVWSDH--------TENLIVLGKYVYF-RD----------KELNG- 220
+ R ++ILDD W DH ENLI + KY ++ RD +E G
Sbjct: 235 DEHRALVIILDDNAETW-DHQYSDGRNSQENLIQVDKYSFWPRDLGEGHNPVAAREWQGA 293
Query: 221 DHKSYSETLTDESENEEA 238
+ S+S +L + + EEA
Sbjct: 294 ESSSFSWSLNEAQKQEEA 311
>gi|190346120|gb|EDK38128.2| hypothetical protein PGUG_02226 [Meyerozyma guilliermondii ATCC
6260]
Length = 732
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 96/196 (48%), Gaps = 29/196 (14%)
Query: 45 NDSFGL--SFDYMLRGLRYSEQE----ERKLQLVLNLDHTLLHCR------NIKSLSSGE 92
+DS GL SFD + L S E ERKL LV++LD T++H +S S
Sbjct: 89 HDSTGLKISFDEAAK-LEQSTSERLTSERKLILVVDLDQTVIHATVDPTVGEWQSDPSNP 147
Query: 93 KYLK-KQIHSFI----------GSLFQMANDK---LVKLRPFVRTFLEQASSLVDIYLCT 138
Y K + SF S +M K VK+RP + FL++ S L ++++ T
Sbjct: 148 NYRAVKDVRSFCLEEDPIAPPGWSGPKMTPTKCWYYVKVRPGLEDFLKRVSQLYEMHVYT 207
Query: 139 MSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVW 197
M+TR YA A ++D D +YF RI++R++ KN L + +VI+DD VW
Sbjct: 208 MATRNYALAIAHIIDPDGRYFGDRILSRDESGSLTHKNLRRLFPVDQSMVVIIDDRGDVW 267
Query: 198 SDHTENLIVLGKYVYF 213
+NLI + Y +F
Sbjct: 268 Q-WEKNLIKVVPYEFF 282
>gi|448111257|ref|XP_004201796.1| Piso0_001998 [Millerozyma farinosa CBS 7064]
gi|359464785|emb|CCE88490.1| Piso0_001998 [Millerozyma farinosa CBS 7064]
Length = 830
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 83/170 (48%), Gaps = 22/170 (12%)
Query: 65 EERKLQLVLNLDHTLLHCR------NIKSLSSGEKYLK-KQIHSFIGSLFQMA-----ND 112
+E+KL LV++LD T++H +S S Y K + SF +A
Sbjct: 162 DEKKLILVVDLDQTVIHATVDPTVGEWQSDPSNPNYKAVKDVKSFCLEEESIAPLGWEGP 221
Query: 113 KL--------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII 164
KL VK+RP + FLEQ S L ++++ TM+TR YA K++D D KYF RI+
Sbjct: 222 KLPATKCWYYVKVRPGLEEFLEQISKLYEMHIYTMATRNYALEIAKIIDPDGKYFGDRIL 281
Query: 165 AREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+R++ KN L + + I+DD VW NLI + Y +F
Sbjct: 282 SRDESGSLTHKNLKRLFPVDQSMVAIIDDRGDVWQ-WENNLIKVVPYDFF 330
>gi|410076480|ref|XP_003955822.1| hypothetical protein KAFR_0B03910 [Kazachstania africana CBS 2517]
gi|372462405|emb|CCF56687.1| hypothetical protein KAFR_0B03910 [Kazachstania africana CBS 2517]
Length = 724
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 60/234 (25%), Positives = 105/234 (44%), Gaps = 42/234 (17%)
Query: 21 CEQSLSCAHTTVRDSRCIFCSQAMNDS----FGLSF--DYMLRGLRYSEQE--------- 65
CE C H V C C + +++S FG +F + L+ S +E
Sbjct: 102 CEIKRPCNHDIVYGGLCTQCGKEVDESEQSQFGSNFTVSHTDTNLKISRKEALDIGEDFK 161
Query: 66 -----ERKLQLVLNLDHTLLHCR------NIKSLSSGEKY-LKKQIHSF------IGSLF 107
E+KL LV++LD T++HC KS + Y K + F +
Sbjct: 162 KRLRNEKKLVLVVDLDQTVIHCGVDPTIGEWKSDPNNPNYDTLKDVQMFALEEEPVLPFM 221
Query: 108 QMANDKL-------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFS 160
M VK+RP ++ F ++ + L ++++ TM+TR YA K++D + F
Sbjct: 222 YMGPKPTPRKCWYYVKVRPGLKEFFKKVAPLFEMHIYTMATRAYALEITKIIDPTGELFG 281
Query: 161 SRIIAREDFNGKDRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+RI++R++ K+ + L + ++I+DD VW + + NLI + Y +F
Sbjct: 282 NRILSRDENGSLTSKSLERLFPTDQSMVIIIDDRGDVW-NWSPNLIKVVPYSFF 334
>gi|294868642|ref|XP_002765622.1| hypothetical protein Pmar_PMAR013688 [Perkinsus marinus ATCC 50983]
gi|239865701|gb|EEQ98339.1| hypothetical protein Pmar_PMAR013688 [Perkinsus marinus ATCC 50983]
Length = 956
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 56/171 (32%), Positives = 81/171 (47%), Gaps = 20/171 (11%)
Query: 67 RKLQLVLNLDHTLLHCRN---------------IKSLSSGEKYLKKQIHSFIGSLFQMAN 111
++L VL++DHT+LH N + +G +K FIG+
Sbjct: 494 KRLVAVLDIDHTILHVTNKRIDLLFPDVTCYNLAPNRDTGRLDEEKVYQFFIGTSPTTTA 553
Query: 112 DKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSS--RIIAREDF 169
+KLRP TFLE+ L ++YL T TR YA +K LD ++YF S R+IAR
Sbjct: 554 CCYLKLRPGFYTFLEEILPLYELYLYTHGTREYAIRLLKALDPSARYFGSPPRLIARPTQ 613
Query: 170 NGKDRKN-PDLVRGQERGIVILDDTESVWS--DHTENLIVLGKYVYFRDKE 217
+ K + R VI+DD + VW D+ +LI + YV+F D E
Sbjct: 614 SALTCKTLSRIFPSNHRLAVIVDDRDDVWEAKDNEHSLIKVTPYVFFPDSE 664
>gi|209879341|ref|XP_002141111.1| NLI interacting factor-like phosphatase family protein
[Cryptosporidium muris RN66]
gi|209556717|gb|EEA06762.1| NLI interacting factor-like phosphatase family protein
[Cryptosporidium muris RN66]
Length = 590
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 53/164 (32%), Positives = 77/164 (46%), Gaps = 22/164 (13%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQ-----------MANDKL 114
+ KL +L+LD+TLLH N S + + FIG+ + M
Sbjct: 166 QNKLVAILDLDNTLLHAYN-----STKVGCNINLEDFIGANGEPEMYKFVLPQDMNTPYY 220
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+KLRP VR FL + + +CT +TR YA+ +LD F RI+ARE+ +G+D
Sbjct: 221 LKLRPGVREFLNTIAPYYIMGICTNATREYADVIRAVLDPKRDKFGDRIVARENVDGRDT 280
Query: 175 KNPDL----VRGQERGIVILDDTESVWSDHTENLIVLGK-YVYF 213
+ D + R IV+LDD VW E +V + Y YF
Sbjct: 281 QK-DFKKICIGIDTRAIVLLDDRSDVWDSSLEIQVVKAQTYEYF 323
>gi|328874143|gb|EGG22509.1| hypothetical protein DFA_04637 [Dictyostelium fasciculatum]
Length = 397
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 80/174 (45%), Gaps = 24/174 (13%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
++E K+ L++N+DH L H K+ S E Q S I + +N VK RP+ T
Sbjct: 53 KDEHKMNLIINIDHILFHS--TKNPESNET----QGESVIKCVVDESNTYYVKFRPYAAT 106
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDF-------------- 169
FL+ L ++ L ++ ++ Y ++LLDL++ F +II+RE F
Sbjct: 107 FLQSLQPLFNLILFSLYSKSYVFKLIELLDLNNNIF-KQIISRESFGESLPKQQVGKPYA 165
Query: 170 --NGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF-RDKELNG 220
N + ILDD E +W +NLI ++ YF ++ + NG
Sbjct: 166 LWNTPSHFTKIFKISAHESLAILDDREDIWRQFRDNLISPERFTYFTKEDDENG 219
>gi|431907029|gb|ELK11148.1| RNA polymerase II subunit A C-terminal domain phosphatase [Pteropus
alecto]
Length = 918
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 51/153 (33%), Positives = 81/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K +H +G M + +LRP R F
Sbjct: 154 RKLVLMVDLDQTLIHTTEQHCQRMSN-----KGILHFQLGRGEPMLH---TRLRPHCREF 205
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 206 LEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 265
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 266 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 297
>gi|154284394|ref|XP_001542992.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150406633|gb|EDN02174.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 654
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 78/153 (50%), Gaps = 12/153 (7%)
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VKLRPFV 121
V++LD T++H +++ ++ H + + FQ+ +D +KLRP +
Sbjct: 89 VVDLDQTIIHATVDPTVAEWQQDKDNPNHEAVKDVRAFQLVDDGPGMKGCWYYIKLRPGL 148
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLV 180
FL S+L ++++ TM TR YA+ ++D D K F RI++R++ KN L
Sbjct: 149 EEFLRNISTLFELHIYTMGTRAYAQHIASIVDPDRKIFGDRILSRDESGSLTAKNLQRLF 208
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ +VI+DD VW T+NLI + Y +F
Sbjct: 209 PVDTKMVVIIDDRGDVWK-WTDNLIKVVPYDFF 240
>gi|448097224|ref|XP_004198617.1| Piso0_001998 [Millerozyma farinosa CBS 7064]
gi|359380039|emb|CCE82280.1| Piso0_001998 [Millerozyma farinosa CBS 7064]
Length = 830
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 83/170 (48%), Gaps = 22/170 (12%)
Query: 65 EERKLQLVLNLDHTLLHCR------NIKSLSSGEKYLK-KQIHSFIGSLFQMA-----ND 112
EE+KL LV++LD T++H +S S Y K + SF +A
Sbjct: 162 EEKKLILVVDLDQTVIHATVDPTVGEWQSDPSNPNYKAVKDVKSFCLEEESIAPLGWEGP 221
Query: 113 KL--------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII 164
KL VK+RP + FLEQ S L ++++ TM+TR YA K++D + KYF RI+
Sbjct: 222 KLPATKCWYYVKVRPGLEQFLEQISKLYEMHIYTMATRNYALEIAKIIDPNGKYFGDRIL 281
Query: 165 AREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+R++ KN L + + I+DD VW NLI + Y +F
Sbjct: 282 SRDESGSLTHKNLKRLFPVDQSMVAIIDDRGDVWQ-WENNLIKVVPYDFF 330
>gi|239606973|gb|EEQ83960.1| RNA Polymerase II CTD phosphatase Fcp1 [Ajellomyces dermatitidis
ER-3]
Length = 901
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 79/153 (51%), Gaps = 12/153 (7%)
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VKLRPFV 121
V++LD T++H +++ ++ H + + FQ+ +D +KLRP +
Sbjct: 134 VVDLDQTIIHATVDPTVAEWQQDKDNPNHEAVKDVRAFQLVDDGPGMRGCWYYIKLRPGL 193
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLV 180
FL + S+L ++++ TM TR YA+ ++D D K F RI++R++ KN L
Sbjct: 194 EEFLREISTLFELHIYTMGTRAYAQHIANIVDPDRKIFGDRILSRDESGSLTAKNLQRLF 253
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ +VI+DD VW T+NLI + Y +F
Sbjct: 254 PVDTKMVVIIDDRGDVWK-WTDNLIKVLPYDFF 285
>gi|225556539|gb|EEH04827.1| RNA polymerase II C-terminal domain phosphatase component
[Ajellomyces capsulatus G186AR]
Length = 871
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 78/153 (50%), Gaps = 12/153 (7%)
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VKLRPFV 121
V++LD T++H +++ ++ H + + FQ+ +D +KLRP +
Sbjct: 134 VVDLDQTIIHATVDPTVAEWQQDKDNPNHEAVKDVRAFQLVDDGPGMKGCWYYIKLRPGL 193
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLV 180
FL S+L ++++ TM TR YA+ ++D D K F RI++R++ KN L
Sbjct: 194 EEFLRNISTLFELHIYTMGTRAYAQHIASIVDPDRKIFGDRILSRDESGSLTAKNLQRLF 253
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ +VI+DD VW T+NLI + Y +F
Sbjct: 254 PVDTKMVVIIDDRGDVWK-WTDNLIKVVPYDFF 285
>gi|327358124|gb|EGE86981.1| RNA Polymerase II CTD phosphatase Fcp1 [Ajellomyces dermatitidis
ATCC 18188]
Length = 839
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 79/153 (51%), Gaps = 12/153 (7%)
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VKLRPFV 121
V++LD T++H +++ ++ H + + FQ+ +D +KLRP +
Sbjct: 62 VVDLDQTIIHATVDPTVAEWQQDKDNPNHEAVKDVRAFQLVDDGPGMRGCWYYIKLRPGL 121
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLV 180
FL + S+L ++++ TM TR YA+ ++D D K F RI++R++ KN L
Sbjct: 122 EEFLREISTLFELHIYTMGTRAYAQHIANIVDPDRKIFGDRILSRDESGSLTAKNLQRLF 181
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ +VI+DD VW T+NLI + Y +F
Sbjct: 182 PVDTKMVVIIDDRGDVWK-WTDNLIKVLPYDFF 213
>gi|294935258|ref|XP_002781353.1| hypothetical protein Pmar_PMAR020737 [Perkinsus marinus ATCC 50983]
gi|239891934|gb|EER13148.1| hypothetical protein Pmar_PMAR020737 [Perkinsus marinus ATCC 50983]
Length = 979
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 56/171 (32%), Positives = 81/171 (47%), Gaps = 20/171 (11%)
Query: 67 RKLQLVLNLDHTLLHCRN---------------IKSLSSGEKYLKKQIHSFIGSLFQMAN 111
++L VL++DHT+LH N + +G +K FIG+
Sbjct: 517 KRLVAVLDIDHTILHVTNKRIDLLFPDVTCYNLAPNRDTGRLDEEKVYQFFIGTSPTTTA 576
Query: 112 DKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSS--RIIAREDF 169
+KLRP TFLE+ L ++YL T TR YA +K LD ++YF S R+IAR
Sbjct: 577 CCYLKLRPGFYTFLEEILPLYELYLYTHGTREYAIRLLKALDPSARYFGSPPRLIARPTQ 636
Query: 170 NGKDRKN-PDLVRGQERGIVILDDTESVWS--DHTENLIVLGKYVYFRDKE 217
+ K + R VI+DD + VW D+ +LI + YV+F D E
Sbjct: 637 SALTCKTLSRIFPSNHRLAVIVDDRDDVWEAKDNEHSLIKVTPYVFFPDSE 687
>gi|354479392|ref|XP_003501894.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase [Cricetulus griseus]
Length = 978
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 75/150 (50%), Gaps = 10/150 (6%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
RKL L+++LD TL+H + K + H +G M + +LRP R FLE
Sbjct: 192 RKLVLMVDLDQTLIHTTEQQCPQMSNKGI---FHFQLGRGEPMLH---TRLRPHCRDFLE 245
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQ 183
+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 246 KIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFPCG 305
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ + I+DD E VW NLI + KYVYF
Sbjct: 306 DSMVCIIDDREDVWK-FAPNLITVKKYVYF 334
>gi|296419837|ref|XP_002839498.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295635659|emb|CAZ83689.1| unnamed protein product [Tuber melanosporum]
Length = 896
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 59/229 (25%), Positives = 102/229 (44%), Gaps = 50/229 (21%)
Query: 27 CAHTTVRDSRCIFCSQAM-------------------NDSFGLSFDYMLRGLRYSEQEER 67
C+H C C Q M +DS GL+ R E+ +R
Sbjct: 94 CSHEVQFAGLCSMCGQDMTLLDHGHFSNKDRATIHMVHDSMGLTV-SQDEATRLEEETKR 152
Query: 68 ------KLQLVLNLDHTLLH---------------CRNIKSLSSGEKY-LKKQIHSFIGS 105
KL LV++LD T++H C N +S+ + + L + I G+
Sbjct: 153 RLLKSKKLSLVVDLDQTIIHATVDPTVGDWKNDPFCINHESVKDVQAFKLDEDIIGGRGT 212
Query: 106 LFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIA 165
+ VK+RP ++ FLE S L ++++ TM TR YA + K++D D + F R+++
Sbjct: 213 WY------YVKMRPGLKEFLEHISQLYELHIYTMGTRAYAMSVKKIVDPDGRIFGERVLS 266
Query: 166 REDFNGKDRKNPDLV-RGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
R++ +K+ + + +VI+DD VW ++NL+ + Y +F
Sbjct: 267 RDESGSMTQKSLHRIFPVDTKMVVIIDDRGDVWK-WSDNLVKVRPYDFF 314
>gi|281206665|gb|EFA80851.1| putative tfiif-interacting component of the c-terminal domain
phosphatase [Polysphondylium pallidum PN500]
Length = 881
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 52/162 (32%), Positives = 80/162 (49%), Gaps = 27/162 (16%)
Query: 66 ERKLQLVLNLDHTLLHC------------RNIKSLSSGEKYLKKQIHSFIGSLFQMANDK 113
+RKL LVL++DHT++H RNI K+ I S + N K
Sbjct: 274 QRKLSLVLDIDHTIIHAIMEPHFMEVPYWRNIDCE-------KENIRSIT-----LGNMK 321
Query: 114 L-VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGK 172
+KLRPF+ FLE + ++++ TM TR YA KL+D + F RI++R+D
Sbjct: 322 YYIKLRPFLYKFLEDVNKKFELHIYTMGTRNYALEIAKLIDEKQELFKERILSRDDTTDM 381
Query: 173 DRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K L + ++I+DD VW ++NL+ + Y+YF
Sbjct: 382 SFKTLQRLFPCDDSMVLIVDDRSDVWK-RSKNLVQISPYLYF 422
>gi|449493392|ref|XP_002190004.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase [Taeniopygia guttata]
Length = 871
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 51/153 (33%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 62 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKEF 113
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K DL
Sbjct: 114 LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRDLFP 173
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 174 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 205
>gi|453084575|gb|EMF12619.1| hypothetical protein SEPMUDRAFT_149240 [Mycosphaerella populorum
SO2202]
Length = 848
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/157 (28%), Positives = 80/157 (50%), Gaps = 11/157 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDK-------LVKL 117
R+L LV++LD T++H S+ + + + + FQ+ +D +K
Sbjct: 160 RRLSLVVDLDQTIIHACVDPSIGEWQNDPSNPNYDALRDVQAFQLRDDNKPVATWYYIKQ 219
Query: 118 RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR-EDFNGKDRKN 176
RP +++FL+ S L ++++ TM TR YAE K++D D + F RI+ R E + K++
Sbjct: 220 RPGLQSFLKGLSELYEMHIYTMGTRTYAEGVAKIIDPDGRVFGDRIVTRTESGSDKEKSL 279
Query: 177 PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + +VI+DD VW NL+ + + +F
Sbjct: 280 KRLFPTDSKMVVIIDDRADVWR-WISNLVKVNVFEFF 315
>gi|261194090|ref|XP_002623450.1| RNA Polymerase II CTD phosphatase Fcp1 [Ajellomyces dermatitidis
SLH14081]
gi|239588464|gb|EEQ71107.1| RNA Polymerase II CTD phosphatase Fcp1 [Ajellomyces dermatitidis
SLH14081]
Length = 901
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 79/153 (51%), Gaps = 12/153 (7%)
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VKLRPFV 121
V++LD T++H +++ ++ H + + FQ+ +D +KLRP +
Sbjct: 134 VVDLDQTIIHATVDPTVAEWQQDKDNPNHEAVKDVRAFQLVDDGPGMRGCWYYIKLRPGL 193
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLV 180
FL + S+L ++++ TM TR YA+ ++D D K F RI++R++ KN L
Sbjct: 194 EEFLREISTLFELHIYTMGTRAYAQHIANIVDPDRKIFGDRILSRDESGSLTAKNLQRLF 253
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ +VI+DD VW T+NLI + Y +F
Sbjct: 254 PVDTKMVVIIDDRGDVWK-WTDNLIKVLPYDFF 285
>gi|255936731|ref|XP_002559392.1| Pc13g09690 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211584012|emb|CAP92038.1| Pc13g09690 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 792
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 55/204 (26%), Positives = 94/204 (46%), Gaps = 24/204 (11%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDS---FGLSFDYMLRGLRYSEQEERKLQLVLNLDHTLLHC 82
CAH C C + M D+ + D R L R+L LV++LD T++H
Sbjct: 92 PCAHEIQFGGLCAECGKDMTDAREATRVEEDAKRRLLA-----SRRLTLVVDLDQTIIHA 146
Query: 83 RNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VKLRPFVRTFLEQASSLV 132
++ + + H + + FQ+ +D +KLRP + FL+ + +
Sbjct: 147 TVDPTVGEWREDKQNPNHEAVRDVRQFQLIDDGPGMRGCWYYIKLRPGLEEFLQNVAEIY 206
Query: 133 DIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR---GQERGIVI 189
++++ TM TR YA+ V ++D K F RI++R++ K DL R + +VI
Sbjct: 207 ELHIYTMGTRAYAQHIVDIIDPTRKLFGDRILSRDESGSLTVK--DLQRLFPVDTKMVVI 264
Query: 190 LDDTESVWSDHTENLIVLGKYVYF 213
+DD +W + NLI + Y +F
Sbjct: 265 IDDRGDIWR-WSPNLIKVSPYDFF 287
>gi|171680434|ref|XP_001905162.1| hypothetical protein [Podospora anserina S mat+]
gi|170939844|emb|CAP65069.1| unnamed protein product [Podospora anserina S mat+]
Length = 835
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 85/165 (51%), Gaps = 18/165 (10%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL-----FQMANDK------ 113
E RKL LV++LD T++ + GE ++K + S+ FQ+ +
Sbjct: 161 ESRKLSLVVDLDQTVIQA--CIDPTVGE-WMKDPTNPNYDSVKNVKTFQLDDGPHAVVRK 217
Query: 114 ---LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR-EDF 169
+K+RP + FL++ S++ ++++ TM TR YA+ +++D + K F +R+I+R E+
Sbjct: 218 CWYYIKMRPGLEGFLKRISTMYELHVYTMGTRAYAQNVARVIDPEKKLFGNRVISRDENG 277
Query: 170 NGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
N + L +VI+DD VW + NL+ + Y +F+
Sbjct: 278 NMYSKSLQRLFPVSTNMVVIIDDRSDVWPHNRPNLVKVTPYEFFK 322
>gi|344242866|gb|EGV98969.1| hypothetical protein I79_008270 [Cricetulus griseus]
Length = 848
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 75/150 (50%), Gaps = 10/150 (6%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
RKL L+++LD TL+H + K + H +G M + +LRP R FLE
Sbjct: 62 RKLVLMVDLDQTLIHTTEQQCPQMSNKGI---FHFQLGRGEPMLH---TRLRPHCRDFLE 115
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQ 183
+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 116 KIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFPCG 175
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ + I+DD E VW NLI + KYVYF
Sbjct: 176 DSMVCIIDDREDVWK-FAPNLITVKKYVYF 204
>gi|325087549|gb|EGC40859.1| RNA polymerase II C-terminal domain phosphatase component
[Ajellomyces capsulatus H88]
Length = 885
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 78/153 (50%), Gaps = 12/153 (7%)
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VKLRPFV 121
V++LD T++H +++ ++ H + + FQ+ +D +KLRP +
Sbjct: 134 VVDLDQTIIHATVDPTVAEWQQDKDNPNHEAVKDVRAFQLVDDGPGMKGCWYYIKLRPGL 193
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLV 180
FL S+L ++++ TM TR YA+ ++D D K F RI++R++ KN L
Sbjct: 194 EEFLRNISTLFELHIYTMGTRAYAQHIASIVDPDRKIFGDRILSRDESGSLTAKNLQRLF 253
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ +VI+DD VW T+NLI + Y +F
Sbjct: 254 PVDTKMVVIIDDRGDVWK-WTDNLIKVVPYDFF 285
>gi|403222664|dbj|BAM40795.1| uncharacterized protein TOT_030000057 [Theileria orientalis strain
Shintoku]
Length = 656
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 79/165 (47%), Gaps = 23/165 (13%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLS---------SGEKYLKKQIH-----SFIGSLFQMA 110
E+RKL LVL+LD+TL+H + + S LK ++ S+ S F
Sbjct: 194 EDRKLCLVLDLDNTLVHATSQSPPADIDVETIEISSSSVLKTIVYNETETSYCNSFF--- 250
Query: 111 NDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFN 170
KLRP + F S ++L TM TR +A++A+++LD YF +R+ R D
Sbjct: 251 -----KLRPGIFKFFRSVSKRYKLFLFTMGTRQHAQSALRILDPQGVYFGNRVFCRNDSR 305
Query: 171 GKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD 215
+ L + ++++DD+E VW+ LI + Y YF D
Sbjct: 306 SCMKSLDRLFPNHKNLVLVMDDSEYVWTSKLA-LIKVHPYYYFSD 349
>gi|241953831|ref|XP_002419637.1| RNA polymerase II subunit a c-terminal domain phosphatase, putative
[Candida dubliniensis CD36]
gi|223642977|emb|CAX43233.1| RNA polymerase II subunit a c-terminal domain phosphatase, putative
[Candida dubliniensis CD36]
Length = 771
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 66/236 (27%), Positives = 105/236 (44%), Gaps = 51/236 (21%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSFGLS-FDYMLR----------GLRYSEQE--------- 65
+C HT C C +++ + S ++Y R GL+ S E
Sbjct: 98 ACPHTVQYSGLCALCGKSLEEEKDYSGYNYEDRATIEMSHDNTGLKISFDEAAKIEHNTT 157
Query: 66 -----ERKLQLVLNLDHTLLHCR------NIKSLSSGEKYLK-KQIHSF----------- 102
ERKL LV++LD T++H +S + Y K + +F
Sbjct: 158 DRLIDERKLILVVDLDQTVIHATVDPTVGEWQSDPANPNYAAVKDVKTFCLEEEAIVPPG 217
Query: 103 -IGSLFQMANDK---LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKY 158
G ++A K VKLRP + FLE+ + ++++ TM+TR YA + K++D D KY
Sbjct: 218 WTGP--KLAPTKCTYYVKLRPGLSEFLEKMAEKYEMHIYTMATRNYALSIAKIIDPDGKY 275
Query: 159 FSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
F RI++R++ KN L + +VI+DD VW + NLI + Y +F
Sbjct: 276 FGDRILSRDESGSLTHKNLKRLFPVDQSMVVIIDDRGDVWQWES-NLIKVVPYDFF 330
>gi|346975758|gb|EGY19210.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Verticillium dahliae VdLs.17]
Length = 818
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 82/163 (50%), Gaps = 15/163 (9%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------- 114
+RKL LV++LD T++H ++ + + + + FQ+ ++
Sbjct: 160 QRKLSLVVDLDQTIIHACIEPTVGEWMNDPENPNYDAVKDVEKFQLNDEGPRGVTQGCWY 219
Query: 115 -VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGK- 172
+K+RP +R FLE+ + L ++++ TM TR YA K++D K F +R+I+R D NG
Sbjct: 220 YIKMRPGLREFLEKVAELYELHVYTMGTRAYALNIAKIVDPQQKLFGNRVISR-DENGSI 278
Query: 173 -DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ L +VI+DD VW + NLI + Y +F+
Sbjct: 279 TSKSLQRLFPVSTNMVVIIDDRADVWPRNRPNLIKVVPYDFFK 321
>gi|380022133|ref|XP_003694908.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
C-terminal domain phosphatase-like [Apis florea]
Length = 749
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 73/152 (48%), Gaps = 10/152 (6%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N + + Q++ + +LRP R FL
Sbjct: 151 DRKLALLVDLDQTIVHTTNDNVPPNMKDVYHYQLYGPNSPWYH------TRLRPNTRHFL 204
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQER 185
+ S L ++++CT R YA LLD D FS RI++R++ K +L
Sbjct: 205 SEMSRLYELHICTFGARNYAHTVAALLDKDGTLFSHRILSRDECFDPASKTANLKALFPC 264
Query: 186 G---IVILDDTESVWSDHTENLIVLGKYVYFR 214
G + I+DD E VW NL+ + Y +FR
Sbjct: 265 GDDLVCIIDDREDVWQG-CGNLVQVKPYHFFR 295
>gi|363730338|ref|XP_418905.3| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase [Gallus gallus]
Length = 958
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 75/151 (49%), Gaps = 10/151 (6%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
RKL L+++LD TL+H K + H +G M + +LRP + FLE
Sbjct: 146 RKLVLMVDLDQTLIHTTEQHCQQMSNKGI---FHFQLGRGEPMLH---TRLRPHCKEFLE 199
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQ 183
+ + L ++++ T +R YA LD + K FS RI++R+ D K DL
Sbjct: 200 KIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRDLFPCG 259
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 260 DSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 289
>gi|68472089|ref|XP_719840.1| potential RNA Pol II CTD phosphatase component [Candida albicans
SC5314]
gi|68472324|ref|XP_719723.1| potential RNA Pol II CTD phosphatase component [Candida albicans
SC5314]
gi|46441553|gb|EAL00849.1| potential RNA Pol II CTD phosphatase component [Candida albicans
SC5314]
gi|46441679|gb|EAL00974.1| potential RNA Pol II CTD phosphatase component [Candida albicans
SC5314]
Length = 768
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 66/236 (27%), Positives = 104/236 (44%), Gaps = 51/236 (21%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSFGLS-FDYMLR----------GLRYSEQE--------- 65
+C HT C C +++ + S ++Y R GL+ S E
Sbjct: 98 ACPHTVQYSGLCALCGKSLEEEKDYSGYNYEDRATIEMSHDNTGLKISFDEAAKIEHNTT 157
Query: 66 -----ERKLQLVLNLDHTLLHCR------NIKSLSSGEKYLK-KQIHSF----------- 102
ERKL LV++LD T++H +S + Y K + +F
Sbjct: 158 DRLIDERKLILVVDLDQTVIHATVDPTVGEWQSDPANPNYAAVKDVKTFCLEEEAIVPPG 217
Query: 103 -IGSLFQMANDK---LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKY 158
G ++A K VKLRP + FLE+ + ++++ TM+TR YA + K++D D KY
Sbjct: 218 WTGP--KLAPTKCTYYVKLRPGLSEFLEKMAEKYEMHIYTMATRNYALSIAKIIDPDGKY 275
Query: 159 FSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
F RI++R++ KN L + +VI+DD VW NLI + Y +F
Sbjct: 276 FGDRILSRDESGSLTHKNLKRLFPVDQSMVVIIDDRGDVWQ-WESNLIKVVPYDFF 330
>gi|302404507|ref|XP_003000091.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Verticillium albo-atrum VaMs.102]
gi|261361273|gb|EEY23701.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Verticillium albo-atrum VaMs.102]
Length = 755
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 82/163 (50%), Gaps = 15/163 (9%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------- 114
+RKL LV++LD T++H ++ + + + + FQ+ ++
Sbjct: 160 QRKLSLVVDLDQTIIHACIEPTVGEWMNDPENPNYDAVKDVQKFQLNDEGPRGVTQGCWY 219
Query: 115 -VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGK- 172
+K+RP +R FLE+ + L ++++ TM TR YA K++D K F +R+I+R D NG
Sbjct: 220 YIKMRPGLREFLERVAELYELHVYTMGTRAYALNIAKIVDPQQKLFGNRVISR-DENGSI 278
Query: 173 -DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ L +VI+DD VW + NLI + Y +F+
Sbjct: 279 TSKSLQRLFPVSTNMVVIIDDRADVWPRNRPNLIKVVPYDFFK 321
>gi|383859141|ref|XP_003705055.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like isoform 2 [Megachile rotundata]
Length = 759
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N + + Q++ + +LRP R FL
Sbjct: 150 DRKLALLVDLDQTIVHTTNDNIPPNMKDVYHYQLYGPNSPWYH------TRLRPNTRHFL 203
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQER 185
+ S L ++++CT R YA LLD D FS+RI++R++ K +L
Sbjct: 204 SEMSRLYELHICTFGARNYAHTVASLLDKDGILFSNRILSRDECFDPASKTANLKALFPC 263
Query: 186 G---IVILDDTESVWSDHTENLIVLGKYVYFR 214
G + I+DD E VW NL+ + Y +FR
Sbjct: 264 GDDLVCIIDDREDVWQG-CGNLVQVKPYHFFR 294
>gi|326916917|ref|XP_003204751.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Meleagris gallopavo]
Length = 1003
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 75/151 (49%), Gaps = 10/151 (6%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
RKL L+++LD TL+H K + H +G M + +LRP + FLE
Sbjct: 192 RKLVLMVDLDQTLIHTTEQHCQQMSNKGI---FHFQLGRGEPMLH---TRLRPHCKEFLE 245
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQ 183
+ + L ++++ T +R YA LD + K FS RI++R+ D K DL
Sbjct: 246 KIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRDLFPCG 305
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 306 DSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 335
>gi|367032510|ref|XP_003665538.1| hypothetical protein MYCTH_2309412 [Myceliophthora thermophila ATCC
42464]
gi|347012809|gb|AEO60293.1| hypothetical protein MYCTH_2309412 [Myceliophthora thermophila ATCC
42464]
Length = 913
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 80/161 (49%), Gaps = 14/161 (8%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL---------V 115
RKL LV++LD T++ ++ +K H + FQ+ + +
Sbjct: 161 RKLSLVVDLDQTIIQACIDPTVGEWQKDPTNPNHELAKEVKSFQLDDGPTDLARRCWYYI 220
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGK--D 173
K+RP ++ FL++ + + ++++ TM TR YA+ +++D D K F +R+I+R D NG
Sbjct: 221 KMRPGLQDFLKRIAEMYELHVYTMGTRAYAQNVARVVDPDKKLFGNRVISR-DENGNIFA 279
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ L + I+DD VW + NLI + Y +F+
Sbjct: 280 KSLHRLFPVSTHMVAIIDDRSDVWPRNRPNLIKVSPYEFFK 320
>gi|328792425|ref|XP_623605.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Apis mellifera]
Length = 745
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 73/152 (48%), Gaps = 10/152 (6%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N + + Q++ + +LRP R FL
Sbjct: 151 DRKLALLVDLDQTIVHTTNDNVPPNMKDVYHYQLYGPNSPWYH------TRLRPNTRHFL 204
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQER 185
+ S L ++++CT R YA LLD D FS RI++R++ K +L
Sbjct: 205 SEMSRLYELHICTFGARNYAHTVAALLDKDGTLFSHRILSRDECFDPASKTANLKALFPC 264
Query: 186 G---IVILDDTESVWSDHTENLIVLGKYVYFR 214
G + I+DD E VW NL+ + Y +FR
Sbjct: 265 GDDLVCIIDDREDVWQG-CGNLVQVKPYHFFR 295
>gi|328872613|gb|EGG20980.1| putative tfiif-interacting component of the c-terminal domain
phosphatase [Dictyostelium fasciculatum]
Length = 757
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 80/158 (50%), Gaps = 15/158 (9%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYL-----KKQIHSFIGSLFQMANDKLVKLRP 119
+ +KL LVL+LDHT++H + + K IH I + Q +KLRP
Sbjct: 203 DNKKLSLVLDLDHTIIHAIMEQHFMEVPYWRTIDRKKSNIHEIILNGNQRY---FIKLRP 259
Query: 120 FVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE----DFNGKDRK 175
+ FL + + L ++++ TM TR YA+ L+D + F R+++R+ D N K K
Sbjct: 260 HLYEFLREVNRLFELHIYTMGTRNYAQKIASLVDPKQRVFKERVLSRDDTPNDMNHKTLK 319
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + ++I+DD VW ++NLI + Y+YF
Sbjct: 320 R--LFPCDDSMVLIVDDRSDVWKK-SKNLIQIVPYLYF 354
>gi|383859139|ref|XP_003705054.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like isoform 1 [Megachile rotundata]
Length = 760
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N + + Q++ + +LRP R FL
Sbjct: 150 DRKLALLVDLDQTIVHTTNDNIPPNMKDVYHYQLYGPNSPWYH------TRLRPNTRHFL 203
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQER 185
+ S L ++++CT R YA LLD D FS+RI++R++ K +L
Sbjct: 204 SEMSRLYELHICTFGARNYAHTVASLLDKDGILFSNRILSRDECFDPASKTANLKALFPC 263
Query: 186 G---IVILDDTESVWSDHTENLIVLGKYVYFR 214
G + I+DD E VW NL+ + Y +FR
Sbjct: 264 GDDLVCIIDDREDVWQG-CGNLVQVKPYHFFR 294
>gi|426253911|ref|XP_004020634.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
C-terminal domain phosphatase, partial [Ovis aries]
Length = 820
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 81/154 (52%), Gaps = 16/154 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPFVRT 123
RKL L+++LD TL+H + E++ ++ + I FQ+ + + +LRP +
Sbjct: 90 RKLVLMVDLDQTLIH--------TTEQHCQQMSNKGI-FHFQLGRGEPMLHTRLRPHCKE 140
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLV 180
FLE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 141 FLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLF 200
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 201 PCGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 233
>gi|365991295|ref|XP_003672476.1| hypothetical protein NDAI_0K00420 [Naumovozyma dairenensis CBS 421]
gi|343771252|emb|CCD27233.1| hypothetical protein NDAI_0K00420 [Naumovozyma dairenensis CBS 421]
Length = 778
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 64/296 (21%), Positives = 125/296 (42%), Gaps = 62/296 (20%)
Query: 21 CEQSLSCAHTTVRDSRCIFCSQAMNDS-------FGLSFDYMLRGLRYSEQE-------- 65
CE C H + C C + ++++ L+ + L+ S +E
Sbjct: 143 CEIQRPCNHDVIYGGLCTLCGKEVDENDIDDLSGPNLTISHTDTNLKISTREAVDIGQSV 202
Query: 66 ------ERKLQLVLNLDHTLLHC---------------RNIKSLSSGEKYLKKQIHSFIG 104
++KL LV++LD T++HC N ++L +++ ++ I
Sbjct: 203 KKRLRDDKKLILVVDLDQTVIHCGVDPTIGEWKRDPTNPNFETLKDVKEFALEE--EPIL 260
Query: 105 SLFQMANDKL-------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSK 157
L M VK+RP ++ F ++ + L ++++ TM+TR YA K++D
Sbjct: 261 PLMYMGPKPPARKCWYYVKVRPGLKDFFQKVAPLFEMHIYTMATRAYASEIAKIIDPTGD 320
Query: 158 YFSSRIIAREDFNGKDRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD- 215
F +RI++R++ K+ + L + ++I+DD VW + + NLI + Y +F
Sbjct: 321 LFGNRILSRDENGSLTTKSLERLFPTDQSMVIIIDDRGDVW-NWSPNLIKVIPYNFFVGV 379
Query: 216 KELN--------------GDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSV 257
++N G S E+ EN++ L +++ K + R + V
Sbjct: 380 GDINSNFLPKQQATMLQLGRRSSRGESKVSTKENDDLLTDIMDTEKVLQRKINEEV 435
>gi|116179414|ref|XP_001219556.1| hypothetical protein CHGG_00335 [Chaetomium globosum CBS 148.51]
gi|88184632|gb|EAQ92100.1| hypothetical protein CHGG_00335 [Chaetomium globosum CBS 148.51]
Length = 828
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/154 (29%), Positives = 80/154 (51%), Gaps = 18/154 (11%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL---------V 115
RKL LV++LD T++ ++ +K H + S+ FQ+ + +
Sbjct: 161 RKLSLVVDLDQTIIQACIDPTVGDWQKDPTNPNHESVKSVKSFQLDDGPTQAANQCSYYI 220
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNG---- 171
K+RP + +FL++ + + ++++ TM TR YA+ +++D D K F +R+I+R D NG
Sbjct: 221 KMRPGLESFLKRIAQMYELHVYTMGTRAYAQNVARVVDPDKKLFGNRVISR-DENGSIYA 279
Query: 172 KDRKNPDLVRGQERGIVILDDTESVWSDHTENLI 205
KD + L + I+DD VW ++ NLI
Sbjct: 280 KDLQR--LFPISTHMVAIIDDRSDVWPNNRANLI 311
>gi|297834404|ref|XP_002885084.1| hypothetical protein ARALYDRAFT_897822 [Arabidopsis lyrata subsp.
lyrata]
gi|297330924|gb|EFH61343.1| hypothetical protein ARALYDRAFT_897822 [Arabidopsis lyrata subsp.
lyrata]
Length = 166
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 78/143 (54%), Gaps = 8/143 (5%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSF--IGSLFQMANDKLVKLRPFV 121
QE++KL LVL L TL I LS EK+L ++ S + + +++ L+KLRPFV
Sbjct: 11 QEKKKLHLVLGLRGTLYDYIIISHLSDREKHLIGEVDSRDDLWRITAQSHEGLIKLRPFV 70
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
FL +A++ + Y ++S +++ +KLL YF R+I D K DLV
Sbjct: 71 AEFLREANNTLHAY--SLSRPEHSDYMLKLLHPHQTYFGRRVICSRDTC---MKTLDLVL 125
Query: 182 GQERGIVILDDTESV-WSDHTEN 203
ER +V++DD S W+DHT +
Sbjct: 126 VDERVLVVMDDQCSTWWTDHTNH 148
>gi|332029822|gb|EGI69691.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Acromyrmex echinatior]
Length = 749
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 73/152 (48%), Gaps = 10/152 (6%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N + + Q++ + +LRP R FL
Sbjct: 153 DRKLVLLVDLDQTIVHTTNDNIPPNLKDVFHFQLYGLNSPWYH------TRLRPNTRHFL 206
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQER 185
+ S L ++++CT R YA LLD D FS RI++R++ K +L
Sbjct: 207 SEMSRLYELHICTFGARIYAHTVASLLDKDGVLFSHRILSRDECFDPASKTANLKALFPC 266
Query: 186 G---IVILDDTESVWSDHTENLIVLGKYVYFR 214
G + I+DD E VW NL+ + Y +FR
Sbjct: 267 GDDLVCIIDDREDVWQG-CGNLVQVKPYHFFR 297
>gi|340709144|ref|XP_003393173.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
C-terminal domain phosphatase-like [Bombus terrestris]
Length = 751
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N S+ + Q++ + +LRP + FL
Sbjct: 151 DRKLALLVDLDQTIVHTTNDNIPSNIKDVYHYQLYGPNSPWYH------TRLRPNTKHFL 204
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQER 185
+ S L ++++CT R YA LLD D FS RI++R++ K +L
Sbjct: 205 SEMSRLYELHICTFGARNYAHTVAALLDKDGTLFSHRILSRDECFDPASKTANLKALFPC 264
Query: 186 G---IVILDDTESVWSDHTENLIVLGKYVYFR 214
G + I+DD E VW NL+ + Y +FR
Sbjct: 265 GDDLVCIIDDREDVWQ-GCGNLVQVKPYHFFR 295
>gi|322785368|gb|EFZ12041.1| hypothetical protein SINV_00693 [Solenopsis invicta]
Length = 759
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 47/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N + + Q++ + +LRP R FL
Sbjct: 157 DRKLVLLVDLDQTIVHTTNDNIPPNLKDVFHFQLYGPNSPWYH------TRLRPNTRRFL 210
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQER 185
+ SSL ++++CT R YA LLD D FS RI++R++ K +L
Sbjct: 211 SKMSSLYELHICTFGARIYAHTVASLLDKDKVLFSHRILSRDECFDPASKTANLKALFPC 270
Query: 186 G---IVILDDTESVWSDHTENLIVLGKYVYFR 214
G + I+DD E VW NL+ + Y +FR
Sbjct: 271 GDDLVCIIDDREDVWQ-GCGNLVQVKPYHFFR 301
>gi|350413080|ref|XP_003489872.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Bombus impatiens]
Length = 751
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N S+ + Q++ + +LRP + FL
Sbjct: 151 DRKLALLVDLDQTIVHTTNDNIPSNIKDVYHYQLYGPNSPWYH------TRLRPNTKHFL 204
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQER 185
+ S L ++++CT R YA LLD D FS RI++R++ K +L
Sbjct: 205 SEMSRLYELHICTFGARNYAHTVAALLDKDGTLFSHRILSRDECFDPASKTANLKALFPC 264
Query: 186 G---IVILDDTESVWSDHTENLIVLGKYVYFR 214
G + I+DD E VW NL+ + Y +FR
Sbjct: 265 GDDLVCIIDDREDVWQ-GCGNLVQVKPYHFFR 295
>gi|323332189|gb|EGA73600.1| Fcp1p [Saccharomyces cerevisiae AWRI796]
Length = 646
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 60/237 (25%), Positives = 109/237 (45%), Gaps = 45/237 (18%)
Query: 21 CEQSLSCAHTTVRDSRCIFCSQAMN-DSF-GLSFDYMLR-GLRYSEQE------------ 65
CE C H V C C + ++ D+F G+ D + L+ SE E
Sbjct: 110 CEIKRPCNHDIVYGGLCTQCGKEVSADAFDGVPLDVVGDVDLQISETEAIRTGKALKEHL 169
Query: 66 --ERKLQLVLNLDHTLLHC---------------------RNIKSLSSGEKYLKKQIH-S 101
++KL LV++LD T++HC R++KS + E+ + ++ +
Sbjct: 170 RRDKKLILVVDLDQTIIHCGVDPTIAEWKNDPNNPNFETLRDVKSFTLDEELVLPLMYMN 229
Query: 102 FIGSLFQMANDK----LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSK 157
GS+ + + VK+RP ++ F + + L ++++ TM+TR YA K++D +
Sbjct: 230 DDGSMLRPPPVRKCWYYVKVRPGLKEFFAKVAPLFEMHIYTMATRAYALQIAKIVDPTGE 289
Query: 158 YFSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
F RI++R++ K+ L + +V++DD VW + NLI + Y +F
Sbjct: 290 LFGDRILSRDENGSLTTKSLAKLFPTDQSMVVVIDDRGDVW-NWCPNLIKVVPYNFF 345
>gi|348512639|ref|XP_003443850.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Oreochromis niloticus]
Length = 998
Score = 66.6 bits (161), Expect = 1e-08, Method: Composition-based stats.
Identities = 47/157 (29%), Positives = 81/157 (51%), Gaps = 16/157 (10%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPF 120
+KL L+++LD TL+H + E++ ++ + I FQ+ + + +LRP
Sbjct: 172 HRNKKLVLMVDLDQTLIH--------TTEQHCQRMSNKGIFH-FQLGRGEPMLHTRLRPH 222
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNP 177
+ FLE+ + L ++++ T +R YA LD + K FS RI++R+ D K
Sbjct: 223 CKEFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLR 282
Query: 178 DLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+L + + I+DD E VW NLI + KY+YF+
Sbjct: 283 NLFPCGDSMVCIIDDREDVWK-FAPNLITVKKYIYFQ 318
>gi|238881126|gb|EEQ44764.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 525
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 65/234 (27%), Positives = 104/234 (44%), Gaps = 47/234 (20%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSFGLS-FDYMLR----------GLRYSEQE--------- 65
+C HT C C +++ + S ++Y R GL+ S E
Sbjct: 13 ACPHTVQYSGLCALCGKSLEEEKDYSGYNYEDRATIEMSHDNTGLKISFDEAAKIEHNTT 72
Query: 66 -----ERKLQLVLNLDHTLLHCR------NIKSLSSGEKYLK-KQIHSFI---------- 103
ERKL LV++LD T++H +S + Y K + +F
Sbjct: 73 DRLIDERKLILVVDLDQTVIHATVDPTVGEWQSDPANPNYAAVKDVKTFCLEEEAIVPPG 132
Query: 104 GSLFQMANDK---LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFS 160
+ ++A K VKLRP + FLE+ + ++++ TM+TR YA + K++D D KYF
Sbjct: 133 WTGPKLAPTKCTYYVKLRPGLSEFLEKMAEKYEMHIYTMATRNYALSIAKIIDPDGKYFG 192
Query: 161 SRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
RI++R++ KN L + +VI+DD VW NLI + Y +F
Sbjct: 193 DRILSRDESGSLTHKNLKRLFPVDQSMVVIIDDRGDVWQ-WESNLIKVVPYDFF 245
>gi|365758888|gb|EHN00710.1| Fcp1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 677
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 60/237 (25%), Positives = 110/237 (46%), Gaps = 45/237 (18%)
Query: 21 CEQSLSCAHTTVRDSRCIFCSQAMN-DSF-GLSFDYML-RGLRYSEQE------------ 65
CE C H V C C + ++ D+F G+ D + L+ SE E
Sbjct: 60 CEIKRPCNHDIVYGGLCTQCGKEVSADAFDGVPLDVVGDMDLQISETEAIRSGEALKEHL 119
Query: 66 --ERKLQLVLNLDHTLLHC---------------------RNIKSLSSGEKYLKKQIH-S 101
++KL LV++LD T++HC R++KS + E+ + ++ +
Sbjct: 120 RRDKKLILVVDLDQTIIHCGVDPTIAEWKNDPNNPNFETLRDVKSFTLDEELVLPLMYMN 179
Query: 102 FIGSLFQMANDK----LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSK 157
GS+ + + VK+RP ++ F ++ + L ++++ TM+TR YA K++D +
Sbjct: 180 EDGSVLKPPPVRKCWYYVKVRPGLKEFFDKVAPLFEMHIYTMATRAYAIQIAKIVDPTGE 239
Query: 158 YFSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
F RI++R++ K+ L + +V++DD VW + NLI + Y +F
Sbjct: 240 LFGDRILSRDENGSLTTKSLAKLFPTDQSMVVVIDDRGDVW-NWCPNLIKVVPYNFF 295
>gi|194214772|ref|XP_001496059.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase [Equus caballus]
Length = 868
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 89 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKEF 140
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 141 LEKTAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 200
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 201 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 232
>gi|367047187|ref|XP_003653973.1| hypothetical protein THITE_2116513 [Thielavia terrestris NRRL 8126]
gi|347001236|gb|AEO67637.1| hypothetical protein THITE_2116513 [Thielavia terrestris NRRL 8126]
Length = 909
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 81/163 (49%), Gaps = 14/163 (8%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDK--------- 113
+ RKL LV++LD T++ ++ ++ H + + FQ+ +
Sbjct: 159 QSRKLSLVVDLDQTIIQACIDPTVGEWQRDPTNPNHESVKEVKSFQLDDGPSDLARRCSY 218
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGK- 172
+K+RP + FL++ S L ++++ TM TR YA+ +++D K F +R+I+R D NG
Sbjct: 219 YIKMRPGLEEFLKRISELYEMHVYTMGTRAYAQNVARVVDPQRKLFGNRVISR-DENGNM 277
Query: 173 -DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ L +VI+DD VW + NLI + Y +F+
Sbjct: 278 FAKSLGRLFPVSTNMVVIIDDRSDVWPRNRPNLIKVSPYEFFK 320
>gi|363752479|ref|XP_003646456.1| hypothetical protein Ecym_4610 [Eremothecium cymbalariae
DBVPG#7215]
gi|356890091|gb|AET39639.1| hypothetical protein Ecym_4610 [Eremothecium cymbalariae
DBVPG#7215]
Length = 751
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 60/233 (25%), Positives = 97/233 (41%), Gaps = 46/233 (19%)
Query: 26 SCAHTTVRDSRCIFCSQAMND----------SFGLSFDYMLRGLRYSEQ----------- 64
C H C+ C Q + D L+ + +R SE+
Sbjct: 101 PCTHDVTYGGLCVQCGQTVEDEQTSGSLLDNQAKLTMSHTNMNIRISEKQAYTLEKSAQK 160
Query: 65 ---EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLK-------KQIHSF------IGSLFQ 108
E RKL LV++LD T++HC ++ K K + SF + F
Sbjct: 161 QLREARKLVLVVDLDQTVIHCGVDPTIGEWSKDPDNPNYESLKDVRSFSLHEEPVLPPFY 220
Query: 109 MANDKL-------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSS 161
M VKLRP ++ F + ++++ TM+TR YA K++D D F
Sbjct: 221 MGPKPPTRKCWYYVKLRPGLQDFFSNIAPHFELHIYTMATRTYALEIAKIIDPDGTLFGD 280
Query: 162 RIIAREDFNGKDRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
RI++R++ +K+ + L + +VI+DD VW + ENLI + Y +F
Sbjct: 281 RILSRDENGSLTQKSLERLFPMDQSMVVIIDDRGDVW-NWCENLIKVVPYDFF 332
>gi|392578708|gb|EIW71836.1| hypothetical protein TREMEDRAFT_67978 [Tremella mesenterica DSM
1558]
Length = 944
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 58/103 (56%), Gaps = 6/103 (5%)
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FN 170
K RP + FLE+ + L ++++ TM TR YAEA V ++D + KYF RI++R+D F
Sbjct: 354 FTKPRPGLAKFLEEMNKLYEMHVYTMGTRTYAEAIVGIVDPEGKYFGGRILSRDDSRNFT 413
Query: 171 GKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K+ K L +V++DD VW D NL+ + Y +F
Sbjct: 414 TKNLKR--LFPTDTSMVVVIDDRADVWGD-CPNLVKVRPYDFF 453
>gi|425767354|gb|EKV05928.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Penicillium
digitatum PHI26]
gi|425779797|gb|EKV17828.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Penicillium
digitatum Pd1]
Length = 817
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 54/194 (27%), Positives = 92/194 (47%), Gaps = 27/194 (13%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VK 116
R+L LV++LD T++H ++ + + H + + FQ+ +D +K
Sbjct: 158 RRLTLVVDLDQTIIHATVDPTVGEWREDKQNPNHEAVKDVRQFQLIDDGPGMRGCWYYIK 217
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP + FL+ + + ++++ TM TR YA+ V ++D K F RI++R++ K
Sbjct: 218 LRPGLEEFLQNVAEIYELHIYTMGTRAYAQHIVDIIDPTRKLFGDRILSRDESGSLTVK- 276
Query: 177 PDLVR---GQERGIVILDDTESVWSDHTENLIVLGKYVYF-----------RDKELNGDH 222
DL R + +VI+DD +W + NLI + Y +F KE G +
Sbjct: 277 -DLQRLFPVDTKMVVIIDDRGDIWR-WSPNLIKVSPYDFFVGIGDINSSFLPKKEDIGAN 334
Query: 223 KSYSETLTDESENE 236
KS E T E+ E
Sbjct: 335 KSQIEAKTSENNQE 348
>gi|399215912|emb|CCF72600.1| unnamed protein product [Babesia microti strain RI]
Length = 545
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 68/238 (28%), Positives = 104/238 (43%), Gaps = 39/238 (16%)
Query: 25 LSCAHTTVRDSRCIFCSQAMN---DSFGLSFDYMLRGLRYSEQE---------------- 65
L+C H+ V C C++ ++ DSF + D + G +E
Sbjct: 106 LTCDHSVVVHGLCADCNEEIDITEDSFDID-DVVKPGFITNEASMSISATFVRQMEESNL 164
Query: 66 -----ERKLQLVLNLDHTLLHCRNI---KSLSSGEKYLKKQIHSFIGSLFQMANDKLVKL 117
+R L LVL+LD+TL+H + + + L S + + K I+ F G L +L
Sbjct: 165 HSLLIKRLLCLVLDLDNTLIHAKTLDKNEVLDSNDDF--KAIY-FGGRC------NLYRL 215
Query: 118 RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNP 177
RP V FL+ S +YL TM T +A AA+ LLD K FS+RI +R D + RK
Sbjct: 216 RPGVSEFLDAMSKYYQLYLFTMGTSEHATAALSLLDPQGKLFSNRIFSRSD-SQNSRKTL 274
Query: 178 DLVRGQERGIV-ILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESE 234
+ +GIV ++DD E W + Y+ E + H + +T S
Sbjct: 275 SRIFPNYQGIVCVVDDCEHAWRADLSGAGFFKIHPYYYFSERSKQHNPLTAMITAASN 332
>gi|146421209|ref|XP_001486555.1| hypothetical protein PGUG_02226 [Meyerozyma guilliermondii ATCC
6260]
Length = 732
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 67/240 (27%), Positives = 101/240 (42%), Gaps = 50/240 (20%)
Query: 21 CEQSLSCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLR----------GLRYSEQE----- 65
C C H C C ++D + Y R GL+ S E
Sbjct: 46 CRVKEPCGHEVQYGGLCAMCGLTVDDKDYSGYSYEDRATISMAHDSTGLKISFDEAAKLE 105
Query: 66 ---------ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLK---------KQIHSFI---- 103
ERKL LV++LD T++H ++ GE L K + SF
Sbjct: 106 QSTSERLTSERKLILVVDLDQTVIHATVDPTV--GEWQLDPLNPNYRAVKDVRSFCLEED 163
Query: 104 ------GSLFQMANDK---LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDL 154
S +M K VK+RP + FL++ S L ++++ TM+TR YA A ++D
Sbjct: 164 PIAPPGWSGPKMTPTKCWYYVKVRPGLEDFLKRVSQLYEMHVYTMATRNYALAIAHIIDP 223
Query: 155 DSKYFSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
D +YF RI++R++ KN L + +VI+DD VW +NLI + Y +F
Sbjct: 224 DGRYFGDRILSRDESGSLTHKNLRRLFPVDQLMVVIIDDRGDVWQ-WEKNLIKVVPYEFF 282
>gi|301118528|ref|XP_002906992.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262108341|gb|EEY66393.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 735
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 55/201 (27%), Positives = 88/201 (43%), Gaps = 54/201 (26%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
+KL LVL+LDHTLLH + +D + +++ V
Sbjct: 271 KKLSLVLDLDHTLLHAVRV-------------------------DDVVSEIKQTV----- 300
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQ--- 183
L D+++ T TR YAE V ++D D YF +RI+AR D PD++
Sbjct: 301 ----LYDLFIYTHGTRLYAEKIVNIIDPDETYFKNRIVARTD-------TPDMLHKSLKL 349
Query: 184 ------ERGIVILDDTESVWSDHTENLIVLGKYVYFR-DKELN---GDHKSYSETLTDES 233
+ I++LDD VW ++ N+ ++ Y YF+ E+N G + E E+
Sbjct: 350 LFPSCDDSMILVLDDRIDVWKENEGNVFLIEPYHYFKCTSEINNASGRGVAGMEDSEAEA 409
Query: 234 ENEEALANVLRVLKTIHRLFF 254
+ LA VL+ +H F+
Sbjct: 410 SEDSHLAQSTTVLRHVHEAFY 430
>gi|320581076|gb|EFW95298.1| RNA Pol II CTD phosphatase component, putative [Ogataea
parapolymorpha DL-1]
Length = 743
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 58/233 (24%), Positives = 101/233 (43%), Gaps = 46/233 (19%)
Query: 26 SCAHTTVRDSRCIFCSQAMND----------SFGLSFDYMLRGLRYSEQE---------- 65
C+H+ C C + + D +S + L+ S +E
Sbjct: 105 PCSHSIQYGGLCALCGKNVEDLDYTGFNDKDRAPISMSHGTTNLKVSTKEAENIERSSTQ 164
Query: 66 ----ERKLQLVLNLDHTLLHCRNIKSLS------SGEKYLK-KQIHSF-------IGSLF 107
E KL LV++LD T++H ++ + Y K + SF + +
Sbjct: 165 RLLKEEKLSLVVDLDQTVIHATVDPTVGEWMSDPTNPNYESIKDVRSFCLEEEPILPPNY 224
Query: 108 QMANDK------LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSS 161
+ VKLRP ++ FLE+ S L ++++ TM+TR YA++ K++D D YF
Sbjct: 225 KGPKPPSHKRWYYVKLRPGLQEFLEKVSKLYELHIYTMATRSYAKSIAKIIDPDGIYFGD 284
Query: 162 RIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
RI++R++ +K L +V++DD VW + + NLI + Y +F
Sbjct: 285 RILSRDESGSLTQKTLKRLFPVDTSMVVVIDDRGDVW-NWSPNLIKVVPYDFF 336
>gi|156050785|ref|XP_001591354.1| hypothetical protein SS1G_07980 [Sclerotinia sclerotiorum 1980]
gi|154692380|gb|EDN92118.1| hypothetical protein SS1G_07980 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 806
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 79/160 (49%), Gaps = 13/160 (8%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL---------- 114
RKL LV++LD T++H ++ ++ + + + + FQ+ +D
Sbjct: 160 RKLSLVVDLDQTIIHACIEPTVGEWQRDVNSPNYEAVKDVRSFQLNDDGPRGLASGCWYY 219
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR-EDFNGKD 173
+K+RP + FL + S + ++++ TM TR YA + K++D K F RII+R E+ N
Sbjct: 220 IKMRPGLAEFLTKISEMYELHVYTMGTRAYALSIAKIVDPGKKLFGDRIISRDENGNVTA 279
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ L + I+DD VW + NLI + Y +F
Sbjct: 280 KSLARLFPQSTHMVAIIDDRADVWPMNRPNLIKVVPYDFF 319
>gi|341882050|gb|EGT37985.1| hypothetical protein CAEBREN_32558 [Caenorhabditis brenneri]
Length = 673
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/203 (24%), Positives = 96/203 (47%), Gaps = 21/203 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
RKL L+++LD T++H + EK H I + KLRP FL
Sbjct: 142 RKLVLLVDLDQTIIHTSDKPMSEDSEK------HKDITRYGLNHRKYITKLRPHTTEFLN 195
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPD-------- 178
+ +++ ++++ T R YA ++LD +++ F RI++R++ K +
Sbjct: 196 KMATMYEMHIVTYGQRQYAHKIAQILDPEARLFGQRILSRDELFSAQHKTRNLKVIILFQ 255
Query: 179 --LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD-KELNGDHKSYSE---TLTDE 232
L + +VI+DD VW +++ LI + Y +F++ ++N S + + D+
Sbjct: 256 KALFPCGDNLVVIIDDRADVWM-YSDALIQIKPYRFFKEVGDINAPQNSKEQMPVQIEDD 314
Query: 233 SENEEALANVLRVLKTIHRLFFD 255
+ ++ L + RVL IH +++
Sbjct: 315 AHEDKVLEEIERVLTNIHDKYYE 337
>gi|89269074|emb|CAJ81904.1| ctd (carboxy terminal domain rna polymerase 2 polypeptide a)
phosphatase subunit 1 [Xenopus (Silurana) tropicalis]
Length = 567
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 75/151 (49%), Gaps = 10/151 (6%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
RKL L+++LD TL+H K + H +G M + +LRP + FLE
Sbjct: 176 RKLVLMVDLDQTLIHTTEQHCQHMSRKGI---FHFQLGRGEPMLH---TRLRPHCKEFLE 229
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQ 183
+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 230 KIAKLFELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPYSKTGNLRNLFPCG 289
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 290 DSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 319
>gi|255732778|ref|XP_002551312.1| hypothetical protein CTRG_05610 [Candida tropicalis MYA-3404]
gi|240131053|gb|EER30614.1| hypothetical protein CTRG_05610 [Candida tropicalis MYA-3404]
Length = 818
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 67/236 (28%), Positives = 104/236 (44%), Gaps = 51/236 (21%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSFGLS-FDYMLR----------GLRYSEQE--------- 65
+C HT C C +++ + S ++Y R GL+ S E
Sbjct: 98 ACPHTVQYGGLCALCGKSLEEEKDYSGYNYEDRATIEMSHDKTGLKISFDEAAKIEHSTT 157
Query: 66 -----ERKLQLVLNLDHTLLHCR------NIKSLSSGEKYLK-KQIHSF----------- 102
E+KL LV++LD T++H +S S Y K + SF
Sbjct: 158 DRLIDEKKLILVVDLDQTVIHATVDPTVGEWQSDPSNPNYRAVKDVRSFCLEEQPIVPPG 217
Query: 103 -IGSLFQMANDK---LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKY 158
G ++A K VKLRP + FLE+ S ++++ TM+TR YA A K++D + KY
Sbjct: 218 WTGP--KLAPTKCTYYVKLRPGLSEFLERMSEKYEMHIYTMATRNYALAIAKIIDPEGKY 275
Query: 159 FSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
F RI++R++ KN L + + I+DD VW + NLI + Y +F
Sbjct: 276 FGDRILSRDESGSLTHKNLKRLFPVDQSMVAIIDDRGDVWQWES-NLIKVVPYDFF 330
>gi|299470348|emb|CBN78397.1| Similar to RNA Polymerase II CTD phosphatase Fcp1, putative
[Ectocarpus siliculosus]
Length = 985
Score = 66.2 bits (160), Expect = 2e-08, Method: Composition-based stats.
Identities = 47/151 (31%), Positives = 74/151 (49%), Gaps = 6/151 (3%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
+KL LVL+LD+TLLHC + IH+ L + +KLRP +R FL
Sbjct: 258 KKLSLVLDLDNTLLHCSDHPDAGRVVVPGVDGIHAL--RLPNQQREYYIKLRPGLRRFLA 315
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERG 186
QA+++ ++ + T T YA+A +LD D F R + R L R G
Sbjct: 316 QAATMFEMTIYTAGTSQYADAVASVLDPDRSLFQGRHFSTCYTPDLGRNTKSLERIFPNG 375
Query: 187 I---VILDDTESVW-SDHTENLIVLGKYVYF 213
+ +I+DD + VW + +NL+++ Y +F
Sbjct: 376 LDMALIVDDRDDVWRGEQAKNLLLVRPYKFF 406
>gi|347836062|emb|CCD50634.1| similar to FCP1-like phosphatase [Botryotinia fuckeliana]
Length = 832
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 78/160 (48%), Gaps = 13/160 (8%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL---------- 114
RKL LV++LD T++H ++ ++ + + + + FQ+ +D
Sbjct: 160 RKLSLVVDLDQTIIHACIEPTVGEWQRDVNSPNYEAVKDVRSFQLNDDGPRGLASGCWYY 219
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR-EDFNGKD 173
+K+RP + FL + S + ++++ TM TR YA K++D K F RII+R E+ N
Sbjct: 220 IKMRPGLAEFLAKVSEMYELHVYTMGTRAYALNIAKIVDPGKKLFGDRIISRDENGNVTA 279
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ L + I+DD VW + NLI + Y +F
Sbjct: 280 KSLARLFPQSTHMVAIIDDRADVWPMNRPNLIKVVPYDFF 319
>gi|358418617|ref|XP_003583993.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Bos taurus]
Length = 864
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 81/154 (52%), Gaps = 16/154 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPFVRT 123
RKL L+++LD TL+H + E++ ++ + I FQ+ + + +LRP +
Sbjct: 178 RKLVLMVDLDQTLIH--------TTEQHCQQMSNKGI-FHFQLGRGEPMLHTRLRPHCKE 228
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLV 180
FLE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 229 FLEKVARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLF 288
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 289 PCGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 321
>gi|119587036|gb|EAW66632.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1, isoform CRA_e [Homo sapiens]
Length = 748
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 62 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKDF 113
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 114 LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 173
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 174 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 205
>gi|344269798|ref|XP_003406734.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
C-terminal domain phosphatase-like [Loxodonta africana]
Length = 972
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 187 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKEF 238
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 239 LEKVAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 298
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 299 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 330
>gi|351695852|gb|EHA98770.1| hypothetical protein GW7_03722 [Heterocephalus glaber]
Length = 963
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 54/187 (28%), Positives = 88/187 (47%), Gaps = 11/187 (5%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
RKL L+++LD TL+H K + H +G M + +LRP + FLE
Sbjct: 179 RKLVLMVDLDQTLIHTTEQHCPQMSNKGI---FHFQLGRGEPMLH---TRLRPHCKDFLE 232
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQ 183
+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 233 KIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLKNLFPCG 292
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYFRD-KELNGDHKSYSETLTDESENEEALANV 242
+ + I+DD E VW NLI + KYVYF ++N S + + + A+V
Sbjct: 293 DSMVCIIDDREDVWK-FAPNLITVKKYVYFPGTGDMNAPPGSRESQMRKKVNHSSKDADV 351
Query: 243 LRVLKTI 249
L + ++
Sbjct: 352 LEQVPSV 358
>gi|62858037|ref|NP_001017022.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1 [Xenopus (Silurana) tropicalis]
Length = 570
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 75/151 (49%), Gaps = 10/151 (6%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
RKL L+++LD TL+H K + H +G M + +LRP + FLE
Sbjct: 179 RKLVLMVDLDQTLIHTTEQHCQHMSRKGI---FHFQLGRGEPMLH---TRLRPHCKEFLE 232
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQ 183
+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 233 KIAKLFELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPYSKTGNLRNLFPCG 292
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 293 DSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 322
>gi|348665920|gb|EGZ05748.1| hypothetical protein PHYSODRAFT_566275 [Phytophthora sojae]
Length = 684
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 69/272 (25%), Positives = 104/272 (38%), Gaps = 75/272 (27%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
+KL LVL+LDHTLLH ++ +G + +
Sbjct: 272 KKLSLVLDLDHTLLHA--------------VRVDDVVGEIPKSG---------------- 301
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQ--- 183
S+L D+++ T TR YAE VK++D D YF +RI+AR D PD++
Sbjct: 302 MLSALYDLFIYTHGTRLYAEQIVKIIDPDESYFKNRIVARTD-------TPDMLHKSLKL 354
Query: 184 ------ERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEE 237
+ I++LDD VW ++ N+ ++ Y YF+ T E N
Sbjct: 355 LFPSCDDSMILVLDDRIDVWKENEGNVFLIEPYHYFK--------------CTSEINNAS 400
Query: 238 ALANVLRVLKTIHRLFFDSVCGDVRTYLPKVRSEFSRDVLYFSAIFRDCLWAEQEEKFLV 297
+V H F+ +R K + L I + L ++ V
Sbjct: 401 GRGHV-------HETFYAGHETGMRDLGAKPSMTLNNFPLTHLVIHPERLGTQKH----V 449
Query: 298 QEKK----FLVHPRWIDAYYFLWRRRPEDDYL 325
Q KK +V P WI W R E D+L
Sbjct: 450 QAKKIPGVLIVTPDWIIKCARSWSRVSEQDFL 481
>gi|157109625|ref|XP_001650754.1| RNA polymerase ii ctd phosphatase [Aedes aegypti]
gi|108868428|gb|EAT32653.1| AAEL015142-PA, partial [Aedes aegypti]
Length = 569
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 56/214 (26%), Positives = 97/214 (45%), Gaps = 35/214 (16%)
Query: 27 CAHTTVRDSRCIFCSQAM--NDSFGLS---------------FDYMLRGLRYSEQE---- 65
C+HTTV + C C + +D G S + + + L ++ E
Sbjct: 83 CSHTTVINDMCADCGADLRQDDLAGGSEASVPMIHSVPELKVTETLAKKLGQADTERLLR 142
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
++KL L+++LD TL+H N ++ + Q++ + +LRP FL
Sbjct: 143 DKKLVLLVDLDQTLIHTTNDNVPNNLKDVYHFQLYGSNSPWYH------TRLRPGALEFL 196
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKNPDLVRG-- 182
+ ++++CT R YA + LD D K FS RI++R++ FN + D +R
Sbjct: 197 AKMHPYYELHICTFGARNYAHMIAQFLDRDGKLFSHRILSRDECFNATSKT--DNLRALF 254
Query: 183 --QERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW + NLI + Y +F+
Sbjct: 255 PCGDSMVCIIDDREDVW-NMAANLIQVKPYHFFQ 287
>gi|403268140|ref|XP_003926140.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase [Saimiri boliviensis boliviensis]
Length = 937
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 156 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKDF 207
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 208 LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 267
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 268 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 299
>gi|190408503|gb|EDV11768.1| TFIIF interacting component of CTD phosphatase [Saccharomyces
cerevisiae RM11-1a]
Length = 732
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 60/237 (25%), Positives = 109/237 (45%), Gaps = 45/237 (18%)
Query: 21 CEQSLSCAHTTVRDSRCIFCSQAMN-DSF-GLSFDYMLR-GLRYSEQE------------ 65
CE C H V C C + ++ D+F G+ D + L+ SE E
Sbjct: 110 CEIKRPCNHDIVYGGLCTQCGKEVSADAFDGVPLDVVGDVDLQISETEAIRTGKALKEHL 169
Query: 66 --ERKLQLVLNLDHTLLHC---------------------RNIKSLSSGEKYLKKQIH-S 101
++KL LV++LD T++HC R++KS + E+ + ++ +
Sbjct: 170 RRDKKLILVVDLDQTIIHCGVDPTIAEWKNDPNNPNFETLRDVKSFTLDEELVLPLMYMN 229
Query: 102 FIGSLFQMANDK----LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSK 157
GS+ + + VK+RP ++ F + + L ++++ TM+TR YA K++D +
Sbjct: 230 DDGSMLRPPPVRKCWYYVKVRPGLKEFFAKVAPLFEMHIYTMATRAYALQIAKIVDPTGE 289
Query: 158 YFSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
F RI++R++ K+ L + +V++DD VW+ NLI + Y +F
Sbjct: 290 LFGDRILSRDENGSLTTKSLAKLFPTDQSMVVVIDDRGDVWN-WCPNLIKVVPYNFF 345
>gi|50838820|ref|NP_001002873.1| RNA polymerase II subunit A C-terminal domain phosphatase [Danio
rerio]
gi|49618915|gb|AAT68042.1| RNA polymerase II CTD phosphatase [Danio rerio]
Length = 947
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 48/154 (31%), Positives = 81/154 (52%), Gaps = 16/154 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPFVRT 123
RKL L+++LD TL+H + E++ ++ + I FQ+ + + +LRP +
Sbjct: 168 RKLVLMVDLDQTLIH--------TTEQHCQRMSNKGI-FHFQLGRGEPMLHTRLRPHCKD 218
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLV 180
FLE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 219 FLEKIAKLFELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLKNLF 278
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KY+YF+
Sbjct: 279 PCGDSMVCIIDDREDVWK-FAPNLITVKKYIYFQ 311
>gi|410977919|ref|XP_003995346.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
C-terminal domain phosphatase [Felis catus]
Length = 960
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 61/208 (29%), Positives = 96/208 (46%), Gaps = 32/208 (15%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + ++RP R F
Sbjct: 200 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRVRPHCREF 251
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 252 LEKIARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 311
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALAN 241
+ + I+DD E VW NLI + KYVYF+ GD + S +
Sbjct: 312 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQG---TGDINAPSGS------------- 354
Query: 242 VLRVLKTIHRLFFDSVCGDVRTYLPKVR 269
R + R+ S DV + P VR
Sbjct: 355 --RESQARRRVTQSSKAADVAEHAPSVR 380
>gi|310791724|gb|EFQ27251.1| FCP1-like phosphatase [Glomerella graminicola M1.001]
Length = 860
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 82/164 (50%), Gaps = 16/164 (9%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------- 114
+RKL LV++LD T++H ++ + + + + FQ+ ++
Sbjct: 160 QRKLSLVVDLDQTIIHACIEPTVGEWMEDPSNPNYQAVKDVKKFQLNDEGPRGMVTSGCW 219
Query: 115 --VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGK 172
+K+RP + FLE+ + L ++++ TM TR YA K++D K F +R+I+R D NG
Sbjct: 220 YYIKMRPGLAEFLEKVAELYELHVYTMGTRAYALNIAKIVDPHQKLFGNRVISR-DENGS 278
Query: 173 --DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ L +VI+DD VW ++ NLI + Y +F+
Sbjct: 279 MISKSLQRLFPVNTNMVVIIDDRADVWPNNRPNLIKVVPYDFFK 322
>gi|6323933|ref|NP_014004.1| Fcp1p [Saccharomyces cerevisiae S288c]
gi|2497216|sp|Q03254.1|FCP1_YEAST RecName: Full=RNA polymerase II subunit A C-terminal domain
phosphatase; AltName: Full=CTD phosphatase FCP1
gi|825543|emb|CAA89775.1| unknown [Saccharomyces cerevisiae]
gi|151945985|gb|EDN64217.1| protein phosphatase [Saccharomyces cerevisiae YJM789]
gi|256270710|gb|EEU05873.1| Fcp1p [Saccharomyces cerevisiae JAY291]
gi|259148865|emb|CAY82110.1| Fcp1p [Saccharomyces cerevisiae EC1118]
gi|285814283|tpg|DAA10178.1| TPA: Fcp1p [Saccharomyces cerevisiae S288c]
gi|323346974|gb|EGA81251.1| Fcp1p [Saccharomyces cerevisiae Lalvin QA23]
gi|323353207|gb|EGA85507.1| Fcp1p [Saccharomyces cerevisiae VL3]
gi|392297449|gb|EIW08549.1| Fcp1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 732
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 60/237 (25%), Positives = 109/237 (45%), Gaps = 45/237 (18%)
Query: 21 CEQSLSCAHTTVRDSRCIFCSQAMN-DSF-GLSFDYMLR-GLRYSEQE------------ 65
CE C H V C C + ++ D+F G+ D + L+ SE E
Sbjct: 110 CEIKRPCNHDIVYGGLCTQCGKEVSADAFDGVPLDVVGDVDLQISETEAIRTGKALKEHL 169
Query: 66 --ERKLQLVLNLDHTLLHC---------------------RNIKSLSSGEKYLKKQIH-S 101
++KL LV++LD T++HC R++KS + E+ + ++ +
Sbjct: 170 RRDKKLILVVDLDQTIIHCGVDPTIAEWKNDPNNPNFETLRDVKSFTLDEELVLPLMYMN 229
Query: 102 FIGSLFQMANDK----LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSK 157
GS+ + + VK+RP ++ F + + L ++++ TM+TR YA K++D +
Sbjct: 230 DDGSMLRPPPVRKCWYYVKVRPGLKEFFAKVAPLFEMHIYTMATRAYALQIAKIVDPTGE 289
Query: 158 YFSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
F RI++R++ K+ L + +V++DD VW+ NLI + Y +F
Sbjct: 290 LFGDRILSRDENGSLTTKSLAKLFPTDQSMVVVIDDRGDVWN-WCPNLIKVVPYNFF 345
>gi|291414979|ref|XP_002723734.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) phosphatase, subunit 1-like [Oryctolagus
cuniculus]
Length = 940
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 76/151 (50%), Gaps = 10/151 (6%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
RKL L+++LD TL+H K + +H +G M + +LRP + FLE
Sbjct: 162 RKLVLMVDLDQTLIHTTEQHCPQMSNKGI---LHFQLGRGEPMLH---TRLRPHCKDFLE 215
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQ 183
+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 216 KIARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFPCG 275
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 276 DSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 305
>gi|323307594|gb|EGA60861.1| Fcp1p [Saccharomyces cerevisiae FostersO]
Length = 732
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 60/237 (25%), Positives = 109/237 (45%), Gaps = 45/237 (18%)
Query: 21 CEQSLSCAHTTVRDSRCIFCSQAMN-DSF-GLSFDYMLR-GLRYSEQE------------ 65
CE C H V C C + ++ D+F G+ D + L+ SE E
Sbjct: 110 CEIKRPCNHDIVYGGLCTQCGKEVSADAFDGVPLDVVGDVDLQISETEAIRTGKALKEHL 169
Query: 66 --ERKLQLVLNLDHTLLHC---------------------RNIKSLSSGEKYLKKQIH-S 101
++KL LV++LD T++HC R++KS + E+ + ++ +
Sbjct: 170 RRDKKLILVVDLDQTIIHCGVDPTIAEWKNDPNNPNFETLRDVKSFTLDEELVLPLMYMN 229
Query: 102 FIGSLFQMANDK----LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSK 157
GS+ + + VK+RP ++ F + + L ++++ TM+TR YA K++D +
Sbjct: 230 DDGSMLRPPPVRKCWYYVKVRPGLKEFFAKVAPLFEMHIYTMATRAYALQIAKIVDPTGE 289
Query: 158 YFSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
F RI++R++ K+ L + +V++DD VW+ NLI + Y +F
Sbjct: 290 LFGDRILSRDENGSLTTKSLAKLFPTDQSMVVVIDDRGDVWN-WCPNLIKVVPYNFF 345
>gi|50552035|ref|XP_503492.1| YALI0E03278p [Yarrowia lipolytica]
gi|49649361|emb|CAG79071.1| YALI0E03278p [Yarrowia lipolytica CLIB122]
Length = 750
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 101/232 (43%), Gaps = 45/232 (19%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDS-----------FGLSFDYMLRGLRYSEQE--------- 65
C H C +C ++ D +S + GL S E
Sbjct: 103 PCTHAVQYGGMCAWCGASVADEKDYTDFSNKDRAPISMSHSTAGLTVSLSEAQRLEEGST 162
Query: 66 -----ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL------ 114
+RKL LV++LD T++H ++ +K + + + + +++
Sbjct: 163 KQLLKQRKLILVVDLDQTVIHVTVDPTVGEWKKDPSNPNYDAVKDVRVFSLEEMTMVSYD 222
Query: 115 ------------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
VKLRP ++ FLE S ++++ TM+TR YA+A +++D D +YF R
Sbjct: 223 GGKPVPQLCYYYVKLRPHLKEFLEVVSEKYELHIYTMATRAYAKAIAEIIDPDGRYFGDR 282
Query: 163 IIAREDFNGKDRKNPDLVRGQERGIV-ILDDTESVWSDHTENLIVLGKYVYF 213
I++R++ +K+ + + +V I+DD VW ++NLI + Y +F
Sbjct: 283 ILSRDESGSLTQKSLQRLFPVDTSMVAIIDDRGDVWK-WSKNLIRVVPYDFF 333
>gi|321262398|ref|XP_003195918.1| carboxy-terminal domain (CTD) phosphatase; Fcp1p [Cryptococcus
gattii WM276]
gi|317462392|gb|ADV24131.1| Carboxy-terminal domain (CTD) phosphatase, putative; Fcp1p
[Cryptococcus gattii WM276]
Length = 952
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 60/102 (58%), Gaps = 6/102 (5%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNG 171
K RP ++ FL++ S L ++++ TM TR YA+A VK++D D K F RI++R++ F+
Sbjct: 309 TKPRPGLQKFLDEMSQLYEMHVYTMGTRTYADAIVKVIDPDGKIFGGRILSRDESGSFSS 368
Query: 172 KDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K+ K L +V++DD VW D NL+ + Y +F
Sbjct: 369 KNLKR--LFPTDTSMVVVIDDRSDVWGD-CPNLVKVVPYDFF 407
>gi|269860082|ref|XP_002649764.1| carboxy-terminal domain (CTD) phosphatase [Enterocytozoon bieneusi
H348]
gi|220066823|gb|EED44294.1| carboxy-terminal domain (CTD) phosphatase [Enterocytozoon bieneusi
H348]
Length = 409
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 52/224 (23%), Positives = 99/224 (44%), Gaps = 34/224 (15%)
Query: 61 YSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPF 120
Y +KL L L+LD TL+H +LS ++H+ + +K RP
Sbjct: 97 YELYHNKKLILFLDLDQTLIHA----TLSKKPCNFSFKLHNI---------EFFIKKRPG 143
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLV 180
+ FL + S + ++ TM TR YA K+LD + +F RI+ R + N +K + +
Sbjct: 144 LDKFLSKLSRFFEFHVYTMGTREYANYICKILDPNKIFFGDRIVTRTENNKMFKKYLERI 203
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELN--------------------G 220
++ILDD VW + N+ ++ + Y+ ++N
Sbjct: 204 TNFSNNVIILDDRVDVWG-FSPNVFLIKPFYYYDTNDINCTISKQIHTNNKLNNIAKQVN 262
Query: 221 DHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVCGDVRTY 264
+Y+ +S+N++ L V + L+ IH+ +F + ++++
Sbjct: 263 FQNNYTTKYFKKSKNDKELNFVYKKLRKIHKEYFRQLDSCIKSF 306
>gi|355755122|gb|EHH58989.1| RNA polymerase II subunit A C-terminal domain phosphatase, partial
[Macaca fascicularis]
Length = 861
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 76 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKDF 127
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 128 LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 187
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 188 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 219
>gi|30962890|gb|AAH52576.1| CTDP1 protein, partial [Homo sapiens]
Length = 874
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 94 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKDF 145
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 146 LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPISKTGNLRNLFP 205
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 206 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 237
>gi|355681363|gb|AER96784.1| CTD phosphatase, subunit 1 [Mustela putorius furo]
Length = 819
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + ++RP R F
Sbjct: 62 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRVRPHCREF 113
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 114 LEKIARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 173
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 174 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 205
>gi|349580569|dbj|GAA25729.1| K7_Fcp1p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 732
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 60/237 (25%), Positives = 109/237 (45%), Gaps = 45/237 (18%)
Query: 21 CEQSLSCAHTTVRDSRCIFCSQAMN-DSF-GLSFDYMLR-GLRYSEQE------------ 65
CE C H V C C + ++ D+F G+ D + L+ SE E
Sbjct: 110 CEIKRPCNHDIVYGGLCTQCGKEVSADAFDGVPLDVVGDVDLQISETEAIRTGKALKEHL 169
Query: 66 --ERKLQLVLNLDHTLLHC---------------------RNIKSLSSGEKYLKKQIH-S 101
++KL LV++LD T++HC R++KS + E+ + ++ +
Sbjct: 170 RRDKKLILVVDLDQTIIHCGVDPTIAEWKNDPNNPNFETLRDVKSFTLDEELVLPLMYMN 229
Query: 102 FIGSLFQMANDK----LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSK 157
GS+ + + VK+RP ++ F + + L ++++ TM+TR YA K++D +
Sbjct: 230 DDGSMLRPPPVRKCWYYVKVRPGLKEFFAKVAPLFEMHIYTMATRAYALQIAKIVDPTGE 289
Query: 158 YFSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
F RI++R++ K+ L + +V++DD VW+ NLI + Y +F
Sbjct: 290 LFGDRILSRDENGSLTTKSLTKLFPTDQSMVVVIDDRGDVWN-WCPNLIKVVPYNFF 345
>gi|426386293|ref|XP_004059621.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase isoform 1 [Gorilla gorilla gorilla]
gi|426386295|ref|XP_004059622.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase isoform 2 [Gorilla gorilla gorilla]
Length = 842
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 62 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKDF 113
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 114 LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 173
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 174 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 205
>gi|402903421|ref|XP_003914564.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase isoform 3 [Papio anubis]
Length = 846
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 62 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKDF 113
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 114 LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 173
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 174 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 205
>gi|380472901|emb|CCF46552.1| FCP1-like phosphatase, partial [Colletotrichum higginsianum]
Length = 740
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 81/164 (49%), Gaps = 16/164 (9%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------- 114
+RKL LV++LD T++H ++ + + + + FQ+ ++
Sbjct: 160 QRKLSLVVDLDQTIIHACIEPTVGEWMEDPSNPNYEAVKDVKKFQLNDEGPRGMVTSGCW 219
Query: 115 --VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGK 172
+K+RP + FLE+ + L ++++ TM TR YA K++D K F +R+I+R D NG
Sbjct: 220 YYIKMRPGLAEFLERVAELYELHVYTMGTRAYALNIAKIVDPQQKLFGNRVISR-DENGS 278
Query: 173 --DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ L +VI+DD VW + NLI + Y +F+
Sbjct: 279 MISKSLQRLFPVNTNMVVIIDDRADVWPSNRPNLIKVVPYDFFK 322
>gi|321267522|ref|NP_001189433.1| RNA polymerase II subunit A C-terminal domain phosphatase isoform 3
[Homo sapiens]
Length = 842
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 62 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKDF 113
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 114 LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 173
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 174 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 205
>gi|119587034|gb|EAW66630.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1, isoform CRA_c [Homo sapiens]
Length = 948
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 181 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKDF 232
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 233 LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 292
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 293 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 324
>gi|402903417|ref|XP_003914562.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase isoform 1 [Papio anubis]
Length = 965
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 181 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKDF 232
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 233 LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 292
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 293 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 324
>gi|297702856|ref|XP_002828379.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A
C-terminal domain phosphatase [Pongo abelii]
Length = 962
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 181 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKDF 232
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 233 LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 292
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 293 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 324
>gi|397467065|ref|XP_003805250.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase [Pan paniscus]
Length = 842
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 62 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKDF 113
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 114 LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 173
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 174 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 205
>gi|355702027|gb|EHH29380.1| RNA polymerase II subunit A C-terminal domain phosphatase, partial
[Macaca mulatta]
Length = 861
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 76 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKDF 127
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 128 LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 187
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 188 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 219
>gi|39645774|gb|AAH63447.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1 [Homo sapiens]
Length = 867
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 81/154 (52%), Gaps = 16/154 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPFVRT 123
RKL L+++LD TL+H + E++ ++ + I FQ+ + + +LRP +
Sbjct: 181 RKLVLMVDLDQTLIH--------TTEQHCQQMSNKGI-FHFQLGRGEPMLHTRLRPHCKD 231
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLV 180
FLE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 232 FLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLF 291
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 292 PCGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 324
>gi|67188550|ref|NP_430255.2| RNA polymerase II subunit A C-terminal domain phosphatase isoform 2
[Homo sapiens]
gi|119587035|gb|EAW66631.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1, isoform CRA_d [Homo sapiens]
Length = 867
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 81/154 (52%), Gaps = 16/154 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPFVRT 123
RKL L+++LD TL+H + E++ ++ + I FQ+ + + +LRP +
Sbjct: 181 RKLVLMVDLDQTLIH--------TTEQHCQQMSNKGI-FHFQLGRGEPMLHTRLRPHCKD 231
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLV 180
FLE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 232 FLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLF 291
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 292 PCGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 324
>gi|47224149|emb|CAG13069.1| unnamed protein product [Tetraodon nigroviridis]
Length = 159
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 76/153 (49%), Gaps = 13/153 (8%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIK-SLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVR 122
+ RKL L+++LD+TL+H I LS + K ++ + V+LRP+ +
Sbjct: 15 HQSRKLVLMVDLDNTLIHTTEIPCQLSPKKNVFKMKLEG--------SPTYYVRLRPYYK 66
Query: 123 TFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED--FNGKDRKNPDLV 180
FLE+ S L ++ + T + + YA+ LD D+ +F+ RII+R++ + N
Sbjct: 67 EFLEKISELFELNIFTFACQSYAKTVAGFLDPDNTFFAQRIISRDNCFYPATKMANVRFF 126
Query: 181 RG-QERGIVILDDTESVWSDHTENLIVLGKYVY 212
E ++DD E VW + L+ + Y+Y
Sbjct: 127 SPCGESMTCMIDDREDVW-NFAPGLVAVKPYMY 158
>gi|3769521|gb|AAC64549.1| serine phosphatase FCP1a [Homo sapiens]
Length = 842
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 62 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKDF 113
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 114 LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 173
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 174 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 205
>gi|410215194|gb|JAA04816.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1 [Pan troglodytes]
gi|410254644|gb|JAA15289.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1 [Pan troglodytes]
gi|410331971|gb|JAA34932.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1 [Pan troglodytes]
Length = 961
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 181 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKDF 232
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 233 LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 292
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 293 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 324
>gi|109122558|ref|XP_001088601.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase isoform 2 [Macaca mulatta]
Length = 964
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 181 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKDF 232
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 233 LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 292
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 293 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 324
>gi|440638319|gb|ELR08238.1| hypothetical protein GMDG_03040 [Geomyces destructans 20631-21]
Length = 1765
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 55/226 (24%), Positives = 97/226 (42%), Gaps = 38/226 (16%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDS-------------FGLSFDYMLRGL------RYSEQ-- 64
+C+H C C + MN++ + D L + R EQ
Sbjct: 94 TCSHAVQYAGLCALCGKDMNETSWATDTVDAQRAQINMIHDQTLLSVSQDEASRAEEQLQ 153
Query: 65 ----EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDK----- 113
+ RKL LV++LD T++H ++ ++ + + + FQ+ +D
Sbjct: 154 RRLLKNRKLSLVVDLDQTIIHACIEPTIGEWQRDPTSPNYEAVKDVKSFQLHDDGPRGLA 213
Query: 114 -----LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED 168
+K+RP + FL + ++++ TM TR YA+ K++D + K F RII+R++
Sbjct: 214 SGCWYYIKMRPGLAHFLTTIAEKYELHVYTMGTRAYAQEIAKIVDPEHKLFGDRIISRDE 273
Query: 169 FNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K L + +VI+DD VW + NLI + Y +F
Sbjct: 274 NGSLTAKTLSRLFPVDTKMVVIIDDRADVWPRNRSNLIKVVPYDFF 319
>gi|410294550|gb|JAA25875.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1 [Pan troglodytes]
Length = 961
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 181 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKDF 232
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 233 LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 292
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 293 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 324
>gi|67188445|ref|NP_004706.3| RNA polymerase II subunit A C-terminal domain phosphatase isoform 1
[Homo sapiens]
gi|327478586|sp|Q9Y5B0.3|CTDP1_HUMAN RecName: Full=RNA polymerase II subunit A C-terminal domain
phosphatase; AltName: Full=TFIIF-associating CTD
phosphatase
gi|119587032|gb|EAW66628.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1, isoform CRA_a [Homo sapiens]
Length = 961
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 181 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKDF 232
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 233 LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 292
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 293 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 324
>gi|402903419|ref|XP_003914563.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase isoform 2 [Papio anubis]
Length = 871
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 80/154 (51%), Gaps = 16/154 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPFVRT 123
RKL L+++LD TL+H E++ ++ + I FQ+ + + +LRP +
Sbjct: 181 RKLVLMVDLDQTLIHTT--------EQHCQQMSNKGI-FHFQLGRGEPMLHTRLRPHCKD 231
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLV 180
FLE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 232 FLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLF 291
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 292 PCGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 324
>gi|157823025|ref|NP_001099601.1| RNA polymerase II subunit A C-terminal domain phosphatase [Rattus
norvegicus]
gi|149015915|gb|EDL75222.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1 (predicted), isoform CRA_a
[Rattus norvegicus]
Length = 969
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 74/150 (49%), Gaps = 10/150 (6%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
RKL L+++LD TL+H K + H +G M + +LRP + FLE
Sbjct: 177 RKLVLMVDLDQTLIHTTEQHCPQMSNKGI---FHFQLGRGEPMLH---TRLRPHCKDFLE 230
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQ 183
+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 231 KIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFPCG 290
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ + I+DD E VW NLI + KYVYF
Sbjct: 291 DSMVCIIDDREDVWK-FAPNLITVKKYVYF 319
>gi|34328280|ref|NP_080571.2| RNA polymerase II subunit A C-terminal domain phosphatase [Mus
musculus]
gi|46395722|sp|Q7TSG2.1|CTDP1_MOUSE RecName: Full=RNA polymerase II subunit A C-terminal domain
phosphatase; AltName: Full=TFIIF-associating CTD
phosphatase
gi|31419683|gb|AAH53435.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1 [Mus musculus]
Length = 960
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 74/150 (49%), Gaps = 10/150 (6%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
RKL L+++LD TL+H K + H +G M + +LRP + FLE
Sbjct: 181 RKLVLMVDLDQTLIHTTEQHCPQMSNKGI---FHFQLGRGEPMLH---TRLRPHCKDFLE 234
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQ 183
+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 235 KIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFPCG 294
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ + I+DD E VW NLI + KYVYF
Sbjct: 295 DSMVCIIDDREDVWK-FAPNLITVKKYVYF 323
>gi|74140094|dbj|BAE33777.1| unnamed protein product [Mus musculus]
Length = 960
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 74/150 (49%), Gaps = 10/150 (6%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
RKL L+++LD TL+H K + H +G M + +LRP + FLE
Sbjct: 181 RKLVLMVDLDQTLIHTTEQHCPQMSNKGI---FHFQLGRGEPMLH---TRLRPHCKDFLE 234
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQ 183
+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 235 KIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFPCG 294
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ + I+DD E VW NLI + KYVYF
Sbjct: 295 DSMVCIIDDREDVWK-FAPNLITVKKYVYF 323
>gi|148677457|gb|EDL09404.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1, isoform CRA_a [Mus musculus]
Length = 956
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 74/150 (49%), Gaps = 10/150 (6%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
RKL L+++LD TL+H K + H +G M + +LRP + FLE
Sbjct: 177 RKLVLMVDLDQTLIHTTEQHCPQMSNKGI---FHFQLGRGEPMLH---TRLRPHCKDFLE 230
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQ 183
+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 231 KIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFPCG 290
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ + I+DD E VW NLI + KYVYF
Sbjct: 291 DSMVCIIDDREDVWK-FAPNLITVKKYVYF 319
>gi|148236185|ref|NP_001090168.1| CTD phosphatase [Xenopus laevis]
gi|13487713|gb|AAK27686.1| CTD phosphatase [Xenopus laevis]
Length = 980
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 48/151 (31%), Positives = 75/151 (49%), Gaps = 10/151 (6%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
+KL L+++LD TL+H K + H +G M + +LRP + FLE
Sbjct: 174 KKLVLMVDLDQTLIHTTEQHCQHMSRKGI---FHFQLGRGEPMLH---TRLRPHCKEFLE 227
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQ 183
+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 228 KIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPYSKTGNLRNLFPCG 287
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 288 DSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 317
>gi|73945347|ref|XP_533365.2| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase isoform 1 [Canis lupus familiaris]
Length = 933
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 81/154 (52%), Gaps = 16/154 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPFVRT 123
RKL L+++LD TL+H + E++ ++ + I FQ+ + + ++RP R
Sbjct: 178 RKLVLMVDLDQTLIH--------TTEQHCQQMSNKGI-FHFQLGRGEPMLHTRVRPHCRE 228
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLV 180
FLE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 229 FLEKIARLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLF 288
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 289 PCGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 321
>gi|348555132|ref|XP_003463378.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase [Cavia porcellus]
Length = 970
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 74/150 (49%), Gaps = 10/150 (6%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
RKL L+++LD TL+H K + H +G M + +LRP + FLE
Sbjct: 181 RKLVLMVDLDQTLIHTTEQHCPQMSNKGI---FHFQLGRGEPMLH---TRLRPHCKDFLE 234
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQ 183
+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 235 KIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFPCG 294
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ + I+DD E VW NLI + KYVYF
Sbjct: 295 DSMVCIIDDREDVWK-FAPNLITVKKYVYF 323
>gi|444518074|gb|ELV11938.1| RNA polymerase II subunit A C-terminal domain phosphatase [Tupaia
chinensis]
Length = 876
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 79/154 (51%), Gaps = 16/154 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPFVRT 123
RKL L+++LD TL+H E++ + + I FQ+ + + +LRP +
Sbjct: 26 RKLVLMVDLDQTLIHTT--------EQHCAQMSNRGI-FHFQLGRGEPMLHTRLRPHCKD 76
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLV 180
FLE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 77 FLEKVAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLF 136
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 137 PCGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 169
>gi|389637610|ref|XP_003716438.1| RNA polymerase II subunit A domain phosphatase [Magnaporthe oryzae
70-15]
gi|351642257|gb|EHA50119.1| RNA polymerase II subunit A domain phosphatase [Magnaporthe oryzae
70-15]
gi|440471327|gb|ELQ40350.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Magnaporthe oryzae Y34]
gi|440487323|gb|ELQ67117.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Magnaporthe oryzae P131]
Length = 866
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 83/164 (50%), Gaps = 10/164 (6%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDK--------LV 115
+RKL LV++LD T++ ++ +K + + + F++ ++ V
Sbjct: 168 QRKLVLVVDLDQTVIQTACEPTIGEWQKDPSNPNYEALKEVRSFELPSEDGPRRNYTYYV 227
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRK 175
K RP FL + S+L ++++ TM+TR YAE ++++D F +R+I+R + G ++
Sbjct: 228 KCRPGTHEFLNKVSNLFEMHVYTMATRAYAEHILRIIDPKKNLFGNRVISRNENKGIEKT 287
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELN 219
+ + + ++DD VW + N+I + Y ++ ++N
Sbjct: 288 LQRIFPTSTKMVAVIDDRTDVWPQNRSNVIKVVPYNFYMIGDIN 331
>gi|378756636|gb|EHY66660.1| hypothetical protein NERG_00300 [Nematocida sp. 1 ERTm2]
Length = 507
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 80/176 (45%), Gaps = 6/176 (3%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKD 173
VKLR + FL++A ++++ TM + YA A VK+LD K F SRII R+D F D
Sbjct: 205 VKLRDRLEWFLKEAEKYCEMHIYTMGNKAYATAIVKILDPTGKLFGSRIITRDDNFGCFD 264
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDES 233
+ L + ++ILDD VW +NL + Y +F ++N + L D
Sbjct: 265 KDIKRLFPTNSKHVIILDDRPDVWG-FVDNLYPIKPYYFFETDDINSPEALQNGYLPDVG 323
Query: 234 ENEEALANVLRVLKTIH----RLFFDSVCGDVRTYLPKVRSEFSRDVLYFSAIFRD 285
N +L+ I R FD+ V L +V +EF + I R+
Sbjct: 324 MPVSIPNNKEDLLEEISIECIRNPFDNELEKVLRGLVEVHAEFFAGTYSIAHILRE 379
>gi|388580688|gb|EIM21001.1| FCP1-like phosphatase [Wallemia sebi CBS 633.66]
Length = 510
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 54/218 (24%), Positives = 94/218 (43%), Gaps = 32/218 (14%)
Query: 27 CAHTTVRDSRCIFCS-----QAMNDSFGLSFDYMLRGLRYSEQE------------ERKL 69
C H C C + ++S+ +S + Y E + KL
Sbjct: 7 CTHPVQLSGLCAICGKDVSQEQQSESYHISHSTANLTVSYDEAQRIGKTSKHTLLKSSKL 66
Query: 70 QLVLNLDHTLLHCR---NIKSLSSGEKYLKK----QIHSFIGSLFQMANDK------LVK 116
L+++LD T++H + L + K +H F F + N VK
Sbjct: 67 ALIVDLDQTIIHATVDPTVNELLQDPTLVYKGALNDVHKFKLGDFGLVNHHEFGSWYFVK 126
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
RP + FL+ + L ++++ TM TR YA A +L+D KYF RI++R++ +K+
Sbjct: 127 FRPGLMEFLDNMNKLFEMHVYTMGTRSYALAICQLIDPSGKYFGERILSRDESGSFTQKS 186
Query: 177 PDLVRGQERGI-VILDDTESVWSDHTENLIVLGKYVYF 213
+ + + VI+DD VW D + NL+ + + +F
Sbjct: 187 LQRLFPTDTSMCVIIDDRADVWGD-SPNLVKVIPFEFF 223
>gi|327270066|ref|XP_003219812.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Anolis carolinensis]
Length = 965
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ + H +G M + +LRP + F
Sbjct: 164 RKLVLMVDLDQTLIHTTEQHCQQMSN-----RGIFHYQLGRGEPMLH---TRLRPHCKEF 215
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 216 LEKIAKLYELHVFTFGSRLYAHTIAAFLDSEKKLFSHRILSRDECIDPFSKTGNLRNLFP 275
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 276 CGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 307
>gi|324504080|gb|ADY41763.1| RNA polymerase II subunit A C-terminal domain phosphatase [Ascaris
suum]
Length = 490
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 54/204 (26%), Positives = 95/204 (46%), Gaps = 23/204 (11%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
E R+L L+++LD TL+H N + K + + A D K+RP+ TF
Sbjct: 56 ESRRLVLLVDLDQTLIHTTN-------HAFDMKDSVDVVHYKLRGA-DFYTKIRPYTHTF 107
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPD---LVR 181
L + S L ++++ + R YA ++LD D +YF RI++R++ K + L
Sbjct: 108 LRRMSELYEMHIISYGERQYAHKIAEILDPDKRYFGHRILSRDELFSAMYKTGNMKALFP 167
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKE-------LNGDHKSYSE----TLT 230
++ I I+DD VW +++ LI + Y +F++ N +S + +
Sbjct: 168 CGDQLIAIIDDRPDVWQ-YSDALIQVKPYRFFKETGDINAPTICNAQQQSLVQERIAQVN 226
Query: 231 DESENEEALANVLRVLKTIHRLFF 254
E + +E L V VL +H F+
Sbjct: 227 VEGDGDETLEFVATVLTRVHTTFY 250
>gi|256073745|ref|XP_002573189.1| rna polymerase II ctd phosphatase [Schistosoma mansoni]
gi|360045501|emb|CCD83049.1| putative rna polymerase II ctd phosphatase [Schistosoma mansoni]
Length = 1345
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 77/155 (49%), Gaps = 17/155 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPFVRT 123
RKL L+++LD T++H N + + K +H + ++ LV +LRP +
Sbjct: 149 RKLVLLVDLDQTIIHTTN-----DPQAFKYKNVHRY-----RLPGSPLVYHTRLRPHLEK 198
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQ 183
L+ S +++CT R YA ++D +YFS RI++R++ K+ +L
Sbjct: 199 VLDCLSQYYQMHICTFGNRVYAHQLASMIDPKRRYFSQRILSRDECFNPVTKSANLKALF 258
Query: 184 ERG---IVILDDTESVWSDHTENLIVLGKYVYFRD 215
RG + I+DD VW D + NLI + Y +F D
Sbjct: 259 PRGLNLVCIIDDRGEVW-DWSSNLIHVKPYRFFPD 292
>gi|148227040|ref|NP_001081726.1| FCP1 serine phosphatase [Xenopus laevis]
gi|62185667|gb|AAH92306.1| Fcp1 protein [Xenopus laevis]
Length = 979
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 48/151 (31%), Positives = 75/151 (49%), Gaps = 10/151 (6%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
+KL L+++LD TL+H K + H +G M + +LRP + FLE
Sbjct: 174 QKLVLMVDLDQTLIHTTEQHCQHMSRKGI---FHFQLGRGEPMLH---TRLRPHCKEFLE 227
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQ 183
+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 228 KIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPYSKTGNLRNLFPCG 287
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 288 DSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 317
>gi|118784887|ref|XP_314000.3| AGAP005119-PA [Anopheles gambiae str. PEST]
gi|116128258|gb|EAA09414.3| AGAP005119-PA [Anopheles gambiae str. PEST]
Length = 822
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 44/152 (28%), Positives = 75/152 (49%), Gaps = 10/152 (6%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD TL+H N ++ + Q++ + +LRP FL
Sbjct: 144 DRKLVLLVDLDQTLIHTTNDNVPNNLKDVYHFQLYGPNSPWYH------TRLRPGALEFL 197
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKN--PDLVRG 182
+ ++++CT R YA + LD D +FS RI++R++ FN + + L
Sbjct: 198 AKMHPYYELHICTFGARNYAHMIAQFLDKDGNFFSHRILSRDECFNATSKTDNLKALFPC 257
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW + NLI + Y +FR
Sbjct: 258 GDSMVCIIDDREDVW-NMASNLIQVKPYHFFR 288
>gi|324508774|gb|ADY43701.1| RNA polymerase II subunit A C-terminal domain phosphatase [Ascaris
suum]
Length = 576
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 54/204 (26%), Positives = 95/204 (46%), Gaps = 23/204 (11%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
E R+L L+++LD TL+H N + K + + A D K+RP+ TF
Sbjct: 142 ESRRLVLLVDLDQTLIHTTN-------HAFDMKDSVDVVHYKLRGA-DFYTKIRPYTHTF 193
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPD---LVR 181
L + S L ++++ + R YA ++LD D +YF RI++R++ K + L
Sbjct: 194 LRRMSELYEMHIISYGERQYAHKIAEILDPDKRYFGHRILSRDELFSAMYKTGNMKALFP 253
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKE-------LNGDHKSYSE----TLT 230
++ I I+DD VW +++ LI + Y +F++ N +S + +
Sbjct: 254 CGDQLIAIIDDRPDVWQ-YSDALIQVKPYRFFKETGDINAPTICNAQQQSLVQERIAQVN 312
Query: 231 DESENEEALANVLRVLKTIHRLFF 254
E + +E L V VL +H F+
Sbjct: 313 VEGDGDETLEFVATVLTRVHTTFY 336
>gi|358057984|dbj|GAA96229.1| hypothetical protein E5Q_02893 [Mixia osmundae IAM 14324]
Length = 760
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 52/186 (27%), Positives = 89/186 (47%), Gaps = 31/186 (16%)
Query: 58 GLRYSEQEE--------------RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFI 103
GL SEQE +KL L+++LD T++ ++ + HS +
Sbjct: 178 GLTVSEQEAARLEDASTTRLRKAKKLSLIVDLDQTIIQATVDPTVGDWMRDGTNPNHSAL 237
Query: 104 GSL--FQMAN--DKLV-----------KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAA 148
+ F++ DK V KLRP ++ FL + + L ++++ TM TR YA A
Sbjct: 238 KDVCVFKLGTQEDKEVVADVDGCWYYLKLRPGLQAFLRKMADLYEMHVYTMGTRSYAMAV 297
Query: 149 VKLLDLDSKYFSSRIIAREDFNGKDRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVL 207
+++D D YFS+RI++R++ RK+ + L VI+DD VW + NL+ +
Sbjct: 298 CRIIDPDGTYFSTRILSRDESGSLTRKSLERLFPCDTSMAVIIDDRSDVWH-WSPNLVKV 356
Query: 208 GKYVYF 213
+ +F
Sbjct: 357 EPFEFF 362
>gi|170578206|ref|XP_001894313.1| NLI interacting factor-like phosphatase family protein [Brugia
malayi]
gi|158599134|gb|EDP36825.1| NLI interacting factor-like phosphatase family protein [Brugia
malayi]
Length = 576
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 70/294 (23%), Positives = 122/294 (41%), Gaps = 50/294 (17%)
Query: 2 GAYSCKECVGKTKFVIKRKCEQSL-SCAHTTVRDSRCIFCSQAMNDSFGLSFDY------ 54
G S + K + K SL +C+H V C C + + G S D
Sbjct: 53 GVVSIDTTIKKGNKLKKGMTVASLRACSHAIVIKDMCASCGKDLRGKPGTSGDLAEASTA 112
Query: 55 ----------------MLRGLRYSEQE----ERKLQLVLNLDHTLLHCRNIK-SLSSGEK 93
+ R + ++E KL L+++LD TL+H N +L +
Sbjct: 113 NVSMIHHVPELIVSDELARKIGNRDRELLLKAHKLVLLVDLDQTLIHTTNHTFNLENDTD 172
Query: 94 YLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD 153
L ++ D K+RP FL + +SL ++++ + R YA + LD
Sbjct: 173 VLHYKLK---------GTDFYTKIRPHAHEFLRRMASLYEMHIISYGERQYAHRIAEFLD 223
Query: 154 LDSKYFSSRIIARED-FNG--KDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
+ YF RI++R++ F+ K R L + IV++DD VW +++ LI + Y
Sbjct: 224 PEKIYFGHRILSRDELFSAMYKTRNMQALFPCGDHMIVMIDDRPDVWQ-YSDALIQVKPY 282
Query: 211 VYFRD-KELNGDHKSYSETLTD--------ESENEEALANVLRVLKTIHRLFFD 255
+F++ ++N E + ESE++E L + VL +H F++
Sbjct: 283 RFFKEIGDINAPRNEKGEPILSGSYAEQDMESEDDETLEYIALVLTKVHSAFYE 336
>gi|312373985|gb|EFR21645.1| hypothetical protein AND_16677 [Anopheles darlingi]
Length = 857
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 54/213 (25%), Positives = 94/213 (44%), Gaps = 32/213 (15%)
Query: 27 CAHTTVRDSRCIFCSQAM-NDSFGLS-----------------FDYMLRGLRYSEQE--- 65
C HTTV C C + D G + + + + L ++ E
Sbjct: 94 CNHTTVIKDMCADCGADLRQDEPGANSSKASVPMVHSVPELKVTETLAKKLGQADTERLL 153
Query: 66 -ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
+RKL L+++LD TL+H N ++ + Q++ + +LRP F
Sbjct: 154 NDRKLVLLVDLDQTLIHTTNDNVPNNLKDVYHFQLYGPNSPWYH------TRLRPGALEF 207
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKN--PDLVR 181
L + ++++CT R YA + LD D ++FS RI++R++ FN + + L
Sbjct: 208 LAKMHPYYELHICTFGARNYAHMIAQFLDKDGRFFSHRILSRDECFNATSKTDNLKALFP 267
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW + NLI + Y +F+
Sbjct: 268 CGDSMVCIIDDREDVW-NMASNLIQVKPYHFFQ 299
>gi|47217775|emb|CAG05997.1| unnamed protein product [Tetraodon nigroviridis]
Length = 979
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 53/181 (29%), Positives = 92/181 (50%), Gaps = 19/181 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPFVRT 123
+KL L+++LD TL+H + E++ + + I FQ+ + + +LRP +
Sbjct: 175 KKLVLMVDLDQTLIH--------TTEQHCHRMSNKGI-FHFQLGRGEPMLHTRLRPHCKE 225
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLV 180
FLE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 226 FLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLF 285
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALA 240
+ + I+DD E VW NLI + KYVYF+ GD + + ++E + AL+
Sbjct: 286 PCGDSMVCIIDDREDVWK-FAPNLITVKKYVYFQG---TGDINAPPGSREAQTERKGALS 341
Query: 241 N 241
+
Sbjct: 342 S 342
>gi|410911388|ref|XP_003969172.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Takifugu rubripes]
Length = 905
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/154 (30%), Positives = 81/154 (52%), Gaps = 16/154 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPFVRT 123
+KL L+++LD TL+H + E++ ++ + I FQ+ + + +LRP +
Sbjct: 175 KKLVLMVDLDQTLIH--------TTEQHCQRMSNKGI-LHFQLGRGEPMLHTRLRPHCKE 225
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLV 180
FLE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 226 FLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLF 285
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NL+ + KYVYF+
Sbjct: 286 PCGDSMVCIIDDREDVWK-FAPNLVTVKKYVYFQ 318
>gi|148236996|ref|NP_001087852.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
phosphatase, subunit 1 [Xenopus laevis]
gi|51950264|gb|AAH82378.1| MGC81710 protein [Xenopus laevis]
Length = 977
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/151 (31%), Positives = 75/151 (49%), Gaps = 10/151 (6%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
+K+ L+++LD TL+H K + H +G M + +LRP + FLE
Sbjct: 172 KKVVLMVDLDQTLIHTTEQHCQHMSRKGI---FHFQLGRGEPMLH---TRLRPHCKEFLE 225
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQ 183
+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 226 KIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPYSKTGNLRNLFPCG 285
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + KYVYF+
Sbjct: 286 DSMVCIIDDREDVWK-FAPNLITVKKYVYFQ 315
>gi|320591286|gb|EFX03725.1| RNA polymerase 2 ctd phosphatase [Grosmannia clavigera kw1407]
Length = 923
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/162 (27%), Positives = 81/162 (50%), Gaps = 14/162 (8%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMAND---------KL 114
+RKL LV++LD T++H ++ ++ + + + FQ+
Sbjct: 167 QRKLSLVVDLDQTIIHACIDPTIGEWQQDPSNPNYEALKDVRRFQLEEGFQGLARGCWYY 226
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGK-- 172
+K+RP + FLE+ S++ ++++ TM TR YA +++D + K F +R+I+R D NG
Sbjct: 227 IKMRPHLTEFLEKISTMYELHVYTMGTRTYATNIAQIVDPNQKLFGNRVISR-DENGNII 285
Query: 173 DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ L VI+DD VW + NLI + Y +F+
Sbjct: 286 AKSLQRLFPVSTNMAVIIDDRADVWPYNRHNLIKVNPYDFFK 327
>gi|307168754|gb|EFN61749.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Camponotus floridanus]
Length = 721
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 72/152 (47%), Gaps = 10/152 (6%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N + + Q++ + + RP R FL
Sbjct: 154 DRKLVLLVDLDQTIVHTTNDNIPPNLKDVFHFQLYGPNSPWYH------TRFRPNTRHFL 207
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQER 185
+ S L ++++CT R YA LLD D FS RI++R++ K +L
Sbjct: 208 SEMSHLYELHICTFGARIYAHTVASLLDKDGILFSHRILSRDECFDPASKTANLKALFPC 267
Query: 186 G---IVILDDTESVWSDHTENLIVLGKYVYFR 214
G + I+DD E VW NL+ + Y +FR
Sbjct: 268 GDDLVCIIDDREDVWQG-CGNLVQVKPYHFFR 298
>gi|5326898|gb|AAD42088.1| RNA polymerase II CTD phosphatase [Homo sapiens]
Length = 961
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 80/153 (52%), Gaps = 14/153 (9%)
Query: 67 RKLQLVLNLDHTLLHC--RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
RKL L+++LD TL+H ++ + +S+ K H +G M + +LRP + F
Sbjct: 181 RKLVLMVDLDQTLIHTTEQHCQQMSN-----KGIFHFQLGRGEPMLH---TRLRPHCKDF 232
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVR 181
LE+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 233 LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFP 292
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD + VW NLI + KYVYF+
Sbjct: 293 CGDSMVCIIDDRKDVWK-FAPNLITVKKYVYFQ 324
>gi|345568228|gb|EGX51125.1| hypothetical protein AOL_s00054g501 [Arthrobotrys oligospora ATCC
24927]
Length = 854
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 56/224 (25%), Positives = 101/224 (45%), Gaps = 38/224 (16%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLR----------GLRYSEQEE--------- 66
C H V +++C C M++ ++F + GL+ S E
Sbjct: 93 PCPHPVVWNNQCAVCGMDMSEQTYINFHNLETANINVTHDNTGLKISRGEAENIEKEAKK 152
Query: 67 -----RKLQLVLNLDHTLLHCRNIKSL-------SSGEKYLKKQIHSFIGSLFQMANDK- 113
+KL LV++LD T++ ++ S+ + K + +F L + A +
Sbjct: 153 RLLSAKKLSLVVDLDQTIIQATVDPTVGEWRDDPSNPNYHAVKDVEAF-QLLDEGAGGRG 211
Query: 114 ---LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFN 170
VKLRP ++ FL S + + ++ TM TR YA + K++D + F RI++R++
Sbjct: 212 CWYYVKLRPGLKRFLSNISKIYECHIYTMGTRAYAMSIAKIVDPEGSIFGERILSRDESG 271
Query: 171 GKDRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K+ + L + +VI+DD VW ++NLI + Y +F
Sbjct: 272 SLTSKSLERLFPVDTKMVVIIDDRGDVWK-WSDNLIKVTPYDFF 314
>gi|358253094|dbj|GAA51983.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Clonorchis sinensis]
Length = 1535
Score = 62.4 bits (150), Expect = 3e-07, Method: Composition-based stats.
Identities = 45/155 (29%), Positives = 77/155 (49%), Gaps = 17/155 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPFVRT 123
RKL L+++LD T+LH N + Y K + S + + LV RP ++
Sbjct: 185 RKLVLLVDLDETVLHTTN-----DPQAYRYKNV-----SRYCLPGSPLVYHTSFRPHLKA 234
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQ 183
L++ S +++CT R YA ++D +YFS RI++R++ K+ +L
Sbjct: 235 VLDRLSKYYQMHICTFGNRMYAHQLAGMIDPKRRYFSHRILSRDECFNPVTKSANLKALF 294
Query: 184 ERG---IVILDDTESVWSDHTENLIVLGKYVYFRD 215
RG + I+DD VW + + +LI + Y +F+D
Sbjct: 295 PRGLNLVCIIDDRGEVW-EWSPHLIQVKPYRFFQD 328
>gi|443896478|dbj|GAC73822.1| TFIIF-interacting CTD phosphatases [Pseudozyma antarctica T-34]
Length = 751
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 60/238 (25%), Positives = 100/238 (42%), Gaps = 55/238 (23%)
Query: 27 CAHTTVRDSRCIFCSQAMN----DSFGLSFDYMLRGLRYSEQE--------------ERK 68
C H C C Q ++ S LS + ++ S +E +RK
Sbjct: 8 CKHPVQLFGMCAVCGQPVDADSDQSASLSVMHSSASVKVSAEEAQRLDSESTSHLLSQRK 67
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-------------- 114
L L+++LD T++H ++ GE +++ + +L + +L
Sbjct: 68 LALIVDLDQTVIHATVDPTV--GE-WMRDDTNPNYDALKSVGKFRLGIDGEEIKDDDDPT 124
Query: 115 ------------------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDS 156
VK RP V T L+Q S +++ TM TR YA KL+D D+
Sbjct: 125 APKDAAAALRASRACWYYVKPRPGVPTILKQLSQKYQLHVYTMGTRSYANCVCKLIDPDA 184
Query: 157 KYFSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
F +RI++R++ RK+ L +VI+DD E VWS ++ NL+ + Y +F
Sbjct: 185 SIFGNRILSRDENGSLVRKSLSRLFPVDHSMVVIIDDREDVWS-NSPNLLPVLPYEFF 241
>gi|164658688|ref|XP_001730469.1| hypothetical protein MGL_2265 [Malassezia globosa CBS 7966]
gi|159104365|gb|EDP43255.1| hypothetical protein MGL_2265 [Malassezia globosa CBS 7966]
Length = 364
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 50/181 (27%), Positives = 87/181 (48%), Gaps = 33/181 (18%)
Query: 65 EERKLQLVLNLDHTLLHCR---NIKSLSSGEK-----YLKKQIHSFIGS----------- 105
E+RKL L+++LD T++H +K + K LK + +GS
Sbjct: 40 EQRKLALIVDLDQTIIHVTVDPTVKEWAHDPKNPNWCMLKDVVAFQLGSDGKTVSHQPER 99
Query: 106 -------LFQMANDK-----LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD 153
F D+ VKLRP ++ FL+ S + ++++ TM TR YA+ +++D
Sbjct: 100 MDQHDVKSFATDGDENGCWYYVKLRPGLQAFLQSVSPMYEMHVYTMGTRSYADCICRIVD 159
Query: 154 LDSKYFSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVY 212
D F +RI++R++ + +K+ L +V++DD VWS + NLI + Y +
Sbjct: 160 PDGHLFGARILSRDENGNEVQKSLSRLFPISTDMVVVIDDRADVWS-WSPNLIKVEPYEF 218
Query: 213 F 213
F
Sbjct: 219 F 219
>gi|406602036|emb|CCH46356.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Wickerhamomyces ciferrii]
Length = 720
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 54/241 (22%), Positives = 103/241 (42%), Gaps = 54/241 (22%)
Query: 26 SCAHTTVRDSRCIFCSQAMN-----------DSFGLSFDYMLRGLRYSEQE--------- 65
C H+ C C ++++ D +S + L+ S+ E
Sbjct: 107 PCTHSIQYGGLCALCGKSLDEETDYSGFKYEDRAPISMSHGTSDLKISKSEAQKVEQLMT 166
Query: 66 -----ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL-----FQMANDKL- 114
E KL LV++LD T++H ++ +++ Q + SL F + + +
Sbjct: 167 KNLIKENKLILVVDLDQTVIHATVDPTIG---EWMNDQSNPNFPSLKDVQYFSLEEEPIL 223
Query: 115 -----------------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSK 157
VK+RP + FL++ + + ++++ TM T+ YA + K++D D +
Sbjct: 224 PPGYQGPRPPTHKRWYYVKMRPGLEDFLKRIAKIYELHIYTMGTKEYARSIAKIIDPDGE 283
Query: 158 YFSSRIIAREDFNGKDRKNPD-LVRGQERGIVILDDTESV--WSDHTENLIVLGKYVYFR 214
YF RI++R++ +K+ + L +VI+DD V WSDH ++ +V
Sbjct: 284 YFGERILSRDESGSLTQKSLERLFPTDTSMVVIIDDRGDVWNWSDHLIKVVPFDFFVGIG 343
Query: 215 D 215
D
Sbjct: 344 D 344
>gi|405122085|gb|AFR96852.1| hypothetical protein CNAG_04120 [Cryptococcus neoformans var.
grubii H99]
Length = 921
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 53/89 (59%), Gaps = 5/89 (5%)
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FN 170
K RP ++ FL++ S L ++++ TM TR YA+A VK++D D K F RI++R++ F+
Sbjct: 287 FTKPRPGLQKFLDEMSQLYEMHVYTMGTRTYADAIVKVIDPDGKIFGGRILSRDESGSFS 346
Query: 171 GKDRKNPDLVRGQERGIVILDDTESVWSD 199
K+ K L +V++DD VW D
Sbjct: 347 SKNLKR--LFPTDTSMVVVIDDRSDVWGD 373
>gi|58271496|ref|XP_572904.1| protein phosphatase [Cryptococcus neoformans var. neoformans JEC21]
gi|134115316|ref|XP_773956.1| hypothetical protein CNBH4080 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50256584|gb|EAL19309.1| hypothetical protein CNBH4080 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57229163|gb|AAW45597.1| protein phosphatase, putative [Cryptococcus neoformans var.
neoformans JEC21]
Length = 955
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 59/103 (57%), Gaps = 6/103 (5%)
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FN 170
K RP ++ FL++ L ++++ TM TR YA+A VK++D D K F RI++R++ F+
Sbjct: 307 FTKPRPGLQRFLDEMCQLYEMHVYTMGTRTYADAIVKVIDPDGKIFGGRILSRDESGSFS 366
Query: 171 GKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K+ K L +V++DD VW D NL+ + Y +F
Sbjct: 367 SKNLKR--LFPTDTSMVVVIDDRSDVWGD-CPNLVKVVPYDFF 406
>gi|307212079|gb|EFN87962.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Harpegnathos saltator]
Length = 734
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 73/152 (48%), Gaps = 10/152 (6%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+RKL L+++LD T++H N + + Q++ + +LRP R FL
Sbjct: 151 DRKLVLLVDLDQTIVHTTNDHIPPNLKDVHHFQLYGPNSPWYH------TRLRPNTRHFL 204
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQER 185
+ S L ++++C+ R YA LLD D FS RI++R++ K +L
Sbjct: 205 SEMSHLYELHICSFGARIYAHTIASLLDKDGVLFSHRILSRDECFDPASKTANLKALFPC 264
Query: 186 G---IVILDDTESVWSDHTENLIVLGKYVYFR 214
G + I+DD E VW NL+ + Y +FR
Sbjct: 265 GDDLVCIIDDREDVWQ-GCGNLVQVKPYHFFR 295
>gi|384488044|gb|EIE80224.1| hypothetical protein RO3G_04929 [Rhizopus delemar RA 99-880]
Length = 433
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 62/107 (57%), Gaps = 6/107 (5%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
E RKL L+L+LD T++H +S + ++I F +L + +KLRP +R F
Sbjct: 28 ESRKLSLILDLDQTIVHASCDPRISH---WKNEEIRQF--TLPKSPTMYYIKLRPGLREF 82
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNG 171
L++ +L D+++ TM T+ YA+A + +D + F RI++R D NG
Sbjct: 83 LKEIENLYDLHIYTMGTKDYAKAVAREMDPEGSLFKERILSR-DENG 128
>gi|118369793|ref|XP_001018099.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila]
gi|89299866|gb|EAR97854.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila SB210]
Length = 874
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 51/218 (23%), Positives = 100/218 (45%), Gaps = 36/218 (16%)
Query: 27 CAHTTV-RDSRCIFCSQAMNDSFGLSF-------DYMLRGLRYSE----------QEERK 68
C+H + +++ C++C Q + + +L G Y+E +K
Sbjct: 222 CSHQKIDQNNSCVYCYQDLPKHTNKVYAGLDQKDKSVLIGKEYAEYSKKLAHQQLHSNQK 281
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK----LVKLRPFVRTF 124
L LVL+LD+T+LH ++ + + L F+ +++ ++K RP+++ F
Sbjct: 282 LILVLDLDNTILH-----AVPAIKNALFDNADGIQQDSFKEFHNRYSKYVIKFRPYMKEF 336
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLL---------DLDSKYFSSRIIAREDFNGKDRK 175
L+ +IY+ TM+ YA+ L D + RII+RE F+ ++
Sbjct: 337 LQTVLPHYEIYIFTMAMLDYAKCVCDYLKQTYKDILDDYPMTFNYDRIISREQFSSNNKD 396
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
++ E+ ++ILDD + VW+ + NL+ Y+Y+
Sbjct: 397 LQQILPNSEKIMLILDDRDDVWAKNKMNLVTTLPYIYW 434
>gi|401408967|ref|XP_003883932.1| hypothetical protein NCLIV_036820 [Neospora caninum Liverpool]
gi|325118349|emb|CBZ53900.1| hypothetical protein NCLIV_036820 [Neospora caninum Liverpool]
Length = 1149
Score = 62.0 bits (149), Expect = 3e-07, Method: Composition-based stats.
Identities = 37/114 (32%), Positives = 64/114 (56%), Gaps = 8/114 (7%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKD 173
+KLRP++RTFL++ ++ + T +T+ YA+ + +LD + + F RI+AR+ F G+
Sbjct: 691 MKLRPYLRTFLKKLEPFYEMSVYTNATQEYADIVIAILDDNRQLFQDRIVARDSGFRGEA 750
Query: 174 RKNPDLVRGQE----RGIVILDDTESVWSDHTENLIVLGKYVYFRDK---ELNG 220
+N + R E R IV DD +++W+D +V ++ F D ELN
Sbjct: 751 SENKAVRRLYEGMDKRCIVAFDDRQNIWTDLPLTHVVKAQHYDFFDSHKAELNA 804
>gi|317027693|ref|XP_001399857.2| RNA polymerase II subunit A C-terminal domain phosphatase
[Aspergillus niger CBS 513.88]
Length = 800
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 57/214 (26%), Positives = 90/214 (42%), Gaps = 41/214 (19%)
Query: 26 SCAHTTVRDSRCIFCSQAMND-SFGLSFDYMLRG----------LRYSEQE--------- 65
CAH C C + M D S+ + R L SEQE
Sbjct: 92 PCAHEVQFGGLCAICGKDMTDFSYNTEVTDVHRAPIQMAHDNTTLTVSEQEATRVEEDAK 151
Query: 66 -----ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPF 120
RKL LV++LD T++H ++ GE K+ ++ S +
Sbjct: 152 RRLLANRKLSLVVDLDQTIIHATVDPTV--GEWMEDKENPNYQAS------------ERW 197
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDL 179
+ +FL+ S + ++++ TM TR YA+ ++D D K F RI++R++ KN L
Sbjct: 198 LESFLQNVSEMYELHIYTMGTRSYAQHIASIIDPDRKLFGDRILSRDESGSLVAKNLHRL 257
Query: 180 VRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ +VI+DD VW NLI + Y +F
Sbjct: 258 FPVDTKMVVIIDDRGDVWR-WNPNLIKVSPYDFF 290
>gi|388853856|emb|CCF52577.1| related to FCP1-TFIIF interacting component of CTD phosphatase
[Ustilago hordei]
Length = 471
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 52/181 (28%), Positives = 85/181 (46%), Gaps = 37/181 (20%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL----------- 114
+RKL LV++LD T++H ++ GE +++ + + +L +A +L
Sbjct: 28 QRKLALVVDLDQTIIHTAVDPTV--GE-WMEDESNPNYEALKSVAKFRLGIGGEEIKDDD 84
Query: 115 ---------------------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD 153
VKLRP V L++ S +++ TM TR YA KL+D
Sbjct: 85 DPPAPKDSAAALKASRACWYYVKLRPGVPEILKKLSEKYQLHVYTMGTRSYANLVCKLID 144
Query: 154 LDSKYFSSRIIAREDFNGKDRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVY 212
D+ F +RI++R + RK+ D L +VI+DD E VWS + NL+ + Y +
Sbjct: 145 PDASIFGNRIVSRNENGSLVRKSLDKLFPMDHSMVVIIDDREDVWS-KSPNLLQVVPYEF 203
Query: 213 F 213
F
Sbjct: 204 F 204
>gi|145544070|ref|XP_001457720.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124425538|emb|CAK90323.1| unnamed protein product [Paramecium tetraurelia]
Length = 659
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 66/271 (24%), Positives = 117/271 (43%), Gaps = 54/271 (19%)
Query: 10 VGKTKFVIKRK-----CEQSLSCAHTTVRDSRCIFCSQ-AMNDSFGLSFDY-------ML 56
+ KTK ++ R + S +C H + ++ C+ C++ + + L +Y +
Sbjct: 176 LAKTKIILPRNYALMVIDSSQTCNHLKIENNYCLICNEKVIRNVESLDLNYSDDISKKIS 235
Query: 57 RGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIH--------SFIG---- 104
+ + ++RKL +VL+LD T+LH + + + ++ +KQ F G
Sbjct: 236 KEIVLDILKKRKLIMVLDLDQTILHAIKVSTTFNKYEFCEKQNKMIQADSEAQFNGFQQL 295
Query: 105 ------SLFQMANDK----LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDL 154
L M D+ ++KLRP+ F L DI++ T +++ YA+ + +
Sbjct: 296 GFNIKEHLLDMTCDQQSKFIIKLRPYFEQFFLTLIPLFDIFIYTKASKSYADFILSFITH 355
Query: 155 DSKYF---------SSRIIAREDFNGKDRKNPDLVRGQERGI-----VILDDTESVWSDH 200
F R+++RED + K+ L R GI VILDD +W+
Sbjct: 356 RLNEFIPEHKPFFPPQRVLSREDTICSNSKS--LNRLFYPGIATNLLVILDDNAGMWNQF 413
Query: 201 TENLIVLGKYVYFRDKELNGDHKSYSETLTD 231
ENLI +VYF + +G K +TD
Sbjct: 414 KENLIHTKPFVYFNE---HGSTKDGQGIVTD 441
>gi|255712225|ref|XP_002552395.1| KLTH0C03894p [Lachancea thermotolerans]
gi|238933774|emb|CAR21957.1| KLTH0C03894p [Lachancea thermotolerans CBS 6340]
Length = 745
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 48/173 (27%), Positives = 84/173 (48%), Gaps = 26/173 (15%)
Query: 64 QEERKLQLVLNLDHTLLHC---------------------RNIKSLSSGEKYLKKQIHSF 102
+E +KL LV++LD T++HC +N+K+ S E + +
Sbjct: 161 REHKKLVLVVDLDQTVIHCGVDPTIHEWANDPSNPNYDALKNVKTFSLDEDPILPPF--Y 218
Query: 103 IGSLFQMAN-DKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSS 161
+G VKLRP ++ F ++ + ++++ TM+TR YA K++D + F
Sbjct: 219 MGPRPPPRKCQYYVKLRPGLQEFFDKIAPHFELHIYTMATRAYALEIAKIIDPKGELFGD 278
Query: 162 RIIAREDFNGKDRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
RI++R++ K+ + L + +VI+DD VWS ENLI + Y +F
Sbjct: 279 RILSRDENGSLTHKSLERLFPMDQSMVVIIDDRGDVWS-WCENLIKVVPYNFF 330
>gi|323453463|gb|EGB09334.1| putative formate/nitrite transporter [Aureococcus anophagefferens]
Length = 1144
Score = 60.8 bits (146), Expect = 9e-07, Method: Composition-based stats.
Identities = 46/156 (29%), Positives = 76/156 (48%), Gaps = 12/156 (7%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+R+LQLVL+LDHTLL C ++ ++ + +G++ V+LRP + F
Sbjct: 346 KRQLQLVLDLDHTLLECSTDPRAAALAAAPGSRVRA-LGAV--AGRPHWVRLRPRLEEFF 402
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSK--YFSSRIIARE---DFNGKDRKNPDLV 180
+ L ++ + T +R YAEA L+ + F R+++R+ D G+
Sbjct: 403 AAVAPLYELAIYTHGSRQYAEAVRAALEAEVPGLSFGGRVVSRDCCPDLRGEKSLERLFP 462
Query: 181 RGQERGIVILDDTESVWS---DHTENLIVLGKYVYF 213
G R + ILDD VW+ D T ++V+ Y YF
Sbjct: 463 GGAARAL-ILDDRLDVWTRGEDQTPRVLVVQPYTYF 497
>gi|391332118|ref|XP_003740485.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Metaseiulus occidentalis]
Length = 646
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 59/120 (49%), Gaps = 7/120 (5%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
++RP FL + S L ++++ T R YA V LLD KYF RI+ R++
Sbjct: 182 TRIRPGTEDFLRKISQLFELHIVTFGARPYANHIVSLLDPGKKYFQYRILTRDECFHPQS 241
Query: 175 KNPD---LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTD 231
K + L ++ + I+DD E VW + NL+ + YV+FR GD + + L D
Sbjct: 242 KTANLKSLFPCGDQMVCIIDDREDVW-NFASNLVAVKPYVFFRGA---GDINAPAGLLAD 297
>gi|213403530|ref|XP_002172537.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Schizosaccharomyces japonicus yFS275]
gi|212000584|gb|EEB06244.1| RNA polymerase II subunit A C-terminal domain phosphatase
[Schizosaccharomyces japonicus yFS275]
Length = 723
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 49/225 (21%), Positives = 95/225 (42%), Gaps = 45/225 (20%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLR----------GLRYSE----------QE 65
C+H C C Q + + + F + R GL + Q+
Sbjct: 98 PCSHEVHYGGLCAICGQNITNQDYMGFSDLSRATINMTHGSGGLTEARRLETETAIRLQK 157
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEK----------------YLKKQIHSFIGSLFQM 109
+++L L+++LD T++H ++ K YL++ + +
Sbjct: 158 QKRLSLIVDLDQTIIHATVDPTVGEWMKDPNNVNYKVLRDVHYFYLREGTSGYTSCYY-- 215
Query: 110 ANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDF 169
+K RP ++ FL S L ++++ TM T+ YA K++D D + F R+++R+D
Sbjct: 216 -----IKPRPGLQEFLHNVSKLYELHIYTMGTKAYATEVAKVIDPDGELFQDRVLSRDDS 270
Query: 170 NGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+K+ L +V++DD VW + + NLI + + +F
Sbjct: 271 GNLTQKSIRRLFPCDTSMVVVIDDRGDVW-NWSSNLIKVYPFEFF 314
>gi|134056779|emb|CAK37687.1| unnamed protein product [Aspergillus niger]
Length = 788
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/100 (35%), Positives = 55/100 (55%), Gaps = 2/100 (2%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
VKLRP + +FL+ S + ++++ TM TR YA+ ++D D K F RI++R++
Sbjct: 180 VKLRPGLESFLQNVSEMYELHIYTMGTRSYAQHIASIIDPDRKLFGDRILSRDESGSLVA 239
Query: 175 KN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
KN L + +VI+DD VW NLI + Y +F
Sbjct: 240 KNLHRLFPVDTKMVVIIDDRGDVWR-WNPNLIKVSPYDFF 278
>gi|221488107|gb|EEE26321.1| RNA polymerase II phosphatase, putative [Toxoplasma gondii GT1]
gi|221508626|gb|EEE34195.1| RNA polymerase II phosphatase, putative [Toxoplasma gondii VEG]
Length = 1139
Score = 60.5 bits (145), Expect = 1e-06, Method: Composition-based stats.
Identities = 37/114 (32%), Positives = 63/114 (55%), Gaps = 8/114 (7%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKD 173
+KLRP +RTFL++ ++ + T +T+ YA+ + +LD + + F RI+AR+ F G+
Sbjct: 681 MKLRPHLRTFLKKLEPFYEMSVYTNATQEYADIVIAILDGNRQLFQDRIVARDSGFRGEA 740
Query: 174 RKNPDLVRGQE----RGIVILDDTESVWSDHTENLIVLGKYVYFRDK---ELNG 220
+N + R E R IV DD +++W+D +V ++ F D ELN
Sbjct: 741 SENKAVRRLYEGMDKRCIVAFDDRQNIWTDLPLTHVVKAQHYDFFDSHKTELNA 794
>gi|237832707|ref|XP_002365651.1| NLI interacting factor-like phosphatase domain-containing protein
[Toxoplasma gondii ME49]
gi|211963315|gb|EEA98510.1| NLI interacting factor-like phosphatase domain-containing protein
[Toxoplasma gondii ME49]
Length = 1139
Score = 60.5 bits (145), Expect = 1e-06, Method: Composition-based stats.
Identities = 37/114 (32%), Positives = 63/114 (55%), Gaps = 8/114 (7%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKD 173
+KLRP +RTFL++ ++ + T +T+ YA+ + +LD + + F RI+AR+ F G+
Sbjct: 681 MKLRPHLRTFLKKLEPFYEMSVYTNATQEYADIVIAILDGNRQLFQDRIVARDSGFRGEA 740
Query: 174 RKNPDLVRGQE----RGIVILDDTESVWSDHTENLIVLGKYVYFRDK---ELNG 220
+N + R E R IV DD +++W+D +V ++ F D ELN
Sbjct: 741 SENKAVRRLYEGMDKRCIVAFDDRQNIWTDLPLTHVVKAQHYDFFDSHKTELNA 794
>gi|145536530|ref|XP_001453987.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124421731|emb|CAK86590.1| unnamed protein product [Paramecium tetraurelia]
Length = 659
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 68/255 (26%), Positives = 113/255 (44%), Gaps = 55/255 (21%)
Query: 10 VGKTKFVIKRK-----CEQSLSCAHTTVRDSRCIFCSQAM---NDSFGLSFD-----YML 56
+ KTK ++ R + + +C H + + C+ C++ + +S L++ +
Sbjct: 176 LAKTKTILSRNDVLLVIDIAQTCNHLKIEKNYCVICNEKVIRYEESLDLNYSDDISKKIS 235
Query: 57 RGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKY-----LKKQIHS-----FIG-- 104
+ + ++RKL +VL+LD T+LH IK +S KY K + S F G
Sbjct: 236 KEIVLDILKKRKLIMVLDLDQTILHA--IKVTNSFNKYDFCEKQNKMLQSDSDGQFNGFN 293
Query: 105 --------SLFQMANDK----LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLL 152
+MA D ++KLRP+ F L DI++ T ++R YAE + +
Sbjct: 294 QLGFNIKEHFLEMACDSQCKFIIKLRPYFEQFFLTLIPLFDIFIYTKASRSYAEFILNFI 353
Query: 153 D-------LDSKYF--SSRIIAREDFNGKDRKNPDLVRGQERGI-----VILDDTESVWS 198
+ K F R+++R+D + K+ L R GI VILDD +W+
Sbjct: 354 SKRLNEVIPEHKPFFPPQRVLSRDDTICSNSKS--LNRLFYPGIATNLLVILDDNAGMWN 411
Query: 199 DHTENLIVLGKYVYF 213
ENLI +VYF
Sbjct: 412 QFKENLIHTKPFVYF 426
>gi|159483481|ref|XP_001699789.1| hypothetical protein CHLREDRAFT_141879 [Chlamydomonas reinhardtii]
gi|158281731|gb|EDP07485.1| predicted protein [Chlamydomonas reinhardtii]
Length = 375
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 73/141 (51%), Gaps = 10/141 (7%)
Query: 74 NLDHTLLHCRNIKSLSSG-----EKYLKKQIHSFIGS---LFQMANDKL-VKLRPFVRTF 124
+LDHTLL+ ++ + + +++ + +G L +A+ KL KLRP V F
Sbjct: 133 DLDHTLLNSVHMNEVGEDVAPRLAELQRREQEANLGPRRLLHCLADKKLWTKLRPGVFEF 192
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQE 184
LE ++++ TM + YA +LLD + FSS +IA++ K+ D++ +
Sbjct: 193 LEGLRDAYEMHIYTMGDKTYAAEVRRLLDPTGRLFSS-VIAKDHSTTATAKHLDVLLSAD 251
Query: 185 RGIVILDDTESVWSDHTENLI 205
++LDDTE VW H NL+
Sbjct: 252 ELALVLDDTEVVWPGHRRNLL 272
>gi|328772741|gb|EGF82779.1| hypothetical protein BATDEDRAFT_22917 [Batrachochytrium
dendrobatidis JAM81]
Length = 868
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 57/104 (54%), Gaps = 8/104 (7%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
+ERKL LVL+LD T++H ++ GE +F ++ P R F
Sbjct: 165 DERKLSLVLDLDQTVIHATVDPTV--GEWMADPNNPNFPALTVWATHE------PGTREF 216
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED 168
L + ++ ++++ TM TR YA+A K+LD D +YF RI++R+D
Sbjct: 217 LRELNAKYEMHIYTMGTRNYAKAVSKILDPDKRYFKDRILSRDD 260
>gi|156083399|ref|XP_001609183.1| hypothetical protein [Babesia bovis T2Bo]
gi|154796434|gb|EDO05615.1| hypothetical protein BBOV_IV000150 [Babesia bovis]
Length = 692
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 87/176 (49%), Gaps = 19/176 (10%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD- 173
+KLRP +R FL+ S ++ + T +T+ YA+ V +LD D F RI+AR +D
Sbjct: 314 MKLRPGLRGFLQVLSLYYEMSIYTNATKEYADVVVSILDPDRSLFMDRIVARTSAGERDL 373
Query: 174 -----RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD------KELNGDH 222
R P+L R +V DD VW+D N +V ++ F D +L G
Sbjct: 374 QKTAARLYPNL---DPRFVVAFDDRADVWADVPHNQVVKAEHYDFFDSHIAELSDLYGIV 430
Query: 223 KSYSE-TLTDESENEEALANVLRVLKTIHRLFF-DSVCGDVRTYLPKVRSEFSRDV 276
S +E TL +S+ L ++++V +H+ FF D +V T + +++S +D
Sbjct: 431 NSSTENTLYIDSDRH--LDHMVKVFLELHKRFFNDPFKSNVGTLVQEIQSNVLKDT 484
>gi|323508124|emb|CBQ67995.1| related to FCP1-TFIIF interacting component of CTD phosphatase
[Sporisorium reilianum SRZ2]
Length = 773
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 59/238 (24%), Positives = 100/238 (42%), Gaps = 55/238 (23%)
Query: 27 CAHTTVRDSRCIFCSQAMN----DSFGLSFDYMLRGLRYSEQE--------------ERK 68
C H C C Q ++ +S LS + ++ S +E +RK
Sbjct: 8 CKHPVQLFGMCAVCGQPVDADSEESASLSVMHSSAAVKVSAEEAQRLDSESTSHLLSQRK 67
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-------------- 114
L L+++LD T++H ++ GE +++ + + +L + +L
Sbjct: 68 LALIVDLDQTVIHATVDPTV--GE-WMRDESNPNYDALQSVGKFRLGIDGEEIKDDDDES 124
Query: 115 ------------------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDS 156
VK RP V L+Q S +++ TM TR YA KL+D D+
Sbjct: 125 APRDSAAALRASRACWYYVKPRPGVPKVLKQLSEKYQLHVYTMGTRSYANCVCKLIDPDA 184
Query: 157 KYFSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
F +RI++R++ RK+ L +VI+DD E VWS + NL+ + Y +F
Sbjct: 185 SIFGNRILSRDENGSLVRKSLSRLFPVDHSMVVIIDDREDVWS-RSPNLLPVLPYEFF 241
>gi|449018404|dbj|BAM81806.1| similar to TFIIF interacting component of CTD phosphatase Fcp1p
[Cyanidioschyzon merolae strain 10D]
Length = 1640
Score = 59.3 bits (142), Expect = 2e-06, Method: Composition-based stats.
Identities = 41/144 (28%), Positives = 73/144 (50%), Gaps = 16/144 (11%)
Query: 110 ANDKL--VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE 167
AN L +KLRP + FL + ++++ TM +R YA+ ++D D + F RI +R+
Sbjct: 516 ANTSLYYIKLRPGLHEFLRTIADRFELHIYTMGSRPYADTVASIIDSDERLFQGRITSRD 575
Query: 168 DF-NGK-DRKN-PDLVRGQERGIVILDDTESVW--------SDHTENLIVLGKYVYFRDK 216
DF +G+ ++KN + + ++++DD E VW H NLI Y +FR
Sbjct: 576 DFEDGRLNQKNLKHVFPCDDSMVLVVDDREDVWVAQDQSLHGRHFPNLIRARPYYFFRGL 635
Query: 217 E---LNGDHKSYSETLTDESENEE 237
E H + ++ LT+ ++ +
Sbjct: 636 EETFQREQHTATTDILTNTHDHSD 659
>gi|388858248|emb|CCF48177.1| related to FCP1-TFIIF interacting component of CTD phosphatase
[Ustilago hordei]
Length = 774
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 58/238 (24%), Positives = 101/238 (42%), Gaps = 55/238 (23%)
Query: 27 CAHTTVRDSRCIFCSQAMN----DSFGLSFDYMLRGLRYSEQE--------------ERK 68
C H C C Q ++ +S LS + ++ S +E +RK
Sbjct: 9 CKHPVQLFGMCALCGQPVDTESEESASLSVMHSHAAVKVSAEEAQRLDSETTSHLLSQRK 68
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-------------- 114
L L+++LD T++H ++ GE ++K + + +L + +L
Sbjct: 69 LALIVDLDQTVIHATVDPTV--GE-WMKDESNPNYEALKSVGKFRLGIDGEEIKDDDDDS 125
Query: 115 ------------------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDS 156
VK RP V +++ S +++ TM TR YA KL+D D+
Sbjct: 126 APKDSAAALKASRACWYYVKPRPGVPEIVKKLSEKYQLHVYTMGTRSYANCVCKLIDPDA 185
Query: 157 KYFSSRIIAREDFNGKDRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
F +RI++R++ RK+ + L +VI+DD E VWS + NL+ + Y +F
Sbjct: 186 SIFGNRILSRDENGSLVRKSLNRLFPVDHSMVVIIDDREDVWS-RSPNLLPVVPYEFF 242
>gi|300176006|emb|CBK22223.2| unnamed protein product [Blastocystis hominis]
Length = 680
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/160 (26%), Positives = 75/160 (46%), Gaps = 13/160 (8%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
R+L LV +LD+TL+ + S + IH + + LRP V++ L
Sbjct: 19 RRLGLVFDLDNTLMEQSDDPRCSVAPSFGIPNIHFIQFKRNNQLSKHTIILRPEVQSILT 78
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN------PD-- 178
+ S ++ + T R YA+A ++ +D + F SR+IAR+D N P
Sbjct: 79 ELSKYYELSIYTNGVRTYAQAIIESIDPKHQLFGSRVIARDDVPDNSETNFFNNFLPASK 138
Query: 179 ----LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
++ G ER V++DD+ VW D ++ + K+ ++R
Sbjct: 139 DISFVLPGLERLGVVVDDSVEVWKDRA-IVLHIPKFCFWR 177
>gi|19115680|ref|NP_594768.1| CTD phosphatase Fcp1 [Schizosaccharomyces pombe 972h-]
gi|26393804|sp|Q9P376.1|FCP1_SCHPO RecName: Full=RNA polymerase II subunit A C-terminal domain
phosphatase; AltName: Full=CTD phosphatase fcp1
gi|9588462|emb|CAC00553.1| CTD phosphatase Fcp1 [Schizosaccharomyces pombe]
Length = 723
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 82/172 (47%), Gaps = 35/172 (20%)
Query: 64 QEERKLQLVLNLDHTLLHC---------------------RNIKSLSSGEKYLKKQIHSF 102
++E++L L+++LD T++H R+++S + L++ +
Sbjct: 160 RQEKRLSLIVDLDQTIIHATVDPTVGEWMSDPGNVNYDVLRDVRSFN-----LQEGPSGY 214
Query: 103 IGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
+ +K RP + FL++ S L ++++ TM T+ YA+ K++D K F R
Sbjct: 215 TSCYY-------IKFRPGLAQFLQKISELYELHIYTMGTKAYAKEVAKIIDPTGKLFQDR 267
Query: 163 IIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+++R+D +K+ L +V++DD VW D NLI + Y +F
Sbjct: 268 VLSRDDSGSLAQKSLRRLFPCDTSMVVVIDDRGDVW-DWNPNLIKVVPYEFF 318
>gi|390333352|ref|XP_791406.3| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like [Strongylocentrotus purpuratus]
Length = 673
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/163 (29%), Positives = 79/163 (48%), Gaps = 22/163 (13%)
Query: 67 RKLQLVLNLDHTLLHCR--NIKSLSSGEKYLKKQIHSFI---GSLFQMANDKLVKLRPFV 121
RKL L+++LD TL+H + + G +H F G +F + ++R
Sbjct: 30 RKLVLLVDLDQTLIHTTLDEVPADMPG-------VHHFQLRKGPMFPWYH---TRIRDNY 79
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
+ FL+ S +++ TM R YA +++D + K+FS RI++R++ K +L
Sbjct: 80 QQFLDLISQFYQLHIFTMGVRLYAHTVAEIIDPEGKFFSHRILSRDECVDPHSKKANLRS 139
Query: 182 GQERG---IVILDDTESVWSDHTENLIVLGKYVYFRDKELNGD 221
RG + I+DD + VW + NLI + Y YF E GD
Sbjct: 140 IFPRGDKMVCIIDDRDDVW-NFAPNLIQVPPYRYF---EGTGD 178
>gi|215794709|pdb|3EF0|A Chain A, The Structure Of Fcp1, An Essential Rna Polymerase Ii Ctd
Phosphatase
Length = 372
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 79/161 (49%), Gaps = 13/161 (8%)
Query: 64 QEERKLQLVLNLDHTLLHCR---NIKSLSSGEKYLKKQIHSFIGSLFQMANDK------- 113
++E++L L+++LD T++H + S + + + S F +
Sbjct: 14 RQEKRLSLIVDLDQTIIHATVDPTVGEWMSDPGNVNYDVLRDVRS-FNLQEGPSGYTSCY 72
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
+K RP + FL++ S L ++++ TM T+ YA+ K++D K F R+++R+D
Sbjct: 73 YIKFRPGLAQFLQKISELYELHIYTMGTKAYAKEVAKIIDPTGKLFQDRVLSRDDSGSLA 132
Query: 174 RKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+K+ L +V++DD VW D NLI + Y +F
Sbjct: 133 QKSLRRLFPCDTSMVVVIDDRGDVW-DWNPNLIKVVPYEFF 172
>gi|440804367|gb|ELR25244.1| FCP1like phosphatase, phosphatase subfamily protein [Acanthamoeba
castellanii str. Neff]
Length = 930
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 57/237 (24%), Positives = 98/237 (41%), Gaps = 46/237 (19%)
Query: 27 CAHTTVRDSRCIFCSQAMNDSFGLSFDYML---------RGLRYSEQEE--------RKL 69
CAH V C C + +N S + ++ R + + E +KL
Sbjct: 91 CAHEMVFADLCAICGKTINSSDKQATISLIPSQPALTVSRAVAERDAERTAERLTAAKKL 150
Query: 70 QLVLNLDHTLLH------------------------CRNIKSLSSGEKYLKKQIHSFIGS 105
LVL+LD TL+H C + E ++ F +
Sbjct: 151 SLVLDLDQTLVHATQDAEVETLFGTDAAEAKGGSITCALPNPPAGPEDVPAAHLYRF--T 208
Query: 106 LFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIA 165
L + +KLRP + FL L ++++ TM +R YA +++D + K F I++
Sbjct: 209 LEGNPHKFYLKLRPHLEEFLMGVKDLFELHIYTMGSRSYARKVAQIIDPEQKLFRENIVS 268
Query: 166 RED-FNGKDRKNPDLVRGQERGIV-ILDDTESVWSDHTENLIVLGKYVYFRDKELNG 220
R++ N + KN + + +V I+DD VW ++NLI + Y +F D ++N
Sbjct: 269 RDECGNVMNLKNLQRIFPVDDSMVMIIDDRVDVWGT-SKNLIKIEPYYFFNDAKVNA 324
>gi|297843870|ref|XP_002889816.1| hypothetical protein ARALYDRAFT_888325 [Arabidopsis lyrata subsp.
lyrata]
gi|297335658|gb|EFH66075.1| hypothetical protein ARALYDRAFT_888325 [Arabidopsis lyrata subsp.
lyrata]
Length = 100
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/109 (36%), Positives = 60/109 (55%), Gaps = 11/109 (10%)
Query: 146 EAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLI 205
E +KLLD KYFS RII+R+D + +K+ D V G E ++ +D+++ VW
Sbjct: 3 ERWLKLLDPKGKYFSDRIISRDDGTVRHKKSLD-VMGNEEAVLFVDESKIVWQKK----- 56
Query: 206 VLGKYVYFRDKELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFF 254
G++ K+ D S+ L DESE++ AL+ VL VLK H + F
Sbjct: 57 -YGEFFASSCKQFKED----SKLLPDESESDGALSTVLNVLKQTHGILF 100
>gi|291234950|ref|XP_002737409.1| PREDICTED: RNA polymerase II ctd phosphatase, putative-like
[Saccoglossus kowalevskii]
Length = 896
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 78/163 (47%), Gaps = 22/163 (13%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-----VKLRPFV 121
RKL +++LD T++H ++ + + LK H FQ+ + ++RP
Sbjct: 178 RKLVCIVDLDQTIIHT----TMDNVPENLKDVYH------FQLWSGPQYPWFHTRIRPKC 227
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPD 178
+ FLE+ S L ++++ T R YA +D D K FS RI++R+ D + K
Sbjct: 228 KEFLEKISKLYELHIFTFGARLYAHMIAGFIDPDKKLFSHRIVSRDECFDASSKTANLQA 287
Query: 179 LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGD 221
+ + + I+DD E VW + N+I + Y YF E GD
Sbjct: 288 IFPCGDNMVCIIDDREDVW-NFAPNMIHVKPYHYF---EGTGD 326
>gi|84994102|ref|XP_951773.1| CTD-like phosphatase [Theileria annulata strain Ankara]
gi|65301934|emb|CAI74041.1| CTD-like phosphatase, putative [Theileria annulata]
Length = 767
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 67/143 (46%), Gaps = 12/143 (8%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD- 173
+KLRP +R FL+ S ++ + T +T+ YA+ + +LD D F RI+AR + KD
Sbjct: 345 MKLRPCIREFLQILSLYYEMSIYTNATKEYADVVISILDPDRSLFMDRIVARNSVDEKDL 404
Query: 174 -----RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDK---ELNGDHKSY 225
R PDL R I+ DD VWSD +V ++ F + ELN ++ S
Sbjct: 405 LKSASRLYPDL---DTRFILAFDDRRDVWSDIPHKQVVRAEHYDFFESYITELNNNYSSS 461
Query: 226 SETLTDESENEEALANVLRVLKT 248
++ + + + V T
Sbjct: 462 PSPPNKQTPESNSFNSTINVSST 484
>gi|71004098|ref|XP_756715.1| hypothetical protein UM00568.1 [Ustilago maydis 521]
gi|46095984|gb|EAK81217.1| hypothetical protein UM00568.1 [Ustilago maydis 521]
Length = 779
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 62/239 (25%), Positives = 100/239 (41%), Gaps = 57/239 (23%)
Query: 27 CAHTTVRDSRCIFCSQAMN----DSFGLSFDYMLRGLRYSEQE--------------ERK 68
C H C C Q ++ +S LS + ++ S +E +RK
Sbjct: 8 CKHPVQLFGMCAVCGQPVDADSEESASLSVMHSSSAVKVSAEEAQRLDSETTSHLLSQRK 67
Query: 69 LQLVLNLDHTLLHCR---------------NIKSLSS---------GEKYLKKQIHSFIG 104
L L+++LD T++H N ++L S GE+ ++ G
Sbjct: 68 LALIVDLDQTVIHATVDPTVGEWMRDESNPNYEALQSVGKFRLGIDGEEIKDEED----G 123
Query: 105 SLFQMANDKL---------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLD 155
S + L VK RP V L+ S ++++ TM TR YA KL+D D
Sbjct: 124 SEPKDPAAALKASRACWYYVKPRPGVPQVLKHLSEKYELHVYTMGTRSYANCVCKLIDPD 183
Query: 156 SKYFSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ F +RI++R++ RK+ L +VI+DD E VWS + NL+ + Y +F
Sbjct: 184 ASIFGNRILSRDENGSLVRKSLSRLFPVDHSMVVIIDDREDVWS-RSPNLLPVLPYEFF 241
>gi|330796177|ref|XP_003286145.1| hypothetical protein DICPUDRAFT_87022 [Dictyostelium purpureum]
gi|325083890|gb|EGC37331.1| hypothetical protein DICPUDRAFT_87022 [Dictyostelium purpureum]
Length = 793
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 49/170 (28%), Positives = 75/170 (44%), Gaps = 29/170 (17%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYL--KKQIHSFIGSLFQMANDKL-VKLRPFVRTF 124
K+ L++++DHTL+H +GE Y K +H F N+ VK RP F
Sbjct: 416 KMHLIVDIDHTLIHST---KDPNGESYFLKDKTVHKI---SFPETNETFYVKERPNAIEF 469
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI------------IAREDFNGK 172
L S IY+ + + Y E +LD S FS I I RE+ N +
Sbjct: 470 LRTLSQQFYIYVYSFHPKYYVERVASILDPHSNIFSKVISKEIIESIENIKICRENNNSQ 529
Query: 173 -------DRKNPDLVRGQE-RGIVILDDTESVWSDHTENLIVLGKYVYFR 214
++ P + + + ++ILDD E VW + +NLI+L + YF
Sbjct: 530 KPFIVFNEQNVPKIFKFESINQLIILDDREDVWRNFQDNLILLDTFKYFN 579
>gi|215794710|pdb|3EF1|A Chain A, The Structure Of Fcp1, An Essential Rna Polymerase Ii Ctd
Phosphatase
Length = 442
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 81/172 (47%), Gaps = 35/172 (20%)
Query: 64 QEERKLQLVLNLDHTLLHC---------------------RNIKSLSSGEKYLKKQIHSF 102
++E++L L++ LD T++H R+++S + L++ +
Sbjct: 22 RQEKRLSLIVXLDQTIIHATVDPTVGEWMSDPGNVNYDVLRDVRSFN-----LQEGPSGY 76
Query: 103 IGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
+ +K RP + FL++ S L ++++ TM T+ YA+ K++D K F R
Sbjct: 77 TSCYY-------IKFRPGLAQFLQKISELYELHIYTMGTKAYAKEVAKIIDPTGKLFQDR 129
Query: 163 IIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+++R+D +K+ L +V++DD VW D NLI + Y +F
Sbjct: 130 VLSRDDSGSLAQKSLRRLFPCDTSMVVVIDDRGDVW-DWNPNLIKVVPYEFF 180
>gi|356510404|ref|XP_003523928.1| PREDICTED: uncharacterized protein LOC100810756 [Glycine max]
Length = 469
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 52/163 (31%), Positives = 79/163 (48%), Gaps = 21/163 (12%)
Query: 56 LRGLRYSEQEERK-LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK- 113
L L +E +RK + LVL+LD TL+H S G Q F+M D+
Sbjct: 283 LPALLINETSKRKKVTLVLDLDETLIHS------SMG------QCDGAADFTFKMITDRE 330
Query: 114 ---LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFN 170
V+ RPF++ FL + S + +I + T S R YAE + +LD D K+FS R+ RE
Sbjct: 331 LTVYVRKRPFLQEFLVKVSEMFEIIIFTASKRMYAETLLDVLDPDKKFFSRRVY-RESCT 389
Query: 171 GKDR---KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
KDR K+ ++ + I+D+T V+ N I + +
Sbjct: 390 WKDRRCVKDLTVLGIDLAKVCIIDNTPEVFRFQVNNGIPIKSW 432
>gi|403222586|dbj|BAM40718.1| CTD-like phosphatase [Theileria orientalis strain Shintoku]
Length = 763
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 65/135 (48%), Gaps = 10/135 (7%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD- 173
+KLRP +R FL+ S ++ + T +T+ YA+ + +LD D F RI+AR + KD
Sbjct: 343 MKLRPCIREFLQILSLYYEMSIYTNATKEYADVVISILDPDRSLFMDRIVARNSVDEKDL 402
Query: 174 -----RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSET 228
R PDL R I+ DD VWSD +V ++ F + L + +Y+ +
Sbjct: 403 LKSASRLYPDL---DPRFILAFDDRRDVWSDIPHKQVVRAEHYDFFESYLTELNNNYTSS 459
Query: 229 LTD-ESENEEALANV 242
+D N E N
Sbjct: 460 GSDFNKANGEGSTNT 474
>gi|393240595|gb|EJD48120.1| hypothetical protein AURDEDRAFT_85955 [Auricularia delicata
TFB-10046 SS5]
Length = 796
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 55/100 (55%), Gaps = 2/100 (2%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+K RP ++ FLE S ++++ TM TR YAE +D D + F RI++R++
Sbjct: 261 IKPRPGLQAFLEAISQKYEMHVYTMGTRAYAEKVCAAIDPDGRMFGRRILSRDESGSLTA 320
Query: 175 KNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K+ + L +VI+DD VW D + NL+ + +Y +F
Sbjct: 321 KSLERLFPCDTSMVVIIDDRSDVW-DRSPNLVEVVRYDFF 359
>gi|302698337|ref|XP_003038847.1| hypothetical protein SCHCODRAFT_255670 [Schizophyllum commune H4-8]
gi|300112544|gb|EFJ03945.1| hypothetical protein SCHCODRAFT_255670 [Schizophyllum commune H4-8]
Length = 1207
Score = 57.0 bits (136), Expect = 1e-05, Method: Composition-based stats.
Identities = 34/104 (32%), Positives = 56/104 (53%), Gaps = 5/104 (4%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+K RP + F+ S+ ++++ TM TR YA A +LD D + F RI++R++ +
Sbjct: 620 IKPRPGWQEFMNNMSAKYEMHVYTMGTRAYAMAVCNVLDPDGRLFGERILSRDESGSLTQ 679
Query: 175 KNPD-LVRGQERGIVILDDTESVWSDHTE----NLIVLGKYVYF 213
K+ D L + +VI+DD VWS + NLI + Y +F
Sbjct: 680 KSLDRLFPTDQSMVVIIDDRADVWSGGLQFWSPNLIKVVPYDFF 723
>gi|300122627|emb|CBK23195.2| unnamed protein product [Blastocystis hominis]
Length = 598
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 66/266 (24%), Positives = 109/266 (40%), Gaps = 76/266 (28%)
Query: 67 RKLQLVLNLDHTLLHCRNIK-------------SLSSGE----KYLKKQIHSFIGSLFQM 109
+KL L+++LD TL+H + + S S+ E K LK Q+HS LF +
Sbjct: 151 KKLILIIDLDMTLVHAIHEEESIGLFLNWLHGASESNEEDEWKKTLKDQVHSI--ELFYV 208
Query: 110 ANDK-------LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
++ L+K+RP VR L+ ++ ++ + T YAE ++++D D+ F R
Sbjct: 209 DDNGSARMSKLLIKIRPGVRAMLQMLANSYEMIVYTQGENQYAEKVMQIVDPDNTLFKKR 268
Query: 163 IIAREDFNGKDRKNP-------------DLVRGQE---------------------RGIV 188
IAR G+ R P VR Q R ++
Sbjct: 269 FIAR----GETRNEPQKKLLSKIVDCWNQYVRKQNVYDPANPTPESLPELTLEEMCRRLL 324
Query: 189 ILDDTESVWSDHTENLIVLG---------KYVYFRDKELNGDHKSYSETLTDESENEEAL 239
ILDD + VW H E+ ++L YV+F K D ++ + E ++ +
Sbjct: 325 ILDDKDEVWGMHEESGMILNPTSSLIKCFPYVFFDTK---SDLYNFEKLSAYEGVEQQYI 381
Query: 240 ANVLRVLKTIHRLFFDSVCGDVRTYL 265
+ + + IH+ F DVR L
Sbjct: 382 LRLSEIFRDIHQTFTLENAEDVRKTL 407
>gi|66805733|ref|XP_636588.1| hypothetical protein DDB_G0288707 [Dictyostelium discoideum AX4]
gi|60464974|gb|EAL63085.1| hypothetical protein DDB_G0288707 [Dictyostelium discoideum AX4]
Length = 985
Score = 57.0 bits (136), Expect = 1e-05, Method: Composition-based stats.
Identities = 48/177 (27%), Positives = 79/177 (44%), Gaps = 24/177 (13%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLK-KQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
K+ L++++DHTLLH + K ++ YLK I+ F ++ + VK RP FL
Sbjct: 574 KMYLIVDIDHTLLH--STKDPNAESYYLKDNSINKF--TITETNETFYVKQRPNAIEFLS 629
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQE-- 184
SS IYL + + Y E +LD + F +++I +E + P G+
Sbjct: 630 SLSSQFKIYLYSFHPKYYVEQLALILDPNRSIF-TKVITKEVIEPVEPLPPINSIGKPYI 688
Query: 185 ----------------RGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSY 225
++ILDD E VW + +NLI+L + +F N ++Y
Sbjct: 689 VFNNQNFSKIFNFEAINQMIILDDREDVWRNFQDNLILLDTFKFFNTNSSNTSGRNY 745
>gi|389751366|gb|EIM92439.1| hypothetical protein STEHIDRAFT_136328 [Stereum hirsutum FP-91666
SS1]
Length = 1075
Score = 57.0 bits (136), Expect = 1e-05, Method: Composition-based stats.
Identities = 34/100 (34%), Positives = 53/100 (53%), Gaps = 2/100 (2%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
VK RP R FL + ++++ TM TR YAE +D D K+F RI++R++ +
Sbjct: 308 VKPRPGTREFLSSVAEKYEMHVYTMGTRAYAEEVCAAIDPDGKFFGGRILSRDESGSMTQ 367
Query: 175 KN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K+ L +VI+DD VW + + NLI + Y +F
Sbjct: 368 KSLRRLFPVDTSMVVIIDDRADVW-EWSPNLIKVIPYDFF 406
>gi|357451355|ref|XP_003595954.1| RNA polymerase II subunit A C-terminal domain phosphatase [Medicago
truncatula]
gi|355485002|gb|AES66205.1| RNA polymerase II subunit A C-terminal domain phosphatase [Medicago
truncatula]
Length = 239
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 40/60 (66%)
Query: 104 GSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
GSLF + ++ KLRPFVRTFL++AS + ++Y+ TM R Y+ KLLD +YF ++
Sbjct: 58 GSLFVLDMQRMNKLRPFVRTFLKEASEVFEMYIYTMGIRQYSLEMAKLLDPQVEYFKDKV 117
>gi|342320998|gb|EGU12936.1| RNA Polymerase II CTD phosphatase Fcp1, putative [Rhodotorula
glutinis ATCC 204091]
Length = 817
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/134 (28%), Positives = 65/134 (48%), Gaps = 14/134 (10%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+K+RP + FL++ + + ++++ TM TR YA K++D D F RI++R++ R
Sbjct: 252 IKMRPGLPDFLKRVAEMYEMHVYTMGTRAYASEVCKVIDPDGGLFGGRILSRDESGSMTR 311
Query: 175 KN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF------------RDKELNGD 221
K+ L +VI+DD VW D + +L+ + Y +F + KEL+
Sbjct: 312 KSLQRLFPCDTNMVVIIDDRADVW-DGSPHLVKVIPYEFFVGIGDINAAFLPKKKELHPP 370
Query: 222 HKSYSETLTDESEN 235
K ESE
Sbjct: 371 PKPKDAQAAPESEG 384
>gi|255540899|ref|XP_002511514.1| hypothetical protein RCOM_1513430 [Ricinus communis]
gi|223550629|gb|EEF52116.1| hypothetical protein RCOM_1513430 [Ricinus communis]
Length = 149
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 47/87 (54%), Gaps = 13/87 (14%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQE-------------ERKLQLV 72
SC+H V C C Q + D +GL F Y+++ LR S+ E +KL LV
Sbjct: 40 SCSHPIVLKLMCTICGQDVPDGYGLPFGYIMKDLRLSKIEADRQRYIETTNILSKKLILV 99
Query: 73 LNLDHTLLHCRNIKSLSSGEKYLKKQI 99
L+L+ TLL + ++L+ EKY++ QI
Sbjct: 100 LDLNKTLLQSKYPEALTPEEKYMENQI 126
>gi|71031738|ref|XP_765511.1| hypothetical protein [Theileria parva strain Muguga]
gi|68352467|gb|EAN33228.1| hypothetical protein TP02_0943 [Theileria parva]
Length = 769
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 67/140 (47%), Gaps = 15/140 (10%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD- 173
+KLRP +R FL+ S ++ + T +T+ YA+ + +LD D F RI+AR + KD
Sbjct: 346 MKLRPCIREFLQILSLYYEMSIYTNATKEYADVVISILDPDRSLFMDRIVARNSVDEKDL 405
Query: 174 -----RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD---KELNGDHKS- 224
R PDL R I+ DD VWSD +V ++ F + ELN ++ S
Sbjct: 406 LKSASRLYPDL---DTRFILAFDDRRDVWSDIPHKQVVRAEHYDFFESYISELNNNYSSS 462
Query: 225 --YSETLTDESENEEALANV 242
S T ES + NV
Sbjct: 463 PTPSNKQTPESNSFNLTTNV 482
>gi|6689545|emb|CAB65510.1| FCP1 serine phosphatase [Xenopus laevis]
Length = 867
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 72/151 (47%), Gaps = 10/151 (6%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
+KL L+++LD TL+H K + H +G M + +LRP + FLE
Sbjct: 62 QKLVLMVDLDQTLIHTTEQHCQHMSRKGI---FHFQLGRGEPMLH---TRLRPHCKEFLE 115
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQ 183
+ + L ++++ T +R YA LD + K FS RI++R+ D K +L
Sbjct: 116 KIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPYSKTGNLRNLFPCG 175
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ + I+DD E VW NLI + K F+
Sbjct: 176 DSMVCIIDDREDVWK-FAPNLITVKKMCIFQ 205
>gi|356515353|ref|XP_003526365.1| PREDICTED: uncharacterized protein LOC100813300 [Glycine max]
Length = 467
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/151 (31%), Positives = 74/151 (49%), Gaps = 21/151 (13%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK----LVKLRPFVR 122
+K+ L L+LD TL+H SS E+ F+M D+ V+ RPF++
Sbjct: 294 KKVTLALDLDETLIH-------SSMEQCDGADF------TFKMITDRERTVYVRKRPFLQ 340
Query: 123 TFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR---KNPDL 179
FL + S + +I + T S R YAE + +LD D K+FS R + RE KDR K+ +
Sbjct: 341 EFLAKVSEMFEIIIFTASKRMYAETLLDVLDPDKKFFSRR-VCRESCTWKDRCCVKDLTV 399
Query: 180 VRGQERGIVILDDTESVWSDHTENLIVLGKY 210
+ + I+D+T V+ N I + +
Sbjct: 400 LGIDLAKVCIIDNTPEVFRFQVNNGIPIKSW 430
>gi|170084539|ref|XP_001873493.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164651045|gb|EDR15285.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 845
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 56/100 (56%), Gaps = 2/100 (2%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+K RP + FL++AS+ ++++ TM TR YAE +D D K F R+++R++ +
Sbjct: 262 IKPRPGWKEFLQEASTKYEMHVYTMGTRAYAEQVCAAIDPDGKLFGGRVLSRDESGSLTQ 321
Query: 175 KN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K+ L +VI+DD VW + + NL+ + Y +F
Sbjct: 322 KSLQRLFPCDTSMVVIIDDRADVW-EWSPNLLKVVPYDFF 360
>gi|294898997|ref|XP_002776453.1| NLI interacting factor, putative [Perkinsus marinus ATCC 50983]
gi|294900793|ref|XP_002777118.1| NLI interacting factor, putative [Perkinsus marinus ATCC 50983]
gi|239883444|gb|EER08269.1| NLI interacting factor, putative [Perkinsus marinus ATCC 50983]
gi|239884575|gb|EER08934.1| NLI interacting factor, putative [Perkinsus marinus ATCC 50983]
Length = 370
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 72/164 (43%), Gaps = 25/164 (15%)
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYF--SSRIIAR-EDFN 170
VKLRP V FLE + + Y+ T +TR Y E ++ LD K F + + +R +D
Sbjct: 31 FVKLRPGVHQFLEALQPMYEFYIHTKATRVYLEYVMEALDPHKKGFFRNDNVFSRCDDMK 90
Query: 171 GKDRKNPDL----VRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKEL-------- 218
+N D+ R +E ++ILDD + +W D N+I Y Y K L
Sbjct: 91 HGSNENKDIRAVCSRPREE-VIILDDKDKIWLDFQPNVIKCPPYKYMDQKLLQVVRALKQ 149
Query: 219 -------NGDHKSYSET-LTDESENEEA-LANVLRVLKTIHRLF 253
G Y + L D S+N + L ++RV IH +
Sbjct: 150 TSDWIKEGGPESGYPKPELDDASKNFDGYLPAMVRVFTEIHHRY 193
>gi|68525545|ref|XP_723632.1| NLI interacting factor [Plasmodium yoelii yoelii 17XNL]
gi|23477988|gb|EAA15197.1| NLI interacting factor, putative [Plasmodium yoelii yoelii]
Length = 1251
Score = 56.2 bits (134), Expect = 2e-05, Method: Composition-based stats.
Identities = 42/116 (36%), Positives = 64/116 (55%), Gaps = 3/116 (2%)
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDR 174
KLRP V FL++ + +IYL TM T +A++ + LLD K+F +RI +R+D NG
Sbjct: 431 KLRPGVIEFLQKMNQKYEIYLYTMGTIEHAKSCLFLLDPLKKFFGNRIFSRKDCTNGMKH 490
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLT 230
N L + I + DD+E +W + T + I + Y YF + + GD K + LT
Sbjct: 491 LNRILPTYRSISICV-DDSEYIWKE-TNSCIKVHAYNYFPEIQFLGDIKKKTYFLT 544
>gi|402220046|gb|EJU00119.1| hypothetical protein DACRYDRAFT_81791 [Dacryopinax sp. DJM-731 SS1]
Length = 855
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 54/106 (50%), Gaps = 1/106 (0%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+K RP + FL + S L ++++ TM TR YA V+L+D F SR+++R++
Sbjct: 243 IKPRPGLHAFLSRLSELYEMHVYTMGTRSYASQVVRLIDPLGNLFGSRVLSRDESGSLTF 302
Query: 175 KN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELN 219
KN L VI+DD VW NL+ + Y +F ++N
Sbjct: 303 KNLTRLFPCNTSSAVIIDDRADVWDLSRANLVKVVPYDFFSVGDIN 348
>gi|403217618|emb|CCK72111.1| hypothetical protein KNAG_0J00280 [Kazachstania naganishii CBS
8797]
Length = 742
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/169 (23%), Positives = 82/169 (48%), Gaps = 23/169 (13%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL---------- 114
+KL LV++LD T++HC ++ ++ + + + + F + + +
Sbjct: 178 QKLVLVVDLDQTVVHCGVDPTIGEWKRDPRNPNYEALRDVQSFALEEEPILPFLYVGGKR 237
Query: 115 ---------VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIA 165
VK+RP ++ F ++ + L ++++ TM+TR YA K++D D F RI++
Sbjct: 238 PAPRKCWYYVKVRPGLKQFFKRLAPLFEMHIYTMATRAYALEIAKIIDPDKSLFGDRILS 297
Query: 166 REDFNGKDRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
R++ K+ + L + + ++DD VW + NLI + Y +F
Sbjct: 298 RDENGSLTHKSLERLFPTDQSMVTVIDDRGDVW-NWCANLIKVVPYNFF 345
>gi|345479753|ref|XP_001603378.2| PREDICTED: hypothetical protein LOC100119644 [Nasonia vitripennis]
Length = 563
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/155 (31%), Positives = 72/155 (46%), Gaps = 13/155 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC +++ LS + ++F V+ RPF R FLE
Sbjct: 384 EFSLVLDLDETLVHC-SLQELSDASFRFPVVFQNITYTVF-------VRTRPFFREFLEH 435
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
SSL ++ L T S R YA + LLD K R+ RE NG K+ ++
Sbjct: 436 VSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLF-REHCVCVNGNYIKDLSILGRDL 494
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVYFR-DKEL 218
VI+D++ + EN I + + R D EL
Sbjct: 495 SKTVIIDNSPQAFGYQLENGIPIESWFADRTDSEL 529
>gi|401886990|gb|EJT50998.1| protein phosphatase [Trichosporon asahii var. asahii CBS 2479]
Length = 922
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 53/101 (52%), Gaps = 15/101 (14%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYF--SSRIIAREDFNGK 172
K RP + FLE S L ++++ TM TR YA+A K++D + KYF S++ + R
Sbjct: 309 TKPRPGLNKFLEDMSKLYEMHVYTMGTRSYADAICKIVDPEGKYFAMSAKSLVR------ 362
Query: 173 DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + +VI+DD VW D + NL+ + Y +F
Sbjct: 363 ------LFPHDQSMVVIIDDRSDVWGD-SPNLVKVVPYDFF 396
>gi|406695220|gb|EKC98531.1| protein phosphatase [Trichosporon asahii var. asahii CBS 8904]
Length = 917
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 53/101 (52%), Gaps = 15/101 (14%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYF--SSRIIAREDFNGK 172
K RP + FLE S L ++++ TM TR YA+A K++D + KYF S++ + R
Sbjct: 309 TKPRPGLNKFLEDMSKLYEMHVYTMGTRSYADAICKIVDPEGKYFAMSAKSLVR------ 362
Query: 173 DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
L + +VI+DD VW D + NL+ + Y +F
Sbjct: 363 ------LFPHDQSMVVIIDDRSDVWGD-SPNLVKVVPYDFF 396
>gi|167384602|ref|XP_001737021.1| RNA polymerase II ctd phosphatase [Entamoeba dispar SAW760]
gi|165900378|gb|EDR26711.1| RNA polymerase II ctd phosphatase, putative [Entamoeba dispar
SAW760]
Length = 429
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 69/294 (23%), Positives = 124/294 (42%), Gaps = 61/294 (20%)
Query: 27 CAHTTVRDSR-CIFCSQAMND---------SFGLSFDYMLRGLRYSEQ---EERKLQLVL 73
C H + D C+ C Q + D +G++ Y R + +E+KL L+L
Sbjct: 7 CPHNKINDQNYCVDCYQLIEDVDDYIRTSGGYGITKSYAEEQKRSVSERLLKEKKLSLIL 66
Query: 74 NLDHTLLHCRN--IKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASSL 131
+LD T++ L + E+ + + F + + L+K R + TF+E+ S L
Sbjct: 67 DLDGTIVFTNPELCVPLENEEEPITPE-QGFYFEIPEQNAKVLIKFRDGIVTFMEKVSKL 125
Query: 132 VDIYLCTMSTRCYAEAAVKLLDL--DSKYFSSRIIAREDFNGK-------------DRKN 176
DI++ T+ + YA A V ++ D+ + + ++ ED + DR+
Sbjct: 126 YDIHVVTLGQKEYAFAIVNAINKLRDTPFITGDLVTAEDCSSVIVCDEKDTNDGLIDREE 185
Query: 177 PDLVRGQERGI---------VILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSE 227
+ R +R I VI+DD VW + +N++ + +YV
Sbjct: 186 TNERRSVKRSIPTMGKEEMQVIVDDRIDVWDN--KNVVQICEYV---------------- 227
Query: 228 TLTDESENEEALANVLRVLKTIHRLFFDSVCGDVRTYLPKVRSEFSRDV-LYFS 280
T++ + E L V VL+ I+ F+D DV+ L R + + LYF+
Sbjct: 228 PSTNQVDTE--LLRVTEVLQNIYNKFYDEHIEDVKEILHSFRKKILENKNLYFN 279
>gi|328859642|gb|EGG08750.1| hypothetical protein MELLADRAFT_115868 [Melampsora larici-populina
98AG31]
Length = 736
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/152 (25%), Positives = 76/152 (50%), Gaps = 32/152 (21%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
++ KL L+++LD T++H + + +I L + F+RT
Sbjct: 269 KDTKLSLIVDLDQTIVHA-----------TVDPTVGEWIPGLSE-----------FLRTL 306
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR--- 181
E+ ++++ TM TR YA+A +++D S+ F SR+++R++ +K+ L R
Sbjct: 307 AEK----YEMHVYTMGTRAYADAVCRIIDPTSELFGSRVLSRDESGSMTQKS--LTRLFP 360
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+VI+DD VW +++ NL+ + Y +F
Sbjct: 361 VDTSMVVIIDDRGDVW-EYSPNLVSVVPYNFF 391
>gi|350421968|ref|XP_003493015.1| PREDICTED: hypothetical protein LOC100746789 isoform 2 [Bombus
impatiens]
Length = 457
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 90/208 (43%), Gaps = 40/208 (19%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC +++ LS ++F V+ RP+ R FLE
Sbjct: 278 EFSLVLDLDETLVHC-SLQELSDAAFRFPVVFQDVTYTVF-------VRTRPYFREFLEH 329
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
SSL ++ L T S R YA + LLD K R+ RE NG K+ ++
Sbjct: 330 VSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLF-REHCVCVNGNYIKDLSILGRDL 388
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALANVLR 244
VI+D++ + EN I + + D S+NE +++
Sbjct: 389 SKTVIIDNSPQAFGYQLENGIPIESW------------------FADRSDNE-----LMK 425
Query: 245 VLKTIHRLFFDSVCGDVRTYLPKVRSEF 272
+L + L + GDVR P++R +F
Sbjct: 426 LLPFLENLV--NWGGDVR---PRIREQF 448
>gi|353236741|emb|CCA68729.1| related to FCP1-TFIIF interacting component of CTD phosphatase
[Piriformospora indica DSM 11827]
Length = 782
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 49/178 (27%), Positives = 80/178 (44%), Gaps = 33/178 (18%)
Query: 67 RKLQLVLNLDHTLLHC-------RNIKSLSSGEKYLKKQIHSFIGSL------------- 106
RKL L+++LD T+LH IK+ + EK
Sbjct: 155 RKLSLIVDLDQTILHATFDPTVGEWIKAKDAFEKRRSTTPPDHDPPPESVNWPALEDVIS 214
Query: 107 FQMANDK---------LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSK 157
FQ+ +D VK RP ++ F+ S L ++++ TM R YA A LD
Sbjct: 215 FQLPSDHGHMGHSERYYVKPRPGLQRFMNNLSELYEMHVYTMGVRSYANAICAALDPSGA 274
Query: 158 YFSSRIIAREDFNGKDR-KN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+F SR+++R + +G DR KN L + +V++DD VW + + NL+ + + +F
Sbjct: 275 WFGSRVLSRNE-SGSDRVKNLKRLFPSDQSMVVVIDDRADVW-NWSPNLVRVIPFEFF 330
>gi|91086797|ref|XP_973406.1| PREDICTED: similar to CTD (carboxy-terminal domain, RNA polymerase
II, polypeptide A) small phosphatase like 2 [Tribolium
castaneum]
gi|270009707|gb|EFA06155.1| hypothetical protein TcasGA2_TC009000 [Tribolium castaneum]
Length = 451
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 50/177 (28%), Positives = 84/177 (47%), Gaps = 17/177 (9%)
Query: 49 GLSFDYMLR--GLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL 106
L+FD + L + + LVL+LD TL+HC +++ LS + L
Sbjct: 251 PLTFDMRSKCPALPLKTRSSPEFSLVLDLDETLVHC-SLQELSDASFHFP--------VL 301
Query: 107 FQMANDKL-VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIA 165
FQ + + V+ RP+ R F+E+ S + ++ L T S R YA+ + LLD + K+ R+
Sbjct: 302 FQDCSYTVYVRTRPYFREFMEKVSQMFEVILFTASKRVYADKLLNLLDPERKWIKYRLF- 360
Query: 166 RED---FNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR-DKEL 218
RE NG K+ ++ +I+D++ + H N I + + R D EL
Sbjct: 361 REHCVCVNGNYIKDLSILGRDLSKTIIIDNSPQAFGYHLNNGIPIESWFVDRTDSEL 417
>gi|429964988|gb|ELA46985.1| FCP1-like phosphatase, phosphatase domain-containing protein,
partial [Vavraia culicis 'floridensis']
Length = 231
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 50/221 (22%), Positives = 95/221 (42%), Gaps = 40/221 (18%)
Query: 25 LSCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRG---------------LRYSEQ--EER 67
+ C H + C C Q + D+ F L +RY ++ +++
Sbjct: 1 MPCQHPIKLNKLCALCGQEVQDTENTKFYNALHSNSRLRVDKSTIDGMYVRYRDELIQKK 60
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGE------------------KYLKKQIHSFIGSLFQ- 108
K+ LV++LD T+LH +K G+ + L+ + + S F
Sbjct: 61 KMILVVDLDQTILHSIEVKGGRVGDNGSRNRNGECGGRGITNKQLLQARPRQPLPSSFTY 120
Query: 109 -MANDKL-VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR 166
+A+ + LRP + TFL + + + +++ TM T Y ++D D F RI+ R
Sbjct: 121 TLASTTMKTTLRPHLHTFLTELNEMFHMHIYTMGTSEYVHQITNVIDRDRSLFGDRIVTR 180
Query: 167 EDFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVL 207
+D ++ L +E +V++DD VW ++ NL+++
Sbjct: 181 DD-EVLVKRLERLFGDREDMVVVIDDRGDVW-EYCGNLVMI 219
>gi|350421965|ref|XP_003493014.1| PREDICTED: hypothetical protein LOC100746789 isoform 1 [Bombus
impatiens]
Length = 558
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 90/208 (43%), Gaps = 40/208 (19%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC +++ LS ++F V+ RP+ R FLE
Sbjct: 379 EFSLVLDLDETLVHC-SLQELSDAAFRFPVVFQDVTYTVF-------VRTRPYFREFLEH 430
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
SSL ++ L T S R YA + LLD K R+ RE NG K+ ++
Sbjct: 431 VSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLF-REHCVCVNGNYIKDLSILGRDL 489
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALANVLR 244
VI+D++ + EN I + + D S+NE +++
Sbjct: 490 SKTVIIDNSPQAFGYQLENGIPIESW------------------FADRSDNE-----LMK 526
Query: 245 VLKTIHRLFFDSVCGDVRTYLPKVRSEF 272
+L + L + GDVR P++R +F
Sbjct: 527 LLPFLENLV--NWGGDVR---PRIREQF 549
>gi|307194093|gb|EFN76554.1| CTD small phosphatase-like protein 2 [Harpegnathos saltator]
Length = 546
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 89/208 (42%), Gaps = 40/208 (19%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC +++ LS ++F V+ RP+ R FLE
Sbjct: 367 EFSLVLDLDETLVHC-SLQELSDAAFRFPVVFQDVTYTVF-------VRTRPYFREFLEH 418
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
SSL ++ L T S R YA + LLD K R+ RE NG K+ ++
Sbjct: 419 VSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLF-REHCVCVNGNYIKDLSILGRDL 477
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALANVLR 244
VI+D++ + EN I + + D S+NE +++
Sbjct: 478 SKTVIIDNSPQAFGYQLENGIPIESW------------------FADRSDNE-----LMK 514
Query: 245 VLKTIHRLFFDSVCGDVRTYLPKVRSEF 272
+L + L + GDVR P +R +F
Sbjct: 515 LLPFLENLV--NWGGDVR---PHIREQF 537
>gi|322779051|gb|EFZ09448.1| hypothetical protein SINV_03717 [Solenopsis invicta]
Length = 568
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 90/208 (43%), Gaps = 40/208 (19%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC +++ LS ++F V+ RP+ R FLE
Sbjct: 389 EFSLVLDLDETLVHC-SLQELSDAAFRFPVVFQDVTYTVF-------VRTRPYFREFLEH 440
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
SSL ++ L T S R YA + LLD K R+ RE NG K+ ++
Sbjct: 441 VSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLF-REHCVCVNGNYIKDLSILGRDL 499
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALANVLR 244
VI+D++ + EN I + + D S+NE +++
Sbjct: 500 SKTVIIDNSPQAFGYQLENGIPIESW------------------FADRSDNE-----LMK 536
Query: 245 VLKTIHRLFFDSVCGDVRTYLPKVRSEF 272
+L + L + GDVR P++R +F
Sbjct: 537 LLPFLENLV--NWGGDVR---PRIREQF 559
>gi|313234471|emb|CBY24671.1| unnamed protein product [Oikopleura dioica]
Length = 614
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 52/198 (26%), Positives = 93/198 (46%), Gaps = 21/198 (10%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
+ RKL L+++LD T++H + + K L K SF L + +LRPF
Sbjct: 68 HDNRKLVLLVDLDQTVIH-----TTQNRPKKLTKNTISF--QLTRQDPWLWTRLRPFCAK 120
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKL--------LDLDSK--YFSSRIIAREDFNGKD 173
F+ + S ++++ T +R YA ++ L+LDS +FS RI++R++
Sbjct: 121 FIHEMSEKYELHIVTFGSRQYAHKIAEILEDQTRRQLNLDSNKSFFSHRILSRDECVDPF 180
Query: 174 RKNPDLVRGQERG---IVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLT 230
K+ +L G I+DD VW ++ N I++ KY +F D D ++ TL
Sbjct: 181 HKSGNLEHLFPCGDSMCAIIDDRGDVWR-YSPNCILVKKYHFFTDTGDINDPHAFKSTLP 239
Query: 231 DESENEEALANVLRVLKT 248
S+ + L + + + +
Sbjct: 240 PTSQTQNELPDKDKAISS 257
>gi|156381374|ref|XP_001632240.1| predicted protein [Nematostella vectensis]
gi|156219293|gb|EDO40177.1| predicted protein [Nematostella vectensis]
Length = 122
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 54/103 (52%), Gaps = 4/103 (3%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKD 173
K RP+ FL++ + ++++ TM TR YA ++LD D F RI +R+D FN
Sbjct: 5 TKFRPWAHKFLQKIAKFYELHIFTMGTRMYAHTIARMLDPDLSLFGYRIRSRDDCFNAFS 64
Query: 174 RKNP--DLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR 214
+ N L + + I+DD VW++ +LI + Y +F+
Sbjct: 65 KFNDLRSLFPCGDSMVCIIDDRADVWNN-APSLIKVKPYQFFK 106
>gi|332020757|gb|EGI61161.1| CTD small phosphatase-like protein 2 [Acromyrmex echinatior]
Length = 593
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 90/208 (43%), Gaps = 40/208 (19%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC +++ LS ++F V+ RP+ R FLE
Sbjct: 414 EFSLVLDLDETLVHC-SLQELSDAAFRFPVVFQDVTYTVF-------VRTRPYFREFLEH 465
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
SSL ++ L T S R YA + LLD K R+ RE NG K+ ++
Sbjct: 466 VSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLF-REHCVCVNGNYIKDLSILGRDL 524
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALANVLR 244
VI+D++ + EN I + + D S+NE +++
Sbjct: 525 SKTVIIDNSPQAFGYQLENGIPIESW------------------FADRSDNE-----LMK 561
Query: 245 VLKTIHRLFFDSVCGDVRTYLPKVRSEF 272
+L + L + GDVR P++R +F
Sbjct: 562 LLPFLENLV--NWGGDVR---PRIREQF 584
>gi|221486680|gb|EEE24941.1| RNA polymerase II phosphatase, putative [Toxoplasma gondii GT1]
Length = 1234
Score = 54.7 bits (130), Expect = 5e-05, Method: Composition-based stats.
Identities = 29/100 (29%), Positives = 54/100 (54%), Gaps = 1/100 (1%)
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRK 175
KLRP FL + S ++Y+ TM T +A A+++LD ++F R+ +R+D +
Sbjct: 632 KLRPGCLDFLRRVSQTFELYMYTMGTALHAATALRILDPKRRFFGRRVFSRQDAVNGLKA 691
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD 215
+ ++ ++++DD E +WS ++ I + Y YF D
Sbjct: 692 IERIFPHDQKMVLVVDDLECMWS-YSPCCIKVQGYHYFAD 730
>gi|307165882|gb|EFN60237.1| CTD small phosphatase-like protein 2 [Camponotus floridanus]
Length = 568
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 58/218 (26%), Positives = 92/218 (42%), Gaps = 40/218 (18%)
Query: 58 GLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKL 117
L + + LVL+LD TL+HC +++ LS ++F V+
Sbjct: 379 ALPLKTRSSPEFSLVLDLDETLVHC-SLQELSDAAFRFPVVFQDVTYTVF-------VRT 430
Query: 118 RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDR 174
RP+ R FLE SSL ++ L T S R YA + LLD K R+ RE NG
Sbjct: 431 RPYFREFLEHVSSLYEVILFTASKRVYANKLMNLLDPTRKLIKYRLF-REHCVCVNGNYI 489
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESE 234
K+ ++ VI+D++ + EN I + + D S+
Sbjct: 490 KDLSILGRDLSKTVIIDNSPQAFGYQLENGIPIESW------------------FADRSD 531
Query: 235 NEEALANVLRVLKTIHRLFFDSVCGDVRTYLPKVRSEF 272
NE ++++L + L + GDVR P++R +F
Sbjct: 532 NE-----LMKLLPFLENLV--NWGGDVR---PRIREQF 559
>gi|221508436|gb|EEE34023.1| RNA polymerase II phosphatase, putative [Toxoplasma gondii VEG]
Length = 1228
Score = 54.7 bits (130), Expect = 6e-05, Method: Composition-based stats.
Identities = 29/100 (29%), Positives = 54/100 (54%), Gaps = 1/100 (1%)
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRK 175
KLRP FL + S ++Y+ TM T +A A+++LD ++F R+ +R+D +
Sbjct: 626 KLRPGCLDFLRRVSQTFELYMYTMGTALHAATALRILDPKRRFFGRRVFSRQDAVNGLKA 685
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD 215
+ ++ ++++DD E +WS ++ I + Y YF D
Sbjct: 686 IERIFPHDQKMVLVVDDLECMWS-YSPCCIKVQGYHYFAD 724
>gi|428672173|gb|EKX73087.1| conserved hypothetical protein [Babesia equi]
Length = 937
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 55/105 (52%), Gaps = 9/105 (8%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD- 173
+KLRP +R FL+ S ++ + T +T+ YA+ + +LD D F RI+AR + KD
Sbjct: 511 MKLRPCIREFLQVLSLYYEMSIYTNATKEYADVVISILDPDRTLFMDRIVARNSVDEKDL 570
Query: 174 -----RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
R PDL R R ++ DD + VW+D +V ++ F
Sbjct: 571 LKSAARLYPDLNR---RFVLAFDDRKDVWADIPHRQVVRAEHYDF 612
>gi|389584495|dbj|GAB67227.1| hypothetical protein PCYB_112480 [Plasmodium cynomolgi strain B]
Length = 1447
Score = 54.3 bits (129), Expect = 7e-05, Method: Composition-based stats.
Identities = 40/150 (26%), Positives = 75/150 (50%), Gaps = 12/150 (8%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR-EDFNGKD 173
+K RP+VR FL+ S ++ + T +TR YA+ + +LD D F+ RI+AR + ++
Sbjct: 1035 LKFRPYVRQFLQILSLYYELSIYTNATREYADVVIAILDPDRTLFADRIVARCSSADREE 1094
Query: 174 RKN-----PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSET 228
KN P++ + I+ DD + VW+D + I+ ++ F + + E
Sbjct: 1095 NKNFSKIYPNV---DSKYIIAFDDRKDVWTDIPHSHILKAEHYNFFELSKYDIISHFKEP 1151
Query: 229 LTDES---ENEEALANVLRVLKTIHRLFFD 255
T + + + L + +VL +H+ FF+
Sbjct: 1152 TTCKKRFVDMDMHLHFMTKVLLKLHKHFFE 1181
>gi|299756470|ref|XP_002912206.1| RNA polymerase II subunit A domain phosphatase [Coprinopsis cinerea
okayama7#130]
gi|298411691|gb|EFI28712.1| RNA polymerase II subunit A domain phosphatase [Coprinopsis cinerea
okayama7#130]
Length = 801
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 55/100 (55%), Gaps = 2/100 (2%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+K RP + FLE A+ ++++ TM TR YA+ +D D K F SR+++R++ +
Sbjct: 271 IKPRPGWKEFLENAAKKYEMHVYTMGTRAYAQEVCAAIDPDGKLFGSRLLSRDESGSLTQ 330
Query: 175 KN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K+ L +VI+DD VW + + NL+ + Y +F
Sbjct: 331 KSLQRLFPCDTSMVVIIDDRADVW-EWSPNLLKVIPYDFF 369
>gi|392597598|gb|EIW86920.1| hypothetical protein CONPUDRAFT_95946 [Coniophora puteana
RWD-64-598 SS2]
Length = 830
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 54/100 (54%), Gaps = 2/100 (2%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
VK RP + F ++ S ++++ TM TR YAE +D DSK F RI++R++ +
Sbjct: 264 VKPRPGWKEFFQELSKKYEMHVYTMGTRAYAEEVCAAIDPDSKIFGGRILSRDESGSLTQ 323
Query: 175 KN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K+ L +VI+DD VW + + NLI + Y +F
Sbjct: 324 KSLQRLFPCDTSMVVIIDDRADVW-EWSPNLIKVIPYDFF 362
>gi|70952066|ref|XP_745226.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56525483|emb|CAH77992.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 1224
Score = 54.3 bits (129), Expect = 7e-05, Method: Composition-based stats.
Identities = 41/116 (35%), Positives = 63/116 (54%), Gaps = 3/116 (2%)
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDR 174
KLRP V FL++ + +IYL TM T +A++ + LLD K+F +RI +R+D NG
Sbjct: 432 KLRPGVIEFLQKMNQKYEIYLYTMGTIEHAKSCLFLLDPLKKFFGNRIFSRKDCTNGMKH 491
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLT 230
N L + I + DD+E +W + + I + Y YF + + GD K + LT
Sbjct: 492 LNRILPTYRSISICV-DDSEYIWKE-ANSCIKVHAYNYFPEIQFLGDIKKKTYFLT 545
>gi|449678335|ref|XP_002165480.2| PREDICTED: CTD small phosphatase-like protein 2-like [Hydra
magnipapillata]
Length = 421
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 52/96 (54%), Gaps = 8/96 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
++ LVL+LD TL+HC SLS E Y F +Q+ VKLRP + FLE+
Sbjct: 243 QMTLVLDLDETLVHC----SLSKLEAYNMTFNVVFDNVTYQL----FVKLRPHLLEFLER 294
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
S L ++ L T S R YA+ + ++D ++F R+
Sbjct: 295 VSKLYEVILFTASRRVYADKLLNIIDPRRQFFRHRL 330
>gi|336374248|gb|EGO02585.1| hypothetical protein SERLA73DRAFT_102556 [Serpula lacrymans var.
lacrymans S7.3]
Length = 811
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 78/169 (46%), Gaps = 29/169 (17%)
Query: 67 RKLQLVLNLDHTLLHC-----------------RNIKSLSSGEKY-LKKQIHSFI---GS 105
RKL L+++LD T++H N ++L K+ L K FI G
Sbjct: 159 RKLSLIVDLDQTIVHATVDPTVATDSESDDECNPNWEALKDVRKFQLVKGKQKFIENEGC 218
Query: 106 LFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIA 165
++ +K RP + FL ++ ++++ TM TR YAE +D D F RI++
Sbjct: 219 MY------YIKPRPGWQHFLHSIANKYEMHVYTMGTRAYAEEVCAAIDPDGTIFGGRILS 272
Query: 166 REDFNGKDRKN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
R++ +K+ L +VI+DD VW + + NL+ + Y +F
Sbjct: 273 RDESGSLTQKSLQRLFPCDTSMVVIIDDRADVW-EWSPNLVKVIPYDFF 320
>gi|156101293|ref|XP_001616340.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148805214|gb|EDL46613.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 1544
Score = 53.9 bits (128), Expect = 1e-04, Method: Composition-based stats.
Identities = 39/150 (26%), Positives = 75/150 (50%), Gaps = 12/150 (8%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR-EDFNGKD 173
+K RP+VR FL+ S ++ + T +TR YA+ + +LD D F+ RI+AR + ++
Sbjct: 1125 LKFRPYVRQFLQILSLYYELSIYTNATREYADVVIAILDPDRTLFADRIVARCSSADREE 1184
Query: 174 RKN-----PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSET 228
KN P++ + ++ DD + VW+D + I+ ++ F + + E
Sbjct: 1185 NKNFSKIYPNV---DSKYVIAFDDRKDVWTDIPHSHILKAEHYNFFELSKYDIISHFKEP 1241
Query: 229 LTDES---ENEEALANVLRVLKTIHRLFFD 255
T + + + L + +VL +H+ FF+
Sbjct: 1242 STCKKRFVDMDMHLHFMTKVLLKLHKQFFE 1271
>gi|156404147|ref|XP_001640269.1| predicted protein [Nematostella vectensis]
gi|156227402|gb|EDO48206.1| predicted protein [Nematostella vectensis]
Length = 289
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 52/96 (54%), Gaps = 8/96 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC SL+ L+ SF S + V+ RP ++ FLE+
Sbjct: 103 EFSLVLDLDETLVHC----SLNK----LEDATLSFPVSYQDITYQVFVRTRPHLKYFLER 154
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
S + ++ L T S R YA+ + +LD + KYF R+
Sbjct: 155 VSKVFEVILFTASKRVYADKLLNILDPEKKYFRHRL 190
>gi|221057654|ref|XP_002261335.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|194247340|emb|CAQ40740.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 1389
Score = 53.5 bits (127), Expect = 1e-04, Method: Composition-based stats.
Identities = 42/150 (28%), Positives = 75/150 (50%), Gaps = 12/150 (8%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+K RP+VR FL+ S ++ + T +TR YA+ + +LD D F+ RI+AR N DR
Sbjct: 965 LKFRPYVRQFLQILSLYYELSIYTNATREYADVVIAILDPDRTLFADRIVAR--CNSADR 1022
Query: 175 -KNPDLVR----GQERGIVILDDTESVWSD--HTENLIVLGKYVYFR--DKELNGDHKSY 225
+N + + + ++ DD + VW+D H+ N++ Y +F ++ K
Sbjct: 1023 EENKNFSKIYPNVDSKYVIAFDDRKDVWTDIPHS-NILKAEHYNFFELSKYDIISHFKEP 1081
Query: 226 SETLTDESENEEALANVLRVLKTIHRLFFD 255
S + + L + +VL +H+ FF+
Sbjct: 1082 STCKKRFVDMDMHLHFMTKVLLKLHKHFFE 1111
>gi|145495300|ref|XP_001433643.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124400762|emb|CAK66246.1| unnamed protein product [Paramecium tetraurelia]
Length = 477
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 77/155 (49%), Gaps = 17/155 (10%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
+V +LD TL+HC ++L + YL S G Q + +RP+ + L++ S
Sbjct: 289 VVFDLDETLIHCNENQNLK-ADVYLPITFPS--GDTAQAG----INIRPYAKWILQELSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR------IIAREDFNGKDRKNPDLVRGQE 184
L ++ + T S +CYA +K LD +S S + +++++ + KD + ++
Sbjct: 342 LCEVIVFTASHQCYASQVIKFLDPNSNLLSGQLFRDRCVLSQDGVHIKDLR---VLNRDP 398
Query: 185 RGIVILDDTESVWSDHTENLI-VLGKYVYFRDKEL 218
+ IV++D+ + H EN I ++ Y DKEL
Sbjct: 399 KDIVLVDNAAYSFGVHLENGIPIIPFYDNKEDKEL 433
>gi|402584910|gb|EJW78851.1| hypothetical protein WUBG_10241, partial [Wuchereria bancrofti]
Length = 278
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 54/226 (23%), Positives = 88/226 (38%), Gaps = 38/226 (16%)
Query: 2 GAYSCKECVGKTKFVIKRKCEQSL-SCAHTTVRDSRCIFCSQAMNDSFGLSFDY------ 54
G S + K + K SL +C+H V C C + + G S D
Sbjct: 53 GVVSIDTTIKKGNKLKKGMTVASLRACSHAIVIKDMCASCGKDLRGKPGTSGDLTEASTA 112
Query: 55 ----------------MLRGLRYSEQE----ERKLQLVLNLDHTLLHCRNIKSLSSGEKY 94
+ R + ++E RKL L+++LD TL+H N
Sbjct: 113 NVSMIHHVPELIVSDELARKIGSRDRELLLKARKLVLLVDLDQTLIHTTN--------HT 164
Query: 95 LKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDL 154
K + + + D K+RP R FL + + L ++++ + R YA + LD
Sbjct: 165 FKLEKDTDVLHYKLKGTDFYTKIRPHAREFLRRMAGLYEMHIISYGERQYAHRIAEFLDP 224
Query: 155 DSKYFSSRIIAREDF---NGKDRKNPDLVRGQERGIVILDDTESVW 197
+ YF RI++R++ K R L + IV++DD VW
Sbjct: 225 EKIYFGHRILSRDELFCAMYKTRNMQALFPCGDHMIVMIDDRPDVW 270
>gi|351699228|gb|EHB02147.1| CTD small phosphatase-like protein 2 [Heterocephalus glaber]
Length = 465
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/100 (35%), Positives = 54/100 (54%), Gaps = 16/100 (16%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFI----GSLFQMANDKLVKLRPFVRT 123
K LVL+LD TL+HC SL+ L+ H+F G ++Q+ V+LRPF R
Sbjct: 286 KFSLVLDLDETLVHC----SLNE----LEDAAHTFPVLFQGVIYQV----YVRLRPFFRE 333
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
FLE+ S + +I + T + + YAE + +LD + R+
Sbjct: 334 FLERMSKMYEIIVFTAAKKVYAEKLLNILDPKKQLVRHRL 373
>gi|158293726|ref|XP_315066.4| AGAP004967-PA [Anopheles gambiae str. PEST]
gi|157016584|gb|EAA10342.4| AGAP004967-PA [Anopheles gambiae str. PEST]
Length = 226
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 71/157 (45%), Gaps = 14/157 (8%)
Query: 58 GLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMAN-DKLVK 116
L + + LVL+LD TL+HC ++ + K+ LFQ V+
Sbjct: 37 ALPLKTRSSPEFSLVLDLDETLVHCSLMELSDASFKF---------PVLFQECKYTVFVR 87
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKD 173
RP+ R FLE+ S + ++ L T S R YA+ + LLD D + R+ RE NG
Sbjct: 88 TRPYFREFLERVSQMFEVILFTASKRVYADKLLNLLDPDRRLIKYRLF-REHCVLVNGNY 146
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
K+ ++ +I+D++ + EN I + +
Sbjct: 147 IKDLTILGRDLSKTIIIDNSPQAFGYQLENGIPIESW 183
>gi|145501228|ref|XP_001436596.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124403737|emb|CAK69199.1| unnamed protein product [Paramecium tetraurelia]
Length = 483
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/157 (28%), Positives = 78/157 (49%), Gaps = 21/157 (13%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
+V +LD TL+HC +SL + YL S G Q + +RPF + L++ S
Sbjct: 295 VVFDLDETLIHCNENQSLK-ADVYLPITFPS--GDTVQAG----INIRPFAKWILQELSQ 347
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR------IIAREDFNGKDRK--NPDLVRG 182
+ ++ + T S +CYA ++ LD ++ S++ +++ + + KD K N DL
Sbjct: 348 ICEVIVFTASHQCYASQVIQYLDPKNQLLSAQLFRDKCVLSPDGVHIKDLKIFNRDL--- 404
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFR-DKEL 218
+ IV++D+ + H EN I + Y + DKEL
Sbjct: 405 --KDIVLVDNAAYSFGVHLENGIPIIPYYDNKDDKEL 439
>gi|207342073|gb|EDZ69950.1| YMR277Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 544
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 55/101 (54%), Gaps = 4/101 (3%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGK-- 172
VK+RP ++ F + + L ++++ TM+TR YA K++D + F RI++R D NG
Sbjct: 59 VKVRPGLKEFFAKVAPLFEMHIYTMATRAYALQIAKIVDPTGELFGDRILSR-DENGSLT 117
Query: 173 DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+ L + +V++DD VW+ NLI + Y +F
Sbjct: 118 TKSLAKLFPTDQSMVVVIDDRGDVWN-WCPNLIKVVPYNFF 157
>gi|68074755|ref|XP_679294.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56500009|emb|CAH99961.1| conserved hypothetical protein [Plasmodium berghei]
Length = 983
Score = 52.8 bits (125), Expect = 2e-04, Method: Composition-based stats.
Identities = 39/109 (35%), Positives = 60/109 (55%), Gaps = 3/109 (2%)
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDR 174
KLRP V FL++ + +IYL TM T +A++ + LLD K+F +RI +R+D NG
Sbjct: 236 KLRPGVIEFLQKMNQKYEIYLYTMGTIEHAKSCLFLLDPLKKFFGNRIFSRKDCTNGMKH 295
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHK 223
N L + I + DD+E +W + + I + Y YF + + GD K
Sbjct: 296 LNRILPTYRSISICV-DDSEYIWKE-ANSCIKVHAYNYFPEIQFLGDIK 342
>gi|392570766|gb|EIW63938.1| hypothetical protein TRAVEDRAFT_111329 [Trametes versicolor
FP-101664 SS1]
Length = 900
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 55/100 (55%), Gaps = 2/100 (2%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+K RP + FLE ++ ++++ TM TR YAE +D K F +RI++R++ +
Sbjct: 264 IKPRPGLPEFLETMATKYEMHVYTMGTRAYAEEVCAAIDPGGKIFGNRILSRDESGSLTQ 323
Query: 175 KN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K+ L + +VI+DD VW + + NL+ + Y +F
Sbjct: 324 KSLQRLFPCDQSMVVIIDDRADVW-EWSPNLVKVIPYDFF 362
>gi|237834315|ref|XP_002366455.1| NLI interacting factor-like phosphatase domain-containing protein
[Toxoplasma gondii ME49]
gi|211964119|gb|EEA99314.1| NLI interacting factor-like phosphatase domain-containing protein
[Toxoplasma gondii ME49]
Length = 1225
Score = 52.8 bits (125), Expect = 2e-04, Method: Composition-based stats.
Identities = 28/100 (28%), Positives = 53/100 (53%), Gaps = 1/100 (1%)
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRK 175
KLRP FL + S ++Y+ TM T +A A+++LD ++F R+ +R+D +
Sbjct: 623 KLRPGCLDFLRRVSQTFELYMYTMGTALHAATALRILDPKRRFFGRRVFSRQDAVNGLKA 682
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD 215
+ ++ ++++DD E +W ++ I + Y YF D
Sbjct: 683 IERIFPHDQKMVLVVDDLECMWR-YSPCCIKVQGYHYFAD 721
>gi|393218252|gb|EJD03740.1| hypothetical protein FOMMEDRAFT_105888 [Fomitiporia mediterranea
MF3/22]
Length = 921
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 53/100 (53%), Gaps = 2/100 (2%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
VK RP + FL +S ++++ TM TR YAE +D D + F RI++R++ +
Sbjct: 274 VKPRPGWKEFLSSVASRYEMHVYTMGTRAYAEKVCAAIDPDGRLFGGRILSRDESGSLTQ 333
Query: 175 KN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K+ L +VI+DD VW + + NLI + Y +F
Sbjct: 334 KSLRRLFPCDTSMVVIIDDRADVW-EWSPNLIKVIPYDFF 372
>gi|409083591|gb|EKM83948.1| hypothetical protein AGABI1DRAFT_124274 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 853
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 53/100 (53%), Gaps = 2/100 (2%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+K RP + FL ++ D+++ TM TR YAE +D D F SRI++R++ +
Sbjct: 270 IKPRPGWKEFLMDMATKYDMHVYTMGTRAYAEEVCAAIDPDGSVFKSRILSRDESGSLTQ 329
Query: 175 KN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K+ L +VI+DD VW + + NLI + Y +F
Sbjct: 330 KSLQRLFPCDTSMVVIIDDRADVW-EWSPNLIKVIPYDFF 368
>gi|422292668|gb|EKU19970.1| rna polymerase ii ctd phosphatase, partial [Nannochloropsis
gaditana CCMP526]
Length = 419
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 61/115 (53%), Gaps = 20/115 (17%)
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP +RTFL QA +L + + T R YA +LLD D F RI++R+D
Sbjct: 254 LRPHLRTFLSQAHALYVLTIYTHGRRDYAHQVARLLDPDRTLFEDRIVSRDDC------- 306
Query: 177 PDLVRGQER-------GI---VILDDTESVW-SDHTENLIVLGKYVYFRD-KELN 219
PDL GQ+ GI +ILDD+ VW + + +L+ + + ++ + +E+N
Sbjct: 307 PDL-HGQKSLQRLFPGGIEMALILDDSPQVWQGEQSRHLLPVLPFKFYTEFEEVN 360
>gi|387196292|gb|AFJ68751.1| rna polymerase ii ctd phosphatase, partial [Nannochloropsis
gaditana CCMP526]
Length = 414
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 61/115 (53%), Gaps = 20/115 (17%)
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP +RTFL QA +L + + T R YA +LLD D F RI++R+D
Sbjct: 249 LRPHLRTFLSQAHALYVLTIYTHGRRDYAHQVARLLDPDRTLFEDRIVSRDDC------- 301
Query: 177 PDLVRGQER-------GI---VILDDTESVW-SDHTENLIVLGKYVYFRD-KELN 219
PDL GQ+ GI +ILDD+ VW + + +L+ + + ++ + +E+N
Sbjct: 302 PDL-HGQKSLQRLFPGGIEMALILDDSPQVWQGEQSRHLLPVLPFKFYTEFEEVN 355
>gi|401409326|ref|XP_003884111.1| hypothetical protein NCLIV_045130 [Neospora caninum Liverpool]
gi|325118529|emb|CBZ54080.1| hypothetical protein NCLIV_045130 [Neospora caninum Liverpool]
Length = 1185
Score = 52.4 bits (124), Expect = 3e-04, Method: Composition-based stats.
Identities = 29/100 (29%), Positives = 53/100 (53%), Gaps = 1/100 (1%)
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRK 175
KLRP FL + S ++Y+ TM T +A A+++LD ++F R+ +R+D +
Sbjct: 649 KLRPGCLDFLRRVSQTFELYMYTMGTALHAATALRILDPGRRFFGRRVFSRQDAVNGLKA 708
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD 215
+ + ++++DD + +WS + + V G Y YF D
Sbjct: 709 IERIFPHDRKMVLVVDDLDCMWSYNPCCIKVQG-YHYFAD 747
>gi|390356058|ref|XP_788296.3| PREDICTED: CTD small phosphatase-like protein 2-like isoform 2
[Strongylocentrotus purpuratus]
Length = 485
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 69/143 (48%), Gaps = 12/143 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
K LVL+LD TL+HC SL+ E F + +Q+ V+ RPF R FLE+
Sbjct: 306 KYSLVLDLDETLVHC----SLAEMENCTMSFPVYFQDNEYQV----YVRTRPFFRDFLER 357
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
S + +I L T S R YA+ + LLD + K R+ RE G K+ +++
Sbjct: 358 MSKIFEIILFTASKRVYADKLLNLLDPEKKLVRHRLF-REHCICVQGNYIKDLNILGRDL 416
Query: 185 RGIVILDDTESVWSDHTENLIVL 207
VI+D++ + EN I +
Sbjct: 417 TKTVIIDNSPQAFGYQLENGIPI 439
>gi|145498355|ref|XP_001435165.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402295|emb|CAK67768.1| unnamed protein product [Paramecium tetraurelia]
Length = 485
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/154 (29%), Positives = 73/154 (47%), Gaps = 15/154 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
+V +LD TL+HC ++L + YL S G Q + +RPF + L++ S
Sbjct: 297 VVFDLDETLIHCNENQNLK-ADIYLPITFPS--GDTAQAG----INIRPFAKWILQELSQ 349
Query: 131 LVDIYLCTMSTRCYAEAAVKLLD-----LDSKYFSSRIIAREDFNGKDRKNPDLVRGQER 185
L ++ + T S +CYA +K LD L + F R + D G K+ ++ +
Sbjct: 350 LCEVIVFTASHQCYASQVIKYLDPHSTLLQGQLFRDRCVLSPD--GVHIKDLRVLNRDLK 407
Query: 186 GIVILDDTESVWSDHTENLI-VLGKYVYFRDKEL 218
IV++D+ + H EN I ++ Y DKEL
Sbjct: 408 DIVLIDNAAYSFGVHLENGIPIIPYYDNKEDKEL 441
>gi|224035555|gb|ACN36853.1| unknown [Zea mays]
gi|414881338|tpg|DAA58469.1| TPA: hypothetical protein ZEAMMB73_648049 [Zea mays]
gi|414881339|tpg|DAA58470.1| TPA: hypothetical protein ZEAMMB73_648049 [Zea mays]
gi|414881340|tpg|DAA58471.1| TPA: hypothetical protein ZEAMMB73_648049 [Zea mays]
Length = 397
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 52/98 (53%), Gaps = 10/98 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVRTFL 125
+ + LVL+LD TL+H + S + L+ F M N + VK RP+++ FL
Sbjct: 222 KHVTLVLDLDETLVHS-TLDQCDSADFTLE--------VFFNMKNHTVYVKKRPYLKVFL 272
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
E+ + + ++ + T S R YAE + LD D KY S RI
Sbjct: 273 EKVAQMFELVIFTASQRIYAEQLIDKLDPDGKYISRRI 310
>gi|440293350|gb|ELP86476.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase, putative [Entamoeba invadens IP1]
Length = 213
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 79/170 (46%), Gaps = 11/170 (6%)
Query: 45 NDSFGLSFDYM--LRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSF 102
+D+F DY L +++ +L ++ +LD TL+H ++ L K+ ++
Sbjct: 21 SDAFVFKIDYTPKLTETLLPPKDDERLTVIFDLDETLIHTHSL--LPEDSKHSRETCKVV 78
Query: 103 IGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
+ + + +RP FL Q S ++ L T S + YA+ + ++ D K F +
Sbjct: 79 VQN-----KEYTTSIRPGAIQFLRQLSKTCEVVLFTASKQVYADQIIDYMEKDGKIFEHK 133
Query: 163 IIAREDFNGKDRKNPDLVR-GQE-RGIVILDDTESVWSDHTENLIVLGKY 210
+ + N R D + G++ + +VI DD E VW+ + L+V +Y
Sbjct: 134 LYQQSCKNKFGRVYKDATKLGRDIKNVVIFDDCELVWTMTQDKLVVCKRY 183
>gi|226506682|ref|NP_001149415.1| CTD-phosphatase-like protein [Zea mays]
gi|195627078|gb|ACG35369.1| CTD-phosphatase-like protein [Zea mays]
gi|414881341|tpg|DAA58472.1| TPA: CTD-phosphatase-like protein [Zea mays]
Length = 460
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 52/98 (53%), Gaps = 10/98 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVRTFL 125
+ + LVL+LD TL+H + S + L+ F M N + VK RP+++ FL
Sbjct: 285 KHVTLVLDLDETLVH-STLDQCDSADFTLE--------VFFNMKNHTVYVKKRPYLKVFL 335
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
E+ + + ++ + T S R YAE + LD D KY S RI
Sbjct: 336 EKVAQMFELVIFTASQRIYAEQLIDKLDPDGKYISRRI 373
>gi|390356060|ref|XP_003728694.1| PREDICTED: CTD small phosphatase-like protein 2-like isoform 1
[Strongylocentrotus purpuratus]
Length = 514
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 69/143 (48%), Gaps = 12/143 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
K LVL+LD TL+HC SL+ E F + +Q+ V+ RPF R FLE+
Sbjct: 335 KYSLVLDLDETLVHC----SLAEMENCTMSFPVYFQDNEYQV----YVRTRPFFRDFLER 386
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
S + +I L T S R YA+ + LLD + K R+ RE G K+ +++
Sbjct: 387 MSKIFEIILFTASKRVYADKLLNLLDPEKKLVRHRLF-REHCICVQGNYIKDLNILGRDL 445
Query: 185 RGIVILDDTESVWSDHTENLIVL 207
VI+D++ + EN I +
Sbjct: 446 TKTVIIDNSPQAFGYQLENGIPI 468
>gi|242009525|ref|XP_002425534.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212509409|gb|EEB12796.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 834
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 76/154 (49%), Gaps = 17/154 (11%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFI-GSLFQ-MANDKLVKLRPFVRTFLEQA 128
LVL+LD TL+HC +++ L Q SF LFQ A V+ RP+ R FLE+
Sbjct: 670 LVLDLDETLVHC-SLQEL---------QDASFTFPVLFQDCAYTVFVRTRPYFREFLERV 719
Query: 129 SSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQER 185
SSL ++ L T S R YA+ + LLD ++ R+ RE NG K+ ++
Sbjct: 720 SSLFEVILFTASKRVYADKLMNLLDPKKRWIKYRLF-REHCVCVNGNYIKDLTILGRDLS 778
Query: 186 GIVILDDTESVWSDHTENLIVLGKYVYFR-DKEL 218
+I+D++ + EN I + + R D EL
Sbjct: 779 KTIIIDNSPQAFGYQLENGIPIESWFVDRNDNEL 812
>gi|242053713|ref|XP_002456002.1| hypothetical protein SORBIDRAFT_03g028730 [Sorghum bicolor]
gi|241927977|gb|EES01122.1| hypothetical protein SORBIDRAFT_03g028730 [Sorghum bicolor]
Length = 400
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 74/148 (50%), Gaps = 14/148 (9%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVRTFL 125
+ + LVL+LD TL+H + + + L+ F M N + V+ RP+++ FL
Sbjct: 223 KHVTLVLDLDETLVHS-TLDHCDNADFTLE--------VFFNMKNHTVYVRKRPYLKMFL 273
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRG 182
E+ + + ++ + T S R YAE + LD D KY S RI RE +G K+ ++R
Sbjct: 274 EKVAQMFEVVIFTASQRIYAEQLIDKLDPDGKYISRRIY-RESCIFSDGCYTKDLTILRI 332
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKY 210
+ I+D+T V+ +N I + +
Sbjct: 333 DLAKVAIVDNTPQVFQLQVDNGIPIKSW 360
>gi|124513824|ref|XP_001350268.1| protein phosphatase, putative [Plasmodium falciparum 3D7]
gi|23615685|emb|CAD52677.1| protein phosphatase, putative [Plasmodium falciparum 3D7]
Length = 1288
Score = 51.6 bits (122), Expect = 5e-04, Method: Composition-based stats.
Identities = 32/105 (30%), Positives = 55/105 (52%), Gaps = 9/105 (8%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD- 173
+K RP+VR FL+ S ++ + T +TR YA+ + +LD D FS RI+AR +D
Sbjct: 899 LKFRPYVRQFLQILSLYYELAIYTNATREYADVVIAILDPDRTIFSDRIVARCSSTDRDE 958
Query: 174 -----RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
R P++ + ++ DD + VW D ++ I+ ++ F
Sbjct: 959 NKYFSRIYPNV---DPKYVIAFDDRKDVWIDIPQSHILKAEHYNF 1000
>gi|449551315|gb|EMD42279.1| hypothetical protein CERSUDRAFT_148004 [Ceriporiopsis subvermispora
B]
Length = 875
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 55/100 (55%), Gaps = 2/100 (2%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+K RP + FL+ ++ ++++ TM TR YAE +D D K F R+++R++ +
Sbjct: 265 IKPRPGWQDFLQDMATKYEMHVYTMGTRAYAEEVCATIDPDGKIFGGRLLSRDESGSLTQ 324
Query: 175 KN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K+ L + +VI+DD VW + + NL+ + Y +F
Sbjct: 325 KSLQRLFPCDQSMVVIIDDRADVW-EWSPNLVKVIPYDFF 363
>gi|390604450|gb|EIN13841.1| hypothetical protein PUNSTDRAFT_95201 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1229
Score = 51.6 bits (122), Expect = 5e-04, Method: Composition-based stats.
Identities = 32/100 (32%), Positives = 53/100 (53%), Gaps = 2/100 (2%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+K RP FL S ++++ TM TR YAE K +D + + F +RI++R++ +
Sbjct: 619 IKPRPGWHEFLHTLSEKYEMHVYTMGTRAYAEEVCKAIDPEGQIFGNRILSRDESGSLTQ 678
Query: 175 KN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K+ L +VI+DD VW + + NLI + Y +F
Sbjct: 679 KSLQRLFPCDTSMVVIIDDRADVW-EWSPNLIKVIPYDFF 717
>gi|145529323|ref|XP_001450450.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124418061|emb|CAK83053.1| unnamed protein product [Paramecium tetraurelia]
Length = 442
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 41/155 (26%), Positives = 77/155 (49%), Gaps = 17/155 (10%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
+V +LD TL+HC +SL + Y+ + S G + + +RPF + L + S
Sbjct: 254 VVFDLDETLIHCNENQSLK-ADVYIPIKFPS--GDVVSAG----INVRPFAKWILTELSK 306
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR------IIAREDFNGKDRKNPDLVRGQE 184
L ++ + T S +CYA + LD +++ S++ +++ E + KD + + +
Sbjct: 307 LCEVIVFTASHQCYASQVIAHLDPKNQFLSAQVFRDGCVLSTEGVHVKDLR---IFKRDL 363
Query: 185 RGIVILDDTESVWSDHTENLI-VLGKYVYFRDKEL 218
+ IV++D+ + H EN I ++ Y DKEL
Sbjct: 364 KDIVLVDNAAYSFGMHLENGIPIIPYYDNQEDKEL 398
>gi|47230493|emb|CAF99686.1| unnamed protein product [Tetraodon nigroviridis]
Length = 2418
Score = 51.2 bits (121), Expect = 6e-04, Method: Composition-based stats.
Identities = 45/155 (29%), Positives = 74/155 (47%), Gaps = 14/155 (9%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+
Sbjct: 323 EFSLVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLER 374
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
S + +I L T S + YA+ + +LD + R+ RE G K+ +++
Sbjct: 375 MSQIYEIILFTASKKVYADKLLNILDPKKQLVRHRLF-REHCVCVQGNYIKDLNILGRDL 433
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVYFRDKELN 219
+I+D++ ++ N I + +F DK N
Sbjct: 434 SKTIIIDNSPQAFAYQLSNGIPIES--WFMDKNDN 466
>gi|118371686|ref|XP_001019041.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila]
gi|89300808|gb|EAR98796.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila SB210]
Length = 379
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 50/89 (56%), Gaps = 8/89 (8%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
EE LVL+LD TL+HC N KSL+ + Q + Q N L + R +++ F
Sbjct: 206 EEHPNNLVLDLDETLIHC-NEKSLNDDSSIITVQFQN------QQKNYYLHQ-RGYLQEF 257
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLD 153
LEQ + +IY+ T STR YAE VK++D
Sbjct: 258 LEQCALNFNIYIYTASTRDYAEEVVKIID 286
>gi|399215917|emb|CCF72605.1| unnamed protein product [Babesia microti strain RI]
Length = 664
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 52/106 (49%), Gaps = 10/106 (9%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD- 173
+KLRP +R FL S ++ + T +TR YA+ + +LD D F RIIAR N +
Sbjct: 229 LKLRPRLREFLHILSFYYEMSIYTNATREYADVVIAILDPDRSLFMDRIIARGGGNDRGL 288
Query: 174 -----RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLG-KYVYF 213
R P L +R +V DD VW+D N ++ Y YF
Sbjct: 289 TKSARRLYPKL---SQRFVVSFDDRRDVWTDIDPNQVLKAHHYSYF 331
>gi|145513564|ref|XP_001442693.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410046|emb|CAK75296.1| unnamed protein product [Paramecium tetraurelia]
Length = 351
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 36/157 (22%), Positives = 74/157 (47%), Gaps = 18/157 (11%)
Query: 58 GLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKL 117
L+Y + ++KL++ +LD TL+H I+ K +++ + + F V +
Sbjct: 157 SLQYQGKSQKKLKIAFDLDETLIHTEPIQ---------KDKVYDYQNNEFG------VFI 201
Query: 118 RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR--- 174
RP+ R L++ S L D+++ T + + YA+ + L+D ++ YF + R
Sbjct: 202 RPYCRHVLKELSLLADLFVFTSANQKYAKTIINLIDPENTYFKGHFCRNHCITLQSRIQL 261
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
K+ ++ IVI+D++ + N I + Y+
Sbjct: 262 KHLGILSNDFSNIVIIDNSPIFYMGQPYNGIPIAPYI 298
>gi|147772503|emb|CAN60776.1| hypothetical protein VITISV_018840 [Vitis vinifera]
Length = 398
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 43/95 (45%), Gaps = 14/95 (14%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQE--------------ERKLQL 71
+C H V CI C Q M G++F Y+ + LR E +KL L
Sbjct: 259 TCTHPGVFRELCIRCGQKMEGGSGVAFGYIHKDLRLGSDEIARLRDTDLKNLLRHKKLYL 318
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL 106
VL+LDHTLL+ + ++ E YLK Q G +
Sbjct: 319 VLDLDHTLLNSTRLLDITPEELYLKNQTDPLQGMI 353
>gi|70945368|ref|XP_742511.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56521536|emb|CAH80727.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 359
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 47/185 (25%), Positives = 84/185 (45%), Gaps = 28/185 (15%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+K RP+VR FLE S ++ + T +TR YA+ + +LD D F+ RI+AR +D
Sbjct: 4 LKFRPYVRQFLEILSLYYELSIYTNATREYADVVIAILDPDRTIFADRIVARCSSVDRDE 63
Query: 175 KN------PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD----------KEL 218
P++ + ++ DD + VW D ++ I+ ++ F + KE
Sbjct: 64 NKHFEKIYPNV---DPKYVIAFDDRKDVWYDIPDSHILRAEHYNFFELSKYDIISHFKEP 120
Query: 219 NGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVCG-DVRTYLPKVRSEFSRDV- 276
N K + + + L ++++ IH+ FF++ DV + + DV
Sbjct: 121 NTCKKRFVDM-------DMHLHYMIKIFLKIHKQFFENPLNVDVGKIIDNIMLSTLSDVG 173
Query: 277 LYFSA 281
LYF+
Sbjct: 174 LYFTG 178
>gi|125571265|gb|EAZ12780.1| hypothetical protein OsJ_02697 [Oryza sativa Japonica Group]
Length = 576
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 46/157 (29%), Positives = 72/157 (45%), Gaps = 32/157 (20%)
Query: 67 RKLQLVLNLDHTLLH-----CRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPF 120
+++ LVL+LD TL+H C N+ Q+ F M N + V+ RP
Sbjct: 399 KQITLVLDLDETLVHSTLDHCDNVD--------FTLQV------FFNMKNHTVYVRQRPH 444
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI-----IAREDFNGKDRK 175
++ FLE+ + + D+ + T S R YAE + LD D + S RI I E KD
Sbjct: 445 LKMFLEKVAQMFDLVIFTASQRIYAEQLIDRLDPDGRLISHRIYRESCIFSEGCYTKDLT 504
Query: 176 --NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
DL + +VI+D+T V+ +N I + +
Sbjct: 505 ILGVDLAK-----VVIVDNTPQVFQLQVDNGIPIKSW 536
>gi|359494479|ref|XP_002266587.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
4-like isoform 2 [Vitis vinifera]
Length = 193
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 41/87 (47%), Gaps = 14/87 (16%)
Query: 26 SCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQE--------------ERKLQL 71
+C H V CI C Q M G++F Y+ + LR E +KL L
Sbjct: 91 TCTHPGVFRELCIRCGQKMEGGSGVAFGYIHKDLRLGSDEIARLRDTDLKNLLRHKKLYL 150
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQ 98
VL+LDHTLL+ + ++ E YLK Q
Sbjct: 151 VLDLDHTLLNSTRLLDITPEELYLKNQ 177
>gi|350579777|ref|XP_003122350.3| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like, partial [Sus scrofa]
Length = 284
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 59/105 (56%), Gaps = 12/105 (11%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV---KLRPFVRT 123
RKL L+++LD TL+H + E++ ++ + I FQ+ + + +LRP +
Sbjct: 178 RKLVLMVDLDQTLIH--------TTEQHCQQMSNKGIFH-FQLGRGEPMLHTRLRPHCKE 228
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED 168
FLE+ + L ++++ T +R YA LD + K FS RI++R++
Sbjct: 229 FLEKIAQLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDE 273
>gi|223943303|gb|ACN25735.1| unknown [Zea mays]
Length = 342
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 56/106 (52%), Gaps = 10/106 (9%)
Query: 59 LRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKL 117
L + +++ + LVL+LD TL+H + + + L+ F M N + V+
Sbjct: 157 LSKTPVKKKHVTLVLDLDETLVHS-TLDHCDNADFTLE--------VFFNMKNHTVYVRK 207
Query: 118 RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
RP+++ FLE+ + + ++ + T S R YAE + LD D KY S RI
Sbjct: 208 RPYLKMFLEKVAQMFEVVIFTASQRVYAEQLIDKLDPDGKYISRRI 253
>gi|413950699|gb|AFW83348.1| hypothetical protein ZEAMMB73_634755 [Zea mays]
Length = 400
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 56/106 (52%), Gaps = 10/106 (9%)
Query: 59 LRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKL 117
L + +++ + LVL+LD TL+H + + + L+ F M N + V+
Sbjct: 215 LSKTPVKKKHVTLVLDLDETLVHS-TLDHCDNADFTLE--------VFFNMKNHTVYVRK 265
Query: 118 RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
RP+++ FLE+ + + ++ + T S R YAE + LD D KY S RI
Sbjct: 266 RPYLKMFLEKVAQMFEVVIFTASQRVYAEQLIDKLDPDGKYISRRI 311
>gi|395334832|gb|EJF67208.1| hypothetical protein DICSQDRAFT_142769 [Dichomitus squalens
LYAD-421 SS1]
Length = 953
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 55/100 (55%), Gaps = 2/100 (2%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+K RP + FL+ ++ ++++ TM TR YAE +D K F +RI++R++ +
Sbjct: 288 IKPRPGLLDFLQTMATKYEMHVYTMGTRAYAEEVCAAIDPGGKIFGNRILSRDESGSLTQ 347
Query: 175 KN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K+ L + +VI+DD VW + + NL+ + Y +F
Sbjct: 348 KSLQRLFPCDQSMVVIIDDRADVW-EWSPNLVKVIPYDFF 386
>gi|297799336|ref|XP_002867552.1| hypothetical protein ARALYDRAFT_913891 [Arabidopsis lyrata subsp.
lyrata]
gi|297313388|gb|EFH43811.1| hypothetical protein ARALYDRAFT_913891 [Arabidopsis lyrata subsp.
lyrata]
Length = 113
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 53/98 (54%), Gaps = 6/98 (6%)
Query: 173 DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDK-ELNGDHKSYSETLTD 231
D N ++ E ++I+DDT +W NL+ + KY+YF ++ +SY+E D
Sbjct: 10 DLSNHSILVVDELRVIIVDDTVDIWPHDKRNLLQITKYIYFSVAVSIDKRWRSYAEVKRD 69
Query: 232 ESENEEALANVLRVLKTIHRLF---FDSVCGDVRTYLP 266
ES + +LANVL+ L +H+ + DS D+R +P
Sbjct: 70 ESLSNGSLANVLKFLVYVHKRYEKKLDS--KDLRLLIP 105
>gi|293332237|ref|NP_001167877.1| uncharacterized protein LOC100381584 [Zea mays]
gi|223944585|gb|ACN26376.1| unknown [Zea mays]
gi|413950698|gb|AFW83347.1| hypothetical protein ZEAMMB73_634755 [Zea mays]
Length = 419
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 56/106 (52%), Gaps = 10/106 (9%)
Query: 59 LRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKL 117
L + +++ + LVL+LD TL+H + + + L+ F M N + V+
Sbjct: 234 LSKTPVKKKHVTLVLDLDETLVH-STLDHCDNADFTLE--------VFFNMKNHTVYVRK 284
Query: 118 RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
RP+++ FLE+ + + ++ + T S R YAE + LD D KY S RI
Sbjct: 285 RPYLKMFLEKVAQMFEVVIFTASQRVYAEQLIDKLDPDGKYISRRI 330
>gi|268566879|ref|XP_002639837.1| C. briggsae CBR-SCPL-3 protein [Caenorhabditis briggsae]
Length = 294
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 46/93 (49%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ YL F M V++RPF+RTFL + S
Sbjct: 67 LVLDLDETLVHC----SLN----YLDNSNMVFPVDFQGMTYQVYVRIRPFLRTFLTRMSK 118
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I + T S +CYA +LD R+
Sbjct: 119 VFEIIVFTASKKCYANKLCDILDPQKTIIKHRL 151
>gi|82541597|ref|XP_725029.1| NLI interacting factor [Plasmodium yoelii yoelii 17XNL]
gi|23479881|gb|EAA16594.1| NLI interacting factor, putative [Plasmodium yoelii yoelii]
Length = 1177
Score = 50.4 bits (119), Expect = 0.001, Method: Composition-based stats.
Identities = 38/151 (25%), Positives = 72/151 (47%), Gaps = 12/151 (7%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+K RP+VR FLE S ++ + T +TR YA+ + +LD D F+ RI+AR +D
Sbjct: 779 LKFRPYVRQFLEILSLYYELSIYTNATREYADVVIAILDPDRTIFADRIVARCSSVDRDE 838
Query: 175 KN------PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSET 228
P++ + ++ DD + VW D + I+ ++ F + + E
Sbjct: 839 NKHFEKIYPNV---DPKYVIAFDDRKDVWFDIPHSHILRAEHYNFFELSKYDIISHFKEP 895
Query: 229 LTDES---ENEEALANVLRVLKTIHRLFFDS 256
T + + + L ++++ IH+ FF++
Sbjct: 896 STCKKRFVDMDMHLHYMIKIFLKIHKQFFEN 926
>gi|297740632|emb|CBI30814.3| unnamed protein product [Vitis vinifera]
Length = 479
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 55/100 (55%), Gaps = 10/100 (10%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVR 122
++++ + LVL+LD TL+H S ++ +F F M + + VK RP++
Sbjct: 302 RKKKSITLVLDLDETLVH--------STLEHCDDADFTF-PVFFNMKDHTVYVKQRPYLH 352
Query: 123 TFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
TFLE+ + + +I + T S YAE + +LD D K+FS R
Sbjct: 353 TFLERVAEMFEIVVFTASQSIYAEQLLDILDPDGKFFSHR 392
>gi|70921595|ref|XP_734099.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56506520|emb|CAH86297.1| hypothetical protein PC301933.00.0 [Plasmodium chabaudi chabaudi]
Length = 212
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/185 (25%), Positives = 84/185 (45%), Gaps = 28/185 (15%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+K RP+VR FLE S ++ + T +TR YA+ + +LD D F+ RI+AR +D
Sbjct: 25 LKFRPYVRQFLEILSLYYELSIYTNATREYADVVIAILDPDRTIFADRIVARCSSVDRDE 84
Query: 175 KN------PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD----------KEL 218
P++ + ++ DD + VW D ++ I+ ++ F + KE
Sbjct: 85 NKHFEKIYPNV---DPKYVIAFDDRKDVWYDIPDSHILRAEHYNFFELSKYDIISHFKEP 141
Query: 219 NGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVCG-DVRTYLPKVRSEFSRDV- 276
N K + + + L ++++ IH+ FF++ DV + + DV
Sbjct: 142 NTCKKRFVDM-------DMHLHYMIKIFLKIHKQFFENPLNVDVGKIIDNIMLSTLSDVG 194
Query: 277 LYFSA 281
LYF+
Sbjct: 195 LYFTG 199
>gi|409051930|gb|EKM61406.1| hypothetical protein PHACADRAFT_204575 [Phanerochaete carnosa
HHB-10118-sp]
Length = 863
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 53/100 (53%), Gaps = 2/100 (2%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+K RP FLE + ++++ TM TR YAE +D D K F R+++R++ +
Sbjct: 259 IKPRPGWNEFLEDMAEKYEMHVYTMGTRAYAEEVCAAIDPDGKIFGGRLLSRDESGSLTQ 318
Query: 175 KN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K+ L + +V++DD VW + + NL+ + + +F
Sbjct: 319 KSLQRLFPCDQSMVVVIDDRADVW-EWSPNLVKVIPFEFF 357
>gi|145529526|ref|XP_001450546.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124418168|emb|CAK83149.1| unnamed protein product [Paramecium tetraurelia]
Length = 591
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/153 (28%), Positives = 72/153 (47%), Gaps = 14/153 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
+V +LD TL+HC + + S E YL S G Q + +RP+ + L Q S
Sbjct: 402 VVFDLDETLIHCNEDQKMKS-EVYLPITFPS--GDTVQAG----INIRPWAKQILNQLSE 454
Query: 131 LVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKDRKNPDLVRGQERG 186
+ ++ + T S +CYA ++ LD L ++ F I D G K+ ++ +
Sbjct: 455 VCEVVVFTASHQCYASQVIQFLDHKKILTAQLFRESCIVTND--GVHIKDLRVLGRDMKD 512
Query: 187 IVILDDTESVWSDHTENLIVLGKYVYFR-DKEL 218
IV++D+ + H EN I + Y + DKEL
Sbjct: 513 IVLIDNAAYSFGYHIENGIPIIPYYDNKDDKEL 545
>gi|147839779|emb|CAN65912.1| hypothetical protein VITISV_035567 [Vitis vinifera]
Length = 482
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 55/100 (55%), Gaps = 10/100 (10%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVR 122
++++ + LVL+LD TL+H S ++ +F F M + + VK RP++
Sbjct: 305 RKKKSITLVLDLDETLVH--------STLEHCDDADFTF-PVFFNMKDHTVYVKQRPYLH 355
Query: 123 TFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
TFLE+ + + +I + T S YAE + +LD D K+FS R
Sbjct: 356 TFLERVAEMFEIVVFTASQSIYAEQLLDILDPDGKFFSHR 395
>gi|402467220|gb|EJW02558.1| FCP1-like phosphatase, phosphatase domain-containing protein
[Edhazardia aedis USNM 41457]
Length = 905
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 55/105 (52%), Gaps = 2/105 (1%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+ LRPF+ L ++++ TM YA+ K++D F +RII R++ N +
Sbjct: 240 IALRPFLEKLL-SLDEKYEMHIYTMGNNQYAQKVKKIIDPTGTIFGNRIITRDENNQELF 298
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELN 219
K+ D IV++DD VW+ + N++ + + +FRD ++N
Sbjct: 299 KSLDRFSTNHDNIVVIDDRIDVWN-FSVNVVGVRPFWFFRDGDIN 342
>gi|221055253|ref|XP_002258765.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|193808835|emb|CAQ39537.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 1474
Score = 50.1 bits (118), Expect = 0.002, Method: Composition-based stats.
Identities = 35/99 (35%), Positives = 56/99 (56%), Gaps = 3/99 (3%)
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDR 174
KLRP V FL++ + +IYL TM T +A++ + LLD K+F +R+ +R+D NG
Sbjct: 557 KLRPGVIQFLQKMNKKYEIYLYTMGTLEHAKSCLLLLDPLKKFFGNRVFSRKDSVNGLKH 616
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
N L + + I DD++ +W + + + V G Y YF
Sbjct: 617 LNRILPTYRSVSLCI-DDSDYMWKESSSCIKVHG-YNYF 653
>gi|387594493|gb|EIJ89517.1| hypothetical protein NEQG_00287 [Nematocida parisii ERTm3]
gi|387596665|gb|EIJ94286.1| hypothetical protein NEPG_00953 [Nematocida parisii ERTm1]
Length = 310
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 67/144 (46%), Gaps = 11/144 (7%)
Query: 139 MSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDRKNPDLVRGQERGIVILDDTESVW 197
M + YA + LLD K F SRII+R+D F D+ L + +VILDD VW
Sbjct: 1 MGNKSYACSIAGLLDPTGKLFGSRIISRDDNFGCFDKDIKRLFPTNSKHVVILDDRPDVW 60
Query: 198 SDHTENLIVLGKYVYFRDKELNGDH--KSYSETLTDESENE---EALANVLRVLKTIHR- 251
+NL + Y YF+ ++N + L+++ N E N +++ I R
Sbjct: 61 G-FVDNLYPIRPYYYFQTDDINSPEALQGMKSALSEDVRNSPVGEVFRNKNDLIELIDRE 119
Query: 252 ---LFFDSVCGDVRTYLPKVRSEF 272
+FD+ V + L +V +EF
Sbjct: 120 CILTYFDNELEKVLSGLKEVHTEF 143
>gi|225463384|ref|XP_002271705.1| PREDICTED: uncharacterized protein LOC100258847 [Vitis vinifera]
Length = 484
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 55/100 (55%), Gaps = 10/100 (10%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVR 122
++++ + LVL+LD TL+H S ++ +F F M + + VK RP++
Sbjct: 307 RKKKSITLVLDLDETLVH--------STLEHCDDADFTF-PVFFNMKDHTVYVKQRPYLH 357
Query: 123 TFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
TFLE+ + + +I + T S YAE + +LD D K+FS R
Sbjct: 358 TFLERVAEMFEIVVFTASQSIYAEQLLDILDPDGKFFSHR 397
>gi|299472381|emb|CBN77569.1| putative nuclear LIM interactor-interacting protein [Ectocarpus
siliculosus]
Length = 602
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 8/97 (8%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
++L LVL+LD TL+HC ++ ++ ++H F G FQ+ V+ RP + FLE
Sbjct: 361 KELTLVLDLDETLVHCTVDPIVNPDHRF---EVH-FNGEEFQV----YVRKRPHLDAFLE 412
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
S L ++ + T S + YAE + ++D K+ R+
Sbjct: 413 AVSELFEVVVFTASQQVYAERLLNMIDPQKKFVKYRL 449
>gi|358335312|dbj|GAA53844.1| CTD small phosphatase-like protein 2 [Clonorchis sinensis]
Length = 498
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 46/171 (26%), Positives = 83/171 (48%), Gaps = 13/171 (7%)
Query: 52 FDYMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMAN 111
Y L L + + LVL+LD TL+HC ++ L + ++ + + F G ++ +
Sbjct: 290 LSYQLPALPKRTRSAPEFCLVLDLDETLVHC-SLTPLPDAQ-FIFQVV--FQGVVYMV-- 343
Query: 112 DKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED--- 168
V++RP + FL + S ++ L T ST+ YA+ V L+D K+ R+ RE
Sbjct: 344 --YVRIRPHLYEFLSRVSERFEVVLFTASTKVYADRLVNLIDPKKKWIKHRLF-REHCVC 400
Query: 169 FNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGK-YVYFRDKEL 218
NG K+ ++ R VI+D++ + +N + + +V D+EL
Sbjct: 401 VNGNYVKDLRVLGRDLRKTVIVDNSPQAFGYQLDNGVPIESWFVDSNDREL 451
>gi|209877977|ref|XP_002140430.1| NLI interacting factor-like phosphatase family protein
[Cryptosporidium muris RN66]
gi|209556036|gb|EEA06081.1| NLI interacting factor-like phosphatase family protein
[Cryptosporidium muris RN66]
Length = 356
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 54/97 (55%), Gaps = 7/97 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
R L +VL++D TL+HC + + L +G + + + ++ V RP+++ FL+
Sbjct: 166 RSLFMVLDMDETLVHC-SFEILENGME------PDLLVDIIPFSSPWCVYFRPYLQLFLQ 218
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
AS L D+ + T ST+ YAE +K +D + KY ++
Sbjct: 219 YASYLGDLCIFTASTKTYAEKVLKSIDPNGKYIRYKL 255
>gi|156088257|ref|XP_001611535.1| Dullard-like phosphatase domain containing protein [Babesia bovis]
gi|154798789|gb|EDO07967.1| Dullard-like phosphatase domain containing protein [Babesia bovis]
Length = 278
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 75/170 (44%), Gaps = 22/170 (12%)
Query: 57 RGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIG-SLFQMANDKLV 115
+ YS RK LVL+LD TL+H ++ G+ +I G SL V
Sbjct: 80 KAATYSLDTPRKKTLVLDLDETLIHSSTFRT---GKHQTLVEIVGDTGISLVS------V 130
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR------EDF 169
LRPF R F+ A+ + ++ + T + YA + LLD + RI AR F
Sbjct: 131 SLRPFAREFIAAATRMFEVVIFTAAGCKYANPIIDLLDCE-----RRIHARLFREHCTTF 185
Query: 170 NGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR-DKEL 218
N K+ + + IVI+D+T + H N I + + R D+EL
Sbjct: 186 NQHIIKDLSMFDRDSKDIVIIDNTPISYFLHPHNAIPISSWHDNRSDREL 235
>gi|449270631|gb|EMC81290.1| CTD small phosphatase-like protein 2 [Columba livia]
Length = 468
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 292 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 343
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD K R+
Sbjct: 344 IYEIILFTASKKVYADKLLNILDPKKKLVRHRL 376
>gi|125526935|gb|EAY75049.1| hypothetical protein OsI_02945 [Oryza sativa Indica Group]
Length = 577
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 45/157 (28%), Positives = 72/157 (45%), Gaps = 32/157 (20%)
Query: 67 RKLQLVLNLDHTLLH-----CRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPF 120
+++ LVL+LD TL+H C N+ Q+ F M N + V+ RP
Sbjct: 400 KQITLVLDLDETLVHSTLDHCDNVD--------FTLQV------FFNMKNHTVYVRQRPH 445
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI-----IAREDFNGKDRK 175
++ FLE+ + + ++ + T S R YAE + LD D + S RI I E KD
Sbjct: 446 LKMFLEKVAQMFELVIFTASQRIYAEQLIDRLDPDGRLISHRIYRESCIFSEGCYTKDLT 505
Query: 176 --NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
DL + +VI+D+T V+ +N I + +
Sbjct: 506 ILGVDLAK-----VVIVDNTPQVFQLQVDNGIPIKSW 537
>gi|68068525|ref|XP_676173.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56495746|emb|CAI00611.1| conserved hypothetical protein [Plasmodium berghei]
Length = 953
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 53/105 (50%), Gaps = 9/105 (8%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+K RP+VR FLE S ++ + T +TR YA+ + +LD D F+ RI+AR +D
Sbjct: 618 LKFRPYVRQFLEILSLYYELSIYTNATREYADVVIAILDPDRTIFADRIVARCSSVDRDE 677
Query: 175 KN------PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
P++ + ++ DD + VW D + I+ ++ F
Sbjct: 678 NKHFEKIYPNV---DPKYVIAFDDRKDVWFDIPHSHILRAEHYNF 719
>gi|157125124|ref|XP_001660632.1| hypothetical protein AaeL_AAEL010078 [Aedes aegypti]
gi|108873763|gb|EAT37988.1| AAEL010078-PA [Aedes aegypti]
Length = 678
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 71/144 (49%), Gaps = 14/144 (9%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMAN-DKLVKLRPFVRTFLE 126
+ LVL+LD TL+HC +++ LS + K + LFQ V+ RPF R FLE
Sbjct: 499 EFSLVLDLDETLVHC-SLQELS--DASFKFPV------LFQECKYTVFVRTRPFFREFLE 549
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQ 183
+ S + ++ L T S R YA+ + LLD + + R+ RE NG K+ ++
Sbjct: 550 KVSQIFEVILFTASKRVYADKLLNLLDPERRLIKYRLF-REHCVLVNGNYIKDLTILGRD 608
Query: 184 ERGIVILDDTESVWSDHTENLIVL 207
+I+D++ + EN I +
Sbjct: 609 LSKTIIIDNSPQAFGYQLENGIPI 632
>gi|291403116|ref|XP_002717973.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase like 2 [Oryctolagus
cuniculus]
Length = 286
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 74/155 (47%), Gaps = 14/155 (9%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+
Sbjct: 107 EFSLVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLER 158
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
S + +I L T S + YA+ + +LD + R+ RE G K+ +++
Sbjct: 159 MSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLF-REHCVCVQGNYIKDLNILGRDL 217
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVYFRDKELN 219
+I+D++ ++ N I + +F DK N
Sbjct: 218 SKTIIIDNSPQAFAYQLSNGIPIES--WFMDKNDN 250
>gi|124802229|ref|XP_001347409.1| protein phosphatase, putative [Plasmodium falciparum 3D7]
gi|23494988|gb|AAN35322.1| protein phosphatase, putative [Plasmodium falciparum 3D7]
Length = 1438
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 55/101 (54%), Gaps = 3/101 (2%)
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDR 174
KLRP V FL S +IYL TM T +A++ + LLD K+F +R+ +R+D N
Sbjct: 575 KLRPGVIEFLRTMSEKYEIYLYTMGTLEHAKSCLFLLDPLRKFFGNRVFSRKDCLNSLKH 634
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD 215
N L + I I DD++ +W +++ + V G Y YF D
Sbjct: 635 LNKILPTYRSVSICI-DDSDYIWKENSSCIKVHG-YNYFPD 673
>gi|118390259|ref|XP_001028120.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila]
gi|89309890|gb|EAS07878.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila SB210]
Length = 623
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 67/135 (49%), Gaps = 22/135 (16%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
+K L+L+LD TL+HC +SL + ++ I + + Q + +RPF + FLE
Sbjct: 432 KKKTLILDLDETLIHCN--ESLDNSSDFIL-DIQADSKEVVQAG----INVRPFAKQFLE 484
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR---------KNP 177
+ S L +I + T S YA + LD +K+ R+ RE+ K+R KN
Sbjct: 485 EMSHLYEIVIFTASRSVYANEVINKLDPQNKFIFKRLF-RENCIYKNRIYIKDLRIFKNR 543
Query: 178 DLVRGQERGIVILDD 192
D+ + +VI+D+
Sbjct: 544 DI-----KNLVIVDN 553
>gi|219126682|ref|XP_002183580.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217404817|gb|EEC44762.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 224
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 49/96 (51%), Gaps = 8/96 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC ++ + + + H+ +Q+ V+LRP + TFL +
Sbjct: 43 PITLVLDLDETLVHC-TVEPVENADLTFPVDFHNVT---YQVH----VRLRPHLFTFLSR 94
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+I L T S + YA + +D D KYF R+
Sbjct: 95 IEGQYEIVLFTASQKVYANELLNRIDPDGKYFHHRL 130
>gi|7022613|dbj|BAA91664.1| unnamed protein product [Homo sapiens]
Length = 286
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 74/155 (47%), Gaps = 14/155 (9%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+
Sbjct: 107 EFSLVLDLDETLVHC----SLNELEDAALTFPVLFQDVVYQV----YVRLRPFFREFLER 158
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
S + +I L T S + YA+ + +LD + R+ RE G K+ +++
Sbjct: 159 MSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLF-REHCVCVQGNYIKDLNILGRDL 217
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVYFRDKELN 219
+I+D++ ++ N I + +F DK N
Sbjct: 218 SKTIIIDNSPQAFAYQLSNGIPIES--WFMDKNDN 250
>gi|414881093|tpg|DAA58224.1| TPA: hypothetical protein ZEAMMB73_373456 [Zea mays]
gi|414881094|tpg|DAA58225.1| TPA: hypothetical protein ZEAMMB73_373456 [Zea mays]
Length = 442
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 58/210 (27%), Positives = 91/210 (43%), Gaps = 42/210 (20%)
Query: 63 EQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV--KLRPF 120
EQ R + LVL+LD TL+H S K+ +F S+F + +V K RP
Sbjct: 261 EQWTRNVTLVLDLDETLVH--------STMKHCDDADFTF--SMFYDMKEHVVYVKKRPH 310
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD---RKNP 177
V FL++ + ++ + T S YA+ + +LD + K FS R RE D RK+
Sbjct: 311 VHMFLQRMVEMFEVVIFTASQSVYADQLLDMLDPEKKLFSKRFF-RESCLITDSGYRKDL 369
Query: 178 DLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEE 237
+V + I+D+T V+ N I + + YS L +E
Sbjct: 370 TVVGVDLAKVAIIDNTPQVFELQVNNGIPIESW--------------YSNPL------DE 409
Query: 238 ALANVLRVLKTIHRLFFDSVCGDVRTYLPK 267
AL ++ L+T+ +V DVR + K
Sbjct: 410 ALPQLIPFLETL------AVADDVRPIIAK 433
>gi|350578733|ref|XP_003480441.1| PREDICTED: CTD small phosphatase-like protein 2-like [Sus scrofa]
Length = 355
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 50/96 (52%), Gaps = 8/96 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+
Sbjct: 176 EFSLVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLER 227
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
S + +I L T S + YA+ + +LD + R+
Sbjct: 228 MSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRL 263
>gi|145511237|ref|XP_001441546.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124408796|emb|CAK74149.1| unnamed protein product [Paramecium tetraurelia]
Length = 470
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 42/157 (26%), Positives = 77/157 (49%), Gaps = 21/157 (13%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
+V +LD TL+HC +SL + Y+ S G + +RP+ + L++ S
Sbjct: 282 VVFDLDETLIHCNENQSLK-ADVYIPITFPS--GDTVSAG----INIRPYAKWILQELSQ 334
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR------IIAREDFNGKDRK--NPDLVRG 182
+ ++ + T S +CYA ++ LD ++ S++ +++ + + KD K N DL
Sbjct: 335 ICEVVVFTASHQCYASQVIQQLDPKNQLLSAQLFRDNCVLSPDGVHIKDLKIFNRDL--- 391
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFR-DKEL 218
+ IV++D+ + H EN I + Y + DKEL
Sbjct: 392 --KDIVLVDNAAYSFGVHLENGIPIIPYYENKDDKEL 426
>gi|336387157|gb|EGO28302.1| hypothetical protein SERLADRAFT_354339 [Serpula lacrymans var.
lacrymans S7.9]
Length = 874
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 52/100 (52%), Gaps = 2/100 (2%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR 174
+K RP + FL ++ ++++ TM TR YAE +D D F RI++R++ +
Sbjct: 272 IKPRPGWQHFLHSIANKYEMHVYTMGTRAYAEEVCAAIDPDGTIFGGRILSRDESGSLTQ 331
Query: 175 KN-PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K+ L +VI+DD VW + + NL+ + Y +F
Sbjct: 332 KSLQRLFPCDTSMVVIIDDRADVW-EWSPNLVKVIPYDFF 370
>gi|359487040|ref|XP_002265614.2| PREDICTED: uncharacterized protein LOC100267967 [Vitis vinifera]
Length = 522
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 53/107 (49%), Gaps = 14/107 (13%)
Query: 59 LRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHS--FIGSLFQMANDKL-V 115
L E + +++ LVL+LD TL+H L+ H+ F M + V
Sbjct: 340 LPEEESKRKRITLVLDLDETLVH-----------STLEPCDHADFTFPVFFNMKEHTIYV 388
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
+ RPF++ FLE+ + + +I + T S YAE + +LD D K FS R
Sbjct: 389 RQRPFLQMFLERVAEMFEIIVFTASQSIYAEQLLDILDPDRKLFSGR 435
>gi|344297040|ref|XP_003420208.1| PREDICTED: CTD small phosphatase-like protein 2 [Loxodonta
africana]
Length = 466
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|100815975|ref|NP_057480.2| CTD small phosphatase-like protein 2 [Homo sapiens]
gi|187471086|sp|Q05D32.2|CTSL2_HUMAN RecName: Full=CTD small phosphatase-like protein 2;
Short=CTDSP-like 2
gi|23273027|gb|AAH35744.1| CTDSPL2 protein [Homo sapiens]
gi|71835542|gb|AAZ42188.1| unknown [Homo sapiens]
gi|119597671|gb|EAW77265.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase like 2, isoform CRA_a [Homo sapiens]
gi|119597672|gb|EAW77266.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase like 2, isoform CRA_a [Homo sapiens]
gi|123994825|gb|ABM85014.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase like 2 [synthetic construct]
gi|157928777|gb|ABW03674.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase like 2 [synthetic construct]
gi|158255896|dbj|BAF83919.1| unnamed protein product [Homo sapiens]
gi|168278020|dbj|BAG10988.1| CTD small phosphatase like 2 [synthetic construct]
Length = 466
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|50949928|emb|CAH10508.1| hypothetical protein [Homo sapiens]
Length = 394
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 218 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 269
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 270 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 302
>gi|296213856|ref|XP_002753450.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 3
[Callithrix jacchus]
Length = 466
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|395837830|ref|XP_003791832.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Otolemur
garnettii]
gi|395837832|ref|XP_003791833.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Otolemur
garnettii]
Length = 466
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|147798518|emb|CAN65472.1| hypothetical protein VITISV_037605 [Vitis vinifera]
Length = 506
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 53/107 (49%), Gaps = 14/107 (13%)
Query: 59 LRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHS--FIGSLFQMANDKL-V 115
L E + +++ LVL+LD TL+H L+ H+ F M + V
Sbjct: 324 LPEEESKRKRITLVLDLDETLVH-----------STLEPCDHADFTFPVFFNMKEHTIYV 372
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
+ RPF++ FLE+ + + +I + T S YAE + +LD D K FS R
Sbjct: 373 RQRPFLQMFLERVAEMFEIIVFTASQSIYAEQLLDILDPDRKLFSGR 419
>gi|410961377|ref|XP_003987259.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Felis
catus]
gi|410961379|ref|XP_003987260.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Felis
catus]
Length = 466
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|402874166|ref|XP_003900915.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Papio
anubis]
gi|402874168|ref|XP_003900916.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Papio
anubis]
Length = 466
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|403274413|ref|XP_003928971.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Saimiri
boliviensis boliviensis]
gi|403274415|ref|XP_003928972.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Saimiri
boliviensis boliviensis]
Length = 466
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|397480304|ref|XP_003811426.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Pan
paniscus]
gi|397480306|ref|XP_003811427.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Pan
paniscus]
Length = 466
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|296090552|emb|CBI40902.3| unnamed protein product [Vitis vinifera]
Length = 570
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 53/107 (49%), Gaps = 14/107 (13%)
Query: 59 LRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHS--FIGSLFQMANDKL-V 115
L E + +++ LVL+LD TL+H L+ H+ F M + V
Sbjct: 388 LPEEESKRKRITLVLDLDETLVH-----------STLEPCDHADFTFPVFFNMKEHTIYV 436
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
+ RPF++ FLE+ + + +I + T S YAE + +LD D K FS R
Sbjct: 437 RQRPFLQMFLERVAEMFEIIVFTASQSIYAEQLLDILDPDRKLFSGR 483
>gi|388453109|ref|NP_001253738.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase like 2 [Macaca mulatta]
gi|114656732|ref|XP_001161756.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase like 2 isoform 3 [Pan
troglodytes]
gi|114656734|ref|XP_001161793.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase like 2 isoform 4 [Pan
troglodytes]
gi|297696523|ref|XP_002825440.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase like 2 isoform 1 [Pongo
abelii]
gi|395746659|ref|XP_003778487.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase like 2 isoform 2 [Pongo
abelii]
gi|380813572|gb|AFE78660.1| CTD small phosphatase-like protein 2 [Macaca mulatta]
gi|383419005|gb|AFH32716.1| CTD small phosphatase-like protein 2 [Macaca mulatta]
gi|384947558|gb|AFI37384.1| CTD small phosphatase-like protein 2 [Macaca mulatta]
gi|410206686|gb|JAA00562.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase like 2 [Pan troglodytes]
gi|410253512|gb|JAA14723.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase like 2 [Pan troglodytes]
gi|410302524|gb|JAA29862.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase like 2 [Pan troglodytes]
gi|410341327|gb|JAA39610.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase like 2 [Pan troglodytes]
Length = 466
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|6841480|gb|AAF29093.1|AF161478_1 HSPC129 [Homo sapiens]
Length = 466
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|6841354|gb|AAF29030.1|AF161543_1 HSPC058 [Homo sapiens]
Length = 352
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 50/96 (52%), Gaps = 8/96 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+
Sbjct: 173 EFSLVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLER 224
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
S + +I L T S + YA+ + +LD + R+
Sbjct: 225 MSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRL 260
>gi|417401418|gb|JAA47595.1| Putative ctd carboxy-terminal domain rna polymer [Desmodus
rotundus]
Length = 466
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|225711928|gb|ACO11810.1| Probable C-terminal domain small phosphatase [Lepeophtheirus
salmonis]
Length = 265
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 12/143 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC +++ L +F V+ RP +R FLE+
Sbjct: 85 RFSLVLDLDETLVHC-SLQELDDASLSFPVVFQDTTYRVF-------VRTRPRIREFLER 136
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
S ++ L T S + YA+ + LLD + K+ R+ RE NG K+ +++
Sbjct: 137 VSKNFEVTLFTASKKVYADKLLNLLDPERKWIKYRLF-REHCVCVNGNYIKDLNILGRDL 195
Query: 185 RGIVILDDTESVWSDHTENLIVL 207
+I+D++ + EN I +
Sbjct: 196 SKTIIIDNSPQAFGYQLENGIPI 218
>gi|149692003|ref|XP_001502897.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase like 2 isoform 2 [Equus
caballus]
gi|149692005|ref|XP_001502892.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase like 2 isoform 1 [Equus
caballus]
Length = 466
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|57108473|ref|XP_544655.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase like 2 isoform 1 [Canis
lupus familiaris]
gi|73999941|ref|XP_860654.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase like 2 isoform 4 [Canis
lupus familiaris]
Length = 466
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|355681384|gb|AER96789.1| CTD small phosphatase like 2 [Mustela putorius furo]
Length = 465
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|145527362|ref|XP_001449481.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124417069|emb|CAK82084.1| unnamed protein product [Paramecium tetraurelia]
Length = 249
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 84/174 (48%), Gaps = 14/174 (8%)
Query: 49 GLSFDYMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQ 108
GL FD + +++ E++ LVL+LD TL+H ++ +L ++I IG+ +
Sbjct: 52 GLDFDDECKDKITAKKTEKEFTLVLDLDETLIHSDMERT-----SFLDEEILVKIGNTIE 106
Query: 109 MANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED 168
VK+RPF R FL+ S+ ++ + T + + YA+ + LD F R R+
Sbjct: 107 KY---YVKIRPFARDFLKALSNYFELVIFTAAIKEYADKVIDYLDPSG--FIKRRFYRDS 161
Query: 169 FNGKDR---KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGK-YVYFRDKEL 218
KD K+ V I+D++ S S + +N I++ Y +D+EL
Sbjct: 162 CTKKDGVFYKDLTKVNSNLDKTFIIDNSLSGMSLNPQNGILIKSWYKDLKDQEL 215
>gi|26390099|dbj|BAC25842.1| unnamed protein product [Mus musculus]
Length = 351
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 74/155 (47%), Gaps = 14/155 (9%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+
Sbjct: 172 EFSLVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLER 223
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
S + +I L T S + YA+ + +LD + R+ RE G K+ +++
Sbjct: 224 MSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLF-REHCVCVQGNYIKDLNILGRDL 282
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVYFRDKELN 219
+I+D++ ++ N I + +F DK N
Sbjct: 283 SKTIIIDNSPQAFAYQLSNGIPIES--WFMDKNDN 315
>gi|332235387|ref|XP_003266885.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 1 [Nomascus
leucogenys]
gi|332235389|ref|XP_003266886.1| PREDICTED: CTD small phosphatase-like protein 2 isoform 2 [Nomascus
leucogenys]
Length = 466
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|301754747|ref|XP_002913218.1| PREDICTED: CTD small phosphatase-like protein 2-like [Ailuropoda
melanoleuca]
Length = 466
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|330864811|ref|NP_001178334.1| CTD small phosphatase-like protein 2 [Bos taurus]
gi|296482877|tpg|DAA24992.1| TPA: CTD (carboxy-terminal domain, RNA polymerase II, polypeptide
A) small phosphatase like 2 [Bos taurus]
gi|440911957|gb|ELR61572.1| CTD small phosphatase-like protein 2 [Bos grunniens mutus]
Length = 466
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|126281910|ref|XP_001363358.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase like 2 [Monodelphis
domestica]
Length = 466
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 IYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|149392655|gb|ABR26130.1| ctd-phosphatase-like protein [Oryza sativa Indica Group]
Length = 187
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 70/152 (46%), Gaps = 22/152 (14%)
Query: 67 RKLQLVLNLDHTLLH-----CRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPF 120
+++ LVL+LD TL+H C N+ + F M N + V+ RP
Sbjct: 10 KQITLVLDLDETLVHSTLDHCDNVDFT--------------LQVFFNMKNHTVYVRQRPH 55
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDL- 179
++ FLE+ + + ++ + T S R YAE + LD D + S RI + DL
Sbjct: 56 LKMFLEKVAQMFELVIFTASQRIYAEQLIDRLDPDERLISHRIYRESCIFSEGCYTKDLT 115
Query: 180 VRGQERG-IVILDDTESVWSDHTENLIVLGKY 210
+ G + +VI+D+T V+ +N I + +
Sbjct: 116 ILGVDLAKVVIVDNTPQVFQLQVDNGIPIKSW 147
>gi|30851260|gb|AAH52660.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase like 2 [Mus musculus]
Length = 465
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 289 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 340
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 341 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 373
>gi|355692677|gb|EHH27280.1| CTD small phosphatase-like protein 2 [Macaca mulatta]
Length = 466
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 8/83 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLD 153
+ +I L T S + YA+ + +LD
Sbjct: 342 MYEIILFTASKKVYADKLLNILD 364
>gi|432861327|ref|XP_004069613.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Oryzias
latipes]
Length = 473
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 47/152 (30%), Positives = 73/152 (48%), Gaps = 14/152 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 297 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 348
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQERGI 187
L +I L T S + YA+ + +LD + R+ RE G K+ +++
Sbjct: 349 LYEIILFTASKKVYADKLLNILDPKKQLVRHRLF-REHCVCVQGNYIKDLNILGRDLSKT 407
Query: 188 VILDDTESVWSDHTENLIVLGKYVYFRDKELN 219
VI+D++ ++ N I + +F DK N
Sbjct: 408 VIIDNSPQAFAYQLSNGIPIES--WFVDKNDN 437
>gi|47059059|ref|NP_997615.1| CTD small phosphatase-like protein 2 [Mus musculus]
gi|81873659|sp|Q8BG15.1|CTSL2_MOUSE RecName: Full=CTD small phosphatase-like protein 2;
Short=CTDSP-like 2
gi|26326063|dbj|BAC26775.1| unnamed protein product [Mus musculus]
gi|26329037|dbj|BAC28257.1| unnamed protein product [Mus musculus]
gi|26340192|dbj|BAC33759.1| unnamed protein product [Mus musculus]
gi|26349835|dbj|BAC38557.1| unnamed protein product [Mus musculus]
gi|148696133|gb|EDL28080.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase like 2, isoform CRA_b [Mus musculus]
Length = 465
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 289 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 340
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 341 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 373
>gi|354471693|ref|XP_003498075.1| PREDICTED: CTD small phosphatase-like protein 2 [Cricetulus
griseus]
Length = 465
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 289 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 340
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 341 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 373
>gi|327288817|ref|XP_003229121.1| PREDICTED: CTD small phosphatase-like protein 2-like [Anolis
carolinensis]
Length = 466
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 IYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|229892336|ref|NP_001080602.1| CTD small phosphatase-like protein 2-A [Xenopus laevis]
gi|82176945|sp|Q801R4.1|CTL2A_XENLA RecName: Full=CTD small phosphatase-like protein 2-A;
Short=CTDSP-like 2-A
gi|28838482|gb|AAH47962.1| Ctdspl2a protein [Xenopus laevis]
gi|120538080|gb|AAI29525.1| Ctdspl2a protein [Xenopus laevis]
Length = 466
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 50/96 (52%), Gaps = 8/96 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+
Sbjct: 287 EFSLVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLER 338
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
S + +I L T S + YA+ + +LD + R+
Sbjct: 339 MSQIYEIILFTASKKVYADKLLNILDPKKRLVRHRL 374
>gi|351710351|gb|EHB13270.1| CTD small phosphatase-like protein 2 [Heterocephalus glaber]
Length = 466
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|74190363|dbj|BAE37265.1| unnamed protein product [Mus musculus]
Length = 465
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 289 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 340
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 341 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 373
>gi|148696132|gb|EDL28079.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase like 2, isoform CRA_a [Mus musculus]
Length = 465
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 289 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 340
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 341 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 373
>gi|348512761|ref|XP_003443911.1| PREDICTED: CTD small phosphatase-like protein 2-A-like isoform 2
[Oreochromis niloticus]
Length = 471
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 295 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 346
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
L +I L T S + YA+ + +LD + R+
Sbjct: 347 LYEIILFTASKKVYADKLLNILDPKKQLVRHRL 379
>gi|147907092|ref|NP_001089935.1| CTD small phosphatase-like protein 2-B [Xenopus laevis]
gi|83405117|gb|AAI10767.1| Ctdspl2b protein [Xenopus laevis]
Length = 466
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 50/96 (52%), Gaps = 8/96 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+
Sbjct: 287 EFSLVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLER 338
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
S + +I L T S + YA+ + +LD + R+
Sbjct: 339 MSQIYEIILFTASKKVYADKLLNILDPKKRLVRHRL 374
>gi|62078827|ref|NP_001014070.1| CTD small phosphatase-like protein 2 [Rattus norvegicus]
gi|81883796|sp|Q5XIK8.1|CTSL2_RAT RecName: Full=CTD small phosphatase-like protein 2;
Short=CTDSP-like 2
gi|53734232|gb|AAH83672.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase like 2 [Rattus norvegicus]
gi|149023119|gb|EDL80013.1| similar to hypothetical protein HSPC129 [Rattus norvegicus]
Length = 465
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 289 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 340
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 341 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 373
>gi|34596232|gb|AAQ76796.1| hypothetical protein [Homo sapiens]
Length = 466
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVVYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|123900520|sp|Q3KQB6.1|CTL2B_XENLA RecName: Full=CTD small phosphatase-like protein 2-B;
Short=CTDSP-like 2-B
gi|76779483|gb|AAI06291.1| Ctdspl2b protein [Xenopus laevis]
Length = 466
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 50/96 (52%), Gaps = 8/96 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+
Sbjct: 287 EFSLVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLER 338
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
S + +I L T S + YA+ + +LD + R+
Sbjct: 339 MSQIYEIILFTASKKVYADKLLNILDPKKRLVRHRL 374
>gi|321470826|gb|EFX81801.1| hypothetical protein DAPPUDRAFT_49973 [Daphnia pulex]
Length = 237
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 67/143 (46%), Gaps = 12/143 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL E F + +Q+ V+ RP R FLE+ S
Sbjct: 61 LVLDLDETLVHC----SLEELEDAAFSFPVFFQDTTYQV----FVRTRPHFREFLERVSQ 112
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQERGI 187
+ ++ L T S + YA+ + LLD ++ R+ RE NG K+ ++
Sbjct: 113 IFEVILFTASKKVYADKLLNLLDPQRRWIKYRLF-REHCVCVNGNYIKDLTILGRDLSRT 171
Query: 188 VILDDTESVWSDHTENLIVLGKY 210
+I+D++ + EN I + +
Sbjct: 172 IIIDNSPQAFGYQLENGIPIESW 194
>gi|348512759|ref|XP_003443910.1| PREDICTED: CTD small phosphatase-like protein 2-A-like isoform 1
[Oreochromis niloticus]
Length = 474
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 298 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 349
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
L +I L T S + YA+ + +LD + R+
Sbjct: 350 LYEIILFTASKKVYADKLLNILDPKKQLVRHRL 382
>gi|56605878|ref|NP_001008438.1| CTD small phosphatase-like protein 2 [Xenopus (Silurana)
tropicalis]
gi|82181540|sp|Q66KM5.1|CTSL2_XENTR RecName: Full=CTD small phosphatase-like protein 2;
Short=CTDSP-like 2
gi|51512946|gb|AAH80328.1| MGC79498 protein [Xenopus (Silurana) tropicalis]
Length = 466
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 50/96 (52%), Gaps = 8/96 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+
Sbjct: 287 EFSLVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLER 338
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
S + +I L T S + YA+ + +LD + R+
Sbjct: 339 MSQIYEIILFTASKKVYADKLLNILDPKKRLVRHRL 374
>gi|452819366|gb|EME26426.1| CTD small phosphatase like isoform 1 [Galdieria sulphuraria]
gi|452819367|gb|EME26427.1| CTD small phosphatase like isoform 2 [Galdieria sulphuraria]
Length = 490
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 88/207 (42%), Gaps = 42/207 (20%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV--KLRPFVRT 123
+ ++ LVL+LD TL+HC S+ I ++ + LV K RPF+
Sbjct: 285 DPQITLVLDLDETLVHCSTDPCQSA----------DLIFPVYFGGTEYLVYAKKRPFLDY 334
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLV 180
FL + ++ + T S + YA+ + LLD + YF R R+ G K+ ++
Sbjct: 335 FLSEIRKYFEVIVFTASQQAYADTILNLLDPEGSYFRHRAF-RDSCVFIEGNFLKDLRVL 393
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALA 240
VILD++ + EN I + +V D+SE+ E L
Sbjct: 394 GRDLSKCVILDNSPQAFGLQVENGIPITTWV-------------------DDSEDRE-LL 433
Query: 241 NVLRVLKTIHRLFFDSVCGDVRTYLPK 267
++L LK + S C DVR +L K
Sbjct: 434 DLLPFLKQL------SNCEDVRPFLSK 454
>gi|61098234|ref|NP_001012790.1| CTD small phosphatase-like protein 2 [Gallus gallus]
gi|60098613|emb|CAH65137.1| hypothetical protein RCJMB04_4a24 [Gallus gallus]
Length = 468
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 292 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 343
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 344 IYEIILFTASKKVYADKLLNILDPKKQLVRHRL 376
>gi|426233772|ref|XP_004010888.1| PREDICTED: CTD small phosphatase-like protein 2 [Ovis aries]
Length = 466
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|326926934|ref|XP_003209651.1| PREDICTED: CTD small phosphatase-like protein 2-like [Meleagris
gallopavo]
Length = 468
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 292 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 343
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 344 IYEIILFTASKKVYADKLLNILDPKKQLVRHRL 376
>gi|224062995|ref|XP_002187586.1| PREDICTED: CTD small phosphatase-like protein 2 [Taeniopygia
guttata]
Length = 467
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 291 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 342
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 343 IYEIILFTASKKVYADKLLNILDPKKQLVRHRL 375
>gi|187471087|sp|Q5F3Z7.2|CTSL2_CHICK RecName: Full=CTD small phosphatase-like protein 2;
Short=CTDSP-like 2
Length = 466
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 IYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|117606236|ref|NP_001071012.1| CTD small phosphatase-like protein 2-A [Danio rerio]
gi|123884286|sp|Q08BB5.1|CTL2A_DANRE RecName: Full=CTD small phosphatase-like protein 2-A;
Short=CTDSP-like 2-A
gi|115528634|gb|AAI24795.1| Zgc:154017 [Danio rerio]
Length = 469
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 80/175 (45%), Gaps = 26/175 (14%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+
Sbjct: 290 EFSLVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLER 341
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
S + +I L T S + YA+ + +LD + R+ RE G K+ +++
Sbjct: 342 MSQIYEIILFTASKKVYADKLLNILDPKKQLVRHRLF-REHCVCVQGNYIKDLNILGRDL 400
Query: 185 RGIVILDDTESVWSDHTENLIV------------LGKYVYFRDK--ELNGDHKSY 225
VI+D++ ++ N I L K V F +K ELN D + Y
Sbjct: 401 SKTVIIDNSPQAFAYQLSNGIPIESWFVDKNDNELLKLVPFLEKLVELNEDVRPY 455
>gi|26343511|dbj|BAC35412.1| unnamed protein product [Mus musculus]
Length = 464
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 288 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 339
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 340 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 372
>gi|145503264|ref|XP_001437609.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124404760|emb|CAK70212.1| unnamed protein product [Paramecium tetraurelia]
Length = 480
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 50/167 (29%), Positives = 82/167 (49%), Gaps = 26/167 (15%)
Query: 64 QEERKLQ--LVLNLDHTLLHC---RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLR 118
Q+ K Q +V +LD TL+HC +NIKS + YL S G Q + +R
Sbjct: 283 QKNTKFQKTVVFDLDETLIHCNENQNIKS----DVYLPITFPS--GDTVQAG----INIR 332
Query: 119 PFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYF-SSRIIAREDFNGKD 173
P+ + L S + ++ + T S +CYA ++ LD L ++ F S I+ + + KD
Sbjct: 333 PWAKQILNLLSEVCEVVVFTASHQCYASQVIQFLDQKKILSAQLFRESCIVTNDGVHIKD 392
Query: 174 RKNPDLVRGQE-RGIVILDDTESVWSDHTENLI-VLGKYVYFRDKEL 218
+ V G++ + IV++D+ + H EN I ++ Y DKEL
Sbjct: 393 LR----VLGRDMKDIVLIDNAAYSFGYHIENGIPIIPYYDNKEDKEL 435
>gi|145552384|ref|XP_001461868.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124429704|emb|CAK94495.1| unnamed protein product [Paramecium tetraurelia]
Length = 411
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 59/138 (42%), Gaps = 21/138 (15%)
Query: 29 HTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSL 88
H T + C F Q ND R + ++ +R+L L +LD TL+HC S+
Sbjct: 179 HQTYQGLNCRFFPQNNND--------YNRSHKLPKKHQRQLTLFFDLDETLVHCNETPSI 230
Query: 89 SSG---EKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYA 145
E + K H + + + +RP+ + L+ S+ +I + T S CYA
Sbjct: 231 PCDVVLEINVSK--HQIVKAG--------INVRPYAKEMLKNLSNHFEIIVFTASHSCYA 280
Query: 146 EAAVKLLDLDSKYFSSRI 163
E LD DS S R+
Sbjct: 281 EKVCNHLDPDSTIISHRL 298
>gi|114108339|gb|AAI23380.1| Ctdspl2a protein [Xenopus laevis]
Length = 536
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 75/155 (48%), Gaps = 13/155 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+
Sbjct: 357 EFSLVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLER 408
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
S + +I L T S + YA+ + +LD + R+ RE G K+ +++
Sbjct: 409 MSQIYEIILFTASKKVYADKLLNILDPKKRLVRHRLF-REHCVCVQGNYIKDLNILGRDL 467
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVYFR-DKEL 218
+I+D++ ++ N I + + + DKEL
Sbjct: 468 SKTIIIDNSPQAFAYQLSNGIPIESWFMDKNDKEL 502
>gi|281338163|gb|EFB13747.1| hypothetical protein PANDA_001000 [Ailuropoda melanoleuca]
Length = 445
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 49/93 (52%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 MYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|432851772|ref|XP_004067077.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Oryzias
latipes]
Length = 474
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 74/155 (47%), Gaps = 14/155 (9%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+
Sbjct: 295 EFSLVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLER 346
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
S + +I L T S + YA+ + +LD + R+ RE G K+ +++
Sbjct: 347 MSQIYEIILFTASKKVYADKLLNILDPKKQLVRHRLF-REHCVCVQGNYIKDLNILGRDL 405
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVYFRDKELN 219
+I+D++ ++ N I + +F DK N
Sbjct: 406 SKTIIIDNSPQAFAYQLSNGIPIES--WFMDKNDN 438
>gi|225718796|gb|ACO15244.1| Probable C-terminal domain small phosphatase [Caligus clemensi]
Length = 314
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 69/153 (45%), Gaps = 12/153 (7%)
Query: 58 GLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKL 117
L + + LVL+LD TL+HC +++ L +F V+
Sbjct: 124 ALPLKTRSSPRFSLVLDLDETLVHC-SLQELDDASLSFPVVFQDTTYRVF-------VRT 175
Query: 118 RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDR 174
RP +R FLE+ S ++ L T S + YA+ + LLD + K+ R+ RE NG
Sbjct: 176 RPRIREFLERVSKNFEVTLFTASKKVYADKLLNLLDPERKWIKYRLF-REHCVCVNGNYI 234
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVL 207
K+ +++ +I+D++ + EN I +
Sbjct: 235 KDLNILGRDLFKTIIIDNSPQAFGYQLENGIPI 267
>gi|198474069|ref|XP_002132618.1| GA25924 [Drosophila pseudoobscura pseudoobscura]
gi|198138234|gb|EDY70020.1| GA25924 [Drosophila pseudoobscura pseudoobscura]
Length = 306
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 47/171 (27%), Positives = 77/171 (45%), Gaps = 12/171 (7%)
Query: 50 LSFDYMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIH-SFIGSLFQ 108
L DYM + +K LVL+LD TL+ +K G + KK+ ++ F+
Sbjct: 87 LHGDYMTSCSKRKLTLVKKKTLVLDLDETLMTSVFVKKGVKGGRGSKKKCKWHYVPVDFE 146
Query: 109 M-ANDKLVKL--RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD-----LDSKYFS 160
+D VK+ RPFV FL+Q S DI + T T YA + LD L + F
Sbjct: 147 FNLHDSTVKVYKRPFVDHFLDQVSKWFDIVVFTAGTEPYATPIIDYLDGGRNILGHRLFR 206
Query: 161 SRIIAREDFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
+ + + FN K +V + +++LD++ + +N I + Y+
Sbjct: 207 DKCVTVQGFNA---KFVSIVNDDKANVILLDNSIPECCFNVDNSIPIFDYI 254
>gi|410912504|ref|XP_003969729.1| PREDICTED: CTD small phosphatase-like protein 2-like [Takifugu
rubripes]
Length = 474
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 50/96 (52%), Gaps = 8/96 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+
Sbjct: 295 EFSLVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLER 346
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
S + +I L T S + YA+ + +LD + R+
Sbjct: 347 MSQIYEIILFTASKKVYADKLLNILDPKKQLVRHRL 382
>gi|149490347|ref|XP_001511004.1| PREDICTED: CTD small phosphatase-like protein 2-like
[Ornithorhynchus anatinus]
Length = 374
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 46/83 (55%), Gaps = 8/83 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 293 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 344
Query: 131 LVDIYLCTMSTRCYAEAAVKLLD 153
+ +I L T S + YA+ + +LD
Sbjct: 345 IYEIILFTASKKVYADKLLNILD 367
>gi|195122938|ref|XP_002005967.1| GI20773 [Drosophila mojavensis]
gi|193911035|gb|EDW09902.1| GI20773 [Drosophila mojavensis]
Length = 313
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 44/170 (25%), Positives = 70/170 (41%), Gaps = 33/170 (19%)
Query: 71 LVLNLDHTLLH-CRNIKSLSSGEKYLKKQIHSFIGSLFQ--------------MANDKLV 115
L+L+LD TL+H C YL + H +G F +AN +
Sbjct: 123 LILDLDETLVHSC-----------YLDPETHDVVGCTFVPQTAVPDYILNIPILANLSPI 171
Query: 116 KL----RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNG 171
+ RP+V FL+ S D+ + T S + YA + LD R + N
Sbjct: 172 EFQVFKRPYVDLFLDLVSKWYDVVIYTASLQAYASIVIDKLDAGRGILQRRFYRQHCVNT 231
Query: 172 KD--RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVY-FRDKEL 218
KN +V ++I+D++ S + D EN + + Y+Y D+EL
Sbjct: 232 SSLVSKNLFVVNRDLNSVLIIDNSPSAYRDFPENALPIKSYIYDPNDREL 281
>gi|66361684|ref|XP_627365.1| RNA pol II carboxy terminal domain phosphatase of the HAD
superfamily with a BRCT domain at the C-terminus
[Cryptosporidium parvum Iowa II]
gi|46228744|gb|EAK89614.1| RNA pol II carboxy terminal domain phosphatase of the HAD
superfamily with a BRCT domain at the C-terminus
[Cryptosporidium parvum Iowa II]
Length = 762
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 61/119 (51%), Gaps = 5/119 (4%)
Query: 116 KLRPFVRTFLEQASS-LVDIYLCTMSTRCYAEAAVKLLDLDSKYF-SSRIIARED-FNGK 172
KLRP V L S +IY+ TM T +A ++++LD + ++F S RI R + F
Sbjct: 350 KLRPGVINMLRTLSKDKYEIYMYTMGTEYHAYTSLRILDPELRFFHSKRIFYRNNGFKET 409
Query: 173 DRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLT 230
K+ + L R +VILDD E W+D +L+ + Y +F + D S+S ++
Sbjct: 410 SIKSLNTLFPYDHRTLVILDDIEQAWTD-INSLLKVYPYNFFPSNSIPNDSSSFSRYIS 467
>gi|348509633|ref|XP_003442352.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Oreochromis
niloticus]
Length = 476
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 50/96 (52%), Gaps = 8/96 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+
Sbjct: 297 EFSLVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLER 348
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
S + +I L T S + YA+ + +LD + R+
Sbjct: 349 MSQIYEIILFTASKKVYADKLLNILDPKKQLVRHRL 384
>gi|209882178|ref|XP_002142526.1| NLI interacting factor-like phosphatase family protein
[Cryptosporidium muris RN66]
gi|209558132|gb|EEA08177.1| NLI interacting factor-like phosphatase family protein
[Cryptosporidium muris RN66]
Length = 710
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 57/114 (50%), Gaps = 4/114 (3%)
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD-- 173
KLRP V L + ++Y+ TM T +A +A++++D + ++F + + + KD
Sbjct: 297 KLRPGVLNMLRRLKDKFELYMYTMGTELHAYSALRIIDPEFRFFHPKRLFYRNNGFKDCN 356
Query: 174 -RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYS 226
+ L R ++++DD E WS ++ +LI + Y +F L D YS
Sbjct: 357 SKSLSTLFPYDHRTLIVIDDIEQAWS-NSNSLIKVYPYNFFPSAPLPVDASCYS 409
>gi|196002271|ref|XP_002111003.1| hypothetical protein TRIADDRAFT_15923 [Trichoplax adhaerens]
gi|190586954|gb|EDV27007.1| hypothetical protein TRIADDRAFT_15923, partial [Trichoplax
adhaerens]
Length = 174
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 37/146 (25%), Positives = 71/146 (48%), Gaps = 17/146 (11%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
+K ++++LD TL+H + K + + + + +I + + +++ + RP + FLE
Sbjct: 13 KKKCVIIDLDETLVHS-SFKPVKNADYIVPVEIDNIVHTVYVLK-------RPHIDKFLE 64
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKDRKNPDLVRG 182
+ L + L T S YAE KLLD D+K + + F KD G
Sbjct: 65 RMGQLFECVLFTASVSKYAEPVSKLLDKWNVFDNKLYRESCVYNRGFYVKDLSK----LG 120
Query: 183 QE-RGIVILDDTESVWSDHTENLIVL 207
++ + VILD++ + ++ H EN + +
Sbjct: 121 RDLKSTVILDNSPTSYAFHPENAVPI 146
>gi|291234069|ref|XP_002736972.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase 1-like [Saccoglossus
kowalevskii]
Length = 251
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 91/195 (46%), Gaps = 24/195 (12%)
Query: 18 KRKCEQSLSCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQEERKLQLVLNLDH 77
K + + SL +R + C+ S+ Y+L +R+SE KL +V++LD
Sbjct: 25 KLRLKSSLYAIDMYIRHAPCLSQSK-----------YLLPEVRHSEMH--KLCIVIDLDE 71
Query: 78 TLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLC 137
TL+H + K +S+ + + +I + ++ + RPFV FL++ L + L
Sbjct: 72 TLVH-SSFKPVSNADFVVPVEIDGTVHQVYVLK-------RPFVDEFLQKMGELFECVLF 123
Query: 138 TMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-GQE-RGIVILDDTES 195
T S YA+ LLD F +R+ + DL R G++ + IVI+D++ +
Sbjct: 124 TASLSKYADPVADLLD-KWGVFRARLFRDSCVFHRGNYVKDLGRLGRDLKKIVIVDNSPA 182
Query: 196 VWSDHTENLIVLGKY 210
+ H +N + + +
Sbjct: 183 SYIFHPDNAVPVASW 197
>gi|391328122|ref|XP_003738541.1| PREDICTED: CTD small phosphatase-like protein 2-like [Metaseiulus
occidentalis]
Length = 236
Score = 47.4 bits (111), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 48/87 (55%), Gaps = 10/87 (11%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVRTFLE 126
+ LVL+LD TL+HC ++ L+ +F LFQ K+ V+ RPF R FLE
Sbjct: 57 EFSLVLDLDETLVHCSLME--------LEGATFTF-PVLFQGIEYKVYVRTRPFFREFLE 107
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLD 153
+ S + ++ L T S + YA+ + LLD
Sbjct: 108 RVSKMFEVILFTASKKVYADKLLDLLD 134
>gi|397621029|gb|EJK66064.1| hypothetical protein THAOC_13029, partial [Thalassiosira oceanica]
Length = 518
Score = 47.4 bits (111), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 50/98 (51%), Gaps = 8/98 (8%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+ + LVL+LD TL+HC + + + + F G +Q+ V+ RPF+R FL
Sbjct: 267 DPPVTLVLDLDETLVHC-TVDPVDDPDMVFGVE---FNGIDYQVH----VRYRPFLREFL 318
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
E S ++ + T S + YA+ + +D + KY R+
Sbjct: 319 EAVSERFEVVVFTASQQVYADKLLDRIDPEGKYIKHRM 356
>gi|170050634|ref|XP_001861399.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167872200|gb|EDS35583.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 627
Score = 47.4 bits (111), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 47/88 (53%), Gaps = 12/88 (13%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSF-IGSLFQMAN-DKLVKLRPFVRTFL 125
+ LVL+LD TL+HC +++ LS SF LFQ V+ RPF R FL
Sbjct: 488 EFSLVLDLDETLVHC-SLQELSDA---------SFKFPVLFQECQYTVFVRTRPFFREFL 537
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLD 153
E+ S + ++ L T S R YA+ + LLD
Sbjct: 538 EKVSQIFEVILFTASKRVYADKLLNLLD 565
>gi|156095526|ref|XP_001613798.1| nif-like protein [Plasmodium vivax Sal-1]
gi|148802672|gb|EDL44071.1| nif-like protein, putative [Plasmodium vivax]
Length = 327
Score = 47.4 bits (111), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 74/158 (46%), Gaps = 24/158 (15%)
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFI-GSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL++C K S +K++ I G F + K RP++ F
Sbjct: 58 MTLVLDLDETLIYCTKKKKFSH-----QKEVDVLINGRYFSLYVCK----RPYIDLFFSV 108
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDR---KNPDLVRGQ 183
+ +I + T S + YA+A + ++D+D ++ + RED F + KN ++ +
Sbjct: 109 LNPFFEIVIFTTSIKSYADAVLNIIDVD--HYVDKKFYREDCFEVNQKIYLKNLQSIKKE 166
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGD 221
IV++DD+ + EN YF K+ GD
Sbjct: 167 ISRIVLVDDSNVSGLKYPEN--------YFPIKKWQGD 196
>gi|118368774|ref|XP_001017593.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila]
gi|89299360|gb|EAR97348.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila SB210]
Length = 1131
Score = 47.0 bits (110), Expect = 0.011, Method: Composition-based stats.
Identities = 39/149 (26%), Positives = 68/149 (45%), Gaps = 16/149 (10%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
S Q + K L+L+LD TL+H +K KK +F + VK RP V
Sbjct: 163 SPQNKMKKTLILDLDETLIHSSQMKP--------KKYDLNFNIQTSTTKEEFFVKFRPNV 214
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII-----AREDFNGKDRKN 176
FL ++ ++++ T S + YA+ + LD + S R+ + D+ KD
Sbjct: 215 SNFLRIMANYYEVFIWTASIKEYADVIINQLDPSGSFISYRLYRDSCRKKGDYYIKDLA- 273
Query: 177 PDLVRGQERGIVILDDTESVWSDHTENLI 205
L+ + ++I+D+ + ++ H EN I
Sbjct: 274 --LLNRNMKDVIIIDNLSTCFNLHQENGI 300
>gi|145509220|ref|XP_001440554.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124407771|emb|CAK73157.1| unnamed protein product [Paramecium tetraurelia]
Length = 489
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 75/159 (47%), Gaps = 21/159 (13%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL----VKLRPFVRTFLE 126
LVL+LD TL+HC ++Q+ QM N ++ + +RP+ + FL
Sbjct: 300 LVLDLDETLMHCNE-----------QQQMKFDFKIPIQMPNGQVHEAGISVRPYAQQFLS 348
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED----FNGKDRKNPDLVRG 182
+ S +I + T S + YA+ + LD K+ S R+ RE+ G K+ ++
Sbjct: 349 ECSKHFEIIIFTASHQLYADKIIDKLDPSRKWVSHRLY-RENCIQTQQGIYVKDLRIINR 407
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYV-YFRDKELNG 220
+ IV++D+ ++ EN I + Y+ +D EL G
Sbjct: 408 DLKDIVLIDNAAYSYAFQIENGIPIIPYIDNVKDIELLG 446
>gi|407043726|gb|EKE42114.1| NLI interacting factor family phosphatase domain containing protein
[Entamoeba nuttalli P19]
Length = 428
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 70/280 (25%), Positives = 118/280 (42%), Gaps = 64/280 (22%)
Query: 27 CAHTTVRDSR-CIFCSQAMND---------SFGLSFDY---MLRGLRYSEQEERKLQLVL 73
C H + D C+ C Q + D +G++ Y R + +E+KL L+L
Sbjct: 7 CPHNKINDQNYCVDCYQLIEDVDDYIRTSGGYGITKSYAEEQKRSVSEKLLKEKKLSLIL 66
Query: 74 NLDHTLLHCRN--IKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASSL 131
+LD T++ L S E+ + + F + + L++ R + TF+E+ S L
Sbjct: 67 DLDGTIVFTNPELCIPLESEEESITPE-QGFYFEIPEQNAKVLIRFRDGIVTFMEKVSKL 125
Query: 132 VDIYLCTMSTRCYAEAAV----KLLDL-------------------DSKYFSSRIIARED 168
DI++ T+ + YA A V KL D+ D K + +I RE+
Sbjct: 126 YDIHVVTLGQKEYAFAIVNAINKLRDVPFITGDLVTAEDCSSVIVCDEKDTNDGLIDREE 185
Query: 169 FNGK---DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSY 225
N + R P + G+E VI+DD VW + +N++ + +YV
Sbjct: 186 TNERRSVKRSIPTM--GKEEMQVIVDDRIDVWDN--KNVVQICEYV-------------- 227
Query: 226 SETLTDESENEEALANVLRVLKTIHRLFFDSVCGDVRTYL 265
T++ + E L V VL+ I+ F+D DV+ L
Sbjct: 228 --PSTNQVDTE--LVRVTEVLQNIYTKFYDEHIEDVKEIL 263
>gi|281204241|gb|EFA78437.1| hypothetical protein PPL_09089 [Polysphondylium pallidum PN500]
Length = 1252
Score = 47.0 bits (110), Expect = 0.011, Method: Composition-based stats.
Identities = 32/114 (28%), Positives = 54/114 (47%), Gaps = 16/114 (14%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNG--- 171
VK+RP+ TFL+ L +I L +++ + Y V+++D SK II E F
Sbjct: 935 VKIRPYTITFLKTLYPLFNITLFSLNHKSYVNKMVEIID-PSKTLFKNIITIESFGDNIP 993
Query: 172 KDRKN-------PDLVRG-----QERGIVILDDTESVWSDHTENLIVLGKYVYF 213
K + N P IV++DD E +W +NLI++ ++++F
Sbjct: 994 KQQTNRPYSLFTPSNFSSIFKIDSSESIVVIDDREDIWRQFRDNLIMVERFIHF 1047
>gi|340500514|gb|EGR27383.1| NLI interacting factor-like phosphatase family protein, putative
[Ichthyophthirius multifiliis]
Length = 345
Score = 47.0 bits (110), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 59/106 (55%), Gaps = 6/106 (5%)
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFS---SRIIAREDFNGK 172
++RP+ + FLE DIY+ T S+ YA A VK LD + KY + +R E NG
Sbjct: 201 RVRPYCKEFLETMVQYWDIYVFTASSPSYASAIVKFLDSEGKYINGILNRSNCMETKNGF 260
Query: 173 DRKNPDLVRGQE-RGIVILDDTESVWSDHTENLIVLGKYVYFRDKE 217
K+ +++G++ + +VI+D+ + EN I + + +F+DK+
Sbjct: 261 FIKDLRILKGKDLKKMVIVDNLAHSFGFQIENGIPILE--WFQDKK 304
>gi|67588036|ref|XP_665317.1| hypothetical protein [Cryptosporidium hominis TU502]
gi|54655944|gb|EAL35087.1| hypothetical protein Chro.80553 [Cryptosporidium hominis]
Length = 364
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 61/119 (51%), Gaps = 5/119 (4%)
Query: 116 KLRPFVRTFLEQASS-LVDIYLCTMSTRCYAEAAVKLLDLDSKYF-SSRIIARED-FNGK 172
KLRP V L S +IY+ TM T +A ++++LD + ++F S RI R + F
Sbjct: 183 KLRPGVINMLRTLSKDKYEIYMYTMGTEYHAYTSLRILDPELRFFHSKRIFYRNNGFKET 242
Query: 173 DRKNPD-LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLT 230
K+ + L R +VILDD E W+D +L+ + Y +F + D S+S ++
Sbjct: 243 SIKSLNTLFPYDHRTLVILDDIEQAWTD-INSLLKVYPYNFFPSNSIPNDSSSFSRYIS 300
>gi|357130565|ref|XP_003566918.1| PREDICTED: uncharacterized protein LOC100830008 [Brachypodium
distachyon]
Length = 510
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 73/160 (45%), Gaps = 22/160 (13%)
Query: 59 LRYSEQEERKLQLVLNLDHTLLHCR----NIKSLSSGEKYLKKQIHSFIGSLFQMANDKL 114
L+ S + + LVL+LD TL+H +I + I F M + +
Sbjct: 325 LQKSPVRTKHVTLVLDLDETLVHSTLDHCDIADFT-------------IQVFFNMKDHTV 371
Query: 115 -VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FN 170
V+ RP ++ FLE+ + + ++ + T S + YAE + LD D K S RI RE +
Sbjct: 372 YVRQRPHLKMFLEKVAQMFELVIFTASQKIYAEQIIDRLDPDGKLISQRIY-RESCIFSD 430
Query: 171 GKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
G K+ ++ + I+D+T V+ +N I + +
Sbjct: 431 GSYTKDLTILGVHLAKVAIIDNTPQVFQLQVDNGIPIKSW 470
>gi|195147580|ref|XP_002014757.1| GL19342 [Drosophila persimilis]
gi|194106710|gb|EDW28753.1| GL19342 [Drosophila persimilis]
Length = 274
Score = 46.6 bits (109), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 47/171 (27%), Positives = 76/171 (44%), Gaps = 12/171 (7%)
Query: 50 LSFDYMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIK-SLSSGEKYLKKQIHSFIGSLFQ 108
L DYM + +K LVL+LD TL+ +K + G KK ++ F+
Sbjct: 55 LHGDYMTSCSKRKLTLVKKKTLVLDLDETLMTSVFVKKGVKGGRGSQKKCKWHYVPVDFE 114
Query: 109 M-ANDKLVKL--RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD-----LDSKYFS 160
+D VK+ RPFV FL+Q S DI + T T YA + LD L + F
Sbjct: 115 FNLHDSTVKVYKRPFVDHFLDQVSKWFDIVVFTAGTEPYATPIIDYLDGGRNILGHRLFR 174
Query: 161 SRIIAREDFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
+ + + FN K +V + +++LD++ + +N I + Y+
Sbjct: 175 DKCVTVQGFNA---KFVSIVNDDKANVILLDNSIPECCFNMDNSIPIFDYI 222
>gi|148233948|ref|NP_001082795.1| CTD small phosphatase-like protein 2-B [Danio rerio]
gi|187471000|sp|A4QNX6.1|CTL2B_DANRE RecName: Full=CTD small phosphatase-like protein 2-B;
Short=CTDSP-like 2-B
gi|141796856|gb|AAI39561.1| Zgc:162265 protein [Danio rerio]
Length = 460
Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 44/86 (51%), Gaps = 8/86 (9%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC ++ L I ++ V+LRPF R FLE+
Sbjct: 281 EFSLVLDLDETLVHC-SLNELDDAALTFPVLFQDVIYQVY-------VRLRPFFREFLER 332
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLD 153
S + +I L T S + YA+ + +LD
Sbjct: 333 MSQIYEIILFTASKKVYADKLLNILD 358
>gi|430814217|emb|CCJ28521.1| unnamed protein product [Pneumocystis jirovecii]
Length = 352
Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 69/143 (48%), Gaps = 9/143 (6%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
L+L+LD TL+H SL G + + + L + A V RP+ +FL + S
Sbjct: 178 LILDLDETLIH-----SLVKGGRITSGHMVEVM--LGKHAILYYVHKRPYCDSFLRKVSK 230
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE-DF-NGKDRKNPDLVRGQERGIV 188
++ + T S + YA+ + L+ D K F +R + F NG K+ +V+ ++
Sbjct: 231 WYNVVIFTASVQEYADPVIDWLEQDRKLFKARFYRQHCTFRNGAYIKDLSIVQPDLSKVI 290
Query: 189 ILDDTESVWSDHTENLIVLGKYV 211
I+D++ +S H N I + ++
Sbjct: 291 IIDNSPVSYSMHENNAIPIQAWI 313
>gi|403332687|gb|EJY65381.1| hypothetical protein OXYTRI_14465 [Oxytricha trifallax]
Length = 927
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 29/110 (26%), Positives = 54/110 (49%), Gaps = 21/110 (19%)
Query: 63 EQEERKLQLVLNLDHTLLHCRN---------IKSLSSGEKYLKKQIHSFIGSLFQMANDK 113
+Q+++ L+L++D TL++CR I++ SS Q+ F
Sbjct: 468 KQQQKLYTLILDMDETLIYCRQNPYPGYQDIIQATSSAHNTYSCQVQIF----------- 516
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
RP +R FLEQ S + ++ + T S + YA+ + +D +++FS R+
Sbjct: 517 -TSYRPNLRKFLEQVSQIFEVVIFTASEKSYADLILDKIDPRNEFFSKRL 565
>gi|403353558|gb|EJY76317.1| NLI interacting factor-like phosphatase family protein [Oxytricha
trifallax]
Length = 1037
Score = 46.2 bits (108), Expect = 0.019, Method: Composition-based stats.
Identities = 51/204 (25%), Positives = 97/204 (47%), Gaps = 30/204 (14%)
Query: 23 QSLSCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQEERKLQLVLNLDHTLLHC 82
Q++S HT +RD + + ++ Y+ L +K L+ ++D TL+HC
Sbjct: 620 QTISALHT-IRDKITMPSDEEIH--------YLKINLPTPNHPSKKKTLIFDMDETLIHC 670
Query: 83 RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTR 142
+ + S + + I ++ N + +RP++ LE+A+ L + + T S +
Sbjct: 671 --VDDIESEDPDVIIPID--FPDEDEIVNAG-INIRPYLYECLEEANKLFQVIVFTASHK 725
Query: 143 CYAEAAVKLLDLDSKYFSSRII------AREDFNGKDRK---NPDLVRGQERGIVILDDT 193
YA+A + LD ++KYF R+ RE + KD + N DL + ++I+D++
Sbjct: 726 AYADAILDYLDPENKYFQYRLYRDNCVQTREGYYVKDLRIINNRDL-----KDLIIIDNS 780
Query: 194 ESVWSDHTENLIVLGKYVYFRDKE 217
+S H +N I + ++ DKE
Sbjct: 781 VFSFSFHIDNGIPI--IPFYADKE 802
>gi|224072608|ref|XP_002303804.1| predicted protein [Populus trichocarpa]
gi|222841236|gb|EEE78783.1| predicted protein [Populus trichocarpa]
Length = 244
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 71/152 (46%), Gaps = 13/152 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+H S + +F + + V+ RP++R F+E+ SS
Sbjct: 54 LVLDLDETLVH--------SALEPCNDADFTFPVNFNLQEHTVFVRCRPYLRDFMERVSS 105
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQERGI 187
L +I + T S YAE + +LD + F R+ RE G K+ ++ +
Sbjct: 106 LFEIIIFTASQSIYAEQLLNVLDPKRRIFRHRVF-RESCVFVEGNYLKDLSVLGRDLARV 164
Query: 188 VILDDTESVWSDHTENLIVLGKYVYFR-DKEL 218
+I+D++ + +N I + + R DKEL
Sbjct: 165 IIIDNSPQAFGFQVDNGIPIESWFEDRSDKEL 196
>gi|403223458|dbj|BAM41589.1| RNA polymerase II carboxyterminal domain phosphatase [Theileria
orientalis strain Shintoku]
Length = 268
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 70/147 (47%), Gaps = 19/147 (12%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKL--RPFVRTF 124
++ LVL+LD TL+H S E Y SF + Q +K + + RPFV F
Sbjct: 90 KRKTLVLDLDETLIHS----SFEPIENY------SFTLPIMQDGVEKKIYVGKRPFVDEF 139
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDS----KYFSSRIIAREDFNGKDRKNPDLV 180
L+ S + DI + T + YA+ + LD++ ++F I FNG K+ +V
Sbjct: 140 LKTTSKIYDIVIFTAGLKSYADPVIDQLDVNKVCKRRFFRDSCIY---FNGYYIKDLTIV 196
Query: 181 RGQERGIVILDDTESVWSDHTENLIVL 207
+ ++I+D++ + + + N I +
Sbjct: 197 TKSLKDVIIIDNSPACYCLNPNNAIPI 223
>gi|145553118|ref|XP_001462234.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124430072|emb|CAK94861.1| unnamed protein product [Paramecium tetraurelia]
Length = 474
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 77/155 (49%), Gaps = 18/155 (11%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
+V +LD TL+HC N + S + L S G + Q + +RP+ R L++ S
Sbjct: 286 IVFDLDETLIHC-NESNTSRSDISLPITFPS--GDIVQAG----INIRPWAREILQKLSE 338
Query: 131 LVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSR-IIAREDFNGKDRKNPDLVRGQE- 184
+ ++ + T S +CYA ++ +D + + F + I+ E + KD + + G++
Sbjct: 339 VCEVVIFTASHQCYASQVIESIDKNKVVSATLFRDKCIVTNEGVHIKDLR----ILGRDM 394
Query: 185 RGIVILDDTESVWSDHTENLI-VLGKYVYFRDKEL 218
+ IV++D+ + H EN I ++ Y DKEL
Sbjct: 395 KDIVLVDNAAYSFGVHIENGIPIIPYYDNKEDKEL 429
>gi|145513150|ref|XP_001442486.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409839|emb|CAK75089.1| unnamed protein product [Paramecium tetraurelia]
Length = 425
Score = 46.2 bits (108), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 72/143 (50%), Gaps = 16/143 (11%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVRTFLEQAS 129
L+L+LD TL+H S ++ + + +G + A K+ + +RP+ FL+Q S
Sbjct: 245 LILDLDETLIH-------SCAQRENPQVYVTAVGDFGEEA--KIGINIRPYTSLFLQQLS 295
Query: 130 SLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR----EDFNGKDRKNPDLVRGQE- 184
IY+ T S++ YA+A + LD +Y S I+ R E NG K+ L+ +E
Sbjct: 296 QYYTIYIYTASSQAYAQAIINYLDPTKQYISG-IMTRNNCMETKNGFFIKDLRLISNKEL 354
Query: 185 RGIVILDDTESVWSDHTENLIVL 207
+ ++I+D+ + EN I +
Sbjct: 355 KDMLIVDNLAHSFGFQIENGIPI 377
>gi|452823685|gb|EME30693.1| putative CTD small phosphatase [Galdieria sulphuraria]
Length = 397
Score = 46.2 bits (108), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 48/219 (21%), Positives = 98/219 (44%), Gaps = 43/219 (19%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+E+ + K LVL+LD TL+H S + + L Q+ + LF VK+RP++
Sbjct: 197 TEEMKEKKTLVLDLDETLVHSGFEGSRETSDFVLSMQVENTNLQLF-------VKMRPYL 249
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPD--- 178
+ FL++ + +I + T S YA+ + L+ D+ + F +P+
Sbjct: 250 KEFLQEVTKHFEIVIFTASMVTYADPVIDLM-FDATGVAHIPETHRLFRESCEYDPETCS 308
Query: 179 -----LVRGQE-RGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDE 232
+ G++ + ++I+D++ + ++ + N I + ++
Sbjct: 309 FHKDLMALGRDIKKVIIVDNSPTAYTKNPYNAIPIPTWM--------------------N 348
Query: 233 SENEEALANVLRVLKTIHRLFFDSVCGDVRTYLPKVRSE 271
EN+ +L +VL +LKT+ DVRT L +++ +
Sbjct: 349 DENDHSLLDVLSILKTL------IPVQDVRTVLKQLKEQ 381
>gi|195996503|ref|XP_002108120.1| hypothetical protein TRIADDRAFT_18774 [Trichoplax adhaerens]
gi|190588896|gb|EDV28918.1| hypothetical protein TRIADDRAFT_18774, partial [Trichoplax
adhaerens]
Length = 208
Score = 46.2 bits (108), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 51/97 (52%), Gaps = 10/97 (10%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMAN-DKLVKLRPFVRTFLE 126
+ LV++LD TL+HC SLS E +H I F+ N D V+LRP+ R FLE
Sbjct: 30 EFTLVIDLDETLVHC----SLSLLED---ANLHFPI--YFKNNNYDVYVRLRPYYREFLE 80
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ S + ++ L T S + YA + ++D K R+
Sbjct: 81 RVSKIYEVILFTASKKVYANKLMDIIDPGRKLVKHRL 117
>gi|302847022|ref|XP_002955046.1| hypothetical protein VOLCADRAFT_121370 [Volvox carteri f.
nagariensis]
gi|300259574|gb|EFJ43800.1| hypothetical protein VOLCADRAFT_121370 [Volvox carteri f.
nagariensis]
Length = 1180
Score = 45.8 bits (107), Expect = 0.024, Method: Composition-based stats.
Identities = 46/194 (23%), Positives = 90/194 (46%), Gaps = 31/194 (15%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
+ +++ LVL+LD TL+ + + H+ + + + ++ V LRP +R F
Sbjct: 562 DPQRMTLVLDLDGTLIASED-------------EPHAPVPFDYCVDEERFVWLRPGLRRF 608
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI-----IAREDF---NGKDRKN 176
L+ ++ L T + +A +A++ +D D F SR+ ++ +D+ R
Sbjct: 609 LDSVRPHFEVVLFTAAGESWATSALQRIDPDGVIFDSRLYRDHTVSHDDWPWVKDLSRLG 668
Query: 177 PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENE 236
DL R +VI+DD ++ +N + + Y D +L G + E D ++
Sbjct: 669 RDLAR-----VVIVDDNPLMFMYQPDNALHVAAY----DPQLTGHNDDVLEQALDVLMHK 719
Query: 237 EALANVLR-VLKTI 249
+AN +R VL++I
Sbjct: 720 VLIANDVREVLRSI 733
>gi|387015310|gb|AFJ49774.1| CTD small phosphatase [Crotalus adamanteus]
Length = 466
Score = 45.8 bits (107), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 48/93 (51%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE S
Sbjct: 290 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLECMSQ 341
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I L T S + YA+ + +LD + R+
Sbjct: 342 IYEIILFTASKKVYADKLLNILDPKKQLVRHRL 374
>gi|313226803|emb|CBY21948.1| unnamed protein product [Oikopleura dioica]
Length = 444
Score = 45.8 bits (107), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 67/140 (47%), Gaps = 10/140 (7%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC S E ++ +F + D VK RP++R FLE+
Sbjct: 255 LVLDLDETLVHC------SLCELQMRDYEFTFPIRFQNVDYDVYVKTRPYLRDFLERMCE 308
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQERGI 187
+I + T S + YA+ + ++D + K R+ RE G K+ ++
Sbjct: 309 HFEIIIFTASKKVYADKLISIIDPNKKLVRHRLF-REHCMLVQGNYIKDLTILGRDLTKT 367
Query: 188 VILDDTESVWSDHTENLIVL 207
+I+D++ +S H +N I +
Sbjct: 368 IIVDNSPQAFSYHMDNGIPI 387
>gi|440493707|gb|ELQ76143.1| TFIIF-interacting CTD phosphatase, including NLI-interacting factor
[Trachipleistophora hominis]
Length = 466
Score = 45.8 bits (107), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 51/95 (53%), Gaps = 2/95 (2%)
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN 176
LRP + FL +AS L +++ TM T Y ++D D +F RI+ R+D + ++
Sbjct: 186 LRPHLHQFLTEASKLFHMHIYTMGTAEYVHQITNVIDKDGMFFGDRIVTRDD-EMQVKRL 244
Query: 177 PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
L + +VI+DD VW ++ NL+++ ++
Sbjct: 245 ERLFGDKVDMVVIVDDRGDVW-EYCGNLVMVRPFL 278
>gi|55740293|gb|AAV63948.1| putative nuclear LIM interactor-interacting protein [Phytophthora
sojae]
gi|348665891|gb|EGZ05719.1| hypothetical protein PHYSODRAFT_551168 [Phytophthora sojae]
Length = 237
Score = 45.8 bits (107), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 66/147 (44%), Gaps = 8/147 (5%)
Query: 57 RGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGE-KYLKKQIHSFIGSLFQMAND--- 112
RG + ++ LVL++D L+H + + + +Y +Q+ + G F++ D
Sbjct: 29 RGAAHVRAPSERIALVLDMDECLVHSKFQNEVEYRQSEYRPEQLEEY-GDSFEIVMDDGE 87
Query: 113 -KLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR--EDF 169
+V RP + FLE+A+ D+Y+ T Y + + LD F+ R +
Sbjct: 88 RAVVNKRPGLDRFLEEAAKHYDVYVFTAGLEAYGKPILDALDPKGNLFAGRFFRESCQQR 147
Query: 170 NGKDRKNPDLVRGQERGIVILDDTESV 196
G K+ +VRG + VIL D V
Sbjct: 148 KGMFLKDLSVVRGGDLSRVILVDNNPV 174
>gi|67624693|ref|XP_668629.1| ENSANGP00000011443 [Cryptosporidium hominis TU502]
gi|54659821|gb|EAL38383.1| ENSANGP00000011443 [Cryptosporidium hominis]
Length = 392
Score = 45.8 bits (107), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 49/92 (53%), Gaps = 9/92 (9%)
Query: 63 EQE-ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
EQE L +VL++D TL+HC N + L + L +I ++ F V RPF+
Sbjct: 198 EQEVSSGLFIVLDMDETLVHCTN-EMLKGVKPDLLVKIATYSTPWF-------VYYRPFL 249
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD 153
+ FL+ AS L I + T STR YAE + +D
Sbjct: 250 KFFLQNASKLGSICVFTASTREYAEQVINSID 281
>gi|393247111|gb|EJD54619.1| NLI interacting factor [Auricularia delicata TFB-10046 SS5]
Length = 182
Score = 45.8 bits (107), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 42/142 (29%), Positives = 69/142 (48%), Gaps = 11/142 (7%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+H + K + + + I Q+ N +VK RP V TFLE+
Sbjct: 17 LVLDLDETLVHS-SFKMIPQADYIIPVLIEH------QLHNVYVVK-RPGVDTFLEKMGE 68
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-GQE-RGIV 188
L ++ + T S YA+ + LD+ K S R+ +N K DL + G+ G +
Sbjct: 69 LYEVVVFTASLSMYADPVLDKLDI-HKAVSHRLFREHCYNHKGVYVKDLSQLGRPIEGTI 127
Query: 189 ILDDTESVWSDHTENLIVLGKY 210
ILD++ + + H N + + +
Sbjct: 128 ILDNSPASYIFHPNNAVPVSSW 149
>gi|66357454|ref|XP_625905.1| possible NLI interacting factor CTD-like phosphatase
[Cryptosporidium parvum Iowa II]
gi|46226829|gb|EAK87795.1| possible NLI interacting factor CTD-like phosphatase
[Cryptosporidium parvum Iowa II]
Length = 392
Score = 45.8 bits (107), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 49/92 (53%), Gaps = 9/92 (9%)
Query: 63 EQE-ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
EQE L +VL++D TL+HC N + L + L +I ++ F V RPF+
Sbjct: 198 EQEVSSGLFIVLDMDETLVHCTN-EMLKGVKPDLLVKIATYSTPWF-------VYYRPFL 249
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD 153
+ FL+ AS L I + T STR YAE + +D
Sbjct: 250 KFFLQNASKLGSICVFTASTREYAEQVINSID 281
>gi|426201370|gb|EKV51293.1| hypothetical protein AGABI2DRAFT_114027 [Agaricus bisporus var.
bisporus H97]
Length = 814
Score = 45.8 bits (107), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 34/59 (57%), Gaps = 1/59 (1%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
+K RP + FL ++ D+++ TM TR YAE +D D F SRI++R D +G D
Sbjct: 270 IKPRPGWKEFLMDMATKYDMHVYTMGTRAYAEEVCAAIDPDGSVFKSRILSR-DESGND 327
>gi|405966502|gb|EKC31780.1| CTD small phosphatase-like protein 2 [Crassostrea gigas]
Length = 402
Score = 45.4 bits (106), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 46/153 (30%), Positives = 74/153 (48%), Gaps = 15/153 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVRTFLEQAS 129
LVL+LD TL+HC SL+ L+ +F LF+ K+ V+ RP R FLE S
Sbjct: 227 LVLDLDETLVHC----SLTE----LEDAAFTF-PVLFEDVTYKVFVRTRPHFREFLETVS 277
Query: 130 SLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQERG 186
+ ++ L T S + YA+ V +LD + R+ RE NG K+ ++
Sbjct: 278 EMFEVILFTASKKVYADKLVNILDPQKQLIKHRLF-REHCVCINGNYIKDLTILGRDLSR 336
Query: 187 IVILDDTESVWSDHTENLIVLGK-YVYFRDKEL 218
+I+D++ + +N I + +V D+EL
Sbjct: 337 TIIVDNSPQAFGYQLDNGIPIESWFVDKNDREL 369
>gi|156096809|ref|XP_001614438.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148803312|gb|EDL44711.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 1467
Score = 45.4 bits (106), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 55/99 (55%), Gaps = 3/99 (3%)
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDR 174
KLRP V FL++ + +IYL TM T +A++ + LLD +F +R+ +R+D NG
Sbjct: 541 KLRPGVIQFLQKMNKKYEIYLYTMGTLEHAKSCLLLLDPLKNFFGNRVFSRKDSVNGLKH 600
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
N L + + I DD++ +W + + + V G Y YF
Sbjct: 601 LNRILPTYRSVSLCI-DDSDYMWKESSSCIKVHG-YNYF 637
>gi|340507950|gb|EGR33782.1| NLI interacting factor-like phosphatase family protein, putative
[Ichthyophthirius multifiliis]
Length = 226
Score = 45.4 bits (106), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 52/232 (22%), Positives = 98/232 (42%), Gaps = 34/232 (14%)
Query: 61 YSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPF 120
Y ++ R+ LV +LD TL+HC + S + I G + + + +RP+
Sbjct: 23 YEIKKNRQKTLVFDLDETLIHCNENVQIPSD---VVLPIKFPTGEIIEAG----INIRPY 75
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLV 180
L++ S +I + T S CYA + LD +Y S R + RE+
Sbjct: 76 CYECLQELSKYYEIVVFTASHSCYANVVLDYLDPKGQYISYR-LYREN-----------C 123
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALA 240
E G+ I D + + + +++++ Y ++N +++N+ L
Sbjct: 124 VTTEEGVYI-KDLRVLQNRNMSDIVLVDNAAYSFGFQINN---GIPVIPFYDNKNDNELK 179
Query: 241 NVLRVLKTIHRLFFDSVCGDVRTYLPKVR-----SEFSRDVLYFSAIFRDCL 287
N++ +K+IH++ D R L KV SEF + S +F++ +
Sbjct: 180 NLINFMKSIHQV------KDFRDTLKKVLKINQFSEFQDPEMLLSTLFQELI 225
>gi|219109563|ref|XP_002176536.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411071|gb|EEC50999.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 809
Score = 45.4 bits (106), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 50/102 (49%), Gaps = 16/102 (15%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQ--IHSFIGSLFQMANDK-------- 113
Q+ +KL LVL+LDHTL+H N + +++ K + + + I + + +
Sbjct: 253 QKRKKLSLVLDLDHTLVHATND---TRAQQFCKSRDDVRTLILPMLRPNGEPRQPQHPEW 309
Query: 114 ---LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLL 152
VK+RP V FL +A +I + T TR YAE LL
Sbjct: 310 TQHFVKMRPHVEVFLNEAQDQYEIGVYTAGTRDYAEQICILL 351
Score = 37.4 bits (85), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 41/183 (22%), Positives = 78/183 (42%), Gaps = 23/183 (12%)
Query: 150 KLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERG---IVILDDTESVWSD------- 199
K+L+L + F SRI++R D + L R G V++DD E VW++
Sbjct: 510 KVLELRQRLFGSRIVSRTDVRDLGQNVKSLKRIFPCGGIMAVVMDDREDVWANAADILTV 569
Query: 200 ----HTENLIVLGKYVY-----FRDKELNGDHKSYSETLTDESENEEALANVLRVLKTIH 250
+NL+++ Y + F D E+ + E +E L L +L+ +H
Sbjct: 570 RKGEPPDNLLLVRPYHWSSFLGFADVNNASGADLSGESEAGDVETDEQLLWSLDILQRVH 629
Query: 251 RLFFD---SVCGDVRTYLPKVRSEFSRDVLYFSA-IFRDCLWAEQEEKFLVQEKKFLVHP 306
R F++ S G + +P + + + L+ + +F + ++++ L K + P
Sbjct: 630 RRFYESDGSFLGALTQTVPDIVKQLRAETLHGAHLVFSGMVPLHRQQQQLESGDKVVPRP 689
Query: 307 RWI 309
I
Sbjct: 690 TVI 692
>gi|297597243|ref|NP_001043640.2| Os01g0629400 [Oryza sativa Japonica Group]
gi|255673485|dbj|BAF05554.2| Os01g0629400, partial [Oryza sativa Japonica Group]
Length = 177
Score = 45.4 bits (106), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 53/108 (49%), Gaps = 3/108 (2%)
Query: 106 LFQMANDKL-VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII 164
F M N + V+ RP ++ FLE+ + + D+ + T S R YAE + LD D + S RI
Sbjct: 30 FFNMKNHTVYVRQRPHLKMFLEKVAQMFDLVIFTASQRIYAEQLIDRLDPDGRLISHRIY 89
Query: 165 AREDFNGKDRKNPDL-VRGQERG-IVILDDTESVWSDHTENLIVLGKY 210
+ DL + G + +VI+D+T V+ +N I + +
Sbjct: 90 RESCIFSEGCYTKDLTILGVDLAKVVIVDNTPQVFQLQVDNGIPIKSW 137
>gi|145539710|ref|XP_001455545.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423353|emb|CAK88148.1| unnamed protein product [Paramecium tetraurelia]
Length = 432
Score = 45.4 bits (106), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 66/249 (26%), Positives = 103/249 (41%), Gaps = 47/249 (18%)
Query: 6 CKECVGKTKFVIKRKCEQSLSCAHTTVRDS-------RCIFCSQAMNDSFGLSFDYMLRG 58
K+ V + EQS + +D IF +F RG
Sbjct: 159 TKQAVSMQNLNVNSDNEQSKKNSQNNAKDKLSNHPFRHLIFGPTINEQTFKKHLILTQRG 218
Query: 59 LRYSEQ----------EERKLQL-----------VLNLDHTLLHCRNIKSLSSGEKYLKK 97
L Y+ + + +K+QL VL+LD TL+H S S E
Sbjct: 219 LIYARKCLKGPSDKFIQSKKIQLSEANPKKDKTLVLDLDETLIH-----SCSQREN---P 270
Query: 98 QIH-SFIGSLFQMANDKL-VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLD 155
Q++ + +G + A K+ + +RP+ FL+Q S IY+ T S+ YA A + LD
Sbjct: 271 QVYVTAVGDFGEEA--KIGINIRPYTTLFLQQLSQHYTIYIYTASSSAYALAIINYLDPT 328
Query: 156 SKYFSSRIIAR----EDFNGKDRKNPDLVRGQE-RGIVILDDTESVWSDHTENLI-VLGK 209
+Y S I+ R E NG K+ L+ +E + I+I+D+ + EN I +L
Sbjct: 329 KQYISG-IMTRNNCMETKNGFFIKDLRLIGNKELKDILIVDNLAHSFGFQIENGIPILEW 387
Query: 210 YVYFRDKEL 218
Y D+EL
Sbjct: 388 YCDQNDQEL 396
>gi|145533993|ref|XP_001452741.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420440|emb|CAK85344.1| unnamed protein product [Paramecium tetraurelia]
Length = 425
Score = 45.4 bits (106), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 73/144 (50%), Gaps = 18/144 (12%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIH-SFIGSLFQMANDKL-VKLRPFVRTFLEQA 128
L+L+LD TL+H S + Q++ + +G + A K+ + +RP+ FL+Q
Sbjct: 245 LILDLDETLIH--------SCTQRENPQVYVTAVGDFGEEA--KIGINIRPYTSLFLQQL 294
Query: 129 SSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR----EDFNGKDRKNPDLVRGQE 184
S IY+ T S+ YA+A ++ LD +Y S I+ R E NG K+ L+ +E
Sbjct: 295 SQYYTIYIYTASSSAYAQAIIQYLDPTKQYISG-IMTRNNCMETKNGFFIKDLRLISNKE 353
Query: 185 -RGIVILDDTESVWSDHTENLIVL 207
+ ++I+D+ + EN I +
Sbjct: 354 LKDMLIVDNLAHSFGFQIENGIPI 377
>gi|449668337|ref|XP_002155392.2| PREDICTED: CTD small phosphatase-like protein-like [Hydra
magnipapillata]
Length = 311
Score = 45.1 bits (105), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 46/191 (24%), Positives = 88/191 (46%), Gaps = 15/191 (7%)
Query: 54 YMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK 113
Y+L L + Q++ K +V++LD TL+H + K + + + + +I + ++ +
Sbjct: 115 YLLPAL--TRQDQNKKCVVIDLDETLVH-SSFKPVENADFIVPVEIDGIVHQVYVLK--- 168
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FN 170
RPFV FL++ L + L T S YA+ LLD + F SR+ RE +
Sbjct: 169 ----RPFVDKFLKRMGELFECVLFTASLAKYADPVADLLD-KTTCFRSRLF-RESCVYYK 222
Query: 171 GKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLT 230
G K+ + ++I+D++ + + H EN + + + +D D + E+++
Sbjct: 223 GNYVKDLSKLGRDLHNVIIIDNSPASYIFHPENAVPVTSWFDDQDDTELMDLIPFLESIS 282
Query: 231 DESENEEALAN 241
AL N
Sbjct: 283 SAESCVTALQN 293
>gi|330936653|ref|XP_003305476.1| hypothetical protein PTT_18329 [Pyrenophora teres f. teres 0-1]
gi|311317492|gb|EFQ86437.1| hypothetical protein PTT_18329 [Pyrenophora teres f. teres 0-1]
Length = 464
Score = 45.1 bits (105), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 37/158 (23%), Positives = 79/158 (50%), Gaps = 27/158 (17%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKY-----LKKQIHSFIGSLFQMANDKL-----VKLRPF 120
L+++LD TL+H S+ +G ++ ++ ++ + +G+ Q+ ++ V RP+
Sbjct: 279 LIIDLDETLIH-----SIVNGGRFQTGHMVEVKLQASVGAGGQVIGPQVPLLYYVHKRPY 333
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI-----IAREDFNGKD-- 173
FL++ S ++ + T S + YA+ + L+++ KYF+ R R KD
Sbjct: 334 CDDFLKKVSKWYNLIIFTASVQEYADPVIDWLEVERKYFAGRYYRQHCTVRNGAYIKDLA 393
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
+ PDL + ++ILD++ + H +N I + ++
Sbjct: 394 QVEPDLSK-----VMILDNSPLSYGFHPDNAIPIEGWI 426
>gi|300175820|emb|CBK21816.2| unnamed protein product [Blastocystis hominis]
Length = 266
Score = 45.1 bits (105), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 88/191 (46%), Gaps = 14/191 (7%)
Query: 63 EQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVR 122
E+ + LVL+LD TL+HC +Y++ + + + + + ++RP+
Sbjct: 80 ERGSKPFTLVLDLDETLVHC--------SLEYMENCHYCYHIIVDGVKHAVFARVRPYAN 131
Query: 123 TFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR- 181
FLE S +I + T S + YA+ + LD + K+ R+ DL R
Sbjct: 132 QFLEYCSRFCEIVVFTASKQEYADRMLDFLDPEKKFIKHRLFRESCTKIGKVYVKDLNRL 191
Query: 182 GQE-RGIVILDDTESVWSDHTENLIVLGKYV-YFRDKEL-NGDHKSYSETLTDESENEEA 238
G++ R VI+D++ + H +N I + + ++D+EL N YS L +
Sbjct: 192 GRDLRRTVIIDNSIVSFGYHLDNGIPICSWFDNWKDQELYNAARIMYS--LQAVQDVRPY 249
Query: 239 LANVLRVLKTI 249
+ N+ R+ +TI
Sbjct: 250 ITNMFRLRETI 260
>gi|224002358|ref|XP_002290851.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220974273|gb|EED92603.1| predicted protein, partial [Thalassiosira pseudonana CCMP1335]
Length = 196
Score = 45.1 bits (105), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 48/93 (51%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC ++ +S + + + M V+ RPF+ FLE+ S
Sbjct: 21 LVLDLDETLVHC-TVEPVSDADMIFPVEFNG-------MEYTVHVRCRPFLTEFLEKVSE 72
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
++ + T S + YA+ + ++D + K+ R+
Sbjct: 73 DFEVVVFTASQQVYADKLLDMIDPEGKFIKHRM 105
>gi|67463585|ref|XP_648443.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56464600|gb|EAL43056.1| hypothetical protein EHI_121510 [Entamoeba histolytica HM-1:IMSS]
gi|449705880|gb|EMD45836.1| RNA polymerase II ctd phosphatase, putative [Entamoeba histolytica
KU27]
Length = 428
Score = 45.1 bits (105), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 66/280 (23%), Positives = 115/280 (41%), Gaps = 64/280 (22%)
Query: 27 CAHTTVRDSR-CIFCSQAMND---------SFGLSFDY---MLRGLRYSEQEERKLQLVL 73
C H + D C+ C Q + D +G++ Y R + +E+KL L+L
Sbjct: 7 CPHNKINDQNYCVDCYQLIEDVDDYIRTSGGYGITKSYAEEQKRSVSEKLLKEKKLSLIL 66
Query: 74 NLDHTLLHCRN--IKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASSL 131
+LD T++ L S E+ + + F + + ++ R + TF+E+ S L
Sbjct: 67 DLDGTIVFTNPELCIPLESEEEPITPE-QGFYFEIPEQNAKVFIRFRDGIVTFMEKVSKL 125
Query: 132 VDIYLCTMSTRCYAEAAVKLLD-----------------------LDSKYFSSRIIARED 168
DI++ T+ + YA A V ++ D K + +I RE+
Sbjct: 126 YDIHVVTLGQKEYAFAIVNAINKLRNIPFITGDLVTAEDCSSVIVCDEKDTNDGLIDREE 185
Query: 169 FNGK---DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSY 225
N + R P + G+E VI+DD VW + +N++ + +YV
Sbjct: 186 TNERRSVKRSIPTM--GKEEMQVIVDDRIDVWDN--KNVVQICEYV-------------- 227
Query: 226 SETLTDESENEEALANVLRVLKTIHRLFFDSVCGDVRTYL 265
T++ + E L V VL+ I+ F+D DV+ L
Sbjct: 228 --PSTNQVDTE--LVRVTEVLQNIYTKFYDEHIEDVKEIL 263
>gi|357487783|ref|XP_003614179.1| CTD small phosphatase-like protein [Medicago truncatula]
gi|355515514|gb|AES97137.1| CTD small phosphatase-like protein [Medicago truncatula]
Length = 306
Score = 45.1 bits (105), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 41/153 (26%), Positives = 69/153 (45%), Gaps = 15/153 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIH--SFIGSLFQMANDKLVKLRPFVRTFLEQA 128
LVL LD TL+H +K K+ H +F S + D V+ RP ++ FL++
Sbjct: 127 LVLGLDGTLVHSTLVKP---------KEDHDLTFTVSFNSVKEDVYVRYRPHLKEFLDEV 177
Query: 129 SSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDL-VRGQERG- 186
S + +I + T R YA+ + LD K F R+ N ++ DL + G++
Sbjct: 178 SGIFEIIVFTAGQRIYADKLLNKLDPSRKIFRHRLFRESCVNVDEKYVKDLSILGRDLAR 237
Query: 187 IVILDDTESVWSDHTENLIVLGKYVYFRDKELN 219
+ ++D + + EN I + +F D N
Sbjct: 238 VTMIDSSPHSFGFQVENGIPI--ETWFADPSDN 268
>gi|302422178|ref|XP_003008919.1| nuclear envelope morphology protein [Verticillium albo-atrum
VaMs.102]
gi|261352065|gb|EEY14493.1| nuclear envelope morphology protein [Verticillium albo-atrum
VaMs.102]
Length = 381
Score = 45.1 bits (105), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 43/165 (26%), Positives = 79/165 (47%), Gaps = 21/165 (12%)
Query: 63 EQEERKLQ--LVLNLDHTLLHCRNIKS-LSSGEKYLKKQIHSFIGSLFQMANDK------ 113
EQ +RK Q L+L+LD TL+H + +S+G + +++G+ Q +
Sbjct: 193 EQTDRKHQKTLILDLDETLIHSMSKGGRMSTGHMVEVRLNQTYVGAGGQTSLGPQHPILY 252
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI-----IARED 168
V RP+ FL + ++ + T S + YA+ + L+ + K+FS+R R+
Sbjct: 253 WVNKRPYCDDFLRRICKWYNLVVFTASVQEYADPVIDWLESERKFFSARYYRQHCTFRQG 312
Query: 169 FNGKDRKN--PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
KD + PDL R ++ILD++ + H +N I + ++
Sbjct: 313 AFIKDLSSVEPDLSR-----VMILDNSPLSYMFHQDNAIPIQGWI 352
>gi|145513758|ref|XP_001442790.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410143|emb|CAK75393.1| unnamed protein product [Paramecium tetraurelia]
Length = 423
Score = 45.1 bits (105), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 77/166 (46%), Gaps = 19/166 (11%)
Query: 63 EQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL----VKLR 118
+Q +++ LVL+LD TL+HC + Q+ QM N ++ + +R
Sbjct: 226 QQIKKQKTLVLDLDETLIHCNE-----------QPQMKFDFKVPIQMPNGQIHEAGISVR 274
Query: 119 PFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR---EDFNGKDRK 175
PF + FL++ S ++ + T S YA+ + LD K+ + R+ + G K
Sbjct: 275 PFAQQFLQECSKHFEVMIFTASHPLYADKIIDKLDPTKKWVTCRLYREHCIQTQQGIYVK 334
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV-YFRDKELNG 220
+ ++ + +V++D+ ++ +N I + Y+ +D EL G
Sbjct: 335 DLRILNRNLKDVVLIDNAAYSFAYQIDNGIPIIPYIDNAKDNELIG 380
>gi|428671109|gb|EKX72028.1| conserved hypothetical protein [Babesia equi]
Length = 267
Score = 45.1 bits (105), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 75/163 (46%), Gaps = 24/163 (14%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQ--MANDKLVKLRPFVRTF 124
+K LVL+LD TL+H S E Y S+ L Q + D V RPFV F
Sbjct: 76 KKKTLVLDLDETLIHS----SFDGIENY------SYSVQLLQDGIKRDVFVAKRPFVDEF 125
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDS----KYFSSRIIAREDFNGKDRKNPDLV 180
L Q S L ++ + T YA + +LD + +YF + ++G K+ +V
Sbjct: 126 LLQVSRLFEVVIFTAGISSYANPVIDVLDTNKVCKRRYFRDSCLF---YSGYYIKDLTIV 182
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHK 223
+ + +VI+D++ + + N + + +F D+E DH+
Sbjct: 183 QKSLKDVVIIDNSPPCYCLNPNNAVPIES--WFDDEE---DHE 220
>gi|47220514|emb|CAG05540.1| unnamed protein product [Tetraodon nigroviridis]
Length = 473
Score = 45.1 bits (105), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 45/83 (54%), Gaps = 8/83 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S
Sbjct: 297 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQ 348
Query: 131 LVDIYLCTMSTRCYAEAAVKLLD 153
+I L T S + YA+ + +LD
Sbjct: 349 KYEIILFTASKKVYADKLLNILD 371
>gi|281210104|gb|EFA84272.1| CTD small phosphatase-like protein 2 [Polysphondylium pallidum
PN500]
Length = 539
Score = 45.1 bits (105), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 47/95 (49%), Gaps = 8/95 (8%)
Query: 59 LRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLR 118
L +++ K+ LVL+LD TL+HC S E + + +F + + K R
Sbjct: 353 LPPKDEQTPKISLVLDLDETLVHC-------STEPIDEPDL-TFFVTFNNVEYKVFAKKR 404
Query: 119 PFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD 153
PF FL +ASSL ++ + T S YA + ++D
Sbjct: 405 PFFEDFLSKASSLFELIIFTASQEVYANKLLNMID 439
>gi|260789874|ref|XP_002589969.1| hypothetical protein BRAFLDRAFT_224775 [Branchiostoma floridae]
gi|229275156|gb|EEN45980.1| hypothetical protein BRAFLDRAFT_224775 [Branchiostoma floridae]
Length = 232
Score = 45.1 bits (105), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 47/87 (54%), Gaps = 10/87 (11%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVRTFLE 126
+ LVL+LD TL+HC SL+ E + LFQ ++ V+ RP+ R FLE
Sbjct: 53 EFSLVLDLDETLVHC----SLNELE-----DANLTFPVLFQDVTYQVYVRTRPYYREFLE 103
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLD 153
+ S L +I L T S + YA+ + +LD
Sbjct: 104 RMSKLYEIILFTASKKVYADKLMNILD 130
>gi|189196298|ref|XP_001934487.1| NIF domain containing protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187980366|gb|EDU46992.1| NIF domain containing protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 451
Score = 45.1 bits (105), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 37/158 (23%), Positives = 79/158 (50%), Gaps = 27/158 (17%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKY-----LKKQIHSFIGSLFQMANDKL-----VKLRPF 120
L+++LD TL+H S+ +G ++ ++ ++ + +G+ Q+ ++ V RP+
Sbjct: 279 LIIDLDETLIH-----SIVNGGRFQTGHMVEVKLQASVGAGGQVIGPQVPLLYYVHKRPY 333
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI-----IAREDFNGKD-- 173
FL++ S ++ + T S + YA+ + L+++ KYF+ R R KD
Sbjct: 334 CDDFLKKVSKWYNLIIFTASVQEYADPVIDWLEVERKYFAGRYYRQHCTVRNGAYIKDLA 393
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
+ PDL + ++ILD++ + H +N I + ++
Sbjct: 394 QVEPDLSK-----VMILDNSPLSYGFHPDNAIPIEGWI 426
>gi|167376104|ref|XP_001733861.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase [Entamoeba dispar SAW760]
gi|165904880|gb|EDR30013.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase, putative [Entamoeba dispar SAW760]
Length = 208
Score = 45.1 bits (105), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 67/147 (45%), Gaps = 17/147 (11%)
Query: 68 KLQLVLNLDHTLLHCR-NIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
+L +V +LD TL+H N +SLS ++ Q + V +RP R L+
Sbjct: 42 RLTIVFDLDETLVHTHVNTQSLSDDLITVELQGKQY-----------FVSVRPGARELLK 90
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII---AREDFNGKDRKNPDLVRGQ 183
++ L T ST YA V L+ D + F ++ +E F + L R
Sbjct: 91 NLVGKYELILFTASTESYANQIVNDLERDGQIFDYKLYCHNCKEKFGQLFKDAHKLGRDL 150
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKY 210
+R ++I DD+ VW+ +ENL V +Y
Sbjct: 151 DR-VIIFDDSTIVWTT-SENLFVCKRY 175
>gi|169603884|ref|XP_001795363.1| hypothetical protein SNOG_04951 [Phaeosphaeria nodorum SN15]
gi|160706473|gb|EAT87342.2| hypothetical protein SNOG_04951 [Phaeosphaeria nodorum SN15]
Length = 479
Score = 45.1 bits (105), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 40/158 (25%), Positives = 81/158 (51%), Gaps = 27/158 (17%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKY-----LKKQIHSFIGSLFQMANDKL-----VKLRPF 120
L+++LD TL+H S+S G ++ ++ ++ + +G+ Q+ ++ V RP+
Sbjct: 295 LIIDLDETLIH-----SMSKGGRFQTGRMVEVKLQASVGAGGQIIGPQVPILYYVHKRPY 349
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE-DF-NG---KD-- 173
FL++ S ++ + T S + YA+ + L+++ KYF R + F NG KD
Sbjct: 350 CDDFLKKVSKWYNLVIFTASVQEYADPVIDWLEVERKYFVGRYYRQHCTFRNGAYIKDLA 409
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
+ PDL + ++ILD++ + H +N I + ++
Sbjct: 410 QVEPDLSK-----VMILDNSPLSYIFHPDNAIPIEGWI 442
>gi|291239709|ref|XP_002739764.1| PREDICTED: CTD small phosphatase-like protein 2-like [Saccoglossus
kowalevskii]
Length = 526
Score = 45.1 bits (105), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 42/93 (45%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC ++ L +F V+ RP+ + FLE S
Sbjct: 350 LVLDLDETLVHC-SLNELDDANLTFPVVFQDITYQVF-------VRTRPYFKEFLEAVSQ 401
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
++ L T S + YA+ LLD KY R+
Sbjct: 402 QFEVILFTASKKVYADKLFNLLDPQKKYVKYRL 434
>gi|145514934|ref|XP_001443372.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410750|emb|CAK75975.1| unnamed protein product [Paramecium tetraurelia]
Length = 401
Score = 45.1 bits (105), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 80/169 (47%), Gaps = 23/169 (13%)
Query: 59 LRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL---V 115
LR S Q + K L+L+LD TL+H + + Q DK+
Sbjct: 211 LRESNQRKPKF-LILDLDETLIHSCTFRDSPQ------------VTITLQDDEDKVDLFF 257
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR----EDFNG 171
+RPF + FL + S+ +IY+ T S+ YA A V LD + +Y + ++ R E NG
Sbjct: 258 NVRPFCKEFLREMSNYYNIYIFTASSELYANAIVNHLDPNRQYIND-VLCRNNCFETKNG 316
Query: 172 KDRKNPDLVRGQE-RGIVILDDTESVWSDHTENLIVLGKYV-YFRDKEL 218
K+ ++ + + IVI+D+ + EN I + +Y+ +D+EL
Sbjct: 317 FFIKDLRIITNRHLKDIVIVDNLPHSFGLQLENGIPILEYLCNPKDEEL 365
>gi|145497555|ref|XP_001434766.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124401894|emb|CAK67369.1| unnamed protein product [Paramecium tetraurelia]
Length = 249
Score = 45.1 bits (105), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 94/200 (47%), Gaps = 21/200 (10%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+++ E++ LVL+LD TL I+S +L ++I IG+ + VK+RPF
Sbjct: 65 AKETEKEFTLVLDLDETL-----IRSEMERTSFLDEEIIVKIGNTIEKY---YVKIRPFA 116
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR---KNPD 178
R FL+ S ++ + T + + YA+ + LD F R R+ KD K+
Sbjct: 117 RDFLKALSKYFELVIFTAALKEYADKVIDYLDPSG--FIKRRFYRDSCTKKDGVFYKDLT 174
Query: 179 LVRGQERGIVILDDTESVWSDHTEN-LIVLGKYVYFRDKELNGDHKSYSETLTDESENEE 237
V I+D++ S S + +N L++ Y +D+EL K Y L +N +
Sbjct: 175 KVNSNLEKTFIIDNSLSGMSLNPQNGLLIKSWYDDLKDQEL----KIYDAML---KKNVK 227
Query: 238 ALANVLRVLKTIHRLFFDSV 257
N+++ +K + R + +V
Sbjct: 228 PKENIVQCIKQMKRKYPKNV 247
>gi|297597322|ref|NP_001043795.2| Os01g0665300 [Oryza sativa Japonica Group]
gi|55773815|dbj|BAD72353.1| Chain A, Three-Dimensional Structure Of A Rna-Polymerase Ii Binding
Protein With Associated Ligand-like [Oryza sativa
Japonica Group]
gi|125571492|gb|EAZ13007.1| hypothetical protein OsJ_02926 [Oryza sativa Japonica Group]
gi|255673527|dbj|BAF05709.2| Os01g0665300 [Oryza sativa Japonica Group]
Length = 439
Score = 44.7 bits (104), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 73/153 (47%), Gaps = 16/153 (10%)
Query: 63 EQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLR--PF 120
EQ RK+ LVL+LD TL+H S+ E+ + F +F + +V +R P
Sbjct: 254 EQGARKVTLVLDLDETLVH-------STTEQC---DDYDFTFPVFFDMKEHMVYVRKRPH 303
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNP 177
+ FL++ + + ++ + T S YA+ + +LD + K FS R RE N K+
Sbjct: 304 LHMFLQKMAEMFEVVIFTASQSVYADQLLDILDPEKKLFSRRYF-RESCVFTNTSYTKDL 362
Query: 178 DLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
+V +VI+D+T V+ N I + +
Sbjct: 363 TVVGVDLAKVVIIDNTPQVFQLQVNNGIPIESW 395
>gi|125527169|gb|EAY75283.1| hypothetical protein OsI_03170 [Oryza sativa Indica Group]
Length = 507
Score = 44.7 bits (104), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 73/153 (47%), Gaps = 16/153 (10%)
Query: 63 EQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLR--PF 120
EQ RK+ LVL+LD TL+H S+ E+ + F +F + +V +R P
Sbjct: 322 EQGARKVTLVLDLDETLVH-------STTEQC---DDYDFTFPVFFDLKEHMVYVRKRPH 371
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNP 177
+ FL++ + + ++ + T S YA+ + +LD + K FS R RE N K+
Sbjct: 372 LHMFLQKMAEMFEVVIFTASQSVYADQLLDILDPEKKLFSRRYF-RESCVFTNTSYTKDL 430
Query: 178 DLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
+V +VI+D+T V+ N I + +
Sbjct: 431 TVVGVDLAKVVIIDNTPQVFQLQVNNGIPIESW 463
>gi|357610246|gb|EHJ66893.1| hypothetical protein KGM_16951 [Danaus plexippus]
Length = 673
Score = 44.7 bits (104), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 44/152 (28%), Positives = 70/152 (46%), Gaps = 13/152 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC +++ L + ++F V+ RP FL + S
Sbjct: 498 LVLDLDETLVHC-SLQELPDASFHFPVLFQDCRYTVF-------VRTRPHFAEFLSKVSR 549
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQERGI 187
L ++ L T S R YA+ + LLD ++ R+ RE NG K+ ++ R
Sbjct: 550 LYEVILFTASKRVYADRLLNLLDPARRWIKYRLF-REHCLLVNGNYVKDLSILGRDLRRT 608
Query: 188 VILDDTESVWSDHTENLIVLGKYVYFR-DKEL 218
VI+D++ + EN I + + R D EL
Sbjct: 609 VIVDNSPQAFGYQLENGIPIDSWFVDRSDNEL 640
>gi|189237962|ref|XP_001811853.1| PREDICTED: similar to CG5830 CG5830-PA [Tribolium castaneum]
gi|270006659|gb|EFA03107.1| hypothetical protein TcasGA2_TC013017 [Tribolium castaneum]
Length = 292
Score = 44.7 bits (104), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 43/165 (26%), Positives = 81/165 (49%), Gaps = 15/165 (9%)
Query: 49 GLSFDYMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQ 108
G S Y+L +R+ Q+ K +V++LD TL+H + K +S+ + + +I + ++
Sbjct: 80 GSSCTYLLPPVRH--QDMHKKCMVIDLDETLVH-SSFKPISNADFVVPVEIDGTVHQVYV 136
Query: 109 MANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED 168
+ RP V FL++ L + L T S YA+ LLD F SR+ RE
Sbjct: 137 LK-------RPHVDDFLKRMGELYECVLFTASLAKYADPVADLLD-QWGVFRSRLF-RES 187
Query: 169 ---FNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
+ G K+ + + + + IVI+D++ + + H +N + + +
Sbjct: 188 CVFYRGNYVKDLNKLGRELQQIVIVDNSPASYIFHPDNAVPVASW 232
>gi|116197703|ref|XP_001224663.1| hypothetical protein CHGG_07007 [Chaetomium globosum CBS 148.51]
gi|88178286|gb|EAQ85754.1| hypothetical protein CHGG_07007 [Chaetomium globosum CBS 148.51]
Length = 533
Score = 44.7 bits (104), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 91/212 (42%), Gaps = 39/212 (18%)
Query: 71 LVLNLDHTLLHCRNIKS-LSSG---EKYLKKQIHSFIGSLFQMANDKL---VKLRPFVRT 123
L+L+LD TL+H + +SSG E L S G + V RP
Sbjct: 335 LILDLDETLIHSMSKGGRMSSGHMVEVRLNTTYQSAGGQAAVGPQHPILYYVHKRPHCDE 394
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI-----IAREDFNGKDRKN-- 176
FL + S ++ + T S + YA+ + L+ + KYFS+R R KD +
Sbjct: 395 FLRRVSKWFNLVVFTASVQEYADPVIDWLEAERKYFSARYYRQHCTFRHGAFIKDLSSVE 454
Query: 177 PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENE 236
PDL + ++ILD++ + H +N I + ++ +D ++++
Sbjct: 455 PDLSK-----VMILDNSPLSYMFHQDNAIPIQGWI------------------SDPTDSD 491
Query: 237 EALANVLRVLKTIHRLFFDSVCGDVRTYLPKV 268
L+N++ L+ +HR + V G + P V
Sbjct: 492 --LSNLIPFLEGLHRAGIERVYGGILDLEPPV 521
>gi|389584175|dbj|GAB66908.1| nif-like protein [Plasmodium cynomolgi strain B]
Length = 303
Score = 44.7 bits (104), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 42/158 (26%), Positives = 73/158 (46%), Gaps = 24/158 (15%)
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFI-GSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL++C K S +K++ I G F + V RP++ F
Sbjct: 58 MTLVLDLDETLIYCTKKKKFSH-----QKEVDVLINGRYFSLY----VCKRPYLDLFFSI 108
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDR---KNPDLVRGQ 183
+ +I + T S + YA+ + ++D+D ++ + RED F + KN ++ +
Sbjct: 109 LNPFFEIVIFTTSIKSYADTVLNIIDVD--HYIDKKFYREDCFEVNQKIYIKNLQNIKKE 166
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGD 221
IV++DD+ + EN YF K+ GD
Sbjct: 167 VSKIVLIDDSNISGLKYPEN--------YFPIKKWQGD 196
>gi|406602671|emb|CCH45772.1| CTD small phosphatase-like protein 2-B [Wickerhamomyces ciferrii]
Length = 423
Score = 44.7 bits (104), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 35/148 (23%), Positives = 76/148 (51%), Gaps = 11/148 (7%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
K L+L+LD TL+H + + + ++ ++ + + +L+ V RP+ FL+Q
Sbjct: 244 KKTLILDLDETLVHSLSRGTRMNNGHMIEVKLSNQVATLY------YVYKRPYCDHFLKQ 297
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR----KNPDLVRGQ 183
S ++ + T S + YA+ + L+ + KYFS R R+ +D K+ ++V
Sbjct: 298 ISKWFNLVIFTASVKEYADPVIDWLESERKYFSKR-YYRDHCTLRDGQGYIKDLNIVDKN 356
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYV 211
+ ++I+D++ ++ H N I++ ++
Sbjct: 357 LQNLIIIDNSPISYAWHESNAIIVEGWI 384
>gi|146185627|ref|XP_001032201.2| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila]
gi|146142847|gb|EAR84538.2| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila SB210]
Length = 446
Score = 44.7 bits (104), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 53/97 (54%), Gaps = 4/97 (4%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFS---SRIIAREDFNG 171
+++RP+ FL++ + DIY+ T S+ YA A VK LD + KY + +R E NG
Sbjct: 301 LRVRPYCLEFLQKLAQYWDIYIFTASSPTYASAIVKFLDPEGKYINGILNRSNCMETKNG 360
Query: 172 KDRKNPDLVRGQE-RGIVILDDTESVWSDHTENLIVL 207
K+ +V+G++ + V++D+ + EN I +
Sbjct: 361 FFIKDLRIVKGKDLKKTVLVDNLAHSFGFQIENGIPI 397
>gi|357450579|ref|XP_003595566.1| CTD small phosphatase-like protein [Medicago truncatula]
gi|355484614|gb|AES65817.1| CTD small phosphatase-like protein [Medicago truncatula]
Length = 469
Score = 44.7 bits (104), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 71/149 (47%), Gaps = 16/149 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV--KLRPFVRTF 124
+ + LVL+LD TL+H ++ + F ++F D +V K RPF+ F
Sbjct: 295 KSVTLVLDLDETLVH-STLEHCDDAD---------FTFNIFFNMKDYIVYVKQRPFLHKF 344
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVR 181
LE+ S + ++ + T S YA + +LD D K+ S R+ RE +G K+ ++
Sbjct: 345 LERVSDMFEVVIFTASQSIYANQLLDILDPDEKFISRRLY-RESCMFSDGNYTKDLTILG 403
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKY 210
+VI+D++ V+ N I + +
Sbjct: 404 IDLAKVVIIDNSPQVFRLQVNNGIPIKSW 432
>gi|357450577|ref|XP_003595565.1| CTD small phosphatase-like protein [Medicago truncatula]
gi|355484613|gb|AES65816.1| CTD small phosphatase-like protein [Medicago truncatula]
Length = 460
Score = 44.7 bits (104), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 71/149 (47%), Gaps = 16/149 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV--KLRPFVRTF 124
+ + LVL+LD TL+H ++ + F ++F D +V K RPF+ F
Sbjct: 286 KSVTLVLDLDETLVH-STLEHCDDAD---------FTFNIFFNMKDYIVYVKQRPFLHKF 335
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVR 181
LE+ S + ++ + T S YA + +LD D K+ S R+ RE +G K+ ++
Sbjct: 336 LERVSDMFEVVIFTASQSIYANQLLDILDPDEKFISRRLY-RESCMFSDGNYTKDLTILG 394
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKY 210
+VI+D++ V+ N I + +
Sbjct: 395 IDLAKVVIIDNSPQVFRLQVNNGIPIKSW 423
>gi|145515175|ref|XP_001443487.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410876|emb|CAK76090.1| unnamed protein product [Paramecium tetraurelia]
Length = 411
Score = 44.3 bits (103), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 33/135 (24%), Positives = 58/135 (42%), Gaps = 15/135 (11%)
Query: 29 HTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSL 88
H T + C F Q ND + + ++ +R+ L +LD TL+HC ++
Sbjct: 179 HQTYQGLNCRFFPQNNND--------YNKSHKLPKKHQRQFTLFFDLDETLVHCNETPTI 230
Query: 89 SSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAA 148
+ +I+ + + + +RP+ + L+ S+ +I + T S CYAE
Sbjct: 231 PCD---VVLEINVSKHQVVRAG----INVRPYAKELLKNLSNHFEIIVFTASHSCYAEKV 283
Query: 149 VKLLDLDSKYFSSRI 163
LD DS S R+
Sbjct: 284 CNYLDPDSTIISHRL 298
>gi|353230275|emb|CCD76446.1| nuclear lim interactor-interacting factor-related [Schistosoma
mansoni]
Length = 429
Score = 44.3 bits (103), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 68/140 (48%), Gaps = 12/140 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC ++ L + + F G ++ + V++RP + FL S
Sbjct: 295 LVLDLDETLVHC-SLNPLLDAQFIFQV---VFQGVVYMV----YVRIRPHLYEFLTNVSE 346
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQERGI 187
++ L T ST+ YA+ V L+D K+ R+ RE NG K+ ++ R
Sbjct: 347 HFEVVLFTASTKVYADRLVNLIDPKKKWIKHRLF-REHCVCVNGNYVKDLRVLGRDLRKT 405
Query: 188 VILDDTESVWSDHTENLIVL 207
VI+D++ + L++L
Sbjct: 406 VIIDNSPQAFGYQVFGLLLL 425
>gi|145533457|ref|XP_001452473.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420172|emb|CAK85076.1| unnamed protein product [Paramecium tetraurelia]
Length = 481
Score = 44.3 bits (103), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 39/161 (24%), Positives = 73/161 (45%), Gaps = 25/161 (15%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL----VKLRPFVRTFLE 126
LVL+LD TL+HC + Q+ QM N ++ + +RPF + FL+
Sbjct: 292 LVLDLDETLIHCNE-----------QPQMKYDFKVPIQMPNGQIHEAGISVRPFAQQFLQ 340
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR------IIAREDFNGKDRKNPDLV 180
+ S ++ + T S YA+ + LD K+ + R I ++ KD + ++
Sbjct: 341 ECSKHFEVMIFTASHPLYADKIIDKLDPTKKWVTCRLYREHCIQTQQGIYVKDLR---IL 397
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKYV-YFRDKELNG 220
+ +V++D+ ++ +N I + Y+ +D EL G
Sbjct: 398 NRNLKDVVLIDNAAYSFAYQIDNGIPIIPYIDNPKDNELIG 438
>gi|221481692|gb|EEE20068.1| conserved hypothetical protein [Toxoplasma gondii GT1]
gi|221502239|gb|EEE27977.1| dullard protein, putative [Toxoplasma gondii VEG]
Length = 184
Score = 44.3 bits (103), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 65/131 (49%), Gaps = 14/131 (10%)
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYLKK-QIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL++D TL+HC K L +L + + +G ++ +RP+ + FL+
Sbjct: 1 MTLVLDMDETLMHCAT-KPLEKSPAFLVRFSDTNVLGHVY---------VRPYTKIFLDL 50
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE--DFNGKDRKNPDLVRGQER 185
AS + +I + T ST+ YA+ + LD D + R+ + NG K+ L+ G++
Sbjct: 51 ASQICEIVVFTASTQSYADQVLAHLDPDRRLVHHRLYRQHCTMINGGYVKDLRLL-GRDI 109
Query: 186 GIVILDDTESV 196
V+L D +
Sbjct: 110 SRVVLADNSPI 120
>gi|452005182|gb|EMD97638.1| hypothetical protein COCHEDRAFT_1200267 [Cochliobolus
heterostrophus C5]
Length = 467
Score = 44.3 bits (103), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 38/158 (24%), Positives = 79/158 (50%), Gaps = 27/158 (17%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKY-----LKKQIHSFIGSLFQMANDKL-----VKLRPF 120
L+++LD TL+H S+ +G ++ ++ ++ + IG+ Q+ ++ V RP+
Sbjct: 282 LIIDLDETLIH-----SIVNGGRFQTGHMVEVKLQASIGADGQVIGPQVPLLYYVHKRPY 336
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI-----IAREDFNGKD-- 173
FL++ S ++ + T S + YA+ + L+++ KYF+ R R KD
Sbjct: 337 CDDFLKKVSKWYNLIIFTASVQEYADPVIDWLEVERKYFAGRYYRQHCTVRNGAYIKDLA 396
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
+ PDL + ++ILD++ + H +N I + ++
Sbjct: 397 QVEPDLSK-----VMILDNSPLSYVFHPDNAIPIEGWI 429
>gi|55742007|ref|NP_001006793.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 2 [Xenopus (Silurana) tropicalis]
gi|49903624|gb|AAH76658.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 1 [Xenopus (Silurana) tropicalis]
Length = 271
Score = 44.3 bits (103), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 74/146 (50%), Gaps = 11/146 (7%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+ +++ K+ +V++LD TL+H + K +S+ + + +I G+ Q+ V RP+V
Sbjct: 95 APKDKEKICMVIDLDETLVH-SSFKPISNADFIVPVEIE---GTTHQV----YVLKRPYV 146
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
FLE+ L + L T S YA+ LLD S F SR+ + DL R
Sbjct: 147 DEFLERMGQLYECVLFTASLAKYADPVTDLLD-KSGVFRSRLFREACVFHQGCYVKDLSR 205
Query: 182 -GQE-RGIVILDDTESVWSDHTENLI 205
G++ + VILD++ + + H EN +
Sbjct: 206 LGRDLKKTVILDNSPASYIFHPENAV 231
>gi|159476674|ref|XP_001696436.1| cleavage and polyadenylation factor 6-related protein
[Chlamydomonas reinhardtii]
gi|158282661|gb|EDP08413.1| cleavage and polyadenylation factor 6-related protein
[Chlamydomonas reinhardtii]
Length = 2174
Score = 44.3 bits (103), Expect = 0.084, Method: Composition-based stats.
Identities = 21/51 (41%), Positives = 32/51 (62%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIA 165
+KLRP R FL +A+ +++ T R YA+A V+LLD + F SR++A
Sbjct: 867 LKLRPGARAFLARAAERYELWARTRQGRPYADAVVELLDPHQQLFGSRVVA 917
>gi|340380578|ref|XP_003388799.1| PREDICTED: hypothetical protein LOC100637093 [Amphimedon
queenslandica]
Length = 532
Score = 44.3 bits (103), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 48/93 (51%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC ++ L K + + LF D V+LRP+ FLE+ S
Sbjct: 357 LVLDLDETLVHC-SLSKLELANFTFKVE---YSNQLF----DVYVRLRPYFHEFLERVSK 408
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
++ L T ST+ YA+ + L+D + R+
Sbjct: 409 QFEVILFTASTKVYADKLLDLIDPSRRLVKHRL 441
>gi|339237973|ref|XP_003380541.1| nuclear envelope morphology protein 1 [Trichinella spiralis]
gi|316976534|gb|EFV59811.1| nuclear envelope morphology protein 1 [Trichinella spiralis]
Length = 281
Score = 44.3 bits (103), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 57/215 (26%), Positives = 87/215 (40%), Gaps = 41/215 (19%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
E+ ++ VL+LD TL+H R S G+ K I ++ Q +RP
Sbjct: 96 PEKSKKLYTAVLDLDQTLVHSR---SKRKGDPRYK------IVNIPQATRRFYTAVRPCC 146
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAV-KLLDLDSKYFSSRIIAREDFNGKDR---KNP 177
FLE S ++ L T T YA A + +L+D + KYFS+ R D D K+
Sbjct: 147 AEFLESISEFYEVILFTAGTPRYAAAVIDQLVDPEHKYFSN-FYYRPDCAPVDHEFVKDL 205
Query: 178 DLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEE 237
++ VI+DD + H +N I++ E T + E+ E
Sbjct: 206 SILGRDLSKTVIMDDNMMSFCCHIDNGILV-------------------EPWTGDEEDRE 246
Query: 238 ALANVLRVLKTIHRLFFDSVCGDVRTYLPKVRSEF 272
LKT+ R F + V +V P +R F
Sbjct: 247 --------LKTMIRFFHEIVDSNVEDVRPFLRERF 273
>gi|215695024|dbj|BAG90215.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 269
Score = 44.3 bits (103), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 68/139 (48%), Gaps = 16/139 (11%)
Query: 63 EQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLR--PF 120
EQ RK+ LVL+LD TL+H S+ E+ + F +F + +V +R P
Sbjct: 140 EQGARKVTLVLDLDETLVH-------STTEQC---DDYDFTFPVFFDMKEHMVYVRKRPH 189
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNP 177
+ FL++ + + ++ + T S YA+ + +LD + K FS R RE N K+
Sbjct: 190 LHMFLQKMAEMFEVVIFTASQSVYADQLLDILDPEKKLFSRRYF-RESCVFTNTSYTKDL 248
Query: 178 DLVRGQERGIVILDDTESV 196
+V +VI+D+T V
Sbjct: 249 TVVGVDLAKVVIIDNTPQV 267
>gi|340501300|gb|EGR28100.1| NLI interacting factor-like phosphatase family protein, putative
[Ichthyophthirius multifiliis]
Length = 306
Score = 43.9 bits (102), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 67/142 (47%), Gaps = 12/142 (8%)
Query: 71 LVLNLDHTLLH-CRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQAS 129
L L+LD TL+H CR E Y QI +F + Q ++RP+ FL++ S
Sbjct: 111 LFLDLDETLIHSCR------INENY-NVQIKAFEDNNSQQEYLIQFRIRPYCMEFLQKIS 163
Query: 130 SLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR---EDFNGKDRKNPDLVRGQE-R 185
DIYL T S+ YA A V LD +Y + + + E NG K+ +V+G +
Sbjct: 164 KYWDIYLFTASSTTYANAIVNYLDPHRQYINQVLTRKNCMETKNGFFVKDLRIVKGINIK 223
Query: 186 GIVILDDTESVWSDHTENLIVL 207
+I+D+ + +N I +
Sbjct: 224 KAIIVDNLAHSFGLQIDNGIPI 245
>gi|326513088|dbj|BAK06784.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 445
Score = 43.9 bits (102), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 66/147 (44%), Gaps = 10/147 (6%)
Query: 63 EQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVR 122
EQ RK+ LVL+LD TL+H S ++ SF S + V+ RP +
Sbjct: 269 EQGARKVTLVLDLDETLVH--------STLEHCDDADFSFPVSFGLKEHVVYVRKRPHLH 320
Query: 123 TFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDL-VR 181
FL++ + + D+ + T S YA+ + LD ++ FS R + DL V
Sbjct: 321 MFLQKMAEMFDVVIFTASQSVYADQLLDRLDPENTLFSKRFFRESCVFTESGYTKDLTVI 380
Query: 182 GQERG-IVILDDTESVWSDHTENLIVL 207
G + + I+D+T V+ N I +
Sbjct: 381 GVDLAKVAIIDNTPQVFQLQVNNGIPI 407
>gi|37538060|gb|AAQ92971.1| CTD-phosphatase-like protein [Hordeum vulgare subsp. vulgare]
gi|37538062|gb|AAQ92972.1| CTD-phosphatase-like protein [Hordeum vulgare subsp. vulgare]
Length = 445
Score = 43.9 bits (102), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 66/147 (44%), Gaps = 10/147 (6%)
Query: 63 EQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVR 122
EQ RK+ LVL+LD TL+H S ++ SF S + V+ RP +
Sbjct: 269 EQGARKVTLVLDLDETLVH--------STLEHCDDADFSFPVSFGLKEHVVYVRKRPHLH 320
Query: 123 TFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDL-VR 181
FL++ + + D+ + T S YA+ + LD ++ FS R + DL V
Sbjct: 321 MFLQKMAEMFDVVIFTASQSVYADQLLDRLDPENTLFSKRFFRESCVFTESGYTKDLTVI 380
Query: 182 GQERG-IVILDDTESVWSDHTENLIVL 207
G + + I+D+T V+ N I +
Sbjct: 381 GVDLAKVAIIDNTPQVFQLQVNNGIPI 407
>gi|225681687|gb|EEH19971.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 869
Score = 43.9 bits (102), Expect = 0.100, Method: Compositional matrix adjust.
Identities = 36/155 (23%), Positives = 69/155 (44%), Gaps = 42/155 (27%)
Query: 72 VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL--FQMANDKL--------VKLRPFV 121
V++LD T++H +++ ++ H + + FQ+ +D +KLRP +
Sbjct: 163 VVDLDQTIIHATVDPTVAEWQQDRDNPNHEAVKDVRAFQLVDDGPGMKGCWYYIKLRPGL 222
Query: 122 RTFLEQASSLVDIYLCTMSTRC---YAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPD 178
+ FL++ S+L ++++ TM TR A+ +L +D+K
Sbjct: 223 QEFLQEISALYELHIYTMGTRAGSLTAKNLQRLFPVDTKM-------------------- 262
Query: 179 LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYF 213
+VI+DD VW ++NLI + Y +F
Sbjct: 263 --------VVIIDDRGDVWK-WSDNLIKVSPYDFF 288
>gi|145483633|ref|XP_001427839.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394922|emb|CAK60441.1| unnamed protein product [Paramecium tetraurelia]
Length = 308
Score = 43.9 bits (102), Expect = 0.100, Method: Compositional matrix adjust.
Identities = 45/165 (27%), Positives = 80/165 (48%), Gaps = 13/165 (7%)
Query: 58 GLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKL 117
G+ + RKL VL+LD TL+H + K + + L + S + +F V +
Sbjct: 46 GIDTPKSHARKL-CVLDLDETLVHSQ-FKGDNGYDFLLDIIVQSQLFKVF-------VTV 96
Query: 118 RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE--DFNGKDRK 175
RP V TFLEQ S DI L T S + YA+ + ++D + +R+ G K
Sbjct: 97 RPGVETFLEQLSEHFDIVLWTASLKEYADPVIDIID-PQRRIQTRLYRESCTPIRGGLTK 155
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR-DKELN 219
N + + + ++I+D+++ + EN ++ ++ + DKEL+
Sbjct: 156 NLNKLGRNLKEVLIIDNSQMSFLFQPENGFLIKDFIQDKNDKELD 200
>gi|399215866|emb|CCF72554.1| unnamed protein product [Babesia microti strain RI]
Length = 248
Score = 43.9 bits (102), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 54/109 (49%), Gaps = 16/109 (14%)
Query: 51 SFDYMLRGLRYSEQE----ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSL 106
+F L+ SE+ ++K LVL+LD TL+H +++ HSF ++
Sbjct: 35 TFQTQLKKFLTSEKPVTSGKKKFTLVLDLDETLIHS----------EFVTDGNHSFSTTI 84
Query: 107 FQMANDKLVKL--RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD 153
++ + + RP+ FLEQ + L ++ + T + YA+A + +LD
Sbjct: 85 KNDTENQTIYVYKRPYADEFLEQVAKLFEVVIFTAGSEPYAKAVIDILD 133
>gi|85001578|ref|XP_955502.1| ctd-like phosphatase [Theileria annulata strain Ankara]
gi|65303648|emb|CAI76026.1| ctd-like phosphatase, putative [Theileria annulata]
Length = 832
Score = 43.9 bits (102), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 42/86 (48%)
Query: 109 MANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED 168
M + KLRP + F Q ++L T T+ +AE+A++++D YFS+RI +R
Sbjct: 296 MFTNTYFKLRPGIFNFFHQIRDKFTLFLFTTGTKQHAESALQIIDPQLIYFSNRIFSRSH 355
Query: 169 FNGKDRKNPDLVRGQERGIVILDDTE 194
N + N V G V+ T+
Sbjct: 356 SNILNGVNTVTVSGPTNITVVPGTTK 381
>gi|340508012|gb|EGR33824.1| NLI interacting factor-like phosphatase family protein, putative
[Ichthyophthirius multifiliis]
Length = 222
Score = 43.9 bits (102), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 51/98 (52%), Gaps = 7/98 (7%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
L+L+LD TL+H + +++ +S FQ+A ++RP+ FL+Q S
Sbjct: 61 LLLDLDETLIHSCGLNENPDAVIMAQEEYNS--QKQFQIA----FRIRPYCIEFLQQVSK 114
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED 168
DIY+ T S+ YA A V LD +Y +++ R++
Sbjct: 115 YWDIYVFTASSASYANAIVNYLDSQQEYI-HQVLTRQN 151
>gi|403331662|gb|EJY64792.1| Dullard-like phosphatase domain containing protein [Oxytricha
trifallax]
Length = 1099
Score = 43.9 bits (102), Expect = 0.10, Method: Composition-based stats.
Identities = 47/165 (28%), Positives = 79/165 (47%), Gaps = 16/165 (9%)
Query: 56 LRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV 115
L G R +E +K LVL+LD TL+H + K + L +I ++ V
Sbjct: 150 LLGPRMKGKENKK-TLVLDLDETLVH-SSFKPPEQPDIVLPVEIEGKTCYVY-------V 200
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGK 172
+RP TFLEQ S ++ + T S YAE +K+LD + F + RE +NG
Sbjct: 201 LIRPGAITFLEQLSEYYELVIFTASLSKYAEPLMKILDHGT--FCHYHLFREHCTFYNGI 258
Query: 173 DRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKE 217
K+ + + + ++I+D++ S + EN + + ++ DKE
Sbjct: 259 FVKDMSQLGRRMQDVIIIDNSPSCYLFQPENALPI--LSWYDDKE 301
>gi|451846675|gb|EMD59984.1| hypothetical protein COCSADRAFT_151187 [Cochliobolus sativus
ND90Pr]
Length = 467
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 38/158 (24%), Positives = 79/158 (50%), Gaps = 27/158 (17%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKY-----LKKQIHSFIGSLFQMANDKL-----VKLRPF 120
L+++LD TL+H S+ +G ++ ++ ++ + IG+ Q+ ++ V RP+
Sbjct: 282 LIIDLDETLIH-----SIVNGGRFQTGHMVEVKLQASIGADGQVIGPQVPLLYYVHKRPY 336
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI-----IAREDFNGKD-- 173
FL++ S ++ + T S + YA+ + L+++ KYF+ R R KD
Sbjct: 337 CDDFLKKVSKWYNLIIFTASVQEYADPVIDWLEVERKYFAGRYYRQHCTVRNGAYIKDLA 396
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
+ PDL + ++ILD++ + H +N I + ++
Sbjct: 397 QVEPDLSK-----VMILDNSPLSYVFHPDNAIPIEGWI 429
>gi|308811648|ref|XP_003083132.1| TFIIF-interacting CTD phosphatase, including NLI-interacting factor
(involved in RNA polymerase II regulation) (ISS)
[Ostreococcus tauri]
gi|116055010|emb|CAL57087.1| TFIIF-interacting CTD phosphatase, including NLI-interacting factor
(involved in RNA polymerase II regulation) (ISS)
[Ostreococcus tauri]
Length = 485
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 54/100 (54%), Gaps = 7/100 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
+++ + LVL+LD TL+H N+++ + + F G + Q+ V+ RP ++T
Sbjct: 282 KDDNRNTLVLDLDETLVHS-NLENTGGKSDFSFPVV--FNGEIHQVN----VRTRPHLQT 334
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
F+E S +I + T S + YA+ + LLD ++ + R+
Sbjct: 335 FMETVSKKYEIVVFTASQQIYADKLLDLLDPKREWIAHRV 374
>gi|221057037|ref|XP_002259656.1| nif-like protein [Plasmodium knowlesi strain H]
gi|193809728|emb|CAQ40430.1| nif-like protein, putative [Plasmodium knowlesi strain H]
Length = 327
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 73/158 (46%), Gaps = 24/158 (15%)
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFI-GSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL++C K S +K++ I G F + K RP++ F
Sbjct: 58 MTLVLDLDETLIYCTKKKKFSH-----QKEVDVLINGRYFSLYVCK----RPYIDLFFSI 108
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDR---KNPDLVRGQ 183
+ +I + T S + YA+ + ++D+D ++ + RED F + KN ++ +
Sbjct: 109 LNPFFEIVIFTTSIKSYADTVLNIIDVD--HYIDKKFYREDCFEVSQKVYIKNLQSIKKE 166
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGD 221
+V++DD+ + EN YF K+ GD
Sbjct: 167 ISKMVLIDDSNISGLKYPEN--------YFPIKKWQGD 196
>gi|148229304|ref|NP_001079929.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 2 [Xenopus laevis]
gi|17046469|gb|AAL34532.1|AF441288_1 Os4 [Xenopus laevis]
gi|34784578|gb|AAH57696.1| MGC68415 protein [Xenopus laevis]
Length = 271
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 74/146 (50%), Gaps = 11/146 (7%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+ +++ K+ +V++LD TL+H + K +S+ + + +I G+ Q+ V RP+V
Sbjct: 95 APKDKGKICMVIDLDETLVH-SSFKPISNADFIVPVEIE---GTTHQV----YVLKRPYV 146
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
FLE+ L + L T S YA+ LLD S F SR+ + DL R
Sbjct: 147 DEFLERMGQLYECVLFTASLAKYADPVTDLLD-KSGVFRSRLFREACVFHQGCYVKDLSR 205
Query: 182 -GQE-RGIVILDDTESVWSDHTENLI 205
G++ + VILD++ + + H EN +
Sbjct: 206 LGRDLKKTVILDNSPASYIFHPENAV 231
>gi|330794863|ref|XP_003285496.1| hypothetical protein DICPUDRAFT_91512 [Dictyostelium purpureum]
gi|325084587|gb|EGC38012.1| hypothetical protein DICPUDRAFT_91512 [Dictyostelium purpureum]
Length = 558
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 44/97 (45%), Gaps = 10/97 (10%)
Query: 58 GLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VK 116
L + E K+ LVL+LD TL+HC S E Q H F ++ K
Sbjct: 371 ALPPKDHESPKISLVLDLDETLVHC-------STEPL--NQPHLIFPVFFNNTEYQVFAK 421
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD 153
RPF FL + S++ ++ + T S YA + ++D
Sbjct: 422 KRPFFEEFLHKVSTIFEVIIFTASQEVYANKLLNIID 458
>gi|238480828|ref|NP_001031661.2| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
gi|240255993|ref|NP_193548.7| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
gi|332658601|gb|AEE84001.1| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
gi|332658602|gb|AEE84002.1| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
Length = 446
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 52/112 (46%), Gaps = 9/112 (8%)
Query: 52 FDYMLRGLRYSEQEERK-LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMA 110
F+Y + + +RK + LVL+LD TL+H S + + SF +
Sbjct: 251 FNYFPDMQQPRDSPKRKAVTLVLDLDETLVH--------STLEVCRDTDFSFRVTFNMQE 302
Query: 111 NDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
N VK RP++ FLE+ L + + T S YA + +LD D K+ S R
Sbjct: 303 NTVYVKQRPYLYRFLERVVELFHVVIFTASHSIYASQLLDILDPDGKFVSQR 354
>gi|442763025|gb|JAA73671.1| Putative tfiif-interacting ctd phosphat, partial [Ixodes ricinus]
Length = 260
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 40/162 (24%), Positives = 81/162 (50%), Gaps = 19/162 (11%)
Query: 54 YMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK 113
++L +R+ Q+ K+ L+++LD TL+H + K +S+ + + +I + ++ +
Sbjct: 73 FLLPPVRH--QDLHKICLIIDLDETLVHS-SFKPISNADFVVPVEIDGTVHQVYVLK--- 126
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKY--FSSRIIARED--- 168
RP+V FL++ D Y C + T A+ A + DL K+ F SR+ RE
Sbjct: 127 ----RPYVDEFLQRVG---DAYECVLFTASLAKYADPVADLLDKWGVFRSRLF-RESCVF 178
Query: 169 FNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
+ G K+ + +VI+D++ + + H +N + +G +
Sbjct: 179 YRGNYVKDLGRLGRDLHRVVIIDNSPASYIFHPDNAVPVGSW 220
>gi|334186662|ref|NP_001190760.1| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
gi|332658603|gb|AEE84003.1| SCP1-like small phosphatase 4b [Arabidopsis thaliana]
Length = 442
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 52/112 (46%), Gaps = 9/112 (8%)
Query: 52 FDYMLRGLRYSEQEERK-LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMA 110
F+Y + + +RK + LVL+LD TL+H S + + SF +
Sbjct: 251 FNYFPDMQQPRDSPKRKAVTLVLDLDETLVH--------STLEVCRDTDFSFRVTFNMQE 302
Query: 111 NDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
N VK RP++ FLE+ L + + T S YA + +LD D K+ S R
Sbjct: 303 NTVYVKQRPYLYRFLERVVELFHVVIFTASHSIYASQLLDILDPDGKFVSQR 354
>gi|347831182|emb|CCD46879.1| similar to NIF domain-containing protein [Botryotinia fuckeliana]
Length = 505
Score = 43.5 bits (101), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 36/146 (24%), Positives = 68/146 (46%), Gaps = 5/146 (3%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL---VKLRPFVRTFLEQ 127
L+L+LD TL+H N S ++ QI + +G+ + V RP+ FL +
Sbjct: 319 LILDLDETLIHSMNYGGRMSAGHMVEVQITNLMGAGGAGPQHPILYYVNKRPYCDEFLRR 378
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDF--NGKDRKNPDLVRGQER 185
++ + T S + YA+ + L+ + K+FS+R + NG K+ V
Sbjct: 379 VCKWYNLVVFTASLQDYADPVIDWLEQERKFFSARYYRQHCTYRNGAFIKDLSSVEPDLS 438
Query: 186 GIVILDDTESVWSDHTENLIVLGKYV 211
++ILD++ + H +N I + ++
Sbjct: 439 KVMILDNSPVSYLFHQDNAIPIEGWI 464
>gi|224116454|ref|XP_002317305.1| predicted protein [Populus trichocarpa]
gi|222860370|gb|EEE97917.1| predicted protein [Populus trichocarpa]
Length = 377
Score = 43.5 bits (101), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 48/98 (48%), Gaps = 10/98 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVRTFL 125
+ + LVL+LD TL+H S ++ +F F M + VK RP V TFL
Sbjct: 203 KSITLVLDLDETLVH--------STLEHCDDADFTFT-VFFNMKEHTVYVKQRPHVHTFL 253
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
E+ + + ++ + T S YA + +LD D K S RI
Sbjct: 254 ERVAEMFEVVIFTASQSIYAAQLLDMLDPDRKLISRRI 291
>gi|290990355|ref|XP_002677802.1| nuclear lim interactor-interacting protein [Naegleria gruberi]
gi|284091411|gb|EFC45058.1| nuclear lim interactor-interacting protein [Naegleria gruberi]
Length = 332
Score = 43.5 bits (101), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 8/105 (7%)
Query: 59 LRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLR 118
L E + + LVL+LD TL+HC + + + + H +++ V+ R
Sbjct: 142 LPPKELSQPDITLVLDLDETLVHC-STEPIPDPDFTFTVLFHGVEYTVY-------VRKR 193
Query: 119 PFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
P+ FLE S + ++ + T S YA+ + +LD + KY R+
Sbjct: 194 PYFVEFLEAVSKIFEVVVFTASQSVYADKLLSILDPERKYIKYRV 238
>gi|171694335|ref|XP_001912092.1| hypothetical protein [Podospora anserina S mat+]
gi|170947116|emb|CAP73921.1| unnamed protein product [Podospora anserina S mat+]
Length = 529
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 43/160 (26%), Positives = 73/160 (45%), Gaps = 19/160 (11%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKS-LSSGEKYLKKQIHSFIGSLFQMANDK------LVKLR 118
E + L+L+LD TL+H + +SSG + +++G Q + V R
Sbjct: 335 EHQKTLILDLDETLIHSMSKGGRMSSGHMVEVRLNTTYVGVGGQNSIGPQHPILYYVHKR 394
Query: 119 PFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI-----IAREDFNGKD 173
P FL + S ++ + T S + YA+ + L+ D KYFS+R R KD
Sbjct: 395 PHCDEFLRRVSKWYNLVVFTASVQEYADPVIDWLEADRKYFSARYYRQHCTFRHGAFIKD 454
Query: 174 RKN--PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
+ PDL R ++ILD++ + H +N I + ++
Sbjct: 455 LSSVEPDLSR-----VMILDNSPLSYMFHQDNAIPIQGWI 489
>gi|407929015|gb|EKG21854.1| NLI interacting factor [Macrophomina phaseolina MS6]
Length = 510
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 54/102 (52%), Gaps = 15/102 (14%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKY-----LKKQIHSFIGSLFQMANDKL-----VKLRPF 120
L+L+LD TL+H S++ G +Y ++ +++ +GS Q+ ++ V RP
Sbjct: 332 LILDLDETLIH-----SMAKGGRYTTGHMVEVKLNQAMGSGNQVIGPQIPILYYVHKRPH 386
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
FL + S ++ + T S + YA+ + L+L+ KYF+ R
Sbjct: 387 CDDFLRKVSKWYNLIIFTASVQEYADPVIDWLELERKYFAGR 428
>gi|223648574|gb|ACN11045.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 2 [Salmo salar]
Length = 260
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 72/147 (48%), Gaps = 17/147 (11%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q++ K+ +V++LD TL+H + K +S+ + + +I G+ Q+ V RP+V
Sbjct: 86 QDQGKICVVIDLDETLVH-SSFKPISNADFIVPVEIE---GTTHQV----YVLKRPYVDE 137
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKDRKNPDL 179
FL++ L + L T S YA+ LLD ++ F + + F KD
Sbjct: 138 FLQRMGELFECILFTASLAKYADPVTDLLDQCGVFRARLFRESCVFHQGFYVKDLS---- 193
Query: 180 VRGQE-RGIVILDDTESVWSDHTENLI 205
+ G+E +ILD++ + + H EN +
Sbjct: 194 ILGRELHKTLILDNSPASYIFHPENAV 220
>gi|395503570|ref|XP_003756137.1| PREDICTED: CTD small phosphatase-like protein 2 [Sarcophilus
harrisii]
Length = 395
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 42/85 (49%), Gaps = 10/85 (11%)
Query: 79 LLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCT 138
L +I S++ K LKK I S + V+LRPF R FLE+ S + +I L T
Sbjct: 229 LTGSSSIASIAQTHKNLKKYIDSNV----------YVRLRPFFREFLERMSQIYEIILFT 278
Query: 139 MSTRCYAEAAVKLLDLDSKYFSSRI 163
S + YA+ + +LD + R+
Sbjct: 279 ASKKVYADKLLNILDPKKQLVRHRL 303
>gi|222632581|gb|EEE64713.1| hypothetical protein OsJ_19569 [Oryza sativa Japonica Group]
Length = 485
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 63/143 (44%), Gaps = 10/143 (6%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
+ + LVL+LD TL+H + G + H + VK RP V TFL+
Sbjct: 308 KNITLVLDLDETLIHSSAVDR--DGADFSFPMYHGL------KEHTVYVKKRPHVDTFLQ 359
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE--DFNGKDRKNPDLVRGQE 184
+ S + + + T S YA + +LD + +F+ R +G K+ ++
Sbjct: 360 KVSEMFKVVIFTASLSSYANRLLDMLDPKNIFFTKRYFRDSCLPVDGSYLKDLTVIVADL 419
Query: 185 RGIVILDDTESVWSDHTENLIVL 207
+VI+D++ V+ EN I +
Sbjct: 420 AKVVIIDNSPEVFRLQEENGIPI 442
>gi|348685327|gb|EGZ25142.1| hypothetical protein PHYSODRAFT_311755 [Phytophthora sojae]
Length = 257
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 53/101 (52%), Gaps = 9/101 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
K+ LVL+LD TL+HC S+ + + +F G + + VK RP + FL++
Sbjct: 75 KICLVLDLDETLVHC----SVDEVKNPHMQFPVTFNGVEYTVN----VKKRPHLEYFLKR 126
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED 168
S L +I + T S + YAE + +LD + + R+ RED
Sbjct: 127 VSKLFEIVVFTASHKVYAEKLMNMLDPNRNFIKYRLY-RED 166
>gi|218197280|gb|EEC79707.1| hypothetical protein OsI_21008 [Oryza sativa Indica Group]
Length = 485
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 63/143 (44%), Gaps = 10/143 (6%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
+ + LVL+LD TL+H + G + H + VK RP V TFL+
Sbjct: 308 KNITLVLDLDETLIHSSAVDR--DGADFSFPMYHGL------KEHTVYVKKRPHVDTFLQ 359
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE--DFNGKDRKNPDLVRGQE 184
+ S + + + T S YA + +LD + +F+ R +G K+ ++
Sbjct: 360 KVSEMFKVVIFTASLSSYANRLLDMLDPKNIFFTKRYFRDSCLPVDGSYLKDLTVIVADL 419
Query: 185 RGIVILDDTESVWSDHTENLIVL 207
+VI+D++ V+ EN I +
Sbjct: 420 AKVVIIDNSPEVFRLQEENGIPI 442
>gi|256083671|ref|XP_002578064.1| nuclear lim interactor-interacting factor-related [Schistosoma
mansoni]
Length = 441
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 66/130 (50%), Gaps = 12/130 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC L + +++ + + F G ++ + V++RP + FL S
Sbjct: 295 LVLDLDETLVHCSLNPLLDA--QFIFQVV--FQGVVYMV----YVRIRPHLYEFLTNVSE 346
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQERGI 187
++ L T ST+ YA+ V L+D K+ R+ RE NG K+ ++ R
Sbjct: 347 HFEVVLFTASTKVYADRLVNLIDPKKKWIKHRLF-REHCVCVNGNYVKDLRVLGRDLRKT 405
Query: 188 VILDDTESVW 197
VI+D++ +
Sbjct: 406 VIIDNSPQAF 415
>gi|427785179|gb|JAA58041.1| hypothetical protein [Rhipicephalus pulchellus]
Length = 285
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 43/191 (22%), Positives = 90/191 (47%), Gaps = 19/191 (9%)
Query: 25 LSCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQEERKLQLVLNLDHTLLHCRN 84
L C + + + + + S L ++L +R+ Q+ K+ L+++LD TL+H +
Sbjct: 43 LCCFGSNNQGNNPVIAEENGQYSPKLQGKFLLPPVRH--QDMHKICLIIDLDETLVHS-S 99
Query: 85 IKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCY 144
K +S+ + + +I + ++ + RP+V FL++ D Y C + T
Sbjct: 100 FKPISNADFVVPVEIDGTVHQVYVLK-------RPYVDEFLQRVG---DAYECVLFTASL 149
Query: 145 AEAAVKLLDLDSKY--FSSRIIARED---FNGKDRKNPDLVRGQERGIVILDDTESVWSD 199
A+ A + DL K+ F +R+ RE + G K+ + +VI+D++ + +
Sbjct: 150 AKYADPVADLLDKWGVFRARLF-RESCVFYRGNYVKDLGRLGRDLHRVVIIDNSPASYIF 208
Query: 200 HTENLIVLGKY 210
H +N + +G +
Sbjct: 209 HPDNAVPVGSW 219
>gi|301118476|ref|XP_002906966.1| CTD small phosphatase-like protein, putative [Phytophthora
infestans T30-4]
gi|301126789|ref|XP_002909873.1| CTD small phosphatase-like protein, putative [Phytophthora
infestans T30-4]
gi|262101427|gb|EEY59479.1| CTD small phosphatase-like protein, putative [Phytophthora
infestans T30-4]
gi|262108315|gb|EEY66367.1| CTD small phosphatase-like protein, putative [Phytophthora
infestans T30-4]
Length = 237
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 63/135 (46%), Gaps = 6/135 (4%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGE-KYLKKQIHSFIGSLFQMAND---KLVKLRPFVRT 123
++ LVL++D L+H + + + +Y +Q+ + S + +D +V RP +
Sbjct: 40 RIALVLDMDECLVHSKFQNEVEYRQSEYRPEQLEEYSDSFEIVMDDGERAIVNKRPGLDR 99
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR--EDFNGKDRKNPDLVR 181
FLE+A+ D+Y+ T Y + + LD F+ R + G K+ ++VR
Sbjct: 100 FLEEAAKHYDVYVFTAGLEAYGKPILDALDPKGNLFAGRFFRESCQQRKGMFLKDLNVVR 159
Query: 182 GQERGIVILDDTESV 196
G + VIL D V
Sbjct: 160 GGDLSRVILVDNNPV 174
>gi|55740289|gb|AAV63947.1| putative nuclear LIM interactor-interacting protein [Phytophthora
sojae]
Length = 261
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 69/142 (48%), Gaps = 10/142 (7%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
K+ LVL+LD TL+HC S+ + + +F G + + VK RP + FL++
Sbjct: 78 KICLVLDLDETLVHC----SVDEVKNPHMQFPVTFNGVEYTVN----VKKRPHLEYFLKR 129
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE--DFNGKDRKNPDLVRGQER 185
S L +I + T S + YAE + +LD + + R+ + D G K+ +++
Sbjct: 130 VSKLFEIVVFTASHKVYAEKLMNMLDPNRNFIKYRLYREDCLDVFGNYLKDLNVLGRDLS 189
Query: 186 GIVILDDTESVWSDHTENLIVL 207
+V++D++ + N I +
Sbjct: 190 KVVLVDNSPHAFGYQVNNGIPI 211
>gi|55740279|gb|AAV63941.1| putative nuclear LIM factor interactor-interacting protein hyphal
form [Phytophthora infestans]
Length = 237
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 63/135 (46%), Gaps = 6/135 (4%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGE-KYLKKQIHSFIGSLFQMAND---KLVKLRPFVRT 123
++ LVL++D L+H + + + +Y +Q+ + S + +D +V RP +
Sbjct: 40 RIALVLDMDECLVHSKFQNEVEYRQSEYRPEQLEEYSDSFEIVMDDGERAIVNKRPGLDR 99
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR--EDFNGKDRKNPDLVR 181
FLE+A+ D+Y+ T Y + + LD F+ R + G K+ ++VR
Sbjct: 100 FLEEAAKHYDVYVFTAGLEAYGKPILDALDPKGNLFAGRFFRESCQQRKGMFLKDLNVVR 159
Query: 182 GQERGIVILDDTESV 196
G + VIL D V
Sbjct: 160 GGDLSRVILVDNNPV 174
>gi|443696004|gb|ELT96785.1| hypothetical protein CAPTEDRAFT_124156, partial [Capitella teleta]
Length = 209
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 14/144 (9%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQ-MANDKLVKLRPFVRTFLE 126
+ LVL+LD TL+HC SL+ L+ SF LFQ + V+ RP R FLE
Sbjct: 30 EFSLVLDLDETLVHC----SLNE----LEDAAFSF-PVLFQDVTYQVFVRTRPRFREFLE 80
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQ 183
+ + + ++ + T S + YA + LLD + K R+ RE NG K+ ++
Sbjct: 81 RVAKIFEVTVFTASKKVYANKLLNLLDPEKKLIRHRLF-REHCVCVNGNYIKDLHILGRD 139
Query: 184 ERGIVILDDTESVWSDHTENLIVL 207
+I+D++ + N I +
Sbjct: 140 LDKTIIIDNSPQAFGYQLTNGIPI 163
>gi|255547724|ref|XP_002514919.1| conserved hypothetical protein [Ricinus communis]
gi|223545970|gb|EEF47473.1| conserved hypothetical protein [Ricinus communis]
Length = 455
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 46/93 (49%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+H S + +F + + V+ RPF++ F+E+ SS
Sbjct: 265 LVLDLDETLVH--------STLEPCGDADFTFPVNFNLQEHTVYVRCRPFLKDFMERVSS 316
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
L +I + T S YAE + +LD K F R+
Sbjct: 317 LFEIIIFTASQSIYAEQLLNVLDPKRKVFRHRV 349
>gi|145539396|ref|XP_001455388.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423196|emb|CAK87991.1| unnamed protein product [Paramecium tetraurelia]
Length = 410
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 46/90 (51%), Gaps = 8/90 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
+ L+LD TL+H SLS +K + GS ++ + +RP+ + FL++ S
Sbjct: 231 IFLDLDETLVHA----SLSKDNSQVKINQINDDGSETEIG----INIRPYTQYFLQELSQ 282
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFS 160
+Y+ T S++ YA A V LD +Y S
Sbjct: 283 FYTVYIYTASSQQYASAIVNYLDPKRQYIS 312
>gi|384502027|gb|EIE92518.1| hypothetical protein RO3G_17116 [Rhizopus delemar RA 99-880]
Length = 224
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 86/187 (45%), Gaps = 18/187 (9%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+++ E K LVL+LD TL+H + K++S + + +I ++F + RP V
Sbjct: 49 AKEYEGKKCLVLDLDETLVHS-SFKTVSRPDFVVPVEIEGHNHNVFVLK-------RPGV 100
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
F+++ S L +I + T S YA+ + DL K R+ N + DL R
Sbjct: 101 DEFMKRMSELYEIVIFTASLSKYADPVLDNFDL-HKVIQHRLFREACCNYRGGFIKDLSR 159
Query: 182 -GQE-RGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEAL 239
G++ +VILD+T + +S H N I + + N H S L E+ +
Sbjct: 160 LGRDLNHVVILDNTPASYSLHPSNAIPISTW-------FNDQHDSELLDLIPFLEDLAKV 212
Query: 240 ANVLRVL 246
NV+ VL
Sbjct: 213 DNVVEVL 219
>gi|391338474|ref|XP_003743583.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1-like [Metaseiulus occidentalis]
Length = 314
Score = 43.1 bits (100), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 74/146 (50%), Gaps = 13/146 (8%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
++ K+ LV++LD TL+H + K +S+ + + +I GS+ Q+ V RP+V F
Sbjct: 98 DQGKICLVIDLDETLVHS-SFKPVSNPDFVVPVEIE---GSVHQV----YVLKRPYVDEF 149
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVR 181
LE+ SL + L T S YA+ LLD F R+ RE + G K+ + +
Sbjct: 150 LEKVGSLYECVLFTASLSKYADPVADLLD-KWGVFRGRLF-RESCAFYRGNYVKDLNRLG 207
Query: 182 GQERGIVILDDTESVWSDHTENLIVL 207
+VI+D++ + + H +N + +
Sbjct: 208 RDVHRVVIIDNSPASYMFHPDNAMPV 233
>gi|66803905|ref|XP_635771.1| CTD small phosphatase-like protein 2 [Dictyostelium discoideum AX4]
gi|74851880|sp|Q54GB2.1|CTSL2_DICDI RecName: Full=CTD small phosphatase-like protein 2;
Short=CTDSP-like 2
gi|60464148|gb|EAL62309.1| CTD small phosphatase-like protein 2 [Dictyostelium discoideum AX4]
Length = 567
Score = 43.1 bits (100), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 46/101 (45%), Gaps = 10/101 (9%)
Query: 58 GLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VK 116
L E K+ LVL+LD TL+HC S E +Q H F ++ K
Sbjct: 380 ALPPKEHSSPKISLVLDLDETLVHC-------STEPL--EQPHLTFPVFFNNTEYQVFAK 430
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSK 157
RPF FL + S + ++ + T S YA + ++D ++K
Sbjct: 431 KRPFFEEFLHKVSDIFEVIIFTASQEVYANKLLNMIDPNNK 471
>gi|330843764|ref|XP_003293816.1| hypothetical protein DICPUDRAFT_95899 [Dictyostelium purpureum]
gi|325075819|gb|EGC29663.1| hypothetical protein DICPUDRAFT_95899 [Dictyostelium purpureum]
Length = 342
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 51/209 (24%), Positives = 93/209 (44%), Gaps = 36/209 (17%)
Query: 59 LRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLR 118
L S RK L+L+LD TL+H +K +S + I S + + V R
Sbjct: 158 LNLSNSAPRK-TLILDLDETLVHST-MKPVSHHHLTVNVLIESSYCTFY-------VIKR 208
Query: 119 PFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE--DFNGKDRKN 176
P V F+++ S D+ + T S + YA+ + LD++ K F R+ + +G K+
Sbjct: 209 PHVDYFIQKVSQWYDVVIFTASMQQYADPLLDQLDVN-KVFKKRLFRDSCLEKDGNYIKD 267
Query: 177 PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENE 236
++ +I+D++ +S++ EN + + ++ GD +S N+
Sbjct: 268 LSMINQDLTSTIIIDNSPIAYSNNLENALPIDNWM--------GDMES----------ND 309
Query: 237 EALANVLRVLKTIHRLFFDSVCGDVRTYL 265
+L N+L L+ I + DVR+ L
Sbjct: 310 TSLLNLLPFLEIIRNV------TDVRSIL 332
>gi|397787628|gb|AFO66533.1| putative NLI interacting factor family protein [Brassica napus]
Length = 477
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 69/147 (46%), Gaps = 14/147 (9%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSS-GEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
+ LVL+LD TL+H SL GE +H + + V+ RP ++ F+E
Sbjct: 113 PISLVLDLDETLVH----SSLEPCGEVDFTFTVH-----FNEEEHMVYVRCRPHLKEFME 163
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQ 183
+ S L ++ + T S YAE + +LD K F R+ R+ F+G K+ ++
Sbjct: 164 RVSRLFEVIIFTASQSIYAEQLLNVLDPKRKLFRHRVY-RDSCVFFDGNYLKDLSVLGRD 222
Query: 184 ERGIVILDDTESVWSDHTENLIVLGKY 210
++I+D++ + EN + + +
Sbjct: 223 LSRVIIVDNSPQAFGFQVENGVPIESW 249
>gi|119389575|pdb|2GHQ|A Chain A, Ctd-Specific Phosphatase Scp1 In Complex With Peptide C-
Terminal Domain Of Rna Polymerase Ii
gi|119389576|pdb|2GHQ|B Chain B, Ctd-Specific Phosphatase Scp1 In Complex With Peptide C-
Terminal Domain Of Rna Polymerase Ii
gi|119389579|pdb|2GHT|A Chain A, Ctd-Specific Phosphatase Scp1 In Complex With Peptide From
C-Terminal Domain Of Rna Polymerase Ii
gi|119389580|pdb|2GHT|B Chain B, Ctd-Specific Phosphatase Scp1 In Complex With Peptide From
C-Terminal Domain Of Rna Polymerase Ii
Length = 181
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 37/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V+NLD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 11 QDSDKICVVINLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 62
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 63 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 121
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 122 RDLRRVLILDNSPASYVFHPDNAVPVASW 150
>gi|344300484|gb|EGW30805.1| hypothetical protein SPAPADRAFT_142199 [Spathaspora passalidarum
NRRL Y-27907]
Length = 335
Score = 43.1 bits (100), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 53/230 (23%), Positives = 97/230 (42%), Gaps = 58/230 (25%)
Query: 60 RYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIG--SLFQMANDKLVKL 117
R E+ RK L+L+LD TL+H SLS G HS + +L +++ V
Sbjct: 123 RNPERRRRKKILILDLDETLIH-----SLSKGSPRSFTSSHSKMIEITLNNISSLYYVHK 177
Query: 118 RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLD--------SKY-------FSSR 162
RP+ FL++ S ++ + T S + YA+ + L+ D KY FS +
Sbjct: 178 RPYCDYFLQEISKWFELQIFTASVKEYADPIINWLESDLIDSRKQKHKYTSAEDMPFSPK 237
Query: 163 IIAREDFNGKDRKNPDL---------VRGQE-RGIVILDDTESVWSDHTENLIVLGKYVY 212
+ + + P + ++ +E + ++ILD++ +S H +N + + +V
Sbjct: 238 VFTKRYYRNDCTYRPGVGYIKDLSKFIKDEELKNVLILDNSPISYSLHEQNAVTIEGWV- 296
Query: 213 FRDKELNGDHKSYSETLTDESENEEALANVLRVLKTIHRLFFDSVCGDVR 262
N++ ++L +L +H L S+C DVR
Sbjct: 297 ----------------------NDQTDRDLLNLLPMLHSL---SLCIDVR 321
>gi|194752999|ref|XP_001958806.1| GF12569 [Drosophila ananassae]
gi|190620104|gb|EDV35628.1| GF12569 [Drosophila ananassae]
Length = 282
Score = 43.1 bits (100), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 45/180 (25%), Positives = 72/180 (40%), Gaps = 37/180 (20%)
Query: 62 SEQEERKLQ------LVLNLDHTLLH-------------CRNIKSLSSGEKYLKKQIHSF 102
S + +R+L+ LVL+LD TL+H C + + + L +
Sbjct: 83 SPESQRRLRQVGRKTLVLDLDETLVHSCYSDPETNELVGCSLVPQTAKPDYELSVTLEGL 142
Query: 103 IGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD-----LDSK 157
FQ V RP V FL+ AS D+ + T S YA V LD + +
Sbjct: 143 DPIAFQ------VYKRPHVDVFLKFASKWYDLVIFTASLEVYAAQVVDRLDNGRGMIQKR 196
Query: 158 YFSSRIIAREDFNGKDRK--NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD 215
Y+ + KD NPD+ G I+D++ + + D +N I + ++Y D
Sbjct: 197 YYRQHCSSTTSMISKDLTVVNPDM-----SGTFIIDNSPNAYRDFPDNAIPIKTFIYDPD 251
>gi|320168222|gb|EFW45121.1| NLI interacting factor family protein [Capsaspora owczarzaki ATCC
30864]
Length = 380
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 63/149 (42%), Gaps = 10/149 (6%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+ ++ LVL+LD TL+H G + QI I L + V RP+V
Sbjct: 202 PQPHVKRKTLVLDLDETLIHS---TLEPGGPRVHDMQIDVHIEKLVYVF---YVYKRPYV 255
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDF---NGKDRKNPD 178
FL+Q S D+ + T S Y + LDL F R+ RE NG K+
Sbjct: 256 DLFLKQTSHWYDLVIFTASLHQYGHPVIDSLDLGRGLFRHRLF-RESCVQENGNFMKDLT 314
Query: 179 LVRGQERGIVILDDTESVWSDHTENLIVL 207
LV + ++D++ ++ EN I +
Sbjct: 315 LVEPDLARVCLIDNSPGAYAIQPENGIPI 343
>gi|123404051|ref|XP_001302356.1| NLI interacting factor-like phosphatase family protein [Trichomonas
vaginalis G3]
gi|121883637|gb|EAX89426.1| NLI interacting factor-like phosphatase family protein [Trichomonas
vaginalis G3]
Length = 205
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 41/82 (50%), Gaps = 12/82 (14%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+H HS + SL + V LRP VR FL++ S
Sbjct: 32 LVLDLDETLIHTSTFPP------------HSDVESLKFDDSPDYVFLRPNVRIFLDKVSE 79
Query: 131 LVDIYLCTMSTRCYAEAAVKLL 152
L ++++ T T+ YAE + LL
Sbjct: 80 LFEVFIFTAGTQNYAERILDLL 101
>gi|324518550|gb|ADY47137.1| CTD small phosphatase-like protein 2 [Ascaris suum]
Length = 248
Score = 42.7 bits (99), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 50/94 (53%), Gaps = 10/94 (10%)
Query: 71 LVLNLDHTLLHCRNIKSLSS-GEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQAS 129
LVL+LD TL+HC SL+ + L +H F + +Q+ V++RP + FLE+ S
Sbjct: 66 LVLDLDETLVHC----SLTELPDASLTFPVH-FQDNTYQVY----VRVRPHLHEFLERLS 116
Query: 130 SLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+I L T S R YA+ + LLD + R+
Sbjct: 117 QSFEIILFTASKRVYADKLLNLLDPGKRLIRHRL 150
>gi|440798568|gb|ELR19635.1| cterminal domain small phosphatase, putative [Acanthamoeba
castellanii str. Neff]
Length = 262
Score = 42.7 bits (99), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 40/177 (22%), Positives = 85/177 (48%), Gaps = 13/177 (7%)
Query: 36 RCIFCSQAMNDSFGLSFDYMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYL 95
R + N S LS DY+L L ++ K LVL+LD TL+H + K +++ + +
Sbjct: 63 RMTKVGASSNTSPHLSRDYLLPPLL--AEDSGKKTLVLDLDETLVHS-SFKPINNADFII 119
Query: 96 KKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLD 155
++ + ++ + RP V TF+++ + ++ + T S YA+ + LLD+
Sbjct: 120 PVEVEDQMHQVYVLK-------RPGVDTFMKRVGEIFEVVVFTASLAKYADPVLDLLDI- 171
Query: 156 SKYFSSRIIAREDFNGKDRKNPDLVR-GQE-RGIVILDDTESVWSDHTENLIVLGKY 210
+ +R+ K DL + G+E + ++I+D++ + + H + + + +
Sbjct: 172 HRVTRTRLFRESCVQHKGNFVKDLSKLGREMKNVIIIDNSPASYLFHPHHAVPIDSW 228
>gi|396461911|ref|XP_003835567.1| hypothetical protein LEMA_P049080.1 [Leptosphaeria maculans JN3]
gi|312212118|emb|CBX92202.1| hypothetical protein LEMA_P049080.1 [Leptosphaeria maculans JN3]
Length = 536
Score = 42.7 bits (99), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 37/153 (24%), Positives = 73/153 (47%), Gaps = 17/153 (11%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-----VKLRPFVRTFL 125
L+L+LD TL+H S ++ ++ + +G+ Q+ ++ V RP+ FL
Sbjct: 351 LILDLDETLIHSVVNNSRFQTGHMVEVKLQAAVGAGGQIIGPQVPLLYYVHKRPYCDDFL 410
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI-----IAREDFNGKD--RKNPD 178
++ S ++ + T S + YA+ + L+++ KYF R R KD + PD
Sbjct: 411 KKVSKWYNLVIFTASVQEYADPVIDWLEVERKYFVGRYYRQHCTLRNGAYIKDLAQIEPD 470
Query: 179 LVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
L + ++ILD++ + H +N I + ++
Sbjct: 471 LSK-----VMILDNSPLSYVFHPDNAIPIEGWI 498
>gi|426378923|ref|XP_004056157.1| PREDICTED: CTD small phosphatase-like protein 2 [Gorilla gorilla
gorilla]
Length = 398
Score = 42.7 bits (99), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 45/89 (50%), Gaps = 8/89 (8%)
Query: 75 LDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASSLVDI 134
LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+ S + +I
Sbjct: 258 LDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMSQMYEI 309
Query: 135 YLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
L T S + YA+ + +LD + R+
Sbjct: 310 ILFTASKKVYADKLLNILDPKKQLVRHRL 338
>gi|145532723|ref|XP_001452117.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124419794|emb|CAK84720.1| unnamed protein product [Paramecium tetraurelia]
Length = 428
Score = 42.7 bits (99), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 35/150 (23%), Positives = 71/150 (47%), Gaps = 12/150 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
++ ++D TL+HC + + K I G + + + +R F R +++ S
Sbjct: 240 IIFDMDETLIHCN---EDENDKCQFKIDIQFEDGEIIEAG----INIRNFAREIIQKLSD 292
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR---KNPDLVRGQERGI 187
L ++ + T S YA + +LD ++K S RI + D K+ ++ + +
Sbjct: 293 LCEVMIFTASQDVYANKVINILDPNNK-LSYRIFRESCISVGDNNLIKHLGVLNRDLKNV 351
Query: 188 VILDDTESVWSDHTENLIVLGKYVYFRDKE 217
V++D++ ++ H EN I + Y Y+ DK+
Sbjct: 352 VLIDNSSYSFAHHLENGIPILPY-YYDDKD 380
>gi|145490634|ref|XP_001431317.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398421|emb|CAK63919.1| unnamed protein product [Paramecium tetraurelia]
Length = 473
Score = 42.7 bits (99), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 50/95 (52%), Gaps = 11/95 (11%)
Query: 60 RYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQM-ANDKLVKLR 118
+ + Q R+ LV++LD TL+HC K + K L+KQ LF+ +N + +R
Sbjct: 273 KINPQINRQKTLVIDLDETLVHCNESKLMP---KDLQKQ-------LFEAYSNQAEISVR 322
Query: 119 PFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD 153
P+ + FL++ + +I + T S YA ++ LD
Sbjct: 323 PYAQQFLQKMAKHFEIMIYTASNEDYANQIIEYLD 357
>gi|432103407|gb|ELK30512.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1, partial [Myotis davidii]
Length = 239
Score = 42.7 bits (99), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 73/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP+V
Sbjct: 64 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPYVDE 115
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 116 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 174
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 175 RDLRRVLILDNSPASYVFHPDNAVPVASW 203
>gi|346470919|gb|AEO35304.1| hypothetical protein [Amblyomma maculatum]
Length = 288
Score = 42.7 bits (99), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 39/162 (24%), Positives = 81/162 (50%), Gaps = 19/162 (11%)
Query: 54 YMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK 113
++L +R+ Q+ K+ L+++LD TL+H + K +S+ + + +I + ++ +
Sbjct: 71 FLLPPVRH--QDMHKICLIIDLDETLVHS-SFKPISNADFVVPVEIDGTVHQVYVLK--- 124
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKY--FSSRIIARED--- 168
RP+V FL++ D Y C + T A+ A + DL K+ F +R+ RE
Sbjct: 125 ----RPYVDEFLQRVG---DAYECVLFTASLAKYADPVADLLDKWGVFRARLF-RESCVF 176
Query: 169 FNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
+ G K+ + +VI+D++ + + H +N + +G +
Sbjct: 177 YRGNYVKDLGRLGRDLHRVVIIDNSPASYIFHPDNAVPVGSW 218
>gi|328767138|gb|EGF77189.1| hypothetical protein BATDEDRAFT_14325 [Batrachochytrium
dendrobatidis JAM81]
Length = 182
Score = 42.7 bits (99), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 56/204 (27%), Positives = 83/204 (40%), Gaps = 40/204 (19%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL--VKLRPFVRT 123
+RK LVL+LD TL+H S S G + H FI + ++ L V RP V
Sbjct: 11 QRKKTLVLDLDETLIH-----STSRGSRR-----HDFIVEVLVNSHICLYHVYKRPHVDL 60
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE--DFNGKDRKNPDLVR 181
FL +A+ I + T S YA+ + LD S R F G KN ++V
Sbjct: 61 FLRKATEWFKIVIFTASMPEYADPVIDWLDSTRTIVSKRYFRESCTSFFGTLTKNLEVVE 120
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENEEALAN 241
+ ++D+ + +LN D+ ET TD+ N+EAL +
Sbjct: 121 SDLSQVCLIDNAPLSY-------------------KLNPDNGIPIETWTDDP-NDEALLD 160
Query: 242 VLRVLKTIHRLFFDSVCGDVRTYL 265
+L L + DVR+ L
Sbjct: 161 LLPFLDALR------FADDVRSVL 178
>gi|312072812|ref|XP_003139236.1| SCP small domain phosphatase [Loa loa]
Length = 321
Score = 42.7 bits (99), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 52/97 (53%), Gaps = 10/97 (10%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSS-GEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
+ LVL+LD TL+HC SL+ + L +H F + +Q+ V++RP ++ FLE
Sbjct: 136 EFSLVLDLDETLVHC----SLTELPDASLTFPVH-FQENTYQV----YVRVRPHLQEFLE 186
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ S +I L T S R YA+ + LLD + R+
Sbjct: 187 RLSRSFEIILFTASKRIYADKLLNLLDPGKRLIRHRL 223
>gi|452822754|gb|EME29770.1| phosphatase isoform 1 [Galdieria sulphuraria]
Length = 351
Score = 42.7 bits (99), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 69/146 (47%), Gaps = 19/146 (13%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+H ++ S + L+ + + S+F V RP++ FL S
Sbjct: 173 LVLDLDETLVHSTTRQN-SHFDIRLEVSVDN-CPSIFY------VNKRPYLDVFLRVVSQ 224
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDS----KYFSSRIIAREDFNGKDRK--NPDLVRGQE 184
D+ + T S + YA+ + LD+ +YF I + KD PDL
Sbjct: 225 WYDLVVYTASLQKYADPLIDALDVHGVIRERYFRDHCIQVGNNFVKDISIIEPDL----- 279
Query: 185 RGIVILDDTESVWSDHTENLIVLGKY 210
R IVI+D++ S + H EN I +G +
Sbjct: 280 RKIVIVDNSPSAYVLHEENAIPIGTW 305
>gi|452822755|gb|EME29771.1| phosphatase isoform 2 [Galdieria sulphuraria]
Length = 356
Score = 42.7 bits (99), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 69/146 (47%), Gaps = 19/146 (13%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+H ++ S + L+ + + S+F V RP++ FL S
Sbjct: 173 LVLDLDETLVHSTTRQN-SHFDIRLEVSVDN-CPSIFY------VNKRPYLDVFLRVVSQ 224
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDS----KYFSSRIIAREDFNGKDRK--NPDLVRGQE 184
D+ + T S + YA+ + LD+ +YF I + KD PDL
Sbjct: 225 WYDLVVYTASLQKYADPLIDALDVHGVIRERYFRDHCIQVGNNFVKDISIIEPDL----- 279
Query: 185 RGIVILDDTESVWSDHTENLIVLGKY 210
R IVI+D++ S + H EN I +G +
Sbjct: 280 RKIVIVDNSPSAYVLHEENAIPIGTW 305
>gi|397787605|gb|AFO66511.1| putative small phosphatase-like protein 2-B [Brassica napus]
Length = 262
Score = 42.7 bits (99), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 66/140 (47%), Gaps = 14/140 (10%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSS-GEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
+ LVL+LD TL+H SL GE +H + + V+ RP ++ F+E
Sbjct: 68 PISLVLDLDETLVH----SSLEPCGEVDFTFTVH-----FNEEEHMVYVRCRPHLKEFME 118
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQ 183
+ S L ++ + T S YAE + +LD K F R+ R+ F+G K+ ++
Sbjct: 119 RVSRLFEVIIFTASQSIYAEQLLNVLDPKRKLFRHRVY-RDSCVFFDGNYLKDLSVLGRD 177
Query: 184 ERGIVILDDTESVWSDHTEN 203
++I+D++ + EN
Sbjct: 178 LSRVIIVDNSPQAFGFQVEN 197
>gi|170587764|ref|XP_001898644.1| NLI interacting factor-like phosphatase family protein [Brugia
malayi]
gi|158593914|gb|EDP32508.1| NLI interacting factor-like phosphatase family protein [Brugia
malayi]
Length = 314
Score = 42.4 bits (98), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 51/96 (53%), Gaps = 8/96 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC ++ L L +H F + +Q+ V++RP ++ FLE+
Sbjct: 129 EFSLVLDLDETLVHC-SLTELPDAS--LTFPVH-FQENTYQVY----VRVRPHLQEFLER 180
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
S +I L T S R YA+ + LLD + R+
Sbjct: 181 LSRSFEIILFTASKRVYADKLLNLLDPGKRLIRHRL 216
>gi|393909936|gb|EFO24836.2| SCP small domain phosphatase [Loa loa]
Length = 321
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 51/96 (53%), Gaps = 8/96 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+HC ++ L L +H F + +Q+ V++RP ++ FLE+
Sbjct: 136 EFSLVLDLDETLVHC-SLTELPDAS--LTFPVH-FQENTYQV----YVRVRPHLQEFLER 187
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
S +I L T S R YA+ + LLD + R+
Sbjct: 188 LSRSFEIILFTASKRIYADKLLNLLDPGKRLIRHRL 223
>gi|320169548|gb|EFW46447.1| CTD small phosphatase [Capsaspora owczarzaki ATCC 30864]
Length = 257
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 53/215 (24%), Positives = 97/215 (45%), Gaps = 23/215 (10%)
Query: 4 YSCKECVGKTKFVIKRKCEQSLSCAHTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSE 63
Y + + F R +++ S H T R +++ +D G + +L LR +
Sbjct: 32 YPARRGIWSLLFCCGRGTQEAESPEHVTDRT-----VTESQSDYNG---EPLLGPLR-KD 82
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
+ RK LVL+LD TL+H + + + + + + +I + ++ + RP+V
Sbjct: 83 DKGRKC-LVLDLDETLVHS-SFRPIPNPDYIIPVEIEGIVHQVYVLK-------RPYVDE 133
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD D + SR+ + DL R G
Sbjct: 134 FLKRVGQLFECVLFTASLAKYADPVSDLLDKD-RVLRSRLFRESCVQHRGNYVKDLSRLG 192
Query: 183 QERG-IVILDDTESVWSDHTENLIVLGKYVYFRDK 216
+E VI+D++ + ++ H + I + +F DK
Sbjct: 193 RELSQTVIIDNSPASYAFHPDYAIPI--VTWFDDK 225
>gi|213404738|ref|XP_002173141.1| nuclear envelope morphology protein [Schizosaccharomyces japonicus
yFS275]
gi|212001188|gb|EEB06848.1| nuclear envelope morphology protein [Schizosaccharomyces japonicus
yFS275]
Length = 449
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 65/144 (45%), Gaps = 10/144 (6%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+H S +S ++ I L+ + RP + FL + S
Sbjct: 280 LVLDLDETLIHSVTRGSRTSSGHPVEVHIPGQHPILY------FIHKRPHLDKFLAKVSQ 333
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR---KNPDLVRGQERGI 187
+ L T S + YA+ V L+ D K F +R R+ N D K+ + R I
Sbjct: 334 WYRLVLFTASVQAYADPIVDYLERDHKLFDARYY-RQHCNLVDSTYVKDISICRTHLSRI 392
Query: 188 VILDDTESVWSDHTENLIVLGKYV 211
+I+D++ + H EN I + ++
Sbjct: 393 MIIDNSPFSYKMHQENAIPIEGWI 416
>gi|302806322|ref|XP_002984911.1| hypothetical protein SELMODRAFT_121282 [Selaginella moellendorffii]
gi|300147497|gb|EFJ14161.1| hypothetical protein SELMODRAFT_121282 [Selaginella moellendorffii]
Length = 198
Score = 42.4 bits (98), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 71/148 (47%), Gaps = 18/148 (12%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
E K LVL++D TL+H KS +S + F G + + LV RP V TFL
Sbjct: 24 EEKPTLVLDMDETLIHAH--KSTAS--------LKLFSGKILPLQR-YLVAKRPGVDTFL 72
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII----AREDFNGKDRKNPDLVR 181
+ S + +I + T + + YA+ + LD F+ R+ + ++ G+ + DL R
Sbjct: 73 NEMSQIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRLYRDSCSPKEVGGR-KVVKDLSR 131
Query: 182 -GQE-RGIVILDDTESVWSDHTENLIVL 207
G++ R VI+DD + N IV+
Sbjct: 132 LGRDLRHTVIVDDKPESFCLQPSNGIVI 159
>gi|71026803|ref|XP_763045.1| nuclear LIM interactor-interacting factor 1 [Theileria parva strain
Muguga]
gi|68349998|gb|EAN30762.1| nuclear LIM interactor-interacting factor 1, putative [Theileria
parva]
Length = 254
Score = 42.4 bits (98), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 35/146 (23%), Positives = 67/146 (45%), Gaps = 18/146 (12%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKL--RPFVRTF 124
RK LVL+LD TL+H + +SF L Q ++ + + RP++ F
Sbjct: 69 RKKMLVLDLDETLIHSS-----------FEPSNNSFPMQLMQNGVERTIYIGKRPYLSEF 117
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVR 181
L S+ +I + T + YA+ + +D D R + R+ +NG K+ +++
Sbjct: 118 LSVVSNFYEIVIFTAGLKSYADPVIDFIDPDG--VCKRRLFRDSCKYWNGYYIKDLEILN 175
Query: 182 GQERGIVILDDTESVWSDHTENLIVL 207
+ +V +D++ + + EN I +
Sbjct: 176 KPLKDVVTIDNSPCCYCLNPENAIPI 201
>gi|410908573|ref|XP_003967765.1| PREDICTED: CTD small phosphatase-like protein 2-A-like [Takifugu
rubripes]
Length = 474
Score = 42.4 bits (98), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 44/83 (53%), Gaps = 8/83 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC SL+ E F ++Q+ V+LRPF R FLE+
Sbjct: 298 LVLDLDETLVHC----SLNELEDAALTFPVLFQDVIYQV----YVRLRPFFREFLERMCQ 349
Query: 131 LVDIYLCTMSTRCYAEAAVKLLD 153
+I L T S + YA+ + +LD
Sbjct: 350 KYEIILFTASKKVYADKLLNILD 372
>gi|323353885|gb|EGA85738.1| Psr1p [Saccharomyces cerevisiae VL3]
Length = 342
Score = 42.4 bits (98), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 70/143 (48%), Gaps = 13/143 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
L+L+LD TL+H + K L S + L +I Q+ N ++K RP V FLE+
Sbjct: 175 LILDLDETLVHS-SFKYLRSADFVLPVEIDD------QVHNVYVIK-RPGVEEFLERVGK 226
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQERGI 187
L ++ + T S Y + + +LD D K R+ RE ++ G KN + I
Sbjct: 227 LFEVVVFTASVSRYGDPLLDILDTD-KVIHHRLF-REACYNYEGNYIKNLSQIGRPLSDI 284
Query: 188 VILDDTESVWSDHTENLIVLGKY 210
+ILD++ + + H ++ I + +
Sbjct: 285 IILDNSPASYIFHPQHAIPISSW 307
>gi|348539980|ref|XP_003457466.1| PREDICTED: CTD small phosphatase-like protein-like isoform 1
[Oreochromis niloticus]
Length = 276
Score = 42.4 bits (98), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 49/192 (25%), Positives = 90/192 (46%), Gaps = 15/192 (7%)
Query: 54 YMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK 113
Y+L ++ S+ ++ + V++LD TL+H + K +S+ + + +I + ++ +
Sbjct: 94 YLLPEMKISDYGKKCV--VIDLDETLVH-SSFKPISNADFIVPVEIDGTVHQVYVLK--- 147
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
RP V FL++ L + L T S YA+ LLD F +R+ +
Sbjct: 148 ----RPHVDEFLQKMGELFECVLFTASLAKYADPVADLLD-QWGVFRARLFRESCVFHRG 202
Query: 174 RKNPDLVR-GQE-RGIVILDDTESVWSDHTENLIVLGKYV-YFRDKELNGDHKSYSETLT 230
DL R G+E R ++I+D++ + + H EN + + + D EL D + E L+
Sbjct: 203 NYVKDLSRLGRELRNVIIVDNSPASYIFHPENAVPVQSWFDDMNDTEL-LDLLPFFEGLS 261
Query: 231 DESENEEALANV 242
E E L N+
Sbjct: 262 KEEEVYGVLQNL 273
>gi|68075063|ref|XP_679448.1| nif-like protein [Plasmodium berghei strain ANKA]
gi|56500195|emb|CAI00043.1| nif-like protein, putative [Plasmodium berghei]
Length = 289
Score = 42.4 bits (98), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 35/150 (23%), Positives = 71/150 (47%), Gaps = 22/150 (14%)
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKL----RPFVRTF 124
+ LVL+LD TL++C KK+ + + + + N K + L RP++ F
Sbjct: 19 MTLVLDLDETLIYC------------TKKKKYDYQKEIDVLINGKYLSLYVCKRPYIDLF 66
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDR-KNPDLV 180
+I + T S + YA+ + ++D+D ++ + RED NGK KN +
Sbjct: 67 FSVLYPYYEIIIFTTSIKSYADTVLNIMDVD--HYIDKKFYREDCFEMNGKVYIKNLVNI 124
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKY 210
+ + ++++DD+ + + +N + K+
Sbjct: 125 KKEISKMILIDDSNASGFKYPDNFFHIKKW 154
>gi|83286618|ref|XP_730240.1| NLI interacting factor [Plasmodium yoelii yoelii 17XNL]
gi|23489907|gb|EAA21805.1| NLI interacting factor, putative [Plasmodium yoelii yoelii]
Length = 328
Score = 42.4 bits (98), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 35/150 (23%), Positives = 71/150 (47%), Gaps = 22/150 (14%)
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKL----RPFVRTF 124
+ LVL+LD TL++C KK+ + + + + N K + L RP++ F
Sbjct: 58 MTLVLDLDETLIYCT------------KKKKYDYQKEIDVLINGKYLSLYVCKRPYIDLF 105
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDR-KNPDLV 180
+I + T S + YA+ + ++D+D ++ + RED NGK KN +
Sbjct: 106 FSVLYPYYEIIIFTTSIKSYADTVLNIMDVD--HYIDKKFYREDCFEMNGKVYIKNLVNI 163
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKY 210
+ + ++++DD+ + + +N + K+
Sbjct: 164 KKEISKMILIDDSNTSGFKYPDNFFHIKKW 193
>gi|308485158|ref|XP_003104778.1| CRE-SCPL-3 protein [Caenorhabditis remanei]
gi|308257476|gb|EFP01429.1| CRE-SCPL-3 protein [Caenorhabditis remanei]
Length = 292
Score = 42.4 bits (98), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 43/93 (46%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC ++ L + ++ V+LRP +RTFL + S
Sbjct: 67 LVLDLDETLVHC-SLTPLDNATMIFPVMFQDITYQVY-------VRLRPHLRTFLRRMSK 118
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ +I + T S + YA ++D R+
Sbjct: 119 IFEIIIFTASKKVYANKLCDIIDPQKTMIRHRL 151
>gi|145538816|ref|XP_001455108.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124422896|emb|CAK87711.1| unnamed protein product [Paramecium tetraurelia]
Length = 282
Score = 42.4 bits (98), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 49/92 (53%), Gaps = 6/92 (6%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+Q +K LVL+LD TL+HC ++ + + L + IH G L+ + +K RP++
Sbjct: 28 PKQYSQKKVLVLDLDETLVHCEFKENENFQHEVLLEVIHK--GQLYTV----YLKARPYL 81
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD 153
FL++AS +I++ T Y + + +D
Sbjct: 82 NQFLQEASKDYEIFIFTAGYEAYCQEVLSFID 113
>gi|348539982|ref|XP_003457467.1| PREDICTED: CTD small phosphatase-like protein-like isoform 2
[Oreochromis niloticus]
Length = 265
Score = 42.4 bits (98), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 49/192 (25%), Positives = 90/192 (46%), Gaps = 15/192 (7%)
Query: 54 YMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK 113
Y+L ++ S+ ++ +V++LD TL+H + K +S+ + + +I + ++ +
Sbjct: 83 YLLPEMKISDYGKK--CVVIDLDETLVHS-SFKPISNADFIVPVEIDGTVHQVYVLK--- 136
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
RP V FL++ L + L T S YA+ LLD F +R+ +
Sbjct: 137 ----RPHVDEFLQKMGELFECVLFTASLAKYADPVADLLD-QWGVFRARLFRESCVFHRG 191
Query: 174 RKNPDLVR-GQE-RGIVILDDTESVWSDHTENLIVLGKYV-YFRDKELNGDHKSYSETLT 230
DL R G+E R ++I+D++ + + H EN + + + D EL D + E L+
Sbjct: 192 NYVKDLSRLGRELRNVIIVDNSPASYIFHPENAVPVQSWFDDMNDTEL-LDLLPFFEGLS 250
Query: 231 DESENEEALANV 242
E E L N+
Sbjct: 251 KEEEVYGVLQNL 262
>gi|357463015|ref|XP_003601789.1| CTD small phosphatase-like protein [Medicago truncatula]
gi|355490837|gb|AES72040.1| CTD small phosphatase-like protein [Medicago truncatula]
Length = 885
Score = 42.4 bits (98), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 46/95 (48%), Gaps = 8/95 (8%)
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQA 128
+ LVL+LD TL+H SL E +F + + V+ RP ++ FLE+
Sbjct: 694 ITLVLDLDETLVHS----SLKPSEDVDFTFTVNFKSEEYIV----YVRCRPHLKEFLERV 745
Query: 129 SSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
S L +I + T S YAE + LLD K F R+
Sbjct: 746 SGLFEIIIFTASQSIYAEQLLNLLDPKRKIFRHRV 780
Score = 40.8 bits (94), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 46/95 (48%), Gaps = 8/95 (8%)
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQA 128
+ LVL+LD TL+H SL E +F + + V+ RP ++ FLE+
Sbjct: 279 ITLVLDLDETLVHS----SLEPCEDV----DFTFTVNFNSEEHIVYVRCRPHLKEFLERV 330
Query: 129 SSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
S L +I + T S YAE + +LD K F R+
Sbjct: 331 SGLFEIIIFTASQSIYAEQLLNVLDPKRKIFRHRV 365
>gi|195027101|ref|XP_001986422.1| GH21358 [Drosophila grimshawi]
gi|193902422|gb|EDW01289.1| GH21358 [Drosophila grimshawi]
Length = 294
Score = 42.0 bits (97), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 40/164 (24%), Positives = 71/164 (43%), Gaps = 21/164 (12%)
Query: 71 LVLNLDHTLLHC----RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKL----RPFVR 122
LVL+LD TL+H +L + I ++ ++ +A+ + ++ RP+V
Sbjct: 107 LVLDLDETLVHSCYFDPETNNLIGCNLMPETAIPDYVINIPIVADIQPIEFQIFKRPYVD 166
Query: 123 TFLEQASSLVDIYLCTMSTRCYAEAAVKLLD-----LDSKYFSSRIIAREDFNGKD--RK 175
FL ++ + T S YA V LD +++ ++ F K+
Sbjct: 167 EFLSFVGRWYEVVIFTASMEAYASIVVDKLDDGRGIFQRRFYRQHCVSTSSFVSKNLFGV 226
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVY-FRDKEL 218
N DL + I+D++ S + D EN I + Y+Y D+EL
Sbjct: 227 NKDLA-----SVFIIDNSPSAYRDFPENAIPIKSYIYDLNDQEL 265
>gi|340505145|gb|EGR31502.1| NLI interacting factor-like phosphatase family protein, putative
[Ichthyophthirius multifiliis]
Length = 199
Score = 42.0 bits (97), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 68/141 (48%), Gaps = 13/141 (9%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
K LVL+LD TL+H + +S Q+ F+ + + VK RP FLE+
Sbjct: 43 KKTLVLDLDETLVHSSFVYMQNSD-----FQLEIFVQDIRFIV---YVKKRPGCELFLEE 94
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
S +I + T S YA + L +D K +S + RE+ +NG K+ + Q
Sbjct: 95 LSKYYEIIIFTASLSEYANPVIDL--IDKKKVTSIRLFRENCTLYNGFFVKDLSKLERQL 152
Query: 185 RGIVILDDTESVWSDHTENLI 205
+ I+I+D++E+ + EN I
Sbjct: 153 KDIIIIDNSENSFLFQPENAI 173
>gi|237832281|ref|XP_002365438.1| NLI interacting factor-like phosphatase domain-containing protein
[Toxoplasma gondii ME49]
gi|211963102|gb|EEA98297.1| NLI interacting factor-like phosphatase domain-containing protein
[Toxoplasma gondii ME49]
Length = 184
Score = 42.0 bits (97), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 64/131 (48%), Gaps = 14/131 (10%)
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYLKK-QIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL++D TL+HC K L +L + + +G ++ +RP+ + FL+
Sbjct: 1 MTLVLDMDETLMHCAT-KPLEKSPAFLVRFSDTNLLGHVY---------VRPYTKIFLDL 50
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE--DFNGKDRKNPDLVRGQER 185
AS + +I + T ST+ YA+ + LD + R+ + NG K+ L+ G++
Sbjct: 51 ASQICEIVVFTASTQSYADQVLAHLDPKRRLVHHRLYRQHCTMINGGYVKDLRLL-GRDI 109
Query: 186 GIVILDDTESV 196
V+L D +
Sbjct: 110 SRVVLADNSPI 120
>gi|145547036|ref|XP_001459200.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124427024|emb|CAK91803.1| unnamed protein product [Paramecium tetraurelia]
Length = 425
Score = 42.0 bits (97), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 69/142 (48%), Gaps = 14/142 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVRTFLEQAS 129
L+L+LD TL+H + + Y+ + IG + A K+ + +RP+ FL S
Sbjct: 245 LILDLDETLIHS--CTPRENPQVYV-----TAIGDFGEEA--KIGINIRPYTSLFLSSLS 295
Query: 130 SLVDIYLCTMSTRCYAEAAVKLLDLDSKYFS---SRIIAREDFNGKDRKNPDLVRGQE-R 185
IY+ T S++ YA+A + LD +Y S SR E NG K+ L+ ++ +
Sbjct: 296 QFYTIYIYTASSQAYAQAIIGYLDPKKQYISGVLSRNNCMETKNGFFIKDLRLIGNKQLK 355
Query: 186 GIVILDDTESVWSDHTENLIVL 207
++I+D+ + EN I +
Sbjct: 356 DMLIIDNLAHSFGFQIENGIPI 377
>gi|403338921|gb|EJY68702.1| hypothetical protein OXYTRI_10682 [Oxytricha trifallax]
Length = 574
Score = 42.0 bits (97), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 52/104 (50%), Gaps = 16/104 (15%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK-----------LVKLRP 119
LVL++D TL+HC SL Y ++ IH + ++ + V RP
Sbjct: 366 LVLDMDETLIHC----SLEPFYGY-QEVIHVMQDTYKPISQNSDLIHSQKSLQIYVASRP 420
Query: 120 FVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
++ FLEQ SS ++ + T S + YA+ + +D +KYFS R+
Sbjct: 421 YLIHFLEQVSSQYEVVVFTASDKSYADVILDKIDPYNKYFSYRL 464
>gi|145539644|ref|XP_001455512.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423320|emb|CAK88115.1| unnamed protein product [Paramecium tetraurelia]
Length = 390
Score = 42.0 bits (97), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 41/157 (26%), Positives = 71/157 (45%), Gaps = 16/157 (10%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
K ++ +LD TL+HC ++SS QI I + + +RPF ++
Sbjct: 196 KKTVIFDLDETLVHCNEEDNMSS-------QIVLPITFPTGEKVNAGINIRPFAEKMIKL 248
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE----DFNGKDR-KNPDLVRG 182
S + ++ + T S CYA + LD S+ R I R+ D N KN +++
Sbjct: 249 LSDICEVMIFTASHECYANEVINYLDPQSRV--KRRIFRDSCVTDINSNYYVKNLEVIDR 306
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELN 219
+ IVI+D+ + H +N I + ++ DK+ N
Sbjct: 307 DLKDIVIVDNASYSFVHHIDNGIPI--ISFYDDKQDN 341
>gi|118378638|ref|XP_001022493.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila]
gi|89304260|gb|EAS02248.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila SB210]
Length = 1393
Score = 42.0 bits (97), Expect = 0.37, Method: Composition-based stats.
Identities = 33/130 (25%), Positives = 57/130 (43%), Gaps = 18/130 (13%)
Query: 49 GLSFDYMLRGLRYSEQEERKLQL----------VLNLDHTLLHCRNIKSLSSGEKYLKKQ 98
+SF ML+ +E+K+ L V +LD TL+HC ++ S +
Sbjct: 1168 AISFSRMLKPASQKVIDEKKVHLPIRRDNKKTLVFDLDETLIHCNENANIPSD---VILP 1224
Query: 99 IHSFIGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKY 158
I G + + + +RP+ L++ S +I + T S CYA + LD +Y
Sbjct: 1225 IRFPTGEVIEAG----INVRPYCMEILQELSKFYEIIVFTASHSCYANVVLDYLDPKGQY 1280
Query: 159 FSSRIIARED 168
+ R+ RE+
Sbjct: 1281 ITGRLF-REN 1289
>gi|145552922|ref|XP_001462136.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124429974|emb|CAK94763.1| unnamed protein product [Paramecium tetraurelia]
Length = 532
Score = 42.0 bits (97), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 48/96 (50%), Gaps = 15/96 (15%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVK----LRPFVRTFLE 126
LV +LD TLLHC E H+ + M N+ +VK +RPF + L+
Sbjct: 336 LVFDLDETLLHC--------NENVNDPTDHTI---MVNMPNEGMVKTKINIRPFCQQMLK 384
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
S+ ++ L T + + YA+ A++L+D + K F R
Sbjct: 385 LLSNHFELILFTAAYQYYADKALELIDPERKLFQYR 420
>gi|145539087|ref|XP_001455238.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423037|emb|CAK87841.1| unnamed protein product [Paramecium tetraurelia]
Length = 476
Score = 42.0 bits (97), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 43/171 (25%), Positives = 79/171 (46%), Gaps = 22/171 (12%)
Query: 57 RGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-- 114
+ ++ +E + L+++LD TL+HC L S FI +F N+++
Sbjct: 271 KSIKVQLNQEIQKTLIIDLDETLVHCNEFSCLKSD---------FFIPVIF---NEQIYQ 318
Query: 115 --VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---- 168
+ +RP+ + FL + +I + T S YA + LD K S R+ R+D
Sbjct: 319 VGISIRPYAQQFLRNMAKDYEIMVFTASNPDYANKIIDYLDPQHKLVSYRLF-RDDCIQI 377
Query: 169 FNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFR-DKEL 218
N K+ ++ + IV++D++ ++ EN I + Y+ + DKEL
Sbjct: 378 SNNCHIKDLRILNRNMKDIVLVDNSAYSFAFQVENGIPIIPYLDDKNDKEL 428
>gi|193631995|ref|XP_001944419.1| PREDICTED: CTD small phosphatase-like protein-like [Acyrthosiphon
pisum]
Length = 288
Score = 42.0 bits (97), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 42/169 (24%), Positives = 82/169 (48%), Gaps = 16/169 (9%)
Query: 54 YMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK 113
Y+L +R+ Q+ K +V++LD TL+H + K++++ + + +I + ++ +
Sbjct: 85 YLLPAIRH--QDMHKKCMVIDLDETLVHS-SFKAINNADFVVPVEIDGTVHQVYVLK--- 138
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FN 170
RP V FL++ L + L T S YA+ LLD F +R+ RE +
Sbjct: 139 ----RPHVDEFLQRMGELYECVLFTASLAKYADPVADLLD-KWGVFRARLF-RESCVFYR 192
Query: 171 GKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV-YFRDKEL 218
G K+ + + +VI+D++ + + H +N + + + DKEL
Sbjct: 193 GNYVKDLNKLGRALHKVVIIDNSPASYIFHPDNAVPVNSWFDDMTDKEL 241
>gi|59807669|gb|AAH89307.1| Ctdsp2 protein, partial [Mus musculus]
Length = 212
Score = 42.0 bits (97), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 72/150 (48%), Gaps = 19/150 (12%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+EQ++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RP+V
Sbjct: 36 TEQDQGRICVVIDLDETLVH-SSFKPINNADFIVPVEIE---GTTHQV----YVLKRPYV 87
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RK 175
FL + L + L T S YA+ LLD ++ F + + KD R
Sbjct: 88 DEFLRRMGELFECVLFTASLAKYADPVTDLLDRCGVFRARLFREACVFHQGCYVKDLSRL 147
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLI 205
DL R VILD++ + + H EN +
Sbjct: 148 GRDL-----RKTVILDNSPASYIFHPENAV 172
>gi|156043075|ref|XP_001588094.1| hypothetical protein SS1G_10540 [Sclerotinia sclerotiorum 1980]
gi|154694928|gb|EDN94666.1| hypothetical protein SS1G_10540 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 506
Score = 42.0 bits (97), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 37/146 (25%), Positives = 69/146 (47%), Gaps = 5/146 (3%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL---VKLRPFVRTFLEQ 127
LVL+LD TL+H S ++ QI + +G+ + V RP+ FL +
Sbjct: 320 LVLDLDETLIHSMIHGGRMSAGHMVEVQITNVVGTGGVAPQHPILYYVNKRPYCDDFLRR 379
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE-DF-NGKDRKNPDLVRGQER 185
++ + T S + YA+ + L+ + K+FS+R + F NG K+ V
Sbjct: 380 VCKWYNLVVFTASLQDYADPVIDWLEQERKFFSARYYRQHCTFRNGAYIKDLSSVEPDLS 439
Query: 186 GIVILDDTESVWSDHTENLIVLGKYV 211
++ILD++ + + H +N I + ++
Sbjct: 440 KVMILDNSPTSYLFHQDNAIPIEGWI 465
>gi|146100339|ref|XP_001468839.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398022901|ref|XP_003864612.1| hypothetical protein, conserved [Leishmania donovani]
gi|401429084|ref|XP_003879024.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|134073208|emb|CAM71928.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322495274|emb|CBZ30577.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322502848|emb|CBZ37930.1| hypothetical protein, conserved [Leishmania donovani]
Length = 240
Score = 42.0 bits (97), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 65/151 (43%), Gaps = 29/151 (19%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+E + KL LVL+LD TL+ R SG Y + I F FQM DK +++ +
Sbjct: 44 AEIYQGKLVLVLDLDETLVFAR------SGPLYARPGIPEF----FQMCKDKGIEVVVWT 93
Query: 122 RTFLEQASSLV-DIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR------ 174
A ++V +I C + C +K+F+ + R+D N R
Sbjct: 94 AGLKAYAQAIVSNIDTCNAVSHCIYR--------HNKWFNGQPGYRKDLNALGRPLDRVL 145
Query: 175 ---KNPDLVRG-QERGIVILDDTESVWSDHT 201
PD +RG Q+ GI++ D D+T
Sbjct: 146 IVENTPDCIRGYQDNGILVSDYEGGDGEDNT 176
>gi|398009710|ref|XP_003858054.1| hypothetical protein, conserved [Leishmania donovani]
gi|322496258|emb|CBZ31330.1| hypothetical protein, conserved [Leishmania donovani]
Length = 739
Score = 42.0 bits (97), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 46/91 (50%), Gaps = 7/91 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGS-LFQMANDKLVKLRPFVRTFL 125
R+ LV++LD TL H + +G + I + G+ LF V RP+ R FL
Sbjct: 309 RQKVLVIDLDETLCHVSTTTANMAGPPTFSEVIPTASGAELFH------VWERPYARLFL 362
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDS 156
A+ L ++ L T +++ YA+ ++ +D D
Sbjct: 363 STAAKLFNLVLFTSASKPYADTILQRIDPDG 393
>gi|146075974|ref|XP_001462817.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|134066897|emb|CAM60038.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 739
Score = 42.0 bits (97), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 46/91 (50%), Gaps = 7/91 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGS-LFQMANDKLVKLRPFVRTFL 125
R+ LV++LD TL H + +G + I + G+ LF V RP+ R FL
Sbjct: 309 RQKVLVIDLDETLCHVSTTTANMAGPPTFSEVIPTASGAELFH------VWERPYARLFL 362
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDS 156
A+ L ++ L T +++ YA+ ++ +D D
Sbjct: 363 STAAKLFNLVLFTSASKPYADTILQRIDPDG 393
>gi|449018620|dbj|BAM82022.1| similar to nuclear LIM interactor-interacting factor
[Cyanidioschyzon merolae strain 10D]
Length = 611
Score = 42.0 bits (97), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 68/139 (48%), Gaps = 10/139 (7%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC + + +S + +H F G+ + + VK RPF++ L+ A+
Sbjct: 420 LVLDLDETLVHC-STEFMSDAD--FNFSVH-FEGTNYTV----YVKRRPFLQALLQYAAR 471
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFN--GKDRKNPDLVRGQERGIV 188
++ + T S + YA+ + +LD D R+ N G K+ ++ R +
Sbjct: 472 YFEVVVFTASQKAYADRLLNILDPDHTLIHHRLFRDACINVAGNYLKDLTVLSRDLRRTI 531
Query: 189 ILDDTESVWSDHTENLIVL 207
I+D++ + H N + +
Sbjct: 532 IVDNSPQAFGYHLGNGVPI 550
>gi|417397992|gb|JAA46029.1| Putative carboxy-terminal domain rna polymerase ii polypeptide a
small phosphatase 1 isoform 2 [Desmodus rotundus]
Length = 260
Score = 42.0 bits (97), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 36/150 (24%), Positives = 73/150 (48%), Gaps = 11/150 (7%)
Query: 63 EQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVR 122
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP+V
Sbjct: 84 PQDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPYVD 135
Query: 123 TFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR- 181
FL++ L + L T S YA+ LLD F +R+ + DL R
Sbjct: 136 EFLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRL 194
Query: 182 GQE-RGIVILDDTESVWSDHTENLIVLGKY 210
G++ R ++ILD++ + + H +N + + +
Sbjct: 195 GRDLRRVLILDNSPASYVFHPDNAVPVASW 224
>gi|195474791|ref|XP_002089673.1| GE22820 [Drosophila yakuba]
gi|194175774|gb|EDW89385.1| GE22820 [Drosophila yakuba]
Length = 294
Score = 42.0 bits (97), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 43/168 (25%), Positives = 65/168 (38%), Gaps = 36/168 (21%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGS-----------LFQMANDKLVKL-- 117
LVL+LD TL+H YL H +G + ++ D +V+
Sbjct: 98 LVLDLDETLVH----------SCYLDPDTHDNVGCSQLPDHAQPDYVLNVSIDPMVEPIV 147
Query: 118 -----RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII-----ARE 167
RP V FL+ S D+ + T S YA V LLD S R A
Sbjct: 148 FRVFKRPHVDEFLDCVSKWYDLVIYTASLEVYATQVVDLLDAGQGRMSRRFYRQHCRASS 207
Query: 168 DFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRD 215
KD LV G++I+D++ + D +N + + ++Y D
Sbjct: 208 PLVSKDLS---LVTPDMTGVLIIDNSPYAYRDFPDNAVPIKTFIYDPD 252
>gi|356556521|ref|XP_003546573.1| PREDICTED: uncharacterized protein LOC100799803 [Glycine max]
Length = 471
Score = 42.0 bits (97), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 52/99 (52%), Gaps = 12/99 (12%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV--KLRPFVRTF 124
+ + LVL+LD TL+H ++ + F ++F + +V K RP++ TF
Sbjct: 297 KSITLVLDLDETLVH-STLEHCDDAD---------FTFTVFFNLKEYIVYVKQRPYLHTF 346
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
LE+ S + ++ + T S YA+ + +LD D ++ S R+
Sbjct: 347 LERVSEMFEVVIFTASQSIYAKQLLDILDPDGRFISRRM 385
>gi|118384086|ref|XP_001025196.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila]
gi|89306963|gb|EAS04951.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila SB210]
Length = 426
Score = 42.0 bits (97), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 49/101 (48%), Gaps = 4/101 (3%)
Query: 98 QIHSFIGSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSK 157
QI F + + N L ++RPF FL++ + DI++ T S+ YAEA + +D K
Sbjct: 263 QILKFKNEIGETQNIGL-RIRPFCYEFLQKMTQFWDIFIFTASSSTYAEAIINFIDPTRK 321
Query: 158 YFS---SRIIAREDFNGKDRKNPDLVRGQERGIVILDDTES 195
Y S +R E NG K+ +V G + IL D S
Sbjct: 322 YISGILNRSNCMETKNGFFIKDLRIVSGSDLRYTILVDNLS 362
>gi|403338554|gb|EJY68521.1| Dullard-like phosphatase domain containing protein [Oxytricha
trifallax]
Length = 615
Score = 42.0 bits (97), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 49/95 (51%), Gaps = 13/95 (13%)
Query: 59 LRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLR 118
+R++ +R L +VL+LD+TL+H N SS + Y F + ++ V R
Sbjct: 430 MRFTHTNKR-LIVVLDLDNTLIHSVNSVPTSSDQNY------------FAIRDNIYVYKR 476
Query: 119 PFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD 153
P + FL + + DIY+ T S + YA+ + ++D
Sbjct: 477 PHMEYFLAEIAKFADIYIFTASMKDYADQIMDVID 511
>gi|386770484|ref|NP_001246593.1| CG12078, isoform B [Drosophila melanogaster]
gi|383291721|gb|AFH04264.1| CG12078, isoform B [Drosophila melanogaster]
Length = 236
Score = 42.0 bits (97), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 65/143 (45%), Gaps = 5/143 (3%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL++D+T++ IK K + + H F L V RP++ FL++ S
Sbjct: 56 LVLDMDNTMITSWFIKR-GKKPKNIPRIAHDFKFYLPAYGATIYVYKRPYLDHFLDRVSK 114
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR---EDFNGKDRKNPDLVRGQERGI 187
D+ + T YA + LD +SR+ + E F GK K+ L +
Sbjct: 115 WYDLTVFTSGAEIYASPILDFLDRGRGILNSRLYRQHCIEQF-GKWSKSVLLACPDLSNV 173
Query: 188 VILDDTESVWSDHTENLIVLGKY 210
V+LD++ + S + EN I++ Y
Sbjct: 174 VLLDNSSTECSFNAENAILIKSY 196
>gi|154344393|ref|XP_001568138.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134065475|emb|CAM43240.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 240
Score = 42.0 bits (97), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 65/151 (43%), Gaps = 29/151 (19%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+E + KL LVL+LD TL+ R SG Y + I F FQM DK +++ +
Sbjct: 44 AEIYQGKLVLVLDLDETLVFAR------SGPLYARPGIPEF----FQMCKDKGIEVVVWT 93
Query: 122 RTFLEQASSLV-DIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR------ 174
A ++V +I C + C +K+F+ + R+D N R
Sbjct: 94 AGLKAYAQAIVSNIDTCNAVSHCIYR--------HNKWFNGQPGYRKDLNALGRPLDRVL 145
Query: 175 ---KNPDLVRG-QERGIVILDDTESVWSDHT 201
PD +RG Q+ GI++ D D+T
Sbjct: 146 IVENTPDCIRGYQDNGILVSDYEGGDGEDNT 176
>gi|145542510|ref|XP_001456942.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124424756|emb|CAK89545.1| unnamed protein product [Paramecium tetraurelia]
Length = 492
Score = 42.0 bits (97), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 69/145 (47%), Gaps = 13/145 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LV++LD TL+HC L S + Y+ QI++ +Q + +RP+ + FL +
Sbjct: 285 LVIDLDETLVHCNEYPQLKS-DFYIPVQINNIT---YQAG----ISVRPYAQEFLRSMAE 336
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED----FNGKDRKNPDLVRGQERG 186
+I + T S YA + LD S R+ RED +G K+ ++ +
Sbjct: 337 YYEIIIFTASNEDYANQIIDYLDPTGTLVSGRLF-REDCIRVESGCHVKDLRILNRDLKD 395
Query: 187 IVILDDTESVWSDHTENLIVLGKYV 211
+V++D++ ++ +N I + Y+
Sbjct: 396 VVLIDNSAFSYAFQIDNGIPIIPYL 420
>gi|389592649|ref|XP_003721765.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|321438298|emb|CBZ12051.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 738
Score = 42.0 bits (97), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 46/90 (51%), Gaps = 7/90 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGS-LFQMANDKLVKLRPFVRTFL 125
R+ LV++LD TL H + +G + I + G+ LF V RP+ R FL
Sbjct: 309 RQKVLVIDLDETLCHVSTTTANMAGPPTFSEVIPTASGAELFH------VWERPYARLFL 362
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLD 155
A+ L ++ L T +++ YA+ ++ +D D
Sbjct: 363 STAAKLFNLVLFTSASKPYADTILQRIDPD 392
>gi|146162237|ref|XP_001009046.2| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila]
gi|146146485|gb|EAR88801.2| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila SB210]
Length = 937
Score = 42.0 bits (97), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 42/92 (45%), Gaps = 7/92 (7%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
L+ ++D TL+HC S S + + G Q + +RP+ L++ S
Sbjct: 745 LIFDMDETLIHCNESASTPSD---VIVDVRFPTGEFIQAG----INIRPYAIEILQELSE 797
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
+I + T S CYA+A ++ LD KY R
Sbjct: 798 EFEIVIFTASHSCYAQAVIEYLDPHRKYVHHR 829
>gi|328767798|gb|EGF77846.1| hypothetical protein BATDEDRAFT_13622 [Batrachochytrium
dendrobatidis JAM81]
Length = 192
Score = 42.0 bits (97), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 27/106 (25%), Positives = 51/106 (48%), Gaps = 8/106 (7%)
Query: 58 GLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKL 117
L + + LVL+LD TL+HC + L + + ++ ++ +L
Sbjct: 20 ALPKKTRSSPPITLVLDLDETLVHC-STSPLDHCDITFPVEFNNITYTVSG-------RL 71
Query: 118 RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
RP +TFLE+ S + ++ + T S + YA+ + ++D KY R+
Sbjct: 72 RPHYKTFLERCSEIFEVVVFTASQKIYADRLLNIIDPTHKYIKYRL 117
>gi|145494426|ref|XP_001433207.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124400324|emb|CAK65810.1| unnamed protein product [Paramecium tetraurelia]
Length = 223
Score = 42.0 bits (97), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 71/157 (45%), Gaps = 16/157 (10%)
Query: 57 RGLRYSEQEERKLQ-LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV 115
R +R E RK + LVL+LD TL+H + ++ D
Sbjct: 29 RFVRLKESNNRKQKILVLDLDETLIHSCTHRDFPHITITIQDNDEPI---------DIAF 79
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR----EDFNG 171
+RP+ + F+++ S+ IYL T S+ YA A V LD +Y + I+ R E NG
Sbjct: 80 NVRPYCKEFIKEMSNYYTIYLFTASSEMYARAIVNHLDPKRQYITD-ILCRNNCFETKNG 138
Query: 172 KDRKNPDLVRGQE-RGIVILDDTESVWSDHTENLIVL 207
K+ ++ ++ + IVI+D+ + EN I +
Sbjct: 139 FFIKDLRIITNRDLKDIVIIDNLPHSFGLQLENGIPI 175
>gi|293348636|ref|XP_002727004.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 2-like [Rattus norvegicus]
gi|392349440|ref|XP_003750378.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 2-like [Rattus norvegicus]
Length = 357
Score = 42.0 bits (97), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 41/155 (26%), Positives = 74/155 (47%), Gaps = 19/155 (12%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+EQ++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RP+V
Sbjct: 181 TEQDQGRICVVIDLDETLVH-SSFKPINNADFIVPVEIE---GTTHQV----YVLKRPYV 232
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RK 175
FL + L + L T S YA+ LLD ++ F + + KD R
Sbjct: 233 DEFLRRMGELFECVLFTASLAKYADPVTDLLDRCGVFRARLFRESCVFHQGCYVKDLSRL 292
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
DL R VILD++ + + H EN + + +
Sbjct: 293 GRDL-----RKTVILDNSPASYIFHPENAVPVQSW 322
>gi|145533625|ref|XP_001452557.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420256|emb|CAK85160.1| unnamed protein product [Paramecium tetraurelia]
Length = 343
Score = 41.6 bits (96), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 30/157 (19%), Positives = 74/157 (47%), Gaps = 18/157 (11%)
Query: 58 GLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKL 117
L Y + ++++++V +LD TL+H ++ K +++ F + F + +
Sbjct: 149 SLLYYGKSQKQIKIVFDLDETLVHSEEVQ---------KDKVYDFQNNEFGLF------V 193
Query: 118 RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR--- 174
RP+ L++ S L D+++ T + + YA+ + L+D ++ +F + + +
Sbjct: 194 RPYCCHVLKELSQLADLFVYTSANQKYAKTIINLIDPENTFFKGHFYRNNCVSLQSKMQI 253
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
K+ ++ IVI+D++ + N I + ++
Sbjct: 254 KHLGILSNNYSKIVIIDNSPIFYMGQPYNGIPIAPFI 290
>gi|300121382|emb|CBK21762.2| unnamed protein product [Blastocystis hominis]
Length = 399
Score = 41.6 bits (96), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 42/159 (26%), Positives = 75/159 (47%), Gaps = 17/159 (10%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
K LVL+LD TL+HC + S+ + + G F + +RPF+ L++
Sbjct: 219 KYTLVLDLDETLVHCSMERDPSADLAFSIRHE----GQRFTI----YANVRPFLFYLLKR 270
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQE 184
+ +I + T S +CYA+ + +LD + + R+ RE + +G K+ + +
Sbjct: 271 VAPYYEIVIYTASQKCYADRLLDILDSEQHLITHRLY-REHCLNIDGNYIKDLNALNRDL 329
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHK 223
VI+D+ S + H +N I + +F DK DH+
Sbjct: 330 SKTVIVDNYISCFGYHLDNGIPIIS--WFSDK---ADHE 363
>gi|403333806|gb|EJY66027.1| hypothetical protein OXYTRI_13811 [Oxytricha trifallax]
Length = 509
Score = 41.6 bits (96), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 53/109 (48%), Gaps = 16/109 (14%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK-----------L 114
+ K LVL++D TL+HC SL Y ++ IH + ++ D
Sbjct: 295 QSKKTLVLDMDETLIHC----SLEPFYGY-QEVIHVMQDTYKPISPDSDLIYSQKSLQIY 349
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
V RP++ FLE+ SS ++ + T S + YA+ + +D KYFS R+
Sbjct: 350 VAYRPYLIHFLEKVSSQYEVVVFTASDKSYADVILDKIDPYHKYFSYRL 398
>gi|291392229|ref|XP_002712521.1| PREDICTED: CTD (carboxy-terminal domain, RNA polymerase II,
polypeptide A) small phosphatase 1 [Oryctolagus
cuniculus]
Length = 260
Score = 41.6 bits (96), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 37/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +S+ + + +I + ++ + RP V
Sbjct: 85 QDSDKICVVIDLDETLVHS-SFKPVSNADFIIPVEIDGVVHQVYVLK-------RPHVDE 136
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 137 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 195
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 196 RDLRRVLILDNSPASYVFHPDNAVPVASW 224
>gi|340507775|gb|EGR33687.1| NLI interacting factor-like phosphatase family protein, putative
[Ichthyophthirius multifiliis]
Length = 286
Score = 41.6 bits (96), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 41/153 (26%), Positives = 65/153 (42%), Gaps = 29/153 (18%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL----VKLRPFVRTFLE 126
+V +LD TL+HC + + S I + N ++ V +RPF R L+
Sbjct: 60 IVFDLDETLIHCNE-----------NQDVQSDITIQIKFPNQEVIEAGVNIRPFCREVLK 108
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR------IIAREDFNGKDR---KNP 177
+ S +I + T S CYA+ + LD ++ R I E + KD KN
Sbjct: 109 ELSKSFEIIVFTASHSCYADKVLDYLDPNNDIIDYRLFRESCIQTAEGVHIKDLRIFKNR 168
Query: 178 DLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
DL + IV++D+ + EN I + Y
Sbjct: 169 DL-----KDIVLVDNAAYSFGYQIENGIPIIPY 196
>gi|340507407|gb|EGR33377.1| hypothetical protein IMG5_055200 [Ichthyophthirius multifiliis]
Length = 226
Score = 41.6 bits (96), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 62/124 (50%), Gaps = 11/124 (8%)
Query: 45 NDSFG-LSFDYMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFI 103
N+ F L+F ++ L S E++ LVL+LD TL+H NIK L+S + FI
Sbjct: 32 NNVFNELNFKHIDINLLISLYEKKPNNLVLDLDETLIHS-NIKQLNS------QGFKIFI 84
Query: 104 GSLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
S Q+ L K R ++ FL ++ +IY+ T S YAE +K +D +I
Sbjct: 85 ESKNQIKTYYLHK-RQYLEYFLINSAKNYNIYIYTSSQSNYAEEVIK--HIDPLNIIKKI 141
Query: 164 IARE 167
ARE
Sbjct: 142 FARE 145
>gi|401414521|ref|XP_003871758.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322487977|emb|CBZ23223.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 643
Score = 41.6 bits (96), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 46/90 (51%), Gaps = 7/90 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGS-LFQMANDKLVKLRPFVRTFL 125
R+ LV++LD TL H + +G + I + G+ LF V RP+ R FL
Sbjct: 225 RQKVLVIDLDETLCHVSTTTANMAGPPTFSEVIPTASGAELFH------VWERPYARLFL 278
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLD 155
A+ L ++ L T +++ YA+ ++ +D D
Sbjct: 279 STAAKLFNLVLFTSASKPYADTILQRIDPD 308
>gi|145525990|ref|XP_001448806.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416372|emb|CAK81409.1| unnamed protein product [Paramecium tetraurelia]
Length = 477
Score = 41.6 bits (96), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 53/109 (48%), Gaps = 15/109 (13%)
Query: 59 LRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVK-- 116
LR ++ + K+ L+ +LD TL+HC E L+K S I Q++ +++VK
Sbjct: 272 LRQKDKYKNKISLIFDLDETLVHC--------NESLLQK---SDIVLNIQVSPNEIVKAG 320
Query: 117 --LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+RP LE +I + T S CYA+ + LD + K S R+
Sbjct: 321 VNIRPGAIELLESLVDDFEIIVFTASHSCYAQQVLDYLDPEKKLISHRL 369
>gi|118375320|ref|XP_001020845.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila]
gi|89302612|gb|EAS00600.1| NLI interacting factor-like phosphatase family protein [Tetrahymena
thermophila SB210]
Length = 699
Score = 41.6 bits (96), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 73/147 (49%), Gaps = 13/147 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
L+L+LD TL+H + + + + + L ++ + ++ VK RP V FLE+AS
Sbjct: 178 LILDLDETLVHS-SFQPMGNSDYTLSIKVQNIPFTIH-------VKKRPGVEYFLEKASE 229
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE--DFNGKDRKNPDLVRGQERGIV 188
++ + T S YA+ L+D +Y S R+ ++ G K+ + + I+
Sbjct: 230 YFEVVIYTASLAEYADPVCDLID-PKRYVSYRLFRENCTNYQGLFVKDLSKIGRDMKDIL 288
Query: 189 ILDDTESVWSDHTENLIVLGKYVYFRD 215
I+D++E+ + EN I + +F+D
Sbjct: 289 IVDNSETSFLFQPENAIQISN--FFQD 313
>gi|209156204|gb|ACI34334.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 2 [Salmo salar]
gi|209737868|gb|ACI69803.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 2 [Salmo salar]
Length = 260
Score = 41.6 bits (96), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 73/147 (49%), Gaps = 13/147 (8%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+ Q++ K+ +V++LD TL+H + K +S+ + + +I G+ Q+ V RP+V
Sbjct: 84 TPQDQGKICVVIDLDETLVH-SSFKPISNADFIVPVEIE---GTTHQV----YVLKRPYV 135
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPD 178
FL++ L + L T S YA+ LLD F +R+ RE G K+
Sbjct: 136 DEFLQRMGELFECILFTASLAKYADPVTDLLD-QCGVFRARLF-RESCVFHQGCYVKDLS 193
Query: 179 LVRGQERGIVILDDTESVWSDHTENLI 205
L+ + +ILD++ + + H EN +
Sbjct: 194 LLGRELHKTLILDNSPASYIFHPENAV 220
>gi|66808307|ref|XP_637876.1| dullard-like phosphatase domain containing protein [Dictyostelium
discoideum AX4]
gi|60466304|gb|EAL64365.1| dullard-like phosphatase domain containing protein [Dictyostelium
discoideum AX4]
Length = 375
Score = 41.6 bits (96), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 42/177 (23%), Positives = 79/177 (44%), Gaps = 19/177 (10%)
Query: 56 LRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV 115
+ L + K L+L+LD TL+H +K ++ + +K I + + V
Sbjct: 189 INSLNIQNLNQPKKTLILDLDETLVH-STLKPVTHHQITVKVLIEDMDCTFY-------V 240
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE--DFNGKD 173
RP V FLE+ S DI + T S + YA+ + LD K F R+ + +G
Sbjct: 241 IKRPHVDYFLEKVSQWYDIVIFTASMQQYADPLLDQLDT-HKVFKKRLFRDSCLEKHGNF 299
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLT 230
K+ ++ +I+D++ +S++ EN + + ++ GD+ S + L+
Sbjct: 300 VKDLSMIDQDLTSTIIIDNSPIAYSNNLENALPIDNWM--------GDNPSDTSLLS 348
>gi|363736290|ref|XP_003641697.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1-like [Gallus gallus]
Length = 275
Score = 41.6 bits (96), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 37/150 (24%), Positives = 72/150 (48%), Gaps = 11/150 (7%)
Query: 63 EQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVR 122
Q+ KL +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 99 PQDASKLCVVIDLDETLVH-SSFKPVNNADFIIPVEIDGIMHQVYVLK-------RPHVD 150
Query: 123 TFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR- 181
FL++ L + L T S YA+ LLD F +R+ + DL R
Sbjct: 151 EFLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRL 209
Query: 182 GQE-RGIVILDDTESVWSDHTENLIVLGKY 210
G++ R I+I+D++ + + H +N + + +
Sbjct: 210 GRDLRRIIIVDNSPASYIFHPDNAVPVASW 239
>gi|31074177|gb|AAP34398.1| small CTD phosphatase 1 splice variant [Homo sapiens]
Length = 213
Score = 41.6 bits (96), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 38 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 89
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 90 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 148
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 149 RDLRRVLILDNSPASYVFHPDNAVPVASW 177
>gi|164698411|ref|NP_001106941.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 2 isoform a [Mus musculus]
gi|51701335|sp|Q8BX07.1|CTDS2_MOUSE RecName: Full=Carboxy-terminal domain RNA polymerase II polypeptide
A small phosphatase 2; AltName: Full=Small C-terminal
domain phosphatase 2; AltName: Full=Small CTD
phosphatase 2; Short=SCP2
gi|26339972|dbj|BAC33649.1| unnamed protein product [Mus musculus]
gi|55154141|gb|AAH85142.1| Ctdsp2 protein [Mus musculus]
gi|148692510|gb|EDL24457.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 2 [Mus musculus]
Length = 270
Score = 41.6 bits (96), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 72/150 (48%), Gaps = 19/150 (12%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+EQ++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RP+V
Sbjct: 94 TEQDQGRICVVIDLDETLVH-SSFKPINNADFIVPVEIE---GTTHQV----YVLKRPYV 145
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RK 175
FL + L + L T S YA+ LLD ++ F + + KD R
Sbjct: 146 DEFLRRMGELFECVLFTASLAKYADPVTDLLDRCGVFRARLFREACVFHQGCYVKDLSRL 205
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLI 205
DL R VILD++ + + H EN +
Sbjct: 206 GRDL-----RKTVILDNSPASYIFHPENAV 230
>gi|85726465|ref|NP_647795.2| CG12078, isoform A [Drosophila melanogaster]
gi|66771487|gb|AAY55055.1| IP07723p [Drosophila melanogaster]
gi|84796078|gb|AAF47748.2| CG12078, isoform A [Drosophila melanogaster]
Length = 253
Score = 41.6 bits (96), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 65/143 (45%), Gaps = 5/143 (3%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL++D+T++ IK K + + H F L V RP++ FL++ S
Sbjct: 73 LVLDMDNTMITSWFIKR-GKKPKNIPRIAHDFKFYLPAYGATIYVYKRPYLDHFLDRVSK 131
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR---EDFNGKDRKNPDLVRGQERGI 187
D+ + T YA + LD +SR+ + E F GK K+ L +
Sbjct: 132 WYDLTVFTSGAEIYASPILDFLDRGRGILNSRLYRQHCIEQF-GKWSKSVLLACPDLSNV 190
Query: 188 VILDDTESVWSDHTENLIVLGKY 210
V+LD++ + S + EN I++ Y
Sbjct: 191 VLLDNSSTECSFNAENAILIKSY 213
>gi|354490868|ref|XP_003507578.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 2-like [Cricetulus griseus]
Length = 252
Score = 41.6 bits (96), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 73/146 (50%), Gaps = 11/146 (7%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+EQ++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RP+V
Sbjct: 76 TEQDQGRICVVIDLDETLVH-SSFKPINNADFIVPVEIE---GTTHQV----YVLKRPYV 127
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
FL + L + L T S YA+ LLD F +R+ + DL R
Sbjct: 128 DEFLRRMGELFECVLFTASLAKYADPVTDLLD-QCGVFRARLFRESCVFHQGCYVKDLSR 186
Query: 182 -GQE-RGIVILDDTESVWSDHTENLI 205
G++ R +ILD++ + + H EN +
Sbjct: 187 LGRDLRKTLILDNSPASYIFHPENAV 212
>gi|302833726|ref|XP_002948426.1| hypothetical protein VOLCADRAFT_58281 [Volvox carteri f.
nagariensis]
gi|300266113|gb|EFJ50301.1| hypothetical protein VOLCADRAFT_58281 [Volvox carteri f.
nagariensis]
Length = 215
Score = 41.6 bits (96), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 48/97 (49%), Gaps = 8/97 (8%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
R+ LVL+LD TL+H S + + + SF + V+ RP++R F+
Sbjct: 34 RRKTLVLDLDETLVH--------SSLEAVDRSDFSFPVIFNGTEHQVYVRQRPYLREFMV 85
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ ++L ++ + T S R YAE + +LD + RI
Sbjct: 86 RVAALFEVVVFTASQRIYAEKLLDILDPQQQLVRHRI 122
>gi|224000223|ref|XP_002289784.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220974992|gb|EED93321.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 179
Score = 41.6 bits (96), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 38/144 (26%), Positives = 69/144 (47%), Gaps = 13/144 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+H + +++ + + QI + ++ V RP V FL + +
Sbjct: 15 LVLDLDETLVHS-SFRAVPGADFVIPVQIEDVVHFVY-------VAKRPGVDEFLTEMAK 66
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQERGI 187
+I + T S YA+ + LLD ++ +R+ RE + G K+ L+
Sbjct: 67 HYEIVVYTASLNKYADPLLDLLD-PNRVIRTRLF-RESCVFYEGNYVKDMSLLNRDLSQA 124
Query: 188 VILDDTESVWSDHTENLIVLGKYV 211
+I+D++ S + H EN I G ++
Sbjct: 125 IIIDNSPSSYLFHPENAIDCGSFI 148
>gi|294877772|ref|XP_002768119.1| hypothetical protein Pmar_PMAR002906 [Perkinsus marinus ATCC 50983]
gi|239870316|gb|EER00837.1| hypothetical protein Pmar_PMAR002906 [Perkinsus marinus ATCC 50983]
Length = 161
Score = 41.2 bits (95), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 47/106 (44%), Gaps = 21/106 (19%)
Query: 114 LVKLRPFVRTFLEQASS------LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE 167
L K+RP R F+ + S ++ IY T +R Y E K+LD + R+++RE
Sbjct: 46 LTKIRPHARAFIRELVSKTGCGVVLSIY--TKGSRRYMEVIKKMLDPSGELIKGRLVSRE 103
Query: 168 DFNGKD---RKNPDLVRGQERGI----------VILDDTESVWSDH 200
D K+PD + + + V+LDD+ VW +
Sbjct: 104 DEPSNMTPLEKDPDFIINADSAVGTEELRRRWFVVLDDSPEVWPEE 149
>gi|15239800|ref|NP_196747.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
gi|30683828|ref|NP_850809.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
gi|42573341|ref|NP_974767.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
gi|145334381|ref|NP_001078572.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
gi|7573353|emb|CAB87659.1| putative protein [Arabidopsis thaliana]
gi|21553575|gb|AAM62668.1| unknown [Arabidopsis thaliana]
gi|56550687|gb|AAV97797.1| At5g11860 [Arabidopsis thaliana]
gi|332004345|gb|AED91728.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
gi|332004346|gb|AED91729.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
gi|332004347|gb|AED91730.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
gi|332004348|gb|AED91731.1| SCP1-like small phosphatase 5 [Arabidopsis thaliana]
Length = 305
Score = 41.2 bits (95), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 40/155 (25%), Positives = 73/155 (47%), Gaps = 13/155 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+H S + + +F + + + V+ RP ++ F+E+
Sbjct: 111 PISLVLDLDETLVH--------STLEPCGEVDFTFPVNFNEEEHMVYVRCRPHLKEFMER 162
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
S L +I + T S YAE + +LD K F R+ R+ F+G K+ ++
Sbjct: 163 VSRLFEIIIFTASQSIYAEQLLNVLDPKRKLFRHRVY-RDSCVFFDGNYLKDLSVLGRDL 221
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVY-FRDKEL 218
++I+D++ + EN + + + DKEL
Sbjct: 222 SRVIIVDNSPQAFGFQVENGVPIESWFNDPSDKEL 256
>gi|297811303|ref|XP_002873535.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297319372|gb|EFH49794.1| NLI interacting factor family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 305
Score = 41.2 bits (95), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 40/155 (25%), Positives = 73/155 (47%), Gaps = 13/155 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+H S + + +F + + + V+ RP ++ F+E+
Sbjct: 111 PISLVLDLDETLVH--------STLEPCGEVDFTFPVNFNEEEHMVYVRCRPHLKEFMER 162
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
S L +I + T S YAE + +LD K F R+ R+ F+G K+ ++
Sbjct: 163 VSRLFEIIIFTASQSIYAEQLLNVLDPKRKLFRHRVY-RDSCVFFDGNYLKDLSVLGRDL 221
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVY-FRDKEL 218
++I+D++ + EN + + + DKEL
Sbjct: 222 SRVIIVDNSPQAFGFQVENGVPIESWFNDPSDKEL 256
>gi|401840826|gb|EJT43491.1| PSR1-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 270
Score = 41.2 bits (95), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 44/160 (27%), Positives = 74/160 (46%), Gaps = 13/160 (8%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
E + K L+L+LD TL+H + K L S + L +I Q+ N ++K RP V
Sbjct: 94 GESTKGKKCLILDLDETLVHS-SFKYLRSADFVLPVEIDD------QVHNVYVIK-RPGV 145
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFN--GKDRKNPDL 179
FLE+ L ++ + T S Y + + +LD + K R+ +N G KN
Sbjct: 146 EEFLERVGKLFEVVVFTASVSRYGDPLLDILDTN-KVIHHRLFREACYNYEGNYIKNLSQ 204
Query: 180 VRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELN 219
+ I+ILD++ + + H ++ I + +F D N
Sbjct: 205 IGRPLSDIIILDNSPASYIFHPQHAIPISS--WFSDTHDN 242
>gi|348552620|ref|XP_003462125.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1-like [Cavia porcellus]
Length = 261
Score = 41.2 bits (95), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 37/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I I ++ + RP V
Sbjct: 86 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVIHQVYVLK-------RPHVDE 137
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 138 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 196
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 197 RDLRRVLILDNSPASYVFHPDNAVPVASW 225
>gi|67472775|ref|XP_652175.1| nuclear LIM interactor-interacting factor 3 [Entamoeba histolytica
HM-1:IMSS]
gi|56468992|gb|EAL46789.1| nuclear LIM interactor-interacting factor 3 [Entamoeba histolytica
HM-1:IMSS]
gi|449705336|gb|EMD45405.1| nuclear LIM interactorinteracting factor 3, putative [Entamoeba
histolytica KU27]
Length = 226
Score = 41.2 bits (95), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 64/146 (43%), Gaps = 15/146 (10%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
KL +V +LD TL+H S + I FQ V +RP R L+
Sbjct: 60 KLTIVFDLDETLIHTHVTSQNLSDD---------LITIEFQ-GKQYFVSVRPGARELLKS 109
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII---AREDFNGKDRKNPDLVRGQE 184
+ ++ L T ST YA + L+ D + F ++ +E F + L R +
Sbjct: 110 LAGKYELILFTASTEGYATQIINNLERDGQIFDYKLYCHNCKEKFGQLFKDVHKLGRDLD 169
Query: 185 RGIVILDDTESVWSDHTENLIVLGKY 210
R ++I DD+ VW+ +ENL V +Y
Sbjct: 170 R-VLIFDDSTIVWTT-SENLFVCKRY 193
>gi|156839904|ref|XP_001643638.1| hypothetical protein Kpol_478p16 [Vanderwaltozyma polyspora DSM
70294]
gi|156114257|gb|EDO15780.1| hypothetical protein Kpol_478p16 [Vanderwaltozyma polyspora DSM
70294]
Length = 350
Score = 41.2 bits (95), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 69/151 (45%), Gaps = 12/151 (7%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+H + K +S+ + L I Q N ++K RP V FL+ S
Sbjct: 182 LVLDLDETLVH-SSFKYVSTADFVLPVDIDD------QFQNVYVIK-RPGVDAFLQYTSK 233
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII--AREDFNGKDRKNPDLVRGQERGIV 188
L ++ + T S Y + +LD + R+ A ++NG KN + I+
Sbjct: 234 LFEVVIFTASVEKYGNPLLDILDSTNDLVHHRLFRDACYNYNGNYIKNLAQLGRPLSDII 293
Query: 189 ILDDTESVWSDHTENLIVLGKYVYFRDKELN 219
ILD++ + + H + I + +F D N
Sbjct: 294 ILDNSPTSYLFHPNHAIPISS--WFSDAHDN 322
>gi|145516326|ref|XP_001444057.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411457|emb|CAK76660.1| unnamed protein product [Paramecium tetraurelia]
Length = 411
Score = 41.2 bits (95), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 33/54 (61%), Gaps = 1/54 (1%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED 168
+ +RPF + FL+Q S L IY+ T S+ YA VK LD ++ S I++R++
Sbjct: 268 LNVRPFCQWFLQQMSLLYTIYVYTASSSAYANTIVKYLDPKGQWISG-ILSRQN 320
>gi|302834483|ref|XP_002948804.1| hypothetical protein VOLCADRAFT_89056 [Volvox carteri f. nagariensis]
gi|300265995|gb|EFJ50184.1| hypothetical protein VOLCADRAFT_89056 [Volvox carteri f. nagariensis]
Length = 2442
Score = 41.2 bits (95), Expect = 0.64, Method: Composition-based stats.
Identities = 20/52 (38%), Positives = 31/52 (59%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR 166
+KLRP R FL +A +++ + R YA+A V+LLD F SR++A+
Sbjct: 2116 LKLRPGARAFLARAHERFELWAHSRQGRPYADAVVELLDPSLALFGSRVVAQ 2167
>gi|256272313|gb|EEU07297.1| Psr1p [Saccharomyces cerevisiae JAY291]
Length = 396
Score = 41.2 bits (95), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 70/143 (48%), Gaps = 13/143 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
L+L+LD TL+H + K L S + L +I Q+ N ++K RP V FLE+
Sbjct: 229 LILDLDETLVHS-SFKYLRSADFVLPVEIDD------QVHNVYVIK-RPGVEEFLERVGK 280
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQERGI 187
L ++ + T S Y + + +LD D K R+ RE ++ G KN + I
Sbjct: 281 LFEVVVFTASVSRYGDPLLDILDTD-KVIHHRLF-REACYNYEGNYIKNLSQIGRPLSDI 338
Query: 188 VILDDTESVWSDHTENLIVLGKY 210
+ILD++ + + H ++ I + +
Sbjct: 339 IILDNSPASYIFHPQHAIPISSW 361
>gi|281204367|gb|EFA78563.1| hypothetical protein PPL_09215 [Polysphondylium pallidum PN500]
Length = 374
Score = 41.2 bits (95), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 38/151 (25%), Positives = 73/151 (48%), Gaps = 11/151 (7%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
SE+ + K +V++LD TL+H K S + L ++ + + + + + RP+V
Sbjct: 193 SEEFKGKKTIVIDLDETLVHSY-FKPTSEPDIILPIEMDNGVVTFY-------INKRPYV 244
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
+ + +I + T S YA+ + L+D +K SSR+ ++ K DL R
Sbjct: 245 QELFDFLHGKFEIVIFTASISRYADKVLDLID-PNKVISSRLFRESCYHHKGNYIKDLSR 303
Query: 182 -GQE-RGIVILDDTESVWSDHTENLIVLGKY 210
G++ R +I+D++ + H EN I + +
Sbjct: 304 LGRDLRNTIIVDNSPHAYFLHPENAIPITSW 334
>gi|85000055|ref|XP_954746.1| RNA polymerase II carboxyterminal domain (CTD) phosphatase
[Theileria annulata strain Ankara]
gi|65302892|emb|CAI75270.1| RNA polymerase II carboxyterminal domain (CTD) phosphatase,
putative [Theileria annulata]
Length = 246
Score = 41.2 bits (95), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 48/202 (23%), Positives = 87/202 (43%), Gaps = 46/202 (22%)
Query: 58 GLRYSEQEERKLQ--------LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQM 109
GL+Y RK LVL+LD TL+H + I+SF L Q
Sbjct: 44 GLKYGATVLRKSATLIPKRKTLVLDLDETLIHSS-----------FEPSINSFTMPLMQN 92
Query: 110 ANDKLVKL--RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE 167
++ + + RP++ FL S + DI + T + YA+ + +D++ K R+ R+
Sbjct: 93 GVERTIYINKRPYLDEFLSIISDIYDIVIFTAGLKSYADPVIDAIDVN-KVCKKRLF-RD 150
Query: 168 D---FNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKS 224
+NG K+ +++ + ++ +D++ + LN D+
Sbjct: 151 SCKFWNGYYIKDLEILNRPMKDVITIDNSPCCYC-------------------LNPDNAI 191
Query: 225 YSETLTDESENEEALANVLRVL 246
ET D+ EN+ LAN++ +L
Sbjct: 192 PIETWFDD-ENDSQLANLVPLL 212
>gi|340503354|gb|EGR29951.1| NLI interacting factor-like phosphatase family protein, putative
[Ichthyophthirius multifiliis]
Length = 316
Score = 41.2 bits (95), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 39/145 (26%), Positives = 65/145 (44%), Gaps = 26/145 (17%)
Query: 71 LVLNLDHTLLHCRNI---KSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
L L+LD TL+H I EKY+ +K+RPF + FL++
Sbjct: 144 LYLDLDETLIHVCQIWDNPDFIIYEKYIIP-----------------IKIRPFCKEFLQK 186
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDF----NGKDRKNPDLVRGQ 183
+ DIY+ T S + YA A LD +Y I+ RE+ NG K+ +++ +
Sbjct: 187 IAQYWDIYIFTASQKKYANAVCDFLDPQREYIID-ILTRENCMETKNGLFIKDLRIIKDK 245
Query: 184 E-RGIVILDDTESVWSDHTENLIVL 207
+ + + I+D+ + EN I +
Sbjct: 246 DIKKMAIVDNLSHSYGFQIENGIPI 270
>gi|325533975|pdb|3PGL|A Chain A, Crystal Structure Of Human Small C-Terminal Domain
Phosphatase 1 (Scp1) Bound To Rabeprazole
gi|325533976|pdb|3PGL|B Chain B, Crystal Structure Of Human Small C-Terminal Domain
Phosphatase 1 (Scp1) Bound To Rabeprazole
Length = 180
Score = 41.2 bits (95), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 10 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 61
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 62 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 120
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 121 RDLRRVLILDNSPASYVFHPDNAVPVASW 149
>gi|342180265|emb|CCC89742.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 569
Score = 41.2 bits (95), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 48/95 (50%), Gaps = 7/95 (7%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGS-LFQMANDKLVKLRPF 120
S Q R+ L+L+LD TL S SS + I + G+ LF V RP+
Sbjct: 301 SYQATRQKVLILDLDETLCFVSTNLSASSQPPSFSEVIPTASGAELFH------VWERPY 354
Query: 121 VRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLD 155
V+ FL S L ++ L T ST+ YA++ ++ +D D
Sbjct: 355 VKLFLRTMSKLFNLVLFTSSTKPYADSILRRIDPD 389
>gi|123454430|ref|XP_001314970.1| NLI interacting factor-like phosphatase family protein [Trichomonas
vaginalis G3]
gi|121897632|gb|EAY02747.1| NLI interacting factor-like phosphatase family protein [Trichomonas
vaginalis G3]
Length = 218
Score = 41.2 bits (95), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 39/76 (51%), Gaps = 12/76 (15%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+H HS + +L ++ V LRP V+ FLE+ S
Sbjct: 44 LVLDLDETLVHTSTFPP------------HSDVEALKFDDTNEYVFLRPNVKKFLERVSE 91
Query: 131 LVDIYLCTMSTRCYAE 146
L ++++ T T+ YAE
Sbjct: 92 LFEVFIFTAGTQIYAE 107
>gi|320588951|gb|EFX01419.1| nif domain containing protein [Grosmannia clavigera kw1407]
Length = 585
Score = 41.2 bits (95), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 43/165 (26%), Positives = 75/165 (45%), Gaps = 21/165 (12%)
Query: 63 EQEERKLQ--LVLNLDHTLLHCRNIKS-LSSGEKYLKKQIHSFIGSLFQMANDK------ 113
E +R Q L+L+LD TL+H + +S+G + +F+G Q +
Sbjct: 386 ETADRTHQKTLILDLDETLIHSMSKGGRMSTGHMVEVRLNTTFVGMGGQPSAGPQHPILY 445
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI-----IARED 168
V RP+ FL + S ++ + T S + YA+ + L+ + KYFS+R R
Sbjct: 446 YVHKRPYCDEFLRRVSKWYNLVVFTASVQEYADPVIDWLESERKYFSARYYRQHCTFRHG 505
Query: 169 FNGKDRK--NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
KD PDL + ++ILD++ + H +N I + ++
Sbjct: 506 AFIKDLSAVEPDLSK-----VMILDNSPLSYMFHQDNAIPIQGWI 545
>gi|145538780|ref|XP_001455090.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124422878|emb|CAK87693.1| unnamed protein product [Paramecium tetraurelia]
Length = 554
Score = 41.2 bits (95), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 44/93 (47%), Gaps = 7/93 (7%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LV +LD TL+HC S+ G+ L I G Q + + +RP+ + L+ S
Sbjct: 360 LVFDLDETLIHCNESTSIP-GDIILP--ITFPTGETIQAS----INIRPYAQQILQTLSR 412
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+I + T S CYA + LD ++ S R+
Sbjct: 413 HFEIIVFTASHSCYANIVLDYLDPKKQWISHRL 445
>gi|124087766|ref|XP_001346866.1| CTD-like phosphatase [Paramecium tetraurelia strain d4-2]
gi|145474907|ref|XP_001423476.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|50057255|emb|CAH03239.1| CTD-like phosphatase, putative [Paramecium tetraurelia]
gi|124390536|emb|CAK56078.1| unnamed protein product [Paramecium tetraurelia]
Length = 276
Score = 41.2 bits (95), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 39/156 (25%), Positives = 75/156 (48%), Gaps = 17/156 (10%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+ Q RK LVL+LD TL+HC ++ + + + H G L+ + K RP++
Sbjct: 31 NSQVRRKKTLVLDLDETLVHCEFKENPNFHYETILDVWHR--GVLYTVYLCK----RPYL 84
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLD---SKYFSSRIIARED---FNGKDRK 175
R FL+Q S+ +I + T Y + ++ +D+D S YF AR + NG K
Sbjct: 85 REFLQQLSAYYEIIVFTAGYESYCDKVLQHIDIDRHISDYF-----ARSNCRFVNGICLK 139
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
+ ++ ++ +D+ + + EN +++ ++
Sbjct: 140 DLSILDRPLDQLIFIDNNANAFEMQPENGLLIPSFL 175
>gi|323336549|gb|EGA77815.1| Psr1p [Saccharomyces cerevisiae Vin13]
Length = 423
Score = 41.2 bits (95), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 69/143 (48%), Gaps = 13/143 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
L+L+LD TL+H + K L S + L +I + +++ V RP V FLE+
Sbjct: 256 LILDLDETLVHS-SFKYLRSADFVLPVEIDDQVHNVY-------VIKRPGVEEFLERVGK 307
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQERGI 187
L ++ + T S Y + + +LD D K R+ RE ++ G KN + I
Sbjct: 308 LFEVVVFTASVSRYGDPLLDILDTD-KVIHHRLF-REACYNYEGNYIKNLSQIGRPLSDI 365
Query: 188 VILDDTESVWSDHTENLIVLGKY 210
+ILD++ + + H ++ I + +
Sbjct: 366 IILDNSPASYIFHPQHAIPISSW 388
>gi|294875260|ref|XP_002767242.1| hypothetical protein Pmar_PMAR022745 [Perkinsus marinus ATCC 50983]
gi|239868797|gb|EEQ99959.1| hypothetical protein Pmar_PMAR022745 [Perkinsus marinus ATCC 50983]
Length = 215
Score = 41.2 bits (95), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 27/104 (25%), Positives = 46/104 (44%), Gaps = 17/104 (16%)
Query: 114 LVKLRP----FVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDF 169
L K+RP F+R + + V + + T +R Y E K+LD + R+++RED
Sbjct: 46 LTKIRPHARAFIRELVSKTGCGVVLSIYTKGSRRYMEVIKKMLDPSGELIKGRLVSREDE 105
Query: 170 NGKD---RKNPDLVRGQERGI----------VILDDTESVWSDH 200
K+PD + + + V+LDD+ VW +
Sbjct: 106 PSNMTPLEKDPDFIINADSAVGTEELRRRWFVVLDDSPEVWPEE 149
>gi|151941159|gb|EDN59537.1| protein phosphatase [Saccharomyces cerevisiae YJM789]
gi|190406033|gb|EDV09300.1| phosphatase PSR1 [Saccharomyces cerevisiae RM11-1a]
gi|259147980|emb|CAY81229.1| Psr1p [Saccharomyces cerevisiae EC1118]
Length = 423
Score = 41.2 bits (95), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 69/143 (48%), Gaps = 13/143 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
L+L+LD TL+H + K L S + L +I + +++ V RP V FLE+
Sbjct: 256 LILDLDETLVHS-SFKYLRSADFVLPVEIDDQVHNVY-------VIKRPGVEEFLERVGK 307
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQERGI 187
L ++ + T S Y + + +LD D K R+ RE ++ G KN + I
Sbjct: 308 LFEVVVFTASVSRYGDPLLDILDTD-KVIHHRLF-REACYNYEGNYIKNLSQIGRPLSDI 365
Query: 188 VILDDTESVWSDHTENLIVLGKY 210
+ILD++ + + H ++ I + +
Sbjct: 366 IILDNSPASYIFHPQHAIPISSW 388
>gi|349579717|dbj|GAA24878.1| K7_Psr1p [Saccharomyces cerevisiae Kyokai no. 7]
gi|392297965|gb|EIW09064.1| Psr1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 433
Score = 41.2 bits (95), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 69/143 (48%), Gaps = 13/143 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
L+L+LD TL+H + K L S + L +I + +++ V RP V FLE+
Sbjct: 266 LILDLDETLVHS-SFKYLRSADFVLPVEIDDQVHNVY-------VIKRPGVEEFLERVGK 317
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQERGI 187
L ++ + T S Y + + +LD D K R+ RE ++ G KN + I
Sbjct: 318 LFEVVVFTASVSRYGDPLLDILDTD-KVIHHRLF-REACYNYEGNYIKNLSQIGRPLSDI 375
Query: 188 VILDDTESVWSDHTENLIVLGKY 210
+ILD++ + + H ++ I + +
Sbjct: 376 IILDNSPASYIFHPQHAIPISSW 398
>gi|355750837|gb|EHH55164.1| hypothetical protein EGM_04316, partial [Macaca fascicularis]
Length = 237
Score = 40.8 bits (94), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 38/159 (23%), Positives = 76/159 (47%), Gaps = 13/159 (8%)
Query: 54 YMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK 113
Y+L + Q+ K+ +V++LD TL+H + K +++ + + +I + ++ +
Sbjct: 54 YLLPAAK--AQDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK--- 107
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
RP V FL++ L + L T S YA+ LLD F +R+ +
Sbjct: 108 ----RPHVDEFLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRG 162
Query: 174 RKNPDLVR-GQE-RGIVILDDTESVWSDHTENLIVLGKY 210
DL R G++ R ++ILD++ + + H +N + + +
Sbjct: 163 NYVKDLSRLGRDLRRVLILDNSPASYVFHPDNAVPVASW 201
>gi|323303946|gb|EGA57726.1| Psr1p [Saccharomyces cerevisiae FostersB]
Length = 423
Score = 40.8 bits (94), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 69/143 (48%), Gaps = 13/143 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
L+L+LD TL+H + K L S + L +I + +++ V RP V FLE+
Sbjct: 256 LILDLDETLVHS-SFKYLRSADFVLPVEIDDQVHNVY-------VIKRPGVEEFLERVGK 307
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQERGI 187
L ++ + T S Y + + +LD D K R+ RE ++ G KN + I
Sbjct: 308 LFEVVVFTASVSRYGDPLLDILDTD-KVIHHRLF-REACYNYEGNYIKNLSQIGRPLSDI 365
Query: 188 VILDDTESVWSDHTENLIVLGKY 210
+ILD++ + + H ++ I + +
Sbjct: 366 IILDNSPASYIFHPQHAIPISSW 388
>gi|313224860|emb|CBY20652.1| unnamed protein product [Oikopleura dioica]
Length = 271
Score = 40.8 bits (94), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 38/155 (24%), Positives = 75/155 (48%), Gaps = 15/155 (9%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
E +K+ V++LD TL+H + K +++ + ++ +I + + ++ + RP+V F
Sbjct: 85 EPKKICCVIDLDETLVHS-SFKPIANADFHVPVEIENMVHQVYVLK-------RPYVDEF 136
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLV---R 181
L + L + L T S YA+ +D +++ FSSR+ + DL R
Sbjct: 137 LAKVGELFECVLFTASLAKYADEVANEIDPNNE-FSSRLFRESCVYDRGNYVKDLTKLGR 195
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFRDK 216
+R I+I D++ + + +N I + +F DK
Sbjct: 196 PLDRTIII-DNSPASYLFQPQNAIPVSS--WFEDK 227
>gi|17509983|ref|NP_491348.1| Protein SCPL-3, isoform a [Caenorhabditis elegans]
gi|75023288|sp|Q9N4V4.1|SCPL3_CAEEL RecName: Full=CTD small phosphatase-like protein 3;
Short=CTDSP-like 3
gi|351059571|emb|CCD67161.1| Protein SCPL-3, isoform a [Caenorhabditis elegans]
Length = 287
Score = 40.8 bits (94), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 44/93 (47%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC ++ L + + ++ V+LRP +RTFL + +
Sbjct: 67 LVLDLDETLVHC-SLTPLDNATMVFPVVFQNITYQVY-------VRLRPHLRTFLSRMAK 118
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+I + T S + YA +LD + R+
Sbjct: 119 TFEIIIFTASKKVYANKLCDILDPRKNHIRHRL 151
>gi|390464816|ref|XP_003733289.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 2 [Callithrix jacchus]
Length = 260
Score = 40.8 bits (94), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 37/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ V RP V
Sbjct: 85 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVY-------VLKRPHVDE 136
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 137 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 195
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 196 RDLRRVLILDNSPASYVFHPDNAVPVASW 224
>gi|154331705|ref|XP_001561670.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134058989|emb|CAM36816.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 738
Score = 40.8 bits (94), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 45/90 (50%), Gaps = 7/90 (7%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGS-LFQMANDKLVKLRPFVRTFL 125
R+ LV++LD TL H + G + I + G+ LF V RP+ R FL
Sbjct: 321 RQKVLVMDLDETLCHVSTTTANMEGPPTFSEVIPTASGAELFH------VWERPYTRLFL 374
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLD 155
A+ L ++ L T +++ YA+ ++ +D D
Sbjct: 375 STAAKLFNLVLFTSASKPYADTILQRIDPD 404
>gi|145513909|ref|XP_001442865.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410226|emb|CAK75468.1| unnamed protein product [Paramecium tetraurelia]
Length = 392
Score = 40.8 bits (94), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 73/161 (45%), Gaps = 24/161 (14%)
Query: 57 RGLRYSEQEERKLQL-VLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLV 115
R +R E ++K +L +L+LD TL+H E++ D
Sbjct: 206 RYIRLKEPNQKKSKLLILDLDETLIHITITLQDDDEERF-----------------DLCF 248
Query: 116 KLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR----EDFNG 171
+RPF FL++ S +I+L T S+ YA A V LD +Y + I+ R E NG
Sbjct: 249 NVRPFCNEFLKEMSKYYNIHLFTASSELYANAIVNHLDPKRQYINE-ILCRNNCFETKNG 307
Query: 172 KDRKNPDLVRGQE-RGIVILDDTESVWSDHTENLIVLGKYV 211
K+ ++ + + IVI+D+ + EN I + +Y+
Sbjct: 308 FFIKDLRIITNRTLKDIVIVDNLPHSFGLQLENGIPILEYL 348
>gi|6323019|ref|NP_013091.1| Psr1p [Saccharomyces cerevisiae S288c]
gi|55583861|sp|Q07800.1|PSR1_YEAST RecName: Full=Phosphatase PSR1; AltName: Full=Plasma membrane
sodium response protein 1
gi|1360175|emb|CAA97454.1| unnamed protein product [Saccharomyces cerevisiae]
gi|1495214|emb|CAA62782.1| L1341 protein [Saccharomyces cerevisiae]
gi|285813412|tpg|DAA09308.1| TPA: Psr1p [Saccharomyces cerevisiae S288c]
Length = 427
Score = 40.8 bits (94), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 69/143 (48%), Gaps = 13/143 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
L+L+LD TL+H + K L S + L +I + +++ V RP V FLE+
Sbjct: 260 LILDLDETLVHS-SFKYLRSADFVLSVEIDDQVHNVY-------VIKRPGVEEFLERVGK 311
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQERGI 187
L ++ + T S Y + + +LD D K R+ RE ++ G KN + I
Sbjct: 312 LFEVVVFTASVSRYGDPLLDILDTD-KVIHHRLF-REACYNYEGNYIKNLSQIGRPLSDI 369
Query: 188 VILDDTESVWSDHTENLIVLGKY 210
+ILD++ + + H ++ I + +
Sbjct: 370 IILDNSPASYIFHPQHAIPISSW 392
>gi|119591022|gb|EAW70616.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 1, isoform CRA_b [Homo sapiens]
gi|119591023|gb|EAW70617.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 1, isoform CRA_b [Homo sapiens]
Length = 255
Score = 40.8 bits (94), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 80 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 131
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 132 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 190
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 191 RDLRRVLILDNSPASYVFHPDNAVPVASW 219
>gi|145475985|ref|XP_001424015.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124391077|emb|CAK56617.1| unnamed protein product [Paramecium tetraurelia]
Length = 552
Score = 40.8 bits (94), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 43/93 (46%), Gaps = 7/93 (7%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LV +LD TL+HC N G+ L I G Q + + +RPF + L+ S
Sbjct: 358 LVFDLDETLIHC-NESIAVPGDIVLP--ISFPTGETIQAS----INIRPFAQQILQTLSR 410
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+I + T S CYA + LD ++ S R+
Sbjct: 411 HFEIIVFTASHSCYANIVLDYLDPKKQWISHRL 443
>gi|365764281|gb|EHN05805.1| Psr1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 423
Score = 40.8 bits (94), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 69/143 (48%), Gaps = 13/143 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
L+L+LD TL+H + K L S + L +I + +++ V RP V FLE+
Sbjct: 256 LILDLDETLVHS-SFKYLRSADFVLPVEIDDQVHNVY-------VIKRPGVEEFLERVGK 307
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPDLVRGQERGI 187
L ++ + T S Y + + +LD D K R+ RE ++ G KN + I
Sbjct: 308 LFEVVVFTASVSRYGDPLLDILDTD-KVIHHRLF-REACYNYEGNYIKNLSQIGRPLSDI 365
Query: 188 VILDDTESVWSDHTENLIVLGKY 210
+ILD++ + + H ++ I + +
Sbjct: 366 IILDNSPASYIFHPQHAIPISSW 388
>gi|148667909|gb|EDL00326.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 1, isoform CRA_b [Mus musculus]
Length = 209
Score = 40.8 bits (94), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 34 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 85
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 86 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 144
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 145 RDLRRVLILDNSPASYVFHPDNAVPVASW 173
>gi|145504064|ref|XP_001438004.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124405165|emb|CAK70607.1| unnamed protein product [Paramecium tetraurelia]
Length = 419
Score = 40.8 bits (94), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 49/202 (24%), Positives = 89/202 (44%), Gaps = 39/202 (19%)
Query: 29 HTTVRDSRCIFCSQAMNDSFGLSFDYMLRGLRYSEQEERKLQ--LVLNLDHTLLHCRNIK 86
++ ++ S+ I C Q SF + Q ++K+Q L+++LD TL+HC
Sbjct: 197 YSNLQKSKLIVCPQQY--SFSIKI-----------QPQKKIQKTLIIDLDETLVHCNEFS 243
Query: 87 SLSSGEKYLKKQIHSFIGSL-----FQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMST 141
L S FI + FQ+ + +RP + FL + + +I + T S
Sbjct: 244 CLKSD---------FFIPLVYGDKSFQVG----ISIRPHAQQFLRNMAKVYEIIVFTASN 290
Query: 142 RCYAEAAVKLLDLDSKYFSSRIIARED----FNGKDRKNPDLVRGQERGIVILDDTESVW 197
YA + LD + S R+ R+D N K+ ++ + IV++D++ +
Sbjct: 291 PDYANKIIDYLDPEQNLVSYRLF-RDDCIQISNNCHIKDLRILNRNMQDIVLVDNSAYSF 349
Query: 198 SDHTENLIVLGKYVYFR-DKEL 218
+ +N I + Y+ + DKEL
Sbjct: 350 AFQIDNGIPIIPYLDNKNDKEL 371
>gi|403266874|ref|XP_003925585.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 1 [Saimiri boliviensis
boliviensis]
Length = 262
Score = 40.8 bits (94), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 87 QDSDKICVVIDLDETLVH-SSFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 138
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 139 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 197
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 198 RDLRRVLILDNSPASYVFHPDNAVPVASW 226
>gi|365759502|gb|EHN01285.1| Psr1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 410
Score = 40.8 bits (94), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 46/161 (28%), Positives = 77/161 (47%), Gaps = 15/161 (9%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
SE + K L+L+LD TL+H + K L S + L +I Q+ N ++K RP V
Sbjct: 234 SESTKGKKCLILDLDETLVHS-SFKYLRSADFVLPVEIDD------QVHNVYVIK-RPGV 285
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRKNPD 178
FLE+ L ++ + T S Y + + +LD + K R+ RE ++ G KN
Sbjct: 286 EEFLERVGKLFEVVVFTASVSRYGDPLLDILDTN-KVIHHRLF-REACYNYEGNYIKNLS 343
Query: 179 LVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELN 219
+ I+ILD++ + + H ++ I + +F D N
Sbjct: 344 QIGRPLSDIIILDNSPASYIFHPQHAIPISS--WFSDTHDN 382
>gi|387018216|gb|AFJ51226.1| Carboxy-terminal domain RNA polymerase II polypeptide [Crotalus
adamanteus]
Length = 271
Score = 40.8 bits (94), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 37/146 (25%), Positives = 71/146 (48%), Gaps = 11/146 (7%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
++Q++ ++ +V++LD TL+H + K +++ + + +I ++ + RPFV
Sbjct: 95 TQQDQGRICVVIDLDETLVH-SSFKPINNADFIVPVEIEGTTHEVYVLK-------RPFV 146
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
FL + L + L T S YA+ LLD F +R+ + DL R
Sbjct: 147 DEFLRRMGELFECVLFTASLAKYADPVTDLLD-KCGVFRTRLFRESCVFHQGCYVKDLSR 205
Query: 182 -GQE-RGIVILDDTESVWSDHTENLI 205
G++ R +ILD++ + + H EN +
Sbjct: 206 LGRDLRKTLILDNSPASYIFHPENAV 231
>gi|357156637|ref|XP_003577524.1| PREDICTED: CTD small phosphatase-like protein 2-like isoform 2
[Brachypodium distachyon]
Length = 443
Score = 40.8 bits (94), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 71/155 (45%), Gaps = 13/155 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+H S + + +F + V+ RP+++ FLE+
Sbjct: 258 RTTLVLDLDETLVH--------STLEPCEDSDFTFPVHFNLREHTIYVRCRPYLKEFLER 309
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
+S+ +I + T S YAE + +LD K F R+ RE G K+ ++
Sbjct: 310 VASMFEIIIFTASQSIYAEQLLNVLDPKRKLFRHRVY-RESCVYVEGNYLKDLSVLGRDL 368
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYV-YFRDKEL 218
+VI+D++ + EN I + + DKEL
Sbjct: 369 ARVVIVDNSPQAFGFQLENGIPIESWFDDPNDKEL 403
>gi|32564286|ref|NP_871854.1| Protein SCPL-3, isoform b [Caenorhabditis elegans]
gi|351059572|emb|CCD67162.1| Protein SCPL-3, isoform b [Caenorhabditis elegans]
Length = 312
Score = 40.8 bits (94), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 44/93 (47%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC ++ L + + ++ V+LRP +RTFL + +
Sbjct: 67 LVLDLDETLVHC-SLTPLDNATMVFPVVFQNITYQVY-------VRLRPHLRTFLSRMAK 118
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+I + T S + YA +LD + R+
Sbjct: 119 TFEIIIFTASKKVYANKLCDILDPRKNHIRHRL 151
>gi|126343824|ref|XP_001380778.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 2-like [Monodelphis domestica]
Length = 317
Score = 40.8 bits (94), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 37/151 (24%), Positives = 73/151 (48%), Gaps = 11/151 (7%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
++Q++ ++ +V++LD TL+H + K +++ + + +I ++ V RP+V
Sbjct: 141 TQQDQGRICVVIDLDETLVH-SSFKPINNADFIVPVEIEGITHQVY-------VLKRPYV 192
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
FL + L + L T S YA+ LLD F +R+ + DL R
Sbjct: 193 DEFLRRMGELFECVLFTASLAKYADPVTDLLD-QCGVFRARLFRESCVFHQGCYVKDLSR 251
Query: 182 -GQE-RGIVILDDTESVWSDHTENLIVLGKY 210
G++ R +ILD++ + + H EN + + +
Sbjct: 252 LGRDLRKTLILDNSPASYIFHPENAVPVQSW 282
>gi|145517051|ref|XP_001444414.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411825|emb|CAK77017.1| unnamed protein product [Paramecium tetraurelia]
Length = 477
Score = 40.8 bits (94), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 55/112 (49%), Gaps = 21/112 (18%)
Query: 59 LRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVK-- 116
L+ E+ K+ ++ +LD TL+HC E L+K S I Q+ +++VK
Sbjct: 272 LKQKEKYRNKISVIFDLDETLVHC--------NESLLQK---SDIVLNIQVGPNEMVKAG 320
Query: 117 --LRPFVRTFLEQASSLVD---IYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+RP LE SLVD I + T S CYA+ + LD ++K S R+
Sbjct: 321 VNIRPGAVELLE---SLVDDFEIIVFTASHSCYAQQVLDYLDPENKLISHRL 369
>gi|126337836|ref|XP_001365381.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1-like [Monodelphis domestica]
Length = 346
Score = 40.8 bits (94), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 37/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +S+ + + +I + ++ + RP V
Sbjct: 171 QDLGKICVVIDLDETLVHS-SFKPVSNADFIIPVEIDGMVHQVYVLK-------RPHVDE 222
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 223 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGSFRARLFRESCVFHRGNYVKDLSRLG 281
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 282 RDLRRVLILDNSPASYVFHPDNAVPVASW 310
>gi|340504501|gb|EGR30938.1| NLI interacting factor-like phosphatase family protein, putative
[Ichthyophthirius multifiliis]
Length = 230
Score = 40.8 bits (94), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 68/142 (47%), Gaps = 14/142 (9%)
Query: 71 LVLNLDHTLLH-CRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQAS 129
L L+LD TL+H C SL+ + K +G + + ++RP+ FL+
Sbjct: 49 LFLDLDETLIHSC----SLNENPDVILK-----VGEINEPQFHIGFRIRPYCMDFLKALV 99
Query: 130 SLVDIYLCTMSTRCYAEAAVKLLDLDSKYFS---SRIIAREDFNGKDRKNPDLVRGQE-R 185
DIY+ T S+ Y+ A + LD + KY + +R E NG K+ + +G++ R
Sbjct: 100 EYWDIYIFTASSSTYSNAIINYLDPERKYINGILNRSNCMETKNGFFIKDLRIAKGKDLR 159
Query: 186 GIVILDDTESVWSDHTENLIVL 207
I+++D+ + +N I +
Sbjct: 160 KIILVDNLSHSFGFQIDNGIPI 181
>gi|349603764|gb|AEP99509.1| CTD small phosphatase-like protein 2-like protein, partial [Equus
caballus]
Length = 159
Score = 40.8 bits (94), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 53/108 (49%), Gaps = 6/108 (5%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNG 171
V+LRPF R FLE+ S + +I L T S + YA+ + +LD + R+ RE G
Sbjct: 19 VRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRLF-REHCVCVQG 77
Query: 172 KDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELN 219
K+ +++ +I+D++ ++ N I + +F DK N
Sbjct: 78 NYIKDLNILGRDLSKTIIIDNSPQAFAYQLSNGIPIES--WFMDKNDN 123
>gi|403266876|ref|XP_003925586.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 2 [Saimiri boliviensis
boliviensis]
Length = 248
Score = 40.8 bits (94), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 73 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 124
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 125 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 183
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 184 RDLRRVLILDNSPASYVFHPDNAVPVASW 212
>gi|384484378|gb|EIE76558.1| hypothetical protein RO3G_01262 [Rhizopus delemar RA 99-880]
Length = 348
Score = 40.8 bits (94), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 53/115 (46%), Gaps = 3/115 (2%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
RKL LVL+LD TL+ + +G + I + + + K V L VR FLE
Sbjct: 113 RKLPLVLDLDDTLV---RLVGNENGRFVSESDIPKCKDRVAVLKDGKRVVLTERVREFLE 169
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
A L DI +C++ + Y ++ + +LD + + + + R +PD R
Sbjct: 170 WAQQLYDISICSLGDQNYVDSVIDVLDPTRSWVKGILYSARAEHDYIRSSPDPGR 224
>gi|332308973|ref|NP_001193807.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 isoform 3 [Homo sapiens]
gi|397495664|ref|XP_003818667.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 2 [Pan paniscus]
gi|410036206|ref|XP_003950023.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 [Pan troglodytes]
gi|426338591|ref|XP_004033259.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 2 [Gorilla gorilla gorilla]
Length = 260
Score = 40.8 bits (94), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 37/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ V RP V
Sbjct: 85 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVY-------VLKRPHVDE 136
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 137 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 195
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 196 RDLRRVLILDNSPASYVFHPDNAVPVASW 224
>gi|355565181|gb|EHH21670.1| hypothetical protein EGK_04793 [Macaca mulatta]
Length = 270
Score = 40.4 bits (93), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 38/159 (23%), Positives = 76/159 (47%), Gaps = 13/159 (8%)
Query: 54 YMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK 113
Y+L + Q+ K+ +V++LD TL+H + K +++ + + +I + ++ +
Sbjct: 87 YLLPAAK--AQDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK--- 140
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
RP V FL++ L + L T S YA+ LLD F +R+ +
Sbjct: 141 ----RPHVDEFLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRG 195
Query: 174 RKNPDLVR-GQE-RGIVILDDTESVWSDHTENLIVLGKY 210
DL R G++ R ++ILD++ + + H +N + + +
Sbjct: 196 NYVKDLSRLGRDLRRVLILDNSPASYVFHPDNAVPVASW 234
>gi|341876625|gb|EGT32560.1| hypothetical protein CAEBREN_01530 [Caenorhabditis brenneri]
Length = 286
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 41/83 (49%), Gaps = 8/83 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC ++ L + + ++ V+LRP +RTFL + +
Sbjct: 67 LVLDLDETLVHC-SLTPLDNATMIFPVVFQNITYQVY-------VRLRPHLRTFLNRMAK 118
Query: 131 LVDIYLCTMSTRCYAEAAVKLLD 153
+I + T S + YA +LD
Sbjct: 119 TFEIIIFTASKKVYANKLCDILD 141
>gi|145489835|ref|XP_001430919.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398020|emb|CAK63521.1| unnamed protein product [Paramecium tetraurelia]
Length = 253
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 47/95 (49%), Gaps = 4/95 (4%)
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKY---FSSRIIAREDFNGKD 173
+RPF FL+Q S L IY+ T S+ YA A V LD ++ SR E NG
Sbjct: 112 IRPFCAWFLQQMSQLYTIYVFTASSSAYANAIVNYLDPKRQWILGILSRGNCMETKNGFF 171
Query: 174 RKNPDLVRGQE-RGIVILDDTESVWSDHTENLIVL 207
K+ +V ++ + +VI+D+ + EN I +
Sbjct: 172 IKDLRIVGNKQLKDMVIVDNLAHSFGFQIENGIPI 206
>gi|302808565|ref|XP_002985977.1| hypothetical protein SELMODRAFT_123069 [Selaginella moellendorffii]
gi|300146484|gb|EFJ13154.1| hypothetical protein SELMODRAFT_123069 [Selaginella moellendorffii]
Length = 214
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 67/154 (43%), Gaps = 30/154 (19%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
E K LVL++D TL+H K+ +S + F G + LV RP V TFL
Sbjct: 40 EEKPTLVLDIDETLIHAH--KATAS--------LKLFSGKTLPLQR-YLVAKRPGVDTFL 88
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQE- 184
++ S + +I + T + + YA+ + LD F+ + +D +P VRG++
Sbjct: 89 DEMSKIYEIVVFTRAVKPYADRILDRLDPTGNLFTHHLY-------RDSCSPKEVRGKKV 141
Query: 185 -----------RGIVILDDTESVWSDHTENLIVL 207
R VI+DD + N +V+
Sbjct: 142 VKDLSRLGRDLRHTVIVDDKPESFCLQPSNGLVI 175
>gi|403351246|gb|EJY75109.1| hypothetical protein OXYTRI_03508 [Oxytricha trifallax]
Length = 500
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 31/152 (20%), Positives = 73/152 (48%), Gaps = 25/152 (16%)
Query: 118 RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD 173
RP++ TFL+ S + I + T T+ YA+ + +D + +Y+ R + D +G
Sbjct: 363 RPYLDTFLKDLSKMGQISIFTAGTQEYADPIIDEIDPQGLIKGRYY--REHCKLDKHGNQ 420
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDES 233
K +++ + +VI++D + + + +N I++ ++ T+ +
Sbjct: 421 LKPMEIITKNLKKLVIIEDQKIIKEKYPKNTILVPEF-------------------TNNN 461
Query: 234 ENEEALANVLRVLKTIHRLFFDSVCGDVRTYL 265
+ ++AL VL VL+ ++++ V D+ + +
Sbjct: 462 KKDKALLQVLNVLEQLYQMNTKDVSADLNSVI 493
>gi|402889397|ref|XP_003908003.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 2 [Papio anubis]
Length = 260
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 37/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ V RP V
Sbjct: 85 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVY-------VLKRPHVDE 136
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 137 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 195
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 196 RDLRRVLILDNSPASYVFHPDNAVPVASW 224
>gi|303281306|ref|XP_003059945.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458600|gb|EEH55897.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 199
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 51/100 (51%), Gaps = 7/100 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
+ E K LVL+LD TL+H N+++ + SF + + V+ RP++R
Sbjct: 16 KAEPKNTLVLDLDETLVHS-NLEATEDACDF------SFPVTFNNQQHIVNVRKRPYLRE 68
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
F+E A++ ++ + T S R YAE + +D + + R+
Sbjct: 69 FMEFAAARFEVVVFTASQRVYAERLLNTIDPEKRLIKHRL 108
>gi|431914074|gb|ELK15336.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 2 [Pteropus alecto]
Length = 271
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 40/150 (26%), Positives = 72/150 (48%), Gaps = 19/150 (12%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+EQ++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RP+V
Sbjct: 95 TEQDQGRICVVIDLDETLVH-SSFKPINNADFIVPVEIE---GTTHQV----YVLKRPYV 146
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RK 175
FL + L + L T S YA+ LLD ++ F + + KD R
Sbjct: 147 DEFLRRMGELFECVLFTASLAKYADPVTDLLDRCGVFRARLFRESCVFHQGCYVKDLSRL 206
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLI 205
DL R +ILD++ + + H EN +
Sbjct: 207 GRDL-----RKTLILDNSPASYIFHPENAV 231
>gi|115495067|ref|NP_001070083.1| CTD small phosphatase-like protein [Danio rerio]
gi|115313384|gb|AAI24543.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase-like b [Danio rerio]
Length = 266
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 36/142 (25%), Positives = 68/142 (47%), Gaps = 11/142 (7%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
+V++LD TL+H + K +S+ + + +I + ++ + RP V FL++
Sbjct: 99 VVIDLDETLVHS-SFKPISNADFIVPVEIAGTVHQVYVLK-------RPHVDEFLQKMGE 150
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-GQE-RGIV 188
L + L T S YA+ LLD F +R+ + DL R G+E R ++
Sbjct: 151 LFECVLFTASLAKYADPVADLLD-QWGVFRARLFRESCVFHRGNYVKDLSRLGRELRNVI 209
Query: 189 ILDDTESVWSDHTENLIVLGKY 210
I+D++ + + H EN + + +
Sbjct: 210 IVDNSPASYIFHPENAVPVQSW 231
>gi|389594387|ref|XP_003722416.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|323363644|emb|CBZ12649.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 240
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 41/151 (27%), Positives = 65/151 (43%), Gaps = 29/151 (19%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+E + KL LVL+LD TL+ R SG Y + I F FQM D+ +++ +
Sbjct: 44 AEIYQGKLVLVLDLDETLVFAR------SGPLYARPGIPEF----FQMCKDEGIEVVVWT 93
Query: 122 RTFLEQASSLV-DIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR------ 174
A ++V +I C + C +K+F+ + R+D N R
Sbjct: 94 AGLKAYAQAIVSNIDTCNAVSHCIYR--------HNKWFNGQPGYRKDLNALGRPLDRVL 145
Query: 175 ---KNPDLVRG-QERGIVILDDTESVWSDHT 201
PD +RG Q+ GI++ D D+T
Sbjct: 146 IVENTPDCIRGYQDNGILVSDYEGGDGEDNT 176
>gi|345561635|gb|EGX44723.1| hypothetical protein AOL_s00188g61 [Arthrobotrys oligospora ATCC
24927]
Length = 443
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 70/152 (46%), Gaps = 26/152 (17%)
Query: 71 LVLNLDHTLLHCRN----IKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
L+L+LD TL+H + + S E L KQ H+ + V RPF FL+
Sbjct: 270 LILDLDETLIHSMSKGGSMASAHMVEVKLDKQ-HAIL---------YYVHKRPFCDEFLK 319
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI-----IAREDFNGKDRK--NPDL 179
+ ++ + T S + YA+ + LD + KYF +R R+ KD PDL
Sbjct: 320 KVCKWYNVVIFTASVQEYADPVIDWLDQEHKYFRARYYRQHCTFRDGVYIKDLSVVEPDL 379
Query: 180 VRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
+ ++I+D++ + + H +N I + ++
Sbjct: 380 SK-----VMIVDNSPTSYIFHKDNAIPIEGWI 406
>gi|340504114|gb|EGR30595.1| NLI interacting factor-like phosphatase family protein, putative
[Ichthyophthirius multifiliis]
Length = 318
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 68/143 (47%), Gaps = 13/143 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LV++LD TL+HC K + + L I + + D VK RP FLE S
Sbjct: 6 LVIDLDETLVHCY-FKEVEDYDFTLTINIQN-------IKFDIYVKKRPGCELFLEILSQ 57
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQERGI 187
+I + T S YA + +D +K +SRI RE+ NG K+ ++ + I
Sbjct: 58 YYEIIIFTASLGEYANPVIDQID-KNKVVASRIF-RENCTFHNGIFVKDLSKLKRDLKDI 115
Query: 188 VILDDTESVWSDHTENLIVLGKY 210
+I+D++E + EN I++ +
Sbjct: 116 IIIDNSECSFLFQKENAILIDSF 138
>gi|340914979|gb|EGS18320.1| putative nuclear envelope morphology protein [Chaetomium
thermophilum var. thermophilum DSM 1495]
Length = 532
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 47/99 (47%), Gaps = 7/99 (7%)
Query: 71 LVLNLDHTLLHCRNIKS-LSSGEKYLKKQIHSFIGSLFQMANDK------LVKLRPFVRT 123
L+L+LD TL+H + +SSG + +++G Q V RP
Sbjct: 379 LILDLDETLIHSMSKGGRMSSGHMVEVRLNTTYVGVGGQATIGPQHPILYYVHKRPHCDE 438
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
FL + S ++ + T S + YA+ + L+ D KYFS+R
Sbjct: 439 FLRRVSKWYNLVVFTASVQEYADPVIDWLEADRKYFSAR 477
>gi|327263870|ref|XP_003216740.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 2-like [Anolis carolinensis]
Length = 427
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 73/146 (50%), Gaps = 11/146 (7%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
++Q++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RPFV
Sbjct: 251 TQQDQGRICVVIDLDETLVH-SSFKPINNADFIVPVEIE---GTTHQV----YVLKRPFV 302
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
FL + L + L T S YA+ LLD F +R+ + DL R
Sbjct: 303 DEFLRRMGELFECVLFTASLAKYADPVTDLLD-KCGVFRTRLFRESCVFHQGCYVKDLSR 361
Query: 182 -GQE-RGIVILDDTESVWSDHTENLI 205
G++ R +ILD++ + + H EN +
Sbjct: 362 LGRDLRKTLILDNSPASYIFHPENAV 387
>gi|325180168|emb|CCA14570.1| nuclear LIM factor interactorinteracting protein hyphal form
putative [Albugo laibachii Nc14]
Length = 418
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 41/155 (26%), Positives = 74/155 (47%), Gaps = 13/155 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
K+ LVL+LD TL+HC +++ + + Q F N V LRP + FL++
Sbjct: 235 KICLVLDLDETLVHC-SVEEIENP----NFQFDVFFNGTNYNVN---VSLRPHMHHFLKR 286
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
+ ++ + T S R YAE + LLD + R+ RED +G K+ +++
Sbjct: 287 VTKQFELVVFTASQRVYAEKLLNLLDPNRDLIKYRLY-REDCLEVDGNFLKDLNVLGRDL 345
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYVY-FRDKEL 218
++++D++ + N I + + RD+EL
Sbjct: 346 ARVILVDNSPHAFGYQVNNGIPIESWFNDERDREL 380
>gi|123496080|ref|XP_001326885.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121909806|gb|EAY14662.1| hypothetical protein TVAG_460790 [Trichomonas vaginalis G3]
Length = 288
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 34/164 (20%), Positives = 73/164 (44%), Gaps = 13/164 (7%)
Query: 51 SFDYMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMA 110
S ++ + L ++ K+ L+L+LD TL+H + ++ +L + + I
Sbjct: 106 SLEHNCKELLPPPKDPSKISLILDLDETLIHSSFVPIQNANFTFLLNAVPAPIPV----- 160
Query: 111 NDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE--- 167
V +RP F+ ++ + T S + YA+ ++ +D K+ + RE
Sbjct: 161 ---SVLIRPHAEEFITSLGEKFELIVFTASNKDYADYCIE--QIDPKHLVKYKLYRESCS 215
Query: 168 DFNGKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
D NG K+ L+ + ++I+D++ + H N I + ++
Sbjct: 216 DLNGATVKDLGLLNRNLKKLIIIDNSPMSYLLHPYNAIPITTWM 259
>gi|23346509|ref|NP_694728.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 [Mus musculus]
gi|17865506|sp|P58466.1|CTDS1_MOUSE RecName: Full=Carboxy-terminal domain RNA polymerase II polypeptide
A small phosphatase 1; AltName: Full=Golli-interacting
protein; Short=GIP; AltName: Full=Nuclear LIM
interactor-interacting factor 3; Short=NLI-interacting
factor 3; AltName: Full=Small C-terminal domain
phosphatase 1; Short=SCP1; Short=Small CTD phosphatase 1
gi|15145799|gb|AAK83555.1| golli-interacting protein [Mus musculus]
gi|40796195|gb|AAH65158.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 1 [Mus musculus]
gi|51258970|gb|AAH79638.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 1 [Mus musculus]
gi|57169202|gb|AAH49184.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 1 [Mus musculus]
gi|74191312|dbj|BAE39480.1| unnamed protein product [Mus musculus]
gi|148667908|gb|EDL00325.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 1, isoform CRA_a [Mus musculus]
Length = 261
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 86 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 137
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 138 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 196
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 197 RDLRRVLILDNSPASYVFHPDNAVPVASW 225
>gi|395823467|ref|XP_003785008.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 [Otolemur garnettii]
Length = 260
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 85 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 136
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 137 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 195
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 196 RDLRRVLILDNSPASYVFHPDNAVPVASW 224
>gi|302773411|ref|XP_002970123.1| hypothetical protein SELMODRAFT_5881 [Selaginella moellendorffii]
gi|302807202|ref|XP_002985314.1| hypothetical protein SELMODRAFT_5876 [Selaginella moellendorffii]
gi|300147142|gb|EFJ13808.1| hypothetical protein SELMODRAFT_5876 [Selaginella moellendorffii]
gi|300162634|gb|EFJ29247.1| hypothetical protein SELMODRAFT_5881 [Selaginella moellendorffii]
Length = 126
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 40/130 (30%), Positives = 62/130 (47%), Gaps = 18/130 (13%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
E K LVL+LD TL++ IK E+ G F +A RP V FL
Sbjct: 10 EGKGTLVLDLDETLVY---IKC----ERGCPFNCQCGEGDGFYVAK------RPCVDDFL 56
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-GQE 184
+ ++ ++ L T S + YAEAA+ LLD + + F R+ + G D+ R G+E
Sbjct: 57 QLMAARFELVLWTASPQAYAEAALGLLDPEGRIFEHRLYRQHCVGGLK----DISRLGRE 112
Query: 185 RGIVILDDTE 194
+V++ D +
Sbjct: 113 LNMVVVVDDQ 122
>gi|145479543|ref|XP_001425794.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392866|emb|CAK58396.1| unnamed protein product [Paramecium tetraurelia]
Length = 419
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 38/157 (24%), Positives = 71/157 (45%), Gaps = 16/157 (10%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
K ++ +LD TL+HC +S+ S QI I + + +RPF ++
Sbjct: 225 KKTVIFDLDETLVHCNEDESMPS-------QIVLPITFPTGEKVNAGINIRPFAEKMIQL 277
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDR-----KNPDLVRG 182
S++ ++ + T S CYA + LD ++ R I R+ + KN +++
Sbjct: 278 LSNVCEVMIFTASHECYANEVINHLDPQTRV--KRRIFRDSCVTDENSIYYIKNLEVIDR 335
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELN 219
+ +VI+D+ + H EN I + ++ DK+ N
Sbjct: 336 DLKDVVIVDNASYSFFHHLENGIPIVS--FYDDKQDN 370
>gi|410224860|gb|JAA09649.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 1 [Pan troglodytes]
Length = 260
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 85 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 136
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 137 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 195
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 196 RDLRRVLILDNSPASYVFHPDNAVPVASW 224
>gi|156407316|ref|XP_001641490.1| predicted protein [Nematostella vectensis]
gi|156228629|gb|EDO49427.1| predicted protein [Nematostella vectensis]
Length = 177
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 74/151 (49%), Gaps = 15/151 (9%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K +V++LD TL+H + K +S+ + + +I + ++ + RP V
Sbjct: 16 QDLNKKCIVIDLDETLVH-SSFKPVSNADFIVPVEIDGTVHQVYVLK-------RPHVDE 67
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKY--FSSRIIAREDFNGKDRKNPDLVR 181
FL++ + + L T S YA+ LLD KY F +R+ + DL +
Sbjct: 68 FLKRVGQIYECVLFTASLAKYADPVADLLD---KYNTFRARLFRESCVFHRGNYVKDLSK 124
Query: 182 -GQE-RGIVILDDTESVWSDHTENLIVLGKY 210
G++ + ++ILD++ + +S H EN I + +
Sbjct: 125 LGRDLKKVLILDNSPASYSFHPENAIPVTSW 155
>gi|452842521|gb|EME44457.1| hypothetical protein DOTSEDRAFT_72062 [Dothistroma septosporum
NZE10]
Length = 501
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 47/173 (27%), Positives = 80/173 (46%), Gaps = 19/173 (10%)
Query: 59 LRYSEQEERKLQLVLNLDHTLLHCRNIKS-LSSGEKYLKKQIHSFIGSLFQMANDKL--- 114
L YS +K L+++LD TL+H +S+G + + S Q+
Sbjct: 305 LAYSPDTPKKT-LIIDLDETLIHSMAKGGRMSTGHMVEVRLVGQVSSSGVQIGPGVPILY 363
Query: 115 -VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE-DF-NG 171
V RP FL +A ++ + T S + YA+ + L+ ++KYFS R + F NG
Sbjct: 364 YVHERPGCHEFLRKARKWYNLIVFTASVQEYADPVIDWLERETKYFSGRYYRQHCTFRNG 423
Query: 172 ---KD--RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVY-FRDKEL 218
KD + PDL + ++ILD++ + H +N I + ++ D+EL
Sbjct: 424 AYIKDLAQVEPDLSK-----VMILDNSPMSYIFHEDNAIPIEGWISDPTDREL 471
>gi|300794122|ref|NP_001179369.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 [Bos taurus]
gi|296490317|tpg|DAA32430.1| TPA: CTD (carboxy-terminal domain, RNA polymerase II, polypeptide
A) small phosphatase 1-like [Bos taurus]
Length = 260
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 85 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 136
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 137 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 195
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 196 RDLRRVLILDNSPASYVFHPDNAVPVASW 224
>gi|209156250|gb|ACI34357.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 2 [Salmo salar]
Length = 271
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 75/161 (46%), Gaps = 14/161 (8%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+ Q+E K+ +V++LD TL+H + K +S+ + + +I ++ + RP V
Sbjct: 95 TSQDEGKICVVIDLDETLVH-SSFKPISNADFIVPVEIEGTTHQVYVLK-------RPHV 146
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPD 178
FL++ L + L T S YA+ LLD F +R+ RE G K+
Sbjct: 147 DQFLQRMGELFECVLFTASLAKYADPVTDLLD-QCGVFGTRLF-RESCVFHQGCYVKDLS 204
Query: 179 LVRGQERGIVILDDTESVWSDHTENLI-VLGKYVYFRDKEL 218
+ Q +ILD++ + + H EN + V+ + D EL
Sbjct: 205 RLGRQLNKTLILDNSPASYIFHPENAVPVVSWFDDLEDTEL 245
>gi|348511669|ref|XP_003443366.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1-like [Oreochromis niloticus]
Length = 264
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 37/149 (24%), Positives = 71/149 (47%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
+E K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 88 NDEGKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGTVHQVYVLK-------RPHVDE 139
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ + + L T S YA+ LLD F SR+ K DL R G
Sbjct: 140 FLKRMGEMFECVLFTASLSKYADPVSDLLD-KWGAFRSRLFREACVFHKGNYVKDLSRLG 198
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ ++ILD++ + + H EN + + +
Sbjct: 199 RDLNKVIILDNSPASYIFHPENAVPVASW 227
>gi|189303571|ref|NP_001121551.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 [Rattus norvegicus]
gi|149016108|gb|EDL75354.1| rCG23761 [Rattus norvegicus]
gi|171846749|gb|AAI61976.1| Ctdsp1 protein [Rattus norvegicus]
Length = 261
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 86 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 137
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 138 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 196
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 197 RDLRRVLILDNSPASYVFHPDNAVPVASW 225
>gi|410258922|gb|JAA17427.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 1 [Pan troglodytes]
gi|410290720|gb|JAA23960.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 1 [Pan troglodytes]
Length = 260
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 85 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 136
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 137 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 195
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 196 RDLRRVLILDNSPASYVFHPDNAVPVASW 224
>gi|380815184|gb|AFE79466.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 isoform 2 [Macaca mulatta]
gi|383420375|gb|AFH33401.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 isoform 2 [Macaca mulatta]
gi|384948522|gb|AFI37866.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 isoform 2 [Macaca mulatta]
Length = 260
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 38/159 (23%), Positives = 76/159 (47%), Gaps = 13/159 (8%)
Query: 54 YMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK 113
Y+L + Q+ K+ +V++LD TL+H + K +++ + + +I + ++ +
Sbjct: 77 YLLPAAK--AQDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK--- 130
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
RP V FL++ L + L T S YA+ LLD F +R+ +
Sbjct: 131 ----RPHVDEFLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRG 185
Query: 174 RKNPDLVR-GQE-RGIVILDDTESVWSDHTENLIVLGKY 210
DL R G++ R ++ILD++ + + H +N + + +
Sbjct: 186 NYVKDLSRLGRDLRRVLILDNSPASYVFHPDNAVPVASW 224
>gi|255557435|ref|XP_002519748.1| conserved hypothetical protein [Ricinus communis]
gi|223541165|gb|EEF42721.1| conserved hypothetical protein [Ricinus communis]
Length = 474
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 68/148 (45%), Gaps = 12/148 (8%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
++ + LVL+LD TL+H S ++ +F + VK RP + TFL
Sbjct: 299 KKSVTLVLDLDETLVH--------STLEHCDDADFTFTVFFNLKEHTVYVKRRPHLHTFL 350
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRG 182
E+ + L ++ + T S YA + +LD + K S R+ RE +G K+ ++
Sbjct: 351 ERVAELFEVVIFTASQSIYAAQLLDILDPEKKLISRRVY-RESCIFTDGSYTKDLTVLGV 409
Query: 183 QERGIVILDDTESVWSDHTENLIVLGKY 210
+ I+D++ V+S N I + +
Sbjct: 410 DLAKVAIIDNSPQVFSLQVNNGIPIKSW 437
>gi|440911023|gb|ELR60752.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 [Bos grunniens mutus]
Length = 261
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 86 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 137
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 138 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 196
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 197 RDLRRVLILDNSPASYVFHPDNAVPVASW 225
>gi|296205578|ref|XP_002749828.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 1 [Callithrix jacchus]
Length = 261
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 86 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 137
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 138 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 196
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 197 RDLRRVLILDNSPASYVFHPDNAVPVASW 225
>gi|114583310|ref|XP_001156881.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 2 [Pan troglodytes]
Length = 261
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 86 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 137
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 138 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 196
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 197 RDLRRVLILDNSPASYVFHPDNAVPVASW 225
>gi|32813443|ref|NP_872580.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 isoform 2 [Homo sapiens]
gi|31074175|gb|AAP34397.1| small CTD phosphatase 1 [Homo sapiens]
gi|410351181|gb|JAA42194.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 1 [Pan troglodytes]
Length = 260
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 85 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 136
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 137 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 195
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 196 RDLRRVLILDNSPASYVFHPDNAVPVASW 224
>gi|449275333|gb|EMC84205.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1, partial [Columba livia]
Length = 230
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 71/149 (47%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ L +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 55 QDASNLCVVIDLDETLVH-SSFKPVNNADFIIPVEIDGIMHQVYVLK-------RPHVDE 106
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 107 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 165
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R I+I+D++ + + H +N + + +
Sbjct: 166 RDLRRIIIVDNSPASYIFHPDNAVPVASW 194
>gi|10864009|ref|NP_067021.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 isoform 1 [Homo sapiens]
gi|397495662|ref|XP_003818666.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 1 [Pan paniscus]
gi|402889395|ref|XP_003908002.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 1 [Papio anubis]
gi|426338589|ref|XP_004033258.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 1 [Gorilla gorilla gorilla]
gi|17865510|sp|Q9GZU7.1|CTDS1_HUMAN RecName: Full=Carboxy-terminal domain RNA polymerase II polypeptide
A small phosphatase 1; AltName: Full=Nuclear LIM
interactor-interacting factor 3; Short=NLI-IF;
Short=NLI-interacting factor 3; AltName: Full=Small
C-terminal domain phosphatase 1; Short=SCP1; Short=Small
CTD phosphatase 1
gi|10257407|gb|AAG15402.1|AF229162_1 nuclear LIM interactor-interacting factor [Homo sapiens]
gi|10257410|gb|AAG15404.1| nuclear LIM interactor-interacting factor [Homo sapiens]
gi|15278033|gb|AAH12977.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 1 [Homo sapiens]
gi|119591021|gb|EAW70615.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 1, isoform CRA_a [Homo sapiens]
gi|119591024|gb|EAW70618.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 1, isoform CRA_a [Homo sapiens]
gi|167773945|gb|ABZ92407.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 1 [synthetic construct]
gi|208966090|dbj|BAG73059.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 1 [synthetic construct]
Length = 261
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 86 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 137
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 138 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 196
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 197 RDLRRVLILDNSPASYVFHPDNAVPVASW 225
>gi|156549638|ref|XP_001604265.1| PREDICTED: RNA polymerase II subunit A C-terminal domain
phosphatase-like, partial [Nasonia vitripennis]
Length = 512
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 42/84 (50%), Gaps = 4/84 (4%)
Query: 133 DIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQERG---IVI 189
++++CT R YA +LD D K FS RI++R++ K +L G + I
Sbjct: 1 ELHICTFGARQYAHRVAAILDNDGKLFSHRILSRDECFDPQSKTANLKALFPCGVDMVCI 60
Query: 190 LDDTESVWSDHTENLIVLGKYVYF 213
+DD + VW NL+ + Y +F
Sbjct: 61 IDDRDDVWQ-GCANLVQVKPYHFF 83
>gi|302808545|ref|XP_002985967.1| hypothetical protein SELMODRAFT_123223 [Selaginella moellendorffii]
gi|300146474|gb|EFJ13144.1| hypothetical protein SELMODRAFT_123223 [Selaginella moellendorffii]
Length = 198
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 67/148 (45%), Gaps = 18/148 (12%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
E K LVL++D TL+H + + F G + LV RP V TFL
Sbjct: 24 EEKPTLVLDMDETLIHAHKATA----------SLKLFSGKTLPLQR-YLVAKRPGVDTFL 72
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII----AREDFNGKDRKNPDLVR 181
+ S + +I + T + + YA+ + LD F+ R+ + ++ G+ + DL R
Sbjct: 73 NEMSEIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRLYRDSCSPKEVGGR-KVVKDLSR 131
Query: 182 -GQE-RGIVILDDTESVWSDHTENLIVL 207
G++ R VI+DD + N IV+
Sbjct: 132 LGRDLRHTVIVDDKPESFCLQPSNGIVI 159
>gi|359323950|ref|XP_003640241.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1-like [Canis lupus familiaris]
Length = 260
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 85 QDADKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 136
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 137 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 195
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 196 RDLRRVLILDNSPASYVFHPDNAVPVASW 224
>gi|302564542|ref|NP_001180802.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 [Macaca mulatta]
gi|387542952|gb|AFJ72103.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 isoform 1 [Macaca mulatta]
Length = 261
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 38/159 (23%), Positives = 76/159 (47%), Gaps = 13/159 (8%)
Query: 54 YMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK 113
Y+L + Q+ K+ +V++LD TL+H + K +++ + + +I + ++ +
Sbjct: 78 YLLPAAK--AQDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK--- 131
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
RP V FL++ L + L T S YA+ LLD F +R+ +
Sbjct: 132 ----RPHVDEFLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRG 186
Query: 174 RKNPDLVR-GQE-RGIVILDDTESVWSDHTENLIVLGKY 210
DL R G++ R ++ILD++ + + H +N + + +
Sbjct: 187 NYVKDLSRLGRDLRRVLILDNSPASYVFHPDNAVPVASW 225
>gi|344268533|ref|XP_003406112.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1-like [Loxodonta africana]
Length = 261
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 86 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 137
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 138 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 196
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 197 RDLRRVLILDNSPASYVFHPDNAVPVASW 225
>gi|346970080|gb|EGY13532.1| nuclear envelope morphology protein [Verticillium dahliae VdLs.17]
Length = 452
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 38/155 (24%), Positives = 73/155 (47%), Gaps = 19/155 (12%)
Query: 71 LVLNLDHTLLHCRNIKS-LSSGEKYLKKQIHSFIGSLFQMANDK------LVKLRPFVRT 123
L+L+LD TL+H + +S+G + +++G+ Q + V RP+
Sbjct: 263 LILDLDETLIHSMSKGGRMSTGHMVEVRLNQTYVGAGGQTSLGPQHPILYWVNKRPYCDD 322
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI-----IAREDFNGKDRKN-- 176
FL + ++ + T S + YA+ + L+ + K+FS+R R+ KD +
Sbjct: 323 FLRRICKWYNLVVFTASVQEYADPVIDWLESERKFFSARYYRQHCTFRQGAFIKDLSSVE 382
Query: 177 PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
PDL R ++ILD++ + H +N I + ++
Sbjct: 383 PDLSR-----VMILDNSPLSYMFHQDNAIPIQGWI 412
>gi|340052675|emb|CCC46957.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 401
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 36/146 (24%), Positives = 64/146 (43%), Gaps = 11/146 (7%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
K+ L+L+LD TL+H + L ++ S ++ V RPF++ FL+
Sbjct: 229 KVSLILDLDETLVHSSLTLQPRHYDLMLDVRVESATTRVY-------VAFRPFMQEFLQA 281
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
+ L ++ + T S Y + +D D+ S R+ RE NG K+ L+
Sbjct: 282 VAPLFEVIIFTASVSAYCNDVMNAIDPDNILGSLRLF-REHCSILNGAYVKDLSLLGRDL 340
Query: 185 RGIVILDDTESVWSDHTENLIVLGKY 210
+VILD++ + N I + +
Sbjct: 341 EKVVILDNSPVAYLFQPRNAIPITSW 366
>gi|328874828|gb|EGG23193.1| CTD small phosphatase-like protein 2 [Dictyostelium fasciculatum]
Length = 692
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 25/92 (27%), Positives = 44/92 (47%), Gaps = 8/92 (8%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
+ K+ LVL+LD TL+HC ++ +F+ + + K RPF F
Sbjct: 511 DTPKISLVLDLDETLVHC--------STDPIEDPDLTFLVTFNAIEYKVYAKKRPFFEEF 562
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDS 156
L +AS L ++ + T S YA + ++D ++
Sbjct: 563 LVKASELFEVIIFTASQEVYANKLLNMIDPNN 594
>gi|431917984|gb|ELK17213.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 [Pteropus alecto]
Length = 261
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 86 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 137
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 138 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 196
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 197 RDLRRVLILDNSPASYVFHPDNAVPVASW 225
>gi|52695708|pdb|1TA0|A Chain A, Three-Dimensional Structure Of A Rna-Polymerase Ii Binding
Protein With Associated Ligand
Length = 197
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 71/149 (47%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V+ LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 11 QDSDKICVVIXLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 62
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 63 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 121
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 122 RDLRRVLILDNSPASYVFHPDNAVPVASW 150
>gi|403342064|gb|EJY70343.1| hypothetical protein OXYTRI_08908 [Oxytricha trifallax]
Length = 378
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/97 (27%), Positives = 50/97 (51%), Gaps = 6/97 (6%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
R L+ +LD TL+H + I + E+ + K + + + V +RP+V+ LE
Sbjct: 194 RHKTLIFDLDETLIHSQMITQ--NQEQEIVKDFEISLSNNVKFG----VAVRPYVQQCLE 247
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
SS ++ + T + + YA+ + +D + KYFS R+
Sbjct: 248 HLSSYYEMAIFTAAEQQYADLIIDRIDPEKKYFSQRL 284
>gi|321474691|gb|EFX85656.1| hypothetical protein DAPPUDRAFT_313811 [Daphnia pulex]
Length = 314
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 76/162 (46%), Gaps = 13/162 (8%)
Query: 51 SFDYMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMA 110
S Y+L Y Q+ ++ +V++LD TL+H + K +S+ + + +I + ++ +
Sbjct: 106 SAKYLLPVPHY--QDSQRKCMVIDLDETLVHS-SFKPISNADFIVPVEIDGTVHQVYVLK 162
Query: 111 NDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFN 170
RP V FL + L + L T S YA+ LLD F SR+
Sbjct: 163 -------RPHVDEFLRKMGELYECVLFTASLAKYADPVADLLD-QWGVFRSRLFRESCVF 214
Query: 171 GKDRKNPDLVR-GQE-RGIVILDDTESVWSDHTENLIVLGKY 210
+ DL R G+E + +VI+D++ + + H +N + + +
Sbjct: 215 HRGNYVKDLSRLGRELQKVVIIDNSPASYIFHPDNAVPVASW 256
>gi|336472042|gb|EGO60202.1| hypothetical protein NEUTE1DRAFT_74992 [Neurospora tetrasperma FGSC
2508]
gi|350294753|gb|EGZ75838.1| hypothetical protein NEUTE2DRAFT_84748 [Neurospora tetrasperma FGSC
2509]
Length = 531
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 41/155 (26%), Positives = 70/155 (45%), Gaps = 19/155 (12%)
Query: 71 LVLNLDHTLLHCRNIKS-LSSGEKYLKKQIHSFIGSLFQMANDK------LVKLRPFVRT 123
L+L+LD TL+H + +SSG + +++G Q V RP
Sbjct: 342 LILDLDETLIHSMSKGGRMSSGHMVEVRLNTTYVGVGGQQTIGPQHPILYYVHKRPHCDE 401
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI-----IAREDFNGKDRKN-- 176
FL + S ++ + T S + YA+ + L+ D KYFS+R R KD +
Sbjct: 402 FLRRVSKWYNLVVFTASVQEYADPVIDWLESDRKYFSARYYRQHCTFRHGAFIKDLSSVE 461
Query: 177 PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
PDL + ++ILD++ + H +N I + ++
Sbjct: 462 PDLSK-----VMILDNSPLSYMFHQDNAIPIQGWI 491
>gi|302808549|ref|XP_002985969.1| hypothetical protein SELMODRAFT_122967 [Selaginella moellendorffii]
gi|300146476|gb|EFJ13146.1| hypothetical protein SELMODRAFT_122967 [Selaginella moellendorffii]
Length = 198
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 70/148 (47%), Gaps = 18/148 (12%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
E K LVL++D TL+H K+++S + F G + LV RP V TFL
Sbjct: 24 EEKPTLVLDMDETLIHAH--KAIAS--------LKLFSGKTLPLQR-YLVAKRPGVDTFL 72
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR---- 181
+ S + +I + T + + YA+ + LD F+ R+ R+ + K+ +V+
Sbjct: 73 NEMSEIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRLY-RDSCSPKEVGGRKVVKDLSR 131
Query: 182 -GQE-RGIVILDDTESVWSDHTENLIVL 207
G++ R VI+DD + N IV+
Sbjct: 132 LGRDLRHTVIVDDKLESFCLQPSNGIVI 159
>gi|164423757|ref|XP_960672.2| hypothetical protein NCU08948 [Neurospora crassa OR74A]
gi|28950150|emb|CAD71008.1| related to nuclear envelope protein NEM1 [Neurospora crassa]
gi|157070223|gb|EAA31436.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 531
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 41/155 (26%), Positives = 70/155 (45%), Gaps = 19/155 (12%)
Query: 71 LVLNLDHTLLHCRNIKS-LSSGEKYLKKQIHSFIGSLFQMANDK------LVKLRPFVRT 123
L+L+LD TL+H + +SSG + +++G Q V RP
Sbjct: 342 LILDLDETLIHSMSKGGRMSSGHMVEVRLNTTYVGVGGQQTIGPQHPILYYVHKRPHCDE 401
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI-----IAREDFNGKDRKN-- 176
FL + S ++ + T S + YA+ + L+ D KYFS+R R KD +
Sbjct: 402 FLRRVSKWYNLVVFTASVQEYADPVIDWLESDRKYFSARYYRQHCTFRHGAFIKDLSSVE 461
Query: 177 PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
PDL + ++ILD++ + H +N I + ++
Sbjct: 462 PDLSK-----VMILDNSPLSYMFHQDNAIPIQGWI 491
>gi|145533244|ref|XP_001452372.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420060|emb|CAK84975.1| unnamed protein product [Paramecium tetraurelia]
Length = 250
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 48/88 (54%), Gaps = 8/88 (9%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
+++ LVL+LD TL+H +++ S ++ + +I I + +K+RP+ R FL
Sbjct: 70 QKEFTLVLDLDETLIHS-DLERTSILDEEIIVKIGENIEKYY-------IKVRPYAREFL 121
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLD 153
+ S L D+ + T + + YA+ + LD
Sbjct: 122 QSLSQLFDLVIFTAALKEYADKVIDFLD 149
>gi|357156635|ref|XP_003577523.1| PREDICTED: CTD small phosphatase-like protein 2-like isoform 1
[Brachypodium distachyon]
Length = 411
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 71/155 (45%), Gaps = 13/155 (8%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
+ LVL+LD TL+H S + + +F + V+ RP+++ FLE+
Sbjct: 226 RTTLVLDLDETLVH--------STLEPCEDSDFTFPVHFNLREHTIYVRCRPYLKEFLER 277
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FNGKDRKNPDLVRGQE 184
+S+ +I + T S YAE + +LD K F R+ RE G K+ ++
Sbjct: 278 VASMFEIIIFTASQSIYAEQLLNVLDPKRKLFRHRVY-RESCVYVEGNYLKDLSVLGRDL 336
Query: 185 RGIVILDDTESVWSDHTENLIVLGKYV-YFRDKEL 218
+VI+D++ + EN I + + DKEL
Sbjct: 337 ARVVIVDNSPQAFGFQLENGIPIESWFDDPNDKEL 371
>gi|302811311|ref|XP_002987345.1| hypothetical protein SELMODRAFT_125729 [Selaginella moellendorffii]
gi|300144980|gb|EFJ11660.1| hypothetical protein SELMODRAFT_125729 [Selaginella moellendorffii]
Length = 240
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 48/102 (47%), Gaps = 20/102 (19%)
Query: 68 KLQLVLNLDHTLLH-----CRNIK-SLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+ LVL+LD TL+H C N S S + ++ ++ V+ RP +
Sbjct: 44 PVALVLDLDETLVHSTTDHCGNADFSFSLHANFQRQTVY--------------VRRRPHL 89
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ F+E+ + L +I + T S YAE + +LD K F RI
Sbjct: 90 QMFMERVAQLFEIIVFTASQSTYAEKLLNILDPKRKVFRHRI 131
>gi|356566193|ref|XP_003551319.1| PREDICTED: CTD small phosphatase-like protein 2-like [Glycine max]
Length = 403
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 46/93 (49%), Gaps = 8/93 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+H S ++ + +F + + V+ RP ++ FLE+ S
Sbjct: 214 LVLDLDETLVH--------STLEHCEDVDFTFPVNFNSEEHIVYVRCRPHLKDFLERVSG 265
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
L +I + T S YAE + +LD K F R+
Sbjct: 266 LFEIIIFTASQSIYAEQLLNVLDPKRKIFRHRV 298
>gi|431896052|gb|ELK05470.1| CTD small phosphatase-like protein 2 [Pteropus alecto]
Length = 282
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 29/49 (59%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
V+LRPF R FLE+ S + +I L T S + YA+ + +LD + R+
Sbjct: 142 VRLRPFFREFLERMSQMYEIILFTASKKVYADKLLNILDPKKQLVRHRL 190
>gi|341894763|gb|EGT50698.1| hypothetical protein CAEBREN_25349 [Caenorhabditis brenneri]
Length = 250
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 41/83 (49%), Gaps = 8/83 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+HC ++ L + + ++ V+LRP +RTFL + +
Sbjct: 31 LVLDLDETLVHC-SLTPLDNATMIFPVVFQNITYQVY-------VRLRPHLRTFLNRMAK 82
Query: 131 LVDIYLCTMSTRCYAEAAVKLLD 153
+I + T S + YA +LD
Sbjct: 83 TFEIIIFTASKKVYANKLCDILD 105
>gi|301755758|ref|XP_002913748.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1-like, partial [Ailuropoda
melanoleuca]
Length = 252
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 77 QDVDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 128
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 129 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 187
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 188 RDLRRVLILDNSPASYVFHPDNAVPVASW 216
>gi|336268969|ref|XP_003349246.1| hypothetical protein SMAC_05530 [Sordaria macrospora k-hell]
gi|380089819|emb|CCC12352.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 532
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 41/155 (26%), Positives = 70/155 (45%), Gaps = 19/155 (12%)
Query: 71 LVLNLDHTLLHCRNIKS-LSSGEKYLKKQIHSFIGSLFQMANDK------LVKLRPFVRT 123
L+L+LD TL+H + +SSG + +++G Q V RP
Sbjct: 343 LILDLDETLIHSMSKGGRMSSGHMVEVRLNTTYVGVGGQQTIGPQHPILYYVHKRPHCDE 402
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI-----IAREDFNGKDRKN-- 176
FL + S ++ + T S + YA+ + L+ D KYFS+R R KD +
Sbjct: 403 FLRRVSKWYNLVVFTASVQEYADPVIDWLESDRKYFSARYYRQHCTFRHGAFIKDLSSVE 462
Query: 177 PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
PDL + ++ILD++ + H +N I + ++
Sbjct: 463 PDLSK-----VMILDNSPLSYMFHQDNAIPIQGWI 492
>gi|145533471|ref|XP_001452480.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420179|emb|CAK85083.1| unnamed protein product [Paramecium tetraurelia]
Length = 592
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 46/94 (48%), Gaps = 9/94 (9%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAR-----EDF 169
V RPF+ TFL+Q S L + L T YA + + + KYF+ + + +DF
Sbjct: 453 VHQRPFLLTFLKQMSRLYQLILFTAGLESYANRILSQITI-KKYFTHLLFRQHTNIYQDF 511
Query: 170 NGKDRKNPDLVRGQERGIVILDDTESVWSDHTEN 203
GKD + L R R I+I D+T +S EN
Sbjct: 512 YGKDLR--KLGRLLSRTIII-DNTPECFSLQPEN 542
>gi|328772991|gb|EGF83028.1| hypothetical protein BATDEDRAFT_8275, partial [Batrachochytrium
dendrobatidis JAM81]
Length = 184
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 41/159 (25%), Positives = 77/159 (48%), Gaps = 13/159 (8%)
Query: 54 YMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK 113
Y+L+ L +E RK LVL+LD TL+H + K ++ + + +I I +++ +
Sbjct: 1 YLLKELA-AEDVGRKC-LVLDLDETLVHS-SFKPVAKADFIIPVEIDKTIHNVYVLK--- 54
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII--AREDFNG 171
RP V TFL++ + ++ + T S YA+ + +LD K R+ A G
Sbjct: 55 ----RPGVDTFLQRLGTQFEVVVFTASLAKYADPVLDMLD-KHKVVKHRLFREACIHHKG 109
Query: 172 KDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
K+ L+ + ++I+D++ S + H N I + +
Sbjct: 110 NYVKDLSLLGRNLKDVIIIDNSPSCYLFHPANAIPITSW 148
>gi|148232046|ref|NP_001084286.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase-like [Xenopus laevis]
gi|32396218|gb|AAP43959.1| NIF [Xenopus laevis]
gi|114107822|gb|AAI23152.1| NIF protein [Xenopus laevis]
Length = 276
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 43/168 (25%), Positives = 81/168 (48%), Gaps = 14/168 (8%)
Query: 54 YMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK 113
Y+L L+ SE ++ +V++LD TL+H + K +++ + + +I I ++ +
Sbjct: 94 YLLPELKVSEYGKK--CVVIDLDETLVH-SSFKPINNADFIVPVEIDGTIHQVYVLK--- 147
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD 173
RP V FL++ + + L T S YA+ LLD F++R+ +
Sbjct: 148 ----RPHVDEFLQKMGEMFECVLFTASLAKYADPVADLLD-RWGVFNARLFRESCVFHRG 202
Query: 174 RKNPDLVR-GQERG-IVILDDTESVWSDHTENLI-VLGKYVYFRDKEL 218
DL R G+E ++I+D++ + + H EN + V+ + D EL
Sbjct: 203 NYVKDLSRLGRELSKVIIIDNSPASYIFHPENAVPVMSWFDDMADTEL 250
>gi|66808305|ref|XP_637875.1| dullard-like phosphatase domain containing protein [Dictyostelium
discoideum AX4]
gi|60466303|gb|EAL64364.1| dullard-like phosphatase domain containing protein [Dictyostelium
discoideum AX4]
Length = 344
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 76/165 (46%), Gaps = 19/165 (11%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
K L+L+LD TL+H +K ++ + +K I + + V RP V FLE+
Sbjct: 171 KKTLILDLDETLVHST-LKPVTHHQITVKVLIEDMDCTFY-------VIKRPHVDYFLEK 222
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE--DFNGKDRKNPDLVRGQER 185
S DI + T S + YA+ + LD K F R+ + +G K+ ++
Sbjct: 223 VSQWYDIVIFTASMQQYADPLLDQLDT-HKVFKKRLFRDSCLEKDGNFVKDLSMIDQDLT 281
Query: 186 GIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLT 230
+I+D++ +S++ EN + + ++ GD+ S + L+
Sbjct: 282 STIIIDNSPIAYSNNLENALPIDNWM--------GDNPSDTSLLS 318
>gi|432112038|gb|ELK35066.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 2 [Myotis davidii]
Length = 262
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 75/151 (49%), Gaps = 11/151 (7%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+E+++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RP+V
Sbjct: 92 TEEDQGRICVVIDLDETLVH-SSFKPINNADFIVPVEIE---GTTHQV----YVLKRPYV 143
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
FL + L + L T S YA+ LLD F +R+ + DL R
Sbjct: 144 DEFLRRMGELFECVLFTASLAKYADPVTDLLD-RCGVFRARLFRESCVFHQGCYVKDLSR 202
Query: 182 -GQE-RGIVILDDTESVWSDHTENLIVLGKY 210
G++ R +ILD++ + + H EN + + +
Sbjct: 203 LGRDLRKTLILDNSPASYIFHPENAVPVQSW 233
>gi|299470416|emb|CBN80177.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 613
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 30/48 (62%)
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
V+LRP + FLE+ +++ ++ + T S R YA+A + LLD F+ R
Sbjct: 448 VQLRPGLARFLEKVAAIYELVVWTASGRSYADAIIDLLDPAGDIFAER 495
>gi|281340231|gb|EFB15815.1| hypothetical protein PANDA_001554 [Ailuropoda melanoleuca]
Length = 243
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 68 QDVDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 119
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 120 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 178
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 179 RDLRRVLILDNSPASYVFHPDNAVPVASW 207
>gi|302806326|ref|XP_002984913.1| hypothetical protein SELMODRAFT_5868 [Selaginella moellendorffii]
gi|300147499|gb|EFJ14163.1| hypothetical protein SELMODRAFT_5868 [Selaginella moellendorffii]
Length = 173
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 70/146 (47%), Gaps = 18/146 (12%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
K LVL++D TL+H K+ +S + F G + + LV RP V TFL +
Sbjct: 2 KPTLVLDMDETLIHAH--KATAS--------LKLFSGKILPLQR-YLVAKRPGVDTFLNE 50
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII----AREDFNGKDRKNPDLVR-G 182
S + +I + T + + YA+ + LD F+ R+ + ++ G+ + DL R G
Sbjct: 51 MSQIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRLYRDSCSPKEVGGR-KVVKDLSRLG 109
Query: 183 QE-RGIVILDDTESVWSDHTENLIVL 207
++ R VI+DD + N IV+
Sbjct: 110 RDLRHTVIVDDKPESFCLQPSNGIVI 135
>gi|145500510|ref|XP_001436238.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124403377|emb|CAK68841.1| unnamed protein product [Paramecium tetraurelia]
Length = 494
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 40/95 (42%), Gaps = 7/95 (7%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
K +V +LD TL+HC+ S I G Q + LRP+ R L
Sbjct: 305 KKTIVFDLDETLIHCQESNDDPSDTVLT---IKFPTGETVQAG----INLRPYCREMLAI 357
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
S +I + T S CYA+ + +D D K+ R
Sbjct: 358 LSQKYEIIVFTASHECYAQKVINYIDPDKKWIHHR 392
>gi|145488647|ref|XP_001430327.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124397424|emb|CAK62929.1| unnamed protein product [Paramecium tetraurelia]
Length = 571
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 43/93 (46%), Gaps = 7/93 (7%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
+V +LD TL+HC N G+ L I G Q + + +RPF + L+ S
Sbjct: 377 VVFDLDETLIHC-NESVAVPGDVVLP--ITFPTGETIQAS----INIRPFAQQILQTLSR 429
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+I + T S CYA + LD ++ S R+
Sbjct: 430 HFEIIVFTASHSCYANVVLDYLDPKKQWISHRL 462
>gi|328868172|gb|EGG16552.1| dullard-like phosphatase domain containing protein [Dictyostelium
fasciculatum]
Length = 297
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 68/148 (45%), Gaps = 15/148 (10%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
K LVL+LD TL+H + K +++ + + +I I +F V RP V FL
Sbjct: 126 NKKTLVLDLDETLVHS-SFKPVANPDFVVPVEIEGIIHQVF-------VVKRPHVDEFLR 177
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKY--FSSRIIAREDFNGKDRKNPDLVR-GQ 183
+I + T S YA+ + LLD KY R+ N K DL R G+
Sbjct: 178 AVGEHFEIVVFTASLAKYADPVLNLLD---KYQVVHWRLFRESCHNHKGNYVKDLSRIGR 234
Query: 184 E-RGIVILDDTESVWSDHTENLIVLGKY 210
+ + +I+D++ + + H EN I + +
Sbjct: 235 DLKSTIIIDNSPTSYMFHPENAIPVDSW 262
>gi|145526783|ref|XP_001449197.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416774|emb|CAK81800.1| unnamed protein product [Paramecium tetraurelia]
Length = 495
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 41/95 (43%), Gaps = 7/95 (7%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
K +V +LD TL+HC+ S + I G Q + LRP+ R L
Sbjct: 306 KKTIVFDLDETLIHCQESNDDPSD---IVLTIKFPTGETVQAG----INLRPYCREMLAI 358
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
S +I + T S CYA+ + +D D K+ R
Sbjct: 359 LSQKYEIIVFTASHECYAQKVINYIDPDKKWIHHR 393
>gi|302814947|ref|XP_002989156.1| hypothetical protein SELMODRAFT_129286 [Selaginella moellendorffii]
gi|300143056|gb|EFJ09750.1| hypothetical protein SELMODRAFT_129286 [Selaginella moellendorffii]
Length = 245
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 48/102 (47%), Gaps = 20/102 (19%)
Query: 68 KLQLVLNLDHTLLH-----CRNIK-SLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+ LVL+LD TL+H C N S S + ++ ++ V+ RP +
Sbjct: 44 PVALVLDLDETLVHSTTDHCGNADFSFSLHANFQRQTVY--------------VRRRPHL 89
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ F+E+ + L +I + T S YAE + +LD K F RI
Sbjct: 90 QMFMERVAQLFEIIVFTASQSTYAEKLLNILDPKRKVFRHRI 131
>gi|260807745|ref|XP_002598669.1| hypothetical protein BRAFLDRAFT_67070 [Branchiostoma floridae]
gi|229283942|gb|EEN54681.1| hypothetical protein BRAFLDRAFT_67070 [Branchiostoma floridae]
Length = 258
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 38/168 (22%), Positives = 80/168 (47%), Gaps = 13/168 (7%)
Query: 45 NDSFGLSFDYMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIG 104
N S + Y+L +R+ Q+ K +V++LD TL+H + K +++ + + +I +
Sbjct: 65 NGSAKVPQKYLLPPVRH--QDMHKKCIVIDLDETLVH-SSFKPVTNADFIVPVEIDGTVH 121
Query: 105 SLFQMANDKLVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII 164
++ + RP+V FL++ + + L T S YA+ LLD F +R+
Sbjct: 122 QVYVLK-------RPYVDEFLQKMGEMFECVLFTASLAKYADPVADLLD-KWGVFRARLF 173
Query: 165 AREDFNGKDRKNPDLVR-GQER-GIVILDDTESVWSDHTENLIVLGKY 210
+ DL R G++ ++I+D++ + + H +N + + +
Sbjct: 174 RDSCVFHRGNYVKDLSRLGRDLCKVIIVDNSPASYIFHPDNAVPVASW 221
>gi|410969412|ref|XP_003991189.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 [Felis catus]
Length = 259
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 84 QDVDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 135
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 136 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 194
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 195 RDLRRVLILDNSPASYVFHPDNAVPVASW 223
>gi|354502403|ref|XP_003513276.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1-like [Cricetulus griseus]
Length = 342
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 39/153 (25%), Positives = 70/153 (45%), Gaps = 19/153 (12%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I I ++ V RP V
Sbjct: 167 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVIHQVY-------VLKRPHVDE 218
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RKNP 177
FL++ L + L T S YA+ LLD ++ F + KD R
Sbjct: 219 FLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRARLFRESCVFHRGNYVKDLSRLGR 278
Query: 178 DLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
DL RG +ILD++ + + H +N + + +
Sbjct: 279 DLRRG-----LILDNSPASYVFHPDNAVPVASW 306
>gi|195127712|ref|XP_002008312.1| GI13418 [Drosophila mojavensis]
gi|193919921|gb|EDW18788.1| GI13418 [Drosophila mojavensis]
Length = 331
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 39/160 (24%), Positives = 78/160 (48%), Gaps = 15/160 (9%)
Query: 54 YMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK 113
Y+L +R+S+ ++ + V++LD TL+H + K + + + + +I I ++ +
Sbjct: 75 YLLPQIRHSDMHKKCM--VIDLDETLVHS-SFKPIPNADFIVPVEIDGTIHQVYVLK--- 128
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FN 170
RP V FL++ L + L T S YA+ LLD F +R+ RE +
Sbjct: 129 ----RPHVDEFLQKMGELYECVLFTASLAKYADPVADLLD-KWNVFRARLF-RESCVYYR 182
Query: 171 GKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
G K+ + + + IVI+D++ + + H +N + + +
Sbjct: 183 GNYIKDLNRLGRDLQKIVIVDNSPASYIFHPDNAVPVKSW 222
>gi|302794308|ref|XP_002978918.1| hypothetical protein SELMODRAFT_418692 [Selaginella moellendorffii]
gi|300153236|gb|EFJ19875.1| hypothetical protein SELMODRAFT_418692 [Selaginella moellendorffii]
Length = 218
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 71/148 (47%), Gaps = 18/148 (12%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
E K LVL++D TL+H K+ +S + F G + + LV RP V FL
Sbjct: 40 EEKPTLVLDMDETLIHAH--KATAS--------LKLFSGKILPLER-YLVAKRPGVDIFL 88
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII----AREDFNGKDRKNPDLVR 181
++ S + +I + T + + YA+ + LD F+ R+ + ++ G+ + DL R
Sbjct: 89 DEMSKIYEIVVFTRAVKPYADRILDRLDPAGNLFAHRLYRDSCSTKEVGGR-KVVKDLSR 147
Query: 182 -GQE-RGIVILDDTESVWSDHTENLIVL 207
G++ R VI+DD + N IV+
Sbjct: 148 LGRDLRHTVIVDDKPESFFLQPNNGIVI 175
>gi|281209812|gb|EFA83980.1| dullard-like phosphatase domain containing protein [Polysphondylium
pallidum PN500]
Length = 270
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 71/145 (48%), Gaps = 11/145 (7%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
K LVL+LD TL+H + K ++ + + +I G L Q+ V RP V F++
Sbjct: 76 KKTLVLDLDETLVHS-SFKPVAKADFIVPVEIE---GQLHQV----YVSKRPHVDEFMQA 127
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-GQE-R 185
S +I + T S YA+ + LLD +++ R+ + K DL R G++ +
Sbjct: 128 ISQKFEIVVFTASLAKYADPVLDLLD-PNRFVHHRLFREACHHHKGNFVKDLSRLGRDLK 186
Query: 186 GIVILDDTESVWSDHTENLIVLGKY 210
+I+D++ + + H EN I + +
Sbjct: 187 TTIIIDNSPTSYLFHPENAIPIDSW 211
>gi|326429212|gb|EGD74782.1| hypothetical protein PTSG_07015 [Salpingoeca sp. ATCC 50818]
Length = 797
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 74/145 (51%), Gaps = 16/145 (11%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
++ LVL+LD TL+H ++ + H G ++ ++RP R FL +
Sbjct: 307 RMTLVLDLDETLVHSLTTP-VADADVAFDISAH---GQSLRI----YTRVRPHARDFLRR 358
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE--DFN-GKDRKNPDLVR-GQ 183
+ ++ L T S + YA+A ++ LD +++F R+ RE DF G KN L R G+
Sbjct: 359 VAQRYEVVLFTASMQVYADALLEQLDPHNEFFHHRLF-REHCDFQFGIHLKN--LTRLGR 415
Query: 184 E-RGIVILDDTESVWSDHTENLIVL 207
+ R ++++D++ V++ N I +
Sbjct: 416 DLRRVMLVDNSPQVFAYQLSNGIPI 440
>gi|123434330|ref|XP_001308790.1| NLI interacting factor-like phosphatase family protein [Trichomonas
vaginalis G3]
gi|121890487|gb|EAX95860.1| NLI interacting factor-like phosphatase family protein [Trichomonas
vaginalis G3]
Length = 324
Score = 39.7 bits (91), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 66/156 (42%), Gaps = 21/156 (13%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
S ++ K+ LVL+LD TL+H + + + F + Q V +RP
Sbjct: 151 SSEDRGKICLVLDLDETLVHSSFLA--------IPHADYRFNIGVEQNPVGVFVCVRPGA 202
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII-------AREDFNGKDR 174
FL + SL +I + T S + YA+ + +D R++ A DFNG
Sbjct: 203 EKFLRELGSLYEIIIFTASCQVYADPVIDFID------KGRVVKYRLYREACTDFNGSFV 256
Query: 175 KNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
K+ + I+I+D++ + N I +G +
Sbjct: 257 KDLSRLNRPLEKIIIIDNSSVAYLLQPYNAIPIGSW 292
>gi|344253634|gb|EGW09738.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 [Cricetulus griseus]
Length = 354
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 39/153 (25%), Positives = 70/153 (45%), Gaps = 19/153 (12%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I I ++ V RP V
Sbjct: 179 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVIHQVY-------VLKRPHVDE 230
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RKNP 177
FL++ L + L T S YA+ LLD ++ F + KD R
Sbjct: 231 FLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRARLFRESCVFHRGNYVKDLSRLGR 290
Query: 178 DLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
DL RG +ILD++ + + H +N + + +
Sbjct: 291 DLRRG-----LILDNSPASYVFHPDNAVPVASW 318
>gi|393215753|gb|EJD01244.1| NIF-domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 507
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 86/217 (39%), Gaps = 55/217 (25%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKL------------- 117
LVL+LD TL+H + L SG + + S IG +V++
Sbjct: 319 LVLDLDETLIHS-TTRPLPSGGRNGLFNLGSLIGFGHNRKAGHIVEVVMNNRSTLYHVYK 377
Query: 118 RPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED----FNGKD 173
RPFV FL + S+ + + T S + YA+ + LD S R RE NG
Sbjct: 378 RPFVDYFLRKVSAWYTLVIFTASMKEYADPVIDWLDAGRGILSLRFF-REHCTQLPNGSY 436
Query: 174 RK-----NPDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSET 228
K N DL R I ++D++ + +S + N I + + H Y
Sbjct: 437 SKDLSILNEDLAR-----ICLIDNSPASYSINKANGIPIEGWT----------HDPY--- 478
Query: 229 LTDESENEEALANVLRVLKTIHRLFFDSVCGDVRTYL 265
+EAL ++L VL ++ GDVR L
Sbjct: 479 -------DEALLDLLPVLDSLR------FTGDVRHIL 502
>gi|302806328|ref|XP_002984914.1| hypothetical protein SELMODRAFT_121036 [Selaginella moellendorffii]
gi|302806330|ref|XP_002984915.1| hypothetical protein SELMODRAFT_121271 [Selaginella moellendorffii]
gi|300147500|gb|EFJ14164.1| hypothetical protein SELMODRAFT_121036 [Selaginella moellendorffii]
gi|300147501|gb|EFJ14165.1| hypothetical protein SELMODRAFT_121271 [Selaginella moellendorffii]
Length = 198
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 67/148 (45%), Gaps = 18/148 (12%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
E K LVL++D TL+H + + F G + LV RP V TFL
Sbjct: 24 EEKPTLVLDMDETLIHAHKATA----------SLKLFSGRTLPLQR-YLVAKRPGVDTFL 72
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII----AREDFNGKDRKNPDLVR 181
+ S + +I + T + + YA+ + LD F+ R+ + ++ G+ + DL R
Sbjct: 73 NEMSQIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRLYRDSCSPKEVGGR-KVVKDLSR 131
Query: 182 -GQE-RGIVILDDTESVWSDHTENLIVL 207
G++ R VI+DD + N IV+
Sbjct: 132 LGRDLRHTVIVDDKPESFCLQPSNGIVI 159
>gi|351699531|gb|EHB02450.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 1 [Heterocephalus glaber]
Length = 261
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 37/153 (24%), Positives = 70/153 (45%), Gaps = 19/153 (12%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 86 QDSDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 137
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RKNP 177
FL++ L + L T S YA+ LLD ++ F + KD R
Sbjct: 138 FLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRARLFRESCVFHRGNYVKDLSRLGR 197
Query: 178 DLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
DL RG +ILD++ + + H +N + + +
Sbjct: 198 DLRRG-----LILDNSPASYVFHPDNAVPVASW 225
>gi|26449836|dbj|BAC42041.1| unknown protein [Arabidopsis thaliana]
Length = 453
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 48/98 (48%), Gaps = 10/98 (10%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVRTF 124
++ + LVL+LD TL+H ++S + + + F M + + V+ RP + F
Sbjct: 278 KKSVTLVLDLDETLVHS-TLESCNVADFSFR--------VFFNMQENTVYVRQRPHLYRF 328
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
LE+ L + + T S YA + +LD D K+ S R
Sbjct: 329 LERVGELFHVVIFTASHSIYASQLLDILDPDGKFISQR 366
>gi|195382318|ref|XP_002049877.1| GJ20507 [Drosophila virilis]
gi|194144674|gb|EDW61070.1| GJ20507 [Drosophila virilis]
Length = 305
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 39/158 (24%), Positives = 65/158 (41%), Gaps = 10/158 (6%)
Query: 71 LVLNLDHTLLHC----RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKL----RPFVR 122
LVL+LD TL+H + + + + ++ + +AN ++ RP+V
Sbjct: 117 LVLDLDETLVHSCYLDPDTNDVVGCNFVPETAVPDYVMHIPILANFHPIEFQVFKRPYVD 176
Query: 123 TFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKD-RKNPDLVR 181
FL D+ + T S YA + LD R+ + + KN V
Sbjct: 177 EFLNFVGRWYDLVIYTASLEAYASNVIDRLDAGRGILQRRLYRQHCISTTVVTKNLYAVN 236
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVY-FRDKEL 218
I I+D++ S + D EN I + Y+Y D+EL
Sbjct: 237 QDLTSIFIIDNSPSAYRDFPENAIPIKSYIYDPNDQEL 274
>gi|124506237|ref|XP_001351716.1| protein phosphatase, putative [Plasmodium falciparum 3D7]
gi|23504645|emb|CAD51523.1| protein phosphatase, putative [Plasmodium falciparum 3D7]
Length = 328
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 32/150 (21%), Positives = 70/150 (46%), Gaps = 22/150 (14%)
Query: 69 LQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKL----RPFVRTF 124
+ LVL+LD TL++C KK+ + + + + N K + L RP++ F
Sbjct: 58 MTLVLDLDETLIYCT------------KKRKYHYQKEVDVLINGKYLPLYVCKRPYIDLF 105
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED-FNGKDR---KNPDLV 180
+I + T + + YA+ + ++D+D ++ + RED + ++ KN +
Sbjct: 106 FSSLYPFYEIIIFTTAIKSYADTVLNIIDVD--HYIDKKFYREDCYEMNEKLYIKNLTNI 163
Query: 181 RGQERGIVILDDTESVWSDHTENLIVLGKY 210
+ + I+++DD+ + +N + K+
Sbjct: 164 KKELSKIILIDDSNISGFQYPDNFFPIKKW 193
>gi|22327621|ref|NP_199453.2| SCP1-like small phosphatase 4 [Arabidopsis thaliana]
gi|18377616|gb|AAL66958.1| unknown protein [Arabidopsis thaliana]
gi|20465765|gb|AAM20371.1| unknown protein [Arabidopsis thaliana]
gi|332007997|gb|AED95380.1| SCP1-like small phosphatase 4 [Arabidopsis thaliana]
Length = 453
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 48/98 (48%), Gaps = 10/98 (10%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVRTF 124
++ + LVL+LD TL+H ++S + + + F M + + V+ RP + F
Sbjct: 278 KKSVTLVLDLDETLVHS-TLESCNVADFSFR--------VFFNMQENTVYVRQRPHLYRF 328
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
LE+ L + + T S YA + +LD D K+ S R
Sbjct: 329 LERVGELFHVVIFTASHSIYASQLLDILDPDGKFISQR 366
>gi|355681366|gb|AER96785.1| CTD small phosphatase 1 [Mustela putorius furo]
Length = 260
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 86 QDVDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 137
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 138 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 196
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 197 RDLRRVLILDNSPASYVFHPDNAVPVASW 225
>gi|186529839|ref|NP_001119383.1| SCP1-like small phosphatase 4 [Arabidopsis thaliana]
gi|332007998|gb|AED95381.1| SCP1-like small phosphatase 4 [Arabidopsis thaliana]
Length = 456
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 48/98 (48%), Gaps = 10/98 (10%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFVRTF 124
++ + LVL+LD TL+H ++S + + + F M + + V+ RP + F
Sbjct: 281 KKSVTLVLDLDETLVHS-TLESCNVADFSFR--------VFFNMQENTVYVRQRPHLYRF 331
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSR 162
LE+ L + + T S YA + +LD D K+ S R
Sbjct: 332 LERVGELFHVVIFTASHSIYASQLLDILDPDGKFISQR 369
>gi|395527645|ref|XP_003765953.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 1 [Sarcophilus harrisii]
Length = 257
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 82 QDLGKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGMVHQVYVLK-------RPHVDE 133
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 134 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGSFRARLFRESCVFHRGNYVKDLSRLG 192
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 193 RDLRRVLILDNSPASYVFHPDNAVPVASW 221
>gi|195019148|ref|XP_001984920.1| GH16757 [Drosophila grimshawi]
gi|193898402|gb|EDV97268.1| GH16757 [Drosophila grimshawi]
Length = 341
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 39/160 (24%), Positives = 77/160 (48%), Gaps = 15/160 (9%)
Query: 54 YMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK 113
Y+L +R+S+ + + V++LD TL+H + K + + + + +I I ++ +
Sbjct: 75 YLLPQVRHSDMHRKCM--VIDLDETLVHS-SFKPIPNADFIVPVEIDGTIHQVYVLK--- 128
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FN 170
RP V FL++ L + L T S YA+ LLD F +R+ RE +
Sbjct: 129 ----RPHVDEFLQKMGELYECVLFTASLAKYADPVADLLD-KWNVFRARLF-RESCVYYR 182
Query: 171 GKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
G K+ + + + IVI+D++ + + H +N + + +
Sbjct: 183 GNYIKDLNRLGRDLQKIVIVDNSPASYIFHPDNAVPVKSW 222
>gi|395835349|ref|XP_003790644.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 2 [Otolemur garnettii]
Length = 271
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 72/150 (48%), Gaps = 19/150 (12%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+E+++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RP+V
Sbjct: 95 TEEDQGRICVVIDLDETLVH-SSFKPINNADFIVPVEIE---GTTHQV----YVLKRPYV 146
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RK 175
FL + L + L T S YA+ LLD ++ F + + KD R
Sbjct: 147 DEFLRRMGELFECVLFTASLAKYADPVTDLLDRCGVFRARLFRESCVFHQGCYVKDLSRL 206
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLI 205
DL R +ILD++ + + H EN +
Sbjct: 207 GRDL-----RKTLILDNSPASYIFHPENAV 231
>gi|302812229|ref|XP_002987802.1| hypothetical protein SELMODRAFT_126751 [Selaginella moellendorffii]
gi|302817447|ref|XP_002990399.1| hypothetical protein SELMODRAFT_131611 [Selaginella moellendorffii]
gi|300141784|gb|EFJ08492.1| hypothetical protein SELMODRAFT_131611 [Selaginella moellendorffii]
gi|300144421|gb|EFJ11105.1| hypothetical protein SELMODRAFT_126751 [Selaginella moellendorffii]
Length = 253
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 49/109 (44%), Gaps = 10/109 (9%)
Query: 57 RGLRYSEQEER--KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL 114
R + +Q R + LVL+LD TL+H S ++ SF +
Sbjct: 43 RPMLLPKQTRRCPPVTLVLDLDETLVH--------STLEHCADADFSFPVYFNYQEHTVY 94
Query: 115 VKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
V+ RP ++ FLE+ + L +I + T S YAE + +LD K RI
Sbjct: 95 VRRRPHLQVFLEKVAQLFEIIIFTASQSVYAEQLLNILDPKRKLIRHRI 143
>gi|313212699|emb|CBY36636.1| unnamed protein product [Oikopleura dioica]
Length = 271
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 37/155 (23%), Positives = 75/155 (48%), Gaps = 15/155 (9%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
+ +K+ V++LD TL+H + K +++ + ++ +I + + ++ + RP+V F
Sbjct: 85 DPKKICCVIDLDETLVHS-SFKPIANADFHVPVEIENMVHQVYVLK-------RPYVDEF 136
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLV---R 181
L + L + L T S YA+ +D +++ FSSR+ + DL R
Sbjct: 137 LAKVGELFECVLFTASLAKYADEVANEIDPNNE-FSSRLFRESCVYDRGNYVKDLTKLGR 195
Query: 182 GQERGIVILDDTESVWSDHTENLIVLGKYVYFRDK 216
+R I+I D++ + + +N I + +F DK
Sbjct: 196 PLDRTIII-DNSPASYLFQPQNAIPVSS--WFEDK 227
>gi|302806318|ref|XP_002984909.1| hypothetical protein SELMODRAFT_423987 [Selaginella moellendorffii]
gi|300147495|gb|EFJ14159.1| hypothetical protein SELMODRAFT_423987 [Selaginella moellendorffii]
Length = 214
Score = 39.7 bits (91), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 64/147 (43%), Gaps = 16/147 (10%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
E K LVL++D TL+H K+ +S + F G + LV RP V TFL
Sbjct: 40 EEKPTLVLDMDETLIHAH--KATAS--------LKLFSGRTLPLQR-YLVAKRPGVDTFL 88
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII-----AREDFNGKDRKNPDLV 180
+ S + +I + T + + YA+ + LD F+ R+ +E K KN +
Sbjct: 89 NEMSQIYEIVVFTRAVKPYADRILDRLDPAGNLFTHRLYRDLCSPKEVGGRKVVKNLSRL 148
Query: 181 RGQERGIVILDDTESVWSDHTENLIVL 207
+ VI+DD + N IV+
Sbjct: 149 GRDLKHTVIVDDKPESFCLQPSNGIVI 175
>gi|195377848|ref|XP_002047699.1| GJ11778 [Drosophila virilis]
gi|194154857|gb|EDW70041.1| GJ11778 [Drosophila virilis]
Length = 329
Score = 39.7 bits (91), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 39/160 (24%), Positives = 77/160 (48%), Gaps = 15/160 (9%)
Query: 54 YMLRGLRYSEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK 113
Y+L +R+S+ + + V++LD TL+H + K + + + + +I I ++ +
Sbjct: 74 YLLPQVRHSDMHRKCM--VIDLDETLVHS-SFKPIPNADFIVPVEIDGTIHQVYVLK--- 127
Query: 114 LVKLRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARED---FN 170
RP V FL++ L + L T S YA+ LLD F +R+ RE +
Sbjct: 128 ----RPHVDEFLQKMGELYECVLFTASLAKYADPVADLLD-KWNVFRARLF-RESCVYYR 181
Query: 171 GKDRKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
G K+ + + + IVI+D++ + + H +N + + +
Sbjct: 182 GNYIKDLNRLGRDLQKIVIVDNSPASYIFHPDNAVPVKSW 221
>gi|403416935|emb|CCM03635.1| predicted protein [Fibroporia radiculosa]
Length = 580
Score = 39.7 bits (91), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 41/76 (53%), Gaps = 2/76 (2%)
Query: 139 MSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKN-PDLVRGQERGIVILDDTESVW 197
M TR YAE +D + K+F R+++R++ +K+ L + +VI+DD VW
Sbjct: 1 MGTRAYAEEVCAAIDPEGKFFGGRLLSRDESGSLTQKSLQRLFPTDQSMVVIIDDRADVW 60
Query: 198 SDHTENLIVLGKYVYF 213
+ + NL+ + Y +F
Sbjct: 61 -EWSPNLVKVIPYDFF 75
>gi|395527647|ref|XP_003765954.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 isoform 2 [Sarcophilus harrisii]
Length = 258
Score = 39.7 bits (91), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 83 QDLGKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGMVHQVYVLK-------RPHVDE 134
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 135 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGSFRARLFRESCVFHRGNYVKDLSRLG 193
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 194 RDLRRVLILDNSPASYVFHPDNAVPVASW 222
>gi|339250888|ref|XP_003374429.1| carboxy- domain RNA polymerase II polypeptide A small phosphatase 1
[Trichinella spiralis]
gi|316969260|gb|EFV53388.1| carboxy- domain RNA polymerase II polypeptide A small phosphatase 1
[Trichinella spiralis]
Length = 284
Score = 39.7 bits (91), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 35/137 (25%), Positives = 68/137 (49%), Gaps = 11/137 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
L+++LD TL+H + K + + + + +I + ++ + RP+V FL+Q S+
Sbjct: 87 LIVDLDETLVH-SSFKPVKNPDFVIPVEIDGVVHQVYVLK-------RPYVDEFLQQISA 138
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-GQE-RGIV 188
+ L T S YA+ LLD F SR+ K DL R G++ + ++
Sbjct: 139 NFECILFTASLAKYADPVADLLD-RWGVFRSRLFREACVFHKGNYVKDLNRLGRDLKHVL 197
Query: 189 ILDDTESVWSDHTENLI 205
I+D++ + ++ H +N +
Sbjct: 198 IVDNSPASYAFHPDNAV 214
>gi|356530555|ref|XP_003533846.1| PREDICTED: uncharacterized protein LOC100786602 [Glycine max]
Length = 470
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 27/99 (27%), Positives = 50/99 (50%), Gaps = 12/99 (12%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL--VKLRPFVRTF 124
+ + LVL+LD TL+H ++ + F ++F + VK RP++ F
Sbjct: 296 KSITLVLDLDETLVH-STLEPCDDAD---------FTFTVFFNLKEYTVYVKQRPYLHAF 345
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
LE+ S + ++ + T S YA+ + +LD D ++ S R+
Sbjct: 346 LERVSEMFEVVIFTASQSIYAKQLLDILDPDGRFISRRM 384
>gi|403368592|gb|EJY84135.1| Putative tfiif-interacting component of the c-terminal domain
phosphatase [Oxytricha trifallax]
Length = 525
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 48/99 (48%), Gaps = 8/99 (8%)
Query: 65 EERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL------VKLR 118
++RKL LVL+LD+TLLH ++I+ K + I L + KL KLR
Sbjct: 6 QDRKLVLVLDLDNTLLHTKSIEEREFQTKSRDPTFINLIDPLKSIYEIKLFRGGFHTKLR 65
Query: 119 PFVRTFLEQA--SSLVDIYLCTMSTRCYAEAAVKLLDLD 155
PF+ FL++ +IY T T+ Y + + ++
Sbjct: 66 PFLFEFLKKVFDERKFEIYFYTAGTKDYGMLIIDIFKME 104
>gi|338711176|ref|XP_001504815.3| PREDICTED: CTD nuclear envelope phosphatase 1-like [Equus caballus]
Length = 296
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 67/157 (42%), Gaps = 19/157 (12%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGS--LFQMANDK-----LVK 116
Q +RK+ LVL+LD TL+H S + L+ + + ++ DK V
Sbjct: 110 QVKRKI-LVLDLDETLIH-------SHHDGVLRPTVRPGTPPDFILKVVIDKHPVRFFVH 161
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFN---GKD 173
RP V FLE S ++ + T S Y A LD +S+ R R+ G
Sbjct: 162 KRPHVDFFLEVVSQWYELVVFTASMEIYGSAVADKLD-NSRSILKRRYYRQHCTLELGSY 220
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
K+ +V IVILD++ + H +N I + +
Sbjct: 221 IKDLSVVHSDLSSIVILDNSPGAYRSHPDNAIPIKSW 257
>gi|322710332|gb|EFZ01907.1| NIF domain protein [Metarhizium anisopliae ARSEF 23]
Length = 500
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 68/154 (44%), Gaps = 18/154 (11%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGE------KYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
L+L+LD TL+H + SSG + + G Q V RP+ F
Sbjct: 312 LILDLDETLIHSMSKGGRSSGHMVEVRLNTASLGMGTAPGGAAQHPILYWVNKRPYCDEF 371
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI-----IAREDFNGKDRKN--P 177
L + ++ + T S + YA+ + L+ + K+FS+R R+ KD + P
Sbjct: 372 LRRICKWFNLVIFTASVQEYADPVIDWLEAERKFFSARYYRQHCTYRQGAYIKDLSSVEP 431
Query: 178 DLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
DL + ++ILD++ + H +N I + ++
Sbjct: 432 DLSK-----VMILDNSPLSYLFHEDNAIPIQGWI 460
>gi|301761366|ref|XP_002916075.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 2-like [Ailuropoda melanoleuca]
gi|410964959|ref|XP_003989020.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 2-like [Felis catus]
Length = 271
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 72/150 (48%), Gaps = 19/150 (12%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+E+++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RP+V
Sbjct: 95 TEEDQGRICVVIDLDETLVH-SSFKPINNADFIVPVEIE---GTTHQV----YVLKRPYV 146
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RK 175
FL + L + L T S YA+ LLD ++ F + + KD R
Sbjct: 147 DEFLRRMGELFECVLFTASLAKYADPVTDLLDRCGVFRARLFRESCVFHQGCYVKDLSRL 206
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLI 205
DL R +ILD++ + + H EN +
Sbjct: 207 GRDL-----RKTLILDNSPASYIFHPENAV 231
>gi|159473212|ref|XP_001694733.1| predicted protein [Chlamydomonas reinhardtii]
gi|158276545|gb|EDP02317.1| predicted protein [Chlamydomonas reinhardtii]
Length = 215
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 27/97 (27%), Positives = 47/97 (48%), Gaps = 8/97 (8%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
R+ LVL+LD TL+H S + + + +F + M + V+ RP + F+
Sbjct: 33 RRKTLVLDLDETLVH--------SSLEAVDRSDFNFPVTFNGMDHTVYVRQRPHLHDFMA 84
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
+ ++L ++ + T S R YAE + +LD RI
Sbjct: 85 RVAALFEVVVFTASQRIYAERLLDILDPGQALVRHRI 121
>gi|73968605|ref|XP_538256.2| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 2 isoform 1 [Canis lupus familiaris]
Length = 271
Score = 39.3 bits (90), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 72/150 (48%), Gaps = 19/150 (12%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+E+++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RP+V
Sbjct: 95 TEEDQGRICVVIDLDETLVH-SSFKPINNADFVVPVEIE---GTTHQV----YVLKRPYV 146
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RK 175
FL + L + L T S YA+ LLD ++ F + + KD R
Sbjct: 147 DEFLRRMGELFECVLFTASLAKYADPVTDLLDRCGVFRARLFRESCVFHQGCYVKDLSRL 206
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLI 205
DL R +ILD++ + + H EN +
Sbjct: 207 GRDL-----RKTLILDNSPASYIFHPENAV 231
>gi|72386761|ref|XP_843805.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62359817|gb|AAX80246.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70800337|gb|AAZ10246.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|261326894|emb|CBH09867.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 423
Score = 39.3 bits (90), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 49/99 (49%), Gaps = 13/99 (13%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL---VKLRPFVRTF 124
K+ L+L+LD TL+H SL+S ++ H + + +M N V RPF+R F
Sbjct: 236 KITLILDLDETLVHS----SLTSQSRH-----HDLVLDV-RMENTSTTVYVAFRPFMREF 285
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI 163
L+ + L ++ + T S Y + +D D+ S R+
Sbjct: 286 LQAVAPLFEVIIFTASVSVYCNQLMDAIDTDNILGSLRL 324
>gi|432914367|ref|XP_004079077.1| PREDICTED: CTD small phosphatase-like protein-like isoform 1
[Oryzias latipes]
Length = 263
Score = 39.3 bits (90), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 79/174 (45%), Gaps = 13/174 (7%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
+V++LD TL+H + K +S+ + + +I + ++ + RP V FL++
Sbjct: 96 VVIDLDETLVHS-SFKPISNADFIVPVEIDGTVHQVYVLK-------RPHVDEFLQKMGE 147
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-GQE-RGIV 188
L + L T S YA+ LLD F +R+ + DL R G+E ++
Sbjct: 148 LFECVLFTASLAKYADPVADLLD-QWGVFRARLFRESCVFHRGNYVKDLSRLGRELNNVI 206
Query: 189 ILDDTESVWSDHTENLIVLGKYV-YFRDKELNGDHKSYSETLTDESENEEALAN 241
I+D++ + + H EN + + + D EL D + E L+ E E L N
Sbjct: 207 IVDNSPASYIFHPENAVPVQSWFDDMNDTEL-LDLLPFFEGLSKEEEVYGVLQN 259
>gi|348580807|ref|XP_003476170.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 2-like [Cavia porcellus]
Length = 271
Score = 39.3 bits (90), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 72/150 (48%), Gaps = 19/150 (12%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+E+++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RP+V
Sbjct: 95 TEEDQGRICVVIDLDETLVH-SSFKPINNADFIVPVEIE---GTTHQV----YVLKRPYV 146
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RK 175
FL + L + L T S YA+ LLD ++ F + + KD R
Sbjct: 147 DEFLRRMGELFECVLFTASLAKYADPVTDLLDRCGVFRARLFRESCVFHQGCYVKDLSRL 206
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLI 205
DL R +ILD++ + + H EN +
Sbjct: 207 GRDL-----RKTLILDNSPASYIFHPENAV 231
>gi|432914369|ref|XP_004079078.1| PREDICTED: CTD small phosphatase-like protein-like isoform 2
[Oryzias latipes]
Length = 274
Score = 39.3 bits (90), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 79/174 (45%), Gaps = 13/174 (7%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
+V++LD TL+H + K +S+ + + +I + ++ + RP V FL++
Sbjct: 107 VVIDLDETLVHS-SFKPISNADFIVPVEIDGTVHQVYVLK-------RPHVDEFLQKMGE 158
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-GQE-RGIV 188
L + L T S YA+ LLD F +R+ + DL R G+E ++
Sbjct: 159 LFECVLFTASLAKYADPVADLLD-QWGVFRARLFRESCVFHRGNYVKDLSRLGRELNNVI 217
Query: 189 ILDDTESVWSDHTENLIVLGKYV-YFRDKELNGDHKSYSETLTDESENEEALAN 241
I+D++ + + H EN + + + D EL D + E L+ E E L N
Sbjct: 218 IVDNSPASYIFHPENAVPVQSWFDDMNDTEL-LDLLPFFEGLSKEEEVYGVLQN 270
>gi|351704703|gb|EHB07622.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 2 [Heterocephalus glaber]
Length = 271
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 72/150 (48%), Gaps = 19/150 (12%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+E+++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RP+V
Sbjct: 95 TEEDQGRICVVIDLDETLVH-SSFKPINNADFIVPVEIE---GTTHQV----YVLKRPYV 146
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RK 175
FL + L + L T S YA+ LLD ++ F + + KD R
Sbjct: 147 DEFLRRMGELFECVLFTASLAKYADPVTDLLDRCGVFRARLFRESCVFHQGCYVKDLSRL 206
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLI 205
DL R +ILD++ + + H EN +
Sbjct: 207 GRDL-----RKTLILDNSPASYIFHPENAV 231
>gi|426221551|ref|XP_004004972.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 1 [Ovis aries]
Length = 260
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 72/149 (48%), Gaps = 11/149 (7%)
Query: 64 QEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
Q+ K+ +V++LD TL+H + K +++ + + +I + ++ + RP V
Sbjct: 85 QDLDKICVVIDLDETLVHS-SFKPVNNADFIIPVEIDGVVHQVYVLK-------RPHVDE 136
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-G 182
FL++ L + L T S YA+ LLD F +R+ + DL R G
Sbjct: 137 FLQRMGELFECVLFTASLAKYADPVADLLD-KWGAFRARLFRESCVFHRGNYVKDLSRLG 195
Query: 183 QE-RGIVILDDTESVWSDHTENLIVLGKY 210
++ R ++ILD++ + + H +N + + +
Sbjct: 196 RDLRRVLILDNSPASYVFHPDNAVPVASW 224
>gi|344266297|ref|XP_003405217.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 2-like [Loxodonta africana]
Length = 271
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 72/150 (48%), Gaps = 19/150 (12%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+E+++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RP+V
Sbjct: 95 TEEDQGRICVVIDLDETLVH-SSFKPINNADFIVPVEIE---GTTHQV----YVLKRPYV 146
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RK 175
FL + L + L T S YA+ LLD ++ F + + KD R
Sbjct: 147 DEFLRRMGELFECVLFTASLAKYADPVTDLLDRCGVFRARLFRESCVFHQGCYVKDLSRL 206
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLI 205
DL R +ILD++ + + H EN +
Sbjct: 207 GRDL-----RKTLILDNSPASYIFHPENAV 231
>gi|302806320|ref|XP_002984910.1| hypothetical protein SELMODRAFT_121210 [Selaginella moellendorffii]
gi|300147496|gb|EFJ14160.1| hypothetical protein SELMODRAFT_121210 [Selaginella moellendorffii]
Length = 198
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 66/148 (44%), Gaps = 18/148 (12%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFL 125
E K LVL++D TL+H K+ +S + F G + LV RP V TFL
Sbjct: 24 EEKPTLVLDMDETLIHAH--KATAS--------LKLFSGKTLPLQR-YLVAKRPGVDTFL 72
Query: 126 EQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVRGQER 185
+ S + +I + T + + YA+ + LD F+ R+ R+ + K+ +V+ R
Sbjct: 73 NEMSQIYEIVVFTRAVKLYADRILDRLDPAGNLFTHRLY-RDSCSPKEVGGRKVVKDLSR 131
Query: 186 ------GIVILDDTESVWSDHTENLIVL 207
VI+DD + N IV+
Sbjct: 132 LGRDLKHTVIVDDKPESFCLQPSNGIVI 159
>gi|145540281|ref|XP_001455830.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423639|emb|CAK88433.1| unnamed protein product [Paramecium tetraurelia]
Length = 291
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 79/160 (49%), Gaps = 21/160 (13%)
Query: 66 ERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSF---IGSLFQMANDKL-VKLRPFV 121
+RK+ +VL+LD TL+H S +Y SF I Q N K+ V +RP V
Sbjct: 52 QRKI-IVLDLDETLVH--------SQFEYF----DSFDFTINIAVQSQNFKVYVIVRPGV 98
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR 181
+ F+EQ + DI T S + YA A + +D D K R+ K+ DL +
Sbjct: 99 KKFIEQLNHFYDIIFWTASIKEYAMAVIDYIDPDGKAV-ERLFRDSCTPLKNSFTKDLTK 157
Query: 182 -GQE-RGIVILDDTESVWSDHTENLIVLGKYVYFR-DKEL 218
G++ + ++I+D++ + + EN + + + Y + DKEL
Sbjct: 158 LGRDLKDVIIVDNSVFSFIMNPENGLKINDFFYDKYDKEL 197
>gi|301115156|ref|XP_002905307.1| nuclear LIM factor interactor-interacting protein hyphal form,
putative [Phytophthora infestans T30-4]
gi|262110096|gb|EEY68148.1| nuclear LIM factor interactor-interacting protein hyphal form,
putative [Phytophthora infestans T30-4]
Length = 422
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 43/87 (49%), Gaps = 10/87 (11%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGE-KYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
K+ LVL+LD TL+HC S E K Q + + N VK RP + FL+
Sbjct: 239 KICLVLDLDETLVHC------SVDEVKNPHMQFPVTFNGVEYIVN---VKKRPHMEYFLK 289
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLD 153
+ S L +I + T S + YAE +LD
Sbjct: 290 RVSKLFEIVVFTASHKVYAEKLTNMLD 316
>gi|225710872|gb|ACO11282.1| Serine/threonine-protein phosphatase dullard-A [Caligus
rogercresseyi]
Length = 261
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 62/151 (41%), Gaps = 9/151 (5%)
Query: 67 RKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDK-----LVKLRPFV 121
+K LVL+LD TL+H + +L S + KQ ++ ++ D+ V RP V
Sbjct: 76 KKKILVLDLDETLIHSHHDGTLRSSGPH--KQPNTQPDFTLKITLDRHPVRCFVHKRPHV 133
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE--DFNGKDRKNPDL 179
FL S ++ + T S Y A L+ S R + NG RK+ L
Sbjct: 134 DLFLSVVSQWFELVVFTASMEVYGTAVADKLESKSGILKGRYYRQHCTLINGSYRKDISL 193
Query: 180 VRGQERGIVILDDTESVWSDHTENLIVLGKY 210
V I ILD++ + N + + +
Sbjct: 194 VNKDLSSIFILDNSPGAYRSFPRNAVPIQSW 224
>gi|55740281|gb|AAV63942.1| putative nuclear LIM factor interactor-interacting protein hyphal
form [Phytophthora infestans]
Length = 211
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 43/87 (49%), Gaps = 10/87 (11%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGE-KYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLE 126
K+ LVL+LD TL+HC S E K Q + + N VK RP + FL+
Sbjct: 28 KICLVLDLDETLVHC------SVDEVKNPHMQFPVTFNGVEYIVN---VKKRPHMEYFLK 78
Query: 127 QASSLVDIYLCTMSTRCYAEAAVKLLD 153
+ S L +I + T S + YAE +LD
Sbjct: 79 RVSKLFEIVVFTASHKVYAEKLTNMLD 105
>gi|444722948|gb|ELW63620.1| CTD nuclear envelope phosphatase 1 [Tupaia chinensis]
Length = 352
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 66/157 (42%), Gaps = 19/157 (12%)
Query: 64 QEERKLQLVLNLDHTLLHCRN-------IKSLSSGEKYLKKQIHSFIGSLFQMANDKLVK 116
Q +RK+ LVL+LD TL+H + ++ + + LK I F V
Sbjct: 58 QVKRKI-LVLDLDETLIHSHHDGVLRPTVRPGTPPDFILKVVIDKHPVRFF-------VH 109
Query: 117 LRPFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFN---GKD 173
RP V FLE S ++ + T S Y A LD +S+ R R+ G
Sbjct: 110 KRPHVDFFLEVVSQWYELVVFTASMEIYGSAVADKLD-NSRSILKRRYYRQHCTLELGSY 168
Query: 174 RKNPDLVRGQERGIVILDDTESVWSDHTENLIVLGKY 210
K+ +V IVILD++ + H +N I + +
Sbjct: 169 IKDLSVVHSDLSSIVILDNSPGAYRSHPDNAIPIKSW 205
>gi|444509388|gb|ELV09225.1| Carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 2 [Tupaia chinensis]
Length = 271
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 72/150 (48%), Gaps = 19/150 (12%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+E+++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RP+V
Sbjct: 95 TEEDQGRICVVIDLDETLVH-SSFKPINNADFIVPVEIE---GTTHQV----YVLKRPYV 146
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RK 175
FL + L + L T S YA+ LLD ++ F + + KD R
Sbjct: 147 DEFLRRMGELFECVLFTASLAKYADPVTDLLDRCGVFRARLFRESCVFHQGCYVKDLSRL 206
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLI 205
DL R +ILD++ + + H EN +
Sbjct: 207 GRDL-----RKTLILDNSPASYIFHPENAV 231
>gi|312084146|ref|XP_003144155.1| hypothetical protein LOAG_08577 [Loa loa]
Length = 152
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 33/137 (24%), Positives = 66/137 (48%), Gaps = 11/137 (8%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
L+++LD TL+H + K + + + + +I + I ++ + RP+V FLE+
Sbjct: 25 LIIDLDETLVH-SSFKPVKNPDFIIPVEIDNVIHQVYVLK-------RPYVDEFLERIGD 76
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-GQE-RGIV 188
+ L T S YA+ LD F +R+ K DL R G++ + ++
Sbjct: 77 KFECVLFTASLAKYADPVADFLD-KRGVFRARLFRESCVFHKGNYVKDLTRLGRDLKKVI 135
Query: 189 ILDDTESVWSDHTENLI 205
I+D++ + ++ H +N +
Sbjct: 136 IVDNSPASYAFHPDNAV 152
>gi|367026037|ref|XP_003662303.1| hypothetical protein MYCTH_2302800 [Myceliophthora thermophila ATCC
42464]
gi|347009571|gb|AEO57058.1| hypothetical protein MYCTH_2302800 [Myceliophthora thermophila ATCC
42464]
Length = 524
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 41/160 (25%), Positives = 71/160 (44%), Gaps = 29/160 (18%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQ-IHSFIGSLFQMANDKL-----------VKLR 118
L+L+LD TL+H SLS G + + + + +Q A + V R
Sbjct: 335 LILDLDETLIH-----SLSKGGRMGSGHMVEVRLNTTYQSAGGQTAIGPQHPILYYVHKR 389
Query: 119 PFVRTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI-----IAREDFNGKD 173
P FL + S ++ + T S + YA+ + L+ + KYFS+R R KD
Sbjct: 390 PHCDEFLRRVSKWYNLVVFTASVQEYADPVIDWLEAERKYFSARYYRQHCTFRHGAFIKD 449
Query: 174 RKN--PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
+ PDL + ++ILD++ + H +N I + ++
Sbjct: 450 LSSVEPDLSK-----VMILDNSPLSYMFHQDNAIPIQGWI 484
>gi|119617494|gb|EAW97088.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 2, isoform CRA_e [Homo sapiens]
Length = 260
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 72/150 (48%), Gaps = 19/150 (12%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+E+++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RP+V
Sbjct: 101 TEEDQGRICVVIDLDETLVH-SSFKPINNADFIVPIEIE---GTTHQV----YVLKRPYV 152
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RK 175
FL + L + L T S YA+ LLD ++ F + + KD R
Sbjct: 153 DEFLRRMGELFECVLFTASLAKYADPVTDLLDRCGVFRARLFRESCVFHQGCYVKDLSRL 212
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLI 205
DL R +ILD++ + + H EN +
Sbjct: 213 GRDL-----RKTLILDNSPASYIFHPENAV 237
>gi|66799565|ref|XP_628708.1| hypothetical protein DDB_G0294376 [Dictyostelium discoideum AX4]
gi|74849923|sp|Q9XYL0.1|CTDS_DICDI RecName: Full=Probable C-terminal domain small phosphatase;
AltName: Full=Developmental gene 1148 protein
gi|4731912|gb|AAD28548.1|AF111941_1 development protein DG1148 [Dictyostelium discoideum]
gi|60462033|gb|EAL60295.1| hypothetical protein DDB_G0294376 [Dictyostelium discoideum AX4]
Length = 306
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 39/142 (27%), Positives = 66/142 (46%), Gaps = 11/142 (7%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+H + K + + + + +I I ++ V RPFV FL +
Sbjct: 139 LVLDLDETLVHS-SFKPVHNPDFIVPVEIEGTIHQVY-------VVKRPFVDDFLRAIAE 190
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-GQE-RGIV 188
+I + T S YA+ + LD + R+ N K DL R G++ + +
Sbjct: 191 KFEIVVFTASLAKYADPVLDFLDT-GRVIHYRLFRESCHNHKGNYVKDLSRLGRDLKSTI 249
Query: 189 ILDDTESVWSDHTENLIVLGKY 210
I+D++ S + H EN I + +
Sbjct: 250 IVDNSPSSYLFHPENAIPIDSW 271
>gi|114052134|ref|NP_001039400.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 2 [Bos taurus]
gi|86823928|gb|AAI05532.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 2 [Bos taurus]
gi|126010770|gb|AAI33617.1| CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A)
small phosphatase 2 [Bos taurus]
gi|296487636|tpg|DAA29749.1| TPA: CTD (carboxy-terminal domain, RNA polymerase II, polypeptide
A) small phosphatase 2 [Bos taurus]
Length = 271
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 72/150 (48%), Gaps = 19/150 (12%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+E+++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RP+V
Sbjct: 95 TEEDQGRICVVIDLDETLVH-SSFKPINNADFIVPVEIE---GTTHQV----YVLKRPYV 146
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RK 175
FL + L + L T S YA+ LLD ++ F + + KD R
Sbjct: 147 DEFLRRMGELFECVLFTASLAKYADPVTDLLDRCGVFRARLFRESCVFHQGCYVKDLSRL 206
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLI 205
DL R +ILD++ + + H EN +
Sbjct: 207 GRDL-----RKTLILDNSPASYIFHPENAV 231
>gi|317419953|emb|CBN81989.1| CTD small phosphatase-like protein [Dicentrarchus labrax]
Length = 301
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 36/142 (25%), Positives = 67/142 (47%), Gaps = 11/142 (7%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
+V++LD TL+H + K +S+ + + +I + ++ + RP V FL++
Sbjct: 134 VVIDLDETLVH-SSFKPISNADFIVPVEIDGTVHQVYVLK-------RPHVDEFLQKMGE 185
Query: 131 LVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDLVR-GQERG-IV 188
L + L T S YA+ LLD F SR+ + DL R G+E ++
Sbjct: 186 LFECVLFTASLAKYADPVADLLD-QWGVFRSRLFRESCVFHRGNYVKDLSRLGRELSKVI 244
Query: 189 ILDDTESVWSDHTENLIVLGKY 210
I+D++ + + H EN + + +
Sbjct: 245 IIDNSPASYIFHPENAVPVQSW 266
>gi|291409394|ref|XP_002720975.1| PREDICTED: nuclear LIM interactor-interacting factor 2 [Oryctolagus
cuniculus]
Length = 271
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 72/150 (48%), Gaps = 19/150 (12%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+E+++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RP+V
Sbjct: 95 TEEDQGRICVVIDLDETLVH-SSFKPINNADFIVPVEIE---GTTHQV----YVLKRPYV 146
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RK 175
FL + L + L T S YA+ LLD ++ F + + KD R
Sbjct: 147 DEFLRRMGELFECVLFTASLAKYADPVTDLLDRCGVFRARLFRESCVFHQGCYVKDLSRL 206
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLI 205
DL R +ILD++ + + H EN +
Sbjct: 207 GRDL-----RKTLILDNSPASYIFHPENAV 231
>gi|224057698|ref|XP_002299297.1| predicted protein [Populus trichocarpa]
gi|222846555|gb|EEE84102.1| predicted protein [Populus trichocarpa]
Length = 256
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 42/83 (50%), Gaps = 8/83 (9%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQASS 130
LVL+LD TL+H S + +F + + V+ RP++R F+E+ SS
Sbjct: 81 LVLDLDETLVH--------STLEPCDDADFTFPVNFNLQQHTVFVRCRPYLRDFMERVSS 132
Query: 131 LVDIYLCTMSTRCYAEAAVKLLD 153
L +I + T S YAE + +LD
Sbjct: 133 LFEIIIFTASQSIYAEQLLNVLD 155
>gi|357135834|ref|XP_003569513.1| PREDICTED: uncharacterized protein LOC100822852 [Brachypodium
distachyon]
Length = 447
Score = 39.3 bits (90), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 67/151 (44%), Gaps = 12/151 (7%)
Query: 63 EQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKL-VKLRPFV 121
EQ +K+ LVL+LD TL+H S ++ +F F M + V+ RP +
Sbjct: 266 EQGTKKVTLVLDLDETLVH--------STMEHCSDADFTF-PVFFDMKEHVVYVRKRPHL 316
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIAREDFNGKDRKNPDL-V 180
FL++ + + D+ + T S YA+ + LD + F R + DL V
Sbjct: 317 HIFLQKMAEMFDVVIFTASQSVYADQLLDRLDPEKTLFCKRFFRESCVFTESGYTKDLTV 376
Query: 181 RGQERG-IVILDDTESVWSDHTENLIVLGKY 210
G + +VI+D+T V+ N I + +
Sbjct: 377 VGVDLAKVVIIDNTPQVFQLQVNNGIPIQSW 407
>gi|302806324|ref|XP_002984912.1| hypothetical protein SELMODRAFT_48489 [Selaginella moellendorffii]
gi|300147498|gb|EFJ14162.1| hypothetical protein SELMODRAFT_48489 [Selaginella moellendorffii]
Length = 171
Score = 39.3 bits (90), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 70/146 (47%), Gaps = 18/146 (12%)
Query: 68 KLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRTFLEQ 127
K LVL++D TL+H K+ +S + F G + + LV RP V TFL +
Sbjct: 1 KPTLVLDMDETLIHAH--KATAS--------LKLFSGKILPLQR-YLVAKRPGVDTFLNE 49
Query: 128 ASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRII----AREDFNGKDRKNPDLVR-G 182
S + +I + T + + YA+ + LD F+ R+ + ++ G+ + DL R G
Sbjct: 50 MSQIYEIVVFTRAVKPYADRILDRLDPVGNLFTHRLYRDSCSPKEVGGR-KVVKDLSRLG 108
Query: 183 QE-RGIVILDDTESVWSDHTENLIVL 207
++ R VI+DD + N IV+
Sbjct: 109 RDLRHTVIVDDKPESFCLQPSNGIVI 134
>gi|322692835|gb|EFY84722.1| NIF domain protein [Metarhizium acridum CQMa 102]
Length = 501
Score = 39.3 bits (90), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 68/154 (44%), Gaps = 18/154 (11%)
Query: 71 LVLNLDHTLLHCRNIKSLSSGE------KYLKKQIHSFIGSLFQMANDKLVKLRPFVRTF 124
L+L+LD TL+H + SSG + + G Q V RP+ F
Sbjct: 313 LILDLDETLIHSMSKGGRSSGHMVEVRLNTASLGMGTAPGGAAQHPILYWVNKRPYCDEF 372
Query: 125 LEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRI-----IAREDFNGKDRKN--P 177
L + ++ + T S + YA+ + L+ + K+FS+R R+ KD + P
Sbjct: 373 LRRICKWFNLVIFTASVQEYADPVIDWLEAERKFFSARYYRQHCTYRQGAYIKDLSSVEP 432
Query: 178 DLVRGQERGIVILDDTESVWSDHTENLIVLGKYV 211
DL + ++ILD++ + H +N I + ++
Sbjct: 433 DLSK-----VMILDNSPLSYLFHEDNAIPIQGWI 461
>gi|389585986|dbj|GAB68715.1| phosphatase [Plasmodium cynomolgi strain B]
Length = 1263
Score = 39.3 bits (90), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 46/94 (48%), Gaps = 8/94 (8%)
Query: 63 EQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVR 122
E+E + +VL+LD TL+H S GE+Y +IH +G + V RP V
Sbjct: 1082 EEERGRKTIVLDLDETLVH-----STLRGERYNSFRIHIELGDGRCVI---YVNKRPGVE 1133
Query: 123 TFLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDS 156
F ++ S ++ + T S YA A + LD D+
Sbjct: 1134 HFFKEISKHYEVVIFTASLPKYANAVIDKLDKDN 1167
>gi|426224809|ref|XP_004006561.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 2-like [Ovis aries]
Length = 271
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 72/150 (48%), Gaps = 19/150 (12%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+E+++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RP+V
Sbjct: 95 TEEDQGRICVVIDLDETLVH-SSFKPINNADFIVPVEIE---GTTHQV----YVLKRPYV 146
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RK 175
FL + L + L T S YA+ LLD ++ F + + KD R
Sbjct: 147 DEFLRRMGELFECVLFTASLAKYADPVTDLLDRCGVFRARLFRESCVFHQGCYVKDLSRL 206
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLI 205
DL R +ILD++ + + H EN +
Sbjct: 207 GRDL-----RKTLILDNSPASYIFHPENAV 231
>gi|347300364|ref|NP_001231476.1| carboxy-terminal domain RNA polymerase II polypeptide A small
phosphatase 2 [Sus scrofa]
Length = 271
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 72/150 (48%), Gaps = 19/150 (12%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+E+++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RP+V
Sbjct: 95 TEEDQGRICVVIDLDETLVH-SSFKPINNADFIVPVEIE---GTTHQV----YVLKRPYV 146
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RK 175
FL + L + L T S YA+ LLD ++ F + + KD R
Sbjct: 147 DEFLRRMGELFECVLFTASLAKYADPVTDLLDRCGVFRARLFRESCVFHQGCYVKDLSRL 206
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLI 205
DL R +ILD++ + + H EN +
Sbjct: 207 GRDL-----RKTLILDNSPASYIFHPENAV 231
>gi|417398162|gb|JAA46114.1| Putative carboxy-terminal domain rna polymerase ii polypeptide a
small phosphatase 2-like isoform 1 [Desmodus rotundus]
Length = 271
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 72/150 (48%), Gaps = 19/150 (12%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+E+++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RP+V
Sbjct: 95 TEEDQGRICVVIDLDETLVH-SSFKPINNADFVVPVEIE---GTTHQV----YVLKRPYV 146
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RK 175
FL + L + L T S YA+ LLD ++ F + + KD R
Sbjct: 147 DEFLRRMGELFECVLFTASLAKYADPVTDLLDRCGVFRARLFRESCVFHQGCYVKDLSRL 206
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLI 205
DL R +ILD++ + + H EN +
Sbjct: 207 GRDL-----RKTLILDNSPASYIFHPENAV 231
>gi|296212190|ref|XP_002752719.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 2 [Callithrix jacchus]
gi|403269004|ref|XP_003926550.1| PREDICTED: carboxy-terminal domain RNA polymerase II polypeptide A
small phosphatase 2 [Saimiri boliviensis boliviensis]
Length = 271
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 72/150 (48%), Gaps = 19/150 (12%)
Query: 62 SEQEERKLQLVLNLDHTLLHCRNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFV 121
+E+++ ++ +V++LD TL+H + K +++ + + +I G+ Q+ V RP+V
Sbjct: 95 TEEDQGRICVVIDLDETLVH-SSFKPINNADFIVPIEIE---GTTHQV----YVLKRPYV 146
Query: 122 RTFLEQASSLVDIYLCTMSTRCYAEAAVKLLD----LDSKYFSSRIIAREDFNGKD--RK 175
FL + L + L T S YA+ LLD ++ F + + KD R
Sbjct: 147 DEFLRRMGELFECVLFTASLAKYADPVTDLLDRCGVFRARLFRESCVFHQGCYVKDLSRL 206
Query: 176 NPDLVRGQERGIVILDDTESVWSDHTENLI 205
DL R +ILD++ + + H EN +
Sbjct: 207 GRDL-----RKTLILDNSPASYIFHPENAV 231
>gi|391332323|ref|XP_003740585.1| PREDICTED: CTD nuclear envelope phosphatase 1-like [Metaseiulus
occidentalis]
Length = 243
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 54/209 (25%), Positives = 76/209 (36%), Gaps = 52/209 (24%)
Query: 71 LVLNLDHTLLHC-------RNIKSLSSGEKYLKKQIHSFIGSLFQMANDKLVKLRPFVRT 123
LVL+LD TL+H + + S + LK I F V RP V
Sbjct: 63 LVLDLDETLIHSYHDGMLRQTVPSGTPPNFVLKVTIERHPVRFF-------VHKRPHVDY 115
Query: 124 FLEQASSLVDIYLCTMSTRCYAEAAVKLLDLDSKYFSSRIIARE---DFNGKDRK----N 176
FLE S ++ + T S Y A LD R + D+ G + N
Sbjct: 116 FLEVVSQWYELVVFTASMEIYGAAVADRLDNGRGVMRRRFFRQHCTLDYGGYTKDLCAIN 175
Query: 177 PDLVRGQERGIVILDDTESVWSDHTENLIVLGKYVYFRDKELNGDHKSYSETLTDESENE 236
PDL + ILD++ S + +N I + +F D N+
Sbjct: 176 PDL-----SSVFILDNSPSAYKLFPDNAIPIKS--WFND------------------PND 210
Query: 237 EALANVLRVLKTIHRLFFDSVCGDVRTYL 265
AL N+L VL + C DVR+ L
Sbjct: 211 TALLNLLPVLDAL------RFCSDVRSIL 233
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.323 0.138 0.415
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,874,887,134
Number of Sequences: 23463169
Number of extensions: 193111748
Number of successful extensions: 446697
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 646
Number of HSP's successfully gapped in prelim test: 765
Number of HSP's that attempted gapping in prelim test: 444166
Number of HSP's gapped (non-prelim): 1776
length of query: 326
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 184
effective length of database: 9,027,425,369
effective search space: 1661046267896
effective search space used: 1661046267896
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 77 (34.3 bits)