BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy13345
(276 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|345480428|ref|XP_001601407.2| PREDICTED: probable cleavage and polyadenylation specificity factor
subunit 2-like [Nasonia vitripennis]
Length = 739
Score = 218 bits (554), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 103/139 (74%), Positives = 122/139 (87%), Gaps = 3/139 (2%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK IEWMSDKLMKSFEGARNNPF FKH++LCHS+AEL +VPSPKVVL STP
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMKSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTP 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
DMECGFSRDLF QWCS+P+NSIIIT+RTSPGTLARDL+E GGNR +TL++KK++RLEG E
Sbjct: 332 DMECGFSRDLFLQWCSNPQNSIIITSRTSPGTLARDLVENGGNRNITLEIKKKVRLEGAE 391
Query: 142 LEEYQKK-KDKEAKDKQEK 159
LEEY KK K K+ + KQEK
Sbjct: 392 LEEYMKKEKVKQEQLKQEK 410
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/117 (38%), Positives = 60/117 (51%), Gaps = 39/117 (33%)
Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
E I + T+FINEL+LSDFKQ L + I EF GVL CC T+AVRR
Sbjct: 662 EVIRVYQTTFINELKLSDFKQILTKANIPSEFSGGVLWCCNNTIAVRR------------ 709
Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
++ I++EGCLS++YYRV++LLY+QYAI+
Sbjct: 710 ---------------------------HEAGKIIMEGCLSEDYYRVKELLYEQYAIV 739
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 29/36 (80%), Positives = 33/36 (91%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL STPDMECGFSRDLF QWCS+P+NSIIIT+R
Sbjct: 324 KVVLASTPDMECGFSRDLFLQWCSNPQNSIIITSRT 359
>gi|332028657|gb|EGI68691.1| Putative cleavage and polyadenylation specificity factor subunit 2
[Acromyrmex echinatior]
Length = 737
Score = 212 bits (540), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 99/139 (71%), Positives = 123/139 (88%), Gaps = 3/139 (2%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK IEWMSDKLM+SFEGARNNPF FKH++LCHS+AEL +VPSPKVVL STP
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTP 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
DMECGFSR+LF QWCS+P+NSII+T+RTSPGTLARDL+E GGNR +TL+VK+R++LEG E
Sbjct: 332 DMECGFSRELFLQWCSNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKRRVKLEGIE 391
Query: 142 LEEYQKK-KDKEAKDKQEK 159
LEEYQK+ K K+ + KQE+
Sbjct: 392 LEEYQKREKLKQEQLKQEQ 410
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 62/116 (53%), Gaps = 39/116 (33%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H T+FINEL+LSDFKQ L ++ I EF GVL CC T+AVRR
Sbjct: 661 EVPGHQTTFINELKLSDFKQVLNKSNIPSEFSGGVLWCCNNTIAVRR------------- 707
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
++ ++LEGC+S++YY+V++LLY+QYAI+
Sbjct: 708 --------------------------HEAGKVILEGCISEDYYKVRELLYEQYAIV 737
Score = 67.8 bits (164), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 27/36 (75%), Positives = 33/36 (91%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL STPDMECGFSR+LF QWCS+P+NSII+T+R
Sbjct: 324 KVVLASTPDMECGFSRELFLQWCSNPQNSIILTSRT 359
>gi|340713940|ref|XP_003395491.1| PREDICTED: probable cleavage and polyadenylation specificity factor
subunit 2-like isoform 1 [Bombus terrestris]
gi|340713942|ref|XP_003395492.1| PREDICTED: probable cleavage and polyadenylation specificity factor
subunit 2-like isoform 2 [Bombus terrestris]
Length = 737
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 99/139 (71%), Positives = 122/139 (87%), Gaps = 3/139 (2%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK IEWMSDKLM+SFEGARNNPF FKH++LCHS+AEL +VPSPKVVL STP
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTP 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
DMECGFSR+LF QWC +P+NSII+T+RTSPGTLARDL+E GGNR +TL+VK+RI+LEG E
Sbjct: 332 DMECGFSRELFLQWCGNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKRRIKLEGLE 391
Query: 142 LEEYQKK-KDKEAKDKQEK 159
LEEYQ+K K K+ + KQE+
Sbjct: 392 LEEYQRKEKLKQEQLKQEQ 410
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 62/116 (53%), Gaps = 39/116 (33%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H T+FINEL+LSDFKQ L ++ I EF GVL CC T+AVRR
Sbjct: 661 EVPGHQTTFINELKLSDFKQILNKSNIPSEFSGGVLWCCNNTIAVRR------------- 707
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
++ ++LEGC+S++YY+V++LLY+QYAI+
Sbjct: 708 --------------------------HEAGKVILEGCISEDYYKVRELLYEQYAIV 737
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 48/82 (58%), Gaps = 16/82 (19%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRAD---------DSDSNV------ 251
+VVL STPDMECGFSR+LF QWC +P+NSII+T+R + N+
Sbjct: 324 KVVLASTPDMECGFSRELFLQWCGNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKR 383
Query: 252 -IVLEGCLSDEYYRVQQLLYDQ 272
I LEG +EY R ++L +Q
Sbjct: 384 RIKLEGLELEEYQRKEKLKQEQ 405
>gi|328780437|ref|XP_394940.3| PREDICTED: probable cleavage and polyadenylation specificity factor
subunit 2 [Apis mellifera]
Length = 730
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 99/139 (71%), Positives = 122/139 (87%), Gaps = 3/139 (2%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK IEWMSDKLM+SFEGARNNPF FKH++LCHS+AEL +VPSPKVVL STP
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTP 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
DMECGFSR+LF QWC +P+NSII+T+RTSPGTLARDL+E GGNR +TL+VK+RI+LEG E
Sbjct: 332 DMECGFSRELFLQWCGNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKRRIKLEGLE 391
Query: 142 LEEYQKK-KDKEAKDKQEK 159
LEEYQ+K K K+ + KQE+
Sbjct: 392 LEEYQRKEKLKQEQLKQEQ 410
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 48/82 (58%), Gaps = 16/82 (19%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRAD---------DSDSNV------ 251
+VVL STPDMECGFSR+LF QWC +P+NSII+T+R + N+
Sbjct: 324 KVVLASTPDMECGFSRELFLQWCGNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKR 383
Query: 252 -IVLEGCLSDEYYRVQQLLYDQ 272
I LEG +EY R ++L +Q
Sbjct: 384 RIKLEGLELEEYQRKEKLKQEQ 405
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 50/104 (48%), Gaps = 39/104 (37%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H T+FINEL+LSDFKQ L ++ I EF GVL CC T+AVRR
Sbjct: 661 EVPGHQTTFINELKLSDFKQILNKSNIPSEFSGGVLWCCNNTIAVRR------------- 707
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYR 264
++ ++LEGC+S++YY+
Sbjct: 708 --------------------------HEAGKVILEGCISEDYYK 725
>gi|350400562|ref|XP_003485880.1| PREDICTED: probable cleavage and polyadenylation specificity factor
subunit 2-like [Bombus impatiens]
Length = 737
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 99/139 (71%), Positives = 122/139 (87%), Gaps = 3/139 (2%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK IEWMSDKLM+SFEGARNNPF FKH++LCHS+AEL +VPSPKVVL STP
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTP 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
DMECGFSR+LF QWC +P+NSII+T+RTSPGTLARDL+E GGNR +TL+VK+RI+LEG E
Sbjct: 332 DMECGFSRELFLQWCGNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKRRIKLEGLE 391
Query: 142 LEEYQKK-KDKEAKDKQEK 159
LEEYQ+K K K+ + KQE+
Sbjct: 392 LEEYQRKEKLKQEQLKQEQ 410
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 62/116 (53%), Gaps = 39/116 (33%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H T+FINEL+LSDFKQ L ++ I EF GVL CC T+AVRR
Sbjct: 661 EVPGHQTTFINELKLSDFKQILNKSNIPSEFSGGVLWCCNNTIAVRR------------- 707
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
++ ++LEGC+S++YY+V++LLY+QYAI+
Sbjct: 708 --------------------------HEAGKVILEGCISEDYYKVRELLYEQYAIV 737
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 48/82 (58%), Gaps = 16/82 (19%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRAD---------DSDSNV------ 251
+VVL STPDMECGFSR+LF QWC +P+NSII+T+R + N+
Sbjct: 324 KVVLASTPDMECGFSRELFLQWCGNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKR 383
Query: 252 -IVLEGCLSDEYYRVQQLLYDQ 272
I LEG +EY R ++L +Q
Sbjct: 384 RIKLEGLELEEYQRKEKLKQEQ 405
>gi|380025109|ref|XP_003696322.1| PREDICTED: probable cleavage and polyadenylation specificity factor
subunit 2-like [Apis florea]
Length = 737
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 99/139 (71%), Positives = 122/139 (87%), Gaps = 3/139 (2%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK IEWMSDKLM+SFEGARNNPF FKH++LCHS+AEL +VPSPKVVL STP
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTP 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
DMECGFSR+LF QWC +P+NSII+T+RTSPGTLARDL+E GGNR +TL+VK+RI+LEG E
Sbjct: 332 DMECGFSRELFLQWCGNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKRRIKLEGLE 391
Query: 142 LEEYQKK-KDKEAKDKQEK 159
LEEYQ+K K K+ + KQE+
Sbjct: 392 LEEYQRKEKLKQEQLKQEQ 410
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 62/116 (53%), Gaps = 39/116 (33%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H T+FINEL+LSDFKQ L ++ I EF GVL CC T+AVRR
Sbjct: 661 EVPGHQTTFINELKLSDFKQILNKSNIPSEFSGGVLWCCNNTIAVRR------------- 707
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
++ ++LEGC+S++YY+V++LLY+QYAI+
Sbjct: 708 --------------------------HEAGKVILEGCISEDYYKVRELLYEQYAIV 737
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 48/82 (58%), Gaps = 16/82 (19%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRAD---------DSDSNV------ 251
+VVL STPDMECGFSR+LF QWC +P+NSII+T+R + N+
Sbjct: 324 KVVLASTPDMECGFSRELFLQWCGNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKR 383
Query: 252 -IVLEGCLSDEYYRVQQLLYDQ 272
I LEG +EY R ++L +Q
Sbjct: 384 RIKLEGLELEEYQRKEKLKQEQ 405
>gi|307189918|gb|EFN74154.1| Probable cleavage and polyadenylation specificity factor subunit 2
[Camponotus floridanus]
Length = 737
Score = 210 bits (535), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 98/139 (70%), Positives = 121/139 (87%), Gaps = 3/139 (2%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK IEWMSDKLM+SFEGARNNPF FKH++LCHS+ EL +VPSPKVVL STP
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMVELNQVPSPKVVLASTP 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
DMECGFSR+LF QWC++P+NSIIIT+RTSPGTLARDL+E GGNR +TL VK+R++LEG E
Sbjct: 332 DMECGFSRELFLQWCTNPQNSIIITSRTSPGTLARDLVEKGGNRNITLDVKRRVKLEGIE 391
Query: 142 LEEYQKK-KDKEAKDKQEK 159
LEEYQK+ K K+ + KQE+
Sbjct: 392 LEEYQKREKLKQEQMKQEQ 410
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 62/116 (53%), Gaps = 39/116 (33%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H T+FINEL+LSDFKQ L ++ I EF GVL CC T+AVRR
Sbjct: 661 EVPGHQTTFINELKLSDFKQVLNKSNIPSEFSGGVLWCCNNTIAVRR------------- 707
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
++ ++LEGC+S++YY+V++LL++QYAI+
Sbjct: 708 --------------------------HEAGKVILEGCISEDYYKVRELLFEQYAIV 737
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 27/35 (77%), Positives = 33/35 (94%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL STPDMECGFSR+LF QWC++P+NSIIIT+R
Sbjct: 324 KVVLASTPDMECGFSRELFLQWCTNPQNSIIITSR 358
>gi|322783252|gb|EFZ10838.1| hypothetical protein SINV_80021 [Solenopsis invicta]
Length = 737
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 98/139 (70%), Positives = 121/139 (87%), Gaps = 3/139 (2%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK IEWMSDKLM+SFEGARNNPF FKH++LCHS+AEL +VPSPKVVL STP
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTP 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
DMECGFSR+LF QWCS+ +NSII+T+RTSPGTLARDL+E GGNR +TL VK+R++LEG E
Sbjct: 332 DMECGFSRELFLQWCSNTQNSIILTSRTSPGTLARDLVEKGGNRNITLDVKRRVKLEGIE 391
Query: 142 LEEYQKK-KDKEAKDKQEK 159
LEEYQK+ K K+ + KQE+
Sbjct: 392 LEEYQKREKLKQEQMKQEQ 410
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 62/116 (53%), Gaps = 39/116 (33%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H T+FINEL+LSDFKQ L ++ I EF GVL CC T+AVRR
Sbjct: 661 EVPGHQTTFINELKLSDFKQVLNKSNIPSEFSGGVLWCCNNTIAVRR------------- 707
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
++ ++LEGC+S++YY+V++LLY+QYAI+
Sbjct: 708 --------------------------HEAGKVILEGCISEDYYKVRELLYEQYAIV 737
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 26/36 (72%), Positives = 32/36 (88%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL STPDMECGFSR+LF QWCS+ +NSII+T+R
Sbjct: 324 KVVLASTPDMECGFSRELFLQWCSNTQNSIILTSRT 359
>gi|383852782|ref|XP_003701904.1| PREDICTED: probable cleavage and polyadenylation specificity factor
subunit 2-like [Megachile rotundata]
Length = 737
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 98/139 (70%), Positives = 121/139 (87%), Gaps = 3/139 (2%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK IEWMSDKLM+SFEGARNNPF FKH++LCHS+AEL +VPSPKVVL STP
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTP 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
DMECGFSR+LF QWC + +NSII+T+RTSPGTLARDL+E GGNR +TL+VK+RI+LEG E
Sbjct: 332 DMECGFSRELFLQWCGNSQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKRRIKLEGLE 391
Query: 142 LEEYQKK-KDKEAKDKQEK 159
LEEYQ+K K K+ + KQE+
Sbjct: 392 LEEYQRKEKLKQEQLKQEQ 410
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 62/116 (53%), Gaps = 39/116 (33%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H T+FINEL+LSDFKQ L ++ I EF GVL CC T+AVRR
Sbjct: 661 EVPGHQTTFINELKLSDFKQILNKSNIPSEFSGGVLWCCNNTIAVRR------------- 707
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
++ ++LEGC+S++YY+V++LLY+QYAI+
Sbjct: 708 --------------------------HEAGKVILEGCISEDYYKVRELLYEQYAIV 737
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 47/82 (57%), Gaps = 16/82 (19%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRAD---------DSDSNV------ 251
+VVL STPDMECGFSR+LF QWC + +NSII+T+R + N+
Sbjct: 324 KVVLASTPDMECGFSRELFLQWCGNSQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKR 383
Query: 252 -IVLEGCLSDEYYRVQQLLYDQ 272
I LEG +EY R ++L +Q
Sbjct: 384 RIKLEGLELEEYQRKEKLKQEQ 405
>gi|270010824|gb|EFA07272.1| hypothetical protein TcasGA2_TC014506 [Tribolium castaneum]
Length = 733
Score = 197 bits (501), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 91/128 (71%), Positives = 108/128 (84%), Gaps = 2/128 (1%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK IEWMSDKLM+SFEGARNNPF FKH++LCHSL EL KV SPKVVL S+P
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSLHELQKVSSPKVVLASSP 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
DME GFSR+LF QWCS+P NSIIIT RTSPGTLARDL++ GGNR + L VK+R++LEG E
Sbjct: 332 DMESGFSRELFLQWCSNPNNSIIITTRTSPGTLARDLVDNGGNRQIDLVVKRRVKLEGSE 391
Query: 142 LEEYQKKK 149
LEEYQK +
Sbjct: 392 LEEYQKSQ 399
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 62/113 (54%), Gaps = 39/113 (34%)
Query: 164 PHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRD 223
PHDT FINEL+LS+FKQ L ++ I+ EF GVL C GT+A+RRV
Sbjct: 660 PHDTVFINELKLSEFKQILAKSNINSEFSGGVLWCSNGTLAIRRV--------------- 704
Query: 224 LFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
++ ++LEGC+S++YY+V++LLY+QYA+L
Sbjct: 705 ------------------------ETGRVILEGCISEDYYKVKELLYEQYAVL 733
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 27/39 (69%), Positives = 32/39 (82%)
Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
V+ +VVL S+PDME GFSR+LF QWCS+P NSIIIT R
Sbjct: 320 VSSPKVVLASSPDMESGFSRELFLQWCSNPNNSIIITTR 358
>gi|242021798|ref|XP_002431330.1| Cleavage and polyadenylation specificity factor 100 kDa subunit,
putative [Pediculus humanus corporis]
gi|212516598|gb|EEB18592.1| Cleavage and polyadenylation specificity factor 100 kDa subunit,
putative [Pediculus humanus corporis]
Length = 731
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 89/126 (70%), Positives = 110/126 (87%), Gaps = 2/126 (1%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ ++ E AK IEWMS+KLM+SFEGARNNPF FK+V+LCHS +EL+KVPSPKVVL STP
Sbjct: 272 VSYNTVEFAKSQIEWMSEKLMRSFEGARNNPFQFKYVQLCHSFSELSKVPSPKVVLASTP 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
DME GFSR+LF QW S+P NSII+T+RTSPGTLARDLIE GG+R +++++KKR++LEGEE
Sbjct: 332 DMESGFSRELFLQWSSNPLNSIILTSRTSPGTLARDLIENGGDRIISIEIKKRVKLEGEE 391
Query: 142 LEEYQK 147
LEEY K
Sbjct: 392 LEEYFK 397
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 64/117 (54%), Gaps = 39/117 (33%)
Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
E IP H+TS+INEL+LSDFKQ L +N I+CEF GVL CC G+VAVRR
Sbjct: 654 EDIPGHETSYINELKLSDFKQILNKNNINCEFSGGVLRCCHGSVAVRR------------ 701
Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
++ ++LEGCLS++YY+V++LL QYAI+
Sbjct: 702 ---------------------------HEAGRVILEGCLSEDYYKVKELLCQQYAIV 731
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 30/36 (83%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL STPDME GFSR+LF QW S+P NSII+T+R
Sbjct: 324 KVVLASTPDMESGFSRELFLQWSSNPLNSIILTSRT 359
>gi|346465041|gb|AEO32365.1| hypothetical protein [Amblyomma maculatum]
Length = 644
Score = 192 bits (488), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 86/140 (61%), Positives = 112/140 (80%), Gaps = 2/140 (1%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK +EWMSDK+M+SFEGARNNPF F+H++LCH LAELA+VP PKVVL S
Sbjct: 180 VSYNVVEFAKSQVEWMSDKVMRSFEGARNNPFQFRHLQLCHGLAELARVPEPKVVLASMA 239
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
DMECGFSRDLF QWCSSP NS+++T+RT+PGTLAR LIE + LT+ +KKR+RLEG E
Sbjct: 240 DMECGFSRDLFIQWCSSPRNSVVLTSRTAPGTLARQLIENPHQQALTITMKKRVRLEGSE 299
Query: 142 LEEYQKKKDKEAKDKQEKEK 161
LEEY +K+ + A + + E+
Sbjct: 300 LEEYMRKEKELAAARHKAER 319
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 57/114 (50%), Gaps = 40/114 (35%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H T FINE++LSDFKQ L RNG+ EF GVL C G VAVRR
Sbjct: 571 QVPGHATIFINEIKLSDFKQVLLRNGVQAEFSGGVLY-CNGIVAVRR------------- 616
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYA 274
+++ I +EGCL ++Y++V+++LY QYA
Sbjct: 617 --------------------------NEAGRINIEGCLCEDYFKVREILYQQYA 644
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 29/52 (55%), Positives = 34/52 (65%), Gaps = 5/52 (9%)
Query: 196 LICCRGTVAVRRV-----VLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
L C G + RV VL S DMECGFSRDLF QWCSSP NS+++T+R
Sbjct: 216 LQLCHGLAELARVPEPKVVLASMADMECGFSRDLFIQWCSSPRNSVVLTSRT 267
>gi|427789025|gb|JAA59964.1| Putative mrna cleavage and polyadenylation factor ii complex
subunit cft2 cpsf subunit [Rhipicephalus pulchellus]
Length = 646
Score = 190 bits (482), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 84/140 (60%), Positives = 113/140 (80%), Gaps = 2/140 (1%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK +EWMSDK+M+SFEGARNNPF F+H++LCH +AELA+VP PKVVL S
Sbjct: 180 VSYNVVEFAKSQVEWMSDKVMRSFEGARNNPFQFRHLQLCHGMAELARVPEPKVVLASMA 239
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
DMECGFSR+LF QWCSSP NS+++T+R++PGTLAR LIE ++LT+ VKKR+RLEG E
Sbjct: 240 DMECGFSRELFIQWCSSPRNSVVLTSRSAPGTLARQLIENPHQQSLTITVKKRVRLEGSE 299
Query: 142 LEEYQKKKDKEAKDKQEKEK 161
LEEY +K+ + A + + E+
Sbjct: 300 LEEYMRKEKELAAARHKAER 319
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 59/116 (50%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H T F+NEL+LSDFKQ L RNG+ EF GVL C G VAVRR
Sbjct: 571 QVPGHATIFVNELKLSDFKQVLLRNGVQAEFSGGVLY-CNGIVAVRR------------- 616
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I +EGCL ++Y++V+++LY QYAI+
Sbjct: 617 --------------------------NEAGRINIEGCLCEDYFKVREILYQQYAII 646
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/52 (53%), Positives = 35/52 (67%), Gaps = 5/52 (9%)
Query: 196 LICCRGTVAVRRV-----VLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
L C G + RV VL S DMECGFSR+LF QWCSSP NS+++T+R+
Sbjct: 216 LQLCHGMAELARVPEPKVVLASMADMECGFSRELFIQWCSSPRNSVVLTSRS 267
>gi|157112944|ref|XP_001657690.1| cleavage and polyadenylation specificity factor [Aedes aegypti]
gi|108884656|gb|EAT48881.1| AAEL000118-PA [Aedes aegypti]
Length = 744
Score = 183 bits (464), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 88/127 (69%), Positives = 105/127 (82%), Gaps = 3/127 (2%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK IEWMSDKLMKSFEGARNNPF FKH++LCH++AELAKVPSPKVVL S+
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMKSFEGARNNPFQFKHLRLCHTMAELAKVPSPKVVLASSA 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGN-RTLTLQVKKRIRLEGE 140
DME GFSR+LF QW S+ NSIIIT R+SPGTLARDLIE GGN R + L V++R+ LEG
Sbjct: 332 DMESGFSRELFVQWASNVNNSIIITCRSSPGTLARDLIENGGNGRKIELDVRRRVELEGA 391
Query: 141 ELEEYQK 147
ELEEY +
Sbjct: 392 ELEEYMR 398
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 60/117 (51%), Gaps = 39/117 (33%)
Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
+++P H + FINEL+L DFKQ L + I EF GVL C GTVA+RRV
Sbjct: 667 DELPAHHSVFINELKLIDFKQVLMKANISSEFSGGVLWCNNGTVALRRV----------- 715
Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
D+ + +EGCLS+EYY++++LLY+QYAI+
Sbjct: 716 ----------------------------DTGKVTVEGCLSEEYYKIRELLYEQYAIV 744
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/36 (63%), Positives = 28/36 (77%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S+ DME GFSR+LF QW S+ NSIIIT R+
Sbjct: 324 KVVLASSADMESGFSRELFVQWASNVNNSIIITCRS 359
>gi|170046825|ref|XP_001850949.1| cleavage and polyadenylation specificity factor subunit 2 [Culex
quinquefasciatus]
gi|167869453|gb|EDS32836.1| cleavage and polyadenylation specificity factor subunit 2 [Culex
quinquefasciatus]
Length = 747
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 86/127 (67%), Positives = 105/127 (82%), Gaps = 3/127 (2%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK IEWMSDKLMKSFEGARNNPF FKH++LCH++A+LAKVPSPKVVL S+P
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMKSFEGARNNPFQFKHLRLCHTMADLAKVPSPKVVLASSP 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGN-RTLTLQVKKRIRLEGE 140
DME GFSR+LF QW + NSIIIT R+SPGTLARDLI+ GGN R L L V++R+ LEG
Sbjct: 332 DMESGFSRELFVQWAGNVNNSIIITCRSSPGTLARDLIDNGGNGRKLELDVRRRVELEGA 391
Query: 141 ELEEYQK 147
EL+EY +
Sbjct: 392 ELDEYMR 398
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 73/142 (51%), Gaps = 41/142 (28%)
Query: 137 LEGEELEEYQKKKDKE--AKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDG 194
++ +++E + K DK+ D +++P H++ FINEL+L DFKQ L + I EF G
Sbjct: 645 MDVDQVEITEDKSDKQILTLDPLLNDQLPAHNSVFINELKLIDFKQVLMKANIASEFSGG 704
Query: 195 VLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVL 254
VL C GT+A+RR+ D+ + +
Sbjct: 705 VLWCSNGTLALRRI---------------------------------------DTGKVTI 725
Query: 255 EGCLSDEYYRVQQLLYDQYAIL 276
EGCLS++YYR+++LLY+QYAI+
Sbjct: 726 EGCLSEDYYRIRELLYEQYAIV 747
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 40/77 (51%), Gaps = 17/77 (22%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRAD-------DSDSNV-------- 251
+VVL S+PDME GFSR+LF QW + NSIIIT R+ D+ N
Sbjct: 324 KVVLASSPDMESGFSRELFVQWAGNVNNSIIITCRSSPGTLARDLIDNGGNGRKLELDVR 383
Query: 252 --IVLEGCLSDEYYRVQ 266
+ LEG DEY R +
Sbjct: 384 RRVELEGAELDEYMRTE 400
>gi|158290938|ref|XP_312464.4| AGAP002474-PA [Anopheles gambiae str. PEST]
gi|157018137|gb|EAA08192.4| AGAP002474-PA [Anopheles gambiae str. PEST]
Length = 745
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 84/126 (66%), Positives = 105/126 (83%), Gaps = 3/126 (2%)
Query: 25 CHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
+++ E AK IEWMSDKLMKSFEGARNNPF FKH++LCH++A+LAKVPSPKVVL S+PD
Sbjct: 273 SYNVVEFAKSQIEWMSDKLMKSFEGARNNPFTFKHLRLCHTMADLAKVPSPKVVLASSPD 332
Query: 83 MECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGN-RTLTLQVKKRIRLEGEE 141
+E GFSR+LF QW + NSIIIT+R+SPGTLARDLIE GGN R + + +++R+ LEG E
Sbjct: 333 LESGFSRELFIQWAPNASNSIIITSRSSPGTLARDLIENGGNGRKIEMDIRRRVELEGAE 392
Query: 142 LEEYQK 147
LEEY +
Sbjct: 393 LEEYMR 398
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 50/147 (34%), Positives = 76/147 (51%), Gaps = 42/147 (28%)
Query: 133 KRI-RLEGEELEEYQKKKDKE--AKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDC 189
KRI +E ++++ K DK+ + +E +PPH+ FINEL+L DFKQ L ++ I
Sbjct: 638 KRIDTMEVDDVDTIDDKMDKQILTLEPLAQEDLPPHNPVFINELKLIDFKQILMKSNIAS 697
Query: 190 EFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDS 249
EF GVL C GTVA+RRV D+
Sbjct: 698 EFSGGVLWCSNGTVALRRV---------------------------------------DT 718
Query: 250 NVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+ +EGC+S++YY++++LLY+QYAI+
Sbjct: 719 GRVTIEGCISEDYYKIRELLYEQYAII 745
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 22/36 (61%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S+PD+E GFSR+LF QW + NSIIIT+R+
Sbjct: 324 KVVLASSPDLESGFSRELFIQWAPNASNSIIITSRS 359
>gi|312375001|gb|EFR22454.1| hypothetical protein AND_15244 [Anopheles darlingi]
Length = 772
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 85/127 (66%), Positives = 104/127 (81%), Gaps = 3/127 (2%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK IEWMSDKLMKSFEGARNNPF FKH++LCH++A+LAKVPSPKVVL S+
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMKSFEGARNNPFTFKHLRLCHTMADLAKVPSPKVVLASSA 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGN-RTLTLQVKKRIRLEGE 140
DME GFSR+LF QW NSIIITNR+SPGTLARDLI+ GGN R + + V++R+ LEG
Sbjct: 332 DMESGFSRELFIQWAPQATNSIIITNRSSPGTLARDLIDNGGNGRKIEMDVRRRVELEGA 391
Query: 141 ELEEYQK 147
ELEEY +
Sbjct: 392 ELEEYMR 398
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 62/117 (52%), Gaps = 39/117 (33%)
Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
E++PPH+ FINEL+L DFKQ L R+ I EF GVL C GTVA+RRV
Sbjct: 695 EELPPHNPVFINELKLIDFKQVLMRSNITSEFSGGVLWCSNGTVALRRV----------- 743
Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
D+ + +EGC+S++YY++++LLY+QYAI+
Sbjct: 744 ----------------------------DTGRVTIEGCISEDYYKIRELLYEQYAII 772
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 23/36 (63%), Positives = 27/36 (75%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S+ DME GFSR+LF QW NSIIITNR+
Sbjct: 324 KVVLASSADMESGFSRELFIQWAPQATNSIIITNRS 359
>gi|357610700|gb|EHJ67102.1| putative cleavage and polyadenylation specificity factor 100 kDa
subunit [Danaus plexippus]
Length = 818
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 80/154 (51%), Positives = 113/154 (73%), Gaps = 2/154 (1%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK IEWMSDKL ++FEGAR+NPF +H++LCHS+ E+ + P PKVVL S P
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLTRAFEGARSNPFALRHLQLCHSVVEVTRTPGPKVVLASFP 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
D+E GF+RDLF QW + +NSI++T RTSPGTLARDLIE GG+RT+ L V++R+RLEG E
Sbjct: 332 DLETGFARDLFLQWAPNSQNSIVLTARTSPGTLARDLIEKGGDRTIELTVRRRVRLEGAE 391
Query: 142 LEEYQKKKDKEAKDKQEKEKIPPHDTSFINELQL 175
LEE+ +++ K +E+ D+ EL++
Sbjct: 392 LEEFMQQRVKVNNSVKEETGGISSDSESEGELEM 425
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 55/113 (48%), Gaps = 39/113 (34%)
Query: 164 PHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRD 223
PH F+N ++LS+ + L RNG+ EF G L CC GT+A+RR+
Sbjct: 652 PHGAWFVNSVRLSELRAALARNGLGAEFSAGALECCNGTIAIRRL--------------- 696
Query: 224 LFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
++ + LEG LS+EY++V++LLYDQ+AI+
Sbjct: 697 ------------------------ENGRVALEGVLSEEYFKVRELLYDQFAIV 725
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 31/52 (59%), Gaps = 5/52 (9%)
Query: 196 LICCRGTVAVRR-----VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
L C V V R VVL S PD+E GF+RDLF QW + +NSI++T R
Sbjct: 308 LQLCHSVVEVTRTPGPKVVLASFPDLETGFARDLFLQWAPNSQNSIVLTART 359
>gi|193676458|ref|XP_001951701.1| PREDICTED: probable cleavage and polyadenylation specificity factor
subunit 2-like [Acyrthosiphon pisum]
Length = 729
Score = 173 bits (439), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 87/143 (60%), Positives = 107/143 (74%), Gaps = 5/143 (3%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ ++ E AK IEWMSDKLMKSFEGARNNPF FKHVKLCH++ +L KV PKVVL S
Sbjct: 272 VSYNTVEFAKSQIEWMSDKLMKSFEGARNNPFFFKHVKLCHNMNDLNKVSEPKVVLASNG 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
D+E GFSR++F W S+ +NSII+T+RT+PGTLARDLI+ GG+R + L VKKR+ L+ E
Sbjct: 332 DLESGFSREVFIMWASNSKNSIILTDRTAPGTLARDLIDEGGDRNIKLIVKKRVPLDDNE 391
Query: 142 LEEYQKKKDKEAKDKQEKEKIPP 164
LEEY K D+E K E KI P
Sbjct: 392 LEEYNIKHDEE---KMEASKIDP 411
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 53/118 (44%), Gaps = 39/118 (33%)
Query: 159 KEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMEC 218
+ +I P T FIN L+LSDFKQ L +N I CE GVL CC TV VRR
Sbjct: 651 EHEIHPRKTVFINRLKLSDFKQILSKNNIPCELSKGVLWCCNRTVCVRR----------- 699
Query: 219 GFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+ S +++EG +S +YY ++ LLY Q+ I+
Sbjct: 700 ----------------------------NSSGKVLMEGIISRQYYYIRSLLYSQFIII 729
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 20/39 (51%), Positives = 29/39 (74%)
Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
V+ +VVL S D+E GFSR++F W S+ +NSII+T+R
Sbjct: 320 VSEPKVVLASNGDLESGFSREVFIMWASNSKNSIILTDR 358
>gi|223648270|gb|ACN10893.1| Cleavage and polyadenylation specificity factor subunit 2 [Salmo
salar]
Length = 796
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 82/139 (58%), Positives = 107/139 (76%), Gaps = 2/139 (1%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCHSLA+LA+VPSPKVVL S PD+E GFSR+L
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHSLADLARVPSPKVVLCSQPDLESGFSREL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKK-- 149
F QWC +NS+I+T RT+PGTL R LI+ G + L L+++KR++LEG ELEEY +K+
Sbjct: 342 FIQWCQDAKNSVILTYRTTPGTLGRYLIDNPGEKMLDLEIRKRVKLEGRELEEYLEKERM 401
Query: 150 DKEAKDKQEKEKIPPHDTS 168
KEA K E+EK D+S
Sbjct: 402 KKEAAKKLEQEKEVDVDSS 420
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 58/116 (50%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
+IP H + FINE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 721 EIPGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLV-CNNMVAVRR------------- 766
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL D+YY++++LLY QYA++
Sbjct: 767 --------------------------TEAGRIGLEGCLCDDYYKIRELLYQQYAVV 796
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 27/36 (75%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+E GFSR+LF QWC +NS+I+T R
Sbjct: 324 KVVLCSQPDLESGFSRELFIQWCQDAKNSVILTYRT 359
>gi|213514628|ref|NP_001134023.1| cleavage and polyadenylation specificity factor subunit 2 [Salmo
salar]
gi|209156194|gb|ACI34329.1| Cleavage and polyadenylation specificity factor subunit 2 [Salmo
salar]
Length = 796
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 82/139 (58%), Positives = 107/139 (76%), Gaps = 2/139 (1%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCHSLA+LA+VPSPKVVL S PD+E GFSR+L
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHSLADLARVPSPKVVLCSQPDLESGFSREL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKK-- 149
F QWC +NS+I+T RT+PGTL R LI+ G + L L+++KR++LEG ELEEY +K+
Sbjct: 342 FIQWCQEAKNSVILTYRTTPGTLGRYLIDNPGEKMLDLEIRKRVKLEGRELEEYLEKERM 401
Query: 150 DKEAKDKQEKEKIPPHDTS 168
KEA K E+EK D+S
Sbjct: 402 KKEAAKKLEQEKEVDVDSS 420
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 58/116 (50%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
+IP H + FINE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 721 EIPGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLV-CNNIVAVRR------------- 766
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL D+YY++++LLY QYA++
Sbjct: 767 --------------------------TEAGRIGLEGCLCDDYYKIRELLYQQYAVV 796
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 27/36 (75%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+E GFSR+LF QWC +NS+I+T R
Sbjct: 324 KVVLCSQPDLESGFSRELFIQWCQEAKNSVILTYRT 359
>gi|443725188|gb|ELU12868.1| hypothetical protein CAPTEDRAFT_155355 [Capitella teleta]
Length = 728
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 81/156 (51%), Positives = 115/156 (73%), Gaps = 11/156 (7%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK +EWMSDK+M+SFE RNNPF FKH++LCHS+AELAKVP PKVVL STP
Sbjct: 272 VAYNVVEFAKSQVEWMSDKIMRSFEERRNNPFQFKHLQLCHSMAELAKVPEPKVVLASTP 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
D++ GFSR+LF QWCS+P+N II+TNRT+P TL R LI+ ++ L+VK+R+RLEG
Sbjct: 332 DLQTGFSRELFVQWCSNPKNCIILTNRTAPPTLCRQLIDYPNRGSVRLEVKRRVRLEGRA 391
Query: 142 LEEYQK---------KKDKEAKDKQEKEKIPPHDTS 168
LE++ + +++K K+++E+E + D S
Sbjct: 392 LEDFLRAERERKAEVEREKAEKERREREGLESSDDS 427
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 66/140 (47%), Gaps = 41/140 (29%)
Query: 138 EGEELEEYQKKKDKEAKDKQ-EKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVL 196
+ E +E+ + +D+EA +I PH F+NE +LSDFKQTL G+ EF GVL
Sbjct: 629 QAEAMEDGENPEDEEAVPVVIPTSQIRPHGAVFVNEPKLSDFKQTLVNLGVQAEFSAGVL 688
Query: 197 ICCRGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEG 256
IC NS++ +N A + LEG
Sbjct: 689 IC-----------------------------------NSVVAVRKNEA-----GRLQLEG 708
Query: 257 CLSDEYYRVQQLLYDQYAIL 276
L D+YYR++QLLY+Q+AI+
Sbjct: 709 TLCDDYYRIRQLLYEQFAIV 728
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 24/36 (66%), Positives = 31/36 (86%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL STPD++ GFSR+LF QWCS+P+N II+TNR
Sbjct: 324 KVVLASTPDLQTGFSRELFVQWCSNPKNCIILTNRT 359
>gi|50539828|ref|NP_001002384.1| cleavage and polyadenylation specificity factor subunit 2 [Danio
rerio]
gi|49903850|gb|AAH76029.1| Cleavage and polyadenylation specific factor 2 [Danio rerio]
Length = 790
Score = 170 bits (430), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 81/139 (58%), Positives = 106/139 (76%), Gaps = 2/139 (1%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCHSL++LA+VPSPKVVL S PD+E GFSR+L
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHSLSDLARVPSPKVVLCSQPDLESGFSREL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKK-- 149
F QWC +NS+I+T RT+PGTLAR LI+ G + + L+++KR RLEG ELEEY +K+
Sbjct: 342 FIQWCQDAKNSVILTYRTTPGTLARYLIDNPGEKRIELEIRKRCRLEGRELEEYMEKERM 401
Query: 150 DKEAKDKQEKEKIPPHDTS 168
KEA K E+ K D+S
Sbjct: 402 KKEAAKKLEQAKEVDLDSS 420
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 58/116 (50%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + FINE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 715 EVPGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLV-CNNLVAVRR------------- 760
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGC D+YYR+++LLY+QYA++
Sbjct: 761 --------------------------TEAGRICLEGCHCDDYYRIRELLYEQYAVV 790
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 27/36 (75%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+E GFSR+LF QWC +NS+I+T R
Sbjct: 324 KVVLCSQPDLESGFSRELFIQWCQDAKNSVILTYRT 359
>gi|47224566|emb|CAG03550.1| unnamed protein product [Tetraodon nigroviridis]
Length = 765
Score = 169 bits (429), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 82/132 (62%), Positives = 102/132 (77%), Gaps = 2/132 (1%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCHSLA+LA+VPSPKVVL S PD+E GFSR+L
Sbjct: 258 SQVEWMSDKLMRCFEDKRNNPFQFRHLTLCHSLADLARVPSPKVVLCSQPDLESGFSREL 317
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKK-- 149
F QW +NSII+T RT+PGTLAR LI+ G + L L+V+KR+RLEG ELEEY +K
Sbjct: 318 FIQWSKDSKNSIILTYRTTPGTLARYLIDNPGEKHLDLEVRKRVRLEGRELEEYLEKDRI 377
Query: 150 DKEAKDKQEKEK 161
KEA K E+ K
Sbjct: 378 KKEAAKKLEQAK 389
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 58/116 (50%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + FINE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 690 EVPGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLV-CNNMVAVRR------------- 735
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YY++++LLY QYA++
Sbjct: 736 --------------------------TEAGRISLEGCLCEDYYKIRELLYQQYAVV 765
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 26/36 (72%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+E GFSR+LF QW +NSII+T R
Sbjct: 300 KVVLCSQPDLESGFSRELFIQWSKDSKNSIILTYRT 335
>gi|74194185|dbj|BAE24650.1| unnamed protein product [Mus musculus]
Length = 396
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 72/114 (63%), Positives = 94/114 (82%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEY 145
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG+ELEEY
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPTEKVTEIELRKRVKLEGKELEEY 395
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|410916717|ref|XP_003971833.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like [Takifugu rubripes]
Length = 787
Score = 167 bits (422), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 80/132 (60%), Positives = 102/132 (77%), Gaps = 2/132 (1%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCHSLA+LA+VPSPKVVL S PD+E GFSR+L
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLTLCHSLADLARVPSPKVVLCSQPDLESGFSREL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKK-- 149
F QW +NSII+T RT+PGTL+R LI+ G + L L+V+KR++LEG ELEEY +K
Sbjct: 342 FIQWSKDAKNSIILTYRTTPGTLSRYLIDNPGEKHLDLEVRKRVKLEGRELEEYLEKDRV 401
Query: 150 DKEAKDKQEKEK 161
KEA K E+ K
Sbjct: 402 KKEAAKKLEQAK 413
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 58/116 (50%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + FINE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 712 EVPGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLV-CNNMVAVRR------------- 757
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YY++++LLY QYA++
Sbjct: 758 --------------------------TEAGRIGLEGCLCEDYYKIRELLYQQYAVV 787
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 26/36 (72%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+E GFSR+LF QW +NSII+T R
Sbjct: 324 KVVLCSQPDLESGFSRELFIQWSKDAKNSIILTYRT 359
>gi|328722057|ref|XP_001949295.2| PREDICTED: probable cleavage and polyadenylation specificity factor
subunit 2-like [Acyrthosiphon pisum]
Length = 724
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 84/143 (58%), Positives = 106/143 (74%), Gaps = 5/143 (3%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ ++ E AK IEWMSDKLMKSFEGARNNPF FKHVKLCH++ +L KV PKVVL S
Sbjct: 272 VSYNTVEFAKSQIEWMSDKLMKSFEGARNNPFIFKHVKLCHNMNDLKKVSEPKVVLASHG 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
D+E GFSR++F W S+P+NSII+T+R +PGTLAR+LI+ G +R + L VKKR+ L+ E
Sbjct: 332 DLENGFSREVFIMWASNPKNSIILTDRAAPGTLARNLIDGGSDRNIKLIVKKRVPLDENE 391
Query: 142 LEEYQKKKDKEAKDKQEKEKIPP 164
LEEY K +KE K E K+ P
Sbjct: 392 LEEYNIKYEKE---KMEGSKMDP 411
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 52/116 (44%), Gaps = 39/116 (33%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
+I PH T FIN L+LSDFKQ L + I CE GVL CC TV VRR
Sbjct: 648 EIHPHKTVFINRLKLSDFKQILSKKNIPCELSKGVLWCCNRTVCVRR------------- 694
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+ S +++EG +S +YY ++ LLY Q+ I+
Sbjct: 695 --------------------------NSSGKVLMEGIISRQYYYIRSLLYSQFIII 724
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/50 (46%), Positives = 33/50 (66%)
Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVI 252
V+ +VVL S D+E GFSR++F W S+P+NSII+T+R N+I
Sbjct: 320 VSEPKVVLASHGDLENGFSREVFIMWASNPKNSIILTDRAAPGTLARNLI 369
>gi|390333491|ref|XP_780045.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
2 isoform 1 [Strongylocentrotus purpuratus]
Length = 773
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 80/136 (58%), Positives = 104/136 (76%), Gaps = 3/136 (2%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK +EWMSDK+M++FE RNNPF FKH+KLCH+L ELAKVP PKVVL S P
Sbjct: 272 VSYNVVEFAKSQVEWMSDKVMRAFEDRRNNPFQFKHLKLCHNLKELAKVPDPKVVLASVP 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
D+ECG+SR+LF QW +NS+I+TNRTS GTLAR LIE L L+V KR++LE EE
Sbjct: 332 DLECGYSRELFIQWSGDAKNSVILTNRTSHGTLARRLIETPNPNQLKLRVSKRVKLEKEE 391
Query: 142 LEEYQ-KKKDKEAKDK 156
L+EY+ +K+KE + K
Sbjct: 392 LDEYRIHEKEKERQRK 407
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 55/122 (45%), Gaps = 40/122 (32%)
Query: 155 DKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTP 214
D + ++ PH ++N + DFKQ L +NGI EF GVL+ C TVA++R
Sbjct: 692 DAVQTNEVTPHRQVYVNPPRFLDFKQVLAKNGIRAEFTGGVLV-CNNTVAIKR------- 743
Query: 215 DMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYA 274
++ + LEG + D+YY V++LLY+QYA
Sbjct: 744 --------------------------------NEKGHLTLEGAVCDDYYTVRELLYEQYA 771
Query: 275 IL 276
I+
Sbjct: 772 IV 773
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/35 (60%), Positives = 28/35 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL S PD+ECG+SR+LF QW +NS+I+TNR
Sbjct: 324 KVVLASVPDLECGYSRELFIQWSGDAKNSVILTNR 358
>gi|260822471|ref|XP_002606625.1| hypothetical protein BRAFLDRAFT_209615 [Branchiostoma floridae]
gi|229291969|gb|EEN62635.1| hypothetical protein BRAFLDRAFT_209615 [Branchiostoma floridae]
Length = 607
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 74/124 (59%), Positives = 96/124 (77%), Gaps = 2/124 (1%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK +EWMSDK+M+ FE RNNPF FKH+KLCHSL+EL KVP PKVVL S P
Sbjct: 89 VAYNVVEFAKSQVEWMSDKIMRVFEDNRNNPFQFKHLKLCHSLSELHKVPDPKVVLASVP 148
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
D+E GFSR+LF QWC + +N++++T+R PGTL R LI+ +T TLQ +KR+RLEG E
Sbjct: 149 DLESGFSRELFVQWCQNQKNTVVLTSRPGPGTLGRMLIDNPKMKTFTLQARKRVRLEGPE 208
Query: 142 LEEY 145
LEEY
Sbjct: 209 LEEY 212
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 58/116 (50%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
+IP H+ FIN+ +LSD KQ LQ+ GI EF GVLIC
Sbjct: 532 QIPGHEAVFINKPRLSDIKQVLQKEGIQAEFSGGVLIC---------------------- 569
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
N+++ RN +S I +EGC+ ++YY+V++LLY+QYAI+
Sbjct: 570 -------------NNVVALKRN-----ESGRIGMEGCICEDYYKVRKLLYEQYAIV 607
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 29/35 (82%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL S PD+E GFSR+LF QWC + +N++++T+R
Sbjct: 141 KVVLASVPDLESGFSRELFVQWCQNQKNTVVLTSR 175
>gi|195145330|ref|XP_002013649.1| GL24248 [Drosophila persimilis]
gi|194102592|gb|EDW24635.1| GL24248 [Drosophila persimilis]
Length = 583
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 77/126 (61%), Positives = 99/126 (78%), Gaps = 4/126 (3%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
+ +++ E AK IEWMSDKL K+FEGARNNPF FKH++LCH+LA++ K+P+ PKVVL ST
Sbjct: 98 VSYNVVEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHTLADVYKLPAGPKVVLAST 157
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
PD+E GF+RDLF QW S+ NSII+T RTSPGTLA +L+E R + L V++R+ LEG
Sbjct: 158 PDLESGFTRDLFIQWASNANNSIILTTRTSPGTLAMELVENYAPGRQIELDVRRRVELEG 217
Query: 140 EELEEY 145
ELEEY
Sbjct: 218 AELEEY 223
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 46/119 (38%), Positives = 65/119 (54%), Gaps = 39/119 (32%)
Query: 158 EKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDME 217
E+++IP H++ INEL+LSDFKQ L RN I+ EF GVL C GT+A+RRV
Sbjct: 504 EEDEIPVHNSVLINELKLSDFKQILLRNNINSEFSGGVLWCTNGTLALRRV--------- 554
Query: 218 CGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
D+ + +EGCLS+EYY++++LLY+QYAI+
Sbjct: 555 ------------------------------DAGKVAMEGCLSEEYYKIRELLYEQYAIV 583
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/35 (65%), Positives = 28/35 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL STPD+E GF+RDLF QW S+ NSII+T R
Sbjct: 151 KVVLASTPDLESGFTRDLFIQWASNANNSIILTTR 185
>gi|195449222|ref|XP_002071979.1| GK22564 [Drosophila willistoni]
gi|194168064|gb|EDW82965.1| GK22564 [Drosophila willistoni]
Length = 757
Score = 163 bits (413), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 78/130 (60%), Positives = 100/130 (76%), Gaps = 4/130 (3%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
+ +++ E AK IEWMSDKL K+FEGARNNPF FKH+ LCHSLA++ K+P+ PKVVL ST
Sbjct: 272 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHINLCHSLADVFKLPAGPKVVLAST 331
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG-GNRTLTLQVKKRIRLEG 139
PDME GF+RDLF QW ++P NSII T RTSPG+LA +L+E R + L V++R+ LEG
Sbjct: 332 PDMESGFTRDLFVQWAANPNNSIIFTTRTSPGSLAMELVENAVPGRKIELDVRRRVELEG 391
Query: 140 EELEEYQKKK 149
ELEEY + +
Sbjct: 392 PELEEYLRTQ 401
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 43/119 (36%), Positives = 64/119 (53%), Gaps = 39/119 (32%)
Query: 158 EKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDME 217
E+++IP H++ INEL+LSDFKQ L RN I+ EF GVL C T+A+RR+
Sbjct: 678 EEDEIPVHNSVLINELKLSDFKQILMRNNINSEFSGGVLWCSNNTLALRRI--------- 728
Query: 218 CGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
D+ + +EGCLS++YY++++LLY+QYAI+
Sbjct: 729 ------------------------------DAGKVSMEGCLSEDYYKIRELLYEQYAIV 757
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 28/35 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL STPDME GF+RDLF QW ++P NSII T R
Sbjct: 325 KVVLASTPDMESGFTRDLFVQWAANPNNSIIFTTR 359
>gi|198452192|ref|XP_002137430.1| GA26549 [Drosophila pseudoobscura pseudoobscura]
gi|198131825|gb|EDY67988.1| GA26549 [Drosophila pseudoobscura pseudoobscura]
Length = 757
Score = 163 bits (412), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 76/126 (60%), Positives = 98/126 (77%), Gaps = 4/126 (3%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
+ +++ E AK IEWMSDKL K+FEGARNNPF FKH++LCH+LA++ K+P+ PKVVL ST
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHTLADVYKLPAGPKVVLAST 331
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
PD+E GF+RDLF QW + NSII+T RTSPGTLA +L+E R + L V++R+ LEG
Sbjct: 332 PDLESGFTRDLFIQWAGNANNSIILTTRTSPGTLAMELVENYAPGRQIELDVRRRVELEG 391
Query: 140 EELEEY 145
ELEEY
Sbjct: 392 AELEEY 397
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/35 (62%), Positives = 27/35 (77%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL STPD+E GF+RDLF QW + NSII+T R
Sbjct: 325 KVVLASTPDLESGFTRDLFIQWAGNANNSIILTTR 359
Score = 45.4 bits (106), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 26/29 (89%)
Query: 248 DSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
D+ + +EGCLS+EYY++++LLY+QYAI+
Sbjct: 729 DAGKVAMEGCLSEEYYKIRELLYEQYAIV 757
>gi|24650920|ref|NP_733264.1| cleavage and polyadenylation specificity factor 100, isoform B
[Drosophila melanogaster]
gi|23172526|gb|AAN14148.1| cleavage and polyadenylation specificity factor 100, isoform B
[Drosophila melanogaster]
Length = 664
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 77/126 (61%), Positives = 99/126 (78%), Gaps = 4/126 (3%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
+ +++ E AK IEWMSDKL K+FEGARNNPF FKH++LCHSLA++ K+P+ PKVVL ST
Sbjct: 180 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLPAGPKVVLAST 239
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
PD+E GF+RDLF QW S+ NSII+T RTSPGTLA +L+E + + L V++R+ LEG
Sbjct: 240 PDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVDLEG 299
Query: 140 EELEEY 145
ELEEY
Sbjct: 300 AELEEY 305
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 64/117 (54%), Gaps = 39/117 (33%)
Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
++IP H++ INEL+LSDFKQTL RN I+ EF GVL C GT+A+RRV
Sbjct: 587 DEIPIHNSVLINELKLSDFKQTLMRNNINSEFSGGVLWCSNGTLALRRV----------- 635
Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
D+ + +EGCLS+EYY++++LLY+QYAI+
Sbjct: 636 ----------------------------DAGKVAMEGCLSEEYYKIRELLYEQYAIV 664
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/35 (65%), Positives = 28/35 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL STPD+E GF+RDLF QW S+ NSII+T R
Sbjct: 233 KVVLASTPDLESGFTRDLFVQWASNANNSIILTTR 267
>gi|194906654|ref|XP_001981406.1| GG11633 [Drosophila erecta]
gi|190656044|gb|EDV53276.1| GG11633 [Drosophila erecta]
Length = 756
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 77/130 (59%), Positives = 101/130 (77%), Gaps = 4/130 (3%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
+ +++ E AK IEWMSDKL K+FEGARNNPF FKH++LCHSLA++ K+P+ PKVVL ST
Sbjct: 272 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLPAGPKVVLAST 331
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
PD+E GF+RDLF QW S+ NSII+T RTSPGTLA +L+E + + L V++R+ LEG
Sbjct: 332 PDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVELEG 391
Query: 140 EELEEYQKKK 149
ELEEY + +
Sbjct: 392 AELEEYLRTQ 401
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 64/119 (53%), Gaps = 39/119 (32%)
Query: 158 EKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDME 217
+ ++IP H++ INEL+LSDFKQ L RN I+ EF GVL C GT+A+RRV
Sbjct: 677 DDDEIPIHNSVLINELKLSDFKQILMRNNINSEFSGGVLWCSNGTLALRRV--------- 727
Query: 218 CGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
D+ + +EGCLS+EYY++++LLY+QYAI+
Sbjct: 728 ------------------------------DAGKVAMEGCLSEEYYKIRELLYEQYAIV 756
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/35 (65%), Positives = 28/35 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL STPD+E GF+RDLF QW S+ NSII+T R
Sbjct: 325 KVVLASTPDLESGFTRDLFVQWASNANNSIILTTR 359
>gi|195341087|ref|XP_002037143.1| GM12754 [Drosophila sechellia]
gi|194131259|gb|EDW53302.1| GM12754 [Drosophila sechellia]
Length = 743
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 77/130 (59%), Positives = 101/130 (77%), Gaps = 4/130 (3%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
+ +++ E AK IEWMSDKL K+FEGARNNPF FKH++LCHSLA++ K+P+ PKVVL ST
Sbjct: 272 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLPAGPKVVLAST 331
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
PD+E GF+RDLF QW S+ NSII+T RTSPGTLA +L+E + + L V++R+ LEG
Sbjct: 332 PDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVDLEG 391
Query: 140 EELEEYQKKK 149
ELEEY + +
Sbjct: 392 AELEEYLRTQ 401
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 64/117 (54%), Gaps = 39/117 (33%)
Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
++IP H++ INEL+LSDFKQTL RN I+ EF GVL C GT+A+RRV
Sbjct: 666 DEIPIHNSVLINELKLSDFKQTLLRNNINSEFSGGVLWCSNGTLALRRV----------- 714
Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
D+ + +EGCLS+EYY++++LLY+QYAI+
Sbjct: 715 ----------------------------DAGKVAMEGCLSEEYYKIRELLYEQYAIV 743
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/35 (65%), Positives = 28/35 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL STPD+E GF+RDLF QW S+ NSII+T R
Sbjct: 325 KVVLASTPDLESGFTRDLFVQWASNANNSIILTTR 359
>gi|195503417|ref|XP_002098643.1| GE26465, isoform A [Drosophila yakuba]
gi|194184744|gb|EDW98355.1| GE26465, isoform A [Drosophila yakuba]
Length = 756
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 77/130 (59%), Positives = 101/130 (77%), Gaps = 4/130 (3%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
+ +++ E AK IEWMSDKL K+FEGARNNPF FKH++LCHSLA++ K+P+ PKVVL ST
Sbjct: 272 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLPAGPKVVLAST 331
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
PD+E GF+RDLF QW S+ NSII+T RTSPGTLA +L+E + + L V++R+ LEG
Sbjct: 332 PDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVELEG 391
Query: 140 EELEEYQKKK 149
ELEEY + +
Sbjct: 392 AELEEYLRTQ 401
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 45/117 (38%), Positives = 63/117 (53%), Gaps = 39/117 (33%)
Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
++IP H++ INEL+LSDFKQ L RN I+ EF GVL C GT+A+RRV
Sbjct: 679 DEIPIHNSVLINELKLSDFKQILMRNNINSEFSGGVLWCSNGTLALRRV----------- 727
Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
D+ + +EGCLS+EYY++++LLY+QYAI+
Sbjct: 728 ----------------------------DAGKVAMEGCLSEEYYKIRELLYEQYAIV 756
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/35 (65%), Positives = 28/35 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL STPD+E GF+RDLF QW S+ NSII+T R
Sbjct: 325 KVVLASTPDLESGFTRDLFVQWASNANNSIILTTR 359
>gi|157822735|ref|NP_001100223.1| cleavage and polyadenylation specificity factor subunit 2 [Rattus
norvegicus]
gi|149025374|gb|EDL81741.1| cleavage and polyadenylation specific factor 2 (predicted) [Rattus
norvegicus]
Length = 782
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 69/111 (62%), Positives = 91/111 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG+EL
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKVTEIELRKRVKLEGKEL 392
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|354494117|ref|XP_003509185.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2 [Cricetulus griseus]
Length = 782
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 69/111 (62%), Positives = 91/111 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG+EL
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKVTEIELRKRVKLEGKEL 392
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|195503420|ref|XP_002098644.1| GE26465, isoform B [Drosophila yakuba]
gi|194184745|gb|EDW98356.1| GE26465, isoform B [Drosophila yakuba]
Length = 548
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 77/130 (59%), Positives = 101/130 (77%), Gaps = 4/130 (3%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
+ +++ E AK IEWMSDKL K+FEGARNNPF FKH++LCHSLA++ K+P+ PKVVL ST
Sbjct: 272 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLPAGPKVVLAST 331
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
PD+E GF+RDLF QW S+ NSII+T RTSPGTLA +L+E + + L V++R+ LEG
Sbjct: 332 PDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVELEG 391
Query: 140 EELEEYQKKK 149
ELEEY + +
Sbjct: 392 AELEEYLRTQ 401
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/35 (65%), Positives = 28/35 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL STPD+E GF+RDLF QW S+ NSII+T R
Sbjct: 325 KVVLASTPDLESGFTRDLFVQWASNANNSIILTTR 359
>gi|21358013|ref|NP_651658.1| cleavage and polyadenylation specificity factor 100, isoform A
[Drosophila melanogaster]
gi|18203548|sp|Q9V3D6.1|CPSF2_DROME RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 2; AltName: Full=Cleavage and
polyadenylation specificity factor 100 kDa subunit;
Short=CPSF 100 kDa subunit
gi|5679134|gb|AAD46873.1|AF160933_1 LD14168p [Drosophila melanogaster]
gi|7301732|gb|AAF56844.1| cleavage and polyadenylation specificity factor 100, isoform A
[Drosophila melanogaster]
Length = 756
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 77/130 (59%), Positives = 101/130 (77%), Gaps = 4/130 (3%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
+ +++ E AK IEWMSDKL K+FEGARNNPF FKH++LCHSLA++ K+P+ PKVVL ST
Sbjct: 272 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLPAGPKVVLAST 331
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
PD+E GF+RDLF QW S+ NSII+T RTSPGTLA +L+E + + L V++R+ LEG
Sbjct: 332 PDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVDLEG 391
Query: 140 EELEEYQKKK 149
ELEEY + +
Sbjct: 392 AELEEYLRTQ 401
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 64/117 (54%), Gaps = 39/117 (33%)
Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
++IP H++ INEL+LSDFKQTL RN I+ EF GVL C GT+A+RRV
Sbjct: 679 DEIPIHNSVLINELKLSDFKQTLMRNNINSEFSGGVLWCSNGTLALRRV----------- 727
Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
D+ + +EGCLS+EYY++++LLY+QYAI+
Sbjct: 728 ----------------------------DAGKVAMEGCLSEEYYKIRELLYEQYAIV 756
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/35 (65%), Positives = 28/35 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL STPD+E GF+RDLF QW S+ NSII+T R
Sbjct: 325 KVVLASTPDLESGFTRDLFVQWASNANNSIILTTR 359
>gi|13938095|gb|AAH07163.1| Cpsf2 protein, partial [Mus musculus]
Length = 732
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 69/111 (62%), Positives = 91/111 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 232 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 291
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG+EL
Sbjct: 292 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPTEKVTEIELRKRVKLEGKEL 342
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 657 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 702
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 703 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 732
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 274 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 309
>gi|8393762|ref|NP_058552.1| cleavage and polyadenylation specificity factor subunit 2 [Mus
musculus]
gi|18202027|sp|O35218.1|CPSF2_MOUSE RecName: Full=Cleavage and polyadenylation specificity factor
subunit 2; AltName: Full=Cleavage and polyadenylation
specificity factor 100 kDa subunit; Short=CPSF 100 kDa
subunit
gi|2331036|gb|AAB66830.1| cleavage and polyadenylation specificity factor [Mus musculus]
gi|15489017|gb|AAH13628.1| Cleavage and polyadenylation specific factor 2 [Mus musculus]
gi|148686924|gb|EDL18871.1| cleavage and polyadenylation specific factor 2 [Mus musculus]
Length = 782
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 69/111 (62%), Positives = 91/111 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG+EL
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPTEKVTEIELRKRVKLEGKEL 392
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|195392300|ref|XP_002054797.1| GJ24636 [Drosophila virilis]
gi|194152883|gb|EDW68317.1| GJ24636 [Drosophila virilis]
Length = 693
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 76/130 (58%), Positives = 99/130 (76%), Gaps = 4/130 (3%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
+ +++ E AK IEWMSDKL K+FEGARNNPF FKH+ LCH+LA++ K+P+ PKVVL ST
Sbjct: 272 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIHLCHTLADIYKLPAGPKVVLAST 331
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGG-NRTLTLQVKKRIRLEG 139
PDME GF+RDLF QW S+P NSII T RT PG+L+ +L+E R + L V++R+ LEG
Sbjct: 332 PDMESGFTRDLFVQWASNPNNSIIFTTRTGPGSLSMELVENSTPGRQIELDVRRRVELEG 391
Query: 140 EELEEYQKKK 149
ELEEY + +
Sbjct: 392 AELEEYLRTQ 401
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 65/119 (54%), Gaps = 39/119 (32%)
Query: 158 EKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDME 217
E+++IP H++ INEL+LSDFKQ L RN I+ EF GVL C GT+A+RRV
Sbjct: 614 EEDEIPVHNSVLINELKLSDFKQVLMRNNINSEFSGGVLWCSNGTLALRRV--------- 664
Query: 218 CGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
D+ + +EGCLS++YY++++LLY+QYAI+
Sbjct: 665 ------------------------------DAGKVAMEGCLSEDYYKIRELLYEQYAIV 693
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/35 (71%), Positives = 28/35 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL STPDME GF+RDLF QW S+P NSII T R
Sbjct: 325 KVVLASTPDMESGFTRDLFVQWASNPNNSIIFTTR 359
>gi|195109795|ref|XP_001999467.1| GI23051 [Drosophila mojavensis]
gi|193916061|gb|EDW14928.1| GI23051 [Drosophila mojavensis]
Length = 754
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 75/130 (57%), Positives = 98/130 (75%), Gaps = 4/130 (3%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
+ +++ E AK IEWMSDKL K+FEGARNNPF FKH+ LCH+LA++ K+P+ PKVVL ST
Sbjct: 272 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHINLCHTLADIYKLPAGPKVVLAST 331
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
PDME GF+RDLF QW +P NSII T RT PG+L+ DL+E R + L +++R+ LEG
Sbjct: 332 PDMESGFTRDLFVQWAGNPNNSIIFTTRTGPGSLSMDLVENYSPGRQIELDLRRRVELEG 391
Query: 140 EELEEYQKKK 149
ELEEY + +
Sbjct: 392 AELEEYLRTQ 401
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/119 (38%), Positives = 65/119 (54%), Gaps = 39/119 (32%)
Query: 158 EKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDME 217
E+++IP H++ INEL+LSDFKQ L RN I+ EF GVL CC GT+A+RRV
Sbjct: 675 EEDEIPVHNSVLINELKLSDFKQVLMRNNINSEFSGGVLWCCNGTLALRRV--------- 725
Query: 218 CGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
D + +EGCLS++YY++++LLY+QYAI+
Sbjct: 726 ------------------------------DVGKVAMEGCLSEDYYKIRELLYEQYAIV 754
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 27/35 (77%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL STPDME GF+RDLF QW +P NSII T R
Sbjct: 325 KVVLASTPDMESGFTRDLFVQWAGNPNNSIIFTTR 359
>gi|194745794|ref|XP_001955372.1| GF16269 [Drosophila ananassae]
gi|190628409|gb|EDV43933.1| GF16269 [Drosophila ananassae]
Length = 756
Score = 161 bits (407), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 76/130 (58%), Positives = 101/130 (77%), Gaps = 4/130 (3%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
+ +++ E AK IEWMSDKL K+FEGARNNPF FKH++LCHSLA++ K+P+ PKVVL ST
Sbjct: 272 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADIYKLPAGPKVVLAST 331
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
PD+E GF+RDLF QW S+ NSII+T RTSPGTLA +L+E R + L +++R+ LEG
Sbjct: 332 PDLESGFTRDLFVQWASNSNNSIILTTRTSPGTLAMELVENCTPGRQIELDIRRRVELEG 391
Query: 140 EELEEYQKKK 149
EL+EY + +
Sbjct: 392 AELDEYLRTQ 401
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 64/118 (54%), Gaps = 39/118 (33%)
Query: 159 KEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMEC 218
+++IP H++ INEL+LSDFKQ L RN I+ EF GVL C GT+A+RRV
Sbjct: 678 EDEIPVHNSVLINELKLSDFKQILMRNNINSEFSGGVLWCSNGTLALRRV---------- 727
Query: 219 GFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
D+ + +EGCLS+EYY++++LLY+QYAI+
Sbjct: 728 -----------------------------DAGKVAMEGCLSEEYYKIRELLYEQYAIV 756
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 38/77 (49%), Gaps = 17/77 (22%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNV--------------- 251
+VVL STPD+E GF+RDLF QW S+ NSII+T R +
Sbjct: 325 KVVLASTPDLESGFTRDLFVQWASNSNNSIILTTRTSPGTLAMELVENCTPGRQIELDIR 384
Query: 252 --IVLEGCLSDEYYRVQ 266
+ LEG DEY R Q
Sbjct: 385 RRVELEGAELDEYLRTQ 401
>gi|344253621|gb|EGW09725.1| Sodium/potassium/calcium exchanger 4 [Cricetulus griseus]
Length = 1206
Score = 161 bits (407), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 69/111 (62%), Positives = 91/111 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 200 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 259
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG+EL
Sbjct: 260 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKVTEIELRKRVKLEGKEL 310
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 242 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 277
>gi|74183852|dbj|BAE24504.1| unnamed protein product [Mus musculus]
Length = 493
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 69/111 (62%), Positives = 91/111 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG+EL
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPTEKVTEIELRKRVKLEGKEL 392
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|74188762|dbj|BAE28111.1| unnamed protein product [Mus musculus]
Length = 412
Score = 160 bits (406), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 69/111 (62%), Positives = 91/111 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG+EL
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPTEKVTEIELRKRVKLEGKEL 392
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|432115811|gb|ELK36959.1| Cleavage and polyadenylation specificity factor subunit 2 [Myotis
davidii]
Length = 687
Score = 160 bits (406), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 72/124 (58%), Positives = 94/124 (75%), Gaps = 10/124 (8%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 200 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 259
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELG----------GNRTLTLQVKKRIRLEGEE 141
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG+E
Sbjct: 260 FIQWCEDPKNSIILTYRTTPGTLARFLIDNPLPHPSPSLHFAEKVTEIELRKRVKLEGKE 319
Query: 142 LEEY 145
LEEY
Sbjct: 320 LEEY 323
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/40 (65%), Positives = 30/40 (75%)
Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
V +VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 238 VPSPKVVLASQPDLECGFSRDLFIQWCEDPKNSIILTYRT 277
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 28/47 (59%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRR 207
++P H + F+NE +L DFKQ L R I EF+ GVL+C RR
Sbjct: 635 EVPGHQSVFMNEPRLFDFKQVLLREWIQAEFVGGVLVCNNQISVCRR 681
>gi|195574631|ref|XP_002105288.1| GD21403 [Drosophila simulans]
gi|194201215|gb|EDX14791.1| GD21403 [Drosophila simulans]
Length = 664
Score = 160 bits (406), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 76/126 (60%), Positives = 98/126 (77%), Gaps = 4/126 (3%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
+ +++ E AK IEWMSDKL K+FEGARNNPF FKH++LCHSLA++ +P+ PKVVL ST
Sbjct: 180 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYNLPAGPKVVLAST 239
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
PD+E GF+RDLF QW S+ NSII+T RTSPGTLA +L+E + + L V++R+ LEG
Sbjct: 240 PDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVDLEG 299
Query: 140 EELEEY 145
ELEEY
Sbjct: 300 AELEEY 305
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 64/117 (54%), Gaps = 39/117 (33%)
Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
++IP H++ INEL+LSDFKQTL RN I+ EF GVL C GT+A+RRV
Sbjct: 587 DEIPIHNSVLINELKLSDFKQTLMRNNINSEFSGGVLWCSNGTLALRRV----------- 635
Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
D+ + +EGCLS+EYY++++LLY+QYAI+
Sbjct: 636 ----------------------------DAGKVAMEGCLSEEYYKIRELLYEQYAIV 664
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/35 (65%), Positives = 28/35 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL STPD+E GF+RDLF QW S+ NSII+T R
Sbjct: 233 KVVLASTPDLESGFTRDLFVQWASNANNSIILTTR 267
>gi|348517622|ref|XP_003446332.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like [Oreochromis niloticus]
Length = 787
Score = 160 bits (405), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 71/108 (65%), Positives = 90/108 (83%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCHSLA+LA+VPSPKVVL S PD+E GFSR+L
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLTLCHSLADLARVPSPKVVLCSQPDLESGFSREL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC + +NSII+T RT+PGTLAR LI+ G + L L+VKKR++LEG
Sbjct: 342 FIQWCQNAKNSIILTYRTTPGTLARYLIDNPGEKMLDLEVKKRVKLEG 389
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 56/114 (49%), Gaps = 40/114 (35%)
Query: 163 PPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSR 222
P H + FINE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 714 PGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLV-CNNMVAVRR--------------- 757
Query: 223 DLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL D+YY++++LLY QYA++
Sbjct: 758 ------------------------TEAGRIGLEGCLCDDYYKIRELLYQQYAVV 787
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/36 (61%), Positives = 28/36 (77%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+E GFSR+LF QWC + +NSII+T R
Sbjct: 324 KVVLCSQPDLESGFSRELFIQWCQNAKNSIILTYRT 359
>gi|147901518|ref|NP_001081123.1| cleavage and polyadenylation specificity factor subunit 2 [Xenopus
laevis]
gi|18203567|sp|Q9W799.1|CPSF2_XENLA RecName: Full=Cleavage and polyadenylation specificity factor
subunit 2; AltName: Full=Cleavage and polyadenylation
specificity factor 100 kDa subunit; Short=CPSF 100 kDa
subunit
gi|4927240|gb|AAD33061.1|AF139986_1 cleavage and polyadenylation specificity factor 100 kDa subunit
[Xenopus laevis]
Length = 783
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 67/111 (60%), Positives = 91/111 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH ++LA+VPSPKVVL S PD+ECGFSR+L
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLTLCHGYSDLARVPSPKVVLASQPDLECGFSREL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
F QWC P+NS+I+T RT+PGTLAR LI+ R + ++++KR++LEG+EL
Sbjct: 342 FIQWCQDPKNSVILTYRTTPGTLARFLIDHPSERIIDIELRKRVKLEGKEL 392
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 59/116 (50%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 708 EVPGHQSVFMNEPRLSDFKQVLLREGIHAEFVGGVLV-CNNMVAVRR------------- 753
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++++++++LLY+QYAI+
Sbjct: 754 --------------------------TETGRIGLEGCLCEDFFKIRELLYEQYAIV 783
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 27/52 (51%), Positives = 33/52 (63%), Gaps = 5/52 (9%)
Query: 196 LICCRGTVAVRRV-----VLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
L C G + RV VL S PD+ECGFSR+LF QWC P+NS+I+T R
Sbjct: 308 LTLCHGYSDLARVPSPKVVLASQPDLECGFSRELFIQWCQDPKNSVILTYRT 359
>gi|195054718|ref|XP_001994270.1| GH10247 [Drosophila grimshawi]
gi|193896140|gb|EDV95006.1| GH10247 [Drosophila grimshawi]
Length = 754
Score = 160 bits (404), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 76/130 (58%), Positives = 97/130 (74%), Gaps = 4/130 (3%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVST 80
+ +++ E AK IEWMSDKL K+FEGARNNPF FKH+ LCH+LA++ K+P PKVVL ST
Sbjct: 272 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHINLCHTLADVYKLPVGPKVVLAST 331
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG-GNRTLTLQVKKRIRLEG 139
PDME GF+RDLF QW +P NSII T RT PG+L+ +L+E R L L V++R+ LEG
Sbjct: 332 PDMESGFTRDLFVQWAGNPNNSIIFTTRTGPGSLSMELVENSVPGRQLELDVRRRVELEG 391
Query: 140 EELEEYQKKK 149
ELEEY + +
Sbjct: 392 AELEEYLRTQ 401
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 65/118 (55%), Gaps = 39/118 (33%)
Query: 159 KEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMEC 218
+++IP H++ INEL+LSDFKQ L RN I+ EF GVL CC GT+A+RRV
Sbjct: 676 EDEIPVHNSVLINELKLSDFKQVLMRNSINSEFSGGVLWCCNGTLALRRV---------- 725
Query: 219 GFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
D+ + +EGC+S+EYY++++LLY+QYAI+
Sbjct: 726 -----------------------------DTGKVAMEGCISEEYYKIRELLYEQYAIV 754
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 27/35 (77%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL STPDME GF+RDLF QW +P NSII T R
Sbjct: 325 KVVLASTPDMESGFTRDLFVQWAGNPNNSIIFTTR 359
>gi|327259138|ref|XP_003214395.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like [Anolis carolinensis]
Length = 783
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 66/108 (61%), Positives = 89/108 (82%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD++CGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLDCGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + + ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNSSEKVIDMELRKRVKLEG 389
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 58/116 (50%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 708 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNLVAVRR------------- 753
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL +++Y+++ LLY+QYAI+
Sbjct: 754 --------------------------TETGRIGLEGCLCEDFYKIRDLLYEQYAIV 783
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 29/57 (50%), Positives = 36/57 (63%), Gaps = 8/57 (14%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRA--------DDSDSNVIVLE 255
+VVL S PD++CGFSRDLF QWC P+NSII+T R D+S VI +E
Sbjct: 324 KVVLASQPDLDCGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNSSEKVIDME 380
>gi|387015290|gb|AFJ49764.1| Cleavage and polyadenylation specificity factor subunit 2-like
[Crotalus adamanteus]
Length = 783
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH LA+LA+VPSPKVVL S PD++CGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLADLARVPSPKVVLASQPDLDCGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + + ++ +KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKVIDIEFRKRVKLEG 389
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 58/116 (50%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R G+ EF+ GVL+ C VAVRR
Sbjct: 708 EVPGHQSVFMNEPRLSDFKQVLLREGVQAEFVGGVLV-CNNLVAVRR------------- 753
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL +++Y+++ LLY+QYAI+
Sbjct: 754 --------------------------TETGRIGLEGCLCEDFYKIRDLLYEQYAIV 783
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 28/52 (53%), Positives = 33/52 (63%), Gaps = 5/52 (9%)
Query: 196 LICCRGTVAVRRV-----VLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
L C G + RV VL S PD++CGFSRDLF QWC P+NSII+T R
Sbjct: 308 LSLCHGLADLARVPSPKVVLASQPDLDCGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|7243115|dbj|BAA92605.1| KIAA1367 protein [Homo sapiens]
Length = 579
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 69/118 (58%), Positives = 93/118 (78%), Gaps = 2/118 (1%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E +K +EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S P
Sbjct: 69 VSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQP 128
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
D+ECGFSRDLF QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 129 DLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 186
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 504 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 549
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 550 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 579
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 121 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 156
>gi|67968123|dbj|BAE00542.1| unnamed protein product [Macaca fascicularis]
Length = 592
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 69/118 (58%), Positives = 93/118 (78%), Gaps = 2/118 (1%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E +K +EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S P
Sbjct: 82 VSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQP 141
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
D+ECGFSRDLF QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 142 DLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 199
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 517 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 562
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 563 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 592
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/40 (65%), Positives = 30/40 (75%)
Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
V +VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 130 VPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 169
>gi|291406601|ref|XP_002719640.1| PREDICTED: cleavage and polyadenylation specific factor 2
[Oryctolagus cuniculus]
Length = 782
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKVTEIELRKRVKLEG 389
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|28461235|ref|NP_787002.1| cleavage and polyadenylation specificity factor subunit 2 [Bos
taurus]
gi|426248504|ref|XP_004018003.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2 [Ovis aries]
gi|1706103|sp|Q10568.1|CPSF2_BOVIN RecName: Full=Cleavage and polyadenylation specificity factor
subunit 2; AltName: Full=Cleavage and polyadenylation
specificity factor 100 kDa subunit; Short=CPSF 100 kDa
subunit
gi|599683|emb|CAA53535.1| Cleavage and Polyadenylation specificity factor (CPSF) 100kD
subunit [Bos taurus]
gi|296475169|tpg|DAA17284.1| TPA: cleavage and polyadenylation specificity factor subunit 2 [Bos
taurus]
gi|440892550|gb|ELR45701.1| Cleavage and polyadenylation specificity factor subunit 2 [Bos
grunniens mutus]
Length = 782
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKVTEIELRKRVKLEG 389
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|426377790|ref|XP_004055637.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2 [Gorilla gorilla gorilla]
gi|193785772|dbj|BAG51207.1| unnamed protein product [Homo sapiens]
Length = 690
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 69/118 (58%), Positives = 93/118 (78%), Gaps = 2/118 (1%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E +K +EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S P
Sbjct: 180 VSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQP 239
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
D+ECGFSRDLF QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 240 DLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 297
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 615 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 660
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 661 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 690
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 232 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 267
>gi|10241720|emb|CAC09445.1| hypothetical protein [Homo sapiens]
Length = 504
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 4 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 63
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 64 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 111
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 429 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 474
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 475 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 504
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/40 (65%), Positives = 30/40 (75%)
Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
V +VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 42 VPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 81
>gi|410962841|ref|XP_003987977.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2 [Felis catus]
Length = 690
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 69/118 (58%), Positives = 93/118 (78%), Gaps = 2/118 (1%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E +K +EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S P
Sbjct: 180 VSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQP 239
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
D+ECGFSRDLF QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 240 DLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 297
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 615 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 660
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 661 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 690
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 232 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 267
>gi|432944969|ref|XP_004083472.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like [Oryzias latipes]
Length = 787
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 69/108 (63%), Positives = 89/108 (82%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCHSLA+LA+VPSPKVVL S PD+E GFSR+L
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLNLCHSLADLARVPSPKVVLCSQPDLESGFSREL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC + +NSII+T RT+PGTL R LI+ G + L L+V+KR++LEG
Sbjct: 342 FIQWCQNSKNSIILTYRTTPGTLGRYLIDHPGEKMLDLEVRKRVKLEG 389
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
+IP H FINE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 712 EIPGHQAVFINEPRLSDFKQVLLREGIQAEFVGGVLV-CNNMVAVRR------------- 757
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL D+YY++++LLY QYA++
Sbjct: 758 --------------------------TEAGRIGLEGCLCDDYYKIRELLYQQYAVV 787
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 22/36 (61%), Positives = 28/36 (77%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+E GFSR+LF QWC + +NSII+T R
Sbjct: 324 KVVLCSQPDLESGFSRELFIQWCQNSKNSIILTYRT 359
>gi|193786016|dbj|BAG50992.1| unnamed protein product [Homo sapiens]
Length = 644
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 144 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 203
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 204 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 251
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 569 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 614
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 615 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 644
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/40 (65%), Positives = 30/40 (75%)
Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
V +VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 182 VPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 221
>gi|403298151|ref|XP_003939898.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2 isoform 2 [Saimiri boliviensis boliviensis]
Length = 648
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 148 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 207
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 208 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 255
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 573 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 618
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 619 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 648
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/40 (65%), Positives = 30/40 (75%)
Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
V +VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 186 VPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 225
>gi|297695726|ref|XP_002825082.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 2 [Pongo abelii]
Length = 747
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 69/118 (58%), Positives = 93/118 (78%), Gaps = 2/118 (1%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E +K +EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S P
Sbjct: 237 VSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQP 296
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
D+ECGFSRDLF QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 297 DLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 354
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 672 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 717
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 718 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 747
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/40 (65%), Positives = 30/40 (75%)
Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
V +VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 285 VPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 324
>gi|431839217|gb|ELK01144.1| Cleavage and polyadenylation specificity factor subunit 2 [Pteropus
alecto]
Length = 782
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|73962293|ref|XP_537353.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
2 isoform 1 [Canis lupus familiaris]
Length = 782
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|351699560|gb|EHB02479.1| Cleavage and polyadenylation specificity factor subunit 2
[Heterocephalus glaber]
Length = 782
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|344274144|ref|XP_003408878.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2 [Loxodonta africana]
Length = 782
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|296215760|ref|XP_002754257.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2 [Callithrix jacchus]
gi|403298149|ref|XP_003939897.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2 isoform 1 [Saimiri boliviensis boliviensis]
Length = 782
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|34101288|ref|NP_059133.1| cleavage and polyadenylation specificity factor subunit 2 [Homo
sapiens]
gi|114654441|ref|XP_001147277.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2 isoform 3 [Pan troglodytes]
gi|397525769|ref|XP_003832826.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2 [Pan paniscus]
gi|51338827|sp|Q9P2I0.2|CPSF2_HUMAN RecName: Full=Cleavage and polyadenylation specificity factor
subunit 2; AltName: Full=Cleavage and polyadenylation
specificity factor 100 kDa subunit; Short=CPSF 100 kDa
subunit
gi|119601886|gb|EAW81480.1| cleavage and polyadenylation specific factor 2, 100kDa, isoform
CRA_a [Homo sapiens]
gi|119601888|gb|EAW81482.1| cleavage and polyadenylation specific factor 2, 100kDa, isoform
CRA_a [Homo sapiens]
gi|193786082|dbj|BAG50953.1| unnamed protein product [Homo sapiens]
gi|410221574|gb|JAA08006.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
troglodytes]
gi|410221576|gb|JAA08007.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
troglodytes]
gi|410221578|gb|JAA08008.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
troglodytes]
gi|410252002|gb|JAA13968.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
troglodytes]
gi|410307320|gb|JAA32260.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
troglodytes]
gi|410307322|gb|JAA32261.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
troglodytes]
gi|410339303|gb|JAA38598.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
troglodytes]
gi|410339305|gb|JAA38599.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
troglodytes]
gi|410339307|gb|JAA38600.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
troglodytes]
gi|410339309|gb|JAA38601.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
troglodytes]
gi|410339311|gb|JAA38602.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
troglodytes]
Length = 782
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|417404575|gb|JAA49034.1| Putative mrna cleavage and polyadenylation factor ii complex
subunit cft2 cpsf subunit [Desmodus rotundus]
Length = 782
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 342 FIQWCEDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCEDPKNSIILTYRT 359
>gi|348553776|ref|XP_003462702.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like [Cavia porcellus]
Length = 782
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|383872268|ref|NP_001244509.1| cleavage and polyadenylation specificity factor subunit 2 [Macaca
mulatta]
gi|402876992|ref|XP_003902228.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2 [Papio anubis]
gi|355693514|gb|EHH28117.1| hypothetical protein EGK_18472 [Macaca mulatta]
gi|355778801|gb|EHH63837.1| hypothetical protein EGM_16889 [Macaca fascicularis]
gi|380783537|gb|AFE63644.1| cleavage and polyadenylation specificity factor subunit 2 [Macaca
mulatta]
gi|383412079|gb|AFH29253.1| cleavage and polyadenylation specificity factor subunit 2 [Macaca
mulatta]
gi|384942144|gb|AFI34677.1| cleavage and polyadenylation specificity factor subunit 2 [Macaca
mulatta]
Length = 782
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|456753050|gb|JAA74086.1| cleavage and polyadenylation specific factor 2, 100kDa [Sus scrofa]
Length = 782
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|332223568|ref|XP_003260944.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2 isoform 1 [Nomascus leucogenys]
Length = 782
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|149737455|ref|XP_001497134.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like isoform 1 [Equus caballus]
Length = 782
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|149531954|ref|XP_001507374.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like [Ornithorhynchus anatinus]
Length = 782
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNLVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/40 (65%), Positives = 30/40 (75%)
Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
V +VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 320 VPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|355680846|gb|AER96660.1| cleavage and polyadenylation specific factor 2, 100kDa [Mustela
putorius furo]
Length = 569
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/40 (65%), Positives = 30/40 (75%)
Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
V +VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 320 VPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|224051637|ref|XP_002200593.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2 [Taeniopygia guttata]
Length = 782
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 89/108 (82%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCHSL++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHSLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC +NSII+T RT+PGTLAR LI+ + + ++++KR++LEG
Sbjct: 342 FIQWCQDSKNSIILTYRTTPGTLARFLIDNPSEKVIDIELRKRVKLEG 389
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 43/122 (35%), Positives = 59/122 (48%), Gaps = 45/122 (36%)
Query: 160 EKIPPHD-----TSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTP 214
E +PPH+ + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 701 EPMPPHEVLGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNLVAVRR------- 752
Query: 215 DMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYA 274
+++ I LEGCL ++YR++ LLY+QYA
Sbjct: 753 --------------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYA 780
Query: 275 IL 276
I+
Sbjct: 781 IV 782
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/36 (66%), Positives = 28/36 (77%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC +NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYRT 359
>gi|187608214|ref|NP_001120452.1| cleavage and polyadenylation specific factor 2, 100kDa [Xenopus
(Silurana) tropicalis]
gi|170285004|gb|AAI61233.1| LOC100145546 protein [Xenopus (Silurana) tropicalis]
Length = 783
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 65/108 (60%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH ++LA+VPSPKVVL S PD+ECGFSR+L
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLTLCHGFSDLARVPSPKVVLASQPDLECGFSREL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NS+I+T RT+PGTLAR LI+ R + ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSVILTYRTTPGTLARFLIDHPSERIIDIELRKRVKLEG 389
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 59/116 (50%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 708 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNMVAVRR------------- 753
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++++++++LLY+QYAI+
Sbjct: 754 --------------------------TETGRIGLEGCLCEDFFKIRELLYEQYAIV 783
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 27/52 (51%), Positives = 33/52 (63%), Gaps = 5/52 (9%)
Query: 196 LICCRG-----TVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
L C G V +VVL S PD+ECGFSR+LF QWC P+NS+I+T R
Sbjct: 308 LTLCHGFSDLARVPSPKVVLASQPDLECGFSRELFIQWCQDPKNSVILTYRT 359
>gi|326920924|ref|XP_003206716.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like [Meleagris gallopavo]
Length = 782
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 66/108 (61%), Positives = 89/108 (82%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCHSL++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHSLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC +NSII+T RT+PGTLAR LI+ + + +++++R++LEG
Sbjct: 342 FIQWCQDSKNSIILTYRTTPGTLARFLIDNPSEKVIDIELRRRVKLEG 389
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNMVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR+++LLY QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRELLYKQYAIV 782
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/36 (66%), Positives = 28/36 (77%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC +NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYRT 359
>gi|449280731|gb|EMC87967.1| Cleavage and polyadenylation specificity factor subunit 2 [Columba
livia]
Length = 782
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 65/108 (60%), Positives = 89/108 (82%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKL++ FE RNNPF F+H+ LCHSL++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLLRCFEDKRNNPFQFRHLSLCHSLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC +NS+I+T RT+PGTLAR LI+ + + ++++KR++LEG
Sbjct: 342 FIQWCQDSKNSVILTYRTTPGTLARFLIDNPSEKVIDIELRKRVKLEG 389
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 43/122 (35%), Positives = 58/122 (47%), Gaps = 45/122 (36%)
Query: 160 EKIPPHD-----TSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTP 214
E +PPH+ + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 701 EPLPPHEVIGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNLVAVRR------- 752
Query: 215 DMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYA 274
+++ I LEGCL ++YR++ LLY QYA
Sbjct: 753 --------------------------------TETGRIGLEGCLCQDFYRIRDLLYKQYA 780
Query: 275 IL 276
I+
Sbjct: 781 IV 782
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/36 (63%), Positives = 28/36 (77%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC +NS+I+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDSKNSVILTYRT 359
>gi|47125306|gb|AAH70095.1| Cleavage and polyadenylation specific factor 2, 100kDa [Homo
sapiens]
Length = 782
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 66/108 (61%), Positives = 87/108 (80%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMS KLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSGKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|449662070|ref|XP_004205466.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like, partial [Hydra magnipapillata]
Length = 568
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 75/136 (55%), Positives = 101/136 (74%), Gaps = 5/136 (3%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK +EWMSD++MKSFE R NPF FKH+ LCH L EL ++PSPKVVL S
Sbjct: 90 VSYNVVEFAKSQVEWMSDRMMKSFEVDRRNPFAFKHITLCHFLKELDQLPSPKVVLASAA 149
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
DM CGFS+DLF QW S+P+NS+I T +TSPG+LAR LI+ ++ L+V KR+RLEG E
Sbjct: 150 DMNCGFSKDLFVQWASNPKNSVIFTFKTSPGSLARTLIDNPKIESVELEVFKRVRLEGVE 209
Query: 142 LEEY---QKKKDKEAK 154
L +Y +K+K ++AK
Sbjct: 210 LSQYLEVEKEKARQAK 225
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 54/166 (32%), Positives = 76/166 (45%), Gaps = 42/166 (25%)
Query: 113 TLARD--LIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFI 170
+ARD L + G + + +K ++E E E+ +K+ ++ E IP H T FI
Sbjct: 443 AVARDTELAWVDGQLVMEARGEKFNQIEQENSEKVEKQDVVPVLEQLPPEMIPGHATVFI 502
Query: 171 NELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQWCS 230
+E +LSDFKQ L + GI EF GVL+ C VAVRR
Sbjct: 503 DEPRLSDFKQVLTKAGIQAEFTGGVLV-CNNVVAVRR----------------------- 538
Query: 231 SPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+ I +EG L +EYY ++QLLYDQYAI+
Sbjct: 539 ----------------GEQGKISIEGGLCEEYYVIRQLLYDQYAIV 568
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 21/33 (63%), Positives = 26/33 (78%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 239
+VVL S DM CGFS+DLF QW S+P+NS+I T
Sbjct: 142 KVVLASAADMNCGFSKDLFVQWASNPKNSVIFT 174
>gi|395503674|ref|XP_003756188.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2 [Sarcophilus harrisii]
Length = 782
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 66/108 (61%), Positives = 87/108 (80%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC +NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 342 FIQWCQDSKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNLVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/36 (66%), Positives = 28/36 (77%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC +NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYRT 359
>gi|126282067|ref|XP_001365312.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2 isoform 1 [Monodelphis domestica]
Length = 782
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 66/108 (61%), Positives = 87/108 (80%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC +NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 342 FIQWCQDSKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNLVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/36 (66%), Positives = 28/36 (77%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC +NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYRT 359
>gi|340370496|ref|XP_003383782.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like [Amphimedon queenslandica]
Length = 730
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 75/142 (52%), Positives = 103/142 (72%), Gaps = 5/142 (3%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK +EWMS+KLM++FE +R NPF F+H+ LCH+L ELAKV +PK VLVS P
Sbjct: 272 VSYNVVEFAKSQVEWMSEKLMRTFEDSRTNPFQFQHINLCHNLEELAKVSNPKAVLVSPP 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
D+ECGFSRDLF W ++P NSII T++T+ TLAR L++ T+ + VK+R+ LEG E
Sbjct: 332 DLECGFSRDLFLHWSNNPHNSIIFTSKTAHNTLARTLVDNLKIITIDMDVKRRVPLEGAE 391
Query: 142 LEEY---QKKKDKEAKDKQEKE 160
LEEY +K+K K A D K+
Sbjct: 392 LEEYLMKEKEKAKTANDDDAKD 413
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 54/117 (46%), Gaps = 40/117 (34%)
Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
+++ H FIN +LSDFK TL + GI EF G LI C G VA++R
Sbjct: 654 KQVADHKNVFINPPRLSDFKNTLTKAGIQAEFCGGALI-CNGVVAIKR------------ 700
Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
++ I +EG +SD+YY +++LLY+Q+AI+
Sbjct: 701 ---------------------------TEGGKISIEGSVSDDYYLIRKLLYEQFAIV 730
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/40 (57%), Positives = 30/40 (75%)
Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
V+ + VLVS PD+ECGFSRDLF W ++P NSII T++
Sbjct: 320 VSNPKAVLVSPPDLECGFSRDLFLHWSNNPHNSIIFTSKT 359
>gi|156399337|ref|XP_001638458.1| predicted protein [Nematostella vectensis]
gi|156225579|gb|EDO46395.1| predicted protein [Nematostella vectensis]
Length = 737
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 72/134 (53%), Positives = 100/134 (74%), Gaps = 2/134 (1%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK +EWMSDK+MK+FE RNNP+ F++ LCHSLA+LA+VP PKVVL S
Sbjct: 272 VSYNVIEFAKSQVEWMSDKIMKAFEIGRNNPYQFRYCHLCHSLADLARVPEPKVVLASMM 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
D+ GFSRDLF +W +P+N++I T R+SPGTLAR LI+ + + L+VK+R+RL GEE
Sbjct: 332 DLTAGFSRDLFVEWADNPKNTVIFTARSSPGTLARTLIDNLELKQVELEVKQRVRLGGEE 391
Query: 142 LEEYQKKKDKEAKD 155
LE Y ++ K+ KD
Sbjct: 392 LERYLEENKKKEKD 405
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 53/116 (45%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
KI H + FINE +LSDFKQ L + GI EF GVLIC
Sbjct: 662 KIAGHVSVFINEPRLSDFKQVLNKAGIQAEFAGGVLIC---------------------- 699
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
N+++ RN ++ + LEG + ++YY ++ LLY QYAI+
Sbjct: 700 -------------NNVVCVRRN-----ETGRVGLEGTVCEDYYTIRDLLYSQYAIV 737
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 26/36 (72%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S D+ GFSRDLF +W +P+N++I T R+
Sbjct: 324 KVVLASMMDLTAGFSRDLFVEWADNPKNTVIFTARS 359
>gi|71894931|ref|NP_001026379.1| cleavage and polyadenylation specificity factor subunit 2 [Gallus
gallus]
gi|60098929|emb|CAH65295.1| hypothetical protein RCJMB04_15m16 [Gallus gallus]
Length = 782
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 65/108 (60%), Positives = 88/108 (81%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCHSL++LA+VP PKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHSLSDLARVPCPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
F QWC +NSII+T RT+PGTLAR LI+ + + +++++R++LEG
Sbjct: 342 FIQWCQDSKNSIILTYRTTPGTLARFLIDNPSEKVIDIELRRRVKLEG 389
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNMVAVRR------------- 752
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR+++LLY QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRELLYKQYAIV 782
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/40 (62%), Positives = 29/40 (72%)
Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
V +VVL S PD+ECGFSRDLF QWC +NSII+T R
Sbjct: 320 VPCPKVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYRT 359
>gi|321462132|gb|EFX73157.1| hypothetical protein DAPPUDRAFT_58164 [Daphnia pulex]
Length = 735
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 70/128 (54%), Positives = 100/128 (78%), Gaps = 2/128 (1%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK IEWMSDKLMKSFEGARNNPF FK+++LCH+L E+ ++ KVVL S P
Sbjct: 272 VAYNVNEFAKSQIEWMSDKLMKSFEGARNNPFGFKYLQLCHTLPEVLRIAGSKVVLSSCP 331
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
D+ECGF+RDLF WCS NSII+T+R+ GTL + L + +++TL++K+R++LEG E
Sbjct: 332 DLECGFARDLFALWCSDARNSIILTSRSGQGTLGQRLHDQRNLKSVTLELKQRVKLEGAE 391
Query: 142 LEEYQKKK 149
LEE+++K+
Sbjct: 392 LEEFRRKE 399
Score = 96.7 bits (239), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 51/119 (42%), Positives = 66/119 (55%), Gaps = 39/119 (32%)
Query: 158 EKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDME 217
E + IP H+TS+INEL+LSDFKQ L RNGI EF+ GVL CC G VA+RR
Sbjct: 656 EPKDIPGHETSYINELKLSDFKQVLVRNGISSEFIGGVLWCCNGNVALRR---------- 705
Query: 218 CGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
++S + LEGC+SD+YYRV++LLY+QYAI+
Sbjct: 706 -----------------------------NESGRVTLEGCISDDYYRVRELLYEQYAII 735
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/40 (60%), Positives = 31/40 (77%)
Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+A +VVL S PD+ECGF+RDLF WCS NSII+T+R+
Sbjct: 320 IAGSKVVLSSCPDLECGFARDLFALWCSDARNSIILTSRS 359
>gi|198428144|ref|XP_002129804.1| PREDICTED: similar to cleavage and polyadenylation specific factor
2 [Ciona intestinalis]
Length = 784
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 66/128 (51%), Positives = 97/128 (75%), Gaps = 2/128 (1%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK +EWMSDK++ SF RNNPFHFKH+KLCH+L +LA+VP PK VL ST
Sbjct: 280 VTYNVVEFAKFMVEWMSDKIINSFTDQRNNPFHFKHLKLCHNLGDLAQVPQPKCVLASTA 339
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
DMECGF+R LF +W + P N++IIT+R++ GTL+R L++ L L++KKR+ + GEE
Sbjct: 340 DMECGFARQLFIRWAADPRNTVIITSRSTKGTLSRTLVDDPTVSRLKLEMKKRVPIIGEE 399
Query: 142 LEEYQKKK 149
L++Y++ +
Sbjct: 400 LDQYERNR 407
Score = 67.0 bits (162), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 54/117 (46%), Gaps = 40/117 (34%)
Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
E P H T FINEL+LSDFKQ L + G EF+ GVL+C
Sbjct: 708 EDTPGHQTCFINELRLSDFKQVLTKEGYQAEFIGGVLVC--------------------- 746
Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
N+++ RN+ D LEG L++EYY ++ LLY QYA++
Sbjct: 747 --------------NNMLAIRRNQQGHID-----LEGTLTEEYYAIRDLLYQQYAVV 784
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/36 (55%), Positives = 28/36 (77%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+ VL ST DMECGF+R LF +W + P N++IIT+R+
Sbjct: 332 KCVLASTADMECGFARQLFIRWAADPRNTVIITSRS 367
>gi|119601887|gb|EAW81481.1| cleavage and polyadenylation specific factor 2, 100kDa, isoform
CRA_b [Homo sapiens]
Length = 496
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 65/103 (63%), Positives = 83/103 (80%)
Query: 37 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
MSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDLF QWC
Sbjct: 1 MSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWC 60
Query: 97 SSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
P+NSII+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 61 QDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 103
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 421 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 466
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 467 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 496
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/40 (65%), Positives = 30/40 (75%)
Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
V +VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 34 VPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 73
>gi|209420822|gb|ACI46951.1| cyclin B [Fenneropenaeus penicillatus]
Length = 475
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 66/124 (53%), Positives = 95/124 (76%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
IEWMS+KL K+F+ R NPF FKH+K CH+L +L+++PSPKVVL S PD+ CG++R+LF
Sbjct: 2 IEWMSEKLTKAFDSLRTNPFSFKHLKFCHNLTDLSRLPSPKVVLASFPDLGCGYARELFV 61
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEA 153
QW ++P+N+II+T+RT P TLAR LI+ RT L K+R++LEG EL+E+ + K +E
Sbjct: 62 QWATNPKNTIILTSRTGPDTLARRLIDNPQIRTFKLLEKRRMKLEGSELDEHYRMKREEE 121
Query: 154 KDKQ 157
+ +Q
Sbjct: 122 QQQQ 125
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 44/77 (57%), Gaps = 16/77 (20%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVI-------------- 252
+VVL S PD+ CG++R+LF QW ++P+N+II+T+R D +I
Sbjct: 42 KVVLASFPDLGCGYARELFVQWATNPKNTIILTSRTGPDTLARRLIDNPQIRTFKLLEKR 101
Query: 253 --VLEGCLSDEYYRVQQ 267
LEG DE+YR+++
Sbjct: 102 RMKLEGSELDEHYRMKR 118
>gi|324503279|gb|ADY41427.1| Cleavage and polyadenylation specificity factor subunit 2 [Ascaris
suum]
Length = 841
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 75/155 (48%), Positives = 101/155 (65%), Gaps = 14/155 (9%)
Query: 19 FKHVKLCH---SLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSP 73
+ V L H S+ E AK +EWMSDK++KSFE R NPF F+HV+LCH+ +L ++ SP
Sbjct: 263 YNLVMLSHVASSVVEFAKSQVEWMSDKILKSFEVGRYNPFQFRHVQLCHTHMDLLRIRSP 322
Query: 74 KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGG---------N 124
KVVLVS DMECGFSR++F +WC+ N++I+T R+ TL LI + N
Sbjct: 323 KVVLVSGLDMECGFSREIFLEWCADVRNTVIVTGRSGDRTLGARLIRMAEQMAENPSTVN 382
Query: 125 RTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEK 159
R LTL+VK+RIRLEG ELE Y+ KK + ++ K
Sbjct: 383 RNLTLEVKRRIRLEGVELENYRAKKRADEREAARK 417
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 20/36 (55%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVLVS DMECGFSR++F +WC+ N++I+T R+
Sbjct: 323 KVVLVSGLDMECGFSREIFLEWCADVRNTVIVTGRS 358
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 47/115 (40%), Gaps = 40/115 (34%)
Query: 162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFS 221
IP H F+N+ +LSD KQ L NG EF GVL
Sbjct: 767 IPSHQAVFVNDPKLSDMKQLLMANGFHAEFSSGVLYI----------------------- 803
Query: 222 RDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
N++ RN A +EGC S++YY+++ ++Y Q+AI+
Sbjct: 804 ------------NNVASIRRNEAGR-----FHVEGCASEDYYKIRDIVYAQFAIV 841
>gi|312084310|ref|XP_003144223.1| cleavage and polyadenylation specificity factor subunit 2 [Loa loa]
Length = 837
Score = 147 bits (371), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 77/152 (50%), Positives = 100/152 (65%), Gaps = 15/152 (9%)
Query: 19 FKHVKLCH---SLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSP 73
+ V L H S+ E AK +EWMSDK++KSFE R NPF F+HV+LCH+ +L +V SP
Sbjct: 263 YNLVMLSHVASSVVEFAKSQVEWMSDKVLKSFEVGRYNPFQFRHVQLCHTHIDLLRVRSP 322
Query: 74 KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGG---------- 123
KVVLVS DME GFSR+LF +WC+ +NS+I+T R+ TL LI +
Sbjct: 323 KVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRSGDRTLGARLIRMAEQAAENPNGTI 382
Query: 124 NRTLTLQVKKRIRLEGEELEEYQKKKDKEAKD 155
NR LTL+VK+RIRLEG ELE Y+ KK E ++
Sbjct: 383 NRNLTLEVKRRIRLEGAELENYRAKKRAEERE 414
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVLVS DME GFSR+LF +WC+ +NS+I+T R+
Sbjct: 323 KVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRS 358
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 48/115 (41%), Gaps = 40/115 (34%)
Query: 162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFS 221
IPPH T F+N+ +LSD KQ L NG EF G+L
Sbjct: 763 IPPHQTVFVNDPKLSDVKQLLASNGFRAEFSSGILYI----------------------- 799
Query: 222 RDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
N+I RN A +EGC ++YY+++ ++Y Q+A++
Sbjct: 800 ------------NNIASIRRNEA-----GRFHVEGCACEDYYKIRDIVYAQFAVV 837
>gi|393910520|gb|EJD75913.1| cleavage and polyadenylation specificity factor subunit 2, variant
[Loa loa]
Length = 664
Score = 147 bits (371), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 77/152 (50%), Positives = 100/152 (65%), Gaps = 15/152 (9%)
Query: 19 FKHVKLCH---SLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSP 73
+ V L H S+ E AK +EWMSDK++KSFE R NPF F+HV+LCH+ +L +V SP
Sbjct: 263 YNLVMLSHVASSVVEFAKSQVEWMSDKVLKSFEVGRYNPFQFRHVQLCHTHIDLLRVRSP 322
Query: 74 KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGG---------- 123
KVVLVS DME GFSR+LF +WC+ +NS+I+T R+ TL LI +
Sbjct: 323 KVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRSGDRTLGARLIRMAEQAAENPNGTI 382
Query: 124 NRTLTLQVKKRIRLEGEELEEYQKKKDKEAKD 155
NR LTL+VK+RIRLEG ELE Y+ KK E ++
Sbjct: 383 NRNLTLEVKRRIRLEGAELENYRAKKRAEERE 414
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVLVS DME GFSR+LF +WC+ +NS+I+T R+
Sbjct: 323 KVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRS 358
>gi|393910519|gb|EFO19846.2| cleavage and polyadenylation specificity factor subunit 2 [Loa loa]
Length = 828
Score = 147 bits (371), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 77/152 (50%), Positives = 100/152 (65%), Gaps = 15/152 (9%)
Query: 19 FKHVKLCH---SLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSP 73
+ V L H S+ E AK +EWMSDK++KSFE R NPF F+HV+LCH+ +L +V SP
Sbjct: 263 YNLVMLSHVASSVVEFAKSQVEWMSDKVLKSFEVGRYNPFQFRHVQLCHTHIDLLRVRSP 322
Query: 74 KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGG---------- 123
KVVLVS DME GFSR+LF +WC+ +NS+I+T R+ TL LI +
Sbjct: 323 KVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRSGDRTLGARLIRMAEQAAENPNGTI 382
Query: 124 NRTLTLQVKKRIRLEGEELEEYQKKKDKEAKD 155
NR LTL+VK+RIRLEG ELE Y+ KK E ++
Sbjct: 383 NRNLTLEVKRRIRLEGAELENYRAKKRAEERE 414
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVLVS DME GFSR+LF +WC+ +NS+I+T R+
Sbjct: 323 KVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRS 358
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 48/115 (41%), Gaps = 40/115 (34%)
Query: 162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFS 221
IPPH T F+N+ +LSD KQ L NG EF G+L
Sbjct: 754 IPPHQTVFVNDPKLSDVKQLLASNGFRAEFSSGILYI----------------------- 790
Query: 222 RDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
N+I RN A +EGC ++YY+++ ++Y Q+A++
Sbjct: 791 ------------NNIASIRRNEA-----GRFHVEGCACEDYYKIRDIVYAQFAVV 828
>gi|349604123|gb|AEP99763.1| Cleavage and polyadenylation specificity factor subunit 2-like
protein, partial [Equus caballus]
Length = 281
Score = 146 bits (369), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 62/89 (69%), Positives = 76/89 (85%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 189 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 248
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIE 120
F QWC P+NSII+T RT+PGTLAR LI+
Sbjct: 249 FIQWCQDPKNSIILTYRTTPGTLARFLID 277
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/35 (71%), Positives = 29/35 (82%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 231 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYR 265
>gi|350587145|ref|XP_001926907.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
2 [Sus scrofa]
Length = 438
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 62/89 (69%), Positives = 76/89 (85%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIE 120
F QWC P+NSII+T RT+PGTLAR LI+
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLID 370
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
>gi|170581110|ref|XP_001895540.1| cleavage and polyadenylation specificity factor [Brugia malayi]
gi|158597460|gb|EDP35606.1| cleavage and polyadenylation specificity factor, putative [Brugia
malayi]
Length = 831
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 76/152 (50%), Positives = 100/152 (65%), Gaps = 15/152 (9%)
Query: 19 FKHVKLCH---SLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSP 73
+ V L H S+ E AK +EWMSDK++KSFE R NPF F+HV+LCH+ +L +V SP
Sbjct: 263 YNLVMLSHVASSVVEFAKSQVEWMSDKVLKSFEVGRYNPFQFRHVQLCHTHIDLMRVRSP 322
Query: 74 KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGG---------- 123
KVVLVS DME GFSR+LF +WC+ +NS+I+T R+ TL LI +
Sbjct: 323 KVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRSGDRTLGARLIRMAEQAAENPNGTI 382
Query: 124 NRTLTLQVKKRIRLEGEELEEYQKKKDKEAKD 155
NR LTL+VK+RIRL+G ELE Y+ KK E ++
Sbjct: 383 NRNLTLEVKRRIRLDGVELENYRAKKRAEERE 414
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVLVS DME GFSR+LF +WC+ +NS+I+T R+
Sbjct: 323 KVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRS 358
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 47/115 (40%), Gaps = 40/115 (34%)
Query: 162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFS 221
IPPH F+N+ +LSD KQ L NG EF GVL
Sbjct: 757 IPPHQAVFVNDPKLSDMKQLLASNGFRAEFSSGVLYI----------------------- 793
Query: 222 RDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
N+I RN A +EGC ++YY+++ ++Y Q+A++
Sbjct: 794 ------------NNIASIRRNEA-----GRFHVEGCACEDYYKIRDIVYAQFAVV 831
>gi|391325231|ref|XP_003737142.1| PREDICTED: probable cleavage and polyadenylation specificity factor
subunit 2-like isoform 1 [Metaseiulus occidentalis]
Length = 741
Score = 144 bits (362), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 69/129 (53%), Positives = 97/129 (75%), Gaps = 9/129 (6%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDK+M+SFEGAR+NPF FK++ CHS E+ V PKVVL S PD+E G+ RDL
Sbjct: 280 SQVEWMSDKVMRSFEGARSNPFQFKYLIPCHSHGEIQSVSEPKVVLASMPDLESGYGRDL 339
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRT--LTLQVKKRIRLEGEELEEYQK-- 147
F W S+P+NS+I+T+R+SPGTLAR+L++ NR + L +K+R+ LE +ELEE+ +
Sbjct: 340 FMLWASNPKNSVILTSRSSPGTLARNLVD---NRPKFVHLTLKQRVALEADELEEHVRNE 396
Query: 148 --KKDKEAK 154
KK+KE K
Sbjct: 397 RLKKEKETK 405
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 70/147 (47%), Gaps = 47/147 (31%)
Query: 138 EGEELEEYQKKKDKEAKDK--------QEKEKIPPHDTSFINELQLSDFKQTLQRNGIDC 189
E EE + K D+E K+K + PH+ FIN+L+LSDFKQ L ++GI
Sbjct: 634 EPEEDNDLVGKGDEETKEKLMVLQPLGDSNRVVAPHNPLFINDLKLSDFKQVLVKSGISA 693
Query: 190 EFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDS 249
EF GVL C +VAV+R +++
Sbjct: 694 EFSGGVLYCNNCSVAVKR---------------------------------------NET 714
Query: 250 NVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+ +EG L+D+Y+R+++LLYDQYAIL
Sbjct: 715 GRLSVEGALTDDYFRIRELLYDQYAIL 741
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 35/51 (68%)
Query: 202 TVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVI 252
+V+ +VVL S PD+E G+ RDLF W S+P+NS+I+T+R+ N++
Sbjct: 317 SVSEPKVVLASMPDLESGYGRDLFMLWASNPKNSVILTSRSSPGTLARNLV 367
>gi|391325235|ref|XP_003737144.1| PREDICTED: probable cleavage and polyadenylation specificity factor
subunit 2-like isoform 3 [Metaseiulus occidentalis]
Length = 754
Score = 143 bits (361), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 69/129 (53%), Positives = 97/129 (75%), Gaps = 9/129 (6%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDK+M+SFEGAR+NPF FK++ CHS E+ V PKVVL S PD+E G+ RDL
Sbjct: 280 SQVEWMSDKVMRSFEGARSNPFQFKYLIPCHSHGEIQSVSEPKVVLASMPDLESGYGRDL 339
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRT--LTLQVKKRIRLEGEELEEYQK-- 147
F W S+P+NS+I+T+R+SPGTLAR+L++ NR + L +K+R+ LE +ELEE+ +
Sbjct: 340 FMLWASNPKNSVILTSRSSPGTLARNLVD---NRPKFVHLTLKQRVALEADELEEHVRNE 396
Query: 148 --KKDKEAK 154
KK+KE K
Sbjct: 397 RLKKEKETK 405
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 70/147 (47%), Gaps = 47/147 (31%)
Query: 138 EGEELEEYQKKKDKEAKDK--------QEKEKIPPHDTSFINELQLSDFKQTLQRNGIDC 189
E EE + K D+E K+K + PH+ FIN+L+LSDFKQ L ++GI
Sbjct: 647 EPEEDNDLVGKGDEETKEKLMVLQPLGDSNRVVAPHNPLFINDLKLSDFKQVLVKSGISA 706
Query: 190 EFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDS 249
EF GVL C +VAV+R +++
Sbjct: 707 EFSGGVLYCNNCSVAVKR---------------------------------------NET 727
Query: 250 NVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+ +EG L+D+Y+R+++LLYDQYAIL
Sbjct: 728 GRLSVEGALTDDYFRIRELLYDQYAIL 754
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 35/51 (68%)
Query: 202 TVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVI 252
+V+ +VVL S PD+E G+ RDLF W S+P+NS+I+T+R+ N++
Sbjct: 317 SVSEPKVVLASMPDLESGYGRDLFMLWASNPKNSVILTSRSSPGTLARNLV 367
>gi|391325233|ref|XP_003737143.1| PREDICTED: probable cleavage and polyadenylation specificity factor
subunit 2-like isoform 2 [Metaseiulus occidentalis]
Length = 745
Score = 143 bits (361), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 69/129 (53%), Positives = 97/129 (75%), Gaps = 9/129 (6%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDK+M+SFEGAR+NPF FK++ CHS E+ V PKVVL S PD+E G+ RDL
Sbjct: 280 SQVEWMSDKVMRSFEGARSNPFQFKYLIPCHSHGEIQSVSEPKVVLASMPDLESGYGRDL 339
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRT--LTLQVKKRIRLEGEELEEYQK-- 147
F W S+P+NS+I+T+R+SPGTLAR+L++ NR + L +K+R+ LE +ELEE+ +
Sbjct: 340 FMLWASNPKNSVILTSRSSPGTLARNLVD---NRPKFVHLTLKQRVALEADELEEHVRNE 396
Query: 148 --KKDKEAK 154
KK+KE K
Sbjct: 397 RLKKEKETK 405
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 70/147 (47%), Gaps = 47/147 (31%)
Query: 138 EGEELEEYQKKKDKEAKDK--------QEKEKIPPHDTSFINELQLSDFKQTLQRNGIDC 189
E EE + K D+E K+K + PH+ FIN+L+LSDFKQ L ++GI
Sbjct: 638 EPEEDNDLVGKGDEETKEKLMVLQPLGDSNRVVAPHNPLFINDLKLSDFKQVLVKSGISA 697
Query: 190 EFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDS 249
EF GVL C +VAV+R +++
Sbjct: 698 EFSGGVLYCNNCSVAVKR---------------------------------------NET 718
Query: 250 NVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+ +EG L+D+Y+R+++LLYDQYAIL
Sbjct: 719 GRLSVEGALTDDYFRIRELLYDQYAIL 745
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 35/51 (68%)
Query: 202 TVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVI 252
+V+ +VVL S PD+E G+ RDLF W S+P+NS+I+T+R+ N++
Sbjct: 317 SVSEPKVVLASMPDLESGYGRDLFMLWASNPKNSVILTSRSSPGTLARNLV 367
>gi|196012036|ref|XP_002115881.1| hypothetical protein TRIADDRAFT_30006 [Trichoplax adhaerens]
gi|190581657|gb|EDV21733.1| hypothetical protein TRIADDRAFT_30006 [Trichoplax adhaerens]
Length = 745
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 62/124 (50%), Positives = 89/124 (71%), Gaps = 3/124 (2%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ +++ E AK +EWM DK++ +FE RNNPF FK+++LCHSL EL+ +P PKVVL S+P
Sbjct: 271 VSYNVVEFAKSQVEWMCDKVLVAFEDNRNNPFQFKYIQLCHSLNELSGLPEPKVVLASSP 330
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
D+ CGF+RDLF QW + +N I T R+SPGTL R +++ +++ + VK R+ L G E
Sbjct: 331 DLTCGFARDLFLQWAGNSKNLTIFTGRSSPGTLGRHILD-ERPQSIDVTVKTRVELSGNE 389
Query: 142 LEEY 145
LEEY
Sbjct: 390 LEEY 393
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 56/118 (47%), Gaps = 40/118 (33%)
Query: 159 KEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMEC 218
+ +IP H + FIN+ +LSD KQTL + GI EF+ GV++ C +AVRR
Sbjct: 668 QSEIPGHKSVFINDPRLSDLKQTLTKAGIQAEFVGGVIV-CNDKIAVRR----------- 715
Query: 219 GFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEG + ++YY V+ +LY QYAI+
Sbjct: 716 ----------------------------TETGKITLEGAICNDYYTVRDILYQQYAII 745
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 26/36 (72%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S+PD+ CGF+RDLF QW + +N I T R+
Sbjct: 323 KVVLASSPDLTCGFARDLFLQWAGNSKNLTIFTGRS 358
>gi|395827898|ref|XP_003787126.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2 [Otolemur garnettii]
Length = 750
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 57/95 (60%), Positives = 75/95 (78%)
Query: 45 FEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSII 104
+ RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDLF QWC P+NSII
Sbjct: 294 YXDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSII 353
Query: 105 ITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
+T RT+PGTLAR LI+ + ++++KR++LEG
Sbjct: 354 LTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 388
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 675 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 720
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 721 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 750
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/36 (69%), Positives = 29/36 (80%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
+VVL S PD+ECGFSRDLF QWC P+NSII+T R
Sbjct: 323 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 358
>gi|307203591|gb|EFN82620.1| Probable cleavage and polyadenylation specificity factor subunit 2
[Harpegnathos saltator]
Length = 685
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 67/104 (64%), Positives = 85/104 (81%), Gaps = 6/104 (5%)
Query: 60 LCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
L +SLA L V VL STPDMECGFSR+LF QWC++P+NSII+T+RTSPGTLARDL+
Sbjct: 262 LAYSLALLNNV-----VLASTPDMECGFSRELFLQWCTNPQNSIILTSRTSPGTLARDLV 316
Query: 120 ELGGNRTLTLQVKKRIRLEGEELEEYQKK-KDKEAKDKQEKEKI 162
E GGNR +TL+VK+R++LEG ELEEYQK+ K K+ + KQE+ +I
Sbjct: 317 EKGGNRNITLEVKRRVKLEGIELEEYQKREKLKQEQLKQEQMEI 360
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 62/116 (53%), Gaps = 39/116 (33%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H T+FINEL+LSDFKQ L ++ I EF GVL CC T+AVRR
Sbjct: 609 EVPGHQTTFINELKLSDFKQVLNKSNISSEFSGGVLWCCNNTIAVRR------------- 655
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
++ ++LEGC+S++YY+V++LLY+QYAI+
Sbjct: 656 --------------------------HEAGKVILEGCISEDYYKVRELLYEQYAIV 685
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 26/35 (74%), Positives = 32/35 (91%)
Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
VVL STPDMECGFSR+LF QWC++P+NSII+T+R
Sbjct: 272 VVLASTPDMECGFSRELFLQWCTNPQNSIILTSRT 306
>gi|332672684|gb|AEE87269.1| cleavage and polyadenylation specific factor 2 [Sepia officinalis]
Length = 129
Score = 133 bits (335), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 55/81 (67%), Positives = 71/81 (87%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++EWMSDK+M++FE RNNPFHFKH+KLCH+LAEL+KV PKVVL S PD++CGF+RDL
Sbjct: 49 SQVEWMSDKIMRAFEDHRNNPFHFKHLKLCHNLAELSKVMEPKVVLASMPDLQCGFARDL 108
Query: 92 FFQWCSSPENSIIITNRTSPG 112
F WC + +NSII+T+RTSPG
Sbjct: 109 FMGWCGNAKNSIILTSRTSPG 129
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/35 (60%), Positives = 29/35 (82%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL S PD++CGF+RDLF WC + +NSII+T+R
Sbjct: 91 KVVLASMPDLQCGFARDLFMGWCGNAKNSIILTSR 125
>gi|17559452|ref|NP_504822.1| Protein CPSF-2 [Caenorhabditis elegans]
gi|18201967|sp|O17403.1|CPSF2_CAEEL RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 2; AltName: Full=Cleavage and
polyadenylation specificity factor 100 kDa subunit;
Short=CPSF 100 kDa subunit
gi|351057814|emb|CCD64424.1| Protein CPSF-2 [Caenorhabditis elegans]
Length = 843
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 67/139 (48%), Positives = 88/139 (63%), Gaps = 13/139 (9%)
Query: 24 LCHSLAELAK--IEWMSDKLMK-SFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVST 80
+ S+ + AK +EWM++KL K AR NPF KHV LCHS EL +V SPKVVL S+
Sbjct: 271 VASSVVQFAKSQLEWMNEKLFKYDSSSARYNPFTLKHVTLCHSHQELMRVRSPKVVLCSS 330
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG----------GNRTLTLQ 130
DME GFSR+LF WCS P N +I+T R + TLA L+ + +R ++L
Sbjct: 331 QDMESGFSRELFLDWCSDPRNGVILTARPASFTLAAKLVNMAERANDGVLKHEDRLISLV 390
Query: 131 VKKRIRLEGEELEEYQKKK 149
VKKR+ LEGEEL EY+++K
Sbjct: 391 VKKRVALEGEELLEYKRRK 409
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 33/122 (27%), Positives = 51/122 (41%), Gaps = 39/122 (31%)
Query: 155 DKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTP 214
D K IP H F+N+ +LSDFK L G EF+ G L+ G ++RR
Sbjct: 761 DPLPKRLIPIHQAVFVNDPKLSDFKNLLTDKGYKAEFLSGTLLINGGNCSIRR------- 813
Query: 215 DMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYA 274
+D+ V +EG + +YY++++L YDQ+A
Sbjct: 814 --------------------------------NDTGVFQMEGAFTKDYYKLRRLFYDQFA 841
Query: 275 IL 276
+L
Sbjct: 842 VL 843
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/35 (60%), Positives = 26/35 (74%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL S+ DME GFSR+LF WCS P N +I+T R
Sbjct: 324 KVVLCSSQDMESGFSRELFLDWCSDPRNGVILTAR 358
>gi|308480408|ref|XP_003102411.1| CRE-CPSF-2 protein [Caenorhabditis remanei]
gi|308262077|gb|EFP06030.1| CRE-CPSF-2 protein [Caenorhabditis remanei]
Length = 850
Score = 123 bits (308), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 67/139 (48%), Positives = 87/139 (62%), Gaps = 13/139 (9%)
Query: 24 LCHSLAELAK--IEWMSDKLMK-SFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVST 80
+ S+ + AK +EWM++KL K AR NPF KH+ LCHS EL +V SPKVVL S+
Sbjct: 271 VASSVVQFAKSQLEWMNEKLFKYDSNSARYNPFTLKHITLCHSHQELMRVRSPKVVLCSS 330
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG----------GNRTLTLQ 130
DME GFSR+LF WCS N +I+T R S TLA L+ L +R ++L
Sbjct: 331 QDMESGFSRELFLDWCSDSRNGVILTARPSSFTLAAKLVNLAERANDGVLRNEDRLISLS 390
Query: 131 VKKRIRLEGEELEEYQKKK 149
VKKR+ LEGEEL EY+++K
Sbjct: 391 VKKRVPLEGEELLEYKRRK 409
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 51/118 (43%), Gaps = 40/118 (33%)
Query: 159 KEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMEC 218
K+ IP H F+N+ +LSDFK L G EF+ G L+ G ++RR E
Sbjct: 773 KKLIPIHQAIFVNDPKLSDFKNLLVEKGYKAEFLSGTLLINGGKCSIRR--------GEM 824
Query: 219 GFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
GFS +EG LS +YY+++ L YDQ+AIL
Sbjct: 825 GFS--------------------------------MEGALSKDYYKLRNLFYDQFAIL 850
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 25/35 (71%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL S+ DME GFSR+LF WCS N +I+T R
Sbjct: 324 KVVLCSSQDMESGFSRELFLDWCSDSRNGVILTAR 358
>gi|440797154|gb|ELR18249.1| cleavage and polyadenylation specificity factor subunit 2, putative
[Acanthamoeba castellanii str. Neff]
Length = 799
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/144 (43%), Positives = 93/144 (64%), Gaps = 5/144 (3%)
Query: 23 KLCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVST 80
K ++ E AK +EWMS + KSF+ R NPF FK V+LCHS+ EL +P P VVL +T
Sbjct: 232 KQAYNTIEFAKSQLEWMSTAVQKSFDLDRVNPFEFKFVRLCHSVEELEALPKPLVVLATT 291
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGE 140
+E GF+RDLF +W S+P +++I T+R PGTL L+ L L++ +R+ LEG
Sbjct: 292 ASLEWGFARDLFVEWSSNPRHAVIFTDRPQPGTLGH-LVLTQQPPALGLELHRRVPLEGA 350
Query: 141 ELEEYQKKKDKEAKDK--QEKEKI 162
EL E+++K+ +E K +E++K+
Sbjct: 351 ELREWRQKQQEEKARKLLEEQQKV 374
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 47/176 (26%), Positives = 69/176 (39%), Gaps = 59/176 (33%)
Query: 101 NSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKE 160
N +++ + + G R I + G EG ELE Q + E
Sbjct: 683 NGVVMLKKETEGLSGRSKISVNG--------------EGMELELEQAPE----------E 718
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
IPPHD F+ E++LSDFK L ++G EF GVLIC
Sbjct: 719 IIPPHDPVFVGEVRLSDFKDILTQHGFRTEFAAGVLIC---------------------- 756
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
N +++ + S + I + G L D+Y+ V+ LLY Q+ IL
Sbjct: 757 -------------NGVVMLKKETEGLSGRSKISVNGALCDDYFAVRDLLYSQFHIL 799
Score = 45.4 bits (106), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 26/34 (76%)
Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
VVL +T +E GF+RDLF +W S+P +++I T+R
Sbjct: 286 VVLATTASLEWGFARDLFVEWSSNPRHAVIFTDR 319
>gi|320163729|gb|EFW40628.1| cleavage and polyadenylation specificity factor [Capsaspora
owczarzaki ATCC 30864]
Length = 744
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/155 (40%), Positives = 93/155 (60%), Gaps = 6/155 (3%)
Query: 1 MSDKLMKSFEGARNNPFH-FKHVKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVK 59
++ L K +E +N P H+ +A+IEWMS K+ K FE R+NPF F H+K
Sbjct: 166 LAQVLDKQWEKYKNFPIVVLSHISRTVMNFAMAQIEWMSAKMQKEFETTRSNPFSFAHIK 225
Query: 60 LCHSLAELAKVP---SPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLAR 116
+C ++ ELA+V +P VVL S + GF+RDL +W +P+N II N + LA+
Sbjct: 226 MCQTMEELAQVAKEGTPVVVLASMEGLTSGFARDLMLKWAENPKNLIIFPNNSPASDLAK 285
Query: 117 DLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDK 151
L+E NR + + VK RI LEGEEL+EY +++++
Sbjct: 286 SLVE--KNRQIVIDVKTRIALEGEELDEYLREQEE 318
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 42/93 (45%), Gaps = 16/93 (17%)
Query: 189 CEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSD 248
C+ M+ + + V VVL S + GF+RDL +W +P+N II N + A D
Sbjct: 227 CQTMEELAQVAKEGTPV--VVLASMEGLTSGFARDLMLKWAENPKNLIIFPNNSPASDLA 284
Query: 249 SNV--------------IVLEGCLSDEYYRVQQ 267
++ I LEG DEY R Q+
Sbjct: 285 KSLVEKNRQIVIDVKTRIALEGEELDEYLREQE 317
>gi|341883504|gb|EGT39439.1| CBN-CPSF-2 protein [Caenorhabditis brenneri]
Length = 822
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 65/139 (46%), Positives = 84/139 (60%), Gaps = 13/139 (9%)
Query: 24 LCHSLAELAK--IEWMSDKLMK-SFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVST 80
+ S+ + AK +EWM + L K R NPF K+V LCHS EL +V SPKVVL S+
Sbjct: 271 VASSVVQFAKSQLEWMHESLFKYDSNSTRYNPFTLKNVTLCHSHQELLRVRSPKVVLCSS 330
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG----------GNRTLTLQ 130
DME GFSR+LF WCS N +I+T R S TLA L+ L +R ++L
Sbjct: 331 QDMEAGFSRELFLDWCSDSRNGVILTARPSSFTLAAKLVNLAERANDGILRPEDRLISLL 390
Query: 131 VKKRIRLEGEELEEYQKKK 149
VKKR+ LEGEEL EY+++K
Sbjct: 391 VKKRVPLEGEELLEYKRRK 409
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 32/122 (26%), Positives = 52/122 (42%), Gaps = 39/122 (31%)
Query: 155 DKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTP 214
D ++ +IP H F+N+ ++SDFK L G EF+ G LI G ++RR
Sbjct: 740 DPLQRSRIPVHQAIFVNDPKMSDFKNLLVERGYKAEFLSGTLIINGGKCSIRR------- 792
Query: 215 DMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYA 274
S++ +EG + +YY+V++L YDQ+A
Sbjct: 793 --------------------------------SETGSFQMEGAFTKDYYKVRKLFYDQFA 820
Query: 275 IL 276
+L
Sbjct: 821 VL 822
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 25/35 (71%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL S+ DME GFSR+LF WCS N +I+T R
Sbjct: 324 KVVLCSSQDMEAGFSRELFLDWCSDSRNGVILTAR 358
>gi|348689662|gb|EGZ29476.1| hypothetical protein PHYSODRAFT_552782 [Phytophthora sojae]
Length = 513
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 83/134 (61%), Gaps = 4/134 (2%)
Query: 29 AELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFS 88
A A +EW +D++ K+F+ R NPF F H+ L H+L EL +PSPKVVL ++P +ECGF+
Sbjct: 57 AAQAMLEWCNDRIAKNFDVGRQNPFQFSHIHLVHTLEELDALPSPKVVLATSPSLECGFA 116
Query: 89 RDLFFQWCSSPENSIIITNRTSPGTLARDLIELG----GNRTLTLQVKKRIRLEGEELEE 144
+D+F +W P NSII T+ T + A ++++ + ++ V K++ LEG EL
Sbjct: 117 KDIFIRWAPDPRNSIIFTSTTPETSFASRVLKIAKDPSAAKVISCTVTKKVFLEGAELAL 176
Query: 145 YQKKKDKEAKDKQE 158
Y+ K+ K + + E
Sbjct: 177 YEVKERKRLRTEAE 190
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 26/34 (76%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 240
+VVL ++P +ECGF++D+F +W P NSII T+
Sbjct: 102 KVVLATSPSLECGFAKDIFIRWAPDPRNSIIFTS 135
>gi|268558798|ref|XP_002637390.1| Hypothetical protein CBG19097 [Caenorhabditis briggsae]
Length = 838
Score = 114 bits (284), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 63/139 (45%), Positives = 86/139 (61%), Gaps = 13/139 (9%)
Query: 24 LCHSLAELAK--IEWMSDKLMK-SFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVST 80
+ S+ + AK +EWM +KL + AR NPF K+V L HS EL K+ SPKVVL S+
Sbjct: 271 VASSVVQFAKSQLEWMDEKLFRYDSSSARYNPFTLKNVNLVHSHLELIKIRSPKVVLCSS 330
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG----------GNRTLTLQ 130
DME GFSR+LF WC+ N +I+T R + TLA L+EL ++ L+L
Sbjct: 331 QDMETGFSRELFLDWCADQRNGVILTARPASFTLAARLVELAERANDGVLRNEDKHLSLL 390
Query: 131 VKKRIRLEGEELEEYQKKK 149
V+KR+ LEGEEL EY+++K
Sbjct: 391 VRKRVPLEGEELLEYKRRK 409
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/118 (25%), Positives = 49/118 (41%), Gaps = 41/118 (34%)
Query: 159 KEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMEC 218
K++IP H F+N+ +LS+FK L G EF G L+ G ++R E
Sbjct: 762 KKQIPVHQAIFVNDPKLSEFKNLLVDKGYKAEFFSGTLLINGGKCSIRG---------ET 812
Query: 219 GFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
GF +EG + ++Y++++L YDQ+A+L
Sbjct: 813 GFQ--------------------------------MEGAFTKDFYKLRKLFYDQFAVL 838
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 25/35 (71%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL S+ DME GFSR+LF WC+ N +I+T R
Sbjct: 324 KVVLCSSQDMETGFSRELFLDWCADQRNGVILTAR 358
>gi|301092283|ref|XP_002997000.1| cleavage and polyadenylation specificity factor subunit, putative
[Phytophthora infestans T30-4]
gi|262112189|gb|EEY70241.1| cleavage and polyadenylation specificity factor subunit, putative
[Phytophthora infestans T30-4]
Length = 513
Score = 114 bits (284), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 51/134 (38%), Positives = 84/134 (62%), Gaps = 4/134 (2%)
Query: 29 AELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFS 88
A A +EW +D++ K+F+ R NPF F H+ L H+L EL +P+PKVVL ++P +ECGF+
Sbjct: 57 AAQAMLEWCNDRIAKNFDVGRQNPFQFTHIHLVHTLEELDALPNPKVVLATSPSLECGFA 116
Query: 89 RDLFFQWCSSPENSIIITNRTSPGTLARDLIEL----GGNRTLTLQVKKRIRLEGEELEE 144
+D+F +W P NSII ++ TS + A +++L + ++ V +++ LEG EL
Sbjct: 117 KDIFIRWAPDPRNSIIFSSTTSETSFASRVVKLSKDPSAEKNISCTVTQKVFLEGAELAL 176
Query: 145 YQKKKDKEAKDKQE 158
Y+ K+ K + + E
Sbjct: 177 YEVKERKRLRTEAE 190
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 30/48 (62%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVL 254
+VVL ++P +ECGF++D+F +W P NSII ++ S V+ L
Sbjct: 102 KVVLATSPSLECGFAKDIFIRWAPDPRNSIIFSSTTSETSFASRVVKL 149
>gi|229553940|sp|A8XUS3.2|CPSF2_CAEBR RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 2; AltName: Full=Cleavage and
polyadenylation specificity factor 100 kDa subunit;
Short=CPSF 100 kDa subunit
Length = 842
Score = 114 bits (284), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 63/139 (45%), Positives = 86/139 (61%), Gaps = 13/139 (9%)
Query: 24 LCHSLAELAK--IEWMSDKLMK-SFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVST 80
+ S+ + AK +EWM +KL + AR NPF K+V L HS EL K+ SPKVVL S+
Sbjct: 271 VASSVVQFAKSQLEWMDEKLFRYDSSSARYNPFTLKNVNLVHSHLELIKIRSPKVVLCSS 330
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG----------GNRTLTLQ 130
DME GFSR+LF WC+ N +I+T R + TLA L+EL ++ L+L
Sbjct: 331 QDMETGFSRELFLDWCADQRNGVILTARPASFTLAARLVELAERANDGVLRNEDKHLSLL 390
Query: 131 VKKRIRLEGEELEEYQKKK 149
V+KR+ LEGEEL EY+++K
Sbjct: 391 VRKRVPLEGEELLEYKRRK 409
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 25/35 (71%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+VVL S+ DME GFSR+LF WC+ N +I+T R
Sbjct: 324 KVVLCSSQDMETGFSRELFLDWCADQRNGVILTAR 358
Score = 37.7 bits (86), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 26/47 (55%)
Query: 159 KEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAV 205
K++IP H F+N+ +LS+FK L G EF G L+ G ++
Sbjct: 764 KKQIPVHQAIFVNDPKLSEFKNLLVDKGYKAEFFSGTLLINGGKCSI 810
>gi|339247939|ref|XP_003375603.1| cleavage and polyadenylation specificity factor subunit 2
[Trichinella spiralis]
gi|316971010|gb|EFV54853.1| cleavage and polyadenylation specificity factor subunit 2
[Trichinella spiralis]
Length = 1188
Score = 113 bits (283), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 58/137 (42%), Positives = 84/137 (61%), Gaps = 20/137 (14%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSP-------- 73
+ ++ E AK +EWMS+++++ FE R+NPF F+H +LCHSLAEL ++ SP
Sbjct: 272 VAFNVVEFAKSQVEWMSERMLRMFEEGRSNPFQFRHAQLCHSLAELTRLRSPKVLSFRDA 331
Query: 74 ----KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG------G 123
KVVL S PD++ GFSR+LF WC +N II+T+R G+L LIE+ G
Sbjct: 332 FFSDKVVLASQPDLDSGFSRELFLDWCIDAKNCIILTSRARIGSLCSKLIEMVSSPERIG 391
Query: 124 NRTLTLQVKKRIRLEGE 140
+ +T+QVK+R GE
Sbjct: 392 TKQITVQVKRRFDDYGE 408
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/37 (54%), Positives = 27/37 (72%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR 243
+VVL S PD++ GFSR+LF WC +N II+T+R R
Sbjct: 336 KVVLASQPDLDSGFSRELFLDWCIDAKNCIILTSRAR 372
Score = 40.8 bits (94), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 34/78 (43%), Gaps = 9/78 (11%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLIC---------CRGTVAVRRVVLV 211
+IP HD F+ + +LSD KQ L +G EF GVL+ G + V +V
Sbjct: 613 EIPSHDAVFVGDPKLSDLKQALMLDGFQAEFSHGVLVVNNVLSIRKRADGQLHVEGIVCK 672
Query: 212 STPDMECGFSRDLFFQWC 229
+ F + FF C
Sbjct: 673 DYYAIRDQFHANYFFYIC 690
>gi|384484008|gb|EIE76188.1| hypothetical protein RO3G_00892 [Rhizopus delemar RA 99-880]
Length = 657
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 57/167 (34%), Positives = 90/167 (53%), Gaps = 40/167 (23%)
Query: 14 NNPFHFKHVKLCHSLAELAKI--EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP 71
N +H H AKI EWM ++L + F +R NP+ FK+V+LCH + +L P
Sbjct: 226 NTSYHTVH---------FAKIMLEWMGEELTRKFSQSRENPYEFKYVRLCHKIEDLDNYP 276
Query: 72 SPKVVLVSTPDMECGFSRDLFFQWCSS-PENSIIITNRTSPGTLARDLIE---------- 120
PK+V+ S +E GF+R+LF +W ++ P+N++I+T+R++PGTLAR L +
Sbjct: 277 GPKIVMASHHSLETGFARELFLRWMTNDPQNTLILTDRSAPGTLARRLYDDWEQQTNKTA 336
Query: 121 ------------------LGGNRTLTLQVKKRIRLEGEELEEYQKKK 149
+ T+ L+V KR+ LEG EL+EY+ +
Sbjct: 337 TTTTVVNNNRTKVLVKPAIAYENTIDLRVYKRVPLEGAELQEYEAAQ 383
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 14/37 (37%), Positives = 29/37 (78%), Gaps = 1/37 (2%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSS-PENSIIITNRN 242
++V+ S +E GF+R+LF +W ++ P+N++I+T+R+
Sbjct: 279 KIVMASHHSLETGFARELFLRWMTNDPQNTLILTDRS 315
>gi|402591052|gb|EJW84982.1| cleavage and polyadenylation specificity factor subunit 2
[Wuchereria bancrofti]
Length = 809
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 68/144 (47%), Positives = 85/144 (59%), Gaps = 20/144 (13%)
Query: 19 FKHVKLCH---SLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSP 73
+ V L H S+ E AK +EWMSDK++KSFE R NPF F+HV+LCH+ +L +V SP
Sbjct: 263 YNLVMLSHVASSVVEFAKSQVEWMSDKVLKSFEVGRYNPFQFRHVQLCHTHIDLMRVRSP 322
Query: 74 KVVLVSTPDMECGFS--RDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQV 131
KVVLVS DME G S R L + E T GT+ NR LTL+V
Sbjct: 323 KVVLVSGLDMESGRSGDRTLGARLIRMAEQ----TAENPNGTI---------NRNLTLEV 369
Query: 132 KKRIRLEGEELEEYQKKKDKEAKD 155
K+RIRLEG ELE Y+ KK E ++
Sbjct: 370 KRRIRLEGVELENYRAKKRAEERE 393
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 31/115 (26%), Positives = 46/115 (40%), Gaps = 40/115 (34%)
Query: 162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFS 221
IPPH F+N+ +LSD KQ L NG EF GVL
Sbjct: 735 IPPHQAVFVNDPKLSDMKQLLASNGFRAEFSSGVLYI----------------------- 771
Query: 222 RDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
N+I RN A +EG ++YY+++ ++Y Q+A++
Sbjct: 772 ------------NNIASIRRNEAGR-----FHVEGYACEDYYKIRDIVYAQFAVV 809
>gi|358338982|dbj|GAA43367.2| cleavage and polyadenylation specificity factor subunit 2, partial
[Clonorchis sinensis]
Length = 995
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 42/77 (54%), Positives = 59/77 (76%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+EWMS+K+M++FE R NPFHF+HV+LCH+L +L VP PKVVL S D+ CGF+R LF
Sbjct: 259 VEWMSEKVMRTFEDQRTNPFHFRHVQLCHTLEQLDTVPEPKVVLASASDLSCGFARQLFA 318
Query: 94 QWCSSPENSIIITNRTS 110
+W + N++I+T+R S
Sbjct: 319 EWADNDLNTVILTSRES 335
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 48/111 (43%), Gaps = 40/111 (36%)
Query: 162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFS 221
+ H T F+NE +LSD KQ L NG+ EF+ GVL+ VA++R
Sbjct: 806 VGAHKTVFVNEPKLSDLKQLLLANGLVAEFVSGVLV-VDNCVAIKR-------------- 850
Query: 222 RDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQ 272
S++ ++LEG LS Y+ V+Q+LY Q
Sbjct: 851 -------------------------SEAGKLLLEGLLSRTYFTVRQVLYQQ 876
Score = 43.9 bits (102), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 27/40 (67%)
Query: 202 TVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
TV +VVL S D+ CGF+R LF +W + N++I+T+R
Sbjct: 294 TVPEPKVVLASASDLSCGFARQLFAEWADNDLNTVILTSR 333
>gi|302819854|ref|XP_002991596.1| hypothetical protein SELMODRAFT_429848 [Selaginella moellendorffii]
gi|300140629|gb|EFJ07350.1| hypothetical protein SELMODRAFT_429848 [Selaginella moellendorffii]
Length = 715
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 52/142 (36%), Positives = 87/142 (61%), Gaps = 5/142 (3%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRD 90
+ +EWM+D + K+FE R NPF + VKLC S +L ++P P+VVL S +E GF+++
Sbjct: 278 SSLEWMNDAIGKAFEQNRENPFALRSVKLCTSRKQLEELPPGPRVVLASMASLETGFAKE 337
Query: 91 LFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKD 150
LF +W P+N ++ T R GTLAR L + + + + K++ L GEELE Y++++
Sbjct: 338 LFLEWAVDPKNLVLFTERAQVGTLARQLQVEPPPKIVKITISKKVLLVGEELEAYEREQS 397
Query: 151 K---EAKD-KQEKEKIPPHDTS 168
+ EA++ ++E + P +S
Sbjct: 398 RLREEARNAASQQEPVQPASSS 419
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 50/112 (44%), Gaps = 40/112 (35%)
Query: 165 HDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDL 224
H T F+ +L+L+DFKQ L GI EF GVL C +AVR+
Sbjct: 644 HKTVFVGDLRLADFKQLLATKGIQAEFAGGVLRCA-DNIAVRK----------------- 685
Query: 225 FFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
S +V+EG LSD+YY+V++LLY QY I+
Sbjct: 686 ----------------------SGGQQLVIEGSLSDDYYKVRELLYSQYHIV 715
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 15/35 (42%), Positives = 23/35 (65%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
RVVL S +E GF+++LF +W P+N ++ T R
Sbjct: 321 RVVLASMASLETGFAKELFLEWAVDPKNLVLFTER 355
>gi|302776792|ref|XP_002971541.1| hypothetical protein SELMODRAFT_441578 [Selaginella moellendorffii]
gi|300160673|gb|EFJ27290.1| hypothetical protein SELMODRAFT_441578 [Selaginella moellendorffii]
Length = 721
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 52/142 (36%), Positives = 87/142 (61%), Gaps = 5/142 (3%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRD 90
+ +EWM+D + K+FE R NPF + VKLC S +L ++P P+VVL S +E GF+++
Sbjct: 278 SSLEWMNDAIGKAFEQNRENPFALRSVKLCTSRKQLDELPPGPRVVLASMASLETGFAKE 337
Query: 91 LFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKD 150
LF +W P+N ++ T R GTLAR L + + + + K++ L GEELE Y++++
Sbjct: 338 LFLEWAVDPKNLVLFTERAQVGTLARQLQVEPPPKIVKITISKKVLLVGEELEAYEREQS 397
Query: 151 K---EAKD-KQEKEKIPPHDTS 168
+ EA++ ++E + P +S
Sbjct: 398 RLREEARNAASQQEPVQPASSS 419
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 50/112 (44%), Gaps = 40/112 (35%)
Query: 165 HDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDL 224
H T F+ +L+L+DFKQ L GI EF GVL C +AVR+
Sbjct: 650 HKTVFVGDLRLADFKQLLATKGIQAEFAGGVLRCA-DNIAVRK----------------- 691
Query: 225 FFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
S +V+EG LSD+YY+V++LLY QY I+
Sbjct: 692 ----------------------SGGQQLVIEGSLSDDYYKVRELLYSQYHIV 721
Score = 40.8 bits (94), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 15/35 (42%), Positives = 23/35 (65%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
RVVL S +E GF+++LF +W P+N ++ T R
Sbjct: 321 RVVLASMASLETGFAKELFLEWAVDPKNLVLFTER 355
>gi|328768987|gb|EGF79032.1| hypothetical protein BATDEDRAFT_12823 [Batrachochytrium
dendrobatidis JAM81]
Length = 719
Score = 103 bits (258), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 52/123 (42%), Positives = 74/123 (60%), Gaps = 3/123 (2%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+ +EWM D + ++F AR PF FK +K+ HS+ EL + PKVVL S P M GFS+DL
Sbjct: 282 STLEWMGDGIAQAFT-ARELPFEFKCLKMIHSIDELDNLMGPKVVLASFPGMMTGFSQDL 340
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIE--LGGNRTLTLQVKKRIRLEGEELEEYQKKK 149
+W S P N II+ +R PGTL R + + + + +KK++ L G+EL+EY KK
Sbjct: 341 LIEWGSDPRNMIILPDRAQPGTLGRMMFDDWFESAKMADMNLKKQVPLVGDELDEYMSKK 400
Query: 150 DKE 152
E
Sbjct: 401 QAE 403
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 24/37 (64%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR 243
+VVL S P M GFS+DL +W S P N II+ +R +
Sbjct: 323 KVVLASFPGMMTGFSQDLLIEWGSDPRNMIILPDRAQ 359
>gi|325187176|emb|CCA21717.1| cleavage and polyadenylation specificity factor subunit putative
[Albugo laibachii Nc14]
gi|325187319|emb|CCA21858.1| cleavage and polyadenylation specificity factor subunit putative
[Albugo laibachii Nc14]
Length = 731
Score = 103 bits (258), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 82/134 (61%), Gaps = 3/134 (2%)
Query: 29 AELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFS 88
A A +EW ++++ ++F+ R NPF F H+ L HS+ EL K+ SPKVVL ++ +ECG++
Sbjct: 276 AAEAMLEWCNEQIARNFDAGRQNPFQFSHIHLIHSIEELEKLSSPKVVLATSATLECGYA 335
Query: 89 RDLFFQWCSSPENSIIITNRTSPGTLARDLIEL---GGNRTLTLQVKKRIRLEGEELEEY 145
++LF ++ + NSII T P + ++++ +R +T V KR+ LEG EL Y
Sbjct: 336 KELFIKYAADTRNSIIFTTTPPPRSFGARILDMNKKNDSRVVTCSVAKRVLLEGTELALY 395
Query: 146 QKKKDKEAKDKQEK 159
+ K+ + + + E+
Sbjct: 396 EAKERRRLRLEAEQ 409
>gi|330803886|ref|XP_003289932.1| hypothetical protein DICPUDRAFT_80682 [Dictyostelium purpureum]
gi|325079974|gb|EGC33550.1| hypothetical protein DICPUDRAFT_80682 [Dictyostelium purpureum]
Length = 752
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 48/123 (39%), Positives = 78/123 (63%), Gaps = 6/123 (4%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
+++E+MS FE +NPF F+ +K+ +L E+ + +PKV+L S+ D+E G+SRDL
Sbjct: 284 SQLEFMSSSASVRFEQKIDNPFTFRQIKIFSTLEEIPETNTPKVILTSSQDLETGYSRDL 343
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIEL------GGNRTLTLQVKKRIRLEGEELEEY 145
F +W S P+N I+ TN G+LA +I + G N+T+ +Q R+ L+GEEL EY
Sbjct: 344 FIKWSSDPKNLILFTNYIPEGSLASKVINIASNKSSGSNKTIEIQQGSRVPLQGEELLEY 403
Query: 146 QKK 148
+++
Sbjct: 404 EQR 406
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/134 (27%), Positives = 59/134 (44%), Gaps = 38/134 (28%)
Query: 144 EYQKKKDKEAKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGID-CEFMDGVLICCRGT 202
EY + + D E+ HD SFI +++L+D KQ L + GI +F G++ C
Sbjct: 656 EYNPENNSSYLDIIPSEQNNGHDESFIGDIKLADLKQVLVKAGIKKVQFDQGIINC---- 711
Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEY 262
DL + W R D +++I ++G +SDEY
Sbjct: 712 -------------------NDLVYIW--------------REDVGGNSIINVDGIISDEY 738
Query: 263 YRVQQLLYDQYAIL 276
Y V++LLY Q+ I+
Sbjct: 739 YLVKELLYRQFQIV 752
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 18/34 (52%), Positives = 26/34 (76%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 240
+V+L S+ D+E G+SRDLF +W S P+N I+ TN
Sbjct: 326 KVILTSSQDLETGYSRDLFIKWSSDPKNLILFTN 359
>gi|172087214|ref|XP_001913149.1| cleavage and polyadenylation factor [Oikopleura dioica]
gi|18029276|gb|AAL56454.1| cleavage and polyadenylation factor-like protein [Oikopleura
dioica]
Length = 765
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/121 (41%), Positives = 78/121 (64%), Gaps = 2/121 (1%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ S E AK IEWMS+K++ F R N F +KL S+ E+A++P PKV+L +
Sbjct: 271 VATSTIEAAKGMIEWMSEKIISKFTHKRENIFDLTKMKLRSSIQEIARIPEPKVILATPM 330
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
DM+ GFSR+LF + P+N++I++ R++ G+L R +IE G ++TL++ KR+ L G E
Sbjct: 331 DMDTGFSRELFVMMAAHPKNAVIMSGRSTKGSLCRKIIENEGMSSITLEMNKRLPLVGPE 390
Query: 142 L 142
L
Sbjct: 391 L 391
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/171 (22%), Positives = 71/171 (41%), Gaps = 51/171 (29%)
Query: 117 DLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPP-----------H 165
+ + +G +++ R+ G LE + + + K E + IP H
Sbjct: 635 NFVRVGSKDIEVARIRGRVDYFGGRLELEAENGENDEPKKLEIDDIPTLQPVTNNYSSGH 694
Query: 166 DTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLF 225
D+ FIN+ +L++ K L G+ EF+ G L+ C V+++R
Sbjct: 695 DSIFINDTKLTELKSNLIDCGMQAEFIGGNLV-CNNKVSIKR------------------ 735
Query: 226 FQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
S + VI +EG LS++Y+ V++++YD YAI+
Sbjct: 736 ---------------------SANGVIQVEGTLSEDYFIVRKMVYDNYAIV 765
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/53 (33%), Positives = 31/53 (58%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLS 259
+V+L + DM+ GFSR+LF + P+N++I++ R+ +I EG S
Sbjct: 323 KVILATPMDMDTGFSRELFVMMAAHPKNAVIMSGRSTKGSLCRKIIENEGMSS 375
>gi|313232558|emb|CBY19228.1| unnamed protein product [Oikopleura dioica]
Length = 764
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/121 (41%), Positives = 78/121 (64%), Gaps = 2/121 (1%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
+ S E AK IEWMS+K++ F R N F +KL S+ E+A++P PKV+L +
Sbjct: 270 VATSTIEAAKGMIEWMSEKIISKFTHKRENIFDLTKMKLRSSIQEIARIPEPKVILATPM 329
Query: 82 DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
DM+ GFSR+LF + P+N++I++ R++ G+L R +IE G ++TL++ KR+ L G E
Sbjct: 330 DMDTGFSRELFVMMAAHPKNAVIMSGRSTKGSLCRKIIENEGMSSITLEMNKRLPLVGPE 389
Query: 142 L 142
L
Sbjct: 390 L 390
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/171 (22%), Positives = 71/171 (41%), Gaps = 51/171 (29%)
Query: 117 DLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPP-----------H 165
+ + +G +++ R+ G LE + + + K E + IP H
Sbjct: 634 NFVRVGSKDIEVARIRGRVDYFGGRLELEAENGENDEPKKLEIDDIPTLQPVTNNYSSGH 693
Query: 166 DTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLF 225
D+ FIN+ +L++ K L G+ EF+ G L+ C V+++R
Sbjct: 694 DSIFINDTKLTELKSNLIDCGMHAEFIGGNLV-CNNKVSIKR------------------ 734
Query: 226 FQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
S + VI +EG LS++Y+ V++++YD YAI+
Sbjct: 735 ---------------------SANGVIQVEGTLSEDYFIVRKMVYDNYAIV 764
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/53 (33%), Positives = 31/53 (58%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLS 259
+V+L + DM+ GFSR+LF + P+N++I++ R+ +I EG S
Sbjct: 322 KVILATPMDMDTGFSRELFVMMAAHPKNAVIMSGRSTKGSLCRKIIENEGMSS 374
>gi|256077072|ref|XP_002574832.1| cleavage and polyadenylation specificity factor [Schistosoma
mansoni]
Length = 1063
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 42/77 (54%), Positives = 59/77 (76%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+EWMS+K+M+SFE R+NPFHF+H++LCH+L +L V PKVVL S D+ CGFSR LF
Sbjct: 85 VEWMSEKVMRSFEDQRSNPFHFRHMQLCHTLEQLDAVSEPKVVLSSLSDLSCGFSRQLFA 144
Query: 94 QWCSSPENSIIITNRTS 110
+W + N+II+T++ S
Sbjct: 145 EWADNDLNTIILTSQES 161
Score = 41.2 bits (95), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 27/39 (69%)
Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
V+ +VVL S D+ CGFSR LF +W + N+II+T++
Sbjct: 121 VSEPKVVLSSLSDLSCGFSRQLFAEWADNDLNTIILTSQ 159
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 1/46 (2%)
Query: 162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRR 207
I H T F+NE +LSD KQ L G+ EF+ G+L+ VA++R
Sbjct: 655 IGQHKTVFVNEPKLSDLKQLLLSQGLMAEFVSGILV-VDNCVAIKR 699
>gi|225464483|ref|XP_002268591.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2 [Vitis vinifera]
gi|302143847|emb|CBI22708.3| unnamed protein product [Vitis vinifera]
Length = 740
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 79/134 (58%), Gaps = 1/134 (0%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWMSD + KSFE R+N F KHV L S +EL KVP PK+VL S +E GFS D+F
Sbjct: 280 LEWMSDSIAKSFEHTRDNAFLLKHVTLLISKSELEKVPDGPKIVLASMASLEAGFSHDIF 339
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
+W + +N ++ + R TLAR L + + + + KR+ L GEEL Y++++++
Sbjct: 340 VEWATDAKNLVLFSERGQFATLARMLQADPPPKAVKVTMSKRVPLVGEELAAYEEEQERI 399
Query: 153 AKDKQEKEKIPPHD 166
K++ K + D
Sbjct: 400 KKEEALKASLSKED 413
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 50/112 (44%), Gaps = 30/112 (26%)
Query: 165 HDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDL 224
HDT F+ +++++DFKQ L GI EF G L C V +R+V
Sbjct: 659 HDTVFVGDIKMADFKQFLASKGIQVEFSGGALRCGE-YVTLRKV---------------- 701
Query: 225 FFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
S II IV+EG L DEYY++++ LY QY +L
Sbjct: 702 --GDASQKGGGAIIQQ-----------IVMEGPLCDEYYKIREYLYSQYYLL 740
>gi|350646480|emb|CCD58879.1| cleavage and polyadenylation specificity factor,putative
[Schistosoma mansoni]
Length = 729
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 42/77 (54%), Positives = 59/77 (76%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+EWMS+K+M+SFE R+NPFHF+H++LCH+L +L V PKVVL S D+ CGFSR LF
Sbjct: 85 VEWMSEKVMRSFEDQRSNPFHFRHMQLCHTLEQLDAVSEPKVVLSSLSDLSCGFSRQLFA 144
Query: 94 QWCSSPENSIIITNRTS 110
+W + N+II+T++ S
Sbjct: 145 EWADNDLNTIILTSQES 161
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 50/115 (43%), Gaps = 40/115 (34%)
Query: 162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFS 221
I H T F+NE +LSD KQ L G+ EF+ G+L+ VA++R
Sbjct: 655 IGQHKTVFVNEPKLSDLKQLLLSQGLMAEFVSGILV-VDNCVAIKR-------------- 699
Query: 222 RDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
S++ ++LEG L Y+ V+++LY Q+AIL
Sbjct: 700 -------------------------SEAGKLLLEGLLCGTYFEVRRILYQQFAIL 729
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 33/55 (60%), Gaps = 4/55 (7%)
Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR----ADDSDSNVIV 253
V+ +VVL S D+ CGFSR LF +W + N+II+T++ D+S S ++
Sbjct: 121 VSEPKVVLSSLSDLSCGFSRQLFAEWADNDLNTIILTSQESVITGVDESSSQSLI 175
>gi|256077070|ref|XP_002574831.1| cleavage and polyadenylation specificity factor [Schistosoma
mansoni]
Length = 928
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 42/77 (54%), Positives = 59/77 (76%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+EWMS+K+M+SFE R+NPFHF+H++LCH+L +L V PKVVL S D+ CGFSR LF
Sbjct: 284 VEWMSEKVMRSFEDQRSNPFHFRHMQLCHTLEQLDAVSEPKVVLSSLSDLSCGFSRQLFA 343
Query: 94 QWCSSPENSIIITNRTS 110
+W + N+II+T++ S
Sbjct: 344 EWADNDLNTIILTSQES 360
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 50/115 (43%), Gaps = 40/115 (34%)
Query: 162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFS 221
I H T F+NE +LSD KQ L G+ EF+ G+L+ VA++R
Sbjct: 854 IGQHKTVFVNEPKLSDLKQLLLSQGLMAEFVSGILV-VDNCVAIKR-------------- 898
Query: 222 RDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
S++ ++LEG L Y+ V+++LY Q+AIL
Sbjct: 899 -------------------------SEAGKLLLEGLLCGTYFEVRRILYQQFAIL 928
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 4/56 (7%)
Query: 202 TVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR----ADDSDSNVIV 253
V+ +VVL S D+ CGFSR LF +W + N+II+T++ D+S S ++
Sbjct: 319 AVSEPKVVLSSLSDLSCGFSRQLFAEWADNDLNTIILTSQESVITGVDESSSQSLI 374
>gi|452822529|gb|EME29547.1| cleavage and polyadenylation specificity factor subunit 2
[Galdieria sulphuraria]
Length = 747
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 87/153 (56%), Gaps = 14/153 (9%)
Query: 2 SDKLMKSFEGARNNPFHFKHVKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLC 61
++KL S+ A F + S+ +EWMSD ++ F+ R NPFH KH+ LC
Sbjct: 273 TEKLGSSYAVAIIEHVSFNTIDFAKSM-----MEWMSDAVINKFDTTRENPFHLKHIHLC 327
Query: 62 HSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIEL 121
HS +EL+ + SPKV+L S +ECGFSR+L + S+ +N +I+ +R P TLA + +
Sbjct: 328 HSRSELSSLLSPKVILTSVASLECGFSRELVVEMVSNKKNKLILVDRLEPNTLAHSIYNV 387
Query: 122 GGNRT---------LTLQVKKRIRLEGEELEEY 145
+ + + L++ +R+ L+G ELEEY
Sbjct: 388 LEDESEGKTVQLPRIALRLNRRVPLQGAELEEY 420
>gi|168010331|ref|XP_001757858.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691134|gb|EDQ77498.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 724
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 50/126 (39%), Positives = 75/126 (59%), Gaps = 3/126 (2%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
+ +S E AK +EWMSD + +SF +R N F K++KLCH E ++PS PKVV S
Sbjct: 268 VSYSTVEFAKSLLEWMSDSIARSFGSSRENSFLLKYLKLCHDRKEFDELPSGPKVVFASM 327
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGE 140
+E GF+RDLF +W + N ++ T R GTLA+ L + + + + ++I L GE
Sbjct: 328 ASLEGGFARDLFVEWATDSRNLVLFTERGQMGTLAKKLQAEPPPKIVKVTMSQKIPLTGE 387
Query: 141 ELEEYQ 146
EL+ Y+
Sbjct: 388 ELQAYE 393
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 51/117 (43%), Gaps = 39/117 (33%)
Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
E PPH + F+ +L+L+DFKQ L GI EF GVL C AVRR
Sbjct: 647 ETPPPHKSVFVGDLRLADFKQLLATKGIQAEFAGGVLRCGDA-FAVRR------------ 693
Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
S +V+EG LS+EYY+++ LLY Q+ +L
Sbjct: 694 --------------------------SGGSQQLVIEGPLSEEYYKLRDLLYSQFYML 724
>gi|255070137|ref|XP_002507150.1| predicted protein [Micromonas sp. RCC299]
gi|255070139|ref|XP_002507151.1| predicted protein [Micromonas sp. RCC299]
gi|226522425|gb|ACO68408.1| predicted protein [Micromonas sp. RCC299]
gi|226522426|gb|ACO68409.1| predicted protein [Micromonas sp. RCC299]
Length = 808
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 49/136 (36%), Positives = 82/136 (60%), Gaps = 8/136 (5%)
Query: 22 VKLCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLV 78
K+ ++ E A+ +EWM + + + F+ R+N F+ +H+KLCHS+ E +P PKVVL
Sbjct: 271 TKVAYNTLEFARSHLEWMGEHVGQYFDRERHNAFNTRHLKLCHSIDEFRALPQGPKVVLA 330
Query: 79 STPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIEL-----GGNRTLTLQVKK 133
S ++ G SR +F +W P N I+ T+R PG+L+R++ L G L + + +
Sbjct: 331 SFGSLDAGASRHIFVEWAPDPRNLIVFTDRLQPGSLSREVCRLSQLPPGARLPLRISLSQ 390
Query: 134 RIRLEGEELEEYQKKK 149
R++L G+EL E+Q K+
Sbjct: 391 RLKLVGDELLEWQGKE 406
>gi|449446027|ref|XP_004140773.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like [Cucumis sativus]
Length = 738
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 66/207 (31%), Positives = 100/207 (48%), Gaps = 27/207 (13%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWMSD + KSFE RNN F KHV L + +EL P PKVVL S +E G+S D+F
Sbjct: 280 LEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIF 339
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
W +N ++ + R GTLAR L + + + V KR+ L G+EL Y+++++++
Sbjct: 340 VDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVKVTVSKRVPLTGDELIAYEEEQNRK 399
Query: 153 AKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVS 212
K+E K S + E +Q+ +G D + D ++I VA
Sbjct: 400 ---KEEALK-----ASLLKE------EQSKASHGADNDTGDPMIIDASSNVA-------- 437
Query: 213 TPDM---ECGFSRDLFFQWCSSPENSI 236
PD+ G RD+ P +
Sbjct: 438 -PDVGSSHGGAYRDILIDGFVPPSTGV 463
>gi|449528453|ref|XP_004171219.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 2-like, partial [Cucumis
sativus]
Length = 501
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 66/207 (31%), Positives = 99/207 (47%), Gaps = 27/207 (13%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWMSD + KSFE RNN F KHV L + +EL P PKVVL S +E G+S D F
Sbjct: 280 LEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDXF 339
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
W +N ++ + R GTLAR L + + + V KR+ L G+EL Y+++++++
Sbjct: 340 VDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVKVTVSKRVPLTGDELIAYEEEQNRK 399
Query: 153 AKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVS 212
K+E K S + E +Q+ +G D + D ++I VA
Sbjct: 400 ---KEEALK-----ASLLKE------EQSKASHGADNDTGDPMIIDASSNVA-------- 437
Query: 213 TPDM---ECGFSRDLFFQWCSSPENSI 236
PD+ G RD+ P +
Sbjct: 438 -PDVGSSHGGAYRDILIDGFVPPSTGV 463
>gi|449018596|dbj|BAM81998.1| cleavage and polyadenylation specific factor 2, 100kD subunit
[Cyanidioschyzon merolae strain 10D]
Length = 884
Score = 94.4 bits (233), Expect = 4e-17, Method: Composition-based stats.
Identities = 53/137 (38%), Positives = 85/137 (62%), Gaps = 7/137 (5%)
Query: 20 KHVKLCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVL 77
+HV + H++ E AK IEWMSD ++ +F+ R NPF +HV++ H L + +PSPKVV+
Sbjct: 270 QHVGI-HTI-EFAKSLIEWMSDAVVSAFDSRRENPFRLRHVQVVHGLDQADALPSPKVVM 327
Query: 78 VSTPDMECGFSRDLFFQWCSS-PENSIIITNRTSPGTLA-RDLIELGGNRTLT-LQVKKR 134
P ++ GFSR LF Q ++ P +++++R GT A R +E R L +R
Sbjct: 328 APLPSLDYGFSRVLFLQRIAADPRAMVLMSDRLESGTFAFRLAVEKEKLRVREPLTYAER 387
Query: 135 IRLEGEELEEYQKKKDK 151
+ L+GEELE +Q++++K
Sbjct: 388 VPLQGEELERWQREQEK 404
>gi|15237845|ref|NP_197776.1| cleavage and polyadenylation specificity factor subunit 2
[Arabidopsis thaliana]
gi|18203240|sp|Q9LKF9.2|CPSF2_ARATH RecName: Full=Cleavage and polyadenylation specificity factor
subunit 2; AltName: Full=Cleavage and polyadenylation
specificity factor 100 kDa subunit; Short=AtCPSF100;
Short=CPSF 100 kDa subunit; AltName: Full=Protein EMBRYO
DEFECTIVE 1265; AltName: Full=Protein ENHANCED SILENCING
PHENOTYPE 5
gi|10176855|dbj|BAB10061.1| cleavage and polyadenylation specificity factor [Arabidopsis
thaliana]
gi|14334618|gb|AAK59487.1| putative cleavage and polyadenylation specificity factor
[Arabidopsis thaliana]
gi|28393921|gb|AAO42368.1| putative cleavage and polyadenylation specificity factor
[Arabidopsis thaliana]
gi|332005845|gb|AED93228.1| cleavage and polyadenylation specificity factor subunit 2
[Arabidopsis thaliana]
Length = 739
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 46/119 (38%), Positives = 72/119 (60%), Gaps = 1/119 (0%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWMSD + KSFE +R+N F +HV L + +L P PKVVL S +E GF+R++F
Sbjct: 281 LEWMSDSISKSFETSRDNAFLLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFAREIF 340
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDK 151
+W + P N ++ T GTLAR L + + + + KR+ L GEEL Y++++++
Sbjct: 341 VEWANDPRNLVLFTETGQFGTLARMLQSAPPPKFVKVTMSKRVPLAGEELIAYEEEQNR 399
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 50/113 (44%), Gaps = 29/113 (25%)
Query: 164 PHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRD 223
PH + +L+++DFKQ L G+ EF G + C V +R+V P + G
Sbjct: 656 PHKPVLVGDLKIADFKQFLSSKGVQVEFAGGGALRCGEYVTLRKV----GPTGQKG---- 707
Query: 224 LFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
S P+ I++EG L ++YY+++ LY Q+ +L
Sbjct: 708 ----GASGPQQ-----------------ILIEGPLCEDYYKIRDYLYSQFYLL 739
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 13/37 (35%), Positives = 23/37 (62%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR 243
+VVL S +E GF+R++F +W + P N ++ T +
Sbjct: 322 KVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQ 358
>gi|297808393|ref|XP_002872080.1| CPSF100 [Arabidopsis lyrata subsp. lyrata]
gi|297317917|gb|EFH48339.1| CPSF100 [Arabidopsis lyrata subsp. lyrata]
Length = 739
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 46/119 (38%), Positives = 72/119 (60%), Gaps = 1/119 (0%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWMSD + KSFE +R+N F +HV L + +L P PKVVL S +E GF+R++F
Sbjct: 281 LEWMSDSISKSFETSRDNAFLLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFAREIF 340
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDK 151
+W + P N ++ T GTLAR L + + + + KR+ L GEEL Y++++++
Sbjct: 341 VEWANDPRNLVLFTETGQFGTLARMLQSAPPPKFVKVTMSKRVPLAGEELIAYEEEQNR 399
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 50/113 (44%), Gaps = 29/113 (25%)
Query: 164 PHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRD 223
PH + +L+++DFKQ L G+ EF G + C V +R+V P + G
Sbjct: 656 PHKPVLVGDLKIADFKQFLSSKGVQVEFAGGGALRCGEYVTLRKV----GPTGQKG---- 707
Query: 224 LFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
S P+ I++EG L ++YY+++ LY Q+ +L
Sbjct: 708 ----GASGPQQ-----------------ILIEGPLCEDYYKIRDYLYSQFYLL 739
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 13/37 (35%), Positives = 23/37 (62%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR 243
+VVL S +E GF+R++F +W + P N ++ T +
Sbjct: 322 KVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQ 358
>gi|9082326|gb|AAF82809.1|AF283277_1 polyadenylation cleavage/specificity factor 100 kDa subunit
[Arabidopsis thaliana]
Length = 739
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 46/119 (38%), Positives = 72/119 (60%), Gaps = 1/119 (0%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWMSD + KSFE +R+N F +HV L + +L P PKVVL S +E GF+R++F
Sbjct: 281 LEWMSDSISKSFETSRDNAFLLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFAREIF 340
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDK 151
+W + P N ++ T GTLAR L + + + + KR+ L GEEL Y++++++
Sbjct: 341 VEWANDPRNLVLFTETGQFGTLARMLQSAPPPKFVKVTMSKRVPLAGEELIAYEEEQNR 399
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 50/113 (44%), Gaps = 29/113 (25%)
Query: 164 PHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRD 223
PH + +L+++DFKQ L G+ EF G + C V +R+V P + G
Sbjct: 656 PHKPVLVGDLKIADFKQFLSSKGVQVEFAGGGALRCGEYVTLRKV----GPTGQKG---- 707
Query: 224 LFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
S P+ I++EG L ++YY+++ LY Q+ +L
Sbjct: 708 ----GASGPQQ-----------------ILIEGPLCEDYYKIRDYLYSQFYLL 739
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 13/37 (35%), Positives = 23/37 (62%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR 243
+VVL S +E GF+R++F +W + P N ++ T +
Sbjct: 322 KVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQ 358
>gi|255553723|ref|XP_002517902.1| cleavage and polyadenylation specificity factor, putative [Ricinus
communis]
gi|223542884|gb|EEF44420.1| cleavage and polyadenylation specificity factor, putative [Ricinus
communis]
Length = 740
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 48/125 (38%), Positives = 76/125 (60%), Gaps = 1/125 (0%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVSTPDMECGFSRDLF 92
+EWMSD + KSFE +R+N F KHV L + EL P+ PKVVL S +E GFS D+F
Sbjct: 281 LEWMSDSIAKSFETSRDNAFLLKHVTLLINKNELDNAPNVPKVVLASMASLEAGFSHDIF 340
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
+W + +N ++ T R GTLAR L + + + + +R+ L G+EL Y++++ +
Sbjct: 341 VEWAADVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGDELIAYEEEQKRL 400
Query: 153 AKDKQ 157
K+++
Sbjct: 401 KKEEE 405
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 52/114 (45%), Gaps = 30/114 (26%)
Query: 163 PPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSR 222
PPH + + +L+++DFKQ L G+ EF G L C V +R+V ++
Sbjct: 657 PPHKSVLVGDLKMADFKQFLASKGVQVEFAGGALRCGE-YVTLRKVGNIN---------- 705
Query: 223 DLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+ S + IV+EG L ++YY++++ LY Q+ +L
Sbjct: 706 -------------------QKGGGSGTQQIVIEGPLCEDYYKIREYLYSQFYLL 740
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 24/39 (61%)
Query: 205 VRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR 243
V +VVL S +E GFS D+F +W + +N ++ T R +
Sbjct: 320 VPKVVLASMASLEAGFSHDIFVEWAADVKNLVLFTERGQ 358
>gi|66826811|ref|XP_646760.1| beta-lactamase domain-containing protein [Dictyostelium discoideum
AX4]
gi|74858209|sp|Q55BS1.1|CPSF2_DICDI RecName: Full=Cleavage and polyadenylation specificity factor
subunit 2; AltName: Full=Cleavage and polyadenylation
specificity factor 100 kDa subunit; Short=CPSF 100 kDa
subunit
gi|60474609|gb|EAL72546.1| beta-lactamase domain-containing protein [Dictyostelium discoideum
AX4]
Length = 784
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/132 (37%), Positives = 80/132 (60%), Gaps = 6/132 (4%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRD 90
+++E+MS FE NPF FKH+K+ SL EL ++P + KV+L S+ D+E GFSR+
Sbjct: 285 SQLEFMSSTASVKFEQNIENPFSFKHIKILSSLEELQELPDTNKVILTSSQDLETGFSRE 344
Query: 91 LFFQWCSSPENSIIITNRTSPGTLARDLIEL-----GGNRTLTLQVKKRIRLEGEELEEY 145
LF QWCS P+ I+ T + +LA LI+ G + + + R+ L G+EL +Y
Sbjct: 345 LFIQWCSDPKTLILFTQKIPKDSLADKLIKQYSTPNGRGKCIEIVQGSRVPLTGDELLQY 404
Query: 146 QKKKDKEAKDKQ 157
+ ++ K+ ++K+
Sbjct: 405 EMEQAKQREEKR 416
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 50/112 (44%), Gaps = 36/112 (32%)
Query: 165 HDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDL 224
HD SFI +++LSD KQ L GI +F G+L C G V + R
Sbjct: 709 HDESFIGDIKLSDLKQVLVNAGIQVQFDQGIL-NCGGLVYIWR----------------- 750
Query: 225 FFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
D +++I ++G +SDEYY +++LLY Q+ I+
Sbjct: 751 ------------------DEDHGGNSIINVDGIISDEYYLIKELLYKQFQIV 784
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 26/35 (74%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+V+L S+ D+E GFSR+LF QWCS P+ I+ T +
Sbjct: 328 KVILTSSQDLETGFSRELFIQWCSDPKTLILFTQK 362
>gi|224121102|ref|XP_002330904.1| predicted protein [Populus trichocarpa]
gi|222872726|gb|EEF09857.1| predicted protein [Populus trichocarpa]
Length = 740
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 54/162 (33%), Positives = 89/162 (54%), Gaps = 4/162 (2%)
Query: 5 LMKSFEGAR--NNPFHFKHVKLCHSLAELAK-IEWMSDKLMKSFEGARNNPFHFKHVKLC 61
+++ F G R N P F ++ + +EWMSD + KSFE +R+N F KHV L
Sbjct: 248 ILEQFWGQRFLNYPIFFLSYVSSSTIDYIKSFLEWMSDSIAKSFETSRDNAFLMKHVTLL 307
Query: 62 HSLAELAKVPS-PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE 120
S EL + PKVVL S +E GFS D+F +W + +N ++ T R GTLAR L
Sbjct: 308 ISKDELDNASTGPKVVLASVASLEAGFSHDIFAEWAADVKNLVLFTERGQFGTLARMLQA 367
Query: 121 LGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKI 162
+ + + + +R+ L G+EL Y++++ + ++++ K +
Sbjct: 368 DPPPKAVKMTMSRRVPLVGDELIAYEEEQKRLKREEELKASL 409
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 52/114 (45%), Gaps = 30/114 (26%)
Query: 163 PPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSR 222
PPH + + +L+++DFKQ L G+ EF G L C V +R+ V P + G
Sbjct: 657 PPHKSVLVGDLKMADFKQFLASKGVQVEFAGGALRCGE-YVTLRK---VGNPSQKGG--- 709
Query: 223 DLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
S + I++EG L ++YY++++ LY Q+ +L
Sbjct: 710 -----------------------ASGTQQIIIEGPLCEDYYKIREYLYSQFYLL 740
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 14/37 (37%), Positives = 23/37 (62%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR 243
+VVL S +E GFS D+F +W + +N ++ T R +
Sbjct: 321 KVVLASVASLEAGFSHDIFAEWAADVKNLVLFTERGQ 357
>gi|145340766|ref|XP_001415490.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144575713|gb|ABO93782.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 715
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 49/135 (36%), Positives = 75/135 (55%), Gaps = 5/135 (3%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAEL-AKVPSPKVVLVSTPDMECGFSRD 90
+ +EWM D + +FE R NPF+ K + LCHS+ EL A P PKVVL S +E G SR
Sbjct: 262 SHLEWMGDHVTNAFERRRENPFNTKFLTLCHSMEELQALPPGPKVVLASFGSLEAGPSRH 321
Query: 91 LFFQWCSSPENSIIITNRTSPGTLARDLIELG----GNRTLTLQVKKRIRLEGEELEEYQ 146
LF +W N +I+T + G+L +++L + + L + +RI LEG EL E++
Sbjct: 322 LFAEWAEDKSNLVILTGQPEHGSLTEQVVQLSAKATAKKKIKLTLSRRIPLEGSELAEHE 381
Query: 147 KKKDKEAKDKQEKEK 161
+ + EK++
Sbjct: 382 SSRKSSTSTELEKKE 396
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/108 (34%), Positives = 51/108 (47%), Gaps = 39/108 (36%)
Query: 169 FINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQW 228
F+ +L+LSDF+Q L + GI EF DGVL+C G V VR+
Sbjct: 647 FLGDLRLSDFRQALAQEGIIAEFADGVLVCANGRVTVRK--------------------- 685
Query: 229 CSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
D D ++V EG LS +Y+ V+Q+LY QY+IL
Sbjct: 686 -----------------DGDEKLVV-EGALSQDYFEVRQILYSQYSIL 715
>gi|290981012|ref|XP_002673225.1| predicted protein [Naegleria gruberi]
gi|284086807|gb|EFC40481.1| predicted protein [Naegleria gruberi]
Length = 808
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 81/153 (52%), Gaps = 38/153 (24%)
Query: 33 KIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAEL------------------------- 67
++EWMSDK+MK F+ R NPF F++ +CH++ EL
Sbjct: 327 QLEWMSDKIMKGFDEKRENPFKFQYFSVCHNVEELMDKLQKKEQMRMMMENQMNDEDEET 386
Query: 68 --AKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI------ 119
+P VVL S+ ++ G++R+LF +WC N ++ R++P +L+R LI
Sbjct: 387 ATTGKHTPMVVLASSNTLDYGYARELFVKWCEDQRNLVMFIERSAPNSLSRKLINKLRAK 446
Query: 120 ---ELGGNRTLTLQVKKRIRLEGEELEEYQKKK 149
L N +LTL +R+ L+GEELE+Y+K++
Sbjct: 447 KSERLDENMSLTLY--RRVALKGEELEKYEKEQ 477
>gi|384251490|gb|EIE24968.1| hypothetical protein COCSUDRAFT_83661 [Coccomyxa subellipsoidea
C-169]
Length = 731
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 50/148 (33%), Positives = 86/148 (58%), Gaps = 9/148 (6%)
Query: 24 LCHSLAELA--KIEWMSDKLMKSFEGARNNPFHF---KHVKLCHSLAELAKVP-SPKVVL 77
+ +++ ELA ++EWMS + + FE + NPF K +KLC + ELAK+P P+VV+
Sbjct: 270 MAYNVLELASSQLEWMSHYIGQMFERTKQNPFSVRQAKKLKLCRTTEELAKLPPGPRVVM 329
Query: 78 VSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDL---IELGGNRTLTLQVKKR 134
+ P +E G SR L +W ++P N I+ R TLA L ++ G T+ +++ KR
Sbjct: 330 ATLPSLEAGASRQLLTEWATNPANLILFPGRAPNDTLAGLLQQNMQSGQPFTVPIRLSKR 389
Query: 135 IRLEGEELEEYQKKKDKEAKDKQEKEKI 162
+ L+G EL+ +Q+ + +++E+ I
Sbjct: 390 MPLQGAELQAWQESQTAHVLEEEEEPAI 417
>gi|356530856|ref|XP_003533995.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like isoform 1 [Glycine max]
Length = 736
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 49/123 (39%), Positives = 71/123 (57%), Gaps = 1/123 (0%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWMSD + KSFE R N F K+V L + EL P PKVVL S +E GFS D+F
Sbjct: 280 LEWMSDTIAKSFEKTRENIFLLKYVTLLINKTELDNAPDGPKVVLASMASLEAGFSHDIF 339
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
+W + +N ++ T R TLAR L + + + V KR+ L GEEL Y++++++
Sbjct: 340 VEWANDVKNLVLFTERGQFATLARMLQADPPPKAVKVVVSKRVPLVGEELIAYEEEQNRI 399
Query: 153 AKD 155
K+
Sbjct: 400 KKE 402
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 49/114 (42%), Gaps = 30/114 (26%)
Query: 163 PPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSR 222
PPH + + +L+L+D KQ L G+ EF G L C V +R+V S
Sbjct: 653 PPHKSVLVGDLKLADIKQFLSSKGVQVEFAGGALRCGE-YVTLRKVGDAS---------- 701
Query: 223 DLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+ S + IV+EG L ++YY+++ LY Q+ +L
Sbjct: 702 -------------------QKGGGSGAQQIVIEGPLCEDYYKIRDYLYSQFYLL 736
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 14/37 (37%), Positives = 23/37 (62%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR 243
+VVL S +E GFS D+F +W + +N ++ T R +
Sbjct: 321 KVVLASMASLEAGFSHDIFVEWANDVKNLVLFTERGQ 357
>gi|356530858|ref|XP_003533996.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like isoform 2 [Glycine max]
Length = 742
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 49/123 (39%), Positives = 71/123 (57%), Gaps = 1/123 (0%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWMSD + KSFE R N F K+V L + EL P PKVVL S +E GFS D+F
Sbjct: 283 LEWMSDTIAKSFEKTRENIFLLKYVTLLINKTELDNAPDGPKVVLASMASLEAGFSHDIF 342
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
+W + +N ++ T R TLAR L + + + V KR+ L GEEL Y++++++
Sbjct: 343 VEWANDVKNLVLFTERGQFATLARMLQADPPPKAVKVVVSKRVPLVGEELIAYEEEQNRI 402
Query: 153 AKD 155
K+
Sbjct: 403 KKE 405
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 49/114 (42%), Gaps = 30/114 (26%)
Query: 163 PPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSR 222
PPH + + +L+L+D KQ L G+ EF G L C V +R+V S
Sbjct: 659 PPHKSVLVGDLKLADIKQFLSSKGVQVEFAGGALRCGE-YVTLRKVGDAS---------- 707
Query: 223 DLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+ S + IV+EG L ++YY+++ LY Q+ +L
Sbjct: 708 -------------------QKGGGSGAQQIVIEGPLCEDYYKIRDYLYSQFYLL 742
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 14/37 (37%), Positives = 23/37 (62%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR 243
+VVL S +E GFS D+F +W + +N ++ T R +
Sbjct: 324 KVVLASMASLEAGFSHDIFVEWANDVKNLVLFTERGQ 360
>gi|326436560|gb|EGD82130.1| hypothetical protein PTSG_02804 [Salpingoeca sp. ATCC 50818]
Length = 630
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 67/117 (57%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
++++SD L + PF F ++K SL + P PKVV+ + +E GFSR LF
Sbjct: 195 VDFLSDGLQAQMNEHKAMPFRFPNIKRVQSLDAINAHPGPKVVVTAELGLEAGFSRQLFA 254
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKD 150
QW ++P+N+II T R TLA + TL L++ R+ LEGEELE ++ +++
Sbjct: 255 QWAANPDNAIIFTRRPDEDTLAHSIYHNTAPDTLQLRLGARVELEGEELEAHRAERE 311
Score = 43.9 bits (102), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 26/39 (66%)
Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRAD 245
+VV+ + +E GFSR LF QW ++P+N+II T R D
Sbjct: 235 KVVVTAELGLEAGFSRQLFAQWAANPDNAIIFTRRPDED 273
>gi|356559788|ref|XP_003548179.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like isoform 1 [Glycine max]
Length = 738
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 48/123 (39%), Positives = 71/123 (57%), Gaps = 1/123 (0%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWMSD + KSFE R N F K+V L + EL P PKVVL S +E GFS ++F
Sbjct: 280 LEWMSDTIAKSFEKTRENIFLLKYVTLLINKTELDNAPDGPKVVLASMASLEAGFSHEIF 339
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
+W + +N ++ T R TLAR L + + + V KR+ L GEEL Y++++++
Sbjct: 340 VEWANDVKNLVLFTERGQFATLARMLQADPPPKAVKVVVSKRVALVGEELIAYEEEQNRI 399
Query: 153 AKD 155
K+
Sbjct: 400 KKE 402
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 49/114 (42%), Gaps = 30/114 (26%)
Query: 163 PPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSR 222
PPH + + +L+L+D KQ L G+ EF G L C V +R+V S
Sbjct: 655 PPHKSVLVGDLKLADIKQFLSSKGVQVEFAGGALRCGE-YVTLRKVGDAS---------- 703
Query: 223 DLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+ S + IV+EG L ++YY+++ LY Q+ +L
Sbjct: 704 -------------------QKGGGSGAQQIVIEGPLCEDYYKIRDYLYSQFYLL 738
>gi|356559790|ref|XP_003548180.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like isoform 2 [Glycine max]
Length = 743
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 48/123 (39%), Positives = 71/123 (57%), Gaps = 1/123 (0%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWMSD + KSFE R N F K+V L + EL P PKVVL S +E GFS ++F
Sbjct: 285 LEWMSDTIAKSFEKTRENIFLLKYVTLLINKTELDNAPDGPKVVLASMASLEAGFSHEIF 344
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
+W + +N ++ T R TLAR L + + + V KR+ L GEEL Y++++++
Sbjct: 345 VEWANDVKNLVLFTERGQFATLARMLQADPPPKAVKVVVSKRVALVGEELIAYEEEQNRI 404
Query: 153 AKD 155
K+
Sbjct: 405 KKE 407
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 49/114 (42%), Gaps = 30/114 (26%)
Query: 163 PPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSR 222
PPH + + +L+L+D KQ L G+ EF G L C V +R+V S
Sbjct: 660 PPHKSVLVGDLKLADIKQFLSSKGVQVEFAGGALRCGE-YVTLRKVGDAS---------- 708
Query: 223 DLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+ S + IV+EG L ++YY+++ LY Q+ +L
Sbjct: 709 -------------------QKGGGSGAQQIVIEGPLCEDYYKIRDYLYSQFYLL 743
>gi|428169733|gb|EKX38664.1| hypothetical protein GUITHDRAFT_89302 [Guillardia theta CCMP2712]
Length = 770
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 46/129 (35%), Positives = 78/129 (60%), Gaps = 6/129 (4%)
Query: 31 LAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMEC-GFSR 89
+++ EW+S + + F +R+N FH ++V C S EL ++P P+VVL S D+E FS
Sbjct: 281 MSQTEWLSKHVNQRFIQSRHNVFHLENVHRCCSREELGRLPYPQVVLASGLDLETSSFSL 340
Query: 90 DLFFQWCSSPENSIIITNRTSPGTLARDLIELGGN-----RTLTLQVKKRIRLEGEELEE 144
DLF +W +N +++T + PG+ AR +L G+ L LQ+ +R+ LEG EL E
Sbjct: 341 DLFAEWAPDSKNLVLLTQKARPGSRARQFQDLMGSGLPLPSNLMLQMHRRVPLEGRELRE 400
Query: 145 YQKKKDKEA 153
+++++ +A
Sbjct: 401 HEEQERLKA 409
>gi|326495752|dbj|BAJ85972.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 726
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 84/143 (58%), Gaps = 8/143 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWMSD + KSFE R+N F +HV L + EL K+ +PKVVL S +E GFS D+F
Sbjct: 268 LEWMSDSISKSFEHTRDNAFLLRHVSLIINKEELEKLGDAPKVVLASMSSLEVGFSHDIF 327
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
+ + +N ++ T + GTLAR L + + + + KR+ L G+EL+ Y++++++
Sbjct: 328 VEMANEAKNLVLFTEKGQFGTLARMLQVDPPPKAVKVTMSKRVPLVGDELKAYEEEQERI 387
Query: 153 AKDK-------QEKEKIPPHDTS 168
K++ +EKE H+++
Sbjct: 388 KKEEVLKASLSKEKELKASHESN 410
>gi|226492345|ref|NP_001151557.1| LOC100285191 [Zea mays]
gi|195647682|gb|ACG43309.1| cleavage and polyadenylation specificity factor, 100 kDa subunit
[Zea mays]
Length = 673
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 75/130 (57%), Gaps = 1/130 (0%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWM D++ KSFE +R N F K V L + EL K+ +PKVVL S +E GFS D+F
Sbjct: 216 LEWMGDQIAKSFESSRANAFLLKKVTLIINKEELEKLGDAPKVVLASMASLEVGFSHDIF 275
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
+ + N ++ T + GTLAR L + L + + KRI L G EL+ Y++++++
Sbjct: 276 VEMANEARNLVLFTEKGQFGTLARMLQVDPPPKALKVTMSKRIPLVGNELKAYEEEQERI 335
Query: 153 AKDKQEKEKI 162
K+K K +
Sbjct: 336 KKEKSLKASL 345
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 50/108 (46%), Gaps = 31/108 (28%)
Query: 169 FINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQW 228
+ +L+LSDFKQ L+ G EF G L C + VR+V
Sbjct: 597 LVGDLKLSDFKQFLENKGWQVEFAGGALRCGE-YIMVRKV-------------------- 635
Query: 229 CSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+SI+ + S IV+EG L ++YY++++LLY Q+ +L
Sbjct: 636 ----GDSIL------KGSTGSQQIVIEGPLCEDYYKIRELLYSQFYLL 673
>gi|281208327|gb|EFA82503.1| beta-lactamase domain-containing protein [Polysphondylium pallidum
PN500]
Length = 738
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 75/127 (59%), Gaps = 6/127 (4%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHS---LAELAKVPSPKVVLV 78
+ +++ E AK +E+MS FE N F FK++KLCHS L L + S VVL
Sbjct: 280 VTYNVCEFAKSQLEFMSTAAGLKFEQRNENIFAFKNIKLCHSIYDLENLMGLSSNYVVLA 339
Query: 79 STPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
S D+E G++R+LF +W + +N I++T+ GTLA L+ ++TL++ +R+ LE
Sbjct: 340 SGKDLESGYARELFIKWAADSKNLILMTDSVEEGTLASHLLN-DQPESVTLELGRRVELE 398
Query: 139 GEELEEY 145
GEEL Y
Sbjct: 399 GEELRAY 405
Score = 53.9 bits (128), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 34/113 (30%)
Query: 164 PHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRD 223
PH++SFI +++LS+FKQ L +G +F G++ T
Sbjct: 660 PHNSSFIGDIKLSEFKQLLIDSGYQVQFDQGIIAVSMKTT-------------------- 699
Query: 224 LFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
L + W R D + S I ++G LSDEYY+V++LLY Q+ I+
Sbjct: 700 LIYIW------------REEVDGNSS--IQIDGILSDEYYQVRELLYQQFQII 738
>gi|219886123|gb|ACL53436.1| unknown [Zea mays]
gi|414881946|tpg|DAA59077.1| TPA: cleavage and polyadenylation specificity factor, subunit
isoform 1 [Zea mays]
gi|414881947|tpg|DAA59078.1| TPA: cleavage and polyadenylation specificity factor, subunit
isoform 2 [Zea mays]
gi|414881948|tpg|DAA59079.1| TPA: cleavage and polyadenylation specificity factor, subunit
isoform 3 [Zea mays]
Length = 737
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 49/130 (37%), Positives = 75/130 (57%), Gaps = 1/130 (0%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWM D++ KSFE +R N F K V L + EL K+ +PKVVL S +E GFS D+F
Sbjct: 280 LEWMGDQIAKSFESSRANAFLLKKVTLIINKEELEKLGDAPKVVLASMASLEVGFSHDIF 339
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
+ + N ++ T + GTLAR L + L + + KRI L G EL+ Y++++++
Sbjct: 340 VEMANEARNLVLFTEKGQFGTLARMLQVDPPPKALKVTMSKRIPLVGNELKAYEEEQERI 399
Query: 153 AKDKQEKEKI 162
K+K K +
Sbjct: 400 KKEKSLKASL 409
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 50/108 (46%), Gaps = 31/108 (28%)
Query: 169 FINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQW 228
+ +L+LSDFKQ L+ G EF G L C + VR+V
Sbjct: 661 LVGDLKLSDFKQFLENKGWQVEFAGGALRCGE-YIMVRKV-------------------- 699
Query: 229 CSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+SI+ + S IV+EG L ++YY++++LLY Q+ +L
Sbjct: 700 ----GDSIL------KGSTGSQQIVIEGPLCEDYYKIRELLYSQFYLL 737
>gi|308799055|ref|XP_003074308.1| polyadenylation cleavage/specificity factor 100 kDa subunit (ISS)
[Ostreococcus tauri]
gi|116000479|emb|CAL50159.1| polyadenylation cleavage/specificity factor 100 kDa subunit (ISS)
[Ostreococcus tauri]
Length = 807
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 68/121 (56%), Gaps = 5/121 (4%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAEL-AKVPSPKVVLVSTPDMECGFSRDLF 92
+EWM D + +FE R NPF+ K + +CH++ EL A P PKVVL S +E G +R LF
Sbjct: 310 LEWMGDLVTSAFERRRENPFNTKFITICHTMDELKALPPGPKVVLASFGSLEAGPARHLF 369
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIEL----GGNRTLTLQVKKRIRLEGEELEEYQKK 148
+W N +++T + G+L +++ + + + + +R+ LEGEEL ++
Sbjct: 370 AEWAGDKSNLVVLTGQPEEGSLMEEVVRVSSKPAAKKNVKFTLSRRVPLEGEELATHEST 429
Query: 149 K 149
+
Sbjct: 430 R 430
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 49/108 (45%), Gaps = 39/108 (36%)
Query: 169 FINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQW 228
F+ +L+L DF+Q L + GI EF GVL+C G V +R+
Sbjct: 739 FLGDLRLVDFRQALAQEGITAEFSGGVLVCADGRVTIRK--------------------- 777
Query: 229 CSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
DSD +V+EG LS +++ ++Q+LY QY IL
Sbjct: 778 -----------------DSDEK-LVIEGALSQDFFEIRQILYSQYQIL 807
>gi|357160194|ref|XP_003578687.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like [Brachypodium distachyon]
Length = 738
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 47/130 (36%), Positives = 78/130 (60%), Gaps = 1/130 (0%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWMSD + KSFE R+N F ++V L + EL K+ +PKVVL S +E GFS D+F
Sbjct: 280 LEWMSDSISKSFEHTRDNAFLLRYVSLIINKEELEKLGDAPKVVLASMASLEVGFSHDIF 339
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
+ + +N ++ T + GTLAR L + + + + KRI L G+EL+ Y++++++
Sbjct: 340 VEMANEAKNLVLFTEKGQFGTLARMLQVDPPPKAVKVTMGKRIPLVGDELKAYEEEQERI 399
Query: 153 AKDKQEKEKI 162
K++ K +
Sbjct: 400 KKEELLKASL 409
>gi|357127861|ref|XP_003565596.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like [Brachypodium distachyon]
Length = 738
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 47/130 (36%), Positives = 78/130 (60%), Gaps = 1/130 (0%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWMSD + KSFE R+N F ++V L + EL K+ +PKVVL S +E GFS D+F
Sbjct: 280 LEWMSDSISKSFEHTRDNAFLLRYVSLIINKEELEKLGDAPKVVLASMASLEVGFSHDIF 339
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
+ + +N ++ T + GTLAR L + + + + KRI L G+EL+ Y++++++
Sbjct: 340 VEMANEAKNLVLFTEKGQFGTLARMLQVDPPPKAVKVTMGKRIPLVGDELKAYEEEQERI 399
Query: 153 AKDKQEKEKI 162
K++ K +
Sbjct: 400 KKEELLKASL 409
>gi|405969098|gb|EKC34107.1| Cleavage and polyadenylation specificity factor subunit 2
[Crassostrea gigas]
Length = 135
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 37/51 (72%), Positives = 45/51 (88%), Gaps = 2/51 (3%)
Query: 27 SLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKV 75
++ E AK +EWMSDK+M+SFE ARNNPFHFKHVKLCH+LAELA++P PKV
Sbjct: 23 NVVEFAKSQVEWMSDKIMRSFEEARNNPFHFKHVKLCHNLAELARIPEPKV 73
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 29/34 (85%), Positives = 33/34 (97%)
Query: 1 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKI 34
MSDK+M+SFE ARNNPFHFKHVKLCH+LAELA+I
Sbjct: 35 MSDKIMRSFEEARNNPFHFKHVKLCHNLAELARI 68
>gi|242037469|ref|XP_002466129.1| hypothetical protein SORBIDRAFT_01g001930 [Sorghum bicolor]
gi|241919983|gb|EER93127.1| hypothetical protein SORBIDRAFT_01g001930 [Sorghum bicolor]
Length = 738
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 75/130 (57%), Gaps = 1/130 (0%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWM D++ KSFE R N F K V L + EL K+ +PKVVL S +E GFS D+F
Sbjct: 280 LEWMRDQIAKSFESNRANAFLLKKVMLIINKEELEKLGDAPKVVLASMASLEVGFSHDIF 339
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
+ + N ++ T + GTLAR L + + + + KRI L G+EL+ Y++++++
Sbjct: 340 VEMANEARNLVLFTEKGQFGTLARMLQVDPPPKAVKVTMSKRIPLVGDELKAYEEEQERI 399
Query: 153 AKDKQEKEKI 162
K+K K +
Sbjct: 400 KKEKALKASL 409
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 49/108 (45%), Gaps = 31/108 (28%)
Query: 169 FINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQW 228
+ +L+LSDFKQ L+ G EF G L C + VR+
Sbjct: 662 LVGDLKLSDFKQFLENKGWQVEFAGGALRCGE-YIMVRK--------------------- 699
Query: 229 CSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
I + ++ + S IV+EG L ++YY++++LLY Q+ +L
Sbjct: 700 ---------IGDSSQKGSTGSQQIVIEGPLCEDYYKIRELLYSQFYLL 738
>gi|159465769|ref|XP_001691095.1| predicted protein [Chlamydomonas reinhardtii]
gi|158279781|gb|EDP05541.1| predicted protein [Chlamydomonas reinhardtii]
Length = 389
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 54/86 (62%)
Query: 33 KIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLF 92
++E++ +++++F R PF F+ + + L +L +P PKVVL + P ++CG +R L
Sbjct: 304 QLEYLGSEMVQAFSHKRTIPFTFRKLAVITRLEDLGAIPGPKVVLATLPSLDCGPARQLL 363
Query: 93 FQWCSSPENSIIITNRTSPGTLARDL 118
W ++P N+II T R +PGTLA L
Sbjct: 364 VDWAAAPRNTIIFTERANPGTLAHAL 389
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 25/41 (60%)
Query: 201 GTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
G + +VVL + P ++CG +R L W ++P N+II T R
Sbjct: 339 GAIPGPKVVLATLPSLDCGPARQLLVDWAAAPRNTIIFTER 379
>gi|414881949|tpg|DAA59080.1| TPA: hypothetical protein ZEAMMB73_548570 [Zea mays]
Length = 766
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 49/136 (36%), Positives = 75/136 (55%), Gaps = 7/136 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWM D++ KSFE +R N F K V L + EL K+ +PKVVL S +E GFS D+F
Sbjct: 280 LEWMGDQIAKSFESSRANAFLLKKVTLIINKEELEKLGDAPKVVLASMASLEVGFSHDIF 339
Query: 93 FQWCSSPENSIIITNRTSP------GTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQ 146
+ + N ++ T + GTLAR L + L + + KRI L G EL+ Y+
Sbjct: 340 VEMANEARNLVLFTEKGQKIFALQFGTLARMLQVDPPPKALKVTMSKRIPLVGNELKAYE 399
Query: 147 KKKDKEAKDKQEKEKI 162
+++++ K+K K +
Sbjct: 400 EEQERIKKEKSLKASL 415
>gi|115480769|ref|NP_001063978.1| Os09g0569400 [Oryza sativa Japonica Group]
gi|75253249|sp|Q652P4.1|CPSF2_ORYSJ RecName: Full=Cleavage and polyadenylation specificity factor
subunit 2; AltName: Full=Cleavage and polyadenylation
specificity factor 100 kDa subunit; Short=CPSF 100 kDa
subunit
gi|52077178|dbj|BAD46223.1| putative cleavage and polyadenylation specificity factor [Oryza
sativa Japonica Group]
gi|113632211|dbj|BAF25892.1| Os09g0569400 [Oryza sativa Japonica Group]
Length = 738
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 75/130 (57%), Gaps = 1/130 (0%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWM+D + KSFE R+N F K V + EL K+ +PKVVL S +E GFS D+F
Sbjct: 280 LEWMNDSISKSFEHTRDNAFLLKCVTQIINKDELEKLGDAPKVVLASMASLEVGFSHDIF 339
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
+ +N ++ T + GTLAR L + + + + KRI L G+EL+ Y++++++
Sbjct: 340 VDMANEAKNLVLFTEKGQFGTLARMLQVDPPPKAVKVTMSKRIPLVGDELKAYEEEQERI 399
Query: 153 AKDKQEKEKI 162
K++ K +
Sbjct: 400 KKEEALKASL 409
>gi|218202664|gb|EEC85091.1| hypothetical protein OsI_32459 [Oryza sativa Indica Group]
Length = 1195
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 75/130 (57%), Gaps = 1/130 (0%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWM+D + KSFE R+N F K V + EL K+ +PKVVL S +E GFS D+F
Sbjct: 280 LEWMNDSISKSFEHTRDNAFLLKCVTQIINKDELEKLGDAPKVVLASMASLEVGFSHDIF 339
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
+ +N ++ T + GTLAR L + + + + KRI L G+EL+ Y++++++
Sbjct: 340 VDMANEAKNLVLFTEKGQFGTLARMLQVDPPPKAVKVTMSKRIPLVGDELKAYEEEQERI 399
Query: 153 AKDKQEKEKI 162
K++ K +
Sbjct: 400 KKEEALKASL 409
>gi|222642134|gb|EEE70266.1| hypothetical protein OsJ_30409 [Oryza sativa Japonica Group]
Length = 1073
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 75/130 (57%), Gaps = 1/130 (0%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWM+D + KSFE R+N F K V + EL K+ +PKVVL S +E GFS D+F
Sbjct: 280 LEWMNDSISKSFEHTRDNAFLLKCVTQIINKDELEKLGDAPKVVLASMASLEVGFSHDIF 339
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
+ +N ++ T + GTLAR L + + + + KRI L G+EL+ Y++++++
Sbjct: 340 VDMANEAKNLVLFTEKGQFGTLARMLQVDPPPKAVKVTMSKRIPLVGDELKAYEEEQERI 399
Query: 153 AKDKQEKEKI 162
K++ K +
Sbjct: 400 KKEEALKASL 409
>gi|167535876|ref|XP_001749611.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772003|gb|EDQ85662.1| predicted protein [Monosiga brevicollis MX1]
Length = 770
Score = 77.0 bits (188), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 71/126 (56%), Gaps = 1/126 (0%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
IE+MSDK++ + +NPF ++K ++ + + KVV+V +E GF R+L F
Sbjct: 281 IEFMSDKMLVKLQTMTHNPFACPNIKCQKTVDGVMRSAGAKVVIVPHSSLEAGFGRELLF 340
Query: 94 QWCSSPENSIIITNRTSPGTL-ARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
+ N + R P +L AR L + G T+ + + R++LEGEEL+ Y++ K +E
Sbjct: 341 RLAGEARNRFLFIARPPPHSLGARLLAKSGQIHTIQFEHRFRVQLEGEELKAYRQHKAEE 400
Query: 153 AKDKQE 158
AK ++E
Sbjct: 401 AKQQKE 406
>gi|422293869|gb|EKU21169.1| cleavage and polyadenylation specificity factor subunit 2
[Nannochloropsis gaditana CCMP526]
Length = 925
Score = 75.5 bits (184), Expect = 3e-11, Method: Composition-based stats.
Identities = 44/134 (32%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAEL----AKVPSPKVVLVSTPDMECGFSR 89
+E+M ++++ F+ + NPF KHV S+ EL + +P VVL S M+ GFSR
Sbjct: 353 VEYMHPEVIRDFDRSLRNPFSLKHVVPAQSMLELEAAMGEYRNPVVVLASDEGMDTGFSR 412
Query: 90 DLFFQWCSSPENSIIITNRTSPGTLARDLIELGG--NRTLTLQVKKRIRLEGEELEEYQK 147
L +W S PEN++++ G+LA +L L+ V R+ GEEL ++
Sbjct: 413 ALATRWASGPENALLLCGHLRKGSLAESFWKLRHLPKAALSFSVPVIERIVGEELAGLRE 472
Query: 148 KKDKEAKDKQEKEK 161
K+D+E + E E+
Sbjct: 473 KEDRERRKALEAEE 486
Score = 38.1 bits (87), Expect = 3.7, Method: Composition-based stats.
Identities = 19/61 (31%), Positives = 33/61 (54%), Gaps = 11/61 (18%)
Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQ 267
VVL S M+ GFSR L +W S PEN++++ R +G L++ +++++
Sbjct: 398 VVLASDEGMDTGFSRALATRWASGPENALLLCGHLR-----------KGSLAESFWKLRH 446
Query: 268 L 268
L
Sbjct: 447 L 447
>gi|328866931|gb|EGG15314.1| beta-lactamase domain-containing protein [Dictyostelium
fasciculatum]
Length = 768
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 61/100 (61%), Gaps = 5/100 (5%)
Query: 24 LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPK---VVLV 78
+ + + E AK +E+MS FE N F+F+++K+C+S +L ++P+ VVL
Sbjct: 278 VTYYVREFAKSQLEFMSTAASSKFEQKNENIFNFRNIKICNSFKQLEELPNLTRNYVVLA 337
Query: 79 STPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDL 118
S+ D+E GF++DLF QW + P+N +++T+ GTL L
Sbjct: 338 SSKDLETGFAKDLFIQWANDPKNMVMLTDNMDEGTLGDQL 377
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/112 (26%), Positives = 49/112 (43%), Gaps = 37/112 (33%)
Query: 165 HDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDL 224
H+ SF+ +++LS+FK L R G +F G+L C L
Sbjct: 694 HNVSFVGDVKLSEFKDVLTREGFQVQFDKGILSC-----------------------NGL 730
Query: 225 FFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+ W R + ++ I ++G +S+EYY V++LLY Q+ IL
Sbjct: 731 VYLW--------------REEVDGNSCINIDGVMSEEYYLVKELLYSQFKIL 768
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 35/59 (59%), Gaps = 11/59 (18%)
Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQ 266
VVL S+ D+E GF++DLF QW + P+N +++T D+ D EG L D+ + Q
Sbjct: 334 VVLASSKDLETGFAKDLFIQWANDPKNMVMLT-----DNMD------EGTLGDQLSKCQ 381
>gi|298708373|emb|CBJ48436.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 997
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 34/89 (38%), Positives = 57/89 (64%), Gaps = 1/89 (1%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRD 90
+++EWMS+ + ++F+ R+NPF ++V + HSL EL ++ PKVV+ + ++ GFS+
Sbjct: 415 SQLEWMSEDIGRAFDLQRSNPFELRNVHIMHSLEELDELGDDPKVVMATDMSLDFGFSKA 474
Query: 91 LFFQWCSSPENSIIITNRTSPGTLARDLI 119
L +W S N+I++T R T AR LI
Sbjct: 475 LLLRWASGGANTILLTGRGHGNTTARTLI 503
>gi|302833565|ref|XP_002948346.1| hypothetical protein VOLCADRAFT_31342 [Volvox carteri f.
nagariensis]
gi|300266566|gb|EFJ50753.1| hypothetical protein VOLCADRAFT_31342 [Volvox carteri f.
nagariensis]
Length = 375
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 47/80 (58%)
Query: 33 KIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLF 92
++E++ +L+++F R+ PF F+ + + L EL P PKVVL P +E G +R+L
Sbjct: 296 QLEYLGSELVQAFSLKRSVPFSFRKLSVITRLEELGAFPGPKVVLAPMPSLESGPARELL 355
Query: 93 FQWCSSPENSIIITNRTSPG 112
QW + P N+II T R G
Sbjct: 356 VQWGALPRNTIIFTERAQVG 375
>gi|67968624|dbj|BAE00671.1| unnamed protein product [Macaca fascicularis]
Length = 341
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 266 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 311
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 312 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 341
>gi|301776931|ref|XP_002923884.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like [Ailuropoda melanoleuca]
Length = 146
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 71 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 116
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 117 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 146
>gi|119601889|gb|EAW81483.1| cleavage and polyadenylation specific factor 2, 100kDa, isoform
CRA_c [Homo sapiens]
Length = 690
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 615 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 660
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 661 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 690
>gi|349603401|gb|AEP99246.1| Cleavage and polyadenylation specificity factor subunit 2-like
protein, partial [Equus caballus]
Length = 327
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 252 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 297
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 298 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 327
>gi|328908757|gb|AEB61046.1| cleavage and polyadenylation specificity factor subunit 2-like
protein, partial [Equus caballus]
Length = 256
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 181 EVPGHQSVFMNEPRLSDFKQALLREGIQAEFVGGVLV-CNNQVAVRR------------- 226
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 227 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 256
>gi|328908891|gb|AEB61113.1| cleavage and polyadenylation specificity factor subunit 2-like
protein, partial [Equus caballus]
Length = 108
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 33 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 78
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 79 --------------------------TETGPIGLEGCLCQDFYRIRDLLYEQYAIV 108
>gi|350587135|ref|XP_003482353.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like [Sus scrofa]
Length = 272
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 197 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 242
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+++ I LEGCL ++YR++ LLY+QYAI+
Sbjct: 243 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 272
>gi|449512224|ref|XP_002198279.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like [Taeniopygia guttata]
Length = 272
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 43/122 (35%), Positives = 59/122 (48%), Gaps = 45/122 (36%)
Query: 160 EKIPPHD-----TSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTP 214
E +PPH+ + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 191 EPMPPHEVLGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNLVAVRR------- 242
Query: 215 DMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYA 274
+++ I LEGCL ++YR++ LLY+QYA
Sbjct: 243 --------------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYA 270
Query: 275 IL 276
I+
Sbjct: 271 IV 272
>gi|430813604|emb|CCJ29043.1| unnamed protein product [Pneumocystis jirovecii]
gi|430813606|emb|CCJ29045.1| unnamed protein product [Pneumocystis jirovecii]
Length = 772
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 50/169 (29%), Positives = 87/169 (51%), Gaps = 34/169 (20%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVK-LCHSLAELAKVPSPKVVLVSTPDMECGFSRD 90
+ IEWMSD ++ + G+ + F F +VK + + L+ V PKV+L ++ +M+CGFS+
Sbjct: 272 SMIEWMSDSIISEY-GSSGSVFEFTYVKVITNEFQFLSMVSGPKVILATSSNMDCGFSQK 330
Query: 91 LFFQWCSSPENSIIITNRTS---PGTLARDLIE------------------LGGNRTLTL 129
+F + +++I ++ S +L++DL++ L NRT+T
Sbjct: 331 IFLDSIAKDSKNLVILSQKSIYYENSLSKDLLDRWNLAIEHSDQLIPPAVILNFNRTVT- 389
Query: 130 QVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFINELQLSDF 178
++ + L G ELE+YQ+K+ K ++EKE I ELQ D
Sbjct: 390 -IRTSVPLVGSELEKYQEKE----KLRREKEA-----AKLIMELQNRDL 428
>gi|412994069|emb|CCO14580.1| predicted protein [Bathycoccus prasinos]
Length = 1092
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 71/135 (52%), Gaps = 16/135 (11%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
+EWM++ + F+ R N F K + CHS + ++P PKVVL S +E GF+R LF
Sbjct: 485 LEWMNEDVGLKFDSTRQNVFDVKRLFPCHSYEDFTRLPPGPKVVLASLASLEGGFARKLF 544
Query: 93 FQWCSSPENSIIITNRTSPGT-LARDLIE---LGG-----------NRTLTLQVKKRIRL 137
+W S +N I + LAR+++E GG + + +++ +R L
Sbjct: 545 VEWASDAKNCFIWPDEIGRQVGLAREIVEKCSKGGAKTTSSKTKKKDVIMKVELARRELL 604
Query: 138 EGEELEEYQKKKDKE 152
G+ELE ++ +++++
Sbjct: 605 SGKELEAWEHEQEEK 619
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 55/111 (49%), Gaps = 39/111 (35%)
Query: 167 TSFINELQLSDFKQTLQRNGIDCEFMD-GVLICCRGTVAVRRVVLVSTPDMECGFSRDLF 225
++F+ +++LS+F++ LQR G+ EF + G L+C G V VRR
Sbjct: 1020 SAFVGDVRLSEFRRYLQRMGVPAEFGEGGALVCANGQVVVRR------------------ 1061
Query: 226 FQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
RA+D + +++EG +SD Y+ V+ +LY QY+I+
Sbjct: 1062 -----------------RAEDDE---LIVEGSISDAYFNVRDMLYAQYSII 1092
>gi|297808389|ref|XP_002872078.1| hypothetical protein ARALYDRAFT_910398 [Arabidopsis lyrata subsp.
lyrata]
gi|297317915|gb|EFH48337.1| hypothetical protein ARALYDRAFT_910398 [Arabidopsis lyrata subsp.
lyrata]
Length = 544
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 61/118 (51%), Gaps = 26/118 (22%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+EWMSD + KSFE +R+N F L+S +E GF+R++F
Sbjct: 205 LEWMSDSISKSFETSRDNAF-----------------------LLS---LEAGFAREIFV 238
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDK 151
+W + P N ++ T GTLAR L + + + + KR+ L GEEL Y++++++
Sbjct: 239 EWANDPRNLVLFTETGQFGTLARMLQSAPPPKFVKVTMSKRVPLAGEELIAYEEEQNR 296
>gi|422294077|gb|EKU21377.1| cleavage and polyadenylation specificity factor subunit 2, partial
[Nannochloropsis gaditana CCMP526]
Length = 429
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 60/115 (52%), Gaps = 6/115 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAEL----AKVPSPKVVLVSTPDMECGFSR 89
+E+M ++++ F+ + NPF KHV S+ EL + +P VVL S M+ GFSR
Sbjct: 310 VEYMHPEVIRDFDRSLRNPFSLKHVVPAQSMLELEAAMGEYRNPVVVLASDEGMDTGFSR 369
Query: 90 DLFFQWCSSPENSIIITNRTSPGTLARDLIELG--GNRTLTLQVKKRIRLEGEEL 142
L +W S PEN++++ G+LA +L L+ V R+ GEEL
Sbjct: 370 ALATRWASGPENALLLCGHLRKGSLAESFWKLRHLPKAALSFSVPVIERIVGEEL 424
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 19/61 (31%), Positives = 33/61 (54%), Gaps = 11/61 (18%)
Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQ 267
VVL S M+ GFSR L +W S PEN++++ R +G L++ +++++
Sbjct: 355 VVLASDEGMDTGFSRALATRWASGPENALLLCGHLR-----------KGSLAESFWKLRH 403
Query: 268 L 268
L
Sbjct: 404 L 404
>gi|115456655|ref|NP_001051928.1| Os03g0852900 [Oryza sativa Japonica Group]
gi|27573349|gb|AAO20067.1| putative cleavage and polyadenylation specifity factor protein
[Oryza sativa Japonica Group]
gi|29126360|gb|AAO66552.1| putative cleavage and polyadenylation specifity factor [Oryza
sativa Japonica Group]
gi|108712151|gb|ABF99946.1| Cleavage and polyadenylation specificity factor, 73 kDa subunit,
putative, expressed [Oryza sativa Japonica Group]
gi|113550399|dbj|BAF13842.1| Os03g0852900 [Oryza sativa Japonica Group]
gi|125588676|gb|EAZ29340.1| hypothetical protein OsJ_13407 [Oryza sativa Japonica Group]
Length = 700
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 51/86 (59%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++++ F A++NPFHFKH++ +S+ V P VV+ S ++ G SR LF
Sbjct: 305 INSMNERIRNQF--AQSNPFHFKHIESLNSIDNFHDV-GPSVVMASPGGLQSGLSRQLFD 361
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC+ +NS +I GTLA+ +I
Sbjct: 362 KWCTDKKNSCVIPGYVVEGTLAKTII 387
>gi|125546484|gb|EAY92623.1| hypothetical protein OsI_14368 [Oryza sativa Indica Group]
Length = 700
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 51/86 (59%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++++ F A++NPFHFKH++ +S+ V P VV+ S ++ G SR LF
Sbjct: 305 INSMNERIRNQF--AQSNPFHFKHIESLNSIDNFHDV-GPSVVMASPGGLQSGLSRQLFD 361
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC+ +NS +I GTLA+ +I
Sbjct: 362 KWCTDKKNSCVIPGYVVEGTLAKTII 387
>gi|387219447|gb|AFJ69432.1| cleavage and polyadenylation specificity factor subunit 2, partial
[Nannochloropsis gaditana CCMP526]
Length = 217
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 60/115 (52%), Gaps = 6/115 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAEL----AKVPSPKVVLVSTPDMECGFSR 89
+E+M ++++ F+ + NPF KHV S+ EL + +P VVL S M+ GFSR
Sbjct: 98 VEYMHPEVIRDFDRSLRNPFSLKHVVPAQSMLELEAAMGEYRNPVVVLASDEGMDTGFSR 157
Query: 90 DLFFQWCSSPENSIIITNRTSPGTLARDLIELGG--NRTLTLQVKKRIRLEGEEL 142
L +W S PEN++++ G+LA +L L+ V R+ GEEL
Sbjct: 158 ALATRWASGPENALLLCGHLRKGSLAESFWKLRHLPKAALSFSVPVIERIVGEEL 212
Score = 40.0 bits (92), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 19/61 (31%), Positives = 33/61 (54%), Gaps = 11/61 (18%)
Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQ 267
VVL S M+ GFSR L +W S PEN++++ R +G L++ +++++
Sbjct: 143 VVLASDEGMDTGFSRALATRWASGPENALLLCGHLR-----------KGSLAESFWKLRH 191
Query: 268 L 268
L
Sbjct: 192 L 192
>gi|224009389|ref|XP_002293653.1| cleavage and polyadenylation specificity factor [Thalassiosira
pseudonana CCMP1335]
gi|220971053|gb|EED89389.1| cleavage and polyadenylation specificity factor [Thalassiosira
pseudonana CCMP1335]
Length = 347
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 48/80 (60%), Gaps = 5/80 (6%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV-----PSPKVVLVSTPDMECG 86
+++EWM++ L F+ R +P+ K V++C S+AEL V +P VL S ++ G
Sbjct: 268 SQLEWMAEPLGAQFDSQRGHPYALKSVRICSSVAELESVIESSNGNPTAVLASGSSLDHG 327
Query: 87 FSRDLFFQWCSSPENSIIIT 106
+RDL +W +P+N ++IT
Sbjct: 328 PARDLLLKWGDNPDNLVLIT 347
>gi|145350779|ref|XP_001419775.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580007|gb|ABO98068.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 767
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 57/110 (51%), Gaps = 12/110 (10%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I ++ + +FE A NPF F HVK +EL V P VVL + ++ G SR+LF
Sbjct: 287 INVLNSDMKAAFEEA--NPFVFNHVKHVSKSSELDDV-GPCVVLATPSMLQSGLSRELFE 343
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELE 143
WC P+N +II + GTLAR+++ V K I +G EL+
Sbjct: 344 SWCEDPKNGVIIADFAVQGTLAREILS---------DVNKIIARDGRELQ 384
>gi|320163324|gb|EFW40223.1| CPSF3 protein [Capsaspora owczarzaki ATCC 30864]
Length = 802
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 64/119 (53%), Gaps = 6/119 (5%)
Query: 3 DKLMKSFEGARNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKL 60
D+ +S N P ++ + C ++ + I M++++ + F A +NPF FKH+
Sbjct: 280 DEFWESKPALHNIPIYYASSLARKCMAIYQ-TYINQMNERIRRQF--AISNPFMFKHIAS 336
Query: 61 CHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
S +E+ + P V++ S ++ G SRDLF QWC N +I+T + GTLA+ ++
Sbjct: 337 IKSASEIDQ-SGPMVMMASPGMLQNGLSRDLFEQWCPDSRNGVIVTGYSVEGTLAKSIL 394
>gi|226497180|ref|NP_001146407.1| uncharacterized protein LOC100279987 [Zea mays]
gi|219887045|gb|ACL53897.1| unknown [Zea mays]
gi|414873991|tpg|DAA52548.1| TPA: hypothetical protein ZEAMMB73_264007 [Zea mays]
Length = 697
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 51/86 (59%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++++ F A++NPFHFKH++ +S+ V P VV+ S ++ G SR LF
Sbjct: 302 INSMNERIRNQF--AQSNPFHFKHIESLNSIDNFHDV-GPSVVMASPGGLQSGLSRQLFD 358
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC+ +N+ +I GTLA+ +I
Sbjct: 359 KWCTDKKNACVIPGYVVEGTLAKTII 384
>gi|242032211|ref|XP_002463500.1| hypothetical protein SORBIDRAFT_01g000850 [Sorghum bicolor]
gi|241917354|gb|EER90498.1| hypothetical protein SORBIDRAFT_01g000850 [Sorghum bicolor]
Length = 695
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 51/86 (59%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++++ F A++NPFHFKH++ +S+ V P VV+ S ++ G SR LF
Sbjct: 302 INSMNERIRNQF--AQSNPFHFKHIESLNSIDNFHDV-GPSVVMASPGGLQSGLSRQLFD 358
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC+ +N+ +I GTLA+ +I
Sbjct: 359 KWCTDKKNACVIPGYVVEGTLAKTII 384
>gi|357117889|ref|XP_003560694.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-I-like [Brachypodium distachyon]
Length = 690
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 51/86 (59%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++++ F A++NPFHFKH++ +S+ V P VV+ S ++ G SR LF
Sbjct: 295 INSMNERIRNQF--AQSNPFHFKHIEPLNSIDNFHDV-GPSVVMASPGSLQSGLSRQLFD 351
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC+ +N+ +I GTLA+ +I
Sbjct: 352 KWCTDKKNTCVIPGYVIEGTLAKTII 377
>gi|156840674|ref|XP_001643716.1| hypothetical protein Kpol_1009p4 [Vanderwaltozyma polyspora DSM
70294]
gi|156114339|gb|EDO15858.1| hypothetical protein Kpol_1009p4 [Vanderwaltozyma polyspora DSM
70294]
Length = 778
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 56/108 (51%), Gaps = 4/108 (3%)
Query: 11 GARNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELA 68
G+ P ++ K C S+ + + M+D + K F ++ NPF FKH+ +L E
Sbjct: 267 GSGQVPIYYASNLAKKCMSVYQ-TYVNMMNDDIRKKFRDSQTNPFIFKHISYLKNLDEFQ 325
Query: 69 KVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLAR 116
P V+L S ++ G SRDL +WC +N ++IT + GT+A+
Sbjct: 326 DF-GPSVMLASPGMLQNGLSRDLLEKWCPEDKNMVLITGYSVEGTMAK 372
>gi|226505292|ref|NP_001151522.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
[Zea mays]
gi|195647398|gb|ACG43167.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
[Zea mays]
gi|224034229|gb|ACN36190.1| unknown [Zea mays]
gi|413932397|gb|AFW66948.1| cleavage and polyadenylation specificity factor, subunit [Zea mays]
Length = 694
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 50/86 (58%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++++ F A++NPFHFKH++ +S+ V P VV+ S ++ G SR LF
Sbjct: 302 INSMNERIRNQF--AQSNPFHFKHIESLNSIDNFHDV-GPSVVMASPSGLQSGLSRQLFD 358
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC+ N+ +I GTLA+ +I
Sbjct: 359 KWCTDKRNACVIPGYVVEGTLAKTII 384
>gi|156379813|ref|XP_001631650.1| predicted protein [Nematostella vectensis]
gi|156218694|gb|EDO39587.1| predicted protein [Nematostella vectensis]
Length = 688
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 49/86 (56%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K A +NPF FKH+ S+ + + P VV+ S M+ G SR+LF
Sbjct: 295 VNAMNDKIKKQI--AISNPFVFKHISNLKSIDQFDDI-GPSVVMASPGMMQSGLSRELFE 351
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
QWC+ N +II GTLA++L+
Sbjct: 352 QWCTDRRNGVIIAGYCVEGTLAKNLM 377
>gi|301787149|ref|XP_002928998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like, partial [Ailuropoda melanoleuca]
Length = 145
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 23/38 (60%), Positives = 32/38 (84%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAK 69
+++EWMSDKLM+ FE RNNPF F+H+ LCH L++LA+
Sbjct: 108 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLAR 145
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 21/33 (63%), Positives = 27/33 (81%)
Query: 1 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAK 33
MSDKLM+ FE RNNPF F+H+ LCH L++LA+
Sbjct: 113 MSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLAR 145
>gi|308807807|ref|XP_003081214.1| mRNA cleavage and polyadenylation factor II complex, BRR5 (CPSF
subunit) (ISS) [Ostreococcus tauri]
gi|116059676|emb|CAL55383.1| mRNA cleavage and polyadenylation factor II complex, BRR5 (CPSF
subunit) (ISS) [Ostreococcus tauri]
Length = 572
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 57/110 (51%), Gaps = 12/110 (10%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I ++ + +FE A NPF F HVK +EL V P VVL + ++ G SR+LF
Sbjct: 275 INVLNADMKAAFEEA--NPFVFNHVKHISKASELDDV-GPCVVLATPSMLQSGLSRELFE 331
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELE 143
WC P+N +II + GTLAR+++ V K +G+EL+
Sbjct: 332 SWCEEPKNGVIIADFAVQGTLAREILS---------DVNKVTARDGQELQ 372
>gi|385305954|gb|EIF49896.1| mrna cleavage and polyadenylation specificity factor complex
subunit ysh1 [Dekkera bruxellensis AWRI1499]
Length = 295
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 64/120 (53%), Gaps = 6/120 (5%)
Query: 3 DKLMKSFEGARNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVK- 59
D+ + + +N P ++ K C ++ E I M+D + K F NPFHFK++K
Sbjct: 169 DEYWQRHKDLQNVPIYYASSLAKKCMAVYE-RYINMMNDSIRKKFTETNENPFHFKYIKN 227
Query: 60 LCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
+ H A+ +P V++ S ++ G SR L +WC P N++I+T + GT+A+ L+
Sbjct: 228 VAH--ADRIDDLNPCVMIASPGMLQNGVSRQLLEKWCPDPRNTVIMTGYSVDGTMAKKLL 285
>gi|356543411|ref|XP_003540154.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-I-like [Glycine max]
Length = 689
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 59/109 (54%), Gaps = 8/109 (7%)
Query: 13 RNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV 70
+N P ++ K C ++ E + M+D++ + A++NPF FKHV S+ E+ K
Sbjct: 273 QNIPIYYASPLAKKCLTVYETYTLS-MNDRI----QNAKSNPFSFKHVSALSSI-EVFKD 326
Query: 71 PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
P VV+ S ++ G SR LF WCS +NS ++ GTLA+ +I
Sbjct: 327 VGPSVVMASPGGLQSGLSRQLFDMWCSDKKNSCVLPGYVVEGTLAKTII 375
>gi|170060909|ref|XP_001866010.1| cleavage and polyadenylation specificity factor [Culex
quinquefasciatus]
gi|167879247|gb|EDS42630.1| cleavage and polyadenylation specificity factor [Culex
quinquefasciatus]
Length = 688
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+DK+ + A NNPF F+H+ + + P VV+ S M+ G SR+LF
Sbjct: 293 INAMNDKIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFE 349
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WCS P+N +II GTLA+ ++
Sbjct: 350 TWCSDPKNGVIIAGYCVEGTLAKTVL 375
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 21/33 (63%)
Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 240
VV+ S M+ G SR+LF WCS P+N +II
Sbjct: 331 VVMASPGMMQSGLSRELFETWCSDPKNGVIIAG 363
>gi|50549403|ref|XP_502172.1| YALI0C23232p [Yarrowia lipolytica]
gi|49648039|emb|CAG82492.1| YALI0C23232p [Yarrowia lipolytica CLIB122]
Length = 799
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 48/156 (30%), Positives = 76/156 (48%), Gaps = 20/156 (12%)
Query: 22 VKLCHSLAEL-----AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV---PSP 73
V L H A L + +EWMS ++ +E +PF ++ K+ HS+ + KV +
Sbjct: 283 VLLTHLGARLLEYASSMLEWMSPSIIAEWEKKNESPFQTRNFKIVHSMDQFDKVVKGGNG 342
Query: 74 KVVLVSTP-DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDL------IELGGNRT 126
+ V+VS D+E GFSR LF + S NS++ T R+ +LA +L E GN
Sbjct: 343 QFVVVSVGEDLESGFSRLLFNRLASDERNSVLFTERSEGNSLATELQDKWEKTERDGNSA 402
Query: 127 -----LTLQVKKRIRLEGEELEEYQKKKDKEAKDKQ 157
TL++ L E++EY+ + + KD Q
Sbjct: 403 KMDFQTTLKMPTYTPLSEAEMKEYRTTVESQQKDLQ 438
>gi|363750442|ref|XP_003645438.1| hypothetical protein Ecym_3113 [Eremothecium cymbalariae
DBVPG#7215]
gi|356889072|gb|AET38621.1| Hypothetical protein Ecym_3113 [Eremothecium cymbalariae
DBVPG#7215]
Length = 773
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 49/86 (56%), Gaps = 1/86 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K F ++ NPF FK++ +L E P V+L S ++ G SRDL
Sbjct: 295 VNMMNDKIRKKFRDSQTNPFIFKNISYLKNLDEFQDF-GPSVMLASPGMLQNGLSRDLLE 353
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC +N ++IT + GT+A+ LI
Sbjct: 354 KWCPDEKNLVLITGYSVEGTMAKFLI 379
>gi|156343760|ref|XP_001621104.1| hypothetical protein NEMVEDRAFT_v1g222359 [Nematostella vectensis]
gi|156206741|gb|EDO29004.1| predicted protein [Nematostella vectensis]
Length = 388
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 48/85 (56%), Gaps = 3/85 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K A +NPF FKH+ S+ + + P VV+ S M+ G SR+LF
Sbjct: 221 VNAMNDKIKKQI--AISNPFVFKHISNLKSIDQFDDI-GPSVVMASPGMMQSGLSRELFE 277
Query: 94 QWCSSPENSIIITNRTSPGTLARDL 118
QWC+ N +II GTLA+++
Sbjct: 278 QWCTDRRNGVIIAGYCVEGTLAKEV 302
>gi|326487902|dbj|BAJ89790.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 704
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 50/86 (58%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++++ F A++NPFHFKH+ +S+ V P VV+ S ++ G SR LF
Sbjct: 310 INSMNERIRNQF--AQSNPFHFKHIDPLNSIDNFHDV-GPSVVMASPGSLQSGLSRQLFD 366
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC+ +N+ +I G+LA+ +I
Sbjct: 367 KWCTDKKNTCVIPGYAVEGSLAKTII 392
>gi|326495416|dbj|BAJ85804.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 704
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 50/86 (58%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++++ F A++NPFHFKH+ +S+ V P VV+ S ++ G SR LF
Sbjct: 310 INSMNERIRNQF--AQSNPFHFKHIDPLNSIDNFHDV-GPSVVMASPGSLQSGLSRQLFD 366
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC+ +N+ +I G+LA+ +I
Sbjct: 367 KWCTDKKNTCVIPGYAVEGSLAKTII 392
>gi|326508058|dbj|BAJ86772.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 704
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 50/86 (58%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++++ F A++NPFHFKH+ +S+ V P VV+ S ++ G SR LF
Sbjct: 310 INSMNERIRNQF--AQSNPFHFKHIDPLNSIDNFHDV-GPSVVMASPGSLQSGLSRQLFD 366
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC+ +N+ +I G+LA+ +I
Sbjct: 367 KWCTDKKNTCVIPGYAVEGSLAKTII 392
>gi|213512037|ref|NP_001133354.1| cleavage and polyadenylation specificity factor subunit 3 [Salmo
salar]
gi|209151738|gb|ACI33081.1| Cleavage and polyadenylation specificity factor subunit 3 [Salmo
salar]
Length = 690
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K+ NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 293 VNAMNDKIRKAIN--VNNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II + GTLA+ ++
Sbjct: 350 SWCTDKRNGVIIAGYSVEGTLAKHIM 375
>gi|157117185|ref|XP_001652976.1| cleavage and polyadenylation specificity factor [Aedes aegypti]
gi|108876120|gb|EAT40345.1| AAEL007904-PA [Aedes aegypti]
Length = 687
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+DK+ + A NNPF F+H+ + + P VV+ S M+ G SR+LF
Sbjct: 293 INAMNDKIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFE 349
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ P+N +II GTLA+ ++
Sbjct: 350 TWCTDPKNGVIIAGYCVEGTLAKTIL 375
>gi|378733596|gb|EHY60055.1| hypothetical protein HMPREF1120_08027 [Exophiala dermatitidis
NIH/UT8656]
Length = 948
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 52/196 (26%), Positives = 80/196 (40%), Gaps = 60/196 (30%)
Query: 4 KLMKSFEGARNNPFHFKHVKLCHSLAELAKIEWMSDKLMKSFEGARNNP----------- 52
K +K + +R+ +H + + +EWM D +++ FEG NP
Sbjct: 304 KDVKVYMASRSGNATLRHAR--------SLLEWMDDSIVREFEGEDENPTTQPYNRRGGN 355
Query: 53 -----------FHFKHVKLC---HSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSS 98
F FK+VK+ H L +L KV P+V+L S ++ GFSR L
Sbjct: 356 KAAGTNKPSRPFEFKNVKVVERKHQLEKLLKVEGPRVILASDVTLDWGFSRSLLEHVVQK 415
Query: 99 PENSIIITNR-------TSPG------------TLARDLIELG--------GNRTLTLQV 131
PEN +I+T R SPG +A + E G G R L L+
Sbjct: 416 PENLVILTERLNVRPGSESPGQAFWQWFEQRQDGVALERTEGGGQLEQVHSGGRMLKLKN 475
Query: 132 KKRIRLEGEELEEYQK 147
++ L +E + YQ+
Sbjct: 476 PEKAPLSAQESQRYQQ 491
>gi|374110195|gb|AEY99100.1| FAGR279Cp [Ashbya gossypii FDAG1]
Length = 771
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 49/86 (56%), Gaps = 1/86 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K F ++ NPF FK++ +L E P V+L S ++ G SRDL
Sbjct: 295 VNMMNDKIRKKFRDSQTNPFIFKNISYLKNLDEFQDF-GPSVMLASPGMLQNGLSRDLLE 353
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC +N ++IT + GT+A+ L+
Sbjct: 354 KWCPDEKNLVLITGYSVEGTMAKFLM 379
>gi|302309512|ref|NP_986945.2| AGR279Cp [Ashbya gossypii ATCC 10895]
gi|442570103|sp|Q74ZC0.2|YSH1_ASHGO RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
3'-end-processing protein YSH1
gi|299788393|gb|AAS54769.2| AGR279Cp [Ashbya gossypii ATCC 10895]
Length = 771
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 49/86 (56%), Gaps = 1/86 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K F ++ NPF FK++ +L E P V+L S ++ G SRDL
Sbjct: 295 VNMMNDKIRKKFRDSQTNPFIFKNISYLKNLDEFQDF-GPSVMLASPGMLQNGLSRDLLE 353
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC +N ++IT + GT+A+ L+
Sbjct: 354 KWCPDEKNLVLITGYSVEGTMAKFLM 379
>gi|240975718|ref|XP_002402161.1| cleavage and polyadenylation specificity factor, putative [Ixodes
scapularis]
gi|215491113|gb|EEC00754.1| cleavage and polyadenylation specificity factor, putative [Ixodes
scapularis]
Length = 694
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 40/70 (57%), Gaps = 1/70 (1%)
Query: 50 NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
NNPF FKH+ S+ V P VV+ S M+ G SR+LF WC+ P+N +II
Sbjct: 313 NNPFVFKHISNLKSIEHFEDV-GPCVVMASPGMMQSGLSRELFESWCTDPKNGVIIAGYC 371
Query: 110 SPGTLARDLI 119
GTLA+ ++
Sbjct: 372 VEGTLAKTIL 381
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 21/33 (63%)
Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 240
VV+ S M+ G SR+LF WC+ P+N +II
Sbjct: 337 VVMASPGMMQSGLSRELFESWCTDPKNGVIIAG 369
>gi|321461562|gb|EFX72593.1| hypothetical protein DAPPUDRAFT_308207 [Daphnia pulex]
Length = 689
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+DK+ + A NNPF FKH+ + + V P V++ S M+ G SR+LF
Sbjct: 292 INAMNDKIRRQI--AINNPFIFKHISSLKGIDQFEDV-GPCVIMASPGMMQSGLSRELFE 348
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ P+N II GTLA+ ++
Sbjct: 349 AWCTDPKNGCIIAGYCVEGTLAKHVL 374
>gi|385304166|gb|EIF48196.1| subunit of mrna cleavage and polyadenylation [Dekkera bruxellensis
AWRI1499]
Length = 664
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 46/179 (25%), Positives = 80/179 (44%), Gaps = 20/179 (11%)
Query: 13 RNNPFHFKHVKLCHSLAELAK-IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP 71
R PF+ SL A +EWMS + K ++ PF ++LC ++ +L+K
Sbjct: 41 REVPFYLVAYTGIQSLKSSASMLEWMSPDVTKDWDNENQTPFDSTRIELC-TVEDLSKHT 99
Query: 72 SPKVVLVSTPDMECG-FSRDLFFQWCSSPENSIIITNRTSPGTLARDL-------IELGG 123
PKVV V + + G +R +F + CS ++ +T R GT+ D+ +
Sbjct: 100 GPKVVFVESLGFDEGSLARSVFVELCSKQNTALFLTERPPDGTVLYDIYNTWEETVXXDS 159
Query: 124 N----------RTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFINE 172
N + LTL + K L G+EL+ Y D+ +Q++E ++ +N+
Sbjct: 160 NLKDGSLIIFQKQLTLSMIKETXLRGQELKAYLSXVDQRRXKRQKEELEEXKNSHLLND 218
>gi|365990355|ref|XP_003672007.1| hypothetical protein NDAI_0I01950 [Naumovozyma dairenensis CBS 421]
gi|343770781|emb|CCD26764.1| hypothetical protein NDAI_0I01950 [Naumovozyma dairenensis CBS 421]
Length = 757
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 49/86 (56%), Gaps = 1/86 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+D + K F ++ NPF FK++ +L E + P V+L S ++ G SRDL
Sbjct: 258 VNMMNDDIRKKFRDSQTNPFIFKNISYLKNLEEFQDL-GPSVMLASPGMLQSGLSRDLLE 316
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC +N ++IT + GT+A+ L+
Sbjct: 317 RWCPDEKNLVLITGYSIEGTMAKYLM 342
>gi|224140921|ref|XP_002323825.1| predicted protein [Populus trichocarpa]
gi|222866827|gb|EEF03958.1| predicted protein [Populus trichocarpa]
Length = 696
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 48/83 (57%), Gaps = 3/83 (3%)
Query: 37 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
M++++ F A +NPF FKH+ +S+ + + V P VV+ S ++ G SR LF WC
Sbjct: 302 MNERIRNQF--ANSNPFKFKHISPLNSIEDFSDV-GPSVVMASPGGLQSGLSRQLFDMWC 358
Query: 97 SSPENSIIITNRTSPGTLARDLI 119
S +N+ +I GTLA+ +I
Sbjct: 359 SDKKNACVIPGYVVEGTLAKTII 381
>gi|412990885|emb|CCO18257.1| predicted protein [Bathycoccus prasinos]
Length = 825
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 50/86 (58%), Gaps = 2/86 (2%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I ++ + + FE NPF+FKHV+ + ++L P VVL + ++ G SR+LF
Sbjct: 338 INVLNADIKRQFE--EKNPFNFKHVQSLNRASDLDGNTGPCVVLATPSMLQSGTSRELFE 395
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC S +N ++I + GTLAR+++
Sbjct: 396 NWCESSDNGVVICDFAVQGTLAREIL 421
>gi|328704356|ref|XP_001945120.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Acyrthosiphon pisum]
Length = 694
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D++ + A NNPF FKH+ S+ + P V++ S ME G SR+LF
Sbjct: 296 INAMNDRIKRQI--AVNNPFVFKHITNLKSIDHFEDI-GPCVIMASPGVMESGLSRELFE 352
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ +N +II GTLA+ ++
Sbjct: 353 MWCTDSKNGVIIAGYVVQGTLAKAIL 378
>gi|19112240|ref|NP_595448.1| cleavage factor two Cft2/polyadenylation factor CPSF-73 (predicted)
[Schizosaccharomyces pombe 972h-]
gi|74582548|sp|O74740.1|CFT2_SCHPO RecName: Full=Cleavage factor two protein 2
gi|3738153|emb|CAA21254.1| cleavage factor two Cft2/polyadenylation factor CPSF-73 (predicted)
[Schizosaccharomyces pombe]
Length = 797
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 70/145 (48%), Gaps = 24/145 (16%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV-PSPKVVLVSTPDMECGFSRDLF 92
IEWM D +++ F G N F+++ ++++ + P PKV+L + +ECGFS+ +
Sbjct: 275 IEWMGDNIVRDF-GINENLLEFRNINTITDFSQISHIGPGPKVILATALTLECGFSQRIL 333
Query: 93 FQWCSSPENSIII---TNRTSPGTLARDLIE---------------LGGNRTLTLQVKKR 134
S N +I+ +R +LA I +G +++K +
Sbjct: 334 LDLMSENSNDLILFTQRSRCPQNSLANQFIRYWERASKKKRDIPHPVGLYAEQAVKIKTK 393
Query: 135 IRLEGEELEEYQK----KKDKEAKD 155
LEGEEL YQ+ K++K+A+D
Sbjct: 394 EPLEGEELRSYQELEFSKRNKDAED 418
>gi|432954006|ref|XP_004085503.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Oryzias latipes]
Length = 686
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+DK+ K+ NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 293 INAMNDKIRKAIN--VNNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 350 SWCTDKRNGVIIAGYCVEGTLAKHIM 375
>gi|190346159|gb|EDK38177.2| hypothetical protein PGUG_02275 [Meyerozyma guilliermondii ATCC
6260]
Length = 770
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 48/78 (61%), Gaps = 1/78 (1%)
Query: 42 MKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPEN 101
+ S G ++NPF FKH+KL SL + + P VV+ + ++ G SR+L +W +N
Sbjct: 307 VSSASGGKSNPFQFKHIKLIRSLDKFQDI-GPCVVVAAPGMLQNGVSRELLERWAPDAKN 365
Query: 102 SIIITNRTSPGTLARDLI 119
++I+T + GT+A++L+
Sbjct: 366 AVIMTGYSVEGTMAKELL 383
>gi|357445375|ref|XP_003592965.1| Cleavage and polyadenylation specificity factor subunit 3-I
[Medicago truncatula]
gi|357445453|ref|XP_003593004.1| Cleavage and polyadenylation specificity factor subunit 3-I
[Medicago truncatula]
gi|355482013|gb|AES63216.1| Cleavage and polyadenylation specificity factor subunit 3-I
[Medicago truncatula]
gi|355482052|gb|AES63255.1| Cleavage and polyadenylation specificity factor subunit 3-I
[Medicago truncatula]
Length = 690
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 59/109 (54%), Gaps = 8/109 (7%)
Query: 13 RNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV 70
+N P ++ K C ++ E + M+D++ + A++NPF FKH+ S+ ++ K
Sbjct: 274 QNIPIYYASPLAKKCLTVYETYTLS-MNDRI----QNAKSNPFAFKHISALSSI-DIFKD 327
Query: 71 PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
P VV+ S ++ G SR LF WCS +NS +I GTLA+ ++
Sbjct: 328 VGPSVVMASPGGLQSGLSRQLFDMWCSDKKNSCVIPGYVVEGTLAKTIL 376
>gi|410928245|ref|XP_003977511.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Takifugu rubripes]
Length = 696
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+DK+ K+ NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 293 INAMNDKIRKAIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 350 SWCTDKRNGVIIAGYCVEGTLAKHIM 375
>gi|357114659|ref|XP_003559115.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-I-like [Brachypodium distachyon]
Length = 768
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 51/86 (59%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++++ F A++NPFHFKH++ +S+ V P VV+ S ++ G SR LF
Sbjct: 373 INSMNERIRNQF--AQSNPFHFKHIEPLNSIDNFHDV-GPSVVMASPGTLQSGLSRQLFD 429
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC+ +N+ +I GTL++ +I
Sbjct: 430 KWCTDKKNTCVIPGFVIEGTLSKTII 455
>gi|427779921|gb|JAA55412.1| Putative cleavage and polyadenylation specificity factor cpsf
subunit [Rhipicephalus pulchellus]
Length = 737
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 40/70 (57%), Gaps = 1/70 (1%)
Query: 50 NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
NNPF FKH+ S+ + P VV+ S M+ G SR+LF WC+ P+N +II
Sbjct: 356 NNPFVFKHISNLKSIEHFEDI-GPCVVMASPGMMQSGLSRELFESWCTDPKNGVIIAGYC 414
Query: 110 SPGTLARDLI 119
GTLA+ ++
Sbjct: 415 VEGTLAKTIL 424
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 21/33 (63%)
Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 240
VV+ S M+ G SR+LF WC+ P+N +II
Sbjct: 380 VVMASPGMMQSGLSRELFESWCTDPKNGVIIAG 412
>gi|346466613|gb|AEO33151.1| hypothetical protein [Amblyomma maculatum]
Length = 618
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 40/70 (57%), Gaps = 1/70 (1%)
Query: 50 NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
NNPF FKH+ S+ + P VV+ S M+ G SR+LF WC+ P+N +II
Sbjct: 237 NNPFVFKHISNLKSIEHFEDI-GPCVVMASPGMMQSGLSRELFESWCTDPKNGVIIAGYC 295
Query: 110 SPGTLARDLI 119
GTLA+ ++
Sbjct: 296 VEGTLAKTIL 305
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 21/33 (63%)
Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 240
VV+ S M+ G SR+LF WC+ P+N +II
Sbjct: 261 VVMASPGMMQSGLSRELFESWCTDPKNGVIIAG 293
>gi|395828536|ref|XP_003787428.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 3 [Otolemur garnettii]
Length = 634
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 285 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 341
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ L+
Sbjct: 342 SWCTDKRNGVIIAGYCVEGTLAKILV 367
>gi|384252038|gb|EIE25515.1| Metallo-hydrolase/oxidoreductase [Coccomyxa subellipsoidea C-169]
Length = 696
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/155 (26%), Positives = 72/155 (46%), Gaps = 4/155 (2%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D + +F G NPF+FK++ + L V P VVL + ++ G SR+LF
Sbjct: 285 IAMMNDDIKAAF-GQSANPFNFKYITELKTQGGLDDV-GPCVVLATPSMLQSGLSRELFD 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEA 153
WC N +II + GTLARD++ + K +R+ E + + D +
Sbjct: 343 AWCEDKRNGVIIADFAVQGTLARDILASPSHVLTKAGAKVPLRMSVEHI-SFSAHADFDQ 401
Query: 154 KDKQEKEKIPPHDTSFINE-LQLSDFKQTLQRNGI 187
+ + PPH E +++ ++ L++ +
Sbjct: 402 TSQFVELLDPPHVILVHGEAVEMGRLRKALEQQAV 436
>gi|344232758|gb|EGV64631.1| Metallo-hydrolase/oxidoreductase [Candida tenuis ATCC 10573]
Length = 782
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 42/71 (59%), Gaps = 1/71 (1%)
Query: 49 RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
R NPF FK++K SL P VV+ S ++ GFSR L +W P+N++++T
Sbjct: 313 RKNPFTFKYIKTLKSLDRFQDF-GPSVVIASPGMLQSGFSRQLLEKWAPDPKNTVLMTGY 371
Query: 109 TSPGTLARDLI 119
+ GT+A+DL+
Sbjct: 372 SVEGTMAKDLL 382
>gi|195343244|ref|XP_002038208.1| GM18692 [Drosophila sechellia]
gi|194133058|gb|EDW54626.1| GM18692 [Drosophila sechellia]
Length = 684
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D++ + A NNPF F+H+ + + P V++ S M+ G SR+LF
Sbjct: 292 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVIMASPGMMQSGLSRELFE 348
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ P+N +II GTLA+ ++
Sbjct: 349 SWCTDPKNGVIIAGYCVEGTLAKTVL 374
>gi|194743214|ref|XP_001954095.1| GF18101 [Drosophila ananassae]
gi|190627132|gb|EDV42656.1| GF18101 [Drosophila ananassae]
Length = 684
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D++ + A NNPF F+H+ + + P V++ S M+ G SR+LF
Sbjct: 292 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVIMASPGMMQSGLSRELFE 348
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ P+N +II GTLA+ ++
Sbjct: 349 SWCTDPKNGVIIAGYCVEGTLAKTIL 374
>gi|195569857|ref|XP_002102925.1| GD20157 [Drosophila simulans]
gi|194198852|gb|EDX12428.1| GD20157 [Drosophila simulans]
Length = 684
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D++ + A NNPF F+H+ + + P V++ S M+ G SR+LF
Sbjct: 292 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVIMASPGMMQSGLSRELFE 348
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ P+N +II GTLA+ ++
Sbjct: 349 SWCTDPKNGVIIAGYCVEGTLAKTVL 374
>gi|198451826|ref|XP_001358526.2| GA20526 [Drosophila pseudoobscura pseudoobscura]
gi|198131664|gb|EAL27667.2| GA20526 [Drosophila pseudoobscura pseudoobscura]
Length = 684
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D++ + A NNPF F+H+ + + P V++ S M+ G SR+LF
Sbjct: 292 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVIMASPGMMQSGLSRELFE 348
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ P+N +II GTLA+ ++
Sbjct: 349 SWCTDPKNGVIIAGYCVEGTLAKTIL 374
>gi|195145744|ref|XP_002013850.1| GL23169 [Drosophila persimilis]
gi|194102793|gb|EDW24836.1| GL23169 [Drosophila persimilis]
Length = 684
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D++ + A NNPF F+H+ + + P V++ S M+ G SR+LF
Sbjct: 292 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVIMASPGMMQSGLSRELFE 348
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ P+N +II GTLA+ ++
Sbjct: 349 SWCTDPKNGVIIAGYCVEGTLAKTIL 374
>gi|221106537|ref|XP_002161150.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Hydra magnipapillata]
Length = 677
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++K+ + + +NPF FKH+ + + P VVL S M+ G SR+LF
Sbjct: 284 ISAMNEKIRRQI--SISNPFVFKHISNLKGIDSFDDI-GPSVVLASPGMMQSGLSRELFE 340
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ P N +II GTLA++L+
Sbjct: 341 TWCTDPRNGVIIAGYCVEGTLAKELM 366
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 20/33 (60%)
Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 240
VVL S M+ G SR+LF WC+ P N +II
Sbjct: 322 VVLASPGMMQSGLSRELFETWCTDPRNGVIIAG 354
>gi|410898094|ref|XP_003962533.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Takifugu rubripes]
Length = 691
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K+ NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 293 VNAMNDKIRKAIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 350 SWCTDKRNGVIIAGYCVEGTLAKHIM 375
>gi|348518441|ref|XP_003446740.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Oreochromis niloticus]
Length = 686
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+DK+ K+ NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 293 INAMNDKIRKAIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 350 SWCTDRRNGVIIAGYCVEGTLAKHIM 375
>gi|209877326|ref|XP_002140105.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209555711|gb|EEA05756.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 1097
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 46/86 (53%), Gaps = 4/86 (4%)
Query: 34 IEWMSDKLMKSFEGARNNPFH-FKHVKLCHSLAELAKVPS---PKVVLVSTPDMECGFSR 89
IEWMS + F R NPFH K++ + ++L + PS PKV+ S M+ G+SR
Sbjct: 326 IEWMSLDIRSEFCNTRFNPFHGLKNILIENTLNNVRTGPSAKLPKVIFASPASMDYGYSR 385
Query: 90 DLFFQWCSSPENSIIITNRTSPGTLA 115
+LF SSP NS+I T T A
Sbjct: 386 ELFADLASSPNNSVIFTREPKLNTFA 411
>gi|24648013|ref|NP_650738.1| cleavage and polyadenylation specificity factor 73 [Drosophila
melanogaster]
gi|21430620|gb|AAM50988.1| RE31408p [Drosophila melanogaster]
gi|23171662|gb|AAF55578.2| cleavage and polyadenylation specificity factor 73 [Drosophila
melanogaster]
gi|220948314|gb|ACL86700.1| CG7698-PA [synthetic construct]
Length = 684
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D++ + A NNPF F+H+ + + P V++ S M+ G SR+LF
Sbjct: 292 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVIMASPGMMQSGLSRELFE 348
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ P+N +II GTLA+ ++
Sbjct: 349 SWCTDPKNGVIIAGYCVEGTLAKAVL 374
>gi|348531581|ref|XP_003453287.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Oreochromis niloticus]
Length = 690
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K+ NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 293 VNAMNDKIRKAIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 350 SWCTDKRNGVIIAGYCVEGTLAKHIM 375
>gi|359486187|ref|XP_002271646.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-I-like [Vitis vinifera]
Length = 693
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/180 (26%), Positives = 78/180 (43%), Gaps = 8/180 (4%)
Query: 13 RNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV 70
N P ++ K C ++ + I M++++ F A +NPF FKH+ S+ V
Sbjct: 275 HNVPIYYASPLAKRCMAVYQ-TYINSMNERIRNQF--ANSNPFDFKHISPLKSIENFNDV 331
Query: 71 PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQ 130
P VV+ S ++ G SR LF WCS +N+ +I GTLA+ +I TL
Sbjct: 332 -GPSVVMASPGGLQSGLSRQLFDMWCSDKKNACVIPGYVVGGTLAKTIINEPKEVTLMNG 390
Query: 131 VKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFINEL-QLSDFKQTLQRNGIDC 189
+ + ++ + + D KE +PP+ E ++ KQ L DC
Sbjct: 391 LTAPLNMQVHYI-SFSAHADFAQTSTFLKELMPPNIILVHGEANEMGRLKQKLITQFADC 449
>gi|194900154|ref|XP_001979622.1| GG16362 [Drosophila erecta]
gi|190651325|gb|EDV48580.1| GG16362 [Drosophila erecta]
Length = 684
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D++ + A NNPF F+H+ + + P V++ S M+ G SR+LF
Sbjct: 292 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVIMASPGMMQSGLSRELFE 348
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ P+N +II GTLA+ ++
Sbjct: 349 SWCTDPKNGVIIAGYCVEGTLAKAVL 374
>gi|116283804|gb|AAH30988.1| CPSF3 protein [Homo sapiens]
Length = 554
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 44/83 (53%), Gaps = 3/83 (3%)
Query: 37 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF WC
Sbjct: 289 MNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFESWC 345
Query: 97 SSPENSIIITNRTSPGTLARDLI 119
+ N +II GTLA+ L+
Sbjct: 346 TDKRNGVIIAGYCVEGTLAKILV 368
>gi|297267035|ref|XP_001114469.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Macaca mulatta]
Length = 234
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 68/148 (45%), Gaps = 4/148 (2%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 42 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 98
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEA 153
WC+ N +II GTLA+ ++ T K +++ + + + D +
Sbjct: 99 SWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYI-SFSAHTDYQQ 157
Query: 154 KDKQEKEKIPPHDTSFINELQLSDFKQT 181
+ + PPH S +N + + K++
Sbjct: 158 TSEFIRALKPPHVVSLMNLIYCTKGKRS 185
>gi|195497711|ref|XP_002096215.1| GE25184 [Drosophila yakuba]
gi|194182316|gb|EDW95927.1| GE25184 [Drosophila yakuba]
Length = 684
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D++ + A NNPF F+H+ + + P V++ S M+ G SR+LF
Sbjct: 292 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVIMASPGMMQSGLSRELFE 348
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ P+N +II GTLA+ ++
Sbjct: 349 SWCTDPKNGVIIAGYCVEGTLAKAVL 374
>gi|297837375|ref|XP_002886569.1| hypothetical protein ARALYDRAFT_475225 [Arabidopsis lyrata subsp.
lyrata]
gi|297332410|gb|EFH62828.1| hypothetical protein ARALYDRAFT_475225 [Arabidopsis lyrata subsp.
lyrata]
Length = 693
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 47/83 (56%), Gaps = 3/83 (3%)
Query: 37 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
M+D++ F A +NPF FKH+ +S+ + V P VV+ + ++ G SR LF WC
Sbjct: 299 MNDRIRNQF--ANSNPFVFKHISPLNSIDDFNDV-GPSVVMATPGGLQSGLSRQLFDSWC 355
Query: 97 SSPENSIIITNRTSPGTLARDLI 119
S +N+ II GTLA+ +I
Sbjct: 356 SDKKNACIIPGYMVEGTLAKTII 378
>gi|195037533|ref|XP_001990215.1| GH19212 [Drosophila grimshawi]
gi|193894411|gb|EDV93277.1| GH19212 [Drosophila grimshawi]
Length = 686
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D++ + A NNPF F+H+ + + P V++ S M+ G SR+LF
Sbjct: 294 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFDDI-GPCVIMASPGMMQSGLSRELFE 350
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ P+N +II GTLA+ ++
Sbjct: 351 SWCTDPKNGVIIAGYCVEGTLAKTIL 376
>gi|195395198|ref|XP_002056223.1| GJ10819 [Drosophila virilis]
gi|194142932|gb|EDW59335.1| GJ10819 [Drosophila virilis]
Length = 686
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D++ + A NNPF F+H+ + + P V++ S M+ G SR+LF
Sbjct: 294 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFDDI-GPCVIMASPGMMQSGLSRELFE 350
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ P+N +II GTLA+ ++
Sbjct: 351 SWCTDPKNGVIIAGYCVEGTLAKTIL 376
>gi|15219848|ref|NP_176297.1| cleavage and polyadenylation specificity factor subunit 3-I
[Arabidopsis thaliana]
gi|30696512|ref|NP_849835.1| cleavage and polyadenylation specificity factor subunit 3-I
[Arabidopsis thaliana]
gi|79320389|ref|NP_001031215.1| cleavage and polyadenylation specificity factor subunit 3-I
[Arabidopsis thaliana]
gi|75262219|sp|Q9C952.1|CPSF3_ARATH RecName: Full=Cleavage and polyadenylation specificity factor
subunit 3-I; AltName: Full=Cleavage and polyadenylation
specificity factor 73 kDa subunit I; Short=AtCPSF73-I;
Short=CPSF 73 kDa subunit I
gi|12323330|gb|AAG51638.1|AC018908_4 putative cleavage and polyadenylation specificity factor;
72745-70039 [Arabidopsis thaliana]
gi|23297661|gb|AAN13003.1| putative cleavage and polyadenylation specificity factor
[Arabidopsis thaliana]
gi|24415578|gb|AAN41458.1| putative cleavage and polyadenylation specificity factor 73 kDa
subunit [Arabidopsis thaliana]
gi|222422865|dbj|BAH19419.1| AT1G61010 [Arabidopsis thaliana]
gi|222423059|dbj|BAH19511.1| AT1G61010 [Arabidopsis thaliana]
gi|332195645|gb|AEE33766.1| cleavage and polyadenylation specificity factor subunit 3-I
[Arabidopsis thaliana]
gi|332195646|gb|AEE33767.1| cleavage and polyadenylation specificity factor subunit 3-I
[Arabidopsis thaliana]
gi|332195647|gb|AEE33768.1| cleavage and polyadenylation specificity factor subunit 3-I
[Arabidopsis thaliana]
Length = 693
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 47/83 (56%), Gaps = 3/83 (3%)
Query: 37 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
M+D++ F A +NPF FKH+ +S+ + V P VV+ + ++ G SR LF WC
Sbjct: 299 MNDRIRNQF--ANSNPFVFKHISPLNSIDDFNDV-GPSVVMATPGGLQSGLSRQLFDSWC 355
Query: 97 SSPENSIIITNRTSPGTLARDLI 119
S +N+ II GTLA+ +I
Sbjct: 356 SDKKNACIIPGYMVEGTLAKTII 378
>gi|50287519|ref|XP_446189.1| hypothetical protein [Candida glabrata CBS 138]
gi|74637743|sp|Q6FUA5.1|YSH1_CANGA RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
3'-end-processing protein YSH1
gi|49525496|emb|CAG59113.1| unnamed protein product [Candida glabrata]
Length = 771
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 48/86 (55%), Gaps = 1/86 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+D + K F ++ NPF FK++ +L E P V+L S ++ G SRDL
Sbjct: 293 VNMMNDNIRKKFRDSQTNPFIFKNIAYIKNLDEFQDF-GPSVMLASPGMLQNGLSRDLLE 351
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC +N ++IT + GT+A+ L+
Sbjct: 352 RWCPDEKNLVLITGYSVEGTMAKYLL 377
>gi|195108751|ref|XP_001998956.1| GI24246 [Drosophila mojavensis]
gi|193915550|gb|EDW14417.1| GI24246 [Drosophila mojavensis]
Length = 686
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D++ + A NNPF F+H+ + + P V++ S M+ G SR+LF
Sbjct: 294 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFDDI-GPCVIMASPGMMQSGLSRELFE 350
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ P+N +II GTLA+ ++
Sbjct: 351 SWCTDPKNGVIIAGYCVEGTLAKTIL 376
>gi|326503296|dbj|BAJ99273.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 693
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 50/86 (58%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++++ F A++NPFHFKH++ +S+ V P VV+ S ++ G SR LF
Sbjct: 298 INSMNERIRNQF--AQSNPFHFKHIEPLNSIDNFHDV-GPSVVMASPGSLQSGLSRQLFD 354
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC+ +N+ +I G+L + +I
Sbjct: 355 KWCTDKKNTCVIPGFAVEGSLVKTII 380
>gi|47213099|emb|CAF89519.1| unnamed protein product [Tetraodon nigroviridis]
Length = 810
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+DK+ K+ NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 342 INAMNDKIRKAIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 398
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 399 SWCTDKRNGVIIAGYCVEGTLAKHIM 424
>gi|47230093|emb|CAG10507.1| unnamed protein product [Tetraodon nigroviridis]
Length = 730
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K+ NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 283 VNAMNDKIRKAIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 339
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 340 SWCTDKRNGVIIAGYCVEGTLAKHIM 365
>gi|302808975|ref|XP_002986181.1| hypothetical protein SELMODRAFT_234972 [Selaginella moellendorffii]
gi|300146040|gb|EFJ12712.1| hypothetical protein SELMODRAFT_234972 [Selaginella moellendorffii]
Length = 684
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 48/86 (55%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+DK+ +E +NPF+FKH+ S+ + V P +V+ S ++ G SR LF
Sbjct: 297 INSMNDKIKSQYE--NSNPFNFKHISPLKSIEQFEDV-GPSIVMASPSGLQSGLSRQLFD 353
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC +N+ +I GTLA+ ++
Sbjct: 354 RWCQDRKNACVIPGYVVEGTLAKTIL 379
>gi|18377654|gb|AAL66977.1| putative cleavage and polyadenylation specificity factor
[Arabidopsis thaliana]
Length = 693
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 47/83 (56%), Gaps = 3/83 (3%)
Query: 37 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
M+D++ F A +NPF FKH+ +S+ + V P VV+ + ++ G SR LF WC
Sbjct: 299 MNDRIRNQF--ANSNPFVFKHISPLNSIDDFNDV-GPSVVMATPGGLQSGLSRQLFDSWC 355
Query: 97 SSPENSIIITNRTSPGTLARDLI 119
S +N+ II GTLA+ +I
Sbjct: 356 SDKKNACIIPGYMVEGTLAKTII 378
>gi|302806483|ref|XP_002984991.1| hypothetical protein SELMODRAFT_234671 [Selaginella moellendorffii]
gi|302825687|ref|XP_002994439.1| hypothetical protein SELMODRAFT_236963 [Selaginella moellendorffii]
gi|300137630|gb|EFJ04498.1| hypothetical protein SELMODRAFT_236963 [Selaginella moellendorffii]
gi|300147201|gb|EFJ13866.1| hypothetical protein SELMODRAFT_234671 [Selaginella moellendorffii]
Length = 677
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 48/86 (55%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+DK+ +E + NPF+FKH+ S+ + V P +V+ S ++ G SR LF
Sbjct: 290 INSMNDKIKSQYENS--NPFNFKHISPLKSIEQFEDV-GPSIVMASPSGLQSGLSRQLFD 346
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC +N+ +I GTLA+ ++
Sbjct: 347 RWCQDRKNACVIPGYVVEGTLAKTIL 372
>gi|55250298|gb|AAH85402.1| Cleavage and polyadenylation specific factor 3 [Danio rerio]
gi|182889046|gb|AAI64567.1| Cpsf3 protein [Danio rerio]
Length = 690
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K+ NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 293 VNAMNDKIRKAIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 350 SWCTDKRNGVIIAGYCVEGTLAKHIM 375
>gi|51467896|ref|NP_001003836.1| cleavage and polyadenylation specificity factor subunit 3 [Danio
rerio]
gi|49619053|gb|AAT68111.1| cleavage and polyadenylation specificity factor 3 [Danio rerio]
Length = 690
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K+ NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 293 VNAMNDKIRKAIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 350 SWCTDKRNGVIIAGYCVEGTLAKHIM 375
>gi|444731702|gb|ELW72051.1| Cleavage and polyadenylation specificity factor subunit 3 [Tupaia
chinensis]
Length = 587
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 189 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 245
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 246 SWCTDKRNGVIIAGYCVEGTLAKHIM 271
>gi|335775092|gb|AEH58456.1| cleavage and polyadenylation specificity facto subunit 3-like
protein, partial [Equus caballus]
Length = 499
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 101 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 157
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 158 SWCTDKRNGVIIAGYCVEGTLAKHIM 183
>gi|355565449|gb|EHH21878.1| hypothetical protein EGK_05038 [Macaca mulatta]
Length = 650
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 283 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 339
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 340 SWCTDKRNGVIIAGYCVEGTLAKHIM 365
>gi|195452860|ref|XP_002073532.1| GK13096 [Drosophila willistoni]
gi|194169617|gb|EDW84518.1| GK13096 [Drosophila willistoni]
Length = 684
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D++ + A NNPF F+H+ + + P V++ S M+ G SR+LF
Sbjct: 292 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVIMASPGMMQSGLSRELFE 348
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ P+N +I+ GTLA+ ++
Sbjct: 349 SWCTDPKNGVIVAGYCVEGTLAKTIL 374
>gi|89267474|emb|CAJ83498.1| cleavage and polyadenylation specific factor 3 [Xenopus (Silurana)
tropicalis]
Length = 692
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 293 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 350 SWCTDKRNGVIIAGYCVEGTLAKHIM 375
>gi|403216468|emb|CCK70965.1| hypothetical protein KNAG_0F03030 [Kazachstania naganishii CBS
8797]
Length = 820
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 46/83 (55%), Gaps = 1/83 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+D + K F ++ NPF FK++ +L E P V+L S ++ G SRDL
Sbjct: 292 VNMMNDDIRKKFRDSQTNPFIFKNISYLKNLEEFQDF-GPSVMLASPGMLQSGLSRDLLE 350
Query: 94 QWCSSPENSIIITNRTSPGTLAR 116
+WC +N ++IT + GT+A+
Sbjct: 351 KWCPEQKNLVLITGYSVEGTMAK 373
>gi|55741994|ref|NP_001006770.1| cleavage and polyadenylation specificity factor 3 [Xenopus
(Silurana) tropicalis]
gi|49522504|gb|AAH75564.1| cleavage and polyadenylation specific factor 3, 73kDa [Xenopus
(Silurana) tropicalis]
Length = 692
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 293 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 350 SWCTDKRNGVIIAGYCVEGTLAKHIM 375
>gi|410074967|ref|XP_003955066.1| hypothetical protein KAFR_0A04950 [Kazachstania africana CBS 2517]
gi|372461648|emb|CCF55931.1| hypothetical protein KAFR_0A04950 [Kazachstania africana CBS 2517]
Length = 769
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 46/83 (55%), Gaps = 1/83 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+D + K F ++ NPF FK++ +L E P V+L S ++ G SRDL
Sbjct: 291 VNMMNDDIRKKFRDSQTNPFIFKNISYLKNLEEFQDF-GPSVMLASPGMLQSGISRDLLE 349
Query: 94 QWCSSPENSIIITNRTSPGTLAR 116
+WC +N ++IT + GT+A+
Sbjct: 350 RWCPDDKNLVLITGYSVEGTMAK 372
>gi|410955844|ref|XP_003984560.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Felis catus]
Length = 686
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|149641381|ref|XP_001505542.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like, partial [Ornithorhynchus anatinus]
Length = 595
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 197 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 253
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 254 SWCTDKRNGVIIAGYCVEGTLAKHIM 279
>gi|2394306|gb|AAB70268.1| 73 kDA subunit of cleavage and polyadenylation specificity factor
[Homo sapiens]
Length = 379
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 167 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 223
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 224 SWCTDKRNGVIIAGYCVEGTLAKHIM 249
>gi|291412514|ref|XP_002722528.1| PREDICTED: cleavage and polyadenylation specific factor 3, 73kDa
[Oryctolagus cuniculus]
Length = 684
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|431911821|gb|ELK13965.1| Cleavage and polyadenylation specificity factor subunit 3, partial
[Pteropus alecto]
Length = 667
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 269 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 325
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 326 SWCTDKRNGVIIAGYCVEGTLAKHIM 351
>gi|27805863|ref|NP_776709.1| cleavage and polyadenylation specificity factor subunit 3 [Bos
taurus]
gi|426223116|ref|XP_004005724.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Ovis aries]
gi|18202362|sp|P79101.1|CPSF3_BOVIN RecName: Full=Cleavage and polyadenylation specificity factor
subunit 3; AltName: Full=Cleavage and polyadenylation
specificity factor 73 kDa subunit; Short=CPSF 73 kDa
subunit; AltName: Full=mRNA 3'-end-processing
endonuclease CPSF-73
gi|1707412|emb|CAA65151.1| Cleavage and Polyadenylation Specifity Factor protein [Bos taurus]
gi|75773721|gb|AAI04554.1| Cleavage and polyadenylation specific factor 3, 73kDa [Bos taurus]
gi|296482248|tpg|DAA24363.1| TPA: cleavage and polyadenylation specificity factor subunit 3 [Bos
taurus]
gi|440897562|gb|ELR49218.1| Cleavage and polyadenylation specificity factor subunit 3 [Bos
grunniens mutus]
Length = 684
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|147905468|ref|NP_001088278.1| cleavage and polyadenylation specific factor 3, 73kDa [Xenopus
laevis]
gi|54038587|gb|AAH84286.1| LOC495111 protein [Xenopus laevis]
Length = 692
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 293 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 350 SWCTDKRNGVIIAGYCVEGTLAKHIM 375
>gi|432100623|gb|ELK29151.1| Cleavage and polyadenylation specificity factor subunit 3 [Myotis
davidii]
Length = 684
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|403270697|ref|XP_003927303.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Saimiri boliviensis boliviensis]
Length = 658
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 290 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 346
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 347 SWCTDKRNGVIIAGYCVEGTLAKHIM 372
>gi|350539083|ref|NP_001233296.1| cleavage and polyadenylation specificity factor subunit 3 [Pan
troglodytes]
gi|397513374|ref|XP_003826991.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Pan paniscus]
gi|426334660|ref|XP_004028859.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Gorilla gorilla gorilla]
gi|343961085|dbj|BAK62132.1| cleavage and polyadenylation specificity factor 73 kDa subunit [Pan
troglodytes]
gi|343961781|dbj|BAK62478.1| cleavage and polyadenylation specificity factor 73 kDa subunit [Pan
troglodytes]
gi|410254182|gb|JAA15058.1| cleavage and polyadenylation specific factor 3, 73kDa [Pan
troglodytes]
gi|410291448|gb|JAA24324.1| cleavage and polyadenylation specific factor 3, 73kDa [Pan
troglodytes]
gi|410339611|gb|JAA38752.1| cleavage and polyadenylation specific factor 3, 73kDa [Pan
troglodytes]
Length = 684
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|194220982|ref|XP_001502516.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Equus caballus]
gi|301775721|ref|XP_002923277.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Ailuropoda melanoleuca]
Length = 684
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|402890043|ref|XP_003908303.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Papio anubis]
Length = 684
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|351704796|gb|EHB07715.1| Cleavage and polyadenylation specificity factor subunit 3
[Heterocephalus glaber]
Length = 692
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 286 VHAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|296224527|ref|XP_002758090.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Callithrix jacchus]
Length = 684
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|119621395|gb|EAX00990.1| cleavage and polyadenylation specific factor 3, 73kDa, isoform
CRA_b [Homo sapiens]
Length = 647
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 249 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 305
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 306 SWCTDKRNGVIIAGYCVEGTLAKHIM 331
>gi|332247248|ref|XP_003272765.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Nomascus leucogenys]
gi|67969340|dbj|BAE01022.1| unnamed protein product [Macaca fascicularis]
gi|355751093|gb|EHH55348.1| hypothetical protein EGM_04543 [Macaca fascicularis]
gi|380813676|gb|AFE78712.1| cleavage and polyadenylation specificity factor subunit 3 [Macaca
mulatta]
gi|383419123|gb|AFH32775.1| cleavage and polyadenylation specificity factor subunit 3 [Macaca
mulatta]
gi|384940728|gb|AFI33969.1| cleavage and polyadenylation specificity factor subunit 3 [Macaca
mulatta]
Length = 684
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|62898706|dbj|BAD97207.1| cleavage and polyadenylation specific factor 3, 73kDa variant [Homo
sapiens]
Length = 684
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|15079675|gb|AAH11654.1| Cleavage and polyadenylation specific factor 3, 73kDa [Homo
sapiens]
gi|157929136|gb|ABW03853.1| cleavage and polyadenylation specific factor 3, 73kDa [synthetic
construct]
Length = 684
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|7706427|ref|NP_057291.1| cleavage and polyadenylation specificity factor subunit 3 [Homo
sapiens]
gi|18203503|sp|Q9UKF6.1|CPSF3_HUMAN RecName: Full=Cleavage and polyadenylation specificity factor
subunit 3; AltName: Full=Cleavage and polyadenylation
specificity factor 73 kDa subunit; Short=CPSF 73 kDa
subunit; AltName: Full=mRNA 3'-end-processing
endonuclease CPSF-73
gi|6002955|gb|AAF00224.1|AF171877_1 cleavage and polyadenylation specificity factor 73 kDa subunit
[Homo sapiens]
gi|18044212|gb|AAH20211.1| Cleavage and polyadenylation specific factor 3, 73kDa [Homo
sapiens]
gi|62822309|gb|AAY14858.1| unknown [Homo sapiens]
gi|119621394|gb|EAX00989.1| cleavage and polyadenylation specific factor 3, 73kDa, isoform
CRA_a [Homo sapiens]
Length = 684
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|363732494|ref|XP_419942.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Gallus gallus]
Length = 672
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 273 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 329
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 330 SWCTDKRNGVIIAGYCVEGTLAKHIM 355
>gi|348558392|ref|XP_003465002.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Cavia porcellus]
Length = 684
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|335285899|ref|XP_003354974.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Sus scrofa]
Length = 684
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|294883712|ref|XP_002771037.1| cleavage and polyadenylation specificity factor, putative
[Perkinsus marinus ATCC 50983]
gi|239874243|gb|EER02853.1| cleavage and polyadenylation specificity factor, putative
[Perkinsus marinus ATCC 50983]
Length = 1050
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 49/81 (60%), Gaps = 6/81 (7%)
Query: 33 KIEWMSDKLMKSFE---GARNNPFHFKHVKLCHSLAELAK-VPS--PKVVLVSTPDMECG 86
++EWMS + F+ G +PF F+HV+LC S + A+ P+ PKVVL S+ +E G
Sbjct: 339 RMEWMSRVVHNDFDTSMGFMYHPFLFQHVQLCSSFQDFAQNYPARKPKVVLASSASLEIG 398
Query: 87 FSRDLFFQWCSSPENSIIITN 107
+R++F + C P +++I T
Sbjct: 399 DAREIFCRMCGDPNSTVIFTT 419
>gi|197102904|ref|NP_001127045.1| cleavage and polyadenylation specificity factor subunit 3 [Pongo
abelii]
gi|55733623|emb|CAH93488.1| hypothetical protein [Pongo abelii]
Length = 647
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 249 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 305
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 306 SWCTDKRNGVIIAGYCVEGTLAKHIM 331
>gi|294945374|ref|XP_002784648.1| cleavage and polyadenylation specificity factor, putative
[Perkinsus marinus ATCC 50983]
gi|239897833|gb|EER16444.1| cleavage and polyadenylation specificity factor, putative
[Perkinsus marinus ATCC 50983]
Length = 1115
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 49/81 (60%), Gaps = 6/81 (7%)
Query: 33 KIEWMSDKLMKSFE---GARNNPFHFKHVKLCHSLAELAK-VPS--PKVVLVSTPDMECG 86
++EWMS + F+ G +PF F+HV+LC S + A+ P+ PKVVL S+ +E G
Sbjct: 384 RMEWMSRVVHNDFDTSMGFMYHPFLFQHVQLCSSFQDFAQNYPARKPKVVLASSASLEIG 443
Query: 87 FSRDLFFQWCSSPENSIIITN 107
+R++F + C P +++I T
Sbjct: 444 DAREIFCRMCGDPNSTVIFTT 464
>gi|67969643|dbj|BAE01170.1| unnamed protein product [Macaca fascicularis]
Length = 684
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|224140917|ref|XP_002323823.1| predicted protein [Populus trichocarpa]
gi|222866825|gb|EEF03956.1| predicted protein [Populus trichocarpa]
Length = 250
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 48/83 (57%), Gaps = 3/83 (3%)
Query: 37 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
M++++ F A +NPF FKH+ +S+ + + V P VV+ S ++ G SR LF WC
Sbjct: 170 MNERIRNQF--ANSNPFKFKHISPLNSIEDFSDV-GPSVVMASPGGLQSGLSRQLFDMWC 226
Query: 97 SSPENSIIITNRTSPGTLARDLI 119
S +N+ ++ GTLA+ +I
Sbjct: 227 SDKKNACVLPGYVVEGTLAKTII 249
>gi|210075949|ref|XP_504965.2| YALI0F03817p [Yarrowia lipolytica]
gi|223634672|sp|Q6C2Z7.2|YSH1_YARLI RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
3'-end-processing protein YSH1
gi|199424917|emb|CAG77772.2| YALI0F03817p [Yarrowia lipolytica CLIB122]
Length = 827
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 59/109 (54%), Gaps = 5/109 (4%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D + + F + NPF FK++K +L + P V++ S ++ G SR L
Sbjct: 312 INMMNDNIRRRFRDQKTNPFRFKYIKNIKNLDRFDDM-GPCVMVASPGMLQSGVSRSLLE 370
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI----ELGGNRTLTLQVKKRIRLE 138
+W P+N++I+T + GT+A+ +I E+ + L+V +R+ +E
Sbjct: 371 RWAPDPKNTLILTGYSVEGTMAKQIINEPNEIPSAQNPDLKVPRRLAVE 419
>gi|281351872|gb|EFB27456.1| hypothetical protein PANDA_012399 [Ailuropoda melanoleuca]
Length = 648
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 250 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 306
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 307 SWCTDKRNGVIIAGYCVEGTLAKHIM 332
>gi|417412420|gb|JAA52597.1| Putative cleavage and polyadenylation specificity factor cpsf
subunit, partial [Desmodus rotundus]
Length = 714
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 316 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 372
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 373 SWCTDKRNGVIIAGYCVEGTLAKHIM 398
>gi|126303222|ref|XP_001371997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Monodelphis domestica]
Length = 684
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|297739612|emb|CBI29794.3| unnamed protein product [Vitis vinifera]
Length = 581
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 6/109 (5%)
Query: 13 RNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV 70
N P ++ K C ++ + I M++++ F A +NPF FKH+ S+ V
Sbjct: 275 HNVPIYYASPLAKRCMAVYQ-TYINSMNERIRNQF--ANSNPFDFKHISPLKSIENFNDV 331
Query: 71 PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
P VV+ S ++ G SR LF WCS +N+ +I GTLA+ +I
Sbjct: 332 -GPSVVMASPGGLQSGLSRQLFDMWCSDKKNACVIPGYVVGGTLAKTII 379
>gi|443725897|gb|ELU13297.1| hypothetical protein CAPTEDRAFT_184406 [Capitella teleta]
Length = 668
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 54/110 (49%), Gaps = 11/110 (10%)
Query: 23 KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
K C S+ + I M+DK+ + NNPF FKH+ S+ + P VV+ S
Sbjct: 264 KKCMSVYQ-TYINAMNDKIKRQIN--TNNPFVFKHISNLKSMEHFDDI-GPSVVMASPGM 319
Query: 83 MECGFSRDLFFQWCSSPENSIIITNRTSPGTLAR-------DLIELGGNR 125
M+ G SR+LF WC+ N II GTLA+ +++ +GG +
Sbjct: 320 MQSGLSRELFENWCTDKRNGCIIAGYCVEGTLAKHILSEPEEIVTMGGQK 369
>gi|395507218|ref|XP_003757924.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Sarcophilus harrisii]
Length = 684
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|147787280|emb|CAN71414.1| hypothetical protein VITISV_029216 [Vitis vinifera]
Length = 687
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++++ F A +NPF FKH+ S+ V P VV+ S ++ G SR LF
Sbjct: 291 INSMNERIRNQF--ANSNPFDFKHISPLKSIENFNDV-GPSVVMASPSGLQSGLSRQLFD 347
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WCS +N+ +I GTLA+ +I
Sbjct: 348 MWCSDKKNACVIPGYVVEGTLAKTII 373
>gi|255718827|ref|XP_002555694.1| KLTH0G15202p [Lachancea thermotolerans]
gi|238937078|emb|CAR25257.1| KLTH0G15202p [Lachancea thermotolerans CBS 6340]
Length = 755
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 47/83 (56%), Gaps = 1/83 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+D + K F +++NPF FK++ +L E P V+L S ++ G SRDL
Sbjct: 296 VNMMNDDIRKKFRDSQSNPFIFKNISYLKNLDEFQDF-GPSVMLASPGMLQNGLSRDLLE 354
Query: 94 QWCSSPENSIIITNRTSPGTLAR 116
+WC +N ++IT + GT+A+
Sbjct: 355 KWCPGEKNLVLITGYSVEGTMAK 377
>gi|359486185|ref|XP_003633408.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-I-like [Vitis vinifera]
Length = 694
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++++ F A +NPF FKH+ S+ V P VV+ S ++ G SR LF
Sbjct: 298 INSMNERIRNQF--ANSNPFDFKHISPLKSIENFNDV-GPSVVMASPSGLQSGLSRQLFD 354
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WCS +N+ +I GTLA+ +I
Sbjct: 355 MWCSDKKNACVIPGYVVEGTLAKTII 380
>gi|326916480|ref|XP_003204535.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Meleagris gallopavo]
Length = 759
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 360 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 416
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 417 SWCTDKRNGVIIAGYCVEGTLAKHIM 442
>gi|146421308|ref|XP_001486604.1| hypothetical protein PGUG_02275 [Meyerozyma guilliermondii ATCC
6260]
Length = 770
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 47/78 (60%), Gaps = 1/78 (1%)
Query: 42 MKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPEN 101
+ S G + NPF FKH+KL SL + + P VV+ + ++ G SR+L +W +N
Sbjct: 307 VSSASGGKLNPFQFKHIKLIRSLDKFQDI-GPCVVVAAPGMLQNGVSRELLERWAPDAKN 365
Query: 102 SIIITNRTSPGTLARDLI 119
++I+T + GT+A++L+
Sbjct: 366 AVIMTGYSVEGTMAKELL 383
>gi|359321645|ref|XP_003639652.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Canis lupus familiaris]
Length = 717
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 319 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 375
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 376 SWCTDKRNGVIIAGYCVEGTLAKHIM 401
>gi|327261273|ref|XP_003215455.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Anolis carolinensis]
Length = 651
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 252 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 308
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 309 SWCTDKRNGVIIAGYCVEGTLAKHIM 334
>gi|355680849|gb|AER96661.1| cleavage and polyadenylation specific factor 3, 73kDa [Mustela
putorius furo]
Length = 600
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 235 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 291
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 292 SWCTDKRNGVIIAGYCVEGTLAKHIM 317
>gi|297739590|emb|CBI29772.3| unnamed protein product [Vitis vinifera]
Length = 680
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 6/109 (5%)
Query: 13 RNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV 70
N P ++ K C ++ + I M++++ F A +NPF FKH+ S+ V
Sbjct: 276 HNIPIYYASPLAKRCMAVYQ-TYINSMNERIRNQF--ANSNPFDFKHISPLKSIENFNDV 332
Query: 71 PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
P VV+ S ++ G SR LF WCS +N+ +I GTLA+ +I
Sbjct: 333 -GPSVVMASPSGLQSGLSRQLFDMWCSDKKNACVIPGYVVEGTLAKTII 380
>gi|300676780|gb|ADK26656.1| cleavage and polyadenylation specific factor 3, 73kDa [Zonotrichia
albicollis]
Length = 721
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 323 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 379
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 380 SWCTDKRNGVIIAGYCVEGTLAKHIM 405
>gi|302832928|ref|XP_002948028.1| hypothetical protein VOLCADRAFT_79885 [Volvox carteri f.
nagariensis]
gi|300266830|gb|EFJ51016.1| hypothetical protein VOLCADRAFT_79885 [Volvox carteri f.
nagariensis]
Length = 728
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 57/105 (54%), Gaps = 5/105 (4%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAE-LAKVPSPKVVLVSTPDMECGFSRDLF 92
+E +++ + K F NPF F+HV+ + A +A P V++ + ++ G SRD F
Sbjct: 299 VESLNEDIKKVFH--DRNPFKFRHVQTLKNPAHFIADYSGPCVIMATPSGLQSGASRDFF 356
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRL 137
WC N+ II + GTLA+++ LGG ++T + +R+ L
Sbjct: 357 EAWCEDARNTCIICDFAVQGTLAKEI--LGGPSSITTREGRRVPL 399
>gi|444315239|ref|XP_004178277.1| hypothetical protein TBLA_0A09750 [Tetrapisispora blattae CBS 6284]
gi|387511316|emb|CCH58758.1| hypothetical protein TBLA_0A09750 [Tetrapisispora blattae CBS 6284]
Length = 781
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 56/111 (50%), Gaps = 4/111 (3%)
Query: 11 GARNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELA 68
G P ++ K C S+ + + M+D + K F ++ NPF FK++ +L
Sbjct: 277 GGGQVPIYYASSLAKKCMSVFQ-TYVNMMNDDIRKKFRDSQTNPFIFKNISYLRNLDNFE 335
Query: 69 KVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
P V+L S ++ G SRDL +WC +N ++IT + GT+A+ L+
Sbjct: 336 DF-GPSVLLASPGMLQSGISRDLLERWCPEDKNMVLITGYSVEGTMAKYLM 385
>gi|213407230|ref|XP_002174386.1| cleavage factor two Cft2/polyadenylation factor CPSF-73
[Schizosaccharomyces japonicus yFS275]
gi|212002433|gb|EEB08093.1| cleavage factor two Cft2/polyadenylation factor CPSF-73
[Schizosaccharomyces japonicus yFS275]
Length = 786
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/168 (25%), Positives = 83/168 (49%), Gaps = 26/168 (15%)
Query: 16 PFHFKHVKLCHSLAEL-----AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV 70
P F L +S A+ + IEWM D +++ F G + F+H++ ++L+++
Sbjct: 252 PLSFPIYFLSYSSAKTIGYAKSMIEWMGDNIVRDF-GMNESLLEFRHIQTITHPSQLSQI 310
Query: 71 -PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIII---TNRTSPGTLAR---------- 116
P PKV++ ++ +E GFS+++ N++I+ +R S +LA+
Sbjct: 311 SPGPKVIIATSLTLESGFSQNVLLDIMPDNSNNLILLTQKSRYSENSLAKQFYRYWERAS 370
Query: 117 -----DLIELGGNRTLTLQVKKRIRLEGEELEEYQ-KKKDKEAKDKQE 158
+ +G ++QVK L+GEEL E+Q K++ K +D ++
Sbjct: 371 RKSPENFSSVGMYFEQSIQVKHSEPLQGEELREFQEKEQSKRTRDAED 418
>gi|449498153|ref|XP_002196255.2| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 3 [Taeniopygia guttata]
Length = 746
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 348 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 404
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 405 SWCTDKRNGVIIAGYCVEGTLAKHIM 430
>gi|126030713|pdb|2I7T|A Chain A, Structure Of Human Cpsf-73
gi|126030714|pdb|2I7V|A Chain A, Structure Of Human Cpsf-73
Length = 459
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 44/83 (53%), Gaps = 3/83 (3%)
Query: 37 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF WC
Sbjct: 289 MNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFESWC 345
Query: 97 SSPENSIIITNRTSPGTLARDLI 119
+ N +II GTLA+ ++
Sbjct: 346 TDKRNGVIIAGYCVEGTLAKHIM 368
>gi|367005895|ref|XP_003687679.1| hypothetical protein TPHA_0K01110 [Tetrapisispora phaffii CBS 4417]
gi|357525984|emb|CCE65245.1| hypothetical protein TPHA_0K01110 [Tetrapisispora phaffii CBS 4417]
Length = 790
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 48/86 (55%), Gaps = 1/86 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+D + K F ++ NPF FK++ ++ P V+L + ++ G SRDL
Sbjct: 291 VNMMNDSIRKKFRDSKTNPFIFKNISYLKNIDSFQDF-GPSVMLAAPGMLQNGLSRDLLE 349
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC P+N ++IT + G++A+ L+
Sbjct: 350 KWCPEPKNMVLITGYSVEGSMAKYLM 375
>gi|320583131|gb|EFW97347.1| Putative endoribonuclease [Ogataea parapolymorpha DL-1]
Length = 702
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 65/117 (55%), Gaps = 14/117 (11%)
Query: 13 RNNPFHFKHVKLCHSLAELAK---------IEWMSDKLMKSFEGARNNPFHFKHVKLCHS 63
+NNP ++V + ++ ++LAK + M+D + K F NPFHFK++K +
Sbjct: 257 QNNP-ELQNVPIYYA-SDLAKKCMAVYQRYVNMMNDSIRKKFTETNQNPFHFKYIKNITN 314
Query: 64 LAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
+ ++ + S VL+++P M + G SR + +W P NS I+T + GT+A+ L+
Sbjct: 315 IEKINDLDSS--VLIASPGMLQNGISRKILEKWSPDPRNSCILTGYSVEGTMAKILL 369
>gi|159488791|ref|XP_001702386.1| subunit of mRNA cleavage and polyadenylation specificity factor
[Chlamydomonas reinhardtii]
gi|158271180|gb|EDO97006.1| subunit of mRNA cleavage and polyadenylation specificity factor
[Chlamydomonas reinhardtii]
Length = 690
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 57/105 (54%), Gaps = 5/105 (4%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD-MECGFSRDLF 92
+E ++D + + F NPF F+HV+ + A S V+++TP ++ G SRD F
Sbjct: 274 VESLNDDIKRVFH--ERNPFKFRHVQTLKNPAHFISDYSGPCVIMATPSGLQSGASRDFF 331
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRL 137
WC N+ II + GTLA+++ LGG ++T + +R+ L
Sbjct: 332 EAWCEDSRNTCIICDFAVQGTLAKEI--LGGPSSITTREGRRVPL 374
>gi|367031802|ref|XP_003665184.1| hypothetical protein MYCTH_2308652 [Myceliophthora thermophila ATCC
42464]
gi|347012455|gb|AEO59939.1| hypothetical protein MYCTH_2308652 [Myceliophthora thermophila ATCC
42464]
Length = 1035
Score = 58.2 bits (139), Expect = 4e-06, Method: Composition-based stats.
Identities = 48/172 (27%), Positives = 70/172 (40%), Gaps = 58/172 (33%)
Query: 34 IEWMSDKLMKSFE----GARNN----------------PFHFKHVKLCHSLAELAKV--- 70
+EWM D +++ FE G R PF FKH++L A++ +V
Sbjct: 331 LEWMDDSIVREFEAVAGGTRTGNSGGGAGSGAKGKEAGPFDFKHLRLLERKAQVERVLQQ 390
Query: 71 --------PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS--PG--TLARDL 118
P +V+L + +E GFS+D+ P N +I+T + S PG ++AR L
Sbjct: 391 ATATDDAEPRGRVILATDSSLEWGFSKDVMRAIAEDPRNLVILTEKPSLNPGKPSIARML 450
Query: 119 IE-----------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
E GG R L L R LEG EL+ YQ+
Sbjct: 451 WEWWRERKDGVAVEQTSGGDTFEQVYGGGRELELTDATRQALEGTELDVYQQ 502
>gi|323307973|gb|EGA61229.1| Ysh1p [Saccharomyces cerevisiae FostersO]
Length = 727
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 46/83 (55%), Gaps = 1/83 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+D + K F ++ NPF FK++ +L + P V+L S ++ G SRDL
Sbjct: 260 VNMMNDDIXKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGMLQSGLSRDLLE 318
Query: 94 QWCSSPENSIIITNRTSPGTLAR 116
+WC +N ++IT + GT+A+
Sbjct: 319 RWCPEDKNLVLITGYSIEGTMAK 341
>gi|344302811|gb|EGW33085.1| hypothetical protein SPAPADRAFT_66091 [Spathaspora passalidarum
NRRL Y-27907]
Length = 762
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 50/86 (58%), Gaps = 3/86 (3%)
Query: 37 MSDK--LMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQ 94
M+DK L S G ++NPF FK++K L + + P VV+ + ++ G SR L +
Sbjct: 298 MNDKIRLSASSSGHKSNPFDFKYIKSIRDLGKFQDM-GPSVVIAAPGMLQAGISRQLLEK 356
Query: 95 WCSSPENSIIITNRTSPGTLARDLIE 120
W P+N +I+T + GT+A++L++
Sbjct: 357 WAPDPKNLVILTGYSVEGTMAKELLK 382
>gi|449283675|gb|EMC90280.1| Cleavage and polyadenylation specificity factor subunit 3, partial
[Columba livia]
Length = 667
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 50/97 (51%), Gaps = 4/97 (4%)
Query: 23 KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
K C S+ + + M+DK+ K NNPF FKH+ S+ + P +V+ S
Sbjct: 259 KKCMSVYQ-TYVNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSIVMASPGM 314
Query: 83 MECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
M+ G SR+LF WC+ N +II GTLA+ ++
Sbjct: 315 MQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHVM 351
>gi|195995883|ref|XP_002107810.1| hypothetical protein TRIADDRAFT_19764 [Trichoplax adhaerens]
gi|190588586|gb|EDV28608.1| hypothetical protein TRIADDRAFT_19764 [Trichoplax adhaerens]
Length = 636
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/132 (27%), Positives = 63/132 (47%), Gaps = 4/132 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ A +NPF FKH+ + + P VV+ S M+ G SR+LF
Sbjct: 297 VSAMNDKIRNQI--AISNPFIFKHISNLKGIDHFDDI-GPCVVMASPGMMQSGLSRELFE 353
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEA 153
+WC+ +N ++I GTLA++++ L++ +K R + + D E
Sbjct: 354 KWCTDSKNGVVIAGYCVEGTLAKEVMS-EPEEVLSMSGQKLARKLSVDYISFSAHTDYEQ 412
Query: 154 KDKQEKEKIPPH 165
+ + +PPH
Sbjct: 413 TSEFIRMLMPPH 424
>gi|380480161|emb|CCF42595.1| RNA-metabolising metallo-beta-lactamase [Colletotrichum
higginsianum]
Length = 979
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 71/166 (42%), Gaps = 52/166 (31%)
Query: 34 IEWMSDKLMKSFEGA-----RNN----------PFHFKHVKLCHSLAELAKVPSP----- 73
+EWM D +++ FE R N PF F+++KL A++ K+ S
Sbjct: 332 LEWMDDNIVREFESVADGQRRTNGAEAKSKEGVPFDFRYLKLVERRAQIEKLLSGSGDNV 391
Query: 74 ----KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN-----RTSPGTLARDL------ 118
+V+L S +E GFS+DL N +I+T+ R ++AR L
Sbjct: 392 QAEGRVILASDDTLEWGFSKDLIRGLAKDSRNLVILTDKPAKSRAEQPSIARTLWDWWTE 451
Query: 119 ---------------IEL--GGNRTLTLQVKKRIRLEGEELEEYQK 147
IEL GG R L +Q KR LEG+EL YQ+
Sbjct: 452 RRDGVSVEQSSNGNSIELVYGGGRELEVQEAKRQALEGDELNVYQQ 497
>gi|50304897|ref|XP_452404.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|74636942|sp|Q6CUI5.1|YSH1_KLULA RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
3'-end-processing protein YSH1
gi|49641537|emb|CAH01255.1| KLLA0C04598p [Kluyveromyces lactis]
Length = 764
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 49/86 (56%), Gaps = 1/86 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+D + K F+ ++ NPF FK++ +L E P V+L S ++ G SRD+
Sbjct: 298 VNMMNDDIRKKFKDSQTNPFIFKNISYLKNLDEFEDF-GPSVMLASPGMLQNGLSRDILE 356
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC +N +++T + GT+A+ L+
Sbjct: 357 KWCPEEKNLVLVTGYSVEGTMAKYLL 382
>gi|365764103|gb|EHN05628.1| Ysh1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 699
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 46/83 (55%), Gaps = 1/83 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+D + K F ++ NPF FK++ +L + P V+L S ++ G SRDL
Sbjct: 214 VNMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGMLQSGLSRDLLE 272
Query: 94 QWCSSPENSIIITNRTSPGTLAR 116
+WC +N ++IT + GT+A+
Sbjct: 273 RWCPEDKNLVLITGYSIEGTMAK 295
>gi|323336337|gb|EGA77605.1| Ysh1p [Saccharomyces cerevisiae Vin13]
Length = 745
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 46/83 (55%), Gaps = 1/83 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+D + K F ++ NPF FK++ +L + P V+L S ++ G SRDL
Sbjct: 260 VNMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGMLQSGLSRDLLE 318
Query: 94 QWCSSPENSIIITNRTSPGTLAR 116
+WC +N ++IT + GT+A+
Sbjct: 319 RWCPEDKNLVLITGYSIEGTMAK 341
>gi|168007963|ref|XP_001756677.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162692273|gb|EDQ78631.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 682
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D++ K FE +NPF FKH++ ++ + P VV+ S ++ G SR LF
Sbjct: 292 INAMNDRIQKQFE--VSNPFDFKHIQPLKNIDGFDDI-GPAVVMASPGGLQSGLSRQLFD 348
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC +NS II GTLA+ ++
Sbjct: 349 IWCQDKKNSCIIPGYVVEGTLAKAIM 374
>gi|356525973|ref|XP_003531594.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-I-like [Glycine max]
Length = 688
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 59/109 (54%), Gaps = 8/109 (7%)
Query: 13 RNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV 70
N P ++ K C ++ E + M+D++ + A++NPF FKH+ S+ E+ K
Sbjct: 272 HNIPIYYASPLAKKCLTVYETYTLS-MNDRV----QNAKSNPFSFKHISALSSI-EVFKD 325
Query: 71 PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
P VV+ S ++ G SR LF +WCS +N+ ++ GTLA+ ++
Sbjct: 326 VGPSVVMASPGGLQSGLSRQLFDKWCSDKKNTCVLPGFVVEGTLAKTIM 374
>gi|307199387|gb|EFN80012.1| Cleavage and polyadenylation specificity factor subunit 3
[Harpegnathos saltator]
Length = 685
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ + A NNPF FKH+ + + P VV+ S M+ G SR+LF
Sbjct: 292 VNAMNDKIRRQI--AINNPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFE 348
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ +N +II GTLA+ ++
Sbjct: 349 SWCTDAKNGVIIAGYCVEGTLAKGIL 374
>gi|349579985|dbj|GAA25146.1| K7_Ysh1p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 779
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 46/83 (55%), Gaps = 1/83 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+D + K F ++ NPF FK++ +L + P V+L S ++ G SRDL
Sbjct: 294 VNMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGMLQSGLSRDLLE 352
Query: 94 QWCSSPENSIIITNRTSPGTLAR 116
+WC +N ++IT + GT+A+
Sbjct: 353 RWCPEDKNLVLITGYSIEGTMAK 375
>gi|323303815|gb|EGA57598.1| Ysh1p [Saccharomyces cerevisiae FostersB]
Length = 727
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 46/83 (55%), Gaps = 1/83 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+D + K F ++ NPF FK++ +L + P V+L S ++ G SRDL
Sbjct: 260 VNMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGMLQSGLSRDLLE 318
Query: 94 QWCSSPENSIIITNRTSPGTLAR 116
+WC +N ++IT + GT+A+
Sbjct: 319 RWCPEDKNLVLITGYSIEGTMAK 341
>gi|6323307|ref|NP_013379.1| Ysh1p [Saccharomyces cerevisiae S288c]
gi|74644951|sp|Q06224.1|YSH1_YEAST RecName: Full=Endoribonuclease YSH1; AltName: Full=Yeast 73 kDa
homolog 1; AltName: Full=mRNA 3'-end-processing protein
YSH1
gi|577190|gb|AAB67367.1| Ysh1p: subunit of polyadenylation factor I (PF I) [Saccharomyces
cerevisiae]
gi|151940984|gb|EDN59365.1| cleavage factor II (CF II) component [Saccharomyces cerevisiae
YJM789]
gi|190405336|gb|EDV08603.1| hypothetical protein SCRG_04228 [Saccharomyces cerevisiae RM11-1a]
gi|256269831|gb|EEU05091.1| Ysh1p [Saccharomyces cerevisiae JAY291]
gi|285813694|tpg|DAA09590.1| TPA: Ysh1p [Saccharomyces cerevisiae S288c]
gi|323332373|gb|EGA73782.1| Ysh1p [Saccharomyces cerevisiae AWRI796]
Length = 779
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 46/83 (55%), Gaps = 1/83 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+D + K F ++ NPF FK++ +L + P V+L S ++ G SRDL
Sbjct: 294 VNMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGMLQSGLSRDLLE 352
Query: 94 QWCSSPENSIIITNRTSPGTLAR 116
+WC +N ++IT + GT+A+
Sbjct: 353 RWCPEDKNLVLITGYSIEGTMAK 375
>gi|259148260|emb|CAY81507.1| Ysh1p [Saccharomyces cerevisiae EC1118]
Length = 779
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 46/83 (55%), Gaps = 1/83 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+D + K F ++ NPF FK++ +L + P V+L S ++ G SRDL
Sbjct: 294 VNMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGMLQSGLSRDLLE 352
Query: 94 QWCSSPENSIIITNRTSPGTLAR 116
+WC +N ++IT + GT+A+
Sbjct: 353 RWCPEDKNLVLITGYSIEGTMAK 375
>gi|392297785|gb|EIW08884.1| Ysh1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 772
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 46/83 (55%), Gaps = 1/83 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+D + K F ++ NPF FK++ +L + P V+L S ++ G SRDL
Sbjct: 287 VNMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGMLQSGLSRDLLE 345
Query: 94 QWCSSPENSIIITNRTSPGTLAR 116
+WC +N ++IT + GT+A+
Sbjct: 346 RWCPEDKNLVLITGYSIEGTMAK 368
>gi|322786053|gb|EFZ12664.1| hypothetical protein SINV_01905 [Solenopsis invicta]
Length = 686
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ + A NNPF FKH+ + + P VV+ S M+ G SR+LF
Sbjct: 293 VNAMNDKIRRQI--AINNPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFE 349
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ +N +II GTLA+ ++
Sbjct: 350 SWCTDAKNGVIIAGYCVEGTLAKTIL 375
>gi|194386428|dbj|BAG61024.1| unnamed protein product [Homo sapiens]
Length = 406
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 44/86 (51%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 8 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 64
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GT A+ ++
Sbjct: 65 SWCTDKRNGVIIAGYCVEGTFAKHIM 90
>gi|401837471|gb|EJT41396.1| YSH1-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 779
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 46/83 (55%), Gaps = 1/83 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+D + K F ++ NPF FK++ +L + P V+L S ++ G SRDL
Sbjct: 294 VNMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGMLQSGLSRDLLE 352
Query: 94 QWCSSPENSIIITNRTSPGTLAR 116
+WC +N ++IT + GT+A+
Sbjct: 353 RWCPEDKNLVLITGYSIEGTMAK 375
>gi|344280152|ref|XP_003411849.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Loxodonta africana]
Length = 903
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF
Sbjct: 505 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 561
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 562 SWCTDKRNGVIIAGYCVEGTLAKHIM 587
>gi|380012076|ref|XP_003690115.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Apis florea]
Length = 686
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ + A NNPF FKH+ + + P VV+ S M+ G SR+LF
Sbjct: 293 VNAMNDKIRRQI--AINNPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFE 349
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ +N +II GTLA+ ++
Sbjct: 350 SWCTDAKNGVIIAGYCVEGTLAKTIL 375
>gi|310799284|gb|EFQ34177.1| RNA-metabolising metallo-beta-lactamase [Glomerella graminicola
M1.001]
Length = 984
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 47/166 (28%), Positives = 70/166 (42%), Gaps = 52/166 (31%)
Query: 34 IEWMSDKLMKSFEGARNN---------------PFHFKHVKLCHSLAELAKVPSP----- 73
+EWM D +++ FE + PF F+++KL A++ K+ S
Sbjct: 332 LEWMDDNIVREFESVADGQRKANGTEAKSKEGVPFDFRYLKLVERRAQIEKLLSGSGDNV 391
Query: 74 ----KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN-----RTSPGTLARDL------ 118
+V+L S +E GFS+DL N +I+T+ R ++AR L
Sbjct: 392 QAEGRVILASDDTLEWGFSKDLIRGLAKDSRNLVILTDKPAKSRAEQPSIARTLWDWWTE 451
Query: 119 ---------------IEL--GGNRTLTLQVKKRIRLEGEELEEYQK 147
+EL GG R L +Q KR LEGEEL YQ+
Sbjct: 452 RRDGVAVEQSSNGNNLELVYGGGRELEVQEAKRQALEGEELNVYQQ 497
>gi|347965534|ref|XP_321933.5| AGAP001224-PA [Anopheles gambiae str. PEST]
gi|333470467|gb|EAA01794.5| AGAP001224-PA [Anopheles gambiae str. PEST]
Length = 690
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+DK+ + A NNPF F+ + + V P VV+ S M+ G SR+LF
Sbjct: 294 INAMNDKIRRQI--AINNPFVFRFISNLKGIDHFDDV-GPCVVMASPGMMQSGLSRELFE 350
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ P+N +II GTLA+ ++
Sbjct: 351 SWCTDPKNGVIIAGYCVEGTLAKTIL 376
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 21/33 (63%)
Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 240
VV+ S M+ G SR+LF WC+ P+N +II
Sbjct: 332 VVMASPGMMQSGLSRELFESWCTDPKNGVIIAG 364
>gi|401624491|gb|EJS42547.1| ysh1p [Saccharomyces arboricola H-6]
Length = 779
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 51/94 (54%), Gaps = 2/94 (2%)
Query: 23 KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
K C S+ + + M+D + K F ++ NPF FK++ +L + P V+L S
Sbjct: 284 KKCMSVFQ-TYVNMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGM 341
Query: 83 MECGFSRDLFFQWCSSPENSIIITNRTSPGTLAR 116
++ G SRDL +WC +N ++IT + GT+A+
Sbjct: 342 LQSGLSRDLLERWCPEDKNLVLITGYSIEGTMAK 375
>gi|332019331|gb|EGI59837.1| Cleavage and polyadenylation specificity factor subunit 3
[Acromyrmex echinatior]
Length = 685
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ + A NNPF FKH+ + + P VV+ S M+ G SR+LF
Sbjct: 293 VNAMNDKIRRQI--AINNPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFE 349
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ +N +II GTLA+ ++
Sbjct: 350 SWCTDAKNGVIIAGYCVEGTLAKTIL 375
>gi|383861262|ref|XP_003706105.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Megachile rotundata]
Length = 686
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ + A NNPF FKH+ + + P VV+ S M+ G SR+LF
Sbjct: 293 VNAMNDKIRRQI--AINNPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFE 349
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ +N +II GTLA+ ++
Sbjct: 350 SWCTDAKNGVIIAGYCVEGTLAKTIL 375
>gi|156552097|ref|XP_001605081.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Nasonia vitripennis]
Length = 688
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ + A NNPF FKH+ + + P VV+ S M+ G SR+LF
Sbjct: 296 VNAMNDKIRRQI--AINNPFVFKHISNLKGIDHFDDI-GPCVVMASPGMMQSGLSRELFE 352
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ +N +II GTLA+ ++
Sbjct: 353 SWCTDAKNGVIIAGYCVEGTLAKTIL 378
>gi|366992944|ref|XP_003676237.1| hypothetical protein NCAS_0D02950 [Naumovozyma castellii CBS 4309]
gi|342302103|emb|CCC69876.1| hypothetical protein NCAS_0D02950 [Naumovozyma castellii CBS 4309]
Length = 771
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 56/108 (51%), Gaps = 4/108 (3%)
Query: 11 GARNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELA 68
G+ P ++ K C S+ + + M+D + + F ++ NPF FK++ +L E
Sbjct: 268 GSGQVPIYYASNLAKKCMSVFQ-TYVNMMNDDIRRKFRDSQTNPFIFKNISYLRNLEEFQ 326
Query: 69 KVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLAR 116
P V+L S ++ G SRD+ +WC +N ++IT + GT+A+
Sbjct: 327 DF-GPSVMLASPGMLQSGLSRDVLEKWCPDEKNLVLITGYSIEGTMAK 373
>gi|312372474|gb|EFR20427.1| hypothetical protein AND_20124 [Anopheles darlingi]
Length = 692
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+DK+ + A NNPF F+ + + V P VV+ S M+ G SR+LF
Sbjct: 296 INAMNDKIRRQI--AINNPFVFRFISNLKGIDHFDDV-GPCVVMASPGMMQSGLSRELFE 352
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ P+N +II GTLA+ ++
Sbjct: 353 TWCTDPKNGVIIAGYCVEGTLAKTIL 378
>gi|224140919|ref|XP_002323824.1| predicted protein [Populus trichocarpa]
gi|222866826|gb|EEF03957.1| predicted protein [Populus trichocarpa]
Length = 699
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 47/83 (56%), Gaps = 3/83 (3%)
Query: 37 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
M++++ F A +NPF FKH+ +S+ + V P VV+ + ++ G SR LF WC
Sbjct: 302 MNERIRNQF--ADSNPFKFKHISPLNSIEDFTDV-GPSVVMATPGGLQSGLSRQLFDMWC 358
Query: 97 SSPENSIIITNRTSPGTLARDLI 119
S +N+ +I GTLA+ +I
Sbjct: 359 SDKKNACVIPGFLVEGTLAKTII 381
>gi|336261956|ref|XP_003345764.1| hypothetical protein SMAC_05921 [Sordaria macrospora k-hell]
gi|380090100|emb|CCC12183.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1003
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 46/167 (27%), Positives = 72/167 (43%), Gaps = 53/167 (31%)
Query: 34 IEWMSDKLMKSFEG-----ARNN------------PFHFKHVKLCHSLAELAKV------ 70
+EWM D ++K FE RNN PF FK+++L A++ K+
Sbjct: 333 LEWMDDNIIKEFEAFADESRRNNRRDEGNHQTGPGPFDFKYLRLLERKAQIEKILKQSED 392
Query: 71 --PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS--PG---TLARDLIE--- 120
P KV+L S ++ GFS+D+ + N +I+T + + P ++AR L E
Sbjct: 393 TEPRAKVILASDASLDWGFSKDILKSIAADARNLVILTEKPNFEPNHKPSIARTLWEWWK 452
Query: 121 --------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
GNR L ++ +R LEG+EL YQ+
Sbjct: 453 ERRDGVATERTSNGDTFEQVYAGNRELEVETAERKGLEGDELNVYQQ 499
>gi|342320223|gb|EGU12165.1| Cleavage and polyadenylation specificity factor subunit
[Rhodotorula glutinis ATCC 204091]
Length = 1010
Score = 57.4 bits (137), Expect = 7e-06, Method: Composition-based stats.
Identities = 38/144 (26%), Positives = 72/144 (50%), Gaps = 28/144 (19%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV--PS-PKVVLVSTPDMECGFSRDL 91
EWM ++ G+ + F ++++ SL E+A PS PK++L + G+SR L
Sbjct: 400 EWMGGQI----AGSGADKLKFANLRIFSSLDEIATTIPPSVPKLILTVPSTLSYGYSRAL 455
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI---------------------ELGGNRTLTLQ 130
F + + N +++T + PG+LAR L E+ ++T+ L+
Sbjct: 456 FLDFARNAANLVLLTGLSEPGSLARWLAREVWEPQQEKGCKYGEGKVGKEVKMDQTIELE 515
Query: 131 VKKRIRLEGEELEEYQKKKDKEAK 154
+K+++ LEG+ELE + + + A+
Sbjct: 516 IKRKVYLEGDELEAHLAAEREAAE 539
>gi|403373777|gb|EJY86813.1| Cleavage and polyadenylation specificity factor subunit 3
[Oxytricha trifallax]
Length = 755
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 51/98 (52%), Gaps = 10/98 (10%)
Query: 37 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
M D+L E NNPFHF+ + + +E P V++ S ++ G SRDLF +W
Sbjct: 304 MGDQLRMELESG-NNPFHFEPITTFNDESEF-----PLVIMASPGMLQNGQSRDLFVKWA 357
Query: 97 SSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKR 134
P+N I+ T + GTLA+ ++ NR L V ++
Sbjct: 358 PDPKNGIVFTGYSVEGTLAKSVM----NRPKILTVAEQ 391
>gi|307177772|gb|EFN66769.1| Cleavage and polyadenylation specificity factor subunit 3 [Camponotus
floridanus]
Length = 1750
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ + A NNPF FKH+ + + P VV+ S M+ G SR+LF
Sbjct: 1357 VNAMNDKIRRQI--AINNPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFE 1413
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ +N +II GTLA+ ++
Sbjct: 1414 SWCTDAKNGVIIAGYCVEGTLAKTIL 1439
>gi|388579716|gb|EIM20037.1| hypothetical protein WALSEDRAFT_61199 [Wallemia sebi CBS 633.66]
Length = 844
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 52/99 (52%), Gaps = 18/99 (18%)
Query: 66 ELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE----- 120
E A +PK+VL M G SR LF + ++P N+I++T+R +PG+LARDL +
Sbjct: 361 EAAYPGTPKLVLAVPYSMSYGGSRRLFHSFSNNPGNAIVLTSRGAPGSLARDLFDRWNGK 420
Query: 121 ---------LG----GNRTLTLQVKKRIRLEGEELEEYQ 146
LG G+ + + ++ L GEELE YQ
Sbjct: 421 QNDKWGSGKLGEAVQGDWNIPITEHSKVPLLGEELEAYQ 459
>gi|12846103|dbj|BAB27031.1| unnamed protein product [Mus musculus]
Length = 406
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S ++ G SR+LF
Sbjct: 8 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 64
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 65 SWCTDKRNGVIIAGYCVEGTLAKHIM 90
>gi|308492421|ref|XP_003108401.1| CRE-CPSF-3 protein [Caenorhabditis remanei]
gi|308249249|gb|EFO93201.1| CRE-CPSF-3 protein [Caenorhabditis remanei]
Length = 712
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 68/160 (42%), Gaps = 21/160 (13%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+ ++ K A NPF FKHV + + P VVL + ++ GFSR+LF
Sbjct: 287 VNGMNSRIQKQI--AVKNPFIFKHVSTLRGMDQFEDA-GPCVVLATPGMLQSGFSRELFE 343
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELE--------EY 145
WCS +N II GTLAR ++ + ++ + L GE+L +
Sbjct: 344 NWCSDSKNGCIIAGYCVEGTLARHILT---------EPEEIVSLSGEKLPMRMQVGYVSF 394
Query: 146 QKKKDKEAKDKQEKEKIPPHDTSFINEL-QLSDFKQTLQR 184
D K PPH EL ++S K ++R
Sbjct: 395 SAHTDFNQTSNFVKALKPPHLVLVHGELHEMSRLKAGIER 434
>gi|320581695|gb|EFW95914.1| Ca2+/calmodulin-dependent protein kinase [Ogataea parapolymorpha
DL-1]
Length = 1184
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 70/146 (47%), Gaps = 20/146 (13%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVSTPDM-ECGFSRDL 91
+EWMS + K++E PF +KL SL +LA + PK++ V D+ E SRD
Sbjct: 316 LEWMSPDITKNWENQNQTPFESNRLKLI-SLRDLASLDHRPKIIFVDGTDLNEGSLSRDC 374
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDL-------------IELGG----NRTLTLQVKKR 134
F + CS ++I+T R T A D+ ++ G + ++L +
Sbjct: 375 FIELCSKHNTALIMTERPEVNTTAYDVYKEWESKVKNDNNLKDGALTILEKQMSLSATRE 434
Query: 135 IRLEGEELEEYQKKKDKEAKDKQEKE 160
+L G EL Y+K ++ + ++E+E
Sbjct: 435 EKLRGSELNAYKKSVEERRQRRKEQE 460
>gi|260942735|ref|XP_002615666.1| hypothetical protein CLUG_04548 [Clavispora lusitaniae ATCC 42720]
gi|238850956|gb|EEQ40420.1| hypothetical protein CLUG_04548 [Clavispora lusitaniae ATCC 42720]
Length = 797
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/71 (35%), Positives = 44/71 (61%), Gaps = 1/71 (1%)
Query: 49 RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
+ NPF FKH+K S+ ++ + P VV+ S ++ G SR L +W P+N++I+T
Sbjct: 321 KTNPFQFKHIKSIKSIDKIQDM-GPCVVVASPGMLQSGVSRQLLERWAPDPKNAVILTGY 379
Query: 109 TSPGTLARDLI 119
+ GT+A++L+
Sbjct: 380 SVEGTMAKELL 390
>gi|313216448|emb|CBY37756.1| unnamed protein product [Oikopleura dioica]
Length = 690
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 41/82 (50%), Gaps = 2/82 (2%)
Query: 37 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
M+ K+ K+ NPF F+H+ + P VVL S M+ G SR+LF +WC
Sbjct: 287 MNSKIQKAI--TTRNPFQFRHISNLKGMEAFDDDIGPSVVLASPGMMQSGLSRELFEKWC 344
Query: 97 SSPENSIIITNRTSPGTLARDL 118
++ N +I+ GTLA +
Sbjct: 345 TNKRNGVILAGYAVEGTLAHQI 366
>gi|406601461|emb|CCH46911.1| hypothetical protein BN7_6516 [Wickerhamomyces ciferrii]
Length = 679
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 47/86 (54%), Gaps = 1/86 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D + K F +NPF FK++K +L + P VV+ S ++ G SR+L
Sbjct: 251 INMMNDNIRKQFRDQNSNPFQFKYIKNIKNLDKFDDF-GPCVVVASPGMLQNGVSRELLE 309
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+W NS+I+T + GTLA+ L+
Sbjct: 310 RWAPDSRNSVILTGYSVEGTLAKTLL 335
>gi|443713763|gb|ELU06463.1| hypothetical protein CAPTEDRAFT_141423, partial [Capitella teleta]
Length = 369
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 54/110 (49%), Gaps = 11/110 (10%)
Query: 23 KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
K C S+ + I M+DK+ + NNPF FKH+ + + + P VV+ S
Sbjct: 57 KKCMSVYQ-TYINAMNDKIKRQIN--TNNPFVFKHISNLKASSHFDDI-GPSVVMASPGM 112
Query: 83 MECGFSRDLFFQWCSSPENSIIITNRTSPGTLAR-------DLIELGGNR 125
M+ G SR+LF WC+ N II GTLA+ +++ +GG +
Sbjct: 113 MQSGLSRELFENWCTDKRNGCIIAGYCVEGTLAKHILSEPEEIVTMGGQK 162
>gi|313244184|emb|CBY15021.1| unnamed protein product [Oikopleura dioica]
Length = 690
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 41/82 (50%), Gaps = 2/82 (2%)
Query: 37 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
M+ K+ K+ NPF F+H+ + P VVL S M+ G SR+LF +WC
Sbjct: 287 MNSKIQKAI--TTRNPFQFRHISNLKGMEAFDDDIGPSVVLASPGMMQSGLSRELFEKWC 344
Query: 97 SSPENSIIITNRTSPGTLARDL 118
++ N +I+ GTLA +
Sbjct: 345 TNKRNGVILAGYAVEGTLAHQI 366
>gi|74178650|dbj|BAE33998.1| unnamed protein product [Mus musculus]
Length = 684
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S ++ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|31980904|ref|NP_061283.2| cleavage and polyadenylation specificity factor subunit 3 [Mus
musculus]
gi|341940395|sp|Q9QXK7.2|CPSF3_MOUSE RecName: Full=Cleavage and polyadenylation specificity factor
subunit 3; AltName: Full=Cleavage and polyadenylation
specificity factor 73 kDa subunit; Short=CPSF 73 kDa
subunit; Short=mRNA 3'-end-processing endonuclease
CPSF-73
gi|23271024|gb|AAH23297.1| Cleavage and polyadenylation specificity factor 3 [Mus musculus]
Length = 684
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S ++ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|367016955|ref|XP_003682976.1| hypothetical protein TDEL_0G03980 [Torulaspora delbrueckii]
gi|359750639|emb|CCE93765.1| hypothetical protein TDEL_0G03980 [Torulaspora delbrueckii]
Length = 775
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 57/111 (51%), Gaps = 4/111 (3%)
Query: 11 GARNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELA 68
G P ++ K C S+ + + M+D + K F ++ NPF FK++ ++ +
Sbjct: 267 GGGQVPIYYASNLAKKCMSVFQ-TYVNMMNDDIRKKFRDSQTNPFVFKNISYLRNIDDFQ 325
Query: 69 KVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
P V+L S ++ G SRD+ +WC +N ++IT + GT+A+ L+
Sbjct: 326 DF-GPSVMLASPGMLQSGLSRDVLEKWCPEDKNLVLITGYSVEGTMAKFLM 375
>gi|74221128|dbj|BAE42066.1| unnamed protein product [Mus musculus]
Length = 684
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S ++ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|354504216|ref|XP_003514173.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Cricetulus griseus]
Length = 684
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S ++ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|74211665|dbj|BAE29190.1| unnamed protein product [Mus musculus]
Length = 684
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S ++ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|149050991|gb|EDM03164.1| cleavage and polyadenylation specificity factor 3, isoform CRA_a
[Rattus norvegicus]
Length = 685
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S ++ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|71795627|ref|NP_001025201.1| cleavage and polyadenylation specificity factor subunit 3 [Rattus
norvegicus]
gi|71121802|gb|AAH99817.1| Cleavage and polyadenylation specificity factor 3 [Rattus
norvegicus]
Length = 685
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S ++ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|148702078|gb|EDL34025.1| cleavage and polyadenylation specificity factor 3, isoform CRA_b
[Mus musculus]
Length = 701
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S ++ G SR+LF
Sbjct: 303 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 359
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 360 SWCTDKRNGVIIAGYCVEGTLAKHIM 385
>gi|290978816|ref|XP_002672131.1| predicted protein [Naegleria gruberi]
gi|284085705|gb|EFC39387.1| predicted protein [Naegleria gruberi]
Length = 749
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 78/165 (47%), Gaps = 7/165 (4%)
Query: 3 DKLMKSFEGARNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKL 60
D+ ++ + ++ P ++ K C ++ + I M+DK+ K F+ +NPF FKH+
Sbjct: 281 DEFWETHQDLQHIPIYYASSLAKKCMTIFQ-TYINMMNDKIRKQFD--IHNPFVFKHISN 337
Query: 61 CHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE 120
S+ + P V++ S ++ G S++LF WC +N +II + GTLA+ ++
Sbjct: 338 LRSIEDFQD-NGPCVIMASPGMLQSGLSKELFELWCQDAKNGVIIAGYSVDGTLAKKIMS 396
Query: 121 LGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPH 165
TL+ +R+ + + DK ++ PPH
Sbjct: 397 EPETVTLSNGNTVPLRMSVRTI-SFSAHSDKAQTEEFIGTIKPPH 440
>gi|344257704|gb|EGW13808.1| Cleavage and polyadenylation specificity factor subunit 3
[Cricetulus griseus]
Length = 647
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S ++ G SR+LF
Sbjct: 249 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 305
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 306 SWCTDKRNGVIIAGYCVEGTLAKHIM 331
>gi|6625904|gb|AAF19420.1|AF203969_1 cleavage and polyadenylation specificity factor 73 kDa subunit [Mus
musculus]
Length = 684
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 44/83 (53%), Gaps = 3/83 (3%)
Query: 37 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
M+DK+ K NNPF FKH+ S+ + P VV+ S ++ G SR+LF WC
Sbjct: 289 MNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFESWC 345
Query: 97 SSPENSIIITNRTSPGTLARDLI 119
+ N +II GTLA+ ++
Sbjct: 346 TDKRNGVIIAGYCVEGTLAKHIM 368
>gi|328350068|emb|CCA36468.1| hypothetical protein PP7435_Chr1-0308 [Komagataella pastoris CBS
7435]
Length = 741
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 58/109 (53%), Gaps = 5/109 (4%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++ + K F NPF F+++K +L++ P VV+ S ++ G SR L
Sbjct: 238 INMMNENIRKKFRDTNKNPFQFQYIKNIKNLSKFDDF-QPSVVVASPGMLQNGVSRALLE 296
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI----ELGGNRTLTLQVKKRIRLE 138
+W P N++I+T + GT+A++++ E+ + + + +R+ +E
Sbjct: 297 KWAPDPRNTLIMTGYSVEGTMAKEILLEPTEIPSQQNPDVLIPRRMTVE 345
>gi|302694097|ref|XP_003036727.1| hypothetical protein SCHCODRAFT_72177 [Schizophyllum commune H4-8]
gi|300110424|gb|EFJ01825.1| hypothetical protein SCHCODRAFT_72177 [Schizophyllum commune H4-8]
Length = 913
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 62/120 (51%), Gaps = 23/120 (19%)
Query: 53 FHFKHVKLCHSLAELAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
F+H++ + L + S PK++L ++ G SR +F ++ + P+N I++T R
Sbjct: 384 LRFQHLEFFPTPQALLQTYSSKDPKLILAVPLNLSHGPSRSIFSEFAAIPDNVILLTQRG 443
Query: 110 SPGTLARDLIE----------------LGGN----RTLTLQVKKRIRLEGEELEEYQKKK 149
PGTLAR L E +G N LTL++++++ L+G+ELE Y K+
Sbjct: 444 DPGTLARALFEKWNDSQRAEAKWDKGKVGSNVMLDDNLTLKMRRKVPLQGDELEAYLAKE 503
>gi|387594760|gb|EIJ89784.1| cleavage and polyadenylation specificity factor 3 [Nematocida
parisii ERTm3]
gi|387596392|gb|EIJ94013.1| cleavage and polyadenylation specificity factor 3 [Nematocida
parisii ERTm1]
Length = 696
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 40/70 (57%), Gaps = 1/70 (1%)
Query: 50 NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
+NPFHFKHV+ ++ E + P V++ S ++ G SRDLF WC N II
Sbjct: 298 SNPFHFKHVQSIKNI-EAYEDRGPCVMMASPGMLQNGLSRDLFEMWCGDKRNGCIIPGYC 356
Query: 110 SPGTLARDLI 119
GTLA+DL+
Sbjct: 357 VEGTLAKDLL 366
>gi|322708414|gb|EFY99991.1| cleavage and polyadenylylation specificity factor, putative
[Metarhizium anisopliae ARSEF 23]
Length = 960
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/167 (26%), Positives = 69/167 (41%), Gaps = 53/167 (31%)
Query: 34 IEWMSDKLMKSFE----GAR----------NNPFHFKHVKLCHSLAELAK---------- 69
+EWM D +++ FE G R PF FK+++L A+++K
Sbjct: 332 LEWMDDNIVQEFEAFAEGQRKANGAVEKKEGGPFDFKYLRLLERKAQVSKLLDQVASAQG 391
Query: 70 -VPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG-----TLARDLIEL-- 121
V +V+L S ME GFS+D+ P N +I+T+R + ++AR L E
Sbjct: 392 EVAKGRVILASDTSMEWGFSKDVLKGLAKDPNNLVILTDRPTIAKDDRPSIARTLWEWWR 451
Query: 122 ---------------------GGNRTLTLQVKKRIRLEGEELEEYQK 147
G R L ++ R LEG+EL YQ+
Sbjct: 452 ERKDGVSVEQTSTGDNIETVQAGGRELEIREASRQALEGDELAVYQQ 498
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 50/102 (49%), Gaps = 11/102 (10%)
Query: 140 EELEEYQKKKDKEAKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICC 199
+E E + + + ++A EK++ P D ++ + L+R + +D V
Sbjct: 341 QEFEAFAEGQ-RKANGAVEKKEGGPFDFKYL---------RLLERKAQVSKLLDQV-ASA 389
Query: 200 RGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
+G VA RV+L S ME GFS+D+ P N +I+T+R
Sbjct: 390 QGEVAKGRVILASDTSMEWGFSKDVLKGLAKDPNNLVILTDR 431
>gi|150865856|ref|XP_001385241.2| hypothetical protein PICST_89936 [Scheffersomyces stipitis CBS
6054]
gi|149387112|gb|ABN67212.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 793
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 50/84 (59%), Gaps = 2/84 (2%)
Query: 37 MSDKL-MKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQW 95
M+D + + S G +++PF FK++K L + + + P VV+ S ++ G SR L +W
Sbjct: 301 MNDNIRLSSRIGQKSSPFDFKYIKSIKDLGKFSDM-GPSVVVASPGMLQAGVSRQLLEKW 359
Query: 96 CSSPENSIIITNRTSPGTLARDLI 119
P+N +++T + GT+A+DL+
Sbjct: 360 APDPKNLVVMTGYSVEGTMAKDLL 383
>gi|149050992|gb|EDM03165.1| cleavage and polyadenylation specificity factor 3, isoform CRA_b
[Rattus norvegicus]
Length = 605
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ K NNPF FKH+ S+ + P VV+ S ++ G SR+LF
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 342
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N +II GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368
>gi|428172766|gb|EKX41673.1| hypothetical protein GUITHDRAFT_74597 [Guillardia theta CCMP2712]
Length = 615
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 50/88 (56%), Gaps = 2/88 (2%)
Query: 51 NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS 110
NPF F+H++ ++AE P VV+ S ++ G SRDLF +WC NS++IT
Sbjct: 315 NPFEFRHIRYMTTMAEFQD-NCPCVVMASPGMLQNGPSRDLFDRWCEYRHNSVVITGYCV 373
Query: 111 PGTLARDLIELGGNRTLTLQVKKRIRLE 138
TLA++L++ T TLQ K + L+
Sbjct: 374 QNTLAKELLD-AQPATHTLQDGKEVPLK 400
>gi|378756364|gb|EHY66388.1| cleavage and polyadenylation specificity factor [Nematocida sp. 1
ERTm2]
Length = 692
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 40/70 (57%), Gaps = 1/70 (1%)
Query: 50 NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
+NPFHFKHV+ ++ E + P V++ S ++ G SRDLF WC N II
Sbjct: 298 SNPFHFKHVQNIKNI-EAYEDRGPCVMMASPGMLQNGLSRDLFEMWCGDKRNGCIIPGYC 356
Query: 110 SPGTLARDLI 119
GTLA+DL+
Sbjct: 357 VEGTLAKDLL 366
>gi|392568293|gb|EIW61467.1| hypothetical protein TRAVEDRAFT_162694 [Trametes versicolor
FP-101664 SS1]
Length = 943
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 64/140 (45%), Gaps = 29/140 (20%)
Query: 53 FHFKHVKLCHSLAELAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
F+H++ S L S PK++L + G SR LF + P+N +++T R+
Sbjct: 387 LRFRHLEFFSSPQALMSTYSTKDPKLILAVPATLSHGPSRSLFAHFAEIPDNVVLLTGRS 446
Query: 110 SPGTLARDLIE----------------LGGN----RTLTLQVKKRIRLEGEELEEYQKKK 149
PGTL R L + +G N L L++ K++ L+G+ELEE+
Sbjct: 447 EPGTLGRILFDKWNNSQREEAKWDRGKIGNNIMMDGVLRLEIHKKVPLQGDELEEFL--- 503
Query: 150 DKEAKDKQEKEKIPPHDTSF 169
AK++ KEK H +
Sbjct: 504 ---AKERAVKEKEAAHQAAL 520
Score = 37.0 bits (84), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 30/121 (24%), Positives = 52/121 (42%), Gaps = 32/121 (26%)
Query: 159 KEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFM-DGVLICCRGTVAVRRVVLVSTPDME 217
+ K P ++ I EL+L+ K L + G+ E + +GVLIC +
Sbjct: 850 RPKQPLPQSTMIGELKLTSLKARLAQVGVQAELVGEGVLICGAAAKKGASADAL------ 903
Query: 218 CGFSRDLFFQWCSSPENSIII--TNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAI 275
E+S+ + T R R + LEG +SD YY+V++ +Y +A+
Sbjct: 904 ---------------EDSVAVRKTGRGRVE--------LEGSISDIYYKVRKEIYALHAL 940
Query: 276 L 276
+
Sbjct: 941 V 941
>gi|71682600|gb|AAI00570.1| Cpsf3 protein [Mus musculus]
Length = 512
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 44/83 (53%), Gaps = 3/83 (3%)
Query: 37 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
M+DK+ K NNPF FKH+ S+ + P VV+ S ++ G SR+LF WC
Sbjct: 289 MNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFESWC 345
Query: 97 SSPENSIIITNRTSPGTLARDLI 119
+ N +II GTLA+ ++
Sbjct: 346 TDKRNGVIIAGYCVEGTLAKHIM 368
>gi|291233360|ref|XP_002736621.1| PREDICTED: cleavage and polyadenylation specific factor 3,
73kDa-like [Saccoglossus kowalevskii]
Length = 715
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 65/143 (45%), Gaps = 5/143 (3%)
Query: 23 KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
K C S+ + I M+DK+ + +NPF FKH+ + + P VV+ S
Sbjct: 283 KKCMSVYQ-TYINAMNDKIKRQI--TISNPFVFKHISNLRGMDHFDDI-GPSVVMASPGM 338
Query: 83 MECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
M+ G SR+LF WC+ N +II GTLA+ ++ T K +++ + +
Sbjct: 339 MQSGLSRELFESWCTDRRNGVIIAGYCVEGTLAKHILSQPEEVTTMSGQKLPLKMSVDYI 398
Query: 143 EEYQKKKDKEAKDKQEKEKIPPH 165
+ D + ++ IPPH
Sbjct: 399 -SFSAHADYQQISDFIRKLIPPH 420
>gi|432117051|gb|ELK37610.1| Cleavage and polyadenylation specificity factor subunit 2 [Myotis
davidii]
Length = 121
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 40/114 (35%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
++P H + F+NE +L DFKQ L R I EF+ GVL+C
Sbjct: 46 EVPGHQSVFMNEPRLFDFKQVLLREWIQAEFVGGVLVC---------------------- 83
Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYA 274
N+ +++ I LEGCL ++YR++ LLY+QYA
Sbjct: 84 ------------------NNQISVCRTETGRIGLEGCLCQDFYRIRDLLYEQYA 119
>gi|429857613|gb|ELA32471.1| cleavage and polyadenylylation specificity [Colletotrichum
gloeosporioides Nara gc5]
Length = 962
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/166 (27%), Positives = 69/166 (41%), Gaps = 52/166 (31%)
Query: 34 IEWMSDKLMKSFEGARNN---------------PFHFKHVKLCHSLAELAKVPSP----- 73
+EWM D +++ FE + PF F+++KL A++ K+ S
Sbjct: 332 LEWMDDNIVREFESVADGQRKANGADGKTKEAVPFDFRYLKLVERRAQIEKLLSSSGGNV 391
Query: 74 ----KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN-----RTSPGTLAR-------- 116
+V+L S +E GFS+DL N +++T+ R ++AR
Sbjct: 392 QSEGRVILASDDTLEWGFSKDLIKGLAKDSRNLVVLTDKPPKSRAEQPSIARTLWDWWTE 451
Query: 117 -------------DLIEL--GGNRTLTLQVKKRIRLEGEELEEYQK 147
D IE GG R L +Q KR LEG+EL YQ+
Sbjct: 452 RQDGATVEQTSSGDSIEFVYGGGRELEIQEAKRQALEGDELTVYQQ 497
>gi|312080023|ref|XP_003142424.1| cpsf3-prov protein [Loa loa]
Length = 715
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+ ++ K A NNPF FKHV S+ V P VVL S ++ G SR+LF
Sbjct: 311 VSGMNSRIQKQI--ALNNPFVFKHVSNLKSIDHFEDV-GPCVVLASPGMLQNGLSRELFE 367
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ +N II GTLA+ ++
Sbjct: 368 NWCTDSKNGCIIAGYCVEGTLAKHIL 393
>gi|360043111|emb|CCD78523.1| cleavage and polyadenylation specificity factor-related
[Schistosoma mansoni]
Length = 670
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 1/74 (1%)
Query: 48 ARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
A NNPF F+H+ S+ P VV+ S M+ G SR+LF WC+ N +II
Sbjct: 302 ANNNPFCFRHISNLKSIEHFDD-SGPCVVMASPGMMQSGLSRELFENWCTDKRNGVIIAG 360
Query: 108 RTSPGTLARDLIEL 121
GTLA+ ++ L
Sbjct: 361 YCVEGTLAKQILSL 374
>gi|242007002|ref|XP_002424331.1| Cleavage and polyadenylation specificity factor 73 kDa subunit,
putative [Pediculus humanus corporis]
gi|212507731|gb|EEB11593.1| Cleavage and polyadenylation specificity factor 73 kDa subunit,
putative [Pediculus humanus corporis]
Length = 692
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+DK+ + A NNPF F+H+ + + P VV+ S M+ G SR+LF
Sbjct: 294 VNAMNDKIRRQI--AINNPFIFRHIHNLKGIDHFDDI-GPCVVMASPGMMQSGLSRELFE 350
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ +N +II GTLA+ ++
Sbjct: 351 LWCTDSKNGVIIAGYCVEGTLAKQIL 376
>gi|402594378|gb|EJW88304.1| cleavage and polyadenylation specificity factor subunit 3
[Wuchereria bancrofti]
Length = 694
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+ ++ K A NNPF FKHV S+ V P VVL S ++ G SR+LF
Sbjct: 284 VSGMNSRIQKQI--ALNNPFVFKHVSNLKSIDHFEDV-GPCVVLASPGMLQNGLSRELFE 340
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ +N II GTLA+ ++
Sbjct: 341 NWCTDSKNGCIIAGYCVEGTLAKHIL 366
>gi|170587204|ref|XP_001898368.1| cpsf3-prov protein [Brugia malayi]
gi|158594194|gb|EDP32780.1| cpsf3-prov protein, putative [Brugia malayi]
Length = 700
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+ ++ K A NNPF FKHV S+ V P VVL S ++ G SR+LF
Sbjct: 290 VSGMNSRIQKQI--ALNNPFVFKHVSNLKSIDHFEDV-GPCVVLASPGMLQNGLSRELFE 346
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ +N II GTLA+ ++
Sbjct: 347 NWCTDSKNGCIIAGYCVEGTLAKHIL 372
>gi|256086716|ref|XP_002579538.1| cleavage and polyadenylation specificity factor-related
[Schistosoma mansoni]
Length = 670
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 1/74 (1%)
Query: 48 ARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
A NNPF F+H+ S+ P VV+ S M+ G SR+LF WC+ N +II
Sbjct: 302 ASNNPFCFRHISNLKSIEHFDD-SGPCVVMASPGMMQSGLSRELFENWCTDKRNGVIIAG 360
Query: 108 RTSPGTLARDLIEL 121
GTLA+ ++ L
Sbjct: 361 YCVEGTLAKQILSL 374
>gi|303275006|ref|XP_003056813.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461165|gb|EEH58458.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 803
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I ++ + +FE A NPF F HV+ L V P VVL + ++ G SR+LF
Sbjct: 292 INVLNSDMKAAFEEA--NPFVFNHVQHLSHAGGLDDV-GPCVVLATPSMLQSGLSRELFE 348
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC N +II + GTLAR+++
Sbjct: 349 MWCGDANNGVIIADFAVQGTLAREIL 374
>gi|255084461|ref|XP_002508805.1| predicted protein [Micromonas sp. RCC299]
gi|226524082|gb|ACO70063.1| predicted protein [Micromonas sp. RCC299]
Length = 728
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I ++ + +FE +NPF F HV + + L V P VVL + ++ G SRDLF
Sbjct: 302 INVLNSDMKAAFE--ESNPFVFNHVNHLANSSGLDDV-GPCVVLATPSMLQSGLSRDLFE 358
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC +N +II + GTLAR+++
Sbjct: 359 SWCGDSKNGVIICDFAVQGTLAREIL 384
>gi|268552491|ref|XP_002634228.1| Hypothetical protein CBG01798 [Caenorhabditis briggsae]
Length = 722
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 44/160 (27%), Positives = 68/160 (42%), Gaps = 21/160 (13%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+ ++ K A NPF FKHV + + P VVL + ++ GFSR+LF
Sbjct: 287 VNGMNSRIQKQI--AIKNPFIFKHVSTLRGMDQFEDA-GPCVVLATPGMLQSGFSRELFE 343
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELE--------EY 145
WCS +N II GTLA+ ++ + ++ + L GE+L +
Sbjct: 344 NWCSDSKNGCIIAGYCVEGTLAKHILT---------EPEEIVSLSGEKLPMRMQVGYVSF 394
Query: 146 QKKKDKEAKDKQEKEKIPPHDTSFINEL-QLSDFKQTLQR 184
D K PPH EL ++S K ++R
Sbjct: 395 SAHTDFNQTSNFVKTLKPPHLVLVHGELHEMSRLKAGIER 434
>gi|336466927|gb|EGO55091.1| hypothetical protein NEUTE1DRAFT_130968 [Neurospora tetrasperma
FGSC 2508]
Length = 1051
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 44/167 (26%), Positives = 72/167 (43%), Gaps = 53/167 (31%)
Query: 34 IEWMSDKLMKSFEG-----ARNN------------PFHFKHVKLCHSLAELAKV------ 70
+EWM D +++ FE RNN PF FK+++L A++ K+
Sbjct: 382 LEWMDDSIIREFEAFADESRRNNRRDEGNHQTGPGPFDFKYLRLLERKAQIDKILQQSDD 441
Query: 71 --PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS--PG---TLARDLIE--- 120
P KV+L S ++ GFS+D+ + N +I+T + + P +++R L E
Sbjct: 442 AEPRAKVILASDTSLDWGFSKDILKSIAADARNLVILTEKPNLEPNQKPSISRTLWEWWK 501
Query: 121 --------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
GNR L ++ +R LEG+EL YQ+
Sbjct: 502 ERRDGVATERTSNGDTFEQVYAGNRELEIETAERKGLEGDELNVYQQ 548
>gi|164424681|ref|XP_958078.2| hypothetical protein NCU06869 [Neurospora crassa OR74A]
gi|157070616|gb|EAA28842.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 986
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 44/167 (26%), Positives = 72/167 (43%), Gaps = 53/167 (31%)
Query: 34 IEWMSDKLMKSFEG-----ARNN------------PFHFKHVKLCHSLAELAKV------ 70
+EWM D +++ FE RNN PF FK+++L A++ K+
Sbjct: 333 LEWMDDSIIREFEAFADESRRNNRRDEGNHQTGPGPFDFKYLRLLERKAQIDKILQQSDD 392
Query: 71 --PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS--PG---TLARDLIE--- 120
P KV+L S ++ GFS+D+ + N +I+T + + P +++R L E
Sbjct: 393 AEPRAKVILASDTSLDWGFSKDILKSIAADARNLVILTEKPNLEPNQKPSISRTLWEWWK 452
Query: 121 --------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
GNR L ++ +R LEG+EL YQ+
Sbjct: 453 ERRDGVATERTSNGDTFEQVYAGNRELEIETAERKGLEGDELNVYQQ 499
>gi|448118544|ref|XP_004203525.1| Piso0_001136 [Millerozyma farinosa CBS 7064]
gi|448120951|ref|XP_004204108.1| Piso0_001136 [Millerozyma farinosa CBS 7064]
gi|359384393|emb|CCE79097.1| Piso0_001136 [Millerozyma farinosa CBS 7064]
gi|359384976|emb|CCE78511.1| Piso0_001136 [Millerozyma farinosa CBS 7064]
Length = 809
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 43/73 (58%), Gaps = 1/73 (1%)
Query: 47 GARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
G + NPF FK++K SL + P VV+ S ++ G SR+L +W P N++I+T
Sbjct: 321 GEKRNPFQFKYIKNIRSLDKFQDF-GPCVVVASPGMLQNGVSRELLERWAPDPRNAVIMT 379
Query: 107 NRTSPGTLARDLI 119
+ GT+A++L+
Sbjct: 380 GYSVEGTMAKELL 392
>gi|168026077|ref|XP_001765559.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683197|gb|EDQ69609.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 682
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++++ K FE +NPF FKH++ ++ E + P VV+ S ++ G SR LF
Sbjct: 292 INAMNERIQKQFE--VSNPFDFKHIQPLKNIDEFDDI-GPAVVMASPGGLQSGLSRQLFD 348
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC +NS +I GT A+ ++
Sbjct: 349 IWCQDKKNSCVIPGYVVEGTPAKAIM 374
>gi|393912282|gb|EJD76666.1| cleavage and polyadenylation specific factor 3 [Loa loa]
Length = 475
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+ ++ K A NNPF FKHV S+ V P VVL S ++ G SR+LF
Sbjct: 71 VSGMNSRIQKQI--ALNNPFVFKHVSNLKSIDHFEDV-GPCVVLASPGMLQNGLSRELFE 127
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ +N II GTLA+ ++
Sbjct: 128 NWCTDSKNGCIIAGYCVEGTLAKHIL 153
>gi|350288464|gb|EGZ69700.1| hypothetical protein NEUTE2DRAFT_152270 [Neurospora tetrasperma
FGSC 2509]
Length = 1070
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 44/167 (26%), Positives = 72/167 (43%), Gaps = 53/167 (31%)
Query: 34 IEWMSDKLMKSFEG-----ARNN------------PFHFKHVKLCHSLAELAKV------ 70
+EWM D +++ FE RNN PF FK+++L A++ K+
Sbjct: 401 LEWMDDSIIREFEAFADESRRNNRRDEGNHQTGPGPFDFKYLRLLERKAQIDKILQQSDD 460
Query: 71 --PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS--PG---TLARDLIE--- 120
P KV+L S ++ GFS+D+ + N +I+T + + P +++R L E
Sbjct: 461 AEPRAKVILASDTSLDWGFSKDILKSIAADARNLVILTEKPNLEPNQKPSISRTLWEWWK 520
Query: 121 --------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
GNR L ++ +R LEG+EL YQ+
Sbjct: 521 ERRDGVATERTSNGDTFEQVYAGNRELEIETAERKGLEGDELNVYQQ 567
>gi|299116292|emb|CBN76100.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 752
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 53/105 (50%), Gaps = 3/105 (2%)
Query: 17 FHFKHVKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVV 76
H + LA I M++ + + + A NPF F+H+ S+ + P VV
Sbjct: 161 LHHIPIYYASRLASKTYINMMNEHIRQQMDVA--NPFKFQHITNLKSIDQFDD-SGPSVV 217
Query: 77 LVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIEL 121
+ S ++ G SR LF +WC+ +NS++I + GTLA+ L+ +
Sbjct: 218 MASPGMLQSGVSRMLFDRWCTDDKNSVLIPGYSVEGTLAKKLLSM 262
>gi|440795785|gb|ELR16901.1| putative cleavage and polyadenylation specificity factor, putative
[Acanthamoeba castellanii str. Neff]
Length = 589
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++ + K F A +NPF FKH+ + P VV+ S ++ G SR+LF
Sbjct: 166 INMMNENIRKQF--AVSNPFVFKHISNLKGMQHFDD-SGPCVVMASPGMLQSGLSRELFE 222
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WCS+ +N +II GTLA+ ++
Sbjct: 223 KWCSNAKNGVIIPGYCVEGTLAKHIM 248
>gi|324506922|gb|ADY42942.1| Cleavage and polyadenylation specificity factor subunit 3 [Ascaris
suum]
Length = 706
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+ ++ K A NNPF F+HV S+ V P VVL S ++ G SR+LF
Sbjct: 299 VSGMNSRIQKQI--ALNNPFVFRHVSNLKSIEHFEDV-GPCVVLASPGMLQNGLSRELFE 355
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ +N II GTLA+ ++
Sbjct: 356 NWCTDSKNGCIIAGYCVEGTLAKHIL 381
>gi|322700762|gb|EFY92515.1| cleavage and polyadenylylation specificity factor, putative
[Metarhizium acridum CQMa 102]
Length = 960
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 43/167 (25%), Positives = 69/167 (41%), Gaps = 53/167 (31%)
Query: 34 IEWMSDKLMKSFE----GAR----------NNPFHFKHVKLCHSLAELAKV--------- 70
+EWM D +++ FE G R PF FK+++L A+++K+
Sbjct: 332 LEWMDDNIVQEFEAFAEGQRKANGTVEKKDGGPFDFKYLRLLERKAQVSKLLDQVASAQG 391
Query: 71 --PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG-----TLARDLIEL-- 121
+V+L S ME GFS+D+ + P N +I+T+R + ++ R L E
Sbjct: 392 EAAKGRVILASDASMEWGFSKDVLRELAKDPNNLVILTDRPTIAKDDRPSIGRTLWEWWK 451
Query: 122 ---------------------GGNRTLTLQVKKRIRLEGEELEEYQK 147
G R L ++ R LEG+EL YQ+
Sbjct: 452 ERKDGVSVEQTSTGDNIETVQAGGRELEIREASRQALEGDELAVYQQ 498
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 36/67 (53%), Gaps = 3/67 (4%)
Query: 177 DFK--QTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPEN 234
DFK + L+R + +D V +G A RV+L S ME GFS+D+ + P N
Sbjct: 366 DFKYLRLLERKAQVSKLLDQV-ASAQGEAAKGRVILASDASMEWGFSKDVLRELAKDPNN 424
Query: 235 SIIITNR 241
+I+T+R
Sbjct: 425 LVILTDR 431
>gi|408391611|gb|EKJ70983.1| hypothetical protein FPSE_08842 [Fusarium pseudograminearum CS3096]
Length = 963
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 74/167 (44%), Gaps = 55/167 (32%)
Query: 34 IEWMSDKLMKSFE----------GARN----NPFHFKHVKLCHSLAELAKVPS------- 72
+EWM D +++ FE GA N PF FK+++L A++A++ S
Sbjct: 332 LEWMDDSIVQEFEAFAEDQRRVNGANNKKEGGPFDFKYLRLLERKAQIARLLSQNVENAG 391
Query: 73 --PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG-------TLARDL----- 118
+V+L S +E GFS+DL N +I+T++ PG ++AR L
Sbjct: 392 TEGRVILASDSSIEWGFSKDLIKGLAQDSRNLVILTDK--PGLSKNDNPSIARTLWDWWK 449
Query: 119 ----------------IEL--GGNRTLTLQVKKRIRLEGEELEEYQK 147
IEL G R L ++ +R LEG+EL YQ+
Sbjct: 450 ERKDGVSVEQNCNGENIELVYAGGRELEIREPQRHALEGDELALYQQ 496
>gi|255721479|ref|XP_002545674.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240136163|gb|EER35716.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 870
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 71/147 (48%), Gaps = 25/147 (17%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECG-FSRDLFF 93
+WMS+ L K +E PF+ V L AEL K+ PK+V S D++ G S + F
Sbjct: 287 DWMSNTLSKQWEELSTVPFNPSKVDLLLDPAELIKLSGPKIVFCSGIDLKDGDISSEAFQ 346
Query: 94 QWCSSPENSIIITNRT-----------------------SPGTLARDLIELGGNRTLTLQ 130
C+ ++I+T ++ S G A+D I + +RT++LQ
Sbjct: 347 YLCNDTSTTVILTEKSCIDSRNGLGAELYKEWYTSASNKSTGNGAKDGIAVPIDRTISLQ 406
Query: 131 VK-KRIRLEGEELEEYQKKKDKEAKDK 156
+ K + L G++L +Q+K ++ K+K
Sbjct: 407 NQTKEVDLTGQDLLNFQEKVAQKRKEK 433
>gi|32566029|ref|NP_502553.2| Protein CPSF-3 [Caenorhabditis elegans]
gi|26985920|emb|CAC44310.2| Protein CPSF-3 [Caenorhabditis elegans]
Length = 707
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 81/193 (41%), Gaps = 24/193 (12%)
Query: 3 DKLMKSFEGARNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKL 60
D+ +S + + P ++ K C S+ + + M+ ++ K A NPF FKHV
Sbjct: 255 DEYWESHQELHDIPVYYASSLAKKCMSVYQ-TFVNGMNSRIQKQI--AVKNPFIFKHVST 311
Query: 61 CHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE 120
+ + P VVL + ++ GFSR+LF WC +N II GTLA+ ++
Sbjct: 312 LRGMDQFEDA-GPCVVLATPGMLQSGFSRELFESWCPDTKNGCIIAGYCVEGTLAKHILS 370
Query: 121 LGGNRTLTLQVKKRIRLEGEELE--------EYQKKKDKEAKDKQEKEKIPPHDTSFINE 172
+ ++ + L GE+L + D K PPH E
Sbjct: 371 ---------EPEEIVSLSGEKLPMRMQVGYVSFSAHTDYHQTSNFVKALKPPHLVLVHGE 421
Query: 173 L-QLSDFKQTLQR 184
L ++S K ++R
Sbjct: 422 LHEMSRLKSGIER 434
>gi|444714932|gb|ELW55806.1| Cleavage and polyadenylation specificity factor subunit 2 [Tupaia
chinensis]
Length = 723
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/47 (55%), Positives = 33/47 (70%), Gaps = 1/47 (2%)
Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRR 207
++P H + F+NE +LSDFKQ L R GI EF+ GVL+ C VAVRR
Sbjct: 610 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR 655
>gi|46138561|ref|XP_390971.1| hypothetical protein FG10795.1 [Gibberella zeae PH-1]
Length = 964
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 48/168 (28%), Positives = 74/168 (44%), Gaps = 56/168 (33%)
Query: 34 IEWMSDKLMKSFE----------GARN-----NPFHFKHVKLCHSLAELAKVPS------ 72
+EWM D +++ FE GA N PF FK+++L A++A++ S
Sbjct: 332 LEWMDDSIVQEFEAFAEDQRRVNGANNKKEGGGPFDFKYLRLLERKAQIARLLSQNVENA 391
Query: 73 ---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG-------TLARDL---- 118
+V+L S +E GFS+DL N +I+T++ PG ++AR L
Sbjct: 392 GTEGRVILASDSSIEWGFSKDLIKGLAQDSRNLVILTDK--PGLSKNGNPSIARTLWDWW 449
Query: 119 -----------------IEL--GGNRTLTLQVKKRIRLEGEELEEYQK 147
IEL G R L ++ +R LEG+EL YQ+
Sbjct: 450 KERKDGVSVEQNSNGDNIELVYAGGRELEIREPQRHALEGDELALYQQ 497
>gi|346327110|gb|EGX96706.1| cleavage and polyadenylylation specificity factor, putative
[Cordyceps militaris CM01]
Length = 1024
Score = 54.3 bits (129), Expect = 5e-05, Method: Composition-based stats.
Identities = 48/187 (25%), Positives = 82/187 (43%), Gaps = 33/187 (17%)
Query: 34 IEWMSDKLMKSFEG-----ARNN---------PFHFKHVKLCHSLAELAKVPSP------ 73
+EWM D +++ FE R N P F+ ++L A++AK+ S
Sbjct: 366 LEWMDDGIVQEFEAFAEGQKRTNGASDKKVGGPLDFRFMRLLDRKAQIAKLLSTAVNNGE 425
Query: 74 ---KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR------TSPGTLARDLIELGGN 124
+V+L S M+ GFS+DL S P N +I+T++ +SP ++AR L L
Sbjct: 426 SKGRVILASDTSMDWGFSKDLLRGLASDPNNVVILTDKPGMAQESSP-SIARTLWSLWKE 484
Query: 125 R---TLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFINELQLSDFKQT 181
R TLQ +E Y + ++ + E++ + + QL +QT
Sbjct: 485 RKDGVATLQTANGDTIEYVNTSGYDLEIRDAKREPLQGEELAVYQQWLATQRQLQATQQT 544
Query: 182 LQRNGID 188
+ +GI+
Sbjct: 545 VGADGIE 551
>gi|392512873|emb|CAD25809.2| similarity to HYPOTHETICAL PROTEIN Y162_METJA [Encephalitozoon
cuniculi GB-M1]
Length = 643
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 40/69 (57%), Gaps = 1/69 (1%)
Query: 51 NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS 110
NPF FK+VK + P V++ S ++ G SRDLF +WCS +N++II
Sbjct: 296 NPFAFKYVKNLKGIDSFDD-EGPCVIMASPGMLQSGLSRDLFERWCSDSKNAVIIPGYCV 354
Query: 111 PGTLARDLI 119
GTLA++++
Sbjct: 355 DGTLAKEIL 363
>gi|401882746|gb|EJT46990.1| cleavage and polyadenylation specificity factor [Trichosporon
asahii var. asahii CBS 2479]
gi|406700483|gb|EKD03650.1| cleavage and polyadenylation specificity factor [Trichosporon
asahii var. asahii CBS 8904]
Length = 738
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 39/71 (54%), Gaps = 1/71 (1%)
Query: 49 RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
R+NPF FK+VK L P VV+ S M G SR+L +W P+N +I+T
Sbjct: 343 RDNPFDFKYVKWLKDPKRLDH-KQPCVVMASAQFMSFGLSRELLEEWAPDPKNGVIVTGY 401
Query: 109 TSPGTLARDLI 119
+ GT+AR L+
Sbjct: 402 SIEGTMARTLL 412
>gi|396082284|gb|AFN83894.1| putative beta-lactamase fold-containing exonuclease
[Encephalitozoon romaleae SJ-2008]
Length = 643
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 40/69 (57%), Gaps = 1/69 (1%)
Query: 51 NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS 110
NPF FK+VK + P V++ S ++ G SRDLF +WCS +N++II
Sbjct: 296 NPFAFKYVKNLKGIDSFDD-EGPCVIMASPGMLQSGLSRDLFERWCSDSKNAVIIPGYCV 354
Query: 111 PGTLARDLI 119
GTLA++++
Sbjct: 355 DGTLAKEIL 363
>gi|388853919|emb|CCF52417.1| uncharacterized protein [Ustilago hordei]
Length = 1033
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 67/137 (48%), Gaps = 24/137 (17%)
Query: 47 GARNNPFHFKHVKLCHSLAEL-AKVPS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSI 103
A ++P FK+VK+ SL + +P KVVL P M G SR L ++ +P + +
Sbjct: 405 AAASSPLDFKYVKVYSSLQAMDEAIPQDQAKVVLAVPPSMTHGPSRRLLARFAKNPNDVV 464
Query: 104 IITNRTSPGTLARDLIE----------------LG----GNRTLTLQVKKRIRLEGEELE 143
++ +R PG+L R L + LG N + ++K R+ LEGEEL
Sbjct: 465 VLISRGEPGSLCRQLWDAWNTNQGKGFAWAQGKLGEAVTPNTRVRFELKSRVPLEGEELR 524
Query: 144 EYQKKKDKEAKDKQEKE 160
+ + + E +D+Q ++
Sbjct: 525 AHLEAEQAE-RDRQTQQ 540
>gi|301111988|ref|XP_002905073.1| cleavage and polyadenylation specificity factor subunit 3
[Phytophthora infestans T30-4]
gi|262095403|gb|EEY53455.1| cleavage and polyadenylation specificity factor subunit 3
[Phytophthora infestans T30-4]
Length = 724
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D++ K A +NPF F+H+ S+ + P VV+ S ++ G SR LF
Sbjct: 291 INMMNDRIRKQI--AISNPFQFEHISNLKSMDDFDD-SGPSVVMASPGMLQSGVSRQLFE 347
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WCS N+ +I GTLA+ ++
Sbjct: 348 RWCSDKRNACLIPGYVVEGTLAKKIL 373
>gi|405963469|gb|EKC29039.1| Cleavage and polyadenylation specificity factor subunit 3
[Crassostrea gigas]
Length = 686
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
Query: 50 NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
+NPF FKH+ S+ + P VVL S M+ G SR+LF WC+ N II
Sbjct: 305 SNPFVFKHISNLKSMEHFEDI-GPSVVLASPGMMQSGLSRELFESWCTDKRNGCIIAGYC 363
Query: 110 SPGTLARDLI 119
GTLA+ ++
Sbjct: 364 VEGTLAKHIL 373
>gi|303391080|ref|XP_003073770.1| putative beta-lactamase fold-containing exonuclease
[Encephalitozoon intestinalis ATCC 50506]
gi|303302918|gb|ADM12410.1| putative beta-lactamase fold-containing exonuclease
[Encephalitozoon intestinalis ATCC 50506]
Length = 696
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 40/69 (57%), Gaps = 1/69 (1%)
Query: 51 NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS 110
NPF FK+VK + P V++ S ++ G SRDLF +WCS +N++II
Sbjct: 296 NPFAFKYVKNLKGIDSFDD-EGPCVIMASPGMLQSGLSRDLFERWCSDSKNAVIIPGYCV 354
Query: 111 PGTLARDLI 119
GTLA++++
Sbjct: 355 DGTLAKEIL 363
>gi|348686031|gb|EGZ25846.1| hypothetical protein PHYSODRAFT_478942 [Phytophthora sojae]
Length = 733
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D++ K A +NPF F+H+ S+ + P VV+ S ++ G SR LF
Sbjct: 291 INMMNDRIRKQI--AISNPFQFEHISNLKSMDDFDD-SGPSVVMASPGMLQSGVSRQLFE 347
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WCS N+ +I GTLA+ ++
Sbjct: 348 RWCSDKRNACLIPGYVVEGTLAKKIL 373
>gi|198421242|ref|XP_002128016.1| PREDICTED: similar to Cleavage and polyadenylation specificity
factor subunit 3 (Cleavage and polyadenylation
specificity factor 73 kDa subunit) (CPSF 73 kDa subunit)
(mRNA 3-end-processing endonuclease CPSF-73) [Ciona
intestinalis]
Length = 690
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
Query: 50 NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
+NPF FKH+ + V P VV+ S M+ G SR+LF WC+ N +I+
Sbjct: 305 SNPFQFKHISNLKGMEHFDDV-GPSVVMASPGMMQSGLSRELFESWCNDRRNGVIVAGYC 363
Query: 110 SPGTLARDLI 119
GTLA+ ++
Sbjct: 364 VEGTLAKHIL 373
>gi|401827745|ref|XP_003888165.1| putative RNA-processing beta-lactamase-fold exonuclease
[Encephalitozoon hellem ATCC 50504]
gi|392999365|gb|AFM99184.1| putative RNA-processing beta-lactamase-fold exonuclease
[Encephalitozoon hellem ATCC 50504]
Length = 643
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 39/69 (56%), Gaps = 1/69 (1%)
Query: 51 NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS 110
NPF FK+VK + P V++ S ++ G SRDLF +WCS N++II
Sbjct: 296 NPFAFKYVKNLKGIDSFDD-EGPCVIMASPGMLQSGLSRDLFERWCSDSRNAVIIPGYCV 354
Query: 111 PGTLARDLI 119
GTLA++++
Sbjct: 355 DGTLAKEIL 363
>gi|425768274|gb|EKV06801.1| Cleavage and polyadenylylation specificity factor, putative
[Penicillium digitatum Pd1]
gi|425770355|gb|EKV08828.1| Cleavage and polyadenylylation specificity factor, putative
[Penicillium digitatum PHI26]
Length = 1001
Score = 53.9 bits (128), Expect = 7e-05, Method: Composition-based stats.
Identities = 29/96 (30%), Positives = 45/96 (46%), Gaps = 23/96 (23%)
Query: 34 IEWMSDKLMKSFEGARNN--------------------PFHFKHVKLCHSLAELAKV--- 70
+EWM + +++ FE A ++ PF FKH+K+ L K+
Sbjct: 344 LEWMDENIVREFEAAESSDVTNGQRTGAQEKSSNKGGGPFTFKHLKIIERKKRLEKLLAE 403
Query: 71 PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
P PKV+L S M+ GFS+D Q P N +++T
Sbjct: 404 PGPKVILASDTSMDWGFSKDALRQVAEGPNNLLLLT 439
>gi|300706475|ref|XP_002995499.1| hypothetical protein NCER_101581 [Nosema ceranae BRL01]
gi|239604633|gb|EEQ81828.1| hypothetical protein NCER_101581 [Nosema ceranae BRL01]
Length = 671
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 49/86 (56%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D + KS NPF F++VK S+ + + SP V++ S ++ G SR+LF
Sbjct: 305 INMMNDHIKKS--SLIKNPFAFQYVKNLKSI-DFFEDNSPCVIMASPGMLQSGLSRELFE 361
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC N +II + GTLA++++
Sbjct: 362 KWCGDRRNGVIIPGYSVDGTLAKEIL 387
>gi|402696939|gb|AFQ90658.1| 73kDa cleavage and polyadenylation specific factor 3, partial
[Draco beccarii]
Length = 220
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 39/77 (50%), Gaps = 3/77 (3%)
Query: 37 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF WC
Sbjct: 147 MNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFESWC 203
Query: 97 SSPENSIIITNRTSPGT 113
+ N +II GT
Sbjct: 204 TDKRNGVIIAGYCVEGT 220
>gi|19074699|ref|NP_586205.1| similarity to HYPOTHETICAL PROTEIN Y162_METJA [Encephalitozoon
cuniculi GB-M1]
Length = 730
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 40/69 (57%), Gaps = 1/69 (1%)
Query: 51 NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS 110
NPF FK+VK + P V++ S ++ G SRDLF +WCS +N++II
Sbjct: 383 NPFAFKYVKNLKGIDSFDD-EGPCVIMASPGMLQSGLSRDLFERWCSDSKNAVIIPGYCV 441
Query: 111 PGTLARDLI 119
GTLA++++
Sbjct: 442 DGTLAKEIL 450
>gi|66820693|ref|XP_643926.1| beta-lactamase domain-containing protein [Dictyostelium discoideum
AX4]
gi|74860395|sp|Q86A79.1|CPSF3_DICDI RecName: Full=Cleavage and polyadenylation specificity factor
subunit 3; Short=Cleavage and polyadenylation
specificity factor 3
gi|60472339|gb|EAL70292.1| beta-lactamase domain-containing protein [Dictyostelium discoideum
AX4]
Length = 774
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D++ F+ + NPF FKH+K + P V + S ++ G SR LF
Sbjct: 313 INMMNDRVRAQFDVS--NPFEFKHIKNIKGIESFDD-RGPCVFMASPGMLQSGLSRQLFE 369
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WCS N I+I + GTLA+ ++
Sbjct: 370 RWCSDKRNGIVIPGYSVEGTLAKHIM 395
>gi|254582142|ref|XP_002497056.1| ZYRO0D14410p [Zygosaccharomyces rouxii]
gi|238939948|emb|CAR28123.1| ZYRO0D14410p [Zygosaccharomyces rouxii]
Length = 772
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 48/86 (55%), Gaps = 1/86 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+D + + F ++ NPF FK++ ++ E P V+L S ++ G SR++
Sbjct: 292 VNMMNDDIRRKFRDSQTNPFVFKNISYLKNIDEFQDF-GPSVMLASPGMLQNGLSREVLE 350
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC +N ++IT + GT+A+ L+
Sbjct: 351 RWCPEGKNLVLITGYSVEGTMAKFLM 376
>gi|296418744|ref|XP_002838985.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295634979|emb|CAZ83176.1| unnamed protein product [Tuber melanosporum]
Length = 783
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 51/88 (57%), Gaps = 4/88 (4%)
Query: 34 IEWMSDKLMKS-FEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDL 91
I M+DK+ ++ FEG NP+ F+ V+ ++ V V++++P M + G SR+L
Sbjct: 297 INTMNDKIKRAMFEGEGRNPWDFRWVRSLKTIDRFEDVGG--CVMLASPGMLQNGVSREL 354
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
+WC P N ++IT + GT+A+ ++
Sbjct: 355 LERWCPDPRNGLVITGYSVEGTMAKQIM 382
>gi|340545979|gb|AEK51788.1| cleavage and polyadenylation specific factor 3 [Heteronotia binoei]
gi|402696941|gb|AFQ90659.1| 73kDa cleavage and polyadenylation specific factor 3, partial
[Malaclemys terrapin]
gi|402696943|gb|AFQ90660.1| 73kDa cleavage and polyadenylation specific factor 3, partial
[Testudo hermanni]
Length = 220
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 39/77 (50%), Gaps = 3/77 (3%)
Query: 37 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
M+DK+ K NNPF FKH+ S+ + P VV+ S M+ G SR+LF WC
Sbjct: 147 MNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFESWC 203
Query: 97 SSPENSIIITNRTSPGT 113
+ N +II GT
Sbjct: 204 TDKRNGVIIAGYCVEGT 220
>gi|339237605|ref|XP_003380357.1| cleavage and polyadenylation specificity factor subunit 3
[Trichinella spiralis]
gi|316976818|gb|EFV60027.1| cleavage and polyadenylation specificity factor subunit 3
[Trichinella spiralis]
Length = 687
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 44/86 (51%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+ + K A NPF FKHV+ S+ + P VVL S ++ G SR+LF
Sbjct: 294 ISGMNQNIQKQI--AVQNPFLFKHVRSLRSIDFFEDI-GPCVVLASPGMLQSGLSRELFE 350
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ +N II GTLA+ ++
Sbjct: 351 MWCTDTKNGCIIAGYCVEGTLAKHIL 376
>gi|71017515|ref|XP_758988.1| hypothetical protein UM02841.1 [Ustilago maydis 521]
gi|46098766|gb|EAK83999.1| hypothetical protein UM02841.1 [Ustilago maydis 521]
Length = 979
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/130 (26%), Positives = 63/130 (48%), Gaps = 24/130 (18%)
Query: 47 GARNNPFHFKHVKLCHSLAEL-AKVPS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSI 103
A ++P FK+VK+ +L + +P KVVL P M G SR L ++ +P + +
Sbjct: 423 AAASSPLDFKYVKVFPTLQAMDEAIPQDQAKVVLAVPPSMTHGPSRKLLARFAQNPNDVV 482
Query: 104 IITNRTSPGTLARDLIE---------------------LGGNRTLTLQVKKRIRLEGEEL 142
++ +R PG+L R+L + + N +L ++K ++ LEG+EL
Sbjct: 483 VLISRGEPGSLCRELWDAWNTNQSKGFSWSQGKLGQAVVASNTSLRFELKSKVPLEGDEL 542
Query: 143 EEYQKKKDKE 152
+++ + E
Sbjct: 543 RAHREAEQAE 552
>gi|393217572|gb|EJD03061.1| Metallo-hydrolase/oxidoreductase [Fomitiporia mediterranea MF3/22]
Length = 826
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 45/89 (50%), Gaps = 4/89 (4%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAK--VPSPKVVLVSTPDM-ECGFSRD 90
I M+ + F R+NPF FKHV + K P V++ TP M + G SR+
Sbjct: 284 IHTMNSNIRSRF-AKRDNPFVFKHVSNIPQVRGWEKRIAEGPPCVILCTPGMLQPGPSRE 342
Query: 91 LFFQWCSSPENSIIITNRTSPGTLARDLI 119
L W P N +IIT + GTLARD++
Sbjct: 343 LLELWAPDPRNGLIITGYSVEGTLARDIV 371
>gi|391348443|ref|XP_003748457.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Metaseiulus occidentalis]
Length = 673
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
Query: 48 ARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
A +NPF FKH+ S+ V P V++ + M+ G SR+LF WC +N +II
Sbjct: 296 AISNPFVFKHISNLKSIDHFDDV-GPCVIMATPGMMQSGLSRELFEAWCGDTKNGVIIAG 354
Query: 108 RTSPGTLARDLI 119
GTLA+ ++
Sbjct: 355 YCVEGTLAKQIL 366
>gi|443898849|dbj|GAC76183.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT2
[Pseudozyma antarctica T-34]
Length = 1135
Score = 53.5 bits (127), Expect = 1e-04, Method: Composition-based stats.
Identities = 36/132 (27%), Positives = 67/132 (50%), Gaps = 27/132 (20%)
Query: 50 NNPFHFKHVKLCHSLAELAK-VPS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
++P FK+V++ SL ++ + +P +VVL P M G SR L ++ +P ++I++
Sbjct: 503 SSPLDFKYVRVFASLQQMDEAIPQDQARVVLAVPPSMTHGPSRRLLARFARNPNDAIVLI 562
Query: 107 NRTSPGTLARDLIE--------------------LGGNRTLTLQVKKRIRLEGEE----L 142
+R PG+L R L + + G T+ +++ ++ LEGEE L
Sbjct: 563 SRGEPGSLCRQLWDAWNQRQPKGFSWTKGKLGEVVSGEATVRYELQSKVPLEGEELRLHL 622
Query: 143 EEYQKKKDKEAK 154
E Q ++D+ A+
Sbjct: 623 ESEQAERDRLAQ 634
>gi|426197081|gb|EKV47008.1| hypothetical protein AGABI2DRAFT_203789 [Agaricus bisporus var.
bisporus H97]
Length = 794
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 49 RNNPFHFKHVKLCHSL----AELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSII 104
R+NPF FKH+ ++A P P VVL S M+ G SR+LF WC N +I
Sbjct: 308 RDNPFVFKHISNVPQTRGWEKKIADGP-PCVVLASPGFMQVGPSRELFEHWCPDARNGLI 366
Query: 105 ITNRTSPGTLARDLI 119
IT + GT ARD++
Sbjct: 367 ITGYSIEGTPARDIM 381
>gi|321264788|ref|XP_003197111.1| cleavage and polyadenylation specificity factor [Cryptococcus
gattii WM276]
gi|317463589|gb|ADV25324.1| Cleavage and polyadenylation specificity factor, putative
[Cryptococcus gattii WM276]
Length = 778
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 46/93 (49%), Gaps = 8/93 (8%)
Query: 49 RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
R+NPF F+ VK +L + P V++ S M G SRDL +W +N +I+T
Sbjct: 323 RDNPFDFRFVKWLKDPQKLRENKGPCVIMSSPQFMSFGLSRDLLEEWAPDSKNGVIVTGY 382
Query: 109 TSPGTLARDLIE--------LGGNRTLTLQVKK 133
+ GT+AR L+ GGN L VK+
Sbjct: 383 SIEGTMARTLLSEPDHIESLKGGNIPRRLTVKE 415
>gi|409080187|gb|EKM80547.1| hypothetical protein AGABI1DRAFT_70926 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 841
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 49 RNNPFHFKHVKLCHSL----AELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSII 104
R+NPF FKH+ ++A P P VVL S M+ G SR+LF WC N +I
Sbjct: 308 RDNPFVFKHISNVPQTRGWEKKIADGP-PCVVLASPGFMQVGPSRELFEHWCPDARNGLI 366
Query: 105 ITNRTSPGTLARDLI 119
IT + GT ARD++
Sbjct: 367 ITGYSIEGTPARDIM 381
>gi|238882385|gb|EEQ46023.1| hypothetical protein CAWG_04366 [Candida albicans WO-1]
Length = 783
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 52/86 (60%), Gaps = 4/86 (4%)
Query: 37 MSDKL-MKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQ 94
M+DK+ + S ++NPF FK++K L++ + P VV V+TP M + G SR L +
Sbjct: 300 MNDKIRLSSASSEKSNPFDFKYIKSIKDLSKFQDM-GPSVV-VATPGMLQAGVSRQLLEK 357
Query: 95 WCSSPENSIIITNRTSPGTLARDLIE 120
W +N +I+T + GT+A++L++
Sbjct: 358 WAPDGKNLVILTGYSVEGTMAKELLK 383
>gi|213409816|ref|XP_002175678.1| endoribonuclease ysh1 [Schizosaccharomyces japonicus yFS275]
gi|212003725|gb|EEB09385.1| endoribonuclease ysh1 [Schizosaccharomyces japonicus yFS275]
Length = 771
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 47/83 (56%), Gaps = 3/83 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+DK+ K+F A NPF F+++K S+ + + P V+L S ++ G SR L
Sbjct: 264 INMMNDKIRKAF--AERNPFIFRYIKSLRSIDKFDDI-GPSVILASPGMLQNGVSRTLLE 320
Query: 94 QWCSSPENSIIITNRTSPGTLAR 116
+W N++++T + GT+A+
Sbjct: 321 RWAPDARNTLLLTGYSVEGTMAK 343
>gi|393245131|gb|EJD52642.1| Metallo-hydrolase/oxidoreductase [Auricularia delicata TFB-10046
SS5]
Length = 751
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 48/89 (53%), Gaps = 4/89 (4%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAK--VPSPKVVLVSTPDM-ECGFSRD 90
I M+ + + F R+NPF FKH+ L + P V++++P M + G SR+
Sbjct: 284 IHTMNSNIRQRF-ARRDNPFIFKHISHLPQTRGLERKIADGPPCVVLASPGMLQSGTSRE 342
Query: 91 LFFQWCSSPENSIIITNRTSPGTLARDLI 119
L W P N++++T + GTLARD++
Sbjct: 343 LLELWAPDPRNALVVTGYSVEGTLARDIL 371
>gi|58270576|ref|XP_572444.1| hypothetical protein CNH02710 [Cryptococcus neoformans var.
neoformans JEC21]
gi|134118056|ref|XP_772409.1| hypothetical protein CNBL2750 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|338819805|sp|P0CM89.1|YSH1_CRYNB RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
3'-end-processing protein YSH1
gi|338819806|sp|P0CM88.1|YSH1_CRYNJ RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
3'-end-processing protein YSH1
gi|50255022|gb|EAL17762.1| hypothetical protein CNBL2750 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57228702|gb|AAW45137.1| hypothetical protein CNH02710 [Cryptococcus neoformans var.
neoformans JEC21]
Length = 773
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 46/93 (49%), Gaps = 8/93 (8%)
Query: 49 RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
R+NPF F+ VK +L + P V++ S M G SRDL +W +N +I+T
Sbjct: 323 RDNPFDFRFVKWLKDPQKLRENKGPCVIMSSPQFMSFGLSRDLLEEWAPDSKNGVIVTGY 382
Query: 109 TSPGTLARDLIE--------LGGNRTLTLQVKK 133
+ GT+AR L+ GGN L VK+
Sbjct: 383 SIEGTMARTLLSEPDHIESLKGGNVPRRLTVKE 415
>gi|68489322|ref|XP_711502.1| hypothetical protein CaO19.12941 [Candida albicans SC5314]
gi|68489371|ref|XP_711478.1| hypothetical protein CaO19.5486 [Candida albicans SC5314]
gi|74584420|sp|Q59P50.1|YSH1_CANAL RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
3'-end-processing protein YSH1
gi|46432783|gb|EAK92250.1| hypothetical protein CaO19.5486 [Candida albicans SC5314]
gi|46432809|gb|EAK92275.1| hypothetical protein CaO19.12941 [Candida albicans SC5314]
Length = 870
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 52/86 (60%), Gaps = 4/86 (4%)
Query: 37 MSDKL-MKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQ 94
M+DK+ + S ++NPF FK++K L++ + P VV V+TP M + G SR L +
Sbjct: 387 MNDKIRLSSASSEKSNPFDFKYIKSIKDLSKFQDM-GPSVV-VATPGMLQAGVSRQLLEK 444
Query: 95 WCSSPENSIIITNRTSPGTLARDLIE 120
W +N +I+T + GT+A++L++
Sbjct: 445 WAPDGKNLVILTGYSVEGTMAKELLK 470
>gi|405124298|gb|AFR99060.1| endoribonuclease YSH1 [Cryptococcus neoformans var. grubii H99]
Length = 770
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 46/93 (49%), Gaps = 8/93 (8%)
Query: 49 RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
R+NPF F+ VK +L + P V++ S M G SRDL +W +N +I+T
Sbjct: 323 RDNPFDFRFVKWLKDPQKLRENKGPCVIMSSPQFMSFGLSRDLLEEWAPDSKNGVIVTGY 382
Query: 109 TSPGTLARDLIE--------LGGNRTLTLQVKK 133
+ GT+AR L+ GGN L VK+
Sbjct: 383 SIEGTMARTLLSEPDHIESLKGGNIPRRLTVKE 415
>gi|429963183|gb|ELA42727.1| hypothetical protein VICG_00042 [Vittaforma corneae ATCC 50505]
Length = 642
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 60/113 (53%), Gaps = 12/113 (10%)
Query: 15 NPFHFKHVKLCHSLAELAK--------IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAE 66
NP H K + + ++ A K I M++++ K+ NPF F++V+ +
Sbjct: 255 NP-HLKGIPIYYASALAQKCMSVYQTYINMMNERIQKA--SLVKNPFDFRNVESIKDIQS 311
Query: 67 LAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
K P V++ S ++ GFSR+LF +WCS+ +N ++I GTLA++++
Sbjct: 312 F-KDTGPCVMMASPGMLQSGFSRELFEKWCSNEKNGVVIPGYCVEGTLAKEIL 363
>gi|170093225|ref|XP_001877834.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164647693|gb|EDR11937.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 772
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 45/86 (52%), Gaps = 9/86 (10%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++ + F R+NPF FK C +AE P VVL S M+ G SR+LF
Sbjct: 284 IHTMNNNIRSRF-AKRDNPFVFK----CKKIAE----GPPCVVLASPGFMQVGPSRELFE 334
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
W N +IIT + GTLARD++
Sbjct: 335 LWAPDARNGLIITGYSIEGTLARDIM 360
>gi|164663111|ref|XP_001732677.1| hypothetical protein MGL_0452 [Malassezia globosa CBS 7966]
gi|159106580|gb|EDP45463.1| hypothetical protein MGL_0452 [Malassezia globosa CBS 7966]
Length = 862
Score = 52.8 bits (125), Expect = 2e-04, Method: Composition-based stats.
Identities = 35/137 (25%), Positives = 66/137 (48%), Gaps = 28/137 (20%)
Query: 52 PFHFKHVKLCHSLAELAKVPSP---KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
P F ++ S+ L + +P KVVL + P + G SR L ++ P+ +I+T+R
Sbjct: 379 PLDFSGLRFYSSVEALHQALTPSQVKVVLATPPALSHGLSRQLLPEFLCDPDALLILTSR 438
Query: 109 TSPGTLARDL----------------------IELGGNRTLTLQVKKRIRLEGEELEEY- 145
+P +L R+L + +GG L+ ++++R+ L G+EL Y
Sbjct: 439 GTPSSLVRNLWDRWNAKQADRDAWRQGHVGVPVSVGGQ--LSYELRRRVPLAGDELRTYV 496
Query: 146 QKKKDKEAKDKQEKEKI 162
+++K +EA + +I
Sbjct: 497 ERQKAREAAADAPRARI 513
>gi|440632320|gb|ELR02239.1| hypothetical protein GMDG_05312 [Geomyces destructans 20631-21]
Length = 988
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/171 (25%), Positives = 64/171 (37%), Gaps = 57/171 (33%)
Query: 34 IEWMSDKLMKSFEG-----------------ARNNPFHFKHVKLCHSLAELAKV------ 70
+EWM D +++ FE A PF FKH++L ++ V
Sbjct: 334 LEWMDDGIIREFEAIAGGIDRQPNKPSEPRQAGAGPFDFKHLRLIEKKGGVSAVLNNDAT 393
Query: 71 ----PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSP--------------- 111
P KV+L S ++ GFS+D+ + N +I+T + S
Sbjct: 394 KDGKPMAKVILASDRSLDWGFSKDILRNIAADSRNLVILTEKMSKLPDVGRSSGVSRTLW 453
Query: 112 --------------GTLARDLIEL-GGNRTLTLQVKKRIRLEGEELEEYQK 147
G+ ++ GG R LTL KR LEG EL YQ+
Sbjct: 454 SWWDERKDGVALEQGSTGESFAQVYGGGRELTLADVKREALEGNELLSYQQ 504
>gi|294658126|ref|XP_460457.2| DEHA2F02134p [Debaryomyces hansenii CBS767]
gi|218511903|sp|Q6BMW3.2|YSH1_DEBHA RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
3'-end-processing protein YSH1
gi|202952895|emb|CAG88764.2| DEHA2F02134p [Debaryomyces hansenii CBS767]
Length = 815
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/71 (35%), Positives = 42/71 (59%), Gaps = 1/71 (1%)
Query: 49 RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
+ NPF FK +K +L + P VV+ S ++ G SR+L +W P+N++I+T
Sbjct: 323 KQNPFQFKFIKSIKNLDKFQDF-GPCVVVASPGMLQNGVSRELLERWAPDPKNAVIMTGY 381
Query: 109 TSPGTLARDLI 119
+ GT+A+DL+
Sbjct: 382 SVEGTMAKDLL 392
>gi|395330425|gb|EJF62808.1| hypothetical protein DICSQDRAFT_135076 [Dichomitus squalens
LYAD-421 SS1]
Length = 943
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 54/116 (46%), Gaps = 23/116 (19%)
Query: 53 FHFKHVKLCHSLAELAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
FKHV+ S L S PK++L + G SR +F ++ P+N +++T R
Sbjct: 386 LRFKHVEFFLSPQALMSTYSSKDPKLILAVPATLSHGPSRAIFAEFAEIPDNVVLLTGRG 445
Query: 110 SPGTLARDLIE----------------LGGN----RTLTLQVKKRIRLEGEELEEY 145
PGTL R L + +G N L L++ ++ L+GEELEEY
Sbjct: 446 EPGTLGRLLFDKWNDSQREEAKWDRGKIGNNIMMDGVLRLEMHSKVPLQGEELEEY 501
>gi|448517227|ref|XP_003867743.1| endoribonuclease [Candida orthopsilosis Co 90-125]
gi|380352082|emb|CCG22306.1| endoribonuclease [Candida orthopsilosis]
Length = 769
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 45/71 (63%), Gaps = 3/71 (4%)
Query: 51 NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRT 109
NPF FK++K +L++ + + P VV V+TP M + G SR L +W +N +I+T +
Sbjct: 318 NPFDFKYIKSIRNLSKFSDL-GPSVV-VATPGMLQAGVSRQLLEKWAPEQKNLVILTGYS 375
Query: 110 SPGTLARDLIE 120
GT+A+DL++
Sbjct: 376 VEGTMAKDLLK 386
>gi|326426580|gb|EGD72150.1| cleavage and polyadenylation specificity factor subunit 3
[Salpingoeca sp. ATCC 50818]
Length = 790
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 44/73 (60%), Gaps = 2/73 (2%)
Query: 50 NNPFHFKHVKLCHSLAEL-AKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITN 107
NNPF+F+HV +L + + V++++P M + G SR++F +W S+ N +++
Sbjct: 339 NNPFNFRHVSYIRNLHQFDGEYGGGPCVMLASPGMLQSGLSREIFERWASNKANCVLLAG 398
Query: 108 RTSPGTLARDLIE 120
GTLA+DL++
Sbjct: 399 YVVNGTLAKDLLK 411
>gi|343429654|emb|CBQ73226.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 1039
Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats.
Identities = 39/136 (28%), Positives = 64/136 (47%), Gaps = 28/136 (20%)
Query: 47 GARNNPFHFKHVKLCHSLAELAK-VP--SPKVVLVSTPDMECGFSRDLFFQWCSSPENSI 103
A ++P FK+V++ SL + + +P KVVL P M G SR L ++ +P + +
Sbjct: 427 AAASSPLDFKYVRVFPSLQAMDEAIPHDQAKVVLAVPPSMTHGPSRRLLARFAQNPNDVV 486
Query: 104 IITNRTSPGTLARDL---------------------IELGGNRTLTLQVKKRIRLEGEE- 141
++ +R PG+L R+L I L ++K ++ LEGEE
Sbjct: 487 VLISRGEPGSLCRELWNAWNTHQSKGFSWAQGKLGQIVTPTKTALRFELKSKVPLEGEEL 546
Query: 142 ---LEEYQKKKDKEAK 154
LE Q ++DK A+
Sbjct: 547 RAHLEAEQAERDKLAQ 562
>gi|255724858|ref|XP_002547358.1| hypothetical protein CTRG_01665 [Candida tropicalis MYA-3404]
gi|240135249|gb|EER34803.1| hypothetical protein CTRG_01665 [Candida tropicalis MYA-3404]
Length = 783
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 60/110 (54%), Gaps = 12/110 (10%)
Query: 37 MSDKL-MKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQ 94
M+DK+ + S G ++NPF K +K L++ + P VV V+TP M + G SR L +
Sbjct: 300 MNDKIRLSSSSGEKSNPFDLKFIKSIKDLSKFQDM-GPSVV-VATPGMLQAGVSRQLLEK 357
Query: 95 WCSSPENSIIITNRTSPGTLARDL------IELGGNRTLTLQVKKRIRLE 138
W +N +I+T + GT+A++L I+ N LT + +RI +E
Sbjct: 358 WAPDNKNLVILTGYSVEGTMAKELLKEPTMIQSATNPDLT--IPRRIGIE 405
>gi|392575747|gb|EIW68879.1| hypothetical protein TREMEDRAFT_44189 [Tremella mesenterica DSM
1558]
Length = 738
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 37/71 (52%)
Query: 49 RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
R+NPF FK+VK SP VV+ S M G SR+L W +N +I+T
Sbjct: 319 RDNPFDFKYVKPLKDGRRGDNFKSPCVVMASAQFMSFGLSRELLEDWAPGEKNGVIVTGY 378
Query: 109 TSPGTLARDLI 119
+ GT+AR L+
Sbjct: 379 SIEGTMARTLL 389
>gi|396500483|ref|XP_003845730.1| similar to cleavage and polyadenylation specificity factor subunit
2 [Leptosphaeria maculans JN3]
gi|312222311|emb|CBY02251.1| similar to cleavage and polyadenylation specificity factor subunit
2 [Leptosphaeria maculans JN3]
Length = 954
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 44/92 (47%), Gaps = 15/92 (16%)
Query: 34 IEWMSDKLMKSFE---GARNN---------PFHFKHVKLCHS---LAELAKVPSPKVVLV 78
+EWM + ++K FE GA N PF FKH+ L +A + P+V+L
Sbjct: 336 LEWMEEGIVKEFEVASGANNGKDDSKAARVPFDFKHITLLERKTRVARMLATSGPRVILA 395
Query: 79 STPDMECGFSRDLFFQWCSSPENSIIITNRTS 110
S +E GFS+D S +N I++T R
Sbjct: 396 SDTTLEWGFSKDAIKSLASDEKNLILLTERAG 427
>gi|260815130|ref|XP_002602327.1| hypothetical protein BRAFLDRAFT_282200 [Branchiostoma floridae]
gi|229287635|gb|EEN58339.1| hypothetical protein BRAFLDRAFT_282200 [Branchiostoma floridae]
Length = 687
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 44/86 (51%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++K+ K + +NPF FKH+ + + P VV+ S M+ G SR+LF
Sbjct: 293 INAMNEKIRKQI--SVSNPFVFKHISNLKGMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
WC+ N II GTLA+ ++
Sbjct: 350 SWCTDRRNGCIIAGYCVEGTLAKHIM 375
>gi|241951638|ref|XP_002418541.1| cleavage and polyadenylation factor specificity complex subunit,
putative; endonuclease, putative [Candida dubliniensis
CD36]
gi|223641880|emb|CAX43843.1| cleavage and polyadenylation factor specificity complex subunit,
putative [Candida dubliniensis CD36]
Length = 787
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 60/110 (54%), Gaps = 12/110 (10%)
Query: 37 MSDKL-MKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQ 94
M+DK+ + S ++NPF FK +K L++ + P VV V+TP M + G SR L +
Sbjct: 301 MNDKIRLSSASSKKSNPFDFKFIKSIKDLSKFQDM-GPSVV-VATPGMLQAGVSRQLLEK 358
Query: 95 WCSSPENSIIITNRTSPGTLARDL------IELGGNRTLTLQVKKRIRLE 138
W +N +I+T + GT+A++L I+ N LT + +RI +E
Sbjct: 359 WAPDGKNLVILTGYSVEGTMAKELLKEPTMIQSATNPDLT--IPRRIGIE 406
>gi|67597128|ref|XP_666125.1| cleavage and polyadenylation specifity factor protein
[Cryptosporidium hominis TU502]
gi|54657058|gb|EAL35898.1| cleavage and polyadenylation specifity factor protein
[Cryptosporidium hominis]
Length = 484
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 46/99 (46%), Gaps = 17/99 (17%)
Query: 37 MSDKLMKSFEGARN--------------NPFHFKHVKLCHSLAELAKV---PSPKVVLVS 79
MS K M+ FE N NPF F ++K +SL E+ + P P VV+ +
Sbjct: 1 MSIKCMRVFETYINQCGESVRRQADLGINPFQFNYIKTVNSLNEIKDIIYNPGPCVVMAA 60
Query: 80 TPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDL 118
++ G SRD+F W N II+T GT A +L
Sbjct: 61 PGMLQNGTSRDIFEIWAPDKRNGIILTGYAVRGTPAYEL 99
>gi|269860830|ref|XP_002650133.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
[Enterocytozoon bieneusi H348]
gi|220066453|gb|EED43934.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
[Enterocytozoon bieneusi H348]
Length = 657
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 48/84 (57%), Gaps = 2/84 (2%)
Query: 37 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
M ++ M+ RN PF FK+V+ A+ + P V++ S ++ G SRD+F +WC
Sbjct: 286 MMNERMQKLNLTRN-PFDFKNVENIKD-AKTVRDGGPCVIMASPGMLQSGVSRDIFERWC 343
Query: 97 SSPENSIIITNRTSPGTLARDLIE 120
S +N ++I GTLA+++++
Sbjct: 344 SDSKNGVVIAGYCVEGTLAKEVLK 367
>gi|429243009|ref|NP_594263.2| mRNA cleavage and polyadenylation specificity factor complex
endoribonuclease subunit Ysh1 [Schizosaccharomyces pombe
972h-]
gi|384872669|sp|O13794.2|YSH1_SCHPO RecName: Full=Endoribonuclease ysh1; AltName: Full=mRNA
3'-end-processing protein ysh1
gi|347834169|emb|CAB16227.2| mRNA cleavage and polyadenylation specificity factor complex
endoribonuclease subunit Ysh1 [Schizosaccharomyces
pombe]
Length = 757
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 46/85 (54%), Gaps = 3/85 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+D + K F A NPF F+ VK +L + + P V+L S ++ G SR L
Sbjct: 293 VNMMNDNIRKIF--AERNPFIFRFVKSLRNLEKFDDI-GPSVILASPGMLQNGVSRTLLE 349
Query: 94 QWCSSPENSIIITNRTSPGTLARDL 118
+W P N++++T + GT+A+ +
Sbjct: 350 RWAPDPRNTLLLTGYSVEGTMAKQI 374
>gi|328867689|gb|EGG16071.1| beta-lactamase domain-containing protein [Dictyostelium
fasciculatum]
Length = 786
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I+ M++++ F A +NPF FKH+K + + P V + S ++ G SR LF
Sbjct: 306 IQMMNERIRAQF--AVSNPFIFKHIKDINGIDNFND-NGPCVFMASPGMLQSGLSRQLFE 362
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WCS N ++I + GTLA+ ++
Sbjct: 363 RWCSDRRNGVVIPGYSVEGTLAKHIM 388
>gi|164658265|ref|XP_001730258.1| hypothetical protein MGL_2640 [Malassezia globosa CBS 7966]
gi|159104153|gb|EDP43044.1| hypothetical protein MGL_2640 [Malassezia globosa CBS 7966]
Length = 741
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 40/71 (56%), Gaps = 1/71 (1%)
Query: 49 RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
R+NPF FKHV SL + P V++ S M+ G SR+L +W N +I++
Sbjct: 221 RDNPFVFKHVSNLRSLDKFDD-KGPCVMMASPGFMQSGISRELLERWAPDKRNGVIVSGY 279
Query: 109 TSPGTLARDLI 119
+ GT+ARD++
Sbjct: 280 SVEGTMARDIL 290
>gi|389746898|gb|EIM88077.1| hypothetical protein STEHIDRAFT_94995 [Stereum hirsutum FP-91666
SS1]
Length = 968
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/132 (26%), Positives = 67/132 (50%), Gaps = 24/132 (18%)
Query: 53 FHFKHVKL---CHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
FKH++ +L + PK++L + G SR LF + S P+N +++T+R
Sbjct: 388 LRFKHLEFFLNPQALQQTYSSKDPKLILAVPASLSHGPSRSLFADFASIPDNVVLLTSRG 447
Query: 110 SPGTLARDLIE----------------LGGN----RTLTLQVKKRIRLEGEELEEYQKKK 149
GTL+R L + +G N +L+++++ ++ L+G ELE + ++
Sbjct: 448 EEGTLSRVLFDRWNNAQRAEDKWDKGKIGSNVMMDGSLSIELRSKVALQGTELEIFL-QR 506
Query: 150 DKEAKDKQEKEK 161
++ AK+KQ E+
Sbjct: 507 ERAAKEKQTAEQ 518
>gi|66356658|ref|XP_625507.1| cleavage and polyadenylation specifity factor protein, CPSF
metallobeta-lactamase [Cryptosporidium parvum Iowa II]
gi|46226496|gb|EAK87490.1| cleavage and polyadenylation specifity factor protein, CPSF
metallobeta-lactamase [Cryptosporidium parvum Iowa II]
Length = 780
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 46/99 (46%), Gaps = 17/99 (17%)
Query: 37 MSDKLMKSFEGARN--------------NPFHFKHVKLCHSLAELAKV---PSPKVVLVS 79
MS K M+ FE N NPF F ++K +SL E+ + P P VV+ +
Sbjct: 297 MSIKCMRVFETYINQCGDSVRRQADLGINPFQFNYIKTVNSLNEIKDIIYNPGPCVVMAA 356
Query: 80 TPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDL 118
++ G SRD+F W N II+T GT A +L
Sbjct: 357 PGMLQNGTSRDIFEIWAPDKRNGIILTGYAVRGTPAYEL 395
>gi|361125691|gb|EHK97723.1| putative Cleavage factor two protein 2 [Glarea lozoyensis 74030]
Length = 835
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/168 (23%), Positives = 66/168 (39%), Gaps = 55/168 (32%)
Query: 34 IEWMSDKLMKSFEGARN----------------NPFHFKHVKLCHSLAELAKVPS----- 72
+EWM + +++ FE PF FKH++L +++ K+
Sbjct: 210 LEWMDESIVREFEAVAGQNKQDDDPDAKLRGIGGPFDFKHLRLLERKSQIDKIMQEVDNH 269
Query: 73 ----PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR-------------------- 108
KV+L S +E GFS+++F + C N +I T R
Sbjct: 270 GRSIGKVILASDTSLEWGFSKEVFRRICDDRRNLVIFTERMGQPKMENPKLGMARTLWSW 329
Query: 109 --------TSPGTLARDLIE--LGGNRTLTLQVKKRIRLEGEELEEYQ 146
+ + D++E GG R L ++ R+ LEG++L YQ
Sbjct: 330 WEDRSDGVATETAASGDVLEQVYGGGRQLEMRETTRVALEGDDLAAYQ 377
>gi|308473777|ref|XP_003099112.1| hypothetical protein CRE_27737 [Caenorhabditis remanei]
gi|308267766|gb|EFP11719.1| hypothetical protein CRE_27737 [Caenorhabditis remanei]
Length = 125
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/118 (26%), Positives = 48/118 (40%), Gaps = 40/118 (33%)
Query: 159 KEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMEC 218
K+ IP H F+N+ +LSDFK G EF+ G L+ G ++RR E
Sbjct: 48 KKLIPIHQAIFVNDPKLSDFKSLFVEKGFKAEFLSGTLLIKGGKCSIRR--------GEM 99
Query: 219 GFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
GFS +EG L +YY+++ L ++Q+ +L
Sbjct: 100 GFS--------------------------------MEGALEKDYYKLRNLFFNQFGVL 125
>gi|336371935|gb|EGO00275.1| hypothetical protein SERLA73DRAFT_73000 [Serpula lacrymans var.
lacrymans S7.3]
gi|336384684|gb|EGO25832.1| hypothetical protein SERLADRAFT_437559 [Serpula lacrymans var.
lacrymans S7.9]
Length = 748
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 44/78 (56%), Gaps = 11/78 (14%)
Query: 49 RNNPFHFKHVKLCHSLAE-------LAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPEN 101
R+NPF FKH+ +LA+ +A P P VVL S ++ G SR+L W P N
Sbjct: 298 RDNPFVFKHIS---NLAQPRGWERKIADGP-PCVVLASPGFLQSGPSRELLELWAPDPRN 353
Query: 102 SIIITNRTSPGTLARDLI 119
+I+T + GTLARD++
Sbjct: 354 GLIVTGYSVEGTLARDIM 371
>gi|219123319|ref|XP_002181974.1| cleavage and polyadenylation specific factor [Phaeodactylum
tricornutum CCAP 1055/1]
gi|217406575|gb|EEC46514.1| cleavage and polyadenylation specific factor [Phaeodactylum
tricornutum CCAP 1055/1]
Length = 1001
Score = 51.2 bits (121), Expect = 4e-04, Method: Composition-based stats.
Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 5/81 (6%)
Query: 32 AKIEWMSDKLMKSFEG-ARNNPFHFKHVKLCHSLAELAKV----PSPKVVLVSTPDMECG 86
+++EWM KL F+ A +P HV +C + EL K P+P V+ S +E G
Sbjct: 325 SQLEWMGTKLGHVFDAQAGPHPLTLPHVHVCTNTRELEKFLAENPNPACVVASGLSLEGG 384
Query: 87 FSRDLFFQWCSSPENSIIITN 107
+RDL W + +N+I+ T+
Sbjct: 385 PARDLLLSWADNVDNAILFTD 405
>gi|302415331|ref|XP_003005497.1| cleavage and polyadenylation specificity factor subunit 2
[Verticillium albo-atrum VaMs.102]
gi|261354913|gb|EEY17341.1| cleavage and polyadenylation specificity factor subunit 2
[Verticillium albo-atrum VaMs.102]
Length = 739
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 43/167 (25%), Positives = 70/167 (41%), Gaps = 53/167 (31%)
Query: 34 IEWMSDKLMKSFEGARNN----------------PFHFKHVKLCHSLAELAKVPS----- 72
+EWM D +++ FE + PF F+ ++L A++ K+ S
Sbjct: 299 LEWMDDNIVREFEATADGQRKANGNDGKHAKDAAPFDFRFMRLVEREAQIRKLLSQTSEN 358
Query: 73 ----PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG-----TLAR------- 116
+V++ S +E GFS +L + N +I+T++ S ++AR
Sbjct: 359 VRSDGRVIVASDNSLEWGFSHELLRELAKDSRNLLILTDKPSLAQSGQPSIARILWDWWQ 418
Query: 117 --------------DLIEL--GGNRTLTLQVKKRIRLEGEELEEYQK 147
D IEL GG R LT+ +R LEG+EL YQ+
Sbjct: 419 ERRDGVSIDQSDSNDSIELVYGGGRALTVTDARRQGLEGDELSTYQQ 465
>gi|116203607|ref|XP_001227614.1| hypothetical protein CHGG_09687 [Chaetomium globosum CBS 148.51]
gi|88175815|gb|EAQ83283.1| hypothetical protein CHGG_09687 [Chaetomium globosum CBS 148.51]
Length = 956
Score = 51.2 bits (121), Expect = 5e-04, Method: Composition-based stats.
Identities = 42/166 (25%), Positives = 67/166 (40%), Gaps = 52/166 (31%)
Query: 34 IEWMSDKLMKSFEGA----------------RNNPFHFKHVKLCHSLAELAKV------- 70
+EWM D +++ FE PF FK+++L A++ +V
Sbjct: 331 LEWMDDSIVREFEAVAGGQRGNGGSGGGKGKDAGPFDFKYLRLLERKAQVERVLQQAADA 390
Query: 71 --PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS--PGT--LARDLIE---- 120
P +V++ + +E GFS+++ P N +I+T + S P T +AR L E
Sbjct: 391 SEPKGRVIVATDSSLEWGFSKEVMRAIAGDPRNLVILTEKPSLNPNTPSIARTLWEWWRE 450
Query: 121 -------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
GG R L + R LEG EL+ YQ+
Sbjct: 451 RKDGVAVEQTSNGDTFEQVYGGGRDLEVGHATRQALEGGELDIYQQ 496
>gi|346976151|gb|EGY19603.1| cleavage and polyadenylation specificity factor subunit 2
[Verticillium dahliae VdLs.17]
Length = 972
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 43/167 (25%), Positives = 70/167 (41%), Gaps = 53/167 (31%)
Query: 34 IEWMSDKLMKSFEGARNN----------------PFHFKHVKLCHSLAELAKVPS----- 72
+EWM D +++ FE + PF F+ ++L A++ K+ S
Sbjct: 333 LEWMDDNIVREFEATADGQRKANGNDGKHAKDAAPFDFRFMRLVEREAQIRKLLSQTSDN 392
Query: 73 ----PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG-----TLAR------- 116
+V++ S +E GFS+ L + N +I+T++ S ++AR
Sbjct: 393 VQSEGRVIVASDNSLEWGFSQQLLRELAKDSRNLLILTDKPSLAQSGQPSIARTLWDWWQ 452
Query: 117 --------------DLIEL--GGNRTLTLQVKKRIRLEGEELEEYQK 147
D IEL GG R L++ KR LEG+EL YQ+
Sbjct: 453 ERKDGVSIDQSDSNDSIELVYGGGRALSVTDAKRQGLEGDELSTYQQ 499
>gi|302899216|ref|XP_003048005.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256728937|gb|EEU42292.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 958
Score = 50.8 bits (120), Expect = 5e-04, Method: Composition-based stats.
Identities = 46/167 (27%), Positives = 72/167 (43%), Gaps = 55/167 (32%)
Query: 34 IEWMSDKLMKSFE----GARN----------NPFHFKHVKLCHSLAELAKVPSP------ 73
+EWM D +++ FE G R PF FK+++L A++ ++ S
Sbjct: 332 LEWMDDTIVQEFEAFAEGQRKVNGAGDKKEGGPFDFKYLRLLERKAQIVRLLSRGFENVE 391
Query: 74 ---KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG-------TLAR------- 116
+V+L S +E GFS+DL N +I+T++ PG ++AR
Sbjct: 392 TEGRVILASDSSIEWGFSKDLIKGLARDSRNLVILTDK--PGLSKDDKPSIARTLWDWWK 449
Query: 117 --------------DLIEL--GGNRTLTLQVKKRIRLEGEELEEYQK 147
D IEL G R L ++ +R LEG+EL YQ+
Sbjct: 450 ERKDGVSVEQTSSGDSIELVYAGGRELEIREAQRQALEGDELAVYQQ 496
>gi|189192102|ref|XP_001932390.1| cleavage and polyadenylation specificity factor subunit 2
[Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187973996|gb|EDU41495.1| cleavage and polyadenylation specificity factor subunit 2
[Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 954
Score = 50.8 bits (120), Expect = 5e-04, Method: Composition-based stats.
Identities = 39/159 (24%), Positives = 67/159 (42%), Gaps = 46/159 (28%)
Query: 34 IEWMSDKLMKSFEGARNN-----------------PFHFKHVKLCHSLAELAKV---PSP 73
+EWM + ++K FE + + PF F+H+ L +A++ P
Sbjct: 336 LEWMEEGIVKEFEASAADQDRRTKEGQEEERVAKVPFDFRHITLLERKTRVARMLAGAGP 395
Query: 74 KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT-----------------------S 110
+V+L S +E GFS+D S +N +I+T R+ S
Sbjct: 396 RVILASDATLEWGFSKDAIRSLASDEKNLVILTERSGELGSQKKGLGRYLWDLWNQRNAS 455
Query: 111 PGTLA--RDLIELGGNRTLTLQVKKRIRLEGEELEEYQK 147
PG A +I+ GN+ L + + L+G+E+ YQ+
Sbjct: 456 PGEDAPSTTVIDASGNQA-PLDTVRTVALQGDEVPLYQQ 493
>gi|407851025|gb|EKG05159.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 762
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 46/88 (52%), Gaps = 6/88 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV--PSPKVVLVSTPDMECGFSRDL 91
+ M+D++ + RN PF FK++ HSL E P VVL S ++ G S +L
Sbjct: 310 VSAMNDRVKQQHANHRN-PFVFKYI---HSLMETRSFEDTGPCVVLASPGMLQSGISLEL 365
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F +WC N III GT+A+D++
Sbjct: 366 FERWCGDRRNGIIIAGYCVDGTIAKDIL 393
>gi|323453344|gb|EGB09216.1| hypothetical protein AURANDRAFT_71470 [Aureococcus anophagefferens]
Length = 1101
Score = 50.8 bits (120), Expect = 6e-04, Method: Composition-based stats.
Identities = 26/70 (37%), Positives = 40/70 (57%), Gaps = 1/70 (1%)
Query: 50 NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
+NPF F HV+ S+ +L P VVL + ++ G SR LF +W SS N ++I +
Sbjct: 198 SNPFKFDHVQNLASIDDLDD-SGPVVVLAAPGMLQSGVSRQLFDRWASSERNGVVIAGYS 256
Query: 110 SPGTLARDLI 119
GTLA+ ++
Sbjct: 257 VEGTLAKQIL 266
>gi|71005902|ref|XP_757617.1| hypothetical protein UM01470.1 [Ustilago maydis 521]
gi|74703664|sp|Q4PEJ3.1|YSH1_USTMA RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
3'-end-processing protein YSH1
gi|46097110|gb|EAK82343.1| hypothetical protein UM01470.1 [Ustilago maydis 521]
Length = 880
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 2/86 (2%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D + F R+NPF FKH+ SL + P V++ S M+ G SR+L
Sbjct: 302 IHTMNDHIRTRF-NRRDNPFVFKHISNLRSLEKFEDR-GPCVMMASPGFMQSGVSRELLE 359
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+W N +I++ + GT+AR+++
Sbjct: 360 RWAPDKRNGLIVSGYSVEGTMARNIL 385
>gi|388852694|emb|CCF53612.1| related to YSH1-component of pre-mRNA polyadenylation factor PF I
[Ustilago hordei]
Length = 888
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 2/86 (2%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D + F R+NPF FKH+ SL + P V++ S M+ G SR+L
Sbjct: 302 IHTMNDHIRTRFN-RRDNPFVFKHISNLRSLEKFEDR-GPCVMMASPGFMQSGVSRELLE 359
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+W N +I++ + GT+AR+++
Sbjct: 360 RWAPDKRNGLIVSGYSVEGTMARNIL 385
>gi|50363261|gb|AAT75333.1| cleavage polyadenylation specificity factor CPSF73 [Trypanosoma
cruzi]
Length = 762
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 46/88 (52%), Gaps = 6/88 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV--PSPKVVLVSTPDMECGFSRDL 91
+ M+D++ + RN PF FK++ HSL E P VVL S ++ G S +L
Sbjct: 310 VSAMNDRVKQQHANHRN-PFVFKYI---HSLMETRSFEDTGPCVVLASPGMLQSGISLEL 365
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F +WC N III GT+A+D++
Sbjct: 366 FERWCGDRRNGIIIAGYCVDGTIAKDIL 393
>gi|299752177|ref|XP_001830756.2| mRNA 3'-end-processing protein YSH1 [Coprinopsis cinerea
okayama7#130]
gi|298409712|gb|EAU91125.2| mRNA 3'-end-processing protein YSH1 [Coprinopsis cinerea
okayama7#130]
Length = 846
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 44/75 (58%), Gaps = 5/75 (6%)
Query: 49 RNNPFHFKHVK-LCHSLA---ELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSII 104
R+NPF FK++ L + ++A+ P P VVL S M+ G SR+LF W N +I
Sbjct: 298 RDNPFVFKYISNLPQTRGWEKKIAEGP-PCVVLASPGFMQVGPSRELFELWAPDARNGLI 356
Query: 105 ITNRTSPGTLARDLI 119
IT + GTLARD++
Sbjct: 357 ITGYSIEGTLARDIM 371
>gi|71661559|ref|XP_817799.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
strain CL Brener]
gi|70883012|gb|EAN95948.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 625
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 46/88 (52%), Gaps = 6/88 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV--PSPKVVLVSTPDMECGFSRDL 91
+ M+D++ + RN PF FK++ HSL E P VVL S ++ G S +L
Sbjct: 173 VSAMNDRVKQQHANHRN-PFVFKYI---HSLMETRSFEDTGPCVVLASPGMLQSGISLEL 228
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F +WC N III GT+A+D++
Sbjct: 229 FERWCGDRRNGIIIAGYCVDGTIAKDIL 256
>gi|169599735|ref|XP_001793290.1| hypothetical protein SNOG_02691 [Phaeosphaeria nodorum SN15]
gi|160705309|gb|EAT89422.2| hypothetical protein SNOG_02691 [Phaeosphaeria nodorum SN15]
Length = 957
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 45/194 (23%), Positives = 78/194 (40%), Gaps = 58/194 (29%)
Query: 4 KLMKSFEGARNNPFHFKHVKLCHSLAELAKIEWMSDKLMKSFE---------GARN---- 50
K K++ +R ++V+ + +EWM + ++K FE G RN
Sbjct: 313 KNAKAYLASRTGGATMRYVR--------SMLEWMGEGIVKEFEAASGAAEGQGQRNVRGA 364
Query: 51 ----------NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCS 97
PF F+H+ L A + ++ P+V+L S +E GFS+D S
Sbjct: 365 PGRDDGRGIRTPFDFQHITLLEKKARVTRMLNATEPRVILASDTSLEWGFSKDAIRSLAS 424
Query: 98 SPENSIIITNR-----TSPGTLARDLIELGGNRTL-------------------TLQVKK 133
+N +I+T R T L R L +L R++ ++ +
Sbjct: 425 DEKNLVILTERVGELGTQEKGLGRYLWDLWNERSVNSGDDSLDSTMVDVSGQQASISTVR 484
Query: 134 RIRLEGEELEEYQK 147
+ LEG+E+ YQ+
Sbjct: 485 TVALEGDEVPLYQQ 498
>gi|403419016|emb|CCM05716.1| predicted protein [Fibroporia radiculosa]
Length = 828
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 31/73 (42%), Positives = 39/73 (53%), Gaps = 3/73 (4%)
Query: 49 RNNPFHFKHVK-LCHSLAELAKVPS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSIII 105
R+NPF FKH+ L H+ KV P VVL S + G SR+L W N III
Sbjct: 298 RDNPFVFKHISNLPHTRGWERKVADGPPCVVLASPGFVTVGASRELLEMWAPDSRNGIII 357
Query: 106 TNRTSPGTLARDL 118
T + GT+ARD+
Sbjct: 358 TGYSIEGTMARDI 370
>gi|407411604|gb|EKF33594.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi marinkellei]
Length = 763
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 46/88 (52%), Gaps = 6/88 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV--PSPKVVLVSTPDMECGFSRDL 91
+ M+D++ + RN PF FK++ HSL E P VVL S ++ G S +L
Sbjct: 311 VSAMNDRVKQQHANHRN-PFVFKYI---HSLMETRSFEDTGPCVVLASPGMLQSGISLEL 366
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F +WC N III GT+A+D++
Sbjct: 367 FERWCGDRRNGIIIAGYCVDGTIAKDIL 394
>gi|443899092|dbj|GAC76423.1| mRNA cleavage and polyadenylation factor II complex, BRR5
[Pseudozyma antarctica T-34]
Length = 884
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 2/86 (2%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D + F R+NPF FKH+ SL + P V++ S M+ G SR+L
Sbjct: 302 IHTMNDHIRTRFN-RRDNPFVFKHISNLRSLEKFEDR-GPCVMMASPGFMQSGVSRELLE 359
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+W N +I++ + GT+AR+++
Sbjct: 360 RWAPDKRNGLIVSGYSVEGTMARNIL 385
>gi|343428147|emb|CBQ71677.1| related to YSH1-component of pre-mRNA polyadenylation factor PF I
[Sporisorium reilianum SRZ2]
Length = 878
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 2/86 (2%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D + F R+NPF FKH+ SL + P V++ S M+ G SR+L
Sbjct: 302 IHTMNDHIRTRFN-RRDNPFVFKHISNLRSLEKFED-RGPCVMMASPGFMQSGVSRELLE 359
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+W N +I++ + GT+AR+++
Sbjct: 360 RWAPDKRNGLIVSGYSVEGTMARNIL 385
>gi|325186851|emb|CCA21396.1| cleavage and polyadenylation specific factor 3 puta [Albugo
laibachii Nc14]
Length = 759
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D++ K A +NPF F H+ S+ + P VV+ S ++ G SR LF
Sbjct: 292 INMMNDRIRKQI--AVSNPFLFDHISNLKSMDDFDD-SGPCVVMASPGMLQSGVSRQLFE 348
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WCS N+ +I GTLA+ ++
Sbjct: 349 RWCSDKRNACLIPGYVVEGTLAKKIL 374
>gi|402080824|gb|EJT75969.1| hypothetical protein GGTG_05894 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 974
Score = 50.8 bits (120), Expect = 7e-04, Method: Composition-based stats.
Identities = 49/179 (27%), Positives = 71/179 (39%), Gaps = 57/179 (31%)
Query: 26 HSLAELAK--IEWMSDKLMKSFE------GARNN------------PFHFKHVKLCHSLA 65
+S LAK EWM D +++ FE G R N PF FK++++ A
Sbjct: 321 NSTIRLAKSMFEWMDDSIVQEFEAVADQGGKRTNGNTDGGRGRDAGPFDFKYLRVLDRKA 380
Query: 66 ELAKVPSP---------KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG---- 112
++ KV S KV+L S +E GFS+D+ + N +I+T + S
Sbjct: 381 QVEKVLSQSSTPNELRGKVILASDTSLEWGFSKDVMARIADDSRNLVILTEKPSASSKEK 440
Query: 113 -TLARDLIE-----------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
++AR L E G R L L KR L+ EL YQ+
Sbjct: 441 PSVARQLWEWWKERRDGTADEASSGPGSAEQIYAGGRELRLMDSKRAALDESELPLYQQ 499
>gi|340383473|ref|XP_003390242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Amphimedon queenslandica]
Length = 726
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
Query: 50 NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
+NPF FKH+ ++ + P V+L S M+ G SR LF WC+ N +++
Sbjct: 351 SNPFVFKHISSLKNIDNFDDI-GPCVILASPGMMQSGLSRQLFESWCTDKRNGVVVAGYC 409
Query: 110 SPGTLARDLI 119
GTLA+ ++
Sbjct: 410 VEGTLAKHIL 419
>gi|402696937|gb|AFQ90657.1| 73kDa cleavage and polyadenylation specific factor 3, partial
[Dibamus sp. JJF-2012]
Length = 220
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 38/77 (49%), Gaps = 3/77 (3%)
Query: 37 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
M+DK+ K NNPF KH+ S+ + P VV+ S M+ G SR+LF WC
Sbjct: 147 MNDKIRKXX--XINNPFVJKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFESWC 203
Query: 97 SSPENSIIITNRTSPGT 113
+ N +II GT
Sbjct: 204 TDKRNGVIIAGYCVEGT 220
>gi|71654879|ref|XP_816051.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
strain CL Brener]
gi|70881152|gb|EAN94200.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 430
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 46/88 (52%), Gaps = 6/88 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV--PSPKVVLVSTPDMECGFSRDL 91
+ M+D++ + RN PF FK++ HSL E P VVL S ++ G S +L
Sbjct: 310 VSAMNDRVKQQHANHRN-PFVFKYI---HSLMETRSFEDTGPCVVLASPGMLQSGISLEL 365
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F +WC N III GT+A+D++
Sbjct: 366 FERWCGDRRNGIIIAGYCVDGTIAKDIL 393
>gi|330920784|ref|XP_003299151.1| hypothetical protein PTT_10086 [Pyrenophora teres f. teres 0-1]
gi|311327303|gb|EFQ92764.1| hypothetical protein PTT_10086 [Pyrenophora teres f. teres 0-1]
Length = 953
Score = 50.4 bits (119), Expect = 7e-04, Method: Composition-based stats.
Identities = 39/159 (24%), Positives = 67/159 (42%), Gaps = 46/159 (28%)
Query: 34 IEWMSDKLMKSFEGARNN-----------------PFHFKHVKLCHSLAELAKV---PSP 73
+EWM + ++K FE + + PF F+H+ L +A++ P
Sbjct: 336 LEWMEEGIVKEFEASAADQDRRTKGGKEDERVAKVPFDFRHITLLERKTRVARMLAGAGP 395
Query: 74 KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT-----------------------S 110
+V+L S +E GFS+D S +N +I+T R+ S
Sbjct: 396 RVILASDATLEWGFSKDAIRTLASDEKNLVILTERSGELGSQKKGLGRYLWDLWNQRNAS 455
Query: 111 PGTLA--RDLIELGGNRTLTLQVKKRIRLEGEELEEYQK 147
PG A +I+ GN+ L + + L+G+E+ YQ+
Sbjct: 456 PGEDAPSTTVIDASGNQA-PLDTIRTVALQGDEVPLYQQ 493
>gi|255934198|ref|XP_002558380.1| Pc12g15810 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211582999|emb|CAP81208.1| Pc12g15810 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 893
Score = 50.4 bits (119), Expect = 8e-04, Method: Composition-based stats.
Identities = 28/96 (29%), Positives = 43/96 (44%), Gaps = 23/96 (23%)
Query: 34 IEWMSDKLMKSFEGARN--------------------NPFHFKHVKLCHSLAELAKV--- 70
+EWM + +++ FE A + PF FKH+K+ L K+
Sbjct: 344 LEWMDENIVREFEAAESADVTNGQRTGGQDKSTSKGGGPFTFKHLKIIERKKRLEKLLAE 403
Query: 71 PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
P PKV+L S M+ GFS+ Q P N +++T
Sbjct: 404 PGPKVILASDTSMDWGFSKHALRQVAEGPNNLLLMT 439
>gi|307110126|gb|EFN58363.1| hypothetical protein CHLNCDRAFT_142438 [Chlorella variabilis]
Length = 709
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 17/88 (19%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
A IE M++ + ++F A NPF FKH+ S A + G SR+L
Sbjct: 297 AYIEMMNEDIKRAFTVA--NPFEFKHISHLKSAAHFD---------------DSGMSREL 339
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F WC N ++I + GTLARD++
Sbjct: 340 FEAWCEDARNCVVIADFAVQGTLARDIL 367
>gi|409044817|gb|EKM54298.1| hypothetical protein PHACADRAFT_146128 [Phanerochaete carnosa
HHB-10118-sp]
Length = 869
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 31/74 (41%), Positives = 43/74 (58%), Gaps = 5/74 (6%)
Query: 49 RNNPFHFKHVK-LCHSLA---ELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSII 104
R+NPF FKH+ + HS ++A+ PS VVL S ME G SR+L W N +I
Sbjct: 299 RDNPFVFKHISNVPHSRGWERKIAEGPS-CVVLASPGFMESGPSRELLELWAPDSRNGVI 357
Query: 105 ITNRTSPGTLARDL 118
+T + GT+ARD+
Sbjct: 358 LTGYSIEGTMARDI 371
>gi|328773999|gb|EGF84036.1| hypothetical protein BATDEDRAFT_9083 [Batrachochytrium
dendrobatidis JAM81]
Length = 669
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
Query: 50 NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
+NPF FKH+ S+A+ V P V++ S ++ G SR+L WC N +II
Sbjct: 347 SNPFQFKHISNLKSIAQFDDV-GPCVMMASPGMLQSGLSRELLELWCVDKRNGVIIPGYV 405
Query: 110 SPGTLARDLI 119
GTL + ++
Sbjct: 406 VEGTLGKQIL 415
>gi|403337788|gb|EJY68117.1| Integrator complex subunit 11 [Oxytricha trifallax]
Length = 771
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 54/110 (49%), Gaps = 13/110 (11%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W ++K+ K+F N F F+HVK + + K P V S + G+S +F
Sbjct: 300 INWTNEKIKKTFT-KNQNMFQFQHVKTLDTAS--IKSDQPMVCFASPGMLHGGYSLQIFK 356
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELE 143
W +N++II PGT+ GN+ L+ + K+I ++G +++
Sbjct: 357 DWAGQEKNTLIIPGYCMPGTV--------GNKLLSGE--KKINIDGRDID 396
>gi|115479027|ref|NP_001063107.1| Os09g0397900 [Oryza sativa Japonica Group]
gi|50252615|dbj|BAD28786.1| putative FEG protein [Oryza sativa Japonica Group]
gi|113631340|dbj|BAF25021.1| Os09g0397900 [Oryza sativa Japonica Group]
gi|218202115|gb|EEC84542.1| hypothetical protein OsI_31281 [Oryza sativa Indica Group]
gi|222641522|gb|EEE69654.1| hypothetical protein OsJ_29268 [Oryza sativa Japonica Group]
Length = 559
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 43/87 (49%), Gaps = 6/87 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
I W S K+ S+ +NPF FKHV CH P P VL +TP M GFS ++F
Sbjct: 285 IGWTSQKIKNSY--TVHNPFDFKHV--CHFERSFINNPGP-CVLFATPGMISGGFSLEVF 339
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLI 119
+W S +N + + GT+ L+
Sbjct: 340 KKWAPSEKNLVTLPGYCVAGTIGHKLM 366
>gi|400602286|gb|EJP69888.1| RNA-metabolising metallo-beta-lactamase [Beauveria bassiana ARSEF
2860]
Length = 962
Score = 50.1 bits (118), Expect = 0.001, Method: Composition-based stats.
Identities = 46/167 (27%), Positives = 71/167 (42%), Gaps = 55/167 (32%)
Query: 34 IEWMSDKLMKSFEG-----ARNN---------PFHFKHVKLCHSLAELAKVPSP------ 73
+EWM D +++ FE R N PF F+ ++L A++AK+ +
Sbjct: 332 LEWMDDSIVQEFEAFAEGQKRTNGNSDKKVGGPFDFRFMRLLDRKAQIAKLLTTAVNNGE 391
Query: 74 ---KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG-------TLARDLIEL-- 121
+V+L S M+ GFS+DL S N +I+T++ PG ++AR L L
Sbjct: 392 SRGRVILASDTCMDWGFSKDLLRGLASDANNVVILTDK--PGLAQENVPSIARTLWSLWK 449
Query: 122 ---------------------GGNRTLTLQVKKRIRLEGEELEEYQK 147
G+ L ++ KR LEGEEL YQ+
Sbjct: 450 ERKDGVTTVQTTTGDDLEYVNTGSYDLEIRDAKREPLEGEELTIYQQ 496
>gi|452004821|gb|EMD97277.1| hypothetical protein COCHEDRAFT_1163978 [Cochliobolus
heterostrophus C5]
Length = 948
Score = 50.1 bits (118), Expect = 0.001, Method: Composition-based stats.
Identities = 42/158 (26%), Positives = 68/158 (43%), Gaps = 45/158 (28%)
Query: 34 IEWMSDKLMKSFEGA------RNN----------PFHFKHVKLCHSLAELAKVPS---PK 74
+EWM + ++K FE + RN PF F+HV L +A++ + P+
Sbjct: 335 LEWMEEGIVKEFEASAADQDRRNKGGKDEDRAKIPFDFRHVTLLERKTRVARMLAADGPR 394
Query: 75 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR-----------------------TSP 111
V+L S +E GFS+D S +N +I+T R TSP
Sbjct: 395 VILASDTTLEWGFSKDALRSLASDEKNLVILTERSGELGEKRKGLGRYLWDLWNQRNTSP 454
Query: 112 GTLA--RDLIELGGNRTLTLQVKKRIRLEGEELEEYQK 147
G A +I G++ + + + LEG+E+ YQ+
Sbjct: 455 GQDAPSTTVINASGDQ-IPWKTVRAAALEGDEVPLYQQ 491
>gi|353239750|emb|CCA71648.1| related to YSH1-component of pre-mRNA polyadenylation factor PF I
[Piriformospora indica DSM 11827]
Length = 756
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 57/116 (49%), Gaps = 10/116 (8%)
Query: 12 ARNNPFHFKHVKLCHSLAE------LAKIEWMSDKLMKSFEGARNNPFHFKHV-KLCHSL 64
+RN H + SLA+ I M+ K+ + ++ + NPF FK++ L +
Sbjct: 255 SRNPDLHGVPIYYVSSLAKKCMAVYQTNISSMNSKIQERWK-KQENPFVFKYITNLPQTR 313
Query: 65 AELAKVPS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDL 118
KV P VVL S M+ G SR+L W P N++I+T + GT+ARD+
Sbjct: 314 GAEKKVAEGPPCVVLASPGFMDNGSSRELLELWAPDPRNAVIVTGYSVEGTMARDI 369
>gi|354543719|emb|CCE40441.1| hypothetical protein CPAR2_104770 [Candida parapsilosis]
Length = 776
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 44/70 (62%), Gaps = 3/70 (4%)
Query: 52 PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTS 110
PF FK++K +L++ + + P VV V+TP M + G SR L +W +N +I+T +
Sbjct: 322 PFDFKYIKSIRNLSKFSDL-GPSVV-VATPGMLQAGVSRQLLEKWAPEQKNLVILTGYSV 379
Query: 111 PGTLARDLIE 120
GT+A+DL++
Sbjct: 380 EGTMAKDLLK 389
>gi|451853389|gb|EMD66683.1| hypothetical protein COCSADRAFT_35187 [Cochliobolus sativus ND90Pr]
Length = 948
Score = 50.1 bits (118), Expect = 0.001, Method: Composition-based stats.
Identities = 32/111 (28%), Positives = 51/111 (45%), Gaps = 28/111 (25%)
Query: 34 IEWMSDKLMKSFEGA------RNN----------PFHFKHVKLCHSLAELAKVPS---PK 74
+EWM + ++K FE + RN PF F+H+ L +A++ + P+
Sbjct: 335 LEWMEEGIVKEFEASAADQDRRNKGGKDEDRAKIPFDFRHITLLERKTRVARMLAADGPR 394
Query: 75 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNR 125
V+L S +E GFS+D S +N +I+T R+ ELG R
Sbjct: 395 VILASDTTLEWGFSKDALRSLASDEKNLVILTERSG---------ELGAQR 436
>gi|403346510|gb|EJY72653.1| putative cleavage and polyadenylation specificity factor subunit 2
[Oxytricha trifallax]
Length = 853
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 54/100 (54%), Gaps = 7/100 (7%)
Query: 36 WMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLFFQ 94
WM+ + K F+ NP +F+ VK +L E K+ SP++++ S G+++ L ++
Sbjct: 347 WMNFQDNKVFQDIDENPINFQFVKDIFTLDEYRKLEHSPRIIVTSLASFSQGYTKQLIYE 406
Query: 95 WCSSPENSIIITNRTSP----GTLARDLIELGGNRTLTLQ 130
+ P+N I+ +SP +LA +L L G + +T++
Sbjct: 407 FSQVPKNEIVFLQTSSPQVFKDSLAYNL--LRGQKEITMK 444
>gi|145478255|ref|XP_001425150.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392218|emb|CAK57752.1| unnamed protein product [Paramecium tetraurelia]
Length = 690
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 35/69 (50%)
Query: 51 NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS 110
N F F H+ L ++ PKVV+ S ++ G S+ ++ WC +N +IIT
Sbjct: 297 NLFKFNHINTKKHLQDILNNQKPKVVMASPGLLQSGHSKQIYEYWCKDEKNQVIITGPAV 356
Query: 111 PGTLARDLI 119
GT+A LI
Sbjct: 357 QGTIAHQLI 365
>gi|357158307|ref|XP_003578085.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-II-like [Brachypodium distachyon]
Length = 553
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 43/87 (49%), Gaps = 6/87 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
I W S K+ S+ NPF FKHV CH P P VL +TP M GFS ++F
Sbjct: 286 IGWTSQKIKDSY--TVQNPFDFKHV--CHFERSFINDPGP-CVLFATPGMISGGFSLEVF 340
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLI 119
+W +S +N + + GT+ L+
Sbjct: 341 KRWATSDKNLVTLPGYCVAGTIGHKLM 367
>gi|330842661|ref|XP_003293292.1| hypothetical protein DICPUDRAFT_158104 [Dictyostelium purpureum]
gi|325076396|gb|EGC30185.1| hypothetical protein DICPUDRAFT_158104 [Dictyostelium purpureum]
Length = 789
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 45/86 (52%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+D++ F+ + NPF FK++ + P V + S ++ G SR LF
Sbjct: 308 INMMNDRVRAQFDVS--NPFEFKYISNIKGIESFDD-NGPCVFMASPGMLQSGLSRQLFE 364
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC+S N ++I + GTLA+ ++
Sbjct: 365 RWCTSKRNGVVIPGYSVEGTLAKHIM 390
>gi|72387720|ref|XP_844284.1| cleavage and polyadenylation specificity factor subunit
[Trypanosoma brucei brucei strain 927/4 GUTat10.1]
gi|62359436|gb|AAX79873.1| cleavage and polyadenylation specificity factor subunit, putative
[Trypanosoma brucei]
gi|70800817|gb|AAZ10725.1| cleavage and polyadenylation specificity factor subunit, putative
[Trypanosoma brucei brucei strain 927/4 GUTat10.1]
Length = 770
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 47/88 (53%), Gaps = 6/88 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV--PSPKVVLVSTPDMECGFSRDL 91
+ M+D++ K E RN PF FK+++ SL + P VVL S ++ G S +L
Sbjct: 311 VSAMNDRVKKQHENHRN-PFVFKYIQ---SLLDTRSFEDTGPCVVLASPGMLQSGISLEL 366
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F +WC N II+ GT+A+D++
Sbjct: 367 FERWCGDKRNGIIVAGYCVDGTIAKDIL 394
>gi|340521586|gb|EGR51820.1| predicted protein [Trichoderma reesei QM6a]
Length = 887
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 48/87 (55%), Gaps = 3/87 (3%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
E M++ +N P+ FK+++ +L V V++++P M + G SR+LF
Sbjct: 352 ERMAEAEASGDSAGKNGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELFE 409
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIE 120
+W S +N +IIT + GT+AR +++
Sbjct: 410 RWAPSEKNGVIITGYSVEGTMARQIMQ 436
>gi|261327437|emb|CBH10412.1| cleavage and polyadenylation specificity factor subunit, putative
[Trypanosoma brucei gambiense DAL972]
Length = 770
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 47/88 (53%), Gaps = 6/88 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV--PSPKVVLVSTPDMECGFSRDL 91
+ M+D++ K E RN PF FK+++ SL + P VVL S ++ G S +L
Sbjct: 311 VSAMNDRVKKQHENHRN-PFVFKYIQ---SLLDTRSFEDTGPCVVLASPGMLQSGISLEL 366
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F +WC N II+ GT+A+D++
Sbjct: 367 FERWCGDKRNGIIVAGYCVDGTIAKDIL 394
>gi|32564696|ref|NP_495706.2| Protein F10B5.8 [Caenorhabditis elegans]
gi|26985793|emb|CAB54223.2| Protein F10B5.8 [Caenorhabditis elegans]
Length = 608
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 63/149 (42%), Gaps = 19/149 (12%)
Query: 12 ARNNPFHFKHVKLCHSLAELAK------IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLA 65
A N P +F LAE A I W ++ + K+F N F FKH+K
Sbjct: 265 ALNVPIYFSQ-----GLAERANQYYRLFISWTNENIKKTF--VERNMFEFKHIKPMEKGC 317
Query: 66 ELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGN 124
E P P+V L STP M G S +F +WCS P N II+ GT+ +I G
Sbjct: 318 E--DQPGPQV-LFSTPGMLHGGQSLKVFKKWCSDPLNMIIMPGYCVAGTVGARVIN-GEK 373
Query: 125 RTLTLQVKKRIRLEGEELEEYQKKKDKEA 153
+ Q IRL G E + D +
Sbjct: 374 KIEIDQKMHEIRL-GVEYMSFSAHADAKG 401
>gi|395332776|gb|EJF65154.1| Metallo-hydrolase/oxidoreductase [Dichomitus squalens LYAD-421 SS1]
Length = 809
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 49/92 (53%), Gaps = 9/92 (9%)
Query: 49 RNNPFHFKHVKLCHSLA----ELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSII 104
R+NPF FKH+ ++A+ P P VVL S M G SR+L W S +N I
Sbjct: 298 RDNPFVFKHITNVPGTRGWERKIAEGP-PCVVLASPGFMNSGPSRELLELWASDSKNGCI 356
Query: 105 ITNRTSPGTLARDLI----ELGGNRTLTLQVK 132
+T + GT+ARD++ E+ G + T+Q K
Sbjct: 357 VTGYSVEGTMARDILNEPSEIVGMKGNTIQRK 388
>gi|367047989|ref|XP_003654374.1| hypothetical protein THITE_2117338 [Thielavia terrestris NRRL 8126]
gi|347001637|gb|AEO68038.1| hypothetical protein THITE_2117338 [Thielavia terrestris NRRL 8126]
Length = 1015
Score = 49.7 bits (117), Expect = 0.001, Method: Composition-based stats.
Identities = 40/167 (23%), Positives = 66/167 (39%), Gaps = 53/167 (31%)
Query: 34 IEWMSDKLMKSFE-----------------GARNNPFHFKHVKLCHSLAELAKV------ 70
+EWM D +++ FE G PF FK+++L A++ ++
Sbjct: 331 LEWMDDSIVREFEAVAGGTRGANSGAGGGKGRDAGPFDFKYLRLLERKAQVERILQQEAG 390
Query: 71 ---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG----TLARDLIE--- 120
P KV+L + +E GFS+++ N +++T + + ++AR L E
Sbjct: 391 DSEPKGKVILATDASLEWGFSKEVLKAIAGDARNLVVLTEKPNLSHGRTSIARTLWEWWK 450
Query: 121 --------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
GG R L L R LEG+EL YQ+
Sbjct: 451 ERKDGVAVEQTSSGDTFEQVYGGGRELELTETTRQALEGDELGLYQQ 497
>gi|336373839|gb|EGO02177.1| hypothetical protein SERLA73DRAFT_86401 [Serpula lacrymans var.
lacrymans S7.3]
gi|336386654|gb|EGO27800.1| hypothetical protein SERLADRAFT_447017 [Serpula lacrymans var.
lacrymans S7.9]
Length = 930
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 61/128 (47%), Gaps = 24/128 (18%)
Query: 53 FHFKHVKL---CHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
FKH++ +L + PK++L + G SR LF + P+N +++T+R
Sbjct: 384 LRFKHLEFFPNPQALLQTYSSKDPKLILAVPASLSHGPSRLLFSDFAVVPDNVVLLTSRG 443
Query: 110 SPGTLARDLIE----------------LGGN----RTLTLQVKKRIRLEGEELEEYQKKK 149
GTL R L + +G N T+ L++ +I L+G ELEEY K
Sbjct: 444 EEGTLGRILFDKWNDSQRADDKWDKGKIGSNIMMDGTMKLKINSKIPLQGAELEEYL-AK 502
Query: 150 DKEAKDKQ 157
++ AK+K+
Sbjct: 503 ERVAKEKE 510
>gi|358378169|gb|EHK15851.1| hypothetical protein TRIVIDRAFT_65314 [Trichoderma virens Gv29-8]
Length = 873
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 48/87 (55%), Gaps = 3/87 (3%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
E M++ +N P+ FK+++ +L V V++++P M + G SR+LF
Sbjct: 330 ERMAEAEASGDAAGKNGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELFE 387
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIE 120
+W S +N +IIT + GT+AR +++
Sbjct: 388 RWAPSEKNGVIITGYSVEGTMARQIMQ 414
>gi|409049761|gb|EKM59238.1| hypothetical protein PHACADRAFT_249539 [Phanerochaete carnosa
HHB-10118-sp]
Length = 951
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 65/136 (47%), Gaps = 27/136 (19%)
Query: 53 FHFKHVKLCHSLAELAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
F H+++ + A + + S PK++L + G SR LF ++ P+N +++T R
Sbjct: 394 LRFPHLEIFPNPAAMMQRYSSKDPKLILAVPSSLSHGPSRALFSEFAEIPDNVVLLTGRG 453
Query: 110 SPGTLARDLIE----------------LGGN----RTLTLQVKKRIRLEGEELEEY---- 145
GTL R L E +G N TL L++ ++ L+G ELEE+
Sbjct: 454 EEGTLGRILFERWDNSQRDDTKWDRGKIGNNVMMDGTLHLKISSKVPLQGAELEEHLARE 513
Query: 146 QKKKDKEAKDKQEKEK 161
+ K++EA K +++
Sbjct: 514 RAAKEREAAKKAAEDR 529
>gi|358396914|gb|EHK46289.1| hypothetical protein TRIATDRAFT_132454 [Trichoderma atroviride IMI
206040]
Length = 881
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 48/79 (60%), Gaps = 4/79 (5%)
Query: 44 SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
S +GA +N P+ FK+++ +L V V++++P M + G SR+LF +W S +N
Sbjct: 334 SGDGAGKNGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELFERWAPSEKN 391
Query: 102 SIIITNRTSPGTLARDLIE 120
+IIT + GT+AR +++
Sbjct: 392 GVIITGYSVEGTMARQIMQ 410
>gi|242053629|ref|XP_002455960.1| hypothetical protein SORBIDRAFT_03g028040 [Sorghum bicolor]
gi|241927935|gb|EES01080.1| hypothetical protein SORBIDRAFT_03g028040 [Sorghum bicolor]
Length = 558
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 42/87 (48%), Gaps = 6/87 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
I W S K+ S A +NPF FKHV CH P P VL +TP M GFS + F
Sbjct: 285 IGWTSQKIKDSH--AVHNPFDFKHV--CHFERSFINNPGP-CVLFATPGMISGGFSLEAF 339
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLI 119
+W S +N I + GT+ L+
Sbjct: 340 KKWAPSEKNLITLPGYCVSGTIGHKLM 366
>gi|357440001|ref|XP_003590278.1| Cleavage and polyadenylation specificity factor subunit [Medicago
truncatula]
gi|355479326|gb|AES60529.1| Cleavage and polyadenylation specificity factor subunit [Medicago
truncatula]
Length = 196
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 48/113 (42%), Gaps = 30/113 (26%)
Query: 164 PHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRD 223
PH + + +L+L+DFKQ L G+ EF G L C V VR+V
Sbjct: 114 PHKSVLVGDLKLADFKQFLSTKGVPVEFAGGALRCGE-YVTVRKVG-------------- 158
Query: 224 LFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
+ S + I++EG L ++YY+++ LY Q+ +L
Sbjct: 159 ---------------DATQKGAGSGTQQIIIEGPLCEDYYKIRDYLYSQFYLL 196
>gi|68471691|ref|XP_720152.1| hypothetical protein CaO19.7957 [Candida albicans SC5314]
gi|68471954|ref|XP_720020.1| hypothetical protein CaO19.325 [Candida albicans SC5314]
gi|46441870|gb|EAL01164.1| hypothetical protein CaO19.325 [Candida albicans SC5314]
gi|46442007|gb|EAL01300.1| hypothetical protein CaO19.7957 [Candida albicans SC5314]
Length = 931
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 67/147 (45%), Gaps = 24/147 (16%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECG-FSRDLF 92
++WMS K +E + PF+ V L +EL K+ PK+V S D+ G S + F
Sbjct: 286 LDWMSKSFTKEWEELSSVPFNPSKVDLLLDPSELLKLSGPKIVFCSGIDLRSGDISAEAF 345
Query: 93 FQWCSSPENSIIITNRTS----------------------PGTLARDLIELGGNRTLTLQ 130
C+ +II+T +T+ G + D I + ++ ++L+
Sbjct: 346 QYLCNDEHTTIILTEKTTMNFASSLSSVLYTEWDSLAKKRGGGESEDGIAVPIDKNISLK 405
Query: 131 -VKKRIRLEGEELEEYQKKKDKEAKDK 156
K + L G EL E+Q+K ++ K+K
Sbjct: 406 NWTKEVELTGTELTEFQEKVAQKRKEK 432
>gi|268530366|ref|XP_002630309.1| Hypothetical protein CBG00745 [Caenorhabditis briggsae]
Length = 637
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 60/136 (44%), Gaps = 27/136 (19%)
Query: 12 ARNNPFHFKHVKLCHSLAELAK------IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLA 65
A N P +F LAE A I W ++ + K+F N F FKH++
Sbjct: 291 ALNVPIYFSQ-----GLAERANQYYRLFISWTNENIKKTF--VERNMFEFKHIRPMEKGC 343
Query: 66 ELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGN 124
E P P+V L STP M G S +F +WCS P N II+ GT+ +I
Sbjct: 344 E--DQPGPQV-LFSTPGMLHGGQSLKVFKKWCSDPLNMIIMPGYCVAGTVGARVI----- 395
Query: 125 RTLTLQVKKRIRLEGE 140
+KRI ++G+
Sbjct: 396 -----NGEKRIEIDGK 406
>gi|238880762|gb|EEQ44400.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 931
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 67/147 (45%), Gaps = 24/147 (16%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECG-FSRDLF 92
++WMS K +E + PF+ V L +EL K+ PK+V S D+ G S + F
Sbjct: 286 LDWMSKSFTKEWEELSSVPFNPSKVDLLLDPSELLKLSGPKIVFCSGIDLRSGDISAEAF 345
Query: 93 FQWCSSPENSIIITNRTS----------------------PGTLARDLIELGGNRTLTLQ 130
C+ +II+T +T+ G + D I + ++ ++L+
Sbjct: 346 QYLCNDERTTIILTEKTTMNFASSLSSVLYTEWDSLAKKRGGGESEDGIAVPIDKNISLK 405
Query: 131 -VKKRIRLEGEELEEYQKKKDKEAKDK 156
K + L G EL E+Q+K ++ K+K
Sbjct: 406 NWTKEVELTGTELTEFQEKVAQKRKEK 432
>gi|341890123|gb|EGT46058.1| hypothetical protein CAEBREN_05882 [Caenorhabditis brenneri]
Length = 618
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 52/108 (48%), Gaps = 16/108 (14%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
I W ++ + K+F N F FKH++ E +P P+V L STP M G S +F
Sbjct: 288 ISWTNENIKKTF--VERNMFEFKHIRPMEKGCE--DMPGPQV-LFSTPGMLHGGQSLKVF 342
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGE 140
+WCS P N II+ GT+ +I +KRI ++G+
Sbjct: 343 KKWCSDPINMIIMPGYCVAGTVGAKVI----------NGEKRIEIDGK 380
>gi|327288530|ref|XP_003228979.1| PREDICTED: integrator complex subunit 11-like [Anolis carolinensis]
Length = 600
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 51/105 (48%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFK 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + LGG R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LGGQRKLEMEGRQILEVK 381
>gi|149245028|ref|XP_001527048.1| hypothetical protein LELG_01877 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146449442|gb|EDK43698.1| hypothetical protein LELG_01877 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 812
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 44/70 (62%), Gaps = 3/70 (4%)
Query: 52 PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTS 110
PF FK++K +L++ + + P VV V+TP M + G SR L +W +N +I+T +
Sbjct: 335 PFDFKYIKSIKNLSKFSDL-GPSVV-VATPGMLQAGVSRQLLEKWAPEQKNLVILTGYSV 392
Query: 111 PGTLARDLIE 120
GT+A+D+++
Sbjct: 393 EGTMAKDIMK 402
>gi|409079696|gb|EKM80057.1| hypothetical protein AGABI1DRAFT_72888 [Agaricus bisporus var.
burnettii JB137-S8]
gi|426198540|gb|EKV48466.1| hypothetical protein AGABI2DRAFT_220282 [Agaricus bisporus var.
bisporus H97]
Length = 919
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 61/128 (47%), Gaps = 24/128 (18%)
Query: 53 FHFKHVKL---CHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
FKH++ +L + PK++L + G SR+LF + P+N +++T R
Sbjct: 384 LRFKHLEFFPNPQALLQTYSSKDPKLILAVPASLSHGPSRNLFVDFAVVPDNVVLLTGRG 443
Query: 110 SPGTLARDLI----------------ELGGNRTLT----LQVKKRIRLEGEELEEYQKKK 149
G+L R L ++G N L ++++ ++ L+G ELE Y ++
Sbjct: 444 EEGSLGRALFNKWNDRQRVDDKWDKGKIGSNIMLDGGFRMKMRSKVPLQGAELEAYL-QQ 502
Query: 150 DKEAKDKQ 157
+KE KDK+
Sbjct: 503 EKEKKDKE 510
>gi|123476407|ref|XP_001321376.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121904201|gb|EAY09153.1| hypothetical protein TVAG_363680 [Trichomonas vaginalis G3]
Length = 700
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 45/86 (52%), Gaps = 1/86 (1%)
Query: 75 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKR 134
VV+ ST +E GFSR LF +S +N II T R P +LA L +RT +K R
Sbjct: 320 VVISSTDTLERGFSRKLFLDKANS-DNLIIFTQREPPYSLAEALRTNNAHRTFRFIIKHR 378
Query: 135 IRLEGEELEEYQKKKDKEAKDKQEKE 160
L GEEL ++ +K+ + E E
Sbjct: 379 EPLTGEELVKFMEKQSALQEKANEIE 404
>gi|341903207|gb|EGT59142.1| hypothetical protein CAEBREN_31222 [Caenorhabditis brenneri]
Length = 571
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 52/108 (48%), Gaps = 16/108 (14%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
I W ++ + K+F N F FKH++ E +P P+V L STP M G S +F
Sbjct: 291 ISWTNENIKKTF--VERNMFEFKHIRPMEKGCE--DMPGPQV-LFSTPGMLHGGQSLKVF 345
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGE 140
+WCS P N II+ GT+ +I +KRI ++G+
Sbjct: 346 KKWCSDPINMIIMPGYCVAGTVGAKVI----------NGEKRIEIDGK 383
>gi|302679538|ref|XP_003029451.1| hypothetical protein SCHCODRAFT_59058 [Schizophyllum commune H4-8]
gi|300103141|gb|EFI94548.1| hypothetical protein SCHCODRAFT_59058 [Schizophyllum commune H4-8]
Length = 786
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 55/118 (46%), Gaps = 11/118 (9%)
Query: 3 DKLMKSFEGARNNPFHFKHVKLCHSLAEL-AKIEWMSDKLMKSFEGARNNPFHFKHVKLC 61
D+ K N P ++ S+A I M+ + F R+NPF FK
Sbjct: 252 DEYWKKHPDLHNVPIYYASGLARKSMAVYQTYIHTMNSNIRSRF-AKRDNPFVFK----- 305
Query: 62 HSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
++A+ P P VVL + M+ G SR+LF W N +I+T + GTLARD++
Sbjct: 306 ---CKIAEGP-PCVVLATPGFMQTGSSRELFELWAPDSRNGLIVTGYSVEGTLARDIM 359
>gi|154322621|ref|XP_001560625.1| hypothetical protein BC1G_00653 [Botryotinia fuckeliana B05.10]
gi|347837188|emb|CCD51760.1| similar to cleavage and polyadenylation specifity factor
[Botryotinia fuckeliana]
Length = 828
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 47/87 (54%), Gaps = 3/87 (3%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
E M++ S G R P+ FK+++ +L V V++++P M + G SR L
Sbjct: 315 ERMAEAEANSTSGGRGGPWDFKYIRSLKNLDRFDDVGG--CVILASPGMLQNGISRQLLE 372
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIE 120
+W S +N +IIT + GT+A+ +++
Sbjct: 373 RWAPSDKNGVIITGYSVEGTMAKQIMQ 399
>gi|281206064|gb|EFA80253.1| beta-lactamase domain-containing protein [Polysphondylium pallidum
PN500]
Length = 656
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 44/86 (51%), Gaps = 3/86 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++++ F+ +NPF FKH++ + P V + S ++ G SR LF
Sbjct: 191 INMMNERIRAQFD--LSNPFSFKHIENISGIERFTD-DGPCVFMASPGMLQSGLSRQLFE 247
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WCS N ++I GTLA+ ++
Sbjct: 248 RWCSDKMNGVVIPGYNVEGTLAKHIM 273
>gi|169861678|ref|XP_001837473.1| cleavage and polyadenylation specificity factor subunit
[Coprinopsis cinerea okayama7#130]
gi|116501494|gb|EAU84389.1| cleavage and polyadenylation specificity factor subunit
[Coprinopsis cinerea okayama7#130]
Length = 926
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/132 (26%), Positives = 62/132 (46%), Gaps = 24/132 (18%)
Query: 53 FHFKHVKL---CHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
FKH++ +L + PK++L + G SR LF + + P+N +++T R
Sbjct: 391 LRFKHLEFFPNPQALLQRHSSKDPKLILAVPASLSHGPSRQLFADFAAVPDNVVLLTTRG 450
Query: 110 SPGTLARDLIE----------------LGGNRTLTLQVK----KRIRLEGEELEEYQKKK 149
+ GTL R L + +G N + +K ++ L+G ELEEY K
Sbjct: 451 AEGTLGRALFDKWNNSQRGDDKWDKGRIGRNVMMDGAIKIKMYHKVPLQGAELEEYL-AK 509
Query: 150 DKEAKDKQEKEK 161
++ AK+K+ ++
Sbjct: 510 ERAAKEKEAAQQ 521
>gi|358333242|dbj|GAA51791.1| cleavage and polyadenylation specificity factor subunit 3
[Clonorchis sinensis]
Length = 697
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 53/125 (42%), Gaps = 25/125 (20%)
Query: 12 ARNNPFHFKHVK-----LCHSLAE-----LAKIEWMSDK-----LMKSFEGARNNPFHFK 56
A NNPF F+H+ +S++E LA W+ ++ + N P +
Sbjct: 242 ANNNPFCFRHISNLKAMRSYSISEQTEHALASKAWLYVAYSRFPVIGTVAAGTNVPTSIE 301
Query: 57 HVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLAR 116
H P VV+ S M+ G SR+LF WC+ N +II GTLA+
Sbjct: 302 HFDDS----------GPCVVMASPGMMQSGMSRELFENWCTDRRNGVIIAGYCVEGTLAK 351
Query: 117 DLIEL 121
++ L
Sbjct: 352 QILSL 356
>gi|294953157|ref|XP_002787623.1| hypothetical protein Pmar_PMAR012397 [Perkinsus marinus ATCC 50983]
gi|239902647|gb|EER19419.1| hypothetical protein Pmar_PMAR012397 [Perkinsus marinus ATCC 50983]
Length = 802
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 1/77 (1%)
Query: 61 CHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
C ++ P V+++ P M + G SR+LF QW P+N +IIT + GTLA DL
Sbjct: 6 CKEWEDVVLAPGMPCVVMAAPGMLQSGTSRELFEQWAPDPKNGVIITGYSVSGTLAHDLQ 65
Query: 120 ELGGNRTLTLQVKKRIR 136
TLT K +R
Sbjct: 66 NDPDTLTLTDGRKLPVR 82
>gi|156064885|ref|XP_001598364.1| conserved hypothetical protein [Sclerotinia sclerotiorum 1980]
gi|154691312|gb|EDN91050.1| conserved hypothetical protein [Sclerotinia sclerotiorum 1980
UF-70]
Length = 820
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 47/87 (54%), Gaps = 3/87 (3%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
E M++ S G R P+ FK+++ +L V V++++P M + G SR L
Sbjct: 315 ERMAEAEANSTSGGRGGPWDFKYIRSLKNLDRFDDVGG--CVILASPGMLQNGISRQLLE 372
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIE 120
+W S +N +IIT + GT+A+ +++
Sbjct: 373 RWAPSDKNGVIITGYSVEGTMAKQIMQ 399
>gi|145507230|ref|XP_001439570.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406765|emb|CAK72173.1| unnamed protein product [Paramecium tetraurelia]
Length = 484
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 43/84 (51%), Gaps = 4/84 (4%)
Query: 36 WMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQW 95
W S+K+ +SF +N F+FK+++ L K P V+L + + G S +F +W
Sbjct: 272 WESEKIQQSF--LDDNIFNFKYIQPFDR--NLIKSPLSMVLLATPGMLHGGLSMQVFKEW 327
Query: 96 CSSPENSIIITNRTSPGTLARDLI 119
C NS++I PGTL L+
Sbjct: 328 CGCANNSLVIPGYCVPGTLGNKLL 351
>gi|449016323|dbj|BAM79725.1| cleavage and polyadenylation specifity factor protein
[Cyanidioschyzon merolae strain 10D]
Length = 749
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 37/70 (52%), Gaps = 1/70 (1%)
Query: 50 NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
NPF FK+V SL E P V + S ++ G SR LF +WCS N +I+ +
Sbjct: 302 GNPFAFKYVMNIRSLDEFED-SGPCVFMASPGMLQSGMSRRLFEKWCSDRRNGVILPGYS 360
Query: 110 SPGTLARDLI 119
GTLA+ ++
Sbjct: 361 VQGTLAKYIL 370
>gi|443694305|gb|ELT95478.1| hypothetical protein CAPTEDRAFT_151615 [Capitella teleta]
Length = 600
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ +F + N F FKH+K + A P P VV + + G S +F
Sbjct: 283 ITWTNQKIKNTF--VQRNMFDFKHIKPFDKV--YADNPGPMVVFATPGMLHGGLSLQIFK 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+WC +N +I+ GT+ + L G R + ++ K+ I ++
Sbjct: 339 KWCGGEKNMVIMPGYCVSGTIGWKI--LNGQRKIEMENKQIIEVK 381
>gi|342882935|gb|EGU83499.1| hypothetical protein FOXB_05909 [Fusarium oxysporum Fo5176]
Length = 950
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 71/165 (43%), Gaps = 51/165 (30%)
Query: 34 IEWMSDKLMKSFE----------GARN----NPFHFKHVKLCHSLAELAKVPS------- 72
+EWM + +++ FE GA + PF FK+++L A++A++ S
Sbjct: 332 LEWMDESIVQEFEAFAEGQRKVNGANDKKEGGPFDFKYLRLLERKAQIARLLSQNPDNVS 391
Query: 73 --PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN-----RTSPGTLARDLIE----- 120
+V+L S +E GFS+DL N +I+T+ +T ++AR L +
Sbjct: 392 TEGRVILASDSSIEWGFSKDLIKGLARDSRNLVILTDKPGLSKTDKPSIARTLWDWWKER 451
Query: 121 ------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
G R L ++ +R L+G+EL YQ+
Sbjct: 452 KDGVSVEQNANGENLEFVYAGGRELEIREAQRHALDGDELALYQQ 496
>gi|389601462|ref|XP_001565522.2| putative cleavage and polyadenylation specificity factor
[Leishmania braziliensis MHOM/BR/75/M2904]
gi|322505052|emb|CAM39016.2| putative cleavage and polyadenylation specificity factor
[Leishmania braziliensis MHOM/BR/75/M2904]
Length = 829
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 53/113 (46%), Gaps = 13/113 (11%)
Query: 55 FKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSS----PENSIIITNRTS 110
F +V C S E+ + PK+ + ++ G S +L + ++ I++T
Sbjct: 310 FANVLTCRSAEEVLPIQGPKICVADGASLDFGPSAELLEYFVKGNRDGADHLIVLTEPPL 369
Query: 111 PGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEY---------QKKKDKEAK 154
PGT A + G L Q+ +R RL GEELEEY Q++++ EA+
Sbjct: 370 PGTNATVVTAAGDGERLHFQITRRSRLSGEELEEYYIDLEHDVEQRRRELEAQ 422
>gi|401885166|gb|EJT49292.1| cleavage and polyadenylation specificity factor subunit
[Trichosporon asahii var. asahii CBS 2479]
Length = 958
Score = 48.5 bits (114), Expect = 0.003, Method: Composition-based stats.
Identities = 43/142 (30%), Positives = 63/142 (44%), Gaps = 27/142 (19%)
Query: 47 GARNNPFHFKHVKLCHSLAEL-AKVPS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSI 103
G+ F HV+ + EL K P+ PK+VL P M G SR LF S P N +
Sbjct: 338 GSEYGALDFSHVQFFATPEELLEKYPANRPKLVLAIPPTMSHGPSRTLFASMASVPGNVV 397
Query: 104 IITNRTSPGTLARDLIELG---------------GNRT-----LTLQVKKRIRLEGEELE 143
++T TLAR+L G+ T L L++ + L GEELE
Sbjct: 398 LLTGHGEDRTLARELYARWEAHQDEGAHYGHGKIGHATPMEGRLELELDAKEPLSGEELE 457
Query: 144 EYQ----KKKDKEAKDKQEKEK 161
Y+ +K+++EA + E+
Sbjct: 458 AYETAEREKREREAAHQAALER 479
>gi|358394479|gb|EHK43872.1| hypothetical protein TRIATDRAFT_79096 [Trichoderma atroviride IMI
206040]
Length = 957
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 43/165 (26%), Positives = 69/165 (41%), Gaps = 51/165 (30%)
Query: 34 IEWMSDKLMKSFE----GARNN----------PFHFKHVKLCHSLAELAKVPS------- 72
+EWM + +++ FE G R PF FK+++L A++AK+ S
Sbjct: 332 LEWMDNNIVQEFEAFAEGQRKTNGGSEKKEGAPFDFKYLRLLERKAQIAKLLSQSIENGE 391
Query: 73 --PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR-----TSPGTLARDLIE----- 120
+V+L S M+ GFS+DL N +I+T R T +++R + E
Sbjct: 392 TQGRVILASDVSMDWGFSKDLIKGLAKDTRNLVILTERPSLANTDAPSISRMMWEWWKER 451
Query: 121 ------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
G R L ++ +R LEG+EL YQ+
Sbjct: 452 RDGISTEHASNGDSLETIYSGGRELEVREARREPLEGDELAIYQQ 496
>gi|452819966|gb|EME27015.1| cleavage and polyadenylation specifity factor protein [Galdieria
sulphuraria]
Length = 717
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 61/120 (50%), Gaps = 13/120 (10%)
Query: 23 KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
K C S+ + I M+D + K +E + NPF FK+V ++ + P V + S
Sbjct: 288 KRCMSVYQ-TYINMMNDNIRKRYEVS--NPFAFKYVLNVKNIQDFDD-SGPCVFMASPGM 343
Query: 83 MECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
++ G SR+L +WC+ N II+ + GTLA+ + L T+T RL+G+E+
Sbjct: 344 LQSGLSRELCERWCTDRRNGIILPGYSVEGTLAKHI--LSEPSTIT-------RLDGKEV 394
>gi|449549925|gb|EMD40890.1| hypothetical protein CERSUDRAFT_111471 [Ceriporiopsis subvermispora
B]
Length = 934
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 55/120 (45%), Gaps = 23/120 (19%)
Query: 53 FHFKHVKL---CHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
F+H++ +L + PK++L + G SR LF Q+ P+N +++T R+
Sbjct: 387 LRFRHLEFFPNPQALMQTYSSKDPKLILAVPATLSHGPSRALFTQFAEMPDNVVLLTGRS 446
Query: 110 SPGTLARDLIE----------------LGGN----RTLTLQVKKRIRLEGEELEEYQKKK 149
GTL R L + +G N TL L++ ++ L+G ELE Y K+
Sbjct: 447 EEGTLGRILFDRWNAAQRDEAKWDRGKIGSNVMMDGTLRLKMNSKVPLQGAELEVYLAKE 506
>gi|294931418|ref|XP_002779869.1| hypothetical protein Pmar_PMAR016530 [Perkinsus marinus ATCC 50983]
gi|239889578|gb|EER11664.1| hypothetical protein Pmar_PMAR016530 [Perkinsus marinus ATCC 50983]
Length = 80
Score = 48.1 bits (113), Expect = 0.004, Method: Composition-based stats.
Identities = 26/69 (37%), Positives = 36/69 (52%), Gaps = 1/69 (1%)
Query: 61 CHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
C ++ P V+++ P M + G SR+LF QW P+N +IIT + GTLA DL
Sbjct: 6 CKEWEDVVLAPGMPCVVMAAPGMLQSGTSRELFEQWAPDPKNGVIITGYSVSGTLAHDLQ 65
Query: 120 ELGGNRTLT 128
TLT
Sbjct: 66 NDPDTLTLT 74
>gi|345563625|gb|EGX46611.1| hypothetical protein AOL_s00097g515 [Arthrobotrys oligospora ATCC
24927]
Length = 791
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 36/153 (23%), Positives = 65/153 (42%), Gaps = 20/153 (13%)
Query: 50 NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNR 108
NPF + V+ SL V V+++ P M + G SR+L +WC P+N +I+T
Sbjct: 330 TNPFEMRWVRSLKSLDRFDDVGG--CVMLAAPGMMQNGVSRELLERWCPDPKNGVILTGY 387
Query: 109 TSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTS 168
+ GTLA+ ++ E E++ ++K+ ++E +++
Sbjct: 388 SVEGTLAKSILN-----------------EPTEIQAFKKEHTSRRSGREEADRVMIPRRC 430
Query: 169 FINELQLSDFKQTLQRNGIDCEFMDGVLICCRG 201
I+EL + Q + E V+I G
Sbjct: 431 SIDELSFAAHVDYGQNSSFIEEVGAKVIILVHG 463
>gi|242220452|ref|XP_002475992.1| predicted protein [Postia placenta Mad-698-R]
gi|220724781|gb|EED78801.1| predicted protein [Postia placenta Mad-698-R]
Length = 825
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 57/121 (47%), Gaps = 7/121 (5%)
Query: 3 DKLMKSFEGARNNPFHFKHVKLCHSLAELAK-IEWMSDKLMKSFEGARNNPFHFKHVK-L 60
D+ K +N P ++ S+A I M+ + F R+NPF FKH+ L
Sbjct: 243 DEYWKKHPDLQNVPIYYASSLARKSMAVYQTYIHTMNSNVRSRF-AKRDNPFVFKHISNL 301
Query: 61 CHSLA---ELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARD 117
S ++A+ P P VVL S M G SR+L W N +IIT + GT+AR+
Sbjct: 302 PQSKGWERKIAEGP-PCVVLASPGFMTSGASRELLELWAPDSRNGVIITGYSIEGTMARE 360
Query: 118 L 118
+
Sbjct: 361 I 361
>gi|313215108|emb|CBY42824.1| unnamed protein product [Oikopleura dioica]
Length = 323
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/171 (22%), Positives = 71/171 (41%), Gaps = 51/171 (29%)
Query: 117 DLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPP-----------H 165
+ + +G +++ R+ G LE + + + K E + IP H
Sbjct: 193 NFVRVGSKDIEVARIRGRVDYFGGRLELEAENGENDEPKKLEIDDIPTLQPVTNNYSSGH 252
Query: 166 DTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLF 225
D+ FIN+ +L++ K L G+ EF+ G L+ C V+++R
Sbjct: 253 DSIFINDTKLTELKSNLIDCGMHAEFIGGNLV-CNNKVSIKR------------------ 293
Query: 226 FQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
S + VI +EG LS++Y+ V++++YD YAI+
Sbjct: 294 ---------------------SANGVIQVEGTLSEDYFIVRKMVYDNYAIV 323
>gi|254565077|ref|XP_002489649.1| Putative endoribonuclease [Komagataella pastoris GS115]
gi|238029445|emb|CAY67368.1| Putative endoribonuclease [Komagataella pastoris GS115]
Length = 784
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 39/73 (53%), Gaps = 1/73 (1%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M++ + K F NPF F+++K +L++ P VV+ S ++ G SR L
Sbjct: 291 INMMNENIRKKFRDTNKNPFQFQYIKNIKNLSKFDDF-QPSVVVASPGMLQNGVSRALLE 349
Query: 94 QWCSSPENSIIIT 106
+W P N++I+T
Sbjct: 350 KWAPDPRNTLIMT 362
>gi|401428833|ref|XP_003878899.1| cleavage and polyadenylation specificity factor,putative
[Leishmania mexicana MHOM/GT/2001/U1103]
gi|322495148|emb|CBZ30452.1| cleavage and polyadenylation specificity factor,putative
[Leishmania mexicana MHOM/GT/2001/U1103]
Length = 756
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 46/88 (52%), Gaps = 6/88 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP--SPKVVLVSTPDMECGFSRDL 91
+ M+D++ K +NPF FK++ HSL + P VVL S ++ G S +L
Sbjct: 298 VSAMNDRV-KQQHANHHNPFVFKYI---HSLMDTKSFEDNGPCVVLASPGMLQSGISLEL 353
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F +WC N II+ GT+A+D++
Sbjct: 354 FERWCGDRRNGIIMAGYCVDGTIAKDVL 381
>gi|196007172|ref|XP_002113452.1| hypothetical protein TRIADDRAFT_57642 [Trichoplax adhaerens]
gi|190583856|gb|EDV23926.1| hypothetical protein TRIADDRAFT_57642 [Trichoplax adhaerens]
Length = 596
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 50/102 (49%), Gaps = 6/102 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ ++F ++N F FKH+K L P+P VV + + G S +F
Sbjct: 283 ITWTNQKIRRTF--VQHNMFEFKHIKPFDR--ALIDNPNPMVVFATPGMLHGGLSLQIFK 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRI 135
+W +N +I+ GT+ + L G RT+ L+ K+ I
Sbjct: 339 KWAPDDKNMVILPGYCVAGTVGNKI--LSGQRTVELENKQII 378
>gi|350585498|ref|XP_003127541.3| PREDICTED: integrator complex subunit 11-like [Sus scrofa]
Length = 599
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADSPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L L+ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLELEGRQVLEVK 381
>gi|340966678|gb|EGS22185.1| putative cleavage and polyadenylation protein [Chaetomium
thermophilum var. thermophilum DSM 1495]
Length = 998
Score = 47.8 bits (112), Expect = 0.005, Method: Composition-based stats.
Identities = 46/174 (26%), Positives = 71/174 (40%), Gaps = 60/174 (34%)
Query: 34 IEWMSDKLMKSFE----GARNN----------------PFHFKHVKLCHSLAELAKV--- 70
+EWM D +++ FE GAR PF FK+++L A++ +V
Sbjct: 334 LEWMDDSIVREFEAVAGGARTTNGGANASGGNKAKEAGPFDFKYLRLLERKAQIERVLQQ 393
Query: 71 -------PSPK--VVLVSTPDMECGFSRDLFFQWCSSPENSIIIT---NRTSPG--TLAR 116
SPK V+L + ++ GFS+D+ S N +I+T N +P ++AR
Sbjct: 394 ATSPPEGESPKGTVILATDTSLDWGFSKDVLKAIASDARNLVILTEKPNLANPDRPSIAR 453
Query: 117 DLIE-----------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
L + GG R L++ R LEG EL YQ+
Sbjct: 454 MLWDWWRERRDGVAVEQTASGDTFEQVYGGGRELSVPESTRHPLEGSELTVYQQ 507
>gi|241953057|ref|XP_002419250.1| subunit of mRNA cleavage and polyadenylation factor, putative
[Candida dubliniensis CD36]
gi|223642590|emb|CAX42840.1| subunit of mRNA cleavage and polyadenylation factor, putative
[Candida dubliniensis CD36]
Length = 930
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 68/147 (46%), Gaps = 24/147 (16%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECG-FSRDLF 92
++WMS K +E + PF+ V L +EL + PK+V S D+ G S + F
Sbjct: 286 LDWMSKSFTKEWEELSSVPFNPSKVDLLLDPSELLNLSGPKIVFCSGIDLRSGDISAEAF 345
Query: 93 FQWCSSPENSIIITNRTSP--------------GTLAR--------DLIELGGNRTLTLQ 130
C+ +II+T +T+ TLA+ D I + ++ ++L+
Sbjct: 346 QYLCNDERTTIILTEKTTMSLESSLSSILYTEWDTLAKKRGGGESADGIAVPIDKNISLK 405
Query: 131 -VKKRIRLEGEELEEYQKKKDKEAKDK 156
K I L G EL E+Q+K ++ K+K
Sbjct: 406 NWTKEIELTGTELTEFQEKVAQKRKEK 432
>gi|340518710|gb|EGR48950.1| predicted protein [Trichoderma reesei QM6a]
Length = 962
Score = 47.8 bits (112), Expect = 0.005, Method: Composition-based stats.
Identities = 44/165 (26%), Positives = 68/165 (41%), Gaps = 51/165 (30%)
Query: 34 IEWMSDKLMKSFE----GARN----------NPFHFKHVKLCHSLAELAKVPSP------ 73
+EWM + +++ FE G R PF FK+++L A++ K+ S
Sbjct: 332 LEWMDNNIVQEFEAFAEGQRKVNGGSEKKEGAPFDFKYLRLLERKAQIIKLLSQNIENGE 391
Query: 74 ---KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG-----TLARDLIE----- 120
+V+L S ME GFS+DL N +I+T R S +++R L E
Sbjct: 392 THGRVILASDITMEWGFSKDLVKGLARDSRNLVILTERPSLAREDSPSISRTLWEWWKER 451
Query: 121 ------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
GG R L ++ R LEG++L YQ+
Sbjct: 452 RDGVSTEHASTGDTLEMVYGGGRELEVREAVREPLEGDDLAIYQQ 496
>gi|328854195|gb|EGG03329.1| hypothetical protein MELLADRAFT_90299 [Melampsora larici-populina
98AG31]
Length = 695
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 41/155 (26%), Positives = 71/155 (45%), Gaps = 31/155 (20%)
Query: 35 EWMS-----DKLMKSF---EGARNNPFHFKHVKLCHSLAEL---AKVPSPKVVLVSTPDM 83
EWMS + +KS EG +++K +S+ L PK +L M
Sbjct: 108 EWMSPLTPTESQLKSRARDEGPGGIALRLRNLKFFNSIEALESQTAAIQPKCILAVPLTM 167
Query: 84 ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE------------------LGGNR 125
GFSR +F + P N +++T+ +L R L + + N
Sbjct: 168 AYGFSRRMFTRHVGKPGNLVVLTSMGEKESLTRWLADQVNEKSEAKYGSGTIPEPIDLNT 227
Query: 126 TLTLQVKKRIRLEGEELEEY--QKKKDKEAKDKQE 158
++++++K+++ LEGEELE+Y K++ KE + K E
Sbjct: 228 SVSVELKRKVVLEGEELEQYLEDKQRAKERRTKHE 262
>gi|154336691|ref|XP_001564581.1| putative cleavage and polyadenylation specificity factor
[Leishmania braziliensis MHOM/BR/75/M2904]
gi|134061616|emb|CAM38647.1| putative cleavage and polyadenylation specificity factor
[Leishmania braziliensis MHOM/BR/75/M2904]
Length = 756
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 46/88 (52%), Gaps = 6/88 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP--SPKVVLVSTPDMECGFSRDL 91
+ M+D++ K +NPF FK++ HSL + P VVL S ++ G S +L
Sbjct: 298 VSAMNDRV-KQQHANHHNPFVFKYI---HSLIDTKSFEDNGPCVVLASPGMLQSGISLEL 353
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F +WC N II+ GT+A+D++
Sbjct: 354 FERWCGDRRNGIIMAGYCVDGTIAKDVL 381
>gi|398022636|ref|XP_003864480.1| cleavage and polyadenylation specificity factor, putative
[Leishmania donovani]
gi|322502715|emb|CBZ37798.1| cleavage and polyadenylation specificity factor, putative
[Leishmania donovani]
Length = 756
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 46/88 (52%), Gaps = 6/88 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP--SPKVVLVSTPDMECGFSRDL 91
+ M+D++ K +NPF FK++ HSL + P VVL S ++ G S +L
Sbjct: 298 VSAMNDRV-KQQHANHHNPFVFKYI---HSLMDTKSFEDNGPCVVLASPGMLQSGISLEL 353
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F +WC N II+ GT+A+D++
Sbjct: 354 FERWCGDRRNGIIMAGYCVDGTIAKDVL 381
>gi|223997482|ref|XP_002288414.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975522|gb|EED93850.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 557
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 43/91 (47%), Gaps = 10/91 (10%)
Query: 50 NNPFHFKHVKLCHSL-AELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
NPFHF H++ S+ P VV S ++ G SR LF +W P+N +++
Sbjct: 308 GNPFHFSHIRNLKSIDVNNFDDRGPSVVFASPGMLQSGVSRQLFDRWAGDPKNGVMLAGY 367
Query: 109 TSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
TLA++++ Q K+ + LEG
Sbjct: 368 AVEHTLAKEIMS---------QPKEVVTLEG 389
>gi|146099573|ref|XP_001468678.1| putative cleavage and polyadenylation specificity factor
[Leishmania infantum JPCM5]
gi|134073046|emb|CAM71766.1| putative cleavage and polyadenylation specificity factor
[Leishmania infantum JPCM5]
Length = 756
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 46/88 (52%), Gaps = 6/88 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP--SPKVVLVSTPDMECGFSRDL 91
+ M+D++ K +NPF FK++ HSL + P VVL S ++ G S +L
Sbjct: 298 VSAMNDRV-KQQHANHHNPFVFKYI---HSLMDTKSFEDNGPCVVLASPGMLQSGISLEL 353
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F +WC N II+ GT+A+D++
Sbjct: 354 FERWCGDRRNGIIMAGYCVDGTIAKDVL 381
>gi|390601510|gb|EIN10904.1| hypothetical protein PUNSTDRAFT_112695 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 937
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 61/132 (46%), Gaps = 24/132 (18%)
Query: 53 FHFKHVKLCHSLAELAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
F+H++ + L S PK++L + G SR LF + + P+N II+T R
Sbjct: 388 LRFRHLEFFPNPQALLNTYSSKDPKLILAVPASLSHGPSRALFSTFAAVPDNVIILTQRG 447
Query: 110 SPGTLARDLI----------------ELGGNRTL----TLQVKKRIRLEGEELEEYQKKK 149
GTL DL ++G N L L++ ++ L+G+ELE + K
Sbjct: 448 EEGTLGNDLFKKWNNSQRAEHKWDKGKIGSNVMLDGNMILKMNSKVPLQGDELEAFL-AK 506
Query: 150 DKEAKDKQEKEK 161
++ A +K+ EK
Sbjct: 507 ERAAMEKEAAEK 518
>gi|322710530|gb|EFZ02104.1| cleavage and polyadenylation specifity factor [Metarhizium
anisopliae ARSEF 23]
Length = 831
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 47/79 (59%), Gaps = 4/79 (5%)
Query: 44 SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
S +GA + P+ FK+++ +L V V++++P M + G SR+LF +W S +N
Sbjct: 326 SGDGAGQGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQSGVSRELFERWAPSEKN 383
Query: 102 SIIITNRTSPGTLARDLIE 120
+IIT + GT+AR +++
Sbjct: 384 GVIITGYSVEGTMARQIMQ 402
>gi|324504608|gb|ADY41989.1| Integrator complex subunit 11 [Ascaris suum]
Length = 588
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 60/125 (48%), Gaps = 22/125 (17%)
Query: 26 HSLAELAK------IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVS 79
H LAE A I W ++K+ ++F N F FKH++ + P P +VL S
Sbjct: 270 HGLAEKATQYYRLFISWTNEKIKRTF--VHRNMFDFKHIRPFDQ--SFSDSPGP-MVLFS 324
Query: 80 TPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
TP M G S +F +WCS +N +I+ GT+ +I GG K++ ++
Sbjct: 325 TPGMLHGGQSLRVFKKWCSDEKNMVIMPGFCVAGTIGAKVI--GG--------AKKVEID 374
Query: 139 GEELE 143
G+ L+
Sbjct: 375 GKMLD 379
>gi|400600571|gb|EJP68245.1| metallo-beta-lactamase superfamily protein [Beauveria bassiana
ARSEF 2860]
Length = 866
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 44/78 (56%), Gaps = 3/78 (3%)
Query: 44 SFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENS 102
S E P+ FK+++ +L V V++++P M + G SR+LF +W S +N
Sbjct: 324 SGEAGAGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELFERWAPSDKNG 381
Query: 103 IIITNRTSPGTLARDLIE 120
+IIT + GT+AR +++
Sbjct: 382 VIITGYSVEGTMARQIMK 399
>gi|344283025|ref|XP_003413273.1| PREDICTED: integrator complex subunit 11-like [Loxodonta africana]
Length = 719
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 44/193 (22%), Positives = 79/193 (40%), Gaps = 15/193 (7%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 402 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 457
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGE-ELEEYQKKKDKE 152
+W + +N +I+ GT+ + L G R L ++ ++ + ++ + E + D +
Sbjct: 458 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAK 515
Query: 153 AKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVS 212
+ + P H E + +F + I+ EF + C V L +
Sbjct: 516 GIMQLIGQAEPEHVLLVHGEAKKMEF----LKQKIEQEFR----VNCYMPANGETVTLPT 567
Query: 213 TPDMECGFSRDLF 225
+P + G S L
Sbjct: 568 SPSIPVGISLGLL 580
>gi|432090010|gb|ELK23618.1| Integrator complex subunit 11 [Myotis davidii]
Length = 561
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 53/111 (47%), Gaps = 8/111 (7%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 317 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 372
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKK--RIRLEGEEL 142
+W + +N +I+ GT+ + L G R L ++ ++ IR++ E +
Sbjct: 373 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEIRMQVEYM 421
>gi|426240429|ref|XP_004014105.1| PREDICTED: integrator complex subunit 11 [Ovis aries]
Length = 515
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADSPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|74220481|dbj|BAE31460.1| unnamed protein product [Mus musculus]
Length = 600
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQGNMFEFKHIKAFDR--TFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQMLEVK 381
>gi|343958192|dbj|BAK62951.1| protein related to CPSF subunits 68 kDa [Pan troglodytes]
Length = 600
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 51/105 (48%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ ++ L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHEI--LSGQRKLEMEGRQVLEVK 381
>gi|389740019|gb|EIM81211.1| mRNA 3'-end-processing protein YSH1 [Stereum hirsutum FP-91666 SS1]
Length = 841
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 49 RNNPFHFKHVK-LCHSLA---ELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSII 104
R+NPF FKH+ + S ++A+ P P VVL S M+ G SR L W N +I
Sbjct: 298 RDNPFVFKHISNMPQSSGWERKIAEGP-PCVVLASPGFMQSGPSRQLLELWAPDSRNGLI 356
Query: 105 ITNRTSPGTLARDLI 119
+T + GTLAR+++
Sbjct: 357 VTGYSVEGTLAREIM 371
>gi|449546825|gb|EMD37794.1| hypothetical protein CERSUDRAFT_154677 [Ceriporiopsis subvermispora
B]
Length = 820
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 40/75 (53%), Gaps = 5/75 (6%)
Query: 49 RNNPFHFKHVKLCHSLA----ELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSII 104
R+NPF FKH+ ++A+ P P VVL S + G SR+L W N II
Sbjct: 298 RDNPFVFKHISNVPQARGWERKIAEGP-PCVVLASPGFVTSGPSRELLELWAPDSRNGII 356
Query: 105 ITNRTSPGTLARDLI 119
+T + GT+ARD++
Sbjct: 357 VTGYSVEGTMARDIL 371
>gi|414881435|tpg|DAA58566.1| TPA: putative RNA-metabolising metallo-beta-lactamase [Zea mays]
Length = 558
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 41/87 (47%), Gaps = 6/87 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
I W S K+ S +NPF FKHV CH P P VL +TP M GFS + F
Sbjct: 285 IGWTSQKIKDSH--TVHNPFDFKHV--CHFERSFINNPGP-CVLFATPGMITGGFSLEAF 339
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLI 119
+W S +N + + GT+ L+
Sbjct: 340 KKWAPSEKNLVTLPGYCVSGTIGHKLM 366
>gi|339244969|ref|XP_003378410.1| putative metallo-beta-lactamase domain protein [Trichinella
spiralis]
gi|316972680|gb|EFV56345.1| putative metallo-beta-lactamase domain protein [Trichinella
spiralis]
Length = 562
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 46/97 (47%), Gaps = 6/97 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ W S+K+ K+F + N F FKHV A P P VV + + G S +F
Sbjct: 250 VTWTSEKIKKTF--VKRNMFDFKHVLPFED--SFADTPGPMVVFATPGMLHSGQSLKIFK 305
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQ 130
+W ++ +N +I+ GT+ LI G R L L+
Sbjct: 306 KWATNEKNMVIMPGYCVQGTVGSKLI--AGVRKLDLE 340
>gi|274326663|ref|NP_001094578.1| integrator complex subunit 11 [Bos taurus]
gi|152941100|gb|ABS44987.1| related to CPSF subunits 68 kDa [Bos taurus]
Length = 599
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADSPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|355680857|gb|AER96662.1| cleavage and polyadenylation specific factor 3-like protein
[Mustela putorius furo]
Length = 440
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 292 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 347
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 348 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 390
>gi|118572556|sp|Q2YDM2.2|INT11_BOVIN RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
Full=Cleavage and polyadenylation-specific factor 3-like
protein; Short=CPSF3-like protein
gi|158455110|gb|AAI10156.2| CPSF3L protein [Bos taurus]
Length = 599
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADSPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|440911726|gb|ELR61363.1| Integrator complex subunit 11 [Bos grunniens mutus]
Length = 599
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADSPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|328873132|gb|EGG21499.1| integrator complex subunit 11 [Dictyostelium fasciculatum]
Length = 645
Score = 47.0 bits (110), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 60/132 (45%), Gaps = 18/132 (13%)
Query: 14 NNPFHFKHVKLCHSLAELAK------IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAEL 67
N P +F LAE A I W + K+ ++F + N F FKH+K L
Sbjct: 261 NVPIYF-----SEGLAEKANFYYKLFITWTNQKIKQTF--VKRNMFDFKHIKPFDR--HL 311
Query: 68 AKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRT 126
A P P +VL +TP M G S ++F +W + N II GT+ L+ G
Sbjct: 312 ADAPGP-MVLFATPGMLHAGASLEVFKKWAPNELNMTIIPGYCVVGTVGNKLLSNAGGPQ 370
Query: 127 LTLQVKKRIRLE 138
+ +++ K+ LE
Sbjct: 371 M-VEIDKKTTLE 381
>gi|296479091|tpg|DAA21206.1| TPA: cleavage and polyadenylation specific factor 3-like [Bos
taurus]
Length = 599
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADSPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|134083194|emb|CAK42833.1| unnamed protein product [Aspergillus niger]
Length = 865
Score = 47.0 bits (110), Expect = 0.009, Method: Composition-based stats.
Identities = 24/80 (30%), Positives = 41/80 (51%), Gaps = 7/80 (8%)
Query: 34 IEWMSDKLMKSFEGARNN----PFHFKHVKLCHSLAELAKVPS---PKVVLVSTPDMECG 86
+EWM + +++ FE A PF FKH+++ L K+ S PKV+L S ++ G
Sbjct: 325 LEWMDENIVREFEAAEEGKGVGPFTFKHLRILERKKRLEKILSDQKPKVILASDTSLDWG 384
Query: 87 FSRDLFFQWCSSPENSIIIT 106
F++D N +++T
Sbjct: 385 FAKDSLRLVAEGANNLLLLT 404
>gi|440638117|gb|ELR08036.1| hypothetical protein GMDG_02874 [Geomyces destructans 20631-21]
Length = 831
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 41/74 (55%), Gaps = 1/74 (1%)
Query: 47 GARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
G P+ FK+++ SL V S V+L S M+ G SR+L +W S +N ++IT
Sbjct: 325 GTNGGPWDFKYIRSLKSLERFDDVGS-CVMLASPGMMQNGVSRELLERWAPSDKNGVVIT 383
Query: 107 NRTSPGTLARDLIE 120
+ GT+A+ +++
Sbjct: 384 GYSVEGTMAKSIMQ 397
>gi|348551496|ref|XP_003461566.1| PREDICTED: integrator complex subunit 11 [Cavia porcellus]
Length = 600
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 45/195 (23%), Positives = 79/195 (40%), Gaps = 15/195 (7%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGE-ELEEYQKKKDKE 152
+W + +N +I+ GT+ + L G R L ++ ++ + ++ + E + D +
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAK 396
Query: 153 AKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVS 212
+ + P E + +F R I+ EF + C V L +
Sbjct: 397 GIMQLVGQAEPESVLLVHGEAKKMEF----LRQKIEQEFR----VSCYMPANGETVTLPT 448
Query: 213 TPDMECGFSRDLFFQ 227
+P + G S L Q
Sbjct: 449 SPSIPVGISLGLLKQ 463
>gi|221484558|gb|EEE22852.1| cleavage and polyadenylation specificity factor, putative
[Toxoplasma gondii GT1]
Length = 1100
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 40/72 (55%), Gaps = 5/72 (6%)
Query: 51 NPFHFKHVKLCHSLAELAKV----PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
NPF F+ VK S+ E A+V P VV+ + ++ G SR++F W +N +I+T
Sbjct: 423 NPFAFRFVKNVKSV-EAARVYIHHDGPAVVMAAPGMLQSGASREIFEAWAPDAKNGVILT 481
Query: 107 NRTSPGTLARDL 118
+ GTLA +L
Sbjct: 482 GYSVKGTLADEL 493
>gi|301618510|ref|XP_002938656.1| PREDICTED: integrator complex subunit 11 isoform 1 [Xenopus
(Silurana) tropicalis]
gi|301618512|ref|XP_002938657.1| PREDICTED: integrator complex subunit 11 isoform 2 [Xenopus
(Silurana) tropicalis]
Length = 600
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 50/108 (46%), Gaps = 12/108 (11%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTL------TLQVKKRI 135
+W + +N +I+ GT+ + L G R L TL+VK ++
Sbjct: 339 KWAGNDKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQTLEVKMQV 384
>gi|384499309|gb|EIE89800.1| hypothetical protein RO3G_14511 [Rhizopus delemar RA 99-880]
Length = 654
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 44/83 (53%), Gaps = 3/83 (3%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I M+ ++ K F A +NPF FKH+ ++ + P V++ S ++ G SR+LF
Sbjct: 277 INMMNARIRKQF--AISNPFVFKHISNLKNVEQFED-SGPCVMMASPGMLQNGLSRELFE 333
Query: 94 QWCSSPENSIIITNRTSPGTLAR 116
+W +N ++IT TLAR
Sbjct: 334 RWAPDKKNGLVITGYCVENTLAR 356
>gi|221504752|gb|EEE30417.1| cleavage and polyadenylation specificity factor, putative
[Toxoplasma gondii VEG]
Length = 1100
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 40/72 (55%), Gaps = 5/72 (6%)
Query: 51 NPFHFKHVKLCHSLAELAKV----PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
NPF F+ VK S+ E A+V P VV+ + ++ G SR++F W +N +I+T
Sbjct: 423 NPFAFRFVKNVKSV-EAARVYIHHDGPAVVMAAPGMLQSGASREIFEAWAPDAKNGVILT 481
Query: 107 NRTSPGTLARDL 118
+ GTLA +L
Sbjct: 482 GYSVKGTLADEL 493
>gi|237839761|ref|XP_002369178.1| cleavage and polyadenylation specificity factor, putative
[Toxoplasma gondii ME49]
gi|211966842|gb|EEB02038.1| cleavage and polyadenylation specificity factor, putative
[Toxoplasma gondii ME49]
Length = 1100
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 40/72 (55%), Gaps = 5/72 (6%)
Query: 51 NPFHFKHVKLCHSLAELAKV----PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
NPF F+ VK S+ E A+V P VV+ + ++ G SR++F W +N +I+T
Sbjct: 423 NPFAFRFVKNVKSV-EAARVYIHHDGPAVVMAAPGMLQSGASREIFEAWAPDAKNGVILT 481
Query: 107 NRTSPGTLARDL 118
+ GTLA +L
Sbjct: 482 GYSVKGTLADEL 493
>gi|76559911|ref|NP_001029064.1| integrator complex subunit 11 [Rattus norvegicus]
gi|119371245|sp|Q3MHC2.1|INT11_RAT RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
Full=Cleavage and polyadenylation-specific factor 3-like
protein; Short=CPSF3-like protein
gi|75867808|gb|AAI05304.1| Cleavage and polyadenylation specific factor 3-like [Rattus
norvegicus]
Length = 600
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--TFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQMLEVK 381
>gi|149024842|gb|EDL81339.1| similar to RIKEN cDNA 2410006F12 [Rattus norvegicus]
Length = 601
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 284 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--TFADNPGPMVVFATPGMLHAGQSLQIFR 339
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 340 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQMLEVK 382
>gi|21312614|ref|NP_082296.1| integrator complex subunit 11 [Mus musculus]
gi|81904239|sp|Q9CWS4.1|INT11_MOUSE RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
Full=Cleavage and polyadenylation-specific factor 3-like
protein; Short=CPSF3-like protein
gi|12845859|dbj|BAB26928.1| unnamed protein product [Mus musculus]
gi|26355309|dbj|BAC41135.1| unnamed protein product [Mus musculus]
gi|74192536|dbj|BAE43054.1| unnamed protein product [Mus musculus]
gi|74219576|dbj|BAE29558.1| unnamed protein product [Mus musculus]
gi|148683102|gb|EDL15049.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_b
[Mus musculus]
Length = 600
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--TFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQMLEVK 381
>gi|74198351|dbj|BAE39661.1| unnamed protein product [Mus musculus]
Length = 600
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--TFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQMLEVK 381
>gi|15029864|gb|AAH11155.1| Cleavage and polyadenylation specific factor 3-like [Mus musculus]
Length = 600
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--TFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQMLEVK 381
>gi|345563127|gb|EGX46131.1| hypothetical protein AOL_s00110g295 [Arthrobotrys oligospora ATCC
24927]
Length = 982
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 46/111 (41%), Gaps = 29/111 (26%)
Query: 32 AKIEWMSDKLMKSFEGARNN--------------------------PFHFKHVKLC---H 62
A +EWM + ++K FE +N PF F+H+KL H
Sbjct: 310 ALLEWMDESMVKEFESVTHNNNPSSRRKPKSSNTGAGDKEDDKLYGPFDFRHLKLVEHKH 369
Query: 63 SLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGT 113
L ++ KVV+ S +E GFS ++ N I++T R S GT
Sbjct: 370 QLTKILSRKGGKVVITSDKSLEWGFSTEVVKSIADDERNLIVLTERGSEGT 420
>gi|357618297|gb|EHJ71333.1| putative cleavage and polyadenylation specificity factor [Danaus
plexippus]
Length = 385
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 29/48 (60%)
Query: 72 SPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
P V++ S M+ G SR+LF WC+ P+N +II GTLA+ ++
Sbjct: 29 GPCVIMASPGMMQSGLSRELFESWCTDPKNGVIIAGYCVEGTLAKTIL 76
>gi|281348165|gb|EFB23749.1| hypothetical protein PANDA_020173 [Ailuropoda melanoleuca]
Length = 591
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 274 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 329
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 330 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 372
>gi|254567914|ref|XP_002491067.1| hypothetical protein [Komagataella pastoris GS115]
gi|238030864|emb|CAY68787.1| hypothetical protein PAS_chr2-1_0816 [Komagataella pastoris GS115]
gi|328352406|emb|CCA38805.1| Cleavage and polyadenylation specificity factor subunit 2
[Komagataella pastoris CBS 7435]
Length = 854
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 37/156 (23%), Positives = 70/156 (44%), Gaps = 20/156 (12%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECG-FSRDLF 92
+EWMS K++K +E PF V+L + +L ++P KVV + D+ G +
Sbjct: 284 LEWMSPKIIKDWEERNETPFDPSRVQLV-DVDDLVQLPGAKVVFTADADLTIGSTAHSTL 342
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVK---------------KRI-- 135
C +N+II T R + + + E+ TL K R+
Sbjct: 343 ASICIDEKNTIIFTERPTNSSFGASIYEIWEKLTLERNGKLEDGFPVPFEKLLTFSRVTL 402
Query: 136 -RLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFI 170
+L G EL +Y + ++ ++K++K ++ +T+ +
Sbjct: 403 KKLTGLELAQYTEIVNERKQEKRKKRQVEKMNTTIL 438
>gi|334321967|ref|XP_001364674.2| PREDICTED: integrator complex subunit 11-like [Monodelphis
domestica]
Length = 600
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 43/193 (22%), Positives = 81/193 (41%), Gaps = 15/193 (7%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGE-ELEEYQKKKDKE 152
+W + +N +I+ GT+ + L G R L ++ ++ + ++ + E + D +
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQILEVKMQVEYMSFSAHADAK 396
Query: 153 AKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVS 212
+ ++ P + E + +F + I+ EF + C V L++
Sbjct: 397 GIMQLVRQAEPDNVLLVHGEAKKMEF----LKQKIEQEFH----VNCYMPANGETVTLLT 448
Query: 213 TPDMECGFSRDLF 225
P++ G S L
Sbjct: 449 HPNIPVGISLGLL 461
>gi|443926404|gb|ELU45071.1| mRNA 3'-end-processing protein YSH1 [Rhizoctonia solani AG-1 IA]
Length = 409
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 11/78 (14%)
Query: 49 RNNPFHFKHVKLCHS-------LAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPEN 101
++NPF FKH+ + +AE P V+L S M G SR+L W +N
Sbjct: 266 KDNPFVFKHISHLPATRGWERKIAEAG----PCVILASPGFMSSGPSRELLELWAPDAKN 321
Query: 102 SIIITNRTSPGTLARDLI 119
+IIT + GT+ARD+I
Sbjct: 322 GVIITGYSIEGTMARDII 339
>gi|417403203|gb|JAA48419.1| Putative mrna cleavage and polyadenylation factor ii complex brr5
cpsf subunit [Desmodus rotundus]
Length = 603
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 53/111 (47%), Gaps = 8/111 (7%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKK--RIRLEGEEL 142
+W + +N +I+ GT+ + L G R L ++ ++ +R++ E +
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVRMQVEYM 387
>gi|444519369|gb|ELV12789.1| Integrator complex subunit 11 [Tupaia chinensis]
Length = 601
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 284 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 339
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 340 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 382
>gi|359319514|ref|XP_003639102.1| PREDICTED: integrator complex subunit 11-like [Canis lupus
familiaris]
Length = 600
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|414881433|tpg|DAA58564.1| TPA: putative RNA-metabolising metallo-beta-lactamase [Zea mays]
Length = 400
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 41/87 (47%), Gaps = 6/87 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
I W S K+ S +NPF FKHV CH P P VL +TP M GFS + F
Sbjct: 285 IGWTSQKIKDSH--TVHNPFDFKHV--CHFERSFINNPGP-CVLFATPGMITGGFSLEAF 339
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLI 119
+W S +N + + GT+ L+
Sbjct: 340 KKWAPSEKNLVTLPGYCVSGTIGHKLM 366
>gi|190346294|gb|EDK38344.2| hypothetical protein PGUG_02442 [Meyerozyma guilliermondii ATCC
6260]
Length = 821
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 69/156 (44%), Gaps = 33/156 (21%)
Query: 34 IEWMSDKLMKSFEGA---------RNN-PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
+EWMS L+K +E A +NN PF V L EL ++ PK+VL + DM
Sbjct: 173 LEWMSTSLVKEWEAASSASMNSTNKNNFPFDPSKVDLLSDPKELIQLSGPKIVLCAGIDM 232
Query: 84 ECG-FSRDLFFQWCSSPENSIIITNRTSPGT---------------------LARDLIEL 121
G S ++ CS +N++++T +T G A D + +
Sbjct: 233 NSGDVSFEVLKYLCSDQKNTVLLTEKTHFGADFSINAQLFTDWVRLSREKYGNAEDGLAI 292
Query: 122 GGNRTLTLQ-VKKRIRLEGEELEEYQKKKDKEAKDK 156
G T+ L+ + + L G EL +Q++ + + K K
Sbjct: 293 GYEGTIPLRGLSREDPLSGSELTSFQERINHQRKKK 328
>gi|338722203|ref|XP_001496423.3| PREDICTED: integrator complex subunit 11 [Equus caballus]
Length = 571
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 254 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 309
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 310 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 352
>gi|395840791|ref|XP_003793235.1| PREDICTED: integrator complex subunit 11 isoform 1 [Otolemur
garnettii]
Length = 600
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|301788922|ref|XP_002929872.1| PREDICTED: integrator complex subunit 11-like [Ailuropoda
melanoleuca]
Length = 600
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|296815164|ref|XP_002847919.1| cleavage and polyadenylation specificity factor subunit 2
[Arthroderma otae CBS 113480]
gi|238840944|gb|EEQ30606.1| cleavage and polyadenylation specificity factor subunit 2
[Arthroderma otae CBS 113480]
Length = 1000
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 50/125 (40%), Gaps = 32/125 (25%)
Query: 13 RNNPFHFKHVKLCHSLAELAK--IEWMSDKLMKSFEGARN-------------------- 50
+N P + K H LA+ +EWM + +++ FEG
Sbjct: 309 KNAPLYLAGKK-AHGTMRLARSMLEWMDENIVREFEGNDGVEVGAGKSGGGAANQPSKSA 367
Query: 51 ------NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPEN 101
PF FKH+ L A+L + PKV+L +E G SR + Q + +N
Sbjct: 368 QGQKSLGPFTFKHLNLVEHKAKLDSILDSKGPKVILSPDASLEWGLSRHVLRQIAAGSDN 427
Query: 102 SIIIT 106
II+T
Sbjct: 428 LIIMT 432
>gi|414881434|tpg|DAA58565.1| TPA: putative RNA-metabolising metallo-beta-lactamase [Zea mays]
Length = 400
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 41/87 (47%), Gaps = 6/87 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
I W S K+ S +NPF FKHV CH P P VL +TP M GFS + F
Sbjct: 285 IGWTSQKIKDSH--TVHNPFDFKHV--CHFERSFINNPGP-CVLFATPGMITGGFSLEAF 339
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLI 119
+W S +N + + GT+ L+
Sbjct: 340 KKWAPSEKNLVTLPGYCVSGTIGHKLM 366
>gi|158256210|dbj|BAF84076.1| unnamed protein product [Homo sapiens]
Length = 606
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 289 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 344
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 345 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 387
>gi|431922648|gb|ELK19568.1| Integrator complex subunit 11 [Pteropus alecto]
Length = 603
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|355744837|gb|EHH49462.1| hypothetical protein EGM_00117, partial [Macaca fascicularis]
Length = 592
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 275 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 330
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 331 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 373
>gi|342180524|emb|CCC90000.1| putative cleavage and polyadenylation specificity factor subunit
[Trypanosoma congolense IL3000]
Length = 766
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 47/88 (53%), Gaps = 6/88 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV--PSPKVVLVSTPDMECGFSRDL 91
+ M+D++ + E RN PF FK+++ SL + P VVL S ++ G S +L
Sbjct: 310 VSAMNDRVKEQHENHRN-PFVFKYIQ---SLLDTRSFEDTGPCVVLASPGMLQSGISLEL 365
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F +WC N II+ GT+A++++
Sbjct: 366 FERWCGDKRNGIIVAGYCVDGTIAKEIL 393
>gi|397476278|ref|XP_003809534.1| PREDICTED: integrator complex subunit 11 isoform 2 [Pan paniscus]
Length = 606
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 289 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 344
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 345 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 387
>gi|193786492|dbj|BAG51775.1| unnamed protein product [Homo sapiens]
Length = 606
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 289 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 344
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 345 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 387
>gi|374253819|ref|NP_001243385.1| integrator complex subunit 11 isoform 1 [Homo sapiens]
gi|119576642|gb|EAW56238.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_f
[Homo sapiens]
Length = 606
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 289 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 344
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 345 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 387
>gi|296206477|ref|XP_002750225.1| PREDICTED: integrator complex subunit 11 isoform 1 [Callithrix
jacchus]
Length = 600
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|374253821|ref|NP_001243389.1| integrator complex subunit 11 isoform 3 [Homo sapiens]
gi|194386866|dbj|BAG59799.1| unnamed protein product [Homo sapiens]
Length = 571
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 254 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 309
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 310 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 352
>gi|426327392|ref|XP_004024502.1| PREDICTED: integrator complex subunit 11 isoform 2 [Gorilla gorilla
gorilla]
Length = 606
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 289 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 344
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 345 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 387
>gi|326932364|ref|XP_003212289.1| PREDICTED: integrator complex subunit 11-like [Meleagris gallopavo]
Length = 600
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 47/100 (47%), Gaps = 6/100 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKK 133
+W + +N +I+ GT+ + L G R L ++ ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQ 376
>gi|449268484|gb|EMC79348.1| Integrator complex subunit 11 [Columba livia]
Length = 600
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQILEVK 381
>gi|430813249|emb|CCJ29377.1| unnamed protein product [Pneumocystis jirovecii]
Length = 574
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 37/69 (53%), Gaps = 1/69 (1%)
Query: 51 NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS 110
NPF F+++ SL + P V+L S ++ G SR L +WC P+N +I+
Sbjct: 183 NPFIFRYISSLKSLDRFEDI-GPCVMLASPGMLQSGVSRALLEKWCPDPKNGLIVAGYCV 241
Query: 111 PGTLARDLI 119
GT+A+ ++
Sbjct: 242 EGTMAKHIL 250
>gi|395840793|ref|XP_003793236.1| PREDICTED: integrator complex subunit 11 isoform 2 [Otolemur
garnettii]
Length = 499
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 182 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 237
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 238 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 280
>gi|61098197|ref|NP_001012854.1| integrator complex subunit 11 [Gallus gallus]
gi|75571225|sp|Q5ZIH0.1|INT11_CHICK RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
Full=Cleavage and polyadenylation-specific factor 3-like
protein; Short=CPSF3-like protein
gi|53135966|emb|CAG32473.1| hypothetical protein RCJMB04_26e19 [Gallus gallus]
Length = 600
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 47/100 (47%), Gaps = 6/100 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKK 133
+W + +N +I+ GT+ + L G R L ++ ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQ 376
>gi|426327394|ref|XP_004024503.1| PREDICTED: integrator complex subunit 11 isoform 3 [Gorilla gorilla
gorilla]
Length = 571
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 254 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 309
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 310 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 352
>gi|403297738|ref|XP_003939709.1| PREDICTED: integrator complex subunit 11 isoform 1 [Saimiri
boliviensis boliviensis]
Length = 600
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|402465801|gb|EJW01455.1| hypothetical protein EDEG_00447 [Edhazardia aedis USNM 41457]
Length = 774
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 39/71 (54%), Gaps = 3/71 (4%)
Query: 49 RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
+N F FK+++ ++ + P VV+ S ++ G SRDLF +WC N +II
Sbjct: 284 KNVDFKFKYIRNINTFDDRN---LPCVVMASPGMLQSGLSRDLFEKWCEDKRNGVIIAGY 340
Query: 109 TSPGTLARDLI 119
GTLA++++
Sbjct: 341 CVQGTLAKEIL 351
>gi|402852593|ref|XP_003891002.1| PREDICTED: integrator complex subunit 11 isoform 1 [Papio anubis]
gi|355557446|gb|EHH14226.1| hypothetical protein EGK_00111 [Macaca mulatta]
gi|387540112|gb|AFJ70683.1| integrator complex subunit 11 [Macaca mulatta]
Length = 600
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|296206479|ref|XP_002750226.1| PREDICTED: integrator complex subunit 11 isoform 2 [Callithrix
jacchus]
Length = 499
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 182 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 237
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 238 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 280
>gi|33300633|ref|NP_060341.2| integrator complex subunit 11 isoform 2 [Homo sapiens]
gi|118572557|sp|Q5TA45.2|INT11_HUMAN RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
Full=Cleavage and polyadenylation-specific factor 3-like
protein; Short=CPSF3-like protein; AltName: Full=Protein
related to CPSF subunits of 68 kDa; Short=RC-68
gi|14124912|gb|AAH07978.1| Cleavage and polyadenylation specific factor 3-like [Homo sapiens]
gi|60650138|tpg|DAA05669.1| TPA_exp: beta-lactamase fold protein family member RC-68 [Homo
sapiens]
gi|78100161|tpg|DAA05728.1| TPA_exp: integrator complex subunit 11 [Homo sapiens]
gi|119576636|gb|EAW56232.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_a
[Homo sapiens]
gi|119576638|gb|EAW56234.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_a
[Homo sapiens]
Length = 600
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|12053137|emb|CAB66747.1| hypothetical protein [Homo sapiens]
gi|49065540|emb|CAG38588.1| FLJ20542 [Homo sapiens]
gi|117645260|emb|CAL38096.1| hypothetical protein [synthetic construct]
gi|208966056|dbj|BAG73042.1| cleavage and polyadenylation specific factor 3-like [synthetic
construct]
Length = 600
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|354495797|ref|XP_003510015.1| PREDICTED: integrator complex subunit 11-like [Cricetulus griseus]
gi|344251677|gb|EGW07781.1| Integrator complex subunit 11 [Cricetulus griseus]
Length = 600
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--TFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|224079882|ref|XP_002197797.1| PREDICTED: integrator complex subunit 11 [Taeniopygia guttata]
Length = 600
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 47/100 (47%), Gaps = 6/100 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKK 133
+W + +N +I+ GT+ + L G R L ++ ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQ 376
>gi|380798915|gb|AFE71333.1| integrator complex subunit 11 isoform 2, partial [Macaca mulatta]
Length = 588
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 271 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 326
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 327 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 369
>gi|34783058|gb|AAH00675.2| CPSF3L protein, partial [Homo sapiens]
Length = 473
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 156 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 211
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 212 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 254
>gi|426327390|ref|XP_004024501.1| PREDICTED: integrator complex subunit 11 isoform 1 [Gorilla gorilla
gorilla]
Length = 600
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|14165488|gb|AAH08041.1| CPSF3L protein [Homo sapiens]
gi|18044933|gb|AAH20199.1| CPSF3L protein [Homo sapiens]
Length = 389
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 163 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 218
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 219 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 261
>gi|402852595|ref|XP_003891003.1| PREDICTED: integrator complex subunit 11 isoform 2 [Papio anubis]
Length = 499
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 182 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 237
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 238 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 280
>gi|207079923|ref|NP_001128922.1| DKFZP459J1110 protein [Pongo abelii]
gi|56403907|emb|CAI29738.1| hypothetical protein [Pongo abelii]
Length = 600
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|403418874|emb|CCM05574.1| predicted protein [Fibroporia radiculosa]
Length = 826
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 31/120 (25%), Positives = 55/120 (45%), Gaps = 23/120 (19%)
Query: 53 FHFKHVKLC---HSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
F+H++ +L PK++L + G SR LF ++ +P+N +++T R
Sbjct: 277 LRFRHLEFFPNPQALLHTYSSKDPKLILAVPASLSHGPSRVLFTEFAETPDNVVLLTGRG 336
Query: 110 SPGTLARDLIE----------------LGGN----RTLTLQVKKRIRLEGEELEEYQKKK 149
GTL R L + +G N L L++ ++ L+G ELEE+ +K+
Sbjct: 337 EEGTLGRILFDRWNNAQRDDVKWDRGRIGSNVMLDGILHLEIHSKVPLQGTELEEFLQKQ 396
>gi|403297740|ref|XP_003939710.1| PREDICTED: integrator complex subunit 11 isoform 2 [Saimiri
boliviensis boliviensis]
Length = 499
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 182 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 237
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 238 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 280
>gi|397476280|ref|XP_003809535.1| PREDICTED: integrator complex subunit 11 isoform 3 [Pan paniscus]
Length = 499
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 182 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 237
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 238 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 280
>gi|118572558|sp|Q5NVE6.2|INT11_PONAB RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
Full=Cleavage and polyadenylation-specific factor 3-like
protein; Short=CPSF3-like protein
Length = 600
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|389638668|ref|XP_003716967.1| hypothetical protein MGG_06570 [Magnaporthe oryzae 70-15]
gi|351642786|gb|EHA50648.1| hypothetical protein MGG_06570 [Magnaporthe oryzae 70-15]
gi|440474177|gb|ELQ42934.1| cleavage and polyadenylation specificity factor subunit 2
[Magnaporthe oryzae Y34]
gi|440484966|gb|ELQ64966.1| cleavage and polyadenylation specificity factor subunit 2
[Magnaporthe oryzae P131]
Length = 962
Score = 46.2 bits (108), Expect = 0.014, Method: Composition-based stats.
Identities = 42/180 (23%), Positives = 72/180 (40%), Gaps = 59/180 (32%)
Query: 26 HSLAELAK--IEWMSDKLMKSFE--------------------GARNNPFHFKHVKLCHS 63
HS +LAK EWM + +++ FE G PF FK+++L
Sbjct: 320 HSTIKLAKSMFEWMDNSIVQEFEAGADQGFRRTNGAGGNADAKGKDGGPFDFKYLRLLDR 379
Query: 64 LAELAKVPSP-------KVVLVSTPDMECGFSRDLFFQWCSSPENSIII------TNRTS 110
A++ K+ P KV+L + +E GFS+D+ + N +I+ ++R +
Sbjct: 380 KAQVLKLLEPSTDELRGKVILATDTSLEWGFSKDIISAIANDSRNMVILPEKPAESSRDN 439
Query: 111 PGTLARDLIE-----------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
P +++R L G R L ++ K++ L EL YQ+
Sbjct: 440 P-SISRQLWRWWKERRDGVADEQSSGAGSAEQVFAGGRELQIRESKKVPLADSELSIYQQ 498
>gi|10433243|dbj|BAB13943.1| unnamed protein product [Homo sapiens]
Length = 499
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 182 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 237
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 238 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 280
>gi|397476276|ref|XP_003809533.1| PREDICTED: integrator complex subunit 11 isoform 1 [Pan paniscus]
gi|410206788|gb|JAA00613.1| cleavage and polyadenylation specific factor 3-like [Pan
troglodytes]
gi|410251172|gb|JAA13553.1| cleavage and polyadenylation specific factor 3-like [Pan
troglodytes]
gi|410297680|gb|JAA27440.1| cleavage and polyadenylation specific factor 3-like [Pan
troglodytes]
gi|410349815|gb|JAA41511.1| cleavage and polyadenylation specific factor 3-like [Pan
troglodytes]
Length = 600
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|410989914|ref|XP_004001198.1| PREDICTED: LOW QUALITY PROTEIN: integrator complex subunit 11
[Felis catus]
Length = 598
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|343958314|dbj|BAK63012.1| protein related to CPSF subunits 68 kDa [Pan troglodytes]
Length = 600
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|66475248|ref|XP_627440.1| inactive CPSFs Cft2p metallobeta-lactamase [Cryptosporidium parvum
Iowa II]
gi|46228911|gb|EAK89760.1| inactive CPSFs Cft2p metallobeta-lactamase [Cryptosporidium parvum
Iowa II]
Length = 1196
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 46/88 (52%), Gaps = 8/88 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFH-FKHVKLCHSLAE-----LAKVPSPKVVLVSTPDMECGF 87
IEWMS ++ F +R NPFH K++ L +L L+KVP KV+ M+ G+
Sbjct: 335 IEWMSLEIRSEFCDSRFNPFHDLKNIILETNLKNIRSENLSKVP--KVIFAFPESMDYGY 392
Query: 88 SRDLFFQWCSSPENSIIITNRTSPGTLA 115
SR+LF + ++ N+I+ T A
Sbjct: 393 SRELFTELATNENNTIMFVREPKENTFA 420
>gi|297279172|ref|XP_001092173.2| PREDICTED: integrator complex subunit 11 isoform 3 [Macaca mulatta]
Length = 579
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 262 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 317
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 318 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 360
>gi|119576648|gb|EAW56244.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_j
[Homo sapiens]
Length = 476
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 159 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 214
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 215 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 257
>gi|393215649|gb|EJD01140.1| cleavage and polyadenylation specificity factor subunit
[Fomitiporia mediterranea MF3/22]
Length = 922
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 47/93 (50%), Gaps = 20/93 (21%)
Query: 73 PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE------------ 120
PK++L + G SR +F ++ S +N +++T+ GTLAR L +
Sbjct: 397 PKLILAVPVSLSHGSSRSIFSEFASVADNVVLLTSPGEDGTLARTLFDMWNDEQREDDKW 456
Query: 121 ----LGGN----RTLTLQVKKRIRLEGEELEEY 145
LG N +TL L +K ++ L+G ELEEY
Sbjct: 457 NKGKLGRNVMLDKTLKLTMKSKVPLQGVELEEY 489
>gi|405958713|gb|EKC24813.1| Integrator complex subunit 11 [Crassostrea gigas]
Length = 575
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 45/100 (45%), Gaps = 6/100 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W S K+ K+F + N F FKH+K P P VV + + G S +F
Sbjct: 256 ITWTSQKIKKTF--VQRNMFEFKHIKPFDR--AFIDNPGPMVVFATPGMLHAGLSLQIFK 311
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKK 133
+W + N +I+ GT+ + L G R L L+ K+
Sbjct: 312 KWAPNELNMVIMPGYCVAGTVGHKI--LNGARKLELENKQ 349
>gi|347838796|emb|CCD53368.1| similar to cleavage and polyadenylation specificity factor subunit
2 [Botryotinia fuckeliana]
Length = 934
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 42/169 (24%), Positives = 67/169 (39%), Gaps = 56/169 (33%)
Query: 35 EWMSDKLMKSFEGA------------RN--------NPFHFKHVKLCHSLAELAKV---- 70
EWM + +++ FE RN PF FKH++L ++ ++
Sbjct: 309 EWMDEAIIREFEAQPGHEEQRTGQQRRNAEEAKQHIGPFEFKHLRLLGRKGQIDRMLNET 368
Query: 71 -----PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGT----LARDLIE- 120
KV+L S +E GFS+++ + +N +I+T R +P + L R L
Sbjct: 369 DNLGRSVGKVILASDTSIEWGFSKEVLCKIADDDKNLLILTERLNPISGAPGLGRTLWSW 428
Query: 121 ----------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
GG R L ++ KRI LEG +L YQ+
Sbjct: 429 WEERRDGVISEPSSNGGVLEQVYGGGRDLEIKEPKRIPLEGNDLTVYQQ 477
>gi|340509014|gb|EGR34593.1| hypothetical protein IMG5_006210 [Ichthyophthirius multifiliis]
Length = 456
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 44/87 (50%), Gaps = 6/87 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
+ W ++K+ + +N F+FKH+K A +P +VL +TP M G S +F
Sbjct: 307 VNWTNEKIKSCY--LIDNMFNFKHIKPFQKSLIKANMP---MVLFATPGMLHAGLSMQVF 361
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLI 119
+WC +N++II GTL L+
Sbjct: 362 KEWCYDSKNTLIIPGYCVAGTLGNKLL 388
>gi|374253828|ref|NP_001243392.1| integrator complex subunit 11 isoform 5 [Homo sapiens]
gi|119576639|gb|EAW56235.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_c
[Homo sapiens]
gi|119576644|gb|EAW56240.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_c
[Homo sapiens]
Length = 499
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 182 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 237
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 238 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 280
>gi|426327396|ref|XP_004024504.1| PREDICTED: integrator complex subunit 11 isoform 4 [Gorilla gorilla
gorilla]
Length = 499
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 182 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 237
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 238 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 280
>gi|406866779|gb|EKD19818.1| metallo-beta-lactamase superfamily protein [Marssonina brunnea f.
sp. 'multigermtubi' MB_m1]
Length = 823
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 24/87 (27%), Positives = 49/87 (56%), Gaps = 3/87 (3%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
E M++ S A+ P+ FK+++ +L V + V++++P M + G SR+L
Sbjct: 317 ERMAEAEASSDTAAKGGPWDFKYIRSLKNLDRFDDVG--RCVMLASPGMLQNGVSRELLE 374
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIE 120
+W S +N ++IT + GT+A+ +++
Sbjct: 375 RWAPSEKNGVVITGYSVEGTMAKQIMQ 401
>gi|154292337|ref|XP_001546744.1| hypothetical protein BC1G_14624 [Botryotinia fuckeliana B05.10]
Length = 901
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 42/169 (24%), Positives = 67/169 (39%), Gaps = 56/169 (33%)
Query: 35 EWMSDKLMKSFEGA------------RN--------NPFHFKHVKLCHSLAELAKV---- 70
EWM + +++ FE RN PF FKH++L ++ ++
Sbjct: 309 EWMDEAIIREFEAQPGHEEQRTGQQRRNAEEAKQHIGPFEFKHLRLLGRKGQIDRMLNET 368
Query: 71 -----PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGT----LARDLIE- 120
KV+L S +E GFS+++ + +N +I+T R +P + L R L
Sbjct: 369 DNLGRSVGKVILASDTSIEWGFSKEVLCKIADDDKNLLILTERLNPISGAPGLGRTLWSW 428
Query: 121 ----------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
GG R L ++ KRI LEG +L YQ+
Sbjct: 429 WEERRDGVISEPSSNGGVLEQVYGGGRDLEIKEPKRIPLEGNDLTVYQQ 477
>gi|119576641|gb|EAW56237.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_e
[Homo sapiens]
Length = 578
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 261 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 316
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 317 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 359
>gi|449435478|ref|XP_004135522.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 3-I-like [Cucumis sativus]
Length = 481
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 48/92 (52%), Gaps = 8/92 (8%)
Query: 14 NNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP 71
N P ++ K C ++ E + M+D++ + A++NPF FK++ S+ E+ K
Sbjct: 278 NIPIYYASPLAKRCLTVYETYTLS-MNDRI----QNAKSNPFRFKYISPLKSI-EVFKDV 331
Query: 72 SPKVVLVSTPDMECGFSRDLFFQWCSSPENSI 103
P VV+ S ++ G SR LF WCS S+
Sbjct: 332 GPSVVMASPSGLQSGLSRQLFEMWCSEKHVSL 363
>gi|322699261|gb|EFY91024.1| cleavage and polyadenylation specifity factor [Metarhizium acridum
CQMa 102]
Length = 829
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 47/79 (59%), Gaps = 4/79 (5%)
Query: 44 SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
S +GA + P+ FK+++ +L V V++++P M + G SR+LF +W + +N
Sbjct: 326 SGDGAGQGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQSGVSRELFERWAPNEKN 383
Query: 102 SIIITNRTSPGTLARDLIE 120
+IIT + GT+AR +++
Sbjct: 384 GVIITGYSVEGTMARQIMQ 402
>gi|441671688|ref|XP_004093259.1| PREDICTED: LOW QUALITY PROTEIN: integrator complex subunit 11
[Nomascus leucogenys]
Length = 585
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|331212217|ref|XP_003307378.1| hypothetical protein PGTG_00328 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309297781|gb|EFP74372.1| hypothetical protein PGTG_00328 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 950
Score = 46.2 bits (108), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 46/91 (50%), Gaps = 14/91 (15%)
Query: 35 EWMSDKLMKSF-------EGARN----NPFHFKHVKLCHSLAEL-AKVP--SPKVVLVST 80
EWMS L +S G R+ P +H++ +S+ L A++P PKV+L
Sbjct: 356 EWMSPALARSSFDQNHHKRGNRDQNDQGPLRLRHIRFFNSVEALEAELPIRQPKVILAVP 415
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSP 111
ME GFSR +F + N II+T+ +SP
Sbjct: 416 LSMEYGFSRAMFTRIAGVEGNLIILTSLSSP 446
>gi|291238246|ref|XP_002739041.1| PREDICTED: cleavage and polyadenylation specific factor 3-like
[Saccoglossus kowalevskii]
Length = 573
Score = 45.8 bits (107), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 37/159 (23%), Positives = 71/159 (44%), Gaps = 9/159 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F F+H+K P P VV + + G S +F
Sbjct: 256 ITWTNQKIRKTF--VQRNMFEFRHIKPFDR--SYTDNPGPMVVFATPGMLHGGLSLHVFK 311
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE-GEELEEYQKKKDKE 152
+W S+ +N +I+ GT+ + L G R + L+ ++ I ++ + + D +
Sbjct: 312 KWASNEKNMVIMPGYCVAGTVGHKI--LNGQRKIELENRQTIDVKLSVQYMSFSAHADAK 369
Query: 153 AKDKQEKEKIPPHDTSFINELQLSDF--KQTLQRNGIDC 189
+ K+ P + E + DF ++ +Q+ G+ C
Sbjct: 370 GIMQLIKQCEPKNVMLVHGEAKKMDFLKQKIVQQFGVQC 408
>gi|39645207|gb|AAH13904.2| CPSF3L protein, partial [Homo sapiens]
Length = 429
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 112 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 167
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 168 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 210
>gi|351697497|gb|EHB00416.1| Integrator complex subunit 11 [Heterocephalus glaber]
Length = 672
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 356 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 411
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 412 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 454
>gi|119576643|gb|EAW56239.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_g
[Homo sapiens]
gi|119576646|gb|EAW56242.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_g
[Homo sapiens]
Length = 342
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 25 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 80
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 81 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 123
>gi|449296201|gb|EMC92221.1| hypothetical protein BAUCODRAFT_569527 [Baudoinia compniacensis
UAMH 10762]
Length = 834
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 49/93 (52%), Gaps = 9/93 (9%)
Query: 34 IEWMSDKLMKSFEGARNNP------FHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECG 86
I+ M++ + F+ A+ NP + F+H++ SL V V++++P M + G
Sbjct: 310 IDAMNENIRTKFQAAQANPDGVGGPWDFQHIRSLKSLERFDDVGG--CVMLASPGMLQNG 367
Query: 87 FSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
SR L +W +N +IIT + GT+A+ ++
Sbjct: 368 VSRSLLERWAPDAKNGVIITGYSVEGTMAKSIM 400
>gi|397639513|gb|EJK73612.1| hypothetical protein THAOC_04754 [Thalassiosira oceanica]
Length = 454
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 44/91 (48%), Gaps = 10/91 (10%)
Query: 50 NNPFHFKHVKLCHSL-AELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
NPF F+H++ S+ P VV S ++ G SR LF +W + P+N ++I
Sbjct: 307 GNPFSFRHIRNLKSIDVNNFDDRGPSVVFASPGMLQSGVSRQLFDRWATDPKNGVLIAGY 366
Query: 109 TSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
TLA++++ Q K+ + +EG
Sbjct: 367 AVEHTLAKEIMS---------QPKEVVTMEG 388
>gi|209876680|ref|XP_002139782.1| cleavage and polyadenylation specificity factor subunit 3
[Cryptosporidium muris RN66]
gi|209555388|gb|EEA05433.1| cleavage and polyadenylation specificity factor subunit 3, putative
[Cryptosporidium muris RN66]
Length = 767
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 50/109 (45%), Gaps = 23/109 (21%)
Query: 37 MSDKLMKSFEGARN--------------NPFHFKHVKLCHSLAELAKV---PSPKVVLVS 79
MS K M+ FE N NPF+F+ VK +S++E+ P V++ +
Sbjct: 300 MSVKCMRVFETYINQCGDAVRKQADMGINPFNFQFVKTVNSISEIKDAIYSEGPCVIMAA 359
Query: 80 TPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLAR------DLIELG 122
++ G SRD+F W N +I+T GT A D+I+LG
Sbjct: 360 PGMLQNGTSRDIFEVWAPDKRNGVILTGYAIRGTPAYELRREPDIIQLG 408
>gi|410928941|ref|XP_003977858.1| PREDICTED: integrator complex subunit 11-like [Takifugu rubripes]
Length = 601
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 6/97 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--SYADNPGPMVVFATPGMLHAGQSLQIFK 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQ 130
+W + +N +I+ GT+ + L G R L ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTIGHKI--LNGQRKLEME 373
>gi|432866809|ref|XP_004070946.1| PREDICTED: integrator complex subunit 11-like [Oryzias latipes]
Length = 599
Score = 45.8 bits (107), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 6/97 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--SYADNPGPMVVFATPGMLHAGQSLQIFK 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQ 130
+W + +N +I+ GT+ + L G R L ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTIGHKI--LNGQRKLEME 373
>gi|374253826|ref|NP_001243391.1| integrator complex subunit 11 isoform 4 [Homo sapiens]
Length = 502
Score = 45.4 bits (106), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 185 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 240
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 241 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 283
>gi|440298403|gb|ELP91039.1| Cleavage and polyadenylation specificity factor subunit, putative
[Entamoeba invadens IP1]
Length = 788
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 41/72 (56%), Gaps = 1/72 (1%)
Query: 49 RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITN 107
+ F+FK ++ S + + + + V++++P M + GFSR LF +WC+ N +II
Sbjct: 304 KRQAFNFKFIRDGSSSVDDSTIDNHPCVVLASPGMLQDGFSRTLFERWCTDKNNGVIIPG 363
Query: 108 RTSPGTLARDLI 119
GTLA+ +I
Sbjct: 364 YCVEGTLAKQII 375
>gi|426327398|ref|XP_004024505.1| PREDICTED: integrator complex subunit 11 isoform 5 [Gorilla gorilla
gorilla]
Length = 502
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 185 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 240
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 241 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 283
>gi|119576647|gb|EAW56243.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_i
[Homo sapiens]
Length = 502
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 185 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 240
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 241 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 283
>gi|348503157|ref|XP_003439132.1| PREDICTED: integrator complex subunit 11-like [Oreochromis
niloticus]
Length = 601
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 6/97 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--SYADNPGPMVVFATPGMLHAGQSLQIFK 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQ 130
+W + +N +I+ GT+ + L G R L ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTIGHKI--LNGQRKLEME 373
>gi|30677952|ref|NP_178282.2| cleavage and polyadenylation specificity factor subunit 3-II
[Arabidopsis thaliana]
gi|332278175|sp|Q8GUU3.2|CPS3B_ARATH RecName: Full=Cleavage and polyadenylation specificity factor
subunit 3-II; AltName: Full=Cleavage and polyadenylation
specificity factor 73 kDa subunit II; Short=AtCPSF73-II;
Short=CPSF 73 kDa subunit II; AltName: Full=Protein
EMBRYO SAC DEVELOPMENT ARREST 26
gi|62320470|dbj|BAD94982.1| putative cleavage and polyadenylation specifity factor [Arabidopsis
thaliana]
gi|330250395|gb|AEC05489.1| cleavage and polyadenylation specificity factor subunit 3-II
[Arabidopsis thaliana]
Length = 613
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 42/87 (48%), Gaps = 6/87 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMEC-GFSRDLF 92
I W S + + +NPF FK+VK L P P VL +TP M C GFS ++F
Sbjct: 282 ISWTSQNVKEKHN--THNPFDFKNVKDFDR--SLIHAPGP-CVLFATPGMLCAGFSLEVF 336
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLI 119
W SP N + + + GT+ L+
Sbjct: 337 KHWAPSPLNLVALPGYSVAGTVGHKLM 363
>gi|260790823|ref|XP_002590440.1| hypothetical protein BRAFLDRAFT_289082 [Branchiostoma floridae]
gi|229275634|gb|EEN46451.1| hypothetical protein BRAFLDRAFT_289082 [Branchiostoma floridae]
Length = 597
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 67/162 (41%), Gaps = 11/162 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K P P VV + + G S +F
Sbjct: 276 ITWTNQKIRKTF--VKRNMFEFKHIKAFDR--SYIDNPGPMVVFATPGMLHAGLSLQIFK 331
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKK--RIRLEGEELEEYQKKKDK 151
+W +N +I+ GT+ + L G R + + KK +R+ E + + D
Sbjct: 332 KWAPDSKNMVIMPGYCVAGTVGHKI--LNGIRKIEFENKKVLEVRMTVEYM-SFSAHADA 388
Query: 152 EAKDKQEKEKIPPHDTSFINELQLSDF--KQTLQRNGIDCEF 191
+ + + P + E Q DF K+ Q G+ C F
Sbjct: 389 QGIMQLIRYCEPRNVMLVHGEEQKMDFLSKKITQEFGVQCFF 430
>gi|297814408|ref|XP_002875087.1| hypothetical protein ARALYDRAFT_322516 [Arabidopsis lyrata subsp.
lyrata]
gi|297320925|gb|EFH51346.1| hypothetical protein ARALYDRAFT_322516 [Arabidopsis lyrata subsp.
lyrata]
Length = 819
Score = 45.4 bits (106), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 42/87 (48%), Gaps = 6/87 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMEC-GFSRDLF 92
I W S + + +NPF FK+VK L P P VL +TP M C GFS ++F
Sbjct: 282 ISWTSQNVKEKHN--THNPFDFKNVKDFDR--SLIHAPGP-CVLFATPGMLCAGFSLEVF 336
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLI 119
W SP N + + + GT+ L+
Sbjct: 337 KHWAPSPLNLVALPGYSVAGTVGHKLM 363
>gi|157876175|ref|XP_001686447.1| putative cleavage and polyadenylation specificity factor
[Leishmania major strain Friedlin]
gi|68129521|emb|CAJ08064.1| putative cleavage and polyadenylation specificity factor
[Leishmania major strain Friedlin]
Length = 756
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 46/88 (52%), Gaps = 6/88 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP--SPKVVLVSTPDMECGFSRDL 91
+ M+D++ K +NPF FK+++ SL + P VVL S ++ G S +L
Sbjct: 298 VSAMNDRV-KQQHANHHNPFVFKYIR---SLMDTKSFEDNGPCVVLASPGMLQSGISLEL 353
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F +WC N II+ GT+A+D++
Sbjct: 354 FERWCGDRRNGIIMAGYCVDGTIAKDVL 381
>gi|4220489|gb|AAD12712.1| putative cleavage and polyadenylation specifity factor [Arabidopsis
thaliana]
Length = 837
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 42/87 (48%), Gaps = 6/87 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMEC-GFSRDLF 92
I W S + + +NPF FK+VK L P P VL +TP M C GFS ++F
Sbjct: 282 ISWTSQNVKEKHN--THNPFDFKNVKDFDR--SLIHAPGP-CVLFATPGMLCAGFSLEVF 336
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLI 119
W SP N + + + GT+ L+
Sbjct: 337 KHWAPSPLNLVALPGYSVAGTVGHKLM 363
>gi|308509314|ref|XP_003116840.1| hypothetical protein CRE_01624 [Caenorhabditis remanei]
gi|308241754|gb|EFO85706.1| hypothetical protein CRE_01624 [Caenorhabditis remanei]
Length = 612
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 42/87 (48%), Gaps = 6/87 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
I W ++ + K+F N F FKH++ E P P+V L STP M G S +F
Sbjct: 288 ISWTNENIKKTF--VERNMFEFKHIRPMEKGCE--DQPGPQV-LFSTPGMLHGGQSLKVF 342
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLI 119
+WC P N II+ GT+ +I
Sbjct: 343 KKWCGDPLNMIIMPGYCVAGTVGARVI 369
>gi|260942135|ref|XP_002615366.1| hypothetical protein CLUG_04248 [Clavispora lusitaniae ATCC 42720]
gi|238850656|gb|EEQ40120.1| hypothetical protein CLUG_04248 [Clavispora lusitaniae ATCC 42720]
Length = 940
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 57/124 (45%), Gaps = 13/124 (10%)
Query: 3 DKLMKSFEGARNNPFHFKHVKLCHSLAELAKI-EWMSDKLMKSFEGARNN-------PFH 54
D+ + + +GA P +F L+ A + +WMS +L+K +EG PF
Sbjct: 258 DEHLANLQGAAI-PVYFLSYSGTKVLSYAANLLDWMSSQLIKEYEGIAAEDRAYSRVPFE 316
Query: 55 FKHVKLCHSLAELAKVPSPKVVLVSTPDMECG-FSRDLFFQWCSSPENSIIITNRTSPGT 113
V L + EL ++P PK+V S D + G S C + +II+T ++S
Sbjct: 317 PSKVDLLSNPQELIQLPGPKIVFASGIDFKDGDMSTQALQLLCQDEKTTIILTEKSS--- 373
Query: 114 LARD 117
ARD
Sbjct: 374 FARD 377
>gi|240280758|gb|EER44262.1| cleavage and polyadenylation specificity factor subunit 2
[Ajellomyces capsulatus H143]
Length = 1010
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 51/103 (49%), Gaps = 13/103 (12%)
Query: 51 NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
PF FKH+K+ A+L K+ +PKV+L S ++ G+S+ + + S EN +I+T
Sbjct: 401 GPFTFKHLKIVERKAKLEKILGSNTPKVILTSDTSLDWGYSKHVLQKIASGSENLVILTE 460
Query: 108 RTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKD 150
S + N+ + ++ R L E Y+++KD
Sbjct: 461 SFS----------VSPNKQMVDGIRSRPSLAHEIWTIYEERKD 493
>gi|198413502|ref|XP_002128796.1| PREDICTED: similar to cleavage and polyadenylation specific factor
3-like [Ciona intestinalis]
Length = 605
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ +F N F FKH+K + P P VV + + G S ++F
Sbjct: 284 ITWTNQKIKDTF--VERNMFDFKHIKEFNR--SYIDNPGPMVVFATPGMLHGGLSLEIFK 339
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+WC++ +N II+ GT+ + L G R + L+ + + ++
Sbjct: 340 RWCTNEKNMIIMPGYCVAGTVGNKI--LSGMRKIELKPGQVVEIK 382
>gi|124809291|ref|XP_001348538.1| cleavage and polyadenylation specificity factor protein, putative
[Plasmodium falciparum 3D7]
gi|23497434|gb|AAN36977.1| cleavage and polyadenylation specificity factor protein, putative
[Plasmodium falciparum 3D7]
Length = 876
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 41/72 (56%), Gaps = 4/72 (5%)
Query: 51 NPFHFKHVKLCHSLAELAKV----PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
NPF+FK+VK SL ++ +P V++ S ++ G S+++F S ++ +I+T
Sbjct: 347 NPFNFKYVKYAKSLESISSYLYQDNNPCVIMASPGMLQNGISKNIFNIIASDKKSGVILT 406
Query: 107 NRTSPGTLARDL 118
T GTLA +L
Sbjct: 407 GYTVKGTLADEL 418
>gi|67479721|ref|XP_655242.1| cleavage and polyadenylation specificity factor 73 kDa subunit
[Entamoeba histolytica HM-1:IMSS]
gi|56472366|gb|EAL49856.1| cleavage and polyadenylation specificity factor 73 kDa subunit,
putative [Entamoeba histolytica HM-1:IMSS]
gi|449703858|gb|EMD44220.1| cleavage and polyadenylation specificity factor 73 kDa subunit,
putative [Entamoeba histolytica KU27]
Length = 755
Score = 45.1 bits (105), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 46/85 (54%), Gaps = 5/85 (5%)
Query: 36 WMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQ 94
+M+ +L K + A F FK ++ S + + V++++P M + G SR +F +
Sbjct: 295 FMNQELRKKTKQA----FDFKFIREGSSSVDDGAIDYKPCVVMASPGMLQDGISRKIFER 350
Query: 95 WCSSPENSIIITNRTSPGTLARDLI 119
WC+ +N +II GTLA+DLI
Sbjct: 351 WCTDKKNGVIIPGYCVEGTLAKDLI 375
>gi|167395302|ref|XP_001733549.1| Cleavage and polyadenylation specificity factor subunit [Entamoeba
dispar SAW760]
gi|165894214|gb|EDR22276.1| Cleavage and polyadenylation specificity factor subunit, putative
[Entamoeba dispar SAW760]
Length = 736
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 53 FHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSP 111
F FK ++ S + + V++++P M + G SR +F +WC+ +N +II
Sbjct: 308 FDFKFIREGSSSVDDGAIDYKPCVVMASPGMLQDGISRKIFERWCTDKKNGVIIPGYCVE 367
Query: 112 GTLARDLI 119
GTLA+DLI
Sbjct: 368 GTLAKDLI 375
>gi|170090732|ref|XP_001876588.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164648081|gb|EDR12324.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 901
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 30/126 (23%), Positives = 63/126 (50%), Gaps = 24/126 (19%)
Query: 55 FKHVKL---CHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSP 111
F+H++ +L + PK++L + G SR++F + + P+N +++T R+
Sbjct: 385 FRHLEFFPNPQALLQTYSSKDPKLILAVPASLSHGPSRNMFSDFAAVPDNVVLLTGRSEE 444
Query: 112 GTLARDLIE----------------LGGN----RTLTLQVKKRIRLEGEELEEYQKKKDK 151
GTL R L + +G N +T+++ ++ L+G ELE + ++++
Sbjct: 445 GTLGRALFDKWNNSQRPDDKWDKGKIGSNVMMDGAITIKMNHKVPLQGAELEAHL-QEER 503
Query: 152 EAKDKQ 157
AK+K+
Sbjct: 504 VAKEKE 509
>gi|58266278|ref|XP_570295.1| cleavage and polyadenylation specificity factor subunit
[Cryptococcus neoformans var. neoformans JEC21]
gi|134111080|ref|XP_775682.1| hypothetical protein CNBD4110 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50258346|gb|EAL21035.1| hypothetical protein CNBD4110 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57226528|gb|AAW42988.1| cleavage and polyadenylation specificity factor subunit, putative
[Cryptococcus neoformans var. neoformans JEC21]
Length = 899
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 63/137 (45%), Gaps = 27/137 (19%)
Query: 47 GARNNPFHFKHVKLCHSLAELAK---VPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSI 103
G+ F+HV+ + +L + + PK+VL P M G SR LF ++ N I
Sbjct: 362 GSEYGVLDFRHVQFFLNTTDLLQTYPLTRPKLVLAVPPTMSHGPSRFLFTAMANTEGNVI 421
Query: 104 IITNRTSPGTLARDLI------ELGGNR--------------TLTLQVKKRIRLEGEELE 143
++T R+ TLARDL + G++ L ++V ++ L G ELE
Sbjct: 422 MLTGRSEEQTLARDLYNRWERSQTTGSKWGEGKIGHLTQLEGKLQVEVDSKVPLSGAELE 481
Query: 144 EYQK----KKDKEAKDK 156
+ + +K+KEA K
Sbjct: 482 AHVESERLQKEKEAAHK 498
>gi|320590943|gb|EFX03384.1| polyadenylation specificity factor [Grosmannia clavigera kw1407]
Length = 1036
Score = 45.1 bits (105), Expect = 0.038, Method: Composition-based stats.
Identities = 40/132 (30%), Positives = 59/132 (44%), Gaps = 36/132 (27%)
Query: 50 NNPFHFKHVKLCHSLAELAKV---PSP----KVVLVSTPDMECGFSRDLFFQWCSSPENS 102
+ PF KH++L A++ +V SP KV+L S ME GFS+++ + P N
Sbjct: 390 SGPFDMKHLRLLERRAQVERVLNSQSPPGGGKVILASDASMEWGFSKEVLRRIADKPRNL 449
Query: 103 IIITNRTSPG-------TLARDLIEL--------------GGN------RTLTLQVKKRI 135
+++T PG +LAR L E GGN R L Q ++
Sbjct: 450 VLLTE--VPGVLSANKRSLARTLWEWWKSEQGQEAASSGGGGNPVRAGGRHLQFQDGLKV 507
Query: 136 RLEGEELEEYQK 147
L+G EL YQ+
Sbjct: 508 ALDGAELLLYQQ 519
>gi|340053150|emb|CCC47437.1| putative cleavage and polyadenylation specificity factor, fragment
[Trypanosoma vivax Y486]
Length = 584
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 43/86 (50%), Gaps = 2/86 (2%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ M+D++ + RN PF FK+++ P VVL S ++ G S +LF
Sbjct: 125 VSAMNDRVKQQHANHRN-PFVFKYIQPLVDTRSFED-NGPCVVLASPGMLQSGISLELFE 182
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+WC N III GT+A+D++
Sbjct: 183 KWCGDRRNGIIIAGYCIDGTIAKDVL 208
>gi|407041778|gb|EKE40943.1| cleavage and polyadenylation specificity factor 73 kDa subunit,
putative [Entamoeba nuttalli P19]
Length = 751
Score = 44.7 bits (104), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 46/85 (54%), Gaps = 5/85 (5%)
Query: 36 WMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQ 94
+M+ +L K + A F FK ++ S + + V++++P M + G SR +F +
Sbjct: 295 FMNQELRKKTKQA----FDFKFIREGSSSVDDGAIDYKPCVVMASPGMLQDGISRKIFER 350
Query: 95 WCSSPENSIIITNRTSPGTLARDLI 119
WC+ +N +II GTLA+DLI
Sbjct: 351 WCTDKKNGVIIPGYCVEGTLAKDLI 375
>gi|443926973|gb|ELU45512.1| cleavage and polyadenylation specificity factor subunit
[Rhizoctonia solani AG-1 IA]
Length = 854
Score = 44.7 bits (104), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 62/118 (52%), Gaps = 10/118 (8%)
Query: 53 FHFKHVKLCHSL-AELAKVPS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
FKH+++ S A +++ PS PK++LV + G SR +F ++ S P N++I++ +
Sbjct: 371 LRFKHLEIFPSPDALVSRYPSSMPKLLLVVPATLSHGNSRRIFAEFASVPGNAVILSTPS 430
Query: 110 SPGTLARDLI---ELG---GNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEK 161
PGTLA L LG R V + I+L Y +K++ AK++Q ++
Sbjct: 431 EPGTLANTLFNEWNLGQSDNERFGHGSVGQPIQLNSTMTLTYL-EKERAAKERQATQR 487
>gi|321468347|gb|EFX79332.1| hypothetical protein DAPPUDRAFT_304859 [Daphnia pulex]
Length = 597
Score = 44.7 bits (104), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 43/180 (23%), Positives = 80/180 (44%), Gaps = 16/180 (8%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH++ A P P VV + + G S LF
Sbjct: 282 ITWTNQKIRKTF--VQRNMFEFKHIRPFDK--SYADTPGPMVVFATPGMLHAGLSLQLFK 337
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEA 153
+W + N +I+ GT+ + L G + + ++ ++ I ++ ++ +A
Sbjct: 338 KWAPNENNMLIMPGYCVSGTVGAKV--LSGAKKVEIENRQVIEVK-MSVQYMSFSAHADA 394
Query: 154 KD-KQEKEKIPPHDTSFIN--ELQLSDFKQTLQRN-GIDCEFM----DGVLICCRGTVAV 205
K Q + P + ++ +++ KQ +Q+ GIDC +M + I C+ V V
Sbjct: 395 KGIMQLIQYCEPQNVLLVHGEGVKMEFLKQKIQQEYGIDC-YMPANGESAYIPCKPVVPV 453
>gi|321457255|gb|EFX68345.1| hypothetical protein DAPPUDRAFT_218302 [Daphnia pulex]
Length = 597
Score = 44.7 bits (104), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 42/179 (23%), Positives = 78/179 (43%), Gaps = 14/179 (7%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH++ A P P VV + + G S LF
Sbjct: 282 ITWTNQKIRKTF--VQRNMFEFKHIRPFDK--SYADTPGPMVVFATPGMLHAGLSLQLFK 337
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEA 153
+W + N +I+ GT+ + L G + + ++ ++ I ++ ++ +A
Sbjct: 338 KWAPNENNMLIMPGYCVSGTVGAKV--LSGAKKVEIENRQVIEVK-MSVQYMSFSAHADA 394
Query: 154 KD-KQEKEKIPPHDTSFIN--ELQLSDFKQTLQRN-GIDCEF---MDGVLICCRGTVAV 205
K Q + P + ++ +++ KQ +Q+ GIDC + I C+ V V
Sbjct: 395 KGIMQLIQYCEPQNVLLVHGEGVKMEFLKQKIQQEYGIDCYMPANGESAYIPCKPVVPV 453
>gi|406694795|gb|EKC98117.1| cleavage and polyadenylation specificity factor subunit
[Trichosporon asahii var. asahii CBS 8904]
Length = 958
Score = 44.7 bits (104), Expect = 0.041, Method: Composition-based stats.
Identities = 42/142 (29%), Positives = 62/142 (43%), Gaps = 27/142 (19%)
Query: 47 GARNNPFHFKHVKLCHSLAEL-AKVPS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSI 103
G+ F HV+ + EL K P+ PK+VL P M G SR LF S N +
Sbjct: 338 GSEYGALDFSHVQFFATPEELLEKYPANRPKLVLAIPPTMSHGPSRTLFASMASVTGNVV 397
Query: 104 IITNRTSPGTLARDLIELG---------------GNRT-----LTLQVKKRIRLEGEELE 143
++T TLAR+L G+ T L L++ + L GEELE
Sbjct: 398 LLTGHGEDRTLARELYARWEAHQDEGAHYGHGKIGHATPMEGRLELELDAKEPLSGEELE 457
Query: 144 EYQ----KKKDKEAKDKQEKEK 161
Y+ +K+++EA + E+
Sbjct: 458 AYETAEREKREREAAHQAALER 479
>gi|68076685|ref|XP_680262.1| cleavage and polyadenylation specifity factor protein, [Plasmodium
berghei strain ANKA]
gi|56501171|emb|CAH99756.1| cleavage and polyadenylation specifity factor protein, putative
[Plasmodium berghei]
Length = 888
Score = 44.7 bits (104), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 39/72 (54%), Gaps = 4/72 (5%)
Query: 51 NPFHFKHVKLCHSLAE----LAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
NPF+FK+VK SL L + P VV+ S ++ G S+ +F S ++ +IIT
Sbjct: 349 NPFNFKYVKYAKSLDSILNYLYQDNYPCVVMASPGMLQSGISKSIFNIIASDKKSGVIIT 408
Query: 107 NRTSPGTLARDL 118
T GTLA +L
Sbjct: 409 GYTIKGTLADEL 420
>gi|388579831|gb|EIM20151.1| Metallo-hydrolase/oxidoreductase [Wallemia sebi CBS 633.66]
Length = 626
Score = 44.7 bits (104), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 45/90 (50%), Gaps = 6/90 (6%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV--PSPKVVLVSTPDMECGFSR 89
A I M+ + + F+ N PF FK + L +L K P V+L S ++ G SR
Sbjct: 307 AYIRTMNKNINRRFDSGEN-PFQFKFIS---ELGDLRKWQDKGPCVMLASPGMLQSGTSR 362
Query: 90 DLFFQWCSSPENSIIITNRTSPGTLARDLI 119
+L +W P+N +II + GT+A ++
Sbjct: 363 ELLERWAPDPKNGLIICGYSVEGTMAHSIV 392
>gi|346323812|gb|EGX93410.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Cordyceps militaris CM01]
Length = 879
Score = 44.3 bits (103), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 48/87 (55%), Gaps = 4/87 (4%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
E M++ GA P+ FK+++ +L V V++++P M + G SR+LF
Sbjct: 325 ERMAEAETSGGAGA-GGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELFE 381
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIE 120
+W + +N +IIT + GT+AR +++
Sbjct: 382 RWAPNDKNGVIITGYSVEGTMARQIMK 408
>gi|171679503|ref|XP_001904698.1| hypothetical protein [Podospora anserina S mat+]
gi|170939377|emb|CAP64605.1| unnamed protein product [Podospora anserina S mat+]
Length = 967
Score = 44.3 bits (103), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 45/199 (22%), Positives = 74/199 (37%), Gaps = 66/199 (33%)
Query: 14 NNPFHFKHVKL---CHSLAELAK-----IEWMSDKLMKSFEGARNN-------------- 51
NN F++ +L HS+ K EWM DK+++ FE A
Sbjct: 304 NNNETFRNAQLYLAGHSIGSTLKHARSLFEWMDDKIVREFEAAAGGKESHSRGQRGGHHH 363
Query: 52 ------PFHFKHVKLCHSLAELAKV---------PSPKVVLVSTPDMECGFSRDLFFQWC 96
PF FKH++L +++ V P +V+L + +E GFS+++
Sbjct: 364 DHKVAGPFDFKHLRLLERKGQVSWVLKQALEDLEPKGRVILATDSSLEWGFSKEVLKSIA 423
Query: 97 SSPENSIIIT-----NRTSPGTLARDLIE-----------------------LGGNRTLT 128
N +++T N P ++AR L + G R L
Sbjct: 424 GDARNLVLLTEKPALNENKP-SIARTLWDWWKERKDGVATEQTTSGETFEQVYAGGRELE 482
Query: 129 LQVKKRIRLEGEELEEYQK 147
+ + LEG +L YQ+
Sbjct: 483 IADASKQALEGNDLSVYQQ 501
>gi|82596170|ref|XP_726151.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
[Plasmodium yoelii yoelii 17XNL]
gi|23481438|gb|EAA17716.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
[Plasmodium yoelii yoelii]
Length = 942
Score = 44.3 bits (103), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 4/72 (5%)
Query: 51 NPFHFKHVKLCHSLAE----LAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
NPF+FK+VK SL L + P V++ S ++ G S+ +F S ++ +IIT
Sbjct: 348 NPFNFKYVKYAKSLDSILNYLYQDNYPCVIMASPGMLQSGISKSIFNIIASDKKSGVIIT 407
Query: 107 NRTSPGTLARDL 118
T GTLA +L
Sbjct: 408 GYTIKGTLADEL 419
>gi|406865774|gb|EKD18815.1| RNA-metabolising metallo-beta-lactamase [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 1331
Score = 44.3 bits (103), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 38/173 (21%), Positives = 65/173 (37%), Gaps = 59/173 (34%)
Query: 34 IEWMSDKLMKSFEG--------------------ARNNPFHFKHVKLCHSLAELAKVPS- 72
+EWM D +++ FE + PF FKH++L ++ ++
Sbjct: 334 LEWMDDAIIREFEANAGINQKETGSKAAGDAKGSSDGGPFDFKHLRLLERKGQIDRIMGQ 393
Query: 73 ----------PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT------SPGTLAR 116
KV+L S +E GFSRD+ N II+T + + +LA+
Sbjct: 394 TDIDRHGRSIGKVILASDASLEWGFSRDILKAVADDTRNLIILTEKMAKPQGEATSSLAK 453
Query: 117 DLIEL----------------------GGNRTLTLQVKKRIRLEGEELEEYQK 147
L G +++ ++ KR+ LEG +L YQ+
Sbjct: 454 TLWSWWEDRESVVSEETKSGLVLDRVNSGAKSVAIREAKRLALEGNDLSVYQQ 506
>gi|325088985|gb|EGC42295.1| cleavage and polyadenylation specific subunit [Ajellomyces
capsulatus H88]
Length = 1010
Score = 44.3 bits (103), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 51/103 (49%), Gaps = 13/103 (12%)
Query: 51 NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
PF FKH+K+ A++ K+ +PKV+L S ++ G+S+ + + S EN +I+T
Sbjct: 401 GPFTFKHLKIVERKAKIEKILGSNTPKVILTSDTSLDWGYSKHVLQKIASGSENLVILTE 460
Query: 108 RTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKD 150
S + N+ + ++ R L E Y+++KD
Sbjct: 461 SFS----------VSPNKQMVDGIRSRPSLAHEIWTIYEERKD 493
>gi|197099184|ref|NP_001124760.1| integrator complex subunit 11 [Pongo abelii]
gi|55725797|emb|CAH89679.1| hypothetical protein [Pongo abelii]
Length = 655
Score = 44.3 bits (103), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 27/105 (25%), Positives = 49/105 (46%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K P P VV + + G S +F
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFTDNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|405120276|gb|AFR95047.1| cleavage and polyadenylation specificity factor subunit
[Cryptococcus neoformans var. grubii H99]
Length = 899
Score = 44.3 bits (103), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 62/137 (45%), Gaps = 27/137 (19%)
Query: 47 GARNNPFHFKHVKLCHSLAELAK---VPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSI 103
G+ F+HV + +L + + PK+VL P M G SR LF ++ N I
Sbjct: 362 GSEYGVLDFRHVLFFLNTTDLLQTYPLTRPKLVLAVPPTMSHGPSRFLFTAMANTEGNVI 421
Query: 104 IITNRTSPGTLARDLI------ELGGNR--------------TLTLQVKKRIRLEGEELE 143
++T R+ TLARDL + G++ L ++V ++ L G ELE
Sbjct: 422 MLTGRSEEQTLARDLYNRWERSQTAGSKWGEGKIGHLTRLEGKLQVEVDSKVPLSGAELE 481
Query: 144 EYQK----KKDKEAKDK 156
+ + +K+KEA K
Sbjct: 482 AHVESERLQKEKEAAHK 498
>gi|66472504|ref|NP_001018457.1| integrator complex subunit 11 [Danio rerio]
gi|82192739|sp|Q503E1.1|INT11_DANRE RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
Full=Cleavage and polyadenylation-specific factor 3-like
protein; Short=CPSF3-like protein
gi|63102425|gb|AAH95364.1| Zgc:110671 [Danio rerio]
Length = 598
Score = 44.3 bits (103), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 49/105 (46%), Gaps = 10/105 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--SYADNPGPMVVFATPGMLHAGQSLQIFK 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDL------IELGGNRTLTLQVK 132
+W + +N +I+ GT+ + +E+ G TL ++++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKILNGQKKLEMEGRATLDVKLQ 383
>gi|392593709|gb|EIW83034.1| Metallo-hydrolase oxidoreductase [Coniophora puteana RWD-64-598
SS2]
Length = 770
Score = 44.3 bits (103), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 38/83 (45%), Gaps = 21/83 (25%)
Query: 49 RNNPFHFKHVKLCHSLAELAKVPSPK------------VVLVSTPDMECGFSRDLFFQWC 96
R+NPF FKH+ + +P PK VVL S + G SR+L W
Sbjct: 298 RDNPFVFKHI---------SNLPQPKGWERKIAEGPPCVVLASPGFCQSGPSRELLELWA 348
Query: 97 SSPENSIIITNRTSPGTLARDLI 119
N I+T + GT+ARD++
Sbjct: 349 PDARNGFILTGYSVEGTMARDIL 371
>gi|224108267|ref|XP_002314781.1| predicted protein [Populus trichocarpa]
gi|222863821|gb|EEF00952.1| predicted protein [Populus trichocarpa]
Length = 639
Score = 43.9 bits (102), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 42/88 (47%), Gaps = 8/88 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSL-AELAKVPSPKVVLVSTPDM-ECGFSRDL 91
I W S K+ +++ A N F FKHV H+ L P P VL +TP M GFS ++
Sbjct: 282 ISWTSQKVKETY--ATRNAFDFKHV---HNFDRSLINAPGP-CVLFATPGMISGGFSLEV 335
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F QW N I + GT+ L+
Sbjct: 336 FKQWAPCEMNLITLPGYCVAGTVGHKLM 363
>gi|452845681|gb|EME47614.1| hypothetical protein DOTSEDRAFT_146416 [Dothistroma septosporum
NZE10]
Length = 839
Score = 43.9 bits (102), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 46/97 (47%), Gaps = 7/97 (7%)
Query: 52 PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSP 111
P+ F+ V+ + V VVL S ++ G SR L +W P+N ++IT +
Sbjct: 339 PWDFQFVRALKGVDRFEDVGG-SVVLASPGMLQNGPSRALLERWAPDPKNGVVITGYSVE 397
Query: 112 GTLAR------DLIELGGNRTLTLQVKKRIRLEGEEL 142
GT+A+ D I NRT + + KR E +++
Sbjct: 398 GTMAKQIMLEPDSIAAVTNRTANMSLGKRAGAEEQQM 434
>gi|384486005|gb|EIE78185.1| hypothetical protein RO3G_02889 [Rhizopus delemar RA 99-880]
Length = 613
Score = 43.9 bits (102), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 23/86 (26%), Positives = 42/86 (48%), Gaps = 4/86 (4%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ +F ++ N F FKH+K + P PKV+ + + G S ++F
Sbjct: 260 INWTNQKIKSTF--SQRNMFDFKHIKTWNR--NYIDQPGPKVLFATPGMLNAGTSLEVFK 315
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+W P+N +I+ GT+ ++
Sbjct: 316 KWAPDPKNMVIMPGFCVAGTVGSKVL 341
>gi|392569726|gb|EIW62899.1| mRNA 3'-end-processing protein YSH1 [Trametes versicolor FP-101664
SS1]
Length = 805
Score = 43.9 bits (102), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 40/75 (53%), Gaps = 5/75 (6%)
Query: 49 RNNPFHFKHVKLCHSLA----ELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSII 104
+NPF FKH+ ++A+ P P VVL S M+ G SR+L W N +I
Sbjct: 298 HDNPFVFKHITNVPGTRGWERKIAEGP-PCVVLASPGFMQTGPSRELLELWAPDGRNGLI 356
Query: 105 ITNRTSPGTLARDLI 119
+T + GT+AR+++
Sbjct: 357 VTGYSIEGTMAREIL 371
>gi|225560694|gb|EEH08975.1| cleavage and polyadenylation specificity factor subunit 2
[Ajellomyces capsulatus G186AR]
Length = 1010
Score = 43.9 bits (102), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 35/59 (59%), Gaps = 3/59 (5%)
Query: 51 NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
PF FKH+K+ A+L K+ +PKV+L S ++ G+S+ + + S EN +I+T
Sbjct: 401 GPFTFKHLKIVERKAKLEKILGSNTPKVILTSDTSLDWGYSKHVLQKIASGSENLVILT 459
>gi|308457231|ref|XP_003091005.1| hypothetical protein CRE_12341 [Caenorhabditis remanei]
gi|308258743|gb|EFP02696.1| hypothetical protein CRE_12341 [Caenorhabditis remanei]
Length = 312
Score = 43.9 bits (102), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 51/100 (51%), Gaps = 5/100 (5%)
Query: 140 EELEEYQKKKDKEAKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICC 199
+ELE+ + K + K K K+ F+N+L+LSDFK L G EF+ G L+
Sbjct: 210 KELEKMRNKIGR-VKRNTNKYKVQKQMAIFVNDLKLSDFKNWLVEKGYKAEFLSGTLLIN 268
Query: 200 RGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 239
G ++RR + + ME S+D + S NSI+I+
Sbjct: 269 GGKCSIRRGEMGFS--MEGALSKDDYKLLNSI--NSIVIS 304
>gi|402084516|gb|EJT79534.1| endoribonuclease YSH1 [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 868
Score = 43.9 bits (102), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 21/74 (28%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 47 GARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
G P+ FK ++ +L + P V+L S ++ G SR+L +W S +N ++IT
Sbjct: 328 GTGGGPWDFKFIRSLKNLDRFDDL-GPCVMLASPGMLQTGVSRELLERWAPSDKNGVVIT 386
Query: 107 NRTSPGTLARDLIE 120
+ GT+A+ +++
Sbjct: 387 GYSVEGTMAKQIMQ 400
>gi|281201684|gb|EFA75892.1| integrator complex subunit 11 [Polysphondylium pallidum PN500]
Length = 648
Score = 43.9 bits (102), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 51/106 (48%), Gaps = 7/106 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
I W + K+ ++F + N F FKH+K + L P P +VL +TP M G S ++F
Sbjct: 309 IGWTNQKIKQTF--VKRNMFDFKHIKPFDRM--LVDAPGP-MVLFATPGMLHAGASLEVF 363
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W S N II GT+ L+ + +++ K+ LE
Sbjct: 364 KKWAPSELNMTIIPGYCVVGTVGNKLLSNASGPQM-VEIDKKTTLE 408
>gi|408390480|gb|EKJ69876.1| hypothetical protein FPSE_09963 [Fusarium pseudograminearum CS3096]
Length = 833
Score = 43.9 bits (102), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 44 SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
S +GA + P+ FK+++ +L V V++++P M + G SR+L +W S +N
Sbjct: 324 SGDGAGKGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 381
Query: 102 SIIITNRTSPGTLARDLIE 120
+IIT + GT+A+ +++
Sbjct: 382 GVIITGYSVEGTMAKQIMQ 400
>gi|367054168|ref|XP_003657462.1| hypothetical protein THITE_2123200 [Thielavia terrestris NRRL 8126]
gi|347004728|gb|AEO71126.1| hypothetical protein THITE_2123200 [Thielavia terrestris NRRL 8126]
Length = 859
Score = 43.9 bits (102), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 24/87 (27%), Positives = 45/87 (51%), Gaps = 3/87 (3%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
E M++ + P+ FK ++ S+ V V++++P M + G SR+L
Sbjct: 316 ERMAEAEASGDAAGKGGPWDFKFIRSLKSIDRFDDVGG--CVMLASPGMLQNGVSRELLE 373
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIE 120
+W S +N +IIT + GT+A+ L++
Sbjct: 374 RWAPSEKNGVIITGYSVEGTMAKQLMQ 400
>gi|296424981|ref|XP_002842022.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295638279|emb|CAZ86213.1| unnamed protein product [Tuber melanosporum]
Length = 975
Score = 43.9 bits (102), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 38/163 (23%), Positives = 61/163 (37%), Gaps = 50/163 (30%)
Query: 34 IEWMSDKLMKSFE----------------------GARNNPFHFKHVKLCHSLAELAKV- 70
+EWM + +++ FE G + PF F H+ L + L ++
Sbjct: 317 LEWMDEGVVREFESIAGGDRRGNRQRDDAEGKGNDGNKAGPFDFLHLNLVSTQGHLNRIL 376
Query: 71 ----PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR------------------ 108
KV++ S + GFSR+ + S +N +++T R
Sbjct: 377 NDGNERGKVIIASDSSLGWGFSREALMRLASDEKNLVVLTERSDGKLGWAGNLWQQWKEK 436
Query: 109 TSPGTLARDL----IELGGNRTLTLQVKKRIRLEGEELEEYQK 147
T G A + L G R L + R LEG+ELE Y +
Sbjct: 437 TGSGGEANATDWQEVSLDGQRA-ELDIPHRTPLEGQELEAYNR 478
>gi|70945220|ref|XP_742453.1| cleavage and polyadenylation specifity factor protein, [Plasmodium
chabaudi chabaudi]
gi|56521446|emb|CAH74696.1| cleavage and polyadenylation specifity factor protein, putative
[Plasmodium chabaudi chabaudi]
Length = 564
Score = 43.9 bits (102), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 42/83 (50%), Gaps = 4/83 (4%)
Query: 40 KLMKSFEGARNNPFHFKHVKLCHSLAE----LAKVPSPKVVLVSTPDMECGFSRDLFFQW 95
+ +K NPF+FK+VK SL L + P V++ S ++ G S+ +F
Sbjct: 338 EFIKKIVNEGKNPFNFKYVKYAKSLDSILNYLYQDNYPCVIMASPGMLQSGISKSIFNII 397
Query: 96 CSSPENSIIITNRTSPGTLARDL 118
S ++ +IIT T GTLA +L
Sbjct: 398 ASDKKSGVIITGYTIKGTLADEL 420
>gi|442570104|sp|Q4IPN9.2|YSH1_GIBZE RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
3'-end-processing protein YSH1
Length = 833
Score = 43.9 bits (102), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 44 SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
S +GA + P+ FK+++ +L V V++++P M + G SR+L +W S +N
Sbjct: 324 SGDGAGKGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 381
Query: 102 SIIITNRTSPGTLARDLIE 120
+IIT + GT+A+ +++
Sbjct: 382 GVIITGYSVEGTMAKQIMQ 400
>gi|85079519|ref|XP_956368.1| hypothetical protein NCU03479 [Neurospora crassa OR74A]
gi|74630409|sp|Q8WZS6.1|YSH1_NEUCR RecName: Full=Endoribonuclease ysh-1; AltName: Full=mRNA
3'-end-processing protein ysh-1
gi|18376069|emb|CAD21097.1| related to BRR5 (component of pre-mRNA polyadenylation factor PF I)
[Neurospora crassa]
gi|28917429|gb|EAA27132.1| hypothetical protein NCU03479 [Neurospora crassa OR74A]
Length = 850
Score = 43.5 bits (101), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 45/79 (56%), Gaps = 4/79 (5%)
Query: 44 SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
S +GA + P+ F+ ++ SL V V++++P M + G SR+L +W S +N
Sbjct: 323 SGDGAGKGGPWDFRFIRSLKSLDRFEDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 380
Query: 102 SIIITNRTSPGTLARDLIE 120
+IIT + GT+A+ L++
Sbjct: 381 GVIITGYSVEGTMAKQLLQ 399
>gi|146170679|ref|XP_001017643.2| metallo beta lactamase domain containing protein [Tetrahymena
thermophila]
gi|146145062|gb|EAR97398.2| metallo beta lactamase domain containing protein [Tetrahymena
thermophila SB210]
Length = 675
Score = 43.5 bits (101), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 44/87 (50%), Gaps = 6/87 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
+ W ++K+ S+ +N F FK++K L K P +VL +TP M G S +F
Sbjct: 315 VNWTNEKIKSSY--LTDNMFDFKYIKPFSR--SLIKTNGP-MVLFATPGMLHAGLSMQVF 369
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLI 119
+WC +N++II GTL LI
Sbjct: 370 KEWCYDEKNTLIIPGYCVAGTLGCVLI 396
>gi|399216276|emb|CCF72964.1| unnamed protein product [Babesia microti strain RI]
Length = 916
Score = 43.5 bits (101), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 42/93 (45%), Gaps = 1/93 (1%)
Query: 27 SLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP-DMEC 85
SL IE+MS K+ F NP F +V L +L + V+ STP +
Sbjct: 280 SLIFSTSIEYMSSKVKSEFSRDLKNPLIFDNVIYLDKLEQLKPFTNVPCVIFSTPGSLNW 339
Query: 86 GFSRDLFFQWCSSPENSIIITNRTSPGTLARDL 118
GFS LF S N II++ + TLAR L
Sbjct: 340 GFSNALFAAIGSKKGNLIILSKEPTTKTLARKL 372
>gi|221059687|ref|XP_002260489.1| cleavage and polyadenylation specifity protein [Plasmodium knowlesi
strain H]
gi|193810562|emb|CAQ41756.1| cleavage and polyadenylation specifity protein,putative [Plasmodium
knowlesi strain H]
Length = 938
Score = 43.5 bits (101), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 40/72 (55%), Gaps = 4/72 (5%)
Query: 51 NPFHFKHVKLCHSLAE----LAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
NPF+FK+VK SL L + +P VV+ S ++ G S+++F ++ +I+T
Sbjct: 355 NPFNFKYVKYAKSLDSILNYLYQDNNPCVVMASPGMLQNGISKNIFNIIAPDKKSGVILT 414
Query: 107 NRTSPGTLARDL 118
T GTLA +L
Sbjct: 415 GYTVKGTLAHEL 426
>gi|342879865|gb|EGU81098.1| hypothetical protein FOXB_08372 [Fusarium oxysporum Fo5176]
Length = 858
Score = 43.5 bits (101), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 44 SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
S +GA + P+ FK+++ +L V V++++P M + G SR+L +W S +N
Sbjct: 350 SGDGAGKGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 407
Query: 102 SIIITNRTSPGTLARDLIE 120
+IIT + GT+A+ +++
Sbjct: 408 GVIITGYSVEGTMAKQIMQ 426
>gi|336468884|gb|EGO57047.1| hypothetical protein NEUTE1DRAFT_84705 [Neurospora tetrasperma FGSC
2508]
gi|350288819|gb|EGZ70044.1| Endoribonuclease ysh-1 [Neurospora tetrasperma FGSC 2509]
Length = 853
Score = 43.5 bits (101), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 45/79 (56%), Gaps = 4/79 (5%)
Query: 44 SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
S +GA + P+ F+ ++ SL V V++++P M + G SR+L +W S +N
Sbjct: 323 SGDGAGKGGPWDFRFIRSLKSLDRFEDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 380
Query: 102 SIIITNRTSPGTLARDLIE 120
+IIT + GT+A+ L++
Sbjct: 381 GVIITGYSVEGTMAKQLLQ 399
>gi|46107872|ref|XP_380995.1| hypothetical protein FG00819.1 [Gibberella zeae PH-1]
Length = 864
Score = 43.5 bits (101), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 44 SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
S +GA + P+ FK+++ +L V V++++P M + G SR+L +W S +N
Sbjct: 355 SGDGAGKGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 412
Query: 102 SIIITNRTSPGTLARDLIE 120
+IIT + GT+A+ +++
Sbjct: 413 GVIITGYSVEGTMAKQIMQ 431
>gi|358058074|dbj|GAA96053.1| hypothetical protein E5Q_02714 [Mixia osmundae IAM 14324]
Length = 896
Score = 43.5 bits (101), Expect = 0.099, Method: Composition-based stats.
Identities = 35/128 (27%), Positives = 65/128 (50%), Gaps = 24/128 (18%)
Query: 57 HVKLCHSLAEL-AKVPS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGT 113
++K+ SL L A +P PKVVL + + G SR +F + + N +++T+ G+
Sbjct: 326 NLKIFPSLDSLYAVIPPHVPKVVLAAPLPLSYGSSRKVFTEMAAQAGNLLLLTSPGPAGS 385
Query: 114 LARDLI----------------ELGG----NRTLTLQVKKRIRLEGEELEEYQKKKDKEA 153
L+R L E+G N ++L ++ ++ L+GEEL+E+ + + A
Sbjct: 386 LSRSLFDKWNEAQTPAQRMGTGEIGQTITLNEAVSLPMRSKVILQGEELQEFLDNQ-RAA 444
Query: 154 KDKQEKEK 161
K++ K+K
Sbjct: 445 KERHAKQK 452
>gi|56403864|emb|CAI29717.1| hypothetical protein [Pongo abelii]
Length = 600
Score = 43.5 bits (101), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 27/105 (25%), Positives = 49/105 (46%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+ + N F FKH+K A P P VV + + G S +F
Sbjct: 283 IPWTNQKIRKT--SVQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381
>gi|378730429|gb|EHY56888.1| endoribonuclease ysh1 [Exophiala dermatitidis NIH/UT8656]
Length = 868
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 47/101 (46%), Gaps = 12/101 (11%)
Query: 46 EGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSII 104
+G P+ F+ V+ +L V V++++P M + G SR L +W P N +I
Sbjct: 332 KGVSAGPWDFRFVRSLRNLDRFDDVGG--CVMLASPGMLQSGMSRVLLERWAPDPRNGVI 389
Query: 105 ITNRTSPGTLARDL---------IELGGNRTLTLQVKKRIR 136
+T GT+AR + I GGN + Q+ ++ R
Sbjct: 390 MTGYNVEGTMARTILSEPEMIPAIMSGGNTGVGQQMGRKGR 430
>gi|302927041|ref|XP_003054415.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256735356|gb|EEU48702.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 827
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 44 SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
S +GA + P+ FK+++ +L V V++++P M + G SR+L +W S +N
Sbjct: 324 SGDGAGKGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 381
Query: 102 SIIITNRTSPGTLARDLIE 120
+IIT + GT+A+ +++
Sbjct: 382 GVIITGYSVEGTMAKQIMQ 400
>gi|321257420|ref|XP_003193582.1| cleavage and polyadenylation specificity factor subunit
[Cryptococcus gattii WM276]
gi|317460052|gb|ADV21795.1| Cleavage and polyadenylation specificity factor subunit, putative
[Cryptococcus gattii WM276]
Length = 900
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 64/137 (46%), Gaps = 27/137 (19%)
Query: 47 GARNNPFHFKHVKLCHSLAELAK---VPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSI 103
G+ F+HV+ + +L + + PK+VL P M G SR LF ++ N I
Sbjct: 363 GSEYGVLDFRHVQFFLNPTDLLQTYPLTRPKLVLAIPPTMSHGPSRFLFTAMANTEGNVI 422
Query: 104 IITNRTSPGTLARDLIE-------LG--------GNRT-----LTLQVKKRIRLEGEELE 143
++T R+ TLARDL +G G+ T L +++ ++ L G ELE
Sbjct: 423 MLTGRSEEQTLARDLFNRWERSQTVGSKWGEGKIGHLTQLEGKLQVEMDSKVPLSGAELE 482
Query: 144 EYQK----KKDKEAKDK 156
+ + +K+KEA K
Sbjct: 483 AHMESERLQKEKEAAHK 499
>gi|358385845|gb|EHK23441.1| hypothetical protein TRIVIDRAFT_37526 [Trichoderma virens Gv29-8]
Length = 957
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 50/107 (46%), Gaps = 26/107 (24%)
Query: 34 IEWMSDKLMKSFE----GAR----------NNPFHFKHVKLCHSLAELAKVPS------- 72
+EWM + +++ FE G R PF FK+++L A++ K+ S
Sbjct: 332 LEWMDNNIVQEFEAFAEGQRKVNGGNEKKEGAPFDFKYLRLLERKAQITKLLSQNIENGE 391
Query: 73 --PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARD 117
+V+L S M+ GFS+DL N +I+T R + LA+D
Sbjct: 392 TQGRVILASDVSMDWGFSKDLVKGLAKDSRNLVILTERPN---LAKD 435
>gi|353237084|emb|CCA69065.1| hypothetical protein PIIN_02923 [Piriformospora indica DSM 11827]
Length = 887
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 31/120 (25%), Positives = 58/120 (48%), Gaps = 23/120 (19%)
Query: 53 FHFKHVKLCHSLAELAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
FK V+ + E+ + S PK++L + G SR LF + + N +++T RT
Sbjct: 365 LRFKFVEFFSNPDEMINIFSSREPKLILSVPASLSHGPSRSLFADFAVNEGNMVVLTQRT 424
Query: 110 SPGTLARDLIE--------------------LGGNRTLTLQVKKRIRLEGEELEEYQKKK 149
GTL R L++ + +R + ++++ ++ L+G ELEEY++K+
Sbjct: 425 GMGTLNRFLLDRWEAGQEDSQRWQDGHIGVPVSLDRPIDMELRIKVPLQGVELEEYREKE 484
>gi|297569676|ref|YP_003691020.1| RNA-metabolising metallo-beta-lactamase [Desulfurivibrio
alkaliphilus AHT2]
gi|296925591|gb|ADH86401.1| RNA-metabolising metallo-beta-lactamase [Desulfurivibrio
alkaliphilus AHT2]
Length = 479
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 24/87 (27%), Positives = 44/87 (50%), Gaps = 5/87 (5%)
Query: 39 DKLMKSF--EGARNNPFHFKHVKLCHSLA---ELAKVPSPKVVLVSTPDMECGFSRDLFF 93
D+ ++ F +G +NPF F ++ S+ +L + P V++ ++ ME G
Sbjct: 309 DREIRDFILKGNHDNPFGFDTLRYTQSVGDSKQLNALKKPAVIISASGMMEGGRILHHLR 368
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIE 120
S P N+I++T +P TL R ++E
Sbjct: 369 NRVSDPRNTILVTGWQAPNTLGRKIVE 395
>gi|116200035|ref|XP_001225829.1| hypothetical protein CHGG_08173 [Chaetomium globosum CBS 148.51]
gi|88179452|gb|EAQ86920.1| hypothetical protein CHGG_08173 [Chaetomium globosum CBS 148.51]
Length = 854
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 45/79 (56%), Gaps = 4/79 (5%)
Query: 44 SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
S +GA + P+ FK ++ S+ V V++++P M + G SR+L +W S +N
Sbjct: 324 SGDGAGKGGPWDFKFIRSLKSIDRFEDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 381
Query: 102 SIIITNRTSPGTLARDLIE 120
+IIT + GT+A+ +++
Sbjct: 382 GVIITGYSVEGTMAKQIMQ 400
>gi|156042700|ref|XP_001587907.1| hypothetical protein SS1G_11148 [Sclerotinia sclerotiorum 1980]
gi|154695534|gb|EDN95272.1| hypothetical protein SS1G_11148 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 936
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 39/171 (22%), Positives = 69/171 (40%), Gaps = 58/171 (33%)
Query: 35 EWMSDKLMKSFEG--------------------ARNN--PFHFKHVKLCHSLAELAKV-- 70
EWM + +++ FE ++ N PF FKH++L ++ ++
Sbjct: 309 EWMDEAIIREFEAQPGHEEQQTGQQRRHAYSDESKQNAGPFEFKHLRLLGRKGQIDRMLN 368
Query: 71 -------PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR--------------- 108
KV+L S +E GFS+++ + +N +I+T +
Sbjct: 369 ETDNLGRSVGKVILASDTSIEWGFSKEVLRKIADDDKNLLILTEKLNRIDGVTGLGRTLW 428
Query: 109 -----------TSPGTLARDLIEL-GGNRTLTLQVKKRIRLEGEELEEYQK 147
T P + +L ++ GG R L ++ KRI LEG +L YQ+
Sbjct: 429 SWWEERRNGVATEPSSNGGNLEQVYGGGRDLEIREPKRIPLEGNDLTVYQQ 479
>gi|344229479|gb|EGV61364.1| hypothetical protein CANTEDRAFT_98614 [Candida tenuis ATCC 10573]
Length = 943
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 42/90 (46%), Gaps = 11/90 (12%)
Query: 34 IEWMSDKLMKSFE---------GARNN-PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
+EWMS L K E G RN+ PF V L + EL ++ PKVV S D+
Sbjct: 284 LEWMSPSLSKELENANSIVTNTGNRNHFPFDPSKVDLVSTPYELTQMAGPKVVFTSGVDL 343
Query: 84 ECG-FSRDLFFQWCSSPENSIIITNRTSPG 112
G S + C+ + +II+T +T G
Sbjct: 344 NSGELSSEALRVLCNDEKTTIILTEKTHFG 373
>gi|417403209|gb|JAA48422.1| Putative mrna cleavage and polyadenylation factor ii complex brr5
cpsf subunit [Desmodus rotundus]
Length = 604
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 52/111 (46%), Gaps = 7/111 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIK-AFDRAXXXAHPCAMVVFATPGMLHAGQSLQIFR 339
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKK--RIRLEGEEL 142
+W + +N +I+ GT+ + L G R L ++ ++ +R++ E +
Sbjct: 340 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVRMQVEYM 388
>gi|429966185|gb|ELA48182.1| hypothetical protein VCUG_00420 [Vavraia culicis 'floridensis']
Length = 669
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 21/71 (29%), Positives = 38/71 (53%), Gaps = 1/71 (1%)
Query: 49 RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
+ + F+FKH++ S+ + + VV+ S ++ G SR+LF WC N +I
Sbjct: 307 KKDAFNFKHIRNLKSVDN-HLIKNACVVMASPGMLQSGLSRELFESWCEDANNGTVIPGY 365
Query: 109 TSPGTLARDLI 119
GTLA++++
Sbjct: 366 CVQGTLAKEIM 376
>gi|242778797|ref|XP_002479311.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Talaromyces stipitatus ATCC 10500]
gi|218722930|gb|EED22348.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Talaromyces stipitatus ATCC 10500]
Length = 861
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 21/70 (30%), Positives = 40/70 (57%), Gaps = 3/70 (4%)
Query: 51 NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRT 109
P+ F++V+ SL + S V++++P M + G SR+L +W S N +++T +
Sbjct: 333 GPWDFRYVRSLRSLERFDDIGS--CVMLASPGMLQTGTSRELLERWAPSERNGVVMTGYS 390
Query: 110 SPGTLARDLI 119
GT+A+ L+
Sbjct: 391 VEGTMAKQLL 400
>gi|212533753|ref|XP_002147033.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Talaromyces marneffei ATCC 18224]
gi|210072397|gb|EEA26486.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Talaromyces marneffei ATCC 18224]
Length = 866
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 21/70 (30%), Positives = 40/70 (57%), Gaps = 3/70 (4%)
Query: 51 NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRT 109
P+ F++V+ SL + S V++++P M + G SR+L +W S N +++T +
Sbjct: 333 GPWDFRYVRSLRSLERFDDIGS--CVMLASPGMLQTGTSRELLERWAPSERNGVVMTGYS 390
Query: 110 SPGTLARDLI 119
GT+A+ L+
Sbjct: 391 VEGTMAKQLL 400
>gi|440493393|gb|ELQ75870.1| mRNA cleavage and polyadenylation factor II complex, BRR5 (CPSF
subunit) [Trachipleistophora hominis]
Length = 801
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 21/71 (29%), Positives = 38/71 (53%), Gaps = 1/71 (1%)
Query: 49 RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
+ + F FKH++ S+ + + + VV+ S ++ G SR+LF WC N +I
Sbjct: 361 KKDAFTFKHIRNLKSV-DNHLIKNACVVMASPGMLQSGLSRELFESWCEDANNGTVIPGY 419
Query: 109 TSPGTLARDLI 119
GTLA++++
Sbjct: 420 CVQGTLAKEIM 430
>gi|346472285|gb|AEO35987.1| hypothetical protein [Amblyomma maculatum]
Length = 510
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 37/159 (23%), Positives = 66/159 (41%), Gaps = 9/159 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K P P VV + + G S +F
Sbjct: 254 ITWTNQKIRKTF--VQRNMFDFKHIKPFDR--AFIDNPGPMVVFATPGMLHAGLSLQIFK 309
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE-GEELEEYQKKKDKE 152
+W N +I+ GT+ + L G R + L+ ++ + ++ + + D +
Sbjct: 310 KWAPFEANMVIMPGYCVAGTVGHKV--LSGARKVELENRQVVEVKMSVQYMSFSAHADAK 367
Query: 153 AKDKQEKEKIPPHDTSFINELQLSDF--KQTLQRNGIDC 189
+ ++ P + E DF K+ LQ +DC
Sbjct: 368 GIMQLIQQCEPANVLLVHGEAGKMDFLRKKILQEFSVDC 406
>gi|66816359|ref|XP_642189.1| integrator complex subunit 11 [Dictyostelium discoideum AX4]
gi|74856745|sp|Q54YL3.1|INT11_DICDI RecName: Full=Integrator complex subunit 11 homolog
gi|60470287|gb|EAL68267.1| integrator complex subunit 11 [Dictyostelium discoideum AX4]
Length = 744
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 43/90 (47%), Gaps = 6/90 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
I W + K+ ++F + N F FKH+K S L P +VL +TP M G S ++F
Sbjct: 283 INWTNQKIKQTF--VKRNMFDFKHIKPFQS--HLVDAPGA-MVLFATPGMLHAGASLEVF 337
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELG 122
+W + N II GT+ L+ G
Sbjct: 338 KKWAPNELNMTIIPGYCVVGTVGNKLLTTG 367
>gi|13541495|ref|NP_111183.1| metal-dependent RNase [Thermoplasma volcanium GSS1]
gi|14324879|dbj|BAB59805.1| cleavage and polyadenylation specificity factor [Thermoplasma
volcanium GSS1]
Length = 639
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 26/107 (24%), Positives = 54/107 (50%), Gaps = 10/107 (9%)
Query: 35 EWMSDKLMKSFEGARNNPFH---FKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
E+++ +L ++ + NPF FK V+ ++A+ P KVVL ++ M G +
Sbjct: 463 EYLNKELREAIMVRKENPFLSQIFKKVETREQREDIAEDPESKVVLATSGMMNGGPVMEY 522
Query: 92 FFQWCSSPENSIIITNRTSPGTL-------ARDLIELGGNRTLTLQV 131
F W +P+++++ + GT+ A+++ G +T+T Q+
Sbjct: 523 FKAWAGNPKHTLVFVGYQADGTMGKRIQSGAKEVTISEGGKTVTYQI 569
>gi|440801023|gb|ELR22048.1| cleavage and polyadenylation specific factor 3like, putative
[Acanthamoeba castellanii str. Neff]
Length = 657
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 50/106 (47%), Gaps = 7/106 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
I W ++K+ ++F N F FKH+ LA P P +VL +TP M G S ++F
Sbjct: 282 IHWTNEKIKRTF--VHRNMFDFKHISTFER--GLADQPGP-MVLFATPGMLHAGTSLEVF 336
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +II GT+ L G + + + R LE
Sbjct: 337 KKWAPNEKNLVIIPGYCVVGTVGNKLAA-GRKGSFKVDLDSRTSLE 381
>gi|449299688|gb|EMC95701.1| hypothetical protein BAUCODRAFT_71003 [Baudoinia compniacensis UAMH
10762]
Length = 938
Score = 42.7 bits (99), Expect = 0.15, Method: Composition-based stats.
Identities = 28/106 (26%), Positives = 50/106 (47%), Gaps = 14/106 (13%)
Query: 34 IEWMSDKLMKSFEG------ARNNPFHFKHVKLCH---SLAELAKVPSPKVVLVSTPDME 84
+EW+ + + E NP +++HVKL +L + P V L S +E
Sbjct: 326 LEWVEESVRAEAEAHLTKTKGSTNPLNWQHVKLVERNSTLERAVQRSQPCVFLASDASLE 385
Query: 85 CGFSRDLFFQWCSSPENSIIITNRTSPG-----TLARDLIELGGNR 125
GFSR + +N +I+T +++PG +++R L +L +R
Sbjct: 386 WGFSRLALESLATDTKNLVILTEKSAPGSPPQSSISRQLWDLWQSR 431
>gi|313238583|emb|CBY13629.1| unnamed protein product [Oikopleura dioica]
Length = 618
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 49/108 (45%), Gaps = 8/108 (7%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ W ++K+ SF N F FK++K + + P+V + + G S ++F
Sbjct: 287 VNWTNEKIKSSF--VERNLFDFKYIKAFQKEIHMNQ-SGPQVCFATPGMLHAGMSLEIFQ 343
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTL---TLQVKKRIRLE 138
WC+ +N II+ GT+ L L G R + V RI++E
Sbjct: 344 NWCTDEKNCIIMPGYCVAGTVGHRL--LHGERHFKFNGVNVTSRIKVE 389
>gi|393241063|gb|EJD48587.1| hypothetical protein AURDEDRAFT_183466 [Auricularia delicata
TFB-10046 SS5]
Length = 893
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 34/131 (25%), Positives = 62/131 (47%), Gaps = 27/131 (20%)
Query: 53 FHFKHVKLCHSLAELAKV-PS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
F+H++ + A+L PS PK++L ++ G SR +F + S N +++T++
Sbjct: 376 LRFRHLEFFTTYAQLTSTYPSSKPKLILAVPQNISHGSSRAIFTDFASVVGNVVVLTSKG 435
Query: 110 SPGTLARDLI-----------ELGG---------NRTLTLQVKKRIRLEGEELEEYQKKK 149
GTL+R L + G N TL L++ ++ L+G ELE + + +
Sbjct: 436 EQGTLSRMLFDKWNEAQRDGDQYGAGTVGEPVTLNETLKLRMHTKVPLQGAELETHLQAE 495
Query: 150 DKEAKDKQEKE 160
+ QE+E
Sbjct: 496 ----RAAQERE 502
>gi|171689890|ref|XP_001909884.1| hypothetical protein [Podospora anserina S mat+]
gi|170944907|emb|CAP71018.1| unnamed protein product [Podospora anserina S mat+]
Length = 835
Score = 42.7 bits (99), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 45/79 (56%), Gaps = 4/79 (5%)
Query: 44 SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
S +GA + P+ FK ++ S+ V V++++P M + G SR+L +W S +N
Sbjct: 325 SGDGAGKGGPWDFKFIRSLKSIDRFEDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 382
Query: 102 SIIITNRTSPGTLARDLIE 120
+IIT + GT+A+ +++
Sbjct: 383 GVIITGYSVEGTMAKQIMQ 401
>gi|115396064|ref|XP_001213671.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114193240|gb|EAU34940.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 1005
Score = 42.7 bits (99), Expect = 0.17, Method: Composition-based stats.
Identities = 40/181 (22%), Positives = 73/181 (40%), Gaps = 37/181 (20%)
Query: 26 HSLAELAK--IEWMSDKLMKSFEGARN---------------------NPFHFKHVKLCH 62
H+ LA+ +EWM + +++ FE A PF FK++KL
Sbjct: 339 HTTMRLARSMLEWMDENIVREFEAAEGVDATTGQSIQRPGGQKDEKGVGPFTFKNLKLVE 398
Query: 63 SLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
+L K+ +PKV+L S ++ GF+++ N +++T L D
Sbjct: 399 RRKKLEKILADQTPKVILASDSSLDWGFAKESLRLIAEGSNNLLLLTESLHKERLQED-- 456
Query: 120 ELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFINELQLSDFK 179
+ +R L G + Y+++KD A +K ++ S EL +D +
Sbjct: 457 ---------QEGAQRRTLGGMIWDWYEERKDGVALEKAADGEMIEQVHSGGRELSWTDVR 507
Query: 180 Q 180
+
Sbjct: 508 R 508
>gi|341038970|gb|EGS23962.1| hypothetical protein CTHT_0006720 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 894
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 23/87 (26%), Positives = 45/87 (51%), Gaps = 3/87 (3%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
E M++ + P+ FK ++ S+ V V++++P M + G SR+L
Sbjct: 315 ERMAEAEASGDSAGKGGPWDFKFIRSLKSIDRFEDVGG--CVMLASPGMLQNGVSRELLE 372
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIE 120
+W + +N +IIT + GT+A+ L++
Sbjct: 373 RWAPNEKNGVIITGYSVEGTMAKQLMQ 399
>gi|167525469|ref|XP_001747069.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774364|gb|EDQ87993.1| predicted protein [Monosiga brevicollis MX1]
Length = 730
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 52/111 (46%), Gaps = 9/111 (8%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVK-LCHSLAELAKVPSPKVVLVSTPDMECGFSRDLF 92
I W + KL ++F N F F+HV+ S E A P+V+ + + G S F
Sbjct: 288 ITWTNQKLKETF--VERNLFDFQHVQPFDRSYLEHA---GPQVLFATPGMLHAGTSLLAF 342
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKK-RIRLEGEEL 142
WC P N +I+ + GT+ +I G R L ++ + +R++ E L
Sbjct: 343 THWCEDPRNMVILPGYCTAGTVGAKII--AGIRELDIEGRHYTVRMDVEYL 391
>gi|367034742|ref|XP_003666653.1| hypothetical protein MYCTH_2311535 [Myceliophthora thermophila ATCC
42464]
gi|347013926|gb|AEO61408.1| hypothetical protein MYCTH_2311535 [Myceliophthora thermophila ATCC
42464]
Length = 879
Score = 42.4 bits (98), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 45/79 (56%), Gaps = 4/79 (5%)
Query: 44 SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
S +GA + P+ FK ++ S+ V V++++P M + G SR+L +W S +N
Sbjct: 324 SGDGAGKGGPWDFKFIRSLKSIDRFEDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 381
Query: 102 SIIITNRTSPGTLARDLIE 120
+IIT + GT+A+ +++
Sbjct: 382 GVIITGYSVEGTMAKHIMQ 400
>gi|336259697|ref|XP_003344648.1| hypothetical protein SMAC_07216 [Sordaria macrospora k-hell]
gi|380088385|emb|CCC13649.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 857
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 45/79 (56%), Gaps = 4/79 (5%)
Query: 44 SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
S +GA + P+ FK ++ S+ V V++++P M + G SR+L +W S +N
Sbjct: 323 SGDGAGKGGPWDFKFIRSLKSIDRFEDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 380
Query: 102 SIIITNRTSPGTLARDLIE 120
+IIT + GT+A+ +++
Sbjct: 381 GVIITGYSVEGTMAKHIMQ 399
>gi|392593024|gb|EIW82350.1| hypothetical protein CONPUDRAFT_54247 [Coniophora puteana
RWD-64-598 SS2]
Length = 926
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 26/109 (23%)
Query: 73 PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI------------- 119
PK++L + G SR LF + + P+N +++T R GTL + L
Sbjct: 404 PKIILAVPASLSHGPSRSLFVDFAAVPDNVVLLTGRGEEGTLGQILFGRWNDSQRADDKW 463
Query: 120 ---ELGGN----RTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEK 161
++G N + L++ ++ L+G ELE Y AK++ KEK
Sbjct: 464 DKGKIGRNVMMDGAMRLKMSSKVPLQGTELELYL------AKERATKEK 506
>gi|407929750|gb|EKG22561.1| RNA-metabolising metallo-beta-lactamase [Macrophomina phaseolina
MS6]
Length = 974
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 42/192 (21%), Positives = 72/192 (37%), Gaps = 59/192 (30%)
Query: 14 NNPFHFKHVKLCHSLAELAK--IEWMSDKLMKSFEGAR---------------------- 49
N P + + C + A+ +EWM + +++ FE A
Sbjct: 308 NAPLYLAS-RTCAATMRYARSMLEWMDEGIVREFEAASSGQGTDDNKRSRTQQGSGRSKE 366
Query: 50 --------NNPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSS 98
N PF F+ ++L +++++ PKV+L S +E GFS++ +
Sbjct: 367 GKEDAKKPNAPFDFRSLRLVERRTQVSRMLAAEGPKVILASDVSLEWGFSKEAVRALAAD 426
Query: 99 PENSIIITNRT-----SPGTLARDLIELGGNRT------------------LTLQVKKRI 135
N +I+T R S L R L EL R+ ++V +
Sbjct: 427 SRNLVILTERIAQPSDSKKGLGRLLYELWAERSGANESAENPSPQHAAGTEAAVEVVQAA 486
Query: 136 RLEGEELEEYQK 147
LEG E+ YQ+
Sbjct: 487 PLEGNEVPLYQQ 498
>gi|146417489|ref|XP_001484713.1| hypothetical protein PGUG_02442 [Meyerozyma guilliermondii ATCC
6260]
Length = 821
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 41/156 (26%), Positives = 68/156 (43%), Gaps = 33/156 (21%)
Query: 34 IEWMSDKLMKSFEGA---------RNN-PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
+EWMS L+K +E A +NN PF V L EL ++ PK+VL + DM
Sbjct: 173 LEWMSTLLVKEWEAASSASMNSTNKNNFPFDPSKVDLLLDPKELIQLSGPKIVLCAGIDM 232
Query: 84 ECG-FSRDLFFQWCSSPENSIIITNRTSPGT---------------------LARDLIEL 121
G S ++ C +N++++T +T G A D + +
Sbjct: 233 NSGDVSFEVLKYLCLDQKNTVLLTEKTHFGADFSINAQLFTDWVRLSREKYGNAEDGLAI 292
Query: 122 GGNRTLTLQ-VKKRIRLEGEELEEYQKKKDKEAKDK 156
G T+ L+ + + L G EL +Q++ + + K K
Sbjct: 293 GYEGTIPLRGLSREDPLSGSELTSFQERINHQRKKK 328
>gi|212543221|ref|XP_002151765.1| cleavage and polyadenylylation specificity factor, putative
[Talaromyces marneffei ATCC 18224]
gi|210066672|gb|EEA20765.1| cleavage and polyadenylylation specificity factor, putative
[Talaromyces marneffei ATCC 18224]
Length = 1015
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 28/117 (23%), Positives = 49/117 (41%), Gaps = 35/117 (29%)
Query: 25 CHSLAELAK--IEWMSDKLMKSFE------------------------------GARNNP 52
HS LA+ +EWM + +++ FE G + P
Sbjct: 342 AHSTMRLARSMLEWMDEGIVREFEAVEGGDAAAVRGHKTTDSQNRNAGVTRDKQGTKLGP 401
Query: 53 FHFKHVKLCHSLAELAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
F KH+K+ +L KV + PKV++ S ++ G+S++ F +N I++T
Sbjct: 402 FTLKHLKIVEQKRKLEKVLADGIPKVIIASDTSLDWGYSKETFRTLAQGSQNLILLT 458
>gi|167526212|ref|XP_001747440.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774275|gb|EDQ87907.1| predicted protein [Monosiga brevicollis MX1]
Length = 668
Score = 42.4 bits (98), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 3/72 (4%)
Query: 50 NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNR 108
+NPF F+ + L E S V+++TP M + G SR+LF +W + N +I+
Sbjct: 312 SNPFQFQFIHNLRKLDEFDDHGSS--VVLATPGMLQNGLSRELFERWAPNRHNGVILAGY 369
Query: 109 TSPGTLARDLIE 120
GTLA +L++
Sbjct: 370 HVEGTLAHELLK 381
>gi|156083689|ref|XP_001609328.1| cleavage and polyadenylation specifity factor [Babesia bovis T2Bo]
gi|154796579|gb|EDO05760.1| cleavage and polyadenylation specifity factor [Babesia bovis]
Length = 709
Score = 42.0 bits (97), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 38/71 (53%), Gaps = 3/71 (4%)
Query: 51 NPFHFKHVKLCHSLAELA---KVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
NPF+F VK S+AE++ + P +V+ S ++ G S +F + N +++T
Sbjct: 337 NPFNFTFVKYARSVAEISQYLQADGPCIVMTSPGMLQGGPSLQIFEKIAPDSRNGVVLTG 396
Query: 108 RTSPGTLARDL 118
T GTLA +L
Sbjct: 397 YTVKGTLADEL 407
>gi|330923041|ref|XP_003300074.1| hypothetical protein PTT_11224 [Pyrenophora teres f. teres 0-1]
gi|311325959|gb|EFQ91831.1| hypothetical protein PTT_11224 [Pyrenophora teres f. teres 0-1]
Length = 705
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 1/72 (1%)
Query: 48 ARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
R + FK V+ SL + V+L S M+ G SR+L +W P N +IIT
Sbjct: 328 GRRGAWDFKFVRSLKSLERFDDLGG-CVMLASPGMMQSGTSRELLERWAPDPRNGVIITG 386
Query: 108 RTSPGTLARDLI 119
+ GT+A+ ++
Sbjct: 387 YSVEGTMAKQIV 398
>gi|302501173|ref|XP_003012579.1| hypothetical protein ARB_01192 [Arthroderma benhamiae CBS 112371]
gi|291176138|gb|EFE31939.1| hypothetical protein ARB_01192 [Arthroderma benhamiae CBS 112371]
Length = 991
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 50/131 (38%), Gaps = 38/131 (29%)
Query: 13 RNNPFHFKHVKLCHSLAELAK--IEWMSDKLMKSFE--------------GARN------ 50
+N P + K H LA+ +EWM + +++ FE GA N
Sbjct: 295 KNTPLYLAGKK-AHDTMRLARSMLEWMDENIVREFEGNDGVEATTGKAAGGASNQPSKGV 353
Query: 51 ------------NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQW 95
PF FKH+ L A+L V PKV+L +E G S+ +
Sbjct: 354 QSQKSATGQKSLGPFTFKHLNLVEHKAKLDGVLESKGPKVILSPDTSLEWGLSKHVLKHI 413
Query: 96 CSSPENSIIIT 106
EN II+T
Sbjct: 414 AEGNENLIIMT 424
>gi|402217247|gb|EJT97328.1| Metallo-hydrolase/oxidoreductase [Dacryopinax sp. DJM-731 SS1]
Length = 780
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 38/73 (52%), Gaps = 3/73 (4%)
Query: 50 NNPFHFKHVK-LCHSLAELAKVPS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
NPF F+HV + S K+ P V+L S M+ G SR+L W N I++T
Sbjct: 306 QNPFVFRHVSHIPRSSGWEKKIGEGPPCVILASPGMMQSGASRELLEMWAPDRRNGIVLT 365
Query: 107 NRTSPGTLARDLI 119
+ G++AR+++
Sbjct: 366 GYSVEGSMARNIM 378
>gi|330796066|ref|XP_003286090.1| hypothetical protein DICPUDRAFT_30371 [Dictyostelium purpureum]
gi|325083909|gb|EGC37349.1| hypothetical protein DICPUDRAFT_30371 [Dictyostelium purpureum]
Length = 468
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 50/106 (47%), Gaps = 7/106 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
I W + K+ ++F + N F FKH+K S L P +VL +TP M G S ++F
Sbjct: 283 INWTNQKIKQTF--VKRNMFDFKHIKPFQS--HLVDAPG-AMVLFATPGMLHAGASLEVF 337
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + N II GT+ L+ + +++ K+ LE
Sbjct: 338 KKWAPNELNMTIIPGYCVVGTVGNKLLANNAGSQM-VEIDKKTTLE 382
>gi|189208340|ref|XP_001940503.1| endoribonuclease YSH1 [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187976596|gb|EDU43222.1| endoribonuclease YSH1 [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 871
Score = 42.0 bits (97), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 1/72 (1%)
Query: 48 ARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
R + FK V+ SL + V+L S M+ G SR+L +W P N +IIT
Sbjct: 328 GRRGAWDFKFVRSLKSLERFDDLGG-CVMLASPGMMQSGTSRELLERWAPDPRNGVIITG 386
Query: 108 RTSPGTLARDLI 119
+ GT+A+ ++
Sbjct: 387 YSVEGTMAKQIV 398
>gi|428177137|gb|EKX46018.1| hypothetical protein GUITHDRAFT_70813 [Guillardia theta CCMP2712]
Length = 485
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 6/91 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
I W + K+ +F + N F F+H++ P P +VL +TP M G S ++F
Sbjct: 284 ISWTNQKIKDTF--VKRNVFDFQHIQPFDR--AFIDRPGP-MVLFATPGMLHGGLSMEVF 338
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGG 123
+W S +N +I+ GTL ++ GG
Sbjct: 339 KKWAPSDKNLVIMPGYCVAGTLGHKVLSNGG 369
>gi|399216074|emb|CCF72762.1| unnamed protein product [Babesia microti strain RI]
Length = 725
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 37/71 (52%), Gaps = 3/71 (4%)
Query: 51 NPFHFKHVKLCHSLAELAK---VPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
NP+HF H+K ++ + +P V++ S ++ G SRD+F N +I+T
Sbjct: 308 NPYHFTHIKYAPTVDSVRSHLLRDAPCVIMTSPGMLQGGPSRDVFEIIAPDNRNGVILTG 367
Query: 108 RTSPGTLARDL 118
T GTLA +L
Sbjct: 368 YTVKGTLADEL 378
>gi|396488788|ref|XP_003842943.1| similar to cleavage and polyadenylation specifity factor
[Leptosphaeria maculans JN3]
gi|312219521|emb|CBX99464.1| similar to cleavage and polyadenylation specifity factor
[Leptosphaeria maculans JN3]
Length = 861
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 1/72 (1%)
Query: 48 ARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
R + FK V+ SL + V+L S M+ G SR+L +W P N +IIT
Sbjct: 328 GRRGAWDFKFVRSLKSLERFDDLGG-CVMLASPGMMQSGTSRELLERWAPDPRNGVIITG 386
Query: 108 RTSPGTLARDLI 119
+ GT+A+ ++
Sbjct: 387 YSVEGTMAKQIV 398
>gi|303310723|ref|XP_003065373.1| hypothetical protein CPC735_045980 [Coccidioides posadasii C735
delta SOWgp]
gi|240105035|gb|EER23228.1| hypothetical protein CPC735_045980 [Coccidioides posadasii C735
delta SOWgp]
Length = 1026
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 28/130 (21%), Positives = 50/130 (38%), Gaps = 42/130 (32%)
Query: 23 KLCHSLAELAK--IEWMSDKLMKSFEGARNN----------------------------- 51
K HS LA+ +EWM + +++ FEG
Sbjct: 341 KKVHSTMRLARSMLEWMDESIVREFEGGDGGESLGAGRSSGAASGQQSKGTPGQTSDKKS 400
Query: 52 --------PFHFKHVKLCH---SLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPE 100
PF F+H+K+ L + + PKV++ S ++ GFS+++ E
Sbjct: 401 AGPHKGLGPFTFRHLKIIERKTKLENILRSEGPKVIIASDASLDWGFSKEILRHVAQGAE 460
Query: 101 NSIIITNRTS 110
N +I+T ++
Sbjct: 461 NLVILTESSA 470
>gi|389634325|ref|XP_003714815.1| endoribonuclease YSH1 [Magnaporthe oryzae 70-15]
gi|351647148|gb|EHA55008.1| endoribonuclease YSH1 [Magnaporthe oryzae 70-15]
gi|440467574|gb|ELQ36790.1| endoribonuclease YSH1 [Magnaporthe oryzae Y34]
gi|440483131|gb|ELQ63565.1| endoribonuclease YSH1 [Magnaporthe oryzae P131]
Length = 829
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 20/69 (28%), Positives = 40/69 (57%), Gaps = 1/69 (1%)
Query: 52 PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSP 111
P+ FK+++ +L + P V+L S ++ G SR+L +W S +N ++IT +
Sbjct: 334 PWDFKYIRSLKNLDRFDDL-GPCVMLASPGMLQNGVSRELLERWAPSDKNGVVITGYSVE 392
Query: 112 GTLARDLIE 120
GT+A+ +++
Sbjct: 393 GTMAKQIMQ 401
>gi|67517547|ref|XP_658594.1| hypothetical protein AN0990.2 [Aspergillus nidulans FGSC A4]
gi|74598547|sp|Q5BEP0.1|YSH1_EMENI RecName: Full=Endoribonuclease ysh1; AltName: Full=mRNA
3'-end-processing protein ysh1
gi|40746402|gb|EAA65558.1| hypothetical protein AN0990.2 [Aspergillus nidulans FGSC A4]
gi|259488717|tpe|CBF88384.1| TPA: Endoribonuclease ysh1 (EC 3.1.27.-)(mRNA 3'-end-processing
protein ysh1) [Source:UniProtKB/Swiss-Prot;Acc:Q5BEP0]
[Aspergillus nidulans FGSC A4]
Length = 884
Score = 42.0 bits (97), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 47/90 (52%), Gaps = 9/90 (10%)
Query: 31 LAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSR 89
+A+ E DK + + P+ FK+V+ SL V V++++P M + G SR
Sbjct: 318 MAEAEASGDKSVSA------GPWDFKYVRSLRSLERFDDVGG--CVMLASPGMLQTGTSR 369
Query: 90 DLFFQWCSSPENSIIITNRTSPGTLARDLI 119
+L +W + N +++T + GT+A+ L+
Sbjct: 370 ELLERWAPNERNGVVMTGYSVEGTMAKQLL 399
>gi|451852830|gb|EMD66124.1| hypothetical protein COCSADRAFT_34708 [Cochliobolus sativus ND90Pr]
Length = 872
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 1/72 (1%)
Query: 48 ARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
R + FK V+ SL + V+L S M+ G SR+L +W P N +IIT
Sbjct: 328 GRRGAWDFKFVRSLKSLERFDDLGG-CVMLASPGMMQSGTSRELLERWAPDPRNGVIITG 386
Query: 108 RTSPGTLARDLI 119
+ GT+A+ ++
Sbjct: 387 YSVEGTMAKQIV 398
>gi|396082329|gb|AFN83939.1| putative beta-lactamase fold-containingexonuclease [Encephalitozoon
romaleae SJ-2008]
Length = 496
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 43/93 (46%), Gaps = 5/93 (5%)
Query: 51 NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRT 109
N F +KH+K K P +VL ++P M G S +F +WC +N +II
Sbjct: 293 NVFEYKHIKPFQKYYMENKGP---MVLFASPGMLHSGMSLRMFKEWCEDEKNLVIIPGYC 349
Query: 110 SPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
GT+ ++ G R L +K IRLE L
Sbjct: 350 VRGTIGEKVLN-GAKRLEVLGEEKDIRLEVRNL 381
>gi|156403103|ref|XP_001639929.1| predicted protein [Nematostella vectensis]
gi|156227060|gb|EDO47866.1| predicted protein [Nematostella vectensis]
Length = 527
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 44/102 (43%), Gaps = 6/102 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ +F + N F F+H+K P P VV + + G S +F
Sbjct: 254 ITWTNQKIKNTF--VQRNMFEFEHIKPFDR--SYIDNPGPMVVFATPGMLHAGLSLQIFK 309
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRI 135
+W S+ N ++I GT+ + L G + + L K I
Sbjct: 310 KWASNENNMVVIPGYCVAGTVGHKV--LAGQKKIELDKKNVI 349
>gi|320034772|gb|EFW16715.1| cleavage and polyadenylylation specificity factor [Coccidioides
posadasii str. Silveira]
Length = 1026
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 28/130 (21%), Positives = 50/130 (38%), Gaps = 42/130 (32%)
Query: 23 KLCHSLAELAK--IEWMSDKLMKSFEGARNN----------------------------- 51
K HS LA+ +EWM + +++ FEG
Sbjct: 341 KKVHSTMRLARSMLEWMDESIVREFEGGDGGESLGAGRSSGAASGQQSKGTPGQTSDKKS 400
Query: 52 --------PFHFKHVKLCH---SLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPE 100
PF F+H+K+ L + + PKV++ S ++ GFS+++ E
Sbjct: 401 AGPHKGLGPFTFRHLKIIERKTKLENILRSEGPKVIIASDASLDWGFSKEILRHVAQGAE 460
Query: 101 NSIIITNRTS 110
N +I+T ++
Sbjct: 461 NLVILTESSA 470
>gi|402226056|gb|EJU06116.1| hypothetical protein DACRYDRAFT_73414 [Dacryopinax sp. DJM-731 SS1]
Length = 925
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 54/119 (45%), Gaps = 23/119 (19%)
Query: 50 NNPFHFKHVKLCHS---LAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
N FKH+++ S L + +PKV+L + M G SR +F + S N +++
Sbjct: 381 NMILRFKHLQIFPSPEALMDAIHPSAPKVILATPLTMSHGASRAMFESFSSMRNNLLLLV 440
Query: 107 NRTSPGTLARDLIEL-----------GGNRT---------LTLQVKKRIRLEGEELEEY 145
N GTLAR L ++ G R +++++ ++ L G ELEEY
Sbjct: 441 NIAEKGTLARSLWDIWQREQAETAKWGKGRLGAIVKAETDISVRMNAKVPLAGVELEEY 499
>gi|310796189|gb|EFQ31650.1| metallo-beta-lactamase superfamily protein [Glomerella graminicola
M1.001]
Length = 855
Score = 41.6 bits (96), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 22/87 (25%), Positives = 46/87 (52%), Gaps = 3/87 (3%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
E M++ + P+ FK+++ +L V V++++P M + G SR+L
Sbjct: 316 ERMAEAEASGDGSGKGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELLE 373
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIE 120
+W + +N +IIT + GT+A+ +++
Sbjct: 374 RWAPNDKNGVIITGYSVEGTMAKQIMQ 400
>gi|241245173|ref|XP_002402434.1| cleavage and polyadenylation specificity factor, putative [Ixodes
scapularis]
gi|215496345|gb|EEC05985.1| cleavage and polyadenylation specificity factor, putative [Ixodes
scapularis]
Length = 596
Score = 41.6 bits (96), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 27/105 (25%), Positives = 46/105 (43%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFDFKHIKPFDR--AFIDNPGPMVVFATPGMLHAGLSLQIFK 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W N +I+ GT+ + L G R + L ++ + ++
Sbjct: 339 KWAPVEGNMVIMPGYCVAGTVGHKI--LSGARKVELDNRQVVEVK 381
>gi|442751667|gb|JAA67993.1| Putative cleavage and polyadenylation specificity factor cpsf
subunit [Ixodes ricinus]
Length = 596
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 27/105 (25%), Positives = 46/105 (43%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFDFKHIKPFDR--AFIDNPGPMVVFATPGMLHAGLSLQIFK 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W N +I+ GT+ + L G R + L ++ + ++
Sbjct: 339 KWAPVEGNMVIMPGYCVAGTVGHKI--LSGARKVELDNRQVVEVK 381
>gi|429862463|gb|ELA37111.1| cleavage and polyadenylation specifity 73 kda [Colletotrichum
gloeosporioides Nara gc5]
Length = 831
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 22/87 (25%), Positives = 46/87 (52%), Gaps = 3/87 (3%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
E M++ + P+ FK+++ +L V V++++P M + G SR+L
Sbjct: 312 ERMAEAEASGDGSGKGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELLE 369
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIE 120
+W + +N +IIT + GT+A+ +++
Sbjct: 370 RWAPNEKNGVIITGYSVEGTMAKQIMQ 396
>gi|242786013|ref|XP_002480717.1| cleavage and polyadenylylation specificity factor, putative
[Talaromyces stipitatus ATCC 10500]
gi|218720864|gb|EED20283.1| cleavage and polyadenylylation specificity factor, putative
[Talaromyces stipitatus ATCC 10500]
Length = 1017
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 30/119 (25%), Positives = 51/119 (42%), Gaps = 35/119 (29%)
Query: 23 KLCHSLAELAK--IEWMSDKLMKSFEG------------------------ARNN----- 51
K HS LA+ +EWM + +++ FE +R+N
Sbjct: 340 KKAHSTMRLARSMLEWMDEGIVREFEAVEGGDAAAARGHKRTDSQSRTTGSSRDNKATKL 399
Query: 52 -PFHFKHVKLCHSLAELAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
PF KH+K+ +L K+ PKV++ S ++ G+S++ F +N II+T
Sbjct: 400 GPFTLKHLKIVEQKRKLEKILGDGIPKVIIASDTSLDWGYSKETFRTLAEDSQNLIILT 458
>gi|302661813|ref|XP_003022569.1| hypothetical protein TRV_03308 [Trichophyton verrucosum HKI 0517]
gi|291186522|gb|EFE41951.1| hypothetical protein TRV_03308 [Trichophyton verrucosum HKI 0517]
Length = 1024
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 34/131 (25%), Positives = 50/131 (38%), Gaps = 38/131 (29%)
Query: 13 RNNPFHFKHVKLCHSLAELAK--IEWMSDKLMKSFE--------------GARN------ 50
+N P + K H LA+ +EWM + +++ FE GA N
Sbjct: 328 KNTPLYLAGKK-AHGTMRLARSMLEWMDENIVREFEGNDGVEATTGKAAGGASNQPSKGA 386
Query: 51 ------------NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQW 95
PF FKH+ L A+L + PKV+L +E G S+ +
Sbjct: 387 QSQKSATGQKSLGPFTFKHLNLVEHKAKLDGILESKGPKVILSPDTSLEWGLSKHVLKHI 446
Query: 96 CSSPENSIIIT 106
EN II+T
Sbjct: 447 AEGNENLIIMT 457
>gi|452002411|gb|EMD94869.1| hypothetical protein COCHEDRAFT_1222148 [Cochliobolus
heterostrophus C5]
Length = 872
Score = 41.6 bits (96), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 1/72 (1%)
Query: 48 ARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
R + FK V+ SL + V+L S M+ G SR+L +W P N +IIT
Sbjct: 328 GRRGAWDFKFVRSLKSLERFDDLGG-CVMLASPGMMQSGTSRELLERWAPDPRNGVIITG 386
Query: 108 RTSPGTLARDLI 119
+ GT+A+ ++
Sbjct: 387 YSVEGTMAKHIV 398
>gi|389585470|dbj|GAB68201.1| cleavage and polyadenylation specifity factor protein, partial
[Plasmodium cynomolgi strain B]
Length = 936
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 39/72 (54%), Gaps = 4/72 (5%)
Query: 51 NPFHFKHVKLCHSLAE----LAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
NPF+FK VK SL L + +P VV+ S ++ G S+++F ++ +I+T
Sbjct: 355 NPFNFKFVKYAKSLDSILNYLYQDNNPCVVMASPGMLQNGISKNIFNIIAPDKKSGVILT 414
Query: 107 NRTSPGTLARDL 118
T GTLA +L
Sbjct: 415 GYTVKGTLADEL 426
>gi|427779771|gb|JAA55337.1| Putative mrna cleavage and polyadenylation factor ii complex brr5
cpsf subunit [Rhipicephalus pulchellus]
Length = 621
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 27/105 (25%), Positives = 46/105 (43%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K P P VV + + G S +F
Sbjct: 317 ITWTNQKIRKTF--VQRNMFDFKHIKPFDR--AFIDNPGPMVVFATPGMLHAGLSLQIFK 372
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W N +I+ GT+ + L G R + L ++ + ++
Sbjct: 373 KWAPFEANMVIMPGYCVAGTVGHKI--LSGARKVELDNRQVVEVK 415
>gi|256084683|ref|XP_002578556.1| cleavage and polyadenylation specificity factor [Schistosoma
mansoni]
gi|350644758|emb|CCD60512.1| cleavage and polyadenylation specificity factor,putative
[Schistosoma mansoni]
Length = 619
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 43/95 (45%), Gaps = 8/95 (8%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAE-LAKVPSPKVVLVSTPDMECGFSRDLF 92
I W + K+ ++F + N F FKH+K L + P P VV + + G S +F
Sbjct: 283 ISWTNQKIKETF--VKRNMFDFKHIK---PLGQGTVDNPGPMVVFATPGMLHAGQSLHIF 337
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTL 127
+W S N ++I GT+ + L G R L
Sbjct: 338 RKWASDERNMVVIPGYCVAGTVGYKI--LNGVRRL 370
>gi|340381556|ref|XP_003389287.1| PREDICTED: integrator complex subunit 11-like [Amphimedon
queenslandica]
Length = 610
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 22/86 (25%), Positives = 37/86 (43%), Gaps = 4/86 (4%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ +F N F FKH+K P P +V + + G S +F
Sbjct: 284 ISWTNQKIKNTF--IHRNMFDFKHIKAFDR--SYIDQPGPMIVFATPGMLHAGLSLQIFK 339
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+W +N +I+ PGT+ ++
Sbjct: 340 KWAEDEKNMLIMPGYCVPGTIGHKVL 365
>gi|427785581|gb|JAA58242.1| Putative mrna cleavage and polyadenylation factor ii complex brr5
cpsf subunit [Rhipicephalus pulchellus]
Length = 587
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 27/105 (25%), Positives = 46/105 (43%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFDFKHIKPFDR--AFIDNPGPMVVFATPGMLHAGLSLQIFK 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W N +I+ GT+ + L G R + L ++ + ++
Sbjct: 339 KWAPFEANMVIMPGYCVAGTVGHKI--LSGARKVELDNRQVVEVK 381
>gi|380494427|emb|CCF33158.1| endoribonuclease YSH1 [Colletotrichum higginsianum]
Length = 846
Score = 41.2 bits (95), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 22/87 (25%), Positives = 46/87 (52%), Gaps = 3/87 (3%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
E M++ + P+ FK+++ +L V V++++P M + G SR+L
Sbjct: 316 ERMAEAEASGDGSGKGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELLE 373
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIE 120
+W + +N +IIT + GT+A+ +++
Sbjct: 374 RWAPNDKNGVIITGYSVEGTMAKQIMQ 400
>gi|67525249|ref|XP_660686.1| hypothetical protein AN3082.2 [Aspergillus nidulans FGSC A4]
gi|40744477|gb|EAA63653.1| hypothetical protein AN3082.2 [Aspergillus nidulans FGSC A4]
gi|259485970|tpe|CBF83440.1| TPA: cleavage and polyadenylylation specificity factor, putative
(AFU_orthologue; AFUA_3G09720) [Aspergillus nidulans
FGSC A4]
Length = 1005
Score = 41.2 bits (95), Expect = 0.45, Method: Composition-based stats.
Identities = 26/102 (25%), Positives = 45/102 (44%), Gaps = 28/102 (27%)
Query: 34 IEWMSDKLMKSFEGA-------RNN------------------PFHFKHVKLCH---SLA 65
+EWM + +++ FE A +NN PF FKH+K L
Sbjct: 347 LEWMDESIVREFEAAEAADTAGQNNDGQRSDQRQGKTDNKGLGPFTFKHLKTVERKKKLE 406
Query: 66 ELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
+L P+PKV+L S ++ GF+++ N +++T+
Sbjct: 407 QLLNDPTPKVILASDSSLDWGFAKESLRLLAGGENNLLLLTD 448
>gi|320170221|gb|EFW47120.1| integrator complex subunit 11 [Capsaspora owczarzaki ATCC 30864]
Length = 661
Score = 41.2 bits (95), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 50/106 (47%), Gaps = 8/106 (7%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
I W + K+ ++F N F FKH+K P P +VL +TP M G S D F
Sbjct: 282 ITWTNQKIKRTF--VERNMFEFKHIKPFDR--AFLDNPGP-MVLFATPGMLHAGMSLDAF 336
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + +N +I+ GT+ + L G++ + + + R ++
Sbjct: 337 RKWAPNDKNMVILPGYCVAGTVGNKV--LAGHKQIEMPDRARTVID 380
>gi|27372065|gb|AAN87883.1| FEG protein [Arabidopsis thaliana]
Length = 613
Score = 41.2 bits (95), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 40/87 (45%), Gaps = 6/87 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMEC-GFSRDLF 92
I W S + + +NPF FK+VK L P P VL + P M C G S ++F
Sbjct: 282 ISWTSQNVKEKHN--THNPFDFKNVKDFDR--SLIHAPGP-CVLFAIPGMLCAGLSLEVF 336
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLI 119
W SP N + + + GT+ L+
Sbjct: 337 KHWAPSPLNLVALLGYSVAGTVGHKLM 363
>gi|170595519|ref|XP_001902415.1| RNA-metabolising metallo-beta-lactamase family protein [Brugia
malayi]
gi|158589929|gb|EDP28737.1| RNA-metabolising metallo-beta-lactamase family protein [Brugia
malayi]
Length = 589
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 45/88 (51%), Gaps = 8/88 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPK-VVLVSTPDM-ECGFSRDL 91
I W ++K+ ++F N F FKH++ E + + SP +VL STP M G S +
Sbjct: 284 ISWTNEKIKRTF--VERNMFDFKHIRPF----EQSYIESPGPMVLFSTPGMLHGGQSLRV 337
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F +WCS +N II+ T+ +I
Sbjct: 338 FTKWCSDEKNLIIMPGFCVANTVGAKVI 365
>gi|169767492|ref|XP_001818217.1| cleavage and polyadenylylation specificity factor [Aspergillus
oryzae RIB40]
gi|83766072|dbj|BAE56215.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 1014
Score = 41.2 bits (95), Expect = 0.51, Method: Composition-based stats.
Identities = 43/175 (24%), Positives = 70/175 (40%), Gaps = 42/175 (24%)
Query: 34 IEWMSDKLMKSFEGA----------RNN------------------PFHFKHVKLCHSLA 65
+EWM + +++ FE A R N PF FKH+K+
Sbjct: 357 LEWMDENIVREFEAAEGVDAATGQSRANPGGQRSGQNQGKEEKGTGPFTFKHLKIVERKK 416
Query: 66 ELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG 122
+L K+ +PKV+L S ++ GF+++ P N +++T LA D E
Sbjct: 417 KLEKILNNQAPKVILASDTSLDWGFAKESLRLVAGGPNNLLLLTESFHKERLA-DNQEDP 475
Query: 123 GNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFINELQLSD 177
+TL + E Y+++KD A +K ++ S EL SD
Sbjct: 476 YRKTLGNMI----------WEWYEERKDGVALEKSSDGEMIEQVHSGGRELSWSD 520
>gi|391871950|gb|EIT81099.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT2
[Aspergillus oryzae 3.042]
Length = 1010
Score = 41.2 bits (95), Expect = 0.51, Method: Composition-based stats.
Identities = 43/175 (24%), Positives = 70/175 (40%), Gaps = 42/175 (24%)
Query: 34 IEWMSDKLMKSFEGA----------RNN------------------PFHFKHVKLCHSLA 65
+EWM + +++ FE A R N PF FKH+K+
Sbjct: 353 LEWMDENIVREFEAAEGVDAATGQSRANPGGQRSGQNQGKEEKGTGPFTFKHLKIVERKK 412
Query: 66 ELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG 122
+L K+ +PKV+L S ++ GF+++ P N +++T LA D E
Sbjct: 413 KLEKILNNQAPKVILASDTSLDWGFAKESLRLVAGGPNNLLLLTESFHKERLA-DNQEDP 471
Query: 123 GNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFINELQLSD 177
+TL + E Y+++KD A +K ++ S EL SD
Sbjct: 472 YRKTLGNMI----------WEWYEERKDGVALEKSSDGEMIEQVHSGGRELSWSD 516
>gi|451946223|ref|YP_007466818.1| metallo-beta-lactamase and RNA-binding KH domain-containing
putative metal-dependent RNase [Desulfocapsa sulfexigens
DSM 10523]
gi|451905571|gb|AGF77165.1| metallo-beta-lactamase and RNA-binding KH domain-containing
putative metal-dependent RNase [Desulfocapsa sulfexigens
DSM 10523]
Length = 469
Score = 41.2 bits (95), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 40/79 (50%), Gaps = 3/79 (3%)
Query: 45 FEGARNNPFHFKHVKLCHSLAE---LAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPEN 101
E NNPF F+ +K S+ E L+ P +++ ++ ME G +P+N
Sbjct: 307 LEDKNNNPFGFETLKYTSSVDESKNLSSSDKPAIIISASGMMEGGRILHHLKSRAGNPKN 366
Query: 102 SIIITNRTSPGTLARDLIE 120
+I+IT +P TL R +++
Sbjct: 367 TILITGWQAPFTLGRQIVD 385
>gi|327351648|gb|EGE80505.1| cleavage and polyadenylation specificity factor [Ajellomyces
dermatitidis ATCC 18188]
Length = 983
Score = 41.2 bits (95), Expect = 0.52, Method: Composition-based stats.
Identities = 37/161 (22%), Positives = 69/161 (42%), Gaps = 18/161 (11%)
Query: 1 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKIEWMSDKLMKSFEGARN--NPFHFKHV 58
M + +++ FE +P ++ + M DK +GA PF FK++
Sbjct: 311 MDEGIVREFEAGHGDPVAVSGKGRQDGPSQRNPLTGMPDK---RGDGAFKALGPFTFKYL 367
Query: 59 KLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLA 115
K+ A+L K+ +PKV+L S ++ G+S+ + + EN +I+T S
Sbjct: 368 KIVERKAKLDKILGSNTPKVILTSDTSLDWGYSKHVLQNIATGSENLVILTESFS----- 422
Query: 116 RDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDK 156
+ N+ + + L E Y+++KD A +K
Sbjct: 423 -----VSANKEMMDDGRSTTSLAHEIWSIYEERKDGVALEK 458
>gi|239610975|gb|EEQ87962.1| cleavage and polyadenylation specificity factor [Ajellomyces
dermatitidis ER-3]
Length = 983
Score = 41.2 bits (95), Expect = 0.52, Method: Composition-based stats.
Identities = 37/161 (22%), Positives = 69/161 (42%), Gaps = 18/161 (11%)
Query: 1 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKIEWMSDKLMKSFEGARN--NPFHFKHV 58
M + +++ FE +P ++ + M DK +GA PF FK++
Sbjct: 311 MDEGIVREFEAGHGDPVAVSGKGRQDGPSQRNPLTGMPDK---RGDGAFKALGPFTFKYL 367
Query: 59 KLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLA 115
K+ A+L K+ +PKV+L S ++ G+S+ + + EN +I+T S
Sbjct: 368 KIVERKAKLDKILGSNTPKVILTSDTSLDWGYSKHVLQNIATGSENLVILTESFS----- 422
Query: 116 RDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDK 156
+ N+ + + L E Y+++KD A +K
Sbjct: 423 -----VSANKEMMDDGRSTTSLAHEIWSIYEERKDGVALEK 458
>gi|261206112|ref|XP_002627793.1| cleavage and polyadenylylation specificity factor [Ajellomyces
dermatitidis SLH14081]
gi|239592852|gb|EEQ75433.1| cleavage and polyadenylylation specificity factor [Ajellomyces
dermatitidis SLH14081]
Length = 983
Score = 41.2 bits (95), Expect = 0.52, Method: Composition-based stats.
Identities = 37/161 (22%), Positives = 69/161 (42%), Gaps = 18/161 (11%)
Query: 1 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKIEWMSDKLMKSFEGARN--NPFHFKHV 58
M + +++ FE +P ++ + M DK +GA PF FK++
Sbjct: 311 MDEGIVREFEAGHGDPVAVSGKGRQDGPSQRNPLTGMPDK---RGDGAFKALGPFTFKYL 367
Query: 59 KLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLA 115
K+ A+L K+ +PKV+L S ++ G+S+ + + EN +I+T S
Sbjct: 368 KIVERKAKLDKILGSNTPKVILTSDTSLDWGYSKHVLQNIATGSENLVILTESFS----- 422
Query: 116 RDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDK 156
+ N+ + + L E Y+++KD A +K
Sbjct: 423 -----VSANKEMMDDGRSTTSLAHEIWSIYEERKDGVALEK 458
>gi|156092661|ref|XP_001612477.1| cleavage and polyadenylation specifity factor protein [Plasmodium
vivax Sal-1]
gi|148801279|gb|EDL42684.1| cleavage and polyadenylation specifity factor protein, putative
[Plasmodium vivax]
Length = 858
Score = 41.2 bits (95), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 39/72 (54%), Gaps = 4/72 (5%)
Query: 51 NPFHFKHVKLCHSLAE----LAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
NPF+FK VK SL L + +P VV+ S ++ G S+++F ++ +I+T
Sbjct: 355 NPFNFKFVKYAKSLDSILNYLYQDNNPCVVMASPGMLQNGISKNIFNIIAPDKKSGVILT 414
Query: 107 NRTSPGTLARDL 118
T GTLA +L
Sbjct: 415 GYTVKGTLADEL 426
>gi|169598186|ref|XP_001792516.1| hypothetical protein SNOG_01891 [Phaeosphaeria nodorum SN15]
gi|111068983|gb|EAT90103.1| hypothetical protein SNOG_01891 [Phaeosphaeria nodorum SN15]
Length = 833
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 27/45 (60%)
Query: 75 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
V+L S M+ G SR+L +W P N +IIT + GT+A+ ++
Sbjct: 355 VMLASPGMMQSGTSRELLERWAPDPRNGVIITGYSVEGTMAKQIV 399
>gi|356502382|ref|XP_003519998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-II-like [Glycine max]
Length = 516
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 42/87 (48%), Gaps = 6/87 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
I W K+ ++ +++N F FK+V+ + P P VL +TP M GFS ++F
Sbjct: 282 IRWTRQKIKDTY--SKHNAFDFKNVQKFER--SMIDAPGP-CVLFATPGMLSGGFSVEVF 336
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLI 119
W S N + + PGT+ L+
Sbjct: 337 KHWAVSENNLVSLPGYCVPGTIGHKLM 363
>gi|21358523|ref|NP_651721.1| integrator 11 [Drosophila melanogaster]
gi|7301822|gb|AAF56931.1| integrator 11 [Drosophila melanogaster]
gi|16768852|gb|AAL28645.1| LD08814p [Drosophila melanogaster]
gi|220943570|gb|ACL84328.1| CG1972-PA [synthetic construct]
gi|220953494|gb|ACL89290.1| CG1972-PA [synthetic construct]
Length = 597
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F N F FKH+K + A + +P ++V +TP M G S +
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + LGG + + + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVEFENRQVVEVK 381
>gi|156100557|ref|XP_001616006.1| cleavage and polyadenylation specifity factor protein [Plasmodium
vivax Sal-1]
gi|148804880|gb|EDL46279.1| cleavage and polyadenylation specifity factor protein, putative
[Plasmodium vivax]
Length = 934
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 39/72 (54%), Gaps = 4/72 (5%)
Query: 51 NPFHFKHVKLCHSLAE----LAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
NPF+FK VK SL L + +P VV+ S ++ G S+++F ++ +I+T
Sbjct: 355 NPFNFKFVKYAKSLDSILNYLYQDNNPCVVMASPGMLQNGISKNIFNIIAPDKKSGVILT 414
Query: 107 NRTSPGTLARDL 118
T GTLA +L
Sbjct: 415 GYTVKGTLADEL 426
>gi|70996586|ref|XP_753048.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Aspergillus fumigatus Af293]
gi|74672067|sp|Q4WRC2.1|YSH1_ASPFU RecName: Full=Endoribonuclease ysh1; AltName: Full=mRNA
3'-end-processing protein ysh1
gi|66850683|gb|EAL91010.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Aspergillus fumigatus Af293]
gi|159131784|gb|EDP56897.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Aspergillus fumigatus A1163]
Length = 872
Score = 41.2 bits (95), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 45/90 (50%), Gaps = 9/90 (10%)
Query: 31 LAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSR 89
+A+ E DK A P+ FK V+ SL V V++++P M + G SR
Sbjct: 319 MAEAEASGDK------SASAGPWDFKFVRSLRSLERFDDVGG--CVMLASPGMLQTGTSR 370
Query: 90 DLFFQWCSSPENSIIITNRTSPGTLARDLI 119
+L +W + N +++T + GT+A+ L+
Sbjct: 371 ELLERWAPNERNGVVMTGYSVEGTMAKQLL 400
>gi|195503187|ref|XP_002098546.1| GE23879 [Drosophila yakuba]
gi|194184647|gb|EDW98258.1| GE23879 [Drosophila yakuba]
Length = 597
Score = 40.8 bits (94), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F N F FKH+K + A + +P ++V +TP M G S +
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + LGG + + + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVEFENRQVVEVK 381
>gi|195445135|ref|XP_002070189.1| GK11920 [Drosophila willistoni]
gi|194166274|gb|EDW81175.1| GK11920 [Drosophila willistoni]
Length = 597
Score = 40.8 bits (94), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F N F FKH+K + A + +P ++V +TP M G S +
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + LGG + + + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVEFENRQVVEVK 381
>gi|195394529|ref|XP_002055895.1| GJ10637 [Drosophila virilis]
gi|194142604|gb|EDW59007.1| GJ10637 [Drosophila virilis]
Length = 597
Score = 40.8 bits (94), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F N F FKH+K + A + +P ++V +TP M G S +
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + LGG + + + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVEFENRQVVEVK 381
>gi|194765324|ref|XP_001964777.1| GF23370 [Drosophila ananassae]
gi|190615049|gb|EDV30573.1| GF23370 [Drosophila ananassae]
Length = 597
Score = 40.8 bits (94), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F N F FKH+K + A + +P ++V +TP M G S +
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + LGG + + + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVEFENRQVVEVK 381
>gi|195143691|ref|XP_002012831.1| GL23717 [Drosophila persimilis]
gi|194101774|gb|EDW23817.1| GL23717 [Drosophila persimilis]
Length = 597
Score = 40.8 bits (94), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F N F FKH+K + A + +P ++V +TP M G S +
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + LGG + + + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVEFENRQVVEVK 381
>gi|125773833|ref|XP_001358175.1| GA15164 [Drosophila pseudoobscura pseudoobscura]
gi|54637910|gb|EAL27312.1| GA15164 [Drosophila pseudoobscura pseudoobscura]
Length = 597
Score = 40.8 bits (94), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F N F FKH+K + A + +P ++V +TP M G S +
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + LGG + + + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVEFENRQVVEVK 381
>gi|406604299|emb|CCH44271.1| Cleavage and polyadenylation specificity factor subunit
[Wickerhamomyces ciferrii]
Length = 795
Score = 40.8 bits (94), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 51/115 (44%), Gaps = 14/115 (12%)
Query: 5 LMKSFEGARNNPFHFKHVKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSL 64
LM S+ ARN L ++ + L EWM+ +++ +E PF V++
Sbjct: 256 LMLSYTKARN---------LTYAGSML---EWMAPAVIREWETRGQPPFDSSRVQVIEP- 302
Query: 65 AELAKVPSPKVVLVSTPDMECG-FSRDLFFQWCSSPENSIIITNRTSPGTLARDL 118
EL +P KVV S E G ++ C + +II+T +T T+ DL
Sbjct: 303 NELLNMPGAKVVFASGAGFEDGSVAQAALTTLCDDEKTTIILTEKTVENTIGNDL 357
>gi|402590428|gb|EJW84358.1| RNA-metabolising metallo-beta-lactamase [Wuchereria bancrofti]
Length = 579
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 45/88 (51%), Gaps = 8/88 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPK-VVLVSTPDM-ECGFSRDL 91
I W ++K+ ++F N F FKH++ E + + SP +VL STP M G S +
Sbjct: 255 ISWTNEKIKRTF--VERNMFDFKHIRPF----EQSYIDSPGPMVLFSTPGMLHGGQSLRV 308
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F +WCS +N II+ T+ +I
Sbjct: 309 FTKWCSDEKNLIIMPGFCVANTVGAKVI 336
>gi|194906134|ref|XP_001981318.1| GG11690 [Drosophila erecta]
gi|190655956|gb|EDV53188.1| GG11690 [Drosophila erecta]
Length = 597
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F N F FKH+K + A + +P ++V +TP M G S +
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + LGG + + + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVEFENRQVVEVK 381
>gi|378756880|gb|EHY66904.1| cleavage and polyadenylation specificity factor subunit 3
[Nematocida sp. 1 ERTm2]
Length = 501
Score = 40.8 bits (94), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 53/122 (43%), Gaps = 18/122 (14%)
Query: 18 HFKHVKLCHSLAELAKIEWMSDKLMKSF-----EGARN-----NPFHFKHVKLCHSLAEL 67
H++ KL + A + ++ + K F E RN N F F+HVK S L
Sbjct: 255 HWEKSKLSIPIYTSATLTHKANDIYKQFIDYTHEHIRNTMHKRNLFDFQHVKQFDS--NL 312
Query: 68 AKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDL------IEL 121
A + P ++ S + G S +F +WC P+N +I GT+ + IE+
Sbjct: 313 ASLEGPMILFSSPGMLHSGPSLSIFKKWCGDPKNMVIFPGYCVRGTIGERVLNGASQIEV 372
Query: 122 GG 123
GG
Sbjct: 373 GG 374
>gi|91086147|ref|XP_969343.1| PREDICTED: similar to CG1972 CG1972-PA [Tribolium castaneum]
gi|270009886|gb|EFA06334.1| hypothetical protein TcasGA2_TC009205 [Tribolium castaneum]
Length = 595
Score = 40.8 bits (94), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 28/111 (25%), Positives = 46/111 (41%), Gaps = 14/111 (12%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K P P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VQRNMFDFKHIKPFDR--SYIDNPGPMVVFATPGMLHAGLSLQIFK 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEE 144
+W + N +I+ GT+ + L KR+ E +++ E
Sbjct: 339 KWAPNENNMVIMPGFCVQGTVGHKI----------LNGAKRVEFENKQIVE 379
>gi|387594235|gb|EIJ89259.1| integrator complex subunit 11 [Nematocida parisii ERTm3]
gi|387594982|gb|EIJ92609.1| integrator complex subunit 11 [Nematocida parisii ERTm1]
Length = 502
Score = 40.8 bits (94), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 29/113 (25%), Positives = 49/113 (43%), Gaps = 12/113 (10%)
Query: 19 FKHVKLCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVV 76
+ L H ++ K I++ + + + + N F F+HVK S LA + P ++
Sbjct: 266 YTSATLTHKANDIYKQFIDYTHEHIRSTLH--KRNLFDFRHVKQFDS--NLASLEGPMIL 321
Query: 77 LVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDL------IELGG 123
S + G S +F +WC P N +I GT+ + IE+GG
Sbjct: 322 FSSPGMLHSGPSLSIFKKWCGDPNNMVIFPGYCVRGTIGERVLNGASQIEVGG 374
>gi|328766828|gb|EGF76880.1| hypothetical protein BATDEDRAFT_14507, partial [Batrachochytrium
dendrobatidis JAM81]
Length = 475
Score = 40.8 bits (94), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 27/98 (27%), Positives = 46/98 (46%), Gaps = 10/98 (10%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVK-LCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDL 91
I W ++ + + N F F H++ HS A+ +VL +TP M G S ++
Sbjct: 295 ISWTNENVKSAL--VERNMFDFAHIRSWSHSFAD----EPGAMVLFATPGMLHAGTSLEV 348
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTL 129
F +WC P+N II+ GT+ + L G + + +
Sbjct: 349 FKKWCHDPKNMIIMPGYCVAGTVGAKV--LAGEKVINV 384
>gi|390602470|gb|EIN11863.1| Metallo-hydrolase/oxidoreductase, partial [Punctularia
strigosozonata HHB-11173 SS5]
Length = 721
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 41/77 (53%), Gaps = 9/77 (11%)
Query: 49 RNNPFHFKHVKLCHSLA------ELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENS 102
R+NPF FKH+ H+ +LA+ P P V+L S ++ G SR+L N
Sbjct: 298 RDNPFVFKHI--SHAPQNRGWERKLAEGP-PCVILASPGMLQSGPSRELLELLAPDSRNG 354
Query: 103 IIITNRTSPGTLARDLI 119
+++T + GT ARD+I
Sbjct: 355 LVLTGYSVEGTPARDII 371
>gi|195062087|ref|XP_001996130.1| GH14325 [Drosophila grimshawi]
gi|193891922|gb|EDV90788.1| GH14325 [Drosophila grimshawi]
Length = 597
Score = 40.8 bits (94), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F N F FKH+K + A + +P ++V +TP M G S +
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + LGG + + + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVEFENRQVVEVK 381
>gi|295659367|ref|XP_002790242.1| cleavage and polyadenylation specific factor 2 [Paracoccidioides
sp. 'lutzii' Pb01]
gi|226281947|gb|EEH37513.1| cleavage and polyadenylation specific factor 2 [Paracoccidioides
sp. 'lutzii' Pb01]
Length = 999
Score = 40.4 bits (93), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 42/79 (53%), Gaps = 4/79 (5%)
Query: 51 NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
PF F+H+K+ +L K+ +P+V+L S +E G+S+ + + + EN II+T
Sbjct: 401 GPFTFRHLKIVERKTKLDKILGSNAPQVILTSDTSLEWGYSKHVLQKIAAGSENLIILTE 460
Query: 108 RTSPGTLARDLIELGGNRT 126
P + E GG+R+
Sbjct: 461 -PYPISANAQSTEDGGSRS 478
>gi|401827835|ref|XP_003888210.1| putative RNA-processing beta-lactamase-fold exonuclease
[Encephalitozoon hellem ATCC 50504]
gi|392999410|gb|AFM99229.1| putative RNA-processing beta-lactamase-fold exonuclease
[Encephalitozoon hellem ATCC 50504]
Length = 496
Score = 40.4 bits (93), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 45/93 (48%), Gaps = 5/93 (5%)
Query: 51 NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRT 109
N F +KH+K + P +VL ++P M G S +F +WCS +N +II
Sbjct: 293 NVFEYKHIKPFQKYYMDNEGP---MVLFASPGMLHSGMSLRMFKEWCSDEKNLVIIPGYC 349
Query: 110 SPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
GT+ ++ G R L +K IR+E + L
Sbjct: 350 VRGTIGEKVLN-GAKRLEILGEEKDIRIEIKNL 381
>gi|219121689|ref|XP_002181194.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407180|gb|EEC47117.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 602
Score = 40.4 bits (93), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 10/91 (10%)
Query: 50 NNPFHFKHVKLCHSL-AELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
+NPF F+ ++ S+ P VV S ++ G SR LF +W S +N ++I
Sbjct: 296 SNPFRFRFIQNLKSIDVNSFDDSGPSVVFASPGMLQSGVSRQLFDRWASDHKNGVLIAGY 355
Query: 109 TSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
TLA++++ Q K+ + LEG
Sbjct: 356 AVEHTLAKEIMA---------QPKEVVTLEG 377
>gi|358333178|dbj|GAA51732.1| integrator complex subunit 11 [Clonorchis sinensis]
Length = 649
Score = 40.4 bits (93), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 23/87 (26%), Positives = 40/87 (45%), Gaps = 6/87 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAE-LAKVPSPKVVLVSTPDMECGFSRDLF 92
I W + K+ ++F + N F FKH+K L + + P P VV + + G S +F
Sbjct: 352 ISWTNQKIKETF--VKRNMFEFKHIK---PLGQGIVDNPGPMVVFATPGMLHAGQSLHIF 406
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLI 119
+W N ++I GT+ ++
Sbjct: 407 RKWAPDERNMVVIPGYCVAGTVGYKIL 433
>gi|119195099|ref|XP_001248153.1| hypothetical protein CIMG_01924 [Coccidioides immitis RS]
Length = 1015
Score = 40.4 bits (93), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 27/130 (20%), Positives = 49/130 (37%), Gaps = 42/130 (32%)
Query: 23 KLCHSLAELAK--IEWMSDKLMKSFEGARNN----------------------------- 51
K H LA+ +EWM + +++ FEG
Sbjct: 330 KKVHGTMRLARSMLEWMDESIVREFEGGDGGESLGAGRSSGAASGQQSKGTPGQTSDKKS 389
Query: 52 --------PFHFKHVKLCH---SLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPE 100
PF F+H+K+ L + + PKV++ S ++ GFS+++ E
Sbjct: 390 AGPHKGLGPFTFRHLKIIERKTKLENILRSEGPKVIIASDTSLDWGFSKEILRHVAQGAE 449
Query: 101 NSIIITNRTS 110
N +I+T ++
Sbjct: 450 NLVILTESSA 459
>gi|312083284|ref|XP_003143797.1| RNA-metabolising metallo-beta-lactamase [Loa loa]
gi|307761039|gb|EFO20273.1| RNA-metabolising metallo-beta-lactamase [Loa loa]
Length = 644
Score = 40.4 bits (93), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 6/87 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
I W ++K+ ++F N F FKH++ P P +VL STP M G S +F
Sbjct: 339 ISWTNEKIKRTF--VERNMFDFKHIRPFEQ--SYTDSPGP-MVLFSTPGMLHGGQSLRVF 393
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLI 119
+WCS +N II+ T+ +I
Sbjct: 394 TKWCSDEKNLIIMPGFCVANTVGAKVI 420
>gi|373495498|ref|ZP_09586082.1| hypothetical protein HMPREF0380_01720 [Eubacterium infirmum F0142]
gi|371964681|gb|EHO82189.1| hypothetical protein HMPREF0380_01720 [Eubacterium infirmum F0142]
Length = 556
Score = 40.4 bits (93), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 46/99 (46%), Gaps = 11/99 (11%)
Query: 50 NNPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
+NP FK++K S E + PKV++ ++ E G R +P+NSI+
Sbjct: 312 DNPLEFKNLKFTRSSKESQDLNFNKEPKVIISASGMCEAGRIRHHLKHNLWNPKNSIVFV 371
Query: 107 NRTSPGTLARDLIELGGNRTLTL-----QVKKRIR-LEG 139
GTL R LIE G + +TL QV I LEG
Sbjct: 372 GYQGQGTLGRSLIE--GIKMVTLFGEEIQVNAEIHNLEG 408
>gi|398406895|ref|XP_003854913.1| hypothetical protein MYCGRDRAFT_55193, partial [Zymoseptoria
tritici IPO323]
gi|339474797|gb|EGP89889.1| hypothetical protein MYCGRDRAFT_55193 [Zymoseptoria tritici IPO323]
Length = 855
Score = 40.4 bits (93), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 24/90 (26%), Positives = 42/90 (46%), Gaps = 8/90 (8%)
Query: 52 PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSP 111
P+ F ++ S+ V P VVL S ++ G SR L +W +N +IIT +
Sbjct: 339 PWDFNFIRQLKSMDRYEDV-GPSVVLASPGMLQNGPSRTLLERWAPDAKNGVIITGYSVE 397
Query: 112 GTLARDLI-------ELGGNRTLTLQVKKR 134
GT+A+ ++ + G + + + KR
Sbjct: 398 GTMAKTIMTEPDSIPAVTGKKNTGMNIGKR 427
>gi|407919362|gb|EKG12612.1| Beta-lactamase-like protein [Macrophomina phaseolina MS6]
Length = 842
Score = 40.4 bits (93), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 22/77 (28%), Positives = 41/77 (53%), Gaps = 1/77 (1%)
Query: 43 KSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENS 102
++ + ++ P+ FK+V+ SL V S V+L S M+ G SR+L +W N
Sbjct: 323 RNGDASKAGPWDFKYVRSLKSLERFDDVGS-CVMLASPGMMQNGVSRELLERWAPDQRNG 381
Query: 103 IIITNRTSPGTLARDLI 119
+I+T + GT+ + ++
Sbjct: 382 VIMTGYSVEGTMGKMIL 398
>gi|115397403|ref|XP_001214293.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114192484|gb|EAU34184.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 870
Score = 40.4 bits (93), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 45/90 (50%), Gaps = 9/90 (10%)
Query: 31 LAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSR 89
+A+ E DK A P+ F+ V+ SL V V++++P M + G SR
Sbjct: 319 MAEAEASGDK------NASAGPWDFRFVRSLRSLERFDDVGG--CVMLASPGMLQSGTSR 370
Query: 90 DLFFQWCSSPENSIIITNRTSPGTLARDLI 119
+L +W + N +I+T + GT+A+ L+
Sbjct: 371 ELLERWAPNERNGVIMTGYSVEGTMAKQLL 400
>gi|255957115|ref|XP_002569310.1| Pc21g23430 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591021|emb|CAP97240.1| Pc21g23430 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 862
Score = 40.4 bits (93), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 21/70 (30%), Positives = 38/70 (54%), Gaps = 3/70 (4%)
Query: 51 NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRT 109
P+ F+ V+ SL V V++++P M + G SR+L +W S N +++T +
Sbjct: 333 GPWDFRFVRSLRSLERFDDVGG--CVMLASPGMLQTGTSRELLERWAPSDRNGVVMTGYS 390
Query: 110 SPGTLARDLI 119
GT+A+ L+
Sbjct: 391 VEGTMAKGLL 400
>gi|392862603|gb|EAS36741.2| cleavage and polyadenylylation specificity factor [Coccidioides
immitis RS]
Length = 1026
Score = 40.4 bits (93), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 27/130 (20%), Positives = 49/130 (37%), Gaps = 42/130 (32%)
Query: 23 KLCHSLAELAK--IEWMSDKLMKSFEGARNN----------------------------- 51
K H LA+ +EWM + +++ FEG
Sbjct: 341 KKVHGTMRLARSMLEWMDESIVREFEGGDGGESLGAGRSSGAASGQQSKGTPGQTSDKKS 400
Query: 52 --------PFHFKHVKLCH---SLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPE 100
PF F+H+K+ L + + PKV++ S ++ GFS+++ E
Sbjct: 401 AGPHKGLGPFTFRHLKIIERKTKLENILRSEGPKVIIASDTSLDWGFSKEILRHVAQGAE 460
Query: 101 NSIIITNRTS 110
N +I+T ++
Sbjct: 461 NLVILTESSA 470
>gi|326435554|gb|EGD81124.1| integrator complex subunit 11 [Salpingoeca sp. ATCC 50818]
Length = 620
Score = 40.0 bits (92), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 26/109 (23%), Positives = 47/109 (43%), Gaps = 5/109 (4%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
+ + + K+ +F N F FKH++ A P P+V+ + + G + ++F
Sbjct: 316 VMYTNQKIKDTF--VDRNLFDFKHIRAFDR--SYADQPGPQVLFATPGMLHAGVALEVFA 371
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
+W P N +I+ GT+ +I G Q +RL+ E L
Sbjct: 372 KWAGDPRNMVILPGYCVAGTVGAQVIA-GKKEIRVGQQMVNVRLQVEYL 419
>gi|193608339|ref|XP_001949326.1| PREDICTED: integrator complex subunit 11-like isoform 1
[Acyrthosiphon pisum]
gi|328710634|ref|XP_003244318.1| PREDICTED: integrator complex subunit 11-like isoform 2
[Acyrthosiphon pisum]
Length = 603
Score = 40.0 bits (92), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 26/105 (24%), Positives = 48/105 (45%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ ++F + N F FKH+K P P VV + + G S ++F
Sbjct: 286 ITWTNQKVRQTF--VQRNMFDFKHIKPFDK--TYMHNPGPMVVFATPGMLHAGLSLNIFK 341
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W +N +I+ GT+ + L G++ + + K I ++
Sbjct: 342 KWAPDEKNMLIVPGYCVSGTVGNKV--LSGSKKIEAEPNKFIDVK 384
>gi|195112455|ref|XP_002000788.1| GI10422 [Drosophila mojavensis]
gi|193917382|gb|EDW16249.1| GI10422 [Drosophila mojavensis]
Length = 597
Score = 40.0 bits (92), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F N F FKH+K + A + +P ++V +TP M G S +
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + LGG + + + ++ I ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVECENRQIIEVK 381
>gi|146305000|ref|YP_001192316.1| beta-lactamase domain-containing protein [Metallosphaera sedula DSM
5348]
gi|145703250|gb|ABP96392.1| beta-lactamase domain protein [Metallosphaera sedula DSM 5348]
Length = 642
Score = 40.0 bits (92), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 25/98 (25%), Positives = 41/98 (41%), Gaps = 4/98 (4%)
Query: 19 FKHVKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV 78
F V H+ A EW+ ++ S NPF + K E P ++L
Sbjct: 445 FDEVTAIHT----AYPEWLGKEVRDSILFRDENPFTSEFFKRIEGYREDIAEGEPSIILA 500
Query: 79 STPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLAR 116
++ + G + + F Q P+NS+I + + GTL R
Sbjct: 501 TSGMLNGGPAVEFFKQLAPDPKNSLIFVSYQAEGTLGR 538
>gi|425780830|gb|EKV18826.1| Endoribonuclease ysh1 [Penicillium digitatum PHI26]
gi|425783067|gb|EKV20936.1| Endoribonuclease ysh1 [Penicillium digitatum Pd1]
Length = 862
Score = 40.0 bits (92), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 21/70 (30%), Positives = 38/70 (54%), Gaps = 3/70 (4%)
Query: 51 NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRT 109
P+ F+ V+ SL V V++++P M + G SR+L +W S N +++T +
Sbjct: 333 GPWDFRFVRSLRSLERFDDVGG--CVMLASPGMLQTGTSRELLERWAPSDRNGVVMTGYS 390
Query: 110 SPGTLARDLI 119
GT+A+ L+
Sbjct: 391 VEGTMAKGLL 400
>gi|303391170|ref|XP_003073815.1| putative beta-lactamase fold-containing exonuclease
[Encephalitozoon intestinalis ATCC 50506]
gi|303302963|gb|ADM12455.1| putative beta-lactamase fold-containing exonuclease
[Encephalitozoon intestinalis ATCC 50506]
Length = 496
Score = 40.0 bits (92), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 52/116 (44%), Gaps = 15/116 (12%)
Query: 38 SDKLMKSFEGARN----------NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECG 86
++++ K F G N N F +KH+K K P +VL ++P M G
Sbjct: 270 ANEIYKKFIGYTNETVKRKIFERNVFEYKHIKPFQRYYMDNKGP---MVLFASPGMLHSG 326
Query: 87 FSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
S +F +WC +N +II GT+ ++ G R L +K I+LE + L
Sbjct: 327 MSLRIFKEWCEDEKNLVIIPGYCVRGTIGEKILN-GAKRLEILGEEKDIKLEIKNL 381
>gi|284174985|ref|ZP_06388954.1| mRNA 3'-end processing factor, putative [Sulfolobus solfataricus
98/2]
Length = 638
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 29/122 (23%), Positives = 50/122 (40%), Gaps = 6/122 (4%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQ 94
EW+ ++ + NPF +H K E P ++L ++ + G + + F
Sbjct: 461 EWLGREVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKT 520
Query: 95 WCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKR-----IRLEGEELEEYQKKK 149
P+N+II + + GTL R + + G L R I +E E +E +
Sbjct: 521 MAPDPKNAIIFVSYQAEGTLGRKVRD-GAKEVQILDRDGRVESIQINMEVEAVEGFSGHS 579
Query: 150 DK 151
DK
Sbjct: 580 DK 581
>gi|308271258|emb|CBX27867.1| hypothetical protein N47_C19250 [uncultured Desulfobacterium sp.]
Length = 522
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 45/91 (49%), Gaps = 3/91 (3%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAE---LAKVPSPKVVLVSTPDMECGFSRDL 91
E SD+ +K+F +NPF FK +K S+ E L + +P +VL ++ E G
Sbjct: 339 EVYSDETVKAFLSIGDNPFAFKELKFVASVDESMVLMRDNTPHIVLAASGMCEAGRVLHH 398
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELG 122
+P+N+I++ + TL R + + G
Sbjct: 399 LRYKMHNPKNTILLVGFMAQNTLGRRIHDQG 429
>gi|15897661|ref|NP_342266.1| mRNA 3'-end processing factor [Sulfolobus solfataricus P2]
gi|6015715|emb|CAB57542.1| mRNA 3'-end polyadenylation factor [Sulfolobus solfataricus P2]
gi|13813930|gb|AAK41056.1| mRNA 3'-end processing factor, putative [Sulfolobus solfataricus
P2]
Length = 639
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 29/122 (23%), Positives = 50/122 (40%), Gaps = 6/122 (4%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQ 94
EW+ ++ + NPF +H K E P ++L ++ + G + + F
Sbjct: 462 EWLGREVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKT 521
Query: 95 WCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKR-----IRLEGEELEEYQKKK 149
P+N+II + + GTL R + + G L R I +E E +E +
Sbjct: 522 MAPDPKNAIIFVSYQAEGTLGRKVRD-GAKEVQILDRDGRVESIQINMEVEAVEGFSGHS 580
Query: 150 DK 151
DK
Sbjct: 581 DK 582
>gi|312381513|gb|EFR27247.1| hypothetical protein AND_06171 [Anopheles darlingi]
Length = 624
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/107 (25%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F + N F FKH+K + + +P ++V +TP M G S +
Sbjct: 283 ITWTNQKIRKTF--VQRNMFDFKHIKPF----DKGYIDNPGAMVVFATPGMLHAGLSLQI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + LGG + + + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGHKI--LGGAKKVEFENRQVVEVK 381
>gi|157107341|ref|XP_001649735.1| cleavage and polyadenylation specificity factor [Aedes aegypti]
gi|108879612|gb|EAT43837.1| AAEL004757-PA [Aedes aegypti]
Length = 613
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/107 (25%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F + N F FKH+K + + +P ++V +TP M G S +
Sbjct: 283 ITWTNQKIRKTF--VQRNMFDFKHIKPF----DKGYIDNPGAMVVFATPGMLHAGLSLQI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + LGG + + + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGHKI--LGGAKKVEFENRQVVEVK 381
>gi|384434275|ref|YP_005643633.1| KH-domain/beta-lactamase-domain-containing protein [Sulfolobus
solfataricus 98/2]
gi|261602429|gb|ACX92032.1| KH-domain/beta-lactamase-domain protein [Sulfolobus solfataricus
98/2]
Length = 635
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 29/122 (23%), Positives = 50/122 (40%), Gaps = 6/122 (4%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQ 94
EW+ ++ + NPF +H K E P ++L ++ + G + + F
Sbjct: 458 EWLGREVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKT 517
Query: 95 WCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKR-----IRLEGEELEEYQKKK 149
P+N+II + + GTL R + + G L R I +E E +E +
Sbjct: 518 MAPDPKNAIIFVSYQAEGTLGRKVRD-GAKEVQILDRDGRVESIQINMEVEAVEGFSGHS 576
Query: 150 DK 151
DK
Sbjct: 577 DK 578
>gi|121700651|ref|XP_001268590.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Aspergillus clavatus NRRL 1]
gi|119396733|gb|EAW07164.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Aspergillus clavatus NRRL 1]
Length = 878
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 45/90 (50%), Gaps = 9/90 (10%)
Query: 31 LAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSR 89
+A+ E DK A P+ F+ V+ SL V V++++P M + G SR
Sbjct: 318 MAEAEASGDK------SASAGPWDFRFVRSLRSLERFDDVGG--CVMLASPGMLQTGTSR 369
Query: 90 DLFFQWCSSPENSIIITNRTSPGTLARDLI 119
+L +W + N +++T + GT+A+ L+
Sbjct: 370 ELLERWAPNERNGVVMTGYSVEGTMAKQLL 399
>gi|16082538|ref|NP_394086.1| metal-dependent RNase [Thermoplasma acidophilum DSM 1728]
Length = 639
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/90 (24%), Positives = 45/90 (50%), Gaps = 3/90 (3%)
Query: 32 AKIEWMSDKLMKSFEGARNNPFH---FKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFS 88
A E+++ +L ++ + NPF FK V+ E+A+ P +VL ++ M G
Sbjct: 460 AYPEYLNKELREAIMVKKENPFLSPIFKKVETRDQREEIAENPETTIVLATSGMMNGGPV 519
Query: 89 RDLFFQWCSSPENSIIITNRTSPGTLARDL 118
+ F W + P+++++ + GTL + +
Sbjct: 520 MEYFKAWSADPKHTLVFVGYQADGTLGKKI 549
>gi|449686516|ref|XP_002164154.2| PREDICTED: integrator complex subunit 11-like [Hydra
magnipapillata]
Length = 526
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 26/105 (24%), Positives = 45/105 (42%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ +F + N F FKH+K P P VV + + G S +F
Sbjct: 194 ITWTNQKIRNTF--VQRNLFDFKHIKAFDR--SYIDNPGPMVVFATPGMLHAGLSLQIFK 249
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W N++I+ GT+ + L G + + L K + ++
Sbjct: 250 KWAPDENNTLIMPGYCVIGTVGHKV--LSGQKKIELDKKTTVDVK 292
>gi|307170840|gb|EFN62951.1| Integrator complex subunit 11 [Camponotus floridanus]
Length = 595
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F + N F FKH+K + A + +P ++V +TP M G S +
Sbjct: 283 ITWTNQKIKKTF--VQRNMFEFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + L G R + + ++ + ++
Sbjct: 337 FKKWAPNEANMVIMPGFCVQGTVGHKV--LNGTRRIEFENRQIVEVK 381
>gi|170052069|ref|XP_001862054.1| cleavage and polyadenylation specificity factor subunit 3 [Culex
quinquefasciatus]
gi|167873079|gb|EDS36462.1| cleavage and polyadenylation specificity factor subunit 3 [Culex
quinquefasciatus]
Length = 615
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/107 (25%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F + N F FKH+K + + +P ++V +TP M G S +
Sbjct: 283 ITWTNQKIRKTF--VQRNMFDFKHIKPF----DKGYIDNPGAMVVFATPGMLHAGLSLQI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + LGG + + + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGHKI--LGGAKKVEFENRQVVEVK 381
>gi|258578481|ref|XP_002543422.1| predicted protein [Uncinocarpus reesii 1704]
gi|237903688|gb|EEP78089.1| predicted protein [Uncinocarpus reesii 1704]
Length = 875
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 23/86 (26%), Positives = 43/86 (50%), Gaps = 3/86 (3%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
E M++ K + P+ FK V+ +L V V++++P M + G SR+L
Sbjct: 317 ERMAEAEAKGDKSTTAGPWDFKFVRSVRNLERFDDVGG--CVMLASPGMLQTGTSRELLE 374
Query: 94 QWCSSPENSIIITNRTSPGTLARDLI 119
+W S N +I+T + GT+ + ++
Sbjct: 375 RWAPSERNGVIMTGYSVEGTMGKQIL 400
>gi|19074744|ref|NP_586250.1| similarity to HYPOTHETICAL PROTEIN YO47_METJA [Encephalitozoon
cuniculi GB-M1]
gi|19069386|emb|CAD25854.1| similarity to HYPOTHETICAL PROTEIN YO47_METJA [Encephalitozoon
cuniculi GB-M1]
gi|449329879|gb|AGE96147.1| hypothetical protein ECU10_1350 [Encephalitozoon cuniculi]
Length = 496
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 44/93 (47%), Gaps = 5/93 (5%)
Query: 51 NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRT 109
N F +KH+K +K P +VL ++P M G S +F +WC +N +II
Sbjct: 293 NMFEYKHIKPFQRHYMESKGP---MVLFASPGMLHSGMSLKIFKEWCEDEKNLVIIPGYC 349
Query: 110 SPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
GT+ ++ G + L + IRLE + L
Sbjct: 350 VRGTIGEKVLN-GATKLEILGESREIRLEVKNL 381
>gi|383859336|ref|XP_003705151.1| PREDICTED: integrator complex subunit 11-like isoform 1 [Megachile
rotundata]
Length = 595
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 54/107 (50%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F + N F FKH+K + A + +P ++V +TP M G S +
Sbjct: 283 ITWTNQKIKKTF--VQRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + L G+R + + ++ + ++
Sbjct: 337 FKKWAPNEANMVIMPGFCVQGTVGHKV--LNGSRRIEFENRQIVEVK 381
>gi|328776642|ref|XP_003249190.1| PREDICTED: integrator complex subunit 11-like [Apis mellifera]
Length = 603
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 54/107 (50%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F + N F FKH+K + A + +P ++V +TP M G S +
Sbjct: 283 ITWTNQKIKKTF--VQRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + L G+R + + ++ + ++
Sbjct: 337 FKKWAPNEANMVIMPGFCVQGTVGHKV--LNGSRRIEFENRQIVEVK 381
>gi|294656507|ref|XP_002770276.1| DEHA2D07304p [Debaryomyces hansenii CBS767]
gi|199431523|emb|CAR65632.1| DEHA2D07304p [Debaryomyces hansenii CBS767]
Length = 959
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 42/157 (26%), Positives = 68/157 (43%), Gaps = 34/157 (21%)
Query: 34 IEWMSDKLMKSFEGARNN----------PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
+EWMS +L+K +E A + PF V L +EL ++ PK+V S D+
Sbjct: 286 LEWMSSQLVKEWEEASSVNNNSSNKNNFPFDPSKVDLLSDPSELVQLSGPKIVFCSGIDL 345
Query: 84 ECG-FSRDLFFQWCSSPENSIIITNRTSPG-------TLARDLIEL-----GGN--RTLT 128
+ G S + C + +I++T +T G L D L GG +
Sbjct: 346 KNGDMSSEALQYLCQDEKTTIVLTEKTHFGLDNTINSQLYHDWYNLTKQKQGGTVEDGVA 405
Query: 129 LQVKKRIRLE---------GEELEEYQKKKDKEAKDK 156
+ ++K I LE G EL ++Q+K + + K K
Sbjct: 406 VPLEKVISLENWNREEPLIGAELTDFQEKINLQRKQK 442
>gi|383859338|ref|XP_003705152.1| PREDICTED: integrator complex subunit 11-like isoform 2 [Megachile
rotundata]
Length = 494
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 54/107 (50%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F + N F FKH+K + A + +P ++V +TP M G S +
Sbjct: 182 ITWTNQKIKKTF--VQRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 235
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + L G+R + + ++ + ++
Sbjct: 236 FKKWAPNEANMVIMPGFCVQGTVGHKV--LNGSRRIEFENRQIVEVK 280
>gi|385775961|ref|YP_005648529.1| beta-lactamase [Sulfolobus islandicus REY15A]
gi|323474709|gb|ADX85315.1| beta-lactamase domain protein [Sulfolobus islandicus REY15A]
Length = 638
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/84 (23%), Positives = 37/84 (44%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQ 94
EW+ ++ + NPF +H K E P ++L ++ + G + + F
Sbjct: 461 EWLGREVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKT 520
Query: 95 WCSSPENSIIITNRTSPGTLARDL 118
P+N+II + + GTL R +
Sbjct: 521 MAPDPKNAIIFVSYQAEGTLGRKV 544
>gi|380011463|ref|XP_003689822.1| PREDICTED: integrator complex subunit 11-like [Apis florea]
Length = 595
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 54/107 (50%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F + N F FKH+K + A + +P ++V +TP M G S +
Sbjct: 283 ITWTNQKIKKTF--VQRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + L G+R + + ++ + ++
Sbjct: 337 FKKWAPNEANMVIMPGFCVQGTVGHKV--LNGSRRIEFENRQIVEVK 381
>gi|391330858|ref|XP_003739869.1| PREDICTED: integrator complex subunit 11-like [Metaseiulus
occidentalis]
Length = 601
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 42/97 (43%), Gaps = 6/97 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ +F +N F FKH+K P P VV + + G S +F
Sbjct: 283 ITWTNQKIRNTF--VDHNMFDFKHIKPFDR--AYIDNPGPMVVFATPGMLHAGLSLQIFK 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQ 130
+W EN +I+ GT+ + L G R + ++
Sbjct: 339 KWAPFEENMVIMPGYCVSGTVGAKI--LNGARRVEIE 373
>gi|385773323|ref|YP_005645889.1| beta-lactamase [Sulfolobus islandicus HVE10/4]
gi|323477437|gb|ADX82675.1| beta-lactamase domain protein [Sulfolobus islandicus HVE10/4]
Length = 638
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/84 (23%), Positives = 37/84 (44%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQ 94
EW+ ++ + NPF +H K E P ++L ++ + G + + F
Sbjct: 461 EWLGREVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKT 520
Query: 95 WCSSPENSIIITNRTSPGTLARDL 118
P+N+II + + GTL R +
Sbjct: 521 MAPDPKNAIIFVSYQAEGTLGRKV 544
>gi|322790916|gb|EFZ15582.1| hypothetical protein SINV_14424 [Solenopsis invicta]
Length = 277
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 54/107 (50%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F + N F FKH+K + A + +P ++V +TP M G S +
Sbjct: 60 ITWTNQKIKKTF--VQRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 113
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + L G+R + + ++ + ++
Sbjct: 114 FKKWAPNESNMVIMPGFCVQGTVGHKV--LNGSRRIEFENRQIVEVK 158
>gi|227827653|ref|YP_002829433.1| beta-lactamase [Sulfolobus islandicus M.14.25]
gi|229584857|ref|YP_002843359.1| beta-lactamase domain-containing protein [Sulfolobus islandicus
M.16.27]
gi|238619821|ref|YP_002914647.1| beta-lactamase domain-containing protein [Sulfolobus islandicus
M.16.4]
gi|227459449|gb|ACP38135.1| beta-lactamase domain protein [Sulfolobus islandicus M.14.25]
gi|228019907|gb|ACP55314.1| beta-lactamase domain protein [Sulfolobus islandicus M.16.27]
gi|238380891|gb|ACR41979.1| beta-lactamase domain protein [Sulfolobus islandicus M.16.4]
Length = 638
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/84 (23%), Positives = 37/84 (44%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQ 94
EW+ ++ + NPF +H K E P ++L ++ + G + + F
Sbjct: 461 EWLGREVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKT 520
Query: 95 WCSSPENSIIITNRTSPGTLARDL 118
P+N+II + + GTL R +
Sbjct: 521 MAPDPKNAIIFVSYQAEGTLGRKV 544
>gi|401404496|ref|XP_003881737.1| hypothetical protein NCLIV_014990 [Neospora caninum Liverpool]
gi|325116150|emb|CBZ51704.1| hypothetical protein NCLIV_014990 [Neospora caninum Liverpool]
Length = 1033
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 5/72 (6%)
Query: 51 NPFHFKHVKLCHSLAELAKV----PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
NPF F+ VK S+ E A+V P V++ + ++ G SR++F +N +I+T
Sbjct: 420 NPFAFRFVKNLKSV-ESARVYIHHDGPAVIMAAPGMLQSGASREIFEALAPESKNGVILT 478
Query: 107 NRTSPGTLARDL 118
+ GTLA +L
Sbjct: 479 GYSVKGTLADEL 490
>gi|307215032|gb|EFN89859.1| Integrator complex subunit 11 [Harpegnathos saltator]
Length = 594
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F + N F FKH+K + A + +P ++V +TP M G S +
Sbjct: 282 ITWTNQKIKKTF--VQRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 335
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + L G R + + ++ + ++
Sbjct: 336 FKKWAPNESNMVIMPGFCVQGTVGHKV--LNGTRRIEFENRQIVEVK 380
>gi|229579166|ref|YP_002837564.1| beta-lactamase domain-containing protein [Sulfolobus islandicus
Y.G.57.14]
gi|229582079|ref|YP_002840478.1| beta-lactamase domain-containing protein [Sulfolobus islandicus
Y.N.15.51]
gi|284997854|ref|YP_003419621.1| beta-lactamase domain-containing protein [Sulfolobus islandicus
L.D.8.5]
gi|228009880|gb|ACP45642.1| beta-lactamase domain protein [Sulfolobus islandicus Y.G.57.14]
gi|228012795|gb|ACP48556.1| beta-lactamase domain protein [Sulfolobus islandicus Y.N.15.51]
gi|284445749|gb|ADB87251.1| beta-lactamase domain protein [Sulfolobus islandicus L.D.8.5]
Length = 638
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/84 (23%), Positives = 37/84 (44%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQ 94
EW+ ++ + NPF +H K E P ++L ++ + G + + F
Sbjct: 461 EWLGREVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKT 520
Query: 95 WCSSPENSIIITNRTSPGTLARDL 118
P+N+II + + GTL R +
Sbjct: 521 MAPDPKNAIIFVSYQAEGTLGRKV 544
>gi|332030612|gb|EGI70300.1| Integrator complex subunit 11 [Acromyrmex echinatior]
Length = 536
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F + N F FKH+K + A + +P ++V +TP M G S +
Sbjct: 224 ITWTNQKIKKTF--VQRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 277
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + L G R + + ++ + ++
Sbjct: 278 FKKWAPNESNMVIMPGFCVQGTVGHKV--LNGTRRIEFENRQIVEVK 322
>gi|227830349|ref|YP_002832129.1| beta-lactamase domain-containing protein [Sulfolobus islandicus
L.S.2.15]
gi|227456797|gb|ACP35484.1| beta-lactamase domain protein [Sulfolobus islandicus L.S.2.15]
Length = 638
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/84 (23%), Positives = 37/84 (44%)
Query: 35 EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQ 94
EW+ ++ + NPF +H K E P ++L ++ + G + + F
Sbjct: 461 EWLGREVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKT 520
Query: 95 WCSSPENSIIITNRTSPGTLARDL 118
P+N+II + + GTL R +
Sbjct: 521 MAPDPKNAIIFVSYQAEGTLGRKV 544
>gi|342319748|gb|EGU11695.1| Endoribonuclease YSH1 [Rhodotorula glutinis ATCC 204091]
Length = 857
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 44/92 (47%), Gaps = 9/92 (9%)
Query: 34 IEWMSDKLMKSFEGARN-NPFHFKHV-----KLCHSLAELAKVPSPKVVLVSTPDMECGF 87
I MS + F AR NPF FK L +++L +P VV+ S + G
Sbjct: 316 IHTMSPNVRSKF--ARGINPFDFKRKDSFIRPLDRGISKLNDR-NPCVVMASPGFLTSGV 372
Query: 88 SRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
SR+L +W P N +IIT + G +AR ++
Sbjct: 373 SRELLEKWAPDPRNGLIITGYSVEGVMARTIM 404
>gi|340728535|ref|XP_003402577.1| PREDICTED: integrator complex subunit 11-like [Bombus terrestris]
gi|350421011|ref|XP_003492700.1| PREDICTED: integrator complex subunit 11-like [Bombus impatiens]
Length = 595
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 28/107 (26%), Positives = 54/107 (50%), Gaps = 10/107 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F + N F FKH+K + A + +P ++V +TP M G S +
Sbjct: 283 ITWTNQKIKKTF--VQRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
F +W + N +I+ GT+ + L G+R + + ++ + ++
Sbjct: 337 FKKWAPNEANMVIMPGFCVQGTVGHKV--LNGSRRIEFENRQIVEVK 381
>gi|320593246|gb|EFX05655.1| cleavage and polyadenylation specificity factor subunit [Grosmannia
clavigera kw1407]
Length = 857
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 19/72 (26%), Positives = 40/72 (55%), Gaps = 3/72 (4%)
Query: 50 NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNR 108
P+ F+ ++ +L + V++++P M + G SR+L +W S +N +IIT
Sbjct: 330 GGPWDFRFIRSLKNLDRFEDIGG--CVMLASPGMLQNGVSRELLERWAPSDKNGVIITGY 387
Query: 109 TSPGTLARDLIE 120
+ GT+A+ +++
Sbjct: 388 SVEGTMAKQIMQ 399
>gi|358060736|dbj|GAA93507.1| hypothetical protein E5Q_00148 [Mixia osmundae IAM 14324]
Length = 1378
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 39/73 (53%), Gaps = 5/73 (6%)
Query: 50 NNPFHF---KHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIII 105
+NP+ F + VK S+ SP VV V+TP M + G SR+L +W N +I+
Sbjct: 343 DNPYDFGKGRFVKELRSIRRGVADKSPCVV-VATPGMLQPGTSRELLERWAGDRRNGLIL 401
Query: 106 TNRTSPGTLARDL 118
+ G+LARDL
Sbjct: 402 CGYSVEGSLARDL 414
>gi|119494361|ref|XP_001264076.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Neosartorya fischeri NRRL 181]
gi|119412238|gb|EAW22179.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Neosartorya fischeri NRRL 181]
Length = 878
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 45/90 (50%), Gaps = 9/90 (10%)
Query: 31 LAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSR 89
+A+ E DK A P+ FK V+ SL + V++++P M + G SR
Sbjct: 319 MAEAEASGDK------SASAGPWDFKFVRSLRSLERFDDLGG--CVMLASPGMLQTGTSR 370
Query: 90 DLFFQWCSSPENSIIITNRTSPGTLARDLI 119
+L +W + N +++T + GT+A+ L+
Sbjct: 371 ELLERWAPNERNGVVMTGYSVEGTMAKQLL 400
>gi|158298905|ref|XP_319042.4| AGAP009923-PA [Anopheles gambiae str. PEST]
gi|157014111|gb|EAA13845.4| AGAP009923-PA [Anopheles gambiae str. PEST]
Length = 608
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 27/104 (25%), Positives = 51/104 (49%), Gaps = 10/104 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F + N F FKH+K + + +P ++V +TP M G S +
Sbjct: 283 ITWTNQKIRKTF--VQRNMFDFKHIKPF----DKGYIDNPGAMVVFATPGMLHAGLSLQI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRI 135
F +W + N +I+ GT+ + LGG + + + ++ +
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGHKI--LGGAKKVEFENRQVV 378
>gi|34762198|ref|ZP_00143205.1| hypothetical protein [Fusobacterium nucleatum subsp. vincentii ATCC
49256]
gi|27888159|gb|EAA25218.1| hypothetical protein [Fusobacterium nucleatum subsp. vincentii ATCC
49256]
Length = 209
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 56/205 (27%), Positives = 90/205 (43%), Gaps = 43/205 (20%)
Query: 5 LMKSFE----GARNNPFHFKHVKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKL 60
+MK+ E G++ F F ++ + AKIE KL+ SF RNN F +
Sbjct: 7 VMKAVEEALAGSQLRKFDFASYEISDD--DKAKIEEQEVKLLNSFRKYRNNLFD-----I 59
Query: 61 CHSLAELAKVPSPKVVLVS-------TPDMECGFSR--DLFFQWCSSPENSI-------- 103
C SLAE+AK+ P ++ T DM F + DL+ W ++ I
Sbjct: 60 CSSLAEVAKLLKPSGSFMAWYESAGLTKDMVSVFLKRWDLYL-WQPDYKDKIFSLSDQAI 118
Query: 104 -IITNRT-----SPGTLARDLIELGGNRTLTLQVKK----RIRLEGEELEEYQKKKDKEA 153
II+N + G L D++++ + L Q KK +++E ++ + K K E
Sbjct: 119 KIISNNSLGLEVVKGILEADVLKVREIKKLLPQPKKEEKSEVKVERQKYFNFNKIKKMEK 178
Query: 154 KDK----QEKEKIPPHDTSFINELQ 174
+ K +EK + T +IN LQ
Sbjct: 179 RAKKLKDEEKTEYKKELTEYINSLQ 203
>gi|169767044|ref|XP_001817993.1| endoribonuclease ysh1 [Aspergillus oryzae RIB40]
gi|83765848|dbj|BAE55991.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|391872741|gb|EIT81836.1| mRNA cleavage and polyadenylation factor II complex, BRR5
[Aspergillus oryzae 3.042]
Length = 870
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 24/90 (26%), Positives = 46/90 (51%), Gaps = 9/90 (10%)
Query: 31 LAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSR 89
+A+ E DK + + P+ F+ V+ SL V V++++P M + G SR
Sbjct: 319 MAEAEASGDKSISA------GPWDFRFVRSLRSLERFDDVGG--CVMLASPGMLQTGTSR 370
Query: 90 DLFFQWCSSPENSIIITNRTSPGTLARDLI 119
+L +W + N +++T + GT+A+ L+
Sbjct: 371 ELLERWAPNERNGVVMTGYSVEGTMAKQLL 400
>gi|238483863|ref|XP_002373170.1| cleavage and polyadenylation specifity factor, 73 kDa subunit
[Aspergillus flavus NRRL3357]
gi|220701220|gb|EED57558.1| cleavage and polyadenylation specifity factor, 73 kDa subunit
[Aspergillus flavus NRRL3357]
Length = 870
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 24/90 (26%), Positives = 46/90 (51%), Gaps = 9/90 (10%)
Query: 31 LAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSR 89
+A+ E DK + + P+ F+ V+ SL V V++++P M + G SR
Sbjct: 319 MAEAEASGDKSISA------GPWDFRFVRSLRSLERFDDVGG--CVMLASPGMLQTGTSR 370
Query: 90 DLFFQWCSSPENSIIITNRTSPGTLARDLI 119
+L +W + N +++T + GT+A+ L+
Sbjct: 371 ELLERWAPNERNGVVMTGYSVEGTMAKQLL 400
>gi|195341281|ref|XP_002037239.1| GM12816 [Drosophila sechellia]
gi|195574829|ref|XP_002105386.1| GD21460 [Drosophila simulans]
gi|194131355|gb|EDW53398.1| GM12816 [Drosophila sechellia]
gi|194201313|gb|EDX14889.1| GD21460 [Drosophila simulans]
Length = 597
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 25/105 (23%), Positives = 46/105 (43%), Gaps = 6/105 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F N F FKH+K P VV + + G S +F
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPFDK--NYIDNPGAMVVFATPGMLHAGLSLQIFK 338
Query: 94 QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
+W + N +I+ GT+ + LGG + + + ++ + ++
Sbjct: 339 KWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVEFENRQVVEVK 381
>gi|403158620|ref|XP_003319317.2| hypothetical protein PGTG_01491 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375166386|gb|EFP74898.2| hypothetical protein PGTG_01491 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 778
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 16/49 (32%), Positives = 30/49 (61%)
Query: 72 SPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE 120
P VV+ S M+ G SR+L +W N++++T + GT+AR++++
Sbjct: 357 GPCVVMASPGFMQSGVSRELLERWAPDRRNALLVTGYSIEGTMAREMLK 405
>gi|300121266|emb|CBK21646.2| unnamed protein product [Blastocystis hominis]
Length = 400
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 19/98 (19%), Positives = 45/98 (45%)
Query: 37 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
+++++++ A F + V + + + + KVV+ + P +E F++ L +WC
Sbjct: 291 LNERILQQSAEAERKAFDLQFVTCVSIVENVLESQASKVVIATLPGLETSFAQTLLKKWC 350
Query: 97 SSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKR 134
+ EN ++ P TL ++ T V+++
Sbjct: 351 TRSENLLLFVCSPPPDTLGYRILNSPEESTFEFIVREK 388
>gi|225679068|gb|EEH17352.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 984
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 18/59 (30%), Positives = 33/59 (55%), Gaps = 3/59 (5%)
Query: 51 NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
PF F+H+K+ +L K+ +P+V+L +E G+S+ + + + EN II+T
Sbjct: 361 GPFTFRHLKIVERKTKLDKILGSNAPQVILTPDTSLEWGYSKHVLQKIAAGSENLIILT 419
>gi|308198072|ref|XP_001387057.2| predicted protein [Scheffersomyces stipitis CBS 6054]
gi|149389019|gb|EAZ63034.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 934
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 42/87 (48%), Gaps = 7/87 (8%)
Query: 34 IEWMSDKLMKSFEGA------RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECG- 86
I+WMS + +E A +N PF V L S EL ++ PK+V S D+ G
Sbjct: 286 IDWMSSSVQSQWEEAESSTNYKNLPFDPSKVDLLLSPEELIQLSGPKIVFCSGIDLRNGE 345
Query: 87 FSRDLFFQWCSSPENSIIITNRTSPGT 113
S + F C +++I++T ++ G
Sbjct: 346 LSAEAFQYLCQDEKSTILLTEKSLFGV 372
>gi|448124505|ref|XP_004204939.1| Piso0_000226 [Millerozyma farinosa CBS 7064]
gi|358249572|emb|CCE72638.1| Piso0_000226 [Millerozyma farinosa CBS 7064]
Length = 948
Score = 38.5 bits (88), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 40/157 (25%), Positives = 69/157 (43%), Gaps = 34/157 (21%)
Query: 34 IEWMSDKLMKSFEGARNN----------PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
+EWMS +L+K +E A ++ PF V L EL ++ PK+V S D
Sbjct: 286 LEWMSGQLIKEWEEASSSTNNSSNKNNFPFDPSKVDLLSDPNELIQLSGPKIVFCSGLDF 345
Query: 84 ECG-FSRDLFFQWCSSPENSIIITNRTSPGT-------LARDLIELGGNRT-------LT 128
+ G S ++ C + +II+T +T G+ L R+ EL R
Sbjct: 346 KDGDVSFEVLSYLCQDEKTTIILTEKTHFGSDDTINSQLYREWYELTKQRNGGLVEDGTA 405
Query: 129 LQVKKRIRLE---------GEELEEYQKKKDKEAKDK 156
+ ++K I L+ G EL ++Q++ ++ K +
Sbjct: 406 VPLEKIINLQHWTKEEPLIGTELSDFQERISQQRKQR 442
>gi|226288011|gb|EEH43524.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 999
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 41/79 (51%), Gaps = 4/79 (5%)
Query: 51 NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
PF F+H+K+ +L K+ +P+V+L +E G+S+ + + + EN II+T
Sbjct: 401 GPFTFRHLKIVERKTKLDKILGSNAPQVILTPDTSLEWGYSKHVLQKIAAGSENLIILTE 460
Query: 108 RTSPGTLARDLIELGGNRT 126
P + E GG+R+
Sbjct: 461 -PYPISANAQRTEDGGSRS 478
>gi|410730217|ref|XP_003671288.2| hypothetical protein NDAI_0G02680 [Naumovozyma dairenensis CBS 421]
gi|401780106|emb|CCD26045.2| hypothetical protein NDAI_0G02680 [Naumovozyma dairenensis CBS 421]
Length = 846
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 82/179 (45%), Gaps = 45/179 (25%)
Query: 9 FEGARNNPFHFKHVKLCHSLAELAK----------IEWMSDKLMKSFEGARNNPFHF--- 55
FE +N PF +KH+ + L +K +EW+S L+K++E +NP F
Sbjct: 269 FENGKN-PF-YKHLPVL--LVSYSKGRALTYTKSMLEWLSSSLLKTWESRSSNPSPFDLG 324
Query: 56 KHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSP-----ENSIIITNRTS 110
K+ S EL+K P+ K+ LVS D+ + C S + +I++T+ +
Sbjct: 325 NRFKVVTS-DELSKYPNSKICLVSNVDI---LLDETVAHLCDSKSQHQNKTTILLTSNMN 380
Query: 111 PGTLAR-------------DLIELGGNRTLTLQVKKRIRLEGEELEEY----QKKKDKE 152
G L DLI+ N+T+++ + L EEL EY +++K+KE
Sbjct: 381 NGILQNMKECWEEQKVKEGDLIKF--NKTISVHNIQLDPLNDEELSEYKSVLEERKNKE 437
>gi|71027889|ref|XP_763588.1| cleavage and polyadenylation specificity factor protein [Theileria
parva strain Muguga]
gi|68350541|gb|EAN31305.1| cleavage and polyadenylation specificity factor protein, putative
[Theileria parva]
Length = 708
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 20/71 (28%), Positives = 36/71 (50%), Gaps = 3/71 (4%)
Query: 51 NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
NPF+FK V+ S+ ++ P +++ S ++ G S ++F C N +++T
Sbjct: 312 NPFNFKFVRYARSIKQIRNYLLRDGPCIIMTSPGMLQGGPSLEVFELLCPDNRNGVVLTG 371
Query: 108 RTSPGTLARDL 118
GTLA +L
Sbjct: 372 YAVKGTLADEL 382
>gi|344301243|gb|EGW31555.1| hypothetical protein SPAPADRAFT_67601 [Spathaspora passalidarum
NRRL Y-27907]
Length = 1032
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 38/158 (24%), Positives = 73/158 (46%), Gaps = 35/158 (22%)
Query: 34 IEWMSDKLMKSFE------------GARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
++WMS +KS++ + PF V L +EL ++ PK+V S
Sbjct: 286 LDWMSGSFIKSWDETSGDGGRGGGKALSSMPFDPSKVDLLLDPSELIQLSGPKIVFCSGI 345
Query: 82 DMECG-FSRDLFFQWCSSPENSIIITNRTS--PGTLARDLI---------ELGG------ 123
D++ G S + F C++ + ++I+T ++ G L L +LGG
Sbjct: 346 DIKSGDISSETFQYLCNNEKTTVILTEKSQLENGGLNSMLYKEWYELTKKKLGGKIEDGT 405
Query: 124 ----NRTLTLQVKKR-IRLEGEELEEYQKKKDKEAKDK 156
++T++++ R LEG EL ++Q++ ++ K+K
Sbjct: 406 AVPLDKTVSIEDWTRETNLEGRELSDFQERITQQRKEK 443
>gi|46360445|gb|AAS80153.1| ACT11D09.9 [Cucumis melo]
Length = 708
Score = 38.1 bits (87), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 43/88 (48%), Gaps = 8/88 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHV-KLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDL 91
I W S K+ +++ N F FK+V K S+ + P P VL +TP M GFS ++
Sbjct: 315 ISWTSQKVKETY--TTRNAFDFKNVQKFDRSMID---APGP-CVLFATPGMISSGFSLEV 368
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F +W S N I + GT+ L+
Sbjct: 369 FKRWAPSKLNLITLPGYCVAGTVGHKLM 396
>gi|449460766|ref|XP_004148116.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-II-like [Cucumis sativus]
Length = 649
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 43/88 (48%), Gaps = 8/88 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHV-KLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDL 91
I W S K+ +++ N F FK+V K S+ + P P VL +TP M GFS ++
Sbjct: 282 ISWTSQKVKETY--TTRNAFDFKNVQKFDRSMID---APGP-CVLFATPGMISSGFSLEV 335
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F +W S N I + GT+ L+
Sbjct: 336 FKRWAPSKLNLITLPGYCVAGTVGHKLM 363
>gi|156546030|ref|XP_001608037.1| PREDICTED: integrator complex subunit 11-like isoform 1 [Nasonia
vitripennis]
gi|345498393|ref|XP_003428220.1| PREDICTED: integrator complex subunit 11-like isoform 2 [Nasonia
vitripennis]
gi|345498395|ref|XP_003428221.1| PREDICTED: integrator complex subunit 11-like isoform 3 [Nasonia
vitripennis]
Length = 595
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 51/113 (45%), Gaps = 18/113 (15%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F + N F FKH+K + + + +P ++V +TP M G S +
Sbjct: 283 ITWTNQKIKKTF--VQRNMFDFKHIKPF----DKSYIDNPGAMVVFATPGMLHAGLSLQI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEE 144
F +W + N +I+ GT+ + L KRI E ++ E
Sbjct: 337 FKKWAPNEANMVIMPGFCVQGTVGHKV----------LNGAKRIEFENRQIVE 379
>gi|300706889|ref|XP_002995677.1| hypothetical protein NCER_101357 [Nosema ceranae BRL01]
gi|239604869|gb|EEQ82006.1| hypothetical protein NCER_101357 [Nosema ceranae BRL01]
Length = 500
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 46/95 (48%), Gaps = 7/95 (7%)
Query: 50 NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNR 108
+N F FKH +P +VL ++P M G S +F +WC+ P+N +II
Sbjct: 294 HNVFDFKHTTTFQKHFLDLNIP---MVLFASPGMLHSGMSLKVFKEWCTDPKNLVIIPGY 350
Query: 109 TSPGTLARDLIELGGNRTL-TLQVKKRIRLEGEEL 142
GT+ + L GN+ + L K I+++ + L
Sbjct: 351 CVKGTVGDKV--LNGNKEIEILGELKEIKIQVKNL 383
>gi|354543512|emb|CCE40231.1| hypothetical protein CPAR2_102690 [Candida parapsilosis]
Length = 938
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 24/96 (25%), Positives = 47/96 (48%), Gaps = 5/96 (5%)
Query: 34 IEWMSDKLMKSF--EGARNN--PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECG-FS 88
++WMS K++ +G N+ PF+ V L +EL + P K++ + D++ G S
Sbjct: 286 MDWMSSDFNKTWNTDGGNNSLLPFNPSKVDLLLDPSELTQTPGAKIIFCAGLDLKNGDLS 345
Query: 89 RDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGN 124
+F C+ ++I+T + S + + L G+
Sbjct: 346 SKVFSYLCNDERTTVILTEKPSSANVEAEGSGLSGD 381
>gi|407847992|gb|EKG03521.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 883
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 26/108 (24%), Positives = 49/108 (45%), Gaps = 4/108 (3%)
Query: 55 FKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSP----ENSIIITNRTS 110
F +V C + E+ V PK+ + ++ G S +L + + EN +++T
Sbjct: 395 FANVVTCKTAEEVLSVAGPKICIADGETLDYGVSAELLGHFLQADADERENLVVLTGAPK 454
Query: 111 PGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQE 158
P T A + + L+ R L EELEEY + + E +++++
Sbjct: 455 PHTNAFTMAAAKKGDAIDLRYTIRSPLGKEELEEYYLQIELEMEEQRK 502
>gi|449518964|ref|XP_004166505.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 3-II-like, partial [Cucumis
sativus]
Length = 472
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 43/88 (48%), Gaps = 8/88 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHV-KLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDL 91
I W S K+ +++ N F FK+V K S+ + P P VL +TP M GFS ++
Sbjct: 126 ISWTSQKVKETY--TTRNAFDFKNVQKFDRSMID---APGP-CVLFATPGMISSGFSLEV 179
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F +W S N I + GT+ L+
Sbjct: 180 FKRWAPSKLNLITLPGYCVAGTVGHKLM 207
>gi|350633583|gb|EHA21948.1| hypothetical protein ASPNIDRAFT_41125 [Aspergillus niger ATCC 1015]
Length = 1015
Score = 38.1 bits (87), Expect = 4.3, Method: Composition-based stats.
Identities = 24/103 (23%), Positives = 41/103 (39%), Gaps = 30/103 (29%)
Query: 34 IEWMSDKLMKSFEGARN---------------------------NPFHFKHVKLCHSLAE 66
+EWM + +++ FE A PF FKH+++
Sbjct: 349 LEWMDENIVREFEAAEGVDAATGQSNTEGQRAGQNQGKTEGKGVGPFTFKHLRILERKKR 408
Query: 67 LAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
L K+ S PKV+L S ++ GF++D N +++T
Sbjct: 409 LEKILSDQKPKVILASDTSLDWGFAKDSLRLVAEGANNLLLLT 451
>gi|317036117|ref|XP_001397647.2| cleavage and polyadenylylation specificity factor [Aspergillus
niger CBS 513.88]
Length = 1015
Score = 38.1 bits (87), Expect = 4.3, Method: Composition-based stats.
Identities = 24/103 (23%), Positives = 41/103 (39%), Gaps = 30/103 (29%)
Query: 34 IEWMSDKLMKSFEGARN---------------------------NPFHFKHVKLCHSLAE 66
+EWM + +++ FE A PF FKH+++
Sbjct: 349 LEWMDENIVREFEAAEGVDAATGQSNTEGQRAGQNQGKTEGKGVGPFTFKHLRILERKKR 408
Query: 67 LAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
L K+ S PKV+L S ++ GF++D N +++T
Sbjct: 409 LEKILSDQKPKVILASDTSLDWGFAKDSLRLVAEGANNLLLLT 451
>gi|358368318|dbj|GAA84935.1| cleavage and polyadenylylation specificity factor [Aspergillus
kawachii IFO 4308]
Length = 1015
Score = 38.1 bits (87), Expect = 4.5, Method: Composition-based stats.
Identities = 24/103 (23%), Positives = 41/103 (39%), Gaps = 30/103 (29%)
Query: 34 IEWMSDKLMKSFEGARN---------------------------NPFHFKHVKLCHSLAE 66
+EWM + +++ FE A PF FKH+++
Sbjct: 349 LEWMDENIVREFEAAEGVDAATGQSNTEGQRAGQNQGKAEGKGVGPFTFKHLRILERKKR 408
Query: 67 LAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
L K+ S PKV+L S ++ GF++D N +++T
Sbjct: 409 LEKILSDQKPKVILASDTSLDWGFAKDSLRLVAEGANNLLLLT 451
>gi|168034228|ref|XP_001769615.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679157|gb|EDQ65608.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 563
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 8/88 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSP-KVVLVSTPDM-ECGFSRDL 91
I W + K+ ++ N F FKHV E +K+ +P VL +TP M G S ++
Sbjct: 276 ISWTNQKVKDTY--VTRNTFDFKHV----IPFERSKIDAPGPCVLFATPGMLSGGLSLEV 329
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F W S N II+ GT+ L+
Sbjct: 330 FKHWAPSESNMIILPGFCVAGTVGSKLM 357
>gi|357624104|gb|EHJ75000.1| hypothetical protein KGM_18742 [Danaus plexippus]
Length = 595
Score = 37.7 bits (86), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 45/88 (51%), Gaps = 8/88 (9%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
I W + K+ K+F + N F FKH+K + + + +P ++V +TP M G S ++
Sbjct: 283 ITWTNQKIRKTF--VQRNMFDFKHIKPF----DKSYIDNPGAMVVFATPGMLHAGLSLNI 336
Query: 92 FFQWCSSPENSIIITNRTSPGTLARDLI 119
F +W +N +I+ GT+ ++
Sbjct: 337 FKKWAPYEQNMLIMPGFCVQGTVGHKIL 364
>gi|71656590|ref|XP_816840.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
strain CL Brener]
gi|50363263|gb|AAT75334.1| cleavage polyadenylation specificity factor CPSF100 [Trypanosoma
cruzi]
gi|70881994|gb|EAN94989.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 802
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 26/108 (24%), Positives = 49/108 (45%), Gaps = 4/108 (3%)
Query: 55 FKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSP----ENSIIITNRTS 110
F +V C + E+ V PK+ + ++ G S +L + + EN +++T
Sbjct: 314 FANVVTCKTAEEVLSVAGPKICIADGETLDYGVSAELLGHFLQADADERENLVVLTGAPK 373
Query: 111 PGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQE 158
P T A + + L+ R L EELEEY + + E +++++
Sbjct: 374 PHTNAFTMAAAKKGDAIDLRYTIRSPLGKEELEEYYLQIELEMEEQRK 421
>gi|448122146|ref|XP_004204382.1| Piso0_000226 [Millerozyma farinosa CBS 7064]
gi|358349921|emb|CCE73200.1| Piso0_000226 [Millerozyma farinosa CBS 7064]
Length = 948
Score = 37.7 bits (86), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 11/91 (12%)
Query: 34 IEWMSDKLMKSFEGARNN----------PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
+EWMS +L+K +E A ++ PF V L EL ++ PK+V S D
Sbjct: 286 LEWMSGQLIKEWEEASSSTNNSSNKNNFPFDPSKVDLLSDPNELIQLSGPKIVFCSGLDF 345
Query: 84 ECG-FSRDLFFQWCSSPENSIIITNRTSPGT 113
+ G S ++ C + +II+T +T G+
Sbjct: 346 KDGDVSFEVLSYLCQDEKTTIILTEKTHFGS 376
>gi|14198352|gb|AAH08240.1| Cpsf3l protein [Mus musculus]
Length = 302
Score = 37.7 bits (86), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 22/86 (25%), Positives = 40/86 (46%), Gaps = 4/86 (4%)
Query: 53 FHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG 112
F FKH+K A P P VV + + G S +F +W + +N +I+ G
Sbjct: 2 FEFKHIKAFDR--TFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQG 59
Query: 113 TLARDLIELGGNRTLTLQVKKRIRLE 138
T+ + L G R L ++ ++ + ++
Sbjct: 60 TVGHKI--LSGQRKLEMEGRQMLEVK 83
>gi|156848581|ref|XP_001647172.1| hypothetical protein Kpol_1036p59 [Vanderwaltozyma polyspora DSM
70294]
gi|156117856|gb|EDO19314.1| hypothetical protein Kpol_1036p59 [Vanderwaltozyma polyspora DSM
70294]
Length = 821
Score = 37.4 bits (85), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 40/172 (23%), Positives = 75/172 (43%), Gaps = 28/172 (16%)
Query: 9 FEGARNNPFHFKHVKLCHSL------AELAKIEWMSDKLMKSFEGARN-NPFHFKHVKLC 61
+E +++ ++ HV L + +EW+S LMK++E N +PF
Sbjct: 266 YENSKSGSYNQTHVLLIAYFRGKVLTYARSMLEWLSSSLMKTWESRDNASPFDIGSKFKV 325
Query: 62 HSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTL------- 114
+E++ P KV VS D+ ++ + CS + ++++T+ + T
Sbjct: 326 IDPSEISNFPGSKVCFVSQVDI---LLNEVLTKLCSMNKTTVLMTSTNTNNTQILETMYE 382
Query: 115 ----ARDLIELGGNRTLT------LQVKKRIRLEGEELEEYQKKKDKEAKDK 156
A+ L +L T++ L++ L E+LEEY + KE +DK
Sbjct: 383 KWEKAKTLQKLQDGSTISFTDTVLLKIASYKPLVNEQLEEYNARL-KERRDK 433
>gi|407408786|gb|EKF32094.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi marinkellei]
Length = 632
Score = 37.4 bits (85), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 26/108 (24%), Positives = 49/108 (45%), Gaps = 4/108 (3%)
Query: 55 FKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSP----ENSIIITNRTS 110
F +V C + E+ V PK+ + ++ G S +L + + EN +++T
Sbjct: 144 FSNVVTCKTAEEVLSVAGPKICIADGETLDYGVSAELLGHFLQADADERENLVVLTGAPK 203
Query: 111 PGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQE 158
P T A + + L+ R L EELEEY + + E +++++
Sbjct: 204 PHTNAFTMAAAKKGDAIDLRYTIRSPLGKEELEEYYLQLELEMEEQRK 251
>gi|291000374|ref|XP_002682754.1| predicted protein [Naegleria gruberi]
gi|284096382|gb|EFC50010.1| predicted protein [Naegleria gruberi]
Length = 458
Score = 37.4 bits (85), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 22/87 (25%), Positives = 41/87 (47%), Gaps = 4/87 (4%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
I W ++K+ + + N F+F +++ L P P +VL +TP M G S ++F
Sbjct: 283 INWTNEKIKTTLFDQKRNLFNFSNIQSFERF--LMDNPGP-MVLFATPGMLHAGMSLEVF 339
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLI 119
+W N +I+ GT+ ++
Sbjct: 340 KKWAPGENNKVILPGYCVEGTVGNKVL 366
>gi|440299064|gb|ELP91676.1| cleavage and polyadenylation specificity factor, putative
[Entamoeba invadens IP1]
Length = 566
Score = 37.4 bits (85), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 28/122 (22%), Positives = 51/122 (41%), Gaps = 19/122 (15%)
Query: 86 GFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEY 145
GFS + F +P ++ IT+ + LTL RL+ L
Sbjct: 432 GFSNNRFEICPITPTKTVFITDELT--------------NVLTLS-----RLKEFNLGPI 472
Query: 146 QKKKDKEAKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAV 205
K A +KE+ H + ++ EL + + ++ GID + +G ++C +GTV +
Sbjct: 473 DAKVSDNALVPVKKEERSRHTSIYVGELPIKALTKQIEERGIDVKITEGTVVCAKGTVTI 532
Query: 206 RR 207
+
Sbjct: 533 SK 534
>gi|452981499|gb|EME81259.1| hypothetical protein MYCFIDRAFT_140021 [Pseudocercospora fijiensis
CIRAD86]
Length = 938
Score = 37.4 bits (85), Expect = 7.5, Method: Composition-based stats.
Identities = 26/99 (26%), Positives = 41/99 (41%), Gaps = 13/99 (13%)
Query: 34 IEWMSDKLMKSFEGARN----------NPFHFKHVKLCHSLAELAKV---PSPKVVLVST 80
+EWM D +++ E A N N +K VK + + SP ++L S
Sbjct: 327 LEWMDDTIIQDAERAMNKGQRDDDKAPNLLDWKFVKQIERQTQFDRALRRRSPCIMLASD 386
Query: 81 PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
ME G+SR + + P N +++T S A I
Sbjct: 387 ASMEWGYSRQALEKLSADPRNLVVLTESISQSQAAHSSI 425
>gi|403222958|dbj|BAM41089.1| cleavage and polyadenylation specificty factor subunit [Theileria
orientalis strain Shintoku]
Length = 700
Score = 37.4 bits (85), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 37/72 (51%), Gaps = 3/72 (4%)
Query: 50 NNPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
+NPF+FK VK S+ ++ P +++ S ++ G S ++F N +++T
Sbjct: 329 HNPFNFKFVKYARSVRQIRNYLLREGPCIIMTSPGMLQGGPSLEVFELISPDNRNGVVLT 388
Query: 107 NRTSPGTLARDL 118
T GTLA +L
Sbjct: 389 GYTVKGTLADEL 400
>gi|242013971|ref|XP_002427672.1| Endoribonuclease YSH1, putative [Pediculus humanus corporis]
gi|212512102|gb|EEB14934.1| Endoribonuclease YSH1, putative [Pediculus humanus corporis]
Length = 572
Score = 37.4 bits (85), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 24/87 (27%), Positives = 40/87 (45%), Gaps = 6/87 (6%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVK-LCHSLAELAKVPSPKVVLVSTPDMECGFSRDLF 92
I W + K+ K+F + N F FKH+K S + A P VV + + G S +F
Sbjct: 260 ITWTNQKIRKTF--VQRNMFDFKHIKPFDRSYIDQA---WPMVVFATPGMLHAGLSLQIF 314
Query: 93 FQWCSSPENSIIITNRTSPGTLARDLI 119
+W + N +I+ GT+ ++
Sbjct: 315 KKWAPNENNMVIMPGFCVAGTVGHKIL 341
>gi|119576637|gb|EAW56233.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_b
[Homo sapiens]
Length = 329
Score = 37.4 bits (85), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 21/70 (30%), Positives = 33/70 (47%), Gaps = 4/70 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 261 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 316
Query: 94 QWCSSPENSI 103
+W + +N +
Sbjct: 317 KWAGNEKNMV 326
>gi|253742053|gb|EES98907.1| Cleavage and polyadenylation specificity factor, 73 kDa subunit
[Giardia intestinalis ATCC 50581]
Length = 757
Score = 37.4 bits (85), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 57/129 (44%), Gaps = 11/129 (8%)
Query: 69 KVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTL 127
K P V+ TP M + G S++++ + C SP+N +++T + TL L+E G
Sbjct: 353 KNPKKPYVMFCTPGMLQSGVSKEMYNELCGSPDNLLLVTGYATQDTLLYKLLE-GKPPGA 411
Query: 128 TLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFIN--ELQLSDFKQTLQRN 185
K RI EEL + D K + P + FI+ E +S K+T+ N
Sbjct: 412 YADAKMRI----EELSFSAHSDYNQTLDVLRKTR--PRNVVFIHGSEKSISSLKRTIA-N 464
Query: 186 GIDCEFMDG 194
F +G
Sbjct: 465 ETRVSFSEG 473
>gi|47229058|emb|CAG03810.1| unnamed protein product [Tetraodon nigroviridis]
Length = 698
Score = 37.0 bits (84), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 34/75 (45%), Gaps = 4/75 (5%)
Query: 34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
I W + K+ K+F + N F FKH+K A P P VV + + G S +F
Sbjct: 351 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--SYADNPGPMVVFATPGMLHAGQSLQIFK 406
Query: 94 QWCSSPENSIIITNR 108
+W + +N + R
Sbjct: 407 KWAGNEKNMVQFLRR 421
>gi|300176709|emb|CBK24374.2| unnamed protein product [Blastocystis hominis]
Length = 711
Score = 37.0 bits (84), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 44/91 (48%), Gaps = 10/91 (10%)
Query: 51 NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS 110
+PF H + L + K+P P V+L S ME G SRD+ + S N +I+T
Sbjct: 287 SPFFSPHRQF---LRDFNKLP-PCVILCSPAMMENGTSRDVLERMASGENNLVILTGYCM 342
Query: 111 PGTLARDL------IELGGNRTLTLQVKKRI 135
T+AR L I L G + T+ VK ++
Sbjct: 343 ARTIARRLQDNDRIIPLSGKQMETIDVKCKV 373
>gi|159111399|ref|XP_001705931.1| Cleavage and polyadenylation specificity factor, 73 kDa subunit
[Giardia lamblia ATCC 50803]
gi|157434022|gb|EDO78257.1| Cleavage and polyadenylation specificity factor, 73 kDa subunit
[Giardia lamblia ATCC 50803]
Length = 757
Score = 37.0 bits (84), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 58/129 (44%), Gaps = 11/129 (8%)
Query: 69 KVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTL 127
K P V+ TP M + G S++++ + C SP+N +++T + TL L+E G
Sbjct: 353 KNPKKPYVMFCTPGMLQSGVSKEMYNELCGSPDNLLLVTGYATQDTLLYKLLE-GKPPGG 411
Query: 128 TLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFIN--ELQLSDFKQTLQRN 185
K RI EEL + D K + P + +FI+ E +S K+T+ N
Sbjct: 412 YADAKMRI----EELSFSAHSDYNQTLDVLRKTR--PRNVAFIHGSEKSISSLKRTIA-N 464
Query: 186 GIDCEFMDG 194
F +G
Sbjct: 465 ETRVNFSEG 473
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.135 0.408
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,327,089,666
Number of Sequences: 23463169
Number of extensions: 174882336
Number of successful extensions: 763982
Number of sequences better than 100.0: 947
Number of HSP's better than 100.0 without gapping: 534
Number of HSP's successfully gapped in prelim test: 413
Number of HSP's that attempted gapping in prelim test: 761258
Number of HSP's gapped (non-prelim): 2616
length of query: 276
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 136
effective length of database: 9,074,351,707
effective search space: 1234111832152
effective search space used: 1234111832152
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)