BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy13345
         (276 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|345480428|ref|XP_001601407.2| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like [Nasonia vitripennis]
          Length = 739

 Score =  218 bits (554), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 103/139 (74%), Positives = 122/139 (87%), Gaps = 3/139 (2%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  IEWMSDKLMKSFEGARNNPF FKH++LCHS+AEL +VPSPKVVL STP
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMKSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTP 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           DMECGFSRDLF QWCS+P+NSIIIT+RTSPGTLARDL+E GGNR +TL++KK++RLEG E
Sbjct: 332 DMECGFSRDLFLQWCSNPQNSIIITSRTSPGTLARDLVENGGNRNITLEIKKKVRLEGAE 391

Query: 142 LEEYQKK-KDKEAKDKQEK 159
           LEEY KK K K+ + KQEK
Sbjct: 392 LEEYMKKEKVKQEQLKQEK 410



 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/117 (38%), Positives = 60/117 (51%), Gaps = 39/117 (33%)

Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
           E I  + T+FINEL+LSDFKQ L +  I  EF  GVL CC  T+AVRR            
Sbjct: 662 EVIRVYQTTFINELKLSDFKQILTKANIPSEFSGGVLWCCNNTIAVRR------------ 709

Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                       ++  I++EGCLS++YYRV++LLY+QYAI+
Sbjct: 710 ---------------------------HEAGKIIMEGCLSEDYYRVKELLYEQYAIV 739



 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 29/36 (80%), Positives = 33/36 (91%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL STPDMECGFSRDLF QWCS+P+NSIIIT+R 
Sbjct: 324 KVVLASTPDMECGFSRDLFLQWCSNPQNSIIITSRT 359


>gi|332028657|gb|EGI68691.1| Putative cleavage and polyadenylation specificity factor subunit 2
           [Acromyrmex echinatior]
          Length = 737

 Score =  212 bits (540), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 99/139 (71%), Positives = 123/139 (88%), Gaps = 3/139 (2%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  IEWMSDKLM+SFEGARNNPF FKH++LCHS+AEL +VPSPKVVL STP
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTP 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           DMECGFSR+LF QWCS+P+NSII+T+RTSPGTLARDL+E GGNR +TL+VK+R++LEG E
Sbjct: 332 DMECGFSRELFLQWCSNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKRRVKLEGIE 391

Query: 142 LEEYQKK-KDKEAKDKQEK 159
           LEEYQK+ K K+ + KQE+
Sbjct: 392 LEEYQKREKLKQEQLKQEQ 410



 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 43/116 (37%), Positives = 62/116 (53%), Gaps = 39/116 (33%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H T+FINEL+LSDFKQ L ++ I  EF  GVL CC  T+AVRR             
Sbjct: 661 EVPGHQTTFINELKLSDFKQVLNKSNIPSEFSGGVLWCCNNTIAVRR------------- 707

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                      ++  ++LEGC+S++YY+V++LLY+QYAI+
Sbjct: 708 --------------------------HEAGKVILEGCISEDYYKVRELLYEQYAIV 737



 Score = 67.8 bits (164), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 27/36 (75%), Positives = 33/36 (91%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL STPDMECGFSR+LF QWCS+P+NSII+T+R 
Sbjct: 324 KVVLASTPDMECGFSRELFLQWCSNPQNSIILTSRT 359


>gi|340713940|ref|XP_003395491.1| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like isoform 1 [Bombus terrestris]
 gi|340713942|ref|XP_003395492.1| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like isoform 2 [Bombus terrestris]
          Length = 737

 Score =  211 bits (538), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 99/139 (71%), Positives = 122/139 (87%), Gaps = 3/139 (2%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  IEWMSDKLM+SFEGARNNPF FKH++LCHS+AEL +VPSPKVVL STP
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTP 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           DMECGFSR+LF QWC +P+NSII+T+RTSPGTLARDL+E GGNR +TL+VK+RI+LEG E
Sbjct: 332 DMECGFSRELFLQWCGNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKRRIKLEGLE 391

Query: 142 LEEYQKK-KDKEAKDKQEK 159
           LEEYQ+K K K+ + KQE+
Sbjct: 392 LEEYQRKEKLKQEQLKQEQ 410



 Score = 83.6 bits (205), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 43/116 (37%), Positives = 62/116 (53%), Gaps = 39/116 (33%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H T+FINEL+LSDFKQ L ++ I  EF  GVL CC  T+AVRR             
Sbjct: 661 EVPGHQTTFINELKLSDFKQILNKSNIPSEFSGGVLWCCNNTIAVRR------------- 707

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                      ++  ++LEGC+S++YY+V++LLY+QYAI+
Sbjct: 708 --------------------------HEAGKVILEGCISEDYYKVRELLYEQYAIV 737



 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 48/82 (58%), Gaps = 16/82 (19%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRAD---------DSDSNV------ 251
           +VVL STPDMECGFSR+LF QWC +P+NSII+T+R               + N+      
Sbjct: 324 KVVLASTPDMECGFSRELFLQWCGNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKR 383

Query: 252 -IVLEGCLSDEYYRVQQLLYDQ 272
            I LEG   +EY R ++L  +Q
Sbjct: 384 RIKLEGLELEEYQRKEKLKQEQ 405


>gi|328780437|ref|XP_394940.3| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2 [Apis mellifera]
          Length = 730

 Score =  211 bits (538), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 99/139 (71%), Positives = 122/139 (87%), Gaps = 3/139 (2%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  IEWMSDKLM+SFEGARNNPF FKH++LCHS+AEL +VPSPKVVL STP
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTP 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           DMECGFSR+LF QWC +P+NSII+T+RTSPGTLARDL+E GGNR +TL+VK+RI+LEG E
Sbjct: 332 DMECGFSRELFLQWCGNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKRRIKLEGLE 391

Query: 142 LEEYQKK-KDKEAKDKQEK 159
           LEEYQ+K K K+ + KQE+
Sbjct: 392 LEEYQRKEKLKQEQLKQEQ 410



 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 48/82 (58%), Gaps = 16/82 (19%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRAD---------DSDSNV------ 251
           +VVL STPDMECGFSR+LF QWC +P+NSII+T+R               + N+      
Sbjct: 324 KVVLASTPDMECGFSRELFLQWCGNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKR 383

Query: 252 -IVLEGCLSDEYYRVQQLLYDQ 272
            I LEG   +EY R ++L  +Q
Sbjct: 384 RIKLEGLELEEYQRKEKLKQEQ 405



 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 50/104 (48%), Gaps = 39/104 (37%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H T+FINEL+LSDFKQ L ++ I  EF  GVL CC  T+AVRR             
Sbjct: 661 EVPGHQTTFINELKLSDFKQILNKSNIPSEFSGGVLWCCNNTIAVRR------------- 707

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYR 264
                                      ++  ++LEGC+S++YY+
Sbjct: 708 --------------------------HEAGKVILEGCISEDYYK 725


>gi|350400562|ref|XP_003485880.1| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like [Bombus impatiens]
          Length = 737

 Score =  211 bits (538), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 99/139 (71%), Positives = 122/139 (87%), Gaps = 3/139 (2%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  IEWMSDKLM+SFEGARNNPF FKH++LCHS+AEL +VPSPKVVL STP
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTP 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           DMECGFSR+LF QWC +P+NSII+T+RTSPGTLARDL+E GGNR +TL+VK+RI+LEG E
Sbjct: 332 DMECGFSRELFLQWCGNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKRRIKLEGLE 391

Query: 142 LEEYQKK-KDKEAKDKQEK 159
           LEEYQ+K K K+ + KQE+
Sbjct: 392 LEEYQRKEKLKQEQLKQEQ 410



 Score = 83.6 bits (205), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 43/116 (37%), Positives = 62/116 (53%), Gaps = 39/116 (33%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H T+FINEL+LSDFKQ L ++ I  EF  GVL CC  T+AVRR             
Sbjct: 661 EVPGHQTTFINELKLSDFKQILNKSNIPSEFSGGVLWCCNNTIAVRR------------- 707

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                      ++  ++LEGC+S++YY+V++LLY+QYAI+
Sbjct: 708 --------------------------HEAGKVILEGCISEDYYKVRELLYEQYAIV 737



 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 48/82 (58%), Gaps = 16/82 (19%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRAD---------DSDSNV------ 251
           +VVL STPDMECGFSR+LF QWC +P+NSII+T+R               + N+      
Sbjct: 324 KVVLASTPDMECGFSRELFLQWCGNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKR 383

Query: 252 -IVLEGCLSDEYYRVQQLLYDQ 272
            I LEG   +EY R ++L  +Q
Sbjct: 384 RIKLEGLELEEYQRKEKLKQEQ 405


>gi|380025109|ref|XP_003696322.1| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like [Apis florea]
          Length = 737

 Score =  211 bits (538), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 99/139 (71%), Positives = 122/139 (87%), Gaps = 3/139 (2%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  IEWMSDKLM+SFEGARNNPF FKH++LCHS+AEL +VPSPKVVL STP
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTP 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           DMECGFSR+LF QWC +P+NSII+T+RTSPGTLARDL+E GGNR +TL+VK+RI+LEG E
Sbjct: 332 DMECGFSRELFLQWCGNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKRRIKLEGLE 391

Query: 142 LEEYQKK-KDKEAKDKQEK 159
           LEEYQ+K K K+ + KQE+
Sbjct: 392 LEEYQRKEKLKQEQLKQEQ 410



 Score = 83.6 bits (205), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 43/116 (37%), Positives = 62/116 (53%), Gaps = 39/116 (33%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H T+FINEL+LSDFKQ L ++ I  EF  GVL CC  T+AVRR             
Sbjct: 661 EVPGHQTTFINELKLSDFKQILNKSNIPSEFSGGVLWCCNNTIAVRR------------- 707

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                      ++  ++LEGC+S++YY+V++LLY+QYAI+
Sbjct: 708 --------------------------HEAGKVILEGCISEDYYKVRELLYEQYAIV 737



 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 48/82 (58%), Gaps = 16/82 (19%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRAD---------DSDSNV------ 251
           +VVL STPDMECGFSR+LF QWC +P+NSII+T+R               + N+      
Sbjct: 324 KVVLASTPDMECGFSRELFLQWCGNPQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKR 383

Query: 252 -IVLEGCLSDEYYRVQQLLYDQ 272
            I LEG   +EY R ++L  +Q
Sbjct: 384 RIKLEGLELEEYQRKEKLKQEQ 405


>gi|307189918|gb|EFN74154.1| Probable cleavage and polyadenylation specificity factor subunit 2
           [Camponotus floridanus]
          Length = 737

 Score =  210 bits (535), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 98/139 (70%), Positives = 121/139 (87%), Gaps = 3/139 (2%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  IEWMSDKLM+SFEGARNNPF FKH++LCHS+ EL +VPSPKVVL STP
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMVELNQVPSPKVVLASTP 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           DMECGFSR+LF QWC++P+NSIIIT+RTSPGTLARDL+E GGNR +TL VK+R++LEG E
Sbjct: 332 DMECGFSRELFLQWCTNPQNSIIITSRTSPGTLARDLVEKGGNRNITLDVKRRVKLEGIE 391

Query: 142 LEEYQKK-KDKEAKDKQEK 159
           LEEYQK+ K K+ + KQE+
Sbjct: 392 LEEYQKREKLKQEQMKQEQ 410



 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 42/116 (36%), Positives = 62/116 (53%), Gaps = 39/116 (33%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H T+FINEL+LSDFKQ L ++ I  EF  GVL CC  T+AVRR             
Sbjct: 661 EVPGHQTTFINELKLSDFKQVLNKSNIPSEFSGGVLWCCNNTIAVRR------------- 707

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                      ++  ++LEGC+S++YY+V++LL++QYAI+
Sbjct: 708 --------------------------HEAGKVILEGCISEDYYKVRELLFEQYAIV 737



 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 27/35 (77%), Positives = 33/35 (94%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL STPDMECGFSR+LF QWC++P+NSIIIT+R
Sbjct: 324 KVVLASTPDMECGFSRELFLQWCTNPQNSIIITSR 358


>gi|322783252|gb|EFZ10838.1| hypothetical protein SINV_80021 [Solenopsis invicta]
          Length = 737

 Score =  208 bits (530), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 98/139 (70%), Positives = 121/139 (87%), Gaps = 3/139 (2%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  IEWMSDKLM+SFEGARNNPF FKH++LCHS+AEL +VPSPKVVL STP
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTP 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           DMECGFSR+LF QWCS+ +NSII+T+RTSPGTLARDL+E GGNR +TL VK+R++LEG E
Sbjct: 332 DMECGFSRELFLQWCSNTQNSIILTSRTSPGTLARDLVEKGGNRNITLDVKRRVKLEGIE 391

Query: 142 LEEYQKK-KDKEAKDKQEK 159
           LEEYQK+ K K+ + KQE+
Sbjct: 392 LEEYQKREKLKQEQMKQEQ 410



 Score = 84.0 bits (206), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 43/116 (37%), Positives = 62/116 (53%), Gaps = 39/116 (33%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H T+FINEL+LSDFKQ L ++ I  EF  GVL CC  T+AVRR             
Sbjct: 661 EVPGHQTTFINELKLSDFKQVLNKSNIPSEFSGGVLWCCNNTIAVRR------------- 707

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                      ++  ++LEGC+S++YY+V++LLY+QYAI+
Sbjct: 708 --------------------------HEAGKVILEGCISEDYYKVRELLYEQYAIV 737



 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 26/36 (72%), Positives = 32/36 (88%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL STPDMECGFSR+LF QWCS+ +NSII+T+R 
Sbjct: 324 KVVLASTPDMECGFSRELFLQWCSNTQNSIILTSRT 359


>gi|383852782|ref|XP_003701904.1| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like [Megachile rotundata]
          Length = 737

 Score =  208 bits (530), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 98/139 (70%), Positives = 121/139 (87%), Gaps = 3/139 (2%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  IEWMSDKLM+SFEGARNNPF FKH++LCHS+AEL +VPSPKVVL STP
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSMAELNQVPSPKVVLASTP 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           DMECGFSR+LF QWC + +NSII+T+RTSPGTLARDL+E GGNR +TL+VK+RI+LEG E
Sbjct: 332 DMECGFSRELFLQWCGNSQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKRRIKLEGLE 391

Query: 142 LEEYQKK-KDKEAKDKQEK 159
           LEEYQ+K K K+ + KQE+
Sbjct: 392 LEEYQRKEKLKQEQLKQEQ 410



 Score = 83.6 bits (205), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 43/116 (37%), Positives = 62/116 (53%), Gaps = 39/116 (33%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H T+FINEL+LSDFKQ L ++ I  EF  GVL CC  T+AVRR             
Sbjct: 661 EVPGHQTTFINELKLSDFKQILNKSNIPSEFSGGVLWCCNNTIAVRR------------- 707

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                      ++  ++LEGC+S++YY+V++LLY+QYAI+
Sbjct: 708 --------------------------HEAGKVILEGCISEDYYKVRELLYEQYAIV 737



 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 47/82 (57%), Gaps = 16/82 (19%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRAD---------DSDSNV------ 251
           +VVL STPDMECGFSR+LF QWC + +NSII+T+R               + N+      
Sbjct: 324 KVVLASTPDMECGFSRELFLQWCGNSQNSIILTSRTSPGTLARDLVEKGGNRNITLEVKR 383

Query: 252 -IVLEGCLSDEYYRVQQLLYDQ 272
            I LEG   +EY R ++L  +Q
Sbjct: 384 RIKLEGLELEEYQRKEKLKQEQ 405


>gi|270010824|gb|EFA07272.1| hypothetical protein TcasGA2_TC014506 [Tribolium castaneum]
          Length = 733

 Score =  197 bits (501), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 91/128 (71%), Positives = 108/128 (84%), Gaps = 2/128 (1%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  IEWMSDKLM+SFEGARNNPF FKH++LCHSL EL KV SPKVVL S+P
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMRSFEGARNNPFQFKHLQLCHSLHELQKVSSPKVVLASSP 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           DME GFSR+LF QWCS+P NSIIIT RTSPGTLARDL++ GGNR + L VK+R++LEG E
Sbjct: 332 DMESGFSRELFLQWCSNPNNSIIITTRTSPGTLARDLVDNGGNRQIDLVVKRRVKLEGSE 391

Query: 142 LEEYQKKK 149
           LEEYQK +
Sbjct: 392 LEEYQKSQ 399



 Score = 84.0 bits (206), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 43/113 (38%), Positives = 62/113 (54%), Gaps = 39/113 (34%)

Query: 164 PHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRD 223
           PHDT FINEL+LS+FKQ L ++ I+ EF  GVL C  GT+A+RRV               
Sbjct: 660 PHDTVFINELKLSEFKQILAKSNINSEFSGGVLWCSNGTLAIRRV--------------- 704

Query: 224 LFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                   ++  ++LEGC+S++YY+V++LLY+QYA+L
Sbjct: 705 ------------------------ETGRVILEGCISEDYYKVKELLYEQYAVL 733



 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 27/39 (69%), Positives = 32/39 (82%)

Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           V+  +VVL S+PDME GFSR+LF QWCS+P NSIIIT R
Sbjct: 320 VSSPKVVLASSPDMESGFSRELFLQWCSNPNNSIIITTR 358


>gi|242021798|ref|XP_002431330.1| Cleavage and polyadenylation specificity factor 100 kDa subunit,
           putative [Pediculus humanus corporis]
 gi|212516598|gb|EEB18592.1| Cleavage and polyadenylation specificity factor 100 kDa subunit,
           putative [Pediculus humanus corporis]
          Length = 731

 Score =  194 bits (493), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 89/126 (70%), Positives = 110/126 (87%), Gaps = 2/126 (1%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + ++  E AK  IEWMS+KLM+SFEGARNNPF FK+V+LCHS +EL+KVPSPKVVL STP
Sbjct: 272 VSYNTVEFAKSQIEWMSEKLMRSFEGARNNPFQFKYVQLCHSFSELSKVPSPKVVLASTP 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           DME GFSR+LF QW S+P NSII+T+RTSPGTLARDLIE GG+R +++++KKR++LEGEE
Sbjct: 332 DMESGFSRELFLQWSSNPLNSIILTSRTSPGTLARDLIENGGDRIISIEIKKRVKLEGEE 391

Query: 142 LEEYQK 147
           LEEY K
Sbjct: 392 LEEYFK 397



 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 48/117 (41%), Positives = 64/117 (54%), Gaps = 39/117 (33%)

Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
           E IP H+TS+INEL+LSDFKQ L +N I+CEF  GVL CC G+VAVRR            
Sbjct: 654 EDIPGHETSYINELKLSDFKQILNKNNINCEFSGGVLRCCHGSVAVRR------------ 701

Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                       ++  ++LEGCLS++YY+V++LL  QYAI+
Sbjct: 702 ---------------------------HEAGRVILEGCLSEDYYKVKELLCQQYAIV 731



 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 30/36 (83%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL STPDME GFSR+LF QW S+P NSII+T+R 
Sbjct: 324 KVVLASTPDMESGFSRELFLQWSSNPLNSIILTSRT 359


>gi|346465041|gb|AEO32365.1| hypothetical protein [Amblyomma maculatum]
          Length = 644

 Score =  192 bits (488), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 86/140 (61%), Positives = 112/140 (80%), Gaps = 2/140 (1%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  +EWMSDK+M+SFEGARNNPF F+H++LCH LAELA+VP PKVVL S  
Sbjct: 180 VSYNVVEFAKSQVEWMSDKVMRSFEGARNNPFQFRHLQLCHGLAELARVPEPKVVLASMA 239

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           DMECGFSRDLF QWCSSP NS+++T+RT+PGTLAR LIE    + LT+ +KKR+RLEG E
Sbjct: 240 DMECGFSRDLFIQWCSSPRNSVVLTSRTAPGTLARQLIENPHQQALTITMKKRVRLEGSE 299

Query: 142 LEEYQKKKDKEAKDKQEKEK 161
           LEEY +K+ + A  + + E+
Sbjct: 300 LEEYMRKEKELAAARHKAER 319



 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 57/114 (50%), Gaps = 40/114 (35%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H T FINE++LSDFKQ L RNG+  EF  GVL  C G VAVRR             
Sbjct: 571 QVPGHATIFINEIKLSDFKQVLLRNGVQAEFSGGVLY-CNGIVAVRR------------- 616

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYA 274
                                     +++  I +EGCL ++Y++V+++LY QYA
Sbjct: 617 --------------------------NEAGRINIEGCLCEDYFKVREILYQQYA 644



 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 29/52 (55%), Positives = 34/52 (65%), Gaps = 5/52 (9%)

Query: 196 LICCRGTVAVRRV-----VLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           L  C G   + RV     VL S  DMECGFSRDLF QWCSSP NS+++T+R 
Sbjct: 216 LQLCHGLAELARVPEPKVVLASMADMECGFSRDLFIQWCSSPRNSVVLTSRT 267


>gi|427789025|gb|JAA59964.1| Putative mrna cleavage and polyadenylation factor ii complex
           subunit cft2 cpsf subunit [Rhipicephalus pulchellus]
          Length = 646

 Score =  190 bits (482), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 84/140 (60%), Positives = 113/140 (80%), Gaps = 2/140 (1%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  +EWMSDK+M+SFEGARNNPF F+H++LCH +AELA+VP PKVVL S  
Sbjct: 180 VSYNVVEFAKSQVEWMSDKVMRSFEGARNNPFQFRHLQLCHGMAELARVPEPKVVLASMA 239

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           DMECGFSR+LF QWCSSP NS+++T+R++PGTLAR LIE    ++LT+ VKKR+RLEG E
Sbjct: 240 DMECGFSRELFIQWCSSPRNSVVLTSRSAPGTLARQLIENPHQQSLTITVKKRVRLEGSE 299

Query: 142 LEEYQKKKDKEAKDKQEKEK 161
           LEEY +K+ + A  + + E+
Sbjct: 300 LEEYMRKEKELAAARHKAER 319



 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 42/116 (36%), Positives = 59/116 (50%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H T F+NEL+LSDFKQ L RNG+  EF  GVL  C G VAVRR             
Sbjct: 571 QVPGHATIFVNELKLSDFKQVLLRNGVQAEFSGGVLY-CNGIVAVRR------------- 616

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I +EGCL ++Y++V+++LY QYAI+
Sbjct: 617 --------------------------NEAGRINIEGCLCEDYFKVREILYQQYAII 646



 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 28/52 (53%), Positives = 35/52 (67%), Gaps = 5/52 (9%)

Query: 196 LICCRGTVAVRRV-----VLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           L  C G   + RV     VL S  DMECGFSR+LF QWCSSP NS+++T+R+
Sbjct: 216 LQLCHGMAELARVPEPKVVLASMADMECGFSRELFIQWCSSPRNSVVLTSRS 267


>gi|157112944|ref|XP_001657690.1| cleavage and polyadenylation specificity factor [Aedes aegypti]
 gi|108884656|gb|EAT48881.1| AAEL000118-PA [Aedes aegypti]
          Length = 744

 Score =  183 bits (464), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 88/127 (69%), Positives = 105/127 (82%), Gaps = 3/127 (2%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  IEWMSDKLMKSFEGARNNPF FKH++LCH++AELAKVPSPKVVL S+ 
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMKSFEGARNNPFQFKHLRLCHTMAELAKVPSPKVVLASSA 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGN-RTLTLQVKKRIRLEGE 140
           DME GFSR+LF QW S+  NSIIIT R+SPGTLARDLIE GGN R + L V++R+ LEG 
Sbjct: 332 DMESGFSRELFVQWASNVNNSIIITCRSSPGTLARDLIENGGNGRKIELDVRRRVELEGA 391

Query: 141 ELEEYQK 147
           ELEEY +
Sbjct: 392 ELEEYMR 398



 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 43/117 (36%), Positives = 60/117 (51%), Gaps = 39/117 (33%)

Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
           +++P H + FINEL+L DFKQ L +  I  EF  GVL C  GTVA+RRV           
Sbjct: 667 DELPAHHSVFINELKLIDFKQVLMKANISSEFSGGVLWCNNGTVALRRV----------- 715

Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                       D+  + +EGCLS+EYY++++LLY+QYAI+
Sbjct: 716 ----------------------------DTGKVTVEGCLSEEYYKIRELLYEQYAIV 744



 Score = 50.8 bits (120), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 23/36 (63%), Positives = 28/36 (77%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S+ DME GFSR+LF QW S+  NSIIIT R+
Sbjct: 324 KVVLASSADMESGFSRELFVQWASNVNNSIIITCRS 359


>gi|170046825|ref|XP_001850949.1| cleavage and polyadenylation specificity factor subunit 2 [Culex
           quinquefasciatus]
 gi|167869453|gb|EDS32836.1| cleavage and polyadenylation specificity factor subunit 2 [Culex
           quinquefasciatus]
          Length = 747

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 86/127 (67%), Positives = 105/127 (82%), Gaps = 3/127 (2%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  IEWMSDKLMKSFEGARNNPF FKH++LCH++A+LAKVPSPKVVL S+P
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMKSFEGARNNPFQFKHLRLCHTMADLAKVPSPKVVLASSP 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGN-RTLTLQVKKRIRLEGE 140
           DME GFSR+LF QW  +  NSIIIT R+SPGTLARDLI+ GGN R L L V++R+ LEG 
Sbjct: 332 DMESGFSRELFVQWAGNVNNSIIITCRSSPGTLARDLIDNGGNGRKLELDVRRRVELEGA 391

Query: 141 ELEEYQK 147
           EL+EY +
Sbjct: 392 ELDEYMR 398



 Score = 80.9 bits (198), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 46/142 (32%), Positives = 73/142 (51%), Gaps = 41/142 (28%)

Query: 137 LEGEELEEYQKKKDKE--AKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDG 194
           ++ +++E  + K DK+    D    +++P H++ FINEL+L DFKQ L +  I  EF  G
Sbjct: 645 MDVDQVEITEDKSDKQILTLDPLLNDQLPAHNSVFINELKLIDFKQVLMKANIASEFSGG 704

Query: 195 VLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVL 254
           VL C  GT+A+RR+                                       D+  + +
Sbjct: 705 VLWCSNGTLALRRI---------------------------------------DTGKVTI 725

Query: 255 EGCLSDEYYRVQQLLYDQYAIL 276
           EGCLS++YYR+++LLY+QYAI+
Sbjct: 726 EGCLSEDYYRIRELLYEQYAIV 747



 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/77 (41%), Positives = 40/77 (51%), Gaps = 17/77 (22%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRAD-------DSDSNV-------- 251
           +VVL S+PDME GFSR+LF QW  +  NSIIIT R+          D+  N         
Sbjct: 324 KVVLASSPDMESGFSRELFVQWAGNVNNSIIITCRSSPGTLARDLIDNGGNGRKLELDVR 383

Query: 252 --IVLEGCLSDEYYRVQ 266
             + LEG   DEY R +
Sbjct: 384 RRVELEGAELDEYMRTE 400


>gi|158290938|ref|XP_312464.4| AGAP002474-PA [Anopheles gambiae str. PEST]
 gi|157018137|gb|EAA08192.4| AGAP002474-PA [Anopheles gambiae str. PEST]
          Length = 745

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 84/126 (66%), Positives = 105/126 (83%), Gaps = 3/126 (2%)

Query: 25  CHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
            +++ E AK  IEWMSDKLMKSFEGARNNPF FKH++LCH++A+LAKVPSPKVVL S+PD
Sbjct: 273 SYNVVEFAKSQIEWMSDKLMKSFEGARNNPFTFKHLRLCHTMADLAKVPSPKVVLASSPD 332

Query: 83  MECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGN-RTLTLQVKKRIRLEGEE 141
           +E GFSR+LF QW  +  NSIIIT+R+SPGTLARDLIE GGN R + + +++R+ LEG E
Sbjct: 333 LESGFSRELFIQWAPNASNSIIITSRSSPGTLARDLIENGGNGRKIEMDIRRRVELEGAE 392

Query: 142 LEEYQK 147
           LEEY +
Sbjct: 393 LEEYMR 398



 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 50/147 (34%), Positives = 76/147 (51%), Gaps = 42/147 (28%)

Query: 133 KRI-RLEGEELEEYQKKKDKE--AKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDC 189
           KRI  +E ++++    K DK+    +   +E +PPH+  FINEL+L DFKQ L ++ I  
Sbjct: 638 KRIDTMEVDDVDTIDDKMDKQILTLEPLAQEDLPPHNPVFINELKLIDFKQILMKSNIAS 697

Query: 190 EFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDS 249
           EF  GVL C  GTVA+RRV                                       D+
Sbjct: 698 EFSGGVLWCSNGTVALRRV---------------------------------------DT 718

Query: 250 NVIVLEGCLSDEYYRVQQLLYDQYAIL 276
             + +EGC+S++YY++++LLY+QYAI+
Sbjct: 719 GRVTIEGCISEDYYKIRELLYEQYAII 745



 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 22/36 (61%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S+PD+E GFSR+LF QW  +  NSIIIT+R+
Sbjct: 324 KVVLASSPDLESGFSRELFIQWAPNASNSIIITSRS 359


>gi|312375001|gb|EFR22454.1| hypothetical protein AND_15244 [Anopheles darlingi]
          Length = 772

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 85/127 (66%), Positives = 104/127 (81%), Gaps = 3/127 (2%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  IEWMSDKLMKSFEGARNNPF FKH++LCH++A+LAKVPSPKVVL S+ 
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLMKSFEGARNNPFTFKHLRLCHTMADLAKVPSPKVVLASSA 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGN-RTLTLQVKKRIRLEGE 140
           DME GFSR+LF QW     NSIIITNR+SPGTLARDLI+ GGN R + + V++R+ LEG 
Sbjct: 332 DMESGFSRELFIQWAPQATNSIIITNRSSPGTLARDLIDNGGNGRKIEMDVRRRVELEGA 391

Query: 141 ELEEYQK 147
           ELEEY +
Sbjct: 392 ELEEYMR 398



 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/117 (37%), Positives = 62/117 (52%), Gaps = 39/117 (33%)

Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
           E++PPH+  FINEL+L DFKQ L R+ I  EF  GVL C  GTVA+RRV           
Sbjct: 695 EELPPHNPVFINELKLIDFKQVLMRSNITSEFSGGVLWCSNGTVALRRV----------- 743

Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                       D+  + +EGC+S++YY++++LLY+QYAI+
Sbjct: 744 ----------------------------DTGRVTIEGCISEDYYKIRELLYEQYAII 772



 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 23/36 (63%), Positives = 27/36 (75%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S+ DME GFSR+LF QW     NSIIITNR+
Sbjct: 324 KVVLASSADMESGFSRELFIQWAPQATNSIIITNRS 359


>gi|357610700|gb|EHJ67102.1| putative cleavage and polyadenylation specificity factor 100 kDa
           subunit [Danaus plexippus]
          Length = 818

 Score =  174 bits (441), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 80/154 (51%), Positives = 113/154 (73%), Gaps = 2/154 (1%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  IEWMSDKL ++FEGAR+NPF  +H++LCHS+ E+ + P PKVVL S P
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLTRAFEGARSNPFALRHLQLCHSVVEVTRTPGPKVVLASFP 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           D+E GF+RDLF QW  + +NSI++T RTSPGTLARDLIE GG+RT+ L V++R+RLEG E
Sbjct: 332 DLETGFARDLFLQWAPNSQNSIVLTARTSPGTLARDLIEKGGDRTIELTVRRRVRLEGAE 391

Query: 142 LEEYQKKKDKEAKDKQEKEKIPPHDTSFINELQL 175
           LEE+ +++ K     +E+      D+    EL++
Sbjct: 392 LEEFMQQRVKVNNSVKEETGGISSDSESEGELEM 425



 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 55/113 (48%), Gaps = 39/113 (34%)

Query: 164 PHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRD 223
           PH   F+N ++LS+ +  L RNG+  EF  G L CC GT+A+RR+               
Sbjct: 652 PHGAWFVNSVRLSELRAALARNGLGAEFSAGALECCNGTIAIRRL--------------- 696

Query: 224 LFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                   ++  + LEG LS+EY++V++LLYDQ+AI+
Sbjct: 697 ------------------------ENGRVALEGVLSEEYFKVRELLYDQFAIV 725



 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 25/52 (48%), Positives = 31/52 (59%), Gaps = 5/52 (9%)

Query: 196 LICCRGTVAVRR-----VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           L  C   V V R     VVL S PD+E GF+RDLF QW  + +NSI++T R 
Sbjct: 308 LQLCHSVVEVTRTPGPKVVLASFPDLETGFARDLFLQWAPNSQNSIVLTART 359


>gi|193676458|ref|XP_001951701.1| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like [Acyrthosiphon pisum]
          Length = 729

 Score =  173 bits (439), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 87/143 (60%), Positives = 107/143 (74%), Gaps = 5/143 (3%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + ++  E AK  IEWMSDKLMKSFEGARNNPF FKHVKLCH++ +L KV  PKVVL S  
Sbjct: 272 VSYNTVEFAKSQIEWMSDKLMKSFEGARNNPFFFKHVKLCHNMNDLNKVSEPKVVLASNG 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           D+E GFSR++F  W S+ +NSII+T+RT+PGTLARDLI+ GG+R + L VKKR+ L+  E
Sbjct: 332 DLESGFSREVFIMWASNSKNSIILTDRTAPGTLARDLIDEGGDRNIKLIVKKRVPLDDNE 391

Query: 142 LEEYQKKKDKEAKDKQEKEKIPP 164
           LEEY  K D+E   K E  KI P
Sbjct: 392 LEEYNIKHDEE---KMEASKIDP 411



 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 53/118 (44%), Gaps = 39/118 (33%)

Query: 159 KEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMEC 218
           + +I P  T FIN L+LSDFKQ L +N I CE   GVL CC  TV VRR           
Sbjct: 651 EHEIHPRKTVFINRLKLSDFKQILSKNNIPCELSKGVLWCCNRTVCVRR----------- 699

Query: 219 GFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                       + S  +++EG +S +YY ++ LLY Q+ I+
Sbjct: 700 ----------------------------NSSGKVLMEGIISRQYYYIRSLLYSQFIII 729



 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 20/39 (51%), Positives = 29/39 (74%)

Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           V+  +VVL S  D+E GFSR++F  W S+ +NSII+T+R
Sbjct: 320 VSEPKVVLASNGDLESGFSREVFIMWASNSKNSIILTDR 358


>gi|223648270|gb|ACN10893.1| Cleavage and polyadenylation specificity factor subunit 2 [Salmo
           salar]
          Length = 796

 Score =  173 bits (438), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 82/139 (58%), Positives = 107/139 (76%), Gaps = 2/139 (1%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCHSLA+LA+VPSPKVVL S PD+E GFSR+L
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHSLADLARVPSPKVVLCSQPDLESGFSREL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKK-- 149
           F QWC   +NS+I+T RT+PGTL R LI+  G + L L+++KR++LEG ELEEY +K+  
Sbjct: 342 FIQWCQDAKNSVILTYRTTPGTLGRYLIDNPGEKMLDLEIRKRVKLEGRELEEYLEKERM 401

Query: 150 DKEAKDKQEKEKIPPHDTS 168
            KEA  K E+EK    D+S
Sbjct: 402 KKEAAKKLEQEKEVDVDSS 420



 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 43/116 (37%), Positives = 58/116 (50%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           +IP H + FINE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 721 EIPGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLV-CNNMVAVRR------------- 766

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL D+YY++++LLY QYA++
Sbjct: 767 --------------------------TEAGRIGLEGCLCDDYYKIRELLYQQYAVV 796



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 21/36 (58%), Positives = 27/36 (75%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+E GFSR+LF QWC   +NS+I+T R 
Sbjct: 324 KVVLCSQPDLESGFSRELFIQWCQDAKNSVILTYRT 359


>gi|213514628|ref|NP_001134023.1| cleavage and polyadenylation specificity factor subunit 2 [Salmo
           salar]
 gi|209156194|gb|ACI34329.1| Cleavage and polyadenylation specificity factor subunit 2 [Salmo
           salar]
          Length = 796

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 82/139 (58%), Positives = 107/139 (76%), Gaps = 2/139 (1%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCHSLA+LA+VPSPKVVL S PD+E GFSR+L
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHSLADLARVPSPKVVLCSQPDLESGFSREL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKK-- 149
           F QWC   +NS+I+T RT+PGTL R LI+  G + L L+++KR++LEG ELEEY +K+  
Sbjct: 342 FIQWCQEAKNSVILTYRTTPGTLGRYLIDNPGEKMLDLEIRKRVKLEGRELEEYLEKERM 401

Query: 150 DKEAKDKQEKEKIPPHDTS 168
            KEA  K E+EK    D+S
Sbjct: 402 KKEAAKKLEQEKEVDVDSS 420



 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 43/116 (37%), Positives = 58/116 (50%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           +IP H + FINE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 721 EIPGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLV-CNNIVAVRR------------- 766

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL D+YY++++LLY QYA++
Sbjct: 767 --------------------------TEAGRIGLEGCLCDDYYKIRELLYQQYAVV 796



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 21/36 (58%), Positives = 27/36 (75%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+E GFSR+LF QWC   +NS+I+T R 
Sbjct: 324 KVVLCSQPDLESGFSRELFIQWCQEAKNSVILTYRT 359


>gi|443725188|gb|ELU12868.1| hypothetical protein CAPTEDRAFT_155355 [Capitella teleta]
          Length = 728

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 81/156 (51%), Positives = 115/156 (73%), Gaps = 11/156 (7%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  +EWMSDK+M+SFE  RNNPF FKH++LCHS+AELAKVP PKVVL STP
Sbjct: 272 VAYNVVEFAKSQVEWMSDKIMRSFEERRNNPFQFKHLQLCHSMAELAKVPEPKVVLASTP 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           D++ GFSR+LF QWCS+P+N II+TNRT+P TL R LI+     ++ L+VK+R+RLEG  
Sbjct: 332 DLQTGFSRELFVQWCSNPKNCIILTNRTAPPTLCRQLIDYPNRGSVRLEVKRRVRLEGRA 391

Query: 142 LEEYQK---------KKDKEAKDKQEKEKIPPHDTS 168
           LE++ +         +++K  K+++E+E +   D S
Sbjct: 392 LEDFLRAERERKAEVEREKAEKERREREGLESSDDS 427



 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 46/140 (32%), Positives = 66/140 (47%), Gaps = 41/140 (29%)

Query: 138 EGEELEEYQKKKDKEAKDKQ-EKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVL 196
           + E +E+ +  +D+EA        +I PH   F+NE +LSDFKQTL   G+  EF  GVL
Sbjct: 629 QAEAMEDGENPEDEEAVPVVIPTSQIRPHGAVFVNEPKLSDFKQTLVNLGVQAEFSAGVL 688

Query: 197 ICCRGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEG 256
           IC                                   NS++   +N A       + LEG
Sbjct: 689 IC-----------------------------------NSVVAVRKNEA-----GRLQLEG 708

Query: 257 CLSDEYYRVQQLLYDQYAIL 276
            L D+YYR++QLLY+Q+AI+
Sbjct: 709 TLCDDYYRIRQLLYEQFAIV 728



 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 24/36 (66%), Positives = 31/36 (86%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL STPD++ GFSR+LF QWCS+P+N II+TNR 
Sbjct: 324 KVVLASTPDLQTGFSRELFVQWCSNPKNCIILTNRT 359


>gi|50539828|ref|NP_001002384.1| cleavage and polyadenylation specificity factor subunit 2 [Danio
           rerio]
 gi|49903850|gb|AAH76029.1| Cleavage and polyadenylation specific factor 2 [Danio rerio]
          Length = 790

 Score =  170 bits (430), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 81/139 (58%), Positives = 106/139 (76%), Gaps = 2/139 (1%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCHSL++LA+VPSPKVVL S PD+E GFSR+L
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHSLSDLARVPSPKVVLCSQPDLESGFSREL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKK-- 149
           F QWC   +NS+I+T RT+PGTLAR LI+  G + + L+++KR RLEG ELEEY +K+  
Sbjct: 342 FIQWCQDAKNSVILTYRTTPGTLARYLIDNPGEKRIELEIRKRCRLEGRELEEYMEKERM 401

Query: 150 DKEAKDKQEKEKIPPHDTS 168
            KEA  K E+ K    D+S
Sbjct: 402 KKEAAKKLEQAKEVDLDSS 420



 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 42/116 (36%), Positives = 58/116 (50%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + FINE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 715 EVPGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLV-CNNLVAVRR------------- 760

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGC  D+YYR+++LLY+QYA++
Sbjct: 761 --------------------------TEAGRICLEGCHCDDYYRIRELLYEQYAVV 790



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 21/36 (58%), Positives = 27/36 (75%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+E GFSR+LF QWC   +NS+I+T R 
Sbjct: 324 KVVLCSQPDLESGFSRELFIQWCQDAKNSVILTYRT 359


>gi|47224566|emb|CAG03550.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 765

 Score =  169 bits (429), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 82/132 (62%), Positives = 102/132 (77%), Gaps = 2/132 (1%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCHSLA+LA+VPSPKVVL S PD+E GFSR+L
Sbjct: 258 SQVEWMSDKLMRCFEDKRNNPFQFRHLTLCHSLADLARVPSPKVVLCSQPDLESGFSREL 317

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKK-- 149
           F QW    +NSII+T RT+PGTLAR LI+  G + L L+V+KR+RLEG ELEEY +K   
Sbjct: 318 FIQWSKDSKNSIILTYRTTPGTLARYLIDNPGEKHLDLEVRKRVRLEGRELEEYLEKDRI 377

Query: 150 DKEAKDKQEKEK 161
            KEA  K E+ K
Sbjct: 378 KKEAAKKLEQAK 389



 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 58/116 (50%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + FINE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 690 EVPGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLV-CNNMVAVRR------------- 735

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL ++YY++++LLY QYA++
Sbjct: 736 --------------------------TEAGRISLEGCLCEDYYKIRELLYQQYAVV 765



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/36 (58%), Positives = 26/36 (72%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+E GFSR+LF QW    +NSII+T R 
Sbjct: 300 KVVLCSQPDLESGFSRELFIQWSKDSKNSIILTYRT 335


>gi|74194185|dbj|BAE24650.1| unnamed protein product [Mus musculus]
          Length = 396

 Score =  167 bits (423), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 72/114 (63%), Positives = 94/114 (82%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEY 145
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG+ELEEY
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPTEKVTEIELRKRVKLEGKELEEY 395



 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|410916717|ref|XP_003971833.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Takifugu rubripes]
          Length = 787

 Score =  167 bits (422), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 80/132 (60%), Positives = 102/132 (77%), Gaps = 2/132 (1%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCHSLA+LA+VPSPKVVL S PD+E GFSR+L
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLTLCHSLADLARVPSPKVVLCSQPDLESGFSREL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKK-- 149
           F QW    +NSII+T RT+PGTL+R LI+  G + L L+V+KR++LEG ELEEY +K   
Sbjct: 342 FIQWSKDAKNSIILTYRTTPGTLSRYLIDNPGEKHLDLEVRKRVKLEGRELEEYLEKDRV 401

Query: 150 DKEAKDKQEKEK 161
            KEA  K E+ K
Sbjct: 402 KKEAAKKLEQAK 413



 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 58/116 (50%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + FINE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 712 EVPGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLV-CNNMVAVRR------------- 757

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL ++YY++++LLY QYA++
Sbjct: 758 --------------------------TEAGRIGLEGCLCEDYYKIRELLYQQYAVV 787



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/36 (58%), Positives = 26/36 (72%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+E GFSR+LF QW    +NSII+T R 
Sbjct: 324 KVVLCSQPDLESGFSRELFIQWSKDAKNSIILTYRT 359


>gi|328722057|ref|XP_001949295.2| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like [Acyrthosiphon pisum]
          Length = 724

 Score =  167 bits (422), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 84/143 (58%), Positives = 106/143 (74%), Gaps = 5/143 (3%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + ++  E AK  IEWMSDKLMKSFEGARNNPF FKHVKLCH++ +L KV  PKVVL S  
Sbjct: 272 VSYNTVEFAKSQIEWMSDKLMKSFEGARNNPFIFKHVKLCHNMNDLKKVSEPKVVLASHG 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           D+E GFSR++F  W S+P+NSII+T+R +PGTLAR+LI+ G +R + L VKKR+ L+  E
Sbjct: 332 DLENGFSREVFIMWASNPKNSIILTDRAAPGTLARNLIDGGSDRNIKLIVKKRVPLDENE 391

Query: 142 LEEYQKKKDKEAKDKQEKEKIPP 164
           LEEY  K +KE   K E  K+ P
Sbjct: 392 LEEYNIKYEKE---KMEGSKMDP 411



 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 39/116 (33%), Positives = 52/116 (44%), Gaps = 39/116 (33%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           +I PH T FIN L+LSDFKQ L +  I CE   GVL CC  TV VRR             
Sbjct: 648 EIHPHKTVFINRLKLSDFKQILSKKNIPCELSKGVLWCCNRTVCVRR------------- 694

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     + S  +++EG +S +YY ++ LLY Q+ I+
Sbjct: 695 --------------------------NSSGKVLMEGIISRQYYYIRSLLYSQFIII 724



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/50 (46%), Positives = 33/50 (66%)

Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVI 252
           V+  +VVL S  D+E GFSR++F  W S+P+NSII+T+R        N+I
Sbjct: 320 VSEPKVVLASHGDLENGFSREVFIMWASNPKNSIILTDRAAPGTLARNLI 369


>gi|390333491|ref|XP_780045.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 isoform 1 [Strongylocentrotus purpuratus]
          Length = 773

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 80/136 (58%), Positives = 104/136 (76%), Gaps = 3/136 (2%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  +EWMSDK+M++FE  RNNPF FKH+KLCH+L ELAKVP PKVVL S P
Sbjct: 272 VSYNVVEFAKSQVEWMSDKVMRAFEDRRNNPFQFKHLKLCHNLKELAKVPDPKVVLASVP 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           D+ECG+SR+LF QW    +NS+I+TNRTS GTLAR LIE      L L+V KR++LE EE
Sbjct: 332 DLECGYSRELFIQWSGDAKNSVILTNRTSHGTLARRLIETPNPNQLKLRVSKRVKLEKEE 391

Query: 142 LEEYQ-KKKDKEAKDK 156
           L+EY+  +K+KE + K
Sbjct: 392 LDEYRIHEKEKERQRK 407



 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 36/122 (29%), Positives = 55/122 (45%), Gaps = 40/122 (32%)

Query: 155 DKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTP 214
           D  +  ++ PH   ++N  +  DFKQ L +NGI  EF  GVL+ C  TVA++R       
Sbjct: 692 DAVQTNEVTPHRQVYVNPPRFLDFKQVLAKNGIRAEFTGGVLV-CNNTVAIKR------- 743

Query: 215 DMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYA 274
                                           ++   + LEG + D+YY V++LLY+QYA
Sbjct: 744 --------------------------------NEKGHLTLEGAVCDDYYTVRELLYEQYA 771

Query: 275 IL 276
           I+
Sbjct: 772 IV 773



 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 21/35 (60%), Positives = 28/35 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL S PD+ECG+SR+LF QW    +NS+I+TNR
Sbjct: 324 KVVLASVPDLECGYSRELFIQWSGDAKNSVILTNR 358


>gi|260822471|ref|XP_002606625.1| hypothetical protein BRAFLDRAFT_209615 [Branchiostoma floridae]
 gi|229291969|gb|EEN62635.1| hypothetical protein BRAFLDRAFT_209615 [Branchiostoma floridae]
          Length = 607

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 74/124 (59%), Positives = 96/124 (77%), Gaps = 2/124 (1%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  +EWMSDK+M+ FE  RNNPF FKH+KLCHSL+EL KVP PKVVL S P
Sbjct: 89  VAYNVVEFAKSQVEWMSDKIMRVFEDNRNNPFQFKHLKLCHSLSELHKVPDPKVVLASVP 148

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           D+E GFSR+LF QWC + +N++++T+R  PGTL R LI+    +T TLQ +KR+RLEG E
Sbjct: 149 DLESGFSRELFVQWCQNQKNTVVLTSRPGPGTLGRMLIDNPKMKTFTLQARKRVRLEGPE 208

Query: 142 LEEY 145
           LEEY
Sbjct: 209 LEEY 212



 Score = 70.5 bits (171), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 40/116 (34%), Positives = 58/116 (50%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           +IP H+  FIN+ +LSD KQ LQ+ GI  EF  GVLIC                      
Sbjct: 532 QIPGHEAVFINKPRLSDIKQVLQKEGIQAEFSGGVLIC---------------------- 569

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                        N+++   RN     +S  I +EGC+ ++YY+V++LLY+QYAI+
Sbjct: 570 -------------NNVVALKRN-----ESGRIGMEGCICEDYYKVRKLLYEQYAIV 607



 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 19/35 (54%), Positives = 29/35 (82%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL S PD+E GFSR+LF QWC + +N++++T+R
Sbjct: 141 KVVLASVPDLESGFSRELFVQWCQNQKNTVVLTSR 175


>gi|195145330|ref|XP_002013649.1| GL24248 [Drosophila persimilis]
 gi|194102592|gb|EDW24635.1| GL24248 [Drosophila persimilis]
          Length = 583

 Score =  165 bits (417), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 77/126 (61%), Positives = 99/126 (78%), Gaps = 4/126 (3%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
           + +++ E AK  IEWMSDKL K+FEGARNNPF FKH++LCH+LA++ K+P+ PKVVL ST
Sbjct: 98  VSYNVVEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHTLADVYKLPAGPKVVLAST 157

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
           PD+E GF+RDLF QW S+  NSII+T RTSPGTLA +L+E     R + L V++R+ LEG
Sbjct: 158 PDLESGFTRDLFIQWASNANNSIILTTRTSPGTLAMELVENYAPGRQIELDVRRRVELEG 217

Query: 140 EELEEY 145
            ELEEY
Sbjct: 218 AELEEY 223



 Score = 87.8 bits (216), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 46/119 (38%), Positives = 65/119 (54%), Gaps = 39/119 (32%)

Query: 158 EKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDME 217
           E+++IP H++  INEL+LSDFKQ L RN I+ EF  GVL C  GT+A+RRV         
Sbjct: 504 EEDEIPVHNSVLINELKLSDFKQILLRNNINSEFSGGVLWCTNGTLALRRV--------- 554

Query: 218 CGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                         D+  + +EGCLS+EYY++++LLY+QYAI+
Sbjct: 555 ------------------------------DAGKVAMEGCLSEEYYKIRELLYEQYAIV 583



 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 23/35 (65%), Positives = 28/35 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL STPD+E GF+RDLF QW S+  NSII+T R
Sbjct: 151 KVVLASTPDLESGFTRDLFIQWASNANNSIILTTR 185


>gi|195449222|ref|XP_002071979.1| GK22564 [Drosophila willistoni]
 gi|194168064|gb|EDW82965.1| GK22564 [Drosophila willistoni]
          Length = 757

 Score =  163 bits (413), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 78/130 (60%), Positives = 100/130 (76%), Gaps = 4/130 (3%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
           + +++ E AK  IEWMSDKL K+FEGARNNPF FKH+ LCHSLA++ K+P+ PKVVL ST
Sbjct: 272 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHINLCHSLADVFKLPAGPKVVLAST 331

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG-GNRTLTLQVKKRIRLEG 139
           PDME GF+RDLF QW ++P NSII T RTSPG+LA +L+E     R + L V++R+ LEG
Sbjct: 332 PDMESGFTRDLFVQWAANPNNSIIFTTRTSPGSLAMELVENAVPGRKIELDVRRRVELEG 391

Query: 140 EELEEYQKKK 149
            ELEEY + +
Sbjct: 392 PELEEYLRTQ 401



 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 43/119 (36%), Positives = 64/119 (53%), Gaps = 39/119 (32%)

Query: 158 EKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDME 217
           E+++IP H++  INEL+LSDFKQ L RN I+ EF  GVL C   T+A+RR+         
Sbjct: 678 EEDEIPVHNSVLINELKLSDFKQILMRNNINSEFSGGVLWCSNNTLALRRI--------- 728

Query: 218 CGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                         D+  + +EGCLS++YY++++LLY+QYAI+
Sbjct: 729 ------------------------------DAGKVSMEGCLSEDYYKIRELLYEQYAIV 757



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 28/35 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL STPDME GF+RDLF QW ++P NSII T R
Sbjct: 325 KVVLASTPDMESGFTRDLFVQWAANPNNSIIFTTR 359


>gi|198452192|ref|XP_002137430.1| GA26549 [Drosophila pseudoobscura pseudoobscura]
 gi|198131825|gb|EDY67988.1| GA26549 [Drosophila pseudoobscura pseudoobscura]
          Length = 757

 Score =  163 bits (412), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 76/126 (60%), Positives = 98/126 (77%), Gaps = 4/126 (3%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
           + +++ E AK  IEWMSDKL K+FEGARNNPF FKH++LCH+LA++ K+P+ PKVVL ST
Sbjct: 272 VSYNVVEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHTLADVYKLPAGPKVVLAST 331

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
           PD+E GF+RDLF QW  +  NSII+T RTSPGTLA +L+E     R + L V++R+ LEG
Sbjct: 332 PDLESGFTRDLFIQWAGNANNSIILTTRTSPGTLAMELVENYAPGRQIELDVRRRVELEG 391

Query: 140 EELEEY 145
            ELEEY
Sbjct: 392 AELEEY 397



 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 22/35 (62%), Positives = 27/35 (77%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL STPD+E GF+RDLF QW  +  NSII+T R
Sbjct: 325 KVVLASTPDLESGFTRDLFIQWAGNANNSIILTTR 359



 Score = 45.4 bits (106), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 16/29 (55%), Positives = 26/29 (89%)

Query: 248 DSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
           D+  + +EGCLS+EYY++++LLY+QYAI+
Sbjct: 729 DAGKVAMEGCLSEEYYKIRELLYEQYAIV 757


>gi|24650920|ref|NP_733264.1| cleavage and polyadenylation specificity factor 100, isoform B
           [Drosophila melanogaster]
 gi|23172526|gb|AAN14148.1| cleavage and polyadenylation specificity factor 100, isoform B
           [Drosophila melanogaster]
          Length = 664

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 77/126 (61%), Positives = 99/126 (78%), Gaps = 4/126 (3%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
           + +++ E AK  IEWMSDKL K+FEGARNNPF FKH++LCHSLA++ K+P+ PKVVL ST
Sbjct: 180 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLPAGPKVVLAST 239

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
           PD+E GF+RDLF QW S+  NSII+T RTSPGTLA +L+E     + + L V++R+ LEG
Sbjct: 240 PDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVDLEG 299

Query: 140 EELEEY 145
            ELEEY
Sbjct: 300 AELEEY 305



 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 46/117 (39%), Positives = 64/117 (54%), Gaps = 39/117 (33%)

Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
           ++IP H++  INEL+LSDFKQTL RN I+ EF  GVL C  GT+A+RRV           
Sbjct: 587 DEIPIHNSVLINELKLSDFKQTLMRNNINSEFSGGVLWCSNGTLALRRV----------- 635

Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                       D+  + +EGCLS+EYY++++LLY+QYAI+
Sbjct: 636 ----------------------------DAGKVAMEGCLSEEYYKIRELLYEQYAIV 664



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 23/35 (65%), Positives = 28/35 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL STPD+E GF+RDLF QW S+  NSII+T R
Sbjct: 233 KVVLASTPDLESGFTRDLFVQWASNANNSIILTTR 267


>gi|194906654|ref|XP_001981406.1| GG11633 [Drosophila erecta]
 gi|190656044|gb|EDV53276.1| GG11633 [Drosophila erecta]
          Length = 756

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 77/130 (59%), Positives = 101/130 (77%), Gaps = 4/130 (3%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
           + +++ E AK  IEWMSDKL K+FEGARNNPF FKH++LCHSLA++ K+P+ PKVVL ST
Sbjct: 272 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLPAGPKVVLAST 331

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
           PD+E GF+RDLF QW S+  NSII+T RTSPGTLA +L+E     + + L V++R+ LEG
Sbjct: 332 PDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVELEG 391

Query: 140 EELEEYQKKK 149
            ELEEY + +
Sbjct: 392 AELEEYLRTQ 401



 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 45/119 (37%), Positives = 64/119 (53%), Gaps = 39/119 (32%)

Query: 158 EKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDME 217
           + ++IP H++  INEL+LSDFKQ L RN I+ EF  GVL C  GT+A+RRV         
Sbjct: 677 DDDEIPIHNSVLINELKLSDFKQILMRNNINSEFSGGVLWCSNGTLALRRV--------- 727

Query: 218 CGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                         D+  + +EGCLS+EYY++++LLY+QYAI+
Sbjct: 728 ------------------------------DAGKVAMEGCLSEEYYKIRELLYEQYAIV 756



 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 23/35 (65%), Positives = 28/35 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL STPD+E GF+RDLF QW S+  NSII+T R
Sbjct: 325 KVVLASTPDLESGFTRDLFVQWASNANNSIILTTR 359


>gi|195341087|ref|XP_002037143.1| GM12754 [Drosophila sechellia]
 gi|194131259|gb|EDW53302.1| GM12754 [Drosophila sechellia]
          Length = 743

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 77/130 (59%), Positives = 101/130 (77%), Gaps = 4/130 (3%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
           + +++ E AK  IEWMSDKL K+FEGARNNPF FKH++LCHSLA++ K+P+ PKVVL ST
Sbjct: 272 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLPAGPKVVLAST 331

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
           PD+E GF+RDLF QW S+  NSII+T RTSPGTLA +L+E     + + L V++R+ LEG
Sbjct: 332 PDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVDLEG 391

Query: 140 EELEEYQKKK 149
            ELEEY + +
Sbjct: 392 AELEEYLRTQ 401



 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 46/117 (39%), Positives = 64/117 (54%), Gaps = 39/117 (33%)

Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
           ++IP H++  INEL+LSDFKQTL RN I+ EF  GVL C  GT+A+RRV           
Sbjct: 666 DEIPIHNSVLINELKLSDFKQTLLRNNINSEFSGGVLWCSNGTLALRRV----------- 714

Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                       D+  + +EGCLS+EYY++++LLY+QYAI+
Sbjct: 715 ----------------------------DAGKVAMEGCLSEEYYKIRELLYEQYAIV 743



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 23/35 (65%), Positives = 28/35 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL STPD+E GF+RDLF QW S+  NSII+T R
Sbjct: 325 KVVLASTPDLESGFTRDLFVQWASNANNSIILTTR 359


>gi|195503417|ref|XP_002098643.1| GE26465, isoform A [Drosophila yakuba]
 gi|194184744|gb|EDW98355.1| GE26465, isoform A [Drosophila yakuba]
          Length = 756

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 77/130 (59%), Positives = 101/130 (77%), Gaps = 4/130 (3%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
           + +++ E AK  IEWMSDKL K+FEGARNNPF FKH++LCHSLA++ K+P+ PKVVL ST
Sbjct: 272 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLPAGPKVVLAST 331

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
           PD+E GF+RDLF QW S+  NSII+T RTSPGTLA +L+E     + + L V++R+ LEG
Sbjct: 332 PDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVELEG 391

Query: 140 EELEEYQKKK 149
            ELEEY + +
Sbjct: 392 AELEEYLRTQ 401



 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 45/117 (38%), Positives = 63/117 (53%), Gaps = 39/117 (33%)

Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
           ++IP H++  INEL+LSDFKQ L RN I+ EF  GVL C  GT+A+RRV           
Sbjct: 679 DEIPIHNSVLINELKLSDFKQILMRNNINSEFSGGVLWCSNGTLALRRV----------- 727

Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                       D+  + +EGCLS+EYY++++LLY+QYAI+
Sbjct: 728 ----------------------------DAGKVAMEGCLSEEYYKIRELLYEQYAIV 756



 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 23/35 (65%), Positives = 28/35 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL STPD+E GF+RDLF QW S+  NSII+T R
Sbjct: 325 KVVLASTPDLESGFTRDLFVQWASNANNSIILTTR 359


>gi|157822735|ref|NP_001100223.1| cleavage and polyadenylation specificity factor subunit 2 [Rattus
           norvegicus]
 gi|149025374|gb|EDL81741.1| cleavage and polyadenylation specific factor 2 (predicted) [Rattus
           norvegicus]
          Length = 782

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 69/111 (62%), Positives = 91/111 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG+EL
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKVTEIELRKRVKLEGKEL 392



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|354494117|ref|XP_003509185.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Cricetulus griseus]
          Length = 782

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 69/111 (62%), Positives = 91/111 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG+EL
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKVTEIELRKRVKLEGKEL 392



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|195503420|ref|XP_002098644.1| GE26465, isoform B [Drosophila yakuba]
 gi|194184745|gb|EDW98356.1| GE26465, isoform B [Drosophila yakuba]
          Length = 548

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 77/130 (59%), Positives = 101/130 (77%), Gaps = 4/130 (3%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
           + +++ E AK  IEWMSDKL K+FEGARNNPF FKH++LCHSLA++ K+P+ PKVVL ST
Sbjct: 272 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLPAGPKVVLAST 331

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
           PD+E GF+RDLF QW S+  NSII+T RTSPGTLA +L+E     + + L V++R+ LEG
Sbjct: 332 PDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVELEG 391

Query: 140 EELEEYQKKK 149
            ELEEY + +
Sbjct: 392 AELEEYLRTQ 401



 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 23/35 (65%), Positives = 28/35 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL STPD+E GF+RDLF QW S+  NSII+T R
Sbjct: 325 KVVLASTPDLESGFTRDLFVQWASNANNSIILTTR 359


>gi|21358013|ref|NP_651658.1| cleavage and polyadenylation specificity factor 100, isoform A
           [Drosophila melanogaster]
 gi|18203548|sp|Q9V3D6.1|CPSF2_DROME RecName: Full=Probable cleavage and polyadenylation specificity
           factor subunit 2; AltName: Full=Cleavage and
           polyadenylation specificity factor 100 kDa subunit;
           Short=CPSF 100 kDa subunit
 gi|5679134|gb|AAD46873.1|AF160933_1 LD14168p [Drosophila melanogaster]
 gi|7301732|gb|AAF56844.1| cleavage and polyadenylation specificity factor 100, isoform A
           [Drosophila melanogaster]
          Length = 756

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 77/130 (59%), Positives = 101/130 (77%), Gaps = 4/130 (3%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
           + +++ E AK  IEWMSDKL K+FEGARNNPF FKH++LCHSLA++ K+P+ PKVVL ST
Sbjct: 272 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLPAGPKVVLAST 331

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
           PD+E GF+RDLF QW S+  NSII+T RTSPGTLA +L+E     + + L V++R+ LEG
Sbjct: 332 PDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVDLEG 391

Query: 140 EELEEYQKKK 149
            ELEEY + +
Sbjct: 392 AELEEYLRTQ 401



 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 46/117 (39%), Positives = 64/117 (54%), Gaps = 39/117 (33%)

Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
           ++IP H++  INEL+LSDFKQTL RN I+ EF  GVL C  GT+A+RRV           
Sbjct: 679 DEIPIHNSVLINELKLSDFKQTLMRNNINSEFSGGVLWCSNGTLALRRV----------- 727

Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                       D+  + +EGCLS+EYY++++LLY+QYAI+
Sbjct: 728 ----------------------------DAGKVAMEGCLSEEYYKIRELLYEQYAIV 756



 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 23/35 (65%), Positives = 28/35 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL STPD+E GF+RDLF QW S+  NSII+T R
Sbjct: 325 KVVLASTPDLESGFTRDLFVQWASNANNSIILTTR 359


>gi|13938095|gb|AAH07163.1| Cpsf2 protein, partial [Mus musculus]
          Length = 732

 Score =  161 bits (408), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 69/111 (62%), Positives = 91/111 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 232 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 291

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG+EL
Sbjct: 292 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPTEKVTEIELRKRVKLEGKEL 342



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 657 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 702

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 703 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 732



 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 274 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 309


>gi|8393762|ref|NP_058552.1| cleavage and polyadenylation specificity factor subunit 2 [Mus
           musculus]
 gi|18202027|sp|O35218.1|CPSF2_MOUSE RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 2; AltName: Full=Cleavage and polyadenylation
           specificity factor 100 kDa subunit; Short=CPSF 100 kDa
           subunit
 gi|2331036|gb|AAB66830.1| cleavage and polyadenylation specificity factor [Mus musculus]
 gi|15489017|gb|AAH13628.1| Cleavage and polyadenylation specific factor 2 [Mus musculus]
 gi|148686924|gb|EDL18871.1| cleavage and polyadenylation specific factor 2 [Mus musculus]
          Length = 782

 Score =  161 bits (408), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 69/111 (62%), Positives = 91/111 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG+EL
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPTEKVTEIELRKRVKLEGKEL 392



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|195392300|ref|XP_002054797.1| GJ24636 [Drosophila virilis]
 gi|194152883|gb|EDW68317.1| GJ24636 [Drosophila virilis]
          Length = 693

 Score =  161 bits (408), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 76/130 (58%), Positives = 99/130 (76%), Gaps = 4/130 (3%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
           + +++ E AK  IEWMSDKL K+FEGARNNPF FKH+ LCH+LA++ K+P+ PKVVL ST
Sbjct: 272 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIHLCHTLADIYKLPAGPKVVLAST 331

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGG-NRTLTLQVKKRIRLEG 139
           PDME GF+RDLF QW S+P NSII T RT PG+L+ +L+E     R + L V++R+ LEG
Sbjct: 332 PDMESGFTRDLFVQWASNPNNSIIFTTRTGPGSLSMELVENSTPGRQIELDVRRRVELEG 391

Query: 140 EELEEYQKKK 149
            ELEEY + +
Sbjct: 392 AELEEYLRTQ 401



 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 45/119 (37%), Positives = 65/119 (54%), Gaps = 39/119 (32%)

Query: 158 EKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDME 217
           E+++IP H++  INEL+LSDFKQ L RN I+ EF  GVL C  GT+A+RRV         
Sbjct: 614 EEDEIPVHNSVLINELKLSDFKQVLMRNNINSEFSGGVLWCSNGTLALRRV--------- 664

Query: 218 CGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                         D+  + +EGCLS++YY++++LLY+QYAI+
Sbjct: 665 ------------------------------DAGKVAMEGCLSEDYYKIRELLYEQYAIV 693



 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 25/35 (71%), Positives = 28/35 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL STPDME GF+RDLF QW S+P NSII T R
Sbjct: 325 KVVLASTPDMESGFTRDLFVQWASNPNNSIIFTTR 359


>gi|195109795|ref|XP_001999467.1| GI23051 [Drosophila mojavensis]
 gi|193916061|gb|EDW14928.1| GI23051 [Drosophila mojavensis]
          Length = 754

 Score =  161 bits (408), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 75/130 (57%), Positives = 98/130 (75%), Gaps = 4/130 (3%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
           + +++ E AK  IEWMSDKL K+FEGARNNPF FKH+ LCH+LA++ K+P+ PKVVL ST
Sbjct: 272 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHINLCHTLADIYKLPAGPKVVLAST 331

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
           PDME GF+RDLF QW  +P NSII T RT PG+L+ DL+E     R + L +++R+ LEG
Sbjct: 332 PDMESGFTRDLFVQWAGNPNNSIIFTTRTGPGSLSMDLVENYSPGRQIELDLRRRVELEG 391

Query: 140 EELEEYQKKK 149
            ELEEY + +
Sbjct: 392 AELEEYLRTQ 401



 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 46/119 (38%), Positives = 65/119 (54%), Gaps = 39/119 (32%)

Query: 158 EKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDME 217
           E+++IP H++  INEL+LSDFKQ L RN I+ EF  GVL CC GT+A+RRV         
Sbjct: 675 EEDEIPVHNSVLINELKLSDFKQVLMRNNINSEFSGGVLWCCNGTLALRRV--------- 725

Query: 218 CGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                         D   + +EGCLS++YY++++LLY+QYAI+
Sbjct: 726 ------------------------------DVGKVAMEGCLSEDYYKIRELLYEQYAIV 754



 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 27/35 (77%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL STPDME GF+RDLF QW  +P NSII T R
Sbjct: 325 KVVLASTPDMESGFTRDLFVQWAGNPNNSIIFTTR 359


>gi|194745794|ref|XP_001955372.1| GF16269 [Drosophila ananassae]
 gi|190628409|gb|EDV43933.1| GF16269 [Drosophila ananassae]
          Length = 756

 Score =  161 bits (407), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 76/130 (58%), Positives = 101/130 (77%), Gaps = 4/130 (3%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
           + +++ E AK  IEWMSDKL K+FEGARNNPF FKH++LCHSLA++ K+P+ PKVVL ST
Sbjct: 272 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADIYKLPAGPKVVLAST 331

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
           PD+E GF+RDLF QW S+  NSII+T RTSPGTLA +L+E     R + L +++R+ LEG
Sbjct: 332 PDLESGFTRDLFVQWASNSNNSIILTTRTSPGTLAMELVENCTPGRQIELDIRRRVELEG 391

Query: 140 EELEEYQKKK 149
            EL+EY + +
Sbjct: 392 AELDEYLRTQ 401



 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 45/118 (38%), Positives = 64/118 (54%), Gaps = 39/118 (33%)

Query: 159 KEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMEC 218
           +++IP H++  INEL+LSDFKQ L RN I+ EF  GVL C  GT+A+RRV          
Sbjct: 678 EDEIPVHNSVLINELKLSDFKQILMRNNINSEFSGGVLWCSNGTLALRRV---------- 727

Query: 219 GFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                        D+  + +EGCLS+EYY++++LLY+QYAI+
Sbjct: 728 -----------------------------DAGKVAMEGCLSEEYYKIRELLYEQYAIV 756



 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 31/77 (40%), Positives = 38/77 (49%), Gaps = 17/77 (22%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNV--------------- 251
           +VVL STPD+E GF+RDLF QW S+  NSII+T R         +               
Sbjct: 325 KVVLASTPDLESGFTRDLFVQWASNSNNSIILTTRTSPGTLAMELVENCTPGRQIELDIR 384

Query: 252 --IVLEGCLSDEYYRVQ 266
             + LEG   DEY R Q
Sbjct: 385 RRVELEGAELDEYLRTQ 401


>gi|344253621|gb|EGW09725.1| Sodium/potassium/calcium exchanger 4 [Cricetulus griseus]
          Length = 1206

 Score =  161 bits (407), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 69/111 (62%), Positives = 91/111 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 200 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 259

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG+EL
Sbjct: 260 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKVTEIELRKRVKLEGKEL 310



 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 242 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 277


>gi|74183852|dbj|BAE24504.1| unnamed protein product [Mus musculus]
          Length = 493

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 69/111 (62%), Positives = 91/111 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG+EL
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPTEKVTEIELRKRVKLEGKEL 392



 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|74188762|dbj|BAE28111.1| unnamed protein product [Mus musculus]
          Length = 412

 Score =  160 bits (406), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 69/111 (62%), Positives = 91/111 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG+EL
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPTEKVTEIELRKRVKLEGKEL 392



 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|432115811|gb|ELK36959.1| Cleavage and polyadenylation specificity factor subunit 2 [Myotis
           davidii]
          Length = 687

 Score =  160 bits (406), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 72/124 (58%), Positives = 94/124 (75%), Gaps = 10/124 (8%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 200 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 259

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELG----------GNRTLTLQVKKRIRLEGEE 141
           F QWC  P+NSII+T RT+PGTLAR LI+              +   ++++KR++LEG+E
Sbjct: 260 FIQWCEDPKNSIILTYRTTPGTLARFLIDNPLPHPSPSLHFAEKVTEIELRKRVKLEGKE 319

Query: 142 LEEY 145
           LEEY
Sbjct: 320 LEEY 323



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 26/40 (65%), Positives = 30/40 (75%)

Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           V   +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 238 VPSPKVVLASQPDLECGFSRDLFIQWCEDPKNSIILTYRT 277



 Score = 45.4 bits (106), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 21/47 (44%), Positives = 28/47 (59%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRR 207
           ++P H + F+NE +L DFKQ L R  I  EF+ GVL+C       RR
Sbjct: 635 EVPGHQSVFMNEPRLFDFKQVLLREWIQAEFVGGVLVCNNQISVCRR 681


>gi|195574631|ref|XP_002105288.1| GD21403 [Drosophila simulans]
 gi|194201215|gb|EDX14791.1| GD21403 [Drosophila simulans]
          Length = 664

 Score =  160 bits (406), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 76/126 (60%), Positives = 98/126 (77%), Gaps = 4/126 (3%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
           + +++ E AK  IEWMSDKL K+FEGARNNPF FKH++LCHSLA++  +P+ PKVVL ST
Sbjct: 180 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYNLPAGPKVVLAST 239

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEG 139
           PD+E GF+RDLF QW S+  NSII+T RTSPGTLA +L+E     + + L V++R+ LEG
Sbjct: 240 PDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVDLEG 299

Query: 140 EELEEY 145
            ELEEY
Sbjct: 300 AELEEY 305



 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 46/117 (39%), Positives = 64/117 (54%), Gaps = 39/117 (33%)

Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
           ++IP H++  INEL+LSDFKQTL RN I+ EF  GVL C  GT+A+RRV           
Sbjct: 587 DEIPIHNSVLINELKLSDFKQTLMRNNINSEFSGGVLWCSNGTLALRRV----------- 635

Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                       D+  + +EGCLS+EYY++++LLY+QYAI+
Sbjct: 636 ----------------------------DAGKVAMEGCLSEEYYKIRELLYEQYAIV 664



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 23/35 (65%), Positives = 28/35 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL STPD+E GF+RDLF QW S+  NSII+T R
Sbjct: 233 KVVLASTPDLESGFTRDLFVQWASNANNSIILTTR 267


>gi|348517622|ref|XP_003446332.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Oreochromis niloticus]
          Length = 787

 Score =  160 bits (405), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 71/108 (65%), Positives = 90/108 (83%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCHSLA+LA+VPSPKVVL S PD+E GFSR+L
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLTLCHSLADLARVPSPKVVLCSQPDLESGFSREL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC + +NSII+T RT+PGTLAR LI+  G + L L+VKKR++LEG
Sbjct: 342 FIQWCQNAKNSIILTYRTTPGTLARYLIDNPGEKMLDLEVKKRVKLEG 389



 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 42/114 (36%), Positives = 56/114 (49%), Gaps = 40/114 (35%)

Query: 163 PPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSR 222
           P H + FINE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR               
Sbjct: 714 PGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLV-CNNMVAVRR--------------- 757

Query: 223 DLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                   +++  I LEGCL D+YY++++LLY QYA++
Sbjct: 758 ------------------------TEAGRIGLEGCLCDDYYKIRELLYQQYAVV 787



 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 22/36 (61%), Positives = 28/36 (77%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+E GFSR+LF QWC + +NSII+T R 
Sbjct: 324 KVVLCSQPDLESGFSRELFIQWCQNAKNSIILTYRT 359


>gi|147901518|ref|NP_001081123.1| cleavage and polyadenylation specificity factor subunit 2 [Xenopus
           laevis]
 gi|18203567|sp|Q9W799.1|CPSF2_XENLA RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 2; AltName: Full=Cleavage and polyadenylation
           specificity factor 100 kDa subunit; Short=CPSF 100 kDa
           subunit
 gi|4927240|gb|AAD33061.1|AF139986_1 cleavage and polyadenylation specificity factor 100 kDa subunit
           [Xenopus laevis]
          Length = 783

 Score =  160 bits (405), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 67/111 (60%), Positives = 91/111 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH  ++LA+VPSPKVVL S PD+ECGFSR+L
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLTLCHGYSDLARVPSPKVVLASQPDLECGFSREL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
           F QWC  P+NS+I+T RT+PGTLAR LI+    R + ++++KR++LEG+EL
Sbjct: 342 FIQWCQDPKNSVILTYRTTPGTLARFLIDHPSERIIDIELRKRVKLEGKEL 392



 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 39/116 (33%), Positives = 59/116 (50%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 708 EVPGHQSVFMNEPRLSDFKQVLLREGIHAEFVGGVLV-CNNMVAVRR------------- 753

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL ++++++++LLY+QYAI+
Sbjct: 754 --------------------------TETGRIGLEGCLCEDFFKIRELLYEQYAIV 783



 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 27/52 (51%), Positives = 33/52 (63%), Gaps = 5/52 (9%)

Query: 196 LICCRGTVAVRRV-----VLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           L  C G   + RV     VL S PD+ECGFSR+LF QWC  P+NS+I+T R 
Sbjct: 308 LTLCHGYSDLARVPSPKVVLASQPDLECGFSRELFIQWCQDPKNSVILTYRT 359


>gi|195054718|ref|XP_001994270.1| GH10247 [Drosophila grimshawi]
 gi|193896140|gb|EDV95006.1| GH10247 [Drosophila grimshawi]
          Length = 754

 Score =  160 bits (404), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 76/130 (58%), Positives = 97/130 (74%), Gaps = 4/130 (3%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVST 80
           + +++ E AK  IEWMSDKL K+FEGARNNPF FKH+ LCH+LA++ K+P  PKVVL ST
Sbjct: 272 VSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHINLCHTLADVYKLPVGPKVVLAST 331

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG-GNRTLTLQVKKRIRLEG 139
           PDME GF+RDLF QW  +P NSII T RT PG+L+ +L+E     R L L V++R+ LEG
Sbjct: 332 PDMESGFTRDLFVQWAGNPNNSIIFTTRTGPGSLSMELVENSVPGRQLELDVRRRVELEG 391

Query: 140 EELEEYQKKK 149
            ELEEY + +
Sbjct: 392 AELEEYLRTQ 401



 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 45/118 (38%), Positives = 65/118 (55%), Gaps = 39/118 (33%)

Query: 159 KEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMEC 218
           +++IP H++  INEL+LSDFKQ L RN I+ EF  GVL CC GT+A+RRV          
Sbjct: 676 EDEIPVHNSVLINELKLSDFKQVLMRNSINSEFSGGVLWCCNGTLALRRV---------- 725

Query: 219 GFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                        D+  + +EGC+S+EYY++++LLY+QYAI+
Sbjct: 726 -----------------------------DTGKVAMEGCISEEYYKIRELLYEQYAIV 754



 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 27/35 (77%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL STPDME GF+RDLF QW  +P NSII T R
Sbjct: 325 KVVLASTPDMESGFTRDLFVQWAGNPNNSIIFTTR 359


>gi|327259138|ref|XP_003214395.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Anolis carolinensis]
          Length = 783

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 66/108 (61%), Positives = 89/108 (82%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD++CGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLDCGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    + + ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNSSEKVIDMELRKRVKLEG 389



 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/116 (34%), Positives = 58/116 (50%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 708 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNLVAVRR------------- 753

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL +++Y+++ LLY+QYAI+
Sbjct: 754 --------------------------TETGRIGLEGCLCEDFYKIRDLLYEQYAIV 783



 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 29/57 (50%), Positives = 36/57 (63%), Gaps = 8/57 (14%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRA--------DDSDSNVIVLE 255
           +VVL S PD++CGFSRDLF QWC  P+NSII+T R           D+S   VI +E
Sbjct: 324 KVVLASQPDLDCGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNSSEKVIDME 380


>gi|387015290|gb|AFJ49764.1| Cleavage and polyadenylation specificity factor subunit 2-like
           [Crotalus adamanteus]
          Length = 783

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH LA+LA+VPSPKVVL S PD++CGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLADLARVPSPKVVLASQPDLDCGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    + + ++ +KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKVIDIEFRKRVKLEG 389



 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 39/116 (33%), Positives = 58/116 (50%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R G+  EF+ GVL+ C   VAVRR             
Sbjct: 708 EVPGHQSVFMNEPRLSDFKQVLLREGVQAEFVGGVLV-CNNLVAVRR------------- 753

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL +++Y+++ LLY+QYAI+
Sbjct: 754 --------------------------TETGRIGLEGCLCEDFYKIRDLLYEQYAIV 783



 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 28/52 (53%), Positives = 33/52 (63%), Gaps = 5/52 (9%)

Query: 196 LICCRGTVAVRRV-----VLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           L  C G   + RV     VL S PD++CGFSRDLF QWC  P+NSII+T R 
Sbjct: 308 LSLCHGLADLARVPSPKVVLASQPDLDCGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|7243115|dbj|BAA92605.1| KIAA1367 protein [Homo sapiens]
          Length = 579

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 69/118 (58%), Positives = 93/118 (78%), Gaps = 2/118 (1%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E +K  +EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S P
Sbjct: 69  VSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQP 128

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           D+ECGFSRDLF QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 129 DLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 186



 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 504 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 549

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 550 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 579



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 121 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 156


>gi|67968123|dbj|BAE00542.1| unnamed protein product [Macaca fascicularis]
          Length = 592

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 69/118 (58%), Positives = 93/118 (78%), Gaps = 2/118 (1%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E +K  +EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S P
Sbjct: 82  VSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQP 141

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           D+ECGFSRDLF QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 142 DLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 199



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 517 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 562

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 563 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 592



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 26/40 (65%), Positives = 30/40 (75%)

Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           V   +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 130 VPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 169


>gi|291406601|ref|XP_002719640.1| PREDICTED: cleavage and polyadenylation specific factor 2
           [Oryctolagus cuniculus]
          Length = 782

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKVTEIELRKRVKLEG 389



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|28461235|ref|NP_787002.1| cleavage and polyadenylation specificity factor subunit 2 [Bos
           taurus]
 gi|426248504|ref|XP_004018003.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Ovis aries]
 gi|1706103|sp|Q10568.1|CPSF2_BOVIN RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 2; AltName: Full=Cleavage and polyadenylation
           specificity factor 100 kDa subunit; Short=CPSF 100 kDa
           subunit
 gi|599683|emb|CAA53535.1| Cleavage and Polyadenylation specificity factor (CPSF) 100kD
           subunit [Bos taurus]
 gi|296475169|tpg|DAA17284.1| TPA: cleavage and polyadenylation specificity factor subunit 2 [Bos
           taurus]
 gi|440892550|gb|ELR45701.1| Cleavage and polyadenylation specificity factor subunit 2 [Bos
           grunniens mutus]
          Length = 782

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKVTEIELRKRVKLEG 389



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|426377790|ref|XP_004055637.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Gorilla gorilla gorilla]
 gi|193785772|dbj|BAG51207.1| unnamed protein product [Homo sapiens]
          Length = 690

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 69/118 (58%), Positives = 93/118 (78%), Gaps = 2/118 (1%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E +K  +EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S P
Sbjct: 180 VSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQP 239

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           D+ECGFSRDLF QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 240 DLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 297



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 615 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 660

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 661 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 690



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 232 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 267


>gi|10241720|emb|CAC09445.1| hypothetical protein [Homo sapiens]
          Length = 504

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 4   SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 63

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 64  FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 111



 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 429 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 474

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 475 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 504



 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 26/40 (65%), Positives = 30/40 (75%)

Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           V   +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 42  VPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 81


>gi|410962841|ref|XP_003987977.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Felis catus]
          Length = 690

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 69/118 (58%), Positives = 93/118 (78%), Gaps = 2/118 (1%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E +K  +EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S P
Sbjct: 180 VSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQP 239

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           D+ECGFSRDLF QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 240 DLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 297



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 615 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 660

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 661 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 690



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 232 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 267


>gi|432944969|ref|XP_004083472.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Oryzias latipes]
          Length = 787

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 69/108 (63%), Positives = 89/108 (82%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCHSLA+LA+VPSPKVVL S PD+E GFSR+L
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLNLCHSLADLARVPSPKVVLCSQPDLESGFSREL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC + +NSII+T RT+PGTL R LI+  G + L L+V+KR++LEG
Sbjct: 342 FIQWCQNSKNSIILTYRTTPGTLGRYLIDHPGEKMLDLEVRKRVKLEG 389



 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 43/116 (37%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           +IP H   FINE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 712 EIPGHQAVFINEPRLSDFKQVLLREGIQAEFVGGVLV-CNNMVAVRR------------- 757

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL D+YY++++LLY QYA++
Sbjct: 758 --------------------------TEAGRIGLEGCLCDDYYKIRELLYQQYAVV 787



 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 22/36 (61%), Positives = 28/36 (77%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+E GFSR+LF QWC + +NSII+T R 
Sbjct: 324 KVVLCSQPDLESGFSRELFIQWCQNSKNSIILTYRT 359


>gi|193786016|dbj|BAG50992.1| unnamed protein product [Homo sapiens]
          Length = 644

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 144 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 203

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 204 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 251



 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 569 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 614

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 615 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 644



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 26/40 (65%), Positives = 30/40 (75%)

Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           V   +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 182 VPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 221


>gi|403298151|ref|XP_003939898.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 isoform 2 [Saimiri boliviensis boliviensis]
          Length = 648

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 148 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 207

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 208 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 255



 Score = 69.3 bits (168), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 573 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 618

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 619 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 648



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 26/40 (65%), Positives = 30/40 (75%)

Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           V   +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 186 VPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 225


>gi|297695726|ref|XP_002825082.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 2 [Pongo abelii]
          Length = 747

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 69/118 (58%), Positives = 93/118 (78%), Gaps = 2/118 (1%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E +K  +EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S P
Sbjct: 237 VSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQP 296

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           D+ECGFSRDLF QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 297 DLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 354



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 672 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 717

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 718 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 747



 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 26/40 (65%), Positives = 30/40 (75%)

Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           V   +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 285 VPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 324


>gi|431839217|gb|ELK01144.1| Cleavage and polyadenylation specificity factor subunit 2 [Pteropus
           alecto]
          Length = 782

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|73962293|ref|XP_537353.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 isoform 1 [Canis lupus familiaris]
          Length = 782

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|351699560|gb|EHB02479.1| Cleavage and polyadenylation specificity factor subunit 2
           [Heterocephalus glaber]
          Length = 782

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|344274144|ref|XP_003408878.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Loxodonta africana]
          Length = 782

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|296215760|ref|XP_002754257.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Callithrix jacchus]
 gi|403298149|ref|XP_003939897.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 isoform 1 [Saimiri boliviensis boliviensis]
          Length = 782

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|34101288|ref|NP_059133.1| cleavage and polyadenylation specificity factor subunit 2 [Homo
           sapiens]
 gi|114654441|ref|XP_001147277.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 isoform 3 [Pan troglodytes]
 gi|397525769|ref|XP_003832826.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Pan paniscus]
 gi|51338827|sp|Q9P2I0.2|CPSF2_HUMAN RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 2; AltName: Full=Cleavage and polyadenylation
           specificity factor 100 kDa subunit; Short=CPSF 100 kDa
           subunit
 gi|119601886|gb|EAW81480.1| cleavage and polyadenylation specific factor 2, 100kDa, isoform
           CRA_a [Homo sapiens]
 gi|119601888|gb|EAW81482.1| cleavage and polyadenylation specific factor 2, 100kDa, isoform
           CRA_a [Homo sapiens]
 gi|193786082|dbj|BAG50953.1| unnamed protein product [Homo sapiens]
 gi|410221574|gb|JAA08006.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
 gi|410221576|gb|JAA08007.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
 gi|410221578|gb|JAA08008.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
 gi|410252002|gb|JAA13968.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
 gi|410307320|gb|JAA32260.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
 gi|410307322|gb|JAA32261.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
 gi|410339303|gb|JAA38598.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
 gi|410339305|gb|JAA38599.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
 gi|410339307|gb|JAA38600.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
 gi|410339309|gb|JAA38601.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
 gi|410339311|gb|JAA38602.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
          Length = 782

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|417404575|gb|JAA49034.1| Putative mrna cleavage and polyadenylation factor ii complex
           subunit cft2 cpsf subunit [Desmodus rotundus]
          Length = 782

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 342 FIQWCEDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCEDPKNSIILTYRT 359


>gi|348553776|ref|XP_003462702.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Cavia porcellus]
          Length = 782

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|383872268|ref|NP_001244509.1| cleavage and polyadenylation specificity factor subunit 2 [Macaca
           mulatta]
 gi|402876992|ref|XP_003902228.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Papio anubis]
 gi|355693514|gb|EHH28117.1| hypothetical protein EGK_18472 [Macaca mulatta]
 gi|355778801|gb|EHH63837.1| hypothetical protein EGM_16889 [Macaca fascicularis]
 gi|380783537|gb|AFE63644.1| cleavage and polyadenylation specificity factor subunit 2 [Macaca
           mulatta]
 gi|383412079|gb|AFH29253.1| cleavage and polyadenylation specificity factor subunit 2 [Macaca
           mulatta]
 gi|384942144|gb|AFI34677.1| cleavage and polyadenylation specificity factor subunit 2 [Macaca
           mulatta]
          Length = 782

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|456753050|gb|JAA74086.1| cleavage and polyadenylation specific factor 2, 100kDa [Sus scrofa]
          Length = 782

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|332223568|ref|XP_003260944.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 isoform 1 [Nomascus leucogenys]
          Length = 782

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389



 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|149737455|ref|XP_001497134.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like isoform 1 [Equus caballus]
          Length = 782

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389



 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|149531954|ref|XP_001507374.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Ornithorhynchus anatinus]
          Length = 782

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389



 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNLVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 26/40 (65%), Positives = 30/40 (75%)

Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           V   +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 320 VPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|355680846|gb|AER96660.1| cleavage and polyadenylation specific factor 2, 100kDa [Mustela
           putorius furo]
          Length = 569

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 26/40 (65%), Positives = 30/40 (75%)

Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           V   +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 320 VPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|224051637|ref|XP_002200593.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Taeniopygia guttata]
          Length = 782

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 67/108 (62%), Positives = 89/108 (82%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCHSL++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHSLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC   +NSII+T RT+PGTLAR LI+    + + ++++KR++LEG
Sbjct: 342 FIQWCQDSKNSIILTYRTTPGTLARFLIDNPSEKVIDIELRKRVKLEG 389



 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 43/122 (35%), Positives = 59/122 (48%), Gaps = 45/122 (36%)

Query: 160 EKIPPHD-----TSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTP 214
           E +PPH+     + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR       
Sbjct: 701 EPMPPHEVLGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNLVAVRR------- 752

Query: 215 DMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYA 274
                                           +++  I LEGCL  ++YR++ LLY+QYA
Sbjct: 753 --------------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYA 780

Query: 275 IL 276
           I+
Sbjct: 781 IV 782



 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 24/36 (66%), Positives = 28/36 (77%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC   +NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYRT 359


>gi|187608214|ref|NP_001120452.1| cleavage and polyadenylation specific factor 2, 100kDa [Xenopus
           (Silurana) tropicalis]
 gi|170285004|gb|AAI61233.1| LOC100145546 protein [Xenopus (Silurana) tropicalis]
          Length = 783

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 65/108 (60%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH  ++LA+VPSPKVVL S PD+ECGFSR+L
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLTLCHGFSDLARVPSPKVVLASQPDLECGFSREL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NS+I+T RT+PGTLAR LI+    R + ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSVILTYRTTPGTLARFLIDHPSERIIDIELRKRVKLEG 389



 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 39/116 (33%), Positives = 59/116 (50%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 708 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNMVAVRR------------- 753

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL ++++++++LLY+QYAI+
Sbjct: 754 --------------------------TETGRIGLEGCLCEDFFKIRELLYEQYAIV 783



 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 27/52 (51%), Positives = 33/52 (63%), Gaps = 5/52 (9%)

Query: 196 LICCRG-----TVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           L  C G      V   +VVL S PD+ECGFSR+LF QWC  P+NS+I+T R 
Sbjct: 308 LTLCHGFSDLARVPSPKVVLASQPDLECGFSRELFIQWCQDPKNSVILTYRT 359


>gi|326920924|ref|XP_003206716.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Meleagris gallopavo]
          Length = 782

 Score =  157 bits (396), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 66/108 (61%), Positives = 89/108 (82%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCHSL++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHSLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC   +NSII+T RT+PGTLAR LI+    + + +++++R++LEG
Sbjct: 342 FIQWCQDSKNSIILTYRTTPGTLARFLIDNPSEKVIDIELRRRVKLEG 389



 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNMVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR+++LLY QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRELLYKQYAIV 782



 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 24/36 (66%), Positives = 28/36 (77%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC   +NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYRT 359


>gi|449280731|gb|EMC87967.1| Cleavage and polyadenylation specificity factor subunit 2 [Columba
           livia]
          Length = 782

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 65/108 (60%), Positives = 89/108 (82%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKL++ FE  RNNPF F+H+ LCHSL++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLLRCFEDKRNNPFQFRHLSLCHSLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC   +NS+I+T RT+PGTLAR LI+    + + ++++KR++LEG
Sbjct: 342 FIQWCQDSKNSVILTYRTTPGTLARFLIDNPSEKVIDIELRKRVKLEG 389



 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 43/122 (35%), Positives = 58/122 (47%), Gaps = 45/122 (36%)

Query: 160 EKIPPHD-----TSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTP 214
           E +PPH+     + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR       
Sbjct: 701 EPLPPHEVIGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNLVAVRR------- 752

Query: 215 DMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYA 274
                                           +++  I LEGCL  ++YR++ LLY QYA
Sbjct: 753 --------------------------------TETGRIGLEGCLCQDFYRIRDLLYKQYA 780

Query: 275 IL 276
           I+
Sbjct: 781 IV 782



 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 23/36 (63%), Positives = 28/36 (77%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC   +NS+I+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDSKNSVILTYRT 359


>gi|47125306|gb|AAH70095.1| Cleavage and polyadenylation specific factor 2, 100kDa [Homo
           sapiens]
          Length = 782

 Score =  155 bits (393), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 66/108 (61%), Positives = 87/108 (80%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMS KLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSGKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|449662070|ref|XP_004205466.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like, partial [Hydra magnipapillata]
          Length = 568

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 75/136 (55%), Positives = 101/136 (74%), Gaps = 5/136 (3%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  +EWMSD++MKSFE  R NPF FKH+ LCH L EL ++PSPKVVL S  
Sbjct: 90  VSYNVVEFAKSQVEWMSDRMMKSFEVDRRNPFAFKHITLCHFLKELDQLPSPKVVLASAA 149

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           DM CGFS+DLF QW S+P+NS+I T +TSPG+LAR LI+     ++ L+V KR+RLEG E
Sbjct: 150 DMNCGFSKDLFVQWASNPKNSVIFTFKTSPGSLARTLIDNPKIESVELEVFKRVRLEGVE 209

Query: 142 LEEY---QKKKDKEAK 154
           L +Y   +K+K ++AK
Sbjct: 210 LSQYLEVEKEKARQAK 225



 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 54/166 (32%), Positives = 76/166 (45%), Gaps = 42/166 (25%)

Query: 113 TLARD--LIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFI 170
            +ARD  L  + G   +  + +K  ++E E  E+ +K+      ++   E IP H T FI
Sbjct: 443 AVARDTELAWVDGQLVMEARGEKFNQIEQENSEKVEKQDVVPVLEQLPPEMIPGHATVFI 502

Query: 171 NELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQWCS 230
           +E +LSDFKQ L + GI  EF  GVL+ C   VAVRR                       
Sbjct: 503 DEPRLSDFKQVLTKAGIQAEFTGGVLV-CNNVVAVRR----------------------- 538

Query: 231 SPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                            +   I +EG L +EYY ++QLLYDQYAI+
Sbjct: 539 ----------------GEQGKISIEGGLCEEYYVIRQLLYDQYAIV 568



 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 21/33 (63%), Positives = 26/33 (78%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 239
           +VVL S  DM CGFS+DLF QW S+P+NS+I T
Sbjct: 142 KVVLASAADMNCGFSKDLFVQWASNPKNSVIFT 174


>gi|395503674|ref|XP_003756188.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Sarcophilus harrisii]
          Length = 782

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 66/108 (61%), Positives = 87/108 (80%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC   +NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 342 FIQWCQDSKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389



 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNLVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 24/36 (66%), Positives = 28/36 (77%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC   +NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYRT 359


>gi|126282067|ref|XP_001365312.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 isoform 1 [Monodelphis domestica]
          Length = 782

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 66/108 (61%), Positives = 87/108 (80%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC   +NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 342 FIQWCQDSKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 389



 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNLVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782



 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 24/36 (66%), Positives = 28/36 (77%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC   +NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYRT 359


>gi|340370496|ref|XP_003383782.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Amphimedon queenslandica]
          Length = 730

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 75/142 (52%), Positives = 103/142 (72%), Gaps = 5/142 (3%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  +EWMS+KLM++FE +R NPF F+H+ LCH+L ELAKV +PK VLVS P
Sbjct: 272 VSYNVVEFAKSQVEWMSEKLMRTFEDSRTNPFQFQHINLCHNLEELAKVSNPKAVLVSPP 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           D+ECGFSRDLF  W ++P NSII T++T+  TLAR L++     T+ + VK+R+ LEG E
Sbjct: 332 DLECGFSRDLFLHWSNNPHNSIIFTSKTAHNTLARTLVDNLKIITIDMDVKRRVPLEGAE 391

Query: 142 LEEY---QKKKDKEAKDKQEKE 160
           LEEY   +K+K K A D   K+
Sbjct: 392 LEEYLMKEKEKAKTANDDDAKD 413



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/117 (30%), Positives = 54/117 (46%), Gaps = 40/117 (34%)

Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
           +++  H   FIN  +LSDFK TL + GI  EF  G LI C G VA++R            
Sbjct: 654 KQVADHKNVFINPPRLSDFKNTLTKAGIQAEFCGGALI-CNGVVAIKR------------ 700

Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                      ++   I +EG +SD+YY +++LLY+Q+AI+
Sbjct: 701 ---------------------------TEGGKISIEGSVSDDYYLIRKLLYEQFAIV 730



 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 23/40 (57%), Positives = 30/40 (75%)

Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           V+  + VLVS PD+ECGFSRDLF  W ++P NSII T++ 
Sbjct: 320 VSNPKAVLVSPPDLECGFSRDLFLHWSNNPHNSIIFTSKT 359


>gi|156399337|ref|XP_001638458.1| predicted protein [Nematostella vectensis]
 gi|156225579|gb|EDO46395.1| predicted protein [Nematostella vectensis]
          Length = 737

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 72/134 (53%), Positives = 100/134 (74%), Gaps = 2/134 (1%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  +EWMSDK+MK+FE  RNNP+ F++  LCHSLA+LA+VP PKVVL S  
Sbjct: 272 VSYNVIEFAKSQVEWMSDKIMKAFEIGRNNPYQFRYCHLCHSLADLARVPEPKVVLASMM 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           D+  GFSRDLF +W  +P+N++I T R+SPGTLAR LI+    + + L+VK+R+RL GEE
Sbjct: 332 DLTAGFSRDLFVEWADNPKNTVIFTARSSPGTLARTLIDNLELKQVELEVKQRVRLGGEE 391

Query: 142 LEEYQKKKDKEAKD 155
           LE Y ++  K+ KD
Sbjct: 392 LERYLEENKKKEKD 405



 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 38/116 (32%), Positives = 53/116 (45%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           KI  H + FINE +LSDFKQ L + GI  EF  GVLIC                      
Sbjct: 662 KIAGHVSVFINEPRLSDFKQVLNKAGIQAEFAGGVLIC---------------------- 699

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                        N+++   RN     ++  + LEG + ++YY ++ LLY QYAI+
Sbjct: 700 -------------NNVVCVRRN-----ETGRVGLEGTVCEDYYTIRDLLYSQYAIV 737



 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 26/36 (72%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S  D+  GFSRDLF +W  +P+N++I T R+
Sbjct: 324 KVVLASMMDLTAGFSRDLFVEWADNPKNTVIFTARS 359


>gi|71894931|ref|NP_001026379.1| cleavage and polyadenylation specificity factor subunit 2 [Gallus
           gallus]
 gi|60098929|emb|CAH65295.1| hypothetical protein RCJMB04_15m16 [Gallus gallus]
          Length = 782

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 65/108 (60%), Positives = 88/108 (81%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCHSL++LA+VP PKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHSLSDLARVPCPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           F QWC   +NSII+T RT+PGTLAR LI+    + + +++++R++LEG
Sbjct: 342 FIQWCQDSKNSIILTYRTTPGTLARFLIDNPSEKVIDIELRRRVKLEG 389



 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 707 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNMVAVRR------------- 752

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR+++LLY QYAI+
Sbjct: 753 --------------------------TETGRIGLEGCLCQDFYRIRELLYKQYAIV 782



 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 25/40 (62%), Positives = 29/40 (72%)

Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           V   +VVL S PD+ECGFSRDLF QWC   +NSII+T R 
Sbjct: 320 VPCPKVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYRT 359


>gi|321462132|gb|EFX73157.1| hypothetical protein DAPPUDRAFT_58164 [Daphnia pulex]
          Length = 735

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 70/128 (54%), Positives = 100/128 (78%), Gaps = 2/128 (1%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  IEWMSDKLMKSFEGARNNPF FK+++LCH+L E+ ++   KVVL S P
Sbjct: 272 VAYNVNEFAKSQIEWMSDKLMKSFEGARNNPFGFKYLQLCHTLPEVLRIAGSKVVLSSCP 331

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           D+ECGF+RDLF  WCS   NSII+T+R+  GTL + L +    +++TL++K+R++LEG E
Sbjct: 332 DLECGFARDLFALWCSDARNSIILTSRSGQGTLGQRLHDQRNLKSVTLELKQRVKLEGAE 391

Query: 142 LEEYQKKK 149
           LEE+++K+
Sbjct: 392 LEEFRRKE 399



 Score = 96.7 bits (239), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 51/119 (42%), Positives = 66/119 (55%), Gaps = 39/119 (32%)

Query: 158 EKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDME 217
           E + IP H+TS+INEL+LSDFKQ L RNGI  EF+ GVL CC G VA+RR          
Sbjct: 656 EPKDIPGHETSYINELKLSDFKQVLVRNGISSEFIGGVLWCCNGNVALRR---------- 705

Query: 218 CGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                        ++S  + LEGC+SD+YYRV++LLY+QYAI+
Sbjct: 706 -----------------------------NESGRVTLEGCISDDYYRVRELLYEQYAII 735



 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 24/40 (60%), Positives = 31/40 (77%)

Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +A  +VVL S PD+ECGF+RDLF  WCS   NSII+T+R+
Sbjct: 320 IAGSKVVLSSCPDLECGFARDLFALWCSDARNSIILTSRS 359


>gi|198428144|ref|XP_002129804.1| PREDICTED: similar to cleavage and polyadenylation specific factor
           2 [Ciona intestinalis]
          Length = 784

 Score =  152 bits (384), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 66/128 (51%), Positives = 97/128 (75%), Gaps = 2/128 (1%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  +EWMSDK++ SF   RNNPFHFKH+KLCH+L +LA+VP PK VL ST 
Sbjct: 280 VTYNVVEFAKFMVEWMSDKIINSFTDQRNNPFHFKHLKLCHNLGDLAQVPQPKCVLASTA 339

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           DMECGF+R LF +W + P N++IIT+R++ GTL+R L++      L L++KKR+ + GEE
Sbjct: 340 DMECGFARQLFIRWAADPRNTVIITSRSTKGTLSRTLVDDPTVSRLKLEMKKRVPIIGEE 399

Query: 142 LEEYQKKK 149
           L++Y++ +
Sbjct: 400 LDQYERNR 407



 Score = 67.0 bits (162), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 40/117 (34%), Positives = 54/117 (46%), Gaps = 40/117 (34%)

Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
           E  P H T FINEL+LSDFKQ L + G   EF+ GVL+C                     
Sbjct: 708 EDTPGHQTCFINELRLSDFKQVLTKEGYQAEFIGGVLVC--------------------- 746

Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                         N+++   RN+    D     LEG L++EYY ++ LLY QYA++
Sbjct: 747 --------------NNMLAIRRNQQGHID-----LEGTLTEEYYAIRDLLYQQYAVV 784



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 20/36 (55%), Positives = 28/36 (77%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           + VL ST DMECGF+R LF +W + P N++IIT+R+
Sbjct: 332 KCVLASTADMECGFARQLFIRWAADPRNTVIITSRS 367


>gi|119601887|gb|EAW81481.1| cleavage and polyadenylation specific factor 2, 100kDa, isoform
           CRA_b [Homo sapiens]
          Length = 496

 Score =  150 bits (379), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 65/103 (63%), Positives = 83/103 (80%)

Query: 37  MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
           MSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDLF QWC
Sbjct: 1   MSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWC 60

Query: 97  SSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
             P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 61  QDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 103



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 421 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 466

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 467 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 496



 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 26/40 (65%), Positives = 30/40 (75%)

Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           V   +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 34  VPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 73


>gi|209420822|gb|ACI46951.1| cyclin B [Fenneropenaeus penicillatus]
          Length = 475

 Score =  150 bits (379), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 66/124 (53%), Positives = 95/124 (76%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           IEWMS+KL K+F+  R NPF FKH+K CH+L +L+++PSPKVVL S PD+ CG++R+LF 
Sbjct: 2   IEWMSEKLTKAFDSLRTNPFSFKHLKFCHNLTDLSRLPSPKVVLASFPDLGCGYARELFV 61

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEA 153
           QW ++P+N+II+T+RT P TLAR LI+    RT  L  K+R++LEG EL+E+ + K +E 
Sbjct: 62  QWATNPKNTIILTSRTGPDTLARRLIDNPQIRTFKLLEKRRMKLEGSELDEHYRMKREEE 121

Query: 154 KDKQ 157
           + +Q
Sbjct: 122 QQQQ 125



 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 44/77 (57%), Gaps = 16/77 (20%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVI-------------- 252
           +VVL S PD+ CG++R+LF QW ++P+N+II+T+R   D     +I              
Sbjct: 42  KVVLASFPDLGCGYARELFVQWATNPKNTIILTSRTGPDTLARRLIDNPQIRTFKLLEKR 101

Query: 253 --VLEGCLSDEYYRVQQ 267
              LEG   DE+YR+++
Sbjct: 102 RMKLEGSELDEHYRMKR 118


>gi|324503279|gb|ADY41427.1| Cleavage and polyadenylation specificity factor subunit 2 [Ascaris
           suum]
          Length = 841

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 75/155 (48%), Positives = 101/155 (65%), Gaps = 14/155 (9%)

Query: 19  FKHVKLCH---SLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSP 73
           +  V L H   S+ E AK  +EWMSDK++KSFE  R NPF F+HV+LCH+  +L ++ SP
Sbjct: 263 YNLVMLSHVASSVVEFAKSQVEWMSDKILKSFEVGRYNPFQFRHVQLCHTHMDLLRIRSP 322

Query: 74  KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGG---------N 124
           KVVLVS  DMECGFSR++F +WC+   N++I+T R+   TL   LI +           N
Sbjct: 323 KVVLVSGLDMECGFSREIFLEWCADVRNTVIVTGRSGDRTLGARLIRMAEQMAENPSTVN 382

Query: 125 RTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEK 159
           R LTL+VK+RIRLEG ELE Y+ KK  + ++   K
Sbjct: 383 RNLTLEVKRRIRLEGVELENYRAKKRADEREAARK 417



 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 20/36 (55%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVLVS  DMECGFSR++F +WC+   N++I+T R+
Sbjct: 323 KVVLVSGLDMECGFSREIFLEWCADVRNTVIVTGRS 358



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 47/115 (40%), Gaps = 40/115 (34%)

Query: 162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFS 221
           IP H   F+N+ +LSD KQ L  NG   EF  GVL                         
Sbjct: 767 IPSHQAVFVNDPKLSDMKQLLMANGFHAEFSSGVLYI----------------------- 803

Query: 222 RDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                       N++    RN A         +EGC S++YY+++ ++Y Q+AI+
Sbjct: 804 ------------NNVASIRRNEAGR-----FHVEGCASEDYYKIRDIVYAQFAIV 841


>gi|312084310|ref|XP_003144223.1| cleavage and polyadenylation specificity factor subunit 2 [Loa loa]
          Length = 837

 Score =  147 bits (371), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 77/152 (50%), Positives = 100/152 (65%), Gaps = 15/152 (9%)

Query: 19  FKHVKLCH---SLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSP 73
           +  V L H   S+ E AK  +EWMSDK++KSFE  R NPF F+HV+LCH+  +L +V SP
Sbjct: 263 YNLVMLSHVASSVVEFAKSQVEWMSDKVLKSFEVGRYNPFQFRHVQLCHTHIDLLRVRSP 322

Query: 74  KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGG---------- 123
           KVVLVS  DME GFSR+LF +WC+  +NS+I+T R+   TL   LI +            
Sbjct: 323 KVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRSGDRTLGARLIRMAEQAAENPNGTI 382

Query: 124 NRTLTLQVKKRIRLEGEELEEYQKKKDKEAKD 155
           NR LTL+VK+RIRLEG ELE Y+ KK  E ++
Sbjct: 383 NRNLTLEVKRRIRLEGAELENYRAKKRAEERE 414



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 21/36 (58%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVLVS  DME GFSR+LF +WC+  +NS+I+T R+
Sbjct: 323 KVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRS 358



 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 48/115 (41%), Gaps = 40/115 (34%)

Query: 162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFS 221
           IPPH T F+N+ +LSD KQ L  NG   EF  G+L                         
Sbjct: 763 IPPHQTVFVNDPKLSDVKQLLASNGFRAEFSSGILYI----------------------- 799

Query: 222 RDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                       N+I    RN A         +EGC  ++YY+++ ++Y Q+A++
Sbjct: 800 ------------NNIASIRRNEA-----GRFHVEGCACEDYYKIRDIVYAQFAVV 837


>gi|393910520|gb|EJD75913.1| cleavage and polyadenylation specificity factor subunit 2, variant
           [Loa loa]
          Length = 664

 Score =  147 bits (371), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 77/152 (50%), Positives = 100/152 (65%), Gaps = 15/152 (9%)

Query: 19  FKHVKLCH---SLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSP 73
           +  V L H   S+ E AK  +EWMSDK++KSFE  R NPF F+HV+LCH+  +L +V SP
Sbjct: 263 YNLVMLSHVASSVVEFAKSQVEWMSDKVLKSFEVGRYNPFQFRHVQLCHTHIDLLRVRSP 322

Query: 74  KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGG---------- 123
           KVVLVS  DME GFSR+LF +WC+  +NS+I+T R+   TL   LI +            
Sbjct: 323 KVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRSGDRTLGARLIRMAEQAAENPNGTI 382

Query: 124 NRTLTLQVKKRIRLEGEELEEYQKKKDKEAKD 155
           NR LTL+VK+RIRLEG ELE Y+ KK  E ++
Sbjct: 383 NRNLTLEVKRRIRLEGAELENYRAKKRAEERE 414



 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 21/36 (58%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVLVS  DME GFSR+LF +WC+  +NS+I+T R+
Sbjct: 323 KVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRS 358


>gi|393910519|gb|EFO19846.2| cleavage and polyadenylation specificity factor subunit 2 [Loa loa]
          Length = 828

 Score =  147 bits (371), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 77/152 (50%), Positives = 100/152 (65%), Gaps = 15/152 (9%)

Query: 19  FKHVKLCH---SLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSP 73
           +  V L H   S+ E AK  +EWMSDK++KSFE  R NPF F+HV+LCH+  +L +V SP
Sbjct: 263 YNLVMLSHVASSVVEFAKSQVEWMSDKVLKSFEVGRYNPFQFRHVQLCHTHIDLLRVRSP 322

Query: 74  KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGG---------- 123
           KVVLVS  DME GFSR+LF +WC+  +NS+I+T R+   TL   LI +            
Sbjct: 323 KVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRSGDRTLGARLIRMAEQAAENPNGTI 382

Query: 124 NRTLTLQVKKRIRLEGEELEEYQKKKDKEAKD 155
           NR LTL+VK+RIRLEG ELE Y+ KK  E ++
Sbjct: 383 NRNLTLEVKRRIRLEGAELENYRAKKRAEERE 414



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 21/36 (58%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVLVS  DME GFSR+LF +WC+  +NS+I+T R+
Sbjct: 323 KVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRS 358



 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 48/115 (41%), Gaps = 40/115 (34%)

Query: 162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFS 221
           IPPH T F+N+ +LSD KQ L  NG   EF  G+L                         
Sbjct: 754 IPPHQTVFVNDPKLSDVKQLLASNGFRAEFSSGILYI----------------------- 790

Query: 222 RDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                       N+I    RN A         +EGC  ++YY+++ ++Y Q+A++
Sbjct: 791 ------------NNIASIRRNEA-----GRFHVEGCACEDYYKIRDIVYAQFAVV 828


>gi|349604123|gb|AEP99763.1| Cleavage and polyadenylation specificity factor subunit 2-like
           protein, partial [Equus caballus]
          Length = 281

 Score =  146 bits (369), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 62/89 (69%), Positives = 76/89 (85%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 189 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 248

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIE 120
           F QWC  P+NSII+T RT+PGTLAR LI+
Sbjct: 249 FIQWCQDPKNSIILTYRTTPGTLARFLID 277



 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 25/35 (71%), Positives = 29/35 (82%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R
Sbjct: 231 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYR 265


>gi|350587145|ref|XP_001926907.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Sus scrofa]
          Length = 438

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 62/89 (69%), Positives = 76/89 (85%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDL
Sbjct: 282 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDL 341

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIE 120
           F QWC  P+NSII+T RT+PGTLAR LI+
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLID 370



 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359


>gi|170581110|ref|XP_001895540.1| cleavage and polyadenylation specificity factor [Brugia malayi]
 gi|158597460|gb|EDP35606.1| cleavage and polyadenylation specificity factor, putative [Brugia
           malayi]
          Length = 831

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 76/152 (50%), Positives = 100/152 (65%), Gaps = 15/152 (9%)

Query: 19  FKHVKLCH---SLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSP 73
           +  V L H   S+ E AK  +EWMSDK++KSFE  R NPF F+HV+LCH+  +L +V SP
Sbjct: 263 YNLVMLSHVASSVVEFAKSQVEWMSDKVLKSFEVGRYNPFQFRHVQLCHTHIDLMRVRSP 322

Query: 74  KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGG---------- 123
           KVVLVS  DME GFSR+LF +WC+  +NS+I+T R+   TL   LI +            
Sbjct: 323 KVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRSGDRTLGARLIRMAEQAAENPNGTI 382

Query: 124 NRTLTLQVKKRIRLEGEELEEYQKKKDKEAKD 155
           NR LTL+VK+RIRL+G ELE Y+ KK  E ++
Sbjct: 383 NRNLTLEVKRRIRLDGVELENYRAKKRAEERE 414



 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 21/36 (58%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVLVS  DME GFSR+LF +WC+  +NS+I+T R+
Sbjct: 323 KVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRS 358



 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 47/115 (40%), Gaps = 40/115 (34%)

Query: 162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFS 221
           IPPH   F+N+ +LSD KQ L  NG   EF  GVL                         
Sbjct: 757 IPPHQAVFVNDPKLSDMKQLLASNGFRAEFSSGVLYI----------------------- 793

Query: 222 RDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                       N+I    RN A         +EGC  ++YY+++ ++Y Q+A++
Sbjct: 794 ------------NNIASIRRNEA-----GRFHVEGCACEDYYKIRDIVYAQFAVV 831


>gi|391325231|ref|XP_003737142.1| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like isoform 1 [Metaseiulus occidentalis]
          Length = 741

 Score =  144 bits (362), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 69/129 (53%), Positives = 97/129 (75%), Gaps = 9/129 (6%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDK+M+SFEGAR+NPF FK++  CHS  E+  V  PKVVL S PD+E G+ RDL
Sbjct: 280 SQVEWMSDKVMRSFEGARSNPFQFKYLIPCHSHGEIQSVSEPKVVLASMPDLESGYGRDL 339

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRT--LTLQVKKRIRLEGEELEEYQK-- 147
           F  W S+P+NS+I+T+R+SPGTLAR+L++   NR   + L +K+R+ LE +ELEE+ +  
Sbjct: 340 FMLWASNPKNSVILTSRSSPGTLARNLVD---NRPKFVHLTLKQRVALEADELEEHVRNE 396

Query: 148 --KKDKEAK 154
             KK+KE K
Sbjct: 397 RLKKEKETK 405



 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 48/147 (32%), Positives = 70/147 (47%), Gaps = 47/147 (31%)

Query: 138 EGEELEEYQKKKDKEAKDK--------QEKEKIPPHDTSFINELQLSDFKQTLQRNGIDC 189
           E EE  +   K D+E K+K             + PH+  FIN+L+LSDFKQ L ++GI  
Sbjct: 634 EPEEDNDLVGKGDEETKEKLMVLQPLGDSNRVVAPHNPLFINDLKLSDFKQVLVKSGISA 693

Query: 190 EFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDS 249
           EF  GVL C   +VAV+R                                       +++
Sbjct: 694 EFSGGVLYCNNCSVAVKR---------------------------------------NET 714

Query: 250 NVIVLEGCLSDEYYRVQQLLYDQYAIL 276
             + +EG L+D+Y+R+++LLYDQYAIL
Sbjct: 715 GRLSVEGALTDDYFRIRELLYDQYAIL 741



 Score = 53.5 bits (127), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 35/51 (68%)

Query: 202 TVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVI 252
           +V+  +VVL S PD+E G+ RDLF  W S+P+NS+I+T+R+       N++
Sbjct: 317 SVSEPKVVLASMPDLESGYGRDLFMLWASNPKNSVILTSRSSPGTLARNLV 367


>gi|391325235|ref|XP_003737144.1| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like isoform 3 [Metaseiulus occidentalis]
          Length = 754

 Score =  143 bits (361), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 69/129 (53%), Positives = 97/129 (75%), Gaps = 9/129 (6%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDK+M+SFEGAR+NPF FK++  CHS  E+  V  PKVVL S PD+E G+ RDL
Sbjct: 280 SQVEWMSDKVMRSFEGARSNPFQFKYLIPCHSHGEIQSVSEPKVVLASMPDLESGYGRDL 339

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRT--LTLQVKKRIRLEGEELEEYQK-- 147
           F  W S+P+NS+I+T+R+SPGTLAR+L++   NR   + L +K+R+ LE +ELEE+ +  
Sbjct: 340 FMLWASNPKNSVILTSRSSPGTLARNLVD---NRPKFVHLTLKQRVALEADELEEHVRNE 396

Query: 148 --KKDKEAK 154
             KK+KE K
Sbjct: 397 RLKKEKETK 405



 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 48/147 (32%), Positives = 70/147 (47%), Gaps = 47/147 (31%)

Query: 138 EGEELEEYQKKKDKEAKDK--------QEKEKIPPHDTSFINELQLSDFKQTLQRNGIDC 189
           E EE  +   K D+E K+K             + PH+  FIN+L+LSDFKQ L ++GI  
Sbjct: 647 EPEEDNDLVGKGDEETKEKLMVLQPLGDSNRVVAPHNPLFINDLKLSDFKQVLVKSGISA 706

Query: 190 EFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDS 249
           EF  GVL C   +VAV+R                                       +++
Sbjct: 707 EFSGGVLYCNNCSVAVKR---------------------------------------NET 727

Query: 250 NVIVLEGCLSDEYYRVQQLLYDQYAIL 276
             + +EG L+D+Y+R+++LLYDQYAIL
Sbjct: 728 GRLSVEGALTDDYFRIRELLYDQYAIL 754



 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 35/51 (68%)

Query: 202 TVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVI 252
           +V+  +VVL S PD+E G+ RDLF  W S+P+NS+I+T+R+       N++
Sbjct: 317 SVSEPKVVLASMPDLESGYGRDLFMLWASNPKNSVILTSRSSPGTLARNLV 367


>gi|391325233|ref|XP_003737143.1| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like isoform 2 [Metaseiulus occidentalis]
          Length = 745

 Score =  143 bits (361), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 69/129 (53%), Positives = 97/129 (75%), Gaps = 9/129 (6%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDK+M+SFEGAR+NPF FK++  CHS  E+  V  PKVVL S PD+E G+ RDL
Sbjct: 280 SQVEWMSDKVMRSFEGARSNPFQFKYLIPCHSHGEIQSVSEPKVVLASMPDLESGYGRDL 339

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRT--LTLQVKKRIRLEGEELEEYQK-- 147
           F  W S+P+NS+I+T+R+SPGTLAR+L++   NR   + L +K+R+ LE +ELEE+ +  
Sbjct: 340 FMLWASNPKNSVILTSRSSPGTLARNLVD---NRPKFVHLTLKQRVALEADELEEHVRNE 396

Query: 148 --KKDKEAK 154
             KK+KE K
Sbjct: 397 RLKKEKETK 405



 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 48/147 (32%), Positives = 70/147 (47%), Gaps = 47/147 (31%)

Query: 138 EGEELEEYQKKKDKEAKDK--------QEKEKIPPHDTSFINELQLSDFKQTLQRNGIDC 189
           E EE  +   K D+E K+K             + PH+  FIN+L+LSDFKQ L ++GI  
Sbjct: 638 EPEEDNDLVGKGDEETKEKLMVLQPLGDSNRVVAPHNPLFINDLKLSDFKQVLVKSGISA 697

Query: 190 EFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDS 249
           EF  GVL C   +VAV+R                                       +++
Sbjct: 698 EFSGGVLYCNNCSVAVKR---------------------------------------NET 718

Query: 250 NVIVLEGCLSDEYYRVQQLLYDQYAIL 276
             + +EG L+D+Y+R+++LLYDQYAIL
Sbjct: 719 GRLSVEGALTDDYFRIRELLYDQYAIL 745



 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 35/51 (68%)

Query: 202 TVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVI 252
           +V+  +VVL S PD+E G+ RDLF  W S+P+NS+I+T+R+       N++
Sbjct: 317 SVSEPKVVLASMPDLESGYGRDLFMLWASNPKNSVILTSRSSPGTLARNLV 367


>gi|196012036|ref|XP_002115881.1| hypothetical protein TRIADDRAFT_30006 [Trichoplax adhaerens]
 gi|190581657|gb|EDV21733.1| hypothetical protein TRIADDRAFT_30006 [Trichoplax adhaerens]
          Length = 745

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 62/124 (50%), Positives = 89/124 (71%), Gaps = 3/124 (2%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           + +++ E AK  +EWM DK++ +FE  RNNPF FK+++LCHSL EL+ +P PKVVL S+P
Sbjct: 271 VSYNVVEFAKSQVEWMCDKVLVAFEDNRNNPFQFKYIQLCHSLNELSGLPEPKVVLASSP 330

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           D+ CGF+RDLF QW  + +N  I T R+SPGTL R +++    +++ + VK R+ L G E
Sbjct: 331 DLTCGFARDLFLQWAGNSKNLTIFTGRSSPGTLGRHILD-ERPQSIDVTVKTRVELSGNE 389

Query: 142 LEEY 145
           LEEY
Sbjct: 390 LEEY 393



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 56/118 (47%), Gaps = 40/118 (33%)

Query: 159 KEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMEC 218
           + +IP H + FIN+ +LSD KQTL + GI  EF+ GV++ C   +AVRR           
Sbjct: 668 QSEIPGHKSVFINDPRLSDLKQTLTKAGIQAEFVGGVIV-CNDKIAVRR----------- 715

Query: 219 GFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                       +++  I LEG + ++YY V+ +LY QYAI+
Sbjct: 716 ----------------------------TETGKITLEGAICNDYYTVRDILYQQYAII 745



 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 26/36 (72%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S+PD+ CGF+RDLF QW  + +N  I T R+
Sbjct: 323 KVVLASSPDLTCGFARDLFLQWAGNSKNLTIFTGRS 358


>gi|395827898|ref|XP_003787126.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Otolemur garnettii]
          Length = 750

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 57/95 (60%), Positives = 75/95 (78%)

Query: 45  FEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSII 104
           +   RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+ECGFSRDLF QWC  P+NSII
Sbjct: 294 YXDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSII 353

Query: 105 ITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
           +T RT+PGTLAR LI+    +   ++++KR++LEG
Sbjct: 354 LTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEG 388



 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 675 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 720

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 721 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 750



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/36 (69%), Positives = 29/36 (80%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           +VVL S PD+ECGFSRDLF QWC  P+NSII+T R 
Sbjct: 323 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 358


>gi|307203591|gb|EFN82620.1| Probable cleavage and polyadenylation specificity factor subunit 2
           [Harpegnathos saltator]
          Length = 685

 Score =  134 bits (336), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 67/104 (64%), Positives = 85/104 (81%), Gaps = 6/104 (5%)

Query: 60  LCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
           L +SLA L  V     VL STPDMECGFSR+LF QWC++P+NSII+T+RTSPGTLARDL+
Sbjct: 262 LAYSLALLNNV-----VLASTPDMECGFSRELFLQWCTNPQNSIILTSRTSPGTLARDLV 316

Query: 120 ELGGNRTLTLQVKKRIRLEGEELEEYQKK-KDKEAKDKQEKEKI 162
           E GGNR +TL+VK+R++LEG ELEEYQK+ K K+ + KQE+ +I
Sbjct: 317 EKGGNRNITLEVKRRVKLEGIELEEYQKREKLKQEQLKQEQMEI 360



 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 43/116 (37%), Positives = 62/116 (53%), Gaps = 39/116 (33%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H T+FINEL+LSDFKQ L ++ I  EF  GVL CC  T+AVRR             
Sbjct: 609 EVPGHQTTFINELKLSDFKQVLNKSNISSEFSGGVLWCCNNTIAVRR------------- 655

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                      ++  ++LEGC+S++YY+V++LLY+QYAI+
Sbjct: 656 --------------------------HEAGKVILEGCISEDYYKVRELLYEQYAIV 685



 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 26/35 (74%), Positives = 32/35 (91%)

Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRN 242
           VVL STPDMECGFSR+LF QWC++P+NSII+T+R 
Sbjct: 272 VVLASTPDMECGFSRELFLQWCTNPQNSIILTSRT 306


>gi|332672684|gb|AEE87269.1| cleavage and polyadenylation specific factor 2 [Sepia officinalis]
          Length = 129

 Score =  133 bits (335), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 55/81 (67%), Positives = 71/81 (87%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++EWMSDK+M++FE  RNNPFHFKH+KLCH+LAEL+KV  PKVVL S PD++CGF+RDL
Sbjct: 49  SQVEWMSDKIMRAFEDHRNNPFHFKHLKLCHNLAELSKVMEPKVVLASMPDLQCGFARDL 108

Query: 92  FFQWCSSPENSIIITNRTSPG 112
           F  WC + +NSII+T+RTSPG
Sbjct: 109 FMGWCGNAKNSIILTSRTSPG 129



 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 21/35 (60%), Positives = 29/35 (82%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL S PD++CGF+RDLF  WC + +NSII+T+R
Sbjct: 91  KVVLASMPDLQCGFARDLFMGWCGNAKNSIILTSR 125


>gi|17559452|ref|NP_504822.1| Protein CPSF-2 [Caenorhabditis elegans]
 gi|18201967|sp|O17403.1|CPSF2_CAEEL RecName: Full=Probable cleavage and polyadenylation specificity
           factor subunit 2; AltName: Full=Cleavage and
           polyadenylation specificity factor 100 kDa subunit;
           Short=CPSF 100 kDa subunit
 gi|351057814|emb|CCD64424.1| Protein CPSF-2 [Caenorhabditis elegans]
          Length = 843

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 67/139 (48%), Positives = 88/139 (63%), Gaps = 13/139 (9%)

Query: 24  LCHSLAELAK--IEWMSDKLMK-SFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVST 80
           +  S+ + AK  +EWM++KL K     AR NPF  KHV LCHS  EL +V SPKVVL S+
Sbjct: 271 VASSVVQFAKSQLEWMNEKLFKYDSSSARYNPFTLKHVTLCHSHQELMRVRSPKVVLCSS 330

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG----------GNRTLTLQ 130
            DME GFSR+LF  WCS P N +I+T R +  TLA  L+ +            +R ++L 
Sbjct: 331 QDMESGFSRELFLDWCSDPRNGVILTARPASFTLAAKLVNMAERANDGVLKHEDRLISLV 390

Query: 131 VKKRIRLEGEELEEYQKKK 149
           VKKR+ LEGEEL EY+++K
Sbjct: 391 VKKRVALEGEELLEYKRRK 409



 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 33/122 (27%), Positives = 51/122 (41%), Gaps = 39/122 (31%)

Query: 155 DKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTP 214
           D   K  IP H   F+N+ +LSDFK  L   G   EF+ G L+   G  ++RR       
Sbjct: 761 DPLPKRLIPIHQAVFVNDPKLSDFKNLLTDKGYKAEFLSGTLLINGGNCSIRR------- 813

Query: 215 DMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYA 274
                                           +D+ V  +EG  + +YY++++L YDQ+A
Sbjct: 814 --------------------------------NDTGVFQMEGAFTKDYYKLRRLFYDQFA 841

Query: 275 IL 276
           +L
Sbjct: 842 VL 843



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 21/35 (60%), Positives = 26/35 (74%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL S+ DME GFSR+LF  WCS P N +I+T R
Sbjct: 324 KVVLCSSQDMESGFSRELFLDWCSDPRNGVILTAR 358


>gi|308480408|ref|XP_003102411.1| CRE-CPSF-2 protein [Caenorhabditis remanei]
 gi|308262077|gb|EFP06030.1| CRE-CPSF-2 protein [Caenorhabditis remanei]
          Length = 850

 Score =  123 bits (308), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 67/139 (48%), Positives = 87/139 (62%), Gaps = 13/139 (9%)

Query: 24  LCHSLAELAK--IEWMSDKLMK-SFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVST 80
           +  S+ + AK  +EWM++KL K     AR NPF  KH+ LCHS  EL +V SPKVVL S+
Sbjct: 271 VASSVVQFAKSQLEWMNEKLFKYDSNSARYNPFTLKHITLCHSHQELMRVRSPKVVLCSS 330

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG----------GNRTLTLQ 130
            DME GFSR+LF  WCS   N +I+T R S  TLA  L+ L            +R ++L 
Sbjct: 331 QDMESGFSRELFLDWCSDSRNGVILTARPSSFTLAAKLVNLAERANDGVLRNEDRLISLS 390

Query: 131 VKKRIRLEGEELEEYQKKK 149
           VKKR+ LEGEEL EY+++K
Sbjct: 391 VKKRVPLEGEELLEYKRRK 409



 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 51/118 (43%), Gaps = 40/118 (33%)

Query: 159 KEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMEC 218
           K+ IP H   F+N+ +LSDFK  L   G   EF+ G L+   G  ++RR         E 
Sbjct: 773 KKLIPIHQAIFVNDPKLSDFKNLLVEKGYKAEFLSGTLLINGGKCSIRR--------GEM 824

Query: 219 GFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
           GFS                                +EG LS +YY+++ L YDQ+AIL
Sbjct: 825 GFS--------------------------------MEGALSKDYYKLRNLFYDQFAIL 850



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 20/35 (57%), Positives = 25/35 (71%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL S+ DME GFSR+LF  WCS   N +I+T R
Sbjct: 324 KVVLCSSQDMESGFSRELFLDWCSDSRNGVILTAR 358


>gi|440797154|gb|ELR18249.1| cleavage and polyadenylation specificity factor subunit 2, putative
           [Acanthamoeba castellanii str. Neff]
          Length = 799

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 62/144 (43%), Positives = 93/144 (64%), Gaps = 5/144 (3%)

Query: 23  KLCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVST 80
           K  ++  E AK  +EWMS  + KSF+  R NPF FK V+LCHS+ EL  +P P VVL +T
Sbjct: 232 KQAYNTIEFAKSQLEWMSTAVQKSFDLDRVNPFEFKFVRLCHSVEELEALPKPLVVLATT 291

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGE 140
             +E GF+RDLF +W S+P +++I T+R  PGTL   L+       L L++ +R+ LEG 
Sbjct: 292 ASLEWGFARDLFVEWSSNPRHAVIFTDRPQPGTLGH-LVLTQQPPALGLELHRRVPLEGA 350

Query: 141 ELEEYQKKKDKEAKDK--QEKEKI 162
           EL E+++K+ +E   K  +E++K+
Sbjct: 351 ELREWRQKQQEEKARKLLEEQQKV 374



 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 47/176 (26%), Positives = 69/176 (39%), Gaps = 59/176 (33%)

Query: 101 NSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKE 160
           N +++  + + G   R  I + G              EG ELE  Q  +          E
Sbjct: 683 NGVVMLKKETEGLSGRSKISVNG--------------EGMELELEQAPE----------E 718

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
            IPPHD  F+ E++LSDFK  L ++G   EF  GVLIC                      
Sbjct: 719 IIPPHDPVFVGEVRLSDFKDILTQHGFRTEFAAGVLIC---------------------- 756

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                        N +++  +     S  + I + G L D+Y+ V+ LLY Q+ IL
Sbjct: 757 -------------NGVVMLKKETEGLSGRSKISVNGALCDDYFAVRDLLYSQFHIL 799



 Score = 45.4 bits (106), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 17/34 (50%), Positives = 26/34 (76%)

Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           VVL +T  +E GF+RDLF +W S+P +++I T+R
Sbjct: 286 VVLATTASLEWGFARDLFVEWSSNPRHAVIFTDR 319


>gi|320163729|gb|EFW40628.1| cleavage and polyadenylation specificity factor [Capsaspora
           owczarzaki ATCC 30864]
          Length = 744

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 62/155 (40%), Positives = 93/155 (60%), Gaps = 6/155 (3%)

Query: 1   MSDKLMKSFEGARNNPFH-FKHVKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVK 59
           ++  L K +E  +N P     H+        +A+IEWMS K+ K FE  R+NPF F H+K
Sbjct: 166 LAQVLDKQWEKYKNFPIVVLSHISRTVMNFAMAQIEWMSAKMQKEFETTRSNPFSFAHIK 225

Query: 60  LCHSLAELAKVP---SPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLAR 116
           +C ++ ELA+V    +P VVL S   +  GF+RDL  +W  +P+N II  N +    LA+
Sbjct: 226 MCQTMEELAQVAKEGTPVVVLASMEGLTSGFARDLMLKWAENPKNLIIFPNNSPASDLAK 285

Query: 117 DLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDK 151
            L+E   NR + + VK RI LEGEEL+EY +++++
Sbjct: 286 SLVE--KNRQIVIDVKTRIALEGEELDEYLREQEE 318



 Score = 44.7 bits (104), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 42/93 (45%), Gaps = 16/93 (17%)

Query: 189 CEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSD 248
           C+ M+ +    +    V  VVL S   +  GF+RDL  +W  +P+N II  N + A D  
Sbjct: 227 CQTMEELAQVAKEGTPV--VVLASMEGLTSGFARDLMLKWAENPKNLIIFPNNSPASDLA 284

Query: 249 SNV--------------IVLEGCLSDEYYRVQQ 267
            ++              I LEG   DEY R Q+
Sbjct: 285 KSLVEKNRQIVIDVKTRIALEGEELDEYLREQE 317


>gi|341883504|gb|EGT39439.1| CBN-CPSF-2 protein [Caenorhabditis brenneri]
          Length = 822

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 65/139 (46%), Positives = 84/139 (60%), Gaps = 13/139 (9%)

Query: 24  LCHSLAELAK--IEWMSDKLMK-SFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVST 80
           +  S+ + AK  +EWM + L K      R NPF  K+V LCHS  EL +V SPKVVL S+
Sbjct: 271 VASSVVQFAKSQLEWMHESLFKYDSNSTRYNPFTLKNVTLCHSHQELLRVRSPKVVLCSS 330

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG----------GNRTLTLQ 130
            DME GFSR+LF  WCS   N +I+T R S  TLA  L+ L            +R ++L 
Sbjct: 331 QDMEAGFSRELFLDWCSDSRNGVILTARPSSFTLAAKLVNLAERANDGILRPEDRLISLL 390

Query: 131 VKKRIRLEGEELEEYQKKK 149
           VKKR+ LEGEEL EY+++K
Sbjct: 391 VKKRVPLEGEELLEYKRRK 409



 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 32/122 (26%), Positives = 52/122 (42%), Gaps = 39/122 (31%)

Query: 155 DKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTP 214
           D  ++ +IP H   F+N+ ++SDFK  L   G   EF+ G LI   G  ++RR       
Sbjct: 740 DPLQRSRIPVHQAIFVNDPKMSDFKNLLVERGYKAEFLSGTLIINGGKCSIRR------- 792

Query: 215 DMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYA 274
                                           S++    +EG  + +YY+V++L YDQ+A
Sbjct: 793 --------------------------------SETGSFQMEGAFTKDYYKVRKLFYDQFA 820

Query: 275 IL 276
           +L
Sbjct: 821 VL 822



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 20/35 (57%), Positives = 25/35 (71%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL S+ DME GFSR+LF  WCS   N +I+T R
Sbjct: 324 KVVLCSSQDMEAGFSRELFLDWCSDSRNGVILTAR 358


>gi|348689662|gb|EGZ29476.1| hypothetical protein PHYSODRAFT_552782 [Phytophthora sojae]
          Length = 513

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 52/134 (38%), Positives = 83/134 (61%), Gaps = 4/134 (2%)

Query: 29  AELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFS 88
           A  A +EW +D++ K+F+  R NPF F H+ L H+L EL  +PSPKVVL ++P +ECGF+
Sbjct: 57  AAQAMLEWCNDRIAKNFDVGRQNPFQFSHIHLVHTLEELDALPSPKVVLATSPSLECGFA 116

Query: 89  RDLFFQWCSSPENSIIITNRTSPGTLARDLIELG----GNRTLTLQVKKRIRLEGEELEE 144
           +D+F +W   P NSII T+ T   + A  ++++       + ++  V K++ LEG EL  
Sbjct: 117 KDIFIRWAPDPRNSIIFTSTTPETSFASRVLKIAKDPSAAKVISCTVTKKVFLEGAELAL 176

Query: 145 YQKKKDKEAKDKQE 158
           Y+ K+ K  + + E
Sbjct: 177 YEVKERKRLRTEAE 190



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 17/34 (50%), Positives = 26/34 (76%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 240
           +VVL ++P +ECGF++D+F +W   P NSII T+
Sbjct: 102 KVVLATSPSLECGFAKDIFIRWAPDPRNSIIFTS 135


>gi|268558798|ref|XP_002637390.1| Hypothetical protein CBG19097 [Caenorhabditis briggsae]
          Length = 838

 Score =  114 bits (284), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 63/139 (45%), Positives = 86/139 (61%), Gaps = 13/139 (9%)

Query: 24  LCHSLAELAK--IEWMSDKLMK-SFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVST 80
           +  S+ + AK  +EWM +KL +     AR NPF  K+V L HS  EL K+ SPKVVL S+
Sbjct: 271 VASSVVQFAKSQLEWMDEKLFRYDSSSARYNPFTLKNVNLVHSHLELIKIRSPKVVLCSS 330

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG----------GNRTLTLQ 130
            DME GFSR+LF  WC+   N +I+T R +  TLA  L+EL            ++ L+L 
Sbjct: 331 QDMETGFSRELFLDWCADQRNGVILTARPASFTLAARLVELAERANDGVLRNEDKHLSLL 390

Query: 131 VKKRIRLEGEELEEYQKKK 149
           V+KR+ LEGEEL EY+++K
Sbjct: 391 VRKRVPLEGEELLEYKRRK 409



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/118 (25%), Positives = 49/118 (41%), Gaps = 41/118 (34%)

Query: 159 KEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMEC 218
           K++IP H   F+N+ +LS+FK  L   G   EF  G L+   G  ++R          E 
Sbjct: 762 KKQIPVHQAIFVNDPKLSEFKNLLVDKGYKAEFFSGTLLINGGKCSIRG---------ET 812

Query: 219 GFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
           GF                                 +EG  + ++Y++++L YDQ+A+L
Sbjct: 813 GFQ--------------------------------MEGAFTKDFYKLRKLFYDQFAVL 838



 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 19/35 (54%), Positives = 25/35 (71%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL S+ DME GFSR+LF  WC+   N +I+T R
Sbjct: 324 KVVLCSSQDMETGFSRELFLDWCADQRNGVILTAR 358


>gi|301092283|ref|XP_002997000.1| cleavage and polyadenylation specificity factor subunit, putative
           [Phytophthora infestans T30-4]
 gi|262112189|gb|EEY70241.1| cleavage and polyadenylation specificity factor subunit, putative
           [Phytophthora infestans T30-4]
          Length = 513

 Score =  114 bits (284), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 51/134 (38%), Positives = 84/134 (62%), Gaps = 4/134 (2%)

Query: 29  AELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFS 88
           A  A +EW +D++ K+F+  R NPF F H+ L H+L EL  +P+PKVVL ++P +ECGF+
Sbjct: 57  AAQAMLEWCNDRIAKNFDVGRQNPFQFTHIHLVHTLEELDALPNPKVVLATSPSLECGFA 116

Query: 89  RDLFFQWCSSPENSIIITNRTSPGTLARDLIEL----GGNRTLTLQVKKRIRLEGEELEE 144
           +D+F +W   P NSII ++ TS  + A  +++L       + ++  V +++ LEG EL  
Sbjct: 117 KDIFIRWAPDPRNSIIFSSTTSETSFASRVVKLSKDPSAEKNISCTVTQKVFLEGAELAL 176

Query: 145 YQKKKDKEAKDKQE 158
           Y+ K+ K  + + E
Sbjct: 177 YEVKERKRLRTEAE 190



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 30/48 (62%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVL 254
           +VVL ++P +ECGF++D+F +W   P NSII ++        S V+ L
Sbjct: 102 KVVLATSPSLECGFAKDIFIRWAPDPRNSIIFSSTTSETSFASRVVKL 149


>gi|229553940|sp|A8XUS3.2|CPSF2_CAEBR RecName: Full=Probable cleavage and polyadenylation specificity
           factor subunit 2; AltName: Full=Cleavage and
           polyadenylation specificity factor 100 kDa subunit;
           Short=CPSF 100 kDa subunit
          Length = 842

 Score =  114 bits (284), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 63/139 (45%), Positives = 86/139 (61%), Gaps = 13/139 (9%)

Query: 24  LCHSLAELAK--IEWMSDKLMK-SFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVST 80
           +  S+ + AK  +EWM +KL +     AR NPF  K+V L HS  EL K+ SPKVVL S+
Sbjct: 271 VASSVVQFAKSQLEWMDEKLFRYDSSSARYNPFTLKNVNLVHSHLELIKIRSPKVVLCSS 330

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG----------GNRTLTLQ 130
            DME GFSR+LF  WC+   N +I+T R +  TLA  L+EL            ++ L+L 
Sbjct: 331 QDMETGFSRELFLDWCADQRNGVILTARPASFTLAARLVELAERANDGVLRNEDKHLSLL 390

Query: 131 VKKRIRLEGEELEEYQKKK 149
           V+KR+ LEGEEL EY+++K
Sbjct: 391 VRKRVPLEGEELLEYKRRK 409



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 19/35 (54%), Positives = 25/35 (71%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +VVL S+ DME GFSR+LF  WC+   N +I+T R
Sbjct: 324 KVVLCSSQDMETGFSRELFLDWCADQRNGVILTAR 358



 Score = 37.7 bits (86), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 26/47 (55%)

Query: 159 KEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAV 205
           K++IP H   F+N+ +LS+FK  L   G   EF  G L+   G  ++
Sbjct: 764 KKQIPVHQAIFVNDPKLSEFKNLLVDKGYKAEFFSGTLLINGGKCSI 810


>gi|339247939|ref|XP_003375603.1| cleavage and polyadenylation specificity factor subunit 2
           [Trichinella spiralis]
 gi|316971010|gb|EFV54853.1| cleavage and polyadenylation specificity factor subunit 2
           [Trichinella spiralis]
          Length = 1188

 Score =  113 bits (283), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 58/137 (42%), Positives = 84/137 (61%), Gaps = 20/137 (14%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSP-------- 73
           +  ++ E AK  +EWMS+++++ FE  R+NPF F+H +LCHSLAEL ++ SP        
Sbjct: 272 VAFNVVEFAKSQVEWMSERMLRMFEEGRSNPFQFRHAQLCHSLAELTRLRSPKVLSFRDA 331

Query: 74  ----KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG------G 123
               KVVL S PD++ GFSR+LF  WC   +N II+T+R   G+L   LIE+       G
Sbjct: 332 FFSDKVVLASQPDLDSGFSRELFLDWCIDAKNCIILTSRARIGSLCSKLIEMVSSPERIG 391

Query: 124 NRTLTLQVKKRIRLEGE 140
            + +T+QVK+R    GE
Sbjct: 392 TKQITVQVKRRFDDYGE 408



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 20/37 (54%), Positives = 27/37 (72%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR 243
           +VVL S PD++ GFSR+LF  WC   +N II+T+R R
Sbjct: 336 KVVLASQPDLDSGFSRELFLDWCIDAKNCIILTSRAR 372



 Score = 40.8 bits (94), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 34/78 (43%), Gaps = 9/78 (11%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLIC---------CRGTVAVRRVVLV 211
           +IP HD  F+ + +LSD KQ L  +G   EF  GVL+            G + V  +V  
Sbjct: 613 EIPSHDAVFVGDPKLSDLKQALMLDGFQAEFSHGVLVVNNVLSIRKRADGQLHVEGIVCK 672

Query: 212 STPDMECGFSRDLFFQWC 229
               +   F  + FF  C
Sbjct: 673 DYYAIRDQFHANYFFYIC 690


>gi|384484008|gb|EIE76188.1| hypothetical protein RO3G_00892 [Rhizopus delemar RA 99-880]
          Length = 657

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 57/167 (34%), Positives = 90/167 (53%), Gaps = 40/167 (23%)

Query: 14  NNPFHFKHVKLCHSLAELAKI--EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP 71
           N  +H  H          AKI  EWM ++L + F  +R NP+ FK+V+LCH + +L   P
Sbjct: 226 NTSYHTVH---------FAKIMLEWMGEELTRKFSQSRENPYEFKYVRLCHKIEDLDNYP 276

Query: 72  SPKVVLVSTPDMECGFSRDLFFQWCSS-PENSIIITNRTSPGTLARDLIE---------- 120
            PK+V+ S   +E GF+R+LF +W ++ P+N++I+T+R++PGTLAR L +          
Sbjct: 277 GPKIVMASHHSLETGFARELFLRWMTNDPQNTLILTDRSAPGTLARRLYDDWEQQTNKTA 336

Query: 121 ------------------LGGNRTLTLQVKKRIRLEGEELEEYQKKK 149
                             +    T+ L+V KR+ LEG EL+EY+  +
Sbjct: 337 TTTTVVNNNRTKVLVKPAIAYENTIDLRVYKRVPLEGAELQEYEAAQ 383



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 14/37 (37%), Positives = 29/37 (78%), Gaps = 1/37 (2%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSS-PENSIIITNRN 242
           ++V+ S   +E GF+R+LF +W ++ P+N++I+T+R+
Sbjct: 279 KIVMASHHSLETGFARELFLRWMTNDPQNTLILTDRS 315


>gi|402591052|gb|EJW84982.1| cleavage and polyadenylation specificity factor subunit 2
           [Wuchereria bancrofti]
          Length = 809

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 68/144 (47%), Positives = 85/144 (59%), Gaps = 20/144 (13%)

Query: 19  FKHVKLCH---SLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSP 73
           +  V L H   S+ E AK  +EWMSDK++KSFE  R NPF F+HV+LCH+  +L +V SP
Sbjct: 263 YNLVMLSHVASSVVEFAKSQVEWMSDKVLKSFEVGRYNPFQFRHVQLCHTHIDLMRVRSP 322

Query: 74  KVVLVSTPDMECGFS--RDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQV 131
           KVVLVS  DME G S  R L  +     E     T     GT+         NR LTL+V
Sbjct: 323 KVVLVSGLDMESGRSGDRTLGARLIRMAEQ----TAENPNGTI---------NRNLTLEV 369

Query: 132 KKRIRLEGEELEEYQKKKDKEAKD 155
           K+RIRLEG ELE Y+ KK  E ++
Sbjct: 370 KRRIRLEGVELENYRAKKRAEERE 393



 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 31/115 (26%), Positives = 46/115 (40%), Gaps = 40/115 (34%)

Query: 162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFS 221
           IPPH   F+N+ +LSD KQ L  NG   EF  GVL                         
Sbjct: 735 IPPHQAVFVNDPKLSDMKQLLASNGFRAEFSSGVLYI----------------------- 771

Query: 222 RDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                       N+I    RN A         +EG   ++YY+++ ++Y Q+A++
Sbjct: 772 ------------NNIASIRRNEAGR-----FHVEGYACEDYYKIRDIVYAQFAVV 809


>gi|358338982|dbj|GAA43367.2| cleavage and polyadenylation specificity factor subunit 2, partial
           [Clonorchis sinensis]
          Length = 995

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 42/77 (54%), Positives = 59/77 (76%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +EWMS+K+M++FE  R NPFHF+HV+LCH+L +L  VP PKVVL S  D+ CGF+R LF 
Sbjct: 259 VEWMSEKVMRTFEDQRTNPFHFRHVQLCHTLEQLDTVPEPKVVLASASDLSCGFARQLFA 318

Query: 94  QWCSSPENSIIITNRTS 110
           +W  +  N++I+T+R S
Sbjct: 319 EWADNDLNTVILTSRES 335



 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 48/111 (43%), Gaps = 40/111 (36%)

Query: 162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFS 221
           +  H T F+NE +LSD KQ L  NG+  EF+ GVL+     VA++R              
Sbjct: 806 VGAHKTVFVNEPKLSDLKQLLLANGLVAEFVSGVLV-VDNCVAIKR-------------- 850

Query: 222 RDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQ 272
                                    S++  ++LEG LS  Y+ V+Q+LY Q
Sbjct: 851 -------------------------SEAGKLLLEGLLSRTYFTVRQVLYQQ 876



 Score = 43.9 bits (102), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 27/40 (67%)

Query: 202 TVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           TV   +VVL S  D+ CGF+R LF +W  +  N++I+T+R
Sbjct: 294 TVPEPKVVLASASDLSCGFARQLFAEWADNDLNTVILTSR 333


>gi|302819854|ref|XP_002991596.1| hypothetical protein SELMODRAFT_429848 [Selaginella moellendorffii]
 gi|300140629|gb|EFJ07350.1| hypothetical protein SELMODRAFT_429848 [Selaginella moellendorffii]
          Length = 715

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 52/142 (36%), Positives = 87/142 (61%), Gaps = 5/142 (3%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRD 90
           + +EWM+D + K+FE  R NPF  + VKLC S  +L ++P  P+VVL S   +E GF+++
Sbjct: 278 SSLEWMNDAIGKAFEQNRENPFALRSVKLCTSRKQLEELPPGPRVVLASMASLETGFAKE 337

Query: 91  LFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKD 150
           LF +W   P+N ++ T R   GTLAR L      + + + + K++ L GEELE Y++++ 
Sbjct: 338 LFLEWAVDPKNLVLFTERAQVGTLARQLQVEPPPKIVKITISKKVLLVGEELEAYEREQS 397

Query: 151 K---EAKD-KQEKEKIPPHDTS 168
           +   EA++   ++E + P  +S
Sbjct: 398 RLREEARNAASQQEPVQPASSS 419



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 50/112 (44%), Gaps = 40/112 (35%)

Query: 165 HDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDL 224
           H T F+ +L+L+DFKQ L   GI  EF  GVL C    +AVR+                 
Sbjct: 644 HKTVFVGDLRLADFKQLLATKGIQAEFAGGVLRCA-DNIAVRK----------------- 685

Query: 225 FFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                 S    +V+EG LSD+YY+V++LLY QY I+
Sbjct: 686 ----------------------SGGQQLVIEGSLSDDYYKVRELLYSQYHIV 715



 Score = 40.8 bits (94), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 15/35 (42%), Positives = 23/35 (65%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           RVVL S   +E GF+++LF +W   P+N ++ T R
Sbjct: 321 RVVLASMASLETGFAKELFLEWAVDPKNLVLFTER 355


>gi|302776792|ref|XP_002971541.1| hypothetical protein SELMODRAFT_441578 [Selaginella moellendorffii]
 gi|300160673|gb|EFJ27290.1| hypothetical protein SELMODRAFT_441578 [Selaginella moellendorffii]
          Length = 721

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 52/142 (36%), Positives = 87/142 (61%), Gaps = 5/142 (3%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRD 90
           + +EWM+D + K+FE  R NPF  + VKLC S  +L ++P  P+VVL S   +E GF+++
Sbjct: 278 SSLEWMNDAIGKAFEQNRENPFALRSVKLCTSRKQLDELPPGPRVVLASMASLETGFAKE 337

Query: 91  LFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKD 150
           LF +W   P+N ++ T R   GTLAR L      + + + + K++ L GEELE Y++++ 
Sbjct: 338 LFLEWAVDPKNLVLFTERAQVGTLARQLQVEPPPKIVKITISKKVLLVGEELEAYEREQS 397

Query: 151 K---EAKD-KQEKEKIPPHDTS 168
           +   EA++   ++E + P  +S
Sbjct: 398 RLREEARNAASQQEPVQPASSS 419



 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 50/112 (44%), Gaps = 40/112 (35%)

Query: 165 HDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDL 224
           H T F+ +L+L+DFKQ L   GI  EF  GVL C    +AVR+                 
Sbjct: 650 HKTVFVGDLRLADFKQLLATKGIQAEFAGGVLRCA-DNIAVRK----------------- 691

Query: 225 FFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                 S    +V+EG LSD+YY+V++LLY QY I+
Sbjct: 692 ----------------------SGGQQLVIEGSLSDDYYKVRELLYSQYHIV 721



 Score = 40.8 bits (94), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 15/35 (42%), Positives = 23/35 (65%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           RVVL S   +E GF+++LF +W   P+N ++ T R
Sbjct: 321 RVVLASMASLETGFAKELFLEWAVDPKNLVLFTER 355


>gi|328768987|gb|EGF79032.1| hypothetical protein BATDEDRAFT_12823 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 719

 Score =  103 bits (258), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 52/123 (42%), Positives = 74/123 (60%), Gaps = 3/123 (2%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           + +EWM D + ++F  AR  PF FK +K+ HS+ EL  +  PKVVL S P M  GFS+DL
Sbjct: 282 STLEWMGDGIAQAFT-ARELPFEFKCLKMIHSIDELDNLMGPKVVLASFPGMMTGFSQDL 340

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIE--LGGNRTLTLQVKKRIRLEGEELEEYQKKK 149
             +W S P N II+ +R  PGTL R + +      +   + +KK++ L G+EL+EY  KK
Sbjct: 341 LIEWGSDPRNMIILPDRAQPGTLGRMMFDDWFESAKMADMNLKKQVPLVGDELDEYMSKK 400

Query: 150 DKE 152
             E
Sbjct: 401 QAE 403



 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 24/37 (64%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR 243
           +VVL S P M  GFS+DL  +W S P N II+ +R +
Sbjct: 323 KVVLASFPGMMTGFSQDLLIEWGSDPRNMIILPDRAQ 359


>gi|325187176|emb|CCA21717.1| cleavage and polyadenylation specificity factor subunit putative
           [Albugo laibachii Nc14]
 gi|325187319|emb|CCA21858.1| cleavage and polyadenylation specificity factor subunit putative
           [Albugo laibachii Nc14]
          Length = 731

 Score =  103 bits (258), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 82/134 (61%), Gaps = 3/134 (2%)

Query: 29  AELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFS 88
           A  A +EW ++++ ++F+  R NPF F H+ L HS+ EL K+ SPKVVL ++  +ECG++
Sbjct: 276 AAEAMLEWCNEQIARNFDAGRQNPFQFSHIHLIHSIEELEKLSSPKVVLATSATLECGYA 335

Query: 89  RDLFFQWCSSPENSIIITNRTSPGTLARDLIEL---GGNRTLTLQVKKRIRLEGEELEEY 145
           ++LF ++ +   NSII T    P +    ++++     +R +T  V KR+ LEG EL  Y
Sbjct: 336 KELFIKYAADTRNSIIFTTTPPPRSFGARILDMNKKNDSRVVTCSVAKRVLLEGTELALY 395

Query: 146 QKKKDKEAKDKQEK 159
           + K+ +  + + E+
Sbjct: 396 EAKERRRLRLEAEQ 409


>gi|330803886|ref|XP_003289932.1| hypothetical protein DICPUDRAFT_80682 [Dictyostelium purpureum]
 gi|325079974|gb|EGC33550.1| hypothetical protein DICPUDRAFT_80682 [Dictyostelium purpureum]
          Length = 752

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 48/123 (39%), Positives = 78/123 (63%), Gaps = 6/123 (4%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           +++E+MS      FE   +NPF F+ +K+  +L E+ +  +PKV+L S+ D+E G+SRDL
Sbjct: 284 SQLEFMSSSASVRFEQKIDNPFTFRQIKIFSTLEEIPETNTPKVILTSSQDLETGYSRDL 343

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIEL------GGNRTLTLQVKKRIRLEGEELEEY 145
           F +W S P+N I+ TN    G+LA  +I +      G N+T+ +Q   R+ L+GEEL EY
Sbjct: 344 FIKWSSDPKNLILFTNYIPEGSLASKVINIASNKSSGSNKTIEIQQGSRVPLQGEELLEY 403

Query: 146 QKK 148
           +++
Sbjct: 404 EQR 406



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/134 (27%), Positives = 59/134 (44%), Gaps = 38/134 (28%)

Query: 144 EYQKKKDKEAKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGID-CEFMDGVLICCRGT 202
           EY  + +    D    E+   HD SFI +++L+D KQ L + GI   +F  G++ C    
Sbjct: 656 EYNPENNSSYLDIIPSEQNNGHDESFIGDIKLADLKQVLVKAGIKKVQFDQGIINC---- 711

Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEY 262
                               DL + W              R D   +++I ++G +SDEY
Sbjct: 712 -------------------NDLVYIW--------------REDVGGNSIINVDGIISDEY 738

Query: 263 YRVQQLLYDQYAIL 276
           Y V++LLY Q+ I+
Sbjct: 739 YLVKELLYRQFQIV 752



 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 26/34 (76%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 240
           +V+L S+ D+E G+SRDLF +W S P+N I+ TN
Sbjct: 326 KVILTSSQDLETGYSRDLFIKWSSDPKNLILFTN 359


>gi|172087214|ref|XP_001913149.1| cleavage and polyadenylation factor [Oikopleura dioica]
 gi|18029276|gb|AAL56454.1| cleavage and polyadenylation factor-like protein [Oikopleura
           dioica]
          Length = 765

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 50/121 (41%), Positives = 78/121 (64%), Gaps = 2/121 (1%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           +  S  E AK  IEWMS+K++  F   R N F    +KL  S+ E+A++P PKV+L +  
Sbjct: 271 VATSTIEAAKGMIEWMSEKIISKFTHKRENIFDLTKMKLRSSIQEIARIPEPKVILATPM 330

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           DM+ GFSR+LF    + P+N++I++ R++ G+L R +IE  G  ++TL++ KR+ L G E
Sbjct: 331 DMDTGFSRELFVMMAAHPKNAVIMSGRSTKGSLCRKIIENEGMSSITLEMNKRLPLVGPE 390

Query: 142 L 142
           L
Sbjct: 391 L 391



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/171 (22%), Positives = 71/171 (41%), Gaps = 51/171 (29%)

Query: 117 DLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPP-----------H 165
           + + +G       +++ R+   G  LE   +  + +   K E + IP            H
Sbjct: 635 NFVRVGSKDIEVARIRGRVDYFGGRLELEAENGENDEPKKLEIDDIPTLQPVTNNYSSGH 694

Query: 166 DTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLF 225
           D+ FIN+ +L++ K  L   G+  EF+ G L+ C   V+++R                  
Sbjct: 695 DSIFINDTKLTELKSNLIDCGMQAEFIGGNLV-CNNKVSIKR------------------ 735

Query: 226 FQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                S + VI +EG LS++Y+ V++++YD YAI+
Sbjct: 736 ---------------------SANGVIQVEGTLSEDYFIVRKMVYDNYAIV 765



 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/53 (33%), Positives = 31/53 (58%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLS 259
           +V+L +  DM+ GFSR+LF    + P+N++I++ R+        +I  EG  S
Sbjct: 323 KVILATPMDMDTGFSRELFVMMAAHPKNAVIMSGRSTKGSLCRKIIENEGMSS 375


>gi|313232558|emb|CBY19228.1| unnamed protein product [Oikopleura dioica]
          Length = 764

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 50/121 (41%), Positives = 78/121 (64%), Gaps = 2/121 (1%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           +  S  E AK  IEWMS+K++  F   R N F    +KL  S+ E+A++P PKV+L +  
Sbjct: 270 VATSTIEAAKGMIEWMSEKIISKFTHKRENIFDLTKMKLRSSIQEIARIPEPKVILATPM 329

Query: 82  DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEE 141
           DM+ GFSR+LF    + P+N++I++ R++ G+L R +IE  G  ++TL++ KR+ L G E
Sbjct: 330 DMDTGFSRELFVMMAAHPKNAVIMSGRSTKGSLCRKIIENEGMSSITLEMNKRLPLVGPE 389

Query: 142 L 142
           L
Sbjct: 390 L 390



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/171 (22%), Positives = 71/171 (41%), Gaps = 51/171 (29%)

Query: 117 DLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPP-----------H 165
           + + +G       +++ R+   G  LE   +  + +   K E + IP            H
Sbjct: 634 NFVRVGSKDIEVARIRGRVDYFGGRLELEAENGENDEPKKLEIDDIPTLQPVTNNYSSGH 693

Query: 166 DTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLF 225
           D+ FIN+ +L++ K  L   G+  EF+ G L+ C   V+++R                  
Sbjct: 694 DSIFINDTKLTELKSNLIDCGMHAEFIGGNLV-CNNKVSIKR------------------ 734

Query: 226 FQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                S + VI +EG LS++Y+ V++++YD YAI+
Sbjct: 735 ---------------------SANGVIQVEGTLSEDYFIVRKMVYDNYAIV 764



 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/53 (33%), Positives = 31/53 (58%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLS 259
           +V+L +  DM+ GFSR+LF    + P+N++I++ R+        +I  EG  S
Sbjct: 322 KVILATPMDMDTGFSRELFVMMAAHPKNAVIMSGRSTKGSLCRKIIENEGMSS 374


>gi|256077072|ref|XP_002574832.1| cleavage and polyadenylation specificity factor [Schistosoma
           mansoni]
          Length = 1063

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 42/77 (54%), Positives = 59/77 (76%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +EWMS+K+M+SFE  R+NPFHF+H++LCH+L +L  V  PKVVL S  D+ CGFSR LF 
Sbjct: 85  VEWMSEKVMRSFEDQRSNPFHFRHMQLCHTLEQLDAVSEPKVVLSSLSDLSCGFSRQLFA 144

Query: 94  QWCSSPENSIIITNRTS 110
           +W  +  N+II+T++ S
Sbjct: 145 EWADNDLNTIILTSQES 161



 Score = 41.2 bits (95), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 27/39 (69%)

Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           V+  +VVL S  D+ CGFSR LF +W  +  N+II+T++
Sbjct: 121 VSEPKVVLSSLSDLSCGFSRQLFAEWADNDLNTIILTSQ 159



 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 1/46 (2%)

Query: 162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRR 207
           I  H T F+NE +LSD KQ L   G+  EF+ G+L+     VA++R
Sbjct: 655 IGQHKTVFVNEPKLSDLKQLLLSQGLMAEFVSGILV-VDNCVAIKR 699


>gi|225464483|ref|XP_002268591.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Vitis vinifera]
 gi|302143847|emb|CBI22708.3| unnamed protein product [Vitis vinifera]
          Length = 740

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 52/134 (38%), Positives = 79/134 (58%), Gaps = 1/134 (0%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWMSD + KSFE  R+N F  KHV L  S +EL KVP  PK+VL S   +E GFS D+F
Sbjct: 280 LEWMSDSIAKSFEHTRDNAFLLKHVTLLISKSELEKVPDGPKIVLASMASLEAGFSHDIF 339

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
            +W +  +N ++ + R    TLAR L      + + + + KR+ L GEEL  Y++++++ 
Sbjct: 340 VEWATDAKNLVLFSERGQFATLARMLQADPPPKAVKVTMSKRVPLVGEELAAYEEEQERI 399

Query: 153 AKDKQEKEKIPPHD 166
            K++  K  +   D
Sbjct: 400 KKEEALKASLSKED 413



 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 50/112 (44%), Gaps = 30/112 (26%)

Query: 165 HDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDL 224
           HDT F+ +++++DFKQ L   GI  EF  G L C    V +R+V                
Sbjct: 659 HDTVFVGDIKMADFKQFLASKGIQVEFSGGALRCGE-YVTLRKV---------------- 701

Query: 225 FFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                S      II             IV+EG L DEYY++++ LY QY +L
Sbjct: 702 --GDASQKGGGAIIQQ-----------IVMEGPLCDEYYKIREYLYSQYYLL 740


>gi|350646480|emb|CCD58879.1| cleavage and polyadenylation specificity factor,putative
           [Schistosoma mansoni]
          Length = 729

 Score =  100 bits (248), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 42/77 (54%), Positives = 59/77 (76%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +EWMS+K+M+SFE  R+NPFHF+H++LCH+L +L  V  PKVVL S  D+ CGFSR LF 
Sbjct: 85  VEWMSEKVMRSFEDQRSNPFHFRHMQLCHTLEQLDAVSEPKVVLSSLSDLSCGFSRQLFA 144

Query: 94  QWCSSPENSIIITNRTS 110
           +W  +  N+II+T++ S
Sbjct: 145 EWADNDLNTIILTSQES 161



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 50/115 (43%), Gaps = 40/115 (34%)

Query: 162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFS 221
           I  H T F+NE +LSD KQ L   G+  EF+ G+L+     VA++R              
Sbjct: 655 IGQHKTVFVNEPKLSDLKQLLLSQGLMAEFVSGILV-VDNCVAIKR-------------- 699

Query: 222 RDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                    S++  ++LEG L   Y+ V+++LY Q+AIL
Sbjct: 700 -------------------------SEAGKLLLEGLLCGTYFEVRRILYQQFAIL 729



 Score = 41.2 bits (95), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 33/55 (60%), Gaps = 4/55 (7%)

Query: 203 VAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR----ADDSDSNVIV 253
           V+  +VVL S  D+ CGFSR LF +W  +  N+II+T++       D+S S  ++
Sbjct: 121 VSEPKVVLSSLSDLSCGFSRQLFAEWADNDLNTIILTSQESVITGVDESSSQSLI 175


>gi|256077070|ref|XP_002574831.1| cleavage and polyadenylation specificity factor [Schistosoma
           mansoni]
          Length = 928

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 42/77 (54%), Positives = 59/77 (76%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +EWMS+K+M+SFE  R+NPFHF+H++LCH+L +L  V  PKVVL S  D+ CGFSR LF 
Sbjct: 284 VEWMSEKVMRSFEDQRSNPFHFRHMQLCHTLEQLDAVSEPKVVLSSLSDLSCGFSRQLFA 343

Query: 94  QWCSSPENSIIITNRTS 110
           +W  +  N+II+T++ S
Sbjct: 344 EWADNDLNTIILTSQES 360



 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 50/115 (43%), Gaps = 40/115 (34%)

Query: 162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFS 221
           I  H T F+NE +LSD KQ L   G+  EF+ G+L+     VA++R              
Sbjct: 854 IGQHKTVFVNEPKLSDLKQLLLSQGLMAEFVSGILV-VDNCVAIKR-------------- 898

Query: 222 RDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                    S++  ++LEG L   Y+ V+++LY Q+AIL
Sbjct: 899 -------------------------SEAGKLLLEGLLCGTYFEVRRILYQQFAIL 928



 Score = 41.6 bits (96), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 4/56 (7%)

Query: 202 TVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR----ADDSDSNVIV 253
            V+  +VVL S  D+ CGFSR LF +W  +  N+II+T++       D+S S  ++
Sbjct: 319 AVSEPKVVLSSLSDLSCGFSRQLFAEWADNDLNTIILTSQESVITGVDESSSQSLI 374


>gi|452822529|gb|EME29547.1| cleavage and polyadenylation specificity factor subunit 2
           [Galdieria sulphuraria]
          Length = 747

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 55/153 (35%), Positives = 87/153 (56%), Gaps = 14/153 (9%)

Query: 2   SDKLMKSFEGARNNPFHFKHVKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLC 61
           ++KL  S+  A      F  +    S+     +EWMSD ++  F+  R NPFH KH+ LC
Sbjct: 273 TEKLGSSYAVAIIEHVSFNTIDFAKSM-----MEWMSDAVINKFDTTRENPFHLKHIHLC 327

Query: 62  HSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIEL 121
           HS +EL+ + SPKV+L S   +ECGFSR+L  +  S+ +N +I+ +R  P TLA  +  +
Sbjct: 328 HSRSELSSLLSPKVILTSVASLECGFSRELVVEMVSNKKNKLILVDRLEPNTLAHSIYNV 387

Query: 122 GGNRT---------LTLQVKKRIRLEGEELEEY 145
             + +         + L++ +R+ L+G ELEEY
Sbjct: 388 LEDESEGKTVQLPRIALRLNRRVPLQGAELEEY 420


>gi|168010331|ref|XP_001757858.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691134|gb|EDQ77498.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 724

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 50/126 (39%), Positives = 75/126 (59%), Gaps = 3/126 (2%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVST 80
           + +S  E AK  +EWMSD + +SF  +R N F  K++KLCH   E  ++PS PKVV  S 
Sbjct: 268 VSYSTVEFAKSLLEWMSDSIARSFGSSRENSFLLKYLKLCHDRKEFDELPSGPKVVFASM 327

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGE 140
             +E GF+RDLF +W +   N ++ T R   GTLA+ L      + + + + ++I L GE
Sbjct: 328 ASLEGGFARDLFVEWATDSRNLVLFTERGQMGTLAKKLQAEPPPKIVKVTMSQKIPLTGE 387

Query: 141 ELEEYQ 146
           EL+ Y+
Sbjct: 388 ELQAYE 393



 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/117 (32%), Positives = 51/117 (43%), Gaps = 39/117 (33%)

Query: 160 EKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECG 219
           E  PPH + F+ +L+L+DFKQ L   GI  EF  GVL C     AVRR            
Sbjct: 647 ETPPPHKSVFVGDLRLADFKQLLATKGIQAEFAGGVLRCGDA-FAVRR------------ 693

Query: 220 FSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                        S  +V+EG LS+EYY+++ LLY Q+ +L
Sbjct: 694 --------------------------SGGSQQLVIEGPLSEEYYKLRDLLYSQFYML 724


>gi|255070137|ref|XP_002507150.1| predicted protein [Micromonas sp. RCC299]
 gi|255070139|ref|XP_002507151.1| predicted protein [Micromonas sp. RCC299]
 gi|226522425|gb|ACO68408.1| predicted protein [Micromonas sp. RCC299]
 gi|226522426|gb|ACO68409.1| predicted protein [Micromonas sp. RCC299]
          Length = 808

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 49/136 (36%), Positives = 82/136 (60%), Gaps = 8/136 (5%)

Query: 22  VKLCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLV 78
            K+ ++  E A+  +EWM + + + F+  R+N F+ +H+KLCHS+ E   +P  PKVVL 
Sbjct: 271 TKVAYNTLEFARSHLEWMGEHVGQYFDRERHNAFNTRHLKLCHSIDEFRALPQGPKVVLA 330

Query: 79  STPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIEL-----GGNRTLTLQVKK 133
           S   ++ G SR +F +W   P N I+ T+R  PG+L+R++  L     G    L + + +
Sbjct: 331 SFGSLDAGASRHIFVEWAPDPRNLIVFTDRLQPGSLSREVCRLSQLPPGARLPLRISLSQ 390

Query: 134 RIRLEGEELEEYQKKK 149
           R++L G+EL E+Q K+
Sbjct: 391 RLKLVGDELLEWQGKE 406


>gi|449446027|ref|XP_004140773.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Cucumis sativus]
          Length = 738

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 66/207 (31%), Positives = 100/207 (48%), Gaps = 27/207 (13%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWMSD + KSFE  RNN F  KHV L  + +EL   P  PKVVL S   +E G+S D+F
Sbjct: 280 LEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIF 339

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
             W    +N ++ + R   GTLAR L      + + + V KR+ L G+EL  Y+++++++
Sbjct: 340 VDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVKVTVSKRVPLTGDELIAYEEEQNRK 399

Query: 153 AKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVS 212
              K+E  K      S + E      +Q+   +G D +  D ++I     VA        
Sbjct: 400 ---KEEALK-----ASLLKE------EQSKASHGADNDTGDPMIIDASSNVA-------- 437

Query: 213 TPDM---ECGFSRDLFFQWCSSPENSI 236
            PD+     G  RD+       P   +
Sbjct: 438 -PDVGSSHGGAYRDILIDGFVPPSTGV 463


>gi|449528453|ref|XP_004171219.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 2-like, partial [Cucumis
           sativus]
          Length = 501

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 66/207 (31%), Positives = 99/207 (47%), Gaps = 27/207 (13%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWMSD + KSFE  RNN F  KHV L  + +EL   P  PKVVL S   +E G+S D F
Sbjct: 280 LEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDXF 339

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
             W    +N ++ + R   GTLAR L      + + + V KR+ L G+EL  Y+++++++
Sbjct: 340 VDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVKVTVSKRVPLTGDELIAYEEEQNRK 399

Query: 153 AKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVS 212
              K+E  K      S + E      +Q+   +G D +  D ++I     VA        
Sbjct: 400 ---KEEALK-----ASLLKE------EQSKASHGADNDTGDPMIIDASSNVA-------- 437

Query: 213 TPDM---ECGFSRDLFFQWCSSPENSI 236
            PD+     G  RD+       P   +
Sbjct: 438 -PDVGSSHGGAYRDILIDGFVPPSTGV 463


>gi|449018596|dbj|BAM81998.1| cleavage and polyadenylation specific factor 2, 100kD subunit
           [Cyanidioschyzon merolae strain 10D]
          Length = 884

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 53/137 (38%), Positives = 85/137 (62%), Gaps = 7/137 (5%)

Query: 20  KHVKLCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVL 77
           +HV + H++ E AK  IEWMSD ++ +F+  R NPF  +HV++ H L +   +PSPKVV+
Sbjct: 270 QHVGI-HTI-EFAKSLIEWMSDAVVSAFDSRRENPFRLRHVQVVHGLDQADALPSPKVVM 327

Query: 78  VSTPDMECGFSRDLFFQWCSS-PENSIIITNRTSPGTLA-RDLIELGGNRTLT-LQVKKR 134
              P ++ GFSR LF Q  ++ P   +++++R   GT A R  +E    R    L   +R
Sbjct: 328 APLPSLDYGFSRVLFLQRIAADPRAMVLMSDRLESGTFAFRLAVEKEKLRVREPLTYAER 387

Query: 135 IRLEGEELEEYQKKKDK 151
           + L+GEELE +Q++++K
Sbjct: 388 VPLQGEELERWQREQEK 404


>gi|15237845|ref|NP_197776.1| cleavage and polyadenylation specificity factor subunit 2
           [Arabidopsis thaliana]
 gi|18203240|sp|Q9LKF9.2|CPSF2_ARATH RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 2; AltName: Full=Cleavage and polyadenylation
           specificity factor 100 kDa subunit; Short=AtCPSF100;
           Short=CPSF 100 kDa subunit; AltName: Full=Protein EMBRYO
           DEFECTIVE 1265; AltName: Full=Protein ENHANCED SILENCING
           PHENOTYPE 5
 gi|10176855|dbj|BAB10061.1| cleavage and polyadenylation specificity factor [Arabidopsis
           thaliana]
 gi|14334618|gb|AAK59487.1| putative cleavage and polyadenylation specificity factor
           [Arabidopsis thaliana]
 gi|28393921|gb|AAO42368.1| putative cleavage and polyadenylation specificity factor
           [Arabidopsis thaliana]
 gi|332005845|gb|AED93228.1| cleavage and polyadenylation specificity factor subunit 2
           [Arabidopsis thaliana]
          Length = 739

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 46/119 (38%), Positives = 72/119 (60%), Gaps = 1/119 (0%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWMSD + KSFE +R+N F  +HV L  +  +L   P  PKVVL S   +E GF+R++F
Sbjct: 281 LEWMSDSISKSFETSRDNAFLLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFAREIF 340

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDK 151
            +W + P N ++ T     GTLAR L      + + + + KR+ L GEEL  Y++++++
Sbjct: 341 VEWANDPRNLVLFTETGQFGTLARMLQSAPPPKFVKVTMSKRVPLAGEELIAYEEEQNR 399



 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 30/113 (26%), Positives = 50/113 (44%), Gaps = 29/113 (25%)

Query: 164 PHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRD 223
           PH    + +L+++DFKQ L   G+  EF  G  + C   V +R+V     P  + G    
Sbjct: 656 PHKPVLVGDLKIADFKQFLSSKGVQVEFAGGGALRCGEYVTLRKV----GPTGQKG---- 707

Query: 224 LFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                 S P+                  I++EG L ++YY+++  LY Q+ +L
Sbjct: 708 ----GASGPQQ-----------------ILIEGPLCEDYYKIRDYLYSQFYLL 739



 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 13/37 (35%), Positives = 23/37 (62%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR 243
           +VVL S   +E GF+R++F +W + P N ++ T   +
Sbjct: 322 KVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQ 358


>gi|297808393|ref|XP_002872080.1| CPSF100 [Arabidopsis lyrata subsp. lyrata]
 gi|297317917|gb|EFH48339.1| CPSF100 [Arabidopsis lyrata subsp. lyrata]
          Length = 739

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 46/119 (38%), Positives = 72/119 (60%), Gaps = 1/119 (0%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWMSD + KSFE +R+N F  +HV L  +  +L   P  PKVVL S   +E GF+R++F
Sbjct: 281 LEWMSDSISKSFETSRDNAFLLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFAREIF 340

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDK 151
            +W + P N ++ T     GTLAR L      + + + + KR+ L GEEL  Y++++++
Sbjct: 341 VEWANDPRNLVLFTETGQFGTLARMLQSAPPPKFVKVTMSKRVPLAGEELIAYEEEQNR 399



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/113 (26%), Positives = 50/113 (44%), Gaps = 29/113 (25%)

Query: 164 PHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRD 223
           PH    + +L+++DFKQ L   G+  EF  G  + C   V +R+V     P  + G    
Sbjct: 656 PHKPVLVGDLKIADFKQFLSSKGVQVEFAGGGALRCGEYVTLRKV----GPTGQKG---- 707

Query: 224 LFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                 S P+                  I++EG L ++YY+++  LY Q+ +L
Sbjct: 708 ----GASGPQQ-----------------ILIEGPLCEDYYKIRDYLYSQFYLL 739



 Score = 38.5 bits (88), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 13/37 (35%), Positives = 23/37 (62%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR 243
           +VVL S   +E GF+R++F +W + P N ++ T   +
Sbjct: 322 KVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQ 358


>gi|9082326|gb|AAF82809.1|AF283277_1 polyadenylation cleavage/specificity factor 100 kDa subunit
           [Arabidopsis thaliana]
          Length = 739

 Score = 94.0 bits (232), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 46/119 (38%), Positives = 72/119 (60%), Gaps = 1/119 (0%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWMSD + KSFE +R+N F  +HV L  +  +L   P  PKVVL S   +E GF+R++F
Sbjct: 281 LEWMSDSISKSFETSRDNAFLLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFAREIF 340

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDK 151
            +W + P N ++ T     GTLAR L      + + + + KR+ L GEEL  Y++++++
Sbjct: 341 VEWANDPRNLVLFTETGQFGTLARMLQSAPPPKFVKVTMSKRVPLAGEELIAYEEEQNR 399



 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 30/113 (26%), Positives = 50/113 (44%), Gaps = 29/113 (25%)

Query: 164 PHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRD 223
           PH    + +L+++DFKQ L   G+  EF  G  + C   V +R+V     P  + G    
Sbjct: 656 PHKPVLVGDLKIADFKQFLSSKGVQVEFAGGGALRCGEYVTLRKV----GPTGQKG---- 707

Query: 224 LFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                 S P+                  I++EG L ++YY+++  LY Q+ +L
Sbjct: 708 ----GASGPQQ-----------------ILIEGPLCEDYYKIRDYLYSQFYLL 739



 Score = 38.5 bits (88), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 13/37 (35%), Positives = 23/37 (62%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR 243
           +VVL S   +E GF+R++F +W + P N ++ T   +
Sbjct: 322 KVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQ 358


>gi|255553723|ref|XP_002517902.1| cleavage and polyadenylation specificity factor, putative [Ricinus
           communis]
 gi|223542884|gb|EEF44420.1| cleavage and polyadenylation specificity factor, putative [Ricinus
           communis]
          Length = 740

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 48/125 (38%), Positives = 76/125 (60%), Gaps = 1/125 (0%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVSTPDMECGFSRDLF 92
           +EWMSD + KSFE +R+N F  KHV L  +  EL   P+ PKVVL S   +E GFS D+F
Sbjct: 281 LEWMSDSIAKSFETSRDNAFLLKHVTLLINKNELDNAPNVPKVVLASMASLEAGFSHDIF 340

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
            +W +  +N ++ T R   GTLAR L      + + + + +R+ L G+EL  Y++++ + 
Sbjct: 341 VEWAADVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGDELIAYEEEQKRL 400

Query: 153 AKDKQ 157
            K+++
Sbjct: 401 KKEEE 405



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/114 (26%), Positives = 52/114 (45%), Gaps = 30/114 (26%)

Query: 163 PPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSR 222
           PPH +  + +L+++DFKQ L   G+  EF  G L C    V +R+V  ++          
Sbjct: 657 PPHKSVLVGDLKMADFKQFLASKGVQVEFAGGALRCGE-YVTLRKVGNIN---------- 705

Query: 223 DLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                               +   S +  IV+EG L ++YY++++ LY Q+ +L
Sbjct: 706 -------------------QKGGGSGTQQIVIEGPLCEDYYKIREYLYSQFYLL 740



 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 15/39 (38%), Positives = 24/39 (61%)

Query: 205 VRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR 243
           V +VVL S   +E GFS D+F +W +  +N ++ T R +
Sbjct: 320 VPKVVLASMASLEAGFSHDIFVEWAADVKNLVLFTERGQ 358


>gi|66826811|ref|XP_646760.1| beta-lactamase domain-containing protein [Dictyostelium discoideum
           AX4]
 gi|74858209|sp|Q55BS1.1|CPSF2_DICDI RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 2; AltName: Full=Cleavage and polyadenylation
           specificity factor 100 kDa subunit; Short=CPSF 100 kDa
           subunit
 gi|60474609|gb|EAL72546.1| beta-lactamase domain-containing protein [Dictyostelium discoideum
           AX4]
          Length = 784

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 49/132 (37%), Positives = 80/132 (60%), Gaps = 6/132 (4%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRD 90
           +++E+MS      FE    NPF FKH+K+  SL EL ++P + KV+L S+ D+E GFSR+
Sbjct: 285 SQLEFMSSTASVKFEQNIENPFSFKHIKILSSLEELQELPDTNKVILTSSQDLETGFSRE 344

Query: 91  LFFQWCSSPENSIIITNRTSPGTLARDLIEL-----GGNRTLTLQVKKRIRLEGEELEEY 145
           LF QWCS P+  I+ T +    +LA  LI+      G  + + +    R+ L G+EL +Y
Sbjct: 345 LFIQWCSDPKTLILFTQKIPKDSLADKLIKQYSTPNGRGKCIEIVQGSRVPLTGDELLQY 404

Query: 146 QKKKDKEAKDKQ 157
           + ++ K+ ++K+
Sbjct: 405 EMEQAKQREEKR 416



 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 50/112 (44%), Gaps = 36/112 (32%)

Query: 165 HDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDL 224
           HD SFI +++LSD KQ L   GI  +F  G+L  C G V + R                 
Sbjct: 709 HDESFIGDIKLSDLKQVLVNAGIQVQFDQGIL-NCGGLVYIWR----------------- 750

Query: 225 FFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                               D   +++I ++G +SDEYY +++LLY Q+ I+
Sbjct: 751 ------------------DEDHGGNSIINVDGIISDEYYLIKELLYKQFQIV 784



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 18/35 (51%), Positives = 26/35 (74%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +V+L S+ D+E GFSR+LF QWCS P+  I+ T +
Sbjct: 328 KVILTSSQDLETGFSRELFIQWCSDPKTLILFTQK 362


>gi|224121102|ref|XP_002330904.1| predicted protein [Populus trichocarpa]
 gi|222872726|gb|EEF09857.1| predicted protein [Populus trichocarpa]
          Length = 740

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 54/162 (33%), Positives = 89/162 (54%), Gaps = 4/162 (2%)

Query: 5   LMKSFEGAR--NNPFHFKHVKLCHSLAELAK-IEWMSDKLMKSFEGARNNPFHFKHVKLC 61
           +++ F G R  N P  F       ++  +   +EWMSD + KSFE +R+N F  KHV L 
Sbjct: 248 ILEQFWGQRFLNYPIFFLSYVSSSTIDYIKSFLEWMSDSIAKSFETSRDNAFLMKHVTLL 307

Query: 62  HSLAELAKVPS-PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE 120
            S  EL    + PKVVL S   +E GFS D+F +W +  +N ++ T R   GTLAR L  
Sbjct: 308 ISKDELDNASTGPKVVLASVASLEAGFSHDIFAEWAADVKNLVLFTERGQFGTLARMLQA 367

Query: 121 LGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKI 162
               + + + + +R+ L G+EL  Y++++ +  ++++ K  +
Sbjct: 368 DPPPKAVKMTMSRRVPLVGDELIAYEEEQKRLKREEELKASL 409



 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 31/114 (27%), Positives = 52/114 (45%), Gaps = 30/114 (26%)

Query: 163 PPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSR 222
           PPH +  + +L+++DFKQ L   G+  EF  G L C    V +R+   V  P  + G   
Sbjct: 657 PPHKSVLVGDLKMADFKQFLASKGVQVEFAGGALRCGE-YVTLRK---VGNPSQKGG--- 709

Query: 223 DLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                   S +  I++EG L ++YY++++ LY Q+ +L
Sbjct: 710 -----------------------ASGTQQIIIEGPLCEDYYKIREYLYSQFYLL 740



 Score = 38.5 bits (88), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 14/37 (37%), Positives = 23/37 (62%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR 243
           +VVL S   +E GFS D+F +W +  +N ++ T R +
Sbjct: 321 KVVLASVASLEAGFSHDIFAEWAADVKNLVLFTERGQ 357


>gi|145340766|ref|XP_001415490.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144575713|gb|ABO93782.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 715

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 49/135 (36%), Positives = 75/135 (55%), Gaps = 5/135 (3%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAEL-AKVPSPKVVLVSTPDMECGFSRD 90
           + +EWM D +  +FE  R NPF+ K + LCHS+ EL A  P PKVVL S   +E G SR 
Sbjct: 262 SHLEWMGDHVTNAFERRRENPFNTKFLTLCHSMEELQALPPGPKVVLASFGSLEAGPSRH 321

Query: 91  LFFQWCSSPENSIIITNRTSPGTLARDLIELG----GNRTLTLQVKKRIRLEGEELEEYQ 146
           LF +W     N +I+T +   G+L   +++L       + + L + +RI LEG EL E++
Sbjct: 322 LFAEWAEDKSNLVILTGQPEHGSLTEQVVQLSAKATAKKKIKLTLSRRIPLEGSELAEHE 381

Query: 147 KKKDKEAKDKQEKEK 161
             +      + EK++
Sbjct: 382 SSRKSSTSTELEKKE 396



 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 37/108 (34%), Positives = 51/108 (47%), Gaps = 39/108 (36%)

Query: 169 FINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQW 228
           F+ +L+LSDF+Q L + GI  EF DGVL+C  G V VR+                     
Sbjct: 647 FLGDLRLSDFRQALAQEGIIAEFADGVLVCANGRVTVRK--------------------- 685

Query: 229 CSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                            D D  ++V EG LS +Y+ V+Q+LY QY+IL
Sbjct: 686 -----------------DGDEKLVV-EGALSQDYFEVRQILYSQYSIL 715


>gi|290981012|ref|XP_002673225.1| predicted protein [Naegleria gruberi]
 gi|284086807|gb|EFC40481.1| predicted protein [Naegleria gruberi]
          Length = 808

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 50/153 (32%), Positives = 81/153 (52%), Gaps = 38/153 (24%)

Query: 33  KIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAEL------------------------- 67
           ++EWMSDK+MK F+  R NPF F++  +CH++ EL                         
Sbjct: 327 QLEWMSDKIMKGFDEKRENPFKFQYFSVCHNVEELMDKLQKKEQMRMMMENQMNDEDEET 386

Query: 68  --AKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI------ 119
                 +P VVL S+  ++ G++R+LF +WC    N ++   R++P +L+R LI      
Sbjct: 387 ATTGKHTPMVVLASSNTLDYGYARELFVKWCEDQRNLVMFIERSAPNSLSRKLINKLRAK 446

Query: 120 ---ELGGNRTLTLQVKKRIRLEGEELEEYQKKK 149
               L  N +LTL   +R+ L+GEELE+Y+K++
Sbjct: 447 KSERLDENMSLTLY--RRVALKGEELEKYEKEQ 477


>gi|384251490|gb|EIE24968.1| hypothetical protein COCSUDRAFT_83661 [Coccomyxa subellipsoidea
           C-169]
          Length = 731

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 50/148 (33%), Positives = 86/148 (58%), Gaps = 9/148 (6%)

Query: 24  LCHSLAELA--KIEWMSDKLMKSFEGARNNPFHF---KHVKLCHSLAELAKVP-SPKVVL 77
           + +++ ELA  ++EWMS  + + FE  + NPF     K +KLC +  ELAK+P  P+VV+
Sbjct: 270 MAYNVLELASSQLEWMSHYIGQMFERTKQNPFSVRQAKKLKLCRTTEELAKLPPGPRVVM 329

Query: 78  VSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDL---IELGGNRTLTLQVKKR 134
            + P +E G SR L  +W ++P N I+   R    TLA  L   ++ G   T+ +++ KR
Sbjct: 330 ATLPSLEAGASRQLLTEWATNPANLILFPGRAPNDTLAGLLQQNMQSGQPFTVPIRLSKR 389

Query: 135 IRLEGEELEEYQKKKDKEAKDKQEKEKI 162
           + L+G EL+ +Q+ +     +++E+  I
Sbjct: 390 MPLQGAELQAWQESQTAHVLEEEEEPAI 417


>gi|356530856|ref|XP_003533995.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like isoform 1 [Glycine max]
          Length = 736

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 49/123 (39%), Positives = 71/123 (57%), Gaps = 1/123 (0%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWMSD + KSFE  R N F  K+V L  +  EL   P  PKVVL S   +E GFS D+F
Sbjct: 280 LEWMSDTIAKSFEKTRENIFLLKYVTLLINKTELDNAPDGPKVVLASMASLEAGFSHDIF 339

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
            +W +  +N ++ T R    TLAR L      + + + V KR+ L GEEL  Y++++++ 
Sbjct: 340 VEWANDVKNLVLFTERGQFATLARMLQADPPPKAVKVVVSKRVPLVGEELIAYEEEQNRI 399

Query: 153 AKD 155
            K+
Sbjct: 400 KKE 402



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/114 (27%), Positives = 49/114 (42%), Gaps = 30/114 (26%)

Query: 163 PPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSR 222
           PPH +  + +L+L+D KQ L   G+  EF  G L C    V +R+V   S          
Sbjct: 653 PPHKSVLVGDLKLADIKQFLSSKGVQVEFAGGALRCGE-YVTLRKVGDAS---------- 701

Query: 223 DLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                               +   S +  IV+EG L ++YY+++  LY Q+ +L
Sbjct: 702 -------------------QKGGGSGAQQIVIEGPLCEDYYKIRDYLYSQFYLL 736



 Score = 38.1 bits (87), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 14/37 (37%), Positives = 23/37 (62%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR 243
           +VVL S   +E GFS D+F +W +  +N ++ T R +
Sbjct: 321 KVVLASMASLEAGFSHDIFVEWANDVKNLVLFTERGQ 357


>gi|356530858|ref|XP_003533996.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like isoform 2 [Glycine max]
          Length = 742

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 49/123 (39%), Positives = 71/123 (57%), Gaps = 1/123 (0%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWMSD + KSFE  R N F  K+V L  +  EL   P  PKVVL S   +E GFS D+F
Sbjct: 283 LEWMSDTIAKSFEKTRENIFLLKYVTLLINKTELDNAPDGPKVVLASMASLEAGFSHDIF 342

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
            +W +  +N ++ T R    TLAR L      + + + V KR+ L GEEL  Y++++++ 
Sbjct: 343 VEWANDVKNLVLFTERGQFATLARMLQADPPPKAVKVVVSKRVPLVGEELIAYEEEQNRI 402

Query: 153 AKD 155
            K+
Sbjct: 403 KKE 405



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/114 (27%), Positives = 49/114 (42%), Gaps = 30/114 (26%)

Query: 163 PPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSR 222
           PPH +  + +L+L+D KQ L   G+  EF  G L C    V +R+V   S          
Sbjct: 659 PPHKSVLVGDLKLADIKQFLSSKGVQVEFAGGALRCGE-YVTLRKVGDAS---------- 707

Query: 223 DLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                               +   S +  IV+EG L ++YY+++  LY Q+ +L
Sbjct: 708 -------------------QKGGGSGAQQIVIEGPLCEDYYKIRDYLYSQFYLL 742



 Score = 38.1 bits (87), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 14/37 (37%), Positives = 23/37 (62%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNR 243
           +VVL S   +E GFS D+F +W +  +N ++ T R +
Sbjct: 324 KVVLASMASLEAGFSHDIFVEWANDVKNLVLFTERGQ 360


>gi|326436560|gb|EGD82130.1| hypothetical protein PTSG_02804 [Salpingoeca sp. ATCC 50818]
          Length = 630

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 43/117 (36%), Positives = 67/117 (57%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           ++++SD L       +  PF F ++K   SL  +   P PKVV+ +   +E GFSR LF 
Sbjct: 195 VDFLSDGLQAQMNEHKAMPFRFPNIKRVQSLDAINAHPGPKVVVTAELGLEAGFSRQLFA 254

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKD 150
           QW ++P+N+II T R    TLA  +       TL L++  R+ LEGEELE ++ +++
Sbjct: 255 QWAANPDNAIIFTRRPDEDTLAHSIYHNTAPDTLQLRLGARVELEGEELEAHRAERE 311



 Score = 43.9 bits (102), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 26/39 (66%)

Query: 207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRAD 245
           +VV+ +   +E GFSR LF QW ++P+N+II T R   D
Sbjct: 235 KVVVTAELGLEAGFSRQLFAQWAANPDNAIIFTRRPDED 273


>gi|356559788|ref|XP_003548179.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like isoform 1 [Glycine max]
          Length = 738

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 48/123 (39%), Positives = 71/123 (57%), Gaps = 1/123 (0%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWMSD + KSFE  R N F  K+V L  +  EL   P  PKVVL S   +E GFS ++F
Sbjct: 280 LEWMSDTIAKSFEKTRENIFLLKYVTLLINKTELDNAPDGPKVVLASMASLEAGFSHEIF 339

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
            +W +  +N ++ T R    TLAR L      + + + V KR+ L GEEL  Y++++++ 
Sbjct: 340 VEWANDVKNLVLFTERGQFATLARMLQADPPPKAVKVVVSKRVALVGEELIAYEEEQNRI 399

Query: 153 AKD 155
            K+
Sbjct: 400 KKE 402



 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 31/114 (27%), Positives = 49/114 (42%), Gaps = 30/114 (26%)

Query: 163 PPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSR 222
           PPH +  + +L+L+D KQ L   G+  EF  G L C    V +R+V   S          
Sbjct: 655 PPHKSVLVGDLKLADIKQFLSSKGVQVEFAGGALRCGE-YVTLRKVGDAS---------- 703

Query: 223 DLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                               +   S +  IV+EG L ++YY+++  LY Q+ +L
Sbjct: 704 -------------------QKGGGSGAQQIVIEGPLCEDYYKIRDYLYSQFYLL 738


>gi|356559790|ref|XP_003548180.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like isoform 2 [Glycine max]
          Length = 743

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 48/123 (39%), Positives = 71/123 (57%), Gaps = 1/123 (0%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWMSD + KSFE  R N F  K+V L  +  EL   P  PKVVL S   +E GFS ++F
Sbjct: 285 LEWMSDTIAKSFEKTRENIFLLKYVTLLINKTELDNAPDGPKVVLASMASLEAGFSHEIF 344

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
            +W +  +N ++ T R    TLAR L      + + + V KR+ L GEEL  Y++++++ 
Sbjct: 345 VEWANDVKNLVLFTERGQFATLARMLQADPPPKAVKVVVSKRVALVGEELIAYEEEQNRI 404

Query: 153 AKD 155
            K+
Sbjct: 405 KKE 407



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/114 (27%), Positives = 49/114 (42%), Gaps = 30/114 (26%)

Query: 163 PPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSR 222
           PPH +  + +L+L+D KQ L   G+  EF  G L C    V +R+V   S          
Sbjct: 660 PPHKSVLVGDLKLADIKQFLSSKGVQVEFAGGALRCGE-YVTLRKVGDAS---------- 708

Query: 223 DLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                               +   S +  IV+EG L ++YY+++  LY Q+ +L
Sbjct: 709 -------------------QKGGGSGAQQIVIEGPLCEDYYKIRDYLYSQFYLL 743


>gi|428169733|gb|EKX38664.1| hypothetical protein GUITHDRAFT_89302 [Guillardia theta CCMP2712]
          Length = 770

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 46/129 (35%), Positives = 78/129 (60%), Gaps = 6/129 (4%)

Query: 31  LAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMEC-GFSR 89
           +++ EW+S  + + F  +R+N FH ++V  C S  EL ++P P+VVL S  D+E   FS 
Sbjct: 281 MSQTEWLSKHVNQRFIQSRHNVFHLENVHRCCSREELGRLPYPQVVLASGLDLETSSFSL 340

Query: 90  DLFFQWCSSPENSIIITNRTSPGTLARDLIELGGN-----RTLTLQVKKRIRLEGEELEE 144
           DLF +W    +N +++T +  PG+ AR   +L G+       L LQ+ +R+ LEG EL E
Sbjct: 341 DLFAEWAPDSKNLVLLTQKARPGSRARQFQDLMGSGLPLPSNLMLQMHRRVPLEGRELRE 400

Query: 145 YQKKKDKEA 153
           +++++  +A
Sbjct: 401 HEEQERLKA 409


>gi|326495752|dbj|BAJ85972.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 726

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 50/143 (34%), Positives = 84/143 (58%), Gaps = 8/143 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWMSD + KSFE  R+N F  +HV L  +  EL K+  +PKVVL S   +E GFS D+F
Sbjct: 268 LEWMSDSISKSFEHTRDNAFLLRHVSLIINKEELEKLGDAPKVVLASMSSLEVGFSHDIF 327

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
            +  +  +N ++ T +   GTLAR L      + + + + KR+ L G+EL+ Y++++++ 
Sbjct: 328 VEMANEAKNLVLFTEKGQFGTLARMLQVDPPPKAVKVTMSKRVPLVGDELKAYEEEQERI 387

Query: 153 AKDK-------QEKEKIPPHDTS 168
            K++       +EKE    H+++
Sbjct: 388 KKEEVLKASLSKEKELKASHESN 410


>gi|226492345|ref|NP_001151557.1| LOC100285191 [Zea mays]
 gi|195647682|gb|ACG43309.1| cleavage and polyadenylation specificity factor, 100 kDa subunit
           [Zea mays]
          Length = 673

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 49/130 (37%), Positives = 75/130 (57%), Gaps = 1/130 (0%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWM D++ KSFE +R N F  K V L  +  EL K+  +PKVVL S   +E GFS D+F
Sbjct: 216 LEWMGDQIAKSFESSRANAFLLKKVTLIINKEELEKLGDAPKVVLASMASLEVGFSHDIF 275

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
            +  +   N ++ T +   GTLAR L      + L + + KRI L G EL+ Y++++++ 
Sbjct: 276 VEMANEARNLVLFTEKGQFGTLARMLQVDPPPKALKVTMSKRIPLVGNELKAYEEEQERI 335

Query: 153 AKDKQEKEKI 162
            K+K  K  +
Sbjct: 336 KKEKSLKASL 345



 Score = 46.2 bits (108), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 50/108 (46%), Gaps = 31/108 (28%)

Query: 169 FINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQW 228
            + +L+LSDFKQ L+  G   EF  G L C    + VR+V                    
Sbjct: 597 LVGDLKLSDFKQFLENKGWQVEFAGGALRCGE-YIMVRKV-------------------- 635

Query: 229 CSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                +SI+         + S  IV+EG L ++YY++++LLY Q+ +L
Sbjct: 636 ----GDSIL------KGSTGSQQIVIEGPLCEDYYKIRELLYSQFYLL 673


>gi|281208327|gb|EFA82503.1| beta-lactamase domain-containing protein [Polysphondylium pallidum
           PN500]
          Length = 738

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 49/127 (38%), Positives = 75/127 (59%), Gaps = 6/127 (4%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHS---LAELAKVPSPKVVLV 78
           + +++ E AK  +E+MS      FE    N F FK++KLCHS   L  L  + S  VVL 
Sbjct: 280 VTYNVCEFAKSQLEFMSTAAGLKFEQRNENIFAFKNIKLCHSIYDLENLMGLSSNYVVLA 339

Query: 79  STPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           S  D+E G++R+LF +W +  +N I++T+    GTLA  L+      ++TL++ +R+ LE
Sbjct: 340 SGKDLESGYARELFIKWAADSKNLILMTDSVEEGTLASHLLN-DQPESVTLELGRRVELE 398

Query: 139 GEELEEY 145
           GEEL  Y
Sbjct: 399 GEELRAY 405



 Score = 53.9 bits (128), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 34/113 (30%)

Query: 164 PHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRD 223
           PH++SFI +++LS+FKQ L  +G   +F  G++     T                     
Sbjct: 660 PHNSSFIGDIKLSEFKQLLIDSGYQVQFDQGIIAVSMKTT-------------------- 699

Query: 224 LFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
           L + W            R   D + S  I ++G LSDEYY+V++LLY Q+ I+
Sbjct: 700 LIYIW------------REEVDGNSS--IQIDGILSDEYYQVRELLYQQFQII 738


>gi|219886123|gb|ACL53436.1| unknown [Zea mays]
 gi|414881946|tpg|DAA59077.1| TPA: cleavage and polyadenylation specificity factor, subunit
           isoform 1 [Zea mays]
 gi|414881947|tpg|DAA59078.1| TPA: cleavage and polyadenylation specificity factor, subunit
           isoform 2 [Zea mays]
 gi|414881948|tpg|DAA59079.1| TPA: cleavage and polyadenylation specificity factor, subunit
           isoform 3 [Zea mays]
          Length = 737

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 49/130 (37%), Positives = 75/130 (57%), Gaps = 1/130 (0%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWM D++ KSFE +R N F  K V L  +  EL K+  +PKVVL S   +E GFS D+F
Sbjct: 280 LEWMGDQIAKSFESSRANAFLLKKVTLIINKEELEKLGDAPKVVLASMASLEVGFSHDIF 339

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
            +  +   N ++ T +   GTLAR L      + L + + KRI L G EL+ Y++++++ 
Sbjct: 340 VEMANEARNLVLFTEKGQFGTLARMLQVDPPPKALKVTMSKRIPLVGNELKAYEEEQERI 399

Query: 153 AKDKQEKEKI 162
            K+K  K  +
Sbjct: 400 KKEKSLKASL 409



 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 50/108 (46%), Gaps = 31/108 (28%)

Query: 169 FINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQW 228
            + +L+LSDFKQ L+  G   EF  G L C    + VR+V                    
Sbjct: 661 LVGDLKLSDFKQFLENKGWQVEFAGGALRCGE-YIMVRKV-------------------- 699

Query: 229 CSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                +SI+         + S  IV+EG L ++YY++++LLY Q+ +L
Sbjct: 700 ----GDSIL------KGSTGSQQIVIEGPLCEDYYKIRELLYSQFYLL 737


>gi|308799055|ref|XP_003074308.1| polyadenylation cleavage/specificity factor 100 kDa subunit (ISS)
           [Ostreococcus tauri]
 gi|116000479|emb|CAL50159.1| polyadenylation cleavage/specificity factor 100 kDa subunit (ISS)
           [Ostreococcus tauri]
          Length = 807

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 40/121 (33%), Positives = 68/121 (56%), Gaps = 5/121 (4%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAEL-AKVPSPKVVLVSTPDMECGFSRDLF 92
           +EWM D +  +FE  R NPF+ K + +CH++ EL A  P PKVVL S   +E G +R LF
Sbjct: 310 LEWMGDLVTSAFERRRENPFNTKFITICHTMDELKALPPGPKVVLASFGSLEAGPARHLF 369

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIEL----GGNRTLTLQVKKRIRLEGEELEEYQKK 148
            +W     N +++T +   G+L  +++ +       + +   + +R+ LEGEEL  ++  
Sbjct: 370 AEWAGDKSNLVVLTGQPEEGSLMEEVVRVSSKPAAKKNVKFTLSRRVPLEGEELATHEST 429

Query: 149 K 149
           +
Sbjct: 430 R 430



 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 49/108 (45%), Gaps = 39/108 (36%)

Query: 169 FINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQW 228
           F+ +L+L DF+Q L + GI  EF  GVL+C  G V +R+                     
Sbjct: 739 FLGDLRLVDFRQALAQEGITAEFSGGVLVCADGRVTIRK--------------------- 777

Query: 229 CSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                            DSD   +V+EG LS +++ ++Q+LY QY IL
Sbjct: 778 -----------------DSDEK-LVIEGALSQDFFEIRQILYSQYQIL 807


>gi|357160194|ref|XP_003578687.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Brachypodium distachyon]
          Length = 738

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 47/130 (36%), Positives = 78/130 (60%), Gaps = 1/130 (0%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWMSD + KSFE  R+N F  ++V L  +  EL K+  +PKVVL S   +E GFS D+F
Sbjct: 280 LEWMSDSISKSFEHTRDNAFLLRYVSLIINKEELEKLGDAPKVVLASMASLEVGFSHDIF 339

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
            +  +  +N ++ T +   GTLAR L      + + + + KRI L G+EL+ Y++++++ 
Sbjct: 340 VEMANEAKNLVLFTEKGQFGTLARMLQVDPPPKAVKVTMGKRIPLVGDELKAYEEEQERI 399

Query: 153 AKDKQEKEKI 162
            K++  K  +
Sbjct: 400 KKEELLKASL 409


>gi|357127861|ref|XP_003565596.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Brachypodium distachyon]
          Length = 738

 Score = 84.7 bits (208), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 47/130 (36%), Positives = 78/130 (60%), Gaps = 1/130 (0%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWMSD + KSFE  R+N F  ++V L  +  EL K+  +PKVVL S   +E GFS D+F
Sbjct: 280 LEWMSDSISKSFEHTRDNAFLLRYVSLIINKEELEKLGDAPKVVLASMASLEVGFSHDIF 339

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
            +  +  +N ++ T +   GTLAR L      + + + + KRI L G+EL+ Y++++++ 
Sbjct: 340 VEMANEAKNLVLFTEKGQFGTLARMLQVDPPPKAVKVTMGKRIPLVGDELKAYEEEQERI 399

Query: 153 AKDKQEKEKI 162
            K++  K  +
Sbjct: 400 KKEELLKASL 409


>gi|405969098|gb|EKC34107.1| Cleavage and polyadenylation specificity factor subunit 2
          [Crassostrea gigas]
          Length = 135

 Score = 84.0 bits (206), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 37/51 (72%), Positives = 45/51 (88%), Gaps = 2/51 (3%)

Query: 27 SLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKV 75
          ++ E AK  +EWMSDK+M+SFE ARNNPFHFKHVKLCH+LAELA++P PKV
Sbjct: 23 NVVEFAKSQVEWMSDKIMRSFEEARNNPFHFKHVKLCHNLAELARIPEPKV 73



 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 29/34 (85%), Positives = 33/34 (97%)

Query: 1  MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKI 34
          MSDK+M+SFE ARNNPFHFKHVKLCH+LAELA+I
Sbjct: 35 MSDKIMRSFEEARNNPFHFKHVKLCHNLAELARI 68


>gi|242037469|ref|XP_002466129.1| hypothetical protein SORBIDRAFT_01g001930 [Sorghum bicolor]
 gi|241919983|gb|EER93127.1| hypothetical protein SORBIDRAFT_01g001930 [Sorghum bicolor]
          Length = 738

 Score = 84.0 bits (206), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 48/130 (36%), Positives = 75/130 (57%), Gaps = 1/130 (0%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWM D++ KSFE  R N F  K V L  +  EL K+  +PKVVL S   +E GFS D+F
Sbjct: 280 LEWMRDQIAKSFESNRANAFLLKKVMLIINKEELEKLGDAPKVVLASMASLEVGFSHDIF 339

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
            +  +   N ++ T +   GTLAR L      + + + + KRI L G+EL+ Y++++++ 
Sbjct: 340 VEMANEARNLVLFTEKGQFGTLARMLQVDPPPKAVKVTMSKRIPLVGDELKAYEEEQERI 399

Query: 153 AKDKQEKEKI 162
            K+K  K  +
Sbjct: 400 KKEKALKASL 409



 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 30/108 (27%), Positives = 49/108 (45%), Gaps = 31/108 (28%)

Query: 169 FINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQW 228
            + +L+LSDFKQ L+  G   EF  G L C    + VR+                     
Sbjct: 662 LVGDLKLSDFKQFLENKGWQVEFAGGALRCGE-YIMVRK--------------------- 699

Query: 229 CSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                    I + ++   + S  IV+EG L ++YY++++LLY Q+ +L
Sbjct: 700 ---------IGDSSQKGSTGSQQIVIEGPLCEDYYKIRELLYSQFYLL 738


>gi|159465769|ref|XP_001691095.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158279781|gb|EDP05541.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 389

 Score = 80.9 bits (198), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 54/86 (62%)

Query: 33  KIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLF 92
           ++E++  +++++F   R  PF F+ + +   L +L  +P PKVVL + P ++CG +R L 
Sbjct: 304 QLEYLGSEMVQAFSHKRTIPFTFRKLAVITRLEDLGAIPGPKVVLATLPSLDCGPARQLL 363

Query: 93  FQWCSSPENSIIITNRTSPGTLARDL 118
             W ++P N+II T R +PGTLA  L
Sbjct: 364 VDWAAAPRNTIIFTERANPGTLAHAL 389



 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 16/41 (39%), Positives = 25/41 (60%)

Query: 201 GTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           G +   +VVL + P ++CG +R L   W ++P N+II T R
Sbjct: 339 GAIPGPKVVLATLPSLDCGPARQLLVDWAAAPRNTIIFTER 379


>gi|414881949|tpg|DAA59080.1| TPA: hypothetical protein ZEAMMB73_548570 [Zea mays]
          Length = 766

 Score = 80.9 bits (198), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 49/136 (36%), Positives = 75/136 (55%), Gaps = 7/136 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWM D++ KSFE +R N F  K V L  +  EL K+  +PKVVL S   +E GFS D+F
Sbjct: 280 LEWMGDQIAKSFESSRANAFLLKKVTLIINKEELEKLGDAPKVVLASMASLEVGFSHDIF 339

Query: 93  FQWCSSPENSIIITNRTSP------GTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQ 146
            +  +   N ++ T +         GTLAR L      + L + + KRI L G EL+ Y+
Sbjct: 340 VEMANEARNLVLFTEKGQKIFALQFGTLARMLQVDPPPKALKVTMSKRIPLVGNELKAYE 399

Query: 147 KKKDKEAKDKQEKEKI 162
           +++++  K+K  K  +
Sbjct: 400 EEQERIKKEKSLKASL 415


>gi|115480769|ref|NP_001063978.1| Os09g0569400 [Oryza sativa Japonica Group]
 gi|75253249|sp|Q652P4.1|CPSF2_ORYSJ RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 2; AltName: Full=Cleavage and polyadenylation
           specificity factor 100 kDa subunit; Short=CPSF 100 kDa
           subunit
 gi|52077178|dbj|BAD46223.1| putative cleavage and polyadenylation specificity factor [Oryza
           sativa Japonica Group]
 gi|113632211|dbj|BAF25892.1| Os09g0569400 [Oryza sativa Japonica Group]
          Length = 738

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 46/130 (35%), Positives = 75/130 (57%), Gaps = 1/130 (0%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWM+D + KSFE  R+N F  K V    +  EL K+  +PKVVL S   +E GFS D+F
Sbjct: 280 LEWMNDSISKSFEHTRDNAFLLKCVTQIINKDELEKLGDAPKVVLASMASLEVGFSHDIF 339

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
               +  +N ++ T +   GTLAR L      + + + + KRI L G+EL+ Y++++++ 
Sbjct: 340 VDMANEAKNLVLFTEKGQFGTLARMLQVDPPPKAVKVTMSKRIPLVGDELKAYEEEQERI 399

Query: 153 AKDKQEKEKI 162
            K++  K  +
Sbjct: 400 KKEEALKASL 409


>gi|218202664|gb|EEC85091.1| hypothetical protein OsI_32459 [Oryza sativa Indica Group]
          Length = 1195

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 46/130 (35%), Positives = 75/130 (57%), Gaps = 1/130 (0%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWM+D + KSFE  R+N F  K V    +  EL K+  +PKVVL S   +E GFS D+F
Sbjct: 280 LEWMNDSISKSFEHTRDNAFLLKCVTQIINKDELEKLGDAPKVVLASMASLEVGFSHDIF 339

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
               +  +N ++ T +   GTLAR L      + + + + KRI L G+EL+ Y++++++ 
Sbjct: 340 VDMANEAKNLVLFTEKGQFGTLARMLQVDPPPKAVKVTMSKRIPLVGDELKAYEEEQERI 399

Query: 153 AKDKQEKEKI 162
            K++  K  +
Sbjct: 400 KKEEALKASL 409


>gi|222642134|gb|EEE70266.1| hypothetical protein OsJ_30409 [Oryza sativa Japonica Group]
          Length = 1073

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 46/130 (35%), Positives = 75/130 (57%), Gaps = 1/130 (0%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWM+D + KSFE  R+N F  K V    +  EL K+  +PKVVL S   +E GFS D+F
Sbjct: 280 LEWMNDSISKSFEHTRDNAFLLKCVTQIINKDELEKLGDAPKVVLASMASLEVGFSHDIF 339

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
               +  +N ++ T +   GTLAR L      + + + + KRI L G+EL+ Y++++++ 
Sbjct: 340 VDMANEAKNLVLFTEKGQFGTLARMLQVDPPPKAVKVTMSKRIPLVGDELKAYEEEQERI 399

Query: 153 AKDKQEKEKI 162
            K++  K  +
Sbjct: 400 KKEEALKASL 409


>gi|167535876|ref|XP_001749611.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163772003|gb|EDQ85662.1| predicted protein [Monosiga brevicollis MX1]
          Length = 770

 Score = 77.0 bits (188), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 71/126 (56%), Gaps = 1/126 (0%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           IE+MSDK++   +   +NPF   ++K   ++  + +    KVV+V    +E GF R+L F
Sbjct: 281 IEFMSDKMLVKLQTMTHNPFACPNIKCQKTVDGVMRSAGAKVVIVPHSSLEAGFGRELLF 340

Query: 94  QWCSSPENSIIITNRTSPGTL-ARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKE 152
           +      N  +   R  P +L AR L + G   T+  + + R++LEGEEL+ Y++ K +E
Sbjct: 341 RLAGEARNRFLFIARPPPHSLGARLLAKSGQIHTIQFEHRFRVQLEGEELKAYRQHKAEE 400

Query: 153 AKDKQE 158
           AK ++E
Sbjct: 401 AKQQKE 406


>gi|422293869|gb|EKU21169.1| cleavage and polyadenylation specificity factor subunit 2
           [Nannochloropsis gaditana CCMP526]
          Length = 925

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 44/134 (32%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAEL----AKVPSPKVVLVSTPDMECGFSR 89
           +E+M  ++++ F+ +  NPF  KHV    S+ EL     +  +P VVL S   M+ GFSR
Sbjct: 353 VEYMHPEVIRDFDRSLRNPFSLKHVVPAQSMLELEAAMGEYRNPVVVLASDEGMDTGFSR 412

Query: 90  DLFFQWCSSPENSIIITNRTSPGTLARDLIELGG--NRTLTLQVKKRIRLEGEELEEYQK 147
            L  +W S PEN++++      G+LA    +L       L+  V    R+ GEEL   ++
Sbjct: 413 ALATRWASGPENALLLCGHLRKGSLAESFWKLRHLPKAALSFSVPVIERIVGEELAGLRE 472

Query: 148 KKDKEAKDKQEKEK 161
           K+D+E +   E E+
Sbjct: 473 KEDRERRKALEAEE 486



 Score = 38.1 bits (87), Expect = 3.7,   Method: Composition-based stats.
 Identities = 19/61 (31%), Positives = 33/61 (54%), Gaps = 11/61 (18%)

Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQ 267
           VVL S   M+ GFSR L  +W S PEN++++    R           +G L++ +++++ 
Sbjct: 398 VVLASDEGMDTGFSRALATRWASGPENALLLCGHLR-----------KGSLAESFWKLRH 446

Query: 268 L 268
           L
Sbjct: 447 L 447


>gi|328866931|gb|EGG15314.1| beta-lactamase domain-containing protein [Dictyostelium
           fasciculatum]
          Length = 768

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 36/100 (36%), Positives = 61/100 (61%), Gaps = 5/100 (5%)

Query: 24  LCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPK---VVLV 78
           + + + E AK  +E+MS      FE    N F+F+++K+C+S  +L ++P+     VVL 
Sbjct: 278 VTYYVREFAKSQLEFMSTAASSKFEQKNENIFNFRNIKICNSFKQLEELPNLTRNYVVLA 337

Query: 79  STPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDL 118
           S+ D+E GF++DLF QW + P+N +++T+    GTL   L
Sbjct: 338 SSKDLETGFAKDLFIQWANDPKNMVMLTDNMDEGTLGDQL 377



 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 30/112 (26%), Positives = 49/112 (43%), Gaps = 37/112 (33%)

Query: 165 HDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDL 224
           H+ SF+ +++LS+FK  L R G   +F  G+L C                         L
Sbjct: 694 HNVSFVGDVKLSEFKDVLTREGFQVQFDKGILSC-----------------------NGL 730

Query: 225 FFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
            + W              R +   ++ I ++G +S+EYY V++LLY Q+ IL
Sbjct: 731 VYLW--------------REEVDGNSCINIDGVMSEEYYLVKELLYSQFKIL 768



 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 23/59 (38%), Positives = 35/59 (59%), Gaps = 11/59 (18%)

Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQ 266
           VVL S+ D+E GF++DLF QW + P+N +++T     D+ D      EG L D+  + Q
Sbjct: 334 VVLASSKDLETGFAKDLFIQWANDPKNMVMLT-----DNMD------EGTLGDQLSKCQ 381


>gi|298708373|emb|CBJ48436.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 997

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 34/89 (38%), Positives = 57/89 (64%), Gaps = 1/89 (1%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRD 90
           +++EWMS+ + ++F+  R+NPF  ++V + HSL EL ++   PKVV+ +   ++ GFS+ 
Sbjct: 415 SQLEWMSEDIGRAFDLQRSNPFELRNVHIMHSLEELDELGDDPKVVMATDMSLDFGFSKA 474

Query: 91  LFFQWCSSPENSIIITNRTSPGTLARDLI 119
           L  +W S   N+I++T R    T AR LI
Sbjct: 475 LLLRWASGGANTILLTGRGHGNTTARTLI 503


>gi|302833565|ref|XP_002948346.1| hypothetical protein VOLCADRAFT_31342 [Volvox carteri f.
           nagariensis]
 gi|300266566|gb|EFJ50753.1| hypothetical protein VOLCADRAFT_31342 [Volvox carteri f.
           nagariensis]
          Length = 375

 Score = 70.1 bits (170), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 47/80 (58%)

Query: 33  KIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLF 92
           ++E++  +L+++F   R+ PF F+ + +   L EL   P PKVVL   P +E G +R+L 
Sbjct: 296 QLEYLGSELVQAFSLKRSVPFSFRKLSVITRLEELGAFPGPKVVLAPMPSLESGPARELL 355

Query: 93  FQWCSSPENSIIITNRTSPG 112
            QW + P N+II T R   G
Sbjct: 356 VQWGALPRNTIIFTERAQVG 375


>gi|67968624|dbj|BAE00671.1| unnamed protein product [Macaca fascicularis]
          Length = 341

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 266 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 311

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 312 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 341


>gi|301776931|ref|XP_002923884.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Ailuropoda melanoleuca]
          Length = 146

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 71  EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 116

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 117 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 146


>gi|119601889|gb|EAW81483.1| cleavage and polyadenylation specific factor 2, 100kDa, isoform
           CRA_c [Homo sapiens]
          Length = 690

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 615 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 660

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 661 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 690


>gi|349603401|gb|AEP99246.1| Cleavage and polyadenylation specificity factor subunit 2-like
           protein, partial [Equus caballus]
          Length = 327

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 252 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 297

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 298 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 327


>gi|328908757|gb|AEB61046.1| cleavage and polyadenylation specificity factor subunit 2-like
           protein, partial [Equus caballus]
          Length = 256

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 181 EVPGHQSVFMNEPRLSDFKQALLREGIQAEFVGGVLV-CNNQVAVRR------------- 226

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 227 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 256


>gi|328908891|gb|AEB61113.1| cleavage and polyadenylation specificity factor subunit 2-like
           protein, partial [Equus caballus]
          Length = 108

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 33  EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 78

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 79  --------------------------TETGPIGLEGCLCQDFYRIRDLLYEQYAIV 108


>gi|350587135|ref|XP_003482353.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Sus scrofa]
          Length = 272

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 57/116 (49%), Gaps = 40/116 (34%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR             
Sbjct: 197 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR------------- 242

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct: 243 --------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 272


>gi|449512224|ref|XP_002198279.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Taeniopygia guttata]
          Length = 272

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 43/122 (35%), Positives = 59/122 (48%), Gaps = 45/122 (36%)

Query: 160 EKIPPHD-----TSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTP 214
           E +PPH+     + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR       
Sbjct: 191 EPMPPHEVLGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNLVAVRR------- 242

Query: 215 DMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYA 274
                                           +++  I LEGCL  ++YR++ LLY+QYA
Sbjct: 243 --------------------------------TETGRIGLEGCLCQDFYRIRDLLYEQYA 270

Query: 275 IL 276
           I+
Sbjct: 271 IV 272


>gi|430813604|emb|CCJ29043.1| unnamed protein product [Pneumocystis jirovecii]
 gi|430813606|emb|CCJ29045.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 772

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 50/169 (29%), Positives = 87/169 (51%), Gaps = 34/169 (20%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVK-LCHSLAELAKVPSPKVVLVSTPDMECGFSRD 90
           + IEWMSD ++  + G+  + F F +VK + +    L+ V  PKV+L ++ +M+CGFS+ 
Sbjct: 272 SMIEWMSDSIISEY-GSSGSVFEFTYVKVITNEFQFLSMVSGPKVILATSSNMDCGFSQK 330

Query: 91  LFFQWCSSPENSIIITNRTS---PGTLARDLIE------------------LGGNRTLTL 129
           +F    +    +++I ++ S     +L++DL++                  L  NRT+T 
Sbjct: 331 IFLDSIAKDSKNLVILSQKSIYYENSLSKDLLDRWNLAIEHSDQLIPPAVILNFNRTVT- 389

Query: 130 QVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFINELQLSDF 178
            ++  + L G ELE+YQ+K+    K ++EKE         I ELQ  D 
Sbjct: 390 -IRTSVPLVGSELEKYQEKE----KLRREKEA-----AKLIMELQNRDL 428


>gi|412994069|emb|CCO14580.1| predicted protein [Bathycoccus prasinos]
          Length = 1092

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 71/135 (52%), Gaps = 16/135 (11%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
           +EWM++ +   F+  R N F  K +  CHS  +  ++P  PKVVL S   +E GF+R LF
Sbjct: 485 LEWMNEDVGLKFDSTRQNVFDVKRLFPCHSYEDFTRLPPGPKVVLASLASLEGGFARKLF 544

Query: 93  FQWCSSPENSIIITNRTSPGT-LARDLIE---LGG-----------NRTLTLQVKKRIRL 137
            +W S  +N  I  +       LAR+++E    GG           +  + +++ +R  L
Sbjct: 545 VEWASDAKNCFIWPDEIGRQVGLAREIVEKCSKGGAKTTSSKTKKKDVIMKVELARRELL 604

Query: 138 EGEELEEYQKKKDKE 152
            G+ELE ++ +++++
Sbjct: 605 SGKELEAWEHEQEEK 619



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 55/111 (49%), Gaps = 39/111 (35%)

Query: 167  TSFINELQLSDFKQTLQRNGIDCEFMD-GVLICCRGTVAVRRVVLVSTPDMECGFSRDLF 225
            ++F+ +++LS+F++ LQR G+  EF + G L+C  G V VRR                  
Sbjct: 1020 SAFVGDVRLSEFRRYLQRMGVPAEFGEGGALVCANGQVVVRR------------------ 1061

Query: 226  FQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                             RA+D +   +++EG +SD Y+ V+ +LY QY+I+
Sbjct: 1062 -----------------RAEDDE---LIVEGSISDAYFNVRDMLYAQYSII 1092


>gi|297808389|ref|XP_002872078.1| hypothetical protein ARALYDRAFT_910398 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297317915|gb|EFH48337.1| hypothetical protein ARALYDRAFT_910398 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 544

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 61/118 (51%), Gaps = 26/118 (22%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +EWMSD + KSFE +R+N F                       L+S   +E GF+R++F 
Sbjct: 205 LEWMSDSISKSFETSRDNAF-----------------------LLS---LEAGFAREIFV 238

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDK 151
           +W + P N ++ T     GTLAR L      + + + + KR+ L GEEL  Y++++++
Sbjct: 239 EWANDPRNLVLFTETGQFGTLARMLQSAPPPKFVKVTMSKRVPLAGEELIAYEEEQNR 296


>gi|422294077|gb|EKU21377.1| cleavage and polyadenylation specificity factor subunit 2, partial
           [Nannochloropsis gaditana CCMP526]
          Length = 429

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 39/115 (33%), Positives = 60/115 (52%), Gaps = 6/115 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAEL----AKVPSPKVVLVSTPDMECGFSR 89
           +E+M  ++++ F+ +  NPF  KHV    S+ EL     +  +P VVL S   M+ GFSR
Sbjct: 310 VEYMHPEVIRDFDRSLRNPFSLKHVVPAQSMLELEAAMGEYRNPVVVLASDEGMDTGFSR 369

Query: 90  DLFFQWCSSPENSIIITNRTSPGTLARDLIELG--GNRTLTLQVKKRIRLEGEEL 142
            L  +W S PEN++++      G+LA    +L       L+  V    R+ GEEL
Sbjct: 370 ALATRWASGPENALLLCGHLRKGSLAESFWKLRHLPKAALSFSVPVIERIVGEEL 424



 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 19/61 (31%), Positives = 33/61 (54%), Gaps = 11/61 (18%)

Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQ 267
           VVL S   M+ GFSR L  +W S PEN++++    R           +G L++ +++++ 
Sbjct: 355 VVLASDEGMDTGFSRALATRWASGPENALLLCGHLR-----------KGSLAESFWKLRH 403

Query: 268 L 268
           L
Sbjct: 404 L 404


>gi|115456655|ref|NP_001051928.1| Os03g0852900 [Oryza sativa Japonica Group]
 gi|27573349|gb|AAO20067.1| putative cleavage and polyadenylation specifity factor protein
           [Oryza sativa Japonica Group]
 gi|29126360|gb|AAO66552.1| putative cleavage and polyadenylation specifity factor [Oryza
           sativa Japonica Group]
 gi|108712151|gb|ABF99946.1| Cleavage and polyadenylation specificity factor, 73 kDa subunit,
           putative, expressed [Oryza sativa Japonica Group]
 gi|113550399|dbj|BAF13842.1| Os03g0852900 [Oryza sativa Japonica Group]
 gi|125588676|gb|EAZ29340.1| hypothetical protein OsJ_13407 [Oryza sativa Japonica Group]
          Length = 700

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 51/86 (59%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++++   F  A++NPFHFKH++  +S+     V  P VV+ S   ++ G SR LF 
Sbjct: 305 INSMNERIRNQF--AQSNPFHFKHIESLNSIDNFHDV-GPSVVMASPGGLQSGLSRQLFD 361

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC+  +NS +I      GTLA+ +I
Sbjct: 362 KWCTDKKNSCVIPGYVVEGTLAKTII 387


>gi|125546484|gb|EAY92623.1| hypothetical protein OsI_14368 [Oryza sativa Indica Group]
          Length = 700

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 51/86 (59%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++++   F  A++NPFHFKH++  +S+     V  P VV+ S   ++ G SR LF 
Sbjct: 305 INSMNERIRNQF--AQSNPFHFKHIESLNSIDNFHDV-GPSVVMASPGGLQSGLSRQLFD 361

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC+  +NS +I      GTLA+ +I
Sbjct: 362 KWCTDKKNSCVIPGYVVEGTLAKTII 387


>gi|387219447|gb|AFJ69432.1| cleavage and polyadenylation specificity factor subunit 2, partial
           [Nannochloropsis gaditana CCMP526]
          Length = 217

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 39/115 (33%), Positives = 60/115 (52%), Gaps = 6/115 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAEL----AKVPSPKVVLVSTPDMECGFSR 89
           +E+M  ++++ F+ +  NPF  KHV    S+ EL     +  +P VVL S   M+ GFSR
Sbjct: 98  VEYMHPEVIRDFDRSLRNPFSLKHVVPAQSMLELEAAMGEYRNPVVVLASDEGMDTGFSR 157

Query: 90  DLFFQWCSSPENSIIITNRTSPGTLARDLIELGG--NRTLTLQVKKRIRLEGEEL 142
            L  +W S PEN++++      G+LA    +L       L+  V    R+ GEEL
Sbjct: 158 ALATRWASGPENALLLCGHLRKGSLAESFWKLRHLPKAALSFSVPVIERIVGEEL 212



 Score = 40.0 bits (92), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 19/61 (31%), Positives = 33/61 (54%), Gaps = 11/61 (18%)

Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQ 267
           VVL S   M+ GFSR L  +W S PEN++++    R           +G L++ +++++ 
Sbjct: 143 VVLASDEGMDTGFSRALATRWASGPENALLLCGHLR-----------KGSLAESFWKLRH 191

Query: 268 L 268
           L
Sbjct: 192 L 192


>gi|224009389|ref|XP_002293653.1| cleavage and polyadenylation specificity factor [Thalassiosira
           pseudonana CCMP1335]
 gi|220971053|gb|EED89389.1| cleavage and polyadenylation specificity factor [Thalassiosira
           pseudonana CCMP1335]
          Length = 347

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 48/80 (60%), Gaps = 5/80 (6%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV-----PSPKVVLVSTPDMECG 86
           +++EWM++ L   F+  R +P+  K V++C S+AEL  V      +P  VL S   ++ G
Sbjct: 268 SQLEWMAEPLGAQFDSQRGHPYALKSVRICSSVAELESVIESSNGNPTAVLASGSSLDHG 327

Query: 87  FSRDLFFQWCSSPENSIIIT 106
            +RDL  +W  +P+N ++IT
Sbjct: 328 PARDLLLKWGDNPDNLVLIT 347


>gi|145350779|ref|XP_001419775.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580007|gb|ABO98068.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 767

 Score = 63.5 bits (153), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 40/110 (36%), Positives = 57/110 (51%), Gaps = 12/110 (10%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  ++  +  +FE A  NPF F HVK     +EL  V  P VVL +   ++ G SR+LF 
Sbjct: 287 INVLNSDMKAAFEEA--NPFVFNHVKHVSKSSELDDV-GPCVVLATPSMLQSGLSRELFE 343

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELE 143
            WC  P+N +II +    GTLAR+++           V K I  +G EL+
Sbjct: 344 SWCEDPKNGVIIADFAVQGTLAREILS---------DVNKIIARDGRELQ 384


>gi|320163324|gb|EFW40223.1| CPSF3 protein [Capsaspora owczarzaki ATCC 30864]
          Length = 802

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 64/119 (53%), Gaps = 6/119 (5%)

Query: 3   DKLMKSFEGARNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKL 60
           D+  +S     N P ++     + C ++ +   I  M++++ + F  A +NPF FKH+  
Sbjct: 280 DEFWESKPALHNIPIYYASSLARKCMAIYQ-TYINQMNERIRRQF--AISNPFMFKHIAS 336

Query: 61  CHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
             S +E+ +   P V++ S   ++ G SRDLF QWC    N +I+T  +  GTLA+ ++
Sbjct: 337 IKSASEIDQ-SGPMVMMASPGMLQNGLSRDLFEQWCPDSRNGVIVTGYSVEGTLAKSIL 394


>gi|226497180|ref|NP_001146407.1| uncharacterized protein LOC100279987 [Zea mays]
 gi|219887045|gb|ACL53897.1| unknown [Zea mays]
 gi|414873991|tpg|DAA52548.1| TPA: hypothetical protein ZEAMMB73_264007 [Zea mays]
          Length = 697

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 51/86 (59%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++++   F  A++NPFHFKH++  +S+     V  P VV+ S   ++ G SR LF 
Sbjct: 302 INSMNERIRNQF--AQSNPFHFKHIESLNSIDNFHDV-GPSVVMASPGGLQSGLSRQLFD 358

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC+  +N+ +I      GTLA+ +I
Sbjct: 359 KWCTDKKNACVIPGYVVEGTLAKTII 384


>gi|242032211|ref|XP_002463500.1| hypothetical protein SORBIDRAFT_01g000850 [Sorghum bicolor]
 gi|241917354|gb|EER90498.1| hypothetical protein SORBIDRAFT_01g000850 [Sorghum bicolor]
          Length = 695

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 51/86 (59%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++++   F  A++NPFHFKH++  +S+     V  P VV+ S   ++ G SR LF 
Sbjct: 302 INSMNERIRNQF--AQSNPFHFKHIESLNSIDNFHDV-GPSVVMASPGGLQSGLSRQLFD 358

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC+  +N+ +I      GTLA+ +I
Sbjct: 359 KWCTDKKNACVIPGYVVEGTLAKTII 384


>gi|357117889|ref|XP_003560694.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-I-like [Brachypodium distachyon]
          Length = 690

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 51/86 (59%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++++   F  A++NPFHFKH++  +S+     V  P VV+ S   ++ G SR LF 
Sbjct: 295 INSMNERIRNQF--AQSNPFHFKHIEPLNSIDNFHDV-GPSVVMASPGSLQSGLSRQLFD 351

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC+  +N+ +I      GTLA+ +I
Sbjct: 352 KWCTDKKNTCVIPGYVIEGTLAKTII 377


>gi|156840674|ref|XP_001643716.1| hypothetical protein Kpol_1009p4 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156114339|gb|EDO15858.1| hypothetical protein Kpol_1009p4 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 778

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/108 (31%), Positives = 56/108 (51%), Gaps = 4/108 (3%)

Query: 11  GARNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELA 68
           G+   P ++     K C S+ +   +  M+D + K F  ++ NPF FKH+    +L E  
Sbjct: 267 GSGQVPIYYASNLAKKCMSVYQ-TYVNMMNDDIRKKFRDSQTNPFIFKHISYLKNLDEFQ 325

Query: 69  KVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLAR 116
               P V+L S   ++ G SRDL  +WC   +N ++IT  +  GT+A+
Sbjct: 326 DF-GPSVMLASPGMLQNGLSRDLLEKWCPEDKNMVLITGYSVEGTMAK 372


>gi|226505292|ref|NP_001151522.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
           [Zea mays]
 gi|195647398|gb|ACG43167.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
           [Zea mays]
 gi|224034229|gb|ACN36190.1| unknown [Zea mays]
 gi|413932397|gb|AFW66948.1| cleavage and polyadenylation specificity factor, subunit [Zea mays]
          Length = 694

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 50/86 (58%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++++   F  A++NPFHFKH++  +S+     V  P VV+ S   ++ G SR LF 
Sbjct: 302 INSMNERIRNQF--AQSNPFHFKHIESLNSIDNFHDV-GPSVVMASPSGLQSGLSRQLFD 358

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC+   N+ +I      GTLA+ +I
Sbjct: 359 KWCTDKRNACVIPGYVVEGTLAKTII 384


>gi|156379813|ref|XP_001631650.1| predicted protein [Nematostella vectensis]
 gi|156218694|gb|EDO39587.1| predicted protein [Nematostella vectensis]
          Length = 688

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 33/86 (38%), Positives = 49/86 (56%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K    A +NPF FKH+    S+ +   +  P VV+ S   M+ G SR+LF 
Sbjct: 295 VNAMNDKIKKQI--AISNPFVFKHISNLKSIDQFDDI-GPSVVMASPGMMQSGLSRELFE 351

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           QWC+   N +II      GTLA++L+
Sbjct: 352 QWCTDRRNGVIIAGYCVEGTLAKNLM 377


>gi|301787149|ref|XP_002928998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like, partial [Ailuropoda melanoleuca]
          Length = 145

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 23/38 (60%), Positives = 32/38 (84%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAK 69
           +++EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+
Sbjct: 108 SQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLAR 145



 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 21/33 (63%), Positives = 27/33 (81%)

Query: 1   MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAK 33
           MSDKLM+ FE  RNNPF F+H+ LCH L++LA+
Sbjct: 113 MSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLAR 145


>gi|308807807|ref|XP_003081214.1| mRNA cleavage and polyadenylation factor II complex, BRR5 (CPSF
           subunit) (ISS) [Ostreococcus tauri]
 gi|116059676|emb|CAL55383.1| mRNA cleavage and polyadenylation factor II complex, BRR5 (CPSF
           subunit) (ISS) [Ostreococcus tauri]
          Length = 572

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 39/110 (35%), Positives = 57/110 (51%), Gaps = 12/110 (10%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  ++  +  +FE A  NPF F HVK     +EL  V  P VVL +   ++ G SR+LF 
Sbjct: 275 INVLNADMKAAFEEA--NPFVFNHVKHISKASELDDV-GPCVVLATPSMLQSGLSRELFE 331

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELE 143
            WC  P+N +II +    GTLAR+++           V K    +G+EL+
Sbjct: 332 SWCEEPKNGVIIADFAVQGTLAREILS---------DVNKVTARDGQELQ 372


>gi|385305954|gb|EIF49896.1| mrna cleavage and polyadenylation specificity factor complex
           subunit ysh1 [Dekkera bruxellensis AWRI1499]
          Length = 295

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 64/120 (53%), Gaps = 6/120 (5%)

Query: 3   DKLMKSFEGARNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVK- 59
           D+  +  +  +N P ++     K C ++ E   I  M+D + K F     NPFHFK++K 
Sbjct: 169 DEYWQRHKDLQNVPIYYASSLAKKCMAVYE-RYINMMNDSIRKKFTETNENPFHFKYIKN 227

Query: 60  LCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
           + H  A+     +P V++ S   ++ G SR L  +WC  P N++I+T  +  GT+A+ L+
Sbjct: 228 VAH--ADRIDDLNPCVMIASPGMLQNGVSRQLLEKWCPDPRNTVIMTGYSVDGTMAKKLL 285


>gi|356543411|ref|XP_003540154.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-I-like [Glycine max]
          Length = 689

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 59/109 (54%), Gaps = 8/109 (7%)

Query: 13  RNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV 70
           +N P ++     K C ++ E   +  M+D++    + A++NPF FKHV    S+ E+ K 
Sbjct: 273 QNIPIYYASPLAKKCLTVYETYTLS-MNDRI----QNAKSNPFSFKHVSALSSI-EVFKD 326

Query: 71  PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
             P VV+ S   ++ G SR LF  WCS  +NS ++      GTLA+ +I
Sbjct: 327 VGPSVVMASPGGLQSGLSRQLFDMWCSDKKNSCVLPGYVVEGTLAKTII 375


>gi|170060909|ref|XP_001866010.1| cleavage and polyadenylation specificity factor [Culex
           quinquefasciatus]
 gi|167879247|gb|EDS42630.1| cleavage and polyadenylation specificity factor [Culex
           quinquefasciatus]
          Length = 688

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+DK+ +    A NNPF F+H+     +     +  P VV+ S   M+ G SR+LF 
Sbjct: 293 INAMNDKIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFE 349

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WCS P+N +II      GTLA+ ++
Sbjct: 350 TWCSDPKNGVIIAGYCVEGTLAKTVL 375



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 21/33 (63%)

Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 240
           VV+ S   M+ G SR+LF  WCS P+N +II  
Sbjct: 331 VVMASPGMMQSGLSRELFETWCSDPKNGVIIAG 363


>gi|50549403|ref|XP_502172.1| YALI0C23232p [Yarrowia lipolytica]
 gi|49648039|emb|CAG82492.1| YALI0C23232p [Yarrowia lipolytica CLIB122]
          Length = 799

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 48/156 (30%), Positives = 76/156 (48%), Gaps = 20/156 (12%)

Query: 22  VKLCHSLAEL-----AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV---PSP 73
           V L H  A L     + +EWMS  ++  +E    +PF  ++ K+ HS+ +  KV    + 
Sbjct: 283 VLLTHLGARLLEYASSMLEWMSPSIIAEWEKKNESPFQTRNFKIVHSMDQFDKVVKGGNG 342

Query: 74  KVVLVSTP-DMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDL------IELGGNRT 126
           + V+VS   D+E GFSR LF +  S   NS++ T R+   +LA +L       E  GN  
Sbjct: 343 QFVVVSVGEDLESGFSRLLFNRLASDERNSVLFTERSEGNSLATELQDKWEKTERDGNSA 402

Query: 127 -----LTLQVKKRIRLEGEELEEYQKKKDKEAKDKQ 157
                 TL++     L   E++EY+   + + KD Q
Sbjct: 403 KMDFQTTLKMPTYTPLSEAEMKEYRTTVESQQKDLQ 438


>gi|363750442|ref|XP_003645438.1| hypothetical protein Ecym_3113 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356889072|gb|AET38621.1| Hypothetical protein Ecym_3113 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 773

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 49/86 (56%), Gaps = 1/86 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K F  ++ NPF FK++    +L E      P V+L S   ++ G SRDL  
Sbjct: 295 VNMMNDKIRKKFRDSQTNPFIFKNISYLKNLDEFQDF-GPSVMLASPGMLQNGLSRDLLE 353

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC   +N ++IT  +  GT+A+ LI
Sbjct: 354 KWCPDEKNLVLITGYSVEGTMAKFLI 379


>gi|156343760|ref|XP_001621104.1| hypothetical protein NEMVEDRAFT_v1g222359 [Nematostella vectensis]
 gi|156206741|gb|EDO29004.1| predicted protein [Nematostella vectensis]
          Length = 388

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 32/85 (37%), Positives = 48/85 (56%), Gaps = 3/85 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K    A +NPF FKH+    S+ +   +  P VV+ S   M+ G SR+LF 
Sbjct: 221 VNAMNDKIKKQI--AISNPFVFKHISNLKSIDQFDDI-GPSVVMASPGMMQSGLSRELFE 277

Query: 94  QWCSSPENSIIITNRTSPGTLARDL 118
           QWC+   N +II      GTLA+++
Sbjct: 278 QWCTDRRNGVIIAGYCVEGTLAKEV 302


>gi|326487902|dbj|BAJ89790.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 704

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 50/86 (58%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++++   F  A++NPFHFKH+   +S+     V  P VV+ S   ++ G SR LF 
Sbjct: 310 INSMNERIRNQF--AQSNPFHFKHIDPLNSIDNFHDV-GPSVVMASPGSLQSGLSRQLFD 366

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC+  +N+ +I      G+LA+ +I
Sbjct: 367 KWCTDKKNTCVIPGYAVEGSLAKTII 392


>gi|326495416|dbj|BAJ85804.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 704

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 50/86 (58%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++++   F  A++NPFHFKH+   +S+     V  P VV+ S   ++ G SR LF 
Sbjct: 310 INSMNERIRNQF--AQSNPFHFKHIDPLNSIDNFHDV-GPSVVMASPGSLQSGLSRQLFD 366

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC+  +N+ +I      G+LA+ +I
Sbjct: 367 KWCTDKKNTCVIPGYAVEGSLAKTII 392


>gi|326508058|dbj|BAJ86772.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 704

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 50/86 (58%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++++   F  A++NPFHFKH+   +S+     V  P VV+ S   ++ G SR LF 
Sbjct: 310 INSMNERIRNQF--AQSNPFHFKHIDPLNSIDNFHDV-GPSVVMASPGSLQSGLSRQLFD 366

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC+  +N+ +I      G+LA+ +I
Sbjct: 367 KWCTDKKNTCVIPGYAVEGSLAKTII 392


>gi|213512037|ref|NP_001133354.1| cleavage and polyadenylation specificity factor subunit 3 [Salmo
           salar]
 gi|209151738|gb|ACI33081.1| Cleavage and polyadenylation specificity factor subunit 3 [Salmo
           salar]
          Length = 690

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K+     NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 293 VNAMNDKIRKAIN--VNNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II   +  GTLA+ ++
Sbjct: 350 SWCTDKRNGVIIAGYSVEGTLAKHIM 375


>gi|157117185|ref|XP_001652976.1| cleavage and polyadenylation specificity factor [Aedes aegypti]
 gi|108876120|gb|EAT40345.1| AAEL007904-PA [Aedes aegypti]
          Length = 687

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+DK+ +    A NNPF F+H+     +     +  P VV+ S   M+ G SR+LF 
Sbjct: 293 INAMNDKIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFE 349

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+ P+N +II      GTLA+ ++
Sbjct: 350 TWCTDPKNGVIIAGYCVEGTLAKTIL 375


>gi|378733596|gb|EHY60055.1| hypothetical protein HMPREF1120_08027 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 948

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 52/196 (26%), Positives = 80/196 (40%), Gaps = 60/196 (30%)

Query: 4   KLMKSFEGARNNPFHFKHVKLCHSLAELAKIEWMSDKLMKSFEGARNNP----------- 52
           K +K +  +R+     +H +        + +EWM D +++ FEG   NP           
Sbjct: 304 KDVKVYMASRSGNATLRHAR--------SLLEWMDDSIVREFEGEDENPTTQPYNRRGGN 355

Query: 53  -----------FHFKHVKLC---HSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSS 98
                      F FK+VK+    H L +L KV  P+V+L S   ++ GFSR L       
Sbjct: 356 KAAGTNKPSRPFEFKNVKVVERKHQLEKLLKVEGPRVILASDVTLDWGFSRSLLEHVVQK 415

Query: 99  PENSIIITNR-------TSPG------------TLARDLIELG--------GNRTLTLQV 131
           PEN +I+T R        SPG             +A +  E G        G R L L+ 
Sbjct: 416 PENLVILTERLNVRPGSESPGQAFWQWFEQRQDGVALERTEGGGQLEQVHSGGRMLKLKN 475

Query: 132 KKRIRLEGEELEEYQK 147
            ++  L  +E + YQ+
Sbjct: 476 PEKAPLSAQESQRYQQ 491


>gi|374110195|gb|AEY99100.1| FAGR279Cp [Ashbya gossypii FDAG1]
          Length = 771

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 49/86 (56%), Gaps = 1/86 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K F  ++ NPF FK++    +L E      P V+L S   ++ G SRDL  
Sbjct: 295 VNMMNDKIRKKFRDSQTNPFIFKNISYLKNLDEFQDF-GPSVMLASPGMLQNGLSRDLLE 353

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC   +N ++IT  +  GT+A+ L+
Sbjct: 354 KWCPDEKNLVLITGYSVEGTMAKFLM 379


>gi|302309512|ref|NP_986945.2| AGR279Cp [Ashbya gossypii ATCC 10895]
 gi|442570103|sp|Q74ZC0.2|YSH1_ASHGO RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
           3'-end-processing protein YSH1
 gi|299788393|gb|AAS54769.2| AGR279Cp [Ashbya gossypii ATCC 10895]
          Length = 771

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 49/86 (56%), Gaps = 1/86 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K F  ++ NPF FK++    +L E      P V+L S   ++ G SRDL  
Sbjct: 295 VNMMNDKIRKKFRDSQTNPFIFKNISYLKNLDEFQDF-GPSVMLASPGMLQNGLSRDLLE 353

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC   +N ++IT  +  GT+A+ L+
Sbjct: 354 KWCPDEKNLVLITGYSVEGTMAKFLM 379


>gi|240975718|ref|XP_002402161.1| cleavage and polyadenylation specificity factor, putative [Ixodes
           scapularis]
 gi|215491113|gb|EEC00754.1| cleavage and polyadenylation specificity factor, putative [Ixodes
           scapularis]
          Length = 694

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 29/70 (41%), Positives = 40/70 (57%), Gaps = 1/70 (1%)

Query: 50  NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
           NNPF FKH+    S+     V  P VV+ S   M+ G SR+LF  WC+ P+N +II    
Sbjct: 313 NNPFVFKHISNLKSIEHFEDV-GPCVVMASPGMMQSGLSRELFESWCTDPKNGVIIAGYC 371

Query: 110 SPGTLARDLI 119
             GTLA+ ++
Sbjct: 372 VEGTLAKTIL 381



 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 21/33 (63%)

Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 240
           VV+ S   M+ G SR+LF  WC+ P+N +II  
Sbjct: 337 VVMASPGMMQSGLSRELFESWCTDPKNGVIIAG 369


>gi|321461562|gb|EFX72593.1| hypothetical protein DAPPUDRAFT_308207 [Daphnia pulex]
          Length = 689

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+DK+ +    A NNPF FKH+     + +   V  P V++ S   M+ G SR+LF 
Sbjct: 292 INAMNDKIRRQI--AINNPFIFKHISSLKGIDQFEDV-GPCVIMASPGMMQSGLSRELFE 348

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+ P+N  II      GTLA+ ++
Sbjct: 349 AWCTDPKNGCIIAGYCVEGTLAKHVL 374


>gi|385304166|gb|EIF48196.1| subunit of mrna cleavage and polyadenylation [Dekkera bruxellensis
           AWRI1499]
          Length = 664

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 46/179 (25%), Positives = 80/179 (44%), Gaps = 20/179 (11%)

Query: 13  RNNPFHFKHVKLCHSLAELAK-IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP 71
           R  PF+        SL   A  +EWMS  + K ++     PF    ++LC ++ +L+K  
Sbjct: 41  REVPFYLVAYTGIQSLKSSASMLEWMSPDVTKDWDNENQTPFDSTRIELC-TVEDLSKHT 99

Query: 72  SPKVVLVSTPDMECG-FSRDLFFQWCSSPENSIIITNRTSPGTLARDL-------IELGG 123
            PKVV V +   + G  +R +F + CS    ++ +T R   GT+  D+       +    
Sbjct: 100 GPKVVFVESLGFDEGSLARSVFVELCSKQNTALFLTERPPDGTVLYDIYNTWEETVXXDS 159

Query: 124 N----------RTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFINE 172
           N          + LTL + K   L G+EL+ Y    D+    +Q++E     ++  +N+
Sbjct: 160 NLKDGSLIIFQKQLTLSMIKETXLRGQELKAYLSXVDQRRXKRQKEELEEXKNSHLLND 218


>gi|365990355|ref|XP_003672007.1| hypothetical protein NDAI_0I01950 [Naumovozyma dairenensis CBS 421]
 gi|343770781|emb|CCD26764.1| hypothetical protein NDAI_0I01950 [Naumovozyma dairenensis CBS 421]
          Length = 757

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 49/86 (56%), Gaps = 1/86 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+D + K F  ++ NPF FK++    +L E   +  P V+L S   ++ G SRDL  
Sbjct: 258 VNMMNDDIRKKFRDSQTNPFIFKNISYLKNLEEFQDL-GPSVMLASPGMLQSGLSRDLLE 316

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC   +N ++IT  +  GT+A+ L+
Sbjct: 317 RWCPDEKNLVLITGYSIEGTMAKYLM 342


>gi|224140921|ref|XP_002323825.1| predicted protein [Populus trichocarpa]
 gi|222866827|gb|EEF03958.1| predicted protein [Populus trichocarpa]
          Length = 696

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 48/83 (57%), Gaps = 3/83 (3%)

Query: 37  MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
           M++++   F  A +NPF FKH+   +S+ + + V  P VV+ S   ++ G SR LF  WC
Sbjct: 302 MNERIRNQF--ANSNPFKFKHISPLNSIEDFSDV-GPSVVMASPGGLQSGLSRQLFDMWC 358

Query: 97  SSPENSIIITNRTSPGTLARDLI 119
           S  +N+ +I      GTLA+ +I
Sbjct: 359 SDKKNACVIPGYVVEGTLAKTII 381


>gi|412990885|emb|CCO18257.1| predicted protein [Bathycoccus prasinos]
          Length = 825

 Score = 60.5 bits (145), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 50/86 (58%), Gaps = 2/86 (2%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  ++  + + FE    NPF+FKHV+  +  ++L     P VVL +   ++ G SR+LF 
Sbjct: 338 INVLNADIKRQFE--EKNPFNFKHVQSLNRASDLDGNTGPCVVLATPSMLQSGTSRELFE 395

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC S +N ++I +    GTLAR+++
Sbjct: 396 NWCESSDNGVVICDFAVQGTLAREIL 421


>gi|328704356|ref|XP_001945120.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Acyrthosiphon pisum]
          Length = 694

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D++ +    A NNPF FKH+    S+     +  P V++ S   ME G SR+LF 
Sbjct: 296 INAMNDRIKRQI--AVNNPFVFKHITNLKSIDHFEDI-GPCVIMASPGVMESGLSRELFE 352

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+  +N +II      GTLA+ ++
Sbjct: 353 MWCTDSKNGVIIAGYVVQGTLAKAIL 378


>gi|19112240|ref|NP_595448.1| cleavage factor two Cft2/polyadenylation factor CPSF-73 (predicted)
           [Schizosaccharomyces pombe 972h-]
 gi|74582548|sp|O74740.1|CFT2_SCHPO RecName: Full=Cleavage factor two protein 2
 gi|3738153|emb|CAA21254.1| cleavage factor two Cft2/polyadenylation factor CPSF-73 (predicted)
           [Schizosaccharomyces pombe]
          Length = 797

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/145 (27%), Positives = 70/145 (48%), Gaps = 24/145 (16%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV-PSPKVVLVSTPDMECGFSRDLF 92
           IEWM D +++ F G   N   F+++      ++++ + P PKV+L +   +ECGFS+ + 
Sbjct: 275 IEWMGDNIVRDF-GINENLLEFRNINTITDFSQISHIGPGPKVILATALTLECGFSQRIL 333

Query: 93  FQWCSSPENSIII---TNRTSPGTLARDLIE---------------LGGNRTLTLQVKKR 134
               S   N +I+    +R    +LA   I                +G      +++K +
Sbjct: 334 LDLMSENSNDLILFTQRSRCPQNSLANQFIRYWERASKKKRDIPHPVGLYAEQAVKIKTK 393

Query: 135 IRLEGEELEEYQK----KKDKEAKD 155
             LEGEEL  YQ+    K++K+A+D
Sbjct: 394 EPLEGEELRSYQELEFSKRNKDAED 418


>gi|432954006|ref|XP_004085503.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Oryzias latipes]
          Length = 686

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+DK+ K+     NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 293 INAMNDKIRKAIN--VNNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 350 SWCTDKRNGVIIAGYCVEGTLAKHIM 375


>gi|190346159|gb|EDK38177.2| hypothetical protein PGUG_02275 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 770

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 48/78 (61%), Gaps = 1/78 (1%)

Query: 42  MKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPEN 101
           + S  G ++NPF FKH+KL  SL +   +  P VV+ +   ++ G SR+L  +W    +N
Sbjct: 307 VSSASGGKSNPFQFKHIKLIRSLDKFQDI-GPCVVVAAPGMLQNGVSRELLERWAPDAKN 365

Query: 102 SIIITNRTSPGTLARDLI 119
           ++I+T  +  GT+A++L+
Sbjct: 366 AVIMTGYSVEGTMAKELL 383


>gi|357445375|ref|XP_003592965.1| Cleavage and polyadenylation specificity factor subunit 3-I
           [Medicago truncatula]
 gi|357445453|ref|XP_003593004.1| Cleavage and polyadenylation specificity factor subunit 3-I
           [Medicago truncatula]
 gi|355482013|gb|AES63216.1| Cleavage and polyadenylation specificity factor subunit 3-I
           [Medicago truncatula]
 gi|355482052|gb|AES63255.1| Cleavage and polyadenylation specificity factor subunit 3-I
           [Medicago truncatula]
          Length = 690

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 35/109 (32%), Positives = 59/109 (54%), Gaps = 8/109 (7%)

Query: 13  RNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV 70
           +N P ++     K C ++ E   +  M+D++    + A++NPF FKH+    S+ ++ K 
Sbjct: 274 QNIPIYYASPLAKKCLTVYETYTLS-MNDRI----QNAKSNPFAFKHISALSSI-DIFKD 327

Query: 71  PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
             P VV+ S   ++ G SR LF  WCS  +NS +I      GTLA+ ++
Sbjct: 328 VGPSVVMASPGGLQSGLSRQLFDMWCSDKKNSCVIPGYVVEGTLAKTIL 376


>gi|410928245|ref|XP_003977511.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Takifugu rubripes]
          Length = 696

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+DK+ K+     NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 293 INAMNDKIRKAIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 350 SWCTDKRNGVIIAGYCVEGTLAKHIM 375


>gi|357114659|ref|XP_003559115.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-I-like [Brachypodium distachyon]
          Length = 768

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 51/86 (59%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++++   F  A++NPFHFKH++  +S+     V  P VV+ S   ++ G SR LF 
Sbjct: 373 INSMNERIRNQF--AQSNPFHFKHIEPLNSIDNFHDV-GPSVVMASPGTLQSGLSRQLFD 429

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC+  +N+ +I      GTL++ +I
Sbjct: 430 KWCTDKKNTCVIPGFVIEGTLSKTII 455


>gi|427779921|gb|JAA55412.1| Putative cleavage and polyadenylation specificity factor cpsf
           subunit [Rhipicephalus pulchellus]
          Length = 737

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 40/70 (57%), Gaps = 1/70 (1%)

Query: 50  NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
           NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF  WC+ P+N +II    
Sbjct: 356 NNPFVFKHISNLKSIEHFEDI-GPCVVMASPGMMQSGLSRELFESWCTDPKNGVIIAGYC 414

Query: 110 SPGTLARDLI 119
             GTLA+ ++
Sbjct: 415 VEGTLAKTIL 424



 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 21/33 (63%)

Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 240
           VV+ S   M+ G SR+LF  WC+ P+N +II  
Sbjct: 380 VVMASPGMMQSGLSRELFESWCTDPKNGVIIAG 412


>gi|346466613|gb|AEO33151.1| hypothetical protein [Amblyomma maculatum]
          Length = 618

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 40/70 (57%), Gaps = 1/70 (1%)

Query: 50  NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
           NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF  WC+ P+N +II    
Sbjct: 237 NNPFVFKHISNLKSIEHFEDI-GPCVVMASPGMMQSGLSRELFESWCTDPKNGVIIAGYC 295

Query: 110 SPGTLARDLI 119
             GTLA+ ++
Sbjct: 296 VEGTLAKTIL 305



 Score = 38.9 bits (89), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 21/33 (63%)

Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 240
           VV+ S   M+ G SR+LF  WC+ P+N +II  
Sbjct: 261 VVMASPGMMQSGLSRELFESWCTDPKNGVIIAG 293


>gi|395828536|ref|XP_003787428.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 3 [Otolemur garnettii]
          Length = 634

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 285 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 341

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ L+
Sbjct: 342 SWCTDKRNGVIIAGYCVEGTLAKILV 367


>gi|384252038|gb|EIE25515.1| Metallo-hydrolase/oxidoreductase [Coccomyxa subellipsoidea C-169]
          Length = 696

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 41/155 (26%), Positives = 72/155 (46%), Gaps = 4/155 (2%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D +  +F G   NPF+FK++    +   L  V  P VVL +   ++ G SR+LF 
Sbjct: 285 IAMMNDDIKAAF-GQSANPFNFKYITELKTQGGLDDV-GPCVVLATPSMLQSGLSRELFD 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEA 153
            WC    N +II +    GTLARD++    +       K  +R+  E +  +    D + 
Sbjct: 343 AWCEDKRNGVIIADFAVQGTLARDILASPSHVLTKAGAKVPLRMSVEHI-SFSAHADFDQ 401

Query: 154 KDKQEKEKIPPHDTSFINE-LQLSDFKQTLQRNGI 187
             +  +   PPH      E +++   ++ L++  +
Sbjct: 402 TSQFVELLDPPHVILVHGEAVEMGRLRKALEQQAV 436


>gi|344232758|gb|EGV64631.1| Metallo-hydrolase/oxidoreductase [Candida tenuis ATCC 10573]
          Length = 782

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 42/71 (59%), Gaps = 1/71 (1%)

Query: 49  RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
           R NPF FK++K   SL        P VV+ S   ++ GFSR L  +W   P+N++++T  
Sbjct: 313 RKNPFTFKYIKTLKSLDRFQDF-GPSVVIASPGMLQSGFSRQLLEKWAPDPKNTVLMTGY 371

Query: 109 TSPGTLARDLI 119
           +  GT+A+DL+
Sbjct: 372 SVEGTMAKDLL 382


>gi|195343244|ref|XP_002038208.1| GM18692 [Drosophila sechellia]
 gi|194133058|gb|EDW54626.1| GM18692 [Drosophila sechellia]
          Length = 684

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D++ +    A NNPF F+H+     +     +  P V++ S   M+ G SR+LF 
Sbjct: 292 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVIMASPGMMQSGLSRELFE 348

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+ P+N +II      GTLA+ ++
Sbjct: 349 SWCTDPKNGVIIAGYCVEGTLAKTVL 374


>gi|194743214|ref|XP_001954095.1| GF18101 [Drosophila ananassae]
 gi|190627132|gb|EDV42656.1| GF18101 [Drosophila ananassae]
          Length = 684

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D++ +    A NNPF F+H+     +     +  P V++ S   M+ G SR+LF 
Sbjct: 292 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVIMASPGMMQSGLSRELFE 348

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+ P+N +II      GTLA+ ++
Sbjct: 349 SWCTDPKNGVIIAGYCVEGTLAKTIL 374


>gi|195569857|ref|XP_002102925.1| GD20157 [Drosophila simulans]
 gi|194198852|gb|EDX12428.1| GD20157 [Drosophila simulans]
          Length = 684

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D++ +    A NNPF F+H+     +     +  P V++ S   M+ G SR+LF 
Sbjct: 292 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVIMASPGMMQSGLSRELFE 348

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+ P+N +II      GTLA+ ++
Sbjct: 349 SWCTDPKNGVIIAGYCVEGTLAKTVL 374


>gi|198451826|ref|XP_001358526.2| GA20526 [Drosophila pseudoobscura pseudoobscura]
 gi|198131664|gb|EAL27667.2| GA20526 [Drosophila pseudoobscura pseudoobscura]
          Length = 684

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D++ +    A NNPF F+H+     +     +  P V++ S   M+ G SR+LF 
Sbjct: 292 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVIMASPGMMQSGLSRELFE 348

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+ P+N +II      GTLA+ ++
Sbjct: 349 SWCTDPKNGVIIAGYCVEGTLAKTIL 374


>gi|195145744|ref|XP_002013850.1| GL23169 [Drosophila persimilis]
 gi|194102793|gb|EDW24836.1| GL23169 [Drosophila persimilis]
          Length = 684

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D++ +    A NNPF F+H+     +     +  P V++ S   M+ G SR+LF 
Sbjct: 292 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVIMASPGMMQSGLSRELFE 348

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+ P+N +II      GTLA+ ++
Sbjct: 349 SWCTDPKNGVIIAGYCVEGTLAKTIL 374


>gi|221106537|ref|XP_002161150.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Hydra magnipapillata]
          Length = 677

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++K+ +    + +NPF FKH+     +     +  P VVL S   M+ G SR+LF 
Sbjct: 284 ISAMNEKIRRQI--SISNPFVFKHISNLKGIDSFDDI-GPSVVLASPGMMQSGLSRELFE 340

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+ P N +II      GTLA++L+
Sbjct: 341 TWCTDPRNGVIIAGYCVEGTLAKELM 366



 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 20/33 (60%)

Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 240
           VVL S   M+ G SR+LF  WC+ P N +II  
Sbjct: 322 VVLASPGMMQSGLSRELFETWCTDPRNGVIIAG 354


>gi|410898094|ref|XP_003962533.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Takifugu rubripes]
          Length = 691

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K+     NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 293 VNAMNDKIRKAIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 350 SWCTDKRNGVIIAGYCVEGTLAKHIM 375


>gi|348518441|ref|XP_003446740.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Oreochromis niloticus]
          Length = 686

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+DK+ K+     NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 293 INAMNDKIRKAIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 350 SWCTDRRNGVIIAGYCVEGTLAKHIM 375


>gi|209877326|ref|XP_002140105.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209555711|gb|EEA05756.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
          Length = 1097

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 34/86 (39%), Positives = 46/86 (53%), Gaps = 4/86 (4%)

Query: 34  IEWMSDKLMKSFEGARNNPFH-FKHVKLCHSLAELAKVPS---PKVVLVSTPDMECGFSR 89
           IEWMS  +   F   R NPFH  K++ + ++L  +   PS   PKV+  S   M+ G+SR
Sbjct: 326 IEWMSLDIRSEFCNTRFNPFHGLKNILIENTLNNVRTGPSAKLPKVIFASPASMDYGYSR 385

Query: 90  DLFFQWCSSPENSIIITNRTSPGTLA 115
           +LF    SSP NS+I T      T A
Sbjct: 386 ELFADLASSPNNSVIFTREPKLNTFA 411


>gi|24648013|ref|NP_650738.1| cleavage and polyadenylation specificity factor 73 [Drosophila
           melanogaster]
 gi|21430620|gb|AAM50988.1| RE31408p [Drosophila melanogaster]
 gi|23171662|gb|AAF55578.2| cleavage and polyadenylation specificity factor 73 [Drosophila
           melanogaster]
 gi|220948314|gb|ACL86700.1| CG7698-PA [synthetic construct]
          Length = 684

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D++ +    A NNPF F+H+     +     +  P V++ S   M+ G SR+LF 
Sbjct: 292 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVIMASPGMMQSGLSRELFE 348

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+ P+N +II      GTLA+ ++
Sbjct: 349 SWCTDPKNGVIIAGYCVEGTLAKAVL 374


>gi|348531581|ref|XP_003453287.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Oreochromis niloticus]
          Length = 690

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K+     NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 293 VNAMNDKIRKAIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 350 SWCTDKRNGVIIAGYCVEGTLAKHIM 375


>gi|359486187|ref|XP_002271646.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-I-like [Vitis vinifera]
          Length = 693

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 48/180 (26%), Positives = 78/180 (43%), Gaps = 8/180 (4%)

Query: 13  RNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV 70
            N P ++     K C ++ +   I  M++++   F  A +NPF FKH+    S+     V
Sbjct: 275 HNVPIYYASPLAKRCMAVYQ-TYINSMNERIRNQF--ANSNPFDFKHISPLKSIENFNDV 331

Query: 71  PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQ 130
             P VV+ S   ++ G SR LF  WCS  +N+ +I      GTLA+ +I      TL   
Sbjct: 332 -GPSVVMASPGGLQSGLSRQLFDMWCSDKKNACVIPGYVVGGTLAKTIINEPKEVTLMNG 390

Query: 131 VKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFINEL-QLSDFKQTLQRNGIDC 189
           +   + ++   +  +    D        KE +PP+      E  ++   KQ L     DC
Sbjct: 391 LTAPLNMQVHYI-SFSAHADFAQTSTFLKELMPPNIILVHGEANEMGRLKQKLITQFADC 449


>gi|194900154|ref|XP_001979622.1| GG16362 [Drosophila erecta]
 gi|190651325|gb|EDV48580.1| GG16362 [Drosophila erecta]
          Length = 684

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D++ +    A NNPF F+H+     +     +  P V++ S   M+ G SR+LF 
Sbjct: 292 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVIMASPGMMQSGLSRELFE 348

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+ P+N +II      GTLA+ ++
Sbjct: 349 SWCTDPKNGVIIAGYCVEGTLAKAVL 374


>gi|116283804|gb|AAH30988.1| CPSF3 protein [Homo sapiens]
          Length = 554

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 44/83 (53%), Gaps = 3/83 (3%)

Query: 37  MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
           M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF  WC
Sbjct: 289 MNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFESWC 345

Query: 97  SSPENSIIITNRTSPGTLARDLI 119
           +   N +II      GTLA+ L+
Sbjct: 346 TDKRNGVIIAGYCVEGTLAKILV 368


>gi|297267035|ref|XP_001114469.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Macaca mulatta]
          Length = 234

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/148 (27%), Positives = 68/148 (45%), Gaps = 4/148 (2%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 42  VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 98

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEA 153
            WC+   N +II      GTLA+ ++      T     K  +++  + +  +    D + 
Sbjct: 99  SWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYI-SFSAHTDYQQ 157

Query: 154 KDKQEKEKIPPHDTSFINELQLSDFKQT 181
             +  +   PPH  S +N +  +  K++
Sbjct: 158 TSEFIRALKPPHVVSLMNLIYCTKGKRS 185


>gi|195497711|ref|XP_002096215.1| GE25184 [Drosophila yakuba]
 gi|194182316|gb|EDW95927.1| GE25184 [Drosophila yakuba]
          Length = 684

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D++ +    A NNPF F+H+     +     +  P V++ S   M+ G SR+LF 
Sbjct: 292 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVIMASPGMMQSGLSRELFE 348

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+ P+N +II      GTLA+ ++
Sbjct: 349 SWCTDPKNGVIIAGYCVEGTLAKAVL 374


>gi|297837375|ref|XP_002886569.1| hypothetical protein ARALYDRAFT_475225 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297332410|gb|EFH62828.1| hypothetical protein ARALYDRAFT_475225 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 693

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 47/83 (56%), Gaps = 3/83 (3%)

Query: 37  MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
           M+D++   F  A +NPF FKH+   +S+ +   V  P VV+ +   ++ G SR LF  WC
Sbjct: 299 MNDRIRNQF--ANSNPFVFKHISPLNSIDDFNDV-GPSVVMATPGGLQSGLSRQLFDSWC 355

Query: 97  SSPENSIIITNRTSPGTLARDLI 119
           S  +N+ II      GTLA+ +I
Sbjct: 356 SDKKNACIIPGYMVEGTLAKTII 378


>gi|195037533|ref|XP_001990215.1| GH19212 [Drosophila grimshawi]
 gi|193894411|gb|EDV93277.1| GH19212 [Drosophila grimshawi]
          Length = 686

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D++ +    A NNPF F+H+     +     +  P V++ S   M+ G SR+LF 
Sbjct: 294 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFDDI-GPCVIMASPGMMQSGLSRELFE 350

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+ P+N +II      GTLA+ ++
Sbjct: 351 SWCTDPKNGVIIAGYCVEGTLAKTIL 376


>gi|195395198|ref|XP_002056223.1| GJ10819 [Drosophila virilis]
 gi|194142932|gb|EDW59335.1| GJ10819 [Drosophila virilis]
          Length = 686

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D++ +    A NNPF F+H+     +     +  P V++ S   M+ G SR+LF 
Sbjct: 294 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFDDI-GPCVIMASPGMMQSGLSRELFE 350

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+ P+N +II      GTLA+ ++
Sbjct: 351 SWCTDPKNGVIIAGYCVEGTLAKTIL 376


>gi|15219848|ref|NP_176297.1| cleavage and polyadenylation specificity factor subunit 3-I
           [Arabidopsis thaliana]
 gi|30696512|ref|NP_849835.1| cleavage and polyadenylation specificity factor subunit 3-I
           [Arabidopsis thaliana]
 gi|79320389|ref|NP_001031215.1| cleavage and polyadenylation specificity factor subunit 3-I
           [Arabidopsis thaliana]
 gi|75262219|sp|Q9C952.1|CPSF3_ARATH RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 3-I; AltName: Full=Cleavage and polyadenylation
           specificity factor 73 kDa subunit I; Short=AtCPSF73-I;
           Short=CPSF 73 kDa subunit I
 gi|12323330|gb|AAG51638.1|AC018908_4 putative cleavage and polyadenylation specificity factor;
           72745-70039 [Arabidopsis thaliana]
 gi|23297661|gb|AAN13003.1| putative cleavage and polyadenylation specificity factor
           [Arabidopsis thaliana]
 gi|24415578|gb|AAN41458.1| putative cleavage and polyadenylation specificity factor 73 kDa
           subunit [Arabidopsis thaliana]
 gi|222422865|dbj|BAH19419.1| AT1G61010 [Arabidopsis thaliana]
 gi|222423059|dbj|BAH19511.1| AT1G61010 [Arabidopsis thaliana]
 gi|332195645|gb|AEE33766.1| cleavage and polyadenylation specificity factor subunit 3-I
           [Arabidopsis thaliana]
 gi|332195646|gb|AEE33767.1| cleavage and polyadenylation specificity factor subunit 3-I
           [Arabidopsis thaliana]
 gi|332195647|gb|AEE33768.1| cleavage and polyadenylation specificity factor subunit 3-I
           [Arabidopsis thaliana]
          Length = 693

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 47/83 (56%), Gaps = 3/83 (3%)

Query: 37  MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
           M+D++   F  A +NPF FKH+   +S+ +   V  P VV+ +   ++ G SR LF  WC
Sbjct: 299 MNDRIRNQF--ANSNPFVFKHISPLNSIDDFNDV-GPSVVMATPGGLQSGLSRQLFDSWC 355

Query: 97  SSPENSIIITNRTSPGTLARDLI 119
           S  +N+ II      GTLA+ +I
Sbjct: 356 SDKKNACIIPGYMVEGTLAKTII 378


>gi|50287519|ref|XP_446189.1| hypothetical protein [Candida glabrata CBS 138]
 gi|74637743|sp|Q6FUA5.1|YSH1_CANGA RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
           3'-end-processing protein YSH1
 gi|49525496|emb|CAG59113.1| unnamed protein product [Candida glabrata]
          Length = 771

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 48/86 (55%), Gaps = 1/86 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+D + K F  ++ NPF FK++    +L E      P V+L S   ++ G SRDL  
Sbjct: 293 VNMMNDNIRKKFRDSQTNPFIFKNIAYIKNLDEFQDF-GPSVMLASPGMLQNGLSRDLLE 351

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC   +N ++IT  +  GT+A+ L+
Sbjct: 352 RWCPDEKNLVLITGYSVEGTMAKYLL 377


>gi|195108751|ref|XP_001998956.1| GI24246 [Drosophila mojavensis]
 gi|193915550|gb|EDW14417.1| GI24246 [Drosophila mojavensis]
          Length = 686

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D++ +    A NNPF F+H+     +     +  P V++ S   M+ G SR+LF 
Sbjct: 294 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFDDI-GPCVIMASPGMMQSGLSRELFE 350

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+ P+N +II      GTLA+ ++
Sbjct: 351 SWCTDPKNGVIIAGYCVEGTLAKTIL 376


>gi|326503296|dbj|BAJ99273.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 693

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 50/86 (58%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++++   F  A++NPFHFKH++  +S+     V  P VV+ S   ++ G SR LF 
Sbjct: 298 INSMNERIRNQF--AQSNPFHFKHIEPLNSIDNFHDV-GPSVVMASPGSLQSGLSRQLFD 354

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC+  +N+ +I      G+L + +I
Sbjct: 355 KWCTDKKNTCVIPGFAVEGSLVKTII 380


>gi|47213099|emb|CAF89519.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 810

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+DK+ K+     NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 342 INAMNDKIRKAIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 398

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 399 SWCTDKRNGVIIAGYCVEGTLAKHIM 424


>gi|47230093|emb|CAG10507.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 730

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K+     NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 283 VNAMNDKIRKAIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 339

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 340 SWCTDKRNGVIIAGYCVEGTLAKHIM 365


>gi|302808975|ref|XP_002986181.1| hypothetical protein SELMODRAFT_234972 [Selaginella moellendorffii]
 gi|300146040|gb|EFJ12712.1| hypothetical protein SELMODRAFT_234972 [Selaginella moellendorffii]
          Length = 684

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 48/86 (55%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+DK+   +E   +NPF+FKH+    S+ +   V  P +V+ S   ++ G SR LF 
Sbjct: 297 INSMNDKIKSQYE--NSNPFNFKHISPLKSIEQFEDV-GPSIVMASPSGLQSGLSRQLFD 353

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC   +N+ +I      GTLA+ ++
Sbjct: 354 RWCQDRKNACVIPGYVVEGTLAKTIL 379


>gi|18377654|gb|AAL66977.1| putative cleavage and polyadenylation specificity factor
           [Arabidopsis thaliana]
          Length = 693

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 47/83 (56%), Gaps = 3/83 (3%)

Query: 37  MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
           M+D++   F  A +NPF FKH+   +S+ +   V  P VV+ +   ++ G SR LF  WC
Sbjct: 299 MNDRIRNQF--ANSNPFVFKHISPLNSIDDFNDV-GPSVVMATPGGLQSGLSRQLFDSWC 355

Query: 97  SSPENSIIITNRTSPGTLARDLI 119
           S  +N+ II      GTLA+ +I
Sbjct: 356 SDKKNACIIPGYMVEGTLAKTII 378


>gi|302806483|ref|XP_002984991.1| hypothetical protein SELMODRAFT_234671 [Selaginella moellendorffii]
 gi|302825687|ref|XP_002994439.1| hypothetical protein SELMODRAFT_236963 [Selaginella moellendorffii]
 gi|300137630|gb|EFJ04498.1| hypothetical protein SELMODRAFT_236963 [Selaginella moellendorffii]
 gi|300147201|gb|EFJ13866.1| hypothetical protein SELMODRAFT_234671 [Selaginella moellendorffii]
          Length = 677

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 48/86 (55%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+DK+   +E +  NPF+FKH+    S+ +   V  P +V+ S   ++ G SR LF 
Sbjct: 290 INSMNDKIKSQYENS--NPFNFKHISPLKSIEQFEDV-GPSIVMASPSGLQSGLSRQLFD 346

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC   +N+ +I      GTLA+ ++
Sbjct: 347 RWCQDRKNACVIPGYVVEGTLAKTIL 372


>gi|55250298|gb|AAH85402.1| Cleavage and polyadenylation specific factor 3 [Danio rerio]
 gi|182889046|gb|AAI64567.1| Cpsf3 protein [Danio rerio]
          Length = 690

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K+     NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 293 VNAMNDKIRKAIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 350 SWCTDKRNGVIIAGYCVEGTLAKHIM 375


>gi|51467896|ref|NP_001003836.1| cleavage and polyadenylation specificity factor subunit 3 [Danio
           rerio]
 gi|49619053|gb|AAT68111.1| cleavage and polyadenylation specificity factor 3 [Danio rerio]
          Length = 690

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K+     NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 293 VNAMNDKIRKAIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 350 SWCTDKRNGVIIAGYCVEGTLAKHIM 375


>gi|444731702|gb|ELW72051.1| Cleavage and polyadenylation specificity factor subunit 3 [Tupaia
           chinensis]
          Length = 587

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 189 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 245

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 246 SWCTDKRNGVIIAGYCVEGTLAKHIM 271


>gi|335775092|gb|AEH58456.1| cleavage and polyadenylation specificity facto subunit 3-like
           protein, partial [Equus caballus]
          Length = 499

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 101 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 157

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 158 SWCTDKRNGVIIAGYCVEGTLAKHIM 183


>gi|355565449|gb|EHH21878.1| hypothetical protein EGK_05038 [Macaca mulatta]
          Length = 650

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 283 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 339

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 340 SWCTDKRNGVIIAGYCVEGTLAKHIM 365


>gi|195452860|ref|XP_002073532.1| GK13096 [Drosophila willistoni]
 gi|194169617|gb|EDW84518.1| GK13096 [Drosophila willistoni]
          Length = 684

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D++ +    A NNPF F+H+     +     +  P V++ S   M+ G SR+LF 
Sbjct: 292 INAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVIMASPGMMQSGLSRELFE 348

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+ P+N +I+      GTLA+ ++
Sbjct: 349 SWCTDPKNGVIVAGYCVEGTLAKTIL 374


>gi|89267474|emb|CAJ83498.1| cleavage and polyadenylation specific factor 3 [Xenopus (Silurana)
           tropicalis]
          Length = 692

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 293 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 350 SWCTDKRNGVIIAGYCVEGTLAKHIM 375


>gi|403216468|emb|CCK70965.1| hypothetical protein KNAG_0F03030 [Kazachstania naganishii CBS
           8797]
          Length = 820

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 46/83 (55%), Gaps = 1/83 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+D + K F  ++ NPF FK++    +L E      P V+L S   ++ G SRDL  
Sbjct: 292 VNMMNDDIRKKFRDSQTNPFIFKNISYLKNLEEFQDF-GPSVMLASPGMLQSGLSRDLLE 350

Query: 94  QWCSSPENSIIITNRTSPGTLAR 116
           +WC   +N ++IT  +  GT+A+
Sbjct: 351 KWCPEQKNLVLITGYSVEGTMAK 373


>gi|55741994|ref|NP_001006770.1| cleavage and polyadenylation specificity factor 3 [Xenopus
           (Silurana) tropicalis]
 gi|49522504|gb|AAH75564.1| cleavage and polyadenylation specific factor 3, 73kDa [Xenopus
           (Silurana) tropicalis]
          Length = 692

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 293 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 350 SWCTDKRNGVIIAGYCVEGTLAKHIM 375


>gi|410074967|ref|XP_003955066.1| hypothetical protein KAFR_0A04950 [Kazachstania africana CBS 2517]
 gi|372461648|emb|CCF55931.1| hypothetical protein KAFR_0A04950 [Kazachstania africana CBS 2517]
          Length = 769

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 46/83 (55%), Gaps = 1/83 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+D + K F  ++ NPF FK++    +L E      P V+L S   ++ G SRDL  
Sbjct: 291 VNMMNDDIRKKFRDSQTNPFIFKNISYLKNLEEFQDF-GPSVMLASPGMLQSGISRDLLE 349

Query: 94  QWCSSPENSIIITNRTSPGTLAR 116
           +WC   +N ++IT  +  GT+A+
Sbjct: 350 RWCPDDKNLVLITGYSVEGTMAK 372


>gi|410955844|ref|XP_003984560.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Felis catus]
          Length = 686

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|149641381|ref|XP_001505542.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like, partial [Ornithorhynchus anatinus]
          Length = 595

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 197 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 253

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 254 SWCTDKRNGVIIAGYCVEGTLAKHIM 279


>gi|2394306|gb|AAB70268.1| 73 kDA subunit of cleavage and polyadenylation specificity factor
           [Homo sapiens]
          Length = 379

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 167 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 223

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 224 SWCTDKRNGVIIAGYCVEGTLAKHIM 249


>gi|291412514|ref|XP_002722528.1| PREDICTED: cleavage and polyadenylation specific factor 3, 73kDa
           [Oryctolagus cuniculus]
          Length = 684

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|431911821|gb|ELK13965.1| Cleavage and polyadenylation specificity factor subunit 3, partial
           [Pteropus alecto]
          Length = 667

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 269 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 325

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 326 SWCTDKRNGVIIAGYCVEGTLAKHIM 351


>gi|27805863|ref|NP_776709.1| cleavage and polyadenylation specificity factor subunit 3 [Bos
           taurus]
 gi|426223116|ref|XP_004005724.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Ovis aries]
 gi|18202362|sp|P79101.1|CPSF3_BOVIN RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 3; AltName: Full=Cleavage and polyadenylation
           specificity factor 73 kDa subunit; Short=CPSF 73 kDa
           subunit; AltName: Full=mRNA 3'-end-processing
           endonuclease CPSF-73
 gi|1707412|emb|CAA65151.1| Cleavage and Polyadenylation Specifity Factor protein [Bos taurus]
 gi|75773721|gb|AAI04554.1| Cleavage and polyadenylation specific factor 3, 73kDa [Bos taurus]
 gi|296482248|tpg|DAA24363.1| TPA: cleavage and polyadenylation specificity factor subunit 3 [Bos
           taurus]
 gi|440897562|gb|ELR49218.1| Cleavage and polyadenylation specificity factor subunit 3 [Bos
           grunniens mutus]
          Length = 684

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|147905468|ref|NP_001088278.1| cleavage and polyadenylation specific factor 3, 73kDa [Xenopus
           laevis]
 gi|54038587|gb|AAH84286.1| LOC495111 protein [Xenopus laevis]
          Length = 692

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 293 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 350 SWCTDKRNGVIIAGYCVEGTLAKHIM 375


>gi|432100623|gb|ELK29151.1| Cleavage and polyadenylation specificity factor subunit 3 [Myotis
           davidii]
          Length = 684

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|403270697|ref|XP_003927303.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Saimiri boliviensis boliviensis]
          Length = 658

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 290 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 346

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 347 SWCTDKRNGVIIAGYCVEGTLAKHIM 372


>gi|350539083|ref|NP_001233296.1| cleavage and polyadenylation specificity factor subunit 3 [Pan
           troglodytes]
 gi|397513374|ref|XP_003826991.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Pan paniscus]
 gi|426334660|ref|XP_004028859.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Gorilla gorilla gorilla]
 gi|343961085|dbj|BAK62132.1| cleavage and polyadenylation specificity factor 73 kDa subunit [Pan
           troglodytes]
 gi|343961781|dbj|BAK62478.1| cleavage and polyadenylation specificity factor 73 kDa subunit [Pan
           troglodytes]
 gi|410254182|gb|JAA15058.1| cleavage and polyadenylation specific factor 3, 73kDa [Pan
           troglodytes]
 gi|410291448|gb|JAA24324.1| cleavage and polyadenylation specific factor 3, 73kDa [Pan
           troglodytes]
 gi|410339611|gb|JAA38752.1| cleavage and polyadenylation specific factor 3, 73kDa [Pan
           troglodytes]
          Length = 684

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|194220982|ref|XP_001502516.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Equus caballus]
 gi|301775721|ref|XP_002923277.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Ailuropoda melanoleuca]
          Length = 684

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|402890043|ref|XP_003908303.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Papio anubis]
          Length = 684

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|351704796|gb|EHB07715.1| Cleavage and polyadenylation specificity factor subunit 3
           [Heterocephalus glaber]
          Length = 692

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 286 VHAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|296224527|ref|XP_002758090.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Callithrix jacchus]
          Length = 684

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|119621395|gb|EAX00990.1| cleavage and polyadenylation specific factor 3, 73kDa, isoform
           CRA_b [Homo sapiens]
          Length = 647

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 249 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 305

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 306 SWCTDKRNGVIIAGYCVEGTLAKHIM 331


>gi|332247248|ref|XP_003272765.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Nomascus leucogenys]
 gi|67969340|dbj|BAE01022.1| unnamed protein product [Macaca fascicularis]
 gi|355751093|gb|EHH55348.1| hypothetical protein EGM_04543 [Macaca fascicularis]
 gi|380813676|gb|AFE78712.1| cleavage and polyadenylation specificity factor subunit 3 [Macaca
           mulatta]
 gi|383419123|gb|AFH32775.1| cleavage and polyadenylation specificity factor subunit 3 [Macaca
           mulatta]
 gi|384940728|gb|AFI33969.1| cleavage and polyadenylation specificity factor subunit 3 [Macaca
           mulatta]
          Length = 684

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|62898706|dbj|BAD97207.1| cleavage and polyadenylation specific factor 3, 73kDa variant [Homo
           sapiens]
          Length = 684

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|15079675|gb|AAH11654.1| Cleavage and polyadenylation specific factor 3, 73kDa [Homo
           sapiens]
 gi|157929136|gb|ABW03853.1| cleavage and polyadenylation specific factor 3, 73kDa [synthetic
           construct]
          Length = 684

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|7706427|ref|NP_057291.1| cleavage and polyadenylation specificity factor subunit 3 [Homo
           sapiens]
 gi|18203503|sp|Q9UKF6.1|CPSF3_HUMAN RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 3; AltName: Full=Cleavage and polyadenylation
           specificity factor 73 kDa subunit; Short=CPSF 73 kDa
           subunit; AltName: Full=mRNA 3'-end-processing
           endonuclease CPSF-73
 gi|6002955|gb|AAF00224.1|AF171877_1 cleavage and polyadenylation specificity factor 73 kDa subunit
           [Homo sapiens]
 gi|18044212|gb|AAH20211.1| Cleavage and polyadenylation specific factor 3, 73kDa [Homo
           sapiens]
 gi|62822309|gb|AAY14858.1| unknown [Homo sapiens]
 gi|119621394|gb|EAX00989.1| cleavage and polyadenylation specific factor 3, 73kDa, isoform
           CRA_a [Homo sapiens]
          Length = 684

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|363732494|ref|XP_419942.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Gallus gallus]
          Length = 672

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 273 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 329

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 330 SWCTDKRNGVIIAGYCVEGTLAKHIM 355


>gi|348558392|ref|XP_003465002.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Cavia porcellus]
          Length = 684

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|335285899|ref|XP_003354974.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Sus scrofa]
          Length = 684

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|294883712|ref|XP_002771037.1| cleavage and polyadenylation specificity factor, putative
           [Perkinsus marinus ATCC 50983]
 gi|239874243|gb|EER02853.1| cleavage and polyadenylation specificity factor, putative
           [Perkinsus marinus ATCC 50983]
          Length = 1050

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 49/81 (60%), Gaps = 6/81 (7%)

Query: 33  KIEWMSDKLMKSFE---GARNNPFHFKHVKLCHSLAELAK-VPS--PKVVLVSTPDMECG 86
           ++EWMS  +   F+   G   +PF F+HV+LC S  + A+  P+  PKVVL S+  +E G
Sbjct: 339 RMEWMSRVVHNDFDTSMGFMYHPFLFQHVQLCSSFQDFAQNYPARKPKVVLASSASLEIG 398

Query: 87  FSRDLFFQWCSSPENSIIITN 107
            +R++F + C  P +++I T 
Sbjct: 399 DAREIFCRMCGDPNSTVIFTT 419


>gi|197102904|ref|NP_001127045.1| cleavage and polyadenylation specificity factor subunit 3 [Pongo
           abelii]
 gi|55733623|emb|CAH93488.1| hypothetical protein [Pongo abelii]
          Length = 647

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 249 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 305

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 306 SWCTDKRNGVIIAGYCVEGTLAKHIM 331


>gi|294945374|ref|XP_002784648.1| cleavage and polyadenylation specificity factor, putative
           [Perkinsus marinus ATCC 50983]
 gi|239897833|gb|EER16444.1| cleavage and polyadenylation specificity factor, putative
           [Perkinsus marinus ATCC 50983]
          Length = 1115

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 49/81 (60%), Gaps = 6/81 (7%)

Query: 33  KIEWMSDKLMKSFE---GARNNPFHFKHVKLCHSLAELAK-VPS--PKVVLVSTPDMECG 86
           ++EWMS  +   F+   G   +PF F+HV+LC S  + A+  P+  PKVVL S+  +E G
Sbjct: 384 RMEWMSRVVHNDFDTSMGFMYHPFLFQHVQLCSSFQDFAQNYPARKPKVVLASSASLEIG 443

Query: 87  FSRDLFFQWCSSPENSIIITN 107
            +R++F + C  P +++I T 
Sbjct: 444 DAREIFCRMCGDPNSTVIFTT 464


>gi|67969643|dbj|BAE01170.1| unnamed protein product [Macaca fascicularis]
          Length = 684

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|224140917|ref|XP_002323823.1| predicted protein [Populus trichocarpa]
 gi|222866825|gb|EEF03956.1| predicted protein [Populus trichocarpa]
          Length = 250

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 48/83 (57%), Gaps = 3/83 (3%)

Query: 37  MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
           M++++   F  A +NPF FKH+   +S+ + + V  P VV+ S   ++ G SR LF  WC
Sbjct: 170 MNERIRNQF--ANSNPFKFKHISPLNSIEDFSDV-GPSVVMASPGGLQSGLSRQLFDMWC 226

Query: 97  SSPENSIIITNRTSPGTLARDLI 119
           S  +N+ ++      GTLA+ +I
Sbjct: 227 SDKKNACVLPGYVVEGTLAKTII 249


>gi|210075949|ref|XP_504965.2| YALI0F03817p [Yarrowia lipolytica]
 gi|223634672|sp|Q6C2Z7.2|YSH1_YARLI RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
           3'-end-processing protein YSH1
 gi|199424917|emb|CAG77772.2| YALI0F03817p [Yarrowia lipolytica CLIB122]
          Length = 827

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 32/109 (29%), Positives = 59/109 (54%), Gaps = 5/109 (4%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D + + F   + NPF FK++K   +L     +  P V++ S   ++ G SR L  
Sbjct: 312 INMMNDNIRRRFRDQKTNPFRFKYIKNIKNLDRFDDM-GPCVMVASPGMLQSGVSRSLLE 370

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI----ELGGNRTLTLQVKKRIRLE 138
           +W   P+N++I+T  +  GT+A+ +I    E+   +   L+V +R+ +E
Sbjct: 371 RWAPDPKNTLILTGYSVEGTMAKQIINEPNEIPSAQNPDLKVPRRLAVE 419


>gi|281351872|gb|EFB27456.1| hypothetical protein PANDA_012399 [Ailuropoda melanoleuca]
          Length = 648

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 250 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 306

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 307 SWCTDKRNGVIIAGYCVEGTLAKHIM 332


>gi|417412420|gb|JAA52597.1| Putative cleavage and polyadenylation specificity factor cpsf
           subunit, partial [Desmodus rotundus]
          Length = 714

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 316 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 372

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 373 SWCTDKRNGVIIAGYCVEGTLAKHIM 398


>gi|126303222|ref|XP_001371997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Monodelphis domestica]
          Length = 684

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|297739612|emb|CBI29794.3| unnamed protein product [Vitis vinifera]
          Length = 581

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 6/109 (5%)

Query: 13  RNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV 70
            N P ++     K C ++ +   I  M++++   F  A +NPF FKH+    S+     V
Sbjct: 275 HNVPIYYASPLAKRCMAVYQ-TYINSMNERIRNQF--ANSNPFDFKHISPLKSIENFNDV 331

Query: 71  PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
             P VV+ S   ++ G SR LF  WCS  +N+ +I      GTLA+ +I
Sbjct: 332 -GPSVVMASPGGLQSGLSRQLFDMWCSDKKNACVIPGYVVGGTLAKTII 379


>gi|443725897|gb|ELU13297.1| hypothetical protein CAPTEDRAFT_184406 [Capitella teleta]
          Length = 668

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 36/110 (32%), Positives = 54/110 (49%), Gaps = 11/110 (10%)

Query: 23  KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
           K C S+ +   I  M+DK+ +      NNPF FKH+    S+     +  P VV+ S   
Sbjct: 264 KKCMSVYQ-TYINAMNDKIKRQIN--TNNPFVFKHISNLKSMEHFDDI-GPSVVMASPGM 319

Query: 83  MECGFSRDLFFQWCSSPENSIIITNRTSPGTLAR-------DLIELGGNR 125
           M+ G SR+LF  WC+   N  II      GTLA+       +++ +GG +
Sbjct: 320 MQSGLSRELFENWCTDKRNGCIIAGYCVEGTLAKHILSEPEEIVTMGGQK 369


>gi|395507218|ref|XP_003757924.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Sarcophilus harrisii]
          Length = 684

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|147787280|emb|CAN71414.1| hypothetical protein VITISV_029216 [Vitis vinifera]
          Length = 687

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++++   F  A +NPF FKH+    S+     V  P VV+ S   ++ G SR LF 
Sbjct: 291 INSMNERIRNQF--ANSNPFDFKHISPLKSIENFNDV-GPSVVMASPSGLQSGLSRQLFD 347

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WCS  +N+ +I      GTLA+ +I
Sbjct: 348 MWCSDKKNACVIPGYVVEGTLAKTII 373


>gi|255718827|ref|XP_002555694.1| KLTH0G15202p [Lachancea thermotolerans]
 gi|238937078|emb|CAR25257.1| KLTH0G15202p [Lachancea thermotolerans CBS 6340]
          Length = 755

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 47/83 (56%), Gaps = 1/83 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+D + K F  +++NPF FK++    +L E      P V+L S   ++ G SRDL  
Sbjct: 296 VNMMNDDIRKKFRDSQSNPFIFKNISYLKNLDEFQDF-GPSVMLASPGMLQNGLSRDLLE 354

Query: 94  QWCSSPENSIIITNRTSPGTLAR 116
           +WC   +N ++IT  +  GT+A+
Sbjct: 355 KWCPGEKNLVLITGYSVEGTMAK 377


>gi|359486185|ref|XP_003633408.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-I-like [Vitis vinifera]
          Length = 694

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++++   F  A +NPF FKH+    S+     V  P VV+ S   ++ G SR LF 
Sbjct: 298 INSMNERIRNQF--ANSNPFDFKHISPLKSIENFNDV-GPSVVMASPSGLQSGLSRQLFD 354

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WCS  +N+ +I      GTLA+ +I
Sbjct: 355 MWCSDKKNACVIPGYVVEGTLAKTII 380


>gi|326916480|ref|XP_003204535.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Meleagris gallopavo]
          Length = 759

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 360 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 416

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 417 SWCTDKRNGVIIAGYCVEGTLAKHIM 442


>gi|146421308|ref|XP_001486604.1| hypothetical protein PGUG_02275 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 770

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 47/78 (60%), Gaps = 1/78 (1%)

Query: 42  MKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPEN 101
           + S  G + NPF FKH+KL  SL +   +  P VV+ +   ++ G SR+L  +W    +N
Sbjct: 307 VSSASGGKLNPFQFKHIKLIRSLDKFQDI-GPCVVVAAPGMLQNGVSRELLERWAPDAKN 365

Query: 102 SIIITNRTSPGTLARDLI 119
           ++I+T  +  GT+A++L+
Sbjct: 366 AVIMTGYSVEGTMAKELL 383


>gi|359321645|ref|XP_003639652.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Canis lupus familiaris]
          Length = 717

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 319 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 375

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 376 SWCTDKRNGVIIAGYCVEGTLAKHIM 401


>gi|327261273|ref|XP_003215455.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Anolis carolinensis]
          Length = 651

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 252 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 308

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 309 SWCTDKRNGVIIAGYCVEGTLAKHIM 334


>gi|355680849|gb|AER96661.1| cleavage and polyadenylation specific factor 3, 73kDa [Mustela
           putorius furo]
          Length = 600

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 235 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 291

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 292 SWCTDKRNGVIIAGYCVEGTLAKHIM 317


>gi|297739590|emb|CBI29772.3| unnamed protein product [Vitis vinifera]
          Length = 680

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 35/109 (32%), Positives = 55/109 (50%), Gaps = 6/109 (5%)

Query: 13  RNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV 70
            N P ++     K C ++ +   I  M++++   F  A +NPF FKH+    S+     V
Sbjct: 276 HNIPIYYASPLAKRCMAVYQ-TYINSMNERIRNQF--ANSNPFDFKHISPLKSIENFNDV 332

Query: 71  PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
             P VV+ S   ++ G SR LF  WCS  +N+ +I      GTLA+ +I
Sbjct: 333 -GPSVVMASPSGLQSGLSRQLFDMWCSDKKNACVIPGYVVEGTLAKTII 380


>gi|300676780|gb|ADK26656.1| cleavage and polyadenylation specific factor 3, 73kDa [Zonotrichia
           albicollis]
          Length = 721

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 323 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 379

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 380 SWCTDKRNGVIIAGYCVEGTLAKHIM 405


>gi|302832928|ref|XP_002948028.1| hypothetical protein VOLCADRAFT_79885 [Volvox carteri f.
           nagariensis]
 gi|300266830|gb|EFJ51016.1| hypothetical protein VOLCADRAFT_79885 [Volvox carteri f.
           nagariensis]
          Length = 728

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 57/105 (54%), Gaps = 5/105 (4%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAE-LAKVPSPKVVLVSTPDMECGFSRDLF 92
           +E +++ + K F     NPF F+HV+   + A  +A    P V++ +   ++ G SRD F
Sbjct: 299 VESLNEDIKKVFH--DRNPFKFRHVQTLKNPAHFIADYSGPCVIMATPSGLQSGASRDFF 356

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRL 137
             WC    N+ II +    GTLA+++  LGG  ++T +  +R+ L
Sbjct: 357 EAWCEDARNTCIICDFAVQGTLAKEI--LGGPSSITTREGRRVPL 399


>gi|444315239|ref|XP_004178277.1| hypothetical protein TBLA_0A09750 [Tetrapisispora blattae CBS 6284]
 gi|387511316|emb|CCH58758.1| hypothetical protein TBLA_0A09750 [Tetrapisispora blattae CBS 6284]
          Length = 781

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 56/111 (50%), Gaps = 4/111 (3%)

Query: 11  GARNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELA 68
           G    P ++     K C S+ +   +  M+D + K F  ++ NPF FK++    +L    
Sbjct: 277 GGGQVPIYYASSLAKKCMSVFQ-TYVNMMNDDIRKKFRDSQTNPFIFKNISYLRNLDNFE 335

Query: 69  KVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
               P V+L S   ++ G SRDL  +WC   +N ++IT  +  GT+A+ L+
Sbjct: 336 DF-GPSVLLASPGMLQSGISRDLLERWCPEDKNMVLITGYSVEGTMAKYLM 385


>gi|213407230|ref|XP_002174386.1| cleavage factor two Cft2/polyadenylation factor CPSF-73
           [Schizosaccharomyces japonicus yFS275]
 gi|212002433|gb|EEB08093.1| cleavage factor two Cft2/polyadenylation factor CPSF-73
           [Schizosaccharomyces japonicus yFS275]
          Length = 786

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 43/168 (25%), Positives = 83/168 (49%), Gaps = 26/168 (15%)

Query: 16  PFHFKHVKLCHSLAEL-----AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV 70
           P  F    L +S A+      + IEWM D +++ F G   +   F+H++     ++L+++
Sbjct: 252 PLSFPIYFLSYSSAKTIGYAKSMIEWMGDNIVRDF-GMNESLLEFRHIQTITHPSQLSQI 310

Query: 71  -PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIII---TNRTSPGTLAR---------- 116
            P PKV++ ++  +E GFS+++         N++I+    +R S  +LA+          
Sbjct: 311 SPGPKVIIATSLTLESGFSQNVLLDIMPDNSNNLILLTQKSRYSENSLAKQFYRYWERAS 370

Query: 117 -----DLIELGGNRTLTLQVKKRIRLEGEELEEYQ-KKKDKEAKDKQE 158
                +   +G     ++QVK    L+GEEL E+Q K++ K  +D ++
Sbjct: 371 RKSPENFSSVGMYFEQSIQVKHSEPLQGEELREFQEKEQSKRTRDAED 418


>gi|449498153|ref|XP_002196255.2| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 3 [Taeniopygia guttata]
          Length = 746

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 348 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 404

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 405 SWCTDKRNGVIIAGYCVEGTLAKHIM 430


>gi|126030713|pdb|2I7T|A Chain A, Structure Of Human Cpsf-73
 gi|126030714|pdb|2I7V|A Chain A, Structure Of Human Cpsf-73
          Length = 459

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 44/83 (53%), Gaps = 3/83 (3%)

Query: 37  MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
           M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF  WC
Sbjct: 289 MNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFESWC 345

Query: 97  SSPENSIIITNRTSPGTLARDLI 119
           +   N +II      GTLA+ ++
Sbjct: 346 TDKRNGVIIAGYCVEGTLAKHIM 368


>gi|367005895|ref|XP_003687679.1| hypothetical protein TPHA_0K01110 [Tetrapisispora phaffii CBS 4417]
 gi|357525984|emb|CCE65245.1| hypothetical protein TPHA_0K01110 [Tetrapisispora phaffii CBS 4417]
          Length = 790

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 48/86 (55%), Gaps = 1/86 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+D + K F  ++ NPF FK++    ++        P V+L +   ++ G SRDL  
Sbjct: 291 VNMMNDSIRKKFRDSKTNPFIFKNISYLKNIDSFQDF-GPSVMLAAPGMLQNGLSRDLLE 349

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC  P+N ++IT  +  G++A+ L+
Sbjct: 350 KWCPEPKNMVLITGYSVEGSMAKYLM 375


>gi|320583131|gb|EFW97347.1| Putative endoribonuclease [Ogataea parapolymorpha DL-1]
          Length = 702

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 36/117 (30%), Positives = 65/117 (55%), Gaps = 14/117 (11%)

Query: 13  RNNPFHFKHVKLCHSLAELAK---------IEWMSDKLMKSFEGARNNPFHFKHVKLCHS 63
           +NNP   ++V + ++ ++LAK         +  M+D + K F     NPFHFK++K   +
Sbjct: 257 QNNP-ELQNVPIYYA-SDLAKKCMAVYQRYVNMMNDSIRKKFTETNQNPFHFKYIKNITN 314

Query: 64  LAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
           + ++  + S   VL+++P M + G SR +  +W   P NS I+T  +  GT+A+ L+
Sbjct: 315 IEKINDLDSS--VLIASPGMLQNGISRKILEKWSPDPRNSCILTGYSVEGTMAKILL 369


>gi|159488791|ref|XP_001702386.1| subunit of mRNA cleavage and polyadenylation specificity factor
           [Chlamydomonas reinhardtii]
 gi|158271180|gb|EDO97006.1| subunit of mRNA cleavage and polyadenylation specificity factor
           [Chlamydomonas reinhardtii]
          Length = 690

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 57/105 (54%), Gaps = 5/105 (4%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD-MECGFSRDLF 92
           +E ++D + + F     NPF F+HV+   + A      S   V+++TP  ++ G SRD F
Sbjct: 274 VESLNDDIKRVFH--ERNPFKFRHVQTLKNPAHFISDYSGPCVIMATPSGLQSGASRDFF 331

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRL 137
             WC    N+ II +    GTLA+++  LGG  ++T +  +R+ L
Sbjct: 332 EAWCEDSRNTCIICDFAVQGTLAKEI--LGGPSSITTREGRRVPL 374


>gi|367031802|ref|XP_003665184.1| hypothetical protein MYCTH_2308652 [Myceliophthora thermophila ATCC
           42464]
 gi|347012455|gb|AEO59939.1| hypothetical protein MYCTH_2308652 [Myceliophthora thermophila ATCC
           42464]
          Length = 1035

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 48/172 (27%), Positives = 70/172 (40%), Gaps = 58/172 (33%)

Query: 34  IEWMSDKLMKSFE----GARNN----------------PFHFKHVKLCHSLAELAKV--- 70
           +EWM D +++ FE    G R                  PF FKH++L    A++ +V   
Sbjct: 331 LEWMDDSIVREFEAVAGGTRTGNSGGGAGSGAKGKEAGPFDFKHLRLLERKAQVERVLQQ 390

Query: 71  --------PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS--PG--TLARDL 118
                   P  +V+L +   +E GFS+D+       P N +I+T + S  PG  ++AR L
Sbjct: 391 ATATDDAEPRGRVILATDSSLEWGFSKDVMRAIAEDPRNLVILTEKPSLNPGKPSIARML 450

Query: 119 IE-----------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
            E                        GG R L L    R  LEG EL+ YQ+
Sbjct: 451 WEWWRERKDGVAVEQTSGGDTFEQVYGGGRELELTDATRQALEGTELDVYQQ 502


>gi|323307973|gb|EGA61229.1| Ysh1p [Saccharomyces cerevisiae FostersO]
          Length = 727

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 46/83 (55%), Gaps = 1/83 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+D + K F  ++ NPF FK++    +L +      P V+L S   ++ G SRDL  
Sbjct: 260 VNMMNDDIXKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGMLQSGLSRDLLE 318

Query: 94  QWCSSPENSIIITNRTSPGTLAR 116
           +WC   +N ++IT  +  GT+A+
Sbjct: 319 RWCPEDKNLVLITGYSIEGTMAK 341


>gi|344302811|gb|EGW33085.1| hypothetical protein SPAPADRAFT_66091 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 762

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 50/86 (58%), Gaps = 3/86 (3%)

Query: 37  MSDK--LMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQ 94
           M+DK  L  S  G ++NPF FK++K    L +   +  P VV+ +   ++ G SR L  +
Sbjct: 298 MNDKIRLSASSSGHKSNPFDFKYIKSIRDLGKFQDM-GPSVVIAAPGMLQAGISRQLLEK 356

Query: 95  WCSSPENSIIITNRTSPGTLARDLIE 120
           W   P+N +I+T  +  GT+A++L++
Sbjct: 357 WAPDPKNLVILTGYSVEGTMAKELLK 382


>gi|449283675|gb|EMC90280.1| Cleavage and polyadenylation specificity factor subunit 3, partial
           [Columba livia]
          Length = 667

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 33/97 (34%), Positives = 50/97 (51%), Gaps = 4/97 (4%)

Query: 23  KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
           K C S+ +   +  M+DK+ K      NNPF FKH+    S+     +  P +V+ S   
Sbjct: 259 KKCMSVYQ-TYVNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSIVMASPGM 314

Query: 83  MECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
           M+ G SR+LF  WC+   N +II      GTLA+ ++
Sbjct: 315 MQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHVM 351


>gi|195995883|ref|XP_002107810.1| hypothetical protein TRIADDRAFT_19764 [Trichoplax adhaerens]
 gi|190588586|gb|EDV28608.1| hypothetical protein TRIADDRAFT_19764 [Trichoplax adhaerens]
          Length = 636

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 36/132 (27%), Positives = 63/132 (47%), Gaps = 4/132 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+      A +NPF FKH+     +     +  P VV+ S   M+ G SR+LF 
Sbjct: 297 VSAMNDKIRNQI--AISNPFIFKHISNLKGIDHFDDI-GPCVVMASPGMMQSGLSRELFE 353

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEA 153
           +WC+  +N ++I      GTLA++++       L++  +K  R    +   +    D E 
Sbjct: 354 KWCTDSKNGVVIAGYCVEGTLAKEVMS-EPEEVLSMSGQKLARKLSVDYISFSAHTDYEQ 412

Query: 154 KDKQEKEKIPPH 165
             +  +  +PPH
Sbjct: 413 TSEFIRMLMPPH 424


>gi|380480161|emb|CCF42595.1| RNA-metabolising metallo-beta-lactamase [Colletotrichum
           higginsianum]
          Length = 979

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 49/166 (29%), Positives = 71/166 (42%), Gaps = 52/166 (31%)

Query: 34  IEWMSDKLMKSFEGA-----RNN----------PFHFKHVKLCHSLAELAKVPSP----- 73
           +EWM D +++ FE       R N          PF F+++KL    A++ K+ S      
Sbjct: 332 LEWMDDNIVREFESVADGQRRTNGAEAKSKEGVPFDFRYLKLVERRAQIEKLLSGSGDNV 391

Query: 74  ----KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN-----RTSPGTLARDL------ 118
               +V+L S   +E GFS+DL         N +I+T+     R    ++AR L      
Sbjct: 392 QAEGRVILASDDTLEWGFSKDLIRGLAKDSRNLVILTDKPAKSRAEQPSIARTLWDWWTE 451

Query: 119 ---------------IEL--GGNRTLTLQVKKRIRLEGEELEEYQK 147
                          IEL  GG R L +Q  KR  LEG+EL  YQ+
Sbjct: 452 RRDGVSVEQSSNGNSIELVYGGGRELEVQEAKRQALEGDELNVYQQ 497


>gi|50304897|ref|XP_452404.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|74636942|sp|Q6CUI5.1|YSH1_KLULA RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
           3'-end-processing protein YSH1
 gi|49641537|emb|CAH01255.1| KLLA0C04598p [Kluyveromyces lactis]
          Length = 764

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 49/86 (56%), Gaps = 1/86 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+D + K F+ ++ NPF FK++    +L E      P V+L S   ++ G SRD+  
Sbjct: 298 VNMMNDDIRKKFKDSQTNPFIFKNISYLKNLDEFEDF-GPSVMLASPGMLQNGLSRDILE 356

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC   +N +++T  +  GT+A+ L+
Sbjct: 357 KWCPEEKNLVLVTGYSVEGTMAKYLL 382


>gi|365764103|gb|EHN05628.1| Ysh1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 699

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 46/83 (55%), Gaps = 1/83 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+D + K F  ++ NPF FK++    +L +      P V+L S   ++ G SRDL  
Sbjct: 214 VNMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGMLQSGLSRDLLE 272

Query: 94  QWCSSPENSIIITNRTSPGTLAR 116
           +WC   +N ++IT  +  GT+A+
Sbjct: 273 RWCPEDKNLVLITGYSIEGTMAK 295


>gi|323336337|gb|EGA77605.1| Ysh1p [Saccharomyces cerevisiae Vin13]
          Length = 745

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 46/83 (55%), Gaps = 1/83 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+D + K F  ++ NPF FK++    +L +      P V+L S   ++ G SRDL  
Sbjct: 260 VNMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGMLQSGLSRDLLE 318

Query: 94  QWCSSPENSIIITNRTSPGTLAR 116
           +WC   +N ++IT  +  GT+A+
Sbjct: 319 RWCPEDKNLVLITGYSIEGTMAK 341


>gi|168007963|ref|XP_001756677.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162692273|gb|EDQ78631.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 682

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D++ K FE   +NPF FKH++   ++     +  P VV+ S   ++ G SR LF 
Sbjct: 292 INAMNDRIQKQFE--VSNPFDFKHIQPLKNIDGFDDI-GPAVVMASPGGLQSGLSRQLFD 348

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC   +NS II      GTLA+ ++
Sbjct: 349 IWCQDKKNSCIIPGYVVEGTLAKAIM 374


>gi|356525973|ref|XP_003531594.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-I-like [Glycine max]
          Length = 688

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 34/109 (31%), Positives = 59/109 (54%), Gaps = 8/109 (7%)

Query: 13  RNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV 70
            N P ++     K C ++ E   +  M+D++    + A++NPF FKH+    S+ E+ K 
Sbjct: 272 HNIPIYYASPLAKKCLTVYETYTLS-MNDRV----QNAKSNPFSFKHISALSSI-EVFKD 325

Query: 71  PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
             P VV+ S   ++ G SR LF +WCS  +N+ ++      GTLA+ ++
Sbjct: 326 VGPSVVMASPGGLQSGLSRQLFDKWCSDKKNTCVLPGFVVEGTLAKTIM 374


>gi|307199387|gb|EFN80012.1| Cleavage and polyadenylation specificity factor subunit 3
           [Harpegnathos saltator]
          Length = 685

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ +    A NNPF FKH+     +     +  P VV+ S   M+ G SR+LF 
Sbjct: 292 VNAMNDKIRRQI--AINNPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFE 348

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+  +N +II      GTLA+ ++
Sbjct: 349 SWCTDAKNGVIIAGYCVEGTLAKGIL 374


>gi|349579985|dbj|GAA25146.1| K7_Ysh1p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 779

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 46/83 (55%), Gaps = 1/83 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+D + K F  ++ NPF FK++    +L +      P V+L S   ++ G SRDL  
Sbjct: 294 VNMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGMLQSGLSRDLLE 352

Query: 94  QWCSSPENSIIITNRTSPGTLAR 116
           +WC   +N ++IT  +  GT+A+
Sbjct: 353 RWCPEDKNLVLITGYSIEGTMAK 375


>gi|323303815|gb|EGA57598.1| Ysh1p [Saccharomyces cerevisiae FostersB]
          Length = 727

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 46/83 (55%), Gaps = 1/83 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+D + K F  ++ NPF FK++    +L +      P V+L S   ++ G SRDL  
Sbjct: 260 VNMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGMLQSGLSRDLLE 318

Query: 94  QWCSSPENSIIITNRTSPGTLAR 116
           +WC   +N ++IT  +  GT+A+
Sbjct: 319 RWCPEDKNLVLITGYSIEGTMAK 341


>gi|6323307|ref|NP_013379.1| Ysh1p [Saccharomyces cerevisiae S288c]
 gi|74644951|sp|Q06224.1|YSH1_YEAST RecName: Full=Endoribonuclease YSH1; AltName: Full=Yeast 73 kDa
           homolog 1; AltName: Full=mRNA 3'-end-processing protein
           YSH1
 gi|577190|gb|AAB67367.1| Ysh1p: subunit of polyadenylation factor I (PF I) [Saccharomyces
           cerevisiae]
 gi|151940984|gb|EDN59365.1| cleavage factor II (CF II) component [Saccharomyces cerevisiae
           YJM789]
 gi|190405336|gb|EDV08603.1| hypothetical protein SCRG_04228 [Saccharomyces cerevisiae RM11-1a]
 gi|256269831|gb|EEU05091.1| Ysh1p [Saccharomyces cerevisiae JAY291]
 gi|285813694|tpg|DAA09590.1| TPA: Ysh1p [Saccharomyces cerevisiae S288c]
 gi|323332373|gb|EGA73782.1| Ysh1p [Saccharomyces cerevisiae AWRI796]
          Length = 779

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 46/83 (55%), Gaps = 1/83 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+D + K F  ++ NPF FK++    +L +      P V+L S   ++ G SRDL  
Sbjct: 294 VNMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGMLQSGLSRDLLE 352

Query: 94  QWCSSPENSIIITNRTSPGTLAR 116
           +WC   +N ++IT  +  GT+A+
Sbjct: 353 RWCPEDKNLVLITGYSIEGTMAK 375


>gi|259148260|emb|CAY81507.1| Ysh1p [Saccharomyces cerevisiae EC1118]
          Length = 779

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 46/83 (55%), Gaps = 1/83 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+D + K F  ++ NPF FK++    +L +      P V+L S   ++ G SRDL  
Sbjct: 294 VNMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGMLQSGLSRDLLE 352

Query: 94  QWCSSPENSIIITNRTSPGTLAR 116
           +WC   +N ++IT  +  GT+A+
Sbjct: 353 RWCPEDKNLVLITGYSIEGTMAK 375


>gi|392297785|gb|EIW08884.1| Ysh1p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 772

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 46/83 (55%), Gaps = 1/83 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+D + K F  ++ NPF FK++    +L +      P V+L S   ++ G SRDL  
Sbjct: 287 VNMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGMLQSGLSRDLLE 345

Query: 94  QWCSSPENSIIITNRTSPGTLAR 116
           +WC   +N ++IT  +  GT+A+
Sbjct: 346 RWCPEDKNLVLITGYSIEGTMAK 368


>gi|322786053|gb|EFZ12664.1| hypothetical protein SINV_01905 [Solenopsis invicta]
          Length = 686

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ +    A NNPF FKH+     +     +  P VV+ S   M+ G SR+LF 
Sbjct: 293 VNAMNDKIRRQI--AINNPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFE 349

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+  +N +II      GTLA+ ++
Sbjct: 350 SWCTDAKNGVIIAGYCVEGTLAKTIL 375


>gi|194386428|dbj|BAG61024.1| unnamed protein product [Homo sapiens]
          Length = 406

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 44/86 (51%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 8   VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 64

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GT A+ ++
Sbjct: 65  SWCTDKRNGVIIAGYCVEGTFAKHIM 90


>gi|401837471|gb|EJT41396.1| YSH1-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 779

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 46/83 (55%), Gaps = 1/83 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+D + K F  ++ NPF FK++    +L +      P V+L S   ++ G SRDL  
Sbjct: 294 VNMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGMLQSGLSRDLLE 352

Query: 94  QWCSSPENSIIITNRTSPGTLAR 116
           +WC   +N ++IT  +  GT+A+
Sbjct: 353 RWCPEDKNLVLITGYSIEGTMAK 375


>gi|344280152|ref|XP_003411849.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Loxodonta africana]
          Length = 903

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF 
Sbjct: 505 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFE 561

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 562 SWCTDKRNGVIIAGYCVEGTLAKHIM 587


>gi|380012076|ref|XP_003690115.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Apis florea]
          Length = 686

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ +    A NNPF FKH+     +     +  P VV+ S   M+ G SR+LF 
Sbjct: 293 VNAMNDKIRRQI--AINNPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFE 349

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+  +N +II      GTLA+ ++
Sbjct: 350 SWCTDAKNGVIIAGYCVEGTLAKTIL 375


>gi|310799284|gb|EFQ34177.1| RNA-metabolising metallo-beta-lactamase [Glomerella graminicola
           M1.001]
          Length = 984

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 47/166 (28%), Positives = 70/166 (42%), Gaps = 52/166 (31%)

Query: 34  IEWMSDKLMKSFEGARNN---------------PFHFKHVKLCHSLAELAKVPSP----- 73
           +EWM D +++ FE   +                PF F+++KL    A++ K+ S      
Sbjct: 332 LEWMDDNIVREFESVADGQRKANGTEAKSKEGVPFDFRYLKLVERRAQIEKLLSGSGDNV 391

Query: 74  ----KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN-----RTSPGTLARDL------ 118
               +V+L S   +E GFS+DL         N +I+T+     R    ++AR L      
Sbjct: 392 QAEGRVILASDDTLEWGFSKDLIRGLAKDSRNLVILTDKPAKSRAEQPSIARTLWDWWTE 451

Query: 119 ---------------IEL--GGNRTLTLQVKKRIRLEGEELEEYQK 147
                          +EL  GG R L +Q  KR  LEGEEL  YQ+
Sbjct: 452 RRDGVAVEQSSNGNNLELVYGGGRELEVQEAKRQALEGEELNVYQQ 497


>gi|347965534|ref|XP_321933.5| AGAP001224-PA [Anopheles gambiae str. PEST]
 gi|333470467|gb|EAA01794.5| AGAP001224-PA [Anopheles gambiae str. PEST]
          Length = 690

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+DK+ +    A NNPF F+ +     +     V  P VV+ S   M+ G SR+LF 
Sbjct: 294 INAMNDKIRRQI--AINNPFVFRFISNLKGIDHFDDV-GPCVVMASPGMMQSGLSRELFE 350

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+ P+N +II      GTLA+ ++
Sbjct: 351 SWCTDPKNGVIIAGYCVEGTLAKTIL 376



 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 21/33 (63%)

Query: 208 VVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 240
           VV+ S   M+ G SR+LF  WC+ P+N +II  
Sbjct: 332 VVMASPGMMQSGLSRELFESWCTDPKNGVIIAG 364


>gi|401624491|gb|EJS42547.1| ysh1p [Saccharomyces arboricola H-6]
          Length = 779

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 51/94 (54%), Gaps = 2/94 (2%)

Query: 23  KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
           K C S+ +   +  M+D + K F  ++ NPF FK++    +L +      P V+L S   
Sbjct: 284 KKCMSVFQ-TYVNMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGM 341

Query: 83  MECGFSRDLFFQWCSSPENSIIITNRTSPGTLAR 116
           ++ G SRDL  +WC   +N ++IT  +  GT+A+
Sbjct: 342 LQSGLSRDLLERWCPEDKNLVLITGYSIEGTMAK 375


>gi|332019331|gb|EGI59837.1| Cleavage and polyadenylation specificity factor subunit 3
           [Acromyrmex echinatior]
          Length = 685

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ +    A NNPF FKH+     +     +  P VV+ S   M+ G SR+LF 
Sbjct: 293 VNAMNDKIRRQI--AINNPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFE 349

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+  +N +II      GTLA+ ++
Sbjct: 350 SWCTDAKNGVIIAGYCVEGTLAKTIL 375


>gi|383861262|ref|XP_003706105.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Megachile rotundata]
          Length = 686

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ +    A NNPF FKH+     +     +  P VV+ S   M+ G SR+LF 
Sbjct: 293 VNAMNDKIRRQI--AINNPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFE 349

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+  +N +II      GTLA+ ++
Sbjct: 350 SWCTDAKNGVIIAGYCVEGTLAKTIL 375


>gi|156552097|ref|XP_001605081.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Nasonia vitripennis]
          Length = 688

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ +    A NNPF FKH+     +     +  P VV+ S   M+ G SR+LF 
Sbjct: 296 VNAMNDKIRRQI--AINNPFVFKHISNLKGIDHFDDI-GPCVVMASPGMMQSGLSRELFE 352

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+  +N +II      GTLA+ ++
Sbjct: 353 SWCTDAKNGVIIAGYCVEGTLAKTIL 378


>gi|366992944|ref|XP_003676237.1| hypothetical protein NCAS_0D02950 [Naumovozyma castellii CBS 4309]
 gi|342302103|emb|CCC69876.1| hypothetical protein NCAS_0D02950 [Naumovozyma castellii CBS 4309]
          Length = 771

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 31/108 (28%), Positives = 56/108 (51%), Gaps = 4/108 (3%)

Query: 11  GARNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELA 68
           G+   P ++     K C S+ +   +  M+D + + F  ++ NPF FK++    +L E  
Sbjct: 268 GSGQVPIYYASNLAKKCMSVFQ-TYVNMMNDDIRRKFRDSQTNPFIFKNISYLRNLEEFQ 326

Query: 69  KVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLAR 116
               P V+L S   ++ G SRD+  +WC   +N ++IT  +  GT+A+
Sbjct: 327 DF-GPSVMLASPGMLQSGLSRDVLEKWCPDEKNLVLITGYSIEGTMAK 373


>gi|312372474|gb|EFR20427.1| hypothetical protein AND_20124 [Anopheles darlingi]
          Length = 692

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+DK+ +    A NNPF F+ +     +     V  P VV+ S   M+ G SR+LF 
Sbjct: 296 INAMNDKIRRQI--AINNPFVFRFISNLKGIDHFDDV-GPCVVMASPGMMQSGLSRELFE 352

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+ P+N +II      GTLA+ ++
Sbjct: 353 TWCTDPKNGVIIAGYCVEGTLAKTIL 378


>gi|224140919|ref|XP_002323824.1| predicted protein [Populus trichocarpa]
 gi|222866826|gb|EEF03957.1| predicted protein [Populus trichocarpa]
          Length = 699

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 47/83 (56%), Gaps = 3/83 (3%)

Query: 37  MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
           M++++   F  A +NPF FKH+   +S+ +   V  P VV+ +   ++ G SR LF  WC
Sbjct: 302 MNERIRNQF--ADSNPFKFKHISPLNSIEDFTDV-GPSVVMATPGGLQSGLSRQLFDMWC 358

Query: 97  SSPENSIIITNRTSPGTLARDLI 119
           S  +N+ +I      GTLA+ +I
Sbjct: 359 SDKKNACVIPGFLVEGTLAKTII 381


>gi|336261956|ref|XP_003345764.1| hypothetical protein SMAC_05921 [Sordaria macrospora k-hell]
 gi|380090100|emb|CCC12183.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 1003

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 46/167 (27%), Positives = 72/167 (43%), Gaps = 53/167 (31%)

Query: 34  IEWMSDKLMKSFEG-----ARNN------------PFHFKHVKLCHSLAELAKV------ 70
           +EWM D ++K FE       RNN            PF FK+++L    A++ K+      
Sbjct: 333 LEWMDDNIIKEFEAFADESRRNNRRDEGNHQTGPGPFDFKYLRLLERKAQIEKILKQSED 392

Query: 71  --PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS--PG---TLARDLIE--- 120
             P  KV+L S   ++ GFS+D+     +   N +I+T + +  P    ++AR L E   
Sbjct: 393 TEPRAKVILASDASLDWGFSKDILKSIAADARNLVILTEKPNFEPNHKPSIARTLWEWWK 452

Query: 121 --------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
                                 GNR L ++  +R  LEG+EL  YQ+
Sbjct: 453 ERRDGVATERTSNGDTFEQVYAGNRELEVETAERKGLEGDELNVYQQ 499


>gi|342320223|gb|EGU12165.1| Cleavage and polyadenylation specificity factor subunit
           [Rhodotorula glutinis ATCC 204091]
          Length = 1010

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 38/144 (26%), Positives = 72/144 (50%), Gaps = 28/144 (19%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV--PS-PKVVLVSTPDMECGFSRDL 91
           EWM  ++     G+  +   F ++++  SL E+A    PS PK++L     +  G+SR L
Sbjct: 400 EWMGGQI----AGSGADKLKFANLRIFSSLDEIATTIPPSVPKLILTVPSTLSYGYSRAL 455

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI---------------------ELGGNRTLTLQ 130
           F  +  +  N +++T  + PG+LAR L                      E+  ++T+ L+
Sbjct: 456 FLDFARNAANLVLLTGLSEPGSLARWLAREVWEPQQEKGCKYGEGKVGKEVKMDQTIELE 515

Query: 131 VKKRIRLEGEELEEYQKKKDKEAK 154
           +K+++ LEG+ELE +   + + A+
Sbjct: 516 IKRKVYLEGDELEAHLAAEREAAE 539


>gi|403373777|gb|EJY86813.1| Cleavage and polyadenylation specificity factor subunit 3
           [Oxytricha trifallax]
          Length = 755

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 51/98 (52%), Gaps = 10/98 (10%)

Query: 37  MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
           M D+L    E   NNPFHF+ +   +  +E      P V++ S   ++ G SRDLF +W 
Sbjct: 304 MGDQLRMELESG-NNPFHFEPITTFNDESEF-----PLVIMASPGMLQNGQSRDLFVKWA 357

Query: 97  SSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKR 134
             P+N I+ T  +  GTLA+ ++    NR   L V ++
Sbjct: 358 PDPKNGIVFTGYSVEGTLAKSVM----NRPKILTVAEQ 391


>gi|307177772|gb|EFN66769.1| Cleavage and polyadenylation specificity factor subunit 3 [Camponotus
            floridanus]
          Length = 1750

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34   IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
            +  M+DK+ +    A NNPF FKH+     +     +  P VV+ S   M+ G SR+LF 
Sbjct: 1357 VNAMNDKIRRQI--AINNPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFE 1413

Query: 94   QWCSSPENSIIITNRTSPGTLARDLI 119
             WC+  +N +II      GTLA+ ++
Sbjct: 1414 SWCTDAKNGVIIAGYCVEGTLAKTIL 1439


>gi|388579716|gb|EIM20037.1| hypothetical protein WALSEDRAFT_61199 [Wallemia sebi CBS 633.66]
          Length = 844

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 52/99 (52%), Gaps = 18/99 (18%)

Query: 66  ELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE----- 120
           E A   +PK+VL     M  G SR LF  + ++P N+I++T+R +PG+LARDL +     
Sbjct: 361 EAAYPGTPKLVLAVPYSMSYGGSRRLFHSFSNNPGNAIVLTSRGAPGSLARDLFDRWNGK 420

Query: 121 ---------LG----GNRTLTLQVKKRIRLEGEELEEYQ 146
                    LG    G+  + +    ++ L GEELE YQ
Sbjct: 421 QNDKWGSGKLGEAVQGDWNIPITEHSKVPLLGEELEAYQ 459


>gi|12846103|dbj|BAB27031.1| unnamed protein product [Mus musculus]
          Length = 406

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   ++ G SR+LF 
Sbjct: 8   VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 64

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 65  SWCTDKRNGVIIAGYCVEGTLAKHIM 90


>gi|308492421|ref|XP_003108401.1| CRE-CPSF-3 protein [Caenorhabditis remanei]
 gi|308249249|gb|EFO93201.1| CRE-CPSF-3 protein [Caenorhabditis remanei]
          Length = 712

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 68/160 (42%), Gaps = 21/160 (13%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+ ++ K    A  NPF FKHV     + +      P VVL +   ++ GFSR+LF 
Sbjct: 287 VNGMNSRIQKQI--AVKNPFIFKHVSTLRGMDQFEDA-GPCVVLATPGMLQSGFSRELFE 343

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELE--------EY 145
            WCS  +N  II      GTLAR ++          + ++ + L GE+L          +
Sbjct: 344 NWCSDSKNGCIIAGYCVEGTLARHILT---------EPEEIVSLSGEKLPMRMQVGYVSF 394

Query: 146 QKKKDKEAKDKQEKEKIPPHDTSFINEL-QLSDFKQTLQR 184
               D        K   PPH      EL ++S  K  ++R
Sbjct: 395 SAHTDFNQTSNFVKALKPPHLVLVHGELHEMSRLKAGIER 434


>gi|320581695|gb|EFW95914.1| Ca2+/calmodulin-dependent protein kinase [Ogataea parapolymorpha
           DL-1]
          Length = 1184

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 70/146 (47%), Gaps = 20/146 (13%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVSTPDM-ECGFSRDL 91
           +EWMS  + K++E     PF    +KL  SL +LA +   PK++ V   D+ E   SRD 
Sbjct: 316 LEWMSPDITKNWENQNQTPFESNRLKLI-SLRDLASLDHRPKIIFVDGTDLNEGSLSRDC 374

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDL-------------IELGG----NRTLTLQVKKR 134
           F + CS    ++I+T R    T A D+             ++ G      + ++L   + 
Sbjct: 375 FIELCSKHNTALIMTERPEVNTTAYDVYKEWESKVKNDNNLKDGALTILEKQMSLSATRE 434

Query: 135 IRLEGEELEEYQKKKDKEAKDKQEKE 160
            +L G EL  Y+K  ++  + ++E+E
Sbjct: 435 EKLRGSELNAYKKSVEERRQRRKEQE 460


>gi|260942735|ref|XP_002615666.1| hypothetical protein CLUG_04548 [Clavispora lusitaniae ATCC 42720]
 gi|238850956|gb|EEQ40420.1| hypothetical protein CLUG_04548 [Clavispora lusitaniae ATCC 42720]
          Length = 797

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 44/71 (61%), Gaps = 1/71 (1%)

Query: 49  RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
           + NPF FKH+K   S+ ++  +  P VV+ S   ++ G SR L  +W   P+N++I+T  
Sbjct: 321 KTNPFQFKHIKSIKSIDKIQDM-GPCVVVASPGMLQSGVSRQLLERWAPDPKNAVILTGY 379

Query: 109 TSPGTLARDLI 119
           +  GT+A++L+
Sbjct: 380 SVEGTMAKELL 390


>gi|313216448|emb|CBY37756.1| unnamed protein product [Oikopleura dioica]
          Length = 690

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 41/82 (50%), Gaps = 2/82 (2%)

Query: 37  MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
           M+ K+ K+      NPF F+H+     +        P VVL S   M+ G SR+LF +WC
Sbjct: 287 MNSKIQKAI--TTRNPFQFRHISNLKGMEAFDDDIGPSVVLASPGMMQSGLSRELFEKWC 344

Query: 97  SSPENSIIITNRTSPGTLARDL 118
           ++  N +I+      GTLA  +
Sbjct: 345 TNKRNGVILAGYAVEGTLAHQI 366


>gi|406601461|emb|CCH46911.1| hypothetical protein BN7_6516 [Wickerhamomyces ciferrii]
          Length = 679

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 47/86 (54%), Gaps = 1/86 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D + K F    +NPF FK++K   +L +      P VV+ S   ++ G SR+L  
Sbjct: 251 INMMNDNIRKQFRDQNSNPFQFKYIKNIKNLDKFDDF-GPCVVVASPGMLQNGVSRELLE 309

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +W     NS+I+T  +  GTLA+ L+
Sbjct: 310 RWAPDSRNSVILTGYSVEGTLAKTLL 335


>gi|443713763|gb|ELU06463.1| hypothetical protein CAPTEDRAFT_141423, partial [Capitella teleta]
          Length = 369

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 35/110 (31%), Positives = 54/110 (49%), Gaps = 11/110 (10%)

Query: 23  KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
           K C S+ +   I  M+DK+ +      NNPF FKH+    + +    +  P VV+ S   
Sbjct: 57  KKCMSVYQ-TYINAMNDKIKRQIN--TNNPFVFKHISNLKASSHFDDI-GPSVVMASPGM 112

Query: 83  MECGFSRDLFFQWCSSPENSIIITNRTSPGTLAR-------DLIELGGNR 125
           M+ G SR+LF  WC+   N  II      GTLA+       +++ +GG +
Sbjct: 113 MQSGLSRELFENWCTDKRNGCIIAGYCVEGTLAKHILSEPEEIVTMGGQK 162


>gi|313244184|emb|CBY15021.1| unnamed protein product [Oikopleura dioica]
          Length = 690

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 41/82 (50%), Gaps = 2/82 (2%)

Query: 37  MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
           M+ K+ K+      NPF F+H+     +        P VVL S   M+ G SR+LF +WC
Sbjct: 287 MNSKIQKAI--TTRNPFQFRHISNLKGMEAFDDDIGPSVVLASPGMMQSGLSRELFEKWC 344

Query: 97  SSPENSIIITNRTSPGTLARDL 118
           ++  N +I+      GTLA  +
Sbjct: 345 TNKRNGVILAGYAVEGTLAHQI 366


>gi|74178650|dbj|BAE33998.1| unnamed protein product [Mus musculus]
          Length = 684

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   ++ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|31980904|ref|NP_061283.2| cleavage and polyadenylation specificity factor subunit 3 [Mus
           musculus]
 gi|341940395|sp|Q9QXK7.2|CPSF3_MOUSE RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 3; AltName: Full=Cleavage and polyadenylation
           specificity factor 73 kDa subunit; Short=CPSF 73 kDa
           subunit; Short=mRNA 3'-end-processing endonuclease
           CPSF-73
 gi|23271024|gb|AAH23297.1| Cleavage and polyadenylation specificity factor 3 [Mus musculus]
          Length = 684

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   ++ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|367016955|ref|XP_003682976.1| hypothetical protein TDEL_0G03980 [Torulaspora delbrueckii]
 gi|359750639|emb|CCE93765.1| hypothetical protein TDEL_0G03980 [Torulaspora delbrueckii]
          Length = 775

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 57/111 (51%), Gaps = 4/111 (3%)

Query: 11  GARNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELA 68
           G    P ++     K C S+ +   +  M+D + K F  ++ NPF FK++    ++ +  
Sbjct: 267 GGGQVPIYYASNLAKKCMSVFQ-TYVNMMNDDIRKKFRDSQTNPFVFKNISYLRNIDDFQ 325

Query: 69  KVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
               P V+L S   ++ G SRD+  +WC   +N ++IT  +  GT+A+ L+
Sbjct: 326 DF-GPSVMLASPGMLQSGLSRDVLEKWCPEDKNLVLITGYSVEGTMAKFLM 375


>gi|74221128|dbj|BAE42066.1| unnamed protein product [Mus musculus]
          Length = 684

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   ++ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|354504216|ref|XP_003514173.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Cricetulus griseus]
          Length = 684

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   ++ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|74211665|dbj|BAE29190.1| unnamed protein product [Mus musculus]
          Length = 684

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   ++ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|149050991|gb|EDM03164.1| cleavage and polyadenylation specificity factor 3, isoform CRA_a
           [Rattus norvegicus]
          Length = 685

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   ++ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|71795627|ref|NP_001025201.1| cleavage and polyadenylation specificity factor subunit 3 [Rattus
           norvegicus]
 gi|71121802|gb|AAH99817.1| Cleavage and polyadenylation specificity factor 3 [Rattus
           norvegicus]
          Length = 685

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   ++ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|148702078|gb|EDL34025.1| cleavage and polyadenylation specificity factor 3, isoform CRA_b
           [Mus musculus]
          Length = 701

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   ++ G SR+LF 
Sbjct: 303 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 359

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 360 SWCTDKRNGVIIAGYCVEGTLAKHIM 385


>gi|290978816|ref|XP_002672131.1| predicted protein [Naegleria gruberi]
 gi|284085705|gb|EFC39387.1| predicted protein [Naegleria gruberi]
          Length = 749

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 78/165 (47%), Gaps = 7/165 (4%)

Query: 3   DKLMKSFEGARNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKL 60
           D+  ++ +  ++ P ++     K C ++ +   I  M+DK+ K F+   +NPF FKH+  
Sbjct: 281 DEFWETHQDLQHIPIYYASSLAKKCMTIFQ-TYINMMNDKIRKQFD--IHNPFVFKHISN 337

Query: 61  CHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE 120
             S+ +      P V++ S   ++ G S++LF  WC   +N +II   +  GTLA+ ++ 
Sbjct: 338 LRSIEDFQD-NGPCVIMASPGMLQSGLSKELFELWCQDAKNGVIIAGYSVDGTLAKKIMS 396

Query: 121 LGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPH 165
                TL+      +R+    +  +    DK   ++      PPH
Sbjct: 397 EPETVTLSNGNTVPLRMSVRTI-SFSAHSDKAQTEEFIGTIKPPH 440


>gi|344257704|gb|EGW13808.1| Cleavage and polyadenylation specificity factor subunit 3
           [Cricetulus griseus]
          Length = 647

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   ++ G SR+LF 
Sbjct: 249 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 305

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 306 SWCTDKRNGVIIAGYCVEGTLAKHIM 331


>gi|6625904|gb|AAF19420.1|AF203969_1 cleavage and polyadenylation specificity factor 73 kDa subunit [Mus
           musculus]
          Length = 684

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 44/83 (53%), Gaps = 3/83 (3%)

Query: 37  MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
           M+DK+ K      NNPF FKH+    S+     +  P VV+ S   ++ G SR+LF  WC
Sbjct: 289 MNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFESWC 345

Query: 97  SSPENSIIITNRTSPGTLARDLI 119
           +   N +II      GTLA+ ++
Sbjct: 346 TDKRNGVIIAGYCVEGTLAKHIM 368


>gi|328350068|emb|CCA36468.1| hypothetical protein PP7435_Chr1-0308 [Komagataella pastoris CBS
           7435]
          Length = 741

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/109 (26%), Positives = 58/109 (53%), Gaps = 5/109 (4%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++ + K F     NPF F+++K   +L++      P VV+ S   ++ G SR L  
Sbjct: 238 INMMNENIRKKFRDTNKNPFQFQYIKNIKNLSKFDDF-QPSVVVASPGMLQNGVSRALLE 296

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI----ELGGNRTLTLQVKKRIRLE 138
           +W   P N++I+T  +  GT+A++++    E+   +   + + +R+ +E
Sbjct: 297 KWAPDPRNTLIMTGYSVEGTMAKEILLEPTEIPSQQNPDVLIPRRMTVE 345


>gi|302694097|ref|XP_003036727.1| hypothetical protein SCHCODRAFT_72177 [Schizophyllum commune H4-8]
 gi|300110424|gb|EFJ01825.1| hypothetical protein SCHCODRAFT_72177 [Schizophyllum commune H4-8]
          Length = 913

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 62/120 (51%), Gaps = 23/120 (19%)

Query: 53  FHFKHVKLCHSLAELAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
             F+H++   +   L +  S   PK++L    ++  G SR +F ++ + P+N I++T R 
Sbjct: 384 LRFQHLEFFPTPQALLQTYSSKDPKLILAVPLNLSHGPSRSIFSEFAAIPDNVILLTQRG 443

Query: 110 SPGTLARDLIE----------------LGGN----RTLTLQVKKRIRLEGEELEEYQKKK 149
            PGTLAR L E                +G N      LTL++++++ L+G+ELE Y  K+
Sbjct: 444 DPGTLARALFEKWNDSQRAEAKWDKGKVGSNVMLDDNLTLKMRRKVPLQGDELEAYLAKE 503


>gi|387594760|gb|EIJ89784.1| cleavage and polyadenylation specificity factor 3 [Nematocida
           parisii ERTm3]
 gi|387596392|gb|EIJ94013.1| cleavage and polyadenylation specificity factor 3 [Nematocida
           parisii ERTm1]
          Length = 696

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/70 (41%), Positives = 40/70 (57%), Gaps = 1/70 (1%)

Query: 50  NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
           +NPFHFKHV+   ++ E  +   P V++ S   ++ G SRDLF  WC    N  II    
Sbjct: 298 SNPFHFKHVQSIKNI-EAYEDRGPCVMMASPGMLQNGLSRDLFEMWCGDKRNGCIIPGYC 356

Query: 110 SPGTLARDLI 119
             GTLA+DL+
Sbjct: 357 VEGTLAKDLL 366


>gi|322708414|gb|EFY99991.1| cleavage and polyadenylylation specificity factor, putative
           [Metarhizium anisopliae ARSEF 23]
          Length = 960

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 45/167 (26%), Positives = 69/167 (41%), Gaps = 53/167 (31%)

Query: 34  IEWMSDKLMKSFE----GAR----------NNPFHFKHVKLCHSLAELAK---------- 69
           +EWM D +++ FE    G R            PF FK+++L    A+++K          
Sbjct: 332 LEWMDDNIVQEFEAFAEGQRKANGAVEKKEGGPFDFKYLRLLERKAQVSKLLDQVASAQG 391

Query: 70  -VPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG-----TLARDLIEL-- 121
            V   +V+L S   ME GFS+D+       P N +I+T+R +       ++AR L E   
Sbjct: 392 EVAKGRVILASDTSMEWGFSKDVLKGLAKDPNNLVILTDRPTIAKDDRPSIARTLWEWWR 451

Query: 122 ---------------------GGNRTLTLQVKKRIRLEGEELEEYQK 147
                                 G R L ++   R  LEG+EL  YQ+
Sbjct: 452 ERKDGVSVEQTSTGDNIETVQAGGRELEIREASRQALEGDELAVYQQ 498



 Score = 42.4 bits (98), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 29/102 (28%), Positives = 50/102 (49%), Gaps = 11/102 (10%)

Query: 140 EELEEYQKKKDKEAKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICC 199
           +E E + + + ++A    EK++  P D  ++         + L+R     + +D V    
Sbjct: 341 QEFEAFAEGQ-RKANGAVEKKEGGPFDFKYL---------RLLERKAQVSKLLDQV-ASA 389

Query: 200 RGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
           +G VA  RV+L S   ME GFS+D+       P N +I+T+R
Sbjct: 390 QGEVAKGRVILASDTSMEWGFSKDVLKGLAKDPNNLVILTDR 431


>gi|150865856|ref|XP_001385241.2| hypothetical protein PICST_89936 [Scheffersomyces stipitis CBS
           6054]
 gi|149387112|gb|ABN67212.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 793

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 50/84 (59%), Gaps = 2/84 (2%)

Query: 37  MSDKL-MKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQW 95
           M+D + + S  G +++PF FK++K    L + + +  P VV+ S   ++ G SR L  +W
Sbjct: 301 MNDNIRLSSRIGQKSSPFDFKYIKSIKDLGKFSDM-GPSVVVASPGMLQAGVSRQLLEKW 359

Query: 96  CSSPENSIIITNRTSPGTLARDLI 119
              P+N +++T  +  GT+A+DL+
Sbjct: 360 APDPKNLVVMTGYSVEGTMAKDLL 383


>gi|149050992|gb|EDM03165.1| cleavage and polyadenylation specificity factor 3, isoform CRA_b
           [Rattus norvegicus]
          Length = 605

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   ++ G SR+LF 
Sbjct: 286 VNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFE 342

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N +II      GTLA+ ++
Sbjct: 343 SWCTDKRNGVIIAGYCVEGTLAKHIM 368


>gi|428172766|gb|EKX41673.1| hypothetical protein GUITHDRAFT_74597 [Guillardia theta CCMP2712]
          Length = 615

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 50/88 (56%), Gaps = 2/88 (2%)

Query: 51  NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS 110
           NPF F+H++   ++AE      P VV+ S   ++ G SRDLF +WC    NS++IT    
Sbjct: 315 NPFEFRHIRYMTTMAEFQD-NCPCVVMASPGMLQNGPSRDLFDRWCEYRHNSVVITGYCV 373

Query: 111 PGTLARDLIELGGNRTLTLQVKKRIRLE 138
             TLA++L++     T TLQ  K + L+
Sbjct: 374 QNTLAKELLD-AQPATHTLQDGKEVPLK 400


>gi|378756364|gb|EHY66388.1| cleavage and polyadenylation specificity factor [Nematocida sp. 1
           ERTm2]
          Length = 692

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/70 (41%), Positives = 40/70 (57%), Gaps = 1/70 (1%)

Query: 50  NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
           +NPFHFKHV+   ++ E  +   P V++ S   ++ G SRDLF  WC    N  II    
Sbjct: 298 SNPFHFKHVQNIKNI-EAYEDRGPCVMMASPGMLQNGLSRDLFEMWCGDKRNGCIIPGYC 356

Query: 110 SPGTLARDLI 119
             GTLA+DL+
Sbjct: 357 VEGTLAKDLL 366


>gi|392568293|gb|EIW61467.1| hypothetical protein TRAVEDRAFT_162694 [Trametes versicolor
           FP-101664 SS1]
          Length = 943

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/140 (28%), Positives = 64/140 (45%), Gaps = 29/140 (20%)

Query: 53  FHFKHVKLCHSLAELAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
             F+H++   S   L    S   PK++L     +  G SR LF  +   P+N +++T R+
Sbjct: 387 LRFRHLEFFSSPQALMSTYSTKDPKLILAVPATLSHGPSRSLFAHFAEIPDNVVLLTGRS 446

Query: 110 SPGTLARDLIE----------------LGGN----RTLTLQVKKRIRLEGEELEEYQKKK 149
            PGTL R L +                +G N      L L++ K++ L+G+ELEE+    
Sbjct: 447 EPGTLGRILFDKWNNSQREEAKWDRGKIGNNIMMDGVLRLEIHKKVPLQGDELEEFL--- 503

Query: 150 DKEAKDKQEKEKIPPHDTSF 169
              AK++  KEK   H  + 
Sbjct: 504 ---AKERAVKEKEAAHQAAL 520



 Score = 37.0 bits (84), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 30/121 (24%), Positives = 52/121 (42%), Gaps = 32/121 (26%)

Query: 159 KEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFM-DGVLICCRGTVAVRRVVLVSTPDME 217
           + K P   ++ I EL+L+  K  L + G+  E + +GVLIC            +      
Sbjct: 850 RPKQPLPQSTMIGELKLTSLKARLAQVGVQAELVGEGVLICGAAAKKGASADAL------ 903

Query: 218 CGFSRDLFFQWCSSPENSIII--TNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAI 275
                          E+S+ +  T R R +        LEG +SD YY+V++ +Y  +A+
Sbjct: 904 ---------------EDSVAVRKTGRGRVE--------LEGSISDIYYKVRKEIYALHAL 940

Query: 276 L 276
           +
Sbjct: 941 V 941


>gi|71682600|gb|AAI00570.1| Cpsf3 protein [Mus musculus]
          Length = 512

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 44/83 (53%), Gaps = 3/83 (3%)

Query: 37  MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
           M+DK+ K      NNPF FKH+    S+     +  P VV+ S   ++ G SR+LF  WC
Sbjct: 289 MNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMIQNGLSRELFESWC 345

Query: 97  SSPENSIIITNRTSPGTLARDLI 119
           +   N +II      GTLA+ ++
Sbjct: 346 TDKRNGVIIAGYCVEGTLAKHIM 368


>gi|291233360|ref|XP_002736621.1| PREDICTED: cleavage and polyadenylation specific factor 3,
           73kDa-like [Saccoglossus kowalevskii]
          Length = 715

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 65/143 (45%), Gaps = 5/143 (3%)

Query: 23  KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
           K C S+ +   I  M+DK+ +      +NPF FKH+     +     +  P VV+ S   
Sbjct: 283 KKCMSVYQ-TYINAMNDKIKRQI--TISNPFVFKHISNLRGMDHFDDI-GPSVVMASPGM 338

Query: 83  MECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
           M+ G SR+LF  WC+   N +II      GTLA+ ++      T     K  +++  + +
Sbjct: 339 MQSGLSRELFESWCTDRRNGVIIAGYCVEGTLAKHILSQPEEVTTMSGQKLPLKMSVDYI 398

Query: 143 EEYQKKKDKEAKDKQEKEKIPPH 165
             +    D +      ++ IPPH
Sbjct: 399 -SFSAHADYQQISDFIRKLIPPH 420


>gi|432117051|gb|ELK37610.1| Cleavage and polyadenylation specificity factor subunit 2 [Myotis
           davidii]
          Length = 121

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 50/114 (43%), Gaps = 40/114 (35%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGF 220
           ++P H + F+NE +L DFKQ L R  I  EF+ GVL+C                      
Sbjct: 46  EVPGHQSVFMNEPRLFDFKQVLLREWIQAEFVGGVLVC---------------------- 83

Query: 221 SRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYA 274
                              N+     +++  I LEGCL  ++YR++ LLY+QYA
Sbjct: 84  ------------------NNQISVCRTETGRIGLEGCLCQDFYRIRDLLYEQYA 119


>gi|429857613|gb|ELA32471.1| cleavage and polyadenylylation specificity [Colletotrichum
           gloeosporioides Nara gc5]
          Length = 962

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/166 (27%), Positives = 69/166 (41%), Gaps = 52/166 (31%)

Query: 34  IEWMSDKLMKSFEGARNN---------------PFHFKHVKLCHSLAELAKVPSP----- 73
           +EWM D +++ FE   +                PF F+++KL    A++ K+ S      
Sbjct: 332 LEWMDDNIVREFESVADGQRKANGADGKTKEAVPFDFRYLKLVERRAQIEKLLSSSGGNV 391

Query: 74  ----KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN-----RTSPGTLAR-------- 116
               +V+L S   +E GFS+DL         N +++T+     R    ++AR        
Sbjct: 392 QSEGRVILASDDTLEWGFSKDLIKGLAKDSRNLVVLTDKPPKSRAEQPSIARTLWDWWTE 451

Query: 117 -------------DLIEL--GGNRTLTLQVKKRIRLEGEELEEYQK 147
                        D IE   GG R L +Q  KR  LEG+EL  YQ+
Sbjct: 452 RQDGATVEQTSSGDSIEFVYGGGRELEIQEAKRQALEGDELTVYQQ 497


>gi|312080023|ref|XP_003142424.1| cpsf3-prov protein [Loa loa]
          Length = 715

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+ ++ K    A NNPF FKHV    S+     V  P VVL S   ++ G SR+LF 
Sbjct: 311 VSGMNSRIQKQI--ALNNPFVFKHVSNLKSIDHFEDV-GPCVVLASPGMLQNGLSRELFE 367

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+  +N  II      GTLA+ ++
Sbjct: 368 NWCTDSKNGCIIAGYCVEGTLAKHIL 393


>gi|360043111|emb|CCD78523.1| cleavage and polyadenylation specificity factor-related
           [Schistosoma mansoni]
          Length = 670

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 48  ARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
           A NNPF F+H+    S+        P VV+ S   M+ G SR+LF  WC+   N +II  
Sbjct: 302 ANNNPFCFRHISNLKSIEHFDD-SGPCVVMASPGMMQSGLSRELFENWCTDKRNGVIIAG 360

Query: 108 RTSPGTLARDLIEL 121
               GTLA+ ++ L
Sbjct: 361 YCVEGTLAKQILSL 374


>gi|242007002|ref|XP_002424331.1| Cleavage and polyadenylation specificity factor 73 kDa subunit,
           putative [Pediculus humanus corporis]
 gi|212507731|gb|EEB11593.1| Cleavage and polyadenylation specificity factor 73 kDa subunit,
           putative [Pediculus humanus corporis]
          Length = 692

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+DK+ +    A NNPF F+H+     +     +  P VV+ S   M+ G SR+LF 
Sbjct: 294 VNAMNDKIRRQI--AINNPFIFRHIHNLKGIDHFDDI-GPCVVMASPGMMQSGLSRELFE 350

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+  +N +II      GTLA+ ++
Sbjct: 351 LWCTDSKNGVIIAGYCVEGTLAKQIL 376


>gi|402594378|gb|EJW88304.1| cleavage and polyadenylation specificity factor subunit 3
           [Wuchereria bancrofti]
          Length = 694

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+ ++ K    A NNPF FKHV    S+     V  P VVL S   ++ G SR+LF 
Sbjct: 284 VSGMNSRIQKQI--ALNNPFVFKHVSNLKSIDHFEDV-GPCVVLASPGMLQNGLSRELFE 340

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+  +N  II      GTLA+ ++
Sbjct: 341 NWCTDSKNGCIIAGYCVEGTLAKHIL 366


>gi|170587204|ref|XP_001898368.1| cpsf3-prov protein [Brugia malayi]
 gi|158594194|gb|EDP32780.1| cpsf3-prov protein, putative [Brugia malayi]
          Length = 700

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+ ++ K    A NNPF FKHV    S+     V  P VVL S   ++ G SR+LF 
Sbjct: 290 VSGMNSRIQKQI--ALNNPFVFKHVSNLKSIDHFEDV-GPCVVLASPGMLQNGLSRELFE 346

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+  +N  II      GTLA+ ++
Sbjct: 347 NWCTDSKNGCIIAGYCVEGTLAKHIL 372


>gi|256086716|ref|XP_002579538.1| cleavage and polyadenylation specificity factor-related
           [Schistosoma mansoni]
          Length = 670

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 48  ARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
           A NNPF F+H+    S+        P VV+ S   M+ G SR+LF  WC+   N +II  
Sbjct: 302 ASNNPFCFRHISNLKSIEHFDD-SGPCVVMASPGMMQSGLSRELFENWCTDKRNGVIIAG 360

Query: 108 RTSPGTLARDLIEL 121
               GTLA+ ++ L
Sbjct: 361 YCVEGTLAKQILSL 374


>gi|303275006|ref|XP_003056813.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226461165|gb|EEH58458.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 803

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  ++  +  +FE A  NPF F HV+       L  V  P VVL +   ++ G SR+LF 
Sbjct: 292 INVLNSDMKAAFEEA--NPFVFNHVQHLSHAGGLDDV-GPCVVLATPSMLQSGLSRELFE 348

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC    N +II +    GTLAR+++
Sbjct: 349 MWCGDANNGVIIADFAVQGTLAREIL 374


>gi|255084461|ref|XP_002508805.1| predicted protein [Micromonas sp. RCC299]
 gi|226524082|gb|ACO70063.1| predicted protein [Micromonas sp. RCC299]
          Length = 728

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  ++  +  +FE   +NPF F HV    + + L  V  P VVL +   ++ G SRDLF 
Sbjct: 302 INVLNSDMKAAFE--ESNPFVFNHVNHLANSSGLDDV-GPCVVLATPSMLQSGLSRDLFE 358

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC   +N +II +    GTLAR+++
Sbjct: 359 SWCGDSKNGVIICDFAVQGTLAREIL 384


>gi|268552491|ref|XP_002634228.1| Hypothetical protein CBG01798 [Caenorhabditis briggsae]
          Length = 722

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 44/160 (27%), Positives = 68/160 (42%), Gaps = 21/160 (13%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+ ++ K    A  NPF FKHV     + +      P VVL +   ++ GFSR+LF 
Sbjct: 287 VNGMNSRIQKQI--AIKNPFIFKHVSTLRGMDQFEDA-GPCVVLATPGMLQSGFSRELFE 343

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELE--------EY 145
            WCS  +N  II      GTLA+ ++          + ++ + L GE+L          +
Sbjct: 344 NWCSDSKNGCIIAGYCVEGTLAKHILT---------EPEEIVSLSGEKLPMRMQVGYVSF 394

Query: 146 QKKKDKEAKDKQEKEKIPPHDTSFINEL-QLSDFKQTLQR 184
               D        K   PPH      EL ++S  K  ++R
Sbjct: 395 SAHTDFNQTSNFVKTLKPPHLVLVHGELHEMSRLKAGIER 434


>gi|336466927|gb|EGO55091.1| hypothetical protein NEUTE1DRAFT_130968 [Neurospora tetrasperma
           FGSC 2508]
          Length = 1051

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 44/167 (26%), Positives = 72/167 (43%), Gaps = 53/167 (31%)

Query: 34  IEWMSDKLMKSFEG-----ARNN------------PFHFKHVKLCHSLAELAKV------ 70
           +EWM D +++ FE       RNN            PF FK+++L    A++ K+      
Sbjct: 382 LEWMDDSIIREFEAFADESRRNNRRDEGNHQTGPGPFDFKYLRLLERKAQIDKILQQSDD 441

Query: 71  --PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS--PG---TLARDLIE--- 120
             P  KV+L S   ++ GFS+D+     +   N +I+T + +  P    +++R L E   
Sbjct: 442 AEPRAKVILASDTSLDWGFSKDILKSIAADARNLVILTEKPNLEPNQKPSISRTLWEWWK 501

Query: 121 --------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
                                 GNR L ++  +R  LEG+EL  YQ+
Sbjct: 502 ERRDGVATERTSNGDTFEQVYAGNRELEIETAERKGLEGDELNVYQQ 548


>gi|164424681|ref|XP_958078.2| hypothetical protein NCU06869 [Neurospora crassa OR74A]
 gi|157070616|gb|EAA28842.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 986

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 44/167 (26%), Positives = 72/167 (43%), Gaps = 53/167 (31%)

Query: 34  IEWMSDKLMKSFEG-----ARNN------------PFHFKHVKLCHSLAELAKV------ 70
           +EWM D +++ FE       RNN            PF FK+++L    A++ K+      
Sbjct: 333 LEWMDDSIIREFEAFADESRRNNRRDEGNHQTGPGPFDFKYLRLLERKAQIDKILQQSDD 392

Query: 71  --PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS--PG---TLARDLIE--- 120
             P  KV+L S   ++ GFS+D+     +   N +I+T + +  P    +++R L E   
Sbjct: 393 AEPRAKVILASDTSLDWGFSKDILKSIAADARNLVILTEKPNLEPNQKPSISRTLWEWWK 452

Query: 121 --------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
                                 GNR L ++  +R  LEG+EL  YQ+
Sbjct: 453 ERRDGVATERTSNGDTFEQVYAGNRELEIETAERKGLEGDELNVYQQ 499


>gi|448118544|ref|XP_004203525.1| Piso0_001136 [Millerozyma farinosa CBS 7064]
 gi|448120951|ref|XP_004204108.1| Piso0_001136 [Millerozyma farinosa CBS 7064]
 gi|359384393|emb|CCE79097.1| Piso0_001136 [Millerozyma farinosa CBS 7064]
 gi|359384976|emb|CCE78511.1| Piso0_001136 [Millerozyma farinosa CBS 7064]
          Length = 809

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 43/73 (58%), Gaps = 1/73 (1%)

Query: 47  GARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           G + NPF FK++K   SL +      P VV+ S   ++ G SR+L  +W   P N++I+T
Sbjct: 321 GEKRNPFQFKYIKNIRSLDKFQDF-GPCVVVASPGMLQNGVSRELLERWAPDPRNAVIMT 379

Query: 107 NRTSPGTLARDLI 119
             +  GT+A++L+
Sbjct: 380 GYSVEGTMAKELL 392


>gi|168026077|ref|XP_001765559.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683197|gb|EDQ69609.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 682

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++++ K FE   +NPF FKH++   ++ E   +  P VV+ S   ++ G SR LF 
Sbjct: 292 INAMNERIQKQFE--VSNPFDFKHIQPLKNIDEFDDI-GPAVVMASPGGLQSGLSRQLFD 348

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC   +NS +I      GT A+ ++
Sbjct: 349 IWCQDKKNSCVIPGYVVEGTPAKAIM 374


>gi|393912282|gb|EJD76666.1| cleavage and polyadenylation specific factor 3 [Loa loa]
          Length = 475

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+ ++ K    A NNPF FKHV    S+     V  P VVL S   ++ G SR+LF 
Sbjct: 71  VSGMNSRIQKQI--ALNNPFVFKHVSNLKSIDHFEDV-GPCVVLASPGMLQNGLSRELFE 127

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+  +N  II      GTLA+ ++
Sbjct: 128 NWCTDSKNGCIIAGYCVEGTLAKHIL 153


>gi|350288464|gb|EGZ69700.1| hypothetical protein NEUTE2DRAFT_152270 [Neurospora tetrasperma
           FGSC 2509]
          Length = 1070

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 44/167 (26%), Positives = 72/167 (43%), Gaps = 53/167 (31%)

Query: 34  IEWMSDKLMKSFEG-----ARNN------------PFHFKHVKLCHSLAELAKV------ 70
           +EWM D +++ FE       RNN            PF FK+++L    A++ K+      
Sbjct: 401 LEWMDDSIIREFEAFADESRRNNRRDEGNHQTGPGPFDFKYLRLLERKAQIDKILQQSDD 460

Query: 71  --PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS--PG---TLARDLIE--- 120
             P  KV+L S   ++ GFS+D+     +   N +I+T + +  P    +++R L E   
Sbjct: 461 AEPRAKVILASDTSLDWGFSKDILKSIAADARNLVILTEKPNLEPNQKPSISRTLWEWWK 520

Query: 121 --------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
                                 GNR L ++  +R  LEG+EL  YQ+
Sbjct: 521 ERRDGVATERTSNGDTFEQVYAGNRELEIETAERKGLEGDELNVYQQ 567


>gi|299116292|emb|CBN76100.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 752

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 53/105 (50%), Gaps = 3/105 (2%)

Query: 17  FHFKHVKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVV 76
            H   +     LA    I  M++ + +  + A  NPF F+H+    S+ +      P VV
Sbjct: 161 LHHIPIYYASRLASKTYINMMNEHIRQQMDVA--NPFKFQHITNLKSIDQFDD-SGPSVV 217

Query: 77  LVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIEL 121
           + S   ++ G SR LF +WC+  +NS++I   +  GTLA+ L+ +
Sbjct: 218 MASPGMLQSGVSRMLFDRWCTDDKNSVLIPGYSVEGTLAKKLLSM 262


>gi|440795785|gb|ELR16901.1| putative cleavage and polyadenylation specificity factor, putative
           [Acanthamoeba castellanii str. Neff]
          Length = 589

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++ + K F  A +NPF FKH+     +        P VV+ S   ++ G SR+LF 
Sbjct: 166 INMMNENIRKQF--AVSNPFVFKHISNLKGMQHFDD-SGPCVVMASPGMLQSGLSRELFE 222

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WCS+ +N +II      GTLA+ ++
Sbjct: 223 KWCSNAKNGVIIPGYCVEGTLAKHIM 248


>gi|324506922|gb|ADY42942.1| Cleavage and polyadenylation specificity factor subunit 3 [Ascaris
           suum]
          Length = 706

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+ ++ K    A NNPF F+HV    S+     V  P VVL S   ++ G SR+LF 
Sbjct: 299 VSGMNSRIQKQI--ALNNPFVFRHVSNLKSIEHFEDV-GPCVVLASPGMLQNGLSRELFE 355

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+  +N  II      GTLA+ ++
Sbjct: 356 NWCTDSKNGCIIAGYCVEGTLAKHIL 381


>gi|322700762|gb|EFY92515.1| cleavage and polyadenylylation specificity factor, putative
           [Metarhizium acridum CQMa 102]
          Length = 960

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 43/167 (25%), Positives = 69/167 (41%), Gaps = 53/167 (31%)

Query: 34  IEWMSDKLMKSFE----GAR----------NNPFHFKHVKLCHSLAELAKV--------- 70
           +EWM D +++ FE    G R            PF FK+++L    A+++K+         
Sbjct: 332 LEWMDDNIVQEFEAFAEGQRKANGTVEKKDGGPFDFKYLRLLERKAQVSKLLDQVASAQG 391

Query: 71  --PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG-----TLARDLIEL-- 121
                +V+L S   ME GFS+D+  +    P N +I+T+R +       ++ R L E   
Sbjct: 392 EAAKGRVILASDASMEWGFSKDVLRELAKDPNNLVILTDRPTIAKDDRPSIGRTLWEWWK 451

Query: 122 ---------------------GGNRTLTLQVKKRIRLEGEELEEYQK 147
                                 G R L ++   R  LEG+EL  YQ+
Sbjct: 452 ERKDGVSVEQTSTGDNIETVQAGGRELEIREASRQALEGDELAVYQQ 498



 Score = 42.4 bits (98), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 24/67 (35%), Positives = 36/67 (53%), Gaps = 3/67 (4%)

Query: 177 DFK--QTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPEN 234
           DFK  + L+R     + +D V    +G  A  RV+L S   ME GFS+D+  +    P N
Sbjct: 366 DFKYLRLLERKAQVSKLLDQV-ASAQGEAAKGRVILASDASMEWGFSKDVLRELAKDPNN 424

Query: 235 SIIITNR 241
            +I+T+R
Sbjct: 425 LVILTDR 431


>gi|408391611|gb|EKJ70983.1| hypothetical protein FPSE_08842 [Fusarium pseudograminearum CS3096]
          Length = 963

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 48/167 (28%), Positives = 74/167 (44%), Gaps = 55/167 (32%)

Query: 34  IEWMSDKLMKSFE----------GARN----NPFHFKHVKLCHSLAELAKVPS------- 72
           +EWM D +++ FE          GA N     PF FK+++L    A++A++ S       
Sbjct: 332 LEWMDDSIVQEFEAFAEDQRRVNGANNKKEGGPFDFKYLRLLERKAQIARLLSQNVENAG 391

Query: 73  --PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG-------TLARDL----- 118
              +V+L S   +E GFS+DL         N +I+T++  PG       ++AR L     
Sbjct: 392 TEGRVILASDSSIEWGFSKDLIKGLAQDSRNLVILTDK--PGLSKNDNPSIARTLWDWWK 449

Query: 119 ----------------IEL--GGNRTLTLQVKKRIRLEGEELEEYQK 147
                           IEL   G R L ++  +R  LEG+EL  YQ+
Sbjct: 450 ERKDGVSVEQNCNGENIELVYAGGRELEIREPQRHALEGDELALYQQ 496


>gi|255721479|ref|XP_002545674.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240136163|gb|EER35716.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 870

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 42/147 (28%), Positives = 71/147 (48%), Gaps = 25/147 (17%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECG-FSRDLFF 93
           +WMS+ L K +E     PF+   V L    AEL K+  PK+V  S  D++ G  S + F 
Sbjct: 287 DWMSNTLSKQWEELSTVPFNPSKVDLLLDPAELIKLSGPKIVFCSGIDLKDGDISSEAFQ 346

Query: 94  QWCSSPENSIIITNRT-----------------------SPGTLARDLIELGGNRTLTLQ 130
             C+    ++I+T ++                       S G  A+D I +  +RT++LQ
Sbjct: 347 YLCNDTSTTVILTEKSCIDSRNGLGAELYKEWYTSASNKSTGNGAKDGIAVPIDRTISLQ 406

Query: 131 VK-KRIRLEGEELEEYQKKKDKEAKDK 156
            + K + L G++L  +Q+K  ++ K+K
Sbjct: 407 NQTKEVDLTGQDLLNFQEKVAQKRKEK 433


>gi|32566029|ref|NP_502553.2| Protein CPSF-3 [Caenorhabditis elegans]
 gi|26985920|emb|CAC44310.2| Protein CPSF-3 [Caenorhabditis elegans]
          Length = 707

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 81/193 (41%), Gaps = 24/193 (12%)

Query: 3   DKLMKSFEGARNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKL 60
           D+  +S +   + P ++     K C S+ +   +  M+ ++ K    A  NPF FKHV  
Sbjct: 255 DEYWESHQELHDIPVYYASSLAKKCMSVYQ-TFVNGMNSRIQKQI--AVKNPFIFKHVST 311

Query: 61  CHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE 120
              + +      P VVL +   ++ GFSR+LF  WC   +N  II      GTLA+ ++ 
Sbjct: 312 LRGMDQFEDA-GPCVVLATPGMLQSGFSRELFESWCPDTKNGCIIAGYCVEGTLAKHILS 370

Query: 121 LGGNRTLTLQVKKRIRLEGEELE--------EYQKKKDKEAKDKQEKEKIPPHDTSFINE 172
                    + ++ + L GE+L          +    D        K   PPH      E
Sbjct: 371 ---------EPEEIVSLSGEKLPMRMQVGYVSFSAHTDYHQTSNFVKALKPPHLVLVHGE 421

Query: 173 L-QLSDFKQTLQR 184
           L ++S  K  ++R
Sbjct: 422 LHEMSRLKSGIER 434


>gi|444714932|gb|ELW55806.1| Cleavage and polyadenylation specificity factor subunit 2 [Tupaia
           chinensis]
          Length = 723

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 26/47 (55%), Positives = 33/47 (70%), Gaps = 1/47 (2%)

Query: 161 KIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRR 207
           ++P H + F+NE +LSDFKQ L R GI  EF+ GVL+ C   VAVRR
Sbjct: 610 EVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLV-CNNQVAVRR 655


>gi|46138561|ref|XP_390971.1| hypothetical protein FG10795.1 [Gibberella zeae PH-1]
          Length = 964

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 48/168 (28%), Positives = 74/168 (44%), Gaps = 56/168 (33%)

Query: 34  IEWMSDKLMKSFE----------GARN-----NPFHFKHVKLCHSLAELAKVPS------ 72
           +EWM D +++ FE          GA N      PF FK+++L    A++A++ S      
Sbjct: 332 LEWMDDSIVQEFEAFAEDQRRVNGANNKKEGGGPFDFKYLRLLERKAQIARLLSQNVENA 391

Query: 73  ---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG-------TLARDL---- 118
               +V+L S   +E GFS+DL         N +I+T++  PG       ++AR L    
Sbjct: 392 GTEGRVILASDSSIEWGFSKDLIKGLAQDSRNLVILTDK--PGLSKNGNPSIARTLWDWW 449

Query: 119 -----------------IEL--GGNRTLTLQVKKRIRLEGEELEEYQK 147
                            IEL   G R L ++  +R  LEG+EL  YQ+
Sbjct: 450 KERKDGVSVEQNSNGDNIELVYAGGRELEIREPQRHALEGDELALYQQ 497


>gi|346327110|gb|EGX96706.1| cleavage and polyadenylylation specificity factor, putative
           [Cordyceps militaris CM01]
          Length = 1024

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 48/187 (25%), Positives = 82/187 (43%), Gaps = 33/187 (17%)

Query: 34  IEWMSDKLMKSFEG-----ARNN---------PFHFKHVKLCHSLAELAKVPSP------ 73
           +EWM D +++ FE       R N         P  F+ ++L    A++AK+ S       
Sbjct: 366 LEWMDDGIVQEFEAFAEGQKRTNGASDKKVGGPLDFRFMRLLDRKAQIAKLLSTAVNNGE 425

Query: 74  ---KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR------TSPGTLARDLIELGGN 124
              +V+L S   M+ GFS+DL     S P N +I+T++      +SP ++AR L  L   
Sbjct: 426 SKGRVILASDTSMDWGFSKDLLRGLASDPNNVVILTDKPGMAQESSP-SIARTLWSLWKE 484

Query: 125 R---TLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFINELQLSDFKQT 181
           R     TLQ      +E      Y  +     ++  + E++  +      + QL   +QT
Sbjct: 485 RKDGVATLQTANGDTIEYVNTSGYDLEIRDAKREPLQGEELAVYQQWLATQRQLQATQQT 544

Query: 182 LQRNGID 188
           +  +GI+
Sbjct: 545 VGADGIE 551


>gi|392512873|emb|CAD25809.2| similarity to HYPOTHETICAL PROTEIN Y162_METJA [Encephalitozoon
           cuniculi GB-M1]
          Length = 643

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 40/69 (57%), Gaps = 1/69 (1%)

Query: 51  NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS 110
           NPF FK+VK    +        P V++ S   ++ G SRDLF +WCS  +N++II     
Sbjct: 296 NPFAFKYVKNLKGIDSFDD-EGPCVIMASPGMLQSGLSRDLFERWCSDSKNAVIIPGYCV 354

Query: 111 PGTLARDLI 119
            GTLA++++
Sbjct: 355 DGTLAKEIL 363


>gi|401882746|gb|EJT46990.1| cleavage and polyadenylation specificity factor [Trichosporon
           asahii var. asahii CBS 2479]
 gi|406700483|gb|EKD03650.1| cleavage and polyadenylation specificity factor [Trichosporon
           asahii var. asahii CBS 8904]
          Length = 738

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 39/71 (54%), Gaps = 1/71 (1%)

Query: 49  RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
           R+NPF FK+VK       L     P VV+ S   M  G SR+L  +W   P+N +I+T  
Sbjct: 343 RDNPFDFKYVKWLKDPKRLDH-KQPCVVMASAQFMSFGLSRELLEEWAPDPKNGVIVTGY 401

Query: 109 TSPGTLARDLI 119
           +  GT+AR L+
Sbjct: 402 SIEGTMARTLL 412


>gi|396082284|gb|AFN83894.1| putative beta-lactamase fold-containing exonuclease
           [Encephalitozoon romaleae SJ-2008]
          Length = 643

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 40/69 (57%), Gaps = 1/69 (1%)

Query: 51  NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS 110
           NPF FK+VK    +        P V++ S   ++ G SRDLF +WCS  +N++II     
Sbjct: 296 NPFAFKYVKNLKGIDSFDD-EGPCVIMASPGMLQSGLSRDLFERWCSDSKNAVIIPGYCV 354

Query: 111 PGTLARDLI 119
            GTLA++++
Sbjct: 355 DGTLAKEIL 363


>gi|388853919|emb|CCF52417.1| uncharacterized protein [Ustilago hordei]
          Length = 1033

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 67/137 (48%), Gaps = 24/137 (17%)

Query: 47  GARNNPFHFKHVKLCHSLAEL-AKVPS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSI 103
            A ++P  FK+VK+  SL  +   +P    KVVL   P M  G SR L  ++  +P + +
Sbjct: 405 AAASSPLDFKYVKVYSSLQAMDEAIPQDQAKVVLAVPPSMTHGPSRRLLARFAKNPNDVV 464

Query: 104 IITNRTSPGTLARDLIE----------------LG----GNRTLTLQVKKRIRLEGEELE 143
           ++ +R  PG+L R L +                LG     N  +  ++K R+ LEGEEL 
Sbjct: 465 VLISRGEPGSLCRQLWDAWNTNQGKGFAWAQGKLGEAVTPNTRVRFELKSRVPLEGEELR 524

Query: 144 EYQKKKDKEAKDKQEKE 160
            + + +  E +D+Q ++
Sbjct: 525 AHLEAEQAE-RDRQTQQ 540


>gi|301111988|ref|XP_002905073.1| cleavage and polyadenylation specificity factor subunit 3
           [Phytophthora infestans T30-4]
 gi|262095403|gb|EEY53455.1| cleavage and polyadenylation specificity factor subunit 3
           [Phytophthora infestans T30-4]
          Length = 724

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D++ K    A +NPF F+H+    S+ +      P VV+ S   ++ G SR LF 
Sbjct: 291 INMMNDRIRKQI--AISNPFQFEHISNLKSMDDFDD-SGPSVVMASPGMLQSGVSRQLFE 347

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WCS   N+ +I      GTLA+ ++
Sbjct: 348 RWCSDKRNACLIPGYVVEGTLAKKIL 373


>gi|405963469|gb|EKC29039.1| Cleavage and polyadenylation specificity factor subunit 3
           [Crassostrea gigas]
          Length = 686

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 27/70 (38%), Positives = 37/70 (52%), Gaps = 1/70 (1%)

Query: 50  NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
           +NPF FKH+    S+     +  P VVL S   M+ G SR+LF  WC+   N  II    
Sbjct: 305 SNPFVFKHISNLKSMEHFEDI-GPSVVLASPGMMQSGLSRELFESWCTDKRNGCIIAGYC 363

Query: 110 SPGTLARDLI 119
             GTLA+ ++
Sbjct: 364 VEGTLAKHIL 373


>gi|303391080|ref|XP_003073770.1| putative beta-lactamase fold-containing exonuclease
           [Encephalitozoon intestinalis ATCC 50506]
 gi|303302918|gb|ADM12410.1| putative beta-lactamase fold-containing exonuclease
           [Encephalitozoon intestinalis ATCC 50506]
          Length = 696

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 40/69 (57%), Gaps = 1/69 (1%)

Query: 51  NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS 110
           NPF FK+VK    +        P V++ S   ++ G SRDLF +WCS  +N++II     
Sbjct: 296 NPFAFKYVKNLKGIDSFDD-EGPCVIMASPGMLQSGLSRDLFERWCSDSKNAVIIPGYCV 354

Query: 111 PGTLARDLI 119
            GTLA++++
Sbjct: 355 DGTLAKEIL 363


>gi|348686031|gb|EGZ25846.1| hypothetical protein PHYSODRAFT_478942 [Phytophthora sojae]
          Length = 733

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D++ K    A +NPF F+H+    S+ +      P VV+ S   ++ G SR LF 
Sbjct: 291 INMMNDRIRKQI--AISNPFQFEHISNLKSMDDFDD-SGPSVVMASPGMLQSGVSRQLFE 347

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WCS   N+ +I      GTLA+ ++
Sbjct: 348 RWCSDKRNACLIPGYVVEGTLAKKIL 373


>gi|198421242|ref|XP_002128016.1| PREDICTED: similar to Cleavage and polyadenylation specificity
           factor subunit 3 (Cleavage and polyadenylation
           specificity factor 73 kDa subunit) (CPSF 73 kDa subunit)
           (mRNA 3-end-processing endonuclease CPSF-73) [Ciona
           intestinalis]
          Length = 690

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 37/70 (52%), Gaps = 1/70 (1%)

Query: 50  NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
           +NPF FKH+     +     V  P VV+ S   M+ G SR+LF  WC+   N +I+    
Sbjct: 305 SNPFQFKHISNLKGMEHFDDV-GPSVVMASPGMMQSGLSRELFESWCNDRRNGVIVAGYC 363

Query: 110 SPGTLARDLI 119
             GTLA+ ++
Sbjct: 364 VEGTLAKHIL 373


>gi|401827745|ref|XP_003888165.1| putative RNA-processing beta-lactamase-fold exonuclease
           [Encephalitozoon hellem ATCC 50504]
 gi|392999365|gb|AFM99184.1| putative RNA-processing beta-lactamase-fold exonuclease
           [Encephalitozoon hellem ATCC 50504]
          Length = 643

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 39/69 (56%), Gaps = 1/69 (1%)

Query: 51  NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS 110
           NPF FK+VK    +        P V++ S   ++ G SRDLF +WCS   N++II     
Sbjct: 296 NPFAFKYVKNLKGIDSFDD-EGPCVIMASPGMLQSGLSRDLFERWCSDSRNAVIIPGYCV 354

Query: 111 PGTLARDLI 119
            GTLA++++
Sbjct: 355 DGTLAKEIL 363


>gi|425768274|gb|EKV06801.1| Cleavage and polyadenylylation specificity factor, putative
           [Penicillium digitatum Pd1]
 gi|425770355|gb|EKV08828.1| Cleavage and polyadenylylation specificity factor, putative
           [Penicillium digitatum PHI26]
          Length = 1001

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 29/96 (30%), Positives = 45/96 (46%), Gaps = 23/96 (23%)

Query: 34  IEWMSDKLMKSFEGARNN--------------------PFHFKHVKLCHSLAELAKV--- 70
           +EWM + +++ FE A ++                    PF FKH+K+      L K+   
Sbjct: 344 LEWMDENIVREFEAAESSDVTNGQRTGAQEKSSNKGGGPFTFKHLKIIERKKRLEKLLAE 403

Query: 71  PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           P PKV+L S   M+ GFS+D   Q    P N +++T
Sbjct: 404 PGPKVILASDTSMDWGFSKDALRQVAEGPNNLLLLT 439


>gi|300706475|ref|XP_002995499.1| hypothetical protein NCER_101581 [Nosema ceranae BRL01]
 gi|239604633|gb|EEQ81828.1| hypothetical protein NCER_101581 [Nosema ceranae BRL01]
          Length = 671

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 49/86 (56%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D + KS      NPF F++VK   S+ +  +  SP V++ S   ++ G SR+LF 
Sbjct: 305 INMMNDHIKKS--SLIKNPFAFQYVKNLKSI-DFFEDNSPCVIMASPGMLQSGLSRELFE 361

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC    N +II   +  GTLA++++
Sbjct: 362 KWCGDRRNGVIIPGYSVDGTLAKEIL 387


>gi|402696939|gb|AFQ90658.1| 73kDa cleavage and polyadenylation specific factor 3, partial
           [Draco beccarii]
          Length = 220

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 39/77 (50%), Gaps = 3/77 (3%)

Query: 37  MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
           M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF  WC
Sbjct: 147 MNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFESWC 203

Query: 97  SSPENSIIITNRTSPGT 113
           +   N +II      GT
Sbjct: 204 TDKRNGVIIAGYCVEGT 220


>gi|19074699|ref|NP_586205.1| similarity to HYPOTHETICAL PROTEIN Y162_METJA [Encephalitozoon
           cuniculi GB-M1]
          Length = 730

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 40/69 (57%), Gaps = 1/69 (1%)

Query: 51  NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS 110
           NPF FK+VK    +        P V++ S   ++ G SRDLF +WCS  +N++II     
Sbjct: 383 NPFAFKYVKNLKGIDSFDD-EGPCVIMASPGMLQSGLSRDLFERWCSDSKNAVIIPGYCV 441

Query: 111 PGTLARDLI 119
            GTLA++++
Sbjct: 442 DGTLAKEIL 450


>gi|66820693|ref|XP_643926.1| beta-lactamase domain-containing protein [Dictyostelium discoideum
           AX4]
 gi|74860395|sp|Q86A79.1|CPSF3_DICDI RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 3; Short=Cleavage and polyadenylation
           specificity factor 3
 gi|60472339|gb|EAL70292.1| beta-lactamase domain-containing protein [Dictyostelium discoideum
           AX4]
          Length = 774

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D++   F+ +  NPF FKH+K    +        P V + S   ++ G SR LF 
Sbjct: 313 INMMNDRVRAQFDVS--NPFEFKHIKNIKGIESFDD-RGPCVFMASPGMLQSGLSRQLFE 369

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WCS   N I+I   +  GTLA+ ++
Sbjct: 370 RWCSDKRNGIVIPGYSVEGTLAKHIM 395


>gi|254582142|ref|XP_002497056.1| ZYRO0D14410p [Zygosaccharomyces rouxii]
 gi|238939948|emb|CAR28123.1| ZYRO0D14410p [Zygosaccharomyces rouxii]
          Length = 772

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 48/86 (55%), Gaps = 1/86 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+D + + F  ++ NPF FK++    ++ E      P V+L S   ++ G SR++  
Sbjct: 292 VNMMNDDIRRKFRDSQTNPFVFKNISYLKNIDEFQDF-GPSVMLASPGMLQNGLSREVLE 350

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC   +N ++IT  +  GT+A+ L+
Sbjct: 351 RWCPEGKNLVLITGYSVEGTMAKFLM 376


>gi|296418744|ref|XP_002838985.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295634979|emb|CAZ83176.1| unnamed protein product [Tuber melanosporum]
          Length = 783

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 51/88 (57%), Gaps = 4/88 (4%)

Query: 34  IEWMSDKLMKS-FEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDL 91
           I  M+DK+ ++ FEG   NP+ F+ V+   ++     V     V++++P M + G SR+L
Sbjct: 297 INTMNDKIKRAMFEGEGRNPWDFRWVRSLKTIDRFEDVGG--CVMLASPGMLQNGVSREL 354

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
             +WC  P N ++IT  +  GT+A+ ++
Sbjct: 355 LERWCPDPRNGLVITGYSVEGTMAKQIM 382


>gi|340545979|gb|AEK51788.1| cleavage and polyadenylation specific factor 3 [Heteronotia binoei]
 gi|402696941|gb|AFQ90659.1| 73kDa cleavage and polyadenylation specific factor 3, partial
           [Malaclemys terrapin]
 gi|402696943|gb|AFQ90660.1| 73kDa cleavage and polyadenylation specific factor 3, partial
           [Testudo hermanni]
          Length = 220

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 39/77 (50%), Gaps = 3/77 (3%)

Query: 37  MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
           M+DK+ K      NNPF FKH+    S+     +  P VV+ S   M+ G SR+LF  WC
Sbjct: 147 MNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFESWC 203

Query: 97  SSPENSIIITNRTSPGT 113
           +   N +II      GT
Sbjct: 204 TDKRNGVIIAGYCVEGT 220


>gi|339237605|ref|XP_003380357.1| cleavage and polyadenylation specificity factor subunit 3
           [Trichinella spiralis]
 gi|316976818|gb|EFV60027.1| cleavage and polyadenylation specificity factor subunit 3
           [Trichinella spiralis]
          Length = 687

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 44/86 (51%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+  + K    A  NPF FKHV+   S+     +  P VVL S   ++ G SR+LF 
Sbjct: 294 ISGMNQNIQKQI--AVQNPFLFKHVRSLRSIDFFEDI-GPCVVLASPGMLQSGLSRELFE 350

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+  +N  II      GTLA+ ++
Sbjct: 351 MWCTDTKNGCIIAGYCVEGTLAKHIL 376


>gi|71017515|ref|XP_758988.1| hypothetical protein UM02841.1 [Ustilago maydis 521]
 gi|46098766|gb|EAK83999.1| hypothetical protein UM02841.1 [Ustilago maydis 521]
          Length = 979

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/130 (26%), Positives = 63/130 (48%), Gaps = 24/130 (18%)

Query: 47  GARNNPFHFKHVKLCHSLAEL-AKVPS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSI 103
            A ++P  FK+VK+  +L  +   +P    KVVL   P M  G SR L  ++  +P + +
Sbjct: 423 AAASSPLDFKYVKVFPTLQAMDEAIPQDQAKVVLAVPPSMTHGPSRKLLARFAQNPNDVV 482

Query: 104 IITNRTSPGTLARDLIE---------------------LGGNRTLTLQVKKRIRLEGEEL 142
           ++ +R  PG+L R+L +                     +  N +L  ++K ++ LEG+EL
Sbjct: 483 VLISRGEPGSLCRELWDAWNTNQSKGFSWSQGKLGQAVVASNTSLRFELKSKVPLEGDEL 542

Query: 143 EEYQKKKDKE 152
             +++ +  E
Sbjct: 543 RAHREAEQAE 552


>gi|393217572|gb|EJD03061.1| Metallo-hydrolase/oxidoreductase [Fomitiporia mediterranea MF3/22]
          Length = 826

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/89 (37%), Positives = 45/89 (50%), Gaps = 4/89 (4%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAK--VPSPKVVLVSTPDM-ECGFSRD 90
           I  M+  +   F   R+NPF FKHV     +    K     P  V++ TP M + G SR+
Sbjct: 284 IHTMNSNIRSRF-AKRDNPFVFKHVSNIPQVRGWEKRIAEGPPCVILCTPGMLQPGPSRE 342

Query: 91  LFFQWCSSPENSIIITNRTSPGTLARDLI 119
           L   W   P N +IIT  +  GTLARD++
Sbjct: 343 LLELWAPDPRNGLIITGYSVEGTLARDIV 371


>gi|391348443|ref|XP_003748457.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Metaseiulus occidentalis]
          Length = 673

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 1/72 (1%)

Query: 48  ARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
           A +NPF FKH+    S+     V  P V++ +   M+ G SR+LF  WC   +N +II  
Sbjct: 296 AISNPFVFKHISNLKSIDHFDDV-GPCVIMATPGMMQSGLSRELFEAWCGDTKNGVIIAG 354

Query: 108 RTSPGTLARDLI 119
               GTLA+ ++
Sbjct: 355 YCVEGTLAKQIL 366


>gi|443898849|dbj|GAC76183.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT2
           [Pseudozyma antarctica T-34]
          Length = 1135

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 36/132 (27%), Positives = 67/132 (50%), Gaps = 27/132 (20%)

Query: 50  NNPFHFKHVKLCHSLAELAK-VPS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           ++P  FK+V++  SL ++ + +P    +VVL   P M  G SR L  ++  +P ++I++ 
Sbjct: 503 SSPLDFKYVRVFASLQQMDEAIPQDQARVVLAVPPSMTHGPSRRLLARFARNPNDAIVLI 562

Query: 107 NRTSPGTLARDLIE--------------------LGGNRTLTLQVKKRIRLEGEE----L 142
           +R  PG+L R L +                    + G  T+  +++ ++ LEGEE    L
Sbjct: 563 SRGEPGSLCRQLWDAWNQRQPKGFSWTKGKLGEVVSGEATVRYELQSKVPLEGEELRLHL 622

Query: 143 EEYQKKKDKEAK 154
           E  Q ++D+ A+
Sbjct: 623 ESEQAERDRLAQ 634


>gi|426197081|gb|EKV47008.1| hypothetical protein AGABI2DRAFT_203789 [Agaricus bisporus var.
           bisporus H97]
          Length = 794

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 49  RNNPFHFKHVKLCHSL----AELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSII 104
           R+NPF FKH+           ++A  P P VVL S   M+ G SR+LF  WC    N +I
Sbjct: 308 RDNPFVFKHISNVPQTRGWEKKIADGP-PCVVLASPGFMQVGPSRELFEHWCPDARNGLI 366

Query: 105 ITNRTSPGTLARDLI 119
           IT  +  GT ARD++
Sbjct: 367 ITGYSIEGTPARDIM 381


>gi|321264788|ref|XP_003197111.1| cleavage and polyadenylation specificity factor [Cryptococcus
           gattii WM276]
 gi|317463589|gb|ADV25324.1| Cleavage and polyadenylation specificity factor, putative
           [Cryptococcus gattii WM276]
          Length = 778

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/93 (34%), Positives = 46/93 (49%), Gaps = 8/93 (8%)

Query: 49  RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
           R+NPF F+ VK      +L +   P V++ S   M  G SRDL  +W    +N +I+T  
Sbjct: 323 RDNPFDFRFVKWLKDPQKLRENKGPCVIMSSPQFMSFGLSRDLLEEWAPDSKNGVIVTGY 382

Query: 109 TSPGTLARDLIE--------LGGNRTLTLQVKK 133
           +  GT+AR L+          GGN    L VK+
Sbjct: 383 SIEGTMARTLLSEPDHIESLKGGNIPRRLTVKE 415


>gi|409080187|gb|EKM80547.1| hypothetical protein AGABI1DRAFT_70926 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 841

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 49  RNNPFHFKHVKLCHSL----AELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSII 104
           R+NPF FKH+           ++A  P P VVL S   M+ G SR+LF  WC    N +I
Sbjct: 308 RDNPFVFKHISNVPQTRGWEKKIADGP-PCVVLASPGFMQVGPSRELFEHWCPDARNGLI 366

Query: 105 ITNRTSPGTLARDLI 119
           IT  +  GT ARD++
Sbjct: 367 ITGYSIEGTPARDIM 381


>gi|238882385|gb|EEQ46023.1| hypothetical protein CAWG_04366 [Candida albicans WO-1]
          Length = 783

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 52/86 (60%), Gaps = 4/86 (4%)

Query: 37  MSDKL-MKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQ 94
           M+DK+ + S    ++NPF FK++K    L++   +  P VV V+TP M + G SR L  +
Sbjct: 300 MNDKIRLSSASSEKSNPFDFKYIKSIKDLSKFQDM-GPSVV-VATPGMLQAGVSRQLLEK 357

Query: 95  WCSSPENSIIITNRTSPGTLARDLIE 120
           W    +N +I+T  +  GT+A++L++
Sbjct: 358 WAPDGKNLVILTGYSVEGTMAKELLK 383


>gi|213409816|ref|XP_002175678.1| endoribonuclease ysh1 [Schizosaccharomyces japonicus yFS275]
 gi|212003725|gb|EEB09385.1| endoribonuclease ysh1 [Schizosaccharomyces japonicus yFS275]
          Length = 771

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 47/83 (56%), Gaps = 3/83 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+DK+ K+F  A  NPF F+++K   S+ +   +  P V+L S   ++ G SR L  
Sbjct: 264 INMMNDKIRKAF--AERNPFIFRYIKSLRSIDKFDDI-GPSVILASPGMLQNGVSRTLLE 320

Query: 94  QWCSSPENSIIITNRTSPGTLAR 116
           +W     N++++T  +  GT+A+
Sbjct: 321 RWAPDARNTLLLTGYSVEGTMAK 343


>gi|393245131|gb|EJD52642.1| Metallo-hydrolase/oxidoreductase [Auricularia delicata TFB-10046
           SS5]
          Length = 751

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 48/89 (53%), Gaps = 4/89 (4%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAK--VPSPKVVLVSTPDM-ECGFSRD 90
           I  M+  + + F   R+NPF FKH+        L +     P  V++++P M + G SR+
Sbjct: 284 IHTMNSNIRQRF-ARRDNPFIFKHISHLPQTRGLERKIADGPPCVVLASPGMLQSGTSRE 342

Query: 91  LFFQWCSSPENSIIITNRTSPGTLARDLI 119
           L   W   P N++++T  +  GTLARD++
Sbjct: 343 LLELWAPDPRNALVVTGYSVEGTLARDIL 371


>gi|58270576|ref|XP_572444.1| hypothetical protein CNH02710 [Cryptococcus neoformans var.
           neoformans JEC21]
 gi|134118056|ref|XP_772409.1| hypothetical protein CNBL2750 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|338819805|sp|P0CM89.1|YSH1_CRYNB RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
           3'-end-processing protein YSH1
 gi|338819806|sp|P0CM88.1|YSH1_CRYNJ RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
           3'-end-processing protein YSH1
 gi|50255022|gb|EAL17762.1| hypothetical protein CNBL2750 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57228702|gb|AAW45137.1| hypothetical protein CNH02710 [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 773

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/93 (34%), Positives = 46/93 (49%), Gaps = 8/93 (8%)

Query: 49  RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
           R+NPF F+ VK      +L +   P V++ S   M  G SRDL  +W    +N +I+T  
Sbjct: 323 RDNPFDFRFVKWLKDPQKLRENKGPCVIMSSPQFMSFGLSRDLLEEWAPDSKNGVIVTGY 382

Query: 109 TSPGTLARDLIE--------LGGNRTLTLQVKK 133
           +  GT+AR L+          GGN    L VK+
Sbjct: 383 SIEGTMARTLLSEPDHIESLKGGNVPRRLTVKE 415


>gi|68489322|ref|XP_711502.1| hypothetical protein CaO19.12941 [Candida albicans SC5314]
 gi|68489371|ref|XP_711478.1| hypothetical protein CaO19.5486 [Candida albicans SC5314]
 gi|74584420|sp|Q59P50.1|YSH1_CANAL RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
           3'-end-processing protein YSH1
 gi|46432783|gb|EAK92250.1| hypothetical protein CaO19.5486 [Candida albicans SC5314]
 gi|46432809|gb|EAK92275.1| hypothetical protein CaO19.12941 [Candida albicans SC5314]
          Length = 870

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 52/86 (60%), Gaps = 4/86 (4%)

Query: 37  MSDKL-MKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQ 94
           M+DK+ + S    ++NPF FK++K    L++   +  P VV V+TP M + G SR L  +
Sbjct: 387 MNDKIRLSSASSEKSNPFDFKYIKSIKDLSKFQDM-GPSVV-VATPGMLQAGVSRQLLEK 444

Query: 95  WCSSPENSIIITNRTSPGTLARDLIE 120
           W    +N +I+T  +  GT+A++L++
Sbjct: 445 WAPDGKNLVILTGYSVEGTMAKELLK 470


>gi|405124298|gb|AFR99060.1| endoribonuclease YSH1 [Cryptococcus neoformans var. grubii H99]
          Length = 770

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/93 (34%), Positives = 46/93 (49%), Gaps = 8/93 (8%)

Query: 49  RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
           R+NPF F+ VK      +L +   P V++ S   M  G SRDL  +W    +N +I+T  
Sbjct: 323 RDNPFDFRFVKWLKDPQKLRENKGPCVIMSSPQFMSFGLSRDLLEEWAPDSKNGVIVTGY 382

Query: 109 TSPGTLARDLIE--------LGGNRTLTLQVKK 133
           +  GT+AR L+          GGN    L VK+
Sbjct: 383 SIEGTMARTLLSEPDHIESLKGGNIPRRLTVKE 415


>gi|429963183|gb|ELA42727.1| hypothetical protein VICG_00042 [Vittaforma corneae ATCC 50505]
          Length = 642

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/113 (29%), Positives = 60/113 (53%), Gaps = 12/113 (10%)

Query: 15  NPFHFKHVKLCHSLAELAK--------IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAE 66
           NP H K + + ++ A   K        I  M++++ K+      NPF F++V+    +  
Sbjct: 255 NP-HLKGIPIYYASALAQKCMSVYQTYINMMNERIQKA--SLVKNPFDFRNVESIKDIQS 311

Query: 67  LAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
             K   P V++ S   ++ GFSR+LF +WCS+ +N ++I      GTLA++++
Sbjct: 312 F-KDTGPCVMMASPGMLQSGFSRELFEKWCSNEKNGVVIPGYCVEGTLAKEIL 363


>gi|170093225|ref|XP_001877834.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164647693|gb|EDR11937.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 772

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 34/86 (39%), Positives = 45/86 (52%), Gaps = 9/86 (10%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++ +   F   R+NPF FK    C  +AE      P VVL S   M+ G SR+LF 
Sbjct: 284 IHTMNNNIRSRF-AKRDNPFVFK----CKKIAE----GPPCVVLASPGFMQVGPSRELFE 334

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            W     N +IIT  +  GTLARD++
Sbjct: 335 LWAPDARNGLIITGYSIEGTLARDIM 360


>gi|164663111|ref|XP_001732677.1| hypothetical protein MGL_0452 [Malassezia globosa CBS 7966]
 gi|159106580|gb|EDP45463.1| hypothetical protein MGL_0452 [Malassezia globosa CBS 7966]
          Length = 862

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 35/137 (25%), Positives = 66/137 (48%), Gaps = 28/137 (20%)

Query: 52  PFHFKHVKLCHSLAELAKVPSP---KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
           P  F  ++   S+  L +  +P   KVVL + P +  G SR L  ++   P+  +I+T+R
Sbjct: 379 PLDFSGLRFYSSVEALHQALTPSQVKVVLATPPALSHGLSRQLLPEFLCDPDALLILTSR 438

Query: 109 TSPGTLARDL----------------------IELGGNRTLTLQVKKRIRLEGEELEEY- 145
            +P +L R+L                      + +GG   L+ ++++R+ L G+EL  Y 
Sbjct: 439 GTPSSLVRNLWDRWNAKQADRDAWRQGHVGVPVSVGGQ--LSYELRRRVPLAGDELRTYV 496

Query: 146 QKKKDKEAKDKQEKEKI 162
           +++K +EA     + +I
Sbjct: 497 ERQKAREAAADAPRARI 513


>gi|440632320|gb|ELR02239.1| hypothetical protein GMDG_05312 [Geomyces destructans 20631-21]
          Length = 988

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/171 (25%), Positives = 64/171 (37%), Gaps = 57/171 (33%)

Query: 34  IEWMSDKLMKSFEG-----------------ARNNPFHFKHVKLCHSLAELAKV------ 70
           +EWM D +++ FE                  A   PF FKH++L      ++ V      
Sbjct: 334 LEWMDDGIIREFEAIAGGIDRQPNKPSEPRQAGAGPFDFKHLRLIEKKGGVSAVLNNDAT 393

Query: 71  ----PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSP--------------- 111
               P  KV+L S   ++ GFS+D+     +   N +I+T + S                
Sbjct: 394 KDGKPMAKVILASDRSLDWGFSKDILRNIAADSRNLVILTEKMSKLPDVGRSSGVSRTLW 453

Query: 112 --------------GTLARDLIEL-GGNRTLTLQVKKRIRLEGEELEEYQK 147
                         G+      ++ GG R LTL   KR  LEG EL  YQ+
Sbjct: 454 SWWDERKDGVALEQGSTGESFAQVYGGGRELTLADVKREALEGNELLSYQQ 504


>gi|294658126|ref|XP_460457.2| DEHA2F02134p [Debaryomyces hansenii CBS767]
 gi|218511903|sp|Q6BMW3.2|YSH1_DEBHA RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
           3'-end-processing protein YSH1
 gi|202952895|emb|CAG88764.2| DEHA2F02134p [Debaryomyces hansenii CBS767]
          Length = 815

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 42/71 (59%), Gaps = 1/71 (1%)

Query: 49  RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
           + NPF FK +K   +L +      P VV+ S   ++ G SR+L  +W   P+N++I+T  
Sbjct: 323 KQNPFQFKFIKSIKNLDKFQDF-GPCVVVASPGMLQNGVSRELLERWAPDPKNAVIMTGY 381

Query: 109 TSPGTLARDLI 119
           +  GT+A+DL+
Sbjct: 382 SVEGTMAKDLL 392


>gi|395330425|gb|EJF62808.1| hypothetical protein DICSQDRAFT_135076 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 943

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/116 (31%), Positives = 54/116 (46%), Gaps = 23/116 (19%)

Query: 53  FHFKHVKLCHSLAELAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
             FKHV+   S   L    S   PK++L     +  G SR +F ++   P+N +++T R 
Sbjct: 386 LRFKHVEFFLSPQALMSTYSSKDPKLILAVPATLSHGPSRAIFAEFAEIPDNVVLLTGRG 445

Query: 110 SPGTLARDLIE----------------LGGN----RTLTLQVKKRIRLEGEELEEY 145
            PGTL R L +                +G N      L L++  ++ L+GEELEEY
Sbjct: 446 EPGTLGRLLFDKWNDSQREEAKWDRGKIGNNIMMDGVLRLEMHSKVPLQGEELEEY 501


>gi|448517227|ref|XP_003867743.1| endoribonuclease [Candida orthopsilosis Co 90-125]
 gi|380352082|emb|CCG22306.1| endoribonuclease [Candida orthopsilosis]
          Length = 769

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 45/71 (63%), Gaps = 3/71 (4%)

Query: 51  NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRT 109
           NPF FK++K   +L++ + +  P VV V+TP M + G SR L  +W    +N +I+T  +
Sbjct: 318 NPFDFKYIKSIRNLSKFSDL-GPSVV-VATPGMLQAGVSRQLLEKWAPEQKNLVILTGYS 375

Query: 110 SPGTLARDLIE 120
             GT+A+DL++
Sbjct: 376 VEGTMAKDLLK 386


>gi|326426580|gb|EGD72150.1| cleavage and polyadenylation specificity factor subunit 3
           [Salpingoeca sp. ATCC 50818]
          Length = 790

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 44/73 (60%), Gaps = 2/73 (2%)

Query: 50  NNPFHFKHVKLCHSLAEL-AKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITN 107
           NNPF+F+HV    +L +   +      V++++P M + G SR++F +W S+  N +++  
Sbjct: 339 NNPFNFRHVSYIRNLHQFDGEYGGGPCVMLASPGMLQSGLSREIFERWASNKANCVLLAG 398

Query: 108 RTSPGTLARDLIE 120
               GTLA+DL++
Sbjct: 399 YVVNGTLAKDLLK 411


>gi|343429654|emb|CBQ73226.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
          Length = 1039

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 39/136 (28%), Positives = 64/136 (47%), Gaps = 28/136 (20%)

Query: 47  GARNNPFHFKHVKLCHSLAELAK-VP--SPKVVLVSTPDMECGFSRDLFFQWCSSPENSI 103
            A ++P  FK+V++  SL  + + +P    KVVL   P M  G SR L  ++  +P + +
Sbjct: 427 AAASSPLDFKYVRVFPSLQAMDEAIPHDQAKVVLAVPPSMTHGPSRRLLARFAQNPNDVV 486

Query: 104 IITNRTSPGTLARDL---------------------IELGGNRTLTLQVKKRIRLEGEE- 141
           ++ +R  PG+L R+L                     I       L  ++K ++ LEGEE 
Sbjct: 487 VLISRGEPGSLCRELWNAWNTHQSKGFSWAQGKLGQIVTPTKTALRFELKSKVPLEGEEL 546

Query: 142 ---LEEYQKKKDKEAK 154
              LE  Q ++DK A+
Sbjct: 547 RAHLEAEQAERDKLAQ 562


>gi|255724858|ref|XP_002547358.1| hypothetical protein CTRG_01665 [Candida tropicalis MYA-3404]
 gi|240135249|gb|EER34803.1| hypothetical protein CTRG_01665 [Candida tropicalis MYA-3404]
          Length = 783

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 60/110 (54%), Gaps = 12/110 (10%)

Query: 37  MSDKL-MKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQ 94
           M+DK+ + S  G ++NPF  K +K    L++   +  P VV V+TP M + G SR L  +
Sbjct: 300 MNDKIRLSSSSGEKSNPFDLKFIKSIKDLSKFQDM-GPSVV-VATPGMLQAGVSRQLLEK 357

Query: 95  WCSSPENSIIITNRTSPGTLARDL------IELGGNRTLTLQVKKRIRLE 138
           W    +N +I+T  +  GT+A++L      I+   N  LT  + +RI +E
Sbjct: 358 WAPDNKNLVILTGYSVEGTMAKELLKEPTMIQSATNPDLT--IPRRIGIE 405


>gi|392575747|gb|EIW68879.1| hypothetical protein TREMEDRAFT_44189 [Tremella mesenterica DSM
           1558]
          Length = 738

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 37/71 (52%)

Query: 49  RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
           R+NPF FK+VK            SP VV+ S   M  G SR+L   W    +N +I+T  
Sbjct: 319 RDNPFDFKYVKPLKDGRRGDNFKSPCVVMASAQFMSFGLSRELLEDWAPGEKNGVIVTGY 378

Query: 109 TSPGTLARDLI 119
           +  GT+AR L+
Sbjct: 379 SIEGTMARTLL 389


>gi|396500483|ref|XP_003845730.1| similar to cleavage and polyadenylation specificity factor subunit
           2 [Leptosphaeria maculans JN3]
 gi|312222311|emb|CBY02251.1| similar to cleavage and polyadenylation specificity factor subunit
           2 [Leptosphaeria maculans JN3]
          Length = 954

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 44/92 (47%), Gaps = 15/92 (16%)

Query: 34  IEWMSDKLMKSFE---GARNN---------PFHFKHVKLCHS---LAELAKVPSPKVVLV 78
           +EWM + ++K FE   GA N          PF FKH+ L      +A +     P+V+L 
Sbjct: 336 LEWMEEGIVKEFEVASGANNGKDDSKAARVPFDFKHITLLERKTRVARMLATSGPRVILA 395

Query: 79  STPDMECGFSRDLFFQWCSSPENSIIITNRTS 110
           S   +E GFS+D      S  +N I++T R  
Sbjct: 396 SDTTLEWGFSKDAIKSLASDEKNLILLTERAG 427


>gi|260815130|ref|XP_002602327.1| hypothetical protein BRAFLDRAFT_282200 [Branchiostoma floridae]
 gi|229287635|gb|EEN58339.1| hypothetical protein BRAFLDRAFT_282200 [Branchiostoma floridae]
          Length = 687

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 44/86 (51%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++K+ K    + +NPF FKH+     +     +  P VV+ S   M+ G SR+LF 
Sbjct: 293 INAMNEKIRKQI--SVSNPFVFKHISNLKGMDHFDDI-GPSVVMASPGMMQSGLSRELFE 349

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
            WC+   N  II      GTLA+ ++
Sbjct: 350 SWCTDRRNGCIIAGYCVEGTLAKHIM 375


>gi|241951638|ref|XP_002418541.1| cleavage and polyadenylation factor specificity complex subunit,
           putative; endonuclease, putative [Candida dubliniensis
           CD36]
 gi|223641880|emb|CAX43843.1| cleavage and polyadenylation factor specificity complex subunit,
           putative [Candida dubliniensis CD36]
          Length = 787

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 60/110 (54%), Gaps = 12/110 (10%)

Query: 37  MSDKL-MKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQ 94
           M+DK+ + S    ++NPF FK +K    L++   +  P VV V+TP M + G SR L  +
Sbjct: 301 MNDKIRLSSASSKKSNPFDFKFIKSIKDLSKFQDM-GPSVV-VATPGMLQAGVSRQLLEK 358

Query: 95  WCSSPENSIIITNRTSPGTLARDL------IELGGNRTLTLQVKKRIRLE 138
           W    +N +I+T  +  GT+A++L      I+   N  LT  + +RI +E
Sbjct: 359 WAPDGKNLVILTGYSVEGTMAKELLKEPTMIQSATNPDLT--IPRRIGIE 406


>gi|67597128|ref|XP_666125.1| cleavage and polyadenylation specifity factor protein
           [Cryptosporidium hominis TU502]
 gi|54657058|gb|EAL35898.1| cleavage and polyadenylation specifity factor protein
           [Cryptosporidium hominis]
          Length = 484

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/99 (33%), Positives = 46/99 (46%), Gaps = 17/99 (17%)

Query: 37  MSDKLMKSFEGARN--------------NPFHFKHVKLCHSLAELAKV---PSPKVVLVS 79
           MS K M+ FE   N              NPF F ++K  +SL E+  +   P P VV+ +
Sbjct: 1   MSIKCMRVFETYINQCGESVRRQADLGINPFQFNYIKTVNSLNEIKDIIYNPGPCVVMAA 60

Query: 80  TPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDL 118
              ++ G SRD+F  W     N II+T     GT A +L
Sbjct: 61  PGMLQNGTSRDIFEIWAPDKRNGIILTGYAVRGTPAYEL 99


>gi|269860830|ref|XP_002650133.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
           [Enterocytozoon bieneusi H348]
 gi|220066453|gb|EED43934.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
           [Enterocytozoon bieneusi H348]
          Length = 657

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 48/84 (57%), Gaps = 2/84 (2%)

Query: 37  MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
           M ++ M+     RN PF FK+V+     A+  +   P V++ S   ++ G SRD+F +WC
Sbjct: 286 MMNERMQKLNLTRN-PFDFKNVENIKD-AKTVRDGGPCVIMASPGMLQSGVSRDIFERWC 343

Query: 97  SSPENSIIITNRTSPGTLARDLIE 120
           S  +N ++I      GTLA+++++
Sbjct: 344 SDSKNGVVIAGYCVEGTLAKEVLK 367


>gi|429243009|ref|NP_594263.2| mRNA cleavage and polyadenylation specificity factor complex
           endoribonuclease subunit Ysh1 [Schizosaccharomyces pombe
           972h-]
 gi|384872669|sp|O13794.2|YSH1_SCHPO RecName: Full=Endoribonuclease ysh1; AltName: Full=mRNA
           3'-end-processing protein ysh1
 gi|347834169|emb|CAB16227.2| mRNA cleavage and polyadenylation specificity factor complex
           endoribonuclease subunit Ysh1 [Schizosaccharomyces
           pombe]
          Length = 757

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 46/85 (54%), Gaps = 3/85 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+D + K F  A  NPF F+ VK   +L +   +  P V+L S   ++ G SR L  
Sbjct: 293 VNMMNDNIRKIF--AERNPFIFRFVKSLRNLEKFDDI-GPSVILASPGMLQNGVSRTLLE 349

Query: 94  QWCSSPENSIIITNRTSPGTLARDL 118
           +W   P N++++T  +  GT+A+ +
Sbjct: 350 RWAPDPRNTLLLTGYSVEGTMAKQI 374


>gi|328867689|gb|EGG16071.1| beta-lactamase domain-containing protein [Dictyostelium
           fasciculatum]
          Length = 786

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I+ M++++   F  A +NPF FKH+K  + +        P V + S   ++ G SR LF 
Sbjct: 306 IQMMNERIRAQF--AVSNPFIFKHIKDINGIDNFND-NGPCVFMASPGMLQSGLSRQLFE 362

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WCS   N ++I   +  GTLA+ ++
Sbjct: 363 RWCSDRRNGVVIPGYSVEGTLAKHIM 388


>gi|164658265|ref|XP_001730258.1| hypothetical protein MGL_2640 [Malassezia globosa CBS 7966]
 gi|159104153|gb|EDP43044.1| hypothetical protein MGL_2640 [Malassezia globosa CBS 7966]
          Length = 741

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 40/71 (56%), Gaps = 1/71 (1%)

Query: 49  RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
           R+NPF FKHV    SL +      P V++ S   M+ G SR+L  +W     N +I++  
Sbjct: 221 RDNPFVFKHVSNLRSLDKFDD-KGPCVMMASPGFMQSGISRELLERWAPDKRNGVIVSGY 279

Query: 109 TSPGTLARDLI 119
           +  GT+ARD++
Sbjct: 280 SVEGTMARDIL 290


>gi|389746898|gb|EIM88077.1| hypothetical protein STEHIDRAFT_94995 [Stereum hirsutum FP-91666
           SS1]
          Length = 968

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 35/132 (26%), Positives = 67/132 (50%), Gaps = 24/132 (18%)

Query: 53  FHFKHVKL---CHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
             FKH++      +L +      PK++L     +  G SR LF  + S P+N +++T+R 
Sbjct: 388 LRFKHLEFFLNPQALQQTYSSKDPKLILAVPASLSHGPSRSLFADFASIPDNVVLLTSRG 447

Query: 110 SPGTLARDLIE----------------LGGN----RTLTLQVKKRIRLEGEELEEYQKKK 149
             GTL+R L +                +G N     +L+++++ ++ L+G ELE +  ++
Sbjct: 448 EEGTLSRVLFDRWNNAQRAEDKWDKGKIGSNVMMDGSLSIELRSKVALQGTELEIFL-QR 506

Query: 150 DKEAKDKQEKEK 161
           ++ AK+KQ  E+
Sbjct: 507 ERAAKEKQTAEQ 518


>gi|66356658|ref|XP_625507.1| cleavage and polyadenylation specifity factor protein, CPSF
           metallobeta-lactamase [Cryptosporidium parvum Iowa II]
 gi|46226496|gb|EAK87490.1| cleavage and polyadenylation specifity factor protein, CPSF
           metallobeta-lactamase [Cryptosporidium parvum Iowa II]
          Length = 780

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 33/99 (33%), Positives = 46/99 (46%), Gaps = 17/99 (17%)

Query: 37  MSDKLMKSFEGARN--------------NPFHFKHVKLCHSLAELAKV---PSPKVVLVS 79
           MS K M+ FE   N              NPF F ++K  +SL E+  +   P P VV+ +
Sbjct: 297 MSIKCMRVFETYINQCGDSVRRQADLGINPFQFNYIKTVNSLNEIKDIIYNPGPCVVMAA 356

Query: 80  TPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDL 118
              ++ G SRD+F  W     N II+T     GT A +L
Sbjct: 357 PGMLQNGTSRDIFEIWAPDKRNGIILTGYAVRGTPAYEL 395


>gi|361125691|gb|EHK97723.1| putative Cleavage factor two protein 2 [Glarea lozoyensis 74030]
          Length = 835

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 39/168 (23%), Positives = 66/168 (39%), Gaps = 55/168 (32%)

Query: 34  IEWMSDKLMKSFEGARN----------------NPFHFKHVKLCHSLAELAKVPS----- 72
           +EWM + +++ FE                     PF FKH++L    +++ K+       
Sbjct: 210 LEWMDESIVREFEAVAGQNKQDDDPDAKLRGIGGPFDFKHLRLLERKSQIDKIMQEVDNH 269

Query: 73  ----PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR-------------------- 108
                KV+L S   +E GFS+++F + C    N +I T R                    
Sbjct: 270 GRSIGKVILASDTSLEWGFSKEVFRRICDDRRNLVIFTERMGQPKMENPKLGMARTLWSW 329

Query: 109 --------TSPGTLARDLIE--LGGNRTLTLQVKKRIRLEGEELEEYQ 146
                    +    + D++E   GG R L ++   R+ LEG++L  YQ
Sbjct: 330 WEDRSDGVATETAASGDVLEQVYGGGRQLEMRETTRVALEGDDLAAYQ 377


>gi|308473777|ref|XP_003099112.1| hypothetical protein CRE_27737 [Caenorhabditis remanei]
 gi|308267766|gb|EFP11719.1| hypothetical protein CRE_27737 [Caenorhabditis remanei]
          Length = 125

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 31/118 (26%), Positives = 48/118 (40%), Gaps = 40/118 (33%)

Query: 159 KEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMEC 218
           K+ IP H   F+N+ +LSDFK      G   EF+ G L+   G  ++RR         E 
Sbjct: 48  KKLIPIHQAIFVNDPKLSDFKSLFVEKGFKAEFLSGTLLIKGGKCSIRR--------GEM 99

Query: 219 GFSRDLFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
           GFS                                +EG L  +YY+++ L ++Q+ +L
Sbjct: 100 GFS--------------------------------MEGALEKDYYKLRNLFFNQFGVL 125


>gi|336371935|gb|EGO00275.1| hypothetical protein SERLA73DRAFT_73000 [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336384684|gb|EGO25832.1| hypothetical protein SERLADRAFT_437559 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 748

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 44/78 (56%), Gaps = 11/78 (14%)

Query: 49  RNNPFHFKHVKLCHSLAE-------LAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPEN 101
           R+NPF FKH+    +LA+       +A  P P VVL S   ++ G SR+L   W   P N
Sbjct: 298 RDNPFVFKHIS---NLAQPRGWERKIADGP-PCVVLASPGFLQSGPSRELLELWAPDPRN 353

Query: 102 SIIITNRTSPGTLARDLI 119
            +I+T  +  GTLARD++
Sbjct: 354 GLIVTGYSVEGTLARDIM 371


>gi|219123319|ref|XP_002181974.1| cleavage and polyadenylation specific factor [Phaeodactylum
           tricornutum CCAP 1055/1]
 gi|217406575|gb|EEC46514.1| cleavage and polyadenylation specific factor [Phaeodactylum
           tricornutum CCAP 1055/1]
          Length = 1001

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 5/81 (6%)

Query: 32  AKIEWMSDKLMKSFEG-ARNNPFHFKHVKLCHSLAELAKV----PSPKVVLVSTPDMECG 86
           +++EWM  KL   F+  A  +P    HV +C +  EL K     P+P  V+ S   +E G
Sbjct: 325 SQLEWMGTKLGHVFDAQAGPHPLTLPHVHVCTNTRELEKFLAENPNPACVVASGLSLEGG 384

Query: 87  FSRDLFFQWCSSPENSIIITN 107
            +RDL   W  + +N+I+ T+
Sbjct: 385 PARDLLLSWADNVDNAILFTD 405


>gi|302415331|ref|XP_003005497.1| cleavage and polyadenylation specificity factor subunit 2
           [Verticillium albo-atrum VaMs.102]
 gi|261354913|gb|EEY17341.1| cleavage and polyadenylation specificity factor subunit 2
           [Verticillium albo-atrum VaMs.102]
          Length = 739

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 43/167 (25%), Positives = 70/167 (41%), Gaps = 53/167 (31%)

Query: 34  IEWMSDKLMKSFEGARNN----------------PFHFKHVKLCHSLAELAKVPS----- 72
           +EWM D +++ FE   +                 PF F+ ++L    A++ K+ S     
Sbjct: 299 LEWMDDNIVREFEATADGQRKANGNDGKHAKDAAPFDFRFMRLVEREAQIRKLLSQTSEN 358

Query: 73  ----PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG-----TLAR------- 116
                +V++ S   +E GFS +L  +      N +I+T++ S       ++AR       
Sbjct: 359 VRSDGRVIVASDNSLEWGFSHELLRELAKDSRNLLILTDKPSLAQSGQPSIARILWDWWQ 418

Query: 117 --------------DLIEL--GGNRTLTLQVKKRIRLEGEELEEYQK 147
                         D IEL  GG R LT+   +R  LEG+EL  YQ+
Sbjct: 419 ERRDGVSIDQSDSNDSIELVYGGGRALTVTDARRQGLEGDELSTYQQ 465


>gi|116203607|ref|XP_001227614.1| hypothetical protein CHGG_09687 [Chaetomium globosum CBS 148.51]
 gi|88175815|gb|EAQ83283.1| hypothetical protein CHGG_09687 [Chaetomium globosum CBS 148.51]
          Length = 956

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 42/166 (25%), Positives = 67/166 (40%), Gaps = 52/166 (31%)

Query: 34  IEWMSDKLMKSFEGA----------------RNNPFHFKHVKLCHSLAELAKV------- 70
           +EWM D +++ FE                     PF FK+++L    A++ +V       
Sbjct: 331 LEWMDDSIVREFEAVAGGQRGNGGSGGGKGKDAGPFDFKYLRLLERKAQVERVLQQAADA 390

Query: 71  --PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS--PGT--LARDLIE---- 120
             P  +V++ +   +E GFS+++       P N +I+T + S  P T  +AR L E    
Sbjct: 391 SEPKGRVIVATDSSLEWGFSKEVMRAIAGDPRNLVILTEKPSLNPNTPSIARTLWEWWRE 450

Query: 121 -------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
                               GG R L +    R  LEG EL+ YQ+
Sbjct: 451 RKDGVAVEQTSNGDTFEQVYGGGRDLEVGHATRQALEGGELDIYQQ 496


>gi|346976151|gb|EGY19603.1| cleavage and polyadenylation specificity factor subunit 2
           [Verticillium dahliae VdLs.17]
          Length = 972

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 43/167 (25%), Positives = 70/167 (41%), Gaps = 53/167 (31%)

Query: 34  IEWMSDKLMKSFEGARNN----------------PFHFKHVKLCHSLAELAKVPS----- 72
           +EWM D +++ FE   +                 PF F+ ++L    A++ K+ S     
Sbjct: 333 LEWMDDNIVREFEATADGQRKANGNDGKHAKDAAPFDFRFMRLVEREAQIRKLLSQTSDN 392

Query: 73  ----PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG-----TLAR------- 116
                +V++ S   +E GFS+ L  +      N +I+T++ S       ++AR       
Sbjct: 393 VQSEGRVIVASDNSLEWGFSQQLLRELAKDSRNLLILTDKPSLAQSGQPSIARTLWDWWQ 452

Query: 117 --------------DLIEL--GGNRTLTLQVKKRIRLEGEELEEYQK 147
                         D IEL  GG R L++   KR  LEG+EL  YQ+
Sbjct: 453 ERKDGVSIDQSDSNDSIELVYGGGRALSVTDAKRQGLEGDELSTYQQ 499


>gi|302899216|ref|XP_003048005.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256728937|gb|EEU42292.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 958

 Score = 50.8 bits (120), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 46/167 (27%), Positives = 72/167 (43%), Gaps = 55/167 (32%)

Query: 34  IEWMSDKLMKSFE----GARN----------NPFHFKHVKLCHSLAELAKVPSP------ 73
           +EWM D +++ FE    G R            PF FK+++L    A++ ++ S       
Sbjct: 332 LEWMDDTIVQEFEAFAEGQRKVNGAGDKKEGGPFDFKYLRLLERKAQIVRLLSRGFENVE 391

Query: 74  ---KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG-------TLAR------- 116
              +V+L S   +E GFS+DL         N +I+T++  PG       ++AR       
Sbjct: 392 TEGRVILASDSSIEWGFSKDLIKGLARDSRNLVILTDK--PGLSKDDKPSIARTLWDWWK 449

Query: 117 --------------DLIEL--GGNRTLTLQVKKRIRLEGEELEEYQK 147
                         D IEL   G R L ++  +R  LEG+EL  YQ+
Sbjct: 450 ERKDGVSVEQTSSGDSIELVYAGGRELEIREAQRQALEGDELAVYQQ 496


>gi|189192102|ref|XP_001932390.1| cleavage and polyadenylation specificity factor subunit 2
           [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187973996|gb|EDU41495.1| cleavage and polyadenylation specificity factor subunit 2
           [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 954

 Score = 50.8 bits (120), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 39/159 (24%), Positives = 67/159 (42%), Gaps = 46/159 (28%)

Query: 34  IEWMSDKLMKSFEGARNN-----------------PFHFKHVKLCHSLAELAKV---PSP 73
           +EWM + ++K FE +  +                 PF F+H+ L      +A++     P
Sbjct: 336 LEWMEEGIVKEFEASAADQDRRTKEGQEEERVAKVPFDFRHITLLERKTRVARMLAGAGP 395

Query: 74  KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT-----------------------S 110
           +V+L S   +E GFS+D      S  +N +I+T R+                       S
Sbjct: 396 RVILASDATLEWGFSKDAIRSLASDEKNLVILTERSGELGSQKKGLGRYLWDLWNQRNAS 455

Query: 111 PGTLA--RDLIELGGNRTLTLQVKKRIRLEGEELEEYQK 147
           PG  A    +I+  GN+   L   + + L+G+E+  YQ+
Sbjct: 456 PGEDAPSTTVIDASGNQA-PLDTVRTVALQGDEVPLYQQ 493


>gi|407851025|gb|EKG05159.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi]
          Length = 762

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 46/88 (52%), Gaps = 6/88 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV--PSPKVVLVSTPDMECGFSRDL 91
           +  M+D++ +     RN PF FK++   HSL E        P VVL S   ++ G S +L
Sbjct: 310 VSAMNDRVKQQHANHRN-PFVFKYI---HSLMETRSFEDTGPCVVLASPGMLQSGISLEL 365

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F +WC    N III      GT+A+D++
Sbjct: 366 FERWCGDRRNGIIIAGYCVDGTIAKDIL 393


>gi|323453344|gb|EGB09216.1| hypothetical protein AURANDRAFT_71470 [Aureococcus anophagefferens]
          Length = 1101

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 26/70 (37%), Positives = 40/70 (57%), Gaps = 1/70 (1%)

Query: 50  NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
           +NPF F HV+   S+ +L     P VVL +   ++ G SR LF +W SS  N ++I   +
Sbjct: 198 SNPFKFDHVQNLASIDDLDD-SGPVVVLAAPGMLQSGVSRQLFDRWASSERNGVVIAGYS 256

Query: 110 SPGTLARDLI 119
             GTLA+ ++
Sbjct: 257 VEGTLAKQIL 266


>gi|71005902|ref|XP_757617.1| hypothetical protein UM01470.1 [Ustilago maydis 521]
 gi|74703664|sp|Q4PEJ3.1|YSH1_USTMA RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
           3'-end-processing protein YSH1
 gi|46097110|gb|EAK82343.1| hypothetical protein UM01470.1 [Ustilago maydis 521]
          Length = 880

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 2/86 (2%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D +   F   R+NPF FKH+    SL +      P V++ S   M+ G SR+L  
Sbjct: 302 IHTMNDHIRTRF-NRRDNPFVFKHISNLRSLEKFEDR-GPCVMMASPGFMQSGVSRELLE 359

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +W     N +I++  +  GT+AR+++
Sbjct: 360 RWAPDKRNGLIVSGYSVEGTMARNIL 385


>gi|388852694|emb|CCF53612.1| related to YSH1-component of pre-mRNA polyadenylation factor PF I
           [Ustilago hordei]
          Length = 888

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 2/86 (2%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D +   F   R+NPF FKH+    SL +      P V++ S   M+ G SR+L  
Sbjct: 302 IHTMNDHIRTRFN-RRDNPFVFKHISNLRSLEKFEDR-GPCVMMASPGFMQSGVSRELLE 359

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +W     N +I++  +  GT+AR+++
Sbjct: 360 RWAPDKRNGLIVSGYSVEGTMARNIL 385


>gi|50363261|gb|AAT75333.1| cleavage polyadenylation specificity factor CPSF73 [Trypanosoma
           cruzi]
          Length = 762

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 46/88 (52%), Gaps = 6/88 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV--PSPKVVLVSTPDMECGFSRDL 91
           +  M+D++ +     RN PF FK++   HSL E        P VVL S   ++ G S +L
Sbjct: 310 VSAMNDRVKQQHANHRN-PFVFKYI---HSLMETRSFEDTGPCVVLASPGMLQSGISLEL 365

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F +WC    N III      GT+A+D++
Sbjct: 366 FERWCGDRRNGIIIAGYCVDGTIAKDIL 393


>gi|299752177|ref|XP_001830756.2| mRNA 3'-end-processing protein YSH1 [Coprinopsis cinerea
           okayama7#130]
 gi|298409712|gb|EAU91125.2| mRNA 3'-end-processing protein YSH1 [Coprinopsis cinerea
           okayama7#130]
          Length = 846

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 44/75 (58%), Gaps = 5/75 (6%)

Query: 49  RNNPFHFKHVK-LCHSLA---ELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSII 104
           R+NPF FK++  L  +     ++A+ P P VVL S   M+ G SR+LF  W     N +I
Sbjct: 298 RDNPFVFKYISNLPQTRGWEKKIAEGP-PCVVLASPGFMQVGPSRELFELWAPDARNGLI 356

Query: 105 ITNRTSPGTLARDLI 119
           IT  +  GTLARD++
Sbjct: 357 ITGYSIEGTLARDIM 371


>gi|71661559|ref|XP_817799.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
           strain CL Brener]
 gi|70883012|gb|EAN95948.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi]
          Length = 625

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 46/88 (52%), Gaps = 6/88 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV--PSPKVVLVSTPDMECGFSRDL 91
           +  M+D++ +     RN PF FK++   HSL E        P VVL S   ++ G S +L
Sbjct: 173 VSAMNDRVKQQHANHRN-PFVFKYI---HSLMETRSFEDTGPCVVLASPGMLQSGISLEL 228

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F +WC    N III      GT+A+D++
Sbjct: 229 FERWCGDRRNGIIIAGYCVDGTIAKDIL 256


>gi|169599735|ref|XP_001793290.1| hypothetical protein SNOG_02691 [Phaeosphaeria nodorum SN15]
 gi|160705309|gb|EAT89422.2| hypothetical protein SNOG_02691 [Phaeosphaeria nodorum SN15]
          Length = 957

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 45/194 (23%), Positives = 78/194 (40%), Gaps = 58/194 (29%)

Query: 4   KLMKSFEGARNNPFHFKHVKLCHSLAELAKIEWMSDKLMKSFE---------GARN---- 50
           K  K++  +R      ++V+        + +EWM + ++K FE         G RN    
Sbjct: 313 KNAKAYLASRTGGATMRYVR--------SMLEWMGEGIVKEFEAASGAAEGQGQRNVRGA 364

Query: 51  ----------NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCS 97
                      PF F+H+ L    A + ++     P+V+L S   +E GFS+D      S
Sbjct: 365 PGRDDGRGIRTPFDFQHITLLEKKARVTRMLNATEPRVILASDTSLEWGFSKDAIRSLAS 424

Query: 98  SPENSIIITNR-----TSPGTLARDLIELGGNRTL-------------------TLQVKK 133
             +N +I+T R     T    L R L +L   R++                   ++   +
Sbjct: 425 DEKNLVILTERVGELGTQEKGLGRYLWDLWNERSVNSGDDSLDSTMVDVSGQQASISTVR 484

Query: 134 RIRLEGEELEEYQK 147
            + LEG+E+  YQ+
Sbjct: 485 TVALEGDEVPLYQQ 498


>gi|403419016|emb|CCM05716.1| predicted protein [Fibroporia radiculosa]
          Length = 828

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 31/73 (42%), Positives = 39/73 (53%), Gaps = 3/73 (4%)

Query: 49  RNNPFHFKHVK-LCHSLAELAKVPS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSIII 105
           R+NPF FKH+  L H+     KV    P VVL S   +  G SR+L   W     N III
Sbjct: 298 RDNPFVFKHISNLPHTRGWERKVADGPPCVVLASPGFVTVGASRELLEMWAPDSRNGIII 357

Query: 106 TNRTSPGTLARDL 118
           T  +  GT+ARD+
Sbjct: 358 TGYSIEGTMARDI 370


>gi|407411604|gb|EKF33594.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi marinkellei]
          Length = 763

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 46/88 (52%), Gaps = 6/88 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV--PSPKVVLVSTPDMECGFSRDL 91
           +  M+D++ +     RN PF FK++   HSL E        P VVL S   ++ G S +L
Sbjct: 311 VSAMNDRVKQQHANHRN-PFVFKYI---HSLMETRSFEDTGPCVVLASPGMLQSGISLEL 366

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F +WC    N III      GT+A+D++
Sbjct: 367 FERWCGDRRNGIIIAGYCVDGTIAKDIL 394


>gi|443899092|dbj|GAC76423.1| mRNA cleavage and polyadenylation factor II complex, BRR5
           [Pseudozyma antarctica T-34]
          Length = 884

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 2/86 (2%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D +   F   R+NPF FKH+    SL +      P V++ S   M+ G SR+L  
Sbjct: 302 IHTMNDHIRTRFN-RRDNPFVFKHISNLRSLEKFEDR-GPCVMMASPGFMQSGVSRELLE 359

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +W     N +I++  +  GT+AR+++
Sbjct: 360 RWAPDKRNGLIVSGYSVEGTMARNIL 385


>gi|343428147|emb|CBQ71677.1| related to YSH1-component of pre-mRNA polyadenylation factor PF I
           [Sporisorium reilianum SRZ2]
          Length = 878

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 46/86 (53%), Gaps = 2/86 (2%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D +   F   R+NPF FKH+    SL +      P V++ S   M+ G SR+L  
Sbjct: 302 IHTMNDHIRTRFN-RRDNPFVFKHISNLRSLEKFED-RGPCVMMASPGFMQSGVSRELLE 359

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +W     N +I++  +  GT+AR+++
Sbjct: 360 RWAPDKRNGLIVSGYSVEGTMARNIL 385


>gi|325186851|emb|CCA21396.1| cleavage and polyadenylation specific factor 3 puta [Albugo
           laibachii Nc14]
          Length = 759

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D++ K    A +NPF F H+    S+ +      P VV+ S   ++ G SR LF 
Sbjct: 292 INMMNDRIRKQI--AVSNPFLFDHISNLKSMDDFDD-SGPCVVMASPGMLQSGVSRQLFE 348

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WCS   N+ +I      GTLA+ ++
Sbjct: 349 RWCSDKRNACLIPGYVVEGTLAKKIL 374


>gi|402080824|gb|EJT75969.1| hypothetical protein GGTG_05894 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 974

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 49/179 (27%), Positives = 71/179 (39%), Gaps = 57/179 (31%)

Query: 26  HSLAELAK--IEWMSDKLMKSFE------GARNN------------PFHFKHVKLCHSLA 65
           +S   LAK   EWM D +++ FE      G R N            PF FK++++    A
Sbjct: 321 NSTIRLAKSMFEWMDDSIVQEFEAVADQGGKRTNGNTDGGRGRDAGPFDFKYLRVLDRKA 380

Query: 66  ELAKVPSP---------KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG---- 112
           ++ KV S          KV+L S   +E GFS+D+  +      N +I+T + S      
Sbjct: 381 QVEKVLSQSSTPNELRGKVILASDTSLEWGFSKDVMARIADDSRNLVILTEKPSASSKEK 440

Query: 113 -TLARDLIE-----------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
            ++AR L E                         G R L L   KR  L+  EL  YQ+
Sbjct: 441 PSVARQLWEWWKERRDGTADEASSGPGSAEQIYAGGRELRLMDSKRAALDESELPLYQQ 499


>gi|340383473|ref|XP_003390242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Amphimedon queenslandica]
          Length = 726

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 37/70 (52%), Gaps = 1/70 (1%)

Query: 50  NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
           +NPF FKH+    ++     +  P V+L S   M+ G SR LF  WC+   N +++    
Sbjct: 351 SNPFVFKHISSLKNIDNFDDI-GPCVILASPGMMQSGLSRQLFESWCTDKRNGVVVAGYC 409

Query: 110 SPGTLARDLI 119
             GTLA+ ++
Sbjct: 410 VEGTLAKHIL 419


>gi|402696937|gb|AFQ90657.1| 73kDa cleavage and polyadenylation specific factor 3, partial
           [Dibamus sp. JJF-2012]
          Length = 220

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 38/77 (49%), Gaps = 3/77 (3%)

Query: 37  MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
           M+DK+ K      NNPF  KH+    S+     +  P VV+ S   M+ G SR+LF  WC
Sbjct: 147 MNDKIRKXX--XINNPFVJKHISNLKSMDHFDDI-GPSVVMASPGMMQSGLSRELFESWC 203

Query: 97  SSPENSIIITNRTSPGT 113
           +   N +II      GT
Sbjct: 204 TDKRNGVIIAGYCVEGT 220


>gi|71654879|ref|XP_816051.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
           strain CL Brener]
 gi|70881152|gb|EAN94200.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi]
          Length = 430

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 46/88 (52%), Gaps = 6/88 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV--PSPKVVLVSTPDMECGFSRDL 91
           +  M+D++ +     RN PF FK++   HSL E        P VVL S   ++ G S +L
Sbjct: 310 VSAMNDRVKQQHANHRN-PFVFKYI---HSLMETRSFEDTGPCVVLASPGMLQSGISLEL 365

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F +WC    N III      GT+A+D++
Sbjct: 366 FERWCGDRRNGIIIAGYCVDGTIAKDIL 393


>gi|330920784|ref|XP_003299151.1| hypothetical protein PTT_10086 [Pyrenophora teres f. teres 0-1]
 gi|311327303|gb|EFQ92764.1| hypothetical protein PTT_10086 [Pyrenophora teres f. teres 0-1]
          Length = 953

 Score = 50.4 bits (119), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 39/159 (24%), Positives = 67/159 (42%), Gaps = 46/159 (28%)

Query: 34  IEWMSDKLMKSFEGARNN-----------------PFHFKHVKLCHSLAELAKV---PSP 73
           +EWM + ++K FE +  +                 PF F+H+ L      +A++     P
Sbjct: 336 LEWMEEGIVKEFEASAADQDRRTKGGKEDERVAKVPFDFRHITLLERKTRVARMLAGAGP 395

Query: 74  KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT-----------------------S 110
           +V+L S   +E GFS+D      S  +N +I+T R+                       S
Sbjct: 396 RVILASDATLEWGFSKDAIRTLASDEKNLVILTERSGELGSQKKGLGRYLWDLWNQRNAS 455

Query: 111 PGTLA--RDLIELGGNRTLTLQVKKRIRLEGEELEEYQK 147
           PG  A    +I+  GN+   L   + + L+G+E+  YQ+
Sbjct: 456 PGEDAPSTTVIDASGNQA-PLDTIRTVALQGDEVPLYQQ 493


>gi|255934198|ref|XP_002558380.1| Pc12g15810 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211582999|emb|CAP81208.1| Pc12g15810 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 893

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 28/96 (29%), Positives = 43/96 (44%), Gaps = 23/96 (23%)

Query: 34  IEWMSDKLMKSFEGARN--------------------NPFHFKHVKLCHSLAELAKV--- 70
           +EWM + +++ FE A +                     PF FKH+K+      L K+   
Sbjct: 344 LEWMDENIVREFEAAESADVTNGQRTGGQDKSTSKGGGPFTFKHLKIIERKKRLEKLLAE 403

Query: 71  PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           P PKV+L S   M+ GFS+    Q    P N +++T
Sbjct: 404 PGPKVILASDTSMDWGFSKHALRQVAEGPNNLLLMT 439


>gi|307110126|gb|EFN58363.1| hypothetical protein CHLNCDRAFT_142438 [Chlorella variabilis]
          Length = 709

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 17/88 (19%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           A IE M++ + ++F  A  NPF FKH+    S A                  + G SR+L
Sbjct: 297 AYIEMMNEDIKRAFTVA--NPFEFKHISHLKSAAHFD---------------DSGMSREL 339

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F  WC    N ++I +    GTLARD++
Sbjct: 340 FEAWCEDARNCVVIADFAVQGTLARDIL 367


>gi|409044817|gb|EKM54298.1| hypothetical protein PHACADRAFT_146128 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 869

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 31/74 (41%), Positives = 43/74 (58%), Gaps = 5/74 (6%)

Query: 49  RNNPFHFKHVK-LCHSLA---ELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSII 104
           R+NPF FKH+  + HS     ++A+ PS  VVL S   ME G SR+L   W     N +I
Sbjct: 299 RDNPFVFKHISNVPHSRGWERKIAEGPS-CVVLASPGFMESGPSRELLELWAPDSRNGVI 357

Query: 105 ITNRTSPGTLARDL 118
           +T  +  GT+ARD+
Sbjct: 358 LTGYSIEGTMARDI 371


>gi|328773999|gb|EGF84036.1| hypothetical protein BATDEDRAFT_9083 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 669

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 37/70 (52%), Gaps = 1/70 (1%)

Query: 50  NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
           +NPF FKH+    S+A+   V  P V++ S   ++ G SR+L   WC    N +II    
Sbjct: 347 SNPFQFKHISNLKSIAQFDDV-GPCVMMASPGMLQSGLSRELLELWCVDKRNGVIIPGYV 405

Query: 110 SPGTLARDLI 119
             GTL + ++
Sbjct: 406 VEGTLGKQIL 415


>gi|403337788|gb|EJY68117.1| Integrator complex subunit 11 [Oxytricha trifallax]
          Length = 771

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/110 (28%), Positives = 54/110 (49%), Gaps = 13/110 (11%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W ++K+ K+F     N F F+HVK   + +   K   P V   S   +  G+S  +F 
Sbjct: 300 INWTNEKIKKTFT-KNQNMFQFQHVKTLDTAS--IKSDQPMVCFASPGMLHGGYSLQIFK 356

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELE 143
            W    +N++II     PGT+        GN+ L+ +  K+I ++G +++
Sbjct: 357 DWAGQEKNTLIIPGYCMPGTV--------GNKLLSGE--KKINIDGRDID 396


>gi|115479027|ref|NP_001063107.1| Os09g0397900 [Oryza sativa Japonica Group]
 gi|50252615|dbj|BAD28786.1| putative FEG protein [Oryza sativa Japonica Group]
 gi|113631340|dbj|BAF25021.1| Os09g0397900 [Oryza sativa Japonica Group]
 gi|218202115|gb|EEC84542.1| hypothetical protein OsI_31281 [Oryza sativa Indica Group]
 gi|222641522|gb|EEE69654.1| hypothetical protein OsJ_29268 [Oryza sativa Japonica Group]
          Length = 559

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 43/87 (49%), Gaps = 6/87 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
           I W S K+  S+    +NPF FKHV  CH        P P  VL +TP M   GFS ++F
Sbjct: 285 IGWTSQKIKNSY--TVHNPFDFKHV--CHFERSFINNPGP-CVLFATPGMISGGFSLEVF 339

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLI 119
            +W  S +N + +      GT+   L+
Sbjct: 340 KKWAPSEKNLVTLPGYCVAGTIGHKLM 366


>gi|400602286|gb|EJP69888.1| RNA-metabolising metallo-beta-lactamase [Beauveria bassiana ARSEF
           2860]
          Length = 962

 Score = 50.1 bits (118), Expect = 0.001,   Method: Composition-based stats.
 Identities = 46/167 (27%), Positives = 71/167 (42%), Gaps = 55/167 (32%)

Query: 34  IEWMSDKLMKSFEG-----ARNN---------PFHFKHVKLCHSLAELAKVPSP------ 73
           +EWM D +++ FE       R N         PF F+ ++L    A++AK+ +       
Sbjct: 332 LEWMDDSIVQEFEAFAEGQKRTNGNSDKKVGGPFDFRFMRLLDRKAQIAKLLTTAVNNGE 391

Query: 74  ---KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG-------TLARDLIEL-- 121
              +V+L S   M+ GFS+DL     S   N +I+T++  PG       ++AR L  L  
Sbjct: 392 SRGRVILASDTCMDWGFSKDLLRGLASDANNVVILTDK--PGLAQENVPSIARTLWSLWK 449

Query: 122 ---------------------GGNRTLTLQVKKRIRLEGEELEEYQK 147
                                 G+  L ++  KR  LEGEEL  YQ+
Sbjct: 450 ERKDGVTTVQTTTGDDLEYVNTGSYDLEIRDAKREPLEGEELTIYQQ 496


>gi|452004821|gb|EMD97277.1| hypothetical protein COCHEDRAFT_1163978 [Cochliobolus
           heterostrophus C5]
          Length = 948

 Score = 50.1 bits (118), Expect = 0.001,   Method: Composition-based stats.
 Identities = 42/158 (26%), Positives = 68/158 (43%), Gaps = 45/158 (28%)

Query: 34  IEWMSDKLMKSFEGA------RNN----------PFHFKHVKLCHSLAELAKVPS---PK 74
           +EWM + ++K FE +      RN           PF F+HV L      +A++ +   P+
Sbjct: 335 LEWMEEGIVKEFEASAADQDRRNKGGKDEDRAKIPFDFRHVTLLERKTRVARMLAADGPR 394

Query: 75  VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR-----------------------TSP 111
           V+L S   +E GFS+D      S  +N +I+T R                       TSP
Sbjct: 395 VILASDTTLEWGFSKDALRSLASDEKNLVILTERSGELGEKRKGLGRYLWDLWNQRNTSP 454

Query: 112 GTLA--RDLIELGGNRTLTLQVKKRIRLEGEELEEYQK 147
           G  A    +I   G++ +  +  +   LEG+E+  YQ+
Sbjct: 455 GQDAPSTTVINASGDQ-IPWKTVRAAALEGDEVPLYQQ 491


>gi|353239750|emb|CCA71648.1| related to YSH1-component of pre-mRNA polyadenylation factor PF I
           [Piriformospora indica DSM 11827]
          Length = 756

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/116 (31%), Positives = 57/116 (49%), Gaps = 10/116 (8%)

Query: 12  ARNNPFHFKHVKLCHSLAE------LAKIEWMSDKLMKSFEGARNNPFHFKHV-KLCHSL 64
           +RN   H   +    SLA+         I  M+ K+ + ++  + NPF FK++  L  + 
Sbjct: 255 SRNPDLHGVPIYYVSSLAKKCMAVYQTNISSMNSKIQERWK-KQENPFVFKYITNLPQTR 313

Query: 65  AELAKVPS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDL 118
               KV    P VVL S   M+ G SR+L   W   P N++I+T  +  GT+ARD+
Sbjct: 314 GAEKKVAEGPPCVVLASPGFMDNGSSRELLELWAPDPRNAVIVTGYSVEGTMARDI 369


>gi|354543719|emb|CCE40441.1| hypothetical protein CPAR2_104770 [Candida parapsilosis]
          Length = 776

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 44/70 (62%), Gaps = 3/70 (4%)

Query: 52  PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTS 110
           PF FK++K   +L++ + +  P VV V+TP M + G SR L  +W    +N +I+T  + 
Sbjct: 322 PFDFKYIKSIRNLSKFSDL-GPSVV-VATPGMLQAGVSRQLLEKWAPEQKNLVILTGYSV 379

Query: 111 PGTLARDLIE 120
            GT+A+DL++
Sbjct: 380 EGTMAKDLLK 389


>gi|451853389|gb|EMD66683.1| hypothetical protein COCSADRAFT_35187 [Cochliobolus sativus ND90Pr]
          Length = 948

 Score = 50.1 bits (118), Expect = 0.001,   Method: Composition-based stats.
 Identities = 32/111 (28%), Positives = 51/111 (45%), Gaps = 28/111 (25%)

Query: 34  IEWMSDKLMKSFEGA------RNN----------PFHFKHVKLCHSLAELAKVPS---PK 74
           +EWM + ++K FE +      RN           PF F+H+ L      +A++ +   P+
Sbjct: 335 LEWMEEGIVKEFEASAADQDRRNKGGKDEDRAKIPFDFRHITLLERKTRVARMLAADGPR 394

Query: 75  VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNR 125
           V+L S   +E GFS+D      S  +N +I+T R+          ELG  R
Sbjct: 395 VILASDTTLEWGFSKDALRSLASDEKNLVILTERSG---------ELGAQR 436


>gi|403346510|gb|EJY72653.1| putative cleavage and polyadenylation specificity factor subunit 2
           [Oxytricha trifallax]
          Length = 853

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 54/100 (54%), Gaps = 7/100 (7%)

Query: 36  WMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLFFQ 94
           WM+ +  K F+    NP +F+ VK   +L E  K+  SP++++ S      G+++ L ++
Sbjct: 347 WMNFQDNKVFQDIDENPINFQFVKDIFTLDEYRKLEHSPRIIVTSLASFSQGYTKQLIYE 406

Query: 95  WCSSPENSIIITNRTSP----GTLARDLIELGGNRTLTLQ 130
           +   P+N I+    +SP     +LA +L  L G + +T++
Sbjct: 407 FSQVPKNEIVFLQTSSPQVFKDSLAYNL--LRGQKEITMK 444


>gi|145478255|ref|XP_001425150.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392218|emb|CAK57752.1| unnamed protein product [Paramecium tetraurelia]
          Length = 690

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 35/69 (50%)

Query: 51  NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS 110
           N F F H+     L ++     PKVV+ S   ++ G S+ ++  WC   +N +IIT    
Sbjct: 297 NLFKFNHINTKKHLQDILNNQKPKVVMASPGLLQSGHSKQIYEYWCKDEKNQVIITGPAV 356

Query: 111 PGTLARDLI 119
            GT+A  LI
Sbjct: 357 QGTIAHQLI 365


>gi|357158307|ref|XP_003578085.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-II-like [Brachypodium distachyon]
          Length = 553

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 43/87 (49%), Gaps = 6/87 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
           I W S K+  S+     NPF FKHV  CH        P P  VL +TP M   GFS ++F
Sbjct: 286 IGWTSQKIKDSY--TVQNPFDFKHV--CHFERSFINDPGP-CVLFATPGMISGGFSLEVF 340

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLI 119
            +W +S +N + +      GT+   L+
Sbjct: 341 KRWATSDKNLVTLPGYCVAGTIGHKLM 367


>gi|330842661|ref|XP_003293292.1| hypothetical protein DICPUDRAFT_158104 [Dictyostelium purpureum]
 gi|325076396|gb|EGC30185.1| hypothetical protein DICPUDRAFT_158104 [Dictyostelium purpureum]
          Length = 789

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+D++   F+ +  NPF FK++     +        P V + S   ++ G SR LF 
Sbjct: 308 INMMNDRVRAQFDVS--NPFEFKYISNIKGIESFDD-NGPCVFMASPGMLQSGLSRQLFE 364

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC+S  N ++I   +  GTLA+ ++
Sbjct: 365 RWCTSKRNGVVIPGYSVEGTLAKHIM 390


>gi|72387720|ref|XP_844284.1| cleavage and polyadenylation specificity factor subunit
           [Trypanosoma brucei brucei strain 927/4 GUTat10.1]
 gi|62359436|gb|AAX79873.1| cleavage and polyadenylation specificity factor subunit, putative
           [Trypanosoma brucei]
 gi|70800817|gb|AAZ10725.1| cleavage and polyadenylation specificity factor subunit, putative
           [Trypanosoma brucei brucei strain 927/4 GUTat10.1]
          Length = 770

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 47/88 (53%), Gaps = 6/88 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV--PSPKVVLVSTPDMECGFSRDL 91
           +  M+D++ K  E  RN PF FK+++   SL +        P VVL S   ++ G S +L
Sbjct: 311 VSAMNDRVKKQHENHRN-PFVFKYIQ---SLLDTRSFEDTGPCVVLASPGMLQSGISLEL 366

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F +WC    N II+      GT+A+D++
Sbjct: 367 FERWCGDKRNGIIVAGYCVDGTIAKDIL 394


>gi|340521586|gb|EGR51820.1| predicted protein [Trichoderma reesei QM6a]
          Length = 887

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 48/87 (55%), Gaps = 3/87 (3%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
           E M++         +N P+ FK+++   +L     V     V++++P M + G SR+LF 
Sbjct: 352 ERMAEAEASGDSAGKNGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELFE 409

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIE 120
           +W  S +N +IIT  +  GT+AR +++
Sbjct: 410 RWAPSEKNGVIITGYSVEGTMARQIMQ 436


>gi|261327437|emb|CBH10412.1| cleavage and polyadenylation specificity factor subunit, putative
           [Trypanosoma brucei gambiense DAL972]
          Length = 770

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 47/88 (53%), Gaps = 6/88 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV--PSPKVVLVSTPDMECGFSRDL 91
           +  M+D++ K  E  RN PF FK+++   SL +        P VVL S   ++ G S +L
Sbjct: 311 VSAMNDRVKKQHENHRN-PFVFKYIQ---SLLDTRSFEDTGPCVVLASPGMLQSGISLEL 366

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F +WC    N II+      GT+A+D++
Sbjct: 367 FERWCGDKRNGIIVAGYCVDGTIAKDIL 394


>gi|32564696|ref|NP_495706.2| Protein F10B5.8 [Caenorhabditis elegans]
 gi|26985793|emb|CAB54223.2| Protein F10B5.8 [Caenorhabditis elegans]
          Length = 608

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 48/149 (32%), Positives = 63/149 (42%), Gaps = 19/149 (12%)

Query: 12  ARNNPFHFKHVKLCHSLAELAK------IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLA 65
           A N P +F        LAE A       I W ++ + K+F     N F FKH+K      
Sbjct: 265 ALNVPIYFSQ-----GLAERANQYYRLFISWTNENIKKTF--VERNMFEFKHIKPMEKGC 317

Query: 66  ELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGN 124
           E    P P+V L STP M   G S  +F +WCS P N II+      GT+   +I  G  
Sbjct: 318 E--DQPGPQV-LFSTPGMLHGGQSLKVFKKWCSDPLNMIIMPGYCVAGTVGARVIN-GEK 373

Query: 125 RTLTLQVKKRIRLEGEELEEYQKKKDKEA 153
           +    Q    IRL G E   +    D + 
Sbjct: 374 KIEIDQKMHEIRL-GVEYMSFSAHADAKG 401


>gi|395332776|gb|EJF65154.1| Metallo-hydrolase/oxidoreductase [Dichomitus squalens LYAD-421 SS1]
          Length = 809

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 49/92 (53%), Gaps = 9/92 (9%)

Query: 49  RNNPFHFKHVKLCHSLA----ELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSII 104
           R+NPF FKH+           ++A+ P P VVL S   M  G SR+L   W S  +N  I
Sbjct: 298 RDNPFVFKHITNVPGTRGWERKIAEGP-PCVVLASPGFMNSGPSRELLELWASDSKNGCI 356

Query: 105 ITNRTSPGTLARDLI----ELGGNRTLTLQVK 132
           +T  +  GT+ARD++    E+ G +  T+Q K
Sbjct: 357 VTGYSVEGTMARDILNEPSEIVGMKGNTIQRK 388


>gi|367047989|ref|XP_003654374.1| hypothetical protein THITE_2117338 [Thielavia terrestris NRRL 8126]
 gi|347001637|gb|AEO68038.1| hypothetical protein THITE_2117338 [Thielavia terrestris NRRL 8126]
          Length = 1015

 Score = 49.7 bits (117), Expect = 0.001,   Method: Composition-based stats.
 Identities = 40/167 (23%), Positives = 66/167 (39%), Gaps = 53/167 (31%)

Query: 34  IEWMSDKLMKSFE-----------------GARNNPFHFKHVKLCHSLAELAKV------ 70
           +EWM D +++ FE                 G    PF FK+++L    A++ ++      
Sbjct: 331 LEWMDDSIVREFEAVAGGTRGANSGAGGGKGRDAGPFDFKYLRLLERKAQVERILQQEAG 390

Query: 71  ---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG----TLARDLIE--- 120
              P  KV+L +   +E GFS+++         N +++T + +      ++AR L E   
Sbjct: 391 DSEPKGKVILATDASLEWGFSKEVLKAIAGDARNLVVLTEKPNLSHGRTSIARTLWEWWK 450

Query: 121 --------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
                                GG R L L    R  LEG+EL  YQ+
Sbjct: 451 ERKDGVAVEQTSSGDTFEQVYGGGRELELTETTRQALEGDELGLYQQ 497


>gi|336373839|gb|EGO02177.1| hypothetical protein SERLA73DRAFT_86401 [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336386654|gb|EGO27800.1| hypothetical protein SERLADRAFT_447017 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 930

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 61/128 (47%), Gaps = 24/128 (18%)

Query: 53  FHFKHVKL---CHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
             FKH++      +L +      PK++L     +  G SR LF  +   P+N +++T+R 
Sbjct: 384 LRFKHLEFFPNPQALLQTYSSKDPKLILAVPASLSHGPSRLLFSDFAVVPDNVVLLTSRG 443

Query: 110 SPGTLARDLIE----------------LGGN----RTLTLQVKKRIRLEGEELEEYQKKK 149
             GTL R L +                +G N     T+ L++  +I L+G ELEEY   K
Sbjct: 444 EEGTLGRILFDKWNDSQRADDKWDKGKIGSNIMMDGTMKLKINSKIPLQGAELEEYL-AK 502

Query: 150 DKEAKDKQ 157
           ++ AK+K+
Sbjct: 503 ERVAKEKE 510


>gi|358378169|gb|EHK15851.1| hypothetical protein TRIVIDRAFT_65314 [Trichoderma virens Gv29-8]
          Length = 873

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 48/87 (55%), Gaps = 3/87 (3%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
           E M++         +N P+ FK+++   +L     V     V++++P M + G SR+LF 
Sbjct: 330 ERMAEAEASGDAAGKNGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELFE 387

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIE 120
           +W  S +N +IIT  +  GT+AR +++
Sbjct: 388 RWAPSEKNGVIITGYSVEGTMARQIMQ 414


>gi|409049761|gb|EKM59238.1| hypothetical protein PHACADRAFT_249539 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 951

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/136 (27%), Positives = 65/136 (47%), Gaps = 27/136 (19%)

Query: 53  FHFKHVKLCHSLAELAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
             F H+++  + A + +  S   PK++L     +  G SR LF ++   P+N +++T R 
Sbjct: 394 LRFPHLEIFPNPAAMMQRYSSKDPKLILAVPSSLSHGPSRALFSEFAEIPDNVVLLTGRG 453

Query: 110 SPGTLARDLIE----------------LGGN----RTLTLQVKKRIRLEGEELEEY---- 145
             GTL R L E                +G N     TL L++  ++ L+G ELEE+    
Sbjct: 454 EEGTLGRILFERWDNSQRDDTKWDRGKIGNNVMMDGTLHLKISSKVPLQGAELEEHLARE 513

Query: 146 QKKKDKEAKDKQEKEK 161
           +  K++EA  K  +++
Sbjct: 514 RAAKEREAAKKAAEDR 529


>gi|358396914|gb|EHK46289.1| hypothetical protein TRIATDRAFT_132454 [Trichoderma atroviride IMI
           206040]
          Length = 881

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 48/79 (60%), Gaps = 4/79 (5%)

Query: 44  SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
           S +GA +N P+ FK+++   +L     V     V++++P M + G SR+LF +W  S +N
Sbjct: 334 SGDGAGKNGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELFERWAPSEKN 391

Query: 102 SIIITNRTSPGTLARDLIE 120
            +IIT  +  GT+AR +++
Sbjct: 392 GVIITGYSVEGTMARQIMQ 410


>gi|242053629|ref|XP_002455960.1| hypothetical protein SORBIDRAFT_03g028040 [Sorghum bicolor]
 gi|241927935|gb|EES01080.1| hypothetical protein SORBIDRAFT_03g028040 [Sorghum bicolor]
          Length = 558

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 42/87 (48%), Gaps = 6/87 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
           I W S K+  S   A +NPF FKHV  CH        P P  VL +TP M   GFS + F
Sbjct: 285 IGWTSQKIKDSH--AVHNPFDFKHV--CHFERSFINNPGP-CVLFATPGMISGGFSLEAF 339

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLI 119
            +W  S +N I +      GT+   L+
Sbjct: 340 KKWAPSEKNLITLPGYCVSGTIGHKLM 366


>gi|357440001|ref|XP_003590278.1| Cleavage and polyadenylation specificity factor subunit [Medicago
           truncatula]
 gi|355479326|gb|AES60529.1| Cleavage and polyadenylation specificity factor subunit [Medicago
           truncatula]
          Length = 196

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/113 (26%), Positives = 48/113 (42%), Gaps = 30/113 (26%)

Query: 164 PHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRD 223
           PH +  + +L+L+DFKQ L   G+  EF  G L C    V VR+V               
Sbjct: 114 PHKSVLVGDLKLADFKQFLSTKGVPVEFAGGALRCGE-YVTVRKVG-------------- 158

Query: 224 LFFQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                              +   S +  I++EG L ++YY+++  LY Q+ +L
Sbjct: 159 ---------------DATQKGAGSGTQQIIIEGPLCEDYYKIRDYLYSQFYLL 196


>gi|68471691|ref|XP_720152.1| hypothetical protein CaO19.7957 [Candida albicans SC5314]
 gi|68471954|ref|XP_720020.1| hypothetical protein CaO19.325 [Candida albicans SC5314]
 gi|46441870|gb|EAL01164.1| hypothetical protein CaO19.325 [Candida albicans SC5314]
 gi|46442007|gb|EAL01300.1| hypothetical protein CaO19.7957 [Candida albicans SC5314]
          Length = 931

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/147 (26%), Positives = 67/147 (45%), Gaps = 24/147 (16%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECG-FSRDLF 92
           ++WMS    K +E   + PF+   V L    +EL K+  PK+V  S  D+  G  S + F
Sbjct: 286 LDWMSKSFTKEWEELSSVPFNPSKVDLLLDPSELLKLSGPKIVFCSGIDLRSGDISAEAF 345

Query: 93  FQWCSSPENSIIITNRTS----------------------PGTLARDLIELGGNRTLTLQ 130
              C+    +II+T +T+                       G  + D I +  ++ ++L+
Sbjct: 346 QYLCNDEHTTIILTEKTTMNFASSLSSVLYTEWDSLAKKRGGGESEDGIAVPIDKNISLK 405

Query: 131 -VKKRIRLEGEELEEYQKKKDKEAKDK 156
              K + L G EL E+Q+K  ++ K+K
Sbjct: 406 NWTKEVELTGTELTEFQEKVAQKRKEK 432


>gi|268530366|ref|XP_002630309.1| Hypothetical protein CBG00745 [Caenorhabditis briggsae]
          Length = 637

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 43/136 (31%), Positives = 60/136 (44%), Gaps = 27/136 (19%)

Query: 12  ARNNPFHFKHVKLCHSLAELAK------IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLA 65
           A N P +F        LAE A       I W ++ + K+F     N F FKH++      
Sbjct: 291 ALNVPIYFSQ-----GLAERANQYYRLFISWTNENIKKTF--VERNMFEFKHIRPMEKGC 343

Query: 66  ELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGN 124
           E    P P+V L STP M   G S  +F +WCS P N II+      GT+   +I     
Sbjct: 344 E--DQPGPQV-LFSTPGMLHGGQSLKVFKKWCSDPLNMIIMPGYCVAGTVGARVI----- 395

Query: 125 RTLTLQVKKRIRLEGE 140
                  +KRI ++G+
Sbjct: 396 -----NGEKRIEIDGK 406


>gi|238880762|gb|EEQ44400.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 931

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/147 (26%), Positives = 67/147 (45%), Gaps = 24/147 (16%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECG-FSRDLF 92
           ++WMS    K +E   + PF+   V L    +EL K+  PK+V  S  D+  G  S + F
Sbjct: 286 LDWMSKSFTKEWEELSSVPFNPSKVDLLLDPSELLKLSGPKIVFCSGIDLRSGDISAEAF 345

Query: 93  FQWCSSPENSIIITNRTS----------------------PGTLARDLIELGGNRTLTLQ 130
              C+    +II+T +T+                       G  + D I +  ++ ++L+
Sbjct: 346 QYLCNDERTTIILTEKTTMNFASSLSSVLYTEWDSLAKKRGGGESEDGIAVPIDKNISLK 405

Query: 131 -VKKRIRLEGEELEEYQKKKDKEAKDK 156
              K + L G EL E+Q+K  ++ K+K
Sbjct: 406 NWTKEVELTGTELTEFQEKVAQKRKEK 432


>gi|341890123|gb|EGT46058.1| hypothetical protein CAEBREN_05882 [Caenorhabditis brenneri]
          Length = 618

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 52/108 (48%), Gaps = 16/108 (14%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
           I W ++ + K+F     N F FKH++      E   +P P+V L STP M   G S  +F
Sbjct: 288 ISWTNENIKKTF--VERNMFEFKHIRPMEKGCE--DMPGPQV-LFSTPGMLHGGQSLKVF 342

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGE 140
            +WCS P N II+      GT+   +I            +KRI ++G+
Sbjct: 343 KKWCSDPINMIIMPGYCVAGTVGAKVI----------NGEKRIEIDGK 380


>gi|327288530|ref|XP_003228979.1| PREDICTED: integrator complex subunit 11-like [Anolis carolinensis]
          Length = 600

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/105 (27%), Positives = 51/105 (48%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFK 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  LGG R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LGGQRKLEMEGRQILEVK 381


>gi|149245028|ref|XP_001527048.1| hypothetical protein LELG_01877 [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146449442|gb|EDK43698.1| hypothetical protein LELG_01877 [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 812

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 44/70 (62%), Gaps = 3/70 (4%)

Query: 52  PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTS 110
           PF FK++K   +L++ + +  P VV V+TP M + G SR L  +W    +N +I+T  + 
Sbjct: 335 PFDFKYIKSIKNLSKFSDL-GPSVV-VATPGMLQAGVSRQLLEKWAPEQKNLVILTGYSV 392

Query: 111 PGTLARDLIE 120
            GT+A+D+++
Sbjct: 393 EGTMAKDIMK 402


>gi|409079696|gb|EKM80057.1| hypothetical protein AGABI1DRAFT_72888 [Agaricus bisporus var.
           burnettii JB137-S8]
 gi|426198540|gb|EKV48466.1| hypothetical protein AGABI2DRAFT_220282 [Agaricus bisporus var.
           bisporus H97]
          Length = 919

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 61/128 (47%), Gaps = 24/128 (18%)

Query: 53  FHFKHVKL---CHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
             FKH++      +L +      PK++L     +  G SR+LF  +   P+N +++T R 
Sbjct: 384 LRFKHLEFFPNPQALLQTYSSKDPKLILAVPASLSHGPSRNLFVDFAVVPDNVVLLTGRG 443

Query: 110 SPGTLARDLI----------------ELGGNRTLT----LQVKKRIRLEGEELEEYQKKK 149
             G+L R L                 ++G N  L     ++++ ++ L+G ELE Y  ++
Sbjct: 444 EEGSLGRALFNKWNDRQRVDDKWDKGKIGSNIMLDGGFRMKMRSKVPLQGAELEAYL-QQ 502

Query: 150 DKEAKDKQ 157
           +KE KDK+
Sbjct: 503 EKEKKDKE 510


>gi|123476407|ref|XP_001321376.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121904201|gb|EAY09153.1| hypothetical protein TVAG_363680 [Trichomonas vaginalis G3]
          Length = 700

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/86 (38%), Positives = 45/86 (52%), Gaps = 1/86 (1%)

Query: 75  VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKR 134
           VV+ ST  +E GFSR LF    +S +N II T R  P +LA  L     +RT    +K R
Sbjct: 320 VVISSTDTLERGFSRKLFLDKANS-DNLIIFTQREPPYSLAEALRTNNAHRTFRFIIKHR 378

Query: 135 IRLEGEELEEYQKKKDKEAKDKQEKE 160
             L GEEL ++ +K+    +   E E
Sbjct: 379 EPLTGEELVKFMEKQSALQEKANEIE 404


>gi|341903207|gb|EGT59142.1| hypothetical protein CAEBREN_31222 [Caenorhabditis brenneri]
          Length = 571

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 52/108 (48%), Gaps = 16/108 (14%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
           I W ++ + K+F     N F FKH++      E   +P P+V L STP M   G S  +F
Sbjct: 291 ISWTNENIKKTF--VERNMFEFKHIRPMEKGCE--DMPGPQV-LFSTPGMLHGGQSLKVF 345

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGE 140
            +WCS P N II+      GT+   +I            +KRI ++G+
Sbjct: 346 KKWCSDPINMIIMPGYCVAGTVGAKVI----------NGEKRIEIDGK 383


>gi|302679538|ref|XP_003029451.1| hypothetical protein SCHCODRAFT_59058 [Schizophyllum commune H4-8]
 gi|300103141|gb|EFI94548.1| hypothetical protein SCHCODRAFT_59058 [Schizophyllum commune H4-8]
          Length = 786

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 55/118 (46%), Gaps = 11/118 (9%)

Query: 3   DKLMKSFEGARNNPFHFKHVKLCHSLAEL-AKIEWMSDKLMKSFEGARNNPFHFKHVKLC 61
           D+  K      N P ++       S+A     I  M+  +   F   R+NPF FK     
Sbjct: 252 DEYWKKHPDLHNVPIYYASGLARKSMAVYQTYIHTMNSNIRSRF-AKRDNPFVFK----- 305

Query: 62  HSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
               ++A+ P P VVL +   M+ G SR+LF  W     N +I+T  +  GTLARD++
Sbjct: 306 ---CKIAEGP-PCVVLATPGFMQTGSSRELFELWAPDSRNGLIVTGYSVEGTLARDIM 359


>gi|154322621|ref|XP_001560625.1| hypothetical protein BC1G_00653 [Botryotinia fuckeliana B05.10]
 gi|347837188|emb|CCD51760.1| similar to cleavage and polyadenylation specifity factor
           [Botryotinia fuckeliana]
          Length = 828

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 47/87 (54%), Gaps = 3/87 (3%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
           E M++    S  G R  P+ FK+++   +L     V     V++++P M + G SR L  
Sbjct: 315 ERMAEAEANSTSGGRGGPWDFKYIRSLKNLDRFDDVGG--CVILASPGMLQNGISRQLLE 372

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIE 120
           +W  S +N +IIT  +  GT+A+ +++
Sbjct: 373 RWAPSDKNGVIITGYSVEGTMAKQIMQ 399


>gi|281206064|gb|EFA80253.1| beta-lactamase domain-containing protein [Polysphondylium pallidum
           PN500]
          Length = 656

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 44/86 (51%), Gaps = 3/86 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++++   F+   +NPF FKH++    +        P V + S   ++ G SR LF 
Sbjct: 191 INMMNERIRAQFD--LSNPFSFKHIENISGIERFTD-DGPCVFMASPGMLQSGLSRQLFE 247

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WCS   N ++I      GTLA+ ++
Sbjct: 248 RWCSDKMNGVVIPGYNVEGTLAKHIM 273


>gi|169861678|ref|XP_001837473.1| cleavage and polyadenylation specificity factor subunit
           [Coprinopsis cinerea okayama7#130]
 gi|116501494|gb|EAU84389.1| cleavage and polyadenylation specificity factor subunit
           [Coprinopsis cinerea okayama7#130]
          Length = 926

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/132 (26%), Positives = 62/132 (46%), Gaps = 24/132 (18%)

Query: 53  FHFKHVKL---CHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
             FKH++      +L +      PK++L     +  G SR LF  + + P+N +++T R 
Sbjct: 391 LRFKHLEFFPNPQALLQRHSSKDPKLILAVPASLSHGPSRQLFADFAAVPDNVVLLTTRG 450

Query: 110 SPGTLARDLIE----------------LGGNRTLTLQVK----KRIRLEGEELEEYQKKK 149
           + GTL R L +                +G N  +   +K     ++ L+G ELEEY   K
Sbjct: 451 AEGTLGRALFDKWNNSQRGDDKWDKGRIGRNVMMDGAIKIKMYHKVPLQGAELEEYL-AK 509

Query: 150 DKEAKDKQEKEK 161
           ++ AK+K+  ++
Sbjct: 510 ERAAKEKEAAQQ 521


>gi|358333242|dbj|GAA51791.1| cleavage and polyadenylation specificity factor subunit 3
           [Clonorchis sinensis]
          Length = 697

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 35/125 (28%), Positives = 53/125 (42%), Gaps = 25/125 (20%)

Query: 12  ARNNPFHFKHVK-----LCHSLAE-----LAKIEWMSDK-----LMKSFEGARNNPFHFK 56
           A NNPF F+H+        +S++E     LA   W+        ++ +     N P   +
Sbjct: 242 ANNNPFCFRHISNLKAMRSYSISEQTEHALASKAWLYVAYSRFPVIGTVAAGTNVPTSIE 301

Query: 57  HVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLAR 116
           H               P VV+ S   M+ G SR+LF  WC+   N +II      GTLA+
Sbjct: 302 HFDDS----------GPCVVMASPGMMQSGMSRELFENWCTDRRNGVIIAGYCVEGTLAK 351

Query: 117 DLIEL 121
            ++ L
Sbjct: 352 QILSL 356


>gi|294953157|ref|XP_002787623.1| hypothetical protein Pmar_PMAR012397 [Perkinsus marinus ATCC 50983]
 gi|239902647|gb|EER19419.1| hypothetical protein Pmar_PMAR012397 [Perkinsus marinus ATCC 50983]
          Length = 802

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 1/77 (1%)

Query: 61  CHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
           C    ++   P    V+++ P M + G SR+LF QW   P+N +IIT  +  GTLA DL 
Sbjct: 6   CKEWEDVVLAPGMPCVVMAAPGMLQSGTSRELFEQWAPDPKNGVIITGYSVSGTLAHDLQ 65

Query: 120 ELGGNRTLTLQVKKRIR 136
                 TLT   K  +R
Sbjct: 66  NDPDTLTLTDGRKLPVR 82


>gi|156064885|ref|XP_001598364.1| conserved hypothetical protein [Sclerotinia sclerotiorum 1980]
 gi|154691312|gb|EDN91050.1| conserved hypothetical protein [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 820

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 47/87 (54%), Gaps = 3/87 (3%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
           E M++    S  G R  P+ FK+++   +L     V     V++++P M + G SR L  
Sbjct: 315 ERMAEAEANSTSGGRGGPWDFKYIRSLKNLDRFDDVGG--CVILASPGMLQNGISRQLLE 372

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIE 120
           +W  S +N +IIT  +  GT+A+ +++
Sbjct: 373 RWAPSDKNGVIITGYSVEGTMAKQIMQ 399


>gi|145507230|ref|XP_001439570.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124406765|emb|CAK72173.1| unnamed protein product [Paramecium tetraurelia]
          Length = 484

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 43/84 (51%), Gaps = 4/84 (4%)

Query: 36  WMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQW 95
           W S+K+ +SF    +N F+FK+++       L K P   V+L +   +  G S  +F +W
Sbjct: 272 WESEKIQQSF--LDDNIFNFKYIQPFDR--NLIKSPLSMVLLATPGMLHGGLSMQVFKEW 327

Query: 96  CSSPENSIIITNRTSPGTLARDLI 119
           C    NS++I     PGTL   L+
Sbjct: 328 CGCANNSLVIPGYCVPGTLGNKLL 351


>gi|449016323|dbj|BAM79725.1| cleavage and polyadenylation specifity factor protein
           [Cyanidioschyzon merolae strain 10D]
          Length = 749

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 37/70 (52%), Gaps = 1/70 (1%)

Query: 50  NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
            NPF FK+V    SL E      P V + S   ++ G SR LF +WCS   N +I+   +
Sbjct: 302 GNPFAFKYVMNIRSLDEFED-SGPCVFMASPGMLQSGMSRRLFEKWCSDRRNGVILPGYS 360

Query: 110 SPGTLARDLI 119
             GTLA+ ++
Sbjct: 361 VQGTLAKYIL 370


>gi|443694305|gb|ELT95478.1| hypothetical protein CAPTEDRAFT_151615 [Capitella teleta]
          Length = 600

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/105 (27%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+  +F   + N F FKH+K    +   A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIKNTF--VQRNMFDFKHIKPFDKV--YADNPGPMVVFATPGMLHGGLSLQIFK 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +WC   +N +I+      GT+   +  L G R + ++ K+ I ++
Sbjct: 339 KWCGGEKNMVIMPGYCVSGTIGWKI--LNGQRKIEMENKQIIEVK 381


>gi|342882935|gb|EGU83499.1| hypothetical protein FOXB_05909 [Fusarium oxysporum Fo5176]
          Length = 950

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 71/165 (43%), Gaps = 51/165 (30%)

Query: 34  IEWMSDKLMKSFE----------GARN----NPFHFKHVKLCHSLAELAKVPS------- 72
           +EWM + +++ FE          GA +     PF FK+++L    A++A++ S       
Sbjct: 332 LEWMDESIVQEFEAFAEGQRKVNGANDKKEGGPFDFKYLRLLERKAQIARLLSQNPDNVS 391

Query: 73  --PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN-----RTSPGTLARDLIE----- 120
              +V+L S   +E GFS+DL         N +I+T+     +T   ++AR L +     
Sbjct: 392 TEGRVILASDSSIEWGFSKDLIKGLARDSRNLVILTDKPGLSKTDKPSIARTLWDWWKER 451

Query: 121 ------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
                               G R L ++  +R  L+G+EL  YQ+
Sbjct: 452 KDGVSVEQNANGENLEFVYAGGRELEIREAQRHALDGDELALYQQ 496


>gi|389601462|ref|XP_001565522.2| putative cleavage and polyadenylation specificity factor
           [Leishmania braziliensis MHOM/BR/75/M2904]
 gi|322505052|emb|CAM39016.2| putative cleavage and polyadenylation specificity factor
           [Leishmania braziliensis MHOM/BR/75/M2904]
          Length = 829

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 53/113 (46%), Gaps = 13/113 (11%)

Query: 55  FKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSS----PENSIIITNRTS 110
           F +V  C S  E+  +  PK+ +     ++ G S +L   +        ++ I++T    
Sbjct: 310 FANVLTCRSAEEVLPIQGPKICVADGASLDFGPSAELLEYFVKGNRDGADHLIVLTEPPL 369

Query: 111 PGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEY---------QKKKDKEAK 154
           PGT A  +   G    L  Q+ +R RL GEELEEY         Q++++ EA+
Sbjct: 370 PGTNATVVTAAGDGERLHFQITRRSRLSGEELEEYYIDLEHDVEQRRRELEAQ 422


>gi|401885166|gb|EJT49292.1| cleavage and polyadenylation specificity factor subunit
           [Trichosporon asahii var. asahii CBS 2479]
          Length = 958

 Score = 48.5 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 43/142 (30%), Positives = 63/142 (44%), Gaps = 27/142 (19%)

Query: 47  GARNNPFHFKHVKLCHSLAEL-AKVPS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSI 103
           G+      F HV+   +  EL  K P+  PK+VL   P M  G SR LF    S P N +
Sbjct: 338 GSEYGALDFSHVQFFATPEELLEKYPANRPKLVLAIPPTMSHGPSRTLFASMASVPGNVV 397

Query: 104 IITNRTSPGTLARDLIELG---------------GNRT-----LTLQVKKRIRLEGEELE 143
           ++T      TLAR+L                   G+ T     L L++  +  L GEELE
Sbjct: 398 LLTGHGEDRTLARELYARWEAHQDEGAHYGHGKIGHATPMEGRLELELDAKEPLSGEELE 457

Query: 144 EYQ----KKKDKEAKDKQEKEK 161
            Y+    +K+++EA  +   E+
Sbjct: 458 AYETAEREKREREAAHQAALER 479


>gi|358394479|gb|EHK43872.1| hypothetical protein TRIATDRAFT_79096 [Trichoderma atroviride IMI
           206040]
          Length = 957

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 43/165 (26%), Positives = 69/165 (41%), Gaps = 51/165 (30%)

Query: 34  IEWMSDKLMKSFE----GARNN----------PFHFKHVKLCHSLAELAKVPS------- 72
           +EWM + +++ FE    G R            PF FK+++L    A++AK+ S       
Sbjct: 332 LEWMDNNIVQEFEAFAEGQRKTNGGSEKKEGAPFDFKYLRLLERKAQIAKLLSQSIENGE 391

Query: 73  --PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR-----TSPGTLARDLIE----- 120
              +V+L S   M+ GFS+DL         N +I+T R     T   +++R + E     
Sbjct: 392 TQGRVILASDVSMDWGFSKDLIKGLAKDTRNLVILTERPSLANTDAPSISRMMWEWWKER 451

Query: 121 ------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
                               G R L ++  +R  LEG+EL  YQ+
Sbjct: 452 RDGISTEHASNGDSLETIYSGGRELEVREARREPLEGDELAIYQQ 496


>gi|452819966|gb|EME27015.1| cleavage and polyadenylation specifity factor protein [Galdieria
           sulphuraria]
          Length = 717

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 61/120 (50%), Gaps = 13/120 (10%)

Query: 23  KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
           K C S+ +   I  M+D + K +E +  NPF FK+V    ++ +      P V + S   
Sbjct: 288 KRCMSVYQ-TYINMMNDNIRKRYEVS--NPFAFKYVLNVKNIQDFDD-SGPCVFMASPGM 343

Query: 83  MECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
           ++ G SR+L  +WC+   N II+   +  GTLA+ +  L    T+T       RL+G+E+
Sbjct: 344 LQSGLSRELCERWCTDRRNGIILPGYSVEGTLAKHI--LSEPSTIT-------RLDGKEV 394


>gi|449549925|gb|EMD40890.1| hypothetical protein CERSUDRAFT_111471 [Ceriporiopsis subvermispora
           B]
          Length = 934

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 33/120 (27%), Positives = 55/120 (45%), Gaps = 23/120 (19%)

Query: 53  FHFKHVKL---CHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
             F+H++      +L +      PK++L     +  G SR LF Q+   P+N +++T R+
Sbjct: 387 LRFRHLEFFPNPQALMQTYSSKDPKLILAVPATLSHGPSRALFTQFAEMPDNVVLLTGRS 446

Query: 110 SPGTLARDLIE----------------LGGN----RTLTLQVKKRIRLEGEELEEYQKKK 149
             GTL R L +                +G N     TL L++  ++ L+G ELE Y  K+
Sbjct: 447 EEGTLGRILFDRWNAAQRDEAKWDRGKIGSNVMMDGTLRLKMNSKVPLQGAELEVYLAKE 506


>gi|294931418|ref|XP_002779869.1| hypothetical protein Pmar_PMAR016530 [Perkinsus marinus ATCC 50983]
 gi|239889578|gb|EER11664.1| hypothetical protein Pmar_PMAR016530 [Perkinsus marinus ATCC 50983]
          Length = 80

 Score = 48.1 bits (113), Expect = 0.004,   Method: Composition-based stats.
 Identities = 26/69 (37%), Positives = 36/69 (52%), Gaps = 1/69 (1%)

Query: 61  CHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
           C    ++   P    V+++ P M + G SR+LF QW   P+N +IIT  +  GTLA DL 
Sbjct: 6   CKEWEDVVLAPGMPCVVMAAPGMLQSGTSRELFEQWAPDPKNGVIITGYSVSGTLAHDLQ 65

Query: 120 ELGGNRTLT 128
                 TLT
Sbjct: 66  NDPDTLTLT 74


>gi|345563625|gb|EGX46611.1| hypothetical protein AOL_s00097g515 [Arthrobotrys oligospora ATCC
           24927]
          Length = 791

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 36/153 (23%), Positives = 65/153 (42%), Gaps = 20/153 (13%)

Query: 50  NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNR 108
            NPF  + V+   SL     V     V+++ P M + G SR+L  +WC  P+N +I+T  
Sbjct: 330 TNPFEMRWVRSLKSLDRFDDVGG--CVMLAAPGMMQNGVSRELLERWCPDPKNGVILTGY 387

Query: 109 TSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTS 168
           +  GTLA+ ++                  E  E++ ++K+       ++E +++      
Sbjct: 388 SVEGTLAKSILN-----------------EPTEIQAFKKEHTSRRSGREEADRVMIPRRC 430

Query: 169 FINELQLSDFKQTLQRNGIDCEFMDGVLICCRG 201
            I+EL  +      Q +    E    V+I   G
Sbjct: 431 SIDELSFAAHVDYGQNSSFIEEVGAKVIILVHG 463


>gi|242220452|ref|XP_002475992.1| predicted protein [Postia placenta Mad-698-R]
 gi|220724781|gb|EED78801.1| predicted protein [Postia placenta Mad-698-R]
          Length = 825

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 39/121 (32%), Positives = 57/121 (47%), Gaps = 7/121 (5%)

Query: 3   DKLMKSFEGARNNPFHFKHVKLCHSLAELAK-IEWMSDKLMKSFEGARNNPFHFKHVK-L 60
           D+  K     +N P ++       S+A     I  M+  +   F   R+NPF FKH+  L
Sbjct: 243 DEYWKKHPDLQNVPIYYASSLARKSMAVYQTYIHTMNSNVRSRF-AKRDNPFVFKHISNL 301

Query: 61  CHSLA---ELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARD 117
             S     ++A+ P P VVL S   M  G SR+L   W     N +IIT  +  GT+AR+
Sbjct: 302 PQSKGWERKIAEGP-PCVVLASPGFMTSGASRELLELWAPDSRNGVIITGYSIEGTMARE 360

Query: 118 L 118
           +
Sbjct: 361 I 361


>gi|313215108|emb|CBY42824.1| unnamed protein product [Oikopleura dioica]
          Length = 323

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 39/171 (22%), Positives = 71/171 (41%), Gaps = 51/171 (29%)

Query: 117 DLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPP-----------H 165
           + + +G       +++ R+   G  LE   +  + +   K E + IP            H
Sbjct: 193 NFVRVGSKDIEVARIRGRVDYFGGRLELEAENGENDEPKKLEIDDIPTLQPVTNNYSSGH 252

Query: 166 DTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLF 225
           D+ FIN+ +L++ K  L   G+  EF+ G L+ C   V+++R                  
Sbjct: 253 DSIFINDTKLTELKSNLIDCGMHAEFIGGNLV-CNNKVSIKR------------------ 293

Query: 226 FQWCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
                                S + VI +EG LS++Y+ V++++YD YAI+
Sbjct: 294 ---------------------SANGVIQVEGTLSEDYFIVRKMVYDNYAIV 323


>gi|254565077|ref|XP_002489649.1| Putative endoribonuclease [Komagataella pastoris GS115]
 gi|238029445|emb|CAY67368.1| Putative endoribonuclease [Komagataella pastoris GS115]
          Length = 784

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 39/73 (53%), Gaps = 1/73 (1%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M++ + K F     NPF F+++K   +L++      P VV+ S   ++ G SR L  
Sbjct: 291 INMMNENIRKKFRDTNKNPFQFQYIKNIKNLSKFDDF-QPSVVVASPGMLQNGVSRALLE 349

Query: 94  QWCSSPENSIIIT 106
           +W   P N++I+T
Sbjct: 350 KWAPDPRNTLIMT 362


>gi|401428833|ref|XP_003878899.1| cleavage and polyadenylation specificity factor,putative
           [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|322495148|emb|CBZ30452.1| cleavage and polyadenylation specificity factor,putative
           [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 756

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 46/88 (52%), Gaps = 6/88 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP--SPKVVLVSTPDMECGFSRDL 91
           +  M+D++ K      +NPF FK++   HSL +        P VVL S   ++ G S +L
Sbjct: 298 VSAMNDRV-KQQHANHHNPFVFKYI---HSLMDTKSFEDNGPCVVLASPGMLQSGISLEL 353

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F +WC    N II+      GT+A+D++
Sbjct: 354 FERWCGDRRNGIIMAGYCVDGTIAKDVL 381


>gi|196007172|ref|XP_002113452.1| hypothetical protein TRIADDRAFT_57642 [Trichoplax adhaerens]
 gi|190583856|gb|EDV23926.1| hypothetical protein TRIADDRAFT_57642 [Trichoplax adhaerens]
          Length = 596

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 50/102 (49%), Gaps = 6/102 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ ++F   ++N F FKH+K       L   P+P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRRTF--VQHNMFEFKHIKPFDR--ALIDNPNPMVVFATPGMLHGGLSLQIFK 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRI 135
           +W    +N +I+      GT+   +  L G RT+ L+ K+ I
Sbjct: 339 KWAPDDKNMVILPGYCVAGTVGNKI--LSGQRTVELENKQII 378


>gi|350585498|ref|XP_003127541.3| PREDICTED: integrator complex subunit 11-like [Sus scrofa]
          Length = 599

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/105 (27%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADSPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L L+ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLELEGRQVLEVK 381


>gi|340966678|gb|EGS22185.1| putative cleavage and polyadenylation protein [Chaetomium
           thermophilum var. thermophilum DSM 1495]
          Length = 998

 Score = 47.8 bits (112), Expect = 0.005,   Method: Composition-based stats.
 Identities = 46/174 (26%), Positives = 71/174 (40%), Gaps = 60/174 (34%)

Query: 34  IEWMSDKLMKSFE----GARNN----------------PFHFKHVKLCHSLAELAKV--- 70
           +EWM D +++ FE    GAR                  PF FK+++L    A++ +V   
Sbjct: 334 LEWMDDSIVREFEAVAGGARTTNGGANASGGNKAKEAGPFDFKYLRLLERKAQIERVLQQ 393

Query: 71  -------PSPK--VVLVSTPDMECGFSRDLFFQWCSSPENSIIIT---NRTSPG--TLAR 116
                   SPK  V+L +   ++ GFS+D+     S   N +I+T   N  +P   ++AR
Sbjct: 394 ATSPPEGESPKGTVILATDTSLDWGFSKDVLKAIASDARNLVILTEKPNLANPDRPSIAR 453

Query: 117 DLIE-----------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
            L +                        GG R L++    R  LEG EL  YQ+
Sbjct: 454 MLWDWWRERRDGVAVEQTASGDTFEQVYGGGRELSVPESTRHPLEGSELTVYQQ 507


>gi|241953057|ref|XP_002419250.1| subunit of mRNA cleavage and polyadenylation factor, putative
           [Candida dubliniensis CD36]
 gi|223642590|emb|CAX42840.1| subunit of mRNA cleavage and polyadenylation factor, putative
           [Candida dubliniensis CD36]
          Length = 930

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 41/147 (27%), Positives = 68/147 (46%), Gaps = 24/147 (16%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECG-FSRDLF 92
           ++WMS    K +E   + PF+   V L    +EL  +  PK+V  S  D+  G  S + F
Sbjct: 286 LDWMSKSFTKEWEELSSVPFNPSKVDLLLDPSELLNLSGPKIVFCSGIDLRSGDISAEAF 345

Query: 93  FQWCSSPENSIIITNRTSP--------------GTLAR--------DLIELGGNRTLTLQ 130
              C+    +II+T +T+                TLA+        D I +  ++ ++L+
Sbjct: 346 QYLCNDERTTIILTEKTTMSLESSLSSILYTEWDTLAKKRGGGESADGIAVPIDKNISLK 405

Query: 131 -VKKRIRLEGEELEEYQKKKDKEAKDK 156
              K I L G EL E+Q+K  ++ K+K
Sbjct: 406 NWTKEIELTGTELTEFQEKVAQKRKEK 432


>gi|340518710|gb|EGR48950.1| predicted protein [Trichoderma reesei QM6a]
          Length = 962

 Score = 47.8 bits (112), Expect = 0.005,   Method: Composition-based stats.
 Identities = 44/165 (26%), Positives = 68/165 (41%), Gaps = 51/165 (30%)

Query: 34  IEWMSDKLMKSFE----GARN----------NPFHFKHVKLCHSLAELAKVPSP------ 73
           +EWM + +++ FE    G R            PF FK+++L    A++ K+ S       
Sbjct: 332 LEWMDNNIVQEFEAFAEGQRKVNGGSEKKEGAPFDFKYLRLLERKAQIIKLLSQNIENGE 391

Query: 74  ---KVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG-----TLARDLIE----- 120
              +V+L S   ME GFS+DL         N +I+T R S       +++R L E     
Sbjct: 392 THGRVILASDITMEWGFSKDLVKGLARDSRNLVILTERPSLAREDSPSISRTLWEWWKER 451

Query: 121 ------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
                              GG R L ++   R  LEG++L  YQ+
Sbjct: 452 RDGVSTEHASTGDTLEMVYGGGRELEVREAVREPLEGDDLAIYQQ 496


>gi|328854195|gb|EGG03329.1| hypothetical protein MELLADRAFT_90299 [Melampsora larici-populina
           98AG31]
          Length = 695

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 41/155 (26%), Positives = 71/155 (45%), Gaps = 31/155 (20%)

Query: 35  EWMS-----DKLMKSF---EGARNNPFHFKHVKLCHSLAEL---AKVPSPKVVLVSTPDM 83
           EWMS     +  +KS    EG        +++K  +S+  L        PK +L     M
Sbjct: 108 EWMSPLTPTESQLKSRARDEGPGGIALRLRNLKFFNSIEALESQTAAIQPKCILAVPLTM 167

Query: 84  ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE------------------LGGNR 125
             GFSR +F +    P N +++T+     +L R L +                  +  N 
Sbjct: 168 AYGFSRRMFTRHVGKPGNLVVLTSMGEKESLTRWLADQVNEKSEAKYGSGTIPEPIDLNT 227

Query: 126 TLTLQVKKRIRLEGEELEEY--QKKKDKEAKDKQE 158
           ++++++K+++ LEGEELE+Y   K++ KE + K E
Sbjct: 228 SVSVELKRKVVLEGEELEQYLEDKQRAKERRTKHE 262


>gi|154336691|ref|XP_001564581.1| putative cleavage and polyadenylation specificity factor
           [Leishmania braziliensis MHOM/BR/75/M2904]
 gi|134061616|emb|CAM38647.1| putative cleavage and polyadenylation specificity factor
           [Leishmania braziliensis MHOM/BR/75/M2904]
          Length = 756

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 46/88 (52%), Gaps = 6/88 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP--SPKVVLVSTPDMECGFSRDL 91
           +  M+D++ K      +NPF FK++   HSL +        P VVL S   ++ G S +L
Sbjct: 298 VSAMNDRV-KQQHANHHNPFVFKYI---HSLIDTKSFEDNGPCVVLASPGMLQSGISLEL 353

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F +WC    N II+      GT+A+D++
Sbjct: 354 FERWCGDRRNGIIMAGYCVDGTIAKDVL 381


>gi|398022636|ref|XP_003864480.1| cleavage and polyadenylation specificity factor, putative
           [Leishmania donovani]
 gi|322502715|emb|CBZ37798.1| cleavage and polyadenylation specificity factor, putative
           [Leishmania donovani]
          Length = 756

 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 46/88 (52%), Gaps = 6/88 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP--SPKVVLVSTPDMECGFSRDL 91
           +  M+D++ K      +NPF FK++   HSL +        P VVL S   ++ G S +L
Sbjct: 298 VSAMNDRV-KQQHANHHNPFVFKYI---HSLMDTKSFEDNGPCVVLASPGMLQSGISLEL 353

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F +WC    N II+      GT+A+D++
Sbjct: 354 FERWCGDRRNGIIMAGYCVDGTIAKDVL 381


>gi|223997482|ref|XP_002288414.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220975522|gb|EED93850.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 557

 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/91 (29%), Positives = 43/91 (47%), Gaps = 10/91 (10%)

Query: 50  NNPFHFKHVKLCHSL-AELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
            NPFHF H++   S+         P VV  S   ++ G SR LF +W   P+N +++   
Sbjct: 308 GNPFHFSHIRNLKSIDVNNFDDRGPSVVFASPGMLQSGVSRQLFDRWAGDPKNGVMLAGY 367

Query: 109 TSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
               TLA++++          Q K+ + LEG
Sbjct: 368 AVEHTLAKEIMS---------QPKEVVTLEG 389


>gi|146099573|ref|XP_001468678.1| putative cleavage and polyadenylation specificity factor
           [Leishmania infantum JPCM5]
 gi|134073046|emb|CAM71766.1| putative cleavage and polyadenylation specificity factor
           [Leishmania infantum JPCM5]
          Length = 756

 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 46/88 (52%), Gaps = 6/88 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP--SPKVVLVSTPDMECGFSRDL 91
           +  M+D++ K      +NPF FK++   HSL +        P VVL S   ++ G S +L
Sbjct: 298 VSAMNDRV-KQQHANHHNPFVFKYI---HSLMDTKSFEDNGPCVVLASPGMLQSGISLEL 353

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F +WC    N II+      GT+A+D++
Sbjct: 354 FERWCGDRRNGIIMAGYCVDGTIAKDVL 381


>gi|390601510|gb|EIN10904.1| hypothetical protein PUNSTDRAFT_112695 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 937

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 61/132 (46%), Gaps = 24/132 (18%)

Query: 53  FHFKHVKLCHSLAELAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
             F+H++   +   L    S   PK++L     +  G SR LF  + + P+N II+T R 
Sbjct: 388 LRFRHLEFFPNPQALLNTYSSKDPKLILAVPASLSHGPSRALFSTFAAVPDNVIILTQRG 447

Query: 110 SPGTLARDLI----------------ELGGNRTL----TLQVKKRIRLEGEELEEYQKKK 149
             GTL  DL                 ++G N  L     L++  ++ L+G+ELE +   K
Sbjct: 448 EEGTLGNDLFKKWNNSQRAEHKWDKGKIGSNVMLDGNMILKMNSKVPLQGDELEAFL-AK 506

Query: 150 DKEAKDKQEKEK 161
           ++ A +K+  EK
Sbjct: 507 ERAAMEKEAAEK 518


>gi|322710530|gb|EFZ02104.1| cleavage and polyadenylation specifity factor [Metarhizium
           anisopliae ARSEF 23]
          Length = 831

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 47/79 (59%), Gaps = 4/79 (5%)

Query: 44  SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
           S +GA +  P+ FK+++   +L     V     V++++P M + G SR+LF +W  S +N
Sbjct: 326 SGDGAGQGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQSGVSRELFERWAPSEKN 383

Query: 102 SIIITNRTSPGTLARDLIE 120
            +IIT  +  GT+AR +++
Sbjct: 384 GVIITGYSVEGTMARQIMQ 402


>gi|324504608|gb|ADY41989.1| Integrator complex subunit 11 [Ascaris suum]
          Length = 588

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 60/125 (48%), Gaps = 22/125 (17%)

Query: 26  HSLAELAK------IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVS 79
           H LAE A       I W ++K+ ++F     N F FKH++        +  P P +VL S
Sbjct: 270 HGLAEKATQYYRLFISWTNEKIKRTF--VHRNMFDFKHIRPFDQ--SFSDSPGP-MVLFS 324

Query: 80  TPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           TP M   G S  +F +WCS  +N +I+      GT+   +I  GG         K++ ++
Sbjct: 325 TPGMLHGGQSLRVFKKWCSDEKNMVIMPGFCVAGTIGAKVI--GG--------AKKVEID 374

Query: 139 GEELE 143
           G+ L+
Sbjct: 375 GKMLD 379


>gi|400600571|gb|EJP68245.1| metallo-beta-lactamase superfamily protein [Beauveria bassiana
           ARSEF 2860]
          Length = 866

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 44/78 (56%), Gaps = 3/78 (3%)

Query: 44  SFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENS 102
           S E     P+ FK+++   +L     V     V++++P M + G SR+LF +W  S +N 
Sbjct: 324 SGEAGAGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELFERWAPSDKNG 381

Query: 103 IIITNRTSPGTLARDLIE 120
           +IIT  +  GT+AR +++
Sbjct: 382 VIITGYSVEGTMARQIMK 399


>gi|344283025|ref|XP_003413273.1| PREDICTED: integrator complex subunit 11-like [Loxodonta africana]
          Length = 719

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 44/193 (22%), Positives = 79/193 (40%), Gaps = 15/193 (7%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 402 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 457

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGE-ELEEYQKKKDKE 152
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++ + E   +    D +
Sbjct: 458 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAK 515

Query: 153 AKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVS 212
              +   +  P H      E +  +F     +  I+ EF     + C        V L +
Sbjct: 516 GIMQLIGQAEPEHVLLVHGEAKKMEF----LKQKIEQEFR----VNCYMPANGETVTLPT 567

Query: 213 TPDMECGFSRDLF 225
           +P +  G S  L 
Sbjct: 568 SPSIPVGISLGLL 580


>gi|432090010|gb|ELK23618.1| Integrator complex subunit 11 [Myotis davidii]
          Length = 561

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 53/111 (47%), Gaps = 8/111 (7%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 317 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 372

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKK--RIRLEGEEL 142
           +W  + +N +I+      GT+   +  L G R L ++ ++   IR++ E +
Sbjct: 373 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEIRMQVEYM 421


>gi|426240429|ref|XP_004014105.1| PREDICTED: integrator complex subunit 11 [Ovis aries]
          Length = 515

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADSPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|74220481|dbj|BAE31460.1| unnamed protein product [Mus musculus]
          Length = 600

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQGNMFEFKHIKAFDR--TFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQMLEVK 381


>gi|343958192|dbj|BAK62951.1| protein related to CPSF subunits 68 kDa [Pan troglodytes]
          Length = 600

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 51/105 (48%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+  ++  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHEI--LSGQRKLEMEGRQVLEVK 381


>gi|389740019|gb|EIM81211.1| mRNA 3'-end-processing protein YSH1 [Stereum hirsutum FP-91666 SS1]
          Length = 841

 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 49  RNNPFHFKHVK-LCHSLA---ELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSII 104
           R+NPF FKH+  +  S     ++A+ P P VVL S   M+ G SR L   W     N +I
Sbjct: 298 RDNPFVFKHISNMPQSSGWERKIAEGP-PCVVLASPGFMQSGPSRQLLELWAPDSRNGLI 356

Query: 105 ITNRTSPGTLARDLI 119
           +T  +  GTLAR+++
Sbjct: 357 VTGYSVEGTLAREIM 371


>gi|449546825|gb|EMD37794.1| hypothetical protein CERSUDRAFT_154677 [Ceriporiopsis subvermispora
           B]
          Length = 820

 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 40/75 (53%), Gaps = 5/75 (6%)

Query: 49  RNNPFHFKHVKLCHSLA----ELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSII 104
           R+NPF FKH+           ++A+ P P VVL S   +  G SR+L   W     N II
Sbjct: 298 RDNPFVFKHISNVPQARGWERKIAEGP-PCVVLASPGFVTSGPSRELLELWAPDSRNGII 356

Query: 105 ITNRTSPGTLARDLI 119
           +T  +  GT+ARD++
Sbjct: 357 VTGYSVEGTMARDIL 371


>gi|414881435|tpg|DAA58566.1| TPA: putative RNA-metabolising metallo-beta-lactamase [Zea mays]
          Length = 558

 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 41/87 (47%), Gaps = 6/87 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
           I W S K+  S     +NPF FKHV  CH        P P  VL +TP M   GFS + F
Sbjct: 285 IGWTSQKIKDSH--TVHNPFDFKHV--CHFERSFINNPGP-CVLFATPGMITGGFSLEAF 339

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLI 119
            +W  S +N + +      GT+   L+
Sbjct: 340 KKWAPSEKNLVTLPGYCVSGTIGHKLM 366


>gi|339244969|ref|XP_003378410.1| putative metallo-beta-lactamase domain protein [Trichinella
           spiralis]
 gi|316972680|gb|EFV56345.1| putative metallo-beta-lactamase domain protein [Trichinella
           spiralis]
          Length = 562

 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 30/97 (30%), Positives = 46/97 (47%), Gaps = 6/97 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           + W S+K+ K+F   + N F FKHV         A  P P VV  +   +  G S  +F 
Sbjct: 250 VTWTSEKIKKTF--VKRNMFDFKHVLPFED--SFADTPGPMVVFATPGMLHSGQSLKIFK 305

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQ 130
           +W ++ +N +I+      GT+   LI   G R L L+
Sbjct: 306 KWATNEKNMVIMPGYCVQGTVGSKLI--AGVRKLDLE 340


>gi|274326663|ref|NP_001094578.1| integrator complex subunit 11 [Bos taurus]
 gi|152941100|gb|ABS44987.1| related to CPSF subunits 68 kDa [Bos taurus]
          Length = 599

 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADSPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|355680857|gb|AER96662.1| cleavage and polyadenylation specific factor 3-like protein
           [Mustela putorius furo]
          Length = 440

 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 292 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 347

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 348 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 390


>gi|118572556|sp|Q2YDM2.2|INT11_BOVIN RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
           Full=Cleavage and polyadenylation-specific factor 3-like
           protein; Short=CPSF3-like protein
 gi|158455110|gb|AAI10156.2| CPSF3L protein [Bos taurus]
          Length = 599

 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADSPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|440911726|gb|ELR61363.1| Integrator complex subunit 11 [Bos grunniens mutus]
          Length = 599

 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADSPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|328873132|gb|EGG21499.1| integrator complex subunit 11 [Dictyostelium fasciculatum]
          Length = 645

 Score = 47.0 bits (110), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 40/132 (30%), Positives = 60/132 (45%), Gaps = 18/132 (13%)

Query: 14  NNPFHFKHVKLCHSLAELAK------IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAEL 67
           N P +F        LAE A       I W + K+ ++F   + N F FKH+K       L
Sbjct: 261 NVPIYF-----SEGLAEKANFYYKLFITWTNQKIKQTF--VKRNMFDFKHIKPFDR--HL 311

Query: 68  AKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRT 126
           A  P P +VL +TP M   G S ++F +W  +  N  II      GT+   L+   G   
Sbjct: 312 ADAPGP-MVLFATPGMLHAGASLEVFKKWAPNELNMTIIPGYCVVGTVGNKLLSNAGGPQ 370

Query: 127 LTLQVKKRIRLE 138
           + +++ K+  LE
Sbjct: 371 M-VEIDKKTTLE 381


>gi|296479091|tpg|DAA21206.1| TPA: cleavage and polyadenylation specific factor 3-like [Bos
           taurus]
          Length = 599

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADSPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|134083194|emb|CAK42833.1| unnamed protein product [Aspergillus niger]
          Length = 865

 Score = 47.0 bits (110), Expect = 0.009,   Method: Composition-based stats.
 Identities = 24/80 (30%), Positives = 41/80 (51%), Gaps = 7/80 (8%)

Query: 34  IEWMSDKLMKSFEGARNN----PFHFKHVKLCHSLAELAKVPS---PKVVLVSTPDMECG 86
           +EWM + +++ FE A       PF FKH+++      L K+ S   PKV+L S   ++ G
Sbjct: 325 LEWMDENIVREFEAAEEGKGVGPFTFKHLRILERKKRLEKILSDQKPKVILASDTSLDWG 384

Query: 87  FSRDLFFQWCSSPENSIIIT 106
           F++D          N +++T
Sbjct: 385 FAKDSLRLVAEGANNLLLLT 404


>gi|440638117|gb|ELR08036.1| hypothetical protein GMDG_02874 [Geomyces destructans 20631-21]
          Length = 831

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 47  GARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           G    P+ FK+++   SL     V S  V+L S   M+ G SR+L  +W  S +N ++IT
Sbjct: 325 GTNGGPWDFKYIRSLKSLERFDDVGS-CVMLASPGMMQNGVSRELLERWAPSDKNGVVIT 383

Query: 107 NRTSPGTLARDLIE 120
             +  GT+A+ +++
Sbjct: 384 GYSVEGTMAKSIMQ 397


>gi|348551496|ref|XP_003461566.1| PREDICTED: integrator complex subunit 11 [Cavia porcellus]
          Length = 600

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 45/195 (23%), Positives = 79/195 (40%), Gaps = 15/195 (7%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGE-ELEEYQKKKDKE 152
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++ + E   +    D +
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAK 396

Query: 153 AKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVS 212
              +   +  P        E +  +F     R  I+ EF     + C        V L +
Sbjct: 397 GIMQLVGQAEPESVLLVHGEAKKMEF----LRQKIEQEFR----VSCYMPANGETVTLPT 448

Query: 213 TPDMECGFSRDLFFQ 227
           +P +  G S  L  Q
Sbjct: 449 SPSIPVGISLGLLKQ 463


>gi|221484558|gb|EEE22852.1| cleavage and polyadenylation specificity factor, putative
           [Toxoplasma gondii GT1]
          Length = 1100

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 40/72 (55%), Gaps = 5/72 (6%)

Query: 51  NPFHFKHVKLCHSLAELAKV----PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           NPF F+ VK   S+ E A+V      P VV+ +   ++ G SR++F  W    +N +I+T
Sbjct: 423 NPFAFRFVKNVKSV-EAARVYIHHDGPAVVMAAPGMLQSGASREIFEAWAPDAKNGVILT 481

Query: 107 NRTSPGTLARDL 118
             +  GTLA +L
Sbjct: 482 GYSVKGTLADEL 493


>gi|301618510|ref|XP_002938656.1| PREDICTED: integrator complex subunit 11 isoform 1 [Xenopus
           (Silurana) tropicalis]
 gi|301618512|ref|XP_002938657.1| PREDICTED: integrator complex subunit 11 isoform 2 [Xenopus
           (Silurana) tropicalis]
          Length = 600

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 50/108 (46%), Gaps = 12/108 (11%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTL------TLQVKKRI 135
           +W  + +N +I+      GT+   +  L G R L      TL+VK ++
Sbjct: 339 KWAGNDKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQTLEVKMQV 384


>gi|384499309|gb|EIE89800.1| hypothetical protein RO3G_14511 [Rhizopus delemar RA 99-880]
          Length = 654

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 44/83 (53%), Gaps = 3/83 (3%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I  M+ ++ K F  A +NPF FKH+    ++ +      P V++ S   ++ G SR+LF 
Sbjct: 277 INMMNARIRKQF--AISNPFVFKHISNLKNVEQFED-SGPCVMMASPGMLQNGLSRELFE 333

Query: 94  QWCSSPENSIIITNRTSPGTLAR 116
           +W    +N ++IT      TLAR
Sbjct: 334 RWAPDKKNGLVITGYCVENTLAR 356


>gi|221504752|gb|EEE30417.1| cleavage and polyadenylation specificity factor, putative
           [Toxoplasma gondii VEG]
          Length = 1100

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 40/72 (55%), Gaps = 5/72 (6%)

Query: 51  NPFHFKHVKLCHSLAELAKV----PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           NPF F+ VK   S+ E A+V      P VV+ +   ++ G SR++F  W    +N +I+T
Sbjct: 423 NPFAFRFVKNVKSV-EAARVYIHHDGPAVVMAAPGMLQSGASREIFEAWAPDAKNGVILT 481

Query: 107 NRTSPGTLARDL 118
             +  GTLA +L
Sbjct: 482 GYSVKGTLADEL 493


>gi|237839761|ref|XP_002369178.1| cleavage and polyadenylation specificity factor, putative
           [Toxoplasma gondii ME49]
 gi|211966842|gb|EEB02038.1| cleavage and polyadenylation specificity factor, putative
           [Toxoplasma gondii ME49]
          Length = 1100

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 40/72 (55%), Gaps = 5/72 (6%)

Query: 51  NPFHFKHVKLCHSLAELAKV----PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           NPF F+ VK   S+ E A+V      P VV+ +   ++ G SR++F  W    +N +I+T
Sbjct: 423 NPFAFRFVKNVKSV-EAARVYIHHDGPAVVMAAPGMLQSGASREIFEAWAPDAKNGVILT 481

Query: 107 NRTSPGTLARDL 118
             +  GTLA +L
Sbjct: 482 GYSVKGTLADEL 493


>gi|76559911|ref|NP_001029064.1| integrator complex subunit 11 [Rattus norvegicus]
 gi|119371245|sp|Q3MHC2.1|INT11_RAT RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
           Full=Cleavage and polyadenylation-specific factor 3-like
           protein; Short=CPSF3-like protein
 gi|75867808|gb|AAI05304.1| Cleavage and polyadenylation specific factor 3-like [Rattus
           norvegicus]
          Length = 600

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--TFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQMLEVK 381


>gi|149024842|gb|EDL81339.1| similar to RIKEN cDNA 2410006F12 [Rattus norvegicus]
          Length = 601

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 284 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--TFADNPGPMVVFATPGMLHAGQSLQIFR 339

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 340 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQMLEVK 382


>gi|21312614|ref|NP_082296.1| integrator complex subunit 11 [Mus musculus]
 gi|81904239|sp|Q9CWS4.1|INT11_MOUSE RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
           Full=Cleavage and polyadenylation-specific factor 3-like
           protein; Short=CPSF3-like protein
 gi|12845859|dbj|BAB26928.1| unnamed protein product [Mus musculus]
 gi|26355309|dbj|BAC41135.1| unnamed protein product [Mus musculus]
 gi|74192536|dbj|BAE43054.1| unnamed protein product [Mus musculus]
 gi|74219576|dbj|BAE29558.1| unnamed protein product [Mus musculus]
 gi|148683102|gb|EDL15049.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_b
           [Mus musculus]
          Length = 600

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--TFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQMLEVK 381


>gi|74198351|dbj|BAE39661.1| unnamed protein product [Mus musculus]
          Length = 600

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--TFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQMLEVK 381


>gi|15029864|gb|AAH11155.1| Cleavage and polyadenylation specific factor 3-like [Mus musculus]
          Length = 600

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--TFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQMLEVK 381


>gi|345563127|gb|EGX46131.1| hypothetical protein AOL_s00110g295 [Arthrobotrys oligospora ATCC
           24927]
          Length = 982

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 46/111 (41%), Gaps = 29/111 (26%)

Query: 32  AKIEWMSDKLMKSFEGARNN--------------------------PFHFKHVKLC---H 62
           A +EWM + ++K FE   +N                          PF F+H+KL    H
Sbjct: 310 ALLEWMDESMVKEFESVTHNNNPSSRRKPKSSNTGAGDKEDDKLYGPFDFRHLKLVEHKH 369

Query: 63  SLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGT 113
            L ++      KVV+ S   +E GFS ++         N I++T R S GT
Sbjct: 370 QLTKILSRKGGKVVITSDKSLEWGFSTEVVKSIADDERNLIVLTERGSEGT 420


>gi|357618297|gb|EHJ71333.1| putative cleavage and polyadenylation specificity factor [Danaus
           plexippus]
          Length = 385

 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 29/48 (60%)

Query: 72  SPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
            P V++ S   M+ G SR+LF  WC+ P+N +II      GTLA+ ++
Sbjct: 29  GPCVIMASPGMMQSGLSRELFESWCTDPKNGVIIAGYCVEGTLAKTIL 76


>gi|281348165|gb|EFB23749.1| hypothetical protein PANDA_020173 [Ailuropoda melanoleuca]
          Length = 591

 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 274 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 329

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 330 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 372


>gi|254567914|ref|XP_002491067.1| hypothetical protein [Komagataella pastoris GS115]
 gi|238030864|emb|CAY68787.1| hypothetical protein PAS_chr2-1_0816 [Komagataella pastoris GS115]
 gi|328352406|emb|CCA38805.1| Cleavage and polyadenylation specificity factor subunit 2
           [Komagataella pastoris CBS 7435]
          Length = 854

 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 37/156 (23%), Positives = 70/156 (44%), Gaps = 20/156 (12%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECG-FSRDLF 92
           +EWMS K++K +E     PF    V+L   + +L ++P  KVV  +  D+  G  +    
Sbjct: 284 LEWMSPKIIKDWEERNETPFDPSRVQLV-DVDDLVQLPGAKVVFTADADLTIGSTAHSTL 342

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVK---------------KRI-- 135
              C   +N+II T R +  +    + E+    TL    K                R+  
Sbjct: 343 ASICIDEKNTIIFTERPTNSSFGASIYEIWEKLTLERNGKLEDGFPVPFEKLLTFSRVTL 402

Query: 136 -RLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFI 170
            +L G EL +Y +  ++  ++K++K ++   +T+ +
Sbjct: 403 KKLTGLELAQYTEIVNERKQEKRKKRQVEKMNTTIL 438


>gi|334321967|ref|XP_001364674.2| PREDICTED: integrator complex subunit 11-like [Monodelphis
           domestica]
          Length = 600

 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 43/193 (22%), Positives = 81/193 (41%), Gaps = 15/193 (7%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGE-ELEEYQKKKDKE 152
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++ + E   +    D +
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQILEVKMQVEYMSFSAHADAK 396

Query: 153 AKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVS 212
              +  ++  P +      E +  +F     +  I+ EF     + C        V L++
Sbjct: 397 GIMQLVRQAEPDNVLLVHGEAKKMEF----LKQKIEQEFH----VNCYMPANGETVTLLT 448

Query: 213 TPDMECGFSRDLF 225
            P++  G S  L 
Sbjct: 449 HPNIPVGISLGLL 461


>gi|443926404|gb|ELU45071.1| mRNA 3'-end-processing protein YSH1 [Rhizoctonia solani AG-1 IA]
          Length = 409

 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 11/78 (14%)

Query: 49  RNNPFHFKHVKLCHS-------LAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPEN 101
           ++NPF FKH+    +       +AE      P V+L S   M  G SR+L   W    +N
Sbjct: 266 KDNPFVFKHISHLPATRGWERKIAEAG----PCVILASPGFMSSGPSRELLELWAPDAKN 321

Query: 102 SIIITNRTSPGTLARDLI 119
            +IIT  +  GT+ARD+I
Sbjct: 322 GVIITGYSIEGTMARDII 339


>gi|417403203|gb|JAA48419.1| Putative mrna cleavage and polyadenylation factor ii complex brr5
           cpsf subunit [Desmodus rotundus]
          Length = 603

 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 30/111 (27%), Positives = 53/111 (47%), Gaps = 8/111 (7%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKK--RIRLEGEEL 142
           +W  + +N +I+      GT+   +  L G R L ++ ++   +R++ E +
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVRMQVEYM 387


>gi|444519369|gb|ELV12789.1| Integrator complex subunit 11 [Tupaia chinensis]
          Length = 601

 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 284 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 339

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 340 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 382


>gi|359319514|ref|XP_003639102.1| PREDICTED: integrator complex subunit 11-like [Canis lupus
           familiaris]
          Length = 600

 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|414881433|tpg|DAA58564.1| TPA: putative RNA-metabolising metallo-beta-lactamase [Zea mays]
          Length = 400

 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 41/87 (47%), Gaps = 6/87 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
           I W S K+  S     +NPF FKHV  CH        P P  VL +TP M   GFS + F
Sbjct: 285 IGWTSQKIKDSH--TVHNPFDFKHV--CHFERSFINNPGP-CVLFATPGMITGGFSLEAF 339

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLI 119
            +W  S +N + +      GT+   L+
Sbjct: 340 KKWAPSEKNLVTLPGYCVSGTIGHKLM 366


>gi|190346294|gb|EDK38344.2| hypothetical protein PGUG_02442 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 821

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 42/156 (26%), Positives = 69/156 (44%), Gaps = 33/156 (21%)

Query: 34  IEWMSDKLMKSFEGA---------RNN-PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
           +EWMS  L+K +E A         +NN PF    V L     EL ++  PK+VL +  DM
Sbjct: 173 LEWMSTSLVKEWEAASSASMNSTNKNNFPFDPSKVDLLSDPKELIQLSGPKIVLCAGIDM 232

Query: 84  ECG-FSRDLFFQWCSSPENSIIITNRTSPGT---------------------LARDLIEL 121
             G  S ++    CS  +N++++T +T  G                       A D + +
Sbjct: 233 NSGDVSFEVLKYLCSDQKNTVLLTEKTHFGADFSINAQLFTDWVRLSREKYGNAEDGLAI 292

Query: 122 GGNRTLTLQ-VKKRIRLEGEELEEYQKKKDKEAKDK 156
           G   T+ L+ + +   L G EL  +Q++ + + K K
Sbjct: 293 GYEGTIPLRGLSREDPLSGSELTSFQERINHQRKKK 328


>gi|338722203|ref|XP_001496423.3| PREDICTED: integrator complex subunit 11 [Equus caballus]
          Length = 571

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 254 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 309

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 310 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 352


>gi|395840791|ref|XP_003793235.1| PREDICTED: integrator complex subunit 11 isoform 1 [Otolemur
           garnettii]
          Length = 600

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|301788922|ref|XP_002929872.1| PREDICTED: integrator complex subunit 11-like [Ailuropoda
           melanoleuca]
          Length = 600

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|296815164|ref|XP_002847919.1| cleavage and polyadenylation specificity factor subunit 2
           [Arthroderma otae CBS 113480]
 gi|238840944|gb|EEQ30606.1| cleavage and polyadenylation specificity factor subunit 2
           [Arthroderma otae CBS 113480]
          Length = 1000

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 33/125 (26%), Positives = 50/125 (40%), Gaps = 32/125 (25%)

Query: 13  RNNPFHFKHVKLCHSLAELAK--IEWMSDKLMKSFEGARN-------------------- 50
           +N P +    K  H    LA+  +EWM + +++ FEG                       
Sbjct: 309 KNAPLYLAGKK-AHGTMRLARSMLEWMDENIVREFEGNDGVEVGAGKSGGGAANQPSKSA 367

Query: 51  ------NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPEN 101
                  PF FKH+ L    A+L  +     PKV+L     +E G SR +  Q  +  +N
Sbjct: 368 QGQKSLGPFTFKHLNLVEHKAKLDSILDSKGPKVILSPDASLEWGLSRHVLRQIAAGSDN 427

Query: 102 SIIIT 106
            II+T
Sbjct: 428 LIIMT 432


>gi|414881434|tpg|DAA58565.1| TPA: putative RNA-metabolising metallo-beta-lactamase [Zea mays]
          Length = 400

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 41/87 (47%), Gaps = 6/87 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
           I W S K+  S     +NPF FKHV  CH        P P  VL +TP M   GFS + F
Sbjct: 285 IGWTSQKIKDSH--TVHNPFDFKHV--CHFERSFINNPGP-CVLFATPGMITGGFSLEAF 339

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLI 119
            +W  S +N + +      GT+   L+
Sbjct: 340 KKWAPSEKNLVTLPGYCVSGTIGHKLM 366


>gi|158256210|dbj|BAF84076.1| unnamed protein product [Homo sapiens]
          Length = 606

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 289 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 344

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 345 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 387


>gi|431922648|gb|ELK19568.1| Integrator complex subunit 11 [Pteropus alecto]
          Length = 603

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|355744837|gb|EHH49462.1| hypothetical protein EGM_00117, partial [Macaca fascicularis]
          Length = 592

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 275 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 330

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 331 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 373


>gi|342180524|emb|CCC90000.1| putative cleavage and polyadenylation specificity factor subunit
           [Trypanosoma congolense IL3000]
          Length = 766

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 47/88 (53%), Gaps = 6/88 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV--PSPKVVLVSTPDMECGFSRDL 91
           +  M+D++ +  E  RN PF FK+++   SL +        P VVL S   ++ G S +L
Sbjct: 310 VSAMNDRVKEQHENHRN-PFVFKYIQ---SLLDTRSFEDTGPCVVLASPGMLQSGISLEL 365

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F +WC    N II+      GT+A++++
Sbjct: 366 FERWCGDKRNGIIVAGYCVDGTIAKEIL 393


>gi|397476278|ref|XP_003809534.1| PREDICTED: integrator complex subunit 11 isoform 2 [Pan paniscus]
          Length = 606

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 289 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 344

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 345 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 387


>gi|193786492|dbj|BAG51775.1| unnamed protein product [Homo sapiens]
          Length = 606

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 289 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 344

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 345 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 387


>gi|374253819|ref|NP_001243385.1| integrator complex subunit 11 isoform 1 [Homo sapiens]
 gi|119576642|gb|EAW56238.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_f
           [Homo sapiens]
          Length = 606

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 289 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 344

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 345 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 387


>gi|296206477|ref|XP_002750225.1| PREDICTED: integrator complex subunit 11 isoform 1 [Callithrix
           jacchus]
          Length = 600

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|374253821|ref|NP_001243389.1| integrator complex subunit 11 isoform 3 [Homo sapiens]
 gi|194386866|dbj|BAG59799.1| unnamed protein product [Homo sapiens]
          Length = 571

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 254 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 309

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 310 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 352


>gi|426327392|ref|XP_004024502.1| PREDICTED: integrator complex subunit 11 isoform 2 [Gorilla gorilla
           gorilla]
          Length = 606

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 289 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 344

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 345 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 387


>gi|326932364|ref|XP_003212289.1| PREDICTED: integrator complex subunit 11-like [Meleagris gallopavo]
          Length = 600

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 47/100 (47%), Gaps = 6/100 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKK 133
           +W  + +N +I+      GT+   +  L G R L ++ ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQ 376


>gi|449268484|gb|EMC79348.1| Integrator complex subunit 11 [Columba livia]
          Length = 600

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQILEVK 381


>gi|430813249|emb|CCJ29377.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 574

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 37/69 (53%), Gaps = 1/69 (1%)

Query: 51  NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS 110
           NPF F+++    SL     +  P V+L S   ++ G SR L  +WC  P+N +I+     
Sbjct: 183 NPFIFRYISSLKSLDRFEDI-GPCVMLASPGMLQSGVSRALLEKWCPDPKNGLIVAGYCV 241

Query: 111 PGTLARDLI 119
            GT+A+ ++
Sbjct: 242 EGTMAKHIL 250


>gi|395840793|ref|XP_003793236.1| PREDICTED: integrator complex subunit 11 isoform 2 [Otolemur
           garnettii]
          Length = 499

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 182 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 237

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 238 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 280


>gi|61098197|ref|NP_001012854.1| integrator complex subunit 11 [Gallus gallus]
 gi|75571225|sp|Q5ZIH0.1|INT11_CHICK RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
           Full=Cleavage and polyadenylation-specific factor 3-like
           protein; Short=CPSF3-like protein
 gi|53135966|emb|CAG32473.1| hypothetical protein RCJMB04_26e19 [Gallus gallus]
          Length = 600

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 47/100 (47%), Gaps = 6/100 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKK 133
           +W  + +N +I+      GT+   +  L G R L ++ ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQ 376


>gi|426327394|ref|XP_004024503.1| PREDICTED: integrator complex subunit 11 isoform 3 [Gorilla gorilla
           gorilla]
          Length = 571

 Score = 46.2 bits (108), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 254 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 309

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 310 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 352


>gi|403297738|ref|XP_003939709.1| PREDICTED: integrator complex subunit 11 isoform 1 [Saimiri
           boliviensis boliviensis]
          Length = 600

 Score = 46.2 bits (108), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|402465801|gb|EJW01455.1| hypothetical protein EDEG_00447 [Edhazardia aedis USNM 41457]
          Length = 774

 Score = 46.2 bits (108), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 39/71 (54%), Gaps = 3/71 (4%)

Query: 49  RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
           +N  F FK+++  ++  +      P VV+ S   ++ G SRDLF +WC    N +II   
Sbjct: 284 KNVDFKFKYIRNINTFDDRN---LPCVVMASPGMLQSGLSRDLFEKWCEDKRNGVIIAGY 340

Query: 109 TSPGTLARDLI 119
              GTLA++++
Sbjct: 341 CVQGTLAKEIL 351


>gi|402852593|ref|XP_003891002.1| PREDICTED: integrator complex subunit 11 isoform 1 [Papio anubis]
 gi|355557446|gb|EHH14226.1| hypothetical protein EGK_00111 [Macaca mulatta]
 gi|387540112|gb|AFJ70683.1| integrator complex subunit 11 [Macaca mulatta]
          Length = 600

 Score = 46.2 bits (108), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|296206479|ref|XP_002750226.1| PREDICTED: integrator complex subunit 11 isoform 2 [Callithrix
           jacchus]
          Length = 499

 Score = 46.2 bits (108), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 182 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 237

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 238 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 280


>gi|33300633|ref|NP_060341.2| integrator complex subunit 11 isoform 2 [Homo sapiens]
 gi|118572557|sp|Q5TA45.2|INT11_HUMAN RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
           Full=Cleavage and polyadenylation-specific factor 3-like
           protein; Short=CPSF3-like protein; AltName: Full=Protein
           related to CPSF subunits of 68 kDa; Short=RC-68
 gi|14124912|gb|AAH07978.1| Cleavage and polyadenylation specific factor 3-like [Homo sapiens]
 gi|60650138|tpg|DAA05669.1| TPA_exp: beta-lactamase fold protein family member RC-68 [Homo
           sapiens]
 gi|78100161|tpg|DAA05728.1| TPA_exp: integrator complex subunit 11 [Homo sapiens]
 gi|119576636|gb|EAW56232.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_a
           [Homo sapiens]
 gi|119576638|gb|EAW56234.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_a
           [Homo sapiens]
          Length = 600

 Score = 46.2 bits (108), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|12053137|emb|CAB66747.1| hypothetical protein [Homo sapiens]
 gi|49065540|emb|CAG38588.1| FLJ20542 [Homo sapiens]
 gi|117645260|emb|CAL38096.1| hypothetical protein [synthetic construct]
 gi|208966056|dbj|BAG73042.1| cleavage and polyadenylation specific factor 3-like [synthetic
           construct]
          Length = 600

 Score = 46.2 bits (108), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|354495797|ref|XP_003510015.1| PREDICTED: integrator complex subunit 11-like [Cricetulus griseus]
 gi|344251677|gb|EGW07781.1| Integrator complex subunit 11 [Cricetulus griseus]
          Length = 600

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--TFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|224079882|ref|XP_002197797.1| PREDICTED: integrator complex subunit 11 [Taeniopygia guttata]
          Length = 600

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 47/100 (47%), Gaps = 6/100 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKK 133
           +W  + +N +I+      GT+   +  L G R L ++ ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQ 376


>gi|380798915|gb|AFE71333.1| integrator complex subunit 11 isoform 2, partial [Macaca mulatta]
          Length = 588

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 271 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 326

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 327 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 369


>gi|34783058|gb|AAH00675.2| CPSF3L protein, partial [Homo sapiens]
          Length = 473

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 156 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 211

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 212 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 254


>gi|426327390|ref|XP_004024501.1| PREDICTED: integrator complex subunit 11 isoform 1 [Gorilla gorilla
           gorilla]
          Length = 600

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|14165488|gb|AAH08041.1| CPSF3L protein [Homo sapiens]
 gi|18044933|gb|AAH20199.1| CPSF3L protein [Homo sapiens]
          Length = 389

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 163 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 218

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 219 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 261


>gi|402852595|ref|XP_003891003.1| PREDICTED: integrator complex subunit 11 isoform 2 [Papio anubis]
          Length = 499

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 182 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 237

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 238 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 280


>gi|207079923|ref|NP_001128922.1| DKFZP459J1110 protein [Pongo abelii]
 gi|56403907|emb|CAI29738.1| hypothetical protein [Pongo abelii]
          Length = 600

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|403418874|emb|CCM05574.1| predicted protein [Fibroporia radiculosa]
          Length = 826

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 31/120 (25%), Positives = 55/120 (45%), Gaps = 23/120 (19%)

Query: 53  FHFKHVKLC---HSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
             F+H++      +L        PK++L     +  G SR LF ++  +P+N +++T R 
Sbjct: 277 LRFRHLEFFPNPQALLHTYSSKDPKLILAVPASLSHGPSRVLFTEFAETPDNVVLLTGRG 336

Query: 110 SPGTLARDLIE----------------LGGN----RTLTLQVKKRIRLEGEELEEYQKKK 149
             GTL R L +                +G N      L L++  ++ L+G ELEE+ +K+
Sbjct: 337 EEGTLGRILFDRWNNAQRDDVKWDRGRIGSNVMLDGILHLEIHSKVPLQGTELEEFLQKQ 396


>gi|403297740|ref|XP_003939710.1| PREDICTED: integrator complex subunit 11 isoform 2 [Saimiri
           boliviensis boliviensis]
          Length = 499

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 182 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 237

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 238 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 280


>gi|397476280|ref|XP_003809535.1| PREDICTED: integrator complex subunit 11 isoform 3 [Pan paniscus]
          Length = 499

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 182 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 237

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 238 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 280


>gi|118572558|sp|Q5NVE6.2|INT11_PONAB RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
           Full=Cleavage and polyadenylation-specific factor 3-like
           protein; Short=CPSF3-like protein
          Length = 600

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|389638668|ref|XP_003716967.1| hypothetical protein MGG_06570 [Magnaporthe oryzae 70-15]
 gi|351642786|gb|EHA50648.1| hypothetical protein MGG_06570 [Magnaporthe oryzae 70-15]
 gi|440474177|gb|ELQ42934.1| cleavage and polyadenylation specificity factor subunit 2
           [Magnaporthe oryzae Y34]
 gi|440484966|gb|ELQ64966.1| cleavage and polyadenylation specificity factor subunit 2
           [Magnaporthe oryzae P131]
          Length = 962

 Score = 46.2 bits (108), Expect = 0.014,   Method: Composition-based stats.
 Identities = 42/180 (23%), Positives = 72/180 (40%), Gaps = 59/180 (32%)

Query: 26  HSLAELAK--IEWMSDKLMKSFE--------------------GARNNPFHFKHVKLCHS 63
           HS  +LAK   EWM + +++ FE                    G    PF FK+++L   
Sbjct: 320 HSTIKLAKSMFEWMDNSIVQEFEAGADQGFRRTNGAGGNADAKGKDGGPFDFKYLRLLDR 379

Query: 64  LAELAKVPSP-------KVVLVSTPDMECGFSRDLFFQWCSSPENSIII------TNRTS 110
            A++ K+  P       KV+L +   +E GFS+D+     +   N +I+      ++R +
Sbjct: 380 KAQVLKLLEPSTDELRGKVILATDTSLEWGFSKDIISAIANDSRNMVILPEKPAESSRDN 439

Query: 111 PGTLARDLIE-----------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
           P +++R L                           G R L ++  K++ L   EL  YQ+
Sbjct: 440 P-SISRQLWRWWKERRDGVADEQSSGAGSAEQVFAGGRELQIRESKKVPLADSELSIYQQ 498


>gi|10433243|dbj|BAB13943.1| unnamed protein product [Homo sapiens]
          Length = 499

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 182 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 237

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 238 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 280


>gi|397476276|ref|XP_003809533.1| PREDICTED: integrator complex subunit 11 isoform 1 [Pan paniscus]
 gi|410206788|gb|JAA00613.1| cleavage and polyadenylation specific factor 3-like [Pan
           troglodytes]
 gi|410251172|gb|JAA13553.1| cleavage and polyadenylation specific factor 3-like [Pan
           troglodytes]
 gi|410297680|gb|JAA27440.1| cleavage and polyadenylation specific factor 3-like [Pan
           troglodytes]
 gi|410349815|gb|JAA41511.1| cleavage and polyadenylation specific factor 3-like [Pan
           troglodytes]
          Length = 600

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|410989914|ref|XP_004001198.1| PREDICTED: LOW QUALITY PROTEIN: integrator complex subunit 11
           [Felis catus]
          Length = 598

 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|343958314|dbj|BAK63012.1| protein related to CPSF subunits 68 kDa [Pan troglodytes]
          Length = 600

 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|66475248|ref|XP_627440.1| inactive CPSFs Cft2p metallobeta-lactamase [Cryptosporidium parvum
           Iowa II]
 gi|46228911|gb|EAK89760.1| inactive CPSFs Cft2p metallobeta-lactamase [Cryptosporidium parvum
           Iowa II]
          Length = 1196

 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 46/88 (52%), Gaps = 8/88 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFH-FKHVKLCHSLAE-----LAKVPSPKVVLVSTPDMECGF 87
           IEWMS ++   F  +R NPFH  K++ L  +L       L+KVP  KV+      M+ G+
Sbjct: 335 IEWMSLEIRSEFCDSRFNPFHDLKNIILETNLKNIRSENLSKVP--KVIFAFPESMDYGY 392

Query: 88  SRDLFFQWCSSPENSIIITNRTSPGTLA 115
           SR+LF +  ++  N+I+        T A
Sbjct: 393 SRELFTELATNENNTIMFVREPKENTFA 420


>gi|297279172|ref|XP_001092173.2| PREDICTED: integrator complex subunit 11 isoform 3 [Macaca mulatta]
          Length = 579

 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 262 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 317

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 318 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 360


>gi|119576648|gb|EAW56244.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_j
           [Homo sapiens]
          Length = 476

 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 159 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 214

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 215 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 257


>gi|393215649|gb|EJD01140.1| cleavage and polyadenylation specificity factor subunit
           [Fomitiporia mediterranea MF3/22]
          Length = 922

 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 47/93 (50%), Gaps = 20/93 (21%)

Query: 73  PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE------------ 120
           PK++L     +  G SR +F ++ S  +N +++T+    GTLAR L +            
Sbjct: 397 PKLILAVPVSLSHGSSRSIFSEFASVADNVVLLTSPGEDGTLARTLFDMWNDEQREDDKW 456

Query: 121 ----LGGN----RTLTLQVKKRIRLEGEELEEY 145
               LG N    +TL L +K ++ L+G ELEEY
Sbjct: 457 NKGKLGRNVMLDKTLKLTMKSKVPLQGVELEEY 489


>gi|405958713|gb|EKC24813.1| Integrator complex subunit 11 [Crassostrea gigas]
          Length = 575

 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 45/100 (45%), Gaps = 6/100 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W S K+ K+F   + N F FKH+K           P P VV  +   +  G S  +F 
Sbjct: 256 ITWTSQKIKKTF--VQRNMFEFKHIKPFDR--AFIDNPGPMVVFATPGMLHAGLSLQIFK 311

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKK 133
           +W  +  N +I+      GT+   +  L G R L L+ K+
Sbjct: 312 KWAPNELNMVIMPGYCVAGTVGHKI--LNGARKLELENKQ 349


>gi|347838796|emb|CCD53368.1| similar to cleavage and polyadenylation specificity factor subunit
           2 [Botryotinia fuckeliana]
          Length = 934

 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 42/169 (24%), Positives = 67/169 (39%), Gaps = 56/169 (33%)

Query: 35  EWMSDKLMKSFEGA------------RN--------NPFHFKHVKLCHSLAELAKV---- 70
           EWM + +++ FE              RN         PF FKH++L     ++ ++    
Sbjct: 309 EWMDEAIIREFEAQPGHEEQRTGQQRRNAEEAKQHIGPFEFKHLRLLGRKGQIDRMLNET 368

Query: 71  -----PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGT----LARDLIE- 120
                   KV+L S   +E GFS+++  +     +N +I+T R +P +    L R L   
Sbjct: 369 DNLGRSVGKVILASDTSIEWGFSKEVLCKIADDDKNLLILTERLNPISGAPGLGRTLWSW 428

Query: 121 ----------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
                                  GG R L ++  KRI LEG +L  YQ+
Sbjct: 429 WEERRDGVISEPSSNGGVLEQVYGGGRDLEIKEPKRIPLEGNDLTVYQQ 477


>gi|340509014|gb|EGR34593.1| hypothetical protein IMG5_006210 [Ichthyophthirius multifiliis]
          Length = 456

 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 44/87 (50%), Gaps = 6/87 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
           + W ++K+   +    +N F+FKH+K        A +P   +VL +TP M   G S  +F
Sbjct: 307 VNWTNEKIKSCY--LIDNMFNFKHIKPFQKSLIKANMP---MVLFATPGMLHAGLSMQVF 361

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLI 119
            +WC   +N++II      GTL   L+
Sbjct: 362 KEWCYDSKNTLIIPGYCVAGTLGNKLL 388


>gi|374253828|ref|NP_001243392.1| integrator complex subunit 11 isoform 5 [Homo sapiens]
 gi|119576639|gb|EAW56235.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_c
           [Homo sapiens]
 gi|119576644|gb|EAW56240.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_c
           [Homo sapiens]
          Length = 499

 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 182 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 237

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 238 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 280


>gi|426327396|ref|XP_004024504.1| PREDICTED: integrator complex subunit 11 isoform 4 [Gorilla gorilla
           gorilla]
          Length = 499

 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 182 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 237

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 238 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 280


>gi|406866779|gb|EKD19818.1| metallo-beta-lactamase superfamily protein [Marssonina brunnea f.
           sp. 'multigermtubi' MB_m1]
          Length = 823

 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 24/87 (27%), Positives = 49/87 (56%), Gaps = 3/87 (3%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
           E M++    S   A+  P+ FK+++   +L     V   + V++++P M + G SR+L  
Sbjct: 317 ERMAEAEASSDTAAKGGPWDFKYIRSLKNLDRFDDVG--RCVMLASPGMLQNGVSRELLE 374

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIE 120
           +W  S +N ++IT  +  GT+A+ +++
Sbjct: 375 RWAPSEKNGVVITGYSVEGTMAKQIMQ 401


>gi|154292337|ref|XP_001546744.1| hypothetical protein BC1G_14624 [Botryotinia fuckeliana B05.10]
          Length = 901

 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 42/169 (24%), Positives = 67/169 (39%), Gaps = 56/169 (33%)

Query: 35  EWMSDKLMKSFEGA------------RN--------NPFHFKHVKLCHSLAELAKV---- 70
           EWM + +++ FE              RN         PF FKH++L     ++ ++    
Sbjct: 309 EWMDEAIIREFEAQPGHEEQRTGQQRRNAEEAKQHIGPFEFKHLRLLGRKGQIDRMLNET 368

Query: 71  -----PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGT----LARDLIE- 120
                   KV+L S   +E GFS+++  +     +N +I+T R +P +    L R L   
Sbjct: 369 DNLGRSVGKVILASDTSIEWGFSKEVLCKIADDDKNLLILTERLNPISGAPGLGRTLWSW 428

Query: 121 ----------------------LGGNRTLTLQVKKRIRLEGEELEEYQK 147
                                  GG R L ++  KRI LEG +L  YQ+
Sbjct: 429 WEERRDGVISEPSSNGGVLEQVYGGGRDLEIKEPKRIPLEGNDLTVYQQ 477


>gi|119576641|gb|EAW56237.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_e
           [Homo sapiens]
          Length = 578

 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 261 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 316

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 317 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 359


>gi|449435478|ref|XP_004135522.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 3-I-like [Cucumis sativus]
          Length = 481

 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 48/92 (52%), Gaps = 8/92 (8%)

Query: 14  NNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP 71
           N P ++     K C ++ E   +  M+D++    + A++NPF FK++    S+ E+ K  
Sbjct: 278 NIPIYYASPLAKRCLTVYETYTLS-MNDRI----QNAKSNPFRFKYISPLKSI-EVFKDV 331

Query: 72  SPKVVLVSTPDMECGFSRDLFFQWCSSPENSI 103
            P VV+ S   ++ G SR LF  WCS    S+
Sbjct: 332 GPSVVMASPSGLQSGLSRQLFEMWCSEKHVSL 363


>gi|322699261|gb|EFY91024.1| cleavage and polyadenylation specifity factor [Metarhizium acridum
           CQMa 102]
          Length = 829

 Score = 46.2 bits (108), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 47/79 (59%), Gaps = 4/79 (5%)

Query: 44  SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
           S +GA +  P+ FK+++   +L     V     V++++P M + G SR+LF +W  + +N
Sbjct: 326 SGDGAGQGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQSGVSRELFERWAPNEKN 383

Query: 102 SIIITNRTSPGTLARDLIE 120
            +IIT  +  GT+AR +++
Sbjct: 384 GVIITGYSVEGTMARQIMQ 402


>gi|441671688|ref|XP_004093259.1| PREDICTED: LOW QUALITY PROTEIN: integrator complex subunit 11
           [Nomascus leucogenys]
          Length = 585

 Score = 46.2 bits (108), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|331212217|ref|XP_003307378.1| hypothetical protein PGTG_00328 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|309297781|gb|EFP74372.1| hypothetical protein PGTG_00328 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 950

 Score = 46.2 bits (108), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 46/91 (50%), Gaps = 14/91 (15%)

Query: 35  EWMSDKLMKSF-------EGARN----NPFHFKHVKLCHSLAEL-AKVP--SPKVVLVST 80
           EWMS  L +S         G R+     P   +H++  +S+  L A++P   PKV+L   
Sbjct: 356 EWMSPALARSSFDQNHHKRGNRDQNDQGPLRLRHIRFFNSVEALEAELPIRQPKVILAVP 415

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSP 111
             ME GFSR +F +      N II+T+ +SP
Sbjct: 416 LSMEYGFSRAMFTRIAGVEGNLIILTSLSSP 446


>gi|291238246|ref|XP_002739041.1| PREDICTED: cleavage and polyadenylation specific factor 3-like
           [Saccoglossus kowalevskii]
          Length = 573

 Score = 45.8 bits (107), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 37/159 (23%), Positives = 71/159 (44%), Gaps = 9/159 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F F+H+K           P P VV  +   +  G S  +F 
Sbjct: 256 ITWTNQKIRKTF--VQRNMFEFRHIKPFDR--SYTDNPGPMVVFATPGMLHGGLSLHVFK 311

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE-GEELEEYQKKKDKE 152
           +W S+ +N +I+      GT+   +  L G R + L+ ++ I ++   +   +    D +
Sbjct: 312 KWASNEKNMVIMPGYCVAGTVGHKI--LNGQRKIELENRQTIDVKLSVQYMSFSAHADAK 369

Query: 153 AKDKQEKEKIPPHDTSFINELQLSDF--KQTLQRNGIDC 189
              +  K+  P +      E +  DF  ++ +Q+ G+ C
Sbjct: 370 GIMQLIKQCEPKNVMLVHGEAKKMDFLKQKIVQQFGVQC 408


>gi|39645207|gb|AAH13904.2| CPSF3L protein, partial [Homo sapiens]
          Length = 429

 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 112 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 167

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 168 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 210


>gi|351697497|gb|EHB00416.1| Integrator complex subunit 11 [Heterocephalus glaber]
          Length = 672

 Score = 45.8 bits (107), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 356 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 411

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 412 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 454


>gi|119576643|gb|EAW56239.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_g
           [Homo sapiens]
 gi|119576646|gb|EAW56242.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_g
           [Homo sapiens]
          Length = 342

 Score = 45.8 bits (107), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 25  IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 80

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 81  KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 123


>gi|449296201|gb|EMC92221.1| hypothetical protein BAUCODRAFT_569527 [Baudoinia compniacensis
           UAMH 10762]
          Length = 834

 Score = 45.8 bits (107), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 26/93 (27%), Positives = 49/93 (52%), Gaps = 9/93 (9%)

Query: 34  IEWMSDKLMKSFEGARNNP------FHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECG 86
           I+ M++ +   F+ A+ NP      + F+H++   SL     V     V++++P M + G
Sbjct: 310 IDAMNENIRTKFQAAQANPDGVGGPWDFQHIRSLKSLERFDDVGG--CVMLASPGMLQNG 367

Query: 87  FSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
            SR L  +W    +N +IIT  +  GT+A+ ++
Sbjct: 368 VSRSLLERWAPDAKNGVIITGYSVEGTMAKSIM 400


>gi|397639513|gb|EJK73612.1| hypothetical protein THAOC_04754 [Thalassiosira oceanica]
          Length = 454

 Score = 45.8 bits (107), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 44/91 (48%), Gaps = 10/91 (10%)

Query: 50  NNPFHFKHVKLCHSL-AELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
            NPF F+H++   S+         P VV  S   ++ G SR LF +W + P+N ++I   
Sbjct: 307 GNPFSFRHIRNLKSIDVNNFDDRGPSVVFASPGMLQSGVSRQLFDRWATDPKNGVLIAGY 366

Query: 109 TSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
               TLA++++          Q K+ + +EG
Sbjct: 367 AVEHTLAKEIMS---------QPKEVVTMEG 388


>gi|209876680|ref|XP_002139782.1| cleavage and polyadenylation specificity factor subunit 3
           [Cryptosporidium muris RN66]
 gi|209555388|gb|EEA05433.1| cleavage and polyadenylation specificity factor subunit 3, putative
           [Cryptosporidium muris RN66]
          Length = 767

 Score = 45.8 bits (107), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 50/109 (45%), Gaps = 23/109 (21%)

Query: 37  MSDKLMKSFEGARN--------------NPFHFKHVKLCHSLAELAKV---PSPKVVLVS 79
           MS K M+ FE   N              NPF+F+ VK  +S++E+        P V++ +
Sbjct: 300 MSVKCMRVFETYINQCGDAVRKQADMGINPFNFQFVKTVNSISEIKDAIYSEGPCVIMAA 359

Query: 80  TPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLAR------DLIELG 122
              ++ G SRD+F  W     N +I+T     GT A       D+I+LG
Sbjct: 360 PGMLQNGTSRDIFEVWAPDKRNGVILTGYAIRGTPAYELRREPDIIQLG 408


>gi|410928941|ref|XP_003977858.1| PREDICTED: integrator complex subunit 11-like [Takifugu rubripes]
          Length = 601

 Score = 45.8 bits (107), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 6/97 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--SYADNPGPMVVFATPGMLHAGQSLQIFK 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQ 130
           +W  + +N +I+      GT+   +  L G R L ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTIGHKI--LNGQRKLEME 373


>gi|432866809|ref|XP_004070946.1| PREDICTED: integrator complex subunit 11-like [Oryzias latipes]
          Length = 599

 Score = 45.8 bits (107), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 6/97 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--SYADNPGPMVVFATPGMLHAGQSLQIFK 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQ 130
           +W  + +N +I+      GT+   +  L G R L ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTIGHKI--LNGQRKLEME 373


>gi|374253826|ref|NP_001243391.1| integrator complex subunit 11 isoform 4 [Homo sapiens]
          Length = 502

 Score = 45.4 bits (106), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 185 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 240

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 241 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 283


>gi|440298403|gb|ELP91039.1| Cleavage and polyadenylation specificity factor subunit, putative
           [Entamoeba invadens IP1]
          Length = 788

 Score = 45.4 bits (106), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 41/72 (56%), Gaps = 1/72 (1%)

Query: 49  RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITN 107
           +   F+FK ++   S  + + + +   V++++P M + GFSR LF +WC+   N +II  
Sbjct: 304 KRQAFNFKFIRDGSSSVDDSTIDNHPCVVLASPGMLQDGFSRTLFERWCTDKNNGVIIPG 363

Query: 108 RTSPGTLARDLI 119
               GTLA+ +I
Sbjct: 364 YCVEGTLAKQII 375


>gi|426327398|ref|XP_004024505.1| PREDICTED: integrator complex subunit 11 isoform 5 [Gorilla gorilla
           gorilla]
          Length = 502

 Score = 45.4 bits (106), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 185 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 240

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 241 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 283


>gi|119576647|gb|EAW56243.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_i
           [Homo sapiens]
          Length = 502

 Score = 45.4 bits (106), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 185 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 240

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 241 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 283


>gi|348503157|ref|XP_003439132.1| PREDICTED: integrator complex subunit 11-like [Oreochromis
           niloticus]
          Length = 601

 Score = 45.4 bits (106), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 6/97 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--SYADNPGPMVVFATPGMLHAGQSLQIFK 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQ 130
           +W  + +N +I+      GT+   +  L G R L ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTIGHKI--LNGQRKLEME 373


>gi|30677952|ref|NP_178282.2| cleavage and polyadenylation specificity factor subunit 3-II
           [Arabidopsis thaliana]
 gi|332278175|sp|Q8GUU3.2|CPS3B_ARATH RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 3-II; AltName: Full=Cleavage and polyadenylation
           specificity factor 73 kDa subunit II; Short=AtCPSF73-II;
           Short=CPSF 73 kDa subunit II; AltName: Full=Protein
           EMBRYO SAC DEVELOPMENT ARREST 26
 gi|62320470|dbj|BAD94982.1| putative cleavage and polyadenylation specifity factor [Arabidopsis
           thaliana]
 gi|330250395|gb|AEC05489.1| cleavage and polyadenylation specificity factor subunit 3-II
           [Arabidopsis thaliana]
          Length = 613

 Score = 45.4 bits (106), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 42/87 (48%), Gaps = 6/87 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMEC-GFSRDLF 92
           I W S  + +      +NPF FK+VK       L   P P  VL +TP M C GFS ++F
Sbjct: 282 ISWTSQNVKEKHN--THNPFDFKNVKDFDR--SLIHAPGP-CVLFATPGMLCAGFSLEVF 336

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLI 119
             W  SP N + +   +  GT+   L+
Sbjct: 337 KHWAPSPLNLVALPGYSVAGTVGHKLM 363


>gi|260790823|ref|XP_002590440.1| hypothetical protein BRAFLDRAFT_289082 [Branchiostoma floridae]
 gi|229275634|gb|EEN46451.1| hypothetical protein BRAFLDRAFT_289082 [Branchiostoma floridae]
          Length = 597

 Score = 45.4 bits (106), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 67/162 (41%), Gaps = 11/162 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K           P P VV  +   +  G S  +F 
Sbjct: 276 ITWTNQKIRKTF--VKRNMFEFKHIKAFDR--SYIDNPGPMVVFATPGMLHAGLSLQIFK 331

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKK--RIRLEGEELEEYQKKKDK 151
           +W    +N +I+      GT+   +  L G R +  + KK   +R+  E +  +    D 
Sbjct: 332 KWAPDSKNMVIMPGYCVAGTVGHKI--LNGIRKIEFENKKVLEVRMTVEYM-SFSAHADA 388

Query: 152 EAKDKQEKEKIPPHDTSFINELQLSDF--KQTLQRNGIDCEF 191
           +   +  +   P +      E Q  DF  K+  Q  G+ C F
Sbjct: 389 QGIMQLIRYCEPRNVMLVHGEEQKMDFLSKKITQEFGVQCFF 430


>gi|297814408|ref|XP_002875087.1| hypothetical protein ARALYDRAFT_322516 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320925|gb|EFH51346.1| hypothetical protein ARALYDRAFT_322516 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 819

 Score = 45.4 bits (106), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 42/87 (48%), Gaps = 6/87 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMEC-GFSRDLF 92
           I W S  + +      +NPF FK+VK       L   P P  VL +TP M C GFS ++F
Sbjct: 282 ISWTSQNVKEKHN--THNPFDFKNVKDFDR--SLIHAPGP-CVLFATPGMLCAGFSLEVF 336

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLI 119
             W  SP N + +   +  GT+   L+
Sbjct: 337 KHWAPSPLNLVALPGYSVAGTVGHKLM 363


>gi|157876175|ref|XP_001686447.1| putative cleavage and polyadenylation specificity factor
           [Leishmania major strain Friedlin]
 gi|68129521|emb|CAJ08064.1| putative cleavage and polyadenylation specificity factor
           [Leishmania major strain Friedlin]
          Length = 756

 Score = 45.4 bits (106), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 46/88 (52%), Gaps = 6/88 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP--SPKVVLVSTPDMECGFSRDL 91
           +  M+D++ K      +NPF FK+++   SL +        P VVL S   ++ G S +L
Sbjct: 298 VSAMNDRV-KQQHANHHNPFVFKYIR---SLMDTKSFEDNGPCVVLASPGMLQSGISLEL 353

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F +WC    N II+      GT+A+D++
Sbjct: 354 FERWCGDRRNGIIMAGYCVDGTIAKDVL 381


>gi|4220489|gb|AAD12712.1| putative cleavage and polyadenylation specifity factor [Arabidopsis
           thaliana]
          Length = 837

 Score = 45.4 bits (106), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 42/87 (48%), Gaps = 6/87 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMEC-GFSRDLF 92
           I W S  + +      +NPF FK+VK       L   P P  VL +TP M C GFS ++F
Sbjct: 282 ISWTSQNVKEKHN--THNPFDFKNVKDFDR--SLIHAPGP-CVLFATPGMLCAGFSLEVF 336

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLI 119
             W  SP N + +   +  GT+   L+
Sbjct: 337 KHWAPSPLNLVALPGYSVAGTVGHKLM 363


>gi|308509314|ref|XP_003116840.1| hypothetical protein CRE_01624 [Caenorhabditis remanei]
 gi|308241754|gb|EFO85706.1| hypothetical protein CRE_01624 [Caenorhabditis remanei]
          Length = 612

 Score = 45.4 bits (106), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 42/87 (48%), Gaps = 6/87 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
           I W ++ + K+F     N F FKH++      E    P P+V L STP M   G S  +F
Sbjct: 288 ISWTNENIKKTF--VERNMFEFKHIRPMEKGCE--DQPGPQV-LFSTPGMLHGGQSLKVF 342

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLI 119
            +WC  P N II+      GT+   +I
Sbjct: 343 KKWCGDPLNMIIMPGYCVAGTVGARVI 369


>gi|260942135|ref|XP_002615366.1| hypothetical protein CLUG_04248 [Clavispora lusitaniae ATCC 42720]
 gi|238850656|gb|EEQ40120.1| hypothetical protein CLUG_04248 [Clavispora lusitaniae ATCC 42720]
          Length = 940

 Score = 45.1 bits (105), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 57/124 (45%), Gaps = 13/124 (10%)

Query: 3   DKLMKSFEGARNNPFHFKHVKLCHSLAELAKI-EWMSDKLMKSFEGARNN-------PFH 54
           D+ + + +GA   P +F        L+  A + +WMS +L+K +EG           PF 
Sbjct: 258 DEHLANLQGAAI-PVYFLSYSGTKVLSYAANLLDWMSSQLIKEYEGIAAEDRAYSRVPFE 316

Query: 55  FKHVKLCHSLAELAKVPSPKVVLVSTPDMECG-FSRDLFFQWCSSPENSIIITNRTSPGT 113
              V L  +  EL ++P PK+V  S  D + G  S       C   + +II+T ++S   
Sbjct: 317 PSKVDLLSNPQELIQLPGPKIVFASGIDFKDGDMSTQALQLLCQDEKTTIILTEKSS--- 373

Query: 114 LARD 117
            ARD
Sbjct: 374 FARD 377


>gi|240280758|gb|EER44262.1| cleavage and polyadenylation specificity factor subunit 2
           [Ajellomyces capsulatus H143]
          Length = 1010

 Score = 45.1 bits (105), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 29/103 (28%), Positives = 51/103 (49%), Gaps = 13/103 (12%)

Query: 51  NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
            PF FKH+K+    A+L K+    +PKV+L S   ++ G+S+ +  +  S  EN +I+T 
Sbjct: 401 GPFTFKHLKIVERKAKLEKILGSNTPKVILTSDTSLDWGYSKHVLQKIASGSENLVILTE 460

Query: 108 RTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKD 150
             S          +  N+ +   ++ R  L  E    Y+++KD
Sbjct: 461 SFS----------VSPNKQMVDGIRSRPSLAHEIWTIYEERKD 493


>gi|198413502|ref|XP_002128796.1| PREDICTED: similar to cleavage and polyadenylation specific factor
           3-like [Ciona intestinalis]
          Length = 605

 Score = 45.1 bits (105), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+  +F     N F FKH+K  +        P P VV  +   +  G S ++F 
Sbjct: 284 ITWTNQKIKDTF--VERNMFDFKHIKEFNR--SYIDNPGPMVVFATPGMLHGGLSLEIFK 339

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +WC++ +N II+      GT+   +  L G R + L+  + + ++
Sbjct: 340 RWCTNEKNMIIMPGYCVAGTVGNKI--LSGMRKIELKPGQVVEIK 382


>gi|124809291|ref|XP_001348538.1| cleavage and polyadenylation specificity factor protein, putative
           [Plasmodium falciparum 3D7]
 gi|23497434|gb|AAN36977.1| cleavage and polyadenylation specificity factor protein, putative
           [Plasmodium falciparum 3D7]
          Length = 876

 Score = 45.1 bits (105), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 41/72 (56%), Gaps = 4/72 (5%)

Query: 51  NPFHFKHVKLCHSLAELAKV----PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           NPF+FK+VK   SL  ++       +P V++ S   ++ G S+++F    S  ++ +I+T
Sbjct: 347 NPFNFKYVKYAKSLESISSYLYQDNNPCVIMASPGMLQNGISKNIFNIIASDKKSGVILT 406

Query: 107 NRTSPGTLARDL 118
             T  GTLA +L
Sbjct: 407 GYTVKGTLADEL 418


>gi|67479721|ref|XP_655242.1| cleavage and polyadenylation specificity factor 73 kDa subunit
           [Entamoeba histolytica HM-1:IMSS]
 gi|56472366|gb|EAL49856.1| cleavage and polyadenylation specificity factor 73 kDa subunit,
           putative [Entamoeba histolytica HM-1:IMSS]
 gi|449703858|gb|EMD44220.1| cleavage and polyadenylation specificity factor 73 kDa subunit,
           putative [Entamoeba histolytica KU27]
          Length = 755

 Score = 45.1 bits (105), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 46/85 (54%), Gaps = 5/85 (5%)

Query: 36  WMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQ 94
           +M+ +L K  + A    F FK ++   S  +   +     V++++P M + G SR +F +
Sbjct: 295 FMNQELRKKTKQA----FDFKFIREGSSSVDDGAIDYKPCVVMASPGMLQDGISRKIFER 350

Query: 95  WCSSPENSIIITNRTSPGTLARDLI 119
           WC+  +N +II      GTLA+DLI
Sbjct: 351 WCTDKKNGVIIPGYCVEGTLAKDLI 375


>gi|167395302|ref|XP_001733549.1| Cleavage and polyadenylation specificity factor subunit [Entamoeba
           dispar SAW760]
 gi|165894214|gb|EDR22276.1| Cleavage and polyadenylation specificity factor subunit, putative
           [Entamoeba dispar SAW760]
          Length = 736

 Score = 45.1 bits (105), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 53  FHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSP 111
           F FK ++   S  +   +     V++++P M + G SR +F +WC+  +N +II      
Sbjct: 308 FDFKFIREGSSSVDDGAIDYKPCVVMASPGMLQDGISRKIFERWCTDKKNGVIIPGYCVE 367

Query: 112 GTLARDLI 119
           GTLA+DLI
Sbjct: 368 GTLAKDLI 375


>gi|170090732|ref|XP_001876588.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164648081|gb|EDR12324.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 901

 Score = 45.1 bits (105), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 30/126 (23%), Positives = 63/126 (50%), Gaps = 24/126 (19%)

Query: 55  FKHVKL---CHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSP 111
           F+H++      +L +      PK++L     +  G SR++F  + + P+N +++T R+  
Sbjct: 385 FRHLEFFPNPQALLQTYSSKDPKLILAVPASLSHGPSRNMFSDFAAVPDNVVLLTGRSEE 444

Query: 112 GTLARDLIE----------------LGGN----RTLTLQVKKRIRLEGEELEEYQKKKDK 151
           GTL R L +                +G N      +T+++  ++ L+G ELE +  ++++
Sbjct: 445 GTLGRALFDKWNNSQRPDDKWDKGKIGSNVMMDGAITIKMNHKVPLQGAELEAHL-QEER 503

Query: 152 EAKDKQ 157
            AK+K+
Sbjct: 504 VAKEKE 509


>gi|58266278|ref|XP_570295.1| cleavage and polyadenylation specificity factor subunit
           [Cryptococcus neoformans var. neoformans JEC21]
 gi|134111080|ref|XP_775682.1| hypothetical protein CNBD4110 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50258346|gb|EAL21035.1| hypothetical protein CNBD4110 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57226528|gb|AAW42988.1| cleavage and polyadenylation specificity factor subunit, putative
           [Cryptococcus neoformans var. neoformans JEC21]
          Length = 899

 Score = 45.1 bits (105), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 39/137 (28%), Positives = 63/137 (45%), Gaps = 27/137 (19%)

Query: 47  GARNNPFHFKHVKLCHSLAELAK---VPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSI 103
           G+      F+HV+   +  +L +   +  PK+VL   P M  G SR LF    ++  N I
Sbjct: 362 GSEYGVLDFRHVQFFLNTTDLLQTYPLTRPKLVLAVPPTMSHGPSRFLFTAMANTEGNVI 421

Query: 104 IITNRTSPGTLARDLI------ELGGNR--------------TLTLQVKKRIRLEGEELE 143
           ++T R+   TLARDL       +  G++               L ++V  ++ L G ELE
Sbjct: 422 MLTGRSEEQTLARDLYNRWERSQTTGSKWGEGKIGHLTQLEGKLQVEVDSKVPLSGAELE 481

Query: 144 EYQK----KKDKEAKDK 156
            + +    +K+KEA  K
Sbjct: 482 AHVESERLQKEKEAAHK 498


>gi|320590943|gb|EFX03384.1| polyadenylation specificity factor [Grosmannia clavigera kw1407]
          Length = 1036

 Score = 45.1 bits (105), Expect = 0.038,   Method: Composition-based stats.
 Identities = 40/132 (30%), Positives = 59/132 (44%), Gaps = 36/132 (27%)

Query: 50  NNPFHFKHVKLCHSLAELAKV---PSP----KVVLVSTPDMECGFSRDLFFQWCSSPENS 102
           + PF  KH++L    A++ +V    SP    KV+L S   ME GFS+++  +    P N 
Sbjct: 390 SGPFDMKHLRLLERRAQVERVLNSQSPPGGGKVILASDASMEWGFSKEVLRRIADKPRNL 449

Query: 103 IIITNRTSPG-------TLARDLIEL--------------GGN------RTLTLQVKKRI 135
           +++T    PG       +LAR L E               GGN      R L  Q   ++
Sbjct: 450 VLLTE--VPGVLSANKRSLARTLWEWWKSEQGQEAASSGGGGNPVRAGGRHLQFQDGLKV 507

Query: 136 RLEGEELEEYQK 147
            L+G EL  YQ+
Sbjct: 508 ALDGAELLLYQQ 519


>gi|340053150|emb|CCC47437.1| putative cleavage and polyadenylation specificity factor, fragment
           [Trypanosoma vivax Y486]
          Length = 584

 Score = 45.1 bits (105), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 43/86 (50%), Gaps = 2/86 (2%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           +  M+D++ +     RN PF FK+++             P VVL S   ++ G S +LF 
Sbjct: 125 VSAMNDRVKQQHANHRN-PFVFKYIQPLVDTRSFED-NGPCVVLASPGMLQSGISLELFE 182

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +WC    N III      GT+A+D++
Sbjct: 183 KWCGDRRNGIIIAGYCIDGTIAKDVL 208


>gi|407041778|gb|EKE40943.1| cleavage and polyadenylation specificity factor 73 kDa subunit,
           putative [Entamoeba nuttalli P19]
          Length = 751

 Score = 44.7 bits (104), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 46/85 (54%), Gaps = 5/85 (5%)

Query: 36  WMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQ 94
           +M+ +L K  + A    F FK ++   S  +   +     V++++P M + G SR +F +
Sbjct: 295 FMNQELRKKTKQA----FDFKFIREGSSSVDDGAIDYKPCVVMASPGMLQDGISRKIFER 350

Query: 95  WCSSPENSIIITNRTSPGTLARDLI 119
           WC+  +N +II      GTLA+DLI
Sbjct: 351 WCTDKKNGVIIPGYCVEGTLAKDLI 375


>gi|443926973|gb|ELU45512.1| cleavage and polyadenylation specificity factor subunit
           [Rhizoctonia solani AG-1 IA]
          Length = 854

 Score = 44.7 bits (104), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 62/118 (52%), Gaps = 10/118 (8%)

Query: 53  FHFKHVKLCHSL-AELAKVPS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
             FKH+++  S  A +++ PS  PK++LV    +  G SR +F ++ S P N++I++  +
Sbjct: 371 LRFKHLEIFPSPDALVSRYPSSMPKLLLVVPATLSHGNSRRIFAEFASVPGNAVILSTPS 430

Query: 110 SPGTLARDLI---ELG---GNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEK 161
            PGTLA  L     LG     R     V + I+L       Y  +K++ AK++Q  ++
Sbjct: 431 EPGTLANTLFNEWNLGQSDNERFGHGSVGQPIQLNSTMTLTYL-EKERAAKERQATQR 487


>gi|321468347|gb|EFX79332.1| hypothetical protein DAPPUDRAFT_304859 [Daphnia pulex]
          Length = 597

 Score = 44.7 bits (104), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 43/180 (23%), Positives = 80/180 (44%), Gaps = 16/180 (8%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH++        A  P P VV  +   +  G S  LF 
Sbjct: 282 ITWTNQKIRKTF--VQRNMFEFKHIRPFDK--SYADTPGPMVVFATPGMLHAGLSLQLFK 337

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEA 153
           +W  +  N +I+      GT+   +  L G + + ++ ++ I ++   ++        +A
Sbjct: 338 KWAPNENNMLIMPGYCVSGTVGAKV--LSGAKKVEIENRQVIEVK-MSVQYMSFSAHADA 394

Query: 154 KD-KQEKEKIPPHDTSFIN--ELQLSDFKQTLQRN-GIDCEFM----DGVLICCRGTVAV 205
           K   Q  +   P +   ++   +++   KQ +Q+  GIDC +M    +   I C+  V V
Sbjct: 395 KGIMQLIQYCEPQNVLLVHGEGVKMEFLKQKIQQEYGIDC-YMPANGESAYIPCKPVVPV 453


>gi|321457255|gb|EFX68345.1| hypothetical protein DAPPUDRAFT_218302 [Daphnia pulex]
          Length = 597

 Score = 44.7 bits (104), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 42/179 (23%), Positives = 78/179 (43%), Gaps = 14/179 (7%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH++        A  P P VV  +   +  G S  LF 
Sbjct: 282 ITWTNQKIRKTF--VQRNMFEFKHIRPFDK--SYADTPGPMVVFATPGMLHAGLSLQLFK 337

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEA 153
           +W  +  N +I+      GT+   +  L G + + ++ ++ I ++   ++        +A
Sbjct: 338 KWAPNENNMLIMPGYCVSGTVGAKV--LSGAKKVEIENRQVIEVK-MSVQYMSFSAHADA 394

Query: 154 KD-KQEKEKIPPHDTSFIN--ELQLSDFKQTLQRN-GIDCEF---MDGVLICCRGTVAV 205
           K   Q  +   P +   ++   +++   KQ +Q+  GIDC      +   I C+  V V
Sbjct: 395 KGIMQLIQYCEPQNVLLVHGEGVKMEFLKQKIQQEYGIDCYMPANGESAYIPCKPVVPV 453


>gi|406694795|gb|EKC98117.1| cleavage and polyadenylation specificity factor subunit
           [Trichosporon asahii var. asahii CBS 8904]
          Length = 958

 Score = 44.7 bits (104), Expect = 0.041,   Method: Composition-based stats.
 Identities = 42/142 (29%), Positives = 62/142 (43%), Gaps = 27/142 (19%)

Query: 47  GARNNPFHFKHVKLCHSLAEL-AKVPS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSI 103
           G+      F HV+   +  EL  K P+  PK+VL   P M  G SR LF    S   N +
Sbjct: 338 GSEYGALDFSHVQFFATPEELLEKYPANRPKLVLAIPPTMSHGPSRTLFASMASVTGNVV 397

Query: 104 IITNRTSPGTLARDLIELG---------------GNRT-----LTLQVKKRIRLEGEELE 143
           ++T      TLAR+L                   G+ T     L L++  +  L GEELE
Sbjct: 398 LLTGHGEDRTLARELYARWEAHQDEGAHYGHGKIGHATPMEGRLELELDAKEPLSGEELE 457

Query: 144 EYQ----KKKDKEAKDKQEKEK 161
            Y+    +K+++EA  +   E+
Sbjct: 458 AYETAEREKREREAAHQAALER 479


>gi|68076685|ref|XP_680262.1| cleavage and polyadenylation specifity factor protein, [Plasmodium
           berghei strain ANKA]
 gi|56501171|emb|CAH99756.1| cleavage and polyadenylation specifity factor protein, putative
           [Plasmodium berghei]
          Length = 888

 Score = 44.7 bits (104), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 39/72 (54%), Gaps = 4/72 (5%)

Query: 51  NPFHFKHVKLCHSLAE----LAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           NPF+FK+VK   SL      L +   P VV+ S   ++ G S+ +F    S  ++ +IIT
Sbjct: 349 NPFNFKYVKYAKSLDSILNYLYQDNYPCVVMASPGMLQSGISKSIFNIIASDKKSGVIIT 408

Query: 107 NRTSPGTLARDL 118
             T  GTLA +L
Sbjct: 409 GYTIKGTLADEL 420


>gi|388579831|gb|EIM20151.1| Metallo-hydrolase/oxidoreductase [Wallemia sebi CBS 633.66]
          Length = 626

 Score = 44.7 bits (104), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 45/90 (50%), Gaps = 6/90 (6%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV--PSPKVVLVSTPDMECGFSR 89
           A I  M+  + + F+   N PF FK +     L +L K     P V+L S   ++ G SR
Sbjct: 307 AYIRTMNKNINRRFDSGEN-PFQFKFIS---ELGDLRKWQDKGPCVMLASPGMLQSGTSR 362

Query: 90  DLFFQWCSSPENSIIITNRTSPGTLARDLI 119
           +L  +W   P+N +II   +  GT+A  ++
Sbjct: 363 ELLERWAPDPKNGLIICGYSVEGTMAHSIV 392


>gi|346323812|gb|EGX93410.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Cordyceps militaris CM01]
          Length = 879

 Score = 44.3 bits (103), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 48/87 (55%), Gaps = 4/87 (4%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
           E M++       GA   P+ FK+++   +L     V     V++++P M + G SR+LF 
Sbjct: 325 ERMAEAETSGGAGA-GGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELFE 381

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIE 120
           +W  + +N +IIT  +  GT+AR +++
Sbjct: 382 RWAPNDKNGVIITGYSVEGTMARQIMK 408


>gi|171679503|ref|XP_001904698.1| hypothetical protein [Podospora anserina S mat+]
 gi|170939377|emb|CAP64605.1| unnamed protein product [Podospora anserina S mat+]
          Length = 967

 Score = 44.3 bits (103), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 45/199 (22%), Positives = 74/199 (37%), Gaps = 66/199 (33%)

Query: 14  NNPFHFKHVKL---CHSLAELAK-----IEWMSDKLMKSFEGARNN-------------- 51
           NN   F++ +L    HS+    K      EWM DK+++ FE A                 
Sbjct: 304 NNNETFRNAQLYLAGHSIGSTLKHARSLFEWMDDKIVREFEAAAGGKESHSRGQRGGHHH 363

Query: 52  ------PFHFKHVKLCHSLAELAKV---------PSPKVVLVSTPDMECGFSRDLFFQWC 96
                 PF FKH++L     +++ V         P  +V+L +   +E GFS+++     
Sbjct: 364 DHKVAGPFDFKHLRLLERKGQVSWVLKQALEDLEPKGRVILATDSSLEWGFSKEVLKSIA 423

Query: 97  SSPENSIIIT-----NRTSPGTLARDLIE-----------------------LGGNRTLT 128
               N +++T     N   P ++AR L +                         G R L 
Sbjct: 424 GDARNLVLLTEKPALNENKP-SIARTLWDWWKERKDGVATEQTTSGETFEQVYAGGRELE 482

Query: 129 LQVKKRIRLEGEELEEYQK 147
           +    +  LEG +L  YQ+
Sbjct: 483 IADASKQALEGNDLSVYQQ 501


>gi|82596170|ref|XP_726151.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
           [Plasmodium yoelii yoelii 17XNL]
 gi|23481438|gb|EAA17716.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
           [Plasmodium yoelii yoelii]
          Length = 942

 Score = 44.3 bits (103), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 39/72 (54%), Gaps = 4/72 (5%)

Query: 51  NPFHFKHVKLCHSLAE----LAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           NPF+FK+VK   SL      L +   P V++ S   ++ G S+ +F    S  ++ +IIT
Sbjct: 348 NPFNFKYVKYAKSLDSILNYLYQDNYPCVIMASPGMLQSGISKSIFNIIASDKKSGVIIT 407

Query: 107 NRTSPGTLARDL 118
             T  GTLA +L
Sbjct: 408 GYTIKGTLADEL 419


>gi|406865774|gb|EKD18815.1| RNA-metabolising metallo-beta-lactamase [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 1331

 Score = 44.3 bits (103), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 38/173 (21%), Positives = 65/173 (37%), Gaps = 59/173 (34%)

Query: 34  IEWMSDKLMKSFEG--------------------ARNNPFHFKHVKLCHSLAELAKVPS- 72
           +EWM D +++ FE                     +   PF FKH++L     ++ ++   
Sbjct: 334 LEWMDDAIIREFEANAGINQKETGSKAAGDAKGSSDGGPFDFKHLRLLERKGQIDRIMGQ 393

Query: 73  ----------PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT------SPGTLAR 116
                      KV+L S   +E GFSRD+         N II+T +       +  +LA+
Sbjct: 394 TDIDRHGRSIGKVILASDASLEWGFSRDILKAVADDTRNLIILTEKMAKPQGEATSSLAK 453

Query: 117 DLIEL----------------------GGNRTLTLQVKKRIRLEGEELEEYQK 147
            L                          G +++ ++  KR+ LEG +L  YQ+
Sbjct: 454 TLWSWWEDRESVVSEETKSGLVLDRVNSGAKSVAIREAKRLALEGNDLSVYQQ 506


>gi|325088985|gb|EGC42295.1| cleavage and polyadenylation specific subunit [Ajellomyces
           capsulatus H88]
          Length = 1010

 Score = 44.3 bits (103), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 51/103 (49%), Gaps = 13/103 (12%)

Query: 51  NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
            PF FKH+K+    A++ K+    +PKV+L S   ++ G+S+ +  +  S  EN +I+T 
Sbjct: 401 GPFTFKHLKIVERKAKIEKILGSNTPKVILTSDTSLDWGYSKHVLQKIASGSENLVILTE 460

Query: 108 RTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKD 150
             S          +  N+ +   ++ R  L  E    Y+++KD
Sbjct: 461 SFS----------VSPNKQMVDGIRSRPSLAHEIWTIYEERKD 493


>gi|197099184|ref|NP_001124760.1| integrator complex subunit 11 [Pongo abelii]
 gi|55725797|emb|CAH89679.1| hypothetical protein [Pongo abelii]
          Length = 655

 Score = 44.3 bits (103), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 27/105 (25%), Positives = 49/105 (46%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K           P P VV  +   +  G S  +F 
Sbjct: 283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFTDNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|405120276|gb|AFR95047.1| cleavage and polyadenylation specificity factor subunit
           [Cryptococcus neoformans var. grubii H99]
          Length = 899

 Score = 44.3 bits (103), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 39/137 (28%), Positives = 62/137 (45%), Gaps = 27/137 (19%)

Query: 47  GARNNPFHFKHVKLCHSLAELAK---VPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSI 103
           G+      F+HV    +  +L +   +  PK+VL   P M  G SR LF    ++  N I
Sbjct: 362 GSEYGVLDFRHVLFFLNTTDLLQTYPLTRPKLVLAVPPTMSHGPSRFLFTAMANTEGNVI 421

Query: 104 IITNRTSPGTLARDLI------ELGGNR--------------TLTLQVKKRIRLEGEELE 143
           ++T R+   TLARDL       +  G++               L ++V  ++ L G ELE
Sbjct: 422 MLTGRSEEQTLARDLYNRWERSQTAGSKWGEGKIGHLTRLEGKLQVEVDSKVPLSGAELE 481

Query: 144 EYQK----KKDKEAKDK 156
            + +    +K+KEA  K
Sbjct: 482 AHVESERLQKEKEAAHK 498


>gi|66472504|ref|NP_001018457.1| integrator complex subunit 11 [Danio rerio]
 gi|82192739|sp|Q503E1.1|INT11_DANRE RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
           Full=Cleavage and polyadenylation-specific factor 3-like
           protein; Short=CPSF3-like protein
 gi|63102425|gb|AAH95364.1| Zgc:110671 [Danio rerio]
          Length = 598

 Score = 44.3 bits (103), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 28/105 (26%), Positives = 49/105 (46%), Gaps = 10/105 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--SYADNPGPMVVFATPGMLHAGQSLQIFK 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDL------IELGGNRTLTLQVK 132
           +W  + +N +I+      GT+   +      +E+ G  TL ++++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKILNGQKKLEMEGRATLDVKLQ 383


>gi|392593709|gb|EIW83034.1| Metallo-hydrolase oxidoreductase [Coniophora puteana RWD-64-598
           SS2]
          Length = 770

 Score = 44.3 bits (103), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 38/83 (45%), Gaps = 21/83 (25%)

Query: 49  RNNPFHFKHVKLCHSLAELAKVPSPK------------VVLVSTPDMECGFSRDLFFQWC 96
           R+NPF FKH+         + +P PK            VVL S    + G SR+L   W 
Sbjct: 298 RDNPFVFKHI---------SNLPQPKGWERKIAEGPPCVVLASPGFCQSGPSRELLELWA 348

Query: 97  SSPENSIIITNRTSPGTLARDLI 119
               N  I+T  +  GT+ARD++
Sbjct: 349 PDARNGFILTGYSVEGTMARDIL 371


>gi|224108267|ref|XP_002314781.1| predicted protein [Populus trichocarpa]
 gi|222863821|gb|EEF00952.1| predicted protein [Populus trichocarpa]
          Length = 639

 Score = 43.9 bits (102), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 42/88 (47%), Gaps = 8/88 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSL-AELAKVPSPKVVLVSTPDM-ECGFSRDL 91
           I W S K+ +++  A  N F FKHV   H+    L   P P  VL +TP M   GFS ++
Sbjct: 282 ISWTSQKVKETY--ATRNAFDFKHV---HNFDRSLINAPGP-CVLFATPGMISGGFSLEV 335

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F QW     N I +      GT+   L+
Sbjct: 336 FKQWAPCEMNLITLPGYCVAGTVGHKLM 363


>gi|452845681|gb|EME47614.1| hypothetical protein DOTSEDRAFT_146416 [Dothistroma septosporum
           NZE10]
          Length = 839

 Score = 43.9 bits (102), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 46/97 (47%), Gaps = 7/97 (7%)

Query: 52  PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSP 111
           P+ F+ V+    +     V    VVL S   ++ G SR L  +W   P+N ++IT  +  
Sbjct: 339 PWDFQFVRALKGVDRFEDVGG-SVVLASPGMLQNGPSRALLERWAPDPKNGVVITGYSVE 397

Query: 112 GTLAR------DLIELGGNRTLTLQVKKRIRLEGEEL 142
           GT+A+      D I    NRT  + + KR   E +++
Sbjct: 398 GTMAKQIMLEPDSIAAVTNRTANMSLGKRAGAEEQQM 434


>gi|384486005|gb|EIE78185.1| hypothetical protein RO3G_02889 [Rhizopus delemar RA 99-880]
          Length = 613

 Score = 43.9 bits (102), Expect = 0.071,   Method: Compositional matrix adjust.
 Identities = 23/86 (26%), Positives = 42/86 (48%), Gaps = 4/86 (4%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+  +F  ++ N F FKH+K  +        P PKV+  +   +  G S ++F 
Sbjct: 260 INWTNQKIKSTF--SQRNMFDFKHIKTWNR--NYIDQPGPKVLFATPGMLNAGTSLEVFK 315

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +W   P+N +I+      GT+   ++
Sbjct: 316 KWAPDPKNMVIMPGFCVAGTVGSKVL 341


>gi|392569726|gb|EIW62899.1| mRNA 3'-end-processing protein YSH1 [Trametes versicolor FP-101664
           SS1]
          Length = 805

 Score = 43.9 bits (102), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 40/75 (53%), Gaps = 5/75 (6%)

Query: 49  RNNPFHFKHVKLCHSLA----ELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSII 104
            +NPF FKH+           ++A+ P P VVL S   M+ G SR+L   W     N +I
Sbjct: 298 HDNPFVFKHITNVPGTRGWERKIAEGP-PCVVLASPGFMQTGPSRELLELWAPDGRNGLI 356

Query: 105 ITNRTSPGTLARDLI 119
           +T  +  GT+AR+++
Sbjct: 357 VTGYSIEGTMAREIL 371


>gi|225560694|gb|EEH08975.1| cleavage and polyadenylation specificity factor subunit 2
           [Ajellomyces capsulatus G186AR]
          Length = 1010

 Score = 43.9 bits (102), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 35/59 (59%), Gaps = 3/59 (5%)

Query: 51  NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
            PF FKH+K+    A+L K+    +PKV+L S   ++ G+S+ +  +  S  EN +I+T
Sbjct: 401 GPFTFKHLKIVERKAKLEKILGSNTPKVILTSDTSLDWGYSKHVLQKIASGSENLVILT 459


>gi|308457231|ref|XP_003091005.1| hypothetical protein CRE_12341 [Caenorhabditis remanei]
 gi|308258743|gb|EFP02696.1| hypothetical protein CRE_12341 [Caenorhabditis remanei]
          Length = 312

 Score = 43.9 bits (102), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 33/100 (33%), Positives = 51/100 (51%), Gaps = 5/100 (5%)

Query: 140 EELEEYQKKKDKEAKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICC 199
           +ELE+ + K  +  K    K K+      F+N+L+LSDFK  L   G   EF+ G L+  
Sbjct: 210 KELEKMRNKIGR-VKRNTNKYKVQKQMAIFVNDLKLSDFKNWLVEKGYKAEFLSGTLLIN 268

Query: 200 RGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 239
            G  ++RR  +  +  ME   S+D +    S   NSI+I+
Sbjct: 269 GGKCSIRRGEMGFS--MEGALSKDDYKLLNSI--NSIVIS 304


>gi|402084516|gb|EJT79534.1| endoribonuclease YSH1 [Gaeumannomyces graminis var. tritici
           R3-111a-1]
          Length = 868

 Score = 43.9 bits (102), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 21/74 (28%), Positives = 40/74 (54%), Gaps = 1/74 (1%)

Query: 47  GARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           G    P+ FK ++   +L     +  P V+L S   ++ G SR+L  +W  S +N ++IT
Sbjct: 328 GTGGGPWDFKFIRSLKNLDRFDDL-GPCVMLASPGMLQTGVSRELLERWAPSDKNGVVIT 386

Query: 107 NRTSPGTLARDLIE 120
             +  GT+A+ +++
Sbjct: 387 GYSVEGTMAKQIMQ 400


>gi|281201684|gb|EFA75892.1| integrator complex subunit 11 [Polysphondylium pallidum PN500]
          Length = 648

 Score = 43.9 bits (102), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 51/106 (48%), Gaps = 7/106 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
           I W + K+ ++F   + N F FKH+K    +  L   P P +VL +TP M   G S ++F
Sbjct: 309 IGWTNQKIKQTF--VKRNMFDFKHIKPFDRM--LVDAPGP-MVLFATPGMLHAGASLEVF 363

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
            +W  S  N  II      GT+   L+       + +++ K+  LE
Sbjct: 364 KKWAPSELNMTIIPGYCVVGTVGNKLLSNASGPQM-VEIDKKTTLE 408


>gi|408390480|gb|EKJ69876.1| hypothetical protein FPSE_09963 [Fusarium pseudograminearum CS3096]
          Length = 833

 Score = 43.9 bits (102), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 44  SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
           S +GA +  P+ FK+++   +L     V     V++++P M + G SR+L  +W  S +N
Sbjct: 324 SGDGAGKGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 381

Query: 102 SIIITNRTSPGTLARDLIE 120
            +IIT  +  GT+A+ +++
Sbjct: 382 GVIITGYSVEGTMAKQIMQ 400


>gi|367054168|ref|XP_003657462.1| hypothetical protein THITE_2123200 [Thielavia terrestris NRRL 8126]
 gi|347004728|gb|AEO71126.1| hypothetical protein THITE_2123200 [Thielavia terrestris NRRL 8126]
          Length = 859

 Score = 43.9 bits (102), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 24/87 (27%), Positives = 45/87 (51%), Gaps = 3/87 (3%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
           E M++         +  P+ FK ++   S+     V     V++++P M + G SR+L  
Sbjct: 316 ERMAEAEASGDAAGKGGPWDFKFIRSLKSIDRFDDVGG--CVMLASPGMLQNGVSRELLE 373

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIE 120
           +W  S +N +IIT  +  GT+A+ L++
Sbjct: 374 RWAPSEKNGVIITGYSVEGTMAKQLMQ 400


>gi|296424981|ref|XP_002842022.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295638279|emb|CAZ86213.1| unnamed protein product [Tuber melanosporum]
          Length = 975

 Score = 43.9 bits (102), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 38/163 (23%), Positives = 61/163 (37%), Gaps = 50/163 (30%)

Query: 34  IEWMSDKLMKSFE----------------------GARNNPFHFKHVKLCHSLAELAKV- 70
           +EWM + +++ FE                      G +  PF F H+ L  +   L ++ 
Sbjct: 317 LEWMDEGVVREFESIAGGDRRGNRQRDDAEGKGNDGNKAGPFDFLHLNLVSTQGHLNRIL 376

Query: 71  ----PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR------------------ 108
                  KV++ S   +  GFSR+   +  S  +N +++T R                  
Sbjct: 377 NDGNERGKVIIASDSSLGWGFSREALMRLASDEKNLVVLTERSDGKLGWAGNLWQQWKEK 436

Query: 109 TSPGTLARDL----IELGGNRTLTLQVKKRIRLEGEELEEYQK 147
           T  G  A       + L G R   L +  R  LEG+ELE Y +
Sbjct: 437 TGSGGEANATDWQEVSLDGQRA-ELDIPHRTPLEGQELEAYNR 478


>gi|70945220|ref|XP_742453.1| cleavage and polyadenylation specifity factor protein, [Plasmodium
           chabaudi chabaudi]
 gi|56521446|emb|CAH74696.1| cleavage and polyadenylation specifity factor protein, putative
           [Plasmodium chabaudi chabaudi]
          Length = 564

 Score = 43.9 bits (102), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 42/83 (50%), Gaps = 4/83 (4%)

Query: 40  KLMKSFEGARNNPFHFKHVKLCHSLAE----LAKVPSPKVVLVSTPDMECGFSRDLFFQW 95
           + +K       NPF+FK+VK   SL      L +   P V++ S   ++ G S+ +F   
Sbjct: 338 EFIKKIVNEGKNPFNFKYVKYAKSLDSILNYLYQDNYPCVIMASPGMLQSGISKSIFNII 397

Query: 96  CSSPENSIIITNRTSPGTLARDL 118
            S  ++ +IIT  T  GTLA +L
Sbjct: 398 ASDKKSGVIITGYTIKGTLADEL 420


>gi|442570104|sp|Q4IPN9.2|YSH1_GIBZE RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
           3'-end-processing protein YSH1
          Length = 833

 Score = 43.9 bits (102), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 44  SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
           S +GA +  P+ FK+++   +L     V     V++++P M + G SR+L  +W  S +N
Sbjct: 324 SGDGAGKGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 381

Query: 102 SIIITNRTSPGTLARDLIE 120
            +IIT  +  GT+A+ +++
Sbjct: 382 GVIITGYSVEGTMAKQIMQ 400


>gi|85079519|ref|XP_956368.1| hypothetical protein NCU03479 [Neurospora crassa OR74A]
 gi|74630409|sp|Q8WZS6.1|YSH1_NEUCR RecName: Full=Endoribonuclease ysh-1; AltName: Full=mRNA
           3'-end-processing protein ysh-1
 gi|18376069|emb|CAD21097.1| related to BRR5 (component of pre-mRNA polyadenylation factor PF I)
           [Neurospora crassa]
 gi|28917429|gb|EAA27132.1| hypothetical protein NCU03479 [Neurospora crassa OR74A]
          Length = 850

 Score = 43.5 bits (101), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 45/79 (56%), Gaps = 4/79 (5%)

Query: 44  SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
           S +GA +  P+ F+ ++   SL     V     V++++P M + G SR+L  +W  S +N
Sbjct: 323 SGDGAGKGGPWDFRFIRSLKSLDRFEDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 380

Query: 102 SIIITNRTSPGTLARDLIE 120
            +IIT  +  GT+A+ L++
Sbjct: 381 GVIITGYSVEGTMAKQLLQ 399


>gi|146170679|ref|XP_001017643.2| metallo beta lactamase domain containing protein [Tetrahymena
           thermophila]
 gi|146145062|gb|EAR97398.2| metallo beta lactamase domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 675

 Score = 43.5 bits (101), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 44/87 (50%), Gaps = 6/87 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
           + W ++K+  S+    +N F FK++K       L K   P +VL +TP M   G S  +F
Sbjct: 315 VNWTNEKIKSSY--LTDNMFDFKYIKPFSR--SLIKTNGP-MVLFATPGMLHAGLSMQVF 369

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLI 119
            +WC   +N++II      GTL   LI
Sbjct: 370 KEWCYDEKNTLIIPGYCVAGTLGCVLI 396


>gi|399216276|emb|CCF72964.1| unnamed protein product [Babesia microti strain RI]
          Length = 916

 Score = 43.5 bits (101), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 32/93 (34%), Positives = 42/93 (45%), Gaps = 1/93 (1%)

Query: 27  SLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP-DMEC 85
           SL     IE+MS K+   F     NP  F +V     L +L    +   V+ STP  +  
Sbjct: 280 SLIFSTSIEYMSSKVKSEFSRDLKNPLIFDNVIYLDKLEQLKPFTNVPCVIFSTPGSLNW 339

Query: 86  GFSRDLFFQWCSSPENSIIITNRTSPGTLARDL 118
           GFS  LF    S   N II++   +  TLAR L
Sbjct: 340 GFSNALFAAIGSKKGNLIILSKEPTTKTLARKL 372


>gi|221059687|ref|XP_002260489.1| cleavage and polyadenylation specifity protein [Plasmodium knowlesi
           strain H]
 gi|193810562|emb|CAQ41756.1| cleavage and polyadenylation specifity protein,putative [Plasmodium
           knowlesi strain H]
          Length = 938

 Score = 43.5 bits (101), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 40/72 (55%), Gaps = 4/72 (5%)

Query: 51  NPFHFKHVKLCHSLAE----LAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           NPF+FK+VK   SL      L +  +P VV+ S   ++ G S+++F       ++ +I+T
Sbjct: 355 NPFNFKYVKYAKSLDSILNYLYQDNNPCVVMASPGMLQNGISKNIFNIIAPDKKSGVILT 414

Query: 107 NRTSPGTLARDL 118
             T  GTLA +L
Sbjct: 415 GYTVKGTLAHEL 426


>gi|342879865|gb|EGU81098.1| hypothetical protein FOXB_08372 [Fusarium oxysporum Fo5176]
          Length = 858

 Score = 43.5 bits (101), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 44  SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
           S +GA +  P+ FK+++   +L     V     V++++P M + G SR+L  +W  S +N
Sbjct: 350 SGDGAGKGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 407

Query: 102 SIIITNRTSPGTLARDLIE 120
            +IIT  +  GT+A+ +++
Sbjct: 408 GVIITGYSVEGTMAKQIMQ 426


>gi|336468884|gb|EGO57047.1| hypothetical protein NEUTE1DRAFT_84705 [Neurospora tetrasperma FGSC
           2508]
 gi|350288819|gb|EGZ70044.1| Endoribonuclease ysh-1 [Neurospora tetrasperma FGSC 2509]
          Length = 853

 Score = 43.5 bits (101), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 45/79 (56%), Gaps = 4/79 (5%)

Query: 44  SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
           S +GA +  P+ F+ ++   SL     V     V++++P M + G SR+L  +W  S +N
Sbjct: 323 SGDGAGKGGPWDFRFIRSLKSLDRFEDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 380

Query: 102 SIIITNRTSPGTLARDLIE 120
            +IIT  +  GT+A+ L++
Sbjct: 381 GVIITGYSVEGTMAKQLLQ 399


>gi|46107872|ref|XP_380995.1| hypothetical protein FG00819.1 [Gibberella zeae PH-1]
          Length = 864

 Score = 43.5 bits (101), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 44  SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
           S +GA +  P+ FK+++   +L     V     V++++P M + G SR+L  +W  S +N
Sbjct: 355 SGDGAGKGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 412

Query: 102 SIIITNRTSPGTLARDLIE 120
            +IIT  +  GT+A+ +++
Sbjct: 413 GVIITGYSVEGTMAKQIMQ 431


>gi|358058074|dbj|GAA96053.1| hypothetical protein E5Q_02714 [Mixia osmundae IAM 14324]
          Length = 896

 Score = 43.5 bits (101), Expect = 0.099,   Method: Composition-based stats.
 Identities = 35/128 (27%), Positives = 65/128 (50%), Gaps = 24/128 (18%)

Query: 57  HVKLCHSLAEL-AKVPS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGT 113
           ++K+  SL  L A +P   PKVVL +   +  G SR +F +  +   N +++T+    G+
Sbjct: 326 NLKIFPSLDSLYAVIPPHVPKVVLAAPLPLSYGSSRKVFTEMAAQAGNLLLLTSPGPAGS 385

Query: 114 LARDLI----------------ELGG----NRTLTLQVKKRIRLEGEELEEYQKKKDKEA 153
           L+R L                 E+G     N  ++L ++ ++ L+GEEL+E+   + + A
Sbjct: 386 LSRSLFDKWNEAQTPAQRMGTGEIGQTITLNEAVSLPMRSKVILQGEELQEFLDNQ-RAA 444

Query: 154 KDKQEKEK 161
           K++  K+K
Sbjct: 445 KERHAKQK 452


>gi|56403864|emb|CAI29717.1| hypothetical protein [Pongo abelii]
          Length = 600

 Score = 43.5 bits (101), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 27/105 (25%), Positives = 49/105 (46%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+    + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 283 IPWTNQKIRKT--SVQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct: 339 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>gi|378730429|gb|EHY56888.1| endoribonuclease ysh1 [Exophiala dermatitidis NIH/UT8656]
          Length = 868

 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 28/101 (27%), Positives = 47/101 (46%), Gaps = 12/101 (11%)

Query: 46  EGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSII 104
           +G    P+ F+ V+   +L     V     V++++P M + G SR L  +W   P N +I
Sbjct: 332 KGVSAGPWDFRFVRSLRNLDRFDDVGG--CVMLASPGMLQSGMSRVLLERWAPDPRNGVI 389

Query: 105 ITNRTSPGTLARDL---------IELGGNRTLTLQVKKRIR 136
           +T     GT+AR +         I  GGN  +  Q+ ++ R
Sbjct: 390 MTGYNVEGTMARTILSEPEMIPAIMSGGNTGVGQQMGRKGR 430


>gi|302927041|ref|XP_003054415.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256735356|gb|EEU48702.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 827

 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 44  SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
           S +GA +  P+ FK+++   +L     V     V++++P M + G SR+L  +W  S +N
Sbjct: 324 SGDGAGKGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 381

Query: 102 SIIITNRTSPGTLARDLIE 120
            +IIT  +  GT+A+ +++
Sbjct: 382 GVIITGYSVEGTMAKQIMQ 400


>gi|321257420|ref|XP_003193582.1| cleavage and polyadenylation specificity factor subunit
           [Cryptococcus gattii WM276]
 gi|317460052|gb|ADV21795.1| Cleavage and polyadenylation specificity factor subunit, putative
           [Cryptococcus gattii WM276]
          Length = 900

 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 64/137 (46%), Gaps = 27/137 (19%)

Query: 47  GARNNPFHFKHVKLCHSLAELAK---VPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSI 103
           G+      F+HV+   +  +L +   +  PK+VL   P M  G SR LF    ++  N I
Sbjct: 363 GSEYGVLDFRHVQFFLNPTDLLQTYPLTRPKLVLAIPPTMSHGPSRFLFTAMANTEGNVI 422

Query: 104 IITNRTSPGTLARDLIE-------LG--------GNRT-----LTLQVKKRIRLEGEELE 143
           ++T R+   TLARDL         +G        G+ T     L +++  ++ L G ELE
Sbjct: 423 MLTGRSEEQTLARDLFNRWERSQTVGSKWGEGKIGHLTQLEGKLQVEMDSKVPLSGAELE 482

Query: 144 EYQK----KKDKEAKDK 156
            + +    +K+KEA  K
Sbjct: 483 AHMESERLQKEKEAAHK 499


>gi|358385845|gb|EHK23441.1| hypothetical protein TRIVIDRAFT_37526 [Trichoderma virens Gv29-8]
          Length = 957

 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 50/107 (46%), Gaps = 26/107 (24%)

Query: 34  IEWMSDKLMKSFE----GAR----------NNPFHFKHVKLCHSLAELAKVPS------- 72
           +EWM + +++ FE    G R            PF FK+++L    A++ K+ S       
Sbjct: 332 LEWMDNNIVQEFEAFAEGQRKVNGGNEKKEGAPFDFKYLRLLERKAQITKLLSQNIENGE 391

Query: 73  --PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARD 117
              +V+L S   M+ GFS+DL         N +I+T R +   LA+D
Sbjct: 392 TQGRVILASDVSMDWGFSKDLVKGLAKDSRNLVILTERPN---LAKD 435


>gi|353237084|emb|CCA69065.1| hypothetical protein PIIN_02923 [Piriformospora indica DSM 11827]
          Length = 887

 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 31/120 (25%), Positives = 58/120 (48%), Gaps = 23/120 (19%)

Query: 53  FHFKHVKLCHSLAELAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
             FK V+   +  E+  + S   PK++L     +  G SR LF  +  +  N +++T RT
Sbjct: 365 LRFKFVEFFSNPDEMINIFSSREPKLILSVPASLSHGPSRSLFADFAVNEGNMVVLTQRT 424

Query: 110 SPGTLARDLIE--------------------LGGNRTLTLQVKKRIRLEGEELEEYQKKK 149
             GTL R L++                    +  +R + ++++ ++ L+G ELEEY++K+
Sbjct: 425 GMGTLNRFLLDRWEAGQEDSQRWQDGHIGVPVSLDRPIDMELRIKVPLQGVELEEYREKE 484


>gi|297569676|ref|YP_003691020.1| RNA-metabolising metallo-beta-lactamase [Desulfurivibrio
           alkaliphilus AHT2]
 gi|296925591|gb|ADH86401.1| RNA-metabolising metallo-beta-lactamase [Desulfurivibrio
           alkaliphilus AHT2]
          Length = 479

 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 24/87 (27%), Positives = 44/87 (50%), Gaps = 5/87 (5%)

Query: 39  DKLMKSF--EGARNNPFHFKHVKLCHSLA---ELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           D+ ++ F  +G  +NPF F  ++   S+    +L  +  P V++ ++  ME G       
Sbjct: 309 DREIRDFILKGNHDNPFGFDTLRYTQSVGDSKQLNALKKPAVIISASGMMEGGRILHHLR 368

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIE 120
              S P N+I++T   +P TL R ++E
Sbjct: 369 NRVSDPRNTILVTGWQAPNTLGRKIVE 395


>gi|116200035|ref|XP_001225829.1| hypothetical protein CHGG_08173 [Chaetomium globosum CBS 148.51]
 gi|88179452|gb|EAQ86920.1| hypothetical protein CHGG_08173 [Chaetomium globosum CBS 148.51]
          Length = 854

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 45/79 (56%), Gaps = 4/79 (5%)

Query: 44  SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
           S +GA +  P+ FK ++   S+     V     V++++P M + G SR+L  +W  S +N
Sbjct: 324 SGDGAGKGGPWDFKFIRSLKSIDRFEDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 381

Query: 102 SIIITNRTSPGTLARDLIE 120
            +IIT  +  GT+A+ +++
Sbjct: 382 GVIITGYSVEGTMAKQIMQ 400


>gi|156042700|ref|XP_001587907.1| hypothetical protein SS1G_11148 [Sclerotinia sclerotiorum 1980]
 gi|154695534|gb|EDN95272.1| hypothetical protein SS1G_11148 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 936

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 39/171 (22%), Positives = 69/171 (40%), Gaps = 58/171 (33%)

Query: 35  EWMSDKLMKSFEG--------------------ARNN--PFHFKHVKLCHSLAELAKV-- 70
           EWM + +++ FE                     ++ N  PF FKH++L     ++ ++  
Sbjct: 309 EWMDEAIIREFEAQPGHEEQQTGQQRRHAYSDESKQNAGPFEFKHLRLLGRKGQIDRMLN 368

Query: 71  -------PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR--------------- 108
                     KV+L S   +E GFS+++  +     +N +I+T +               
Sbjct: 369 ETDNLGRSVGKVILASDTSIEWGFSKEVLRKIADDDKNLLILTEKLNRIDGVTGLGRTLW 428

Query: 109 -----------TSPGTLARDLIEL-GGNRTLTLQVKKRIRLEGEELEEYQK 147
                      T P +   +L ++ GG R L ++  KRI LEG +L  YQ+
Sbjct: 429 SWWEERRNGVATEPSSNGGNLEQVYGGGRDLEIREPKRIPLEGNDLTVYQQ 479


>gi|344229479|gb|EGV61364.1| hypothetical protein CANTEDRAFT_98614 [Candida tenuis ATCC 10573]
          Length = 943

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 30/90 (33%), Positives = 42/90 (46%), Gaps = 11/90 (12%)

Query: 34  IEWMSDKLMKSFE---------GARNN-PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
           +EWMS  L K  E         G RN+ PF    V L  +  EL ++  PKVV  S  D+
Sbjct: 284 LEWMSPSLSKELENANSIVTNTGNRNHFPFDPSKVDLVSTPYELTQMAGPKVVFTSGVDL 343

Query: 84  ECG-FSRDLFFQWCSSPENSIIITNRTSPG 112
             G  S +     C+  + +II+T +T  G
Sbjct: 344 NSGELSSEALRVLCNDEKTTIILTEKTHFG 373


>gi|417403209|gb|JAA48422.1| Putative mrna cleavage and polyadenylation factor ii complex brr5
           cpsf subunit [Desmodus rotundus]
          Length = 604

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 29/111 (26%), Positives = 52/111 (46%), Gaps = 7/111 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K     A     P   VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFEFKHIK-AFDRAXXXAHPCAMVVFATPGMLHAGQSLQIFR 339

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKK--RIRLEGEEL 142
           +W  + +N +I+      GT+   +  L G R L ++ ++   +R++ E +
Sbjct: 340 KWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVRMQVEYM 388


>gi|429966185|gb|ELA48182.1| hypothetical protein VCUG_00420 [Vavraia culicis 'floridensis']
          Length = 669

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 21/71 (29%), Positives = 38/71 (53%), Gaps = 1/71 (1%)

Query: 49  RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
           + + F+FKH++   S+     + +  VV+ S   ++ G SR+LF  WC    N  +I   
Sbjct: 307 KKDAFNFKHIRNLKSVDN-HLIKNACVVMASPGMLQSGLSRELFESWCEDANNGTVIPGY 365

Query: 109 TSPGTLARDLI 119
              GTLA++++
Sbjct: 366 CVQGTLAKEIM 376


>gi|242778797|ref|XP_002479311.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Talaromyces stipitatus ATCC 10500]
 gi|218722930|gb|EED22348.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Talaromyces stipitatus ATCC 10500]
          Length = 861

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 21/70 (30%), Positives = 40/70 (57%), Gaps = 3/70 (4%)

Query: 51  NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRT 109
            P+ F++V+   SL     + S   V++++P M + G SR+L  +W  S  N +++T  +
Sbjct: 333 GPWDFRYVRSLRSLERFDDIGS--CVMLASPGMLQTGTSRELLERWAPSERNGVVMTGYS 390

Query: 110 SPGTLARDLI 119
             GT+A+ L+
Sbjct: 391 VEGTMAKQLL 400


>gi|212533753|ref|XP_002147033.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Talaromyces marneffei ATCC 18224]
 gi|210072397|gb|EEA26486.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Talaromyces marneffei ATCC 18224]
          Length = 866

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 21/70 (30%), Positives = 40/70 (57%), Gaps = 3/70 (4%)

Query: 51  NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRT 109
            P+ F++V+   SL     + S   V++++P M + G SR+L  +W  S  N +++T  +
Sbjct: 333 GPWDFRYVRSLRSLERFDDIGS--CVMLASPGMLQTGTSRELLERWAPSERNGVVMTGYS 390

Query: 110 SPGTLARDLI 119
             GT+A+ L+
Sbjct: 391 VEGTMAKQLL 400


>gi|440493393|gb|ELQ75870.1| mRNA cleavage and polyadenylation factor II complex, BRR5 (CPSF
           subunit) [Trachipleistophora hominis]
          Length = 801

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 21/71 (29%), Positives = 38/71 (53%), Gaps = 1/71 (1%)

Query: 49  RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
           + + F FKH++   S+ +   + +  VV+ S   ++ G SR+LF  WC    N  +I   
Sbjct: 361 KKDAFTFKHIRNLKSV-DNHLIKNACVVMASPGMLQSGLSRELFESWCEDANNGTVIPGY 419

Query: 109 TSPGTLARDLI 119
              GTLA++++
Sbjct: 420 CVQGTLAKEIM 430


>gi|346472285|gb|AEO35987.1| hypothetical protein [Amblyomma maculatum]
          Length = 510

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 37/159 (23%), Positives = 66/159 (41%), Gaps = 9/159 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K           P P VV  +   +  G S  +F 
Sbjct: 254 ITWTNQKIRKTF--VQRNMFDFKHIKPFDR--AFIDNPGPMVVFATPGMLHAGLSLQIFK 309

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE-GEELEEYQKKKDKE 152
           +W     N +I+      GT+   +  L G R + L+ ++ + ++   +   +    D +
Sbjct: 310 KWAPFEANMVIMPGYCVAGTVGHKV--LSGARKVELENRQVVEVKMSVQYMSFSAHADAK 367

Query: 153 AKDKQEKEKIPPHDTSFINELQLSDF--KQTLQRNGIDC 189
              +  ++  P +      E    DF  K+ LQ   +DC
Sbjct: 368 GIMQLIQQCEPANVLLVHGEAGKMDFLRKKILQEFSVDC 406


>gi|66816359|ref|XP_642189.1| integrator complex subunit 11 [Dictyostelium discoideum AX4]
 gi|74856745|sp|Q54YL3.1|INT11_DICDI RecName: Full=Integrator complex subunit 11 homolog
 gi|60470287|gb|EAL68267.1| integrator complex subunit 11 [Dictyostelium discoideum AX4]
          Length = 744

 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 43/90 (47%), Gaps = 6/90 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
           I W + K+ ++F   + N F FKH+K   S   L   P   +VL +TP M   G S ++F
Sbjct: 283 INWTNQKIKQTF--VKRNMFDFKHIKPFQS--HLVDAPGA-MVLFATPGMLHAGASLEVF 337

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELG 122
            +W  +  N  II      GT+   L+  G
Sbjct: 338 KKWAPNELNMTIIPGYCVVGTVGNKLLTTG 367


>gi|13541495|ref|NP_111183.1| metal-dependent RNase [Thermoplasma volcanium GSS1]
 gi|14324879|dbj|BAB59805.1| cleavage and polyadenylation specificity factor [Thermoplasma
           volcanium GSS1]
          Length = 639

 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 26/107 (24%), Positives = 54/107 (50%), Gaps = 10/107 (9%)

Query: 35  EWMSDKLMKSFEGARNNPFH---FKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDL 91
           E+++ +L ++    + NPF    FK V+      ++A+ P  KVVL ++  M  G   + 
Sbjct: 463 EYLNKELREAIMVRKENPFLSQIFKKVETREQREDIAEDPESKVVLATSGMMNGGPVMEY 522

Query: 92  FFQWCSSPENSIIITNRTSPGTL-------ARDLIELGGNRTLTLQV 131
           F  W  +P+++++     + GT+       A+++    G +T+T Q+
Sbjct: 523 FKAWAGNPKHTLVFVGYQADGTMGKRIQSGAKEVTISEGGKTVTYQI 569


>gi|440801023|gb|ELR22048.1| cleavage and polyadenylation specific factor 3like, putative
           [Acanthamoeba castellanii str. Neff]
          Length = 657

 Score = 43.1 bits (100), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 50/106 (47%), Gaps = 7/106 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
           I W ++K+ ++F     N F FKH+        LA  P P +VL +TP M   G S ++F
Sbjct: 282 IHWTNEKIKRTF--VHRNMFDFKHISTFER--GLADQPGP-MVLFATPGMLHAGTSLEVF 336

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
            +W  + +N +II      GT+   L   G   +  + +  R  LE
Sbjct: 337 KKWAPNEKNLVIIPGYCVVGTVGNKLAA-GRKGSFKVDLDSRTSLE 381


>gi|449299688|gb|EMC95701.1| hypothetical protein BAUCODRAFT_71003 [Baudoinia compniacensis UAMH
           10762]
          Length = 938

 Score = 42.7 bits (99), Expect = 0.15,   Method: Composition-based stats.
 Identities = 28/106 (26%), Positives = 50/106 (47%), Gaps = 14/106 (13%)

Query: 34  IEWMSDKLMKSFEG------ARNNPFHFKHVKLCH---SLAELAKVPSPKVVLVSTPDME 84
           +EW+ + +    E          NP +++HVKL     +L    +   P V L S   +E
Sbjct: 326 LEWVEESVRAEAEAHLTKTKGSTNPLNWQHVKLVERNSTLERAVQRSQPCVFLASDASLE 385

Query: 85  CGFSRDLFFQWCSSPENSIIITNRTSPG-----TLARDLIELGGNR 125
            GFSR       +  +N +I+T +++PG     +++R L +L  +R
Sbjct: 386 WGFSRLALESLATDTKNLVILTEKSAPGSPPQSSISRQLWDLWQSR 431


>gi|313238583|emb|CBY13629.1| unnamed protein product [Oikopleura dioica]
          Length = 618

 Score = 42.7 bits (99), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 29/108 (26%), Positives = 49/108 (45%), Gaps = 8/108 (7%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           + W ++K+  SF     N F FK++K       + +   P+V   +   +  G S ++F 
Sbjct: 287 VNWTNEKIKSSF--VERNLFDFKYIKAFQKEIHMNQ-SGPQVCFATPGMLHAGMSLEIFQ 343

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTL---TLQVKKRIRLE 138
            WC+  +N II+      GT+   L  L G R      + V  RI++E
Sbjct: 344 NWCTDEKNCIIMPGYCVAGTVGHRL--LHGERHFKFNGVNVTSRIKVE 389


>gi|393241063|gb|EJD48587.1| hypothetical protein AURDEDRAFT_183466 [Auricularia delicata
           TFB-10046 SS5]
          Length = 893

 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 34/131 (25%), Positives = 62/131 (47%), Gaps = 27/131 (20%)

Query: 53  FHFKHVKLCHSLAELAKV-PS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRT 109
             F+H++   + A+L    PS  PK++L    ++  G SR +F  + S   N +++T++ 
Sbjct: 376 LRFRHLEFFTTYAQLTSTYPSSKPKLILAVPQNISHGSSRAIFTDFASVVGNVVVLTSKG 435

Query: 110 SPGTLARDLI-----------ELGG---------NRTLTLQVKKRIRLEGEELEEYQKKK 149
             GTL+R L            + G          N TL L++  ++ L+G ELE + + +
Sbjct: 436 EQGTLSRMLFDKWNEAQRDGDQYGAGTVGEPVTLNETLKLRMHTKVPLQGAELETHLQAE 495

Query: 150 DKEAKDKQEKE 160
               +  QE+E
Sbjct: 496 ----RAAQERE 502


>gi|171689890|ref|XP_001909884.1| hypothetical protein [Podospora anserina S mat+]
 gi|170944907|emb|CAP71018.1| unnamed protein product [Podospora anserina S mat+]
          Length = 835

 Score = 42.7 bits (99), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 45/79 (56%), Gaps = 4/79 (5%)

Query: 44  SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
           S +GA +  P+ FK ++   S+     V     V++++P M + G SR+L  +W  S +N
Sbjct: 325 SGDGAGKGGPWDFKFIRSLKSIDRFEDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 382

Query: 102 SIIITNRTSPGTLARDLIE 120
            +IIT  +  GT+A+ +++
Sbjct: 383 GVIITGYSVEGTMAKQIMQ 401


>gi|115396064|ref|XP_001213671.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114193240|gb|EAU34940.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 1005

 Score = 42.7 bits (99), Expect = 0.17,   Method: Composition-based stats.
 Identities = 40/181 (22%), Positives = 73/181 (40%), Gaps = 37/181 (20%)

Query: 26  HSLAELAK--IEWMSDKLMKSFEGARN---------------------NPFHFKHVKLCH 62
           H+   LA+  +EWM + +++ FE A                        PF FK++KL  
Sbjct: 339 HTTMRLARSMLEWMDENIVREFEAAEGVDATTGQSIQRPGGQKDEKGVGPFTFKNLKLVE 398

Query: 63  SLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
              +L K+    +PKV+L S   ++ GF+++          N +++T       L  D  
Sbjct: 399 RRKKLEKILADQTPKVILASDSSLDWGFAKESLRLIAEGSNNLLLLTESLHKERLQED-- 456

Query: 120 ELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFINELQLSDFK 179
                     +  +R  L G   + Y+++KD  A +K    ++     S   EL  +D +
Sbjct: 457 ---------QEGAQRRTLGGMIWDWYEERKDGVALEKAADGEMIEQVHSGGRELSWTDVR 507

Query: 180 Q 180
           +
Sbjct: 508 R 508


>gi|341038970|gb|EGS23962.1| hypothetical protein CTHT_0006720 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 894

 Score = 42.7 bits (99), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 23/87 (26%), Positives = 45/87 (51%), Gaps = 3/87 (3%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
           E M++         +  P+ FK ++   S+     V     V++++P M + G SR+L  
Sbjct: 315 ERMAEAEASGDSAGKGGPWDFKFIRSLKSIDRFEDVGG--CVMLASPGMLQNGVSRELLE 372

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIE 120
           +W  + +N +IIT  +  GT+A+ L++
Sbjct: 373 RWAPNEKNGVIITGYSVEGTMAKQLMQ 399


>gi|167525469|ref|XP_001747069.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774364|gb|EDQ87993.1| predicted protein [Monosiga brevicollis MX1]
          Length = 730

 Score = 42.7 bits (99), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 52/111 (46%), Gaps = 9/111 (8%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVK-LCHSLAELAKVPSPKVVLVSTPDMECGFSRDLF 92
           I W + KL ++F     N F F+HV+    S  E A    P+V+  +   +  G S   F
Sbjct: 288 ITWTNQKLKETF--VERNLFDFQHVQPFDRSYLEHA---GPQVLFATPGMLHAGTSLLAF 342

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKK-RIRLEGEEL 142
             WC  P N +I+    + GT+   +I   G R L ++ +   +R++ E L
Sbjct: 343 THWCEDPRNMVILPGYCTAGTVGAKII--AGIRELDIEGRHYTVRMDVEYL 391


>gi|367034742|ref|XP_003666653.1| hypothetical protein MYCTH_2311535 [Myceliophthora thermophila ATCC
           42464]
 gi|347013926|gb|AEO61408.1| hypothetical protein MYCTH_2311535 [Myceliophthora thermophila ATCC
           42464]
          Length = 879

 Score = 42.4 bits (98), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 45/79 (56%), Gaps = 4/79 (5%)

Query: 44  SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
           S +GA +  P+ FK ++   S+     V     V++++P M + G SR+L  +W  S +N
Sbjct: 324 SGDGAGKGGPWDFKFIRSLKSIDRFEDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 381

Query: 102 SIIITNRTSPGTLARDLIE 120
            +IIT  +  GT+A+ +++
Sbjct: 382 GVIITGYSVEGTMAKHIMQ 400


>gi|336259697|ref|XP_003344648.1| hypothetical protein SMAC_07216 [Sordaria macrospora k-hell]
 gi|380088385|emb|CCC13649.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 857

 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 45/79 (56%), Gaps = 4/79 (5%)

Query: 44  SFEGA-RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPEN 101
           S +GA +  P+ FK ++   S+     V     V++++P M + G SR+L  +W  S +N
Sbjct: 323 SGDGAGKGGPWDFKFIRSLKSIDRFEDVGG--CVMLASPGMLQNGVSRELLERWAPSEKN 380

Query: 102 SIIITNRTSPGTLARDLIE 120
            +IIT  +  GT+A+ +++
Sbjct: 381 GVIITGYSVEGTMAKHIMQ 399


>gi|392593024|gb|EIW82350.1| hypothetical protein CONPUDRAFT_54247 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 926

 Score = 42.4 bits (98), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 26/109 (23%)

Query: 73  PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI------------- 119
           PK++L     +  G SR LF  + + P+N +++T R   GTL + L              
Sbjct: 404 PKIILAVPASLSHGPSRSLFVDFAAVPDNVVLLTGRGEEGTLGQILFGRWNDSQRADDKW 463

Query: 120 ---ELGGN----RTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEK 161
              ++G N      + L++  ++ L+G ELE Y       AK++  KEK
Sbjct: 464 DKGKIGRNVMMDGAMRLKMSSKVPLQGTELELYL------AKERATKEK 506


>gi|407929750|gb|EKG22561.1| RNA-metabolising metallo-beta-lactamase [Macrophomina phaseolina
           MS6]
          Length = 974

 Score = 42.4 bits (98), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 42/192 (21%), Positives = 72/192 (37%), Gaps = 59/192 (30%)

Query: 14  NNPFHFKHVKLCHSLAELAK--IEWMSDKLMKSFEGAR---------------------- 49
           N P +    + C +    A+  +EWM + +++ FE A                       
Sbjct: 308 NAPLYLAS-RTCAATMRYARSMLEWMDEGIVREFEAASSGQGTDDNKRSRTQQGSGRSKE 366

Query: 50  --------NNPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSS 98
                   N PF F+ ++L     +++++     PKV+L S   +E GFS++      + 
Sbjct: 367 GKEDAKKPNAPFDFRSLRLVERRTQVSRMLAAEGPKVILASDVSLEWGFSKEAVRALAAD 426

Query: 99  PENSIIITNRT-----SPGTLARDLIELGGNRT------------------LTLQVKKRI 135
             N +I+T R      S   L R L EL   R+                    ++V +  
Sbjct: 427 SRNLVILTERIAQPSDSKKGLGRLLYELWAERSGANESAENPSPQHAAGTEAAVEVVQAA 486

Query: 136 RLEGEELEEYQK 147
            LEG E+  YQ+
Sbjct: 487 PLEGNEVPLYQQ 498


>gi|146417489|ref|XP_001484713.1| hypothetical protein PGUG_02442 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 821

 Score = 42.4 bits (98), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 41/156 (26%), Positives = 68/156 (43%), Gaps = 33/156 (21%)

Query: 34  IEWMSDKLMKSFEGA---------RNN-PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
           +EWMS  L+K +E A         +NN PF    V L     EL ++  PK+VL +  DM
Sbjct: 173 LEWMSTLLVKEWEAASSASMNSTNKNNFPFDPSKVDLLLDPKELIQLSGPKIVLCAGIDM 232

Query: 84  ECG-FSRDLFFQWCSSPENSIIITNRTSPGT---------------------LARDLIEL 121
             G  S ++    C   +N++++T +T  G                       A D + +
Sbjct: 233 NSGDVSFEVLKYLCLDQKNTVLLTEKTHFGADFSINAQLFTDWVRLSREKYGNAEDGLAI 292

Query: 122 GGNRTLTLQ-VKKRIRLEGEELEEYQKKKDKEAKDK 156
           G   T+ L+ + +   L G EL  +Q++ + + K K
Sbjct: 293 GYEGTIPLRGLSREDPLSGSELTSFQERINHQRKKK 328


>gi|212543221|ref|XP_002151765.1| cleavage and polyadenylylation specificity factor, putative
           [Talaromyces marneffei ATCC 18224]
 gi|210066672|gb|EEA20765.1| cleavage and polyadenylylation specificity factor, putative
           [Talaromyces marneffei ATCC 18224]
          Length = 1015

 Score = 42.4 bits (98), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 28/117 (23%), Positives = 49/117 (41%), Gaps = 35/117 (29%)

Query: 25  CHSLAELAK--IEWMSDKLMKSFE------------------------------GARNNP 52
            HS   LA+  +EWM + +++ FE                              G +  P
Sbjct: 342 AHSTMRLARSMLEWMDEGIVREFEAVEGGDAAAVRGHKTTDSQNRNAGVTRDKQGTKLGP 401

Query: 53  FHFKHVKLCHSLAELAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           F  KH+K+     +L KV +   PKV++ S   ++ G+S++ F       +N I++T
Sbjct: 402 FTLKHLKIVEQKRKLEKVLADGIPKVIIASDTSLDWGYSKETFRTLAQGSQNLILLT 458


>gi|167526212|ref|XP_001747440.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774275|gb|EDQ87907.1| predicted protein [Monosiga brevicollis MX1]
          Length = 668

 Score = 42.4 bits (98), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 3/72 (4%)

Query: 50  NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNR 108
           +NPF F+ +     L E     S   V+++TP M + G SR+LF +W  +  N +I+   
Sbjct: 312 SNPFQFQFIHNLRKLDEFDDHGSS--VVLATPGMLQNGLSRELFERWAPNRHNGVILAGY 369

Query: 109 TSPGTLARDLIE 120
              GTLA +L++
Sbjct: 370 HVEGTLAHELLK 381


>gi|156083689|ref|XP_001609328.1| cleavage and polyadenylation specifity factor [Babesia bovis T2Bo]
 gi|154796579|gb|EDO05760.1| cleavage and polyadenylation specifity factor [Babesia bovis]
          Length = 709

 Score = 42.0 bits (97), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 38/71 (53%), Gaps = 3/71 (4%)

Query: 51  NPFHFKHVKLCHSLAELA---KVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
           NPF+F  VK   S+AE++   +   P +V+ S   ++ G S  +F +      N +++T 
Sbjct: 337 NPFNFTFVKYARSVAEISQYLQADGPCIVMTSPGMLQGGPSLQIFEKIAPDSRNGVVLTG 396

Query: 108 RTSPGTLARDL 118
            T  GTLA +L
Sbjct: 397 YTVKGTLADEL 407


>gi|330923041|ref|XP_003300074.1| hypothetical protein PTT_11224 [Pyrenophora teres f. teres 0-1]
 gi|311325959|gb|EFQ91831.1| hypothetical protein PTT_11224 [Pyrenophora teres f. teres 0-1]
          Length = 705

 Score = 42.0 bits (97), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 1/72 (1%)

Query: 48  ARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
            R   + FK V+   SL     +    V+L S   M+ G SR+L  +W   P N +IIT 
Sbjct: 328 GRRGAWDFKFVRSLKSLERFDDLGG-CVMLASPGMMQSGTSRELLERWAPDPRNGVIITG 386

Query: 108 RTSPGTLARDLI 119
            +  GT+A+ ++
Sbjct: 387 YSVEGTMAKQIV 398


>gi|302501173|ref|XP_003012579.1| hypothetical protein ARB_01192 [Arthroderma benhamiae CBS 112371]
 gi|291176138|gb|EFE31939.1| hypothetical protein ARB_01192 [Arthroderma benhamiae CBS 112371]
          Length = 991

 Score = 42.0 bits (97), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 50/131 (38%), Gaps = 38/131 (29%)

Query: 13  RNNPFHFKHVKLCHSLAELAK--IEWMSDKLMKSFE--------------GARN------ 50
           +N P +    K  H    LA+  +EWM + +++ FE              GA N      
Sbjct: 295 KNTPLYLAGKK-AHDTMRLARSMLEWMDENIVREFEGNDGVEATTGKAAGGASNQPSKGV 353

Query: 51  ------------NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQW 95
                        PF FKH+ L    A+L  V     PKV+L     +E G S+ +    
Sbjct: 354 QSQKSATGQKSLGPFTFKHLNLVEHKAKLDGVLESKGPKVILSPDTSLEWGLSKHVLKHI 413

Query: 96  CSSPENSIIIT 106
               EN II+T
Sbjct: 414 AEGNENLIIMT 424


>gi|402217247|gb|EJT97328.1| Metallo-hydrolase/oxidoreductase [Dacryopinax sp. DJM-731 SS1]
          Length = 780

 Score = 42.0 bits (97), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 38/73 (52%), Gaps = 3/73 (4%)

Query: 50  NNPFHFKHVK-LCHSLAELAKVPS--PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
            NPF F+HV  +  S     K+    P V+L S   M+ G SR+L   W     N I++T
Sbjct: 306 QNPFVFRHVSHIPRSSGWEKKIGEGPPCVILASPGMMQSGASRELLEMWAPDRRNGIVLT 365

Query: 107 NRTSPGTLARDLI 119
             +  G++AR+++
Sbjct: 366 GYSVEGSMARNIM 378


>gi|330796066|ref|XP_003286090.1| hypothetical protein DICPUDRAFT_30371 [Dictyostelium purpureum]
 gi|325083909|gb|EGC37349.1| hypothetical protein DICPUDRAFT_30371 [Dictyostelium purpureum]
          Length = 468

 Score = 42.0 bits (97), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 50/106 (47%), Gaps = 7/106 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
           I W + K+ ++F   + N F FKH+K   S   L   P   +VL +TP M   G S ++F
Sbjct: 283 INWTNQKIKQTF--VKRNMFDFKHIKPFQS--HLVDAPG-AMVLFATPGMLHAGASLEVF 337

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
            +W  +  N  II      GT+   L+       + +++ K+  LE
Sbjct: 338 KKWAPNELNMTIIPGYCVVGTVGNKLLANNAGSQM-VEIDKKTTLE 382


>gi|189208340|ref|XP_001940503.1| endoribonuclease YSH1 [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187976596|gb|EDU43222.1| endoribonuclease YSH1 [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 871

 Score = 42.0 bits (97), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 1/72 (1%)

Query: 48  ARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
            R   + FK V+   SL     +    V+L S   M+ G SR+L  +W   P N +IIT 
Sbjct: 328 GRRGAWDFKFVRSLKSLERFDDLGG-CVMLASPGMMQSGTSRELLERWAPDPRNGVIITG 386

Query: 108 RTSPGTLARDLI 119
            +  GT+A+ ++
Sbjct: 387 YSVEGTMAKQIV 398


>gi|428177137|gb|EKX46018.1| hypothetical protein GUITHDRAFT_70813 [Guillardia theta CCMP2712]
          Length = 485

 Score = 42.0 bits (97), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 6/91 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
           I W + K+  +F   + N F F+H++           P P +VL +TP M   G S ++F
Sbjct: 284 ISWTNQKIKDTF--VKRNVFDFQHIQPFDR--AFIDRPGP-MVLFATPGMLHGGLSMEVF 338

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGG 123
            +W  S +N +I+      GTL   ++  GG
Sbjct: 339 KKWAPSDKNLVIMPGYCVAGTLGHKVLSNGG 369


>gi|399216074|emb|CCF72762.1| unnamed protein product [Babesia microti strain RI]
          Length = 725

 Score = 42.0 bits (97), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 37/71 (52%), Gaps = 3/71 (4%)

Query: 51  NPFHFKHVKLCHSLAELAK---VPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
           NP+HF H+K   ++  +       +P V++ S   ++ G SRD+F        N +I+T 
Sbjct: 308 NPYHFTHIKYAPTVDSVRSHLLRDAPCVIMTSPGMLQGGPSRDVFEIIAPDNRNGVILTG 367

Query: 108 RTSPGTLARDL 118
            T  GTLA +L
Sbjct: 368 YTVKGTLADEL 378


>gi|396488788|ref|XP_003842943.1| similar to cleavage and polyadenylation specifity factor
           [Leptosphaeria maculans JN3]
 gi|312219521|emb|CBX99464.1| similar to cleavage and polyadenylation specifity factor
           [Leptosphaeria maculans JN3]
          Length = 861

 Score = 42.0 bits (97), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 1/72 (1%)

Query: 48  ARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
            R   + FK V+   SL     +    V+L S   M+ G SR+L  +W   P N +IIT 
Sbjct: 328 GRRGAWDFKFVRSLKSLERFDDLGG-CVMLASPGMMQSGTSRELLERWAPDPRNGVIITG 386

Query: 108 RTSPGTLARDLI 119
            +  GT+A+ ++
Sbjct: 387 YSVEGTMAKQIV 398


>gi|303310723|ref|XP_003065373.1| hypothetical protein CPC735_045980 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240105035|gb|EER23228.1| hypothetical protein CPC735_045980 [Coccidioides posadasii C735
           delta SOWgp]
          Length = 1026

 Score = 42.0 bits (97), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 28/130 (21%), Positives = 50/130 (38%), Gaps = 42/130 (32%)

Query: 23  KLCHSLAELAK--IEWMSDKLMKSFEGARNN----------------------------- 51
           K  HS   LA+  +EWM + +++ FEG                                 
Sbjct: 341 KKVHSTMRLARSMLEWMDESIVREFEGGDGGESLGAGRSSGAASGQQSKGTPGQTSDKKS 400

Query: 52  --------PFHFKHVKLCH---SLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPE 100
                   PF F+H+K+      L  + +   PKV++ S   ++ GFS+++        E
Sbjct: 401 AGPHKGLGPFTFRHLKIIERKTKLENILRSEGPKVIIASDASLDWGFSKEILRHVAQGAE 460

Query: 101 NSIIITNRTS 110
           N +I+T  ++
Sbjct: 461 NLVILTESSA 470


>gi|389634325|ref|XP_003714815.1| endoribonuclease YSH1 [Magnaporthe oryzae 70-15]
 gi|351647148|gb|EHA55008.1| endoribonuclease YSH1 [Magnaporthe oryzae 70-15]
 gi|440467574|gb|ELQ36790.1| endoribonuclease YSH1 [Magnaporthe oryzae Y34]
 gi|440483131|gb|ELQ63565.1| endoribonuclease YSH1 [Magnaporthe oryzae P131]
          Length = 829

 Score = 42.0 bits (97), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 40/69 (57%), Gaps = 1/69 (1%)

Query: 52  PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSP 111
           P+ FK+++   +L     +  P V+L S   ++ G SR+L  +W  S +N ++IT  +  
Sbjct: 334 PWDFKYIRSLKNLDRFDDL-GPCVMLASPGMLQNGVSRELLERWAPSDKNGVVITGYSVE 392

Query: 112 GTLARDLIE 120
           GT+A+ +++
Sbjct: 393 GTMAKQIMQ 401


>gi|67517547|ref|XP_658594.1| hypothetical protein AN0990.2 [Aspergillus nidulans FGSC A4]
 gi|74598547|sp|Q5BEP0.1|YSH1_EMENI RecName: Full=Endoribonuclease ysh1; AltName: Full=mRNA
           3'-end-processing protein ysh1
 gi|40746402|gb|EAA65558.1| hypothetical protein AN0990.2 [Aspergillus nidulans FGSC A4]
 gi|259488717|tpe|CBF88384.1| TPA: Endoribonuclease ysh1 (EC 3.1.27.-)(mRNA 3'-end-processing
           protein ysh1) [Source:UniProtKB/Swiss-Prot;Acc:Q5BEP0]
           [Aspergillus nidulans FGSC A4]
          Length = 884

 Score = 42.0 bits (97), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 47/90 (52%), Gaps = 9/90 (10%)

Query: 31  LAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSR 89
           +A+ E   DK + +       P+ FK+V+   SL     V     V++++P M + G SR
Sbjct: 318 MAEAEASGDKSVSA------GPWDFKYVRSLRSLERFDDVGG--CVMLASPGMLQTGTSR 369

Query: 90  DLFFQWCSSPENSIIITNRTSPGTLARDLI 119
           +L  +W  +  N +++T  +  GT+A+ L+
Sbjct: 370 ELLERWAPNERNGVVMTGYSVEGTMAKQLL 399


>gi|451852830|gb|EMD66124.1| hypothetical protein COCSADRAFT_34708 [Cochliobolus sativus ND90Pr]
          Length = 872

 Score = 42.0 bits (97), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 1/72 (1%)

Query: 48  ARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
            R   + FK V+   SL     +    V+L S   M+ G SR+L  +W   P N +IIT 
Sbjct: 328 GRRGAWDFKFVRSLKSLERFDDLGG-CVMLASPGMMQSGTSRELLERWAPDPRNGVIITG 386

Query: 108 RTSPGTLARDLI 119
            +  GT+A+ ++
Sbjct: 387 YSVEGTMAKQIV 398


>gi|396082329|gb|AFN83939.1| putative beta-lactamase fold-containingexonuclease [Encephalitozoon
           romaleae SJ-2008]
          Length = 496

 Score = 42.0 bits (97), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 43/93 (46%), Gaps = 5/93 (5%)

Query: 51  NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRT 109
           N F +KH+K         K P   +VL ++P M   G S  +F +WC   +N +II    
Sbjct: 293 NVFEYKHIKPFQKYYMENKGP---MVLFASPGMLHSGMSLRMFKEWCEDEKNLVIIPGYC 349

Query: 110 SPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
             GT+   ++  G  R   L  +K IRLE   L
Sbjct: 350 VRGTIGEKVLN-GAKRLEVLGEEKDIRLEVRNL 381


>gi|156403103|ref|XP_001639929.1| predicted protein [Nematostella vectensis]
 gi|156227060|gb|EDO47866.1| predicted protein [Nematostella vectensis]
          Length = 527

 Score = 42.0 bits (97), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 27/102 (26%), Positives = 44/102 (43%), Gaps = 6/102 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+  +F   + N F F+H+K           P P VV  +   +  G S  +F 
Sbjct: 254 ITWTNQKIKNTF--VQRNMFEFEHIKPFDR--SYIDNPGPMVVFATPGMLHAGLSLQIFK 309

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRI 135
           +W S+  N ++I      GT+   +  L G + + L  K  I
Sbjct: 310 KWASNENNMVVIPGYCVAGTVGHKV--LAGQKKIELDKKNVI 349


>gi|320034772|gb|EFW16715.1| cleavage and polyadenylylation specificity factor [Coccidioides
           posadasii str. Silveira]
          Length = 1026

 Score = 42.0 bits (97), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 28/130 (21%), Positives = 50/130 (38%), Gaps = 42/130 (32%)

Query: 23  KLCHSLAELAK--IEWMSDKLMKSFEGARNN----------------------------- 51
           K  HS   LA+  +EWM + +++ FEG                                 
Sbjct: 341 KKVHSTMRLARSMLEWMDESIVREFEGGDGGESLGAGRSSGAASGQQSKGTPGQTSDKKS 400

Query: 52  --------PFHFKHVKLCH---SLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPE 100
                   PF F+H+K+      L  + +   PKV++ S   ++ GFS+++        E
Sbjct: 401 AGPHKGLGPFTFRHLKIIERKTKLENILRSEGPKVIIASDASLDWGFSKEILRHVAQGAE 460

Query: 101 NSIIITNRTS 110
           N +I+T  ++
Sbjct: 461 NLVILTESSA 470


>gi|402226056|gb|EJU06116.1| hypothetical protein DACRYDRAFT_73414 [Dacryopinax sp. DJM-731 SS1]
          Length = 925

 Score = 42.0 bits (97), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 33/119 (27%), Positives = 54/119 (45%), Gaps = 23/119 (19%)

Query: 50  NNPFHFKHVKLCHS---LAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           N    FKH+++  S   L +     +PKV+L +   M  G SR +F  + S   N +++ 
Sbjct: 381 NMILRFKHLQIFPSPEALMDAIHPSAPKVILATPLTMSHGASRAMFESFSSMRNNLLLLV 440

Query: 107 NRTSPGTLARDLIEL-----------GGNRT---------LTLQVKKRIRLEGEELEEY 145
           N    GTLAR L ++           G  R          +++++  ++ L G ELEEY
Sbjct: 441 NIAEKGTLARSLWDIWQREQAETAKWGKGRLGAIVKAETDISVRMNAKVPLAGVELEEY 499


>gi|310796189|gb|EFQ31650.1| metallo-beta-lactamase superfamily protein [Glomerella graminicola
           M1.001]
          Length = 855

 Score = 41.6 bits (96), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 22/87 (25%), Positives = 46/87 (52%), Gaps = 3/87 (3%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
           E M++         +  P+ FK+++   +L     V     V++++P M + G SR+L  
Sbjct: 316 ERMAEAEASGDGSGKGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELLE 373

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIE 120
           +W  + +N +IIT  +  GT+A+ +++
Sbjct: 374 RWAPNDKNGVIITGYSVEGTMAKQIMQ 400


>gi|241245173|ref|XP_002402434.1| cleavage and polyadenylation specificity factor, putative [Ixodes
           scapularis]
 gi|215496345|gb|EEC05985.1| cleavage and polyadenylation specificity factor, putative [Ixodes
           scapularis]
          Length = 596

 Score = 41.6 bits (96), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 27/105 (25%), Positives = 46/105 (43%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K           P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFDFKHIKPFDR--AFIDNPGPMVVFATPGMLHAGLSLQIFK 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W     N +I+      GT+   +  L G R + L  ++ + ++
Sbjct: 339 KWAPVEGNMVIMPGYCVAGTVGHKI--LSGARKVELDNRQVVEVK 381


>gi|442751667|gb|JAA67993.1| Putative cleavage and polyadenylation specificity factor cpsf
           subunit [Ixodes ricinus]
          Length = 596

 Score = 41.6 bits (96), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 27/105 (25%), Positives = 46/105 (43%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K           P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFDFKHIKPFDR--AFIDNPGPMVVFATPGMLHAGLSLQIFK 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W     N +I+      GT+   +  L G R + L  ++ + ++
Sbjct: 339 KWAPVEGNMVIMPGYCVAGTVGHKI--LSGARKVELDNRQVVEVK 381


>gi|429862463|gb|ELA37111.1| cleavage and polyadenylation specifity 73 kda [Colletotrichum
           gloeosporioides Nara gc5]
          Length = 831

 Score = 41.6 bits (96), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 22/87 (25%), Positives = 46/87 (52%), Gaps = 3/87 (3%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
           E M++         +  P+ FK+++   +L     V     V++++P M + G SR+L  
Sbjct: 312 ERMAEAEASGDGSGKGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELLE 369

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIE 120
           +W  + +N +IIT  +  GT+A+ +++
Sbjct: 370 RWAPNEKNGVIITGYSVEGTMAKQIMQ 396


>gi|242786013|ref|XP_002480717.1| cleavage and polyadenylylation specificity factor, putative
           [Talaromyces stipitatus ATCC 10500]
 gi|218720864|gb|EED20283.1| cleavage and polyadenylylation specificity factor, putative
           [Talaromyces stipitatus ATCC 10500]
          Length = 1017

 Score = 41.6 bits (96), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 30/119 (25%), Positives = 51/119 (42%), Gaps = 35/119 (29%)

Query: 23  KLCHSLAELAK--IEWMSDKLMKSFEG------------------------ARNN----- 51
           K  HS   LA+  +EWM + +++ FE                         +R+N     
Sbjct: 340 KKAHSTMRLARSMLEWMDEGIVREFEAVEGGDAAAARGHKRTDSQSRTTGSSRDNKATKL 399

Query: 52  -PFHFKHVKLCHSLAELAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
            PF  KH+K+     +L K+     PKV++ S   ++ G+S++ F       +N II+T
Sbjct: 400 GPFTLKHLKIVEQKRKLEKILGDGIPKVIIASDTSLDWGYSKETFRTLAEDSQNLIILT 458


>gi|302661813|ref|XP_003022569.1| hypothetical protein TRV_03308 [Trichophyton verrucosum HKI 0517]
 gi|291186522|gb|EFE41951.1| hypothetical protein TRV_03308 [Trichophyton verrucosum HKI 0517]
          Length = 1024

 Score = 41.6 bits (96), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 34/131 (25%), Positives = 50/131 (38%), Gaps = 38/131 (29%)

Query: 13  RNNPFHFKHVKLCHSLAELAK--IEWMSDKLMKSFE--------------GARN------ 50
           +N P +    K  H    LA+  +EWM + +++ FE              GA N      
Sbjct: 328 KNTPLYLAGKK-AHGTMRLARSMLEWMDENIVREFEGNDGVEATTGKAAGGASNQPSKGA 386

Query: 51  ------------NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQW 95
                        PF FKH+ L    A+L  +     PKV+L     +E G S+ +    
Sbjct: 387 QSQKSATGQKSLGPFTFKHLNLVEHKAKLDGILESKGPKVILSPDTSLEWGLSKHVLKHI 446

Query: 96  CSSPENSIIIT 106
               EN II+T
Sbjct: 447 AEGNENLIIMT 457


>gi|452002411|gb|EMD94869.1| hypothetical protein COCHEDRAFT_1222148 [Cochliobolus
           heterostrophus C5]
          Length = 872

 Score = 41.6 bits (96), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 1/72 (1%)

Query: 48  ARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
            R   + FK V+   SL     +    V+L S   M+ G SR+L  +W   P N +IIT 
Sbjct: 328 GRRGAWDFKFVRSLKSLERFDDLGG-CVMLASPGMMQSGTSRELLERWAPDPRNGVIITG 386

Query: 108 RTSPGTLARDLI 119
            +  GT+A+ ++
Sbjct: 387 YSVEGTMAKHIV 398


>gi|389585470|dbj|GAB68201.1| cleavage and polyadenylation specifity factor protein, partial
           [Plasmodium cynomolgi strain B]
          Length = 936

 Score = 41.6 bits (96), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 39/72 (54%), Gaps = 4/72 (5%)

Query: 51  NPFHFKHVKLCHSLAE----LAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           NPF+FK VK   SL      L +  +P VV+ S   ++ G S+++F       ++ +I+T
Sbjct: 355 NPFNFKFVKYAKSLDSILNYLYQDNNPCVVMASPGMLQNGISKNIFNIIAPDKKSGVILT 414

Query: 107 NRTSPGTLARDL 118
             T  GTLA +L
Sbjct: 415 GYTVKGTLADEL 426


>gi|427779771|gb|JAA55337.1| Putative mrna cleavage and polyadenylation factor ii complex brr5
           cpsf subunit [Rhipicephalus pulchellus]
          Length = 621

 Score = 41.6 bits (96), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 27/105 (25%), Positives = 46/105 (43%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K           P P VV  +   +  G S  +F 
Sbjct: 317 ITWTNQKIRKTF--VQRNMFDFKHIKPFDR--AFIDNPGPMVVFATPGMLHAGLSLQIFK 372

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W     N +I+      GT+   +  L G R + L  ++ + ++
Sbjct: 373 KWAPFEANMVIMPGYCVAGTVGHKI--LSGARKVELDNRQVVEVK 415


>gi|256084683|ref|XP_002578556.1| cleavage and polyadenylation specificity factor [Schistosoma
           mansoni]
 gi|350644758|emb|CCD60512.1| cleavage and polyadenylation specificity factor,putative
           [Schistosoma mansoni]
          Length = 619

 Score = 41.6 bits (96), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 43/95 (45%), Gaps = 8/95 (8%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAE-LAKVPSPKVVLVSTPDMECGFSRDLF 92
           I W + K+ ++F   + N F FKH+K    L +     P P VV  +   +  G S  +F
Sbjct: 283 ISWTNQKIKETF--VKRNMFDFKHIK---PLGQGTVDNPGPMVVFATPGMLHAGQSLHIF 337

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTL 127
            +W S   N ++I      GT+   +  L G R L
Sbjct: 338 RKWASDERNMVVIPGYCVAGTVGYKI--LNGVRRL 370


>gi|340381556|ref|XP_003389287.1| PREDICTED: integrator complex subunit 11-like [Amphimedon
           queenslandica]
          Length = 610

 Score = 41.6 bits (96), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 22/86 (25%), Positives = 37/86 (43%), Gaps = 4/86 (4%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+  +F     N F FKH+K           P P +V  +   +  G S  +F 
Sbjct: 284 ISWTNQKIKNTF--IHRNMFDFKHIKAFDR--SYIDQPGPMIVFATPGMLHAGLSLQIFK 339

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +W    +N +I+     PGT+   ++
Sbjct: 340 KWAEDEKNMLIMPGYCVPGTIGHKVL 365


>gi|427785581|gb|JAA58242.1| Putative mrna cleavage and polyadenylation factor ii complex brr5
           cpsf subunit [Rhipicephalus pulchellus]
          Length = 587

 Score = 41.6 bits (96), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 27/105 (25%), Positives = 46/105 (43%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K           P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFDFKHIKPFDR--AFIDNPGPMVVFATPGMLHAGLSLQIFK 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W     N +I+      GT+   +  L G R + L  ++ + ++
Sbjct: 339 KWAPFEANMVIMPGYCVAGTVGHKI--LSGARKVELDNRQVVEVK 381


>gi|380494427|emb|CCF33158.1| endoribonuclease YSH1 [Colletotrichum higginsianum]
          Length = 846

 Score = 41.2 bits (95), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 22/87 (25%), Positives = 46/87 (52%), Gaps = 3/87 (3%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
           E M++         +  P+ FK+++   +L     V     V++++P M + G SR+L  
Sbjct: 316 ERMAEAEASGDGSGKGGPWDFKYIRSLKNLDRFDDVGG--CVMLASPGMLQNGVSRELLE 373

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIE 120
           +W  + +N +IIT  +  GT+A+ +++
Sbjct: 374 RWAPNDKNGVIITGYSVEGTMAKQIMQ 400


>gi|67525249|ref|XP_660686.1| hypothetical protein AN3082.2 [Aspergillus nidulans FGSC A4]
 gi|40744477|gb|EAA63653.1| hypothetical protein AN3082.2 [Aspergillus nidulans FGSC A4]
 gi|259485970|tpe|CBF83440.1| TPA: cleavage and polyadenylylation specificity factor, putative
           (AFU_orthologue; AFUA_3G09720) [Aspergillus nidulans
           FGSC A4]
          Length = 1005

 Score = 41.2 bits (95), Expect = 0.45,   Method: Composition-based stats.
 Identities = 26/102 (25%), Positives = 45/102 (44%), Gaps = 28/102 (27%)

Query: 34  IEWMSDKLMKSFEGA-------RNN------------------PFHFKHVKLCH---SLA 65
           +EWM + +++ FE A       +NN                  PF FKH+K       L 
Sbjct: 347 LEWMDESIVREFEAAEAADTAGQNNDGQRSDQRQGKTDNKGLGPFTFKHLKTVERKKKLE 406

Query: 66  ELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
           +L   P+PKV+L S   ++ GF+++          N +++T+
Sbjct: 407 QLLNDPTPKVILASDSSLDWGFAKESLRLLAGGENNLLLLTD 448


>gi|320170221|gb|EFW47120.1| integrator complex subunit 11 [Capsaspora owczarzaki ATCC 30864]
          Length = 661

 Score = 41.2 bits (95), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 29/106 (27%), Positives = 50/106 (47%), Gaps = 8/106 (7%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
           I W + K+ ++F     N F FKH+K           P P +VL +TP M   G S D F
Sbjct: 282 ITWTNQKIKRTF--VERNMFEFKHIKPFDR--AFLDNPGP-MVLFATPGMLHAGMSLDAF 336

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
            +W  + +N +I+      GT+   +  L G++ + +  + R  ++
Sbjct: 337 RKWAPNDKNMVILPGYCVAGTVGNKV--LAGHKQIEMPDRARTVID 380


>gi|27372065|gb|AAN87883.1| FEG protein [Arabidopsis thaliana]
          Length = 613

 Score = 41.2 bits (95), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 40/87 (45%), Gaps = 6/87 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMEC-GFSRDLF 92
           I W S  + +      +NPF FK+VK       L   P P  VL + P M C G S ++F
Sbjct: 282 ISWTSQNVKEKHN--THNPFDFKNVKDFDR--SLIHAPGP-CVLFAIPGMLCAGLSLEVF 336

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLI 119
             W  SP N + +   +  GT+   L+
Sbjct: 337 KHWAPSPLNLVALLGYSVAGTVGHKLM 363


>gi|170595519|ref|XP_001902415.1| RNA-metabolising metallo-beta-lactamase family protein [Brugia
           malayi]
 gi|158589929|gb|EDP28737.1| RNA-metabolising metallo-beta-lactamase family protein [Brugia
           malayi]
          Length = 589

 Score = 41.2 bits (95), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 45/88 (51%), Gaps = 8/88 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPK-VVLVSTPDM-ECGFSRDL 91
           I W ++K+ ++F     N F FKH++      E + + SP  +VL STP M   G S  +
Sbjct: 284 ISWTNEKIKRTF--VERNMFDFKHIRPF----EQSYIESPGPMVLFSTPGMLHGGQSLRV 337

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F +WCS  +N II+       T+   +I
Sbjct: 338 FTKWCSDEKNLIIMPGFCVANTVGAKVI 365


>gi|169767492|ref|XP_001818217.1| cleavage and polyadenylylation specificity factor [Aspergillus
           oryzae RIB40]
 gi|83766072|dbj|BAE56215.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1014

 Score = 41.2 bits (95), Expect = 0.51,   Method: Composition-based stats.
 Identities = 43/175 (24%), Positives = 70/175 (40%), Gaps = 42/175 (24%)

Query: 34  IEWMSDKLMKSFEGA----------RNN------------------PFHFKHVKLCHSLA 65
           +EWM + +++ FE A          R N                  PF FKH+K+     
Sbjct: 357 LEWMDENIVREFEAAEGVDAATGQSRANPGGQRSGQNQGKEEKGTGPFTFKHLKIVERKK 416

Query: 66  ELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG 122
           +L K+    +PKV+L S   ++ GF+++        P N +++T       LA D  E  
Sbjct: 417 KLEKILNNQAPKVILASDTSLDWGFAKESLRLVAGGPNNLLLLTESFHKERLA-DNQEDP 475

Query: 123 GNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFINELQLSD 177
             +TL   +           E Y+++KD  A +K    ++     S   EL  SD
Sbjct: 476 YRKTLGNMI----------WEWYEERKDGVALEKSSDGEMIEQVHSGGRELSWSD 520


>gi|391871950|gb|EIT81099.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT2
           [Aspergillus oryzae 3.042]
          Length = 1010

 Score = 41.2 bits (95), Expect = 0.51,   Method: Composition-based stats.
 Identities = 43/175 (24%), Positives = 70/175 (40%), Gaps = 42/175 (24%)

Query: 34  IEWMSDKLMKSFEGA----------RNN------------------PFHFKHVKLCHSLA 65
           +EWM + +++ FE A          R N                  PF FKH+K+     
Sbjct: 353 LEWMDENIVREFEAAEGVDAATGQSRANPGGQRSGQNQGKEEKGTGPFTFKHLKIVERKK 412

Query: 66  ELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG 122
           +L K+    +PKV+L S   ++ GF+++        P N +++T       LA D  E  
Sbjct: 413 KLEKILNNQAPKVILASDTSLDWGFAKESLRLVAGGPNNLLLLTESFHKERLA-DNQEDP 471

Query: 123 GNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFINELQLSD 177
             +TL   +           E Y+++KD  A +K    ++     S   EL  SD
Sbjct: 472 YRKTLGNMI----------WEWYEERKDGVALEKSSDGEMIEQVHSGGRELSWSD 516


>gi|451946223|ref|YP_007466818.1| metallo-beta-lactamase and RNA-binding KH domain-containing
           putative metal-dependent RNase [Desulfocapsa sulfexigens
           DSM 10523]
 gi|451905571|gb|AGF77165.1| metallo-beta-lactamase and RNA-binding KH domain-containing
           putative metal-dependent RNase [Desulfocapsa sulfexigens
           DSM 10523]
          Length = 469

 Score = 41.2 bits (95), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 23/79 (29%), Positives = 40/79 (50%), Gaps = 3/79 (3%)

Query: 45  FEGARNNPFHFKHVKLCHSLAE---LAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPEN 101
            E   NNPF F+ +K   S+ E   L+    P +++ ++  ME G           +P+N
Sbjct: 307 LEDKNNNPFGFETLKYTSSVDESKNLSSSDKPAIIISASGMMEGGRILHHLKSRAGNPKN 366

Query: 102 SIIITNRTSPGTLARDLIE 120
           +I+IT   +P TL R +++
Sbjct: 367 TILITGWQAPFTLGRQIVD 385


>gi|327351648|gb|EGE80505.1| cleavage and polyadenylation specificity factor [Ajellomyces
           dermatitidis ATCC 18188]
          Length = 983

 Score = 41.2 bits (95), Expect = 0.52,   Method: Composition-based stats.
 Identities = 37/161 (22%), Positives = 69/161 (42%), Gaps = 18/161 (11%)

Query: 1   MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKIEWMSDKLMKSFEGARN--NPFHFKHV 58
           M + +++ FE    +P            ++   +  M DK     +GA     PF FK++
Sbjct: 311 MDEGIVREFEAGHGDPVAVSGKGRQDGPSQRNPLTGMPDK---RGDGAFKALGPFTFKYL 367

Query: 59  KLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLA 115
           K+    A+L K+    +PKV+L S   ++ G+S+ +     +  EN +I+T   S     
Sbjct: 368 KIVERKAKLDKILGSNTPKVILTSDTSLDWGYSKHVLQNIATGSENLVILTESFS----- 422

Query: 116 RDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDK 156
                +  N+ +    +    L  E    Y+++KD  A +K
Sbjct: 423 -----VSANKEMMDDGRSTTSLAHEIWSIYEERKDGVALEK 458


>gi|239610975|gb|EEQ87962.1| cleavage and polyadenylation specificity factor [Ajellomyces
           dermatitidis ER-3]
          Length = 983

 Score = 41.2 bits (95), Expect = 0.52,   Method: Composition-based stats.
 Identities = 37/161 (22%), Positives = 69/161 (42%), Gaps = 18/161 (11%)

Query: 1   MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKIEWMSDKLMKSFEGARN--NPFHFKHV 58
           M + +++ FE    +P            ++   +  M DK     +GA     PF FK++
Sbjct: 311 MDEGIVREFEAGHGDPVAVSGKGRQDGPSQRNPLTGMPDK---RGDGAFKALGPFTFKYL 367

Query: 59  KLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLA 115
           K+    A+L K+    +PKV+L S   ++ G+S+ +     +  EN +I+T   S     
Sbjct: 368 KIVERKAKLDKILGSNTPKVILTSDTSLDWGYSKHVLQNIATGSENLVILTESFS----- 422

Query: 116 RDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDK 156
                +  N+ +    +    L  E    Y+++KD  A +K
Sbjct: 423 -----VSANKEMMDDGRSTTSLAHEIWSIYEERKDGVALEK 458


>gi|261206112|ref|XP_002627793.1| cleavage and polyadenylylation specificity factor [Ajellomyces
           dermatitidis SLH14081]
 gi|239592852|gb|EEQ75433.1| cleavage and polyadenylylation specificity factor [Ajellomyces
           dermatitidis SLH14081]
          Length = 983

 Score = 41.2 bits (95), Expect = 0.52,   Method: Composition-based stats.
 Identities = 37/161 (22%), Positives = 69/161 (42%), Gaps = 18/161 (11%)

Query: 1   MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKIEWMSDKLMKSFEGARN--NPFHFKHV 58
           M + +++ FE    +P            ++   +  M DK     +GA     PF FK++
Sbjct: 311 MDEGIVREFEAGHGDPVAVSGKGRQDGPSQRNPLTGMPDK---RGDGAFKALGPFTFKYL 367

Query: 59  KLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLA 115
           K+    A+L K+    +PKV+L S   ++ G+S+ +     +  EN +I+T   S     
Sbjct: 368 KIVERKAKLDKILGSNTPKVILTSDTSLDWGYSKHVLQNIATGSENLVILTESFS----- 422

Query: 116 RDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDK 156
                +  N+ +    +    L  E    Y+++KD  A +K
Sbjct: 423 -----VSANKEMMDDGRSTTSLAHEIWSIYEERKDGVALEK 458


>gi|156092661|ref|XP_001612477.1| cleavage and polyadenylation specifity factor protein [Plasmodium
           vivax Sal-1]
 gi|148801279|gb|EDL42684.1| cleavage and polyadenylation specifity factor protein, putative
           [Plasmodium vivax]
          Length = 858

 Score = 41.2 bits (95), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 39/72 (54%), Gaps = 4/72 (5%)

Query: 51  NPFHFKHVKLCHSLAE----LAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           NPF+FK VK   SL      L +  +P VV+ S   ++ G S+++F       ++ +I+T
Sbjct: 355 NPFNFKFVKYAKSLDSILNYLYQDNNPCVVMASPGMLQNGISKNIFNIIAPDKKSGVILT 414

Query: 107 NRTSPGTLARDL 118
             T  GTLA +L
Sbjct: 415 GYTVKGTLADEL 426


>gi|169598186|ref|XP_001792516.1| hypothetical protein SNOG_01891 [Phaeosphaeria nodorum SN15]
 gi|111068983|gb|EAT90103.1| hypothetical protein SNOG_01891 [Phaeosphaeria nodorum SN15]
          Length = 833

 Score = 41.2 bits (95), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 17/45 (37%), Positives = 27/45 (60%)

Query: 75  VVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
           V+L S   M+ G SR+L  +W   P N +IIT  +  GT+A+ ++
Sbjct: 355 VMLASPGMMQSGTSRELLERWAPDPRNGVIITGYSVEGTMAKQIV 399


>gi|356502382|ref|XP_003519998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-II-like [Glycine max]
          Length = 516

 Score = 41.2 bits (95), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 42/87 (48%), Gaps = 6/87 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
           I W   K+  ++  +++N F FK+V+       +   P P  VL +TP M   GFS ++F
Sbjct: 282 IRWTRQKIKDTY--SKHNAFDFKNVQKFER--SMIDAPGP-CVLFATPGMLSGGFSVEVF 336

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLI 119
             W  S  N + +     PGT+   L+
Sbjct: 337 KHWAVSENNLVSLPGYCVPGTIGHKLM 363


>gi|21358523|ref|NP_651721.1| integrator 11 [Drosophila melanogaster]
 gi|7301822|gb|AAF56931.1| integrator 11 [Drosophila melanogaster]
 gi|16768852|gb|AAL28645.1| LD08814p [Drosophila melanogaster]
 gi|220943570|gb|ACL84328.1| CG1972-PA [synthetic construct]
 gi|220953494|gb|ACL89290.1| CG1972-PA [synthetic construct]
          Length = 597

 Score = 41.2 bits (95), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F     N F FKH+K      + A + +P  ++V +TP M   G S  +
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  LGG + +  + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVEFENRQVVEVK 381


>gi|156100557|ref|XP_001616006.1| cleavage and polyadenylation specifity factor protein [Plasmodium
           vivax Sal-1]
 gi|148804880|gb|EDL46279.1| cleavage and polyadenylation specifity factor protein, putative
           [Plasmodium vivax]
          Length = 934

 Score = 41.2 bits (95), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 39/72 (54%), Gaps = 4/72 (5%)

Query: 51  NPFHFKHVKLCHSLAE----LAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           NPF+FK VK   SL      L +  +P VV+ S   ++ G S+++F       ++ +I+T
Sbjct: 355 NPFNFKFVKYAKSLDSILNYLYQDNNPCVVMASPGMLQNGISKNIFNIIAPDKKSGVILT 414

Query: 107 NRTSPGTLARDL 118
             T  GTLA +L
Sbjct: 415 GYTVKGTLADEL 426


>gi|70996586|ref|XP_753048.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Aspergillus fumigatus Af293]
 gi|74672067|sp|Q4WRC2.1|YSH1_ASPFU RecName: Full=Endoribonuclease ysh1; AltName: Full=mRNA
           3'-end-processing protein ysh1
 gi|66850683|gb|EAL91010.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Aspergillus fumigatus Af293]
 gi|159131784|gb|EDP56897.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Aspergillus fumigatus A1163]
          Length = 872

 Score = 41.2 bits (95), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 45/90 (50%), Gaps = 9/90 (10%)

Query: 31  LAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSR 89
           +A+ E   DK       A   P+ FK V+   SL     V     V++++P M + G SR
Sbjct: 319 MAEAEASGDK------SASAGPWDFKFVRSLRSLERFDDVGG--CVMLASPGMLQTGTSR 370

Query: 90  DLFFQWCSSPENSIIITNRTSPGTLARDLI 119
           +L  +W  +  N +++T  +  GT+A+ L+
Sbjct: 371 ELLERWAPNERNGVVMTGYSVEGTMAKQLL 400


>gi|195503187|ref|XP_002098546.1| GE23879 [Drosophila yakuba]
 gi|194184647|gb|EDW98258.1| GE23879 [Drosophila yakuba]
          Length = 597

 Score = 40.8 bits (94), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F     N F FKH+K      + A + +P  ++V +TP M   G S  +
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  LGG + +  + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVEFENRQVVEVK 381


>gi|195445135|ref|XP_002070189.1| GK11920 [Drosophila willistoni]
 gi|194166274|gb|EDW81175.1| GK11920 [Drosophila willistoni]
          Length = 597

 Score = 40.8 bits (94), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F     N F FKH+K      + A + +P  ++V +TP M   G S  +
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  LGG + +  + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVEFENRQVVEVK 381


>gi|195394529|ref|XP_002055895.1| GJ10637 [Drosophila virilis]
 gi|194142604|gb|EDW59007.1| GJ10637 [Drosophila virilis]
          Length = 597

 Score = 40.8 bits (94), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F     N F FKH+K      + A + +P  ++V +TP M   G S  +
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  LGG + +  + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVEFENRQVVEVK 381


>gi|194765324|ref|XP_001964777.1| GF23370 [Drosophila ananassae]
 gi|190615049|gb|EDV30573.1| GF23370 [Drosophila ananassae]
          Length = 597

 Score = 40.8 bits (94), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F     N F FKH+K      + A + +P  ++V +TP M   G S  +
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  LGG + +  + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVEFENRQVVEVK 381


>gi|195143691|ref|XP_002012831.1| GL23717 [Drosophila persimilis]
 gi|194101774|gb|EDW23817.1| GL23717 [Drosophila persimilis]
          Length = 597

 Score = 40.8 bits (94), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F     N F FKH+K      + A + +P  ++V +TP M   G S  +
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  LGG + +  + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVEFENRQVVEVK 381


>gi|125773833|ref|XP_001358175.1| GA15164 [Drosophila pseudoobscura pseudoobscura]
 gi|54637910|gb|EAL27312.1| GA15164 [Drosophila pseudoobscura pseudoobscura]
          Length = 597

 Score = 40.8 bits (94), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F     N F FKH+K      + A + +P  ++V +TP M   G S  +
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  LGG + +  + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVEFENRQVVEVK 381


>gi|406604299|emb|CCH44271.1| Cleavage and polyadenylation specificity factor subunit
           [Wickerhamomyces ciferrii]
          Length = 795

 Score = 40.8 bits (94), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 51/115 (44%), Gaps = 14/115 (12%)

Query: 5   LMKSFEGARNNPFHFKHVKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSL 64
           LM S+  ARN         L ++ + L   EWM+  +++ +E     PF    V++    
Sbjct: 256 LMLSYTKARN---------LTYAGSML---EWMAPAVIREWETRGQPPFDSSRVQVIEP- 302

Query: 65  AELAKVPSPKVVLVSTPDMECG-FSRDLFFQWCSSPENSIIITNRTSPGTLARDL 118
            EL  +P  KVV  S    E G  ++      C   + +II+T +T   T+  DL
Sbjct: 303 NELLNMPGAKVVFASGAGFEDGSVAQAALTTLCDDEKTTIILTEKTVENTIGNDL 357


>gi|402590428|gb|EJW84358.1| RNA-metabolising metallo-beta-lactamase [Wuchereria bancrofti]
          Length = 579

 Score = 40.8 bits (94), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 45/88 (51%), Gaps = 8/88 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPK-VVLVSTPDM-ECGFSRDL 91
           I W ++K+ ++F     N F FKH++      E + + SP  +VL STP M   G S  +
Sbjct: 255 ISWTNEKIKRTF--VERNMFDFKHIRPF----EQSYIDSPGPMVLFSTPGMLHGGQSLRV 308

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F +WCS  +N II+       T+   +I
Sbjct: 309 FTKWCSDEKNLIIMPGFCVANTVGAKVI 336


>gi|194906134|ref|XP_001981318.1| GG11690 [Drosophila erecta]
 gi|190655956|gb|EDV53188.1| GG11690 [Drosophila erecta]
          Length = 597

 Score = 40.8 bits (94), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F     N F FKH+K      + A + +P  ++V +TP M   G S  +
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  LGG + +  + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVEFENRQVVEVK 381


>gi|378756880|gb|EHY66904.1| cleavage and polyadenylation specificity factor subunit 3
           [Nematocida sp. 1 ERTm2]
          Length = 501

 Score = 40.8 bits (94), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 34/122 (27%), Positives = 53/122 (43%), Gaps = 18/122 (14%)

Query: 18  HFKHVKLCHSLAELAKIEWMSDKLMKSF-----EGARN-----NPFHFKHVKLCHSLAEL 67
           H++  KL   +   A +   ++ + K F     E  RN     N F F+HVK   S   L
Sbjct: 255 HWEKSKLSIPIYTSATLTHKANDIYKQFIDYTHEHIRNTMHKRNLFDFQHVKQFDS--NL 312

Query: 68  AKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDL------IEL 121
           A +  P ++  S   +  G S  +F +WC  P+N +I       GT+   +      IE+
Sbjct: 313 ASLEGPMILFSSPGMLHSGPSLSIFKKWCGDPKNMVIFPGYCVRGTIGERVLNGASQIEV 372

Query: 122 GG 123
           GG
Sbjct: 373 GG 374


>gi|91086147|ref|XP_969343.1| PREDICTED: similar to CG1972 CG1972-PA [Tribolium castaneum]
 gi|270009886|gb|EFA06334.1| hypothetical protein TcasGA2_TC009205 [Tribolium castaneum]
          Length = 595

 Score = 40.8 bits (94), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 28/111 (25%), Positives = 46/111 (41%), Gaps = 14/111 (12%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K           P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VQRNMFDFKHIKPFDR--SYIDNPGPMVVFATPGMLHAGLSLQIFK 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEE 144
           +W  +  N +I+      GT+   +          L   KR+  E +++ E
Sbjct: 339 KWAPNENNMVIMPGFCVQGTVGHKI----------LNGAKRVEFENKQIVE 379


>gi|387594235|gb|EIJ89259.1| integrator complex subunit 11 [Nematocida parisii ERTm3]
 gi|387594982|gb|EIJ92609.1| integrator complex subunit 11 [Nematocida parisii ERTm1]
          Length = 502

 Score = 40.8 bits (94), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 29/113 (25%), Positives = 49/113 (43%), Gaps = 12/113 (10%)

Query: 19  FKHVKLCHSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVV 76
           +    L H   ++ K  I++  + +  +    + N F F+HVK   S   LA +  P ++
Sbjct: 266 YTSATLTHKANDIYKQFIDYTHEHIRSTLH--KRNLFDFRHVKQFDS--NLASLEGPMIL 321

Query: 77  LVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDL------IELGG 123
             S   +  G S  +F +WC  P N +I       GT+   +      IE+GG
Sbjct: 322 FSSPGMLHSGPSLSIFKKWCGDPNNMVIFPGYCVRGTIGERVLNGASQIEVGG 374


>gi|328766828|gb|EGF76880.1| hypothetical protein BATDEDRAFT_14507, partial [Batrachochytrium
           dendrobatidis JAM81]
          Length = 475

 Score = 40.8 bits (94), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 27/98 (27%), Positives = 46/98 (46%), Gaps = 10/98 (10%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVK-LCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDL 91
           I W ++ +  +      N F F H++   HS A+        +VL +TP M   G S ++
Sbjct: 295 ISWTNENVKSAL--VERNMFDFAHIRSWSHSFAD----EPGAMVLFATPGMLHAGTSLEV 348

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTL 129
           F +WC  P+N II+      GT+   +  L G + + +
Sbjct: 349 FKKWCHDPKNMIIMPGYCVAGTVGAKV--LAGEKVINV 384


>gi|390602470|gb|EIN11863.1| Metallo-hydrolase/oxidoreductase, partial [Punctularia
           strigosozonata HHB-11173 SS5]
          Length = 721

 Score = 40.8 bits (94), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 41/77 (53%), Gaps = 9/77 (11%)

Query: 49  RNNPFHFKHVKLCHSLA------ELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENS 102
           R+NPF FKH+   H+        +LA+ P P V+L S   ++ G SR+L         N 
Sbjct: 298 RDNPFVFKHI--SHAPQNRGWERKLAEGP-PCVILASPGMLQSGPSRELLELLAPDSRNG 354

Query: 103 IIITNRTSPGTLARDLI 119
           +++T  +  GT ARD+I
Sbjct: 355 LVLTGYSVEGTPARDII 371


>gi|195062087|ref|XP_001996130.1| GH14325 [Drosophila grimshawi]
 gi|193891922|gb|EDV90788.1| GH14325 [Drosophila grimshawi]
          Length = 597

 Score = 40.8 bits (94), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F     N F FKH+K      + A + +P  ++V +TP M   G S  +
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  LGG + +  + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVEFENRQVVEVK 381


>gi|295659367|ref|XP_002790242.1| cleavage and polyadenylation specific factor 2 [Paracoccidioides
           sp. 'lutzii' Pb01]
 gi|226281947|gb|EEH37513.1| cleavage and polyadenylation specific factor 2 [Paracoccidioides
           sp. 'lutzii' Pb01]
          Length = 999

 Score = 40.4 bits (93), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 42/79 (53%), Gaps = 4/79 (5%)

Query: 51  NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
            PF F+H+K+     +L K+    +P+V+L S   +E G+S+ +  +  +  EN II+T 
Sbjct: 401 GPFTFRHLKIVERKTKLDKILGSNAPQVILTSDTSLEWGYSKHVLQKIAAGSENLIILTE 460

Query: 108 RTSPGTLARDLIELGGNRT 126
              P +      E GG+R+
Sbjct: 461 -PYPISANAQSTEDGGSRS 478


>gi|401827835|ref|XP_003888210.1| putative RNA-processing beta-lactamase-fold exonuclease
           [Encephalitozoon hellem ATCC 50504]
 gi|392999410|gb|AFM99229.1| putative RNA-processing beta-lactamase-fold exonuclease
           [Encephalitozoon hellem ATCC 50504]
          Length = 496

 Score = 40.4 bits (93), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 45/93 (48%), Gaps = 5/93 (5%)

Query: 51  NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRT 109
           N F +KH+K         + P   +VL ++P M   G S  +F +WCS  +N +II    
Sbjct: 293 NVFEYKHIKPFQKYYMDNEGP---MVLFASPGMLHSGMSLRMFKEWCSDEKNLVIIPGYC 349

Query: 110 SPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
             GT+   ++  G  R   L  +K IR+E + L
Sbjct: 350 VRGTIGEKVLN-GAKRLEILGEEKDIRIEIKNL 381


>gi|219121689|ref|XP_002181194.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407180|gb|EEC47117.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 602

 Score = 40.4 bits (93), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 10/91 (10%)

Query: 50  NNPFHFKHVKLCHSL-AELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 108
           +NPF F+ ++   S+         P VV  S   ++ G SR LF +W S  +N ++I   
Sbjct: 296 SNPFRFRFIQNLKSIDVNSFDDSGPSVVFASPGMLQSGVSRQLFDRWASDHKNGVLIAGY 355

Query: 109 TSPGTLARDLIELGGNRTLTLQVKKRIRLEG 139
               TLA++++          Q K+ + LEG
Sbjct: 356 AVEHTLAKEIMA---------QPKEVVTLEG 377


>gi|358333178|dbj|GAA51732.1| integrator complex subunit 11 [Clonorchis sinensis]
          Length = 649

 Score = 40.4 bits (93), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 23/87 (26%), Positives = 40/87 (45%), Gaps = 6/87 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAE-LAKVPSPKVVLVSTPDMECGFSRDLF 92
           I W + K+ ++F   + N F FKH+K    L + +   P P VV  +   +  G S  +F
Sbjct: 352 ISWTNQKIKETF--VKRNMFEFKHIK---PLGQGIVDNPGPMVVFATPGMLHAGQSLHIF 406

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLI 119
            +W     N ++I      GT+   ++
Sbjct: 407 RKWAPDERNMVVIPGYCVAGTVGYKIL 433


>gi|119195099|ref|XP_001248153.1| hypothetical protein CIMG_01924 [Coccidioides immitis RS]
          Length = 1015

 Score = 40.4 bits (93), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 27/130 (20%), Positives = 49/130 (37%), Gaps = 42/130 (32%)

Query: 23  KLCHSLAELAK--IEWMSDKLMKSFEGARNN----------------------------- 51
           K  H    LA+  +EWM + +++ FEG                                 
Sbjct: 330 KKVHGTMRLARSMLEWMDESIVREFEGGDGGESLGAGRSSGAASGQQSKGTPGQTSDKKS 389

Query: 52  --------PFHFKHVKLCH---SLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPE 100
                   PF F+H+K+      L  + +   PKV++ S   ++ GFS+++        E
Sbjct: 390 AGPHKGLGPFTFRHLKIIERKTKLENILRSEGPKVIIASDTSLDWGFSKEILRHVAQGAE 449

Query: 101 NSIIITNRTS 110
           N +I+T  ++
Sbjct: 450 NLVILTESSA 459


>gi|312083284|ref|XP_003143797.1| RNA-metabolising metallo-beta-lactamase [Loa loa]
 gi|307761039|gb|EFO20273.1| RNA-metabolising metallo-beta-lactamase [Loa loa]
          Length = 644

 Score = 40.4 bits (93), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 6/87 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
           I W ++K+ ++F     N F FKH++           P P +VL STP M   G S  +F
Sbjct: 339 ISWTNEKIKRTF--VERNMFDFKHIRPFEQ--SYTDSPGP-MVLFSTPGMLHGGQSLRVF 393

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLI 119
            +WCS  +N II+       T+   +I
Sbjct: 394 TKWCSDEKNLIIMPGFCVANTVGAKVI 420


>gi|373495498|ref|ZP_09586082.1| hypothetical protein HMPREF0380_01720 [Eubacterium infirmum F0142]
 gi|371964681|gb|EHO82189.1| hypothetical protein HMPREF0380_01720 [Eubacterium infirmum F0142]
          Length = 556

 Score = 40.4 bits (93), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 33/99 (33%), Positives = 46/99 (46%), Gaps = 11/99 (11%)

Query: 50  NNPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           +NP  FK++K   S  E   +     PKV++ ++   E G  R        +P+NSI+  
Sbjct: 312 DNPLEFKNLKFTRSSKESQDLNFNKEPKVIISASGMCEAGRIRHHLKHNLWNPKNSIVFV 371

Query: 107 NRTSPGTLARDLIELGGNRTLTL-----QVKKRIR-LEG 139
                GTL R LIE  G + +TL     QV   I  LEG
Sbjct: 372 GYQGQGTLGRSLIE--GIKMVTLFGEEIQVNAEIHNLEG 408


>gi|398406895|ref|XP_003854913.1| hypothetical protein MYCGRDRAFT_55193, partial [Zymoseptoria
           tritici IPO323]
 gi|339474797|gb|EGP89889.1| hypothetical protein MYCGRDRAFT_55193 [Zymoseptoria tritici IPO323]
          Length = 855

 Score = 40.4 bits (93), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 24/90 (26%), Positives = 42/90 (46%), Gaps = 8/90 (8%)

Query: 52  PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSP 111
           P+ F  ++   S+     V  P VVL S   ++ G SR L  +W    +N +IIT  +  
Sbjct: 339 PWDFNFIRQLKSMDRYEDV-GPSVVLASPGMLQNGPSRTLLERWAPDAKNGVIITGYSVE 397

Query: 112 GTLARDLI-------ELGGNRTLTLQVKKR 134
           GT+A+ ++        + G +   + + KR
Sbjct: 398 GTMAKTIMTEPDSIPAVTGKKNTGMNIGKR 427


>gi|407919362|gb|EKG12612.1| Beta-lactamase-like protein [Macrophomina phaseolina MS6]
          Length = 842

 Score = 40.4 bits (93), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 22/77 (28%), Positives = 41/77 (53%), Gaps = 1/77 (1%)

Query: 43  KSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENS 102
           ++ + ++  P+ FK+V+   SL     V S  V+L S   M+ G SR+L  +W     N 
Sbjct: 323 RNGDASKAGPWDFKYVRSLKSLERFDDVGS-CVMLASPGMMQNGVSRELLERWAPDQRNG 381

Query: 103 IIITNRTSPGTLARDLI 119
           +I+T  +  GT+ + ++
Sbjct: 382 VIMTGYSVEGTMGKMIL 398


>gi|115397403|ref|XP_001214293.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114192484|gb|EAU34184.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 870

 Score = 40.4 bits (93), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 45/90 (50%), Gaps = 9/90 (10%)

Query: 31  LAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSR 89
           +A+ E   DK       A   P+ F+ V+   SL     V     V++++P M + G SR
Sbjct: 319 MAEAEASGDK------NASAGPWDFRFVRSLRSLERFDDVGG--CVMLASPGMLQSGTSR 370

Query: 90  DLFFQWCSSPENSIIITNRTSPGTLARDLI 119
           +L  +W  +  N +I+T  +  GT+A+ L+
Sbjct: 371 ELLERWAPNERNGVIMTGYSVEGTMAKQLL 400


>gi|255957115|ref|XP_002569310.1| Pc21g23430 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211591021|emb|CAP97240.1| Pc21g23430 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 862

 Score = 40.4 bits (93), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 21/70 (30%), Positives = 38/70 (54%), Gaps = 3/70 (4%)

Query: 51  NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRT 109
            P+ F+ V+   SL     V     V++++P M + G SR+L  +W  S  N +++T  +
Sbjct: 333 GPWDFRFVRSLRSLERFDDVGG--CVMLASPGMLQTGTSRELLERWAPSDRNGVVMTGYS 390

Query: 110 SPGTLARDLI 119
             GT+A+ L+
Sbjct: 391 VEGTMAKGLL 400


>gi|392862603|gb|EAS36741.2| cleavage and polyadenylylation specificity factor [Coccidioides
           immitis RS]
          Length = 1026

 Score = 40.4 bits (93), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 27/130 (20%), Positives = 49/130 (37%), Gaps = 42/130 (32%)

Query: 23  KLCHSLAELAK--IEWMSDKLMKSFEGARNN----------------------------- 51
           K  H    LA+  +EWM + +++ FEG                                 
Sbjct: 341 KKVHGTMRLARSMLEWMDESIVREFEGGDGGESLGAGRSSGAASGQQSKGTPGQTSDKKS 400

Query: 52  --------PFHFKHVKLCH---SLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPE 100
                   PF F+H+K+      L  + +   PKV++ S   ++ GFS+++        E
Sbjct: 401 AGPHKGLGPFTFRHLKIIERKTKLENILRSEGPKVIIASDTSLDWGFSKEILRHVAQGAE 460

Query: 101 NSIIITNRTS 110
           N +I+T  ++
Sbjct: 461 NLVILTESSA 470


>gi|326435554|gb|EGD81124.1| integrator complex subunit 11 [Salpingoeca sp. ATCC 50818]
          Length = 620

 Score = 40.0 bits (92), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 26/109 (23%), Positives = 47/109 (43%), Gaps = 5/109 (4%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           + + + K+  +F     N F FKH++        A  P P+V+  +   +  G + ++F 
Sbjct: 316 VMYTNQKIKDTF--VDRNLFDFKHIRAFDR--SYADQPGPQVLFATPGMLHAGVALEVFA 371

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
           +W   P N +I+      GT+   +I  G       Q    +RL+ E L
Sbjct: 372 KWAGDPRNMVILPGYCVAGTVGAQVIA-GKKEIRVGQQMVNVRLQVEYL 419


>gi|193608339|ref|XP_001949326.1| PREDICTED: integrator complex subunit 11-like isoform 1
           [Acyrthosiphon pisum]
 gi|328710634|ref|XP_003244318.1| PREDICTED: integrator complex subunit 11-like isoform 2
           [Acyrthosiphon pisum]
          Length = 603

 Score = 40.0 bits (92), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 26/105 (24%), Positives = 48/105 (45%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ ++F   + N F FKH+K           P P VV  +   +  G S ++F 
Sbjct: 286 ITWTNQKVRQTF--VQRNMFDFKHIKPFDK--TYMHNPGPMVVFATPGMLHAGLSLNIFK 341

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W    +N +I+      GT+   +  L G++ +  +  K I ++
Sbjct: 342 KWAPDEKNMLIVPGYCVSGTVGNKV--LSGSKKIEAEPNKFIDVK 384


>gi|195112455|ref|XP_002000788.1| GI10422 [Drosophila mojavensis]
 gi|193917382|gb|EDW16249.1| GI10422 [Drosophila mojavensis]
          Length = 597

 Score = 40.0 bits (92), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F     N F FKH+K      + A + +P  ++V +TP M   G S  +
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  LGG + +  + ++ I ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVECENRQIIEVK 381


>gi|146305000|ref|YP_001192316.1| beta-lactamase domain-containing protein [Metallosphaera sedula DSM
           5348]
 gi|145703250|gb|ABP96392.1| beta-lactamase domain protein [Metallosphaera sedula DSM 5348]
          Length = 642

 Score = 40.0 bits (92), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 25/98 (25%), Positives = 41/98 (41%), Gaps = 4/98 (4%)

Query: 19  FKHVKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV 78
           F  V   H+    A  EW+  ++  S      NPF  +  K      E      P ++L 
Sbjct: 445 FDEVTAIHT----AYPEWLGKEVRDSILFRDENPFTSEFFKRIEGYREDIAEGEPSIILA 500

Query: 79  STPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLAR 116
           ++  +  G + + F Q    P+NS+I  +  + GTL R
Sbjct: 501 TSGMLNGGPAVEFFKQLAPDPKNSLIFVSYQAEGTLGR 538


>gi|425780830|gb|EKV18826.1| Endoribonuclease ysh1 [Penicillium digitatum PHI26]
 gi|425783067|gb|EKV20936.1| Endoribonuclease ysh1 [Penicillium digitatum Pd1]
          Length = 862

 Score = 40.0 bits (92), Expect = 1.00,   Method: Compositional matrix adjust.
 Identities = 21/70 (30%), Positives = 38/70 (54%), Gaps = 3/70 (4%)

Query: 51  NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRT 109
            P+ F+ V+   SL     V     V++++P M + G SR+L  +W  S  N +++T  +
Sbjct: 333 GPWDFRFVRSLRSLERFDDVGG--CVMLASPGMLQTGTSRELLERWAPSDRNGVVMTGYS 390

Query: 110 SPGTLARDLI 119
             GT+A+ L+
Sbjct: 391 VEGTMAKGLL 400


>gi|303391170|ref|XP_003073815.1| putative beta-lactamase fold-containing exonuclease
           [Encephalitozoon intestinalis ATCC 50506]
 gi|303302963|gb|ADM12455.1| putative beta-lactamase fold-containing exonuclease
           [Encephalitozoon intestinalis ATCC 50506]
          Length = 496

 Score = 40.0 bits (92), Expect = 1.00,   Method: Compositional matrix adjust.
 Identities = 33/116 (28%), Positives = 52/116 (44%), Gaps = 15/116 (12%)

Query: 38  SDKLMKSFEGARN----------NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECG 86
           ++++ K F G  N          N F +KH+K         K P   +VL ++P M   G
Sbjct: 270 ANEIYKKFIGYTNETVKRKIFERNVFEYKHIKPFQRYYMDNKGP---MVLFASPGMLHSG 326

Query: 87  FSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
            S  +F +WC   +N +II      GT+   ++  G  R   L  +K I+LE + L
Sbjct: 327 MSLRIFKEWCEDEKNLVIIPGYCVRGTIGEKILN-GAKRLEILGEEKDIKLEIKNL 381


>gi|284174985|ref|ZP_06388954.1| mRNA 3'-end processing factor, putative [Sulfolobus solfataricus
           98/2]
          Length = 638

 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 29/122 (23%), Positives = 50/122 (40%), Gaps = 6/122 (4%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQ 94
           EW+  ++ +       NPF  +H K      E      P ++L ++  +  G + + F  
Sbjct: 461 EWLGREVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKT 520

Query: 95  WCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKR-----IRLEGEELEEYQKKK 149
               P+N+II  +  + GTL R + + G      L    R     I +E E +E +    
Sbjct: 521 MAPDPKNAIIFVSYQAEGTLGRKVRD-GAKEVQILDRDGRVESIQINMEVEAVEGFSGHS 579

Query: 150 DK 151
           DK
Sbjct: 580 DK 581


>gi|308271258|emb|CBX27867.1| hypothetical protein N47_C19250 [uncultured Desulfobacterium sp.]
          Length = 522

 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 45/91 (49%), Gaps = 3/91 (3%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAE---LAKVPSPKVVLVSTPDMECGFSRDL 91
           E  SD+ +K+F    +NPF FK +K   S+ E   L +  +P +VL ++   E G     
Sbjct: 339 EVYSDETVKAFLSIGDNPFAFKELKFVASVDESMVLMRDNTPHIVLAASGMCEAGRVLHH 398

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELG 122
                 +P+N+I++    +  TL R + + G
Sbjct: 399 LRYKMHNPKNTILLVGFMAQNTLGRRIHDQG 429


>gi|15897661|ref|NP_342266.1| mRNA 3'-end processing factor [Sulfolobus solfataricus P2]
 gi|6015715|emb|CAB57542.1| mRNA 3'-end polyadenylation factor [Sulfolobus solfataricus P2]
 gi|13813930|gb|AAK41056.1| mRNA 3'-end processing factor, putative [Sulfolobus solfataricus
           P2]
          Length = 639

 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 29/122 (23%), Positives = 50/122 (40%), Gaps = 6/122 (4%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQ 94
           EW+  ++ +       NPF  +H K      E      P ++L ++  +  G + + F  
Sbjct: 462 EWLGREVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKT 521

Query: 95  WCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKR-----IRLEGEELEEYQKKK 149
               P+N+II  +  + GTL R + + G      L    R     I +E E +E +    
Sbjct: 522 MAPDPKNAIIFVSYQAEGTLGRKVRD-GAKEVQILDRDGRVESIQINMEVEAVEGFSGHS 580

Query: 150 DK 151
           DK
Sbjct: 581 DK 582


>gi|312381513|gb|EFR27247.1| hypothetical protein AND_06171 [Anopheles darlingi]
          Length = 624

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 27/107 (25%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F   + N F FKH+K      +   + +P  ++V +TP M   G S  +
Sbjct: 283 ITWTNQKIRKTF--VQRNMFDFKHIKPF----DKGYIDNPGAMVVFATPGMLHAGLSLQI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  LGG + +  + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGHKI--LGGAKKVEFENRQVVEVK 381


>gi|157107341|ref|XP_001649735.1| cleavage and polyadenylation specificity factor [Aedes aegypti]
 gi|108879612|gb|EAT43837.1| AAEL004757-PA [Aedes aegypti]
          Length = 613

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 27/107 (25%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F   + N F FKH+K      +   + +P  ++V +TP M   G S  +
Sbjct: 283 ITWTNQKIRKTF--VQRNMFDFKHIKPF----DKGYIDNPGAMVVFATPGMLHAGLSLQI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  LGG + +  + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGHKI--LGGAKKVEFENRQVVEVK 381


>gi|384434275|ref|YP_005643633.1| KH-domain/beta-lactamase-domain-containing protein [Sulfolobus
           solfataricus 98/2]
 gi|261602429|gb|ACX92032.1| KH-domain/beta-lactamase-domain protein [Sulfolobus solfataricus
           98/2]
          Length = 635

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 29/122 (23%), Positives = 50/122 (40%), Gaps = 6/122 (4%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQ 94
           EW+  ++ +       NPF  +H K      E      P ++L ++  +  G + + F  
Sbjct: 458 EWLGREVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKT 517

Query: 95  WCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKR-----IRLEGEELEEYQKKK 149
               P+N+II  +  + GTL R + + G      L    R     I +E E +E +    
Sbjct: 518 MAPDPKNAIIFVSYQAEGTLGRKVRD-GAKEVQILDRDGRVESIQINMEVEAVEGFSGHS 576

Query: 150 DK 151
           DK
Sbjct: 577 DK 578


>gi|121700651|ref|XP_001268590.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Aspergillus clavatus NRRL 1]
 gi|119396733|gb|EAW07164.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Aspergillus clavatus NRRL 1]
          Length = 878

 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 45/90 (50%), Gaps = 9/90 (10%)

Query: 31  LAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSR 89
           +A+ E   DK       A   P+ F+ V+   SL     V     V++++P M + G SR
Sbjct: 318 MAEAEASGDK------SASAGPWDFRFVRSLRSLERFDDVGG--CVMLASPGMLQTGTSR 369

Query: 90  DLFFQWCSSPENSIIITNRTSPGTLARDLI 119
           +L  +W  +  N +++T  +  GT+A+ L+
Sbjct: 370 ELLERWAPNERNGVVMTGYSVEGTMAKQLL 399


>gi|16082538|ref|NP_394086.1| metal-dependent RNase [Thermoplasma acidophilum DSM 1728]
          Length = 639

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 22/90 (24%), Positives = 45/90 (50%), Gaps = 3/90 (3%)

Query: 32  AKIEWMSDKLMKSFEGARNNPFH---FKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFS 88
           A  E+++ +L ++    + NPF    FK V+      E+A+ P   +VL ++  M  G  
Sbjct: 460 AYPEYLNKELREAIMVKKENPFLSPIFKKVETRDQREEIAENPETTIVLATSGMMNGGPV 519

Query: 89  RDLFFQWCSSPENSIIITNRTSPGTLARDL 118
            + F  W + P+++++     + GTL + +
Sbjct: 520 MEYFKAWSADPKHTLVFVGYQADGTLGKKI 549


>gi|449686516|ref|XP_002164154.2| PREDICTED: integrator complex subunit 11-like [Hydra
           magnipapillata]
          Length = 526

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 26/105 (24%), Positives = 45/105 (42%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+  +F   + N F FKH+K           P P VV  +   +  G S  +F 
Sbjct: 194 ITWTNQKIRNTF--VQRNLFDFKHIKAFDR--SYIDNPGPMVVFATPGMLHAGLSLQIFK 249

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W     N++I+      GT+   +  L G + + L  K  + ++
Sbjct: 250 KWAPDENNTLIMPGYCVIGTVGHKV--LSGQKKIELDKKTTVDVK 292


>gi|307170840|gb|EFN62951.1| Integrator complex subunit 11 [Camponotus floridanus]
          Length = 595

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F   + N F FKH+K      + A + +P  ++V +TP M   G S  +
Sbjct: 283 ITWTNQKIKKTF--VQRNMFEFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  L G R +  + ++ + ++
Sbjct: 337 FKKWAPNEANMVIMPGFCVQGTVGHKV--LNGTRRIEFENRQIVEVK 381


>gi|170052069|ref|XP_001862054.1| cleavage and polyadenylation specificity factor subunit 3 [Culex
           quinquefasciatus]
 gi|167873079|gb|EDS36462.1| cleavage and polyadenylation specificity factor subunit 3 [Culex
           quinquefasciatus]
          Length = 615

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 27/107 (25%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F   + N F FKH+K      +   + +P  ++V +TP M   G S  +
Sbjct: 283 ITWTNQKIRKTF--VQRNMFDFKHIKPF----DKGYIDNPGAMVVFATPGMLHAGLSLQI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  LGG + +  + ++ + ++
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGHKI--LGGAKKVEFENRQVVEVK 381


>gi|258578481|ref|XP_002543422.1| predicted protein [Uncinocarpus reesii 1704]
 gi|237903688|gb|EEP78089.1| predicted protein [Uncinocarpus reesii 1704]
          Length = 875

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 23/86 (26%), Positives = 43/86 (50%), Gaps = 3/86 (3%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFF 93
           E M++   K  +     P+ FK V+   +L     V     V++++P M + G SR+L  
Sbjct: 317 ERMAEAEAKGDKSTTAGPWDFKFVRSVRNLERFDDVGG--CVMLASPGMLQTGTSRELLE 374

Query: 94  QWCSSPENSIIITNRTSPGTLARDLI 119
           +W  S  N +I+T  +  GT+ + ++
Sbjct: 375 RWAPSERNGVIMTGYSVEGTMGKQIL 400


>gi|19074744|ref|NP_586250.1| similarity to HYPOTHETICAL PROTEIN YO47_METJA [Encephalitozoon
           cuniculi GB-M1]
 gi|19069386|emb|CAD25854.1| similarity to HYPOTHETICAL PROTEIN YO47_METJA [Encephalitozoon
           cuniculi GB-M1]
 gi|449329879|gb|AGE96147.1| hypothetical protein ECU10_1350 [Encephalitozoon cuniculi]
          Length = 496

 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 44/93 (47%), Gaps = 5/93 (5%)

Query: 51  NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRT 109
           N F +KH+K        +K P   +VL ++P M   G S  +F +WC   +N +II    
Sbjct: 293 NMFEYKHIKPFQRHYMESKGP---MVLFASPGMLHSGMSLKIFKEWCEDEKNLVIIPGYC 349

Query: 110 SPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
             GT+   ++  G  +   L   + IRLE + L
Sbjct: 350 VRGTIGEKVLN-GATKLEILGESREIRLEVKNL 381


>gi|383859336|ref|XP_003705151.1| PREDICTED: integrator complex subunit 11-like isoform 1 [Megachile
           rotundata]
          Length = 595

 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 54/107 (50%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F   + N F FKH+K      + A + +P  ++V +TP M   G S  +
Sbjct: 283 ITWTNQKIKKTF--VQRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  L G+R +  + ++ + ++
Sbjct: 337 FKKWAPNEANMVIMPGFCVQGTVGHKV--LNGSRRIEFENRQIVEVK 381


>gi|328776642|ref|XP_003249190.1| PREDICTED: integrator complex subunit 11-like [Apis mellifera]
          Length = 603

 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 54/107 (50%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F   + N F FKH+K      + A + +P  ++V +TP M   G S  +
Sbjct: 283 ITWTNQKIKKTF--VQRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  L G+R +  + ++ + ++
Sbjct: 337 FKKWAPNEANMVIMPGFCVQGTVGHKV--LNGSRRIEFENRQIVEVK 381


>gi|294656507|ref|XP_002770276.1| DEHA2D07304p [Debaryomyces hansenii CBS767]
 gi|199431523|emb|CAR65632.1| DEHA2D07304p [Debaryomyces hansenii CBS767]
          Length = 959

 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 42/157 (26%), Positives = 68/157 (43%), Gaps = 34/157 (21%)

Query: 34  IEWMSDKLMKSFEGARNN----------PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
           +EWMS +L+K +E A +           PF    V L    +EL ++  PK+V  S  D+
Sbjct: 286 LEWMSSQLVKEWEEASSVNNNSSNKNNFPFDPSKVDLLSDPSELVQLSGPKIVFCSGIDL 345

Query: 84  ECG-FSRDLFFQWCSSPENSIIITNRTSPG-------TLARDLIEL-----GGN--RTLT 128
           + G  S +     C   + +I++T +T  G        L  D   L     GG     + 
Sbjct: 346 KNGDMSSEALQYLCQDEKTTIVLTEKTHFGLDNTINSQLYHDWYNLTKQKQGGTVEDGVA 405

Query: 129 LQVKKRIRLE---------GEELEEYQKKKDKEAKDK 156
           + ++K I LE         G EL ++Q+K + + K K
Sbjct: 406 VPLEKVISLENWNREEPLIGAELTDFQEKINLQRKQK 442


>gi|383859338|ref|XP_003705152.1| PREDICTED: integrator complex subunit 11-like isoform 2 [Megachile
           rotundata]
          Length = 494

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 54/107 (50%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F   + N F FKH+K      + A + +P  ++V +TP M   G S  +
Sbjct: 182 ITWTNQKIKKTF--VQRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 235

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  L G+R +  + ++ + ++
Sbjct: 236 FKKWAPNEANMVIMPGFCVQGTVGHKV--LNGSRRIEFENRQIVEVK 280


>gi|385775961|ref|YP_005648529.1| beta-lactamase [Sulfolobus islandicus REY15A]
 gi|323474709|gb|ADX85315.1| beta-lactamase domain protein [Sulfolobus islandicus REY15A]
          Length = 638

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 20/84 (23%), Positives = 37/84 (44%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQ 94
           EW+  ++ +       NPF  +H K      E      P ++L ++  +  G + + F  
Sbjct: 461 EWLGREVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKT 520

Query: 95  WCSSPENSIIITNRTSPGTLARDL 118
               P+N+II  +  + GTL R +
Sbjct: 521 MAPDPKNAIIFVSYQAEGTLGRKV 544


>gi|380011463|ref|XP_003689822.1| PREDICTED: integrator complex subunit 11-like [Apis florea]
          Length = 595

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 54/107 (50%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F   + N F FKH+K      + A + +P  ++V +TP M   G S  +
Sbjct: 283 ITWTNQKIKKTF--VQRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  L G+R +  + ++ + ++
Sbjct: 337 FKKWAPNEANMVIMPGFCVQGTVGHKV--LNGSRRIEFENRQIVEVK 381


>gi|391330858|ref|XP_003739869.1| PREDICTED: integrator complex subunit 11-like [Metaseiulus
           occidentalis]
          Length = 601

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 26/97 (26%), Positives = 42/97 (43%), Gaps = 6/97 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+  +F    +N F FKH+K           P P VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRNTF--VDHNMFDFKHIKPFDR--AYIDNPGPMVVFATPGMLHAGLSLQIFK 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQ 130
           +W    EN +I+      GT+   +  L G R + ++
Sbjct: 339 KWAPFEENMVIMPGYCVSGTVGAKI--LNGARRVEIE 373


>gi|385773323|ref|YP_005645889.1| beta-lactamase [Sulfolobus islandicus HVE10/4]
 gi|323477437|gb|ADX82675.1| beta-lactamase domain protein [Sulfolobus islandicus HVE10/4]
          Length = 638

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 20/84 (23%), Positives = 37/84 (44%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQ 94
           EW+  ++ +       NPF  +H K      E      P ++L ++  +  G + + F  
Sbjct: 461 EWLGREVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKT 520

Query: 95  WCSSPENSIIITNRTSPGTLARDL 118
               P+N+II  +  + GTL R +
Sbjct: 521 MAPDPKNAIIFVSYQAEGTLGRKV 544


>gi|322790916|gb|EFZ15582.1| hypothetical protein SINV_14424 [Solenopsis invicta]
          Length = 277

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 54/107 (50%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F   + N F FKH+K      + A + +P  ++V +TP M   G S  +
Sbjct: 60  ITWTNQKIKKTF--VQRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 113

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  L G+R +  + ++ + ++
Sbjct: 114 FKKWAPNESNMVIMPGFCVQGTVGHKV--LNGSRRIEFENRQIVEVK 158


>gi|227827653|ref|YP_002829433.1| beta-lactamase [Sulfolobus islandicus M.14.25]
 gi|229584857|ref|YP_002843359.1| beta-lactamase domain-containing protein [Sulfolobus islandicus
           M.16.27]
 gi|238619821|ref|YP_002914647.1| beta-lactamase domain-containing protein [Sulfolobus islandicus
           M.16.4]
 gi|227459449|gb|ACP38135.1| beta-lactamase domain protein [Sulfolobus islandicus M.14.25]
 gi|228019907|gb|ACP55314.1| beta-lactamase domain protein [Sulfolobus islandicus M.16.27]
 gi|238380891|gb|ACR41979.1| beta-lactamase domain protein [Sulfolobus islandicus M.16.4]
          Length = 638

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 20/84 (23%), Positives = 37/84 (44%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQ 94
           EW+  ++ +       NPF  +H K      E      P ++L ++  +  G + + F  
Sbjct: 461 EWLGREVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKT 520

Query: 95  WCSSPENSIIITNRTSPGTLARDL 118
               P+N+II  +  + GTL R +
Sbjct: 521 MAPDPKNAIIFVSYQAEGTLGRKV 544


>gi|401404496|ref|XP_003881737.1| hypothetical protein NCLIV_014990 [Neospora caninum Liverpool]
 gi|325116150|emb|CBZ51704.1| hypothetical protein NCLIV_014990 [Neospora caninum Liverpool]
          Length = 1033

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 5/72 (6%)

Query: 51  NPFHFKHVKLCHSLAELAKV----PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           NPF F+ VK   S+ E A+V      P V++ +   ++ G SR++F       +N +I+T
Sbjct: 420 NPFAFRFVKNLKSV-ESARVYIHHDGPAVIMAAPGMLQSGASREIFEALAPESKNGVILT 478

Query: 107 NRTSPGTLARDL 118
             +  GTLA +L
Sbjct: 479 GYSVKGTLADEL 490


>gi|307215032|gb|EFN89859.1| Integrator complex subunit 11 [Harpegnathos saltator]
          Length = 594

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F   + N F FKH+K      + A + +P  ++V +TP M   G S  +
Sbjct: 282 ITWTNQKIKKTF--VQRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 335

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  L G R +  + ++ + ++
Sbjct: 336 FKKWAPNESNMVIMPGFCVQGTVGHKV--LNGTRRIEFENRQIVEVK 380


>gi|229579166|ref|YP_002837564.1| beta-lactamase domain-containing protein [Sulfolobus islandicus
           Y.G.57.14]
 gi|229582079|ref|YP_002840478.1| beta-lactamase domain-containing protein [Sulfolobus islandicus
           Y.N.15.51]
 gi|284997854|ref|YP_003419621.1| beta-lactamase domain-containing protein [Sulfolobus islandicus
           L.D.8.5]
 gi|228009880|gb|ACP45642.1| beta-lactamase domain protein [Sulfolobus islandicus Y.G.57.14]
 gi|228012795|gb|ACP48556.1| beta-lactamase domain protein [Sulfolobus islandicus Y.N.15.51]
 gi|284445749|gb|ADB87251.1| beta-lactamase domain protein [Sulfolobus islandicus L.D.8.5]
          Length = 638

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 20/84 (23%), Positives = 37/84 (44%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQ 94
           EW+  ++ +       NPF  +H K      E      P ++L ++  +  G + + F  
Sbjct: 461 EWLGREVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKT 520

Query: 95  WCSSPENSIIITNRTSPGTLARDL 118
               P+N+II  +  + GTL R +
Sbjct: 521 MAPDPKNAIIFVSYQAEGTLGRKV 544


>gi|332030612|gb|EGI70300.1| Integrator complex subunit 11 [Acromyrmex echinatior]
          Length = 536

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F   + N F FKH+K      + A + +P  ++V +TP M   G S  +
Sbjct: 224 ITWTNQKIKKTF--VQRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 277

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  L G R +  + ++ + ++
Sbjct: 278 FKKWAPNESNMVIMPGFCVQGTVGHKV--LNGTRRIEFENRQIVEVK 322


>gi|227830349|ref|YP_002832129.1| beta-lactamase domain-containing protein [Sulfolobus islandicus
           L.S.2.15]
 gi|227456797|gb|ACP35484.1| beta-lactamase domain protein [Sulfolobus islandicus L.S.2.15]
          Length = 638

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 20/84 (23%), Positives = 37/84 (44%)

Query: 35  EWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQ 94
           EW+  ++ +       NPF  +H K      E      P ++L ++  +  G + + F  
Sbjct: 461 EWLGREVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKT 520

Query: 95  WCSSPENSIIITNRTSPGTLARDL 118
               P+N+II  +  + GTL R +
Sbjct: 521 MAPDPKNAIIFVSYQAEGTLGRKV 544


>gi|342319748|gb|EGU11695.1| Endoribonuclease YSH1 [Rhodotorula glutinis ATCC 204091]
          Length = 857

 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 44/92 (47%), Gaps = 9/92 (9%)

Query: 34  IEWMSDKLMKSFEGARN-NPFHFKHV-----KLCHSLAELAKVPSPKVVLVSTPDMECGF 87
           I  MS  +   F  AR  NPF FK        L   +++L    +P VV+ S   +  G 
Sbjct: 316 IHTMSPNVRSKF--ARGINPFDFKRKDSFIRPLDRGISKLNDR-NPCVVMASPGFLTSGV 372

Query: 88  SRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
           SR+L  +W   P N +IIT  +  G +AR ++
Sbjct: 373 SRELLEKWAPDPRNGLIITGYSVEGVMARTIM 404


>gi|340728535|ref|XP_003402577.1| PREDICTED: integrator complex subunit 11-like [Bombus terrestris]
 gi|350421011|ref|XP_003492700.1| PREDICTED: integrator complex subunit 11-like [Bombus impatiens]
          Length = 595

 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 54/107 (50%), Gaps = 10/107 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F   + N F FKH+K      + A + +P  ++V +TP M   G S  +
Sbjct: 283 ITWTNQKIKKTF--VQRNMFDFKHIKPF----DKAYIDNPGAMVVFATPGMLHAGLSLQI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           F +W  +  N +I+      GT+   +  L G+R +  + ++ + ++
Sbjct: 337 FKKWAPNEANMVIMPGFCVQGTVGHKV--LNGSRRIEFENRQIVEVK 381


>gi|320593246|gb|EFX05655.1| cleavage and polyadenylation specificity factor subunit [Grosmannia
           clavigera kw1407]
          Length = 857

 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 19/72 (26%), Positives = 40/72 (55%), Gaps = 3/72 (4%)

Query: 50  NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNR 108
             P+ F+ ++   +L     +     V++++P M + G SR+L  +W  S +N +IIT  
Sbjct: 330 GGPWDFRFIRSLKNLDRFEDIGG--CVMLASPGMLQNGVSRELLERWAPSDKNGVIITGY 387

Query: 109 TSPGTLARDLIE 120
           +  GT+A+ +++
Sbjct: 388 SVEGTMAKQIMQ 399


>gi|358060736|dbj|GAA93507.1| hypothetical protein E5Q_00148 [Mixia osmundae IAM 14324]
          Length = 1378

 Score = 39.3 bits (90), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 27/73 (36%), Positives = 39/73 (53%), Gaps = 5/73 (6%)

Query: 50  NNPFHF---KHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIII 105
           +NP+ F   + VK   S+       SP VV V+TP M + G SR+L  +W     N +I+
Sbjct: 343 DNPYDFGKGRFVKELRSIRRGVADKSPCVV-VATPGMLQPGTSRELLERWAGDRRNGLIL 401

Query: 106 TNRTSPGTLARDL 118
              +  G+LARDL
Sbjct: 402 CGYSVEGSLARDL 414


>gi|119494361|ref|XP_001264076.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Neosartorya fischeri NRRL 181]
 gi|119412238|gb|EAW22179.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Neosartorya fischeri NRRL 181]
          Length = 878

 Score = 39.3 bits (90), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 45/90 (50%), Gaps = 9/90 (10%)

Query: 31  LAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSR 89
           +A+ E   DK       A   P+ FK V+   SL     +     V++++P M + G SR
Sbjct: 319 MAEAEASGDK------SASAGPWDFKFVRSLRSLERFDDLGG--CVMLASPGMLQTGTSR 370

Query: 90  DLFFQWCSSPENSIIITNRTSPGTLARDLI 119
           +L  +W  +  N +++T  +  GT+A+ L+
Sbjct: 371 ELLERWAPNERNGVVMTGYSVEGTMAKQLL 400


>gi|158298905|ref|XP_319042.4| AGAP009923-PA [Anopheles gambiae str. PEST]
 gi|157014111|gb|EAA13845.4| AGAP009923-PA [Anopheles gambiae str. PEST]
          Length = 608

 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 27/104 (25%), Positives = 51/104 (49%), Gaps = 10/104 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F   + N F FKH+K      +   + +P  ++V +TP M   G S  +
Sbjct: 283 ITWTNQKIRKTF--VQRNMFDFKHIKPF----DKGYIDNPGAMVVFATPGMLHAGLSLQI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRI 135
           F +W  +  N +I+      GT+   +  LGG + +  + ++ +
Sbjct: 337 FKKWAPNENNMVIMPGYCVQGTVGHKI--LGGAKKVEFENRQVV 378


>gi|34762198|ref|ZP_00143205.1| hypothetical protein [Fusobacterium nucleatum subsp. vincentii ATCC
           49256]
 gi|27888159|gb|EAA25218.1| hypothetical protein [Fusobacterium nucleatum subsp. vincentii ATCC
           49256]
          Length = 209

 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 56/205 (27%), Positives = 90/205 (43%), Gaps = 43/205 (20%)

Query: 5   LMKSFE----GARNNPFHFKHVKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKL 60
           +MK+ E    G++   F F   ++     + AKIE    KL+ SF   RNN F      +
Sbjct: 7   VMKAVEEALAGSQLRKFDFASYEISDD--DKAKIEEQEVKLLNSFRKYRNNLFD-----I 59

Query: 61  CHSLAELAKVPSPKVVLVS-------TPDMECGFSR--DLFFQWCSSPENSI-------- 103
           C SLAE+AK+  P    ++       T DM   F +  DL+  W    ++ I        
Sbjct: 60  CSSLAEVAKLLKPSGSFMAWYESAGLTKDMVSVFLKRWDLYL-WQPDYKDKIFSLSDQAI 118

Query: 104 -IITNRT-----SPGTLARDLIELGGNRTLTLQVKK----RIRLEGEELEEYQKKKDKEA 153
            II+N +       G L  D++++   + L  Q KK     +++E ++   + K K  E 
Sbjct: 119 KIISNNSLGLEVVKGILEADVLKVREIKKLLPQPKKEEKSEVKVERQKYFNFNKIKKMEK 178

Query: 154 KDK----QEKEKIPPHDTSFINELQ 174
           + K    +EK +     T +IN LQ
Sbjct: 179 RAKKLKDEEKTEYKKELTEYINSLQ 203


>gi|169767044|ref|XP_001817993.1| endoribonuclease ysh1 [Aspergillus oryzae RIB40]
 gi|83765848|dbj|BAE55991.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|391872741|gb|EIT81836.1| mRNA cleavage and polyadenylation factor II complex, BRR5
           [Aspergillus oryzae 3.042]
          Length = 870

 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 24/90 (26%), Positives = 46/90 (51%), Gaps = 9/90 (10%)

Query: 31  LAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSR 89
           +A+ E   DK + +       P+ F+ V+   SL     V     V++++P M + G SR
Sbjct: 319 MAEAEASGDKSISA------GPWDFRFVRSLRSLERFDDVGG--CVMLASPGMLQTGTSR 370

Query: 90  DLFFQWCSSPENSIIITNRTSPGTLARDLI 119
           +L  +W  +  N +++T  +  GT+A+ L+
Sbjct: 371 ELLERWAPNERNGVVMTGYSVEGTMAKQLL 400


>gi|238483863|ref|XP_002373170.1| cleavage and polyadenylation specifity factor, 73 kDa subunit
           [Aspergillus flavus NRRL3357]
 gi|220701220|gb|EED57558.1| cleavage and polyadenylation specifity factor, 73 kDa subunit
           [Aspergillus flavus NRRL3357]
          Length = 870

 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 24/90 (26%), Positives = 46/90 (51%), Gaps = 9/90 (10%)

Query: 31  LAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSR 89
           +A+ E   DK + +       P+ F+ V+   SL     V     V++++P M + G SR
Sbjct: 319 MAEAEASGDKSISA------GPWDFRFVRSLRSLERFDDVGG--CVMLASPGMLQTGTSR 370

Query: 90  DLFFQWCSSPENSIIITNRTSPGTLARDLI 119
           +L  +W  +  N +++T  +  GT+A+ L+
Sbjct: 371 ELLERWAPNERNGVVMTGYSVEGTMAKQLL 400


>gi|195341281|ref|XP_002037239.1| GM12816 [Drosophila sechellia]
 gi|195574829|ref|XP_002105386.1| GD21460 [Drosophila simulans]
 gi|194131355|gb|EDW53398.1| GM12816 [Drosophila sechellia]
 gi|194201313|gb|EDX14889.1| GD21460 [Drosophila simulans]
          Length = 597

 Score = 39.3 bits (90), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 25/105 (23%), Positives = 46/105 (43%), Gaps = 6/105 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F     N F FKH+K           P   VV  +   +  G S  +F 
Sbjct: 283 ITWTNQKIRKTF--VHRNMFDFKHIKPFDK--NYIDNPGAMVVFATPGMLHAGLSLQIFK 338

Query: 94  QWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
           +W  +  N +I+      GT+   +  LGG + +  + ++ + ++
Sbjct: 339 KWAPNENNMVIMPGYCVQGTVGNKI--LGGAKKVEFENRQVVEVK 381


>gi|403158620|ref|XP_003319317.2| hypothetical protein PGTG_01491 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375166386|gb|EFP74898.2| hypothetical protein PGTG_01491 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 778

 Score = 39.3 bits (90), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 16/49 (32%), Positives = 30/49 (61%)

Query: 72  SPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE 120
            P VV+ S   M+ G SR+L  +W     N++++T  +  GT+AR++++
Sbjct: 357 GPCVVMASPGFMQSGVSRELLERWAPDRRNALLVTGYSIEGTMAREMLK 405


>gi|300121266|emb|CBK21646.2| unnamed protein product [Blastocystis hominis]
          Length = 400

 Score = 38.9 bits (89), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 19/98 (19%), Positives = 45/98 (45%)

Query: 37  MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWC 96
           +++++++    A    F  + V     +  + +  + KVV+ + P +E  F++ L  +WC
Sbjct: 291 LNERILQQSAEAERKAFDLQFVTCVSIVENVLESQASKVVIATLPGLETSFAQTLLKKWC 350

Query: 97  SSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKR 134
           +  EN ++      P TL   ++      T    V+++
Sbjct: 351 TRSENLLLFVCSPPPDTLGYRILNSPEESTFEFIVREK 388


>gi|225679068|gb|EEH17352.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 984

 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 18/59 (30%), Positives = 33/59 (55%), Gaps = 3/59 (5%)

Query: 51  NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
            PF F+H+K+     +L K+    +P+V+L     +E G+S+ +  +  +  EN II+T
Sbjct: 361 GPFTFRHLKIVERKTKLDKILGSNAPQVILTPDTSLEWGYSKHVLQKIAAGSENLIILT 419


>gi|308198072|ref|XP_001387057.2| predicted protein [Scheffersomyces stipitis CBS 6054]
 gi|149389019|gb|EAZ63034.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 934

 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 42/87 (48%), Gaps = 7/87 (8%)

Query: 34  IEWMSDKLMKSFEGA------RNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECG- 86
           I+WMS  +   +E A      +N PF    V L  S  EL ++  PK+V  S  D+  G 
Sbjct: 286 IDWMSSSVQSQWEEAESSTNYKNLPFDPSKVDLLLSPEELIQLSGPKIVFCSGIDLRNGE 345

Query: 87  FSRDLFFQWCSSPENSIIITNRTSPGT 113
            S + F   C   +++I++T ++  G 
Sbjct: 346 LSAEAFQYLCQDEKSTILLTEKSLFGV 372


>gi|448124505|ref|XP_004204939.1| Piso0_000226 [Millerozyma farinosa CBS 7064]
 gi|358249572|emb|CCE72638.1| Piso0_000226 [Millerozyma farinosa CBS 7064]
          Length = 948

 Score = 38.5 bits (88), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 40/157 (25%), Positives = 69/157 (43%), Gaps = 34/157 (21%)

Query: 34  IEWMSDKLMKSFEGARNN----------PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
           +EWMS +L+K +E A ++          PF    V L     EL ++  PK+V  S  D 
Sbjct: 286 LEWMSGQLIKEWEEASSSTNNSSNKNNFPFDPSKVDLLSDPNELIQLSGPKIVFCSGLDF 345

Query: 84  ECG-FSRDLFFQWCSSPENSIIITNRTSPGT-------LARDLIELGGNRT-------LT 128
           + G  S ++    C   + +II+T +T  G+       L R+  EL   R          
Sbjct: 346 KDGDVSFEVLSYLCQDEKTTIILTEKTHFGSDDTINSQLYREWYELTKQRNGGLVEDGTA 405

Query: 129 LQVKKRIRLE---------GEELEEYQKKKDKEAKDK 156
           + ++K I L+         G EL ++Q++  ++ K +
Sbjct: 406 VPLEKIINLQHWTKEEPLIGTELSDFQERISQQRKQR 442


>gi|226288011|gb|EEH43524.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
          Length = 999

 Score = 38.5 bits (88), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 23/79 (29%), Positives = 41/79 (51%), Gaps = 4/79 (5%)

Query: 51  NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
            PF F+H+K+     +L K+    +P+V+L     +E G+S+ +  +  +  EN II+T 
Sbjct: 401 GPFTFRHLKIVERKTKLDKILGSNAPQVILTPDTSLEWGYSKHVLQKIAAGSENLIILTE 460

Query: 108 RTSPGTLARDLIELGGNRT 126
              P +      E GG+R+
Sbjct: 461 -PYPISANAQRTEDGGSRS 478


>gi|410730217|ref|XP_003671288.2| hypothetical protein NDAI_0G02680 [Naumovozyma dairenensis CBS 421]
 gi|401780106|emb|CCD26045.2| hypothetical protein NDAI_0G02680 [Naumovozyma dairenensis CBS 421]
          Length = 846

 Score = 38.5 bits (88), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 82/179 (45%), Gaps = 45/179 (25%)

Query: 9   FEGARNNPFHFKHVKLCHSLAELAK----------IEWMSDKLMKSFEGARNNPFHF--- 55
           FE  +N PF +KH+ +   L   +K          +EW+S  L+K++E   +NP  F   
Sbjct: 269 FENGKN-PF-YKHLPVL--LVSYSKGRALTYTKSMLEWLSSSLLKTWESRSSNPSPFDLG 324

Query: 56  KHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSP-----ENSIIITNRTS 110
              K+  S  EL+K P+ K+ LVS  D+      +     C S      + +I++T+  +
Sbjct: 325 NRFKVVTS-DELSKYPNSKICLVSNVDI---LLDETVAHLCDSKSQHQNKTTILLTSNMN 380

Query: 111 PGTLAR-------------DLIELGGNRTLTLQVKKRIRLEGEELEEY----QKKKDKE 152
            G L               DLI+   N+T+++   +   L  EEL EY    +++K+KE
Sbjct: 381 NGILQNMKECWEEQKVKEGDLIKF--NKTISVHNIQLDPLNDEELSEYKSVLEERKNKE 437


>gi|71027889|ref|XP_763588.1| cleavage and polyadenylation specificity factor protein [Theileria
           parva strain Muguga]
 gi|68350541|gb|EAN31305.1| cleavage and polyadenylation specificity factor protein, putative
           [Theileria parva]
          Length = 708

 Score = 38.5 bits (88), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 20/71 (28%), Positives = 36/71 (50%), Gaps = 3/71 (4%)

Query: 51  NPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN 107
           NPF+FK V+   S+ ++        P +++ S   ++ G S ++F   C    N +++T 
Sbjct: 312 NPFNFKFVRYARSIKQIRNYLLRDGPCIIMTSPGMLQGGPSLEVFELLCPDNRNGVVLTG 371

Query: 108 RTSPGTLARDL 118
               GTLA +L
Sbjct: 372 YAVKGTLADEL 382


>gi|344301243|gb|EGW31555.1| hypothetical protein SPAPADRAFT_67601 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 1032

 Score = 38.5 bits (88), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 38/158 (24%), Positives = 73/158 (46%), Gaps = 35/158 (22%)

Query: 34  IEWMSDKLMKSFE------------GARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTP 81
           ++WMS   +KS++               + PF    V L    +EL ++  PK+V  S  
Sbjct: 286 LDWMSGSFIKSWDETSGDGGRGGGKALSSMPFDPSKVDLLLDPSELIQLSGPKIVFCSGI 345

Query: 82  DMECG-FSRDLFFQWCSSPENSIIITNRTS--PGTLARDLI---------ELGG------ 123
           D++ G  S + F   C++ + ++I+T ++    G L   L          +LGG      
Sbjct: 346 DIKSGDISSETFQYLCNNEKTTVILTEKSQLENGGLNSMLYKEWYELTKKKLGGKIEDGT 405

Query: 124 ----NRTLTLQVKKR-IRLEGEELEEYQKKKDKEAKDK 156
               ++T++++   R   LEG EL ++Q++  ++ K+K
Sbjct: 406 AVPLDKTVSIEDWTRETNLEGRELSDFQERITQQRKEK 443


>gi|46360445|gb|AAS80153.1| ACT11D09.9 [Cucumis melo]
          Length = 708

 Score = 38.1 bits (87), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 43/88 (48%), Gaps = 8/88 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHV-KLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDL 91
           I W S K+ +++     N F FK+V K   S+ +    P P  VL +TP M   GFS ++
Sbjct: 315 ISWTSQKVKETY--TTRNAFDFKNVQKFDRSMID---APGP-CVLFATPGMISSGFSLEV 368

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F +W  S  N I +      GT+   L+
Sbjct: 369 FKRWAPSKLNLITLPGYCVAGTVGHKLM 396


>gi|449460766|ref|XP_004148116.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-II-like [Cucumis sativus]
          Length = 649

 Score = 38.1 bits (87), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 43/88 (48%), Gaps = 8/88 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHV-KLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDL 91
           I W S K+ +++     N F FK+V K   S+ +    P P  VL +TP M   GFS ++
Sbjct: 282 ISWTSQKVKETY--TTRNAFDFKNVQKFDRSMID---APGP-CVLFATPGMISSGFSLEV 335

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F +W  S  N I +      GT+   L+
Sbjct: 336 FKRWAPSKLNLITLPGYCVAGTVGHKLM 363


>gi|156546030|ref|XP_001608037.1| PREDICTED: integrator complex subunit 11-like isoform 1 [Nasonia
           vitripennis]
 gi|345498393|ref|XP_003428220.1| PREDICTED: integrator complex subunit 11-like isoform 2 [Nasonia
           vitripennis]
 gi|345498395|ref|XP_003428221.1| PREDICTED: integrator complex subunit 11-like isoform 3 [Nasonia
           vitripennis]
          Length = 595

 Score = 38.1 bits (87), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 30/113 (26%), Positives = 51/113 (45%), Gaps = 18/113 (15%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F   + N F FKH+K      + + + +P  ++V +TP M   G S  +
Sbjct: 283 ITWTNQKIKKTF--VQRNMFDFKHIKPF----DKSYIDNPGAMVVFATPGMLHAGLSLQI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEE 144
           F +W  +  N +I+      GT+   +          L   KRI  E  ++ E
Sbjct: 337 FKKWAPNEANMVIMPGFCVQGTVGHKV----------LNGAKRIEFENRQIVE 379


>gi|300706889|ref|XP_002995677.1| hypothetical protein NCER_101357 [Nosema ceranae BRL01]
 gi|239604869|gb|EEQ82006.1| hypothetical protein NCER_101357 [Nosema ceranae BRL01]
          Length = 500

 Score = 38.1 bits (87), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 46/95 (48%), Gaps = 7/95 (7%)

Query: 50  NNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNR 108
           +N F FKH            +P   +VL ++P M   G S  +F +WC+ P+N +II   
Sbjct: 294 HNVFDFKHTTTFQKHFLDLNIP---MVLFASPGMLHSGMSLKVFKEWCTDPKNLVIIPGY 350

Query: 109 TSPGTLARDLIELGGNRTL-TLQVKKRIRLEGEEL 142
              GT+   +  L GN+ +  L   K I+++ + L
Sbjct: 351 CVKGTVGDKV--LNGNKEIEILGELKEIKIQVKNL 383


>gi|354543512|emb|CCE40231.1| hypothetical protein CPAR2_102690 [Candida parapsilosis]
          Length = 938

 Score = 38.1 bits (87), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 24/96 (25%), Positives = 47/96 (48%), Gaps = 5/96 (5%)

Query: 34  IEWMSDKLMKSF--EGARNN--PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECG-FS 88
           ++WMS    K++  +G  N+  PF+   V L    +EL + P  K++  +  D++ G  S
Sbjct: 286 MDWMSSDFNKTWNTDGGNNSLLPFNPSKVDLLLDPSELTQTPGAKIIFCAGLDLKNGDLS 345

Query: 89  RDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGN 124
             +F   C+    ++I+T + S   +  +   L G+
Sbjct: 346 SKVFSYLCNDERTTVILTEKPSSANVEAEGSGLSGD 381


>gi|407847992|gb|EKG03521.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi]
          Length = 883

 Score = 38.1 bits (87), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 26/108 (24%), Positives = 49/108 (45%), Gaps = 4/108 (3%)

Query: 55  FKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSP----ENSIIITNRTS 110
           F +V  C +  E+  V  PK+ +     ++ G S +L   +  +     EN +++T    
Sbjct: 395 FANVVTCKTAEEVLSVAGPKICIADGETLDYGVSAELLGHFLQADADERENLVVLTGAPK 454

Query: 111 PGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQE 158
           P T A  +        + L+   R  L  EELEEY  + + E +++++
Sbjct: 455 PHTNAFTMAAAKKGDAIDLRYTIRSPLGKEELEEYYLQIELEMEEQRK 502


>gi|449518964|ref|XP_004166505.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 3-II-like, partial [Cucumis
           sativus]
          Length = 472

 Score = 38.1 bits (87), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 43/88 (48%), Gaps = 8/88 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHV-KLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDL 91
           I W S K+ +++     N F FK+V K   S+ +    P P  VL +TP M   GFS ++
Sbjct: 126 ISWTSQKVKETY--TTRNAFDFKNVQKFDRSMID---APGP-CVLFATPGMISSGFSLEV 179

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F +W  S  N I +      GT+   L+
Sbjct: 180 FKRWAPSKLNLITLPGYCVAGTVGHKLM 207


>gi|350633583|gb|EHA21948.1| hypothetical protein ASPNIDRAFT_41125 [Aspergillus niger ATCC 1015]
          Length = 1015

 Score = 38.1 bits (87), Expect = 4.3,   Method: Composition-based stats.
 Identities = 24/103 (23%), Positives = 41/103 (39%), Gaps = 30/103 (29%)

Query: 34  IEWMSDKLMKSFEGARN---------------------------NPFHFKHVKLCHSLAE 66
           +EWM + +++ FE A                              PF FKH+++      
Sbjct: 349 LEWMDENIVREFEAAEGVDAATGQSNTEGQRAGQNQGKTEGKGVGPFTFKHLRILERKKR 408

Query: 67  LAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           L K+ S   PKV+L S   ++ GF++D          N +++T
Sbjct: 409 LEKILSDQKPKVILASDTSLDWGFAKDSLRLVAEGANNLLLLT 451


>gi|317036117|ref|XP_001397647.2| cleavage and polyadenylylation specificity factor [Aspergillus
           niger CBS 513.88]
          Length = 1015

 Score = 38.1 bits (87), Expect = 4.3,   Method: Composition-based stats.
 Identities = 24/103 (23%), Positives = 41/103 (39%), Gaps = 30/103 (29%)

Query: 34  IEWMSDKLMKSFEGARN---------------------------NPFHFKHVKLCHSLAE 66
           +EWM + +++ FE A                              PF FKH+++      
Sbjct: 349 LEWMDENIVREFEAAEGVDAATGQSNTEGQRAGQNQGKTEGKGVGPFTFKHLRILERKKR 408

Query: 67  LAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           L K+ S   PKV+L S   ++ GF++D          N +++T
Sbjct: 409 LEKILSDQKPKVILASDTSLDWGFAKDSLRLVAEGANNLLLLT 451


>gi|358368318|dbj|GAA84935.1| cleavage and polyadenylylation specificity factor [Aspergillus
           kawachii IFO 4308]
          Length = 1015

 Score = 38.1 bits (87), Expect = 4.5,   Method: Composition-based stats.
 Identities = 24/103 (23%), Positives = 41/103 (39%), Gaps = 30/103 (29%)

Query: 34  IEWMSDKLMKSFEGARN---------------------------NPFHFKHVKLCHSLAE 66
           +EWM + +++ FE A                              PF FKH+++      
Sbjct: 349 LEWMDENIVREFEAAEGVDAATGQSNTEGQRAGQNQGKAEGKGVGPFTFKHLRILERKKR 408

Query: 67  LAKVPS---PKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           L K+ S   PKV+L S   ++ GF++D          N +++T
Sbjct: 409 LEKILSDQKPKVILASDTSLDWGFAKDSLRLVAEGANNLLLLT 451


>gi|168034228|ref|XP_001769615.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162679157|gb|EDQ65608.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 563

 Score = 38.1 bits (87), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 8/88 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSP-KVVLVSTPDM-ECGFSRDL 91
           I W + K+  ++     N F FKHV       E +K+ +P   VL +TP M   G S ++
Sbjct: 276 ISWTNQKVKDTY--VTRNTFDFKHV----IPFERSKIDAPGPCVLFATPGMLSGGLSLEV 329

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F  W  S  N II+      GT+   L+
Sbjct: 330 FKHWAPSESNMIILPGFCVAGTVGSKLM 357


>gi|357624104|gb|EHJ75000.1| hypothetical protein KGM_18742 [Danaus plexippus]
          Length = 595

 Score = 37.7 bits (86), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 45/88 (51%), Gaps = 8/88 (9%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLV-STPDM-ECGFSRDL 91
           I W + K+ K+F   + N F FKH+K      + + + +P  ++V +TP M   G S ++
Sbjct: 283 ITWTNQKIRKTF--VQRNMFDFKHIKPF----DKSYIDNPGAMVVFATPGMLHAGLSLNI 336

Query: 92  FFQWCSSPENSIIITNRTSPGTLARDLI 119
           F +W    +N +I+      GT+   ++
Sbjct: 337 FKKWAPYEQNMLIMPGFCVQGTVGHKIL 364


>gi|71656590|ref|XP_816840.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
           strain CL Brener]
 gi|50363263|gb|AAT75334.1| cleavage polyadenylation specificity factor CPSF100 [Trypanosoma
           cruzi]
 gi|70881994|gb|EAN94989.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi]
          Length = 802

 Score = 37.7 bits (86), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 26/108 (24%), Positives = 49/108 (45%), Gaps = 4/108 (3%)

Query: 55  FKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSP----ENSIIITNRTS 110
           F +V  C +  E+  V  PK+ +     ++ G S +L   +  +     EN +++T    
Sbjct: 314 FANVVTCKTAEEVLSVAGPKICIADGETLDYGVSAELLGHFLQADADERENLVVLTGAPK 373

Query: 111 PGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQE 158
           P T A  +        + L+   R  L  EELEEY  + + E +++++
Sbjct: 374 PHTNAFTMAAAKKGDAIDLRYTIRSPLGKEELEEYYLQIELEMEEQRK 421


>gi|448122146|ref|XP_004204382.1| Piso0_000226 [Millerozyma farinosa CBS 7064]
 gi|358349921|emb|CCE73200.1| Piso0_000226 [Millerozyma farinosa CBS 7064]
          Length = 948

 Score = 37.7 bits (86), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 27/91 (29%), Positives = 44/91 (48%), Gaps = 11/91 (12%)

Query: 34  IEWMSDKLMKSFEGARNN----------PFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
           +EWMS +L+K +E A ++          PF    V L     EL ++  PK+V  S  D 
Sbjct: 286 LEWMSGQLIKEWEEASSSTNNSSNKNNFPFDPSKVDLLSDPNELIQLSGPKIVFCSGLDF 345

Query: 84  ECG-FSRDLFFQWCSSPENSIIITNRTSPGT 113
           + G  S ++    C   + +II+T +T  G+
Sbjct: 346 KDGDVSFEVLSYLCQDEKTTIILTEKTHFGS 376


>gi|14198352|gb|AAH08240.1| Cpsf3l protein [Mus musculus]
          Length = 302

 Score = 37.7 bits (86), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 22/86 (25%), Positives = 40/86 (46%), Gaps = 4/86 (4%)

Query: 53  FHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPG 112
           F FKH+K        A  P P VV  +   +  G S  +F +W  + +N +I+      G
Sbjct: 2   FEFKHIKAFDR--TFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQG 59

Query: 113 TLARDLIELGGNRTLTLQVKKRIRLE 138
           T+   +  L G R L ++ ++ + ++
Sbjct: 60  TVGHKI--LSGQRKLEMEGRQMLEVK 83


>gi|156848581|ref|XP_001647172.1| hypothetical protein Kpol_1036p59 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156117856|gb|EDO19314.1| hypothetical protein Kpol_1036p59 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 821

 Score = 37.4 bits (85), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 40/172 (23%), Positives = 75/172 (43%), Gaps = 28/172 (16%)

Query: 9   FEGARNNPFHFKHVKLCHSL------AELAKIEWMSDKLMKSFEGARN-NPFHFKHVKLC 61
           +E +++  ++  HV L             + +EW+S  LMK++E   N +PF        
Sbjct: 266 YENSKSGSYNQTHVLLIAYFRGKVLTYARSMLEWLSSSLMKTWESRDNASPFDIGSKFKV 325

Query: 62  HSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTL------- 114
              +E++  P  KV  VS  D+      ++  + CS  + ++++T+  +  T        
Sbjct: 326 IDPSEISNFPGSKVCFVSQVDI---LLNEVLTKLCSMNKTTVLMTSTNTNNTQILETMYE 382

Query: 115 ----ARDLIELGGNRTLT------LQVKKRIRLEGEELEEYQKKKDKEAKDK 156
               A+ L +L    T++      L++     L  E+LEEY  +  KE +DK
Sbjct: 383 KWEKAKTLQKLQDGSTISFTDTVLLKIASYKPLVNEQLEEYNARL-KERRDK 433


>gi|407408786|gb|EKF32094.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi marinkellei]
          Length = 632

 Score = 37.4 bits (85), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 26/108 (24%), Positives = 49/108 (45%), Gaps = 4/108 (3%)

Query: 55  FKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSP----ENSIIITNRTS 110
           F +V  C +  E+  V  PK+ +     ++ G S +L   +  +     EN +++T    
Sbjct: 144 FSNVVTCKTAEEVLSVAGPKICIADGETLDYGVSAELLGHFLQADADERENLVVLTGAPK 203

Query: 111 PGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQE 158
           P T A  +        + L+   R  L  EELEEY  + + E +++++
Sbjct: 204 PHTNAFTMAAAKKGDAIDLRYTIRSPLGKEELEEYYLQLELEMEEQRK 251


>gi|291000374|ref|XP_002682754.1| predicted protein [Naegleria gruberi]
 gi|284096382|gb|EFC50010.1| predicted protein [Naegleria gruberi]
          Length = 458

 Score = 37.4 bits (85), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 22/87 (25%), Positives = 41/87 (47%), Gaps = 4/87 (4%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
           I W ++K+  +    + N F+F +++       L   P P +VL +TP M   G S ++F
Sbjct: 283 INWTNEKIKTTLFDQKRNLFNFSNIQSFERF--LMDNPGP-MVLFATPGMLHAGMSLEVF 339

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLI 119
            +W     N +I+      GT+   ++
Sbjct: 340 KKWAPGENNKVILPGYCVEGTVGNKVL 366


>gi|440299064|gb|ELP91676.1| cleavage and polyadenylation specificity factor, putative
           [Entamoeba invadens IP1]
          Length = 566

 Score = 37.4 bits (85), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 28/122 (22%), Positives = 51/122 (41%), Gaps = 19/122 (15%)

Query: 86  GFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEELEEY 145
           GFS + F     +P  ++ IT+  +                LTL      RL+   L   
Sbjct: 432 GFSNNRFEICPITPTKTVFITDELT--------------NVLTLS-----RLKEFNLGPI 472

Query: 146 QKKKDKEAKDKQEKEKIPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAV 205
             K    A    +KE+   H + ++ EL +    + ++  GID +  +G ++C +GTV +
Sbjct: 473 DAKVSDNALVPVKKEERSRHTSIYVGELPIKALTKQIEERGIDVKITEGTVVCAKGTVTI 532

Query: 206 RR 207
            +
Sbjct: 533 SK 534


>gi|452981499|gb|EME81259.1| hypothetical protein MYCFIDRAFT_140021 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 938

 Score = 37.4 bits (85), Expect = 7.5,   Method: Composition-based stats.
 Identities = 26/99 (26%), Positives = 41/99 (41%), Gaps = 13/99 (13%)

Query: 34  IEWMSDKLMKSFEGARN----------NPFHFKHVKLCHSLAELAKV---PSPKVVLVST 80
           +EWM D +++  E A N          N   +K VK      +  +     SP ++L S 
Sbjct: 327 LEWMDDTIIQDAERAMNKGQRDDDKAPNLLDWKFVKQIERQTQFDRALRRRSPCIMLASD 386

Query: 81  PDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
             ME G+SR    +  + P N +++T   S    A   I
Sbjct: 387 ASMEWGYSRQALEKLSADPRNLVVLTESISQSQAAHSSI 425


>gi|403222958|dbj|BAM41089.1| cleavage and polyadenylation specificty factor subunit [Theileria
           orientalis strain Shintoku]
          Length = 700

 Score = 37.4 bits (85), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 37/72 (51%), Gaps = 3/72 (4%)

Query: 50  NNPFHFKHVKLCHSLAELAKV---PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 106
           +NPF+FK VK   S+ ++        P +++ S   ++ G S ++F        N +++T
Sbjct: 329 HNPFNFKFVKYARSVRQIRNYLLREGPCIIMTSPGMLQGGPSLEVFELISPDNRNGVVLT 388

Query: 107 NRTSPGTLARDL 118
             T  GTLA +L
Sbjct: 389 GYTVKGTLADEL 400


>gi|242013971|ref|XP_002427672.1| Endoribonuclease YSH1, putative [Pediculus humanus corporis]
 gi|212512102|gb|EEB14934.1| Endoribonuclease YSH1, putative [Pediculus humanus corporis]
          Length = 572

 Score = 37.4 bits (85), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 24/87 (27%), Positives = 40/87 (45%), Gaps = 6/87 (6%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVK-LCHSLAELAKVPSPKVVLVSTPDMECGFSRDLF 92
           I W + K+ K+F   + N F FKH+K    S  + A    P VV  +   +  G S  +F
Sbjct: 260 ITWTNQKIRKTF--VQRNMFDFKHIKPFDRSYIDQA---WPMVVFATPGMLHAGLSLQIF 314

Query: 93  FQWCSSPENSIIITNRTSPGTLARDLI 119
            +W  +  N +I+      GT+   ++
Sbjct: 315 KKWAPNENNMVIMPGFCVAGTVGHKIL 341


>gi|119576637|gb|EAW56233.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_b
           [Homo sapiens]
          Length = 329

 Score = 37.4 bits (85), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 21/70 (30%), Positives = 33/70 (47%), Gaps = 4/70 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 261 IPWTNQKIRKTF--VQRNMFEFKHIKAFDR--AFADNPGPMVVFATPGMLHAGQSLQIFR 316

Query: 94  QWCSSPENSI 103
           +W  + +N +
Sbjct: 317 KWAGNEKNMV 326


>gi|253742053|gb|EES98907.1| Cleavage and polyadenylation specificity factor, 73 kDa subunit
           [Giardia intestinalis ATCC 50581]
          Length = 757

 Score = 37.4 bits (85), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 36/129 (27%), Positives = 57/129 (44%), Gaps = 11/129 (8%)

Query: 69  KVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTL 127
           K P    V+  TP M + G S++++ + C SP+N +++T   +  TL   L+E G     
Sbjct: 353 KNPKKPYVMFCTPGMLQSGVSKEMYNELCGSPDNLLLVTGYATQDTLLYKLLE-GKPPGA 411

Query: 128 TLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFIN--ELQLSDFKQTLQRN 185
               K RI    EEL         +  D   K +  P +  FI+  E  +S  K+T+  N
Sbjct: 412 YADAKMRI----EELSFSAHSDYNQTLDVLRKTR--PRNVVFIHGSEKSISSLKRTIA-N 464

Query: 186 GIDCEFMDG 194
                F +G
Sbjct: 465 ETRVSFSEG 473


>gi|47229058|emb|CAG03810.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 698

 Score = 37.0 bits (84), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 34/75 (45%), Gaps = 4/75 (5%)

Query: 34  IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
           I W + K+ K+F   + N F FKH+K        A  P P VV  +   +  G S  +F 
Sbjct: 351 ITWTNQKIRKTF--VQRNMFEFKHIKAFDR--SYADNPGPMVVFATPGMLHAGQSLQIFK 406

Query: 94  QWCSSPENSIIITNR 108
           +W  + +N +    R
Sbjct: 407 KWAGNEKNMVQFLRR 421


>gi|300176709|emb|CBK24374.2| unnamed protein product [Blastocystis hominis]
          Length = 711

 Score = 37.0 bits (84), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 44/91 (48%), Gaps = 10/91 (10%)

Query: 51  NPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTS 110
           +PF   H +    L +  K+P P V+L S   ME G SRD+  +  S   N +I+T    
Sbjct: 287 SPFFSPHRQF---LRDFNKLP-PCVILCSPAMMENGTSRDVLERMASGENNLVILTGYCM 342

Query: 111 PGTLARDL------IELGGNRTLTLQVKKRI 135
             T+AR L      I L G +  T+ VK ++
Sbjct: 343 ARTIARRLQDNDRIIPLSGKQMETIDVKCKV 373


>gi|159111399|ref|XP_001705931.1| Cleavage and polyadenylation specificity factor, 73 kDa subunit
           [Giardia lamblia ATCC 50803]
 gi|157434022|gb|EDO78257.1| Cleavage and polyadenylation specificity factor, 73 kDa subunit
           [Giardia lamblia ATCC 50803]
          Length = 757

 Score = 37.0 bits (84), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 36/129 (27%), Positives = 58/129 (44%), Gaps = 11/129 (8%)

Query: 69  KVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTL 127
           K P    V+  TP M + G S++++ + C SP+N +++T   +  TL   L+E G     
Sbjct: 353 KNPKKPYVMFCTPGMLQSGVSKEMYNELCGSPDNLLLVTGYATQDTLLYKLLE-GKPPGG 411

Query: 128 TLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFIN--ELQLSDFKQTLQRN 185
               K RI    EEL         +  D   K +  P + +FI+  E  +S  K+T+  N
Sbjct: 412 YADAKMRI----EELSFSAHSDYNQTLDVLRKTR--PRNVAFIHGSEKSISSLKRTIA-N 464

Query: 186 GIDCEFMDG 194
                F +G
Sbjct: 465 ETRVNFSEG 473


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.135    0.408 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,327,089,666
Number of Sequences: 23463169
Number of extensions: 174882336
Number of successful extensions: 763982
Number of sequences better than 100.0: 947
Number of HSP's better than 100.0 without gapping: 534
Number of HSP's successfully gapped in prelim test: 413
Number of HSP's that attempted gapping in prelim test: 761258
Number of HSP's gapped (non-prelim): 2616
length of query: 276
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 136
effective length of database: 9,074,351,707
effective search space: 1234111832152
effective search space used: 1234111832152
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)