BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy13880
         (256 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|383863556|ref|XP_003707246.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Megachile rotundata]
          Length = 1415

 Score =  230 bits (587), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 105/193 (54%), Positives = 147/193 (76%), Gaps = 4/193 (2%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY  E++I+++++GF+++D + N+ LFMYQPE+RES GG +LI+K DFHLGQ VNTF
Sbjct: 1223 PAEVYTIEYLIDNNNLGFLVADGESNIALFMYQPESRESLGGQKLIRKADFHLGQKVNTF 1282

Query: 128  FKIRCKPS-PISDA---PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
            F+IRC+ S P +D     GA  R +T YASLDG+LG+ LP+PEK YRRLLMLQNV+VTH 
Sbjct: 1283 FRIRCRISDPANDKKHFSGADKRHVTMYASLDGSLGYILPVPEKTYRRLLMLQNVLVTHI 1342

Query: 184  SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
             H  GLNP+A+RTYK      GNP+RGIIDG LVW++L L   E++++ KKIG++  +I+
Sbjct: 1343 CHIAGLNPKAYRTYKSYIRTQGNPARGIIDGDLVWRYLYLPNNEKIDVAKKIGTRVQEII 1402

Query: 244  DELYDIEALSSHF 256
            ++L +I+  ++HF
Sbjct: 1403 EDLTEIDRQTAHF 1415



 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 33/49 (67%), Positives = 39/49 (79%)

Query: 1   ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKL 49
           E  V+E+L V+LG HGNRP+LLVR   EL IYQ +R+PKG LKLRFKKL
Sbjct: 825 EMQVREILMVALGHHGNRPMLLVRLDSELQIYQTYRYPKGHLKLRFKKL 873


>gi|270003792|gb|EFA00240.1| hypothetical protein TcasGA2_TC003068 [Tribolium castaneum]
          Length = 1392

 Score =  230 bits (586), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 103/193 (53%), Positives = 148/193 (76%), Gaps = 4/193 (2%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  V++ E++I++++MGF++SD +KN+VL+MYQPE+RES GG RL++K DFHLGQ VN+F
Sbjct: 1200 PCEVFSVEYMIDNTTMGFLVSDSEKNLVLYMYQPESRESLGGQRLLRKADFHLGQAVNSF 1259

Query: 128  FKIRCKPSPISDA----PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
            F+I+CK   + +      GA  R +T YA+LDG LG+ +P+PEK YRRLLMLQNV+V+  
Sbjct: 1260 FRIKCKLGELGEDKKNLTGADKRHITMYATLDGGLGYIMPVPEKTYRRLLMLQNVLVSQG 1319

Query: 184  SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
            +H  GLNP+AFRTYK       NP+R +IDG LV+ +LQLS+ E+LE+ KKIG+K  ++L
Sbjct: 1320 AHIAGLNPKAFRTYKSWKKLQTNPARSVIDGELVYNYLQLSIPEKLEVSKKIGTKLEELL 1379

Query: 244  DELYDIEALSSHF 256
            D+L DI+ +++HF
Sbjct: 1380 DDLSDIQKITNHF 1392



 Score = 64.3 bits (155), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 27/47 (57%), Positives = 39/47 (82%)

Query: 4   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLK 50
           V+E+L V+LG HG+RPLL+VR + +L IY+ FR P+G LK+RF+K+K
Sbjct: 803 VKEILVVALGNHGSRPLLMVRLERDLYIYEVFRFPRGNLKMRFRKIK 849


>gi|91078626|ref|XP_968117.1| PREDICTED: similar to cleavage and polyadenylation specificity factor
            cpsf [Tribolium castaneum]
          Length = 1413

 Score =  230 bits (586), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 103/193 (53%), Positives = 148/193 (76%), Gaps = 4/193 (2%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  V++ E++I++++MGF++SD +KN+VL+MYQPE+RES GG RL++K DFHLGQ VN+F
Sbjct: 1221 PCEVFSVEYMIDNTTMGFLVSDSEKNLVLYMYQPESRESLGGQRLLRKADFHLGQAVNSF 1280

Query: 128  FKIRCKPSPISDA----PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
            F+I+CK   + +      GA  R +T YA+LDG LG+ +P+PEK YRRLLMLQNV+V+  
Sbjct: 1281 FRIKCKLGELGEDKKNLTGADKRHITMYATLDGGLGYIMPVPEKTYRRLLMLQNVLVSQG 1340

Query: 184  SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
            +H  GLNP+AFRTYK       NP+R +IDG LV+ +LQLS+ E+LE+ KKIG+K  ++L
Sbjct: 1341 AHIAGLNPKAFRTYKSWKKLQTNPARSVIDGELVYNYLQLSIPEKLEVSKKIGTKLEELL 1400

Query: 244  DELYDIEALSSHF 256
            D+L DI+ +++HF
Sbjct: 1401 DDLSDIQKITNHF 1413



 Score = 64.3 bits (155), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 27/47 (57%), Positives = 39/47 (82%)

Query: 4   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLK 50
           V+E+L V+LG HG+RPLL+VR + +L IY+ FR P+G LK+RF+K+K
Sbjct: 824 VKEILVVALGNHGSRPLLMVRLERDLYIYEVFRFPRGNLKMRFRKIK 870


>gi|350413821|ref|XP_003490124.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Bombus impatiens]
          Length = 1417

 Score =  229 bits (584), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 104/193 (53%), Positives = 147/193 (76%), Gaps = 4/193 (2%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY  E++I+++++GF+++D + N+ LFMYQPE+RES GG +LI+K DFHLGQ VNTF
Sbjct: 1225 PAEVYTIEYLIDNTNLGFLVADGESNMALFMYQPESRESLGGQKLIRKADFHLGQKVNTF 1284

Query: 128  FKIRCKPS-PISDA---PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
            F+I+C+ S P +D     GA  R +T YASLDG+LG+ LP+PEK YRRLLMLQNV+VTH 
Sbjct: 1285 FRIKCRVSDPANDKKHFSGADKRHVTMYASLDGSLGYILPVPEKTYRRLLMLQNVLVTHI 1344

Query: 184  SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
             H  GLNP+A+RTYK      GNP+RGIIDG LVW++L L   E++++ KKIG++  +I+
Sbjct: 1345 CHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDLVWRYLYLPNNEKIDVAKKIGTRVQEII 1404

Query: 244  DELYDIEALSSHF 256
            ++L +I+  ++HF
Sbjct: 1405 EDLTEIDRQTAHF 1417



 Score = 70.1 bits (170), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 34/49 (69%), Positives = 40/49 (81%)

Query: 1   ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKL 49
           E  V+E+L V+LG HGNRP+LLVR   EL IYQA+R+PKG LKLRFKKL
Sbjct: 827 EMQVREILMVALGHHGNRPMLLVRLDSELQIYQAYRYPKGHLKLRFKKL 875


>gi|340710064|ref|XP_003393618.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Bombus terrestris]
          Length = 1417

 Score =  229 bits (583), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 104/193 (53%), Positives = 146/193 (75%), Gaps = 4/193 (2%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY  E++I+++++GF+++D + N+ LFMYQPE+RES GG +LI+K DFHLGQ VNTF
Sbjct: 1225 PAEVYTIEYLIDNTNLGFLVADGESNMALFMYQPESRESLGGQKLIRKADFHLGQKVNTF 1284

Query: 128  FKIRCKPS-PISDA---PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
            F+IRC+ S P +D     GA  R +T YASLDG+LG+ LP+PEK YRRLLMLQNV+VTH 
Sbjct: 1285 FRIRCRLSDPANDKKHFSGADKRHVTMYASLDGSLGYILPVPEKTYRRLLMLQNVLVTHI 1344

Query: 184  SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
             H  GLNP+A+RTYK      GNP+RGIIDG LVW++  L   E++++ KKIG++  +I+
Sbjct: 1345 CHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDLVWRYFYLPNNEKIDVAKKIGTRVQEII 1404

Query: 244  DELYDIEALSSHF 256
            ++L +I+  ++HF
Sbjct: 1405 EDLTEIDRQTAHF 1417



 Score = 70.1 bits (170), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 34/49 (69%), Positives = 40/49 (81%)

Query: 1   ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKL 49
           E  V+E+L V+LG HGNRP+LLVR   EL IYQA+R+PKG LKLRFKKL
Sbjct: 827 EMQVREILMVALGHHGNRPMLLVRLDSELQIYQAYRYPKGHLKLRFKKL 875


>gi|110750698|ref|XP_624382.2| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Apis mellifera]
          Length = 1415

 Score =  227 bits (579), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 104/193 (53%), Positives = 145/193 (75%), Gaps = 4/193 (2%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY  E++I+++++GF+++D + N+ LFMYQPE+RES GG +LI+K DFHLGQ VNTF
Sbjct: 1223 PAEVYTIEYLIDNTNLGFLVADGESNIALFMYQPESRESLGGQKLIRKADFHLGQKVNTF 1282

Query: 128  FKIRCKPS-PISDA---PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
            F+IRC+ S P +D      A  R +T YASLDG LG+ LP+PEK YRRLLMLQNV+VTH 
Sbjct: 1283 FRIRCRISDPANDKKHFSDADKRHVTMYASLDGNLGYILPVPEKTYRRLLMLQNVLVTHI 1342

Query: 184  SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
             H  GLNP+A+RTYK      GNP+RGIIDG LVW++L L   E++++ KKIG++  +I+
Sbjct: 1343 CHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDLVWRYLYLPNNEKIDVAKKIGTRVQEII 1402

Query: 244  DELYDIEALSSHF 256
            ++L +I+  ++HF
Sbjct: 1403 EDLTEIDRQTAHF 1415



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 34/49 (69%), Positives = 40/49 (81%)

Query: 1   ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKL 49
           E  V+E+L V+LG HGNRP+LLVR   EL IYQA+R+PKG LKLRFKKL
Sbjct: 825 EMQVREILMVALGHHGNRPMLLVRLDSELQIYQAYRYPKGHLKLRFKKL 873


>gi|307191845|gb|EFN75271.1| Cleavage and polyadenylation specificity factor subunit 1
            [Harpegnathos saltator]
          Length = 1214

 Score =  223 bits (569), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 105/218 (48%), Positives = 154/218 (70%), Gaps = 12/218 (5%)

Query: 44   LRFK-KLKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
            LRF+ K + L +  R  R       P  VY  E++I+++++GF+I+D + N+ LFMYQPE
Sbjct: 1004 LRFQEKCRTLSLVSRDFR-------PAEVYTIEYLIDNTNLGFLIADGESNLALFMYQPE 1056

Query: 103  ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDA----PGARSRFLTWYASLDGAL 158
            +RES GG +LI+K DFHLGQ +NTFF+I+C+ + ++        A  + +T YASLDG+L
Sbjct: 1057 SRESLGGQKLIRKADFHLGQKINTFFRIKCRVTDVASDKKHFSDADKKHVTMYASLDGSL 1116

Query: 159  GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVW 218
            G+ LP+PEK YRRLLMLQNV+VTH  H  GLNP+A+RTYK      GNP+RGIIDG LVW
Sbjct: 1117 GYVLPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSYVRNQGNPARGIIDGDLVW 1176

Query: 219  KFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
            ++L L   E+ ++ KKIG++  +I++++ +I+  ++HF
Sbjct: 1177 RYLSLPNNEKADVAKKIGTRVQEIIEDITEIDRQTAHF 1214



 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 35/68 (51%), Positives = 46/68 (67%)

Query: 1   ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDRSKR 60
           E  V+E+L V+LG HGNRP+LLVR   EL IYQA+++PKG LKLRFKKL    +     R
Sbjct: 623 ELQVREVLMVALGHHGNRPMLLVRLDSELQIYQAYKYPKGHLKLRFKKLDHGIIPGHLSR 682

Query: 61  ANEQPGLP 68
             ++  +P
Sbjct: 683 KPKEEDVP 690


>gi|345482082|ref|XP_001607052.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Nasonia vitripennis]
          Length = 1415

 Score =  219 bits (559), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 106/218 (48%), Positives = 150/218 (68%), Gaps = 12/218 (5%)

Query: 44   LRFK-KLKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
            LRF+ + K L +  R  R  E       +YA E+ I+++ +GF+++D + N+ +F YQPE
Sbjct: 1205 LRFQPEYKTLSLVSRDFRTTE-------IYAIEYFIQNNELGFIVADGESNISIFSYQPE 1257

Query: 103  ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS----PISDAPGARSRFLTWYASLDGAL 158
            + +S GG +LI+K D HLGQ +NTFF+I+CK +    P     GA  R +T YA+LDG+L
Sbjct: 1258 SSQSLGGQKLIRKADIHLGQKINTFFRIKCKTTDSANPTKQFSGADKRHVTMYATLDGSL 1317

Query: 159  GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVW 218
            G+ LP+PEK YRRLLMLQNV+V+H  H  GLNP+AFRTYK      GNP+RGIIDG LV 
Sbjct: 1318 GYILPVPEKTYRRLLMLQNVLVSHIYHIAGLNPKAFRTYKSCVRMQGNPARGIIDGDLVR 1377

Query: 219  KFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
            K+L L + E++EI KKIG+   +I+D++++I   +SHF
Sbjct: 1378 KYLDLPVNEKIEIAKKIGTGAQEIMDDMHEIYKQTSHF 1415



 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 34/68 (50%), Positives = 46/68 (67%)

Query: 1   ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDRSKR 60
           E  V+E+  V+LG HGNRP+LLVR   EL IYQ +R+PKG LKLRFKK+   F+   S+ 
Sbjct: 825 EVQVREIAVVALGHHGNRPMLLVRLDSELQIYQVYRYPKGHLKLRFKKIDHNFIVGFSRI 884

Query: 61  ANEQPGLP 68
             ++  +P
Sbjct: 885 GPKEEDMP 892


>gi|307190910|gb|EFN74734.1| Cleavage and polyadenylation specificity factor subunit 1 [Camponotus
            floridanus]
          Length = 1418

 Score =  219 bits (558), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 102/194 (52%), Positives = 145/194 (74%), Gaps = 5/194 (2%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY  E++I+++++GF ++D + N+ LFMYQPE+RES GG +LI+K DFHLGQ VNTF
Sbjct: 1225 PAEVYTIEYLIDNTNLGFFLADGESNLALFMYQPESRESLGGQKLIRKADFHLGQKVNTF 1284

Query: 128  FKIRCKPS-PISDA---PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
            F+IRC+ S P +D     GA  R +T YA+LDG+LG+ LP+PEK YRRLLMLQNV+VTH 
Sbjct: 1285 FRIRCRVSDPANDKKQFSGADKRHVTMYATLDGSLGYILPVPEKTYRRLLMLQNVLVTHI 1344

Query: 184  SHTGGLNPRAFR-TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
             H  GLNP+++R TYK      GNP+RGIIDG LVW++L L   E+ ++ KKIG++  +I
Sbjct: 1345 CHIAGLNPKSYRQTYKSYIRNQGNPARGIIDGDLVWRYLFLPNNEKTDVAKKIGTRVQEI 1404

Query: 243  LDELYDIEALSSHF 256
            ++++ +I+  ++HF
Sbjct: 1405 IEDITEIDRQTAHF 1418



 Score = 69.3 bits (168), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 33/50 (66%), Positives = 41/50 (82%)

Query: 1   ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLK 50
           E  V+E+L V+LG HGNRP+LLVR   EL IYQA+++PKG LKLRFKKL+
Sbjct: 826 EMQVREILMVALGHHGNRPMLLVRLDSELQIYQAYKYPKGYLKLRFKKLE 875


>gi|242021233|ref|XP_002431050.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
            putative [Pediculus humanus corporis]
 gi|212516279|gb|EEB18312.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
            putative [Pediculus humanus corporis]
          Length = 1409

 Score =  213 bits (542), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 95/193 (49%), Positives = 141/193 (73%), Gaps = 4/193 (2%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VYA E +++++ MGF+ISD + N++++MY+PE R+S GG +L++K DFHLGQH+N++
Sbjct: 1217 PCEVYAIELLLDNTQMGFLISDVEMNIIMYMYKPEDRDSVGGQKLLRKADFHLGQHINSW 1276

Query: 128  FKIRCK---PSPISDAP-GARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
            F+IRC+    +   D P GA  R ++ +A+LDGALG+ LP+PEK YRRL MLQN++V H 
Sbjct: 1277 FRIRCRLGDQAENYDFPIGAEKRHISMFATLDGALGYLLPIPEKTYRRLQMLQNILVYHI 1336

Query: 184  SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
             H  GLNP+AFR YK      GNP + I+DG L+W +L L++ E+ ++ KK+GSK +DI+
Sbjct: 1337 PHLAGLNPKAFRIYKSGRKLLGNPCKRIVDGELIWMYLSLTVMEKQDVAKKMGSKMDDII 1396

Query: 244  DELYDIEALSSHF 256
            +++  IE LS HF
Sbjct: 1397 EDIAVIERLSGHF 1409



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/59 (49%), Positives = 44/59 (74%), Gaps = 2/59 (3%)

Query: 4   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKG-TLKLRFKKL-KVLFVSDRSKR 60
           + ELL VSLG  G RP+LL+RT+++L+IYQAF+  KG  LK+RF++L + L + +R  +
Sbjct: 821 IHELLVVSLGHLGRRPILLLRTENDLMIYQAFKFAKGPNLKIRFRRLPQTLILKERKAK 879


>gi|380014171|ref|XP_003691113.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Apis florea]
          Length = 1583

 Score =  213 bits (541), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 98/173 (56%), Positives = 129/173 (74%), Gaps = 4/173 (2%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY  E++I+++++GF+++D + N+ LFMYQPE+RES GG +LI+K DFHLGQ VNTF
Sbjct: 1223 PAEVYTIEYLIDNTNLGFLVADGESNIALFMYQPESRESLGGQKLIRKADFHLGQKVNTF 1282

Query: 128  FKIRCKPS-PISDA---PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
            F+IRC+ S P +D      A  R +T YASLDG LG+ LP+PEK YRRLLMLQNV+VTH 
Sbjct: 1283 FRIRCRISDPANDKKHFSDADKRHVTMYASLDGNLGYILPVPEKTYRRLLMLQNVLVTHI 1342

Query: 184  SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
             H  GLNP+A+RTYK      GNP+RGIIDG LVW++L L   E++++ KKI 
Sbjct: 1343 CHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDLVWRYLYLPNNEKIDVAKKIA 1395



 Score = 70.1 bits (170), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 34/49 (69%), Positives = 40/49 (81%)

Query: 1   ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKL 49
           E  V+E+L V+LG HGNRP+LLVR   EL IYQA+R+PKG LKLRFKKL
Sbjct: 825 EMQVREILMVALGHHGNRPMLLVRLDSELQIYQAYRYPKGHLKLRFKKL 873


>gi|322792443|gb|EFZ16427.1| hypothetical protein SINV_15375 [Solenopsis invicta]
          Length = 1532

 Score =  209 bits (531), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 98/176 (55%), Positives = 133/176 (75%), Gaps = 5/176 (2%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY  E++I+++++GF+++D + N+ LFMYQPE+RES GG +LI+K DFHLGQ VNTF
Sbjct: 1192 PAEVYTIEYLIDNTNLGFIVADGESNLALFMYQPESRESLGGQKLIRKADFHLGQKVNTF 1251

Query: 128  FKIRCKPS-PISDA---PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
            F+IRC+ + P +D     GA  R +T YASLDG+LG+ LP+PEK YRRLLMLQNV+VTH 
Sbjct: 1252 FRIRCRVTDPANDKKQFSGADKRHVTMYASLDGSLGYILPVPEKTYRRLLMLQNVLVTHI 1311

Query: 184  SHTGGLNPRAFR-TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 238
             H  GLNP+++R TYK      GNP+RGIIDG LVW++L L   E+ ++ KKIG++
Sbjct: 1312 CHIAGLNPKSYRHTYKSYIRNQGNPARGIIDGDLVWRYLFLPNNEKADLAKKIGTR 1367



 Score = 69.3 bits (168), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 35/68 (51%), Positives = 45/68 (66%)

Query: 4   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDRSKRANE 63
           V+E+L V+LG HGNRP+LLVR   EL IYQ +R+PKG LKLRFKKL    +  R     +
Sbjct: 796 VREILMVALGHHGNRPMLLVRLDSELQIYQVYRYPKGYLKLRFKKLDHGIIPGRLSPRPK 855

Query: 64  QPGLPRGV 71
           +  +PR  
Sbjct: 856 EEDVPRNT 863


>gi|332018184|gb|EGI58789.1| Cleavage and polyadenylation specificity factor subunit 1 [Acromyrmex
            echinatior]
          Length = 1412

 Score =  209 bits (531), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 98/176 (55%), Positives = 132/176 (75%), Gaps = 5/176 (2%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY  E++I++S++GF+++D + N+ LFMYQPE+RES GG +LI+K DFHLGQ +NTF
Sbjct: 1219 PAEVYTIEYLIDNSNLGFIVADGESNLALFMYQPESRESLGGQKLIRKADFHLGQKINTF 1278

Query: 128  FKIRCK-PSPISDA---PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
            F+I+C+   P +D     GA  R +T YASLDG+LG+ LP+PEK YRRLLMLQNV+VTH 
Sbjct: 1279 FRIKCRITDPANDKKQFSGADKRHVTMYASLDGSLGYILPVPEKTYRRLLMLQNVLVTHI 1338

Query: 184  SHTGGLNPRAFR-TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 238
             H  GLNP+A+R TYK      GNP+RGIIDG LVW++L L   E+ ++ KKIG++
Sbjct: 1339 CHIAGLNPKAYRHTYKSYVRNQGNPARGIIDGDLVWRYLFLPNNEKADLAKKIGTR 1394



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 35/67 (52%), Positives = 46/67 (68%)

Query: 4   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDRSKRANE 63
           V+E+L V+LG HGNRP+LLVR   +L IYQA+R+PKG LKLRFKKL    +  R     +
Sbjct: 827 VREILMVALGHHGNRPMLLVRLDSDLQIYQAYRYPKGYLKLRFKKLDHGIIPGRLSPRPK 886

Query: 64  QPGLPRG 70
           +  +PR 
Sbjct: 887 EEDVPRN 893


>gi|427795803|gb|JAA63353.1| Putative mrna cleavage and polyadenylation factor ii complex
           subunit cft1 cpsf subunit, partial [Rhipicephalus
           pulchellus]
          Length = 726

 Score =  201 bits (512), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 92/194 (47%), Positives = 134/194 (69%), Gaps = 5/194 (2%)

Query: 68  PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
           P  VYA EF I+++ M F+++D ++N++L+MYQPE+RES GG RL+++ DFH+G  V + 
Sbjct: 533 PLEVYAVEFFIDNTQMSFLVTDAERNLLLYMYQPESRESCGGQRLLRRGDFHVGSPVVSM 592

Query: 128 FKIRCKPSPISD-----APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
           F+I+C+   I+      A     R +T  A+LDG+L + LP+PEK YRRLLMLQNV+VT+
Sbjct: 593 FRIKCRMGDIAKYDRRAASIVDGRHITMMATLDGSLAYVLPVPEKTYRRLLMLQNVLVTN 652

Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
             H  GLNP+A+R Y  +  + GNP + I+DG L+WKF+ LS  ER E+ KKIG+    I
Sbjct: 653 IPHYAGLNPKAYRMYYSQRRFLGNPHKNILDGELIWKFMHLSFMERSELSKKIGTTVTQI 712

Query: 243 LDELYDIEALSSHF 256
            D+L +IE  ++HF
Sbjct: 713 TDDLLEIETYTAHF 726



 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 34/51 (66%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGTLKLRFKKL 49
           +V E+L V LG+  +RPLLL R   +LLIY+AF       +G LKLRFKK+
Sbjct: 131 VVHEILVVGLGIRHSRPLLLARVDEDLLIYEAFPFYETQREGHLKLRFKKM 181


>gi|427780291|gb|JAA55597.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
            cft1 cpsf subunit [Rhipicephalus pulchellus]
          Length = 1237

 Score =  201 bits (512), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 92/194 (47%), Positives = 134/194 (69%), Gaps = 5/194 (2%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VYA EF I+++ M F+++D ++N++L+MYQPE+RES GG RL+++ DFH+G  V + 
Sbjct: 1044 PLEVYAVEFFIDNTQMSFLVTDAERNLLLYMYQPESRESCGGQRLLRRGDFHVGSPVVSM 1103

Query: 128  FKIRCKPSPISD-----APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            F+I+C+   I+      A     R +T  A+LDG+L + LP+PEK YRRLLMLQNV+VT+
Sbjct: 1104 FRIKCRMGDIAKYDRRAASIVDGRHITMMATLDGSLAYVLPVPEKTYRRLLMLQNVLVTN 1163

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNP+A+R Y  +  + GNP + I+DG L+WKF+ LS  ER E+ KKIG+    I
Sbjct: 1164 IPHYAGLNPKAYRMYYSQRRFLGNPHKNILDGELIWKFMHLSFMERSELSKKIGTTVTQI 1223

Query: 243  LDELYDIEALSSHF 256
             D+L +IE  ++HF
Sbjct: 1224 TDDLLEIETYTAHF 1237



 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 34/51 (66%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGTLKLRFKKL 49
           +V E+L V LG+  +RPLLL R   +LLIY+AF       +G LKLRFKK+
Sbjct: 642 VVHEILVVGLGIRHSRPLLLARVDEDLLIYEAFPFYETQREGHLKLRFKKM 692


>gi|405977622|gb|EKC42064.1| Cleavage and polyadenylation specificity factor subunit 1
            [Crassostrea gigas]
          Length = 1369

 Score =  198 bits (504), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 96/211 (45%), Positives = 143/211 (67%), Gaps = 11/211 (5%)

Query: 50   KVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGG 109
            KVL +  R  R       P  VY  +F+I+++ +  ++SD+ KN+V++ YQPEARES+GG
Sbjct: 1166 KVLSIVSRDPR-------PLEVYTADFLIDNTQLCCLVSDRMKNLVVYSYQPEARESHGG 1218

Query: 110  HRLIKKTDFHLGQHVNTFFKIRCK---PSPISDAPGA-RSRFLTWYASLDGALGFFLPLP 165
             RLI+K DF+ G +V++ F++RCK   PS      GA   R +T++A+LDG+LGF LPL 
Sbjct: 1219 QRLIRKADFNAGSNVSSMFRVRCKLYDPSSDKRMTGAPEKRHITYFATLDGSLGFVLPLS 1278

Query: 166  EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 225
            EK YRRL MLQN +VTH  H  GLNPR++R   G      NP + I+DG L+WK+  LS+
Sbjct: 1279 EKVYRRLFMLQNALVTHIPHVAGLNPRSYRHVIGTFPELRNPQKNILDGELLWKYTNLSI 1338

Query: 226  GERLEICKKIGSKHNDILDELYDIEALSSHF 256
             E++EI K++G+ ++ I+D+L +I+ L++HF
Sbjct: 1339 MEKIEIAKRLGTSNDQIMDDLMEIDRLTAHF 1369



 Score = 47.8 bits (112), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 4   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGT----LKLRFKKLK 50
           ++ELL V LG   +RP LL R + +L IY+AF +P+ +    LKLRFKK++
Sbjct: 776 LKELLMVGLGYKDSRPHLLARVEDDLYIYEAFSYPQSSIDNHLKLRFKKIQ 826


>gi|157110889|ref|XP_001651294.1| cleavage and polyadenylation specificity factor cpsf [Aedes aegypti]
 gi|108883895|gb|EAT48120.1| AAEL000832-PA [Aedes aegypti]
          Length = 1417

 Score =  197 bits (500), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 85/191 (44%), Positives = 134/191 (70%), Gaps = 2/191 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  V+  E+V+++ ++GF++SD+  N++ +MYQPE+RES GG RL++K D+H+GQ +N+ 
Sbjct: 1227 PLNVFQIEYVVDNHNLGFLVSDEQCNIITYMYQPESRESFGGQRLLRKCDYHVGQKINSM 1286

Query: 128  FKIRCKPSPISDAPGA--RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
            F+++C    +     +    +  T++A+LDG +G+ LPLPEK YRRL MLQNV++TH+ H
Sbjct: 1287 FRVQCDFHEMDYKRNSNYECKHTTYFATLDGGIGYVLPLPEKTYRRLFMLQNVLMTHSPH 1346

Query: 186  TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
              GLNP+AFRT K       NP+R ++DG L+W FL L   E+LE+ KKIG++ +DI  +
Sbjct: 1347 LCGLNPKAFRTIKTVKKLPINPARCVVDGDLIWTFLTLPANEKLEVAKKIGTRIDDICAD 1406

Query: 246  LYDIEALSSHF 256
            L +IE+++  F
Sbjct: 1407 LMEIESVTHVF 1417



 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 66/129 (51%), Gaps = 21/129 (16%)

Query: 5   QELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKL------KVLFVSDRS 58
           +E+L V+LG HG RP+L VR +++LL+Y+ +R+ KG LKLRF+++       +  ++ R 
Sbjct: 821 KEILMVALGHHGTRPMLFVRLENDLLVYRVYRYSKGHLKLRFRRVPSGVTGPIFKIAPRQ 880

Query: 59  KRANEQPGLPRGVYACEFVIESSSM-----------GFMISDKDKNVVLFMYQPEARESN 107
               +Q G     ++ + + E+ SM           G  +  +   ++L   + E R   
Sbjct: 881 SAPTDQEGEKPDEHSTKIMYENISMIRYFNNVNGYNGVAVCGEKPYIMLLTSRGELR--- 937

Query: 108 GGHRLIKKT 116
             HRL  KT
Sbjct: 938 -AHRLYAKT 945


>gi|260835071|ref|XP_002612533.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
 gi|229297910|gb|EEN68542.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
          Length = 1003

 Score =  197 bits (500), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 95/194 (48%), Positives = 128/194 (65%), Gaps = 5/194 (2%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P   Y  EF ++++ +GF++SD  KN +L+ YQPEARES GG RL+++ DF++G HVNTF
Sbjct: 810  PLETYCVEFFVDNAQIGFLVSDAQKNFLLYSYQPEARESYGGQRLVRRADFNVGSHVNTF 869

Query: 128  FKIRCKPSPIS-----DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            F++RCK    S     DA     R +T +A+LDG LG  LP+ EK YRRLLMLQN ++TH
Sbjct: 870  FRVRCKIMDPSGERRRDADTVAKRHVTMFATLDGGLGALLPMAEKTYRRLLMLQNTLMTH 929

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
                 GLNP+AFR  K       N  R I+DG L+WKFL LS+ ER E+ +KIG+    I
Sbjct: 930  MPFPAGLNPKAFRMLKHNHRSLINACRNILDGELLWKFLHLSVVERSELARKIGTSPETI 989

Query: 243  LDELYDIEALSSHF 256
             ++L DI+ LS+HF
Sbjct: 990  TEDLMDIDRLSAHF 1003



 Score = 45.4 bits (106), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 31/70 (44%), Positives = 45/70 (64%), Gaps = 9/70 (12%)

Query: 4   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH---PKGT-LKLRFKKLK-VLFVSDR- 57
           V+E+L V LG  G+RP LL R   +LLIY+AF +   P  T LK+RFKK++  L + +R 
Sbjct: 436 VKEILMVGLGHKGSRPHLLARVDEDLLIYEAFPYHLSPSYTMLKIRFKKVQHNLILRERK 495

Query: 58  ---SKRANEQ 64
              +K+A +Q
Sbjct: 496 GGKTKKAGDQ 505


>gi|158287218|ref|XP_309311.4| AGAP011340-PA [Anopheles gambiae str. PEST]
 gi|157019545|gb|EAA05261.4| AGAP011340-PA [Anopheles gambiae str. PEST]
          Length = 1434

 Score =  196 bits (498), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 87/189 (46%), Positives = 133/189 (70%), Gaps = 3/189 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  V+  E+V++++++GF++SD   N++ +MYQPE+RES GG RL++K+D+HLGQ VN  
Sbjct: 1243 PLNVFQVEYVVDNANLGFLVSDDQCNLITYMYQPESRESFGGQRLLRKSDYHLGQQVNCM 1302

Query: 128  FKIRC---KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            F+++C   +   +       ++  T++A+LDG +GF LPLPEK YRRL MLQNV++TH+ 
Sbjct: 1303 FRVQCDFHETDVMKRTLNYDNKHTTFFATLDGGIGFVLPLPEKTYRRLFMLQNVLLTHSP 1362

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            HT GLNP+A+RT K       NPSR ++DG LVW FL+L   E+ E+ KKIG++  +I  
Sbjct: 1363 HTCGLNPKAYRTIKQTRKLPINPSRCVVDGDLVWSFLELPANEKHEVAKKIGTRIEEICA 1422

Query: 245  ELYDIEALS 253
            +L +IE ++
Sbjct: 1423 DLMEIEHVT 1431



 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 27/45 (60%), Positives = 38/45 (84%)

Query: 5   QELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKL 49
           +E+L V+LG +G+RPLL +R +H+LLIY+ FR+ KG LKLRFK+L
Sbjct: 836 KEILMVALGSYGSRPLLFIRLEHDLLIYRVFRYSKGHLKLRFKRL 880


>gi|312380158|gb|EFR26239.1| hypothetical protein AND_07834 [Anopheles darlingi]
          Length = 1503

 Score =  195 bits (495), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 87/186 (46%), Positives = 129/186 (69%), Gaps = 3/186 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY  E+V++++++GF+++D   N++ +MYQPE+RES GG RL++K D+HLGQ VN  
Sbjct: 1264 PLNVYQVEYVVDNTNLGFLVADDQANLITYMYQPESRESFGGQRLLRKGDYHLGQRVNAM 1323

Query: 128  FKIRC---KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            F+++C   +   +       ++  T++A+LDG  GF LPLPEK YRRL MLQNV++TH+ 
Sbjct: 1324 FRVQCDFHESDVMRRTLNYDNKHTTFFATLDGGFGFVLPLPEKTYRRLFMLQNVLLTHSP 1383

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            HT GLNP+A+RT K       NPSR ++DG LVW FL+L   E+ E+ KKIG++  +I  
Sbjct: 1384 HTCGLNPKAYRTIKQSRALPINPSRCVVDGDLVWSFLELPANEKQEVAKKIGTRIEEICA 1443

Query: 245  ELYDIE 250
            +L +IE
Sbjct: 1444 DLMEIE 1449



 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 25/45 (55%), Positives = 37/45 (82%)

Query: 5   QELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKL 49
           +E+L V+LG +G+RP+L +R + +LLIY+ FR+ KG LKLRFK+L
Sbjct: 845 KEILMVALGSYGSRPILFIRLEQDLLIYRVFRYAKGHLKLRFKRL 889


>gi|195056749|ref|XP_001995154.1| GH22991 [Drosophila grimshawi]
 gi|193899360|gb|EDV98226.1| GH22991 [Drosophila grimshawi]
          Length = 1426

 Score =  195 bits (495), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 90/191 (47%), Positives = 133/191 (69%), Gaps = 2/191 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  V+  EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT 
Sbjct: 1236 PMEVFGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1295

Query: 128  FKIRCKPSPISDAPG--ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
            F+++C    +         ++ L  Y SLDGALG+ LPLPEK YRR LMLQNV++++  H
Sbjct: 1296 FRVQCHQRGLHQRQPFLYENKHLVIYGSLDGALGYCLPLPEKVYRRFLMLQNVLLSYQDH 1355

Query: 186  TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
              GLNP+ +RT K       NPSR IIDG L+W F  L+  ER E+ KKIG++  +IL +
Sbjct: 1356 LCGLNPKEYRTIKSVKKLGINPSRCIIDGDLIWSFRMLAHSERNEVAKKIGTRTEEILAD 1415

Query: 246  LYDIEALSSHF 256
            L +IE +S+ F
Sbjct: 1416 LLEIERISAVF 1426



 Score = 64.3 bits (155), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 32/47 (68%), Positives = 37/47 (78%)

Query: 6   ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVL 52
           EL  V LG HG RPLLLVRT+ ELLIYQ FR+ KG LK+RF+KL+ L
Sbjct: 836 ELCLVGLGQHGERPLLLVRTRLELLIYQVFRYAKGHLKIRFRKLEQL 882


>gi|45552619|ref|NP_995833.1| cleavage and polyadenylation specificity factor 160, isoform A
            [Drosophila melanogaster]
 gi|18203551|sp|Q9V726.1|CPSF1_DROME RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=CPSF 160 kDa subunit;
            Short=dCPSF 160
 gi|7303176|gb|AAF58240.1| cleavage and polyadenylation specificity factor 160, isoform A
            [Drosophila melanogaster]
          Length = 1455

 Score =  193 bits (490), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 91/195 (46%), Positives = 133/195 (68%), Gaps = 10/195 (5%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY  EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT 
Sbjct: 1265 PLEVYGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1324

Query: 128  FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
            F+++C    +      R  FL        Y +LDGALG+ LPLPEK YRR LMLQNV+++
Sbjct: 1325 FRVQCHQKGLHQ----RQPFLYENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLS 1380

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
            +  H  GLNP+ +RT K       NPSR IIDG L+W +  ++  ER E+ KKIG++  +
Sbjct: 1381 YQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEE 1440

Query: 242  ILDELYDIEALSSHF 256
            IL +L +IE L+S F
Sbjct: 1441 ILGDLLEIERLASVF 1455



 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 32/52 (61%), Positives = 41/52 (78%)

Query: 6   ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
           EL  + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K+  L + D+
Sbjct: 864 ELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRKMDQLNLLDQ 915


>gi|24653655|ref|NP_725397.1| cleavage and polyadenylation specificity factor 160, isoform B
            [Drosophila melanogaster]
 gi|15292103|gb|AAK93320.1| LD38533p [Drosophila melanogaster]
 gi|21627189|gb|AAM68553.1| cleavage and polyadenylation specificity factor 160, isoform B
            [Drosophila melanogaster]
          Length = 1420

 Score =  192 bits (489), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 91/195 (46%), Positives = 133/195 (68%), Gaps = 10/195 (5%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY  EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT 
Sbjct: 1230 PLEVYGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1289

Query: 128  FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
            F+++C    +      R  FL        Y +LDGALG+ LPLPEK YRR LMLQNV+++
Sbjct: 1290 FRVQCHQKGLHQ----RQPFLYENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLS 1345

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
            +  H  GLNP+ +RT K       NPSR IIDG L+W +  ++  ER E+ KKIG++  +
Sbjct: 1346 YQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEE 1405

Query: 242  ILDELYDIEALSSHF 256
            IL +L +IE L+S F
Sbjct: 1406 ILGDLLEIERLASVF 1420



 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 32/52 (61%), Positives = 41/52 (78%)

Query: 6   ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
           EL  + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K+  L + D+
Sbjct: 829 ELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRKMDQLNLLDQ 880


>gi|194756960|ref|XP_001960738.1| GF11349 [Drosophila ananassae]
 gi|190622036|gb|EDV37560.1| GF11349 [Drosophila ananassae]
          Length = 1455

 Score =  192 bits (489), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 92/195 (47%), Positives = 133/195 (68%), Gaps = 10/195 (5%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY  EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT 
Sbjct: 1265 PLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1324

Query: 128  FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
            F+++C    +      R  FL        Y +LDGALG+ LPLPEK YRR LMLQNV+++
Sbjct: 1325 FRVQCHQKGLHQ----RQPFLYENKHFVVYGTLDGALGYCLPLPEKLYRRFLMLQNVLLS 1380

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
            +  H  GLNP+ +RT K       NPSR IIDG L+W +  L+  ER E+ KKIG++  +
Sbjct: 1381 YQEHLCGLNPKEYRTIKAVKKQGINPSRCIIDGDLIWSYRLLANSERNEVAKKIGTRTEE 1440

Query: 242  ILDELYDIEALSSHF 256
            IL +L +IE L+S F
Sbjct: 1441 ILSDLLEIERLASVF 1455



 Score = 67.4 bits (163), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 33/51 (64%), Positives = 41/51 (80%)

Query: 6   ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSD 56
           EL  + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+KL+ L + D
Sbjct: 864 ELTVLGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRKLEQLNLMD 914


>gi|195122290|ref|XP_002005645.1| GI18959 [Drosophila mojavensis]
 gi|193910713|gb|EDW09580.1| GI18959 [Drosophila mojavensis]
          Length = 1431

 Score =  192 bits (488), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 91/195 (46%), Positives = 133/195 (68%), Gaps = 10/195 (5%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  V+  EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT 
Sbjct: 1241 PLEVFGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1300

Query: 128  FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
            F+++C    +      R  FL        Y +LDGALG+ LPLPEK YRR LMLQNV+++
Sbjct: 1301 FRVQCHQRGLHQ----RQPFLYENKHFVIYGTLDGALGYCLPLPEKVYRRFLMLQNVLLS 1356

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
            +  H  GLNP+ +RT K       NPSR IIDG L+W +  L+  ER E+ KKIG++  +
Sbjct: 1357 YQDHLCGLNPKEYRTIKTVKKMGINPSRCIIDGDLIWSYRMLAHSERSEVAKKIGTRTEE 1416

Query: 242  ILDELYDIEALSSHF 256
            IL +L +IE LS+ F
Sbjct: 1417 ILADLLEIERLSAIF 1431



 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 33/52 (63%), Positives = 41/52 (78%)

Query: 6   ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
           EL  V LG HG+RPLLLVRT+ ELLIYQ FR+ KG LK+RF+KL+ L + D+
Sbjct: 840 ELSLVGLGQHGDRPLLLVRTRLELLIYQVFRYAKGHLKIRFRKLEQLHLLDQ 891


>gi|9794908|gb|AAF98388.1| cleavage and polyadenylation specificity factor [Drosophila
           melanogaster]
          Length = 813

 Score =  192 bits (488), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 88/191 (46%), Positives = 132/191 (69%), Gaps = 2/191 (1%)

Query: 68  PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
           P  VY  EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT 
Sbjct: 623 PLEVYGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 682

Query: 128 FKIRCKPSPISDAPG--ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
           F+++C    +         ++    Y +LDGALG+ LPLPEK YRR LMLQNV++++  H
Sbjct: 683 FRVQCHQKGLHQRQPFLYENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEH 742

Query: 186 TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
             GLNP+ +RT K       NPSR IIDG L+W +  ++  ER E+ KKIG++  +IL +
Sbjct: 743 LCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGD 802

Query: 246 LYDIEALSSHF 256
           L +IE L+S F
Sbjct: 803 LLEIERLASVF 813



 Score = 67.0 bits (162), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 33/52 (63%), Positives = 41/52 (78%)

Query: 6   ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
           EL  + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+KL  L + D+
Sbjct: 222 ELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRKLDQLNLLDQ 273


>gi|195334368|ref|XP_002033855.1| GM20208 [Drosophila sechellia]
 gi|194125825|gb|EDW47868.1| GM20208 [Drosophila sechellia]
          Length = 1455

 Score =  192 bits (487), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 91/195 (46%), Positives = 133/195 (68%), Gaps = 10/195 (5%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY  EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT 
Sbjct: 1265 PLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1324

Query: 128  FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
            F+++C    +      R  FL        Y +LDGALG+ LPLPEK YRR LMLQNV+++
Sbjct: 1325 FRVQCHQKGLHQ----RQPFLYENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLS 1380

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
            +  H  GLNP+ +RT K       NPSR IIDG L+W +  ++  ER E+ KKIG++  +
Sbjct: 1381 YQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEE 1440

Query: 242  ILDELYDIEALSSHF 256
            IL +L +IE L+S F
Sbjct: 1441 ILGDLLEIERLASVF 1455



 Score = 67.4 bits (163), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 33/52 (63%), Positives = 41/52 (78%)

Query: 6   ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
           EL  + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+KL  L + D+
Sbjct: 864 ELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRKLDQLNLLDQ 915


>gi|194883064|ref|XP_001975624.1| GG22421 [Drosophila erecta]
 gi|190658811|gb|EDV56024.1| GG22421 [Drosophila erecta]
          Length = 1455

 Score =  192 bits (487), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 92/195 (47%), Positives = 133/195 (68%), Gaps = 10/195 (5%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY  EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT 
Sbjct: 1265 PLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1324

Query: 128  FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
            F+++C    +      R  FL        Y +LDGALG+ LPLPEK YRR LMLQNV+V+
Sbjct: 1325 FRVQCHQKGLHQ----RQPFLYENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLVS 1380

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
            +  H  GLNP+ +RT K       NPSR IIDG L+W +  ++  ER E+ KKIG++  +
Sbjct: 1381 YQEHLCGLNPKEYRTLKSFKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEE 1440

Query: 242  ILDELYDIEALSSHF 256
            IL +L +IE L+S F
Sbjct: 1441 ILGDLLEIERLASVF 1455



 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 32/52 (61%), Positives = 40/52 (76%)

Query: 6   ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
           EL    LGL+G RPLL+VRT+ ELLIYQ FR+PKG LK+RF+KL  L + D+
Sbjct: 864 ELSLTGLGLNGERPLLMVRTRVELLIYQVFRYPKGHLKIRFRKLDQLNLLDQ 915


>gi|195381337|ref|XP_002049409.1| GJ21566 [Drosophila virilis]
 gi|194144206|gb|EDW60602.1| GJ21566 [Drosophila virilis]
          Length = 1420

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 88/191 (46%), Positives = 133/191 (69%), Gaps = 2/191 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  V+  EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT 
Sbjct: 1230 PLEVFGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1289

Query: 128  FKIRCKPSPISDAPG--ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
            F+++C    +         ++ L  Y +LDGALG+ LPLPEK YRR LMLQNV++++  H
Sbjct: 1290 FRVQCHQRGLHHRQPFLYENKHLVIYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQDH 1349

Query: 186  TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
              GLNP+ +RT K       NPSR IIDG L+W +  L+  ER E+ KKIG++  +IL +
Sbjct: 1350 LCGLNPKEYRTIKTVKKMGINPSRCIIDGDLIWSYRMLAHSERSEVAKKIGTRTEEILAD 1409

Query: 246  LYDIEALSSHF 256
            + +IE LS+ F
Sbjct: 1410 MLEIERLSAVF 1420



 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 33/52 (63%), Positives = 40/52 (76%)

Query: 6   ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
           EL  V LG HG RPLLLVRT+ ELLIYQ FR+ KG LK+RF+KL+ L + D+
Sbjct: 830 ELCLVGLGQHGERPLLLVRTRLELLIYQVFRYAKGHLKIRFRKLEQLHLLDQ 881


>gi|241060959|ref|XP_002408050.1| cleavage and polyadenylation specificity factor, putative [Ixodes
            scapularis]
 gi|215492346|gb|EEC01987.1| cleavage and polyadenylation specificity factor, putative [Ixodes
            scapularis]
          Length = 1241

 Score =  191 bits (484), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 86/181 (47%), Positives = 125/181 (69%), Gaps = 5/181 (2%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  V+A EF I++S M F+++D ++N++L+MYQPE+RES GG RL+++ DFH+G  V + 
Sbjct: 1048 PLEVFAVEFFIDNSQMSFLVTDSERNMILYMYQPESRESCGGQRLLRRGDFHIGSPVVSM 1107

Query: 128  FKIRCKPSPISD-----APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            F+I+C+   ++      A     R +T  A+LDG+LG+ LP+PEK YRRLLMLQNV+VT+
Sbjct: 1108 FRIKCRMGEVAKHDRRLAASVDGRHITMLATLDGSLGYVLPVPEKTYRRLLMLQNVLVTN 1167

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNP+AFR Y  +    GNP + I+DG L+WKF+ LS  ER E+ KKIG+    +
Sbjct: 1168 MPHYAGLNPKAFRMYHSQRRVLGNPHKNILDGELIWKFMHLSFMERSELSKKIGTTVTQV 1227

Query: 243  L 243
            +
Sbjct: 1228 V 1228



 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 29/67 (43%), Positives = 39/67 (58%), Gaps = 4/67 (5%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGTLKLRFKKLKVLFVSDRS 58
           +V E+L V LG+  +RPLLL R   +LLIY+AF       +G LKLRFKKL    +    
Sbjct: 645 VVHEILMVGLGVRQSRPLLLARVDEDLLIYEAFPFYETQREGHLKLRFKKLNHDIILRSR 704

Query: 59  KRANEQP 65
           K   ++P
Sbjct: 705 KYKTQKP 711


>gi|195455711|ref|XP_002074834.1| GK23274 [Drosophila willistoni]
 gi|194170919|gb|EDW85820.1| GK23274 [Drosophila willistoni]
          Length = 1463

 Score =  190 bits (483), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 91/195 (46%), Positives = 131/195 (67%), Gaps = 10/195 (5%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY  EF+++++++GF+++D + N++++MYQPEARES GG +L++K D+HLGQ VNT 
Sbjct: 1273 PLEVYGIEFMVDNTNLGFLVTDAESNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1332

Query: 128  FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
            F+++C    +      R  FL        Y +LDGALG+ LPLPEK YRR LMLQNV+++
Sbjct: 1333 FRVQCHQRGLHQ----RQPFLYENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLS 1388

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
            +  H  GLNP+ +RT K       NPSR IIDG L+W +  L+  ER E+ KKIG++  +
Sbjct: 1389 YQDHLCGLNPKEYRTLKSSKRLGINPSRCIIDGDLIWSYRLLANSERNEVAKKIGTRTEE 1448

Query: 242  ILDELYDIEALSSHF 256
            IL +L +IE LS  F
Sbjct: 1449 ILADLLEIERLSGVF 1463



 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 33/52 (63%), Positives = 41/52 (78%)

Query: 6   ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
           EL  V LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K+  L + D+
Sbjct: 872 ELSLVGLGLNGERPLLLVRTRLELLIYQVFRYPKGHLKIRFRKMDQLNLLDQ 923


>gi|195485994|ref|XP_002091320.1| GE12310 [Drosophila yakuba]
 gi|194177421|gb|EDW91032.1| GE12310 [Drosophila yakuba]
          Length = 1455

 Score =  190 bits (483), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 90/195 (46%), Positives = 133/195 (68%), Gaps = 10/195 (5%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY  EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT 
Sbjct: 1265 PLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1324

Query: 128  FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
            F+++C    +      R  FL        Y +LDGALG+ LPLPEK YRR LMLQNV+++
Sbjct: 1325 FRVQCHQKGLHQ----RQPFLYENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLS 1380

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
            +  H  GLNP+ +RT K       NPSR +IDG L+W +  ++  ER E+ KKIG++  +
Sbjct: 1381 YQEHLCGLNPKEYRTLKSFKKQGINPSRCVIDGDLIWSYRLMANSERNEVAKKIGTRTEE 1440

Query: 242  ILDELYDIEALSSHF 256
            IL +L +IE L+S F
Sbjct: 1441 ILADLLEIERLASVF 1455



 Score = 67.4 bits (163), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 33/52 (63%), Positives = 41/52 (78%)

Query: 6   ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
           EL  + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+KL  L + D+
Sbjct: 864 ELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRKLDQLNLLDQ 915


>gi|198457226|ref|XP_001360595.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
 gi|198135905|gb|EAL25170.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
          Length = 1459

 Score =  190 bits (482), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 92/195 (47%), Positives = 130/195 (66%), Gaps = 10/195 (5%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY  EF++++S++GF+++D ++N++++MYQPEARES GG +LI+K D+HLGQ VNT 
Sbjct: 1269 PLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLIRKADYHLGQVVNTM 1328

Query: 128  FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
            F+++C    +      R  FL        Y +LDG LG+ LPLPEK YRR LMLQNV+++
Sbjct: 1329 FRVQCHQRGVHQ----RQPFLYENKHFVVYGTLDGGLGYCLPLPEKVYRRFLMLQNVLLS 1384

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
            +  H  GLNP+ FRT K       NPSR IIDG L+W +  L   +R E+ KKIG++  +
Sbjct: 1385 YQDHLCGLNPKEFRTLKSFKKQGLNPSRCIIDGDLIWSYRLLPNSDRNEVAKKIGTRTEE 1444

Query: 242  ILDELYDIEALSSHF 256
            IL +L +IE LS  F
Sbjct: 1445 ILSDLLEIERLSGVF 1459



 Score = 66.6 bits (161), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 32/52 (61%), Positives = 42/52 (80%)

Query: 6   ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
           EL  V LGL+G RP+L+VRT+ ELLIYQ FR+PKG LK+RF+KL+ L + D+
Sbjct: 868 ELSLVGLGLNGERPVLMVRTRVELLIYQVFRYPKGNLKIRFRKLEQLNLLDQ 919


>gi|195150431|ref|XP_002016158.1| GL10645 [Drosophila persimilis]
 gi|194110005|gb|EDW32048.1| GL10645 [Drosophila persimilis]
          Length = 1459

 Score =  189 bits (481), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 92/195 (47%), Positives = 130/195 (66%), Gaps = 10/195 (5%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY  EF++++S++GF+++D ++N++++MYQPEARES GG +LI+K D+HLGQ VNT 
Sbjct: 1269 PLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLIRKADYHLGQVVNTM 1328

Query: 128  FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
            F+++C    +      R  FL        Y +LDG LG+ LPLPEK YRR LMLQNV+++
Sbjct: 1329 FRVQCHQRGVHQ----RQPFLYENKHFVVYGTLDGGLGYCLPLPEKVYRRFLMLQNVLLS 1384

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
            +  H  GLNP+ FRT K       NPSR IIDG L+W +  L   +R E+ KKIG++  +
Sbjct: 1385 YQDHLCGLNPKEFRTLKSFKKQGLNPSRCIIDGDLIWSYRLLPNSDRNEVAKKIGTRTEE 1444

Query: 242  ILDELYDIEALSSHF 256
            IL +L +IE LS  F
Sbjct: 1445 ILSDLLEIERLSGVF 1459



 Score = 66.6 bits (161), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 32/52 (61%), Positives = 42/52 (80%)

Query: 6   ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
           EL  V LGL+G RP+L+VRT+ ELLIYQ FR+PKG LK+RF+KL+ L + D+
Sbjct: 868 ELSLVGLGLNGERPVLMVRTRVELLIYQVFRYPKGNLKIRFRKLEQLNLLDQ 919


>gi|149066088|gb|EDM15961.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
           isoform CRA_b [Rattus norvegicus]
          Length = 241

 Score =  186 bits (473), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68  PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
           P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 50  PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 109

Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
           ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 110 WRTPCRGA--AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 167

Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
             H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 168 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDII 227

Query: 243 LDELYDIEALSSHF 256
           LD+L + + +++HF
Sbjct: 228 LDDLLETDRVTAHF 241


>gi|334326317|ref|XP_001364707.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Monodelphis domestica]
          Length = 1449

 Score =  186 bits (473), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 86/194 (44%), Positives = 134/194 (69%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++S+ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1258 PLEVYSVDFMVDSAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1317

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1318 WRTPCRGA--AEGPSKKSIVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1375

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1376 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1435

Query: 243  LDELYDIEALSSHF 256
            LD+L +I+ +++HF
Sbjct: 1436 LDDLLEIDRVTAHF 1449



 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 25/51 (49%), Positives = 33/51 (64%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG    RP LLV    ELLIY+AF H     +  LK+RFKK+
Sbjct: 851 LVKEVLLVALGNRQTRPYLLVHVDQELLIYEAFAHDSQLGQSNLKVRFKKV 901


>gi|395512730|ref|XP_003760588.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Sarcophilus harrisii]
          Length = 1449

 Score =  186 bits (471), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 86/194 (44%), Positives = 134/194 (69%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++S+ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1258 PLEVYSVDFMVDSAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1317

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1318 WRTPCRGA--AEGPTKKSIVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1375

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1376 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1435

Query: 243  LDELYDIEALSSHF 256
            LD+L +I+ +++HF
Sbjct: 1436 LDDLLEIDRVTAHF 1449



 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 25/51 (49%), Positives = 33/51 (64%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG    RP LLV    ELLIY+AF H     +  LK+RFKK+
Sbjct: 851 LVKEVLLVALGNRQTRPYLLVHVDQELLIYEAFAHDSQLGQSNLKVRFKKV 901


>gi|348512553|ref|XP_003443807.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Oreochromis niloticus]
          Length = 1456

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 84/192 (43%), Positives = 134/192 (69%), Gaps = 3/192 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ EF+++++ +GF++SD+DKN+ ++MY PEA+ES GG RL+++ DF+ G ++NTF
Sbjct: 1265 PLEVYSIEFMVDNNQLGFLVSDRDKNLYVYMYLPEAKESFGGMRLLRRADFNAGANINTF 1324

Query: 128  FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            +++ C+ +  + +  A    ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   
Sbjct: 1325 WRMPCRGALDASSKKALTWDNKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1384

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            H  GLNP+AFR          NP + I+DG L+ K+L LS+ ER E+ KKIG+  + ILD
Sbjct: 1385 HHAGLNPKAFRMLHSDRRSLQNPVKNILDGELLNKYLYLSMMERSELAKKIGTTQDIILD 1444

Query: 245  ELYDIEALSSHF 256
            +L +I+ +++HF
Sbjct: 1445 DLLEIDRVTAHF 1456



 Score = 48.1 bits (113), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 37/51 (72%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGTLKLRFKKL 49
           +V+E+  VSLG + ++P LLV  + ELLIY+AF++    P+  LK+RFKK+
Sbjct: 857 LVKEVALVSLGNNHSKPYLLVHVEQELLIYEAFQYDQQQPQNNLKVRFKKV 907


>gi|417406474|gb|JAA49895.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
            cft1 cpsf subunit [Desmodus rotundus]
          Length = 1444

 Score =  184 bits (467), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 85/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1253 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1312

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1313 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1370

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R I+DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1371 LPHHAGLNPRAFRMLHVDRRTLQNAVRNILDGELLNRYLYLSTMERSELAKKIGTTPDII 1430

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1431 LDDLLETDRVTAHF 1444



 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 846 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFAHDSQLGQGNLKVRFKKV 896


>gi|351713968|gb|EHB16887.1| Cleavage and polyadenylation specificity factor subunit 1
            [Heterocephalus glaber]
          Length = 1440

 Score =  184 bits (467), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 85/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1249 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1308

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  S+ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1309 WRTPCRGA--SEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1366

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1367 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1426

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1427 LDDLLETDRVTAHF 1440



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 843 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 893


>gi|354491124|ref|XP_003507706.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform 2 [Cricetulus griseus]
          Length = 1388

 Score =  184 bits (467), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1197 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1256

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1257 WRTPCRGA--AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1314

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1315 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDII 1374

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1375 LDDLLETDRVTAHF 1388



 Score = 49.3 bits (116), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 841 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 891


>gi|354491122|ref|XP_003507705.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform 1 [Cricetulus griseus]
          Length = 1441

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1250 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1309

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1310 WRTPCRGA--AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1367

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1368 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDII 1427

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1428 LDDLLETDRVTAHF 1441



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 843 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 893


>gi|197245729|gb|AAI68713.1| Cpsf1 protein [Rattus norvegicus]
          Length = 1439

 Score =  184 bits (466), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1248 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1307

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1308 WRTPCRGA--AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1365

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1366 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDII 1425

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1426 LDDLLETDRVTAHF 1439



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 841 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 891


>gi|148697644|gb|EDL29591.1| cleavage and polyadenylation specific factor 1, isoform CRA_c [Mus
            musculus]
          Length = 1388

 Score =  184 bits (466), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1197 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1256

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1257 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1314

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1315 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDII 1374

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1375 LDDLLETDRVTAHF 1388



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 841 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 891


>gi|344236599|gb|EGV92702.1| Cleavage and polyadenylation specificity factor subunit 1 [Cricetulus
            griseus]
          Length = 1419

 Score =  184 bits (466), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1228 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1287

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1288 WRTPCRGA--AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1345

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1346 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDII 1405

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1406 LDDLLETDRVTAHF 1419



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 821 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 871


>gi|194474008|ref|NP_001124043.1| cleavage and polyadenylation specificity factor subunit 1 [Rattus
            norvegicus]
 gi|149066087|gb|EDM15960.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
            isoform CRA_a [Rattus norvegicus]
          Length = 1386

 Score =  184 bits (466), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1195 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1254

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1255 WRTPCRGA--AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1312

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1313 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDII 1372

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1373 LDDLLETDRVTAHF 1386



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 839 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 889


>gi|148697643|gb|EDL29590.1| cleavage and polyadenylation specific factor 1, isoform CRA_b [Mus
            musculus]
          Length = 1311

 Score =  184 bits (466), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1120 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1179

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1180 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1237

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1238 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDII 1297

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1298 LDDLLETDRVTAHF 1311



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 764 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 814


>gi|16751835|ref|NP_444423.1| cleavage and polyadenylation specificity factor subunit 1 isoform 2
            [Mus musculus]
 gi|17374611|sp|Q9EPU4.1|CPSF1_MOUSE RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=CPSF 160 kDa subunit
 gi|11762096|gb|AAG40326.1|AF322193_1 cleavage and polyadenylation specificity factor 1 [Mus musculus]
 gi|38614159|gb|AAH56388.1| Cleavage and polyadenylation specific factor 1 [Mus musculus]
          Length = 1441

 Score =  183 bits (465), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1250 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1309

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1310 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1367

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1368 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDII 1427

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1428 LDDLLETDRVTAHF 1441



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 843 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 893


>gi|148697642|gb|EDL29589.1| cleavage and polyadenylation specific factor 1, isoform CRA_a [Mus
            musculus]
          Length = 1417

 Score =  183 bits (465), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1226 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1285

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1286 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1343

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1344 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDII 1403

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1404 LDDLLETDRVTAHF 1417



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 870 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 920


>gi|348555854|ref|XP_003463738.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform 1 [Cavia porcellus]
          Length = 1440

 Score =  183 bits (465), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1249 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1308

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1309 WRTPCRGA--TEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1366

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1367 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1426

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1427 LDDLLETDRVTAHF 1440



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 842 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 892


>gi|395860104|ref|XP_003802355.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Otolemur garnettii]
          Length = 1441

 Score =  183 bits (465), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1250 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1309

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1310 WRTPCRGA--TEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1367

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1368 LPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1427

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1428 LDDLLETDRVTAHF 1441



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 843 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 893


>gi|348555856|ref|XP_003463739.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform 2 [Cavia porcellus]
          Length = 1387

 Score =  183 bits (465), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1196 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1255

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1256 WRTPCRGA--TEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1313

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1314 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1373

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1374 LDDLLETDRVTAHF 1387



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 840 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 890


>gi|338728513|ref|XP_003365689.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Equus caballus]
          Length = 1450

 Score =  183 bits (464), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1259 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1318

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1319 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1376

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1377 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1436

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1437 LDDLLETDRVTAHF 1450



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 852 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 902


>gi|27807297|ref|NP_777145.1| cleavage and polyadenylation specificity factor subunit 1 [Bos
            taurus]
 gi|1706101|sp|Q10569.1|CPSF1_BOVIN RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=CPSF 160 kDa subunit
 gi|929007|emb|CAA58152.1| cleavage and polyadenylation specificity factor, 160 kDa subunit [Bos
            taurus]
 gi|296480730|tpg|DAA22845.1| TPA: cleavage and polyadenylation specificity factor subunit 1 [Bos
            taurus]
          Length = 1444

 Score =  183 bits (464), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1253 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1312

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1313 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1370

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1371 LPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1430

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1431 LDDLLETDRVTAHF 1444



 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 34/51 (66%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG    RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 846 LVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 896


>gi|391328522|ref|XP_003738737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Metaseiulus occidentalis]
          Length = 1500

 Score =  183 bits (464), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 84/197 (42%), Positives = 129/197 (65%), Gaps = 11/197 (5%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  +Y+ E+++++S M F+ +D   N++++MYQPE RE+ GG RLI++ DF++G  +NT 
Sbjct: 1307 PVQIYSIEYLVDNSQMAFLATDAQCNILVYMYQPENRETFGGQRLIRRGDFNIGSRINTM 1366

Query: 128  FKIRCKPSPISDAPGARSRFL--------TWYASLDGALGFFLPLPEKNYRRLLMLQNVM 179
            F+IRC+   +++ P +  R L        T YASLDGA G+ LP+ EK YRRLLMLQNV+
Sbjct: 1367 FRIRCR---LAEVPRSERRLLSDLEARHVTLYASLDGAFGYLLPISEKTYRRLLMLQNVL 1423

Query: 180  VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 239
             ++  H GGLNP+AFR  +       NP + I+DG L+  F+ L+  E+ E+ +KIG+  
Sbjct: 1424 NSYCQHVGGLNPKAFRIMQTDVRALSNPQKNIVDGDLINVFMDLNFNEKAEVARKIGTTV 1483

Query: 240  NDILDELYDIEALSSHF 256
            + I  +L +IE L+ HF
Sbjct: 1484 HQIQLDLAEIEGLTYHF 1500



 Score = 44.3 bits (103), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 32/50 (64%), Gaps = 3/50 (6%)

Query: 4   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAF---RHPKGTLKLRFKKLK 50
           V E+   +LG+H +RPLL  R   EL IY+A+      +G LKL+F++L+
Sbjct: 911 VMEIFVCALGMHQSRPLLFARVDSELHIYEAYPFVNQKEGHLKLQFRRLQ 960


>gi|345779232|ref|XP_532356.3| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Canis lupus familiaris]
          Length = 1460

 Score =  183 bits (464), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1269 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1328

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1329 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1386

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1387 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1446

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1447 LDDLLETDRVTAHF 1460



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 862 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 912


>gi|338728511|ref|XP_001505047.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like isoform 1 [Equus caballus]
          Length = 1444

 Score =  183 bits (464), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1253 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1312

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1313 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1370

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1371 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1430

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1431 LDDLLETDRVTAHF 1444



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 846 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 896


>gi|358415280|ref|XP_003583063.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Bos taurus]
          Length = 1490

 Score =  182 bits (463), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1299 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1358

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1359 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1416

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1417 LPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1476

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1477 LDDLLETDRVTAHF 1490



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 34/51 (66%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG    RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 892 LVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 942


>gi|12697776|dbj|BAB21613.1| polyadenylation specificity factor [Homo sapiens]
          Length = 216

 Score =  182 bits (463), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)

Query: 68  PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
           P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 25  PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 84

Query: 128 FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
           ++  C+ +    +  +    ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   
Sbjct: 85  WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 144

Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
           H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD
Sbjct: 145 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 204

Query: 245 ELYDIEALSSHF 256
           +L + + +++HF
Sbjct: 205 DLLETDRVTAHF 216


>gi|431908146|gb|ELK11749.1| Cleavage and polyadenylation specificity factor subunit 1 [Pteropus
           alecto]
          Length = 820

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68  PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
           P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 629 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 688

Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
           ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 689 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 746

Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
             H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 747 LPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 806

Query: 243 LDELYDIEALSSHF 256
           LD+L + + +++HF
Sbjct: 807 LDDLLETDRVTAHF 820



 Score = 48.1 bits (113), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/51 (49%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELL+Y+AF H     +G LK+RFKK+
Sbjct: 223 LVKEVLLVALGSRQSRPYLLVHVDQELLVYEAFPHDSQLGQGNLKVRFKKV 273


>gi|355680843|gb|AER96659.1| cleavage and polyadenylation specific factor 1, 160kDa [Mustela
            putorius furo]
          Length = 1399

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 83/193 (43%), Positives = 132/193 (68%), Gaps = 7/193 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1209 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1268

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1269 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1326

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1327 LPHHAGLNPRAFRLLHADRRALQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1386

Query: 243  LDELYDIEALSSH 255
            LD+L + + +++H
Sbjct: 1387 LDDLLETDRVTAH 1399



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 802 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 852


>gi|229335612|ref|NP_001108153.2| cleavage and polyadenylation specificity factor subunit 1 [Danio
            rerio]
          Length = 1449

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 85/192 (44%), Positives = 132/192 (68%), Gaps = 3/192 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ EF+++++ +GF++SD+DKN++++MY PEA+ES GG RL+++ DF++G HVN F
Sbjct: 1258 PLEVYSIEFMVDNNQLGFLVSDRDKNLMVYMYLPEAKESFGGMRLLRRADFNVGSHVNAF 1317

Query: 128  FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            +++ C+ +  +    A    ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   
Sbjct: 1318 WRMPCRGTLDTANKKALTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLP 1377

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            H  GLNP+AFR          N  + I+DG L+ K+L LS  ER E+ KKIG+  + ILD
Sbjct: 1378 HHAGLNPKAFRMLHCDRRTLQNAVKNILDGELLNKYLYLSTMERSELAKKIGTTPDIILD 1437

Query: 245  ELYDIEALSSHF 256
            +L +IE +++HF
Sbjct: 1438 DLLEIERVTAHF 1449



 Score = 45.1 bits (105), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 34/51 (66%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGTLKLRFKKL 49
           +V+E+  VSLG + +RP LL   + ELLIY+AF    +  +  LK+RFKK+
Sbjct: 851 LVKEVALVSLGYNHSRPYLLAHVEQELLIYEAFPYDQQQAQSNLKVRFKKM 901


>gi|432883539|ref|XP_004074300.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Oryzias latipes]
          Length = 1456

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 83/192 (43%), Positives = 131/192 (68%), Gaps = 3/192 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ EF+++++ +GF++SD+DKN+ ++MY PEA+ES GG RL+++ DF+ G H+N+ 
Sbjct: 1265 PLEVYSIEFIVDNNQLGFLVSDRDKNLFVYMYLPEAKESFGGMRLLRRADFNAGAHINSL 1324

Query: 128  FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            +++ C+ +  S +  A    ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   
Sbjct: 1325 WRMPCRGALDSGSKKALTWDNKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1384

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            H  GLNP+AFR          N  + I+DG L+ K+L LS  ER E+ KKIG+  + ILD
Sbjct: 1385 HHAGLNPKAFRMMHSNRRSLQNAVKNILDGELLAKYLYLSTMERSELAKKIGTTQDIILD 1444

Query: 245  ELYDIEALSSHF 256
            +L +I+ +++HF
Sbjct: 1445 DLLEIDRVTAHF 1456



 Score = 47.4 bits (111), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 37/51 (72%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGTLKLRFKKL 49
           +V+E+  V+LG + +RP LLV  ++ELL+Y+AF    + P+  LK+RFKK+
Sbjct: 857 LVKEVALVALGNNRSRPYLLVHVENELLVYEAFPYDQQQPQNNLKVRFKKV 907


>gi|193702313|ref|XP_001945086.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Acyrthosiphon pisum]
          Length = 1335

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 81/194 (41%), Positives = 137/194 (70%), Gaps = 5/194 (2%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  V+   F+I+++ +GF+ SD+D+N++L++YQP ARES GG  L+++ DF++G +VN+F
Sbjct: 1142 PLEVFDINFLIDNTELGFLASDRDQNLLLYLYQPMARESYGGQHLVRRGDFNIGSNVNSF 1201

Query: 128  FKIRCKPSPIS----DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
            F++RCK S ++    +A G+  R +T Y +LDG++G+ +P+ EKNYRRLL LQN++V + 
Sbjct: 1202 FRLRCKQSTVAPDRREAIGSDKRHVTMYTTLDGSIGYIVPIHEKNYRRLLTLQNMLVKNI 1261

Query: 184  SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQ-LSLGERLEICKKIGSKHNDI 242
            +H  GLNP+A+R++K       N +R +IDG LVW F+  ++  +R EI  K+G K  ++
Sbjct: 1262 THLAGLNPKAYRSFKATAPERMNQARRVIDGELVWMFVTCMNARQRNEIANKVGVKTIEL 1321

Query: 243  LDELYDIEALSSHF 256
            L ++Y+++  + HF
Sbjct: 1322 LQDIYELDRTTWHF 1335



 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/82 (39%), Positives = 44/82 (53%), Gaps = 10/82 (12%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFV-SDRSKRA 61
           I++E+L V LG    RP++ VR  +E++IY   RHP+GTLK+RF K+  L     RS   
Sbjct: 755 IIKEILIVPLGYQDKRPIMFVRLDNEVVIYGIHRHPEGTLKMRFHKMTSLLTFQSRSGNP 814

Query: 62  NEQPGLPR---------GVYAC 74
            E   L R         GV+ C
Sbjct: 815 LEGTSLLRYFSKVAGHNGVFIC 836


>gi|49619065|gb|AAT68117.1| cleavage and polyadenylation specific factor 1 [Danio rerio]
          Length = 1105

 Score =  181 bits (459), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 85/192 (44%), Positives = 132/192 (68%), Gaps = 3/192 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ EF+++++ +GF++SD+DKN++++MY PEA+ES GG RL+++ DF++G HVN F
Sbjct: 914  PLEVYSIEFMVDNNQLGFLVSDRDKNLMVYMYLPEAKESFGGMRLLRRADFNVGSHVNAF 973

Query: 128  FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            +++ C+ +  +    A    ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   
Sbjct: 974  WRMPCRGTLDTANKKALTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLP 1033

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            H  GLNP+AFR          N  + I+DG L+ K+L LS  ER E+ KKIG+  + ILD
Sbjct: 1034 HHAGLNPKAFRMLHCDRRTLQNAVKNILDGELLNKYLYLSTMERSELAKKIGTTPDIILD 1093

Query: 245  ELYDIEALSSHF 256
            +L +IE +++HF
Sbjct: 1094 DLLEIERVTAHF 1105



 Score = 44.3 bits (103), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGTLKLRFKKL 49
           +V+E+  VSLG   +RP LL   + ELLIY+AF    +  +  LK+RFKK+
Sbjct: 507 LVKEVALVSLGYSHSRPYLLAHVEQELLIYEAFPYDQQQAQSNLKVRFKKM 557


>gi|410987992|ref|XP_004000273.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Felis catus]
          Length = 1432

 Score =  181 bits (459), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 83/194 (42%), Positives = 133/194 (68%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1241 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1300

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1301 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1358

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1359 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1418

Query: 243  LDELYDIEALSSHF 256
            L++L + + +++HF
Sbjct: 1419 LEDLLETDRVTAHF 1432



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 834 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 884


>gi|1045574|gb|AAC50293.1| cleavage and polyadenylation specificity factor [Homo sapiens]
          Length = 1442

 Score =  180 bits (457), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 82/191 (42%), Positives = 129/191 (67%), Gaps = 2/191 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1252 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1311

Query: 128  FKIRCKPSPISDAPGA--RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
            ++  C+ +           ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   H
Sbjct: 1312 WRTPCRATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPH 1371

Query: 186  TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
              GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+
Sbjct: 1372 HAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDD 1431

Query: 246  LYDIEALSSHF 256
            L + + +++HF
Sbjct: 1432 LLETDRVTAHF 1442



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 845 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 895


>gi|444523674|gb|ELV13604.1| Cleavage and polyadenylation specificity factor subunit 1 [Tupaia
            chinensis]
          Length = 1469

 Score =  180 bits (457), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 131/194 (67%), Gaps = 7/194 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG  L+++ DFHLG HVNTF
Sbjct: 1278 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGLLLLRRADFHLGAHVNTF 1337

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +   + P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1338 WRTPCRGA--VEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1395

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1396 LPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLSRYLYLSTMERSELAKKIGTTPDII 1455

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1456 LDDLLETDRVTAHF 1469



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 25/51 (49%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELL+Y+AF H     +G LK+RFKK+
Sbjct: 871 LVKEVLLVALGSRQSRPYLLVHVDQELLLYEAFPHDSQLGQGNLKVRFKKV 921


>gi|392306997|ref|NP_001254722.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
            mulatta]
 gi|380812168|gb|AFE77959.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
            mulatta]
 gi|383417835|gb|AFH32131.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
            mulatta]
          Length = 1442

 Score =  180 bits (456), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1251 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1310

Query: 128  FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            ++  C+ +    +  +    ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   
Sbjct: 1311 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1370

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD
Sbjct: 1371 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1430

Query: 245  ELYDIEALSSHF 256
            +L + + +++HF
Sbjct: 1431 DLLETDRVTAHF 1442



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 844 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 894


>gi|397497327|ref|XP_003819464.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Pan paniscus]
 gi|410336497|gb|JAA37195.1| cleavage and polyadenylation specific factor 1, 160kDa [Pan
            troglodytes]
          Length = 1442

 Score =  180 bits (456), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1251 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1310

Query: 128  FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            ++  C+ +    +  +    ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   
Sbjct: 1311 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1370

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD
Sbjct: 1371 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1430

Query: 245  ELYDIEALSSHF 256
            +L + + +++HF
Sbjct: 1431 DLLETDRVTAHF 1442



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 844 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 894


>gi|56676371|ref|NP_037423.2| cleavage and polyadenylation specificity factor subunit 1 [Homo
            sapiens]
 gi|23503048|sp|Q10570.2|CPSF1_HUMAN RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=CPSF 160 kDa subunit
 gi|16878041|gb|AAH17232.1| Cleavage and polyadenylation specific factor 1, 160kDa [Homo sapiens]
 gi|119602516|gb|EAW82110.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_c
            [Homo sapiens]
 gi|123993607|gb|ABM84405.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
            construct]
 gi|123999626|gb|ABM87355.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
            construct]
 gi|307684758|dbj|BAJ20419.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
            construct]
          Length = 1443

 Score =  180 bits (456), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1252 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1311

Query: 128  FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            ++  C+ +    +  +    ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   
Sbjct: 1312 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1371

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD
Sbjct: 1372 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1431

Query: 245  ELYDIEALSSHF 256
            +L + + +++HF
Sbjct: 1432 DLLETDRVTAHF 1443



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 845 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 895


>gi|119602512|gb|EAW82106.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
            [Homo sapiens]
 gi|119602513|gb|EAW82107.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
            [Homo sapiens]
 gi|119602514|gb|EAW82108.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
            [Homo sapiens]
          Length = 1365

 Score =  180 bits (456), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1174 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1233

Query: 128  FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            ++  C+ +    +  +    ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   
Sbjct: 1234 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1293

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD
Sbjct: 1294 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1353

Query: 245  ELYDIEALSSHF 256
            +L + + +++HF
Sbjct: 1354 DLLETDRVTAHF 1365



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 767 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 817


>gi|441648592|ref|XP_004093268.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
            specificity factor subunit 1 [Nomascus leucogenys]
          Length = 1177

 Score =  180 bits (456), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 986  PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1045

Query: 128  FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            ++  C+ +    +  +    ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   
Sbjct: 1046 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1105

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD
Sbjct: 1106 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1165

Query: 245  ELYDIEALSSHF 256
            +L + + +++HF
Sbjct: 1166 DLLETDRVTAHF 1177



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 763 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 813


>gi|296227035|ref|XP_002807684.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
            specificity factor subunit 1 [Callithrix jacchus]
          Length = 1394

 Score =  180 bits (456), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1203 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1262

Query: 128  FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            ++  C+ +    +  +    ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   
Sbjct: 1263 WRTPCRGATEGLSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1322

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD
Sbjct: 1323 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1382

Query: 245  ELYDIEALSSHF 256
            +L + + +++HF
Sbjct: 1383 DLLETDRVTAHF 1394



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 796 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLSQGNLKVRFKKV 846


>gi|410042329|ref|XP_003954555.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
            specificity factor subunit 1 [Pan troglodytes]
          Length = 1296

 Score =  180 bits (456), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1105 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1164

Query: 128  FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            ++  C+ +    +  +    ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   
Sbjct: 1165 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1224

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD
Sbjct: 1225 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1284

Query: 245  ELYDIEALSSHF 256
            +L + + +++HF
Sbjct: 1285 DLLETDRVTAHF 1296


>gi|355698297|gb|EHH28845.1| Cleavage and polyadenylation specificity factor 160 kDa subunit
            [Macaca mulatta]
          Length = 1436

 Score =  180 bits (456), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1245 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1304

Query: 128  FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            ++  C+ +    +  +    ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   
Sbjct: 1305 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1364

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD
Sbjct: 1365 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1424

Query: 245  ELYDIEALSSHF 256
            +L + + +++HF
Sbjct: 1425 DLLETDRVTAHF 1436


>gi|426361048|ref|XP_004047737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Gorilla gorilla gorilla]
          Length = 1440

 Score =  180 bits (456), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1249 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1308

Query: 128  FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            ++  C+ +    +  +    ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   
Sbjct: 1309 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1368

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD
Sbjct: 1369 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1428

Query: 245  ELYDIEALSSHF 256
            +L + + +++HF
Sbjct: 1429 DLLETDRVTAHF 1440



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 842 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 892


>gi|402879380|ref|XP_003903320.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
            specificity factor subunit 1 [Papio anubis]
          Length = 1389

 Score =  179 bits (455), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1198 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1257

Query: 128  FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            ++  C+ +    +  +    ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   
Sbjct: 1258 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1317

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD
Sbjct: 1318 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1377

Query: 245  ELYDIEALSSHF 256
            +L + + +++HF
Sbjct: 1378 DLLETDRVTAHF 1389



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 791 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 841


>gi|403302917|ref|XP_003942095.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Saimiri boliviensis boliviensis]
          Length = 1390

 Score =  179 bits (454), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1199 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1258

Query: 128  FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            ++  C+ +    +  +    ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   
Sbjct: 1259 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1318

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD
Sbjct: 1319 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1378

Query: 245  ELYDIEALSSHF 256
            +L + + +++HF
Sbjct: 1379 DLLETDRVTAHF 1390



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 792 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLSQGNLKVRFKKV 842


>gi|390347522|ref|XP_003726804.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Strongylocentrotus purpuratus]
          Length = 1439

 Score =  174 bits (441), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 128/206 (62%), Gaps = 6/206 (2%)

Query: 56   DRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 115
            DR+     +   P  ++ CEF+++   M F++SD DKN+++F Y PEA ES+GG  L+++
Sbjct: 1235 DRTLSLVSRDARPLEIFGCEFMVDDKQMAFLVSDADKNLIVFHYHPEAPESHGGAYLLRR 1294

Query: 116  TDFHLGQHVNTFFKIRCK---PSP--ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYR 170
             D ++G  VNTF ++RC+   PS   +   P  R R + ++A+LDG+LG  LP+ EK YR
Sbjct: 1295 GDMNIGSAVNTFVRVRCRLTDPSTEQVLSGPVLR-RQVVFFATLDGSLGLLLPMVEKTYR 1353

Query: 171  RLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 230
            RLLMLQNV+     H GGLNP+++R  K       NP R I+DG L+ K+  LS+ ER E
Sbjct: 1354 RLLMLQNVLTNGLPHVGGLNPKSYRHVKSHMRNLNNPHRNILDGDLLLKYCHLSVVERNE 1413

Query: 231  ICKKIGSKHNDILDELYDIEALSSHF 256
              KKIG+  + I+ +L   E L+ HF
Sbjct: 1414 FAKKIGTSVDQIISDLMLAENLTMHF 1439


>gi|449661926|ref|XP_002167992.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Hydra magnipapillata]
          Length = 1122

 Score =  174 bits (440), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 85/185 (45%), Positives = 115/185 (62%)

Query: 72   YACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 131
            Y  EF I+ S +GF++SD ++N+V+F YQPEA ES GGHRL++K D ++G HVNT  +I+
Sbjct: 938  YTSEFFIDGSQVGFVVSDAERNIVIFSYQPEALESFGGHRLLQKADINIGSHVNTMMRIK 997

Query: 132  CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
                  S +  +  R L    +LDG++G   PL EK +RRL MLQN +V    H  GLNP
Sbjct: 998  LIQDEQSLSKSSEQRQLIILPTLDGSIGILFPLSEKPFRRLTMLQNKLVDCLPHKAGLNP 1057

Query: 192  RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 251
            RAFR          NP R I+DG L+ K+ QLS  ER +I KK+G+    ILD++ DIE 
Sbjct: 1058 RAFRALDVPLRTLTNPHRNILDGQLLDKYAQLSFQERFDIAKKMGTTSGQILDDMMDIER 1117

Query: 252  LSSHF 256
             S+H 
Sbjct: 1118 ASNHL 1122


>gi|390358535|ref|XP_789715.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Strongylocentrotus purpuratus]
          Length = 1223

 Score =  174 bits (440), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 128/206 (62%), Gaps = 6/206 (2%)

Query: 56   DRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 115
            DR+     +   P  ++ CEF+++   M F++SD DKN+++F Y PEA ES+GG  L+++
Sbjct: 1019 DRTLSLVSRDARPLEIFGCEFMVDDKQMAFLVSDADKNLIVFHYHPEAPESHGGAYLLRR 1078

Query: 116  TDFHLGQHVNTFFKIRCK---PSP--ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYR 170
             D ++G  VNTF ++RC+   PS   +   P  R R + ++A+LDG+LG  LP+ EK YR
Sbjct: 1079 GDMNIGSAVNTFVRVRCRLTDPSTEQVLSGPVLR-RQVVFFATLDGSLGLLLPMVEKTYR 1137

Query: 171  RLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 230
            RLLMLQNV+     H GGLNP+++R  K       NP R I+DG L+ K+  LS+ ER E
Sbjct: 1138 RLLMLQNVLTNGLPHVGGLNPKSYRHVKSHMRNLNNPHRNILDGDLLLKYCHLSVVERNE 1197

Query: 231  ICKKIGSKHNDILDELYDIEALSSHF 256
              KKIG+  + I+ +L   E L+ HF
Sbjct: 1198 FAKKIGTSVDQIISDLMLAENLTMHF 1223


>gi|395740218|ref|XP_002819588.2| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Pongo abelii]
          Length = 1388

 Score =  173 bits (438), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 80/192 (41%), Positives = 128/192 (66%), Gaps = 3/192 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1197 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1256

Query: 128  FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            ++  C+ +    +  +    ++ +TW  S+ G +G  LP+ EK YRRLLMLQN + T   
Sbjct: 1257 WRTPCRGAAEGLSKKSVVWENKHITWLVSVRGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1316

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD
Sbjct: 1317 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1376

Query: 245  ELYDIEALSSHF 256
            +L + + +++HF
Sbjct: 1377 DLLETDRVTAHF 1388



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 792 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 842


>gi|357611296|gb|EHJ67409.1| putative cleavage and polyadenylation specific factor 1 [Danaus
           plexippus]
          Length = 328

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 79/187 (42%), Positives = 126/187 (67%), Gaps = 9/187 (4%)

Query: 71  VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
           +Y  +F+I+++S+GF++S+ + N  ++M+QP+ARES GG RLI+K D+HLGQ V+  F++
Sbjct: 150 IYDMQFMIDNTSLGFLVSESEGNFAMYMHQPQARESYGGQRLIRKCDYHLGQRVHAMFRL 209

Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
                    A G R   +T + +LDG +G+ LP+ EK YRRLLMLQNV+  +  H  GLN
Sbjct: 210 A--------ARGERQTHVTMFTTLDGGVGYVLPVSEKVYRRLLMLQNVINNYCCHLAGLN 261

Query: 191 PRAFRTYK-GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
           P+A+RTYK  +    G  +RG++DG LV  +  +   E+ +I +KIG+K  +I+ +LY+I
Sbjct: 262 PKAYRTYKVSRRALCGGAARGVLDGDLVSLYTSMPRTEQQDIARKIGTKVEEIMSDLYEI 321

Query: 250 EALSSHF 256
           +  ++HF
Sbjct: 322 DRQTAHF 328


>gi|33411764|emb|CAD58787.1| cleavage and polyadenylation specificity factor 1 [Bos taurus]
          Length = 180

 Score =  171 bits (433), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 78/175 (44%), Positives = 120/175 (68%), Gaps = 7/175 (4%)

Query: 68  PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
           P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 4   PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 63

Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
           ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 64  WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 121

Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
             H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+
Sbjct: 122 LPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGT 176


>gi|354491126|ref|XP_003507707.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform 3 [Cricetulus griseus]
          Length = 1449

 Score =  171 bits (432), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 80/193 (41%), Positives = 125/193 (64%), Gaps = 7/193 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1250 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1309

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1310 WRTPCRGA--AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1367

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  +  
Sbjct: 1368 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIT 1427

Query: 243  LDELYDIEALSSH 255
            +  L     +  H
Sbjct: 1428 VSLLTSNSVIPCH 1440



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 843 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 893


>gi|410911304|ref|XP_003969130.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Takifugu rubripes]
          Length = 1444

 Score =  170 bits (431), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 86/192 (44%), Positives = 135/192 (70%), Gaps = 3/192 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ EF+++++ +GF++SD+DKN+ ++MY PEA+ES GG RL+++ DF+ G ++NTF
Sbjct: 1253 PLEVYSIEFMVDNNQLGFLVSDRDKNLYVYMYLPEAKESFGGMRLLRRADFNAGANINTF 1312

Query: 128  FKIRCKPSPISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            +++ C+ +  + +  A +   + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T  S
Sbjct: 1313 WRMPCRGALEAGSRKAMTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLS 1372

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            H  GLNP+AFR          NP + I+DG L+ K+L LS+ ER E+ KKIG+  + ILD
Sbjct: 1373 HHAGLNPKAFRMLHCDRRSLQNPVKNILDGELLNKYLYLSMMERSELAKKIGTTQDIILD 1432

Query: 245  ELYDIEALSSHF 256
            +L DI+ +++HF
Sbjct: 1433 DLLDIDRVTAHF 1444



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/51 (49%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGTLKLRFKKL 49
           +V+E+  VSLG + +RP LLV    ELLIY+AF    + P+  LK+RFKK+
Sbjct: 845 LVKEVTLVSLGYNHSRPYLLVHVDQELLIYEAFPYDQQQPQNNLKVRFKKV 895


>gi|255918233|ref|NP_001157645.1| cleavage and polyadenylation specificity factor subunit 1 isoform 1
            [Mus musculus]
          Length = 1450

 Score =  170 bits (430), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 78/175 (44%), Positives = 120/175 (68%), Gaps = 7/175 (4%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1250 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1309

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1310 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1367

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+
Sbjct: 1368 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGT 1422



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 843 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 893


>gi|443684051|gb|ELT88095.1| hypothetical protein CAPTEDRAFT_161045 [Capitella teleta]
          Length = 1410

 Score =  169 bits (427), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 81/217 (37%), Positives = 136/217 (62%), Gaps = 11/217 (5%)

Query: 44   LRFKK-LKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
            LR+++ +KVL +  +  R       P  VY   ++++ +S+ F+++DK KN +++ YQP+
Sbjct: 1201 LRYQEDMKVLSLVSKDVR-------PLAVYGVAYLVDETSLAFIVADKLKNFLVYCYQPD 1253

Query: 103  ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTWYASLDGALG 159
              +S GG RLI+K D ++G  VN FF+++C+ S  S +   +S   + +T+Y +LDG++G
Sbjct: 1254 LVQSQGGQRLIRKADINIGSLVNAFFRVKCRVSDPSTSKTDQSLAMKHITYYVTLDGSIG 1313

Query: 160  FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 219
            + LP+ E  YRRL MLQ +++     T GLNP+A+RT + +     N  R IIDG L WK
Sbjct: 1314 YLLPISESLYRRLYMLQKMLIQQVQQTAGLNPKAYRTCQTEFRQLINIQRNIIDGDLAWK 1373

Query: 220  FLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
            +L L+  +R E+ K+IG+  + I D+L +I+  + HF
Sbjct: 1374 YLALTSHDRAEMAKRIGTTSHQIEDDLLEIDRCTCHF 1410



 Score = 42.0 bits (97), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 31/49 (63%), Gaps = 4/49 (8%)

Query: 4   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGTLKLRFKK 48
           V E++    G++G++PLL+ R   EL IY+ F H     KG L++RFK+
Sbjct: 826 VHEVMLHGFGVNGSQPLLMARVHDELYIYKVFSHVGSKAKGRLQVRFKR 874


>gi|440904368|gb|ELR54893.1| Cleavage and polyadenylation specificity factor subunit 1, partial
            [Bos grunniens mutus]
          Length = 1417

 Score =  169 bits (427), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 78/175 (44%), Positives = 120/175 (68%), Gaps = 7/175 (4%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1243 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1302

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T 
Sbjct: 1303 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1360

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+
Sbjct: 1361 LPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSPMERGELAKKIGT 1415



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 34/51 (66%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG    RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 836 LVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 886


>gi|156364999|ref|XP_001626630.1| predicted protein [Nematostella vectensis]
 gi|156213514|gb|EDO34530.1| predicted protein [Nematostella vectensis]
          Length = 1420

 Score =  168 bits (426), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 83/189 (43%), Positives = 120/189 (63%), Gaps = 5/189 (2%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            VYA +F I+   +GF++SD +KN+VLF YQPEA ES GG RL+++ D ++G H+ +FF+I
Sbjct: 1234 VYAADFFIDGPQIGFLVSDVEKNLVLFTYQPEAIESQGGQRLLQRADINVGTHITSFFRI 1293

Query: 131  RCKPSPISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
              K      A G +S   R LT + +LDGALG  LP+ EK +RRL MLQ  +V    H  
Sbjct: 1294 AAKAH--LKASGEKSKEMRQLTCFGTLDGALGLMLPMTEKTFRRLHMLQTKLVDCIPHVA 1351

Query: 188  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
            GLNP+AFR  + +     NP R ++D  L++K++ LS  ER E+ +KIG+    I+D++ 
Sbjct: 1352 GLNPKAFRMLQWRKRKLCNPHRNVLDWQLLFKYMHLSFMERQEVARKIGTTPAQIMDDMM 1411

Query: 248  DIEALSSHF 256
            DIE   + F
Sbjct: 1412 DIERACAQF 1420



 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 37/64 (57%), Gaps = 3/64 (4%)

Query: 4   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP--KGTLKLRFKKLKVLFVSDRSKRA 61
           V+E+L   LG    R  L+     +LLIY+AF +P  +G L LRFKKL+   +  R K+ 
Sbjct: 831 VREVLLTGLGYKNRRATLVAVMDQDLLIYEAFSYPTVEGHLNLRFKKLQ-HNIQIREKKP 889

Query: 62  NEQP 65
            ++P
Sbjct: 890 KQEP 893


>gi|195583398|ref|XP_002081509.1| GD25678 [Drosophila simulans]
 gi|194193518|gb|EDX07094.1| GD25678 [Drosophila simulans]
          Length = 1450

 Score =  166 bits (420), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 76/159 (47%), Positives = 109/159 (68%), Gaps = 10/159 (6%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY  EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT 
Sbjct: 1265 PLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1324

Query: 128  FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
            F+++C    +      R  FL        Y +LDGALG+ LPLPEK YRR LMLQNV+++
Sbjct: 1325 FRVQCHQKGLHQ----RQPFLYENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLS 1380

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKF 220
            +  H  GLNP+ +RT K       NPSR IIDG L+W +
Sbjct: 1381 YQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSY 1419



 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/60 (56%), Positives = 42/60 (70%), Gaps = 6/60 (10%)

Query: 6   ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDRSKRANEQP 65
           EL  + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+KL      D     ++QP
Sbjct: 864 ELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRKL------DXXNLLDQQP 917


>gi|340371789|ref|XP_003384427.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Amphimedon queenslandica]
          Length = 1408

 Score =  165 bits (417), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 80/195 (41%), Positives = 120/195 (61%), Gaps = 6/195 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  V++  F+I+  ++GF++SD D+N+ LF YQPE   S+GG  L++  D H+G  VN F
Sbjct: 1214 PMEVFSTTFMIDGKALGFLVSDSDRNITLFQYQPENPASSGGANLVRCGDIHVGSLVNVF 1273

Query: 128  FKIRCKPSPISDAPG------ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
              IRCK S    A        A  R  T++ +LDG +G  LP+PEK YRRL MLQ  M  
Sbjct: 1274 LNIRCKTSAGLGASREMKIALADKRQCTFFGTLDGGIGCLLPIPEKVYRRLSMLQVKMTQ 1333

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
               H  GLNP+AFRT++ +  Y  N  R I+DG+L++++L L+  E+ +  K+IG+    
Sbjct: 1334 GMRHMAGLNPKAFRTFQTRHQYLHNAQRNILDGTLLYQYLSLTAKEKFDFSKQIGTTVAQ 1393

Query: 242  ILDELYDIEALSSHF 256
            I+++L +I+ + SHF
Sbjct: 1394 IMEDLKEIDKVMSHF 1408



 Score = 42.4 bits (98), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 35/52 (67%), Gaps = 6/52 (11%)

Query: 4   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFR-----HPKGTLKLRFKKLK 50
           V+++L V +GL+G +P ++     EL+IY+AF+     HP G LKLRF K++
Sbjct: 813 VEQVLCVGMGLNGKKPHIMAFINKELVIYEAFQYTSAIHP-GHLKLRFSKVQ 863


>gi|47217773|emb|CAG05995.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1446

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 88/217 (40%), Positives = 136/217 (62%), Gaps = 28/217 (12%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ EF++++S +GF++SD+DKN+ ++MY PEA+ES GG RL+++ DF+ G ++NTF
Sbjct: 1230 PLEVYSIEFMVDNSQLGFLVSDRDKNLYVYMYLPEAKESFGGMRLLRRADFNAGANINTF 1289

Query: 128  FKIRCKPSPISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            +++ C+ +  + +  A +   + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T  S
Sbjct: 1290 WRMPCRGALEAGSRKAMTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLS 1349

Query: 185  HTGGLNPRAFRTYKGKGYYAG-------------------------NPSRGIIDGSLVWK 219
            H  GLNP+AFR        A                          NP + I+DG L+ K
Sbjct: 1350 HHAGLNPKAFRCVGADRTSAAMLSGMLPDFATSVSRMLHCDRRSLQNPVKNILDGELLNK 1409

Query: 220  FLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
            +L LS+ ER E+ KKIG+  + ILD+L DI+ +++HF
Sbjct: 1410 YLYLSMMERSELAKKIGTTQDIILDDLLDIDRVTAHF 1446



 Score = 49.3 bits (116), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 36/51 (70%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGTLKLRFKKL 49
           +V+E+  VSLG + +RP LLV  + ELL+Y+AF    + P+  LK+RFKK+
Sbjct: 815 LVKEVTLVSLGYNHSRPYLLVHVEQELLVYEAFPYDQQQPQNNLKVRFKKV 865


>gi|291232722|ref|XP_002736302.1| PREDICTED: cleavage and polyadenylation specific factor 1-like
           [Saccoglossus kowalevskii]
          Length = 984

 Score =  154 bits (388), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 79/174 (45%), Positives = 114/174 (65%), Gaps = 11/174 (6%)

Query: 88  SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS-PISD----APG 142
           SD+D+++ L       RES GG RL+++ DF+ G HV +FF++R K S P ++     P 
Sbjct: 817 SDEDRSLSLV-----TRESFGGQRLLRRADFNAGSHVCSFFRMRSKLSDPATEKLLTGPM 871

Query: 143 ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 202
            R R +T +A+LDG++G+ +P+ EK YRRLLMLQN + T T HT GLNP+ FR  K +  
Sbjct: 872 ER-RHVTMFATLDGSIGYLIPMTEKTYRRLLMLQNALTTQTLHTAGLNPKGFRMVKHQTK 930

Query: 203 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
              N  + I+DG L+WK+  LS+ ER E+ KKIG+    ILD+L D+E L++HF
Sbjct: 931 SLENTHKNILDGDLLWKYTFLSVNERTELAKKIGTSVEQILDDLMDVERLTAHF 984



 Score = 40.0 bits (92), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 32/53 (60%), Gaps = 4/53 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGT----LKLRFKKLKV 51
           IV+ELL + LG    +  LL R   +L IY+AF H + +    L+LRF+K+ V
Sbjct: 473 IVKELLLIGLGHKNKKTHLLARVDEDLYIYEAFTHDQSSLDNHLRLRFRKVFV 525


>gi|301773406|ref|XP_002922132.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
            specificity factor subunit 1-like [Ailuropoda
            melanoleuca]
          Length = 1469

 Score =  137 bits (345), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 70/194 (36%), Positives = 121/194 (62%), Gaps = 24/194 (12%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1295 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1354

Query: 128  FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            ++  C+ +  ++ P  +S     + +TW+A+LDG +G  LP+ EK  R    LQ      
Sbjct: 1355 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTNR----LQPAXSPR 1408

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              H   ++ R  +          N  R ++DG L+ ++L LS  ER E+ KKIG+  + I
Sbjct: 1409 MLH---VDRRILQ----------NAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1455

Query: 243  LDELYDIEALSSHF 256
            LD+L + + +++HF
Sbjct: 1456 LDDLLETDRVTAHF 1469



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 881 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 931


>gi|168021793|ref|XP_001763425.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685218|gb|EDQ71614.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1452

 Score =  134 bits (338), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 73/187 (39%), Positives = 111/187 (59%), Gaps = 6/187 (3%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
             YA EF+I+ S++  ++SD  KN+ +F Y P++ ES  G +L+ + +FHLG HVN F ++
Sbjct: 1270 CYATEFLIDGSTLSLLVSDSRKNLQIFSYAPKSMESWKGQKLLSRAEFHLGAHVNKFHRL 1329

Query: 131  RCKPSPISDAPGARS-RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 189
            +  P+P S    ARS R+   + +LDGA+ +  PL E  +RRL  LQ  +V   SH  G+
Sbjct: 1330 QMLPTPGS----ARSNRYAVLFGTLDGAIDYLAPLDELTFRRLHTLQRKLVDCVSHVAGV 1385

Query: 190  NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
            NPRAFR ++  G         I+D  L+  +  L L E+LEI ++IG+    +L  L D+
Sbjct: 1386 NPRAFRQFRCDGKAHRPGPDNIVDCELLSHYDMLPLDEQLEIARQIGTTRAHVLSNLRDL 1445

Query: 250  EALSSHF 256
             ALS+ F
Sbjct: 1446 -ALSTSF 1451


>gi|324499955|gb|ADY39993.1| Cleavage and polyadenylation specificity factor subunit 1 [Ascaris
            suum]
          Length = 1434

 Score =  133 bits (335), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 75/219 (34%), Positives = 132/219 (60%), Gaps = 14/219 (6%)

Query: 44   LRFKK-LKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
            LR+++  K L ++ R  RA  QP +     A +F+I++  M F++SD+  N+ +F Y PE
Sbjct: 1224 LRYQEEYKALSLASRDMRAVVQPPM-----AAQFLIDNRQMAFIMSDEAANIAVFNYLPE 1278

Query: 103  ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP--ISDAPGARSRFLTWYASLDGALGF 160
            A ES+GG RLI +++ ++G +VN+F +++   S   + +   + +R    + SLDG+ GF
Sbjct: 1279 ALESSGGERLILRSEINIGTNVNSFMRVKGHISSGFVENEHYSLNRQSVLFCSLDGSFGF 1338

Query: 161  FLPLPEKNYRRLLMLQNVMVTHTSHTGGLN---PRAFRTYKGKGYYAGNPSRGIIDGSLV 217
              PL EK +RRL MLQ +M +  +   GLN    RA R  +   Y     +R ++DG +V
Sbjct: 1339 VRPLSEKVFRRLHMLQQLMSSLVAQAAGLNVKGSRAARPQRPNHYLN---TRNMVDGDVV 1395

Query: 218  WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
            +++L LSL ++ ++ +K+G+    I+D+L +I  L++H+
Sbjct: 1396 FQYLHLSLADKNDLARKLGTSRYHIIDDLTEISRLTTHY 1434


>gi|328773280|gb|EGF83317.1| hypothetical protein BATDEDRAFT_21894 [Batrachochytrium dendrobatidis
            JAM81]
          Length = 1673

 Score =  130 bits (326), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 67/185 (36%), Positives = 111/185 (60%), Gaps = 7/185 (3%)

Query: 72   YACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 131
            YA   +I+ + +  +++D DKN+   +Y P+  +S GG RLI+K + HLGQHV+ F ++R
Sbjct: 1482 YAANMLIDDNQLALLVADGDKNLHTMIYAPDNVQSLGGERLIRKGEIHLGQHVSKFIRMR 1541

Query: 132  CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
             KP   +DA     ++L   A+LDGAL    P+ E+ ++RL  L + MVT   H  GLNP
Sbjct: 1542 RKPLLRNDAIVFSKQYLNVAATLDGALEIITPVSERIFKRLYGLYSRMVTSIEHIAGLNP 1601

Query: 192  RAFRTYKGK-------GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            R FR  + +       G+      RGI+DG L++++++LS  ++  + K IGSK + ++D
Sbjct: 1602 RGFRQAQHRVRPITLSGFIGPPGPRGILDGDLLYEYVRLSRTQQRGLAKAIGSKDDRLMD 1661

Query: 245  ELYDI 249
            +L ++
Sbjct: 1662 DLLEV 1666


>gi|422295485|gb|EKU22784.1| cleavage and polyadenylation specificity factor subunit 1
           [Nannochloropsis gaditana CCMP526]
          Length = 395

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 67/202 (33%), Positives = 107/202 (52%), Gaps = 8/202 (3%)

Query: 52  LFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR 111
           L   D S   + QPG         FV++  S+G ++ D   N+ L  Y P   ES GG+R
Sbjct: 199 LMGKDYSPIVSAQPG------GSHFVVDPPSLGMLVGDSRGNLQLLQYDPADVESRGGNR 252

Query: 112 LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRR 171
           L+++ DFHL   ++     R    P   A  A  R +  + S++G +G  +P+ EK YRR
Sbjct: 253 LVRRADFHLSHRLSFLQHTRMAEVPRPGAYRAGVRVMV-FGSVEGGVGALVPVEEKVYRR 311

Query: 172 LLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 231
           L  LQ VMV    H G  NPR FR  + +G+  G   +G +DG L+W+F  LS+G++ ++
Sbjct: 312 LYALQAVMVNALPHVGAFNPRGFRLVEARGWAQGR-KKGTLDGELLWRFAGLSVGKQEDL 370

Query: 232 CKKIGSKHNDILDELYDIEALS 253
              IG+    +L+ L +++ ++
Sbjct: 371 ASAIGTSREMVLESLLEVDMMT 392


>gi|313215162|emb|CBY42850.1| unnamed protein product [Oikopleura dioica]
          Length = 228

 Score =  127 bits (319), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 68/211 (32%), Positives = 120/211 (56%), Gaps = 26/211 (12%)

Query: 57  RSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKT 116
           R++RA E       V A   +++ + +G + +D  +N+ ++MY+P+ +ESNGG +L+++ 
Sbjct: 33  RARRAGE-------VTAVGILLDGNQVGLVSTDMQRNLQVYMYKPDQKESNGGKQLVRQA 85

Query: 117 DFHLGQHV-----------NTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLP 165
           D +LG+ V           +TF K+    +         +R +T+YA LDG++G  +P+ 
Sbjct: 86  DINLGKRVISIWNSLGRQNDTFTKVALTEND--------ARHVTFYAGLDGSIGDIVPVS 137

Query: 166 EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 225
           EK +RRL MLQ ++ +H  H GGLNPR +R    +     N ++ IIDG L+ +F  LS 
Sbjct: 138 EKVFRRLEMLQTLVQSHLPHYGGLNPREYRYCTNEYRDLENAAKNIIDGDLLERFNGLSF 197

Query: 226 GERLEICKKIGSKHNDILDELYDIEALSSHF 256
            E+ ++ +KIG     +LD++ D++   + F
Sbjct: 198 TEQTDLSRKIGVTREALLDDMMDVQRTKNLF 228


>gi|393907594|gb|EJD74706.1| hypothetical protein LOAG_18016 [Loa loa]
          Length = 398

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 70/216 (32%), Positives = 125/216 (57%), Gaps = 8/216 (3%)

Query: 44  LRFKK-LKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
           LR+++  K L ++ R  R++ QP +     A +F+I++  MGF++SD+  N+ +F Y PE
Sbjct: 188 LRYQEEYKALSLASRDMRSDVQPPM-----AAQFIIDNKQMGFVMSDEAANIAIFNYLPE 242

Query: 103 ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP--ISDAPGARSRFLTWYASLDGALGF 160
             ES GG +L  + + ++G  VN+F +++   S   + +   +  R    +ASLDG+ GF
Sbjct: 243 TLESLGGEKLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSLERQSVLFASLDGSFGF 302

Query: 161 FLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKF 220
             PL EK +RRL MLQ +M +      GLN +  R  +         +R ++DG +V ++
Sbjct: 303 LRPLTEKVFRRLHMLQQLMSSMVPQPAGLNAKGARAARPPRPNHYLNTRNLVDGDMVMQY 362

Query: 221 LQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
           L LSL E+ ++ +K+G+    I+D+L +I  +++H+
Sbjct: 363 LHLSLPEKNDLARKLGTSRYHIIDDLIEICRVTAHY 398


>gi|312069702|ref|XP_003137805.1| hypothetical protein LOAG_02219 [Loa loa]
          Length = 1065

 Score =  127 bits (318), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 70/216 (32%), Positives = 125/216 (57%), Gaps = 8/216 (3%)

Query: 44   LRFKK-LKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
            LR+++  K L ++ R  R++ QP +     A +F+I++  MGF++SD+  N+ +F Y PE
Sbjct: 855  LRYQEEYKALSLASRDMRSDVQPPM-----AAQFIIDNKQMGFVMSDEAANIAIFNYLPE 909

Query: 103  ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP--ISDAPGARSRFLTWYASLDGALGF 160
              ES GG +L  + + ++G  VN+F +++   S   + +   +  R    +ASLDG+ GF
Sbjct: 910  TLESLGGEKLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSLERQSVLFASLDGSFGF 969

Query: 161  FLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKF 220
              PL EK +RRL MLQ +M +      GLN +  R  +         +R ++DG +V ++
Sbjct: 970  LRPLTEKVFRRLHMLQQLMSSMVPQPAGLNAKGARAARPPRPNHYLNTRNLVDGDMVMQY 1029

Query: 221  LQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
            L LSL E+ ++ +K+G+    I+D+L +I  +++H+
Sbjct: 1030 LHLSLPEKNDLARKLGTSRYHIIDDLIEICRVTAHY 1065


>gi|242075246|ref|XP_002447559.1| hypothetical protein SORBIDRAFT_06g003570 [Sorghum bicolor]
 gi|241938742|gb|EES11887.1| hypothetical protein SORBIDRAFT_06g003570 [Sorghum bicolor]
          Length = 389

 Score =  126 bits (316), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 66/177 (37%), Positives = 101/177 (57%), Gaps = 1/177 (0%)

Query: 72  YACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 131
           +A EF+I+ S++  ++SD DKNV +F Y P+  ES  G +L+ + +FH+G HV+ F +++
Sbjct: 206 FATEFLIDGSTLSLVVSDSDKNVQIFYYAPKMVESWKGQKLLSRAEFHVGAHVSKFLRLQ 265

Query: 132 CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
             P+    A    +RF   + +LDG +G   P+ E  +RRL  LQ  +V    H  GLNP
Sbjct: 266 MLPTQ-GLASEKTNRFALVFGTLDGGIGCIAPVDELTFRRLQSLQRKLVDAVPHVCGLNP 324

Query: 192 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYD 248
           R+FR +K  G         IID  L+  +  LSL E+LEI ++IG+  + IL    D
Sbjct: 325 RSFRHFKSNGKAHRPGPDNIIDFELLSHYEMLSLEEQLEIAQQIGTTRSQILSNFSD 381


>gi|428186188|gb|EKX55039.1| hypothetical protein GUITHDRAFT_160593 [Guillardia theta CCMP2712]
          Length = 2290

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 67/186 (36%), Positives = 104/186 (55%), Gaps = 6/186 (3%)

Query: 66   GLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVN 125
             LP  VYA EFV+  S+ G + SD+ KNV +F++ P + E     +LI + D H+G H+N
Sbjct: 1402 ALPLSVYAAEFVVMGSNFGLLASDEQKNVQVFVFNPNSPEYRR-QQLICRADLHVGSHIN 1460

Query: 126  TFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
             F +    P P     G R+     Y +LDG +G  +P+PE++YRRLL LQN++VT   H
Sbjct: 1461 KFIRW---PLPFRPTLGVRT--AAHYTTLDGGIGAIIPIPEQSYRRLLALQNLLVTAMPH 1515

Query: 186  TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
              GLNPR++R YK         ++  +DG+L+ ++L L L  ++++   +      IL +
Sbjct: 1516 YAGLNPRSWRLYKPAMCMKRRYAKNFLDGNLLGRYLHLDLALQMQLSSALNQTREAILGD 1575

Query: 246  LYDIEA 251
            L    A
Sbjct: 1576 LPSTSA 1581


>gi|308459872|ref|XP_003092248.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
 gi|308253976|gb|EFO97928.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
          Length = 1448

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 68/220 (30%), Positives = 125/220 (56%), Gaps = 12/220 (5%)

Query: 44   LRFKKL-KVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
            +RF++  K + ++ R  R   +P +     A +FV++ + +GF++SD++ N+ LF Y PE
Sbjct: 1234 IRFQEENKAMSIASRDDRRTAKPPM-----AAQFVVDGAHLGFLLSDENGNITLFNYSPE 1288

Query: 103  ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPIS-----DAPGARSRFLTWYASLDGA 157
            A ESNGG RL  +   ++G +VN F +++   S ++     +      R  T + SLDG+
Sbjct: 1289 APESNGGERLTVRAAMNIGTNVNAFLRVKGHTSLLNLQSDEEKESVEQRMSTIFGSLDGS 1348

Query: 158  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK-GKGYYAGNPSRGIIDGSL 216
             GF  PL EK+YRRL  LQ  + + T    GL+ +  R+ +  +    G  +R +IDG +
Sbjct: 1349 FGFVRPLSEKSYRRLHFLQTFIGSVTPQIAGLHIKGARSARPAQPIVNGRNARNLIDGDV 1408

Query: 217  VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
            V ++L LSL ++ ++ +++G     I+D+L  +  ++ ++
Sbjct: 1409 VEQYLHLSLYDKTDLARRLGVGRYHIIDDLMHLRRMAYYY 1448


>gi|356530945|ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Glycine max]
          Length = 1449

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 64/183 (34%), Positives = 102/183 (55%), Gaps = 4/183 (2%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
             +A EF+I+ S++  M+SD ++N+ +F Y P+  ES  G +L+ + +FH+G HV  F ++
Sbjct: 1260 CFATEFLIDGSTLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1319

Query: 131  RCKPSP--ISDAPGA--RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
            +   +       PG+   +RF   + +LDG++G   PL E  +RRL  LQ  +V    H 
Sbjct: 1320 QMLSTSDRAGSVPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQRKLVDAVPHV 1379

Query: 187  GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
             GLNPRAFR ++  G         I+D  L+  +  L L E+LEI  +IG+  + IL  L
Sbjct: 1380 AGLNPRAFRLFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIANQIGTTRSQILSNL 1439

Query: 247  YDI 249
             D+
Sbjct: 1440 SDL 1442


>gi|313232279|emb|CBY09388.1| unnamed protein product [Oikopleura dioica]
          Length = 1451

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 68/211 (32%), Positives = 120/211 (56%), Gaps = 26/211 (12%)

Query: 57   RSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKT 116
            R++RA E       V A   +++ + +G + +D  +N+ ++MY+P+ +ESNGG +L+++ 
Sbjct: 1256 RARRAGE-------VTAVGILLDGNQVGLVSTDMQRNLQVYMYKPDQKESNGGKQLVRQA 1308

Query: 117  DFHLGQHV-----------NTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLP 165
            D +LG+ V           +TF K+    +         +R +T+YA LDG++G  +P+ 
Sbjct: 1309 DINLGKRVISIWNSLGRQNDTFTKVALTEND--------ARHVTFYAGLDGSIGDIVPVS 1360

Query: 166  EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 225
            EK +RRL MLQ ++ +H  H GGLNPR +R    +     N ++ IIDG L+ +F  LS 
Sbjct: 1361 EKVFRRLEMLQTLVQSHLPHYGGLNPREYRYCTNEYRDLENAAKNIIDGDLLERFNGLSF 1420

Query: 226  GERLEICKKIGSKHNDILDELYDIEALSSHF 256
             E+ ++ +KIG     +LD++ D++   + F
Sbjct: 1421 TEQTDLSRKIGVTREALLDDMMDVQRTKNLF 1451


>gi|356559917|ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Glycine max]
          Length = 1447

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 103/185 (55%), Gaps = 8/185 (4%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
             +A EF+I+ S++  M+SD ++N+ +F Y P+  ES  G +L+ + +FH+G HV  F ++
Sbjct: 1258 CFATEFLIDGSTLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1317

Query: 131  RCKPSPISDAPGA------RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            +   +  SD  GA       +RF   + +LDG++G   PL E  +RRL  LQ  +V    
Sbjct: 1318 QMLST--SDRAGAVPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQRKLVDAVP 1375

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
            H  GLNPRAFR ++  G         I+D  L+  +  L L E+LEI  ++G+  + IL 
Sbjct: 1376 HVAGLNPRAFRLFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAHQVGTTRSQILS 1435

Query: 245  ELYDI 249
             L D+
Sbjct: 1436 NLSDL 1440


>gi|402590016|gb|EJW83947.1| hypothetical protein WUBG_05142 [Wuchereria bancrofti]
          Length = 374

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 69/216 (31%), Positives = 126/216 (58%), Gaps = 8/216 (3%)

Query: 44  LRFKK-LKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
           LR+++  K L ++ R  R++ QP +     A +F+I++  MGF++SD+  N+ +F Y PE
Sbjct: 164 LRYQEEYKALSLASRDMRSDVQPPM-----AAQFLIDNKQMGFIMSDEAANIAIFNYLPE 218

Query: 103 ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP--ISDAPGARSRFLTWYASLDGALGF 160
             ES GG +L  + + ++G  VN+F +++   S   + +   +  R    +ASLDG+ G+
Sbjct: 219 TLESLGGEKLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSLERQSVLFASLDGSFGY 278

Query: 161 FLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKF 220
             PL EK +RRL MLQ +M +      GLN +  R  + +       +R ++DG +V ++
Sbjct: 279 LRPLTEKVFRRLHMLQQLMSSMVLQPAGLNAKGARAARPQRPNHYLNTRNLVDGDVVMQY 338

Query: 221 LQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
           L LSL E+ ++ +K+G+    I+D+L +I  +++H+
Sbjct: 339 LHLSLPEKNDLARKLGTSRYHIIDDLNEICRVTAHY 374


>gi|25148482|ref|NP_500157.2| Protein CPSF-1 [Caenorhabditis elegans]
 gi|22096347|sp|Q9N4C2.2|CPSF1_CAEEL RecName: Full=Probable cleavage and polyadenylation specificity
            factor subunit 1; AltName: Full=Cleavage and
            polyadenylation specificity factor 160 kDa subunit;
            Short=CPSF 160 kDa subunit
 gi|373220398|emb|CCD73182.1| Protein CPSF-1 [Caenorhabditis elegans]
          Length = 1454

 Score =  124 bits (311), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 69/220 (31%), Positives = 123/220 (55%), Gaps = 12/220 (5%)

Query: 44   LRFKKL-KVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
            +RF++  K + ++ R  R   QP +     A + V++ + +GF++SD+  N+ +F Y PE
Sbjct: 1240 IRFQEDNKAMSIASRDDRKCAQPPM-----ASQLVVDGAHVGFLLSDETGNITMFNYAPE 1294

Query: 103  ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPI-----SDAPGARSRFLTWYASLDGA 157
            A ESNGG RL  +   ++G ++N F ++R   S +      +      R  T +ASLDG+
Sbjct: 1295 APESNGGERLTVRAAINIGTNINAFVRLRGHTSLLQLNNEDEKEAIEQRMTTVFASLDGS 1354

Query: 158  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK-GKGYYAGNPSRGIIDGSL 216
             GF  PL EK+YRRL  LQ  + + T    GL+ +  R+ K  +    G  +R +IDG +
Sbjct: 1355 FGFVRPLTEKSYRRLHFLQTFIGSVTPQIAGLHIKGSRSAKPSQPIVNGRNARNLIDGDV 1414

Query: 217  VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
            V ++L LSL ++ ++ +++G     I+D+L  +  ++ ++
Sbjct: 1415 VEQYLHLSLYDKTDLARRLGVGRYHIIDDLMQLRRMAFYY 1454


>gi|302814354|ref|XP_002988861.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
 gi|300143432|gb|EFJ10123.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
          Length = 1413

 Score =  124 bits (310), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 62/186 (33%), Positives = 103/186 (55%), Gaps = 3/186 (1%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
             YA EF+I+ S++  ++SD  KN+ +F Y P+  ES  G +L+ + +FHLG HV  F ++
Sbjct: 1231 CYATEFLIDGSTLSLLVSDSRKNIQVFSYAPKNAESWKGQKLLPRVEFHLGSHVTKFLRL 1290

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            +   +P S      +RF   + +LDG +G+  PL E  +RRL  LQ  +V    H  GLN
Sbjct: 1291 QMLQTPGS---SRTNRFALCFGTLDGGIGYITPLDELTFRRLQTLQRKLVDLVPHVAGLN 1347

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
            P+A+R ++  G +  +     +D   + ++  LSL +++ I ++IG+    I   L DI 
Sbjct: 1348 PKAYRQFQANGEHHKHGPDNTVDSEQLREYESLSLDKQVAIARQIGTTRQQIFANLRDIS 1407

Query: 251  ALSSHF 256
              +S F
Sbjct: 1408 LSTSFF 1413


>gi|302761560|ref|XP_002964202.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
 gi|300167931|gb|EFJ34535.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
          Length = 1413

 Score =  124 bits (310), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 62/186 (33%), Positives = 103/186 (55%), Gaps = 3/186 (1%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
             YA EF+I+ S++  ++SD  KN+ +F Y P+  ES  G +L+ + +FHLG HV  F ++
Sbjct: 1231 CYATEFLIDGSTLSLLVSDSRKNIQVFSYAPKNAESWKGQKLLPRVEFHLGSHVTKFLRL 1290

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            +   +P S      +RF   + +LDG +G+  PL E  +RRL  LQ  +V    H  GLN
Sbjct: 1291 QMLQTPGS---SRTNRFALCFGTLDGGIGYITPLDELTFRRLQTLQRKLVDLVPHVAGLN 1347

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
            P+A+R ++  G +  +     +D   + ++  LSL +++ I ++IG+    I   L DI 
Sbjct: 1348 PKAYRQFQANGEHHKHGPDNTVDSEQLREYESLSLDKQVAIARQIGTTRQQIFANLRDIS 1407

Query: 251  ALSSHF 256
              +S F
Sbjct: 1408 LSTSFF 1413


>gi|170576536|ref|XP_001893668.1| CPSF A subunit region family protein [Brugia malayi]
 gi|158600196|gb|EDP37499.1| CPSF A subunit region family protein [Brugia malayi]
          Length = 1323

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 68/216 (31%), Positives = 125/216 (57%), Gaps = 8/216 (3%)

Query: 44   LRFKK-LKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
            LR+++  K L ++ R  R++ QP +     A +F+I++  MGF++SD+  N+ +F Y PE
Sbjct: 1113 LRYQEEYKALSLASRDMRSDVQPPM-----AAQFLIDNKQMGFIMSDEAANIAIFNYLPE 1167

Query: 103  ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP--ISDAPGARSRFLTWYASLDGALGF 160
              ES GG +L  + + ++G  VN+F +++   S   + +   +  R    +ASLDG+ G+
Sbjct: 1168 TLESLGGEKLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSLERQSVLFASLDGSFGY 1227

Query: 161  FLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKF 220
              PL EK +RRL MLQ +M +      GLN +  R  + +       +R ++DG +  ++
Sbjct: 1228 LRPLTEKVFRRLHMLQQLMSSMVLQPAGLNAKGARAARPQRPNHYLNTRNLVDGDVAMQY 1287

Query: 221  LQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
            L LSL E+ ++ +K+G+    I+D+L +I  +++H+
Sbjct: 1288 LHLSLPEKNDLARKLGTSRYHIIDDLIEICRVTAHY 1323


>gi|255539681|ref|XP_002510905.1| cleavage and polyadenylation specificity factor cpsf, putative
            [Ricinus communis]
 gi|223550020|gb|EEF51507.1| cleavage and polyadenylation specificity factor cpsf, putative
            [Ricinus communis]
          Length = 1461

 Score =  123 bits (308), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 62/183 (33%), Positives = 100/183 (54%), Gaps = 5/183 (2%)

Query: 72   YACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 131
            +A EF+I+ S++  ++SD+ KN+ +F Y P+  ES  G +L+ + +FH+G H+  F ++ 
Sbjct: 1272 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMLESWKGQKLLSRAEFHVGAHITKFIRLS 1331

Query: 132  -----CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
                    S  +  P   +RF   + +LDG++G   PL E  +RRL  LQ  +V    H 
Sbjct: 1332 MLSTSSDRSGAAPGPDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQRKLVDAVPHV 1391

Query: 187  GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
             GLNPR+FR ++  G         I+D  L+  F  L L E+LEI +++G+    IL  L
Sbjct: 1392 AGLNPRSFRQFRSDGKVHRPGPESIVDCELLSHFEMLPLEEQLEIAQQVGTTRAQILSNL 1451

Query: 247  YDI 249
             D+
Sbjct: 1452 NDL 1454


>gi|341892673|gb|EGT48608.1| CBN-CPSF-1 protein [Caenorhabditis brenneri]
          Length = 1440

 Score =  123 bits (308), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 67/219 (30%), Positives = 126/219 (57%), Gaps = 11/219 (5%)

Query: 44   LRFKKL-KVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
            +RF++  K + V+ R  R   Q  +     A +F+++ + +GF++SD++ N+ LF Y PE
Sbjct: 1227 IRFQEENKAMSVASRDDRKCAQAPM-----AAQFMVDGAHIGFLLSDENGNITLFNYAPE 1281

Query: 103  ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPIS----DAPGARSRFLTWYASLDGAL 158
            A ESNGG RL  +   ++G ++N F +++   + ++    +   A  R  T +ASLDG+ 
Sbjct: 1282 APESNGGERLTVRAAINIGTNINAFLRVKGHTALLNLHEFEKEAAEQRMSTIFASLDGSF 1341

Query: 159  GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK-GKGYYAGNPSRGIIDGSLV 217
            GF  PL EK+YRRL  LQ  + + +    GL+ +  R+ K  +    G  +R +IDG +V
Sbjct: 1342 GFIRPLTEKSYRRLHFLQTFIGSVSQQIAGLHIKGARSAKPPQPIVNGRNARNLIDGDVV 1401

Query: 218  WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
             ++L LS  ++ ++ +++G     I+D+L ++  ++ ++
Sbjct: 1402 EQYLNLSTYDKTDLARRLGVGKYHIIDDLMELRRMAFYY 1440


>gi|225455571|ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Vitis vinifera]
          Length = 1442

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 64/184 (34%), Positives = 101/184 (54%), Gaps = 7/184 (3%)

Query: 72   YACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 131
            +A EF+I+ S++  ++SD  KN+ +F Y P+  ES  G +L+ + +FH+G HV  F +++
Sbjct: 1253 FATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQ 1312

Query: 132  CKPSPISDAPGAR------SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
              P+  SD   A       +RF   + +LDG++G   PL E  +RRL  LQ  +V    H
Sbjct: 1313 MLPAS-SDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVPH 1371

Query: 186  TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
              GLNPR+FR ++  G         I+D  L+  +  L   E+LEI ++IG+    IL  
Sbjct: 1372 VAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGTTRMQILSN 1431

Query: 246  LYDI 249
            L D+
Sbjct: 1432 LNDL 1435


>gi|296084122|emb|CBI24510.3| unnamed protein product [Vitis vinifera]
          Length = 1448

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 64/184 (34%), Positives = 101/184 (54%), Gaps = 7/184 (3%)

Query: 72   YACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 131
            +A EF+I+ S++  ++SD  KN+ +F Y P+  ES  G +L+ + +FH+G HV  F +++
Sbjct: 1259 FATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQ 1318

Query: 132  CKPSPISDAPGAR------SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
              P+  SD   A       +RF   + +LDG++G   PL E  +RRL  LQ  +V    H
Sbjct: 1319 MLPAS-SDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVPH 1377

Query: 186  TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
              GLNPR+FR ++  G         I+D  L+  +  L   E+LEI ++IG+    IL  
Sbjct: 1378 VAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGTTRMQILSN 1437

Query: 246  LYDI 249
            L D+
Sbjct: 1438 LNDL 1441


>gi|297722899|ref|NP_001173813.1| Os04g0252200 [Oryza sativa Japonica Group]
 gi|255675253|dbj|BAH92541.1| Os04g0252200, partial [Oryza sativa Japonica Group]
          Length = 432

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 65/180 (36%), Positives = 103/180 (57%), Gaps = 5/180 (2%)

Query: 72  YACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 131
           +A EF+I+ S++  + SD DKNV +F Y P+  ES  G +L+ + +FH+G H+  F +++
Sbjct: 249 FATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESWKGQKLLSRAEFHVGAHITKFLRLQ 308

Query: 132 CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
             P+    +    +RF   + +LDG +G   P+ E  +RRL  LQ  +V    H  GLNP
Sbjct: 309 MLPTQ-GLSSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLNP 367

Query: 192 RAFRTY--KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
           R+FR +   GKG+  G     IID  L+  +  LSL E+L++ ++IG+  + IL    DI
Sbjct: 368 RSFRQFHSNGKGHRPG--PDNIIDFELLAHYEMLSLDEQLDVAQQIGTTRSQILSNFSDI 425


>gi|268580265|ref|XP_002645115.1| Hypothetical protein CBG16808 [Caenorhabditis briggsae]
 gi|296439546|sp|A8XPU7.1|CPSF1_CAEBR RecName: Full=Probable cleavage and polyadenylation specificity
            factor subunit 1; AltName: Full=Cleavage and
            polyadenylation specificity factor 160 kDa subunit;
            Short=CPSF 160 kDa subunit
          Length = 1454

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 71/220 (32%), Positives = 124/220 (56%), Gaps = 12/220 (5%)

Query: 44   LRFKKL-KVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
            +RF++  K + ++ R  R   Q  +     A EF+++   +GF++SD+  N+ LF Y PE
Sbjct: 1240 IRFQEENKAMSIASRDDRKCAQAPM-----ASEFLVDGMHIGFLLSDEHGNITLFSYSPE 1294

Query: 103  ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPI-SDAPGARS----RFLTWYASLDGA 157
            A ESNGG RL  K   ++G ++N F +++   S + S +P  R     R  T + SLDG+
Sbjct: 1295 APESNGGERLTVKAAINIGTNINAFLRVKGHTSLLDSSSPEERENIEQRMNTIFGSLDGS 1354

Query: 158  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK-GKGYYAGNPSRGIIDGSL 216
             G+  PL EK+YRRL  LQ  + + T    GL+ +  R+ K  +    G  +R +IDG +
Sbjct: 1355 FGYIRPLTEKSYRRLHFLQTFIGSVTPQIAGLHIKGARSSKPSQPIVNGRNARNLIDGDV 1414

Query: 217  VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
            V ++L LS+ ++ ++ +++G     ILD+L  +  ++ ++
Sbjct: 1415 VEQYLHLSVYDKTDLARRLGVGRYHILDDLMQLRRMAYYY 1454


>gi|75145059|sp|Q7XWP1.2|CPSF1_ORYSJ RecName: Full=Probable cleavage and polyadenylation specificity
            factor subunit 1; AltName: Full=Cleavage and
            polyadenylation specificity factor 160 kDa subunit;
            Short=CPSF 160 kDa subunit
 gi|38345987|emb|CAD39979.2| OSJNBa0032B23.5 [Oryza sativa Japonica Group]
          Length = 1441

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 65/181 (35%), Positives = 103/181 (56%), Gaps = 5/181 (2%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
             +A EF+I+ S++  + SD DKNV +F Y P+  ES  G +L+ + +FH+G H+  F ++
Sbjct: 1257 CFATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESWKGQKLLSRAEFHVGAHITKFLRL 1316

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            +  P+    +    +RF   + +LDG +G   P+ E  +RRL  LQ  +V    H  GLN
Sbjct: 1317 QMLPTQ-GLSSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLN 1375

Query: 191  PRAFRTY--KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYD 248
            PR+FR +   GKG+  G     IID  L+  +  LSL E+L++ ++IG+  + IL    D
Sbjct: 1376 PRSFRQFHSNGKGHRPG--PDNIIDFELLCSYEMLSLDEQLDVAQQIGTTRSQILSNFSD 1433

Query: 249  I 249
            I
Sbjct: 1434 I 1434


>gi|222628488|gb|EEE60620.1| hypothetical protein OsJ_14038 [Oryza sativa Japonica Group]
          Length = 1441

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 65/181 (35%), Positives = 103/181 (56%), Gaps = 5/181 (2%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
             +A EF+I+ S++  + SD DKNV +F Y P+  ES  G +L+ + +FH+G H+  F ++
Sbjct: 1257 CFATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESWKGQKLLSRAEFHVGAHITKFLRL 1316

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            +  P+    +    +RF   + +LDG +G   P+ E  +RRL  LQ  +V    H  GLN
Sbjct: 1317 QMLPTQ-GLSSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLN 1375

Query: 191  PRAFRTY--KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYD 248
            PR+FR +   GKG+  G     IID  L+  +  LSL E+L++ ++IG+  + IL    D
Sbjct: 1376 PRSFRQFHSNGKGHRPG--PDNIIDFELLAHYEMLSLDEQLDVAQQIGTTRSQILSNFSD 1433

Query: 249  I 249
            I
Sbjct: 1434 I 1434


>gi|218194461|gb|EEC76888.1| hypothetical protein OsI_15095 [Oryza sativa Indica Group]
          Length = 1503

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 65/181 (35%), Positives = 103/181 (56%), Gaps = 5/181 (2%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
             +A EF+I+ S++  + SD DKNV +F Y P+  ES  G +L+ + +FH+G H+  F ++
Sbjct: 1319 CFATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESWKGQKLLSRAEFHVGAHITKFLRL 1378

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            +  P+    +    +RF   + +LDG +G   P+ E  +RRL  LQ  +V    H  GLN
Sbjct: 1379 QMLPTQ-GLSSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLN 1437

Query: 191  PRAFRTY--KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYD 248
            PR+FR +   GKG+  G     IID  L+  +  LSL E+L++ ++IG+  + IL    D
Sbjct: 1438 PRSFRQFHSNGKGHRPG--PDNIIDFELLAHYEMLSLDEQLDVAQQIGTTRSQILSNFSD 1495

Query: 249  I 249
            I
Sbjct: 1496 I 1496


>gi|449524573|ref|XP_004169296.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like, partial [Cucumis sativus]
          Length = 741

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 62/183 (33%), Positives = 97/183 (53%), Gaps = 4/183 (2%)

Query: 71  VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            YA EF+I+ S++   +SD  KN+ +F Y P++ ES  G +L+ + +FH+G HV  F ++
Sbjct: 552 CYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKFLRL 611

Query: 131 R----CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
           +          S      +RF   + +LDG++G   PL E  +RRL  LQ  +     H 
Sbjct: 612 QMLSTSSDKACSTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLGDAVPHV 671

Query: 187 GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
           GGLNPR+FR +   G         I+D  L+  +  L L E+L+I  +IG+  + IL  L
Sbjct: 672 GGLNPRSFRQFHSNGKVHRRGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQILSNL 731

Query: 247 YDI 249
            D+
Sbjct: 732 NDL 734


>gi|449470342|ref|XP_004152876.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Cucumis sativus]
          Length = 1504

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 62/182 (34%), Positives = 97/182 (53%), Gaps = 4/182 (2%)

Query: 72   YACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 131
            YA EF+I+ S++   +SD  KN+ +F Y P++ ES  G +L+ + +FH+G HV  F +++
Sbjct: 1316 YATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKFLRLQ 1375

Query: 132  ----CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
                      S      +RF   + +LDG++G   PL E  +RRL  LQ  +     H G
Sbjct: 1376 MLSTSSDKACSTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLGDAVPHVG 1435

Query: 188  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
            GLNPR+FR +   G         I+D  L+  +  L L E+L+I  +IG+  + IL  L 
Sbjct: 1436 GLNPRSFRQFHSNGKVHRRGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQILSNLN 1495

Query: 248  DI 249
            D+
Sbjct: 1496 DL 1497


>gi|357162146|ref|XP_003579318.1| PREDICTED: probable cleavage and polyadenylation specificity factor
            subunit 1-like [Brachypodium distachyon]
          Length = 1442

 Score =  120 bits (302), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 63/178 (35%), Positives = 101/178 (56%), Gaps = 1/178 (0%)

Query: 72   YACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 131
            +A EF+I+ S++  ++SD DKN+ +F Y P+  ES  G +L+ + + H+G H+  F +++
Sbjct: 1259 FATEFLIDGSTLSLVVSDSDKNLQIFYYAPKMVESWKGQKLLSRAELHVGAHMTKFLRLQ 1318

Query: 132  CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
              P+    A    +RF   + +LDG++G   P+ E  +RRL  LQ  +V   SH  GLNP
Sbjct: 1319 MLPAQ-GLASEKTNRFALLFGTLDGSIGCIAPVDELTFRRLQSLQRKLVDAVSHVCGLNP 1377

Query: 192  RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
            R+FR +K  G         IID  L+  +  LSL E+L++ ++IG+    IL    DI
Sbjct: 1378 RSFRQFKSNGKAHRPGPDNIIDFELLTYYEILSLEEQLDMAQQIGTTRAQILSNFSDI 1435


>gi|339253000|ref|XP_003371723.1| cleavage and polyadenylation specificity factor subunit 1
            [Trichinella spiralis]
 gi|316967988|gb|EFV52332.1| cleavage and polyadenylation specificity factor subunit 1
            [Trichinella spiralis]
          Length = 1376

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 68/216 (31%), Positives = 118/216 (54%), Gaps = 14/216 (6%)

Query: 40   GTLKLRFKK-LKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFM 98
            G   LR+++  K L +  R  R +E       V A EF+++ + + F+++    +++ ++
Sbjct: 1164 GVSLLRYQEEYKALSLVSRDPRPDE-------VLAVEFLVDRTDLSFLMTSAAGDILTYV 1216

Query: 99   YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLT----WYASL 154
            Y PE+ +S GG RL+ + D+H G  VN F ++RC    I  A   R   L      +AS 
Sbjct: 1217 YLPESLDSFGGQRLVPQADYHFGSQVNAFVRMRCHAQEI--AGRKRQEVLQRQGLIFASS 1274

Query: 155  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 214
            DG++ + LPLPE+ YR L MLQ++++       GLN   +RT +        P++ IIDG
Sbjct: 1275 DGSVNYLLPLPEREYRLLGMLQSLLIDMLPSFAGLNVDDYRTVRFPNSCLREPTKNIIDG 1334

Query: 215  SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
            ++   +L +   ++ +I ++IGS H+ I+ EL  +E
Sbjct: 1335 NICMLYLYIDALQQEDIVRQIGSSHSQIMLELAYME 1370


>gi|30696088|ref|NP_199979.2| cleavage and polyadenylation specificity factor subunit 1
            [Arabidopsis thaliana]
 gi|290457637|sp|Q9FGR0.2|CPSF1_ARATH RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=AtCPSF160; Short=CPSF 160
            kDa subunit
 gi|332008729|gb|AED96112.1| cleavage and polyadenylation specificity factor subunit 1
            [Arabidopsis thaliana]
          Length = 1442

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 62/184 (33%), Positives = 101/184 (54%), Gaps = 4/184 (2%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
             +A EF+I+ S++   +SD+ KN+ +F Y P+  ES  G +L+ + +FH+G HV+ F ++
Sbjct: 1261 CFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKFLRL 1320

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            +     +S      +RF   + +LDG+ G   PL E  +RRL  LQ  +V    H  GLN
Sbjct: 1321 QM----VSSGADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLN 1376

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
            P AFR ++  G    +    I+D  L+  +  L L E+LE+  +IG+    IL +L D+ 
Sbjct: 1377 PLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIGTTRYSILKDLVDLS 1436

Query: 251  ALSS 254
              +S
Sbjct: 1437 VGTS 1440


>gi|10257491|dbj|BAB11613.1| cleavage and polyadenylation specificity factor subunit [Arabidopsis
            thaliana]
          Length = 1448

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 62/184 (33%), Positives = 101/184 (54%), Gaps = 4/184 (2%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
             +A EF+I+ S++   +SD+ KN+ +F Y P+  ES  G +L+ + +FH+G HV+ F ++
Sbjct: 1267 CFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKFLRL 1326

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            +     +S      +RF   + +LDG+ G   PL E  +RRL  LQ  +V    H  GLN
Sbjct: 1327 QM----VSSGADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLN 1382

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
            P AFR ++  G    +    I+D  L+  +  L L E+LE+  +IG+    IL +L D+ 
Sbjct: 1383 PLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIGTTRYSILKDLVDLS 1442

Query: 251  ALSS 254
              +S
Sbjct: 1443 VGTS 1446


>gi|24415580|gb|AAN41460.1| putative cleavage and polyadenylation specificity factor 160 kDa
            subunit [Arabidopsis thaliana]
          Length = 1442

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 62/184 (33%), Positives = 101/184 (54%), Gaps = 4/184 (2%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
             +A EF+I+ S++   +SD+ KN+ +F Y P+  ES  G +L+ + +FH+G HV+ F ++
Sbjct: 1261 CFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKFLRL 1320

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            +     +S      +RF   + +LDG+ G   PL E  +RRL  LQ  +V    H  GLN
Sbjct: 1321 QM----VSSGADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLN 1376

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
            P AFR ++  G    +    I+D  L+  +  L L E+LE+  +IG+    IL +L D+ 
Sbjct: 1377 PLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIGTTRYSILKDLVDLS 1436

Query: 251  ALSS 254
              +S
Sbjct: 1437 VGTS 1440


>gi|198415711|ref|XP_002123169.1| PREDICTED: similar to cleavage and polyadenylation specificity factor
            1, partial [Ciona intestinalis]
          Length = 1370

 Score =  117 bits (294), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 62/157 (39%), Positives = 92/157 (58%), Gaps = 13/157 (8%)

Query: 44   LRFK-KLKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
            LR++   K L V+ R  R       P  VY  + V++ + + F++SD +KN+VLF Y PE
Sbjct: 1222 LRYQTDFKTLSVTSRDVR-------PLEVYTADLVVDGTGINFLVSDHEKNLVLFAYDPE 1274

Query: 103  ARESNGGHRLIKKTDFHLGQHVNTFFKIRC----KPSPISDAPGARSRFLTWYASLDGAL 158
              ES+GG RL K+ D H+G   N  +++      + + + + P A    +T   +LDG++
Sbjct: 1275 DHESHGGSRLTKRADMHIGSRANCMWRVAACGVDRSTGLPNQPYA-GVHITMMGTLDGSI 1333

Query: 159  GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 195
               LP+ EK YRRLLMLQN+M+T   H  GLNP+AFR
Sbjct: 1334 CHVLPVAEKVYRRLLMLQNIMITGLQHIAGLNPKAFR 1370


>gi|224120960|ref|XP_002318462.1| predicted protein [Populus trichocarpa]
 gi|222859135|gb|EEE96682.1| predicted protein [Populus trichocarpa]
          Length = 1455

 Score =  117 bits (292), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 60/186 (32%), Positives = 102/186 (54%), Gaps = 11/186 (5%)

Query: 72   YACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI- 130
            ++ EF+I+ S++  ++SD+ KNV +F Y P+  ES  G +L+ + +FH+G  V  F ++ 
Sbjct: 1266 FSTEFLIDGSTLSLVVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGALVTKFMRLQ 1325

Query: 131  -------RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
                   R   +P+SD     +RF   + +LDG++G   PL E  +RRL  LQ  +V   
Sbjct: 1326 MLSPSLDRSGAAPVSDK---TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1382

Query: 184  SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
             H  GLNP++FR ++  G         I+D  ++  +  + L E++EI ++IG+    IL
Sbjct: 1383 PHVAGLNPKSFRQFRSDGKAHRPGPESIVDCEMLSYYEMIPLEEQVEIAQQIGTTRAQIL 1442

Query: 244  DELYDI 249
              L D+
Sbjct: 1443 SNLNDL 1448


>gi|320169222|gb|EFW46121.1| cleavage and polyadenylation specificity factor 1 [Capsaspora
            owczarzaki ATCC 30864]
          Length = 1725

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 60/180 (33%), Positives = 102/180 (56%), Gaps = 2/180 (1%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V + +F+++ SS+GF+ +D  +N+V+F Y P  RESN G RL+++ DFH+G HV    ++
Sbjct: 1540 VASTQFLVDGSSLGFLATDSHQNLVVFAYNPLNRESNNGQRLLRQLDFHVGSHVQQVLRM 1599

Query: 131  RCKPSPISDAPGAR-SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 189
              +  P+S   GA   R +   A+L+G+L    P+ E  +RRL  LQ  +V       GL
Sbjct: 1600 VPRSLPVSVDRGASVKRHIDLLATLEGSLNALAPIGETTFRRLEWLQRQLV-GLQQRAGL 1658

Query: 190  NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
            NP  +R Y+         +  +IDG L+ +FL L L E+ E+ ++  +   D++D++  +
Sbjct: 1659 NPIGYRAYRFPRKMTTTRAGNVIDGELLSRFLYLGLAEQRELARQRRNTPEDLIDDILSV 1718


>gi|297792471|ref|XP_002864120.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297309955|gb|EFH40379.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1444

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/184 (32%), Positives = 100/184 (54%), Gaps = 4/184 (2%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
             +A EF+I+ +++   +SD+ KN+ +F Y P+  ES  G +L+ + +FH+G HV  F ++
Sbjct: 1263 CFATEFLIDGNTLSLAVSDEQKNIQVFYYAPKMAESWKGQKLLSRAEFHVGSHVTKFLRL 1322

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            +     ++      +RF   + +LDG+ G   PL E  +RRL  LQ  +V    H  GLN
Sbjct: 1323 QM----VTSGADKTNRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLN 1378

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
            P +FR ++  G    +    IID  L+  +  L L E+LE+  +IG+  + IL  L ++ 
Sbjct: 1379 PHSFRQFRTSGKARRSGPDSIIDCELLCHYEMLPLEEQLELAHQIGTTRSVILLNLVELS 1438

Query: 251  ALSS 254
              +S
Sbjct: 1439 VGTS 1442


>gi|440793679|gb|ELR14857.1| CPSF A subunit region protein [Acanthamoeba castellanii str. Neff]
          Length = 1477

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 56/185 (30%), Positives = 103/185 (55%), Gaps = 4/185 (2%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V+A E+V++  ++  ++SD+ +N+ +  Y P   ES GG  L+ + DF++GQ +N F ++
Sbjct: 1295 VFATEYVVDKQNLALLMSDERQNLQVTAYAPHTAESRGGQLLVPRGDFNVGQSINKFVRL 1354

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
               P  +     +  R   W+ +L G +G+  P+ E  +RRL MLQ+ +++   HT GL+
Sbjct: 1355 ---PMTLPSGTTSLQRHALWFGTLSGGVGYLAPMDESVFRRLGMLQSALLSAIPHTAGLH 1411

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
            P+A+R  + +     N    I+DG L+ ++L L    + +I  K+G+    IL++L  I 
Sbjct: 1412 PQAYRALQ-RERLLRNRKHTILDGLLLSRYLALDSATQQQIALKLGTSRERILNDLQGIP 1470

Query: 251  ALSSH 255
               +H
Sbjct: 1471 QSVTH 1475


>gi|298715584|emb|CBJ28137.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 255

 Score =  111 bits (278), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 56/183 (30%), Positives = 100/183 (54%), Gaps = 5/183 (2%)

Query: 71  VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
           V++ E++I+   M  +++D  +NV +  Y P A  S GG +L+ ++DF+LG  V    + 
Sbjct: 76  VFSAEYLIDEPGMAIVVADGRRNVKVLQYAPNATNSRGGTKLLCQSDFYLGSRVGKLTRR 135

Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
           R + + + D     +R+     +LDG LG  LP+ E+ +RRL  LQ +M     H G  N
Sbjct: 136 RTRGN-LRDG----ARYCLLAGTLDGGLGAVLPVDERVFRRLYALQGIMSNALGHNGAAN 190

Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
           PRA+R +     +     + ++DGSL+W+F+ L    + ++ + IG+  + ++  L DI+
Sbjct: 191 PRAYRLFDHGPTFRYETKQNMLDGSLLWRFVGLDAKTQHDLTRAIGTTVDRVMANLLDID 250

Query: 251 ALS 253
             S
Sbjct: 251 LAS 253


>gi|430810872|emb|CCJ31592.1| unnamed protein product [Pneumocystis jirovecii]
 gi|430814599|emb|CCJ28188.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 203

 Score =  110 bits (276), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 60/188 (31%), Positives = 102/188 (54%), Gaps = 11/188 (5%)

Query: 71  VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
           V + +F+++   + F+I D D N+ +F Y PE  +S  G +L+K+ DFH+G H+ +   +
Sbjct: 13  VSSADFLVDDEHLYFVIGDDDGNIHVFNYDPENPQSFSGQKLLKRGDFHVGSHIKSILML 72

Query: 131 --RCKPSPISDAPGARSR----FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
                P  ++D    R+      L   AS DG++G  + LPEK YRRL  +Q  ++    
Sbjct: 73  PKEAFPQNVNDKEETRASKNQDSLCLCASQDGSMGVLISLPEKTYRRLYFIQGQLINTED 132

Query: 185 HTGGLNPRAFR--TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              GLNP ++R  TY  K     NP+RGI+DG L++++  L   ++ ++ +K G     I
Sbjct: 133 KVAGLNPISYRTSTYVSK---TSNPARGILDGKLLYQYNNLERNKQKDMARKSGMPVETI 189

Query: 243 LDELYDIE 250
           + +L  I+
Sbjct: 190 IYDLLKID 197


>gi|384487281|gb|EIE79461.1| hypothetical protein RO3G_04166 [Rhizopus delemar RA 99-880]
          Length = 1468

 Score =  110 bits (275), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 60/189 (31%), Positives = 103/189 (54%), Gaps = 6/189 (3%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F+I+  S+  ++ D ++N+ L+ Y P   +S GG +L+++ DFH+G  V T  ++
Sbjct: 1283 VGCVDFIIDDKSLYLIVGDTNENIDLYQYAPFNLQSFGGQKLMRRGDFHVGSQVQTMVRL 1342

Query: 131  RCKPSPISDAPGAR--SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
               P       G     R      + +G++     + EK ++RL  L   +V +  H  G
Sbjct: 1343 ---PQIEKTEKGFEYSRRHFCLCGTFNGSIAVISSISEKTFKRLNTLYGHLVNNLQHVAG 1399

Query: 189  LNPRAFRTYKG-KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
            LNPRAFR  KG K   + N ++ ++DG L+++F  LS+ E+ E  K+IG+    I+++L 
Sbjct: 1400 LNPRAFRLIKGPKQRMSTNRTKAVLDGDLIFEFAGLSIEEQKETTKQIGTTVTRIMEDLV 1459

Query: 248  DIEALSSHF 256
            DIE   +HF
Sbjct: 1460 DIECSINHF 1468



 Score = 38.9 bits (89), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 6/86 (6%)

Query: 4   VQELLTVSLGLHGNRPLLLVRTQ-HELLIYQAFRHPKGT----LKLRFKKLKVLFVSDRS 58
           +QE+L   +G     P L+VRT  ++++IY+AF +   +    L LRF +++  +VS +S
Sbjct: 853 IQEILMTHIGKERKDPHLVVRTDTNDIIIYKAFTYLDESSPDRLALRFSRVQHEYVSRKS 912

Query: 59  KRANEQPGLPRGVYACEFVIESSSMG 84
                +P   RG+   EF I  + + 
Sbjct: 913 SSHESKPKKKRGIID-EFEIPDTDLN 937


>gi|290981010|ref|XP_002673224.1| CPSF A subunit [Naegleria gruberi]
 gi|284086806|gb|EFC40480.1| CPSF A subunit [Naegleria gruberi]
          Length = 1373

 Score =  107 bits (268), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 56/208 (26%), Positives = 104/208 (50%), Gaps = 19/208 (9%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  V + EF++ +  +G  + D  KN+ +F Y P+ +ESN G  L+   DFH+G  +N+ 
Sbjct: 1166 PLQVVSSEFLVNNDLLGLAVIDTSKNLQIFSYLPQHQESNDGRNLVPVCDFHIGTLINSL 1225

Query: 128  FKIRCKPSP---------ISDAPGARSR----------FLTWYASLDGALGFFLPLPEKN 168
             +++ +  P         +++ P    +              + S+DGA+G+  P+ E  
Sbjct: 1226 IRMKVRELPDDNTIRLGNVNEKPKQSGKKDITKTNPNHQFILFGSVDGAIGYVAPINEVT 1285

Query: 169  YRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGER 228
            +RRL  LQ  M T      GL+P++FR YK       N  + IIDG L+W +  ++   +
Sbjct: 1286 HRRLFALQLKMYTQLEQAAGLHPKSFRLYKPLERTEYNYKKNIIDGQLIWNYANINTILQ 1345

Query: 229  LEICKKIGSKHNDILDELYDIEALSSHF 256
             ++ ++IG+  ++IL  + ++   +  F
Sbjct: 1346 RDLARQIGTNSDNILRSIQELNQATFFF 1373


>gi|384946686|gb|AFI36948.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
            mulatta]
          Length = 1428

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 58/189 (30%), Positives = 98/189 (51%), Gaps = 11/189 (5%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1251 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1310

Query: 128  FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
            ++  C+ +      G   + + W        G  LP                  H S   
Sbjct: 1311 WRTPCRGA----TEGLSKKSVVWENKHITWFGEDLP-------AAADAAERADHHASAPR 1359

Query: 188  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
               P           +         DG L+ ++L LS  ER E+ KKIG+  + ILD+L 
Sbjct: 1360 RPQPPCLPDAARGPPHPPECCAQRADGELLNRYLYLSTMERSELAKKIGTTPDIILDDLL 1419

Query: 248  DIEALSSHF 256
            + + +++HF
Sbjct: 1420 ETDRVTAHF 1428



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 844 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 894


>gi|303321596|ref|XP_003070792.1| CPSF A subunit region family protein [Coccidioides posadasii C735
            delta SOWgp]
 gi|240110489|gb|EER28647.1| CPSF A subunit region family protein [Coccidioides posadasii C735
            delta SOWgp]
          Length = 1394

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 58/192 (30%), Positives = 100/192 (52%), Gaps = 16/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A +F+ +   +  +++D D  + +  Y PE   S+ G RL+ ++ FH+G   +T   +
Sbjct: 1204 VIAADFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHMGHFTSTMTLL 1263

Query: 131  -RCKPSPISDAPGARS--------RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
             +   SP +D PG            +     S +G++G   PL E +YRRL  LQ+ +VT
Sbjct: 1264 PQHSSSPSADDPGEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQSQLVT 1323

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
               H  GLNP+A+R  +  G+      RGI+DG+L+ ++L + +  + EI  ++G+   D
Sbjct: 1324 SMEHPCGLNPKAYRAVESDGFGG----RGIVDGNLLLRWLDMGVQRKAEIAGRVGA---D 1376

Query: 242  ILDELYDIEALS 253
            I     D+E +S
Sbjct: 1377 IESIRVDLEKIS 1388


>gi|320040273|gb|EFW22206.1| hypothetical protein CPSG_00105 [Coccidioides posadasii str.
            Silveira]
          Length = 1387

 Score =  100 bits (248), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 58/192 (30%), Positives = 100/192 (52%), Gaps = 16/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A +F+ +   +  +++D D  + +  Y PE   S+ G RL+ ++ FH+G   +T   +
Sbjct: 1197 VIAADFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHMGHFTSTMTLL 1256

Query: 131  -RCKPSPISDAPGARS--------RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
             +   SP +D PG            +     S +G++G   PL E +YRRL  LQ+ +VT
Sbjct: 1257 PQHSSSPSADDPGEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQSQLVT 1316

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
               H  GLNP+A+R  +  G+      RGI+DG+L+ ++L + +  + EI  ++G+   D
Sbjct: 1317 SMEHPCGLNPKAYRAVESDGFGG----RGIVDGNLLLRWLDMGVQRKAEIAGRVGA---D 1369

Query: 242  ILDELYDIEALS 253
            I     D+E +S
Sbjct: 1370 IESIRVDLEKIS 1381


>gi|19112233|ref|NP_595441.1| cleavage factor one Cft1 (predicted) [Schizosaccharomyces pombe
            972h-]
 gi|74582544|sp|O74733.1|CFT1_SCHPO RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|3738146|emb|CAA21247.1| cleavage factor one Cft1 (predicted) [Schizosaccharomyces pombe]
          Length = 1441

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 60/188 (31%), Positives = 100/188 (53%), Gaps = 6/188 (3%)

Query: 70   GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
             V A +F+++  ++ F+++D   N+ L  Y PE  ES+ G RL+ + DFH+G +V T   
Sbjct: 1254 NVSAADFLVQGENLYFVVADTSGNLRLLAYDPENPESHSGERLVTRGDFHIG-NVITAMT 1312

Query: 130  IRCKPSPISDAP---GARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
            I  K     +A         F     + DG L   +P+ ++ YRRL ++QN +    +  
Sbjct: 1313 ILPKEKKHQNAEYGYDTGDDFSCVMVNSDGGLQMLVPISDRVYRRLNIIQNYLANRVNTI 1372

Query: 187  GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
            GGLNP+++R          NP+R I+DG L+  F  +S+  R E+  K G   + I+++L
Sbjct: 1373 GGLNPKSYRLITSPSNLT-NPTRRILDGMLIDYFTYMSVAHRHEMAHKCGVPVSTIMNDL 1431

Query: 247  YDI-EALS 253
             ++ EALS
Sbjct: 1432 VELDEALS 1439


>gi|301093545|ref|XP_002997618.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110008|gb|EEY68060.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 1744

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 60/204 (29%), Positives = 100/204 (49%), Gaps = 15/204 (7%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  V A EF +    +  +  D D+N+ +  + P+  ES GG RL++ +DFHLG  V++ 
Sbjct: 1540 PLAVSATEFSVFEKKLALLAVDMDENLHVMQFAPQDIESRGGQRLLRVSDFHLGVQVSSM 1599

Query: 128  FKIRCKPS-PISDAPGAR-----SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
            F+ R   S  +  A   R     S ++    + +G +G  +P+ E+ +RRL  LQNVMV 
Sbjct: 1600 FRKRVDASGSVVSATNGRNAAPLSNYVNVMGTSEGGVGALVPVGERVFRRLFTLQNVMVN 1659

Query: 182  HTSHTGGLNPRAFRTYKGKGY-YAGNPS--------RGIIDGSLVWKFLQLSLGERLEIC 232
                   LNPR FR  K       G P         +  +D  ++++FLQL    + E+ 
Sbjct: 1660 TLPQNCALNPREFRMLKTNAQRRCGRPDAWSKKKWKKSFLDAFVLFRFLQLDYVAQKELA 1719

Query: 233  KKIGSKHNDILDELYDIEALSSHF 256
            + IG+    ++  L +++  +S F
Sbjct: 1720 RCIGTTPEVVMHNLLEVQHATSTF 1743


>gi|219109892|ref|XP_002176699.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411234|gb|EEC51162.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1678

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 63/209 (30%), Positives = 100/209 (47%), Gaps = 23/209 (11%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VYA   +    +M F+  D  +N+  F Y P    + GG+RL+ + D+HLG    +F
Sbjct: 1467 PIPVYAAGVMSRGPAMTFLCHDDRQNLQFFQYAPGEAAARGGNRLVCRADYHLGTQTTSF 1526

Query: 128  FKIRCKPS-------PIS--------DAPGARS----RFLTWYASLDGALGFFLPLPEKN 168
                C+ S       P S        D+   RS    R   ++ + DG +G  +PL E  
Sbjct: 1527 ASHFCRSSLMIHSATPTSTLAALKQQDSYFGRSEEDQRLGAYFGTADGGMGAVVPLSEPV 1586

Query: 169  YRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK----GYYAGNPSRGIIDGSLVWKFLQLS 224
            Y RL  LQ+++         L PRA+R Y+      G  + +  +G+IDG LV ++  LS
Sbjct: 1587 YWRLTALQSIVANALESDCALAPRAWRLYRRSTRRGGCRSNDRKKGVIDGDLVLQYADLS 1646

Query: 225  LGERLEICKKIGSKHNDILDELYDIEALS 253
            + ++ +I   IGS  + ILD L +++  S
Sbjct: 1647 ISKQEDIASAIGSTVDLILDNLLELQCGS 1675


>gi|167526060|ref|XP_001747364.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774199|gb|EDQ87831.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1324

 Score = 97.8 bits (242), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 52/170 (30%), Positives = 95/170 (55%), Gaps = 7/170 (4%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
             Y  +FV+  S +  +  D D N+V   Y  ++ ++ GG  L++  +FHLG  ++    +
Sbjct: 1138 CYCADFVVRESDLAMIYGDADGNLVALDYDADSPDTRGGRILVRSANFHLGTRLSAM--L 1195

Query: 131  RCKPSPISDAPG----ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
            R + +P+  APG    A+   +     ++G  G  +PL E  YRRL MLQ  +V+H+S  
Sbjct: 1196 RLQAAPVVRAPGGLAEAQKCHVVHTFGIEGQQGVVIPLHEAEYRRLEMLQKKLVSHSS-L 1254

Query: 187  GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
             GL+P  FR +K   +   + ++GI+DG+L+ ++  L   E+L++ +++G
Sbjct: 1255 AGLHPFQFRAFKSSIWRPRSFAQGILDGALLRQYFCLGRREQLDVAEQLG 1304


>gi|301103686|ref|XP_002900929.1| cleavage and polyadenylation specificity factor subunit, putative
            [Phytophthora infestans T30-4]
 gi|262101684|gb|EEY59736.1| cleavage and polyadenylation specificity factor subunit, putative
            [Phytophthora infestans T30-4]
          Length = 1561

 Score = 97.8 bits (242), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 60/204 (29%), Positives = 99/204 (48%), Gaps = 15/204 (7%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  V A EF +    +  +  D D+N+ +  + P+  ES GG RL++ +DFHLG  V++ 
Sbjct: 1357 PLAVSATEFSVFEKKLALLAVDMDENLHVMQFAPQDIESRGGQRLLRVSDFHLGVQVSSM 1416

Query: 128  FKIRCKPS-PISDAPGAR-----SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
            F+ R   S  +  A   R     S ++    + +G +G  +P+ E+ +RRL  LQNVMV 
Sbjct: 1417 FRKRVDASGSVVSATNGRNAAPLSNYVNVMGTSEGGVGALVPVGERVFRRLFTLQNVMVN 1476

Query: 182  HTSHTGGLNPRAFRTYKGKGY-YAGNPS--------RGIIDGSLVWKFLQLSLGERLEIC 232
                   LNPR FR  K       G P         +  +D  ++++FLQL    + E+ 
Sbjct: 1477 TLPQNCALNPREFRILKTNAQRRCGRPDAWSKKKWKKSFLDAFVLFRFLQLDYVAQKELA 1536

Query: 233  KKIGSKHNDILDELYDIEALSSHF 256
            + IG+     +  L +++  +S F
Sbjct: 1537 RCIGTTPEVAMHNLLEVQHATSTF 1560


>gi|225679191|gb|EEH17475.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 1377

 Score = 97.1 bits (240), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 60/192 (31%), Positives = 93/192 (48%), Gaps = 18/192 (9%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A +F+     +  M++D D N+ +  Y PE   S  G RL+ ++ FH GQ  +T   +
Sbjct: 1184 VMAADFLPHGKRLFIMVADDDCNIHVLQYDPEDPGSAKGDRLLHRSTFHTGQFASTL-TL 1242

Query: 131  RCKPSPISDAPGARSRFL----------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
              + S +S  P A +  +              S  G++    P+ E  YRRL  LQ+ M+
Sbjct: 1243 LPRTSVLSQGPEAEANAMDLDSSGPLHQVLVTSETGSIALITPVSEMAYRRLSALQSQMI 1302

Query: 181  THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
                H  GLNPRAFR  +  G       RG++DG LV K+L L    + EI  ++G+   
Sbjct: 1303 NTLEHPCGLNPRAFRAVESDGIGG----RGMVDGDLVQKWLDLGTQRKAEIASRVGA--- 1355

Query: 241  DILDELYDIEAL 252
            D+ +   D+EA+
Sbjct: 1356 DVWEIRADLEAI 1367


>gi|295665178|ref|XP_002793140.1| cleavage and polyadenylation specificity factor subunit A
            [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226278054|gb|EEH33620.1| cleavage and polyadenylation specificity factor subunit A
            [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 1408

 Score = 96.7 bits (239), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 93/192 (48%), Gaps = 18/192 (9%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A +F+ +   +  M++D D N+ +  Y PE   S  G RL+ ++ FH GQ  +T   +
Sbjct: 1217 VMAADFLPDGKRLYIMVADDDCNIHVLQYDPEDPGSAKGDRLLHRSTFHTGQFASTL-TL 1275

Query: 131  RCKPSPISDAPGARSRFL----------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
              + S +S  P   +  +              S  G++    P+ E  YRRL  LQ+ M+
Sbjct: 1276 LPRTSVLSQGPETEANAMDLDLSGPLHQVLVTSETGSIALITPVSEMAYRRLSALQSQMI 1335

Query: 181  THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
                H  GLNPRAFR  +  G       RG++DG LV K+L L    + EI  ++G+   
Sbjct: 1336 NTLEHPCGLNPRAFRAVESDGIGG----RGMVDGDLVQKWLDLGTQRKAEIASRVGA--- 1388

Query: 241  DILDELYDIEAL 252
            D+ +   D+EA+
Sbjct: 1389 DVWEIRADLEAI 1400


>gi|213407244|ref|XP_002174393.1| cleavage factor one Cft1 [Schizosaccharomyces japonicus yFS275]
 gi|212002440|gb|EEB08100.1| cleavage factor one Cft1 [Schizosaccharomyces japonicus yFS275]
          Length = 1431

 Score = 96.7 bits (239), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 60/192 (31%), Positives = 97/192 (50%), Gaps = 10/192 (5%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  V + +F+++  S+ F ++D   N+ +  Y PE  ES+ G RL+ + D H+G H+ T 
Sbjct: 1242 PLSVSSADFLVQGQSLYFAVADMRGNLRILAYDPENPESHSGERLVTRGDIHVG-HIIT- 1299

Query: 128  FKIRCKPSPISDAPGAR-----SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
              I   P    D PG         F     + DG+L    P+ E+ YRRL ++QN +   
Sbjct: 1300 -AIHLVPKMKKDRPGEVDYDEGDEFACITTNSDGSLQALCPISERVYRRLNIIQNYLANR 1358

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
                GGLNPR++R          N +  I+DG L+  F  +S+  R E+  K G   + I
Sbjct: 1359 IETVGGLNPRSYRLINTVSSL-NNATHRILDGGLIEHFSYMSVAHRQEMAYKCGVPISTI 1417

Query: 243  LDELYDI-EALS 253
            +++L ++ EAL+
Sbjct: 1418 MNDLVELDEALN 1429


>gi|119195757|ref|XP_001248482.1| hypothetical protein CIMG_02253 [Coccidioides immitis RS]
 gi|121769680|sp|Q1E5B0.1|CFT1_COCIM RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|392862316|gb|EAS37050.2| protein CFT1 [Coccidioides immitis RS]
          Length = 1387

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 57/192 (29%), Positives = 97/192 (50%), Gaps = 16/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A +F+ +   +  +++D D  + +  Y PE   S+ G RL+ ++ FH G   +T   +
Sbjct: 1197 VIAADFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHTGHFTSTMTLL 1256

Query: 131  -RCKPSPISDAPGARS--------RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
                 SP +D P             +     S +G++G   PL E +YRRL  LQ+ +VT
Sbjct: 1257 PEHSSSPSADDPEEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQSQLVT 1316

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
               H  GLNP+A+R  +  G+      RGI+DG+L+ ++L + +  + EI  ++G+   D
Sbjct: 1317 SMEHPCGLNPKAYRAVESDGFGG----RGIVDGNLLLRWLDMGVQRKAEIAGRVGA---D 1369

Query: 242  ILDELYDIEALS 253
            I     D+E +S
Sbjct: 1370 IESIRVDLETIS 1381


>gi|121719617|ref|XP_001276507.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus clavatus NRRL 1]
 gi|148886827|sp|A1C3U1.1|CFT1_ASPCL RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|119404719|gb|EAW15081.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus clavatus NRRL 1]
          Length = 1401

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 57/188 (30%), Positives = 100/188 (53%), Gaps = 19/188 (10%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF--- 127
            V A +F+ +   +  +++D D N+ +  Y+PE   S+ G +L+ ++ FH+G   +T    
Sbjct: 1210 VVAADFLPDGDKLFILVADSDCNLHVLQYEPEDPMSSNGDKLLVRSKFHMGHFTSTLTLL 1269

Query: 128  ------FKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNV 178
                  ++I   PS  SD+     R         S  G++G    +PE++YRRL  LQ+ 
Sbjct: 1270 PRTTASYEI---PSADSDSMEVDPRITPQQVLITSQSGSIGIVTSIPEESYRRLSALQSQ 1326

Query: 179  MVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 238
            +     H  GLNPRA+R  +  G  AG   RG++DG+L++++L +S   R+EI  ++G+ 
Sbjct: 1327 LANTVEHPCGLNPRAYRAIESDG-TAG---RGMLDGNLLYQWLSMSKQRRMEIAARVGAH 1382

Query: 239  HNDILDEL 246
              +I  +L
Sbjct: 1383 EWEIKADL 1390


>gi|196012166|ref|XP_002115946.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
 gi|190581722|gb|EDV21798.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
          Length = 1187

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 82/148 (55%), Gaps = 3/148 (2%)

Query: 103  ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFL 162
            A ES+GG  L+++ +   G + + FF+ + +     +     ++ +TW+ +LDG++G  L
Sbjct: 1039 APESHGGQFLVRRAEIQTGSNAHAFFRTKVRAL---NQRQNENKHITWFGTLDGSIGLLL 1095

Query: 163  PLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQ 222
            P+ EK YRRL  LQ  +  +     GLN +AFRT++       N  R I+DG L+ ++  
Sbjct: 1096 PVDEKEYRRLFSLQAKLSIYLEQNAGLNQKAFRTFRSHQKKLQNSMRNILDGDLLKRYFH 1155

Query: 223  LSLGERLEICKKIGSKHNDILDELYDIE 250
            L   ER ++ K+I S    I+++L  +E
Sbjct: 1156 LGFVERRDLAKQIMSTPEQIINDLTKLE 1183


>gi|148886829|sp|A2R919.1|CFT1_ASPNC RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|134083776|emb|CAK47110.1| unnamed protein product [Aspergillus niger]
          Length = 1383

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 56/192 (29%), Positives = 97/192 (50%), Gaps = 16/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A EF+ +   +  +++D D N+ +  Y PE  +S+ G RL+ ++ FH+G   +T   +
Sbjct: 1192 VCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSRSKFHMGNFASTLTLL 1251

Query: 131  ---------RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
                         S   D             + +G+LG    +PE++YRRL  LQ+ +  
Sbjct: 1252 PRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPEESYRRLSALQSQLTN 1311

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
               H  GLNPRAFR  +      G   RG++DG+L++K++ +S   + EI  ++G++  +
Sbjct: 1312 TLEHPCGLNPRAFRAVESD----GTAGRGMLDGNLLFKWIDMSKQRKTEIAGRVGAREWE 1367

Query: 242  ILDELYDIEALS 253
            I     D+EA+S
Sbjct: 1368 I---KADLEAIS 1376


>gi|348679545|gb|EGZ19361.1| putative cleavage and polyadenylation specificity factor CPSF
            [Phytophthora sojae]
          Length = 1752

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 59/210 (28%), Positives = 98/210 (46%), Gaps = 27/210 (12%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  V A EF +    +  +  D D+N+ +  + P+  ES GG RL++ +DFHLG  V + 
Sbjct: 1548 PLAVSATEFSVFEKKLALLAVDMDENLHVMQFAPQDIESRGGQRLLRVSDFHLGVQVASM 1607

Query: 128  FKIRCKPSPISDAPGAR------------SRFLTWYASLDGALGFFLPLPEKNYRRLLML 175
            F+ R       D PG              S ++    + +G +G  +P+ E+ +RRL  L
Sbjct: 1608 FRKRV------DGPGGHVAVNGRGPRAPPSYYVNVMGNSEGGVGALIPVGERVFRRLFTL 1661

Query: 176  QNVMVTHTSHTGGLNPRAFRTYKGKGY-YAGNPS--------RGIIDGSLVWKFLQLSLG 226
            QNVMV        LNPR FR  K       G P         +  +D  ++++FLQL   
Sbjct: 1662 QNVMVNTLPQNCALNPREFRMLKTNAQRRCGRPDAWSKKKWKKSFLDAFVLFRFLQLDYV 1721

Query: 227  ERLEICKKIGSKHNDILDELYDIEALSSHF 256
             + E+ + IG+    ++  L +++  ++ F
Sbjct: 1722 AQKELARCIGTTPEVVIHNLLEVQHATATF 1751


>gi|317036382|ref|XP_001398211.2| protein cft1 [Aspergillus niger CBS 513.88]
          Length = 1393

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 56/192 (29%), Positives = 97/192 (50%), Gaps = 16/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A EF+ +   +  +++D D N+ +  Y PE  +S+ G RL+ ++ FH+G   +T   +
Sbjct: 1202 VCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSRSKFHMGNFASTLTLL 1261

Query: 131  ---------RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
                         S   D             + +G+LG    +PE++YRRL  LQ+ +  
Sbjct: 1262 PRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPEESYRRLSALQSQLTN 1321

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
               H  GLNPRAFR  +      G   RG++DG+L++K++ +S   + EI  ++G++  +
Sbjct: 1322 TLEHPCGLNPRAFRAVESD----GTAGRGMLDGNLLFKWIDMSKQRKTEIAGRVGAREWE 1377

Query: 242  ILDELYDIEALS 253
            I     D+EA+S
Sbjct: 1378 I---KADLEAIS 1386


>gi|350633238|gb|EHA21604.1| hypothetical protein ASPNIDRAFT_51242 [Aspergillus niger ATCC 1015]
          Length = 1406

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 56/192 (29%), Positives = 97/192 (50%), Gaps = 16/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A EF+ +   +  +++D D N+ +  Y PE  +S+ G RL+ ++ FH+G   +T   +
Sbjct: 1215 VCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSRSKFHMGNFASTLTLL 1274

Query: 131  ---------RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
                         S   D             + +G+LG    +PE++YRRL  LQ+ +  
Sbjct: 1275 PRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPEESYRRLSALQSQLTN 1334

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
               H  GLNPRAFR  +      G   RG++DG+L++K++ +S   + EI  ++G++  +
Sbjct: 1335 TLEHPCGLNPRAFRAVESD----GTAGRGMLDGNLLFKWIDMSKQRKTEIAGRVGAREWE 1390

Query: 242  ILDELYDIEALS 253
            I     D+EA+S
Sbjct: 1391 I---KADLEAIS 1399


>gi|325189779|emb|CCA24259.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1911

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 55/198 (27%), Positives = 96/198 (48%), Gaps = 15/198 (7%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  V   EF I  + +  + +D ++N+ +  Y P   ES GG RL++ +DFH+G  +++ 
Sbjct: 1707 PLAVTTAEFNILDTRLALIAADVEENLHVLQYAPHDIESRGGQRLLRTSDFHVGVQISSI 1766

Query: 128  F-KIRCKPSPISDAPGARSR-----FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
              K+    +       A+ R     +L    S +G +   +P+PE+ +RRL  LQNVM++
Sbjct: 1767 LRKLVISNASHQQYIPAKGRCIGNMYLNVLGSSEGGIAALIPVPERVFRRLFTLQNVMIS 1826

Query: 182  HTSHTGGLNPRAFRTYKGKG---------YYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
                   LNPR FR  K  G         +      +G +DG ++ +FL L    + E+ 
Sbjct: 1827 ALPQNCALNPREFRVMKANGRVRSGRADAWCKQKWKKGFLDGQVLCRFLHLDYVAQKELA 1886

Query: 233  KKIGSKHNDILDELYDIE 250
            + IG+    I+  L +++
Sbjct: 1887 RCIGTNPEVIIQNLSELQ 1904


>gi|325187036|emb|CCA21579.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1912

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 55/198 (27%), Positives = 96/198 (48%), Gaps = 15/198 (7%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  V   EF I  + +  + +D ++N+ +  Y P   ES GG RL++ +DFH+G  +++ 
Sbjct: 1708 PLAVTTAEFNILDTRLALIAADVEENLHVLQYAPHDIESRGGQRLLRTSDFHVGVQISSI 1767

Query: 128  F-KIRCKPSPISDAPGARSR-----FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
              K+    +       A+ R     +L    S +G +   +P+PE+ +RRL  LQNVM++
Sbjct: 1768 LRKLVISNASHQQYIPAKGRCIGNMYLNVLGSSEGGIAALIPVPERVFRRLFTLQNVMIS 1827

Query: 182  HTSHTGGLNPRAFRTYKGKG---------YYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
                   LNPR FR  K  G         +      +G +DG ++ +FL L    + E+ 
Sbjct: 1828 ALPQNCALNPREFRVMKANGRVRSGRADAWCKQKWKKGFLDGQVLCRFLHLDYVAQKELA 1887

Query: 233  KKIGSKHNDILDELYDIE 250
            + IG+    I+  L +++
Sbjct: 1888 RCIGTNPEVIIQNLSELQ 1905


>gi|146324727|ref|XP_747211.2| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus fumigatus Af293]
 gi|148886828|sp|Q4WCL1.2|CFT1_ASPFU RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|129556124|gb|EAL85173.2| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus fumigatus Af293]
          Length = 1401

 Score = 93.6 bits (231), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 55/192 (28%), Positives = 96/192 (50%), Gaps = 17/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF-- 128
            V A EF+ +   +  +++D D N+ +  Y PE  +S+ G RL+ ++ FH+G    T    
Sbjct: 1210 VVAAEFLPDGDKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLL 1269

Query: 129  --------KIRCKPSPIS-DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 179
                    K    P  +  D+     + L    S  G++G    +PE++YRRL  LQ+ +
Sbjct: 1270 PRTMVSSEKAMANPDSMEIDSQTISQQVLI--TSQSGSVGIVTSVPEESYRRLSALQSQL 1327

Query: 180  VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 239
                 H  GLNPRA+R  +      G   RG++DG+L++++L +    ++EI  ++G+  
Sbjct: 1328 ANSLEHPCGLNPRAYRAVESD----GTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAHE 1383

Query: 240  NDILDELYDIEA 251
             +I  +L  I A
Sbjct: 1384 WEIKADLEAIGA 1395


>gi|119484094|ref|XP_001261950.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Neosartorya fischeri NRRL 181]
 gi|148886830|sp|A1DB13.1|CFT1_NEOFI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|119410106|gb|EAW20053.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Neosartorya fischeri NRRL 181]
          Length = 1400

 Score = 93.6 bits (231), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 55/192 (28%), Positives = 96/192 (50%), Gaps = 17/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF-- 128
            V A EF+ +   +  +++D D N+ +  Y PE  +S+ G RL+ ++ FH+G    T    
Sbjct: 1209 VVAAEFLPDGDKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLL 1268

Query: 129  --------KIRCKPSPIS-DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 179
                    K    P  +  D+     + L    S  G++G    +PE++YRRL  LQ+ +
Sbjct: 1269 PRTMVSSEKAMADPDSMEIDSQTISQQVLI--TSQSGSVGIVTSVPEESYRRLSALQSQL 1326

Query: 180  VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 239
                 H  GLNPRA+R  +      G   RG++DG+L++++L +    ++EI  ++G+  
Sbjct: 1327 TNSLEHPCGLNPRAYRAVESD----GTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAHE 1382

Query: 240  NDILDELYDIEA 251
             +I  +L  I A
Sbjct: 1383 WEIKADLEAIGA 1394


>gi|159123784|gb|EDP48903.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus fumigatus A1163]
          Length = 1401

 Score = 93.6 bits (231), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 55/192 (28%), Positives = 96/192 (50%), Gaps = 17/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF-- 128
            V A EF+ +   +  +++D D N+ +  Y PE  +S+ G RL+ ++ FH+G    T    
Sbjct: 1210 VVAAEFLPDGDKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLL 1269

Query: 129  --------KIRCKPSPIS-DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 179
                    K    P  +  D+     + L    S  G++G    +PE++YRRL  LQ+ +
Sbjct: 1270 PRTMVSSEKAMANPDSMEIDSQTISQQVLI--TSQSGSVGIVTSVPEESYRRLSALQSQL 1327

Query: 180  VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 239
                 H  GLNPRA+R  +      G   RG++DG+L++++L +    ++EI  ++G+  
Sbjct: 1328 ANSLEHPCGLNPRAYRAVESD----GTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAHE 1383

Query: 240  NDILDELYDIEA 251
             +I  +L  I A
Sbjct: 1384 WEIKADLEAIGA 1395


>gi|358372791|dbj|GAA89393.1| cleavage and polyadenylation specificity factor subunit A
            [Aspergillus kawachii IFO 4308]
          Length = 1372

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 55/192 (28%), Positives = 97/192 (50%), Gaps = 16/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A EF+ +   +  +++D D N+ +  Y PE  +S+ G +L+ ++ FH G   +T   +
Sbjct: 1181 VSAAEFLPDGRRLFIVVADSDCNIHVLQYDPEDPKSSNGDKLLSRSKFHTGNFASTLTLL 1240

Query: 131  ---------RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
                         S   D     +       + +G+LG    +PE++YRRL  LQ+ +  
Sbjct: 1241 PRTMVSSEKMISNSDDMDIDNQSALHQVLMTTQNGSLGLITCMPEESYRRLSALQSQLTN 1300

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
               H  GLNPRAFR  +      G   RG++DG+L++K++ +S   + EI  ++G++  +
Sbjct: 1301 TLEHPCGLNPRAFRAVESD----GTAGRGMLDGNLLFKWIDMSKQRKTEIAGRVGAREWE 1356

Query: 242  ILDELYDIEALS 253
            I     D+EA+S
Sbjct: 1357 I---KADLEAIS 1365


>gi|414587797|tpg|DAA38368.1| TPA: hypothetical protein ZEAMMB73_143443 [Zea mays]
          Length = 153

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 51/144 (35%), Positives = 78/144 (54%), Gaps = 1/144 (0%)

Query: 105 ESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPL 164
           ES  G +L+ + +FH+G HV+ F +++  P+    A    +RF   + +LDG +G   P+
Sbjct: 3   ESWKGQKLLSRAEFHVGAHVSKFLRLQMLPTQ-GLASEKTNRFALVFGTLDGGIGCIAPV 61

Query: 165 PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
            E  +RRL  LQ  +V    H  GLNPR+FR +K  G         IID  L+  +  +S
Sbjct: 62  DELTFRRLQSLQRKLVDAIPHVCGLNPRSFRHFKSNGKAHRPGPDNIIDFELLSHYEMMS 121

Query: 225 LGERLEICKKIGSKHNDILDELYD 248
           L E+LEI ++IG+  + IL    D
Sbjct: 122 LEEQLEIAQQIGTTRSQILSNFSD 145


>gi|261201748|ref|XP_002628088.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
 gi|239590185|gb|EEQ72766.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
          Length = 1403

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 56/192 (29%), Positives = 91/192 (47%), Gaps = 17/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A +F+ +   +  +++D D N+ +  Y PE   S+ G RL+ ++ FH G   +T   +
Sbjct: 1211 VMAADFLPDGKRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHRSTFHTGHFASTMTLL 1270

Query: 131  RCKPSPISDAPGARSRFL----------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
                 P +  P A    +              S  G++    PL E  YRRL  LQ+ ++
Sbjct: 1271 PRTIIPSAQGPDANPDMMELDSSGPLYHVLVTSETGSIALITPLSETAYRRLSALQSQLI 1330

Query: 181  THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
                H  GLNPRAFR  +  G       RG++DG L+ ++L L    + EI  ++G+   
Sbjct: 1331 NTLEHPCGLNPRAFRAIESDGIGG----RGMVDGDLLHRWLDLGTQRKAEIAHRVGA--- 1383

Query: 241  DILDELYDIEAL 252
            DI +   D+EA+
Sbjct: 1384 DIWEIRADLEAI 1395


>gi|239611898|gb|EEQ88885.1| protein CFT1 [Ajellomyces dermatitidis ER-3]
 gi|327352847|gb|EGE81704.1| CFT1 [Ajellomyces dermatitidis ATCC 18188]
          Length = 1402

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 56/192 (29%), Positives = 91/192 (47%), Gaps = 17/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A +F+ +   +  +++D D N+ +  Y PE   S+ G RL+ ++ FH G   +T   +
Sbjct: 1210 VMAADFLPDGKRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHRSTFHTGHFASTMTLL 1269

Query: 131  RCKPSPISDAPGARSRFL----------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
                 P +  P A    +              S  G++    PL E  YRRL  LQ+ ++
Sbjct: 1270 PRTIIPSAQGPDANPDMMELDSSGPLYHVLVTSETGSIALITPLSETAYRRLSALQSQLI 1329

Query: 181  THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
                H  GLNPRAFR  +  G       RG++DG L+ ++L L    + EI  ++G+   
Sbjct: 1330 NTLEHPCGLNPRAFRAIESDGIGG----RGMVDGDLLHRWLDLGTQRKAEIAHRVGA--- 1382

Query: 241  DILDELYDIEAL 252
            DI +   D+EA+
Sbjct: 1383 DIWEIRADLEAI 1394


>gi|296806499|ref|XP_002844059.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
 gi|238845361|gb|EEQ35023.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
          Length = 1348

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 54/178 (30%), Positives = 90/178 (50%), Gaps = 13/178 (7%)

Query: 70   GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
             V A +F+ + + +  +++D D N+ +  Y PE   S+ G RL+ +  FH+G   +T   
Sbjct: 1155 AVIAADFLPDGNRLYVLVADDDCNLHVLQYDPEDPSSSKGDRLLHRNVFHVGHFASTMTL 1214

Query: 130  I----RCKPSPI------SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 179
            +        SP       +DAP   S++        G++G   PL E +YRRLL LQ+ +
Sbjct: 1215 LPQGSHTPHSPADRDAMDTDAPLPPSKYQILMTFQTGSVGIITPLNEDSYRRLLALQSQL 1274

Query: 180  VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
            V    H  GLNPR +R  +  G       RG+IDG+L+ ++L +    + EI  ++G+
Sbjct: 1275 VNALEHPCGLNPRGYRAVESDGIGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGA 1329


>gi|317157892|ref|XP_001826637.2| protein cft1 [Aspergillus oryzae RIB40]
 gi|391864317|gb|EIT73613.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
            [Aspergillus oryzae 3.042]
          Length = 1389

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 58/192 (30%), Positives = 99/192 (51%), Gaps = 16/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A +F+ + + +  +++D D N+ +  Y PE  +S+ G RL+ ++ FH G  ++T   +
Sbjct: 1198 VLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFHTGNFISTLTLL 1257

Query: 131  -RCKPSP---ISDAPGAR-----SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
             R   S    ISD           R      S +G++G    + E++YRRL  LQ+ +  
Sbjct: 1258 PRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEESYRRLSALQSQLTN 1317

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
               H  GLNPRAFR  +      G   RG++DG L++++L +S   ++EI  ++G+   +
Sbjct: 1318 TIEHPCGLNPRAFRAVESD----GTAGRGMLDGKLLFQWLDMSKQRKVEIASRVGANEWE 1373

Query: 242  ILDELYDIEALS 253
            I     D EA+S
Sbjct: 1374 I---KADFEAIS 1382


>gi|238508528|ref|XP_002385456.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus flavus NRRL3357]
 gi|220688975|gb|EED45327.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus flavus NRRL3357]
          Length = 1204

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 58/192 (30%), Positives = 99/192 (51%), Gaps = 16/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A +F+ + + +  +++D D N+ +  Y PE  +S+ G RL+ ++ FH G  ++T   +
Sbjct: 1013 VLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFHTGNFISTLTLL 1072

Query: 131  -RCKPSP---ISDAPGAR-----SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
             R   S    ISD           R      S +G++G    + E++YRRL  LQ+ +  
Sbjct: 1073 PRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEESYRRLSALQSQLTN 1132

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
               H  GLNPRAFR  +      G   RG++DG L++++L +S   ++EI  ++G+   +
Sbjct: 1133 TIEHPCGLNPRAFRAVESD----GTAGRGMLDGKLLFQWLDMSKQRKVEIASRVGANEWE 1188

Query: 242  ILDELYDIEALS 253
            I     D EA+S
Sbjct: 1189 I---KADFEAIS 1197


>gi|121797760|sp|Q2TZ19.1|CFT1_ASPOR RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|83775384|dbj|BAE65504.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1393

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 58/192 (30%), Positives = 99/192 (51%), Gaps = 16/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A +F+ + + +  +++D D N+ +  Y PE  +S+ G RL+ ++ FH G  ++T   +
Sbjct: 1202 VLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFHTGNFISTLTLL 1261

Query: 131  -RCKPSP---ISDAPGAR-----SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
             R   S    ISD           R      S +G++G    + E++YRRL  LQ+ +  
Sbjct: 1262 PRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEESYRRLSALQSQLTN 1321

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
               H  GLNPRAFR  +      G   RG++DG L++++L +S   ++EI  ++G+   +
Sbjct: 1322 TIEHPCGLNPRAFRAVESD----GTAGRGMLDGKLLFQWLDMSKQRKVEIASRVGANEWE 1377

Query: 242  ILDELYDIEALS 253
            I     D EA+S
Sbjct: 1378 I---KADFEAIS 1386


>gi|145348791|ref|XP_001418827.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579057|gb|ABO97120.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1386

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 53/173 (30%), Positives = 91/173 (52%), Gaps = 8/173 (4%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V + EF+I+ +++  + SD   N  +F Y P++ ES  G +L+ ++ +H+G  ++   + 
Sbjct: 1195 VVSTEFLIDGATLSLLGSDMSGNARIFGYDPKSLESWKGQKLLVRSAYHVGSPISRMVRF 1254

Query: 131  RCKPSPISDAPGAR----SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
              + +    APG R    +R   ++ +LDGALG F+P  E  Y +L  LQ  + T     
Sbjct: 1255 NVEGTTAKAAPGERPKGTNRHAVFFGTLDGALGIFMPTDEPTYAKLHALQRELNTTVRSP 1314

Query: 187  GGLNPRAFRT---YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
             G NPR FRT   ++GK      P   ++DG L+ KF  L+  E+  + ++ G
Sbjct: 1315 IGCNPRTFRTPKVFEGKHVQLLAP-LDVLDGGLLSKFETLTFTEQRAVAERSG 1366


>gi|212541400|ref|XP_002150855.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Talaromyces marneffei ATCC 18224]
 gi|210068154|gb|EEA22246.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Talaromyces marneffei ATCC 18224]
          Length = 1383

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 54/192 (28%), Positives = 93/192 (48%), Gaps = 16/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF-- 128
            V   +F+ +   +  +++D D N+ +  Y PE  +S+ G RL+ +  FH+G   +T    
Sbjct: 1192 VVTADFLPDGKKLYILVADGDCNLYVLQYDPEDPKSSNGDRLLNRCKFHMGHFASTLTLL 1251

Query: 129  -------KIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
                   ++    S   D       +     +  G++     L E++YRRL  LQ+ +  
Sbjct: 1252 PRTAVSSELAVMSSDSMDIDSYTPLYQALITTQSGSMALITSLSEESYRRLTALQSQLSN 1311

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
               H  GLNPRA+R+ +  G       RG+IDG L+ ++L LS   +LEI  ++G+   +
Sbjct: 1312 TLEHPCGLNPRAYRSVESDGVVG----RGMIDGKLLMRWLDLSRSRKLEIAGRVGADEWE 1367

Query: 242  ILDELYDIEALS 253
            I     D+EA+S
Sbjct: 1368 I---RADLEAVS 1376


>gi|315045910|ref|XP_003172330.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
 gi|311342716|gb|EFR01919.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
          Length = 1397

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 52/178 (29%), Positives = 89/178 (50%), Gaps = 13/178 (7%)

Query: 70   GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
             V   +F+ + + +  +++D D N+ +  Y PE   S+ G RL+ ++ FH G   +T   
Sbjct: 1204 AVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTL 1263

Query: 130  I----RCKPSPI------SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 179
            +        SP+      +D P   S++        G++G   PL E +YRRLL LQ+ +
Sbjct: 1264 LPHGSHTLSSPVDEDAMDTDLPPPPSKYQVLITFQTGSIGVISPLNEDSYRRLLALQSQL 1323

Query: 180  VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
            V    H  GLNPR +R  +  G       RG+IDG+L+ ++L +    + EI  ++G+
Sbjct: 1324 VNALEHPCGLNPRGYRAVESDGMGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGA 1378


>gi|225558298|gb|EEH06582.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 1408

 Score = 90.1 bits (222), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 54/192 (28%), Positives = 91/192 (47%), Gaps = 17/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A +F+ + + +  +++D D N+ +  Y PE   S+ G RL+ ++ F  G   +T   +
Sbjct: 1216 VMAADFLPDGNRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHRSTFQTGHFASTMTLL 1275

Query: 131  RCKPSPISDAPGARSRFL----------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
                +  S  P A    +              S  G++    P+ E +YRRL  LQ+ + 
Sbjct: 1276 PRTATSSSQGPDADPDMMDLDSSGPLHHVLVTSETGSIALITPVSETSYRRLSALQSQLT 1335

Query: 181  THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
                H  GLNPRAFR  +  G       RG++DG LV ++L L    + EI  ++G+   
Sbjct: 1336 NTLEHPCGLNPRAFRAVESDGIGG----RGMVDGDLVKRWLDLGTQRKAEIANRVGA--- 1388

Query: 241  DILDELYDIEAL 252
            D+ +   D+EA+
Sbjct: 1389 DVWEIRADLEAI 1400


>gi|240277254|gb|EER40763.1| cleavage factor two protein 1 [Ajellomyces capsulatus H143]
          Length = 1408

 Score = 90.1 bits (222), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 54/192 (28%), Positives = 91/192 (47%), Gaps = 17/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A +F+ + + +  +++D D N+ +  Y PE   S+ G RL+ ++ F  G   +T   +
Sbjct: 1216 VMAADFLPDGNRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHRSTFQTGHFASTMTLL 1275

Query: 131  RCKPSPISDAPGARSRFL----------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
                +  S  P A    +              S  G++    P+ E +YRRL  LQ+ + 
Sbjct: 1276 PRTATSSSQGPDADPDMMDLDSSGPLHHVLVTSETGSIALITPVSETSYRRLSALQSQLA 1335

Query: 181  THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
                H  GLNPRAFR  +  G       RG++DG LV ++L L    + EI  ++G+   
Sbjct: 1336 NTLEHPCGLNPRAFRAVESDGIGG----RGMVDGDLVKRWLDLGTQRKAEIANRVGA--- 1388

Query: 241  DILDELYDIEAL 252
            D+ +   D+EA+
Sbjct: 1389 DVWEIRADLEAI 1400


>gi|308805673|ref|XP_003080148.1| cleavage and polyadenylation specificity factor (ISS) [Ostreococcus
            tauri]
 gi|116058608|emb|CAL54315.1| cleavage and polyadenylation specificity factor (ISS), partial
            [Ostreococcus tauri]
          Length = 1473

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 51/170 (30%), Positives = 87/170 (51%), Gaps = 9/170 (5%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V + EF+I+ +++  + SD   N   F Y P++ ES  G +L+ +  +H+G  ++   + 
Sbjct: 1023 VVSSEFLIDGTTLSLLGSDMSGNARTFGYDPKSIESWKGQKLLPRAAYHVGSPISRMVRF 1082

Query: 131  -----RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
                 + K +     P   +RF  ++ +LDGALG F+P     Y +LL +Q  + T    
Sbjct: 1083 NVEGSKSKMASTDGKPKGANRFAVFFGTLDGALGIFMPTDPVTYEKLLAIQRELTTAVRS 1142

Query: 186  TGGLNPRAFRT---YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
              G NPR FRT   ++GK      P   ++DG L+ KF  L+  E+++I 
Sbjct: 1143 PIGCNPRTFRTPKVFEGKHVQLRAP-LDVLDGGLLSKFETLTFSEQVKIA 1191


>gi|149512998|ref|XP_001514888.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like, partial [Ornithorhynchus anatinus]
          Length = 831

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 52/165 (31%), Positives = 91/165 (55%), Gaps = 21/165 (12%)

Query: 3   IVQELLTVSLGLHGNRPLLL-----VRTQH---------ELLIYQAFRHPKGTLKLR-FK 47
           +V+E+L V+LG    RP LL     +R            +L I+Q        L     K
Sbjct: 671 LVKEVLLVALGNRQTRPYLLRLKWAIRDSELTSITFIDMQLYIHQIISVKNFILAADVMK 730

Query: 48  KLKVLFVSDRSKRAN--EQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARE 105
            + +L   + SK  +   +   P  VY+ +F+++++ +GF++SD+D+N++++MY PEA+E
Sbjct: 731 SISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKE 790

Query: 106 SNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTW 150
           S GG RL+++ DFH+G HVN F++  C+ +    A G   + + W
Sbjct: 791 SFGGMRLLRRADFHVGAHVNAFWRTPCRGA----AEGPSKKSIVW 831


>gi|154285962|ref|XP_001543776.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150407417|gb|EDN02958.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 1283

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 54/192 (28%), Positives = 91/192 (47%), Gaps = 17/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A +F+ + + +  +++D D N+ +  Y PE   S+ G RL+ ++ F  G   +T   +
Sbjct: 1091 VMAADFLPDGNRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHRSTFQTGHFASTMTLL 1150

Query: 131  RCKPSPISDAPGARSRFL----------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
                +  S  P A    +              S  G++    P+ E +YRRL  LQ+ + 
Sbjct: 1151 PRTATSSSQRPDADPDMMDLDSSGPLHHVLVTSETGSIALITPVSETSYRRLSALQSQLT 1210

Query: 181  THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
                H  GLNPRAFR  +  G       RG++DG LV ++L L    + EI  ++G+   
Sbjct: 1211 NTLEHPCGLNPRAFRAVESDGIGG----RGMVDGDLVKRWLDLGTQRKAEIANRVGA--- 1263

Query: 241  DILDELYDIEAL 252
            D+ +   D+EA+
Sbjct: 1264 DVWEIRADLEAI 1275


>gi|321475208|gb|EFX86171.1| hypothetical protein DAPPUDRAFT_313209 [Daphnia pulex]
          Length = 1260

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 36/64 (56%), Positives = 53/64 (82%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  V A ++ I+++ +GF++SD +KN++L+MYQPEARES GGHRLI+K DFH+GQ V+T 
Sbjct: 1197 PLEVMAVDYYIDNTQLGFLVSDAEKNLILYMYQPEARESQGGHRLIRKADFHVGQVVSTM 1256

Query: 128  FKIR 131
            F+I+
Sbjct: 1257 FRIK 1260


>gi|327287424|ref|XP_003228429.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Anolis carolinensis]
          Length = 1294

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 34/88 (38%), Positives = 61/88 (69%), Gaps = 3/88 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  VY  +F+++S  +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVN F
Sbjct: 1206 PLEVYCVDFMVDSCQLGFLVSDRDRNLLVYMYLPEAKESFGGMRLLRRADFHVGAHVNAF 1265

Query: 128  FKIRCKPS---PISDAPGARSRFLTWYA 152
            ++  C+ +   P   +    ++ +TW+ 
Sbjct: 1266 WRTPCRGAMEGPTKKSSAWENKHITWFG 1293



 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 34/85 (40%), Positives = 47/85 (55%), Gaps = 14/85 (16%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL--KVLFVSD 56
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +  LK+RFKK+   + F   
Sbjct: 847 LVKEVLLVALGNRQSRPYLLVHVDQELLIYEAFNHDSQLGQTNLKVRFKKVPHNINFREK 906

Query: 57  R---SKRANEQPG-----LPRGVYA 73
           +   SK+  E  G     +PRG  A
Sbjct: 907 KPRPSKKKTESAGGEEASVPRGRVA 931


>gi|301124072|ref|XP_002909688.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262107255|gb|EEY65307.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 176

 Score = 87.4 bits (215), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 52/173 (30%), Positives = 87/173 (50%), Gaps = 15/173 (8%)

Query: 99  YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS-PISDAPGAR-----SRFLTWYA 152
           + P+  ES GG RL++ +DFHLG  V++ F+ R   S  +  A   R     S ++    
Sbjct: 3   FAPQDIESRGGQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVNVMG 62

Query: 153 SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY-YAGNPS--- 208
           + +G +G  +P+ E+ +RRL  LQNVMV        LNPR FR  K       G P    
Sbjct: 63  TSEGGVGALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRRCGRPDAWS 122

Query: 209 -----RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
                +  +D  ++++FLQL+   + E+ + IG+    ++  L +++  +S F
Sbjct: 123 KKKWKKSFLDAFVLFRFLQLNYVAQKELARCIGTTPEVVMHNLLEVQHATSTF 175


>gi|328848896|gb|EGF98089.1| hypothetical protein MELLADRAFT_96156 [Melampsora larici-populina
            98AG31]
          Length = 1427

 Score = 87.0 bits (214), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 53/180 (29%), Positives = 91/180 (50%), Gaps = 5/180 (2%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
              +F++  + +  + SD    + +F Y P   ES GG +L+ +T+F     + +      
Sbjct: 1252 TIDFLVLENKLSIVTSDLGGTIRIFEYNPTNIESQGGLKLLCRTEFGTAGEMGSSLGFGK 1311

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
            + S   +A        T +A LDG++   +P+ E  ++RL ++Q  ++ H  H  GLNPR
Sbjct: 1312 RLSSKEEAKSIG----TLFAGLDGSISSLVPVKEAVFKRLQIVQTRLIRHLDHFAGLNPR 1367

Query: 193  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 252
             FRT +     +   +RGIIDG ++ +F  L L E+  I K  GS  N IL  L +++ +
Sbjct: 1368 GFRTVRND-LVSRAMNRGIIDGEIIERFGALKLDEQDSIGKLAGSDRNTILINLNNLKGI 1426


>gi|330799483|ref|XP_003287774.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
 gi|325082229|gb|EGC35718.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
          Length = 1453

 Score = 87.0 bits (214), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 53/185 (28%), Positives = 96/185 (51%), Gaps = 9/185 (4%)

Query: 70   GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
             +++ EF++   ++  ++SD DKN++LF + P    S  G  L+ K DFH+G ++  F +
Sbjct: 1260 NIFSTEFIVNQQTLSILVSDLDKNILLFSFDPADPTSRQGQMLLCKADFHIGSNIEKFVR 1319

Query: 130  IRCKPS-PISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
               K +   S      +  L ++ +LDG+L    PL E+ Y+    LQ+ +  +     G
Sbjct: 1320 TPMKFNIQSSSNGNNNNDQLVFFGTLDGSLNVLRPLDERMYQLFYHLQSKLY-YLPQPAG 1378

Query: 189  LNPRAFRTYKG-KGYYAGNPS------RGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
            LN + +R +K     +  +PS      + I+DG L+ KF++L+  ER  +   +GS  ++
Sbjct: 1379 LNAKQYRAFKSFSQNFHFSPSTIHQLPKYILDGDLLSKFVKLNQKERRLLASSVGSNTDE 1438

Query: 242  ILDEL 246
            IL  L
Sbjct: 1439 ILTAL 1443


>gi|301093655|ref|XP_002997673.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110063|gb|EEY68115.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 176

 Score = 87.0 bits (214), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 52/173 (30%), Positives = 86/173 (49%), Gaps = 15/173 (8%)

Query: 99  YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS-PISDAPGAR-----SRFLTWYA 152
           + P+  ES GG RL++ +DFHLG  V++ F+ R   S  +  A   R     S ++    
Sbjct: 3   FAPQDIESRGGQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVNVMG 62

Query: 153 SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY-YAGNPS--- 208
           + +G +G  +P+ E+ +RRL  LQNVMV        LNPR FR  K       G P    
Sbjct: 63  TSEGGVGALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRRCGRPDAWS 122

Query: 209 -----RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
                +  +D  ++++FLQL    + E+ + IG+    ++  L +++  +S F
Sbjct: 123 KKKWKKSFLDAFVLFRFLQLDYVAQKELARCIGTTPEVVMHNLLEVQHATSTF 175


>gi|302652143|ref|XP_003017931.1| hypothetical protein TRV_08063 [Trichophyton verrucosum HKI 0517]
 gi|291181517|gb|EFE37286.1| hypothetical protein TRV_08063 [Trichophyton verrucosum HKI 0517]
          Length = 429

 Score = 86.7 bits (213), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 50/179 (27%), Positives = 90/179 (50%), Gaps = 16/179 (8%)

Query: 70  GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
            V   +F+ + + +  +++D D N+ +  Y PE   S+ G RL+ ++ FH G   +T   
Sbjct: 237 AVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTL 296

Query: 130 I----RCKPSPISD-------APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNV 178
           +    R   SP+ +        P ++ + L  + +  G++    PL E +YRRLL LQ+ 
Sbjct: 297 LPHGARTPSSPVDEDAMDTDSPPPSKYQILMTFQT--GSVAVITPLGEDSYRRLLALQSQ 354

Query: 179 MVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
           +V    H   LNPR +R  +  G       RG+IDG+L+ ++L +    + EI  ++G+
Sbjct: 355 LVNALEHPCSLNPRGYRAVESDGMGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGA 410


>gi|328864890|gb|EGG13276.1| CPSF domain-containing protein [Dictyostelium fasciculatum]
          Length = 1627

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 57/213 (26%), Positives = 101/213 (47%), Gaps = 15/213 (7%)

Query: 44   LRFKKLKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEA 103
            + F KLKV  +  ++     +       Y+ +F+++   +  ++SD ++N+ +F + P  
Sbjct: 1410 ISFFKLKVCIIQKKNIILLGKDYEEVSTYSSDFIVDEKKLSMVLSDANRNIRMFSFDPSD 1469

Query: 104  RESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS-----PISDAPGARSRFLTWYASLDGAL 158
             ES  G  L+ K+ FH+G+  N F +I  K +       S +     + L +Y +L G +
Sbjct: 1470 PESRAGQMLLAKSSFHIGELNNKFVRIPMKNTNYDNNSSSSSIIVNDKHLLFYGTLGGGI 1529

Query: 159  GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP-----SRGIID 213
               +P+  K +  +L      + H   T GLNPR FR     G++  N      ++ ++D
Sbjct: 1530 NLLMPI-NKRFHEILHALETKLMHRGQTAGLNPRGFR----YGHHVNNTLGHLHNQYVVD 1584

Query: 214  GSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
            G L+ KF  LS  +  ++   IGS    ILD L
Sbjct: 1585 GDLLTKFQSLSPDDAKQLATSIGSTTPIILDLL 1617


>gi|452841862|gb|EME43798.1| hypothetical protein DOTSEDRAFT_79774 [Dothistroma septosporum NZE10]
          Length = 1347

 Score = 85.9 bits (211), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 53/189 (28%), Positives = 88/189 (46%), Gaps = 16/189 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A EF+    ++  +I D D ++ +  Y PE  +S  G RL+ ++ FHLG        +
Sbjct: 1151 VMAAEFLPFDGALYILIIDADMDLHVLQYDPENPKSQNGMRLLHRSTFHLGHFATNMLLL 1210

Query: 131  RCKPSPISD------------APGARSR-FLTWYASLDGALGFFLPLPEKNYRRLLMLQN 177
                +P  +            +P   S  F     SL G++G   PL E +YRRL  LQ 
Sbjct: 1211 PSSLNPFGENQPFTNGDTNGESPEESSPLFHVLTTSLTGSIGMITPLDESSYRRLSALQT 1270

Query: 178  VMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
             + T   H   LNPRA+R  + + +     +RG++DG++V +  +L    R ++  + G+
Sbjct: 1271 HLTTILEHPASLNPRAYRAIESESFGG---ARGVVDGNIVRRINELGAARRADVLARAGA 1327

Query: 238  KHNDILDEL 246
                I  +L
Sbjct: 1328 DAWSIRSDL 1336


>gi|327304811|ref|XP_003237097.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
 gi|326460095|gb|EGD85548.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
          Length = 1398

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 90/179 (50%), Gaps = 16/179 (8%)

Query: 70   GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
             V   +F+ + + +  +++D D N+ +  Y PE   S+ G RL++++ FH G   +T   
Sbjct: 1206 AVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLRRSVFHTGHFASTVTL 1265

Query: 130  I----RCKPSPISD-------APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNV 178
            +        SP+ +        P ++ + L  + +  G++    PL E +YRRLL LQ+ 
Sbjct: 1266 LPHGAHTTSSPVDEDAMDTDSPPPSKYQILMTFQT--GSIAVITPLSEDSYRRLLALQSQ 1323

Query: 179  MVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
            +V    H   LNPR +R  +  G       RG+IDG+L+ ++L +    + EI  ++G+
Sbjct: 1324 LVNALEHPCSLNPRGYRAVESDGMGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGA 1379


>gi|224000243|ref|XP_002289794.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220975002|gb|EED93331.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1820

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 55/210 (26%), Positives = 90/210 (42%), Gaps = 23/210 (10%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  V+A   +    +M F+  D  +N+    Y P    + GG++L+ + DFHLG    + 
Sbjct: 1609 PTQVFAAGMISRGGAMSFVCHDDRQNIQFLQYAPTDVAARGGNKLVCRADFHLGSQTTSL 1668

Query: 128  FKIRCKPSPISDAPGARS-------------------RFLTWYASLDGALGFFLPLPEKN 168
                 + S + ++    S                   RF   + + DG+    +PL E  
Sbjct: 1669 NSHWAQSSLLFNSCTVSSTLASLKQQDSLFGRLDDDQRFAVNFGTTDGSFVSIIPLSEPT 1728

Query: 169  YRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK----GYYAGNPSRGIIDGSLVWKFLQLS 224
            Y RL  LQ+VM         L+ RA+R Y+      G    +  +G+ID  LV KF+ L 
Sbjct: 1729 YWRLTALQSVMSNALESNAALSHRAWRLYRRSTRRGGCRTNDRKKGVIDADLVMKFVDLP 1788

Query: 225  LGERLEICKKIGSKHNDILDELYDIEALSS 254
            L E+ ++   IGS    ++D L ++    S
Sbjct: 1789 LPEQEDLTSSIGSTVGLVMDNLLELSCAGS 1818


>gi|242798830|ref|XP_002483249.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Talaromyces stipitatus ATCC 10500]
 gi|218716594|gb|EED16015.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Talaromyces stipitatus ATCC 10500]
          Length = 1382

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 57/192 (29%), Positives = 93/192 (48%), Gaps = 16/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F+ +   +  +++D D N+ +  Y PE  +S+ G RL+ +  FH+G   +T   +
Sbjct: 1191 VVTADFLPDGKKLYILVADSDCNLHVLQYDPEDPKSSNGDRLLNRCKFHMGHFASTITLL 1250

Query: 131  -RCKPSPISDAPGARSRFLTWYASL--------DGALGFFLPLPEKNYRRLLMLQNVMVT 181
             R   S       + S  +  Y  L         G +     L E++YRRL  LQ+ +  
Sbjct: 1251 PRTAVSSELAVMNSDSMDIDSYIPLHQALITTQSGLMALVTSLSEESYRRLSALQSQLSN 1310

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
               H  GLNPRA+R  +  G       RG+IDG L+ ++L LS   +LEI  ++G+   +
Sbjct: 1311 TLEHPCGLNPRAYRAVESDGVVG----RGMIDGKLLMRWLDLSRPRKLEIAGRVGADEWE 1366

Query: 242  ILDELYDIEALS 253
            I     D+EA+S
Sbjct: 1367 I---RADLEAVS 1375


>gi|390599704|gb|EIN09100.1| hypothetical protein PUNSTDRAFT_67240 [Punctularia strigosozonata
            HHB-11173 SS5]
          Length = 1439

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 51/182 (28%), Positives = 91/182 (50%), Gaps = 9/182 (4%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDF--HLGQHVNTFF 128
            V   +F+    SM  + +D++  + L+ Y P+  +S  G +L+ +T+F  H     +  F
Sbjct: 1260 VTTADFIFTEDSMSILTNDENGVMRLYQYDPQDPDSRNGQQLMCRTEFDTHTTCQTSIVF 1319

Query: 129  KIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
              R      +  P A+        S+DG+L     + E  ++RL +LQ  +  +  H  G
Sbjct: 1320 ARRVGEGEEAALPQAK----VVAGSIDGSLAALTCMDEPAFKRLQLLQGQLTRNIQHVAG 1375

Query: 189  LNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
            LNP+AFR  +    Y   P S+GI+DG+L+  +L+L +  + EI K+I ++   +L +  
Sbjct: 1376 LNPKAFRIVRND--YVSKPLSKGILDGNLLSSYLELPIPRQEEITKQIATERAAVLRDWT 1433

Query: 248  DI 249
             I
Sbjct: 1434 SI 1435


>gi|453082807|gb|EMF10854.1| CPSF_A-domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 1349

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 59/208 (28%), Positives = 91/208 (43%), Gaps = 32/208 (15%)

Query: 48   KLKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESN 107
            KL V+  S +SK           V   EF+    ++  +I D D ++ +  Y PE  +S 
Sbjct: 1136 KLTVMGKSPKSKME---------VMTAEFLPFDGALYILIMDADNDLHVLQYDPENPKSV 1186

Query: 108  GGHRLIKKTDFHLGQHVNTFFKIRCKPSPISD------------------APGARSRFLT 149
            GG RL+ ++ FH+G  V     +     P                     AP +    L 
Sbjct: 1187 GGMRLLHRSTFHIGHLVTNMLLVPSSLKPFESQDRDMANGTNGNNEEATRAPPSLHHILA 1246

Query: 150  WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSR 209
               S  G++G   PL E  YRRL  LQ  +     H  GLNPRA+R  + + +     +R
Sbjct: 1247 --TSRSGSVGLITPLDEAAYRRLSALQTHLTAILEHAAGLNPRAYRAVEAESFGG---AR 1301

Query: 210  GIIDGSLVWKFLQLSLGERLEICKKIGS 237
            G++DGSLV +  +L   +R ++  + GS
Sbjct: 1302 GVVDGSLVNRIGELGAAKRADVLGRAGS 1329


>gi|303285993|ref|XP_003062286.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455803|gb|EEH53105.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 1469

 Score = 84.7 bits (208), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 49/199 (24%), Positives = 99/199 (49%), Gaps = 17/199 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V + EF+++ S++  + +D   N  +F Y P++ ES  G +L+ K  FH+G  V+   + 
Sbjct: 1271 VLSTEFLVDGSTLSLLAADTAGNAYVFAYDPKSSESWKGQKLLTKASFHVGSPVHRMVRF 1330

Query: 131  RCK------------PSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNV 178
            + K            P+P ++     +R   ++ +LDG+LG  +P+    + +L +LQ  
Sbjct: 1331 KLKTPTGAGNDGRAAPTP-AEIKANANRHAVFFGTLDGSLGILVPMESSTHAKLEVLQRW 1389

Query: 179  MVTHTSHTGGLNPRAFR---TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
            +  +T+   GLN R++R   T +G+   +  P   ++DG ++  F  L+  ++ E     
Sbjct: 1390 LNYNTAQNAGLNGRSYRAPKTTEGRAMRSPAP-HNLLDGEMLQGFESLAWTKQAEAADAA 1448

Query: 236  GSKHNDILDELYDIEALSS 254
            G    + L  L+ + A ++
Sbjct: 1449 GMTREEALTYLHTLSAKTA 1467


>gi|345566738|gb|EGX49680.1| hypothetical protein AOL_s00078g169 [Arthrobotrys oligospora ATCC
            24927]
          Length = 1407

 Score = 84.3 bits (207), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 52/197 (26%), Positives = 97/197 (49%), Gaps = 9/197 (4%)

Query: 68   PR--GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVN 125
            PR   + A +F+   ++  F+++D   N+ +  Y PE  +S  G+RL+ K + + G  V 
Sbjct: 1212 PRQLSITAGDFLPAGTACYFVVADAQSNIHVLQYDPENPKSIHGNRLLPKGEIYCGHEVK 1271

Query: 126  TFFKIRCKPSPISDAPGARSR------FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 179
            +   +  K S  ++             FL  ++++ G  G    + E  YRRL ++Q  +
Sbjct: 1272 SICILPKKKSLFTEPDEDDMDEDEDEEFLCMFSTMTGVFGTVSSITESMYRRLNVIQGQI 1331

Query: 180  VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 239
                 H  GLNPRA+R  K +   +  P R I+DG L+ ++L L  G R E+  + G+  
Sbjct: 1332 TNTGEHIAGLNPRAYRAAKFRN-TSSEPMRAILDGKLLVRWLMLGAGRRKELAGRAGTSE 1390

Query: 240  NDILDELYDIEALSSHF 256
              + ++L+ ++  ++ F
Sbjct: 1391 EMLREDLWFLQDATAFF 1407


>gi|403170487|ref|XP_003329830.2| hypothetical protein PGTG_11767 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
 gi|375168746|gb|EFP85411.2| hypothetical protein PGTG_11767 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
          Length = 1513

 Score = 84.3 bits (207), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 50/178 (28%), Positives = 90/178 (50%), Gaps = 5/178 (2%)

Query: 75   EFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 134
            +F++    +  + +D +  + +F Y P   ES GG +L+ +++F+    +    +   + 
Sbjct: 1340 DFLVIDGKLAIVATDLNGVIRIFEYNPTNIESQGGQKLLCRSEFNTSSEMTCSMQFGKRL 1399

Query: 135  SPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 194
            S   +A        T++ASLDG++   +P  E  Y+RL ++Q  +  H  H  GLNP+  
Sbjct: 1400 SAKDEA----KVMGTFFASLDGSISSLVPAKEAVYKRLQLVQTRLTRHIQHFAGLNPKGH 1455

Query: 195  RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 252
            RT +     +   +RGI+DG L+ KF  LS+ ++ EI    GS    +L  L ++  L
Sbjct: 1456 RTVRND-LVSRAINRGILDGELLIKFHLLSVTQQAEIAGLAGSDRETVLVNLLNLRGL 1512


>gi|302506529|ref|XP_003015221.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
 gi|291178793|gb|EFE34581.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
          Length = 1370

 Score = 84.3 bits (207), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 89/179 (49%), Gaps = 16/179 (8%)

Query: 70   GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
             V   +F+ + + +  +++D D N+ +  Y PE   S+ G RL+ ++ FH G   +T   
Sbjct: 1148 AVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTL 1207

Query: 130  I----RCKPSPISD-------APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNV 178
            +        SP+ +        P ++ + L  + +  G++    PL E +YRRLL LQ+ 
Sbjct: 1208 LPHGGHTPSSPVDEDAMDTDSPPPSKYQILMTFQT--GSIAIITPLGEDSYRRLLALQSQ 1265

Query: 179  MVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
            +V    H   LNPR +R  +  G       RG+IDG+L+ ++L +    + EI  ++G+
Sbjct: 1266 LVNALEHPCSLNPRGYRAVESDGMGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGA 1321


>gi|67521912|ref|XP_659017.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
 gi|74598221|sp|Q5BDG7.1|CFT1_EMENI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|40745387|gb|EAA64543.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
 gi|259486722|tpe|CBF84808.1| TPA: Protein cft1 (Cleavage factor two protein 1)
            [Source:UniProtKB/Swiss-Prot;Acc:Q5BDG7] [Aspergillus
            nidulans FGSC A4]
          Length = 1339

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 53/192 (27%), Positives = 97/192 (50%), Gaps = 16/192 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A +F+ + + +  +++D D N+ +  Y PE   S+ G +L+ ++ FH G   +T   +
Sbjct: 1148 VLAADFLPDGNKLFIVVADSDCNLYVLQYDPEDPNSSNGDKLLNRSKFHTGNFASTVTLL 1207

Query: 131  -RCKPSPISDAPGARSRFLTWYASL--------DGALGFFLPLPEKNYRRLLMLQNVMVT 181
             R   S      G+    +   A L        +G++G    +PE++YRRL  LQ+ +  
Sbjct: 1208 PRTLVSSERAMSGSDKMDIDNTAPLHQVLVTSHNGSIGLVTCVPEESYRRLSALQSQLTN 1267

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
               H  GLNPRA+R  +       +  RG++D +L+ ++L +S   + EI  ++G+   +
Sbjct: 1268 TLEHPCGLNPRAYRAVESD----ASAGRGMLDSNLLLQYLDMSKQRKAEIAGRVGATEWE 1323

Query: 242  ILDELYDIEALS 253
            I     D+EA+S
Sbjct: 1324 I---RADLEAIS 1332


>gi|50552095|ref|XP_503522.1| YALI0E03982p [Yarrowia lipolytica]
 gi|74634000|sp|Q6C740.1|CFT1_YARLI RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|49649391|emb|CAG79101.1| YALI0E03982p [Yarrowia lipolytica CLIB122]
          Length = 1269

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 48/186 (25%), Positives = 91/186 (48%), Gaps = 12/186 (6%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V  C+F IE  ++ F+++D  K + +  Y P+  +S  G RL++++ F+ G+ +++   +
Sbjct: 1092 VTTCDFAIEGENLTFVVADLQKRLHILEYDPDDPQSYSGARLLRRSVFYSGKVIDSSAMV 1151

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
                      P    RF+      DG++   +P PE  YRRL  +Q  +    +H  GL+
Sbjct: 1152 ----------PINEDRFMVIGVCSDGSVTDVVPCPEDAYRRLYAIQTQITDKEAHVCGLH 1201

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
            PRA+R          +P R I+DG  + +F  L   ++     ++G ++  ++    D+E
Sbjct: 1202 PRAYRYDPILPGTGNSPHRPILDGHTLIRFANLPRNKQNVYANRLGQRYQQLI--WKDLE 1259

Query: 251  ALSSHF 256
             +S  F
Sbjct: 1260 LISDLF 1265


>gi|403178252|ref|XP_003336695.2| hypothetical protein PGTG_18491 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
 gi|375164075|gb|EFP92276.2| hypothetical protein PGTG_18491 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
          Length = 1149

 Score = 84.0 bits (206), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 50/178 (28%), Positives = 90/178 (50%), Gaps = 5/178 (2%)

Query: 75   EFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 134
            +F++    +  + +D +  + +F Y P   ES GG +L+ +++F+    +    +   + 
Sbjct: 976  DFLVIDGKLAIVATDLNGVIRIFEYNPTNIESQGGQKLLCRSEFNTSSEMTCSMQFGKRL 1035

Query: 135  SPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 194
            S   +A        T++ASLDG++   +P  E  Y+RL ++Q  +  H  H  GLNP+  
Sbjct: 1036 SAKDEA----KVMGTFFASLDGSISSLVPAKEAVYKRLQLVQTRLTRHIQHFAGLNPKGH 1091

Query: 195  RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 252
            RT +     +   +RGI+DG L+ KF  LS+ ++ EI    GS    +L  L ++  L
Sbjct: 1092 RTVRND-LVSRAINRGILDGELLIKFHLLSVTQQAEIAGLAGSDRETVLVNLLNLRGL 1148


>gi|405121446|gb|AFR96215.1| cleavage and polyadenylation specific protein [Cryptococcus
            neoformans var. grubii H99]
          Length = 1431

 Score = 83.6 bits (205), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 50/182 (27%), Positives = 97/182 (53%), Gaps = 7/182 (3%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F++    + F+ SD++ ++ +  + P   +S  G RL+ KT++H G    T  K+
Sbjct: 1253 VVTADFLVHDGQVTFISSDRNGDMRMLDFDPTDPDSLNGERLMLKTEYHAGSAA-TVSKV 1311

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
              +     +    +++ +  YA+ DGAL   + + +  ++RL ++ + +V +  H  GLN
Sbjct: 1312 IARRKTAEEEFAPQTQII--YATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLN 1369

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
            PRAFRT +         S+GI+DG L+ +F    +G + E+ ++IG+   D +    D++
Sbjct: 1370 PRAFRTVRND-LLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGT---DAVTVASDLQ 1425

Query: 251  AL 252
            AL
Sbjct: 1426 AL 1427


>gi|326477251|gb|EGE01261.1| protein kinase subdomain-containing protein [Trichophyton equinum CBS
            127.97]
          Length = 1267

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 52/188 (27%), Positives = 91/188 (48%), Gaps = 16/188 (8%)

Query: 70   GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
             V   +F+ + + +  +++D D N+ +  Y PE   S+ G RL+ ++ FH G   +T   
Sbjct: 1075 AVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTL 1134

Query: 130  I---RCKPSPISDA--------PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNV 178
            +      PS   D         P ++ + L  + +  G++    PL E +YRRLL LQ+ 
Sbjct: 1135 LPHGAYTPSAPVDEDAMDTDSLPPSKYQILMTFQT--GSIAVITPLSEDSYRRLLALQSQ 1192

Query: 179  MVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 238
            +V    H   LNPR +R  +  G       RG+IDG+L+ ++L +    + EI  ++G+ 
Sbjct: 1193 LVNALEHPCSLNPRGYRAVESDGMGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGAD 1249

Query: 239  HNDILDEL 246
               I  +L
Sbjct: 1250 VGAIRTDL 1257


>gi|326471884|gb|EGD95893.1| protein kinase subdomain-containing protein [Trichophyton tonsurans
            CBS 112818]
          Length = 1398

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 52/188 (27%), Positives = 91/188 (48%), Gaps = 16/188 (8%)

Query: 70   GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
             V   +F+ + + +  +++D D N+ +  Y PE   S+ G RL+ ++ FH G   +T   
Sbjct: 1206 AVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTL 1265

Query: 130  I---RCKPSPISDA--------PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNV 178
            +      PS   D         P ++ + L  + +  G++    PL E +YRRLL LQ+ 
Sbjct: 1266 LPHGAYTPSAPVDEDAMDTDSLPPSKYQILMTFQT--GSIAVITPLSEDSYRRLLALQSQ 1323

Query: 179  MVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 238
            +V    H   LNPR +R  +  G       RG+IDG+L+ ++L +    + EI  ++G+ 
Sbjct: 1324 LVNALEHPCSLNPRGYRAVESDGMGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGAD 1380

Query: 239  HNDILDEL 246
               I  +L
Sbjct: 1381 VGAIRTDL 1388


>gi|301630307|ref|XP_002944263.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Xenopus (Silurana) tropicalis]
          Length = 92

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 43/93 (46%), Positives = 59/93 (63%), Gaps = 1/93 (1%)

Query: 164 LPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
           + EK YRRLLMLQN + T   H  GLNPRAFR          NP R ++DG L+ ++L L
Sbjct: 1   MQEKTYRRLLMLQNAL-TVLPHHAGLNPRAFRMLNSSRRMLQNPVRNVLDGELLNRYLYL 59

Query: 224 SLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
           S  ER E+ +KIG+  + ILD+L +I+ ++S F
Sbjct: 60  SNMERSELARKIGTTTDIILDDLLEIDRVTSLF 92


>gi|321260384|ref|XP_003194912.1| cleavage and polyadenylation specific protein [Cryptococcus gattii
            WM276]
 gi|317461384|gb|ADV23125.1| cleavage and polyadenylation specific protein, putative [Cryptococcus
            gattii WM276]
          Length = 1431

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 49/182 (26%), Positives = 97/182 (53%), Gaps = 7/182 (3%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F++    + F+ SD++ ++ +  + P   +S  G RL+ +T++H G    T  K+
Sbjct: 1253 VVTADFLVHDGQVTFISSDRNGDMRMLDFDPTDPDSLNGERLMLRTEYHAGSAA-TVSKV 1311

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
              +     +    +++ +  YA+ DGAL   + + +  ++RL ++ + +V +  H  GLN
Sbjct: 1312 IARRKTTEEEFAPQTQII--YATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLN 1369

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
            PRAFRT +         S+GI+DG L+ +F    +G + E+ ++IG+   D +    D++
Sbjct: 1370 PRAFRTVRND-LLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGT---DAVTVASDLQ 1425

Query: 251  AL 252
            AL
Sbjct: 1426 AL 1427


>gi|58268668|ref|XP_571490.1| cleavage and polyadenylation specific protein [Cryptococcus
            neoformans var. neoformans JEC21]
 gi|134113364|ref|XP_774707.1| hypothetical protein CNBF3860 [Cryptococcus neoformans var.
            neoformans B-3501A]
 gi|338817789|sp|P0CM63.1|CFT1_CRYNB RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|338817790|sp|P0CM62.1|CFT1_CRYNJ RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|50257351|gb|EAL20060.1| hypothetical protein CNBF3860 [Cryptococcus neoformans var.
            neoformans B-3501A]
 gi|57227725|gb|AAW44183.1| cleavage and polyadenylation specific protein, putative [Cryptococcus
            neoformans var. neoformans JEC21]
          Length = 1431

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 49/182 (26%), Positives = 97/182 (53%), Gaps = 7/182 (3%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F++    + F+ SD++ ++ +  + P   +S  G RL+ +T++H G    T  K+
Sbjct: 1253 VVTADFLVHDGQVTFISSDRNGDMRMLDFDPTDPDSLNGERLMLRTEYHAGSAA-TVSKV 1311

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
              +     +    +++ +  YA+ DGAL   + + +  ++RL ++ + +V +  H  GLN
Sbjct: 1312 IARRKTAEEEFAPQTQII--YATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLN 1369

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
            PRAFRT +         S+GI+DG L+ +F    +G + E+ ++IG+   D +    D++
Sbjct: 1370 PRAFRTVRND-LLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGT---DAVTVASDLQ 1425

Query: 251  AL 252
            AL
Sbjct: 1426 AL 1427


>gi|403411348|emb|CCL98048.1| predicted protein [Fibroporia radiculosa]
          Length = 1437

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 51/176 (28%), Positives = 86/176 (48%), Gaps = 7/176 (3%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +     + +  ++ D+D  + L  Y P   ES GG  L+++T+FH      T   I
Sbjct: 1260 VTCADLFFAENRVSLLVCDEDGVIRLLEYDPHDPESRGGQHLLRRTEFHGQTEYRTSVLI 1319

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
              +     D P A+        S DG+L  F  + E  ++ L +LQ  +  +  H  GLN
Sbjct: 1320 ARRKDKDIDIPQAK----LVCGSTDGSLVSFTFVEEAAFKGLHLLQGQLTRNVQHVAGLN 1375

Query: 191  PRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
            PRAFR  +    Y   P S+GI+DG+L+  F +L +  + E+ ++IG++   +L +
Sbjct: 1376 PRAFRIVRND--YVSRPLSKGILDGNLLTTFEELPIARQNEMTRQIGTERATVLKD 1429


>gi|255948500|ref|XP_002565017.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211592034|emb|CAP98296.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 1392

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 52/188 (27%), Positives = 93/188 (49%), Gaps = 13/188 (6%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A +F+ + + +  +++D D N+ +  Y PE  +S+ G RL+ ++ F+ G   ++   +
Sbjct: 1201 VVAADFLPDGNKLYMLVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFYTGNFASSVTLL 1260

Query: 131  RCKPSPIS---------DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
                             D     +R     AS +G+L     + E++YRRL  LQ+ ++ 
Sbjct: 1261 PRTAVSSERTESSEEGMDLDETFARHQVLIASQNGSLALVTSVAEESYRRLSALQSQLIN 1320

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
               H  GLNPRAFR  +  G  AG   RG++DG+L+  +L +    + EI  ++G+   +
Sbjct: 1321 TVDHPAGLNPRAFRAIESDG-AAG---RGMVDGNLLRLWLNMGKQRQTEIAGRVGATEWE 1376

Query: 242  ILDELYDI 249
            I  +L  I
Sbjct: 1377 IKADLETI 1384


>gi|392572878|gb|EIW66021.1| hypothetical protein TREMEDRAFT_70300 [Tremella mesenterica DSM 1558]
          Length = 1408

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 48/180 (26%), Positives = 92/180 (51%), Gaps = 4/180 (2%)

Query: 67   LPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT 126
            +P  V + +F++    + F+  D+  ++ +  + P   ES  G RLI +T++H G  V T
Sbjct: 1226 MPISVTSADFLVHDGHVTFLTYDRSGDIRMVDFDPANPESINGERLIVRTEYHGGSPV-T 1284

Query: 127  FFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
               +  +   + +    +++ +  +A  DG++  F+      +RRL  + + ++ +  H 
Sbjct: 1285 VSTMIARRRGVEEEFAPQTQIICAHA--DGSISTFVSTKPARFRRLHFVSDQLIRNAQHV 1342

Query: 187  GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
             GLNPRAFRT +     A   SRGI+DG L+ +F    +  + E+ K+IG+    +  +L
Sbjct: 1343 AGLNPRAFRTVRND-LVAKPLSRGILDGELLGRFAIQPIDRQREMLKQIGTDGGTVASDL 1401


>gi|50288865|ref|XP_446862.1| hypothetical protein [Candida glabrata CBS 138]
 gi|74609915|sp|Q6FSD2.1|CFT1_CANGA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|49526171|emb|CAG59795.1| unnamed protein product [Candida glabrata]
          Length = 1361

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 50/181 (27%), Positives = 93/181 (51%), Gaps = 3/181 (1%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V + EF++ +  + F+++D+D  + +  Y P+   +  G RL+  + F+L    N    +
Sbjct: 1171 VISVEFLVNNGDIYFLVTDRDSIMHVLKYAPDQPNTLSGQRLVHCSSFNLHSLNNCTMLL 1230

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
                    D   +RS F T  A +DG++   +P+ E+ YRRL  +Q  ++       GLN
Sbjct: 1231 PKNDEFPRDQRYSRS-FQTITAQVDGSISKIVPVKEETYRRLYFIQQQIIDKEPQLAGLN 1289

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN-DILDELYDI 249
            PR  R    K Y+ G+  R ++D +++ +F  +S+  R  I +K+G   N ++  +L D+
Sbjct: 1290 PRMERQ-DNKYYHLGHSLRPMLDFNIIKRFKDMSMNRRSHIVQKLGKNSNLEVWRDLIDL 1348

Query: 250  E 250
            E
Sbjct: 1349 E 1349


>gi|367001853|ref|XP_003685661.1| hypothetical protein TPHA_0E01320 [Tetrapisispora phaffii CBS 4417]
 gi|357523960|emb|CCE63227.1| hypothetical protein TPHA_0E01320 [Tetrapisispora phaffii CBS 4417]
          Length = 1357

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 48/164 (29%), Positives = 83/164 (50%), Gaps = 1/164 (0%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            A +FV+ +  + F++SD D  + +  Y P+   S  G RL   + F++     +   +  
Sbjct: 1170 ALDFVVRNGEIYFVVSDTDDILHILKYSPDEPNSLSGQRLAHYSSFNIHSTNTSMHLLPA 1229

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
                I +     S F T  A+LDG++   LPL E ++RRL ++Q  ++    H  GLNPR
Sbjct: 1230 NDEFIENKGNGSSIFQTIGANLDGSIFKILPLSEDSFRRLYVIQQQIIDTEVHAAGLNPR 1289

Query: 193  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
              R    + Y   N +R ++D +L+ ++  LS+ +R  I +K G
Sbjct: 1290 MER-LSNEYYQLTNVTRPLLDFNLIRRYSNLSIKKRKSIAQKAG 1332


>gi|401889164|gb|EJT53104.1| cleavage and polyadenylation specific protein [Trichosporon asahii
            var. asahii CBS 2479]
          Length = 1358

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 53/179 (29%), Positives = 95/179 (53%), Gaps = 8/179 (4%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI-R 131
            A +F++    + F+ +D+  N+ L  + P   +S  G +L+ +T+F  G  V     I R
Sbjct: 1182 AADFLVHEGQVTFITNDQHGNMRLVDFDPANPDSLNGEKLLTQTEFGTGCPVTASCMIAR 1241

Query: 132  CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
             K +    AP  +S+ +  YA+ DGA+   + + E  ++RL ++Q+ +V +  H  GLNP
Sbjct: 1242 RKTAEEEFAP--QSQLI--YATADGAITSVVAVKEARFKRLQLVQDQLVRNAQHVAGLNP 1297

Query: 192  RAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
            RAFRT +        P +RG++DG L+  F    L  + E+ ++IG+    +  +LY +
Sbjct: 1298 RAFRTVRND--LVPRPLARGVLDGGLLAHFALQPLRRQREMMRQIGTDAVTVGSDLYTL 1354


>gi|406699110|gb|EKD02327.1| cleavage and polyadenylation specific protein [Trichosporon asahii
            var. asahii CBS 8904]
          Length = 1339

 Score = 81.3 bits (199), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 53/179 (29%), Positives = 95/179 (53%), Gaps = 8/179 (4%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI-R 131
            A +F++    + F+ +D+  N+ L  + P   +S  G +L+ +T+F  G  V     I R
Sbjct: 1163 AADFLVHEGQVTFITNDQHGNMRLVDFDPANPDSLNGEKLLTQTEFGTGCPVTASCMIAR 1222

Query: 132  CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
             K +    AP  +S+ +  YA+ DGA+   + + E  ++RL ++Q+ +V +  H  GLNP
Sbjct: 1223 RKTAEEEFAP--QSQLI--YATADGAITSVVAVKEARFKRLQLVQDQLVRNAQHVAGLNP 1278

Query: 192  RAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
            RAFRT +        P +RG++DG L+  F    L  + E+ ++IG+    +  +LY +
Sbjct: 1279 RAFRTVRND--LVPRPLARGVLDGGLLAHFALQPLRRQREMMRQIGTDAVTVGSDLYTL 1335


>gi|452979579|gb|EME79341.1| hypothetical protein MYCFIDRAFT_104419, partial [Pseudocercospora
            fijiensis CIRAD86]
          Length = 1342

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 55/193 (28%), Positives = 90/193 (46%), Gaps = 33/193 (17%)

Query: 65   PGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG--- 121
            P +   V + EF+    ++  ++ D D ++ +  Y PE  +S  G RL+ ++ FH+G   
Sbjct: 1145 PKMHMEVMSAEFLPFDGALYIVVLDADCDMHVLQYDPENPKSLNGMRLLHRSTFHIGHFT 1204

Query: 122  ----------------QH--VNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLP 163
                            QH  +N   K   KP P+          LT  +S  GA+G   P
Sbjct: 1205 TNSMLLPSTLASFAAQQHEMMNGGSKAEVKPDPLQ-------HVLT--SSTSGAIGLITP 1255

Query: 164  LPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
            L E+ YRRL  LQ  + +   H  GLNPRA+R+ + + +     +RG++DG LV +  +L
Sbjct: 1256 LDEQAYRRLSALQTHLTSILEHAAGLNPRAYRSIESESFGG---ARGVVDGLLVRRIHEL 1312

Query: 224  SLGERLEICKKIG 236
                R ++  + G
Sbjct: 1313 GAARRADVLGRAG 1325


>gi|452001482|gb|EMD93941.1| hypothetical protein COCHEDRAFT_1129958 [Cochliobolus heterostrophus
            C5]
          Length = 1385

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 76/160 (47%), Gaps = 6/160 (3%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI-- 130
            A +F+     +  +++D D N+ +  + P+  +S  G RL+ K+ FH G    T   +  
Sbjct: 1194 AIDFIPFEEQLHLLVADADMNLQVLQFDPDNPKSEAGSRLLHKSTFHTGHFPATLHVVHS 1253

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            R K    SD    +        S  G L    PL E  YRRL  L   +      T GLN
Sbjct: 1254 RLKMPSASDFAATQPLHQILCTSQSGTLALVTPLSEDTYRRLSNLSAYLSNTLDATAGLN 1313

Query: 191  PRAFRTYKGK--GYYAGNPSRGIIDGSLVWKFLQLSLGER 228
            PRAFR       G+ AG  +RG++DG+L+ ++ +  LGER
Sbjct: 1314 PRAFRASDTPDGGWDAGTGARGMLDGNLLMRWGE--LGER 1351


>gi|395324102|gb|EJF56549.1| hypothetical protein DICSQDRAFT_93527 [Dichomitus squalens LYAD-421
            SS1]
          Length = 1433

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 46/164 (28%), Positives = 85/164 (51%), Gaps = 6/164 (3%)

Query: 83   MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPG 142
            +  +  D++  V L+ Y P   ES GG  L+++T+FH      +   +  +P    D   
Sbjct: 1267 LSIVTCDEEGVVRLYAYDPHDPESKGGQHLLRRTEFHGQTEYRSSLLVARRPK-AGDPEI 1325

Query: 143  ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 202
             ++R +    S+DG+L     + E  ++RL +LQ  ++    H   LNP+AFR  + +  
Sbjct: 1326 PQARLIC--GSVDGSLTTLTYVDENAFKRLHLLQGQLIRTVQHVAALNPKAFRMVRNE-- 1381

Query: 203  YAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
            Y   P S+G++DG+L+  F  L +G + E+ ++IG+    +L +
Sbjct: 1382 YVSRPLSKGVLDGNLLATFEDLPIGRQNEVTRQIGTDRATVLKD 1425


>gi|336388105|gb|EGO29249.1| hypothetical protein SERLADRAFT_445076 [Serpula lacrymans var.
            lacrymans S7.9]
          Length = 1424

 Score = 80.1 bits (196), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 48/180 (26%), Positives = 90/180 (50%), Gaps = 7/180 (3%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F     ++  +  D D  + ++ Y P+  ES  G  L+ +T+FH      +   I
Sbjct: 1247 VTNADFFFTDDTLSIVTEDGDGILRMYAYDPDDPESKNGQHLLCRTEFHNHSECRSSLVI 1306

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
              +    S  P A+   L+ ++  DG+L    P+ + +++RL +LQ  +  +  H  GLN
Sbjct: 1307 ARRTKEESVLPQAK--ILSAFS--DGSLSSLTPVDDASFKRLQLLQGQLTRNIQHVAGLN 1362

Query: 191  PRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
            PRA+R  +    +   P S+ I+DG L+  F  L +  + E+ K+IG++ N +L +  ++
Sbjct: 1363 PRAYRIVRND--FVSKPLSKDILDGQLLSAFESLPISRQNEMTKQIGTERNIVLHDWMEL 1420


>gi|425765419|gb|EKV04111.1| Cleavage and polyadenylation specificity factor subunit A, putative
            [Penicillium digitatum Pd1]
 gi|425767100|gb|EKV05682.1| Cleavage and polyadenylation specificity factor subunit A, putative
            [Penicillium digitatum PHI26]
          Length = 1271

 Score = 80.1 bits (196), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 50/188 (26%), Positives = 93/188 (49%), Gaps = 13/188 (6%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A +F+ + + +  +++D D N+ +  Y PE  +S+ G RL+ ++ F+ G   ++   +
Sbjct: 1080 VVAADFLPDGNKLYMLVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFYTGNFASSVTLL 1139

Query: 131  ---------RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
                             D     +++    AS +G+L     + E++YRRL  LQ+ ++ 
Sbjct: 1140 PRTAVSSELTESSEEAMDVDETFAKYQVLIASQNGSLALVTSVAEESYRRLSGLQSQLIN 1199

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
               H  GLN RAFR  +  G  AG   RG++DG+L+  +L +    + EI  ++G+   +
Sbjct: 1200 TVDHPAGLNARAFRATESDG-AAG---RGMVDGNLLRLWLNMGKQRQAEIAGRVGATEWE 1255

Query: 242  ILDELYDI 249
            I  +L  I
Sbjct: 1256 IKADLETI 1263


>gi|336375160|gb|EGO03496.1| hypothetical protein SERLA73DRAFT_165174 [Serpula lacrymans var.
            lacrymans S7.3]
          Length = 1428

 Score = 80.1 bits (196), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 48/180 (26%), Positives = 90/180 (50%), Gaps = 7/180 (3%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F     ++  +  D D  + ++ Y P+  ES  G  L+ +T+FH      +   I
Sbjct: 1251 VTNADFFFTDDTLSIVTEDGDGILRMYAYDPDDPESKNGQHLLCRTEFHNHSECRSSLVI 1310

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
              +    S  P A+   L+ ++  DG+L    P+ + +++RL +LQ  +  +  H  GLN
Sbjct: 1311 ARRTKEESVLPQAK--ILSAFS--DGSLSSLTPVDDASFKRLQLLQGQLTRNIQHVAGLN 1366

Query: 191  PRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
            PRA+R  +    +   P S+ I+DG L+  F  L +  + E+ K+IG++ N +L +  ++
Sbjct: 1367 PRAYRIVRND--FVSKPLSKDILDGQLLSAFESLPISRQNEMTKQIGTERNIVLHDWMEL 1424


>gi|66812672|ref|XP_640515.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
 gi|60468551|gb|EAL66554.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
          Length = 1628

 Score = 80.1 bits (196), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 52/187 (27%), Positives = 91/187 (48%), Gaps = 25/187 (13%)

Query: 70   GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
             +++ EF++   ++  ++SD DKN++LF ++P+   S  G          + Q +N   K
Sbjct: 1453 NIFSTEFIVNQKTLSILVSDLDKNILLFSFEPQDPSSRSG---------QINQEINGNNK 1503

Query: 130  IRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 189
                    +D    +   L  + +LDG L    PL EK Y     +Q+ +  +   T GL
Sbjct: 1504 --------NDNRLPKKEQLVIFGTLDGGLNVLRPLDEKIYLLFYHIQSKLY-YLPQTAGL 1554

Query: 190  NPRAFRTYKG-KGYYAGNPS------RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
            NP+ +R++K     +  +PS      + I+DG L+ KFL LS  E+  I   I S  ++I
Sbjct: 1555 NPKQYRSFKSFSQNFHFSPSTFHQLPKFILDGDLISKFLSLSQSEKRLISNSINSTSDEI 1614

Query: 243  LDELYDI 249
            ++ L D+
Sbjct: 1615 IESLKDV 1621


>gi|302694047|ref|XP_003036702.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
 gi|300110399|gb|EFJ01800.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
          Length = 1396

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 87/172 (50%), Gaps = 11/172 (6%)

Query: 75   EFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDF--HLGQHVNTFFKIRC 132
            +F     ++  + +D++  + ++ Y PEA +S+ G  L+K+T+F  H     +T    R 
Sbjct: 1223 DFFFAEEALAIVTTDENGVMSMYNYDPEAPDSHDGRLLLKQTEFNLHTDFRTSTLIARRT 1282

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
            K  PI        + +  +   DG L    P+P+   +RL  LQ  +  +  H  GLNP+
Sbjct: 1283 KDDPII------PQGILIFGGTDGTLSCLTPVPDDAAKRLQPLQLQLTRNMQHVAGLNPK 1336

Query: 193  AFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
            A R  + +  +   P S+GI+DG+L+  F  L +  + E+ ++IG++   IL
Sbjct: 1337 ALRIVRNE--HVSRPLSKGILDGNLIAYFEHLPITRQDEMTRQIGTERATIL 1386


>gi|258575565|ref|XP_002541964.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237902230|gb|EEP76631.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 1376

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 49/167 (29%), Positives = 81/167 (48%), Gaps = 17/167 (10%)

Query: 93   NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR-------- 144
             VV   + P+   S+ G RL+ ++ FH G  ++T   I   P   S   GA         
Sbjct: 1210 QVVAADFLPDDPSSSKGDRLLHRSSFHTGHFISTLTLI---PQYTSSGTGASEDNMDVDY 1266

Query: 145  --SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 202
              + +     S  G++G   PL E+ YRRL  LQ+ +V    H  GLNP+A+R  +  G+
Sbjct: 1267 MPAGYQVVVTSQSGSVGVITPLTEETYRRLSALQSQLVMSMEHPCGLNPKAYRAVESDGF 1326

Query: 203  YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
                  RG++DG+L+ ++L + +  + EI  ++G+    I  +L  I
Sbjct: 1327 SG----RGLVDGNLLLRWLDMGVQRKAEIAGRVGADLQSIRADLERI 1369


>gi|255718033|ref|XP_002555297.1| KLTH0G05984p [Lachancea thermotolerans]
 gi|238936681|emb|CAR24860.1| KLTH0G05984p [Lachancea thermotolerans CBS 6340]
          Length = 1307

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 56/183 (30%), Positives = 93/183 (50%), Gaps = 8/183 (4%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V + EF++ + S+ F++SD+   + +  Y P+   S  G +L+  T F+L    NT  K+
Sbjct: 1123 VMSLEFLVNNGSIYFLVSDRSNILHILKYAPDEPNSLSGQKLVHCTSFNL-HSTNTCMKL 1181

Query: 131  RCKPS--PISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
              K    P    P A   F    A  DG+L   +PL E +YRRL M+Q  ++    H  G
Sbjct: 1182 LLKNDEFPTLGEPPA---FQAIGAQTDGSLFNVVPLSESSYRRLYMVQQQLIEKDVHLCG 1238

Query: 189  LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN-DILDELY 247
            LNP+  R  +   Y  G+  R ++D +++  F  L L +R +I  K G + + +I  +L 
Sbjct: 1239 LNPKMER-LQNDFYQLGHLMRPMLDFTVIKSFATLPLNKRKQIAAKAGRQADFEIWRDLI 1297

Query: 248  DIE 250
            ++E
Sbjct: 1298 NVE 1300


>gi|389740693|gb|EIM81883.1| hypothetical protein STEHIDRAFT_65512 [Stereum hirsutum FP-91666 SS1]
          Length = 1438

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/175 (27%), Positives = 89/175 (50%), Gaps = 8/175 (4%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQ--HVNTFFKI 130
            + +F   +  +  +  D++  + ++ Y P   ES  G RL+   +FH GQ  + ++    
Sbjct: 1262 SADFFFANQQLSIVTCDEEGILRMYHYNPHDPESKNGQRLLCHAEFH-GQIEYRSSLTIA 1320

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            R    P ++ P A+        S DG+L   +P+ E  ++RL +LQ  +  +  HT  LN
Sbjct: 1321 RRTKGPDTEIPQAK----LICGSPDGSLSALVPVEEAAFKRLHLLQGQLTRNVQHTAALN 1376

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
            PRAFR  + + Y +    +G +DG L+  F  L +  ++EI ++IG++   +L +
Sbjct: 1377 PRAFRAVRNE-YVSKTLHKGFLDGLLLRSFEDLPVSRQIEITRQIGTERRLVLKD 1430


>gi|393245434|gb|EJD52944.1| hypothetical protein AURDEDRAFT_81080 [Auricularia delicata TFB-10046
            SS5]
          Length = 1422

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 51/176 (28%), Positives = 87/176 (49%), Gaps = 4/176 (2%)

Query: 70   GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
             + + EF      M  + +D+   + +F Y P   E+  G +L+ +T+F+          
Sbjct: 1243 ALTSAEFFFGFGEMTIVSTDEQNVLRIFRYDPMHAEAQDGQKLLCQTEFNTQSDARGTTT 1302

Query: 130  IRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 189
            I  + S   D    +S+ +  Y   DG+L   LP+ E  ++RL +LQ  M  +  H  GL
Sbjct: 1303 ILRRTSD-EDILLPQSKIM--YCGTDGSLSALLPVEEHVFKRLHLLQGQMTRNIQHVAGL 1359

Query: 190  NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
            +P+AFR  +   + A   +RGI+D +L+ KF +L L  ++E  K+IG     IL +
Sbjct: 1360 HPKAFRVVRND-FTARPLARGILDSNLLAKFEELPLSRQVEFTKQIGQSREVILGD 1414


>gi|226290902|gb|EEH46330.1| cleavage and polyadenylation specificity factor subunit A
            [Paracoccidioides brasiliensis Pb18]
          Length = 1343

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 50/157 (31%), Positives = 76/157 (48%), Gaps = 18/157 (11%)

Query: 106  SNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFL----------TWYASLD 155
            S  G RL+ ++ FH GQ  +T   +  + S +S  P A +  +              S  
Sbjct: 1187 SAKGDRLLHRSTFHTGQFASTL-TLLPRTSVLSQGPEAEANAMDLDSSGPLHQVLVTSET 1245

Query: 156  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 215
            G++    P+ E  YRRL  LQ+ M+    H  GLNPRAFR  +  G       RG++DG 
Sbjct: 1246 GSIALITPVSEMAYRRLSALQSQMINTLEHPCGLNPRAFRAVESDGIGG----RGMVDGD 1301

Query: 216  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 252
            LV K+L L    + EI  ++G+   D+ +   D+EA+
Sbjct: 1302 LVQKWLDLGTQRKAEIASRVGA---DVWEIRADLEAI 1335


>gi|156847699|ref|XP_001646733.1| hypothetical protein Kpol_1023p44 [Vanderwaltozyma polyspora DSM
            70294]
 gi|156117413|gb|EDO18875.1| hypothetical protein Kpol_1023p44 [Vanderwaltozyma polyspora DSM
            70294]
          Length = 1337

 Score = 77.8 bits (190), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 47/179 (26%), Positives = 95/179 (53%), Gaps = 3/179 (1%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            A EF++ +  + F++SD++  + +  Y P+   S  G RL   + F++    NT   +  
Sbjct: 1151 ALEFLVNNGDIFFIVSDRNDILHVLKYAPDEPNSLSGQRLAHYSSFNI-HSTNTSMILLP 1209

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
              +    +P  ++ F +  + +DG++   +PL E ++RRL ++Q  ++      GGLNPR
Sbjct: 1210 SNNEFQSSPNGQATFQSVGSCVDGSIFKVIPLDEDSFRRLYVIQQQVIDTEIQAGGLNPR 1269

Query: 193  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDIE 250
              R    + Y   +  R ++D +++ +F  LS+ +R +I +K G + H D+  ++ +IE
Sbjct: 1270 MER-LSNEYYQLVHLMRPMLDFNIIRRFSNLSITKRTKIAQKAGRRAHFDVWRDMINIE 1327


>gi|358056450|dbj|GAA97624.1| hypothetical protein E5Q_04302 [Mixia osmundae IAM 14324]
          Length = 1305

 Score = 77.8 bits (190), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 51/180 (28%), Positives = 89/180 (49%), Gaps = 5/180 (2%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            VY  +F+++ + +G +   +   +    Y P   ES  G RL+ +T++HLG+ +      
Sbjct: 1127 VYCVDFLVQENRLGLLSVSRKGLLRQLEYNPGNAESRAGERLLDRTEYHLGKQIIDSLSF 1186

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
              + S  +D    +S  +   A  DG+L +  P+ E  YRRL +L+  +     H  GLN
Sbjct: 1187 AKRLS--TDEDLRQSGVMLVGA--DGSLTWVTPVREVVYRRLALLERQLHRQLPHFAGLN 1242

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
            PRAFRT +   YY+   +RG++DG L+  +  L    +  +   I S  + +   L ++E
Sbjct: 1243 PRAFRTARND-YYSRPLARGMLDGDLLAIYANLHASRQQSLASHINSDPDTLSVNLGNLE 1301


>gi|367014525|ref|XP_003681762.1| hypothetical protein TDEL_0E03080 [Torulaspora delbrueckii]
 gi|359749423|emb|CCE92551.1| hypothetical protein TDEL_0E03080 [Torulaspora delbrueckii]
          Length = 1327

 Score = 77.4 bits (189), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 53/179 (29%), Positives = 87/179 (48%), Gaps = 3/179 (1%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            + EF++    + F I+D D  + +F Y P+   S  G RL+  T F+L    NT   +  
Sbjct: 1143 SVEFLVNGGDIFFAITDTDDILHVFKYAPDEPNSLSGQRLLHCTSFNL-HSTNTCMALLP 1201

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
            K      A      F      +DG++   LPL E  YRRL ++Q  +       GGLNPR
Sbjct: 1202 KNEEFEPAQANMKNFQAIGGQVDGSVFKLLPLREDVYRRLYVVQQQITEKELQLGGLNPR 1261

Query: 193  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDIE 250
              R    + Y   +  R ++D +++ +F +LS   R +I +K+G + H +I  +L ++E
Sbjct: 1262 MER-LSNEHYKTTHVLRPMLDFNVIQRFKRLSTDRRKQISQKVGKRAHFEIWRDLINVE 1319


>gi|407929511|gb|EKG22329.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
            phaseolina MS6]
          Length = 1418

 Score = 76.6 bits (187), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 58/213 (27%), Positives = 88/213 (41%), Gaps = 41/213 (19%)

Query: 51   VLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGH 110
            +LF   RSK           V A +F+     +  M++D D N+    Y P+  +S  G 
Sbjct: 1199 ILFGKSRSKME---------VMAADFLPHDKQLYLMVADGDCNLHALQYDPDHPKSLSGQ 1249

Query: 111  RLIKKT---------------------------DFHLGQHVNTFFKIRCKPSPISDAPGA 143
            RL+ K+                           D H   HV+           I  AP  
Sbjct: 1250 RLLHKSTFHTGHFTTTMTLLPSSLSPTVSPSSADEHANGHVSPSPSPENDAMDIDPAPAG 1309

Query: 144  RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
              + +    +  G+L    PL E+ YRRL  LQ  ++    H  GLNPRA+R  + +G+ 
Sbjct: 1310 TVQHI-LLTTQTGSLALLTPLSEQQYRRLGALQTYLIGALEHWCGLNPRAYRAVESEGFG 1368

Query: 204  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
                SRGI+DG+L+ ++ +L    R E   K+G
Sbjct: 1369 ----SRGIVDGALLARWCELGSQRRAEGAAKVG 1397


>gi|392585051|gb|EIW74392.1| hypothetical protein CONPUDRAFT_133073 [Coniophora puteana RWD-64-598
            SS2]
          Length = 1490

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 46/174 (26%), Positives = 87/174 (50%), Gaps = 7/174 (4%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            + +F   +  +  +  D++  +  + Y P   ES  G +L+  T+FH  +  +T   I  
Sbjct: 1315 SVDFFFANGELSIVTEDEEGVLRAYEYDPNDPESRSGQQLLCHTEFHGHKECSTTLTIAR 1374

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
            +     + P  +++ ++ +   DG+L    P+ E  ++RL +LQ  +  +  H  GLNPR
Sbjct: 1375 RTKTEHEIP--QAKLISGFG--DGSLSALTPVDEAAFKRLQLLQGQLTRNVQHIAGLNPR 1430

Query: 193  AFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
            AFR  + +      P S+GI+DG L+  F    +  + E+ ++IG++   IL E
Sbjct: 1431 AFRIVRNE--TVSKPLSKGILDGQLLSSFEAQGITRQGEMTRQIGTERTTILQE 1482


>gi|363750592|ref|XP_003645513.1| hypothetical protein Ecym_3197 [Eremothecium cymbalariae DBVPG#7215]
 gi|356889147|gb|AET38696.1| Hypothetical protein Ecym_3197 [Eremothecium cymbalariae DBVPG#7215]
          Length = 1318

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 53/183 (28%), Positives = 93/183 (50%), Gaps = 7/183 (3%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            + EF++ +  M F+I+D+ + + +  Y P+   S  G RL+  T F+L   +NT  ++  
Sbjct: 1127 SLEFLVNNGDMYFIITDRSQVMHVLKYAPDEPNSLSGQRLVYCTSFNL-HSINTCMRLIQ 1185

Query: 133  KPSPISDA----PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
            K +   D         S F      +DG++   +PL E +YRRL ++Q  ++       G
Sbjct: 1186 KNNEFVDLRRNYGSHMSTFQCIGCHIDGSIFKVVPLTESSYRRLYLVQQQIIDKEVQLCG 1245

Query: 189  LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELY 247
            LNPR  R  +   Y  G+  R ++D +++ KF  LS+ +R  +  K G + H ++  +L 
Sbjct: 1246 LNPRMER-LQNPYYQLGHLLRPMLDFTILKKFSTLSISKRRSMASKAGHQAHTEVWRDLI 1304

Query: 248  DIE 250
            DIE
Sbjct: 1305 DIE 1307


>gi|169864473|ref|XP_001838845.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
 gi|116500065|gb|EAU82960.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
          Length = 1458

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 51/165 (30%), Positives = 87/165 (52%), Gaps = 13/165 (7%)

Query: 83   MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQ---HVNTFFKIRCKPSPISD 139
            +  ++ D++  + ++ Y P   +S  G  L+ +T++H GQ   H +T    R K  P   
Sbjct: 1293 LALIVGDEEGIMRIYEYNPNDPDSRDGRYLLLRTEYH-GQVPYHTSTTIARRDKEDP--S 1349

Query: 140  APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG 199
             P  +S  L    S DG+L   +P+ E  ++RL +LQ  +  +  H  GLNP+AFR  K 
Sbjct: 1350 IP--QSHLL--IGSADGSLSSLVPVDEYAFKRLQLLQGQLTRNIQHVAGLNPKAFRIVKN 1405

Query: 200  KGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
               Y   P S+GI+DG L+ ++  L +  + E+ K+IG++   +L
Sbjct: 1406 D--YVSKPLSKGILDGQLLAQYESLPIPRQNEMTKQIGTERGVVL 1448


>gi|324518783|gb|ADY47203.1| Cleavage and polyadenylation specificity factor subunit 1 [Ascaris
           suum]
          Length = 108

 Score = 75.9 bits (185), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 40/110 (36%), Positives = 67/110 (60%), Gaps = 7/110 (6%)

Query: 60  RANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFH 119
           RA  QP +     A +F+I++  M F++SD+  N+ +F Y PEA ES+GG RLI +++ +
Sbjct: 2   RAVVQPPM-----AAQFLIDNRQMAFIMSDEAANIAVFNYLPEALESSGGERLILRSEIN 56

Query: 120 LGQHVNTFFKIRCKPSP--ISDAPGARSRFLTWYASLDGALGFFLPLPEK 167
           +G +VN+F +++   S   + +   + +R    + SLDG+ GF  PL EK
Sbjct: 57  IGTNVNSFMRVKGHISSGFVENEHYSLNRQSVLFCSLDGSFGFVRPLSEK 106


>gi|398397855|ref|XP_003852385.1| hypothetical protein MYCGRDRAFT_100364 [Zymoseptoria tritici IPO323]
 gi|339472266|gb|EGP87361.1| hypothetical protein MYCGRDRAFT_100364 [Zymoseptoria tritici IPO323]
          Length = 1333

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 48/184 (26%), Positives = 81/184 (44%), Gaps = 19/184 (10%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F+    ++  ++ D D ++ +  Y PE  +S  G RL+ ++ FH+G        +
Sbjct: 1130 VIEADFLPFDGALFLLVLDADADLHVLQYDPENPKSLNGQRLLHRSTFHIGHFPTGSMLL 1189

Query: 131  RCKPSPIS----DAPGARSR-----------FLTWYASLDGALGFFLPLPEKNYRRLLML 175
                +P +    D P   S            F     +  G++G   PL E  YRRL  L
Sbjct: 1190 PSTLAPFTEQARDLPNGDSEDTKQEEVNSPLFHVLTTTSSGSIGLITPLDESTYRRLSAL 1249

Query: 176  QNVMVTHTSHTGGLNPRAFRT---YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
            Q  +     H  GLNPR +RT    K      G  ++G++DGSL+ +  +L    R ++ 
Sbjct: 1250 QGHLTNILEHAAGLNPRMYRTDTEMKATDSEMGG-AKGVVDGSLIRRISELGAARRADVL 1308

Query: 233  KKIG 236
             ++G
Sbjct: 1309 SRVG 1312


>gi|388581811|gb|EIM22118.1| hypothetical protein WALSEDRAFT_28358 [Wallemia sebi CBS 633.66]
          Length = 1259

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 67/253 (26%), Positives = 108/253 (42%), Gaps = 25/253 (9%)

Query: 16   GNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLK-VLFVSDRSKRAN-----EQP---- 65
            G + L+     +E LI  AF +  GT  +  + LK  + V D+ K        E P    
Sbjct: 1017 GQKVLIRALEINERLISVAF-YDAGTYIVSLEVLKNFILVGDQVKSITFLAFQESPYKLV 1075

Query: 66   ---GLPRGVYAC--EFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHL 120
                  R +  C   F+     + F+ +D   ++ L  Y P    + GG +LI+ T+FH 
Sbjct: 1076 QLSRDARQIETCVSNFLAHEDQISFVSNDIQGDLRLIDYNPFDPTAEGGEKLIRTTEFHK 1135

Query: 121  GQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
            G             S +   P  R         +DG+L    P+ E  ++ L +LQ  +V
Sbjct: 1136 GSEATC--------SLLLPKPSVRPSSELLLGCVDGSLSCLSPVDEITFKALWLLQGALV 1187

Query: 181  THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
                H   LNPRA R  +   Y + + S+GI+DG L+  +  +    ++EI K+IG    
Sbjct: 1188 RQIPHIAALNPRAHRHVRND-YVSRSLSKGILDGLLLSAYQTIDHATQVEIAKRIGYSKA 1246

Query: 241  DILDELYDIEALS 253
            ++L  L +   LS
Sbjct: 1247 ELLGYLRNFSWLS 1259


>gi|448105510|ref|XP_004200513.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
 gi|448108635|ref|XP_004201144.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
 gi|359381935|emb|CCE80772.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
 gi|359382700|emb|CCE80007.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
          Length = 1344

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 46/184 (25%), Positives = 87/184 (47%), Gaps = 5/184 (2%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F+++   +  + +D +  + +  + PE   S+ G RL+ KT F++      F   
Sbjct: 1160 VSCADFIVKDEEIFILFADNNNVLHVVKFDPEDPLSSNGQRLVHKTSFNINSAATCF--- 1216

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            R  P    + P   + F +  +++DG+     P+ E  YRR+ +LQ  +     H  GLN
Sbjct: 1217 RTIPKNEENYPSLTTSFQSIGSTIDGSFFTVFPINESTYRRMYILQQQLTDKEFHICGLN 1276

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK--HNDILDELYD 248
            PR  R        +   S+ +++  ++ KF+ L+   +     KIGSK  + DI  +L +
Sbjct: 1277 PRLNRFGGLNETNSDANSKPMLEYDVIKKFVNLNSDRKKNFASKIGSKNSYQDIWRDLIE 1336

Query: 249  IEAL 252
             E++
Sbjct: 1337 FESV 1340


>gi|440637976|gb|ELR07895.1| hypothetical protein GMDG_02777 [Geomyces destructans 20631-21]
          Length = 1495

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 53/180 (29%), Positives = 85/180 (47%), Gaps = 18/180 (10%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR-----------CK 133
             +++D D N+ +  + PE  +S  G  L+++T F LG H+ T   +              
Sbjct: 1316 IIVADSDCNLHVLQFDPEHPKSLHGQLLLQRTTFSLGGHMPTTMTLLPLTTTTQTPTPAV 1375

Query: 134  PSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRA 193
             S  S+     S  L   +S  G +    PL E+ YRRL  L N +     H GGLNP+A
Sbjct: 1376 TSTASEPTNPASGLLMTLSS--GVVAILTPLSEQQYRRLNALSNHLSNLLYHPGGLNPKA 1433

Query: 194  FRTYKG--KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 251
             R      +    G P   I+DGS++W++L+L   +R E+  ++G     I ++L +I A
Sbjct: 1434 HRISNTAPEAVIGGRP---IVDGSVLWRWLELGSQKRAEVAGRVGVDGETIREDLQEIAA 1490


>gi|426194401|gb|EKV44332.1| hypothetical protein AGABI2DRAFT_187183 [Agaricus bisporus var.
            bisporus H97]
          Length = 1413

 Score = 74.7 bits (182), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 49/182 (26%), Positives = 93/182 (51%), Gaps = 11/182 (6%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F+   + +  +  D++  + ++ Y P+  +S  G  L+ +T+FH  +   T   +
Sbjct: 1235 VTRADFLFSENDLRLVTGDEEGIIRIYEYNPQDPDSREGRHLLLETEFHGQREYRTSVLV 1294

Query: 131  --RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
              R K     D     SR LT   S DG+L     + E+ ++RL +LQ  ++ +  H   
Sbjct: 1295 AHRIK----EDQSIPNSRLLT--GSADGSLASLTIVEEEAFKRLGLLQGQLMRNIQHMAA 1348

Query: 189  LNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
            LNP+AFR  K +  Y   P +RGI+DG+L+ ++  L +  + E  ++IG+   ++L +  
Sbjct: 1349 LNPKAFRIVKNE--YVSKPLTRGILDGNLLGQYESLPINRQSEATQQIGADRVNVLRDWI 1406

Query: 248  DI 249
            ++
Sbjct: 1407 EL 1408


>gi|409076059|gb|EKM76433.1| hypothetical protein AGABI1DRAFT_108759 [Agaricus bisporus var.
            burnettii JB137-S8]
          Length = 1413

 Score = 74.7 bits (182), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 49/182 (26%), Positives = 93/182 (51%), Gaps = 11/182 (6%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F+   + +  +  D++  + ++ Y P+  +S  G  L+ +T+FH  +   T   +
Sbjct: 1235 VTRADFLFSENDLRLVTGDEEGIIRIYEYNPQDPDSREGRHLLLETEFHGQREYRTSVLV 1294

Query: 131  --RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
              R K     D     SR LT   S DG+L     + E+ ++RL +LQ  ++ +  H   
Sbjct: 1295 AHRIK----EDQSIPNSRLLT--GSADGSLASLTIVEEEAFKRLGLLQGQLMRNIQHMAA 1348

Query: 189  LNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
            LNP+AFR  K +  Y   P +RGI+DG+L+ ++  L +  + E  ++IG+   ++L +  
Sbjct: 1349 LNPKAFRIVKNE--YVSKPLTRGILDGNLLGQYESLPINRQSEATQQIGADRVNVLRDWI 1406

Query: 248  DI 249
            ++
Sbjct: 1407 EL 1408


>gi|451849663|gb|EMD62966.1| hypothetical protein COCSADRAFT_92785 [Cochliobolus sativus ND90Pr]
          Length = 1405

 Score = 74.7 bits (182), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 55/180 (30%), Positives = 81/180 (45%), Gaps = 26/180 (14%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI-- 130
            A +F+     +  +++D D N+ +  + P+  +S  G RL+ K+ FH G    T   +  
Sbjct: 1194 AVDFIPFEEQLHLLVADADMNLQVLQFDPDNPKSEAGSRLLHKSTFHTGHFPATLHVVHS 1253

Query: 131  RCKPSPISDAPGARSR----FLTWYASLD----------------GALGFFLPLPEKNYR 170
            R K    SD  GA +     F    +S D                G L    PL E  YR
Sbjct: 1254 RLKMPSASDFAGANNTENGDFEMDTSSPDDKATQPLHQILCTTQSGTLALVTPLSEDTYR 1313

Query: 171  RLLMLQNVMVTHTSHTGGLNPRAFRTYKGK--GYYAGNPSRGIIDGSLVWKFLQLSLGER 228
            RL  L   +      T GLNPRAFR       G+ AG  +RG++DG+L+ ++ +  LGER
Sbjct: 1314 RLSNLSAYLSNTLDATAGLNPRAFRASDTPDGGWDAGTGARGMLDGNLLMRWGE--LGER 1371


>gi|392558419|gb|EIW51607.1| hypothetical protein TRAVEDRAFT_176174 [Trametes versicolor FP-101664
            SS1]
          Length = 1431

 Score = 74.7 bits (182), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 82/164 (50%), Gaps = 6/164 (3%)

Query: 83   MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPG 142
            +  +  D++  V L+ Y P   ES  G  L+++T+FH      +   +  +P    D   
Sbjct: 1265 LSIVTCDEEGIVRLYAYDPHDPESKSGQHLLRRTEFHGQSEYRSSMLVARRPKN-GDPEI 1323

Query: 143  ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 202
             ++R +    S+DG+L     + E   +RL +LQ  ++    H   LNP+AFR  + +  
Sbjct: 1324 PQARLVC--GSVDGSLSTLTYVDEAASKRLHLLQGQLIRTVQHVAALNPKAFRMVRNE-- 1379

Query: 203  YAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
            Y   P S+GI+DG+L+  F  L +  + E+ ++IG+    +L +
Sbjct: 1380 YVSRPLSKGILDGNLLATFEDLPIARQNEVTRQIGTDRATVLKD 1423


>gi|310789917|gb|EFQ25450.1| CPSF A subunit region [Glomerella graminicola M1.001]
          Length = 1439

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 51/187 (27%), Positives = 84/187 (44%), Gaps = 7/187 (3%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG-QHVNTFFK 129
            V   +F+I    +  ++ DKD  + +  + PE  +S  GH L+ +  F     H  T   
Sbjct: 1246 VLTADFIIAGDELHIVVCDKDGVIHVMQFDPEHPKSLQGHLLLNRASFSAAPNHPTTTLS 1305

Query: 130  IRCKPSPISDAPGARSRFLTWY-ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
            +   P+  +     ++   T   AS  GAL    PL E+ YRRL  L N +     H   
Sbjct: 1306 LPRTPASTATTSATKNPPTTLLLASPTGALASLTPLSEQAYRRLTSLANSIAGALPHAAA 1365

Query: 189  LNPRAFRTYKGKGYYAG---NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
             NP+A R         G   +  R I+DG+L+ ++ +L  G R E+  K G  + D+L+ 
Sbjct: 1366 TNPKAHRLQPLDARTPGVDTSAGRSIVDGALLARWNELGAGRRSEVAGKGG--YGDVLEV 1423

Query: 246  LYDIEAL 252
              ++E +
Sbjct: 1424 RGELEGV 1430


>gi|367052335|ref|XP_003656546.1| hypothetical protein THITE_2121311 [Thielavia terrestris NRRL 8126]
 gi|347003811|gb|AEO70210.1| hypothetical protein THITE_2121311 [Thielavia terrestris NRRL 8126]
          Length = 1460

 Score = 73.9 bits (180), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 52/192 (27%), Positives = 83/192 (43%), Gaps = 26/192 (13%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F+ +   + F++SD D  + +  + PE  +S  GH L+ +T F+ G H  T   +
Sbjct: 1246 VLNADFLPDGKELSFVVSDADGYIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHATKSLL 1305

Query: 131  RCKPSPI------------------SD-----APGARSRFLTWYASLDGALGFFLPLPEK 167
                +P                   SD      P A+   +   AS  G L    PL E 
Sbjct: 1306 LPASTPADKEKNDGNAANAQAKAKASDNKQPREPAAQRPHVLLLASPTGVLAALRPLSES 1365

Query: 168  NYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPS---RGIIDGSLVWKFLQLS 224
             YRRL  L   +     H  GLNPR +R    +   AG  +   R I+DG+++ +F +L 
Sbjct: 1366 AYRRLSSLAAQLTNSLPHPAGLNPRGYRAAGAECPPAGVDAGLGRSIVDGTVLERFAELG 1425

Query: 225  LGERLEICKKIG 236
            +  R+E+  + G
Sbjct: 1426 MARRVELAGRAG 1437


>gi|296414526|ref|XP_002836950.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295632796|emb|CAZ81141.1| unnamed protein product [Tuber melanosporum]
          Length = 1468

 Score = 73.9 bits (180), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 49/204 (24%), Positives = 85/204 (41%), Gaps = 38/204 (18%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            + + EF+ + + + F++ D + N+    Y PE  +S  G RLI++ DF  G  ++T   +
Sbjct: 1244 IISAEFLPDGNQLYFVVVDAESNIHTLQYDPEHPKSLAGQRLIRRADFFSGHEISTLTML 1303

Query: 131  RCKPSPISDAPGAR---------------------SRFLTWYASLDGALGFFLPLPEKNY 169
               P  +S +  +                        +     +  G+L     +PE  Y
Sbjct: 1304 PFSPYSLSASSNSHLPADATDTSPLHHHHQNQQQQQEYFVLAGTQTGSLAMIRTIPETAY 1363

Query: 170  RRLLMLQNVMVTHTSHTGGLNPRAFRTY-----------------KGKGYYAGNPSRGII 212
            RRL ++Q  +V    H  GLNPR +R                      G   G+  RG++
Sbjct: 1364 RRLNIVQGQIVNGEEHVAGLNPREYRAVVNYSGGGGGGAGGGGWGGSGGGVGGDTMRGVL 1423

Query: 213  DGSLVWKFLQLSLGERLEICKKIG 236
            DG LV +++ L+ G + E+  K G
Sbjct: 1424 DGGLVSRWIGLAEGRKGEVSAKAG 1447


>gi|402219312|gb|EJT99386.1| hypothetical protein DACRYDRAFT_17537 [Dacryopinax sp. DJM-731 SS1]
          Length = 1620

 Score = 73.6 bits (179), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 65/240 (27%), Positives = 112/240 (46%), Gaps = 21/240 (8%)

Query: 26   QHELLIYQAFRHPKGTLKLRFKKLKVLFVSDRSKRA------NEQPGLPRGVYACE--FV 77
            Q E L+  AF    G       +LK L +   +KR+       E P   R  Y  +  F+
Sbjct: 1388 QDERLVGVAFMD-AGVCVTSLTRLKNLLLIGDAKRSVSFVAFQEDPFKLRPTYVTDAAFL 1446

Query: 78   IESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF-----KIRC 132
             +      + +D +  + LF + P    +  G+ LI +T+F+ GQ  +T       + R 
Sbjct: 1447 FDEGDFSILAADDEGTLRLFEFDPNLTGATHGNPLICETEFN-GQSEHTHILAIAGRGRE 1505

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
             P  +   P A+  F T    +DG LG   P+P++ ++RL +L   ++    H  GLNPR
Sbjct: 1506 DPEEMQ-IPEAQLIFGT----IDGTLGTISPVPDECFKRLQLLSGQLMRSVQHFAGLNPR 1560

Query: 193  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 252
            AFRT +     +   ++G++D  L+  F +L +  +  I K+IG+    IL ++  +E +
Sbjct: 1561 AFRTVRND-LLSRPLNKGMLDYDLLHAFRELDIRRQATITKQIGTDTITILRDIRSLEEI 1619


>gi|254580509|ref|XP_002496240.1| ZYRO0C13816p [Zygosaccharomyces rouxii]
 gi|238939131|emb|CAR27307.1| ZYRO0C13816p [Zygosaccharomyces rouxii]
          Length = 1331

 Score = 73.6 bits (179), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 53/179 (29%), Positives = 86/179 (48%), Gaps = 3/179 (1%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            A EF++    + F ++D    + +F Y P+   S  G RL+  T F+L    NT   +  
Sbjct: 1147 ALEFLVNGGDIFFAVTDTSNILHIFKYAPDEPNSLSGQRLVHCTSFNL-HSTNTCMVLLP 1205

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
            K    S    + S         DG+L   +PL E  YRRL +LQ  +       GGLNPR
Sbjct: 1206 KNEEFSVGEKSLSPVQVVGGQTDGSLFKLVPLREDTYRRLYVLQQQLTEKEVQLGGLNPR 1265

Query: 193  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDIE 250
              R    + Y+  +  R +++ +++ +F  LS+ +R +  +K G + H DI  +L +IE
Sbjct: 1266 MER-LSNEYYHLTHAVRPMLEFNVIRRFNTLSVEKRKQTAQKAGRRAHFDIWRDLVNIE 1323


>gi|408396642|gb|EKJ75797.1| hypothetical protein FPSE_03977 [Fusarium pseudograminearum CS3096]
          Length = 1427

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 48/169 (28%), Positives = 82/169 (48%), Gaps = 4/169 (2%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F+ +   +  + +D D ++ +  + PE  +S  GH L+ +T F +  +  +   +
Sbjct: 1237 VVVADFLPDGDDLAIVAADVDGDLHILEFNPEHPKSLQGHLLLHRTSFSVSPNPPSTTLL 1296

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
              + +P S         +   AS  G L   +PLPE  YRRLL + N ++   +  GGLN
Sbjct: 1297 LPRTTPPSHPTPQDPPHVLLLASSSGHLSSLIPLPETAYRRLLSVTNQLLPALTPHGGLN 1356

Query: 191  PRAFRTYKGK---GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
             +A R   G    G  A    R I+DG+++ ++ +LS  +R EI  K G
Sbjct: 1357 AKAHRLPVGTRTVGVEAAG-GRAIVDGAVLARWAELSAAKRAEIAGKGG 1404


>gi|406865186|gb|EKD18229.1| CPSF A subunit region [Marssonina brunnea f. sp. 'multigermtubi'
            MB_m1]
          Length = 1443

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 55/183 (30%), Positives = 88/183 (48%), Gaps = 23/183 (12%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI--RCK------PSP 136
             + +D D N+ +  + PE  +S  GH L+ ++ F LG H+ T   +  R K      PSP
Sbjct: 1262 IVAADSDCNLHIMQFDPEHPKSLQGHLLLHRSTFALGGHLPTSMTLLPRTKSATLLPPSP 1321

Query: 137  -----ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
                  +DA       L    S  G +    PL E  YRRL  L + ++    H  GLNP
Sbjct: 1322 DAMDTAADATIPEHEIL--ITSSTGCISLLTPLSEAQYRRLSTLTSHLINTLYHACGLNP 1379

Query: 192  RAFRTYKG--KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
            RA+R  K   +G      SR +IDG+++ ++++L    R E+  ++G    D+L+   D+
Sbjct: 1380 RAYRVDKDAPEGMVG---SRTVIDGNILMRWMELGSQRRAEVAGRVGV---DVLEVREDL 1433

Query: 250  EAL 252
             +L
Sbjct: 1434 ASL 1436


>gi|46120520|ref|XP_385083.1| hypothetical protein FG04907.1 [Gibberella zeae PH-1]
          Length = 1436

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 48/169 (28%), Positives = 82/169 (48%), Gaps = 4/169 (2%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F+ +   +  + +D D ++ +  + PE  +S  GH L+ +T F +  +  +   +
Sbjct: 1237 VVVADFLPDGDDLAIVAADVDGDLHILEFNPEHPKSLQGHLLLHRTSFSVSPNPPSTTLL 1296

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
              + +P S         +   AS  G L   +PLPE  YRRLL + N ++   +  GGLN
Sbjct: 1297 LPRTTPPSHPTPQDPPHVLLLASSSGHLSSLIPLPETAYRRLLSVTNQLLPALTPHGGLN 1356

Query: 191  PRAFRTYKGK---GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
             +A R   G    G  A    R I+DG+++ ++ +LS  +R EI  K G
Sbjct: 1357 AKAHRLPVGTRTVGVEAAG-GRAIVDGAVLARWAELSAAKRAEIAGKGG 1404


>gi|366994686|ref|XP_003677107.1| hypothetical protein NCAS_0F02680 [Naumovozyma castellii CBS 4309]
 gi|342302975|emb|CCC70752.1| hypothetical protein NCAS_0F02680 [Naumovozyma castellii CBS 4309]
          Length = 1340

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 47/179 (26%), Positives = 87/179 (48%), Gaps = 3/179 (1%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            + EF+I +  + F I+D++  + +  Y P+   +  G +L+  + F+L    NT   +  
Sbjct: 1154 SVEFLINAGEINFAITDREDILHVLKYAPDEPNTLSGQKLVHCSSFNLYSS-NTCMLMLP 1212

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
            +      +  A  +F      +DG +   +PL E  YRRL ++Q  ++      GGLNPR
Sbjct: 1213 RNDEFETSDKAPPKFQAIGGQVDGGIFKIIPLKEDTYRRLYVVQQQIIDKEVQLGGLNPR 1272

Query: 193  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDIE 250
              R      Y   +  R +ID +++ +F +LS+  R    +K G + H DI  ++ ++E
Sbjct: 1273 MER-LDNDFYQLTHVMRPMIDFNIIRRFSELSIERRTHFAQKAGRRAHFDIWRDIINVE 1330


>gi|302924728|ref|XP_003053954.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256734895|gb|EEU48241.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 1429

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 48/167 (28%), Positives = 83/167 (49%), Gaps = 6/167 (3%)

Query: 74   CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI--R 131
             +F+ +   +  + +D D ++ +  + PE  +S  GH L+ +T F +  +  T   +  R
Sbjct: 1242 ADFLPDGEDLAIVAADADGDLHILEFNPEHPKSLQGHLLLHRTTFSVSPNPPTSMLLLPR 1301

Query: 132  CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
              P P   +P   S+ L   AS  G L   +PLPE  YRRLL + N ++   +  GGLN 
Sbjct: 1302 TTP-PAHPSPSDPSQIL-LLASPSGHLSTLVPLPEATYRRLLSVTNQLLPALTPYGGLNA 1359

Query: 192  RAFRTYKGKGYYAGNPSRG--IIDGSLVWKFLQLSLGERLEICKKIG 236
            + +R   G      + + G  I+DG+++ ++ +L   +R EI  K G
Sbjct: 1360 KGYRLPSGTRPVGVDAAAGRTIVDGAILARWAELGAAKRAEIAGKGG 1406


>gi|320591495|gb|EFX03934.1| cleavage and polyadenylation specificity factor subunit [Grosmannia
            clavigera kw1407]
          Length = 1461

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 47/173 (27%), Positives = 76/173 (43%), Gaps = 21/173 (12%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF-------FKIRCKPSPI 137
             +++D D N+ +  + PE  +S  GH L+ +  F  G H +T        F    +P+  
Sbjct: 1266 IVVTDADGNLHVMQFDPEHPKSLQGHILLHRATFCTGAHFSTLSLLLPSTFTPADRPTAN 1325

Query: 138  SDAPGARSR--------FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 189
             +  GA S+              S  G L   +PL E  YRRL  L   + T  + T GL
Sbjct: 1326 GETNGASSQPEAQQHQQHQLLLGSPTGLLASLVPLSESEYRRLSSLAGQLATSLTQTAGL 1385

Query: 190  NPRAFRTYKGKGYYAGNP------SRGIIDGSLVWKFLQLSLGERLEICKKIG 236
            NP+ +R   G       P       R ++DG+L+ ++ +L  G + EI  ++G
Sbjct: 1386 NPKGYRMTAGSAAATLAPGVDAAVGRSVVDGALLARWTELGSGRKGEIAGRVG 1438


>gi|393220097|gb|EJD05583.1| cleavage factor protein [Fomitiporia mediterranea MF3/22]
          Length = 1450

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 49/177 (27%), Positives = 93/177 (52%), Gaps = 11/177 (6%)

Query: 71   VYACEFVIESSSMGFM-ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFH--LGQHVNTF 127
            V   +F+  S    ++ +SD++  + L  Y     ES+ G  L+++T++H  +  H    
Sbjct: 1271 VMTADFLFASDGDFYIAVSDEEGIIRLLEYDTSDPESHSGQYLLRRTEYHAQVESHTTVL 1330

Query: 128  FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPL-PEKNYRRLLMLQNVMVTHTSHT 186
               R +    +D    ++R ++  A++DG++    P+  +++ +RL +LQ  +  +  H 
Sbjct: 1331 IARRSQ----NDGLVPQARLIS--AAVDGSMYALTPVDADESAKRLQLLQGQLTRNMQHV 1384

Query: 187  GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
             GLNPRAFR  +  G  A   ++GI+DG+L+  F QL +  + EI + IG+    +L
Sbjct: 1385 AGLNPRAFRAVRSDG-VARPLTKGILDGNLLAGFEQLPIPRQNEIARPIGTDRLAVL 1440


>gi|336276223|ref|XP_003352865.1| hypothetical protein SMAC_04980 [Sordaria macrospora k-hell]
 gi|380092984|emb|CCC09221.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 1486

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/180 (28%), Positives = 82/180 (45%), Gaps = 15/180 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F+ +   +  + SD D ++ +  + PE  +S  GH L+ +T F+ G H  T   +
Sbjct: 1246 VLNADFLPDGKELYIVASDADGHIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLL 1305

Query: 131  RCKPSPISDAPGARSRF------LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
                 P + +P + S        +   AS  G L    PL E  YRRL  L   +     
Sbjct: 1306 LPAVYPTTTSPNSNSEVGENPPHILLLASPTGLLATLRPLQENAYRRLSSLAIQLTNALP 1365

Query: 185  HTGGLNPRAFR--------TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
            H  GLNP+ +R        + +  G  AG   R I+DG ++ +F++L  G+R EI  + G
Sbjct: 1366 HPAGLNPKGYRLPSPSASASMQLPGVDAGI-GRNIVDGKILERFMELGTGKRQEIAGRAG 1424


>gi|325094074|gb|EGC47384.1| cleavage factor two protein 1 [Ajellomyces capsulatus H88]
          Length = 1377

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 87/192 (45%), Gaps = 18/192 (9%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            +++    + SS     + D+++ + + +   +   S+ G RL+ ++ F  G   +T   +
Sbjct: 1186 LFSARMTVPSSDDADFLPDENR-LYILVADDDYPGSSKGDRLLHRSTFQTGHFASTMTLL 1244

Query: 131  RCKPSPISDAPGARSRFL----------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
                +  S  P A    +              S  G++    P+ E +YRRL  LQ+ + 
Sbjct: 1245 PRTATSSSQGPDADPDMMDLDSSGPLHHVLVTSETGSIALITPVSETSYRRLSALQSQLA 1304

Query: 181  THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
                H  GLNPRAFR  +  G       RG++DG LV ++L L    + EI  ++G+   
Sbjct: 1305 NTLEHPCGLNPRAFRAVESDGIGG----RGMVDGDLVKRWLDLGTQRKAEIANRVGA--- 1357

Query: 241  DILDELYDIEAL 252
            D+ +   D+EA+
Sbjct: 1358 DVWEIRADLEAI 1369


>gi|255075065|ref|XP_002501207.1| predicted protein [Micromonas sp. RCC299]
 gi|226516471|gb|ACO62465.1| predicted protein [Micromonas sp. RCC299]
          Length = 1423

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 48/162 (29%), Positives = 80/162 (49%), Gaps = 18/162 (11%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            + A EF+++ S++  + +D   N  +F Y P++ ES  G +L+ K  FH+G  V+   + 
Sbjct: 1226 ILATEFLVDGSTLSMLTTDMTGNAFIFSYDPKSLESWKGQKLLTKGAFHVGSPVHRMVRF 1285

Query: 131  RCKPSPISDAPGAR----------SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
            R K +P + APG            +R   ++ +LDG+LG  +P+ E  +  L  LQ  + 
Sbjct: 1286 RLK-APTA-APGQTISPAEQKAQANRHAVFFGTLDGSLGILVPIEEAAHASLQSLQRYLT 1343

Query: 181  THTSHT--GGLNPRAFR---TYKGKGYYAGNPSRGIIDGSLV 217
              T H    GLN R  R   T +G+      P   ++DG L+
Sbjct: 1344 YATPHAALAGLNARTHRHPKTVEGRPMRQPAP-HSLLDGGLL 1384


>gi|412986884|emb|CCO15310.1| predicted protein [Bathycoccus prasinos]
          Length = 1595

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 50/180 (27%), Positives = 85/180 (47%), Gaps = 14/180 (7%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            A EF+I+  ++  + +D + N  +F Y P+  ES  G +L+ K+ +H G  +    + + 
Sbjct: 1408 ASEFMIDGGTLSLLAADANGNAHVFQYAPKLAESWKGDKLLPKSAYHAGSLIRKMVRFQ- 1466

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG----- 187
                I      ++R   ++ S DG LG F P+ E  +  L  LQ+ M ++   +      
Sbjct: 1467 ----IGVGEQKQNRHAVFFGSSDGGLGIFSPVDEHTFLNLEKLQDAMRSNIVASSNSINP 1522

Query: 188  -GLNPRAFRTYK-GKGYYA-GNPSRGIIDGSLVWKFL-QLSLGERLEICKKIGSKHNDIL 243
             GLN + +R  K  +G  A   P R I+DG L+ KF   LS+  +  +  K G   +  L
Sbjct: 1523 LGLNSKTYRALKSSEGSVARQTPPRTIVDGGLLSKFEHSLSITAQTRVAAKAGLTRDQAL 1582


>gi|294659889|ref|XP_462318.2| DEHA2G17908p [Debaryomyces hansenii CBS767]
 gi|218511978|sp|Q6BHK3.2|CFT1_DEBHA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|199434312|emb|CAG90824.2| DEHA2G17908p [Debaryomyces hansenii CBS767]
          Length = 1342

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 48/182 (26%), Positives = 84/182 (46%), Gaps = 5/182 (2%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F+I+   +  +I+D +  + L  Y PE   S+ G RLI K  F++     T   I
Sbjct: 1158 VNCADFIIKDEEIFILIADNNSTLHLVKYDPEDPTSSNGQRLIHKASFNINS---TPTCI 1214

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            R  P      P +   F +  +++DG+     P+ E +YRR+ +LQ  +     H  GLN
Sbjct: 1215 RSIPKNEEINPSSTEVFQSIGSTIDGSFYTVFPINEASYRRMYILQQQITDKEYHFCGLN 1274

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK--HNDILDELYD 248
            PR  R            ++ ++D  ++  F +L+   R  +  K+ SK  + DI  +L +
Sbjct: 1275 PRLNRFGGLSMTVNDTNTKPLLDYEVIRMFAKLNEDRRKNLSMKVSSKNVYQDIWKDLIE 1334

Query: 249  IE 250
             +
Sbjct: 1335 FD 1336


>gi|358387835|gb|EHK25429.1| hypothetical protein TRIVIDRAFT_32877 [Trichoderma virens Gv29-8]
          Length = 1440

 Score = 71.2 bits (173), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 52/192 (27%), Positives = 92/192 (47%), Gaps = 15/192 (7%)

Query: 74   CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIR 131
             +F+ +   +  +  D D ++ +  + PE  +S  GH L+ +T F +  +   +T    R
Sbjct: 1242 ADFLPDGEDLSMVAVDADGDIHVLEFNPEHPKSLQGHLLLHRTTFSVTPNPPTSTLLLPR 1301

Query: 132  CKPSPIS--DAPGARSR--FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
              P+  S   +P + S    L   AS  G L    PLPE  YRRLL + N ++      G
Sbjct: 1302 TLPASQSATTSPDSSSSQPHLLLLASPSGCLASLTPLPESAYRRLLSVTNQLLPALVPHG 1361

Query: 188  GLNPRAFRTYKGK-------GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
            GL+ RA RT +G        G       R I+DG+++ ++ +L   +R E+  + G  ++
Sbjct: 1362 GLHARAHRTPEGGGGMSRTVGVETAASGRAIVDGAILARWNELGAAKRAEVATRGG--YD 1419

Query: 241  DILDELYDIEAL 252
             +++   D+EA+
Sbjct: 1420 GVMEMREDLEAV 1431


>gi|396471273|ref|XP_003838832.1| similar to cleavage and polyadenylation specificity factor subunit A
            [Leptosphaeria maculans JN3]
 gi|312215401|emb|CBX95353.1| similar to cleavage and polyadenylation specificity factor subunit A
            [Leptosphaeria maculans JN3]
          Length = 1402

 Score = 70.9 bits (172), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 54/183 (29%), Positives = 80/183 (43%), Gaps = 22/183 (12%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG----------- 121
            A EF+  +  +  +++D D N+ +  + PE  +S G  RL+ K  FH G           
Sbjct: 1199 AVEFLPFNGELHIIVADADMNIQVLQFDPENPKSEGS-RLLHKATFHTGHFPTTTHLLQS 1257

Query: 122  -----QHVNTFFKIRC-KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 175
                 +  +TF       P     AP    + L    S  G L    PL E +YRRL  L
Sbjct: 1258 HLQMPESASTFGTTDTFAPDSTPSAPLPLHQVL--ITSQSGTLALITPLSESSYRRLSNL 1315

Query: 176  QNVMVTHTSHTGGLNPRAFRTYKG--KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 233
               ++       GLNP AFR  +G   G+ AG  +RG++DG L+ ++ +L    R E   
Sbjct: 1316 AAYLINTLESPCGLNPVAFRAGEGVEGGWDAGGGARGVLDGGLLMRWGELGEQRRKEGLA 1375

Query: 234  KIG 236
            K G
Sbjct: 1376 KYG 1378


>gi|449299306|gb|EMC95320.1| hypothetical protein BAUCODRAFT_25380 [Baudoinia compniacensis UAMH
            10762]
          Length = 1437

 Score = 70.5 bits (171), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 50/192 (26%), Positives = 82/192 (42%), Gaps = 31/192 (16%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V + EF+     +  ++ D   ++ +  Y PE  ++  G RL+ K+ FHLG        +
Sbjct: 1174 VVSAEFLPFDGQLYLLVVDGKMDLHVLQYDPENPKTVSGQRLLHKSTFHLGHWPVDMLLL 1233

Query: 131  RCKPSPIS--------DAPGARS-----------------RFLTWYASLDGALGFFLPLP 165
                +P +        D+ G  +                   LT + S  GA+G   P+ 
Sbjct: 1234 PSDLAPFAQQAPLTNGDSNGHTNGTESSAANAPAPAPSLFHVLTTFQS--GAVGLITPVD 1291

Query: 166  EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 225
            E  YRRL  LQ  + +   H  GLNPRA+R  + +        RG++DG LV +  +L  
Sbjct: 1292 EATYRRLGALQTQLTSVLEHAAGLNPRAYRAVESESLGG----RGVVDGMLVQRIGELGA 1347

Query: 226  GERLEICKKIGS 237
              R E+  + G+
Sbjct: 1348 ARRAEVLGRAGA 1359


>gi|365984967|ref|XP_003669316.1| hypothetical protein NDAI_0C04130 [Naumovozyma dairenensis CBS 421]
 gi|343768084|emb|CCD24073.1| hypothetical protein NDAI_0C04130 [Naumovozyma dairenensis CBS 421]
          Length = 1388

 Score = 70.5 bits (171), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 47/179 (26%), Positives = 87/179 (48%), Gaps = 3/179 (1%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            + EFV+    + F ++D++  + +  Y P+   S  G +L+  + F+L    N+   +  
Sbjct: 1202 SVEFVVNGGDLYFALTDRNNILHVLKYAPDELNSLSGQKLVHCSSFNLFSG-NSSLLLLP 1260

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
            K     D   A   F T    +DG++   +PL E  YRRL ++Q  M       GGLNPR
Sbjct: 1261 KNEEFEDTKNAPLTFQTIGGQVDGSIFKVIPLREDTYRRLYVIQQHMNDKEPQLGGLNPR 1320

Query: 193  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDIE 250
              R    + Y   +  R ++D +++ +F +L +  R  + K+ G + H +I  ++ ++E
Sbjct: 1321 MER-LSNEYYQLCHVMRPMLDFNIIRRFSELPIDRRTRVAKRAGQRAHYEIWRDMINVE 1378


>gi|449543656|gb|EMD34631.1| hypothetical protein CERSUDRAFT_116804 [Ceriporiopsis subvermispora
            B]
          Length = 1440

 Score = 70.5 bits (171), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 51/184 (27%), Positives = 85/184 (46%), Gaps = 8/184 (4%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  V   +F      +  +  D+D  + +  Y P   ES  G  L+++T+FH GQ     
Sbjct: 1261 PLYVTKADFFFAEGRVSIISCDEDGVMRILEYDPHDPESKNGQHLLRRTEFH-GQVEYRT 1319

Query: 128  FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
              I  +     D P  +SR +      DG+L     + E   +RL +LQ  +  +  H  
Sbjct: 1320 SAILARRLKGVDIP--QSRLICGLT--DGSLITMTYVEEAASKRLHLLQGQLTRNVQHVA 1375

Query: 188  GLNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
            GLNPR FR  +    Y   P +RGI+DG+L+  +  L +  + E+ ++IG+    IL + 
Sbjct: 1376 GLNPRGFRIVRND--YVSRPLTRGILDGNLLMAYEDLPIVRQDEVTRQIGTDRTTILKDW 1433

Query: 247  YDIE 250
              ++
Sbjct: 1434 LSLD 1437


>gi|156040479|ref|XP_001587226.1| hypothetical protein SS1G_12256 [Sclerotinia sclerotiorum 1980]
 gi|154696312|gb|EDN96050.1| hypothetical protein SS1G_12256 [Sclerotinia sclerotiorum 1980 UF-70]
          Length = 1447

 Score = 70.5 bits (171), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 55/197 (27%), Positives = 87/197 (44%), Gaps = 23/197 (11%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG-QHVNTFFK 129
            V   + + +   +  + +D D N+ +  Y PE  +S  GH L+ +T F LG  H  T   
Sbjct: 1249 VLCADLLPDGKDLFIVAADADGNLHIMQYDPEHPKSLQGHLLLHRTTFSLGAHHPTTMTL 1308

Query: 130  IRCKPS--PIS-------------DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM 174
            +   PS  P++             D+P      L    S  G      PL E  YRR   
Sbjct: 1309 LPAIPSLHPLTTASSSSLSPSPQEDSPSPSQSLL--LTSRTGTFALLSPLTESQYRRFGT 1366

Query: 175  LQNVMVTHTSHTGGLNPRAFRTYK--GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
            L + +     H  GLNPRA+R  K   +G   G   R IIDG ++ ++++L    R E+ 
Sbjct: 1367 LVSHLTNTLYHPCGLNPRAYRVDKDANEGIVGG---RTIIDGGVLGRWMELGSQRRGEVA 1423

Query: 233  KKIGSKHNDILDELYDI 249
             ++G    ++ DEL ++
Sbjct: 1424 GRVGVDVLELRDELSEL 1440


>gi|189203597|ref|XP_001938134.1| conserved hypothetical protein [Pyrenophora tritici-repentis
            Pt-1C-BFP]
 gi|187985233|gb|EDU50721.1| conserved hypothetical protein [Pyrenophora tritici-repentis
            Pt-1C-BFP]
          Length = 1407

 Score = 70.5 bits (171), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 48/183 (26%), Positives = 78/183 (42%), Gaps = 30/183 (16%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----- 127
            A +F+     +  +++D D N+ +  + P+  +S  G RL+ K  FH G    +      
Sbjct: 1194 AVDFLPFDQQLHLVVADADMNLQILQFDPDNPKSEAGSRLLHKATFHTGHLPTSLHLIHS 1253

Query: 128  --------------------FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEK 167
                                F +   P+  +D P  +      + +  G L    PL E 
Sbjct: 1254 HLKLPSATDFAATNSNPADAFAMDTSPNTTTDTP-QQPFHQILHTTQSGTLALLTPLSED 1312

Query: 168  NYRRLLMLQNVMVTHTSHTGGLNPRAFRT--YKGKGYYAGNPSRGIIDGSLVWKFLQLSL 225
            +YRRL  L   +         LNPRAFRT      G+ AG  +RG++DG+L+ ++ +  L
Sbjct: 1313 SYRRLSNLTAYLANTLDSACSLNPRAFRTGDVAEGGWDAGTGARGVLDGNLLLRWGE--L 1370

Query: 226  GER 228
            GER
Sbjct: 1371 GER 1373


>gi|401624207|gb|EJS42273.1| cft1p [Saccharomyces arboricola H-6]
          Length = 1356

 Score = 70.1 bits (170), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 47/166 (28%), Positives = 79/166 (47%), Gaps = 8/166 (4%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            + EF++    M F  +D D+NV +  Y P+   S  G RL+  + F L   +N+   +  
Sbjct: 1172 SLEFLVNGGDMYFSATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL-HSINSCMLLLP 1230

Query: 133  KPSPI--SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            K      S  P     F      +DG++   +PL E+ YRRL ++Q  ++      GGLN
Sbjct: 1231 KNEEFGSSQVPS----FQNVGGQVDGSIFKIVPLSEETYRRLYVIQQQIIDREIQLGGLN 1286

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
            PR  R      Y  G+  R ++D +++ +F +L++  R    +K G
Sbjct: 1287 PRMER-LANDFYQMGHSMRPMLDFNVIRRFSELAIDRRKNTAQKAG 1331


>gi|349577352|dbj|GAA22521.1| K7_Cft1p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 1357

 Score = 70.1 bits (170), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 50/179 (27%), Positives = 87/179 (48%), Gaps = 5/179 (2%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            + EF++    M F  +D D+NV +  Y P+   S  G RL+  + F L    N+   +  
Sbjct: 1173 SLEFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL-HSTNSCMMLLP 1231

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
            +      +P   S F      +DG++   +PL E+ YRRL ++Q  ++      GGLNPR
Sbjct: 1232 RNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPR 1289

Query: 193  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG-SKHNDILDELYDIE 250
              R      Y  G+  R ++D +++ +F  L++  R  I +K G + H +   ++ +IE
Sbjct: 1290 MER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKAGRNAHFEAWRDIINIE 1347


>gi|45184764|ref|NP_982482.1| AAL060Wp [Ashbya gossypii ATCC 10895]
 gi|74695871|sp|Q75EY8.1|CFT1_ASHGO RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|44980110|gb|AAS50306.1| AAL060Wp [Ashbya gossypii ATCC 10895]
 gi|374105681|gb|AEY94592.1| FAAL060Wp [Ashbya gossypii FDAG1]
          Length = 1305

 Score = 70.1 bits (170), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 52/187 (27%), Positives = 91/187 (48%), Gaps = 15/187 (8%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
              EF++ +  + F+++D++  + +  Y P+   S  G RL+  T F+L    NT  ++  
Sbjct: 1115 CVEFLVNNGDVYFLVTDRNNLMHVLKYAPDEPNSLSGQRLVHCTSFNL-HSTNTCMRLIK 1173

Query: 133  KPSPISDAPGARSRFLTWY--------ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
            K    +D  G  SR    Y        +  DG +   +PL E +YR L ++Q  ++    
Sbjct: 1174 K----NDEFGKVSRGFGIYMPSFQCIGSQADGTIFKVVPLSEASYRSLYLIQQQLIDKEV 1229

Query: 185  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDIL 243
               GLNPR  R  +   Y  G+  R ++D +++ +F  LS+  R+ +  K G + H +I 
Sbjct: 1230 QLCGLNPRMER-LENPFYQMGHILRPMLDFTVLKRFATLSIPTRMTMASKAGRQAHAEIW 1288

Query: 244  DELYDIE 250
             +L DIE
Sbjct: 1289 RDLIDIE 1295


>gi|151942273|gb|EDN60629.1| cleavage factor II (CF II) component [Saccharomyces cerevisiae
            YJM789]
          Length = 1357

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/164 (28%), Positives = 79/164 (48%), Gaps = 4/164 (2%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            + EF++    M F  +D D+NV +  Y P+   S  G RL+  + F L    N+   +  
Sbjct: 1173 SLEFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL-HSTNSCMMLLP 1231

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
            +      +P   S F      +DG++   +PL E+ YRRL ++Q  ++      GGLNPR
Sbjct: 1232 RNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPR 1289

Query: 193  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
              R      Y  G+  R ++D +++ +F  L++  R  I +K G
Sbjct: 1290 MER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKAG 1332


>gi|6320507|ref|NP_010587.1| Cft1p [Saccharomyces cerevisiae S288c]
 gi|74583567|sp|Q06632.1|CFT1_YEAST RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|849213|gb|AAB64737.1| Ydr301wp [Saccharomyces cerevisiae]
 gi|256271799|gb|EEU06830.1| Cft1p [Saccharomyces cerevisiae JAY291]
 gi|285811316|tpg|DAA12140.1| TPA: Cft1p [Saccharomyces cerevisiae S288c]
 gi|392300415|gb|EIW11506.1| Cft1p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 1357

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/164 (28%), Positives = 79/164 (48%), Gaps = 4/164 (2%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            + EF++    M F  +D D+NV +  Y P+   S  G RL+  + F L    N+   +  
Sbjct: 1173 SLEFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL-HSTNSCMMLLP 1231

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
            +      +P   S F      +DG++   +PL E+ YRRL ++Q  ++      GGLNPR
Sbjct: 1232 RNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPR 1289

Query: 193  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
              R      Y  G+  R ++D +++ +F  L++  R  I +K G
Sbjct: 1290 MER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKAG 1332


>gi|190404756|gb|EDV08023.1| 150 kDa protein associated with polyadenylation factor 1
            [Saccharomyces cerevisiae RM11-1a]
 gi|259145538|emb|CAY78802.1| Cft1p [Saccharomyces cerevisiae EC1118]
          Length = 1357

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/164 (28%), Positives = 79/164 (48%), Gaps = 4/164 (2%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            + EF++    M F  +D D+NV +  Y P+   S  G RL+  + F L    N+   +  
Sbjct: 1173 SLEFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL-HSTNSCMMLLP 1231

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
            +      +P   S F      +DG++   +PL E+ YRRL ++Q  ++      GGLNPR
Sbjct: 1232 RNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPR 1289

Query: 193  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
              R      Y  G+  R ++D +++ +F  L++  R  I +K G
Sbjct: 1290 MER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKAG 1332


>gi|323338222|gb|EGA79455.1| Cft1p [Saccharomyces cerevisiae Vin13]
 gi|365766372|gb|EHN07870.1| Cft1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 1357

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/164 (28%), Positives = 79/164 (48%), Gaps = 4/164 (2%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            + EF++    M F  +D D+NV +  Y P+   S  G RL+  + F L    N+   +  
Sbjct: 1173 SLEFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL-HSTNSCMMLLP 1231

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
            +      +P   S F      +DG++   +PL E+ YRRL ++Q  ++      GGLNPR
Sbjct: 1232 RNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPR 1289

Query: 193  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
              R      Y  G+  R ++D +++ +F  L++  R  I +K G
Sbjct: 1290 MER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKAG 1332


>gi|207346484|gb|EDZ72967.1| YDR301Wp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 1357

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/164 (28%), Positives = 79/164 (48%), Gaps = 4/164 (2%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            + EF++    M F  +D D+NV +  Y P+   S  G RL+  + F L    N+   +  
Sbjct: 1173 SLEFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL-HSTNSCMMLLP 1231

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
            +      +P   S F      +DG++   +PL E+ YRRL ++Q  ++      GGLNPR
Sbjct: 1232 RNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPR 1289

Query: 193  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
              R      Y  G+  R ++D +++ +F  L++  R  I +K G
Sbjct: 1290 MER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKAG 1332


>gi|358390357|gb|EHK39763.1| hypothetical protein TRIATDRAFT_48211 [Trichoderma atroviride IMI
            206040]
          Length = 1441

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 50/192 (26%), Positives = 89/192 (46%), Gaps = 15/192 (7%)

Query: 74   CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIR 131
             +F+ +   +  +  D D ++ +  + PE  +S  GH L+ +T F +  +   +T    R
Sbjct: 1243 ADFLPDGEDLSMVAVDADGDIHVLEFNPEHPKSLQGHLLLHRTTFSVTPNPPTSTLLLPR 1302

Query: 132  CKPSPISDAPGARSR----FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
              P+  S      S      L   AS  G+L    PLPE  YRRLL + N ++      G
Sbjct: 1303 TLPASQSATASQDSSTPQPHLLLLASPSGSLAALTPLPESAYRRLLSVTNQLLPALVPHG 1362

Query: 188  GLNPRAFRTYKGK-------GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
            GL+ RA R  +G        G       R I+DG+++ ++ +L   +R E+  + G  ++
Sbjct: 1363 GLHARAHRAPEGGGGMSRMVGVETAASGRAIVDGAILTRWNELGAAKRAEVASRGG--YD 1420

Query: 241  DILDELYDIEAL 252
             +++   D+EA+
Sbjct: 1421 SVMELREDLEAV 1432


>gi|170102106|ref|XP_001882269.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164642641|gb|EDR06896.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 1406

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 48/177 (27%), Positives = 84/177 (47%), Gaps = 13/177 (7%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQ---HVNTF 127
            V   +F      +  +  D++  + ++ Y P+  +S  G  L+ +T+FH GQ     +T 
Sbjct: 1229 VTNADFFFTDGELSLVTGDEEGIMRMYEYNPQDPDSKDGRYLLLRTEFH-GQSEYRTSTT 1287

Query: 128  FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
               R K  P    P A+          DG L    P+ E  ++RL +LQ  +  +  H  
Sbjct: 1288 IARRLKDDP--SIPQAK----LIIGGTDGCLSSLTPVEEHAFKRLQLLQGQLTRNIQHVA 1341

Query: 188  GLNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
            GLNP+AFR  +    +   P S+GI+DG+L+  +  L +  + E+ ++IG+    +L
Sbjct: 1342 GLNPKAFRIVRND--FVSKPLSKGILDGNLLAHYESLPIIRQNEMTRQIGTDRVTLL 1396


>gi|323309632|gb|EGA62840.1| Cft1p [Saccharomyces cerevisiae FostersO]
          Length = 1357

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/164 (28%), Positives = 79/164 (48%), Gaps = 4/164 (2%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            + EF++    M F  +D D+NV +  Y P+   S  G RL+  + F L    N+   +  
Sbjct: 1173 SLEFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL-HSTNSCMMLLP 1231

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
            +      +P   S F      +DG++   +PL E+ YRRL ++Q  ++      GGLNPR
Sbjct: 1232 RNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPR 1289

Query: 193  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
              R      Y  G+  R ++D +++ +F  L++  R  I +K G
Sbjct: 1290 MER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKAG 1332


>gi|330919204|ref|XP_003298516.1| hypothetical protein PTT_09264 [Pyrenophora teres f. teres 0-1]
 gi|311328242|gb|EFQ93393.1| hypothetical protein PTT_09264 [Pyrenophora teres f. teres 0-1]
          Length = 1388

 Score = 69.3 bits (168), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/183 (25%), Positives = 77/183 (42%), Gaps = 30/183 (16%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----- 127
            A +F+     +  +++D D N+ +  + P+  +   G RL+ K  FH G    +      
Sbjct: 1194 AVDFLPFDQQLHLVVADADMNLQILQFDPDNPKGEAGSRLLHKATFHTGHFPTSLHLIHS 1253

Query: 128  --------------------FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEK 167
                                F +   P+  +D P  +      + +  G L    PL E 
Sbjct: 1254 HLKLPSATDFAATNNNPADAFAMDTSPNTTTDTP-QQPFHQILHTTQSGTLALLTPLSED 1312

Query: 168  NYRRLLMLQNVMVTHTSHTGGLNPRAFRT--YKGKGYYAGNPSRGIIDGSLVWKFLQLSL 225
            +YRRL  L   +         LNPRAFRT      G+ AG  +RG++DG+L+ ++ +  L
Sbjct: 1313 SYRRLSNLSAYLANTLDSACSLNPRAFRTGDVAEGGWDAGTGARGVLDGNLLLRWGE--L 1370

Query: 226  GER 228
            GER
Sbjct: 1371 GER 1373


>gi|401841121|gb|EJT43641.1| CFT1-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 1355

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 47/166 (28%), Positives = 78/166 (46%), Gaps = 8/166 (4%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            + EF++    M F  +D D+NV +  Y P+   S  G RL+  + F +   +N+   +  
Sbjct: 1171 SLEFLVNGGDMYFAATDADRNVHILKYAPDEPNSLSGQRLVHCSSFTV-HSINSCMMLLP 1229

Query: 133  KPSPI--SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            K      S  P     F      +DG++   +PL E+ YRRL ++Q  ++      GGLN
Sbjct: 1230 KNQEFGSSQVPS----FQNVGGQVDGSVFKIVPLSEETYRRLYLIQQQIIDRELQLGGLN 1285

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
            PR  R      Y  G+  R ++D +++ +F  LS+  R    +K G
Sbjct: 1286 PRMER-LANDFYQMGHSMRPMLDFNVIRRFSGLSIDRRKNTAQKAG 1330


>gi|406602601|emb|CCH45811.1| hypothetical protein BN7_5397 [Wickerhamomyces ciferrii]
          Length = 1287

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 46/176 (26%), Positives = 93/176 (52%), Gaps = 14/176 (7%)

Query: 75   EFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 134
            +F+++   + F+++D ++++ L  YQP+  +S  G +L++K+ F     + T  K+  K 
Sbjct: 1112 DFIVDDGEIYFLVADDEESLHLLTYQPDDPKSLSGQKLLQKSTFTTNS-ITTCLKLVPKF 1170

Query: 135  SPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 194
            +      G+ + +     ++DG++   +P+ E +YRRL +LQ  +    +H  GLNPR+ 
Sbjct: 1171 NEFDQ--GSITSYQNIGVNVDGSIFKMIPIDEISYRRLYILQQQLSDKIAHYVGLNPRSN 1228

Query: 195  RTYKGKGYYAGNP--SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYD 248
            R       ++ N    + II+  L+  F+ L++ +R +   K+G   ND L+   D
Sbjct: 1229 R-------FSANEQGQKPIIEFGLLKWFINLNVDKRKQFSAKVG--RNDYLELFKD 1275


>gi|380494933|emb|CCF32776.1| cft-1, partial [Colletotrichum higginsianum]
          Length = 542

 Score = 68.6 bits (166), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 50/188 (26%), Positives = 84/188 (44%), Gaps = 9/188 (4%)

Query: 71  VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
           V   +FV+    +  ++ DKD  + +  + PE  +S  GH L+ +  F    +  T   +
Sbjct: 349 VLTADFVVAGDELHIVVCDKDGVIHVMQFDPEHPKSLQGHLLLNRASFSAAPNHPTI-TL 407

Query: 131 RCKPSPISDAPGARSR---FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
               +PIS +  + S+        AS  GAL    PL E+ YRRL  L N +     H  
Sbjct: 408 SLPRTPISPSATSVSKNPPTTLLLASPTGALASLTPLSEQAYRRLTSLANSIAGALPHAA 467

Query: 188 GLNPRAFRTYKGKGYYAG---NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
             NP+  R         G   +  R I+DG+L+ ++ +L  G R E+  K G  + D+ +
Sbjct: 468 ATNPKGHRLQPLDARTPGVDTSAGRSIVDGALLARWNELGAGRRSEVAGKGG--YGDVHE 525

Query: 245 ELYDIEAL 252
              ++E +
Sbjct: 526 VRSELEGV 533


>gi|389641257|ref|XP_003718261.1| cft-1 [Magnaporthe oryzae 70-15]
 gi|351640814|gb|EHA48677.1| cft-1 [Magnaporthe oryzae 70-15]
          Length = 1452

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 44/160 (27%), Positives = 73/160 (45%), Gaps = 8/160 (5%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIRCKPSPISDAPG 142
             + +D D N+ +  + PE  +S  GH L+ +T F  G H    +       P P ++ P 
Sbjct: 1270 IVAADSDGNLHVLQFDPEHPKSLQGHLLLNRTTFSTGAHHPQKSLLLPTTDPRPSTNQPS 1329

Query: 143  ARS-RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR-----T 196
            + + R     AS  G L    PL +  Y RL  L + ++    H   LNP+A+R     T
Sbjct: 1330 SDAERQHILMASPTGVLAAVQPLSQSTYTRLSALASNLMASVPHHAALNPKAYRLPPTST 1389

Query: 197  YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
                     +  R ++DGSL+ ++ +L+ G R E+  + G
Sbjct: 1390 RNQVAAVDISVGRAVVDGSLLARWAELASGRRAEVAGRAG 1429


>gi|302403950|ref|XP_002999813.1| cft-1 [Verticillium albo-atrum VaMs.102]
 gi|261361315|gb|EEY23743.1| cft-1 [Verticillium albo-atrum VaMs.102]
          Length = 1349

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 51/194 (26%), Positives = 87/194 (44%), Gaps = 20/194 (10%)

Query: 74   CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH----VNTFFK 129
             +F++  + +  + SD+D  + +  + PE   S  GH L+ +  F +  +         +
Sbjct: 1149 ADFLVAGNDLSIVASDEDGVLHILQFDPEHPRSLQGHLLLNRASFSVAPNHAWVTLALPR 1208

Query: 130  IRCKP-----SPISDAPGARSRFLTWY-ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
               +P      P ++A G+++R  T   AS  GA+    P+ E  YRRL  L   +    
Sbjct: 1209 TTTRPYLPQSEPATNAAGSQNRTQTLLLASASGAIASLNPITEHAYRRLTSLTTSLANAL 1268

Query: 184  SHTGGLNPRAFRTYKGKGYYAGNP-------SRGIIDGSLVWKFLQLSLGERLEICKKIG 236
             H  G+NP+A R     G  A  P        R I+DG+L+ ++ +L   +R E   K G
Sbjct: 1269 PHAAGMNPKAHRLPPQDG--AARPPAVDVSAGRTIVDGALLARWNELGARQRAEAAGKGG 1326

Query: 237  -SKHNDILDELYDI 249
             +   D+  EL D+
Sbjct: 1327 FASAADVRGELEDV 1340


>gi|440466842|gb|ELQ36086.1| hypothetical protein OOU_Y34scaffold00669g71 [Magnaporthe oryzae Y34]
 gi|440481991|gb|ELQ62520.1| hypothetical protein OOW_P131scaffold01068g7 [Magnaporthe oryzae
            P131]
          Length = 1475

 Score = 68.2 bits (165), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 44/160 (27%), Positives = 73/160 (45%), Gaps = 8/160 (5%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIRCKPSPISDAPG 142
             + +D D N+ +  + PE  +S  GH L+ +T F  G H    +       P P ++ P 
Sbjct: 1293 IVAADSDGNLHVLQFDPEHPKSLQGHLLLNRTTFSTGAHHPQKSLLLPTTDPRPSTNQPS 1352

Query: 143  ARS-RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR-----T 196
            + + R     AS  G L    PL +  Y RL  L + ++    H   LNP+A+R     T
Sbjct: 1353 SDAERQHILMASPTGVLAAVQPLSQSTYTRLSALASNLMASVPHHAALNPKAYRLPPTST 1412

Query: 197  YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
                     +  R ++DGSL+ ++ +L+ G R E+  + G
Sbjct: 1413 RNQVAAVDISVGRAVVDGSLLARWAELASGRRAEVAGRAG 1452


>gi|346319828|gb|EGX89429.1| protein CFT1 [Cordyceps militaris CM01]
          Length = 1452

 Score = 68.2 bits (165), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/191 (26%), Positives = 87/191 (45%), Gaps = 15/191 (7%)

Query: 74   CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK 133
             +F+ +   +  +  D D ++ +F + P+  +S  GH L+ +T F L  +  T   +  +
Sbjct: 1253 ADFLPDGEDLNMIGVDADGDLHVFEFNPDHPKSLQGHLLLHRTTFSLSPNEPTTTVLLER 1312

Query: 134  PSPISD-----APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
              P S        GA +      +   G L    PL E  YRRLL L N ++      GG
Sbjct: 1313 TIPASQPQPQGTTGAETPHTLLLSCPTGQLAALTPLSESAYRRLLSLANQLMPAVVPYGG 1372

Query: 189  LNPRAFRTYKGKGYYA---------GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 239
            L+P+A R  +G+G  +             R I+DG+++ ++ +L   +R E+  K G   
Sbjct: 1373 LHPKAHRLPEGRGAQSHARAVGVETAASGRMIVDGAVLARWTELGAAKRAEMATKSGYDD 1432

Query: 240  -NDILDELYDI 249
             N++ DEL  +
Sbjct: 1433 LNEMRDELEGV 1443


>gi|254564833|ref|XP_002489527.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
            [Komagataella pastoris GS115]
 gi|238029323|emb|CAY67246.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
            [Komagataella pastoris GS115]
 gi|328349950|emb|CCA36350.1| Protein cft1 [Komagataella pastoris CBS 7435]
          Length = 1388

 Score = 68.2 bits (165), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 51/171 (29%), Positives = 82/171 (47%), Gaps = 9/171 (5%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A +FV+   ++  +I+D+D  + L  Y PE   S  G RL++++ F   Q+  T  K+
Sbjct: 1205 VCAADFVVFEGNLFIIIADEDGMLHLIQYDPEDPASMQGQRLLRRSIFKTNQYT-TCMKM 1263

Query: 131  R-----CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
            R      KP P +        F    A+ DG+     P+ E  YRRL ++Q  +    +H
Sbjct: 1264 RERKYVIKP-PKNQFTNFSEAFEVVAANSDGSFYKVTPISEATYRRLYVIQQQIFDQENH 1322

Query: 186  TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
              GLNPR  R Y    Y   N  R I+D   + +FL+    ++ ++  K+G
Sbjct: 1323 KCGLNPRENR-YLSDQYSIPN-QRLILDFDNIRRFLEFDEIKKRDLVHKLG 1371


>gi|260941626|ref|XP_002614979.1| hypothetical protein CLUG_04994 [Clavispora lusitaniae ATCC 42720]
 gi|238851402|gb|EEQ40866.1| hypothetical protein CLUG_04994 [Clavispora lusitaniae ATCC 42720]
          Length = 1363

 Score = 67.4 bits (163), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 48/184 (26%), Positives = 93/184 (50%), Gaps = 7/184 (3%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V + +F+++   +  +++D ++++ +  Y PE   S+ G RL+ K+ F    ++ T  K 
Sbjct: 1177 VSSADFLVKDEEIYILVADNNRSLHVLQYNPEDPASSNGQRLLHKSSF-TTNYLTTCTKS 1235

Query: 131  RCKPSPISD--APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
              K   +S    P A   F T  ++++GA+    P+ E  YRR+ ++Q  ++    H  G
Sbjct: 1236 VPKHEQLSTWFDPQAIP-FQTVGSTVEGAMYVVFPISEPTYRRMYIMQQQLIDKEYHHCG 1294

Query: 189  LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH--NDILDEL 246
            LNPR  R  + +     N  R ++D  L+ +F +L+   +  +  KI +K+   DI  +L
Sbjct: 1295 LNPRLNRIGRIESVNYAN-LRAMLDCELIRRFSKLNEDRKRTLSSKISTKNVQVDIWKDL 1353

Query: 247  YDIE 250
             + E
Sbjct: 1354 IEFE 1357


>gi|171695066|ref|XP_001912457.1| hypothetical protein [Podospora anserina S mat+]
 gi|170947775|emb|CAP59938.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1441

 Score = 67.4 bits (163), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 53/178 (29%), Positives = 81/178 (45%), Gaps = 15/178 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F+     +  +  D + ++ +  + PE  +S  GH L+ +T F  G H  T  K 
Sbjct: 1244 VLNADFLPNGKELSIVACDAEGHIHILQFDPEHPKSLQGHLLLHRTSFSTGAHHVT--KS 1301

Query: 131  RCKPSPIS-------DAPGARSR-FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
               PS +S       +  GA SR  +   AS  G L    PL E  YRRL  L   +   
Sbjct: 1302 LLLPSTLSPDNKEDNEENGATSRPHILLLASPTGVLAALRPLSETAYRRLSSLAAQLTNS 1361

Query: 183  TSHTGGLNPRAFR----TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
             +H  GLNP+ +R    T    G  AG   R I+DG+++ +F +L   +R E+  + G
Sbjct: 1362 LTHAAGLNPKGYRMPSATCPPAGVDAGI-GRHIVDGTILARFSELGRAKRGEVAGRAG 1418


>gi|346971831|gb|EGY15283.1| cft-1 [Verticillium dahliae VdLs.17]
          Length = 1445

 Score = 67.0 bits (162), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 51/194 (26%), Positives = 86/194 (44%), Gaps = 20/194 (10%)

Query: 74   CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH----VNTFFK 129
             +F++  + +  + SD+D  + +  + PE   S  GH L+ +  F +  +         +
Sbjct: 1245 ADFLVAGNDLSIVASDEDGVLHILQFDPEHPRSLQGHLLLNRASFSVAPNHAWATLVLPR 1304

Query: 130  IRCKP-----SPISDAPGARSRFLTWY-ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
               +P      P + A G+++R  T   AS  GA+    P+ E  YRRL  L   +    
Sbjct: 1305 TTTRPYLPQSEPATGAAGSQNRTQTLLLASASGAIASLNPITEHAYRRLTSLTTSLANAL 1364

Query: 184  SHTGGLNPRAFRTYKGKGYYAGNP-------SRGIIDGSLVWKFLQLSLGERLEICKKIG 236
             H  G+NP+A R     G  A  P        R I+DG+L+ ++ +L   +R E   K G
Sbjct: 1365 PHAAGMNPKAHRLPPQDG--AARPPAVDVSAGRTIVDGALLARWNELGARQRAEAAGKGG 1422

Query: 237  -SKHNDILDELYDI 249
             +   D+  EL D+
Sbjct: 1423 FASAADVRGELEDV 1436


>gi|342877552|gb|EGU79002.1| hypothetical protein FOXB_10431 [Fusarium oxysporum Fo5176]
          Length = 1399

 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 48/169 (28%), Positives = 81/169 (47%), Gaps = 4/169 (2%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F+ +   +  + +D D ++ +  + PE  +S  GH L+ +T F +  +  +   +
Sbjct: 1209 VLVADFLPDGEDLAIVAADADGDLHILDFNPEHPKSLQGHLLLHRTSFSVSPNPPSTTLL 1268

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
              +  P S  P      +   AS  G L   +PLPE  YRRLL + N ++   +  GGLN
Sbjct: 1269 LPRTLPPSHPPPQDPPHILLLASSSGHLATLVPLPETTYRRLLSVTNQLLPALTPHGGLN 1328

Query: 191  PRAFRTYKG---KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
             +A R   G    G  A    R I+DG+++ ++ +L   +R EI  K G
Sbjct: 1329 AKAHRLPDGIRPVGVEAAG-GRTIVDGAILARWAELGAAKRAEIAGKGG 1376


>gi|347838999|emb|CCD53571.1| similar to Cleavage and polyadenylation specificity factor subunit 1
            [Botryotinia fuckeliana]
          Length = 1447

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 52/194 (26%), Positives = 82/194 (42%), Gaps = 23/194 (11%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   + + +   +  + +D + N+ +  Y PE  +S  GH L+ +T F LG H  T   +
Sbjct: 1249 VLCADLLPDGKDLFIVAADANGNLHIMQYDPEHPKSLQGHLLLHRTTFSLGAHHPTTMTL 1308

Query: 131  RCKPSPI----------------SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM 174
                 P+                 D P      L    S  G L    PL E  YRR   
Sbjct: 1309 LPTTRPLPQLTTAPSPSPDPSPQEDTPSPSQPLL--LTSRTGTLALLSPLTESQYRRFGT 1366

Query: 175  LQNVMVTHTSHTGGLNPRAFRTYK--GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
            L + +     H  GLNPRA+R  +   +G   G   R IIDG ++ ++++L    R E+ 
Sbjct: 1367 LVSHLTNTLYHPCGLNPRAYRIDRDANEGIVGG---RTIIDGGVLGRWMELGSQRRGEVA 1423

Query: 233  KKIGSKHNDILDEL 246
             ++G    ++ DEL
Sbjct: 1424 GRVGVDVLELRDEL 1437


>gi|400597740|gb|EJP65470.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
          Length = 1444

 Score = 65.1 bits (157), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 48/183 (26%), Positives = 84/183 (45%), Gaps = 16/183 (8%)

Query: 74   CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK 133
             +F+ +   +  +  D D ++ +F + P+  +S  GH LI +T F L  +  T   +  +
Sbjct: 1245 ADFLPDGEDLSMIGVDADGDLHVFEFDPDHPKSLQGHLLIHRTTFSLSPNEPTTTVLLER 1304

Query: 134  PSPISD-----APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
              P S        GA +      +   G L    PL E  YRRLL L N ++      GG
Sbjct: 1305 TIPASQPQPKGTTGAETPHTLLLSCPTGQLAALTPLSESAYRRLLSLTNQVLPAVVPHGG 1364

Query: 189  LNPRAFRTYKGKGYYA---------GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 239
            L+P+A R  +G+G  +             R I+DG+++ ++ +L   +R E+  K G  +
Sbjct: 1365 LHPKAHRLPEGRGAQSHSRAVGVETAASGRMIVDGAVLARWTELGAAKRAEMALKSG--Y 1422

Query: 240  NDI 242
            +D+
Sbjct: 1423 DDV 1425


>gi|409046890|gb|EKM56369.1| hypothetical protein PHACADRAFT_93103 [Phanerochaete carnosa
            HHB-10118-sp]
          Length = 1417

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 47/178 (26%), Positives = 88/178 (49%), Gaps = 5/178 (2%)

Query: 73   ACEFVIESSSMGFMISDKDKNVV-LFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 131
            AC  +  +     +IS  ++ V+ L  Y P   ES  G RL+ +T+FH      T   I 
Sbjct: 1240 ACADLFFADGKASLISCDEEGVLRLSEYDPHDPESRHGQRLLCRTEFHGQTEYRTSHLIA 1299

Query: 132  CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
             +   + DA   +++ +  +   DG+L     + +   +RL +LQ  +  +  H  GLNP
Sbjct: 1300 RRGKGL-DAEIPQAKLICGHT--DGSLTSLTYVDDAVSKRLHLLQGQLARNVQHVAGLNP 1356

Query: 192  RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
            +AFR  +     A   ++GI+DG+L+  F  L +  ++E+ ++I ++   +L +  D+
Sbjct: 1357 KAFRVVRND-RVARPLTKGILDGNLLAAFEDLPVPRQVEVTRQIATERTTVLKDWLDL 1413


>gi|154320778|ref|XP_001559705.1| hypothetical protein BC1G_01861 [Botryotinia fuckeliana B05.10]
          Length = 1153

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 52/194 (26%), Positives = 82/194 (42%), Gaps = 23/194 (11%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   + + +   +  + +D + N+ +  Y PE  +S  GH L+ +T F LG H  T   +
Sbjct: 955  VLCADLLPDGKDLFIVAADANGNLHIMQYDPEHPKSLQGHLLLHRTTFSLGAHHPTTMTL 1014

Query: 131  RCKPSPI----------------SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM 174
                 P+                 D P      L    S  G L    PL E  YRR   
Sbjct: 1015 LPTTRPLPQLTTAPSPSPDPSPQEDTPSPSQPLL--LTSRTGTLALLSPLTESQYRRFGT 1072

Query: 175  LQNVMVTHTSHTGGLNPRAFRTYK--GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
            L + +     H  GLNPRA+R  +   +G   G   R IIDG ++ ++++L    R E+ 
Sbjct: 1073 LVSHLTNTLYHPCGLNPRAYRIDRDANEGIVGG---RTIIDGGVLGRWMELGSQRRGEVA 1129

Query: 233  KKIGSKHNDILDEL 246
             ++G    ++ DEL
Sbjct: 1130 GRVGVDVLELRDEL 1143


>gi|358338426|dbj|GAA28838.2| cleavage and polyadenylation specificity factor subunit 1 [Clonorchis
            sinensis]
          Length = 1741

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 53/101 (52%)

Query: 156  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 215
            GA+    PL +K Y RL + +  ++ H   T GL P+    Y+       NPS  + D  
Sbjct: 1641 GAVFLIGPLRDKMYSRLRITEKNLIHHFGPTCGLLPKLCWNYRPSAPELVNPSGQVADAD 1700

Query: 216  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
            L+W++L L   +RLEI KK G     I+D++ ++ A + HF
Sbjct: 1701 LLWRYLTLPHSQRLEIAKKSGQSLEGIMDDIAELNATTLHF 1741



 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 32/99 (32%), Positives = 48/99 (48%), Gaps = 12/99 (12%)

Query: 27   HELLIYQAFRHPKGTLK----LRFKK-LKVLFVSDRSKRANEQPGLPRGVYACEFVIESS 81
            H LL  +        LK    LRF+  L+VL V  R         +PR VY   F ++  
Sbjct: 1457 HSLLCVKNLILAADVLKSIQLLRFQSDLRVLSVVSRD-------AIPREVYTSNFFVDGR 1509

Query: 82   SMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHL 120
             +GF+++D+  NVV++ Y P    S  G RL+++ D  L
Sbjct: 1510 RLGFLVTDERGNVVIYSYDPLEPSSRSGRRLVRRADMCL 1548


>gi|444313909|ref|XP_004177612.1| hypothetical protein TBLA_0A02930 [Tetrapisispora blattae CBS 6284]
 gi|387510651|emb|CCH58093.1| hypothetical protein TBLA_0A02930 [Tetrapisispora blattae CBS 6284]
          Length = 1459

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 45/185 (24%), Positives = 88/185 (47%), Gaps = 9/185 (4%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            +   EF+  + ++ F  +D D  + +F Y P+   S  G +L+  + F+L    +    +
Sbjct: 1270 ISTVEFIANNGNIYFAATDYDNILHIFKYAPDEPNSLSGQKLVHCSSFNLHSSTSCMIML 1329

Query: 131  RCKPSPISDAPGARSRFLTWYASL----DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
               P     +   +  F+  + +L    DG++   +PL E  YRRL ++Q  +  +    
Sbjct: 1330 ---PGNDEFSENEQDNFIPSFQTLGGQVDGSIFKVIPLEESPYRRLYVIQQQITDYEVQV 1386

Query: 187  GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDE 245
            GGLNP+  R    + Y   N  + ++D +++ +F  L + +R    +K G + H +I  +
Sbjct: 1387 GGLNPKMER-LSNEYYQKSNMLKPMLDFNIIRRFSMLPIDKRRRTAQKAGRRAHFEIWRD 1445

Query: 246  LYDIE 250
            L +IE
Sbjct: 1446 LINIE 1450


>gi|50305395|ref|XP_452657.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|74606921|sp|Q6CTT2.1|CFT1_KLULA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|49641790|emb|CAH01508.1| KLLA0C10274p [Kluyveromyces lactis]
          Length = 1300

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 42/181 (23%), Positives = 86/181 (47%), Gaps = 6/181 (3%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   EF++ + ++ F+++D+  ++ +  Y P+   S  G RL+    F++    N + K+
Sbjct: 1115 VMNLEFLVNNGNINFIVTDRQNHLHVLRYAPDEANSLSGQRLVHCNSFNMFT-TNNYMKL 1173

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
              K           S ++      DG++   +PL E +YRR  ++Q  ++ H     G N
Sbjct: 1174 VRKHVEFG---SKTSNYIALGCQTDGSIFRMIPLNEASYRRFYLVQQQLLDHEIPLAGFN 1230

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS-KHNDILDELYDI 249
             +  R    + Y+ G+  R  +D  ++ K++ L + +R  I  ++G     ++  +L DI
Sbjct: 1231 TKMER-LDNEYYHKGHSLRPTLDSQVLKKYIHLPITKRTTIENRVGRHASTELWHDLIDI 1289

Query: 250  E 250
            E
Sbjct: 1290 E 1290


>gi|150951283|ref|XP_001387581.2| pre-mRNA 3'-end processing factor CF II mRNA cleavage and
            polyadenylation factor II complex, subunit CFT1 (CPSF
            subunit) RNA processing and modification [Scheffersomyces
            stipitis CBS 6054]
 gi|149388465|gb|EAZ63558.2| pre-mRNA 3'-end processing factor CF II mRNA cleavage and
            polyadenylation factor II complex, subunit CFT1 (CPSF
            subunit) RNA processing and modification [Scheffersomyces
            stipitis CBS 6054]
          Length = 1341

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 46/184 (25%), Positives = 90/184 (48%), Gaps = 11/184 (5%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F+ +   +  +I+D +  + L  Y PE   +  G RL+ K+ F     +N+F  +
Sbjct: 1159 VNCADFITKDEEVFILIADNNNVLHLVQYDPEDPTALNGQRLLSKSSFS----INSF--V 1212

Query: 131  RC-KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 189
             C K  P ++       F T  +++DG+    +P+ E +YRR+ +LQ  +     H  GL
Sbjct: 1213 TCLKSLPKTEEKYDTGNFQTIGSTIDGSFFSVVPINEASYRRMYILQQQLTDKEYHYCGL 1272

Query: 190  NPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSK--HNDILDEL 246
            NPR  R + G    A +  ++ I+D  ++  + +L+   +  +  K+ +K  + DI  ++
Sbjct: 1273 NPRLNR-FGGLSMTANDTNTKPILDYDVIRAYGKLNEERKKNLASKVSAKNIYQDIWKDI 1331

Query: 247  YDIE 250
             + E
Sbjct: 1332 IEFE 1335


>gi|410079681|ref|XP_003957421.1| hypothetical protein KAFR_0E01320 [Kazachstania africana CBS 2517]
 gi|372464007|emb|CCF58286.1| hypothetical protein KAFR_0E01320 [Kazachstania africana CBS 2517]
          Length = 1350

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 42/179 (23%), Positives = 88/179 (49%), Gaps = 3/179 (1%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            + EF++ +  + F ++D+D  + +  Y P+   S  G +L+  + F+L    N+   +  
Sbjct: 1162 SVEFLVNNGDIYFAVTDRDNILHVLKYAPDEPNSFSGQKLVHCSSFNLYAD-NSCMVLLA 1220

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
            K    +        +       DG++   +PL E++YRRL ++Q  ++   +  GGLNPR
Sbjct: 1221 KNDEFNKVDDTNRTYQVVGGQTDGSMFKIVPLSEESYRRLYVIQQQIIDKETQLGGLNPR 1280

Query: 193  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG-SKHNDILDELYDIE 250
              R    +     +  R ++D +++ KF  + + +R  + +K+G + H +   +L +IE
Sbjct: 1281 MER-LSNQYLPLCHVMRPMLDFNVIRKFSAMPISKRQALAQKLGRNVHFEAWRDLINIE 1338


>gi|448530371|ref|XP_003870046.1| mRNA cleavage and polyadenylation factor [Candida orthopsilosis Co
            90-125]
 gi|380354400|emb|CCG23915.1| mRNA cleavage and polyadenylation factor [Candida orthopsilosis]
          Length = 1327

 Score = 64.3 bits (155), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 49/185 (26%), Positives = 85/185 (45%), Gaps = 16/185 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F+++   +  +++D + ++ L  + P+  +S  G  LI K  F           +
Sbjct: 1148 VECADFIVKDEEIFILVADVNNSLHLIQFDPDDPKSINGTILINKASFETNSQTTC---L 1204

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            R  P       G    + T  +++DGA     P+ E  YRR+ ++Q  +     H  GLN
Sbjct: 1205 RSVPK------GETGDYQTIGSTIDGAFFNVFPVNESTYRRMYIVQQQISDKEYHYCGLN 1258

Query: 191  PRAFRTYKGKGYYAGNPSRG--IIDGSLVWKFLQLSLGERLEICKKI---GSKHNDILDE 245
            PR  R + G      N +    I+D +L+ +F +L+L  +  I  KI   GS H DI  +
Sbjct: 1259 PRLNR-FGGAVQIRDNDTNAKPILDYNLIKEFAKLNLDRQKNITTKINIKGSAH-DIWKD 1316

Query: 246  LYDIE 250
            L ++E
Sbjct: 1317 LIELE 1321


>gi|353231025|emb|CCD77443.1| putative cleavage and polyadenylation specificity factor cpsf
            [Schistosoma mansoni]
          Length = 1825

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 34/122 (27%), Positives = 62/122 (50%)

Query: 135  SPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 194
            +P S     + R   ++ S +G++    P+ +K Y RL + +  ++ H     G+ P++ 
Sbjct: 1704 NPNSGVDPEKFRQSIYFGSQNGSIYRIGPIRDKMYSRLRITEKNLIHHLGPICGMPPKSC 1763

Query: 195  RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSS 254
             +Y        NP   + DG L+W++L L   +RLEI KK G     I+D++ ++ A + 
Sbjct: 1764 WSYNRPQPELANPCGKVADGDLIWRYLTLPHCQRLEIAKKSGQSLESIMDDIAELIATTL 1823

Query: 255  HF 256
            HF
Sbjct: 1824 HF 1825



 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 32/109 (29%), Positives = 47/109 (43%), Gaps = 12/109 (11%)

Query: 27   HELLIYQAFRHPKGTLK----LRFKK-LKVLFVSDRSKRANEQPGLPRGVYACEFVIESS 81
            H LL  +        LK    LRF+  L+VL V  R         + R VY   F ++  
Sbjct: 1534 HNLLCVKNLVLAADVLKSVQLLRFQSDLRVLSVVSRD-------NISREVYTSNFFVDGR 1586

Query: 82   SMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
             +GFM+SD+  NV ++ Y P    S  G RL++  D  L        ++
Sbjct: 1587 RLGFMVSDELGNVTIYSYDPLDPSSRSGRRLVRCADMRLPSRATCSLRV 1635


>gi|256079900|ref|XP_002576222.1| cleavage and polyadenylation specificity factor cpsf [Schistosoma
            mansoni]
          Length = 1958

 Score = 63.9 bits (154), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 57/107 (53%)

Query: 150  WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSR 209
            ++ S +G++    P+ +K Y RL + +  ++ H     G+ P++  +Y        NP  
Sbjct: 1852 YFGSQNGSIYRIGPIRDKMYSRLRITEKNLIHHLGPICGMPPKSCWSYNRPQPELANPCG 1911

Query: 210  GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
             + DG L+W++L L   +RLEI KK G     I+D++ ++ A + HF
Sbjct: 1912 KVADGDLIWRYLTLPHCQRLEIAKKSGQSLESIMDDIAELIATTLHF 1958



 Score = 47.4 bits (111), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 32/109 (29%), Positives = 47/109 (43%), Gaps = 12/109 (11%)

Query: 27   HELLIYQAFRHPKGTLK----LRFKK-LKVLFVSDRSKRANEQPGLPRGVYACEFVIESS 81
            H LL  +        LK    LRF+  L+VL V  R         + R VY   F ++  
Sbjct: 1551 HNLLCVKNLVLAADVLKSVQLLRFQSDLRVLSVVSRD-------NISREVYTSNFFVDGR 1603

Query: 82   SMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
             +GFM+SD+  NV ++ Y P    S  G RL++  D  L        ++
Sbjct: 1604 RLGFMVSDELGNVTIYSYDPLDPSSRSGRRLVRCADMRLPSRATCSLRV 1652


>gi|346321204|gb|EGX90804.1| DNA damage-binding protein 1 [Cordyceps militaris CM01]
          Length = 1160

 Score = 63.9 bits (154), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 41/158 (25%), Positives = 79/158 (50%), Gaps = 6/158 (3%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            ++ +D   N+++    PEA       RL   ++ +LG+ +N   K+   P+   +A  + 
Sbjct: 998  WLETDAQGNIMVLARNPEAPTEQDRGRLEITSEMNLGEQINKIRKLNVAPA--DNAVVSP 1055

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
              FL   AS++G L  +  +  K    L+ LQ+ +  +   TG ++  A+R+++ +   A
Sbjct: 1056 KAFL---ASIEGTLYLYGDIAPKYQDLLITLQSNIEQYVKTTGDISFNAWRSFRNQTREA 1112

Query: 205  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              P R  +DG +V +FL L    ++E+CK +G    D+
Sbjct: 1113 DGPFR-FVDGEMVERFLDLDELTQVELCKDLGPSVEDV 1149


>gi|340515387|gb|EGR45642.1| predicted protein [Trichoderma reesei QM6a]
          Length = 1441

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 49/192 (25%), Positives = 90/192 (46%), Gaps = 15/192 (7%)

Query: 74   CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIR 131
             +F+ +   +  +  D D ++ +  + PE  +S  GH L+ +T F +  +   +T    R
Sbjct: 1243 ADFLPDGEDLSMVAVDADGDMHVLEFNPEHPKSLQGHLLLHRTTFSVTPNPPTSTLLLPR 1302

Query: 132  CKPSPISDAPGARSRF----LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
              P+  S    + S      +   AS  G++    PLPE  YRRLL + N ++      G
Sbjct: 1303 TLPASQSSQDSSSSSSTQPHILLLASPSGSIAALTPLPESAYRRLLSVTNQLLPALVPHG 1362

Query: 188  GLNPRAFRTYKGK-------GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
            GL+ RA RT +G        G       R I+DG+++ ++ +L   +R E+  + G  ++
Sbjct: 1363 GLHARAHRTPEGGGGMSRTVGVETAATGRAIVDGTVLTRWNELGAAKRAEVATRGG--YD 1420

Query: 241  DILDELYDIEAL 252
             + +   D+EA+
Sbjct: 1421 GVTEMREDLEAV 1432


>gi|344229600|gb|EGV61485.1| hypothetical protein CANTEDRAFT_109087 [Candida tenuis ATCC 10573]
          Length = 1300

 Score = 63.5 bits (153), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 41/181 (22%), Positives = 87/181 (48%), Gaps = 9/181 (4%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F+++   +  ++ D +  + L  Y PE   S+ G RL++K  F+L   V    ++
Sbjct: 1125 VNCADFIVKDEELYIIVGDNNNILHLLKYDPEDPNSSNGQRLVEKAAFNLNAKVT---QL 1181

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            +  P+ + ++           ++++G+     P+ E +YRR+ +LQ  +     H  GLN
Sbjct: 1182 KQLPNLMDNSTSCIG------STIEGSFFTVFPINESSYRRMYILQQQLTDKAYHHCGLN 1235

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
            PR  R    K     + ++ I+D  ++  + +L+   R  I  K+  + ++I  ++ + E
Sbjct: 1236 PRLNRFGGLKLTANESNNKPILDYDVIKLYAKLNEDRRRNIGAKVSREGSEIWRDMLEFE 1295

Query: 251  A 251
            A
Sbjct: 1296 A 1296


>gi|336463425|gb|EGO51665.1| hypothetical protein NEUTE1DRAFT_89273 [Neurospora tetrasperma FGSC
            2508]
          Length = 1437

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 52/178 (29%), Positives = 81/178 (45%), Gaps = 13/178 (7%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT---- 126
            V   +F+ +   +  + SD D ++ +  + PE  +S  GH L+ +T F+ G H  T    
Sbjct: 1198 VLNADFLPDGKELYIVASDADGHIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLL 1257

Query: 127  FFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
               +   PS +S      S  +   AS  G L    PL E  YRRL  L   +     H 
Sbjct: 1258 LPAVYPNPSSLSSNSEENSPHILLLASPTGVLATLRPLQENAYRRLSSLAVQLTNGLPHP 1317

Query: 187  GGLNPRAFR--------TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
             GLNP+ +R        + +  G  AG   R I+DG ++ +FL+L  G+R E+  + G
Sbjct: 1318 AGLNPKGYRLPSPSASASMQLPGVDAGI-GRNIVDGKILERFLELGTGKRQEMAGRAG 1374


>gi|148886831|sp|Q7SEY2.2|CFT1_NEUCR RecName: Full=Protein cft-1; AltName: Full=Cleavage factor two
            protein 1
          Length = 1456

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 52/178 (29%), Positives = 81/178 (45%), Gaps = 13/178 (7%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT---- 126
            V   +F+ +   +  + SD D ++ +  + PE  +S  GH L+ +T F+ G H  T    
Sbjct: 1248 VLNADFLPDGKELYIVASDADGHIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLL 1307

Query: 127  FFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
               +   PS +S      S  +   AS  G L    PL E  YRRL  L   +     H 
Sbjct: 1308 LPAVYPNPSSLSSNSEENSPHILLLASPTGVLATLRPLQENAYRRLSSLAVQLTNGLPHP 1367

Query: 187  GGLNPRAFR--------TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
             GLNP+ +R        + +  G  AG   R I+DG ++ +FL+L  G+R E+  + G
Sbjct: 1368 AGLNPKGYRLPSPSASASMQLPGVDAGI-GRNIVDGKILERFLELGTGKRQEMAGRAG 1424


>gi|164429683|ref|XP_964609.2| hypothetical protein NCU02082 [Neurospora crassa OR74A]
 gi|157073577|gb|EAA35373.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 1437

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 52/178 (29%), Positives = 81/178 (45%), Gaps = 13/178 (7%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT---- 126
            V   +F+ +   +  + SD D ++ +  + PE  +S  GH L+ +T F+ G H  T    
Sbjct: 1198 VLNADFLPDGKELYIVASDADGHIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLL 1257

Query: 127  FFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
               +   PS +S      S  +   AS  G L    PL E  YRRL  L   +     H 
Sbjct: 1258 LPAVYPNPSSLSSNSEENSPHILLLASPTGVLATLRPLQENAYRRLSSLAVQLTNGLPHP 1317

Query: 187  GGLNPRAFR--------TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
             GLNP+ +R        + +  G  AG   R I+DG ++ +FL+L  G+R E+  + G
Sbjct: 1318 AGLNPKGYRLPSPSASASMQLPGVDAGI-GRNIVDGKILERFLELGTGKRQEMAGRAG 1374


>gi|350297359|gb|EGZ78336.1| protein cft-1 [Neurospora tetrasperma FGSC 2509]
          Length = 1437

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 52/178 (29%), Positives = 81/178 (45%), Gaps = 13/178 (7%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT---- 126
            V   +F+ +   +  + SD D ++ +  + PE  +S  GH L+ +T F+ G H  T    
Sbjct: 1198 VLNADFLPDGKELYIVASDADGHIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLL 1257

Query: 127  FFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
               +   PS +S      S  +   AS  G L    PL E  YRRL  L   +     H 
Sbjct: 1258 LPAVYPNPSSLSSNSEENSPHILLLASPTGVLATLRPLQENAYRRLSSLAVQLTNGLPHP 1317

Query: 187  GGLNPRAFR--------TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
             GLNP+ +R        + +  G  AG   R I+DG ++ +FL+L  G+R E+  + G
Sbjct: 1318 AGLNPKGYRLPSPSASASMQLPGVDAGI-GRNIVDGKILERFLELGTGKRQEMAGRAG 1374


>gi|443894082|dbj|GAC71432.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
            [Pseudozyma antarctica T-34]
          Length = 1543

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 51/172 (29%), Positives = 85/172 (49%), Gaps = 21/172 (12%)

Query: 94   VVLFMYQPEARESNGGHRLIKKTDFH-----LGQHVNTFFKIRCKPSPISDAP-----GA 143
            + L+ Y P    + GGHRL+ +++F      +G  V       C+   +SD+        
Sbjct: 1340 IRLYEYAPHVATTLGGHRLLLRSEFQTPAAAVGSTV-------CRGRWLSDSELRGREEG 1392

Query: 144  RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
            RS+ +   A  +GAL     L +K  +RL +LQ  +V    HT  LNPRAFR  +   + 
Sbjct: 1393 RSKLV--LAKANGALDSLSALDDKVAKRLHLLQGQLVRSVQHTAALNPRAFRAVRND-FV 1449

Query: 204  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSH 255
              + ++GI+D  L+ +F+ LS  + LE  + + S   D LD++   +  S+H
Sbjct: 1450 PRSLAKGILDARLLDRFVWLSRPKMLEAVRTL-SGLFDGLDQIKKRKRDSNH 1500


>gi|164655043|ref|XP_001728653.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
 gi|159102535|gb|EDP41439.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
          Length = 1212

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 48/193 (24%), Positives = 87/193 (45%), Gaps = 8/193 (4%)

Query: 67   LPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT 126
            +P  V    F+I+ + +  +  D +  + L  Y P    S GG RL+ + ++H    V  
Sbjct: 1025 IPTSVTCGAFLIDRARLSIVTCDMNGCLRLMDYHPSNPTSLGGQRLLARCEYHAPGEV-- 1082

Query: 127  FFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
              + R    P     G         A  +GA+   +P+ EK +  L + Q+ +V    HT
Sbjct: 1083 -VRARMLHGPYLATSGECLTSEIVLAKRNGAVDVLVPVTEKIFPTLQLFQSQLVRMVRHT 1141

Query: 187  GGLNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL-- 243
             GLNPR FR    +  +   P ++GI+DG+L+     +S  +   + + + ++   ++  
Sbjct: 1142 AGLNPRGFRAVFNQ--HISRPLAKGILDGTLLHTAESMSRPKLTSLVRDLSTRTGGVIAD 1199

Query: 244  DELYDIEALSSHF 256
            D L  +  L SH+
Sbjct: 1200 DLLRCLVHLQSHW 1212


>gi|326432241|gb|EGD77811.1| hypothetical protein PTSG_08901 [Salpingoeca sp. ATCC 50818]
          Length = 1506

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 44/184 (23%), Positives = 88/184 (47%), Gaps = 2/184 (1%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG-QHVNT 126
            P  VYA +F++  + +  +  D+  N+ +  +  +   + GG  L + + ++ G Q +  
Sbjct: 1318 PTSVYAADFIVRGAELHVLFLDQHANMTILAFDSDDPTTRGGRILKRHSVYNTGHQRIVA 1377

Query: 127  FFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
              +++  P   S      + FLT Y +L+G  G+   +PE  +RRL++LQ  ++ H    
Sbjct: 1378 LTRLQNVPPRNSRNATVDAHFLT-YQTLEGGAGYITSIPEDIFRRLMLLQLRLLPHLKFR 1436

Query: 187  GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
             GL+P AF+ YK    +  +     I   +  +   L L  + E+ +++G+    + D+ 
Sbjct: 1437 AGLHPSAFKKYKSASLHMVHQEVRTICADVYTRLFMLDLDAQKEVARQVGTTTKQLCDDF 1496

Query: 247  YDIE 250
              IE
Sbjct: 1497 LFIE 1500


>gi|238881599|gb|EEQ45237.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 1423

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 45/184 (24%), Positives = 77/184 (41%), Gaps = 15/184 (8%)

Query: 70   GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
             V   +F+I    +  +++D +  + L  Y P+  +S  G +L+ K  F L   ++    
Sbjct: 1220 SVECADFIINDDEIFVLVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRS 1279

Query: 130  ---IRCKPSPISDA-----------PGARSRFLTWYASL-DGALGFFLPLPEKNYRRLLM 174
               I  + S  +DA           P   S +     S  DG+     P+ E  YRR+ +
Sbjct: 1280 LPLIDIEESVQTDAFTNIVVPPTLPPNTTSNYFQVIGSTQDGSFFNVFPINEAAYRRMYI 1339

Query: 175  LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
            LQ  ++    H  GLNPR  R    K       ++ I+D  L+ +F +LS   +  +  K
Sbjct: 1340 LQQQLIDKEFHYCGLNPRLNRIGSIKLQNNETNTKPILDYDLIRRFTKLSDDRKRNLANK 1399

Query: 235  IGSK 238
            +  K
Sbjct: 1400 VSGK 1403


>gi|344305212|gb|EGW35444.1| pre-mRNA 3'-end processing factor CF II [Spathaspora passalidarum
            NRRL Y-27907]
          Length = 1348

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 47/181 (25%), Positives = 82/181 (45%), Gaps = 20/181 (11%)

Query: 70   GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVN---- 125
             V   +F+ +   +  +I+D +  + L  Y P+  +S  G RLI K+ F +   V+    
Sbjct: 1156 NVSCGDFITKDEDIYMLIADNNNILHLIQYDPDDPQSLNGQRLISKSAFEIESTVSCMRK 1215

Query: 126  ------TFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 179
                  +F K   K SPI +       F    ++ DG+     P+ E +YRR+ +LQ  +
Sbjct: 1216 LPKIESSFEKSEIKFSPIDE-------FQIIGSTSDGSFFNVFPVDESSYRRMYILQQQL 1268

Query: 180  VTHTSHTGGLNPRAFRTYKGKGYYAGNP--SRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
                 H  GLNPR  R + G      N   ++ I+D  L+ ++ QL+   +  +  K+ +
Sbjct: 1269 TDKEYHYCGLNPRLNR-FGGAIELRDNETNTKPILDFGLIKRYAQLNEDRKRNLASKVSA 1327

Query: 238  K 238
            K
Sbjct: 1328 K 1328


>gi|340924328|gb|EGS19231.1| hypothetical protein CTHT_0058560 [Chaetomium thermophilum var.
            thermophilum DSM 1495]
          Length = 1460

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 50/196 (25%), Positives = 81/196 (41%), Gaps = 32/196 (16%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F+ +   +  +  D D ++ +  + PE  +S  GH L+ +T F+ G H  T  K 
Sbjct: 1244 VLTADFLPDGKELFIVACDADGHIHILQFDPEHPKSLQGHLLLHRTSFNTGAHNPT--KS 1301

Query: 131  RCKPSPI-SDAPGA----------------RSRFLTWYA----------SLDGALGFFLP 163
               PS + +D P                   S    + A          S  G +    P
Sbjct: 1302 LLLPSTLPTDTPSTIDGSNPNTNNTNGTPNASNLAPYDATERPHILLLCSPTGLIAALRP 1361

Query: 164  LPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG---NPSRGIIDGSLVWKF 220
            L E +YRRL  L   +V    H  GLNP+ +R        AG   +  R I+DG+++ +F
Sbjct: 1362 LSESSYRRLSSLAAQLVNSLPHAAGLNPKGYRMPSADCPPAGVDASVGRNIVDGTVLERF 1421

Query: 221  LQLSLGERLEICKKIG 236
             +L +  R E+  + G
Sbjct: 1422 TELGMARRAELAGRAG 1437


>gi|353234640|emb|CCA66663.1| related to cleavage and polyadenylation specificity factor, 160 kDa
            subunit [Piriformospora indica DSM 11827]
          Length = 1324

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 51/185 (27%), Positives = 81/185 (43%), Gaps = 6/185 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  +   +F    + M  +  D    + L  Y P   ++  G RL+   +F    HV   
Sbjct: 1142 PTSIPQGDFFFAHNDMELLTIDLRGVLRLHSYDPTHVDTEEGARLLCSVEFQ--THVEPV 1199

Query: 128  FKIRCK-PSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
              +R     P SD+    SR L     +DG+L    PL    ++RL +LQ  +V HT H 
Sbjct: 1200 TIVRVAMEQPSSDSASDASRLL--IPRVDGSLASLSPLDMDIFKRLYLLQAQLVRHTHHI 1257

Query: 187  GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
              LNP+A+R  +G        SR ++D  L+  F +LS   +  I  +IG     ++ + 
Sbjct: 1258 AALNPKAYRAVQGSS-TTRTMSRRMLDFGLLVGFKKLSFDRQQGIANQIGETWETLIRDC 1316

Query: 247  YDIEA 251
              +EA
Sbjct: 1317 TQLEA 1321


>gi|241954348|ref|XP_002419895.1| subunit of the mRNA cleavage and polyadenylation factor, putative
            [Candida dubliniensis CD36]
 gi|223643236|emb|CAX42110.1| subunit of the mRNA cleavage and polyadenylation factor, putative
            [Candida dubliniensis CD36]
          Length = 1420

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 44/183 (24%), Positives = 77/183 (42%), Gaps = 14/183 (7%)

Query: 70   GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
             V   +F+I    +  +++D +  + L  Y P+  +S  G +L+ K  F L   ++    
Sbjct: 1218 SVECADFIINDDEIFVLVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRS 1277

Query: 130  -----IRCKPSPISDAPGA---------RSRFLTWYASLDGALGFFLPLPEKNYRRLLML 175
                 I  K    +DA  A         ++ F    ++ DG+     P+ E  YRR+ +L
Sbjct: 1278 LPLKDIDEKVQNETDAAAAATIPLPNNTQNNFQVIGSTQDGSFFNVFPINEAAYRRMYIL 1337

Query: 176  QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
            Q  ++    H  GLNPR  R    K       ++ I+D  L+ +F +LS   +     K+
Sbjct: 1338 QQQLIDKEFHYCGLNPRLNRIGSIKLQNNETNTKPILDYDLIRRFTKLSDDRKRNFANKV 1397

Query: 236  GSK 238
              K
Sbjct: 1398 SGK 1400


>gi|68471006|ref|XP_720510.1| likely Cleavage and Polyadenylation Specificity Factor subunit
            [Candida albicans SC5314]
 gi|74591422|sp|Q5AFT3.1|CFT1_CANAL RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|46442380|gb|EAL01670.1| likely Cleavage and Polyadenylation Specificity Factor subunit
            [Candida albicans SC5314]
          Length = 1420

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 45/184 (24%), Positives = 76/184 (41%), Gaps = 15/184 (8%)

Query: 70   GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
             V   +F+I    +  +++D +  + L  Y P+  +S  G +L+ K  F L   ++    
Sbjct: 1217 SVECADFIINDDEIFVLVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRS 1276

Query: 130  ---IRCKPSPISDA-----------PGARSRFLTWYASL-DGALGFFLPLPEKNYRRLLM 174
               I  + S  +DA           P   S +     S  DG+     P+ E  YRR+ +
Sbjct: 1277 LPLIDIEESVQTDALTNIAVPPPLPPNTTSNYFQVIGSTQDGSFFNVFPINEAAYRRMYI 1336

Query: 175  LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
            LQ  ++    H  GLNPR  R    K       ++ I+D  L+  F +LS   +  +  K
Sbjct: 1337 LQQQLIDKEFHYCGLNPRLNRIGSIKLQNNETNTKPILDYDLIRSFTKLSDDRKRNLANK 1396

Query: 235  IGSK 238
            +  K
Sbjct: 1397 VSGK 1400


>gi|400600376|gb|EJP68050.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
          Length = 1174

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/159 (23%), Positives = 77/159 (48%), Gaps = 8/159 (5%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP-ISDAPGA 143
            ++ +D   N+++    P+A   +   RL   ++ +LG+ +N   ++   P+  +  +P A
Sbjct: 1012 WLETDAQGNIIVLARNPDAPTEHDRSRLEITSEMNLGEQINKIQRLNVAPADNVVVSPKA 1071

Query: 144  RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
                  + AS++G L  +  +  K    L+ LQ  +  +   TGG++  A+R ++ +   
Sbjct: 1072 ------FLASIEGTLYLYGDIAPKYQDLLITLQTTIEKYVKTTGGISFDAWRAFRNQARE 1125

Query: 204  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
            A  P R  +DG +V +FL L    +  +C+ +G    D+
Sbjct: 1126 ADGPFR-FVDGEMVERFLDLRKQTQAALCQDLGLNVEDV 1163


>gi|403218521|emb|CCK73011.1| hypothetical protein KNAG_0M01580 [Kazachstania naganishii CBS 8797]
          Length = 1345

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 44/181 (24%), Positives = 87/181 (48%), Gaps = 5/181 (2%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            + EF++    + F + D++  + +  Y P+   S  G RLI  + F++     +  ++  
Sbjct: 1158 SLEFLVNGGDLYFTLIDRNDILHVLKYAPDEPNSLSGQRLIHCSSFNM-YSTTSCTRLIP 1216

Query: 133  KPSPISDAP--GARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            K     D P   A   +       DG+L   +P+PE  YRRL ++Q  ++   +   G+N
Sbjct: 1217 KNELFVDGPLNPAIQSYQVIGGQADGSLFKVMPVPETVYRRLYVVQQQIIDKETPLAGIN 1276

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDI 249
            P+  R      Y   +  R ++D ++V +F  +S+ +R  +  K+G + H DI  ++ ++
Sbjct: 1277 PKMER-LSNDYYQTSHLLRPMLDYNVVKQFCAMSIPKRTTLAHKLGKRAHFDIWRDVINL 1335

Query: 250  E 250
            E
Sbjct: 1336 E 1336


>gi|402085944|gb|EJT80842.1| cft-1 [Gaeumannomyces graminis var. tritici R3-111a-1]
          Length = 1450

 Score = 60.5 bits (145), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 45/179 (25%), Positives = 80/179 (44%), Gaps = 13/179 (7%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH------- 123
            V   +F+ +  ++  + +D + N+ +F + PE  +S  GH L+ +T F  G H       
Sbjct: 1249 VINVDFLPDGRNLLLVAADAEGNLHIFQFDPEHPKSLQGHLLLNRTTFSTGAHHPQKSLL 1308

Query: 124  VNTFFKIRCKPSPISDAPGARSRFL-TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
            + T      +P+   DA  A +       A+  G L    PL +  Y RL  L + +   
Sbjct: 1309 MPTTSSNPSQPATNGDASAAAAGPQHILMAAPTGVLAAVQPLGQGVYTRLSALASNLAAS 1368

Query: 183  TSHTGGLNPRAFRT--YKGKGYYAG---NPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
              H   LNP+A+R      +   A    +  R ++DG+L+ ++ +L  G R E+  + G
Sbjct: 1369 VPHHAALNPKAYRMPPAPARNQVAAVDISVGRAVVDGALLARWAELGSGRRAEVAGRAG 1427


>gi|68471460|ref|XP_720278.1| likely Cleavage and Polyadenylation Specificity Factor subunit
           fragment [Candida albicans SC5314]
 gi|46442138|gb|EAL01430.1| likely Cleavage and Polyadenylation Specificity Factor subunit
           fragment [Candida albicans SC5314]
          Length = 758

 Score = 60.5 bits (145), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 45/184 (24%), Positives = 76/184 (41%), Gaps = 15/184 (8%)

Query: 70  GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
            V   +F+I    +  +++D +  + L  Y P+  +S  G +L+ K  F L   ++    
Sbjct: 555 SVECADFIINDDEIFVLVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRS 614

Query: 130 ---IRCKPSPISDA-----------PGARSRFLTWYASL-DGALGFFLPLPEKNYRRLLM 174
              I  + S  +DA           P   S +     S  DG+     P+ E  YRR+ +
Sbjct: 615 LPLIDIEESVQTDALTNIAVPPPLPPNTTSNYFQVIGSTQDGSFFNVFPINEAAYRRMYI 674

Query: 175 LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
           LQ  ++    H  GLNPR  R    K       ++ I+D  L+  F +LS   +  +  K
Sbjct: 675 LQQQLIDKEFHYCGLNPRLNRIGSIKLQNNETNTKPILDYDLIRSFTKLSDDRKRNLANK 734

Query: 235 IGSK 238
           +  K
Sbjct: 735 VSGK 738


>gi|149237256|ref|XP_001524505.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
            YB-4239]
 gi|146452040|gb|EDK46296.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
            YB-4239]
          Length = 1380

 Score = 60.1 bits (144), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 44/170 (25%), Positives = 77/170 (45%), Gaps = 12/170 (7%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F+I+   +  +++D +  + L  Y P+  +S  G  L+ K  F L         +
Sbjct: 1201 VECADFIIKDEDIFILVADNNNCIHLIQYDPDDPKSINGTILLNKASFELNSATTC---L 1257

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            R  P       G +  +    ++LDGAL    P+ E  YRR+ +LQ  +     H  GLN
Sbjct: 1258 RSIPK------GEKGDYQIIGSTLDGALYNVFPVNEFTYRRMYILQQQISDKVYHFCGLN 1311

Query: 191  PRAFRTYKGKGYYAG--NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 238
            PR  R + G          ++ I+D  L+ +F +L+L  + ++  KI  +
Sbjct: 1312 PRLNR-FGGSVTLRDRETNTKPILDYGLIRRFSKLNLDRQQQLAGKISVR 1360


>gi|354547787|emb|CCE44522.1| hypothetical protein CPAR2_403250 [Candida parapsilosis]
          Length = 1334

 Score = 60.1 bits (144), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 48/185 (25%), Positives = 85/185 (45%), Gaps = 16/185 (8%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   +F+++   +  +++D +  + L  + P+  +S  G  L+ K  F            
Sbjct: 1155 VECADFIVKDEEIFILVADINNGLHLIQFDPDDPKSINGTILVNKASFETNSQTT----- 1209

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
             C  S   D  G    + T  +++DGA     P+ E  YRR+ ++Q  +     H  GLN
Sbjct: 1210 -CLRSVPKDEAG---DYQTIGSTIDGAFFNVFPVNESTYRRMYIVQQQISDKEFHHCGLN 1265

Query: 191  PRAFRTYKGKGYY--AGNPSRGIIDGSLVWKFLQLSLGERLEICKKI---GSKHNDILDE 245
            PR  R + G      +   ++ I+D +L+ +F +L+L  +  I  KI   GS H DI  +
Sbjct: 1266 PRLNR-FGGAIQIRDSDTNAKPILDYNLIREFAKLNLDRQRNIATKINIKGSAH-DIWKD 1323

Query: 246  LYDIE 250
            L ++E
Sbjct: 1324 LIELE 1328


>gi|121925707|sp|Q0UUE2.1|CFT1_PHANO RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
          Length = 1375

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 45/170 (26%), Positives = 77/170 (45%), Gaps = 7/170 (4%)

Query: 74   CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT--FFKIR 131
             +F+     +  +++D D N+ +  + P+  +S GG RL++K+ FH G   +T    + R
Sbjct: 1205 ADFLPFEEQLHIIVADADMNLQVLQFDPDHPKSMGGTRLLQKSTFHTGHFPSTMHLLQSR 1264

Query: 132  CKPSPISDAPGARSRFLTWY----ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
                  S+   + +  L  +     S  G L    PL E +YRRL  L   +        
Sbjct: 1265 LHMPTASEFTTSTTSSLPLHQILCTSQSGTLALITPLSESSYRRLSGLATHLQQFLDSPC 1324

Query: 188  GLNPRAFRTYK-GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
            GLN +AFR     +G +     R ++DG L+ ++ +L    R E   K+G
Sbjct: 1325 GLNGKAFRAADVMEGGWDAGTQRAMLDGGLLMRWGELGEQRRREGLGKVG 1374


>gi|169603229|ref|XP_001795036.1| hypothetical protein SNOG_04622 [Phaeosphaeria nodorum SN15]
 gi|160706354|gb|EAT88382.2| hypothetical protein SNOG_04622 [Phaeosphaeria nodorum SN15]
          Length = 1338

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 45/170 (26%), Positives = 77/170 (45%), Gaps = 7/170 (4%)

Query: 74   CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT--FFKIR 131
             +F+     +  +++D D N+ +  + P+  +S GG RL++K+ FH G   +T    + R
Sbjct: 1168 ADFLPFEEQLHIIVADADMNLQVLQFDPDHPKSMGGTRLLQKSTFHTGHFPSTMHLLQSR 1227

Query: 132  CKPSPISDAPGARSRFLTWY----ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
                  S+   + +  L  +     S  G L    PL E +YRRL  L   +        
Sbjct: 1228 LHMPTASEFTTSTTSSLPLHQILCTSQSGTLALITPLSESSYRRLSGLATHLQQFLDSPC 1287

Query: 188  GLNPRAFRTYK-GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
            GLN +AFR     +G +     R ++DG L+ ++ +L    R E   K+G
Sbjct: 1288 GLNGKAFRAADVMEGGWDAGTQRAMLDGGLLMRWGELGEQRRREGLGKVG 1337


>gi|190348091|gb|EDK40482.2| hypothetical protein PGUG_04580 [Meyerozyma guilliermondii ATCC 6260]
          Length = 1320

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/185 (22%), Positives = 84/185 (45%), Gaps = 7/185 (3%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   EF+ +   +  +I+  +  + L  + PE   S+ G RL+ +  F++         +
Sbjct: 1136 VSCAEFISKDEEIYILIAGNNNVMHLVQFDPEDPTSSNGQRLVHRASFNVSSSTTC---M 1192

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            R  P            F T  +++DG+     P+ E  YRR+ ++Q  +     H  GLN
Sbjct: 1193 RMVPKNEEINTQYSDVFQTVGSTIDGSFFTVFPVNEFTYRRMYIIQQQLTDKEYHYCGLN 1252

Query: 191  PRAFRTYKGKGYYAGNPS-RGIIDGSLVWKFLQLSLGERLEICKKIGSK--HNDILDELY 247
            PR  R + G+ +       + I+D  ++ ++ +L+   +  I +K+ SK  + +I  +L 
Sbjct: 1253 PRLNR-FGGEAFDDSQTGVKPILDHQVIKRYAKLNEDRKQTIAQKVSSKGVYQEIWKDLI 1311

Query: 248  DIEAL 252
            + E +
Sbjct: 1312 EFENV 1316


>gi|146415762|ref|XP_001483851.1| hypothetical protein PGUG_04580 [Meyerozyma guilliermondii ATCC 6260]
          Length = 1320

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/185 (22%), Positives = 84/185 (45%), Gaps = 7/185 (3%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V   EF+ +   +  +I+  +  + L  + PE   S+ G RL+ +  F++         +
Sbjct: 1136 VSCAEFISKDEEIYILIAGNNNVMHLVQFDPEDPTSSNGQRLVHRASFNVSSSTTC---M 1192

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            R  P            F T  +++DG+     P+ E  YRR+ ++Q  +     H  GLN
Sbjct: 1193 RMVPKNEEINTQYSDVFQTVGSTIDGSFFTVFPVNEFTYRRMYIIQQQLTDKEYHYCGLN 1252

Query: 191  PRAFRTYKGKGYYAGNPS-RGIIDGSLVWKFLQLSLGERLEICKKIGSK--HNDILDELY 247
            PR  R + G+ +       + I+D  ++ ++ +L+   +  I +K+ SK  + +I  +L 
Sbjct: 1253 PRLNR-FGGEAFDDSQTGVKPILDHQVIKRYAKLNEDRKQTIAQKVSSKGVYQEIWKDLI 1311

Query: 248  DIEAL 252
            + E +
Sbjct: 1312 EFENV 1316


>gi|452825139|gb|EME32137.1| cleavage and polyadenylation specificity factor subunit-like protein
            [Galdieria sulphuraria]
          Length = 1454

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 47/156 (30%), Positives = 78/156 (50%), Gaps = 12/156 (7%)

Query: 52   LFVSDRSKRAN--EQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQ-PEARESNG 108
            LF  DR+K  N   +   P    + EF+I +  + F++SD   N+ L  Y  P   ES G
Sbjct: 1240 LFWRDRNKSLNFLGKDFDPVQTLSTEFLILNEFILFVVSDNFGNLHLLEYAGPHEIESRG 1299

Query: 109  GHRLIKKTDFHLGQHVNTFFKIRC--KPSPISDAPGARSRFL-TWYASLDGALGFFLPLP 165
            G +L+++   HLG   ++  ++R   K +   D  G+    L TW    DG L   LPL 
Sbjct: 1300 GEKLLRRGVLHLGTRSSSMIRLRTDWKENNSEDRAGSHIVVLGTW----DGGLACLLPLQ 1355

Query: 166  EKNY--RRLLMLQNVMVTHTSHTGGLNPRAFRTYKG 199
            ++ Y  +  L+ +  + +++ +  GLNP+ FR  +G
Sbjct: 1356 QEEYEQKNELLKKVYLHSYSLYVAGLNPQEFRIPRG 1391


>gi|242208344|ref|XP_002470023.1| predicted protein [Postia placenta Mad-698-R]
 gi|220730923|gb|EED84773.1| predicted protein [Postia placenta Mad-698-R]
          Length = 696

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 35/113 (30%), Positives = 54/113 (47%), Gaps = 4/113 (3%)

Query: 85  FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            +I D+D  + ++ Y P   ES GG  L+++T+FH          I  +    +D P AR
Sbjct: 572 LLIGDEDGVIRIYEYDPHDPESRGGQHLLRRTEFHGQMESRMSILIIRRRGKDTDIPQAR 631

Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 197
                   S +G+L  F  + E   +RL +LQ  +  +  H  GLNP+ FR Y
Sbjct: 632 ----LISGSTNGSLSMFTYVDEVASKRLHLLQGQLTRNVQHVVGLNPKVFRPY 680


>gi|255956643|ref|XP_002569074.1| Pc21g20880 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211590785|emb|CAP96985.1| Pc21g20880 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 1140

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 44/168 (26%), Positives = 81/168 (48%), Gaps = 8/168 (4%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            F+ SD + N+++         +   HRL+  ++  LG+ VN     R +P  I       
Sbjct: 977  FLQSDAEGNLIVLSRNMNGVTAQDKHRLMPTSEISLGEMVN-----RIRPVNIPQLSSVM 1031

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
                 + A+++G++  F  +  ++   L+ LQ  + T  +  G L+   FR+++     A
Sbjct: 1032 VTPRAFMATVEGSIFLFAVINPEHQDFLMTLQASLSTKINSLGNLSFDKFRSFRTMVRSA 1091

Query: 205  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 252
              P R  +DG L+ +FL  S   + EI ++IGS  +D+++    IEAL
Sbjct: 1092 EAPYR-FVDGELIEQFLNCSPSMQEEIVQEIGS--SDVVEVKRMIEAL 1136


>gi|426235955|ref|XP_004011942.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Ovis aries]
          Length = 819

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 29/70 (41%), Positives = 43/70 (61%)

Query: 187 GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
            GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+L
Sbjct: 750 AGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDL 809

Query: 247 YDIEALSSHF 256
            + + +++HF
Sbjct: 810 LETDRVTAHF 819


>gi|255720869|ref|XP_002545369.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240135858|gb|EER35411.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 1351

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 42/171 (24%), Positives = 73/171 (42%), Gaps = 14/171 (8%)

Query: 70   GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
             V   +F+I+   +  +++D +  + L  Y P+  +S  G +L+ K  F L   ++    
Sbjct: 1173 SVECADFIIKDDEVYILVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELASPISCL-- 1230

Query: 130  IRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 189
               +  PI D     + F    +  DG+     P+ E  YRR+ +LQ  +     H  GL
Sbjct: 1231 ---RTLPIDD-----NNFQIIGSCQDGSFFNVFPINESTYRRMYILQQQLTEKEYHYCGL 1282

Query: 190  NPRAFRTYKGKGYYAGNPS--RGIIDGSLVWKFLQLSLGERLEICKKIGSK 238
            NPR  R   G      N +  + I+D  L+  F +L+   +     K+  K
Sbjct: 1283 NPRLNRV--GNLALKDNDTNIKPILDYGLIRIFAKLNTDRKKAFANKVSGK 1331


>gi|428164905|gb|EKX33915.1| hypothetical protein GUITHDRAFT_158867 [Guillardia theta CCMP2712]
          Length = 1092

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 60/131 (45%), Gaps = 12/131 (9%)

Query: 111  RLIKKTDFHLGQHVNTFFK----IRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
            RL    ++HLG  VN F +    +R   +P +D           + +++G +G    L +
Sbjct: 953  RLEVVGEYHLGDMVNRFQRGSLVLRSSDTPTTDT--------IIFGTVNGMIGVIAVLSK 1004

Query: 167  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
            + Y  LL +Q+ +       GGL    +R+++ +        +G IDG L+  FL L   
Sbjct: 1005 EEYEFLLKVQDALNFVIKGVGGLRHEDWRSFENERTQGARAPKGFIDGDLIESFLDLRRE 1064

Query: 227  ERLEICKKIGS 237
            +  E+C  IGS
Sbjct: 1065 KMEEVCHAIGS 1075


>gi|300122534|emb|CBK23104.2| unnamed protein product [Blastocystis hominis]
          Length = 172

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 35/141 (24%), Positives = 65/141 (46%), Gaps = 10/141 (7%)

Query: 112 LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRR 171
           ++++ DFHL   + +         PIS   G   + +    + +GA+G FL +  + Y +
Sbjct: 28  VVRQADFHLASQITSIL-------PISLPDG---QCINVILTAEGAMGVFLFVTGEEYTK 77

Query: 172 LLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 231
           L  LQ  ++        LN   FR Y   G       +G++D  ++ K+L LS  E+ +I
Sbjct: 78  LSSLQKRLIEALPQNAALNNFNFRKYMSDGMMKYPRRKGVLDMGVIRKYLMLSTQEQEDI 137

Query: 232 CKKIGSKHNDILDELYDIEAL 252
            K +  +  +I + +Y  + L
Sbjct: 138 AKSLDLETKEITEVIYRTDKL 158


>gi|425777692|gb|EKV15851.1| UV-damaged DNA binding protein, putative [Penicillium digitatum Pd1]
 gi|425779888|gb|EKV17916.1| UV-damaged DNA binding protein, putative [Penicillium digitatum
            PHI26]
          Length = 1140

 Score = 53.9 bits (128), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 42/168 (25%), Positives = 79/168 (47%), Gaps = 8/168 (4%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            F+ SD + N+++         +   HRLI  ++  LG+ VN     R +P  I       
Sbjct: 977  FLQSDAEGNLIVLSRNTNGVTAQDKHRLIPTSEISLGEMVN-----RIRPVHIPQLCSVM 1031

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
                 + A+++G++  F  +  ++   L+ LQ  +    +  G L+   FR ++     A
Sbjct: 1032 VTPRAFMATVEGSIFLFAVINPEHQDFLMTLQAALSQKLNSLGNLSFDKFRGFRTMVRSA 1091

Query: 205  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 252
              P R  +DG L+ +FL+ +   + EI +++GS  +D+ +    IEAL
Sbjct: 1092 AAPYR-FVDGELIEQFLKCTPSMQEEIAQEVGS--SDVGEVKRLIEAL 1136


>gi|449019486|dbj|BAM82888.1| similar to cleavage and polyadenylation specificity factor subunit
            [Cyanidioschyzon merolae strain 10D]
          Length = 1880

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 45/188 (23%), Positives = 83/188 (44%), Gaps = 9/188 (4%)

Query: 69   RGVYACEFVIESSSMGFMISDKDKNVV-LFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            R + A E+++  +    + +D   N++ L +  P   ES GG R+  +  FH+G   N  
Sbjct: 1683 RELVASEWLMHGTKANVLSADMLGNIIELSIPSPVDPESAGGTRMTFEAGFHVGSRPNAV 1742

Query: 128  FKIRCK-PSPISDAPGARSRFLTWY----ASLDGALGFFLPLPEKNYRRL-LMLQNVMVT 181
             ++R   PS  +  P   S     +     ++DG +    PL     ++L L  Q++M+ 
Sbjct: 1743 RRVRIDDPSAETPPPNEPSSLWNTHVILLGTVDGMITMVSPLLRGVAKKLELAAQDLMLE 1802

Query: 182  HTSHTGGLNPRAFRTYKGKGYYAG--NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 239
                   L  R++R  +     AG   P R I+DG ++  +  L    R EI ++IG   
Sbjct: 1803 PELRKWCLYARSWRVMRSLTVAAGLRKPKRSILDGDVLQLYGSLDTPRRKEIARRIGMPQ 1862

Query: 240  NDILDELY 247
              + + ++
Sbjct: 1863 EALFEAIF 1870


>gi|385304556|gb|EIF48568.1| rna-binding subunit of the mrna cleavage and polyadenylation factor
           [Dekkera bruxellensis AWRI1499]
          Length = 289

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/126 (27%), Positives = 56/126 (44%), Gaps = 6/126 (4%)

Query: 73  ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
           AC+F++    +  + +D+D  + L  Y P   ES  G +L+ ++ F    +         
Sbjct: 145 ACDFIVXEGGLYIIAADEDSVLHLLEYDPYDPESMKGXKLLTRSVFRFNGYTTAMRLCDR 204

Query: 133 KPS------PISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
           K S       ++  PGA   F     +++G+     P  E  YRRL  LQN +    SH 
Sbjct: 205 KNSIFSMLDTLAIPPGADLGFEVIGCNIEGSFYKVTPANEYTYRRLYALQNHISDKESHW 264

Query: 187 GGLNPR 192
            GLNP+
Sbjct: 265 LGLNPK 270


>gi|241952575|ref|XP_002419009.1| pre-mRNA-splicing factor, putative; pre-spliceosome component,
            putative [Candida dubliniensis CD36]
 gi|223642349|emb|CAX42591.1| pre-mRNA-splicing factor, putative [Candida dubliniensis CD36]
          Length = 1187

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/141 (29%), Positives = 66/141 (46%), Gaps = 17/141 (12%)

Query: 110  HRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNY 169
            ++L    +FH+G  + T   + C      +  G  S     Y  L G +G  +PL  K+ 
Sbjct: 1057 YKLQNLIEFHIGDII-TSLNLGCL-----NLAGTESVI---YTGLQGTIGLLVPLVSKSE 1107

Query: 170  RRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGER 228
              LL  LQ +M    ++  G +   FR+Y        NP + +IDG L+ +FL+     R
Sbjct: 1108 VELLFNLQLLMQQFQNNLVGKDHLKFRSYY-------NPIKNVIDGDLLERFLEFDTSLR 1160

Query: 229  LEICKKIGSKHNDILDELYDI 249
            +EI +K+    NDI  +L D+
Sbjct: 1161 IEISRKLNKSVNDIEKKLIDL 1181


>gi|71021721|ref|XP_761091.1| hypothetical protein UM04944.1 [Ustilago maydis 521]
 gi|46100541|gb|EAK85774.1| hypothetical protein UM04944.1 [Ustilago maydis 521]
          Length = 1597

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/161 (26%), Positives = 77/161 (47%), Gaps = 6/161 (3%)

Query: 78   IESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPI 137
            I  S +G   S +    V+ +Y+     + GG +L+ +++F          ++ C    +
Sbjct: 1368 ISGSRLGGDASMEASAGVIRLYEYAPHLAVGGKKLVLRSEFQTTSEA--VARVECSGRWL 1425

Query: 138  SDAP--GARS-RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 194
            SD+   G  + R    +A  +G++     + EK  +RL +LQ  +V    HT  LNPR+F
Sbjct: 1426 SDSELRGRETLRNKVVFAKANGSVESVAAVDEKVGKRLHLLQGQLVRSVMHTAALNPRSF 1485

Query: 195  RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
            R  +   Y      +G++D  L+ +F++LS  + LE  K +
Sbjct: 1486 RMVRND-YVPRALVKGVLDARLLDEFMRLSRPKMLEAVKTL 1525


>gi|298711490|emb|CBJ26578.1| n/a [Ectocarpus siliculosus]
          Length = 1135

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/163 (23%), Positives = 71/163 (43%), Gaps = 8/163 (4%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK--IRCKPSPISDAPG 142
            ++  + D N+       +A       RL  + +FHLG+ VN F +  +  + S ++   G
Sbjct: 956  YIGGEADCNIFTLRRNADAATEEERARLEIQGEFHLGEFVNKFCRGSLLMQSSEVNSPGG 1015

Query: 143  ARSRFLT----WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK 198
              S  +      + +++G +G  L L E N+R L  LQ  M       GG +   +R++ 
Sbjct: 1016 MDSPLVKGQPLLFGTVNGMVGTILTLTEDNHRFLAQLQTAMTKVVKGVGGFSHDEWRSFT 1075

Query: 199  GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
                 +  PS   IDG LV  +L +    + E+ + + +   D
Sbjct: 1076 NGRRTS--PSSNFIDGDLVESYLDMPRHNQEEVLRHVDTPVGD 1116


>gi|384250802|gb|EIE24281.1| hypothetical protein COCSUDRAFT_28729 [Coccomyxa subellipsoidea
            C-169]
          Length = 1101

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 46/213 (21%), Positives = 94/213 (44%), Gaps = 35/213 (16%)

Query: 28   ELLIYQAFRHPKGTLKLRFKKLKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMI 87
            +LLIY+A    +G L++R +  K  +++                 A E + + + +G   
Sbjct: 900  QLLIYRA---DEGILEVRARDYKTHWMT-----------------AVEVLDDDTYLG--- 936

Query: 88   SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGA 143
            ++   N+       +A      +RL     +HLG  VN F      ++   S  +  P  
Sbjct: 937  AENSNNIFTLRKNTDAAADEDRNRLETVGQYHLGVFVNRFRHGSLVMKLPDSEAAKIP-- 994

Query: 144  RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
                   + +++G++G    LP++ ++ L  LQ+ +       GGL+  A+RT++ +  +
Sbjct: 995  ----TVLFVTINGSIGVIASLPQQQFQFLSRLQDCLRKVIKGVGGLSHVAWRTFQDE--H 1048

Query: 204  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
               PS+  +DG L+ +FL L       + +++G
Sbjct: 1049 TKMPSQNFVDGDLIEQFLDLKRDSMERVAREMG 1081


>gi|68476233|ref|XP_717766.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
            albicans SC5314]
 gi|68476422|ref|XP_717672.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
            albicans SC5314]
 gi|74586274|sp|Q5A7S5.1|RSE1_CANAL RecName: Full=Pre-mRNA-splicing factor RSE1
 gi|46439394|gb|EAK98712.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
            albicans SC5314]
 gi|46439495|gb|EAK98812.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
            albicans SC5314]
          Length = 1219

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/141 (28%), Positives = 66/141 (46%), Gaps = 17/141 (12%)

Query: 110  HRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNY 169
            ++L    +FH+G  + T F + C      +  G  S     Y  L G +G  +PL  K+ 
Sbjct: 1089 YKLQNLIEFHIGDII-TSFNLGCL-----NLAGTESVI---YTGLQGTIGLLIPLVSKSE 1139

Query: 170  RRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGER 228
              LL  LQ  M    ++  G +    R+Y        NP + +IDG L+ +FL+  +  +
Sbjct: 1140 VELLFNLQLYMQQSQNNLVGKDHLKLRSYY-------NPIKNVIDGDLLERFLEFDISLK 1192

Query: 229  LEICKKIGSKHNDILDELYDI 249
            +EI +K+    NDI  +L D+
Sbjct: 1193 IEISRKLNKSVNDIEKKLIDL 1213


>gi|302831157|ref|XP_002947144.1| hypothetical protein VOLCADRAFT_87503 [Volvox carteri f. nagariensis]
 gi|300267551|gb|EFJ51734.1| hypothetical protein VOLCADRAFT_87503 [Volvox carteri f. nagariensis]
          Length = 2830

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 39/132 (29%), Positives = 59/132 (44%), Gaps = 12/132 (9%)

Query: 69   RGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESN----GGHRLIKKTDFHLGQHV 124
            R V     VI    + F+ +D   N+ L  +   +R +N     G RL      H+ + +
Sbjct: 2634 RDVLTAGVVINEPKLAFLAADAAGNLALSEFY-GSRNTNPEFWAGQRLAPLGLMHVARRL 2692

Query: 125  NTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNY-RRLLMLQNVMVTHT 183
            +    I+    P SD    ++R      + +G L +  P+P+    +RLL LQN M    
Sbjct: 2693 SCCVSIKM---PTSDG---KNRHALLCGAAEGGLSYIAPVPDAEMTQRLLALQNHMSRRL 2746

Query: 184  SHTGGLNPRAFR 195
             H  GLNPRAFR
Sbjct: 2747 PHVAGLNPRAFR 2758


>gi|343425828|emb|CBQ69361.1| related to cleavage and polyadenylation specificity factor, 160 kDa
            subunit [Sporisorium reilianum SRZ2]
          Length = 1567

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 42/152 (27%), Positives = 71/152 (46%), Gaps = 13/152 (8%)

Query: 90   KDKNVV-LFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAP-----GA 143
            +D  VV L+ Y P    +N   RL+ +T+F           + C+   +SD+        
Sbjct: 1354 RDAGVVRLYEYAPHVASANT--RLVLRTEFQTASPATA--SVACRGRWLSDSELRGREHG 1409

Query: 144  RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
            R++ +   A  +GAL       ++  +RL +LQ  +V    HT  LNPRAFR  +   + 
Sbjct: 1410 RNKLV--LAKANGALETLAAADDRVAKRLHVLQGQLVRSVLHTAALNPRAFRAVRND-FV 1466

Query: 204  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
            +    +G++D  L+  F+ LS  + LE  K +
Sbjct: 1467 SRALGKGVLDARLLDSFVYLSRPKMLEAVKTL 1498


>gi|255080490|ref|XP_002503825.1| predicted protein [Micromonas sp. RCC299]
 gi|226519092|gb|ACO65083.1| predicted protein [Micromonas sp. RCC299]
          Length = 1114

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/174 (21%), Positives = 80/174 (45%), Gaps = 11/174 (6%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK----IRCKPSPISDA 140
            ++ ++ + N+     Q +A       RL    ++H+G+ VN F +    +R      +D 
Sbjct: 946  YLGAENNFNLFTLRRQSDAATDEERSRLEVVGEYHVGEFVNRFRRGSLVMRLPDQENADV 1005

Query: 141  PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 200
            P         + ++ G +G    LP + +  L  LQ  +    S  GGL+  A+R+++ +
Sbjct: 1006 P------TLLFGTVSGVIGVLATLPREQFEFLSALQAALNKTVSGVGGLSHDAWRSFQNE 1059

Query: 201  G-YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 253
              + A + +RG +DG L+  FL L   +  E+   +    +++   + D++ L+
Sbjct: 1060 HRHRAKDGARGFVDGDLIESFLDLRPEKAREVAAAVKLSVDELTRRVEDLQRLT 1113


>gi|348681092|gb|EGZ20908.1| hypothetical protein PHYSODRAFT_259403 [Phytophthora sojae]
          Length = 1137

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/165 (24%), Positives = 72/165 (43%), Gaps = 15/165 (9%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF-----FKIRCKPSPISD 139
            ++ S+ D N+        A       RL    +FHLG+ VN F           P+ + D
Sbjct: 950  YIGSETDFNLFTVQRNSGAASDEERGRLETVGEFHLGEFVNRFRYGSLTPAAAGPTDMVD 1009

Query: 140  A-------PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
                    P A+++ +  + ++ G +G  LPL +  Y  LL +Q  +       GG + +
Sbjct: 1010 VVEQAPIVPAAQNQSM-LFGTVSGMIGVILPLTKDQYSFLLRVQQALTQVVKGVGGFSHK 1068

Query: 193  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
             +R ++ +   + + +R  IDG LV  FL L   +  ++  K+ S
Sbjct: 1069 DWRMFENR--RSVSEARNFIDGDLVESFLDLPKAQMTKVVDKLNS 1111


>gi|281208174|gb|EFA82352.1| UV-damaged DNA binding protein1 [Polysphondylium pallidum PN500]
          Length = 1054

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 44/172 (25%), Positives = 71/172 (41%), Gaps = 8/172 (4%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            F+  D   N+++     EA        L     FHLG+ +N F       SP  D+P   
Sbjct: 890  FIGGDNSYNLIVVQKNAEASSEIDNELLDTVGQFHLGETINKFKHGSLVTSPDMDSPKLP 949

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
            +     + ++ GA+G  + + + +Y     LQ  +       GGL    +R++  +  + 
Sbjct: 950  T---ILFGTVSGAIGVIVSISKDDYEFFEKLQKGLNRVVHGVGGLPFENWRSFSTE--HM 1004

Query: 205  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
              PS+  IDG L+  FL L   + LE  K +      I D    IE+L  H 
Sbjct: 1005 TIPSKNFIDGDLIETFLDLRHDKMLEAIKDMNIS---IEDTYRRIESLMHHI 1053


>gi|384253955|gb|EIE27429.1| hypothetical protein COCSUDRAFT_64224 [Coccomyxa subellipsoidea
            C-169]
          Length = 1137

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 51/201 (25%), Positives = 84/201 (41%), Gaps = 22/201 (10%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQ--PEARESNGGHRLIKKTDFHLGQHVNTFF 128
            V A EF++    +  +  D  +N+ LF Y    E + +  G +L+     H+GQ++ +  
Sbjct: 944  VEAAEFLVNGKKLFLLACDAAQNLRLFAYDGGKEQQATWQGKKLLPLGAIHVGQNICSSL 1003

Query: 129  KIRCKPSPISDAPGARSRFLTWYASLDGALGFFLP----LPEKNYRRLLMLQNVMVTHTS 184
              R  P+    A G + R    + S  G++    P    LP +    LL LQ  MV    
Sbjct: 1004 SHRITPA---SATGVQLRAAV-FGSAAGSIASLAPTWDGLPAE---ELLALQREMVLAVP 1056

Query: 185  HTGGLNPRAF-RTYK--------GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
               GLNP +F R YK        G+ + A      ++D   + +F  L L E++ +  K 
Sbjct: 1057 QVAGLNPVSFRRRYKHGVKALAGGQSFEAPVSDDRVLDLDQLNRFQWLPLTEQVALAAKC 1116

Query: 236  GSKHNDILDELYDIEALSSHF 256
                  +L  L ++    S F
Sbjct: 1117 NLSRQQVLHALREMVMAISTF 1137


>gi|258572939|ref|XP_002540651.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237900917|gb|EEP75318.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 1144

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 41/167 (24%), Positives = 74/167 (44%), Gaps = 7/167 (4%)

Query: 84   GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGA 143
             F++SD + N+V+          +   R+   ++  LG  VN   ++  + S    +P  
Sbjct: 976  AFLVSDAEGNLVVLNRNTTGVTEDDKRRMQITSELRLGTMVNRIRRMDLQAS--QSSPVI 1033

Query: 144  RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
               FL   A+ DG++  F  + +     L+ LQ+ + +  +  GG+    +R +K     
Sbjct: 1034 PKAFL---ATTDGSIYLFGVIAQFAQDLLMRLQSALASFVASPGGIPFSGYRAFKSATRQ 1090

Query: 204  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI-LDELYDI 249
            A  P R  +DG LV +FL   L  +  +  K+     D+ L +L DI
Sbjct: 1091 ADEPFR-FVDGELVEQFLDCPLEVQEAVLAKMDGGGRDVTLSQLKDI 1136


>gi|412992547|emb|CCO18527.1| predicted protein [Bathycoccus prasinos]
          Length = 1275

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 38/140 (27%), Positives = 66/140 (47%), Gaps = 6/140 (4%)

Query: 118  FHLGQHVNTFFK--IRCKPSPISDAPGARSRFLTW-YASLDGALGFFLPLPEKNYRRLLM 174
            FHLG  VN F +  +  +   +SD   + S   TW + ++ G LG    LP++++  L  
Sbjct: 1137 FHLGDSVNRFRRGSLVMRMPDLSDDTSSLSEISTWLFGTISGGLGVVATLPKRDFMLLNK 1196

Query: 175  LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
            +Q  M    +  G  +   FR++           R  IDG LV  FL LS  +++ + + 
Sbjct: 1197 VQEAMQKVVTGVGNFSHSDFRSFHNVQRSV--EMRNFIDGDLVEIFLDLSKEDQVAVSEL 1254

Query: 235  IG-SKHNDILDELYDIEALS 253
             G S   D++ ++ +I  L+
Sbjct: 1255 SGVSNSEDLVKKIEEISRLT 1274


>gi|223994993|ref|XP_002287180.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220976296|gb|EED94623.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1517

 Score = 50.8 bits (120), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 38/155 (24%), Positives = 70/155 (45%), Gaps = 6/155 (3%)

Query: 74   CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR-C 132
            C  V   ++  ++ S+   N+ +  +   A       RL    ++HLG+  N F      
Sbjct: 1296 CTAVEMLTNGTYLGSEGFNNLFVLRHNANASSEEARVRLDTVGEYHLGEMTNKFMGGSLI 1355

Query: 133  KPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 189
             PS      GA++ ++   T + ++DG++G  L L    +  L  LQ  +++     G +
Sbjct: 1356 MPSNSGGIMGAQNAYVGSQTLFGTVDGSIGSVLGLDGPTFAFLACLQRAILSIVKTVGDI 1415

Query: 190  NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
            +   +R ++ +      PSRG IDG L+  FL L+
Sbjct: 1416 SHEEYRAFRAERQV--RPSRGFIDGDLIETFLDLN 1448


>gi|322704830|gb|EFY96421.1| Cleavage factor two protein 1 [Metarhizium anisopliae ARSEF 23]
          Length = 1433

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 42/187 (22%), Positives = 82/187 (43%), Gaps = 9/187 (4%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV--NTFFKI 130
            A +F+ +   +  +  D + ++ +  + PE  +S  GH L+ +T F +  +   +T    
Sbjct: 1240 AADFLPDGEDLSMVAVDAEGDLHILEFNPEHPKSLQGHLLLHRTSFAVTPNTPSSTLLLP 1299

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
            R        A  + S  +   A   G L    PL E  YRRLL + N +       GGL+
Sbjct: 1300 RTHSPSYPQASSSSSSHMLLLACPSGQLAALSPLAESTYRRLLSVTNQLHPAIVPHGGLH 1359

Query: 191  PRAFRTYKGKGYYAG-----NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
             +A R         G     +  R ++DG+++ ++ +L   +R ++  + G  ++ + D 
Sbjct: 1360 SKAHRYPDQSSVAVGVETAASSGRALVDGTVLARWSELGAAKRTDVALRGG--YDSVADL 1417

Query: 246  LYDIEAL 252
              D+E +
Sbjct: 1418 RDDLEGV 1424


>gi|307111604|gb|EFN59838.1| hypothetical protein CHLNCDRAFT_29381 [Chlorella variabilis]
          Length = 1108

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 35/143 (24%), Positives = 59/143 (41%), Gaps = 11/143 (7%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSPISDA 140
            +M ++   N+       +A       RL     +HLG+ VN F      +R   S +S  
Sbjct: 940  YMGAENSYNLFTVRRNADAATDEERSRLETVGRYHLGEFVNRFQPGSLVMRLPDSELSQI 999

Query: 141  PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 200
            P         + +++G +G    LP   Y+ L  LQ  M       GG +   +R +  +
Sbjct: 1000 P------TVLFGTINGVIGVVASLPHAQYQLLESLQEAMRKVVKGVGGFDHAQWRAFSNQ 1053

Query: 201  GYYAGNPSRGIIDGSLVWKFLQL 223
             +    P+R  +DG L+ +FL L
Sbjct: 1054 -HMPATPARQFVDGDLIEQFLDL 1075


>gi|398391687|ref|XP_003849303.1| hypothetical protein MYCGRDRAFT_87400 [Zymoseptoria tritici IPO323]
 gi|339469180|gb|EGP84279.1| hypothetical protein MYCGRDRAFT_87400 [Zymoseptoria tritici IPO323]
          Length = 1143

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/154 (25%), Positives = 71/154 (46%), Gaps = 8/154 (5%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            ++ SD + N+V+  +  +        RL   ++  LG+ VN   +I   P+P  +A    
Sbjct: 979  YLESDAEGNLVVLEHDVKGFSEEDRRRLRVTSEILLGEMVNRIRRIDVSPTP--NATVIP 1036

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
              FL   A+++G++  F  + E     L+ +QN M       G +    FR +K +    
Sbjct: 1037 RAFL---ATVEGSIYLFALIAEGKQDLLIRMQNKMAEMVQSPGHVPFAKFRGFKTQVRDM 1093

Query: 205  GN--PSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
            G   PSR  +DG L+ +FL      + E+ K++G
Sbjct: 1094 GEEGPSR-FVDGELIERFLDCDEDVQAEVAKELG 1126


>gi|406865227|gb|EKD18269.1| CPSF A subunit region [Marssonina brunnea f. sp. 'multigermtubi'
            MB_m1]
          Length = 1146

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 41/169 (24%), Positives = 76/169 (44%), Gaps = 8/169 (4%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPIS-DAPGA 143
            ++ SD   N+++     +        RL    + +LG+ VN   KI   PSP +   P A
Sbjct: 984  WLESDHHGNLLVLQRNIDGITLEDKQRLRITGEMNLGEQVNMIRKIAIDPSPTAMVVPKA 1043

Query: 144  RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
                  + A+ +G++  F  + + +   LL LQ  +       G L+ + +R++K     
Sbjct: 1044 ------FLATTEGSIYLFSTILDGSQDLLLRLQENITECVDTLGRLDFKTYRSFKSAERT 1097

Query: 204  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 252
               P R  +DG L+ +FL  S   + +IC+ +G     I D + +++ L
Sbjct: 1098 TEEPYR-FVDGELIERFLDESEDMQQQICEGLGYTVEAIRDVVENLKRL 1145


>gi|452979181|gb|EME78944.1| hypothetical protein MYCFIDRAFT_43692 [Pseudocercospora fijiensis
            CIRAD86]
          Length = 1149

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 76/172 (44%), Gaps = 11/172 (6%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            AC  V E++   ++ SD + N+++  +           RL   ++F LG+ VN     R 
Sbjct: 973  ACARVSENT---YLESDAEGNLIVLQHDINGFSQEDRRRLRVTSEFLLGEMVN-----RI 1024

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
            +P  +  +PGA      + A+ DG++  +  + +     L+ +Q +M       GG+   
Sbjct: 1025 RPITVQPSPGAVVTPQAFLATTDGSIYVYCEIGKPRQDLLMRMQTLMADMVKSPGGVRFA 1084

Query: 193  AFRTYKGKGYYAGN--PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
             FR +K      G   P R  +DG L+ +FL +    + E+ K +     D+
Sbjct: 1085 KFRGFKTLVRDMGEEGPVR-FVDGELIERFLDMPEVLQNEVVKGLDGTGVDL 1135


>gi|18377609|gb|AAL66955.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
          Length = 270

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/123 (26%), Positives = 57/123 (46%), Gaps = 12/123 (9%)

Query: 117 DFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 172
           ++HLG+ VN F      +R   S I   P         + +++G +G    LP++ Y  L
Sbjct: 138 EYHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVIASLPQEQYTFL 191

Query: 173 LMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
             LQ+ +       GGL+   +R++  +   A   +R  +DG L+  FL LS  +  +I 
Sbjct: 192 EKLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRNKMEDIS 249

Query: 233 KKI 235
           K +
Sbjct: 250 KSM 252


>gi|367018592|ref|XP_003658581.1| hypothetical protein MYCTH_2294503 [Myceliophthora thermophila ATCC
            42464]
 gi|347005848|gb|AEO53336.1| hypothetical protein MYCTH_2294503 [Myceliophthora thermophila ATCC
            42464]
          Length = 1547

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 44/104 (42%), Gaps = 20/104 (19%)

Query: 152  ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG------------ 199
            A+  G L     LPE  YRRL  L   +     H  GLNPR +R   G            
Sbjct: 1421 AAPTGVLAALRALPESAYRRLSSLAAQLAGSLPHAAGLNPRGYRLPDGVASSSSPWSSSS 1480

Query: 200  -------KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
                    G  AG   R I+DG+L+ +F +L +  R+E+  + G
Sbjct: 1481 SSFSAVVPGVDAGV-GRTIVDGALLQRFTELGMARRVELAGRAG 1523


>gi|388856288|emb|CCF50097.1| related to cleavage and polyadenylation specificity factor, 160 kDa
            subunit [Ustilago hordei]
          Length = 1568

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 42/146 (28%), Positives = 67/146 (45%), Gaps = 9/146 (6%)

Query: 84   GFMISDKDKNVV-LFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPG 142
            GF    +D  V+ L+ Y P +  S GG RL+ +T++              +    S+  G
Sbjct: 1316 GFSGFGQDSGVIRLYEYNPSSPTSLGGQRLLLRTEYSTPSSTTCSLSAPGRWLSDSELRG 1375

Query: 143  ---ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG 199
                R++ L   +  +G+L     + EK  +RL +LQ  +V    HT  LNPRAFR  + 
Sbjct: 1376 REQLRNKLL--LSKSNGSLDSLASVEEKVAKRLHLLQGQLVRSVLHTAALNPRAFRQVRN 1433

Query: 200  KGYYAGNP-SRGIIDGSLVWKFLQLS 224
               +   P  +G++D  L+  F  LS
Sbjct: 1434 D--FVSRPLYKGVLDARLLDAFKGLS 1457


>gi|33411762|emb|CAD58786.1| cleavage and polyadenylation specificity factor 1 [Bos taurus]
          Length = 880

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 34/51 (66%), Gaps = 4/51 (7%)

Query: 3   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
           +V+E+L V+LG    RP LLV    ELLIY+AF H     +G LK+RFKK+
Sbjct: 663 LVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 713


>gi|322694449|gb|EFY86278.1| Cleavage factor two protein 1 [Metarhizium acridum CQMa 102]
          Length = 1431

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/185 (21%), Positives = 82/185 (44%), Gaps = 9/185 (4%)

Query: 74   CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK 133
             +F+ +   +  +  D + ++ +  + PE  +S  GH L+ +T F +  +  T   +  +
Sbjct: 1241 ADFLPDGEDLSMVAVDAEGDMHILEFNPEHPKSLQGHLLLHRTSFAVTPNTPTSTLLLPR 1300

Query: 134  PSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRA 193
                S    + S  +   A   G +    PL E  YRRLL + N +        GL+ +A
Sbjct: 1301 THSPSYPHASSSSHMLLLACPSGQVAALSPLAESTYRRLLSVTNQLHPAIVAHCGLHTKA 1360

Query: 194  FRTYKGKGYY------AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
             R Y  +         A +  R ++DG+++ ++ +L   +R ++  + G  ++ + D   
Sbjct: 1361 HR-YPDQSCVAVGVETAASSGRALVDGTVLARWSELGAAKRTDVALRGG--YDSVADLRD 1417

Query: 248  DIEAL 252
            D+E +
Sbjct: 1418 DLEGV 1422


>gi|145351726|ref|XP_001420218.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580451|gb|ABO98511.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1120

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 35/147 (23%), Positives = 62/147 (42%), Gaps = 10/147 (6%)

Query: 111  RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
            RL    ++HLG+ VN F      +  K     + P         + + +G +G    LP+
Sbjct: 979  RLEITGEYHLGEFVNVFSPGSLVMSLKDGDSLEVP------TLLFGTGNGVIGVLASLPK 1032

Query: 167  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
              Y     LQ  M  H    GGL    +R+++       +PSR  +DG LV  FL L + 
Sbjct: 1033 DAYDFAERLQTSMNKHIQGVGGLKHAEWRSFRHTLRRKSDPSRNFVDGDLVESFLDLKVE 1092

Query: 227  ERLEICKKIGSKHNDILDELYDIEALS 253
            +   +   +     +I+  + +++ L+
Sbjct: 1093 QADVVAADMKCDRAEIIRRVEELQRLT 1119


>gi|297809743|ref|XP_002872755.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp. lyrata]
 gi|297318592|gb|EFH49014.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp. lyrata]
          Length = 1088

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/155 (24%), Positives = 68/155 (43%), Gaps = 12/155 (7%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSPISDA 140
            ++ ++ + N+V      E        RL    ++HLG+ VN F      +R   S I   
Sbjct: 924  YLGAENNFNLVTVKKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQI 983

Query: 141  PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 200
            P         + +++G +G    LP++ Y  L  LQ+ +       GGL+   +R++  +
Sbjct: 984  P------TVIFGTVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNE 1037

Query: 201  GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
               A   +R  +DG L+  FL LS  +  +I K +
Sbjct: 1038 KRTA--EARNFLDGDLIESFLDLSRNKMEDISKSM 1070


>gi|67516629|ref|XP_658200.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
 gi|40747539|gb|EAA66695.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
 gi|259489136|tpe|CBF89158.1| TPA: damaged DNA binding protein (Eurofung) [Aspergillus nidulans
            FGSC A4]
          Length = 1132

 Score = 48.1 bits (113), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/159 (23%), Positives = 69/159 (43%), Gaps = 8/159 (5%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            ++ SD + N+++        E +   RL    +  L + VN     R +P  I   P A 
Sbjct: 970  YLESDAEGNLIVLRRNRSGVEEDDRRRLEVTGEICLNEMVN-----RIRPVNIQQLPSAT 1024

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
                 + A+++G++ +   +   +Y+  LM LQ  M +     GG+    +R ++     
Sbjct: 1025 VVPRAFLATVEGSI-YLYAIINPDYQDFLMRLQATMASRADSLGGIPFTDYRAFRTMTRQ 1083

Query: 204  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
            A  P R  +DG L+ +FL      + EI   +GS   ++
Sbjct: 1084 ATEPYR-FVDGELIERFLTCEPAVQKEIVDIVGSSLEEV 1121


>gi|326426696|gb|EGD72266.1| hypothetical protein PTSG_00286 [Salpingoeca sp. ATCC 50818]
          Length = 1104

 Score = 48.1 bits (113), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 41/161 (25%), Positives = 68/161 (42%), Gaps = 5/161 (3%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V +C  V E+    F+++    +  L + Q E    +    L  K + +LG+ V +F + 
Sbjct: 917  VSSCALVNETV---FLVASHAHS--LSVCQREFEPGSTMQTLNAKFEIYLGETVTSFVRA 971

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
                +   D+          + ++ G L   LPL       L  L+  M       GGL+
Sbjct: 972  ALGSAAAVDSSMPLRNTFFVFGTMGGGLACLLPLTPPQTELLTALECRMEEKIGGLGGLD 1031

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 231
             R FRT + +   A   +  ++DG LV  FLQL   E+ E+
Sbjct: 1032 HREFRTARDEQRMAQQVNPRLVDGDLVETFLQLPEEEQKEL 1072


>gi|110741229|dbj|BAF02165.1| UV-damaged DNA binding factor - like protein [Arabidopsis thaliana]
          Length = 727

 Score = 48.1 bits (113), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 35/129 (27%), Positives = 59/129 (45%), Gaps = 12/129 (9%)

Query: 111 RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
           RL    ++HLG+ VN F      +R   S I   P         + +++G +G    LP+
Sbjct: 589 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVIASLPQ 642

Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
           + Y  L  LQ+ +       GGL+   +R++  +   A   +R  +DG L+  FL LS  
Sbjct: 643 EQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRN 700

Query: 227 ERLEICKKI 235
           +  +I K +
Sbjct: 701 KMEDISKSM 709


>gi|448528339|ref|XP_003869702.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis Co 90-125]
 gi|380354055|emb|CCG23569.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis]
          Length = 1170

 Score = 47.8 bits (112), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 53/104 (50%), Gaps = 4/104 (3%)

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            Y  L G +G  LPL  K+   + +L ++ +  +++   +N       K + YY  NP++ 
Sbjct: 1070 YTGLTGTIGILLPLISKS--EIELLHDLQLEISAYNDKVNVAGKNHAKLRSYY--NPAKN 1125

Query: 211  IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSS 254
            I DG  +  +L L L E+L+I K++     ++  +L DI   SS
Sbjct: 1126 IFDGDFLELYLNLPLDEKLKIAKRLNKSVGEVEKKLNDIRNRSS 1169


>gi|62318656|dbj|BAD95136.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
          Length = 1088

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 33/121 (27%), Positives = 56/121 (46%), Gaps = 12/121 (9%)

Query: 117  DFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 172
            ++H+G+ VN F      ++   S I   P         + ++ G +G    LP++ Y  L
Sbjct: 956  EYHIGEFVNRFRHGSLVMKLPDSDIGQIP------TVIFGTVSGMIGVIASLPQEQYAFL 1009

Query: 173  LMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
              LQ  +       GGL+   +R++  +   A   ++G +DG L+  FL LS G+  EI 
Sbjct: 1010 EKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKGYLDGDLIESFLDLSRGKMEEIS 1067

Query: 233  K 233
            K
Sbjct: 1068 K 1068


>gi|15233515|ref|NP_193842.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
 gi|73620956|sp|O49552.2|DDB1B_ARATH RecName: Full=DNA damage-binding protein 1b; AltName: Full=UV-damaged
            DNA-binding protein 1b; Short=DDB1b
 gi|110739453|dbj|BAF01636.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
 gi|332659001|gb|AEE84401.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
          Length = 1088

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 33/121 (27%), Positives = 56/121 (46%), Gaps = 12/121 (9%)

Query: 117  DFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 172
            ++H+G+ VN F      ++   S I   P         + ++ G +G    LP++ Y  L
Sbjct: 956  EYHIGEFVNRFRHGSLVMKLPDSDIGQIP------TVIFGTVSGMIGVIASLPQEQYAFL 1009

Query: 173  LMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
              LQ  +       GGL+   +R++  +   A   ++G +DG L+  FL LS G+  EI 
Sbjct: 1010 EKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKGYLDGDLIESFLDLSRGKMEEIS 1067

Query: 233  K 233
            K
Sbjct: 1068 K 1068


>gi|186511557|ref|NP_001118940.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
 gi|332657118|gb|AEE82518.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
          Length = 1067

 Score = 47.4 bits (111), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 36/146 (24%), Positives = 66/146 (45%), Gaps = 12/146 (8%)

Query: 111  RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
            RL    ++HLG+ VN F      +R   S I   P         + +++G +G    LP+
Sbjct: 929  RLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVIASLPQ 982

Query: 167  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
            + Y  L  LQ+ +       GGL+   +R++  +   A   +R  +DG L+  FL LS  
Sbjct: 983  EQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRN 1040

Query: 227  ERLEICKKIGSKHNDILDELYDIEAL 252
            +  +I K +  +  ++   + ++  L
Sbjct: 1041 KMEDISKSMNVQVEELCKRVEELTRL 1066


>gi|2911067|emb|CAA17529.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
 gi|7268907|emb|CAB79110.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
          Length = 1102

 Score = 47.4 bits (111), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 33/121 (27%), Positives = 56/121 (46%), Gaps = 12/121 (9%)

Query: 117  DFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 172
            ++H+G+ VN F      ++   S I   P         + ++ G +G    LP++ Y  L
Sbjct: 970  EYHIGEFVNRFRHGSLVMKLPDSDIGQIP------TVIFGTVSGMIGVIASLPQEQYAFL 1023

Query: 173  LMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
              LQ  +       GGL+   +R++  +   A   ++G +DG L+  FL LS G+  EI 
Sbjct: 1024 EKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKGYLDGDLIESFLDLSRGKMEEIS 1081

Query: 233  K 233
            K
Sbjct: 1082 K 1082


>gi|312283457|dbj|BAJ34594.1| unnamed protein product [Thellungiella halophila]
          Length = 1088

 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 35/129 (27%), Positives = 59/129 (45%), Gaps = 12/129 (9%)

Query: 111  RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
            RL    ++HLG+ VN F      +R   S I   P         + +++G +G    LP+
Sbjct: 950  RLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVIASLPQ 1003

Query: 167  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
            + Y  L  LQ+ +       GGL+   +R++  +   A   +R  +DG L+  FL LS  
Sbjct: 1004 EQYMFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRN 1061

Query: 227  ERLEICKKI 235
            +  +I K +
Sbjct: 1062 KMEDISKSM 1070


>gi|15235577|ref|NP_192451.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
 gi|55976605|sp|Q9M0V3.1|DDB1A_ARATH RecName: Full=DNA damage-binding protein 1a; AltName: Full=UV-damaged
            DNA-binding protein 1a; Short=DDB1a
 gi|7267302|emb|CAB81084.1| UV-damaged DNA binding factor-like protein [Arabidopsis thaliana]
 gi|25054828|gb|AAN71904.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
 gi|332657117|gb|AEE82517.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
          Length = 1088

 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 35/129 (27%), Positives = 59/129 (45%), Gaps = 12/129 (9%)

Query: 111  RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
            RL    ++HLG+ VN F      +R   S I   P         + +++G +G    LP+
Sbjct: 950  RLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVIASLPQ 1003

Query: 167  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
            + Y  L  LQ+ +       GGL+   +R++  +   A   +R  +DG L+  FL LS  
Sbjct: 1004 EQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRN 1061

Query: 227  ERLEICKKI 235
            +  +I K +
Sbjct: 1062 KMEDISKSM 1070


>gi|389629928|ref|XP_003712617.1| hypothetical protein MGG_16867 [Magnaporthe oryzae 70-15]
 gi|351644949|gb|EHA52810.1| hypothetical protein MGG_16867 [Magnaporthe oryzae 70-15]
 gi|440464739|gb|ELQ34110.1| DNA damage-binding protein 1a [Magnaporthe oryzae Y34]
          Length = 1183

 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 36/159 (22%), Positives = 71/159 (44%), Gaps = 7/159 (4%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            ++++D D N+V+ +            R+   ++F LG+ VN   K+  + S  ++AP   
Sbjct: 1020 WIVADGDGNLVVLLRNTAGVTLEDKRRMQMTSEFGLGECVNKIQKVMVETS--ANAPIVA 1077

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH-TGGLNPRAFRTYKGKGYY 203
              FL+   + +G++  F  +  K    L+  Q  M  H S   G L    +R+++     
Sbjct: 1078 KAFLS---TTEGSIYLFGTVAPKFQSLLMDFQANMEAHVSSPLGELQFNQWRSFRNPERE 1134

Query: 204  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
               P R  +DG  +  FL +    +++IC+ +     D+
Sbjct: 1135 GAGPER-FLDGEFLEMFLDMEENTQIDICQGLSYTAEDM 1172


>gi|340381612|ref|XP_003389315.1| PREDICTED: DNA damage-binding protein 1-like [Amphimedon
            queenslandica]
          Length = 1142

 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 37/143 (25%), Positives = 63/143 (44%), Gaps = 13/143 (9%)

Query: 112  LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFL-----TWYASLDGALGFFLPLPE 166
            +++ +  + G +VN F     + S + D PGA +  L       + ++ GA+G    L  
Sbjct: 992  MVQPSRMYFGDNVNVF----SRGSFVMDHPGAGASSLLQGKPILFGTVHGAIGLIGTLNM 1047

Query: 167  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
              Y  L  LQ  M  +    G +    +R++  +  +   P  G IDG LV KFL+L   
Sbjct: 1048 DTYTLLSKLQQKMAANIKSVGNIEHEIYRSFSNE--HRSKPFAGFIDGDLVEKFLELPRP 1105

Query: 227  ERLEICKKIGSKHNDILDELYDI 249
            +  +I +  G K  D+     D+
Sbjct: 1106 QMSQIVQ--GIKTTDVTGTEVDV 1126


>gi|440487047|gb|ELQ66855.1| DNA damage-binding protein 1a [Magnaporthe oryzae P131]
          Length = 1213

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 36/159 (22%), Positives = 71/159 (44%), Gaps = 7/159 (4%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            ++++D D N+V+ +            R+   ++F LG+ VN   K+  + S  ++AP   
Sbjct: 1050 WIVADGDGNLVVLLRNTAGVTLEDKRRMQMTSEFGLGECVNKIQKVMVETS--ANAPIVA 1107

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH-TGGLNPRAFRTYKGKGYY 203
              FL+   + +G++  F  +  K    L+  Q  M  H S   G L    +R+++     
Sbjct: 1108 KAFLS---TTEGSIYLFGTVAPKFQSLLMDFQANMEAHVSSPLGELQFNQWRSFRNPERE 1164

Query: 204  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
               P R  +DG  +  FL +    +++IC+ +     D+
Sbjct: 1165 GAGPER-FLDGEFLEMFLDMEENTQIDICQGLSYTAEDM 1202


>gi|407923753|gb|EKG16818.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
            phaseolina MS6]
          Length = 1129

 Score = 46.6 bits (109), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 35/151 (23%), Positives = 68/151 (45%), Gaps = 6/151 (3%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            ++ +D + N++L              RL+ + D  LG+ VN   +I    + +SDAP   
Sbjct: 965  YLQADAEGNLILLERDVSGVTEEDRKRLMLRGDMLLGEQVNRIRRIDM--ATVSDAPVIP 1022

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
              F   +A+++G++  F  +       L+ LQ+ +       G      +R ++ +    
Sbjct: 1023 RAF---FATVEGSIYLFALIAPAKVDLLIRLQSQLADFVRSPGHYPFLRYRAFRNQVREE 1079

Query: 205  GNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
              P+R  +DG L+ +FL L   E+ E+ K +
Sbjct: 1080 DEPNR-FVDGDLIERFLDLKPREQEEVVKGV 1109


>gi|443707495|gb|ELU03057.1| hypothetical protein CAPTEDRAFT_148808 [Capitella teleta]
          Length = 1084

 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 40/146 (27%), Positives = 68/146 (46%), Gaps = 16/146 (10%)

Query: 118  FHLGQHVNTFFKIRCKPSPISDAPG---ARSRFLTWYASLDGALGFFLPLPEKNYRRLLM 174
            +HLG+ VN F       S +   PG   + ++    + +++GALG    LP++ Y  LL 
Sbjct: 942  YHLGEFVNVFRH----GSLVMQHPGECTSPTQGSVLFGTVNGALGLVTQLPQEFYLFLLE 997

Query: 175  LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
            +QN +       G +    +R++  +      P+ G IDG L+  FL LS  +  E+ + 
Sbjct: 998  VQNKLAKTIKSVGKVEHAFWRSFHTE--RKTEPATGFIDGDLIESFLDLSRDKMQEVVQG 1055

Query: 235  I------GSKHNDILDELYD-IEALS 253
            +      G K    +D+L   IE L+
Sbjct: 1056 LQMDDGSGMKREAAVDDLVKMIEELT 1081


>gi|154286506|ref|XP_001544048.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150407689|gb|EDN03230.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 1158

 Score = 46.6 bits (109), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 39/154 (25%), Positives = 70/154 (45%), Gaps = 10/154 (6%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            ++ SD + N+V+          +   RL   ++  LG+ VN     R +P  I  + GA 
Sbjct: 988  WLESDAEGNLVMLHRNVNGVTDDDRRRLEVTSEILLGEMVN-----RIRPVNIQGSQGAE 1042

Query: 145  SRFL--TWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKG 201
            +      +  +++G++ +   +    Y+ LLM LQ+ M       GG+    FR ++   
Sbjct: 1043 AAISPRAFLGTVEGSI-YLFGIINPTYQDLLMRLQSAMAGMVVTPGGMPFNKFRAFRNTI 1101

Query: 202  YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
              A  P R  +DG L+ +FL  S+  + EI  K+
Sbjct: 1102 RQAEEPYR-FVDGELIERFLSCSVELQEEIVGKV 1134


>gi|303271531|ref|XP_003055127.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463101|gb|EEH60379.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 1223

 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 42/86 (48%)

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++GA+G    LPE  +  L  LQ  M    S  GG +  A+R++  +       +RG
Sbjct: 1120 FGTVNGAIGVVASLPESTHTFLAALQKAMNKVVSGVGGFSHDAWRSFHNEHRSRLVEARG 1179

Query: 211  IIDGSLVWKFLQLSLGERLEICKKIG 236
             +DG L+  FL L   +  E+   +G
Sbjct: 1180 FVDGDLIESFLDLRPEKASEVASVVG 1205


>gi|297799958|ref|XP_002867863.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297313699|gb|EFH44122.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1088

 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 33/121 (27%), Positives = 55/121 (45%), Gaps = 12/121 (9%)

Query: 117  DFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 172
            ++H+G+ VN F      +R   S I   P         + ++ G +G    LP++ Y  L
Sbjct: 956  EYHIGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVSGMIGVIASLPQEQYAFL 1009

Query: 173  LMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
              LQ  +       GGL+   +R++  +   A   ++  +DG L+  FL LS G+  EI 
Sbjct: 1010 EKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKSYLDGDLIESFLDLSRGKMEEIS 1067

Query: 233  K 233
            K
Sbjct: 1068 K 1068


>gi|402083318|gb|EJT78336.1| hypothetical protein GGTG_03437 [Gaeumannomyces graminis var. tritici
            R3-111a-1]
          Length = 1155

 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 34/158 (21%), Positives = 73/158 (46%), Gaps = 6/158 (3%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            ++ +D D NV++              R+   ++ +LG+++N   KI  +  P  +AP   
Sbjct: 993  WLEADADGNVMVLGRNTTGVTLEDRRRMEITSEINLGENINRIQKISVETGP--NAPIHP 1050

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
              FL+   + +G++     +  +    LL LQ+ +  +    G +  + FR+++     A
Sbjct: 1051 KAFLS---TTEGSIYLVGAIAPQMRDLLLNLQDRLEDYVGTLGNIPFKNFRSFRNAEREA 1107

Query: 205  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              P R  IDG  + +FL ++   + ++C+ +G    D+
Sbjct: 1108 DGPVR-FIDGEYIERFLDMNEETQSQVCRDLGPSVEDM 1144


>gi|91087281|ref|XP_975549.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
            castaneum]
 gi|270010588|gb|EFA07036.1| hypothetical protein TcasGA2_TC010010 [Tribolium castaneum]
          Length = 1149

 Score = 46.2 bits (108), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 38/138 (27%), Positives = 60/138 (43%), Gaps = 15/138 (10%)

Query: 118  FHLGQHVNTFFKIRCKPSPISDAPGARSRFLT---WYASLDGALGFFLPLPEKNYRRLLM 174
            FH+G  +N F       S +    G  S   T    + ++ GA+G    + +  Y  LL 
Sbjct: 1007 FHVGDMINVFRH----GSLVMQNLGETSTPTTGCVLFGTVSGAIGLVTQITQDFYDFLLE 1062

Query: 175  LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS------LGER 228
            LQN + T     G ++   +R +         PS G IDG L+  FL LS      + + 
Sbjct: 1063 LQNKLSTVIKSVGKIDHSQWRAFNTD--IKTEPSEGFIDGDLIESFLDLSHDKMKEVADG 1120

Query: 229  LEICKKIGSKHNDILDEL 246
            L+I  + G K +  +D+L
Sbjct: 1121 LQITGEGGMKQDCTVDDL 1138


>gi|300176205|emb|CBK23516.2| unnamed protein product [Blastocystis hominis]
          Length = 702

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 48/198 (24%), Positives = 82/198 (41%), Gaps = 23/198 (11%)

Query: 69  RGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEAR--ESNGGHRLIKKTDFHLGQHVNT 126
           R V A E + E+ S   ++ D   N+++     EA    SN    ++ K  FHL   +N 
Sbjct: 484 RDVTALERLPEAQSE-MVVGDAYGNLMVMQVVEEADLDRSNPQKIVVTKESFHLDDQINR 542

Query: 127 FFKIRCKPSPISD------APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
           F  ++   S   D      A  +   F   +A++ G +G    L ++ +R L  ++  M 
Sbjct: 543 FVPVQLFRSGAEDKKKEKRAEESEIAFNLAFATVSGRIGMIGALNDREFRMLRAIETAME 602

Query: 181 THTSHTGGLNPRAFRTYKGKGYYAGNPSRGI------IDGSLVWKFLQLSLGERLEICKK 234
              +  GGL+ + +R          N   GI      IDG LV  FL+L    + +I   
Sbjct: 603 NVITPVGGLDHKQWR--------CSNTPFGIKNLAYCIDGDLVEMFLELDDESQAKIADS 654

Query: 235 IGSKHNDILDELYDIEAL 252
           + ++    L   + I+ L
Sbjct: 655 VSTELRSALSPQFLIDYL 672


>gi|301121252|ref|XP_002908353.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
 gi|262103384|gb|EEY61436.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
          Length = 1150

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 40/178 (22%), Positives = 74/178 (41%), Gaps = 28/178 (15%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----------------- 127
            ++ S+ D N+        A       RL    +FHLG+ VN F                 
Sbjct: 950  YIGSETDFNLFTVQRNSGAASDEERGRLETVGEFHLGEFVNRFRYGSLVMQNSSSTSQTP 1009

Query: 128  -FKIRCKPSPISD-------APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 179
               +   P+ + D       AP  +++ +  + ++ G +G  LP+ +  Y  LL +Q  +
Sbjct: 1010 SGVVSTGPTAMVDVGESAPAAPVVQNQSM-LFGTVSGMIGVILPISKDQYSFLLRVQQAL 1068

Query: 180  VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
                   GG + + +RT++ +   + + +R  IDG LV  FL L   +  ++  K+ S
Sbjct: 1069 THVVKGVGGFSHKDWRTFENR--RSVSEARNFIDGDLVESFLDLPKPQMTKVVDKLNS 1124


>gi|380488197|emb|CCF37544.1| hypothetical protein CH063_08850 [Colletotrichum higginsianum]
          Length = 271

 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 42/162 (25%), Positives = 79/162 (48%), Gaps = 18/162 (11%)

Query: 85  FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR-CKPSPISDAPGA 143
           ++ +D   N+++    P+A   +   ++   ++FHLG+ VN   KIR    +P  + P  
Sbjct: 101 WLEADAQGNLMVLRRNPDAPTEHDQKQMEVTSEFHLGEQVN---KIRPLDITPNENDPIV 157

Query: 144 RSRFLTWYASLDGALGFFLPLPEKNYRRLLM-----LQNVMVT----HTSHTGGLNPRAF 194
              FL   A+++G+L  F  + +  Y+ LL+     L +V+ T        T GL+  A+
Sbjct: 158 PKAFL---ATVEGSLYVFADI-KSEYQSLLLQFQERLADVVKTLGQAGGDSTSGLSFMAW 213

Query: 195 RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
           R ++     A  P R  +DG L+ +FL L   ++  + + +G
Sbjct: 214 RGFRNAKRAADGPFR-FVDGELIERFLDLDEAKQEAVVQGLG 254


>gi|237839083|ref|XP_002368839.1| hypothetical protein TGME49_067710 [Toxoplasma gondii ME49]
 gi|211966503|gb|EEB01699.1| hypothetical protein TGME49_067710 [Toxoplasma gondii ME49]
          Length = 2136

 Score = 46.2 bits (108), Expect = 0.015,   Method: Composition-based stats.
 Identities = 27/74 (36%), Positives = 43/74 (58%), Gaps = 3/74 (4%)

Query: 151  YASLDGALGFFLPLP-EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSR 209
            +AS +GA+G  L +P E+ + RL +LQ+ +   T   G L+  AF + K     A  PS+
Sbjct: 2051 WASSEGAIGHLLQIPDEQTFARLAVLQDAVTKVTKSIGKLSAVAFHSVKVGT--ATVPSK 2108

Query: 210  GIIDGSLVWKFLQL 223
            G IDG ++ +FL+ 
Sbjct: 2109 GFIDGDILERFLEF 2122


>gi|221502136|gb|EEE27880.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 2131

 Score = 45.8 bits (107), Expect = 0.015,   Method: Composition-based stats.
 Identities = 27/74 (36%), Positives = 43/74 (58%), Gaps = 3/74 (4%)

Query: 151  YASLDGALGFFLPLP-EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSR 209
            +AS +GA+G  L +P E+ + RL +LQ+ +   T   G L+  AF + K     A  PS+
Sbjct: 2046 WASSEGAIGHLLQIPDEQTFARLAVLQDAVTKVTKSIGKLSAVAFHSVKVGT--ATVPSK 2103

Query: 210  GIIDGSLVWKFLQL 223
            G IDG ++ +FL+ 
Sbjct: 2104 GFIDGDILERFLEF 2117


>gi|302894051|ref|XP_003045906.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256726833|gb|EEU40193.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 1162

 Score = 45.8 bits (107), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 33/158 (20%), Positives = 72/158 (45%), Gaps = 6/158 (3%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            ++ +D   N+++     +A       RL   ++  +G+ +N   +IR    P  D     
Sbjct: 1000 WLEADSQGNLIVLQRNADAPTEQDRSRLEVTSEIGIGEQIN---RIRKLHVPAGDNSIVH 1056

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
             R   + AS +G+L  +  +  +    L+  Q+ M  +    G +  + +R+++ +   +
Sbjct: 1057 PR--AFLASAEGSLYLYGDIAPQYQDLLMTFQSKMEEYIHAPGNIEFKLWRSFRNENRES 1114

Query: 205  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              P R  IDG +V +FL +  G++  +C+ +G    D+
Sbjct: 1115 DGPYR-FIDGEMVERFLDMDEGKQELVCEGLGPSVEDM 1151


>gi|156084934|ref|XP_001609950.1| splicing factor 3b, subunit 3, 130kD [Babesia bovis T2Bo]
 gi|154797202|gb|EDO06382.1| splicing factor 3b, subunit 3, 130kD, putative [Babesia bovis]
          Length = 1169

 Score = 45.8 bits (107), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 46/214 (21%), Positives = 94/214 (43%), Gaps = 25/214 (11%)

Query: 40   GTLKLRFKKLKVLFVSDRSKRANEQPGL--PRGVYACEFVIESSSMGFMISDKDKNVVLF 97
            G ++   + L++ F S+ +       G   PR + +C  +  S+ +       DK   +F
Sbjct: 974  GDIREGIQILRIKFYSEEAAEFEWVGGATGPRWLTSCAQLDYSTVIA-----GDKFDSIF 1028

Query: 98   MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGA 157
            + +    ES    +L     FHLG            P+ +  A  ++S  +  Y ++ G+
Sbjct: 1029 VTRVPQEESTRHIQLENVCQFHLGD----------LPTAMDKAALSQSTHVVLYGTVMGS 1078

Query: 158  LGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 216
            +G  +P   K+    L  L+ +M T      G     +R+Y     Y   P + ++DG L
Sbjct: 1079 IGALVPFQSKDELDFLQHLEMLMATEAPPLCGREHSFYRSY-----YV--PVQQVVDGDL 1131

Query: 217  VWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
              +F  L+  ++ ++ +++ +  N++L +L DI+
Sbjct: 1132 CEQFRHLTEAQQRKVAQQLDTTVNNVLRKLDDIK 1165


>gi|345570887|gb|EGX53705.1| hypothetical protein AOL_s00006g33 [Arthrobotrys oligospora ATCC
            24927]
          Length = 1133

 Score = 45.8 bits (107), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 45/155 (29%), Positives = 66/155 (42%), Gaps = 11/155 (7%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A E + E + +    +D D N+VL   +          R+   ++  LG+ VN    I
Sbjct: 952  VTALEALDEDTCIS---ADSDGNLVLLRRESTGATDEDTRRMRPLSEIRLGEMVNC---I 1005

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGL 189
            R    PI+     + +   +  ++DG L F L L   +Y  +LM  Q  M       G L
Sbjct: 1006 RRVNDPITQGYVVQPK--AYLGTVDGGL-FMLGLIHPDYFDILMKCQVNMAKVIKGIGDL 1062

Query: 190  NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
            +   +R Y  KG     P R  +DG LV KFL L 
Sbjct: 1063 DFNRYRAYNTKGIQPEEPFR-FVDGELVEKFLDLD 1096


>gi|330792580|ref|XP_003284366.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
 gi|325085712|gb|EGC39114.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
          Length = 1064

 Score = 45.8 bits (107), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 37/140 (26%), Positives = 63/140 (45%), Gaps = 10/140 (7%)

Query: 118  FHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN 177
            +H+G+ +N+         P SDAP   +     YAS++G++G    + +++Y     LQ 
Sbjct: 933  YHIGESINSMLCGSLVRLPDSDAPPIPT---ILYASVNGSIGVIASISKEDYEFFSKLQK 989

Query: 178  VMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
             +    +  GG    ++R +    +     SR  IDG L+  F  L    ++E   K+  
Sbjct: 990  GLNRVVNGIGGFTHESWRAFSNDHHTV--ESRNFIDGDLIEMFPDL----KIESMAKVIQ 1043

Query: 238  KHNDILDE-LYDIEALSSHF 256
              N  LDE L  IE+L  + 
Sbjct: 1044 DMNVTLDETLKRIESLMQYI 1063


>gi|193644722|ref|XP_001942922.1| PREDICTED: DNA damage-binding protein 1-like [Acyrthosiphon pisum]
          Length = 1156

 Score = 45.8 bits (107), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 39/173 (22%), Positives = 69/173 (39%), Gaps = 6/173 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P+   A E + +   +G   ++ DKN+ +             H+L +   FH G  +N F
Sbjct: 962  PKWSTAIEIIDDDVFLG---AENDKNLFIIHKDSTLTSDEARHQLQEIGQFHCGDLINVF 1018

Query: 128  FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
                      +D   +    +  Y +  GALG    L  K +  L  L+  + T     G
Sbjct: 1019 RHGSLVMQHFTDTYVSVQGGI-LYGTCSGALGLVTQLTPKMFDFLSDLEKSLATVVKGVG 1077

Query: 188  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
             +N + +R+Y  +      PS   +DG L+  FL LS  E + +   +   ++
Sbjct: 1078 KINHQFWRSYHTE--IRTEPSESFVDGDLIESFLDLSKREMIAVVDALQGAYD 1128


>gi|340521192|gb|EGR51427.1| predicted protein [Trichoderma reesei QM6a]
          Length = 1161

 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 32/158 (20%), Positives = 72/158 (45%), Gaps = 6/158 (3%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            ++ +D   NV++   + EA       +L   ++ ++G+ +N   K++  P    +A    
Sbjct: 999  WLEADAQGNVIVLRQRQEAPTEQDRSQLEITSELNIGEQINRIRKLQVAPG--ENAVVVP 1056

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
              FL    S++G L  +  +  K    L+  Q+ +  +    G L+   +R ++ +    
Sbjct: 1057 KAFL---GSIEGTLYLYGDIAPKYQDLLMTFQSRLQGYIQTPGNLSFDLWRAFRNQAREG 1113

Query: 205  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
             +P R  +DG ++ +FL L   ++  +C+ +G    D+
Sbjct: 1114 ESPYR-FVDGEMIERFLDLDESQQELVCEGLGPNVEDM 1150


>gi|168066745|ref|XP_001785293.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162663100|gb|EDQ49884.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1090

 Score = 45.8 bits (107), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 34/129 (26%), Positives = 56/129 (43%), Gaps = 12/129 (9%)

Query: 111  RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
            RL    ++HLG+ VN F      +R   S  S  P         + +++G +G    LP+
Sbjct: 952  RLEVVGEYHLGEFVNRFRHGSLVMRLPDSEASQIP------TVIFGTVNGVIGVIASLPQ 1005

Query: 167  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
              +  L  LQ  +V      GGL+   +R++  +       +R  +DG L+  FL LS  
Sbjct: 1006 DQFLFLQKLQQALVKVIKGVGGLSHEQWRSFSNERKTV--DARNFLDGDLIESFLDLSRN 1063

Query: 227  ERLEICKKI 235
            +  EI   +
Sbjct: 1064 KMEEIATSL 1072


>gi|328874742|gb|EGG23107.1| UV-damaged DNA binding protein1 [Dictyostelium fasciculatum]
          Length = 1116

 Score = 45.4 bits (106), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 41/190 (21%), Positives = 77/190 (40%), Gaps = 7/190 (3%)

Query: 46   FKKLKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARE 105
             K + +L  S +  +   +   P  +    F+ + + +G   SD   N++L     E   
Sbjct: 915  MKSMALLQASGKDVKELSRSSQPFWLTGLTFIDDDTYLG---SDNSYNLILMKKNTETAN 971

Query: 106  SNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLP 165
                  L      H G+ +N F           D+P   S     +A++ G +G    + 
Sbjct: 972  EVDSQLLDNIGHIHTGEFINRFHHGTLATLTDVDSPKPNSII---FATISGCIGVISTIS 1028

Query: 166  EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 225
            +++Y     LQ  +       GG +   +R+++ + + +   SR  IDG LV +FL L  
Sbjct: 1029 KQDYDFFSKLQVGLNRVIRGIGGFSHDRWRSFQNE-HISNIESRNFIDGDLVEQFLHLRH 1087

Query: 226  GERLEICKKI 235
             + LE+ K +
Sbjct: 1088 DKMLEVTKDM 1097


>gi|169611218|ref|XP_001799027.1| hypothetical protein SNOG_08717 [Phaeosphaeria nodorum SN15]
 gi|160702249|gb|EAT83885.2| hypothetical protein SNOG_08717 [Phaeosphaeria nodorum SN15]
          Length = 1140

 Score = 45.4 bits (106), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 8/167 (4%)

Query: 77   VIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP 136
            V E     ++ SD + N+++    P+    +   RL   ++  LG+ VN   +I      
Sbjct: 970  VAEVDENTYLESDAEGNLLVLYRDPKGVTDDDKRRLNVSSEMLLGEMVNRIRRI-----D 1024

Query: 137  ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRAFR 195
            ++ AP A      +  +++G++ +   L  +NY  LL+ LQ+ +       G ++   FR
Sbjct: 1025 VATAPDAVVVPRAFMGTVEGSI-YLFALISQNYLDLLITLQSNLGNLVVSPGNMDFAKFR 1083

Query: 196  TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
             +K +      P+R  +DG L+ +FL      + +  + +G +  DI
Sbjct: 1084 AFKNQVRTEEEPNR-FVDGELIERFLDCEEDVQRKAIEGLGVELEDI 1129


>gi|255571318|ref|XP_002526608.1| DNA repair protein xp-E, putative [Ricinus communis]
 gi|223534048|gb|EEF35767.1| DNA repair protein xp-E, putative [Ricinus communis]
          Length = 1033

 Score = 45.4 bits (106), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 35/130 (26%), Positives = 57/130 (43%), Gaps = 12/130 (9%)

Query: 111  RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
            RL    ++HLG+ VN F      +R   S +   P         + +++G +G    LP 
Sbjct: 895  RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTVNGVIGVIASLPH 948

Query: 167  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
            + Y  L  LQ+ +       GGL+   +R++  +       ++  +DG L+  FL LS  
Sbjct: 949  EQYIFLEKLQSNLRRVIKGVGGLSHEQWRSFNNEKKTV--EAKNFLDGDLIESFLDLSRN 1006

Query: 227  ERLEICKKIG 236
               EI K IG
Sbjct: 1007 RMDEISKAIG 1016


>gi|358400469|gb|EHK49795.1| hypothetical protein TRIATDRAFT_146031 [Trichoderma atroviride IMI
            206040]
          Length = 1161

 Score = 45.4 bits (106), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 33/160 (20%), Positives = 72/160 (45%), Gaps = 10/160 (6%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            ++ +D   NV++     EA       +L   ++ ++G+ +N   K++        APG  
Sbjct: 999  WLEADAQGNVIVLRQNLEAPTEQDQSQLQVISELNIGEQINRIRKLQV-------APGEN 1051

Query: 145  SRFL--TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 202
            +  +   +  S +G L  +  +  K    L+  Q+ +  + S  G L+   +R ++ +  
Sbjct: 1052 AIVVPKAFLGSTEGTLYLYGDIAPKYQDLLMTFQSRLQEYISTPGNLSFDLWRAFRNQSR 1111

Query: 203  YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
                P R  +DG ++ +FL L  G++  +C+ +G    D+
Sbjct: 1112 EGEAPFR-FVDGEMIERFLDLDEGKQELVCEGLGPSVEDM 1150


>gi|391335522|ref|XP_003742140.1| PREDICTED: DNA damage-binding protein 1-like [Metaseiulus
            occidentalis]
          Length = 1154

 Score = 45.4 bits (106), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 36/140 (25%), Positives = 64/140 (45%), Gaps = 6/140 (4%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            F+ ++ + N+ +    P A +    H + +   +HLG+ VN   K     +   D P   
Sbjct: 980  FLAAESNLNLYVCKRDPSAADDTRQH-MHEVALYHLGEMVNVIVKGSLVMAQPGDMPLPL 1038

Query: 145  SR-FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
            ++ FL  Y SL GA+G  +P+ ++ Y  L  +Q  +       G +    +RT+  +   
Sbjct: 1039 NKSFL--YGSLHGAVGVIVPIKQELYAILNQIQTNLAKTIKSVGKIEHGFWRTFLAERKI 1096

Query: 204  AGNPSRGIIDGSLVWKFLQL 223
               P+ G IDG L+ + L L
Sbjct: 1097 --EPATGFIDGDLIEQLLDL 1114


>gi|358380497|gb|EHK18175.1| hypothetical protein TRIVIDRAFT_80808 [Trichoderma virens Gv29-8]
          Length = 1161

 Score = 45.4 bits (106), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 32/158 (20%), Positives = 71/158 (44%), Gaps = 6/158 (3%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            ++ +D   N+++     EA       +L   ++ ++G+ +N   KI+  P+   +A    
Sbjct: 999  WLEADAQGNIIVLRQNQEAPTEQDRSQLEITSELNIGEQINRIRKIQVAPA--ENAIVIP 1056

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
              FL    S++G L  +  +  K    L+  Q+ +  +    G L+   +R ++ +    
Sbjct: 1057 KAFL---GSIEGTLYLYGDIAPKYQDLLMTFQSRLQEYIQTPGNLSFDTWRAFRNQARDG 1113

Query: 205  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              P R  +DG ++ +FL L   ++  +C+ +G    D+
Sbjct: 1114 EAPFR-FVDGEMIERFLDLDEKQQELVCEGLGPSVEDM 1150


>gi|344231825|gb|EGV63707.1| hypothetical protein CANTEDRAFT_134986 [Candida tenuis ATCC 10573]
 gi|344231826|gb|EGV63708.1| hypothetical protein CANTEDRAFT_134986 [Candida tenuis ATCC 10573]
          Length = 991

 Score = 45.1 bits (105), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 29/98 (29%), Positives = 50/98 (51%), Gaps = 9/98 (9%)

Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
           YA L G +G  LP+ E +++ L  L   +  +     G +   FR     GYY  N +  
Sbjct: 896 YAGLQGTIGILLPISESDFKFLSNLS--IELNKDLLLGRDHMKFR-----GYY--NSTHN 946

Query: 211 IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYD 248
           +IDG ++ KFL+L+   R++I  K+     +I +++ D
Sbjct: 947 VIDGDIIEKFLELNASSRIKISNKLNKSVREIENKIND 984


>gi|71413583|ref|XP_808925.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
           strain CL Brener]
 gi|70873226|gb|EAN87074.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi]
          Length = 444

 Score = 45.1 bits (105), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 40/185 (21%), Positives = 79/185 (42%), Gaps = 10/185 (5%)

Query: 71  VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGG--HRLIKKTDFHLGQHVNTFF 128
           V  C+ +    + G + SD ++N+++  Y P  +E+  G  +++++      G++     
Sbjct: 259 VVHCDMMYHDRAFGLLCSDDERNLLVMGYTPRVQETEAGSPNKVLESVLSLDGEY---RL 315

Query: 129 KIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
              C    +     A +  +T Y +  G +GF +P+ E+  R    L   +     H+ G
Sbjct: 316 SGGCLVKSLRFRSLAGNSSVTLYVTNYGEIGFIVPIGEQANRTASWLMRRLQIDLPHSAG 375

Query: 189 LNPRAFRTY-KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
           L PR F    +G    A      ++  SL+ +F  L +  R    K I S     L+ + 
Sbjct: 376 LTPRMFLGLSQGSPRTAMRAKEMLVSASLLNEFFFLDIHSR----KTIASAAYTQLERVT 431

Query: 248 DIEAL 252
           ++ +L
Sbjct: 432 NVASL 436


>gi|440302955|gb|ELP95261.1| hypothetical protein EIN_430670 [Entamoeba invadens IP1]
          Length = 1175

 Score = 45.1 bits (105), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 48/96 (50%), Gaps = 2/96 (2%)

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            Y S  G++G+   + E++Y+ L  +   M     H  G NP  FR  KG G   G   + 
Sbjct: 1067 YLSAQGSVGYLTVVNERSYQTLRQINAKMNREPWHFAGTNPEEFRMEKGYGVGYGRRKQV 1126

Query: 211  IIDGSLVWKFLQLSLGERLEICKKIGSKHN--DILD 244
            I+DG ++ +F  L+  ++  +C +  S  +  +ILD
Sbjct: 1127 ILDGDILKEFHFLTQEQQKRVCLRNTSISDVVNILD 1162


>gi|320593036|gb|EFX05445.1| uv-damaged DNA-binding protein [Grosmannia clavigera kw1407]
          Length = 1504

 Score = 45.1 bits (105), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 45/201 (22%), Positives = 86/201 (42%), Gaps = 14/201 (6%)

Query: 54   VSDRSKRANEQPGL---PRGVYACEFVIESSSMG--FMISDKDKNVVLFMYQPEARESNG 108
             +DRSK   E P L    RG  A          G  ++ +D   N+ +     +   ++ 
Sbjct: 1306 TADRSKTIAEGPKLVERARGYQASWATAVCHVEGDLWLEADGFGNLTMLERDVQGVTADD 1365

Query: 109  GHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKN 168
              RL    + +LG+ VN     R +P  +  +PGA      + A+++G++     +  + 
Sbjct: 1366 KRRLRTVGEMYLGEMVN-----RIRPIAVETSPGAMVHPRAFLATVEGSIYMVGTIAPEA 1420

Query: 169  YRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGER 228
               L+ LQ  +       G  +  A+R+++     +  P R  +DG L+ +FL +    +
Sbjct: 1421 QDLLMNLQTKLAAIVKGPGNTSFSAYRSFRNAERESTEPFR-FVDGELLERFLDVGEDVQ 1479

Query: 229  LEICKKIGSKHND---ILDEL 246
             E+ + +G    D   I++EL
Sbjct: 1480 KEVAQGLGPSVEDLRNIIEEL 1500


>gi|427780151|gb|JAA55527.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
          Length = 1181

 Score = 45.1 bits (105), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 46/170 (27%), Positives = 74/170 (43%), Gaps = 18/170 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F       S +   PG  S   +    
Sbjct: 1015 LFVCQKDSAATTDEERQHLQEVGQFHLGEFVNVFR----HGSLVMQHPGETSSPTQGSVL 1070

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + ++ GA+G    LP   Y  L  +Q  +       G ++   +R++  +      P+ G
Sbjct: 1071 FGTIHGAIGLVSQLPADFYTFLSEVQEKLTKVIKSVGKIDHAFWRSFSTE--RKTEPAVG 1128

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYD-IEALS 253
             IDG L+  FL LS  +  E+ + I      G K +  +D+L   IE LS
Sbjct: 1129 FIDGDLIESFLDLSRDKMQEVVQGIQMDDGSGMKRDASVDDLIKIIEELS 1178


>gi|427788481|gb|JAA59692.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
          Length = 1156

 Score = 45.1 bits (105), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 46/170 (27%), Positives = 74/170 (43%), Gaps = 18/170 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F       S +   PG  S   +    
Sbjct: 990  LFVCQKDSAATTDEERQHLQEVGQFHLGEFVNVFR----HGSLVMQHPGETSSPTQGSVL 1045

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + ++ GA+G    LP   Y  L  +Q  +       G ++   +R++  +      P+ G
Sbjct: 1046 FGTIHGAIGLVSQLPADFYTFLSEVQEKLTKVIKSVGKIDHAFWRSFSTE--RKTEPAVG 1103

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYD-IEALS 253
             IDG L+  FL LS  +  E+ + I      G K +  +D+L   IE LS
Sbjct: 1104 FIDGDLIESFLDLSRDKMQEVVQGIQMDDGSGMKRDASVDDLIKIIEELS 1153


>gi|303313681|ref|XP_003066852.1| CPSF A subunit region family protein [Coccidioides posadasii C735
            delta SOWgp]
 gi|240106514|gb|EER24707.1| CPSF A subunit region family protein [Coccidioides posadasii C735
            delta SOWgp]
 gi|320031496|gb|EFW13458.1| UV-damaged DNA binding protein [Coccidioides posadasii str. Silveira]
          Length = 1144

 Score = 45.1 bits (105), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 32/137 (23%), Positives = 62/137 (45%), Gaps = 6/137 (4%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            F+++D + N+V+          +   R+   ++  LG+ VN    +  + SP  ++P   
Sbjct: 978  FLVADAEGNLVVLNRNTTGVTEDDRRRMQVTSELRLGEMVNRIHPMDLQTSP--ESPVIP 1035

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
              FL   A++DG++  F  +       L+ LQ+ +    +  G +    +R +K     A
Sbjct: 1036 KAFL---ATVDGSIYLFGLISPSAQDTLMRLQSALADFVASPGEIPFNKYRAFKSSVRQA 1092

Query: 205  GNPSRGIIDGSLVWKFL 221
              P R  +DG L+ +FL
Sbjct: 1093 EEPFR-FVDGELIEQFL 1108


>gi|392864500|gb|EAS34654.2| UV-damaged DNA binding protein [Coccidioides immitis RS]
          Length = 1144

 Score = 44.7 bits (104), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 32/137 (23%), Positives = 62/137 (45%), Gaps = 6/137 (4%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            F+++D + N+V+          +   R+   ++  LG+ VN    +  + SP  ++P   
Sbjct: 978  FLVADAEGNLVVLNRDTTGVTEDDRRRMQVTSELRLGEMVNRIHPMDLQTSP--ESPVIP 1035

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
              FL   A++DG++  F  +       L+ LQ+ +    +  G +    +R +K     A
Sbjct: 1036 KAFL---ATVDGSIYLFGLISPSAQDTLMRLQSALADFVASPGEIPFNKYRAFKSSVRQA 1092

Query: 205  GNPSRGIIDGSLVWKFL 221
              P R  +DG L+ +FL
Sbjct: 1093 EEPFR-FVDGELIEQFL 1108


>gi|119191318|ref|XP_001246265.1| hypothetical protein CIMG_00036 [Coccidioides immitis RS]
          Length = 1072

 Score = 44.7 bits (104), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 32/137 (23%), Positives = 62/137 (45%), Gaps = 6/137 (4%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            F+++D + N+V+          +   R+   ++  LG+ VN    +  + SP  ++P   
Sbjct: 906  FLVADAEGNLVVLNRDTTGVTEDDRRRMQVTSELRLGEMVNRIHPMDLQTSP--ESPVIP 963

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
              FL   A++DG++  F  +       L+ LQ+ +    +  G +    +R +K     A
Sbjct: 964  KAFL---ATVDGSIYLFGLISPSAQDTLMRLQSALADFVASPGEIPFNKYRAFKSSVRQA 1020

Query: 205  GNPSRGIIDGSLVWKFL 221
              P R  +DG L+ +FL
Sbjct: 1021 EEPFR-FVDGELIEQFL 1036


>gi|195108657|ref|XP_001998909.1| GI23368 [Drosophila mojavensis]
 gi|193915503|gb|EDW14370.1| GI23368 [Drosophila mojavensis]
          Length = 1140

 Score = 44.7 bits (104), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 68/162 (41%), Gaps = 16/162 (9%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
            P+ + A E + + + +G    D      LF+ Q ++  +    R  L +   FHLG  +N
Sbjct: 952  PKWMRAVEILDDDTFLGCETHDN-----LFVCQKDSAATTDEERQLLPELARFHLGDTIN 1006

Query: 126  TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
             F       S +    G R+  +     Y + +GA+G    +P+  Y  L  L+  +   
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
                G ++   +R Y+        PS G IDG L+  FL LS
Sbjct: 1063 IKSVGKIDHTYYRNYQINTKV--EPSEGFIDGDLIESFLDLS 1102


>gi|407410979|gb|EKF33219.1| cleavage and polyadenylation specificity factor, putative
            [Trypanosoma cruzi marinkellei]
          Length = 1436

 Score = 44.7 bits (104), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 41/185 (22%), Positives = 78/185 (42%), Gaps = 10/185 (5%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGG--HRLIKKTDFHLGQHVNTFF 128
            V  C+ +    + G + SD ++N+++  Y P  +E+  G  +++++      G++     
Sbjct: 1251 VVHCDMMYHDRAFGLLCSDDERNLLVMGYTPRVQETEAGSPNKVLESVLSLDGEY---RL 1307

Query: 129  KIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
               C    +     A +  +T Y +  G +GF +P+ E+  R    L   +     H  G
Sbjct: 1308 SGGCLVKSLRFRSLAGNSSVTLYVTNYGEIGFIVPIGEQANRTASWLMRRLQMDLPHNAG 1367

Query: 189  LNPRAFRTY-KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
            L PR F    +G    A      ++  SL+ +F  L +  R    K I S     L+ + 
Sbjct: 1368 LTPRMFLGLSQGSPRTALRAKEMLVSASLLNEFFFLDIHSR----KTIASAAYTQLERVT 1423

Query: 248  DIEAL 252
            ++ AL
Sbjct: 1424 NVAAL 1428


>gi|378734083|gb|EHY60542.1| histone H2A [Exophiala dermatitidis NIH/UT8656]
          Length = 1361

 Score = 44.7 bits (104), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 40/138 (28%), Positives = 60/138 (43%), Gaps = 13/138 (9%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V A EF+ +   +  + SD D  + +  Y PE  ++  G +L+ ++ FH G        +
Sbjct: 1166 VLAAEFLPDGKQLYIISSDDDGLLRVLQYDPENPKAERGTKLLLRSTFHSGAAPTKMILL 1225

Query: 131  -------RCKPSPI-----SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNV 178
                   R +   I     S A  A  R      + +G+L    PL E  YRRL  LQ  
Sbjct: 1226 PPQVASGRGRDPEIDMDVDSGAGPAAGRHRILVTTQEGSLCMLTPLSEATYRRLSALQTT 1285

Query: 179  MVTHTS-HTGGLNPRAFR 195
            ++T    H   LNPRA+R
Sbjct: 1286 LLTTLDFHPCSLNPRAYR 1303


>gi|343962533|dbj|BAK62854.1| cleavage and polyadenylation specificity factor 160 kDa subunit
           [Pan troglodytes]
          Length = 269

 Score = 44.7 bits (104), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 14/37 (37%), Positives = 30/37 (81%)

Query: 68  PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEAR 104
           P  VY+ +F+++++ +GF++SD+D+N++++MY PE  
Sbjct: 203 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEGE 239


>gi|345328202|ref|XP_003431248.1| PREDICTED: DNA damage-binding protein 1-like [Ornithorhynchus
            anatinus]
          Length = 1045

 Score = 44.7 bits (104), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 42/165 (25%), Positives = 70/165 (42%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 879  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 934

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 935  FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 992

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K    +D+L  I
Sbjct: 993  FIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREATVDDLIKI 1037


>gi|71654693|ref|XP_815961.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
            strain CL Brener]
 gi|50363265|gb|AAT75335.1| cleavage polyadenylation specificity factor CPSF160 [Trypanosoma
            cruzi]
 gi|70881056|gb|EAN94110.1| cleavage and polyadenylation specificity factor, putative
            [Trypanosoma cruzi]
          Length = 1436

 Score = 44.7 bits (104), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 40/185 (21%), Positives = 79/185 (42%), Gaps = 10/185 (5%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGG--HRLIKKTDFHLGQHVNTFF 128
            V  C+ +    + G + SD ++N+++  Y P  +E+  G  +++++      G++     
Sbjct: 1251 VVHCDMMYHDRAFGLLCSDDERNLLVMGYTPRVQETEAGSPNKVLESVLSLDGEY---RL 1307

Query: 129  KIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
               C    +     A +  +T Y +  G +GF +P+ E+  R    L   +     H+ G
Sbjct: 1308 SGGCLVKSLRFRSLAGNSSVTLYVTNYGEIGFIVPIGEQANRTASWLMRRLQIDLPHSAG 1367

Query: 189  LNPRAFRTY-KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
            L PR F    +G    A      ++  SL+ +F  L +  R    K I S     L+ + 
Sbjct: 1368 LTPRMFLGLSQGSPRTAMRAKEMLVSASLLNEFFFLDIHSR----KTIASAAYTQLERVT 1423

Query: 248  DIEAL 252
            ++ +L
Sbjct: 1424 NVASL 1428


>gi|452820919|gb|EME27955.1| splicing factor 3B subunit 3 [Galdieria sulphuraria]
          Length = 1294

 Score = 44.7 bits (104), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 47/182 (25%), Positives = 79/182 (43%), Gaps = 29/182 (15%)

Query: 87   ISDKDKNVVLFMYQPEA-----RESNGG-------HRLIKKTDFHLGQHVNTFFKIRCKP 134
            I DK  N+ +    PEA     ++  GG       H    +  +++G  +    K+    
Sbjct: 1100 IGDKMGNISILRLPPEAGTFIEQDPTGGLLSKEAPHHFQLEACYYVGSVIQCLSKVEWT- 1158

Query: 135  SPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRA 193
                D P      L +Y +LDGA+G  +PL       L   L+  +  + S   G +  A
Sbjct: 1159 --TGDVP------LLFYGTLDGAIGVMIPLRSTLDMELFQALELQLREYRSPLCGRHHLA 1210

Query: 194  FRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 253
            +R+Y    ++   P R +IDG L  +F +LSL ++ +I K++     D+  +L D    S
Sbjct: 1211 YRSY----FF---PVRHVIDGDLCEEFYRLSLEQQEKIVKELDRSIVDVHRKLEDYRERS 1263

Query: 254  SH 255
             H
Sbjct: 1264 PH 1265


>gi|407850337|gb|EKG04765.1| cleavage and polyadenylation specificity factor, putative
            [Trypanosoma cruzi]
          Length = 1436

 Score = 44.7 bits (104), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 40/185 (21%), Positives = 79/185 (42%), Gaps = 10/185 (5%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGG--HRLIKKTDFHLGQHVNTFF 128
            V  C+ +    + G + SD ++N+++  Y P  +E+  G  +++++      G++     
Sbjct: 1251 VVHCDMMYHDRAFGLLCSDDERNLLVMGYTPRVQETEAGSPNKVLESVLSLDGEY---RL 1307

Query: 129  KIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
               C    +     A +  +T Y +  G +GF +P+ E+  R    L   +     H+ G
Sbjct: 1308 SGGCLVKSLRFRSLAGNSSVTLYVTNYGEIGFIVPIGEQANRTASWLMRRLQIDLPHSAG 1367

Query: 189  LNPRAFRTY-KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
            L PR F    +G    A      ++  SL+ +F  L +  R    K I S     L+ + 
Sbjct: 1368 LTPRMFLGLSQGSPRTAMRAKEMLVSASLLNEFFFLDIHSR----KTIASAAYTQLERVT 1423

Query: 248  DIEAL 252
            ++ +L
Sbjct: 1424 NVASL 1428


>gi|124806507|ref|XP_001350742.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium falciparum
            3D7]
 gi|23496869|gb|AAN36422.1|AE014849_41 splicing factor 3b, subunit 3, 130kD, putative [Plasmodium falciparum
            3D7]
          Length = 1329

 Score = 44.3 bits (103), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 60/260 (23%), Positives = 105/260 (40%), Gaps = 59/260 (22%)

Query: 10   VSLGLHGNRPLLLVRTQHELLIYQAFRHP-KGTLKLRFKKLKVLFVSDRSKRANEQPGLP 68
            VS+ + GNR +     +  +LI+  F  P + TL+L         +SD          +P
Sbjct: 1104 VSIKISGNR-IFACDIRESVLIF--FYDPNQNTLRL---------ISDDI--------IP 1143

Query: 69   RGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGG------------------H 110
            R +  C  +++  ++  M +DK  +V +     EA++   G                   
Sbjct: 1144 RWI-TCSEILDHHTI--MAADKFDSVFILRVPEEAKQDEYGITNKCWYGGEIMNSSTKNR 1200

Query: 111  RLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYR 170
            +L     FH+G+ V +  K+R  P+         S     Y+++ G +G F+P   K   
Sbjct: 1201 KLEHMMSFHIGEIVTSMQKVRLSPT---------SSECIIYSTIMGTIGAFIPYDNKEEL 1251

Query: 171  RLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERL 229
             L   L+ ++ T      G     FR+Y        +P + ++DG L  +F  LS   + 
Sbjct: 1252 ELTQHLEIILRTEKPPLCGREHIFFRSYY-------HPVQNVVDGDLCEQFSSLSYDAQK 1304

Query: 230  EICKKIGSKHNDILDELYDI 249
            +I   +     DIL +L DI
Sbjct: 1305 KIANDLERTPEDILRKLEDI 1324


>gi|195500686|ref|XP_002097479.1| GE26244 [Drosophila yakuba]
 gi|194183580|gb|EDW97191.1| GE26244 [Drosophila yakuba]
          Length = 1140

 Score = 44.3 bits (103), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 42/162 (25%), Positives = 68/162 (41%), Gaps = 16/162 (9%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
            P+ + A E + + + +G      + N  LF+ Q ++  +    R  L +   FHLG  VN
Sbjct: 952  PKWMRAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVN 1006

Query: 126  TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
             F       S +    G R+  +     Y + +GA+G    +P+  Y  L  LQ  +   
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLQERLKKI 1062

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
                G +    +R ++        PS G IDG L+  FL LS
Sbjct: 1063 IKSVGKIEHTYYRNFQINNKV--EPSEGFIDGDLIESFLDLS 1102


>gi|440639387|gb|ELR09306.1| hypothetical protein GMDG_03874 [Geomyces destructans 20631-21]
          Length = 1138

 Score = 43.9 bits (102), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 40/165 (24%), Positives = 72/165 (43%), Gaps = 8/165 (4%)

Query: 84   GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGA 143
            G++ +D D N+++      A       ++    +FHLG+ VN   KIR   S      GA
Sbjct: 976  GWLEADADGNLIVLRRNEAAVTFEDRKKMEVTGEFHLGEQVNRIRKIRVDAS-----EGA 1030

Query: 144  RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
                  + A+ +G+L  +  +   +   LL LQ  +  +    G +    +R+++     
Sbjct: 1031 TVVPRAFLATTEGSLFLYGSVAPASQDLLLRLQQRLAENVETPGNIPFTTYRSFRNAERE 1090

Query: 204  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN--DILDEL 246
               P R  IDG L+ +FL L    +  +CK +       D+++EL
Sbjct: 1091 TEEPYR-FIDGELIERFLDLDEERQEVVCKGLAKVEEVRDLVEEL 1134


>gi|221061705|ref|XP_002262422.1| splicing factor 3b, subunit 3, 130kd [Plasmodium knowlesi strain H]
 gi|193811572|emb|CAQ42300.1| splicing factor 3b, subunit 3, 130kd, putative [Plasmodium knowlesi
            strain H]
          Length = 1276

 Score = 43.9 bits (102), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 46/202 (22%), Positives = 84/202 (41%), Gaps = 38/202 (18%)

Query: 67   LPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARE------------------SNG 108
            +PR +  C  +++  ++  M +DK  +V +     EA++                  SN 
Sbjct: 1089 IPRWI-TCSEILDHHTI--MAADKFDSVFVLRVPEEAKQEEYGISNKCWYGGEMMAGSNK 1145

Query: 109  GHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKN 168
              RL    +FH+G+ V +  K++  P+         S     Y+++ G +G F+P   K 
Sbjct: 1146 NRRLEHIMNFHVGEIVTSLQKVKLSPT---------SSECIIYSTIMGTIGAFIPYDNKE 1196

Query: 169  YRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGE 227
               L   L+ ++ T      G     FR+Y        +P + +IDG L  +F  L    
Sbjct: 1197 ELELTQHLEIILRTENPPLCGREHIFFRSYY-------HPVQHVIDGDLCEQFSSLPYDI 1249

Query: 228  RLEICKKIGSKHNDILDELYDI 249
            + ++   +    +DIL +L DI
Sbjct: 1250 QRKVAADLERTPDDILRKLEDI 1271


>gi|342885673|gb|EGU85655.1| hypothetical protein FOXB_03801 [Fusarium oxysporum Fo5176]
          Length = 1160

 Score = 43.9 bits (102), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 33/158 (20%), Positives = 75/158 (47%), Gaps = 6/158 (3%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            ++ +D   N+V+     +A       RL   ++ ++G+ +N   +IR    P+++     
Sbjct: 998  WLEADSKGNLVVLQRNVDAPTEQDRSRLEITSEMNIGEQIN---RIRKLHVPMAENGIVH 1054

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
             R   + AS +G+L  +  +  +    L+  Q+ M  +    G +  + +R+++ +   +
Sbjct: 1055 PR--AFLASAEGSLYLYGDIAPQYQDLLMTFQSKMEEYIHVPGSVEFKLWRSFRNENRES 1112

Query: 205  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
              P R  IDG +V +FL +  G++  +C+ +G    D+
Sbjct: 1113 EGPFR-FIDGEMVERFLDMDEGKQELVCEGLGPSIEDM 1149


>gi|156049323|ref|XP_001590628.1| hypothetical protein SS1G_08368 [Sclerotinia sclerotiorum 1980]
 gi|154692767|gb|EDN92505.1| hypothetical protein SS1G_08368 [Sclerotinia sclerotiorum 1980 UF-70]
          Length = 1153

 Score = 43.9 bits (102), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 41/180 (22%), Positives = 76/180 (42%), Gaps = 20/180 (11%)

Query: 77   VIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP 136
            + E  +  ++ SD D N+++     E        RL    + +LG+ VN   +I  + S 
Sbjct: 969  IAEVDTDTYLESDHDGNLIVLKRNREGVTREDKLRLEVLCEMNLGEMVNKIKRINVETS- 1027

Query: 137  ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT---------- 186
              DA      F+   A+ +G++  F  +P +N   L+ LQ+ + +  + +          
Sbjct: 1028 -KDALLIPRAFV---ATTEGSIYLFSLIPPQNQDLLMRLQSRLASLPARSLTDSSFSAPI 1083

Query: 187  ----GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
                G L+   +R+Y         P R  +DG L+ +FL L    +  IC  +G +  D+
Sbjct: 1084 EFSPGNLDFDKYRSYVSAVRETNEPFR-FVDGELIERFLDLDGAIQENICDGLGVRAEDL 1142


>gi|452838792|gb|EME40732.1| hypothetical protein DOTSEDRAFT_177898 [Dothistroma septosporum
            NZE10]
          Length = 1138

 Score = 43.9 bits (102), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 36/154 (23%), Positives = 67/154 (43%), Gaps = 8/154 (5%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            ++ SD + N+++  +           RL   ++  LG+ VN   +I   P+      GA 
Sbjct: 974  YLQSDAEGNLIVLQHDTNGFSEEDRRRLRVTSELLLGEMVNRIRRIDVTPTH-----GAL 1028

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG--KGY 202
                 + A+++G++  F  +       L+ +QN M +     G +    FR +K   +  
Sbjct: 1029 VIPRAFLATVEGSIYLFALIVPGKQDLLMRMQNNMASLVKSPGHVEFATFRGFKNQVRDE 1088

Query: 203  YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
             A  PSR  +DG L+ +FL      + EI + +G
Sbjct: 1089 GANGPSR-FVDGELIERFLDCGQDIQEEIIRDLG 1121


>gi|356525403|ref|XP_003531314.1| PREDICTED: DNA damage-binding protein 1-like isoform 2 [Glycine max]
          Length = 1068

 Score = 43.9 bits (102), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 12/129 (9%)

Query: 111  RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
            RL    ++HLG+ VN F      +R   S +   P         + +++G +G    LP 
Sbjct: 930  RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTINGVIGVIASLPH 983

Query: 167  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
            + Y  L  LQ+ +       GGL+   +R++  +       +R  +DG L+  FL L+  
Sbjct: 984  EQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--EARNFLDGDLIESFLDLNRS 1041

Query: 227  ERLEICKKI 235
            +  EI K +
Sbjct: 1042 KMDEISKAV 1050


>gi|327278830|ref|XP_003224163.1| PREDICTED: DNA damage-binding protein 1-like [Anolis carolinensis]
          Length = 1140

 Score = 43.9 bits (102), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 37/141 (26%), Positives = 60/141 (42%), Gaps = 15/141 (10%)

Query: 118  FHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLM 174
            FHLG+ VN F    C  S +    G  S   +    + +++G +G    L E  Y  LL 
Sbjct: 998  FHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLD 1053

Query: 175  LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
            +QN +       G +    +R++  +      P+ G IDG L+  FL +S  +  E+   
Sbjct: 1054 VQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVAN 1111

Query: 235  I------GSKHNDILDELYDI 249
            +      G K    +D+L  I
Sbjct: 1112 LQIDDGSGMKREATVDDLIKI 1132


>gi|356525401|ref|XP_003531313.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Glycine max]
          Length = 1089

 Score = 43.9 bits (102), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 12/129 (9%)

Query: 111  RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
            RL    ++HLG+ VN F      +R   S +   P         + +++G +G    LP 
Sbjct: 951  RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTINGVIGVIASLPH 1004

Query: 167  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
            + Y  L  LQ+ +       GGL+   +R++  +       +R  +DG L+  FL L+  
Sbjct: 1005 EQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--EARNFLDGDLIESFLDLNRS 1062

Query: 227  ERLEICKKI 235
            +  EI K +
Sbjct: 1063 KMDEISKAV 1071


>gi|389586447|dbj|GAB69176.1| splicing factor 3B subunit 3 [Plasmodium cynomolgi strain B]
          Length = 1286

 Score = 43.5 bits (101), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 46/202 (22%), Positives = 83/202 (41%), Gaps = 38/202 (18%)

Query: 67   LPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARE------------------SNG 108
            +PR +  C  +++  ++  M +DK  +V +     EA++                  SN 
Sbjct: 1099 IPRWI-TCSEILDHHTI--MAADKFDSVFVLRVPEEAKQEEYGISNKCWYGGEIMAGSNK 1155

Query: 109  GHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKN 168
              RL     FH+G+ V +  K++  P+         S     Y+++ G +G F+P   K 
Sbjct: 1156 NRRLEHIMSFHVGEIVTSLQKVKLSPT---------SSECIIYSTIMGTIGAFIPYDNKE 1206

Query: 169  YRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGE 227
               L   L+ ++ T      G     FR+Y        +P + +IDG L  +F  L    
Sbjct: 1207 ELELTQHLEIILRTENPPLCGREHIFFRSYY-------HPVQHVIDGDLCEQFSSLPYDV 1259

Query: 228  RLEICKKIGSKHNDILDELYDI 249
            + ++   +    +DIL +L DI
Sbjct: 1260 QRKVAADLERTPDDILRKLEDI 1281


>gi|301616502|ref|XP_002937687.1| PREDICTED: DNA damage-binding protein 1-like [Xenopus (Silurana)
            tropicalis]
          Length = 1140

 Score = 43.5 bits (101), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 70/165 (42%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSPPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDVQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K    +D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVIANLQIDDGSGMKRETTVDDLIKV 1132


>gi|156095699|ref|XP_001613884.1| Splicing factor 3B subunit 3 [Plasmodium vivax Sal-1]
 gi|148802758|gb|EDL44157.1| Splicing factor 3B subunit 3, putative [Plasmodium vivax]
          Length = 1230

 Score = 43.5 bits (101), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 46/202 (22%), Positives = 83/202 (41%), Gaps = 38/202 (18%)

Query: 67   LPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARE------------------SNG 108
            +PR +  C  +++  ++  M +DK  +V +     EA++                  SN 
Sbjct: 1043 IPRWI-TCSEILDHHTI--MAADKFDSVFVLRVPEEAKQEEYGISNKCWYGGEIMAGSNK 1099

Query: 109  GHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKN 168
              RL     FH+G+ V +  K++  P+         S     Y+++ G +G F+P   K 
Sbjct: 1100 NRRLEHIMSFHVGEIVTSLQKVKLSPT---------SSECIIYSTIMGTIGAFIPYDNKE 1150

Query: 169  YRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGE 227
               L   L+ ++ T      G     FR+Y        +P + +IDG L  +F  L    
Sbjct: 1151 ELELTQHLEIILRTENPPLCGREHIFFRSYY-------HPVQHVIDGDLCEQFSSLPYDV 1203

Query: 228  RLEICKKIGSKHNDILDELYDI 249
            + ++   +    +DIL +L DI
Sbjct: 1204 QRKVAADLERTPDDILRKLEDI 1225


>gi|384253371|gb|EIE26846.1| hypothetical protein COCSUDRAFT_52476 [Coccomyxa subellipsoidea
            C-169]
          Length = 1205

 Score = 43.5 bits (101), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 40/145 (27%), Positives = 63/145 (43%), Gaps = 18/145 (12%)

Query: 107  NGG-HRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLP 165
            NG  H+L    +FH+G  V +  +   +P       G R   L  YA++ GA+G  LP P
Sbjct: 1072 NGAPHKLEDVVNFHVGDLVTSLQRAVLQP-------GGREVLL--YATVMGAIGAMLPFP 1122

Query: 166  EKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
             +        L+  +       GG   R   +Y+G  +    P + +IDG L   F QL 
Sbjct: 1123 SREDVDFFSHLEMHLRQEHPPMGG---RDHMSYRGSYF----PVKDVIDGDLCEHFSQLP 1175

Query: 225  LGERLEICKKIGSKHNDILDELYDI 249
              ++  I  ++     +IL +L DI
Sbjct: 1176 AAKQKSIADELERTPGEILKKLEDI 1200


>gi|168047617|ref|XP_001776266.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672361|gb|EDQ58899.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1089

 Score = 43.5 bits (101), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 34/126 (26%), Positives = 55/126 (43%), Gaps = 12/126 (9%)

Query: 111  RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
            RL    ++HLG+ VN F      +R   S  S  P         + +++G +G    LP+
Sbjct: 951  RLEVVGEYHLGEFVNRFRHGSLVMRLPDSEASLIP------TVIFGTVNGVIGVIASLPQ 1004

Query: 167  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
              +  L  LQ  +V      GGL+   +R++  +       +R  +DG L+  FL LS  
Sbjct: 1005 DKFLFLQKLQQALVKVIKGVGGLSHEQWRSFSNERKTV--DARNFLDGDLIESFLDLSRN 1062

Query: 227  ERLEIC 232
            +  EI 
Sbjct: 1063 KMEEIA 1068


>gi|147906138|ref|NP_001083624.1| DNA damage-binding protein 1 [Xenopus laevis]
 gi|82186503|sp|Q6P6Z0.1|DDB1_XENLA RecName: Full=DNA damage-binding protein 1; AltName:
            Full=Damage-specific DNA-binding protein 1
 gi|38303806|gb|AAH61946.1| Ddb1 protein [Xenopus laevis]
          Length = 1140

 Score = 43.5 bits (101), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 70/165 (42%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSPPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDVQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K    +D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVIANLQIDDGSGMKRETTVDDLIKV 1132


>gi|195037449|ref|XP_001990173.1| GH18378 [Drosophila grimshawi]
 gi|193894369|gb|EDV93235.1| GH18378 [Drosophila grimshawi]
          Length = 1140

 Score = 43.5 bits (101), Expect = 0.083,   Method: Compositional matrix adjust.
 Identities = 40/162 (24%), Positives = 68/162 (41%), Gaps = 16/162 (9%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
            P+ + A E + + + +G    D      LF+ Q ++  +    R  L +   FHLG  +N
Sbjct: 952  PKWMRAVEILDDDTFLGCETHDN-----LFVCQKDSAATTDEERQLLPELARFHLGDTIN 1006

Query: 126  TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
             F       S +    G R+  +     Y + +GA+G    +P+  Y  L  L+  +   
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
                G ++   +R Y+        PS G IDG L+  FL L+
Sbjct: 1063 IKSVGKIDHTYYRNYQINTKV--EPSEGFIDGDLIESFLDLN 1102


>gi|195395112|ref|XP_002056180.1| GJ10363 [Drosophila virilis]
 gi|194142889|gb|EDW59292.1| GJ10363 [Drosophila virilis]
          Length = 1140

 Score = 43.5 bits (101), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 40/162 (24%), Positives = 68/162 (41%), Gaps = 16/162 (9%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
            P+ + A E + + + +G    D      LF+ Q ++  +    R  L +   FHLG  +N
Sbjct: 952  PKWMRAVEILDDDTFLGCETHDN-----LFVCQKDSAATTDEERQLLPELARFHLGDTIN 1006

Query: 126  TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
             F       S +    G R+  +     Y + +GA+G    +P+  Y  L  L+  +   
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
                G ++   +R Y+        PS G IDG L+  FL L+
Sbjct: 1063 IKSVGKIDHTYYRNYQINTKV--EPSEGFIDGDLIESFLDLN 1102


>gi|194377326|dbj|BAG57611.1| unnamed protein product [Homo sapiens]
          Length = 451

 Score = 43.5 bits (101), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 37/145 (25%), Positives = 63/145 (43%), Gaps = 11/145 (7%)

Query: 96  LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
           LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 285 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 340

Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
           + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 341 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 398

Query: 211 IIDGSLVWKFLQLSLGERLEICKKI 235
            IDG L+  FL +S  +  E+   +
Sbjct: 399 FIDGDLIESFLDISRPKMQEVVANL 423


>gi|159470709|ref|XP_001693499.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283002|gb|EDP08753.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 279

 Score = 43.1 bits (100), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 49/203 (24%), Positives = 84/203 (41%), Gaps = 24/203 (11%)

Query: 69  RGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESN----GGHRLIKKTDFHLGQHV 124
           R V  C  VI    + F+ +D    + +  +  + R++N     G RL      H+ + V
Sbjct: 82  RDVLTCGVVIAEPKLHFLAADAAGTLQMMEFYGK-RDTNPEFWAGQRLAPMGLLHVARRV 140

Query: 125 NTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT- 183
                ++        +   R+R      S +G L F  P+P+      L      ++ T 
Sbjct: 141 GVAASVQLA------SRDGRNRHALLCGSAEGGLSFVAPVPDPQAAARLAALQAHMSATL 194

Query: 184 SHTGGLNPRAFRTY---------KGKGYYAGNPSRG---IIDGSLVWKFLQLSLGERLEI 231
            H  GLNPR+FR            G+ + A  P R    ++DG L+  F  LS  ++ E 
Sbjct: 195 PHVAGLNPRSFRHRFIRIPKALGGGEHHRAPLPPRNNSGLLDGQLLLGFPHLSRQQQAEA 254

Query: 232 CKKIGSKHNDILDELYDIEALSS 254
            + +GS    +L++L  I A ++
Sbjct: 255 AEAVGSSPQQLLEDLRAIAAAAT 277


>gi|74025892|ref|XP_829512.1| cleavage and polyadenylation specificity factor-like protein
            [Trypanosoma brucei brucei strain 927/4 GUTat10.1]
 gi|70834898|gb|EAN80400.1| cleavage and polyadenylation specificity factor-like protein,
            putative [Trypanosoma brucei brucei strain 927/4
            GUTat10.1]
          Length = 1452

 Score = 43.1 bits (100), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 45/186 (24%), Positives = 73/186 (39%), Gaps = 12/186 (6%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGG-HRLIKKTDFHLGQHVNTFFK 129
            V  C+ +    + G + SD ++NV++  Y P  +E++ G H  + ++   L         
Sbjct: 1267 VVHCDMMYHDRAFGILCSDDERNVLIMGYTPRVQETDAGTHPKVLESVLSLDGEYRLPSG 1326

Query: 130  IRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 189
               K        G  S  +T Y S  G +GF +P+ E+  R  L +   +        GL
Sbjct: 1327 SLVKSLRFRSTAGNSS--VTLYVSNYGEIGFIVPIGEQANRTALWVTRRLQIDLPCEAGL 1384

Query: 190  NPRAFRTYKGKGYYAGNPSRG---IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
             PR F +   +     N  RG   ++   L+     L L  R    K I       LD +
Sbjct: 1385 TPRMFLSLNQRS--PRNSLRGKEMLVPAPLLRGLFSLDLRSR----KAIARAAYTQLDRV 1438

Query: 247  YDIEAL 252
             +I AL
Sbjct: 1439 ANIVAL 1444


>gi|260947152|ref|XP_002617873.1| hypothetical protein CLUG_01332 [Clavispora lusitaniae ATCC 42720]
 gi|238847745|gb|EEQ37209.1| hypothetical protein CLUG_01332 [Clavispora lusitaniae ATCC 42720]
          Length = 1242

 Score = 43.1 bits (100), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 42/172 (24%), Positives = 75/172 (43%), Gaps = 21/172 (12%)

Query: 87   ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSR 146
            ++++ +N VL  ++ E   ++   RL K  DF+    V +  K        S   G    
Sbjct: 1077 VAEQLENNVLMKFEEETLGASSS-RLDKLCDFYTQDIVTSLHKG-------SFVVGGSES 1128

Query: 147  FLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHT--------GGLNPRAFRTY 197
             +  Y  L G +G  LPL       LLM L+N +  + + +         G N       
Sbjct: 1129 II--YTGLQGTVGILLPLATTQEVDLLMKLENSLRDYFNDSFDDFDNTKQGFNLVGREHL 1186

Query: 198  KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
            K +GYY  NP   +IDG  + +F +L+   ++++  ++     DI  ++YD+
Sbjct: 1187 KFRGYY--NPVENVIDGDFIERFFELNPSAQVKLAGRLDKSPRDIERKIYDL 1236


>gi|254585271|ref|XP_002498203.1| ZYRO0G04730p [Zygosaccharomyces rouxii]
 gi|238941097|emb|CAR29270.1| ZYRO0G04730p [Zygosaccharomyces rouxii]
          Length = 1302

 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 33/55 (60%), Gaps = 3/55 (5%)

Query: 196  TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI-GSKHNDILDELYDI 249
            ++K + YYA  P R +IDG L   FL LSL E+ ++CK+  GS    +  +L DI
Sbjct: 1245 SFKYRSYYA--PVRNVIDGDLCETFLNLSLSEQTKLCKETSGSNPEGVCKQLNDI 1297


>gi|355752055|gb|EHH56175.1| Damage-specific DNA-binding protein 1, partial [Macaca fascicularis]
          Length = 1125

 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 959  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1014

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1015 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1072

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1073 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1117


>gi|119594343|gb|EAW73937.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_e [Homo
           sapiens]
          Length = 896

 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96  LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
           LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 730 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 785

Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
           + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 786 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 843

Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
            IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 844 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 888


>gi|449519304|ref|XP_004166675.1| PREDICTED: DNA damage-binding protein 1a-like [Cucumis sativus]
          Length = 596

 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 12/129 (9%)

Query: 111 RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
           RL    ++HLG+ VN F      +R   S +   P         + S++G +G    LP 
Sbjct: 458 RLEVVGEYHLGEFVNRFQHGSLVMRLPDSDVGQIP------TVIFGSVNGVIGVIASLPH 511

Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
             Y  L  LQ+ +       GGL+   +R++  +   A   ++  +DG L+  FL L+  
Sbjct: 512 DQYVFLERLQSNLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKNFLDGDLIESFLDLNRS 569

Query: 227 ERLEICKKI 235
           +  EI + +
Sbjct: 570 KMEEISRAM 578


>gi|326474493|gb|EGD98502.1| UV-damaged DNA binding protein [Trichophyton tonsurans CBS 112818]
 gi|326481558|gb|EGE05568.1| UV-damaged DNA binding protein [Trichophyton equinum CBS 127.97]
          Length = 1147

 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 36/152 (23%), Positives = 70/152 (46%), Gaps = 8/152 (5%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            ++++D + N+V+          +   RL   ++  LG+ VN    I  + +    A  AR
Sbjct: 982  YLLADAEGNLVVLQQNITGVTESDRRRLQPTSEIRLGEMVNRIHPIVVQ-THTETAVSAR 1040

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
            +      A++DG++ +   L   NY  LL+ LQ  M +  +  G +    +R ++     
Sbjct: 1041 A----LLATVDGSI-YLFGLINPNYVDLLLRLQTAMGSIATSPGEIPFSKYRAFRTTVRQ 1095

Query: 204  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
            +  P R  +DG L+ +FL  +L  + EI  ++
Sbjct: 1096 SDEPFR-FVDGELIERFLSCTLAMQQEIVGRL 1126


>gi|149725200|ref|XP_001502072.1| PREDICTED: DNA damage-binding protein 1 [Equus caballus]
          Length = 1140

 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 36/134 (26%), Positives = 59/134 (44%), Gaps = 11/134 (8%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLS 224
             IDG L+  FL +S
Sbjct: 1088 FIDGDLIESFLDIS 1101


>gi|418316|sp|P33194.1|DDB1_CERAE RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
            subunit; AltName: Full=DDBa; AltName:
            Full=Damage-specific DNA-binding protein 1; AltName:
            Full=UV-damaged DNA-binding protein 1; Short=UV-DDB 1
 gi|304026|gb|AAA03021.1| UV-damaged DNA-binding protein [Chlorocebus aethiops]
          Length = 1140

 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132


>gi|356512638|ref|XP_003525025.1| PREDICTED: DNA damage-binding protein 1a-like isoform 2 [Glycine max]
          Length = 1068

 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 12/129 (9%)

Query: 111  RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
            RL    ++HLG+ VN F      +R   S +   P         + +++G +G    LP 
Sbjct: 930  RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTINGVIGVIASLPH 983

Query: 167  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
            + Y  L  LQ+ +       GGL+   +R++  +       +R  +DG L+  FL L+  
Sbjct: 984  EQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--EARNFLDGDLIESFLDLNRS 1041

Query: 227  ERLEICKKI 235
            +  EI K +
Sbjct: 1042 KMDEISKAL 1050


>gi|356512636|ref|XP_003525024.1| PREDICTED: DNA damage-binding protein 1a-like isoform 1 [Glycine max]
          Length = 1089

 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 12/129 (9%)

Query: 111  RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
            RL    ++HLG+ VN F      +R   S +   P         + +++G +G    LP 
Sbjct: 951  RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTINGVIGVIASLPH 1004

Query: 167  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
            + Y  L  LQ+ +       GGL+   +R++  +       +R  +DG L+  FL L+  
Sbjct: 1005 EQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--EARNFLDGDLIESFLDLNRS 1062

Query: 227  ERLEICKKI 235
            +  EI K +
Sbjct: 1063 KMDEISKAL 1071


>gi|413081953|ref|NP_741992.2| DNA damage-binding protein 1 [Rattus norvegicus]
 gi|293344614|ref|XP_002725831.1| PREDICTED: DNA damage-binding protein 1 [Rattus norvegicus]
 gi|293356422|ref|XP_002728912.1| PREDICTED: DNA damage-binding protein 1 [Rattus norvegicus]
 gi|149062405|gb|EDM12828.1| damage-specific DNA binding protein 1 [Rattus norvegicus]
          Length = 1140

 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132


>gi|344295432|ref|XP_003419416.1| PREDICTED: DNA damage-binding protein 1 [Loxodonta africana]
          Length = 1140

 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132


>gi|354504619|ref|XP_003514371.1| PREDICTED: DNA damage-binding protein 1-like [Cricetulus griseus]
 gi|344258340|gb|EGW14444.1| DNA damage-binding protein 1 [Cricetulus griseus]
          Length = 1140

 Score = 43.1 bits (100), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132


>gi|73983859|ref|XP_533275.2| PREDICTED: DNA damage-binding protein 1 [Canis lupus familiaris]
 gi|291409601|ref|XP_002721069.1| PREDICTED: damage-specific DNA binding protein 1 [Oryctolagus
            cuniculus]
 gi|301781686|ref|XP_002926259.1| PREDICTED: DNA damage-binding protein 1-like [Ailuropoda melanoleuca]
          Length = 1140

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132


>gi|148529014|ref|NP_001914.3| DNA damage-binding protein 1 [Homo sapiens]
 gi|296218432|ref|XP_002807395.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1
            [Callithrix jacchus]
 gi|397516558|ref|XP_003828491.1| PREDICTED: DNA damage-binding protein 1 [Pan paniscus]
 gi|402893195|ref|XP_003909786.1| PREDICTED: DNA damage-binding protein 1 [Papio anubis]
 gi|426368721|ref|XP_004051351.1| PREDICTED: DNA damage-binding protein 1 [Gorilla gorilla gorilla]
 gi|12643730|sp|Q16531.1|DDB1_HUMAN RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
            subunit; AltName: Full=DNA damage-binding protein a;
            Short=DDBa; AltName: Full=Damage-specific DNA-binding
            protein 1; AltName: Full=HBV X-associated protein 1;
            Short=XAP-1; AltName: Full=UV-damaged DNA-binding factor;
            AltName: Full=UV-damaged DNA-binding protein 1;
            Short=UV-DDB 1; AltName: Full=XPE-binding factor;
            Short=XPE-BF; AltName: Full=Xeroderma pigmentosum group
            E-complementing protein; Short=XPCe
 gi|203282525|pdb|3E0C|A Chain A, Crystal Structure Of Dna Damage-Binding Protein 1(Ddb1)
 gi|695362|gb|AAA62838.1| X-associated protein 1, partial [Homo sapiens]
 gi|1052865|gb|AAC50349.1| DDBa p127 [Homo sapiens]
 gi|15079750|gb|AAH11686.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
 gi|29792243|gb|AAH50530.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
 gi|30354567|gb|AAH51764.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
 gi|61354161|gb|AAX44048.1| damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
 gi|119594341|gb|EAW73935.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_c [Homo
            sapiens]
 gi|168275638|dbj|BAG10539.1| DNA damage-binding protein 1 [synthetic construct]
 gi|189065506|dbj|BAG35345.1| unnamed protein product [Homo sapiens]
 gi|355566436|gb|EHH22815.1| Damage-specific DNA-binding protein 1 [Macaca mulatta]
 gi|380784123|gb|AFE63937.1| DNA damage-binding protein 1 [Macaca mulatta]
 gi|380808126|gb|AFE75938.1| DNA damage-binding protein 1 [Macaca mulatta]
 gi|380810144|gb|AFE76947.1| DNA damage-binding protein 1 [Macaca mulatta]
 gi|383408123|gb|AFH27275.1| DNA damage-binding protein 1 [Macaca mulatta]
 gi|410305600|gb|JAA31400.1| damage-specific DNA binding protein 1, 127kDa [Pan troglodytes]
 gi|410352015|gb|JAA42611.1| damage-specific DNA binding protein 1, 127kDa [Pan troglodytes]
          Length = 1140

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132


>gi|400260815|pdb|4E54|A Chain A, Damaged Dna Induced Uv-Damaged Dna-Binding Protein (Uv-Ddb)
            Dimerization And Its Roles In Chromatinized Dna Repair
 gi|401871507|pdb|4E5Z|A Chain A, Damaged Dna Induced Uv-Damaged Dna-Binding Protein (Uv-Ddb)
            Dimerization And Its Roles In Chromatinized Dna Repair
          Length = 1150

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 984  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1039

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1040 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1097

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1098 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1142


>gi|395852550|ref|XP_003798801.1| PREDICTED: DNA damage-binding protein 1 [Otolemur garnettii]
          Length = 1140

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132


>gi|359546285|pdb|4A11|A Chain A, Structure Of The Hsddb1-Hscsa Complex
 gi|361132519|pdb|4A0K|C Chain C, Structure Of Ddb1-Ddb2-Cul4a-Rbx1 Bound To A 12 Bp Abasic
            Site Containing Dna-Duplex
          Length = 1159

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 993  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1048

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1049 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1106

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1107 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1151


>gi|358440058|pdb|4A08|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 13 Bp Cpd-Duplex (
            Purine At D-1 Position) At 3.0 A Resolution (Cpd 1)
 gi|358440062|pdb|4A09|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 15 Bp Cpd-Duplex
            (Purine At D-1 Position) At 3.1 A Resolution (Cpd 2)
          Length = 1159

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 993  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1048

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1049 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1106

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1107 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1151


>gi|311247551|ref|XP_003122699.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Sus scrofa]
          Length = 1140

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132


>gi|2632123|emb|CAA05770.1| Xeroderma Pigmentosum Group E Complementing protein [Homo sapiens]
          Length = 1140

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132


>gi|74138855|dbj|BAE27231.1| unnamed protein product [Mus musculus]
          Length = 1140

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGEASTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132


>gi|358440070|pdb|4A0B|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
            Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
 gi|358440072|pdb|4A0B|C Chain C, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
            Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
          Length = 1159

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 993  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1048

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1049 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1106

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1107 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1151


>gi|384941436|gb|AFI34323.1| DNA damage-binding protein 1 [Macaca mulatta]
          Length = 1140

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132


>gi|358440066|pdb|4A0A|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
            Pyrimidine At D-1 Position) At 3.6 A Resolution (Cpd 3)
          Length = 1159

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 993  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1048

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1049 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1106

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1107 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1151


>gi|453081643|gb|EMF09692.1| DNA damage-binding protein 1 [Mycosphaerella populorum SO2202]
          Length = 1151

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 37/154 (24%), Positives = 69/154 (44%), Gaps = 6/154 (3%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            ++ SD + N+V+  + P    +    RL   ++  LG+ VN    I    +P ++A    
Sbjct: 983  YIQSDAEGNLVVLEHDPTGFSAEDRRRLRVTSEMCLGEMVNRIRPITTVITPSANAVVIP 1042

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
              F+   A+++G++  F  + ++    L+ LQ  M       G +    FR +K +    
Sbjct: 1043 KAFI---ATVEGSVYVFGTIAQQYQDLLIRLQGSMAEMVKSPGFVRFNRFRGFKTQVRDM 1099

Query: 205  GN--PSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
            G   P R  +DG ++  FL LS   +  + K +G
Sbjct: 1100 GEEGPVR-FVDGEIIEGFLGLSAEVQESVAKDLG 1132


>gi|194381178|dbj|BAG64157.1| unnamed protein product [Homo sapiens]
          Length = 826

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96  LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
           LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 660 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 715

Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
           + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 716 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 773

Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
            IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 774 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 818


>gi|195329354|ref|XP_002031376.1| GM24084 [Drosophila sechellia]
 gi|194120319|gb|EDW42362.1| GM24084 [Drosophila sechellia]
          Length = 1140

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 71/162 (43%), Gaps = 16/162 (9%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
            P+ + A E + + + +G      + N  LF+ Q ++  +    R  L +   FHLG  VN
Sbjct: 952  PKWMRAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVN 1006

Query: 126  TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
             F       S +    G R+  +     Y + +GA+G    +P+  Y  L  L+  +   
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
                G +  + +R ++   +    PS+G IDG L+  FL LS
Sbjct: 1063 IKLVGKIGHKFYRNFRI--HTQVEPSQGFIDGDLIESFLDLS 1102


>gi|7657011|ref|NP_056550.1| DNA damage-binding protein 1 [Mus musculus]
 gi|134034087|sp|Q3U1J4.2|DDB1_MOUSE RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
            subunit; AltName: Full=Damage-specific DNA-binding
            protein 1; AltName: Full=UV-damaged DNA-binding factor
 gi|5931596|dbj|BAA84699.1| XPE UV-damaged DNA binding factor [Mus musculus]
 gi|16307148|gb|AAH09661.1| Damage specific DNA binding protein 1 [Mus musculus]
 gi|74182145|dbj|BAE34102.1| unnamed protein product [Mus musculus]
 gi|74196166|dbj|BAE32993.1| unnamed protein product [Mus musculus]
          Length = 1140

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGEASTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132


>gi|403255013|ref|XP_003920244.1| PREDICTED: DNA damage-binding protein 1 [Saimiri boliviensis
            boliviensis]
          Length = 1140

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132


>gi|119594340|gb|EAW73934.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_b [Homo
           sapiens]
          Length = 923

 Score = 43.1 bits (100), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96  LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
           LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 757 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 812

Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
           + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 813 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 870

Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
            IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 871 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 915


>gi|270346571|pdb|3I7H|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Hbx
 gi|270346573|pdb|3I7K|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Whx
 gi|270346575|pdb|3I7L|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Ddb2
 gi|270346577|pdb|3I7N|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdtc1
 gi|270346579|pdb|3I7O|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Iqwd1
 gi|270346581|pdb|3I7P|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdr40a
 gi|270346583|pdb|3I89|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdr22
 gi|270346585|pdb|3I8C|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdr21a
 gi|270346587|pdb|3I8E|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdr42a
 gi|270346588|pdb|3I8E|B Chain B, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdr42a
          Length = 1143

 Score = 42.7 bits (99), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 977  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1032

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1033 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1090

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1091 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1135


>gi|221046711|pdb|3EI1|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 14 Bp 6-4 Photoproduct
            Containing Dna-Duplex
 gi|221046715|pdb|3EI2|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Abasic Site
            Containing Dna-Duplex
 gi|221046719|pdb|3EI3|A Chain A, Structure Of The Hsddb1-Drddb2 Complex
          Length = 1158

 Score = 42.7 bits (99), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 992  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1047

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1048 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1105

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1106 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1150


>gi|122692537|ref|NP_001073731.1| DNA damage-binding protein 1 [Bos taurus]
 gi|426251842|ref|XP_004019630.1| PREDICTED: DNA damage-binding protein 1 [Ovis aries]
 gi|134034086|sp|A1A4K3.1|DDB1_BOVIN RecName: Full=DNA damage-binding protein 1; AltName:
            Full=Damage-specific DNA-binding protein 1
 gi|119223918|gb|AAI26630.1| Damage-specific DNA binding protein 1, 127kDa [Bos taurus]
 gi|296471644|tpg|DAA13759.1| TPA: DNA damage-binding protein 1 [Bos taurus]
          Length = 1140

 Score = 42.7 bits (99), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132


>gi|197097564|ref|NP_001126613.1| DNA damage-binding protein 1 [Pongo abelii]
 gi|75041202|sp|Q5R649.1|DDB1_PONAB RecName: Full=DNA damage-binding protein 1; AltName:
            Full=Damage-specific DNA-binding protein 1
 gi|55732122|emb|CAH92767.1| hypothetical protein [Pongo abelii]
          Length = 1140

 Score = 42.7 bits (99), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132


>gi|441604084|ref|XP_004087862.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1
            [Nomascus leucogenys]
          Length = 1140

 Score = 42.7 bits (99), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132


>gi|221046721|pdb|3EI4|A Chain A, Structure Of The Hsddb1-Hsddb2 Complex
 gi|221046723|pdb|3EI4|C Chain C, Structure Of The Hsddb1-Hsddb2 Complex
 gi|221046725|pdb|3EI4|E Chain E, Structure Of The Hsddb1-Hsddb2 Complex
          Length = 1158

 Score = 42.7 bits (99), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 992  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1047

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1048 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1105

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1106 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1150


>gi|74178494|dbj|BAE32502.1| unnamed protein product [Mus musculus]
          Length = 1140

 Score = 42.7 bits (99), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGEASTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132


>gi|21357503|ref|NP_650257.1| piccolo [Drosophila melanogaster]
 gi|74872881|sp|Q9XYZ5.1|DDB1_DROME RecName: Full=DNA damage-binding protein 1; Short=D-DDB1; AltName:
            Full=Damage-specific DNA-binding protein 1; AltName:
            Full=Protein piccolo
 gi|4928452|gb|AAD33592.1|AF132145_1 damage-specific DNA binding protein DDBa p127 subunit [Drosophila
            melanogaster]
 gi|7299719|gb|AAF54901.1| piccolo [Drosophila melanogaster]
 gi|220942640|gb|ACL83863.1| DDB1-PA [synthetic construct]
          Length = 1140

 Score = 42.7 bits (99), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 68/162 (41%), Gaps = 16/162 (9%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
            P+ + A E + + + +G      + N  LF+ Q ++  +    R  L +   FHLG  VN
Sbjct: 952  PKWMRAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVN 1006

Query: 126  TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
             F       S +    G R+  +     Y + +GA+G    +P+  Y  L  L+  +   
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
                G +    +R ++        PS G IDG L+  FL LS
Sbjct: 1063 IKSVGKIEHTYYRNFQINSKV--EPSEGFIDGDLIESFLDLS 1102


>gi|90108797|pdb|2B5L|A Chain A, Crystal Structure Of Ddb1 In Complex With Simian Virus 5 V
            Protein
 gi|90108798|pdb|2B5L|B Chain B, Crystal Structure Of Ddb1 In Complex With Simian Virus 5 V
            Protein
 gi|90108801|pdb|2B5M|A Chain A, Crystal Structure Of Ddb1
 gi|116667897|pdb|2HYE|A Chain A, Crystal Structure Of The Ddb1-cul4a-rbx1-sv5v Complex
 gi|1136228|gb|AAA88883.1| UV-damaged DNA binding factor [Homo sapiens]
 gi|1588524|prf||2208446A xeroderma pigmentosum group E-binding factor
          Length = 1140

 Score = 42.7 bits (99), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132


>gi|361132523|pdb|4A0L|A Chain A, Structure Of Ddb1-Ddb2-Cul4b-Rbx1 Bound To A 12 Bp Abasic
            Site Containing Dna-Duplex
 gi|361132525|pdb|4A0L|C Chain C, Structure Of Ddb1-Ddb2-Cul4b-Rbx1 Bound To A 12 Bp Abasic
            Site Containing Dna-Duplex
          Length = 1144

 Score = 42.7 bits (99), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 978  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1033

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1034 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1091

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1092 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1136


>gi|322706594|gb|EFY98174.1| DNA damage-binding protein 1 [Metarhizium anisopliae ARSEF 23]
          Length = 1121

 Score = 42.7 bits (99), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 33/172 (19%), Positives = 69/172 (40%), Gaps = 15/172 (8%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC--------KPSP 136
            ++ +D   N+++    PEA       +L   ++ ++G+ +N   K+           P  
Sbjct: 940  WLEADAQGNIIVLQRNPEAPTEQDRSKLEVTSEMNIGEQINQIRKLHVASNENAVVSPKA 999

Query: 137  ISDAPGARSRFLTWYASL------DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
               + G     +T +  L      +G L  F  +       LL  Q+ +  +    G ++
Sbjct: 1000 FLGSVGLSETIITCWNQLLMLVQIEGTLYLFGEIAPNYQDLLLTFQSRLQDYIYAPGNVS 1059

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
               +R ++ K      P R  +DG +V +FL L   ++  +C+ +G    D+
Sbjct: 1060 FNLWRAFRNKAREGDGPFR-FVDGEMVERFLDLDEAKQELVCEGLGPSVEDM 1110


>gi|355683071|gb|AER97036.1| damage-specific DNA binding protein 1, 127kDa [Mustela putorius furo]
          Length = 1122

 Score = 42.7 bits (99), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 36/134 (26%), Positives = 59/134 (44%), Gaps = 11/134 (8%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLS 224
             IDG L+  FL +S
Sbjct: 1088 FIDGDLIESFLDIS 1101


>gi|170589357|ref|XP_001899440.1| CPSF A subunit region family protein [Brugia malayi]
 gi|158593653|gb|EDP32248.1| CPSF A subunit region family protein [Brugia malayi]
          Length = 655

 Score = 42.7 bits (99), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 36/151 (23%), Positives = 62/151 (41%), Gaps = 7/151 (4%)

Query: 73  ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
           ACE +    S  F+ ++   N+   M          G RL +   F+LG+ VN F     
Sbjct: 465 ACEII---DSDNFLGAENSYNLFTVMKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSL 521

Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
             + +  AP   S  L  Y + DG +G  + +P   Y  L  +Q  +  +  +   ++  
Sbjct: 522 TATQVDVAPLYHSSIL--YGTSDGGIGVIVQMPPVLYTFLQDVQKRLAEYAENCMRISHT 579

Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
            +RT++ +         G IDG L+   L +
Sbjct: 580 QYRTFETEK--RSEAPNGFIDGDLIESLLDM 608


>gi|449435512|ref|XP_004135539.1| PREDICTED: DNA damage-binding protein 1-like [Cucumis sativus]
          Length = 1093

 Score = 42.7 bits (99), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 12/129 (9%)

Query: 111  RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
            RL    ++HLG+ VN F      +R   S +   P         + S++G +G    LP 
Sbjct: 955  RLEVVGEYHLGEFVNRFQHGSLVMRLPDSDVGQIP------TVIFGSVNGVIGVIASLPH 1008

Query: 167  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
              Y  L  LQ+ +       GGL+   +R++  +   A   ++  +DG L+  FL L+  
Sbjct: 1009 DQYVFLERLQSNLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKNFLDGDLIESFLDLNRS 1066

Query: 227  ERLEICKKI 235
            +  EI + +
Sbjct: 1067 KMEEISRAM 1075


>gi|328770638|gb|EGF80679.1| hypothetical protein BATDEDRAFT_11194 [Batrachochytrium dendrobatidis
            JAM81]
          Length = 1098

 Score = 42.7 bits (99), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 38/152 (25%), Positives = 62/152 (40%), Gaps = 6/152 (3%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            A E + + + +G   +D   N+     Q +        RL  K  FH G+ +N F K   
Sbjct: 914  AAELIDDDTFLG---ADSSMNIFALSKQGDQVSEEERQRLRPKGWFHTGELINRFRKGSL 970

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLL-MLQNVMVTHTSHTGGLNP 191
                  +     +     Y ++ GA+G    +P     ++L  LQ  + +     GGL  
Sbjct: 971  TLHATDETLALPAIPEILYCTVHGAIGVVARIPSDETAKILSTLQEALKSVVQGVGGLIH 1030

Query: 192  RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
              +R Y+ +       S GIIDG L+  FL+L
Sbjct: 1031 SDWRRYRTE--RRSIKSAGIIDGDLIESFLEL 1060


>gi|74215029|dbj|BAE33503.1| unnamed protein product [Mus musculus]
          Length = 1140

 Score = 42.7 bits (99), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 36/141 (25%), Positives = 59/141 (41%), Gaps = 15/141 (10%)

Query: 118  FHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLM 174
            FHLG+ VN F    C  S +    G  S   +    + +++G +G    L E  Y  LL 
Sbjct: 998  FHLGEFVNVF----CHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLD 1053

Query: 175  LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
            +QN +       G +    +R++  +      P+ G IDG L+  FL +S  +  E+   
Sbjct: 1054 MQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVAN 1111

Query: 235  I------GSKHNDILDELYDI 249
            +      G K     D+L  +
Sbjct: 1112 LQYDDGSGMKREATADDLIKV 1132


>gi|74208347|dbj|BAE26370.1| unnamed protein product [Mus musculus]
          Length = 599

 Score = 42.7 bits (99), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96  LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
           LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 433 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGEASTPTQGSVL 488

Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
           + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 489 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 546

Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
            IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 547 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 591


>gi|261335516|emb|CBH18510.1| cleavage and polyadenylation specificity factor-like protein,
            putative [Trypanosoma brucei gambiense DAL972]
          Length = 1452

 Score = 42.7 bits (99), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 31/125 (24%), Positives = 52/125 (41%), Gaps = 3/125 (2%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGG-HRLIKKTDFHLGQHVNTFFK 129
            V  C+ +    + G + SD ++NV++  Y P  +E++ G H  + ++   L         
Sbjct: 1267 VVHCDMMYHDRAFGILCSDDERNVLIMGYTPRVQETDAGTHPKVLESVLSLDGEYRLPSG 1326

Query: 130  IRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 189
               K        G  S  +T Y S  G +GF +P+ E+  R  L +   +        GL
Sbjct: 1327 SLVKSLRFRSTAGNSS--VTLYVSNYGEIGFIVPIGEQANRTALWVTRRLQIDLPCEAGL 1384

Query: 190  NPRAF 194
             PR F
Sbjct: 1385 TPRMF 1389


>gi|410974071|ref|XP_003993471.1| PREDICTED: DNA damage-binding protein 1 [Felis catus]
          Length = 1193

 Score = 42.7 bits (99), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 1027 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1082

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1083 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1140

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1141 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1185


>gi|194389106|dbj|BAG61570.1| unnamed protein product [Homo sapiens]
          Length = 1009

 Score = 42.7 bits (99), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 36/141 (25%), Positives = 59/141 (41%), Gaps = 15/141 (10%)

Query: 118  FHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLM 174
            FHLG+ VN F    C  S +    G  S   +    + +++G +G    L E  Y  LL 
Sbjct: 867  FHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLD 922

Query: 175  LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
            +QN +       G +    +R++  +      P+ G IDG L+  FL +S  +  E+   
Sbjct: 923  MQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVAN 980

Query: 235  I------GSKHNDILDELYDI 249
            +      G K     D+L  +
Sbjct: 981  LQYDDGSGMKREATADDLIKV 1001


>gi|5353754|gb|AAD42230.1|AF159853_1 damage-specific DNA binding protein 1 [Mus musculus]
          Length = 1140

 Score = 42.7 bits (99), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGEASTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132


>gi|347838030|emb|CCD52602.1| similar to DDB1B (Damaged DNA Binding protein 1 B); damaged DNA
            binding / protein binding [Botryotinia fuckeliana]
          Length = 1157

 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 41/184 (22%), Positives = 76/184 (41%), Gaps = 24/184 (13%)

Query: 77   VIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP 136
            V E  +  ++ +D D N++L     E        R+    + +LG+ VN   +I  + S 
Sbjct: 969  VAEVDTDTYLETDHDGNLILLKRNREGVTREDKTRMEVTCEMNLGEMVNRVKRINVETS- 1027

Query: 137  ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT---------- 186
              DA      FL    + +G++  F  +P +N   L+ LQ+ + +  S +          
Sbjct: 1028 -KDALLIPRAFL---GTTEGSIYLFSLIPPQNQDLLMRLQSRLASLPSASSIRGSSDSTS 1083

Query: 187  --------GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 238
                    G L+   +R+Y         P R  +DG L+ +FL L +  +  + + +G K
Sbjct: 1084 PHQIELSPGNLDFNKYRSYISATRETSEPFR-FVDGELIERFLDLEVEVQEHVAEGLGVK 1142

Query: 239  HNDI 242
              D+
Sbjct: 1143 AEDL 1146


>gi|402592185|gb|EJW86114.1| CPSF A subunit region family protein [Wuchereria bancrofti]
          Length = 278

 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 36/151 (23%), Positives = 62/151 (41%), Gaps = 7/151 (4%)

Query: 73  ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
           ACE +    S  F+ ++   N+   M          G RL +   F+LG+ VN F     
Sbjct: 110 ACEII---DSDNFLGAENSYNLFTVMKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSL 166

Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
             + +  AP   S  L  Y + DG +G  + +P   Y  L  +Q  +  +  +   ++  
Sbjct: 167 TATQVDVAPLYHSSIL--YGTSDGGIGVIVQMPPVLYTFLQDVQKRLAEYAENCMRISHT 224

Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
            +RT++ +         G IDG L+   L +
Sbjct: 225 QYRTFETEK--RSEAPNGFIDGDLIESLLDM 253


>gi|170057515|ref|XP_001864517.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167876915|gb|EDS40298.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 1138

 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 35/157 (22%), Positives = 65/157 (41%), Gaps = 6/157 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P+ + A E + + + +G   ++   N+ + +    A   +   ++ +   FHLG  VN F
Sbjct: 950  PKWMTAVEILDDDAFLG---AENSNNLFVCLKDSAATTDDERQQMPEVAQFHLGDMVNVF 1006

Query: 128  FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
                     I +     S  +  + ++ GA+G    +P   Y  L  LQ  +       G
Sbjct: 1007 RHGSLVMQNIGERTTPTSGCV-LFGTVSGAIGLVTQIPPDYYEFLRKLQENLTNTIKSVG 1065

Query: 188  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
             ++   +R++  +       S G IDG LV  FL L+
Sbjct: 1066 RIDHTYWRSFHTE--MKTENSEGFIDGDLVESFLDLT 1100


>gi|393905247|gb|EJD73911.1| CPSF A subunit region family protein [Loa loa]
          Length = 1145

 Score = 42.4 bits (98), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 38/151 (25%), Positives = 66/151 (43%), Gaps = 7/151 (4%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            ACE +I+S S  F+ ++   N+   +          G RL +   F+LG+ VN F     
Sbjct: 956  ACE-IIDSDS--FLGAENSYNLFTVVKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSL 1012

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
              + +  AP   S  L  Y + DG +G  + +P   Y  L  +Q  +  +T +   ++  
Sbjct: 1013 TATQVDVAPLYHSSIL--YGTSDGGIGVIVQMPPVLYTFLHDVQKRLADYTENCMRISHT 1070

Query: 193  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
             +RT++ +         G IDG L+   L +
Sbjct: 1071 QYRTFETEK--RSEVPNGFIDGDLIESLLDM 1099


>gi|357519461|ref|XP_003630019.1| DNA damage-binding protein [Medicago truncatula]
 gi|355524041|gb|AET04495.1| DNA damage-binding protein [Medicago truncatula]
          Length = 1171

 Score = 42.4 bits (98), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 32/129 (24%), Positives = 56/129 (43%), Gaps = 12/129 (9%)

Query: 111  RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
            RL    ++HLG+ +N F      +R   S +   P         + +++G +G    LP 
Sbjct: 1033 RLEVAGEYHLGEFINRFRHGSLVMRLPDSDVGQIPTV------IFGTINGVIGVIASLPH 1086

Query: 167  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
            + Y  L  LQ+ +       GGL+   +R++  +       +R  +DG L+  FL L   
Sbjct: 1087 EQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--EARNFLDGDLIESFLDLKRS 1144

Query: 227  ERLEICKKI 235
            +  EI K +
Sbjct: 1145 KMDEISKAM 1153


>gi|395544366|ref|XP_003774082.1| PREDICTED: DNA damage-binding protein 1 [Sarcophilus harrisii]
          Length = 1239

 Score = 42.4 bits (98), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 1073 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1128

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1129 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1186

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1187 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1231


>gi|195571247|ref|XP_002103615.1| GD18880 [Drosophila simulans]
 gi|194199542|gb|EDX13118.1| GD18880 [Drosophila simulans]
          Length = 1140

 Score = 42.4 bits (98), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 68/162 (41%), Gaps = 16/162 (9%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
            P+ + A E + + + +G      + N  LF+ Q ++  +    R  L +   FHLG  VN
Sbjct: 952  PKWMRAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVN 1006

Query: 126  TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
             F       S +    G R+  +     Y + +GA+G    +P+  Y  L  L+  +   
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
                G +    +R ++        PS G IDG L+  FL LS
Sbjct: 1063 IKSVGKIEHTYYRNFQINTKV--EPSEGFIDGDLIESFLDLS 1102


>gi|195145844|ref|XP_002013900.1| GL24391 [Drosophila persimilis]
 gi|194102843|gb|EDW24886.1| GL24391 [Drosophila persimilis]
          Length = 1140

 Score = 42.4 bits (98), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 68/162 (41%), Gaps = 16/162 (9%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
            P+ + A E + + + +G      + N  LF+ Q ++  +    R  L +   FHLG  VN
Sbjct: 952  PKWMRAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVN 1006

Query: 126  TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
             F       S +    G R+  +     Y + +GA+G    +P+  Y  L  L+  +   
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
                G +    +R ++        PS G IDG L+  FL LS
Sbjct: 1063 IKSVGKIEHTYYRNFQINTKV--EPSEGFIDGDLIESFLDLS 1102


>gi|125774475|ref|XP_001358496.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
 gi|54638233|gb|EAL27635.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
          Length = 1140

 Score = 42.4 bits (98), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 68/162 (41%), Gaps = 16/162 (9%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
            P+ + A E + + + +G      + N  LF+ Q ++  +    R  L +   FHLG  VN
Sbjct: 952  PKWMRAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVN 1006

Query: 126  TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
             F       S +    G R+  +     Y + +GA+G    +P+  Y  L  L+  +   
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
                G +    +R ++        PS G IDG L+  FL LS
Sbjct: 1063 IKSVGKIEHTYYRNFQINTKV--EPSEGFIDGDLIESFLDLS 1102


>gi|340059653|emb|CCC54046.1| putative mitochondrial carrier protein [Trypanosoma vivax Y486]
          Length = 1481

 Score = 42.4 bits (98), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 31/124 (25%), Positives = 49/124 (39%), Gaps = 1/124 (0%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
            V  C  +    S G + SD  +NV++  Y P  +E+  G R  K  +  L          
Sbjct: 1296 VVDCNMMYHDRSFGILCSDDQRNVLVMGYTPRVQEAGAG-RPAKTLESLLTLDGEYRLPS 1354

Query: 131  RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
             C    +  +    +  +  Y S  G +GF +P+ E+  R  L +   + T      GL 
Sbjct: 1355 GCLAKSLRFSSDFGNSSVMLYTSNYGEVGFIVPIGEQANRTALWVTRRLQTDVPCDAGLT 1414

Query: 191  PRAF 194
            PR F
Sbjct: 1415 PRMF 1418


>gi|312076590|ref|XP_003140929.1| CPSF A subunit region family protein [Loa loa]
          Length = 655

 Score = 42.4 bits (98), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 38/151 (25%), Positives = 66/151 (43%), Gaps = 7/151 (4%)

Query: 73  ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
           ACE +I+S S  F+ ++   N+   +          G RL +   F+LG+ VN F     
Sbjct: 457 ACE-IIDSDS--FLGAENSYNLFTVVKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSL 513

Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
             + +  AP   S  L  Y + DG +G  + +P   Y  L  +Q  +  +T +   ++  
Sbjct: 514 TATQVDVAPLYHSSIL--YGTSDGGIGVIVQMPPVLYTFLHDVQKRLADYTENCMRISHT 571

Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
            +RT++ +         G IDG L+   L +
Sbjct: 572 QYRTFETEK--RSEVPNGFIDGDLIESLLDM 600


>gi|449704103|gb|EMD44407.1| DNA-repair binding protein, putative [Entamoeba histolytica KU27]
          Length = 1088

 Score = 42.4 bits (98), Expect = 0.19,   Method: Composition-based stats.
 Identities = 45/183 (24%), Positives = 79/183 (43%), Gaps = 13/183 (7%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            A EFV E     ++ SD + N+++F       ES    RL      H+G+ +N    + C
Sbjct: 914  AIEFVDEDC---YLSSDSNSNILIFNTNSTGNESER-FRLNNCAHIHVGECIN----VMC 965

Query: 133  KPS--PISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN-VMVTHTSHTGGL 189
            K S  P         +    +  + G +G    +P + Y  L+ +QN +++         
Sbjct: 966  KGSIAPTHSTYETVQKKCILFGGVTGYIGGICEIPNEIYDVLIKVQNQILLQMKGIVECT 1025

Query: 190  NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
             P  ++  K    +   PS  IIDGS+V  +L++S  ++ EI    G     I D + ++
Sbjct: 1026 TPDNWK--KVIDDWKRMPSSNIIDGSIVESYLEMSKEKQCEIAHLSGVNEEQISDIIENM 1083

Query: 250  EAL 252
             +L
Sbjct: 1084 ISL 1086


>gi|156841606|ref|XP_001644175.1| hypothetical protein Kpol_1059p7 [Vanderwaltozyma polyspora DSM
            70294]
 gi|156114812|gb|EDO16317.1| hypothetical protein Kpol_1059p7 [Vanderwaltozyma polyspora DSM
            70294]
          Length = 1346

 Score = 42.4 bits (98), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 34/53 (64%), Gaps = 3/53 (5%)

Query: 198  KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI-GSKHNDILDELYDI 249
            K + YYA  P + IIDG L  +FL L+  ERLEICK +  +K  DI+ ++ ++
Sbjct: 1291 KYRSYYA--PVKNIIDGDLCERFLYLNSNERLEICKNLKDTKPEDIIRQINEM 1341


>gi|327301962|ref|XP_003235673.1| UV-damaged DNA binding protein [Trichophyton rubrum CBS 118892]
 gi|326461015|gb|EGD86468.1| UV-damaged DNA binding protein [Trichophyton rubrum CBS 118892]
          Length = 1147

 Score = 42.4 bits (98), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 35/151 (23%), Positives = 68/151 (45%), Gaps = 6/151 (3%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            ++++D + N+V+          +   RL   ++  LG+ VN    I  + +    A  AR
Sbjct: 982  YLLADAEGNLVVLQQNITGVTESDRKRLQPTSEIRLGEMVNRIHPIVIQ-TYTETAVSAR 1040

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
            +      A++DG++  F  +       LL LQ  M + T   G +    +R ++   + +
Sbjct: 1041 A----LLATVDGSIYLFGLINPTYIDLLLRLQTAMGSITISPGEIPFSKYRAFRTTVHQS 1096

Query: 205  GNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
              P R  +DG L+ +FL  + G + EI  ++
Sbjct: 1097 DEPFR-FVDGELIERFLSCTPGMQEEIVSRL 1126


>gi|452003044|gb|EMD95501.1| hypothetical protein COCHEDRAFT_1165632 [Cochliobolus heterostrophus
            C5]
          Length = 1116

 Score = 42.4 bits (98), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 37/149 (24%), Positives = 69/149 (46%), Gaps = 8/149 (5%)

Query: 77   VIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP 136
            V E  +  ++ SD + N+V+    P     +   RL   ++  LG+ VN   ++    + 
Sbjct: 946  VAEVDTNTYLESDAEGNLVVLYRDPNGVTDDDKRRLNISSEMLLGEMVNRIRRVDVLTA- 1004

Query: 137  ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRAFR 195
             +DA      F+    +++G++ +   L   +Y+ LLM LQ+ + +  +  G +    FR
Sbjct: 1005 -ADAVIVPKAFV---GTVEGSI-YLFGLISPDYQNLLMTLQSNLGSLVAAPGDMQFAKFR 1059

Query: 196  TYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
             +K        P R  +DG LV +FL +S
Sbjct: 1060 AFKNSVREEEEPMR-FVDGELVERFLDVS 1087


>gi|219125301|ref|XP_002182922.1| damage-specific DNA binding protein 1 [Phaeodactylum tricornutum CCAP
            1055/1]
 gi|217405716|gb|EEC45658.1| damage-specific DNA binding protein 1 [Phaeodactylum tricornutum CCAP
            1055/1]
          Length = 1284

 Score = 42.4 bits (98), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 52/113 (46%), Gaps = 3/113 (2%)

Query: 111  RLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYR 170
            RL    +FHLG+  N F        P+S      SR  T + +++G+LG  L L  +   
Sbjct: 1110 RLDNIGEFHLGEMCNKFMS-GSLVMPVSSNSTTSSRRATLFGTVEGSLGVILGLDGRTAA 1168

Query: 171  RLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
              + L+  +       GG + + +R+ + +     +P+ G +DG LV  FL L
Sbjct: 1169 FFITLERAIAKTIQPVGGFSHQLYRSCQAE--LRVHPAHGFVDGDLVETFLDL 1219


>gi|452824087|gb|EME31092.1| DNA damage-binding protein 1 isoform 1 [Galdieria sulphuraria]
          Length = 1128

 Score = 42.4 bits (98), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 69/162 (42%), Gaps = 9/162 (5%)

Query: 77   VIESSSMGFMISDKDK-NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS 135
            VIE   + + IS ++  N+       +A       RL K  ++HLG+ VN     R    
Sbjct: 955  VIEMLDLDYYISAENCFNLFTLKRNSDASTEEERSRLEKVGEYHLGELVNRIRHGRL--- 1011

Query: 136  PISDAPGARSRFLT--WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRA 193
             +   P +    L    Y + +GALG    + EK ++ L  LQ  +       GG+    
Sbjct: 1012 -VLQIPESGISILKSLLYGTANGALGVIASIDEKTFQFLHSLQTALNEVIKGVGGIQHED 1070

Query: 194  FRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
            +R +  +       S+  +DG L+ +FL LS  +   + KK+
Sbjct: 1071 WRRFTSERRIG--DSKNFLDGDLIERFLDLSRDKMELVAKKV 1110


>gi|195449948|ref|XP_002072297.1| GK22405 [Drosophila willistoni]
 gi|194168382|gb|EDW83283.1| GK22405 [Drosophila willistoni]
          Length = 1140

 Score = 42.4 bits (98), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 68/162 (41%), Gaps = 16/162 (9%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
            P+ + A E + + + +G      + N  LF+ Q ++  +    R  L +   FHLG  VN
Sbjct: 952  PKWMRAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVN 1006

Query: 126  TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
             F       S +    G R+  +     Y + +GA+G    +P+  Y  L  L+  +   
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
                G +    +R ++        PS G IDG L+  FL LS
Sbjct: 1063 IKSVGKIEHTYYRNFQINTKV--EPSEGFIDGDLIESFLDLS 1102


>gi|325186344|emb|CCA20849.1| predicted protein putative [Albugo laibachii Nc14]
          Length = 1148

 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 42/187 (22%), Positives = 83/187 (44%), Gaps = 22/187 (11%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSPISDA 140
            ++ S+ D N+        A       RL    ++HLG+ VN F      ++   S  ++A
Sbjct: 958  YVGSETDFNLFTVQRNSGAASDEERGRLETIGEYHLGEFVNRFRYGSLVMQHNLSIGAEA 1017

Query: 141  PGA-----RSRFLT--------WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
            PG      R   L+         + ++ G +G  LP+ ++ +  L+ +Q+ +       G
Sbjct: 1018 PGISLSDDRPESLSPLSVQRSMLFGTVSGMIGVILPISKEKHEFLMRVQSALNQVIQGVG 1077

Query: 188  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
            G +   +RT++ +   +   +   IDG L+  FL LS  E  ++  ++   + D L+   
Sbjct: 1078 GFSHSEWRTFENR--RSSIEAHNFIDGDLIESFLDLSKDEMKQVVDEL---NRDQLEGKT 1132

Query: 248  DIEALSS 254
             +EAL++
Sbjct: 1133 TLEALAA 1139


>gi|224061051|ref|XP_002300334.1| predicted protein [Populus trichocarpa]
 gi|222847592|gb|EEE85139.1| predicted protein [Populus trichocarpa]
          Length = 1088

 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 33/129 (25%), Positives = 56/129 (43%), Gaps = 12/129 (9%)

Query: 111  RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
            RL    ++HLG+ VN F      +R   S +   P         + +++G +G    LP 
Sbjct: 950  RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTVNGVIGVIASLPH 1003

Query: 167  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
            + Y  L  LQ+ +       GGL+   +R++  +       ++  +DG L+  FL LS  
Sbjct: 1004 EQYLFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--DAKNFLDGDLIESFLDLSRS 1061

Query: 227  ERLEICKKI 235
               EI K +
Sbjct: 1062 RMDEISKAM 1070


>gi|124505011|ref|XP_001351247.1| CPSF (cleavage and polyadenylation specific factor), subunit A,
            putative [Plasmodium falciparum 3D7]
 gi|7768292|emb|CAB11136.2| CPSF (cleavage and polyadenylation specific factor), subunit A,
            putative [Plasmodium falciparum 3D7]
          Length = 2870

 Score = 42.4 bits (98), Expect = 0.21,   Method: Composition-based stats.
 Identities = 40/194 (20%), Positives = 80/194 (41%), Gaps = 17/194 (8%)

Query: 74   CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK 133
            C  +   S    ++SD D N ++             ++L   + F+ G  +N    +   
Sbjct: 2679 CTSLCALSKSHIVVSDMDANFIILQKSKFKYNDEDSYKLSSVSLFNHGSIINKMLPL--S 2736

Query: 134  PSPISDAPGARSRFLT-----WYASLDGALGFFLPLPE-KNYRRLLMLQNVMVTHTSHTG 187
             + + +    +   LT       AS +G++   +P     N+++ L ++  +  + S  G
Sbjct: 2737 NTNLIEEDYDKRNILTKNDGILCASSEGSISVLIPFSSFANFKKALCIEIAITDNISSIG 2796

Query: 188  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE-------ICKKIGSKHN 240
             L+  A+R YK    +     +GI+DG L+  F  +S  ++ +       I KKI  K  
Sbjct: 2797 NLSHNAYREYKVN--FRSKHCKGIVDGELLKMFFHMSFEKQYKTFIYAKWIAKKINCKFG 2854

Query: 241  DILDELYDIEALSS 254
               + + D+E + S
Sbjct: 2855 SFNNFILDLENMCS 2868


>gi|154303693|ref|XP_001552253.1| hypothetical protein BC1G_08731 [Botryotinia fuckeliana B05.10]
          Length = 1087

 Score = 42.4 bits (98), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 41/184 (22%), Positives = 76/184 (41%), Gaps = 24/184 (13%)

Query: 77   VIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP 136
            V E  +  ++ +D D N++L     E        R+    + +LG+ VN   +I  + S 
Sbjct: 899  VAEVDTDTYLETDHDGNLILLKRNREGVTREDKTRMEVTCEMNLGEMVNRVKRINVETS- 957

Query: 137  ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT---------- 186
              DA      FL    + +G++  F  +P +N   L+ LQ+ + +  S +          
Sbjct: 958  -KDALLIPRAFL---GTTEGSIYLFSLIPPQNQDLLMRLQSRLASLPSASSIRGSSDSTS 1013

Query: 187  --------GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 238
                    G L+   +R+Y         P R  +DG L+ +FL L +  +  + + +G K
Sbjct: 1014 PHQIELSPGNLDFNKYRSYISATRETSEPFR-FVDGELIERFLDLEVEVQEHVAEGLGVK 1072

Query: 239  HNDI 242
              D+
Sbjct: 1073 AEDL 1076


>gi|350537001|ref|NP_001234275.1| DNA damage-binding protein 1 [Solanum lycopersicum]
 gi|350539125|ref|NP_001233864.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
 gi|55976440|sp|Q6QNU4.1|DDB1_SOLLC RecName: Full=DNA damage-binding protein 1; AltName: Full=High
            pigmentation protein 1; AltName: Full=UV-damaged
            DNA-binding protein 1
 gi|38455768|gb|AAR20885.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
 gi|42602165|gb|AAS21683.1| UV-damaged DNA binding protein 1 [Solanum lycopersicum]
          Length = 1090

 Score = 42.0 bits (97), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 50/241 (20%), Positives = 91/241 (37%), Gaps = 41/241 (17%)

Query: 8    LTVSLGLHGNRPLLLVRTQHELLIYQ---------AFRHPKGTLKLRFKKLKVLFVSDRS 58
            L    G HG+   L V+T+ + ++            F+H +G ++ R +     ++S   
Sbjct: 860  LQTECGHHGHILALYVQTRGDFIVVGDLMKSISLLIFKHEEGAIEERARDYNANWMS--- 916

Query: 59   KRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDF 118
                          A E + +   +G   ++ + N+       E        RL    ++
Sbjct: 917  --------------AVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERSRLEVVGEY 959

Query: 119  HLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM 174
            HLG+ VN F      +R   S +   P         + +++G +G    LP   Y  L  
Sbjct: 960  HLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTVNGVIGVIASLPHDQYLFLEK 1013

Query: 175  LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
            LQ  +       GGL+   +R++  +       ++  +DG L+  FL LS     EI K 
Sbjct: 1014 LQTNLRKVIKGVGGLSHEQWRSFYNEKKTV--DAKNFLDGDLIESFLDLSRNRMEEISKA 1071

Query: 235  I 235
            +
Sbjct: 1072 M 1072


>gi|367033707|ref|XP_003666136.1| hypothetical protein MYCTH_2310607 [Myceliophthora thermophila ATCC
            42464]
 gi|347013408|gb|AEO60891.1| hypothetical protein MYCTH_2310607 [Myceliophthora thermophila ATCC
            42464]
          Length = 1043

 Score = 42.0 bits (97), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 37/170 (21%), Positives = 70/170 (41%), Gaps = 9/170 (5%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            A  FV E S   ++ +D   N+++     E   +    R+   ++ +LG+ VN     R 
Sbjct: 872  AVGFVREGS---WLEADAQGNLMVLRRNVEGVTAEDRRRMEITSEMNLGEMVN-----RI 923

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
            +   +   PGA      +  +++G +  F  +       LL  Q  +       G +  R
Sbjct: 924  REVEVETTPGALVVPRAFLGTVEGGIYMFGTVAPHVQDLLLRFQGKLADVLKTAGDIPFR 983

Query: 193  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
            ++R ++        P R  +DG L+ +FL +    +  ICK +G    D+
Sbjct: 984  SYRAFRNAEREGDGPFR-FLDGELLERFLDVDEATQEIICKGLGPSVEDM 1032


>gi|156339616|ref|XP_001620212.1| hypothetical protein NEMVEDRAFT_v1g223331 [Nematostella vectensis]
 gi|156204813|gb|EDO28112.1| predicted protein [Nematostella vectensis]
          Length = 248

 Score = 42.0 bits (97), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 37/64 (57%), Gaps = 3/64 (4%)

Query: 4   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP--KGTLKLRFKKLKVLFVSDRSKRA 61
           V+E+L   LG    R  L+     +LLIY+AF +P  +G L LRFKKL+   +  R K+ 
Sbjct: 150 VREVLLTGLGYKNRRATLVAVMDQDLLIYEAFSYPTVEGHLNLRFKKLQ-HNIQIREKKP 208

Query: 62  NEQP 65
            ++P
Sbjct: 209 KQEP 212


>gi|431910407|gb|ELK13480.1| DNA damage-binding protein 1 [Pteropus alecto]
          Length = 1143

 Score = 42.0 bits (97), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 44/171 (25%), Positives = 71/171 (41%), Gaps = 26/171 (15%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVM------VTHTSHTGGLNPRAFRTYKGKGYYA 204
            + +++G +G    L E  Y  LL +QN +      V    H+   + R+F T +      
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSLYPSQRSFHTERKT---- 1085

Query: 205  GNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
              P+ G IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1086 -EPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1135


>gi|55976392|sp|Q6E7D1.1|DDB1_SOLCE RecName: Full=DNA damage-binding protein 1; AltName: Full=UV-damaged
            DNA-binding protein 1
 gi|49484911|gb|AAT66742.1| UV-damaged DNA binding protein 1 [Solanum cheesmaniae]
          Length = 1095

 Score = 42.0 bits (97), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 50/241 (20%), Positives = 91/241 (37%), Gaps = 41/241 (17%)

Query: 8    LTVSLGLHGNRPLLLVRTQHELLIYQ---------AFRHPKGTLKLRFKKLKVLFVSDRS 58
            L    G HG+   L V+T+ + ++            F+H +G ++ R +     ++S   
Sbjct: 865  LQTECGHHGHILALYVQTRGDFIVVGDLMKSISLLIFKHEEGAIEERARDYNANWMS--- 921

Query: 59   KRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDF 118
                          A E + +   +G   ++ + N+       E        RL    ++
Sbjct: 922  --------------AVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERSRLEVVGEY 964

Query: 119  HLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM 174
            HLG+ VN F      +R   S +   P         + +++G +G    LP   Y  L  
Sbjct: 965  HLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTVNGVIGVIASLPHDQYLFLEK 1018

Query: 175  LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
            LQ  +       GGL+   +R++  +       ++  +DG L+  FL LS     EI K 
Sbjct: 1019 LQTNLRKVIKGVGGLSHEQWRSFYNEKKTV--DAKNFLDGDLIESFLDLSRNRMEEISKA 1076

Query: 235  I 235
            +
Sbjct: 1077 M 1077


>gi|212539802|ref|XP_002150056.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei ATCC
            18224]
 gi|210067355|gb|EEA21447.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei ATCC
            18224]
          Length = 1139

 Score = 42.0 bits (97), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 38/176 (21%), Positives = 75/176 (42%), Gaps = 8/176 (4%)

Query: 77   VIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP 136
            V +  S  +++SD + N+++     +        RL   ++  LG+ VN     R +P  
Sbjct: 968  VADIGSNQWLVSDAEGNLIVLRRNVDGMTEEDRRRLEVTSELLLGEMVN-----RIRPVN 1022

Query: 137  ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT 196
            I            +  +++G++  F  +  ++   L+ LQ  +  +    G +    FR 
Sbjct: 1023 IPQTSTMAVTPKAFLGTVEGSIYLFALINPEHQDFLMRLQTAISAYVDSPGLMPFNKFRA 1082

Query: 197  YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 252
            ++     A  P R  +DG L+ +FL      + EI   +GS   + + ++  IEAL
Sbjct: 1083 FRSTVREAEEPFR-FVDGELIERFLDCDRAVQEEILGVVGSGDLESVQKM--IEAL 1135


>gi|440893607|gb|ELR46310.1| DNA damage-binding protein 1 [Bos grunniens mutus]
          Length = 1143

 Score = 42.0 bits (97), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 44/171 (25%), Positives = 71/171 (41%), Gaps = 26/171 (15%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVM------VTHTSHTGGLNPRAFRTYKGKGYYA 204
            + +++G +G    L E  Y  LL +QN +      V    H+   + R+F T +      
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSLYPSQRSFHTERKT---- 1085

Query: 205  GNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
              P+ G IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1086 -EPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1135


>gi|407044103|gb|EKE42371.1| DNA damage-binding protein, putative [Entamoeba nuttalli P19]
          Length = 1088

 Score = 42.0 bits (97), Expect = 0.26,   Method: Composition-based stats.
 Identities = 45/183 (24%), Positives = 79/183 (43%), Gaps = 13/183 (7%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            A EFV E     ++ SD + N+++F       ES    RL      H+G+ +N    + C
Sbjct: 914  AIEFVDEDC---YLSSDSNSNILIFNTNSTGNESER-FRLNNCAHIHVGECIN----VMC 965

Query: 133  KPS--PISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN-VMVTHTSHTGGL 189
            K S  P         +    +  + G +G    +P + Y  L+ +QN +++         
Sbjct: 966  KGSIAPTHSTYETVQKKCILFGGVTGYIGGICEIPNEIYDILIKVQNQILLQMKGIVECT 1025

Query: 190  NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
             P  ++  K    +   PS  IIDGS+V  +L++S  ++ EI    G     I D + ++
Sbjct: 1026 TPDDWK--KVIDDWKRMPSSNIIDGSIVESYLEMSKEKQCEIAHLSGVNEEKISDIIENM 1083

Query: 250  EAL 252
             +L
Sbjct: 1084 ISL 1086


>gi|290998415|ref|XP_002681776.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
 gi|284095401|gb|EFC49032.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
          Length = 1103

 Score = 42.0 bits (97), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 39/190 (20%), Positives = 79/190 (41%), Gaps = 17/190 (8%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT----FF 128
            +C  ++  S   ++  D+  N++      EA       +L++   ++    VN     F 
Sbjct: 918  SCSLMLNESE--YVAVDQQGNMITLKKNDEAASEEERKQLVRVGKYYCSDRVNRIQPGFI 975

Query: 129  KIR--CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
             +R     S I+  P   + F     ++ G +G    LP + +  +  +Q  M +  +  
Sbjct: 976  GMRFANSSSDINTQPVKTALF----GTISGGIGVLAQLPPETFAFVTKIQKAMSSVVTGL 1031

Query: 187  GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND--ILD 244
              ++   +R Y+ +       S G IDG  V  FL+     +  + +++ + H +   L+
Sbjct: 1032 ANISRETYRQYRSE--RTREDSVGFIDGDFVESFLEFDFETQQRVIEELSNNHQEQITLE 1089

Query: 245  ELY-DIEALS 253
            EL  +IE LS
Sbjct: 1090 ELVKNIEDLS 1099


>gi|315039795|ref|XP_003169275.1| hypothetical protein MGYG_08821 [Arthroderma gypseum CBS 118893]
 gi|311337696|gb|EFQ96898.1| hypothetical protein MGYG_08821 [Arthroderma gypseum CBS 118893]
          Length = 1076

 Score = 42.0 bits (97), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 38/168 (22%), Positives = 72/168 (42%), Gaps = 12/168 (7%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            ++++D + N+V+          +   RL   ++  LG+ VN    I  + +    A  AR
Sbjct: 911  YLLADAEGNLVVLQQNTTGVTESDRKRLQPTSEIRLGEMVNRIHPITVQ-THTETAVSAR 969

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
            +      A++DG++  F  +       LL LQ  M + T   G +    +R ++     +
Sbjct: 970  A----LLATVDGSIYLFGLINPAYIDLLLRLQTAMASITVSPGEIPFSKYRAFRTTVRQS 1025

Query: 205  GNPSRGIIDGSLVWKFL------QLSLGERLEICKKIGSKHNDILDEL 246
              P R  +DG L+ +FL      Q  +  RL+      S   ++++EL
Sbjct: 1026 DEPFR-FVDGELIERFLSCTPTMQEEIANRLDDSNVTVSSLKEMIEEL 1072


>gi|116195210|ref|XP_001223417.1| hypothetical protein CHGG_04203 [Chaetomium globosum CBS 148.51]
 gi|88180116|gb|EAQ87584.1| hypothetical protein CHGG_04203 [Chaetomium globosum CBS 148.51]
          Length = 1127

 Score = 41.6 bits (96), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 36/170 (21%), Positives = 69/170 (40%), Gaps = 9/170 (5%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            A  FV E S   ++ +D   N+++     E   +    R+   ++ +L + VN     R 
Sbjct: 956  AVGFVEEGS---WLEADAQGNLMVLRRNVEGVTAEDKRRMEVTSEINLNEMVN-----RI 1007

Query: 133  KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
            +   +   PGA      +  +++G +  F  +       LL  Q+ +       G +  R
Sbjct: 1008 RTIDVETTPGAMIVPKAFLGTVEGGIYMFGTVAPHVQDLLLRFQSRLADVLKTAGDIEFR 1067

Query: 193  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
             +R ++        P R  +DG L+ KFL +    +  +CK +G    D+
Sbjct: 1068 TYRAFRNAEREGDGPFR-FVDGELLEKFLDVDETTQEAVCKGLGPTVEDM 1116


>gi|357606250|gb|EHJ64976.1| putative Splicing factor 3B subunit [Danaus plexippus]
          Length = 1216

 Score = 41.6 bits (96), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 36/135 (26%), Positives = 58/135 (42%), Gaps = 17/135 (12%)

Query: 116  TDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM- 174
             +FH+G+ V +  +           PG     L  YA++ GALG FLP   +        
Sbjct: 1092 VNFHVGETVTSLQRATL-------IPGGSEALL--YATVSGALGVFLPFTSREDHDFFQH 1142

Query: 175  LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
            L+  M +  S   G +  +FR+Y    YY   P + +IDG L  +F  L   ++  I   
Sbjct: 1143 LEMHMRSENSPLCGRDHLSFRSY----YY---PVKNVIDGDLCEQFNSLEPAKQKAIAGD 1195

Query: 235  IGSKHNDILDELYDI 249
            +     ++  +L DI
Sbjct: 1196 LERTPAEVSKKLEDI 1210


>gi|241260143|ref|XP_002404926.1| DNA repair protein xp-E, putative [Ixodes scapularis]
 gi|215496735|gb|EEC06375.1| DNA repair protein xp-E, putative [Ixodes scapularis]
          Length = 1148

 Score = 41.6 bits (96), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 46/176 (26%), Positives = 75/176 (42%), Gaps = 22/176 (12%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F       S +   PG  S   +    
Sbjct: 974  LFVCQKDSAATTDEDRQHLQEVGQFHLGEFVNIFR----HGSLVMQHPGEASSPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF---RTYKGKGYYA--- 204
            + ++ GA+G    LP   Y  LL +Q  +       G ++   +   R +  + +     
Sbjct: 1030 FGTIHGAIGLVAQLPSDFYNFLLEVQGNLTKVIKSVGKIDHTLYPFVRLFTWRSFSTERK 1089

Query: 205  GNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYD-IEALS 253
               ++G IDG L+  FL LS  +  E+ + I      G K +  +D+L   IE LS
Sbjct: 1090 TEQAQGFIDGDLIESFLDLSRDKMQEVLQGIQMDDGSGMKRDATVDDLIKIIEELS 1145


>gi|194741158|ref|XP_001953056.1| GF17579 [Drosophila ananassae]
 gi|190626115|gb|EDV41639.1| GF17579 [Drosophila ananassae]
          Length = 1140

 Score = 41.6 bits (96), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 48/197 (24%), Positives = 80/197 (40%), Gaps = 22/197 (11%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
            P+ + A E + + + +G      + N  LF+ Q ++  +    R  L +   FHLG  VN
Sbjct: 952  PKWMRAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVN 1006

Query: 126  TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
             F       S +    G R+  +     Y + +GA+G    +P+  Y  L  L+  +   
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL------SLGERLEICKKIG 236
                G +    +R ++        PS G IDG L+  FL L         + LEI     
Sbjct: 1063 IKSVGKIEHTYYRNFQINTKV--EPSEGFIDGDLIESFLDLGRDKMRDAVQGLEITLNGE 1120

Query: 237  SKHNDILDELYDIEALS 253
             K  D+ D +  +E L+
Sbjct: 1121 RKSADVEDVIKIVEDLT 1137


>gi|167517245|ref|XP_001742963.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163778062|gb|EDQ91677.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1670

 Score = 41.6 bits (96), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 48/231 (20%), Positives = 89/231 (38%), Gaps = 44/231 (19%)

Query: 55   SDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIK 114
            S++  R    PG P   Y     + +    F+  D D+N+VL M  P     N    +  
Sbjct: 959  SEQRNRFVVGPGDPESRYLLTCFLPAEDR-FLFCDSDQNLVLGM-PPVDTVENDASLMHL 1016

Query: 115  KTDFHLGQHVNTFFKIRC------------------------------------KPSPIS 138
                H+G ++N++    C                                    +P+   
Sbjct: 1017 AGRIHIGDNINSYVICACIHVWTPYLLCPDSTFCFAALFVTSRFAFGSLSLSYERPAEAG 1076

Query: 139  DA--PGAR--SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 194
            +A   GA+  S     + ++ G +G  L + +K+   +  +Q  +    +  GGL    +
Sbjct: 1077 EAGEDGAKQQSSPPIVFTTVLGGVGMILEVQQKHLWFMHEMQRRLADMGNAVGGLTHEDY 1136

Query: 195  RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK--IGSKHNDIL 243
            R+ K     +  P+R  +DG+L+  FL+L+  E  E+ K+  I +  ND +
Sbjct: 1137 RSTKNGKRESVTPARCFVDGNLIESFLELTPEEMEEVMKEFHIPNSSNDFV 1187


>gi|321478515|gb|EFX89472.1| hypothetical protein DAPPUDRAFT_303245 [Daphnia pulex]
          Length = 1158

 Score = 41.6 bits (96), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 38/166 (22%), Positives = 68/166 (40%), Gaps = 9/166 (5%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
            P  + A E + + + +G      + +  LF+ Q ++  +    R  L +   FHLG  VN
Sbjct: 968  PNWMSAVEILDDDTFLG-----AENSFNLFVCQKDSAATTEEERQQLTEVGRFHLGDMVN 1022

Query: 126  TFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
             F          ++     ++    + ++ GA+G    LP + Y  L  +Q  M      
Sbjct: 1023 VFRHGSLVMDHAAETLTTPTQGCVLFGTVHGAIGVVTQLPSEFYHFLSEVQTRMARVIKP 1082

Query: 186  TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 231
             G +    +R++  +      P  G IDG L+  FL LS  +  E+
Sbjct: 1083 VGKIEHSFWRSFATERKV--EPCEGFIDGDLIESFLDLSSDKMKEV 1126


>gi|1399512|gb|AAC47162.1| repE [Dictyostelium discoideum]
          Length = 1139

 Score = 41.6 bits (96), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 29/124 (23%), Positives = 54/124 (43%), Gaps = 5/124 (4%)

Query: 118  FHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN 177
            +H+G+ +N+         P SD P   +     YAS++G++G    + E+++     LQ 
Sbjct: 1008 YHIGESINSMRHGSLVRLPDSDQPIIPT---ILYASVNGSIGVVASISEEDFIFFSKLQK 1064

Query: 178  VMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
             +       GG +   +R +    +     S+  IDG L+  FL L    +L+    +G 
Sbjct: 1065 GLNQVVRGVGGFSHETWRAFSNDHHTI--DSKNFIDGDLIETFLDLKYESQLKAVADLGI 1122

Query: 238  KHND 241
              +D
Sbjct: 1123 TPDD 1126


>gi|81868411|sp|Q9ESW0.1|DDB1_RAT RecName: Full=DNA damage-binding protein 1; AltName:
            Full=Damage-specific DNA-binding protein 1
 gi|9843869|emb|CAB89874.2| damage-specific DNA binding protein 1 [Rattus norvegicus]
          Length = 1140

 Score = 41.2 bits (95), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 68/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
              +++G +G    L E  Y  LL +QN +       G +    +R++  +      P+ G
Sbjct: 1030 LGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132


>gi|281345356|gb|EFB20940.1| hypothetical protein PANDA_015888 [Ailuropoda melanoleuca]
          Length = 1124

 Score = 41.2 bits (95), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 41/140 (29%), Positives = 62/140 (44%), Gaps = 21/140 (15%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 956  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1011

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVM------VTHTSHTGGLNPRAFRTYKGKGYYA 204
            + +++G +G    L E  Y  LL +QN +      +TH S T     R+F T +      
Sbjct: 1012 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKNITH-SLTHLSTWRSFHTERKT---- 1066

Query: 205  GNPSRGIIDGSLVWKFLQLS 224
              P+ G IDG L+  FL +S
Sbjct: 1067 -EPATGFIDGDLIESFLDIS 1085


>gi|119594342|gb|EAW73936.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_d [Homo
            sapiens]
          Length = 1146

 Score = 41.2 bits (95), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 44/174 (25%), Positives = 71/174 (40%), Gaps = 29/174 (16%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQN---------VMVTHTSHTGGLNPRAFRTYKGKG 201
            + +++G +G    L E  Y  LL +QN          +++  S T     R+F T +   
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKRCFLISTCSLTHPSTWRSFHTERKT- 1088

Query: 202  YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
                 P+ G IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1089 ----EPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1138


>gi|166240328|ref|XP_637896.2| UV-damaged DNA binding protein1 [Dictyostelium discoideum AX4]
 gi|238064940|sp|B0M0P5.1|DDB1_DICDI RecName: Full=DNA damage-binding protein 1; AltName: Full=DNA repair
            protein E; AltName: Full=UV-damaged DNA-binding protein 1
 gi|165988543|gb|EAL64385.2| UV-damaged DNA binding protein1 [Dictyostelium discoideum AX4]
          Length = 1181

 Score = 41.2 bits (95), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 29/124 (23%), Positives = 54/124 (43%), Gaps = 5/124 (4%)

Query: 118  FHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN 177
            +H+G+ +N+         P SD P   +     YAS++G++G    + E+++     LQ 
Sbjct: 1050 YHIGESINSMRHGSLVRLPDSDQPIIPT---ILYASVNGSIGVVASISEEDFIFFSKLQK 1106

Query: 178  VMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
             +       GG +   +R +    +     S+  IDG L+  FL L    +L+    +G 
Sbjct: 1107 GLNQVVRGVGGFSHETWRAFSNDHHTI--DSKNFIDGDLIETFLDLKYESQLKAVADLGI 1164

Query: 238  KHND 241
              +D
Sbjct: 1165 TPDD 1168


>gi|67463896|ref|XP_648489.1| cleavage and polyadenylation specificity factor subunit [Entamoeba
            histolytica HM-1:IMSS]
 gi|56464653|gb|EAL43100.1| cleavage and polyadenylation specificity factor subunit, putative
            [Entamoeba histolytica HM-1:IMSS]
          Length = 1150

 Score = 41.2 bits (95), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 36/168 (21%), Positives = 71/168 (42%), Gaps = 14/168 (8%)

Query: 76   FVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS 135
            F+I       +  D ++N+ L  Y   A E      +     F+LG ++  F ++     
Sbjct: 980  FIITKDYFSVLEFDSEQNLSLLNYSSAATEQLSIFEI--DATFNLGMNLLKFTRLW---- 1033

Query: 136  PISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 195
               +  G    ++  Y +++G++G+   + EK Y+ L  +   M     H  G N   +R
Sbjct: 1034 ---NGKG----YIYMYVTVEGSVGYISVVEEKIYQVLRQINIKMNREPWHFAGTNAEEYR 1086

Query: 196  TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
              KG G   G      +DG ++ +F  L+  ++  +C +  +  ND+ 
Sbjct: 1087 FEKGYGMGFGTRKHVFLDGDMLKQFRLLNEEQQKRVCLR-NTSINDVF 1133


>gi|348526664|ref|XP_003450839.1| PREDICTED: DNA damage-binding protein 1-like [Oreochromis niloticus]
          Length = 1140

 Score = 41.2 bits (95), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 42/165 (25%), Positives = 68/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVLQNLGESSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL LQN +       G +    +R++  +       + G
Sbjct: 1030 FGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEQATG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL L   +  E+   +      G K    +DE+  I
Sbjct: 1088 FIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREATVDEVIKI 1132


>gi|407035910|gb|EKE37921.1| CPSF A subunit region protein, putative [Entamoeba nuttalli P19]
          Length = 836

 Score = 41.2 bits (95), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 36/168 (21%), Positives = 71/168 (42%), Gaps = 14/168 (8%)

Query: 76  FVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS 135
           F+I       +  D ++N+ L  Y   A E      +     F+LG ++  F ++     
Sbjct: 666 FIITKDYFSVLEFDSEQNLSLLNYSSAATEQLSIFEI--DATFNLGMNLLKFTRLW---- 719

Query: 136 PISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 195
              +  G    ++  Y +++G++G+   + EK Y+ L  +   M     H  G N   +R
Sbjct: 720 ---NGKG----YIYMYVTVEGSVGYISVVEEKIYQVLRQINIKMNREPWHFAGTNAEEYR 772

Query: 196 TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
             KG G   G      +DG ++ +F  L+  ++  +C +  +  ND+ 
Sbjct: 773 FEKGYGMGFGTRKHVFLDGDMLKQFRLLNEEQQKRVCLR-NTSINDVF 819


>gi|150863836|ref|XP_001382447.2| hypothetical protein PICST_54680 [Scheffersomyces stipitis CBS 6054]
 gi|149385092|gb|ABN64418.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 1228

 Score = 41.2 bits (95), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 24/99 (24%), Positives = 47/99 (47%), Gaps = 5/99 (5%)

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            Y  + G +G  LPL  K+  + +   N +        G N       K + YY  NP + 
Sbjct: 1129 YTGIQGTVGLLLPLSTKSEVQFI---NSLEQSLRQQMGFNLLGMDHLKFRSYY--NPVKN 1183

Query: 211  IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
            +IDG L+ K+ +LS   +++I +++     ++  ++ D+
Sbjct: 1184 VIDGDLIEKYYELSQSLKIKIARELNRTPKEVEKKISDL 1222


>gi|167384458|ref|XP_001736962.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165900458|gb|EDR26769.1| hypothetical protein EDI_171140 [Entamoeba dispar SAW760]
          Length = 836

 Score = 41.2 bits (95), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 36/168 (21%), Positives = 71/168 (42%), Gaps = 14/168 (8%)

Query: 76  FVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS 135
           F+I       +  D ++N+ L  Y   A E      +     F+LG ++  F ++     
Sbjct: 666 FIITKDYFSVLEFDSEQNLSLLNYSSAATEQLSIFEI--DATFNLGMNLLKFTRLW---- 719

Query: 136 PISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 195
              +  G    ++  Y +++G++G+   + EK Y+ L  +   M     H  G N   +R
Sbjct: 720 ---NGKG----YIYMYVTVEGSVGYISVVEEKIYQVLRQINIKMNREPWHFAGTNAEEYR 772

Query: 196 TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
             KG G   G      +DG ++ +F  L+  ++  +C +  +  ND+ 
Sbjct: 773 FEKGYGMGFGTRKHVFLDGDMLKQFRLLNEEQQKRVCLR-NTSINDVF 819


>gi|449710759|gb|EMD49776.1| cleavage and polyadenylation specificity factor subunit, putative
           [Entamoeba histolytica KU27]
          Length = 836

 Score = 41.2 bits (95), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 36/168 (21%), Positives = 71/168 (42%), Gaps = 14/168 (8%)

Query: 76  FVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS 135
           F+I       +  D ++N+ L  Y   A E      +     F+LG ++  F ++     
Sbjct: 666 FIITKDYFSVLEFDSEQNLSLLNYSSAATEQLSIFEI--DATFNLGMNLLKFTRLW---- 719

Query: 136 PISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 195
              +  G    ++  Y +++G++G+   + EK Y+ L  +   M     H  G N   +R
Sbjct: 720 ---NGKG----YIYMYVTVEGSVGYISVVEEKIYQVLRQINIKMNREPWHFAGTNAEEYR 772

Query: 196 TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
             KG G   G      +DG ++ +F  L+  ++  +C +  +  ND+ 
Sbjct: 773 FEKGYGMGFGTRKHVFLDGDMLKQFRLLNEEQQKRVCLR-NTSINDVF 819


>gi|340367933|ref|XP_003382507.1| PREDICTED: splicing factor 3B subunit 3-like isoform 1 [Amphimedon
            queenslandica]
          Length = 1214

 Score = 40.8 bits (94), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 34/135 (25%), Positives = 60/135 (44%), Gaps = 17/135 (12%)

Query: 116  TDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM- 174
            T +H+G+ +NT  K       +S  PG     +  Y +L G++G  +P   K        
Sbjct: 1090 TSYHVGEGINTLHK-------VSLIPGGSEVLV--YTTLSGSIGILVPFSSKEDSDFFQH 1140

Query: 175  LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
            L+  M +  S+  G +  +FR+     YY   P + +IDG L   +  L   +R EI   
Sbjct: 1141 LEMHMRSEWSNLVGRDHLSFRS-----YYV--PVKSVIDGDLCEVYNSLDPSKRREIALD 1193

Query: 235  IGSKHNDILDELYDI 249
            +    +++  +L D+
Sbjct: 1194 LDRSPSEVAKKLEDL 1208


>gi|194901554|ref|XP_001980317.1| GG19434 [Drosophila erecta]
 gi|190652020|gb|EDV49275.1| GG19434 [Drosophila erecta]
          Length = 1140

 Score = 40.8 bits (94), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 40/161 (24%), Positives = 67/161 (41%), Gaps = 16/161 (9%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
            P+ + A E + + + +G      + N  LF+ Q ++  +    R  L +   FHLG  VN
Sbjct: 952  PKWMRAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVN 1006

Query: 126  TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
             F       S +    G R+  +     Y + +GA+G    +P+  Y  L  L+  +   
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062

Query: 183  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
                G +    +R ++        PS G IDG L+  FL L
Sbjct: 1063 IKSVGKIEHTYYRNFQINTKV--EPSEGFIDGDLIESFLDL 1101


>gi|242803623|ref|XP_002484212.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus ATCC
            10500]
 gi|218717557|gb|EED16978.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus ATCC
            10500]
          Length = 1140

 Score = 40.8 bits (94), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 33/164 (20%), Positives = 68/164 (41%), Gaps = 6/164 (3%)

Query: 77   VIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP 136
            V +  S  +++SD + N+++     +        RL   ++  LG+ VN     R +P  
Sbjct: 968  VADIGSNQWLVSDAEGNLIVLRRNVDGVTEEDRRRLEVTSELLLGEMVN-----RIRPVN 1022

Query: 137  ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT 196
            I            +  +++G++  F  +  ++   L+ LQ  +  +    G +    FR 
Sbjct: 1023 ILQTSTVAVNPKAFLGTVEGSIYLFALINPEHQDFLMRLQTAITAYVDSPGYMPFSKFRA 1082

Query: 197  YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
            ++        P R  +DG L+ +FL      + EI   +GS ++
Sbjct: 1083 FRSSVREGDEPFR-FVDGELIERFLDCDRPVQEEILGVVGSGYD 1125


>gi|183232997|ref|XP_653855.2| damaged DNA binding protein [Entamoeba histolytica HM-1:IMSS]
 gi|169801778|gb|EAL48469.2| damaged DNA binding protein, putative [Entamoeba histolytica
            HM-1:IMSS]
          Length = 1088

 Score = 40.8 bits (94), Expect = 0.59,   Method: Composition-based stats.
 Identities = 43/173 (24%), Positives = 73/173 (42%), Gaps = 13/173 (7%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            A EFV E     ++ SD + N+++F       ES    RL      H+G+ +N    + C
Sbjct: 914  AIEFVDEDC---YLSSDSNSNILIFNTNSTGNESER-FRLNNCAHIHVGECIN----VMC 965

Query: 133  KPS--PISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN-VMVTHTSHTGGL 189
            K S  P         +    +  + G +G    +P + Y  L+ +QN +++         
Sbjct: 966  KGSIAPTHSTYETVQKKCILFGGVTGYIGGICEIPNEIYDVLIKVQNQILLQMKGIVECT 1025

Query: 190  NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
             P  ++  K    +   PS  IIDGS+V  +L++S  ++ EI    G     I
Sbjct: 1026 TPDDWK--KVIDDWKRMPSSNIIDGSIVESYLEMSKEKQCEIAHLSGVNEEQI 1076


>gi|444513057|gb|ELV10249.1| DNA damage-binding protein 1 [Tupaia chinensis]
          Length = 1146

 Score = 40.8 bits (94), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 44/174 (25%), Positives = 71/174 (40%), Gaps = 29/174 (16%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVM---------VTHTSHTGGLNPRAFRTYKGKG 201
            + +++G +G    L E  Y  LL +QN +         ++  S T     R+F T +   
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKRCFQISPNSLTDMSTWRSFHTERKT- 1088

Query: 202  YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
                 P+ G IDG L+  FL +S  +  E+   +      G K     D+L  +
Sbjct: 1089 ----EPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1138


>gi|322700871|gb|EFY92623.1| DNA damage-binding protein 1 [Metarhizium acridum CQMa 102]
          Length = 1121

 Score = 40.4 bits (93), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 32/172 (18%), Positives = 68/172 (39%), Gaps = 15/172 (8%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC--------KPSP 136
            ++ +D   N+++    PEA       +L   ++ ++G+ +N   ++           P  
Sbjct: 940  WLEADAQGNIIVLQRNPEAPTEQDRSKLEVTSEINIGEQINQIRRLHVASNENAVVSPKA 999

Query: 137  ISDAPGARSRFLTWYASL------DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
               + G     +  +  L      +G L  F  +  K    LL  Q  +  +    G ++
Sbjct: 1000 FLGSVGLSETTINCWTQLLILVQIEGTLYLFGEIAPKYQDLLLTFQARLQDYIYAPGNVS 1059

Query: 191  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
               +R ++ K      P R  +DG +V +FL L   ++  +C+ +G    D+
Sbjct: 1060 FNLWRAFRNKAREGDGPFR-FVDGEMVERFLDLDEAKQELVCEGLGPSVEDM 1110


>gi|198432471|ref|XP_002129229.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
            DNA-binding protein 1) (UV-damaged DNA-binding factor)
            (DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
            (UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
            pigmentosum group E-co... isoform 2 [Ciona intestinalis]
          Length = 1142

 Score = 40.4 bits (93), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 38/184 (20%), Positives = 77/184 (41%), Gaps = 11/184 (5%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  + A E + + + +G   ++   NV +      A       +L +   FH+G  +NTF
Sbjct: 954  PNWMTAVEILDDDNFLG---AENFYNVFICQKDSGATTDEERSKLREAALFHVGDSINTF 1010

Query: 128  FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
                     + +     S+    + ++ G++G    + E  Y  L  +QN +       G
Sbjct: 1011 RHGSLVMQNVGET-AVSSKGHILFGTVHGSIGVITTVDEDLYAFLHSIQNRLAKVIKSVG 1069

Query: 188  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI-----GSKHNDI 242
             ++  ++R++        +  RG +DG L+  FL L+  +  E+ K +     G+K    
Sbjct: 1070 NIDHESWRSFCTNEKTEAH--RGFVDGDLIECFLDLNREKMAEVAKGLMVKEHGTKREAT 1127

Query: 243  LDEL 246
            +D+L
Sbjct: 1128 VDDL 1131


>gi|299751161|ref|XP_001830098.2| pre-mRNA-splicing factor rse1 [Coprinopsis cinerea okayama7#130]
 gi|298409248|gb|EAU91763.2| pre-mRNA-splicing factor rse1 [Coprinopsis cinerea okayama7#130]
          Length = 1205

 Score = 40.4 bits (93), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 41/152 (26%), Positives = 65/152 (42%), Gaps = 18/152 (11%)

Query: 107  NGG-HRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLP 165
            NG  H+      FH+G  + +  K+       S   G R   L  Y  L G +G  +PL 
Sbjct: 1070 NGAPHKTKMLAHFHVGDIITSIHKV-------SLVVGGREVLL--YTGLQGTIGILVPLT 1120

Query: 166  EK-NYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
             K +   L ML+  +    +  G L  R   +++G  YY   P + +IDG L   +  LS
Sbjct: 1121 SKEDIEFLTMLEQHI---RNEQGSLVGRDHLSWRG--YYV--PVKAVIDGDLCETYGGLS 1173

Query: 225  LGERLEICKKIGSKHNDILDELYDIEALSSHF 256
              ++  I  ++     D+L +L  +   SS F
Sbjct: 1174 SSKQSAIASELDRTVGDVLKKLDQMRVASSGF 1205


>gi|429850956|gb|ELA26181.1| DNA damage-binding protein 1 [Colletotrichum gloeosporioides Nara
            gc5]
          Length = 1409

 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 34/172 (19%), Positives = 75/172 (43%), Gaps = 16/172 (9%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
            ++ +D   N+++    P A   +   ++   ++FHLG+ +N     + +P  +       
Sbjct: 972  WIEADAQGNLMVLRRNPNAPTEHDKKQMEVISEFHLGEQIN-----KIRPLDVVSGENDP 1026

Query: 145  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG-------GLNPRAFRTY 197
                 + A+++G++  F  +  +    LL  Q  +       G       GL+  ++R +
Sbjct: 1027 IEPKAFLATIEGSIYVFADIKPEYQSLLLQFQERLAGVIKTLGQADEPGAGLSFMSWRGF 1086

Query: 198  KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG---SKHNDILDEL 246
            +     A  P R  +DG L+ +FL L  G +  + + +G    +  D+++EL
Sbjct: 1087 RNAKRSADGPFR-FVDGELIERFLDLDAGRQEAVVQGLGPTVERMRDLVEEL 1137


>gi|157131296|ref|XP_001662180.1| spliceosomal protein sap [Aedes aegypti]
 gi|108881836|gb|EAT46061.1| AAEL002709-PA [Aedes aegypti]
          Length = 1215

 Score = 39.7 bits (91), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 32/133 (24%), Positives = 59/133 (44%), Gaps = 17/133 (12%)

Query: 118  FHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEK-NYRRLLMLQ 176
            FHLG+ + +  K    P       G+ S     YA++ G +G  +P   + +Y     L+
Sbjct: 1093 FHLGETIMSLQKATLIPG------GSESLI---YATMSGTVGALVPFTSREDYDFFQHLE 1143

Query: 177  NVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
              M    +   G +  ++R+Y    YY   P + ++DG L  +F  +   ++  I   +G
Sbjct: 1144 MHMRNENTPLCGRDHLSYRSY----YY---PVKHVMDGDLCEQFTSMDPAKQKSIASDLG 1196

Query: 237  SKHNDILDELYDI 249
               N++  +L DI
Sbjct: 1197 RTPNEVAKKLEDI 1209


>gi|313238818|emb|CBY20011.1| unnamed protein product [Oikopleura dioica]
 gi|313245836|emb|CBY34826.1| unnamed protein product [Oikopleura dioica]
          Length = 1135

 Score = 39.7 bits (91), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 35/182 (19%), Positives = 80/182 (43%), Gaps = 13/182 (7%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  + A + + + + +G   ++  +N+ +     EA +     +L+    +H+G+H+NT 
Sbjct: 951  PNWMTAIKMIDDDNYIG---AENSENIFICTRNTEAPDEEDRQQLLPTGYYHVGEHINTI 1007

Query: 128  FKIRCKPSPISDAPGARSR-FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
             +         ++    +R FL    S+ G +G     PEK ++ L  L+  M       
Sbjct: 1008 VEGNLVMDVHVESSITPTRTFL--MGSVSGYVGLLAIFPEKQWQFLSKLEAKMRKVIRGV 1065

Query: 187  GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG-----SKHND 241
            G ++  ++R ++          +G +DG L+  F  L   ++ E+  ++      + H+D
Sbjct: 1066 GKIDHESWRRFESDSRM--EDCKGFVDGDLIEMFQDLRPEKQKEVISELTMDGEPATHDD 1123

Query: 242  IL 243
            ++
Sbjct: 1124 VV 1125


>gi|157141630|ref|XP_001647735.1| spliceosomal protein sap [Aedes aegypti]
 gi|108867854|gb|EAT32403.1| AAEL015441-PA, partial [Aedes aegypti]
          Length = 645

 Score = 39.7 bits (91), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 32/133 (24%), Positives = 59/133 (44%), Gaps = 17/133 (12%)

Query: 118 FHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEK-NYRRLLMLQ 176
           FHLG+ + +  K    P       G+ S     YA++ G +G  +P   + +Y     L+
Sbjct: 523 FHLGETIMSLQKATLIPG------GSESLI---YATMSGTVGALVPFTSREDYDFFQHLE 573

Query: 177 NVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
             M    +   G +  ++R+Y    YY   P + ++DG L  +F  +   ++  I   +G
Sbjct: 574 MHMRNENTPLCGRDHLSYRSY----YY---PVKHVMDGDLCEQFTSMDPAKQKSIASDLG 626

Query: 237 SKHNDILDELYDI 249
              N++  +L DI
Sbjct: 627 RTPNEVAKKLEDI 639


>gi|348560393|ref|XP_003465998.1| PREDICTED: DNA damage-binding protein 1-like [Cavia porcellus]
          Length = 1140

 Score = 39.7 bits (91), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 35/134 (26%), Positives = 58/134 (43%), Gaps = 11/134 (8%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + +++G +G    L E  Y  LL +QN +       G +    +R++  +       + G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEQATG 1087

Query: 211  IIDGSLVWKFLQLS 224
             IDG L+  FL +S
Sbjct: 1088 FIDGDLIESFLDIS 1101


>gi|339259094|ref|XP_003369733.1| splicing factor 3B subunit 3 [Trichinella spiralis]
 gi|316965959|gb|EFV50595.1| splicing factor 3B subunit 3 [Trichinella spiralis]
          Length = 1241

 Score = 39.7 bits (91), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 29/110 (26%), Positives = 55/110 (50%), Gaps = 10/110 (9%)

Query: 141  PGARSRFLTWYASLDGALGFFLPLPEK-NYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG 199
            PG+    +  Y+++ G +G  +P   K +Y     L+  + T      G +  A+R++  
Sbjct: 1135 PGSSESLV--YSTMSGMIGALVPFSSKEDYEFFQHLEMHLRTEYPPLCGRDHLAYRSF-- 1190

Query: 200  KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
               YA  P +G+IDG L  ++  L  G++ EI  ++    ++I+ +L DI
Sbjct: 1191 ---YA--PVKGVIDGDLCEQYCLLEYGKQKEISNELDRVPSEIMKKLEDI 1235


>gi|58383228|ref|XP_312466.2| AGAP002472-PA [Anopheles gambiae str. PEST]
 gi|55242305|gb|EAA08181.2| AGAP002472-PA [Anopheles gambiae str. PEST]
          Length = 1138

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 35/157 (22%), Positives = 62/157 (39%), Gaps = 6/157 (3%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  + A E + + + +G   +D   N+ + +    A       ++ +   FHLG  VN F
Sbjct: 950  PNWMTAVEILDDDAFLG---ADNSNNLFVCLKDSAATTDEERQQMPEVAQFHLGDMVNVF 1006

Query: 128  FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
                     IS+     +  +  + ++ GA+G    +    Y  L  LQ  +       G
Sbjct: 1007 RHGSLVMQNISERSTPTTGCV-LFGTVSGAIGLVTQIQSDFYEFLRKLQENLTNTIKSVG 1065

Query: 188  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
             ++   +R++  +         G IDG LV  FL LS
Sbjct: 1066 KIDHSYWRSFHTETKM--ERCEGFIDGDLVESFLDLS 1100


>gi|225443990|ref|XP_002280735.1| PREDICTED: DNA damage-binding protein 1 isoform 1 [Vitis vinifera]
          Length = 1089

 Score = 39.3 bits (90), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 32/130 (24%), Positives = 54/130 (41%), Gaps = 12/130 (9%)

Query: 111  RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
            RL    ++HLG+ VN F      +R   S +   P         + +++G +G    LP 
Sbjct: 951  RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTVNGVIGVIASLPH 1004

Query: 167  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
              Y  L  LQ  +       GGL+   +R++  +       ++  +DG L+  FL L+  
Sbjct: 1005 DQYVFLEKLQANLRKVIKGVGGLSHEQWRSFNNEKKTV--DAKNFLDGDLIETFLDLNRT 1062

Query: 227  ERLEICKKIG 236
               EI K + 
Sbjct: 1063 RMDEISKAMA 1072


>gi|225443992|ref|XP_002280744.1| PREDICTED: DNA damage-binding protein 1 isoform 2 [Vitis vinifera]
          Length = 1068

 Score = 39.3 bits (90), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 32/130 (24%), Positives = 54/130 (41%), Gaps = 12/130 (9%)

Query: 111  RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
            RL    ++HLG+ VN F      +R   S +   P         + +++G +G    LP 
Sbjct: 930  RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTVNGVIGVIASLPH 983

Query: 167  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
              Y  L  LQ  +       GGL+   +R++  +       ++  +DG L+  FL L+  
Sbjct: 984  DQYVFLEKLQANLRKVIKGVGGLSHEQWRSFNNEKKTV--DAKNFLDGDLIETFLDLNRT 1041

Query: 227  ERLEICKKIG 236
               EI K + 
Sbjct: 1042 RMDEISKAMA 1051


>gi|353236335|emb|CCA68332.1| probable splicing factor 3B subunit 3 [Piriformospora indica DSM
            11827]
          Length = 1243

 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 36/148 (24%), Positives = 63/148 (42%), Gaps = 17/148 (11%)

Query: 110  HRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNY 169
            H+    T FH+G  + +  ++         A  A +R +  Y  L G +G  +PL  K+ 
Sbjct: 1112 HKTKMLTHFHVGDVITSIQRV---------ALVAGAREVVVYFGLHGTIGILVPLVTKDD 1162

Query: 170  RRLL-MLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGER 228
               +  L+  M T      G +  ++R     GYY   P +  +DG L   F +L   ++
Sbjct: 1163 VDFISTLEQHMRTENLSLVGRDHLSWR-----GYY--TPVKATVDGDLCEYFAKLPTQKQ 1215

Query: 229  LEICKKIGSKHNDILDELYDIEALSSHF 256
            L I  ++     D+L +L  +   +S F
Sbjct: 1216 LAIAGELDRTVGDVLKKLESLRVTASGF 1243


>gi|410912407|ref|XP_003969681.1| PREDICTED: DNA damage-binding protein 1-like [Takifugu rubripes]
          Length = 1140

 Score = 38.9 bits (89), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 68/165 (41%), Gaps = 17/165 (10%)

Query: 96   LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEDRQHLQEVGVFHLGEFVNVF----CHGSLVLQNLGETSTPTQGSVL 1029

Query: 151  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
            + ++ G +G    L E  +  LL LQN +       G +    +R++  +       ++G
Sbjct: 1030 FGTVTGMIGLVTSLSEGWHSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEQAKG 1087

Query: 211  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
             IDG L+  FL L   +  E+   +      G K    +DE+  I
Sbjct: 1088 FIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREATVDEVIKI 1132


>gi|167395195|ref|XP_001741266.1| pre-mRNA-splicing factor rse-1 [Entamoeba dispar SAW760]
 gi|165894222|gb|EDR22283.1| pre-mRNA-splicing factor rse-1, putative [Entamoeba dispar SAW760]
          Length = 1145

 Score = 38.9 bits (89), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 36/146 (24%), Positives = 67/146 (45%), Gaps = 23/146 (15%)

Query: 108  GGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEK 167
              ++++  T F +G    +F       +  S  PGA S FL  Y++  G L   +PL  +
Sbjct: 1013 SNYKMVTATQFFIGDIATSF-------AQCSLIPGAPSIFL--YSNFMGGLSALIPLQNQ 1063

Query: 168  N----YRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
            N    Y+ L M    M  H ++    N  +FR+       +  P +  +DG L   + +L
Sbjct: 1064 NDIDFYQHLEMH---MRVHWTNLTDRNHISFRS-------SMVPVKDTVDGDLCELYERL 1113

Query: 224  SLGERLEICKKIGSKHNDILDELYDI 249
                + EI +++  + N+I+ +L+D+
Sbjct: 1114 PYEIQQEIAEEMEKEVNEIIKKLHDL 1139


>gi|300707023|ref|XP_002995737.1| hypothetical protein NCER_101290 [Nosema ceranae BRL01]
 gi|239604943|gb|EEQ82066.1| hypothetical protein NCER_101290 [Nosema ceranae BRL01]
          Length = 1155

 Score = 38.9 bits (89), Expect = 2.0,   Method: Composition-based stats.
 Identities = 40/164 (24%), Positives = 71/164 (43%), Gaps = 22/164 (13%)

Query: 75   EFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK--TDFHLGQHV-NTFFKIR 131
            +F+++  ++G + +D   N+  + Y P    S  G + +K+  T+F+LG+ V        
Sbjct: 1000 DFIVKEPALGIICTDFAGNIHTYTYSPVNILSCNGTKFVKRCETNFNLGKLVIKRAHSKL 1059

Query: 132  CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
              P  ISD+         +   LD        L   NY   L +QN  ++    T GL P
Sbjct: 1060 LNPVFISDS--------NYIIELDS-------LSLDNYNNFLKVQNAYLSLIEDTFGLCP 1104

Query: 192  RAFRTYKGKGYYAGNPS-RGIIDGSLVWKFLQLSLGERLEICKK 234
              F   +   Y+   PS +  I   L+++FL L + ++    K+
Sbjct: 1105 ENFNNCE---YHLKPPSVKKPILKELLFRFLHLPVDKKANFLKE 1145


>gi|198432469|ref|XP_002129207.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
            DNA-binding protein 1) (UV-damaged DNA-binding factor)
            (DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
            (UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
            pigmentosum group E-co... isoform 1 [Ciona intestinalis]
          Length = 1150

 Score = 38.9 bits (89), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 38/188 (20%), Positives = 77/188 (40%), Gaps = 15/188 (7%)

Query: 68   PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
            P  + A E + + + +G   ++   NV +      A       +L +   FH+G  +NTF
Sbjct: 958  PNWMTAVEILDDDNFLG---AENFYNVFICQKDSGATTDEERSKLREAALFHVGDSINTF 1014

Query: 128  FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
                     + +     S+    + ++ G++G    + E  Y  L  +QN +       G
Sbjct: 1015 RHGSLVMQNVGET-AVSSKGHILFGTVHGSIGVITTVDEDLYAFLHSIQNRLAKVIKSVG 1073

Query: 188  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI---------GSK 238
             ++  ++R++        +  RG +DG L+  FL L+  +  E+ K +         G+K
Sbjct: 1074 NIDHESWRSFCTNEKTEAH--RGFVDGDLIECFLDLNREKMAEVAKGLMVKNFNDQHGTK 1131

Query: 239  HNDILDEL 246
                +D+L
Sbjct: 1132 REATVDDL 1139


>gi|443919095|gb|ELU39366.1| cleavage factor protein [Rhizoctonia solani AG-1 IA]
          Length = 788

 Score = 38.9 bits (89), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 23/94 (24%), Positives = 44/94 (46%), Gaps = 24/94 (25%)

Query: 153 SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGII 212
           S DG +    PL E  + RL +LQ  ++                         N   G++
Sbjct: 712 SSDGTIASLTPLNESEFGRLQLLQGQLIR------------------------NVHNGVL 747

Query: 213 DGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
           DG+L+  F +L + +++E+ ++IG++   IL++L
Sbjct: 748 DGNLLAAFEELPVSKQVEMTQQIGAEREKILNDL 781


>gi|390366809|ref|XP_780126.3| PREDICTED: DNA damage-binding protein 1-like isoform 1
           [Strongylocentrotus purpuratus]
          Length = 630

 Score = 38.5 bits (88), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 32/118 (27%), Positives = 51/118 (43%), Gaps = 7/118 (5%)

Query: 118 FHLGQHVNTFFKIRCKPSPISDA--PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 175
           FHLG+ VN F         I ++  P   S     + ++ G++G    L E+ YR LL +
Sbjct: 489 FHLGEFVNVFRHGSLVMQNIGESTIPTTGS---VLFGTVSGSVGLVTQLNEEFYRFLLEV 545

Query: 176 QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 233
           QN +       G +    +R++  +      P    IDG L+  FL LS     E+ +
Sbjct: 546 QNKLTKVIKSVGKIKHSFWRSFYSE--RKTEPMDNFIDGDLLESFLDLSRDTMDEVAQ 601


>gi|67472417|ref|XP_652013.1| splicing factor 3B subunit 3 [Entamoeba histolytica HM-1:IMSS]
 gi|56468814|gb|EAL46625.1| splicing factor 3B subunit 3, putative [Entamoeba histolytica
            HM-1:IMSS]
 gi|449709507|gb|EMD48761.1| premRNA-splicing factor rse-1, putative [Entamoeba histolytica KU27]
          Length = 1145

 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 36/146 (24%), Positives = 67/146 (45%), Gaps = 23/146 (15%)

Query: 108  GGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEK 167
              ++++  T F +G    +F       +  S  PGA S FL  Y++  G L   +PL  +
Sbjct: 1013 SSYKMVTATQFFVGDIATSF-------AQCSLIPGAPSIFL--YSNFMGGLSALIPLQSQ 1063

Query: 168  N----YRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
            N    Y+ L M    M  H ++    N  +FR+       +  P +  +DG L   + +L
Sbjct: 1064 NDIDFYQHLEMH---MRVHWTNLTDRNHISFRS-------SIVPVKDTVDGDLCELYERL 1113

Query: 224  SLGERLEICKKIGSKHNDILDELYDI 249
                + EI +++  + N+I+ +L+D+
Sbjct: 1114 PYEIQQEIAEEMEKEVNEIIKKLHDL 1139


>gi|407044064|gb|EKE42344.1| CPSF A subunit region protein, putative [Entamoeba nuttalli P19]
          Length = 1145

 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 36/146 (24%), Positives = 67/146 (45%), Gaps = 23/146 (15%)

Query: 108  GGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEK 167
              ++++  T F +G    +F       +  S  PGA S FL  Y++  G L   +PL  +
Sbjct: 1013 SSYKMVTATQFFVGDIATSF-------AQCSLIPGAPSIFL--YSNFMGGLSALIPLQSQ 1063

Query: 168  N----YRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
            N    Y+ L M    M  H ++    N  +FR+       +  P +  +DG L   + +L
Sbjct: 1064 NDIDFYQHLEMH---MRVHWTNLTDRNHISFRS-------SIVPVKDTVDGDLCELYERL 1113

Query: 224  SLGERLEICKKIGSKHNDILDELYDI 249
                + EI +++  + N+I+ +L+D+
Sbjct: 1114 PYEIQQEIAEEMEKEVNEIIKKLHDL 1139


>gi|398020786|ref|XP_003863556.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania donovani]
 gi|322501789|emb|CBZ36871.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania donovani]
          Length = 1542

 Score = 37.7 bits (86), Expect = 4.4,   Method: Composition-based stats.
 Identities = 28/131 (21%), Positives = 57/131 (43%), Gaps = 15/131 (11%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKT------DFHL-GQH 123
            V + + +   +  G +++D  +N+V   Y+P   E  G    I ++      ++ L G  
Sbjct: 1357 VVSNDMLYHDTRFGLLVTDDARNLVCMSYKPRVLEEPGKPPKILESLLTVTGEYRLAGGV 1416

Query: 124  VNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
            +    ++R        A   R+  +  Y +  G +G+ +PL ++  R    +   + +  
Sbjct: 1417 LLKMMRLR--------AASTRNSSVAIYVTNMGEIGYLVPLGDQTSRTGQWVGRRLQSEV 1468

Query: 184  SHTGGLNPRAF 194
            +H GGL PR F
Sbjct: 1469 AHAGGLPPRMF 1479


>gi|146096490|ref|XP_001467824.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania infantum JPCM5]
 gi|134072190|emb|CAM70891.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania infantum JPCM5]
          Length = 1542

 Score = 37.7 bits (86), Expect = 4.4,   Method: Composition-based stats.
 Identities = 28/131 (21%), Positives = 57/131 (43%), Gaps = 15/131 (11%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKT------DFHL-GQH 123
            V + + +   +  G +++D  +N+V   Y+P   E  G    I ++      ++ L G  
Sbjct: 1357 VVSNDMLYHDTRFGLLVTDDARNLVCMSYKPRVLEEPGKPPKILESLLTVTGEYRLAGGV 1416

Query: 124  VNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
            +    ++R        A   R+  +  Y +  G +G+ +PL ++  R    +   + +  
Sbjct: 1417 LLKMMRLR--------AASTRNSSVAIYVTNMGEIGYLVPLGDQTSRTGQWVGRRLQSEV 1468

Query: 184  SHTGGLNPRAF 194
            +H GGL PR F
Sbjct: 1469 AHAGGLPPRMF 1479


>gi|307107849|gb|EFN56091.1| hypothetical protein CHLNCDRAFT_145620 [Chlorella variabilis]
          Length = 1626

 Score = 37.7 bits (86), Expect = 4.7,   Method: Composition-based stats.
 Identities = 20/67 (29%), Positives = 30/67 (44%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            A +F+I  SS+     D    + L  Y P    S  G RL+    FH+G+  +   ++R 
Sbjct: 1425 ATQFLINGSSLHLASCDSAGTLRLLSYAPSHPASWKGQRLVAWGSFHVGEAASCMRRLRL 1484

Query: 133  KPSPISD 139
             PS   D
Sbjct: 1485 HPSSPED 1491


>gi|342186481|emb|CCC95967.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 1456

 Score = 37.7 bits (86), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 52/126 (41%), Gaps = 5/126 (3%)

Query: 71   VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGH--RLIKKTDFHLGQHVNTFF 128
            V  C+ +    + G + S+  ++++L  Y P  +ES G H    + ++ F L        
Sbjct: 1271 VVHCDMMYHDRTFGILCSNDQRDLLLMGYTPRVQES-GEHTPSRVLESPFSLDGEYRLPS 1329

Query: 129  KIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
                K      A G  S  +T Y S  G +GF +PL E+  R  L +   +        G
Sbjct: 1330 GCLAKSLRFRSAAGNSS--VTVYISNYGEVGFIVPLGEQANRTALWITRRLQVDLPCDAG 1387

Query: 189  LNPRAF 194
            L PR F
Sbjct: 1388 LTPRMF 1393


>gi|302837243|ref|XP_002950181.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
            nagariensis]
 gi|300264654|gb|EFJ48849.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
            nagariensis]
          Length = 1104

 Score = 37.0 bits (84), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 35/143 (24%), Positives = 55/143 (38%), Gaps = 11/143 (7%)

Query: 85   FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSPISDA 140
            ++  D   N+V+     ++       RL    ++H G  VN F      +R   S     
Sbjct: 969  YLEGDNHLNLVVLRRNADSATDEERARLQVVGEYHTGTFVNRFRHGSLVMRPPDSEFVSL 1028

Query: 141  PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 200
            P         +   DG LG    LP   Y  L  LQ+ +       GGL+  A+  +  +
Sbjct: 1029 P-----VPLLFGGTDGRLGVIARLPPGLYEMLTKLQSALRQVVRGVGGLSHEAWIAFSNE 1083

Query: 201  GYYAGNPSRGIIDGSLVWKFLQL 223
               A   ++G +DG L+  FL L
Sbjct: 1084 RRTAD--AKGFVDGDLIETFLDL 1104


>gi|70945139|ref|XP_742421.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56521397|emb|CAH76894.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 435

 Score = 37.0 bits (84), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 41/192 (21%), Positives = 76/192 (39%), Gaps = 13/192 (6%)

Query: 74  CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK 133
           C FV   S   F++SD + N ++F             +L +   F+ G  VN    +   
Sbjct: 244 CTFVCALSKSHFLVSDMESNFLVFQKSSIKYNDEDSFKLSRVALFNHGHVVNKMLPVSLS 303

Query: 134 PSPISDAPGA---RSRFLTWYASLDGALGFFLPLPE-KNYRRLLMLQNVMVTHTSHTGGL 189
                + P     R +     AS +G++   +P     N+++ L ++  +    S  G +
Sbjct: 304 SLIEEEEPQNEILRKKESILCASSEGSISSIIPFSNLANFKKALCIELALNDSLSSIGNI 363

Query: 190 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE-------ICKKIGSKHNDI 242
           N  +  TYK     +    +G++DG +   F  +   ++ +       I KK+  K    
Sbjct: 364 NDNSNNTYKMN--LSEKSCKGVVDGEVFKMFFSMPFEKQFKTYIYAKWIAKKLNCKFGTF 421

Query: 243 LDELYDIEALSS 254
            + + DIE L S
Sbjct: 422 ENFMLDIENLCS 433


>gi|320583269|gb|EFW97484.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
            [Ogataea parapolymorpha DL-1]
          Length = 1309

 Score = 37.0 bits (84), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 35/168 (20%), Positives = 71/168 (42%), Gaps = 5/168 (2%)

Query: 73   ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
            A +F++   ++  +++D+D    L  Y P    S  G +L++++             +  
Sbjct: 1128 AADFLVHDGNLHLLVADEDSVFHLLQYDPYDGNSMKGLKLLRRSLLRSNALTTKMISVAR 1187

Query: 133  KPSPISDAPGARSRFLTWY----ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
              S  S            Y    +++DG+    +P+ E  YRRL  +QN +     H  G
Sbjct: 1188 DRSLFSMVSTLNHEDDLGYEIIGSNIDGSFYKVMPVNEYQYRRLYSIQNYLYDKELHWLG 1247

Query: 189  LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
            LNP++     G      +  R  I+ ++  +F+  +   + +I +K+G
Sbjct: 1248 LNPKS-NAIGGLTELMPSIKRPFIELNMFHRFIGFNNDRKKQIMQKLG 1294


>gi|294654658|ref|XP_456718.2| DEHA2A08932p [Debaryomyces hansenii CBS767]
 gi|218511767|sp|Q6BYK1.2|RSE1_DEBHA RecName: Full=Pre-mRNA-splicing factor RSE1
 gi|199429049|emb|CAG84677.2| DEHA2A08932p [Debaryomyces hansenii CBS767]
          Length = 1256

 Score = 36.6 bits (83), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 42/150 (28%), Positives = 67/150 (44%), Gaps = 21/150 (14%)

Query: 91   DKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTW 150
            DK+  L  YQ E+  +  G RL    +F+L Q + T F    K S +    G +   +  
Sbjct: 1095 DKDWSLLRYQ-ESYLNGSGSRLKNICEFYL-QDIPTSF---TKGSLVM---GGKESII-- 1144

Query: 151  YASLDGALGFFLPLPEKNYRRLL-MLQNVMVTHTSHT--------GGLNPRAFRTYKGKG 201
            Y  + G LG  LPL  +N  + L  LQ ++  +  +          G N       K + 
Sbjct: 1145 YTGIQGTLGLLLPLSTENEVKFLGDLQLLLRKYFDYNFDDFDKDKNGYNLLGKDHLKFRS 1204

Query: 202  YYAGNPSRGIIDGSLVWKFLQLSLGERLEI 231
            YY  NP + ++DG L+ +F +LS   ++ I
Sbjct: 1205 YY--NPVKNVMDGDLIERFYELSQSMKIRI 1232


>gi|448111975|ref|XP_004201977.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
 gi|359464966|emb|CCE88671.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
          Length = 1249

 Score = 36.6 bits (83), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 28/114 (24%), Positives = 52/114 (45%), Gaps = 11/114 (9%)

Query: 150  WYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHT--------GGLNPRAFRTYKGK 200
            +Y  L G +G  +PL  KN  +  + L+  + ++  +          G N       K +
Sbjct: 1137 FYTGLMGTVGALIPLVTKNEVQFFIELEAELRSYFDYNFDNFDEQKNGHNLLGKDHLKHR 1196

Query: 201  GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSS 254
             YY  NP + +IDG L+ +F ++S   ++ I  K+     DI  ++ +I   S+
Sbjct: 1197 SYY--NPVKNVIDGDLIERFSEVSYNNKIRIANKLDRTPKDIDKKISEIRNRSA 1248


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.323    0.140    0.422 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,196,213,230
Number of Sequences: 23463169
Number of extensions: 178515530
Number of successful extensions: 290714
Number of sequences better than 100.0: 583
Number of HSP's better than 100.0 without gapping: 298
Number of HSP's successfully gapped in prelim test: 285
Number of HSP's that attempted gapping in prelim test: 289715
Number of HSP's gapped (non-prelim): 728
length of query: 256
length of database: 8,064,228,071
effective HSP length: 139
effective length of query: 117
effective length of database: 9,097,814,876
effective search space: 1064444340492
effective search space used: 1064444340492
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 75 (33.5 bits)