BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy13880
(256 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|383863556|ref|XP_003707246.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Megachile rotundata]
Length = 1415
Score = 230 bits (587), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 105/193 (54%), Positives = 147/193 (76%), Gaps = 4/193 (2%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY E++I+++++GF+++D + N+ LFMYQPE+RES GG +LI+K DFHLGQ VNTF
Sbjct: 1223 PAEVYTIEYLIDNNNLGFLVADGESNIALFMYQPESRESLGGQKLIRKADFHLGQKVNTF 1282
Query: 128 FKIRCKPS-PISDA---PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
F+IRC+ S P +D GA R +T YASLDG+LG+ LP+PEK YRRLLMLQNV+VTH
Sbjct: 1283 FRIRCRISDPANDKKHFSGADKRHVTMYASLDGSLGYILPVPEKTYRRLLMLQNVLVTHI 1342
Query: 184 SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
H GLNP+A+RTYK GNP+RGIIDG LVW++L L E++++ KKIG++ +I+
Sbjct: 1343 CHIAGLNPKAYRTYKSYIRTQGNPARGIIDGDLVWRYLYLPNNEKIDVAKKIGTRVQEII 1402
Query: 244 DELYDIEALSSHF 256
++L +I+ ++HF
Sbjct: 1403 EDLTEIDRQTAHF 1415
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 33/49 (67%), Positives = 39/49 (79%)
Query: 1 ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKL 49
E V+E+L V+LG HGNRP+LLVR EL IYQ +R+PKG LKLRFKKL
Sbjct: 825 EMQVREILMVALGHHGNRPMLLVRLDSELQIYQTYRYPKGHLKLRFKKL 873
>gi|270003792|gb|EFA00240.1| hypothetical protein TcasGA2_TC003068 [Tribolium castaneum]
Length = 1392
Score = 230 bits (586), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 103/193 (53%), Positives = 148/193 (76%), Gaps = 4/193 (2%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P V++ E++I++++MGF++SD +KN+VL+MYQPE+RES GG RL++K DFHLGQ VN+F
Sbjct: 1200 PCEVFSVEYMIDNTTMGFLVSDSEKNLVLYMYQPESRESLGGQRLLRKADFHLGQAVNSF 1259
Query: 128 FKIRCKPSPISDA----PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
F+I+CK + + GA R +T YA+LDG LG+ +P+PEK YRRLLMLQNV+V+
Sbjct: 1260 FRIKCKLGELGEDKKNLTGADKRHITMYATLDGGLGYIMPVPEKTYRRLLMLQNVLVSQG 1319
Query: 184 SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
+H GLNP+AFRTYK NP+R +IDG LV+ +LQLS+ E+LE+ KKIG+K ++L
Sbjct: 1320 AHIAGLNPKAFRTYKSWKKLQTNPARSVIDGELVYNYLQLSIPEKLEVSKKIGTKLEELL 1379
Query: 244 DELYDIEALSSHF 256
D+L DI+ +++HF
Sbjct: 1380 DDLSDIQKITNHF 1392
Score = 64.3 bits (155), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 39/47 (82%)
Query: 4 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLK 50
V+E+L V+LG HG+RPLL+VR + +L IY+ FR P+G LK+RF+K+K
Sbjct: 803 VKEILVVALGNHGSRPLLMVRLERDLYIYEVFRFPRGNLKMRFRKIK 849
>gi|91078626|ref|XP_968117.1| PREDICTED: similar to cleavage and polyadenylation specificity factor
cpsf [Tribolium castaneum]
Length = 1413
Score = 230 bits (586), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 103/193 (53%), Positives = 148/193 (76%), Gaps = 4/193 (2%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P V++ E++I++++MGF++SD +KN+VL+MYQPE+RES GG RL++K DFHLGQ VN+F
Sbjct: 1221 PCEVFSVEYMIDNTTMGFLVSDSEKNLVLYMYQPESRESLGGQRLLRKADFHLGQAVNSF 1280
Query: 128 FKIRCKPSPISDA----PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
F+I+CK + + GA R +T YA+LDG LG+ +P+PEK YRRLLMLQNV+V+
Sbjct: 1281 FRIKCKLGELGEDKKNLTGADKRHITMYATLDGGLGYIMPVPEKTYRRLLMLQNVLVSQG 1340
Query: 184 SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
+H GLNP+AFRTYK NP+R +IDG LV+ +LQLS+ E+LE+ KKIG+K ++L
Sbjct: 1341 AHIAGLNPKAFRTYKSWKKLQTNPARSVIDGELVYNYLQLSIPEKLEVSKKIGTKLEELL 1400
Query: 244 DELYDIEALSSHF 256
D+L DI+ +++HF
Sbjct: 1401 DDLSDIQKITNHF 1413
Score = 64.3 bits (155), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 39/47 (82%)
Query: 4 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLK 50
V+E+L V+LG HG+RPLL+VR + +L IY+ FR P+G LK+RF+K+K
Sbjct: 824 VKEILVVALGNHGSRPLLMVRLERDLYIYEVFRFPRGNLKMRFRKIK 870
>gi|350413821|ref|XP_003490124.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Bombus impatiens]
Length = 1417
Score = 229 bits (584), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 104/193 (53%), Positives = 147/193 (76%), Gaps = 4/193 (2%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY E++I+++++GF+++D + N+ LFMYQPE+RES GG +LI+K DFHLGQ VNTF
Sbjct: 1225 PAEVYTIEYLIDNTNLGFLVADGESNMALFMYQPESRESLGGQKLIRKADFHLGQKVNTF 1284
Query: 128 FKIRCKPS-PISDA---PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
F+I+C+ S P +D GA R +T YASLDG+LG+ LP+PEK YRRLLMLQNV+VTH
Sbjct: 1285 FRIKCRVSDPANDKKHFSGADKRHVTMYASLDGSLGYILPVPEKTYRRLLMLQNVLVTHI 1344
Query: 184 SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
H GLNP+A+RTYK GNP+RGIIDG LVW++L L E++++ KKIG++ +I+
Sbjct: 1345 CHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDLVWRYLYLPNNEKIDVAKKIGTRVQEII 1404
Query: 244 DELYDIEALSSHF 256
++L +I+ ++HF
Sbjct: 1405 EDLTEIDRQTAHF 1417
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 34/49 (69%), Positives = 40/49 (81%)
Query: 1 ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKL 49
E V+E+L V+LG HGNRP+LLVR EL IYQA+R+PKG LKLRFKKL
Sbjct: 827 EMQVREILMVALGHHGNRPMLLVRLDSELQIYQAYRYPKGHLKLRFKKL 875
>gi|340710064|ref|XP_003393618.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Bombus terrestris]
Length = 1417
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 104/193 (53%), Positives = 146/193 (75%), Gaps = 4/193 (2%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY E++I+++++GF+++D + N+ LFMYQPE+RES GG +LI+K DFHLGQ VNTF
Sbjct: 1225 PAEVYTIEYLIDNTNLGFLVADGESNMALFMYQPESRESLGGQKLIRKADFHLGQKVNTF 1284
Query: 128 FKIRCKPS-PISDA---PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
F+IRC+ S P +D GA R +T YASLDG+LG+ LP+PEK YRRLLMLQNV+VTH
Sbjct: 1285 FRIRCRLSDPANDKKHFSGADKRHVTMYASLDGSLGYILPVPEKTYRRLLMLQNVLVTHI 1344
Query: 184 SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
H GLNP+A+RTYK GNP+RGIIDG LVW++ L E++++ KKIG++ +I+
Sbjct: 1345 CHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDLVWRYFYLPNNEKIDVAKKIGTRVQEII 1404
Query: 244 DELYDIEALSSHF 256
++L +I+ ++HF
Sbjct: 1405 EDLTEIDRQTAHF 1417
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 34/49 (69%), Positives = 40/49 (81%)
Query: 1 ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKL 49
E V+E+L V+LG HGNRP+LLVR EL IYQA+R+PKG LKLRFKKL
Sbjct: 827 EMQVREILMVALGHHGNRPMLLVRLDSELQIYQAYRYPKGHLKLRFKKL 875
>gi|110750698|ref|XP_624382.2| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Apis mellifera]
Length = 1415
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 104/193 (53%), Positives = 145/193 (75%), Gaps = 4/193 (2%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY E++I+++++GF+++D + N+ LFMYQPE+RES GG +LI+K DFHLGQ VNTF
Sbjct: 1223 PAEVYTIEYLIDNTNLGFLVADGESNIALFMYQPESRESLGGQKLIRKADFHLGQKVNTF 1282
Query: 128 FKIRCKPS-PISDA---PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
F+IRC+ S P +D A R +T YASLDG LG+ LP+PEK YRRLLMLQNV+VTH
Sbjct: 1283 FRIRCRISDPANDKKHFSDADKRHVTMYASLDGNLGYILPVPEKTYRRLLMLQNVLVTHI 1342
Query: 184 SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
H GLNP+A+RTYK GNP+RGIIDG LVW++L L E++++ KKIG++ +I+
Sbjct: 1343 CHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDLVWRYLYLPNNEKIDVAKKIGTRVQEII 1402
Query: 244 DELYDIEALSSHF 256
++L +I+ ++HF
Sbjct: 1403 EDLTEIDRQTAHF 1415
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/49 (69%), Positives = 40/49 (81%)
Query: 1 ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKL 49
E V+E+L V+LG HGNRP+LLVR EL IYQA+R+PKG LKLRFKKL
Sbjct: 825 EMQVREILMVALGHHGNRPMLLVRLDSELQIYQAYRYPKGHLKLRFKKL 873
>gi|307191845|gb|EFN75271.1| Cleavage and polyadenylation specificity factor subunit 1
[Harpegnathos saltator]
Length = 1214
Score = 223 bits (569), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 154/218 (70%), Gaps = 12/218 (5%)
Query: 44 LRFK-KLKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
LRF+ K + L + R R P VY E++I+++++GF+I+D + N+ LFMYQPE
Sbjct: 1004 LRFQEKCRTLSLVSRDFR-------PAEVYTIEYLIDNTNLGFLIADGESNLALFMYQPE 1056
Query: 103 ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDA----PGARSRFLTWYASLDGAL 158
+RES GG +LI+K DFHLGQ +NTFF+I+C+ + ++ A + +T YASLDG+L
Sbjct: 1057 SRESLGGQKLIRKADFHLGQKINTFFRIKCRVTDVASDKKHFSDADKKHVTMYASLDGSL 1116
Query: 159 GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVW 218
G+ LP+PEK YRRLLMLQNV+VTH H GLNP+A+RTYK GNP+RGIIDG LVW
Sbjct: 1117 GYVLPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSYVRNQGNPARGIIDGDLVW 1176
Query: 219 KFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
++L L E+ ++ KKIG++ +I++++ +I+ ++HF
Sbjct: 1177 RYLSLPNNEKADVAKKIGTRVQEIIEDITEIDRQTAHF 1214
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 35/68 (51%), Positives = 46/68 (67%)
Query: 1 ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDRSKR 60
E V+E+L V+LG HGNRP+LLVR EL IYQA+++PKG LKLRFKKL + R
Sbjct: 623 ELQVREVLMVALGHHGNRPMLLVRLDSELQIYQAYKYPKGHLKLRFKKLDHGIIPGHLSR 682
Query: 61 ANEQPGLP 68
++ +P
Sbjct: 683 KPKEEDVP 690
>gi|345482082|ref|XP_001607052.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Nasonia vitripennis]
Length = 1415
Score = 219 bits (559), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 106/218 (48%), Positives = 150/218 (68%), Gaps = 12/218 (5%)
Query: 44 LRFK-KLKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
LRF+ + K L + R R E +YA E+ I+++ +GF+++D + N+ +F YQPE
Sbjct: 1205 LRFQPEYKTLSLVSRDFRTTE-------IYAIEYFIQNNELGFIVADGESNISIFSYQPE 1257
Query: 103 ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS----PISDAPGARSRFLTWYASLDGAL 158
+ +S GG +LI+K D HLGQ +NTFF+I+CK + P GA R +T YA+LDG+L
Sbjct: 1258 SSQSLGGQKLIRKADIHLGQKINTFFRIKCKTTDSANPTKQFSGADKRHVTMYATLDGSL 1317
Query: 159 GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVW 218
G+ LP+PEK YRRLLMLQNV+V+H H GLNP+AFRTYK GNP+RGIIDG LV
Sbjct: 1318 GYILPVPEKTYRRLLMLQNVLVSHIYHIAGLNPKAFRTYKSCVRMQGNPARGIIDGDLVR 1377
Query: 219 KFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
K+L L + E++EI KKIG+ +I+D++++I +SHF
Sbjct: 1378 KYLDLPVNEKIEIAKKIGTGAQEIMDDMHEIYKQTSHF 1415
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 46/68 (67%)
Query: 1 ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDRSKR 60
E V+E+ V+LG HGNRP+LLVR EL IYQ +R+PKG LKLRFKK+ F+ S+
Sbjct: 825 EVQVREIAVVALGHHGNRPMLLVRLDSELQIYQVYRYPKGHLKLRFKKIDHNFIVGFSRI 884
Query: 61 ANEQPGLP 68
++ +P
Sbjct: 885 GPKEEDMP 892
>gi|307190910|gb|EFN74734.1| Cleavage and polyadenylation specificity factor subunit 1 [Camponotus
floridanus]
Length = 1418
Score = 219 bits (558), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 102/194 (52%), Positives = 145/194 (74%), Gaps = 5/194 (2%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY E++I+++++GF ++D + N+ LFMYQPE+RES GG +LI+K DFHLGQ VNTF
Sbjct: 1225 PAEVYTIEYLIDNTNLGFFLADGESNLALFMYQPESRESLGGQKLIRKADFHLGQKVNTF 1284
Query: 128 FKIRCKPS-PISDA---PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
F+IRC+ S P +D GA R +T YA+LDG+LG+ LP+PEK YRRLLMLQNV+VTH
Sbjct: 1285 FRIRCRVSDPANDKKQFSGADKRHVTMYATLDGSLGYILPVPEKTYRRLLMLQNVLVTHI 1344
Query: 184 SHTGGLNPRAFR-TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNP+++R TYK GNP+RGIIDG LVW++L L E+ ++ KKIG++ +I
Sbjct: 1345 CHIAGLNPKSYRQTYKSYIRNQGNPARGIIDGDLVWRYLFLPNNEKTDVAKKIGTRVQEI 1404
Query: 243 LDELYDIEALSSHF 256
++++ +I+ ++HF
Sbjct: 1405 IEDITEIDRQTAHF 1418
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 33/50 (66%), Positives = 41/50 (82%)
Query: 1 ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLK 50
E V+E+L V+LG HGNRP+LLVR EL IYQA+++PKG LKLRFKKL+
Sbjct: 826 EMQVREILMVALGHHGNRPMLLVRLDSELQIYQAYKYPKGYLKLRFKKLE 875
>gi|242021233|ref|XP_002431050.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
putative [Pediculus humanus corporis]
gi|212516279|gb|EEB18312.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
putative [Pediculus humanus corporis]
Length = 1409
Score = 213 bits (542), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 95/193 (49%), Positives = 141/193 (73%), Gaps = 4/193 (2%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VYA E +++++ MGF+ISD + N++++MY+PE R+S GG +L++K DFHLGQH+N++
Sbjct: 1217 PCEVYAIELLLDNTQMGFLISDVEMNIIMYMYKPEDRDSVGGQKLLRKADFHLGQHINSW 1276
Query: 128 FKIRCK---PSPISDAP-GARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
F+IRC+ + D P GA R ++ +A+LDGALG+ LP+PEK YRRL MLQN++V H
Sbjct: 1277 FRIRCRLGDQAENYDFPIGAEKRHISMFATLDGALGYLLPIPEKTYRRLQMLQNILVYHI 1336
Query: 184 SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
H GLNP+AFR YK GNP + I+DG L+W +L L++ E+ ++ KK+GSK +DI+
Sbjct: 1337 PHLAGLNPKAFRIYKSGRKLLGNPCKRIVDGELIWMYLSLTVMEKQDVAKKMGSKMDDII 1396
Query: 244 DELYDIEALSSHF 256
+++ IE LS HF
Sbjct: 1397 EDIAVIERLSGHF 1409
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/59 (49%), Positives = 44/59 (74%), Gaps = 2/59 (3%)
Query: 4 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKG-TLKLRFKKL-KVLFVSDRSKR 60
+ ELL VSLG G RP+LL+RT+++L+IYQAF+ KG LK+RF++L + L + +R +
Sbjct: 821 IHELLVVSLGHLGRRPILLLRTENDLMIYQAFKFAKGPNLKIRFRRLPQTLILKERKAK 879
>gi|380014171|ref|XP_003691113.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Apis florea]
Length = 1583
Score = 213 bits (541), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 98/173 (56%), Positives = 129/173 (74%), Gaps = 4/173 (2%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY E++I+++++GF+++D + N+ LFMYQPE+RES GG +LI+K DFHLGQ VNTF
Sbjct: 1223 PAEVYTIEYLIDNTNLGFLVADGESNIALFMYQPESRESLGGQKLIRKADFHLGQKVNTF 1282
Query: 128 FKIRCKPS-PISDA---PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
F+IRC+ S P +D A R +T YASLDG LG+ LP+PEK YRRLLMLQNV+VTH
Sbjct: 1283 FRIRCRISDPANDKKHFSDADKRHVTMYASLDGNLGYILPVPEKTYRRLLMLQNVLVTHI 1342
Query: 184 SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
H GLNP+A+RTYK GNP+RGIIDG LVW++L L E++++ KKI
Sbjct: 1343 CHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDLVWRYLYLPNNEKIDVAKKIA 1395
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 34/49 (69%), Positives = 40/49 (81%)
Query: 1 ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKL 49
E V+E+L V+LG HGNRP+LLVR EL IYQA+R+PKG LKLRFKKL
Sbjct: 825 EMQVREILMVALGHHGNRPMLLVRLDSELQIYQAYRYPKGHLKLRFKKL 873
>gi|322792443|gb|EFZ16427.1| hypothetical protein SINV_15375 [Solenopsis invicta]
Length = 1532
Score = 209 bits (531), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 98/176 (55%), Positives = 133/176 (75%), Gaps = 5/176 (2%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY E++I+++++GF+++D + N+ LFMYQPE+RES GG +LI+K DFHLGQ VNTF
Sbjct: 1192 PAEVYTIEYLIDNTNLGFIVADGESNLALFMYQPESRESLGGQKLIRKADFHLGQKVNTF 1251
Query: 128 FKIRCKPS-PISDA---PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
F+IRC+ + P +D GA R +T YASLDG+LG+ LP+PEK YRRLLMLQNV+VTH
Sbjct: 1252 FRIRCRVTDPANDKKQFSGADKRHVTMYASLDGSLGYILPVPEKTYRRLLMLQNVLVTHI 1311
Query: 184 SHTGGLNPRAFR-TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 238
H GLNP+++R TYK GNP+RGIIDG LVW++L L E+ ++ KKIG++
Sbjct: 1312 CHIAGLNPKSYRHTYKSYIRNQGNPARGIIDGDLVWRYLFLPNNEKADLAKKIGTR 1367
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 35/68 (51%), Positives = 45/68 (66%)
Query: 4 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDRSKRANE 63
V+E+L V+LG HGNRP+LLVR EL IYQ +R+PKG LKLRFKKL + R +
Sbjct: 796 VREILMVALGHHGNRPMLLVRLDSELQIYQVYRYPKGYLKLRFKKLDHGIIPGRLSPRPK 855
Query: 64 QPGLPRGV 71
+ +PR
Sbjct: 856 EEDVPRNT 863
>gi|332018184|gb|EGI58789.1| Cleavage and polyadenylation specificity factor subunit 1 [Acromyrmex
echinatior]
Length = 1412
Score = 209 bits (531), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 98/176 (55%), Positives = 132/176 (75%), Gaps = 5/176 (2%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY E++I++S++GF+++D + N+ LFMYQPE+RES GG +LI+K DFHLGQ +NTF
Sbjct: 1219 PAEVYTIEYLIDNSNLGFIVADGESNLALFMYQPESRESLGGQKLIRKADFHLGQKINTF 1278
Query: 128 FKIRCK-PSPISDA---PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
F+I+C+ P +D GA R +T YASLDG+LG+ LP+PEK YRRLLMLQNV+VTH
Sbjct: 1279 FRIKCRITDPANDKKQFSGADKRHVTMYASLDGSLGYILPVPEKTYRRLLMLQNVLVTHI 1338
Query: 184 SHTGGLNPRAFR-TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 238
H GLNP+A+R TYK GNP+RGIIDG LVW++L L E+ ++ KKIG++
Sbjct: 1339 CHIAGLNPKAYRHTYKSYVRNQGNPARGIIDGDLVWRYLFLPNNEKADLAKKIGTR 1394
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 35/67 (52%), Positives = 46/67 (68%)
Query: 4 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDRSKRANE 63
V+E+L V+LG HGNRP+LLVR +L IYQA+R+PKG LKLRFKKL + R +
Sbjct: 827 VREILMVALGHHGNRPMLLVRLDSDLQIYQAYRYPKGYLKLRFKKLDHGIIPGRLSPRPK 886
Query: 64 QPGLPRG 70
+ +PR
Sbjct: 887 EEDVPRN 893
>gi|427795803|gb|JAA63353.1| Putative mrna cleavage and polyadenylation factor ii complex
subunit cft1 cpsf subunit, partial [Rhipicephalus
pulchellus]
Length = 726
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 92/194 (47%), Positives = 134/194 (69%), Gaps = 5/194 (2%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VYA EF I+++ M F+++D ++N++L+MYQPE+RES GG RL+++ DFH+G V +
Sbjct: 533 PLEVYAVEFFIDNTQMSFLVTDAERNLLLYMYQPESRESCGGQRLLRRGDFHVGSPVVSM 592
Query: 128 FKIRCKPSPISD-----APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
F+I+C+ I+ A R +T A+LDG+L + LP+PEK YRRLLMLQNV+VT+
Sbjct: 593 FRIKCRMGDIAKYDRRAASIVDGRHITMMATLDGSLAYVLPVPEKTYRRLLMLQNVLVTN 652
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNP+A+R Y + + GNP + I+DG L+WKF+ LS ER E+ KKIG+ I
Sbjct: 653 IPHYAGLNPKAYRMYYSQRRFLGNPHKNILDGELIWKFMHLSFMERSELSKKIGTTVTQI 712
Query: 243 LDELYDIEALSSHF 256
D+L +IE ++HF
Sbjct: 713 TDDLLEIETYTAHF 726
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 34/51 (66%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGTLKLRFKKL 49
+V E+L V LG+ +RPLLL R +LLIY+AF +G LKLRFKK+
Sbjct: 131 VVHEILVVGLGIRHSRPLLLARVDEDLLIYEAFPFYETQREGHLKLRFKKM 181
>gi|427780291|gb|JAA55597.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
cft1 cpsf subunit [Rhipicephalus pulchellus]
Length = 1237
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 92/194 (47%), Positives = 134/194 (69%), Gaps = 5/194 (2%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VYA EF I+++ M F+++D ++N++L+MYQPE+RES GG RL+++ DFH+G V +
Sbjct: 1044 PLEVYAVEFFIDNTQMSFLVTDAERNLLLYMYQPESRESCGGQRLLRRGDFHVGSPVVSM 1103
Query: 128 FKIRCKPSPISD-----APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
F+I+C+ I+ A R +T A+LDG+L + LP+PEK YRRLLMLQNV+VT+
Sbjct: 1104 FRIKCRMGDIAKYDRRAASIVDGRHITMMATLDGSLAYVLPVPEKTYRRLLMLQNVLVTN 1163
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNP+A+R Y + + GNP + I+DG L+WKF+ LS ER E+ KKIG+ I
Sbjct: 1164 IPHYAGLNPKAYRMYYSQRRFLGNPHKNILDGELIWKFMHLSFMERSELSKKIGTTVTQI 1223
Query: 243 LDELYDIEALSSHF 256
D+L +IE ++HF
Sbjct: 1224 TDDLLEIETYTAHF 1237
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 34/51 (66%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGTLKLRFKKL 49
+V E+L V LG+ +RPLLL R +LLIY+AF +G LKLRFKK+
Sbjct: 642 VVHEILVVGLGIRHSRPLLLARVDEDLLIYEAFPFYETQREGHLKLRFKKM 692
>gi|405977622|gb|EKC42064.1| Cleavage and polyadenylation specificity factor subunit 1
[Crassostrea gigas]
Length = 1369
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 96/211 (45%), Positives = 143/211 (67%), Gaps = 11/211 (5%)
Query: 50 KVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGG 109
KVL + R R P VY +F+I+++ + ++SD+ KN+V++ YQPEARES+GG
Sbjct: 1166 KVLSIVSRDPR-------PLEVYTADFLIDNTQLCCLVSDRMKNLVVYSYQPEARESHGG 1218
Query: 110 HRLIKKTDFHLGQHVNTFFKIRCK---PSPISDAPGA-RSRFLTWYASLDGALGFFLPLP 165
RLI+K DF+ G +V++ F++RCK PS GA R +T++A+LDG+LGF LPL
Sbjct: 1219 QRLIRKADFNAGSNVSSMFRVRCKLYDPSSDKRMTGAPEKRHITYFATLDGSLGFVLPLS 1278
Query: 166 EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 225
EK YRRL MLQN +VTH H GLNPR++R G NP + I+DG L+WK+ LS+
Sbjct: 1279 EKVYRRLFMLQNALVTHIPHVAGLNPRSYRHVIGTFPELRNPQKNILDGELLWKYTNLSI 1338
Query: 226 GERLEICKKIGSKHNDILDELYDIEALSSHF 256
E++EI K++G+ ++ I+D+L +I+ L++HF
Sbjct: 1339 MEKIEIAKRLGTSNDQIMDDLMEIDRLTAHF 1369
Score = 47.8 bits (112), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 4 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGT----LKLRFKKLK 50
++ELL V LG +RP LL R + +L IY+AF +P+ + LKLRFKK++
Sbjct: 776 LKELLMVGLGYKDSRPHLLARVEDDLYIYEAFSYPQSSIDNHLKLRFKKIQ 826
>gi|157110889|ref|XP_001651294.1| cleavage and polyadenylation specificity factor cpsf [Aedes aegypti]
gi|108883895|gb|EAT48120.1| AAEL000832-PA [Aedes aegypti]
Length = 1417
Score = 197 bits (500), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 85/191 (44%), Positives = 134/191 (70%), Gaps = 2/191 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P V+ E+V+++ ++GF++SD+ N++ +MYQPE+RES GG RL++K D+H+GQ +N+
Sbjct: 1227 PLNVFQIEYVVDNHNLGFLVSDEQCNIITYMYQPESRESFGGQRLLRKCDYHVGQKINSM 1286
Query: 128 FKIRCKPSPISDAPGA--RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
F+++C + + + T++A+LDG +G+ LPLPEK YRRL MLQNV++TH+ H
Sbjct: 1287 FRVQCDFHEMDYKRNSNYECKHTTYFATLDGGIGYVLPLPEKTYRRLFMLQNVLMTHSPH 1346
Query: 186 TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
GLNP+AFRT K NP+R ++DG L+W FL L E+LE+ KKIG++ +DI +
Sbjct: 1347 LCGLNPKAFRTIKTVKKLPINPARCVVDGDLIWTFLTLPANEKLEVAKKIGTRIDDICAD 1406
Query: 246 LYDIEALSSHF 256
L +IE+++ F
Sbjct: 1407 LMEIESVTHVF 1417
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 66/129 (51%), Gaps = 21/129 (16%)
Query: 5 QELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKL------KVLFVSDRS 58
+E+L V+LG HG RP+L VR +++LL+Y+ +R+ KG LKLRF+++ + ++ R
Sbjct: 821 KEILMVALGHHGTRPMLFVRLENDLLVYRVYRYSKGHLKLRFRRVPSGVTGPIFKIAPRQ 880
Query: 59 KRANEQPGLPRGVYACEFVIESSSM-----------GFMISDKDKNVVLFMYQPEARESN 107
+Q G ++ + + E+ SM G + + ++L + E R
Sbjct: 881 SAPTDQEGEKPDEHSTKIMYENISMIRYFNNVNGYNGVAVCGEKPYIMLLTSRGELR--- 937
Query: 108 GGHRLIKKT 116
HRL KT
Sbjct: 938 -AHRLYAKT 945
>gi|260835071|ref|XP_002612533.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
gi|229297910|gb|EEN68542.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
Length = 1003
Score = 197 bits (500), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 95/194 (48%), Positives = 128/194 (65%), Gaps = 5/194 (2%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P Y EF ++++ +GF++SD KN +L+ YQPEARES GG RL+++ DF++G HVNTF
Sbjct: 810 PLETYCVEFFVDNAQIGFLVSDAQKNFLLYSYQPEARESYGGQRLVRRADFNVGSHVNTF 869
Query: 128 FKIRCKPSPIS-----DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
F++RCK S DA R +T +A+LDG LG LP+ EK YRRLLMLQN ++TH
Sbjct: 870 FRVRCKIMDPSGERRRDADTVAKRHVTMFATLDGGLGALLPMAEKTYRRLLMLQNTLMTH 929
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
GLNP+AFR K N R I+DG L+WKFL LS+ ER E+ +KIG+ I
Sbjct: 930 MPFPAGLNPKAFRMLKHNHRSLINACRNILDGELLWKFLHLSVVERSELARKIGTSPETI 989
Query: 243 LDELYDIEALSSHF 256
++L DI+ LS+HF
Sbjct: 990 TEDLMDIDRLSAHF 1003
Score = 45.4 bits (106), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 45/70 (64%), Gaps = 9/70 (12%)
Query: 4 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH---PKGT-LKLRFKKLK-VLFVSDR- 57
V+E+L V LG G+RP LL R +LLIY+AF + P T LK+RFKK++ L + +R
Sbjct: 436 VKEILMVGLGHKGSRPHLLARVDEDLLIYEAFPYHLSPSYTMLKIRFKKVQHNLILRERK 495
Query: 58 ---SKRANEQ 64
+K+A +Q
Sbjct: 496 GGKTKKAGDQ 505
>gi|158287218|ref|XP_309311.4| AGAP011340-PA [Anopheles gambiae str. PEST]
gi|157019545|gb|EAA05261.4| AGAP011340-PA [Anopheles gambiae str. PEST]
Length = 1434
Score = 196 bits (498), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 87/189 (46%), Positives = 133/189 (70%), Gaps = 3/189 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P V+ E+V++++++GF++SD N++ +MYQPE+RES GG RL++K+D+HLGQ VN
Sbjct: 1243 PLNVFQVEYVVDNANLGFLVSDDQCNLITYMYQPESRESFGGQRLLRKSDYHLGQQVNCM 1302
Query: 128 FKIRC---KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
F+++C + + ++ T++A+LDG +GF LPLPEK YRRL MLQNV++TH+
Sbjct: 1303 FRVQCDFHETDVMKRTLNYDNKHTTFFATLDGGIGFVLPLPEKTYRRLFMLQNVLLTHSP 1362
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
HT GLNP+A+RT K NPSR ++DG LVW FL+L E+ E+ KKIG++ +I
Sbjct: 1363 HTCGLNPKAYRTIKQTRKLPINPSRCVVDGDLVWSFLELPANEKHEVAKKIGTRIEEICA 1422
Query: 245 ELYDIEALS 253
+L +IE ++
Sbjct: 1423 DLMEIEHVT 1431
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 27/45 (60%), Positives = 38/45 (84%)
Query: 5 QELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKL 49
+E+L V+LG +G+RPLL +R +H+LLIY+ FR+ KG LKLRFK+L
Sbjct: 836 KEILMVALGSYGSRPLLFIRLEHDLLIYRVFRYSKGHLKLRFKRL 880
>gi|312380158|gb|EFR26239.1| hypothetical protein AND_07834 [Anopheles darlingi]
Length = 1503
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 87/186 (46%), Positives = 129/186 (69%), Gaps = 3/186 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY E+V++++++GF+++D N++ +MYQPE+RES GG RL++K D+HLGQ VN
Sbjct: 1264 PLNVYQVEYVVDNTNLGFLVADDQANLITYMYQPESRESFGGQRLLRKGDYHLGQRVNAM 1323
Query: 128 FKIRC---KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
F+++C + + ++ T++A+LDG GF LPLPEK YRRL MLQNV++TH+
Sbjct: 1324 FRVQCDFHESDVMRRTLNYDNKHTTFFATLDGGFGFVLPLPEKTYRRLFMLQNVLLTHSP 1383
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
HT GLNP+A+RT K NPSR ++DG LVW FL+L E+ E+ KKIG++ +I
Sbjct: 1384 HTCGLNPKAYRTIKQSRALPINPSRCVVDGDLVWSFLELPANEKQEVAKKIGTRIEEICA 1443
Query: 245 ELYDIE 250
+L +IE
Sbjct: 1444 DLMEIE 1449
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 25/45 (55%), Positives = 37/45 (82%)
Query: 5 QELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKL 49
+E+L V+LG +G+RP+L +R + +LLIY+ FR+ KG LKLRFK+L
Sbjct: 845 KEILMVALGSYGSRPILFIRLEQDLLIYRVFRYAKGHLKLRFKRL 889
>gi|195056749|ref|XP_001995154.1| GH22991 [Drosophila grimshawi]
gi|193899360|gb|EDV98226.1| GH22991 [Drosophila grimshawi]
Length = 1426
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 90/191 (47%), Positives = 133/191 (69%), Gaps = 2/191 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P V+ EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT
Sbjct: 1236 PMEVFGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1295
Query: 128 FKIRCKPSPISDAPG--ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
F+++C + ++ L Y SLDGALG+ LPLPEK YRR LMLQNV++++ H
Sbjct: 1296 FRVQCHQRGLHQRQPFLYENKHLVIYGSLDGALGYCLPLPEKVYRRFLMLQNVLLSYQDH 1355
Query: 186 TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
GLNP+ +RT K NPSR IIDG L+W F L+ ER E+ KKIG++ +IL +
Sbjct: 1356 LCGLNPKEYRTIKSVKKLGINPSRCIIDGDLIWSFRMLAHSERNEVAKKIGTRTEEILAD 1415
Query: 246 LYDIEALSSHF 256
L +IE +S+ F
Sbjct: 1416 LLEIERISAVF 1426
Score = 64.3 bits (155), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/47 (68%), Positives = 37/47 (78%)
Query: 6 ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVL 52
EL V LG HG RPLLLVRT+ ELLIYQ FR+ KG LK+RF+KL+ L
Sbjct: 836 ELCLVGLGQHGERPLLLVRTRLELLIYQVFRYAKGHLKIRFRKLEQL 882
>gi|45552619|ref|NP_995833.1| cleavage and polyadenylation specificity factor 160, isoform A
[Drosophila melanogaster]
gi|18203551|sp|Q9V726.1|CPSF1_DROME RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit;
Short=dCPSF 160
gi|7303176|gb|AAF58240.1| cleavage and polyadenylation specificity factor 160, isoform A
[Drosophila melanogaster]
Length = 1455
Score = 193 bits (490), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 91/195 (46%), Positives = 133/195 (68%), Gaps = 10/195 (5%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT
Sbjct: 1265 PLEVYGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1324
Query: 128 FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
F+++C + R FL Y +LDGALG+ LPLPEK YRR LMLQNV+++
Sbjct: 1325 FRVQCHQKGLHQ----RQPFLYENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLS 1380
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
+ H GLNP+ +RT K NPSR IIDG L+W + ++ ER E+ KKIG++ +
Sbjct: 1381 YQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEE 1440
Query: 242 ILDELYDIEALSSHF 256
IL +L +IE L+S F
Sbjct: 1441 ILGDLLEIERLASVF 1455
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/52 (61%), Positives = 41/52 (78%)
Query: 6 ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
EL + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K+ L + D+
Sbjct: 864 ELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRKMDQLNLLDQ 915
>gi|24653655|ref|NP_725397.1| cleavage and polyadenylation specificity factor 160, isoform B
[Drosophila melanogaster]
gi|15292103|gb|AAK93320.1| LD38533p [Drosophila melanogaster]
gi|21627189|gb|AAM68553.1| cleavage and polyadenylation specificity factor 160, isoform B
[Drosophila melanogaster]
Length = 1420
Score = 192 bits (489), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 91/195 (46%), Positives = 133/195 (68%), Gaps = 10/195 (5%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT
Sbjct: 1230 PLEVYGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1289
Query: 128 FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
F+++C + R FL Y +LDGALG+ LPLPEK YRR LMLQNV+++
Sbjct: 1290 FRVQCHQKGLHQ----RQPFLYENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLS 1345
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
+ H GLNP+ +RT K NPSR IIDG L+W + ++ ER E+ KKIG++ +
Sbjct: 1346 YQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEE 1405
Query: 242 ILDELYDIEALSSHF 256
IL +L +IE L+S F
Sbjct: 1406 ILGDLLEIERLASVF 1420
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/52 (61%), Positives = 41/52 (78%)
Query: 6 ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
EL + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K+ L + D+
Sbjct: 829 ELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRKMDQLNLLDQ 880
>gi|194756960|ref|XP_001960738.1| GF11349 [Drosophila ananassae]
gi|190622036|gb|EDV37560.1| GF11349 [Drosophila ananassae]
Length = 1455
Score = 192 bits (489), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 92/195 (47%), Positives = 133/195 (68%), Gaps = 10/195 (5%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT
Sbjct: 1265 PLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1324
Query: 128 FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
F+++C + R FL Y +LDGALG+ LPLPEK YRR LMLQNV+++
Sbjct: 1325 FRVQCHQKGLHQ----RQPFLYENKHFVVYGTLDGALGYCLPLPEKLYRRFLMLQNVLLS 1380
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
+ H GLNP+ +RT K NPSR IIDG L+W + L+ ER E+ KKIG++ +
Sbjct: 1381 YQEHLCGLNPKEYRTIKAVKKQGINPSRCIIDGDLIWSYRLLANSERNEVAKKIGTRTEE 1440
Query: 242 ILDELYDIEALSSHF 256
IL +L +IE L+S F
Sbjct: 1441 ILSDLLEIERLASVF 1455
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 33/51 (64%), Positives = 41/51 (80%)
Query: 6 ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSD 56
EL + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+KL+ L + D
Sbjct: 864 ELTVLGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRKLEQLNLMD 914
>gi|195122290|ref|XP_002005645.1| GI18959 [Drosophila mojavensis]
gi|193910713|gb|EDW09580.1| GI18959 [Drosophila mojavensis]
Length = 1431
Score = 192 bits (488), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 91/195 (46%), Positives = 133/195 (68%), Gaps = 10/195 (5%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P V+ EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT
Sbjct: 1241 PLEVFGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1300
Query: 128 FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
F+++C + R FL Y +LDGALG+ LPLPEK YRR LMLQNV+++
Sbjct: 1301 FRVQCHQRGLHQ----RQPFLYENKHFVIYGTLDGALGYCLPLPEKVYRRFLMLQNVLLS 1356
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
+ H GLNP+ +RT K NPSR IIDG L+W + L+ ER E+ KKIG++ +
Sbjct: 1357 YQDHLCGLNPKEYRTIKTVKKMGINPSRCIIDGDLIWSYRMLAHSERSEVAKKIGTRTEE 1416
Query: 242 ILDELYDIEALSSHF 256
IL +L +IE LS+ F
Sbjct: 1417 ILADLLEIERLSAIF 1431
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/52 (63%), Positives = 41/52 (78%)
Query: 6 ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
EL V LG HG+RPLLLVRT+ ELLIYQ FR+ KG LK+RF+KL+ L + D+
Sbjct: 840 ELSLVGLGQHGDRPLLLVRTRLELLIYQVFRYAKGHLKIRFRKLEQLHLLDQ 891
>gi|9794908|gb|AAF98388.1| cleavage and polyadenylation specificity factor [Drosophila
melanogaster]
Length = 813
Score = 192 bits (488), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 88/191 (46%), Positives = 132/191 (69%), Gaps = 2/191 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT
Sbjct: 623 PLEVYGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 682
Query: 128 FKIRCKPSPISDAPG--ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
F+++C + ++ Y +LDGALG+ LPLPEK YRR LMLQNV++++ H
Sbjct: 683 FRVQCHQKGLHQRQPFLYENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEH 742
Query: 186 TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
GLNP+ +RT K NPSR IIDG L+W + ++ ER E+ KKIG++ +IL +
Sbjct: 743 LCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGD 802
Query: 246 LYDIEALSSHF 256
L +IE L+S F
Sbjct: 803 LLEIERLASVF 813
Score = 67.0 bits (162), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 33/52 (63%), Positives = 41/52 (78%)
Query: 6 ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
EL + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+KL L + D+
Sbjct: 222 ELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRKLDQLNLLDQ 273
>gi|195334368|ref|XP_002033855.1| GM20208 [Drosophila sechellia]
gi|194125825|gb|EDW47868.1| GM20208 [Drosophila sechellia]
Length = 1455
Score = 192 bits (487), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 91/195 (46%), Positives = 133/195 (68%), Gaps = 10/195 (5%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT
Sbjct: 1265 PLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1324
Query: 128 FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
F+++C + R FL Y +LDGALG+ LPLPEK YRR LMLQNV+++
Sbjct: 1325 FRVQCHQKGLHQ----RQPFLYENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLS 1380
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
+ H GLNP+ +RT K NPSR IIDG L+W + ++ ER E+ KKIG++ +
Sbjct: 1381 YQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEE 1440
Query: 242 ILDELYDIEALSSHF 256
IL +L +IE L+S F
Sbjct: 1441 ILGDLLEIERLASVF 1455
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 33/52 (63%), Positives = 41/52 (78%)
Query: 6 ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
EL + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+KL L + D+
Sbjct: 864 ELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRKLDQLNLLDQ 915
>gi|194883064|ref|XP_001975624.1| GG22421 [Drosophila erecta]
gi|190658811|gb|EDV56024.1| GG22421 [Drosophila erecta]
Length = 1455
Score = 192 bits (487), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 92/195 (47%), Positives = 133/195 (68%), Gaps = 10/195 (5%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT
Sbjct: 1265 PLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1324
Query: 128 FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
F+++C + R FL Y +LDGALG+ LPLPEK YRR LMLQNV+V+
Sbjct: 1325 FRVQCHQKGLHQ----RQPFLYENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLVS 1380
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
+ H GLNP+ +RT K NPSR IIDG L+W + ++ ER E+ KKIG++ +
Sbjct: 1381 YQEHLCGLNPKEYRTLKSFKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEE 1440
Query: 242 ILDELYDIEALSSHF 256
IL +L +IE L+S F
Sbjct: 1441 ILGDLLEIERLASVF 1455
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/52 (61%), Positives = 40/52 (76%)
Query: 6 ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
EL LGL+G RPLL+VRT+ ELLIYQ FR+PKG LK+RF+KL L + D+
Sbjct: 864 ELSLTGLGLNGERPLLMVRTRVELLIYQVFRYPKGHLKIRFRKLDQLNLLDQ 915
>gi|195381337|ref|XP_002049409.1| GJ21566 [Drosophila virilis]
gi|194144206|gb|EDW60602.1| GJ21566 [Drosophila virilis]
Length = 1420
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 88/191 (46%), Positives = 133/191 (69%), Gaps = 2/191 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P V+ EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT
Sbjct: 1230 PLEVFGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1289
Query: 128 FKIRCKPSPISDAPG--ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
F+++C + ++ L Y +LDGALG+ LPLPEK YRR LMLQNV++++ H
Sbjct: 1290 FRVQCHQRGLHHRQPFLYENKHLVIYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQDH 1349
Query: 186 TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
GLNP+ +RT K NPSR IIDG L+W + L+ ER E+ KKIG++ +IL +
Sbjct: 1350 LCGLNPKEYRTIKTVKKMGINPSRCIIDGDLIWSYRMLAHSERSEVAKKIGTRTEEILAD 1409
Query: 246 LYDIEALSSHF 256
+ +IE LS+ F
Sbjct: 1410 MLEIERLSAVF 1420
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/52 (63%), Positives = 40/52 (76%)
Query: 6 ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
EL V LG HG RPLLLVRT+ ELLIYQ FR+ KG LK+RF+KL+ L + D+
Sbjct: 830 ELCLVGLGQHGERPLLLVRTRLELLIYQVFRYAKGHLKIRFRKLEQLHLLDQ 881
>gi|241060959|ref|XP_002408050.1| cleavage and polyadenylation specificity factor, putative [Ixodes
scapularis]
gi|215492346|gb|EEC01987.1| cleavage and polyadenylation specificity factor, putative [Ixodes
scapularis]
Length = 1241
Score = 191 bits (484), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 86/181 (47%), Positives = 125/181 (69%), Gaps = 5/181 (2%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P V+A EF I++S M F+++D ++N++L+MYQPE+RES GG RL+++ DFH+G V +
Sbjct: 1048 PLEVFAVEFFIDNSQMSFLVTDSERNMILYMYQPESRESCGGQRLLRRGDFHIGSPVVSM 1107
Query: 128 FKIRCKPSPISD-----APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
F+I+C+ ++ A R +T A+LDG+LG+ LP+PEK YRRLLMLQNV+VT+
Sbjct: 1108 FRIKCRMGEVAKHDRRLAASVDGRHITMLATLDGSLGYVLPVPEKTYRRLLMLQNVLVTN 1167
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNP+AFR Y + GNP + I+DG L+WKF+ LS ER E+ KKIG+ +
Sbjct: 1168 MPHYAGLNPKAFRMYHSQRRVLGNPHKNILDGELIWKFMHLSFMERSELSKKIGTTVTQV 1227
Query: 243 L 243
+
Sbjct: 1228 V 1228
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 29/67 (43%), Positives = 39/67 (58%), Gaps = 4/67 (5%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGTLKLRFKKLKVLFVSDRS 58
+V E+L V LG+ +RPLLL R +LLIY+AF +G LKLRFKKL +
Sbjct: 645 VVHEILMVGLGVRQSRPLLLARVDEDLLIYEAFPFYETQREGHLKLRFKKLNHDIILRSR 704
Query: 59 KRANEQP 65
K ++P
Sbjct: 705 KYKTQKP 711
>gi|195455711|ref|XP_002074834.1| GK23274 [Drosophila willistoni]
gi|194170919|gb|EDW85820.1| GK23274 [Drosophila willistoni]
Length = 1463
Score = 190 bits (483), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 91/195 (46%), Positives = 131/195 (67%), Gaps = 10/195 (5%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY EF+++++++GF+++D + N++++MYQPEARES GG +L++K D+HLGQ VNT
Sbjct: 1273 PLEVYGIEFMVDNTNLGFLVTDAESNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1332
Query: 128 FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
F+++C + R FL Y +LDGALG+ LPLPEK YRR LMLQNV+++
Sbjct: 1333 FRVQCHQRGLHQ----RQPFLYENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLS 1388
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
+ H GLNP+ +RT K NPSR IIDG L+W + L+ ER E+ KKIG++ +
Sbjct: 1389 YQDHLCGLNPKEYRTLKSSKRLGINPSRCIIDGDLIWSYRLLANSERNEVAKKIGTRTEE 1448
Query: 242 ILDELYDIEALSSHF 256
IL +L +IE LS F
Sbjct: 1449 ILADLLEIERLSGVF 1463
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/52 (63%), Positives = 41/52 (78%)
Query: 6 ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
EL V LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K+ L + D+
Sbjct: 872 ELSLVGLGLNGERPLLLVRTRLELLIYQVFRYPKGHLKIRFRKMDQLNLLDQ 923
>gi|195485994|ref|XP_002091320.1| GE12310 [Drosophila yakuba]
gi|194177421|gb|EDW91032.1| GE12310 [Drosophila yakuba]
Length = 1455
Score = 190 bits (483), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 90/195 (46%), Positives = 133/195 (68%), Gaps = 10/195 (5%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT
Sbjct: 1265 PLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1324
Query: 128 FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
F+++C + R FL Y +LDGALG+ LPLPEK YRR LMLQNV+++
Sbjct: 1325 FRVQCHQKGLHQ----RQPFLYENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLS 1380
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
+ H GLNP+ +RT K NPSR +IDG L+W + ++ ER E+ KKIG++ +
Sbjct: 1381 YQEHLCGLNPKEYRTLKSFKKQGINPSRCVIDGDLIWSYRLMANSERNEVAKKIGTRTEE 1440
Query: 242 ILDELYDIEALSSHF 256
IL +L +IE L+S F
Sbjct: 1441 ILADLLEIERLASVF 1455
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 33/52 (63%), Positives = 41/52 (78%)
Query: 6 ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
EL + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+KL L + D+
Sbjct: 864 ELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRKLDQLNLLDQ 915
>gi|198457226|ref|XP_001360595.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
gi|198135905|gb|EAL25170.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
Length = 1459
Score = 190 bits (482), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 92/195 (47%), Positives = 130/195 (66%), Gaps = 10/195 (5%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY EF++++S++GF+++D ++N++++MYQPEARES GG +LI+K D+HLGQ VNT
Sbjct: 1269 PLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLIRKADYHLGQVVNTM 1328
Query: 128 FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
F+++C + R FL Y +LDG LG+ LPLPEK YRR LMLQNV+++
Sbjct: 1329 FRVQCHQRGVHQ----RQPFLYENKHFVVYGTLDGGLGYCLPLPEKVYRRFLMLQNVLLS 1384
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
+ H GLNP+ FRT K NPSR IIDG L+W + L +R E+ KKIG++ +
Sbjct: 1385 YQDHLCGLNPKEFRTLKSFKKQGLNPSRCIIDGDLIWSYRLLPNSDRNEVAKKIGTRTEE 1444
Query: 242 ILDELYDIEALSSHF 256
IL +L +IE LS F
Sbjct: 1445 ILSDLLEIERLSGVF 1459
Score = 66.6 bits (161), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 32/52 (61%), Positives = 42/52 (80%)
Query: 6 ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
EL V LGL+G RP+L+VRT+ ELLIYQ FR+PKG LK+RF+KL+ L + D+
Sbjct: 868 ELSLVGLGLNGERPVLMVRTRVELLIYQVFRYPKGNLKIRFRKLEQLNLLDQ 919
>gi|195150431|ref|XP_002016158.1| GL10645 [Drosophila persimilis]
gi|194110005|gb|EDW32048.1| GL10645 [Drosophila persimilis]
Length = 1459
Score = 189 bits (481), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 92/195 (47%), Positives = 130/195 (66%), Gaps = 10/195 (5%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY EF++++S++GF+++D ++N++++MYQPEARES GG +LI+K D+HLGQ VNT
Sbjct: 1269 PLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLIRKADYHLGQVVNTM 1328
Query: 128 FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
F+++C + R FL Y +LDG LG+ LPLPEK YRR LMLQNV+++
Sbjct: 1329 FRVQCHQRGVHQ----RQPFLYENKHFVVYGTLDGGLGYCLPLPEKVYRRFLMLQNVLLS 1384
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
+ H GLNP+ FRT K NPSR IIDG L+W + L +R E+ KKIG++ +
Sbjct: 1385 YQDHLCGLNPKEFRTLKSFKKQGLNPSRCIIDGDLIWSYRLLPNSDRNEVAKKIGTRTEE 1444
Query: 242 ILDELYDIEALSSHF 256
IL +L +IE LS F
Sbjct: 1445 ILSDLLEIERLSGVF 1459
Score = 66.6 bits (161), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 32/52 (61%), Positives = 42/52 (80%)
Query: 6 ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDR 57
EL V LGL+G RP+L+VRT+ ELLIYQ FR+PKG LK+RF+KL+ L + D+
Sbjct: 868 ELSLVGLGLNGERPVLMVRTRVELLIYQVFRYPKGNLKIRFRKLEQLNLLDQ 919
>gi|149066088|gb|EDM15961.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
isoform CRA_b [Rattus norvegicus]
Length = 241
Score = 186 bits (473), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 50 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 109
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 110 WRTPCRGA--AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 167
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 168 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDII 227
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 228 LDDLLETDRVTAHF 241
>gi|334326317|ref|XP_001364707.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Monodelphis domestica]
Length = 1449
Score = 186 bits (473), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 86/194 (44%), Positives = 134/194 (69%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++S+ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1258 PLEVYSVDFMVDSAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1317
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1318 WRTPCRGA--AEGPSKKSIVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1375
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1376 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1435
Query: 243 LDELYDIEALSSHF 256
LD+L +I+ +++HF
Sbjct: 1436 LDDLLEIDRVTAHF 1449
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 33/51 (64%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG RP LLV ELLIY+AF H + LK+RFKK+
Sbjct: 851 LVKEVLLVALGNRQTRPYLLVHVDQELLIYEAFAHDSQLGQSNLKVRFKKV 901
>gi|395512730|ref|XP_003760588.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Sarcophilus harrisii]
Length = 1449
Score = 186 bits (471), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 86/194 (44%), Positives = 134/194 (69%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++S+ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1258 PLEVYSVDFMVDSAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1317
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1318 WRTPCRGA--AEGPTKKSIVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1375
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1376 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1435
Query: 243 LDELYDIEALSSHF 256
LD+L +I+ +++HF
Sbjct: 1436 LDDLLEIDRVTAHF 1449
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 33/51 (64%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG RP LLV ELLIY+AF H + LK+RFKK+
Sbjct: 851 LVKEVLLVALGNRQTRPYLLVHVDQELLIYEAFAHDSQLGQSNLKVRFKKV 901
>gi|348512553|ref|XP_003443807.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Oreochromis niloticus]
Length = 1456
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 84/192 (43%), Positives = 134/192 (69%), Gaps = 3/192 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ EF+++++ +GF++SD+DKN+ ++MY PEA+ES GG RL+++ DF+ G ++NTF
Sbjct: 1265 PLEVYSIEFMVDNNQLGFLVSDRDKNLYVYMYLPEAKESFGGMRLLRRADFNAGANINTF 1324
Query: 128 FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
+++ C+ + + + A ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1325 WRMPCRGALDASSKKALTWDNKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1384
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
H GLNP+AFR NP + I+DG L+ K+L LS+ ER E+ KKIG+ + ILD
Sbjct: 1385 HHAGLNPKAFRMLHSDRRSLQNPVKNILDGELLNKYLYLSMMERSELAKKIGTTQDIILD 1444
Query: 245 ELYDIEALSSHF 256
+L +I+ +++HF
Sbjct: 1445 DLLEIDRVTAHF 1456
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 37/51 (72%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGTLKLRFKKL 49
+V+E+ VSLG + ++P LLV + ELLIY+AF++ P+ LK+RFKK+
Sbjct: 857 LVKEVALVSLGNNHSKPYLLVHVEQELLIYEAFQYDQQQPQNNLKVRFKKV 907
>gi|417406474|gb|JAA49895.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
cft1 cpsf subunit [Desmodus rotundus]
Length = 1444
Score = 184 bits (467), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 85/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1253 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1312
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1313 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1370
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R I+DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1371 LPHHAGLNPRAFRMLHVDRRTLQNAVRNILDGELLNRYLYLSTMERSELAKKIGTTPDII 1430
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1431 LDDLLETDRVTAHF 1444
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 846 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFAHDSQLGQGNLKVRFKKV 896
>gi|351713968|gb|EHB16887.1| Cleavage and polyadenylation specificity factor subunit 1
[Heterocephalus glaber]
Length = 1440
Score = 184 bits (467), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 85/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1249 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1308
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + S+ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1309 WRTPCRGA--SEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1366
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1367 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1426
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1427 LDDLLETDRVTAHF 1440
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 843 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 893
>gi|354491124|ref|XP_003507706.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 2 [Cricetulus griseus]
Length = 1388
Score = 184 bits (467), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1197 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1256
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1257 WRTPCRGA--AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1314
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1315 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDII 1374
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1375 LDDLLETDRVTAHF 1388
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 841 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 891
>gi|354491122|ref|XP_003507705.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 1 [Cricetulus griseus]
Length = 1441
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1250 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1309
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1310 WRTPCRGA--AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1367
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1368 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDII 1427
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1428 LDDLLETDRVTAHF 1441
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 843 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 893
>gi|197245729|gb|AAI68713.1| Cpsf1 protein [Rattus norvegicus]
Length = 1439
Score = 184 bits (466), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1248 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1307
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1308 WRTPCRGA--AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1365
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1366 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDII 1425
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1426 LDDLLETDRVTAHF 1439
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 841 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 891
>gi|148697644|gb|EDL29591.1| cleavage and polyadenylation specific factor 1, isoform CRA_c [Mus
musculus]
Length = 1388
Score = 184 bits (466), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1197 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1256
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1257 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1314
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1315 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDII 1374
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1375 LDDLLETDRVTAHF 1388
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 841 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 891
>gi|344236599|gb|EGV92702.1| Cleavage and polyadenylation specificity factor subunit 1 [Cricetulus
griseus]
Length = 1419
Score = 184 bits (466), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1228 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1287
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1288 WRTPCRGA--AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1345
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1346 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDII 1405
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1406 LDDLLETDRVTAHF 1419
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 821 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 871
>gi|194474008|ref|NP_001124043.1| cleavage and polyadenylation specificity factor subunit 1 [Rattus
norvegicus]
gi|149066087|gb|EDM15960.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
isoform CRA_a [Rattus norvegicus]
Length = 1386
Score = 184 bits (466), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1195 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1254
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1255 WRTPCRGA--AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1312
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1313 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDII 1372
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1373 LDDLLETDRVTAHF 1386
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 839 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 889
>gi|148697643|gb|EDL29590.1| cleavage and polyadenylation specific factor 1, isoform CRA_b [Mus
musculus]
Length = 1311
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1120 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1179
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1180 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1237
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1238 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDII 1297
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1298 LDDLLETDRVTAHF 1311
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 764 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 814
>gi|16751835|ref|NP_444423.1| cleavage and polyadenylation specificity factor subunit 1 isoform 2
[Mus musculus]
gi|17374611|sp|Q9EPU4.1|CPSF1_MOUSE RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit
gi|11762096|gb|AAG40326.1|AF322193_1 cleavage and polyadenylation specificity factor 1 [Mus musculus]
gi|38614159|gb|AAH56388.1| Cleavage and polyadenylation specific factor 1 [Mus musculus]
Length = 1441
Score = 183 bits (465), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1250 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1309
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1310 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1367
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1368 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDII 1427
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1428 LDDLLETDRVTAHF 1441
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 843 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 893
>gi|148697642|gb|EDL29589.1| cleavage and polyadenylation specific factor 1, isoform CRA_a [Mus
musculus]
Length = 1417
Score = 183 bits (465), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1226 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1285
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1286 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1343
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1344 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDII 1403
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1404 LDDLLETDRVTAHF 1417
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 870 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 920
>gi|348555854|ref|XP_003463738.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 1 [Cavia porcellus]
Length = 1440
Score = 183 bits (465), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1249 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1308
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1309 WRTPCRGA--TEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1366
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1367 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1426
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1427 LDDLLETDRVTAHF 1440
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 842 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 892
>gi|395860104|ref|XP_003802355.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Otolemur garnettii]
Length = 1441
Score = 183 bits (465), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1250 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1309
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1310 WRTPCRGA--TEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1367
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1368 LPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1427
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1428 LDDLLETDRVTAHF 1441
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 843 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 893
>gi|348555856|ref|XP_003463739.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 2 [Cavia porcellus]
Length = 1387
Score = 183 bits (465), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1196 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1255
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1256 WRTPCRGA--TEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1313
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1314 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1373
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1374 LDDLLETDRVTAHF 1387
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 840 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 890
>gi|338728513|ref|XP_003365689.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Equus caballus]
Length = 1450
Score = 183 bits (464), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1259 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1318
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1319 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1376
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1377 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1436
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1437 LDDLLETDRVTAHF 1450
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 852 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 902
>gi|27807297|ref|NP_777145.1| cleavage and polyadenylation specificity factor subunit 1 [Bos
taurus]
gi|1706101|sp|Q10569.1|CPSF1_BOVIN RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit
gi|929007|emb|CAA58152.1| cleavage and polyadenylation specificity factor, 160 kDa subunit [Bos
taurus]
gi|296480730|tpg|DAA22845.1| TPA: cleavage and polyadenylation specificity factor subunit 1 [Bos
taurus]
Length = 1444
Score = 183 bits (464), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1253 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1312
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1313 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1370
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1371 LPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1430
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1431 LDDLLETDRVTAHF 1444
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 34/51 (66%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 846 LVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 896
>gi|391328522|ref|XP_003738737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Metaseiulus occidentalis]
Length = 1500
Score = 183 bits (464), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 84/197 (42%), Positives = 129/197 (65%), Gaps = 11/197 (5%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P +Y+ E+++++S M F+ +D N++++MYQPE RE+ GG RLI++ DF++G +NT
Sbjct: 1307 PVQIYSIEYLVDNSQMAFLATDAQCNILVYMYQPENRETFGGQRLIRRGDFNIGSRINTM 1366
Query: 128 FKIRCKPSPISDAPGARSRFL--------TWYASLDGALGFFLPLPEKNYRRLLMLQNVM 179
F+IRC+ +++ P + R L T YASLDGA G+ LP+ EK YRRLLMLQNV+
Sbjct: 1367 FRIRCR---LAEVPRSERRLLSDLEARHVTLYASLDGAFGYLLPISEKTYRRLLMLQNVL 1423
Query: 180 VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 239
++ H GGLNP+AFR + NP + I+DG L+ F+ L+ E+ E+ +KIG+
Sbjct: 1424 NSYCQHVGGLNPKAFRIMQTDVRALSNPQKNIVDGDLINVFMDLNFNEKAEVARKIGTTV 1483
Query: 240 NDILDELYDIEALSSHF 256
+ I +L +IE L+ HF
Sbjct: 1484 HQIQLDLAEIEGLTYHF 1500
Score = 44.3 bits (103), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 32/50 (64%), Gaps = 3/50 (6%)
Query: 4 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAF---RHPKGTLKLRFKKLK 50
V E+ +LG+H +RPLL R EL IY+A+ +G LKL+F++L+
Sbjct: 911 VMEIFVCALGMHQSRPLLFARVDSELHIYEAYPFVNQKEGHLKLQFRRLQ 960
>gi|345779232|ref|XP_532356.3| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Canis lupus familiaris]
Length = 1460
Score = 183 bits (464), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1269 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1328
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1329 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1386
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1387 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1446
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1447 LDDLLETDRVTAHF 1460
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 862 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 912
>gi|338728511|ref|XP_001505047.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like isoform 1 [Equus caballus]
Length = 1444
Score = 183 bits (464), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1253 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1312
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1313 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1370
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1371 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1430
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1431 LDDLLETDRVTAHF 1444
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 846 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 896
>gi|358415280|ref|XP_003583063.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Bos taurus]
Length = 1490
Score = 182 bits (463), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1299 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1358
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1359 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1416
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1417 LPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1476
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1477 LDDLLETDRVTAHF 1490
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 34/51 (66%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 892 LVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 942
>gi|12697776|dbj|BAB21613.1| polyadenylation specificity factor [Homo sapiens]
Length = 216
Score = 182 bits (463), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 25 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 84
Query: 128 FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
++ C+ + + + ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 85 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 144
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD
Sbjct: 145 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 204
Query: 245 ELYDIEALSSHF 256
+L + + +++HF
Sbjct: 205 DLLETDRVTAHF 216
>gi|431908146|gb|ELK11749.1| Cleavage and polyadenylation specificity factor subunit 1 [Pteropus
alecto]
Length = 820
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 629 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 688
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 689 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 746
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 747 LPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 806
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 807 LDDLLETDRVTAHF 820
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELL+Y+AF H +G LK+RFKK+
Sbjct: 223 LVKEVLLVALGSRQSRPYLLVHVDQELLVYEAFPHDSQLGQGNLKVRFKKV 273
>gi|355680843|gb|AER96659.1| cleavage and polyadenylation specific factor 1, 160kDa [Mustela
putorius furo]
Length = 1399
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 83/193 (43%), Positives = 132/193 (68%), Gaps = 7/193 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1209 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1268
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1269 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1326
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1327 LPHHAGLNPRAFRLLHADRRALQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1386
Query: 243 LDELYDIEALSSH 255
LD+L + + +++H
Sbjct: 1387 LDDLLETDRVTAH 1399
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 802 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 852
>gi|229335612|ref|NP_001108153.2| cleavage and polyadenylation specificity factor subunit 1 [Danio
rerio]
Length = 1449
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 85/192 (44%), Positives = 132/192 (68%), Gaps = 3/192 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ EF+++++ +GF++SD+DKN++++MY PEA+ES GG RL+++ DF++G HVN F
Sbjct: 1258 PLEVYSIEFMVDNNQLGFLVSDRDKNLMVYMYLPEAKESFGGMRLLRRADFNVGSHVNAF 1317
Query: 128 FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
+++ C+ + + A ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1318 WRMPCRGTLDTANKKALTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLP 1377
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
H GLNP+AFR N + I+DG L+ K+L LS ER E+ KKIG+ + ILD
Sbjct: 1378 HHAGLNPKAFRMLHCDRRTLQNAVKNILDGELLNKYLYLSTMERSELAKKIGTTPDIILD 1437
Query: 245 ELYDIEALSSHF 256
+L +IE +++HF
Sbjct: 1438 DLLEIERVTAHF 1449
Score = 45.1 bits (105), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 34/51 (66%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGTLKLRFKKL 49
+V+E+ VSLG + +RP LL + ELLIY+AF + + LK+RFKK+
Sbjct: 851 LVKEVALVSLGYNHSRPYLLAHVEQELLIYEAFPYDQQQAQSNLKVRFKKM 901
>gi|432883539|ref|XP_004074300.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Oryzias latipes]
Length = 1456
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 83/192 (43%), Positives = 131/192 (68%), Gaps = 3/192 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ EF+++++ +GF++SD+DKN+ ++MY PEA+ES GG RL+++ DF+ G H+N+
Sbjct: 1265 PLEVYSIEFIVDNNQLGFLVSDRDKNLFVYMYLPEAKESFGGMRLLRRADFNAGAHINSL 1324
Query: 128 FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
+++ C+ + S + A ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1325 WRMPCRGALDSGSKKALTWDNKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1384
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
H GLNP+AFR N + I+DG L+ K+L LS ER E+ KKIG+ + ILD
Sbjct: 1385 HHAGLNPKAFRMMHSNRRSLQNAVKNILDGELLAKYLYLSTMERSELAKKIGTTQDIILD 1444
Query: 245 ELYDIEALSSHF 256
+L +I+ +++HF
Sbjct: 1445 DLLEIDRVTAHF 1456
Score = 47.4 bits (111), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 37/51 (72%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGTLKLRFKKL 49
+V+E+ V+LG + +RP LLV ++ELL+Y+AF + P+ LK+RFKK+
Sbjct: 857 LVKEVALVALGNNRSRPYLLVHVENELLVYEAFPYDQQQPQNNLKVRFKKV 907
>gi|193702313|ref|XP_001945086.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Acyrthosiphon pisum]
Length = 1335
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 81/194 (41%), Positives = 137/194 (70%), Gaps = 5/194 (2%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P V+ F+I+++ +GF+ SD+D+N++L++YQP ARES GG L+++ DF++G +VN+F
Sbjct: 1142 PLEVFDINFLIDNTELGFLASDRDQNLLLYLYQPMARESYGGQHLVRRGDFNIGSNVNSF 1201
Query: 128 FKIRCKPSPIS----DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
F++RCK S ++ +A G+ R +T Y +LDG++G+ +P+ EKNYRRLL LQN++V +
Sbjct: 1202 FRLRCKQSTVAPDRREAIGSDKRHVTMYTTLDGSIGYIVPIHEKNYRRLLTLQNMLVKNI 1261
Query: 184 SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQ-LSLGERLEICKKIGSKHNDI 242
+H GLNP+A+R++K N +R +IDG LVW F+ ++ +R EI K+G K ++
Sbjct: 1262 THLAGLNPKAYRSFKATAPERMNQARRVIDGELVWMFVTCMNARQRNEIANKVGVKTIEL 1321
Query: 243 LDELYDIEALSSHF 256
L ++Y+++ + HF
Sbjct: 1322 LQDIYELDRTTWHF 1335
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 44/82 (53%), Gaps = 10/82 (12%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFV-SDRSKRA 61
I++E+L V LG RP++ VR +E++IY RHP+GTLK+RF K+ L RS
Sbjct: 755 IIKEILIVPLGYQDKRPIMFVRLDNEVVIYGIHRHPEGTLKMRFHKMTSLLTFQSRSGNP 814
Query: 62 NEQPGLPR---------GVYAC 74
E L R GV+ C
Sbjct: 815 LEGTSLLRYFSKVAGHNGVFIC 836
>gi|49619065|gb|AAT68117.1| cleavage and polyadenylation specific factor 1 [Danio rerio]
Length = 1105
Score = 181 bits (459), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 85/192 (44%), Positives = 132/192 (68%), Gaps = 3/192 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ EF+++++ +GF++SD+DKN++++MY PEA+ES GG RL+++ DF++G HVN F
Sbjct: 914 PLEVYSIEFMVDNNQLGFLVSDRDKNLMVYMYLPEAKESFGGMRLLRRADFNVGSHVNAF 973
Query: 128 FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
+++ C+ + + A ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 974 WRMPCRGTLDTANKKALTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLP 1033
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
H GLNP+AFR N + I+DG L+ K+L LS ER E+ KKIG+ + ILD
Sbjct: 1034 HHAGLNPKAFRMLHCDRRTLQNAVKNILDGELLNKYLYLSTMERSELAKKIGTTPDIILD 1093
Query: 245 ELYDIEALSSHF 256
+L +IE +++HF
Sbjct: 1094 DLLEIERVTAHF 1105
Score = 44.3 bits (103), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGTLKLRFKKL 49
+V+E+ VSLG +RP LL + ELLIY+AF + + LK+RFKK+
Sbjct: 507 LVKEVALVSLGYSHSRPYLLAHVEQELLIYEAFPYDQQQAQSNLKVRFKKM 557
>gi|410987992|ref|XP_004000273.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Felis catus]
Length = 1432
Score = 181 bits (459), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 83/194 (42%), Positives = 133/194 (68%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1241 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1300
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1301 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1358
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1359 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1418
Query: 243 LDELYDIEALSSHF 256
L++L + + +++HF
Sbjct: 1419 LEDLLETDRVTAHF 1432
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 834 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 884
>gi|1045574|gb|AAC50293.1| cleavage and polyadenylation specificity factor [Homo sapiens]
Length = 1442
Score = 180 bits (457), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 82/191 (42%), Positives = 129/191 (67%), Gaps = 2/191 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1252 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1311
Query: 128 FKIRCKPSPISDAPGA--RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
++ C+ + ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T H
Sbjct: 1312 WRTPCRATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPH 1371
Query: 186 TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+
Sbjct: 1372 HAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDD 1431
Query: 246 LYDIEALSSHF 256
L + + +++HF
Sbjct: 1432 LLETDRVTAHF 1442
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 845 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 895
>gi|444523674|gb|ELV13604.1| Cleavage and polyadenylation specificity factor subunit 1 [Tupaia
chinensis]
Length = 1469
Score = 180 bits (457), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 84/194 (43%), Positives = 131/194 (67%), Gaps = 7/194 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG L+++ DFHLG HVNTF
Sbjct: 1278 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGLLLLRRADFHLGAHVNTF 1337
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + + P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1338 WRTPCRGA--VEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1395
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1396 LPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLSRYLYLSTMERSELAKKIGTTPDII 1455
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1456 LDDLLETDRVTAHF 1469
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELL+Y+AF H +G LK+RFKK+
Sbjct: 871 LVKEVLLVALGSRQSRPYLLVHVDQELLLYEAFPHDSQLGQGNLKVRFKKV 921
>gi|392306997|ref|NP_001254722.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
gi|380812168|gb|AFE77959.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
gi|383417835|gb|AFH32131.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
Length = 1442
Score = 180 bits (456), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1251 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1310
Query: 128 FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
++ C+ + + + ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1311 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1370
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD
Sbjct: 1371 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1430
Query: 245 ELYDIEALSSHF 256
+L + + +++HF
Sbjct: 1431 DLLETDRVTAHF 1442
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 844 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 894
>gi|397497327|ref|XP_003819464.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Pan paniscus]
gi|410336497|gb|JAA37195.1| cleavage and polyadenylation specific factor 1, 160kDa [Pan
troglodytes]
Length = 1442
Score = 180 bits (456), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1251 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1310
Query: 128 FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
++ C+ + + + ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1311 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1370
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD
Sbjct: 1371 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1430
Query: 245 ELYDIEALSSHF 256
+L + + +++HF
Sbjct: 1431 DLLETDRVTAHF 1442
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 844 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 894
>gi|56676371|ref|NP_037423.2| cleavage and polyadenylation specificity factor subunit 1 [Homo
sapiens]
gi|23503048|sp|Q10570.2|CPSF1_HUMAN RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit
gi|16878041|gb|AAH17232.1| Cleavage and polyadenylation specific factor 1, 160kDa [Homo sapiens]
gi|119602516|gb|EAW82110.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_c
[Homo sapiens]
gi|123993607|gb|ABM84405.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
construct]
gi|123999626|gb|ABM87355.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
construct]
gi|307684758|dbj|BAJ20419.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
construct]
Length = 1443
Score = 180 bits (456), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1252 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1311
Query: 128 FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
++ C+ + + + ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1312 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1371
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD
Sbjct: 1372 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1431
Query: 245 ELYDIEALSSHF 256
+L + + +++HF
Sbjct: 1432 DLLETDRVTAHF 1443
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 845 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 895
>gi|119602512|gb|EAW82106.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
[Homo sapiens]
gi|119602513|gb|EAW82107.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
[Homo sapiens]
gi|119602514|gb|EAW82108.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
[Homo sapiens]
Length = 1365
Score = 180 bits (456), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1174 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1233
Query: 128 FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
++ C+ + + + ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1234 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1293
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD
Sbjct: 1294 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1353
Query: 245 ELYDIEALSSHF 256
+L + + +++HF
Sbjct: 1354 DLLETDRVTAHF 1365
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 767 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 817
>gi|441648592|ref|XP_004093268.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Nomascus leucogenys]
Length = 1177
Score = 180 bits (456), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 986 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1045
Query: 128 FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
++ C+ + + + ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1046 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1105
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD
Sbjct: 1106 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1165
Query: 245 ELYDIEALSSHF 256
+L + + +++HF
Sbjct: 1166 DLLETDRVTAHF 1177
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 763 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 813
>gi|296227035|ref|XP_002807684.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Callithrix jacchus]
Length = 1394
Score = 180 bits (456), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1203 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1262
Query: 128 FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
++ C+ + + + ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1263 WRTPCRGATEGLSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1322
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD
Sbjct: 1323 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1382
Query: 245 ELYDIEALSSHF 256
+L + + +++HF
Sbjct: 1383 DLLETDRVTAHF 1394
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 796 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLSQGNLKVRFKKV 846
>gi|410042329|ref|XP_003954555.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Pan troglodytes]
Length = 1296
Score = 180 bits (456), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1105 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1164
Query: 128 FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
++ C+ + + + ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1165 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1224
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD
Sbjct: 1225 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1284
Query: 245 ELYDIEALSSHF 256
+L + + +++HF
Sbjct: 1285 DLLETDRVTAHF 1296
>gi|355698297|gb|EHH28845.1| Cleavage and polyadenylation specificity factor 160 kDa subunit
[Macaca mulatta]
Length = 1436
Score = 180 bits (456), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1245 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1304
Query: 128 FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
++ C+ + + + ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1305 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1364
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD
Sbjct: 1365 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1424
Query: 245 ELYDIEALSSHF 256
+L + + +++HF
Sbjct: 1425 DLLETDRVTAHF 1436
>gi|426361048|ref|XP_004047737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Gorilla gorilla gorilla]
Length = 1440
Score = 180 bits (456), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1249 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1308
Query: 128 FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
++ C+ + + + ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1309 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1368
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD
Sbjct: 1369 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1428
Query: 245 ELYDIEALSSHF 256
+L + + +++HF
Sbjct: 1429 DLLETDRVTAHF 1440
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 842 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 892
>gi|402879380|ref|XP_003903320.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Papio anubis]
Length = 1389
Score = 179 bits (455), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1198 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1257
Query: 128 FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
++ C+ + + + ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1258 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1317
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD
Sbjct: 1318 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1377
Query: 245 ELYDIEALSSHF 256
+L + + +++HF
Sbjct: 1378 DLLETDRVTAHF 1389
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 791 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 841
>gi|403302917|ref|XP_003942095.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Saimiri boliviensis boliviensis]
Length = 1390
Score = 179 bits (454), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 82/192 (42%), Positives = 131/192 (68%), Gaps = 3/192 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1199 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1258
Query: 128 FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
++ C+ + + + ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1259 WRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1318
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD
Sbjct: 1319 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1378
Query: 245 ELYDIEALSSHF 256
+L + + +++HF
Sbjct: 1379 DLLETDRVTAHF 1390
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 792 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLSQGNLKVRFKKV 842
>gi|390347522|ref|XP_003726804.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Strongylocentrotus purpuratus]
Length = 1439
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 87/206 (42%), Positives = 128/206 (62%), Gaps = 6/206 (2%)
Query: 56 DRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 115
DR+ + P ++ CEF+++ M F++SD DKN+++F Y PEA ES+GG L+++
Sbjct: 1235 DRTLSLVSRDARPLEIFGCEFMVDDKQMAFLVSDADKNLIVFHYHPEAPESHGGAYLLRR 1294
Query: 116 TDFHLGQHVNTFFKIRCK---PSP--ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYR 170
D ++G VNTF ++RC+ PS + P R R + ++A+LDG+LG LP+ EK YR
Sbjct: 1295 GDMNIGSAVNTFVRVRCRLTDPSTEQVLSGPVLR-RQVVFFATLDGSLGLLLPMVEKTYR 1353
Query: 171 RLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 230
RLLMLQNV+ H GGLNP+++R K NP R I+DG L+ K+ LS+ ER E
Sbjct: 1354 RLLMLQNVLTNGLPHVGGLNPKSYRHVKSHMRNLNNPHRNILDGDLLLKYCHLSVVERNE 1413
Query: 231 ICKKIGSKHNDILDELYDIEALSSHF 256
KKIG+ + I+ +L E L+ HF
Sbjct: 1414 FAKKIGTSVDQIISDLMLAENLTMHF 1439
>gi|449661926|ref|XP_002167992.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Hydra magnipapillata]
Length = 1122
Score = 174 bits (440), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 85/185 (45%), Positives = 115/185 (62%)
Query: 72 YACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 131
Y EF I+ S +GF++SD ++N+V+F YQPEA ES GGHRL++K D ++G HVNT +I+
Sbjct: 938 YTSEFFIDGSQVGFVVSDAERNIVIFSYQPEALESFGGHRLLQKADINIGSHVNTMMRIK 997
Query: 132 CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
S + + R L +LDG++G PL EK +RRL MLQN +V H GLNP
Sbjct: 998 LIQDEQSLSKSSEQRQLIILPTLDGSIGILFPLSEKPFRRLTMLQNKLVDCLPHKAGLNP 1057
Query: 192 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 251
RAFR NP R I+DG L+ K+ QLS ER +I KK+G+ ILD++ DIE
Sbjct: 1058 RAFRALDVPLRTLTNPHRNILDGQLLDKYAQLSFQERFDIAKKMGTTSGQILDDMMDIER 1117
Query: 252 LSSHF 256
S+H
Sbjct: 1118 ASNHL 1122
>gi|390358535|ref|XP_789715.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Strongylocentrotus purpuratus]
Length = 1223
Score = 174 bits (440), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 87/206 (42%), Positives = 128/206 (62%), Gaps = 6/206 (2%)
Query: 56 DRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 115
DR+ + P ++ CEF+++ M F++SD DKN+++F Y PEA ES+GG L+++
Sbjct: 1019 DRTLSLVSRDARPLEIFGCEFMVDDKQMAFLVSDADKNLIVFHYHPEAPESHGGAYLLRR 1078
Query: 116 TDFHLGQHVNTFFKIRCK---PSP--ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYR 170
D ++G VNTF ++RC+ PS + P R R + ++A+LDG+LG LP+ EK YR
Sbjct: 1079 GDMNIGSAVNTFVRVRCRLTDPSTEQVLSGPVLR-RQVVFFATLDGSLGLLLPMVEKTYR 1137
Query: 171 RLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 230
RLLMLQNV+ H GGLNP+++R K NP R I+DG L+ K+ LS+ ER E
Sbjct: 1138 RLLMLQNVLTNGLPHVGGLNPKSYRHVKSHMRNLNNPHRNILDGDLLLKYCHLSVVERNE 1197
Query: 231 ICKKIGSKHNDILDELYDIEALSSHF 256
KKIG+ + I+ +L E L+ HF
Sbjct: 1198 FAKKIGTSVDQIISDLMLAENLTMHF 1223
>gi|395740218|ref|XP_002819588.2| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Pongo abelii]
Length = 1388
Score = 173 bits (438), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 80/192 (41%), Positives = 128/192 (66%), Gaps = 3/192 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1197 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1256
Query: 128 FKIRCKPSPISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
++ C+ + + + ++ +TW S+ G +G LP+ EK YRRLLMLQN + T
Sbjct: 1257 WRTPCRGAAEGLSKKSVVWENKHITWLVSVRGGIGLLLPMQEKTYRRLLMLQNALTTMLP 1316
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD
Sbjct: 1317 HHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILD 1376
Query: 245 ELYDIEALSSHF 256
+L + + +++HF
Sbjct: 1377 DLLETDRVTAHF 1388
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 792 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 842
>gi|357611296|gb|EHJ67409.1| putative cleavage and polyadenylation specific factor 1 [Danaus
plexippus]
Length = 328
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 79/187 (42%), Positives = 126/187 (67%), Gaps = 9/187 (4%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
+Y +F+I+++S+GF++S+ + N ++M+QP+ARES GG RLI+K D+HLGQ V+ F++
Sbjct: 150 IYDMQFMIDNTSLGFLVSESEGNFAMYMHQPQARESYGGQRLIRKCDYHLGQRVHAMFRL 209
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
A G R +T + +LDG +G+ LP+ EK YRRLLMLQNV+ + H GLN
Sbjct: 210 A--------ARGERQTHVTMFTTLDGGVGYVLPVSEKVYRRLLMLQNVINNYCCHLAGLN 261
Query: 191 PRAFRTYK-GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
P+A+RTYK + G +RG++DG LV + + E+ +I +KIG+K +I+ +LY+I
Sbjct: 262 PKAYRTYKVSRRALCGGAARGVLDGDLVSLYTSMPRTEQQDIARKIGTKVEEIMSDLYEI 321
Query: 250 EALSSHF 256
+ ++HF
Sbjct: 322 DRQTAHF 328
>gi|33411764|emb|CAD58787.1| cleavage and polyadenylation specificity factor 1 [Bos taurus]
Length = 180
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 78/175 (44%), Positives = 120/175 (68%), Gaps = 7/175 (4%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 4 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 63
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 64 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 121
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+
Sbjct: 122 LPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGT 176
>gi|354491126|ref|XP_003507707.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 3 [Cricetulus griseus]
Length = 1449
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 80/193 (41%), Positives = 125/193 (64%), Gaps = 7/193 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1250 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1309
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1310 WRTPCRGA--AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1367
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ +
Sbjct: 1368 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIT 1427
Query: 243 LDELYDIEALSSH 255
+ L + H
Sbjct: 1428 VSLLTSNSVIPCH 1440
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 843 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 893
>gi|410911304|ref|XP_003969130.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Takifugu rubripes]
Length = 1444
Score = 170 bits (431), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 86/192 (44%), Positives = 135/192 (70%), Gaps = 3/192 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ EF+++++ +GF++SD+DKN+ ++MY PEA+ES GG RL+++ DF+ G ++NTF
Sbjct: 1253 PLEVYSIEFMVDNNQLGFLVSDRDKNLYVYMYLPEAKESFGGMRLLRRADFNAGANINTF 1312
Query: 128 FKIRCKPSPISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
+++ C+ + + + A + + +TW+A+LDG +G LP+ EK YRRLLMLQN + T S
Sbjct: 1313 WRMPCRGALEAGSRKAMTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLS 1372
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
H GLNP+AFR NP + I+DG L+ K+L LS+ ER E+ KKIG+ + ILD
Sbjct: 1373 HHAGLNPKAFRMLHCDRRSLQNPVKNILDGELLNKYLYLSMMERSELAKKIGTTQDIILD 1432
Query: 245 ELYDIEALSSHF 256
+L DI+ +++HF
Sbjct: 1433 DLLDIDRVTAHF 1444
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGTLKLRFKKL 49
+V+E+ VSLG + +RP LLV ELLIY+AF + P+ LK+RFKK+
Sbjct: 845 LVKEVTLVSLGYNHSRPYLLVHVDQELLIYEAFPYDQQQPQNNLKVRFKKV 895
>gi|255918233|ref|NP_001157645.1| cleavage and polyadenylation specificity factor subunit 1 isoform 1
[Mus musculus]
Length = 1450
Score = 170 bits (430), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 78/175 (44%), Positives = 120/175 (68%), Gaps = 7/175 (4%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1250 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1309
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1310 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1367
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+
Sbjct: 1368 LPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGT 1422
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 843 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 893
>gi|443684051|gb|ELT88095.1| hypothetical protein CAPTEDRAFT_161045 [Capitella teleta]
Length = 1410
Score = 169 bits (427), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 81/217 (37%), Positives = 136/217 (62%), Gaps = 11/217 (5%)
Query: 44 LRFKK-LKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
LR+++ +KVL + + R P VY ++++ +S+ F+++DK KN +++ YQP+
Sbjct: 1201 LRYQEDMKVLSLVSKDVR-------PLAVYGVAYLVDETSLAFIVADKLKNFLVYCYQPD 1253
Query: 103 ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTWYASLDGALG 159
+S GG RLI+K D ++G VN FF+++C+ S S + +S + +T+Y +LDG++G
Sbjct: 1254 LVQSQGGQRLIRKADINIGSLVNAFFRVKCRVSDPSTSKTDQSLAMKHITYYVTLDGSIG 1313
Query: 160 FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 219
+ LP+ E YRRL MLQ +++ T GLNP+A+RT + + N R IIDG L WK
Sbjct: 1314 YLLPISESLYRRLYMLQKMLIQQVQQTAGLNPKAYRTCQTEFRQLINIQRNIIDGDLAWK 1373
Query: 220 FLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
+L L+ +R E+ K+IG+ + I D+L +I+ + HF
Sbjct: 1374 YLALTSHDRAEMAKRIGTTSHQIEDDLLEIDRCTCHF 1410
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 31/49 (63%), Gaps = 4/49 (8%)
Query: 4 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGTLKLRFKK 48
V E++ G++G++PLL+ R EL IY+ F H KG L++RFK+
Sbjct: 826 VHEVMLHGFGVNGSQPLLMARVHDELYIYKVFSHVGSKAKGRLQVRFKR 874
>gi|440904368|gb|ELR54893.1| Cleavage and polyadenylation specificity factor subunit 1, partial
[Bos grunniens mutus]
Length = 1417
Score = 169 bits (427), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 78/175 (44%), Positives = 120/175 (68%), Gaps = 7/175 (4%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1243 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1302
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 1303 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTM 1360
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+
Sbjct: 1361 LPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSPMERGELAKKIGT 1415
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 34/51 (66%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 836 LVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 886
>gi|156364999|ref|XP_001626630.1| predicted protein [Nematostella vectensis]
gi|156213514|gb|EDO34530.1| predicted protein [Nematostella vectensis]
Length = 1420
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 83/189 (43%), Positives = 120/189 (63%), Gaps = 5/189 (2%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
VYA +F I+ +GF++SD +KN+VLF YQPEA ES GG RL+++ D ++G H+ +FF+I
Sbjct: 1234 VYAADFFIDGPQIGFLVSDVEKNLVLFTYQPEAIESQGGQRLLQRADINVGTHITSFFRI 1293
Query: 131 RCKPSPISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
K A G +S R LT + +LDGALG LP+ EK +RRL MLQ +V H
Sbjct: 1294 AAKAH--LKASGEKSKEMRQLTCFGTLDGALGLMLPMTEKTFRRLHMLQTKLVDCIPHVA 1351
Query: 188 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
GLNP+AFR + + NP R ++D L++K++ LS ER E+ +KIG+ I+D++
Sbjct: 1352 GLNPKAFRMLQWRKRKLCNPHRNVLDWQLLFKYMHLSFMERQEVARKIGTTPAQIMDDMM 1411
Query: 248 DIEALSSHF 256
DIE + F
Sbjct: 1412 DIERACAQF 1420
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 37/64 (57%), Gaps = 3/64 (4%)
Query: 4 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP--KGTLKLRFKKLKVLFVSDRSKRA 61
V+E+L LG R L+ +LLIY+AF +P +G L LRFKKL+ + R K+
Sbjct: 831 VREVLLTGLGYKNRRATLVAVMDQDLLIYEAFSYPTVEGHLNLRFKKLQ-HNIQIREKKP 889
Query: 62 NEQP 65
++P
Sbjct: 890 KQEP 893
>gi|195583398|ref|XP_002081509.1| GD25678 [Drosophila simulans]
gi|194193518|gb|EDX07094.1| GD25678 [Drosophila simulans]
Length = 1450
Score = 166 bits (420), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 76/159 (47%), Positives = 109/159 (68%), Gaps = 10/159 (6%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY EF++++S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT
Sbjct: 1265 PLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTM 1324
Query: 128 FKIRCKPSPISDAPGARSRFL------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
F+++C + R FL Y +LDGALG+ LPLPEK YRR LMLQNV+++
Sbjct: 1325 FRVQCHQKGLHQ----RQPFLYENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLS 1380
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKF 220
+ H GLNP+ +RT K NPSR IIDG L+W +
Sbjct: 1381 YQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSY 1419
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/60 (56%), Positives = 42/60 (70%), Gaps = 6/60 (10%)
Query: 6 ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLKVLFVSDRSKRANEQP 65
EL + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+KL D ++QP
Sbjct: 864 ELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRKL------DXXNLLDQQP 917
>gi|340371789|ref|XP_003384427.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Amphimedon queenslandica]
Length = 1408
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 80/195 (41%), Positives = 120/195 (61%), Gaps = 6/195 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P V++ F+I+ ++GF++SD D+N+ LF YQPE S+GG L++ D H+G VN F
Sbjct: 1214 PMEVFSTTFMIDGKALGFLVSDSDRNITLFQYQPENPASSGGANLVRCGDIHVGSLVNVF 1273
Query: 128 FKIRCKPSPISDAPG------ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
IRCK S A A R T++ +LDG +G LP+PEK YRRL MLQ M
Sbjct: 1274 LNIRCKTSAGLGASREMKIALADKRQCTFFGTLDGGIGCLLPIPEKVYRRLSMLQVKMTQ 1333
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
H GLNP+AFRT++ + Y N R I+DG+L++++L L+ E+ + K+IG+
Sbjct: 1334 GMRHMAGLNPKAFRTFQTRHQYLHNAQRNILDGTLLYQYLSLTAKEKFDFSKQIGTTVAQ 1393
Query: 242 ILDELYDIEALSSHF 256
I+++L +I+ + SHF
Sbjct: 1394 IMEDLKEIDKVMSHF 1408
Score = 42.4 bits (98), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 35/52 (67%), Gaps = 6/52 (11%)
Query: 4 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFR-----HPKGTLKLRFKKLK 50
V+++L V +GL+G +P ++ EL+IY+AF+ HP G LKLRF K++
Sbjct: 813 VEQVLCVGMGLNGKKPHIMAFINKELVIYEAFQYTSAIHP-GHLKLRFSKVQ 863
>gi|47217773|emb|CAG05995.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1446
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 88/217 (40%), Positives = 136/217 (62%), Gaps = 28/217 (12%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ EF++++S +GF++SD+DKN+ ++MY PEA+ES GG RL+++ DF+ G ++NTF
Sbjct: 1230 PLEVYSIEFMVDNSQLGFLVSDRDKNLYVYMYLPEAKESFGGMRLLRRADFNAGANINTF 1289
Query: 128 FKIRCKPSPISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
+++ C+ + + + A + + +TW+A+LDG +G LP+ EK YRRLLMLQN + T S
Sbjct: 1290 WRMPCRGALEAGSRKAMTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLS 1349
Query: 185 HTGGLNPRAFRTYKGKGYYAG-------------------------NPSRGIIDGSLVWK 219
H GLNP+AFR A NP + I+DG L+ K
Sbjct: 1350 HHAGLNPKAFRCVGADRTSAAMLSGMLPDFATSVSRMLHCDRRSLQNPVKNILDGELLNK 1409
Query: 220 FLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
+L LS+ ER E+ KKIG+ + ILD+L DI+ +++HF
Sbjct: 1410 YLYLSMMERSELAKKIGTTQDIILDDLLDIDRVTAHF 1446
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 36/51 (70%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGTLKLRFKKL 49
+V+E+ VSLG + +RP LLV + ELL+Y+AF + P+ LK+RFKK+
Sbjct: 815 LVKEVTLVSLGYNHSRPYLLVHVEQELLVYEAFPYDQQQPQNNLKVRFKKV 865
>gi|291232722|ref|XP_002736302.1| PREDICTED: cleavage and polyadenylation specific factor 1-like
[Saccoglossus kowalevskii]
Length = 984
Score = 154 bits (388), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 79/174 (45%), Positives = 114/174 (65%), Gaps = 11/174 (6%)
Query: 88 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS-PISD----APG 142
SD+D+++ L RES GG RL+++ DF+ G HV +FF++R K S P ++ P
Sbjct: 817 SDEDRSLSLV-----TRESFGGQRLLRRADFNAGSHVCSFFRMRSKLSDPATEKLLTGPM 871
Query: 143 ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 202
R R +T +A+LDG++G+ +P+ EK YRRLLMLQN + T T HT GLNP+ FR K +
Sbjct: 872 ER-RHVTMFATLDGSIGYLIPMTEKTYRRLLMLQNALTTQTLHTAGLNPKGFRMVKHQTK 930
Query: 203 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
N + I+DG L+WK+ LS+ ER E+ KKIG+ ILD+L D+E L++HF
Sbjct: 931 SLENTHKNILDGDLLWKYTFLSVNERTELAKKIGTSVEQILDDLMDVERLTAHF 984
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 32/53 (60%), Gaps = 4/53 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGT----LKLRFKKLKV 51
IV+ELL + LG + LL R +L IY+AF H + + L+LRF+K+ V
Sbjct: 473 IVKELLLIGLGHKNKKTHLLARVDEDLYIYEAFTHDQSSLDNHLRLRFRKVFV 525
>gi|301773406|ref|XP_002922132.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1-like [Ailuropoda
melanoleuca]
Length = 1469
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 70/194 (36%), Positives = 121/194 (62%), Gaps = 24/194 (12%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1295 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1354
Query: 128 FKIRCKPSPISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
++ C+ + ++ P +S + +TW+A+LDG +G LP+ EK R LQ
Sbjct: 1355 WRTPCRGA--AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTNR----LQPAXSPR 1408
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
H ++ R + N R ++DG L+ ++L LS ER E+ KKIG+ + I
Sbjct: 1409 MLH---VDRRILQ----------NAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDII 1455
Query: 243 LDELYDIEALSSHF 256
LD+L + + +++HF
Sbjct: 1456 LDDLLETDRVTAHF 1469
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 881 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 931
>gi|168021793|ref|XP_001763425.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685218|gb|EDQ71614.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1452
Score = 134 bits (338), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 73/187 (39%), Positives = 111/187 (59%), Gaps = 6/187 (3%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
YA EF+I+ S++ ++SD KN+ +F Y P++ ES G +L+ + +FHLG HVN F ++
Sbjct: 1270 CYATEFLIDGSTLSLLVSDSRKNLQIFSYAPKSMESWKGQKLLSRAEFHLGAHVNKFHRL 1329
Query: 131 RCKPSPISDAPGARS-RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 189
+ P+P S ARS R+ + +LDGA+ + PL E +RRL LQ +V SH G+
Sbjct: 1330 QMLPTPGS----ARSNRYAVLFGTLDGAIDYLAPLDELTFRRLHTLQRKLVDCVSHVAGV 1385
Query: 190 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
NPRAFR ++ G I+D L+ + L L E+LEI ++IG+ +L L D+
Sbjct: 1386 NPRAFRQFRCDGKAHRPGPDNIVDCELLSHYDMLPLDEQLEIARQIGTTRAHVLSNLRDL 1445
Query: 250 EALSSHF 256
ALS+ F
Sbjct: 1446 -ALSTSF 1451
>gi|324499955|gb|ADY39993.1| Cleavage and polyadenylation specificity factor subunit 1 [Ascaris
suum]
Length = 1434
Score = 133 bits (335), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 75/219 (34%), Positives = 132/219 (60%), Gaps = 14/219 (6%)
Query: 44 LRFKK-LKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
LR+++ K L ++ R RA QP + A +F+I++ M F++SD+ N+ +F Y PE
Sbjct: 1224 LRYQEEYKALSLASRDMRAVVQPPM-----AAQFLIDNRQMAFIMSDEAANIAVFNYLPE 1278
Query: 103 ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP--ISDAPGARSRFLTWYASLDGALGF 160
A ES+GG RLI +++ ++G +VN+F +++ S + + + +R + SLDG+ GF
Sbjct: 1279 ALESSGGERLILRSEINIGTNVNSFMRVKGHISSGFVENEHYSLNRQSVLFCSLDGSFGF 1338
Query: 161 FLPLPEKNYRRLLMLQNVMVTHTSHTGGLN---PRAFRTYKGKGYYAGNPSRGIIDGSLV 217
PL EK +RRL MLQ +M + + GLN RA R + Y +R ++DG +V
Sbjct: 1339 VRPLSEKVFRRLHMLQQLMSSLVAQAAGLNVKGSRAARPQRPNHYLN---TRNMVDGDVV 1395
Query: 218 WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
+++L LSL ++ ++ +K+G+ I+D+L +I L++H+
Sbjct: 1396 FQYLHLSLADKNDLARKLGTSRYHIIDDLTEISRLTTHY 1434
>gi|328773280|gb|EGF83317.1| hypothetical protein BATDEDRAFT_21894 [Batrachochytrium dendrobatidis
JAM81]
Length = 1673
Score = 130 bits (326), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 67/185 (36%), Positives = 111/185 (60%), Gaps = 7/185 (3%)
Query: 72 YACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 131
YA +I+ + + +++D DKN+ +Y P+ +S GG RLI+K + HLGQHV+ F ++R
Sbjct: 1482 YAANMLIDDNQLALLVADGDKNLHTMIYAPDNVQSLGGERLIRKGEIHLGQHVSKFIRMR 1541
Query: 132 CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
KP +DA ++L A+LDGAL P+ E+ ++RL L + MVT H GLNP
Sbjct: 1542 RKPLLRNDAIVFSKQYLNVAATLDGALEIITPVSERIFKRLYGLYSRMVTSIEHIAGLNP 1601
Query: 192 RAFRTYKGK-------GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
R FR + + G+ RGI+DG L++++++LS ++ + K IGSK + ++D
Sbjct: 1602 RGFRQAQHRVRPITLSGFIGPPGPRGILDGDLLYEYVRLSRTQQRGLAKAIGSKDDRLMD 1661
Query: 245 ELYDI 249
+L ++
Sbjct: 1662 DLLEV 1666
>gi|422295485|gb|EKU22784.1| cleavage and polyadenylation specificity factor subunit 1
[Nannochloropsis gaditana CCMP526]
Length = 395
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 67/202 (33%), Positives = 107/202 (52%), Gaps = 8/202 (3%)
Query: 52 LFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR 111
L D S + QPG FV++ S+G ++ D N+ L Y P ES GG+R
Sbjct: 199 LMGKDYSPIVSAQPG------GSHFVVDPPSLGMLVGDSRGNLQLLQYDPADVESRGGNR 252
Query: 112 LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRR 171
L+++ DFHL ++ R P A A R + + S++G +G +P+ EK YRR
Sbjct: 253 LVRRADFHLSHRLSFLQHTRMAEVPRPGAYRAGVRVMV-FGSVEGGVGALVPVEEKVYRR 311
Query: 172 LLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 231
L LQ VMV H G NPR FR + +G+ G +G +DG L+W+F LS+G++ ++
Sbjct: 312 LYALQAVMVNALPHVGAFNPRGFRLVEARGWAQGR-KKGTLDGELLWRFAGLSVGKQEDL 370
Query: 232 CKKIGSKHNDILDELYDIEALS 253
IG+ +L+ L +++ ++
Sbjct: 371 ASAIGTSREMVLESLLEVDMMT 392
>gi|313215162|emb|CBY42850.1| unnamed protein product [Oikopleura dioica]
Length = 228
Score = 127 bits (319), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 68/211 (32%), Positives = 120/211 (56%), Gaps = 26/211 (12%)
Query: 57 RSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKT 116
R++RA E V A +++ + +G + +D +N+ ++MY+P+ +ESNGG +L+++
Sbjct: 33 RARRAGE-------VTAVGILLDGNQVGLVSTDMQRNLQVYMYKPDQKESNGGKQLVRQA 85
Query: 117 DFHLGQHV-----------NTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLP 165
D +LG+ V +TF K+ + +R +T+YA LDG++G +P+
Sbjct: 86 DINLGKRVISIWNSLGRQNDTFTKVALTEND--------ARHVTFYAGLDGSIGDIVPVS 137
Query: 166 EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 225
EK +RRL MLQ ++ +H H GGLNPR +R + N ++ IIDG L+ +F LS
Sbjct: 138 EKVFRRLEMLQTLVQSHLPHYGGLNPREYRYCTNEYRDLENAAKNIIDGDLLERFNGLSF 197
Query: 226 GERLEICKKIGSKHNDILDELYDIEALSSHF 256
E+ ++ +KIG +LD++ D++ + F
Sbjct: 198 TEQTDLSRKIGVTREALLDDMMDVQRTKNLF 228
>gi|393907594|gb|EJD74706.1| hypothetical protein LOAG_18016 [Loa loa]
Length = 398
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 70/216 (32%), Positives = 125/216 (57%), Gaps = 8/216 (3%)
Query: 44 LRFKK-LKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
LR+++ K L ++ R R++ QP + A +F+I++ MGF++SD+ N+ +F Y PE
Sbjct: 188 LRYQEEYKALSLASRDMRSDVQPPM-----AAQFIIDNKQMGFVMSDEAANIAIFNYLPE 242
Query: 103 ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP--ISDAPGARSRFLTWYASLDGALGF 160
ES GG +L + + ++G VN+F +++ S + + + R +ASLDG+ GF
Sbjct: 243 TLESLGGEKLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSLERQSVLFASLDGSFGF 302
Query: 161 FLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKF 220
PL EK +RRL MLQ +M + GLN + R + +R ++DG +V ++
Sbjct: 303 LRPLTEKVFRRLHMLQQLMSSMVPQPAGLNAKGARAARPPRPNHYLNTRNLVDGDMVMQY 362
Query: 221 LQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
L LSL E+ ++ +K+G+ I+D+L +I +++H+
Sbjct: 363 LHLSLPEKNDLARKLGTSRYHIIDDLIEICRVTAHY 398
>gi|312069702|ref|XP_003137805.1| hypothetical protein LOAG_02219 [Loa loa]
Length = 1065
Score = 127 bits (318), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 70/216 (32%), Positives = 125/216 (57%), Gaps = 8/216 (3%)
Query: 44 LRFKK-LKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
LR+++ K L ++ R R++ QP + A +F+I++ MGF++SD+ N+ +F Y PE
Sbjct: 855 LRYQEEYKALSLASRDMRSDVQPPM-----AAQFIIDNKQMGFVMSDEAANIAIFNYLPE 909
Query: 103 ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP--ISDAPGARSRFLTWYASLDGALGF 160
ES GG +L + + ++G VN+F +++ S + + + R +ASLDG+ GF
Sbjct: 910 TLESLGGEKLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSLERQSVLFASLDGSFGF 969
Query: 161 FLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKF 220
PL EK +RRL MLQ +M + GLN + R + +R ++DG +V ++
Sbjct: 970 LRPLTEKVFRRLHMLQQLMSSMVPQPAGLNAKGARAARPPRPNHYLNTRNLVDGDMVMQY 1029
Query: 221 LQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
L LSL E+ ++ +K+G+ I+D+L +I +++H+
Sbjct: 1030 LHLSLPEKNDLARKLGTSRYHIIDDLIEICRVTAHY 1065
>gi|242075246|ref|XP_002447559.1| hypothetical protein SORBIDRAFT_06g003570 [Sorghum bicolor]
gi|241938742|gb|EES11887.1| hypothetical protein SORBIDRAFT_06g003570 [Sorghum bicolor]
Length = 389
Score = 126 bits (316), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 66/177 (37%), Positives = 101/177 (57%), Gaps = 1/177 (0%)
Query: 72 YACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 131
+A EF+I+ S++ ++SD DKNV +F Y P+ ES G +L+ + +FH+G HV+ F +++
Sbjct: 206 FATEFLIDGSTLSLVVSDSDKNVQIFYYAPKMVESWKGQKLLSRAEFHVGAHVSKFLRLQ 265
Query: 132 CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
P+ A +RF + +LDG +G P+ E +RRL LQ +V H GLNP
Sbjct: 266 MLPTQ-GLASEKTNRFALVFGTLDGGIGCIAPVDELTFRRLQSLQRKLVDAVPHVCGLNP 324
Query: 192 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYD 248
R+FR +K G IID L+ + LSL E+LEI ++IG+ + IL D
Sbjct: 325 RSFRHFKSNGKAHRPGPDNIIDFELLSHYEMLSLEEQLEIAQQIGTTRSQILSNFSD 381
>gi|428186188|gb|EKX55039.1| hypothetical protein GUITHDRAFT_160593 [Guillardia theta CCMP2712]
Length = 2290
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 67/186 (36%), Positives = 104/186 (55%), Gaps = 6/186 (3%)
Query: 66 GLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVN 125
LP VYA EFV+ S+ G + SD+ KNV +F++ P + E +LI + D H+G H+N
Sbjct: 1402 ALPLSVYAAEFVVMGSNFGLLASDEQKNVQVFVFNPNSPEYRR-QQLICRADLHVGSHIN 1460
Query: 126 TFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
F + P P G R+ Y +LDG +G +P+PE++YRRLL LQN++VT H
Sbjct: 1461 KFIRW---PLPFRPTLGVRT--AAHYTTLDGGIGAIIPIPEQSYRRLLALQNLLVTAMPH 1515
Query: 186 TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
GLNPR++R YK ++ +DG+L+ ++L L L ++++ + IL +
Sbjct: 1516 YAGLNPRSWRLYKPAMCMKRRYAKNFLDGNLLGRYLHLDLALQMQLSSALNQTREAILGD 1575
Query: 246 LYDIEA 251
L A
Sbjct: 1576 LPSTSA 1581
>gi|308459872|ref|XP_003092248.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
gi|308253976|gb|EFO97928.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
Length = 1448
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 125/220 (56%), Gaps = 12/220 (5%)
Query: 44 LRFKKL-KVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
+RF++ K + ++ R R +P + A +FV++ + +GF++SD++ N+ LF Y PE
Sbjct: 1234 IRFQEENKAMSIASRDDRRTAKPPM-----AAQFVVDGAHLGFLLSDENGNITLFNYSPE 1288
Query: 103 ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPIS-----DAPGARSRFLTWYASLDGA 157
A ESNGG RL + ++G +VN F +++ S ++ + R T + SLDG+
Sbjct: 1289 APESNGGERLTVRAAMNIGTNVNAFLRVKGHTSLLNLQSDEEKESVEQRMSTIFGSLDGS 1348
Query: 158 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK-GKGYYAGNPSRGIIDGSL 216
GF PL EK+YRRL LQ + + T GL+ + R+ + + G +R +IDG +
Sbjct: 1349 FGFVRPLSEKSYRRLHFLQTFIGSVTPQIAGLHIKGARSARPAQPIVNGRNARNLIDGDV 1408
Query: 217 VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
V ++L LSL ++ ++ +++G I+D+L + ++ ++
Sbjct: 1409 VEQYLHLSLYDKTDLARRLGVGRYHIIDDLMHLRRMAYYY 1448
>gi|356530945|ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Glycine max]
Length = 1449
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 64/183 (34%), Positives = 102/183 (55%), Gaps = 4/183 (2%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
+A EF+I+ S++ M+SD ++N+ +F Y P+ ES G +L+ + +FH+G HV F ++
Sbjct: 1260 CFATEFLIDGSTLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1319
Query: 131 RCKPSP--ISDAPGA--RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
+ + PG+ +RF + +LDG++G PL E +RRL LQ +V H
Sbjct: 1320 QMLSTSDRAGSVPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQRKLVDAVPHV 1379
Query: 187 GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
GLNPRAFR ++ G I+D L+ + L L E+LEI +IG+ + IL L
Sbjct: 1380 AGLNPRAFRLFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIANQIGTTRSQILSNL 1439
Query: 247 YDI 249
D+
Sbjct: 1440 SDL 1442
>gi|313232279|emb|CBY09388.1| unnamed protein product [Oikopleura dioica]
Length = 1451
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 68/211 (32%), Positives = 120/211 (56%), Gaps = 26/211 (12%)
Query: 57 RSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKT 116
R++RA E V A +++ + +G + +D +N+ ++MY+P+ +ESNGG +L+++
Sbjct: 1256 RARRAGE-------VTAVGILLDGNQVGLVSTDMQRNLQVYMYKPDQKESNGGKQLVRQA 1308
Query: 117 DFHLGQHV-----------NTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLP 165
D +LG+ V +TF K+ + +R +T+YA LDG++G +P+
Sbjct: 1309 DINLGKRVISIWNSLGRQNDTFTKVALTEND--------ARHVTFYAGLDGSIGDIVPVS 1360
Query: 166 EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 225
EK +RRL MLQ ++ +H H GGLNPR +R + N ++ IIDG L+ +F LS
Sbjct: 1361 EKVFRRLEMLQTLVQSHLPHYGGLNPREYRYCTNEYRDLENAAKNIIDGDLLERFNGLSF 1420
Query: 226 GERLEICKKIGSKHNDILDELYDIEALSSHF 256
E+ ++ +KIG +LD++ D++ + F
Sbjct: 1421 TEQTDLSRKIGVTREALLDDMMDVQRTKNLF 1451
>gi|356559917|ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Glycine max]
Length = 1447
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 103/185 (55%), Gaps = 8/185 (4%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
+A EF+I+ S++ M+SD ++N+ +F Y P+ ES G +L+ + +FH+G HV F ++
Sbjct: 1258 CFATEFLIDGSTLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1317
Query: 131 RCKPSPISDAPGA------RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
+ + SD GA +RF + +LDG++G PL E +RRL LQ +V
Sbjct: 1318 QMLST--SDRAGAVPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQRKLVDAVP 1375
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
H GLNPRAFR ++ G I+D L+ + L L E+LEI ++G+ + IL
Sbjct: 1376 HVAGLNPRAFRLFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAHQVGTTRSQILS 1435
Query: 245 ELYDI 249
L D+
Sbjct: 1436 NLSDL 1440
>gi|402590016|gb|EJW83947.1| hypothetical protein WUBG_05142 [Wuchereria bancrofti]
Length = 374
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 69/216 (31%), Positives = 126/216 (58%), Gaps = 8/216 (3%)
Query: 44 LRFKK-LKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
LR+++ K L ++ R R++ QP + A +F+I++ MGF++SD+ N+ +F Y PE
Sbjct: 164 LRYQEEYKALSLASRDMRSDVQPPM-----AAQFLIDNKQMGFIMSDEAANIAIFNYLPE 218
Query: 103 ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP--ISDAPGARSRFLTWYASLDGALGF 160
ES GG +L + + ++G VN+F +++ S + + + R +ASLDG+ G+
Sbjct: 219 TLESLGGEKLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSLERQSVLFASLDGSFGY 278
Query: 161 FLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKF 220
PL EK +RRL MLQ +M + GLN + R + + +R ++DG +V ++
Sbjct: 279 LRPLTEKVFRRLHMLQQLMSSMVLQPAGLNAKGARAARPQRPNHYLNTRNLVDGDVVMQY 338
Query: 221 LQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
L LSL E+ ++ +K+G+ I+D+L +I +++H+
Sbjct: 339 LHLSLPEKNDLARKLGTSRYHIIDDLNEICRVTAHY 374
>gi|25148482|ref|NP_500157.2| Protein CPSF-1 [Caenorhabditis elegans]
gi|22096347|sp|Q9N4C2.2|CPSF1_CAEEL RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 1; AltName: Full=Cleavage and
polyadenylation specificity factor 160 kDa subunit;
Short=CPSF 160 kDa subunit
gi|373220398|emb|CCD73182.1| Protein CPSF-1 [Caenorhabditis elegans]
Length = 1454
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 69/220 (31%), Positives = 123/220 (55%), Gaps = 12/220 (5%)
Query: 44 LRFKKL-KVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
+RF++ K + ++ R R QP + A + V++ + +GF++SD+ N+ +F Y PE
Sbjct: 1240 IRFQEDNKAMSIASRDDRKCAQPPM-----ASQLVVDGAHVGFLLSDETGNITMFNYAPE 1294
Query: 103 ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPI-----SDAPGARSRFLTWYASLDGA 157
A ESNGG RL + ++G ++N F ++R S + + R T +ASLDG+
Sbjct: 1295 APESNGGERLTVRAAINIGTNINAFVRLRGHTSLLQLNNEDEKEAIEQRMTTVFASLDGS 1354
Query: 158 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK-GKGYYAGNPSRGIIDGSL 216
GF PL EK+YRRL LQ + + T GL+ + R+ K + G +R +IDG +
Sbjct: 1355 FGFVRPLTEKSYRRLHFLQTFIGSVTPQIAGLHIKGSRSAKPSQPIVNGRNARNLIDGDV 1414
Query: 217 VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
V ++L LSL ++ ++ +++G I+D+L + ++ ++
Sbjct: 1415 VEQYLHLSLYDKTDLARRLGVGRYHIIDDLMQLRRMAFYY 1454
>gi|302814354|ref|XP_002988861.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
gi|300143432|gb|EFJ10123.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
Length = 1413
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 103/186 (55%), Gaps = 3/186 (1%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
YA EF+I+ S++ ++SD KN+ +F Y P+ ES G +L+ + +FHLG HV F ++
Sbjct: 1231 CYATEFLIDGSTLSLLVSDSRKNIQVFSYAPKNAESWKGQKLLPRVEFHLGSHVTKFLRL 1290
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ +P S +RF + +LDG +G+ PL E +RRL LQ +V H GLN
Sbjct: 1291 QMLQTPGS---SRTNRFALCFGTLDGGIGYITPLDELTFRRLQTLQRKLVDLVPHVAGLN 1347
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
P+A+R ++ G + + +D + ++ LSL +++ I ++IG+ I L DI
Sbjct: 1348 PKAYRQFQANGEHHKHGPDNTVDSEQLREYESLSLDKQVAIARQIGTTRQQIFANLRDIS 1407
Query: 251 ALSSHF 256
+S F
Sbjct: 1408 LSTSFF 1413
>gi|302761560|ref|XP_002964202.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
gi|300167931|gb|EFJ34535.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
Length = 1413
Score = 124 bits (310), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 103/186 (55%), Gaps = 3/186 (1%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
YA EF+I+ S++ ++SD KN+ +F Y P+ ES G +L+ + +FHLG HV F ++
Sbjct: 1231 CYATEFLIDGSTLSLLVSDSRKNIQVFSYAPKNAESWKGQKLLPRVEFHLGSHVTKFLRL 1290
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ +P S +RF + +LDG +G+ PL E +RRL LQ +V H GLN
Sbjct: 1291 QMLQTPGS---SRTNRFALCFGTLDGGIGYITPLDELTFRRLQTLQRKLVDLVPHVAGLN 1347
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
P+A+R ++ G + + +D + ++ LSL +++ I ++IG+ I L DI
Sbjct: 1348 PKAYRQFQANGEHHKHGPDNTVDSEQLREYESLSLDKQVAIARQIGTTRQQIFANLRDIS 1407
Query: 251 ALSSHF 256
+S F
Sbjct: 1408 LSTSFF 1413
>gi|170576536|ref|XP_001893668.1| CPSF A subunit region family protein [Brugia malayi]
gi|158600196|gb|EDP37499.1| CPSF A subunit region family protein [Brugia malayi]
Length = 1323
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 68/216 (31%), Positives = 125/216 (57%), Gaps = 8/216 (3%)
Query: 44 LRFKK-LKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
LR+++ K L ++ R R++ QP + A +F+I++ MGF++SD+ N+ +F Y PE
Sbjct: 1113 LRYQEEYKALSLASRDMRSDVQPPM-----AAQFLIDNKQMGFIMSDEAANIAIFNYLPE 1167
Query: 103 ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP--ISDAPGARSRFLTWYASLDGALGF 160
ES GG +L + + ++G VN+F +++ S + + + R +ASLDG+ G+
Sbjct: 1168 TLESLGGEKLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSLERQSVLFASLDGSFGY 1227
Query: 161 FLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKF 220
PL EK +RRL MLQ +M + GLN + R + + +R ++DG + ++
Sbjct: 1228 LRPLTEKVFRRLHMLQQLMSSMVLQPAGLNAKGARAARPQRPNHYLNTRNLVDGDVAMQY 1287
Query: 221 LQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
L LSL E+ ++ +K+G+ I+D+L +I +++H+
Sbjct: 1288 LHLSLPEKNDLARKLGTSRYHIIDDLIEICRVTAHY 1323
>gi|255539681|ref|XP_002510905.1| cleavage and polyadenylation specificity factor cpsf, putative
[Ricinus communis]
gi|223550020|gb|EEF51507.1| cleavage and polyadenylation specificity factor cpsf, putative
[Ricinus communis]
Length = 1461
Score = 123 bits (308), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 100/183 (54%), Gaps = 5/183 (2%)
Query: 72 YACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 131
+A EF+I+ S++ ++SD+ KN+ +F Y P+ ES G +L+ + +FH+G H+ F ++
Sbjct: 1272 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMLESWKGQKLLSRAEFHVGAHITKFIRLS 1331
Query: 132 -----CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
S + P +RF + +LDG++G PL E +RRL LQ +V H
Sbjct: 1332 MLSTSSDRSGAAPGPDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQRKLVDAVPHV 1391
Query: 187 GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
GLNPR+FR ++ G I+D L+ F L L E+LEI +++G+ IL L
Sbjct: 1392 AGLNPRSFRQFRSDGKVHRPGPESIVDCELLSHFEMLPLEEQLEIAQQVGTTRAQILSNL 1451
Query: 247 YDI 249
D+
Sbjct: 1452 NDL 1454
>gi|341892673|gb|EGT48608.1| CBN-CPSF-1 protein [Caenorhabditis brenneri]
Length = 1440
Score = 123 bits (308), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 67/219 (30%), Positives = 126/219 (57%), Gaps = 11/219 (5%)
Query: 44 LRFKKL-KVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
+RF++ K + V+ R R Q + A +F+++ + +GF++SD++ N+ LF Y PE
Sbjct: 1227 IRFQEENKAMSVASRDDRKCAQAPM-----AAQFMVDGAHIGFLLSDENGNITLFNYAPE 1281
Query: 103 ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPIS----DAPGARSRFLTWYASLDGAL 158
A ESNGG RL + ++G ++N F +++ + ++ + A R T +ASLDG+
Sbjct: 1282 APESNGGERLTVRAAINIGTNINAFLRVKGHTALLNLHEFEKEAAEQRMSTIFASLDGSF 1341
Query: 159 GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK-GKGYYAGNPSRGIIDGSLV 217
GF PL EK+YRRL LQ + + + GL+ + R+ K + G +R +IDG +V
Sbjct: 1342 GFIRPLTEKSYRRLHFLQTFIGSVSQQIAGLHIKGARSAKPPQPIVNGRNARNLIDGDVV 1401
Query: 218 WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
++L LS ++ ++ +++G I+D+L ++ ++ ++
Sbjct: 1402 EQYLNLSTYDKTDLARRLGVGKYHIIDDLMELRRMAFYY 1440
>gi|225455571|ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Vitis vinifera]
Length = 1442
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 64/184 (34%), Positives = 101/184 (54%), Gaps = 7/184 (3%)
Query: 72 YACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 131
+A EF+I+ S++ ++SD KN+ +F Y P+ ES G +L+ + +FH+G HV F +++
Sbjct: 1253 FATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQ 1312
Query: 132 CKPSPISDAPGAR------SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
P+ SD A +RF + +LDG++G PL E +RRL LQ +V H
Sbjct: 1313 MLPAS-SDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVPH 1371
Query: 186 TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
GLNPR+FR ++ G I+D L+ + L E+LEI ++IG+ IL
Sbjct: 1372 VAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGTTRMQILSN 1431
Query: 246 LYDI 249
L D+
Sbjct: 1432 LNDL 1435
>gi|296084122|emb|CBI24510.3| unnamed protein product [Vitis vinifera]
Length = 1448
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 64/184 (34%), Positives = 101/184 (54%), Gaps = 7/184 (3%)
Query: 72 YACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 131
+A EF+I+ S++ ++SD KN+ +F Y P+ ES G +L+ + +FH+G HV F +++
Sbjct: 1259 FATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQ 1318
Query: 132 CKPSPISDAPGAR------SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
P+ SD A +RF + +LDG++G PL E +RRL LQ +V H
Sbjct: 1319 MLPAS-SDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVPH 1377
Query: 186 TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
GLNPR+FR ++ G I+D L+ + L E+LEI ++IG+ IL
Sbjct: 1378 VAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGTTRMQILSN 1437
Query: 246 LYDI 249
L D+
Sbjct: 1438 LNDL 1441
>gi|297722899|ref|NP_001173813.1| Os04g0252200 [Oryza sativa Japonica Group]
gi|255675253|dbj|BAH92541.1| Os04g0252200, partial [Oryza sativa Japonica Group]
Length = 432
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 65/180 (36%), Positives = 103/180 (57%), Gaps = 5/180 (2%)
Query: 72 YACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 131
+A EF+I+ S++ + SD DKNV +F Y P+ ES G +L+ + +FH+G H+ F +++
Sbjct: 249 FATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESWKGQKLLSRAEFHVGAHITKFLRLQ 308
Query: 132 CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
P+ + +RF + +LDG +G P+ E +RRL LQ +V H GLNP
Sbjct: 309 MLPTQ-GLSSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLNP 367
Query: 192 RAFRTY--KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
R+FR + GKG+ G IID L+ + LSL E+L++ ++IG+ + IL DI
Sbjct: 368 RSFRQFHSNGKGHRPG--PDNIIDFELLAHYEMLSLDEQLDVAQQIGTTRSQILSNFSDI 425
>gi|268580265|ref|XP_002645115.1| Hypothetical protein CBG16808 [Caenorhabditis briggsae]
gi|296439546|sp|A8XPU7.1|CPSF1_CAEBR RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 1; AltName: Full=Cleavage and
polyadenylation specificity factor 160 kDa subunit;
Short=CPSF 160 kDa subunit
Length = 1454
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 71/220 (32%), Positives = 124/220 (56%), Gaps = 12/220 (5%)
Query: 44 LRFKKL-KVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
+RF++ K + ++ R R Q + A EF+++ +GF++SD+ N+ LF Y PE
Sbjct: 1240 IRFQEENKAMSIASRDDRKCAQAPM-----ASEFLVDGMHIGFLLSDEHGNITLFSYSPE 1294
Query: 103 ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPI-SDAPGARS----RFLTWYASLDGA 157
A ESNGG RL K ++G ++N F +++ S + S +P R R T + SLDG+
Sbjct: 1295 APESNGGERLTVKAAINIGTNINAFLRVKGHTSLLDSSSPEERENIEQRMNTIFGSLDGS 1354
Query: 158 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK-GKGYYAGNPSRGIIDGSL 216
G+ PL EK+YRRL LQ + + T GL+ + R+ K + G +R +IDG +
Sbjct: 1355 FGYIRPLTEKSYRRLHFLQTFIGSVTPQIAGLHIKGARSSKPSQPIVNGRNARNLIDGDV 1414
Query: 217 VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
V ++L LS+ ++ ++ +++G ILD+L + ++ ++
Sbjct: 1415 VEQYLHLSVYDKTDLARRLGVGRYHILDDLMQLRRMAYYY 1454
>gi|75145059|sp|Q7XWP1.2|CPSF1_ORYSJ RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 1; AltName: Full=Cleavage and
polyadenylation specificity factor 160 kDa subunit;
Short=CPSF 160 kDa subunit
gi|38345987|emb|CAD39979.2| OSJNBa0032B23.5 [Oryza sativa Japonica Group]
Length = 1441
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 65/181 (35%), Positives = 103/181 (56%), Gaps = 5/181 (2%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
+A EF+I+ S++ + SD DKNV +F Y P+ ES G +L+ + +FH+G H+ F ++
Sbjct: 1257 CFATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESWKGQKLLSRAEFHVGAHITKFLRL 1316
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ P+ + +RF + +LDG +G P+ E +RRL LQ +V H GLN
Sbjct: 1317 QMLPTQ-GLSSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLN 1375
Query: 191 PRAFRTY--KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYD 248
PR+FR + GKG+ G IID L+ + LSL E+L++ ++IG+ + IL D
Sbjct: 1376 PRSFRQFHSNGKGHRPG--PDNIIDFELLCSYEMLSLDEQLDVAQQIGTTRSQILSNFSD 1433
Query: 249 I 249
I
Sbjct: 1434 I 1434
>gi|222628488|gb|EEE60620.1| hypothetical protein OsJ_14038 [Oryza sativa Japonica Group]
Length = 1441
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 65/181 (35%), Positives = 103/181 (56%), Gaps = 5/181 (2%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
+A EF+I+ S++ + SD DKNV +F Y P+ ES G +L+ + +FH+G H+ F ++
Sbjct: 1257 CFATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESWKGQKLLSRAEFHVGAHITKFLRL 1316
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ P+ + +RF + +LDG +G P+ E +RRL LQ +V H GLN
Sbjct: 1317 QMLPTQ-GLSSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLN 1375
Query: 191 PRAFRTY--KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYD 248
PR+FR + GKG+ G IID L+ + LSL E+L++ ++IG+ + IL D
Sbjct: 1376 PRSFRQFHSNGKGHRPG--PDNIIDFELLAHYEMLSLDEQLDVAQQIGTTRSQILSNFSD 1433
Query: 249 I 249
I
Sbjct: 1434 I 1434
>gi|218194461|gb|EEC76888.1| hypothetical protein OsI_15095 [Oryza sativa Indica Group]
Length = 1503
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 65/181 (35%), Positives = 103/181 (56%), Gaps = 5/181 (2%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
+A EF+I+ S++ + SD DKNV +F Y P+ ES G +L+ + +FH+G H+ F ++
Sbjct: 1319 CFATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESWKGQKLLSRAEFHVGAHITKFLRL 1378
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ P+ + +RF + +LDG +G P+ E +RRL LQ +V H GLN
Sbjct: 1379 QMLPTQ-GLSSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLN 1437
Query: 191 PRAFRTY--KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYD 248
PR+FR + GKG+ G IID L+ + LSL E+L++ ++IG+ + IL D
Sbjct: 1438 PRSFRQFHSNGKGHRPG--PDNIIDFELLAHYEMLSLDEQLDVAQQIGTTRSQILSNFSD 1495
Query: 249 I 249
I
Sbjct: 1496 I 1496
>gi|449524573|ref|XP_004169296.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like, partial [Cucumis sativus]
Length = 741
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 97/183 (53%), Gaps = 4/183 (2%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
YA EF+I+ S++ +SD KN+ +F Y P++ ES G +L+ + +FH+G HV F ++
Sbjct: 552 CYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKFLRL 611
Query: 131 R----CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
+ S +RF + +LDG++G PL E +RRL LQ + H
Sbjct: 612 QMLSTSSDKACSTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLGDAVPHV 671
Query: 187 GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
GGLNPR+FR + G I+D L+ + L L E+L+I +IG+ + IL L
Sbjct: 672 GGLNPRSFRQFHSNGKVHRRGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQILSNL 731
Query: 247 YDI 249
D+
Sbjct: 732 NDL 734
>gi|449470342|ref|XP_004152876.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Cucumis sativus]
Length = 1504
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 62/182 (34%), Positives = 97/182 (53%), Gaps = 4/182 (2%)
Query: 72 YACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 131
YA EF+I+ S++ +SD KN+ +F Y P++ ES G +L+ + +FH+G HV F +++
Sbjct: 1316 YATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKFLRLQ 1375
Query: 132 ----CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
S +RF + +LDG++G PL E +RRL LQ + H G
Sbjct: 1376 MLSTSSDKACSTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLGDAVPHVG 1435
Query: 188 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
GLNPR+FR + G I+D L+ + L L E+L+I +IG+ + IL L
Sbjct: 1436 GLNPRSFRQFHSNGKVHRRGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQILSNLN 1495
Query: 248 DI 249
D+
Sbjct: 1496 DL 1497
>gi|357162146|ref|XP_003579318.1| PREDICTED: probable cleavage and polyadenylation specificity factor
subunit 1-like [Brachypodium distachyon]
Length = 1442
Score = 120 bits (302), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 63/178 (35%), Positives = 101/178 (56%), Gaps = 1/178 (0%)
Query: 72 YACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 131
+A EF+I+ S++ ++SD DKN+ +F Y P+ ES G +L+ + + H+G H+ F +++
Sbjct: 1259 FATEFLIDGSTLSLVVSDSDKNLQIFYYAPKMVESWKGQKLLSRAELHVGAHMTKFLRLQ 1318
Query: 132 CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
P+ A +RF + +LDG++G P+ E +RRL LQ +V SH GLNP
Sbjct: 1319 MLPAQ-GLASEKTNRFALLFGTLDGSIGCIAPVDELTFRRLQSLQRKLVDAVSHVCGLNP 1377
Query: 192 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
R+FR +K G IID L+ + LSL E+L++ ++IG+ IL DI
Sbjct: 1378 RSFRQFKSNGKAHRPGPDNIIDFELLTYYEILSLEEQLDMAQQIGTTRAQILSNFSDI 1435
>gi|339253000|ref|XP_003371723.1| cleavage and polyadenylation specificity factor subunit 1
[Trichinella spiralis]
gi|316967988|gb|EFV52332.1| cleavage and polyadenylation specificity factor subunit 1
[Trichinella spiralis]
Length = 1376
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 68/216 (31%), Positives = 118/216 (54%), Gaps = 14/216 (6%)
Query: 40 GTLKLRFKK-LKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFM 98
G LR+++ K L + R R +E V A EF+++ + + F+++ +++ ++
Sbjct: 1164 GVSLLRYQEEYKALSLVSRDPRPDE-------VLAVEFLVDRTDLSFLMTSAAGDILTYV 1216
Query: 99 YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLT----WYASL 154
Y PE+ +S GG RL+ + D+H G VN F ++RC I A R L +AS
Sbjct: 1217 YLPESLDSFGGQRLVPQADYHFGSQVNAFVRMRCHAQEI--AGRKRQEVLQRQGLIFASS 1274
Query: 155 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 214
DG++ + LPLPE+ YR L MLQ++++ GLN +RT + P++ IIDG
Sbjct: 1275 DGSVNYLLPLPEREYRLLGMLQSLLIDMLPSFAGLNVDDYRTVRFPNSCLREPTKNIIDG 1334
Query: 215 SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
++ +L + ++ +I ++IGS H+ I+ EL +E
Sbjct: 1335 NICMLYLYIDALQQEDIVRQIGSSHSQIMLELAYME 1370
>gi|30696088|ref|NP_199979.2| cleavage and polyadenylation specificity factor subunit 1
[Arabidopsis thaliana]
gi|290457637|sp|Q9FGR0.2|CPSF1_ARATH RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=AtCPSF160; Short=CPSF 160
kDa subunit
gi|332008729|gb|AED96112.1| cleavage and polyadenylation specificity factor subunit 1
[Arabidopsis thaliana]
Length = 1442
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/184 (33%), Positives = 101/184 (54%), Gaps = 4/184 (2%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
+A EF+I+ S++ +SD+ KN+ +F Y P+ ES G +L+ + +FH+G HV+ F ++
Sbjct: 1261 CFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKFLRL 1320
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ +S +RF + +LDG+ G PL E +RRL LQ +V H GLN
Sbjct: 1321 QM----VSSGADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLN 1376
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
P AFR ++ G + I+D L+ + L L E+LE+ +IG+ IL +L D+
Sbjct: 1377 PLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIGTTRYSILKDLVDLS 1436
Query: 251 ALSS 254
+S
Sbjct: 1437 VGTS 1440
>gi|10257491|dbj|BAB11613.1| cleavage and polyadenylation specificity factor subunit [Arabidopsis
thaliana]
Length = 1448
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/184 (33%), Positives = 101/184 (54%), Gaps = 4/184 (2%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
+A EF+I+ S++ +SD+ KN+ +F Y P+ ES G +L+ + +FH+G HV+ F ++
Sbjct: 1267 CFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKFLRL 1326
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ +S +RF + +LDG+ G PL E +RRL LQ +V H GLN
Sbjct: 1327 QM----VSSGADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLN 1382
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
P AFR ++ G + I+D L+ + L L E+LE+ +IG+ IL +L D+
Sbjct: 1383 PLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIGTTRYSILKDLVDLS 1442
Query: 251 ALSS 254
+S
Sbjct: 1443 VGTS 1446
>gi|24415580|gb|AAN41460.1| putative cleavage and polyadenylation specificity factor 160 kDa
subunit [Arabidopsis thaliana]
Length = 1442
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 62/184 (33%), Positives = 101/184 (54%), Gaps = 4/184 (2%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
+A EF+I+ S++ +SD+ KN+ +F Y P+ ES G +L+ + +FH+G HV+ F ++
Sbjct: 1261 CFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKFLRL 1320
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ +S +RF + +LDG+ G PL E +RRL LQ +V H GLN
Sbjct: 1321 QM----VSSGADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLN 1376
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
P AFR ++ G + I+D L+ + L L E+LE+ +IG+ IL +L D+
Sbjct: 1377 PLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIGTTRYSILKDLVDLS 1436
Query: 251 ALSS 254
+S
Sbjct: 1437 VGTS 1440
>gi|198415711|ref|XP_002123169.1| PREDICTED: similar to cleavage and polyadenylation specificity factor
1, partial [Ciona intestinalis]
Length = 1370
Score = 117 bits (294), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 62/157 (39%), Positives = 92/157 (58%), Gaps = 13/157 (8%)
Query: 44 LRFK-KLKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPE 102
LR++ K L V+ R R P VY + V++ + + F++SD +KN+VLF Y PE
Sbjct: 1222 LRYQTDFKTLSVTSRDVR-------PLEVYTADLVVDGTGINFLVSDHEKNLVLFAYDPE 1274
Query: 103 ARESNGGHRLIKKTDFHLGQHVNTFFKIRC----KPSPISDAPGARSRFLTWYASLDGAL 158
ES+GG RL K+ D H+G N +++ + + + + P A +T +LDG++
Sbjct: 1275 DHESHGGSRLTKRADMHIGSRANCMWRVAACGVDRSTGLPNQPYA-GVHITMMGTLDGSI 1333
Query: 159 GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 195
LP+ EK YRRLLMLQN+M+T H GLNP+AFR
Sbjct: 1334 CHVLPVAEKVYRRLLMLQNIMITGLQHIAGLNPKAFR 1370
>gi|224120960|ref|XP_002318462.1| predicted protein [Populus trichocarpa]
gi|222859135|gb|EEE96682.1| predicted protein [Populus trichocarpa]
Length = 1455
Score = 117 bits (292), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 60/186 (32%), Positives = 102/186 (54%), Gaps = 11/186 (5%)
Query: 72 YACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI- 130
++ EF+I+ S++ ++SD+ KNV +F Y P+ ES G +L+ + +FH+G V F ++
Sbjct: 1266 FSTEFLIDGSTLSLVVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGALVTKFMRLQ 1325
Query: 131 -------RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
R +P+SD +RF + +LDG++G PL E +RRL LQ +V
Sbjct: 1326 MLSPSLDRSGAAPVSDK---TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1382
Query: 184 SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
H GLNP++FR ++ G I+D ++ + + L E++EI ++IG+ IL
Sbjct: 1383 PHVAGLNPKSFRQFRSDGKAHRPGPESIVDCEMLSYYEMIPLEEQVEIAQQIGTTRAQIL 1442
Query: 244 DELYDI 249
L D+
Sbjct: 1443 SNLNDL 1448
>gi|320169222|gb|EFW46121.1| cleavage and polyadenylation specificity factor 1 [Capsaspora
owczarzaki ATCC 30864]
Length = 1725
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 60/180 (33%), Positives = 102/180 (56%), Gaps = 2/180 (1%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V + +F+++ SS+GF+ +D +N+V+F Y P RESN G RL+++ DFH+G HV ++
Sbjct: 1540 VASTQFLVDGSSLGFLATDSHQNLVVFAYNPLNRESNNGQRLLRQLDFHVGSHVQQVLRM 1599
Query: 131 RCKPSPISDAPGAR-SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 189
+ P+S GA R + A+L+G+L P+ E +RRL LQ +V GL
Sbjct: 1600 VPRSLPVSVDRGASVKRHIDLLATLEGSLNALAPIGETTFRRLEWLQRQLV-GLQQRAGL 1658
Query: 190 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
NP +R Y+ + +IDG L+ +FL L L E+ E+ ++ + D++D++ +
Sbjct: 1659 NPIGYRAYRFPRKMTTTRAGNVIDGELLSRFLYLGLAEQRELARQRRNTPEDLIDDILSV 1718
>gi|297792471|ref|XP_002864120.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
lyrata]
gi|297309955|gb|EFH40379.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
lyrata]
Length = 1444
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/184 (32%), Positives = 100/184 (54%), Gaps = 4/184 (2%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
+A EF+I+ +++ +SD+ KN+ +F Y P+ ES G +L+ + +FH+G HV F ++
Sbjct: 1263 CFATEFLIDGNTLSLAVSDEQKNIQVFYYAPKMAESWKGQKLLSRAEFHVGSHVTKFLRL 1322
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ ++ +RF + +LDG+ G PL E +RRL LQ +V H GLN
Sbjct: 1323 QM----VTSGADKTNRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLN 1378
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
P +FR ++ G + IID L+ + L L E+LE+ +IG+ + IL L ++
Sbjct: 1379 PHSFRQFRTSGKARRSGPDSIIDCELLCHYEMLPLEEQLELAHQIGTTRSVILLNLVELS 1438
Query: 251 ALSS 254
+S
Sbjct: 1439 VGTS 1442
>gi|440793679|gb|ELR14857.1| CPSF A subunit region protein [Acanthamoeba castellanii str. Neff]
Length = 1477
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/185 (30%), Positives = 103/185 (55%), Gaps = 4/185 (2%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V+A E+V++ ++ ++SD+ +N+ + Y P ES GG L+ + DF++GQ +N F ++
Sbjct: 1295 VFATEYVVDKQNLALLMSDERQNLQVTAYAPHTAESRGGQLLVPRGDFNVGQSINKFVRL 1354
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
P + + R W+ +L G +G+ P+ E +RRL MLQ+ +++ HT GL+
Sbjct: 1355 ---PMTLPSGTTSLQRHALWFGTLSGGVGYLAPMDESVFRRLGMLQSALLSAIPHTAGLH 1411
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
P+A+R + + N I+DG L+ ++L L + +I K+G+ IL++L I
Sbjct: 1412 PQAYRALQ-RERLLRNRKHTILDGLLLSRYLALDSATQQQIALKLGTSRERILNDLQGIP 1470
Query: 251 ALSSH 255
+H
Sbjct: 1471 QSVTH 1475
>gi|298715584|emb|CBJ28137.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 255
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 56/183 (30%), Positives = 100/183 (54%), Gaps = 5/183 (2%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V++ E++I+ M +++D +NV + Y P A S GG +L+ ++DF+LG V +
Sbjct: 76 VFSAEYLIDEPGMAIVVADGRRNVKVLQYAPNATNSRGGTKLLCQSDFYLGSRVGKLTRR 135
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
R + + + D +R+ +LDG LG LP+ E+ +RRL LQ +M H G N
Sbjct: 136 RTRGN-LRDG----ARYCLLAGTLDGGLGAVLPVDERVFRRLYALQGIMSNALGHNGAAN 190
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
PRA+R + + + ++DGSL+W+F+ L + ++ + IG+ + ++ L DI+
Sbjct: 191 PRAYRLFDHGPTFRYETKQNMLDGSLLWRFVGLDAKTQHDLTRAIGTTVDRVMANLLDID 250
Query: 251 ALS 253
S
Sbjct: 251 LAS 253
>gi|430810872|emb|CCJ31592.1| unnamed protein product [Pneumocystis jirovecii]
gi|430814599|emb|CCJ28188.1| unnamed protein product [Pneumocystis jirovecii]
Length = 203
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 60/188 (31%), Positives = 102/188 (54%), Gaps = 11/188 (5%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V + +F+++ + F+I D D N+ +F Y PE +S G +L+K+ DFH+G H+ + +
Sbjct: 13 VSSADFLVDDEHLYFVIGDDDGNIHVFNYDPENPQSFSGQKLLKRGDFHVGSHIKSILML 72
Query: 131 --RCKPSPISDAPGARSR----FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
P ++D R+ L AS DG++G + LPEK YRRL +Q ++
Sbjct: 73 PKEAFPQNVNDKEETRASKNQDSLCLCASQDGSMGVLISLPEKTYRRLYFIQGQLINTED 132
Query: 185 HTGGLNPRAFR--TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
GLNP ++R TY K NP+RGI+DG L++++ L ++ ++ +K G I
Sbjct: 133 KVAGLNPISYRTSTYVSK---TSNPARGILDGKLLYQYNNLERNKQKDMARKSGMPVETI 189
Query: 243 LDELYDIE 250
+ +L I+
Sbjct: 190 IYDLLKID 197
>gi|384487281|gb|EIE79461.1| hypothetical protein RO3G_04166 [Rhizopus delemar RA 99-880]
Length = 1468
Score = 110 bits (275), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 60/189 (31%), Positives = 103/189 (54%), Gaps = 6/189 (3%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F+I+ S+ ++ D ++N+ L+ Y P +S GG +L+++ DFH+G V T ++
Sbjct: 1283 VGCVDFIIDDKSLYLIVGDTNENIDLYQYAPFNLQSFGGQKLMRRGDFHVGSQVQTMVRL 1342
Query: 131 RCKPSPISDAPGAR--SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
P G R + +G++ + EK ++RL L +V + H G
Sbjct: 1343 ---PQIEKTEKGFEYSRRHFCLCGTFNGSIAVISSISEKTFKRLNTLYGHLVNNLQHVAG 1399
Query: 189 LNPRAFRTYKG-KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
LNPRAFR KG K + N ++ ++DG L+++F LS+ E+ E K+IG+ I+++L
Sbjct: 1400 LNPRAFRLIKGPKQRMSTNRTKAVLDGDLIFEFAGLSIEEQKETTKQIGTTVTRIMEDLV 1459
Query: 248 DIEALSSHF 256
DIE +HF
Sbjct: 1460 DIECSINHF 1468
Score = 38.9 bits (89), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 6/86 (6%)
Query: 4 VQELLTVSLGLHGNRPLLLVRTQ-HELLIYQAFRHPKGT----LKLRFKKLKVLFVSDRS 58
+QE+L +G P L+VRT ++++IY+AF + + L LRF +++ +VS +S
Sbjct: 853 IQEILMTHIGKERKDPHLVVRTDTNDIIIYKAFTYLDESSPDRLALRFSRVQHEYVSRKS 912
Query: 59 KRANEQPGLPRGVYACEFVIESSSMG 84
+P RG+ EF I + +
Sbjct: 913 SSHESKPKKKRGIID-EFEIPDTDLN 937
>gi|290981010|ref|XP_002673224.1| CPSF A subunit [Naegleria gruberi]
gi|284086806|gb|EFC40480.1| CPSF A subunit [Naegleria gruberi]
Length = 1373
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 104/208 (50%), Gaps = 19/208 (9%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P V + EF++ + +G + D KN+ +F Y P+ +ESN G L+ DFH+G +N+
Sbjct: 1166 PLQVVSSEFLVNNDLLGLAVIDTSKNLQIFSYLPQHQESNDGRNLVPVCDFHIGTLINSL 1225
Query: 128 FKIRCKPSP---------ISDAPGARSR----------FLTWYASLDGALGFFLPLPEKN 168
+++ + P +++ P + + S+DGA+G+ P+ E
Sbjct: 1226 IRMKVRELPDDNTIRLGNVNEKPKQSGKKDITKTNPNHQFILFGSVDGAIGYVAPINEVT 1285
Query: 169 YRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGER 228
+RRL LQ M T GL+P++FR YK N + IIDG L+W + ++ +
Sbjct: 1286 HRRLFALQLKMYTQLEQAAGLHPKSFRLYKPLERTEYNYKKNIIDGQLIWNYANINTILQ 1345
Query: 229 LEICKKIGSKHNDILDELYDIEALSSHF 256
++ ++IG+ ++IL + ++ + F
Sbjct: 1346 RDLARQIGTNSDNILRSIQELNQATFFF 1373
>gi|384946686|gb|AFI36948.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
Length = 1428
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 58/189 (30%), Positives = 98/189 (51%), Gaps = 11/189 (5%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY+ +F+++++ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF
Sbjct: 1251 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1310
Query: 128 FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
++ C+ + G + + W G LP H S
Sbjct: 1311 WRTPCRGA----TEGLSKKSVVWENKHITWFGEDLP-------AAADAAERADHHASAPR 1359
Query: 188 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
P + DG L+ ++L LS ER E+ KKIG+ + ILD+L
Sbjct: 1360 RPQPPCLPDAARGPPHPPECCAQRADGELLNRYLYLSTMERSELAKKIGTTPDIILDDLL 1419
Query: 248 DIEALSSHF 256
+ + +++HF
Sbjct: 1420 ETDRVTAHF 1428
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 35/51 (68%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 844 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 894
>gi|303321596|ref|XP_003070792.1| CPSF A subunit region family protein [Coccidioides posadasii C735
delta SOWgp]
gi|240110489|gb|EER28647.1| CPSF A subunit region family protein [Coccidioides posadasii C735
delta SOWgp]
Length = 1394
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 58/192 (30%), Positives = 100/192 (52%), Gaps = 16/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A +F+ + + +++D D + + Y PE S+ G RL+ ++ FH+G +T +
Sbjct: 1204 VIAADFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHMGHFTSTMTLL 1263
Query: 131 -RCKPSPISDAPGARS--------RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
+ SP +D PG + S +G++G PL E +YRRL LQ+ +VT
Sbjct: 1264 PQHSSSPSADDPGEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQSQLVT 1323
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
H GLNP+A+R + G+ RGI+DG+L+ ++L + + + EI ++G+ D
Sbjct: 1324 SMEHPCGLNPKAYRAVESDGFGG----RGIVDGNLLLRWLDMGVQRKAEIAGRVGA---D 1376
Query: 242 ILDELYDIEALS 253
I D+E +S
Sbjct: 1377 IESIRVDLEKIS 1388
>gi|320040273|gb|EFW22206.1| hypothetical protein CPSG_00105 [Coccidioides posadasii str.
Silveira]
Length = 1387
Score = 100 bits (248), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 58/192 (30%), Positives = 100/192 (52%), Gaps = 16/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A +F+ + + +++D D + + Y PE S+ G RL+ ++ FH+G +T +
Sbjct: 1197 VIAADFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHMGHFTSTMTLL 1256
Query: 131 -RCKPSPISDAPGARS--------RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
+ SP +D PG + S +G++G PL E +YRRL LQ+ +VT
Sbjct: 1257 PQHSSSPSADDPGEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQSQLVT 1316
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
H GLNP+A+R + G+ RGI+DG+L+ ++L + + + EI ++G+ D
Sbjct: 1317 SMEHPCGLNPKAYRAVESDGFGG----RGIVDGNLLLRWLDMGVQRKAEIAGRVGA---D 1369
Query: 242 ILDELYDIEALS 253
I D+E +S
Sbjct: 1370 IESIRVDLEKIS 1381
>gi|19112233|ref|NP_595441.1| cleavage factor one Cft1 (predicted) [Schizosaccharomyces pombe
972h-]
gi|74582544|sp|O74733.1|CFT1_SCHPO RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|3738146|emb|CAA21247.1| cleavage factor one Cft1 (predicted) [Schizosaccharomyces pombe]
Length = 1441
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 60/188 (31%), Positives = 100/188 (53%), Gaps = 6/188 (3%)
Query: 70 GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
V A +F+++ ++ F+++D N+ L Y PE ES+ G RL+ + DFH+G +V T
Sbjct: 1254 NVSAADFLVQGENLYFVVADTSGNLRLLAYDPENPESHSGERLVTRGDFHIG-NVITAMT 1312
Query: 130 IRCKPSPISDAP---GARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
I K +A F + DG L +P+ ++ YRRL ++QN + +
Sbjct: 1313 ILPKEKKHQNAEYGYDTGDDFSCVMVNSDGGLQMLVPISDRVYRRLNIIQNYLANRVNTI 1372
Query: 187 GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
GGLNP+++R NP+R I+DG L+ F +S+ R E+ K G + I+++L
Sbjct: 1373 GGLNPKSYRLITSPSNLT-NPTRRILDGMLIDYFTYMSVAHRHEMAHKCGVPVSTIMNDL 1431
Query: 247 YDI-EALS 253
++ EALS
Sbjct: 1432 VELDEALS 1439
>gi|301093545|ref|XP_002997618.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110008|gb|EEY68060.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1744
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 60/204 (29%), Positives = 100/204 (49%), Gaps = 15/204 (7%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P V A EF + + + D D+N+ + + P+ ES GG RL++ +DFHLG V++
Sbjct: 1540 PLAVSATEFSVFEKKLALLAVDMDENLHVMQFAPQDIESRGGQRLLRVSDFHLGVQVSSM 1599
Query: 128 FKIRCKPS-PISDAPGAR-----SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
F+ R S + A R S ++ + +G +G +P+ E+ +RRL LQNVMV
Sbjct: 1600 FRKRVDASGSVVSATNGRNAAPLSNYVNVMGTSEGGVGALVPVGERVFRRLFTLQNVMVN 1659
Query: 182 HTSHTGGLNPRAFRTYKGKGY-YAGNPS--------RGIIDGSLVWKFLQLSLGERLEIC 232
LNPR FR K G P + +D ++++FLQL + E+
Sbjct: 1660 TLPQNCALNPREFRMLKTNAQRRCGRPDAWSKKKWKKSFLDAFVLFRFLQLDYVAQKELA 1719
Query: 233 KKIGSKHNDILDELYDIEALSSHF 256
+ IG+ ++ L +++ +S F
Sbjct: 1720 RCIGTTPEVVMHNLLEVQHATSTF 1743
>gi|219109892|ref|XP_002176699.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411234|gb|EEC51162.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1678
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 63/209 (30%), Positives = 100/209 (47%), Gaps = 23/209 (11%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VYA + +M F+ D +N+ F Y P + GG+RL+ + D+HLG +F
Sbjct: 1467 PIPVYAAGVMSRGPAMTFLCHDDRQNLQFFQYAPGEAAARGGNRLVCRADYHLGTQTTSF 1526
Query: 128 FKIRCKPS-------PIS--------DAPGARS----RFLTWYASLDGALGFFLPLPEKN 168
C+ S P S D+ RS R ++ + DG +G +PL E
Sbjct: 1527 ASHFCRSSLMIHSATPTSTLAALKQQDSYFGRSEEDQRLGAYFGTADGGMGAVVPLSEPV 1586
Query: 169 YRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK----GYYAGNPSRGIIDGSLVWKFLQLS 224
Y RL LQ+++ L PRA+R Y+ G + + +G+IDG LV ++ LS
Sbjct: 1587 YWRLTALQSIVANALESDCALAPRAWRLYRRSTRRGGCRSNDRKKGVIDGDLVLQYADLS 1646
Query: 225 LGERLEICKKIGSKHNDILDELYDIEALS 253
+ ++ +I IGS + ILD L +++ S
Sbjct: 1647 ISKQEDIASAIGSTVDLILDNLLELQCGS 1675
>gi|167526060|ref|XP_001747364.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774199|gb|EDQ87831.1| predicted protein [Monosiga brevicollis MX1]
Length = 1324
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 52/170 (30%), Positives = 95/170 (55%), Gaps = 7/170 (4%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
Y +FV+ S + + D D N+V Y ++ ++ GG L++ +FHLG ++ +
Sbjct: 1138 CYCADFVVRESDLAMIYGDADGNLVALDYDADSPDTRGGRILVRSANFHLGTRLSAM--L 1195
Query: 131 RCKPSPISDAPG----ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
R + +P+ APG A+ + ++G G +PL E YRRL MLQ +V+H+S
Sbjct: 1196 RLQAAPVVRAPGGLAEAQKCHVVHTFGIEGQQGVVIPLHEAEYRRLEMLQKKLVSHSS-L 1254
Query: 187 GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
GL+P FR +K + + ++GI+DG+L+ ++ L E+L++ +++G
Sbjct: 1255 AGLHPFQFRAFKSSIWRPRSFAQGILDGALLRQYFCLGRREQLDVAEQLG 1304
>gi|301103686|ref|XP_002900929.1| cleavage and polyadenylation specificity factor subunit, putative
[Phytophthora infestans T30-4]
gi|262101684|gb|EEY59736.1| cleavage and polyadenylation specificity factor subunit, putative
[Phytophthora infestans T30-4]
Length = 1561
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 60/204 (29%), Positives = 99/204 (48%), Gaps = 15/204 (7%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P V A EF + + + D D+N+ + + P+ ES GG RL++ +DFHLG V++
Sbjct: 1357 PLAVSATEFSVFEKKLALLAVDMDENLHVMQFAPQDIESRGGQRLLRVSDFHLGVQVSSM 1416
Query: 128 FKIRCKPS-PISDAPGAR-----SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
F+ R S + A R S ++ + +G +G +P+ E+ +RRL LQNVMV
Sbjct: 1417 FRKRVDASGSVVSATNGRNAAPLSNYVNVMGTSEGGVGALVPVGERVFRRLFTLQNVMVN 1476
Query: 182 HTSHTGGLNPRAFRTYKGKGY-YAGNPS--------RGIIDGSLVWKFLQLSLGERLEIC 232
LNPR FR K G P + +D ++++FLQL + E+
Sbjct: 1477 TLPQNCALNPREFRILKTNAQRRCGRPDAWSKKKWKKSFLDAFVLFRFLQLDYVAQKELA 1536
Query: 233 KKIGSKHNDILDELYDIEALSSHF 256
+ IG+ + L +++ +S F
Sbjct: 1537 RCIGTTPEVAMHNLLEVQHATSTF 1560
>gi|225679191|gb|EEH17475.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 1377
Score = 97.1 bits (240), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 60/192 (31%), Positives = 93/192 (48%), Gaps = 18/192 (9%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A +F+ + M++D D N+ + Y PE S G RL+ ++ FH GQ +T +
Sbjct: 1184 VMAADFLPHGKRLFIMVADDDCNIHVLQYDPEDPGSAKGDRLLHRSTFHTGQFASTL-TL 1242
Query: 131 RCKPSPISDAPGARSRFL----------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
+ S +S P A + + S G++ P+ E YRRL LQ+ M+
Sbjct: 1243 LPRTSVLSQGPEAEANAMDLDSSGPLHQVLVTSETGSIALITPVSEMAYRRLSALQSQMI 1302
Query: 181 THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
H GLNPRAFR + G RG++DG LV K+L L + EI ++G+
Sbjct: 1303 NTLEHPCGLNPRAFRAVESDGIGG----RGMVDGDLVQKWLDLGTQRKAEIASRVGA--- 1355
Query: 241 DILDELYDIEAL 252
D+ + D+EA+
Sbjct: 1356 DVWEIRADLEAI 1367
>gi|295665178|ref|XP_002793140.1| cleavage and polyadenylation specificity factor subunit A
[Paracoccidioides sp. 'lutzii' Pb01]
gi|226278054|gb|EEH33620.1| cleavage and polyadenylation specificity factor subunit A
[Paracoccidioides sp. 'lutzii' Pb01]
Length = 1408
Score = 96.7 bits (239), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 93/192 (48%), Gaps = 18/192 (9%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A +F+ + + M++D D N+ + Y PE S G RL+ ++ FH GQ +T +
Sbjct: 1217 VMAADFLPDGKRLYIMVADDDCNIHVLQYDPEDPGSAKGDRLLHRSTFHTGQFASTL-TL 1275
Query: 131 RCKPSPISDAPGARSRFL----------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
+ S +S P + + S G++ P+ E YRRL LQ+ M+
Sbjct: 1276 LPRTSVLSQGPETEANAMDLDLSGPLHQVLVTSETGSIALITPVSEMAYRRLSALQSQMI 1335
Query: 181 THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
H GLNPRAFR + G RG++DG LV K+L L + EI ++G+
Sbjct: 1336 NTLEHPCGLNPRAFRAVESDGIGG----RGMVDGDLVQKWLDLGTQRKAEIASRVGA--- 1388
Query: 241 DILDELYDIEAL 252
D+ + D+EA+
Sbjct: 1389 DVWEIRADLEAI 1400
>gi|213407244|ref|XP_002174393.1| cleavage factor one Cft1 [Schizosaccharomyces japonicus yFS275]
gi|212002440|gb|EEB08100.1| cleavage factor one Cft1 [Schizosaccharomyces japonicus yFS275]
Length = 1431
Score = 96.7 bits (239), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 60/192 (31%), Positives = 97/192 (50%), Gaps = 10/192 (5%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P V + +F+++ S+ F ++D N+ + Y PE ES+ G RL+ + D H+G H+ T
Sbjct: 1242 PLSVSSADFLVQGQSLYFAVADMRGNLRILAYDPENPESHSGERLVTRGDIHVG-HIIT- 1299
Query: 128 FKIRCKPSPISDAPGAR-----SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
I P D PG F + DG+L P+ E+ YRRL ++QN +
Sbjct: 1300 -AIHLVPKMKKDRPGEVDYDEGDEFACITTNSDGSLQALCPISERVYRRLNIIQNYLANR 1358
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
GGLNPR++R N + I+DG L+ F +S+ R E+ K G + I
Sbjct: 1359 IETVGGLNPRSYRLINTVSSL-NNATHRILDGGLIEHFSYMSVAHRQEMAYKCGVPISTI 1417
Query: 243 LDELYDI-EALS 253
+++L ++ EAL+
Sbjct: 1418 MNDLVELDEALN 1429
>gi|119195757|ref|XP_001248482.1| hypothetical protein CIMG_02253 [Coccidioides immitis RS]
gi|121769680|sp|Q1E5B0.1|CFT1_COCIM RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|392862316|gb|EAS37050.2| protein CFT1 [Coccidioides immitis RS]
Length = 1387
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/192 (29%), Positives = 97/192 (50%), Gaps = 16/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A +F+ + + +++D D + + Y PE S+ G RL+ ++ FH G +T +
Sbjct: 1197 VIAADFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHTGHFTSTMTLL 1256
Query: 131 -RCKPSPISDAPGARS--------RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
SP +D P + S +G++G PL E +YRRL LQ+ +VT
Sbjct: 1257 PEHSSSPSADDPEEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQSQLVT 1316
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
H GLNP+A+R + G+ RGI+DG+L+ ++L + + + EI ++G+ D
Sbjct: 1317 SMEHPCGLNPKAYRAVESDGFGG----RGIVDGNLLLRWLDMGVQRKAEIAGRVGA---D 1369
Query: 242 ILDELYDIEALS 253
I D+E +S
Sbjct: 1370 IESIRVDLETIS 1381
>gi|121719617|ref|XP_001276507.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus clavatus NRRL 1]
gi|148886827|sp|A1C3U1.1|CFT1_ASPCL RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|119404719|gb|EAW15081.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus clavatus NRRL 1]
Length = 1401
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/188 (30%), Positives = 100/188 (53%), Gaps = 19/188 (10%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF--- 127
V A +F+ + + +++D D N+ + Y+PE S+ G +L+ ++ FH+G +T
Sbjct: 1210 VVAADFLPDGDKLFILVADSDCNLHVLQYEPEDPMSSNGDKLLVRSKFHMGHFTSTLTLL 1269
Query: 128 ------FKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNV 178
++I PS SD+ R S G++G +PE++YRRL LQ+
Sbjct: 1270 PRTTASYEI---PSADSDSMEVDPRITPQQVLITSQSGSIGIVTSIPEESYRRLSALQSQ 1326
Query: 179 MVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 238
+ H GLNPRA+R + G AG RG++DG+L++++L +S R+EI ++G+
Sbjct: 1327 LANTVEHPCGLNPRAYRAIESDG-TAG---RGMLDGNLLYQWLSMSKQRRMEIAARVGAH 1382
Query: 239 HNDILDEL 246
+I +L
Sbjct: 1383 EWEIKADL 1390
>gi|196012166|ref|XP_002115946.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
gi|190581722|gb|EDV21798.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
Length = 1187
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 82/148 (55%), Gaps = 3/148 (2%)
Query: 103 ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFL 162
A ES+GG L+++ + G + + FF+ + + + ++ +TW+ +LDG++G L
Sbjct: 1039 APESHGGQFLVRRAEIQTGSNAHAFFRTKVRAL---NQRQNENKHITWFGTLDGSIGLLL 1095
Query: 163 PLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQ 222
P+ EK YRRL LQ + + GLN +AFRT++ N R I+DG L+ ++
Sbjct: 1096 PVDEKEYRRLFSLQAKLSIYLEQNAGLNQKAFRTFRSHQKKLQNSMRNILDGDLLKRYFH 1155
Query: 223 LSLGERLEICKKIGSKHNDILDELYDIE 250
L ER ++ K+I S I+++L +E
Sbjct: 1156 LGFVERRDLAKQIMSTPEQIINDLTKLE 1183
>gi|148886829|sp|A2R919.1|CFT1_ASPNC RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|134083776|emb|CAK47110.1| unnamed protein product [Aspergillus niger]
Length = 1383
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 56/192 (29%), Positives = 97/192 (50%), Gaps = 16/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A EF+ + + +++D D N+ + Y PE +S+ G RL+ ++ FH+G +T +
Sbjct: 1192 VCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSRSKFHMGNFASTLTLL 1251
Query: 131 ---------RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
S D + +G+LG +PE++YRRL LQ+ +
Sbjct: 1252 PRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPEESYRRLSALQSQLTN 1311
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
H GLNPRAFR + G RG++DG+L++K++ +S + EI ++G++ +
Sbjct: 1312 TLEHPCGLNPRAFRAVESD----GTAGRGMLDGNLLFKWIDMSKQRKTEIAGRVGAREWE 1367
Query: 242 ILDELYDIEALS 253
I D+EA+S
Sbjct: 1368 I---KADLEAIS 1376
>gi|348679545|gb|EGZ19361.1| putative cleavage and polyadenylation specificity factor CPSF
[Phytophthora sojae]
Length = 1752
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 59/210 (28%), Positives = 98/210 (46%), Gaps = 27/210 (12%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P V A EF + + + D D+N+ + + P+ ES GG RL++ +DFHLG V +
Sbjct: 1548 PLAVSATEFSVFEKKLALLAVDMDENLHVMQFAPQDIESRGGQRLLRVSDFHLGVQVASM 1607
Query: 128 FKIRCKPSPISDAPGAR------------SRFLTWYASLDGALGFFLPLPEKNYRRLLML 175
F+ R D PG S ++ + +G +G +P+ E+ +RRL L
Sbjct: 1608 FRKRV------DGPGGHVAVNGRGPRAPPSYYVNVMGNSEGGVGALIPVGERVFRRLFTL 1661
Query: 176 QNVMVTHTSHTGGLNPRAFRTYKGKGY-YAGNPS--------RGIIDGSLVWKFLQLSLG 226
QNVMV LNPR FR K G P + +D ++++FLQL
Sbjct: 1662 QNVMVNTLPQNCALNPREFRMLKTNAQRRCGRPDAWSKKKWKKSFLDAFVLFRFLQLDYV 1721
Query: 227 ERLEICKKIGSKHNDILDELYDIEALSSHF 256
+ E+ + IG+ ++ L +++ ++ F
Sbjct: 1722 AQKELARCIGTTPEVVIHNLLEVQHATATF 1751
>gi|317036382|ref|XP_001398211.2| protein cft1 [Aspergillus niger CBS 513.88]
Length = 1393
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 56/192 (29%), Positives = 97/192 (50%), Gaps = 16/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A EF+ + + +++D D N+ + Y PE +S+ G RL+ ++ FH+G +T +
Sbjct: 1202 VCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSRSKFHMGNFASTLTLL 1261
Query: 131 ---------RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
S D + +G+LG +PE++YRRL LQ+ +
Sbjct: 1262 PRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPEESYRRLSALQSQLTN 1321
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
H GLNPRAFR + G RG++DG+L++K++ +S + EI ++G++ +
Sbjct: 1322 TLEHPCGLNPRAFRAVESD----GTAGRGMLDGNLLFKWIDMSKQRKTEIAGRVGAREWE 1377
Query: 242 ILDELYDIEALS 253
I D+EA+S
Sbjct: 1378 I---KADLEAIS 1386
>gi|350633238|gb|EHA21604.1| hypothetical protein ASPNIDRAFT_51242 [Aspergillus niger ATCC 1015]
Length = 1406
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 56/192 (29%), Positives = 97/192 (50%), Gaps = 16/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A EF+ + + +++D D N+ + Y PE +S+ G RL+ ++ FH+G +T +
Sbjct: 1215 VCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSRSKFHMGNFASTLTLL 1274
Query: 131 ---------RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
S D + +G+LG +PE++YRRL LQ+ +
Sbjct: 1275 PRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPEESYRRLSALQSQLTN 1334
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
H GLNPRAFR + G RG++DG+L++K++ +S + EI ++G++ +
Sbjct: 1335 TLEHPCGLNPRAFRAVESD----GTAGRGMLDGNLLFKWIDMSKQRKTEIAGRVGAREWE 1390
Query: 242 ILDELYDIEALS 253
I D+EA+S
Sbjct: 1391 I---KADLEAIS 1399
>gi|325189779|emb|CCA24259.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1911
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 55/198 (27%), Positives = 96/198 (48%), Gaps = 15/198 (7%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P V EF I + + + +D ++N+ + Y P ES GG RL++ +DFH+G +++
Sbjct: 1707 PLAVTTAEFNILDTRLALIAADVEENLHVLQYAPHDIESRGGQRLLRTSDFHVGVQISSI 1766
Query: 128 F-KIRCKPSPISDAPGARSR-----FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
K+ + A+ R +L S +G + +P+PE+ +RRL LQNVM++
Sbjct: 1767 LRKLVISNASHQQYIPAKGRCIGNMYLNVLGSSEGGIAALIPVPERVFRRLFTLQNVMIS 1826
Query: 182 HTSHTGGLNPRAFRTYKGKG---------YYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
LNPR FR K G + +G +DG ++ +FL L + E+
Sbjct: 1827 ALPQNCALNPREFRVMKANGRVRSGRADAWCKQKWKKGFLDGQVLCRFLHLDYVAQKELA 1886
Query: 233 KKIGSKHNDILDELYDIE 250
+ IG+ I+ L +++
Sbjct: 1887 RCIGTNPEVIIQNLSELQ 1904
>gi|325187036|emb|CCA21579.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1912
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 55/198 (27%), Positives = 96/198 (48%), Gaps = 15/198 (7%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P V EF I + + + +D ++N+ + Y P ES GG RL++ +DFH+G +++
Sbjct: 1708 PLAVTTAEFNILDTRLALIAADVEENLHVLQYAPHDIESRGGQRLLRTSDFHVGVQISSI 1767
Query: 128 F-KIRCKPSPISDAPGARSR-----FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
K+ + A+ R +L S +G + +P+PE+ +RRL LQNVM++
Sbjct: 1768 LRKLVISNASHQQYIPAKGRCIGNMYLNVLGSSEGGIAALIPVPERVFRRLFTLQNVMIS 1827
Query: 182 HTSHTGGLNPRAFRTYKGKG---------YYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
LNPR FR K G + +G +DG ++ +FL L + E+
Sbjct: 1828 ALPQNCALNPREFRVMKANGRVRSGRADAWCKQKWKKGFLDGQVLCRFLHLDYVAQKELA 1887
Query: 233 KKIGSKHNDILDELYDIE 250
+ IG+ I+ L +++
Sbjct: 1888 RCIGTNPEVIIQNLSELQ 1905
>gi|146324727|ref|XP_747211.2| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus fumigatus Af293]
gi|148886828|sp|Q4WCL1.2|CFT1_ASPFU RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|129556124|gb|EAL85173.2| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus fumigatus Af293]
Length = 1401
Score = 93.6 bits (231), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 55/192 (28%), Positives = 96/192 (50%), Gaps = 17/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF-- 128
V A EF+ + + +++D D N+ + Y PE +S+ G RL+ ++ FH+G T
Sbjct: 1210 VVAAEFLPDGDKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLL 1269
Query: 129 --------KIRCKPSPIS-DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 179
K P + D+ + L S G++G +PE++YRRL LQ+ +
Sbjct: 1270 PRTMVSSEKAMANPDSMEIDSQTISQQVLI--TSQSGSVGIVTSVPEESYRRLSALQSQL 1327
Query: 180 VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 239
H GLNPRA+R + G RG++DG+L++++L + ++EI ++G+
Sbjct: 1328 ANSLEHPCGLNPRAYRAVESD----GTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAHE 1383
Query: 240 NDILDELYDIEA 251
+I +L I A
Sbjct: 1384 WEIKADLEAIGA 1395
>gi|119484094|ref|XP_001261950.1| cleavage and polyadenylation specificity factor subunit A, putative
[Neosartorya fischeri NRRL 181]
gi|148886830|sp|A1DB13.1|CFT1_NEOFI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|119410106|gb|EAW20053.1| cleavage and polyadenylation specificity factor subunit A, putative
[Neosartorya fischeri NRRL 181]
Length = 1400
Score = 93.6 bits (231), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 55/192 (28%), Positives = 96/192 (50%), Gaps = 17/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF-- 128
V A EF+ + + +++D D N+ + Y PE +S+ G RL+ ++ FH+G T
Sbjct: 1209 VVAAEFLPDGDKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLL 1268
Query: 129 --------KIRCKPSPIS-DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 179
K P + D+ + L S G++G +PE++YRRL LQ+ +
Sbjct: 1269 PRTMVSSEKAMADPDSMEIDSQTISQQVLI--TSQSGSVGIVTSVPEESYRRLSALQSQL 1326
Query: 180 VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 239
H GLNPRA+R + G RG++DG+L++++L + ++EI ++G+
Sbjct: 1327 TNSLEHPCGLNPRAYRAVESD----GTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAHE 1382
Query: 240 NDILDELYDIEA 251
+I +L I A
Sbjct: 1383 WEIKADLEAIGA 1394
>gi|159123784|gb|EDP48903.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus fumigatus A1163]
Length = 1401
Score = 93.6 bits (231), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 55/192 (28%), Positives = 96/192 (50%), Gaps = 17/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF-- 128
V A EF+ + + +++D D N+ + Y PE +S+ G RL+ ++ FH+G T
Sbjct: 1210 VVAAEFLPDGDKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLL 1269
Query: 129 --------KIRCKPSPIS-DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 179
K P + D+ + L S G++G +PE++YRRL LQ+ +
Sbjct: 1270 PRTMVSSEKAMANPDSMEIDSQTISQQVLI--TSQSGSVGIVTSVPEESYRRLSALQSQL 1327
Query: 180 VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 239
H GLNPRA+R + G RG++DG+L++++L + ++EI ++G+
Sbjct: 1328 ANSLEHPCGLNPRAYRAVESD----GTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAHE 1383
Query: 240 NDILDELYDIEA 251
+I +L I A
Sbjct: 1384 WEIKADLEAIGA 1395
>gi|358372791|dbj|GAA89393.1| cleavage and polyadenylation specificity factor subunit A
[Aspergillus kawachii IFO 4308]
Length = 1372
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 55/192 (28%), Positives = 97/192 (50%), Gaps = 16/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A EF+ + + +++D D N+ + Y PE +S+ G +L+ ++ FH G +T +
Sbjct: 1181 VSAAEFLPDGRRLFIVVADSDCNIHVLQYDPEDPKSSNGDKLLSRSKFHTGNFASTLTLL 1240
Query: 131 ---------RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
S D + + +G+LG +PE++YRRL LQ+ +
Sbjct: 1241 PRTMVSSEKMISNSDDMDIDNQSALHQVLMTTQNGSLGLITCMPEESYRRLSALQSQLTN 1300
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
H GLNPRAFR + G RG++DG+L++K++ +S + EI ++G++ +
Sbjct: 1301 TLEHPCGLNPRAFRAVESD----GTAGRGMLDGNLLFKWIDMSKQRKTEIAGRVGAREWE 1356
Query: 242 ILDELYDIEALS 253
I D+EA+S
Sbjct: 1357 I---KADLEAIS 1365
>gi|414587797|tpg|DAA38368.1| TPA: hypothetical protein ZEAMMB73_143443 [Zea mays]
Length = 153
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 78/144 (54%), Gaps = 1/144 (0%)
Query: 105 ESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPL 164
ES G +L+ + +FH+G HV+ F +++ P+ A +RF + +LDG +G P+
Sbjct: 3 ESWKGQKLLSRAEFHVGAHVSKFLRLQMLPTQ-GLASEKTNRFALVFGTLDGGIGCIAPV 61
Query: 165 PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
E +RRL LQ +V H GLNPR+FR +K G IID L+ + +S
Sbjct: 62 DELTFRRLQSLQRKLVDAIPHVCGLNPRSFRHFKSNGKAHRPGPDNIIDFELLSHYEMMS 121
Query: 225 LGERLEICKKIGSKHNDILDELYD 248
L E+LEI ++IG+ + IL D
Sbjct: 122 LEEQLEIAQQIGTTRSQILSNFSD 145
>gi|261201748|ref|XP_002628088.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
gi|239590185|gb|EEQ72766.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
Length = 1403
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/192 (29%), Positives = 91/192 (47%), Gaps = 17/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A +F+ + + +++D D N+ + Y PE S+ G RL+ ++ FH G +T +
Sbjct: 1211 VMAADFLPDGKRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHRSTFHTGHFASTMTLL 1270
Query: 131 RCKPSPISDAPGARSRFL----------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
P + P A + S G++ PL E YRRL LQ+ ++
Sbjct: 1271 PRTIIPSAQGPDANPDMMELDSSGPLYHVLVTSETGSIALITPLSETAYRRLSALQSQLI 1330
Query: 181 THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
H GLNPRAFR + G RG++DG L+ ++L L + EI ++G+
Sbjct: 1331 NTLEHPCGLNPRAFRAIESDGIGG----RGMVDGDLLHRWLDLGTQRKAEIAHRVGA--- 1383
Query: 241 DILDELYDIEAL 252
DI + D+EA+
Sbjct: 1384 DIWEIRADLEAI 1395
>gi|239611898|gb|EEQ88885.1| protein CFT1 [Ajellomyces dermatitidis ER-3]
gi|327352847|gb|EGE81704.1| CFT1 [Ajellomyces dermatitidis ATCC 18188]
Length = 1402
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/192 (29%), Positives = 91/192 (47%), Gaps = 17/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A +F+ + + +++D D N+ + Y PE S+ G RL+ ++ FH G +T +
Sbjct: 1210 VMAADFLPDGKRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHRSTFHTGHFASTMTLL 1269
Query: 131 RCKPSPISDAPGARSRFL----------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
P + P A + S G++ PL E YRRL LQ+ ++
Sbjct: 1270 PRTIIPSAQGPDANPDMMELDSSGPLYHVLVTSETGSIALITPLSETAYRRLSALQSQLI 1329
Query: 181 THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
H GLNPRAFR + G RG++DG L+ ++L L + EI ++G+
Sbjct: 1330 NTLEHPCGLNPRAFRAIESDGIGG----RGMVDGDLLHRWLDLGTQRKAEIAHRVGA--- 1382
Query: 241 DILDELYDIEAL 252
DI + D+EA+
Sbjct: 1383 DIWEIRADLEAI 1394
>gi|296806499|ref|XP_002844059.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238845361|gb|EEQ35023.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 1348
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/178 (30%), Positives = 90/178 (50%), Gaps = 13/178 (7%)
Query: 70 GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
V A +F+ + + + +++D D N+ + Y PE S+ G RL+ + FH+G +T
Sbjct: 1155 AVIAADFLPDGNRLYVLVADDDCNLHVLQYDPEDPSSSKGDRLLHRNVFHVGHFASTMTL 1214
Query: 130 I----RCKPSPI------SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 179
+ SP +DAP S++ G++G PL E +YRRLL LQ+ +
Sbjct: 1215 LPQGSHTPHSPADRDAMDTDAPLPPSKYQILMTFQTGSVGIITPLNEDSYRRLLALQSQL 1274
Query: 180 VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
V H GLNPR +R + G RG+IDG+L+ ++L + + EI ++G+
Sbjct: 1275 VNALEHPCGLNPRGYRAVESDGIGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGA 1329
>gi|317157892|ref|XP_001826637.2| protein cft1 [Aspergillus oryzae RIB40]
gi|391864317|gb|EIT73613.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
[Aspergillus oryzae 3.042]
Length = 1389
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/192 (30%), Positives = 99/192 (51%), Gaps = 16/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A +F+ + + + +++D D N+ + Y PE +S+ G RL+ ++ FH G ++T +
Sbjct: 1198 VLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFHTGNFISTLTLL 1257
Query: 131 -RCKPSP---ISDAPGAR-----SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
R S ISD R S +G++G + E++YRRL LQ+ +
Sbjct: 1258 PRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEESYRRLSALQSQLTN 1317
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
H GLNPRAFR + G RG++DG L++++L +S ++EI ++G+ +
Sbjct: 1318 TIEHPCGLNPRAFRAVESD----GTAGRGMLDGKLLFQWLDMSKQRKVEIASRVGANEWE 1373
Query: 242 ILDELYDIEALS 253
I D EA+S
Sbjct: 1374 I---KADFEAIS 1382
>gi|238508528|ref|XP_002385456.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus flavus NRRL3357]
gi|220688975|gb|EED45327.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus flavus NRRL3357]
Length = 1204
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/192 (30%), Positives = 99/192 (51%), Gaps = 16/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A +F+ + + + +++D D N+ + Y PE +S+ G RL+ ++ FH G ++T +
Sbjct: 1013 VLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFHTGNFISTLTLL 1072
Query: 131 -RCKPSP---ISDAPGAR-----SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
R S ISD R S +G++G + E++YRRL LQ+ +
Sbjct: 1073 PRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEESYRRLSALQSQLTN 1132
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
H GLNPRAFR + G RG++DG L++++L +S ++EI ++G+ +
Sbjct: 1133 TIEHPCGLNPRAFRAVESD----GTAGRGMLDGKLLFQWLDMSKQRKVEIASRVGANEWE 1188
Query: 242 ILDELYDIEALS 253
I D EA+S
Sbjct: 1189 I---KADFEAIS 1197
>gi|121797760|sp|Q2TZ19.1|CFT1_ASPOR RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|83775384|dbj|BAE65504.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 1393
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/192 (30%), Positives = 99/192 (51%), Gaps = 16/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A +F+ + + + +++D D N+ + Y PE +S+ G RL+ ++ FH G ++T +
Sbjct: 1202 VLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFHTGNFISTLTLL 1261
Query: 131 -RCKPSP---ISDAPGAR-----SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
R S ISD R S +G++G + E++YRRL LQ+ +
Sbjct: 1262 PRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEESYRRLSALQSQLTN 1321
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
H GLNPRAFR + G RG++DG L++++L +S ++EI ++G+ +
Sbjct: 1322 TIEHPCGLNPRAFRAVESD----GTAGRGMLDGKLLFQWLDMSKQRKVEIASRVGANEWE 1377
Query: 242 ILDELYDIEALS 253
I D EA+S
Sbjct: 1378 I---KADFEAIS 1386
>gi|145348791|ref|XP_001418827.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579057|gb|ABO97120.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1386
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 91/173 (52%), Gaps = 8/173 (4%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V + EF+I+ +++ + SD N +F Y P++ ES G +L+ ++ +H+G ++ +
Sbjct: 1195 VVSTEFLIDGATLSLLGSDMSGNARIFGYDPKSLESWKGQKLLVRSAYHVGSPISRMVRF 1254
Query: 131 RCKPSPISDAPGAR----SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
+ + APG R +R ++ +LDGALG F+P E Y +L LQ + T
Sbjct: 1255 NVEGTTAKAAPGERPKGTNRHAVFFGTLDGALGIFMPTDEPTYAKLHALQRELNTTVRSP 1314
Query: 187 GGLNPRAFRT---YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
G NPR FRT ++GK P ++DG L+ KF L+ E+ + ++ G
Sbjct: 1315 IGCNPRTFRTPKVFEGKHVQLLAP-LDVLDGGLLSKFETLTFTEQRAVAERSG 1366
>gi|212541400|ref|XP_002150855.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces marneffei ATCC 18224]
gi|210068154|gb|EEA22246.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces marneffei ATCC 18224]
Length = 1383
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 93/192 (48%), Gaps = 16/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF-- 128
V +F+ + + +++D D N+ + Y PE +S+ G RL+ + FH+G +T
Sbjct: 1192 VVTADFLPDGKKLYILVADGDCNLYVLQYDPEDPKSSNGDRLLNRCKFHMGHFASTLTLL 1251
Query: 129 -------KIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
++ S D + + G++ L E++YRRL LQ+ +
Sbjct: 1252 PRTAVSSELAVMSSDSMDIDSYTPLYQALITTQSGSMALITSLSEESYRRLTALQSQLSN 1311
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
H GLNPRA+R+ + G RG+IDG L+ ++L LS +LEI ++G+ +
Sbjct: 1312 TLEHPCGLNPRAYRSVESDGVVG----RGMIDGKLLMRWLDLSRSRKLEIAGRVGADEWE 1367
Query: 242 ILDELYDIEALS 253
I D+EA+S
Sbjct: 1368 I---RADLEAVS 1376
>gi|315045910|ref|XP_003172330.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
gi|311342716|gb|EFR01919.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
Length = 1397
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 52/178 (29%), Positives = 89/178 (50%), Gaps = 13/178 (7%)
Query: 70 GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
V +F+ + + + +++D D N+ + Y PE S+ G RL+ ++ FH G +T
Sbjct: 1204 AVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTL 1263
Query: 130 I----RCKPSPI------SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 179
+ SP+ +D P S++ G++G PL E +YRRLL LQ+ +
Sbjct: 1264 LPHGSHTLSSPVDEDAMDTDLPPPPSKYQVLITFQTGSIGVISPLNEDSYRRLLALQSQL 1323
Query: 180 VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
V H GLNPR +R + G RG+IDG+L+ ++L + + EI ++G+
Sbjct: 1324 VNALEHPCGLNPRGYRAVESDGMGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGA 1378
>gi|225558298|gb|EEH06582.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 1408
Score = 90.1 bits (222), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 91/192 (47%), Gaps = 17/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A +F+ + + + +++D D N+ + Y PE S+ G RL+ ++ F G +T +
Sbjct: 1216 VMAADFLPDGNRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHRSTFQTGHFASTMTLL 1275
Query: 131 RCKPSPISDAPGARSRFL----------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
+ S P A + S G++ P+ E +YRRL LQ+ +
Sbjct: 1276 PRTATSSSQGPDADPDMMDLDSSGPLHHVLVTSETGSIALITPVSETSYRRLSALQSQLT 1335
Query: 181 THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
H GLNPRAFR + G RG++DG LV ++L L + EI ++G+
Sbjct: 1336 NTLEHPCGLNPRAFRAVESDGIGG----RGMVDGDLVKRWLDLGTQRKAEIANRVGA--- 1388
Query: 241 DILDELYDIEAL 252
D+ + D+EA+
Sbjct: 1389 DVWEIRADLEAI 1400
>gi|240277254|gb|EER40763.1| cleavage factor two protein 1 [Ajellomyces capsulatus H143]
Length = 1408
Score = 90.1 bits (222), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 91/192 (47%), Gaps = 17/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A +F+ + + + +++D D N+ + Y PE S+ G RL+ ++ F G +T +
Sbjct: 1216 VMAADFLPDGNRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHRSTFQTGHFASTMTLL 1275
Query: 131 RCKPSPISDAPGARSRFL----------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
+ S P A + S G++ P+ E +YRRL LQ+ +
Sbjct: 1276 PRTATSSSQGPDADPDMMDLDSSGPLHHVLVTSETGSIALITPVSETSYRRLSALQSQLA 1335
Query: 181 THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
H GLNPRAFR + G RG++DG LV ++L L + EI ++G+
Sbjct: 1336 NTLEHPCGLNPRAFRAVESDGIGG----RGMVDGDLVKRWLDLGTQRKAEIANRVGA--- 1388
Query: 241 DILDELYDIEAL 252
D+ + D+EA+
Sbjct: 1389 DVWEIRADLEAI 1400
>gi|308805673|ref|XP_003080148.1| cleavage and polyadenylation specificity factor (ISS) [Ostreococcus
tauri]
gi|116058608|emb|CAL54315.1| cleavage and polyadenylation specificity factor (ISS), partial
[Ostreococcus tauri]
Length = 1473
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 51/170 (30%), Positives = 87/170 (51%), Gaps = 9/170 (5%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V + EF+I+ +++ + SD N F Y P++ ES G +L+ + +H+G ++ +
Sbjct: 1023 VVSSEFLIDGTTLSLLGSDMSGNARTFGYDPKSIESWKGQKLLPRAAYHVGSPISRMVRF 1082
Query: 131 -----RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
+ K + P +RF ++ +LDGALG F+P Y +LL +Q + T
Sbjct: 1083 NVEGSKSKMASTDGKPKGANRFAVFFGTLDGALGIFMPTDPVTYEKLLAIQRELTTAVRS 1142
Query: 186 TGGLNPRAFRT---YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
G NPR FRT ++GK P ++DG L+ KF L+ E+++I
Sbjct: 1143 PIGCNPRTFRTPKVFEGKHVQLRAP-LDVLDGGLLSKFETLTFSEQVKIA 1191
>gi|149512998|ref|XP_001514888.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like, partial [Ornithorhynchus anatinus]
Length = 831
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 52/165 (31%), Positives = 91/165 (55%), Gaps = 21/165 (12%)
Query: 3 IVQELLTVSLGLHGNRPLLL-----VRTQH---------ELLIYQAFRHPKGTLKLR-FK 47
+V+E+L V+LG RP LL +R +L I+Q L K
Sbjct: 671 LVKEVLLVALGNRQTRPYLLRLKWAIRDSELTSITFIDMQLYIHQIISVKNFILAADVMK 730
Query: 48 KLKVLFVSDRSKRAN--EQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARE 105
+ +L + SK + + P VY+ +F+++++ +GF++SD+D+N++++MY PEA+E
Sbjct: 731 SISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKE 790
Query: 106 SNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTW 150
S GG RL+++ DFH+G HVN F++ C+ + A G + + W
Sbjct: 791 SFGGMRLLRRADFHVGAHVNAFWRTPCRGA----AEGPSKKSIVW 831
>gi|154285962|ref|XP_001543776.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150407417|gb|EDN02958.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 1283
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 91/192 (47%), Gaps = 17/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A +F+ + + + +++D D N+ + Y PE S+ G RL+ ++ F G +T +
Sbjct: 1091 VMAADFLPDGNRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHRSTFQTGHFASTMTLL 1150
Query: 131 RCKPSPISDAPGARSRFL----------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
+ S P A + S G++ P+ E +YRRL LQ+ +
Sbjct: 1151 PRTATSSSQRPDADPDMMDLDSSGPLHHVLVTSETGSIALITPVSETSYRRLSALQSQLT 1210
Query: 181 THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
H GLNPRAFR + G RG++DG LV ++L L + EI ++G+
Sbjct: 1211 NTLEHPCGLNPRAFRAVESDGIGG----RGMVDGDLVKRWLDLGTQRKAEIANRVGA--- 1263
Query: 241 DILDELYDIEAL 252
D+ + D+EA+
Sbjct: 1264 DVWEIRADLEAI 1275
>gi|321475208|gb|EFX86171.1| hypothetical protein DAPPUDRAFT_313209 [Daphnia pulex]
Length = 1260
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 36/64 (56%), Positives = 53/64 (82%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P V A ++ I+++ +GF++SD +KN++L+MYQPEARES GGHRLI+K DFH+GQ V+T
Sbjct: 1197 PLEVMAVDYYIDNTQLGFLVSDAEKNLILYMYQPEARESQGGHRLIRKADFHVGQVVSTM 1256
Query: 128 FKIR 131
F+I+
Sbjct: 1257 FRIK 1260
>gi|327287424|ref|XP_003228429.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Anolis carolinensis]
Length = 1294
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 61/88 (69%), Gaps = 3/88 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P VY +F+++S +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVN F
Sbjct: 1206 PLEVYCVDFMVDSCQLGFLVSDRDRNLLVYMYLPEAKESFGGMRLLRRADFHVGAHVNAF 1265
Query: 128 FKIRCKPS---PISDAPGARSRFLTWYA 152
++ C+ + P + ++ +TW+
Sbjct: 1266 WRTPCRGAMEGPTKKSSAWENKHITWFG 1293
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 47/85 (55%), Gaps = 14/85 (16%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL--KVLFVSD 56
+V+E+L V+LG +RP LLV ELLIY+AF H + LK+RFKK+ + F
Sbjct: 847 LVKEVLLVALGNRQSRPYLLVHVDQELLIYEAFNHDSQLGQTNLKVRFKKVPHNINFREK 906
Query: 57 R---SKRANEQPG-----LPRGVYA 73
+ SK+ E G +PRG A
Sbjct: 907 KPRPSKKKTESAGGEEASVPRGRVA 931
>gi|301124072|ref|XP_002909688.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262107255|gb|EEY65307.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 176
Score = 87.4 bits (215), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 87/173 (50%), Gaps = 15/173 (8%)
Query: 99 YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS-PISDAPGAR-----SRFLTWYA 152
+ P+ ES GG RL++ +DFHLG V++ F+ R S + A R S ++
Sbjct: 3 FAPQDIESRGGQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVNVMG 62
Query: 153 SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY-YAGNPS--- 208
+ +G +G +P+ E+ +RRL LQNVMV LNPR FR K G P
Sbjct: 63 TSEGGVGALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRRCGRPDAWS 122
Query: 209 -----RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
+ +D ++++FLQL+ + E+ + IG+ ++ L +++ +S F
Sbjct: 123 KKKWKKSFLDAFVLFRFLQLNYVAQKELARCIGTTPEVVMHNLLEVQHATSTF 175
>gi|328848896|gb|EGF98089.1| hypothetical protein MELLADRAFT_96156 [Melampsora larici-populina
98AG31]
Length = 1427
Score = 87.0 bits (214), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 91/180 (50%), Gaps = 5/180 (2%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
+F++ + + + SD + +F Y P ES GG +L+ +T+F + +
Sbjct: 1252 TIDFLVLENKLSIVTSDLGGTIRIFEYNPTNIESQGGLKLLCRTEFGTAGEMGSSLGFGK 1311
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
+ S +A T +A LDG++ +P+ E ++RL ++Q ++ H H GLNPR
Sbjct: 1312 RLSSKEEAKSIG----TLFAGLDGSISSLVPVKEAVFKRLQIVQTRLIRHLDHFAGLNPR 1367
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 252
FRT + + +RGIIDG ++ +F L L E+ I K GS N IL L +++ +
Sbjct: 1368 GFRTVRND-LVSRAMNRGIIDGEIIERFGALKLDEQDSIGKLAGSDRNTILINLNNLKGI 1426
>gi|330799483|ref|XP_003287774.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
gi|325082229|gb|EGC35718.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
Length = 1453
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 96/185 (51%), Gaps = 9/185 (4%)
Query: 70 GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
+++ EF++ ++ ++SD DKN++LF + P S G L+ K DFH+G ++ F +
Sbjct: 1260 NIFSTEFIVNQQTLSILVSDLDKNILLFSFDPADPTSRQGQMLLCKADFHIGSNIEKFVR 1319
Query: 130 IRCKPS-PISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
K + S + L ++ +LDG+L PL E+ Y+ LQ+ + + G
Sbjct: 1320 TPMKFNIQSSSNGNNNNDQLVFFGTLDGSLNVLRPLDERMYQLFYHLQSKLY-YLPQPAG 1378
Query: 189 LNPRAFRTYKG-KGYYAGNPS------RGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
LN + +R +K + +PS + I+DG L+ KF++L+ ER + +GS ++
Sbjct: 1379 LNAKQYRAFKSFSQNFHFSPSTIHQLPKYILDGDLLSKFVKLNQKERRLLASSVGSNTDE 1438
Query: 242 ILDEL 246
IL L
Sbjct: 1439 ILTAL 1443
>gi|301093655|ref|XP_002997673.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110063|gb|EEY68115.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 176
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 86/173 (49%), Gaps = 15/173 (8%)
Query: 99 YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS-PISDAPGAR-----SRFLTWYA 152
+ P+ ES GG RL++ +DFHLG V++ F+ R S + A R S ++
Sbjct: 3 FAPQDIESRGGQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVNVMG 62
Query: 153 SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY-YAGNPS--- 208
+ +G +G +P+ E+ +RRL LQNVMV LNPR FR K G P
Sbjct: 63 TSEGGVGALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRRCGRPDAWS 122
Query: 209 -----RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
+ +D ++++FLQL + E+ + IG+ ++ L +++ +S F
Sbjct: 123 KKKWKKSFLDAFVLFRFLQLDYVAQKELARCIGTTPEVVMHNLLEVQHATSTF 175
>gi|302652143|ref|XP_003017931.1| hypothetical protein TRV_08063 [Trichophyton verrucosum HKI 0517]
gi|291181517|gb|EFE37286.1| hypothetical protein TRV_08063 [Trichophyton verrucosum HKI 0517]
Length = 429
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 50/179 (27%), Positives = 90/179 (50%), Gaps = 16/179 (8%)
Query: 70 GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
V +F+ + + + +++D D N+ + Y PE S+ G RL+ ++ FH G +T
Sbjct: 237 AVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTL 296
Query: 130 I----RCKPSPISD-------APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNV 178
+ R SP+ + P ++ + L + + G++ PL E +YRRLL LQ+
Sbjct: 297 LPHGARTPSSPVDEDAMDTDSPPPSKYQILMTFQT--GSVAVITPLGEDSYRRLLALQSQ 354
Query: 179 MVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
+V H LNPR +R + G RG+IDG+L+ ++L + + EI ++G+
Sbjct: 355 LVNALEHPCSLNPRGYRAVESDGMGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGA 410
>gi|328864890|gb|EGG13276.1| CPSF domain-containing protein [Dictyostelium fasciculatum]
Length = 1627
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 101/213 (47%), Gaps = 15/213 (7%)
Query: 44 LRFKKLKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEA 103
+ F KLKV + ++ + Y+ +F+++ + ++SD ++N+ +F + P
Sbjct: 1410 ISFFKLKVCIIQKKNIILLGKDYEEVSTYSSDFIVDEKKLSMVLSDANRNIRMFSFDPSD 1469
Query: 104 RESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS-----PISDAPGARSRFLTWYASLDGAL 158
ES G L+ K+ FH+G+ N F +I K + S + + L +Y +L G +
Sbjct: 1470 PESRAGQMLLAKSSFHIGELNNKFVRIPMKNTNYDNNSSSSSIIVNDKHLLFYGTLGGGI 1529
Query: 159 GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP-----SRGIID 213
+P+ K + +L + H T GLNPR FR G++ N ++ ++D
Sbjct: 1530 NLLMPI-NKRFHEILHALETKLMHRGQTAGLNPRGFR----YGHHVNNTLGHLHNQYVVD 1584
Query: 214 GSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
G L+ KF LS + ++ IGS ILD L
Sbjct: 1585 GDLLTKFQSLSPDDAKQLATSIGSTTPIILDLL 1617
>gi|452841862|gb|EME43798.1| hypothetical protein DOTSEDRAFT_79774 [Dothistroma septosporum NZE10]
Length = 1347
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 53/189 (28%), Positives = 88/189 (46%), Gaps = 16/189 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A EF+ ++ +I D D ++ + Y PE +S G RL+ ++ FHLG +
Sbjct: 1151 VMAAEFLPFDGALYILIIDADMDLHVLQYDPENPKSQNGMRLLHRSTFHLGHFATNMLLL 1210
Query: 131 RCKPSPISD------------APGARSR-FLTWYASLDGALGFFLPLPEKNYRRLLMLQN 177
+P + +P S F SL G++G PL E +YRRL LQ
Sbjct: 1211 PSSLNPFGENQPFTNGDTNGESPEESSPLFHVLTTSLTGSIGMITPLDESSYRRLSALQT 1270
Query: 178 VMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
+ T H LNPRA+R + + + +RG++DG++V + +L R ++ + G+
Sbjct: 1271 HLTTILEHPASLNPRAYRAIESESFGG---ARGVVDGNIVRRINELGAARRADVLARAGA 1327
Query: 238 KHNDILDEL 246
I +L
Sbjct: 1328 DAWSIRSDL 1336
>gi|327304811|ref|XP_003237097.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
gi|326460095|gb|EGD85548.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
Length = 1398
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 90/179 (50%), Gaps = 16/179 (8%)
Query: 70 GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
V +F+ + + + +++D D N+ + Y PE S+ G RL++++ FH G +T
Sbjct: 1206 AVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLRRSVFHTGHFASTVTL 1265
Query: 130 I----RCKPSPISD-------APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNV 178
+ SP+ + P ++ + L + + G++ PL E +YRRLL LQ+
Sbjct: 1266 LPHGAHTTSSPVDEDAMDTDSPPPSKYQILMTFQT--GSIAVITPLSEDSYRRLLALQSQ 1323
Query: 179 MVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
+V H LNPR +R + G RG+IDG+L+ ++L + + EI ++G+
Sbjct: 1324 LVNALEHPCSLNPRGYRAVESDGMGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGA 1379
>gi|224000243|ref|XP_002289794.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975002|gb|EED93331.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1820
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 90/210 (42%), Gaps = 23/210 (10%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P V+A + +M F+ D +N+ Y P + GG++L+ + DFHLG +
Sbjct: 1609 PTQVFAAGMISRGGAMSFVCHDDRQNIQFLQYAPTDVAARGGNKLVCRADFHLGSQTTSL 1668
Query: 128 FKIRCKPSPISDAPGARS-------------------RFLTWYASLDGALGFFLPLPEKN 168
+ S + ++ S RF + + DG+ +PL E
Sbjct: 1669 NSHWAQSSLLFNSCTVSSTLASLKQQDSLFGRLDDDQRFAVNFGTTDGSFVSIIPLSEPT 1728
Query: 169 YRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK----GYYAGNPSRGIIDGSLVWKFLQLS 224
Y RL LQ+VM L+ RA+R Y+ G + +G+ID LV KF+ L
Sbjct: 1729 YWRLTALQSVMSNALESNAALSHRAWRLYRRSTRRGGCRTNDRKKGVIDADLVMKFVDLP 1788
Query: 225 LGERLEICKKIGSKHNDILDELYDIEALSS 254
L E+ ++ IGS ++D L ++ S
Sbjct: 1789 LPEQEDLTSSIGSTVGLVMDNLLELSCAGS 1818
>gi|242798830|ref|XP_002483249.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces stipitatus ATCC 10500]
gi|218716594|gb|EED16015.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces stipitatus ATCC 10500]
Length = 1382
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 57/192 (29%), Positives = 93/192 (48%), Gaps = 16/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F+ + + +++D D N+ + Y PE +S+ G RL+ + FH+G +T +
Sbjct: 1191 VVTADFLPDGKKLYILVADSDCNLHVLQYDPEDPKSSNGDRLLNRCKFHMGHFASTITLL 1250
Query: 131 -RCKPSPISDAPGARSRFLTWYASL--------DGALGFFLPLPEKNYRRLLMLQNVMVT 181
R S + S + Y L G + L E++YRRL LQ+ +
Sbjct: 1251 PRTAVSSELAVMNSDSMDIDSYIPLHQALITTQSGLMALVTSLSEESYRRLSALQSQLSN 1310
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
H GLNPRA+R + G RG+IDG L+ ++L LS +LEI ++G+ +
Sbjct: 1311 TLEHPCGLNPRAYRAVESDGVVG----RGMIDGKLLMRWLDLSRPRKLEIAGRVGADEWE 1366
Query: 242 ILDELYDIEALS 253
I D+EA+S
Sbjct: 1367 I---RADLEAVS 1375
>gi|390599704|gb|EIN09100.1| hypothetical protein PUNSTDRAFT_67240 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1439
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 91/182 (50%), Gaps = 9/182 (4%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDF--HLGQHVNTFF 128
V +F+ SM + +D++ + L+ Y P+ +S G +L+ +T+F H + F
Sbjct: 1260 VTTADFIFTEDSMSILTNDENGVMRLYQYDPQDPDSRNGQQLMCRTEFDTHTTCQTSIVF 1319
Query: 129 KIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
R + P A+ S+DG+L + E ++RL +LQ + + H G
Sbjct: 1320 ARRVGEGEEAALPQAK----VVAGSIDGSLAALTCMDEPAFKRLQLLQGQLTRNIQHVAG 1375
Query: 189 LNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
LNP+AFR + Y P S+GI+DG+L+ +L+L + + EI K+I ++ +L +
Sbjct: 1376 LNPKAFRIVRND--YVSKPLSKGILDGNLLSSYLELPIPRQEEITKQIATERAAVLRDWT 1433
Query: 248 DI 249
I
Sbjct: 1434 SI 1435
>gi|453082807|gb|EMF10854.1| CPSF_A-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 1349
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/208 (28%), Positives = 91/208 (43%), Gaps = 32/208 (15%)
Query: 48 KLKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESN 107
KL V+ S +SK V EF+ ++ +I D D ++ + Y PE +S
Sbjct: 1136 KLTVMGKSPKSKME---------VMTAEFLPFDGALYILIMDADNDLHVLQYDPENPKSV 1186
Query: 108 GGHRLIKKTDFHLGQHVNTFFKIRCKPSPISD------------------APGARSRFLT 149
GG RL+ ++ FH+G V + P AP + L
Sbjct: 1187 GGMRLLHRSTFHIGHLVTNMLLVPSSLKPFESQDRDMANGTNGNNEEATRAPPSLHHILA 1246
Query: 150 WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSR 209
S G++G PL E YRRL LQ + H GLNPRA+R + + + +R
Sbjct: 1247 --TSRSGSVGLITPLDEAAYRRLSALQTHLTAILEHAAGLNPRAYRAVEAESFGG---AR 1301
Query: 210 GIIDGSLVWKFLQLSLGERLEICKKIGS 237
G++DGSLV + +L +R ++ + GS
Sbjct: 1302 GVVDGSLVNRIGELGAAKRADVLGRAGS 1329
>gi|303285993|ref|XP_003062286.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455803|gb|EEH53105.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 1469
Score = 84.7 bits (208), Expect = 3e-14, Method: Composition-based stats.
Identities = 49/199 (24%), Positives = 99/199 (49%), Gaps = 17/199 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V + EF+++ S++ + +D N +F Y P++ ES G +L+ K FH+G V+ +
Sbjct: 1271 VLSTEFLVDGSTLSLLAADTAGNAYVFAYDPKSSESWKGQKLLTKASFHVGSPVHRMVRF 1330
Query: 131 RCK------------PSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNV 178
+ K P+P ++ +R ++ +LDG+LG +P+ + +L +LQ
Sbjct: 1331 KLKTPTGAGNDGRAAPTP-AEIKANANRHAVFFGTLDGSLGILVPMESSTHAKLEVLQRW 1389
Query: 179 MVTHTSHTGGLNPRAFR---TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
+ +T+ GLN R++R T +G+ + P ++DG ++ F L+ ++ E
Sbjct: 1390 LNYNTAQNAGLNGRSYRAPKTTEGRAMRSPAP-HNLLDGEMLQGFESLAWTKQAEAADAA 1448
Query: 236 GSKHNDILDELYDIEALSS 254
G + L L+ + A ++
Sbjct: 1449 GMTREEALTYLHTLSAKTA 1467
>gi|345566738|gb|EGX49680.1| hypothetical protein AOL_s00078g169 [Arthrobotrys oligospora ATCC
24927]
Length = 1407
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 52/197 (26%), Positives = 97/197 (49%), Gaps = 9/197 (4%)
Query: 68 PR--GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVN 125
PR + A +F+ ++ F+++D N+ + Y PE +S G+RL+ K + + G V
Sbjct: 1212 PRQLSITAGDFLPAGTACYFVVADAQSNIHVLQYDPENPKSIHGNRLLPKGEIYCGHEVK 1271
Query: 126 TFFKIRCKPSPISDAPGARSR------FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 179
+ + K S ++ FL ++++ G G + E YRRL ++Q +
Sbjct: 1272 SICILPKKKSLFTEPDEDDMDEDEDEEFLCMFSTMTGVFGTVSSITESMYRRLNVIQGQI 1331
Query: 180 VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 239
H GLNPRA+R K + + P R I+DG L+ ++L L G R E+ + G+
Sbjct: 1332 TNTGEHIAGLNPRAYRAAKFRN-TSSEPMRAILDGKLLVRWLMLGAGRRKELAGRAGTSE 1390
Query: 240 NDILDELYDIEALSSHF 256
+ ++L+ ++ ++ F
Sbjct: 1391 EMLREDLWFLQDATAFF 1407
>gi|403170487|ref|XP_003329830.2| hypothetical protein PGTG_11767 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|375168746|gb|EFP85411.2| hypothetical protein PGTG_11767 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1513
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 50/178 (28%), Positives = 90/178 (50%), Gaps = 5/178 (2%)
Query: 75 EFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 134
+F++ + + +D + + +F Y P ES GG +L+ +++F+ + + +
Sbjct: 1340 DFLVIDGKLAIVATDLNGVIRIFEYNPTNIESQGGQKLLCRSEFNTSSEMTCSMQFGKRL 1399
Query: 135 SPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 194
S +A T++ASLDG++ +P E Y+RL ++Q + H H GLNP+
Sbjct: 1400 SAKDEA----KVMGTFFASLDGSISSLVPAKEAVYKRLQLVQTRLTRHIQHFAGLNPKGH 1455
Query: 195 RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 252
RT + + +RGI+DG L+ KF LS+ ++ EI GS +L L ++ L
Sbjct: 1456 RTVRND-LVSRAINRGILDGELLIKFHLLSVTQQAEIAGLAGSDRETVLVNLLNLRGL 1512
>gi|302506529|ref|XP_003015221.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
gi|291178793|gb|EFE34581.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
Length = 1370
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 89/179 (49%), Gaps = 16/179 (8%)
Query: 70 GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
V +F+ + + + +++D D N+ + Y PE S+ G RL+ ++ FH G +T
Sbjct: 1148 AVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTL 1207
Query: 130 I----RCKPSPISD-------APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNV 178
+ SP+ + P ++ + L + + G++ PL E +YRRLL LQ+
Sbjct: 1208 LPHGGHTPSSPVDEDAMDTDSPPPSKYQILMTFQT--GSIAIITPLGEDSYRRLLALQSQ 1265
Query: 179 MVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
+V H LNPR +R + G RG+IDG+L+ ++L + + EI ++G+
Sbjct: 1266 LVNALEHPCSLNPRGYRAVESDGMGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGA 1321
>gi|67521912|ref|XP_659017.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
gi|74598221|sp|Q5BDG7.1|CFT1_EMENI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|40745387|gb|EAA64543.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
gi|259486722|tpe|CBF84808.1| TPA: Protein cft1 (Cleavage factor two protein 1)
[Source:UniProtKB/Swiss-Prot;Acc:Q5BDG7] [Aspergillus
nidulans FGSC A4]
Length = 1339
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 53/192 (27%), Positives = 97/192 (50%), Gaps = 16/192 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A +F+ + + + +++D D N+ + Y PE S+ G +L+ ++ FH G +T +
Sbjct: 1148 VLAADFLPDGNKLFIVVADSDCNLYVLQYDPEDPNSSNGDKLLNRSKFHTGNFASTVTLL 1207
Query: 131 -RCKPSPISDAPGARSRFLTWYASL--------DGALGFFLPLPEKNYRRLLMLQNVMVT 181
R S G+ + A L +G++G +PE++YRRL LQ+ +
Sbjct: 1208 PRTLVSSERAMSGSDKMDIDNTAPLHQVLVTSHNGSIGLVTCVPEESYRRLSALQSQLTN 1267
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
H GLNPRA+R + + RG++D +L+ ++L +S + EI ++G+ +
Sbjct: 1268 TLEHPCGLNPRAYRAVESD----ASAGRGMLDSNLLLQYLDMSKQRKAEIAGRVGATEWE 1323
Query: 242 ILDELYDIEALS 253
I D+EA+S
Sbjct: 1324 I---RADLEAIS 1332
>gi|50552095|ref|XP_503522.1| YALI0E03982p [Yarrowia lipolytica]
gi|74634000|sp|Q6C740.1|CFT1_YARLI RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|49649391|emb|CAG79101.1| YALI0E03982p [Yarrowia lipolytica CLIB122]
Length = 1269
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 48/186 (25%), Positives = 91/186 (48%), Gaps = 12/186 (6%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V C+F IE ++ F+++D K + + Y P+ +S G RL++++ F+ G+ +++ +
Sbjct: 1092 VTTCDFAIEGENLTFVVADLQKRLHILEYDPDDPQSYSGARLLRRSVFYSGKVIDSSAMV 1151
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
P RF+ DG++ +P PE YRRL +Q + +H GL+
Sbjct: 1152 ----------PINEDRFMVIGVCSDGSVTDVVPCPEDAYRRLYAIQTQITDKEAHVCGLH 1201
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
PRA+R +P R I+DG + +F L ++ ++G ++ ++ D+E
Sbjct: 1202 PRAYRYDPILPGTGNSPHRPILDGHTLIRFANLPRNKQNVYANRLGQRYQQLI--WKDLE 1259
Query: 251 ALSSHF 256
+S F
Sbjct: 1260 LISDLF 1265
>gi|403178252|ref|XP_003336695.2| hypothetical protein PGTG_18491 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|375164075|gb|EFP92276.2| hypothetical protein PGTG_18491 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1149
Score = 84.0 bits (206), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 50/178 (28%), Positives = 90/178 (50%), Gaps = 5/178 (2%)
Query: 75 EFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 134
+F++ + + +D + + +F Y P ES GG +L+ +++F+ + + +
Sbjct: 976 DFLVIDGKLAIVATDLNGVIRIFEYNPTNIESQGGQKLLCRSEFNTSSEMTCSMQFGKRL 1035
Query: 135 SPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 194
S +A T++ASLDG++ +P E Y+RL ++Q + H H GLNP+
Sbjct: 1036 SAKDEA----KVMGTFFASLDGSISSLVPAKEAVYKRLQLVQTRLTRHIQHFAGLNPKGH 1091
Query: 195 RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 252
RT + + +RGI+DG L+ KF LS+ ++ EI GS +L L ++ L
Sbjct: 1092 RTVRND-LVSRAINRGILDGELLIKFHLLSVTQQAEIAGLAGSDRETVLVNLLNLRGL 1148
>gi|405121446|gb|AFR96215.1| cleavage and polyadenylation specific protein [Cryptococcus
neoformans var. grubii H99]
Length = 1431
Score = 83.6 bits (205), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 97/182 (53%), Gaps = 7/182 (3%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F++ + F+ SD++ ++ + + P +S G RL+ KT++H G T K+
Sbjct: 1253 VVTADFLVHDGQVTFISSDRNGDMRMLDFDPTDPDSLNGERLMLKTEYHAGSAA-TVSKV 1311
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ + +++ + YA+ DGAL + + + ++RL ++ + +V + H GLN
Sbjct: 1312 IARRKTAEEEFAPQTQII--YATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLN 1369
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
PRAFRT + S+GI+DG L+ +F +G + E+ ++IG+ D + D++
Sbjct: 1370 PRAFRTVRND-LLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGT---DAVTVASDLQ 1425
Query: 251 AL 252
AL
Sbjct: 1426 AL 1427
>gi|326477251|gb|EGE01261.1| protein kinase subdomain-containing protein [Trichophyton equinum CBS
127.97]
Length = 1267
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 91/188 (48%), Gaps = 16/188 (8%)
Query: 70 GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
V +F+ + + + +++D D N+ + Y PE S+ G RL+ ++ FH G +T
Sbjct: 1075 AVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTL 1134
Query: 130 I---RCKPSPISDA--------PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNV 178
+ PS D P ++ + L + + G++ PL E +YRRLL LQ+
Sbjct: 1135 LPHGAYTPSAPVDEDAMDTDSLPPSKYQILMTFQT--GSIAVITPLSEDSYRRLLALQSQ 1192
Query: 179 MVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 238
+V H LNPR +R + G RG+IDG+L+ ++L + + EI ++G+
Sbjct: 1193 LVNALEHPCSLNPRGYRAVESDGMGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGAD 1249
Query: 239 HNDILDEL 246
I +L
Sbjct: 1250 VGAIRTDL 1257
>gi|326471884|gb|EGD95893.1| protein kinase subdomain-containing protein [Trichophyton tonsurans
CBS 112818]
Length = 1398
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 91/188 (48%), Gaps = 16/188 (8%)
Query: 70 GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
V +F+ + + + +++D D N+ + Y PE S+ G RL+ ++ FH G +T
Sbjct: 1206 AVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTL 1265
Query: 130 I---RCKPSPISDA--------PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNV 178
+ PS D P ++ + L + + G++ PL E +YRRLL LQ+
Sbjct: 1266 LPHGAYTPSAPVDEDAMDTDSLPPSKYQILMTFQT--GSIAVITPLSEDSYRRLLALQSQ 1323
Query: 179 MVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 238
+V H LNPR +R + G RG+IDG+L+ ++L + + EI ++G+
Sbjct: 1324 LVNALEHPCSLNPRGYRAVESDGMGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGAD 1380
Query: 239 HNDILDEL 246
I +L
Sbjct: 1381 VGAIRTDL 1388
>gi|301630307|ref|XP_002944263.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Xenopus (Silurana) tropicalis]
Length = 92
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 59/93 (63%), Gaps = 1/93 (1%)
Query: 164 LPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
+ EK YRRLLMLQN + T H GLNPRAFR NP R ++DG L+ ++L L
Sbjct: 1 MQEKTYRRLLMLQNAL-TVLPHHAGLNPRAFRMLNSSRRMLQNPVRNVLDGELLNRYLYL 59
Query: 224 SLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
S ER E+ +KIG+ + ILD+L +I+ ++S F
Sbjct: 60 SNMERSELARKIGTTTDIILDDLLEIDRVTSLF 92
>gi|321260384|ref|XP_003194912.1| cleavage and polyadenylation specific protein [Cryptococcus gattii
WM276]
gi|317461384|gb|ADV23125.1| cleavage and polyadenylation specific protein, putative [Cryptococcus
gattii WM276]
Length = 1431
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 97/182 (53%), Gaps = 7/182 (3%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F++ + F+ SD++ ++ + + P +S G RL+ +T++H G T K+
Sbjct: 1253 VVTADFLVHDGQVTFISSDRNGDMRMLDFDPTDPDSLNGERLMLRTEYHAGSAA-TVSKV 1311
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ + +++ + YA+ DGAL + + + ++RL ++ + +V + H GLN
Sbjct: 1312 IARRKTTEEEFAPQTQII--YATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLN 1369
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
PRAFRT + S+GI+DG L+ +F +G + E+ ++IG+ D + D++
Sbjct: 1370 PRAFRTVRND-LLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGT---DAVTVASDLQ 1425
Query: 251 AL 252
AL
Sbjct: 1426 AL 1427
>gi|58268668|ref|XP_571490.1| cleavage and polyadenylation specific protein [Cryptococcus
neoformans var. neoformans JEC21]
gi|134113364|ref|XP_774707.1| hypothetical protein CNBF3860 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|338817789|sp|P0CM63.1|CFT1_CRYNB RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|338817790|sp|P0CM62.1|CFT1_CRYNJ RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|50257351|gb|EAL20060.1| hypothetical protein CNBF3860 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57227725|gb|AAW44183.1| cleavage and polyadenylation specific protein, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 1431
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 97/182 (53%), Gaps = 7/182 (3%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F++ + F+ SD++ ++ + + P +S G RL+ +T++H G T K+
Sbjct: 1253 VVTADFLVHDGQVTFISSDRNGDMRMLDFDPTDPDSLNGERLMLRTEYHAGSAA-TVSKV 1311
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ + +++ + YA+ DGAL + + + ++RL ++ + +V + H GLN
Sbjct: 1312 IARRKTAEEEFAPQTQII--YATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLN 1369
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
PRAFRT + S+GI+DG L+ +F +G + E+ ++IG+ D + D++
Sbjct: 1370 PRAFRTVRND-LLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGT---DAVTVASDLQ 1425
Query: 251 AL 252
AL
Sbjct: 1426 AL 1427
>gi|403411348|emb|CCL98048.1| predicted protein [Fibroporia radiculosa]
Length = 1437
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 86/176 (48%), Gaps = 7/176 (3%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V + + + ++ D+D + L Y P ES GG L+++T+FH T I
Sbjct: 1260 VTCADLFFAENRVSLLVCDEDGVIRLLEYDPHDPESRGGQHLLRRTEFHGQTEYRTSVLI 1319
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ D P A+ S DG+L F + E ++ L +LQ + + H GLN
Sbjct: 1320 ARRKDKDIDIPQAK----LVCGSTDGSLVSFTFVEEAAFKGLHLLQGQLTRNVQHVAGLN 1375
Query: 191 PRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
PRAFR + Y P S+GI+DG+L+ F +L + + E+ ++IG++ +L +
Sbjct: 1376 PRAFRIVRND--YVSRPLSKGILDGNLLTTFEELPIARQNEMTRQIGTERATVLKD 1429
>gi|255948500|ref|XP_002565017.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211592034|emb|CAP98296.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1392
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 93/188 (49%), Gaps = 13/188 (6%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A +F+ + + + +++D D N+ + Y PE +S+ G RL+ ++ F+ G ++ +
Sbjct: 1201 VVAADFLPDGNKLYMLVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFYTGNFASSVTLL 1260
Query: 131 RCKPSPIS---------DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
D +R AS +G+L + E++YRRL LQ+ ++
Sbjct: 1261 PRTAVSSERTESSEEGMDLDETFARHQVLIASQNGSLALVTSVAEESYRRLSALQSQLIN 1320
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
H GLNPRAFR + G AG RG++DG+L+ +L + + EI ++G+ +
Sbjct: 1321 TVDHPAGLNPRAFRAIESDG-AAG---RGMVDGNLLRLWLNMGKQRQTEIAGRVGATEWE 1376
Query: 242 ILDELYDI 249
I +L I
Sbjct: 1377 IKADLETI 1384
>gi|392572878|gb|EIW66021.1| hypothetical protein TREMEDRAFT_70300 [Tremella mesenterica DSM 1558]
Length = 1408
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 48/180 (26%), Positives = 92/180 (51%), Gaps = 4/180 (2%)
Query: 67 LPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT 126
+P V + +F++ + F+ D+ ++ + + P ES G RLI +T++H G V T
Sbjct: 1226 MPISVTSADFLVHDGHVTFLTYDRSGDIRMVDFDPANPESINGERLIVRTEYHGGSPV-T 1284
Query: 127 FFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
+ + + + +++ + +A DG++ F+ +RRL + + ++ + H
Sbjct: 1285 VSTMIARRRGVEEEFAPQTQIICAHA--DGSISTFVSTKPARFRRLHFVSDQLIRNAQHV 1342
Query: 187 GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
GLNPRAFRT + A SRGI+DG L+ +F + + E+ K+IG+ + +L
Sbjct: 1343 AGLNPRAFRTVRND-LVAKPLSRGILDGELLGRFAIQPIDRQREMLKQIGTDGGTVASDL 1401
>gi|50288865|ref|XP_446862.1| hypothetical protein [Candida glabrata CBS 138]
gi|74609915|sp|Q6FSD2.1|CFT1_CANGA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|49526171|emb|CAG59795.1| unnamed protein product [Candida glabrata]
Length = 1361
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 50/181 (27%), Positives = 93/181 (51%), Gaps = 3/181 (1%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V + EF++ + + F+++D+D + + Y P+ + G RL+ + F+L N +
Sbjct: 1171 VISVEFLVNNGDIYFLVTDRDSIMHVLKYAPDQPNTLSGQRLVHCSSFNLHSLNNCTMLL 1230
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
D +RS F T A +DG++ +P+ E+ YRRL +Q ++ GLN
Sbjct: 1231 PKNDEFPRDQRYSRS-FQTITAQVDGSISKIVPVKEETYRRLYFIQQQIIDKEPQLAGLN 1289
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN-DILDELYDI 249
PR R K Y+ G+ R ++D +++ +F +S+ R I +K+G N ++ +L D+
Sbjct: 1290 PRMERQ-DNKYYHLGHSLRPMLDFNIIKRFKDMSMNRRSHIVQKLGKNSNLEVWRDLIDL 1348
Query: 250 E 250
E
Sbjct: 1349 E 1349
>gi|367001853|ref|XP_003685661.1| hypothetical protein TPHA_0E01320 [Tetrapisispora phaffii CBS 4417]
gi|357523960|emb|CCE63227.1| hypothetical protein TPHA_0E01320 [Tetrapisispora phaffii CBS 4417]
Length = 1357
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 83/164 (50%), Gaps = 1/164 (0%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
A +FV+ + + F++SD D + + Y P+ S G RL + F++ + +
Sbjct: 1170 ALDFVVRNGEIYFVVSDTDDILHILKYSPDEPNSLSGQRLAHYSSFNIHSTNTSMHLLPA 1229
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
I + S F T A+LDG++ LPL E ++RRL ++Q ++ H GLNPR
Sbjct: 1230 NDEFIENKGNGSSIFQTIGANLDGSIFKILPLSEDSFRRLYVIQQQIIDTEVHAAGLNPR 1289
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
R + Y N +R ++D +L+ ++ LS+ +R I +K G
Sbjct: 1290 MER-LSNEYYQLTNVTRPLLDFNLIRRYSNLSIKKRKSIAQKAG 1332
>gi|401889164|gb|EJT53104.1| cleavage and polyadenylation specific protein [Trichosporon asahii
var. asahii CBS 2479]
Length = 1358
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 53/179 (29%), Positives = 95/179 (53%), Gaps = 8/179 (4%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI-R 131
A +F++ + F+ +D+ N+ L + P +S G +L+ +T+F G V I R
Sbjct: 1182 AADFLVHEGQVTFITNDQHGNMRLVDFDPANPDSLNGEKLLTQTEFGTGCPVTASCMIAR 1241
Query: 132 CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
K + AP +S+ + YA+ DGA+ + + E ++RL ++Q+ +V + H GLNP
Sbjct: 1242 RKTAEEEFAP--QSQLI--YATADGAITSVVAVKEARFKRLQLVQDQLVRNAQHVAGLNP 1297
Query: 192 RAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
RAFRT + P +RG++DG L+ F L + E+ ++IG+ + +LY +
Sbjct: 1298 RAFRTVRND--LVPRPLARGVLDGGLLAHFALQPLRRQREMMRQIGTDAVTVGSDLYTL 1354
>gi|406699110|gb|EKD02327.1| cleavage and polyadenylation specific protein [Trichosporon asahii
var. asahii CBS 8904]
Length = 1339
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 53/179 (29%), Positives = 95/179 (53%), Gaps = 8/179 (4%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI-R 131
A +F++ + F+ +D+ N+ L + P +S G +L+ +T+F G V I R
Sbjct: 1163 AADFLVHEGQVTFITNDQHGNMRLVDFDPANPDSLNGEKLLTQTEFGTGCPVTASCMIAR 1222
Query: 132 CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
K + AP +S+ + YA+ DGA+ + + E ++RL ++Q+ +V + H GLNP
Sbjct: 1223 RKTAEEEFAP--QSQLI--YATADGAITSVVAVKEARFKRLQLVQDQLVRNAQHVAGLNP 1278
Query: 192 RAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
RAFRT + P +RG++DG L+ F L + E+ ++IG+ + +LY +
Sbjct: 1279 RAFRTVRND--LVPRPLARGVLDGGLLAHFALQPLRRQREMMRQIGTDAVTVGSDLYTL 1335
>gi|452979579|gb|EME79341.1| hypothetical protein MYCFIDRAFT_104419, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 1342
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 55/193 (28%), Positives = 90/193 (46%), Gaps = 33/193 (17%)
Query: 65 PGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG--- 121
P + V + EF+ ++ ++ D D ++ + Y PE +S G RL+ ++ FH+G
Sbjct: 1145 PKMHMEVMSAEFLPFDGALYIVVLDADCDMHVLQYDPENPKSLNGMRLLHRSTFHIGHFT 1204
Query: 122 ----------------QH--VNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLP 163
QH +N K KP P+ LT +S GA+G P
Sbjct: 1205 TNSMLLPSTLASFAAQQHEMMNGGSKAEVKPDPLQ-------HVLT--SSTSGAIGLITP 1255
Query: 164 LPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
L E+ YRRL LQ + + H GLNPRA+R+ + + + +RG++DG LV + +L
Sbjct: 1256 LDEQAYRRLSALQTHLTSILEHAAGLNPRAYRSIESESFGG---ARGVVDGLLVRRIHEL 1312
Query: 224 SLGERLEICKKIG 236
R ++ + G
Sbjct: 1313 GAARRADVLGRAG 1325
>gi|452001482|gb|EMD93941.1| hypothetical protein COCHEDRAFT_1129958 [Cochliobolus heterostrophus
C5]
Length = 1385
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 76/160 (47%), Gaps = 6/160 (3%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI-- 130
A +F+ + +++D D N+ + + P+ +S G RL+ K+ FH G T +
Sbjct: 1194 AIDFIPFEEQLHLLVADADMNLQVLQFDPDNPKSEAGSRLLHKSTFHTGHFPATLHVVHS 1253
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
R K SD + S G L PL E YRRL L + T GLN
Sbjct: 1254 RLKMPSASDFAATQPLHQILCTSQSGTLALVTPLSEDTYRRLSNLSAYLSNTLDATAGLN 1313
Query: 191 PRAFRTYKGK--GYYAGNPSRGIIDGSLVWKFLQLSLGER 228
PRAFR G+ AG +RG++DG+L+ ++ + LGER
Sbjct: 1314 PRAFRASDTPDGGWDAGTGARGMLDGNLLMRWGE--LGER 1351
>gi|395324102|gb|EJF56549.1| hypothetical protein DICSQDRAFT_93527 [Dichomitus squalens LYAD-421
SS1]
Length = 1433
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 85/164 (51%), Gaps = 6/164 (3%)
Query: 83 MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPG 142
+ + D++ V L+ Y P ES GG L+++T+FH + + +P D
Sbjct: 1267 LSIVTCDEEGVVRLYAYDPHDPESKGGQHLLRRTEFHGQTEYRSSLLVARRPK-AGDPEI 1325
Query: 143 ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 202
++R + S+DG+L + E ++RL +LQ ++ H LNP+AFR + +
Sbjct: 1326 PQARLIC--GSVDGSLTTLTYVDENAFKRLHLLQGQLIRTVQHVAALNPKAFRMVRNE-- 1381
Query: 203 YAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
Y P S+G++DG+L+ F L +G + E+ ++IG+ +L +
Sbjct: 1382 YVSRPLSKGVLDGNLLATFEDLPIGRQNEVTRQIGTDRATVLKD 1425
>gi|336388105|gb|EGO29249.1| hypothetical protein SERLADRAFT_445076 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1424
Score = 80.1 bits (196), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 48/180 (26%), Positives = 90/180 (50%), Gaps = 7/180 (3%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F ++ + D D + ++ Y P+ ES G L+ +T+FH + I
Sbjct: 1247 VTNADFFFTDDTLSIVTEDGDGILRMYAYDPDDPESKNGQHLLCRTEFHNHSECRSSLVI 1306
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ S P A+ L+ ++ DG+L P+ + +++RL +LQ + + H GLN
Sbjct: 1307 ARRTKEESVLPQAK--ILSAFS--DGSLSSLTPVDDASFKRLQLLQGQLTRNIQHVAGLN 1362
Query: 191 PRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
PRA+R + + P S+ I+DG L+ F L + + E+ K+IG++ N +L + ++
Sbjct: 1363 PRAYRIVRND--FVSKPLSKDILDGQLLSAFESLPISRQNEMTKQIGTERNIVLHDWMEL 1420
>gi|425765419|gb|EKV04111.1| Cleavage and polyadenylation specificity factor subunit A, putative
[Penicillium digitatum Pd1]
gi|425767100|gb|EKV05682.1| Cleavage and polyadenylation specificity factor subunit A, putative
[Penicillium digitatum PHI26]
Length = 1271
Score = 80.1 bits (196), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 50/188 (26%), Positives = 93/188 (49%), Gaps = 13/188 (6%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A +F+ + + + +++D D N+ + Y PE +S+ G RL+ ++ F+ G ++ +
Sbjct: 1080 VVAADFLPDGNKLYMLVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFYTGNFASSVTLL 1139
Query: 131 ---------RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 181
D +++ AS +G+L + E++YRRL LQ+ ++
Sbjct: 1140 PRTAVSSELTESSEEAMDVDETFAKYQVLIASQNGSLALVTSVAEESYRRLSGLQSQLIN 1199
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
H GLN RAFR + G AG RG++DG+L+ +L + + EI ++G+ +
Sbjct: 1200 TVDHPAGLNARAFRATESDG-AAG---RGMVDGNLLRLWLNMGKQRQAEIAGRVGATEWE 1255
Query: 242 ILDELYDI 249
I +L I
Sbjct: 1256 IKADLETI 1263
>gi|336375160|gb|EGO03496.1| hypothetical protein SERLA73DRAFT_165174 [Serpula lacrymans var.
lacrymans S7.3]
Length = 1428
Score = 80.1 bits (196), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 48/180 (26%), Positives = 90/180 (50%), Gaps = 7/180 (3%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F ++ + D D + ++ Y P+ ES G L+ +T+FH + I
Sbjct: 1251 VTNADFFFTDDTLSIVTEDGDGILRMYAYDPDDPESKNGQHLLCRTEFHNHSECRSSLVI 1310
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ S P A+ L+ ++ DG+L P+ + +++RL +LQ + + H GLN
Sbjct: 1311 ARRTKEESVLPQAK--ILSAFS--DGSLSSLTPVDDASFKRLQLLQGQLTRNIQHVAGLN 1366
Query: 191 PRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
PRA+R + + P S+ I+DG L+ F L + + E+ K+IG++ N +L + ++
Sbjct: 1367 PRAYRIVRND--FVSKPLSKDILDGQLLSAFESLPISRQNEMTKQIGTERNIVLHDWMEL 1424
>gi|66812672|ref|XP_640515.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
gi|60468551|gb|EAL66554.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
Length = 1628
Score = 80.1 bits (196), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 91/187 (48%), Gaps = 25/187 (13%)
Query: 70 GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
+++ EF++ ++ ++SD DKN++LF ++P+ S G + Q +N K
Sbjct: 1453 NIFSTEFIVNQKTLSILVSDLDKNILLFSFEPQDPSSRSG---------QINQEINGNNK 1503
Query: 130 IRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 189
+D + L + +LDG L PL EK Y +Q+ + + T GL
Sbjct: 1504 --------NDNRLPKKEQLVIFGTLDGGLNVLRPLDEKIYLLFYHIQSKLY-YLPQTAGL 1554
Query: 190 NPRAFRTYKG-KGYYAGNPS------RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
NP+ +R++K + +PS + I+DG L+ KFL LS E+ I I S ++I
Sbjct: 1555 NPKQYRSFKSFSQNFHFSPSTFHQLPKFILDGDLISKFLSLSQSEKRLISNSINSTSDEI 1614
Query: 243 LDELYDI 249
++ L D+
Sbjct: 1615 IESLKDV 1621
>gi|302694047|ref|XP_003036702.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
gi|300110399|gb|EFJ01800.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
Length = 1396
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 87/172 (50%), Gaps = 11/172 (6%)
Query: 75 EFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDF--HLGQHVNTFFKIRC 132
+F ++ + +D++ + ++ Y PEA +S+ G L+K+T+F H +T R
Sbjct: 1223 DFFFAEEALAIVTTDENGVMSMYNYDPEAPDSHDGRLLLKQTEFNLHTDFRTSTLIARRT 1282
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
K PI + + + DG L P+P+ +RL LQ + + H GLNP+
Sbjct: 1283 KDDPII------PQGILIFGGTDGTLSCLTPVPDDAAKRLQPLQLQLTRNMQHVAGLNPK 1336
Query: 193 AFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
A R + + + P S+GI+DG+L+ F L + + E+ ++IG++ IL
Sbjct: 1337 ALRIVRNE--HVSRPLSKGILDGNLIAYFEHLPITRQDEMTRQIGTERATIL 1386
>gi|258575565|ref|XP_002541964.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237902230|gb|EEP76631.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 1376
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 49/167 (29%), Positives = 81/167 (48%), Gaps = 17/167 (10%)
Query: 93 NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR-------- 144
VV + P+ S+ G RL+ ++ FH G ++T I P S GA
Sbjct: 1210 QVVAADFLPDDPSSSKGDRLLHRSSFHTGHFISTLTLI---PQYTSSGTGASEDNMDVDY 1266
Query: 145 --SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 202
+ + S G++G PL E+ YRRL LQ+ +V H GLNP+A+R + G+
Sbjct: 1267 MPAGYQVVVTSQSGSVGVITPLTEETYRRLSALQSQLVMSMEHPCGLNPKAYRAVESDGF 1326
Query: 203 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
RG++DG+L+ ++L + + + EI ++G+ I +L I
Sbjct: 1327 SG----RGLVDGNLLLRWLDMGVQRKAEIAGRVGADLQSIRADLERI 1369
>gi|255718033|ref|XP_002555297.1| KLTH0G05984p [Lachancea thermotolerans]
gi|238936681|emb|CAR24860.1| KLTH0G05984p [Lachancea thermotolerans CBS 6340]
Length = 1307
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/183 (30%), Positives = 93/183 (50%), Gaps = 8/183 (4%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V + EF++ + S+ F++SD+ + + Y P+ S G +L+ T F+L NT K+
Sbjct: 1123 VMSLEFLVNNGSIYFLVSDRSNILHILKYAPDEPNSLSGQKLVHCTSFNL-HSTNTCMKL 1181
Query: 131 RCKPS--PISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
K P P A F A DG+L +PL E +YRRL M+Q ++ H G
Sbjct: 1182 LLKNDEFPTLGEPPA---FQAIGAQTDGSLFNVVPLSESSYRRLYMVQQQLIEKDVHLCG 1238
Query: 189 LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN-DILDELY 247
LNP+ R + Y G+ R ++D +++ F L L +R +I K G + + +I +L
Sbjct: 1239 LNPKMER-LQNDFYQLGHLMRPMLDFTVIKSFATLPLNKRKQIAAKAGRQADFEIWRDLI 1297
Query: 248 DIE 250
++E
Sbjct: 1298 NVE 1300
>gi|389740693|gb|EIM81883.1| hypothetical protein STEHIDRAFT_65512 [Stereum hirsutum FP-91666 SS1]
Length = 1438
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/175 (27%), Positives = 89/175 (50%), Gaps = 8/175 (4%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQ--HVNTFFKI 130
+ +F + + + D++ + ++ Y P ES G RL+ +FH GQ + ++
Sbjct: 1262 SADFFFANQQLSIVTCDEEGILRMYHYNPHDPESKNGQRLLCHAEFH-GQIEYRSSLTIA 1320
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
R P ++ P A+ S DG+L +P+ E ++RL +LQ + + HT LN
Sbjct: 1321 RRTKGPDTEIPQAK----LICGSPDGSLSALVPVEEAAFKRLHLLQGQLTRNVQHTAALN 1376
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
PRAFR + + Y + +G +DG L+ F L + ++EI ++IG++ +L +
Sbjct: 1377 PRAFRAVRNE-YVSKTLHKGFLDGLLLRSFEDLPVSRQIEITRQIGTERRLVLKD 1430
>gi|393245434|gb|EJD52944.1| hypothetical protein AURDEDRAFT_81080 [Auricularia delicata TFB-10046
SS5]
Length = 1422
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 87/176 (49%), Gaps = 4/176 (2%)
Query: 70 GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
+ + EF M + +D+ + +F Y P E+ G +L+ +T+F+
Sbjct: 1243 ALTSAEFFFGFGEMTIVSTDEQNVLRIFRYDPMHAEAQDGQKLLCQTEFNTQSDARGTTT 1302
Query: 130 IRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 189
I + S D +S+ + Y DG+L LP+ E ++RL +LQ M + H GL
Sbjct: 1303 ILRRTSD-EDILLPQSKIM--YCGTDGSLSALLPVEEHVFKRLHLLQGQMTRNIQHVAGL 1359
Query: 190 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
+P+AFR + + A +RGI+D +L+ KF +L L ++E K+IG IL +
Sbjct: 1360 HPKAFRVVRND-FTARPLARGILDSNLLAKFEELPLSRQVEFTKQIGQSREVILGD 1414
>gi|226290902|gb|EEH46330.1| cleavage and polyadenylation specificity factor subunit A
[Paracoccidioides brasiliensis Pb18]
Length = 1343
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 76/157 (48%), Gaps = 18/157 (11%)
Query: 106 SNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFL----------TWYASLD 155
S G RL+ ++ FH GQ +T + + S +S P A + + S
Sbjct: 1187 SAKGDRLLHRSTFHTGQFASTL-TLLPRTSVLSQGPEAEANAMDLDSSGPLHQVLVTSET 1245
Query: 156 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 215
G++ P+ E YRRL LQ+ M+ H GLNPRAFR + G RG++DG
Sbjct: 1246 GSIALITPVSEMAYRRLSALQSQMINTLEHPCGLNPRAFRAVESDGIGG----RGMVDGD 1301
Query: 216 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 252
LV K+L L + EI ++G+ D+ + D+EA+
Sbjct: 1302 LVQKWLDLGTQRKAEIASRVGA---DVWEIRADLEAI 1335
>gi|156847699|ref|XP_001646733.1| hypothetical protein Kpol_1023p44 [Vanderwaltozyma polyspora DSM
70294]
gi|156117413|gb|EDO18875.1| hypothetical protein Kpol_1023p44 [Vanderwaltozyma polyspora DSM
70294]
Length = 1337
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 47/179 (26%), Positives = 95/179 (53%), Gaps = 3/179 (1%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
A EF++ + + F++SD++ + + Y P+ S G RL + F++ NT +
Sbjct: 1151 ALEFLVNNGDIFFIVSDRNDILHVLKYAPDEPNSLSGQRLAHYSSFNI-HSTNTSMILLP 1209
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
+ +P ++ F + + +DG++ +PL E ++RRL ++Q ++ GGLNPR
Sbjct: 1210 SNNEFQSSPNGQATFQSVGSCVDGSIFKVIPLDEDSFRRLYVIQQQVIDTEIQAGGLNPR 1269
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDIE 250
R + Y + R ++D +++ +F LS+ +R +I +K G + H D+ ++ +IE
Sbjct: 1270 MER-LSNEYYQLVHLMRPMLDFNIIRRFSNLSITKRTKIAQKAGRRAHFDVWRDMINIE 1327
>gi|358056450|dbj|GAA97624.1| hypothetical protein E5Q_04302 [Mixia osmundae IAM 14324]
Length = 1305
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 51/180 (28%), Positives = 89/180 (49%), Gaps = 5/180 (2%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
VY +F+++ + +G + + + Y P ES G RL+ +T++HLG+ +
Sbjct: 1127 VYCVDFLVQENRLGLLSVSRKGLLRQLEYNPGNAESRAGERLLDRTEYHLGKQIIDSLSF 1186
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ S +D +S + A DG+L + P+ E YRRL +L+ + H GLN
Sbjct: 1187 AKRLS--TDEDLRQSGVMLVGA--DGSLTWVTPVREVVYRRLALLERQLHRQLPHFAGLN 1242
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
PRAFRT + YY+ +RG++DG L+ + L + + I S + + L ++E
Sbjct: 1243 PRAFRTARND-YYSRPLARGMLDGDLLAIYANLHASRQQSLASHINSDPDTLSVNLGNLE 1301
>gi|367014525|ref|XP_003681762.1| hypothetical protein TDEL_0E03080 [Torulaspora delbrueckii]
gi|359749423|emb|CCE92551.1| hypothetical protein TDEL_0E03080 [Torulaspora delbrueckii]
Length = 1327
Score = 77.4 bits (189), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 53/179 (29%), Positives = 87/179 (48%), Gaps = 3/179 (1%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
+ EF++ + F I+D D + +F Y P+ S G RL+ T F+L NT +
Sbjct: 1143 SVEFLVNGGDIFFAITDTDDILHVFKYAPDEPNSLSGQRLLHCTSFNL-HSTNTCMALLP 1201
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
K A F +DG++ LPL E YRRL ++Q + GGLNPR
Sbjct: 1202 KNEEFEPAQANMKNFQAIGGQVDGSVFKLLPLREDVYRRLYVVQQQITEKELQLGGLNPR 1261
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDIE 250
R + Y + R ++D +++ +F +LS R +I +K+G + H +I +L ++E
Sbjct: 1262 MER-LSNEHYKTTHVLRPMLDFNVIQRFKRLSTDRRKQISQKVGKRAHFEIWRDLINVE 1319
>gi|407929511|gb|EKG22329.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
phaseolina MS6]
Length = 1418
Score = 76.6 bits (187), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 88/213 (41%), Gaps = 41/213 (19%)
Query: 51 VLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGH 110
+LF RSK V A +F+ + M++D D N+ Y P+ +S G
Sbjct: 1199 ILFGKSRSKME---------VMAADFLPHDKQLYLMVADGDCNLHALQYDPDHPKSLSGQ 1249
Query: 111 RLIKKT---------------------------DFHLGQHVNTFFKIRCKPSPISDAPGA 143
RL+ K+ D H HV+ I AP
Sbjct: 1250 RLLHKSTFHTGHFTTTMTLLPSSLSPTVSPSSADEHANGHVSPSPSPENDAMDIDPAPAG 1309
Query: 144 RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
+ + + G+L PL E+ YRRL LQ ++ H GLNPRA+R + +G+
Sbjct: 1310 TVQHI-LLTTQTGSLALLTPLSEQQYRRLGALQTYLIGALEHWCGLNPRAYRAVESEGFG 1368
Query: 204 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
SRGI+DG+L+ ++ +L R E K+G
Sbjct: 1369 ----SRGIVDGALLARWCELGSQRRAEGAAKVG 1397
>gi|392585051|gb|EIW74392.1| hypothetical protein CONPUDRAFT_133073 [Coniophora puteana RWD-64-598
SS2]
Length = 1490
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 87/174 (50%), Gaps = 7/174 (4%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
+ +F + + + D++ + + Y P ES G +L+ T+FH + +T I
Sbjct: 1315 SVDFFFANGELSIVTEDEEGVLRAYEYDPNDPESRSGQQLLCHTEFHGHKECSTTLTIAR 1374
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
+ + P +++ ++ + DG+L P+ E ++RL +LQ + + H GLNPR
Sbjct: 1375 RTKTEHEIP--QAKLISGFG--DGSLSALTPVDEAAFKRLQLLQGQLTRNVQHIAGLNPR 1430
Query: 193 AFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
AFR + + P S+GI+DG L+ F + + E+ ++IG++ IL E
Sbjct: 1431 AFRIVRNE--TVSKPLSKGILDGQLLSSFEAQGITRQGEMTRQIGTERTTILQE 1482
>gi|363750592|ref|XP_003645513.1| hypothetical protein Ecym_3197 [Eremothecium cymbalariae DBVPG#7215]
gi|356889147|gb|AET38696.1| Hypothetical protein Ecym_3197 [Eremothecium cymbalariae DBVPG#7215]
Length = 1318
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 53/183 (28%), Positives = 93/183 (50%), Gaps = 7/183 (3%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
+ EF++ + M F+I+D+ + + + Y P+ S G RL+ T F+L +NT ++
Sbjct: 1127 SLEFLVNNGDMYFIITDRSQVMHVLKYAPDEPNSLSGQRLVYCTSFNL-HSINTCMRLIQ 1185
Query: 133 KPSPISDA----PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
K + D S F +DG++ +PL E +YRRL ++Q ++ G
Sbjct: 1186 KNNEFVDLRRNYGSHMSTFQCIGCHIDGSIFKVVPLTESSYRRLYLVQQQIIDKEVQLCG 1245
Query: 189 LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELY 247
LNPR R + Y G+ R ++D +++ KF LS+ +R + K G + H ++ +L
Sbjct: 1246 LNPRMER-LQNPYYQLGHLLRPMLDFTILKKFSTLSISKRRSMASKAGHQAHTEVWRDLI 1304
Query: 248 DIE 250
DIE
Sbjct: 1305 DIE 1307
>gi|169864473|ref|XP_001838845.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
gi|116500065|gb|EAU82960.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
Length = 1458
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 51/165 (30%), Positives = 87/165 (52%), Gaps = 13/165 (7%)
Query: 83 MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQ---HVNTFFKIRCKPSPISD 139
+ ++ D++ + ++ Y P +S G L+ +T++H GQ H +T R K P
Sbjct: 1293 LALIVGDEEGIMRIYEYNPNDPDSRDGRYLLLRTEYH-GQVPYHTSTTIARRDKEDP--S 1349
Query: 140 APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG 199
P +S L S DG+L +P+ E ++RL +LQ + + H GLNP+AFR K
Sbjct: 1350 IP--QSHLL--IGSADGSLSSLVPVDEYAFKRLQLLQGQLTRNIQHVAGLNPKAFRIVKN 1405
Query: 200 KGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
Y P S+GI+DG L+ ++ L + + E+ K+IG++ +L
Sbjct: 1406 D--YVSKPLSKGILDGQLLAQYESLPIPRQNEMTKQIGTERGVVL 1448
>gi|324518783|gb|ADY47203.1| Cleavage and polyadenylation specificity factor subunit 1 [Ascaris
suum]
Length = 108
Score = 75.9 bits (185), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 67/110 (60%), Gaps = 7/110 (6%)
Query: 60 RANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFH 119
RA QP + A +F+I++ M F++SD+ N+ +F Y PEA ES+GG RLI +++ +
Sbjct: 2 RAVVQPPM-----AAQFLIDNRQMAFIMSDEAANIAVFNYLPEALESSGGERLILRSEIN 56
Query: 120 LGQHVNTFFKIRCKPSP--ISDAPGARSRFLTWYASLDGALGFFLPLPEK 167
+G +VN+F +++ S + + + +R + SLDG+ GF PL EK
Sbjct: 57 IGTNVNSFMRVKGHISSGFVENEHYSLNRQSVLFCSLDGSFGFVRPLSEK 106
>gi|398397855|ref|XP_003852385.1| hypothetical protein MYCGRDRAFT_100364 [Zymoseptoria tritici IPO323]
gi|339472266|gb|EGP87361.1| hypothetical protein MYCGRDRAFT_100364 [Zymoseptoria tritici IPO323]
Length = 1333
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/184 (26%), Positives = 81/184 (44%), Gaps = 19/184 (10%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F+ ++ ++ D D ++ + Y PE +S G RL+ ++ FH+G +
Sbjct: 1130 VIEADFLPFDGALFLLVLDADADLHVLQYDPENPKSLNGQRLLHRSTFHIGHFPTGSMLL 1189
Query: 131 RCKPSPIS----DAPGARSR-----------FLTWYASLDGALGFFLPLPEKNYRRLLML 175
+P + D P S F + G++G PL E YRRL L
Sbjct: 1190 PSTLAPFTEQARDLPNGDSEDTKQEEVNSPLFHVLTTTSSGSIGLITPLDESTYRRLSAL 1249
Query: 176 QNVMVTHTSHTGGLNPRAFRT---YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
Q + H GLNPR +RT K G ++G++DGSL+ + +L R ++
Sbjct: 1250 QGHLTNILEHAAGLNPRMYRTDTEMKATDSEMGG-AKGVVDGSLIRRISELGAARRADVL 1308
Query: 233 KKIG 236
++G
Sbjct: 1309 SRVG 1312
>gi|388581811|gb|EIM22118.1| hypothetical protein WALSEDRAFT_28358 [Wallemia sebi CBS 633.66]
Length = 1259
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 67/253 (26%), Positives = 108/253 (42%), Gaps = 25/253 (9%)
Query: 16 GNRPLLLVRTQHELLIYQAFRHPKGTLKLRFKKLK-VLFVSDRSKRAN-----EQP---- 65
G + L+ +E LI AF + GT + + LK + V D+ K E P
Sbjct: 1017 GQKVLIRALEINERLISVAF-YDAGTYIVSLEVLKNFILVGDQVKSITFLAFQESPYKLV 1075
Query: 66 ---GLPRGVYAC--EFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHL 120
R + C F+ + F+ +D ++ L Y P + GG +LI+ T+FH
Sbjct: 1076 QLSRDARQIETCVSNFLAHEDQISFVSNDIQGDLRLIDYNPFDPTAEGGEKLIRTTEFHK 1135
Query: 121 GQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
G S + P R +DG+L P+ E ++ L +LQ +V
Sbjct: 1136 GSEATC--------SLLLPKPSVRPSSELLLGCVDGSLSCLSPVDEITFKALWLLQGALV 1187
Query: 181 THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
H LNPRA R + Y + + S+GI+DG L+ + + ++EI K+IG
Sbjct: 1188 RQIPHIAALNPRAHRHVRND-YVSRSLSKGILDGLLLSAYQTIDHATQVEIAKRIGYSKA 1246
Query: 241 DILDELYDIEALS 253
++L L + LS
Sbjct: 1247 ELLGYLRNFSWLS 1259
>gi|448105510|ref|XP_004200513.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
gi|448108635|ref|XP_004201144.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
gi|359381935|emb|CCE80772.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
gi|359382700|emb|CCE80007.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
Length = 1344
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/184 (25%), Positives = 87/184 (47%), Gaps = 5/184 (2%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F+++ + + +D + + + + PE S+ G RL+ KT F++ F
Sbjct: 1160 VSCADFIVKDEEIFILFADNNNVLHVVKFDPEDPLSSNGQRLVHKTSFNINSAATCF--- 1216
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
R P + P + F + +++DG+ P+ E YRR+ +LQ + H GLN
Sbjct: 1217 RTIPKNEENYPSLTTSFQSIGSTIDGSFFTVFPINESTYRRMYILQQQLTDKEFHICGLN 1276
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK--HNDILDELYD 248
PR R + S+ +++ ++ KF+ L+ + KIGSK + DI +L +
Sbjct: 1277 PRLNRFGGLNETNSDANSKPMLEYDVIKKFVNLNSDRKKNFASKIGSKNSYQDIWRDLIE 1336
Query: 249 IEAL 252
E++
Sbjct: 1337 FESV 1340
>gi|440637976|gb|ELR07895.1| hypothetical protein GMDG_02777 [Geomyces destructans 20631-21]
Length = 1495
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 85/180 (47%), Gaps = 18/180 (10%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR-----------CK 133
+++D D N+ + + PE +S G L+++T F LG H+ T +
Sbjct: 1316 IIVADSDCNLHVLQFDPEHPKSLHGQLLLQRTTFSLGGHMPTTMTLLPLTTTTQTPTPAV 1375
Query: 134 PSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRA 193
S S+ S L +S G + PL E+ YRRL L N + H GGLNP+A
Sbjct: 1376 TSTASEPTNPASGLLMTLSS--GVVAILTPLSEQQYRRLNALSNHLSNLLYHPGGLNPKA 1433
Query: 194 FRTYKG--KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 251
R + G P I+DGS++W++L+L +R E+ ++G I ++L +I A
Sbjct: 1434 HRISNTAPEAVIGGRP---IVDGSVLWRWLELGSQKRAEVAGRVGVDGETIREDLQEIAA 1490
>gi|426194401|gb|EKV44332.1| hypothetical protein AGABI2DRAFT_187183 [Agaricus bisporus var.
bisporus H97]
Length = 1413
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 93/182 (51%), Gaps = 11/182 (6%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F+ + + + D++ + ++ Y P+ +S G L+ +T+FH + T +
Sbjct: 1235 VTRADFLFSENDLRLVTGDEEGIIRIYEYNPQDPDSREGRHLLLETEFHGQREYRTSVLV 1294
Query: 131 --RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
R K D SR LT S DG+L + E+ ++RL +LQ ++ + H
Sbjct: 1295 AHRIK----EDQSIPNSRLLT--GSADGSLASLTIVEEEAFKRLGLLQGQLMRNIQHMAA 1348
Query: 189 LNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
LNP+AFR K + Y P +RGI+DG+L+ ++ L + + E ++IG+ ++L +
Sbjct: 1349 LNPKAFRIVKNE--YVSKPLTRGILDGNLLGQYESLPINRQSEATQQIGADRVNVLRDWI 1406
Query: 248 DI 249
++
Sbjct: 1407 EL 1408
>gi|409076059|gb|EKM76433.1| hypothetical protein AGABI1DRAFT_108759 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1413
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 93/182 (51%), Gaps = 11/182 (6%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F+ + + + D++ + ++ Y P+ +S G L+ +T+FH + T +
Sbjct: 1235 VTRADFLFSENDLRLVTGDEEGIIRIYEYNPQDPDSREGRHLLLETEFHGQREYRTSVLV 1294
Query: 131 --RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
R K D SR LT S DG+L + E+ ++RL +LQ ++ + H
Sbjct: 1295 AHRIK----EDQSIPNSRLLT--GSADGSLASLTIVEEEAFKRLGLLQGQLMRNIQHMAA 1348
Query: 189 LNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
LNP+AFR K + Y P +RGI+DG+L+ ++ L + + E ++IG+ ++L +
Sbjct: 1349 LNPKAFRIVKNE--YVSKPLTRGILDGNLLGQYESLPINRQSEATQQIGADRVNVLRDWI 1406
Query: 248 DI 249
++
Sbjct: 1407 EL 1408
>gi|451849663|gb|EMD62966.1| hypothetical protein COCSADRAFT_92785 [Cochliobolus sativus ND90Pr]
Length = 1405
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 81/180 (45%), Gaps = 26/180 (14%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI-- 130
A +F+ + +++D D N+ + + P+ +S G RL+ K+ FH G T +
Sbjct: 1194 AVDFIPFEEQLHLLVADADMNLQVLQFDPDNPKSEAGSRLLHKSTFHTGHFPATLHVVHS 1253
Query: 131 RCKPSPISDAPGARSR----FLTWYASLD----------------GALGFFLPLPEKNYR 170
R K SD GA + F +S D G L PL E YR
Sbjct: 1254 RLKMPSASDFAGANNTENGDFEMDTSSPDDKATQPLHQILCTTQSGTLALVTPLSEDTYR 1313
Query: 171 RLLMLQNVMVTHTSHTGGLNPRAFRTYKGK--GYYAGNPSRGIIDGSLVWKFLQLSLGER 228
RL L + T GLNPRAFR G+ AG +RG++DG+L+ ++ + LGER
Sbjct: 1314 RLSNLSAYLSNTLDATAGLNPRAFRASDTPDGGWDAGTGARGMLDGNLLMRWGE--LGER 1371
>gi|392558419|gb|EIW51607.1| hypothetical protein TRAVEDRAFT_176174 [Trametes versicolor FP-101664
SS1]
Length = 1431
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 82/164 (50%), Gaps = 6/164 (3%)
Query: 83 MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPG 142
+ + D++ V L+ Y P ES G L+++T+FH + + +P D
Sbjct: 1265 LSIVTCDEEGIVRLYAYDPHDPESKSGQHLLRRTEFHGQSEYRSSMLVARRPKN-GDPEI 1323
Query: 143 ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 202
++R + S+DG+L + E +RL +LQ ++ H LNP+AFR + +
Sbjct: 1324 PQARLVC--GSVDGSLSTLTYVDEAASKRLHLLQGQLIRTVQHVAALNPKAFRMVRNE-- 1379
Query: 203 YAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
Y P S+GI+DG+L+ F L + + E+ ++IG+ +L +
Sbjct: 1380 YVSRPLSKGILDGNLLATFEDLPIARQNEVTRQIGTDRATVLKD 1423
>gi|310789917|gb|EFQ25450.1| CPSF A subunit region [Glomerella graminicola M1.001]
Length = 1439
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 84/187 (44%), Gaps = 7/187 (3%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG-QHVNTFFK 129
V +F+I + ++ DKD + + + PE +S GH L+ + F H T
Sbjct: 1246 VLTADFIIAGDELHIVVCDKDGVIHVMQFDPEHPKSLQGHLLLNRASFSAAPNHPTTTLS 1305
Query: 130 IRCKPSPISDAPGARSRFLTWY-ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
+ P+ + ++ T AS GAL PL E+ YRRL L N + H
Sbjct: 1306 LPRTPASTATTSATKNPPTTLLLASPTGALASLTPLSEQAYRRLTSLANSIAGALPHAAA 1365
Query: 189 LNPRAFRTYKGKGYYAG---NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
NP+A R G + R I+DG+L+ ++ +L G R E+ K G + D+L+
Sbjct: 1366 TNPKAHRLQPLDARTPGVDTSAGRSIVDGALLARWNELGAGRRSEVAGKGG--YGDVLEV 1423
Query: 246 LYDIEAL 252
++E +
Sbjct: 1424 RGELEGV 1430
>gi|367052335|ref|XP_003656546.1| hypothetical protein THITE_2121311 [Thielavia terrestris NRRL 8126]
gi|347003811|gb|AEO70210.1| hypothetical protein THITE_2121311 [Thielavia terrestris NRRL 8126]
Length = 1460
Score = 73.9 bits (180), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 52/192 (27%), Positives = 83/192 (43%), Gaps = 26/192 (13%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F+ + + F++SD D + + + PE +S GH L+ +T F+ G H T +
Sbjct: 1246 VLNADFLPDGKELSFVVSDADGYIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHATKSLL 1305
Query: 131 RCKPSPI------------------SD-----APGARSRFLTWYASLDGALGFFLPLPEK 167
+P SD P A+ + AS G L PL E
Sbjct: 1306 LPASTPADKEKNDGNAANAQAKAKASDNKQPREPAAQRPHVLLLASPTGVLAALRPLSES 1365
Query: 168 NYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPS---RGIIDGSLVWKFLQLS 224
YRRL L + H GLNPR +R + AG + R I+DG+++ +F +L
Sbjct: 1366 AYRRLSSLAAQLTNSLPHPAGLNPRGYRAAGAECPPAGVDAGLGRSIVDGTVLERFAELG 1425
Query: 225 LGERLEICKKIG 236
+ R+E+ + G
Sbjct: 1426 MARRVELAGRAG 1437
>gi|296414526|ref|XP_002836950.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295632796|emb|CAZ81141.1| unnamed protein product [Tuber melanosporum]
Length = 1468
Score = 73.9 bits (180), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 49/204 (24%), Positives = 85/204 (41%), Gaps = 38/204 (18%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
+ + EF+ + + + F++ D + N+ Y PE +S G RLI++ DF G ++T +
Sbjct: 1244 IISAEFLPDGNQLYFVVVDAESNIHTLQYDPEHPKSLAGQRLIRRADFFSGHEISTLTML 1303
Query: 131 RCKPSPISDAPGAR---------------------SRFLTWYASLDGALGFFLPLPEKNY 169
P +S + + + + G+L +PE Y
Sbjct: 1304 PFSPYSLSASSNSHLPADATDTSPLHHHHQNQQQQQEYFVLAGTQTGSLAMIRTIPETAY 1363
Query: 170 RRLLMLQNVMVTHTSHTGGLNPRAFRTY-----------------KGKGYYAGNPSRGII 212
RRL ++Q +V H GLNPR +R G G+ RG++
Sbjct: 1364 RRLNIVQGQIVNGEEHVAGLNPREYRAVVNYSGGGGGGAGGGGWGGSGGGVGGDTMRGVL 1423
Query: 213 DGSLVWKFLQLSLGERLEICKKIG 236
DG LV +++ L+ G + E+ K G
Sbjct: 1424 DGGLVSRWIGLAEGRKGEVSAKAG 1447
>gi|402219312|gb|EJT99386.1| hypothetical protein DACRYDRAFT_17537 [Dacryopinax sp. DJM-731 SS1]
Length = 1620
Score = 73.6 bits (179), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 65/240 (27%), Positives = 112/240 (46%), Gaps = 21/240 (8%)
Query: 26 QHELLIYQAFRHPKGTLKLRFKKLKVLFVSDRSKRA------NEQPGLPRGVYACE--FV 77
Q E L+ AF G +LK L + +KR+ E P R Y + F+
Sbjct: 1388 QDERLVGVAFMD-AGVCVTSLTRLKNLLLIGDAKRSVSFVAFQEDPFKLRPTYVTDAAFL 1446
Query: 78 IESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF-----KIRC 132
+ + +D + + LF + P + G+ LI +T+F+ GQ +T + R
Sbjct: 1447 FDEGDFSILAADDEGTLRLFEFDPNLTGATHGNPLICETEFN-GQSEHTHILAIAGRGRE 1505
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
P + P A+ F T +DG LG P+P++ ++RL +L ++ H GLNPR
Sbjct: 1506 DPEEMQ-IPEAQLIFGT----IDGTLGTISPVPDECFKRLQLLSGQLMRSVQHFAGLNPR 1560
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 252
AFRT + + ++G++D L+ F +L + + I K+IG+ IL ++ +E +
Sbjct: 1561 AFRTVRND-LLSRPLNKGMLDYDLLHAFRELDIRRQATITKQIGTDTITILRDIRSLEEI 1619
>gi|254580509|ref|XP_002496240.1| ZYRO0C13816p [Zygosaccharomyces rouxii]
gi|238939131|emb|CAR27307.1| ZYRO0C13816p [Zygosaccharomyces rouxii]
Length = 1331
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 53/179 (29%), Positives = 86/179 (48%), Gaps = 3/179 (1%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
A EF++ + F ++D + +F Y P+ S G RL+ T F+L NT +
Sbjct: 1147 ALEFLVNGGDIFFAVTDTSNILHIFKYAPDEPNSLSGQRLVHCTSFNL-HSTNTCMVLLP 1205
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
K S + S DG+L +PL E YRRL +LQ + GGLNPR
Sbjct: 1206 KNEEFSVGEKSLSPVQVVGGQTDGSLFKLVPLREDTYRRLYVLQQQLTEKEVQLGGLNPR 1265
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDIE 250
R + Y+ + R +++ +++ +F LS+ +R + +K G + H DI +L +IE
Sbjct: 1266 MER-LSNEYYHLTHAVRPMLEFNVIRRFNTLSVEKRKQTAQKAGRRAHFDIWRDLVNIE 1323
>gi|408396642|gb|EKJ75797.1| hypothetical protein FPSE_03977 [Fusarium pseudograminearum CS3096]
Length = 1427
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 82/169 (48%), Gaps = 4/169 (2%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F+ + + + +D D ++ + + PE +S GH L+ +T F + + + +
Sbjct: 1237 VVVADFLPDGDDLAIVAADVDGDLHILEFNPEHPKSLQGHLLLHRTSFSVSPNPPSTTLL 1296
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ +P S + AS G L +PLPE YRRLL + N ++ + GGLN
Sbjct: 1297 LPRTTPPSHPTPQDPPHVLLLASSSGHLSSLIPLPETAYRRLLSVTNQLLPALTPHGGLN 1356
Query: 191 PRAFRTYKGK---GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
+A R G G A R I+DG+++ ++ +LS +R EI K G
Sbjct: 1357 AKAHRLPVGTRTVGVEAAG-GRAIVDGAVLARWAELSAAKRAEIAGKGG 1404
>gi|406865186|gb|EKD18229.1| CPSF A subunit region [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 1443
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/183 (30%), Positives = 88/183 (48%), Gaps = 23/183 (12%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI--RCK------PSP 136
+ +D D N+ + + PE +S GH L+ ++ F LG H+ T + R K PSP
Sbjct: 1262 IVAADSDCNLHIMQFDPEHPKSLQGHLLLHRSTFALGGHLPTSMTLLPRTKSATLLPPSP 1321
Query: 137 -----ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
+DA L S G + PL E YRRL L + ++ H GLNP
Sbjct: 1322 DAMDTAADATIPEHEIL--ITSSTGCISLLTPLSEAQYRRLSTLTSHLINTLYHACGLNP 1379
Query: 192 RAFRTYKG--KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
RA+R K +G SR +IDG+++ ++++L R E+ ++G D+L+ D+
Sbjct: 1380 RAYRVDKDAPEGMVG---SRTVIDGNILMRWMELGSQRRAEVAGRVGV---DVLEVREDL 1433
Query: 250 EAL 252
+L
Sbjct: 1434 ASL 1436
>gi|46120520|ref|XP_385083.1| hypothetical protein FG04907.1 [Gibberella zeae PH-1]
Length = 1436
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 82/169 (48%), Gaps = 4/169 (2%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F+ + + + +D D ++ + + PE +S GH L+ +T F + + + +
Sbjct: 1237 VVVADFLPDGDDLAIVAADVDGDLHILEFNPEHPKSLQGHLLLHRTSFSVSPNPPSTTLL 1296
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ +P S + AS G L +PLPE YRRLL + N ++ + GGLN
Sbjct: 1297 LPRTTPPSHPTPQDPPHVLLLASSSGHLSSLIPLPETAYRRLLSVTNQLLPALTPHGGLN 1356
Query: 191 PRAFRTYKGK---GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
+A R G G A R I+DG+++ ++ +LS +R EI K G
Sbjct: 1357 AKAHRLPVGTRTVGVEAAG-GRAIVDGAVLARWAELSAAKRAEIAGKGG 1404
>gi|366994686|ref|XP_003677107.1| hypothetical protein NCAS_0F02680 [Naumovozyma castellii CBS 4309]
gi|342302975|emb|CCC70752.1| hypothetical protein NCAS_0F02680 [Naumovozyma castellii CBS 4309]
Length = 1340
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 47/179 (26%), Positives = 87/179 (48%), Gaps = 3/179 (1%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
+ EF+I + + F I+D++ + + Y P+ + G +L+ + F+L NT +
Sbjct: 1154 SVEFLINAGEINFAITDREDILHVLKYAPDEPNTLSGQKLVHCSSFNLYSS-NTCMLMLP 1212
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
+ + A +F +DG + +PL E YRRL ++Q ++ GGLNPR
Sbjct: 1213 RNDEFETSDKAPPKFQAIGGQVDGGIFKIIPLKEDTYRRLYVVQQQIIDKEVQLGGLNPR 1272
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDIE 250
R Y + R +ID +++ +F +LS+ R +K G + H DI ++ ++E
Sbjct: 1273 MER-LDNDFYQLTHVMRPMIDFNIIRRFSELSIERRTHFAQKAGRRAHFDIWRDIINVE 1330
>gi|302924728|ref|XP_003053954.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256734895|gb|EEU48241.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 1429
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 83/167 (49%), Gaps = 6/167 (3%)
Query: 74 CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI--R 131
+F+ + + + +D D ++ + + PE +S GH L+ +T F + + T + R
Sbjct: 1242 ADFLPDGEDLAIVAADADGDLHILEFNPEHPKSLQGHLLLHRTTFSVSPNPPTSMLLLPR 1301
Query: 132 CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
P P +P S+ L AS G L +PLPE YRRLL + N ++ + GGLN
Sbjct: 1302 TTP-PAHPSPSDPSQIL-LLASPSGHLSTLVPLPEATYRRLLSVTNQLLPALTPYGGLNA 1359
Query: 192 RAFRTYKGKGYYAGNPSRG--IIDGSLVWKFLQLSLGERLEICKKIG 236
+ +R G + + G I+DG+++ ++ +L +R EI K G
Sbjct: 1360 KGYRLPSGTRPVGVDAAAGRTIVDGAILARWAELGAAKRAEIAGKGG 1406
>gi|320591495|gb|EFX03934.1| cleavage and polyadenylation specificity factor subunit [Grosmannia
clavigera kw1407]
Length = 1461
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/173 (27%), Positives = 76/173 (43%), Gaps = 21/173 (12%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF-------FKIRCKPSPI 137
+++D D N+ + + PE +S GH L+ + F G H +T F +P+
Sbjct: 1266 IVVTDADGNLHVMQFDPEHPKSLQGHILLHRATFCTGAHFSTLSLLLPSTFTPADRPTAN 1325
Query: 138 SDAPGARSR--------FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 189
+ GA S+ S G L +PL E YRRL L + T + T GL
Sbjct: 1326 GETNGASSQPEAQQHQQHQLLLGSPTGLLASLVPLSESEYRRLSSLAGQLATSLTQTAGL 1385
Query: 190 NPRAFRTYKGKGYYAGNP------SRGIIDGSLVWKFLQLSLGERLEICKKIG 236
NP+ +R G P R ++DG+L+ ++ +L G + EI ++G
Sbjct: 1386 NPKGYRMTAGSAAATLAPGVDAAVGRSVVDGALLARWTELGSGRKGEIAGRVG 1438
>gi|393220097|gb|EJD05583.1| cleavage factor protein [Fomitiporia mediterranea MF3/22]
Length = 1450
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/177 (27%), Positives = 93/177 (52%), Gaps = 11/177 (6%)
Query: 71 VYACEFVIESSSMGFM-ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFH--LGQHVNTF 127
V +F+ S ++ +SD++ + L Y ES+ G L+++T++H + H
Sbjct: 1271 VMTADFLFASDGDFYIAVSDEEGIIRLLEYDTSDPESHSGQYLLRRTEYHAQVESHTTVL 1330
Query: 128 FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPL-PEKNYRRLLMLQNVMVTHTSHT 186
R + +D ++R ++ A++DG++ P+ +++ +RL +LQ + + H
Sbjct: 1331 IARRSQ----NDGLVPQARLIS--AAVDGSMYALTPVDADESAKRLQLLQGQLTRNMQHV 1384
Query: 187 GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
GLNPRAFR + G A ++GI+DG+L+ F QL + + EI + IG+ +L
Sbjct: 1385 AGLNPRAFRAVRSDG-VARPLTKGILDGNLLAGFEQLPIPRQNEIARPIGTDRLAVL 1440
>gi|336276223|ref|XP_003352865.1| hypothetical protein SMAC_04980 [Sordaria macrospora k-hell]
gi|380092984|emb|CCC09221.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1486
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/180 (28%), Positives = 82/180 (45%), Gaps = 15/180 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F+ + + + SD D ++ + + PE +S GH L+ +T F+ G H T +
Sbjct: 1246 VLNADFLPDGKELYIVASDADGHIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLL 1305
Query: 131 RCKPSPISDAPGARSRF------LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
P + +P + S + AS G L PL E YRRL L +
Sbjct: 1306 LPAVYPTTTSPNSNSEVGENPPHILLLASPTGLLATLRPLQENAYRRLSSLAIQLTNALP 1365
Query: 185 HTGGLNPRAFR--------TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
H GLNP+ +R + + G AG R I+DG ++ +F++L G+R EI + G
Sbjct: 1366 HPAGLNPKGYRLPSPSASASMQLPGVDAGI-GRNIVDGKILERFMELGTGKRQEIAGRAG 1424
>gi|325094074|gb|EGC47384.1| cleavage factor two protein 1 [Ajellomyces capsulatus H88]
Length = 1377
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 87/192 (45%), Gaps = 18/192 (9%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
+++ + SS + D+++ + + + + S+ G RL+ ++ F G +T +
Sbjct: 1186 LFSARMTVPSSDDADFLPDENR-LYILVADDDYPGSSKGDRLLHRSTFQTGHFASTMTLL 1244
Query: 131 RCKPSPISDAPGARSRFL----------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
+ S P A + S G++ P+ E +YRRL LQ+ +
Sbjct: 1245 PRTATSSSQGPDADPDMMDLDSSGPLHHVLVTSETGSIALITPVSETSYRRLSALQSQLA 1304
Query: 181 THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
H GLNPRAFR + G RG++DG LV ++L L + EI ++G+
Sbjct: 1305 NTLEHPCGLNPRAFRAVESDGIGG----RGMVDGDLVKRWLDLGTQRKAEIANRVGA--- 1357
Query: 241 DILDELYDIEAL 252
D+ + D+EA+
Sbjct: 1358 DVWEIRADLEAI 1369
>gi|255075065|ref|XP_002501207.1| predicted protein [Micromonas sp. RCC299]
gi|226516471|gb|ACO62465.1| predicted protein [Micromonas sp. RCC299]
Length = 1423
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 80/162 (49%), Gaps = 18/162 (11%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
+ A EF+++ S++ + +D N +F Y P++ ES G +L+ K FH+G V+ +
Sbjct: 1226 ILATEFLVDGSTLSMLTTDMTGNAFIFSYDPKSLESWKGQKLLTKGAFHVGSPVHRMVRF 1285
Query: 131 RCKPSPISDAPGAR----------SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
R K +P + APG +R ++ +LDG+LG +P+ E + L LQ +
Sbjct: 1286 RLK-APTA-APGQTISPAEQKAQANRHAVFFGTLDGSLGILVPIEEAAHASLQSLQRYLT 1343
Query: 181 THTSHT--GGLNPRAFR---TYKGKGYYAGNPSRGIIDGSLV 217
T H GLN R R T +G+ P ++DG L+
Sbjct: 1344 YATPHAALAGLNARTHRHPKTVEGRPMRQPAP-HSLLDGGLL 1384
>gi|412986884|emb|CCO15310.1| predicted protein [Bathycoccus prasinos]
Length = 1595
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 50/180 (27%), Positives = 85/180 (47%), Gaps = 14/180 (7%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
A EF+I+ ++ + +D + N +F Y P+ ES G +L+ K+ +H G + + +
Sbjct: 1408 ASEFMIDGGTLSLLAADANGNAHVFQYAPKLAESWKGDKLLPKSAYHAGSLIRKMVRFQ- 1466
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG----- 187
I ++R ++ S DG LG F P+ E + L LQ+ M ++ +
Sbjct: 1467 ----IGVGEQKQNRHAVFFGSSDGGLGIFSPVDEHTFLNLEKLQDAMRSNIVASSNSINP 1522
Query: 188 -GLNPRAFRTYK-GKGYYA-GNPSRGIIDGSLVWKFL-QLSLGERLEICKKIGSKHNDIL 243
GLN + +R K +G A P R I+DG L+ KF LS+ + + K G + L
Sbjct: 1523 LGLNSKTYRALKSSEGSVARQTPPRTIVDGGLLSKFEHSLSITAQTRVAAKAGLTRDQAL 1582
>gi|294659889|ref|XP_462318.2| DEHA2G17908p [Debaryomyces hansenii CBS767]
gi|218511978|sp|Q6BHK3.2|CFT1_DEBHA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|199434312|emb|CAG90824.2| DEHA2G17908p [Debaryomyces hansenii CBS767]
Length = 1342
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 48/182 (26%), Positives = 84/182 (46%), Gaps = 5/182 (2%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F+I+ + +I+D + + L Y PE S+ G RLI K F++ T I
Sbjct: 1158 VNCADFIIKDEEIFILIADNNSTLHLVKYDPEDPTSSNGQRLIHKASFNINS---TPTCI 1214
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
R P P + F + +++DG+ P+ E +YRR+ +LQ + H GLN
Sbjct: 1215 RSIPKNEEINPSSTEVFQSIGSTIDGSFYTVFPINEASYRRMYILQQQITDKEYHFCGLN 1274
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK--HNDILDELYD 248
PR R ++ ++D ++ F +L+ R + K+ SK + DI +L +
Sbjct: 1275 PRLNRFGGLSMTVNDTNTKPLLDYEVIRMFAKLNEDRRKNLSMKVSSKNVYQDIWKDLIE 1334
Query: 249 IE 250
+
Sbjct: 1335 FD 1336
>gi|358387835|gb|EHK25429.1| hypothetical protein TRIVIDRAFT_32877 [Trichoderma virens Gv29-8]
Length = 1440
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 52/192 (27%), Positives = 92/192 (47%), Gaps = 15/192 (7%)
Query: 74 CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIR 131
+F+ + + + D D ++ + + PE +S GH L+ +T F + + +T R
Sbjct: 1242 ADFLPDGEDLSMVAVDADGDIHVLEFNPEHPKSLQGHLLLHRTTFSVTPNPPTSTLLLPR 1301
Query: 132 CKPSPIS--DAPGARSR--FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
P+ S +P + S L AS G L PLPE YRRLL + N ++ G
Sbjct: 1302 TLPASQSATTSPDSSSSQPHLLLLASPSGCLASLTPLPESAYRRLLSVTNQLLPALVPHG 1361
Query: 188 GLNPRAFRTYKGK-------GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
GL+ RA RT +G G R I+DG+++ ++ +L +R E+ + G ++
Sbjct: 1362 GLHARAHRTPEGGGGMSRTVGVETAASGRAIVDGAILARWNELGAAKRAEVATRGG--YD 1419
Query: 241 DILDELYDIEAL 252
+++ D+EA+
Sbjct: 1420 GVMEMREDLEAV 1431
>gi|396471273|ref|XP_003838832.1| similar to cleavage and polyadenylation specificity factor subunit A
[Leptosphaeria maculans JN3]
gi|312215401|emb|CBX95353.1| similar to cleavage and polyadenylation specificity factor subunit A
[Leptosphaeria maculans JN3]
Length = 1402
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 54/183 (29%), Positives = 80/183 (43%), Gaps = 22/183 (12%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG----------- 121
A EF+ + + +++D D N+ + + PE +S G RL+ K FH G
Sbjct: 1199 AVEFLPFNGELHIIVADADMNIQVLQFDPENPKSEGS-RLLHKATFHTGHFPTTTHLLQS 1257
Query: 122 -----QHVNTFFKIRC-KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 175
+ +TF P AP + L S G L PL E +YRRL L
Sbjct: 1258 HLQMPESASTFGTTDTFAPDSTPSAPLPLHQVL--ITSQSGTLALITPLSESSYRRLSNL 1315
Query: 176 QNVMVTHTSHTGGLNPRAFRTYKG--KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 233
++ GLNP AFR +G G+ AG +RG++DG L+ ++ +L R E
Sbjct: 1316 AAYLINTLESPCGLNPVAFRAGEGVEGGWDAGGGARGVLDGGLLMRWGELGEQRRKEGLA 1375
Query: 234 KIG 236
K G
Sbjct: 1376 KYG 1378
>gi|449299306|gb|EMC95320.1| hypothetical protein BAUCODRAFT_25380 [Baudoinia compniacensis UAMH
10762]
Length = 1437
Score = 70.5 bits (171), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 50/192 (26%), Positives = 82/192 (42%), Gaps = 31/192 (16%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V + EF+ + ++ D ++ + Y PE ++ G RL+ K+ FHLG +
Sbjct: 1174 VVSAEFLPFDGQLYLLVVDGKMDLHVLQYDPENPKTVSGQRLLHKSTFHLGHWPVDMLLL 1233
Query: 131 RCKPSPIS--------DAPGARS-----------------RFLTWYASLDGALGFFLPLP 165
+P + D+ G + LT + S GA+G P+
Sbjct: 1234 PSDLAPFAQQAPLTNGDSNGHTNGTESSAANAPAPAPSLFHVLTTFQS--GAVGLITPVD 1291
Query: 166 EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 225
E YRRL LQ + + H GLNPRA+R + + RG++DG LV + +L
Sbjct: 1292 EATYRRLGALQTQLTSVLEHAAGLNPRAYRAVESESLGG----RGVVDGMLVQRIGELGA 1347
Query: 226 GERLEICKKIGS 237
R E+ + G+
Sbjct: 1348 ARRAEVLGRAGA 1359
>gi|365984967|ref|XP_003669316.1| hypothetical protein NDAI_0C04130 [Naumovozyma dairenensis CBS 421]
gi|343768084|emb|CCD24073.1| hypothetical protein NDAI_0C04130 [Naumovozyma dairenensis CBS 421]
Length = 1388
Score = 70.5 bits (171), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 47/179 (26%), Positives = 87/179 (48%), Gaps = 3/179 (1%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
+ EFV+ + F ++D++ + + Y P+ S G +L+ + F+L N+ +
Sbjct: 1202 SVEFVVNGGDLYFALTDRNNILHVLKYAPDELNSLSGQKLVHCSSFNLFSG-NSSLLLLP 1260
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
K D A F T +DG++ +PL E YRRL ++Q M GGLNPR
Sbjct: 1261 KNEEFEDTKNAPLTFQTIGGQVDGSIFKVIPLREDTYRRLYVIQQHMNDKEPQLGGLNPR 1320
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDIE 250
R + Y + R ++D +++ +F +L + R + K+ G + H +I ++ ++E
Sbjct: 1321 MER-LSNEYYQLCHVMRPMLDFNIIRRFSELPIDRRTRVAKRAGQRAHYEIWRDMINVE 1378
>gi|449543656|gb|EMD34631.1| hypothetical protein CERSUDRAFT_116804 [Ceriporiopsis subvermispora
B]
Length = 1440
Score = 70.5 bits (171), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 51/184 (27%), Positives = 85/184 (46%), Gaps = 8/184 (4%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P V +F + + D+D + + Y P ES G L+++T+FH GQ
Sbjct: 1261 PLYVTKADFFFAEGRVSIISCDEDGVMRILEYDPHDPESKNGQHLLRRTEFH-GQVEYRT 1319
Query: 128 FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
I + D P +SR + DG+L + E +RL +LQ + + H
Sbjct: 1320 SAILARRLKGVDIP--QSRLICGLT--DGSLITMTYVEEAASKRLHLLQGQLTRNVQHVA 1375
Query: 188 GLNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
GLNPR FR + Y P +RGI+DG+L+ + L + + E+ ++IG+ IL +
Sbjct: 1376 GLNPRGFRIVRND--YVSRPLTRGILDGNLLMAYEDLPIVRQDEVTRQIGTDRTTILKDW 1433
Query: 247 YDIE 250
++
Sbjct: 1434 LSLD 1437
>gi|156040479|ref|XP_001587226.1| hypothetical protein SS1G_12256 [Sclerotinia sclerotiorum 1980]
gi|154696312|gb|EDN96050.1| hypothetical protein SS1G_12256 [Sclerotinia sclerotiorum 1980 UF-70]
Length = 1447
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 55/197 (27%), Positives = 87/197 (44%), Gaps = 23/197 (11%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG-QHVNTFFK 129
V + + + + + +D D N+ + Y PE +S GH L+ +T F LG H T
Sbjct: 1249 VLCADLLPDGKDLFIVAADADGNLHIMQYDPEHPKSLQGHLLLHRTTFSLGAHHPTTMTL 1308
Query: 130 IRCKPS--PIS-------------DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM 174
+ PS P++ D+P L S G PL E YRR
Sbjct: 1309 LPAIPSLHPLTTASSSSLSPSPQEDSPSPSQSLL--LTSRTGTFALLSPLTESQYRRFGT 1366
Query: 175 LQNVMVTHTSHTGGLNPRAFRTYK--GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
L + + H GLNPRA+R K +G G R IIDG ++ ++++L R E+
Sbjct: 1367 LVSHLTNTLYHPCGLNPRAYRVDKDANEGIVGG---RTIIDGGVLGRWMELGSQRRGEVA 1423
Query: 233 KKIGSKHNDILDELYDI 249
++G ++ DEL ++
Sbjct: 1424 GRVGVDVLELRDELSEL 1440
>gi|189203597|ref|XP_001938134.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187985233|gb|EDU50721.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 1407
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 78/183 (42%), Gaps = 30/183 (16%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----- 127
A +F+ + +++D D N+ + + P+ +S G RL+ K FH G +
Sbjct: 1194 AVDFLPFDQQLHLVVADADMNLQILQFDPDNPKSEAGSRLLHKATFHTGHLPTSLHLIHS 1253
Query: 128 --------------------FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEK 167
F + P+ +D P + + + G L PL E
Sbjct: 1254 HLKLPSATDFAATNSNPADAFAMDTSPNTTTDTP-QQPFHQILHTTQSGTLALLTPLSED 1312
Query: 168 NYRRLLMLQNVMVTHTSHTGGLNPRAFRT--YKGKGYYAGNPSRGIIDGSLVWKFLQLSL 225
+YRRL L + LNPRAFRT G+ AG +RG++DG+L+ ++ + L
Sbjct: 1313 SYRRLSNLTAYLANTLDSACSLNPRAFRTGDVAEGGWDAGTGARGVLDGNLLLRWGE--L 1370
Query: 226 GER 228
GER
Sbjct: 1371 GER 1373
>gi|401624207|gb|EJS42273.1| cft1p [Saccharomyces arboricola H-6]
Length = 1356
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 47/166 (28%), Positives = 79/166 (47%), Gaps = 8/166 (4%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
+ EF++ M F +D D+NV + Y P+ S G RL+ + F L +N+ +
Sbjct: 1172 SLEFLVNGGDMYFSATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL-HSINSCMLLLP 1230
Query: 133 KPSPI--SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
K S P F +DG++ +PL E+ YRRL ++Q ++ GGLN
Sbjct: 1231 KNEEFGSSQVPS----FQNVGGQVDGSIFKIVPLSEETYRRLYVIQQQIIDREIQLGGLN 1286
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
PR R Y G+ R ++D +++ +F +L++ R +K G
Sbjct: 1287 PRMER-LANDFYQMGHSMRPMLDFNVIRRFSELAIDRRKNTAQKAG 1331
>gi|349577352|dbj|GAA22521.1| K7_Cft1p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 1357
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 50/179 (27%), Positives = 87/179 (48%), Gaps = 5/179 (2%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
+ EF++ M F +D D+NV + Y P+ S G RL+ + F L N+ +
Sbjct: 1173 SLEFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL-HSTNSCMMLLP 1231
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
+ +P S F +DG++ +PL E+ YRRL ++Q ++ GGLNPR
Sbjct: 1232 RNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPR 1289
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG-SKHNDILDELYDIE 250
R Y G+ R ++D +++ +F L++ R I +K G + H + ++ +IE
Sbjct: 1290 MER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKAGRNAHFEAWRDIINIE 1347
>gi|45184764|ref|NP_982482.1| AAL060Wp [Ashbya gossypii ATCC 10895]
gi|74695871|sp|Q75EY8.1|CFT1_ASHGO RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|44980110|gb|AAS50306.1| AAL060Wp [Ashbya gossypii ATCC 10895]
gi|374105681|gb|AEY94592.1| FAAL060Wp [Ashbya gossypii FDAG1]
Length = 1305
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 91/187 (48%), Gaps = 15/187 (8%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
EF++ + + F+++D++ + + Y P+ S G RL+ T F+L NT ++
Sbjct: 1115 CVEFLVNNGDVYFLVTDRNNLMHVLKYAPDEPNSLSGQRLVHCTSFNL-HSTNTCMRLIK 1173
Query: 133 KPSPISDAPGARSRFLTWY--------ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 184
K +D G SR Y + DG + +PL E +YR L ++Q ++
Sbjct: 1174 K----NDEFGKVSRGFGIYMPSFQCIGSQADGTIFKVVPLSEASYRSLYLIQQQLIDKEV 1229
Query: 185 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDIL 243
GLNPR R + Y G+ R ++D +++ +F LS+ R+ + K G + H +I
Sbjct: 1230 QLCGLNPRMER-LENPFYQMGHILRPMLDFTVLKRFATLSIPTRMTMASKAGRQAHAEIW 1288
Query: 244 DELYDIE 250
+L DIE
Sbjct: 1289 RDLIDIE 1295
>gi|151942273|gb|EDN60629.1| cleavage factor II (CF II) component [Saccharomyces cerevisiae
YJM789]
Length = 1357
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 79/164 (48%), Gaps = 4/164 (2%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
+ EF++ M F +D D+NV + Y P+ S G RL+ + F L N+ +
Sbjct: 1173 SLEFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL-HSTNSCMMLLP 1231
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
+ +P S F +DG++ +PL E+ YRRL ++Q ++ GGLNPR
Sbjct: 1232 RNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPR 1289
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
R Y G+ R ++D +++ +F L++ R I +K G
Sbjct: 1290 MER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>gi|6320507|ref|NP_010587.1| Cft1p [Saccharomyces cerevisiae S288c]
gi|74583567|sp|Q06632.1|CFT1_YEAST RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|849213|gb|AAB64737.1| Ydr301wp [Saccharomyces cerevisiae]
gi|256271799|gb|EEU06830.1| Cft1p [Saccharomyces cerevisiae JAY291]
gi|285811316|tpg|DAA12140.1| TPA: Cft1p [Saccharomyces cerevisiae S288c]
gi|392300415|gb|EIW11506.1| Cft1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 1357
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 79/164 (48%), Gaps = 4/164 (2%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
+ EF++ M F +D D+NV + Y P+ S G RL+ + F L N+ +
Sbjct: 1173 SLEFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL-HSTNSCMMLLP 1231
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
+ +P S F +DG++ +PL E+ YRRL ++Q ++ GGLNPR
Sbjct: 1232 RNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPR 1289
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
R Y G+ R ++D +++ +F L++ R I +K G
Sbjct: 1290 MER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>gi|190404756|gb|EDV08023.1| 150 kDa protein associated with polyadenylation factor 1
[Saccharomyces cerevisiae RM11-1a]
gi|259145538|emb|CAY78802.1| Cft1p [Saccharomyces cerevisiae EC1118]
Length = 1357
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 79/164 (48%), Gaps = 4/164 (2%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
+ EF++ M F +D D+NV + Y P+ S G RL+ + F L N+ +
Sbjct: 1173 SLEFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL-HSTNSCMMLLP 1231
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
+ +P S F +DG++ +PL E+ YRRL ++Q ++ GGLNPR
Sbjct: 1232 RNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPR 1289
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
R Y G+ R ++D +++ +F L++ R I +K G
Sbjct: 1290 MER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>gi|323338222|gb|EGA79455.1| Cft1p [Saccharomyces cerevisiae Vin13]
gi|365766372|gb|EHN07870.1| Cft1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 1357
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 79/164 (48%), Gaps = 4/164 (2%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
+ EF++ M F +D D+NV + Y P+ S G RL+ + F L N+ +
Sbjct: 1173 SLEFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL-HSTNSCMMLLP 1231
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
+ +P S F +DG++ +PL E+ YRRL ++Q ++ GGLNPR
Sbjct: 1232 RNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPR 1289
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
R Y G+ R ++D +++ +F L++ R I +K G
Sbjct: 1290 MER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>gi|207346484|gb|EDZ72967.1| YDR301Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 1357
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 79/164 (48%), Gaps = 4/164 (2%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
+ EF++ M F +D D+NV + Y P+ S G RL+ + F L N+ +
Sbjct: 1173 SLEFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL-HSTNSCMMLLP 1231
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
+ +P S F +DG++ +PL E+ YRRL ++Q ++ GGLNPR
Sbjct: 1232 RNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPR 1289
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
R Y G+ R ++D +++ +F L++ R I +K G
Sbjct: 1290 MER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>gi|358390357|gb|EHK39763.1| hypothetical protein TRIATDRAFT_48211 [Trichoderma atroviride IMI
206040]
Length = 1441
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/192 (26%), Positives = 89/192 (46%), Gaps = 15/192 (7%)
Query: 74 CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIR 131
+F+ + + + D D ++ + + PE +S GH L+ +T F + + +T R
Sbjct: 1243 ADFLPDGEDLSMVAVDADGDIHVLEFNPEHPKSLQGHLLLHRTTFSVTPNPPTSTLLLPR 1302
Query: 132 CKPSPISDAPGARSR----FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
P+ S S L AS G+L PLPE YRRLL + N ++ G
Sbjct: 1303 TLPASQSATASQDSSTPQPHLLLLASPSGSLAALTPLPESAYRRLLSVTNQLLPALVPHG 1362
Query: 188 GLNPRAFRTYKGK-------GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
GL+ RA R +G G R I+DG+++ ++ +L +R E+ + G ++
Sbjct: 1363 GLHARAHRAPEGGGGMSRMVGVETAASGRAIVDGAILTRWNELGAAKRAEVASRGG--YD 1420
Query: 241 DILDELYDIEAL 252
+++ D+EA+
Sbjct: 1421 SVMELREDLEAV 1432
>gi|170102106|ref|XP_001882269.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164642641|gb|EDR06896.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 1406
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/177 (27%), Positives = 84/177 (47%), Gaps = 13/177 (7%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQ---HVNTF 127
V +F + + D++ + ++ Y P+ +S G L+ +T+FH GQ +T
Sbjct: 1229 VTNADFFFTDGELSLVTGDEEGIMRMYEYNPQDPDSKDGRYLLLRTEFH-GQSEYRTSTT 1287
Query: 128 FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
R K P P A+ DG L P+ E ++RL +LQ + + H
Sbjct: 1288 IARRLKDDP--SIPQAK----LIIGGTDGCLSSLTPVEEHAFKRLQLLQGQLTRNIQHVA 1341
Query: 188 GLNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
GLNP+AFR + + P S+GI+DG+L+ + L + + E+ ++IG+ +L
Sbjct: 1342 GLNPKAFRIVRND--FVSKPLSKGILDGNLLAHYESLPIIRQNEMTRQIGTDRVTLL 1396
>gi|323309632|gb|EGA62840.1| Cft1p [Saccharomyces cerevisiae FostersO]
Length = 1357
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 79/164 (48%), Gaps = 4/164 (2%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
+ EF++ M F +D D+NV + Y P+ S G RL+ + F L N+ +
Sbjct: 1173 SLEFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL-HSTNSCMMLLP 1231
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
+ +P S F +DG++ +PL E+ YRRL ++Q ++ GGLNPR
Sbjct: 1232 RNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPR 1289
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
R Y G+ R ++D +++ +F L++ R I +K G
Sbjct: 1290 MER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>gi|330919204|ref|XP_003298516.1| hypothetical protein PTT_09264 [Pyrenophora teres f. teres 0-1]
gi|311328242|gb|EFQ93393.1| hypothetical protein PTT_09264 [Pyrenophora teres f. teres 0-1]
Length = 1388
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 77/183 (42%), Gaps = 30/183 (16%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----- 127
A +F+ + +++D D N+ + + P+ + G RL+ K FH G +
Sbjct: 1194 AVDFLPFDQQLHLVVADADMNLQILQFDPDNPKGEAGSRLLHKATFHTGHFPTSLHLIHS 1253
Query: 128 --------------------FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEK 167
F + P+ +D P + + + G L PL E
Sbjct: 1254 HLKLPSATDFAATNNNPADAFAMDTSPNTTTDTP-QQPFHQILHTTQSGTLALLTPLSED 1312
Query: 168 NYRRLLMLQNVMVTHTSHTGGLNPRAFRT--YKGKGYYAGNPSRGIIDGSLVWKFLQLSL 225
+YRRL L + LNPRAFRT G+ AG +RG++DG+L+ ++ + L
Sbjct: 1313 SYRRLSNLSAYLANTLDSACSLNPRAFRTGDVAEGGWDAGTGARGVLDGNLLLRWGE--L 1370
Query: 226 GER 228
GER
Sbjct: 1371 GER 1373
>gi|401841121|gb|EJT43641.1| CFT1-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 1355
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/166 (28%), Positives = 78/166 (46%), Gaps = 8/166 (4%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
+ EF++ M F +D D+NV + Y P+ S G RL+ + F + +N+ +
Sbjct: 1171 SLEFLVNGGDMYFAATDADRNVHILKYAPDEPNSLSGQRLVHCSSFTV-HSINSCMMLLP 1229
Query: 133 KPSPI--SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
K S P F +DG++ +PL E+ YRRL ++Q ++ GGLN
Sbjct: 1230 KNQEFGSSQVPS----FQNVGGQVDGSVFKIVPLSEETYRRLYLIQQQIIDRELQLGGLN 1285
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
PR R Y G+ R ++D +++ +F LS+ R +K G
Sbjct: 1286 PRMER-LANDFYQMGHSMRPMLDFNVIRRFSGLSIDRRKNTAQKAG 1330
>gi|406602601|emb|CCH45811.1| hypothetical protein BN7_5397 [Wickerhamomyces ciferrii]
Length = 1287
Score = 68.9 bits (167), Expect = 2e-09, Method: Composition-based stats.
Identities = 46/176 (26%), Positives = 93/176 (52%), Gaps = 14/176 (7%)
Query: 75 EFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 134
+F+++ + F+++D ++++ L YQP+ +S G +L++K+ F + T K+ K
Sbjct: 1112 DFIVDDGEIYFLVADDEESLHLLTYQPDDPKSLSGQKLLQKSTFTTNS-ITTCLKLVPKF 1170
Query: 135 SPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 194
+ G+ + + ++DG++ +P+ E +YRRL +LQ + +H GLNPR+
Sbjct: 1171 NEFDQ--GSITSYQNIGVNVDGSIFKMIPIDEISYRRLYILQQQLSDKIAHYVGLNPRSN 1228
Query: 195 RTYKGKGYYAGNP--SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYD 248
R ++ N + II+ L+ F+ L++ +R + K+G ND L+ D
Sbjct: 1229 R-------FSANEQGQKPIIEFGLLKWFINLNVDKRKQFSAKVG--RNDYLELFKD 1275
>gi|380494933|emb|CCF32776.1| cft-1, partial [Colletotrichum higginsianum]
Length = 542
Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/188 (26%), Positives = 84/188 (44%), Gaps = 9/188 (4%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +FV+ + ++ DKD + + + PE +S GH L+ + F + T +
Sbjct: 349 VLTADFVVAGDELHIVVCDKDGVIHVMQFDPEHPKSLQGHLLLNRASFSAAPNHPTI-TL 407
Query: 131 RCKPSPISDAPGARSR---FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
+PIS + + S+ AS GAL PL E+ YRRL L N + H
Sbjct: 408 SLPRTPISPSATSVSKNPPTTLLLASPTGALASLTPLSEQAYRRLTSLANSIAGALPHAA 467
Query: 188 GLNPRAFRTYKGKGYYAG---NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 244
NP+ R G + R I+DG+L+ ++ +L G R E+ K G + D+ +
Sbjct: 468 ATNPKGHRLQPLDARTPGVDTSAGRSIVDGALLARWNELGAGRRSEVAGKGG--YGDVHE 525
Query: 245 ELYDIEAL 252
++E +
Sbjct: 526 VRSELEGV 533
>gi|389641257|ref|XP_003718261.1| cft-1 [Magnaporthe oryzae 70-15]
gi|351640814|gb|EHA48677.1| cft-1 [Magnaporthe oryzae 70-15]
Length = 1452
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/160 (27%), Positives = 73/160 (45%), Gaps = 8/160 (5%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIRCKPSPISDAPG 142
+ +D D N+ + + PE +S GH L+ +T F G H + P P ++ P
Sbjct: 1270 IVAADSDGNLHVLQFDPEHPKSLQGHLLLNRTTFSTGAHHPQKSLLLPTTDPRPSTNQPS 1329
Query: 143 ARS-RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR-----T 196
+ + R AS G L PL + Y RL L + ++ H LNP+A+R T
Sbjct: 1330 SDAERQHILMASPTGVLAAVQPLSQSTYTRLSALASNLMASVPHHAALNPKAYRLPPTST 1389
Query: 197 YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
+ R ++DGSL+ ++ +L+ G R E+ + G
Sbjct: 1390 RNQVAAVDISVGRAVVDGSLLARWAELASGRRAEVAGRAG 1429
>gi|302403950|ref|XP_002999813.1| cft-1 [Verticillium albo-atrum VaMs.102]
gi|261361315|gb|EEY23743.1| cft-1 [Verticillium albo-atrum VaMs.102]
Length = 1349
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 51/194 (26%), Positives = 87/194 (44%), Gaps = 20/194 (10%)
Query: 74 CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH----VNTFFK 129
+F++ + + + SD+D + + + PE S GH L+ + F + + +
Sbjct: 1149 ADFLVAGNDLSIVASDEDGVLHILQFDPEHPRSLQGHLLLNRASFSVAPNHAWVTLALPR 1208
Query: 130 IRCKP-----SPISDAPGARSRFLTWY-ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
+P P ++A G+++R T AS GA+ P+ E YRRL L +
Sbjct: 1209 TTTRPYLPQSEPATNAAGSQNRTQTLLLASASGAIASLNPITEHAYRRLTSLTTSLANAL 1268
Query: 184 SHTGGLNPRAFRTYKGKGYYAGNP-------SRGIIDGSLVWKFLQLSLGERLEICKKIG 236
H G+NP+A R G A P R I+DG+L+ ++ +L +R E K G
Sbjct: 1269 PHAAGMNPKAHRLPPQDG--AARPPAVDVSAGRTIVDGALLARWNELGARQRAEAAGKGG 1326
Query: 237 -SKHNDILDELYDI 249
+ D+ EL D+
Sbjct: 1327 FASAADVRGELEDV 1340
>gi|440466842|gb|ELQ36086.1| hypothetical protein OOU_Y34scaffold00669g71 [Magnaporthe oryzae Y34]
gi|440481991|gb|ELQ62520.1| hypothetical protein OOW_P131scaffold01068g7 [Magnaporthe oryzae
P131]
Length = 1475
Score = 68.2 bits (165), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/160 (27%), Positives = 73/160 (45%), Gaps = 8/160 (5%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIRCKPSPISDAPG 142
+ +D D N+ + + PE +S GH L+ +T F G H + P P ++ P
Sbjct: 1293 IVAADSDGNLHVLQFDPEHPKSLQGHLLLNRTTFSTGAHHPQKSLLLPTTDPRPSTNQPS 1352
Query: 143 ARS-RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR-----T 196
+ + R AS G L PL + Y RL L + ++ H LNP+A+R T
Sbjct: 1353 SDAERQHILMASPTGVLAAVQPLSQSTYTRLSALASNLMASVPHHAALNPKAYRLPPTST 1412
Query: 197 YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
+ R ++DGSL+ ++ +L+ G R E+ + G
Sbjct: 1413 RNQVAAVDISVGRAVVDGSLLARWAELASGRRAEVAGRAG 1452
>gi|346319828|gb|EGX89429.1| protein CFT1 [Cordyceps militaris CM01]
Length = 1452
Score = 68.2 bits (165), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/191 (26%), Positives = 87/191 (45%), Gaps = 15/191 (7%)
Query: 74 CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK 133
+F+ + + + D D ++ +F + P+ +S GH L+ +T F L + T + +
Sbjct: 1253 ADFLPDGEDLNMIGVDADGDLHVFEFNPDHPKSLQGHLLLHRTTFSLSPNEPTTTVLLER 1312
Query: 134 PSPISD-----APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
P S GA + + G L PL E YRRLL L N ++ GG
Sbjct: 1313 TIPASQPQPQGTTGAETPHTLLLSCPTGQLAALTPLSESAYRRLLSLANQLMPAVVPYGG 1372
Query: 189 LNPRAFRTYKGKGYYA---------GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 239
L+P+A R +G+G + R I+DG+++ ++ +L +R E+ K G
Sbjct: 1373 LHPKAHRLPEGRGAQSHARAVGVETAASGRMIVDGAVLARWTELGAAKRAEMATKSGYDD 1432
Query: 240 -NDILDELYDI 249
N++ DEL +
Sbjct: 1433 LNEMRDELEGV 1443
>gi|254564833|ref|XP_002489527.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
[Komagataella pastoris GS115]
gi|238029323|emb|CAY67246.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
[Komagataella pastoris GS115]
gi|328349950|emb|CCA36350.1| Protein cft1 [Komagataella pastoris CBS 7435]
Length = 1388
Score = 68.2 bits (165), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 51/171 (29%), Positives = 82/171 (47%), Gaps = 9/171 (5%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A +FV+ ++ +I+D+D + L Y PE S G RL++++ F Q+ T K+
Sbjct: 1205 VCAADFVVFEGNLFIIIADEDGMLHLIQYDPEDPASMQGQRLLRRSIFKTNQYT-TCMKM 1263
Query: 131 R-----CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
R KP P + F A+ DG+ P+ E YRRL ++Q + +H
Sbjct: 1264 RERKYVIKP-PKNQFTNFSEAFEVVAANSDGSFYKVTPISEATYRRLYVIQQQIFDQENH 1322
Query: 186 TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
GLNPR R Y Y N R I+D + +FL+ ++ ++ K+G
Sbjct: 1323 KCGLNPRENR-YLSDQYSIPN-QRLILDFDNIRRFLEFDEIKKRDLVHKLG 1371
>gi|260941626|ref|XP_002614979.1| hypothetical protein CLUG_04994 [Clavispora lusitaniae ATCC 42720]
gi|238851402|gb|EEQ40866.1| hypothetical protein CLUG_04994 [Clavispora lusitaniae ATCC 42720]
Length = 1363
Score = 67.4 bits (163), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 48/184 (26%), Positives = 93/184 (50%), Gaps = 7/184 (3%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V + +F+++ + +++D ++++ + Y PE S+ G RL+ K+ F ++ T K
Sbjct: 1177 VSSADFLVKDEEIYILVADNNRSLHVLQYNPEDPASSNGQRLLHKSSF-TTNYLTTCTKS 1235
Query: 131 RCKPSPISD--APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
K +S P A F T ++++GA+ P+ E YRR+ ++Q ++ H G
Sbjct: 1236 VPKHEQLSTWFDPQAIP-FQTVGSTVEGAMYVVFPISEPTYRRMYIMQQQLIDKEYHHCG 1294
Query: 189 LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH--NDILDEL 246
LNPR R + + N R ++D L+ +F +L+ + + KI +K+ DI +L
Sbjct: 1295 LNPRLNRIGRIESVNYAN-LRAMLDCELIRRFSKLNEDRKRTLSSKISTKNVQVDIWKDL 1353
Query: 247 YDIE 250
+ E
Sbjct: 1354 IEFE 1357
>gi|171695066|ref|XP_001912457.1| hypothetical protein [Podospora anserina S mat+]
gi|170947775|emb|CAP59938.1| unnamed protein product [Podospora anserina S mat+]
Length = 1441
Score = 67.4 bits (163), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 53/178 (29%), Positives = 81/178 (45%), Gaps = 15/178 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F+ + + D + ++ + + PE +S GH L+ +T F G H T K
Sbjct: 1244 VLNADFLPNGKELSIVACDAEGHIHILQFDPEHPKSLQGHLLLHRTSFSTGAHHVT--KS 1301
Query: 131 RCKPSPIS-------DAPGARSR-FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
PS +S + GA SR + AS G L PL E YRRL L +
Sbjct: 1302 LLLPSTLSPDNKEDNEENGATSRPHILLLASPTGVLAALRPLSETAYRRLSSLAAQLTNS 1361
Query: 183 TSHTGGLNPRAFR----TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
+H GLNP+ +R T G AG R I+DG+++ +F +L +R E+ + G
Sbjct: 1362 LTHAAGLNPKGYRMPSATCPPAGVDAGI-GRHIVDGTILARFSELGRAKRGEVAGRAG 1418
>gi|346971831|gb|EGY15283.1| cft-1 [Verticillium dahliae VdLs.17]
Length = 1445
Score = 67.0 bits (162), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 51/194 (26%), Positives = 86/194 (44%), Gaps = 20/194 (10%)
Query: 74 CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH----VNTFFK 129
+F++ + + + SD+D + + + PE S GH L+ + F + + +
Sbjct: 1245 ADFLVAGNDLSIVASDEDGVLHILQFDPEHPRSLQGHLLLNRASFSVAPNHAWATLVLPR 1304
Query: 130 IRCKP-----SPISDAPGARSRFLTWY-ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
+P P + A G+++R T AS GA+ P+ E YRRL L +
Sbjct: 1305 TTTRPYLPQSEPATGAAGSQNRTQTLLLASASGAIASLNPITEHAYRRLTSLTTSLANAL 1364
Query: 184 SHTGGLNPRAFRTYKGKGYYAGNP-------SRGIIDGSLVWKFLQLSLGERLEICKKIG 236
H G+NP+A R G A P R I+DG+L+ ++ +L +R E K G
Sbjct: 1365 PHAAGMNPKAHRLPPQDG--AARPPAVDVSAGRTIVDGALLARWNELGARQRAEAAGKGG 1422
Query: 237 -SKHNDILDELYDI 249
+ D+ EL D+
Sbjct: 1423 FASAADVRGELEDV 1436
>gi|342877552|gb|EGU79002.1| hypothetical protein FOXB_10431 [Fusarium oxysporum Fo5176]
Length = 1399
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 81/169 (47%), Gaps = 4/169 (2%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F+ + + + +D D ++ + + PE +S GH L+ +T F + + + +
Sbjct: 1209 VLVADFLPDGEDLAIVAADADGDLHILDFNPEHPKSLQGHLLLHRTSFSVSPNPPSTTLL 1268
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ P S P + AS G L +PLPE YRRLL + N ++ + GGLN
Sbjct: 1269 LPRTLPPSHPPPQDPPHILLLASSSGHLATLVPLPETTYRRLLSVTNQLLPALTPHGGLN 1328
Query: 191 PRAFRTYKG---KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
+A R G G A R I+DG+++ ++ +L +R EI K G
Sbjct: 1329 AKAHRLPDGIRPVGVEAAG-GRTIVDGAILARWAELGAAKRAEIAGKGG 1376
>gi|347838999|emb|CCD53571.1| similar to Cleavage and polyadenylation specificity factor subunit 1
[Botryotinia fuckeliana]
Length = 1447
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/194 (26%), Positives = 82/194 (42%), Gaps = 23/194 (11%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V + + + + + +D + N+ + Y PE +S GH L+ +T F LG H T +
Sbjct: 1249 VLCADLLPDGKDLFIVAADANGNLHIMQYDPEHPKSLQGHLLLHRTTFSLGAHHPTTMTL 1308
Query: 131 RCKPSPI----------------SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM 174
P+ D P L S G L PL E YRR
Sbjct: 1309 LPTTRPLPQLTTAPSPSPDPSPQEDTPSPSQPLL--LTSRTGTLALLSPLTESQYRRFGT 1366
Query: 175 LQNVMVTHTSHTGGLNPRAFRTYK--GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
L + + H GLNPRA+R + +G G R IIDG ++ ++++L R E+
Sbjct: 1367 LVSHLTNTLYHPCGLNPRAYRIDRDANEGIVGG---RTIIDGGVLGRWMELGSQRRGEVA 1423
Query: 233 KKIGSKHNDILDEL 246
++G ++ DEL
Sbjct: 1424 GRVGVDVLELRDEL 1437
>gi|400597740|gb|EJP65470.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
Length = 1444
Score = 65.1 bits (157), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 84/183 (45%), Gaps = 16/183 (8%)
Query: 74 CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK 133
+F+ + + + D D ++ +F + P+ +S GH LI +T F L + T + +
Sbjct: 1245 ADFLPDGEDLSMIGVDADGDLHVFEFDPDHPKSLQGHLLIHRTTFSLSPNEPTTTVLLER 1304
Query: 134 PSPISD-----APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
P S GA + + G L PL E YRRLL L N ++ GG
Sbjct: 1305 TIPASQPQPKGTTGAETPHTLLLSCPTGQLAALTPLSESAYRRLLSLTNQVLPAVVPHGG 1364
Query: 189 LNPRAFRTYKGKGYYA---------GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 239
L+P+A R +G+G + R I+DG+++ ++ +L +R E+ K G +
Sbjct: 1365 LHPKAHRLPEGRGAQSHSRAVGVETAASGRMIVDGAVLARWTELGAAKRAEMALKSG--Y 1422
Query: 240 NDI 242
+D+
Sbjct: 1423 DDV 1425
>gi|409046890|gb|EKM56369.1| hypothetical protein PHACADRAFT_93103 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1417
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 47/178 (26%), Positives = 88/178 (49%), Gaps = 5/178 (2%)
Query: 73 ACEFVIESSSMGFMISDKDKNVV-LFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 131
AC + + +IS ++ V+ L Y P ES G RL+ +T+FH T I
Sbjct: 1240 ACADLFFADGKASLISCDEEGVLRLSEYDPHDPESRHGQRLLCRTEFHGQTEYRTSHLIA 1299
Query: 132 CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
+ + DA +++ + + DG+L + + +RL +LQ + + H GLNP
Sbjct: 1300 RRGKGL-DAEIPQAKLICGHT--DGSLTSLTYVDDAVSKRLHLLQGQLARNVQHVAGLNP 1356
Query: 192 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
+AFR + A ++GI+DG+L+ F L + ++E+ ++I ++ +L + D+
Sbjct: 1357 KAFRVVRND-RVARPLTKGILDGNLLAAFEDLPVPRQVEVTRQIATERTTVLKDWLDL 1413
>gi|154320778|ref|XP_001559705.1| hypothetical protein BC1G_01861 [Botryotinia fuckeliana B05.10]
Length = 1153
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 52/194 (26%), Positives = 82/194 (42%), Gaps = 23/194 (11%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V + + + + + +D + N+ + Y PE +S GH L+ +T F LG H T +
Sbjct: 955 VLCADLLPDGKDLFIVAADANGNLHIMQYDPEHPKSLQGHLLLHRTTFSLGAHHPTTMTL 1014
Query: 131 RCKPSPI----------------SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM 174
P+ D P L S G L PL E YRR
Sbjct: 1015 LPTTRPLPQLTTAPSPSPDPSPQEDTPSPSQPLL--LTSRTGTLALLSPLTESQYRRFGT 1072
Query: 175 LQNVMVTHTSHTGGLNPRAFRTYK--GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
L + + H GLNPRA+R + +G G R IIDG ++ ++++L R E+
Sbjct: 1073 LVSHLTNTLYHPCGLNPRAYRIDRDANEGIVGG---RTIIDGGVLGRWMELGSQRRGEVA 1129
Query: 233 KKIGSKHNDILDEL 246
++G ++ DEL
Sbjct: 1130 GRVGVDVLELRDEL 1143
>gi|358338426|dbj|GAA28838.2| cleavage and polyadenylation specificity factor subunit 1 [Clonorchis
sinensis]
Length = 1741
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 53/101 (52%)
Query: 156 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 215
GA+ PL +K Y RL + + ++ H T GL P+ Y+ NPS + D
Sbjct: 1641 GAVFLIGPLRDKMYSRLRITEKNLIHHFGPTCGLLPKLCWNYRPSAPELVNPSGQVADAD 1700
Query: 216 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
L+W++L L +RLEI KK G I+D++ ++ A + HF
Sbjct: 1701 LLWRYLTLPHSQRLEIAKKSGQSLEGIMDDIAELNATTLHF 1741
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 48/99 (48%), Gaps = 12/99 (12%)
Query: 27 HELLIYQAFRHPKGTLK----LRFKK-LKVLFVSDRSKRANEQPGLPRGVYACEFVIESS 81
H LL + LK LRF+ L+VL V R +PR VY F ++
Sbjct: 1457 HSLLCVKNLILAADVLKSIQLLRFQSDLRVLSVVSRD-------AIPREVYTSNFFVDGR 1509
Query: 82 SMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHL 120
+GF+++D+ NVV++ Y P S G RL+++ D L
Sbjct: 1510 RLGFLVTDERGNVVIYSYDPLEPSSRSGRRLVRRADMCL 1548
>gi|444313909|ref|XP_004177612.1| hypothetical protein TBLA_0A02930 [Tetrapisispora blattae CBS 6284]
gi|387510651|emb|CCH58093.1| hypothetical protein TBLA_0A02930 [Tetrapisispora blattae CBS 6284]
Length = 1459
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 45/185 (24%), Positives = 88/185 (47%), Gaps = 9/185 (4%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
+ EF+ + ++ F +D D + +F Y P+ S G +L+ + F+L + +
Sbjct: 1270 ISTVEFIANNGNIYFAATDYDNILHIFKYAPDEPNSLSGQKLVHCSSFNLHSSTSCMIML 1329
Query: 131 RCKPSPISDAPGARSRFLTWYASL----DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
P + + F+ + +L DG++ +PL E YRRL ++Q + +
Sbjct: 1330 ---PGNDEFSENEQDNFIPSFQTLGGQVDGSIFKVIPLEESPYRRLYVIQQQITDYEVQV 1386
Query: 187 GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDE 245
GGLNP+ R + Y N + ++D +++ +F L + +R +K G + H +I +
Sbjct: 1387 GGLNPKMER-LSNEYYQKSNMLKPMLDFNIIRRFSMLPIDKRRRTAQKAGRRAHFEIWRD 1445
Query: 246 LYDIE 250
L +IE
Sbjct: 1446 LINIE 1450
>gi|50305395|ref|XP_452657.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|74606921|sp|Q6CTT2.1|CFT1_KLULA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|49641790|emb|CAH01508.1| KLLA0C10274p [Kluyveromyces lactis]
Length = 1300
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 42/181 (23%), Positives = 86/181 (47%), Gaps = 6/181 (3%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V EF++ + ++ F+++D+ ++ + Y P+ S G RL+ F++ N + K+
Sbjct: 1115 VMNLEFLVNNGNINFIVTDRQNHLHVLRYAPDEANSLSGQRLVHCNSFNMFT-TNNYMKL 1173
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
K S ++ DG++ +PL E +YRR ++Q ++ H G N
Sbjct: 1174 VRKHVEFG---SKTSNYIALGCQTDGSIFRMIPLNEASYRRFYLVQQQLLDHEIPLAGFN 1230
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS-KHNDILDELYDI 249
+ R + Y+ G+ R +D ++ K++ L + +R I ++G ++ +L DI
Sbjct: 1231 TKMER-LDNEYYHKGHSLRPTLDSQVLKKYIHLPITKRTTIENRVGRHASTELWHDLIDI 1289
Query: 250 E 250
E
Sbjct: 1290 E 1290
>gi|150951283|ref|XP_001387581.2| pre-mRNA 3'-end processing factor CF II mRNA cleavage and
polyadenylation factor II complex, subunit CFT1 (CPSF
subunit) RNA processing and modification [Scheffersomyces
stipitis CBS 6054]
gi|149388465|gb|EAZ63558.2| pre-mRNA 3'-end processing factor CF II mRNA cleavage and
polyadenylation factor II complex, subunit CFT1 (CPSF
subunit) RNA processing and modification [Scheffersomyces
stipitis CBS 6054]
Length = 1341
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 46/184 (25%), Positives = 90/184 (48%), Gaps = 11/184 (5%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F+ + + +I+D + + L Y PE + G RL+ K+ F +N+F +
Sbjct: 1159 VNCADFITKDEEVFILIADNNNVLHLVQYDPEDPTALNGQRLLSKSSFS----INSF--V 1212
Query: 131 RC-KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 189
C K P ++ F T +++DG+ +P+ E +YRR+ +LQ + H GL
Sbjct: 1213 TCLKSLPKTEEKYDTGNFQTIGSTIDGSFFSVVPINEASYRRMYILQQQLTDKEYHYCGL 1272
Query: 190 NPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSK--HNDILDEL 246
NPR R + G A + ++ I+D ++ + +L+ + + K+ +K + DI ++
Sbjct: 1273 NPRLNR-FGGLSMTANDTNTKPILDYDVIRAYGKLNEERKKNLASKVSAKNIYQDIWKDI 1331
Query: 247 YDIE 250
+ E
Sbjct: 1332 IEFE 1335
>gi|410079681|ref|XP_003957421.1| hypothetical protein KAFR_0E01320 [Kazachstania africana CBS 2517]
gi|372464007|emb|CCF58286.1| hypothetical protein KAFR_0E01320 [Kazachstania africana CBS 2517]
Length = 1350
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 42/179 (23%), Positives = 88/179 (49%), Gaps = 3/179 (1%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
+ EF++ + + F ++D+D + + Y P+ S G +L+ + F+L N+ +
Sbjct: 1162 SVEFLVNNGDIYFAVTDRDNILHVLKYAPDEPNSFSGQKLVHCSSFNLYAD-NSCMVLLA 1220
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
K + + DG++ +PL E++YRRL ++Q ++ + GGLNPR
Sbjct: 1221 KNDEFNKVDDTNRTYQVVGGQTDGSMFKIVPLSEESYRRLYVIQQQIIDKETQLGGLNPR 1280
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG-SKHNDILDELYDIE 250
R + + R ++D +++ KF + + +R + +K+G + H + +L +IE
Sbjct: 1281 MER-LSNQYLPLCHVMRPMLDFNVIRKFSAMPISKRQALAQKLGRNVHFEAWRDLINIE 1338
>gi|448530371|ref|XP_003870046.1| mRNA cleavage and polyadenylation factor [Candida orthopsilosis Co
90-125]
gi|380354400|emb|CCG23915.1| mRNA cleavage and polyadenylation factor [Candida orthopsilosis]
Length = 1327
Score = 64.3 bits (155), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 85/185 (45%), Gaps = 16/185 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F+++ + +++D + ++ L + P+ +S G LI K F +
Sbjct: 1148 VECADFIVKDEEIFILVADVNNSLHLIQFDPDDPKSINGTILINKASFETNSQTTC---L 1204
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
R P G + T +++DGA P+ E YRR+ ++Q + H GLN
Sbjct: 1205 RSVPK------GETGDYQTIGSTIDGAFFNVFPVNESTYRRMYIVQQQISDKEYHYCGLN 1258
Query: 191 PRAFRTYKGKGYYAGNPSRG--IIDGSLVWKFLQLSLGERLEICKKI---GSKHNDILDE 245
PR R + G N + I+D +L+ +F +L+L + I KI GS H DI +
Sbjct: 1259 PRLNR-FGGAVQIRDNDTNAKPILDYNLIKEFAKLNLDRQKNITTKINIKGSAH-DIWKD 1316
Query: 246 LYDIE 250
L ++E
Sbjct: 1317 LIELE 1321
>gi|353231025|emb|CCD77443.1| putative cleavage and polyadenylation specificity factor cpsf
[Schistosoma mansoni]
Length = 1825
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 34/122 (27%), Positives = 62/122 (50%)
Query: 135 SPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 194
+P S + R ++ S +G++ P+ +K Y RL + + ++ H G+ P++
Sbjct: 1704 NPNSGVDPEKFRQSIYFGSQNGSIYRIGPIRDKMYSRLRITEKNLIHHLGPICGMPPKSC 1763
Query: 195 RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSS 254
+Y NP + DG L+W++L L +RLEI KK G I+D++ ++ A +
Sbjct: 1764 WSYNRPQPELANPCGKVADGDLIWRYLTLPHCQRLEIAKKSGQSLESIMDDIAELIATTL 1823
Query: 255 HF 256
HF
Sbjct: 1824 HF 1825
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 47/109 (43%), Gaps = 12/109 (11%)
Query: 27 HELLIYQAFRHPKGTLK----LRFKK-LKVLFVSDRSKRANEQPGLPRGVYACEFVIESS 81
H LL + LK LRF+ L+VL V R + R VY F ++
Sbjct: 1534 HNLLCVKNLVLAADVLKSVQLLRFQSDLRVLSVVSRD-------NISREVYTSNFFVDGR 1586
Query: 82 SMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
+GFM+SD+ NV ++ Y P S G RL++ D L ++
Sbjct: 1587 RLGFMVSDELGNVTIYSYDPLDPSSRSGRRLVRCADMRLPSRATCSLRV 1635
>gi|256079900|ref|XP_002576222.1| cleavage and polyadenylation specificity factor cpsf [Schistosoma
mansoni]
Length = 1958
Score = 63.9 bits (154), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 57/107 (53%)
Query: 150 WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSR 209
++ S +G++ P+ +K Y RL + + ++ H G+ P++ +Y NP
Sbjct: 1852 YFGSQNGSIYRIGPIRDKMYSRLRITEKNLIHHLGPICGMPPKSCWSYNRPQPELANPCG 1911
Query: 210 GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
+ DG L+W++L L +RLEI KK G I+D++ ++ A + HF
Sbjct: 1912 KVADGDLIWRYLTLPHCQRLEIAKKSGQSLESIMDDIAELIATTLHF 1958
Score = 47.4 bits (111), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 47/109 (43%), Gaps = 12/109 (11%)
Query: 27 HELLIYQAFRHPKGTLK----LRFKK-LKVLFVSDRSKRANEQPGLPRGVYACEFVIESS 81
H LL + LK LRF+ L+VL V R + R VY F ++
Sbjct: 1551 HNLLCVKNLVLAADVLKSVQLLRFQSDLRVLSVVSRD-------NISREVYTSNFFVDGR 1603
Query: 82 SMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
+GFM+SD+ NV ++ Y P S G RL++ D L ++
Sbjct: 1604 RLGFMVSDELGNVTIYSYDPLDPSSRSGRRLVRCADMRLPSRATCSLRV 1652
>gi|346321204|gb|EGX90804.1| DNA damage-binding protein 1 [Cordyceps militaris CM01]
Length = 1160
Score = 63.9 bits (154), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 79/158 (50%), Gaps = 6/158 (3%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
++ +D N+++ PEA RL ++ +LG+ +N K+ P+ +A +
Sbjct: 998 WLETDAQGNIMVLARNPEAPTEQDRGRLEITSEMNLGEQINKIRKLNVAPA--DNAVVSP 1055
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
FL AS++G L + + K L+ LQ+ + + TG ++ A+R+++ + A
Sbjct: 1056 KAFL---ASIEGTLYLYGDIAPKYQDLLITLQSNIEQYVKTTGDISFNAWRSFRNQTREA 1112
Query: 205 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
P R +DG +V +FL L ++E+CK +G D+
Sbjct: 1113 DGPFR-FVDGEMVERFLDLDELTQVELCKDLGPSVEDV 1149
>gi|340515387|gb|EGR45642.1| predicted protein [Trichoderma reesei QM6a]
Length = 1441
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 49/192 (25%), Positives = 90/192 (46%), Gaps = 15/192 (7%)
Query: 74 CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIR 131
+F+ + + + D D ++ + + PE +S GH L+ +T F + + +T R
Sbjct: 1243 ADFLPDGEDLSMVAVDADGDMHVLEFNPEHPKSLQGHLLLHRTTFSVTPNPPTSTLLLPR 1302
Query: 132 CKPSPISDAPGARSRF----LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
P+ S + S + AS G++ PLPE YRRLL + N ++ G
Sbjct: 1303 TLPASQSSQDSSSSSSTQPHILLLASPSGSIAALTPLPESAYRRLLSVTNQLLPALVPHG 1362
Query: 188 GLNPRAFRTYKGK-------GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
GL+ RA RT +G G R I+DG+++ ++ +L +R E+ + G ++
Sbjct: 1363 GLHARAHRTPEGGGGMSRTVGVETAATGRAIVDGTVLTRWNELGAAKRAEVATRGG--YD 1420
Query: 241 DILDELYDIEAL 252
+ + D+EA+
Sbjct: 1421 GVTEMREDLEAV 1432
>gi|344229600|gb|EGV61485.1| hypothetical protein CANTEDRAFT_109087 [Candida tenuis ATCC 10573]
Length = 1300
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 41/181 (22%), Positives = 87/181 (48%), Gaps = 9/181 (4%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F+++ + ++ D + + L Y PE S+ G RL++K F+L V ++
Sbjct: 1125 VNCADFIVKDEELYIIVGDNNNILHLLKYDPEDPNSSNGQRLVEKAAFNLNAKVT---QL 1181
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ P+ + ++ ++++G+ P+ E +YRR+ +LQ + H GLN
Sbjct: 1182 KQLPNLMDNSTSCIG------STIEGSFFTVFPINESSYRRMYILQQQLTDKAYHHCGLN 1235
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
PR R K + ++ I+D ++ + +L+ R I K+ + ++I ++ + E
Sbjct: 1236 PRLNRFGGLKLTANESNNKPILDYDVIKLYAKLNEDRRRNIGAKVSREGSEIWRDMLEFE 1295
Query: 251 A 251
A
Sbjct: 1296 A 1296
>gi|336463425|gb|EGO51665.1| hypothetical protein NEUTE1DRAFT_89273 [Neurospora tetrasperma FGSC
2508]
Length = 1437
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 52/178 (29%), Positives = 81/178 (45%), Gaps = 13/178 (7%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT---- 126
V +F+ + + + SD D ++ + + PE +S GH L+ +T F+ G H T
Sbjct: 1198 VLNADFLPDGKELYIVASDADGHIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLL 1257
Query: 127 FFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
+ PS +S S + AS G L PL E YRRL L + H
Sbjct: 1258 LPAVYPNPSSLSSNSEENSPHILLLASPTGVLATLRPLQENAYRRLSSLAVQLTNGLPHP 1317
Query: 187 GGLNPRAFR--------TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
GLNP+ +R + + G AG R I+DG ++ +FL+L G+R E+ + G
Sbjct: 1318 AGLNPKGYRLPSPSASASMQLPGVDAGI-GRNIVDGKILERFLELGTGKRQEMAGRAG 1374
>gi|148886831|sp|Q7SEY2.2|CFT1_NEUCR RecName: Full=Protein cft-1; AltName: Full=Cleavage factor two
protein 1
Length = 1456
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 52/178 (29%), Positives = 81/178 (45%), Gaps = 13/178 (7%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT---- 126
V +F+ + + + SD D ++ + + PE +S GH L+ +T F+ G H T
Sbjct: 1248 VLNADFLPDGKELYIVASDADGHIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLL 1307
Query: 127 FFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
+ PS +S S + AS G L PL E YRRL L + H
Sbjct: 1308 LPAVYPNPSSLSSNSEENSPHILLLASPTGVLATLRPLQENAYRRLSSLAVQLTNGLPHP 1367
Query: 187 GGLNPRAFR--------TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
GLNP+ +R + + G AG R I+DG ++ +FL+L G+R E+ + G
Sbjct: 1368 AGLNPKGYRLPSPSASASMQLPGVDAGI-GRNIVDGKILERFLELGTGKRQEMAGRAG 1424
>gi|164429683|ref|XP_964609.2| hypothetical protein NCU02082 [Neurospora crassa OR74A]
gi|157073577|gb|EAA35373.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 1437
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 52/178 (29%), Positives = 81/178 (45%), Gaps = 13/178 (7%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT---- 126
V +F+ + + + SD D ++ + + PE +S GH L+ +T F+ G H T
Sbjct: 1198 VLNADFLPDGKELYIVASDADGHIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLL 1257
Query: 127 FFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
+ PS +S S + AS G L PL E YRRL L + H
Sbjct: 1258 LPAVYPNPSSLSSNSEENSPHILLLASPTGVLATLRPLQENAYRRLSSLAVQLTNGLPHP 1317
Query: 187 GGLNPRAFR--------TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
GLNP+ +R + + G AG R I+DG ++ +FL+L G+R E+ + G
Sbjct: 1318 AGLNPKGYRLPSPSASASMQLPGVDAGI-GRNIVDGKILERFLELGTGKRQEMAGRAG 1374
>gi|350297359|gb|EGZ78336.1| protein cft-1 [Neurospora tetrasperma FGSC 2509]
Length = 1437
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 52/178 (29%), Positives = 81/178 (45%), Gaps = 13/178 (7%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT---- 126
V +F+ + + + SD D ++ + + PE +S GH L+ +T F+ G H T
Sbjct: 1198 VLNADFLPDGKELYIVASDADGHIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLL 1257
Query: 127 FFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
+ PS +S S + AS G L PL E YRRL L + H
Sbjct: 1258 LPAVYPNPSSLSSNSEENSPHILLLASPTGVLATLRPLQENAYRRLSSLAVQLTNGLPHP 1317
Query: 187 GGLNPRAFR--------TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
GLNP+ +R + + G AG R I+DG ++ +FL+L G+R E+ + G
Sbjct: 1318 AGLNPKGYRLPSPSASASMQLPGVDAGI-GRNIVDGKILERFLELGTGKRQEMAGRAG 1374
>gi|443894082|dbj|GAC71432.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
[Pseudozyma antarctica T-34]
Length = 1543
Score = 62.8 bits (151), Expect = 1e-07, Method: Composition-based stats.
Identities = 51/172 (29%), Positives = 85/172 (49%), Gaps = 21/172 (12%)
Query: 94 VVLFMYQPEARESNGGHRLIKKTDFH-----LGQHVNTFFKIRCKPSPISDAP-----GA 143
+ L+ Y P + GGHRL+ +++F +G V C+ +SD+
Sbjct: 1340 IRLYEYAPHVATTLGGHRLLLRSEFQTPAAAVGSTV-------CRGRWLSDSELRGREEG 1392
Query: 144 RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
RS+ + A +GAL L +K +RL +LQ +V HT LNPRAFR + +
Sbjct: 1393 RSKLV--LAKANGALDSLSALDDKVAKRLHLLQGQLVRSVQHTAALNPRAFRAVRND-FV 1449
Query: 204 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSH 255
+ ++GI+D L+ +F+ LS + LE + + S D LD++ + S+H
Sbjct: 1450 PRSLAKGILDARLLDRFVWLSRPKMLEAVRTL-SGLFDGLDQIKKRKRDSNH 1500
>gi|164655043|ref|XP_001728653.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
gi|159102535|gb|EDP41439.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
Length = 1212
Score = 62.8 bits (151), Expect = 1e-07, Method: Composition-based stats.
Identities = 48/193 (24%), Positives = 87/193 (45%), Gaps = 8/193 (4%)
Query: 67 LPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT 126
+P V F+I+ + + + D + + L Y P S GG RL+ + ++H V
Sbjct: 1025 IPTSVTCGAFLIDRARLSIVTCDMNGCLRLMDYHPSNPTSLGGQRLLARCEYHAPGEV-- 1082
Query: 127 FFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
+ R P G A +GA+ +P+ EK + L + Q+ +V HT
Sbjct: 1083 -VRARMLHGPYLATSGECLTSEIVLAKRNGAVDVLVPVTEKIFPTLQLFQSQLVRMVRHT 1141
Query: 187 GGLNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL-- 243
GLNPR FR + + P ++GI+DG+L+ +S + + + + ++ ++
Sbjct: 1142 AGLNPRGFRAVFNQ--HISRPLAKGILDGTLLHTAESMSRPKLTSLVRDLSTRTGGVIAD 1199
Query: 244 DELYDIEALSSHF 256
D L + L SH+
Sbjct: 1200 DLLRCLVHLQSHW 1212
>gi|326432241|gb|EGD77811.1| hypothetical protein PTSG_08901 [Salpingoeca sp. ATCC 50818]
Length = 1506
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/184 (23%), Positives = 88/184 (47%), Gaps = 2/184 (1%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG-QHVNT 126
P VYA +F++ + + + D+ N+ + + + + GG L + + ++ G Q +
Sbjct: 1318 PTSVYAADFIVRGAELHVLFLDQHANMTILAFDSDDPTTRGGRILKRHSVYNTGHQRIVA 1377
Query: 127 FFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
+++ P S + FLT Y +L+G G+ +PE +RRL++LQ ++ H
Sbjct: 1378 LTRLQNVPPRNSRNATVDAHFLT-YQTLEGGAGYITSIPEDIFRRLMLLQLRLLPHLKFR 1436
Query: 187 GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
GL+P AF+ YK + + I + + L L + E+ +++G+ + D+
Sbjct: 1437 AGLHPSAFKKYKSASLHMVHQEVRTICADVYTRLFMLDLDAQKEVARQVGTTTKQLCDDF 1496
Query: 247 YDIE 250
IE
Sbjct: 1497 LFIE 1500
>gi|238881599|gb|EEQ45237.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 1423
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/184 (24%), Positives = 77/184 (41%), Gaps = 15/184 (8%)
Query: 70 GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
V +F+I + +++D + + L Y P+ +S G +L+ K F L ++
Sbjct: 1220 SVECADFIINDDEIFVLVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRS 1279
Query: 130 ---IRCKPSPISDA-----------PGARSRFLTWYASL-DGALGFFLPLPEKNYRRLLM 174
I + S +DA P S + S DG+ P+ E YRR+ +
Sbjct: 1280 LPLIDIEESVQTDAFTNIVVPPTLPPNTTSNYFQVIGSTQDGSFFNVFPINEAAYRRMYI 1339
Query: 175 LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
LQ ++ H GLNPR R K ++ I+D L+ +F +LS + + K
Sbjct: 1340 LQQQLIDKEFHYCGLNPRLNRIGSIKLQNNETNTKPILDYDLIRRFTKLSDDRKRNLANK 1399
Query: 235 IGSK 238
+ K
Sbjct: 1400 VSGK 1403
>gi|344305212|gb|EGW35444.1| pre-mRNA 3'-end processing factor CF II [Spathaspora passalidarum
NRRL Y-27907]
Length = 1348
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 47/181 (25%), Positives = 82/181 (45%), Gaps = 20/181 (11%)
Query: 70 GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVN---- 125
V +F+ + + +I+D + + L Y P+ +S G RLI K+ F + V+
Sbjct: 1156 NVSCGDFITKDEDIYMLIADNNNILHLIQYDPDDPQSLNGQRLISKSAFEIESTVSCMRK 1215
Query: 126 ------TFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 179
+F K K SPI + F ++ DG+ P+ E +YRR+ +LQ +
Sbjct: 1216 LPKIESSFEKSEIKFSPIDE-------FQIIGSTSDGSFFNVFPVDESSYRRMYILQQQL 1268
Query: 180 VTHTSHTGGLNPRAFRTYKGKGYYAGNP--SRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
H GLNPR R + G N ++ I+D L+ ++ QL+ + + K+ +
Sbjct: 1269 TDKEYHYCGLNPRLNR-FGGAIELRDNETNTKPILDFGLIKRYAQLNEDRKRNLASKVSA 1327
Query: 238 K 238
K
Sbjct: 1328 K 1328
>gi|340924328|gb|EGS19231.1| hypothetical protein CTHT_0058560 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 1460
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 50/196 (25%), Positives = 81/196 (41%), Gaps = 32/196 (16%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F+ + + + D D ++ + + PE +S GH L+ +T F+ G H T K
Sbjct: 1244 VLTADFLPDGKELFIVACDADGHIHILQFDPEHPKSLQGHLLLHRTSFNTGAHNPT--KS 1301
Query: 131 RCKPSPI-SDAPGA----------------RSRFLTWYA----------SLDGALGFFLP 163
PS + +D P S + A S G + P
Sbjct: 1302 LLLPSTLPTDTPSTIDGSNPNTNNTNGTPNASNLAPYDATERPHILLLCSPTGLIAALRP 1361
Query: 164 LPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG---NPSRGIIDGSLVWKF 220
L E +YRRL L +V H GLNP+ +R AG + R I+DG+++ +F
Sbjct: 1362 LSESSYRRLSSLAAQLVNSLPHAAGLNPKGYRMPSADCPPAGVDASVGRNIVDGTVLERF 1421
Query: 221 LQLSLGERLEICKKIG 236
+L + R E+ + G
Sbjct: 1422 TELGMARRAELAGRAG 1437
>gi|353234640|emb|CCA66663.1| related to cleavage and polyadenylation specificity factor, 160 kDa
subunit [Piriformospora indica DSM 11827]
Length = 1324
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 51/185 (27%), Positives = 81/185 (43%), Gaps = 6/185 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P + +F + M + D + L Y P ++ G RL+ +F HV
Sbjct: 1142 PTSIPQGDFFFAHNDMELLTIDLRGVLRLHSYDPTHVDTEEGARLLCSVEFQ--THVEPV 1199
Query: 128 FKIRCK-PSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
+R P SD+ SR L +DG+L PL ++RL +LQ +V HT H
Sbjct: 1200 TIVRVAMEQPSSDSASDASRLL--IPRVDGSLASLSPLDMDIFKRLYLLQAQLVRHTHHI 1257
Query: 187 GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
LNP+A+R +G SR ++D L+ F +LS + I +IG ++ +
Sbjct: 1258 AALNPKAYRAVQGSS-TTRTMSRRMLDFGLLVGFKKLSFDRQQGIANQIGETWETLIRDC 1316
Query: 247 YDIEA 251
+EA
Sbjct: 1317 TQLEA 1321
>gi|241954348|ref|XP_002419895.1| subunit of the mRNA cleavage and polyadenylation factor, putative
[Candida dubliniensis CD36]
gi|223643236|emb|CAX42110.1| subunit of the mRNA cleavage and polyadenylation factor, putative
[Candida dubliniensis CD36]
Length = 1420
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 44/183 (24%), Positives = 77/183 (42%), Gaps = 14/183 (7%)
Query: 70 GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
V +F+I + +++D + + L Y P+ +S G +L+ K F L ++
Sbjct: 1218 SVECADFIINDDEIFVLVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRS 1277
Query: 130 -----IRCKPSPISDAPGA---------RSRFLTWYASLDGALGFFLPLPEKNYRRLLML 175
I K +DA A ++ F ++ DG+ P+ E YRR+ +L
Sbjct: 1278 LPLKDIDEKVQNETDAAAAATIPLPNNTQNNFQVIGSTQDGSFFNVFPINEAAYRRMYIL 1337
Query: 176 QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
Q ++ H GLNPR R K ++ I+D L+ +F +LS + K+
Sbjct: 1338 QQQLIDKEFHYCGLNPRLNRIGSIKLQNNETNTKPILDYDLIRRFTKLSDDRKRNFANKV 1397
Query: 236 GSK 238
K
Sbjct: 1398 SGK 1400
>gi|68471006|ref|XP_720510.1| likely Cleavage and Polyadenylation Specificity Factor subunit
[Candida albicans SC5314]
gi|74591422|sp|Q5AFT3.1|CFT1_CANAL RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|46442380|gb|EAL01670.1| likely Cleavage and Polyadenylation Specificity Factor subunit
[Candida albicans SC5314]
Length = 1420
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 45/184 (24%), Positives = 76/184 (41%), Gaps = 15/184 (8%)
Query: 70 GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
V +F+I + +++D + + L Y P+ +S G +L+ K F L ++
Sbjct: 1217 SVECADFIINDDEIFVLVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRS 1276
Query: 130 ---IRCKPSPISDA-----------PGARSRFLTWYASL-DGALGFFLPLPEKNYRRLLM 174
I + S +DA P S + S DG+ P+ E YRR+ +
Sbjct: 1277 LPLIDIEESVQTDALTNIAVPPPLPPNTTSNYFQVIGSTQDGSFFNVFPINEAAYRRMYI 1336
Query: 175 LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
LQ ++ H GLNPR R K ++ I+D L+ F +LS + + K
Sbjct: 1337 LQQQLIDKEFHYCGLNPRLNRIGSIKLQNNETNTKPILDYDLIRSFTKLSDDRKRNLANK 1396
Query: 235 IGSK 238
+ K
Sbjct: 1397 VSGK 1400
>gi|400600376|gb|EJP68050.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
Length = 1174
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/159 (23%), Positives = 77/159 (48%), Gaps = 8/159 (5%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP-ISDAPGA 143
++ +D N+++ P+A + RL ++ +LG+ +N ++ P+ + +P A
Sbjct: 1012 WLETDAQGNIIVLARNPDAPTEHDRSRLEITSEMNLGEQINKIQRLNVAPADNVVVSPKA 1071
Query: 144 RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
+ AS++G L + + K L+ LQ + + TGG++ A+R ++ +
Sbjct: 1072 ------FLASIEGTLYLYGDIAPKYQDLLITLQTTIEKYVKTTGGISFDAWRAFRNQARE 1125
Query: 204 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
A P R +DG +V +FL L + +C+ +G D+
Sbjct: 1126 ADGPFR-FVDGEMVERFLDLRKQTQAALCQDLGLNVEDV 1163
>gi|403218521|emb|CCK73011.1| hypothetical protein KNAG_0M01580 [Kazachstania naganishii CBS 8797]
Length = 1345
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 44/181 (24%), Positives = 87/181 (48%), Gaps = 5/181 (2%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
+ EF++ + F + D++ + + Y P+ S G RLI + F++ + ++
Sbjct: 1158 SLEFLVNGGDLYFTLIDRNDILHVLKYAPDEPNSLSGQRLIHCSSFNM-YSTTSCTRLIP 1216
Query: 133 KPSPISDAP--GARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
K D P A + DG+L +P+PE YRRL ++Q ++ + G+N
Sbjct: 1217 KNELFVDGPLNPAIQSYQVIGGQADGSLFKVMPVPETVYRRLYVVQQQIIDKETPLAGIN 1276
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDI 249
P+ R Y + R ++D ++V +F +S+ +R + K+G + H DI ++ ++
Sbjct: 1277 PKMER-LSNDYYQTSHLLRPMLDYNVVKQFCAMSIPKRTTLAHKLGKRAHFDIWRDVINL 1335
Query: 250 E 250
E
Sbjct: 1336 E 1336
>gi|402085944|gb|EJT80842.1| cft-1 [Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 1450
Score = 60.5 bits (145), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 45/179 (25%), Positives = 80/179 (44%), Gaps = 13/179 (7%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH------- 123
V +F+ + ++ + +D + N+ +F + PE +S GH L+ +T F G H
Sbjct: 1249 VINVDFLPDGRNLLLVAADAEGNLHIFQFDPEHPKSLQGHLLLNRTTFSTGAHHPQKSLL 1308
Query: 124 VNTFFKIRCKPSPISDAPGARSRFL-TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
+ T +P+ DA A + A+ G L PL + Y RL L + +
Sbjct: 1309 MPTTSSNPSQPATNGDASAAAAGPQHILMAAPTGVLAAVQPLGQGVYTRLSALASNLAAS 1368
Query: 183 TSHTGGLNPRAFRT--YKGKGYYAG---NPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
H LNP+A+R + A + R ++DG+L+ ++ +L G R E+ + G
Sbjct: 1369 VPHHAALNPKAYRMPPAPARNQVAAVDISVGRAVVDGALLARWAELGSGRRAEVAGRAG 1427
>gi|68471460|ref|XP_720278.1| likely Cleavage and Polyadenylation Specificity Factor subunit
fragment [Candida albicans SC5314]
gi|46442138|gb|EAL01430.1| likely Cleavage and Polyadenylation Specificity Factor subunit
fragment [Candida albicans SC5314]
Length = 758
Score = 60.5 bits (145), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 45/184 (24%), Positives = 76/184 (41%), Gaps = 15/184 (8%)
Query: 70 GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
V +F+I + +++D + + L Y P+ +S G +L+ K F L ++
Sbjct: 555 SVECADFIINDDEIFVLVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRS 614
Query: 130 ---IRCKPSPISDA-----------PGARSRFLTWYASL-DGALGFFLPLPEKNYRRLLM 174
I + S +DA P S + S DG+ P+ E YRR+ +
Sbjct: 615 LPLIDIEESVQTDALTNIAVPPPLPPNTTSNYFQVIGSTQDGSFFNVFPINEAAYRRMYI 674
Query: 175 LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
LQ ++ H GLNPR R K ++ I+D L+ F +LS + + K
Sbjct: 675 LQQQLIDKEFHYCGLNPRLNRIGSIKLQNNETNTKPILDYDLIRSFTKLSDDRKRNLANK 734
Query: 235 IGSK 238
+ K
Sbjct: 735 VSGK 738
>gi|149237256|ref|XP_001524505.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146452040|gb|EDK46296.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1380
Score = 60.1 bits (144), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 44/170 (25%), Positives = 77/170 (45%), Gaps = 12/170 (7%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F+I+ + +++D + + L Y P+ +S G L+ K F L +
Sbjct: 1201 VECADFIIKDEDIFILVADNNNCIHLIQYDPDDPKSINGTILLNKASFELNSATTC---L 1257
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
R P G + + ++LDGAL P+ E YRR+ +LQ + H GLN
Sbjct: 1258 RSIPK------GEKGDYQIIGSTLDGALYNVFPVNEFTYRRMYILQQQISDKVYHFCGLN 1311
Query: 191 PRAFRTYKGKGYYAG--NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 238
PR R + G ++ I+D L+ +F +L+L + ++ KI +
Sbjct: 1312 PRLNR-FGGSVTLRDRETNTKPILDYGLIRRFSKLNLDRQQQLAGKISVR 1360
>gi|354547787|emb|CCE44522.1| hypothetical protein CPAR2_403250 [Candida parapsilosis]
Length = 1334
Score = 60.1 bits (144), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 48/185 (25%), Positives = 85/185 (45%), Gaps = 16/185 (8%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +F+++ + +++D + + L + P+ +S G L+ K F
Sbjct: 1155 VECADFIVKDEEIFILVADINNGLHLIQFDPDDPKSINGTILVNKASFETNSQTT----- 1209
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
C S D G + T +++DGA P+ E YRR+ ++Q + H GLN
Sbjct: 1210 -CLRSVPKDEAG---DYQTIGSTIDGAFFNVFPVNESTYRRMYIVQQQISDKEFHHCGLN 1265
Query: 191 PRAFRTYKGKGYY--AGNPSRGIIDGSLVWKFLQLSLGERLEICKKI---GSKHNDILDE 245
PR R + G + ++ I+D +L+ +F +L+L + I KI GS H DI +
Sbjct: 1266 PRLNR-FGGAIQIRDSDTNAKPILDYNLIREFAKLNLDRQRNIATKINIKGSAH-DIWKD 1323
Query: 246 LYDIE 250
L ++E
Sbjct: 1324 LIELE 1328
>gi|121925707|sp|Q0UUE2.1|CFT1_PHANO RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
Length = 1375
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 45/170 (26%), Positives = 77/170 (45%), Gaps = 7/170 (4%)
Query: 74 CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT--FFKIR 131
+F+ + +++D D N+ + + P+ +S GG RL++K+ FH G +T + R
Sbjct: 1205 ADFLPFEEQLHIIVADADMNLQVLQFDPDHPKSMGGTRLLQKSTFHTGHFPSTMHLLQSR 1264
Query: 132 CKPSPISDAPGARSRFLTWY----ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
S+ + + L + S G L PL E +YRRL L +
Sbjct: 1265 LHMPTASEFTTSTTSSLPLHQILCTSQSGTLALITPLSESSYRRLSGLATHLQQFLDSPC 1324
Query: 188 GLNPRAFRTYK-GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
GLN +AFR +G + R ++DG L+ ++ +L R E K+G
Sbjct: 1325 GLNGKAFRAADVMEGGWDAGTQRAMLDGGLLMRWGELGEQRRREGLGKVG 1374
>gi|169603229|ref|XP_001795036.1| hypothetical protein SNOG_04622 [Phaeosphaeria nodorum SN15]
gi|160706354|gb|EAT88382.2| hypothetical protein SNOG_04622 [Phaeosphaeria nodorum SN15]
Length = 1338
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 45/170 (26%), Positives = 77/170 (45%), Gaps = 7/170 (4%)
Query: 74 CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT--FFKIR 131
+F+ + +++D D N+ + + P+ +S GG RL++K+ FH G +T + R
Sbjct: 1168 ADFLPFEEQLHIIVADADMNLQVLQFDPDHPKSMGGTRLLQKSTFHTGHFPSTMHLLQSR 1227
Query: 132 CKPSPISDAPGARSRFLTWY----ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
S+ + + L + S G L PL E +YRRL L +
Sbjct: 1228 LHMPTASEFTTSTTSSLPLHQILCTSQSGTLALITPLSESSYRRLSGLATHLQQFLDSPC 1287
Query: 188 GLNPRAFRTYK-GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
GLN +AFR +G + R ++DG L+ ++ +L R E K+G
Sbjct: 1288 GLNGKAFRAADVMEGGWDAGTQRAMLDGGLLMRWGELGEQRRREGLGKVG 1337
>gi|190348091|gb|EDK40482.2| hypothetical protein PGUG_04580 [Meyerozyma guilliermondii ATCC 6260]
Length = 1320
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/185 (22%), Positives = 84/185 (45%), Gaps = 7/185 (3%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V EF+ + + +I+ + + L + PE S+ G RL+ + F++ +
Sbjct: 1136 VSCAEFISKDEEIYILIAGNNNVMHLVQFDPEDPTSSNGQRLVHRASFNVSSSTTC---M 1192
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
R P F T +++DG+ P+ E YRR+ ++Q + H GLN
Sbjct: 1193 RMVPKNEEINTQYSDVFQTVGSTIDGSFFTVFPVNEFTYRRMYIIQQQLTDKEYHYCGLN 1252
Query: 191 PRAFRTYKGKGYYAGNPS-RGIIDGSLVWKFLQLSLGERLEICKKIGSK--HNDILDELY 247
PR R + G+ + + I+D ++ ++ +L+ + I +K+ SK + +I +L
Sbjct: 1253 PRLNR-FGGEAFDDSQTGVKPILDHQVIKRYAKLNEDRKQTIAQKVSSKGVYQEIWKDLI 1311
Query: 248 DIEAL 252
+ E +
Sbjct: 1312 EFENV 1316
>gi|146415762|ref|XP_001483851.1| hypothetical protein PGUG_04580 [Meyerozyma guilliermondii ATCC 6260]
Length = 1320
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/185 (22%), Positives = 84/185 (45%), Gaps = 7/185 (3%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V EF+ + + +I+ + + L + PE S+ G RL+ + F++ +
Sbjct: 1136 VSCAEFISKDEEIYILIAGNNNVMHLVQFDPEDPTSSNGQRLVHRASFNVSSSTTC---M 1192
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
R P F T +++DG+ P+ E YRR+ ++Q + H GLN
Sbjct: 1193 RMVPKNEEINTQYSDVFQTVGSTIDGSFFTVFPVNEFTYRRMYIIQQQLTDKEYHYCGLN 1252
Query: 191 PRAFRTYKGKGYYAGNPS-RGIIDGSLVWKFLQLSLGERLEICKKIGSK--HNDILDELY 247
PR R + G+ + + I+D ++ ++ +L+ + I +K+ SK + +I +L
Sbjct: 1253 PRLNR-FGGEAFDDSQTGVKPILDHQVIKRYAKLNEDRKQTIAQKVSSKGVYQEIWKDLI 1311
Query: 248 DIEAL 252
+ E +
Sbjct: 1312 EFENV 1316
>gi|452825139|gb|EME32137.1| cleavage and polyadenylation specificity factor subunit-like protein
[Galdieria sulphuraria]
Length = 1454
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 78/156 (50%), Gaps = 12/156 (7%)
Query: 52 LFVSDRSKRAN--EQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQ-PEARESNG 108
LF DR+K N + P + EF+I + + F++SD N+ L Y P ES G
Sbjct: 1240 LFWRDRNKSLNFLGKDFDPVQTLSTEFLILNEFILFVVSDNFGNLHLLEYAGPHEIESRG 1299
Query: 109 GHRLIKKTDFHLGQHVNTFFKIRC--KPSPISDAPGARSRFL-TWYASLDGALGFFLPLP 165
G +L+++ HLG ++ ++R K + D G+ L TW DG L LPL
Sbjct: 1300 GEKLLRRGVLHLGTRSSSMIRLRTDWKENNSEDRAGSHIVVLGTW----DGGLACLLPLQ 1355
Query: 166 EKNY--RRLLMLQNVMVTHTSHTGGLNPRAFRTYKG 199
++ Y + L+ + + +++ + GLNP+ FR +G
Sbjct: 1356 QEEYEQKNELLKKVYLHSYSLYVAGLNPQEFRIPRG 1391
>gi|242208344|ref|XP_002470023.1| predicted protein [Postia placenta Mad-698-R]
gi|220730923|gb|EED84773.1| predicted protein [Postia placenta Mad-698-R]
Length = 696
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 54/113 (47%), Gaps = 4/113 (3%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
+I D+D + ++ Y P ES GG L+++T+FH I + +D P AR
Sbjct: 572 LLIGDEDGVIRIYEYDPHDPESRGGQHLLRRTEFHGQMESRMSILIIRRRGKDTDIPQAR 631
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 197
S +G+L F + E +RL +LQ + + H GLNP+ FR Y
Sbjct: 632 ----LISGSTNGSLSMFTYVDEVASKRLHLLQGQLTRNVQHVVGLNPKVFRPY 680
>gi|255956643|ref|XP_002569074.1| Pc21g20880 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211590785|emb|CAP96985.1| Pc21g20880 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1140
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 44/168 (26%), Positives = 81/168 (48%), Gaps = 8/168 (4%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
F+ SD + N+++ + HRL+ ++ LG+ VN R +P I
Sbjct: 977 FLQSDAEGNLIVLSRNMNGVTAQDKHRLMPTSEISLGEMVN-----RIRPVNIPQLSSVM 1031
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
+ A+++G++ F + ++ L+ LQ + T + G L+ FR+++ A
Sbjct: 1032 VTPRAFMATVEGSIFLFAVINPEHQDFLMTLQASLSTKINSLGNLSFDKFRSFRTMVRSA 1091
Query: 205 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 252
P R +DG L+ +FL S + EI ++IGS +D+++ IEAL
Sbjct: 1092 EAPYR-FVDGELIEQFLNCSPSMQEEIVQEIGS--SDVVEVKRMIEAL 1136
>gi|426235955|ref|XP_004011942.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 [Ovis aries]
Length = 819
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 43/70 (61%)
Query: 187 GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+L
Sbjct: 750 AGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDL 809
Query: 247 YDIEALSSHF 256
+ + +++HF
Sbjct: 810 LETDRVTAHF 819
>gi|255720869|ref|XP_002545369.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240135858|gb|EER35411.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 1351
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 42/171 (24%), Positives = 73/171 (42%), Gaps = 14/171 (8%)
Query: 70 GVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 129
V +F+I+ + +++D + + L Y P+ +S G +L+ K F L ++
Sbjct: 1173 SVECADFIIKDDEVYILVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELASPISCL-- 1230
Query: 130 IRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 189
+ PI D + F + DG+ P+ E YRR+ +LQ + H GL
Sbjct: 1231 ---RTLPIDD-----NNFQIIGSCQDGSFFNVFPINESTYRRMYILQQQLTEKEYHYCGL 1282
Query: 190 NPRAFRTYKGKGYYAGNPS--RGIIDGSLVWKFLQLSLGERLEICKKIGSK 238
NPR R G N + + I+D L+ F +L+ + K+ K
Sbjct: 1283 NPRLNRV--GNLALKDNDTNIKPILDYGLIRIFAKLNTDRKKAFANKVSGK 1331
>gi|428164905|gb|EKX33915.1| hypothetical protein GUITHDRAFT_158867 [Guillardia theta CCMP2712]
Length = 1092
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 60/131 (45%), Gaps = 12/131 (9%)
Query: 111 RLIKKTDFHLGQHVNTFFK----IRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
RL ++HLG VN F + +R +P +D + +++G +G L +
Sbjct: 953 RLEVVGEYHLGDMVNRFQRGSLVLRSSDTPTTDT--------IIFGTVNGMIGVIAVLSK 1004
Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
+ Y LL +Q+ + GGL +R+++ + +G IDG L+ FL L
Sbjct: 1005 EEYEFLLKVQDALNFVIKGVGGLRHEDWRSFENERTQGARAPKGFIDGDLIESFLDLRRE 1064
Query: 227 ERLEICKKIGS 237
+ E+C IGS
Sbjct: 1065 KMEEVCHAIGS 1075
>gi|300122534|emb|CBK23104.2| unnamed protein product [Blastocystis hominis]
Length = 172
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/141 (24%), Positives = 65/141 (46%), Gaps = 10/141 (7%)
Query: 112 LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRR 171
++++ DFHL + + PIS G + + + +GA+G FL + + Y +
Sbjct: 28 VVRQADFHLASQITSIL-------PISLPDG---QCINVILTAEGAMGVFLFVTGEEYTK 77
Query: 172 LLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 231
L LQ ++ LN FR Y G +G++D ++ K+L LS E+ +I
Sbjct: 78 LSSLQKRLIEALPQNAALNNFNFRKYMSDGMMKYPRRKGVLDMGVIRKYLMLSTQEQEDI 137
Query: 232 CKKIGSKHNDILDELYDIEAL 252
K + + +I + +Y + L
Sbjct: 138 AKSLDLETKEITEVIYRTDKL 158
>gi|425777692|gb|EKV15851.1| UV-damaged DNA binding protein, putative [Penicillium digitatum Pd1]
gi|425779888|gb|EKV17916.1| UV-damaged DNA binding protein, putative [Penicillium digitatum
PHI26]
Length = 1140
Score = 53.9 bits (128), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/168 (25%), Positives = 79/168 (47%), Gaps = 8/168 (4%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
F+ SD + N+++ + HRLI ++ LG+ VN R +P I
Sbjct: 977 FLQSDAEGNLIVLSRNTNGVTAQDKHRLIPTSEISLGEMVN-----RIRPVHIPQLCSVM 1031
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
+ A+++G++ F + ++ L+ LQ + + G L+ FR ++ A
Sbjct: 1032 VTPRAFMATVEGSIFLFAVINPEHQDFLMTLQAALSQKLNSLGNLSFDKFRGFRTMVRSA 1091
Query: 205 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 252
P R +DG L+ +FL+ + + EI +++GS +D+ + IEAL
Sbjct: 1092 AAPYR-FVDGELIEQFLKCTPSMQEEIAQEVGS--SDVGEVKRLIEAL 1136
>gi|449019486|dbj|BAM82888.1| similar to cleavage and polyadenylation specificity factor subunit
[Cyanidioschyzon merolae strain 10D]
Length = 1880
Score = 53.1 bits (126), Expect = 1e-04, Method: Composition-based stats.
Identities = 45/188 (23%), Positives = 83/188 (44%), Gaps = 9/188 (4%)
Query: 69 RGVYACEFVIESSSMGFMISDKDKNVV-LFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
R + A E+++ + + +D N++ L + P ES GG R+ + FH+G N
Sbjct: 1683 RELVASEWLMHGTKANVLSADMLGNIIELSIPSPVDPESAGGTRMTFEAGFHVGSRPNAV 1742
Query: 128 FKIRCK-PSPISDAPGARSRFLTWY----ASLDGALGFFLPLPEKNYRRL-LMLQNVMVT 181
++R PS + P S + ++DG + PL ++L L Q++M+
Sbjct: 1743 RRVRIDDPSAETPPPNEPSSLWNTHVILLGTVDGMITMVSPLLRGVAKKLELAAQDLMLE 1802
Query: 182 HTSHTGGLNPRAFRTYKGKGYYAG--NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 239
L R++R + AG P R I+DG ++ + L R EI ++IG
Sbjct: 1803 PELRKWCLYARSWRVMRSLTVAAGLRKPKRSILDGDVLQLYGSLDTPRRKEIARRIGMPQ 1862
Query: 240 NDILDELY 247
+ + ++
Sbjct: 1863 EALFEAIF 1870
>gi|385304556|gb|EIF48568.1| rna-binding subunit of the mrna cleavage and polyadenylation factor
[Dekkera bruxellensis AWRI1499]
Length = 289
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/126 (27%), Positives = 56/126 (44%), Gaps = 6/126 (4%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
AC+F++ + + +D+D + L Y P ES G +L+ ++ F +
Sbjct: 145 ACDFIVXEGGLYIIAADEDSVLHLLEYDPYDPESMKGXKLLTRSVFRFNGYTTAMRLCDR 204
Query: 133 KPS------PISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
K S ++ PGA F +++G+ P E YRRL LQN + SH
Sbjct: 205 KNSIFSMLDTLAIPPGADLGFEVIGCNIEGSFYKVTPANEYTYRRLYALQNHISDKESHW 264
Query: 187 GGLNPR 192
GLNP+
Sbjct: 265 LGLNPK 270
>gi|241952575|ref|XP_002419009.1| pre-mRNA-splicing factor, putative; pre-spliceosome component,
putative [Candida dubliniensis CD36]
gi|223642349|emb|CAX42591.1| pre-mRNA-splicing factor, putative [Candida dubliniensis CD36]
Length = 1187
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 66/141 (46%), Gaps = 17/141 (12%)
Query: 110 HRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNY 169
++L +FH+G + T + C + G S Y L G +G +PL K+
Sbjct: 1057 YKLQNLIEFHIGDII-TSLNLGCL-----NLAGTESVI---YTGLQGTIGLLVPLVSKSE 1107
Query: 170 RRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGER 228
LL LQ +M ++ G + FR+Y NP + +IDG L+ +FL+ R
Sbjct: 1108 VELLFNLQLLMQQFQNNLVGKDHLKFRSYY-------NPIKNVIDGDLLERFLEFDTSLR 1160
Query: 229 LEICKKIGSKHNDILDELYDI 249
+EI +K+ NDI +L D+
Sbjct: 1161 IEISRKLNKSVNDIEKKLIDL 1181
>gi|71021721|ref|XP_761091.1| hypothetical protein UM04944.1 [Ustilago maydis 521]
gi|46100541|gb|EAK85774.1| hypothetical protein UM04944.1 [Ustilago maydis 521]
Length = 1597
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 77/161 (47%), Gaps = 6/161 (3%)
Query: 78 IESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPI 137
I S +G S + V+ +Y+ + GG +L+ +++F ++ C +
Sbjct: 1368 ISGSRLGGDASMEASAGVIRLYEYAPHLAVGGKKLVLRSEFQTTSEA--VARVECSGRWL 1425
Query: 138 SDAP--GARS-RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 194
SD+ G + R +A +G++ + EK +RL +LQ +V HT LNPR+F
Sbjct: 1426 SDSELRGRETLRNKVVFAKANGSVESVAAVDEKVGKRLHLLQGQLVRSVMHTAALNPRSF 1485
Query: 195 RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
R + Y +G++D L+ +F++LS + LE K +
Sbjct: 1486 RMVRND-YVPRALVKGVLDARLLDEFMRLSRPKMLEAVKTL 1525
>gi|298711490|emb|CBJ26578.1| n/a [Ectocarpus siliculosus]
Length = 1135
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/163 (23%), Positives = 71/163 (43%), Gaps = 8/163 (4%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK--IRCKPSPISDAPG 142
++ + D N+ +A RL + +FHLG+ VN F + + + S ++ G
Sbjct: 956 YIGGEADCNIFTLRRNADAATEEERARLEIQGEFHLGEFVNKFCRGSLLMQSSEVNSPGG 1015
Query: 143 ARSRFLT----WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK 198
S + + +++G +G L L E N+R L LQ M GG + +R++
Sbjct: 1016 MDSPLVKGQPLLFGTVNGMVGTILTLTEDNHRFLAQLQTAMTKVVKGVGGFSHDEWRSFT 1075
Query: 199 GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 241
+ PS IDG LV +L + + E+ + + + D
Sbjct: 1076 NGRRTS--PSSNFIDGDLVESYLDMPRHNQEEVLRHVDTPVGD 1116
>gi|384250802|gb|EIE24281.1| hypothetical protein COCSUDRAFT_28729 [Coccomyxa subellipsoidea
C-169]
Length = 1101
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/213 (21%), Positives = 94/213 (44%), Gaps = 35/213 (16%)
Query: 28 ELLIYQAFRHPKGTLKLRFKKLKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMI 87
+LLIY+A +G L++R + K +++ A E + + + +G
Sbjct: 900 QLLIYRA---DEGILEVRARDYKTHWMT-----------------AVEVLDDDTYLG--- 936
Query: 88 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGA 143
++ N+ +A +RL +HLG VN F ++ S + P
Sbjct: 937 AENSNNIFTLRKNTDAAADEDRNRLETVGQYHLGVFVNRFRHGSLVMKLPDSEAAKIP-- 994
Query: 144 RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
+ +++G++G LP++ ++ L LQ+ + GGL+ A+RT++ + +
Sbjct: 995 ----TVLFVTINGSIGVIASLPQQQFQFLSRLQDCLRKVIKGVGGLSHVAWRTFQDE--H 1048
Query: 204 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
PS+ +DG L+ +FL L + +++G
Sbjct: 1049 TKMPSQNFVDGDLIEQFLDLKRDSMERVAREMG 1081
>gi|68476233|ref|XP_717766.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
albicans SC5314]
gi|68476422|ref|XP_717672.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
albicans SC5314]
gi|74586274|sp|Q5A7S5.1|RSE1_CANAL RecName: Full=Pre-mRNA-splicing factor RSE1
gi|46439394|gb|EAK98712.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
albicans SC5314]
gi|46439495|gb|EAK98812.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
albicans SC5314]
Length = 1219
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 66/141 (46%), Gaps = 17/141 (12%)
Query: 110 HRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNY 169
++L +FH+G + T F + C + G S Y L G +G +PL K+
Sbjct: 1089 YKLQNLIEFHIGDII-TSFNLGCL-----NLAGTESVI---YTGLQGTIGLLIPLVSKSE 1139
Query: 170 RRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGER 228
LL LQ M ++ G + R+Y NP + +IDG L+ +FL+ + +
Sbjct: 1140 VELLFNLQLYMQQSQNNLVGKDHLKLRSYY-------NPIKNVIDGDLLERFLEFDISLK 1192
Query: 229 LEICKKIGSKHNDILDELYDI 249
+EI +K+ NDI +L D+
Sbjct: 1193 IEISRKLNKSVNDIEKKLIDL 1213
>gi|302831157|ref|XP_002947144.1| hypothetical protein VOLCADRAFT_87503 [Volvox carteri f. nagariensis]
gi|300267551|gb|EFJ51734.1| hypothetical protein VOLCADRAFT_87503 [Volvox carteri f. nagariensis]
Length = 2830
Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats.
Identities = 39/132 (29%), Positives = 59/132 (44%), Gaps = 12/132 (9%)
Query: 69 RGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESN----GGHRLIKKTDFHLGQHV 124
R V VI + F+ +D N+ L + +R +N G RL H+ + +
Sbjct: 2634 RDVLTAGVVINEPKLAFLAADAAGNLALSEFY-GSRNTNPEFWAGQRLAPLGLMHVARRL 2692
Query: 125 NTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNY-RRLLMLQNVMVTHT 183
+ I+ P SD ++R + +G L + P+P+ +RLL LQN M
Sbjct: 2693 SCCVSIKM---PTSDG---KNRHALLCGAAEGGLSYIAPVPDAEMTQRLLALQNHMSRRL 2746
Query: 184 SHTGGLNPRAFR 195
H GLNPRAFR
Sbjct: 2747 PHVAGLNPRAFR 2758
>gi|343425828|emb|CBQ69361.1| related to cleavage and polyadenylation specificity factor, 160 kDa
subunit [Sporisorium reilianum SRZ2]
Length = 1567
Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats.
Identities = 42/152 (27%), Positives = 71/152 (46%), Gaps = 13/152 (8%)
Query: 90 KDKNVV-LFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAP-----GA 143
+D VV L+ Y P +N RL+ +T+F + C+ +SD+
Sbjct: 1354 RDAGVVRLYEYAPHVASANT--RLVLRTEFQTASPATA--SVACRGRWLSDSELRGREHG 1409
Query: 144 RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
R++ + A +GAL ++ +RL +LQ +V HT LNPRAFR + +
Sbjct: 1410 RNKLV--LAKANGALETLAAADDRVAKRLHVLQGQLVRSVLHTAALNPRAFRAVRND-FV 1466
Query: 204 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
+ +G++D L+ F+ LS + LE K +
Sbjct: 1467 SRALGKGVLDARLLDSFVYLSRPKMLEAVKTL 1498
>gi|255080490|ref|XP_002503825.1| predicted protein [Micromonas sp. RCC299]
gi|226519092|gb|ACO65083.1| predicted protein [Micromonas sp. RCC299]
Length = 1114
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/174 (21%), Positives = 80/174 (45%), Gaps = 11/174 (6%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK----IRCKPSPISDA 140
++ ++ + N+ Q +A RL ++H+G+ VN F + +R +D
Sbjct: 946 YLGAENNFNLFTLRRQSDAATDEERSRLEVVGEYHVGEFVNRFRRGSLVMRLPDQENADV 1005
Query: 141 PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 200
P + ++ G +G LP + + L LQ + S GGL+ A+R+++ +
Sbjct: 1006 P------TLLFGTVSGVIGVLATLPREQFEFLSALQAALNKTVSGVGGLSHDAWRSFQNE 1059
Query: 201 G-YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 253
+ A + +RG +DG L+ FL L + E+ + +++ + D++ L+
Sbjct: 1060 HRHRAKDGARGFVDGDLIESFLDLRPEKAREVAAAVKLSVDELTRRVEDLQRLT 1113
>gi|348681092|gb|EGZ20908.1| hypothetical protein PHYSODRAFT_259403 [Phytophthora sojae]
Length = 1137
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 72/165 (43%), Gaps = 15/165 (9%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF-----FKIRCKPSPISD 139
++ S+ D N+ A RL +FHLG+ VN F P+ + D
Sbjct: 950 YIGSETDFNLFTVQRNSGAASDEERGRLETVGEFHLGEFVNRFRYGSLTPAAAGPTDMVD 1009
Query: 140 A-------PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
P A+++ + + ++ G +G LPL + Y LL +Q + GG + +
Sbjct: 1010 VVEQAPIVPAAQNQSM-LFGTVSGMIGVILPLTKDQYSFLLRVQQALTQVVKGVGGFSHK 1068
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
+R ++ + + + +R IDG LV FL L + ++ K+ S
Sbjct: 1069 DWRMFENR--RSVSEARNFIDGDLVESFLDLPKAQMTKVVDKLNS 1111
>gi|281208174|gb|EFA82352.1| UV-damaged DNA binding protein1 [Polysphondylium pallidum PN500]
Length = 1054
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 44/172 (25%), Positives = 71/172 (41%), Gaps = 8/172 (4%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
F+ D N+++ EA L FHLG+ +N F SP D+P
Sbjct: 890 FIGGDNSYNLIVVQKNAEASSEIDNELLDTVGQFHLGETINKFKHGSLVTSPDMDSPKLP 949
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
+ + ++ GA+G + + + +Y LQ + GGL +R++ + +
Sbjct: 950 T---ILFGTVSGAIGVIVSISKDDYEFFEKLQKGLNRVVHGVGGLPFENWRSFSTE--HM 1004
Query: 205 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 256
PS+ IDG L+ FL L + LE K + I D IE+L H
Sbjct: 1005 TIPSKNFIDGDLIETFLDLRHDKMLEAIKDMNIS---IEDTYRRIESLMHHI 1053
>gi|384253955|gb|EIE27429.1| hypothetical protein COCSUDRAFT_64224 [Coccomyxa subellipsoidea
C-169]
Length = 1137
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 84/201 (41%), Gaps = 22/201 (10%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQ--PEARESNGGHRLIKKTDFHLGQHVNTFF 128
V A EF++ + + D +N+ LF Y E + + G +L+ H+GQ++ +
Sbjct: 944 VEAAEFLVNGKKLFLLACDAAQNLRLFAYDGGKEQQATWQGKKLLPLGAIHVGQNICSSL 1003
Query: 129 KIRCKPSPISDAPGARSRFLTWYASLDGALGFFLP----LPEKNYRRLLMLQNVMVTHTS 184
R P+ A G + R + S G++ P LP + LL LQ MV
Sbjct: 1004 SHRITPA---SATGVQLRAAV-FGSAAGSIASLAPTWDGLPAE---ELLALQREMVLAVP 1056
Query: 185 HTGGLNPRAF-RTYK--------GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
GLNP +F R YK G+ + A ++D + +F L L E++ + K
Sbjct: 1057 QVAGLNPVSFRRRYKHGVKALAGGQSFEAPVSDDRVLDLDQLNRFQWLPLTEQVALAAKC 1116
Query: 236 GSKHNDILDELYDIEALSSHF 256
+L L ++ S F
Sbjct: 1117 NLSRQQVLHALREMVMAISTF 1137
>gi|258572939|ref|XP_002540651.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237900917|gb|EEP75318.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 1144
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/167 (24%), Positives = 74/167 (44%), Gaps = 7/167 (4%)
Query: 84 GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGA 143
F++SD + N+V+ + R+ ++ LG VN ++ + S +P
Sbjct: 976 AFLVSDAEGNLVVLNRNTTGVTEDDKRRMQITSELRLGTMVNRIRRMDLQAS--QSSPVI 1033
Query: 144 RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
FL A+ DG++ F + + L+ LQ+ + + + GG+ +R +K
Sbjct: 1034 PKAFL---ATTDGSIYLFGVIAQFAQDLLMRLQSALASFVASPGGIPFSGYRAFKSATRQ 1090
Query: 204 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI-LDELYDI 249
A P R +DG LV +FL L + + K+ D+ L +L DI
Sbjct: 1091 ADEPFR-FVDGELVEQFLDCPLEVQEAVLAKMDGGGRDVTLSQLKDI 1136
>gi|412992547|emb|CCO18527.1| predicted protein [Bathycoccus prasinos]
Length = 1275
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 66/140 (47%), Gaps = 6/140 (4%)
Query: 118 FHLGQHVNTFFK--IRCKPSPISDAPGARSRFLTW-YASLDGALGFFLPLPEKNYRRLLM 174
FHLG VN F + + + +SD + S TW + ++ G LG LP++++ L
Sbjct: 1137 FHLGDSVNRFRRGSLVMRMPDLSDDTSSLSEISTWLFGTISGGLGVVATLPKRDFMLLNK 1196
Query: 175 LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
+Q M + G + FR++ R IDG LV FL LS +++ + +
Sbjct: 1197 VQEAMQKVVTGVGNFSHSDFRSFHNVQRSV--EMRNFIDGDLVEIFLDLSKEDQVAVSEL 1254
Query: 235 IG-SKHNDILDELYDIEALS 253
G S D++ ++ +I L+
Sbjct: 1255 SGVSNSEDLVKKIEEISRLT 1274
>gi|223994993|ref|XP_002287180.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976296|gb|EED94623.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1517
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/155 (24%), Positives = 70/155 (45%), Gaps = 6/155 (3%)
Query: 74 CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR-C 132
C V ++ ++ S+ N+ + + A RL ++HLG+ N F
Sbjct: 1296 CTAVEMLTNGTYLGSEGFNNLFVLRHNANASSEEARVRLDTVGEYHLGEMTNKFMGGSLI 1355
Query: 133 KPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 189
PS GA++ ++ T + ++DG++G L L + L LQ +++ G +
Sbjct: 1356 MPSNSGGIMGAQNAYVGSQTLFGTVDGSIGSVLGLDGPTFAFLACLQRAILSIVKTVGDI 1415
Query: 190 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
+ +R ++ + PSRG IDG L+ FL L+
Sbjct: 1416 SHEEYRAFRAERQV--RPSRGFIDGDLIETFLDLN 1448
>gi|322704830|gb|EFY96421.1| Cleavage factor two protein 1 [Metarhizium anisopliae ARSEF 23]
Length = 1433
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 42/187 (22%), Positives = 82/187 (43%), Gaps = 9/187 (4%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV--NTFFKI 130
A +F+ + + + D + ++ + + PE +S GH L+ +T F + + +T
Sbjct: 1240 AADFLPDGEDLSMVAVDAEGDLHILEFNPEHPKSLQGHLLLHRTSFAVTPNTPSSTLLLP 1299
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
R A + S + A G L PL E YRRLL + N + GGL+
Sbjct: 1300 RTHSPSYPQASSSSSSHMLLLACPSGQLAALSPLAESTYRRLLSVTNQLHPAIVPHGGLH 1359
Query: 191 PRAFRTYKGKGYYAG-----NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 245
+A R G + R ++DG+++ ++ +L +R ++ + G ++ + D
Sbjct: 1360 SKAHRYPDQSSVAVGVETAASSGRALVDGTVLARWSELGAAKRTDVALRGG--YDSVADL 1417
Query: 246 LYDIEAL 252
D+E +
Sbjct: 1418 RDDLEGV 1424
>gi|307111604|gb|EFN59838.1| hypothetical protein CHLNCDRAFT_29381 [Chlorella variabilis]
Length = 1108
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/143 (24%), Positives = 59/143 (41%), Gaps = 11/143 (7%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSPISDA 140
+M ++ N+ +A RL +HLG+ VN F +R S +S
Sbjct: 940 YMGAENSYNLFTVRRNADAATDEERSRLETVGRYHLGEFVNRFQPGSLVMRLPDSELSQI 999
Query: 141 PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 200
P + +++G +G LP Y+ L LQ M GG + +R + +
Sbjct: 1000 P------TVLFGTINGVIGVVASLPHAQYQLLESLQEAMRKVVKGVGGFDHAQWRAFSNQ 1053
Query: 201 GYYAGNPSRGIIDGSLVWKFLQL 223
+ P+R +DG L+ +FL L
Sbjct: 1054 -HMPATPARQFVDGDLIEQFLDL 1075
>gi|398391687|ref|XP_003849303.1| hypothetical protein MYCGRDRAFT_87400 [Zymoseptoria tritici IPO323]
gi|339469180|gb|EGP84279.1| hypothetical protein MYCGRDRAFT_87400 [Zymoseptoria tritici IPO323]
Length = 1143
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/154 (25%), Positives = 71/154 (46%), Gaps = 8/154 (5%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
++ SD + N+V+ + + RL ++ LG+ VN +I P+P +A
Sbjct: 979 YLESDAEGNLVVLEHDVKGFSEEDRRRLRVTSEILLGEMVNRIRRIDVSPTP--NATVIP 1036
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
FL A+++G++ F + E L+ +QN M G + FR +K +
Sbjct: 1037 RAFL---ATVEGSIYLFALIAEGKQDLLIRMQNKMAEMVQSPGHVPFAKFRGFKTQVRDM 1093
Query: 205 GN--PSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
G PSR +DG L+ +FL + E+ K++G
Sbjct: 1094 GEEGPSR-FVDGELIERFLDCDEDVQAEVAKELG 1126
>gi|406865227|gb|EKD18269.1| CPSF A subunit region [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 1146
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/169 (24%), Positives = 76/169 (44%), Gaps = 8/169 (4%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPIS-DAPGA 143
++ SD N+++ + RL + +LG+ VN KI PSP + P A
Sbjct: 984 WLESDHHGNLLVLQRNIDGITLEDKQRLRITGEMNLGEQVNMIRKIAIDPSPTAMVVPKA 1043
Query: 144 RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
+ A+ +G++ F + + + LL LQ + G L+ + +R++K
Sbjct: 1044 ------FLATTEGSIYLFSTILDGSQDLLLRLQENITECVDTLGRLDFKTYRSFKSAERT 1097
Query: 204 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 252
P R +DG L+ +FL S + +IC+ +G I D + +++ L
Sbjct: 1098 TEEPYR-FVDGELIERFLDESEDMQQQICEGLGYTVEAIRDVVENLKRL 1145
>gi|452979181|gb|EME78944.1| hypothetical protein MYCFIDRAFT_43692 [Pseudocercospora fijiensis
CIRAD86]
Length = 1149
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 76/172 (44%), Gaps = 11/172 (6%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
AC V E++ ++ SD + N+++ + RL ++F LG+ VN R
Sbjct: 973 ACARVSENT---YLESDAEGNLIVLQHDINGFSQEDRRRLRVTSEFLLGEMVN-----RI 1024
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
+P + +PGA + A+ DG++ + + + L+ +Q +M GG+
Sbjct: 1025 RPITVQPSPGAVVTPQAFLATTDGSIYVYCEIGKPRQDLLMRMQTLMADMVKSPGGVRFA 1084
Query: 193 AFRTYKGKGYYAGN--PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
FR +K G P R +DG L+ +FL + + E+ K + D+
Sbjct: 1085 KFRGFKTLVRDMGEEGPVR-FVDGELIERFLDMPEVLQNEVVKGLDGTGVDL 1135
>gi|18377609|gb|AAL66955.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
Length = 270
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/123 (26%), Positives = 57/123 (46%), Gaps = 12/123 (9%)
Query: 117 DFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 172
++HLG+ VN F +R S I P + +++G +G LP++ Y L
Sbjct: 138 EYHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVIASLPQEQYTFL 191
Query: 173 LMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
LQ+ + GGL+ +R++ + A +R +DG L+ FL LS + +I
Sbjct: 192 EKLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRNKMEDIS 249
Query: 233 KKI 235
K +
Sbjct: 250 KSM 252
>gi|367018592|ref|XP_003658581.1| hypothetical protein MYCTH_2294503 [Myceliophthora thermophila ATCC
42464]
gi|347005848|gb|AEO53336.1| hypothetical protein MYCTH_2294503 [Myceliophthora thermophila ATCC
42464]
Length = 1547
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 44/104 (42%), Gaps = 20/104 (19%)
Query: 152 ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG------------ 199
A+ G L LPE YRRL L + H GLNPR +R G
Sbjct: 1421 AAPTGVLAALRALPESAYRRLSSLAAQLAGSLPHAAGLNPRGYRLPDGVASSSSPWSSSS 1480
Query: 200 -------KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
G AG R I+DG+L+ +F +L + R+E+ + G
Sbjct: 1481 SSFSAVVPGVDAGV-GRTIVDGALLQRFTELGMARRVELAGRAG 1523
>gi|388856288|emb|CCF50097.1| related to cleavage and polyadenylation specificity factor, 160 kDa
subunit [Ustilago hordei]
Length = 1568
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 67/146 (45%), Gaps = 9/146 (6%)
Query: 84 GFMISDKDKNVV-LFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPG 142
GF +D V+ L+ Y P + S GG RL+ +T++ + S+ G
Sbjct: 1316 GFSGFGQDSGVIRLYEYNPSSPTSLGGQRLLLRTEYSTPSSTTCSLSAPGRWLSDSELRG 1375
Query: 143 ---ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG 199
R++ L + +G+L + EK +RL +LQ +V HT LNPRAFR +
Sbjct: 1376 REQLRNKLL--LSKSNGSLDSLASVEEKVAKRLHLLQGQLVRSVLHTAALNPRAFRQVRN 1433
Query: 200 KGYYAGNP-SRGIIDGSLVWKFLQLS 224
+ P +G++D L+ F LS
Sbjct: 1434 D--FVSRPLYKGVLDARLLDAFKGLS 1457
>gi|33411762|emb|CAD58786.1| cleavage and polyadenylation specificity factor 1 [Bos taurus]
Length = 880
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 34/51 (66%), Gaps = 4/51 (7%)
Query: 3 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGTLKLRFKKL 49
+V+E+L V+LG RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 663 LVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKV 713
>gi|322694449|gb|EFY86278.1| Cleavage factor two protein 1 [Metarhizium acridum CQMa 102]
Length = 1431
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/185 (21%), Positives = 82/185 (44%), Gaps = 9/185 (4%)
Query: 74 CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK 133
+F+ + + + D + ++ + + PE +S GH L+ +T F + + T + +
Sbjct: 1241 ADFLPDGEDLSMVAVDAEGDMHILEFNPEHPKSLQGHLLLHRTSFAVTPNTPTSTLLLPR 1300
Query: 134 PSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRA 193
S + S + A G + PL E YRRLL + N + GL+ +A
Sbjct: 1301 THSPSYPHASSSSHMLLLACPSGQVAALSPLAESTYRRLLSVTNQLHPAIVAHCGLHTKA 1360
Query: 194 FRTYKGKGYY------AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
R Y + A + R ++DG+++ ++ +L +R ++ + G ++ + D
Sbjct: 1361 HR-YPDQSCVAVGVETAASSGRALVDGTVLARWSELGAAKRTDVALRGG--YDSVADLRD 1417
Query: 248 DIEAL 252
D+E +
Sbjct: 1418 DLEGV 1422
>gi|145351726|ref|XP_001420218.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580451|gb|ABO98511.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1120
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/147 (23%), Positives = 62/147 (42%), Gaps = 10/147 (6%)
Query: 111 RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
RL ++HLG+ VN F + K + P + + +G +G LP+
Sbjct: 979 RLEITGEYHLGEFVNVFSPGSLVMSLKDGDSLEVP------TLLFGTGNGVIGVLASLPK 1032
Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
Y LQ M H GGL +R+++ +PSR +DG LV FL L +
Sbjct: 1033 DAYDFAERLQTSMNKHIQGVGGLKHAEWRSFRHTLRRKSDPSRNFVDGDLVESFLDLKVE 1092
Query: 227 ERLEICKKIGSKHNDILDELYDIEALS 253
+ + + +I+ + +++ L+
Sbjct: 1093 QADVVAADMKCDRAEIIRRVEELQRLT 1119
>gi|297809743|ref|XP_002872755.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp. lyrata]
gi|297318592|gb|EFH49014.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp. lyrata]
Length = 1088
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/155 (24%), Positives = 68/155 (43%), Gaps = 12/155 (7%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSPISDA 140
++ ++ + N+V E RL ++HLG+ VN F +R S I
Sbjct: 924 YLGAENNFNLVTVKKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQI 983
Query: 141 PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 200
P + +++G +G LP++ Y L LQ+ + GGL+ +R++ +
Sbjct: 984 P------TVIFGTVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNE 1037
Query: 201 GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
A +R +DG L+ FL LS + +I K +
Sbjct: 1038 KRTA--EARNFLDGDLIESFLDLSRNKMEDISKSM 1070
>gi|67516629|ref|XP_658200.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
gi|40747539|gb|EAA66695.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
gi|259489136|tpe|CBF89158.1| TPA: damaged DNA binding protein (Eurofung) [Aspergillus nidulans
FGSC A4]
Length = 1132
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/159 (23%), Positives = 69/159 (43%), Gaps = 8/159 (5%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
++ SD + N+++ E + RL + L + VN R +P I P A
Sbjct: 970 YLESDAEGNLIVLRRNRSGVEEDDRRRLEVTGEICLNEMVN-----RIRPVNIQQLPSAT 1024
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
+ A+++G++ + + +Y+ LM LQ M + GG+ +R ++
Sbjct: 1025 VVPRAFLATVEGSI-YLYAIINPDYQDFLMRLQATMASRADSLGGIPFTDYRAFRTMTRQ 1083
Query: 204 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
A P R +DG L+ +FL + EI +GS ++
Sbjct: 1084 ATEPYR-FVDGELIERFLTCEPAVQKEIVDIVGSSLEEV 1121
>gi|326426696|gb|EGD72266.1| hypothetical protein PTSG_00286 [Salpingoeca sp. ATCC 50818]
Length = 1104
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/161 (25%), Positives = 68/161 (42%), Gaps = 5/161 (3%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V +C V E+ F+++ + L + Q E + L K + +LG+ V +F +
Sbjct: 917 VSSCALVNETV---FLVASHAHS--LSVCQREFEPGSTMQTLNAKFEIYLGETVTSFVRA 971
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ D+ + ++ G L LPL L L+ M GGL+
Sbjct: 972 ALGSAAAVDSSMPLRNTFFVFGTMGGGLACLLPLTPPQTELLTALECRMEEKIGGLGGLD 1031
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 231
R FRT + + A + ++DG LV FLQL E+ E+
Sbjct: 1032 HREFRTARDEQRMAQQVNPRLVDGDLVETFLQLPEEEQKEL 1072
>gi|110741229|dbj|BAF02165.1| UV-damaged DNA binding factor - like protein [Arabidopsis thaliana]
Length = 727
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 59/129 (45%), Gaps = 12/129 (9%)
Query: 111 RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
RL ++HLG+ VN F +R S I P + +++G +G LP+
Sbjct: 589 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVIASLPQ 642
Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
+ Y L LQ+ + GGL+ +R++ + A +R +DG L+ FL LS
Sbjct: 643 EQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRN 700
Query: 227 ERLEICKKI 235
+ +I K +
Sbjct: 701 KMEDISKSM 709
>gi|448528339|ref|XP_003869702.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis Co 90-125]
gi|380354055|emb|CCG23569.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis]
Length = 1170
Score = 47.8 bits (112), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 53/104 (50%), Gaps = 4/104 (3%)
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
Y L G +G LPL K+ + +L ++ + +++ +N K + YY NP++
Sbjct: 1070 YTGLTGTIGILLPLISKS--EIELLHDLQLEISAYNDKVNVAGKNHAKLRSYY--NPAKN 1125
Query: 211 IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSS 254
I DG + +L L L E+L+I K++ ++ +L DI SS
Sbjct: 1126 IFDGDFLELYLNLPLDEKLKIAKRLNKSVGEVEKKLNDIRNRSS 1169
>gi|62318656|dbj|BAD95136.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
Length = 1088
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 56/121 (46%), Gaps = 12/121 (9%)
Query: 117 DFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 172
++H+G+ VN F ++ S I P + ++ G +G LP++ Y L
Sbjct: 956 EYHIGEFVNRFRHGSLVMKLPDSDIGQIP------TVIFGTVSGMIGVIASLPQEQYAFL 1009
Query: 173 LMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
LQ + GGL+ +R++ + A ++G +DG L+ FL LS G+ EI
Sbjct: 1010 EKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKGYLDGDLIESFLDLSRGKMEEIS 1067
Query: 233 K 233
K
Sbjct: 1068 K 1068
>gi|15233515|ref|NP_193842.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
gi|73620956|sp|O49552.2|DDB1B_ARATH RecName: Full=DNA damage-binding protein 1b; AltName: Full=UV-damaged
DNA-binding protein 1b; Short=DDB1b
gi|110739453|dbj|BAF01636.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
gi|332659001|gb|AEE84401.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
Length = 1088
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 56/121 (46%), Gaps = 12/121 (9%)
Query: 117 DFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 172
++H+G+ VN F ++ S I P + ++ G +G LP++ Y L
Sbjct: 956 EYHIGEFVNRFRHGSLVMKLPDSDIGQIP------TVIFGTVSGMIGVIASLPQEQYAFL 1009
Query: 173 LMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
LQ + GGL+ +R++ + A ++G +DG L+ FL LS G+ EI
Sbjct: 1010 EKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKGYLDGDLIESFLDLSRGKMEEIS 1067
Query: 233 K 233
K
Sbjct: 1068 K 1068
>gi|186511557|ref|NP_001118940.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
gi|332657118|gb|AEE82518.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
Length = 1067
Score = 47.4 bits (111), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/146 (24%), Positives = 66/146 (45%), Gaps = 12/146 (8%)
Query: 111 RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
RL ++HLG+ VN F +R S I P + +++G +G LP+
Sbjct: 929 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVIASLPQ 982
Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
+ Y L LQ+ + GGL+ +R++ + A +R +DG L+ FL LS
Sbjct: 983 EQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRN 1040
Query: 227 ERLEICKKIGSKHNDILDELYDIEAL 252
+ +I K + + ++ + ++ L
Sbjct: 1041 KMEDISKSMNVQVEELCKRVEELTRL 1066
>gi|2911067|emb|CAA17529.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
gi|7268907|emb|CAB79110.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
Length = 1102
Score = 47.4 bits (111), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 56/121 (46%), Gaps = 12/121 (9%)
Query: 117 DFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 172
++H+G+ VN F ++ S I P + ++ G +G LP++ Y L
Sbjct: 970 EYHIGEFVNRFRHGSLVMKLPDSDIGQIP------TVIFGTVSGMIGVIASLPQEQYAFL 1023
Query: 173 LMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
LQ + GGL+ +R++ + A ++G +DG L+ FL LS G+ EI
Sbjct: 1024 EKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKGYLDGDLIESFLDLSRGKMEEIS 1081
Query: 233 K 233
K
Sbjct: 1082 K 1082
>gi|312283457|dbj|BAJ34594.1| unnamed protein product [Thellungiella halophila]
Length = 1088
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 59/129 (45%), Gaps = 12/129 (9%)
Query: 111 RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
RL ++HLG+ VN F +R S I P + +++G +G LP+
Sbjct: 950 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVIASLPQ 1003
Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
+ Y L LQ+ + GGL+ +R++ + A +R +DG L+ FL LS
Sbjct: 1004 EQYMFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRN 1061
Query: 227 ERLEICKKI 235
+ +I K +
Sbjct: 1062 KMEDISKSM 1070
>gi|15235577|ref|NP_192451.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
gi|55976605|sp|Q9M0V3.1|DDB1A_ARATH RecName: Full=DNA damage-binding protein 1a; AltName: Full=UV-damaged
DNA-binding protein 1a; Short=DDB1a
gi|7267302|emb|CAB81084.1| UV-damaged DNA binding factor-like protein [Arabidopsis thaliana]
gi|25054828|gb|AAN71904.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
gi|332657117|gb|AEE82517.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
Length = 1088
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 59/129 (45%), Gaps = 12/129 (9%)
Query: 111 RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
RL ++HLG+ VN F +R S I P + +++G +G LP+
Sbjct: 950 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVIASLPQ 1003
Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
+ Y L LQ+ + GGL+ +R++ + A +R +DG L+ FL LS
Sbjct: 1004 EQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRN 1061
Query: 227 ERLEICKKI 235
+ +I K +
Sbjct: 1062 KMEDISKSM 1070
>gi|389629928|ref|XP_003712617.1| hypothetical protein MGG_16867 [Magnaporthe oryzae 70-15]
gi|351644949|gb|EHA52810.1| hypothetical protein MGG_16867 [Magnaporthe oryzae 70-15]
gi|440464739|gb|ELQ34110.1| DNA damage-binding protein 1a [Magnaporthe oryzae Y34]
Length = 1183
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/159 (22%), Positives = 71/159 (44%), Gaps = 7/159 (4%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
++++D D N+V+ + R+ ++F LG+ VN K+ + S ++AP
Sbjct: 1020 WIVADGDGNLVVLLRNTAGVTLEDKRRMQMTSEFGLGECVNKIQKVMVETS--ANAPIVA 1077
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH-TGGLNPRAFRTYKGKGYY 203
FL+ + +G++ F + K L+ Q M H S G L +R+++
Sbjct: 1078 KAFLS---TTEGSIYLFGTVAPKFQSLLMDFQANMEAHVSSPLGELQFNQWRSFRNPERE 1134
Query: 204 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
P R +DG + FL + +++IC+ + D+
Sbjct: 1135 GAGPER-FLDGEFLEMFLDMEENTQIDICQGLSYTAEDM 1172
>gi|340381612|ref|XP_003389315.1| PREDICTED: DNA damage-binding protein 1-like [Amphimedon
queenslandica]
Length = 1142
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 63/143 (44%), Gaps = 13/143 (9%)
Query: 112 LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFL-----TWYASLDGALGFFLPLPE 166
+++ + + G +VN F + S + D PGA + L + ++ GA+G L
Sbjct: 992 MVQPSRMYFGDNVNVF----SRGSFVMDHPGAGASSLLQGKPILFGTVHGAIGLIGTLNM 1047
Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
Y L LQ M + G + +R++ + + P G IDG LV KFL+L
Sbjct: 1048 DTYTLLSKLQQKMAANIKSVGNIEHEIYRSFSNE--HRSKPFAGFIDGDLVEKFLELPRP 1105
Query: 227 ERLEICKKIGSKHNDILDELYDI 249
+ +I + G K D+ D+
Sbjct: 1106 QMSQIVQ--GIKTTDVTGTEVDV 1126
>gi|440487047|gb|ELQ66855.1| DNA damage-binding protein 1a [Magnaporthe oryzae P131]
Length = 1213
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 36/159 (22%), Positives = 71/159 (44%), Gaps = 7/159 (4%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
++++D D N+V+ + R+ ++F LG+ VN K+ + S ++AP
Sbjct: 1050 WIVADGDGNLVVLLRNTAGVTLEDKRRMQMTSEFGLGECVNKIQKVMVETS--ANAPIVA 1107
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH-TGGLNPRAFRTYKGKGYY 203
FL+ + +G++ F + K L+ Q M H S G L +R+++
Sbjct: 1108 KAFLS---TTEGSIYLFGTVAPKFQSLLMDFQANMEAHVSSPLGELQFNQWRSFRNPERE 1164
Query: 204 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
P R +DG + FL + +++IC+ + D+
Sbjct: 1165 GAGPER-FLDGEFLEMFLDMEENTQIDICQGLSYTAEDM 1202
>gi|407923753|gb|EKG16818.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
phaseolina MS6]
Length = 1129
Score = 46.6 bits (109), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 35/151 (23%), Positives = 68/151 (45%), Gaps = 6/151 (3%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
++ +D + N++L RL+ + D LG+ VN +I + +SDAP
Sbjct: 965 YLQADAEGNLILLERDVSGVTEEDRKRLMLRGDMLLGEQVNRIRRIDM--ATVSDAPVIP 1022
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
F +A+++G++ F + L+ LQ+ + G +R ++ +
Sbjct: 1023 RAF---FATVEGSIYLFALIAPAKVDLLIRLQSQLADFVRSPGHYPFLRYRAFRNQVREE 1079
Query: 205 GNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
P+R +DG L+ +FL L E+ E+ K +
Sbjct: 1080 DEPNR-FVDGDLIERFLDLKPREQEEVVKGV 1109
>gi|443707495|gb|ELU03057.1| hypothetical protein CAPTEDRAFT_148808 [Capitella teleta]
Length = 1084
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 68/146 (46%), Gaps = 16/146 (10%)
Query: 118 FHLGQHVNTFFKIRCKPSPISDAPG---ARSRFLTWYASLDGALGFFLPLPEKNYRRLLM 174
+HLG+ VN F S + PG + ++ + +++GALG LP++ Y LL
Sbjct: 942 YHLGEFVNVFRH----GSLVMQHPGECTSPTQGSVLFGTVNGALGLVTQLPQEFYLFLLE 997
Query: 175 LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
+QN + G + +R++ + P+ G IDG L+ FL LS + E+ +
Sbjct: 998 VQNKLAKTIKSVGKVEHAFWRSFHTE--RKTEPATGFIDGDLIESFLDLSRDKMQEVVQG 1055
Query: 235 I------GSKHNDILDELYD-IEALS 253
+ G K +D+L IE L+
Sbjct: 1056 LQMDDGSGMKREAAVDDLVKMIEELT 1081
>gi|154286506|ref|XP_001544048.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150407689|gb|EDN03230.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 1158
Score = 46.6 bits (109), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 39/154 (25%), Positives = 70/154 (45%), Gaps = 10/154 (6%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
++ SD + N+V+ + RL ++ LG+ VN R +P I + GA
Sbjct: 988 WLESDAEGNLVMLHRNVNGVTDDDRRRLEVTSEILLGEMVN-----RIRPVNIQGSQGAE 1042
Query: 145 SRFL--TWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKG 201
+ + +++G++ + + Y+ LLM LQ+ M GG+ FR ++
Sbjct: 1043 AAISPRAFLGTVEGSI-YLFGIINPTYQDLLMRLQSAMAGMVVTPGGMPFNKFRAFRNTI 1101
Query: 202 YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
A P R +DG L+ +FL S+ + EI K+
Sbjct: 1102 RQAEEPYR-FVDGELIERFLSCSVELQEEIVGKV 1134
>gi|303271531|ref|XP_003055127.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463101|gb|EEH60379.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 1223
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 42/86 (48%)
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++GA+G LPE + L LQ M S GG + A+R++ + +RG
Sbjct: 1120 FGTVNGAIGVVASLPESTHTFLAALQKAMNKVVSGVGGFSHDAWRSFHNEHRSRLVEARG 1179
Query: 211 IIDGSLVWKFLQLSLGERLEICKKIG 236
+DG L+ FL L + E+ +G
Sbjct: 1180 FVDGDLIESFLDLRPEKASEVASVVG 1205
>gi|297799958|ref|XP_002867863.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
lyrata]
gi|297313699|gb|EFH44122.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
lyrata]
Length = 1088
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 55/121 (45%), Gaps = 12/121 (9%)
Query: 117 DFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 172
++H+G+ VN F +R S I P + ++ G +G LP++ Y L
Sbjct: 956 EYHIGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVSGMIGVIASLPQEQYAFL 1009
Query: 173 LMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 232
LQ + GGL+ +R++ + A ++ +DG L+ FL LS G+ EI
Sbjct: 1010 EKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKSYLDGDLIESFLDLSRGKMEEIS 1067
Query: 233 K 233
K
Sbjct: 1068 K 1068
>gi|402083318|gb|EJT78336.1| hypothetical protein GGTG_03437 [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 1155
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 34/158 (21%), Positives = 73/158 (46%), Gaps = 6/158 (3%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
++ +D D NV++ R+ ++ +LG+++N KI + P +AP
Sbjct: 993 WLEADADGNVMVLGRNTTGVTLEDRRRMEITSEINLGENINRIQKISVETGP--NAPIHP 1050
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
FL+ + +G++ + + LL LQ+ + + G + + FR+++ A
Sbjct: 1051 KAFLS---TTEGSIYLVGAIAPQMRDLLLNLQDRLEDYVGTLGNIPFKNFRSFRNAEREA 1107
Query: 205 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
P R IDG + +FL ++ + ++C+ +G D+
Sbjct: 1108 DGPVR-FIDGEYIERFLDMNEETQSQVCRDLGPSVEDM 1144
>gi|91087281|ref|XP_975549.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270010588|gb|EFA07036.1| hypothetical protein TcasGA2_TC010010 [Tribolium castaneum]
Length = 1149
Score = 46.2 bits (108), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 60/138 (43%), Gaps = 15/138 (10%)
Query: 118 FHLGQHVNTFFKIRCKPSPISDAPGARSRFLT---WYASLDGALGFFLPLPEKNYRRLLM 174
FH+G +N F S + G S T + ++ GA+G + + Y LL
Sbjct: 1007 FHVGDMINVFRH----GSLVMQNLGETSTPTTGCVLFGTVSGAIGLVTQITQDFYDFLLE 1062
Query: 175 LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS------LGER 228
LQN + T G ++ +R + PS G IDG L+ FL LS + +
Sbjct: 1063 LQNKLSTVIKSVGKIDHSQWRAFNTD--IKTEPSEGFIDGDLIESFLDLSHDKMKEVADG 1120
Query: 229 LEICKKIGSKHNDILDEL 246
L+I + G K + +D+L
Sbjct: 1121 LQITGEGGMKQDCTVDDL 1138
>gi|300176205|emb|CBK23516.2| unnamed protein product [Blastocystis hominis]
Length = 702
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 48/198 (24%), Positives = 82/198 (41%), Gaps = 23/198 (11%)
Query: 69 RGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEAR--ESNGGHRLIKKTDFHLGQHVNT 126
R V A E + E+ S ++ D N+++ EA SN ++ K FHL +N
Sbjct: 484 RDVTALERLPEAQSE-MVVGDAYGNLMVMQVVEEADLDRSNPQKIVVTKESFHLDDQINR 542
Query: 127 FFKIRCKPSPISD------APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 180
F ++ S D A + F +A++ G +G L ++ +R L ++ M
Sbjct: 543 FVPVQLFRSGAEDKKKEKRAEESEIAFNLAFATVSGRIGMIGALNDREFRMLRAIETAME 602
Query: 181 THTSHTGGLNPRAFRTYKGKGYYAGNPSRGI------IDGSLVWKFLQLSLGERLEICKK 234
+ GGL+ + +R N GI IDG LV FL+L + +I
Sbjct: 603 NVITPVGGLDHKQWR--------CSNTPFGIKNLAYCIDGDLVEMFLELDDESQAKIADS 654
Query: 235 IGSKHNDILDELYDIEAL 252
+ ++ L + I+ L
Sbjct: 655 VSTELRSALSPQFLIDYL 672
>gi|301121252|ref|XP_002908353.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
gi|262103384|gb|EEY61436.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
Length = 1150
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 40/178 (22%), Positives = 74/178 (41%), Gaps = 28/178 (15%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----------------- 127
++ S+ D N+ A RL +FHLG+ VN F
Sbjct: 950 YIGSETDFNLFTVQRNSGAASDEERGRLETVGEFHLGEFVNRFRYGSLVMQNSSSTSQTP 1009
Query: 128 -FKIRCKPSPISD-------APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 179
+ P+ + D AP +++ + + ++ G +G LP+ + Y LL +Q +
Sbjct: 1010 SGVVSTGPTAMVDVGESAPAAPVVQNQSM-LFGTVSGMIGVILPISKDQYSFLLRVQQAL 1068
Query: 180 VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
GG + + +RT++ + + + +R IDG LV FL L + ++ K+ S
Sbjct: 1069 THVVKGVGGFSHKDWRTFENR--RSVSEARNFIDGDLVESFLDLPKPQMTKVVDKLNS 1124
>gi|380488197|emb|CCF37544.1| hypothetical protein CH063_08850 [Colletotrichum higginsianum]
Length = 271
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 79/162 (48%), Gaps = 18/162 (11%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR-CKPSPISDAPGA 143
++ +D N+++ P+A + ++ ++FHLG+ VN KIR +P + P
Sbjct: 101 WLEADAQGNLMVLRRNPDAPTEHDQKQMEVTSEFHLGEQVN---KIRPLDITPNENDPIV 157
Query: 144 RSRFLTWYASLDGALGFFLPLPEKNYRRLLM-----LQNVMVT----HTSHTGGLNPRAF 194
FL A+++G+L F + + Y+ LL+ L +V+ T T GL+ A+
Sbjct: 158 PKAFL---ATVEGSLYVFADI-KSEYQSLLLQFQERLADVVKTLGQAGGDSTSGLSFMAW 213
Query: 195 RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
R ++ A P R +DG L+ +FL L ++ + + +G
Sbjct: 214 RGFRNAKRAADGPFR-FVDGELIERFLDLDEAKQEAVVQGLG 254
>gi|237839083|ref|XP_002368839.1| hypothetical protein TGME49_067710 [Toxoplasma gondii ME49]
gi|211966503|gb|EEB01699.1| hypothetical protein TGME49_067710 [Toxoplasma gondii ME49]
Length = 2136
Score = 46.2 bits (108), Expect = 0.015, Method: Composition-based stats.
Identities = 27/74 (36%), Positives = 43/74 (58%), Gaps = 3/74 (4%)
Query: 151 YASLDGALGFFLPLP-EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSR 209
+AS +GA+G L +P E+ + RL +LQ+ + T G L+ AF + K A PS+
Sbjct: 2051 WASSEGAIGHLLQIPDEQTFARLAVLQDAVTKVTKSIGKLSAVAFHSVKVGT--ATVPSK 2108
Query: 210 GIIDGSLVWKFLQL 223
G IDG ++ +FL+
Sbjct: 2109 GFIDGDILERFLEF 2122
>gi|221502136|gb|EEE27880.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 2131
Score = 45.8 bits (107), Expect = 0.015, Method: Composition-based stats.
Identities = 27/74 (36%), Positives = 43/74 (58%), Gaps = 3/74 (4%)
Query: 151 YASLDGALGFFLPLP-EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSR 209
+AS +GA+G L +P E+ + RL +LQ+ + T G L+ AF + K A PS+
Sbjct: 2046 WASSEGAIGHLLQIPDEQTFARLAVLQDAVTKVTKSIGKLSAVAFHSVKVGT--ATVPSK 2103
Query: 210 GIIDGSLVWKFLQL 223
G IDG ++ +FL+
Sbjct: 2104 GFIDGDILERFLEF 2117
>gi|302894051|ref|XP_003045906.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256726833|gb|EEU40193.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 1162
Score = 45.8 bits (107), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 33/158 (20%), Positives = 72/158 (45%), Gaps = 6/158 (3%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
++ +D N+++ +A RL ++ +G+ +N +IR P D
Sbjct: 1000 WLEADSQGNLIVLQRNADAPTEQDRSRLEVTSEIGIGEQIN---RIRKLHVPAGDNSIVH 1056
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
R + AS +G+L + + + L+ Q+ M + G + + +R+++ + +
Sbjct: 1057 PR--AFLASAEGSLYLYGDIAPQYQDLLMTFQSKMEEYIHAPGNIEFKLWRSFRNENRES 1114
Query: 205 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
P R IDG +V +FL + G++ +C+ +G D+
Sbjct: 1115 DGPYR-FIDGEMVERFLDMDEGKQELVCEGLGPSVEDM 1151
>gi|156084934|ref|XP_001609950.1| splicing factor 3b, subunit 3, 130kD [Babesia bovis T2Bo]
gi|154797202|gb|EDO06382.1| splicing factor 3b, subunit 3, 130kD, putative [Babesia bovis]
Length = 1169
Score = 45.8 bits (107), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 46/214 (21%), Positives = 94/214 (43%), Gaps = 25/214 (11%)
Query: 40 GTLKLRFKKLKVLFVSDRSKRANEQPGL--PRGVYACEFVIESSSMGFMISDKDKNVVLF 97
G ++ + L++ F S+ + G PR + +C + S+ + DK +F
Sbjct: 974 GDIREGIQILRIKFYSEEAAEFEWVGGATGPRWLTSCAQLDYSTVIA-----GDKFDSIF 1028
Query: 98 MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGA 157
+ + ES +L FHLG P+ + A ++S + Y ++ G+
Sbjct: 1029 VTRVPQEESTRHIQLENVCQFHLGD----------LPTAMDKAALSQSTHVVLYGTVMGS 1078
Query: 158 LGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 216
+G +P K+ L L+ +M T G +R+Y Y P + ++DG L
Sbjct: 1079 IGALVPFQSKDELDFLQHLEMLMATEAPPLCGREHSFYRSY-----YV--PVQQVVDGDL 1131
Query: 217 VWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 250
+F L+ ++ ++ +++ + N++L +L DI+
Sbjct: 1132 CEQFRHLTEAQQRKVAQQLDTTVNNVLRKLDDIK 1165
>gi|345570887|gb|EGX53705.1| hypothetical protein AOL_s00006g33 [Arthrobotrys oligospora ATCC
24927]
Length = 1133
Score = 45.8 bits (107), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 66/155 (42%), Gaps = 11/155 (7%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A E + E + + +D D N+VL + R+ ++ LG+ VN I
Sbjct: 952 VTALEALDEDTCIS---ADSDGNLVLLRRESTGATDEDTRRMRPLSEIRLGEMVNC---I 1005
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGL 189
R PI+ + + + ++DG L F L L +Y +LM Q M G L
Sbjct: 1006 RRVNDPITQGYVVQPK--AYLGTVDGGL-FMLGLIHPDYFDILMKCQVNMAKVIKGIGDL 1062
Query: 190 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
+ +R Y KG P R +DG LV KFL L
Sbjct: 1063 DFNRYRAYNTKGIQPEEPFR-FVDGELVEKFLDLD 1096
>gi|330792580|ref|XP_003284366.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
gi|325085712|gb|EGC39114.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
Length = 1064
Score = 45.8 bits (107), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 37/140 (26%), Positives = 63/140 (45%), Gaps = 10/140 (7%)
Query: 118 FHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN 177
+H+G+ +N+ P SDAP + YAS++G++G + +++Y LQ
Sbjct: 933 YHIGESINSMLCGSLVRLPDSDAPPIPT---ILYASVNGSIGVIASISKEDYEFFSKLQK 989
Query: 178 VMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
+ + GG ++R + + SR IDG L+ F L ++E K+
Sbjct: 990 GLNRVVNGIGGFTHESWRAFSNDHHTV--ESRNFIDGDLIEMFPDL----KIESMAKVIQ 1043
Query: 238 KHNDILDE-LYDIEALSSHF 256
N LDE L IE+L +
Sbjct: 1044 DMNVTLDETLKRIESLMQYI 1063
>gi|193644722|ref|XP_001942922.1| PREDICTED: DNA damage-binding protein 1-like [Acyrthosiphon pisum]
Length = 1156
Score = 45.8 bits (107), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 39/173 (22%), Positives = 69/173 (39%), Gaps = 6/173 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P+ A E + + +G ++ DKN+ + H+L + FH G +N F
Sbjct: 962 PKWSTAIEIIDDDVFLG---AENDKNLFIIHKDSTLTSDEARHQLQEIGQFHCGDLINVF 1018
Query: 128 FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
+D + + Y + GALG L K + L L+ + T G
Sbjct: 1019 RHGSLVMQHFTDTYVSVQGGI-LYGTCSGALGLVTQLTPKMFDFLSDLEKSLATVVKGVG 1077
Query: 188 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
+N + +R+Y + PS +DG L+ FL LS E + + + ++
Sbjct: 1078 KINHQFWRSYHTE--IRTEPSESFVDGDLIESFLDLSKREMIAVVDALQGAYD 1128
>gi|340521192|gb|EGR51427.1| predicted protein [Trichoderma reesei QM6a]
Length = 1161
Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 32/158 (20%), Positives = 72/158 (45%), Gaps = 6/158 (3%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
++ +D NV++ + EA +L ++ ++G+ +N K++ P +A
Sbjct: 999 WLEADAQGNVIVLRQRQEAPTEQDRSQLEITSELNIGEQINRIRKLQVAPG--ENAVVVP 1056
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
FL S++G L + + K L+ Q+ + + G L+ +R ++ +
Sbjct: 1057 KAFL---GSIEGTLYLYGDIAPKYQDLLMTFQSRLQGYIQTPGNLSFDLWRAFRNQAREG 1113
Query: 205 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
+P R +DG ++ +FL L ++ +C+ +G D+
Sbjct: 1114 ESPYR-FVDGEMIERFLDLDESQQELVCEGLGPNVEDM 1150
>gi|168066745|ref|XP_001785293.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663100|gb|EDQ49884.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1090
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 34/129 (26%), Positives = 56/129 (43%), Gaps = 12/129 (9%)
Query: 111 RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
RL ++HLG+ VN F +R S S P + +++G +G LP+
Sbjct: 952 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSEASQIP------TVIFGTVNGVIGVIASLPQ 1005
Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
+ L LQ +V GGL+ +R++ + +R +DG L+ FL LS
Sbjct: 1006 DQFLFLQKLQQALVKVIKGVGGLSHEQWRSFSNERKTV--DARNFLDGDLIESFLDLSRN 1063
Query: 227 ERLEICKKI 235
+ EI +
Sbjct: 1064 KMEEIATSL 1072
>gi|328874742|gb|EGG23107.1| UV-damaged DNA binding protein1 [Dictyostelium fasciculatum]
Length = 1116
Score = 45.4 bits (106), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 41/190 (21%), Positives = 77/190 (40%), Gaps = 7/190 (3%)
Query: 46 FKKLKVLFVSDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARE 105
K + +L S + + + P + F+ + + +G SD N++L E
Sbjct: 915 MKSMALLQASGKDVKELSRSSQPFWLTGLTFIDDDTYLG---SDNSYNLILMKKNTETAN 971
Query: 106 SNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLP 165
L H G+ +N F D+P S +A++ G +G +
Sbjct: 972 EVDSQLLDNIGHIHTGEFINRFHHGTLATLTDVDSPKPNSII---FATISGCIGVISTIS 1028
Query: 166 EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 225
+++Y LQ + GG + +R+++ + + + SR IDG LV +FL L
Sbjct: 1029 KQDYDFFSKLQVGLNRVIRGIGGFSHDRWRSFQNE-HISNIESRNFIDGDLVEQFLHLRH 1087
Query: 226 GERLEICKKI 235
+ LE+ K +
Sbjct: 1088 DKMLEVTKDM 1097
>gi|169611218|ref|XP_001799027.1| hypothetical protein SNOG_08717 [Phaeosphaeria nodorum SN15]
gi|160702249|gb|EAT83885.2| hypothetical protein SNOG_08717 [Phaeosphaeria nodorum SN15]
Length = 1140
Score = 45.4 bits (106), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 38/167 (22%), Positives = 76/167 (45%), Gaps = 8/167 (4%)
Query: 77 VIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP 136
V E ++ SD + N+++ P+ + RL ++ LG+ VN +I
Sbjct: 970 VAEVDENTYLESDAEGNLLVLYRDPKGVTDDDKRRLNVSSEMLLGEMVNRIRRI-----D 1024
Query: 137 ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRAFR 195
++ AP A + +++G++ + L +NY LL+ LQ+ + G ++ FR
Sbjct: 1025 VATAPDAVVVPRAFMGTVEGSI-YLFALISQNYLDLLITLQSNLGNLVVSPGNMDFAKFR 1083
Query: 196 TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
+K + P+R +DG L+ +FL + + + +G + DI
Sbjct: 1084 AFKNQVRTEEEPNR-FVDGELIERFLDCEEDVQRKAIEGLGVELEDI 1129
>gi|255571318|ref|XP_002526608.1| DNA repair protein xp-E, putative [Ricinus communis]
gi|223534048|gb|EEF35767.1| DNA repair protein xp-E, putative [Ricinus communis]
Length = 1033
Score = 45.4 bits (106), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 57/130 (43%), Gaps = 12/130 (9%)
Query: 111 RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
RL ++HLG+ VN F +R S + P + +++G +G LP
Sbjct: 895 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTVNGVIGVIASLPH 948
Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
+ Y L LQ+ + GGL+ +R++ + ++ +DG L+ FL LS
Sbjct: 949 EQYIFLEKLQSNLRRVIKGVGGLSHEQWRSFNNEKKTV--EAKNFLDGDLIESFLDLSRN 1006
Query: 227 ERLEICKKIG 236
EI K IG
Sbjct: 1007 RMDEISKAIG 1016
>gi|358400469|gb|EHK49795.1| hypothetical protein TRIATDRAFT_146031 [Trichoderma atroviride IMI
206040]
Length = 1161
Score = 45.4 bits (106), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 33/160 (20%), Positives = 72/160 (45%), Gaps = 10/160 (6%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
++ +D NV++ EA +L ++ ++G+ +N K++ APG
Sbjct: 999 WLEADAQGNVIVLRQNLEAPTEQDQSQLQVISELNIGEQINRIRKLQV-------APGEN 1051
Query: 145 SRFL--TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 202
+ + + S +G L + + K L+ Q+ + + S G L+ +R ++ +
Sbjct: 1052 AIVVPKAFLGSTEGTLYLYGDIAPKYQDLLMTFQSRLQEYISTPGNLSFDLWRAFRNQSR 1111
Query: 203 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
P R +DG ++ +FL L G++ +C+ +G D+
Sbjct: 1112 EGEAPFR-FVDGEMIERFLDLDEGKQELVCEGLGPSVEDM 1150
>gi|391335522|ref|XP_003742140.1| PREDICTED: DNA damage-binding protein 1-like [Metaseiulus
occidentalis]
Length = 1154
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 64/140 (45%), Gaps = 6/140 (4%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
F+ ++ + N+ + P A + H + + +HLG+ VN K + D P
Sbjct: 980 FLAAESNLNLYVCKRDPSAADDTRQH-MHEVALYHLGEMVNVIVKGSLVMAQPGDMPLPL 1038
Query: 145 SR-FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
++ FL Y SL GA+G +P+ ++ Y L +Q + G + +RT+ +
Sbjct: 1039 NKSFL--YGSLHGAVGVIVPIKQELYAILNQIQTNLAKTIKSVGKIEHGFWRTFLAERKI 1096
Query: 204 AGNPSRGIIDGSLVWKFLQL 223
P+ G IDG L+ + L L
Sbjct: 1097 --EPATGFIDGDLIEQLLDL 1114
>gi|358380497|gb|EHK18175.1| hypothetical protein TRIVIDRAFT_80808 [Trichoderma virens Gv29-8]
Length = 1161
Score = 45.4 bits (106), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 32/158 (20%), Positives = 71/158 (44%), Gaps = 6/158 (3%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
++ +D N+++ EA +L ++ ++G+ +N KI+ P+ +A
Sbjct: 999 WLEADAQGNIIVLRQNQEAPTEQDRSQLEITSELNIGEQINRIRKIQVAPA--ENAIVIP 1056
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
FL S++G L + + K L+ Q+ + + G L+ +R ++ +
Sbjct: 1057 KAFL---GSIEGTLYLYGDIAPKYQDLLMTFQSRLQEYIQTPGNLSFDTWRAFRNQARDG 1113
Query: 205 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
P R +DG ++ +FL L ++ +C+ +G D+
Sbjct: 1114 EAPFR-FVDGEMIERFLDLDEKQQELVCEGLGPSVEDM 1150
>gi|344231825|gb|EGV63707.1| hypothetical protein CANTEDRAFT_134986 [Candida tenuis ATCC 10573]
gi|344231826|gb|EGV63708.1| hypothetical protein CANTEDRAFT_134986 [Candida tenuis ATCC 10573]
Length = 991
Score = 45.1 bits (105), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 50/98 (51%), Gaps = 9/98 (9%)
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
YA L G +G LP+ E +++ L L + + G + FR GYY N +
Sbjct: 896 YAGLQGTIGILLPISESDFKFLSNLS--IELNKDLLLGRDHMKFR-----GYY--NSTHN 946
Query: 211 IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYD 248
+IDG ++ KFL+L+ R++I K+ +I +++ D
Sbjct: 947 VIDGDIIEKFLELNASSRIKISNKLNKSVREIENKIND 984
>gi|71413583|ref|XP_808925.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
strain CL Brener]
gi|70873226|gb|EAN87074.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 444
Score = 45.1 bits (105), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 40/185 (21%), Positives = 79/185 (42%), Gaps = 10/185 (5%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGG--HRLIKKTDFHLGQHVNTFF 128
V C+ + + G + SD ++N+++ Y P +E+ G +++++ G++
Sbjct: 259 VVHCDMMYHDRAFGLLCSDDERNLLVMGYTPRVQETEAGSPNKVLESVLSLDGEY---RL 315
Query: 129 KIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
C + A + +T Y + G +GF +P+ E+ R L + H+ G
Sbjct: 316 SGGCLVKSLRFRSLAGNSSVTLYVTNYGEIGFIVPIGEQANRTASWLMRRLQIDLPHSAG 375
Query: 189 LNPRAFRTY-KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
L PR F +G A ++ SL+ +F L + R K I S L+ +
Sbjct: 376 LTPRMFLGLSQGSPRTAMRAKEMLVSASLLNEFFFLDIHSR----KTIASAAYTQLERVT 431
Query: 248 DIEAL 252
++ +L
Sbjct: 432 NVASL 436
>gi|440302955|gb|ELP95261.1| hypothetical protein EIN_430670 [Entamoeba invadens IP1]
Length = 1175
Score = 45.1 bits (105), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 48/96 (50%), Gaps = 2/96 (2%)
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
Y S G++G+ + E++Y+ L + M H G NP FR KG G G +
Sbjct: 1067 YLSAQGSVGYLTVVNERSYQTLRQINAKMNREPWHFAGTNPEEFRMEKGYGVGYGRRKQV 1126
Query: 211 IIDGSLVWKFLQLSLGERLEICKKIGSKHN--DILD 244
I+DG ++ +F L+ ++ +C + S + +ILD
Sbjct: 1127 ILDGDILKEFHFLTQEQQKRVCLRNTSISDVVNILD 1162
>gi|320593036|gb|EFX05445.1| uv-damaged DNA-binding protein [Grosmannia clavigera kw1407]
Length = 1504
Score = 45.1 bits (105), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 45/201 (22%), Positives = 86/201 (42%), Gaps = 14/201 (6%)
Query: 54 VSDRSKRANEQPGL---PRGVYACEFVIESSSMG--FMISDKDKNVVLFMYQPEARESNG 108
+DRSK E P L RG A G ++ +D N+ + + ++
Sbjct: 1306 TADRSKTIAEGPKLVERARGYQASWATAVCHVEGDLWLEADGFGNLTMLERDVQGVTADD 1365
Query: 109 GHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKN 168
RL + +LG+ VN R +P + +PGA + A+++G++ + +
Sbjct: 1366 KRRLRTVGEMYLGEMVN-----RIRPIAVETSPGAMVHPRAFLATVEGSIYMVGTIAPEA 1420
Query: 169 YRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGER 228
L+ LQ + G + A+R+++ + P R +DG L+ +FL + +
Sbjct: 1421 QDLLMNLQTKLAAIVKGPGNTSFSAYRSFRNAERESTEPFR-FVDGELLERFLDVGEDVQ 1479
Query: 229 LEICKKIGSKHND---ILDEL 246
E+ + +G D I++EL
Sbjct: 1480 KEVAQGLGPSVEDLRNIIEEL 1500
>gi|427780151|gb|JAA55527.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
Length = 1181
Score = 45.1 bits (105), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 74/170 (43%), Gaps = 18/170 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F S + PG S +
Sbjct: 1015 LFVCQKDSAATTDEERQHLQEVGQFHLGEFVNVFR----HGSLVMQHPGETSSPTQGSVL 1070
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ ++ GA+G LP Y L +Q + G ++ +R++ + P+ G
Sbjct: 1071 FGTIHGAIGLVSQLPADFYTFLSEVQEKLTKVIKSVGKIDHAFWRSFSTE--RKTEPAVG 1128
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYD-IEALS 253
IDG L+ FL LS + E+ + I G K + +D+L IE LS
Sbjct: 1129 FIDGDLIESFLDLSRDKMQEVVQGIQMDDGSGMKRDASVDDLIKIIEELS 1178
>gi|427788481|gb|JAA59692.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
Length = 1156
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 74/170 (43%), Gaps = 18/170 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F S + PG S +
Sbjct: 990 LFVCQKDSAATTDEERQHLQEVGQFHLGEFVNVFR----HGSLVMQHPGETSSPTQGSVL 1045
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ ++ GA+G LP Y L +Q + G ++ +R++ + P+ G
Sbjct: 1046 FGTIHGAIGLVSQLPADFYTFLSEVQEKLTKVIKSVGKIDHAFWRSFSTE--RKTEPAVG 1103
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYD-IEALS 253
IDG L+ FL LS + E+ + I G K + +D+L IE LS
Sbjct: 1104 FIDGDLIESFLDLSRDKMQEVVQGIQMDDGSGMKRDASVDDLIKIIEELS 1153
>gi|303313681|ref|XP_003066852.1| CPSF A subunit region family protein [Coccidioides posadasii C735
delta SOWgp]
gi|240106514|gb|EER24707.1| CPSF A subunit region family protein [Coccidioides posadasii C735
delta SOWgp]
gi|320031496|gb|EFW13458.1| UV-damaged DNA binding protein [Coccidioides posadasii str. Silveira]
Length = 1144
Score = 45.1 bits (105), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 32/137 (23%), Positives = 62/137 (45%), Gaps = 6/137 (4%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
F+++D + N+V+ + R+ ++ LG+ VN + + SP ++P
Sbjct: 978 FLVADAEGNLVVLNRNTTGVTEDDRRRMQVTSELRLGEMVNRIHPMDLQTSP--ESPVIP 1035
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
FL A++DG++ F + L+ LQ+ + + G + +R +K A
Sbjct: 1036 KAFL---ATVDGSIYLFGLISPSAQDTLMRLQSALADFVASPGEIPFNKYRAFKSSVRQA 1092
Query: 205 GNPSRGIIDGSLVWKFL 221
P R +DG L+ +FL
Sbjct: 1093 EEPFR-FVDGELIEQFL 1108
>gi|392864500|gb|EAS34654.2| UV-damaged DNA binding protein [Coccidioides immitis RS]
Length = 1144
Score = 44.7 bits (104), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 32/137 (23%), Positives = 62/137 (45%), Gaps = 6/137 (4%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
F+++D + N+V+ + R+ ++ LG+ VN + + SP ++P
Sbjct: 978 FLVADAEGNLVVLNRDTTGVTEDDRRRMQVTSELRLGEMVNRIHPMDLQTSP--ESPVIP 1035
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
FL A++DG++ F + L+ LQ+ + + G + +R +K A
Sbjct: 1036 KAFL---ATVDGSIYLFGLISPSAQDTLMRLQSALADFVASPGEIPFNKYRAFKSSVRQA 1092
Query: 205 GNPSRGIIDGSLVWKFL 221
P R +DG L+ +FL
Sbjct: 1093 EEPFR-FVDGELIEQFL 1108
>gi|119191318|ref|XP_001246265.1| hypothetical protein CIMG_00036 [Coccidioides immitis RS]
Length = 1072
Score = 44.7 bits (104), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 32/137 (23%), Positives = 62/137 (45%), Gaps = 6/137 (4%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
F+++D + N+V+ + R+ ++ LG+ VN + + SP ++P
Sbjct: 906 FLVADAEGNLVVLNRDTTGVTEDDRRRMQVTSELRLGEMVNRIHPMDLQTSP--ESPVIP 963
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
FL A++DG++ F + L+ LQ+ + + G + +R +K A
Sbjct: 964 KAFL---ATVDGSIYLFGLISPSAQDTLMRLQSALADFVASPGEIPFNKYRAFKSSVRQA 1020
Query: 205 GNPSRGIIDGSLVWKFL 221
P R +DG L+ +FL
Sbjct: 1021 EEPFR-FVDGELIEQFL 1036
>gi|195108657|ref|XP_001998909.1| GI23368 [Drosophila mojavensis]
gi|193915503|gb|EDW14370.1| GI23368 [Drosophila mojavensis]
Length = 1140
Score = 44.7 bits (104), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 68/162 (41%), Gaps = 16/162 (9%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
P+ + A E + + + +G D LF+ Q ++ + R L + FHLG +N
Sbjct: 952 PKWMRAVEILDDDTFLGCETHDN-----LFVCQKDSAATTDEERQLLPELARFHLGDTIN 1006
Query: 126 TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
F S + G R+ + Y + +GA+G +P+ Y L L+ +
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
G ++ +R Y+ PS G IDG L+ FL LS
Sbjct: 1063 IKSVGKIDHTYYRNYQINTKV--EPSEGFIDGDLIESFLDLS 1102
>gi|407410979|gb|EKF33219.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi marinkellei]
Length = 1436
Score = 44.7 bits (104), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 41/185 (22%), Positives = 78/185 (42%), Gaps = 10/185 (5%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGG--HRLIKKTDFHLGQHVNTFF 128
V C+ + + G + SD ++N+++ Y P +E+ G +++++ G++
Sbjct: 1251 VVHCDMMYHDRAFGLLCSDDERNLLVMGYTPRVQETEAGSPNKVLESVLSLDGEY---RL 1307
Query: 129 KIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
C + A + +T Y + G +GF +P+ E+ R L + H G
Sbjct: 1308 SGGCLVKSLRFRSLAGNSSVTLYVTNYGEIGFIVPIGEQANRTASWLMRRLQMDLPHNAG 1367
Query: 189 LNPRAFRTY-KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
L PR F +G A ++ SL+ +F L + R K I S L+ +
Sbjct: 1368 LTPRMFLGLSQGSPRTALRAKEMLVSASLLNEFFFLDIHSR----KTIASAAYTQLERVT 1423
Query: 248 DIEAL 252
++ AL
Sbjct: 1424 NVAAL 1428
>gi|378734083|gb|EHY60542.1| histone H2A [Exophiala dermatitidis NIH/UT8656]
Length = 1361
Score = 44.7 bits (104), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 60/138 (43%), Gaps = 13/138 (9%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V A EF+ + + + SD D + + Y PE ++ G +L+ ++ FH G +
Sbjct: 1166 VLAAEFLPDGKQLYIISSDDDGLLRVLQYDPENPKAERGTKLLLRSTFHSGAAPTKMILL 1225
Query: 131 -------RCKPSPI-----SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNV 178
R + I S A A R + +G+L PL E YRRL LQ
Sbjct: 1226 PPQVASGRGRDPEIDMDVDSGAGPAAGRHRILVTTQEGSLCMLTPLSEATYRRLSALQTT 1285
Query: 179 MVTHTS-HTGGLNPRAFR 195
++T H LNPRA+R
Sbjct: 1286 LLTTLDFHPCSLNPRAYR 1303
>gi|343962533|dbj|BAK62854.1| cleavage and polyadenylation specificity factor 160 kDa subunit
[Pan troglodytes]
Length = 269
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 14/37 (37%), Positives = 30/37 (81%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEAR 104
P VY+ +F+++++ +GF++SD+D+N++++MY PE
Sbjct: 203 PLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEGE 239
>gi|345328202|ref|XP_003431248.1| PREDICTED: DNA damage-binding protein 1-like [Ornithorhynchus
anatinus]
Length = 1045
Score = 44.7 bits (104), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 42/165 (25%), Positives = 70/165 (42%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 879 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 934
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 935 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 992
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K +D+L I
Sbjct: 993 FIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREATVDDLIKI 1037
>gi|71654693|ref|XP_815961.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
strain CL Brener]
gi|50363265|gb|AAT75335.1| cleavage polyadenylation specificity factor CPSF160 [Trypanosoma
cruzi]
gi|70881056|gb|EAN94110.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 1436
Score = 44.7 bits (104), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 40/185 (21%), Positives = 79/185 (42%), Gaps = 10/185 (5%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGG--HRLIKKTDFHLGQHVNTFF 128
V C+ + + G + SD ++N+++ Y P +E+ G +++++ G++
Sbjct: 1251 VVHCDMMYHDRAFGLLCSDDERNLLVMGYTPRVQETEAGSPNKVLESVLSLDGEY---RL 1307
Query: 129 KIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
C + A + +T Y + G +GF +P+ E+ R L + H+ G
Sbjct: 1308 SGGCLVKSLRFRSLAGNSSVTLYVTNYGEIGFIVPIGEQANRTASWLMRRLQIDLPHSAG 1367
Query: 189 LNPRAFRTY-KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
L PR F +G A ++ SL+ +F L + R K I S L+ +
Sbjct: 1368 LTPRMFLGLSQGSPRTAMRAKEMLVSASLLNEFFFLDIHSR----KTIASAAYTQLERVT 1423
Query: 248 DIEAL 252
++ +L
Sbjct: 1424 NVASL 1428
>gi|452820919|gb|EME27955.1| splicing factor 3B subunit 3 [Galdieria sulphuraria]
Length = 1294
Score = 44.7 bits (104), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 47/182 (25%), Positives = 79/182 (43%), Gaps = 29/182 (15%)
Query: 87 ISDKDKNVVLFMYQPEA-----RESNGG-------HRLIKKTDFHLGQHVNTFFKIRCKP 134
I DK N+ + PEA ++ GG H + +++G + K+
Sbjct: 1100 IGDKMGNISILRLPPEAGTFIEQDPTGGLLSKEAPHHFQLEACYYVGSVIQCLSKVEWT- 1158
Query: 135 SPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRA 193
D P L +Y +LDGA+G +PL L L+ + + S G + A
Sbjct: 1159 --TGDVP------LLFYGTLDGAIGVMIPLRSTLDMELFQALELQLREYRSPLCGRHHLA 1210
Query: 194 FRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 253
+R+Y ++ P R +IDG L +F +LSL ++ +I K++ D+ +L D S
Sbjct: 1211 YRSY----FF---PVRHVIDGDLCEEFYRLSLEQQEKIVKELDRSIVDVHRKLEDYRERS 1263
Query: 254 SH 255
H
Sbjct: 1264 PH 1265
>gi|407850337|gb|EKG04765.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 1436
Score = 44.7 bits (104), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 40/185 (21%), Positives = 79/185 (42%), Gaps = 10/185 (5%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGG--HRLIKKTDFHLGQHVNTFF 128
V C+ + + G + SD ++N+++ Y P +E+ G +++++ G++
Sbjct: 1251 VVHCDMMYHDRAFGLLCSDDERNLLVMGYTPRVQETEAGSPNKVLESVLSLDGEY---RL 1307
Query: 129 KIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
C + A + +T Y + G +GF +P+ E+ R L + H+ G
Sbjct: 1308 SGGCLVKSLRFRSLAGNSSVTLYVTNYGEIGFIVPIGEQANRTASWLMRRLQIDLPHSAG 1367
Query: 189 LNPRAFRTY-KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
L PR F +G A ++ SL+ +F L + R K I S L+ +
Sbjct: 1368 LTPRMFLGLSQGSPRTAMRAKEMLVSASLLNEFFFLDIHSR----KTIASAAYTQLERVT 1423
Query: 248 DIEAL 252
++ +L
Sbjct: 1424 NVASL 1428
>gi|124806507|ref|XP_001350742.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium falciparum
3D7]
gi|23496869|gb|AAN36422.1|AE014849_41 splicing factor 3b, subunit 3, 130kD, putative [Plasmodium falciparum
3D7]
Length = 1329
Score = 44.3 bits (103), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 60/260 (23%), Positives = 105/260 (40%), Gaps = 59/260 (22%)
Query: 10 VSLGLHGNRPLLLVRTQHELLIYQAFRHP-KGTLKLRFKKLKVLFVSDRSKRANEQPGLP 68
VS+ + GNR + + +LI+ F P + TL+L +SD +P
Sbjct: 1104 VSIKISGNR-IFACDIRESVLIF--FYDPNQNTLRL---------ISDDI--------IP 1143
Query: 69 RGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGG------------------H 110
R + C +++ ++ M +DK +V + EA++ G
Sbjct: 1144 RWI-TCSEILDHHTI--MAADKFDSVFILRVPEEAKQDEYGITNKCWYGGEIMNSSTKNR 1200
Query: 111 RLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYR 170
+L FH+G+ V + K+R P+ S Y+++ G +G F+P K
Sbjct: 1201 KLEHMMSFHIGEIVTSMQKVRLSPT---------SSECIIYSTIMGTIGAFIPYDNKEEL 1251
Query: 171 RLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERL 229
L L+ ++ T G FR+Y +P + ++DG L +F LS +
Sbjct: 1252 ELTQHLEIILRTEKPPLCGREHIFFRSYY-------HPVQNVVDGDLCEQFSSLSYDAQK 1304
Query: 230 EICKKIGSKHNDILDELYDI 249
+I + DIL +L DI
Sbjct: 1305 KIANDLERTPEDILRKLEDI 1324
>gi|195500686|ref|XP_002097479.1| GE26244 [Drosophila yakuba]
gi|194183580|gb|EDW97191.1| GE26244 [Drosophila yakuba]
Length = 1140
Score = 44.3 bits (103), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 68/162 (41%), Gaps = 16/162 (9%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
P+ + A E + + + +G + N LF+ Q ++ + R L + FHLG VN
Sbjct: 952 PKWMRAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVN 1006
Query: 126 TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
F S + G R+ + Y + +GA+G +P+ Y L LQ +
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLQERLKKI 1062
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
G + +R ++ PS G IDG L+ FL LS
Sbjct: 1063 IKSVGKIEHTYYRNFQINNKV--EPSEGFIDGDLIESFLDLS 1102
>gi|440639387|gb|ELR09306.1| hypothetical protein GMDG_03874 [Geomyces destructans 20631-21]
Length = 1138
Score = 43.9 bits (102), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 72/165 (43%), Gaps = 8/165 (4%)
Query: 84 GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGA 143
G++ +D D N+++ A ++ +FHLG+ VN KIR S GA
Sbjct: 976 GWLEADADGNLIVLRRNEAAVTFEDRKKMEVTGEFHLGEQVNRIRKIRVDAS-----EGA 1030
Query: 144 RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
+ A+ +G+L + + + LL LQ + + G + +R+++
Sbjct: 1031 TVVPRAFLATTEGSLFLYGSVAPASQDLLLRLQQRLAENVETPGNIPFTTYRSFRNAERE 1090
Query: 204 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN--DILDEL 246
P R IDG L+ +FL L + +CK + D+++EL
Sbjct: 1091 TEEPYR-FIDGELIERFLDLDEERQEVVCKGLAKVEEVRDLVEEL 1134
>gi|221061705|ref|XP_002262422.1| splicing factor 3b, subunit 3, 130kd [Plasmodium knowlesi strain H]
gi|193811572|emb|CAQ42300.1| splicing factor 3b, subunit 3, 130kd, putative [Plasmodium knowlesi
strain H]
Length = 1276
Score = 43.9 bits (102), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 46/202 (22%), Positives = 84/202 (41%), Gaps = 38/202 (18%)
Query: 67 LPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARE------------------SNG 108
+PR + C +++ ++ M +DK +V + EA++ SN
Sbjct: 1089 IPRWI-TCSEILDHHTI--MAADKFDSVFVLRVPEEAKQEEYGISNKCWYGGEMMAGSNK 1145
Query: 109 GHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKN 168
RL +FH+G+ V + K++ P+ S Y+++ G +G F+P K
Sbjct: 1146 NRRLEHIMNFHVGEIVTSLQKVKLSPT---------SSECIIYSTIMGTIGAFIPYDNKE 1196
Query: 169 YRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGE 227
L L+ ++ T G FR+Y +P + +IDG L +F L
Sbjct: 1197 ELELTQHLEIILRTENPPLCGREHIFFRSYY-------HPVQHVIDGDLCEQFSSLPYDI 1249
Query: 228 RLEICKKIGSKHNDILDELYDI 249
+ ++ + +DIL +L DI
Sbjct: 1250 QRKVAADLERTPDDILRKLEDI 1271
>gi|342885673|gb|EGU85655.1| hypothetical protein FOXB_03801 [Fusarium oxysporum Fo5176]
Length = 1160
Score = 43.9 bits (102), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 33/158 (20%), Positives = 75/158 (47%), Gaps = 6/158 (3%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
++ +D N+V+ +A RL ++ ++G+ +N +IR P+++
Sbjct: 998 WLEADSKGNLVVLQRNVDAPTEQDRSRLEITSEMNIGEQIN---RIRKLHVPMAENGIVH 1054
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
R + AS +G+L + + + L+ Q+ M + G + + +R+++ + +
Sbjct: 1055 PR--AFLASAEGSLYLYGDIAPQYQDLLMTFQSKMEEYIHVPGSVEFKLWRSFRNENRES 1112
Query: 205 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
P R IDG +V +FL + G++ +C+ +G D+
Sbjct: 1113 EGPFR-FIDGEMVERFLDMDEGKQELVCEGLGPSIEDM 1149
>gi|156049323|ref|XP_001590628.1| hypothetical protein SS1G_08368 [Sclerotinia sclerotiorum 1980]
gi|154692767|gb|EDN92505.1| hypothetical protein SS1G_08368 [Sclerotinia sclerotiorum 1980 UF-70]
Length = 1153
Score = 43.9 bits (102), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 41/180 (22%), Positives = 76/180 (42%), Gaps = 20/180 (11%)
Query: 77 VIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP 136
+ E + ++ SD D N+++ E RL + +LG+ VN +I + S
Sbjct: 969 IAEVDTDTYLESDHDGNLIVLKRNREGVTREDKLRLEVLCEMNLGEMVNKIKRINVETS- 1027
Query: 137 ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT---------- 186
DA F+ A+ +G++ F +P +N L+ LQ+ + + + +
Sbjct: 1028 -KDALLIPRAFV---ATTEGSIYLFSLIPPQNQDLLMRLQSRLASLPARSLTDSSFSAPI 1083
Query: 187 ----GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
G L+ +R+Y P R +DG L+ +FL L + IC +G + D+
Sbjct: 1084 EFSPGNLDFDKYRSYVSAVRETNEPFR-FVDGELIERFLDLDGAIQENICDGLGVRAEDL 1142
>gi|452838792|gb|EME40732.1| hypothetical protein DOTSEDRAFT_177898 [Dothistroma septosporum
NZE10]
Length = 1138
Score = 43.9 bits (102), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 36/154 (23%), Positives = 67/154 (43%), Gaps = 8/154 (5%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
++ SD + N+++ + RL ++ LG+ VN +I P+ GA
Sbjct: 974 YLQSDAEGNLIVLQHDTNGFSEEDRRRLRVTSELLLGEMVNRIRRIDVTPTH-----GAL 1028
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG--KGY 202
+ A+++G++ F + L+ +QN M + G + FR +K +
Sbjct: 1029 VIPRAFLATVEGSIYLFALIVPGKQDLLMRMQNNMASLVKSPGHVEFATFRGFKNQVRDE 1088
Query: 203 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
A PSR +DG L+ +FL + EI + +G
Sbjct: 1089 GANGPSR-FVDGELIERFLDCGQDIQEEIIRDLG 1121
>gi|356525403|ref|XP_003531314.1| PREDICTED: DNA damage-binding protein 1-like isoform 2 [Glycine max]
Length = 1068
Score = 43.9 bits (102), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 12/129 (9%)
Query: 111 RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
RL ++HLG+ VN F +R S + P + +++G +G LP
Sbjct: 930 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTINGVIGVIASLPH 983
Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
+ Y L LQ+ + GGL+ +R++ + +R +DG L+ FL L+
Sbjct: 984 EQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--EARNFLDGDLIESFLDLNRS 1041
Query: 227 ERLEICKKI 235
+ EI K +
Sbjct: 1042 KMDEISKAV 1050
>gi|327278830|ref|XP_003224163.1| PREDICTED: DNA damage-binding protein 1-like [Anolis carolinensis]
Length = 1140
Score = 43.9 bits (102), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 60/141 (42%), Gaps = 15/141 (10%)
Query: 118 FHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLM 174
FHLG+ VN F C S + G S + + +++G +G L E Y LL
Sbjct: 998 FHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLD 1053
Query: 175 LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
+QN + G + +R++ + P+ G IDG L+ FL +S + E+
Sbjct: 1054 VQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVAN 1111
Query: 235 I------GSKHNDILDELYDI 249
+ G K +D+L I
Sbjct: 1112 LQIDDGSGMKREATVDDLIKI 1132
>gi|356525401|ref|XP_003531313.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Glycine max]
Length = 1089
Score = 43.9 bits (102), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 12/129 (9%)
Query: 111 RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
RL ++HLG+ VN F +R S + P + +++G +G LP
Sbjct: 951 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTINGVIGVIASLPH 1004
Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
+ Y L LQ+ + GGL+ +R++ + +R +DG L+ FL L+
Sbjct: 1005 EQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--EARNFLDGDLIESFLDLNRS 1062
Query: 227 ERLEICKKI 235
+ EI K +
Sbjct: 1063 KMDEISKAV 1071
>gi|389586447|dbj|GAB69176.1| splicing factor 3B subunit 3 [Plasmodium cynomolgi strain B]
Length = 1286
Score = 43.5 bits (101), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 46/202 (22%), Positives = 83/202 (41%), Gaps = 38/202 (18%)
Query: 67 LPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARE------------------SNG 108
+PR + C +++ ++ M +DK +V + EA++ SN
Sbjct: 1099 IPRWI-TCSEILDHHTI--MAADKFDSVFVLRVPEEAKQEEYGISNKCWYGGEIMAGSNK 1155
Query: 109 GHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKN 168
RL FH+G+ V + K++ P+ S Y+++ G +G F+P K
Sbjct: 1156 NRRLEHIMSFHVGEIVTSLQKVKLSPT---------SSECIIYSTIMGTIGAFIPYDNKE 1206
Query: 169 YRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGE 227
L L+ ++ T G FR+Y +P + +IDG L +F L
Sbjct: 1207 ELELTQHLEIILRTENPPLCGREHIFFRSYY-------HPVQHVIDGDLCEQFSSLPYDV 1259
Query: 228 RLEICKKIGSKHNDILDELYDI 249
+ ++ + +DIL +L DI
Sbjct: 1260 QRKVAADLERTPDDILRKLEDI 1281
>gi|301616502|ref|XP_002937687.1| PREDICTED: DNA damage-binding protein 1-like [Xenopus (Silurana)
tropicalis]
Length = 1140
Score = 43.5 bits (101), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 70/165 (42%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSPPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDVQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K +D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVIANLQIDDGSGMKRETTVDDLIKV 1132
>gi|156095699|ref|XP_001613884.1| Splicing factor 3B subunit 3 [Plasmodium vivax Sal-1]
gi|148802758|gb|EDL44157.1| Splicing factor 3B subunit 3, putative [Plasmodium vivax]
Length = 1230
Score = 43.5 bits (101), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 46/202 (22%), Positives = 83/202 (41%), Gaps = 38/202 (18%)
Query: 67 LPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARE------------------SNG 108
+PR + C +++ ++ M +DK +V + EA++ SN
Sbjct: 1043 IPRWI-TCSEILDHHTI--MAADKFDSVFVLRVPEEAKQEEYGISNKCWYGGEIMAGSNK 1099
Query: 109 GHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKN 168
RL FH+G+ V + K++ P+ S Y+++ G +G F+P K
Sbjct: 1100 NRRLEHIMSFHVGEIVTSLQKVKLSPT---------SSECIIYSTIMGTIGAFIPYDNKE 1150
Query: 169 YRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGE 227
L L+ ++ T G FR+Y +P + +IDG L +F L
Sbjct: 1151 ELELTQHLEIILRTENPPLCGREHIFFRSYY-------HPVQHVIDGDLCEQFSSLPYDV 1203
Query: 228 RLEICKKIGSKHNDILDELYDI 249
+ ++ + +DIL +L DI
Sbjct: 1204 QRKVAADLERTPDDILRKLEDI 1225
>gi|384253371|gb|EIE26846.1| hypothetical protein COCSUDRAFT_52476 [Coccomyxa subellipsoidea
C-169]
Length = 1205
Score = 43.5 bits (101), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 63/145 (43%), Gaps = 18/145 (12%)
Query: 107 NGG-HRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLP 165
NG H+L +FH+G V + + +P G R L YA++ GA+G LP P
Sbjct: 1072 NGAPHKLEDVVNFHVGDLVTSLQRAVLQP-------GGREVLL--YATVMGAIGAMLPFP 1122
Query: 166 EKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
+ L+ + GG R +Y+G + P + +IDG L F QL
Sbjct: 1123 SREDVDFFSHLEMHLRQEHPPMGG---RDHMSYRGSYF----PVKDVIDGDLCEHFSQLP 1175
Query: 225 LGERLEICKKIGSKHNDILDELYDI 249
++ I ++ +IL +L DI
Sbjct: 1176 AAKQKSIADELERTPGEILKKLEDI 1200
>gi|168047617|ref|XP_001776266.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672361|gb|EDQ58899.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1089
Score = 43.5 bits (101), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 34/126 (26%), Positives = 55/126 (43%), Gaps = 12/126 (9%)
Query: 111 RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
RL ++HLG+ VN F +R S S P + +++G +G LP+
Sbjct: 951 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSEASLIP------TVIFGTVNGVIGVIASLPQ 1004
Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
+ L LQ +V GGL+ +R++ + +R +DG L+ FL LS
Sbjct: 1005 DKFLFLQKLQQALVKVIKGVGGLSHEQWRSFSNERKTV--DARNFLDGDLIESFLDLSRN 1062
Query: 227 ERLEIC 232
+ EI
Sbjct: 1063 KMEEIA 1068
>gi|147906138|ref|NP_001083624.1| DNA damage-binding protein 1 [Xenopus laevis]
gi|82186503|sp|Q6P6Z0.1|DDB1_XENLA RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|38303806|gb|AAH61946.1| Ddb1 protein [Xenopus laevis]
Length = 1140
Score = 43.5 bits (101), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 70/165 (42%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSPPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDVQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K +D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVIANLQIDDGSGMKRETTVDDLIKV 1132
>gi|195037449|ref|XP_001990173.1| GH18378 [Drosophila grimshawi]
gi|193894369|gb|EDV93235.1| GH18378 [Drosophila grimshawi]
Length = 1140
Score = 43.5 bits (101), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 40/162 (24%), Positives = 68/162 (41%), Gaps = 16/162 (9%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
P+ + A E + + + +G D LF+ Q ++ + R L + FHLG +N
Sbjct: 952 PKWMRAVEILDDDTFLGCETHDN-----LFVCQKDSAATTDEERQLLPELARFHLGDTIN 1006
Query: 126 TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
F S + G R+ + Y + +GA+G +P+ Y L L+ +
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
G ++ +R Y+ PS G IDG L+ FL L+
Sbjct: 1063 IKSVGKIDHTYYRNYQINTKV--EPSEGFIDGDLIESFLDLN 1102
>gi|195395112|ref|XP_002056180.1| GJ10363 [Drosophila virilis]
gi|194142889|gb|EDW59292.1| GJ10363 [Drosophila virilis]
Length = 1140
Score = 43.5 bits (101), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 40/162 (24%), Positives = 68/162 (41%), Gaps = 16/162 (9%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
P+ + A E + + + +G D LF+ Q ++ + R L + FHLG +N
Sbjct: 952 PKWMRAVEILDDDTFLGCETHDN-----LFVCQKDSAATTDEERQLLPELARFHLGDTIN 1006
Query: 126 TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
F S + G R+ + Y + +GA+G +P+ Y L L+ +
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
G ++ +R Y+ PS G IDG L+ FL L+
Sbjct: 1063 IKSVGKIDHTYYRNYQINTKV--EPSEGFIDGDLIESFLDLN 1102
>gi|194377326|dbj|BAG57611.1| unnamed protein product [Homo sapiens]
Length = 451
Score = 43.5 bits (101), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 63/145 (43%), Gaps = 11/145 (7%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 285 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 340
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 341 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 398
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI 235
IDG L+ FL +S + E+ +
Sbjct: 399 FIDGDLIESFLDISRPKMQEVVANL 423
>gi|159470709|ref|XP_001693499.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283002|gb|EDP08753.1| predicted protein [Chlamydomonas reinhardtii]
Length = 279
Score = 43.1 bits (100), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 49/203 (24%), Positives = 84/203 (41%), Gaps = 24/203 (11%)
Query: 69 RGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESN----GGHRLIKKTDFHLGQHV 124
R V C VI + F+ +D + + + + R++N G RL H+ + V
Sbjct: 82 RDVLTCGVVIAEPKLHFLAADAAGTLQMMEFYGK-RDTNPEFWAGQRLAPMGLLHVARRV 140
Query: 125 NTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT- 183
++ + R+R S +G L F P+P+ L ++ T
Sbjct: 141 GVAASVQLA------SRDGRNRHALLCGSAEGGLSFVAPVPDPQAAARLAALQAHMSATL 194
Query: 184 SHTGGLNPRAFRTY---------KGKGYYAGNPSRG---IIDGSLVWKFLQLSLGERLEI 231
H GLNPR+FR G+ + A P R ++DG L+ F LS ++ E
Sbjct: 195 PHVAGLNPRSFRHRFIRIPKALGGGEHHRAPLPPRNNSGLLDGQLLLGFPHLSRQQQAEA 254
Query: 232 CKKIGSKHNDILDELYDIEALSS 254
+ +GS +L++L I A ++
Sbjct: 255 AEAVGSSPQQLLEDLRAIAAAAT 277
>gi|74025892|ref|XP_829512.1| cleavage and polyadenylation specificity factor-like protein
[Trypanosoma brucei brucei strain 927/4 GUTat10.1]
gi|70834898|gb|EAN80400.1| cleavage and polyadenylation specificity factor-like protein,
putative [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 1452
Score = 43.1 bits (100), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 73/186 (39%), Gaps = 12/186 (6%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGG-HRLIKKTDFHLGQHVNTFFK 129
V C+ + + G + SD ++NV++ Y P +E++ G H + ++ L
Sbjct: 1267 VVHCDMMYHDRAFGILCSDDERNVLIMGYTPRVQETDAGTHPKVLESVLSLDGEYRLPSG 1326
Query: 130 IRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 189
K G S +T Y S G +GF +P+ E+ R L + + GL
Sbjct: 1327 SLVKSLRFRSTAGNSS--VTLYVSNYGEIGFIVPIGEQANRTALWVTRRLQIDLPCEAGL 1384
Query: 190 NPRAFRTYKGKGYYAGNPSRG---IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
PR F + + N RG ++ L+ L L R K I LD +
Sbjct: 1385 TPRMFLSLNQRS--PRNSLRGKEMLVPAPLLRGLFSLDLRSR----KAIARAAYTQLDRV 1438
Query: 247 YDIEAL 252
+I AL
Sbjct: 1439 ANIVAL 1444
>gi|260947152|ref|XP_002617873.1| hypothetical protein CLUG_01332 [Clavispora lusitaniae ATCC 42720]
gi|238847745|gb|EEQ37209.1| hypothetical protein CLUG_01332 [Clavispora lusitaniae ATCC 42720]
Length = 1242
Score = 43.1 bits (100), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 75/172 (43%), Gaps = 21/172 (12%)
Query: 87 ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSR 146
++++ +N VL ++ E ++ RL K DF+ V + K S G
Sbjct: 1077 VAEQLENNVLMKFEEETLGASSS-RLDKLCDFYTQDIVTSLHKG-------SFVVGGSES 1128
Query: 147 FLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHT--------GGLNPRAFRTY 197
+ Y L G +G LPL LLM L+N + + + + G N
Sbjct: 1129 II--YTGLQGTVGILLPLATTQEVDLLMKLENSLRDYFNDSFDDFDNTKQGFNLVGREHL 1186
Query: 198 KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
K +GYY NP +IDG + +F +L+ ++++ ++ DI ++YD+
Sbjct: 1187 KFRGYY--NPVENVIDGDFIERFFELNPSAQVKLAGRLDKSPRDIERKIYDL 1236
>gi|254585271|ref|XP_002498203.1| ZYRO0G04730p [Zygosaccharomyces rouxii]
gi|238941097|emb|CAR29270.1| ZYRO0G04730p [Zygosaccharomyces rouxii]
Length = 1302
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 33/55 (60%), Gaps = 3/55 (5%)
Query: 196 TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI-GSKHNDILDELYDI 249
++K + YYA P R +IDG L FL LSL E+ ++CK+ GS + +L DI
Sbjct: 1245 SFKYRSYYA--PVRNVIDGDLCETFLNLSLSEQTKLCKETSGSNPEGVCKQLNDI 1297
>gi|355752055|gb|EHH56175.1| Damage-specific DNA-binding protein 1, partial [Macaca fascicularis]
Length = 1125
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 959 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1014
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1015 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1072
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1073 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1117
>gi|119594343|gb|EAW73937.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_e [Homo
sapiens]
Length = 896
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 730 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 785
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 786 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 843
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 844 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 888
>gi|449519304|ref|XP_004166675.1| PREDICTED: DNA damage-binding protein 1a-like [Cucumis sativus]
Length = 596
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 12/129 (9%)
Query: 111 RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
RL ++HLG+ VN F +R S + P + S++G +G LP
Sbjct: 458 RLEVVGEYHLGEFVNRFQHGSLVMRLPDSDVGQIP------TVIFGSVNGVIGVIASLPH 511
Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
Y L LQ+ + GGL+ +R++ + A ++ +DG L+ FL L+
Sbjct: 512 DQYVFLERLQSNLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKNFLDGDLIESFLDLNRS 569
Query: 227 ERLEICKKI 235
+ EI + +
Sbjct: 570 KMEEISRAM 578
>gi|326474493|gb|EGD98502.1| UV-damaged DNA binding protein [Trichophyton tonsurans CBS 112818]
gi|326481558|gb|EGE05568.1| UV-damaged DNA binding protein [Trichophyton equinum CBS 127.97]
Length = 1147
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 36/152 (23%), Positives = 70/152 (46%), Gaps = 8/152 (5%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
++++D + N+V+ + RL ++ LG+ VN I + + A AR
Sbjct: 982 YLLADAEGNLVVLQQNITGVTESDRRRLQPTSEIRLGEMVNRIHPIVVQ-THTETAVSAR 1040
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYY 203
+ A++DG++ + L NY LL+ LQ M + + G + +R ++
Sbjct: 1041 A----LLATVDGSI-YLFGLINPNYVDLLLRLQTAMGSIATSPGEIPFSKYRAFRTTVRQ 1095
Query: 204 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
+ P R +DG L+ +FL +L + EI ++
Sbjct: 1096 SDEPFR-FVDGELIERFLSCTLAMQQEIVGRL 1126
>gi|149725200|ref|XP_001502072.1| PREDICTED: DNA damage-binding protein 1 [Equus caballus]
Length = 1140
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 59/134 (44%), Gaps = 11/134 (8%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLS 224
IDG L+ FL +S
Sbjct: 1088 FIDGDLIESFLDIS 1101
>gi|418316|sp|P33194.1|DDB1_CERAE RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
subunit; AltName: Full=DDBa; AltName:
Full=Damage-specific DNA-binding protein 1; AltName:
Full=UV-damaged DNA-binding protein 1; Short=UV-DDB 1
gi|304026|gb|AAA03021.1| UV-damaged DNA-binding protein [Chlorocebus aethiops]
Length = 1140
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132
>gi|356512638|ref|XP_003525025.1| PREDICTED: DNA damage-binding protein 1a-like isoform 2 [Glycine max]
Length = 1068
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 12/129 (9%)
Query: 111 RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
RL ++HLG+ VN F +R S + P + +++G +G LP
Sbjct: 930 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTINGVIGVIASLPH 983
Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
+ Y L LQ+ + GGL+ +R++ + +R +DG L+ FL L+
Sbjct: 984 EQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--EARNFLDGDLIESFLDLNRS 1041
Query: 227 ERLEICKKI 235
+ EI K +
Sbjct: 1042 KMDEISKAL 1050
>gi|356512636|ref|XP_003525024.1| PREDICTED: DNA damage-binding protein 1a-like isoform 1 [Glycine max]
Length = 1089
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 12/129 (9%)
Query: 111 RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
RL ++HLG+ VN F +R S + P + +++G +G LP
Sbjct: 951 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTINGVIGVIASLPH 1004
Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
+ Y L LQ+ + GGL+ +R++ + +R +DG L+ FL L+
Sbjct: 1005 EQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--EARNFLDGDLIESFLDLNRS 1062
Query: 227 ERLEICKKI 235
+ EI K +
Sbjct: 1063 KMDEISKAL 1071
>gi|413081953|ref|NP_741992.2| DNA damage-binding protein 1 [Rattus norvegicus]
gi|293344614|ref|XP_002725831.1| PREDICTED: DNA damage-binding protein 1 [Rattus norvegicus]
gi|293356422|ref|XP_002728912.1| PREDICTED: DNA damage-binding protein 1 [Rattus norvegicus]
gi|149062405|gb|EDM12828.1| damage-specific DNA binding protein 1 [Rattus norvegicus]
Length = 1140
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132
>gi|344295432|ref|XP_003419416.1| PREDICTED: DNA damage-binding protein 1 [Loxodonta africana]
Length = 1140
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132
>gi|354504619|ref|XP_003514371.1| PREDICTED: DNA damage-binding protein 1-like [Cricetulus griseus]
gi|344258340|gb|EGW14444.1| DNA damage-binding protein 1 [Cricetulus griseus]
Length = 1140
Score = 43.1 bits (100), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132
>gi|73983859|ref|XP_533275.2| PREDICTED: DNA damage-binding protein 1 [Canis lupus familiaris]
gi|291409601|ref|XP_002721069.1| PREDICTED: damage-specific DNA binding protein 1 [Oryctolagus
cuniculus]
gi|301781686|ref|XP_002926259.1| PREDICTED: DNA damage-binding protein 1-like [Ailuropoda melanoleuca]
Length = 1140
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132
>gi|148529014|ref|NP_001914.3| DNA damage-binding protein 1 [Homo sapiens]
gi|296218432|ref|XP_002807395.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1
[Callithrix jacchus]
gi|397516558|ref|XP_003828491.1| PREDICTED: DNA damage-binding protein 1 [Pan paniscus]
gi|402893195|ref|XP_003909786.1| PREDICTED: DNA damage-binding protein 1 [Papio anubis]
gi|426368721|ref|XP_004051351.1| PREDICTED: DNA damage-binding protein 1 [Gorilla gorilla gorilla]
gi|12643730|sp|Q16531.1|DDB1_HUMAN RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
subunit; AltName: Full=DNA damage-binding protein a;
Short=DDBa; AltName: Full=Damage-specific DNA-binding
protein 1; AltName: Full=HBV X-associated protein 1;
Short=XAP-1; AltName: Full=UV-damaged DNA-binding factor;
AltName: Full=UV-damaged DNA-binding protein 1;
Short=UV-DDB 1; AltName: Full=XPE-binding factor;
Short=XPE-BF; AltName: Full=Xeroderma pigmentosum group
E-complementing protein; Short=XPCe
gi|203282525|pdb|3E0C|A Chain A, Crystal Structure Of Dna Damage-Binding Protein 1(Ddb1)
gi|695362|gb|AAA62838.1| X-associated protein 1, partial [Homo sapiens]
gi|1052865|gb|AAC50349.1| DDBa p127 [Homo sapiens]
gi|15079750|gb|AAH11686.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
gi|29792243|gb|AAH50530.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
gi|30354567|gb|AAH51764.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
gi|61354161|gb|AAX44048.1| damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
gi|119594341|gb|EAW73935.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_c [Homo
sapiens]
gi|168275638|dbj|BAG10539.1| DNA damage-binding protein 1 [synthetic construct]
gi|189065506|dbj|BAG35345.1| unnamed protein product [Homo sapiens]
gi|355566436|gb|EHH22815.1| Damage-specific DNA-binding protein 1 [Macaca mulatta]
gi|380784123|gb|AFE63937.1| DNA damage-binding protein 1 [Macaca mulatta]
gi|380808126|gb|AFE75938.1| DNA damage-binding protein 1 [Macaca mulatta]
gi|380810144|gb|AFE76947.1| DNA damage-binding protein 1 [Macaca mulatta]
gi|383408123|gb|AFH27275.1| DNA damage-binding protein 1 [Macaca mulatta]
gi|410305600|gb|JAA31400.1| damage-specific DNA binding protein 1, 127kDa [Pan troglodytes]
gi|410352015|gb|JAA42611.1| damage-specific DNA binding protein 1, 127kDa [Pan troglodytes]
Length = 1140
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132
>gi|400260815|pdb|4E54|A Chain A, Damaged Dna Induced Uv-Damaged Dna-Binding Protein (Uv-Ddb)
Dimerization And Its Roles In Chromatinized Dna Repair
gi|401871507|pdb|4E5Z|A Chain A, Damaged Dna Induced Uv-Damaged Dna-Binding Protein (Uv-Ddb)
Dimerization And Its Roles In Chromatinized Dna Repair
Length = 1150
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 984 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1039
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1040 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1097
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1098 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1142
>gi|395852550|ref|XP_003798801.1| PREDICTED: DNA damage-binding protein 1 [Otolemur garnettii]
Length = 1140
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132
>gi|359546285|pdb|4A11|A Chain A, Structure Of The Hsddb1-Hscsa Complex
gi|361132519|pdb|4A0K|C Chain C, Structure Of Ddb1-Ddb2-Cul4a-Rbx1 Bound To A 12 Bp Abasic
Site Containing Dna-Duplex
Length = 1159
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 993 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1048
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1049 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1106
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1107 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1151
>gi|358440058|pdb|4A08|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 13 Bp Cpd-Duplex (
Purine At D-1 Position) At 3.0 A Resolution (Cpd 1)
gi|358440062|pdb|4A09|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 15 Bp Cpd-Duplex
(Purine At D-1 Position) At 3.1 A Resolution (Cpd 2)
Length = 1159
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 993 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1048
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1049 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1106
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1107 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1151
>gi|311247551|ref|XP_003122699.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Sus scrofa]
Length = 1140
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132
>gi|2632123|emb|CAA05770.1| Xeroderma Pigmentosum Group E Complementing protein [Homo sapiens]
Length = 1140
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132
>gi|74138855|dbj|BAE27231.1| unnamed protein product [Mus musculus]
Length = 1140
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGEASTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132
>gi|358440070|pdb|4A0B|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
gi|358440072|pdb|4A0B|C Chain C, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
Length = 1159
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 993 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1048
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1049 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1106
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1107 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1151
>gi|384941436|gb|AFI34323.1| DNA damage-binding protein 1 [Macaca mulatta]
Length = 1140
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132
>gi|358440066|pdb|4A0A|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
Pyrimidine At D-1 Position) At 3.6 A Resolution (Cpd 3)
Length = 1159
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 993 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1048
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1049 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1106
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1107 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1151
>gi|453081643|gb|EMF09692.1| DNA damage-binding protein 1 [Mycosphaerella populorum SO2202]
Length = 1151
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 37/154 (24%), Positives = 69/154 (44%), Gaps = 6/154 (3%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
++ SD + N+V+ + P + RL ++ LG+ VN I +P ++A
Sbjct: 983 YIQSDAEGNLVVLEHDPTGFSAEDRRRLRVTSEMCLGEMVNRIRPITTVITPSANAVVIP 1042
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
F+ A+++G++ F + ++ L+ LQ M G + FR +K +
Sbjct: 1043 KAFI---ATVEGSVYVFGTIAQQYQDLLIRLQGSMAEMVKSPGFVRFNRFRGFKTQVRDM 1099
Query: 205 GN--PSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
G P R +DG ++ FL LS + + K +G
Sbjct: 1100 GEEGPVR-FVDGEIIEGFLGLSAEVQESVAKDLG 1132
>gi|194381178|dbj|BAG64157.1| unnamed protein product [Homo sapiens]
Length = 826
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 660 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 715
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 716 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 773
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 774 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 818
>gi|195329354|ref|XP_002031376.1| GM24084 [Drosophila sechellia]
gi|194120319|gb|EDW42362.1| GM24084 [Drosophila sechellia]
Length = 1140
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 71/162 (43%), Gaps = 16/162 (9%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
P+ + A E + + + +G + N LF+ Q ++ + R L + FHLG VN
Sbjct: 952 PKWMRAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVN 1006
Query: 126 TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
F S + G R+ + Y + +GA+G +P+ Y L L+ +
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
G + + +R ++ + PS+G IDG L+ FL LS
Sbjct: 1063 IKLVGKIGHKFYRNFRI--HTQVEPSQGFIDGDLIESFLDLS 1102
>gi|7657011|ref|NP_056550.1| DNA damage-binding protein 1 [Mus musculus]
gi|134034087|sp|Q3U1J4.2|DDB1_MOUSE RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
subunit; AltName: Full=Damage-specific DNA-binding
protein 1; AltName: Full=UV-damaged DNA-binding factor
gi|5931596|dbj|BAA84699.1| XPE UV-damaged DNA binding factor [Mus musculus]
gi|16307148|gb|AAH09661.1| Damage specific DNA binding protein 1 [Mus musculus]
gi|74182145|dbj|BAE34102.1| unnamed protein product [Mus musculus]
gi|74196166|dbj|BAE32993.1| unnamed protein product [Mus musculus]
Length = 1140
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGEASTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132
>gi|403255013|ref|XP_003920244.1| PREDICTED: DNA damage-binding protein 1 [Saimiri boliviensis
boliviensis]
Length = 1140
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132
>gi|119594340|gb|EAW73934.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_b [Homo
sapiens]
Length = 923
Score = 43.1 bits (100), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 757 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 812
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 813 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 870
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 871 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 915
>gi|270346571|pdb|3I7H|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Hbx
gi|270346573|pdb|3I7K|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Whx
gi|270346575|pdb|3I7L|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Ddb2
gi|270346577|pdb|3I7N|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdtc1
gi|270346579|pdb|3I7O|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Iqwd1
gi|270346581|pdb|3I7P|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr40a
gi|270346583|pdb|3I89|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr22
gi|270346585|pdb|3I8C|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr21a
gi|270346587|pdb|3I8E|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr42a
gi|270346588|pdb|3I8E|B Chain B, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr42a
Length = 1143
Score = 42.7 bits (99), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 977 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1032
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1033 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1090
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1091 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1135
>gi|221046711|pdb|3EI1|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 14 Bp 6-4 Photoproduct
Containing Dna-Duplex
gi|221046715|pdb|3EI2|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Abasic Site
Containing Dna-Duplex
gi|221046719|pdb|3EI3|A Chain A, Structure Of The Hsddb1-Drddb2 Complex
Length = 1158
Score = 42.7 bits (99), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 992 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1047
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1048 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1105
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1106 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1150
>gi|122692537|ref|NP_001073731.1| DNA damage-binding protein 1 [Bos taurus]
gi|426251842|ref|XP_004019630.1| PREDICTED: DNA damage-binding protein 1 [Ovis aries]
gi|134034086|sp|A1A4K3.1|DDB1_BOVIN RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|119223918|gb|AAI26630.1| Damage-specific DNA binding protein 1, 127kDa [Bos taurus]
gi|296471644|tpg|DAA13759.1| TPA: DNA damage-binding protein 1 [Bos taurus]
Length = 1140
Score = 42.7 bits (99), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132
>gi|197097564|ref|NP_001126613.1| DNA damage-binding protein 1 [Pongo abelii]
gi|75041202|sp|Q5R649.1|DDB1_PONAB RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|55732122|emb|CAH92767.1| hypothetical protein [Pongo abelii]
Length = 1140
Score = 42.7 bits (99), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132
>gi|441604084|ref|XP_004087862.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1
[Nomascus leucogenys]
Length = 1140
Score = 42.7 bits (99), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132
>gi|221046721|pdb|3EI4|A Chain A, Structure Of The Hsddb1-Hsddb2 Complex
gi|221046723|pdb|3EI4|C Chain C, Structure Of The Hsddb1-Hsddb2 Complex
gi|221046725|pdb|3EI4|E Chain E, Structure Of The Hsddb1-Hsddb2 Complex
Length = 1158
Score = 42.7 bits (99), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 992 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1047
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1048 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1105
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1106 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1150
>gi|74178494|dbj|BAE32502.1| unnamed protein product [Mus musculus]
Length = 1140
Score = 42.7 bits (99), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGEASTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132
>gi|21357503|ref|NP_650257.1| piccolo [Drosophila melanogaster]
gi|74872881|sp|Q9XYZ5.1|DDB1_DROME RecName: Full=DNA damage-binding protein 1; Short=D-DDB1; AltName:
Full=Damage-specific DNA-binding protein 1; AltName:
Full=Protein piccolo
gi|4928452|gb|AAD33592.1|AF132145_1 damage-specific DNA binding protein DDBa p127 subunit [Drosophila
melanogaster]
gi|7299719|gb|AAF54901.1| piccolo [Drosophila melanogaster]
gi|220942640|gb|ACL83863.1| DDB1-PA [synthetic construct]
Length = 1140
Score = 42.7 bits (99), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 68/162 (41%), Gaps = 16/162 (9%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
P+ + A E + + + +G + N LF+ Q ++ + R L + FHLG VN
Sbjct: 952 PKWMRAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVN 1006
Query: 126 TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
F S + G R+ + Y + +GA+G +P+ Y L L+ +
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
G + +R ++ PS G IDG L+ FL LS
Sbjct: 1063 IKSVGKIEHTYYRNFQINSKV--EPSEGFIDGDLIESFLDLS 1102
>gi|90108797|pdb|2B5L|A Chain A, Crystal Structure Of Ddb1 In Complex With Simian Virus 5 V
Protein
gi|90108798|pdb|2B5L|B Chain B, Crystal Structure Of Ddb1 In Complex With Simian Virus 5 V
Protein
gi|90108801|pdb|2B5M|A Chain A, Crystal Structure Of Ddb1
gi|116667897|pdb|2HYE|A Chain A, Crystal Structure Of The Ddb1-cul4a-rbx1-sv5v Complex
gi|1136228|gb|AAA88883.1| UV-damaged DNA binding factor [Homo sapiens]
gi|1588524|prf||2208446A xeroderma pigmentosum group E-binding factor
Length = 1140
Score = 42.7 bits (99), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132
>gi|361132523|pdb|4A0L|A Chain A, Structure Of Ddb1-Ddb2-Cul4b-Rbx1 Bound To A 12 Bp Abasic
Site Containing Dna-Duplex
gi|361132525|pdb|4A0L|C Chain C, Structure Of Ddb1-Ddb2-Cul4b-Rbx1 Bound To A 12 Bp Abasic
Site Containing Dna-Duplex
Length = 1144
Score = 42.7 bits (99), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 978 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1033
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1034 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1091
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1092 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1136
>gi|322706594|gb|EFY98174.1| DNA damage-binding protein 1 [Metarhizium anisopliae ARSEF 23]
Length = 1121
Score = 42.7 bits (99), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 33/172 (19%), Positives = 69/172 (40%), Gaps = 15/172 (8%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC--------KPSP 136
++ +D N+++ PEA +L ++ ++G+ +N K+ P
Sbjct: 940 WLEADAQGNIIVLQRNPEAPTEQDRSKLEVTSEMNIGEQINQIRKLHVASNENAVVSPKA 999
Query: 137 ISDAPGARSRFLTWYASL------DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ G +T + L +G L F + LL Q+ + + G ++
Sbjct: 1000 FLGSVGLSETIITCWNQLLMLVQIEGTLYLFGEIAPNYQDLLLTFQSRLQDYIYAPGNVS 1059
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
+R ++ K P R +DG +V +FL L ++ +C+ +G D+
Sbjct: 1060 FNLWRAFRNKAREGDGPFR-FVDGEMVERFLDLDEAKQELVCEGLGPSVEDM 1110
>gi|355683071|gb|AER97036.1| damage-specific DNA binding protein 1, 127kDa [Mustela putorius furo]
Length = 1122
Score = 42.7 bits (99), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 59/134 (44%), Gaps = 11/134 (8%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLS 224
IDG L+ FL +S
Sbjct: 1088 FIDGDLIESFLDIS 1101
>gi|170589357|ref|XP_001899440.1| CPSF A subunit region family protein [Brugia malayi]
gi|158593653|gb|EDP32248.1| CPSF A subunit region family protein [Brugia malayi]
Length = 655
Score = 42.7 bits (99), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 36/151 (23%), Positives = 62/151 (41%), Gaps = 7/151 (4%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
ACE + S F+ ++ N+ M G RL + F+LG+ VN F
Sbjct: 465 ACEII---DSDNFLGAENSYNLFTVMKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSL 521
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
+ + AP S L Y + DG +G + +P Y L +Q + + + ++
Sbjct: 522 TATQVDVAPLYHSSIL--YGTSDGGIGVIVQMPPVLYTFLQDVQKRLAEYAENCMRISHT 579
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
+RT++ + G IDG L+ L +
Sbjct: 580 QYRTFETEK--RSEAPNGFIDGDLIESLLDM 608
>gi|449435512|ref|XP_004135539.1| PREDICTED: DNA damage-binding protein 1-like [Cucumis sativus]
Length = 1093
Score = 42.7 bits (99), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 12/129 (9%)
Query: 111 RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
RL ++HLG+ VN F +R S + P + S++G +G LP
Sbjct: 955 RLEVVGEYHLGEFVNRFQHGSLVMRLPDSDVGQIP------TVIFGSVNGVIGVIASLPH 1008
Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
Y L LQ+ + GGL+ +R++ + A ++ +DG L+ FL L+
Sbjct: 1009 DQYVFLERLQSNLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKNFLDGDLIESFLDLNRS 1066
Query: 227 ERLEICKKI 235
+ EI + +
Sbjct: 1067 KMEEISRAM 1075
>gi|328770638|gb|EGF80679.1| hypothetical protein BATDEDRAFT_11194 [Batrachochytrium dendrobatidis
JAM81]
Length = 1098
Score = 42.7 bits (99), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 38/152 (25%), Positives = 62/152 (40%), Gaps = 6/152 (3%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
A E + + + +G +D N+ Q + RL K FH G+ +N F K
Sbjct: 914 AAELIDDDTFLG---ADSSMNIFALSKQGDQVSEEERQRLRPKGWFHTGELINRFRKGSL 970
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLL-MLQNVMVTHTSHTGGLNP 191
+ + Y ++ GA+G +P ++L LQ + + GGL
Sbjct: 971 TLHATDETLALPAIPEILYCTVHGAIGVVARIPSDETAKILSTLQEALKSVVQGVGGLIH 1030
Query: 192 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
+R Y+ + S GIIDG L+ FL+L
Sbjct: 1031 SDWRRYRTE--RRSIKSAGIIDGDLIESFLEL 1060
>gi|74215029|dbj|BAE33503.1| unnamed protein product [Mus musculus]
Length = 1140
Score = 42.7 bits (99), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 59/141 (41%), Gaps = 15/141 (10%)
Query: 118 FHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLM 174
FHLG+ VN F C S + G S + + +++G +G L E Y LL
Sbjct: 998 FHLGEFVNVF----CHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLD 1053
Query: 175 LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
+QN + G + +R++ + P+ G IDG L+ FL +S + E+
Sbjct: 1054 MQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVAN 1111
Query: 235 I------GSKHNDILDELYDI 249
+ G K D+L +
Sbjct: 1112 LQYDDGSGMKREATADDLIKV 1132
>gi|74208347|dbj|BAE26370.1| unnamed protein product [Mus musculus]
Length = 599
Score = 42.7 bits (99), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 433 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGEASTPTQGSVL 488
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 489 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 546
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 547 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 591
>gi|261335516|emb|CBH18510.1| cleavage and polyadenylation specificity factor-like protein,
putative [Trypanosoma brucei gambiense DAL972]
Length = 1452
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 31/125 (24%), Positives = 52/125 (41%), Gaps = 3/125 (2%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGG-HRLIKKTDFHLGQHVNTFFK 129
V C+ + + G + SD ++NV++ Y P +E++ G H + ++ L
Sbjct: 1267 VVHCDMMYHDRAFGILCSDDERNVLIMGYTPRVQETDAGTHPKVLESVLSLDGEYRLPSG 1326
Query: 130 IRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 189
K G S +T Y S G +GF +P+ E+ R L + + GL
Sbjct: 1327 SLVKSLRFRSTAGNSS--VTLYVSNYGEIGFIVPIGEQANRTALWVTRRLQIDLPCEAGL 1384
Query: 190 NPRAF 194
PR F
Sbjct: 1385 TPRMF 1389
>gi|410974071|ref|XP_003993471.1| PREDICTED: DNA damage-binding protein 1 [Felis catus]
Length = 1193
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 1027 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1082
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1083 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1140
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1141 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1185
>gi|194389106|dbj|BAG61570.1| unnamed protein product [Homo sapiens]
Length = 1009
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 59/141 (41%), Gaps = 15/141 (10%)
Query: 118 FHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLM 174
FHLG+ VN F C S + G S + + +++G +G L E Y LL
Sbjct: 867 FHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLD 922
Query: 175 LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
+QN + G + +R++ + P+ G IDG L+ FL +S + E+
Sbjct: 923 MQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVAN 980
Query: 235 I------GSKHNDILDELYDI 249
+ G K D+L +
Sbjct: 981 LQYDDGSGMKREATADDLIKV 1001
>gi|5353754|gb|AAD42230.1|AF159853_1 damage-specific DNA binding protein 1 [Mus musculus]
Length = 1140
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGEASTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132
>gi|347838030|emb|CCD52602.1| similar to DDB1B (Damaged DNA Binding protein 1 B); damaged DNA
binding / protein binding [Botryotinia fuckeliana]
Length = 1157
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 41/184 (22%), Positives = 76/184 (41%), Gaps = 24/184 (13%)
Query: 77 VIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP 136
V E + ++ +D D N++L E R+ + +LG+ VN +I + S
Sbjct: 969 VAEVDTDTYLETDHDGNLILLKRNREGVTREDKTRMEVTCEMNLGEMVNRVKRINVETS- 1027
Query: 137 ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT---------- 186
DA FL + +G++ F +P +N L+ LQ+ + + S +
Sbjct: 1028 -KDALLIPRAFL---GTTEGSIYLFSLIPPQNQDLLMRLQSRLASLPSASSIRGSSDSTS 1083
Query: 187 --------GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 238
G L+ +R+Y P R +DG L+ +FL L + + + + +G K
Sbjct: 1084 PHQIELSPGNLDFNKYRSYISATRETSEPFR-FVDGELIERFLDLEVEVQEHVAEGLGVK 1142
Query: 239 HNDI 242
D+
Sbjct: 1143 AEDL 1146
>gi|402592185|gb|EJW86114.1| CPSF A subunit region family protein [Wuchereria bancrofti]
Length = 278
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 36/151 (23%), Positives = 62/151 (41%), Gaps = 7/151 (4%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
ACE + S F+ ++ N+ M G RL + F+LG+ VN F
Sbjct: 110 ACEII---DSDNFLGAENSYNLFTVMKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSL 166
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
+ + AP S L Y + DG +G + +P Y L +Q + + + ++
Sbjct: 167 TATQVDVAPLYHSSIL--YGTSDGGIGVIVQMPPVLYTFLQDVQKRLAEYAENCMRISHT 224
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
+RT++ + G IDG L+ L +
Sbjct: 225 QYRTFETEK--RSEAPNGFIDGDLIESLLDM 253
>gi|170057515|ref|XP_001864517.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167876915|gb|EDS40298.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1138
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 35/157 (22%), Positives = 65/157 (41%), Gaps = 6/157 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P+ + A E + + + +G ++ N+ + + A + ++ + FHLG VN F
Sbjct: 950 PKWMTAVEILDDDAFLG---AENSNNLFVCLKDSAATTDDERQQMPEVAQFHLGDMVNVF 1006
Query: 128 FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
I + S + + ++ GA+G +P Y L LQ + G
Sbjct: 1007 RHGSLVMQNIGERTTPTSGCV-LFGTVSGAIGLVTQIPPDYYEFLRKLQENLTNTIKSVG 1065
Query: 188 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
++ +R++ + S G IDG LV FL L+
Sbjct: 1066 RIDHTYWRSFHTE--MKTENSEGFIDGDLVESFLDLT 1100
>gi|393905247|gb|EJD73911.1| CPSF A subunit region family protein [Loa loa]
Length = 1145
Score = 42.4 bits (98), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 38/151 (25%), Positives = 66/151 (43%), Gaps = 7/151 (4%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
ACE +I+S S F+ ++ N+ + G RL + F+LG+ VN F
Sbjct: 956 ACE-IIDSDS--FLGAENSYNLFTVVKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSL 1012
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
+ + AP S L Y + DG +G + +P Y L +Q + +T + ++
Sbjct: 1013 TATQVDVAPLYHSSIL--YGTSDGGIGVIVQMPPVLYTFLHDVQKRLADYTENCMRISHT 1070
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
+RT++ + G IDG L+ L +
Sbjct: 1071 QYRTFETEK--RSEVPNGFIDGDLIESLLDM 1099
>gi|357519461|ref|XP_003630019.1| DNA damage-binding protein [Medicago truncatula]
gi|355524041|gb|AET04495.1| DNA damage-binding protein [Medicago truncatula]
Length = 1171
Score = 42.4 bits (98), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 32/129 (24%), Positives = 56/129 (43%), Gaps = 12/129 (9%)
Query: 111 RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
RL ++HLG+ +N F +R S + P + +++G +G LP
Sbjct: 1033 RLEVAGEYHLGEFINRFRHGSLVMRLPDSDVGQIPTV------IFGTINGVIGVIASLPH 1086
Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
+ Y L LQ+ + GGL+ +R++ + +R +DG L+ FL L
Sbjct: 1087 EQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--EARNFLDGDLIESFLDLKRS 1144
Query: 227 ERLEICKKI 235
+ EI K +
Sbjct: 1145 KMDEISKAM 1153
>gi|395544366|ref|XP_003774082.1| PREDICTED: DNA damage-binding protein 1 [Sarcophilus harrisii]
Length = 1239
Score = 42.4 bits (98), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 69/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 1073 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1128
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1129 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1186
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1187 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1231
>gi|195571247|ref|XP_002103615.1| GD18880 [Drosophila simulans]
gi|194199542|gb|EDX13118.1| GD18880 [Drosophila simulans]
Length = 1140
Score = 42.4 bits (98), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 68/162 (41%), Gaps = 16/162 (9%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
P+ + A E + + + +G + N LF+ Q ++ + R L + FHLG VN
Sbjct: 952 PKWMRAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVN 1006
Query: 126 TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
F S + G R+ + Y + +GA+G +P+ Y L L+ +
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
G + +R ++ PS G IDG L+ FL LS
Sbjct: 1063 IKSVGKIEHTYYRNFQINTKV--EPSEGFIDGDLIESFLDLS 1102
>gi|195145844|ref|XP_002013900.1| GL24391 [Drosophila persimilis]
gi|194102843|gb|EDW24886.1| GL24391 [Drosophila persimilis]
Length = 1140
Score = 42.4 bits (98), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 68/162 (41%), Gaps = 16/162 (9%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
P+ + A E + + + +G + N LF+ Q ++ + R L + FHLG VN
Sbjct: 952 PKWMRAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVN 1006
Query: 126 TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
F S + G R+ + Y + +GA+G +P+ Y L L+ +
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
G + +R ++ PS G IDG L+ FL LS
Sbjct: 1063 IKSVGKIEHTYYRNFQINTKV--EPSEGFIDGDLIESFLDLS 1102
>gi|125774475|ref|XP_001358496.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
gi|54638233|gb|EAL27635.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
Length = 1140
Score = 42.4 bits (98), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 68/162 (41%), Gaps = 16/162 (9%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
P+ + A E + + + +G + N LF+ Q ++ + R L + FHLG VN
Sbjct: 952 PKWMRAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVN 1006
Query: 126 TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
F S + G R+ + Y + +GA+G +P+ Y L L+ +
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
G + +R ++ PS G IDG L+ FL LS
Sbjct: 1063 IKSVGKIEHTYYRNFQINTKV--EPSEGFIDGDLIESFLDLS 1102
>gi|340059653|emb|CCC54046.1| putative mitochondrial carrier protein [Trypanosoma vivax Y486]
Length = 1481
Score = 42.4 bits (98), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 49/124 (39%), Gaps = 1/124 (0%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 130
V C + S G + SD +NV++ Y P +E+ G R K + L
Sbjct: 1296 VVDCNMMYHDRSFGILCSDDQRNVLVMGYTPRVQEAGAG-RPAKTLESLLTLDGEYRLPS 1354
Query: 131 RCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
C + + + + Y S G +GF +P+ E+ R L + + T GL
Sbjct: 1355 GCLAKSLRFSSDFGNSSVMLYTSNYGEVGFIVPIGEQANRTALWVTRRLQTDVPCDAGLT 1414
Query: 191 PRAF 194
PR F
Sbjct: 1415 PRMF 1418
>gi|312076590|ref|XP_003140929.1| CPSF A subunit region family protein [Loa loa]
Length = 655
Score = 42.4 bits (98), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 38/151 (25%), Positives = 66/151 (43%), Gaps = 7/151 (4%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
ACE +I+S S F+ ++ N+ + G RL + F+LG+ VN F
Sbjct: 457 ACE-IIDSDS--FLGAENSYNLFTVVKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSL 513
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
+ + AP S L Y + DG +G + +P Y L +Q + +T + ++
Sbjct: 514 TATQVDVAPLYHSSIL--YGTSDGGIGVIVQMPPVLYTFLHDVQKRLADYTENCMRISHT 571
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
+RT++ + G IDG L+ L +
Sbjct: 572 QYRTFETEK--RSEVPNGFIDGDLIESLLDM 600
>gi|449704103|gb|EMD44407.1| DNA-repair binding protein, putative [Entamoeba histolytica KU27]
Length = 1088
Score = 42.4 bits (98), Expect = 0.19, Method: Composition-based stats.
Identities = 45/183 (24%), Positives = 79/183 (43%), Gaps = 13/183 (7%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
A EFV E ++ SD + N+++F ES RL H+G+ +N + C
Sbjct: 914 AIEFVDEDC---YLSSDSNSNILIFNTNSTGNESER-FRLNNCAHIHVGECIN----VMC 965
Query: 133 KPS--PISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN-VMVTHTSHTGGL 189
K S P + + + G +G +P + Y L+ +QN +++
Sbjct: 966 KGSIAPTHSTYETVQKKCILFGGVTGYIGGICEIPNEIYDVLIKVQNQILLQMKGIVECT 1025
Query: 190 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
P ++ K + PS IIDGS+V +L++S ++ EI G I D + ++
Sbjct: 1026 TPDNWK--KVIDDWKRMPSSNIIDGSIVESYLEMSKEKQCEIAHLSGVNEEQISDIIENM 1083
Query: 250 EAL 252
+L
Sbjct: 1084 ISL 1086
>gi|156841606|ref|XP_001644175.1| hypothetical protein Kpol_1059p7 [Vanderwaltozyma polyspora DSM
70294]
gi|156114812|gb|EDO16317.1| hypothetical protein Kpol_1059p7 [Vanderwaltozyma polyspora DSM
70294]
Length = 1346
Score = 42.4 bits (98), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 34/53 (64%), Gaps = 3/53 (5%)
Query: 198 KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI-GSKHNDILDELYDI 249
K + YYA P + IIDG L +FL L+ ERLEICK + +K DI+ ++ ++
Sbjct: 1291 KYRSYYA--PVKNIIDGDLCERFLYLNSNERLEICKNLKDTKPEDIIRQINEM 1341
>gi|327301962|ref|XP_003235673.1| UV-damaged DNA binding protein [Trichophyton rubrum CBS 118892]
gi|326461015|gb|EGD86468.1| UV-damaged DNA binding protein [Trichophyton rubrum CBS 118892]
Length = 1147
Score = 42.4 bits (98), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 35/151 (23%), Positives = 68/151 (45%), Gaps = 6/151 (3%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
++++D + N+V+ + RL ++ LG+ VN I + + A AR
Sbjct: 982 YLLADAEGNLVVLQQNITGVTESDRKRLQPTSEIRLGEMVNRIHPIVIQ-TYTETAVSAR 1040
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
+ A++DG++ F + LL LQ M + T G + +R ++ + +
Sbjct: 1041 A----LLATVDGSIYLFGLINPTYIDLLLRLQTAMGSITISPGEIPFSKYRAFRTTVHQS 1096
Query: 205 GNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
P R +DG L+ +FL + G + EI ++
Sbjct: 1097 DEPFR-FVDGELIERFLSCTPGMQEEIVSRL 1126
>gi|452003044|gb|EMD95501.1| hypothetical protein COCHEDRAFT_1165632 [Cochliobolus heterostrophus
C5]
Length = 1116
Score = 42.4 bits (98), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 37/149 (24%), Positives = 69/149 (46%), Gaps = 8/149 (5%)
Query: 77 VIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP 136
V E + ++ SD + N+V+ P + RL ++ LG+ VN ++ +
Sbjct: 946 VAEVDTNTYLESDAEGNLVVLYRDPNGVTDDDKRRLNISSEMLLGEMVNRIRRVDVLTA- 1004
Query: 137 ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRAFR 195
+DA F+ +++G++ + L +Y+ LLM LQ+ + + + G + FR
Sbjct: 1005 -ADAVIVPKAFV---GTVEGSI-YLFGLISPDYQNLLMTLQSNLGSLVAAPGDMQFAKFR 1059
Query: 196 TYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
+K P R +DG LV +FL +S
Sbjct: 1060 AFKNSVREEEEPMR-FVDGELVERFLDVS 1087
>gi|219125301|ref|XP_002182922.1| damage-specific DNA binding protein 1 [Phaeodactylum tricornutum CCAP
1055/1]
gi|217405716|gb|EEC45658.1| damage-specific DNA binding protein 1 [Phaeodactylum tricornutum CCAP
1055/1]
Length = 1284
Score = 42.4 bits (98), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 52/113 (46%), Gaps = 3/113 (2%)
Query: 111 RLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYR 170
RL +FHLG+ N F P+S SR T + +++G+LG L L +
Sbjct: 1110 RLDNIGEFHLGEMCNKFMS-GSLVMPVSSNSTTSSRRATLFGTVEGSLGVILGLDGRTAA 1168
Query: 171 RLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
+ L+ + GG + + +R+ + + +P+ G +DG LV FL L
Sbjct: 1169 FFITLERAIAKTIQPVGGFSHQLYRSCQAE--LRVHPAHGFVDGDLVETFLDL 1219
>gi|452824087|gb|EME31092.1| DNA damage-binding protein 1 isoform 1 [Galdieria sulphuraria]
Length = 1128
Score = 42.4 bits (98), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 69/162 (42%), Gaps = 9/162 (5%)
Query: 77 VIESSSMGFMISDKDK-NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS 135
VIE + + IS ++ N+ +A RL K ++HLG+ VN R
Sbjct: 955 VIEMLDLDYYISAENCFNLFTLKRNSDASTEEERSRLEKVGEYHLGELVNRIRHGRL--- 1011
Query: 136 PISDAPGARSRFLT--WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRA 193
+ P + L Y + +GALG + EK ++ L LQ + GG+
Sbjct: 1012 -VLQIPESGISILKSLLYGTANGALGVIASIDEKTFQFLHSLQTALNEVIKGVGGIQHED 1070
Query: 194 FRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 235
+R + + S+ +DG L+ +FL LS + + KK+
Sbjct: 1071 WRRFTSERRIG--DSKNFLDGDLIERFLDLSRDKMELVAKKV 1110
>gi|195449948|ref|XP_002072297.1| GK22405 [Drosophila willistoni]
gi|194168382|gb|EDW83283.1| GK22405 [Drosophila willistoni]
Length = 1140
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 68/162 (41%), Gaps = 16/162 (9%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
P+ + A E + + + +G + N LF+ Q ++ + R L + FHLG VN
Sbjct: 952 PKWMRAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVN 1006
Query: 126 TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
F S + G R+ + Y + +GA+G +P+ Y L L+ +
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
G + +R ++ PS G IDG L+ FL LS
Sbjct: 1063 IKSVGKIEHTYYRNFQINTKV--EPSEGFIDGDLIESFLDLS 1102
>gi|325186344|emb|CCA20849.1| predicted protein putative [Albugo laibachii Nc14]
Length = 1148
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 42/187 (22%), Positives = 83/187 (44%), Gaps = 22/187 (11%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSPISDA 140
++ S+ D N+ A RL ++HLG+ VN F ++ S ++A
Sbjct: 958 YVGSETDFNLFTVQRNSGAASDEERGRLETIGEYHLGEFVNRFRYGSLVMQHNLSIGAEA 1017
Query: 141 PGA-----RSRFLT--------WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
PG R L+ + ++ G +G LP+ ++ + L+ +Q+ + G
Sbjct: 1018 PGISLSDDRPESLSPLSVQRSMLFGTVSGMIGVILPISKEKHEFLMRVQSALNQVIQGVG 1077
Query: 188 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 247
G + +RT++ + + + IDG L+ FL LS E ++ ++ + D L+
Sbjct: 1078 GFSHSEWRTFENR--RSSIEAHNFIDGDLIESFLDLSKDEMKQVVDEL---NRDQLEGKT 1132
Query: 248 DIEALSS 254
+EAL++
Sbjct: 1133 TLEALAA 1139
>gi|224061051|ref|XP_002300334.1| predicted protein [Populus trichocarpa]
gi|222847592|gb|EEE85139.1| predicted protein [Populus trichocarpa]
Length = 1088
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 33/129 (25%), Positives = 56/129 (43%), Gaps = 12/129 (9%)
Query: 111 RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
RL ++HLG+ VN F +R S + P + +++G +G LP
Sbjct: 950 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTVNGVIGVIASLPH 1003
Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
+ Y L LQ+ + GGL+ +R++ + ++ +DG L+ FL LS
Sbjct: 1004 EQYLFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--DAKNFLDGDLIESFLDLSRS 1061
Query: 227 ERLEICKKI 235
EI K +
Sbjct: 1062 RMDEISKAM 1070
>gi|124505011|ref|XP_001351247.1| CPSF (cleavage and polyadenylation specific factor), subunit A,
putative [Plasmodium falciparum 3D7]
gi|7768292|emb|CAB11136.2| CPSF (cleavage and polyadenylation specific factor), subunit A,
putative [Plasmodium falciparum 3D7]
Length = 2870
Score = 42.4 bits (98), Expect = 0.21, Method: Composition-based stats.
Identities = 40/194 (20%), Positives = 80/194 (41%), Gaps = 17/194 (8%)
Query: 74 CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK 133
C + S ++SD D N ++ ++L + F+ G +N +
Sbjct: 2679 CTSLCALSKSHIVVSDMDANFIILQKSKFKYNDEDSYKLSSVSLFNHGSIINKMLPL--S 2736
Query: 134 PSPISDAPGARSRFLT-----WYASLDGALGFFLPLPE-KNYRRLLMLQNVMVTHTSHTG 187
+ + + + LT AS +G++ +P N+++ L ++ + + S G
Sbjct: 2737 NTNLIEEDYDKRNILTKNDGILCASSEGSISVLIPFSSFANFKKALCIEIAITDNISSIG 2796
Query: 188 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE-------ICKKIGSKHN 240
L+ A+R YK + +GI+DG L+ F +S ++ + I KKI K
Sbjct: 2797 NLSHNAYREYKVN--FRSKHCKGIVDGELLKMFFHMSFEKQYKTFIYAKWIAKKINCKFG 2854
Query: 241 DILDELYDIEALSS 254
+ + D+E + S
Sbjct: 2855 SFNNFILDLENMCS 2868
>gi|154303693|ref|XP_001552253.1| hypothetical protein BC1G_08731 [Botryotinia fuckeliana B05.10]
Length = 1087
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 41/184 (22%), Positives = 76/184 (41%), Gaps = 24/184 (13%)
Query: 77 VIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP 136
V E + ++ +D D N++L E R+ + +LG+ VN +I + S
Sbjct: 899 VAEVDTDTYLETDHDGNLILLKRNREGVTREDKTRMEVTCEMNLGEMVNRVKRINVETS- 957
Query: 137 ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT---------- 186
DA FL + +G++ F +P +N L+ LQ+ + + S +
Sbjct: 958 -KDALLIPRAFL---GTTEGSIYLFSLIPPQNQDLLMRLQSRLASLPSASSIRGSSDSTS 1013
Query: 187 --------GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 238
G L+ +R+Y P R +DG L+ +FL L + + + + +G K
Sbjct: 1014 PHQIELSPGNLDFNKYRSYISATRETSEPFR-FVDGELIERFLDLEVEVQEHVAEGLGVK 1072
Query: 239 HNDI 242
D+
Sbjct: 1073 AEDL 1076
>gi|350537001|ref|NP_001234275.1| DNA damage-binding protein 1 [Solanum lycopersicum]
gi|350539125|ref|NP_001233864.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
gi|55976440|sp|Q6QNU4.1|DDB1_SOLLC RecName: Full=DNA damage-binding protein 1; AltName: Full=High
pigmentation protein 1; AltName: Full=UV-damaged
DNA-binding protein 1
gi|38455768|gb|AAR20885.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
gi|42602165|gb|AAS21683.1| UV-damaged DNA binding protein 1 [Solanum lycopersicum]
Length = 1090
Score = 42.0 bits (97), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 50/241 (20%), Positives = 91/241 (37%), Gaps = 41/241 (17%)
Query: 8 LTVSLGLHGNRPLLLVRTQHELLIYQ---------AFRHPKGTLKLRFKKLKVLFVSDRS 58
L G HG+ L V+T+ + ++ F+H +G ++ R + ++S
Sbjct: 860 LQTECGHHGHILALYVQTRGDFIVVGDLMKSISLLIFKHEEGAIEERARDYNANWMS--- 916
Query: 59 KRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDF 118
A E + + +G ++ + N+ E RL ++
Sbjct: 917 --------------AVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERSRLEVVGEY 959
Query: 119 HLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM 174
HLG+ VN F +R S + P + +++G +G LP Y L
Sbjct: 960 HLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTVNGVIGVIASLPHDQYLFLEK 1013
Query: 175 LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
LQ + GGL+ +R++ + ++ +DG L+ FL LS EI K
Sbjct: 1014 LQTNLRKVIKGVGGLSHEQWRSFYNEKKTV--DAKNFLDGDLIESFLDLSRNRMEEISKA 1071
Query: 235 I 235
+
Sbjct: 1072 M 1072
>gi|367033707|ref|XP_003666136.1| hypothetical protein MYCTH_2310607 [Myceliophthora thermophila ATCC
42464]
gi|347013408|gb|AEO60891.1| hypothetical protein MYCTH_2310607 [Myceliophthora thermophila ATCC
42464]
Length = 1043
Score = 42.0 bits (97), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 37/170 (21%), Positives = 70/170 (41%), Gaps = 9/170 (5%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
A FV E S ++ +D N+++ E + R+ ++ +LG+ VN R
Sbjct: 872 AVGFVREGS---WLEADAQGNLMVLRRNVEGVTAEDRRRMEITSEMNLGEMVN-----RI 923
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
+ + PGA + +++G + F + LL Q + G + R
Sbjct: 924 REVEVETTPGALVVPRAFLGTVEGGIYMFGTVAPHVQDLLLRFQGKLADVLKTAGDIPFR 983
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
++R ++ P R +DG L+ +FL + + ICK +G D+
Sbjct: 984 SYRAFRNAEREGDGPFR-FLDGELLERFLDVDEATQEIICKGLGPSVEDM 1032
>gi|156339616|ref|XP_001620212.1| hypothetical protein NEMVEDRAFT_v1g223331 [Nematostella vectensis]
gi|156204813|gb|EDO28112.1| predicted protein [Nematostella vectensis]
Length = 248
Score = 42.0 bits (97), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 37/64 (57%), Gaps = 3/64 (4%)
Query: 4 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP--KGTLKLRFKKLKVLFVSDRSKRA 61
V+E+L LG R L+ +LLIY+AF +P +G L LRFKKL+ + R K+
Sbjct: 150 VREVLLTGLGYKNRRATLVAVMDQDLLIYEAFSYPTVEGHLNLRFKKLQ-HNIQIREKKP 208
Query: 62 NEQP 65
++P
Sbjct: 209 KQEP 212
>gi|431910407|gb|ELK13480.1| DNA damage-binding protein 1 [Pteropus alecto]
Length = 1143
Score = 42.0 bits (97), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 71/171 (41%), Gaps = 26/171 (15%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVM------VTHTSHTGGLNPRAFRTYKGKGYYA 204
+ +++G +G L E Y LL +QN + V H+ + R+F T +
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSLYPSQRSFHTERKT---- 1085
Query: 205 GNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
P+ G IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1086 -EPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1135
>gi|55976392|sp|Q6E7D1.1|DDB1_SOLCE RecName: Full=DNA damage-binding protein 1; AltName: Full=UV-damaged
DNA-binding protein 1
gi|49484911|gb|AAT66742.1| UV-damaged DNA binding protein 1 [Solanum cheesmaniae]
Length = 1095
Score = 42.0 bits (97), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 50/241 (20%), Positives = 91/241 (37%), Gaps = 41/241 (17%)
Query: 8 LTVSLGLHGNRPLLLVRTQHELLIYQ---------AFRHPKGTLKLRFKKLKVLFVSDRS 58
L G HG+ L V+T+ + ++ F+H +G ++ R + ++S
Sbjct: 865 LQTECGHHGHILALYVQTRGDFIVVGDLMKSISLLIFKHEEGAIEERARDYNANWMS--- 921
Query: 59 KRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDF 118
A E + + +G ++ + N+ E RL ++
Sbjct: 922 --------------AVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERSRLEVVGEY 964
Query: 119 HLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM 174
HLG+ VN F +R S + P + +++G +G LP Y L
Sbjct: 965 HLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTVNGVIGVIASLPHDQYLFLEK 1018
Query: 175 LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
LQ + GGL+ +R++ + ++ +DG L+ FL LS EI K
Sbjct: 1019 LQTNLRKVIKGVGGLSHEQWRSFYNEKKTV--DAKNFLDGDLIESFLDLSRNRMEEISKA 1076
Query: 235 I 235
+
Sbjct: 1077 M 1077
>gi|212539802|ref|XP_002150056.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei ATCC
18224]
gi|210067355|gb|EEA21447.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei ATCC
18224]
Length = 1139
Score = 42.0 bits (97), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 38/176 (21%), Positives = 75/176 (42%), Gaps = 8/176 (4%)
Query: 77 VIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP 136
V + S +++SD + N+++ + RL ++ LG+ VN R +P
Sbjct: 968 VADIGSNQWLVSDAEGNLIVLRRNVDGMTEEDRRRLEVTSELLLGEMVN-----RIRPVN 1022
Query: 137 ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT 196
I + +++G++ F + ++ L+ LQ + + G + FR
Sbjct: 1023 IPQTSTMAVTPKAFLGTVEGSIYLFALINPEHQDFLMRLQTAISAYVDSPGLMPFNKFRA 1082
Query: 197 YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 252
++ A P R +DG L+ +FL + EI +GS + + ++ IEAL
Sbjct: 1083 FRSTVREAEEPFR-FVDGELIERFLDCDRAVQEEILGVVGSGDLESVQKM--IEAL 1135
>gi|440893607|gb|ELR46310.1| DNA damage-binding protein 1 [Bos grunniens mutus]
Length = 1143
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 71/171 (41%), Gaps = 26/171 (15%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVM------VTHTSHTGGLNPRAFRTYKGKGYYA 204
+ +++G +G L E Y LL +QN + V H+ + R+F T +
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSLYPSQRSFHTERKT---- 1085
Query: 205 GNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
P+ G IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1086 -EPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1135
>gi|407044103|gb|EKE42371.1| DNA damage-binding protein, putative [Entamoeba nuttalli P19]
Length = 1088
Score = 42.0 bits (97), Expect = 0.26, Method: Composition-based stats.
Identities = 45/183 (24%), Positives = 79/183 (43%), Gaps = 13/183 (7%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
A EFV E ++ SD + N+++F ES RL H+G+ +N + C
Sbjct: 914 AIEFVDEDC---YLSSDSNSNILIFNTNSTGNESER-FRLNNCAHIHVGECIN----VMC 965
Query: 133 KPS--PISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN-VMVTHTSHTGGL 189
K S P + + + G +G +P + Y L+ +QN +++
Sbjct: 966 KGSIAPTHSTYETVQKKCILFGGVTGYIGGICEIPNEIYDILIKVQNQILLQMKGIVECT 1025
Query: 190 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
P ++ K + PS IIDGS+V +L++S ++ EI G I D + ++
Sbjct: 1026 TPDDWK--KVIDDWKRMPSSNIIDGSIVESYLEMSKEKQCEIAHLSGVNEEKISDIIENM 1083
Query: 250 EAL 252
+L
Sbjct: 1084 ISL 1086
>gi|290998415|ref|XP_002681776.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
gi|284095401|gb|EFC49032.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
Length = 1103
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 39/190 (20%), Positives = 79/190 (41%), Gaps = 17/190 (8%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT----FF 128
+C ++ S ++ D+ N++ EA +L++ ++ VN F
Sbjct: 918 SCSLMLNESE--YVAVDQQGNMITLKKNDEAASEEERKQLVRVGKYYCSDRVNRIQPGFI 975
Query: 129 KIR--CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
+R S I+ P + F ++ G +G LP + + + +Q M + +
Sbjct: 976 GMRFANSSSDINTQPVKTALF----GTISGGIGVLAQLPPETFAFVTKIQKAMSSVVTGL 1031
Query: 187 GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND--ILD 244
++ +R Y+ + S G IDG V FL+ + + +++ + H + L+
Sbjct: 1032 ANISRETYRQYRSE--RTREDSVGFIDGDFVESFLEFDFETQQRVIEELSNNHQEQITLE 1089
Query: 245 ELY-DIEALS 253
EL +IE LS
Sbjct: 1090 ELVKNIEDLS 1099
>gi|315039795|ref|XP_003169275.1| hypothetical protein MGYG_08821 [Arthroderma gypseum CBS 118893]
gi|311337696|gb|EFQ96898.1| hypothetical protein MGYG_08821 [Arthroderma gypseum CBS 118893]
Length = 1076
Score = 42.0 bits (97), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 38/168 (22%), Positives = 72/168 (42%), Gaps = 12/168 (7%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
++++D + N+V+ + RL ++ LG+ VN I + + A AR
Sbjct: 911 YLLADAEGNLVVLQQNTTGVTESDRKRLQPTSEIRLGEMVNRIHPITVQ-THTETAVSAR 969
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 204
+ A++DG++ F + LL LQ M + T G + +R ++ +
Sbjct: 970 A----LLATVDGSIYLFGLINPAYIDLLLRLQTAMASITVSPGEIPFSKYRAFRTTVRQS 1025
Query: 205 GNPSRGIIDGSLVWKFL------QLSLGERLEICKKIGSKHNDILDEL 246
P R +DG L+ +FL Q + RL+ S ++++EL
Sbjct: 1026 DEPFR-FVDGELIERFLSCTPTMQEEIANRLDDSNVTVSSLKEMIEEL 1072
>gi|116195210|ref|XP_001223417.1| hypothetical protein CHGG_04203 [Chaetomium globosum CBS 148.51]
gi|88180116|gb|EAQ87584.1| hypothetical protein CHGG_04203 [Chaetomium globosum CBS 148.51]
Length = 1127
Score = 41.6 bits (96), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 36/170 (21%), Positives = 69/170 (40%), Gaps = 9/170 (5%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
A FV E S ++ +D N+++ E + R+ ++ +L + VN R
Sbjct: 956 AVGFVEEGS---WLEADAQGNLMVLRRNVEGVTAEDKRRMEVTSEINLNEMVN-----RI 1007
Query: 133 KPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 192
+ + PGA + +++G + F + LL Q+ + G + R
Sbjct: 1008 RTIDVETTPGAMIVPKAFLGTVEGGIYMFGTVAPHVQDLLLRFQSRLADVLKTAGDIEFR 1067
Query: 193 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
+R ++ P R +DG L+ KFL + + +CK +G D+
Sbjct: 1068 TYRAFRNAEREGDGPFR-FVDGELLEKFLDVDETTQEAVCKGLGPTVEDM 1116
>gi|357606250|gb|EHJ64976.1| putative Splicing factor 3B subunit [Danaus plexippus]
Length = 1216
Score = 41.6 bits (96), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 36/135 (26%), Positives = 58/135 (42%), Gaps = 17/135 (12%)
Query: 116 TDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM- 174
+FH+G+ V + + PG L YA++ GALG FLP +
Sbjct: 1092 VNFHVGETVTSLQRATL-------IPGGSEALL--YATVSGALGVFLPFTSREDHDFFQH 1142
Query: 175 LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
L+ M + S G + +FR+Y YY P + +IDG L +F L ++ I
Sbjct: 1143 LEMHMRSENSPLCGRDHLSFRSY----YY---PVKNVIDGDLCEQFNSLEPAKQKAIAGD 1195
Query: 235 IGSKHNDILDELYDI 249
+ ++ +L DI
Sbjct: 1196 LERTPAEVSKKLEDI 1210
>gi|241260143|ref|XP_002404926.1| DNA repair protein xp-E, putative [Ixodes scapularis]
gi|215496735|gb|EEC06375.1| DNA repair protein xp-E, putative [Ixodes scapularis]
Length = 1148
Score = 41.6 bits (96), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 46/176 (26%), Positives = 75/176 (42%), Gaps = 22/176 (12%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F S + PG S +
Sbjct: 974 LFVCQKDSAATTDEDRQHLQEVGQFHLGEFVNIFR----HGSLVMQHPGEASSPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF---RTYKGKGYYA--- 204
+ ++ GA+G LP Y LL +Q + G ++ + R + + +
Sbjct: 1030 FGTIHGAIGLVAQLPSDFYNFLLEVQGNLTKVIKSVGKIDHTLYPFVRLFTWRSFSTERK 1089
Query: 205 GNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYD-IEALS 253
++G IDG L+ FL LS + E+ + I G K + +D+L IE LS
Sbjct: 1090 TEQAQGFIDGDLIESFLDLSRDKMQEVLQGIQMDDGSGMKRDATVDDLIKIIEELS 1145
>gi|194741158|ref|XP_001953056.1| GF17579 [Drosophila ananassae]
gi|190626115|gb|EDV41639.1| GF17579 [Drosophila ananassae]
Length = 1140
Score = 41.6 bits (96), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 48/197 (24%), Positives = 80/197 (40%), Gaps = 22/197 (11%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
P+ + A E + + + +G + N LF+ Q ++ + R L + FHLG VN
Sbjct: 952 PKWMRAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVN 1006
Query: 126 TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
F S + G R+ + Y + +GA+G +P+ Y L L+ +
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL------SLGERLEICKKIG 236
G + +R ++ PS G IDG L+ FL L + LEI
Sbjct: 1063 IKSVGKIEHTYYRNFQINTKV--EPSEGFIDGDLIESFLDLGRDKMRDAVQGLEITLNGE 1120
Query: 237 SKHNDILDELYDIEALS 253
K D+ D + +E L+
Sbjct: 1121 RKSADVEDVIKIVEDLT 1137
>gi|167517245|ref|XP_001742963.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778062|gb|EDQ91677.1| predicted protein [Monosiga brevicollis MX1]
Length = 1670
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 48/231 (20%), Positives = 89/231 (38%), Gaps = 44/231 (19%)
Query: 55 SDRSKRANEQPGLPRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIK 114
S++ R PG P Y + + F+ D D+N+VL M P N +
Sbjct: 959 SEQRNRFVVGPGDPESRYLLTCFLPAEDR-FLFCDSDQNLVLGM-PPVDTVENDASLMHL 1016
Query: 115 KTDFHLGQHVNTFFKIRC------------------------------------KPSPIS 138
H+G ++N++ C +P+
Sbjct: 1017 AGRIHIGDNINSYVICACIHVWTPYLLCPDSTFCFAALFVTSRFAFGSLSLSYERPAEAG 1076
Query: 139 DA--PGAR--SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 194
+A GA+ S + ++ G +G L + +K+ + +Q + + GGL +
Sbjct: 1077 EAGEDGAKQQSSPPIVFTTVLGGVGMILEVQQKHLWFMHEMQRRLADMGNAVGGLTHEDY 1136
Query: 195 RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK--IGSKHNDIL 243
R+ K + P+R +DG+L+ FL+L+ E E+ K+ I + ND +
Sbjct: 1137 RSTKNGKRESVTPARCFVDGNLIESFLELTPEEMEEVMKEFHIPNSSNDFV 1187
>gi|321478515|gb|EFX89472.1| hypothetical protein DAPPUDRAFT_303245 [Daphnia pulex]
Length = 1158
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 38/166 (22%), Positives = 68/166 (40%), Gaps = 9/166 (5%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
P + A E + + + +G + + LF+ Q ++ + R L + FHLG VN
Sbjct: 968 PNWMSAVEILDDDTFLG-----AENSFNLFVCQKDSAATTEEERQQLTEVGRFHLGDMVN 1022
Query: 126 TFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 185
F ++ ++ + ++ GA+G LP + Y L +Q M
Sbjct: 1023 VFRHGSLVMDHAAETLTTPTQGCVLFGTVHGAIGVVTQLPSEFYHFLSEVQTRMARVIKP 1082
Query: 186 TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 231
G + +R++ + P G IDG L+ FL LS + E+
Sbjct: 1083 VGKIEHSFWRSFATERKV--EPCEGFIDGDLIESFLDLSSDKMKEV 1126
>gi|1399512|gb|AAC47162.1| repE [Dictyostelium discoideum]
Length = 1139
Score = 41.6 bits (96), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 29/124 (23%), Positives = 54/124 (43%), Gaps = 5/124 (4%)
Query: 118 FHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN 177
+H+G+ +N+ P SD P + YAS++G++G + E+++ LQ
Sbjct: 1008 YHIGESINSMRHGSLVRLPDSDQPIIPT---ILYASVNGSIGVVASISEEDFIFFSKLQK 1064
Query: 178 VMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
+ GG + +R + + S+ IDG L+ FL L +L+ +G
Sbjct: 1065 GLNQVVRGVGGFSHETWRAFSNDHHTI--DSKNFIDGDLIETFLDLKYESQLKAVADLGI 1122
Query: 238 KHND 241
+D
Sbjct: 1123 TPDD 1126
>gi|81868411|sp|Q9ESW0.1|DDB1_RAT RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|9843869|emb|CAB89874.2| damage-specific DNA binding protein 1 [Rattus norvegicus]
Length = 1140
Score = 41.2 bits (95), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 68/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+++G +G L E Y LL +QN + G + +R++ + P+ G
Sbjct: 1030 LGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEPATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1088 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1132
>gi|281345356|gb|EFB20940.1| hypothetical protein PANDA_015888 [Ailuropoda melanoleuca]
Length = 1124
Score = 41.2 bits (95), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 41/140 (29%), Positives = 62/140 (44%), Gaps = 21/140 (15%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 956 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1011
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVM------VTHTSHTGGLNPRAFRTYKGKGYYA 204
+ +++G +G L E Y LL +QN + +TH S T R+F T +
Sbjct: 1012 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKNITH-SLTHLSTWRSFHTERKT---- 1066
Query: 205 GNPSRGIIDGSLVWKFLQLS 224
P+ G IDG L+ FL +S
Sbjct: 1067 -EPATGFIDGDLIESFLDIS 1085
>gi|119594342|gb|EAW73936.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_d [Homo
sapiens]
Length = 1146
Score = 41.2 bits (95), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 71/174 (40%), Gaps = 29/174 (16%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQN---------VMVTHTSHTGGLNPRAFRTYKGKG 201
+ +++G +G L E Y LL +QN +++ S T R+F T +
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKRCFLISTCSLTHPSTWRSFHTERKT- 1088
Query: 202 YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
P+ G IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1089 ----EPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1138
>gi|166240328|ref|XP_637896.2| UV-damaged DNA binding protein1 [Dictyostelium discoideum AX4]
gi|238064940|sp|B0M0P5.1|DDB1_DICDI RecName: Full=DNA damage-binding protein 1; AltName: Full=DNA repair
protein E; AltName: Full=UV-damaged DNA-binding protein 1
gi|165988543|gb|EAL64385.2| UV-damaged DNA binding protein1 [Dictyostelium discoideum AX4]
Length = 1181
Score = 41.2 bits (95), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 29/124 (23%), Positives = 54/124 (43%), Gaps = 5/124 (4%)
Query: 118 FHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN 177
+H+G+ +N+ P SD P + YAS++G++G + E+++ LQ
Sbjct: 1050 YHIGESINSMRHGSLVRLPDSDQPIIPT---ILYASVNGSIGVVASISEEDFIFFSKLQK 1106
Query: 178 VMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 237
+ GG + +R + + S+ IDG L+ FL L +L+ +G
Sbjct: 1107 GLNQVVRGVGGFSHETWRAFSNDHHTI--DSKNFIDGDLIETFLDLKYESQLKAVADLGI 1164
Query: 238 KHND 241
+D
Sbjct: 1165 TPDD 1168
>gi|67463896|ref|XP_648489.1| cleavage and polyadenylation specificity factor subunit [Entamoeba
histolytica HM-1:IMSS]
gi|56464653|gb|EAL43100.1| cleavage and polyadenylation specificity factor subunit, putative
[Entamoeba histolytica HM-1:IMSS]
Length = 1150
Score = 41.2 bits (95), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 36/168 (21%), Positives = 71/168 (42%), Gaps = 14/168 (8%)
Query: 76 FVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS 135
F+I + D ++N+ L Y A E + F+LG ++ F ++
Sbjct: 980 FIITKDYFSVLEFDSEQNLSLLNYSSAATEQLSIFEI--DATFNLGMNLLKFTRLW---- 1033
Query: 136 PISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 195
+ G ++ Y +++G++G+ + EK Y+ L + M H G N +R
Sbjct: 1034 ---NGKG----YIYMYVTVEGSVGYISVVEEKIYQVLRQINIKMNREPWHFAGTNAEEYR 1086
Query: 196 TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
KG G G +DG ++ +F L+ ++ +C + + ND+
Sbjct: 1087 FEKGYGMGFGTRKHVFLDGDMLKQFRLLNEEQQKRVCLR-NTSINDVF 1133
>gi|348526664|ref|XP_003450839.1| PREDICTED: DNA damage-binding protein 1-like [Oreochromis niloticus]
Length = 1140
Score = 41.2 bits (95), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 42/165 (25%), Positives = 68/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVLQNLGESSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL LQN + G + +R++ + + G
Sbjct: 1030 FGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEQATG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL L + E+ + G K +DE+ I
Sbjct: 1088 FIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREATVDEVIKI 1132
>gi|407035910|gb|EKE37921.1| CPSF A subunit region protein, putative [Entamoeba nuttalli P19]
Length = 836
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 36/168 (21%), Positives = 71/168 (42%), Gaps = 14/168 (8%)
Query: 76 FVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS 135
F+I + D ++N+ L Y A E + F+LG ++ F ++
Sbjct: 666 FIITKDYFSVLEFDSEQNLSLLNYSSAATEQLSIFEI--DATFNLGMNLLKFTRLW---- 719
Query: 136 PISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 195
+ G ++ Y +++G++G+ + EK Y+ L + M H G N +R
Sbjct: 720 ---NGKG----YIYMYVTVEGSVGYISVVEEKIYQVLRQINIKMNREPWHFAGTNAEEYR 772
Query: 196 TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
KG G G +DG ++ +F L+ ++ +C + + ND+
Sbjct: 773 FEKGYGMGFGTRKHVFLDGDMLKQFRLLNEEQQKRVCLR-NTSINDVF 819
>gi|150863836|ref|XP_001382447.2| hypothetical protein PICST_54680 [Scheffersomyces stipitis CBS 6054]
gi|149385092|gb|ABN64418.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 1228
Score = 41.2 bits (95), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 24/99 (24%), Positives = 47/99 (47%), Gaps = 5/99 (5%)
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
Y + G +G LPL K+ + + N + G N K + YY NP +
Sbjct: 1129 YTGIQGTVGLLLPLSTKSEVQFI---NSLEQSLRQQMGFNLLGMDHLKFRSYY--NPVKN 1183
Query: 211 IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
+IDG L+ K+ +LS +++I +++ ++ ++ D+
Sbjct: 1184 VIDGDLIEKYYELSQSLKIKIARELNRTPKEVEKKISDL 1222
>gi|167384458|ref|XP_001736962.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165900458|gb|EDR26769.1| hypothetical protein EDI_171140 [Entamoeba dispar SAW760]
Length = 836
Score = 41.2 bits (95), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 36/168 (21%), Positives = 71/168 (42%), Gaps = 14/168 (8%)
Query: 76 FVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS 135
F+I + D ++N+ L Y A E + F+LG ++ F ++
Sbjct: 666 FIITKDYFSVLEFDSEQNLSLLNYSSAATEQLSIFEI--DATFNLGMNLLKFTRLW---- 719
Query: 136 PISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 195
+ G ++ Y +++G++G+ + EK Y+ L + M H G N +R
Sbjct: 720 ---NGKG----YIYMYVTVEGSVGYISVVEEKIYQVLRQINIKMNREPWHFAGTNAEEYR 772
Query: 196 TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
KG G G +DG ++ +F L+ ++ +C + + ND+
Sbjct: 773 FEKGYGMGFGTRKHVFLDGDMLKQFRLLNEEQQKRVCLR-NTSINDVF 819
>gi|449710759|gb|EMD49776.1| cleavage and polyadenylation specificity factor subunit, putative
[Entamoeba histolytica KU27]
Length = 836
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 36/168 (21%), Positives = 71/168 (42%), Gaps = 14/168 (8%)
Query: 76 FVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS 135
F+I + D ++N+ L Y A E + F+LG ++ F ++
Sbjct: 666 FIITKDYFSVLEFDSEQNLSLLNYSSAATEQLSIFEI--DATFNLGMNLLKFTRLW---- 719
Query: 136 PISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 195
+ G ++ Y +++G++G+ + EK Y+ L + M H G N +R
Sbjct: 720 ---NGKG----YIYMYVTVEGSVGYISVVEEKIYQVLRQINIKMNREPWHFAGTNAEEYR 772
Query: 196 TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 243
KG G G +DG ++ +F L+ ++ +C + + ND+
Sbjct: 773 FEKGYGMGFGTRKHVFLDGDMLKQFRLLNEEQQKRVCLR-NTSINDVF 819
>gi|340367933|ref|XP_003382507.1| PREDICTED: splicing factor 3B subunit 3-like isoform 1 [Amphimedon
queenslandica]
Length = 1214
Score = 40.8 bits (94), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 34/135 (25%), Positives = 60/135 (44%), Gaps = 17/135 (12%)
Query: 116 TDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM- 174
T +H+G+ +NT K +S PG + Y +L G++G +P K
Sbjct: 1090 TSYHVGEGINTLHK-------VSLIPGGSEVLV--YTTLSGSIGILVPFSSKEDSDFFQH 1140
Query: 175 LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 234
L+ M + S+ G + +FR+ YY P + +IDG L + L +R EI
Sbjct: 1141 LEMHMRSEWSNLVGRDHLSFRS-----YYV--PVKSVIDGDLCEVYNSLDPSKRREIALD 1193
Query: 235 IGSKHNDILDELYDI 249
+ +++ +L D+
Sbjct: 1194 LDRSPSEVAKKLEDL 1208
>gi|194901554|ref|XP_001980317.1| GG19434 [Drosophila erecta]
gi|190652020|gb|EDV49275.1| GG19434 [Drosophila erecta]
Length = 1140
Score = 40.8 bits (94), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 40/161 (24%), Positives = 67/161 (41%), Gaps = 16/161 (9%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVN 125
P+ + A E + + + +G + N LF+ Q ++ + R L + FHLG VN
Sbjct: 952 PKWMRAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVN 1006
Query: 126 TFFKIRCKPSPISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 182
F S + G R+ + Y + +GA+G +P+ Y L L+ +
Sbjct: 1007 VFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKI 1062
Query: 183 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
G + +R ++ PS G IDG L+ FL L
Sbjct: 1063 IKSVGKIEHTYYRNFQINTKV--EPSEGFIDGDLIESFLDL 1101
>gi|242803623|ref|XP_002484212.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus ATCC
10500]
gi|218717557|gb|EED16978.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus ATCC
10500]
Length = 1140
Score = 40.8 bits (94), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 33/164 (20%), Positives = 68/164 (41%), Gaps = 6/164 (3%)
Query: 77 VIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSP 136
V + S +++SD + N+++ + RL ++ LG+ VN R +P
Sbjct: 968 VADIGSNQWLVSDAEGNLIVLRRNVDGVTEEDRRRLEVTSELLLGEMVN-----RIRPVN 1022
Query: 137 ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT 196
I + +++G++ F + ++ L+ LQ + + G + FR
Sbjct: 1023 ILQTSTVAVNPKAFLGTVEGSIYLFALINPEHQDFLMRLQTAITAYVDSPGYMPFSKFRA 1082
Query: 197 YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 240
++ P R +DG L+ +FL + EI +GS ++
Sbjct: 1083 FRSSVREGDEPFR-FVDGELIERFLDCDRPVQEEILGVVGSGYD 1125
>gi|183232997|ref|XP_653855.2| damaged DNA binding protein [Entamoeba histolytica HM-1:IMSS]
gi|169801778|gb|EAL48469.2| damaged DNA binding protein, putative [Entamoeba histolytica
HM-1:IMSS]
Length = 1088
Score = 40.8 bits (94), Expect = 0.59, Method: Composition-based stats.
Identities = 43/173 (24%), Positives = 73/173 (42%), Gaps = 13/173 (7%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
A EFV E ++ SD + N+++F ES RL H+G+ +N + C
Sbjct: 914 AIEFVDEDC---YLSSDSNSNILIFNTNSTGNESER-FRLNNCAHIHVGECIN----VMC 965
Query: 133 KPS--PISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN-VMVTHTSHTGGL 189
K S P + + + G +G +P + Y L+ +QN +++
Sbjct: 966 KGSIAPTHSTYETVQKKCILFGGVTGYIGGICEIPNEIYDVLIKVQNQILLQMKGIVECT 1025
Query: 190 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
P ++ K + PS IIDGS+V +L++S ++ EI G I
Sbjct: 1026 TPDDWK--KVIDDWKRMPSSNIIDGSIVESYLEMSKEKQCEIAHLSGVNEEQI 1076
>gi|444513057|gb|ELV10249.1| DNA damage-binding protein 1 [Tupaia chinensis]
Length = 1146
Score = 40.8 bits (94), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 71/174 (40%), Gaps = 29/174 (16%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVM---------VTHTSHTGGLNPRAFRTYKGKG 201
+ +++G +G L E Y LL +QN + ++ S T R+F T +
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKRCFQISPNSLTDMSTWRSFHTERKT- 1088
Query: 202 YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
P+ G IDG L+ FL +S + E+ + G K D+L +
Sbjct: 1089 ----EPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 1138
>gi|322700871|gb|EFY92623.1| DNA damage-binding protein 1 [Metarhizium acridum CQMa 102]
Length = 1121
Score = 40.4 bits (93), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 32/172 (18%), Positives = 68/172 (39%), Gaps = 15/172 (8%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC--------KPSP 136
++ +D N+++ PEA +L ++ ++G+ +N ++ P
Sbjct: 940 WLEADAQGNIIVLQRNPEAPTEQDRSKLEVTSEINIGEQINQIRRLHVASNENAVVSPKA 999
Query: 137 ISDAPGARSRFLTWYASL------DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 190
+ G + + L +G L F + K LL Q + + G ++
Sbjct: 1000 FLGSVGLSETTINCWTQLLILVQIEGTLYLFGEIAPKYQDLLLTFQARLQDYIYAPGNVS 1059
Query: 191 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 242
+R ++ K P R +DG +V +FL L ++ +C+ +G D+
Sbjct: 1060 FNLWRAFRNKAREGDGPFR-FVDGEMVERFLDLDEAKQELVCEGLGPSVEDM 1110
>gi|198432471|ref|XP_002129229.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
DNA-binding protein 1) (UV-damaged DNA-binding factor)
(DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
(UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
pigmentosum group E-co... isoform 2 [Ciona intestinalis]
Length = 1142
Score = 40.4 bits (93), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 38/184 (20%), Positives = 77/184 (41%), Gaps = 11/184 (5%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P + A E + + + +G ++ NV + A +L + FH+G +NTF
Sbjct: 954 PNWMTAVEILDDDNFLG---AENFYNVFICQKDSGATTDEERSKLREAALFHVGDSINTF 1010
Query: 128 FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
+ + S+ + ++ G++G + E Y L +QN + G
Sbjct: 1011 RHGSLVMQNVGET-AVSSKGHILFGTVHGSIGVITTVDEDLYAFLHSIQNRLAKVIKSVG 1069
Query: 188 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI-----GSKHNDI 242
++ ++R++ + RG +DG L+ FL L+ + E+ K + G+K
Sbjct: 1070 NIDHESWRSFCTNEKTEAH--RGFVDGDLIECFLDLNREKMAEVAKGLMVKEHGTKREAT 1127
Query: 243 LDEL 246
+D+L
Sbjct: 1128 VDDL 1131
>gi|299751161|ref|XP_001830098.2| pre-mRNA-splicing factor rse1 [Coprinopsis cinerea okayama7#130]
gi|298409248|gb|EAU91763.2| pre-mRNA-splicing factor rse1 [Coprinopsis cinerea okayama7#130]
Length = 1205
Score = 40.4 bits (93), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 65/152 (42%), Gaps = 18/152 (11%)
Query: 107 NGG-HRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLP 165
NG H+ FH+G + + K+ S G R L Y L G +G +PL
Sbjct: 1070 NGAPHKTKMLAHFHVGDIITSIHKV-------SLVVGGREVLL--YTGLQGTIGILVPLT 1120
Query: 166 EK-NYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
K + L ML+ + + G L R +++G YY P + +IDG L + LS
Sbjct: 1121 SKEDIEFLTMLEQHI---RNEQGSLVGRDHLSWRG--YYV--PVKAVIDGDLCETYGGLS 1173
Query: 225 LGERLEICKKIGSKHNDILDELYDIEALSSHF 256
++ I ++ D+L +L + SS F
Sbjct: 1174 SSKQSAIASELDRTVGDVLKKLDQMRVASSGF 1205
>gi|429850956|gb|ELA26181.1| DNA damage-binding protein 1 [Colletotrichum gloeosporioides Nara
gc5]
Length = 1409
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 34/172 (19%), Positives = 75/172 (43%), Gaps = 16/172 (9%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGAR 144
++ +D N+++ P A + ++ ++FHLG+ +N + +P +
Sbjct: 972 WIEADAQGNLMVLRRNPNAPTEHDKKQMEVISEFHLGEQIN-----KIRPLDVVSGENDP 1026
Query: 145 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG-------GLNPRAFRTY 197
+ A+++G++ F + + LL Q + G GL+ ++R +
Sbjct: 1027 IEPKAFLATIEGSIYVFADIKPEYQSLLLQFQERLAGVIKTLGQADEPGAGLSFMSWRGF 1086
Query: 198 KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG---SKHNDILDEL 246
+ A P R +DG L+ +FL L G + + + +G + D+++EL
Sbjct: 1087 RNAKRSADGPFR-FVDGELIERFLDLDAGRQEAVVQGLGPTVERMRDLVEEL 1137
>gi|157131296|ref|XP_001662180.1| spliceosomal protein sap [Aedes aegypti]
gi|108881836|gb|EAT46061.1| AAEL002709-PA [Aedes aegypti]
Length = 1215
Score = 39.7 bits (91), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 32/133 (24%), Positives = 59/133 (44%), Gaps = 17/133 (12%)
Query: 118 FHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEK-NYRRLLMLQ 176
FHLG+ + + K P G+ S YA++ G +G +P + +Y L+
Sbjct: 1093 FHLGETIMSLQKATLIPG------GSESLI---YATMSGTVGALVPFTSREDYDFFQHLE 1143
Query: 177 NVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
M + G + ++R+Y YY P + ++DG L +F + ++ I +G
Sbjct: 1144 MHMRNENTPLCGRDHLSYRSY----YY---PVKHVMDGDLCEQFTSMDPAKQKSIASDLG 1196
Query: 237 SKHNDILDELYDI 249
N++ +L DI
Sbjct: 1197 RTPNEVAKKLEDI 1209
>gi|313238818|emb|CBY20011.1| unnamed protein product [Oikopleura dioica]
gi|313245836|emb|CBY34826.1| unnamed protein product [Oikopleura dioica]
Length = 1135
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 35/182 (19%), Positives = 80/182 (43%), Gaps = 13/182 (7%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P + A + + + + +G ++ +N+ + EA + +L+ +H+G+H+NT
Sbjct: 951 PNWMTAIKMIDDDNYIG---AENSENIFICTRNTEAPDEEDRQQLLPTGYYHVGEHINTI 1007
Query: 128 FKIRCKPSPISDAPGARSR-FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 186
+ ++ +R FL S+ G +G PEK ++ L L+ M
Sbjct: 1008 VEGNLVMDVHVESSITPTRTFL--MGSVSGYVGLLAIFPEKQWQFLSKLEAKMRKVIRGV 1065
Query: 187 GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG-----SKHND 241
G ++ ++R ++ +G +DG L+ F L ++ E+ ++ + H+D
Sbjct: 1066 GKIDHESWRRFESDSRM--EDCKGFVDGDLIEMFQDLRPEKQKEVISELTMDGEPATHDD 1123
Query: 242 IL 243
++
Sbjct: 1124 VV 1125
>gi|157141630|ref|XP_001647735.1| spliceosomal protein sap [Aedes aegypti]
gi|108867854|gb|EAT32403.1| AAEL015441-PA, partial [Aedes aegypti]
Length = 645
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 32/133 (24%), Positives = 59/133 (44%), Gaps = 17/133 (12%)
Query: 118 FHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEK-NYRRLLMLQ 176
FHLG+ + + K P G+ S YA++ G +G +P + +Y L+
Sbjct: 523 FHLGETIMSLQKATLIPG------GSESLI---YATMSGTVGALVPFTSREDYDFFQHLE 573
Query: 177 NVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
M + G + ++R+Y YY P + ++DG L +F + ++ I +G
Sbjct: 574 MHMRNENTPLCGRDHLSYRSY----YY---PVKHVMDGDLCEQFTSMDPAKQKSIASDLG 626
Query: 237 SKHNDILDELYDI 249
N++ +L DI
Sbjct: 627 RTPNEVAKKLEDI 639
>gi|348560393|ref|XP_003465998.1| PREDICTED: DNA damage-binding protein 1-like [Cavia porcellus]
Length = 1140
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 58/134 (43%), Gaps = 11/134 (8%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ +++G +G L E Y LL +QN + G + +R++ + + G
Sbjct: 1030 FGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEQATG 1087
Query: 211 IIDGSLVWKFLQLS 224
IDG L+ FL +S
Sbjct: 1088 FIDGDLIESFLDIS 1101
>gi|339259094|ref|XP_003369733.1| splicing factor 3B subunit 3 [Trichinella spiralis]
gi|316965959|gb|EFV50595.1| splicing factor 3B subunit 3 [Trichinella spiralis]
Length = 1241
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 29/110 (26%), Positives = 55/110 (50%), Gaps = 10/110 (9%)
Query: 141 PGARSRFLTWYASLDGALGFFLPLPEK-NYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG 199
PG+ + Y+++ G +G +P K +Y L+ + T G + A+R++
Sbjct: 1135 PGSSESLV--YSTMSGMIGALVPFSSKEDYEFFQHLEMHLRTEYPPLCGRDHLAYRSF-- 1190
Query: 200 KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 249
YA P +G+IDG L ++ L G++ EI ++ ++I+ +L DI
Sbjct: 1191 ---YA--PVKGVIDGDLCEQYCLLEYGKQKEISNELDRVPSEIMKKLEDI 1235
>gi|58383228|ref|XP_312466.2| AGAP002472-PA [Anopheles gambiae str. PEST]
gi|55242305|gb|EAA08181.2| AGAP002472-PA [Anopheles gambiae str. PEST]
Length = 1138
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 35/157 (22%), Positives = 62/157 (39%), Gaps = 6/157 (3%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P + A E + + + +G +D N+ + + A ++ + FHLG VN F
Sbjct: 950 PNWMTAVEILDDDAFLG---ADNSNNLFVCLKDSAATTDEERQQMPEVAQFHLGDMVNVF 1006
Query: 128 FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
IS+ + + + ++ GA+G + Y L LQ + G
Sbjct: 1007 RHGSLVMQNISERSTPTTGCV-LFGTVSGAIGLVTQIQSDFYEFLRKLQENLTNTIKSVG 1065
Query: 188 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 224
++ +R++ + G IDG LV FL LS
Sbjct: 1066 KIDHSYWRSFHTETKM--ERCEGFIDGDLVESFLDLS 1100
>gi|225443990|ref|XP_002280735.1| PREDICTED: DNA damage-binding protein 1 isoform 1 [Vitis vinifera]
Length = 1089
Score = 39.3 bits (90), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 32/130 (24%), Positives = 54/130 (41%), Gaps = 12/130 (9%)
Query: 111 RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
RL ++HLG+ VN F +R S + P + +++G +G LP
Sbjct: 951 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTVNGVIGVIASLPH 1004
Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
Y L LQ + GGL+ +R++ + ++ +DG L+ FL L+
Sbjct: 1005 DQYVFLEKLQANLRKVIKGVGGLSHEQWRSFNNEKKTV--DAKNFLDGDLIETFLDLNRT 1062
Query: 227 ERLEICKKIG 236
EI K +
Sbjct: 1063 RMDEISKAMA 1072
>gi|225443992|ref|XP_002280744.1| PREDICTED: DNA damage-binding protein 1 isoform 2 [Vitis vinifera]
Length = 1068
Score = 39.3 bits (90), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 32/130 (24%), Positives = 54/130 (41%), Gaps = 12/130 (9%)
Query: 111 RLIKKTDFHLGQHVNTF----FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPE 166
RL ++HLG+ VN F +R S + P + +++G +G LP
Sbjct: 930 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTVNGVIGVIASLPH 983
Query: 167 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 226
Y L LQ + GGL+ +R++ + ++ +DG L+ FL L+
Sbjct: 984 DQYVFLEKLQANLRKVIKGVGGLSHEQWRSFNNEKKTV--DAKNFLDGDLIETFLDLNRT 1041
Query: 227 ERLEICKKIG 236
EI K +
Sbjct: 1042 RMDEISKAMA 1051
>gi|353236335|emb|CCA68332.1| probable splicing factor 3B subunit 3 [Piriformospora indica DSM
11827]
Length = 1243
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 36/148 (24%), Positives = 63/148 (42%), Gaps = 17/148 (11%)
Query: 110 HRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNY 169
H+ T FH+G + + ++ A A +R + Y L G +G +PL K+
Sbjct: 1112 HKTKMLTHFHVGDVITSIQRV---------ALVAGAREVVVYFGLHGTIGILVPLVTKDD 1162
Query: 170 RRLL-MLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGER 228
+ L+ M T G + ++R GYY P + +DG L F +L ++
Sbjct: 1163 VDFISTLEQHMRTENLSLVGRDHLSWR-----GYY--TPVKATVDGDLCEYFAKLPTQKQ 1215
Query: 229 LEICKKIGSKHNDILDELYDIEALSSHF 256
L I ++ D+L +L + +S F
Sbjct: 1216 LAIAGELDRTVGDVLKKLESLRVTASGF 1243
>gi|410912407|ref|XP_003969681.1| PREDICTED: DNA damage-binding protein 1-like [Takifugu rubripes]
Length = 1140
Score = 38.9 bits (89), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 68/165 (41%), Gaps = 17/165 (10%)
Query: 96 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARS---RFLTW 150
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEDRQHLQEVGVFHLGEFVNVF----CHGSLVLQNLGETSTPTQGSVL 1029
Query: 151 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 210
+ ++ G +G L E + LL LQN + G + +R++ + ++G
Sbjct: 1030 FGTVTGMIGLVTSLSEGWHSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEQAKG 1087
Query: 211 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 249
IDG L+ FL L + E+ + G K +DE+ I
Sbjct: 1088 FIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREATVDEVIKI 1132
>gi|167395195|ref|XP_001741266.1| pre-mRNA-splicing factor rse-1 [Entamoeba dispar SAW760]
gi|165894222|gb|EDR22283.1| pre-mRNA-splicing factor rse-1, putative [Entamoeba dispar SAW760]
Length = 1145
Score = 38.9 bits (89), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 36/146 (24%), Positives = 67/146 (45%), Gaps = 23/146 (15%)
Query: 108 GGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEK 167
++++ T F +G +F + S PGA S FL Y++ G L +PL +
Sbjct: 1013 SNYKMVTATQFFIGDIATSF-------AQCSLIPGAPSIFL--YSNFMGGLSALIPLQNQ 1063
Query: 168 N----YRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
N Y+ L M M H ++ N +FR+ + P + +DG L + +L
Sbjct: 1064 NDIDFYQHLEMH---MRVHWTNLTDRNHISFRS-------SMVPVKDTVDGDLCELYERL 1113
Query: 224 SLGERLEICKKIGSKHNDILDELYDI 249
+ EI +++ + N+I+ +L+D+
Sbjct: 1114 PYEIQQEIAEEMEKEVNEIIKKLHDL 1139
>gi|300707023|ref|XP_002995737.1| hypothetical protein NCER_101290 [Nosema ceranae BRL01]
gi|239604943|gb|EEQ82066.1| hypothetical protein NCER_101290 [Nosema ceranae BRL01]
Length = 1155
Score = 38.9 bits (89), Expect = 2.0, Method: Composition-based stats.
Identities = 40/164 (24%), Positives = 71/164 (43%), Gaps = 22/164 (13%)
Query: 75 EFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK--TDFHLGQHV-NTFFKIR 131
+F+++ ++G + +D N+ + Y P S G + +K+ T+F+LG+ V
Sbjct: 1000 DFIVKEPALGIICTDFAGNIHTYTYSPVNILSCNGTKFVKRCETNFNLGKLVIKRAHSKL 1059
Query: 132 CKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 191
P ISD+ + LD L NY L +QN ++ T GL P
Sbjct: 1060 LNPVFISDS--------NYIIELDS-------LSLDNYNNFLKVQNAYLSLIEDTFGLCP 1104
Query: 192 RAFRTYKGKGYYAGNPS-RGIIDGSLVWKFLQLSLGERLEICKK 234
F + Y+ PS + I L+++FL L + ++ K+
Sbjct: 1105 ENFNNCE---YHLKPPSVKKPILKELLFRFLHLPVDKKANFLKE 1145
>gi|198432469|ref|XP_002129207.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
DNA-binding protein 1) (UV-damaged DNA-binding factor)
(DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
(UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
pigmentosum group E-co... isoform 1 [Ciona intestinalis]
Length = 1150
Score = 38.9 bits (89), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 38/188 (20%), Positives = 77/188 (40%), Gaps = 15/188 (7%)
Query: 68 PRGVYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 127
P + A E + + + +G ++ NV + A +L + FH+G +NTF
Sbjct: 958 PNWMTAVEILDDDNFLG---AENFYNVFICQKDSGATTDEERSKLREAALFHVGDSINTF 1014
Query: 128 FKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 187
+ + S+ + ++ G++G + E Y L +QN + G
Sbjct: 1015 RHGSLVMQNVGET-AVSSKGHILFGTVHGSIGVITTVDEDLYAFLHSIQNRLAKVIKSVG 1073
Query: 188 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI---------GSK 238
++ ++R++ + RG +DG L+ FL L+ + E+ K + G+K
Sbjct: 1074 NIDHESWRSFCTNEKTEAH--RGFVDGDLIECFLDLNREKMAEVAKGLMVKNFNDQHGTK 1131
Query: 239 HNDILDEL 246
+D+L
Sbjct: 1132 REATVDDL 1139
>gi|443919095|gb|ELU39366.1| cleavage factor protein [Rhizoctonia solani AG-1 IA]
Length = 788
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 23/94 (24%), Positives = 44/94 (46%), Gaps = 24/94 (25%)
Query: 153 SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGII 212
S DG + PL E + RL +LQ ++ N G++
Sbjct: 712 SSDGTIASLTPLNESEFGRLQLLQGQLIR------------------------NVHNGVL 747
Query: 213 DGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 246
DG+L+ F +L + +++E+ ++IG++ IL++L
Sbjct: 748 DGNLLAAFEELPVSKQVEMTQQIGAEREKILNDL 781
>gi|390366809|ref|XP_780126.3| PREDICTED: DNA damage-binding protein 1-like isoform 1
[Strongylocentrotus purpuratus]
Length = 630
Score = 38.5 bits (88), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 51/118 (43%), Gaps = 7/118 (5%)
Query: 118 FHLGQHVNTFFKIRCKPSPISDA--PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 175
FHLG+ VN F I ++ P S + ++ G++G L E+ YR LL +
Sbjct: 489 FHLGEFVNVFRHGSLVMQNIGESTIPTTGS---VLFGTVSGSVGLVTQLNEEFYRFLLEV 545
Query: 176 QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 233
QN + G + +R++ + P IDG L+ FL LS E+ +
Sbjct: 546 QNKLTKVIKSVGKIKHSFWRSFYSE--RKTEPMDNFIDGDLLESFLDLSRDTMDEVAQ 601
>gi|67472417|ref|XP_652013.1| splicing factor 3B subunit 3 [Entamoeba histolytica HM-1:IMSS]
gi|56468814|gb|EAL46625.1| splicing factor 3B subunit 3, putative [Entamoeba histolytica
HM-1:IMSS]
gi|449709507|gb|EMD48761.1| premRNA-splicing factor rse-1, putative [Entamoeba histolytica KU27]
Length = 1145
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 36/146 (24%), Positives = 67/146 (45%), Gaps = 23/146 (15%)
Query: 108 GGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEK 167
++++ T F +G +F + S PGA S FL Y++ G L +PL +
Sbjct: 1013 SSYKMVTATQFFVGDIATSF-------AQCSLIPGAPSIFL--YSNFMGGLSALIPLQSQ 1063
Query: 168 N----YRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
N Y+ L M M H ++ N +FR+ + P + +DG L + +L
Sbjct: 1064 NDIDFYQHLEMH---MRVHWTNLTDRNHISFRS-------SIVPVKDTVDGDLCELYERL 1113
Query: 224 SLGERLEICKKIGSKHNDILDELYDI 249
+ EI +++ + N+I+ +L+D+
Sbjct: 1114 PYEIQQEIAEEMEKEVNEIIKKLHDL 1139
>gi|407044064|gb|EKE42344.1| CPSF A subunit region protein, putative [Entamoeba nuttalli P19]
Length = 1145
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 36/146 (24%), Positives = 67/146 (45%), Gaps = 23/146 (15%)
Query: 108 GGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEK 167
++++ T F +G +F + S PGA S FL Y++ G L +PL +
Sbjct: 1013 SSYKMVTATQFFVGDIATSF-------AQCSLIPGAPSIFL--YSNFMGGLSALIPLQSQ 1063
Query: 168 N----YRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 223
N Y+ L M M H ++ N +FR+ + P + +DG L + +L
Sbjct: 1064 NDIDFYQHLEMH---MRVHWTNLTDRNHISFRS-------SIVPVKDTVDGDLCELYERL 1113
Query: 224 SLGERLEICKKIGSKHNDILDELYDI 249
+ EI +++ + N+I+ +L+D+
Sbjct: 1114 PYEIQQEIAEEMEKEVNEIIKKLHDL 1139
>gi|398020786|ref|XP_003863556.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania donovani]
gi|322501789|emb|CBZ36871.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania donovani]
Length = 1542
Score = 37.7 bits (86), Expect = 4.4, Method: Composition-based stats.
Identities = 28/131 (21%), Positives = 57/131 (43%), Gaps = 15/131 (11%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKT------DFHL-GQH 123
V + + + + G +++D +N+V Y+P E G I ++ ++ L G
Sbjct: 1357 VVSNDMLYHDTRFGLLVTDDARNLVCMSYKPRVLEEPGKPPKILESLLTVTGEYRLAGGV 1416
Query: 124 VNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
+ ++R A R+ + Y + G +G+ +PL ++ R + + +
Sbjct: 1417 LLKMMRLR--------AASTRNSSVAIYVTNMGEIGYLVPLGDQTSRTGQWVGRRLQSEV 1468
Query: 184 SHTGGLNPRAF 194
+H GGL PR F
Sbjct: 1469 AHAGGLPPRMF 1479
>gi|146096490|ref|XP_001467824.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania infantum JPCM5]
gi|134072190|emb|CAM70891.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania infantum JPCM5]
Length = 1542
Score = 37.7 bits (86), Expect = 4.4, Method: Composition-based stats.
Identities = 28/131 (21%), Positives = 57/131 (43%), Gaps = 15/131 (11%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKT------DFHL-GQH 123
V + + + + G +++D +N+V Y+P E G I ++ ++ L G
Sbjct: 1357 VVSNDMLYHDTRFGLLVTDDARNLVCMSYKPRVLEEPGKPPKILESLLTVTGEYRLAGGV 1416
Query: 124 VNTFFKIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 183
+ ++R A R+ + Y + G +G+ +PL ++ R + + +
Sbjct: 1417 LLKMMRLR--------AASTRNSSVAIYVTNMGEIGYLVPLGDQTSRTGQWVGRRLQSEV 1468
Query: 184 SHTGGLNPRAF 194
+H GGL PR F
Sbjct: 1469 AHAGGLPPRMF 1479
>gi|307107849|gb|EFN56091.1| hypothetical protein CHLNCDRAFT_145620 [Chlorella variabilis]
Length = 1626
Score = 37.7 bits (86), Expect = 4.7, Method: Composition-based stats.
Identities = 20/67 (29%), Positives = 30/67 (44%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
A +F+I SS+ D + L Y P S G RL+ FH+G+ + ++R
Sbjct: 1425 ATQFLINGSSLHLASCDSAGTLRLLSYAPSHPASWKGQRLVAWGSFHVGEAASCMRRLRL 1484
Query: 133 KPSPISD 139
PS D
Sbjct: 1485 HPSSPED 1491
>gi|342186481|emb|CCC95967.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 1456
Score = 37.7 bits (86), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 52/126 (41%), Gaps = 5/126 (3%)
Query: 71 VYACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGH--RLIKKTDFHLGQHVNTFF 128
V C+ + + G + S+ ++++L Y P +ES G H + ++ F L
Sbjct: 1271 VVHCDMMYHDRTFGILCSNDQRDLLLMGYTPRVQES-GEHTPSRVLESPFSLDGEYRLPS 1329
Query: 129 KIRCKPSPISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
K A G S +T Y S G +GF +PL E+ R L + + G
Sbjct: 1330 GCLAKSLRFRSAAGNSS--VTVYISNYGEVGFIVPLGEQANRTALWITRRLQVDLPCDAG 1387
Query: 189 LNPRAF 194
L PR F
Sbjct: 1388 LTPRMF 1393
>gi|302837243|ref|XP_002950181.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
nagariensis]
gi|300264654|gb|EFJ48849.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
nagariensis]
Length = 1104
Score = 37.0 bits (84), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 35/143 (24%), Positives = 55/143 (38%), Gaps = 11/143 (7%)
Query: 85 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSPISDA 140
++ D N+V+ ++ RL ++H G VN F +R S
Sbjct: 969 YLEGDNHLNLVVLRRNADSATDEERARLQVVGEYHTGTFVNRFRHGSLVMRPPDSEFVSL 1028
Query: 141 PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 200
P + DG LG LP Y L LQ+ + GGL+ A+ + +
Sbjct: 1029 P-----VPLLFGGTDGRLGVIARLPPGLYEMLTKLQSALRQVVRGVGGLSHEAWIAFSNE 1083
Query: 201 GYYAGNPSRGIIDGSLVWKFLQL 223
A ++G +DG L+ FL L
Sbjct: 1084 RRTAD--AKGFVDGDLIETFLDL 1104
>gi|70945139|ref|XP_742421.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56521397|emb|CAH76894.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 435
Score = 37.0 bits (84), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 41/192 (21%), Positives = 76/192 (39%), Gaps = 13/192 (6%)
Query: 74 CEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK 133
C FV S F++SD + N ++F +L + F+ G VN +
Sbjct: 244 CTFVCALSKSHFLVSDMESNFLVFQKSSIKYNDEDSFKLSRVALFNHGHVVNKMLPVSLS 303
Query: 134 PSPISDAPGA---RSRFLTWYASLDGALGFFLPLPE-KNYRRLLMLQNVMVTHTSHTGGL 189
+ P R + AS +G++ +P N+++ L ++ + S G +
Sbjct: 304 SLIEEEEPQNEILRKKESILCASSEGSISSIIPFSNLANFKKALCIELALNDSLSSIGNI 363
Query: 190 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE-------ICKKIGSKHNDI 242
N + TYK + +G++DG + F + ++ + I KK+ K
Sbjct: 364 NDNSNNTYKMN--LSEKSCKGVVDGEVFKMFFSMPFEKQFKTYIYAKWIAKKLNCKFGTF 421
Query: 243 LDELYDIEALSS 254
+ + DIE L S
Sbjct: 422 ENFMLDIENLCS 433
>gi|320583269|gb|EFW97484.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
[Ogataea parapolymorpha DL-1]
Length = 1309
Score = 37.0 bits (84), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 35/168 (20%), Positives = 71/168 (42%), Gaps = 5/168 (2%)
Query: 73 ACEFVIESSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 132
A +F++ ++ +++D+D L Y P S G +L++++ +
Sbjct: 1128 AADFLVHDGNLHLLVADEDSVFHLLQYDPYDGNSMKGLKLLRRSLLRSNALTTKMISVAR 1187
Query: 133 KPSPISDAPGARSRFLTWY----ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 188
S S Y +++DG+ +P+ E YRRL +QN + H G
Sbjct: 1188 DRSLFSMVSTLNHEDDLGYEIIGSNIDGSFYKVMPVNEYQYRRLYSIQNYLYDKELHWLG 1247
Query: 189 LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 236
LNP++ G + R I+ ++ +F+ + + +I +K+G
Sbjct: 1248 LNPKS-NAIGGLTELMPSIKRPFIELNMFHRFIGFNNDRKKQIMQKLG 1294
>gi|294654658|ref|XP_456718.2| DEHA2A08932p [Debaryomyces hansenii CBS767]
gi|218511767|sp|Q6BYK1.2|RSE1_DEBHA RecName: Full=Pre-mRNA-splicing factor RSE1
gi|199429049|emb|CAG84677.2| DEHA2A08932p [Debaryomyces hansenii CBS767]
Length = 1256
Score = 36.6 bits (83), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 42/150 (28%), Positives = 67/150 (44%), Gaps = 21/150 (14%)
Query: 91 DKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSPISDAPGARSRFLTW 150
DK+ L YQ E+ + G RL +F+L Q + T F K S + G + +
Sbjct: 1095 DKDWSLLRYQ-ESYLNGSGSRLKNICEFYL-QDIPTSF---TKGSLVM---GGKESII-- 1144
Query: 151 YASLDGALGFFLPLPEKNYRRLL-MLQNVMVTHTSHT--------GGLNPRAFRTYKGKG 201
Y + G LG LPL +N + L LQ ++ + + G N K +
Sbjct: 1145 YTGIQGTLGLLLPLSTENEVKFLGDLQLLLRKYFDYNFDDFDKDKNGYNLLGKDHLKFRS 1204
Query: 202 YYAGNPSRGIIDGSLVWKFLQLSLGERLEI 231
YY NP + ++DG L+ +F +LS ++ I
Sbjct: 1205 YY--NPVKNVMDGDLIERFYELSQSMKIRI 1232
>gi|448111975|ref|XP_004201977.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
gi|359464966|emb|CCE88671.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
Length = 1249
Score = 36.6 bits (83), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 28/114 (24%), Positives = 52/114 (45%), Gaps = 11/114 (9%)
Query: 150 WYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHT--------GGLNPRAFRTYKGK 200
+Y L G +G +PL KN + + L+ + ++ + G N K +
Sbjct: 1137 FYTGLMGTVGALIPLVTKNEVQFFIELEAELRSYFDYNFDNFDEQKNGHNLLGKDHLKHR 1196
Query: 201 GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSS 254
YY NP + +IDG L+ +F ++S ++ I K+ DI ++ +I S+
Sbjct: 1197 SYY--NPVKNVIDGDLIERFSEVSYNNKIRIANKLDRTPKDIDKKISEIRNRSA 1248
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.323 0.140 0.422
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,196,213,230
Number of Sequences: 23463169
Number of extensions: 178515530
Number of successful extensions: 290714
Number of sequences better than 100.0: 583
Number of HSP's better than 100.0 without gapping: 298
Number of HSP's successfully gapped in prelim test: 285
Number of HSP's that attempted gapping in prelim test: 289715
Number of HSP's gapped (non-prelim): 728
length of query: 256
length of database: 8,064,228,071
effective HSP length: 139
effective length of query: 117
effective length of database: 9,097,814,876
effective search space: 1064444340492
effective search space used: 1064444340492
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 75 (33.5 bits)