BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy92
(638 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|350413821|ref|XP_003490124.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Bombus impatiens]
Length = 1417
Score = 773 bits (1995), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/631 (58%), Positives = 459/631 (72%), Gaps = 46/631 (7%)
Query: 14 ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLK--VLFVSDRS 71
E V+E+L V+LG HGNRP+LLVR EL IYQA+R+PKG LKLRFKKL ++ R
Sbjct: 827 EMQVREILMVALGHHGNRPMLLVRLDSELQIYQAYRYPKGHLKLRFKKLDHGIIPGQLRP 886
Query: 72 KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS 131
K +E + R MRYFSNIAGY GVF+C +P W+FLT RGELR HPM IDGPV+
Sbjct: 887 KPRDEDIPMMNETRHCMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPVT 946
Query: 132 TLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKT 191
+ APF+N+NCP+GFLYFN K ELRI VLPTHLSYDAPWPVRKVPL+CTPHF+ YHLE+KT
Sbjct: 947 SFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKT 1006
Query: 192 YCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFP 251
YC++TS AEP YY+FNGEDKE + R RFI P QF + LFSP SWE IP T
Sbjct: 1007 YCVITSIAEPLKSYYRFNGEDKEFTEEERPERFIYPSQEQFSIVLFSPVSWETIPNTKIE 1066
Query: 252 LHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQ 311
L +WEHV CLKNVS+ YEGT SGL+GYI LGTNYNY ED+T RGRIL+FDIIEVVPEPGQ
Sbjct: 1067 LDQWEHVTCLKNVSLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQ 1126
Query: 312 PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIA 371
PLTKN+ K IYAKEQKGP+TAI V+GFLV+AVGQKIYIWQLKDNDL G+AFIDT++YI
Sbjct: 1127 PLTKNRFKQIYAKEQKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQIYIH 1186
Query: 372 SMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGS 431
M+S+K+LIL+ D +SI+LLR+Q EYRTLSLV+RD++P + + Y N
Sbjct: 1187 QMLSIKSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDN--------- 1237
Query: 432 LVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGG 491
+++GF+++D + N+ LFMYQPE+RES GG
Sbjct: 1238 -------------------------------TNLGFLVADGESNMALFMYQPESRESLGG 1266
Query: 492 HRLIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPLP 547
+LI+K DFHLGQ VNTFF+I+C+ S ++ GA R +T YASLDG+LG+ LP+P
Sbjct: 1267 QKLIRKADFHLGQKVNTFFRIKCRVSDPANDKKHFSGADKRHVTMYASLDGSLGYILPVP 1326
Query: 548 EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 607
EK YRRLLMLQNV+VTH H GLNP+A+RTYK GNP+RGIIDG LVW++L L
Sbjct: 1327 EKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDLVWRYLYLPN 1386
Query: 608 GERLEICKKIGSKHNDILDELYDIEALSSHF 638
E++++ KKIG++ +I+++L +I+ ++HF
Sbjct: 1387 NEKIDVAKKIGTRVQEIIEDLTEIDRQTAHF 1417
>gi|340710064|ref|XP_003393618.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Bombus terrestris]
Length = 1417
Score = 771 bits (1991), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/631 (58%), Positives = 458/631 (72%), Gaps = 46/631 (7%)
Query: 14 ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLK--VLFVSDRS 71
E V+E+L V+LG HGNRP+LLVR EL IYQA+R+PKG LKLRFKKL ++ +
Sbjct: 827 EMQVREILMVALGHHGNRPMLLVRLDSELQIYQAYRYPKGHLKLRFKKLDHGIIPGQLKP 886
Query: 72 KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS 131
K +E + R MRYFSNIAGY GVF+C +P W+FLT RGELR HPM IDGPV+
Sbjct: 887 KLRDEDIPMMNETRHCMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPVT 946
Query: 132 TLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKT 191
+ APF+N+NCP+GFLYFN K ELRI VLPTHLSYDAPWPVRKVPL+CTPHF+ YHLE+KT
Sbjct: 947 SFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKT 1006
Query: 192 YCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFP 251
YC++TS AEP YY+FNGEDKE + R RFI P QF + LFSP SWE IP T
Sbjct: 1007 YCVITSIAEPLKSYYRFNGEDKEFTEEERPERFIYPSQEQFSIVLFSPVSWETIPNTKIE 1066
Query: 252 LHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQ 311
L +WEHV CLKNVS+ YEGT SGL+GYI LGTNYNY ED+T RGRIL+FDIIEVVPEPGQ
Sbjct: 1067 LDQWEHVTCLKNVSLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQ 1126
Query: 312 PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIA 371
PLTKN+ K IYAKEQKGP+TAI V+GFLV+AVGQKIYIWQLKDNDL G+AFIDT++YI
Sbjct: 1127 PLTKNRFKQIYAKEQKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQIYIH 1186
Query: 372 SMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGS 431
M+S+K+LIL+ D +SI+LLR+Q EYRTLSLV+RD++P + + Y N
Sbjct: 1187 QMLSIKSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDN--------- 1237
Query: 432 LVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGG 491
+++GF+++D + N+ LFMYQPE+RES GG
Sbjct: 1238 -------------------------------TNLGFLVADGESNMALFMYQPESRESLGG 1266
Query: 492 HRLIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPLP 547
+LI+K DFHLGQ VNTFF+IRC+ S ++ GA R +T YASLDG+LG+ LP+P
Sbjct: 1267 QKLIRKADFHLGQKVNTFFRIRCRLSDPANDKKHFSGADKRHVTMYASLDGSLGYILPVP 1326
Query: 548 EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 607
EK YRRLLMLQNV+VTH H GLNP+A+RTYK GNP+RGIIDG LVW++ L
Sbjct: 1327 EKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDLVWRYFYLPN 1386
Query: 608 GERLEICKKIGSKHNDILDELYDIEALSSHF 638
E++++ KKIG++ +I+++L +I+ ++HF
Sbjct: 1387 NEKIDVAKKIGTRVQEIIEDLTEIDRQTAHF 1417
>gi|383863556|ref|XP_003707246.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Megachile rotundata]
Length = 1415
Score = 771 bits (1990), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/631 (58%), Positives = 457/631 (72%), Gaps = 46/631 (7%)
Query: 14 ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLK--VLFVSDRS 71
E V+E+L V+LG HGNRP+LLVR EL IYQ +R+PKG LKLRFKKL ++ + R
Sbjct: 825 EMQVREILMVALGHHGNRPMLLVRLDSELQIYQTYRYPKGHLKLRFKKLDHGIIPGNLRP 884
Query: 72 KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS 131
K E R MRYFSNIAGY GVF+C +P W+FLT RGELR HPM IDGP++
Sbjct: 885 KPKEEDMSAMNETRHCMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPIT 944
Query: 132 TLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKT 191
+ APF+N+NCP+GFLYFN K ELRI VLPTHLSYDAPWPVRKVPL+CTPHF+ YHLE+KT
Sbjct: 945 SFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKT 1004
Query: 192 YCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFP 251
YC++TS AEP YY+FNGEDKE + R RFI P QF + LFSP SWE IP T
Sbjct: 1005 YCVITSIAEPLKSYYRFNGEDKEFTEEDRPDRFIFPSQEQFSIVLFSPVSWETIPNTKIE 1064
Query: 252 LHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQ 311
L +WEHV CLKNVS+ YEGT SGL+GYI LGTNYNY ED+T RGRIL+FDIIEVVPEPGQ
Sbjct: 1065 LDQWEHVTCLKNVSLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQ 1124
Query: 312 PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIA 371
PLTKN+ K IYAKEQKGP+TAI V+GFLV+AVGQKIYIWQLKDNDL G+AFIDT++YI
Sbjct: 1125 PLTKNRFKQIYAKEQKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQIYIH 1184
Query: 372 SMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGS 431
M+S+K+LIL+ D +SI+LLR+Q EYRTLSLV+RD++P + + Y N
Sbjct: 1185 QMLSIKSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDN--------- 1235
Query: 432 LVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGG 491
+++GF+++D + N+ LFMYQPE+RES GG
Sbjct: 1236 -------------------------------NNLGFLVADGESNIALFMYQPESRESLGG 1264
Query: 492 HRLIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPLP 547
+LI+K DFHLGQ VNTFF+IRC+ S ++ GA R +T YASLDG+LG+ LP+P
Sbjct: 1265 QKLIRKADFHLGQKVNTFFRIRCRISDPANDKKHFSGADKRHVTMYASLDGSLGYILPVP 1324
Query: 548 EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 607
EK YRRLLMLQNV+VTH H GLNP+A+RTYK GNP+RGIIDG LVW++L L
Sbjct: 1325 EKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSYIRTQGNPARGIIDGDLVWRYLYLPN 1384
Query: 608 GERLEICKKIGSKHNDILDELYDIEALSSHF 638
E++++ KKIG++ +I+++L +I+ ++HF
Sbjct: 1385 NEKIDVAKKIGTRVQEIIEDLTEIDRQTAHF 1415
>gi|110750698|ref|XP_624382.2| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Apis mellifera]
Length = 1415
Score = 768 bits (1984), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/632 (58%), Positives = 456/632 (72%), Gaps = 48/632 (7%)
Query: 14 ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSK- 72
E V+E+L V+LG HGNRP+LLVR EL IYQA+R+PKG LKLRFKKL + +
Sbjct: 825 EMQVREILMVALGHHGNRPMLLVRLDSELQIYQAYRYPKGHLKLRFKKLDHGIIPGHLRP 884
Query: 73 --RANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPV 130
R + P + R MRYFSNIAGY GVF+C +P W+FLT RGELR HPM IDGPV
Sbjct: 885 RPRDEDMPAM-NDTRHCMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPV 943
Query: 131 STLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETK 190
++ APF+N+NCP+GFLYFN K ELRI VLPTHLSYDAPWPVRKVPL+CTPHF+ YHLE+K
Sbjct: 944 TSFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESK 1003
Query: 191 TYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNF 250
TYC++TS AEP YY+FNGEDKE + R RFI P QF + LFSP SWE IP T
Sbjct: 1004 TYCVITSIAEPLKSYYRFNGEDKEFTEEERPDRFIFPSQEQFSIVLFSPVSWETIPNTKI 1063
Query: 251 PLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPG 310
L +WEHV CLKNVS+ YEGT SGL+GYI LGTNYNY ED+T RGRIL+FDIIEVVPEPG
Sbjct: 1064 ELDQWEHVTCLKNVSLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPG 1123
Query: 311 QPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYI 370
QPLTKN+ K IYAKEQKGP+TAI V+GFLV+AVGQKIYIWQLKDNDL G+AFIDT++YI
Sbjct: 1124 QPLTKNRFKQIYAKEQKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQIYI 1183
Query: 371 ASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDG 430
M+S+K+LIL+ D +SI+LLR+Q EYRTLSLV+RD++P + + Y N
Sbjct: 1184 HQMLSIKSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDN-------- 1235
Query: 431 SLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNG 490
+++GF+++D + N+ LFMYQPE+RES G
Sbjct: 1236 --------------------------------TNLGFLVADGESNIALFMYQPESRESLG 1263
Query: 491 GHRLIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPL 546
G +LI+K DFHLGQ VNTFF+IRC+ S ++ A R +T YASLDG LG+ LP+
Sbjct: 1264 GQKLIRKADFHLGQKVNTFFRIRCRISDPANDKKHFSDADKRHVTMYASLDGNLGYILPV 1323
Query: 547 PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
PEK YRRLLMLQNV+VTH H GLNP+A+RTYK GNP+RGIIDG LVW++L L
Sbjct: 1324 PEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDLVWRYLYLP 1383
Query: 607 LGERLEICKKIGSKHNDILDELYDIEALSSHF 638
E++++ KKIG++ +I+++L +I+ ++HF
Sbjct: 1384 NNEKIDVAKKIGTRVQEIIEDLTEIDRQTAHF 1415
>gi|345482082|ref|XP_001607052.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Nasonia vitripennis]
Length = 1415
Score = 766 bits (1977), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/631 (57%), Positives = 457/631 (72%), Gaps = 46/631 (7%)
Query: 14 ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKR 73
E V+E+ V+LG HGNRP+LLVR EL IYQ +R+PKG LKLRFKK+ F+ S+
Sbjct: 825 EVQVREIAVVALGHHGNRPMLLVRLDSELQIYQVYRYPKGHLKLRFKKIDHNFIVGFSRI 884
Query: 74 ANEQPGLP--RGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS 131
++ +P R+ MRYFSNIAGY GVF+ G +P W+FLT RGELRAHPM IDGPV
Sbjct: 885 GPKEEDMPSMNDTRLCMMRYFSNIAGYNGVFIGGDYPHWIFLTGRGELRAHPMNIDGPVK 944
Query: 132 TLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKT 191
+ APF+NVNCP+GFLYFN K ELRI VLPTHLSYDAPWPVRKVPL+CTPHF+ YHLE+KT
Sbjct: 945 SFAPFNNVNCPQGFLYFNRKDELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKT 1004
Query: 192 YCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFP 251
YC+VTSTAEP YY+FNGEDKE + R+ RF+ P QF + LFSP SW+ IP T
Sbjct: 1005 YCVVTSTAEPLKSYYRFNGEDKEFTEEERNERFLYPTQEQFSIVLFSPVSWDTIPNTKID 1064
Query: 252 LHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQ 311
L +WEHV CLKNVS+ YEGT SGL+GYI +GTNYNY ED+T RGRI +FDIIEVVPEPGQ
Sbjct: 1065 LDQWEHVTCLKNVSLAYEGTRSGLKGYIVIGTNYNYGEDITSRGRIFIFDIIEVVPEPGQ 1124
Query: 312 PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIA 371
PLTKN+ K IYAKEQKGPVTAI V+GFLV+A+GQKIYIWQLKDNDL G+AFIDT++Y+
Sbjct: 1125 PLTKNRFKQIYAKEQKGPVTAITQVSGFLVSAIGQKIYIWQLKDNDLVGVAFIDTQIYVC 1184
Query: 372 SMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGS 431
M+S+K+LILV D +S++LLR+QPEY+TLSLV+RD++ T+ + Y+ N
Sbjct: 1185 QMLSIKSLILVADVYKSVSLLRFQPEYKTLSLVSRDFRTTEIYAIEYFIQN--------- 1235
Query: 432 LVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGG 491
+ +GF+++D + N+ +F YQPE+ +S GG
Sbjct: 1236 -------------------------------NELGFIVADGESNISIFSYQPESSQSLGG 1264
Query: 492 HRLIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPLP 547
+LI+K D HLGQ +NTFF+I+CK + ++ GA R +T YA+LDG+LG+ LP+P
Sbjct: 1265 QKLIRKADIHLGQKINTFFRIKCKTTDSANPTKQFSGADKRHVTMYATLDGSLGYILPVP 1324
Query: 548 EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 607
EK YRRLLMLQNV+V+H H GLNP+AFRTYK GNP+RGIIDG LV K+L L +
Sbjct: 1325 EKTYRRLLMLQNVLVSHIYHIAGLNPKAFRTYKSCVRMQGNPARGIIDGDLVRKYLDLPV 1384
Query: 608 GERLEICKKIGSKHNDILDELYDIEALSSHF 638
E++EI KKIG+ +I+D++++I +SHF
Sbjct: 1385 NEKIEIAKKIGTGAQEIMDDMHEIYKQTSHF 1415
>gi|270003792|gb|EFA00240.1| hypothetical protein TcasGA2_TC003068 [Tribolium castaneum]
Length = 1392
Score = 761 bits (1965), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/641 (56%), Positives = 468/641 (73%), Gaps = 48/641 (7%)
Query: 6 SHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLK-- 63
+H + + V+E+L V+LG HG+RPLL+VR + +L IY+ FR P+G LK+RF+K+K
Sbjct: 792 AHEANIQRQFDVKEILVVALGNHGSRPLLMVRLERDLYIYEVFRFPRGNLKMRFRKIKHS 851
Query: 64 VLFVSDRSKRANEQPGLPRGV--RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
+++ + S R + + + RI +MRYF+NIAGY GVF+CG +P W+F+++RGELR
Sbjct: 852 LIYSPNVSGRIDTEDSDFFAIQERIIKMRYFTNIAGYNGVFVCGANPHWIFMSARGELRT 911
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
HPMTIDG V + A F+NVNCP+GFLYFN KSELRI VLPTHLSYDA WPVRKVPL+CTPH
Sbjct: 912 HPMTIDGEVLSFAAFNNVNCPQGFLYFNRKSELRIGVLPTHLSYDAAWPVRKVPLRCTPH 971
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
F+ YHLE+KTYC+VTS AEPS YYKFNGEDKEL + R RF PL +F + LFSP S
Sbjct: 972 FVTYHLESKTYCLVTSIAEPSNKYYKFNGEDKELSVEDRGDRFPYPLQEKFSLMLFSPVS 1031
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
W+ IP T L EWEHV CLKNVS+ YEGT SGL+GYIA+GTNYNY EDVT RGRIL+FD
Sbjct: 1032 WDVIPNTKIDLDEWEHVNCLKNVSLAYEGTRSGLKGYIAVGTNYNYGEDVTSRGRILIFD 1091
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
IIEVVPEPGQPLTKN+ K IYAK+QKGPVTA+ V GFLV+AVGQKIYIWQLKDNDL G+
Sbjct: 1092 IIEVVPEPGQPLTKNRFKEIYAKDQKGPVTALSQVKGFLVSAVGQKIYIWQLKDNDLVGV 1151
Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
AFIDT++Y ++++K+L+LV D +SI+LLR+Q EYRTLSLV+RD++P + S Y
Sbjct: 1152 AFIDTQIYTHQILTIKSLLLVADVYKSISLLRFQEEYRTLSLVSRDFRPCEVFSVEYMID 1211
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
N ++MGF++SD +KN+VL+MY
Sbjct: 1212 N----------------------------------------TTMGFLVSDSEKNLVLYMY 1231
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLD 537
QPE+RES GG RL++K DFHLGQ VN+FF+I+CK + + GA R +T YA+LD
Sbjct: 1232 QPESRESLGGQRLLRKADFHLGQAVNSFFRIKCKLGELGEDKKNLTGADKRHITMYATLD 1291
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G LG+ +P+PEK YRRLLMLQNV+V+ +H GLNP+AFRTYK NP+R +IDG
Sbjct: 1292 GGLGYIMPVPEKTYRRLLMLQNVLVSQGAHIAGLNPKAFRTYKSWKKLQTNPARSVIDGE 1351
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
LV+ +LQLS+ E+LE+ KKIG+K ++LD+L DI+ +++HF
Sbjct: 1352 LVYNYLQLSIPEKLEVSKKIGTKLEELLDDLSDIQKITNHF 1392
>gi|91078626|ref|XP_968117.1| PREDICTED: similar to cleavage and polyadenylation specificity factor
cpsf [Tribolium castaneum]
Length = 1413
Score = 761 bits (1965), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/641 (56%), Positives = 468/641 (73%), Gaps = 48/641 (7%)
Query: 6 SHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLK-- 63
+H + + V+E+L V+LG HG+RPLL+VR + +L IY+ FR P+G LK+RF+K+K
Sbjct: 813 AHEANIQRQFDVKEILVVALGNHGSRPLLMVRLERDLYIYEVFRFPRGNLKMRFRKIKHS 872
Query: 64 VLFVSDRSKRANEQPGLPRGV--RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
+++ + S R + + + RI +MRYF+NIAGY GVF+CG +P W+F+++RGELR
Sbjct: 873 LIYSPNVSGRIDTEDSDFFAIQERIIKMRYFTNIAGYNGVFVCGANPHWIFMSARGELRT 932
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
HPMTIDG V + A F+NVNCP+GFLYFN KSELRI VLPTHLSYDA WPVRKVPL+CTPH
Sbjct: 933 HPMTIDGEVLSFAAFNNVNCPQGFLYFNRKSELRIGVLPTHLSYDAAWPVRKVPLRCTPH 992
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
F+ YHLE+KTYC+VTS AEPS YYKFNGEDKEL + R RF PL +F + LFSP S
Sbjct: 993 FVTYHLESKTYCLVTSIAEPSNKYYKFNGEDKELSVEDRGDRFPYPLQEKFSLMLFSPVS 1052
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
W+ IP T L EWEHV CLKNVS+ YEGT SGL+GYIA+GTNYNY EDVT RGRIL+FD
Sbjct: 1053 WDVIPNTKIDLDEWEHVNCLKNVSLAYEGTRSGLKGYIAVGTNYNYGEDVTSRGRILIFD 1112
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
IIEVVPEPGQPLTKN+ K IYAK+QKGPVTA+ V GFLV+AVGQKIYIWQLKDNDL G+
Sbjct: 1113 IIEVVPEPGQPLTKNRFKEIYAKDQKGPVTALSQVKGFLVSAVGQKIYIWQLKDNDLVGV 1172
Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
AFIDT++Y ++++K+L+LV D +SI+LLR+Q EYRTLSLV+RD++P + S Y
Sbjct: 1173 AFIDTQIYTHQILTIKSLLLVADVYKSISLLRFQEEYRTLSLVSRDFRPCEVFSVEYMID 1232
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
N ++MGF++SD +KN+VL+MY
Sbjct: 1233 N----------------------------------------TTMGFLVSDSEKNLVLYMY 1252
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLD 537
QPE+RES GG RL++K DFHLGQ VN+FF+I+CK + + GA R +T YA+LD
Sbjct: 1253 QPESRESLGGQRLLRKADFHLGQAVNSFFRIKCKLGELGEDKKNLTGADKRHITMYATLD 1312
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G LG+ +P+PEK YRRLLMLQNV+V+ +H GLNP+AFRTYK NP+R +IDG
Sbjct: 1313 GGLGYIMPVPEKTYRRLLMLQNVLVSQGAHIAGLNPKAFRTYKSWKKLQTNPARSVIDGE 1372
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
LV+ +LQLS+ E+LE+ KKIG+K ++LD+L DI+ +++HF
Sbjct: 1373 LVYNYLQLSIPEKLEVSKKIGTKLEELLDDLSDIQKITNHF 1413
>gi|307190910|gb|EFN74734.1| Cleavage and polyadenylation specificity factor subunit 1 [Camponotus
floridanus]
Length = 1418
Score = 761 bits (1964), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/633 (57%), Positives = 457/633 (72%), Gaps = 48/633 (7%)
Query: 14 ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKR 73
E V+E+L V+LG HGNRP+LLVR EL IYQA+++PKG LKLRFKKL+ + R
Sbjct: 826 EMQVREILMVALGHHGNRPMLLVRLDSELQIYQAYKYPKGYLKLRFKKLEHGIIPGRLSP 885
Query: 74 ANEQPGLPRGV---RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPV 130
++ +P RI MRYFSNIAGY GVF+C +P W+FLT RGELR HPM IDGP+
Sbjct: 886 KPKEEDMPMNASETRICMMRYFSNIAGYNGVFICCDYPHWIFLTGRGELRTHPMGIDGPI 945
Query: 131 STLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETK 190
++ A F+NVNCP+GFLYFN K ELRI VLPTHLSYDAPWPVRKVPL+CTPHF+ YHLE+K
Sbjct: 946 TSFAAFNNVNCPQGFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESK 1005
Query: 191 TYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNF 250
TYC++TS AEP YY+FNGEDKE + R RF+ P QF + LFSP SWE IP T
Sbjct: 1006 TYCVITSIAEPLKSYYRFNGEDKEFTEEERPERFLYPSQEQFSIVLFSPVSWETIPNTKI 1065
Query: 251 PLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPG 310
L +WEHV CLKNVS+ YEGT SGL+GYI LGTNYNY ED+T RGRIL+FDIIEVVPEPG
Sbjct: 1066 ELEQWEHVTCLKNVSLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPG 1125
Query: 311 QPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYI 370
QPLTKN+ K IYAKEQKGP+TAI V+GFLV+AVGQKIYIWQLKDNDL G+AFIDT++YI
Sbjct: 1126 QPLTKNRFKQIYAKEQKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQIYI 1185
Query: 371 ASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDG 430
M+S+K+LIL+ D +SI+LLR+Q EYRTLSLV+RD++P + + Y N
Sbjct: 1186 HQMLSIKSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDN-------- 1237
Query: 431 SLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNG 490
+++GF ++D + N+ LFMYQPE+RES G
Sbjct: 1238 --------------------------------TNLGFFLADGESNLALFMYQPESRESLG 1265
Query: 491 GHRLIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPL 546
G +LI+K DFHLGQ VNTFF+IRC+ S ++ GA R +T YA+LDG+LG+ LP+
Sbjct: 1266 GQKLIRKADFHLGQKVNTFFRIRCRVSDPANDKKQFSGADKRHVTMYATLDGSLGYILPV 1325
Query: 547 PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR-TYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
PEK YRRLLMLQNV+VTH H GLNP+++R TYK GNP+RGIIDG LVW++L L
Sbjct: 1326 PEKTYRRLLMLQNVLVTHICHIAGLNPKSYRQTYKSYIRNQGNPARGIIDGDLVWRYLFL 1385
Query: 606 SLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
E+ ++ KKIG++ +I++++ +I+ ++HF
Sbjct: 1386 PNNEKTDVAKKIGTRVQEIIEDITEIDRQTAHF 1418
>gi|307191845|gb|EFN75271.1| Cleavage and polyadenylation specificity factor subunit 1
[Harpegnathos saltator]
Length = 1214
Score = 759 bits (1960), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/632 (57%), Positives = 455/632 (71%), Gaps = 47/632 (7%)
Query: 14 ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKR 73
E V+E+L V+LG HGNRP+LLVR EL IYQA+++PKG LKLRFKKL + R
Sbjct: 623 ELQVREVLMVALGHHGNRPMLLVRLDSELQIYQAYKYPKGHLKLRFKKLDHGIIPGHLSR 682
Query: 74 ANEQPGLP---RGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPV 130
++ +P RI MRYFSNIAGY GVF+C +P W+FLT RGELR HPM IDG V
Sbjct: 683 KPKEEDVPVNANETRICMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGSV 742
Query: 131 STLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETK 190
++ A F+N+NCP+GFLYFN K ELRI VLPTHLSYDAPWPVRKVPL+CTPHF+ YHLE+K
Sbjct: 743 TSFAAFNNINCPQGFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESK 802
Query: 191 TYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNF 250
TYC++TST+EP YY+FNGEDKE + R RF+ P QF + LFSP SWE IP T
Sbjct: 803 TYCVITSTSEPLKSYYRFNGEDKEFTEEDRPERFLYPSQEQFCIVLFSPVSWETIPNTKI 862
Query: 251 PLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPG 310
L +WEHV CLKNVS+ YEGT SGL+GYI LGTNYNY ED+T RGRIL+FDIIEVVPEPG
Sbjct: 863 ELDQWEHVTCLKNVSLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPG 922
Query: 311 QPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYI 370
QPLTKN+ K IYAKEQKGP+TAI V+GFLVTAVGQKIYIWQLKDNDL GIAFIDT++YI
Sbjct: 923 QPLTKNRFKQIYAKEQKGPITAITQVSGFLVTAVGQKIYIWQLKDNDLVGIAFIDTQIYI 982
Query: 371 ASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDG 430
M+S+K+LIL+ D +SI+LLR+Q + RTLSLV+RD++P + + Y N
Sbjct: 983 HQMLSIKSLILIADVYKSISLLRFQEKCRTLSLVSRDFRPAEVYTIEYLIDN-------- 1034
Query: 431 SLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNG 490
+++GF+I+D + N+ LFMYQPE+RES G
Sbjct: 1035 --------------------------------TNLGFLIADGESNLALFMYQPESRESLG 1062
Query: 491 GHRLIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPL 546
G +LI+K DFHLGQ +NTFF+I+C+ + ++ A + +T YASLDG+LG+ LP+
Sbjct: 1063 GQKLIRKADFHLGQKINTFFRIKCRVTDVASDKKHFSDADKKHVTMYASLDGSLGYVLPV 1122
Query: 547 PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
PEK YRRLLMLQNV+VTH H GLNP+A+RTYK GNP+RGIIDG LVW++L L
Sbjct: 1123 PEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSYVRNQGNPARGIIDGDLVWRYLSLP 1182
Query: 607 LGERLEICKKIGSKHNDILDELYDIEALSSHF 638
E+ ++ KKIG++ +I++++ +I+ ++HF
Sbjct: 1183 NNEKADVAKKIGTRVQEIIEDITEIDRQTAHF 1214
>gi|380014171|ref|XP_003691113.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Apis florea]
Length = 1583
Score = 754 bits (1948), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/612 (59%), Positives = 440/612 (71%), Gaps = 48/612 (7%)
Query: 14 ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSK- 72
E V+E+L V+LG HGNRP+LLVR EL IYQA+R+PKG LKLRFKKL + +
Sbjct: 825 EMQVREILMVALGHHGNRPMLLVRLDSELQIYQAYRYPKGHLKLRFKKLDHGIIPGHLRP 884
Query: 73 --RANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPV 130
R + P + R MRYFSNIAGY GVF+C +P W+FLT RGELR HPM IDGPV
Sbjct: 885 RPRDEDMPAM-NDTRHCMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPV 943
Query: 131 STLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETK 190
++ APF+N+NCP+GFLYFN K ELRI VLPTHLSYDAPWPVRKVPL+CTPHF+ YHLE+K
Sbjct: 944 TSFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESK 1003
Query: 191 TYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNF 250
TYC++TS AEP YY+FNGEDKE + R RFI P QF + LFSP SWE IP T
Sbjct: 1004 TYCVITSIAEPLKSYYRFNGEDKEFTEEERPDRFIYPSQEQFSIVLFSPVSWETIPNTKI 1063
Query: 251 PLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPG 310
L +WEHV CLKNVS+ YEGT SGL+GYI LGTNYNY ED+T RGRIL+FDIIEVVPEPG
Sbjct: 1064 ELDQWEHVTCLKNVSLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPG 1123
Query: 311 QPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYI 370
QPLTKN+ K IYAKEQKGP+TAI V+GFLV+AVGQKIYIWQLKDNDL G+AFIDT++YI
Sbjct: 1124 QPLTKNRFKQIYAKEQKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQIYI 1183
Query: 371 ASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDG 430
M+S+K+LIL+ D +SI+LLR+Q EYRTLSLV+RD++P + + Y N
Sbjct: 1184 HQMLSIKSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDN-------- 1235
Query: 431 SLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNG 490
+++GF+++D + N+ LFMYQPE+RES G
Sbjct: 1236 --------------------------------TNLGFLVADGESNIALFMYQPESRESLG 1263
Query: 491 GHRLIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPL 546
G +LI+K DFHLGQ VNTFF+IRC+ S ++ A R +T YASLDG LG+ LP+
Sbjct: 1264 GQKLIRKADFHLGQKVNTFFRIRCRISDPANDKKHFSDADKRHVTMYASLDGNLGYILPV 1323
Query: 547 PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
PEK YRRLLMLQNV+VTH H GLNP+A+RTYK GNP+RGIIDG LVW++L L
Sbjct: 1324 PEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDLVWRYLYLP 1383
Query: 607 LGERLEICKKIG 618
E++++ KKI
Sbjct: 1384 NNEKIDVAKKIA 1395
>gi|322792443|gb|EFZ16427.1| hypothetical protein SINV_15375 [Solenopsis invicta]
Length = 1532
Score = 751 bits (1938), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/612 (58%), Positives = 443/612 (72%), Gaps = 48/612 (7%)
Query: 17 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANE 76
V+E+L V+LG HGNRP+LLVR EL IYQ +R+PKG LKLRFKKL + R +
Sbjct: 796 VREILMVALGHHGNRPMLLVRLDSELQIYQVYRYPKGYLKLRFKKLDHGIIPGRLSPRPK 855
Query: 77 QPGLPRGV---RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTL 133
+ +PR RI MRYFSNIAGY GVF+C +P W+FLT RGELR HPM IDG V++
Sbjct: 856 EEDVPRNTSDTRICVMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGSVTSF 915
Query: 134 APFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYC 193
A F+N+NCP+GFLYFN K ELRI VLPTHLSYDAPWPVRKVPL+CTPHF+ YHLE+KTYC
Sbjct: 916 AAFNNINCPQGFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKTYC 975
Query: 194 IVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLH 253
++TSTAEP YY+FNGEDKE + R RF+ P QF + LFSP SWE IP T L
Sbjct: 976 VITSTAEPLKSYYRFNGEDKEFTEEERPDRFLYPSQEQFSIVLFSPVSWETIPNTKIELD 1035
Query: 254 EWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPL 313
+WEHV CLKNVS+ YEGT SGL+GYI LGTNYNY ED+T RGRIL+FDIIEVVPEPGQPL
Sbjct: 1036 QWEHVTCLKNVSLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPL 1095
Query: 314 TKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASM 373
TKN+ K IYAKEQKGP+TAI V+GFLV+AVGQKIYIWQLKDNDL G+AFIDT++YI M
Sbjct: 1096 TKNRFKQIYAKEQKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQIYIHQM 1155
Query: 374 VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
+S+K+LIL+ D +SI+LLR+Q EYRTLSLV+RD++P + + Y N
Sbjct: 1156 LSIKSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDN----------- 1204
Query: 434 WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
+++GF+++D + N+ LFMYQPE+RES GG +
Sbjct: 1205 -----------------------------TNLGFIVADGESNLALFMYQPESRESLGGQK 1235
Query: 494 LIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPLPEK 549
LI+K DFHLGQ VNTFF+IRC+ + ++ GA R +T YASLDG+LG+ LP+PEK
Sbjct: 1236 LIRKADFHLGQKVNTFFRIRCRVTDPANDKKQFSGADKRHVTMYASLDGSLGYILPVPEK 1295
Query: 550 NYRRLLMLQNVMVTHTSHTGGLNPRAFR-TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 608
YRRLLMLQNV+VTH H GLNP+++R TYK GNP+RGIIDG LVW++L L
Sbjct: 1296 TYRRLLMLQNVLVTHICHIAGLNPKSYRHTYKSYIRNQGNPARGIIDGDLVWRYLFLPNN 1355
Query: 609 ERLEICKKIGSK 620
E+ ++ KKIG++
Sbjct: 1356 EKADLAKKIGTR 1367
>gi|242021233|ref|XP_002431050.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
putative [Pediculus humanus corporis]
gi|212516279|gb|EEB18312.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
putative [Pediculus humanus corporis]
Length = 1409
Score = 744 bits (1921), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/633 (55%), Positives = 466/633 (73%), Gaps = 47/633 (7%)
Query: 13 DETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGA-LKLRFKKL-KVLFVSDR 70
D+ + ELL VSLG G RP+LL+RT+++L+IYQAF+ KG LK+RF++L + L + +R
Sbjct: 817 DDPEIHELLVVSLGHLGRRPILLLRTENDLMIYQAFKFAKGPNLKIRFRRLPQTLILKER 876
Query: 71 -SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP 129
+K + R +++RYFSNI+GY GVF+CGP+P WLFLT+RGELR+HPM IDG
Sbjct: 877 KAKFKVKYENEVESERATRLRYFSNISGYNGVFVCGPNPHWLFLTARGELRSHPMLIDGR 936
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
V++ A FHNVNCP GFLYF +K ELRI +LPTHLSYDAPWPVRKVPL+CTPH + YHLE+
Sbjct: 937 VTSFASFHNVNCPLGFLYFTSKCELRICILPTHLSYDAPWPVRKVPLRCTPHMVTYHLES 996
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
KTYC++TS++EPS +Y++FNGEDKE + RD RF PL +F + LFSP SWE IP T
Sbjct: 997 KTYCLITSSSEPSNEYFRFNGEDKEHSVEDRDDRFPLPLQDKFSIVLFSPVSWEVIPNTK 1056
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEP 309
L EWEHV C+K V++ YEGT SGL+GY+A+GTNYNYSED+T +GRIL++DIIEVVPEP
Sbjct: 1057 MELDEWEHVTCVKTVNLSYEGTRSGLKGYVAVGTNYNYSEDITSKGRILIYDIIEVVPEP 1116
Query: 310 GQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVY 369
GQPLTKN+ K +YAKEQKGPVTA+CHV GFLVTA+GQKIYIWQLKDNDL GIAFIDT++Y
Sbjct: 1117 GQPLTKNRFKTVYAKEQKGPVTALCHVLGFLVTAMGQKIYIWQLKDNDLVGIAFIDTQIY 1176
Query: 370 IASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIID 429
I M+SVK+LILV D +SI+LLR+Q EYRTLSLV+RD++P + YA
Sbjct: 1177 IHQMISVKSLILVADVYKSISLLRFQEEYRTLSLVSRDFRPCE-----VYA--------- 1222
Query: 430 GSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESN 489
++L + + MGF+ISD + N++++MY+PE R+S
Sbjct: 1223 --------------------------IELLLDNTQMGFLISDVEMNIIMYMYKPEDRDSV 1256
Query: 490 GGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS---DAP-GARSRFLTWYASLDGALGFFLP 545
GG +L++K DFHLGQH+N++F+IRC+ + D P GA R ++ +A+LDGALG+ LP
Sbjct: 1257 GGQKLLRKADFHLGQHINSWFRIRCRLGDQAENYDFPIGAEKRHISMFATLDGALGYLLP 1316
Query: 546 LPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
+PEK YRRL MLQN++V H H GLNP+AFR YK GNP + I+DG L+W +L L
Sbjct: 1317 IPEKTYRRLQMLQNILVYHIPHLAGLNPKAFRIYKSGRKLLGNPCKRIVDGELIWMYLSL 1376
Query: 606 SLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
++ E+ ++ KK+GSK +DI++++ IE LS HF
Sbjct: 1377 TVMEKQDVAKKMGSKMDDIIEDIAVIERLSGHF 1409
>gi|332018184|gb|EGI58789.1| Cleavage and polyadenylation specificity factor subunit 1 [Acromyrmex
echinatior]
Length = 1412
Score = 733 bits (1892), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/612 (58%), Positives = 440/612 (71%), Gaps = 52/612 (8%)
Query: 17 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANE 76
V+E+L V+LG HGNRP+LLVR +L IYQA+R+PKG LKLRFKKL + R +
Sbjct: 827 VREILMVALGHHGNRPMLLVRLDSDLQIYQAYRYPKGYLKLRFKKLDHGIIPGRLSPRPK 886
Query: 77 QPGLPRG---VRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTL 133
+ +PR RI MRYFSNIAGY GVF+C +P W+FLT RGELR HPM IDGPV++
Sbjct: 887 EEDVPRNRNITRICVMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPVTSF 946
Query: 134 APFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYC 193
APF+N+NCP+GFLYFN K ELRI VLPTHLSYDAPWPVRKVPL+CTPHF+ YHLE+KTYC
Sbjct: 947 APFNNINCPQGFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKTYC 1006
Query: 194 IVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLH 253
++TSTAEP YY+FNGEDK L ++ F LFSP SWE IP T L
Sbjct: 1007 VITSTAEPLKSYYRFNGEDKVLTK----LYYLFQFSRIFMNLLFSPVSWETIPNTKIELD 1062
Query: 254 EWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPL 313
+WEHV CLKNVS+ YEGT SGL+GYI LGTNYNY ED+T RGRIL+FDIIEVVPEPGQPL
Sbjct: 1063 QWEHVTCLKNVSLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPL 1122
Query: 314 TKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASM 373
TKN+ K IYAKEQKGP+TAI V+GFLV+AVGQKIYIWQLKDNDL G+AFIDT++YI M
Sbjct: 1123 TKNRFKQIYAKEQKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQIYIHQM 1182
Query: 374 VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
+S+K+LIL+ D +SI+LLR+Q EYRTLSLV+RD++P + + Y N
Sbjct: 1183 LSIKSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDN----------- 1231
Query: 434 WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
S++GF+++D + N+ LFMYQPE+RES GG +
Sbjct: 1232 -----------------------------SNLGFIVADGESNLALFMYQPESRESLGGQK 1262
Query: 494 LIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPLPEK 549
LI+K DFHLGQ +NTFF+I+C+ + ++ GA R +T YASLDG+LG+ LP+PEK
Sbjct: 1263 LIRKADFHLGQKINTFFRIKCRITDPANDKKQFSGADKRHVTMYASLDGSLGYILPVPEK 1322
Query: 550 NYRRLLMLQNVMVTHTSHTGGLNPRAFR-TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 608
YRRLLMLQNV+VTH H GLNP+A+R TYK GNP+RGIIDG LVW++L L
Sbjct: 1323 TYRRLLMLQNVLVTHICHIAGLNPKAYRHTYKSYVRNQGNPARGIIDGDLVWRYLFLPNN 1382
Query: 609 ERLEICKKIGSK 620
E+ ++ KKIG++
Sbjct: 1383 EKADLAKKIGTR 1394
>gi|193702313|ref|XP_001945086.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Acyrthosiphon pisum]
Length = 1335
Score = 686 bits (1770), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/628 (52%), Positives = 440/628 (70%), Gaps = 52/628 (8%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRAN 75
I++E+L V LG RP++ VR +E++IY RHP+G LK+RF K+ L ++ +S+ N
Sbjct: 755 IIKEILIVPLGYQDKRPIMFVRLDNEVVIYGIHRHPEGTLKMRFHKMTSL-LTFQSRSGN 813
Query: 76 EQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAP 135
G S +RYFS +AG+ GVF+CG +P + LT RGELR HP+ IDGP+ AP
Sbjct: 814 PLEG------TSLLRYFSKVAGHNGVFICGQNPHLILLTVRGELRCHPLHIDGPIMCFAP 867
Query: 136 FHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIV 195
FHNVNC +GFLYFN+ +LRIS+LPTHLSYD PWP+RKVPL+ TPHF+AYHLETKTYC+V
Sbjct: 868 FHNVNCSQGFLYFNSDHKLRISILPTHLSYDEPWPLRKVPLRKTPHFIAYHLETKTYCVV 927
Query: 196 TSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEW 255
TS++E S YY+FNGEDKEL T+ RD F P F + LFSP SWE IP T+ +W
Sbjct: 928 TSSSELSASYYRFNGEDKELTTEERDPLFPLPSHEVFTLELFSPASWEPIPDTSIETEDW 987
Query: 256 EHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTK 315
EH+ CLKNV++ YEG SGL+GYIA+GTNY+YSED+T RGRI LFDII+VVPEPG+PLTK
Sbjct: 988 EHITCLKNVALAYEGARSGLKGYIAMGTNYSYSEDITSRGRIFLFDIIDVVPEPGKPLTK 1047
Query: 316 NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVS 375
NKIKMIYAKEQKGPVTAI HV GFLVTAVGQKIYIWQLKDNDL GIAFIDTEVY+ M+S
Sbjct: 1048 NKIKMIYAKEQKGPVTAITHVVGFLVTAVGQKIYIWQLKDNDLIGIAFIDTEVYVHQMLS 1107
Query: 376 VKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWK 435
+K+LILV D +SI LLR+Q EYRTLSLV RD KP + + N
Sbjct: 1108 IKSLILVADLFKSITLLRFQEEYRTLSLVCRDSKPLEVFDINFLIDN------------- 1154
Query: 436 FLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLI 495
+ +GF+ SD+D+N++L++YQP ARES GG L+
Sbjct: 1155 ---------------------------TELGFLASDRDQNLLLYLYQPMARESYGGQHLV 1187
Query: 496 KKTDFHLGQHVNTFFKIRCKPSSIS----DAPGARSRFLTWYASLDGALGFFLPLPEKNY 551
++ DF++G +VN+FF++RCK S+++ +A G+ R +T Y +LDG++G+ +P+ EKNY
Sbjct: 1188 RRGDFNIGSNVNSFFRLRCKQSTVAPDRREAIGSDKRHVTMYTTLDGSIGYIVPIHEKNY 1247
Query: 552 RRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQ-LSLGER 610
RRLL LQN++V + +H GLNP+A+R++K N +R +IDG LVW F+ ++ +R
Sbjct: 1248 RRLLTLQNMLVKNITHLAGLNPKAYRSFKATAPERMNQARRVIDGELVWMFVTCMNARQR 1307
Query: 611 LEICKKIGSKHNDILDELYDIEALSSHF 638
EI K+G K ++L ++Y+++ + HF
Sbjct: 1308 NEIANKVGVKTIELLQDIYELDRTTWHF 1335
>gi|427795803|gb|JAA63353.1| Putative mrna cleavage and polyadenylation factor ii complex
subunit cft1 cpsf subunit, partial [Rhipicephalus
pulchellus]
Length = 726
Score = 659 bits (1700), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 328/637 (51%), Positives = 436/637 (68%), Gaps = 55/637 (8%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGALKLRFKKLK-VLFVSDR 70
+V E+L V LG+ +RPLLL R +LLIY+AF +G LKLRFKK+ +F+ +R
Sbjct: 131 VVHEILVVGLGIRHSRPLLLARVDEDLLIYEAFPFYETQREGHLKLRFKKMSHDIFLRER 190
Query: 71 SKRANEQPGLPRGVRISQMRY----FSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTI 126
K ++P + Q R FS+I+GY GVFLCG P WLF++SRGELR HPM +
Sbjct: 191 -KYKTQKPENEEEEKAFQSRQWLHPFSDISGYSGVFLCGYRPYWLFMSSRGELRCHPMFV 249
Query: 127 DGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYH 186
DGP+ APFHNVNCP+GFL+FN + ELRIS LPTHL+YDAPWPVRKVPL+CTPHF+ YH
Sbjct: 250 DGPIHCFAPFHNVNCPKGFLHFNKQGELRISTLPTHLTYDAPWPVRKVPLRCTPHFVNYH 309
Query: 187 LETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIP 246
+++KTYC+VTS +P +F GE+KE RDSR+I P + +F + L SP SWE IP
Sbjct: 310 VDSKTYCVVTSQPDPCNHLVRFTGEEKEYELLERDSRYIFPTMDKFSLQLLSPVSWETIP 369
Query: 247 QTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVV 306
T L EWEH+ CLKNV + EGT +G++GY+ALGTNY Y EDVT RGRI++ DII+VV
Sbjct: 370 NTRVDLDEWEHLTCLKNVMLSSEGTTTGMKGYLALGTNYCYGEDVTSRGRIIILDIIDVV 429
Query: 307 PEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT 366
PEPGQPLTKNKIK++Y+KEQKGPVTA+ V GFL++A+GQKIYIWQLKDN+L G+AFIDT
Sbjct: 430 PEPGQPLTKNKIKIVYSKEQKGPVTALSQVVGFLLSAIGQKIYIWQLKDNELVGVAFIDT 489
Query: 367 EVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRG 426
++YI S+V+VKNLILVGD +S++LLRYQ RTLSLV+RD +P + + ++ N
Sbjct: 490 QIYIHSVVTVKNLILVGDVFKSVSLLRYQEASRTLSLVSRDVRPLEVYAVEFFIDN---- 545
Query: 427 IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
+ M F+++D ++N++L+MYQPE+R
Sbjct: 546 ------------------------------------TQMSFLVTDAERNLLLYMYQPESR 569
Query: 487 ESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISD-----APGARSRFLTWYASLDGALG 541
ES GG RL+++ DFH+G V + F+I+C+ I+ A R +T A+LDG+L
Sbjct: 570 ESCGGQRLLRRGDFHVGSPVVSMFRIKCRMGDIAKYDRRAASIVDGRHITMMATLDGSLA 629
Query: 542 FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 601
+ LP+PEK YRRLLMLQNV+VT+ H GLNP+A+R Y + + GNP + I+DG L+WK
Sbjct: 630 YVLPVPEKTYRRLLMLQNVLVTNIPHYAGLNPKAYRMYYSQRRFLGNPHKNILDGELIWK 689
Query: 602 FLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
F+ LS ER E+ KKIG+ I D+L +IE ++HF
Sbjct: 690 FMHLSFMERSELSKKIGTTVTQITDDLLEIETYTAHF 726
>gi|427780291|gb|JAA55597.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
cft1 cpsf subunit [Rhipicephalus pulchellus]
Length = 1237
Score = 659 bits (1699), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 328/637 (51%), Positives = 436/637 (68%), Gaps = 55/637 (8%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGALKLRFKKLK-VLFVSDR 70
+V E+L V LG+ +RPLLL R +LLIY+AF +G LKLRFKK+ +F+ +R
Sbjct: 642 VVHEILVVGLGIRHSRPLLLARVDEDLLIYEAFPFYETQREGHLKLRFKKMSHDIFLRER 701
Query: 71 SKRANEQPGLPRGVRISQMRY----FSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTI 126
K ++P + Q R FS+I+GY GVFLCG P WLF++SRGELR HPM +
Sbjct: 702 -KYKTQKPENEEEEKAFQSRQWLHPFSDISGYSGVFLCGYRPYWLFMSSRGELRCHPMFV 760
Query: 127 DGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYH 186
DGP+ APFHNVNCP+GFL+FN + ELRIS LPTHL+YDAPWPVRKVPL+CTPHF+ YH
Sbjct: 761 DGPIHCFAPFHNVNCPKGFLHFNKQGELRISTLPTHLTYDAPWPVRKVPLRCTPHFVNYH 820
Query: 187 LETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIP 246
+++KTYC+VTS +P +F GE+KE RDSR+I P + +F + L SP SWE IP
Sbjct: 821 VDSKTYCVVTSQPDPCNHLVRFTGEEKEYELLERDSRYIFPTMDKFSLQLLSPVSWETIP 880
Query: 247 QTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVV 306
T L EWEH+ CLKNV + EGT +G++GY+ALGTNY Y EDVT RGRI++ DII+VV
Sbjct: 881 NTRVDLDEWEHLTCLKNVMLSSEGTTTGMKGYLALGTNYCYGEDVTSRGRIIILDIIDVV 940
Query: 307 PEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT 366
PEPGQPLTKNKIK++Y+KEQKGPVTA+ V GFL++A+GQKIYIWQLKDN+L G+AFIDT
Sbjct: 941 PEPGQPLTKNKIKIVYSKEQKGPVTALSQVVGFLLSAIGQKIYIWQLKDNELVGVAFIDT 1000
Query: 367 EVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRG 426
++YI S+V+VKNLILVGD +S++LLRYQ RTLSLV+RD +P + + ++ N
Sbjct: 1001 QIYIHSVVTVKNLILVGDVFKSVSLLRYQEASRTLSLVSRDVRPLEVYAVEFFIDN---- 1056
Query: 427 IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
+ M F+++D ++N++L+MYQPE+R
Sbjct: 1057 ------------------------------------TQMSFLVTDAERNLLLYMYQPESR 1080
Query: 487 ESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISD-----APGARSRFLTWYASLDGALG 541
ES GG RL+++ DFH+G V + F+I+C+ I+ A R +T A+LDG+L
Sbjct: 1081 ESCGGQRLLRRGDFHVGSPVVSMFRIKCRMGDIAKYDRRAASIVDGRHITMMATLDGSLA 1140
Query: 542 FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 601
+ LP+PEK YRRLLMLQNV+VT+ H GLNP+A+R Y + + GNP + I+DG L+WK
Sbjct: 1141 YVLPVPEKTYRRLLMLQNVLVTNIPHYAGLNPKAYRMYYSQRRFLGNPHKNILDGELIWK 1200
Query: 602 FLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
F+ LS ER E+ KKIG+ I D+L +IE ++HF
Sbjct: 1201 FMHLSFMERSELSKKIGTTVTQITDDLLEIETYTAHF 1237
>gi|432883539|ref|XP_004074300.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Oryzias latipes]
Length = 1456
Score = 640 bits (1652), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 309/640 (48%), Positives = 424/640 (66%), Gaps = 57/640 (8%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGALKLRFKKL--------- 62
+V+E+ V+LG + +RP LLV ++ELL+Y+AF + P+ LK+RFKK+
Sbjct: 857 LVKEVALVALGNNRSRPYLLVHVENELLVYEAFPYDQQQPQNNLKVRFKKVPHSINFREK 916
Query: 63 -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
L +++ + + RIS+ RYF +I+GY GVF+CGP P W+ +TSRG LR
Sbjct: 917 KPKLKKDKKAEGGGPEENVAVKSRISRFRYFEDISGYSGVFICGPSPHWMLITSRGGLRL 976
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
HPMTIDGP+ + +PFHN+NCP+GFLYFN + ELRISVLPT+LSYDAPWPVRK+PL+CT H
Sbjct: 977 HPMTIDGPIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVH 1036
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
F++YH+E+K Y + TS E T + GE+KE T RD R+I PL +F + L SP S
Sbjct: 1037 FVSYHVESKVYAVCTSVKELCTRIPRMTGEEKEFETIERDERYINPLQEKFSIQLISPVS 1096
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
WE IP T L EWEHV C+K V++ + T+SGL+GYIA GT E+VTCRGRIL+ D
Sbjct: 1097 WETIPNTRIDLEEWEHVTCMKTVALRSQETVSGLKGYIAAGTCVLQGEEVTCRGRILILD 1156
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G+LV+A+GQKI++W LKDNDLTG+
Sbjct: 1157 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCHGYLVSAIGQKIFLWALKDNDLTGM 1216
Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
AFIDT++YI M+S+KN IL D +SI+LLRYQ E +TLSLV+RD KP + S +
Sbjct: 1217 AFIDTQLYIHQMISIKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSIEFIVD 1276
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
N + +GF++SD+DKN+ ++MY
Sbjct: 1277 N----------------------------------------NQLGFLVSDRDKNLFVYMY 1296
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDG 538
PEA+ES GG RL+++ DF+ G H+N+ +++ C+ + S + A ++ +TW+A+LDG
Sbjct: 1297 LPEAKESFGGMRLLRRADFNAGAHINSLWRMPCRGALDSGSKKALTWDNKHITWFATLDG 1356
Query: 539 ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
+G LP+ EK YRRLLMLQN + T H GLNP+AFR N + I+DG L
Sbjct: 1357 GIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFRMMHSNRRSLQNAVKNILDGEL 1416
Query: 599 VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
+ K+L LS ER E+ KKIG+ + ILD+L +I+ +++HF
Sbjct: 1417 LAKYLYLSTMERSELAKKIGTTQDIILDDLLEIDRVTAHF 1456
>gi|229335612|ref|NP_001108153.2| cleavage and polyadenylation specificity factor subunit 1 [Danio
rerio]
Length = 1449
Score = 640 bits (1652), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 311/642 (48%), Positives = 423/642 (65%), Gaps = 56/642 (8%)
Query: 13 DETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGALKLRFKKLKVLFVS 68
D +V+E+ VSLG + +RP LL + ELLIY+AF + + LK+RFKK+
Sbjct: 848 DIPLVKEVALVSLGYNHSRPYLLAHVEQELLIYEAFPYDQQQAQSNLKVRFKKMPHNINY 907
Query: 69 DRSKRANEQPGLPRGV---------RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGEL 119
K + P G R+++ RYF +I+GY GVF+CGP P W+ +TSRG +
Sbjct: 908 REKKVKVRKDKKPEGQGEDTLGVKGRVARFRYFQDISGYSGVFICGPSPHWMLVTSRGAM 967
Query: 120 RAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCT 179
R HPMTIDG + + +PFHN+NCP+GFLYFN + ELRISVLPT+LSYDAPWPVRK+PL+CT
Sbjct: 968 RLHPMTIDGAIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCT 1027
Query: 180 PHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP 239
H+++YH+E+K Y + TS EP T + GE+KE T RD R+I P +F + L SP
Sbjct: 1028 VHYVSYHVESKVYAVCTSVKEPCTRIPRMTGEEKEFETIERDERYIHPQQDKFSIQLISP 1087
Query: 240 FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
SWE IP T L EWEHV C+K V+++ + T+SGL+GY+ALGT E+VTCRGRIL+
Sbjct: 1088 VSWEAIPNTRVDLEEWEHVTCMKTVALKSQETVSGLKGYVALGTCLMQGEEVTCRGRILI 1147
Query: 300 FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLT 359
D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH +GFLV+A+GQKI++W LKDNDLT
Sbjct: 1148 LDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCSGFLVSAIGQKIFLWSLKDNDLT 1207
Query: 360 GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
G+AFIDT++YI M S+KN IL D +SI+LLRYQPE +TLSLV+RD KP + S +
Sbjct: 1208 GMAFIDTQLYIHQMYSIKNFILAADVMKSISLLRYQPESKTLSLVSRDAKPLEVYSIEFM 1267
Query: 420 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
N + +GF++SD+DKN++++
Sbjct: 1268 VDN----------------------------------------NQLGFLVSDRDKNLMVY 1287
Query: 480 MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK---PSSISDAPGARSRFLTWYASL 536
MY PEA+ES GG RL+++ DF++G HVN F+++ C+ ++ A ++ +TW+A+L
Sbjct: 1288 MYLPEAKESFGGMRLLRRADFNVGSHVNAFWRMPCRGTLDTANKKALTWDNKHITWFATL 1347
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
DG +G LP+ EK YRRLLMLQN + T H GLNP+AFR N + I+DG
Sbjct: 1348 DGGVGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFRMLHCDRRTLQNAVKNILDG 1407
Query: 597 SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ K+L LS ER E+ KKIG+ + ILD+L +IE +++HF
Sbjct: 1408 ELLNKYLYLSTMERSELAKKIGTTPDIILDDLLEIERVTAHF 1449
>gi|348512553|ref|XP_003443807.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Oreochromis niloticus]
Length = 1456
Score = 639 bits (1648), Expect = e-180, Method: Compositional matrix adjust.
Identities = 310/641 (48%), Positives = 424/641 (66%), Gaps = 59/641 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGALKLRFKKL--------- 62
+V+E+ VSLG + ++P LLV + ELLIY+AF++ P+ LK+RFKK+
Sbjct: 857 LVKEVALVSLGNNHSKPYLLVHVEQELLIYEAFQYDQQQPQNNLKVRFKKVPHNINFREK 916
Query: 63 --KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
K+ A E+ +G RI++ R+F +I+GY GVF+CGP P W+ +TSRG LR
Sbjct: 917 KSKLKKDKKAESSATEESSGVKG-RIARFRFFEDISGYSGVFICGPSPHWMLVTSRGALR 975
Query: 121 AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTP 180
HPMTIDG + + +PFHN+NCP+GFLYFN + ELRISVLPT+LSYDAPWPVRK+PL+CT
Sbjct: 976 LHPMTIDGSIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTV 1035
Query: 181 HFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPF 240
H+++YH+E+K Y + TS EP T + GE+KE RD R+I P +F + L SP
Sbjct: 1036 HYVSYHVESKVYAVCTSVKEPCTRIPRMTGEEKEYEVIERDERYIHPQQEKFSIQLISPV 1095
Query: 241 SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
SWE IP T L EWEHV C+K V++ + T+SGL+GYIA GT E+VTCRGRIL+
Sbjct: 1096 SWEAIPNTRIDLEEWEHVTCMKTVALRSQETVSGLKGYIAAGTCLMQGEEVTCRGRILIL 1155
Query: 301 DIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTG 360
D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G+LV+A+GQKI++W LKDNDLTG
Sbjct: 1156 DVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFLWVLKDNDLTG 1215
Query: 361 IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
+AFIDT++YI M S+KN IL D +SI+LLRYQ E +TLSLV+RD KP + S +
Sbjct: 1216 MAFIDTQLYIHQMFSIKNFILAADLMKSISLLRYQEESKTLSLVSRDAKPLEVYSIEFMV 1275
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
N + +GF++SD+DKN+ ++M
Sbjct: 1276 DN----------------------------------------NQLGFLVSDRDKNLYVYM 1295
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK---PSSISDAPGARSRFLTWYASLD 537
Y PEA+ES GG RL+++ DF+ G ++NTF+++ C+ +S A ++ +TW+A+LD
Sbjct: 1296 YLPEAKESFGGMRLLRRADFNAGANINTFWRMPCRGALDASSKKALTWDNKHITWFATLD 1355
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G +G LP+ EK YRRLLMLQN + T H GLNP+AFR NP + I+DG
Sbjct: 1356 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFRMLHSDRRSLQNPVKNILDGE 1415
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ K+L LS+ ER E+ KKIG+ + ILD+L +I+ +++HF
Sbjct: 1416 LLNKYLYLSMMERSELAKKIGTTQDIILDDLLEIDRVTAHF 1456
>gi|49619065|gb|AAT68117.1| cleavage and polyadenylation specific factor 1 [Danio rerio]
Length = 1105
Score = 637 bits (1642), Expect = e-180, Method: Compositional matrix adjust.
Identities = 310/642 (48%), Positives = 421/642 (65%), Gaps = 56/642 (8%)
Query: 13 DETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGALKLRFKKLKVLFVS 68
D +V+E+ VSLG +RP LL + ELLIY+AF + + LK+RFKK+
Sbjct: 504 DIPLVKEVALVSLGYSHSRPYLLAHVEQELLIYEAFPYDQQQAQSNLKVRFKKMPHNINY 563
Query: 69 DRSKRANEQPGLPRGV---------RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGEL 119
K + P G R+++ RYF +I+GY GVF+CGP P W+ +TSRG +
Sbjct: 564 REKKVKVRKDKKPEGQGEDSLGVKGRVARFRYFQDISGYSGVFICGPSPHWMLVTSRGAM 623
Query: 120 RAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCT 179
R HPMTIDG + + +PFHN+NCP+GFLYFN + ELRISVLPT+LSYDAPWPVRK+PL+CT
Sbjct: 624 RLHPMTIDGAIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCT 683
Query: 180 PHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP 239
H+++YH+E+K Y + TS EP T + GE+KE T RD R+I P +F + L SP
Sbjct: 684 VHYVSYHVESKVYAVCTSVKEPCTRIPRMTGEEKEFETIERDERYIHPQQDKFSIQLISP 743
Query: 240 FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
SWE IP T L EWEHV C+K V+++ + T+SGL+GY+ALGT E+VTCRGRIL+
Sbjct: 744 VSWEAIPNTRVDLEEWEHVTCMKTVALKSQETVSGLKGYVALGTCLMQGEEVTCRGRILI 803
Query: 300 FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLT 359
D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH +GFLV+A+GQKI++W LK NDLT
Sbjct: 804 LDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCSGFLVSAIGQKIFLWSLKYNDLT 863
Query: 360 GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
G+AFIDT++YI M S+KN IL D +SI+LLRYQPE +TLSLV+RD KP + S +
Sbjct: 864 GMAFIDTQLYIHQMYSIKNFILAADVMKSISLLRYQPESKTLSLVSRDAKPLEVYSIEFM 923
Query: 420 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
N + +GF++SD+DKN++++
Sbjct: 924 VDN----------------------------------------NQLGFLVSDRDKNLMVY 943
Query: 480 MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK---PSSISDAPGARSRFLTWYASL 536
MY PEA+ES GG RL+++ DF++G HVN F+++ C+ ++ A ++ +TW+A+L
Sbjct: 944 MYLPEAKESFGGMRLLRRADFNVGSHVNAFWRMPCRGTLDTANKKALTWDNKHITWFATL 1003
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
DG +G LP+ EK YRRLLMLQN + T H GLNP+AFR N + I+DG
Sbjct: 1004 DGGVGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFRMLHCDRRTLQNAVKNILDG 1063
Query: 597 SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ K+L LS ER E+ KKIG+ + ILD+L +IE +++HF
Sbjct: 1064 ELLNKYLYLSTMERSELAKKIGTTPDIILDDLLEIERVTAHF 1105
>gi|27807297|ref|NP_777145.1| cleavage and polyadenylation specificity factor subunit 1 [Bos
taurus]
gi|1706101|sp|Q10569.1|CPSF1_BOVIN RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit
gi|929007|emb|CAA58152.1| cleavage and polyadenylation specificity factor, 160 kDa subunit [Bos
taurus]
gi|296480730|tpg|DAA22845.1| TPA: cleavage and polyadenylation specificity factor subunit 1 [Bos
taurus]
Length = 1444
Score = 634 bits (1635), Expect = e-179, Method: Compositional matrix adjust.
Identities = 311/642 (48%), Positives = 415/642 (64%), Gaps = 62/642 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 846 LVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 905
Query: 63 -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
+ E+ PRG R+++ RYF +I GY GVF+CGP P WL +T RG LR
Sbjct: 906 KPKPSKKKAEGGSTEEGTGPRG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 964
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
HPM IDGP+ + APFHN+NCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 965 HPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 1024
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
++AYH+E+K Y + TST+ P T + GE+KE T RD R++ P F + L SP S
Sbjct: 1025 YVAYHVESKVYAVATSTSTPCTRVPRMTGEEKEFETIERDERYVHPQQEAFCIQLISPVS 1084
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
WE IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D
Sbjct: 1085 WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1144
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1145 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1204
Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S +
Sbjct: 1205 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1264
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
N + +GF++SD+D+N++++MY
Sbjct: 1265 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1284
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASL 536
PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ P +S + +TW+A+L
Sbjct: 1285 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCR--GAAEGPSKKSVVWENKHITWFATL 1342
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
DG +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1343 DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDG 1402
Query: 597 SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1403 ELLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 1444
>gi|431908146|gb|ELK11749.1| Cleavage and polyadenylation specificity factor subunit 1 [Pteropus
alecto]
Length = 820
Score = 634 bits (1635), Expect = e-179, Method: Compositional matrix adjust.
Identities = 317/642 (49%), Positives = 421/642 (65%), Gaps = 63/642 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--KVLFVSD 69
+V+E+L V+LG +RP LLV ELL+Y+AF H +G LK+RFKK+ + F
Sbjct: 223 LVKEVLLVALGSRQSRPYLLVHVDQELLVYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 282
Query: 70 R---SKR-----ANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
+ SK+ A E PG RG R+++ RYF +I GY GVF+CGP P WL +T RG LR
Sbjct: 283 KPRPSKKKAEGGAEEGPG-ARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 340
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 341 HPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 400
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
++AYH+E+K Y + TST P T + GE+KE T RD R+I P F + L SP S
Sbjct: 401 YVAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS 460
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
WE IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D
Sbjct: 461 WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 520
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+
Sbjct: 521 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 580
Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S +
Sbjct: 581 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 640
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
N + +GF++SD+D+N++++MY
Sbjct: 641 N----------------------------------------AQLGFLVSDRDRNLMVYMY 660
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASL 536
PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ P +S + +TW+A+L
Sbjct: 661 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCR--GAAEGPSKKSVVWENKHITWFATL 718
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
DG +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 719 DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDG 778
Query: 597 SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 779 ELLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 820
>gi|158287218|ref|XP_309311.4| AGAP011340-PA [Anopheles gambiae str. PEST]
gi|157019545|gb|EAA05261.4| AGAP011340-PA [Anopheles gambiae str. PEST]
Length = 1434
Score = 634 bits (1634), Expect = e-179, Method: Compositional matrix adjust.
Identities = 306/637 (48%), Positives = 424/637 (66%), Gaps = 60/637 (9%)
Query: 18 QELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFV---------- 67
+E+L V+LG +G+RPLL +R +H+LLIY+ FR+ KG LKLRFK+L
Sbjct: 836 KEILMVALGSYGSRPLLFIRLEHDLLIYRVFRYSKGHLKLRFKRLSTSVTCPVFRTPEPS 895
Query: 68 -SDRSKRANEQPGLPRGVR-----ISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
+ ++ ANEQ R + IS +RYF+N++GY GV +CG P +LFLT+ GELR+
Sbjct: 896 GAGATEAANEQQQ-ARATKVLYENISMIRYFANVSGYAGVAVCGEKPYFLFLTAHGELRS 954
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
H + + APF+NVNCP GFLYF+ + EL+IS+ PT+LSYD+ WPVRK+PL+ +P
Sbjct: 955 HRLYARTVMKAFAPFNNVNCPNGFLYFDEQYELKISIFPTYLSYDSVWPVRKIPLRSSPK 1014
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
+ YH E K YC+V E YY+FNGEDKEL + + RF+ P+ +F V L +P +
Sbjct: 1015 QIVYHRENKVYCVVMDAEEICNKYYRFNGEDKELTEENKGERFLYPMGHRFSVVLVTPAA 1074
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
WE +P+T+ L EWEHV+ LKNVS+ YEG SGL+ YIA+GTN+NYSED+T RGR+LL+D
Sbjct: 1075 WEVVPETSINLEEWEHVIALKNVSLTYEGARSGLKEYIAVGTNFNYSEDITSRGRLLLYD 1134
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
IIEVVPEPG+PLTK+K K + K+QKGPV+AI HV GFLV AVGQK+Y+WQ+KD+DL G+
Sbjct: 1135 IIEVVPEPGKPLTKHKFKEVIVKDQKGPVSAISHVCGFLVGAVGQKVYLWQMKDDDLVGV 1194
Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
AFIDT +++ MVS+K+LILV D +S++LLR+Q EYRTLS+V+RDY P Y
Sbjct: 1195 AFIDTNIFVHQMVSIKSLILVADVYKSVSLLRFQEEYRTLSVVSRDYHPLNVFQVEYVVD 1254
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
N +++GF++SD N++ +MY
Sbjct: 1255 N----------------------------------------ANLGFLVSDDQCNLITYMY 1274
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC---KPSSISDAPGARSRFLTWYASLDG 538
QPE+RES GG RL++K+D+HLGQ VN F+++C + + ++ T++A+LDG
Sbjct: 1275 QPESRESFGGQRLLRKSDYHLGQQVNCMFRVQCDFHETDVMKRTLNYDNKHTTFFATLDG 1334
Query: 539 ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
+GF LPLPEK YRRL MLQNV++TH+ HT GLNP+A+RT K NPSR ++DG L
Sbjct: 1335 GIGFVLPLPEKTYRRLFMLQNVLLTHSPHTCGLNPKAYRTIKQTRKLPINPSRCVVDGDL 1394
Query: 599 VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
VW FL+L E+ E+ KKIG++ +I +L +IE ++
Sbjct: 1395 VWSFLELPANEKHEVAKKIGTRIEEICADLMEIEHVT 1431
>gi|358415280|ref|XP_003583063.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Bos taurus]
Length = 1490
Score = 633 bits (1633), Expect = e-179, Method: Compositional matrix adjust.
Identities = 311/642 (48%), Positives = 415/642 (64%), Gaps = 62/642 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 892 LVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 951
Query: 63 -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
+ E+ PRG R+++ RYF +I GY GVF+CGP P WL +T RG LR
Sbjct: 952 KPKPSKKKAEGGSTEEGTGPRG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 1010
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
HPM IDGP+ + APFHN+NCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 1011 HPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 1070
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
++AYH+E+K Y + TST+ P T + GE+KE T RD R++ P F + L SP S
Sbjct: 1071 YVAYHVESKVYAVATSTSTPCTRVPRMTGEEKEFETIERDERYVHPQQEAFCIQLISPVS 1130
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
WE IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D
Sbjct: 1131 WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1190
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1191 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1250
Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S +
Sbjct: 1251 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1310
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
N + +GF++SD+D+N++++MY
Sbjct: 1311 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1330
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASL 536
PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ P +S + +TW+A+L
Sbjct: 1331 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCR--GAAEGPSKKSVVWENKHITWFATL 1388
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
DG +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1389 DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDG 1448
Query: 597 SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1449 ELLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 1490
>gi|395860104|ref|XP_003802355.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Otolemur garnettii]
Length = 1441
Score = 633 bits (1632), Expect = e-178, Method: Compositional matrix adjust.
Identities = 311/641 (48%), Positives = 417/641 (65%), Gaps = 60/641 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 843 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 902
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K +++ + + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 903 KPKPSKKKAEGGSTEEGAGVRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 962
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 963 PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1022
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P T + GE+KE T RD R+I P F + L SP SW
Sbjct: 1023 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSW 1082
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D+
Sbjct: 1083 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1142
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1143 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1202
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1203 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1262
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 1263 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1282
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
PEA+ES GG RL+++ DFH+G HVNTF++ C+ + ++ P +S + +TW+A+LD
Sbjct: 1283 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--TEGPSKKSVVWENKHITWFATLD 1340
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1341 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGE 1400
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1401 LLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 1441
>gi|348555854|ref|XP_003463738.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 1 [Cavia porcellus]
Length = 1440
Score = 632 bits (1630), Expect = e-178, Method: Compositional matrix adjust.
Identities = 316/658 (48%), Positives = 420/658 (63%), Gaps = 65/658 (9%)
Query: 2 GNFRSHSPSAMDET-IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALK 56
G R + E +V+E+L V+LG +RP LLV ELLIY+AF H +G LK
Sbjct: 827 GEVRKEEATRQGELPLVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLK 886
Query: 57 LRFKKL-----------KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCG 105
+RFKK+ K +E G+ RG R+++ RYF +I GY GVF+CG
Sbjct: 887 VRFKKVPHNINFREKKPKPSKKKAEGGSTDEGSGV-RG-RVARFRYFEDIYGYSGVFICG 944
Query: 106 PHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY 165
P P WL +T RG LR HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSY
Sbjct: 945 PSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSY 1004
Query: 166 DAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFI 225
DAPWPVRK+PL+CT H++AYH+E+K Y + TST+ P T + GE+KE RD R+I
Sbjct: 1005 DAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTSTPCTRIPRMTGEEKEFEAIERDDRYI 1064
Query: 226 PPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNY 285
P F + L SP SWE IP L EWEHV C+K VS+ E T+SGL+GY+A GT
Sbjct: 1065 HPQQEAFSIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCL 1124
Query: 286 NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVG 345
E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+G
Sbjct: 1125 MQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIG 1184
Query: 346 QKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVA 405
QKI++W L+ ++LTG+AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+
Sbjct: 1185 QKIFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVS 1244
Query: 406 RDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
RD KP + S + N + +
Sbjct: 1245 RDAKPLEVYSVDFMVDN----------------------------------------AQL 1264
Query: 466 GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA 525
GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ P
Sbjct: 1265 GFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCR--GATEGPSK 1322
Query: 526 RS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK 580
+S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T H GLNPRAFR
Sbjct: 1323 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1382
Query: 581 GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1383 VDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 1440
>gi|344236599|gb|EGV92702.1| Cleavage and polyadenylation specificity factor subunit 1 [Cricetulus
griseus]
Length = 1419
Score = 631 bits (1627), Expect = e-178, Method: Compositional matrix adjust.
Identities = 310/641 (48%), Positives = 416/641 (64%), Gaps = 60/641 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 821 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 880
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K +++ + + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 881 KPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 940
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 941 PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1000
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P T + GE+KE RD R+I P F + L SP SW
Sbjct: 1001 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSW 1060
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D+
Sbjct: 1061 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1120
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1121 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1180
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1181 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1240
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 1241 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1260
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
PEA+ES GG RL+++ DFH+G HVNTF++ C+ + ++ P +S + +TW+A+LD
Sbjct: 1261 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVMWENKHITWFATLD 1318
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1319 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1378
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1379 LLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1419
>gi|354491122|ref|XP_003507705.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 1 [Cricetulus griseus]
Length = 1441
Score = 631 bits (1627), Expect = e-178, Method: Compositional matrix adjust.
Identities = 310/641 (48%), Positives = 416/641 (64%), Gaps = 60/641 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 843 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 902
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K +++ + + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 903 KPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 962
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 963 PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1022
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P T + GE+KE RD R+I P F + L SP SW
Sbjct: 1023 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSW 1082
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D+
Sbjct: 1083 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1142
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1143 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1202
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1203 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1262
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 1263 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1282
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
PEA+ES GG RL+++ DFH+G HVNTF++ C+ + ++ P +S + +TW+A+LD
Sbjct: 1283 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVMWENKHITWFATLD 1340
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1341 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1400
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1401 LLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1441
>gi|403302917|ref|XP_003942095.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Saimiri boliviensis boliviensis]
Length = 1390
Score = 630 bits (1626), Expect = e-178, Method: Compositional matrix adjust.
Identities = 309/639 (48%), Positives = 416/639 (65%), Gaps = 56/639 (8%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 792 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLSQGNLKVRFKKVPHNINFREK 851
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K +++ + + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 852 KPKPSKKKAEGGSAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 911
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGPV + APFHN+NCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 912 PMGIDGPVDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 971
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P T + GE+KE T RD R+I P F + L SP SW
Sbjct: 972 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSW 1031
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D+
Sbjct: 1032 EAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1091
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1092 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1151
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1152 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1211
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 1212 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1231
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGA 539
PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ + + ++ +TW+A+LDG
Sbjct: 1232 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSKKSVVWENKHITWFATLDGG 1291
Query: 540 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
+G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG L+
Sbjct: 1292 IGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELL 1351
Query: 600 WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1352 NRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1390
>gi|197245729|gb|AAI68713.1| Cpsf1 protein [Rattus norvegicus]
Length = 1439
Score = 630 bits (1626), Expect = e-178, Method: Compositional matrix adjust.
Identities = 310/641 (48%), Positives = 416/641 (64%), Gaps = 60/641 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 841 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 900
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K +++ + + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 901 KPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 960
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 961 PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1020
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P T + GE+KE RD R+I P F + L SP SW
Sbjct: 1021 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSW 1080
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D+
Sbjct: 1081 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1140
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1141 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1200
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1201 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1260
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 1261 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1280
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
PEA+ES GG RL+++ DFH+G HVNTF++ C+ + ++ P +S + +TW+A+LD
Sbjct: 1281 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVMWENKHITWFATLD 1338
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1339 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1398
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1399 LLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1439
>gi|16751835|ref|NP_444423.1| cleavage and polyadenylation specificity factor subunit 1 isoform 2
[Mus musculus]
gi|17374611|sp|Q9EPU4.1|CPSF1_MOUSE RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit
gi|11762096|gb|AAG40326.1|AF322193_1 cleavage and polyadenylation specificity factor 1 [Mus musculus]
gi|38614159|gb|AAH56388.1| Cleavage and polyadenylation specific factor 1 [Mus musculus]
Length = 1441
Score = 630 bits (1625), Expect = e-178, Method: Compositional matrix adjust.
Identities = 313/656 (47%), Positives = 420/656 (64%), Gaps = 61/656 (9%)
Query: 2 GNFRSHSPSAMDET-IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALK 56
G R + E +V+E+L V+LG +RP LLV ELLIY+AF H +G LK
Sbjct: 828 GEVRKEEATRQGELPLVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLK 887
Query: 57 LRFKKL---------KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPH 107
+RFKK+ K +++ + + G R+++ RYF +I GY GVF+CGP
Sbjct: 888 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 947
Query: 108 PAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDA 167
P WL +T RG LR HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDA
Sbjct: 948 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007
Query: 168 PWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPP 227
PWPVRK+PL+CT H++AYH+E+K Y + TST P T + GE+KE RD R+I P
Sbjct: 1008 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHP 1067
Query: 228 LVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY 287
F + L SP SWE IP L EWEHV C+K VS+ E T+SGL+GY+A GT
Sbjct: 1068 QQEAFSIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQ 1127
Query: 288 SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQK
Sbjct: 1128 GEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQK 1187
Query: 348 IYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
I++W L+ ++LTG+AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD
Sbjct: 1188 IFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRD 1247
Query: 408 YKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGF 467
KP + S + N + +GF
Sbjct: 1248 AKPLEVYSVDFMVDN----------------------------------------AQLGF 1267
Query: 468 MISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS 527
++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++ C+ + ++ P +S
Sbjct: 1268 LVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKS 1325
Query: 528 -----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 582
+ +TW+A+LDG +G LP+ EK YRRLLMLQN + T H GLNPRAFR
Sbjct: 1326 VVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVD 1385
Query: 583 GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1386 RRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1441
>gi|410987992|ref|XP_004000273.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Felis catus]
Length = 1432
Score = 630 bits (1624), Expect = e-178, Method: Compositional matrix adjust.
Identities = 314/643 (48%), Positives = 416/643 (64%), Gaps = 64/643 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 834 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 893
Query: 63 --KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
K A E G RG R+++ RYF +I GY GVF+CGP P WL +T RG LR
Sbjct: 894 KPKPSKKKVEGGSAEEGAG-ARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALR 951
Query: 121 AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTP 180
HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT
Sbjct: 952 LHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTA 1011
Query: 181 HFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPF 240
H++AYH+E+K Y + TST P T + GE+KE T RD R+I P F + L SP
Sbjct: 1012 HYVAYHVESKVYAVATSTNMPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPV 1071
Query: 241 SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
SWE IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+
Sbjct: 1072 SWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIM 1131
Query: 301 DIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTG 360
D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG
Sbjct: 1132 DVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTG 1191
Query: 361 IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
+AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S +
Sbjct: 1192 MAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMV 1251
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
N + +GF++SD+D+N++++M
Sbjct: 1252 DN----------------------------------------AQLGFLVSDRDRNLMVYM 1271
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYAS 535
Y PEA+ES GG RL+++ DFH+G HVNTF++ C+ + ++ P +S + +TW+A+
Sbjct: 1272 YLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVVWENKHITWFAT 1329
Query: 536 LDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIID 595
LDG +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++D
Sbjct: 1330 LDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLD 1389
Query: 596 GSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
G L+ ++L LS ER E+ KKIG+ + IL++L + + +++HF
Sbjct: 1390 GELLNRYLYLSTMERGELAKKIGTTPDIILEDLLETDRVTAHF 1432
>gi|444523674|gb|ELV13604.1| Cleavage and polyadenylation specificity factor subunit 1 [Tupaia
chinensis]
Length = 1469
Score = 629 bits (1623), Expect = e-177, Method: Compositional matrix adjust.
Identities = 311/642 (48%), Positives = 413/642 (64%), Gaps = 62/642 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELL+Y+AF H +G LK+RFKK+
Sbjct: 871 LVKEVLLVALGSRQSRPYLLVHVDQELLLYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 930
Query: 63 -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
+ E+ RG R+++ RYF +I GY GVF+CGP P WL +T RG LR
Sbjct: 931 KLKPSKKKAEGGSTEEGAGARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 989
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 990 HPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 1049
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
++AYH+E+K Y + TST P T + GE+KE T RD R+I P F + L SP S
Sbjct: 1050 YVAYHVESKVYAVATSTNAPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS 1109
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
WE IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D
Sbjct: 1110 WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1169
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1170 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1229
Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S +
Sbjct: 1230 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1289
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
N + +GF++SD+D+N++++MY
Sbjct: 1290 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1309
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASL 536
PEA+ES GG L+++ DFHLG HVNTF++ C+ + + P +S + +TW+A+L
Sbjct: 1310 LPEAKESFGGLLLLRRADFHLGAHVNTFWRTPCRGA--VEGPSKKSVVWENKHITWFATL 1367
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
DG +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1368 DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDG 1427
Query: 597 SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1428 ELLSRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1469
>gi|338728513|ref|XP_003365689.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Equus caballus]
Length = 1450
Score = 629 bits (1622), Expect = e-177, Method: Compositional matrix adjust.
Identities = 312/642 (48%), Positives = 413/642 (64%), Gaps = 62/642 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 852 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 911
Query: 63 -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
E+ RG R+++ RYF +I GY GVF+CGP P WL +T RG LR
Sbjct: 912 KPKPSKKKAEGGGAEEGVGARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 970
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 971 HPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 1030
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
++AYH+E+K Y + TST P T + GE+KE T RD R+I P F + L SP S
Sbjct: 1031 YVAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS 1090
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
WE IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D
Sbjct: 1091 WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1150
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1151 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1210
Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S +
Sbjct: 1211 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1270
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
N + +GF++SD+D+N++++MY
Sbjct: 1271 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1290
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASL 536
PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ P +S + +TW+A+L
Sbjct: 1291 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCR--GAAEGPSKKSVVWENKHITWFATL 1348
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
DG +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1349 DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDG 1408
Query: 597 SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1409 ELLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 1450
>gi|338728511|ref|XP_001505047.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like isoform 1 [Equus caballus]
Length = 1444
Score = 629 bits (1622), Expect = e-177, Method: Compositional matrix adjust.
Identities = 312/642 (48%), Positives = 414/642 (64%), Gaps = 62/642 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 846 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 905
Query: 63 -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
E+ RG R+++ RYF +I GY GVF+CGP P WL +T RG LR
Sbjct: 906 KPKPSKKKAEGGGAEEGVGARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 964
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 965 HPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 1024
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
++AYH+E+K Y + TST P T + GE+KE T RD R+I P F + L SP S
Sbjct: 1025 YVAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS 1084
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
WE IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D
Sbjct: 1085 WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1144
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1145 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1204
Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S +
Sbjct: 1205 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1264
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
N + +GF++SD+D+N++++MY
Sbjct: 1265 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1284
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASL 536
PEA+ES GG RL+++ DFH+G HVNTF++ C+ + ++ P +S + +TW+A+L
Sbjct: 1285 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVVWENKHITWFATL 1342
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
DG +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1343 DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDG 1402
Query: 597 SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1403 ELLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 1444
>gi|417406474|gb|JAA49895.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
cft1 cpsf subunit [Desmodus rotundus]
Length = 1444
Score = 627 bits (1618), Expect = e-177, Method: Compositional matrix adjust.
Identities = 311/641 (48%), Positives = 415/641 (64%), Gaps = 60/641 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 846 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFAHDSQLGQGNLKVRFKKVPHNINFREK 905
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K ++ + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 906 KPKPSKKKADGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 965
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 966 PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1025
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P T + GE+KE T RD R+I P F + L SP SW
Sbjct: 1026 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSW 1085
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D+
Sbjct: 1086 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1145
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1146 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1205
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ + +TLSLV+RD KP + S + N
Sbjct: 1206 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEDSKTLSLVSRDAKPLEVYSVDFMVDN 1265
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 1266 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1285
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
PEA+ES GG RL+++ DFH+G HVNTF++ C+ + ++ P +S + +TW+A+LD
Sbjct: 1286 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVVWENKHITWFATLD 1343
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R I+DG
Sbjct: 1344 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNILDGE 1403
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1404 LLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1444
>gi|392306997|ref|NP_001254722.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
gi|380812168|gb|AFE77959.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
gi|383417835|gb|AFH32131.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
Length = 1442
Score = 627 bits (1617), Expect = e-177, Method: Compositional matrix adjust.
Identities = 310/640 (48%), Positives = 412/640 (64%), Gaps = 58/640 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 844 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 903
Query: 63 -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
E+ RG R+++ RYF +I GY GVF+CGP P WL +T RG LR
Sbjct: 904 KPKPSKKKAEGGGTEEGAGARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 962
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
HPM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 963 HPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 1022
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
++AYH+E+K Y + TST P + GE+KE T RD R+I P F + L SP S
Sbjct: 1023 YVAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS 1082
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
WE IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D
Sbjct: 1083 WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1142
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1143 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1202
Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S +
Sbjct: 1203 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1262
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
N + +GF++SD+D+N++++MY
Sbjct: 1263 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1282
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDG 538
PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ + + ++ +TW+A+LDG
Sbjct: 1283 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSKKSVVWENKHITWFATLDG 1342
Query: 539 ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
+G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG L
Sbjct: 1343 GIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGEL 1402
Query: 599 VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1403 LNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1442
>gi|402879380|ref|XP_003903320.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Papio anubis]
Length = 1389
Score = 626 bits (1615), Expect = e-176, Method: Compositional matrix adjust.
Identities = 309/639 (48%), Positives = 414/639 (64%), Gaps = 56/639 (8%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 791 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 850
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K +++ + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 851 KPKPSKKKAEGGGTEEGAGXRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 910
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 911 PMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 970
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P + GE+KE T RD R+I P F + L SP SW
Sbjct: 971 VAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSW 1030
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D+
Sbjct: 1031 EAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1090
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1091 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1150
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1151 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1210
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 1211 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1230
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGA 539
PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ + + ++ +TW+A+LDG
Sbjct: 1231 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSKKSVVWENKHITWFATLDGG 1290
Query: 540 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
+G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG L+
Sbjct: 1291 IGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELL 1350
Query: 600 WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1351 NRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1389
>gi|334326317|ref|XP_001364707.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Monodelphis domestica]
Length = 1449
Score = 626 bits (1615), Expect = e-176, Method: Compositional matrix adjust.
Identities = 312/642 (48%), Positives = 414/642 (64%), Gaps = 62/642 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG RP LLV ELLIY+AF H + LK+RFKK+
Sbjct: 851 LVKEVLLVALGNRQTRPYLLVHVDQELLIYEAFAHDSQLGQSNLKVRFKKVPHNINFREK 910
Query: 63 -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
E+ RG R+++ RYF +I GY GVF+CGP P WL +T RG LR
Sbjct: 911 KPKPSKKKPEGGGTEEGAGARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 969
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 970 HPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 1029
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
++AYH+E+K Y + TST T + GE+KE T RD R+I PL F + L SP S
Sbjct: 1030 YVAYHVESKVYAVATSTNALCTRIPRMTGEEKEFETIERDERYIHPLQEAFSIQLISPVS 1089
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
WE IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D
Sbjct: 1090 WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1149
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1150 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1209
Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S
Sbjct: 1210 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSV----- 1264
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
D + + + +GF++SD+D+N++++MY
Sbjct: 1265 -----------------------------------DFMVDSAQLGFLVSDRDRNLMVYMY 1289
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASL 536
PEA+ES GG RL+++ DFH+G HVNTF++ C+ + ++ P +S + +TW+A+L
Sbjct: 1290 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSIVWENKHITWFATL 1347
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
DG +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1348 DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDG 1407
Query: 597 SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ ++L LS ER E+ KKIG+ + ILD+L +I+ +++HF
Sbjct: 1408 ELLNRYLYLSTMERGELAKKIGTTPDIILDDLLEIDRVTAHF 1449
>gi|405977622|gb|EKC42064.1| Cleavage and polyadenylation specificity factor subunit 1
[Crassostrea gigas]
Length = 1369
Score = 626 bits (1614), Expect = e-176, Method: Compositional matrix adjust.
Identities = 303/634 (47%), Positives = 422/634 (66%), Gaps = 52/634 (8%)
Query: 17 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGA----LKLRFKKLK----VLFVS 68
++ELL V LG +RP LL R + +L IY+AF +P+ + LKLRFKK++ +
Sbjct: 776 LKELLMVGLGYKDSRPHLLARVEDDLYIYEAFSYPQSSIDNHLKLRFKKIQHDLILREKR 835
Query: 69 DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG 128
+SK+ + + ++ +MRYF ++AGY GVF+CG +P W+F+TSRG LR HPM IDG
Sbjct: 836 SKSKKKDPEEFQKEEKKVGKMRYFKDVAGYSGVFVCGAYPHWIFVTSRGSLRIHPMGIDG 895
Query: 129 PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLE 188
PV + FHN+NCP GFLYFN ELRISVLPTHL+YDAPWPVRKVPL+CTPHF+AYH E
Sbjct: 896 PVWCFSEFHNINCPHGFLYFNKMGELRISVLPTHLTYDAPWPVRKVPLRCTPHFVAYHFE 955
Query: 189 TKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQT 248
K Y +VTST E K ED+E T +D RFI P + +F + L+SP SWE +P T
Sbjct: 956 NKIYAVVTSTPEICNKLPKTTTEDREWDTIEKDERFIYPTIPRFTLQLYSPTSWEVVPNT 1015
Query: 249 NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPE 308
EWEHV+ +K + + E TLSG + YI +GTN + E+VT RGR+++ DIIEVVPE
Sbjct: 1016 KIECEEWEHVVSMKTIRLRSEETLSGFKSYIVMGTNLSLGEEVTSRGRVIIADIIEVVPE 1075
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEV 368
PG PLTK+KIK +Y KEQKGPVTA+ + G L+TA+GQK+YIWQLKDNDL G+AFIDT +
Sbjct: 1076 PGMPLTKHKIKTLYEKEQKGPVTALADINGLLITAIGQKLYIWQLKDNDLMGVAFIDTHI 1135
Query: 369 YIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGII 428
YI ++V++K++IL GD +S+++ +YQ E++ LS+V+RD +P + + + N
Sbjct: 1136 YIHTLVTIKHIILAGDILKSVSVYQYQEEHKVLSIVSRDPRPLEVYTADFLIDN------ 1189
Query: 429 DGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARES 488
+ + ++SD+ KN+V++ YQPEARES
Sbjct: 1190 ----------------------------------TQLCCLVSDRMKNLVVYSYQPEARES 1215
Query: 489 NGGHRLIKKTDFHLGQHVNTFFKIRCK---PSSISDAPGA-RSRFLTWYASLDGALGFFL 544
+GG RLI+K DF+ G +V++ F++RCK PSS GA R +T++A+LDG+LGF L
Sbjct: 1216 HGGQRLIRKADFNAGSNVSSMFRVRCKLYDPSSDKRMTGAPEKRHITYFATLDGSLGFVL 1275
Query: 545 PLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQ 604
PL EK YRRL MLQN +VTH H GLNPR++R G NP + I+DG L+WK+
Sbjct: 1276 PLSEKVYRRLFMLQNALVTHIPHVAGLNPRSYRHVIGTFPELRNPQKNILDGELLWKYTN 1335
Query: 605 LSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
LS+ E++EI K++G+ ++ I+D+L +I+ L++HF
Sbjct: 1336 LSIMEKIEIAKRLGTSNDQIMDDLMEIDRLTAHF 1369
>gi|157110889|ref|XP_001651294.1| cleavage and polyadenylation specificity factor cpsf [Aedes aegypti]
gi|108883895|gb|EAT48120.1| AAEL000832-PA [Aedes aegypti]
Length = 1417
Score = 625 bits (1613), Expect = e-176, Method: Compositional matrix adjust.
Identities = 300/637 (47%), Positives = 420/637 (65%), Gaps = 56/637 (8%)
Query: 18 QELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLK------VLFVSDRS 71
+E+L V+LG HG RP+L VR +++LL+Y+ +R+ KG LKLRF+++ + ++ R
Sbjct: 821 KEILMVALGHHGTRPMLFVRLENDLLVYRVYRYSKGHLKLRFRRVPSGVTGPIFKIAPRQ 880
Query: 72 KRANEQPGLPRGV--------RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHP 123
+Q G IS +RYF+N+ GY GV +CG P + LTSRGELRAH
Sbjct: 881 SAPTDQEGEKPDEHSTKIMYENISMIRYFNNVNGYNGVAVCGEKPYIMLLTSRGELRAHR 940
Query: 124 MTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFL 183
+ + APF+NVNCP GFLYF+ + EL+I+V P +LSYD+ WPVRK+PL+ +P +
Sbjct: 941 LYAKTIMKGFAPFNNVNCPNGFLYFDEQYELKIAVFPGYLSYDSIWPVRKIPLRSSPKQI 1000
Query: 184 AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
YH E K YC+V E YY+FNGEDKEL + + RF+ P+ +F V L +P +WE
Sbjct: 1001 VYHKENKVYCVVMDAEEVCNKYYRFNGEDKELTEENKGERFLYPMAHKFSVVLVTPSAWE 1060
Query: 244 EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
IP+T+ L EWEHV+ LKNVS+ YEG SG + YIA+GTN+NYSED+T RGR+LL+DII
Sbjct: 1061 IIPETSINLDEWEHVIALKNVSLSYEGARSGFKEYIAVGTNFNYSEDITSRGRLLLYDII 1120
Query: 304 EVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAF 363
EVVPEPG+PLT+ K K + KEQKGPV+AI HV+GFLV AVGQK+Y+WQLKD+DL G+AF
Sbjct: 1121 EVVPEPGKPLTRYKFKEVIVKEQKGPVSAITHVSGFLVGAVGQKVYLWQLKDDDLVGVAF 1180
Query: 364 IDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNP 423
IDT +++ +VS+K+LILV D +S++LLR+Q +YRTLSLV+RDY+P Y N
Sbjct: 1181 IDTNIFVHQLVSIKSLILVADVYKSVSLLRFQEDYRTLSLVSRDYQPLNVFQIEYVVDN- 1239
Query: 424 SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQP 483
HN +GF++SD+ N++ +MYQP
Sbjct: 1240 -------------------------------HN--------LGFLVSDEQCNIITYMYQP 1260
Query: 484 EARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA--RSRFLTWYASLDGALG 541
E+RES GG RL++K D+H+GQ +N+ F+++C + + + T++A+LDG +G
Sbjct: 1261 ESRESFGGQRLLRKCDYHVGQKINSMFRVQCDFHEMDYKRNSNYECKHTTYFATLDGGIG 1320
Query: 542 FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 601
+ LPLPEK YRRL MLQNV++TH+ H GLNP+AFRT K NP+R ++DG L+W
Sbjct: 1321 YVLPLPEKTYRRLFMLQNVLMTHSPHLCGLNPKAFRTIKTVKKLPINPARCVVDGDLIWT 1380
Query: 602 FLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
FL L E+LE+ KKIG++ +DI +L +IE+++ F
Sbjct: 1381 FLTLPANEKLEVAKKIGTRIDDICADLMEIESVTHVF 1417
>gi|345779232|ref|XP_532356.3| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Canis lupus familiaris]
Length = 1460
Score = 625 bits (1612), Expect = e-176, Method: Compositional matrix adjust.
Identities = 311/641 (48%), Positives = 415/641 (64%), Gaps = 60/641 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 862 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 921
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K +++ + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 922 KPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 981
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 982 PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1041
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P T + GE+KE T RD R+I P F + L SP SW
Sbjct: 1042 VAYHVESKVYAVATSTNMPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSW 1101
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D+
Sbjct: 1102 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1161
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1162 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1221
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1222 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1281
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 1282 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1301
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ P +S + +TW+A+LD
Sbjct: 1302 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCR--GAAEGPSKKSVVWENKHITWFATLD 1359
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1360 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1419
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1420 LLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 1460
>gi|119602512|gb|EAW82106.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
[Homo sapiens]
gi|119602513|gb|EAW82107.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
[Homo sapiens]
gi|119602514|gb|EAW82108.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
[Homo sapiens]
Length = 1365
Score = 624 bits (1609), Expect = e-176, Method: Compositional matrix adjust.
Identities = 309/639 (48%), Positives = 414/639 (64%), Gaps = 56/639 (8%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 767 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 826
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K +++ + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 827 KPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 886
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 887 PMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 946
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P + GE+KE T RD R+I P F + L SP SW
Sbjct: 947 VAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSW 1006
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D+
Sbjct: 1007 EAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1066
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1067 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1126
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1127 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1186
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 1187 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1206
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGA 539
PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ + + ++ +TW+A+LDG
Sbjct: 1207 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSKKSVVWENKHITWFATLDGG 1266
Query: 540 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
+G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG L+
Sbjct: 1267 IGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELL 1326
Query: 600 WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1327 NRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1365
>gi|397497327|ref|XP_003819464.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Pan paniscus]
gi|410336497|gb|JAA37195.1| cleavage and polyadenylation specific factor 1, 160kDa [Pan
troglodytes]
Length = 1442
Score = 624 bits (1609), Expect = e-176, Method: Compositional matrix adjust.
Identities = 309/639 (48%), Positives = 414/639 (64%), Gaps = 56/639 (8%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 844 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 903
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K +++ + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 904 KPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 963
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 964 PMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1023
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P + GE+KE T RD R+I P F + L SP SW
Sbjct: 1024 VAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSW 1083
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D+
Sbjct: 1084 EAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1143
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1144 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1203
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1204 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1263
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 1264 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1283
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGA 539
PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ + + ++ +TW+A+LDG
Sbjct: 1284 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSKKSVVWENKHITWFATLDGG 1343
Query: 540 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
+G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG L+
Sbjct: 1344 IGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELL 1403
Query: 600 WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1404 NRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1442
>gi|1045574|gb|AAC50293.1| cleavage and polyadenylation specificity factor [Homo sapiens]
Length = 1442
Score = 624 bits (1608), Expect = e-176, Method: Compositional matrix adjust.
Identities = 309/638 (48%), Positives = 411/638 (64%), Gaps = 55/638 (8%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 845 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 904
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K +++ + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 905 KPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 964
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 965 PMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1024
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P + GE+KE T RD R+I P F + L SP SW
Sbjct: 1025 VAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSW 1084
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D+
Sbjct: 1085 EAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1144
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1145 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1204
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1205 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1264
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 1265 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1284
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA--RSRFLTWYASLDGAL 540
PEA+ES GG RL+++ DFH+G HVNTF++ C+ + ++ +TW+A+LDG +
Sbjct: 1285 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRATEGLSKKSVVWENKHITWFATLDGGI 1344
Query: 541 GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVW 600
G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG L+
Sbjct: 1345 GLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLN 1404
Query: 601 KFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1405 RYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1442
>gi|56676371|ref|NP_037423.2| cleavage and polyadenylation specificity factor subunit 1 [Homo
sapiens]
gi|23503048|sp|Q10570.2|CPSF1_HUMAN RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit
gi|16878041|gb|AAH17232.1| Cleavage and polyadenylation specific factor 1, 160kDa [Homo sapiens]
gi|119602516|gb|EAW82110.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_c
[Homo sapiens]
gi|123993607|gb|ABM84405.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
construct]
gi|123999626|gb|ABM87355.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
construct]
gi|307684758|dbj|BAJ20419.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
construct]
Length = 1443
Score = 623 bits (1607), Expect = e-176, Method: Compositional matrix adjust.
Identities = 309/639 (48%), Positives = 414/639 (64%), Gaps = 56/639 (8%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 845 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 904
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K +++ + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 905 KPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 964
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 965 PMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1024
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P + GE+KE T RD R+I P F + L SP SW
Sbjct: 1025 VAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSW 1084
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D+
Sbjct: 1085 EAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1144
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1145 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1204
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1205 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1264
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 1265 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1284
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGA 539
PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ + + ++ +TW+A+LDG
Sbjct: 1285 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSKKSVVWENKHITWFATLDGG 1344
Query: 540 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
+G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG L+
Sbjct: 1345 IGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELL 1404
Query: 600 WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1405 NRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1443
>gi|410911304|ref|XP_003969130.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Takifugu rubripes]
Length = 1444
Score = 623 bits (1607), Expect = e-176, Method: Compositional matrix adjust.
Identities = 312/641 (48%), Positives = 425/641 (66%), Gaps = 59/641 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGALKLRFKKL--------- 62
+V+E+ VSLG + +RP LLV ELLIY+AF + P+ LK+RFKK+
Sbjct: 845 LVKEVTLVSLGYNHSRPYLLVHVDQELLIYEAFPYDQQQPQNNLKVRFKKVPHNINFREK 904
Query: 63 --KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
K+ A E RG RIS+ RYF +I+GY GVF+CGP P W+ +TSRG LR
Sbjct: 905 KSKLRKDKKAEGTAAEDSVAARG-RISRFRYFEDISGYSGVFICGPSPHWMLVTSRGALR 963
Query: 121 AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTP 180
HPM+IDGP+ + +PFHN+NCP+GFLYFN + ELRISVLPT+LSYDAPWPVRK+PL+CT
Sbjct: 964 LHPMSIDGPIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTV 1023
Query: 181 HFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPF 240
H+++YH+E+K Y + TS E T + GE+KE T RD R+I P +F + L SP
Sbjct: 1024 HYVSYHVESKVYAVCTSLKELCTRIPRMTGEEKEYETIERDERYINPQQDKFSIQLISPV 1083
Query: 241 SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
SWE IP T L EWE+V C+K V++ + T+SGL+GYIA GT E+VTCRGRIL+
Sbjct: 1084 SWEAIPNTRIDLEEWEYVTCMKTVALRSQETVSGLKGYIAAGTCLMQGEEVTCRGRILIL 1143
Query: 301 DIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTG 360
D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G+LV+A+GQKI++W LKDNDLTG
Sbjct: 1144 DVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFLWVLKDNDLTG 1203
Query: 361 IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
+AFIDT+++I M+S+KN IL D +S++LLRYQ E +TLSLV+RD KP + S +
Sbjct: 1204 MAFIDTQLHIHQMMSIKNFILAADLMKSVSLLRYQEESKTLSLVSRDAKPLEVYSIEFMV 1263
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
N + +GF++SD+DKN+ ++M
Sbjct: 1264 DN----------------------------------------NQLGFLVSDRDKNLYVYM 1283
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS---RFLTWYASLD 537
Y PEA+ES GG RL+++ DF+ G ++NTF+++ C+ + + + A + + +TW+A+LD
Sbjct: 1284 YLPEAKESFGGMRLLRRADFNAGANINTFWRMPCRGALEAGSRKAMTWDNKHITWFATLD 1343
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G +G LP+ EK YRRLLMLQN + T SH GLNP+AFR NP + I+DG
Sbjct: 1344 GGVGLLLPMQEKTYRRLLMLQNALTTMLSHHAGLNPKAFRMLHCDRRSLQNPVKNILDGE 1403
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ K+L LS+ ER E+ KKIG+ + ILD+L DI+ +++HF
Sbjct: 1404 LLNKYLYLSMMERSELAKKIGTTQDIILDDLLDIDRVTAHF 1444
>gi|426361048|ref|XP_004047737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Gorilla gorilla gorilla]
Length = 1440
Score = 623 bits (1606), Expect = e-175, Method: Compositional matrix adjust.
Identities = 309/639 (48%), Positives = 414/639 (64%), Gaps = 56/639 (8%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 842 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 901
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K +++ + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 902 KPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 961
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 962 PMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1021
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P + GE+KE T RD R+I P F + L SP SW
Sbjct: 1022 VAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSW 1081
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D+
Sbjct: 1082 EAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1141
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1142 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1201
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1202 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1261
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 1262 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1281
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGA 539
PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ + + ++ +TW+A+LDG
Sbjct: 1282 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSKKSVVWENKHITWFATLDGG 1341
Query: 540 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
+G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG L+
Sbjct: 1342 IGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELL 1401
Query: 600 WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1402 NRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1440
>gi|195056749|ref|XP_001995154.1| GH22991 [Drosophila grimshawi]
gi|193899360|gb|EDV98226.1| GH22991 [Drosophila grimshawi]
Length = 1426
Score = 623 bits (1606), Expect = e-175, Method: Compositional matrix adjust.
Identities = 304/631 (48%), Positives = 412/631 (65%), Gaps = 51/631 (8%)
Query: 19 ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQP 78
EL V LG HG RPLLLVRT+ ELLIYQ FR+ KG LK+RF+KL+ L + ++ E
Sbjct: 836 ELCLVGLGQHGERPLLLVRTRLELLIYQVFRYAKGHLKIRFRKLEQLHLLEQQPTHIELD 895
Query: 79 GLP---------RGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP 129
G + + ++RYF+N+ G G+ +CG +P ++FLTSRGELR H + +G
Sbjct: 896 GEDVEEAESYNMQAKYVQKLRYFANVGGLAGIMVCGVNPCFVFLTSRGELRIHRLLGNGD 955
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
V + A F+NVN P GFLYF+ EL+ISVLP++LSYDA WPVRKVPL+CTP L YH E
Sbjct: 956 VRSFAAFNNVNIPHGFLYFDTTYELKISVLPSYLSYDAAWPVRKVPLRCTPRQLVYHREN 1015
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+ YC++T EP T YY+FNGEDKEL + R RFI P+ S F + L SP +WE +P +
Sbjct: 1016 RVYCLITQKEEPMTKYYRFNGEDKELSEECRGERFIYPIGSLFEMVLISPETWEIVPDAS 1075
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEP 309
WEHV K V + YEGT SGL+ Y+ +GTN+NYSED+T RG I ++DIIEVVPEP
Sbjct: 1076 IQFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPEP 1135
Query: 310 GQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVY 369
G+P+TK K+K ++ KEQKGPV+AI V GFLVT +GQKIYIWQL+D DL G+AFIDT +Y
Sbjct: 1136 GKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGLGQKIYIWQLRDGDLIGVAFIDTNIY 1195
Query: 370 IASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIID 429
+ +++VK+LI + D +SI+LLR+Q E+RTLSL +RD+ P + + N
Sbjct: 1196 VHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLASRDFNPMEVFGIEFMVDN------- 1248
Query: 430 GSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESN 489
S++GF+++D ++N++++MYQPEARES
Sbjct: 1249 ---------------------------------SNLGFLVTDAERNLIVYMYQPEARESL 1275
Query: 490 GGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARSRFLTWYASLDGALGFFLPLP 547
GG +L++K D+HLGQ VNT F+++C + ++ L Y SLDGALG+ LPLP
Sbjct: 1276 GGQKLLRKADYHLGQVVNTMFRVQCHQRGLHQRQPFLYENKHLVIYGSLDGALGYCLPLP 1335
Query: 548 EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 607
EK YRR LMLQNV++++ H GLNP+ +RT K NPSR IIDG L+W F L+
Sbjct: 1336 EKVYRRFLMLQNVLLSYQDHLCGLNPKEYRTIKSVKKLGINPSRCIIDGDLIWSFRMLAH 1395
Query: 608 GERLEICKKIGSKHNDILDELYDIEALSSHF 638
ER E+ KKIG++ +IL +L +IE +S+ F
Sbjct: 1396 SERNEVAKKIGTRTEEILADLLEIERISAVF 1426
>gi|395512730|ref|XP_003760588.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Sarcophilus harrisii]
Length = 1449
Score = 622 bits (1605), Expect = e-175, Method: Compositional matrix adjust.
Identities = 311/641 (48%), Positives = 415/641 (64%), Gaps = 60/641 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG RP LLV ELLIY+AF H + LK+RFKK+
Sbjct: 851 LVKEVLLVALGNRQTRPYLLVHVDQELLIYEAFAHDSQLGQSNLKVRFKKVPHNINFREK 910
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K + + + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 911 KPKPSKKKPEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 970
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 971 PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1030
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST T + GE+KE T RD R+I PL F + L SP SW
Sbjct: 1031 VAYHVESKVYAVATSTNALCTRIPRMTGEEKEFETIERDDRYIHPLQEAFSIQLISPVSW 1090
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D+
Sbjct: 1091 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1150
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1151 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1210
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S
Sbjct: 1211 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSV------ 1264
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
D + + + +GF++SD+D+N++++MY
Sbjct: 1265 ----------------------------------DFMVDSAQLGFLVSDRDRNLMVYMYL 1290
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
PEA+ES GG RL+++ DFH+G HVNTF++ C+ + ++ P +S + +TW+A+LD
Sbjct: 1291 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPTKKSIVWENKHITWFATLD 1348
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1349 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1408
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ ++L LS ER E+ KKIG+ + ILD+L +I+ +++HF
Sbjct: 1409 LLNRYLYLSTMERGELAKKIGTTPDIILDDLLEIDRVTAHF 1449
>gi|195122290|ref|XP_002005645.1| GI18959 [Drosophila mojavensis]
gi|193910713|gb|EDW09580.1| GI18959 [Drosophila mojavensis]
Length = 1431
Score = 622 bits (1604), Expect = e-175, Method: Compositional matrix adjust.
Identities = 304/637 (47%), Positives = 413/637 (64%), Gaps = 62/637 (9%)
Query: 19 ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFV----------- 67
EL V LG HG+RPLLLVRT+ ELLIYQ FR+ KG LK+RF+KL+ L +
Sbjct: 840 ELSLVGLGQHGDRPLLLVRTRLELLIYQVFRYAKGHLKIRFRKLEQLHLLDQQPTHIELI 899
Query: 68 ----SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHP 123
+D ++ N QP + ++RYF+N+ G G+ +CG +P ++FLT+RGELR H
Sbjct: 900 NEEETDEAESYNMQPKY-----VQKLRYFNNVGGLAGIMVCGVNPCFIFLTARGELRIHR 954
Query: 124 MTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFL 183
+ + V + A F+NVN P GFLYF+ EL+ISVLPT+LSYDA WPVRKVPL+CTP L
Sbjct: 955 LLGNAEVRSFAAFNNVNIPHGFLYFDTTYELKISVLPTYLSYDAAWPVRKVPLRCTPRQL 1014
Query: 184 AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
YH E + YC++T EP T YY+FNGEDKEL + R RFI P+ S F + L SP +WE
Sbjct: 1015 VYHRENRVYCLITQKEEPMTKYYRFNGEDKELSEESRGERFIYPIGSLFEMVLISPETWE 1074
Query: 244 EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
+P + WEHV K V + YEGT SGL+ Y+ +GTN+NYSED+T RG I ++DII
Sbjct: 1075 IVPDASIQFEPWEHVTAFKLVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDII 1134
Query: 304 EVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAF 363
EVVPEPG+P+TK K+K ++ KEQKGPV+AI V GFLVT +GQKIYIWQL+D DL G+AF
Sbjct: 1135 EVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGLGQKIYIWQLRDGDLIGVAF 1194
Query: 364 IDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNP 423
IDT +Y+ +++VK+LI + D +SI+LLR+Q EYRTLSL +RD+ P + + N
Sbjct: 1195 IDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVFGIEFMVDN- 1253
Query: 424 SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQP 483
S++GF+++D ++N++++MYQP
Sbjct: 1254 ---------------------------------------SNLGFLVTDAERNIIVYMYQP 1274
Query: 484 EARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARSRFLTWYASLDGALG 541
EARES GG +L++K D+HLGQ VNT F+++C + ++ Y +LDGALG
Sbjct: 1275 EARESLGGQKLLRKADYHLGQVVNTMFRVQCHQRGLHQRQPFLYENKHFVIYGTLDGALG 1334
Query: 542 FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 601
+ LPLPEK YRR LMLQNV++++ H GLNP+ +RT K NPSR IIDG L+W
Sbjct: 1335 YCLPLPEKVYRRFLMLQNVLLSYQDHLCGLNPKEYRTIKTVKKMGINPSRCIIDGDLIWS 1394
Query: 602 FLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
+ L+ ER E+ KKIG++ +IL +L +IE LS+ F
Sbjct: 1395 YRMLAHSERSEVAKKIGTRTEEILADLLEIERLSAIF 1431
>gi|355680843|gb|AER96659.1| cleavage and polyadenylation specific factor 1, 160kDa [Mustela
putorius furo]
Length = 1399
Score = 622 bits (1603), Expect = e-175, Method: Compositional matrix adjust.
Identities = 308/640 (48%), Positives = 414/640 (64%), Gaps = 60/640 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 802 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 861
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K +++ + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 862 KPKPSKKKAEGGGAEEGAAARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 921
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 922 PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 981
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P T + GE+KE RD R++ P F + L SP SW
Sbjct: 982 VAYHVESKVYAVATSTNMPCTRIPRMTGEEKEFEAIERDDRYVHPQQEAFSIQLISPVSW 1041
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D+
Sbjct: 1042 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1101
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1102 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1161
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1162 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1221
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 1222 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1241
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
PEA+ES GG RL+++ DFH+G HVNTF++ C+ + ++ P +S + +TW+A+LD
Sbjct: 1242 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVVWENKHITWFATLD 1299
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1300 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRLLHADRRALQNAVRNVLDGE 1359
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSH 637
L+ ++L LS ER E+ KKIG+ + ILD+L + + +++H
Sbjct: 1360 LLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAH 1399
>gi|195381337|ref|XP_002049409.1| GJ21566 [Drosophila virilis]
gi|194144206|gb|EDW60602.1| GJ21566 [Drosophila virilis]
Length = 1420
Score = 621 bits (1602), Expect = e-175, Method: Compositional matrix adjust.
Identities = 304/633 (48%), Positives = 413/633 (65%), Gaps = 55/633 (8%)
Query: 19 ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFV----------- 67
EL V LG HG RPLLLVRT+ ELLIYQ FR+ KG LK+RF+KL+ L +
Sbjct: 830 ELCLVGLGQHGERPLLLVRTRLELLIYQVFRYAKGHLKIRFRKLEQLHLLDQQPTHIELD 889
Query: 68 SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID 127
D ++ A P+ V+ ++RYFSN+ G G+ +CG +P ++FLT+RGELR H + +
Sbjct: 890 GDEAEEAESYNMQPKYVQ--KLRYFSNVGGLAGIMVCGMNPVFVFLTARGELRIHRLLGN 947
Query: 128 GPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHL 187
V + A F+NVN P GFLYF+ EL+ISVLP++LSYDA WPVRKVPL+CTP L YH
Sbjct: 948 ADVRSFAAFNNVNIPHGFLYFDTTYELKISVLPSYLSYDAAWPVRKVPLRCTPRQLVYHR 1007
Query: 188 ETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQ 247
E + YC++T EP T YY+FNGEDKEL + R RFI P+ S F + L SP +WE +P
Sbjct: 1008 ENRVYCLITQKEEPMTKYYRFNGEDKELSEESRGERFIYPIGSLFEMVLISPETWEIVPD 1067
Query: 248 TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
+ WEHV K V + YEGT SGL+ Y+ +GTN+NYSED+T RG I ++DIIEVVP
Sbjct: 1068 ASIQFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVP 1127
Query: 308 EPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTE 367
EPG+P+TK K+K ++ KEQKGPV+AI V GFLVT +GQKIYIWQL+D DL G+AFIDT
Sbjct: 1128 EPGKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGLGQKIYIWQLRDGDLIGVAFIDTN 1187
Query: 368 VYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGI 427
+Y+ +++VK+LI + D +SI+LLR+Q EYRTLSL +RD+ P + + N
Sbjct: 1188 IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVFGIEFMVDN----- 1242
Query: 428 IDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARE 487
S++GF+++D ++N++++MYQPEARE
Sbjct: 1243 -----------------------------------SNLGFLVTDAERNLIVYMYQPEARE 1267
Query: 488 SNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARSRFLTWYASLDGALGFFLP 545
S GG +L++K D+HLGQ VNT F+++C + ++ L Y +LDGALG+ LP
Sbjct: 1268 SLGGQKLLRKADYHLGQVVNTMFRVQCHQRGLHHRQPFLYENKHLVIYGTLDGALGYCLP 1327
Query: 546 LPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
LPEK YRR LMLQNV++++ H GLNP+ +RT K NPSR IIDG L+W + L
Sbjct: 1328 LPEKVYRRFLMLQNVLLSYQDHLCGLNPKEYRTIKTVKKMGINPSRCIIDGDLIWSYRML 1387
Query: 606 SLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
+ ER E+ KKIG++ +IL ++ +IE LS+ F
Sbjct: 1388 AHSERSEVAKKIGTRTEEILADMLEIERLSAVF 1420
>gi|351713968|gb|EHB16887.1| Cleavage and polyadenylation specificity factor subunit 1
[Heterocephalus glaber]
Length = 1440
Score = 620 bits (1598), Expect = e-175, Method: Compositional matrix adjust.
Identities = 309/642 (48%), Positives = 413/642 (64%), Gaps = 63/642 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 843 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 902
Query: 63 -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
+ E+ RG R+++ RYF +I GY GVF+CGP P WL +T RG LR
Sbjct: 903 KPKPSKKKAEGGSTEEGSGVRG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRG-LRL 960
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 961 HPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 1020
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
++AYH+E+K Y + TST+ P T + GE+KE RD R+I P F + L SP S
Sbjct: 1021 YVAYHVESKVYAVATSTSTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVS 1080
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
WE IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGR+ ++
Sbjct: 1081 WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRVRDWE 1140
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1141 RIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1200
Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S +
Sbjct: 1201 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1260
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
N + +GF++SD+D+N++++MY
Sbjct: 1261 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1280
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASL 536
PEA+ES GG RL+++ DFH+G HVNTF++ C+ + S+ P +S + +TW+A+L
Sbjct: 1281 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--SEGPSKKSVVWENKHITWFATL 1338
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
DG +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1339 DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDG 1398
Query: 597 SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1399 ELLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 1440
>gi|440904368|gb|ELR54893.1| Cleavage and polyadenylation specificity factor subunit 1, partial
[Bos grunniens mutus]
Length = 1417
Score = 619 bits (1595), Expect = e-174, Method: Compositional matrix adjust.
Identities = 305/623 (48%), Positives = 402/623 (64%), Gaps = 62/623 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 836 LVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 895
Query: 63 -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
+ E+ PRG R+++ RYF +I GY GVF+CGP P WL +T RG LR
Sbjct: 896 KPKPSKKKAEGGSTEEGTGPRG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 954
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
HPM IDGP+ + APFHN+NCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 955 HPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 1014
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
++AYH+E+K Y + TST+ P T + GE+KE T RD R++ P F + L SP S
Sbjct: 1015 YVAYHVESKVYAVATSTSTPCTRVPRMTGEEKEFETIERDERYVHPQQEAFCIQLISPVS 1074
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
WE IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D
Sbjct: 1075 WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1134
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1135 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1194
Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S +
Sbjct: 1195 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1254
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
N + +GF++SD+D+N++++MY
Sbjct: 1255 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1274
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASL 536
PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ P +S + +TW+A+L
Sbjct: 1275 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCR--GAAEGPSKKSVVWENKHITWFATL 1332
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
DG +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1333 DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDG 1392
Query: 597 SLVWKFLQLSLGERLEICKKIGS 619
L+ ++L LS ER E+ KKIG+
Sbjct: 1393 ELLNRYLYLSPMERGELAKKIGT 1415
>gi|312380158|gb|EFR26239.1| hypothetical protein AND_07834 [Anopheles darlingi]
Length = 1503
Score = 618 bits (1594), Expect = e-174, Method: Compositional matrix adjust.
Identities = 304/645 (47%), Positives = 419/645 (64%), Gaps = 70/645 (10%)
Query: 18 QELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKV-----LFVSDRSK 72
+E+L V+LG +G+RP+L +R + +LLIY+ FR+ KG LKLRFK+L F + ++
Sbjct: 845 KEILMVALGSYGSRPILFIRLEQDLLIYRVFRYAKGHLKLRFKRLTSSVTCPAFRTVPAR 904
Query: 73 RAN--EQPGL--------PRGV------------RISQMRYFSNIAGYQGVFLCGPHPAW 110
AN ++P P G IS +RYF N++GY GV +CG P +
Sbjct: 905 LANLPDKPATGATTDATEPNGKDTQEHATKVQYENISMIRYFGNVSGYAGVAVCGEKPYF 964
Query: 111 LFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWP 170
LFLT+ GELR+H + + APF+NVNCP GFLYF+ + +L+IS+LPT+LSYD+ WP
Sbjct: 965 LFLTAHGELRSHRLYARTVMKAFAPFNNVNCPNGFLYFDEQYQLKISILPTYLSYDSVWP 1024
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
VRK+PL+ +P + YH E + YC+V E YY+FNGEDKEL + + RF+ P+
Sbjct: 1025 VRKIPLRSSPKQIVYHRENRVYCVVMDAEEICNKYYRFNGEDKELTEENKGERFLYPMGH 1084
Query: 231 QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
QF V L +P +WE +P T L EWEHV+ LKNVS+ YEG SGL+ YIA+GTN+NYSED
Sbjct: 1085 QFSVVLVNPAAWEIVPDTAIALEEWEHVVSLKNVSLAYEGARSGLKEYIAVGTNFNYSED 1144
Query: 291 VTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYI 350
+T RGR+LL+DIIEVVPEPG+PLTK+K K + K+QKGPV+AI HV GFLV AVGQK+Y+
Sbjct: 1145 ITSRGRLLLYDIIEVVPEPGKPLTKHKFKEVIVKDQKGPVSAISHVCGFLVGAVGQKVYL 1204
Query: 351 WQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
WQ+KD+DL G+AFIDT +++ MVS+K+LILV D +S++LLR+Q E+RTLSLV+RDY P
Sbjct: 1205 WQMKDDDLVGVAFIDTNIFVHQMVSIKSLILVADVYKSVSLLRFQDEFRTLSLVSRDYHP 1264
Query: 411 TQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMIS 470
Y N +++GF+++
Sbjct: 1265 LNVYQVEYVVDN----------------------------------------TNLGFLVA 1284
Query: 471 DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC---KPSSISDAPGARS 527
D N++ +MYQPE+RES GG RL++K D+HLGQ VN F+++C + + +
Sbjct: 1285 DDQANLITYMYQPESRESFGGQRLLRKGDYHLGQRVNAMFRVQCDFHESDVMRRTLNYDN 1344
Query: 528 RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
+ T++A+LDG GF LPLPEK YRRL MLQNV++TH+ HT GLNP+A+RT K
Sbjct: 1345 KHTTFFATLDGGFGFVLPLPEKTYRRLFMLQNVLLTHSPHTCGLNPKAYRTIKQSRALPI 1404
Query: 588 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 632
NPSR ++DG LVW FL+L E+ E+ KKIG++ +I +L +IE
Sbjct: 1405 NPSRCVVDGDLVWSFLELPANEKQEVAKKIGTRIEEICADLMEIE 1449
>gi|194756960|ref|XP_001960738.1| GF11349 [Drosophila ananassae]
gi|190622036|gb|EDV37560.1| GF11349 [Drosophila ananassae]
Length = 1455
Score = 618 bits (1594), Expect = e-174, Method: Compositional matrix adjust.
Identities = 302/649 (46%), Positives = 419/649 (64%), Gaps = 52/649 (8%)
Query: 2 GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKK 61
G ++ P + + EL + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K
Sbjct: 847 GIVQACMPQHANSPLPLELTVLGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRK 906
Query: 62 LKVLFVSD----------RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWL 111
L+ L + D +R + + + ++R F+N+ G G+ +CG +P ++
Sbjct: 907 LEQLNLMDHQPSHIELDENDEREEMESYQMQPKYVQKLRPFANVGGLSGIMVCGVNPCFV 966
Query: 112 FLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPV 171
FLTSRGELR H + +G V + A F+NVN P GFLYF+ EL+ISVLP++LSYD+ WP+
Sbjct: 967 FLTSRGELRIHRLLGNGDVRSFAAFNNVNIPNGFLYFDTTFELKISVLPSYLSYDSTWPI 1026
Query: 172 RKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQ 231
RKVPL+CTP L YH E + YC++T EP T +Y+FNGEDKEL + R RFI P+ SQ
Sbjct: 1027 RKVPLRCTPRQLVYHRENRVYCLITQNEEPMTKFYRFNGEDKELSEESRGERFIYPIGSQ 1086
Query: 232 FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDV 291
F + L SP +WE +P + WEHV K V + YEGT SGL+ Y+ +GTN+NYSED+
Sbjct: 1087 FEMVLISPETWEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDI 1146
Query: 292 TCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIW 351
T RG I ++DIIEVVPEPG+P+TK K+K ++ KEQKGPV+AI V GFLVT +GQKIYIW
Sbjct: 1147 TSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKIYIW 1206
Query: 352 QLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPT 411
QL+D DL G+AFIDT +Y+ +++VK+LI + D +SI+LLR+Q EYRTLSL +RD+ P
Sbjct: 1207 QLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPL 1266
Query: 412 QPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISD 471
+ + N S++GF+++D
Sbjct: 1267 EVYGIEFMVDN----------------------------------------SNLGFLVTD 1286
Query: 472 KDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARSRF 529
++N++++MYQPEARES GG +L++K D+HLGQ VNT F+++C + ++
Sbjct: 1287 AERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFLYENKH 1346
Query: 530 LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
Y +LDGALG+ LPLPEK YRR LMLQNV++++ H GLNP+ +RT K NP
Sbjct: 1347 FVVYGTLDGALGYCLPLPEKLYRRFLMLQNVLLSYQEHLCGLNPKEYRTIKAVKKQGINP 1406
Query: 590 SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
SR IIDG L+W + L+ ER E+ KKIG++ +IL +L +IE L+S F
Sbjct: 1407 SRCIIDGDLIWSYRLLANSERNEVAKKIGTRTEEILSDLLEIERLASVF 1455
>gi|354491126|ref|XP_003507707.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 3 [Cricetulus griseus]
Length = 1449
Score = 618 bits (1594), Expect = e-174, Method: Compositional matrix adjust.
Identities = 304/622 (48%), Positives = 403/622 (64%), Gaps = 60/622 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 843 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 902
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K +++ + + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 903 KPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 962
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 963 PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1022
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P T + GE+KE RD R+I P F + L SP SW
Sbjct: 1023 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSW 1082
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D+
Sbjct: 1083 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1142
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1143 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1202
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1203 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1262
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 1263 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1282
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
PEA+ES GG RL+++ DFH+G HVNTF++ C+ + ++ P +S + +TW+A+LD
Sbjct: 1283 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVMWENKHITWFATLD 1340
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1341 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1400
Query: 598 LVWKFLQLSLGERLEICKKIGS 619
L+ ++L LS ER E+ KKIG+
Sbjct: 1401 LLNRYLYLSTMERSELAKKIGT 1422
>gi|195455711|ref|XP_002074834.1| GK23274 [Drosophila willistoni]
gi|194170919|gb|EDW85820.1| GK23274 [Drosophila willistoni]
Length = 1463
Score = 617 bits (1592), Expect = e-174, Method: Compositional matrix adjust.
Identities = 306/654 (46%), Positives = 418/654 (63%), Gaps = 62/654 (9%)
Query: 2 GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKK 61
G +S P + + EL V LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K
Sbjct: 855 GIVQSCMPQHANSPLPLELSLVGLGLNGERPLLLVRTRLELLIYQVFRYPKGHLKIRFRK 914
Query: 62 LKVLFVSDRS---------------KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGP 106
+ L + D+ + N QP + ++R F+N+ G GV +CG
Sbjct: 915 MDQLNLLDQQPTHVNLDDNEENEELESYNMQPKY-----VQKLRPFNNVGGMSGVMICGV 969
Query: 107 HPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYD 166
+P +LFLTSRGELR H + +G V + A F+N+N P GFL+F+ EL+ISVLP++LSYD
Sbjct: 970 NPCFLFLTSRGELRIHRLLGNGEVRSFAAFNNINIPNGFLFFDTTFELKISVLPSYLSYD 1029
Query: 167 APWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIP 226
+ WPVRKVPL+CTP L YH E + YC++T T EP T +Y+FNGEDKEL + R RFI
Sbjct: 1030 STWPVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKFYRFNGEDKELSEESRGERFIY 1089
Query: 227 PLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYN 286
P+ SQF + L SP +WE +P + WEHV K V + YEGT SGL+ Y+ +GTN+N
Sbjct: 1090 PIGSQFDMVLISPETWEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFN 1149
Query: 287 YSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ 346
YSED+T RG I ++DIIEVVPEPG+P+TK K+K ++ KEQKGPV+AI V GFLVT +GQ
Sbjct: 1150 YSEDITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQ 1209
Query: 347 KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVAR 406
KIYIWQL+D DL G+AFIDT +Y+ +++VK+LI + D +SI+LLR+Q EYRTLSL +R
Sbjct: 1210 KIYIWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASR 1269
Query: 407 DYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMG 466
D+ P + + N +++G
Sbjct: 1270 DFNPLEVYGIEFMVDN----------------------------------------TNLG 1289
Query: 467 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG-- 524
F+++D + N++++MYQPEARES GG +L++K D+HLGQ VNT F+++C +
Sbjct: 1290 FLVTDAESNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQRGLHQRQPFL 1349
Query: 525 ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 584
++ Y +LDGALG+ LPLPEK YRR LMLQNV++++ H GLNP+ +RT K
Sbjct: 1350 YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQDHLCGLNPKEYRTLKSSKR 1409
Query: 585 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
NPSR IIDG L+W + L+ ER E+ KKIG++ +IL +L +IE LS F
Sbjct: 1410 LGINPSRCIIDGDLIWSYRLLANSERNEVAKKIGTRTEEILADLLEIERLSGVF 1463
>gi|296227035|ref|XP_002807684.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Callithrix jacchus]
Length = 1394
Score = 617 bits (1591), Expect = e-174, Method: Compositional matrix adjust.
Identities = 305/640 (47%), Positives = 409/640 (63%), Gaps = 58/640 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 796 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLSQGNLKVRFKKVPHNINFREK 855
Query: 63 -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
+ E+ RG R+++ RYF +I GY GVF+CGP P WL +T RG LR
Sbjct: 856 KPKPSKKKAEGGSTEEGAGARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 914
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
HPM IDGPV + APFHN+NCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 915 HPMGIDGPVDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 974
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
++AYH+E+K Y + TST P T + GE+KE T RD R+I P F + L SP S
Sbjct: 975 YVAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS 1034
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
WE IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D
Sbjct: 1035 WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1094
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
+IEVV EP Q LT K K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1095 VIEVVTEPRQTLTXXKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1154
Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S +
Sbjct: 1155 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1214
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
N + +GF++SD+D+N++++MY
Sbjct: 1215 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1234
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDG 538
PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ + + ++ +TW+A+LDG
Sbjct: 1235 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSKKSVMWENKHITWFATLDG 1294
Query: 539 ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
+G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG L
Sbjct: 1295 GIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGEL 1354
Query: 599 VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1355 LNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1394
>gi|255918233|ref|NP_001157645.1| cleavage and polyadenylation specificity factor subunit 1 isoform 1
[Mus musculus]
Length = 1450
Score = 617 bits (1590), Expect = e-174, Method: Compositional matrix adjust.
Identities = 307/637 (48%), Positives = 407/637 (63%), Gaps = 61/637 (9%)
Query: 2 GNFRSHSPSAMDET-IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALK 56
G R + E +V+E+L V+LG +RP LLV ELLIY+AF H +G LK
Sbjct: 828 GEVRKEEATRQGELPLVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLK 887
Query: 57 LRFKKL---------KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPH 107
+RFKK+ K +++ + + G R+++ RYF +I GY GVF+CGP
Sbjct: 888 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 947
Query: 108 PAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDA 167
P WL +T RG LR HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDA
Sbjct: 948 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007
Query: 168 PWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPP 227
PWPVRK+PL+CT H++AYH+E+K Y + TST P T + GE+KE RD R+I P
Sbjct: 1008 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHP 1067
Query: 228 LVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY 287
F + L SP SWE IP L EWEHV C+K VS+ E T+SGL+GY+A GT
Sbjct: 1068 QQEAFSIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQ 1127
Query: 288 SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQK
Sbjct: 1128 GEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQK 1187
Query: 348 IYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
I++W L+ ++LTG+AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD
Sbjct: 1188 IFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRD 1247
Query: 408 YKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGF 467
KP + S + N + +GF
Sbjct: 1248 AKPLEVYSVDFMVDN----------------------------------------AQLGF 1267
Query: 468 MISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS 527
++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++ C+ + ++ P +S
Sbjct: 1268 LVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKS 1325
Query: 528 -----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 582
+ +TW+A+LDG +G LP+ EK YRRLLMLQN + T H GLNPRAFR
Sbjct: 1326 VVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVD 1385
Query: 583 GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
N R ++DG L+ ++L LS ER E+ KKIG+
Sbjct: 1386 RRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGT 1422
>gi|47217773|emb|CAG05995.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1446
Score = 613 bits (1582), Expect = e-173, Method: Compositional matrix adjust.
Identities = 313/672 (46%), Positives = 429/672 (63%), Gaps = 89/672 (13%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGALKLRFKKL--------- 62
+V+E+ VSLG + +RP LLV + ELL+Y+AF + P+ LK+RFKK+
Sbjct: 815 LVKEVTLVSLGYNHSRPYLLVHVEQELLVYEAFPYDQQQPQNNLKVRFKKVPHNINFREK 874
Query: 63 -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
L +++ A + G+ RIS+ RYF +I+GY GVF+CGP P W+ +TSRG LR
Sbjct: 875 KSKLRKDKKAEGAAAEDGVAARGRISRFRYFEDISGYSGVFICGPSPHWMLVTSRGALRL 934
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
HPMTIDGP+ + +PFHN+NCP+GFLYFN + ELRISVLPT+LSYDAPWPVRK+PL+CT H
Sbjct: 935 HPMTIDGPIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVH 994
Query: 182 FLAYHLETKT-------YCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHV 234
+++YH+E+K Y + TS E T + GE+KE T RD R+I P +F +
Sbjct: 995 YVSYHVESKASLSHCCVYAVCTSVKELCTRIPRMTGEEKEYETIERDERYINPQQDKFSI 1054
Query: 235 SLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR 294
L SP SWE IP T L EWE+V C+K V++ + T+SGL+GYIA GT E+VTCR
Sbjct: 1055 QLISPVSWEAIPNTRIDLEEWEYVTCMKTVALRSQETVSGLKGYIAAGTCLMQGEEVTCR 1114
Query: 295 GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK 354
GRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G+LV+A+GQKI++W LK
Sbjct: 1115 GRILILDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFLWVLK 1174
Query: 355 DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
DNDLTG+AFIDT++YI M+S+KN IL D +S++LLRYQ E +TLSLV+RD KP +
Sbjct: 1175 DNDLTGMAFIDTQLYIHQMMSIKNFILAADLMKSVSLLRYQEESKTLSLVSRDAKPLEVY 1234
Query: 415 SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
S + N S +GF++SD+DK
Sbjct: 1235 SIEFMVDN----------------------------------------SQLGFLVSDRDK 1254
Query: 475 NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS---RFLT 531
N+ ++MY PEA+ES GG RL+++ DF+ G ++NTF+++ C+ + + + A + + +T
Sbjct: 1255 NLYVYMYLPEAKESFGGMRLLRRADFNAGANINTFWRMPCRGALEAGSRKAMTWDNKHIT 1314
Query: 532 WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG---- 587
W+A+LDG +G LP+ EK YRRLLMLQN + T SH GLNP+AFR A
Sbjct: 1315 WFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLSHHAGLNPKAFRCVGADRTSAAMLSG 1374
Query: 588 ---------------------NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 626
NP + I+DG L+ K+L LS+ ER E+ KKIG+ + ILD
Sbjct: 1375 MLPDFATSVSRMLHCDRRSLQNPVKNILDGELLNKYLYLSMMERSELAKKIGTTQDIILD 1434
Query: 627 ELYDIEALSSHF 638
+L DI+ +++HF
Sbjct: 1435 DLLDIDRVTAHF 1446
>gi|198457226|ref|XP_001360595.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
gi|198135905|gb|EAL25170.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
Length = 1459
Score = 613 bits (1581), Expect = e-173, Method: Compositional matrix adjust.
Identities = 306/655 (46%), Positives = 418/655 (63%), Gaps = 62/655 (9%)
Query: 1 MGNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFK 60
+G +S P + + EL V LGL+G RP+L+VRT+ ELLIYQ FR+PKG LK+RF+
Sbjct: 850 VGIVQSCMPQHANSPLPLELSLVGLGLNGERPVLMVRTRVELLIYQVFRYPKGNLKIRFR 909
Query: 61 KLKVLFVSDRS---------------KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCG 105
KL+ L + D+ + N QP + ++R FSN+ G G+ +CG
Sbjct: 910 KLEQLNLLDQQPSHIELEENDEEEELESYNMQPKY-----VQKLRPFSNVGGLAGIMVCG 964
Query: 106 PHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY 165
+P ++FLT+RGELR H + +G V + A F+NVN P GFLYF+ EL+ISVLP++LSY
Sbjct: 965 VNPCFVFLTARGELRIHRLQGNGDVRSFAAFNNVNIPNGFLYFDTTFELKISVLPSYLSY 1024
Query: 166 DAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFI 225
D+ WPVRKVPL+CTP L YH E + YC++T T EP T YY+FNGEDKEL + R RFI
Sbjct: 1025 DSVWPVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFI 1084
Query: 226 PPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNY 285
P SQF + L SP +WE +P + WEHV K V + YEGT SGL+ Y+ +GTN+
Sbjct: 1085 YPNGSQFEMVLISPETWEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNF 1144
Query: 286 NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVG 345
NYSED+T RG I ++DIIEVVPEPG+P+TK K+K ++ KEQKGPV+AI V GFLVT +G
Sbjct: 1145 NYSEDITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLG 1204
Query: 346 QKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVA 405
QKIYIWQL+D DL G+AFIDT +Y+ +++VK+LI + D +SI+LLR+Q E+RTLSL +
Sbjct: 1205 QKIYIWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLAS 1264
Query: 406 RDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
RD+ P + + N S++
Sbjct: 1265 RDFNPLEVYGIEFMVDN----------------------------------------SNL 1284
Query: 466 GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG- 524
GF+++D ++N++++MYQPEARES GG +LI+K D+HLGQ VNT F+++C +
Sbjct: 1285 GFLVTDAERNLIVYMYQPEARESLGGQKLIRKADYHLGQVVNTMFRVQCHQRGVHQRQPF 1344
Query: 525 -ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKG 583
++ Y +LDG LG+ LPLPEK YRR LMLQNV++++ H GLNP+ FRT K
Sbjct: 1345 LYENKHFVVYGTLDGGLGYCLPLPEKVYRRFLMLQNVLLSYQDHLCGLNPKEFRTLKSFK 1404
Query: 584 YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
NPSR IIDG L+W + L +R E+ KKIG++ +IL +L +IE LS F
Sbjct: 1405 KQGLNPSRCIIDGDLIWSYRLLPNSDRNEVAKKIGTRTEEILSDLLEIERLSGVF 1459
>gi|195150431|ref|XP_002016158.1| GL10645 [Drosophila persimilis]
gi|194110005|gb|EDW32048.1| GL10645 [Drosophila persimilis]
Length = 1459
Score = 613 bits (1581), Expect = e-173, Method: Compositional matrix adjust.
Identities = 306/655 (46%), Positives = 418/655 (63%), Gaps = 62/655 (9%)
Query: 1 MGNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFK 60
+G +S P + + EL V LGL+G RP+L+VRT+ ELLIYQ FR+PKG LK+RF+
Sbjct: 850 VGIVQSCMPQHANSPLPLELSLVGLGLNGERPVLMVRTRVELLIYQVFRYPKGNLKIRFR 909
Query: 61 KLKVLFVSDRS---------------KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCG 105
KL+ L + D+ + N QP + ++R FSN+ G G+ +CG
Sbjct: 910 KLEQLNLLDQQPSHIELEENDEEEELESYNMQPKY-----VQKLRPFSNVGGLAGIMVCG 964
Query: 106 PHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY 165
+P ++FLT+RGELR H + +G V + A F+NVN P GFLYF+ EL+ISVLP++LSY
Sbjct: 965 VNPCFVFLTARGELRIHRLQGNGDVRSFAAFNNVNIPNGFLYFDTTFELKISVLPSYLSY 1024
Query: 166 DAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFI 225
D+ WPVRKVPL+CTP L YH E + YC++T T EP T YY+FNGEDKEL + R RFI
Sbjct: 1025 DSVWPVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFI 1084
Query: 226 PPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNY 285
P SQF + L SP +WE +P + WEHV K V + YEGT SGL+ Y+ +GTN+
Sbjct: 1085 YPNGSQFEMVLISPETWEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNF 1144
Query: 286 NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVG 345
NYSED+T RG I ++DIIEVVPEPG+P+TK K+K ++ KEQKGPV+AI V GFLVT +G
Sbjct: 1145 NYSEDITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLG 1204
Query: 346 QKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVA 405
QKIYIWQL+D DL G+AFIDT +Y+ +++VK+LI + D +SI+LLR+Q E+RTLSL +
Sbjct: 1205 QKIYIWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLAS 1264
Query: 406 RDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
RD+ P + + N S++
Sbjct: 1265 RDFNPLEVYGIEFMVDN----------------------------------------SNL 1284
Query: 466 GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG- 524
GF+++D ++N++++MYQPEARES GG +LI+K D+HLGQ VNT F+++C +
Sbjct: 1285 GFLVTDAERNLIVYMYQPEARESLGGQKLIRKADYHLGQVVNTMFRVQCHQRGVHQRQPF 1344
Query: 525 -ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKG 583
++ Y +LDG LG+ LPLPEK YRR LMLQNV++++ H GLNP+ FRT K
Sbjct: 1345 LYENKHFVVYGTLDGGLGYCLPLPEKVYRRFLMLQNVLLSYQDHLCGLNPKEFRTLKSFK 1404
Query: 584 YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
NPSR IIDG L+W + L +R E+ KKIG++ +IL +L +IE LS F
Sbjct: 1405 KQGLNPSRCIIDGDLIWSYRLLPNSDRNEVAKKIGTRTEEILSDLLEIERLSGVF 1459
>gi|443684051|gb|ELT88095.1| hypothetical protein CAPTEDRAFT_161045 [Capitella teleta]
Length = 1410
Score = 613 bits (1581), Expect = e-173, Method: Compositional matrix adjust.
Identities = 294/648 (45%), Positives = 424/648 (65%), Gaps = 59/648 (9%)
Query: 2 GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGALKL 57
+F + S + V E++ G++G++PLL+ R EL IY+ F H KG L++
Sbjct: 811 ASFVAPERSTQEVPFVHEVMLHGFGVNGSQPLLMARVHDELYIYKVFSHVGSKAKGRLQV 870
Query: 58 RFKKLK---VLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLT 114
RFK+ ++ DR ++ E +R F++I+GY GVF+CG +P WL +T
Sbjct: 871 RFKRRSHGLIIRPRDREEKIPENK--------KWLRPFTDISGYSGVFICGSYPHWLIMT 922
Query: 115 SRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKV 174
RG LR HPM IDG + FHNVNCP+GFLYF++ ELRI VLPTHLSYDAPWPVRKV
Sbjct: 923 QRGTLRGHPMAIDGTIPCFTAFHNVNCPKGFLYFSSNEELRICVLPTHLSYDAPWPVRKV 982
Query: 175 PLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGE-DKELVTDPRDSRFIPPLVSQFH 233
PL+CTPHF+ YH ++KTY +V+S P T + G+ +KE+ +D RF+ P++++F+
Sbjct: 983 PLRCTPHFVVYHPDSKTYSVVSSQQVPCTQLVRVAGDGEKEIEAVQKDDRFVFPIMNKFN 1042
Query: 234 VSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTC 293
+ LFSP SWE IP T F L EWEHV+C+K ++++ EGTLSGL+GY+ +GTN NY+EDV+
Sbjct: 1043 IQLFSPVSWEPIPNTRFDLEEWEHVMCIKTINLKSEGTLSGLKGYVVVGTNLNYNEDVSS 1102
Query: 294 RGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL 353
RG++ ++D+I+VVPEPGQPLTKNKIK++Y KEQKGPVTA+ V GFLVTA+GQK+YIWQL
Sbjct: 1103 RGKLTIYDVIDVVPEPGQPLTKNKIKVVYNKEQKGPVTALDGVQGFLVTAIGQKVYIWQL 1162
Query: 354 KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
KDNDL GIAFIDT++YI M ++KNLI++GD +SI++LRYQ + + LSLV++D +P
Sbjct: 1163 KDNDLAGIAFIDTQIYIHKMEALKNLIIIGDVCKSISVLRYQEDMKVLSLVSKDVRPLAV 1222
Query: 414 NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
Y ++DE +S+ F+++DK
Sbjct: 1223 YGVAY---------------------------------------LVDE-TSLAFIVADKL 1242
Query: 474 KNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS---RFL 530
KN +++ YQP+ +S GG RLI+K D ++G VN FF+++C+ S S + +S + +
Sbjct: 1243 KNFLVYCYQPDLVQSQGGQRLIRKADINIGSLVNAFFRVKCRVSDPSTSKTDQSLAMKHI 1302
Query: 531 TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPS 590
T+Y +LDG++G+ LP+ E YRRL MLQ +++ T GLNP+A+RT + + N
Sbjct: 1303 TYYVTLDGSIGYLLPISESLYRRLYMLQKMLIQQVQQTAGLNPKAYRTCQTEFRQLINIQ 1362
Query: 591 RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
R IIDG L WK+L L+ +R E+ K+IG+ + I D+L +I+ + HF
Sbjct: 1363 RNIIDGDLAWKYLALTSHDRAEMAKRIGTTSHQIEDDLLEIDRCTCHF 1410
>gi|9794908|gb|AAF98388.1| cleavage and polyadenylation specificity factor [Drosophila
melanogaster]
Length = 813
Score = 608 bits (1568), Expect = e-171, Method: Compositional matrix adjust.
Identities = 310/651 (47%), Positives = 423/651 (64%), Gaps = 56/651 (8%)
Query: 2 GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKK 61
G ++ P + + EL + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K
Sbjct: 205 GIVQACMPQHANSPLPLELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRK 264
Query: 62 LKVLFVSDRS------KRANEQPGL------PRGVRISQMRYFSNIAGYQGVFLCGPHPA 109
L L + D+ +EQ + P+ V+ ++R F+N+ G GV +CG +P
Sbjct: 265 LDQLNLLDQQPTHIELDENDEQEEIESYQMQPKYVQ--KLRPFANVGGLSGVMVCGVNPC 322
Query: 110 WLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPW 169
++FLT RGELR H + +G V + A F+NVN P GFLYF+ EL+ISVLP++LSYD+ W
Sbjct: 323 FVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNGFLYFDTTYELKISVLPSYLSYDSVW 382
Query: 170 PVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLV 229
PVRKVPL+CTP L YH E + YC++T T EP T YY+FNGEDKEL + RD RFI P+
Sbjct: 383 PVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRDERFIYPIG 442
Query: 230 SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
SQF + L SP +WE +P + WEHV K V + YEGT SGL+ Y+ +GTN+NYSE
Sbjct: 443 SQFEMVLISPETWEIVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSE 502
Query: 290 DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
D+T RG I ++DIIEVVPEPG+P+TK KIK I+ KEQKGPV+AI V GFLVT +GQKIY
Sbjct: 503 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIY 562
Query: 350 IWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
IWQL+D DL G+AFIDT +Y+ +++VK+LI + D +SI+LLR+Q EYRTLSL +RD+
Sbjct: 563 IWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFN 622
Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
P + + N S++GF++
Sbjct: 623 PLEVYGIEFMVDN----------------------------------------SNLGFLV 642
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARS 527
+D ++N++++MYQPEARES GG +L++K D+HLGQ VNT F+++C + +
Sbjct: 643 TDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFLYEN 702
Query: 528 RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
+ Y +LDGALG+ LPLPEK YRR LMLQNV++++ H GLNP+ +RT K
Sbjct: 703 KHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSSKKQGI 762
Query: 588 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
NPSR IIDG L+W + ++ ER E+ KKIG++ +IL +L +IE L+S F
Sbjct: 763 NPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDLLEIERLASVF 813
>gi|24653655|ref|NP_725397.1| cleavage and polyadenylation specificity factor 160, isoform B
[Drosophila melanogaster]
gi|15292103|gb|AAK93320.1| LD38533p [Drosophila melanogaster]
gi|21627189|gb|AAM68553.1| cleavage and polyadenylation specificity factor 160, isoform B
[Drosophila melanogaster]
Length = 1420
Score = 607 bits (1564), Expect = e-171, Method: Compositional matrix adjust.
Identities = 308/651 (47%), Positives = 422/651 (64%), Gaps = 56/651 (8%)
Query: 2 GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKK 61
G ++ P + + EL + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K
Sbjct: 812 GIVQACMPQHANSPLPLELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRK 871
Query: 62 LKVLFVSDRS------KRANEQPGL------PRGVRISQMRYFSNIAGYQGVFLCGPHPA 109
+ L + D+ +EQ + P+ V+ ++R F+N+ G GV +CG +P
Sbjct: 872 MDQLNLLDQQPTHIDLDENDEQEEIESYQMQPKYVQ--KLRPFANVGGLSGVMVCGVNPC 929
Query: 110 WLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPW 169
++FLT RGELR H + +G V + A F+NVN P GFLYF+ EL+ISVLP++LSYD+ W
Sbjct: 930 FVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNGFLYFDTTYELKISVLPSYLSYDSVW 989
Query: 170 PVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLV 229
PVRKVPL+CTP L YH E + YC++T T EP T YY+FNGEDKEL + R RFI P+
Sbjct: 990 PVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIG 1049
Query: 230 SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
SQF + L SP +WE +P + WEHV K V + YEGT SGL+ Y+ +GTN+NYSE
Sbjct: 1050 SQFEMVLISPETWEIVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSE 1109
Query: 290 DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
D+T RG I ++DIIEVVPEPG+P+TK KIK I+ KEQKGPV+AI V GFLVT +GQKIY
Sbjct: 1110 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIY 1169
Query: 350 IWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
IWQL+D DL G+AFIDT +Y+ +++VK+LI + D +SI+LLR+Q EYRTLSL +RD+
Sbjct: 1170 IWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFN 1229
Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
P + + N S++GF++
Sbjct: 1230 PLEVYGIEFMVDN----------------------------------------SNLGFLV 1249
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARS 527
+D ++N++++MYQPEARES GG +L++K D+HLGQ VNT F+++C + +
Sbjct: 1250 TDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFLYEN 1309
Query: 528 RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
+ Y +LDGALG+ LPLPEK YRR LMLQNV++++ H GLNP+ +RT K
Sbjct: 1310 KHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSSKKQGI 1369
Query: 588 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
NPSR IIDG L+W + ++ ER E+ KKIG++ +IL +L +IE L+S F
Sbjct: 1370 NPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDLLEIERLASVF 1420
>gi|195334368|ref|XP_002033855.1| GM20208 [Drosophila sechellia]
gi|194125825|gb|EDW47868.1| GM20208 [Drosophila sechellia]
Length = 1455
Score = 606 bits (1563), Expect = e-170, Method: Compositional matrix adjust.
Identities = 309/651 (47%), Positives = 422/651 (64%), Gaps = 56/651 (8%)
Query: 2 GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKK 61
G ++ P + + EL + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K
Sbjct: 847 GIVQACMPQHANSPLPLELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRK 906
Query: 62 LKVLFVSDRS------KRANEQPGL------PRGVRISQMRYFSNIAGYQGVFLCGPHPA 109
L L + D+ +EQ + P+ V+ ++R F+N+ G GV +CG +P
Sbjct: 907 LDQLNLLDQQPTHIELDENDEQEEIESYQMQPKYVQ--KLRPFANVGGLSGVMVCGVNPC 964
Query: 110 WLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPW 169
++FLT RGELR H + +G V + A F+NVN P GFLYF+ EL+ISVLP++LSYD+ W
Sbjct: 965 FVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNGFLYFDTTYELKISVLPSYLSYDSIW 1024
Query: 170 PVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLV 229
PVRKVPL+CTP L YH E + YC++T T EP T YY+FNGEDKEL + R RFI P+
Sbjct: 1025 PVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIG 1084
Query: 230 SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
SQF + L SP +WE +P + WEHV K V + YEGT SGL+ Y+ +GTN+NYSE
Sbjct: 1085 SQFEMVLISPETWEIVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSE 1144
Query: 290 DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
D+T RG I ++DIIEVVPEPG+P+TK KIK I+ KEQKGPV+AI V GFLVT +GQKIY
Sbjct: 1145 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIY 1204
Query: 350 IWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
IWQL+D DL G+AFIDT +Y+ +++VK+LI + D +SI+LLR+Q EYRTLSL +RD+
Sbjct: 1205 IWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFN 1264
Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
P + + N S++GF++
Sbjct: 1265 PLEVYGIEFMVDN----------------------------------------SNLGFLV 1284
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARS 527
+D ++N++++MYQPEARES GG +L++K D+HLGQ VNT F+++C + +
Sbjct: 1285 TDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFLYEN 1344
Query: 528 RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
+ Y +LDGALG+ LPLPEK YRR LMLQNV++++ H GLNP+ +RT K
Sbjct: 1345 KHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSSKKQGI 1404
Query: 588 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
NPSR IIDG L+W + ++ ER E+ KKIG++ +IL +L +IE L+S F
Sbjct: 1405 NPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDLLEIERLASVF 1455
>gi|45552619|ref|NP_995833.1| cleavage and polyadenylation specificity factor 160, isoform A
[Drosophila melanogaster]
gi|18203551|sp|Q9V726.1|CPSF1_DROME RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit;
Short=dCPSF 160
gi|7303176|gb|AAF58240.1| cleavage and polyadenylation specificity factor 160, isoform A
[Drosophila melanogaster]
Length = 1455
Score = 606 bits (1563), Expect = e-170, Method: Compositional matrix adjust.
Identities = 308/651 (47%), Positives = 422/651 (64%), Gaps = 56/651 (8%)
Query: 2 GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKK 61
G ++ P + + EL + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K
Sbjct: 847 GIVQACMPQHANSPLPLELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRK 906
Query: 62 LKVLFVSDRS------KRANEQPGL------PRGVRISQMRYFSNIAGYQGVFLCGPHPA 109
+ L + D+ +EQ + P+ V+ ++R F+N+ G GV +CG +P
Sbjct: 907 MDQLNLLDQQPTHIDLDENDEQEEIESYQMQPKYVQ--KLRPFANVGGLSGVMVCGVNPC 964
Query: 110 WLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPW 169
++FLT RGELR H + +G V + A F+NVN P GFLYF+ EL+ISVLP++LSYD+ W
Sbjct: 965 FVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNGFLYFDTTYELKISVLPSYLSYDSVW 1024
Query: 170 PVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLV 229
PVRKVPL+CTP L YH E + YC++T T EP T YY+FNGEDKEL + R RFI P+
Sbjct: 1025 PVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIG 1084
Query: 230 SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
SQF + L SP +WE +P + WEHV K V + YEGT SGL+ Y+ +GTN+NYSE
Sbjct: 1085 SQFEMVLISPETWEIVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSE 1144
Query: 290 DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
D+T RG I ++DIIEVVPEPG+P+TK KIK I+ KEQKGPV+AI V GFLVT +GQKIY
Sbjct: 1145 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIY 1204
Query: 350 IWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
IWQL+D DL G+AFIDT +Y+ +++VK+LI + D +SI+LLR+Q EYRTLSL +RD+
Sbjct: 1205 IWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFN 1264
Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
P + + N S++GF++
Sbjct: 1265 PLEVYGIEFMVDN----------------------------------------SNLGFLV 1284
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARS 527
+D ++N++++MYQPEARES GG +L++K D+HLGQ VNT F+++C + +
Sbjct: 1285 TDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFLYEN 1344
Query: 528 RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
+ Y +LDGALG+ LPLPEK YRR LMLQNV++++ H GLNP+ +RT K
Sbjct: 1345 KHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSSKKQGI 1404
Query: 588 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
NPSR IIDG L+W + ++ ER E+ KKIG++ +IL +L +IE L+S F
Sbjct: 1405 NPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDLLEIERLASVF 1455
>gi|194883064|ref|XP_001975624.1| GG22421 [Drosophila erecta]
gi|190658811|gb|EDV56024.1| GG22421 [Drosophila erecta]
Length = 1455
Score = 606 bits (1562), Expect = e-170, Method: Compositional matrix adjust.
Identities = 309/651 (47%), Positives = 421/651 (64%), Gaps = 56/651 (8%)
Query: 2 GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKK 61
G ++ P + + EL LGL+G RPLL+VRT+ ELLIYQ FR+PKG LK+RF+K
Sbjct: 847 GIVQACMPQHANSPLPLELSLTGLGLNGERPLLMVRTRVELLIYQVFRYPKGHLKIRFRK 906
Query: 62 LKVLFVSDRS------KRANEQPGL------PRGVRISQMRYFSNIAGYQGVFLCGPHPA 109
L L + D+ +EQ + P+ V+ ++R F+N+ G GV +CG +P
Sbjct: 907 LDQLNLLDQQPTHIELDENDEQEDIESYQMQPKYVQ--KLRPFANVGGLSGVMVCGVNPC 964
Query: 110 WLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPW 169
++FLT RGELR H + +G V + A F+NVN P GFLYF+ EL+ISVLP++LSYD+ W
Sbjct: 965 FVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNGFLYFDTTYELKISVLPSYLSYDSTW 1024
Query: 170 PVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLV 229
PVRKVPL+CTP L YH E + YC++T T EP T YY+FNGEDKEL + R RFI P+
Sbjct: 1025 PVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIG 1084
Query: 230 SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
SQF + L SP +WE +P + WEHV K V + YEGT SGL+ Y+ +GTN+NYSE
Sbjct: 1085 SQFEMVLISPETWEIVPDASISFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSE 1144
Query: 290 DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
D+T RG I ++DIIEVVPEPG+P+TK KIK I+ KEQKGPV+AI V GFLVT +GQKIY
Sbjct: 1145 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIY 1204
Query: 350 IWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
IWQL+D DL G+AFIDT +Y+ +++VK+LI + D +SI+LLR+Q EYRTLSL +RD+
Sbjct: 1205 IWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFN 1264
Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
P + + N S++GF++
Sbjct: 1265 PLEVYGIEFMVDN----------------------------------------SNLGFLV 1284
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARS 527
+D ++N++++MYQPEARES GG +L++K D+HLGQ VNT F+++C + +
Sbjct: 1285 TDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFLYEN 1344
Query: 528 RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
+ Y +LDGALG+ LPLPEK YRR LMLQNV+V++ H GLNP+ +RT K
Sbjct: 1345 KHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLVSYQEHLCGLNPKEYRTLKSFKKQGI 1404
Query: 588 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
NPSR IIDG L+W + ++ ER E+ KKIG++ +IL +L +IE L+S F
Sbjct: 1405 NPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDLLEIERLASVF 1455
>gi|195485994|ref|XP_002091320.1| GE12310 [Drosophila yakuba]
gi|194177421|gb|EDW91032.1| GE12310 [Drosophila yakuba]
Length = 1455
Score = 603 bits (1556), Expect = e-170, Method: Compositional matrix adjust.
Identities = 308/651 (47%), Positives = 421/651 (64%), Gaps = 56/651 (8%)
Query: 2 GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKK 61
G ++ P + + EL + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K
Sbjct: 847 GIVQACMPQHANSPLPLELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRK 906
Query: 62 LKVLFVSDRS--------KRANEQ----PGLPRGVRISQMRYFSNIAGYQGVFLCGPHPA 109
L L + D+ A E+ P+ V+ ++R F+N+ G GV +CG +P
Sbjct: 907 LDQLNLLDQQPTHIELDENDAQEEIESYQMQPKYVQ--KLRPFANVGGLSGVMVCGVNPC 964
Query: 110 WLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPW 169
++FLT RGELR H + +G V + A F+NVN P GFLYF+ EL+ISVLP++LSYD+ W
Sbjct: 965 FVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNGFLYFDTTYELKISVLPSYLSYDSTW 1024
Query: 170 PVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLV 229
PVRKVPL+CTP L YH E + YC++T T EP T YY+FNGEDKEL + R RFI P+
Sbjct: 1025 PVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIG 1084
Query: 230 SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
SQF + L SP +WE +P + WEHV K V + YEGT SGL+ Y+ +GTN+NYSE
Sbjct: 1085 SQFEMVLISPETWEIVPDASISFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSE 1144
Query: 290 DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
D+T RG I ++DIIEVVPEPG+P+TK KIK I+ KEQKGPV+AI V GFLVT +GQKIY
Sbjct: 1145 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIY 1204
Query: 350 IWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
IWQL+D DL G+AFIDT +Y+ +++VK+LI + D +SI+LLR+Q EYRTLSL +RD+
Sbjct: 1205 IWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFN 1264
Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
P + + N S++GF++
Sbjct: 1265 PLEVYGIEFMVDN----------------------------------------SNLGFLV 1284
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARS 527
+D ++N++++MYQPEARES GG +L++K D+HLGQ VNT F+++C + +
Sbjct: 1285 TDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFLYEN 1344
Query: 528 RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
+ Y +LDGALG+ LPLPEK YRR LMLQNV++++ H GLNP+ +RT K
Sbjct: 1345 KHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSFKKQGI 1404
Query: 588 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
NPSR +IDG L+W + ++ ER E+ KKIG++ +IL +L +IE L+S F
Sbjct: 1405 NPSRCVIDGDLIWSYRLMANSERNEVAKKIGTRTEEILADLLEIERLASVF 1455
>gi|355698297|gb|EHH28845.1| Cleavage and polyadenylation specificity factor 160 kDa subunit
[Macaca mulatta]
Length = 1436
Score = 599 bits (1544), Expect = e-168, Method: Compositional matrix adjust.
Identities = 295/626 (47%), Positives = 399/626 (63%), Gaps = 60/626 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRAN 75
+V+E+L V+LG +RP LLV + F++ K +++
Sbjct: 868 LVKEVLLVALGSRQSRPYLLV-----------------PHNINFREKKPKPSKKKAEGGG 910
Query: 76 EQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAP 135
+ G R+++ RYF +I GY GVF+CGP P WL +T RG LR HPM IDGPV + AP
Sbjct: 911 TEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMAIDGPVDSFAP 970
Query: 136 FHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIV 195
FHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H++AYH+E+K Y +
Sbjct: 971 FHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVA 1030
Query: 196 TSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEW 255
TST P + GE+KE T RD R+I P F + L SP SWE IP L EW
Sbjct: 1031 TSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNARIELQEW 1090
Query: 256 EHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTK 315
EHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D+IEVVPEPGQPLTK
Sbjct: 1091 EHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTK 1150
Query: 316 NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVS 375
NK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+AFIDT++YI M+S
Sbjct: 1151 NKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQLYIHQMIS 1210
Query: 376 VKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWK 435
VKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1211 VKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN------------- 1257
Query: 436 FLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLI 495
+ +GF++SD+D+N++++MY PEA+ES GG RL+
Sbjct: 1258 ---------------------------AQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1290
Query: 496 KKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYR 552
++ DFH+G HVNTF++ C+ ++ + + ++ +TW+A+LDG +G LP+ EK YR
Sbjct: 1291 RRADFHVGAHVNTFWRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYR 1350
Query: 553 RLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 612
RLLMLQN + T H GLNPRAFR N R ++DG L+ ++L LS ER E
Sbjct: 1351 RLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSE 1410
Query: 613 ICKKIGSKHNDILDELYDIEALSSHF 638
+ KKIG+ + ILD+L + + +++HF
Sbjct: 1411 LAKKIGTTPDIILDDLLETDRVTAHF 1436
>gi|391328522|ref|XP_003738737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Metaseiulus occidentalis]
Length = 1500
Score = 593 bits (1529), Expect = e-167, Method: Compositional matrix adjust.
Identities = 293/649 (45%), Positives = 410/649 (63%), Gaps = 56/649 (8%)
Query: 2 GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF---RHPKGALKLR 58
G S S S V E+ +LG+H +RPLL R EL IY+A+ +G LKL+
Sbjct: 896 GQTTSASTSEAQLPKVMEIFVCALGMHQSRPLLFARVDSELHIYEAYPFVNQKEGHLKLQ 955
Query: 59 FKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGE 118
F++L+ + + ++ G P + + +R F ++ GY GVF+CG P W+FLT+RGE
Sbjct: 956 FRRLQHAVTMEPRRVYKQKEGDPT-LSLRWIRAFQDVCGYNGVFVCGRRPHWIFLTARGE 1014
Query: 119 LRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKC 178
LRAHPM DG + + A FHNVNC +GFL+FN ELRI LP++L+YDAPWP+RK+P+
Sbjct: 1015 LRAHPMLNDGRIYSFATFHNVNCEKGFLFFNKYGELRICALPSYLNYDAPWPMRKIPIYE 1074
Query: 179 TPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS-RFIPPLVSQFHVSLF 237
TPH + YH++++TYC+ TS E +T K EDKE R+S RFIPP V +F + L+
Sbjct: 1075 TPHSVNYHVDSRTYCVATSKEETATCVPKLANEDKEFEPIERESSRFIPPTVDKFALELW 1134
Query: 238 SPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRI 297
SP SWE IP T P+ +WE + C+KNV + EGT SG +G IA+GT +N+ ED+T +GRI
Sbjct: 1135 SPVSWEAIPNTRMPMEDWEKITCVKNVMIASEGTTSGEKGLIAVGTIHNFGEDITAKGRI 1194
Query: 298 LLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND 357
LL DIIEVVPEPGQPLT++K+K I +K Q PVTA+C V G L+ AVGQK++++QLKDND
Sbjct: 1195 LLIDIIEVVPEPGQPLTRSKVKTILSKPQNAPVTALCSVKGHLMAAVGQKLFLFQLKDND 1254
Query: 358 LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
L G+AF+DT++YI S +S+K+ IL+GD +SI LLRYQ E +TL++V++D KP Q S
Sbjct: 1255 LVGMAFLDTQIYILSAISIKSFILIGDVHKSITLLRYQEESKTLAVVSKDTKPVQIYSIE 1314
Query: 418 YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVV 477
Y N S M F+ +D N++
Sbjct: 1315 YLVDN----------------------------------------SQMAFLATDAQCNIL 1334
Query: 478 LFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL------- 530
++MYQPE RE+ GG RLI++ DF++G +NT F+IRC+ +++ P + R L
Sbjct: 1335 VYMYQPENRETFGGQRLIRRGDFNIGSRINTMFRIRCR---LAEVPRSERRLLSDLEARH 1391
Query: 531 -TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
T YASLDGA G+ LP+ EK YRRLLMLQNV+ ++ H GGLNP+AFR + NP
Sbjct: 1392 VTLYASLDGAFGYLLPISEKTYRRLLMLQNVLNSYCQHVGGLNPKAFRIMQTDVRALSNP 1451
Query: 590 SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
+ I+DG L+ F+ L+ E+ E+ +KIG+ + I +L +IE L+ HF
Sbjct: 1452 QKNIVDGDLINVFMDLNFNEKAEVARKIGTTVHQIQLDLAEIEGLTYHF 1500
>gi|301773406|ref|XP_002922132.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1-like [Ailuropoda
melanoleuca]
Length = 1469
Score = 578 bits (1490), Expect = e-162, Method: Compositional matrix adjust.
Identities = 300/649 (46%), Positives = 406/649 (62%), Gaps = 86/649 (13%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--KVLFVSD 69
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+ + F
Sbjct: 881 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 940
Query: 70 RSKR--------ANEQPGLPRGVRISQMRYFSNIAGYQG-------VFLCGPHPAWLFLT 114
+ K + E+ RG R+++ RYF +I GY G VF+CGP P WL +T
Sbjct: 941 KPKPSKKKVEGGSAEEGAGARG-RVARFRYFEDIYGYSGGGGACPQVFICGPSPHWLLVT 999
Query: 115 SRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKV 174
RG LR HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+
Sbjct: 1000 GRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKI 1059
Query: 175 PLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHV 234
PL+CT H++AYH+E+K Y + TST P T + GE+KE T RD R+I P F +
Sbjct: 1060 PLRCTAHYVAYHVESKVYAVATSTNMPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSI 1119
Query: 235 SLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR 294
L SP SWE IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCR
Sbjct: 1120 QLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCR 1179
Query: 295 GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK 354
GRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+
Sbjct: 1180 GRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLR 1239
Query: 355 DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
++LTG+AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP +
Sbjct: 1240 ASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVY 1299
Query: 415 SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
S + N + +GF++SD+D+
Sbjct: 1300 SVDFMVDN----------------------------------------AQLGFLVSDRDR 1319
Query: 475 NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RF 529
N++++MY PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ P +S +
Sbjct: 1320 NLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCR--GAAEGPSKKSVVWENKH 1377
Query: 530 LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
+TW+A+LDG +G LP+ EK R LQ H ++ R + N
Sbjct: 1378 ITWFATLDGGIGLLLPMQEKTNR----LQPAXSPRMLH---VDRRILQ----------NA 1420
Query: 590 SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
R ++DG L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1421 VRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 1469
>gi|410042329|ref|XP_003954555.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Pan troglodytes]
Length = 1296
Score = 578 bits (1489), Expect = e-162, Method: Compositional matrix adjust.
Identities = 272/540 (50%), Positives = 360/540 (66%), Gaps = 43/540 (7%)
Query: 102 FLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPT 161
F+CGP P WL +T RG LR HPM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP
Sbjct: 797 FICGPSPPWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPA 856
Query: 162 HLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRD 221
+LSYDAPWPVRK+PL+CT H++AYH+E+K Y + TST P + GE+KE T RD
Sbjct: 857 YLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERD 916
Query: 222 SRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIAL 281
R+I P F + L SP SWE IP L EWEHV C+K VS+ E T+SGL+GY+A
Sbjct: 917 ERYIHPQQEAFSIQLISPVSWEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAA 976
Query: 282 GTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLV 341
GT E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV
Sbjct: 977 GTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLV 1036
Query: 342 TAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTL 401
+A+GQKI++W L+ ++LTG+AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TL
Sbjct: 1037 SAIGQKIFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTL 1096
Query: 402 SLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
SLV+RD KP + S + N
Sbjct: 1097 SLVSRDAKPLEVYSVDFMVDN--------------------------------------- 1117
Query: 462 FSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISD 521
+ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++
Sbjct: 1118 -AQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGL 1176
Query: 522 APGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT 578
+ + ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T H GLNPRAFR
Sbjct: 1177 SKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRM 1236
Query: 579 YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1237 LHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1296
>gi|195583398|ref|XP_002081509.1| GD25678 [Drosophila simulans]
gi|194193518|gb|EDX07094.1| GD25678 [Drosophila simulans]
Length = 1450
Score = 577 bits (1488), Expect = e-162, Method: Compositional matrix adjust.
Identities = 293/615 (47%), Positives = 397/615 (64%), Gaps = 56/615 (9%)
Query: 2 GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKK 61
G ++ P + + EL + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K
Sbjct: 847 GIVQACMPQHANSPLPLELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRK 906
Query: 62 LKVLFVSDRS------KRANEQPGL------PRGVRISQMRYFSNIAGYQGVFLCGPHPA 109
L + D+ +EQ + P+ V+ ++R F+N+ G GV +CG +P
Sbjct: 907 LDXXNLLDQQPTHIELDENDEQEEIESYQMQPKYVQ--KLRPFANVGGLSGVMVCGVNPC 964
Query: 110 WLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPW 169
++FLT RGELR H + +G V + A F+NVN P GFLYF+ EL+ISVLP++LSYD+ W
Sbjct: 965 FVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNGFLYFDTTYELKISVLPSYLSYDSIW 1024
Query: 170 PVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLV 229
PVRKVPL+CTP L YH E + YC++T T EP T YY+FNGEDKEL + R RFI P+
Sbjct: 1025 PVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIG 1084
Query: 230 SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
SQF + L SP +WE +P + WEHV K V + YEGT SGL+ Y+ +GTN+NYSE
Sbjct: 1085 SQFEMVLISPETWEIVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSE 1144
Query: 290 DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
D+T RG I ++DIIEVVPEPG+P+TK KIK I+ KEQKGPV+AI V GFLVT +GQKIY
Sbjct: 1145 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIY 1204
Query: 350 IWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
IWQL+D DL G+AFIDT +Y+ +++VK+LI + D +SI+LLR+Q EYRTLSL +RD+
Sbjct: 1205 IWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFN 1264
Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
P + + N S++GF++
Sbjct: 1265 PLEVYGIEFMVDN----------------------------------------SNLGFLV 1284
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARS 527
+D ++N++++MYQPEARES GG +L++K D+HLGQ VNT F+++C + +
Sbjct: 1285 TDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFLYEN 1344
Query: 528 RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
+ Y +LDGALG+ LPLPEK YRR LMLQNV++++ H GLNP+ +RT K
Sbjct: 1345 KHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSSKKQGI 1404
Query: 588 NPSRGIIDGSLVWKF 602
NPSR IIDG L+W +
Sbjct: 1405 NPSRCIIDGDLIWSY 1419
>gi|156364999|ref|XP_001626630.1| predicted protein [Nematostella vectensis]
gi|156213514|gb|EDO34530.1| predicted protein [Nematostella vectensis]
Length = 1420
Score = 577 bits (1488), Expect = e-162, Method: Compositional matrix adjust.
Identities = 294/645 (45%), Positives = 404/645 (62%), Gaps = 55/645 (8%)
Query: 6 SHSPSAMDETI-VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP--KGALKLRFKKL 62
+ S + +E++ V+E+L LG R L+ +LLIY+AF +P +G L LRFKKL
Sbjct: 819 TQSSVSEEESLNVREVLLTGLGYKNRRATLVAVMDQDLLIYEAFSYPTVEGHLNLRFKKL 878
Query: 63 KVLFVSDRSKRANEQP--------GLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLT 114
+ + R K+ ++P GL +++ +R F++I+ Y G+F+CG +P W+F+T
Sbjct: 879 Q-HNIQIREKKPKQEPKNDSETKSGL--DPKVAMLRVFNDISSYSGIFVCGSYPFWIFVT 935
Query: 115 SRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKV 174
+RG HPM+IDGPV+ A FHNVNCP+GFLYFN + ELRISVLPTHLSYD+PWPVRKV
Sbjct: 936 NRGAFHWHPMSIDGPVTCFAAFHNVNCPKGFLYFNTRGELRISVLPTHLSYDSPWPVRKV 995
Query: 175 PLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHV 234
PL+ TPH ++Y+ E+KTY IVTS EP + EDKE V RD+RFI P +F +
Sbjct: 996 PLRYTPHMVSYNRESKTYAIVTSEQEPCKKIPRVTAEDKEFVDTIRDARFIYPSTERFVL 1055
Query: 235 SLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR 294
L SP SWE IP T L EWEHV +KN+ + E T +G +G+I +GT Y E++ R
Sbjct: 1056 QLISPISWEVIPNTRHDLDEWEHVTTMKNLLLHSEETHTGRKGFICVGTTQLYGEEIAVR 1115
Query: 295 GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK 354
GRIL+FDIIEVVPEPGQPLTKNK K++Y KEQKGPVTA+ V G+LV+ +GQKIYIW
Sbjct: 1116 GRILIFDIIEVVPEPGQPLTKNKFKLLYEKEQKGPVTALNQVNGYLVSGIGQKIYIWNFT 1175
Query: 355 DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
DNDL G+AFIDT++YI S+V+++N ++ D +SI LLR Q E +TL+ V++D P
Sbjct: 1176 DNDLVGMAFIDTQLYIHSLVTIRNFVIAADVCKSITLLRLQEETKTLAFVSKD-----PK 1230
Query: 415 SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
+ YA + IDG +GF++SD +K
Sbjct: 1231 NLEVYAAD---FFIDG--------------------------------PQIGFLVSDVEK 1255
Query: 475 NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS-SISDAPGARSRFLTWY 533
N+VLF YQPEA ES GG RL+++ D ++G H+ +FF+I K S R LT +
Sbjct: 1256 NLVLFTYQPEAIESQGGQRLLQRADINVGTHITSFFRIAAKAHLKASGEKSKEMRQLTCF 1315
Query: 534 ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGI 593
+LDGALG LP+ EK +RRL MLQ +V H GLNP+AFR + + NP R +
Sbjct: 1316 GTLDGALGLMLPMTEKTFRRLHMLQTKLVDCIPHVAGLNPKAFRMLQWRKRKLCNPHRNV 1375
Query: 594 IDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
+D L++K++ LS ER E+ +KIG+ I+D++ DIE + F
Sbjct: 1376 LDWQLLFKYMHLSFMERQEVARKIGTTPAQIMDDMMDIERACAQF 1420
>gi|390358535|ref|XP_789715.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Strongylocentrotus purpuratus]
Length = 1223
Score = 568 bits (1465), Expect = e-159, Method: Compositional matrix adjust.
Identities = 293/653 (44%), Positives = 402/653 (61%), Gaps = 76/653 (11%)
Query: 17 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGALKLRFKKL-KVLFVSDRS 71
VQE+L V LG + +L + +++IY+AF + + L++RF+K+ + + +
Sbjct: 616 VQEVLLVGLGHDRKKIYMLALVEDDIMIYEAFPYNTVTQEHHLRVRFRKIPHKILMKPKK 675
Query: 72 KRANEQPGLPRGV-----------------RISQMRYFSNIAGYQGVFLCGPHPAWLFLT 114
R +++P G R++++R F N+ Y GVF+ G HP WLF+T
Sbjct: 676 TRTSKKPTAEGGTKPETETEAESDTKTTSRRVNRLREFHNVQTYSGVFISGSHPYWLFVT 735
Query: 115 SRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKV 174
SRG LR HPM +DG +S A FHNVNCP GFLYFN K ELRI VLP+HLSYDAPWPVRKV
Sbjct: 736 SRGALRTHPMPVDGAISCFASFHNVNCPNGFLYFNRKEELRICVLPSHLSYDAPWPVRKV 795
Query: 175 PLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDP--RDSRFIPPLVSQF 232
PL+CTPHF+AYH+ETKTY +VTS E T +K GE E+ +P RD RF+P F
Sbjct: 796 PLRCTPHFVAYHVETKTYAVVTSVQETKTHVWKVTGE--EIGEEPVERDDRFVPTTKVVF 853
Query: 233 HVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT 292
+ LFSP SW+ IP T E+V CLK V++ EGT++G +GY+ + T + YSED+
Sbjct: 854 SIQLFSPVSWDAIPNTRIEYEAAENVTCLKVVNLSCEGTMTGKKGYVVVATTHVYSEDLQ 913
Query: 293 CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQ 352
RG + ++D IEVVPEPGQPLTKNK+K +Y K QKGPV+A+C V GFL+T +GQK+Y+WQ
Sbjct: 914 TRGSVYIYDCIEVVPEPGQPLTKNKLKPLYEKRQKGPVSALCEVMGFLLTCIGQKVYMWQ 973
Query: 353 LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQ 412
KDNDL G+AFIDT++YI + VSVK IL+ D + L+YQ + RTLSLV+RD +P
Sbjct: 974 FKDNDLIGLAFIDTQIYIHNAVSVKQFILITDVMKGAYFLQYQAQDRTLSLVSRDARP-- 1031
Query: 413 PNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI--CKKIGSKHNDILDEFSSMGFMIS 470
LEI C + + + M F++S
Sbjct: 1032 --------------------------------LEIFGC--------EFMVDDKQMAFLVS 1051
Query: 471 DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK---PSS--ISDAPGA 525
D DKN+++F Y PEA ES+GG L+++ D ++G VNTF ++RC+ PS+ + P
Sbjct: 1052 DADKNLIVFHYHPEAPESHGGAYLLRRGDMNIGSAVNTFVRVRCRLTDPSTEQVLSGPVL 1111
Query: 526 RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 585
R R + ++A+LDG+LG LP+ EK YRRLLMLQNV+ H GGLNP+++R K
Sbjct: 1112 R-RQVVFFATLDGSLGLLLPMVEKTYRRLLMLQNVLTNGLPHVGGLNPKSYRHVKSHMRN 1170
Query: 586 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
NP R I+DG L+ K+ LS+ ER E KKIG+ + I+ +L E L+ HF
Sbjct: 1171 LNNPHRNILDGDLLLKYCHLSVVERNEFAKKIGTSVDQIISDLMLAENLTMHF 1223
>gi|390347522|ref|XP_003726804.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Strongylocentrotus purpuratus]
Length = 1439
Score = 565 bits (1456), Expect = e-158, Method: Compositional matrix adjust.
Identities = 292/653 (44%), Positives = 402/653 (61%), Gaps = 76/653 (11%)
Query: 17 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGALKLRFKKL-KVLFVSDRS 71
VQE+L V LG + +L + +++IY+AF + + L++RF+K+ + + +
Sbjct: 832 VQEVLLVGLGHDRKKIYMLALVEDDIMIYEAFPYNTVTQEHHLRVRFRKIPHKILMKPKK 891
Query: 72 KRANEQPGL-----------------PRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLT 114
R +++P + R++++R F N+ Y GVF+ G HP WLF+T
Sbjct: 892 TRTSKKPTAEGGTKTETETEAESDTKTQTRRVNRLREFHNVQTYSGVFISGSHPYWLFVT 951
Query: 115 SRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKV 174
SRG LR HPM +DG +S A FHNVNCP GFLYFN K ELRI VLP+HLSYDAPWPVRKV
Sbjct: 952 SRGALRTHPMPVDGAISCFASFHNVNCPNGFLYFNRKEELRICVLPSHLSYDAPWPVRKV 1011
Query: 175 PLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDP--RDSRFIPPLVSQF 232
PL+CTPHF+AYH+ETKTY +VTS E T +K GE E+ +P RD RF+P F
Sbjct: 1012 PLRCTPHFVAYHVETKTYAVVTSVQETKTHVWKVTGE--EIGEEPVERDDRFVPTTKVVF 1069
Query: 233 HVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT 292
+ LFSP SW+ IP T E+V CLK V++ EGT++G +GY+ + T + YSED+
Sbjct: 1070 SIQLFSPVSWDAIPNTRIEYEAAENVTCLKVVNLSCEGTMTGKKGYVVVATTHVYSEDLQ 1129
Query: 293 CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQ 352
RG + ++D IEVVPEPGQPLTKNK+K +Y K QKGPV+A+C V GFL+T +GQK+Y+WQ
Sbjct: 1130 TRGSVYIYDCIEVVPEPGQPLTKNKLKPLYEKRQKGPVSALCEVMGFLLTCIGQKVYMWQ 1189
Query: 353 LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQ 412
KDNDL G+AFIDT++YI + VSVK IL+ D + L+YQ + RTLSLV+RD +P
Sbjct: 1190 FKDNDLIGLAFIDTQIYIHNAVSVKQFILITDVMKGAYFLQYQAQDRTLSLVSRDARP-- 1247
Query: 413 PNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI--CKKIGSKHNDILDEFSSMGFMIS 470
LEI C + + + M F++S
Sbjct: 1248 --------------------------------LEIFGC--------EFMVDDKQMAFLVS 1267
Query: 471 DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK---PSS--ISDAPGA 525
D DKN+++F Y PEA ES+GG L+++ D ++G VNTF ++RC+ PS+ + P
Sbjct: 1268 DADKNLIVFHYHPEAPESHGGAYLLRRGDMNIGSAVNTFVRVRCRLTDPSTEQVLSGPVL 1327
Query: 526 RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 585
R R + ++A+LDG+LG LP+ EK YRRLLMLQNV+ H GGLNP+++R K
Sbjct: 1328 R-RQVVFFATLDGSLGLLLPMVEKTYRRLLMLQNVLTNGLPHVGGLNPKSYRHVKSHMRN 1386
Query: 586 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
NP R I+DG L+ K+ LS+ ER E KKIG+ + I+ +L E L+ HF
Sbjct: 1387 LNNPHRNILDGDLLLKYCHLSVVERNEFAKKIGTSVDQIISDLMLAENLTMHF 1439
>gi|395740218|ref|XP_002819588.2| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Pongo abelii]
Length = 1388
Score = 555 bits (1429), Expect = e-155, Method: Compositional matrix adjust.
Identities = 286/640 (44%), Positives = 386/640 (60%), Gaps = 60/640 (9%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 792 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 851
Query: 63 -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
+ E+ RG R+++ RYF +I GY GVF+CGP P WL +T RG LR
Sbjct: 852 KPKPSKKKAEGGSTEEGAGARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 910
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
HPM IDGPV + APFHNVNCPRGFLYFN + R+S P+ P P + L H
Sbjct: 911 HPMAIDGPVDSFAPFHNVNCPRGFLYFNRQEPQRLSGSPSRTXXXXPTPPGLLGLPG--H 968
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
+ + Y + TST P + GE+KE T RD R+I P F + L SP S
Sbjct: 969 WCVTPTNPQVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS 1028
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
WE IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D
Sbjct: 1029 WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1088
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1089 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1148
Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S +
Sbjct: 1149 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1208
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
N + +GF++SD+D+N++++MY
Sbjct: 1209 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1228
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDG 538
PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ + + ++ +TW S+ G
Sbjct: 1229 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGAAEGLSKKSVVWENKHITWLVSVRG 1288
Query: 539 ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
+G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG L
Sbjct: 1289 GIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGEL 1348
Query: 599 VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1349 LNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1388
>gi|241060959|ref|XP_002408050.1| cleavage and polyadenylation specificity factor, putative [Ixodes
scapularis]
gi|215492346|gb|EEC01987.1| cleavage and polyadenylation specificity factor, putative [Ixodes
scapularis]
Length = 1241
Score = 553 bits (1425), Expect = e-154, Method: Compositional matrix adjust.
Identities = 289/638 (45%), Positives = 390/638 (61%), Gaps = 82/638 (12%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGALKLRFKKLKVLFVSDRS 71
+V E+L V LG+ +RPLLL R +LLIY+AF +G LKLRFKKL +
Sbjct: 645 VVHEILMVGLGVRQSRPLLLARVDEDLLIYEAFPFYETQREGHLKLRFKKLNHDIILRSR 704
Query: 72 KRANEQPGLPRGVRISQMRY----FSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID 127
K ++P + Q R FS+I+GY GVFLCG P WLF++SRGELR HPM +D
Sbjct: 705 KYKTQKPENEEEEKAFQSRLWLQPFSDISGYSGVFLCGHRPHWLFMSSRGELRYHPMFVD 764
Query: 128 GPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVR---KVPLKCTPHFLA 184
GPV APFHNVNCP+GFL+FN +S+ +L ++ P P R ++ C H
Sbjct: 765 GPVYCFAPFHNVNCPKGFLHFNKQSDSYALLLHSYWLSQLPSPKRHGERLLFNCPSH--- 821
Query: 185 YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDP------------RDSRFIPPLVSQF 232
K CI ++ + + + P DSR+I P + +F
Sbjct: 822 -----KKICI------HRCHFFALQQKAADFLWPPPFVTTVSPLPFVADSRYIFPTMDKF 870
Query: 233 HVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT 292
+ L SP SWE IP T L EWEH+ C+KNV + EGT +G++GY+ALGTNY Y EDVT
Sbjct: 871 SLQLLSPVSWETIPNTRVDLDEWEHLTCIKNVMLSSEGTSTGMKGYLALGTNYCYGEDVT 930
Query: 293 CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQ 352
RGRI + DII+VVPEPGQPLTKNKIK++Y+KEQKGPVTA+ V GFL++A+GQK+YIWQ
Sbjct: 931 SRGRITILDIIDVVPEPGQPLTKNKIKIVYSKEQKGPVTALSQVVGFLLSAIGQKMYIWQ 990
Query: 353 LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQ 412
LKDN L G+AFIDT++YI S+V+VKNLILVGD +S++LLRYQ RTLSLV+RD +P +
Sbjct: 991 LKDNGLVGVAFIDTQIYIHSVVTVKNLILVGDVFKSVSLLRYQEASRTLSLVSRDVRPLE 1050
Query: 413 PNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDK 472
+ ++ N S M F+++D
Sbjct: 1051 VFAVEFFIDN----------------------------------------SQMSFLVTDS 1070
Query: 473 DKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISD-----APGARS 527
++N++L+MYQPE+RES GG RL+++ DFH+G V + F+I+C+ ++ A
Sbjct: 1071 ERNMILYMYQPESRESCGGQRLLRRGDFHIGSPVVSMFRIKCRMGEVAKHDRRLAASVDG 1130
Query: 528 RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
R +T A+LDG+LG+ LP+PEK YRRLLMLQNV+VT+ H GLNP+AFR Y + G
Sbjct: 1131 RHITMLATLDGSLGYVLPVPEKTYRRLLMLQNVLVTNMPHYAGLNPKAFRMYHSQRRVLG 1190
Query: 588 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 625
NP + I+DG L+WKF+ LS ER E+ KKIG+ ++
Sbjct: 1191 NPHKNILDGELIWKFMHLSFMERSELSKKIGTTVTQVV 1228
>gi|260835071|ref|XP_002612533.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
gi|229297910|gb|EEN68542.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
Length = 1003
Score = 550 bits (1416), Expect = e-153, Method: Compositional matrix adjust.
Identities = 286/638 (44%), Positives = 389/638 (60%), Gaps = 86/638 (13%)
Query: 17 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGALKLRFKKLK-VLFVSDR- 70
V+E+L V LG G+RP LL R +LLIY+AF + LK+RFKK++ L + +R
Sbjct: 436 VKEILMVGLGHKGSRPHLLARVDEDLLIYEAFPYHLSPSYTMLKIRFKKVQHNLILRERK 495
Query: 71 ---SKRA--NEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
+K+A E+ G RI R F++I+GY G+F+CG P WLF+TSRG LR HPM+
Sbjct: 496 GGKTKKAGDQEESDGQTGSRIQHFRTFTDISGYSGLFICGSSPHWLFMTSRGALRIHPMS 555
Query: 126 IDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
IDG V+ +PFHNVNCP+GFLYFN ELRISVLPTHLSYDAPWPVRKVPL+CTPHF+AY
Sbjct: 556 IDGAVTCFSPFHNVNCPKGFLYFNRGGELRISVLPTHLSYDAPWPVRKVPLRCTPHFVAY 615
Query: 186 HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEI 245
H+E K Y + ST E + G++KE +D R+I P++ +F++ L SP SWE I
Sbjct: 616 HMECKVYAVAASTFEMCNRIPRMAGDEKEYDAVEKDDRYIYPMLDKFNIQLMSPVSWEII 675
Query: 246 PQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEV 305
P T M+ E + +G N+ + G+I++ D+IEV
Sbjct: 676 PNTR---------------GMQLEENYAECTCSFLVGINFV----LFVAGQIVILDVIEV 716
Query: 306 VPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFID 365
VPEPGQPLTKNKIK +Y KEQKGPV+A+C G+L++A+GQKI++W+ ++NDL G+AFID
Sbjct: 717 VPEPGQPLTKNKIKELYGKEQKGPVSALCGCNGYLLSAIGQKIFLWEFRNNDLIGVAFID 776
Query: 366 TEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSR 425
T+VYI + +S+KN +++ D +SI+LLRYQ D +P + ++ N
Sbjct: 777 TQVYIHTAISIKNYVILADVFKSISLLRYQ-----------DMRPLETYCVEFFVDN--- 822
Query: 426 GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEA 485
+ +GF++SD KN +L+ YQPEA
Sbjct: 823 -------------------------------------AQIGFLVSDAQKNFLLYSYQPEA 845
Query: 486 RESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS-----DAPGARSRFLTWYASLDGAL 540
RES GG RL+++ DF++G HVNTFF++RCK S DA R +T +A+LDG L
Sbjct: 846 RESYGGQRLVRRADFNVGSHVNTFFRVRCKIMDPSGERRRDADTVAKRHVTMFATLDGGL 905
Query: 541 GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVW 600
G LP+ EK YRRLLMLQN ++TH GLNP+AFR K N R I+DG L+W
Sbjct: 906 GALLPMAEKTYRRLLMLQNTLMTHMPFPAGLNPKAFRMLKHNHRSLINACRNILDGELLW 965
Query: 601 KFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
KFL LS+ ER E+ +KIG+ I ++L DI+ LS+HF
Sbjct: 966 KFLHLSVVERSELARKIGTSPETITEDLMDIDRLSAHF 1003
>gi|384946686|gb|AFI36948.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
Length = 1428
Score = 546 bits (1407), Expect = e-152, Method: Compositional matrix adjust.
Identities = 286/637 (44%), Positives = 379/637 (59%), Gaps = 66/637 (10%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 844 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 903
Query: 63 -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
E+ RG R+++ RYF +I GY GVF+CGP P WL +T RG LR
Sbjct: 904 KPKPSKKKAEGGGTEEGAGARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 962
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
HPM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 963 HPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 1022
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
++AYH+E+K Y + TST P + GE+KE T RD R+I P F + L SP S
Sbjct: 1023 YVAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS 1082
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
WE IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D
Sbjct: 1083 WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1142
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1143 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1202
Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S +
Sbjct: 1203 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1262
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
N + +GF++SD+D+N++++MY
Sbjct: 1263 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1282
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALG 541
PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ G + + W G
Sbjct: 1283 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGAT----EGLSKKSVVWENKHITWFG 1338
Query: 542 FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 601
LP H S P + DG L+ +
Sbjct: 1339 EDLPA-------AADAAERADHHASAPRRPQPPCLPDAARGPPHPPECCAQRADGELLNR 1391
Query: 602 FLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
+L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1392 YLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1428
>gi|321475208|gb|EFX86171.1| hypothetical protein DAPPUDRAFT_313209 [Daphnia pulex]
Length = 1260
Score = 541 bits (1393), Expect = e-151, Method: Compositional matrix adjust.
Identities = 265/518 (51%), Positives = 344/518 (66%), Gaps = 54/518 (10%)
Query: 9 PSAMDETIVQELLTVSLGLHGNRPLLLVRTQH-ELLIYQAF-------RHPKGALKLRFK 60
PS+ IV E+ LG RPLL++RT +L+Y+A K LK+RF+
Sbjct: 784 PSSTHCNIV-EMGIFGLGHLHRRPLLMIRTSDFGVLLYEAIPALPVYDSKQKNELKIRFR 842
Query: 61 KLKVLFVSDRSKRANEQPGL-----PRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTS 115
KL + +K + G P + +Q +YFSNIAGY GVF+ GP+P WLF+TS
Sbjct: 843 KLNHSLLLRETKTYVRKGGQSVVLEPYAWKTNQFKYFSNIAGYTGVFIGGPYPHWLFMTS 902
Query: 116 RGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVP 175
RGELR HPM+IDG + A FHNVNC +GF+Y N K ELRI +LPT +YDAPWPVRKVP
Sbjct: 903 RGELRLHPMSIDGSIKCFACFHNVNCAQGFIYLNRKDELRICLLPTLFNYDAPWPVRKVP 962
Query: 176 LKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVS 235
L+CTPH+L YH+ETKTY + TS AEP+ Y+FNG+DKEL + RD RF P V +F +
Sbjct: 963 LRCTPHYLIYHVETKTYILATSLAEPTNRIYRFNGDDKELSLEERDDRFPYPHVEKFAIQ 1022
Query: 236 LFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRG 295
L SP +WE +P T L +WEHV CLK VS+EYEG SGL+ Y+A+ TNYNY ED+ RG
Sbjct: 1023 LISPVTWEAVPNTRMDLDDWEHVTCLKTVSLEYEGHASGLKDYLAVSTNYNYGEDIISRG 1082
Query: 296 RILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKD 355
RI + D+IEVVPEPGQPLTKNKIK +YAK+QKGPV AI V G+LV A+GQKIY+WQLK+
Sbjct: 1083 RIFILDLIEVVPEPGQPLTKNKIKTLYAKDQKGPVAAISSVCGYLVAAIGQKIYLWQLKN 1142
Query: 356 NDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNS 415
+DL GIAFIDTE+YI ++++K+ IL D +S+++LR+Q EYRTL +VARDY+P + +
Sbjct: 1143 DDLVGIAFIDTEIYIHQLLNIKSFILAADVYKSVSILRFQEEYRTLCIVARDYQPLEVMA 1202
Query: 416 KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKN 475
YY N + +GF++SD +KN
Sbjct: 1203 VDYYIDN----------------------------------------TQLGFLVSDAEKN 1222
Query: 476 VVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
++L+MYQPEARES GGHRLI+K DFH+GQ V+T F+I+
Sbjct: 1223 LILYMYQPEARESQGGHRLIRKADFHVGQVVSTMFRIK 1260
>gi|348555856|ref|XP_003463739.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 2 [Cavia porcellus]
Length = 1387
Score = 525 bits (1351), Expect = e-146, Method: Compositional matrix adjust.
Identities = 276/643 (42%), Positives = 370/643 (57%), Gaps = 115/643 (17%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 840 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 899
Query: 63 --KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
K +E G+ RG R+++ RYF +I GY GVF+CGP P WL +T RG LR
Sbjct: 900 KPKPSKKKAEGGSTDEGSGV-RG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALR 957
Query: 121 AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTP 180
HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT
Sbjct: 958 LHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTA 1017
Query: 181 HFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPF 240
H++AYH+E+K Y + TST+ P T + GE+KE RD R+I P F + L SP
Sbjct: 1018 HYVAYHVESKVYAVATSTSTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPV 1077
Query: 241 SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
SWE IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRI L
Sbjct: 1078 SWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRIFL- 1136
Query: 301 DIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTG 360
W L+ ++LTG
Sbjct: 1137 --------------------------------------------------WSLRASELTG 1146
Query: 361 IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
+AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S +
Sbjct: 1147 MAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMV 1206
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
N + +GF++SD+D+N++++M
Sbjct: 1207 DN----------------------------------------AQLGFLVSDRDRNLMVYM 1226
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYAS 535
Y PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ P +S + +TW+A+
Sbjct: 1227 YLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCR--GATEGPSKKSVVWENKHITWFAT 1284
Query: 536 LDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIID 595
LDG +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++D
Sbjct: 1285 LDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLD 1344
Query: 596 GSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
G L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1345 GELLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 1387
>gi|354491124|ref|XP_003507706.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 2 [Cricetulus griseus]
Length = 1388
Score = 524 bits (1350), Expect = e-146, Method: Compositional matrix adjust.
Identities = 273/641 (42%), Positives = 370/641 (57%), Gaps = 111/641 (17%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 841 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 900
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K +++ + + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 901 KPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 960
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 961 PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1020
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P T + GE+KE RD R+I P F + L SP SW
Sbjct: 1021 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSW 1080
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRI L
Sbjct: 1081 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRIFL--- 1137
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
W L+ ++LTG+A
Sbjct: 1138 ------------------------------------------------WSLRASELTGMA 1149
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1150 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1209
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 1210 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1229
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
PEA+ES GG RL+++ DFH+G HVNTF++ C+ + ++ P +S + +TW+A+LD
Sbjct: 1230 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVMWENKHITWFATLD 1287
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1288 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1347
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1348 LLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1388
>gi|148697644|gb|EDL29591.1| cleavage and polyadenylation specific factor 1, isoform CRA_c [Mus
musculus]
Length = 1388
Score = 524 bits (1349), Expect = e-146, Method: Compositional matrix adjust.
Identities = 273/641 (42%), Positives = 370/641 (57%), Gaps = 111/641 (17%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 841 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 900
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K +++ + + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 901 KPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 960
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 961 PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1020
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P T + GE+KE RD R+I P F + L SP SW
Sbjct: 1021 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSW 1080
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRI L
Sbjct: 1081 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRIFL--- 1137
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
W L+ ++LTG+A
Sbjct: 1138 ------------------------------------------------WSLRASELTGMA 1149
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1150 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1209
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 1210 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1229
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
PEA+ES GG RL+++ DFH+G HVNTF++ C+ + ++ P +S + +TW+A+LD
Sbjct: 1230 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVVWENKHITWFATLD 1287
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1288 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1347
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1348 LLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1388
>gi|194474008|ref|NP_001124043.1| cleavage and polyadenylation specificity factor subunit 1 [Rattus
norvegicus]
gi|149066087|gb|EDM15960.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
isoform CRA_a [Rattus norvegicus]
Length = 1386
Score = 524 bits (1349), Expect = e-146, Method: Compositional matrix adjust.
Identities = 273/641 (42%), Positives = 370/641 (57%), Gaps = 111/641 (17%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 839 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 898
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K +++ + + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 899 KPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 958
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 959 PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1018
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P T + GE+KE RD R+I P F + L SP SW
Sbjct: 1019 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSW 1078
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRI L
Sbjct: 1079 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRIFL--- 1135
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
W L+ ++LTG+A
Sbjct: 1136 ------------------------------------------------WSLRASELTGMA 1147
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1148 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1207
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 1208 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1227
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
PEA+ES GG RL+++ DFH+G HVNTF++ C+ + ++ P +S + +TW+A+LD
Sbjct: 1228 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVMWENKHITWFATLD 1285
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1286 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1345
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1346 LLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1386
>gi|148697642|gb|EDL29589.1| cleavage and polyadenylation specific factor 1, isoform CRA_a [Mus
musculus]
Length = 1417
Score = 523 bits (1348), Expect = e-145, Method: Compositional matrix adjust.
Identities = 273/641 (42%), Positives = 369/641 (57%), Gaps = 111/641 (17%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 870 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 929
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K +++ + + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 930 KPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 989
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 990 PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1049
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P T + GE+KE RD R+I P F + L SP SW
Sbjct: 1050 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSW 1109
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRI L
Sbjct: 1110 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRIFL--- 1166
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
W L+ ++LTG+A
Sbjct: 1167 ------------------------------------------------WSLRASELTGMA 1178
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1179 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1238
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 1239 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1258
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ P +S + +TW+A+LD
Sbjct: 1259 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCR--GAAEGPSKKSVVWENKHITWFATLD 1316
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1317 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1376
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1377 LLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1417
>gi|148697643|gb|EDL29590.1| cleavage and polyadenylation specific factor 1, isoform CRA_b [Mus
musculus]
Length = 1311
Score = 523 bits (1348), Expect = e-145, Method: Compositional matrix adjust.
Identities = 273/641 (42%), Positives = 370/641 (57%), Gaps = 111/641 (17%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 764 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 823
Query: 63 KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
K +++ + + G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 824 KPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 883
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 884 PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 943
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P T + GE+KE RD R+I P F + L SP SW
Sbjct: 944 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSW 1003
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRI L
Sbjct: 1004 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRIFL--- 1060
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
W L+ ++LTG+A
Sbjct: 1061 ------------------------------------------------WSLRASELTGMA 1072
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1073 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1132
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 1133 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1152
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
PEA+ES GG RL+++ DFH+G HVNTF++ C+ + ++ P +S + +TW+A+LD
Sbjct: 1153 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVVWENKHITWFATLD 1210
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG
Sbjct: 1211 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1270
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1271 LLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1311
>gi|340371789|ref|XP_003384427.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Amphimedon queenslandica]
Length = 1408
Score = 516 bits (1328), Expect = e-143, Method: Compositional matrix adjust.
Identities = 269/637 (42%), Positives = 386/637 (60%), Gaps = 56/637 (8%)
Query: 17 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFR-----HPKGALKLRFKKLK--VLFVSD 69
V+++L V +GL+G +P ++ EL+IY+AF+ HP G LKLRF K++ V+
Sbjct: 813 VEQVLCVGMGLNGKKPHIMAFINKELVIYEAFQYTSAIHP-GHLKLRFSKVQHNVILQDK 871
Query: 70 RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP 129
R + + +R FSNIAGY GVF+CGP+P W+F+ +RG L HPM IDGP
Sbjct: 872 RVGKLAKHFQQQEFSFPPHLRKFSNIAGYSGVFVCGPYPHWIFMAARGHLSIHPMYIDGP 931
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
V + APF NVNCP GFLYFN +SELRISVLPT LSYD+ WPVRKVPLK TPHF+ YH+E+
Sbjct: 932 VQSFAPFDNVNCPSGFLYFNKESELRISVLPTQLSYDSYWPVRKVPLKATPHFVGYHMES 991
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKE-LVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQT 248
K + I+ ST +P T NGE ++ L T RD RF+ +++ L SP SWE IP +
Sbjct: 992 KVHVIIASTPQPVTVIPDPNGETEDALETVERDGRFVYSQEETYYLQLLSPTSWETIPHS 1051
Query: 249 NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPE 308
+ + HV +K + + + TLSG + YI +GT + E+++ +G++L+FD+ V+PE
Sbjct: 1052 KYEMEAHYHVTDMKVMRLRSQETLSGRKEYIVVGTMATFGEELSAKGKVLIFDVSVVIPE 1111
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDN-DLTGIAFIDTE 367
PG+P ++ ++K +Y +EQK PVT + V G ++TA+GQKI++WQ KDN DL +AFID E
Sbjct: 1112 PGKPFSQYRLKNLYDQEQKWPVTGLECVNGLILTAMGQKIFMWQFKDNKDLLAVAFIDAE 1171
Query: 368 VYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGI 427
YI + S+K IL GD RSI LL Y + R+LSL+++D P + S + +
Sbjct: 1172 TYIHTAQSIKGFILTGDVTRSIQLLHYNEDRRSLSLISQDPNPMEVFSTTF--------M 1223
Query: 428 IDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARE 487
IDG ++GF++SD D+N+ LF YQPE
Sbjct: 1224 IDG--------------------------------KALGFLVSDSDRNITLFQYQPENPA 1251
Query: 488 SNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG------ARSRFLTWYASLDGALG 541
S+GG L++ D H+G VN F IRCK S+ A A R T++ +LDG +G
Sbjct: 1252 SSGGANLVRCGDIHVGSLVNVFLNIRCKTSAGLGASREMKIALADKRQCTFFGTLDGGIG 1311
Query: 542 FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 601
LP+PEK YRRL MLQ M H GLNP+AFRT++ + Y N R I+DG+L+++
Sbjct: 1312 CLLPIPEKVYRRLSMLQVKMTQGMRHMAGLNPKAFRTFQTRHQYLHNAQRNILDGTLLYQ 1371
Query: 602 FLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
+L L+ E+ + K+IG+ I+++L +I+ + SHF
Sbjct: 1372 YLSLTAKEKFDFSKQIGTTVAQIMEDLKEIDKVMSHF 1408
>gi|198415711|ref|XP_002123169.1| PREDICTED: similar to cleavage and polyadenylation specificity factor
1, partial [Ciona intestinalis]
Length = 1370
Score = 473 bits (1218), Expect = e-130, Method: Compositional matrix adjust.
Identities = 246/587 (41%), Positives = 353/587 (60%), Gaps = 57/587 (9%)
Query: 5 RSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPK-------GALKL 57
+S S D+ + E+L V LG + P L+ R + E+LIY+ F+ +L++
Sbjct: 827 KSTSTRYSDKPRIFEILLVGLGYKNSSPHLIARIEEEILIYEVFKFSAPEKFKKYNSLQI 886
Query: 58 RFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
RFKK+ + R+ +E R + +R FSNI GY GVFLCGP+P W+F+T RG
Sbjct: 887 RFKKVNHSMMIRRAPVTHETKTDQLEHR-NCLRTFSNIGGYSGVFLCGPYPYWIFVTIRG 945
Query: 118 ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
L HPM++DG VS PFHNVNCP GFLYFN++ ELRI +LP H+ YD WP+RK+ L+
Sbjct: 946 ALCCHPMSVDGSVSCFVPFHNVNCPNGFLYFNSQGELRICMLPPHMKYDTAWPMRKITLR 1005
Query: 178 CTPHFLAYHLETKTYCIVTSTAEPSTD--YYKFNGEDKELVTDPRDSRFIPPLVSQFHVS 235
C+ HFLAY +E K Y +VTS +EP T Y F E +E + RFI P + +F V
Sbjct: 1006 CSVHFLAYSIEHKVYALVTSVSEPCTRLPYLTFENE-REFEDLEKGDRFIYPHIDKFSVQ 1064
Query: 236 LFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRG 295
L SP SW+ +P + E+EH+ C+KNV + S + ++ LGT + E+++ RG
Sbjct: 1065 LISPASWDLVPNARLDMGEFEHITCMKNVWLSCGQDSSARQNFLVLGTVNVFGEEMSSRG 1124
Query: 296 RILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKD 355
+I++ ++IEVVPEPGQPLTKNK+K IY++EQKGPVTA+C + G L+TA+GQKI+IW+ +
Sbjct: 1125 KIIILEVIEVVPEPGQPLTKNKLKQIYSEEQKGPVTAVCGLEGNLLTAIGQKIFIWRFDE 1184
Query: 356 ND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
N L G+AF+DT VYI +S ++ LVGD RSI LLRYQ +++TLS+ +RD +P +
Sbjct: 1185 NQSLRGLAFVDTNVYIHHALSFRSFALVGDIQRSITLLRYQTDFKTLSVTSRDVRPLE-- 1242
Query: 415 SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
Y + ++DG + + F++SD +K
Sbjct: 1243 ---VYTADL---VVDG--------------------------------TGINFLVSDHEK 1264
Query: 475 NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC----KPSSISDAPGARSRFL 530
N+VLF Y PE ES+GG RL K+ D H+G N +++ + + + + P A +
Sbjct: 1265 NLVLFAYDPEDHESHGGSRLTKRADMHIGSRANCMWRVAACGVDRSTGLPNQPYA-GVHI 1323
Query: 531 TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 577
T +LDG++ LP+ EK YRRLLMLQN+M+T H GLNP+AFR
Sbjct: 1324 TMMGTLDGSICHVLPVAEKVYRRLLMLQNIMITGLQHIAGLNPKAFR 1370
>gi|291232722|ref|XP_002736302.1| PREDICTED: cleavage and polyadenylation specific factor 1-like
[Saccoglossus kowalevskii]
Length = 984
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 205/396 (51%), Positives = 267/396 (67%), Gaps = 43/396 (10%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAL----KLRFKKLKVLFVSDRS 71
IV+ELL + LG + LL R +L IY+AF H + +L +LRF+K
Sbjct: 473 IVKELLLIGLGHKNKKTHLLARVDEDLYIYEAFTHDQSSLDNHLRLRFRK---------- 522
Query: 72 KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS 131
VF+CGP+P WLF+TSRG LR+HPM IDG V+
Sbjct: 523 -----------------------------VFVCGPYPHWLFMTSRGALRSHPMHIDGSVT 553
Query: 132 TLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKT 191
APFHN+NCP+GFLYFN ELRI VLPTHLSYDA WPVRKVPL+CTPHF++YH+E+KT
Sbjct: 554 CFAPFHNINCPKGFLYFNKHGELRICVLPTHLSYDALWPVRKVPLRCTPHFISYHIESKT 613
Query: 192 YCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFP 251
Y +VTS +EP K G+DKE RD RFI P + +F + LFSP SWE IP T
Sbjct: 614 YAVVTSVSEPCLRICKMTGDDKEFEDVERDDRFIFPTIEKFSLQLFSPLSWEAIPNTKID 673
Query: 252 LHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQ 311
+WEH+ LK V ++ EGT+SGL+G+IA+ T Y E+VTCRGRIL+FD+IEVVPEPGQ
Sbjct: 674 TEDWEHITGLKTVFLKSEGTVSGLKGFIAVSTTIVYGEEVTCRGRILIFDVIEVVPEPGQ 733
Query: 312 PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIA 371
PLTKNK+K++Y KEQKGPVT +C + G L A+GQKI++W ++NDL G+AFIDT+++I
Sbjct: 734 PLTKNKLKLLYDKEQKGPVTTLCDIEGLLAAAIGQKIFLWAFRNNDLIGVAFIDTQIHIH 793
Query: 372 SMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
++ ++KN IL D +S++LLR+ E R+LSLV R+
Sbjct: 794 TLCTIKNFILAADIRKSVSLLRFSDEDRSLSLVTRE 829
Score = 155 bits (392), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 82/187 (43%), Positives = 117/187 (62%), Gaps = 13/187 (6%)
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
DI S + F SD+D+++ L RES GG RL+++ DF+ G HV +FF++R K
Sbjct: 806 DIRKSVSLLRF--SDEDRSLSLV-----TRESFGGQRLLRRADFNAGSHVCSFFRMRSKL 858
Query: 517 SS-----ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
S + P R R +T +A+LDG++G+ +P+ EK YRRLLMLQN + T T HT GL
Sbjct: 859 SDPATEKLLTGPMER-RHVTMFATLDGSIGYLIPMTEKTYRRLLMLQNALTTQTLHTAGL 917
Query: 572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
NP+ FR K + N + I+DG L+WK+ LS+ ER E+ KKIG+ ILD+L D+
Sbjct: 918 NPKGFRMVKHQTKSLENTHKNILDGDLLWKYTFLSVNERTELAKKIGTSVEQILDDLMDV 977
Query: 632 EALSSHF 638
E L++HF
Sbjct: 978 ERLTAHF 984
Score = 44.3 bits (103), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 27/41 (65%)
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEF 462
N + I+DG L+WK+ LS+ ER E+ KKIG+ ILD+
Sbjct: 934 NTHKNILDGDLLWKYTFLSVNERTELAKKIGTSVEQILDDL 974
>gi|339253000|ref|XP_003371723.1| cleavage and polyadenylation specificity factor subunit 1
[Trichinella spiralis]
gi|316967988|gb|EFV52332.1| cleavage and polyadenylation specificity factor subunit 1
[Trichinella spiralis]
Length = 1376
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 226/659 (34%), Positives = 348/659 (52%), Gaps = 98/659 (14%)
Query: 30 NRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKLKVLFV------------------ 67
+RP L + +LLIY+AF +P + L +RFKK++ +
Sbjct: 754 DRPFLFAVVEEQLLIYEAFHYPYPQQRYRLSVRFKKVRHTAILQRFRRIGRDDFKLLADD 813
Query: 68 -----------------SDRSKRANEQPG------------LPRGVRISQMRYFSNIAGY 98
S+RS+R + G L Q+ F N+AGY
Sbjct: 814 FQFSEQYRRRRKRSKHDSNRSRRGDRHSGRRQEAHEHEPYRLTYEAPARQLSPFENVAGY 873
Query: 99 QGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISV 158
G+F+ G +P + FL+ +G+LR HPM IDGPV AP+ + R F YF A +R+S
Sbjct: 874 AGLFIGGGYPYFCFLSKQGDLRLHPMHIDGPVVAFAPYCSPKQLRAFAYFTADGMMRVSS 933
Query: 159 LPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTD 218
LP+ +D P KV L HF+ Y +E+ TY + TS P G+DK+ T
Sbjct: 934 LPSKFDFDRSIPSMKVELGRAAHFVVYLMESHTYALTTSEQMPCHKVVTLIGDDKQFETF 993
Query: 219 PRDS-RFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRG 277
R++ FI P + QF + L+S +W +P E+EHV + V ++ EG+ SGL+
Sbjct: 994 DREAPHFIYPTMEQFKLQLYSADTWLPVPGAELDFDEFEHVTACQEVQLKSEGSASGLQS 1053
Query: 278 YIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
Y+A+GT NY E+V RGR+L+ D++EVVPEP +P+TK K+K++Y+KEQKGPVT++C +
Sbjct: 1054 YLAIGTVLNYGEEVLIRGRLLIIDVVEVVPEPDRPMTKFKLKVVYSKEQKGPVTSLCSLR 1113
Query: 338 GFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPE 397
G+L+T +GQK+YIWQ KDN L GI+F+D +VY+ M S++ L L D ++LLRYQ E
Sbjct: 1114 GYLLTGMGQKVYIWQYKDNALVGISFLDLQVYVHQMASIRYLALTADAFFGVSLLRYQEE 1173
Query: 398 YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 457
Y+ LSLV+RD +P D L +FL
Sbjct: 1174 YKALSLVSRDPRP------------------DEVLAVEFLV------------------- 1196
Query: 458 ILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS 517
+ + + F+++ +++ ++Y PE+ +S GG RL+ + D+H G VN F ++RC
Sbjct: 1197 ---DRTDLSFLMTSAAGDILTYVYLPESLDSFGGQRLVPQADYHFGSQVNAFVRMRCHAQ 1253
Query: 518 SISDAPGARSRFLT----WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
I A R L +AS DG++ + LPLPE+ YR L MLQ++++ GLN
Sbjct: 1254 EI--AGRKRQEVLQRQGLIFASSDGSVNYLLPLPEREYRLLGMLQSLLIDMLPSFAGLNV 1311
Query: 574 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 632
+RT + P++ IIDG++ +L + ++ +I ++IGS H+ I+ EL +E
Sbjct: 1312 DDYRTVRFPNSCLREPTKNIIDGNICMLYLYIDALQQEDIVRQIGSSHSQIMLELAYME 1370
>gi|324499955|gb|ADY39993.1| Cleavage and polyadenylation specificity factor subunit 1 [Ascaris
suum]
Length = 1434
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 220/641 (34%), Positives = 358/641 (55%), Gaps = 52/641 (8%)
Query: 11 AMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGA---LKLRFKKLKVLFV 67
A E I+ E+L +G++ RP+L V + +Y+ F + G L +RFK+L V
Sbjct: 833 AKPEEIIVEVLLTGMGMNQGRPMLFVVVDDMVSVYEMFMYDNGVVEHLAVRFKRLPYTTV 892
Query: 68 SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQ-------GVFLCGPHPAWLFLTSRGELR 120
+ + P +RY + + ++ GVF+C +P +FL G LR
Sbjct: 893 TRSCRFQGNDGRAPVEAARDTVRYRTALHPFERIGNILNGVFICSSYPC-VFLMDSGILR 951
Query: 121 AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKS-ELRISVLPTHLSYDAPWPVRKVPLKCT 179
HP+ ++GP+ + F+NV CP GF+Y + +RI+ LPT + D+ PVRK+ T
Sbjct: 952 MHPLNLEGPILSFTAFNNVLCPNGFIYLTEREWAMRIAKLPTDVELDSSLPVRKIRTGRT 1011
Query: 180 PHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP 239
H + Y L++ TY +V S +P+ EDK + F+ P + + V L+SP
Sbjct: 1012 IHNIVYLLQSNTYAVVGSEKKPNNRLCVLVNEDKSFDEHEKADSFVLPELEVYDVKLYSP 1071
Query: 240 FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
W+ +P + ++E + C + V + EGT+SG++ Y+A+GT NY E+V RGRI++
Sbjct: 1072 EDWKPVPNAEIKMEDFEVLTCCEEVVLRSEGTVSGVQNYLAVGTACNYGEEVLVRGRIII 1131
Query: 300 FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLT 359
+IIEVVPEPGQP +K++IK +Y KEQKGPVT++C G+L+ +GQK++IW +DN+L
Sbjct: 1132 SEIIEVVPEPGQPTSKHRIKTLYDKEQKGPVTSLCSCNGYLLAGMGQKVFIWLFRDNNLQ 1191
Query: 360 GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
GI+F+D YI +V V+NL L D RS+ALLRYQ EY+ LSL +RD
Sbjct: 1192 GISFLDMHFYIHQLVGVRNLALACDIYRSVALLRYQEEYKALSLASRDM----------- 1240
Query: 420 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
R ++ + +FL I ++ M F++SD+ N+ +F
Sbjct: 1241 -----RAVVQPPMAAQFL-------------IDNRQ---------MAFIMSDEAANIAVF 1273
Query: 480 MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSS--ISDAPGARSRFLTWYASLD 537
Y PEA ES+GG RLI +++ ++G +VN+F +++ SS + + + +R + SLD
Sbjct: 1274 NYLPEALESSGGERLILRSEINIGTNVNSFMRVKGHISSGFVENEHYSLNRQSVLFCSLD 1333
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G+ GF PL EK +RRL MLQ +M + + GLN + R + + +R ++DG
Sbjct: 1334 GSFGFVRPLSEKVFRRLHMLQQLMSSLVAQAAGLNVKGSRAARPQRPNHYLNTRNMVDGD 1393
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
+V+++L LSL ++ ++ +K+G+ I+D+L +I L++H+
Sbjct: 1394 VVFQYLHLSLADKNDLARKLGTSRYHIIDDLTEISRLTTHY 1434
>gi|449661926|ref|XP_002167992.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Hydra magnipapillata]
Length = 1122
Score = 387 bits (994), Expect = e-105, Method: Compositional matrix adjust.
Identities = 190/449 (42%), Positives = 277/449 (61%), Gaps = 40/449 (8%)
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+ Y + +S E +F+ E++E T R+ R+I P + +F VSL SP SWE +P +
Sbjct: 714 QVYAVASSYTENQKKLPRFHTEEREFDTVEREPRYIYPQIERFVVSLISPTSWETVPNSR 773
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEP 309
L E+EHV C+K + + E GL+ Y+ +GT +NY ED+ C+GRIL+FD++EVVPEP
Sbjct: 774 TVLQEFEHVTCMKVLLLHSELVDIGLKQYLVVGTTFNYGEDLACKGRILIFDVLEVVPEP 833
Query: 310 GQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVY 369
GQPLTK K K +Y KEQKGPVTAIC +G+++ AVGQKIY ++ KDNDL G+AF+D++V+
Sbjct: 834 GQPLTKTKCKCVYDKEQKGPVTAICATSGYIIAAVGQKIYAFKYKDNDLVGVAFVDSQVF 893
Query: 370 IASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIID 429
++++++N+I+ D +RSI+L+R+Q E+++L+LV+RD K + + ++ ID
Sbjct: 894 TVNLMAIRNVIVAADISRSISLVRFQVEHKSLALVSRDTKTLEAYTSEFF--------ID 945
Query: 430 GSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESN 489
GS V GF++SD ++N+V+F YQPEA ES
Sbjct: 946 GSQV--------------------------------GFVVSDAERNIVIFSYQPEALESF 973
Query: 490 GGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEK 549
GGHRL++K D ++G HVNT +I+ S + + R L +LDG++G PL EK
Sbjct: 974 GGHRLLQKADINIGSHVNTMMRIKLIQDEQSLSKSSEQRQLIILPTLDGSIGILFPLSEK 1033
Query: 550 NYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGE 609
+RRL MLQN +V H GLNPRAFR NP R I+DG L+ K+ QLS E
Sbjct: 1034 PFRRLTMLQNKLVDCLPHKAGLNPRAFRALDVPLRTLTNPHRNILDGQLLDKYAQLSFQE 1093
Query: 610 RLEICKKIGSKHNDILDELYDIEALSSHF 638
R +I KK+G+ ILD++ DIE S+H
Sbjct: 1094 RFDIAKKMGTTSGQILDDMMDIERASNHL 1122
>gi|327287424|ref|XP_003228429.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Anolis carolinensis]
Length = 1294
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 226/540 (41%), Positives = 304/540 (56%), Gaps = 114/540 (21%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--KVLFVSD 69
+V+E+L V+LG +RP LLV ELLIY+AF H + LK+RFKK+ + F
Sbjct: 847 LVKEVLLVALGNRQSRPYLLVHVDQELLIYEAFNHDSQLGQTNLKVRFKKVPHNINFREK 906
Query: 70 R---SKRANEQPG-----LPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
+ SK+ E G +PRG R+++ RYF +I GY GVF+CGP P WL +TSRG LR
Sbjct: 907 KPRPSKKKTESAGGEEASVPRG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTSRGALRL 965
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
HPMTIDGP+ + APFHN + C
Sbjct: 966 HPMTIDGPIESFAPFHN-------------------------------------VNCPKG 988
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDK-ELVTDPRDSRFIPPLVSQFHVSLFSPF 240
FL ++ + I + + + GED E T R P H L F
Sbjct: 989 FLYFNRQGTGGGIHNACSR----IPRMTGEDDMEFETIERGVLKCVPGEGFGHPDLILSF 1044
Query: 241 SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
+ L EWEHV C+K VS++ E T+SGL+GYIA+GT E+VTCRGRIL+
Sbjct: 1045 KID--------LEEWEHVTCMKTVSLKSEETVSGLKGYIAVGTCLMQGEEVTCRGRILIM 1096
Query: 301 DIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTG 360
DIIEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G+LV+A+GQKI++W LKDNDLTG
Sbjct: 1097 DIIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFLWSLKDNDLTG 1156
Query: 361 IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
+AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP
Sbjct: 1157 MAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKP---------- 1206
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEI-CKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
LE+ C D + + +GF++SD+D+N++++
Sbjct: 1207 ------------------------LEVYCV-------DFMVDSCQLGFLVSDRDRNLLVY 1235
Query: 480 MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK-----PSSISDAPGARSRFLTWYA 534
MY PEA+ES GG RL+++ DFH+G HVN F++ C+ P+ S A ++ +TW+
Sbjct: 1236 MYLPEAKESFGGMRLLRRADFHVGAHVNAFWRTPCRGAMEGPTKKSSA--WENKHITWFG 1293
>gi|268580265|ref|XP_002645115.1| Hypothetical protein CBG16808 [Caenorhabditis briggsae]
gi|296439546|sp|A8XPU7.1|CPSF1_CAEBR RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 1; AltName: Full=Cleavage and
polyadenylation specificity factor 160 kDa subunit;
Short=CPSF 160 kDa subunit
Length = 1454
Score = 370 bits (951), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 213/640 (33%), Positives = 344/640 (53%), Gaps = 56/640 (8%)
Query: 17 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFR--HPK-GALKLRFKKLKVL-------F 66
V E V +G++ P+L+ E+++Y+ F +P+ G L + F+KL L +
Sbjct: 853 VVEAQIVGMGINQAHPVLIAIIDEEVVLYEMFASYNPQPGHLGVAFRKLPHLIGLRTSPY 912
Query: 67 VSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQ-GVFLCGPHPAWLFLTSRGELRAHPMT 125
V+ KRA + + G R + + F I+ GV + G P L + G ++ H MT
Sbjct: 913 VNIDGKRAPFEMEMEHGKRYTLIHPFERISSINNGVMIGGAVPTLLVYGAWGGMQTHQMT 972
Query: 126 IDGPVSTLAPFHNVNCPRGFLYF-NAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLA 184
IDG + PF+N N GF+Y KSELRI+ + YD P+PV+K+ + T H +
Sbjct: 973 IDGSIKAFTPFNNENVLHGFVYMTQQKSELRIARMHPDFDYDMPYPVKKIEVGKTVHNVR 1032
Query: 185 YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEE 244
Y + + Y +V+S +PS + +DK+ +D F+ P ++ ++LFS W
Sbjct: 1033 YLMNSDIYAVVSSVPKPSNKIWVVMNDDKQEEIHEKDENFVLPAPPKYTLNLFSSQDWAA 1092
Query: 245 IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIE 304
+P T F + E V +++V ++ E GL Y+AL T NY E+V RGRI+L ++IE
Sbjct: 1093 VPNTEFEFEDMEAVTAMEDVPLKSESRYGGLDTYLALATVNNYGEEVLVRGRIILCEVIE 1152
Query: 305 VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFI 364
VVPEPGQP + KIK++Y KEQKGPVT +C + G L++ +GQK++IWQ KDNDL GI+F+
Sbjct: 1153 VVPEPGQPTSNRKIKVLYDKEQKGPVTGLCAINGLLLSGMGQKVFIWQFKDNDLMGISFL 1212
Query: 365 DTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPS 424
D Y+ + S++ + L D S++L+R+Q E + +S+ +RD + K A S
Sbjct: 1213 DMHYYVYQLHSIRTIALALDARESMSLIRFQEENKAMSIASRD------DRKCAQAPMAS 1266
Query: 425 RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPE 484
++DG +GF++SD+ N+ LF Y PE
Sbjct: 1267 EFLVDG--------------------------------MHIGFLLSDEHGNITLFSYSPE 1294
Query: 485 ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSI-SDAPGARS----RFLTWYASLDGA 539
A ESNGG RL K ++G ++N F +++ S + S +P R R T + SLDG+
Sbjct: 1295 APESNGGERLTVKAAINIGTNINAFLRVKGHTSLLDSSSPEERENIEQRMNTIFGSLDGS 1354
Query: 540 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK-GKGYYAGNPSRGIIDGSL 598
G+ PL EK+YRRL LQ + + T GL+ + R+ K + G +R +IDG +
Sbjct: 1355 FGYIRPLTEKSYRRLHFLQTFIGSVTPQIAGLHIKGARSSKPSQPIVNGRNARNLIDGDV 1414
Query: 599 VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
V ++L LS+ ++ ++ +++G ILD+L + ++ ++
Sbjct: 1415 VEQYLHLSVYDKTDLARRLGVGRYHILDDLMQLRRMAYYY 1454
>gi|308459872|ref|XP_003092248.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
gi|308253976|gb|EFO97928.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
Length = 1448
Score = 369 bits (947), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 203/641 (31%), Positives = 345/641 (53%), Gaps = 58/641 (9%)
Query: 17 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH---PKGALKLRFKKLKVLF------- 66
V E V +G++ + P+L+ ++++Y+ F H G L + F+KL
Sbjct: 847 VMEAQIVGMGINQSHPVLMAIVDEQVVMYEMFSHYNPQAGHLGIAFRKLPHFICLRTSSH 906
Query: 67 VSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQ-GVFLCGPHPAWLFLTSRGELRAHPMT 125
++ KRA + + G R + + F I+ GV + G P + + G ++ H MT
Sbjct: 907 LNSDGKRAPFEMEVENGKRYTLIHPFERISSINNGVMIGGAVPTLVVYGAWGGMQTHQMT 966
Query: 126 IDGPVSTLAPFHNVNCPRGFLYF-NAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLA 184
IDGP+ PF+N N GF+Y KSELRI+ + Y+ P+P++K+ + T H +
Sbjct: 967 IDGPIKAFTPFNNENVLHGFVYMTQQKSELRIARMHPDFDYEMPYPMKKIEVGRTIHNVR 1026
Query: 185 YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEE 244
Y + + Y +V+S +PS + +DK+ +D F+ P ++ ++LFS W+
Sbjct: 1027 YLMNSDVYVVVSSIPKPSNKIWVVMNDDKQEEIHEKDENFVLPAPPKYTLNLFSSQDWKA 1086
Query: 245 IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIE 304
+P T + E V ++VS++ E T+SG+ Y+A+GT NY E+V RGRI+L ++IE
Sbjct: 1087 VPNTEIEFEDMEAVTACEDVSLKSESTISGVETYLAVGTVNNYGEEVLVRGRIILCEVIE 1146
Query: 305 VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFI 364
VVPEP QP + KIK+++ KEQKGPVT +C + G L++ +GQK++IWQ KDNDL G++F+
Sbjct: 1147 VVPEPDQPTSNRKIKVLFDKEQKGPVTGLCAINGLLLSGMGQKVFIWQFKDNDLMGLSFL 1206
Query: 365 DTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPT-QPNSKGYYAGNP 423
D Y+ + S++ + L D S++L+R+Q E + +S+ +RD + T +P +
Sbjct: 1207 DMHYYVYQLHSLRTIALACDARESMSLIRFQEENKAMSIASRDDRRTAKPPMAAQF---- 1262
Query: 424 SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQP 483
++DG + +GF++SD++ N+ LF Y P
Sbjct: 1263 ---VVDG--------------------------------AHLGFLLSDENGNITLFNYSP 1287
Query: 484 EARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS-----SISDAPGARSRFLTWYASLDG 538
EA ESNGG RL + ++G +VN F +++ S S + R T + SLDG
Sbjct: 1288 EAPESNGGERLTVRAAMNIGTNVNAFLRVKGHTSLLNLQSDEEKESVEQRMSTIFGSLDG 1347
Query: 539 ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK-GKGYYAGNPSRGIIDGS 597
+ GF PL EK+YRRL LQ + + T GL+ + R+ + + G +R +IDG
Sbjct: 1348 SFGFVRPLSEKSYRRLHFLQTFIGSVTPQIAGLHIKGARSARPAQPIVNGRNARNLIDGD 1407
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
+V ++L LSL ++ ++ +++G I+D+L + ++ ++
Sbjct: 1408 VVEQYLHLSLYDKTDLARRLGVGRYHIIDDLMHLRRMAYYY 1448
>gi|25148482|ref|NP_500157.2| Protein CPSF-1 [Caenorhabditis elegans]
gi|22096347|sp|Q9N4C2.2|CPSF1_CAEEL RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 1; AltName: Full=Cleavage and
polyadenylation specificity factor 160 kDa subunit;
Short=CPSF 160 kDa subunit
gi|373220398|emb|CCD73182.1| Protein CPSF-1 [Caenorhabditis elegans]
Length = 1454
Score = 363 bits (933), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 203/641 (31%), Positives = 339/641 (52%), Gaps = 58/641 (9%)
Query: 17 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPK---GALKLRFKKLKVLF------- 66
V E V +G++ P+L+ ++++Y+ F G L + F+KL
Sbjct: 853 VLEAQIVGMGINQAHPILMAIVDEQVVLYEMFSSSNPIPGHLGISFRKLPHFICLRTSSH 912
Query: 67 VSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQ-GVFLCGPHPAWLFLTSRGELRAHPMT 125
++ KRA + + G R S + F ++ GV + G P L + G ++ H MT
Sbjct: 913 LNSDGKRAPFEMKINNGKRFSLIHPFERVSSVNNGVMIVGAVPTLLVYGAWGGMQTHQMT 972
Query: 126 IDGPVSTLAPFHNVNCPRGFLYFNA-KSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLA 184
+DGP+ PF+N N G +Y KSELRI+ + Y+ P+PV+K+ + T H +
Sbjct: 973 VDGPIKAFTPFNNENVLHGIVYMTQHKSELRIARMHPDFDYEMPYPVKKIEVGRTIHHVR 1032
Query: 185 YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEE 244
Y + + Y +V+S +PS + +DK+ +D F+ P ++ ++LFS W
Sbjct: 1033 YLMNSDVYAVVSSIPKPSNKIWVVMNDDKQEEIHEKDENFVLPAPPKYTLNLFSSQDWAA 1092
Query: 245 IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIE 304
+P T + E V ++V+++ E T+SGL +A+GT NY E+V RGRI+L ++IE
Sbjct: 1093 VPNTEISFEDMEAVTACEDVALKSESTISGLETLLAMGTVNNYGEEVLVRGRIILCEVIE 1152
Query: 305 VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFI 364
VVPEP QP + KIK+++ KEQKGPVT +C + G L+ +GQK++IWQ KDNDL GI+F+
Sbjct: 1153 VVPEPDQPTSNRKIKVLFDKEQKGPVTGLCAINGLLLCGMGQKVFIWQFKDNDLMGISFL 1212
Query: 365 DTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVAR-DYKPTQPNSKGYYAGNP 423
D Y+ + S++ + + D S++L+R+Q + + +S+ +R D K QP
Sbjct: 1213 DMHYYVYQLHSLRTIAIACDARESMSLIRFQEDNKAMSIASRDDRKCAQPPMA------- 1265
Query: 424 SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQP 483
S+ ++DG+ V GF++SD+ N+ +F Y P
Sbjct: 1266 SQLVVDGAHV--------------------------------GFLLSDETGNITMFNYAP 1293
Query: 484 EARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSI-----SDAPGARSRFLTWYASLDG 538
EA ESNGG RL + ++G ++N F ++R S + + R T +ASLDG
Sbjct: 1294 EAPESNGGERLTVRAAINIGTNINAFVRLRGHTSLLQLNNEDEKEAIEQRMTTVFASLDG 1353
Query: 539 ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK-GKGYYAGNPSRGIIDGS 597
+ GF PL EK+YRRL LQ + + T GL+ + R+ K + G +R +IDG
Sbjct: 1354 SFGFVRPLTEKSYRRLHFLQTFIGSVTPQIAGLHIKGSRSAKPSQPIVNGRNARNLIDGD 1413
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
+V ++L LSL ++ ++ +++G I+D+L + ++ ++
Sbjct: 1414 VVEQYLHLSLYDKTDLARRLGVGRYHIIDDLMQLRRMAFYY 1454
>gi|357611296|gb|EHJ67409.1| putative cleavage and polyadenylation specific factor 1 [Danaus
plexippus]
Length = 328
Score = 362 bits (928), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 179/368 (48%), Positives = 244/368 (66%), Gaps = 49/368 (13%)
Query: 272 LSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVT 331
LSGLRGYIA+GTNYNY ED+T RGRIL++DII+VVPEPGQPLTKN+ K IYAKEQKGPVT
Sbjct: 9 LSGLRGYIAIGTNYNYGEDITSRGRILIYDIIDVVPEPGQPLTKNRFKEIYAKEQKGPVT 68
Query: 332 AICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIAL 391
A+ V GFL++AVGQKIY+WQLKDNDL G+AFIDT++Y+ M++VKNLILV D +SI+L
Sbjct: 69 ALTQVLGFLISAVGQKIYLWQLKDNDLVGVAFIDTQIYVHRMLAVKNLILVADVYKSISL 128
Query: 392 LRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 451
LRYQ ++RTLSLV+RD + Q + N
Sbjct: 129 LRYQHQHRTLSLVSRDLRTAQIYDMQFMIDN----------------------------- 159
Query: 452 GSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 511
+S+GF++S+ + N ++M+QP+ARES GG RLI+K D+HLGQ V+ F+
Sbjct: 160 -----------TSLGFLVSESEGNFAMYMHQPQARESYGGQRLIRKCDYHLGQRVHAMFR 208
Query: 512 IRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
+ A G R +T + +LDG +G+ LP+ EK YRRLLMLQNV+ + H GL
Sbjct: 209 L--------AARGERQTHVTMFTTLDGGVGYVLPVSEKVYRRLLMLQNVINNYCCHLAGL 260
Query: 572 NPRAFRTYK-GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYD 630
NP+A+RTYK + G +RG++DG LV + + E+ +I +KIG+K +I+ +LY+
Sbjct: 261 NPKAYRTYKVSRRALCGGAARGVLDGDLVSLYTSMPRTEQQDIARKIGTKVEEIMSDLYE 320
Query: 631 IEALSSHF 638
I+ ++HF
Sbjct: 321 IDRQTAHF 328
>gi|341892673|gb|EGT48608.1| CBN-CPSF-1 protein [Caenorhabditis brenneri]
Length = 1440
Score = 360 bits (924), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 204/640 (31%), Positives = 349/640 (54%), Gaps = 57/640 (8%)
Query: 17 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPK---GALKLRFKKLKVLFVSDRS-- 71
+ E V +G++ + P+L+ ++++Y+ F +P G L + F+KL F+ RS
Sbjct: 840 IMEAQIVGMGINQSHPILMAIVDEQVIMYEMFANPNSQPGHLGIAFRKLP-HFICLRSSP 898
Query: 72 ------KRANEQPGLPRGVRISQMRYFSNIAGYQ-GVFLCGPHPAWLFLTSRGELRAHPM 124
KRA Q G R + F ++ GV + G P L + G ++ HPM
Sbjct: 899 YLKSDGKRAAFQIVEEDGKRYPLIHSFERVSTVNNGVIIGGAVPTLLVYGAWGGMQTHPM 958
Query: 125 TIDGPVSTLAPFHNVNCPRGFLYF-NAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFL 183
TIDG + PF+ N P GF+Y KSELRI+ + Y+ P+PV+K+ + T H +
Sbjct: 959 TIDGSIKAFTPFNIDNVPYGFVYMTQKKSELRIAKMHADFDYEMPYPVKKIEVGRTIHSV 1018
Query: 184 AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
Y + + Y +V+S +PS + +DK+ +D F+ P ++ ++LFS W+
Sbjct: 1019 RYLMNSDVYVVVSSVPKPSNKIWVVMNDDKQEEIHEKDENFVLPAPPKYTLNLFSSQDWK 1078
Query: 244 EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
+P T + E V ++V+++ E T +G Y+A+GT NY E+V RGRI+L ++I
Sbjct: 1079 AVPNTEISFEDMEAVTACEDVALKSESTHTGFETYLAIGTVNNYGEEVLVRGRIILAEVI 1138
Query: 304 EVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAF 363
EVVPEPGQP + KIK+++ KEQKGPVT +C + G L++ +GQK++IWQ KDNDL G++F
Sbjct: 1139 EVVPEPGQPTSNRKIKVLFDKEQKGPVTGLCAMEGLLLSGMGQKVFIWQFKDNDLMGLSF 1198
Query: 364 IDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNP 423
+D Y+ + S++++ L D S++L+R+Q E + +S+ +RD + K A
Sbjct: 1199 LDMHYYVYQLHSLRSIALACDARESMSLIRFQEENKAMSVASRD------DRKCAQAPMA 1252
Query: 424 SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQP 483
++ ++DG + +GF++SD++ N+ LF Y P
Sbjct: 1253 AQFMVDG--------------------------------AHIGFLLSDENGNITLFNYAP 1280
Query: 484 EARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS----DAPGARSRFLTWYASLDGA 539
EA ESNGG RL + ++G ++N F +++ + ++ + A R T +ASLDG+
Sbjct: 1281 EAPESNGGERLTVRAAINIGTNINAFLRVKGHTALLNLHEFEKEAAEQRMSTIFASLDGS 1340
Query: 540 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK-GKGYYAGNPSRGIIDGSL 598
GF PL EK+YRRL LQ + + + GL+ + R+ K + G +R +IDG +
Sbjct: 1341 FGFIRPLTEKSYRRLHFLQTFIGSVSQQIAGLHIKGARSAKPPQPIVNGRNARNLIDGDV 1400
Query: 599 VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
V ++L LS ++ ++ +++G I+D+L ++ ++ ++
Sbjct: 1401 VEQYLNLSTYDKTDLARRLGVGKYHIIDDLMELRRMAFYY 1440
>gi|313232279|emb|CBY09388.1| unnamed protein product [Oikopleura dioica]
Length = 1451
Score = 360 bits (923), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 216/662 (32%), Positives = 341/662 (51%), Gaps = 83/662 (12%)
Query: 4 FRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQ------AFRHPKGALKL 57
F +D VQE+ ++G + P ++V +L+IY+ F+ L
Sbjct: 846 FEGSEGRRVDVLDVQEMNVFNMG-PSSLPYIVVMIGDQLMIYRFRATLNRFQTESPVLSG 904
Query: 58 RFKKLKVLFVSDRSKRANEQPGL--------PRGVRISQMRYFSNIAGYQGVFLCGPHPA 109
RF KL+ D++K PG+ R +I MR F NI+ + G+FL G +P
Sbjct: 905 RFIKLQ-----DKTKLLRRIPGVHDESSKTKNRNNKI--MRQFMNISDHNGIFLGGAYPT 957
Query: 110 WLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF-NAKSELRISVLPTHLSYDAP 168
W+F G L H M +G V+ PF N C GFLYF ++ L ++ L L YDA
Sbjct: 958 WIFCGQNGRLNIHSMWQEGFVNAFTPFDNEKCADGFLYFRHSTKTLTVANLQPFLKYDAD 1017
Query: 169 WPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGED-KELVTDPRDSRFIPP 227
WP +K+ L TP F +Y LE K + S +E K N E KE P
Sbjct: 1018 WPFKKIKLNYTPCFSSYDLEQKVLTVCGSRSEKIEMLPKINAEGHKEYEDLPEVQNVETQ 1077
Query: 228 LVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY 287
L QF V +FSP SWE IP + + EH+LC ++V ++ E ++SG + YIA+GT+
Sbjct: 1078 LFPQFFVEMFSPASWEVIPNSRIEMDAHEHILCCRSVYLKSEASMSGRKQYIAIGTSNIC 1137
Query: 288 SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
ED RGR++L ++I+VVPEPG+PLT+ K K ++ Q+GPV+A+ + G L+ A+GQK
Sbjct: 1138 GEDFQSRGRLILLEVIDVVPEPGKPLTRYKYKTVFDASQRGPVSAVDSLDGALIAAIGQK 1197
Query: 348 IYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
++I +D++L F+DT++Y + KN LVGD + I LLR+Q E +S ++R
Sbjct: 1198 VFIHAFQDDNLRATGFVDTQLYTHATHCFKNYALVGDIQQGITLLRHQGERNCISQISRA 1257
Query: 408 YKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGF 467
+ + + G ++DG+ V G
Sbjct: 1258 RRAGEVTAVGI--------LLDGNQV--------------------------------GL 1277
Query: 468 MISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV-----------NTFFKIRCKP 516
+ +D +N+ ++MY+P+ +ESNGG +L+++ D +LG+ V +TF K+
Sbjct: 1278 VSTDMQRNLQVYMYKPDQKESNGGKQLVRQADINLGKRVISIWNSLGRQNDTFTKVALTE 1337
Query: 517 SSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
+ +R +T+YA LDG++G +P+ EK +RRL MLQ ++ +H H GGLNPR +
Sbjct: 1338 ND--------ARHVTFYAGLDGSIGDIVPVSEKVFRRLEMLQTLVQSHLPHYGGLNPREY 1389
Query: 577 RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSS 636
R + N ++ IIDG L+ +F LS E+ ++ +KIG +LD++ D++ +
Sbjct: 1390 RYCTNEYRDLENAAKNIIDGDLLERFNGLSFTEQTDLSRKIGVTREALLDDMMDVQRTKN 1449
Query: 637 HF 638
F
Sbjct: 1450 LF 1451
>gi|320169222|gb|EFW46121.1| cleavage and polyadenylation specificity factor 1 [Capsaspora
owczarzaki ATCC 30864]
Length = 1725
Score = 348 bits (894), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 195/549 (35%), Positives = 295/549 (53%), Gaps = 52/549 (9%)
Query: 91 YFSNIAGYQ---GVFLCGPHPAWLFLT-SRGELRAHPMTIDGPVSTLAPFHNVNCPRGFL 146
Y + G+Q GVF+CG P WL ++ +R LRAH M DG VS + F+N CP GF+
Sbjct: 1214 YTGVLGGHQLCSGVFVCGRRPLWLLMSPTRKALRAHLMLTDGSVSAFSAFNNNACPGGFV 1273
Query: 147 YFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYY 206
YF + LR L ++D PWPVR+VPL+ T H++ YH +TY +VTS +P +
Sbjct: 1274 YFTTQGTLRFCQLAPTTNHDNPWPVRRVPLRATAHYIGYHEVFRTYVLVTSHPKPYFNLP 1333
Query: 207 KF-NGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVS 265
+ N E V R IP F + L SP +WE I +F L +E V + +
Sbjct: 1334 RLTNDETYTPVPYTPKPRAIPATFDTFSLQLISPVTWESI--HSFDLPAFERVTSVDIAA 1391
Query: 266 MEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKE 325
+ + T++GL+ Y+ +GT EDVTC GRI++F+II+VVPE +P T K+K + +E
Sbjct: 1392 ITSQETVTGLKDYVVIGTTVIEGEDVTCHGRIIVFEIIDVVPEVNRPQTNRKLKYLMERE 1451
Query: 326 QKGPVTAICHVAGFLVTAVGQKIYIWQLKDND-LTGIAFIDTEVYIASMVSVKNLILVGD 384
QKG +TA+ HV G LV+ +GQKI IWQ +D + G+AFIDT+ ++ S+ ++KN ILVGD
Sbjct: 1452 QKGAITALSHVCGHLVSCIGQKIIIWQFASDDTMDGVAFIDTQTFVVSVSAIKNFILVGD 1511
Query: 385 YARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGER 444
S+ LLR+ + L +ARD+ S + ++DG
Sbjct: 1512 LNNSVFLLRFNETTKHLGFIARDFDHMSVASTQF--------LVDG-------------- 1549
Query: 445 LEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQ 504
SS+GF+ +D +N+V+F Y P RESN G RL+++ DFH+G
Sbjct: 1550 ------------------SSLGFLATDSHQNLVVFAYNPLNRESNNGQRLLRQLDFHVGS 1591
Query: 505 HVNTFFKI--RCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 562
HV ++ R P S+ D + R + A+L+G+L P+ E +RRL LQ +V
Sbjct: 1592 HVQQVLRMVPRSLPVSV-DRGASVKRHIDLLATLEGSLNALAPIGETTFRRLEWLQRQLV 1650
Query: 563 THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 622
GLNP +R Y+ + +IDG L+ +FL L L E+ E+ ++ +
Sbjct: 1651 G-LQQRAGLNPIGYRAYRFPRKMTTTRAGNVIDGELLSRFLYLGLAEQRELARQRRNTPE 1709
Query: 623 DILDELYDI 631
D++D++ +
Sbjct: 1710 DLIDDILSV 1718
>gi|358338426|dbj|GAA28838.2| cleavage and polyadenylation specificity factor subunit 1 [Clonorchis
sinensis]
Length = 1741
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 210/679 (30%), Positives = 353/679 (51%), Gaps = 50/679 (7%)
Query: 7 HSPSAMDETI---VQELLTVSLGLHGNRPLLLVRTQHELLIYQAF------RHPKGALK- 56
+ P+A ++ I V E+ +G + +RP+LLVRT E+ ++A HP +
Sbjct: 1066 NCPAAEEDNIPPTVLEITVFPIGRNRDRPVLLVRTSQEIAFFEALCPSHNEAHPFASESW 1125
Query: 57 ----LRFKKLKVL--FVSDRSKRANEQPGLPRGVRISQ---MRYFSNIAGYQGVFLCGPH 107
LR+++L + V+ R R + + + +++ +R F +I G+ GVF+CG
Sbjct: 1126 SQEGLRWRRLPIPCPLVAPRRVRTDPKIADVQSTMLTRKNLLRPFEDIDGHCGVFVCGAT 1185
Query: 108 PAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDA 167
P WLF + G +R +IDG + + AP + CP GF+YF +E+R++ L S+
Sbjct: 1186 PIWLFSSDTGHIRVFNHSIDGIMGSFAPLNTDICPSGFVYFTYSNEMRLATLLPGYSFKE 1245
Query: 168 PWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGE-DKELVTDPRDSRFIP 226
+R VPL+ TP+FL YH+E+KTY +V + + + Y N E +KE R +
Sbjct: 1246 HLGMRWVPLELTPYFLQYHIESKTYALVGTRVKSCSSVYHLNAEGNKEEEVLLRPPTCVL 1305
Query: 227 PLVSQFHVSLFSPFS-------WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYI 279
P + + + +++P + W+ IP WE V C+ + E T G + Y+
Sbjct: 1306 PSLDYYVLQMYAPSTSLAEATPWQAIPHACIDFEPWEVVTCMITAQLSSEQTFHGTKDYL 1365
Query: 280 ALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF 339
ALG N +Y E++ RGRI++ D+I+VVPEPGQPLT++K+K IY EQKGPVTA+ G
Sbjct: 1366 ALGANLSYGEEIPVRGRIIILDVIDVVPEPGQPLTRHKLKTIYDGEQKGPVTALSSCQGH 1425
Query: 340 LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYR 399
LV+A+GQK+YIW LK+ DL G+AF+D+E+YI S++ VKNLIL D +SI LLR+Q + R
Sbjct: 1426 LVSAIGQKVYIWTLKNADLVGVAFVDSELYIHSLLCVKNLILAADVLKSIQLLRFQSDLR 1485
Query: 400 TLSLVARDYKPTQPNSKGYYAGNPSRGII-----DGSLVWKFLQLSLGERLEICKKIGSK 454
LS+V+RD P + + ++ G + +++ + L R +++ +
Sbjct: 1486 VLSVVSRDAIPREVYTSNFFVDGRRLGFLVTDERGNVVIYSYDPLEPSSRSG--RRLVRR 1543
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQ---------PEARESNGGHRLIKKT------D 499
+ L + ++++ ++ +L + P A GG ++++T
Sbjct: 1544 ADMCLPTRAISSLRVANRLRHALLSVKSAGTGTQTTVPSA-AGVGGSEVLERTGKTGVSS 1602
Query: 500 FHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN 559
F N+ + S ++ + + + + GA+ PL +K Y RL + +
Sbjct: 1603 FVAPGRANSASAMTLSTPSATNIDPEKLKHSVYLGTQTGAVFLIGPLRDKMYSRLRITEK 1662
Query: 560 VMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
++ H T GL P+ Y+ NPS + D L+W++L L +RLEI KK G
Sbjct: 1663 NLIHHFGPTCGLLPKLCWNYRPSAPELVNPSGQVADADLLWRYLTLPHSQRLEIAKKSGQ 1722
Query: 620 KHNDILDELYDIEALSSHF 638
I+D++ ++ A + HF
Sbjct: 1723 SLEGIMDDIAELNATTLHF 1741
>gi|312069702|ref|XP_003137805.1| hypothetical protein LOAG_02219 [Loa loa]
Length = 1065
Score = 339 bits (869), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 212/647 (32%), Positives = 333/647 (51%), Gaps = 103/647 (15%)
Query: 10 SAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP---KGALKLRFKKLKVLF 66
+A E ++ ELL V +G++ RP+L + + +Y+ F + +G L +RFK+L
Sbjct: 504 AAKPEEVIMELLMVGMGMNQGRPMLFLLIDDTVSVYEMFTYNNGIQGHLAVRFKRLPYTV 563
Query: 67 VSDRSKRANEQPGLPRGVRISQMR----------YFSNIAGY-QGVFLCGPHPAWLFLTS 115
V+ RS R GL + +R +F I GVF+C +P FL +
Sbjct: 564 VT-RSCRFQ---GLDGRAAVESVRDAVRHKTVLHFFERIGNVLNGVFICSSYPCIFFLET 619
Query: 116 RGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSEL-RISVLPTHLSYDAPWPVRKV 174
G R HP+ +DGP+ + F+N CP GF+Y + L R+ A PV K+
Sbjct: 620 -GVPRLHPVNLDGPILSFTTFNNAACPNGFIYLTERERLMRV----------AKLPVTKM 668
Query: 175 PLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHV 234
C++ + +DK + F+ P + Q+ +
Sbjct: 669 ------------------CVLIN-------------DDKTFEEHEKPDTFVYPEMDQYKL 697
Query: 235 SLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR 294
L+SP W+ + E+E V C + V + EGT+SG++ Y+A+GT NY E+V R
Sbjct: 698 QLYSPEDWKPVQNVEVLFEEFEVVTCCEEVVLRSEGTVSGVQNYLAVGTACNYGEEVLVR 757
Query: 295 GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK 354
GRI++ +IIEVVPEPGQP +K++IK +Y KEQKGPVT++C G+L+T +GQK++IW K
Sbjct: 758 GRIIISEIIEVVPEPGQPTSKHRIKTLYDKEQKGPVTSLCSCNGYLLTGMGQKVFIWLFK 817
Query: 355 DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKP-TQP 413
DN+L GI+F+D Y+ ++ V+NL L D RS+ALLRYQ EY+ LSL +RD + QP
Sbjct: 818 DNNLQGISFLDMHFYVHQLIGVRNLALACDMYRSVALLRYQEEYKALSLASRDMRSDVQP 877
Query: 414 NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
+ IID MGF++SD+
Sbjct: 878 PMAAQF-------IIDN--------------------------------KQMGFVMSDEA 898
Query: 474 KNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSS--ISDAPGARSRFLT 531
N+ +F Y PE ES GG +L + + ++G VN+F +++ SS + + + R
Sbjct: 899 ANIAIFNYLPETLESLGGEKLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSLERQSV 958
Query: 532 WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSR 591
+ASLDG+ GF PL EK +RRL MLQ +M + GLN + R + +R
Sbjct: 959 LFASLDGSFGFLRPLTEKVFRRLHMLQQLMSSMVPQPAGLNAKGARAARPPRPNHYLNTR 1018
Query: 592 GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
++DG +V ++L LSL E+ ++ +K+G+ I+D+L +I +++H+
Sbjct: 1019 NLVDGDMVMQYLHLSLPEKNDLARKLGTSRYHIIDDLIEICRVTAHY 1065
>gi|384487281|gb|EIE79461.1| hypothetical protein RO3G_04166 [Rhizopus delemar RA 99-880]
Length = 1468
Score = 338 bits (866), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 204/663 (30%), Positives = 342/663 (51%), Gaps = 88/663 (13%)
Query: 17 VQELLTVSLGLHGNRPLLLVRTQ-HELLIYQAFRHPKGA----LKLRFKKLKVLFVSDRS 71
+QE+L +G P L+VRT ++++IY+AF + + L LRF +++ +VS +S
Sbjct: 853 IQEILMTHIGKERKDPHLVVRTDTNDIIIYKAFTYLDESSPDRLALRFSRVQHEYVSRKS 912
Query: 72 KRANEQPGLPRGV------------------RISQMRY-----------FSNIAGYQGVF 102
+P RG+ ++S + F+++AGY GVF
Sbjct: 913 SSHESKPKKKRGIIDEFEIPDTDLNEEEEDLKLSTKKMDKKIQRKLLIPFTDVAGYAGVF 972
Query: 103 LCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTH 162
+ G PAWL + + +R HPM + + FHNVNC GF+ ++KS +++S L T
Sbjct: 973 VAGAQPAWLMCSCKSFVRVHPMKTEHEIVGFTQFHNVNCQHGFITVDSKSTIQLSRLRTE 1032
Query: 163 -LSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELV----T 217
++YD W ++KV L T H + YH + Y ++ S++ P+ + +D + + T
Sbjct: 1033 GINYDLDWVIQKVLLGQTVHKIQYHPVMRVYAVLVSSSVPT----RMKNDDNQYIDGKET 1088
Query: 218 DPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRG 277
D R P + QF + L SP +WE + + F E+E L+ ++ + T +G +
Sbjct: 1089 DERGPGEFLPEMEQFSMILVSPVTWEIVDKVEF--EEFEQCFSLECALLDSKQTSTGRKY 1146
Query: 278 YIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
Y+ +GT ED T +G I ++DIIEVVPEP P T +K K + ++ KG VTA+C V+
Sbjct: 1147 YMIIGTGTLKGEDTTMKGSIRMYDIIEVVPEPDNPQTNHKFKPVLTEDVKGAVTAMCTVS 1206
Query: 338 GFLVTAVGQKIYIWQLKDND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQP 396
G L +G K+ +W L+D++ L G+AFID ++Y+ SM S+KN IL+GD +SI L +Q
Sbjct: 1207 GHLAACIGSKVIVWSLEDDERLVGVAFIDVQIYVTSMSSIKNFILIGDAQKSIWFLGFQL 1266
Query: 397 EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 456
E L+L+ +DY+ + G +D
Sbjct: 1267 EPAKLTLLGKDYQ------------SFDVGCVDF-------------------------- 1288
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
I+D+ S+ ++ D ++N+ L+ Y P +S GG +L+++ DFH+G V T ++
Sbjct: 1289 -IIDD-KSLYLIVGDTNENIDLYQYAPFNLQSFGGQKLMRRGDFHVGSQVQTMVRLPQIE 1346
Query: 517 SSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
+ +R F + +G++ + EK ++RL L +V + H GLNPRAF
Sbjct: 1347 KTEKGFEYSRRHFCLC-GTFNGSIAVISSISEKTFKRLNTLYGHLVNNLQHVAGLNPRAF 1405
Query: 577 RTYKG-KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
R KG K + N ++ ++DG L+++F LS+ E+ E K+IG+ I+++L DIE
Sbjct: 1406 RLIKGPKQRMSTNRTKAVLDGDLIFEFAGLSIEEQKETTKQIGTTVTRIMEDLVDIECSI 1465
Query: 636 SHF 638
+HF
Sbjct: 1466 NHF 1468
>gi|353231025|emb|CCD77443.1| putative cleavage and polyadenylation specificity factor cpsf
[Schistosoma mansoni]
Length = 1825
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 201/682 (29%), Positives = 339/682 (49%), Gaps = 69/682 (10%)
Query: 17 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAF------RHP-------KGALKLRFKKLK 63
+ E+L +G+ +RP+L+VRT E+ ++A +P +G L+ R L
Sbjct: 1153 ILEILVYPIGIDKDRPVLMVRTSQEIAFFEALCPSPDESYPLISGTFYEGRLRWRRLPLP 1212
Query: 64 VLFVSDRSKRANEQPGLPRGV---RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
V+ R R + + + R +R F NI ++GVF+CG +P WLF T G+LR
Sbjct: 1213 CPLVAPRRVRTDPKIMDVQSTLLTRTHMLRSFENIGDHRGVFVCGGNPIWLFATDSGQLR 1272
Query: 121 AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTP 180
P +IDG + + AP + C GF+YF +E+R++ LP S++ ++ + L P
Sbjct: 1273 VFPHSIDGIMGSFAPLNAKICHSGFVYFTFSNEMRLATLPPGYSFNEHLGIKWITLDPVP 1332
Query: 181 HFLAYHLETKTYCIVTSTAEPSTDYYKFNGE-DKELVTDPRDSRFIPPLVSQFHVSLFSP 239
+++ YH+E+KTY +V +EP ++ N E +KE R + P + + + +++P
Sbjct: 1333 YYVQYHVESKTYAVVGIHSEPCKSVFRLNAEGNKEEDVLVRPKTCVLPTLDYYSLQMYAP 1392
Query: 240 F----------SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
W IP T WE V CL + E T G + Y+ALG N Y E
Sbjct: 1393 NLNANHRNKQPPWLLIPNTLIEFEPWEVVTCLITAQLASEETFHGTKDYLALGANLTYGE 1452
Query: 290 DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
++ RGRIL+ D+I+VVPEPGQPLT++K+K+I+ EQKGPVTA+ G L++A+GQKIY
Sbjct: 1453 EIPVRGRILILDVIDVVPEPGQPLTRHKLKIIHDGEQKGPVTALTSCQGHLISAIGQKIY 1512
Query: 350 IWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
IW LK+ DL G+AF+D+E+YI +++ VKNL+L D +S+ LLR+Q + R LS+V+RD
Sbjct: 1513 IWTLKNTDLVGVAFVDSELYIHNLLCVKNLVLAADVLKSVQLLRFQSDLRVLSVVSRDNI 1572
Query: 410 PTQPNSKGYYAGNPSRGI-----IDGSLVWKFLQLS----LGERLEICKKIGSKHNDILD 460
+ + ++ G + ++ + L G RL C + L
Sbjct: 1573 SREVYTSNFFVDGRRLGFMVSDELGNVTIYSYDPLDPSSRSGRRLVRCADMR------LP 1626
Query: 461 EFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS 520
++ ++++ ++ +L + + + + T + NT + S++
Sbjct: 1627 SRATCSLRVANRLRHALLSV---KPSSTTTASAMTAGTSATIQDSTNTVLDNLSRVDSVN 1683
Query: 521 DAPGARS------------------------RFLTWYASLDGALGFFLPLPEKNYRRLLM 556
R R ++ S +G++ P+ +K Y RL +
Sbjct: 1684 QMNNLRQSQQQSTAAQQGTTNPNSGVDPEKFRQSIYFGSQNGSIYRIGPIRDKMYSRLRI 1743
Query: 557 LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 616
+ ++ H G+ P++ +Y NP + DG L+W++L L +RLEI KK
Sbjct: 1744 TEKNLIHHLGPICGMPPKSCWSYNRPQPELANPCGKVADGDLIWRYLTLPHCQRLEIAKK 1803
Query: 617 IGSKHNDILDELYDIEALSSHF 638
G I+D++ ++ A + HF
Sbjct: 1804 SGQSLESIMDDIAELIATTLHF 1825
>gi|328773280|gb|EGF83317.1| hypothetical protein BATDEDRAFT_21894 [Batrachochytrium dendrobatidis
JAM81]
Length = 1673
Score = 331 bits (849), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 199/590 (33%), Positives = 306/590 (51%), Gaps = 98/590 (16%)
Query: 98 YQGVFLCGPHPAWLF--LTSRGE---------------------------LRAHPMTIDG 128
Y GV + G P W+ L SR + LR HPM +DG
Sbjct: 1119 YSGVVVTGSRPCWIMVALQSRQQDLDVISFDNSVACSTKLPPVPLLGTNMLRFHPMPVDG 1178
Query: 129 PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLE 188
P+ AP HNVN GFLY N K RI LP ++D WPV KVP+ T H +AYH
Sbjct: 1179 PMKCFAPLHNVNVAHGFLYINWKGLFRICQLPPQFNFDHDWPVCKVPIHKTVHKVAYHYS 1238
Query: 189 TKTYCIVTSTAE---------PSTDYYKFNGEDKEL------VTDPRDSRFIPP-----L 228
++TY I TST E S E E+ VT R+ I P
Sbjct: 1239 SQTYAIATSTPERFDIPHAQYASAVAAAVIDEGDEMPDAERKVTGIRELSEIKPGMYEAT 1298
Query: 229 VSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYS 288
V ++ + L S +WE + + L E E V+ L+ V + + T+SG + Y+A+GT Y+
Sbjct: 1299 VDRYKIELVSSVTWETV--DSIELSEAETVMALEAVDLSSKETISGKKLYLAIGTGYSRG 1356
Query: 289 EDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI 348
ED++ RG++ L+D+IEVVP+P P T K K + +++ + P +AIC V +L+ A+G KI
Sbjct: 1357 EDLSSRGKLHLYDVIEVVPDPNNPQTNRKFKHVDSEDDRSPFSAICTVNDYLLAAIGPKI 1416
Query: 349 YIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDY 408
++QL+D ++TG+AF+D V++ S+ SVKNLI + D +S+ + +Q E L+++ RD
Sbjct: 1417 IMYQLEDGEITGVAFLDVNVFVTSLSSVKNLIQICDIQKSVWFVAFQEEPAKLAVLGRDV 1476
Query: 409 KPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFM 468
P Q GY A N ++D+ + + +
Sbjct: 1477 HPLQ----GYAA-----------------------------------NMLIDD-NQLALL 1496
Query: 469 ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSR 528
++D DKN+ +Y P+ +S GG RLI+K + HLGQHV+ F ++R KP +DA +
Sbjct: 1497 VADGDKNLHTMIYAPDNVQSLGGERLIRKGEIHLGQHVSKFIRMRRKPLLRNDAIVFSKQ 1556
Query: 529 FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK------ 582
+L A+LDGAL P+ E+ ++RL L + MVT H GLNPR FR + +
Sbjct: 1557 YLNVAATLDGALEIITPVSERIFKRLYGLYSRMVTSIEHIAGLNPRGFRQAQHRVRPITL 1616
Query: 583 -GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
G+ RGI+DG L++++++LS ++ + K IGSK + ++D+L ++
Sbjct: 1617 SGFIGPPGPRGILDGDLLYEYVRLSRTQQRGLAKAIGSKDDRLMDDLLEV 1666
>gi|256079900|ref|XP_002576222.1| cleavage and polyadenylation specificity factor cpsf [Schistosoma
mansoni]
Length = 1958
Score = 313 bits (803), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 181/542 (33%), Positives = 279/542 (51%), Gaps = 76/542 (14%)
Query: 17 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAF------RHP-------KGALKLRFKKLK 63
+ E+L +G+ +RP+L+VRT E+ ++A +P +G L+ R L
Sbjct: 1170 ILEILVYPIGIDKDRPVLMVRTSQEIAFFEALCPSPDESYPLISGTFYEGRLRWRRLPLP 1229
Query: 64 VLFVSDRSKRANEQPGLPRGV---RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
V+ R R + + + R +R F NI ++GVF+CG +P WLF T G+LR
Sbjct: 1230 CPLVAPRRVRTDPKIMDVQSTLLTRTHMLRSFENIGDHRGVFVCGGNPIWLFATDSGQLR 1289
Query: 121 AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTP 180
P +IDG + + AP + C GF+YF +E+R++ LP S++ ++ + L P
Sbjct: 1290 VFPHSIDGIMGSFAPLNAKICHSGFVYFTFSNEMRLATLPPGYSFNEHLGIKWITLDPVP 1349
Query: 181 HFLAYHLETKTYCIVTSTAEPSTDYYKFNGE-DKELVTDPRDSRFIPPLVSQFHVSLFSP 239
+++ YH+E+KTY +V +EP ++ N E +KE R + P + + + +++P
Sbjct: 1350 YYVQYHVESKTYAVVGIHSEPCKSVFRLNAEGNKEEDVLVRPKTCVLPTLDYYSLQMYAP 1409
Query: 240 F----------SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
W IP T WE V CL + E T G + Y+ALG N Y E
Sbjct: 1410 NLNANHRNKQPPWLLIPNTLIEFEPWEVVTCLITAQLASEETFHGTKDYLALGANLTYGE 1469
Query: 290 DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
++ RGRIL+ D+I+VVPEPGQPLT++K+K+I+ EQKGPVTA+ G L++A+GQKIY
Sbjct: 1470 EIPVRGRILILDVIDVVPEPGQPLTRHKLKIIHDGEQKGPVTALTSCQGHLISAIGQKIY 1529
Query: 350 IWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
IW LK+ DL G+AF+D+E+YI +++ VKNL+L D +S+ LLR+Q + R LS+V+RD
Sbjct: 1530 IWTLKNTDLVGVAFVDSELYIHNLLCVKNLVLAADVLKSVQLLRFQSDLRVLSVVSRDNI 1589
Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
S+ Y N +DG +GFM+
Sbjct: 1590 -----SREVYTSN---FFVDG--------------------------------RRLGFMV 1609
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI---------RCKPSSIS 520
SD+ NV ++ Y P S G RL++ D L ++ KPSS +
Sbjct: 1610 SDELGNVTIYSYDPLDPSSRSGRRLVRCADMRLPSRATCSLRVANRLRHALLSVKPSSTT 1669
Query: 521 DA 522
A
Sbjct: 1670 TA 1671
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 57/107 (53%)
Query: 532 WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSR 591
++ S +G++ P+ +K Y RL + + ++ H G+ P++ +Y NP
Sbjct: 1852 YFGSQNGSIYRIGPIRDKMYSRLRITEKNLIHHLGPICGMPPKSCWSYNRPQPELANPCG 1911
Query: 592 GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
+ DG L+W++L L +RLEI KK G I+D++ ++ A + HF
Sbjct: 1912 KVADGDLIWRYLTLPHCQRLEIAKKSGQSLESIMDDIAELIATTLHF 1958
Score = 42.0 bits (97), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 26/45 (57%)
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
NP + DG L+W++L L +RLEI KK G I+D+ + +
Sbjct: 1907 ANPCGKVADGDLIWRYLTLPHCQRLEIAKKSGQSLESIMDDIAEL 1951
>gi|426235955|ref|XP_004011942.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 [Ovis aries]
Length = 819
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 163/365 (44%), Positives = 207/365 (56%), Gaps = 61/365 (16%)
Query: 101 VFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLP 160
VF+CGP P WL +T RG LR HPM IDGP+ + APFHN+NCPRGFLYFN + ELRISVLP
Sbjct: 503 VFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLP 562
Query: 161 THLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPR 220
+LSYDAPWPVRK+PL+CT H++AYH+E+K Y + TST+ P T + GE+KE T R
Sbjct: 563 AYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTSTPCTRVPRMTGEEKEFETIER 622
Query: 221 DSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGY-- 278
D R++ P F + L SP SWE IP L E G RG+
Sbjct: 623 DERYVHPQQEAFCIQLISPVSWEAIPNARIELEE------------XXXXXXXGSRGHVY 670
Query: 279 -IALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
+ G+ E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH
Sbjct: 671 SVPAGSCLKEGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCN 730
Query: 338 GFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPE 397
G LV+A+GQK L + G R+ +L +
Sbjct: 731 GHLVSAIGQKXXXXXLPPH-------------------------AGLNPRAFRMLHV--D 763
Query: 398 YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 457
R L N R ++DG L+ ++L LS ER E+ KKIG+ +
Sbjct: 764 RRVLQ-------------------NAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDI 804
Query: 458 ILDEF 462
ILD+
Sbjct: 805 ILDDL 809
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 43/70 (61%)
Query: 569 GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+L
Sbjct: 750 AGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDL 809
Query: 629 YDIEALSSHF 638
+ + +++HF
Sbjct: 810 LETDRVTAHF 819
>gi|168021793|ref|XP_001763425.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685218|gb|EDQ71614.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1452
Score = 291 bits (745), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 203/655 (30%), Positives = 321/655 (49%), Gaps = 97/655 (14%)
Query: 17 VQELLTVSLGLHGNRPLLLVR-TQHELLIYQAF-------------RHPKGALK------ 56
V ++ S G RP LL + +L Y AF R +LK
Sbjct: 861 VSQICFESWGEKFGRPFLLATLSDGTMLCYHAFSYDANESSDALEFRETATSLKDLSRLT 920
Query: 57 -LRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTS 115
LRF ++ + +VS + A + + ++ F N+ + GVF+ G P WL +
Sbjct: 921 HLRFARIPIDWVSGQEDGA-------KVLYETKFCSFKNVGSFPGVFVTGLRPTWL-MVC 972
Query: 116 RGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVP 175
RG LR HP DG + P HNVNC GF+Y A+ +L+I LP+ L YD WPV+K+P
Sbjct: 973 RGRLRPHPQFCDGAILGFTPLHNVNCAHGFIYITAQGQLKICQLPSLLFYDNDWPVQKIP 1032
Query: 176 LKCTPHFLAYHLETKTYCIVTST--AEPSTDYYKFNG----EDKELVTDPRDSRFIPPLV 229
L+ TPH + YH + Y ++ ST + P++ +G + +E R +
Sbjct: 1033 LRGTPHQITYHSDVNLYALIISTPVSRPTSQVLMGDGHPFDQQQENSIGEDGQRLVTS-- 1090
Query: 230 SQFHVSLFSPF----SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNY 285
+ V + P +WE + +H E+ L ++ VS++ T + +A+GT+Y
Sbjct: 1091 EDYEVRIIEPAQPGGNWE--AKAAIKMHLTENALTVRIVSIK-NITTDQTQTLLAIGTSY 1147
Query: 286 NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVG 345
EDV +GRI+L + + +PG + + +Y+KE KG ++AI + G L+ A+G
Sbjct: 1148 VQGEDVAAKGRIILVSVGKDPQDPG-----SWAREVYSKELKGSISAIASLQGHLLIAIG 1202
Query: 346 QKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVA 405
KI + ++L G AF D +Y+ S+ VKN IL GD +SI L ++ + L+L+A
Sbjct: 1203 PKIILHSWNGSELNGAAFFDAPLYVVSLNIVKNFILFGDIHKSIYFLCWKEDGAQLTLLA 1262
Query: 406 RDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
+D+ S YA + +IDG S++
Sbjct: 1263 KDF-----GSLDCYA---TEFLIDG--------------------------------STL 1282
Query: 466 GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG- 524
++SD KN+ +F Y P++ ES G +L+ + +FHLG HVN F +++ P+ PG
Sbjct: 1283 SLLVSDSRKNLQIFSYAPKSMESWKGQKLLSRAEFHLGAHVNKFHRLQMLPT-----PGS 1337
Query: 525 ARS-RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKG 583
ARS R+ + +LDGA+ + PL E +RRL LQ +V SH G+NPRAFR ++ G
Sbjct: 1338 ARSNRYAVLFGTLDGAIDYLAPLDELTFRRLHTLQRKLVDCVSHVAGVNPRAFRQFRCDG 1397
Query: 584 YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
I+D L+ + L L E+LEI ++IG+ +L L D+ ALS+ F
Sbjct: 1398 KAHRPGPDNIVDCELLSHYDMLPLDEQLEIARQIGTTRAHVLSNLRDL-ALSTSF 1451
>gi|196012166|ref|XP_002115946.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
gi|190581722|gb|EDV21798.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
Length = 1187
Score = 283 bits (723), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 135/249 (54%), Positives = 172/249 (69%), Gaps = 1/249 (0%)
Query: 126 IDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
+DG V APF+ NCP GFLYFN++ +LRI VL +YD PWPV KVPL+ T HF+ +
Sbjct: 768 VDGYVKCFAPFNIANCPNGFLYFNSEEDLRICVLDQRFTYDCPWPVHKVPLRNTLHFITH 827
Query: 186 HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEI 245
H TKTY I++ST EDKE + + RFI V +F + L + +WE I
Sbjct: 828 HFVTKTYVIISSTMTVCEKMPHITTEDKEFIPVEKGDRFIHAPVEKFCLQLITSETWEII 887
Query: 246 PQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEV 305
P + EWEHV CLK+V ++ E T+SGL+ +IA+GT E+V CRGRI++FD+IEV
Sbjct: 888 PDAEIQMAEWEHVTCLKSVKLKSEETVSGLKEFIAVGTTNVCGEEVACRGRIVIFDVIEV 947
Query: 306 VPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDN-DLTGIAFI 364
VPEPG+PLTKNKIK Y KEQKGPVTAI V GFLVT++GQKIYIW+ +DN DL G+AFI
Sbjct: 948 VPEPGKPLTKNKIKTYYDKEQKGPVTAITCVEGFLVTSIGQKIYIWEFRDNKDLIGMAFI 1007
Query: 365 DTEVYIASM 373
DT +YI S+
Sbjct: 1008 DTLIYIHSL 1016
Score = 98.6 bits (244), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 82/148 (55%), Gaps = 3/148 (2%)
Query: 485 ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFL 544
A ES+GG L+++ + G + + FF+ + + + ++ +TW+ +LDG++G L
Sbjct: 1039 APESHGGQFLVRRAEIQTGSNAHAFFRTKVRAL---NQRQNENKHITWFGTLDGSIGLLL 1095
Query: 545 PLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQ 604
P+ EK YRRL LQ + + GLN +AFRT++ N R I+DG L+ ++
Sbjct: 1096 PVDEKEYRRLFSLQAKLSIYLEQNAGLNQKAFRTFRSHQKKLQNSMRNILDGDLLKRYFH 1155
Query: 605 LSLGERLEICKKIGSKHNDILDELYDIE 632
L ER ++ K+I S I+++L +E
Sbjct: 1156 LGFVERRDLAKQIMSTPEQIINDLTKLE 1183
>gi|356530945|ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Glycine max]
Length = 1449
Score = 282 bits (722), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 185/592 (31%), Positives = 289/592 (48%), Gaps = 73/592 (12%)
Query: 58 RFKKLKVLFVS-DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSR 116
R + L+ + V D R + G P Q+ F NI YQG FL G PAW+ + R
Sbjct: 906 RLRNLRFVRVPLDAYPREDTSNGSP----CQQITIFKNIGSYQGFFLSGSRPAWVMVL-R 960
Query: 117 GELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPL 176
LR HP DG + HNVNC G +Y ++ L+I LP+ +YD+ WPV+K+PL
Sbjct: 961 ERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSGSNYDSYWPVQKIPL 1020
Query: 177 KCTPHFLAYHLETKTYCIVTS--TAEP-----STDYYKFNGEDKELVTDPRD-SRFIPPL 228
K TPH + Y E Y ++ S +P S FN +++ +P + +RF P
Sbjct: 1021 KATPHQVTYFAEKNLYPLIVSFPVLKPLNQVISLVDQDFNHQNESQNMNPDEQNRFYP-- 1078
Query: 229 VSQFHVSLFSPFS----WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTN 284
+ +F V + P W+ + P+ E+ L ++ V++ T +A+GT
Sbjct: 1079 IDEFEVRIMEPEKSGGPWQ--TKATIPMQSSENALTVRMVTL-LNTTSKENETLLAIGTA 1135
Query: 285 YNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAV 344
Y EDV RGRILLF + ++ P + + +Y+KE KG ++A+ + G L+ A
Sbjct: 1136 YVQGEDVAARGRILLFSLGKITDNP-----QTLVSEVYSKELKGAISALASLQGHLLIAS 1190
Query: 345 GQKIYIWQLKDNDLTGIAFIDT-EVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSL 403
G KI + + +L GIAF D +++ S+ VKN IL+GD +SI L ++ + LSL
Sbjct: 1191 GPKIILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSL 1250
Query: 404 VARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFS 463
+A+D+ G + +IDG S
Sbjct: 1251 LAKDF--------GSLDCFATEFLIDG--------------------------------S 1270
Query: 464 SMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSS--ISD 521
++ M+SD ++N+ +F Y P+ ES G +L+ + +FH+G HV F +++ +S
Sbjct: 1271 TLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTSDRAGS 1330
Query: 522 APGA--RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
PG+ +RF + +LDG++G PL E +RRL LQ +V H GLNPRAFR +
Sbjct: 1331 VPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQRKLVDAVPHVAGLNPRAFRLF 1390
Query: 580 KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
+ G I+D L+ + L L E+LEI +IG+ + IL L D+
Sbjct: 1391 RSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIANQIGTTRSQILSNLSDL 1442
>gi|449524573|ref|XP_004169296.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like, partial [Cucumis sativus]
Length = 741
Score = 281 bits (719), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 180/570 (31%), Positives = 275/570 (48%), Gaps = 70/570 (12%)
Query: 80 LPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNV 139
+P G ++ F NI+GYQG+FLCG PAW F+ R LR HP DGP+ A HNV
Sbjct: 217 MPNGTLSRRLSIFKNISGYQGLFLCGSRPAW-FMVFRERLRVHPQLCDGPIVAFAVLHNV 275
Query: 140 NCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTST- 198
NC G +Y ++ L+I LP+ +YD WPV+KVPLK TPH + Y E Y ++ S
Sbjct: 276 NCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQKVPLKGTPHQVTYFHEKNLYPVIISAP 335
Query: 199 --------AEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS----WEEIP 246
D + E+ L D + V +F + + P W+
Sbjct: 336 VQKPLNQVLSSMVDQDVGHVENHNLSADELQQTYS---VEEFEIRILEPEKSGGPWQ--T 390
Query: 247 QTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVV 306
+ +H E+ L ++ V++ T +A+GT Y EDV RGR+LLF + +
Sbjct: 391 RATIAMHSSENALTIRVVTL-LNTTTKENETLLAVGTAYVQGEDVAARGRVLLFSVGKDA 449
Query: 307 PEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT 366
++ + +Y+KE KG ++A+ + G L+ A G KI + + +L GIAF D
Sbjct: 450 DN-----SQTLVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGAELNGIAFYDV 504
Query: 367 -EVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSR 425
+Y+ S+ VKN IL+GD +SI L ++ + LSL+A+D+ S YA +
Sbjct: 505 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDF-----GSLDCYA---TE 556
Query: 426 GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEA 485
+IDG S++ +SD KN+ +F Y P++
Sbjct: 557 FLIDG--------------------------------STLSLTVSDDQKNIQIFYYAPKS 584
Query: 486 RESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS----RFLTWYASLDGALG 541
ES G +L+ + +FH+G HV F +++ +S A S RF + +LDG++G
Sbjct: 585 TESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTSSDKACSTVSDKTNRFALLFGTLDGSIG 644
Query: 542 FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 601
PL E +RRL LQ + H GGLNPR+FR + G I+D L+
Sbjct: 645 CIAPLDELTFRRLQSLQKKLGDAVPHVGGLNPRSFRQFHSNGKVHRRGPDSIVDCELLCH 704
Query: 602 FLQLSLGERLEICKKIGSKHNDILDELYDI 631
+ L L E+L+I +IG+ + IL L D+
Sbjct: 705 YEMLPLEEQLDIAHQIGTTRSQILSNLNDL 734
>gi|393907594|gb|EJD74706.1| hypothetical protein LOAG_18016 [Loa loa]
Length = 398
Score = 281 bits (719), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 157/431 (36%), Positives = 244/431 (56%), Gaps = 42/431 (9%)
Query: 211 EDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEG 270
+DK + F+ P + Q+ + L+SP W+ + E+E V C + V + EG
Sbjct: 7 DDKTFEEHEKPDTFVYPEMDQYKLQLYSPEDWKPVQNVEVLFEEFEVVTCCEEVVLRSEG 66
Query: 271 TLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPV 330
T+SG++ Y+A+GT NY E+V RGRI++ +IIEVVPEPGQP +K++IK +Y KEQKGPV
Sbjct: 67 TVSGVQNYLAVGTACNYGEEVLVRGRIIISEIIEVVPEPGQPTSKHRIKTLYDKEQKGPV 126
Query: 331 TAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIA 390
T++C G+L+T +GQK++IW KDN+L GI+F+D Y+ ++ V+NL L D RS+A
Sbjct: 127 TSLCSCNGYLLTGMGQKVFIWLFKDNNLQGISFLDMHFYVHQLIGVRNLALACDMYRSVA 186
Query: 391 LLRYQPEYRTLSLVARDYKP-TQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 449
LLRYQ EY+ LSL +RD + QP + IID
Sbjct: 187 LLRYQEEYKALSLASRDMRSDVQPPMAAQF-------IIDN------------------- 220
Query: 450 KIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
MGF++SD+ N+ +F Y PE ES GG +L + + ++G VN+F
Sbjct: 221 -------------KQMGFVMSDEAANIAIFNYLPETLESLGGEKLTLRAEINIGTVVNSF 267
Query: 510 FKIRCKPSS--ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 567
+++ SS + + + R +ASLDG+ GF PL EK +RRL MLQ +M +
Sbjct: 268 IRVKGHISSGFVENELFSLERQSVLFASLDGSFGFLRPLTEKVFRRLHMLQQLMSSMVPQ 327
Query: 568 TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 627
GLN + R + +R ++DG +V ++L LSL E+ ++ +K+G+ I+D+
Sbjct: 328 PAGLNAKGARAARPPRPNHYLNTRNLVDGDMVMQYLHLSLPEKNDLARKLGTSRYHIIDD 387
Query: 628 LYDIEALSSHF 638
L +I +++H+
Sbjct: 388 LIEICRVTAHY 398
>gi|449470342|ref|XP_004152876.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Cucumis sativus]
Length = 1504
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 180/570 (31%), Positives = 275/570 (48%), Gaps = 70/570 (12%)
Query: 80 LPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNV 139
+P G ++ F NI+GYQG+FLCG PAW F+ R LR HP DGP+ A HNV
Sbjct: 980 MPNGTLSCRLSIFKNISGYQGLFLCGSRPAW-FMVFRERLRVHPQLCDGPIVAFAVLHNV 1038
Query: 140 NCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTST- 198
NC G +Y ++ L+I LP+ +YD WPV+KVPLK TPH + Y E Y ++ S
Sbjct: 1039 NCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQKVPLKGTPHQVTYFHEKNLYPVIISAP 1098
Query: 199 --------AEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS----WEEIP 246
D + E+ L D + V +F + + P W+
Sbjct: 1099 VQKPLNQVLSSMVDQDVGHVENHNLSADELQQTYS---VEEFEIRILEPEKSGGPWQ--T 1153
Query: 247 QTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVV 306
+ +H E+ L ++ V++ T +A+GT Y EDV RGR+LLF + +
Sbjct: 1154 RATIAMHSSENALTIRVVTL-LNTTTKENETLLAVGTAYVQGEDVAARGRVLLFSVGKDA 1212
Query: 307 PEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT 366
++ + +Y+KE KG ++A+ + G L+ A G KI + + +L GIAF D
Sbjct: 1213 DN-----SQTLVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGAELNGIAFYDV 1267
Query: 367 -EVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSR 425
+Y+ S+ VKN IL+GD +SI L ++ + LSL+A+D+ S YA +
Sbjct: 1268 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDF-----GSLDCYA---TE 1319
Query: 426 GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEA 485
+IDG S++ +SD KN+ +F Y P++
Sbjct: 1320 FLIDG--------------------------------STLSLTVSDDQKNIQIFYYAPKS 1347
Query: 486 RESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS----RFLTWYASLDGALG 541
ES G +L+ + +FH+G HV F +++ +S A S RF + +LDG++G
Sbjct: 1348 TESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTSSDKACSTVSDKTNRFALLFGTLDGSIG 1407
Query: 542 FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 601
PL E +RRL LQ + H GGLNPR+FR + G I+D L+
Sbjct: 1408 CIAPLDELTFRRLQSLQKKLGDAVPHVGGLNPRSFRQFHSNGKVHRRGPDSIVDCELLCH 1467
Query: 602 FLQLSLGERLEICKKIGSKHNDILDELYDI 631
+ L L E+L+I +IG+ + IL L D+
Sbjct: 1468 YEMLPLEEQLDIAHQIGTTRSQILSNLNDL 1497
>gi|218194461|gb|EEC76888.1| hypothetical protein OsI_15095 [Oryza sativa Indica Group]
Length = 1503
Score = 278 bits (712), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 197/664 (29%), Positives = 315/664 (47%), Gaps = 86/664 (12%)
Query: 30 NRPLLL-VRTQHELLIYQAFRH-------------PKG------ALKLRFKKLKVLFVSD 69
+RP L + LL Y AF + P+G A R + L+ VS
Sbjct: 857 SRPFLFGLLNDGTLLCYHAFSYEASESNVKRVPLSPQGSADHHNASDSRLRNLRFHRVSI 916
Query: 70 RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP 129
+ P L R ++ F+N+ GY+G+FL G PAW+ + R LR HP DGP
Sbjct: 917 DITSREDIPTLGR----PRITTFNNVGGYEGLFLSGTRPAWV-MVCRQRLRVHPQLCDGP 971
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
+ HNVNC GF+Y ++ L+I LP+ +YD WPV+KVPL TPH + Y+ E
Sbjct: 972 IEAFTVLHNVNCSHGFIYVTSQGFLKICQLPSAYNYDNYWPVQKVPLHGTPHQVTYYAEQ 1031
Query: 190 KTYCIVTSTA---------EPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFS-- 238
Y ++ S D + D ++ + D+ V +F V +
Sbjct: 1032 SLYPLIVSVPVVRPLNQVLSSMADQESVHHMDNDVTS--TDALHKTYTVDEFEVRILELE 1089
Query: 239 --PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGR 296
WE ++ P+ +E+ L ++ V++ + T +A+GT Y EDV RGR
Sbjct: 1090 KPGGHWE--TKSTIPMQLFENALTVRIVTL-HNTTTKENETLLAIGTAYVLGEDVAARGR 1146
Query: 297 ILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDN 356
+LLF ++ ++N + +Y+KE KG V+A+ + G L+ A G KI + +
Sbjct: 1147 VLLFSFMK------SENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWTGA 1200
Query: 357 DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSK 416
+LT +AF D +++ S+ VKN +L GD +SI L ++ + LSL+A+D+ +
Sbjct: 1201 ELTAVAFYDAPLHVVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDFGSLDCFAT 1260
Query: 417 GYYAGNPSRGIIDGS----------------------LVWKFL--QLSLGERLEICKKIG 452
+ + ++ L WK QLSL + K G
Sbjct: 1261 EFLIDGSTLSLVASDSDKNVQVKNFVLFGDIHKSIYFLSWKEQGSQLSL-----LAKDFG 1315
Query: 453 SKH---NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
S + L + S++ + SD DKNV +F Y P+ ES G +L+ + +FH+G H+ F
Sbjct: 1316 SLDCFATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESWKGQKLLSRAEFHVGAHITKF 1375
Query: 510 FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
+++ P+ + +RF + +LDG +G P+ E +RRL LQ +V H
Sbjct: 1376 LRLQMLPTQ-GLSSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVC 1434
Query: 570 GLNPRAFRTY--KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 627
GLNPR+FR + GKG+ G + IID L+ + LSL E+L++ ++IG+ + IL
Sbjct: 1435 GLNPRSFRQFHSNGKGHRPGPDN--IIDFELLAHYEMLSLDEQLDVAQQIGTTRSQILSN 1492
Query: 628 LYDI 631
DI
Sbjct: 1493 FSDI 1496
>gi|356559917|ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Glycine max]
Length = 1447
Score = 277 bits (709), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 183/594 (30%), Positives = 284/594 (47%), Gaps = 77/594 (12%)
Query: 58 RFKKLKVLFVS-DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSR 116
R + L+ + V D R + G P Q+ F NI Y+G FL G PAW+ + R
Sbjct: 904 RLRNLRFVRVPLDAYAREDTSNGPP----CQQITIFKNIGSYEGFFLSGSRPAWVMVL-R 958
Query: 117 GELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPL 176
LR HP DG + HNVNC +G +Y ++ L+I LP+ +YD+ WPV+K+PL
Sbjct: 959 ERLRVHPQLCDGSIVAFTVLHNVNCNQGLIYVTSQGVLKICQLPSGSNYDSYWPVQKIPL 1018
Query: 177 KCTPHFLAYHLETKTYCIVTS--TAEPSTDYYKFNGED------KELVTDPRDSRFIPPL 228
K TPH + Y E Y ++ S +P +D + + +RF P
Sbjct: 1019 KATPHQVTYFAEKNLYPLIVSFPVLKPLNQVISLVDQDINHQNESQNMNPDEQNRFYP-- 1076
Query: 229 VSQFHVSLFSPFS----WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTN 284
+ +F V + P W+ + P+ E+ L ++ V++ T +A+GT
Sbjct: 1077 IDEFEVRIMEPEKSGGPWQ--TKATIPMQSSENALTVRMVTL-VNTTSKENETLLAIGTA 1133
Query: 285 YNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAV 344
Y EDV RGRILLF + + P + + +Y+KE KG ++A+ + G L+ A
Sbjct: 1134 YVQGEDVAARGRILLFSLGKNTDNP-----QTLVSEVYSKELKGAISALASLQGHLLIAS 1188
Query: 345 GQKIYIWQLKDNDLTGIAFIDT-EVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSL 403
G KI + + +L GIAF D +++ S+ VKN IL+GD +SI L ++ + LSL
Sbjct: 1189 GPKIILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSL 1248
Query: 404 VARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFS 463
+A+D+ G + +IDG S
Sbjct: 1249 LAKDF--------GSLDCFATEFLIDG--------------------------------S 1268
Query: 464 SMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAP 523
++ M+SD ++N+ +F Y P+ ES G +L+ + +FH+G HV F +R + S SD
Sbjct: 1269 TLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF--LRLQMLSTSDRA 1326
Query: 524 GA------RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 577
GA +RF + +LDG++G PL E +RRL LQ +V H GLNPRAFR
Sbjct: 1327 GAVPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQRKLVDAVPHVAGLNPRAFR 1386
Query: 578 TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
++ G I+D L+ + L L E+LEI ++G+ + IL L D+
Sbjct: 1387 LFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAHQVGTTRSQILSNLSDL 1440
>gi|302761560|ref|XP_002964202.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
gi|300167931|gb|EFJ34535.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
Length = 1413
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 201/656 (30%), Positives = 317/656 (48%), Gaps = 91/656 (13%)
Query: 12 MDETIVQELLTVSLGLHGNRPLLLVR-TQHELLIYQAFRHP---KGA--------LKLRF 59
M + V ++ + G RP + V + LL Y+AF + GA LRF
Sbjct: 820 MSKIKVVDICVDTWGEKYGRPFVFVLLSDGTLLSYRAFIYEGQDSGAHASDGTSFRNLRF 879
Query: 60 KKLKV-LFVSDRSKRANEQPGLPRGVR-ISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
+L++ L + + A+E VR + ++ F ++ G QG+FL G P WL + R
Sbjct: 880 LRLQLDLELGEEDSNADE-------VRSVQKIIPFKDVGGLQGLFLAGGKPTWLMIF-RE 931
Query: 118 ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
++R HP DGP+ HNVNC G +Y ++ L+I L L+YD WPV+K+PLK
Sbjct: 932 QIRLHPQASDGPIVAFTSLHNVNCQHGLIYVTNEASLKICRLSNILNYDNDWPVQKIPLK 991
Query: 178 CTPHFLAYHLETKTYCIV--------TSTAEPS-TDYYKFNGEDKELVTDPRDSRFIPPL 228
TPH +A+H + Y +V TS PS D + D+ +D D + +
Sbjct: 992 GTPHQMAHHPDLNIYVLVLSFSVSVPTSLVLPSAADGPPGHQIDQSEASDGLDPQKMVQ- 1050
Query: 229 VSQFHVSLFSPFS----WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTN 284
V F V L P + WE F E+VL ++ VS++ T + +A+GT
Sbjct: 1051 VDDFEVRLLEPMAQGVPWETKDTIKF--QPAENVLTVRIVSIKNAAT-EQVENLLAIGTG 1107
Query: 285 YNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAV 344
Y EDV RGRI+L + E +P P K K +Y+KE KG ++A+ + G L+ A+
Sbjct: 1108 YLQGEDVASRGRIILVSLGE---DPSDP--KVWAKELYSKELKGAISALAALQGHLLLAI 1162
Query: 345 GQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLV 404
G KI + ++L G AF D +Y+ S+ VKN +L GD+ +SI L ++ E L L+
Sbjct: 1163 GPKIILHTWNGSELIGTAFFDAPLYVVSLNIVKNFVLFGDFHKSIYFLCWKEEGAQLVLL 1222
Query: 405 ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
A+D+ S YA + +IDG S+
Sbjct: 1223 AKDF-----GSLDCYA---TEFLIDG--------------------------------ST 1242
Query: 465 MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG 524
+ ++SD KN+ +F Y P+ ES G +L+ + +FHLG HV F +++ + PG
Sbjct: 1243 LSLLVSDSRKNIQVFSYAPKNAESWKGQKLLPRVEFHLGSHVTKFLRLQ-----MLQTPG 1297
Query: 525 AR--SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 582
+ +RF + +LDG +G+ PL E +RRL LQ +V H GLNP+A+R ++
Sbjct: 1298 SSRTNRFALCFGTLDGGIGYITPLDELTFRRLQTLQRKLVDLVPHVAGLNPKAYRQFQAN 1357
Query: 583 GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
G + + +D + ++ LSL +++ I ++IG+ I L DI +S F
Sbjct: 1358 GEHHKHGPDNTVDSEQLREYESLSLDKQVAIARQIGTTRQQIFANLRDISLSTSFF 1413
>gi|302814354|ref|XP_002988861.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
gi|300143432|gb|EFJ10123.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
Length = 1413
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 201/656 (30%), Positives = 317/656 (48%), Gaps = 91/656 (13%)
Query: 12 MDETIVQELLTVSLGLHGNRPLLLVR-TQHELLIYQAFRHP---KGA--------LKLRF 59
M + V ++ + G RP + V + LL Y+AF + GA LRF
Sbjct: 820 MSKIKVVDICVDTWGEKYGRPFVFVLLSDGTLLSYRAFIYEGQDSGAHASDGTSFRNLRF 879
Query: 60 KKLKV-LFVSDRSKRANEQPGLPRGVR-ISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
+L++ L + + A+E VR + ++ F ++ G QG+FL G P WL + R
Sbjct: 880 LRLQLDLELGEEDSNADE-------VRSVQKIIPFKDVGGLQGLFLAGGKPTWLMIF-RE 931
Query: 118 ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
++R HP DGP+ HNVNC G +Y ++ L+I L L+YD WPV+K+PLK
Sbjct: 932 QIRLHPQASDGPIVAFTSLHNVNCQHGLIYVTNEASLKICRLSNILNYDNDWPVQKIPLK 991
Query: 178 CTPHFLAYHLETKTYCIV--------TSTAEPS-TDYYKFNGEDKELVTDPRDSRFIPPL 228
TPH +A+H + Y +V TS PS D + D+ +D D + +
Sbjct: 992 GTPHQMAHHPDLNIYVLVLSFSVSVPTSLVLPSAADGPPGHQIDQSEASDGLDPQKMVQ- 1050
Query: 229 VSQFHVSLFSPFS----WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTN 284
V F V L P + WE F E+VL ++ VS++ T + +A+GT
Sbjct: 1051 VDDFEVRLLEPMAQGVPWETKDTIKF--QPAENVLTVRIVSIKNAAT-EQVENLLAIGTG 1107
Query: 285 YNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAV 344
Y EDV RGRI+L + E +P P K K +Y+KE KG ++A+ + G L+ A+
Sbjct: 1108 YLQGEDVASRGRIILVSLGE---DPSDP--KVWAKELYSKELKGAISALAALQGHLLLAI 1162
Query: 345 GQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLV 404
G KI + ++L G AF D +Y+ S+ VKN +L GD+ +SI L ++ E L L+
Sbjct: 1163 GPKIILHTWNGSELIGTAFFDAPLYVVSLNIVKNFVLFGDFHKSIYFLCWKEEGAQLVLL 1222
Query: 405 ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
A+D+ S YA + +IDG S+
Sbjct: 1223 AKDF-----GSLDCYA---TEFLIDG--------------------------------ST 1242
Query: 465 MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG 524
+ ++SD KN+ +F Y P+ ES G +L+ + +FHLG HV F +++ + PG
Sbjct: 1243 LSLLVSDSRKNIQVFSYAPKNAESWKGQKLLPRVEFHLGSHVTKFLRLQ-----MLQTPG 1297
Query: 525 AR--SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 582
+ +RF + +LDG +G+ PL E +RRL LQ +V H GLNP+A+R ++
Sbjct: 1298 SSRTNRFALCFGTLDGGIGYITPLDELTFRRLQTLQRKLVDLVPHVAGLNPKAYRQFQAN 1357
Query: 583 GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
G + + +D + ++ LSL +++ I ++IG+ I L DI +S F
Sbjct: 1358 GEHHKHGPDNTVDSEQLREYESLSLDKQVAIARQIGTTRQQIFANLRDISLSTSFF 1413
>gi|343962533|dbj|BAK62854.1| cleavage and polyadenylation specificity factor 160 kDa subunit
[Pan troglodytes]
Length = 269
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 134/279 (48%), Positives = 175/279 (62%), Gaps = 40/279 (14%)
Query: 208 FNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSME 267
GE+KE T RD R+I P F + L SP SWE IP L EWEHV C+K VS+
Sbjct: 1 MTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNARIELQEWEHVTCMKTVSLR 60
Query: 268 YEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQK 327
E T+SGL+GY+A GT E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQK
Sbjct: 61 SEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQK 120
Query: 328 GPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYAR 387
GPVTA+CH G LV+A+GQKI++W L+ ++LTG+AFIDT++YI M+SVKN IL D +
Sbjct: 121 GPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMK 180
Query: 388 SIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 447
SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 181 SISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN------------------------- 215
Query: 448 CKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
+ +GF++SD+D+N++++MY PE
Sbjct: 216 ---------------AQLGFLVSDRDRNLMVYMYLPEGE 239
>gi|75145059|sp|Q7XWP1.2|CPSF1_ORYSJ RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 1; AltName: Full=Cleavage and
polyadenylation specificity factor 160 kDa subunit;
Short=CPSF 160 kDa subunit
gi|38345987|emb|CAD39979.2| OSJNBa0032B23.5 [Oryza sativa Japonica Group]
Length = 1441
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 190/637 (29%), Positives = 301/637 (47%), Gaps = 94/637 (14%)
Query: 30 NRPLLL-VRTQHELLIYQAFRH-------------PKG------ALKLRFKKLKVLFVSD 69
+RP L + LL Y AF + P+G A R + L+ VS
Sbjct: 857 SRPFLFGLLNDGTLLCYHAFSYEASESNVKRVPLSPQGSADHHNASDSRLRNLRFHRVSI 916
Query: 70 RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP 129
+ P L R ++ F+N+ GY+G+FL G PAW+ + R LR HP DGP
Sbjct: 917 DITSREDIPTLGR----PRITTFNNVGGYEGLFLSGTRPAWV-MVCRQRLRVHPQLCDGP 971
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
+ HNVNC GF+Y ++ L+I LP+ +YD+ WPV+KVPL TPH + Y+ E
Sbjct: 972 IEAFTVLHNVNCSHGFIYVTSQGFLKICQLPSAYNYDSYWPVQKVPLHGTPHQVTYYAEQ 1031
Query: 190 KTYCIVTSTA---------EPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFS-- 238
Y ++ S D + D ++ + D+ V +F V +
Sbjct: 1032 SLYPLIVSVPVVRPLNQVLSSMADQESVHHMDNDVTS--TDALHKTYTVDEFEVRILELE 1089
Query: 239 --PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGR 296
WE ++ P+ +E+ L ++ V++ + T +A+GT Y EDV RGR
Sbjct: 1090 KPGGHWE--TKSTIPMQLFENALTVRIVTL-HNTTTKENETLLAIGTAYVLGEDVAARGR 1146
Query: 297 ILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDN 356
+LLF + ++N + +Y+KE KG V+A+ + G L+ A G KI + +
Sbjct: 1147 VLLFSFTK------SENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWTGA 1200
Query: 357 DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSK 416
+LT +AF D +++ S+ VKN +L GD +SI L ++ + LSL+A+D+
Sbjct: 1201 ELTAVAFYDAPLHVVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDF-------- 1252
Query: 417 GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
G + +IDG S++ + SD DKNV
Sbjct: 1253 GSLDCFATEFLIDG--------------------------------STLSLVASDSDKNV 1280
Query: 477 VLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASL 536
+F Y P+ ES G +L+ + +FH+G H+ F +++ P+ + +RF + +L
Sbjct: 1281 QIFYYAPKMVESWKGQKLLSRAEFHVGAHITKFLRLQMLPTQ-GLSSEKTNRFALLFGNL 1339
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY--KGKGYYAGNPSRGII 594
DG +G P+ E +RRL LQ +V H GLNPR+FR + GKG+ G II
Sbjct: 1340 DGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHRPG--PDNII 1397
Query: 595 DGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
D L+ + LSL E+L++ ++IG+ + IL DI
Sbjct: 1398 DFELLCSYEMLSLDEQLDVAQQIGTTRSQILSNFSDI 1434
>gi|222628488|gb|EEE60620.1| hypothetical protein OsJ_14038 [Oryza sativa Japonica Group]
Length = 1441
Score = 275 bits (704), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 190/637 (29%), Positives = 301/637 (47%), Gaps = 94/637 (14%)
Query: 30 NRPLLL-VRTQHELLIYQAFRH-------------PKG------ALKLRFKKLKVLFVSD 69
+RP L + LL Y AF + P+G A R + L+ VS
Sbjct: 857 SRPFLFGLLNDGTLLCYHAFSYEASESNVKRVPLSPQGSADHHNASDSRLRNLRFHRVSI 916
Query: 70 RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP 129
+ P L R ++ F+N+ GY+G+FL G PAW+ + R LR HP DGP
Sbjct: 917 DITSREDIPTLGR----PRITTFNNVGGYEGLFLSGTRPAWV-MVCRQRLRVHPQLCDGP 971
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
+ HNVNC GF+Y ++ L+I LP+ +YD+ WPV+KVPL TPH + Y+ E
Sbjct: 972 IEAFTVLHNVNCSHGFIYVTSQGFLKICQLPSAYNYDSYWPVQKVPLHGTPHQVTYYAEQ 1031
Query: 190 KTYCIVTSTA---------EPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFS-- 238
Y ++ S D + D ++ + D+ V +F V +
Sbjct: 1032 SLYPLIVSVPVVRPLNQVLSSMADQESVHHMDNDVTS--TDALHKTYTVDEFEVRILELE 1089
Query: 239 --PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGR 296
WE ++ P+ +E+ L ++ V++ + T +A+GT Y EDV RGR
Sbjct: 1090 KPGGHWE--TKSTIPMQLFENALTVRIVTL-HNTTTKENETLLAIGTAYVLGEDVAARGR 1146
Query: 297 ILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDN 356
+LLF + ++N + +Y+KE KG V+A+ + G L+ A G KI + +
Sbjct: 1147 VLLFSFTK------SENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWTGA 1200
Query: 357 DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSK 416
+LT +AF D +++ S+ VKN +L GD +SI L ++ + LSL+A+D+
Sbjct: 1201 ELTAVAFYDAPLHVVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDF-------- 1252
Query: 417 GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
G + +IDG S++ + SD DKNV
Sbjct: 1253 GSLDCFATEFLIDG--------------------------------STLSLVASDSDKNV 1280
Query: 477 VLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASL 536
+F Y P+ ES G +L+ + +FH+G H+ F +++ P+ + +RF + +L
Sbjct: 1281 QIFYYAPKMVESWKGQKLLSRAEFHVGAHITKFLRLQMLPTQ-GLSSEKTNRFALLFGNL 1339
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY--KGKGYYAGNPSRGII 594
DG +G P+ E +RRL LQ +V H GLNPR+FR + GKG+ G II
Sbjct: 1340 DGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHRPG--PDNII 1397
Query: 595 DGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
D L+ + LSL E+L++ ++IG+ + IL DI
Sbjct: 1398 DFELLAHYEMLSLDEQLDVAQQIGTTRSQILSNFSDI 1434
>gi|402590016|gb|EJW83947.1| hypothetical protein WUBG_05142 [Wuchereria bancrofti]
Length = 374
Score = 274 bits (701), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 153/412 (37%), Positives = 240/412 (58%), Gaps = 40/412 (9%)
Query: 229 VSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYS 288
+ Q+ + L+SP W+ + E+E V C + V + EGT+SG++ Y+A+GT NY
Sbjct: 1 MDQYKLQLYSPEDWKPVQHVEILFEEFEVVTCCEEVVLRSEGTVSGVQNYLAVGTACNYG 60
Query: 289 EDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI 348
E+V RGRI++ +IIEVVPEPGQP +K++IK +Y KEQKGPVT++C G+L+T +GQK+
Sbjct: 61 EEVLVRGRIIISEIIEVVPEPGQPTSKHRIKTLYDKEQKGPVTSLCSCNGYLLTGMGQKV 120
Query: 349 YIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDY 408
+IW KDN+L GI+F+D YI ++ V+NL L D RS+ALLRYQ EY+ LSL +RD
Sbjct: 121 FIWLFKDNNLQGISFLDMHFYIHQLIGVRNLALACDMYRSLALLRYQEEYKALSLASRDM 180
Query: 409 KPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFM 468
R + + +FL I +K MGF+
Sbjct: 181 ----------------RSDVQPPMAAQFL-------------IDNKQ---------MGFI 202
Query: 469 ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSS--ISDAPGAR 526
+SD+ N+ +F Y PE ES GG +L + + ++G VN+F +++ SS + + +
Sbjct: 203 MSDEAANIAIFNYLPETLESLGGEKLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSL 262
Query: 527 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 586
R +ASLDG+ G+ PL EK +RRL MLQ +M + GLN + R + +
Sbjct: 263 ERQSVLFASLDGSFGYLRPLTEKVFRRLHMLQQLMSSMVLQPAGLNAKGARAARPQRPNH 322
Query: 587 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
+R ++DG +V ++L LSL E+ ++ +K+G+ I+D+L +I +++H+
Sbjct: 323 YLNTRNLVDGDVVMQYLHLSLPEKNDLARKLGTSRYHIIDDLNEICRVTAHY 374
>gi|255539681|ref|XP_002510905.1| cleavage and polyadenylation specificity factor cpsf, putative
[Ricinus communis]
gi|223550020|gb|EEF51507.1| cleavage and polyadenylation specificity factor cpsf, putative
[Ricinus communis]
Length = 1461
Score = 273 bits (697), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 178/562 (31%), Positives = 276/562 (49%), Gaps = 68/562 (12%)
Query: 88 QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLY 147
++ F+NI+G+QG FL G PAW F+ R LR HP DG + HNVNC G +Y
Sbjct: 943 RITIFNNISGHQGFFLLGSRPAW-FMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIY 1001
Query: 148 FNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTA--EP---- 201
++ L+I LP+ +YD WPV+K+PLK TPH + Y E Y ++ S +P
Sbjct: 1002 VTSQGNLKICQLPSFSNYDNYWPVQKIPLKGTPHQVTYFPEKNLYPLIVSVPVHKPVNQV 1061
Query: 202 -STDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS----WEEIPQTNFPLHEWE 256
S+ + G E D V +F V + + W+ + P+ E
Sbjct: 1062 LSSLVDQEVGHQIENHNLSSDELLQTYSVEEFEVRILESENGGGPWQ--TKATIPMQSSE 1119
Query: 257 HVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKN 316
+ L ++ V++ + T +A+GT Y EDV RGR+LLF +++ E Q L
Sbjct: 1120 NALTVRVVTL-FNATTKENETLLAIGTAYVQGEDVAARGRVLLFSVVKST-ENSQVL--- 1174
Query: 317 KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-EVYIASMVS 375
+ +Y+KE KG ++A+ + G L+ A G KI + + +L G+AF D +Y+ASM
Sbjct: 1175 -VSEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGVAFYDAPPLYVASMNI 1233
Query: 376 VKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWK 435
VKN IL+GD +SI L ++ + LSL+A+D+ G + +IDG
Sbjct: 1234 VKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDF--------GSLDCFATEFLIDG----- 1280
Query: 436 FLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLI 495
S++ ++SD+ KN+ +F Y P+ ES G +L+
Sbjct: 1281 ---------------------------STLSLVVSDEQKNIQIFYYAPKMLESWKGQKLL 1313
Query: 496 KKTDFHLGQHVNTFFKIRCKPSSISDAPGA------RSRFLTWYASLDGALGFFLPLPEK 549
+ +FH+G H+ F ++ +S SD GA +RF + +LDG++G PL E
Sbjct: 1314 SRAEFHVGAHITKFIRLSMLSTS-SDRSGAAPGPDKTNRFALLFGTLDGSIGCIAPLDEL 1372
Query: 550 NYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGE 609
+RRL LQ +V H GLNPR+FR ++ G I+D L+ F L L E
Sbjct: 1373 TFRRLQSLQRKLVDAVPHVAGLNPRSFRQFRSDGKVHRPGPESIVDCELLSHFEMLPLEE 1432
Query: 610 RLEICKKIGSKHNDILDELYDI 631
+LEI +++G+ IL L D+
Sbjct: 1433 QLEIAQQVGTTRAQILSNLNDL 1454
>gi|357162146|ref|XP_003579318.1| PREDICTED: probable cleavage and polyadenylation specificity factor
subunit 1-like [Brachypodium distachyon]
Length = 1442
Score = 272 bits (695), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 186/655 (28%), Positives = 304/655 (46%), Gaps = 87/655 (13%)
Query: 11 AMDETIVQELLTVSLGLHG-----NRPLLL-VRTQHELLIYQAFRH-------------P 51
++ + + + V L +H +RP L + LL YQA+ + P
Sbjct: 834 SLKKEVANNIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYQAYCYEGLESNIKGTSLSP 893
Query: 52 KGALKL------RFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCG 105
G++ L R K L+ VS + L R ++ F+N+ GY+G+FL G
Sbjct: 894 DGSVDLGNASDSRLKNLRFHRVSVDITSREDISSLAR----PRITIFNNVGGYEGLFLSG 949
Query: 106 PHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY 165
P W+ + R R HP DGP+ HNVNC G +Y ++ L+I LP+ +Y
Sbjct: 950 TRPVWV-MVCRQRFRVHPQLCDGPIEAFTVLHNVNCSHGLIYVTSQGFLKICQLPSAYNY 1008
Query: 166 DAPWPVRKVPLKCTPHFLAYHLETKTYCIVTS--TAEPSTDYYKFNGEDKELVTDPRDSR 223
D WPV+K+PL TPH + Y+ E Y ++ S P + + + D+
Sbjct: 1009 DNYWPVQKIPLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSIMADQEMIHHMDNDAS 1068
Query: 224 FIPPLVSQFHVSLFSPFSWE-EIP------QTNFPLHEWEHVLCLKNVSMEYEGTLSGLR 276
L + V F E E P ++ P+ +E+ L ++ V++ + T
Sbjct: 1069 SADDLQKTYTVEEFEVRVLELEKPGGRWETRSTIPMQSFENALTVRIVTL-HNTTTKENE 1127
Query: 277 GYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
+A+GT Y EDV RGR+LLF + ++N + +Y+KE KG V+A+ +
Sbjct: 1128 TLMAIGTAYVQGEDVAARGRVLLFSFTK------SENSQNLVTEVYSKESKGAVSAVASL 1181
Query: 337 AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQP 396
G LV A G KI + + ++LT +AF D +++ S+ VKN +L GD +S+ L ++
Sbjct: 1182 QGHLVIASGPKITLNKWNGSELTAVAFYDAPLHVVSLNIVKNFVLFGDIHKSVYFLSWKE 1241
Query: 397 EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 456
+ L+L+A+D+ G + +IDG
Sbjct: 1242 QGSQLTLLAKDF--------GSLDCFATEFLIDG-------------------------- 1267
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
S++ ++SD DKN+ +F Y P+ ES G +L+ + + H+G H+ F +++ P
Sbjct: 1268 ------STLSLVVSDSDKNLQIFYYAPKMVESWKGQKLLSRAELHVGAHMTKFLRLQMLP 1321
Query: 517 SSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
+ A +RF + +LDG++G P+ E +RRL LQ +V SH GLNPR+F
Sbjct: 1322 AQ-GLASEKTNRFALLFGTLDGSIGCIAPVDELTFRRLQSLQRKLVDAVSHVCGLNPRSF 1380
Query: 577 RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
R +K G IID L+ + LSL E+L++ ++IG+ IL DI
Sbjct: 1381 RQFKSNGKAHRPGPDNIIDFELLTYYEILSLEEQLDMAQQIGTTRAQILSNFSDI 1435
>gi|296084122|emb|CBI24510.3| unnamed protein product [Vitis vinifera]
Length = 1448
Score = 269 bits (687), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 176/567 (31%), Positives = 270/567 (47%), Gaps = 68/567 (11%)
Query: 83 GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCP 142
G +M F NI G QG+FL G P W F+ R +R HP DG + HN+NC
Sbjct: 925 GTTSPRMTVFKNIGGCQGLFLSGSRPLW-FMVFRERIRVHPQLCDGSIVAFTVLHNINCN 983
Query: 143 RGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTA--E 200
G +Y ++ L+I LP SYD WPV+K+PLK TPH + Y E Y ++ S +
Sbjct: 984 HGLIYVTSQGFLKICQLPAVSSYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVLK 1043
Query: 201 P-----STDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS----WEEIPQTNFP 251
P S+ + G E D V +F V + P W+ + P
Sbjct: 1044 PLNHVLSSLVDQEAGHQLENDNLSSDELHRSYSVDEFEVRVLEPEKSGAPWQ--TRATIP 1101
Query: 252 LHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQ 311
+ E+ L ++ V++ + T +A+GT Y EDV RGR+LLF + +
Sbjct: 1102 MQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSVGKNTDN--- 1157
Query: 312 PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-EVYI 370
++N + IY+KE KG ++A+ + G L+ A G KI + + +L G+AF D +Y+
Sbjct: 1158 --SQNLVSEIYSKELKGAISAVASLQGHLLIASGPKIILHKWTGTELNGVAFFDAPPLYV 1215
Query: 371 ASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDG 430
S+ VKN IL+GD RSI L ++ + L+L+A+D+ G + +IDG
Sbjct: 1216 VSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLLAKDF--------GSLDCFATEFLIDG 1267
Query: 431 SLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNG 490
S++ ++SD KN+ +F Y P+ ES
Sbjct: 1268 --------------------------------STLSLIVSDDQKNIQIFYYAPKMSESWK 1295
Query: 491 GHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA------RSRFLTWYASLDGALGFFL 544
G +L+ + +FH+G HV F +++ P+S SD A +RF + +LDG++G
Sbjct: 1296 GQKLLSRAEFHVGAHVTKFLRLQMLPAS-SDRTSATQGSDKTNRFALLFGTLDGSIGCIA 1354
Query: 545 PLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQ 604
PL E +RRL LQ +V H GLNPR+FR ++ G I+D L+ +
Sbjct: 1355 PLDELTFRRLQSLQKKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEM 1414
Query: 605 LSLGERLEICKKIGSKHNDILDELYDI 631
L E+LEI ++IG+ IL L D+
Sbjct: 1415 LPFEEQLEIAQQIGTTRMQILSNLNDL 1441
>gi|225455571|ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Vitis vinifera]
Length = 1442
Score = 269 bits (687), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 176/567 (31%), Positives = 270/567 (47%), Gaps = 68/567 (11%)
Query: 83 GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCP 142
G +M F NI G QG+FL G P W F+ R +R HP DG + HN+NC
Sbjct: 919 GTTSPRMTVFKNIGGCQGLFLSGSRPLW-FMVFRERIRVHPQLCDGSIVAFTVLHNINCN 977
Query: 143 RGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTA--E 200
G +Y ++ L+I LP SYD WPV+K+PLK TPH + Y E Y ++ S +
Sbjct: 978 HGLIYVTSQGFLKICQLPAVSSYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVLK 1037
Query: 201 P-----STDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS----WEEIPQTNFP 251
P S+ + G E D V +F V + P W+ + P
Sbjct: 1038 PLNHVLSSLVDQEAGHQLENDNLSSDELHRSYSVDEFEVRVLEPEKSGAPWQ--TRATIP 1095
Query: 252 LHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQ 311
+ E+ L ++ V++ + T +A+GT Y EDV RGR+LLF + +
Sbjct: 1096 MQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSVGKNTDN--- 1151
Query: 312 PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-EVYI 370
++N + IY+KE KG ++A+ + G L+ A G KI + + +L G+AF D +Y+
Sbjct: 1152 --SQNLVSEIYSKELKGAISAVASLQGHLLIASGPKIILHKWTGTELNGVAFFDAPPLYV 1209
Query: 371 ASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDG 430
S+ VKN IL+GD RSI L ++ + L+L+A+D+ G + +IDG
Sbjct: 1210 VSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLLAKDF--------GSLDCFATEFLIDG 1261
Query: 431 SLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNG 490
S++ ++SD KN+ +F Y P+ ES
Sbjct: 1262 --------------------------------STLSLIVSDDQKNIQIFYYAPKMSESWK 1289
Query: 491 GHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA------RSRFLTWYASLDGALGFFL 544
G +L+ + +FH+G HV F +++ P+S SD A +RF + +LDG++G
Sbjct: 1290 GQKLLSRAEFHVGAHVTKFLRLQMLPAS-SDRTSATQGSDKTNRFALLFGTLDGSIGCIA 1348
Query: 545 PLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQ 604
PL E +RRL LQ +V H GLNPR+FR ++ G I+D L+ +
Sbjct: 1349 PLDELTFRRLQSLQKKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEM 1408
Query: 605 LSLGERLEICKKIGSKHNDILDELYDI 631
L E+LEI ++IG+ IL L D+
Sbjct: 1409 LPFEEQLEIAQQIGTTRMQILSNLNDL 1435
>gi|10257491|dbj|BAB11613.1| cleavage and polyadenylation specificity factor subunit [Arabidopsis
thaliana]
Length = 1448
Score = 268 bits (685), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 183/603 (30%), Positives = 287/603 (47%), Gaps = 72/603 (11%)
Query: 47 AFRHPKGALKLRFKKLKVLFVS-DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCG 105
A + G+ KLR LK L + D S R G GV ++ F NI+G+QG FL G
Sbjct: 903 AALNSSGSSKLR--NLKFLRIPLDTSTR----EGTSDGVASQRITMFKNISGHQGFFLSG 956
Query: 106 PHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY 165
P W L R LR H DG ++ HNVNC GF+Y A+ L+I LP+ Y
Sbjct: 957 SRPGWCMLF-RERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIY 1015
Query: 166 DAPWPVRKVPLKCTPHFLAYHLETKTYCIVTS--TAEP-----STDYYKFNGEDKELVTD 218
D WPV+K+PLK TPH + Y+ E Y ++ S ++P S+ + G+ +
Sbjct: 1016 DNYWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNM 1075
Query: 219 PRDSRFIPPLVSQFHVSLFSPFS----WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSG 274
D V +F + + P WE + P+ EH L ++ V++ T
Sbjct: 1076 SSDDLQRTYTVEEFEIQILEPERSGGPWE--TKAKIPMQTSEHALTVRVVTLLNASTGEN 1133
Query: 275 LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAIC 334
+A+GT Y EDV RGR+LLF ++N + +Y++E KG ++A+
Sbjct: 1134 -ETLLAVGTAYVQGEDVAARGRVLLFSF-----GKNGDNSQNVVTEVYSRELKGAISAVA 1187
Query: 335 HVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-EVYIASMVSVKNLILVGDYARSIALLR 393
+ G L+ + G KI + + +L G+AF D +Y+ SM VK+ IL+GD +SI L
Sbjct: 1188 SIQGHLLISSGPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLS 1247
Query: 394 YQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 453
++ + LSL+A+D++ + + +IDG
Sbjct: 1248 WKEQGSQLSLLAKDFESLDCFATEF--------LIDG----------------------- 1276
Query: 454 KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
S++ +SD+ KN+ +F Y P+ ES G +L+ + +FH+G HV+ F +++
Sbjct: 1277 ---------STLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKFLRLQ 1327
Query: 514 CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
+S +RF + +LDG+ G PL E +RRL LQ +V H GLNP
Sbjct: 1328 M----VSSGADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNP 1383
Query: 574 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
AFR ++ G + I+D L+ + L L E+LE+ +IG+ IL +L D+
Sbjct: 1384 LAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIGTTRYSILKDLVDLSV 1443
Query: 634 LSS 636
+S
Sbjct: 1444 GTS 1446
>gi|30696088|ref|NP_199979.2| cleavage and polyadenylation specificity factor subunit 1
[Arabidopsis thaliana]
gi|290457637|sp|Q9FGR0.2|CPSF1_ARATH RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=AtCPSF160; Short=CPSF 160
kDa subunit
gi|332008729|gb|AED96112.1| cleavage and polyadenylation specificity factor subunit 1
[Arabidopsis thaliana]
Length = 1442
Score = 268 bits (684), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 183/603 (30%), Positives = 287/603 (47%), Gaps = 72/603 (11%)
Query: 47 AFRHPKGALKLRFKKLKVLFVS-DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCG 105
A + G+ KLR LK L + D S R G GV ++ F NI+G+QG FL G
Sbjct: 897 AALNSSGSSKLR--NLKFLRIPLDTSTR----EGTSDGVASQRITMFKNISGHQGFFLSG 950
Query: 106 PHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY 165
P W L R LR H DG ++ HNVNC GF+Y A+ L+I LP+ Y
Sbjct: 951 SRPGWCMLF-RERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIY 1009
Query: 166 DAPWPVRKVPLKCTPHFLAYHLETKTYCIVTS--TAEP-----STDYYKFNGEDKELVTD 218
D WPV+K+PLK TPH + Y+ E Y ++ S ++P S+ + G+ +
Sbjct: 1010 DNYWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNM 1069
Query: 219 PRDSRFIPPLVSQFHVSLFSPFS----WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSG 274
D V +F + + P WE + P+ EH L ++ V++ T
Sbjct: 1070 SSDDLQRTYTVEEFEIQILEPERSGGPWE--TKAKIPMQTSEHALTVRVVTLLNASTGEN 1127
Query: 275 LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAIC 334
+A+GT Y EDV RGR+LLF ++N + +Y++E KG ++A+
Sbjct: 1128 -ETLLAVGTAYVQGEDVAARGRVLLFSF-----GKNGDNSQNVVTEVYSRELKGAISAVA 1181
Query: 335 HVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-EVYIASMVSVKNLILVGDYARSIALLR 393
+ G L+ + G KI + + +L G+AF D +Y+ SM VK+ IL+GD +SI L
Sbjct: 1182 SIQGHLLISSGPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLS 1241
Query: 394 YQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 453
++ + LSL+A+D++ + + +IDG
Sbjct: 1242 WKEQGSQLSLLAKDFESLDCFATEF--------LIDG----------------------- 1270
Query: 454 KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
S++ +SD+ KN+ +F Y P+ ES G +L+ + +FH+G HV+ F +++
Sbjct: 1271 ---------STLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKFLRLQ 1321
Query: 514 CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
+S +RF + +LDG+ G PL E +RRL LQ +V H GLNP
Sbjct: 1322 M----VSSGADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNP 1377
Query: 574 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
AFR ++ G + I+D L+ + L L E+LE+ +IG+ IL +L D+
Sbjct: 1378 LAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIGTTRYSILKDLVDLSV 1437
Query: 634 LSS 636
+S
Sbjct: 1438 GTS 1440
>gi|24415580|gb|AAN41460.1| putative cleavage and polyadenylation specificity factor 160 kDa
subunit [Arabidopsis thaliana]
Length = 1442
Score = 268 bits (684), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 183/603 (30%), Positives = 287/603 (47%), Gaps = 72/603 (11%)
Query: 47 AFRHPKGALKLRFKKLKVLFVS-DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCG 105
A + G+ KLR LK L + D S R G GV ++ F NI+G+QG FL G
Sbjct: 897 AALNSSGSSKLR--NLKFLRIPLDTSTR----EGTSDGVASQRITMFKNISGHQGFFLSG 950
Query: 106 PHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY 165
P W L R LR H DG ++ HNVNC GF+Y A+ L+I LP+ Y
Sbjct: 951 SRPGWCMLF-RERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIY 1009
Query: 166 DAPWPVRKVPLKCTPHFLAYHLETKTYCIVTS--TAEP-----STDYYKFNGEDKELVTD 218
D WPV+K+PLK TPH + Y+ E Y ++ S ++P S+ + G+ +
Sbjct: 1010 DNYWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNM 1069
Query: 219 PRDSRFIPPLVSQFHVSLFSPFS----WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSG 274
D V +F + + P WE + P+ EH L ++ V++ T
Sbjct: 1070 SSDDLQRTYTVEEFEIQILEPERSGGPWE--TKAKIPMQTSEHALTVRVVTLLNASTGEN 1127
Query: 275 LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAIC 334
+A+GT Y EDV RGR+LLF ++N + +Y++E KG ++A+
Sbjct: 1128 -ETLLAVGTAYVQGEDVAARGRVLLFSF-----GKNGDNSQNVVTEVYSRELKGAISAVA 1181
Query: 335 HVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-EVYIASMVSVKNLILVGDYARSIALLR 393
+ G L+ + G KI + + +L G+AF D +Y+ SM VK+ IL+GD +SI L
Sbjct: 1182 SIQGHLLISSGPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLS 1241
Query: 394 YQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 453
++ + LSL+A+D++ + + +IDG
Sbjct: 1242 WKEQGSQLSLLAKDFESLDCFATEF--------LIDG----------------------- 1270
Query: 454 KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
S++ +SD+ KN+ +F Y P+ ES G +L+ + +FH+G HV+ F +++
Sbjct: 1271 ---------STLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKFLRLQ 1321
Query: 514 CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
+S +RF + +LDG+ G PL E +RRL LQ +V H GLNP
Sbjct: 1322 M----VSSGADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNP 1377
Query: 574 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
AFR ++ G + I+D L+ + L L E+LE+ +IG+ IL +L D+
Sbjct: 1378 LAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIGTTRYSILKDLVDLSV 1437
Query: 634 LSS 636
+S
Sbjct: 1438 GTS 1440
>gi|224120960|ref|XP_002318462.1| predicted protein [Populus trichocarpa]
gi|222859135|gb|EEE96682.1| predicted protein [Populus trichocarpa]
Length = 1455
Score = 268 bits (684), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 189/636 (29%), Positives = 295/636 (46%), Gaps = 84/636 (13%)
Query: 30 NRPLLL-VRTQHELLIYQA--FRHPKGALKL------------------RFKKLKVLFVS 68
+RP L + T +L Y A F P G KL R + L+ + V
Sbjct: 863 SRPFLFGILTDGTILCYHAYLFEGPDGTSKLEDSVSAQNSVGASTISASRLRNLRFVRVP 922
Query: 69 DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG 128
+ E RI+ F NI+GYQG FL G PAW F+ R LR HP DG
Sbjct: 923 LDTYTREETSSETSCQRITT---FKNISGYQGFFLSGSRPAW-FMVFRERLRVHPQLCDG 978
Query: 129 PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLE 188
+ H VNC G +Y ++ L+I L + SYD WPV+K+PLK TPH + Y E
Sbjct: 979 SIVAFTVLHTVNCNHGLIYVTSQGNLKICHLSSVSSYDNYWPVQKIPLKGTPHQVTYFAE 1038
Query: 189 TKTY-CIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPP------LVSQFHVSLFSPFS 241
Y IV+ + + + D+E+ + V +F V + P +
Sbjct: 1039 RNLYPLIVSVPVQKPVNQVLSSLVDQEVGHQIENHNLSSEEIHRTYSVDEFEVRILEPSN 1098
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
+ P+ E+ L ++ VS+ + + +A+GT Y EDV RGRILLF
Sbjct: 1099 GPWQVKATIPMQTSENALTVRMVSL-FNTSTKENETLLAVGTAYVQGEDVAARGRILLFS 1157
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
+++ PE Q L + +Y+KE KG ++A+ + G L+ A G KI + + +LTG+
Sbjct: 1158 VVK-NPENSQIL----VSEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELTGV 1212
Query: 362 AFIDT-EVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
AF D +Y+ S+ VKN IL+GD +SI L ++ + LSL+A+D+ S +
Sbjct: 1213 AFSDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFASLDCFSTEF-- 1270
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
+IDG S++ ++SD+ KNV +F
Sbjct: 1271 ------LIDG--------------------------------STLSLVVSDEQKNVQIFY 1292
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA-----RSRFLTWYAS 535
Y P+ ES G +L+ + +FH+G V F +++ S+ + A +RF + +
Sbjct: 1293 YAPKMSESWKGQKLLSRAEFHVGALVTKFMRLQMLSPSLDRSGAAPVSDKTNRFALLFGT 1352
Query: 536 LDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIID 595
LDG++G PL E +RRL LQ +V H GLNP++FR ++ G I+D
Sbjct: 1353 LDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVAGLNPKSFRQFRSDGKAHRPGPESIVD 1412
Query: 596 GSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
++ + + L E++EI ++IG+ IL L D+
Sbjct: 1413 CEMLSYYEMIPLEEQVEIAQQIGTTRAQILSNLNDL 1448
>gi|297792471|ref|XP_002864120.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
lyrata]
gi|297309955|gb|EFH40379.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
lyrata]
Length = 1444
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 180/603 (29%), Positives = 286/603 (47%), Gaps = 72/603 (11%)
Query: 47 AFRHPKGALKLR-FKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCG 105
A + G+ KLR K L++ F D S R G GV ++ F NI+G+QG FL G
Sbjct: 899 AALNSSGSSKLRNLKFLRIPF--DTSTR----EGTSDGVASQRITMFKNISGHQGFFLSG 952
Query: 106 PHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY 165
P W L R LR H DG ++ HNVNC GF+Y ++ L+I LP+ Y
Sbjct: 953 SRPGWCMLF-RERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTSQVVLKICQLPSASIY 1011
Query: 166 DAPWPVRKVPLKCTPHFLAYHLETKTYCIVTS--TAEP-----STDYYKFNGEDKELVTD 218
D WPV+K+PLK TPH + Y+ E Y ++ S ++P S+ + G+ +
Sbjct: 1012 DNYWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPINQVLSSLVDQEAGQQIDNHNL 1071
Query: 219 PRDSRFIPPLVSQFHVSLFSPFS----WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSG 274
D V +F + + P WE + P+ EH L ++ V++ T
Sbjct: 1072 SSDDLQRTYTVEEFEIQILEPERSGGPWE--TKATIPMQSSEHALTVRVVTLLNASTGEN 1129
Query: 275 LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAIC 334
+A+GT Y EDV RGR+LLF + ++N + +Y++E KG ++A+
Sbjct: 1130 -ETLLAVGTAYVQGEDVAARGRVLLFSFGK-----NGDNSQNVVTEVYSRELKGAISAVA 1183
Query: 335 HVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-EVYIASMVSVKNLILVGDYARSIALLR 393
+ G L+ + G KI + + +L G+AF D +Y+ SM VK IL+GD +SI L
Sbjct: 1184 SIQGHLLISSGPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKTFILLGDVHKSIYFLS 1243
Query: 394 YQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 453
++ + LSL+A+D+ G + +IDG
Sbjct: 1244 WKEQGSQLSLLAKDF--------GSLDCFATEFLIDG----------------------- 1272
Query: 454 KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
+++ +SD+ KN+ +F Y P+ ES G +L+ + +FH+G HV F +++
Sbjct: 1273 ---------NTLSLAVSDEQKNIQVFYYAPKMAESWKGQKLLSRAEFHVGSHVTKFLRLQ 1323
Query: 514 CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
++ +RF + +LDG+ G PL E +RRL LQ +V H GLNP
Sbjct: 1324 M----VTSGADKTNRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNP 1379
Query: 574 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
+FR ++ G + IID L+ + L L E+LE+ +IG+ + IL L ++
Sbjct: 1380 HSFRQFRTSGKARRSGPDSIIDCELLCHYEMLPLEEQLELAHQIGTTRSVILLNLVELSV 1439
Query: 634 LSS 636
+S
Sbjct: 1440 GTS 1442
>gi|290981010|ref|XP_002673224.1| CPSF A subunit [Naegleria gruberi]
gi|284086806|gb|EFC40480.1| CPSF A subunit [Naegleria gruberi]
Length = 1373
Score = 258 bits (660), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 175/598 (29%), Positives = 283/598 (47%), Gaps = 105/598 (17%)
Query: 87 SQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFL 146
SQ+ F NI GY G+F G P WLF T LR HP PV+T P+H+ NCP GF+
Sbjct: 835 SQLIPFKNIGGYGGLFKTGEKPFWLF-TEHSNLRVHPTQSRDPVTTFTPYHHENCPHGFI 893
Query: 147 YFNAK-------SELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTA 199
Y K S+L IS L ++ ++A WP RK+ LK TP+ + +H +T T TS
Sbjct: 894 YLTDKEQDNKKQSKLHISSLNANVKFNAYWPQRKILLKSTPNVITFHQDTNTCLAFTSVP 953
Query: 200 EPSTDYYKFNGEDKELVTD--PRDSRFIPPLVSQFH-VSLFSPFSWEEIPQTNFPLHE-- 254
K ++ D P PP Q H V LFS +W+E+ + F LHE
Sbjct: 954 V------------KAILPDSIPFPEGKCPPPAEQKHTVKLFSGHNWQEMDKFEFDLHESA 1001
Query: 255 -WEHVLCLK------NVSMEYEGTLSG----LRGYIALGTNYNYSEDVTCRGRILLFDII 303
V+ L + + +E L+ L +A+GT Y SE CRGR+LLFD+
Sbjct: 1002 VAAKVVYLSKEEYNDDTDISFEEPLNSRKQDLVSVVAVGTAYVQSERELCRGRLLLFDLD 1061
Query: 304 EVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYI----WQLKDNDLT 359
++ + K+ +I + KGP+T + V +++ +VG +IY W+ K +T
Sbjct: 1062 PILGRENE----YKLNLISSTSVKGPITTLEQVDRYIICSVGNRIYTYYFDWEEKRMHIT 1117
Query: 360 GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
+F DT+ Y AS+ +V+N I+ GD +S++ LR++ + L L+A+D +P Q S +
Sbjct: 1118 --SFYDTQFYTASLNTVRNFIMFGDIYKSVSFLRWKEKGHRLILLAKDNRPLQVVSSEFL 1175
Query: 420 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
+ND+L G + D KN+ +F
Sbjct: 1176 V----------------------------------NNDLL------GLAVIDTSKNLQIF 1195
Query: 480 MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP---------SSISDAPGARSR-- 528
Y P+ +ESN G L+ DFH+G +N+ +++ + ++++ P +
Sbjct: 1196 SYLPQHQESNDGRNLVPVCDFHIGTLINSLIRMKVRELPDDNTIRLGNVNEKPKQSGKKD 1255
Query: 529 --------FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK 580
+ S+DGA+G+ P+ E +RRL LQ M T GL+P++FR YK
Sbjct: 1256 ITKTNPNHQFILFGSVDGAIGYVAPINEVTHRRLFALQLKMYTQLEQAAGLHPKSFRLYK 1315
Query: 581 GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
N + IIDG L+W + ++ + ++ ++IG+ ++IL + ++ + F
Sbjct: 1316 PLERTEYNYKKNIIDGQLIWNYANINTILQRDLARQIGTNSDNILRSIQELNQATFFF 1373
>gi|330799483|ref|XP_003287774.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
gi|325082229|gb|EGC35718.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
Length = 1453
Score = 254 bits (649), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 173/631 (27%), Positives = 311/631 (49%), Gaps = 87/631 (13%)
Query: 19 ELLTVSLG-LHGNRPLLLVRTQ-HELLIYQAFRHPKGALKLRFKKLKVLFV----SDRSK 72
E++ +SL L+ ++P LL++ + +L++Y++F+ G LRFKK F+ S+ SK
Sbjct: 879 EIVEISLEILNNSQPYLLLKNRIGDLIVYKSFKKENG--DLRFKKYNHNFILRDLSNNSK 936
Query: 73 RANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVST 132
N G R + + GVF+ G P W+F +G +R H M DG + +
Sbjct: 937 SINSD-----GYRKKSIVNIKLSSKNNGVFIGGQKPVWIF-NEKGYIRLHSMDFDGAIVS 990
Query: 133 LAPFHNVNCPRGFLYFNA-KSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKT 191
L PFHN +CP GFLY+ K ++I L ++++ + +R+VP+K + H +AYH E K
Sbjct: 991 LKPFHNADCPNGFLYYTEDKQHIKIGYLNGLMNFENEYAIRRVPIKLSAHKIAYHNELKC 1050
Query: 192 YCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP---FSWEEIPQT 248
Y +V S + + + + K ++TD + F + + P +SW I
Sbjct: 1051 YVVVVSFPQVTQELEE--DSKKPILTDEK-----------FQIKIIDPTIDWSWRFID-- 1095
Query: 249 NFPLHEWEHVLCLKNVSMEYEGTLSGLRG--YIALGTNYNYSEDVTCRGRILLFDIIEVV 306
+F L + E VL +K VS++++ + ++ ++ +GT + + ED C+GR+L+F+I+
Sbjct: 1096 SFSLQDRETVLAMKIVSLKFKESDETIKSKPFLVIGTAFTFGEDTQCKGRVLVFEIVSHK 1155
Query: 307 PE-PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFID 365
+ L ++ ++Y KEQKGPVTA+ V+G L+ +G K+ + Q L ++F D
Sbjct: 1156 TQFESDDLGTKRLNLLYEKEQKGPVTALSSVSGLLLMTIGPKLTVNQFLTGQLVTLSFHD 1215
Query: 366 TEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSR 425
++YI S+ ++K I++GD +S+ L++ + L +++DY+ S +
Sbjct: 1216 AQIYICSISTIKTYIVIGDMYKSVYFLQWNG--KQLVPLSKDYQSLNIFSTEFIVNQ--- 1270
Query: 426 GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEA 485
++ ++SD DKN++LF + P
Sbjct: 1271 -------------------------------------QTLSILVSDLDKNILLFSFDPAD 1293
Query: 486 RESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS-SISDAPGARSRFLTWYASLDGALGFFL 544
S G L+ K DFH+G ++ F + K + S + L ++ +LDG+L
Sbjct: 1294 PTSRQGQMLLCKADFHIGSNIEKFVRTPMKFNIQSSSNGNNNNDQLVFFGTLDGSLNVLR 1353
Query: 545 PLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG-KGYYAGNPS------RGIIDGS 597
PL E+ Y+ LQ+ + + GLN + +R +K + +PS + I+DG
Sbjct: 1354 PLDERMYQLFYHLQSKLY-YLPQPAGLNAKQYRAFKSFSQNFHFSPSTIHQLPKYILDGD 1412
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
L+ KF++L+ ER + +GS ++IL L
Sbjct: 1413 LLSKFVKLNQKERRLLASSVGSNTDEILTAL 1443
>gi|19112233|ref|NP_595441.1| cleavage factor one Cft1 (predicted) [Schizosaccharomyces pombe
972h-]
gi|74582544|sp|O74733.1|CFT1_SCHPO RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|3738146|emb|CAA21247.1| cleavage factor one Cft1 (predicted) [Schizosaccharomyces pombe]
Length = 1441
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 186/641 (29%), Positives = 303/641 (47%), Gaps = 78/641 (12%)
Query: 19 ELLTVSLGLHGNRPLLLVRTQ-HELLIYQAFRHP---KGALKLRFKKLKVLFVSDRSKRA 74
ELL LG P L +R++ +E+ +Y+AF + K L F K+ ++ R +A
Sbjct: 853 ELLVADLGDDFKEPHLFLRSRLNEITVYKAFLYSNTDKHKNLLAFAKVPQETMT-REFQA 911
Query: 75 NEQPGLPRGVRIS------------QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
N G PR + +M + + VF+ G P + T +
Sbjct: 912 N--VGTPRDAESTMEKKASSSVDHLKMTALEVVGNHSAVFVTGRKPFLILSTLHSNAKFF 969
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
P++ + P+ ++APFH + P+G++Y + S +RI YD WP +KV L +
Sbjct: 970 PISSNIPILSVAPFHAHHAPQGYIYVDENSFIRICKFQEDFEYDNKWPYKKVSLGKQING 1029
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKE---LVTDPRDSRFIPPLVSQFHVSLFSP 239
+AYH Y + +A P +K ED +TD D P+ + + L SP
Sbjct: 1030 IAYHPTKMVYAV--GSAVPIE--FKVTDEDGNEPYAITDDNDYL---PMANTGSLDLVSP 1082
Query: 240 FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
+W I F ++E L + V++E T + YIA+GT+ ED+ RG L
Sbjct: 1083 LTWTVIDSYEF--QQFEIPLSVALVNLEVSETTKLRKPYIAVGTSITKGEDIAVRGSTYL 1140
Query: 300 FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND-L 358
F+II+VVP+PG+P T++K+K++ +E KG V +C V G+L++ GQK+ + L+D D L
Sbjct: 1141 FEIIDVVPQPGRPETRHKLKLVTREEIKGTVAVVCEVDGYLLSGQGQKVIVRALEDEDHL 1200
Query: 359 TGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
G++FID Y S ++NL+L GD +++ + + E ++L SKG
Sbjct: 1201 VGVSFIDLGSYTLSAKCLRNLLLFGDVRQNVTFVGFAEEPYRMTLF----------SKGQ 1250
Query: 419 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
A N S D L + ++ F+++D N+ L
Sbjct: 1251 EALNVSAA------------------------------DFLVQGENLYFVVADTSGNLRL 1280
Query: 479 FMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAP---GARSRFLTWYAS 535
Y PE ES+ G RL+ + DFH+G +V T I K +A F +
Sbjct: 1281 LAYDPENPESHSGERLVTRGDFHIG-NVITAMTILPKEKKHQNAEYGYDTGDDFSCVMVN 1339
Query: 536 LDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIID 595
DG L +P+ ++ YRRL ++QN + + GGLNP+++R NP+R I+D
Sbjct: 1340 SDGGLQMLVPISDRVYRRLNIIQNYLANRVNTIGGLNPKSYRLITSPSNLT-NPTRRILD 1398
Query: 596 GSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI-EALS 635
G L+ F +S+ R E+ K G + I+++L ++ EALS
Sbjct: 1399 GMLIDYFTYMSVAHRHEMAHKCGVPVSTIMNDLVELDEALS 1439
>gi|213407244|ref|XP_002174393.1| cleavage factor one Cft1 [Schizosaccharomyces japonicus yFS275]
gi|212002440|gb|EEB08100.1| cleavage factor one Cft1 [Schizosaccharomyces japonicus yFS275]
Length = 1431
Score = 248 bits (634), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 178/648 (27%), Positives = 300/648 (46%), Gaps = 81/648 (12%)
Query: 19 ELLTVSLGLHGNRPLLLVRTQ-HELLIYQAF--RHP-KGALKLRFKKLKVLFVSDRSKRA 74
E+L LG LL+R++ +E+ +Y+ F +P +LRF K+ ++ S
Sbjct: 832 EVLATDLGDEAKEAHLLIRSRMNEITVYKPFVCSNPVTHKTELRFSKIPQEGMTRESTEC 891
Query: 75 N--------EQPGLPRG------------VRISQMRYFSNIAGYQGVFLCGPHPAWLFLT 114
+ EQ P+ V +M I + VF+ G P +L T
Sbjct: 892 SLQDLVAETEQENAPKDASEQKPQKSSSTVDKPRMVALQRIGNHSAVFITGAKPFFLLKT 951
Query: 115 SRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKV 174
+ + HP+ + + +LA FH + P+G+++ + ++ I ++YD W +KV
Sbjct: 952 AHSVAKFHPLLSECRILSLASFHTEHAPKGYIFVDENYDINICRFQDDINYDHRWGYKKV 1011
Query: 175 PLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHV 234
+ + H +AYH Y I TST P Y+ E+ +V ++ P + +
Sbjct: 1012 NVGRSVHGIAYHPTKMVYAIATSTLTP----YEVTDEEGNVVYPLKNEGEYLPRTNSGML 1067
Query: 235 SLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR 294
L SP +W I + F ++E LC++ V++E + +IA+GT+ ED+ R
Sbjct: 1068 ELVSPLTWTVIDRYKF--LDYEIPLCVRLVNLEISDVTKLRKPFIAVGTSITKGEDIAVR 1125
Query: 295 GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK 354
G LF+II+VVP+PG P T++K+K++ +E KG V + + G+L++ GQK+ + L+
Sbjct: 1126 GSTYLFEIIDVVPQPGHPETRHKLKLVTREEIKGTVAVVSEINGYLLSGQGQKVIVRALE 1185
Query: 355 DND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
D D L G+AFID Y S++NL++ GD +SI+ + + E ++L A+ P
Sbjct: 1186 DEDHLVGVAFIDLGSYTVVAKSLRNLLIFGDIRQSISFVGFAEEPYRMTLFAKGQDPLSV 1245
Query: 414 NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
+S D L + S+ F ++D
Sbjct: 1246 SSA----------------------------------------DFLVQGQSLYFAVADMR 1265
Query: 474 KNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR-----SR 528
N+ + Y PE ES+ G RL+ + D H+G H+ T I P D PG
Sbjct: 1266 GNLRILAYDPENPESHSGERLVTRGDIHVG-HIIT--AIHLVPKMKKDRPGEVDYDEGDE 1322
Query: 529 FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
F + DG+L P+ E+ YRRL ++QN + GGLNPR++R N
Sbjct: 1323 FACITTNSDGSLQALCPISERVYRRLNIIQNYLANRIETVGGLNPRSYRLINTVSSL-NN 1381
Query: 589 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI-EALS 635
+ I+DG L+ F +S+ R E+ K G + I+++L ++ EAL+
Sbjct: 1382 ATHRILDGGLIEHFSYMSVAHRQEMAYKCGVPISTIMNDLVELDEALN 1429
>gi|308805673|ref|XP_003080148.1| cleavage and polyadenylation specificity factor (ISS) [Ostreococcus
tauri]
gi|116058608|emb|CAL54315.1| cleavage and polyadenylation specificity factor (ISS), partial
[Ostreococcus tauri]
Length = 1473
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 179/616 (29%), Positives = 286/616 (46%), Gaps = 77/616 (12%)
Query: 30 NRPLLL-VRTQHELLIYQAFRHPKGAL-----------KLRFKKLKV------LFVSDRS 71
RPLL VR LL+Y+ F P G +LRF ++ + L V+
Sbjct: 622 ERPLLTAVRGDGTLLLYRGFIVPAGTTCEGSEEPLARGELRFSRVNIDVEGSGLNVAGVG 681
Query: 72 KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS 131
+ L G R++++ G QG+F+ GP+P WL + R + A P +G +
Sbjct: 682 VAGQVRDSLA-GTRLTRISNVGEGQGLQGIFVAGPNPLWLIV-RRSRVLALPTRGEGEIV 739
Query: 132 TLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKT 191
FHNVNCP GF+ A +RI +P+ + Y+A WPVRK+ LKCTPH +AY + K
Sbjct: 740 AFTDFHNVNCPYGFILGTAVGGVRICQMPSKMHYEAAWPVRKIALKCTPHAVAYLPDFKL 799
Query: 192 YCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPP----LVSQFHVSLFSPFSWEEIPQ 247
Y +VTS P D + +GE+ ++ + R + Q+ V L P S + + Q
Sbjct: 800 YALVTSANVPWVD-REIDGENVHGLSLSKARRERAKAHDDMELQYSVRLLVPGSLDCVWQ 858
Query: 248 TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
L EHV C++NV ++ T L Y+A+GT ED CRGR+ LF+++
Sbjct: 859 HT--LEPGEHVQCVRNVQLKDINTGHSL-SYLAVGTAMPGGEDTPCRGRVYLFNMVWERD 915
Query: 308 EPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTE 367
+ K ++ +E K TA+ + G L+ AVG K+ + +L +AF DT
Sbjct: 916 SESADGYRWKGQVCCVREAKMACTALEGLGGHLIVAVGTKLTVHTWDGRELNSVAFFDTP 975
Query: 368 VYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGI 427
++ S+ VKN ILVGD + + R++
Sbjct: 976 IHTVSINVVKNFILVGDLEKGLHFFRWK-------------------------------- 1003
Query: 428 IDGSLVWKFLQLSLG-ERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
D +QLS ER+++ ++ L + +++ + SD N F Y P++
Sbjct: 1004 -DTGFEKSLIQLSKDFERMDVVS------SEFLIDGTTLSLLGSDMSGNARTFGYDPKSI 1056
Query: 487 ESNGGHRLIKKTDFHLGQHVNTFFKI-----RCKPSSISDAPGARSRFLTWYASLDGALG 541
ES G +L+ + +H+G ++ + + K +S P +RF ++ +LDGALG
Sbjct: 1057 ESWKGQKLLPRAAYHVGSPISRMVRFNVEGSKSKMASTDGKPKGANRFAVFFGTLDGALG 1116
Query: 542 FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT---YKGKGYYAGNPSRGIIDGSL 598
F+P Y +LL +Q + T G NPR FRT ++GK P ++DG L
Sbjct: 1117 IFMPTDPVTYEKLLAIQRELTTAVRSPIGCNPRTFRTPKVFEGKHVQLRAP-LDVLDGGL 1175
Query: 599 VWKFLQLSLGERLEIC 614
+ KF L+ E+++I
Sbjct: 1176 LSKFETLTFSEQVKIA 1191
>gi|428186188|gb|EKX55039.1| hypothetical protein GUITHDRAFT_160593 [Guillardia theta CCMP2712]
Length = 2290
Score = 241 bits (616), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 170/569 (29%), Positives = 271/569 (47%), Gaps = 80/569 (14%)
Query: 84 VRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID--GPVSTLAPFHNVNC 141
+R S++ G +GV + PA + L RG R HP +D V + A F+N+ C
Sbjct: 1064 LRTSRLMPLGGAGGLEGVLIAARQPA-VVLFGRGLPRIHPWKLDRGEGVRSAARFNNLQC 1122
Query: 142 PRGFLYF------NAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIV 195
G + AK L+I +P +S D PWP+R + T H +A+H T + +V
Sbjct: 1123 KDGIVCIADKGRDRAKGVLKICNIPEGISGDTPWPLRTKHVGMTVHHVAFHAATGCHVLV 1182
Query: 196 TSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQ-FHVSLFSPFSWEEIPQTNFPLHE 254
S+ + D K G + IPPL + + V L +P+S E + F
Sbjct: 1183 VSSQQEIEDERKPEGTLEGA---------IPPLTEEKYEVQLRAPYSMELLDSYEFDFAN 1233
Query: 255 WEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR--GRILLFDIIEVVPEPG-Q 311
E LCL+ V ++ L ++A+GT + E T R GRI +F++ VV E G +
Sbjct: 1234 GEKALCLQVVHLKNTRVKDSLLPFVAVGTGFQNGESETSRATGRIYVFEVTTVVGEEGYE 1293
Query: 312 PLTKNKIKMIYA----KEQKGPVTAICHVAGFLVTAVG--------QKIYIWQLKDNDLT 359
T KIK I+ ++ K PV+A+C + G+L+ A G K+Y+++ D L
Sbjct: 1294 GRTSFKIKKIFTSADIQDIKAPVSALCQLEGYLLVAQGPNPGMIGGSKLYVYEWVDEKLV 1353
Query: 360 GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
G AF D +YI ++ +VK I+ GD S+ LLR++ + R L L+A+D P Y
Sbjct: 1354 GRAFFDAHLYITTLKTVKFFIVFGDIRHSVHLLRWREDIRMLQLLAKDALPLS-----VY 1408
Query: 420 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
A +F+ + S+ G + SD+ KNV +F
Sbjct: 1409 AA-------------EFVVMG----------------------SNFGLLASDEQKNVQVF 1433
Query: 480 MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGA 539
++ P + E +LI + D H+G H+N F + P G R+ Y +LDG
Sbjct: 1434 VFNPNSPEYR-RQQLICRADLHVGSHINKFIRW---PLPFRPTLGVRT--AAHYTTLDGG 1487
Query: 540 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
+G +P+PE++YRRLL LQN++VT H GLNPR++R YK ++ +DG+L+
Sbjct: 1488 IGAIIPIPEQSYRRLLALQNLLVTAMPHYAGLNPRSWRLYKPAMCMKRRYAKNFLDGNLL 1547
Query: 600 WKFLQLSLGERLEICKKIGSKHNDILDEL 628
++L L L ++++ + IL +L
Sbjct: 1548 GRYLHLDLALQMQLSSALNQTREAILGDL 1576
>gi|145348791|ref|XP_001418827.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579057|gb|ABO97120.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1386
Score = 240 bits (612), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 176/619 (28%), Positives = 286/619 (46%), Gaps = 76/619 (12%)
Query: 30 NRPLLL-VRTQHELLIYQAFRHPKGAL-----------KLRFKKLKV------LFVSDRS 71
RPLL VR LL+Y+ F P G +LRF ++ V L V+
Sbjct: 794 ERPLLTAVRGDGTLLLYKGFIVPAGTTYEGQDEPLEKNELRFSRVNVDVEGSGLNVAGIG 853
Query: 72 KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS 131
+ L G R++++ G QG+F+ GP+P WL + R + A P +G V
Sbjct: 854 AAGQLRDSLA-GARLTRIGNVGEGQGVQGIFVAGPNPLWLIV-RRSRVLALPTRGEGEVV 911
Query: 132 TLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKT 191
FHNVNCP GF+ A +RI +P+ + Y+A WPVRKV LKCTPH + Y + K
Sbjct: 912 AFTVFHNVNCPHGFILGTALGGVRICQMPSKMHYEAAWPVRKVALKCTPHTITYLPDFKL 971
Query: 192 YCIVTSTAEP--STDYYKFNGEDKELVTDPRD-SRFIPPLVSQFHVSLFSPFSWEEIPQT 248
Y +VTS P + + N L R+ ++ + Q+ V L P S + Q
Sbjct: 972 YALVTSAPVPWVEREIEQDNVHGIALAKVRRERAKANDDMELQYSVRLLVPGSLDSAWQ- 1030
Query: 249 NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPE 308
L EHV C++NV + T L +A+GT ED CRGR++LF ++
Sbjct: 1031 -HALEPGEHVQCVRNVQLRDINT-GALLSLLAVGTAMPGGEDTPCRGRVILFQMVWERDA 1088
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEV 368
+ K ++ +E K TA+ + G L+ AVG K+ + +L +AF DT +
Sbjct: 1089 ESMDGYRWKGQVCCVREAKMACTALSALDGHLIVAVGTKLTVHTWDGVELNSVAFFDTPI 1148
Query: 369 YIASMVSVKNLILVGDYARSIALLRYQPE--YRTLSLVARDYKPTQPNSKGYYAGNPSRG 426
+ S+ VKN ILVGD + + R++ +++ +++D+
Sbjct: 1149 HTVSINVVKNFILVGDLEKGLHFFRWKANGFEKSIIQLSKDF------------------ 1190
Query: 427 IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
+R+++ + L + +++ + SD N +F Y P++
Sbjct: 1191 ----------------DRMDVVS------TEFLIDGATLSLLGSDMSGNARIFGYDPKSL 1228
Query: 487 ESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR----SRFLTWYASLDGALGF 542
ES G +L+ ++ +H+G ++ + + ++ APG R +R ++ +LDGALG
Sbjct: 1229 ESWKGQKLLVRSAYHVGSPISRMVRFNVEGTTAKAAPGERPKGTNRHAVFFGTLDGALGI 1288
Query: 543 FLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT---YKGKGYYAGNPSRGIIDGSLV 599
F+P E Y +L LQ + T G NPR FRT ++GK P ++DG L+
Sbjct: 1289 FMPTDEPTYAKLHALQRELNTTVRSPIGCNPRTFRTPKVFEGKHVQLLAP-LDVLDGGLL 1347
Query: 600 WKFLQLSLGERLEICKKIG 618
KF L+ E+ + ++ G
Sbjct: 1348 SKFETLTFTEQRAVAERSG 1366
>gi|440793679|gb|ELR14857.1| CPSF A subunit region protein [Acanthamoeba castellanii str. Neff]
Length = 1477
Score = 234 bits (598), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 168/578 (29%), Positives = 269/578 (46%), Gaps = 90/578 (15%)
Query: 84 VRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPR 143
+R ++ YF + GVF+ G PAW+F RG R +PM +D V A FHN NCP
Sbjct: 964 LRYRRIHYFGTVGKSNGVFISGSAPAWVF-AQRGYARLYPMKLDTFVRAFAEFHNANCPH 1022
Query: 144 GFLYFNAKSELRISVLPTH---LSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAE 200
GF+YFN + L+I LP + ++ P VRKVPL TP +AYH ++TY + +T
Sbjct: 1023 GFIYFNHEGTLKICQLPAAEGAIHWELPGVVRKVPLGRTPREIAYHPPSRTYVVALATPV 1082
Query: 201 P-------STDYYK--------------FNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP 239
TD + E+K+ PR+ I + + + L SP
Sbjct: 1083 TTVVPTPPETDMERQEREREEEESREMGIEPEEKQRDMGPRE---IAMMEERHELHLISP 1139
Query: 240 FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
+W+ + L EHVL TLS L+ LG NY+ +L+
Sbjct: 1140 RTWQILHHVE--LEPKEHVL-----------TLSVLK----LGDNYSQVNRELRPPHLLI 1182
Query: 300 FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLT 359
++I +V E LT K + K KGPV+A + G+L+ AVG KI+++
Sbjct: 1183 YEI-DVTGEEQCKLTMAYQKPMKEKPMKGPVSAAASLQGYLIIAVGPKIWVFNFDGGSTE 1241
Query: 360 GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
+AF D YI S+ ++KN +L GD +SI LR++ L+L+A+D + Y
Sbjct: 1242 AVAFYDAPHYIVSIKTLKNFVLCGDIYKSIFFLRWKDSASQLALLAKDVGRVSVFATEY- 1300
Query: 420 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
++D K N + ++SD+ +N+ +
Sbjct: 1301 -------VVD------------------------KQN--------LALLMSDERQNLQVT 1321
Query: 480 MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGA 539
Y P ES GG L+ + DF++GQ +N F ++ P ++ + R W+ +L G
Sbjct: 1322 AYAPHTAESRGGQLLVPRGDFNVGQSINKFVRL---PMTLPSGTTSLQRHALWFGTLSGG 1378
Query: 540 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
+G+ P+ E +RRL MLQ+ +++ HT GL+P+A+R + + N I+DG L+
Sbjct: 1379 VGYLAPMDESVFRRLGMLQSALLSAIPHTAGLHPQAYRALQ-RERLLRNRKHTILDGLLL 1437
Query: 600 WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSH 637
++L L + +I K+G+ IL++L I +H
Sbjct: 1438 SRYLALDSATQQQIALKLGTSRERILNDLQGIPQSVTH 1475
>gi|33411762|emb|CAD58786.1| cleavage and polyadenylation specificity factor 1 [Bos taurus]
Length = 880
Score = 234 bits (598), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 114/219 (52%), Positives = 145/219 (66%), Gaps = 15/219 (6%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 663 LVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 722
Query: 63 -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
+ E+ PRG R+++ RYF +I GY GVF+CGP P WL +T RG LR
Sbjct: 723 KPKPSKKKAEGGSTEEGTGPRG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 781
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
HPM IDGP+ + APFHN+NCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 782 HPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 841
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPR 220
++AYH+E+K Y + TST+ P T + GE+KE T R
Sbjct: 842 YVAYHVESKVYAVATSTSTPCTRVPRMTGEEKEFETIER 880
>gi|320040273|gb|EFW22206.1| hypothetical protein CPSG_00105 [Coccidioides posadasii str.
Silveira]
Length = 1387
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 187/646 (28%), Positives = 314/646 (48%), Gaps = 95/646 (14%)
Query: 17 VQELLTVSLGLHGNR-PLLLVRT-QHELLIYQAFRHPKGAL---KLRFKKLKVLFVS--D 69
+ E+L LG +R P +++RT ++L++YQ + HPK +L +LRF K+ F+ D
Sbjct: 804 LSEVLIADLGDSISRQPYIILRTANNDLILYQPY-HPKTSLDKQELRFVKIIDHFLPRFD 862
Query: 70 RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG- 128
S +A +PR +R +S+I GY+ VF+ G +P ++ +S H + + G
Sbjct: 863 PSPKAY----MPRS---KFLRAYSDICGYKTVFMSGSNPCFVMKSSTSS--PHVLRLRGE 913
Query: 129 PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLE 188
VS+L+ FH C +GF Y +A + +R+ LP + +D W RKV + + Y
Sbjct: 914 AVSSLSSFHIPACEKGFAYVDASNMVRMCRLPGNTRFDNSWVTRKVHVGDQIDCVEYFAH 973
Query: 189 TKTYCIVTSTAEPSTDYYKFN---GEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWE 243
++ Y + +S +K + ED E+ + R F+P L + + L SP +W
Sbjct: 974 SEIYALGSS--------HKVDFKLPEDDEIHPEWRSEVISFMPQL-ERGCIKLLSPRTWS 1024
Query: 244 EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
+ ++ L + E V+C+K ++ME ++ + +GT ED+T RG I +F+II
Sbjct: 1025 VV--DSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITPRGSIYVFEII 1082
Query: 304 EVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTG 360
EV P+P +P T K+K+ + KG VTA+ + GFL+ A GQK + LK D L
Sbjct: 1083 EVAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRGLKEDGSLLP 1142
Query: 361 IAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
+AF+D + Y+ + ++ L ++GD + I Y E L+L +D + Q + +
Sbjct: 1143 VAFMDMQCYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKDNEYLQVIAADF 1202
Query: 419 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
L G+RL I +++D D + +
Sbjct: 1203 --------------------LPDGKRLYI--------------------LVADDDCTIHV 1222
Query: 479 FMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS-DAPGARS--------RF 529
Y PE S+ G RL+ ++ FH+G +T + SS S D PG +
Sbjct: 1223 LEYDPEDPTSSKGDRLLHRSSFHMGHFTSTMTLLPQHSSSPSADDPGEDDMDVDYVPKSY 1282
Query: 530 LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
S +G++G PL E +YRRL LQ+ +VT H GLNP+A+R + G+
Sbjct: 1283 QVLVTSQEGSIGVVTPLTEDSYRRLSALQSQLVTSMEHPCGLNPKAYRAVESDGFGG--- 1339
Query: 590 SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
RGI+DG+L+ ++L + + + EI ++G+ DI D+E +S
Sbjct: 1340 -RGIVDGNLLLRWLDMGVQRKAEIAGRVGA---DIESIRVDLEKIS 1381
>gi|412986884|emb|CCO15310.1| predicted protein [Bathycoccus prasinos]
Length = 1595
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 177/620 (28%), Positives = 280/620 (45%), Gaps = 86/620 (13%)
Query: 31 RPLLLV-RTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQM 89
RPLL R +L YQAF+ P + +LRF ++ + + S+ N + G R++++
Sbjct: 1024 RPLLTCFRADGSVLAYQAFKSPS-SNELRFARVPIEIETAGSELTNNDVSVQGGSRLTRI 1082
Query: 90 RYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS-TLAPFHNVNCPRGFLYF 148
+ G GVF+ G +P WL + RG + A P +G APFHNVNCP+GF+
Sbjct: 1083 ENIGDGRGIAGVFVSGLNPIWLIV-RRGRVLALPTRGEGGARIAFAPFHNVNCPKGFILA 1141
Query: 149 NAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDY--- 205
+ +R+ LP + +A WPVRK+ L+CTP + Y + K Y +VTS + P D+
Sbjct: 1142 TNEGGIRVCRLPGKMHIEAQWPVRKLALRCTPRAITYMNDFKLYALVTSASVPWKDFEID 1201
Query: 206 ---------YKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWE 256
Y+F E ++ +V QF + L P + E Q + E
Sbjct: 1202 ETDSHARALYRFRKE---------KAKSEGNVVQQFAIRLLVPGTLETAWQK--AVEPGE 1250
Query: 257 HVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKN 316
H+LC+KNV + + T L +A+GT ED CRGRILLF I+ G +
Sbjct: 1251 HILCVKNVQIRDQST-GALLSMLAIGTAMPGGEDTPCRGRILLFAIMWERARDGGVRWRG 1309
Query: 317 KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSV 376
++K K K +AI V G + A+G K+ L IAF DT +Y ++ V
Sbjct: 1310 ELKC--EKPSKMACSAIESVDGTFMVAIGTKLTAHSWDGKHLNPIAFYDTPLYTTTLCCV 1367
Query: 377 KNLILVGDYARSIALLRYQPEY--RTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVW 434
KN +L GD +SI +R++ +TLS + +DY+ + + +IDG
Sbjct: 1368 KNFLLCGDLHKSIRFVRWKDSQGEKTLSQLGKDYEVLDCIASEF--------MIDG---- 1415
Query: 435 KFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRL 494
++ + +D + N +F Y P+ ES G +L
Sbjct: 1416 ----------------------------GTLSLLAADANGNAHVFQYAPKLAESWKGDKL 1447
Query: 495 IKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 554
+ K+ +H G + + + I ++R ++ S DG LG F P+ E + L
Sbjct: 1448 LPKSAYHAGSLIRKMVRFQ-----IGVGEQKQNRHAVFFGSSDGGLGIFSPVDEHTFLNL 1502
Query: 555 LMLQNVMVTHTSHTG------GLNPRAFRTYK-GKGYYA-GNPSRGIIDGSLVWKFL-QL 605
LQ+ M ++ + GLN + +R K +G A P R I+DG L+ KF L
Sbjct: 1503 EKLQDAMRSNIVASSNSINPLGLNSKTYRALKSSEGSVARQTPPRTIVDGGLLSKFEHSL 1562
Query: 606 SLGERLEICKKIGSKHNDIL 625
S+ + + K G + L
Sbjct: 1563 SITAQTRVAAKAGLTRDQAL 1582
>gi|315045910|ref|XP_003172330.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
gi|311342716|gb|EFR01919.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
Length = 1397
Score = 229 bits (585), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 177/645 (27%), Positives = 302/645 (46%), Gaps = 88/645 (13%)
Query: 4 FRSHSPSAMDETIVQELLTVSLG--LHGNRPLLLVRTQHE-LLIYQAFR--HPKGALKLR 58
+ S S ++ + ELL LG +H P +++RT+H+ L++Y+ +R G KL+
Sbjct: 793 YESSSRRPVNRETLTELLVADLGDAIH-KSPYMILRTKHDDLVLYEPYRITGENGRSKLQ 851
Query: 59 F-KKLKVLFVSDRS-----KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLF 112
F K + + + R+ K N P + +R S++ GY+ VF+ G +P ++
Sbjct: 852 FIKAVNHVVMGPRTNQPMNKDINRSPSPSK-----LLRALSDVCGYKTVFMSGQNPCFIL 906
Query: 113 LTSRGELRAHPMTIDG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPV 171
++ R + + + G V +L FH C RGF Y + + +R+S LP++ +D+ W
Sbjct: 907 KSAIA--RPNVLRLRGKAVQSLTGFHIAACERGFAYVDEDNVIRMSRLPSNTRFDSAWAT 964
Query: 172 RKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS--RFIPPLV 229
RK+PL + Y +++Y I TST E +K ED E T+ R+ F+P L
Sbjct: 965 RKIPLGEQVDCIVYSSASESYVIGTSTKED----FKLP-EDDESHTEWRNEFITFLPQL- 1018
Query: 230 SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
+ V L P +W I + + E + C+K + +E T + + +G+ E
Sbjct: 1019 DRGTVKLLEPKNWSAI--DIYEVEPAERITCIKIIRLEISETTHERKDMVVVGSAVAKGE 1076
Query: 290 DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQK 347
D+ +G I +F+II+VVP+P P K+K+ +E KG VTA+ + GFL+ A GQK
Sbjct: 1077 DIVPKGCIRVFEIIDVVPDPDHPEKNKKLKLFAREEVKGAVTAVSGIGGQGFLIVAQGQK 1136
Query: 348 IYIWQLK-DNDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLV 404
+ LK D L IAF DT+ Y+ + +K + ++GD + + Y E L L
Sbjct: 1137 CMVRGLKEDGSLLPIAFKDTQCYVNVLKELKGTGMCIIGDAFKGLWFTGYSEEPYKLDLF 1196
Query: 405 ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
++ N + ++D D L + +
Sbjct: 1197 GKE--------------NENLAVVDA--------------------------DFLPDGNK 1216
Query: 465 MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS---- 520
+ +++D D N+ + Y PE S+ G RL+ ++ FH G +T + ++S
Sbjct: 1217 LYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTLLPHGSHTLSSPVD 1276
Query: 521 ------DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 574
D P S++ G++G PL E +YRRLL LQ+ +V H GLNPR
Sbjct: 1277 EDAMDTDLPPPPSKYQVLITFQTGSIGVISPLNEDSYRRLLALQSQLVNALEHPCGLNPR 1336
Query: 575 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
+R + G RG+IDG+L+ ++L + + EI ++G+
Sbjct: 1337 GYRAVESDGMGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGA 1378
>gi|119195757|ref|XP_001248482.1| hypothetical protein CIMG_02253 [Coccidioides immitis RS]
gi|121769680|sp|Q1E5B0.1|CFT1_COCIM RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|392862316|gb|EAS37050.2| protein CFT1 [Coccidioides immitis RS]
Length = 1387
Score = 229 bits (583), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 184/646 (28%), Positives = 311/646 (48%), Gaps = 95/646 (14%)
Query: 17 VQELLTVSLGLHGNR-PLLLVRTQH-ELLIYQAFRHPKGAL---KLRFKKLKVLFVS--D 69
+ E+L LG +R P +++RT + +L++YQ + HPK +L +LRF K+ F+ D
Sbjct: 804 LSEVLIADLGDSISRQPYMILRTANDDLILYQPY-HPKTSLDKPELRFVKIIDHFLPRFD 862
Query: 70 RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG- 128
S +A +P +R +S+I GY+ VF+ G +P ++ +S H + + G
Sbjct: 863 PSPKAY----MPHS---KFLRAYSDICGYKTVFMSGSNPCFVMKSSTSS--PHVLRLRGE 913
Query: 129 PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLE 188
VS+L+ FH C +GF Y +A + +R+ LP++ +D W RKV + + Y
Sbjct: 914 AVSSLSSFHIPACEKGFAYVDASNMVRMCRLPSNTRFDNSWVTRKVHVGDQIDCVEYFAH 973
Query: 189 TKTYCIVTSTAEPSTDYYKFN---GEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWE 243
++ Y + +S +K + ED E+ + R F+P L + + L SP +W
Sbjct: 974 SEIYALGSS--------HKVDFKLPEDDEIHPEWRSEVISFMPQL-ERGCIKLLSPRTWS 1024
Query: 244 EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
+ ++ L + E V+C+K ++ME ++ + +GT ED+T RG I +F+II
Sbjct: 1025 VV--DSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITPRGSIYVFEII 1082
Query: 304 EVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTG 360
EV P+P +P T K+K+ + KG VTA+ + GFL+ A GQK + LK D L
Sbjct: 1083 EVAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRGLKEDGSLLP 1142
Query: 361 IAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
+AF+D + Y+ + ++ L ++GD + I Y E L+L +D + Q + +
Sbjct: 1143 VAFMDMQCYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKDNEYLQVIAADF 1202
Query: 419 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
L G+RL I +++D D + +
Sbjct: 1203 --------------------LPDGKRLYI--------------------LVADDDCTIHV 1222
Query: 479 FMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS---------DAPGARSRF 529
Y PE S+ G RL+ ++ FH G +T + SS S D +
Sbjct: 1223 LEYDPEDPTSSKGDRLLHRSSFHTGHFTSTMTLLPEHSSSPSADDPEEDDMDVDYVPKSY 1282
Query: 530 LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
S +G++G PL E +YRRL LQ+ +VT H GLNP+A+R + G+
Sbjct: 1283 QVLVTSQEGSIGVVTPLTEDSYRRLSALQSQLVTSMEHPCGLNPKAYRAVESDGFGG--- 1339
Query: 590 SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
RGI+DG+L+ ++L + + + EI ++G+ DI D+E +S
Sbjct: 1340 -RGIVDGNLLLRWLDMGVQRKAEIAGRVGA---DIESIRVDLETIS 1381
>gi|66812672|ref|XP_640515.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
gi|60468551|gb|EAL66554.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
Length = 1628
Score = 227 bits (578), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 172/677 (25%), Positives = 314/677 (46%), Gaps = 141/677 (20%)
Query: 17 VQELLTVSLGLHGNRP--LLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLF-----VSD 69
+ +++ +SL N L + +L+IY++F+ K +LRFKK F V++
Sbjct: 1024 ILDIVEISLHNFNNSDPYLFMFNKIGDLIIYKSFKREKNG-ELRFKKYNHSFILRDSVTE 1082
Query: 70 RSKRANEQP-------------------------GLPRGVRISQMRYFSNIAGYQGVFLC 104
++ E+ L R RI + FS+I+G +G+F+
Sbjct: 1083 FYQKQQEKELLNGMDDDDDMDDEKKKKKEEEEEENLNRQKRIFE---FSSISGKRGLFIG 1139
Query: 105 GPHPAWLFLTSRGELRAHPMTIDG----------------PVSTLAPFHNVNCPRGFLYF 148
G P W F +G LR H M V T F+N++C GF+YF
Sbjct: 1140 GKKPIWAF-CEKGYLRLHSMDSSDNSNSNNSNNNNNNNSNTVETFTSFNNISCQDGFIYF 1198
Query: 149 NAKSE-LRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYK 207
+ + + ++I L T ++++ +R++P K + H +AYH E K Y ++ S + + + +
Sbjct: 1199 SKEKDVIKICTLSTLMNFENDIAIRRIPTKNSCHKIAYHSEAKCYVVIVSFPQVTQELQE 1258
Query: 208 FNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP---FSWEEIPQTNFPLHEWEHVLCLKNV 264
K ++TD + F + L P ++W+ I +F L + E VL +K V
Sbjct: 1259 --DSKKPILTDDK-----------FQIKLIDPTIDWNWKFID--SFSLQDRETVLAMKIV 1303
Query: 265 SMEYE--GTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPE-PGQPLTKNKIKMI 321
S+++ ++ R ++ +GT + + ED C+GR+L+F+I+ + + L + ++ ++
Sbjct: 1304 SLKFTEPDGITRARPFLVIGTAFTFGEDTQCKGRVLVFEIVSHKTQFESEELGEKRLNLL 1363
Query: 322 YAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLIL 381
Y KEQKGPVTA+ V G L+ +G K+ + Q L ++F D ++YI S+ ++KN I+
Sbjct: 1364 YEKEQKGPVTALSSVNGLLLMTIGPKLTVNQFYTGSLVTLSFYDAQIYICSICTIKNYIV 1423
Query: 382 VGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSL 441
+GD +S+ L+++ + +TL+L+++DY+ S + + I
Sbjct: 1424 IGDMYKSVYFLQWK-DNKTLNLLSKDYQALNIFSTEFIVNQKTLSI-------------- 1468
Query: 442 GERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFH 501
++SD DKN++LF ++P+ S G
Sbjct: 1469 --------------------------LVSDLDKNILLFSFEPQDPSSRSG---------Q 1493
Query: 502 LGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 561
+ Q +N ++ +D + L + +LDG L PL EK Y +Q+ +
Sbjct: 1494 INQEIN--------GNNKNDNRLPKKEQLVIFGTLDGGLNVLRPLDEKIYLLFYHIQSKL 1545
Query: 562 VTHTSHTGGLNPRAFRTYKG-KGYYAGNPS------RGIIDGSLVWKFLQLSLGERLEIC 614
+ T GLNP+ +R++K + +PS + I+DG L+ KFL LS E+ I
Sbjct: 1546 Y-YLPQTAGLNPKQYRSFKSFSQNFHFSPSTFHQLPKFILDGDLISKFLSLSQSEKRLIS 1604
Query: 615 KKIGSKHNDILDELYDI 631
I S ++I++ L D+
Sbjct: 1605 NSINSTSDEIIESLKDV 1621
>gi|170576536|ref|XP_001893668.1| CPSF A subunit region family protein [Brugia malayi]
gi|158600196|gb|EDP37499.1| CPSF A subunit region family protein [Brugia malayi]
Length = 1323
Score = 226 bits (575), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 122/338 (36%), Positives = 193/338 (57%), Gaps = 40/338 (11%)
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
++VVPEPGQP +K++IK +Y KEQKGPVT++C G+L+T +GQK++IW KDN+L GI+
Sbjct: 1024 LQVVPEPGQPTSKHRIKTLYDKEQKGPVTSLCSCNGYLLTGMGQKVFIWLFKDNNLQGIS 1083
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
F+D YI ++ V+NL L D RS+ALLRYQ EY+ LSL +RD
Sbjct: 1084 FLDMHFYIHQLIGVRNLALACDMYRSLALLRYQEEYKALSLASRDM-------------- 1129
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
R + + +FL I +K MGF++SD+ N+ +F Y
Sbjct: 1130 --RSDVQPPMAAQFL-------------IDNKQ---------MGFIMSDEAANIAIFNYL 1165
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSS--ISDAPGARSRFLTWYASLDGAL 540
PE ES GG +L + + ++G VN+F +++ SS + + + R +ASLDG+
Sbjct: 1166 PETLESLGGEKLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSLERQSVLFASLDGSF 1225
Query: 541 GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVW 600
G+ PL EK +RRL MLQ +M + GLN + R + + +R ++DG +
Sbjct: 1226 GYLRPLTEKVFRRLHMLQQLMSSMVLQPAGLNAKGARAARPQRPNHYLNTRNLVDGDVAM 1285
Query: 601 KFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
++L LSL E+ ++ +K+G+ I+D+L +I +++H+
Sbjct: 1286 QYLHLSLPEKNDLARKLGTSRYHIIDDLIEICRVTAHY 1323
Score = 112 bits (280), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 70/235 (29%), Positives = 116/235 (49%), Gaps = 14/235 (5%)
Query: 14 ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP---KGALKLRFKKLKVLFVSDR 70
E ++ ELL V +G++ RPLL + + Y+ F + +G L +RFK+L V+ R
Sbjct: 794 EEVIMELLLVGMGMNQGRPLLFLLIDDTVSAYEMFTYNNGIQGHLAIRFKRLPYTTVT-R 852
Query: 71 SKRANEQPG------LPRGVRISQMRYFSNIAG--YQGVFLCGPHPAWLFLTSRGELRAH 122
S R G + VR + +F G GVF+C +P FL S G R H
Sbjct: 853 SCRFQGTDGRAAVESVRDAVRHKTVLHFFERIGNVLNGVFICSSYPCIFFLES-GVPRLH 911
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSE-LRISVLPTHLSYDAPWPVRKVPLKCTPH 181
P+ +DGP+ + F+N CP GF+Y + +R++ LP+ + DA +PV+++ + T H
Sbjct: 912 PVNLDGPILSFTTFNNAVCPNGFIYLTERDRFMRVAKLPSDMILDASYPVKRINVGATVH 971
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSL 236
+ Y L + TY ++TS T +DK + F+ P + Q+ + +
Sbjct: 972 SVVYLLHSNTYAVLTSEKRKVTKMCVLINDDKTFEEHEKPDTFVYPEMDQYKLQV 1026
>gi|242798830|ref|XP_002483249.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces stipitatus ATCC 10500]
gi|218716594|gb|EED16015.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces stipitatus ATCC 10500]
Length = 1382
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 172/642 (26%), Positives = 293/642 (45%), Gaps = 78/642 (12%)
Query: 14 ETIVQELLTVSLG-LHGNRPLLLVRTQ-HELLIYQAFRH----PKGALKLRFKKLKVLFV 67
ETI ELL LG + P L++R+ +L+IY+ R K + L++ K F+
Sbjct: 792 ETIA-ELLVADLGEISTASPYLIIRSATDDLIIYKPVRENSKDEKTGVTLKYIKESNHFL 850
Query: 68 SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID 127
K E R+ +R ++I GY V + G P+ + TS+ R + D
Sbjct: 851 P---KVPIEAAATDTQQRMPGLRRLADIGGYAAVLMSGASPSLVVRTSKSLPRVFSIQSD 907
Query: 128 GPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHL 187
+ ++ F + C +G +Y + + +R L + D WP+RK+PL +LAY
Sbjct: 908 S-IRGISGFDSAGCEKGLIYVDNEHVVRTCRLHDNTQLDFSWPIRKIPLNEEVDYLAYST 966
Query: 188 ETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQ 247
+ TY + T+ + +K D+ + + P V+Q + L +P +W+ I
Sbjct: 967 VSGTYVVGTTHEQD----FKLPDNDELHPEWANEDISLRPKVAQGSIKLLNPKTWKVIDS 1022
Query: 248 TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
F + E + ++N+++E S + I +GT + ED+ RG + +FD+I VVP
Sbjct: 1023 YTF--NAAERITAIENINLEISEKTSERKDMIVVGTTFAKGEDIAARGNVYVFDVINVVP 1080
Query: 308 EPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLKDN-DLTGIAFI 364
+P +P T K+K+I + +G +TA+ + GFL+ A GQK + LKD+ L +AFI
Sbjct: 1081 DPDEPGTNLKLKLIGEESVRGALTAVSGIGGQGFLIVAQGQKCMVRGLKDDGSLLPVAFI 1140
Query: 365 DTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
D + Y++ + +K + L+GD + + Y E ++L +D
Sbjct: 1141 DVQCYVSVIKELKGTGMCLIGDALKGLWFTGYSEEPYKMTLFGKDL-------------- 1186
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ LE+ D L + + +++D D N+ + Y
Sbjct: 1187 --------------------DELEVVTA------DFLPDGKKLYILVADSDCNLHVLQYD 1220
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSSISDAPGARSRFLTWYASL----- 536
PE +S+ G RL+ + FH+G +T + R SS + S + Y L
Sbjct: 1221 PEDPKSSNGDRLLNRCKFHMGHFASTITLLPRTAVSSELAVMNSDSMDIDSYIPLHQALI 1280
Query: 537 ---DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGI 593
G + L E++YRRL LQ+ + H GLNPRA+R + G RG+
Sbjct: 1281 TTQSGLMALVTSLSEESYRRLSALQSQLSNTLEHPCGLNPRAYRAVESDGVVG----RGM 1336
Query: 594 IDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
IDG L+ ++L LS +LEI ++G+ +I D+EA+S
Sbjct: 1337 IDGKLLMRWLDLSRPRKLEIAGRVGADEWEI---RADLEAVS 1375
>gi|149066088|gb|EDM15961.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
isoform CRA_b [Rattus norvegicus]
Length = 241
Score = 222 bits (565), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 114/283 (40%), Positives = 167/283 (59%), Gaps = 47/283 (16%)
Query: 361 IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
+AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S +
Sbjct: 1 MAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMV 60
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
N + +GF++SD+D+N++++M
Sbjct: 61 DN----------------------------------------AQLGFLVSDRDRNLMVYM 80
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYAS 535
Y PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ + P +S + +TW+A+
Sbjct: 81 YLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGAA--EGPSKKSVMWENKHITWFAT 138
Query: 536 LDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIID 595
LDG +G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++D
Sbjct: 139 LDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLD 198
Query: 596 GSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
G L+ ++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 199 GELLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 241
>gi|296414526|ref|XP_002836950.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295632796|emb|CAZ81141.1| unnamed protein product [Tuber melanosporum]
Length = 1468
Score = 221 bits (564), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 153/574 (26%), Positives = 257/574 (44%), Gaps = 89/574 (15%)
Query: 94 NIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSE 153
N+AGY VFL G P+++ T++ R H + G V +L+ FH+ RGF+Y ++
Sbjct: 934 NLAGYSAVFLPGADPSFVIKTAKSSPRIHKLAGTG-VRSLSSFHSAGADRGFVYVDSLGI 992
Query: 154 LRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDK 213
+R++++P ++D W +KV LAY Y I TS +P + ED
Sbjct: 993 VRVALMPAEFTFDGNWGYKKVTPGEHVQSLAYFPPMNVYVISTSKRQP----FDLAEEDG 1048
Query: 214 ELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLS 273
+ +D + P + + L SP +W + + F +E L +K +S+E
Sbjct: 1049 NIA---KDDTTLQPEIDSGTLKLLSPQTWTAVDEYKFAHNEI--ALVVKTISLEVSEHTK 1103
Query: 274 GLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAI 333
+ +++GT ED + RG I +F++IEVVPEP +P T K+K++ +E KG V+AI
Sbjct: 1104 ERKQLVSVGTAIFRGEDHSARGGIYVFEVIEVVPEPNRPETNRKLKLVTREEVKGTVSAI 1163
Query: 334 CHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALL 392
C V G+L+ A GQKI + LK D L +AF+D +Y++ ++ +IL GD+ +S+
Sbjct: 1164 CGVNGYLLAAQGQKIMVRGLKEDQSLLPVAFLDMCLYVSVAKNLDGMILFGDFMKSVWFA 1223
Query: 393 RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
+ E ++L +D ++LEI
Sbjct: 1224 GFSEEPYKMTLFGKDT----------------------------------QKLEIISA-- 1247
Query: 453 SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 512
+ L + + + F++ D + N+ Y PE +S G RLI++ DF G ++T +
Sbjct: 1248 ----EFLPDGNQLYFVVVDAESNIHTLQYDPEHPKSLAGQRLIRRADFFSGHEISTLTML 1303
Query: 513 RCKPSSISDAPGAR---------------------SRFLTWYASLDGALGFFLPLPEKNY 551
P S+S + + + + G+L +PE Y
Sbjct: 1304 PFSPYSLSASSNSHLPADATDTSPLHHHHQNQQQQQEYFVLAGTQTGSLAMIRTIPETAY 1363
Query: 552 RRLLMLQNVMVTHTSHTGGLNPRAFRTY-----------------KGKGYYAGNPSRGII 594
RRL ++Q +V H GLNPR +R G G+ RG++
Sbjct: 1364 RRLNIVQGQIVNGEEHVAGLNPREYRAVVNYSGGGGGGAGGGGWGGSGGGVGGDTMRGVL 1423
Query: 595 DGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
DG LV +++ L+ G + E+ K G I +L
Sbjct: 1424 DGGLVSRWIGLAEGRKGEVSAKAGCGVQGIRGDL 1457
>gi|119484094|ref|XP_001261950.1| cleavage and polyadenylation specificity factor subunit A, putative
[Neosartorya fischeri NRRL 181]
gi|148886830|sp|A1DB13.1|CFT1_NEOFI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|119410106|gb|EAW20053.1| cleavage and polyadenylation specificity factor subunit A, putative
[Neosartorya fischeri NRRL 181]
Length = 1400
Score = 221 bits (562), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 180/649 (27%), Positives = 304/649 (46%), Gaps = 92/649 (14%)
Query: 16 IVQELLTVSLGLHGN-RPLLLVRTQ-HELLIYQAFRHP-KGA--LKLRFKK-----LKVL 65
++ E + LG N P L++RT+ +L+IY+AF KG L F K L +
Sbjct: 807 VLSEAVIADLGESWNPSPHLILRTESDDLVIYKAFASSIKGESHTHLSFVKETNHTLPRV 866
Query: 66 FVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
SD+ ++NE+ R +RI NI+ VF+ GP +++ T++ H
Sbjct: 867 TTSDKEMQSNEELSRSRSLRI-----LPNISDLSAVFMPGPSASFILKTAKS--CPHVFR 919
Query: 126 IDGPVSTLAPFHNVNCP---RGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
+ G ++ P +GF+Y ++K LRI P+ +D W +RK+ +
Sbjct: 920 LRGEFVRGLSIFDLASPSLDKGFIYVDSKDVLRICRFPSETLFDYTWALRKIGIGEQVDH 979
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS--RFIPPLVSQFHVSLFSPF 240
LAY ++TY + TS S D+ +D EL D R+ F+P L Q + + SP
Sbjct: 980 LAYATSSETYVLGTSH---SADFKL--PDDDELHPDWRNEVISFLPEL-RQCSLKVVSPR 1033
Query: 241 SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
+W I ++ L E+V+ +KN+ +E R I +GT + + ED+ RG I +F
Sbjct: 1034 TWTVI--DSYSLGPAEYVMAVKNMDLEVSENTHERRNMIVVGTAFAWGEDIPSRGCIYVF 1091
Query: 301 DIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DND 357
++I+VVP+P +P T K+K+I + KG VTA+ + GFL+ A GQK + LK D
Sbjct: 1092 EVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRGLKEDGS 1151
Query: 358 LTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNS 415
L +AF+D + Y+ + +K + ++GD + + Y E +SL +D
Sbjct: 1152 LLPVAFMDMQCYVNVVKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKD-------- 1203
Query: 416 KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKN 475
+GY + + DG ++ +++D D N
Sbjct: 1204 QGYLEVVAAEFLPDGDKLF--------------------------------ILVADSDCN 1231
Query: 476 VVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF----------KIRCKPSSIS-DAPG 524
+ + Y PE +S+ G RL+ ++ FH+G T K P S+ D+
Sbjct: 1232 LHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLLPRTMVSSEKAMADPDSMEIDSQT 1291
Query: 525 ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 584
+ L S G++G +PE++YRRL LQ+ + H GLNPRA+R +
Sbjct: 1292 ISQQVLI--TSQSGSVGIVTSVPEESYRRLSALQSQLTNSLEHPCGLNPRAYRAVESD-- 1347
Query: 585 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
G RG++DG+L++++L + ++EI ++G+ +I +L I A
Sbjct: 1348 --GTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAHEWEIKADLEAIGA 1394
>gi|212541400|ref|XP_002150855.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces marneffei ATCC 18224]
gi|210068154|gb|EEA22246.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces marneffei ATCC 18224]
Length = 1383
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 172/642 (26%), Positives = 291/642 (45%), Gaps = 78/642 (12%)
Query: 14 ETIVQELLTVSLG-LHGNRPLLLVRTQ-HELLIYQAFRHPKGALK----LRFKKLKVLFV 67
ETI ELL LG L P L++RT +L+IY+ F A K L++ K F+
Sbjct: 793 ETIA-ELLIADLGELPTVSPYLIIRTATDDLIIYKPFWENSNAEKSGGSLKYIKETNHFL 851
Query: 68 SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID 127
S A R +R S++ GY V + G P + TS+ + + D
Sbjct: 852 PKVSLEAASSASQQR---TPGLRRLSDLGGYAAVVMSGASPNLIVRTSKSLPHVYSIQSD 908
Query: 128 GPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHL 187
+ ++ F+ C +G +Y + + +R L + D WP+R++PL LAY
Sbjct: 909 F-IRGISGFNGAGCKKGLVYVDNERLVRTCQLYNNAQLDFSWPIRRIPLNEQVDHLAYST 967
Query: 188 ETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQ 247
+ TY + T+ + +K +D+ + + P V+ + L +P +W+ I
Sbjct: 968 ASGTYVVGTTHEQD----FKLPDDDELHPEWATEEISLLPKVAYGSIKLINPKTWKVIDS 1023
Query: 248 TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
F E + ++N+++E + I +GT Y ED+ RG + +FD+I+VVP
Sbjct: 1024 YTF--SPAERITAVENINLEISEKTGKRKDMIVVGTTYAKGEDIAARGNVYVFDVIDVVP 1081
Query: 308 EPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLKDN-DLTGIAFI 364
+P +P T K+K+I + +G VTA+ + GF++ A GQK + LKD+ L +AFI
Sbjct: 1082 DPDEPGTNLKLKLIGEESIRGAVTAVSGIGGQGFMIVAQGQKCMVRGLKDDGSLLPVAFI 1141
Query: 365 DTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
D + Y++ + +K + L+GD + + Y E ++L +D
Sbjct: 1142 DVQCYVSVIKELKGTGMCLIGDAFKGLWFTGYSEEPYKMTLFGKDL-------------- 1187
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ LE+ D L + + +++D D N+ + Y
Sbjct: 1188 --------------------DELEVVTA------DFLPDGKKLYILVADGDCNLYVLQYD 1221
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSSISDAPGARSRFLTWYASL----- 536
PE +S+ G RL+ + FH+G +T + R SS + S + Y L
Sbjct: 1222 PEDPKSSNGDRLLNRCKFHMGHFASTLTLLPRTAVSSELAVMSSDSMDIDSYTPLYQALI 1281
Query: 537 ---DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGI 593
G++ L E++YRRL LQ+ + H GLNPRA+R+ + G RG+
Sbjct: 1282 TTQSGSMALITSLSEESYRRLTALQSQLSNTLEHPCGLNPRAYRSVESDGVVG----RGM 1337
Query: 594 IDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
IDG L+ ++L LS +LEI ++G+ +I D+EA+S
Sbjct: 1338 IDGKLLMRWLDLSRSRKLEIAGRVGADEWEI---RADLEAVS 1376
>gi|225679191|gb|EEH17475.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 1377
Score = 219 bits (558), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 181/647 (27%), Positives = 294/647 (45%), Gaps = 101/647 (15%)
Query: 17 VQELLTVSLGLHGNR-PLLLVRTQ-HELLIYQAFRHPKGALK----LRFKKL------KV 64
+ E+L LG +R P L++R+ +EL++Y+ + + K LRF K+ K
Sbjct: 793 LTEILVADLGDSVSRTPYLILRSNSNELILYEPYHIVQSTEKRLSDLRFLKIANHHFPKF 852
Query: 65 LFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM 124
L S+ ++ L R +R ++ GY+ VF+ G P F+ H M
Sbjct: 853 LPESNLGNLSDSDRQLAR-----PLRALGDVCGYRTVFMPGNSPC--FIIKSATSIPHVM 905
Query: 125 TIDG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFL 183
+ G V +L+ F+ C +GF+Y + + +R+ P + +D W RK+ L +
Sbjct: 906 NLRGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRFPRNTHFDGSWAARKIGLGEQVDSV 965
Query: 184 AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
Y ++TY + TS D+ P D P ++ V L +P +W
Sbjct: 966 EYSSSSETYVLGTSQ---KVDFKL-----------PEDDEIHPEWRNEESVKLLNPRTWS 1011
Query: 244 EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
I ++ L E V+C+K +++E + IA+GT ED+ RG I +F++I
Sbjct: 1012 II--DSYQLRTAERVMCVKCLNLEASEITHERKEMIAVGTALTRGEDIAARGCIYVFEVI 1069
Query: 304 EVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTG 360
+VVPE +P T K+K+I +E KG +T++ + GFL+ A GQK + LK D L
Sbjct: 1070 KVVPEVDRPETNRKLKLIAKEEVKGAITSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLP 1129
Query: 361 IAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
+AF+D + Y++ + +K + ++GD + + Y E LSL ++D
Sbjct: 1130 VAFMDMQCYVSVLKELKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKD----------- 1178
Query: 419 YAGNPSRGIIDGSL-VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVV 477
DGSL V L G+RL I M++D D N+
Sbjct: 1179 ----------DGSLQVMAADFLPHGKRLFI--------------------MVADDDCNIH 1208
Query: 478 LFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL------- 530
+ Y PE S G RL+ ++ FH GQ +T + + S +S P A + +
Sbjct: 1209 VLQYDPEDPGSAKGDRLLHRSTFHTGQFAST-LTLLPRTSVLSQGPEAEANAMDLDSSGP 1267
Query: 531 ---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
S G++ P+ E YRRL LQ+ M+ H GLNPRAFR + G
Sbjct: 1268 LHQVLVTSETGSIALITPVSEMAYRRLSALQSQMINTLEHPCGLNPRAFRAVESDGIGG- 1326
Query: 588 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
RG++DG LV K+L L + EI ++G+ D+ + D+EA+
Sbjct: 1327 ---RGMVDGDLVQKWLDLGTQRKAEIASRVGA---DVWEIRADLEAI 1367
>gi|255948500|ref|XP_002565017.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211592034|emb|CAP98296.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1392
Score = 219 bits (557), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 177/650 (27%), Positives = 308/650 (47%), Gaps = 81/650 (12%)
Query: 5 RSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHE-LLIYQAFRHPK----GALKLR- 58
RS++ M E +V +L S G P L+VRT+++ L+ Y+ P G+ +L+
Sbjct: 793 RSNTRETMTEFVVADLGDSS----GLSPYLIVRTENDDLVFYKPSLIPANDGHGSSRLQL 848
Query: 59 FKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGE 118
F+ + S A+ Q + + R+ +R NI+G+ +F+ G +++F T++
Sbjct: 849 FRDSNHVLPKSPSGEASSQ--IQKQQRLRPLRILPNISGFSTIFMPGASSSFVFRTAKSS 906
Query: 119 LRAHPMTIDGPVST-LAPFHNVNCPR--GFLYFNAKSELRISVLPTHLSYDAPWPVRKVP 175
H + + G + L+ F +V+ R GF+Y ++++ +R LP+ +D PW +RKVP
Sbjct: 907 --PHIIRLRGGFTRWLSSFDSVDTGRDNGFIYVDSQNCVRACQLPSQTQFDYPWTLRKVP 964
Query: 176 LKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVS 235
++ FLAY ++TY + TS D+ G+D + F P + + +
Sbjct: 965 IEEQVDFLAYSTSSETYVLGTSR---EGDFKLPEGDDLHPEWRNEELSFCPK-IPESSIK 1020
Query: 236 LFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRG 295
+ SP +W I ++PL E V +KNV++E R I +GT ED+ RG
Sbjct: 1021 VVSPKTWTII--DSYPLDPDEQVTAVKNVNIEVSENTHERRDLIVVGTAIVKGEDMPARG 1078
Query: 296 RILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQL 353
I +FD+I+V P+P +P T +K+K+I + KG VTA+ + GF++ A GQK + L
Sbjct: 1079 TIYVFDVIKVAPDPEKPETGHKLKLIGKESVKGAVTALSGIGGQGFVIVAQGQKCMVRGL 1138
Query: 354 K-DNDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
K D L +AF+D + Y+ +K L+++GD + + Y E ++L +D
Sbjct: 1139 KEDGSLLPVAFMDMQCYVTVAKELKGTGLVILGDAVKGLWFAGYSEEPYRMTLFGKD--- 1195
Query: 411 TQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMIS 470
E LE+ D L + + + +++
Sbjct: 1196 -------------------------------PEYLEVVAA------DFLPDGNKLYMLVA 1218
Query: 471 DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS---------D 521
D D N+ + Y PE +S+ G RL+ ++ F+ G ++ + S D
Sbjct: 1219 DSDCNLHVLQYDPEDPKSSNGDRLLSRSKFYTGNFASSVTLLPRTAVSSERTESSEEGMD 1278
Query: 522 APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG 581
+R AS +G+L + E++YRRL LQ+ ++ H GLNPRAFR +
Sbjct: 1279 LDETFARHQVLIASQNGSLALVTSVAEESYRRLSALQSQLINTVDHPAGLNPRAFRAIES 1338
Query: 582 KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
G AG RG++DG+L+ +L + + EI ++G+ +I +L I
Sbjct: 1339 DG-AAG---RGMVDGNLLRLWLNMGKQRQTEIAGRVGATEWEIKADLETI 1384
>gi|452001482|gb|EMD93941.1| hypothetical protein COCHEDRAFT_1129958 [Cochliobolus heterostrophus
C5]
Length = 1385
Score = 219 bits (557), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 179/623 (28%), Positives = 280/623 (44%), Gaps = 63/623 (10%)
Query: 10 SAMDETIVQELLTVSLGLHGNR-PLLLVRTQHE-LLIYQAFRHP-KGALKLRFKKLKVLF 66
SA+ TI E+L LG + P L+VRT + L+IY+AF P + A L K L+ +
Sbjct: 788 SAIKATIT-EILAADLGDATTKSPHLIVRTSSDNLVIYKAFHSPSRSAADLWTKNLRWVK 846
Query: 67 VSDRS-KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
+S + R E G S + S+I GY VF G PA++F S R ++
Sbjct: 847 LSQQHIPRYTEDGGAEDSGFESTLLTLSDIGGYSTVFQRGTTPAFIFKESSSAPRVIGLS 906
Query: 126 IDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYD-APWPVRKVPLKCTPHFLA 184
PV +L FH +C RGF Y ++ LRIS LP Y W R++P+ H LA
Sbjct: 907 -GKPVKSLTSFHTSSCQRGFAYLDSTDTLRISQLPPQTHYGHLGWATRRMPMDAEIHALA 965
Query: 185 YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEE 244
YH + + T +P + Y+ + + P++ P + + + L +W
Sbjct: 966 YH---SSGLYIVGTGQP--EEYQLDPSETYHYELPKEDMSFKPTIERGIIKLLDEKTWTI 1020
Query: 245 IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIE 304
I L E VL +K +++E + +A+GT + ED+ +G I +F++I
Sbjct: 1021 I--DTHVLDPQEVVLSIKTLNLEVSENTHQRKDLVAVGTAILHGEDLATKGCIRIFEVIT 1078
Query: 305 VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGI 361
VVPEP +P T ++K+I E KG V+AI + GF++ A GQK + LK D L +
Sbjct: 1079 VVPEPDRPETNKRLKLIVKDEVKGAVSAISELGTQGFMIMAQGQKCMVRGLKEDGTLLPV 1138
Query: 362 AFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
AF+D + Y++ + ++ ++ + D R + Y E +SL AR +
Sbjct: 1139 AFMDMQCYVSDLKNLPGTGMLAMSDAYRGVWFTGYTEEPYRMSLFARSKHSLE------- 1191
Query: 420 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
++ F+ E+L + +++D D N+ +
Sbjct: 1192 -----------AIAIDFIPFE--EQLHL--------------------LVADADMNLQVL 1218
Query: 480 MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI--RCKPSSISDAPGARSRFLTWYASLD 537
+ P+ +S G RL+ K+ FH G T + R K S SD + S
Sbjct: 1219 QFDPDNPKSEAGSRLLHKSTFHTGHFPATLHVVHSRLKMPSASDFAATQPLHQILCTSQS 1278
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK--GKGYYAGNPSRGIID 595
G L PL E YRRL L + T GLNPRAFR G+ AG +RG++D
Sbjct: 1279 GTLALVTPLSEDTYRRLSNLSAYLSNTLDATAGLNPRAFRASDTPDGGWDAGTGARGMLD 1338
Query: 596 GSLVWKFLQLSLGERLEICKKIG 618
G+L+ ++ +L R E K G
Sbjct: 1339 GNLLMRWGELGERGRREGLAKYG 1361
>gi|146324727|ref|XP_747211.2| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus fumigatus Af293]
gi|148886828|sp|Q4WCL1.2|CFT1_ASPFU RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|129556124|gb|EAL85173.2| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus fumigatus Af293]
Length = 1401
Score = 218 bits (555), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 181/651 (27%), Positives = 309/651 (47%), Gaps = 94/651 (14%)
Query: 16 IVQELLTVSLGLHGN-RPLLLVRTQ-HELLIYQAF-RHPKGA--LKLRFKK-----LKVL 65
++ E + LG N P L++RT+ +L+IY+AF + KG +L F K L +
Sbjct: 806 VLSEAVIADLGESWNPSPHLILRTESDDLVIYKAFASYIKGESHTRLSFVKESNHTLPRV 865
Query: 66 FVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
S++ ++NE+ PR +RI NI+ + VF+ G +++ T++ H
Sbjct: 866 TTSEKEMQSNEKLSRPRSLRI-----LPNISNFSAVFMPGRPASFILKTAKS--CPHVFR 918
Query: 126 IDGP-VSTLAPFH--NVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
+ G V +L+ F + + GF+Y ++K LRI P+ +D W +RK+ +
Sbjct: 919 LRGEFVRSLSIFDLASPSLDTGFIYVDSKDVLRICRFPSETLFDYTWALRKISIGEQVDH 978
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS----RFIPPLVSQFHVSLFS 238
LAY ++TY + TS S D+ +D EL D R+ F+P L Q + + S
Sbjct: 979 LAYATSSETYVLGTSH---SADFKL--PDDDELHPDWRNEGLVISFLPEL-RQCSLKVVS 1032
Query: 239 PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRIL 298
P +W I ++ L E+V+ +KN+ +E R I +GT + ED+ RG I
Sbjct: 1033 PRTWTVI--DSYSLGPDEYVMAVKNMDLEVSENTHERRNMIVVGTAFARGEDIPSRGCIY 1090
Query: 299 LFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-D 355
+F++I+VVP+P +P T K+K+I + KG VTA+ + GFL+ A GQK + LK D
Sbjct: 1091 VFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRGLKED 1150
Query: 356 NDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
L +AF+D + Y+ + +K + ++GD + + Y E +SL +D
Sbjct: 1151 GSLLPVAFMDMQCYVNVLKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKD------ 1204
Query: 414 NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
+GY + + DG ++ +++D D
Sbjct: 1205 --QGYLEVVAAEFLPDGDKLF--------------------------------ILVADSD 1230
Query: 474 KNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF----------KIRCKPSSIS-DA 522
N+ + Y PE +S+ G RL+ ++ FH+G T K P S+ D+
Sbjct: 1231 CNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLLPRTMVSSEKAMANPDSMEIDS 1290
Query: 523 PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 582
+ L S G++G +PE++YRRL LQ+ + H GLNPRA+R +
Sbjct: 1291 QTISQQVLI--TSQSGSVGIVTSVPEESYRRLSALQSQLANSLEHPCGLNPRAYRAVESD 1348
Query: 583 GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
G RG++DG+L++++L + ++EI ++G+ +I +L I A
Sbjct: 1349 ----GTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAHEWEIKADLEAIGA 1395
>gi|295665178|ref|XP_002793140.1| cleavage and polyadenylation specificity factor subunit A
[Paracoccidioides sp. 'lutzii' Pb01]
gi|226278054|gb|EEH33620.1| cleavage and polyadenylation specificity factor subunit A
[Paracoccidioides sp. 'lutzii' Pb01]
Length = 1408
Score = 218 bits (555), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 180/646 (27%), Positives = 298/646 (46%), Gaps = 89/646 (13%)
Query: 17 VQELLTVSLGLHGNR-PLLLVRTQ-HELLIYQAFRHPKGALKLRFKKLKVLFVSD----- 69
+ E+L LG +R P L +R+ +EL++Y+ + H + + R L+ + +++
Sbjct: 816 LTEILVADLGDSVSRTPYLTLRSNSNELILYEPY-HTVQSTEKRLSDLRFVKIANHHFPK 874
Query: 70 ---RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTI 126
S N G + VR +R ++ GY+ VF+ G P F+ H M +
Sbjct: 875 FLPESNLGNLSDGDRQLVR--PLRALGDVCGYRTVFMPGNSPC--FIIKSATSIPHVMNL 930
Query: 127 DG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
G V +L+ F+ C +GF+Y + + +R+ P + +D W RK+ L + Y
Sbjct: 931 RGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRFPRNTHFDGSWAARKIGLGEQVDSVEY 990
Query: 186 HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRF-IPPLVSQFHVSLFSPFSWEE 244
++TY + TS D+ ED E+ + R+ P + + V L +P +W
Sbjct: 991 SSSSETYVLGTSQ---KVDFKL--PEDDEIHPEWRNEVISFFPQIDKGSVKLLNPRTWSI 1045
Query: 245 IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIE 304
I ++ L E V+C+K +++E + IA+GT ED+ RG I +F++I+
Sbjct: 1046 I--DSYQLRTSERVMCVKCLNLEASEITHERKEMIAVGTALTRGEDIAARGCIYVFEVIK 1103
Query: 305 VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGI 361
VVPE +P T K+K+I +E KG +T++ + GFL+ A GQK + LK D L +
Sbjct: 1104 VVPEVDRPETNRKLKLIAKEEVKGAITSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLPV 1163
Query: 362 AFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
AF+D + Y++ + +K + ++GD + + Y E LSL ++D
Sbjct: 1164 AFMDMQCYVSVLKELKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKD------------ 1211
Query: 420 AGNPSRGIIDGSL-VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
DGSL V L G+RL I M++D D N+ +
Sbjct: 1212 ---------DGSLQVMAADFLPDGKRLYI--------------------MVADDDCNIHV 1242
Query: 479 FMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL-------- 530
Y PE S G RL+ ++ FH GQ +T + + S +S P + +
Sbjct: 1243 LQYDPEDPGSAKGDRLLHRSTFHTGQFAST-LTLLPRTSVLSQGPETEANAMDLDLSGPL 1301
Query: 531 --TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
S G++ P+ E YRRL LQ+ M+ H GLNPRAFR + G
Sbjct: 1302 HQVLVTSETGSIALITPVSEMAYRRLSALQSQMINTLEHPCGLNPRAFRAVESDGIGG-- 1359
Query: 589 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
RG++DG LV K+L L + EI ++G+ D+ + D+EA+
Sbjct: 1360 --RGMVDGDLVQKWLDLGTQRKAEIASRVGA---DVWEIRADLEAI 1400
>gi|258575565|ref|XP_002541964.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237902230|gb|EEP76631.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 1376
Score = 218 bits (554), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 176/638 (27%), Positives = 295/638 (46%), Gaps = 94/638 (14%)
Query: 17 VQELLTVSLGLHGNR-PLLLVRTQHE-LLIYQAFRHPKGALK---LRFKKLKVLFVSDRS 71
+ E+L LG +R P +++RT H+ L+IYQ + + K +L+ LRF K+ F+
Sbjct: 803 LSEVLMADLGDSISRQPYMILRTTHDDLVIYQPY-YTKPSLEQPELRFLKITDYFLPKVD 861
Query: 72 KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG-PV 130
+N +++R ++ GY+ +F+ G +P ++ +S H + + G PV
Sbjct: 862 PASNMDN--TNRTSFARLRAIPDLCGYKTMFMPGSNPCFIMKSSTSS--PHVLRLKGEPV 917
Query: 131 STLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETK 190
S+L+ FH C +GF Y +AK+ +R+ LP + +D W RK+ + + Y ++
Sbjct: 918 SSLSSFHMPACEKGFAYVDAKNMVRMCRLPGNTRFDNAWAARKIHIGEQVDCVEYFARSE 977
Query: 191 TYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWEEIPQT 248
TY + TS E +K ED E+ T+ R F+P L + V L SP +W I
Sbjct: 978 TYVLGTSYHED----FKLP-EDDEVHTEWRSEVISFMPQL-DRGRVKLLSPRTWSII--D 1029
Query: 249 NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPE 308
+ L E +LCLK ++ME + + +GT ED+T RG I +F+II+V P+
Sbjct: 1030 CYDLGATERILCLKTINMEVSEITHERQDMVVVGTAIVRGEDITPRGSIYVFEIIDVAPD 1089
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFID 365
P +P T K K+ ++ KG VTAI + GFL+ A GQK + LK D L +AF+D
Sbjct: 1090 PDRPETNQKFKLFAKEDVKGAVTAISGIGGQGFLIAAQGQKCLVRGLKEDGSLLPVAFMD 1149
Query: 366 TEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNP 423
+ Y++ + ++ L ++GD + + Y
Sbjct: 1150 MQCYVSVLKELQGTGLCIMGDALKGLWFTGYS---------------------------- 1181
Query: 424 SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQP 483
+QLS +E C+ + + F + VV + P
Sbjct: 1182 -------------VQLSSAVDVETCE----------EPYKLTLFGKDSEYLQVVAADFLP 1218
Query: 484 EARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR----------SRFLTWY 533
+ S+ G RL+ ++ FH G ++T I P S GA + +
Sbjct: 1219 DDPSSSKGDRLLHRSSFHTGHFISTLTLI---PQYTSSGTGASEDNMDVDYMPAGYQVVV 1275
Query: 534 ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGI 593
S G++G PL E+ YRRL LQ+ +V H GLNP+A+R + G+ RG+
Sbjct: 1276 TSQSGSVGVITPLTEETYRRLSALQSQLVMSMEHPCGLNPKAYRAVESDGFSG----RGL 1331
Query: 594 IDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
+DG+L+ ++L + + + EI ++G+ I +L I
Sbjct: 1332 VDGNLLLRWLDMGVQRKAEIAGRVGADLQSIRADLERI 1369
>gi|159123784|gb|EDP48903.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus fumigatus A1163]
Length = 1401
Score = 218 bits (554), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 181/651 (27%), Positives = 309/651 (47%), Gaps = 94/651 (14%)
Query: 16 IVQELLTVSLGLHGN-RPLLLVRTQ-HELLIYQAF-RHPKGA--LKLRFKK-----LKVL 65
++ E + LG N P L++RT+ +L+IY+AF + KG +L F K L +
Sbjct: 806 VLSEAVIADLGESWNPSPHLILRTESDDLVIYKAFASYIKGESHTRLSFVKESNHTLPRV 865
Query: 66 FVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
S++ ++NE+ PR +RI NI+ + VF+ G +++ T++ H
Sbjct: 866 TTSEKEMQSNEKLSRPRSLRI-----LPNISNFSAVFMPGRPASFILKTAKS--CPHVFR 918
Query: 126 IDGP-VSTLAPFH--NVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
+ G V +L+ F + + GF+Y ++K LRI P+ +D W +RK+ +
Sbjct: 919 LRGEFVRSLSIFDLASPSLDTGFIYVDSKDVLRICRFPSDTLFDYTWALRKISIGEQVDH 978
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS----RFIPPLVSQFHVSLFS 238
LAY ++TY + TS S D+ +D EL D R+ F+P L Q + + S
Sbjct: 979 LAYATSSETYVLGTSH---SADFKL--PDDDELHPDWRNEGLVISFLPEL-RQCSLKVVS 1032
Query: 239 PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRIL 298
P +W I ++ L E+V+ +KN+ +E R I +GT + ED+ RG I
Sbjct: 1033 PRTWTVI--DSYSLGPDEYVMAVKNMDLEVSENTHERRNMIVVGTAFARGEDIPSRGCIY 1090
Query: 299 LFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-D 355
+F++I+VVP+P +P T K+K+I + KG VTA+ + GFL+ A GQK + LK D
Sbjct: 1091 VFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRGLKED 1150
Query: 356 NDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
L +AF+D + Y+ + +K + ++GD + + Y E +SL +D
Sbjct: 1151 GSLLPVAFMDMQCYVNVLKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKD------ 1204
Query: 414 NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
+GY + + DG ++ +++D D
Sbjct: 1205 --QGYLEVVAAEFLPDGDKLF--------------------------------ILVADSD 1230
Query: 474 KNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF----------KIRCKPSSIS-DA 522
N+ + Y PE +S+ G RL+ ++ FH+G T K P S+ D+
Sbjct: 1231 CNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLLPRTMVSSEKAMANPDSMEIDS 1290
Query: 523 PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 582
+ L S G++G +PE++YRRL LQ+ + H GLNPRA+R +
Sbjct: 1291 QTISQQVLI--TSQSGSVGIVTSVPEESYRRLSALQSQLANSLEHPCGLNPRAYRAVESD 1348
Query: 583 GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
G RG++DG+L++++L + ++EI ++G+ +I +L I A
Sbjct: 1349 ----GTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAHEWEIKADLEAIGA 1395
>gi|239611898|gb|EEQ88885.1| protein CFT1 [Ajellomyces dermatitidis ER-3]
gi|327352847|gb|EGE81704.1| CFT1 [Ajellomyces dermatitidis ATCC 18188]
Length = 1402
Score = 217 bits (553), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 181/644 (28%), Positives = 292/644 (45%), Gaps = 85/644 (13%)
Query: 17 VQELLTVSLGLHGNR-PLLLVRT-QHELLIYQAFRH----PKGALKLRFKKLKVLFVSDR 70
+ E+L +G +R P L++R+ ++L++Y+ + K + LRF K
Sbjct: 810 LTEILVADIGDSVSRTPYLILRSSNNDLILYEPYHTTHSTEKKSSDLRFLKTINHHFPKF 869
Query: 71 SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG-P 129
+N + G +R ++ GY+ VF+ G P ++ +S H + + G
Sbjct: 870 HAGSNVEDSSHIGALPKPLRVLGDVCGYRTVFMPGNSPCFVIKSSTS--IPHVLNLRGKT 927
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
V +L+ F+ C RGF+Y +A + +R+ P + +D W RK+ L + Y +
Sbjct: 928 VHSLSSFNIPACERGFVYVDADNVVRMCRFPRNTHFDGSWATRKIGLGEQVDIVEYSSSS 987
Query: 190 KTYCIVTSTAEPSTDYYKFN-GEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWEEIP 246
+TY I TS FN ED E+ + R+ F+P + Q V L SP +W I
Sbjct: 988 ETYVIGTSQK------VDFNLPEDDEIHPEWRNEVISFLPQ-IDQGSVKLLSPRTWSII- 1039
Query: 247 QTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVV 306
+ L E ++C+K + +E R IA+GT ED+ RG I +F++IEVV
Sbjct: 1040 -DSHTLRTAERIMCVKCLDLEVSEITHERRDMIAVGTAVTRGEDIAARGCIYIFEVIEVV 1098
Query: 307 PEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAF 363
PE +P T K+K+I +E KG VT++ + GFL+ A GQK + LK D L +AF
Sbjct: 1099 PEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLPVAF 1158
Query: 364 IDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
+D + Y+ + +K + ++GD + I Y E LSL ++D
Sbjct: 1159 MDMQCYVNVLKELKGTGMCIMGDALKGIWFAGYSEEPYKLSLFSKD-------------- 1204
Query: 422 NPSRGIIDGSL-VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
DG+L V L G+RL I +++D D N+ +
Sbjct: 1205 -------DGTLQVMAADFLPDGKRLYI--------------------LVADDDCNIHVLQ 1237
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL---------- 530
Y PE S+ G RL+ ++ FH G +T + + P A +
Sbjct: 1238 YDPEDPGSSKGDRLLHRSTFHTGHFASTMTLLPRTIIPSAQGPDANPDMMELDSSGPLYH 1297
Query: 531 TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPS 590
S G++ PL E YRRL LQ+ ++ H GLNPRAFR + G
Sbjct: 1298 VLVTSETGSIALITPLSETAYRRLSALQSQLINTLEHPCGLNPRAFRAIESDGIGG---- 1353
Query: 591 RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
RG++DG L+ ++L L + EI ++G+ DI + D+EA+
Sbjct: 1354 RGMVDGDLLHRWLDLGTQRKAEIAHRVGA---DIWEIRADLEAI 1394
>gi|261201748|ref|XP_002628088.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
gi|239590185|gb|EEQ72766.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
Length = 1403
Score = 217 bits (552), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 181/644 (28%), Positives = 292/644 (45%), Gaps = 85/644 (13%)
Query: 17 VQELLTVSLGLHGNR-PLLLVRT-QHELLIYQAFRH----PKGALKLRFKKLKVLFVSDR 70
+ E+L +G +R P L++R+ ++L++Y+ + K + LRF K
Sbjct: 811 LTEILVADIGDSVSRTPYLILRSSNNDLILYEPYHTTHSTEKKSSDLRFLKTINHHFPKF 870
Query: 71 SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG-P 129
+N + G +R ++ GY+ VF+ G P ++ +S H + + G
Sbjct: 871 HAGSNVEDSSHIGALPKPLRVLGDVCGYRTVFMPGNSPCFVIKSSTS--IPHVLNLRGKT 928
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
V +L+ F+ C RGF+Y +A + +R+ P + +D W RK+ L + Y +
Sbjct: 929 VHSLSSFNIPACERGFVYVDADNVVRMCRFPRNTHFDGSWATRKIGLGEQVDIVEYSSSS 988
Query: 190 KTYCIVTSTAEPSTDYYKFN-GEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWEEIP 246
+TY I TS FN ED E+ + R+ F+P + Q V L SP +W I
Sbjct: 989 ETYVIGTSQK------VDFNLPEDDEIHPEWRNEVISFLPQ-IDQGSVKLLSPRTWSII- 1040
Query: 247 QTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVV 306
+ L E ++C+K + +E R IA+GT ED+ RG I +F++IEVV
Sbjct: 1041 -DSHTLRTAERIMCVKCLDLEVSEITHERRDMIAVGTAVTRGEDIAARGCIYIFEVIEVV 1099
Query: 307 PEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAF 363
PE +P T K+K+I +E KG VT++ + GFL+ A GQK + LK D L +AF
Sbjct: 1100 PEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLPVAF 1159
Query: 364 IDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
+D + Y+ + +K + ++GD + I Y E LSL ++D
Sbjct: 1160 MDMQCYVNVLKELKGTGMCIMGDALKGIWFAGYSEEPYKLSLFSKD-------------- 1205
Query: 422 NPSRGIIDGSL-VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
DG+L V L G+RL I +++D D N+ +
Sbjct: 1206 -------DGTLQVMAADFLPDGKRLYI--------------------LVADDDCNIHVLQ 1238
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL---------- 530
Y PE S+ G RL+ ++ FH G +T + + P A +
Sbjct: 1239 YDPEDPGSSKGDRLLHRSTFHTGHFASTMTLLPRTIIPSAQGPDANPDMMELDSSGPLYH 1298
Query: 531 TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPS 590
S G++ PL E YRRL LQ+ ++ H GLNPRAFR + G
Sbjct: 1299 VLVTSETGSIALITPLSETAYRRLSALQSQLINTLEHPCGLNPRAFRAIESDGIGG---- 1354
Query: 591 RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
RG++DG L+ ++L L + EI ++G+ DI + D+EA+
Sbjct: 1355 RGMVDGDLLHRWLDLGTQRKAEIAHRVGA---DIWEIRADLEAI 1395
>gi|296806499|ref|XP_002844059.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238845361|gb|EEQ35023.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 1348
Score = 217 bits (552), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 177/640 (27%), Positives = 295/640 (46%), Gaps = 86/640 (13%)
Query: 8 SPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHE-LLIYQAFR--HPKGALKLRFKKL-- 62
S S + + +L L +GN L +RT+H+ L++Y+ +R G +LRF K
Sbjct: 748 SESGTENIVGNNVLLFLLDGNGN---LSLRTKHDDLILYEPYRVTGENGESRLRFLKAVN 804
Query: 63 KVLFVSDRSKRANEQPG---LPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGEL 119
V+ S K AN G PR +R S+I GY+ VF+ G +P F+
Sbjct: 805 HVVMRSHSEKAANVVEGKHPFPR----KPLRALSDICGYKTVFMPGQNPC--FILKSAIT 858
Query: 120 RAHPMTIDG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKC 178
+ H + + G V +L+ FH C RGF Y + + +R+S LP++ +D+ W RK+PL
Sbjct: 859 QPHVLRLRGKAVQSLSGFHIAACERGFAYVDEDNIIRMSRLPSNTRFDSTWATRKIPLGE 918
Query: 179 TPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS--RFIPPLVSQFHVSL 236
+ Y +++Y I TS E +K ED E T+ ++ F+P L + V L
Sbjct: 919 QVDCIVYSSASESYVIGTSVKED----FKLP-EDDESHTEWQNEFITFLPQL-ERGTVKL 972
Query: 237 FSPFSWE--EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR 294
P +W +I ++ L E + C++ + +E + + +G+ ED+ +
Sbjct: 973 LDPKNWSIADIAPSSHELEPAERITCIEVIRLEISEITHERKDMVVVGSAIVKGEDIVPK 1032
Query: 295 GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQ 352
G I +F+II+VVP+P ++K+ +E KG VTA+ + GFL+ A GQK +
Sbjct: 1033 GCIRVFEIIDVVPDPDHSEMNKRLKLFAREEVKGAVTALSGIGSQGFLIVAQGQKCMVRG 1092
Query: 353 LK-DNDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
LK D L +AF D + Y++ + +K + +VGD + + Y E L L ++
Sbjct: 1093 LKEDGSLLPVAFKDAQCYVSVLKELKGTGMCIVGDAIKGLWFTGYSEEPYKLDLFGKE-- 1150
Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
N + +I L G RL + ++
Sbjct: 1151 ------------NENIAVIAADF------LPDGNRLYV--------------------LV 1172
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI----------RCKPSSI 519
+D D N+ + Y PE S+ G RL+ + FH+G +T + + +
Sbjct: 1173 ADDDCNLHVLQYDPEDPSSSKGDRLLHRNVFHVGHFASTMTLLPQGSHTPHSPADRDAMD 1232
Query: 520 SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
+DAP S++ G++G PL E +YRRLL LQ+ +V H GLNPR +R
Sbjct: 1233 TDAPLPPSKYQILMTFQTGSVGIITPLNEDSYRRLLALQSQLVNALEHPCGLNPRGYRAV 1292
Query: 580 KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
+ G RG+IDG+L+ ++L + + EI ++G+
Sbjct: 1293 ESDGIGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGA 1329
>gi|326471884|gb|EGD95893.1| protein kinase subdomain-containing protein [Trichophyton tonsurans
CBS 112818]
Length = 1398
Score = 216 bits (550), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 172/650 (26%), Positives = 301/650 (46%), Gaps = 81/650 (12%)
Query: 4 FRSHSPSAMDETIVQELLTVSLG--LHGNRPLLLVRTQHE-LLIYQAFR--HPKGALKLR 58
+ S S ++ + ELL LG +H P +++RT+H+ L++Y+ +R G LR
Sbjct: 795 YESSSRRPVNRETLTELLIADLGDAIH-KSPYMILRTKHDDLVLYEPYRIAGESGHSGLR 853
Query: 59 F-KKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
F K + + + R+ + +R ++ GY+ VF+ G +P ++ ++
Sbjct: 854 FLKAVNHVVMGPRTDQGVNHDINRSPSSCKLLRALPDVCGYKTVFMSGHNPCFILKSAIA 913
Query: 118 ELRAHPMTIDG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPL 176
R H + + G V +L+ FH C RGF Y + + +R+S LP++ +D+ W RK+ L
Sbjct: 914 --RPHVLRLRGKAVQSLSGFHIAACERGFAYVDEDNVIRMSRLPSNTRFDSGWATRKIAL 971
Query: 177 KCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS--RFIPPLVSQFHV 234
+ Y ++ Y I TS E +K ED E T+ R+ F+P L + V
Sbjct: 972 GEQVDSIVYSSASECYVIGTSAKED----FKLP-EDDESHTEWRNEFITFLPQL-ERGTV 1025
Query: 235 SLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR 294
L P +W I + L E + C++ + +E + + +G++ ED+ +
Sbjct: 1026 KLLEPKNWSTI--DSHELKPAERITCIEVIRLEISELTHERKDMVVVGSSIVKGEDIVPK 1083
Query: 295 GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQ 352
G I +F++I+VVPEP QP K+K+ +E KG VTA+ + GFL+ A GQK +
Sbjct: 1084 GFIRVFEVIDVVPEPDQPEKSKKLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRG 1143
Query: 353 LK-DNDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
LK D L +AF DT+ Y+ + +K + ++GD + + + Y E L L ++
Sbjct: 1144 LKEDGSLLPVAFKDTQCYVNVLKELKGTGMCIIGDAFKGLWFIGYSEEPYKLDLFGKE-- 1201
Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
N + ++D D L + + + ++
Sbjct: 1202 ------------NENLAVVDA--------------------------DFLPDGNKLYILV 1223
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI---RCKPSSISDA---- 522
+D D N+ + Y PE S+ G RL+ ++ FH G +T + PS+ D
Sbjct: 1224 ADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTLLPHGAYTPSAPVDEDAMD 1283
Query: 523 ----PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT 578
P ++ + L + + G++ PL E +YRRLL LQ+ +V H LNPR +R
Sbjct: 1284 TDSLPPSKYQILMTFQT--GSIAVITPLSEDSYRRLLALQSQLVNALEHPCSLNPRGYRA 1341
Query: 579 YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
+ G RG+IDG+L+ ++L + + EI ++G+ I +L
Sbjct: 1342 VESDGMGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGADVGAIRTDL 1388
>gi|317036382|ref|XP_001398211.2| protein cft1 [Aspergillus niger CBS 513.88]
Length = 1393
Score = 216 bits (550), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 175/637 (27%), Positives = 298/637 (46%), Gaps = 101/637 (15%)
Query: 32 PLLLVRTQ-HELLIYQAFRHPKGALK----LRFKKLKVLFVSDRSKRANEQ-PGLPRGVR 85
P L++R++ +L+IY+ F G ++ L+F SK N P +P GV
Sbjct: 818 PYLILRSETDDLIIYKPFVVSTGPVEGIHSLKF-----------SKETNSVLPRIPPGVS 866
Query: 86 ISQ----------MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS-TLA 134
+Q +R +I+G VF+ G ++ TS H + + G S +++
Sbjct: 867 STQPSGSDYRARPLRILPDISGLSAVFMPGASAGFIIRTSASA--PHFLRLRGENSRSVS 924
Query: 135 PFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCI 194
C +GF+Y +++S +R LP +D W +++V L LAY + Y +
Sbjct: 925 SLDTPECSKGFIYLDSQSTVRFCKLPPMTRFDYQWTLKRVHLGEQVDHLAYSTSSGMYVL 984
Query: 195 VTSTAEPSTDYYKFNGEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWEEIPQTNFPL 252
T A TD+ ED EL + R+ F P F + L SP +W I +F L
Sbjct: 985 GTCHA---TDFKL--PEDDELHPEWRNEAISFFPSARGSF-IKLVSPNTWSII--DSFSL 1036
Query: 253 HEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQP 312
E+V+ +KN+S+E + I +GT + ED+ RG I +F++++VVP+P P
Sbjct: 1037 GADEYVMAIKNISLEVSENTHERKDMIVVGTAFARGEDIPSRGCIYVFEVVQVVPDPDHP 1096
Query: 313 LTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVY 369
T K+K+I + KG VTA+ + GF++ A GQK + LK D L +AF+D + Y
Sbjct: 1097 ETDRKLKLIGKEPVKGAVTALSEIGGQGFVLVAQGQKCMVRGLKEDGSLLPVAFMDMQCY 1156
Query: 370 IASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGI 427
++ + +K + ++GD + + Y E +SL A+D
Sbjct: 1157 VSVVKELKGTGMCILGDAVKGVWFAGYSEEPYKMSLFAKDL------------------- 1197
Query: 428 IDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARE 487
+ LE+C + L + + +++D D N+ + Y PE +
Sbjct: 1198 ---------------DYLEVCAA------EFLPDGKRLFIVVADSDCNIHVLQYDPEDPK 1236
Query: 488 SNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSS---ISDAPG-----ARSRFLTWYASLDG 538
S+ G RL+ ++ FH+G +T + R SS +S + G + +G
Sbjct: 1237 SSNGDRLLSRSKFHMGNFASTLTLLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNG 1296
Query: 539 ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
+LG +PE++YRRL LQ+ + H GLNPRAFR + G RG++DG+L
Sbjct: 1297 SLGLITCIPEESYRRLSALQSQLTNTLEHPCGLNPRAFRAVESD----GTAGRGMLDGNL 1352
Query: 599 VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
++K++ +S + EI ++G++ +I D+EA+S
Sbjct: 1353 LFKWIDMSKQRKTEIAGRVGAREWEI---KADLEAIS 1386
>gi|225558298|gb|EEH06582.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 1408
Score = 216 bits (550), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 180/648 (27%), Positives = 296/648 (45%), Gaps = 87/648 (13%)
Query: 14 ETIVQELLTVSLGLHGNR-PLLLVRTQH-ELLIYQAFRHPKGALK----LRFKKLKVLFV 67
ETI ELL LG +R P L++R+ + +L++Y+ + + K LRF K+
Sbjct: 813 ETIT-ELLVADLGDSVSRSPYLILRSSNSDLILYEPYHYTSSTEKQFSDLRFVKIANHHF 871
Query: 68 SDRSKRANEQPGLPRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTI 126
+N + +S+ +R ++ GY+ VF+ G P ++ +S H M +
Sbjct: 872 PKFHSESNVEKHPANCTTLSKPLRVLGDVCGYRTVFMPGNSPCFIIKSSTS--IPHVMNL 929
Query: 127 DG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
G V +L+ F+ C +GF+Y + + +R+ P + +D W RK+ L + Y
Sbjct: 930 RGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRFPRNTHFDGSWAARKIGLGEQVDAVEY 989
Query: 186 HLETKTYCIVTSTAEPSTDYYKFN-GEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSW 242
++TY I T+ FN ED E+ + R+ F+P + + V L +P +W
Sbjct: 990 SSSSETYVIGTNQK------VDFNLPEDDEIHPEWRNEVISFLPQ-IDKGSVKLLTPRTW 1042
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
I N L E ++C+K +++E + I +GT ED+ RG I +F++
Sbjct: 1043 SIIDSYN--LRNAERIMCVKCLNLEVSEITHERKDTIVVGTALTKGEDIAARGCIYIFEV 1100
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLT 359
IEVVPE +P T K+K+I +E KG VT++ + GFL+ A GQK + LK D L
Sbjct: 1101 IEVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLL 1160
Query: 360 GIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
+AF+D + Y+ + +K + ++GD + + Y E LSL ++D
Sbjct: 1161 PVAFMDMQCYVNVLKELKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKD---------- 1210
Query: 418 YYAGNPSRGIIDGSL-VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
DG+L V L G RL I +++D D N+
Sbjct: 1211 -----------DGTLQVMAADFLPDGNRLYI--------------------LVADDDCNI 1239
Query: 477 VLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL------ 530
+ Y PE S+ G RL+ ++ F G +T + +S S P A +
Sbjct: 1240 HVLQYDPEDPGSSKGDRLLHRSTFQTGHFASTMTLLPRTATSSSQGPDADPDMMDLDSSG 1299
Query: 531 ----TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 586
S G++ P+ E +YRRL LQ+ + H GLNPRAFR + G
Sbjct: 1300 PLHHVLVTSETGSIALITPVSETSYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGIGG 1359
Query: 587 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
RG++DG LV ++L L + EI ++G+ D+ + D+EA+
Sbjct: 1360 ----RGMVDGDLVKRWLDLGTQRKAEIANRVGA---DVWEIRADLEAI 1400
>gi|327304811|ref|XP_003237097.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
gi|326460095|gb|EGD85548.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
Length = 1398
Score = 215 bits (547), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 171/645 (26%), Positives = 300/645 (46%), Gaps = 89/645 (13%)
Query: 4 FRSHSPSAMDETIVQELLTVSLG--LHGNRPLLLVRTQHE-LLIYQAFRHPK--GALKLR 58
+ S S ++ + ELL LG +H P +++RT+H+ L++Y+ +R G LR
Sbjct: 795 YESSSRRPVNRVTLAELLIADLGDSIH-KSPYMILRTKHDDLVLYEPYRVAGECGQSGLR 853
Query: 59 FKKLKVLFVSDRSKRANEQPGLPRGVR-----ISQMRYFSNIAGYQGVFLCGPHPAWLFL 113
F K V+ PG+ + + ++R ++ GY+ VF+ G +P ++
Sbjct: 854 FLKA----VNHVVMGPLTDPGVNQDINRCPSSCKRLRALPDVCGYKTVFMSGHNPCFILK 909
Query: 114 TSRGELRAHPMTIDG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVR 172
++ R H + + G V +L+ FH C RGF Y + + +R+S LP++ +D+ W R
Sbjct: 910 SAIA--RPHVLRLRGKAVQSLSGFHIAACERGFAYVDEDNVIRMSRLPSNTRFDSGWATR 967
Query: 173 KVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS--RFIPPLVS 230
K+ + Y ++ Y I TS E +K ED E T+ R+ F+P L
Sbjct: 968 KIAFGEQVDSIVYSSASECYVIGTSAKED----FKLP-EDDESHTEWRNEFITFLPQL-E 1021
Query: 231 QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
+ V L P +W I + L E ++C++ + +E + + +G++ ED
Sbjct: 1022 RGTVKLLEPRNWSTI--DSHELEPAERIMCIEVIRLEISELTHERKDMVVVGSSIVKGED 1079
Query: 291 VTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKI 348
+ +G I +F++I+VVPEP QP K+K+ +E KG VTA+ + GFL+ A GQK
Sbjct: 1080 IVPKGFIRVFEVIDVVPEPDQPEKSKKLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKC 1139
Query: 349 YIWQLK-DNDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVA 405
+ LK D L +AF DT+ Y+ + +K + ++GD + + Y E L L
Sbjct: 1140 MVRGLKEDGSLLPVAFKDTQCYVNVLKELKGTGMCIIGDAFKGLWFTGYSEEPYKLDLFG 1199
Query: 406 RDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
++ N + ++D D L + + +
Sbjct: 1200 KE--------------NENLAVVDA--------------------------DFLPDGNKL 1219
Query: 466 GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS----- 520
+++D D N+ + Y PE S+ G RL++++ FH G +T + + S
Sbjct: 1220 YILVADDDCNLHVLQYDPEDPSSSKGDRLLRRSVFHTGHFASTVTLLPHGAHTTSSPVDE 1279
Query: 521 DA------PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 574
DA P ++ + L + + G++ PL E +YRRLL LQ+ +V H LNPR
Sbjct: 1280 DAMDTDSPPPSKYQILMTFQT--GSIAVITPLSEDSYRRLLALQSQLVNALEHPCSLNPR 1337
Query: 575 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
+R + G RG+IDG+L+ ++L + + EI ++G+
Sbjct: 1338 GYRAVESDGMGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGA 1379
>gi|317157892|ref|XP_001826637.2| protein cft1 [Aspergillus oryzae RIB40]
gi|391864317|gb|EIT73613.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
[Aspergillus oryzae 3.042]
Length = 1389
Score = 214 bits (546), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 168/644 (26%), Positives = 300/644 (46%), Gaps = 81/644 (12%)
Query: 16 IVQELLTVSLGLH-GNRPLLLVRTQHE-LLIYQAFRHPKGAL-----KLRFKKLKVLFVS 68
++ E++ LG + P L++R++H+ L +Y+ F ++ L F K L +
Sbjct: 796 VLTEIVVADLGDSWSSFPYLIIRSRHDDLAVYRPFISITKSVGEPHADLNFLKETNLVLP 855
Query: 69 DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG 128
+ +Q ++ +R SNI+G+ +F G P ++ TS H + + G
Sbjct: 856 RITSGVEDQSSTEEVIKSVPLRIVSNISGFSAIFRPGVSPGFIVRTSTSS--PHFLGLKG 913
Query: 129 P-VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHL 187
+L+ F C GF+ ++K + + +P + D PW ++++P+ LAY
Sbjct: 914 GYAQSLSKFQTSECGEGFILLDSKGVIHVCQMPLGVQLDYPWTIQQIPIGEQVDHLAYSS 973
Query: 188 ETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRD--SRFIPPLVSQFHVSLFSPFSWEEI 245
+ Y I TS +K ED EL + R+ + F P V + + + SP +W I
Sbjct: 974 SSGMYVIGTS----HRTEFKLP-EDDELHPEWRNEMTSFFPE-VQRSSLKVVSPKTWTVI 1027
Query: 246 PQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEV 305
++ L EHV+ +KN+S+E + I +GT + ED+ RG + +F++I+V
Sbjct: 1028 --DSYLLSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGEDIASRGCVYVFEVIKV 1085
Query: 306 VPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIA 362
VP+P +P K++++ + KG VTA+ + GFL+ A GQK + LK D L +A
Sbjct: 1086 VPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQKCIVRGLKEDGSLLPVA 1145
Query: 363 FIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
F+D + +++ + +K + ++ D + + Y E +SL A+D
Sbjct: 1146 FMDVQCHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPYKMSLFAKDL------------ 1193
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
+ LE+ D L + + + +++D D N+ +
Sbjct: 1194 ----------------------DYLEVLAA------DFLPDGNKLFILVADSDCNLHVLQ 1225
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSS---ISDAPGAR-----SRFLT 531
Y PE +S+ G RL+ ++ FH G ++T + R SS ISD R
Sbjct: 1226 YDPEDPKSSNGDRLLSRSKFHTGNFISTLTLLPRTSVSSEQMISDVDAMDVDIKIPRHQM 1285
Query: 532 WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSR 591
S +G++G + E++YRRL LQ+ + H GLNPRAFR + G R
Sbjct: 1286 LITSQNGSVGLVTCVSEESYRRLSALQSQLTNTIEHPCGLNPRAFRAVESD----GTAGR 1341
Query: 592 GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
G++DG L++++L +S ++EI ++G+ +I D EA+S
Sbjct: 1342 GMLDGKLLFQWLDMSKQRKVEIASRVGANEWEI---KADFEAIS 1382
>gi|345566738|gb|EGX49680.1| hypothetical protein AOL_s00078g169 [Arthrobotrys oligospora ATCC
24927]
Length = 1407
Score = 214 bits (544), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 164/639 (25%), Positives = 296/639 (46%), Gaps = 74/639 (11%)
Query: 10 SAMDETIVQELLTVSLGLHGNR-PLLLVRTQHE-LLIYQAFRHPKGALKLRFKKLKVLFV 67
+A DE ++E++ LG + ++ P L+V+T+ + ++IY+ F + + FKK+ +
Sbjct: 833 TARDE--IEEIIVADLGDNISKAPYLIVKTKRDDIIIYEPFI----SNGICFKKIYNTVL 886
Query: 68 SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID 127
S + P P + ++ GY F+ G P ++ +S+ + + +
Sbjct: 887 PTVSLSEQKSPSGP-------LVKIDDLGGYSVAFMAGDTPTFITKSSKTLPKLYKLQ-G 938
Query: 128 GPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHL 187
G V +L+PF+ RGFLY ++K R+ P +S + W +++PL+ TP L Y+
Sbjct: 939 GMVRSLSPFNTKETERGFLYIDSKGTARVCHFP-EVSMEHTWLSQRIPLERTPTSLTYYD 997
Query: 188 ETKTYCI-VTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIP 246
Y + V ST++P D F E+ + D +P L + H+ + SP +W
Sbjct: 998 PKNVYVVSVLSTSKPEVDDEDFQMEEGLV-----DETLLPELETG-HLVMISPVTWTTTD 1051
Query: 247 QTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVV 306
+ FP+HE V+ K V +E + IA+GT E+ RG + +FD+I+VV
Sbjct: 1052 RYEFPVHEVPFVV--KAVELEISEVTKERKVLIAVGTGLLRGENSPARGAVYVFDVIDVV 1109
Query: 307 PEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFID 365
PE G+P T K K+I +E KG V+ + + G+L+ GQK I LK D L +AF+D
Sbjct: 1110 PEIGKPETGKKFKLISREEVKGVVSTLAGMDGYLLITHGQKCMIRGLKEDGSLLPVAFMD 1169
Query: 366 TEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSR 425
+ +++ +++ GD + ++ + + E + L +D +
Sbjct: 1170 MNTHTTVAKTLEKMVMFGDVLKGVSFVGFSEEPYKMILFGKDPR---------------- 1213
Query: 426 GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEA 485
QLS+ D L ++ F+++D N+ + Y PE
Sbjct: 1214 ------------QLSI------------TAGDFLPAGTACYFVVADAQSNIHVLQYDPEN 1249
Query: 486 RESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS------DAPGARSRFLTWYASLDGA 539
+S G+RL+ K + + G V + + K S + FL ++++ G
Sbjct: 1250 PKSIHGNRLLPKGEIYCGHEVKSICILPKKKSLFTEPDEDDMDEDEDEEFLCMFSTMTGV 1309
Query: 540 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
G + E YRRL ++Q + H GLNPRA+R K + + P R I+DG L+
Sbjct: 1310 FGTVSSITESMYRRLNVIQGQITNTGEHIAGLNPRAYRAAKFRN-TSSEPMRAILDGKLL 1368
Query: 600 WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
++L L G R E+ + G+ + ++L+ ++ ++ F
Sbjct: 1369 VRWLMLGAGRRKELAGRAGTSEEMLREDLWFLQDATAFF 1407
>gi|238508528|ref|XP_002385456.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus flavus NRRL3357]
gi|220688975|gb|EED45327.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus flavus NRRL3357]
Length = 1204
Score = 214 bits (544), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 171/651 (26%), Positives = 301/651 (46%), Gaps = 82/651 (12%)
Query: 8 SPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHE-LLIYQAFRHPKGAL-----KLRFKK 61
S SA + + Q +T+ L R L +R++H+ L +Y+ F ++ L F K
Sbjct: 606 SISATSDELAQNSMTLFLMTQDCR--LFIRSRHDDLAVYRPFISITKSVGEPHADLNFLK 663
Query: 62 LKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
L + + +Q ++ +R SNI+G+ +F G P ++ TS
Sbjct: 664 ETNLVLPRITSGVEDQSSTEEVIKSVPLRIVSNISGFSAIFRPGVSPGFIVRTSTSS--P 721
Query: 122 HPMTIDGP-VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTP 180
H + + G +L+ F C GF+ ++K + + +P + D PW ++++P+
Sbjct: 722 HFLGLKGGYAQSLSKFQTSECGEGFILLDSKGVIHVCQMPLGVQLDYPWTIQQIPIGEQV 781
Query: 181 HFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRD--SRFIPPLVSQFHVSLFS 238
LAY + Y I TS +K ED EL + R+ + F P V + + + S
Sbjct: 782 DHLAYSSSSGMYVIGTS----HRTEFKLP-EDDELHPEWRNEMTSFFPE-VQRSSLKVVS 835
Query: 239 PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRIL 298
P +W I ++ L EHV+ +KN+S+E + I +GT + ED+ RG +
Sbjct: 836 PKTWTVI--DSYLLSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGEDIASRGCVY 893
Query: 299 LFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-D 355
+F++I+VVP+P +P K++++ + KG VTA+ + GFL+ A GQK + LK D
Sbjct: 894 VFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQKCIVRGLKED 953
Query: 356 NDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
L +AF+D + +++ + +K + ++ D + + Y E +SL A+D
Sbjct: 954 GSLLPVAFMDVQCHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPYKMSLFAKDL----- 1008
Query: 414 NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
+ LE+ D L + + + +++D D
Sbjct: 1009 -----------------------------DYLEVLAA------DFLPDGNKLFILVADSD 1033
Query: 474 KNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSS---ISDAPGAR--- 526
N+ + Y PE +S+ G RL+ ++ FH G ++T + R SS ISD
Sbjct: 1034 CNLHVLQYDPEDPKSSNGDRLLSRSKFHTGNFISTLTLLPRTSVSSEQMISDVDAMDVDI 1093
Query: 527 --SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 584
R S +G++G + E++YRRL LQ+ + H GLNPRAFR +
Sbjct: 1094 KIPRHQMLITSQNGSVGLVTCVSEESYRRLSALQSQLTNTIEHPCGLNPRAFRAVESD-- 1151
Query: 585 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
G RG++DG L++++L +S ++EI ++G+ +I D EA+S
Sbjct: 1152 --GTAGRGMLDGKLLFQWLDMSKQRKVEIASRVGANEWEI---KADFEAIS 1197
>gi|240277254|gb|EER40763.1| cleavage factor two protein 1 [Ajellomyces capsulatus H143]
Length = 1408
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 179/648 (27%), Positives = 295/648 (45%), Gaps = 87/648 (13%)
Query: 14 ETIVQELLTVSLGLHGNR-PLLLVRTQH-ELLIYQAFRHPKGALK----LRFKKLKVLFV 67
ETI ELL LG +R P L++R+ + +L +Y+ + + K LRF K+
Sbjct: 813 ETIT-ELLVADLGDSVSRSPYLILRSSNSDLTLYEPYHYTSSTEKQFSDLRFVKIANHHF 871
Query: 68 SDRSKRANEQPGLPRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTI 126
+N + +S+ +R ++ GY+ VF+ G P ++ +S H M +
Sbjct: 872 PKFHSESNVEKHPANCTALSKPLRVLGDVCGYRTVFMPGNSPCFIIKSSTS--IPHVMNL 929
Query: 127 DG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
G V +L+ F+ C +GF+Y + + +R+ P + +D W RK+ L + Y
Sbjct: 930 RGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRFPRNTHFDGSWAARKIGLGEQVDAVEY 989
Query: 186 HLETKTYCIVTSTAEPSTDYYKFN-GEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSW 242
++TY I T+ FN ED E+ + R+ F+P + + V L +P +W
Sbjct: 990 SSSSETYVIGTNQK------VDFNLPEDDEIHPEWRNEVISFLPQ-IDKGSVKLLTPRTW 1042
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
I N L E ++C+K +++E + I +GT ED+ RG I +F++
Sbjct: 1043 SIIDSYN--LRNAERIMCVKCLNLEVSEITHERKDTIVVGTALTKGEDIAARGCIYIFEV 1100
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLT 359
I+VVPE +P T K+K+I +E KG VT++ + GFL+ A GQK + LK D L
Sbjct: 1101 IKVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLL 1160
Query: 360 GIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
+AF+D + Y+ + +K + ++GD + + Y E LSL ++D
Sbjct: 1161 PVAFMDMQCYVNVLKELKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKD---------- 1210
Query: 418 YYAGNPSRGIIDGSL-VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
DG+L V L G RL I +++D D N+
Sbjct: 1211 -----------DGTLQVMAADFLPDGNRLYI--------------------LVADDDCNI 1239
Query: 477 VLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL------ 530
+ Y PE S+ G RL+ ++ F G +T + +S S P A +
Sbjct: 1240 HVLQYDPEDPGSSKGDRLLHRSTFQTGHFASTMTLLPRTATSSSQGPDADPDMMDLDSSG 1299
Query: 531 ----TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 586
S G++ P+ E +YRRL LQ+ + H GLNPRAFR + G
Sbjct: 1300 PLHHVLVTSETGSIALITPVSETSYRRLSALQSQLANTLEHPCGLNPRAFRAVESDGIGG 1359
Query: 587 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
RG++DG LV ++L L + EI ++G+ D+ + D+EA+
Sbjct: 1360 ----RGMVDGDLVKRWLDLGTQRKAEIANRVGA---DVWEIRADLEAI 1400
>gi|154285962|ref|XP_001543776.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150407417|gb|EDN02958.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 1283
Score = 212 bits (540), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 179/648 (27%), Positives = 294/648 (45%), Gaps = 87/648 (13%)
Query: 14 ETIVQELLTVSLGLHGNR-PLLLVRTQH-ELLIYQAFRHPKGALK----LRFKKLKVLFV 67
ETI ELL LG +R P L++R+ + +L++Y+ + + + LRF K+
Sbjct: 688 ETIT-ELLVADLGDSVSRSPYLILRSSNSDLILYEPYHYTSSTERQFSGLRFVKIANHHF 746
Query: 68 SDRSKRANEQPGLPRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTI 126
+N IS+ +R ++ GY+ VF+ G P ++ +S H M +
Sbjct: 747 PKSHSESNAGKHPANCTAISKPLRVLGDVCGYRTVFMPGNSPCFIIKSSTS--IPHVMNL 804
Query: 127 DG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
G V +L+ F+ C +GF+Y + + +R+ P + +D W RK+ L + Y
Sbjct: 805 RGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRFPRNTHFDGSWAARKIGLGEQVDAVEY 864
Query: 186 HLETKTYCIVTSTAEPSTDYYKFN-GEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSW 242
++TY I T+ FN ED E+ + R+ F+P + + V L +P +W
Sbjct: 865 SSSSETYVIGTNQK------VDFNLPEDDEIHPEWRNEVISFLPQ-IDKGSVKLLTPRTW 917
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
I N L E ++C+K +++E + I +GT ED+ RG I +F++
Sbjct: 918 SIIDSYN--LRTAERIMCVKCLNLEVSEITHERKDTIVVGTALTKGEDIAARGCIYIFEV 975
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLT 359
IEVVPE +P T K+K+I +E KG VT++ + G L+ A GQK + LK D L
Sbjct: 976 IEVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGSLIAAQGQKCIVRGLKEDGSLL 1035
Query: 360 GIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
+AF+D + Y+ + +K + ++GD + + Y E LSL ++D
Sbjct: 1036 PVAFMDMQCYVNVLKELKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKD---------- 1085
Query: 418 YYAGNPSRGIIDGSL-VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
DG+L V L G RL I +++D D N+
Sbjct: 1086 -----------DGTLQVMAADFLPDGNRLYI--------------------LVADDDCNI 1114
Query: 477 VLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL------ 530
+ Y PE S+ G RL+ ++ F G +T + +S S P A +
Sbjct: 1115 HVLQYDPEDPGSSKGDRLLHRSTFQTGHFASTMTLLPRTATSSSQRPDADPDMMDLDSSG 1174
Query: 531 ----TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 586
S G++ P+ E +YRRL LQ+ + H GLNPRAFR + G
Sbjct: 1175 PLHHVLVTSETGSIALITPVSETSYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGIGG 1234
Query: 587 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
RG++DG LV ++L L + EI ++G+ D+ + D+EA+
Sbjct: 1235 ----RGMVDGDLVKRWLDLGTQRKAEIANRVGA---DVWEIRADLEAI 1275
>gi|303321596|ref|XP_003070792.1| CPSF A subunit region family protein [Coccidioides posadasii C735
delta SOWgp]
gi|240110489|gb|EER28647.1| CPSF A subunit region family protein [Coccidioides posadasii C735
delta SOWgp]
Length = 1394
Score = 212 bits (540), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 176/630 (27%), Positives = 293/630 (46%), Gaps = 111/630 (17%)
Query: 50 HPKGAL---KLRFKKLKVLFVS--DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLC 104
HPK +L +LRF K+ F+ D S +A +PR +R +S+I GY+ VF+
Sbjct: 826 HPKTSLDKQELRFVKIIDHFLPRFDPSPKAY----MPRS---KFLRAYSDICGYKTVFMS 878
Query: 105 GPHPAWLFLTSRGELRAHPMTIDG-PVSTLAPFHNVNCPRGFLYFNA------------- 150
G +P ++ +S H + + G VS+L+ FH C +GF Y +A
Sbjct: 879 GSNPCFVMKSSTSS--PHVLRLRGEAVSSLSSFHIPACEKGFAYVDASVCVPKQYFVPWN 936
Query: 151 ------KSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTD 204
++ +R+ LP + +D W RKV + + Y ++ Y + +S
Sbjct: 937 KLILVIQNMVRMCRLPGNTRFDNSWVTRKVHVGDQIDCVEYFAHSEIYALGSS------- 989
Query: 205 YYKFN---GEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVL 259
+K + ED E+ + R F+P L + + L SP +W + ++ L + E V+
Sbjct: 990 -HKVDFKLPEDDEIHPEWRSEVISFMPQL-ERGCIKLLSPRTWSVV--DSYELGDAERVM 1045
Query: 260 CLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIK 319
C+K ++ME ++ + +GT ED+T RG I +F+IIEV P+P +P T K+K
Sbjct: 1046 CMKTINMEISEITHEMKDMLVVGTATVRGEDITPRGSIYVFEIIEVAPDPDRPETNRKLK 1105
Query: 320 MIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSV 376
+ + KG VTA+ + GFL+ A GQK + LK D L +AF+D + Y+ + +
Sbjct: 1106 IFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRGLKEDGSLLPVAFMDMQCYVKVLKEL 1165
Query: 377 K--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVW 434
+ L ++GD + I Y E L+L +D + Q + +
Sbjct: 1166 QGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKDNEYLQVIAADF---------------- 1209
Query: 435 KFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRL 494
L G+RL I +++D D + + Y PE S+ G RL
Sbjct: 1210 ----LPDGKRLYI--------------------LVADDDCTIHVLEYDPEDPTSSKGDRL 1245
Query: 495 IKKTDFHLGQHVNTFFKIRCKPSSIS-DAPGARS--------RFLTWYASLDGALGFFLP 545
+ ++ FH+G +T + SS S D PG + S +G++G P
Sbjct: 1246 LHRSSFHMGHFTSTMTLLPQHSSSPSADDPGEDDMDVDYVPKSYQVLVTSQEGSIGVVTP 1305
Query: 546 LPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
L E +YRRL LQ+ +VT H GLNP+A+R + G+ RGI+DG+L+ ++L +
Sbjct: 1306 LTEDSYRRLSALQSQLVTSMEHPCGLNPKAYRAVESDGFGG----RGIVDGNLLLRWLDM 1361
Query: 606 SLGERLEICKKIGSKHNDILDELYDIEALS 635
+ + EI ++G+ DI D+E +S
Sbjct: 1362 GVQRKAEIAGRVGA---DIESIRVDLEKIS 1388
>gi|326477251|gb|EGE01261.1| protein kinase subdomain-containing protein [Trichophyton equinum CBS
127.97]
Length = 1267
Score = 212 bits (540), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 164/620 (26%), Positives = 288/620 (46%), Gaps = 78/620 (12%)
Query: 32 PLLLVRTQHE-LLIYQAFR--HPKGALKLRF-KKLKVLFVSDRSKRANEQPGLPRGVRIS 87
P +++RT+H+ L++Y+ +R G LRF K + + + R+ +
Sbjct: 693 PYMILRTKHDDLVLYEPYRIAGESGHSGLRFLKAVNHVVMGPRTDQGVNHDINRSPSSCK 752
Query: 88 QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG-PVSTLAPFHNVNCPRGFL 146
+R ++ GY+ VF+ G +P ++ ++ R H + + G V +L+ FH C RGF
Sbjct: 753 LLRALPDVCGYKTVFMSGHNPCFILKSAIA--RPHVLRLRGKAVQSLSGFHIAACERGFA 810
Query: 147 YFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYY 206
Y + + +R+S LP++ +D+ W RK+ L + Y ++ Y I TS E +
Sbjct: 811 YVDEDNVIRMSRLPSNTRFDSGWATRKIALGEQVDSIVYSSASECYVIGTSAKED----F 866
Query: 207 KFNGEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNV 264
K ED E T+ R+ F+P L + V L P +W I + L E + C++ +
Sbjct: 867 KLP-EDDESHTEWRNEFITFLPQL-ERGTVKLLEPKNWSTI--DSHELKPAERITCIEVI 922
Query: 265 SMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAK 324
+E + + +G++ ED+ +G I +F++I+VVPEP QP K+K+ +
Sbjct: 923 RLEISELTHERKDMVVVGSSIVKGEDIVPKGFIRVFEVIDVVPEPDQPEKSKKLKLFAKE 982
Query: 325 EQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVK--NL 379
E KG VTA+ + GFL+ A GQK + LK D L +AF DT+ Y+ + +K +
Sbjct: 983 EVKGAVTALSGIGGQGFLIVAQGQKCMVRGLKEDGSLLPVAFKDTQCYVNVLKELKGTGM 1042
Query: 380 ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQL 439
++GD + + + Y E L L ++ N + ++D
Sbjct: 1043 CIIGDAFKGLWFIGYSEEPYKLDLFGKE--------------NENLAVVDA--------- 1079
Query: 440 SLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD 499
D L + + + +++D D N+ + Y PE S+ G RL+ ++
Sbjct: 1080 -----------------DFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSV 1122
Query: 500 FHLGQHVNTFFKI---RCKPSSISDA--------PGARSRFLTWYASLDGALGFFLPLPE 548
FH G +T + PS+ D P ++ + L + + G++ PL E
Sbjct: 1123 FHTGHFASTMTLLPHGAYTPSAPVDEDAMDTDSLPPSKYQILMTFQT--GSIAVITPLSE 1180
Query: 549 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 608
+YRRLL LQ+ +V H LNPR +R + G RG+IDG+L+ ++L +
Sbjct: 1181 DSYRRLLALQSQLVNALEHPCSLNPRGYRAVESDGMGG---QRGMIDGNLLLRWLDMGAQ 1237
Query: 609 ERLEICKKIGSKHNDILDEL 628
+ EI ++G+ I +L
Sbjct: 1238 RKAEIAGRVGADVGAIRTDL 1257
>gi|281205270|gb|EFA79463.1| CPSF domain-containing protein [Polysphondylium pallidum PN500]
Length = 1395
Score = 212 bits (539), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 129/398 (32%), Positives = 221/398 (55%), Gaps = 35/398 (8%)
Query: 28 HGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQ-------PGL 80
H + L+++ ++LIY+A ++ K ++ + ++ + +D++ + ++ P
Sbjct: 886 HSSPYLMILNEFGDILIYKAIKY-KDSMDNTKELIRFIKHTDQNLHSKQREYSYGIDPSS 944
Query: 81 PRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVN 140
I ++ F NI G++GVF+CG W F + LRAHPM PV++ FHN+N
Sbjct: 945 ESSFYIRKIVAFDNIGGHKGVFMCGKRSLWFF-CEKNYLRAHPMNFKDPVTSFTCFHNIN 1003
Query: 141 CPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAE 200
C GF+YF K LRI+ L ++++ W +RK+PL+ T H +++H E K Y +V S +
Sbjct: 1004 CSYGFIYFTEKGVLRINQLSNMMNFENEWAIRKIPLRMTCHKISFHQEFKCYVLVISYPQ 1063
Query: 201 -PSTDYYKFNGEDKELVTDPRDSRFIPPLV--SQFHVSLFSP-FSWEEIPQTNFPLHEWE 256
P +D + + PL+ +F V L P +W + +F + E E
Sbjct: 1064 APQSD-----------EEEEEKEKSKKPLILEEKFQVKLIDPSMNWSIVD--SFSMSEKE 1110
Query: 257 HVLCLKNVSMEYEGTLSG--LRGYIALGTNYNYSEDVTCRGRILLFDII---EVVPEPGQ 311
VLC K V ++Y + G L+ Y+ +GT Y + ED C+GRIL+F+II EV + G+
Sbjct: 1111 TVLCAKIVHLKY-ADVDGIKLKPYLCVGTAYTHGEDTVCKGRILVFEIISHREVQDDTGE 1169
Query: 312 PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIA 371
K ++ ++Y K+QKGPVTA+ + G L+ ++G K+ + L GIAF DT+++I
Sbjct: 1170 E--KKRLNLLYEKDQKGPVTALAGLNGLLLMSIGPKLIVNNFSSGSLVGIAFYDTQIFIV 1227
Query: 372 SMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
S+ +VKN ILVGD +S++ + + + + L L+ +DY+
Sbjct: 1228 SLSTVKNYILVGDMYKSVSFFKLKDQ-KQLILLGKDYE 1264
>gi|350633238|gb|EHA21604.1| hypothetical protein ASPNIDRAFT_51242 [Aspergillus niger ATCC 1015]
Length = 1406
Score = 211 bits (537), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 165/596 (27%), Positives = 278/596 (46%), Gaps = 88/596 (14%)
Query: 71 SKRANEQ-PGLPRGVRISQ----------MRYFSNIAGYQGVFLCGPHPAWLFLTSRGEL 119
SK N P +P GV +Q +R +I+G VF+ G ++ TS
Sbjct: 861 SKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLSAVFMPGASAGFIIRTSASA- 919
Query: 120 RAHPMTIDGPVS-TLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKC 178
H + + G S +++ C +GF+Y +++S +R LP +D W +++V L
Sbjct: 920 -PHFLRLRGENSRSVSSLDTPECSKGFIYLDSQSTVRFCKLPPMTRFDYQWTLKRVHLGE 978
Query: 179 TPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS-----RFIPPLVSQFH 233
LAY + Y + T A TD+ ED EL + R+ F P F
Sbjct: 979 QVDHLAYSTSSGMYVLGTCHA---TDFKL--PEDDELHPEWRNEDCLAISFFPSARGSF- 1032
Query: 234 VSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTC 293
+ L SP +W I +F L E+V+ +KN+S+E + I +GT + ED+
Sbjct: 1033 IKLVSPNTWSII--DSFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTAFARGEDIPS 1090
Query: 294 RGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIW 351
RG I +F++++VVP+P P T K+K+I + KG VTA+ + GF++ A GQK +
Sbjct: 1091 RGCIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVLVAQGQKCMVR 1150
Query: 352 QLK-DNDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDY 408
LK D L +AF+D + Y++ + +K + ++GD + + Y E +SL A+D
Sbjct: 1151 GLKEDGSLLPVAFMDMQCYVSVVKELKGTGMCILGDAVKGVWFAGYSEEPYKMSLFAKDL 1210
Query: 409 KPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFM 468
+ LE+C + L + + +
Sbjct: 1211 ----------------------------------DYLEVCAA------EFLPDGKRLFIV 1230
Query: 469 ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSS---ISDAPG 524
++D D N+ + Y PE +S+ G RL+ ++ FH+G +T + R SS +S + G
Sbjct: 1231 VADSDCNIHVLQYDPEDPKSSNGDRLLSRSKFHMGNFASTLTLLPRTMVSSEKMVSSSDG 1290
Query: 525 -----ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
+ +G+LG +PE++YRRL LQ+ + H GLNPRAFR
Sbjct: 1291 MDIDNQSPLHQVLMTTQNGSLGLITCIPEESYRRLSALQSQLTNTLEHPCGLNPRAFRAV 1350
Query: 580 KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
+ G RG++DG+L++K++ +S + EI ++G++ +I D+EA+S
Sbjct: 1351 ESD----GTAGRGMLDGNLLFKWIDMSKQRKTEIAGRVGAREWEI---KADLEAIS 1399
>gi|121719617|ref|XP_001276507.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus clavatus NRRL 1]
gi|148886827|sp|A1C3U1.1|CFT1_ASPCL RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|119404719|gb|EAW15081.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus clavatus NRRL 1]
Length = 1401
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 182/650 (28%), Positives = 306/650 (47%), Gaps = 103/650 (15%)
Query: 16 IVQELLTVSLGLHGN-RPLLLVRTQHE-LLIYQAF----RHPKGALKLRFKK-----LKV 64
I+ E + +LG N P L++RT ++ L+IY+ F LRF K L
Sbjct: 807 ILSEAIVANLGDSWNPLPHLILRTDNDDLVIYKPFISSVEEDGDPHCLRFVKETNHVLPR 866
Query: 65 LFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRG-----EL 119
+ + ++++P R + I +I+GY VF+ G +++F TSR L
Sbjct: 867 IPPDSDTNISDKEPSNHRPLCI-----LPDISGYSAVFMPGTSASFIFKTSRSCPHILRL 921
Query: 120 RAHPMTIDGPVSTLAPFH--NVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
R G V +L+ F + + RGF+Y ++K +RI LP YD W ++KV +
Sbjct: 922 RG------GVVRSLSDFDFTDPSLGRGFIYVDSKDVVRICQLPPETIYDYSWTLKKVAIG 975
Query: 178 CTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS--RFIPPLVSQFHVS 235
LAY + ++TY + TS S D+ ED EL + R+ F+P L Q +
Sbjct: 976 EHVDHLAYSISSETYVLGTSH---SADFKL--PEDDELHPEWRNEAISFLPEL-RQCCLK 1029
Query: 236 LFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRG 295
+ P +W I ++ L E ++ +KN+++E + I +GT ED+ RG
Sbjct: 1030 VVHPKTWTVI--DSYTLGPDEEIMAVKNMNLEVSENTHERKNMIVVGTALARGEDIPARG 1087
Query: 296 RILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQL 353
I +F++I+VVP+P +P T K+K+I + KG VTA+ + GFL+ A GQK + L
Sbjct: 1088 CIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSEIGGQGFLIAAQGQKCMVRGL 1147
Query: 354 K-DNDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
K D L +AF+D + Y+ + +K + +VGD + I Y E +SL +D
Sbjct: 1148 KEDGSLLPVAFMDVQCYVNVLKELKGTGMCIVGDAFKGIWFAGYSEEPYKMSLFGKD--- 1204
Query: 411 TQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMIS 470
LE + + + D L + + +++
Sbjct: 1205 ----------------------------------LEYPEVVAA---DFLPDGDKLFILVA 1227
Query: 471 DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF---------FKIRCKPSSISD 521
D D N+ + Y+PE S+ G +L+ ++ FH+G +T ++I PS+ SD
Sbjct: 1228 DSDCNLHVLQYEPEDPMSSNGDKLLVRSKFHMGHFTSTLTLLPRTTASYEI---PSADSD 1284
Query: 522 APGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT 578
+ R S G++G +PE++YRRL LQ+ + H GLNPRA+R
Sbjct: 1285 SMEVDPRITPQQVLITSQSGSIGIVTSIPEESYRRLSALQSQLANTVEHPCGLNPRAYRA 1344
Query: 579 YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
+ G RG++DG+L++++L +S R+EI ++G+ +I +L
Sbjct: 1345 IESD----GTAGRGMLDGNLLYQWLSMSKQRRMEIAARVGAHEWEIKADL 1390
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/98 (27%), Positives = 50/98 (51%), Gaps = 11/98 (11%)
Query: 380 ILVGDYARSIALLRYQPE--YRTLSLVARDYK-----PTQPNSKGYYA----GNPSRGII 428
+L+ + SI ++ PE YR LS + P N + Y A G RG++
Sbjct: 1297 VLITSQSGSIGIVTSIPEESYRRLSALQSQLANTVEHPCGLNPRAYRAIESDGTAGRGML 1356
Query: 429 DGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMG 466
DG+L++++L +S R+EI ++G+ +I + ++G
Sbjct: 1357 DGNLLYQWLSMSKQRRMEIAARVGAHEWEIKADLEAVG 1394
>gi|441648592|ref|XP_004093268.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Nomascus leucogenys]
Length = 1177
Score = 210 bits (535), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 110/225 (48%), Positives = 141/225 (62%), Gaps = 19/225 (8%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
+V+E+L V+LG +RP LLV ELLIY+AF H +G LK+RFKK+
Sbjct: 763 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 822
Query: 63 -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
E+ RG R+++ RYF +I GY GVF+CGP P WL +T RG LR
Sbjct: 823 KPKPSKKKAEGGGTEEGAGARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 881
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
HPM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPV K+PL+CT H
Sbjct: 882 HPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVXKIPLRCTAH 941
Query: 182 FLAYHLETKTYC---IVTSTAEPSTDYYKFNGEDKELVTDPRDSR 223
++AYH+E+K C I+ + S ++ E K L RD++
Sbjct: 942 YVAYHVESKV-CPNFILAADVMKSISLLRYQEESKTLSLVSRDAK 985
Score = 194 bits (494), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 106/279 (37%), Positives = 157/279 (56%), Gaps = 46/279 (16%)
Query: 366 TEVYIASMVSVK---NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
T Y+A V K N IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 939 TAHYVAYHVESKVCPNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 998
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ +GF++SD+D+N++++MY
Sbjct: 999 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1018
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGA 539
PEA+ES GG RL+++ DFH+G HVNTF++ C+ ++ + + ++ +TW+A+LDG
Sbjct: 1019 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSKKSVVWENKHITWFATLDGG 1078
Query: 540 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
+G LP+ EK YRRLLMLQN + T H GLNPRAFR N R ++DG L+
Sbjct: 1079 IGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELL 1138
Query: 600 WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
++L LS ER E+ KKIG+ + ILD+L + + +++HF
Sbjct: 1139 NRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1177
>gi|396471273|ref|XP_003838832.1| similar to cleavage and polyadenylation specificity factor subunit A
[Leptosphaeria maculans JN3]
gi|312215401|emb|CBX95353.1| similar to cleavage and polyadenylation specificity factor subunit A
[Leptosphaeria maculans JN3]
Length = 1402
Score = 210 bits (535), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 185/648 (28%), Positives = 282/648 (43%), Gaps = 101/648 (15%)
Query: 10 SAMDETIVQELLTVSLGLHGNR-PLLLVRTQ-HELLIYQAFRHP-KGALKLRFKKLK-VL 65
SA TI E+L LG R P L++RT +L+IY+AF P + A L K L+ +
Sbjct: 793 SAAKATIT-EILAADLGDVTTRSPHLIIRTSSDDLVIYKAFHFPSRSAADLWTKNLRWIK 851
Query: 66 FVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
R E G S + ++ GY VF G P+++F + R ++
Sbjct: 852 LAQQHVPRYVEDAGSEDAGVESTLLALDDVCGYSTVFQRGASPSFIFKEASSSPRVIGLS 911
Query: 126 IDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLP--THLSYDAPWPVRKVPLKCTPHFL 183
PV L FH +C RGF Y ++ LRIS LP TH + W R++P+ + L
Sbjct: 912 -GKPVKGLTTFHTSSCERGFAYVDSTDTLRISQLPSRTHFGHLG-WATRRLPMDAEVYAL 969
Query: 184 AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRF---------IPPLVSQFHV 234
AYH P+ Y G+ ++ V DP ++ P V + +
Sbjct: 970 AYH--------------PAGLYVVGTGQPEDFVLDPSETYHYELPKEDISFKPSVERGVI 1015
Query: 235 SLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR 294
L +W I F E VLC+K +++E T + IA+GT+ + ED+ +
Sbjct: 1016 KLIDEGTWSIIDTHVFDPQEV--VLCIKALNLEVSETTHQRKDLIAVGTSIVHGEDLATK 1073
Query: 295 GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQ 352
G I +F++I VVPEP +P T ++K+I E KG V+AI + GFL+ A GQK +
Sbjct: 1074 GCIRIFEVITVVPEPDRPETNKRLKLIVKDEVKGAVSAISELGTQGFLIMAQGQKCMVRG 1133
Query: 353 LK-DNDLTGIAFIDTEVYIASMVSVKN--LILVGDYARSIALLRYQPEYRTLSLVARDYK 409
LK D L +AF+D + Y+ ++ ++ N ++L+GD R + Y E +SL R
Sbjct: 1134 LKEDGTLLPVAFMDMQCYVTTLKTLPNTGMLLMGDAYRGVWFTGYTEEPYKMSLFGRSKH 1193
Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
+ ++ +FL + G H ++
Sbjct: 1194 NLE------------------AMAVEFLPFN-----------GELH-----------IIV 1213
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG----------------QHVNTFFKIR 513
+D D N+ + + PE +S G RL+ K FH G + +TF
Sbjct: 1214 ADADMNIQVLQFDPENPKSEGS-RLLHKATFHTGHFPTTTHLLQSHLQMPESASTFGTTD 1272
Query: 514 C-KPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
P S AP + L S G L PL E +YRRL L ++ GLN
Sbjct: 1273 TFAPDSTPSAPLPLHQVL--ITSQSGTLALITPLSESSYRRLSNLAAYLINTLESPCGLN 1330
Query: 573 PRAFRTYKG--KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
P AFR +G G+ AG +RG++DG L+ ++ +L R E K G
Sbjct: 1331 PVAFRAGEGVEGGWDAGGGARGVLDGGLLMRWGELGEQRRKEGLAKYG 1378
>gi|407929511|gb|EKG22329.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
phaseolina MS6]
Length = 1418
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 184/663 (27%), Positives = 283/663 (42%), Gaps = 115/663 (17%)
Query: 5 RSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQ-HELLIYQAFRHPK-GALKLRFKKL 62
RS S +A+ E IV EL + P L+VRT ++L+IYQ + P +K F+ L
Sbjct: 801 RSSSKAALTEVIVAELGDSTY----KTPYLIVRTSSNDLVIYQPYHFPAHEVVKPFFENL 856
Query: 63 KVLFVSD-RSKRANEQPGLPR---GV-RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
+ L + R +E+P L G+ + S + +N+ GY VF+ G P+++ S
Sbjct: 857 RWLKIPQPRLPEFSEEPALESEDTGIGKESILTTIANVGGYSAVFMAGTSPSFILKESSS 916
Query: 118 ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPL 176
R M V L+ FH C RGF Y NA LR+ LP Y DA W V+K+ +
Sbjct: 917 LPRVIKMRTKS-VKNLSSFHRAECDRGFAYINADGNLRVCQLPRGYRYGDAGWAVKKISI 975
Query: 177 KCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRF---------IPP 227
+ YH P D DK+ T P D P
Sbjct: 976 NQDVQAMCYH--------------PPKDVLVLGVGDKKPFTLPEDEHHHEWLEENITFKP 1021
Query: 228 LVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY 287
+V Q + + S I + L +E VL +K +++E + +A+GT +
Sbjct: 1022 MVEQGMIKVLDTQSLAVI--DTYELEAFEVVLTIKVLNLEVSENTHERKQLVAVGTGFIR 1079
Query: 288 SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVG 345
ED+ RG I +F++I VVPEPG+P T ++K+I +E +G VTAI V GFL+ A G
Sbjct: 1080 GEDLPSRGCIYVFEVINVVPEPGRPETNRRLKLIAKEEVRGSVTAITDVGSQGFLLMAQG 1139
Query: 346 QKIYIWQLK-DNDLTGIAFIDTEVY--IASMVSVKNLILVGDYARSIALLRYQPEYRTLS 402
QK + LK D L +AF+D + Y +A ++ ++L+GD A+ + Y + +
Sbjct: 1140 QKCMVRGLKEDGTLLPVAFMDMQCYVTVAKELNGSGMLLMGDAAKGAWFVGYTEDPYKMI 1199
Query: 403 LVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEF 462
L + SR ++E+ D L
Sbjct: 1200 LFGK-----------------SR-----------------SKMEVMAA------DFLPHD 1219
Query: 463 SSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKT------------------------ 498
+ M++D D N+ Y P+ +S G RL+ K+
Sbjct: 1220 KQLYLMVADGDCNLHALQYDPDHPKSLSGQRLLHKSTFHTGHFTTTMTLLPSSLSPTVSP 1279
Query: 499 ---DFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLL 555
D H HV+ I AP + + + G+L PL E+ YRRL
Sbjct: 1280 SSADEHANGHVSPSPSPENDAMDIDPAPAGTVQHI-LLTTQTGSLALLTPLSEQQYRRLG 1338
Query: 556 MLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 615
LQ ++ H GLNPRA+R + +G+ SRGI+DG+L+ ++ +L R E
Sbjct: 1339 ALQTYLIGALEHWCGLNPRAYRAVESEGF----GSRGIVDGALLARWCELGSQRRAEGAA 1394
Query: 616 KIG 618
K+G
Sbjct: 1395 KVG 1397
>gi|451849663|gb|EMD62966.1| hypothetical protein COCSADRAFT_92785 [Cochliobolus sativus ND90Pr]
Length = 1405
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 181/643 (28%), Positives = 284/643 (44%), Gaps = 83/643 (12%)
Query: 10 SAMDETIVQELLTVSLGLHGNR-PLLLVRTQHE-LLIYQAFRHP-KGALKLRFKKLKVLF 66
SA+ TI E+L LG + P L++RT + ++IY+AF P + A L K L+ +
Sbjct: 788 SAIKATIT-EILAADLGDATTKSPHLIIRTSSDNIVIYKAFHSPSRSAADLWTKNLRWVK 846
Query: 67 VSDRS-KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
+S + R E G S + S+I GY VF G PA++F S R ++
Sbjct: 847 LSQQHIPRYTEDGGAEDSGFESTLLALSDIGGYSTVFQRGTTPAFIFKESSSAPRVIGLS 906
Query: 126 IDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYD-APWPVRKVPLKCTPHFLA 184
PV +L FH +C RGF Y ++ LRIS LP Y W R++P+ H LA
Sbjct: 907 -GKPVKSLTSFHTSSCQRGFAYLDSTDTLRISQLPPQTHYGHLGWATRRMPMDAEIHALA 965
Query: 185 YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEE 244
YH + + +P + Y+ + + P++ P + + + L +W
Sbjct: 966 YH---SSGLYIIGAGQP--EEYQLDPSETYHYELPKEDMSFKPTIERGIIQLLDEKTWAI 1020
Query: 245 IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIE 304
I L E VL +K +++E + IA+GT + ED+ +G I +F++I
Sbjct: 1021 I--DTHVLDPQEVVLSIKTLNLEVSENTHQRKDLIAVGTAILHGEDLATKGCIRIFEVIT 1078
Query: 305 VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGI 361
VVPEP +P T ++K+I E KG V+AI + GF++ A GQK + LK D L +
Sbjct: 1079 VVPEPDRPETNKRLKLIVKDEVKGAVSAISELGTQGFMIMAQGQKCMVRGLKEDGTLLPV 1138
Query: 362 AFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
AF+D + Y++ + ++ ++ + D R + Y E +SL AR +
Sbjct: 1139 AFMDMQCYVSDLKNLPGTGMLAMSDAYRGVWFTGYTEEPYRMSLFARSKHSLE------- 1191
Query: 420 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
++ F+ E+L + +++D D N+ +
Sbjct: 1192 -----------AIAVDFIPFE--EQLHL--------------------LVADADMNLQVL 1218
Query: 480 MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI--RCKPSSISDAPGARSR----FLTWY 533
+ P+ +S G RL+ K+ FH G T + R K S SD GA + F
Sbjct: 1219 QFDPDNPKSEAGSRLLHKSTFHTGHFPATLHVVHSRLKMPSASDFAGANNTENGDFEMDT 1278
Query: 534 ASLD----------------GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 577
+S D G L PL E YRRL L + T GLNPRAFR
Sbjct: 1279 SSPDDKATQPLHQILCTTQSGTLALVTPLSEDTYRRLSNLSAYLSNTLDATAGLNPRAFR 1338
Query: 578 TYK--GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
G+ AG +RG++DG+L+ ++ +L R E K G
Sbjct: 1339 ASDTPDGGWDAGTGARGMLDGNLLMRWGELGERGRREGLAKYG 1381
>gi|390599704|gb|EIN09100.1| hypothetical protein PUNSTDRAFT_67240 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1439
Score = 209 bits (532), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 164/645 (25%), Positives = 290/645 (44%), Gaps = 74/645 (11%)
Query: 7 HSPSAMDETIVQELLTVSLGLHGNRP-LLLVRTQHELLIYQAF---------RHPKGALK 56
SP E V++ + LG +P LLL +L IYQA + +L
Sbjct: 845 ESPRRPQELDVEQAVIAPLGETAPQPHLLLFLRSGQLAIYQAIPMQASSVDESLSRPSLG 904
Query: 57 LRFKKLKVLFVSDRSKRANEQPGLPRGVRISQ-----MRYFSNIAGYQGVFLCGPHPAWL 111
+RF K+ + + +E+ L +IS+ + S + GVF G HP W+
Sbjct: 905 VRFAKVATRVFEIQRQDDSEKSILAEQKKISRVLIPFLTSPSPTTTFSGVFFTGDHPCWI 964
Query: 112 FLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPV 171
R +R HP + V FL ++ + + +P + P
Sbjct: 965 LKPDRSGIRIHP-SGHSVVHAFTSCSLWESKGDFLLYSDEGPSLLEWMP-DTDVETELPS 1022
Query: 172 RKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQ 231
R +P + + + T ++ + A ++ ++ ED +V +P + P S
Sbjct: 1023 RSIPQPRSYSKVTFDASTG---LIVAAAHLEAEFATYD-EDNNIVWEPDSANVSFPRSSC 1078
Query: 232 FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDV 291
+ L SP W I F E V +++V +E T SG + +IA+GT + ED+
Sbjct: 1079 STLELISPDEW--ITMDGFEFANNEFVTSVESVPLETSSTESGSKDFIAVGTTIDRGEDL 1136
Query: 292 TCRGRILLFDIIEVVPEPGQPLTKN-KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYI 350
RG +F+I+EVVP L++ K+++ + KGPVTA+C + G+LV+++GQKI++
Sbjct: 1137 AVRGTTYVFEIVEVVPPENSSLSRWWKLRLRCRDDAKGPVTALCAMDGYLVSSMGQKIFV 1196
Query: 351 WQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
D L G+AF+D VY+ ++ +VKNL+++GD A+S+ + +Q + L ++A+D++
Sbjct: 1197 RAFDMDERLVGVAFLDVGVYVTTLRAVKNLLVIGDAAKSVWFVGFQEDPYKLVILAKDFQ 1256
Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
+C D + SM +
Sbjct: 1257 ------------------------------------TVCVTTA----DFIFTEDSMSILT 1276
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDF--HLGQHVNTFFKIRCKPSSISDAPGARS 527
+D++ + L+ Y P+ +S G +L+ +T+F H + F R + P A+
Sbjct: 1277 NDENGVMRLYQYDPQDPDSRNGQQLMCRTEFDTHTTCQTSIVFARRVGEGEEAALPQAK- 1335
Query: 528 RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
S+DG+L + E ++RL +LQ + + H GLNP+AFR + Y
Sbjct: 1336 ---VVAGSIDGSLAALTCMDEPAFKRLQLLQGQLTRNIQHVAGLNPKAFRIVRND--YVS 1390
Query: 588 NP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
P S+GI+DG+L+ +L+L + + EI K+I ++ +L + I
Sbjct: 1391 KPLSKGILDGNLLSSYLELPIPRQEEITKQIATERAAVLRDWTSI 1435
>gi|326432241|gb|EGD77811.1| hypothetical protein PTSG_08901 [Salpingoeca sp. ATCC 50818]
Length = 1506
Score = 209 bits (531), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 163/639 (25%), Positives = 302/639 (47%), Gaps = 57/639 (8%)
Query: 14 ETIVQELLTVSLGLHGNRPLLLVR--TQHELLIYQAF-----RHPK--GALKLRFKKLKV 64
E + ELL + LG G+RP L +R TQH +++Y+ F RH K G L++R +K
Sbjct: 899 EMTIVELLAIGLG-RGSRPHLFLRNETQH-VIVYEIFTSSYKRHEKYEGRLQIRLRKRHQ 956
Query: 65 --LFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLT-SRGELRA 121
++ +R +++ P + R F++I+G GVF+C P+W + +R
Sbjct: 957 HPTWIDERLAQSSSIPP-------AAFRPFADISGCDGVFVCARRPSWFMCDHTHKVVRH 1009
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
H M DG V + FLYF K +R++ P P R+ P+K +
Sbjct: 1010 HAMRFDGAVQCFTQLKHAMHTSCFLYFTGKGVMRMATTAAGQVLSTPLPSRRTPIKASAC 1069
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKEL-VTDPRDSRFIP-PLVSQFHVSLFSP 239
++ + E+ Y +V EP KF +E D + + P P ++ + LFS
Sbjct: 1070 YVDFDPESGVYVVVLKHKEPCAHLPKFGPPMEEAPAVDMKFASDEPLPQRERYSICLFSC 1129
Query: 240 FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
W+ +P + + HV K +++ E L+G + +A+GT E RG + L
Sbjct: 1130 EDWQLVPNSPVEIPADHHVTAFKVINISSERHLTGKKPCVAVGTTPVLGERNLERGLLQL 1189
Query: 300 FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAV----GQKIYIWQLKD 355
+D++EVVPEPG+P TKN++K++ + ++ G VTA+ + G+++ A+ G KI++W+++D
Sbjct: 1190 YDVLEVVPEPGKPTTKNRLKLMLSSDETGAVTALNSIEGYVIGALARRDGPKIFVWRVED 1249
Query: 356 ND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
++ L IAF++ ++ ++ N +++GDY + L R + TL ++
Sbjct: 1250 DEKLQPIAFLEGSMFTVTLKVALNFVIIGDYMGRVMLARLIKD-ETLKIL---------- 1298
Query: 415 SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
N S+G +L L +G + + D + + + + D+
Sbjct: 1299 -------NLSKGTTSQAL------LQVGRDVAPTSVYAA---DFIVRGAELHVLFLDQHA 1342
Query: 475 NVVLFMYQPEARESNGGHRLIKKTDFHLG-QHVNTFFKIRCKPSSISDAPGARSRFLTWY 533
N+ + + + + GG L + + ++ G Q + +++ P S + FLT Y
Sbjct: 1343 NMTILAFDSDDPTTRGGRILKRHSVYNTGHQRIVALTRLQNVPPRNSRNATVDAHFLT-Y 1401
Query: 534 ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGI 593
+L+G G+ +PE +RRL++LQ ++ H GL+P AF+ YK + +
Sbjct: 1402 QTLEGGAGYITSIPEDIFRRLMLLQLRLLPHLKFRAGLHPSAFKKYKSASLHMVHQEVRT 1461
Query: 594 IDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 632
I + + L L + E+ +++G+ + D+ IE
Sbjct: 1462 ICADVYTRLFMLDLDAQKEVARQVGTTTKQLCDDFLFIE 1500
>gi|403411348|emb|CCL98048.1| predicted protein [Fibroporia radiculosa]
Length = 1437
Score = 208 bits (529), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 172/639 (26%), Positives = 288/639 (45%), Gaps = 76/639 (11%)
Query: 9 PSAMDETIVQELLTVSLGLHGNRP-LLLVRTQHELLIYQAFRHPKGALKL---RFKKLKV 64
P E +++LL LG RP L+L +L +Y+ P A L R L V
Sbjct: 847 PRKPQELDIEQLLVAPLGESSPRPHLMLFLRSGQLAVYEVHSTPVPAEPLPAARSSTLLV 906
Query: 65 LFVSDRSKRAN-------EQPGLPRGVRISQMRY-FSNIAG----YQGVFLCGPHPAWLF 112
FV S+ N E+ L RIS + F+ + GVFL G P+WL
Sbjct: 907 KFVKVLSRAFNIQHSDEVEKSVLAEQKRISHLLIPFATSPSPGQTFSGVFLTGDRPSWLL 966
Query: 113 LTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVR 172
T +G ++ P + V FL ++ + + LP + D P R
Sbjct: 967 CTDKGGVKVLP-SGHSVVHAFTASSVWESKNDFLLYSEEGPSLMEWLP-DVQLDGHLPSR 1024
Query: 173 KVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQF 232
VP + + Y + T IV ++++ S + ED +V +P D+ P
Sbjct: 1025 SVPRPRSYSNVVY--DPSTSLIVAASSQQSK--FASYDEDGNIVWEP-DTNISFPSCECS 1079
Query: 233 HVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT 292
+ L SP W + + + E V CL +++E T +G + +IA+GT N ED+
Sbjct: 1080 ALELISPEGW--VTMDGYEFAQNEFVNCLDCITLETMSTETGTKDFIAVGTTINRGEDLA 1137
Query: 293 CRGRILLFDIIEVVPEPGQPLTK-NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIW 351
+G + +F+I+EVVP+ L + ++K+ + KGPVTA+C + +LV+++GQKI++
Sbjct: 1138 VKGAVYIFEIVEVVPDTNSGLKRLYRLKLQCRDDAKGPVTALCGMDNYLVSSMGQKIFVR 1197
Query: 352 QLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD-YK 409
D L G+AF+D V++ S+ SVKNL+++GD +S+ + +Q + L ++ +D Y
Sbjct: 1198 AFDLDERLVGVAFLDVGVFVTSLRSVKNLLVIGDAVKSVWFVAFQEDPYKLVILGKDPYH 1257
Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
+ ++A N + ++
Sbjct: 1258 TCVTCADLFFAEN-----------------------------------------RVSLLV 1276
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRF 529
D+D + L Y P ES GG L+++T+FH T I + D P A+
Sbjct: 1277 CDEDGVIRLLEYDPHDPESRGGQHLLRRTEFHGQTEYRTSVLIARRKDKDIDIPQAK--- 1333
Query: 530 LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
S DG+L F + E ++ L +LQ + + H GLNPRAFR + Y P
Sbjct: 1334 -LVCGSTDGSLVSFTFVEEAAFKGLHLLQGQLTRNVQHVAGLNPRAFRIVRND--YVSRP 1390
Query: 590 -SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 627
S+GI+DG+L+ F +L + + E+ ++IG++ +L +
Sbjct: 1391 LSKGILDGNLLTTFEELPIARQNEMTRQIGTERATVLKD 1429
>gi|255075065|ref|XP_002501207.1| predicted protein [Micromonas sp. RCC299]
gi|226516471|gb|ACO62465.1| predicted protein [Micromonas sp. RCC299]
Length = 1423
Score = 205 bits (522), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 164/602 (27%), Positives = 278/602 (46%), Gaps = 86/602 (14%)
Query: 30 NRPLLL-VRTQHELLIYQAFRHPKGA--------LKLRFKKLKVLFVSDRSKRANEQPGL 80
RP+L +R +L+Y+AF P GA +LRF ++ + + +
Sbjct: 837 ERPMLTALRGDGSVLVYRAFLCPPGAGNVGHEAKPQLRFCRVPIELEGGGGGMVDTKA-- 894
Query: 81 PRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS-TLAPFHNV 139
G R+++ + G +GVF+ GP P WL L R + A P+ + + + PFHNV
Sbjct: 895 LSGSRLTRFERVGDRGGIRGVFVSGPRPLWL-LVRRSRVLALPIRGEAQRTVSFTPFHNV 953
Query: 140 NCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTA 199
NC GF+ A +RI +P + Y+A WPVRK+ L+CTPH + Y + + Y + TS
Sbjct: 954 NCLNGFMLGTAAGGVRICQIPGRMHYEAAWPVRKLALRCTPHHVQYLPDFRLYALSTSAP 1013
Query: 200 EPSTDYYKFNGED---KELVTDPRDSRFIPPLVSQ-FHVSLFSPFSWEEIPQTNFPLHEW 255
D ++ N +D L+ + + V Q F + L P + E Q + +
Sbjct: 1014 VKWKD-HEVNEDDIHLSTLIKVRKANAMAKGGVEQVFSLRLLVPGTLECAWQ--YTVDPG 1070
Query: 256 EHVLCLKNVSMEYEGTLSG-LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLT 314
EHV ++NV + T++G L+ + +GT ED CRGR+L+F+++ + + G T
Sbjct: 1071 EHVQSIRNVQL--RNTMTGALQSMLVVGTALPGGEDAPCRGRVLIFEVVWQMTDRG---T 1125
Query: 315 KNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMV 374
K + +++ ++ K TA+ V G L A+G K+ + + L +AF DT ++ +M
Sbjct: 1126 KWQGQLVCVRDAKMACTALEGVGGHLAVAIGTKLIVHSWDGHSLMPVAFFDTPLHTVTMN 1185
Query: 375 SVKNLILVGDYARSIALLRYQ--PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSL 432
VKN IL+GD + R++ P+ + L +A+D+
Sbjct: 1186 VVKNFILLGDIQKGAFFFRWKDTPDEKLLVQMAKDF------------------------ 1221
Query: 433 VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGH 492
E ++I + L + S++ + +D N +F Y P++ ES G
Sbjct: 1222 ----------EGMDILA------TEFLVDGSTLSMLTTDMTGNAFIFSYDPKSLESWKGQ 1265
Query: 493 RLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG----------ARSRFLTWYASLDGALGF 542
+L+ K FH+G V+ + R K + + APG +R ++ +LDG+LG
Sbjct: 1266 KLLTKGAFHVGSPVHRMVRFRLK--APTAAPGQTISPAEQKAQANRHAVFFGTLDGSLGI 1323
Query: 543 FLPLPEKNYRRLLMLQNVMVTHTSHT--GGLNPRAFR---TYKGKGYYAGNPSRGIIDGS 597
+P+ E + L LQ + T H GLN R R T +G+ P ++DG
Sbjct: 1324 LVPIEEAAHASLQSLQRYLTYATPHAALAGLNARTHRHPKTVEGRPMRQPAP-HSLLDGG 1382
Query: 598 LV 599
L+
Sbjct: 1383 LL 1384
>gi|336276223|ref|XP_003352865.1| hypothetical protein SMAC_04980 [Sordaria macrospora k-hell]
gi|380092984|emb|CCC09221.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1486
Score = 205 bits (521), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 171/633 (27%), Positives = 280/633 (44%), Gaps = 81/633 (12%)
Query: 17 VQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALK-----LRFKKL-KVLFVS 68
V E+L LG H + L+L +L +YQ +R A + L F+K+ F
Sbjct: 842 VAEILVADLGDTTHKSPYLILRHANDDLTLYQPYRVKATAGQPFSKSLFFQKVPNSTFAK 901
Query: 69 DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG 128
++ E L R MR +NI+GY VFL G P+++ T++ R + G
Sbjct: 902 APEEKPVEDDELHNAQRFLPMRRCTNISGYSTVFLPGSSPSFILKTAKSSPRVLGLQGSG 961
Query: 129 PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHL 187
V ++ FH C GF+Y + R++ +PT S+ + V+K+P+ + YH
Sbjct: 962 -VQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSFAELGLSVKKIPVGVDTQSVVYHP 1020
Query: 188 ETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQ 247
T+ Y + + AEP ++ +D R++ P+V + + L S +W I
Sbjct: 1021 PTQAYVVGCNNAEP----FELPKDDDYHKEWARENITFKPMVDRGMLKLLSGITWTVI-- 1074
Query: 248 TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
+ E VLC++ +++E + + + IA+GT ED+ RGR+ +FDI +V+P
Sbjct: 1075 DTVEMEPCETVLCVETLNLEVSESTNERKQLIAVGTALTKGEDLPTRGRVYVFDIADVIP 1134
Query: 308 EPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIA 362
EPG+P T K+K++ AKE +G VTA+ V G ++ A GQK + LK D L +A
Sbjct: 1135 EPGKPETSKKLKLV-AKEDIPRGAVTALSEVGTQGLMLVAQGQKCMVRGLKEDGTLLPVA 1193
Query: 363 FIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
F+D Y+ S+ + L L+ D + + Y E + L +
Sbjct: 1194 FMDMNCYVTSVKELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKSST----------- 1242
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
R+E+ + D L + + + SD D ++ +
Sbjct: 1243 -----------------------RMEVL------NADFLPDGKELYIVASDADGHIHILQ 1273
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNT-------FFKIRCKPSSISDAPGARSRFLTWY 533
+ PE +S GH L+ +T F+ G H T + P+S S+ G +
Sbjct: 1274 FDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLLLPAVYPTTTSPNSNSEV-GENPPHILLL 1332
Query: 534 ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR--------TYKGKGYY 585
AS G L PL E YRRL L + H GLNP+ +R + + G
Sbjct: 1333 ASPTGLLATLRPLQENAYRRLSSLAIQLTNALPHPAGLNPKGYRLPSPSASASMQLPGVD 1392
Query: 586 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
AG R I+DG ++ +F++L G+R EI + G
Sbjct: 1393 AGI-GRNIVDGKILERFMELGTGKRQEIAGRAG 1424
>gi|367052335|ref|XP_003656546.1| hypothetical protein THITE_2121311 [Thielavia terrestris NRRL 8126]
gi|347003811|gb|AEO70210.1| hypothetical protein THITE_2121311 [Thielavia terrestris NRRL 8126]
Length = 1460
Score = 204 bits (519), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 171/650 (26%), Positives = 283/650 (43%), Gaps = 104/650 (16%)
Query: 17 VQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALK-----LRFKKLKVLFVSD 69
+ E+L LG H + L+L +L +YQ FR K + L F+K+ ++
Sbjct: 844 LAEILVADLGDSTHKSPYLILRHANDDLTLYQPFRSRKATEQAFSETLFFQKVPNTALAK 903
Query: 70 RSKRANE-----QPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM 124
+ A+E QP R MR N+ GY VF+ G P+++ +S+ R P+
Sbjct: 904 SPQEADEDEASHQP------RFLSMRRCDNVGGYSTVFVPGASPSFIIASSKSMPRVMPL 957
Query: 125 TIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFL 183
G V ++PFH C GF+Y +++ R+ P Y + VRK+P+ +
Sbjct: 958 QGSG-VIAMSPFHTEGCEHGFIYADSRRIARVCQFPDGCIYAETGVAVRKIPIGEDIAAV 1016
Query: 184 AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
AYH ++Y + +T+EP ++ +D R++ P V + + L SP +W
Sbjct: 1017 AYHPPMQSYVVGCNTSEP----FELPKDDDYHKEWARENLSFKPTVDRGILKLLSPITWT 1072
Query: 244 EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
+ + E +LC++ +++E + + IA+GT ED+ RGR+ ++DI
Sbjct: 1073 VVDAVQ--MEPCETILCVETLNLEVSEFTNERKQLIAVGTALTKGEDLPTRGRVYVYDIA 1130
Query: 304 EVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDL 358
+V+PEPG+P T K+K+I AKE +G VTA+ + G ++ A GQK + LK D L
Sbjct: 1131 DVIPEPGRPETGKKLKLI-AKEDIPRGAVTALSEIGTQGLMLVAQGQKCMVRGLKEDGSL 1189
Query: 359 TGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSK 416
+AF+D Y+ + + L L+ D + + Y E + L +
Sbjct: 1190 LPVAFMDMNCYVTAAKELPGTGLCLLADAFKGVWFTGYTEEPYKMMLFGKSST------- 1242
Query: 417 GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
+LE+ + D L + + F++SD D +
Sbjct: 1243 ---------------------------KLEVL------NADFLPDGKELSFVVSDADGYI 1269
Query: 477 VLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSI----------------- 519
+ + PE +S GH L+ +T F+ G H T K P+S
Sbjct: 1270 HILQFDPEHPKSLQGHLLLHRTTFNTGAHHAT--KSLLLPASTPADKEKNDGNAANAQAK 1327
Query: 520 ---SD-----APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
SD P A+ + AS G L PL E YRRL L + H GL
Sbjct: 1328 AKASDNKQPREPAAQRPHVLLLASPTGVLAALRPLSESAYRRLSSLAAQLTNSLPHPAGL 1387
Query: 572 NPRAFRTYKGKGYYAGNPS---RGIIDGSLVWKFLQLSLGERLEICKKIG 618
NPR +R + AG + R I+DG+++ +F +L + R+E+ + G
Sbjct: 1388 NPRGYRAAGAECPPAGVDAGLGRSIVDGTVLERFAELGMARRVELAGRAG 1437
>gi|189203597|ref|XP_001938134.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187985233|gb|EDU50721.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 1407
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 178/657 (27%), Positives = 280/657 (42%), Gaps = 109/657 (16%)
Query: 10 SAMDETIVQELLTVSLGLHGNR-PLLLVRTQHE-LLIYQAFRHP-KGALKLRFKKLKVLF 66
SA TI E+L LG + P L++RT + L+IY+AF P + A L K L+ +
Sbjct: 788 SAAKATIT-EILAADLGDATTKSPHLIIRTSSDNLVIYKAFHAPSRSASDLWTKNLRWVK 846
Query: 67 VSDR------SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
+S + +E PG S + ++ GY VF G PA++F + R
Sbjct: 847 LSQQHVPRYIEDNGSEDPGFE-----STLVALDDVCGYSTVFQRGTTPAFIFKEASSAPR 901
Query: 121 AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYD-APWPVRKVPLKCT 179
++ PV +L FH C RGF Y ++ LRI LP Y W R++P+
Sbjct: 902 VIGLS-GKPVKSLTSFHTSKCQRGFAYLDSTDTLRICQLPPQTHYGHLGWATRRMPMDSE 960
Query: 180 PHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP 239
H L YH + Y + T T+ Y+ + + P++ P + + V L
Sbjct: 961 VHALTYH-PSGLYIVGTG----QTEDYQLDPTETYHYDLPKEDLTFKPSIERGVVKLLDE 1015
Query: 240 FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
SW I T+ L E VL +K +++E + IA+GT+ + ED+ +G I +
Sbjct: 1016 KSWT-IIDTHI-LDPQEIVLSIKTLNLEVSEITHQRKDLIAVGTSVVHGEDLATKGCIRI 1073
Query: 300 FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DN 356
F++I VVP+P +P T ++K+I E KG V+AI + GFL+ A GQK + LK D
Sbjct: 1074 FEVITVVPQPDRPETNKRLKLIVKDEVKGAVSAISELGTQGFLIMAQGQKCMVRGLKEDG 1133
Query: 357 DLTGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
L +AF+D + Y++ + ++ ++ +GD R + Y E +SL AR
Sbjct: 1134 TLLPVAFMDMQCYVSDLKNLPGTGMLAMGDAYRGVWFTGYTEEPYKMSLFAR-------- 1185
Query: 415 SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN------DILDEFSSMGFM 468
SKHN D L + +
Sbjct: 1186 --------------------------------------SKHNLETIAVDFLPFDQQLHLV 1207
Query: 469 ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF------------------- 509
++D D N+ + + P+ +S G RL+ K FH G +
Sbjct: 1208 VADADMNLQILQFDPDNPKSEAGSRLLHKATFHTGHLPTSLHLIHSHLKLPSATDFAATN 1267
Query: 510 ------FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
F + P++ +D P + + + G L PL E +YRRL L +
Sbjct: 1268 SNPADAFAMDTSPNTTTDTP-QQPFHQILHTTQSGTLALLTPLSEDSYRRLSNLTAYLAN 1326
Query: 564 HTSHTGGLNPRAFRT--YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
LNPRAFRT G+ AG +RG++DG+L+ ++ +L R E K G
Sbjct: 1327 TLDSACSLNPRAFRTGDVAEGGWDAGTGARGVLDGNLLLRWGELGERGRREGLAKYG 1383
>gi|340924328|gb|EGS19231.1| hypothetical protein CTHT_0058560 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 1460
Score = 203 bits (516), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 172/647 (26%), Positives = 287/647 (44%), Gaps = 96/647 (14%)
Query: 17 VQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKG-----ALKLRFKKLKVLFVSD 69
+ E++ LG H + L+L + +L IYQ +R+ G + L F+KL +
Sbjct: 842 ITEIMVADLGDTTHKSPYLILRHSNDDLTIYQPYRYKLGTGQVFSKTLFFQKLPNPSFA- 900
Query: 70 RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP 129
R+ EQ +P R+ MR +NIAGY VFL G P+++ +++ R P+ G
Sbjct: 901 RAPEETEQDDVPPQPRLLSMRRCNNIAGYSTVFLPGHSPSFILKSAKSMPRVVPLQGAG- 959
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHLE 188
V ++PFH C GF+Y ++ + R++ +P SY + V+KVP+ +AYH
Sbjct: 960 VIAMSPFHTEGCDHGFIYADSHNIARVTQIPEDWSYAELGLAVKKVPIGEDIAAVAYHPP 1019
Query: 189 TKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQT 248
+ Y + + +EP ++ +D R++ P + + + L SP +W I
Sbjct: 1020 QQCYVVGCNASEP----FELPKDDDYHKEWARENLVFKPTLDRGLLKLISPITWTVIDTV 1075
Query: 249 NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPE 308
L E VLC++ +++E + + R IA+GT ED+ RGR+ ++DI +V+PE
Sbjct: 1076 Q--LEPCETVLCVETLNLEVSESTNERRQLIAVGTALTKGEDLPTRGRVHVYDIADVIPE 1133
Query: 309 PGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAF 363
PG+P T K+K+I AKE +G VTA+ + G ++ A GQK + LK D L +AF
Sbjct: 1134 PGKPETSKKLKLI-AKEDIPRGAVTALSEIGTQGLMLVAQGQKCMVRGLKEDGTLLPVAF 1192
Query: 364 IDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
+D Y+ + + L L+ D + + + Y E + L +
Sbjct: 1193 MDMSCYVTAAKELPGTGLCLMADAFKGVWFVGYTEEPYKMMLFGKS-------------- 1238
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
+LE+ D L + + + D D ++ + +
Sbjct: 1239 --------------------STKLEVLTA------DFLPDGKELFIVACDADGHIHILQF 1272
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSI-SDAPG---------------- 524
PE +S GH L+ +T F+ G H T K PS++ +D P
Sbjct: 1273 DPEHPKSLQGHLLLHRTSFNTGAHNPT--KSLLLPSTLPTDTPSTIDGSNPNTNNTNGTP 1330
Query: 525 ---------ARSR-FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 574
A R + S G + PL E +YRRL L +V H GLNP+
Sbjct: 1331 NASNLAPYDATERPHILLLCSPTGLIAALRPLSESSYRRLSSLAAQLVNSLPHAAGLNPK 1390
Query: 575 AFRTYKGKGYYAG---NPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
+R AG + R I+DG+++ +F +L + R E+ + G
Sbjct: 1391 GYRMPSADCPPAGVDASVGRNIVDGTVLERFTELGMARRAELAGRAG 1437
>gi|336388105|gb|EGO29249.1| hypothetical protein SERLADRAFT_445076 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1424
Score = 203 bits (516), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 166/644 (25%), Positives = 296/644 (45%), Gaps = 77/644 (11%)
Query: 9 PSAMDETIVQELLTVSLGLHGNRPLLLVRTQH-ELLIYQAFRHPKGALKL---RFKKLKV 64
P ++ V++++ LG P LLV + +++IY+A P A + R LKV
Sbjct: 833 PRKSNDLDVEQIILAPLGETAPLPYLLVFLRSGQIVIYEAVPTPAPADSIPPSRVSVLKV 892
Query: 65 LFVSDRSK-------RANEQPGLPRGVRISQMRYF-----SNIAG--YQGVFLCGPHPAW 110
F+ +K E+ L RIS R F S G GVF G P+W
Sbjct: 893 KFIKTATKIFELPKHEETEKSILAEQKRIS--RQFVPFVTSPTPGSVLSGVFFTGDRPSW 950
Query: 111 LFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWP 170
+ T++G +R + + V + FL ++ + + +P L D+ P
Sbjct: 951 IVATNKGGIRIYS-SGHHIVHSFTSCSLWESKGDFLVYSDEGPSLLEWMP-DLCLDSVLP 1008
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
R +P + Y + ++ + + ++ F+ ED ++ +P S P
Sbjct: 1009 SRNIPRSRAYANVVYD---PSAMLIVAASSMQANFASFD-EDGNIIWEPEASNVSLPKCD 1064
Query: 231 QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
+ L +P +W I + E+V L+ V++E T +G + +IA+GT+ + ED
Sbjct: 1065 CSTLELIAPEAW--ITMDGYEFAPNEYVNALECVTLETLSTETGSKDFIAVGTSIDRGED 1122
Query: 291 VTCRGRILLFDIIEVVPEPGQPLTK-NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
+ +G LF+I+EVVP+ Q L + K+K++ + KGPVTA+C + G+LV+++GQKI+
Sbjct: 1123 LAVKGATYLFEIVEVVPDYSQNLKRWYKLKLLARDDAKGPVTALCGINGYLVSSMGQKIF 1182
Query: 350 IWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDY 408
I D L G+AF+D VY+ S+ VKN +L+GD +SI + +Q + L ++A+D
Sbjct: 1183 IRAFDMDERLVGVAFLDVGVYVTSLRVVKNFLLIGDAVKSIWFVAFQEDPYKLVVLAKDV 1242
Query: 409 KPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFM 468
T + ++ + + I+
Sbjct: 1243 HRTHVTNADFFFTDDTLSIV---------------------------------------- 1262
Query: 469 ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSR 528
D D + ++ Y P+ ES G L+ +T+FH + I + S P A +
Sbjct: 1263 TEDGDGILRMYAYDPDDPESKNGQHLLCRTEFHNHSECRSSLVIARRTKEESVLPQA--K 1320
Query: 529 FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
L+ ++ DG+L P+ + +++RL +LQ + + H GLNPRA+R + +
Sbjct: 1321 ILSAFS--DGSLSSLTPVDDASFKRLQLLQGQLTRNIQHVAGLNPRAYRIVRND--FVSK 1376
Query: 589 P-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
P S+ I+DG L+ F L + + E+ K+IG++ N +L + ++
Sbjct: 1377 PLSKDILDGQLLSAFESLPISRQNEMTKQIGTERNIVLHDWMEL 1420
>gi|336375160|gb|EGO03496.1| hypothetical protein SERLA73DRAFT_165174 [Serpula lacrymans var.
lacrymans S7.3]
Length = 1428
Score = 202 bits (515), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 166/644 (25%), Positives = 296/644 (45%), Gaps = 77/644 (11%)
Query: 9 PSAMDETIVQELLTVSLGLHGNRPLLLVRTQH-ELLIYQAFRHPKGALKL---RFKKLKV 64
P ++ V++++ LG P LLV + +++IY+A P A + R LKV
Sbjct: 837 PRKSNDLDVEQIILAPLGETAPLPYLLVFLRSGQIVIYEAVPTPAPADSIPPSRVSVLKV 896
Query: 65 LFVSDRSK-------RANEQPGLPRGVRISQMRYF-----SNIAG--YQGVFLCGPHPAW 110
F+ +K E+ L RIS R F S G GVF G P+W
Sbjct: 897 KFIKTATKIFELPKHEETEKSILAEQKRIS--RQFVPFVTSPTPGSVLSGVFFTGDRPSW 954
Query: 111 LFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWP 170
+ T++G +R + + V + FL ++ + + +P L D+ P
Sbjct: 955 IVATNKGGIRIYS-SGHHIVHSFTSCSLWESKGDFLVYSDEGPSLLEWMP-DLCLDSVLP 1012
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
R +P + Y + ++ + + ++ F+ ED ++ +P S P
Sbjct: 1013 SRNIPRSRAYANVVYD---PSAMLIVAASSMQANFASFD-EDGNIIWEPEASNVSLPKCD 1068
Query: 231 QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
+ L +P +W I + E+V L+ V++E T +G + +IA+GT+ + ED
Sbjct: 1069 CSTLELIAPEAW--ITMDGYEFAPNEYVNALECVTLETLSTETGSKDFIAVGTSIDRGED 1126
Query: 291 VTCRGRILLFDIIEVVPEPGQPLTK-NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
+ +G LF+I+EVVP+ Q L + K+K++ + KGPVTA+C + G+LV+++GQKI+
Sbjct: 1127 LAVKGATYLFEIVEVVPDYSQNLKRWYKLKLLARDDAKGPVTALCGINGYLVSSMGQKIF 1186
Query: 350 IWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDY 408
I D L G+AF+D VY+ S+ VKN +L+GD +SI + +Q + L ++A+D
Sbjct: 1187 IRAFDMDERLVGVAFLDVGVYVTSLRVVKNFLLIGDAVKSIWFVAFQEDPYKLVVLAKDV 1246
Query: 409 KPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFM 468
T + ++ + + I+
Sbjct: 1247 HRTHVTNADFFFTDDTLSIV---------------------------------------- 1266
Query: 469 ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSR 528
D D + ++ Y P+ ES G L+ +T+FH + I + S P A +
Sbjct: 1267 TEDGDGILRMYAYDPDDPESKNGQHLLCRTEFHNHSECRSSLVIARRTKEESVLPQA--K 1324
Query: 529 FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
L+ ++ DG+L P+ + +++RL +LQ + + H GLNPRA+R + +
Sbjct: 1325 ILSAFS--DGSLSSLTPVDDASFKRLQLLQGQLTRNIQHVAGLNPRAYRIVRND--FVSK 1380
Query: 589 P-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
P S+ I+DG L+ F L + + E+ K+IG++ N +L + ++
Sbjct: 1381 PLSKDILDGQLLSAFESLPISRQNEMTKQIGTERNIVLHDWMEL 1424
>gi|320591495|gb|EFX03934.1| cleavage and polyadenylation specificity factor subunit [Grosmannia
clavigera kw1407]
Length = 1461
Score = 201 bits (512), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 177/652 (27%), Positives = 289/652 (44%), Gaps = 104/652 (15%)
Query: 12 MDETIVQELLTVSLGLHGNR-PLLLVR-TQHELLIYQAFRHPKGALKL----RFKKL-KV 64
M + + E+L LG ++ P L+VR +L IYQ R P L RF K+
Sbjct: 846 MAKEPLTEILVADLGDAVSKAPYLIVRHANDDLTIYQPLRTPSSLGSLSESLRFLKVPNP 905
Query: 65 LF----VSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
+F VS S A+ Q +R +R NI GY VFL G +++ +++ + R
Sbjct: 906 VFAKSPVSISSDDASSQ------LRAMPLRVCENIGGYSTVFLPGSSASFVLKSAKSQPR 959
Query: 121 AHPMTIDG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLP-----THLSYDAPWPVRKV 174
+++ G V +L+PFH + R F+Y + + R+ +P T L A RKV
Sbjct: 960 V--VSLQGTAVRSLSPFHTESSERSFIYVDVEGSGRVCSMPAGWNLTELGVCA----RKV 1013
Query: 175 PLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHV 234
L + LAYH T TY + TS E ++ +D ++S PL + +
Sbjct: 1014 ALDTDANALAYHPPTGTYAVGTSALE----AFELPKDDPHRADWNKESTAFRPLAERGRL 1069
Query: 235 SLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR 294
L SP SW I + +E V+C+K +++E + + +A+GT + ED+ R
Sbjct: 1070 LLMSPGSWSTIDTVE--MEPYEVVMCVKTLNLEVSEATNERKQLVAVGTAISRGEDLAIR 1127
Query: 295 GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYI 350
GR+ +FD++ V+PEPG+P T K+K+I AKE +G VTA+ + G ++ A GQK +
Sbjct: 1128 GRVYVFDVVSVIPEPGRPETNRKLKLI-AKEDIPRGAVTAVSEIGTQGLMLVAQGQKCLV 1186
Query: 351 WQLK-DNDLTGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARD 407
LK D L +AF+D Y+ S + L ++ D + + Y E
Sbjct: 1187 RGLKEDGTLLPVAFMDMNCYVTSAKELPGTGLCVMSDAFKGVWFTGYTEE---------- 1236
Query: 408 YKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGF 467
P + I+ G + L++ D+L + +
Sbjct: 1237 ---------------PYKMILFGKSNTRLHALNV---------------DLLPDGKELFI 1266
Query: 468 MISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF-------FKIRCKPSSIS 520
+++D D N+ + + PE +S GH L+ + F G H +T F +P++
Sbjct: 1267 VVTDADGNLHVMQFDPEHPKSLQGHILLHRATFCTGAHFSTLSLLLPSTFTPADRPTANG 1326
Query: 521 DAPGARSR--------FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
+ GA S+ S G L +PL E YRRL L + T + T GLN
Sbjct: 1327 ETNGASSQPEAQQHQQHQLLLGSPTGLLASLVPLSESEYRRLSSLAGQLATSLTQTAGLN 1386
Query: 573 PRAFRTYKGKGYYAGNP------SRGIIDGSLVWKFLQLSLGERLEICKKIG 618
P+ +R G P R ++DG+L+ ++ +L G + EI ++G
Sbjct: 1387 PKGYRMTAGSAAATLAPGVDAAVGRSVVDGALLARWTELGSGRKGEIAGRVG 1438
>gi|303285993|ref|XP_003062286.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455803|gb|EEH53105.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 1469
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 172/645 (26%), Positives = 286/645 (44%), Gaps = 105/645 (16%)
Query: 30 NRPLLL-VRTQHELLIYQAFR----HPKG-AL-KLRFKKLKVLFVSDRSKRANEQPGLPR 82
RPLL +R +L+Y+AF P G AL +LRF ++ V A + LP
Sbjct: 884 ERPLLTALRADGAVLVYRAFTCAVAGPGGRALTQLRFARVPVEL-EGGGGGAVDLSALP- 941
Query: 83 GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP-VSTLAPFHNVNC 141
G R+++ + G +GVF+ GP P WL L R + A P+ + V + FHNVNC
Sbjct: 942 GSRLTRFERVGDRGGIRGVFVSGPQPLWL-LARRSRVLALPVRGEAQRVVSFTAFHNVNC 1000
Query: 142 PRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTST--- 198
GF+ A +RI +P + Y+A WPVRK+ L+CTPH + Y + K Y + TS
Sbjct: 1001 HAGFILGTAAGGVRICQIPGRMHYEAAWPVRKLALRCTPHHVQYLPDFKLYALSTSAPAK 1060
Query: 199 -AEPSTDYYKFNGEDKELVTDPRDSRFIP--PLVSQFHVSLFSPFSWEEIPQTNFPLHEW 255
EP + V R ++ + + QF V L P S E + +
Sbjct: 1061 WVEPEVAEEDIHA---ATVVKTRRAKAMARGGVEEQFAVKLLVPGSLET--AWSRTMDPG 1115
Query: 256 EHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTK 315
EHV +KNV + T L +A+GT ED CRGR++LF+I
Sbjct: 1116 EHVQAVKNVQVRNLRT-GALHSMLAVGTAMPGGEDTPCRGRVILFEI------------- 1161
Query: 316 NKIKMIYAKEQKGP---------VTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT 366
+M+ + ++ P + A+ + G LV A+G K+ + +L +AF DT
Sbjct: 1162 -SWQMVDGETRRVPLLLLFFDDALAALSGLEGHLVVAIGTKLIVHAWDGAELIPVAFFDT 1220
Query: 367 EVYIASMVSVKNLILVGDYARSIALLRYQPEYRT----LSLVARDYKPTQPNSKGYYAGN 422
V+ ++ VKN + +GD + R++ + RT L +A+D++ S +
Sbjct: 1221 PVHTVTINVVKNFVCIGDVQKGAYFFRWKDDPRTGEKNLIQLAKDFESMDVLSTEF---- 1276
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
++DG S++ + +D N +F Y
Sbjct: 1277 ----LVDG--------------------------------STLSLLAADTAGNAYVFAYD 1300
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTF--FKIRCKPSSISDAPGA---------RSRFLT 531
P++ ES G +L+ K FH+G V+ FK++ + +D A +R
Sbjct: 1301 PKSSESWKGQKLLTKASFHVGSPVHRMVRFKLKTPTGAGNDGRAAPTPAEIKANANRHAV 1360
Query: 532 WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR---TYKGKGYYAGN 588
++ +LDG+LG +P+ + +L +LQ + +T+ GLN R++R T +G+ +
Sbjct: 1361 FFGTLDGSLGILVPMESSTHAKLEVLQRWLNYNTAQNAGLNGRSYRAPKTTEGRAMRSPA 1420
Query: 589 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
P ++DG ++ F L+ ++ E G + L L+ + A
Sbjct: 1421 P-HNLLDGEMLQGFESLAWTKQAEAADAAGMTREEALTYLHTLSA 1464
>gi|121797760|sp|Q2TZ19.1|CFT1_ASPOR RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|83775384|dbj|BAE65504.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 1393
Score = 198 bits (504), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 169/652 (25%), Positives = 298/652 (45%), Gaps = 93/652 (14%)
Query: 16 IVQELLTVSLGLH-GNRPLLLVRTQHE-LLIYQAFRHPKGAL-----KLRFKKLKVLFVS 68
++ E++ LG + P L++R++H+ L +Y+ F ++ L F K L +
Sbjct: 796 VLTEIVVADLGDSWSSFPYLIIRSRHDDLAVYRPFISITKSVGEPHADLNFLKETNLVLP 855
Query: 69 DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG 128
+ +Q ++ +R SNI+G+ +F G P ++ TS H + + G
Sbjct: 856 RITSGVEDQSSTEEVIKSVPLRIVSNISGFSAIFRPGVSPGFIVRTSTSS--PHFLGLKG 913
Query: 129 P-VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTH--LSYDA------PWPVRKVPLKCT 179
+L+ F C GF+ ++K I + T+ LS+ PW ++++P+
Sbjct: 914 GYAQSLSKFQTSECGEGFILLDSKVLCFILLCLTYCILSFHTGCHSYYPWTIQQIPIGEQ 973
Query: 180 PHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRD--SRFIPPLVSQFHVSLF 237
LAY + Y I TS +K ED EL + R+ + F P V + + +
Sbjct: 974 VDHLAYSSSSGMYVIGTS----HRTEFKLP-EDDELHPEWRNEMTSFFPE-VQRSSLKVV 1027
Query: 238 SPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRI 297
SP +W I EHV+ +KN+S+E + I +GT + ED+ RG +
Sbjct: 1028 SPKTWTVIDSPA------EHVMAVKNMSLEISENTHERKDMIVVGTAFARGEDIASRGCV 1081
Query: 298 LLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK- 354
+F++I+VVP+P +P K++++ + KG VTA+ + GFL+ A GQK + LK
Sbjct: 1082 YVFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQKCIVRGLKE 1141
Query: 355 DNDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQ 412
D L +AF+D + +++ + +K + ++ D + + Y E +SL A+D
Sbjct: 1142 DGSLLPVAFMDVQCHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPYKMSLFAKDL---- 1197
Query: 413 PNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDK 472
+ LE+ D L + + + +++D
Sbjct: 1198 ------------------------------DYLEVLAA------DFLPDGNKLFILVADS 1221
Query: 473 DKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSS---ISDAPGAR-- 526
D N+ + Y PE +S+ G RL+ ++ FH G ++T + R SS ISD
Sbjct: 1222 DCNLHVLQYDPEDPKSSNGDRLLSRSKFHTGNFISTLTLLPRTSVSSEQMISDVDAMDVD 1281
Query: 527 ---SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKG 583
R S +G++G + E++YRRL LQ+ + H GLNPRAFR +
Sbjct: 1282 IKIPRHQMLITSQNGSVGLVTCVSEESYRRLSALQSQLTNTIEHPCGLNPRAFRAVESD- 1340
Query: 584 YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
G RG++DG L++++L +S ++EI ++G+ +I D EA+S
Sbjct: 1341 ---GTAGRGMLDGKLLFQWLDMSKQRKVEIASRVGANEWEI---KADFEAIS 1386
>gi|294659889|ref|XP_462318.2| DEHA2G17908p [Debaryomyces hansenii CBS767]
gi|218511978|sp|Q6BHK3.2|CFT1_DEBHA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|199434312|emb|CAG90824.2| DEHA2G17908p [Debaryomyces hansenii CBS767]
Length = 1342
Score = 197 bits (502), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 139/552 (25%), Positives = 252/552 (45%), Gaps = 60/552 (10%)
Query: 88 QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLY 147
++ YF N+ G+ +F+ G P ++ T+ R T P + AP+ + G +Y
Sbjct: 838 RLVYFPNVNGFTSIFVTGITPYYISKTTHSVPRIFKFT-KLPAVSFAPYSDDKIKNGLIY 896
Query: 148 FNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYK 207
+ RI +P +Y+ WP++K+P+K + + YH + T+ I T P Y
Sbjct: 897 LDNSKNARICEIPVDFNYENNWPIKKIPIKESIKSVTYHELSNTFVISTYEEIP---YDC 953
Query: 208 FNGEDKELVTDPRDSRFIPPLVSQF--HVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVS 265
+ E K +V + P + + ++ L SP++W I L + E + ++++
Sbjct: 954 LDEEGKPIVGVDKSK----PSANSYKGYIKLISPYNWSVIDT--IELVDGEIGMNVQSMV 1007
Query: 266 MEYEGTLSGLRG---YIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIY 322
++ + + I +GT ED++ G +F+II+++PEPG+P T +K K I+
Sbjct: 1008 LDVGSSTKKFKNKKELIVIGTGKYRMEDLSANGSFKIFEIIDIIPEPGKPETNHKFKEIH 1067
Query: 323 AKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILV 382
++ KG VT+IC ++G + + GQKI I L+D+ + +AF+DT VY++ S NL+++
Sbjct: 1068 QEDTKGAVTSICEISGRFLVSQGQKIIIRDLQDDGVVPVAFLDTSVYVSEAKSFGNLLIL 1127
Query: 383 GDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLG 442
GD +SI L + E + ++ +D + N
Sbjct: 1128 GDSLKSIWLAGFDAEPFRMVMLGKDLQSLDVN---------------------------- 1159
Query: 443 ERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHL 502
C K +I +I+D + + L Y PE S+ G RLI K F++
Sbjct: 1160 -----CADFIIKDEEIF-------ILIADNNSTLHLVKYDPEDPTSSNGQRLIHKASFNI 1207
Query: 503 GQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 562
T IR P + P + F + +++DG+ P+ E +YRR+ +LQ +
Sbjct: 1208 NS---TPTCIRSIPKNEEINPSSTEVFQSIGSTIDGSFYTVFPINEASYRRMYILQQQIT 1264
Query: 563 THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-- 620
H GLNPR R ++ ++D ++ F +L+ R + K+ SK
Sbjct: 1265 DKEYHFCGLNPRLNRFGGLSMTVNDTNTKPLLDYEVIRMFAKLNEDRRKNLSMKVSSKNV 1324
Query: 621 HNDILDELYDIE 632
+ DI +L + +
Sbjct: 1325 YQDIWKDLIEFD 1336
>gi|440637976|gb|ELR07895.1| hypothetical protein GMDG_02777 [Geomyces destructans 20631-21]
Length = 1495
Score = 197 bits (502), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 179/657 (27%), Positives = 289/657 (43%), Gaps = 91/657 (13%)
Query: 10 SAMDETIVQELLTVSLGLHGNRPLLLVRTQHE-LLIYQAFRHPKGALK-----LRFKKLK 63
S + ET+ + LL P L+ R ++ L IY+ F+ P A + L F+K+
Sbjct: 892 STVAETLTEVLLADLGDATSKSPYLIFRASNDDLTIYEPFQVPSEAPRPLSKSLHFQKIH 951
Query: 64 VLFVSDRSKRANEQPGLPRGV-RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
V+ + E R S MR +N+ G VFL G P+++ +S+ R
Sbjct: 952 NPHVAKTANPETEVAADAESAKRGSPMRAIANVGGLSSVFLPGDSPSFVVKSSKSTPRVV 1011
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVL-PTHLSYDAPWPVRKVPLKCTPH 181
+ G V +L+ FH C RGF+Y ++K R+S L P D +RKV +
Sbjct: 1012 GLRGHG-VRSLSGFHTEGCDRGFIYVDSKGIARVSQLEPETNVTDIGLTLRKVKIGEEVQ 1070
Query: 182 FLAYHLETKTYCIVTSTAEP-----STDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSL 236
+ YH Y I T EP DY++ KE +T PL + + L
Sbjct: 1071 AVTYHPPKDVYVIGTVVKEPFELPKDDDYHREWA--KEDIT-------FKPLTGRGFLKL 1121
Query: 237 FSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGR 296
+P +W I + HE ++C+K +++E + I +GT + ED+ RGR
Sbjct: 1122 LNPSNWSVIDKVELDSHEI--IMCIKTLNLEVSENTHERKQLITVGTAISKGEDLAIRGR 1179
Query: 297 ILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQ 352
+ ++++I VVP P +P T K+K+I AKE+ +G +T I + GF++ A GQK +
Sbjct: 1180 VYVYEVITVVPFPDRPETNKKLKLI-AKEEIPRGAITGISEIGTQGFMIVAQGQKSMVRG 1238
Query: 353 LK-DNDLTGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
LK D L +AFID Y+ ++ S+ + L D + + Y E +++ +
Sbjct: 1239 LKEDGTLLPVAFIDMNTYVTTVKSLPGTGMCLFADAIKGVWFAGYSEEPYKMTIFGK--- 1295
Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
S+G +I L L +G+ L I ++
Sbjct: 1296 ----QSQGME-------VITADL------LPIGDELYI--------------------IV 1318
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH-----------VNTFFKIRCKPSS 518
+D D N+ + + PE +S G L+++T F LG H T S+
Sbjct: 1319 ADSDCNLHVLQFDPEHPKSLHGQLLLQRTTFSLGGHMPTTMTLLPLTTTTQTPTPAVTST 1378
Query: 519 ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT 578
S+ S L +S G + PL E+ YRRL L N + H GGLNP+A R
Sbjct: 1379 ASEPTNPASGLLMTLSS--GVVAILTPLSEQQYRRLNALSNHLSNLLYHPGGLNPKAHRI 1436
Query: 579 YKG--KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
+ G P I+DGS++W++L+L +R E+ ++G I ++L +I A
Sbjct: 1437 SNTAPEAVIGGRP---IVDGSVLWRWLELGSQKRAEVAGRVGVDGETIREDLQEIAA 1490
>gi|395324102|gb|EJF56549.1| hypothetical protein DICSQDRAFT_93527 [Dichomitus squalens LYAD-421
SS1]
Length = 1433
Score = 197 bits (502), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 168/648 (25%), Positives = 292/648 (45%), Gaps = 92/648 (14%)
Query: 9 PSAMDETIVQELLTVSLGLHGNRPLLLVRTQH-ELLIYQAFRHPKGALKL---RFKKLKV 64
P E V +L+ LG RP L+V + +L IY+A A L R L V
Sbjct: 841 PRKPQELDVDQLVIAPLGESHPRPHLIVLLRSGQLAIYEAVAASPPADPLPPTRSLTLLV 900
Query: 65 LFVSDRSK-------RANEQPGLPRGVRISQMR--YFSNIA---GYQGVFLCGPHPAWLF 112
V +SK ++ L RIS++ + ++ A Y GVF G P+W+
Sbjct: 901 NLVKVKSKAFDIQHTEEEQKSVLAEQKRISRLLLPFVTSPAPGQTYSGVFFTGDRPSWIV 960
Query: 113 LTSRGELRAHPMTIDGPVSTLAPFHNV-----NCP----RG-FLYFNAKSELRISVLPTH 162
T +G +R P HNV C RG FL ++ + + +P
Sbjct: 961 STDKGGVRVFPSG-----------HNVVHAFTTCSLWESRGDFLLYSEEGPSLVEWMP-D 1008
Query: 163 LSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS 222
+ DA P R VP + P+ H+ + A + + ED +V +P
Sbjct: 1009 IILDAHLPARSVP-RSRPY---SHVVFDASSSLIVAASSFMNRFASYDEDGNIVWEPDSP 1064
Query: 223 RFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALG 282
P + L SP W + F +E+ V C+ +V +E T SG++ +IA+G
Sbjct: 1065 NISFPHCETSTLELISPDGWITMDGYEFAANEF--VSCVVSVPLETVSTESGMKDFIAVG 1122
Query: 283 TNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTK-NKIKMIYAKEQKGPVTAICHVAGFLV 341
T N ED+ +G + +F+I+EVVP+ + + ++K++ + KGPV+ +C + G+LV
Sbjct: 1123 TTINRGEDLAVKGAVYIFEIVEVVPDASLNIKRWWRLKLLCRDDAKGPVSFLCGMNGYLV 1182
Query: 342 TAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRT 400
+++GQKI++ D L G+AF+D VY+ S+ +VKNL+++GD +S+ + +Q +
Sbjct: 1183 SSMGQKIFVRAFDLDERLVGVAFLDVGVYVTSLRAVKNLLVIGDAVKSVWFVAFQEDPYK 1242
Query: 401 LSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 460
L ++ +D P + D+
Sbjct: 1243 LVILGKD---------------PHHCCV-------------------------TRADLFF 1262
Query: 461 EFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS 520
+ + D++ V L+ Y P ES GG L+++T+FH + + +P +
Sbjct: 1263 ADGHLSIVTCDEEGVVRLYAYDPHDPESKGGQHLLRRTEFHGQTEYRSSLLVARRPKA-G 1321
Query: 521 DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK 580
D ++R + S+DG+L + E ++RL +LQ ++ H LNP+AFR +
Sbjct: 1322 DPEIPQARLIC--GSVDGSLTTLTYVDENAFKRLHLLQGQLIRTVQHVAALNPKAFRMVR 1379
Query: 581 GKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 627
+ Y P S+G++DG+L+ F L +G + E+ ++IG+ +L +
Sbjct: 1380 NE--YVSRPLSKGVLDGNLLATFEDLPIGRQNEVTRQIGTDRATVLKD 1425
>gi|67521912|ref|XP_659017.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
gi|74598221|sp|Q5BDG7.1|CFT1_EMENI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|40745387|gb|EAA64543.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
gi|259486722|tpe|CBF84808.1| TPA: Protein cft1 (Cleavage factor two protein 1)
[Source:UniProtKB/Swiss-Prot;Acc:Q5BDG7] [Aspergillus
nidulans FGSC A4]
Length = 1339
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 172/640 (26%), Positives = 295/640 (46%), Gaps = 87/640 (13%)
Query: 17 VQELLTVSLG-LHGNRPLLLVRTQHE-LLIYQAF-RHPKGALKLRFKKLKVLFVSDRSKR 73
V ++ V LG + + P L++RT+++ L++Y+ F + K LRF K +
Sbjct: 759 VLQIAVVELGDSYSSLPFLILRTENDDLVVYKPFFTNSKELTGLRFLKEANHTLPKTPNT 818
Query: 74 ANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP-VST 132
+E + +R NIAG +F+ GP ++F S H + + G +
Sbjct: 819 TDELQS-----EMKPLRILPNIAGCSSIFMPGPSAGFIFRAST--TSPHFIRLRGGFIKG 871
Query: 133 LAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTY 192
L F + + +GF Y ++ L ++ LP PW +R VP+ L Y + TY
Sbjct: 872 LGCFDSPD--KGFAYLDSHG-LHLAKLPEGTQLGYPWIMRTVPIGQQIDKLTYVSASDTY 928
Query: 193 CIVTSTAEPSTDYYKFN-GEDKELVTDPRDSR--FIPPLVSQFHVSLFSPFSWEEIPQTN 249
+ T +F ED EL + R+ F+P V+Q + + SP +W I +
Sbjct: 929 VLGT------CQRCEFRLPEDDELHPEWRNEEISFLPE-VNQSSLKVVSPKTWSVI--DS 979
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEP 309
+PL EH++ +K +S+E R I +GT+ ED+ RG I +F++IEVVP+P
Sbjct: 980 YPLEPAEHIMVMKTMSLEVSENTHERRDMIVVGTSLARGEDIPSRGCIYVFEVIEVVPDP 1039
Query: 310 GQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFIDT 366
QP T ++K+I + KG VTA+ + GFL+ A GQK + LK D L +AF+D
Sbjct: 1040 EQPETNRRLKLIGKEPVKGAVTALSEIGGQGFLIAAQGQKSMVRGLKEDGSLLPVAFMDM 1099
Query: 367 EVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPS 424
+ +++ + +K + + GD + + Y E +SL A+D
Sbjct: 1100 QCFVSVIKELKGTGMCIFGDAVKGLWFAGYSEEPYKMSLFAKDL---------------- 1143
Query: 425 RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPE 484
+ LE+ D L + + + +++D D N+ + Y PE
Sbjct: 1144 ------------------DYLEVLAA------DFLPDGNKLFIVVADSDCNLYVLQYDPE 1179
Query: 485 ARESNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSSISDAPGARSRFLTWYASL------- 536
S+ G +L+ ++ FH G +T + R SS G+ + A L
Sbjct: 1180 DPNSSNGDKLLNRSKFHTGNFASTVTLLPRTLVSSERAMSGSDKMDIDNTAPLHQVLVTS 1239
Query: 537 -DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIID 595
+G++G +PE++YRRL LQ+ + H GLNPRA+R + + RG++D
Sbjct: 1240 HNGSIGLVTCVPEESYRRLSALQSQLTNTLEHPCGLNPRAYRAVESD----ASAGRGMLD 1295
Query: 596 GSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
+L+ ++L +S + EI ++G+ +I D+EA+S
Sbjct: 1296 SNLLLQYLDMSKQRKAEIAGRVGATEWEI---RADLEAIS 1332
>gi|346971831|gb|EGY15283.1| cft-1 [Verticillium dahliae VdLs.17]
Length = 1445
Score = 197 bits (501), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 171/660 (25%), Positives = 289/660 (43%), Gaps = 88/660 (13%)
Query: 5 RSHSPSAMDETIVQEL-LTVSLGLHGNRPLLLVRTQHELLIYQAFR------HPKGALKL 57
R SP + E +V +L + S H L+L ++ IY+ FR A L
Sbjct: 832 RGTSPETLTEILVADLGDSTSASAH----LILRHANDDMTIYEPFRIGGQEEKEDLANSL 887
Query: 58 RFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
FKK+ ++ A E + R+ +R NI GY VFL G P+++ +S+
Sbjct: 888 FFKKVSNSHLAKSPVEAAEDEAVQEN-RVIPLRACDNIGGYSTVFLPGASPSFILKSSKS 946
Query: 118 ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPL 176
+ + G V+ ++ FH C RGF+Y ++K R++ P + + VRKVP+
Sbjct: 947 TPKVIGLQGLG-VNGMSSFHTEGCERGFIYADSKGCARVTQFPDAANVAELGVSVRKVPI 1005
Query: 177 KCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSL 236
+A+H + Y + +S EP ++ +D ++ +PP+ + L
Sbjct: 1006 DTAVSHVAWHPNMEVYAVASSKLEP----FELPKDDDYHKEWAKEECPMPPMKEHGSIKL 1061
Query: 237 FSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGR 296
+SP +W I + F L ++E +C+K + +E R A+GT ED+ RGR
Sbjct: 1062 YSPITWNVIDE--FELEQYEVAMCMKTLLLEVSEETKERRMLFAVGTAILRGEDLPVRGR 1119
Query: 297 ILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQ 352
IL+FD++ V+P+P +P T K+K+I AKE+ +G VT++C V G ++ A GQK +
Sbjct: 1120 ILVFDVVHVIPQPDRPETDRKLKLI-AKEEIPRGAVTSLCEVGTQGLMLVAQGQKCMVRG 1178
Query: 353 LK-DNDLTGIAFIDTEVYIASMVSVKNL--ILVGDYARSIALLRYQPEYRTLSLVARDYK 409
LK D L +AF+D Y+ ++ ++N L+ D + + Y E ++L +
Sbjct: 1179 LKEDGTLLPVAFLDMSTYVVAVHELRNTGYCLMADANMGVWFVGYSEEPYRMTLFGKS-- 1236
Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
G +L+ D L + + +
Sbjct: 1237 --------------------------------GTQLKCLTA------DFLVAGNDLSIVA 1258
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH----VNTFFKIRCKP-----SSIS 520
SD+D + + + PE S GH L+ + F + + + +P +
Sbjct: 1259 SDEDGVLHILQFDPEHPRSLQGHLLLNRASFSVAPNHAWATLVLPRTTTRPYLPQSEPAT 1318
Query: 521 DAPGARSRFLT-WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
A G+++R T AS GA+ P+ E YRRL L + H G+NP+A R
Sbjct: 1319 GAAGSQNRTQTLLLASASGAIASLNPITEHAYRRLTSLTTSLANALPHAAGMNPKAHRLP 1378
Query: 580 KGKGYYAGNP-------SRGIIDGSLVWKFLQLSLGERLEICKKIG-SKHNDILDELYDI 631
G A P R I+DG+L+ ++ +L +R E K G + D+ EL D+
Sbjct: 1379 PQDG--AARPPAVDVSAGRTIVDGALLARWNELGARQRAEAAGKGGFASAADVRGELEDV 1436
>gi|170102106|ref|XP_001882269.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164642641|gb|EDR06896.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 1406
Score = 197 bits (501), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 163/639 (25%), Positives = 275/639 (43%), Gaps = 80/639 (12%)
Query: 9 PSAMDETIVQELLTVSLGLHGNRPLLLVRTQH-ELLIYQAF---RHPKGALKLRFKKLKV 64
P E V+++L +G RP L V + +L IY+ R + K+R +K+
Sbjct: 816 PRKPQEFDVEQILVAPIGESSPRPHLCVFLRSGQLTIYEVLPLGRTTEALPKVRPAHVKI 875
Query: 65 LFVSDRS-----KRANEQPGLPRGVRISQMRYF----------SNIAGYQGVFLCGPHPA 109
FV S +R E +G+ Q R + S + GVF G P
Sbjct: 876 KFVKISSMAFEIQRPEEGE---KGIIAEQKRIYRMFVPFVTSASPGVTFSGVFFTGDRPN 932
Query: 110 WLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPW 169
W+F T +G ++ +P + V+ P FL + E +S YD P
Sbjct: 933 WIFGTDKGGVQIYP-SGHAVVNAFTPCSLFESKGDFLMYT--EEASVSKWLPDFHYDGPL 989
Query: 170 PVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLV 229
P+R VP L + T +S Y +D + +P P+
Sbjct: 990 PLRSVPRGRAYSSLVFDPSTSLLVAASSLQAKFASY----DDDDNKIWEPETPNIGNPMC 1045
Query: 230 SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
+ L SP W I F E++ + V++E GT G + +IA+GT + E
Sbjct: 1046 DTSTLELISPDMW--ITMDGFEFATNEYINDVACVTLETAGTEVGSKDFIAVGTTIDRGE 1103
Query: 290 DVTCRGRILLFDIIEVVPEPG-QPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI 348
D+ RG +++I+EVVP+P P K+++ + KGPVTA+C G+LV+++GQKI
Sbjct: 1104 DLAARGATYIYEIVEVVPDPAISPKRWYKLRLRCRDDAKGPVTAVCGFHGYLVSSMGQKI 1163
Query: 349 YIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
++ D L G+AF+D VY+ S+ ++KNL+LVGD +S++ + +Q + L L+ +D
Sbjct: 1164 FVRAFDSDERLVGVAFMDVGVYVTSLRTLKNLLLVGDAVKSLSFIAFQEDPYKLVLLGKD 1223
Query: 408 YKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGF 467
+ + ++ DG L
Sbjct: 1224 TQHVCVTNADFF-------FTDGEL---------------------------------SL 1243
Query: 468 MISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS 527
+ D++ + ++ Y P+ +S G L+ +T+FH T I + P A+
Sbjct: 1244 VTGDEEGIMRMYEYNPQDPDSKDGRYLLLRTEFHGQSEYRTSTTIARRLKDDPSIPQAK- 1302
Query: 528 RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
DG L P+ E ++RL +LQ + + H GLNP+AFR + +
Sbjct: 1303 ---LIIGGTDGCLSSLTPVEEHAFKRLQLLQGQLTRNIQHVAGLNPKAFRIVRND--FVS 1357
Query: 588 NP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 625
P S+GI+DG+L+ + L + + E+ ++IG+ +L
Sbjct: 1358 KPLSKGILDGNLLAHYESLPIIRQNEMTRQIGTDRVTLL 1396
>gi|408396642|gb|EKJ75797.1| hypothetical protein FPSE_03977 [Fusarium pseudograminearum CS3096]
Length = 1427
Score = 197 bits (500), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 173/627 (27%), Positives = 283/627 (45%), Gaps = 82/627 (13%)
Query: 17 VQELLTVSLG-LHGNRPLLLVRTQ-HELLIYQAFRH--PKG----ALKLRFKKL-KVLFV 67
++E+L LG P L++R Q +L IY+ RH P G + L FKK V
Sbjct: 835 LREILVADLGDTISQSPYLILRNQTDDLTIYEPIRHVRPGGESNLSAALSFKKTSNVTLA 894
Query: 68 SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID 127
+ ++ +++ PR MR +NI GY VFL G P+++ +S+ R +
Sbjct: 895 TTPAQTEDDEVEQPR---FMPMRRCANINGYSTVFLPGSSPSFVLKSSKSIPRVIGLQGL 951
Query: 128 GPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYH 186
G + ++ FH C RGF+Y + K R++ P+ ++ + V+KVPL +AYH
Sbjct: 952 G-IRGMSSFHTEGCDRGFIYADDKGIARVTQFPSDTNFTELGISVKKVPLGSDVRGIAYH 1010
Query: 187 LETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIP 246
T Y T+EP ++ +D +++ PP + + + L SP +W I
Sbjct: 1011 QPTGAYIAGCMTSEP----FELPKDDDYHKEWAKETLSFPPTMPRGILKLISPITWTVI- 1065
Query: 247 QTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVV 306
+ L E + C+K + +E R +A+GT + ED+ RGR+ ++DI+ V+
Sbjct: 1066 -HDIELESCESIECMKTLHLEVSEDTKERRFLVAVGTAVSKGEDLPIRGRVHVYDIVTVI 1124
Query: 307 PEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGI 361
PEPG+P T ++K I A+E +G VTAI + G ++ A GQK + LK D L +
Sbjct: 1125 PEPGKPETNRRLKAI-AREDIPRGGVTAISEIGTQGLMLVAQGQKCMVRGLKEDGSLLPV 1183
Query: 362 AFIDTEVYIASM--VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
AF+D +++S +S L L+ D + + Y E T ++ + +
Sbjct: 1184 AFLDMSCHVSSARELSRTGLCLMADAFKGVWFAGYTEEPYTFKVLGKSHG---------- 1233
Query: 420 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
RL + D L + + + +D D ++ +
Sbjct: 1234 ------------------------RLPVVVA------DFLPDGDDLAIVAADVDGDLHIL 1263
Query: 480 MYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIRCKPSS---ISDAPGARSRFLTWYA 534
+ PE +S GH L+ +T F + + T R P S D P + A
Sbjct: 1264 EFNPEHPKSLQGHLLLHRTSFSVSPNPPSTTLLLPRTTPPSHPTPQDPP-----HVLLLA 1318
Query: 535 SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK---GYYAGNPSR 591
S G L +PLPE YRRLL + N ++ + GGLN +A R G G A R
Sbjct: 1319 SSSGHLSSLIPLPETAYRRLLSVTNQLLPALTPHGGLNAKAHRLPVGTRTVGVEAAG-GR 1377
Query: 592 GIIDGSLVWKFLQLSLGERLEICKKIG 618
I+DG+++ ++ +LS +R EI K G
Sbjct: 1378 AIVDGAVLARWAELSAAKRAEIAGKGG 1404
>gi|330919204|ref|XP_003298516.1| hypothetical protein PTT_09264 [Pyrenophora teres f. teres 0-1]
gi|311328242|gb|EFQ93393.1| hypothetical protein PTT_09264 [Pyrenophora teres f. teres 0-1]
Length = 1388
Score = 197 bits (500), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 174/644 (27%), Positives = 272/644 (42%), Gaps = 101/644 (15%)
Query: 10 SAMDETIVQELLTVSLG-LHGNRPLLLVRTQHE-LLIYQAFRHP-KGALKLRFKKLKVLF 66
SA TI E+L LG P L++RT + L+IY+AF P + A K L+ +
Sbjct: 788 SAAKATIT-EILAADLGDATAKSPHLIIRTSSDNLVIYKAFHAPSRSASDQWTKNLRWVK 846
Query: 67 VSDRS-KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
+S + R E G S + +I GY VF G PA++ + R ++
Sbjct: 847 LSQQHVPRYIEDSGSEDSGFDSTLVALDDICGYSTVFQRGTTPAFILKEASSAPRVIGLS 906
Query: 126 IDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYD-APWPVRKVPLKCTPHFLA 184
PV +L FH +C RGF Y ++ LRI LP Y W R++P+ H L
Sbjct: 907 -GKPVKSLTSFHTSSCQRGFAYLDSTDTLRICQLPPQTHYGHLGWATRRMPMDSEVHTLT 965
Query: 185 YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEE 244
YH Y + T AE Y+ + + P++ P + + + L SW
Sbjct: 966 YH-PPGLYIVGTGQAED----YQLDPTETYHYDLPKEDLTFKPSIERGVIKLLDEKSWTI 1020
Query: 245 IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIE 304
I L E VL +K +++E + IA+GT+ + ED+ +G I +F++I
Sbjct: 1021 I--DTHVLDPQEVVLSIKTLNLEVSEITHQRKDLIAVGTSVVHGEDLATKGCIRIFEVIT 1078
Query: 305 VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGI 361
VVP+P +P T ++K+I E KG V+AI + GFL+ A GQK + LK D L +
Sbjct: 1079 VVPQPDRPETNRRLKLIVKDEVKGAVSAISELGTQGFLIMAQGQKCMVRGLKEDGTLLPV 1138
Query: 362 AFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
AF+D + Y++ + ++ ++ +GD R + Y E +SL AR
Sbjct: 1139 AFMDMQCYVSDLKNLPGTGMLAMGDAYRGVWFTGYTEEPYKMSLFAR------------- 1185
Query: 420 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN------DILDEFSSMGFMISDKD 473
SKHN D L + +++D D
Sbjct: 1186 ---------------------------------SKHNLETIAVDFLPFDQQLHLVVADAD 1212
Query: 474 KNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF------------------------ 509
N+ + + P+ + G RL+ K FH G +
Sbjct: 1213 MNLQILQFDPDNPKGEAGSRLLHKATFHTGHFPTSLHLIHSHLKLPSATDFAATNNNPAD 1272
Query: 510 -FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 568
F + P++ +D P + + + G L PL E +YRRL L +
Sbjct: 1273 AFAMDTSPNTTTDTP-QQPFHQILHTTQSGTLALLTPLSEDSYRRLSNLSAYLANTLDSA 1331
Query: 569 GGLNPRAFRT--YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGER 610
LNPRAFRT G+ AG +RG++DG+L+ ++ + LGER
Sbjct: 1332 CSLNPRAFRTGDVAEGGWDAGTGARGVLDGNLLLRWGE--LGER 1373
>gi|121925707|sp|Q0UUE2.1|CFT1_PHANO RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
Length = 1375
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 175/628 (27%), Positives = 274/628 (43%), Gaps = 79/628 (12%)
Query: 17 VQELLTVSLGLHGNR-PLLLVRTQHE-LLIYQAFRHPKGAL------KLRFKKLKVLFV- 67
+ E+L LG +R P L+VRT ++ L+IY+A P + LR+ KL V
Sbjct: 800 ITEILAADLGDATSRSPHLIVRTSNDDLVIYKAIHSPSRSSSDLWTHNLRWVKLSQQHVP 859
Query: 68 ----SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHP 123
+ A ++PG S + NI GY V G PA++ S R
Sbjct: 860 RYMEDGAQEEAADEPGFE-----STLLALDNINGYSTVIQRGRSPAFILKESSSAPRVIG 914
Query: 124 MTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYD-APWPVRKVPLKCTPHF 182
++ PV +L FH +C RGF Y ++ LRIS LP Y W R++P+ H
Sbjct: 915 LS-GNPVKSLTRFHTSSCQRGFAYLDSTDTLRISQLPPSTHYGHLGWAARRMPMDAEVHA 973
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
LAYH + V T +P + Y + D P++ P V + + +W
Sbjct: 974 LAYH---PSGLYVIGTGQP--EEYTLDPNDTFHYELPKEETSFKPKVEHGIIKVMDEKTW 1028
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
I L E +LC+K +++E T + IA+GT ED+ +G I +F++
Sbjct: 1029 TVI--DTHVLDPQEVILCIKTLNLEVSETTHQRKDVIAVGTAIVLGEDLATKGNIRIFEV 1086
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLT 359
I VVPEP P T ++K+I E KG V+AI + GFL+ A GQK + LK D L
Sbjct: 1087 ITVVPEPDHPETNKRLKLIVKDEVKGTVSAISDLGTQGFLIMAQGQKSMVRGLKEDGTLL 1146
Query: 360 GIAFIDTEVYIASMVSVKN--LILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
+AF+D + Y+ ++ ++ N ++L+GD + Y E + L R SK
Sbjct: 1147 PVAFMDMQCYVTTLKTLPNTGMLLMGDAYKGAWFTGYTEEPYKMMLFGR--------SKH 1198
Query: 418 YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVV 477
+ + FL E+L I +++D D N+
Sbjct: 1199 HLE----------CITADFLPFE--EQLHI--------------------IVADADMNLQ 1226
Query: 478 LFMYQPEARESNGGHRLIKKTDFHLGQHVNT--FFKIRCKPSSISDAPGARSRFLTWY-- 533
+ + P+ +S GG RL++K+ FH G +T + R + S+ + + L +
Sbjct: 1227 VLQFDPDHPKSMGGTRLLQKSTFHTGHFPSTMHLLQSRLHMPTASEFTTSTTSSLPLHQI 1286
Query: 534 --ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK-GKGYYAGNPS 590
S G L PL E +YRRL L + GLN +AFR +G +
Sbjct: 1287 LCTSQSGTLALITPLSESSYRRLSGLATHLQQFLDSPCGLNGKAFRAADVMEGGWDAGTQ 1346
Query: 591 RGIIDGSLVWKFLQLSLGERLEICKKIG 618
R ++DG L+ ++ +L R E K+G
Sbjct: 1347 RAMLDGGLLMRWGELGEQRRREGLGKVG 1374
>gi|340515387|gb|EGR45642.1| predicted protein [Trichoderma reesei QM6a]
Length = 1441
Score = 196 bits (498), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 181/662 (27%), Positives = 299/662 (45%), Gaps = 99/662 (14%)
Query: 11 AMDETIVQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALK------LRFKK- 61
A ET+ E++ LG +H + L+L + ++L IY+ R P L FKK
Sbjct: 832 ATRETLT-EIVVADLGDAVHASPYLILRHSTNDLTIYEPIRLPANETAHTLSDTLFFKKS 890
Query: 62 ----LKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
L V D S + P R +R +N+ GY VFL GP PA++ +SR
Sbjct: 891 PNAVLAKSAVEDPSDDTAQPP------RYVPLRICANVGGYSSVFLPGPSPAFVIKSSRS 944
Query: 118 ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPL 176
R + G V ++ FH C RGF+Y +++ R++ LP+ ++ + V+KVPL
Sbjct: 945 VPRVVGLQGHG-VRGMSTFHTEGCDRGFIYADSEGIARVTQLPSKTNFTELGISVKKVPL 1003
Query: 177 KCTPHFLAYHLETKTY---CIVTSTAE-PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQF 232
+AYH T+TY C VT E P D Y KE R+S +PP +
Sbjct: 1004 GFDVRHVAYHHPTETYIAGCAVTENFELPKDDDYH-----KEWA---RESVPLPPTAVRG 1055
Query: 233 HVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT 292
+ L +P +W I + + E + C+K + +E R +A+GT + ED+
Sbjct: 1056 ALKLINPITWTVIHSID--MEAGESIECMKTLHLEVSEETKERRMLLAVGTALSRGEDLP 1113
Query: 293 CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKI 348
RGR+ ++DI+ V+PEPG+P T ++K++ AKE +G VTA+ + G ++ A GQK
Sbjct: 1114 TRGRVQVYDIVTVIPEPGKPETNKRLKLL-AKEDIPRGGVTALSEIGTQGLMLVAQGQKC 1172
Query: 349 YIWQLK-DNDLTGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVA 405
+ LK D L +AF+D +++S+ + L L+ D + + Y E T ++
Sbjct: 1173 MVRGLKEDGSLLPVAFLDMSCHVSSVRELPGTGLCLIADAFKGLWFAGYTEEPYTFKVLG 1232
Query: 406 RDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
+ GSL D L + +
Sbjct: 1233 KS---------------------SGSLPLLV-------------------ADFLPDGEDL 1252
Query: 466 GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIRCKPSSISDAP 523
+ D D ++ + + PE +S GH L+ +T F + + +T R P+S S
Sbjct: 1253 SMVAVDADGDMHVLEFNPEHPKSLQGHLLLHRTTFSVTPNPPTSTLLLPRTLPASQSSQD 1312
Query: 524 GARSRF----LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
+ S + AS G++ PLPE YRRLL + N ++ GGL+ RA RT
Sbjct: 1313 SSSSSSTQPHILLLASPSGSIAALTPLPESAYRRLLSVTNQLLPALVPHGGLHARAHRTP 1372
Query: 580 KGKGYYA-------GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 632
+G G + R I+DG+++ ++ +L +R E+ + G ++ + + D+E
Sbjct: 1373 EGGGGMSRTVGVETAATGRAIVDGTVLTRWNELGAAKRAEVATRGG--YDGVTEMREDLE 1430
Query: 633 AL 634
A+
Sbjct: 1431 AV 1432
>gi|358387835|gb|EHK25429.1| hypothetical protein TRIVIDRAFT_32877 [Trichoderma virens Gv29-8]
Length = 1440
Score = 196 bits (498), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 180/662 (27%), Positives = 299/662 (45%), Gaps = 99/662 (14%)
Query: 11 AMDETIVQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALK------LRFKK- 61
A ET+ E++ LG +H + L+L + +L IY+ R P + L FKK
Sbjct: 831 ATRETLT-EIVVADLGDSVHSSPYLILRHSTDDLTIYEPIRLPTASATHALSDTLFFKKS 889
Query: 62 ----LKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
L V D S + P R +R +N+ GY VFL GP PA++ +S+
Sbjct: 890 ANSSLAKSAVEDPSDDTAQPP------RYVPLRTCANVGGYSAVFLPGPSPAFIIKSSKS 943
Query: 118 ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPL 176
R + G V ++ FH C RGF+Y +++ R++ LP+ + + V+KVPL
Sbjct: 944 IPRVVGLQGLG-VRGMSTFHTEGCDRGFIYADSEGIARVTQLPSKTNLTELGVSVKKVPL 1002
Query: 177 KCTPHFLAYHLETKTY---CIVTSTAE-PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQF 232
+AYH T+TY C +T E P D Y KE R+S P +++
Sbjct: 1003 GHDIRHVAYHHPTETYIAGCTITENFELPKDDDYH-----KEWA---RESLSFLPSMARG 1054
Query: 233 HVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT 292
+ L +P +W I + + E + C+K + +E R +A+GT ED+
Sbjct: 1055 ALKLINPITWTVIHSID--MEPGESIECMKTLHLEVSEETKERRMLLAVGTALTRGEDLP 1112
Query: 293 CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKI 348
RGR+ ++DI+ V+PEPG+P T ++K++ AKE+ +G VTA+ + G ++ A GQK
Sbjct: 1113 TRGRVQVYDIVTVIPEPGKPETNKRLKLL-AKEEIPRGGVTALSEIGTQGLMLVAQGQKC 1171
Query: 349 YIWQLK-DNDLTGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVA 405
+ LK D L +AF+D ++++ + L L+ D + + Y E T ++
Sbjct: 1172 MVRGLKEDGSLLPVAFLDMSCHVSTARELPGTGLCLIADAFKGLWFAGYTEEPYTFKVLG 1231
Query: 406 RDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
+ GSL D L + +
Sbjct: 1232 KS---------------------SGSLPLLV-------------------ADFLPDGEDL 1251
Query: 466 GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIRCKPSSIS--D 521
+ D D ++ + + PE +S GH L+ +T F + + +T R P+S S
Sbjct: 1252 SMVAVDADGDIHVLEFNPEHPKSLQGHLLLHRTTFSVTPNPPTSTLLLPRTLPASQSATT 1311
Query: 522 APGARSR--FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
+P + S L AS G L PLPE YRRLL + N ++ GGL+ RA RT
Sbjct: 1312 SPDSSSSQPHLLLLASPSGCLASLTPLPESAYRRLLSVTNQLLPALVPHGGLHARAHRTP 1371
Query: 580 KGKGYYA-------GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 632
+G G + R I+DG+++ ++ +L +R E+ + G ++ +++ D+E
Sbjct: 1372 EGGGGMSRTVGVETAASGRAIVDGAILARWNELGAAKRAEVATRGG--YDGVMEMREDLE 1429
Query: 633 AL 634
A+
Sbjct: 1430 AV 1431
>gi|348679545|gb|EGZ19361.1| putative cleavage and polyadenylation specificity factor CPSF
[Phytophthora sojae]
Length = 1752
Score = 196 bits (497), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 166/649 (25%), Positives = 279/649 (42%), Gaps = 137/649 (21%)
Query: 80 LPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT-----IDGPVSTLA 134
L G R + F N+ G F G HP W+ L RG+ PM + PV +
Sbjct: 1150 LRAGFRYPMLTTFYNVNNMSGAFFRGAHPMWI-LGDRGQPTFIPMCSAAPKVSVPVLSFT 1208
Query: 135 PFHNVNCPRGFLYFNAKSELRISVLP-----THLSYDAPWPVRKVPLKCTPHFLAY---- 185
PFH+ NCP GF+YF+++ LR+ LP T L + ++K T H + Y
Sbjct: 1209 PFHHWNCPNGFIYFHSRGALRVCELPSSKTSTILPSSGGFVLQKAEFGATLHHMLYLGNH 1268
Query: 186 -------HLETKTYCIVTSTAEPSTDYYKFNG-EDKELVTDPRD---------SRFIPPL 228
LE TY +V S TD + ED + +P + S + P
Sbjct: 1269 GPGGVSEALEAPTYAVVCSVKMKPTDAERATEVEDADEEKEPENLDANGNPVGSNVMAPT 1328
Query: 229 VSQFHVSLFSPFSWEE-------IPQTN----------FPLH--EWEHVLCLK-----NV 264
F + E + QTN F +H +E VL +K +
Sbjct: 1329 AEMFPDFEIDQMAHTEEEVYELRLVQTNEFGEWGRRGVFRVHFERYEVVLSVKLMYLYDS 1388
Query: 265 SMEYEGTLSGL-------RGYIALGTNY--NYSEDVTCRGRILLF--DIIEVVPEPGQPL 313
S+ E S R Y+ +GT + + ED + RGR+LL+ D + V E G
Sbjct: 1389 SLMKEEVASTSAEWNKKKRPYLVIGTGWVGPHGEDESGRGRLLLYELDYAQYVDEEGGST 1448
Query: 314 TKN--KIKMIYAKE-QKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYI 370
+ K+++++ KE ++G ++++ + +++ AVG K+ +++ K L G AF D +++I
Sbjct: 1449 SSKLPKLRLVFIKEHRQGAISSVVQLGPYVLAAVGSKLIVYEFKSEQLIGCAFYDAQMFI 1508
Query: 371 ASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDG 430
++ VK+ ++ GD +S+ LR++ R L L+A+DY+P
Sbjct: 1509 VTLNVVKDFVMYGDVYKSVHFLRWREMQRQLVLLAKDYEP-------------------- 1548
Query: 431 SLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNG 490
L + S+ E+ + + D D+N+ + + P+ ES G
Sbjct: 1549 -LAVSATEFSVFEK-------------------KLALLAVDMDENLHVMQFAPQDIESRG 1588
Query: 491 GHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR------------SRFLTWYASLDG 538
G RL++ +DFHLG V + F+ R D PG S ++ + +G
Sbjct: 1589 GQRLLRVSDFHLGVQVASMFRKRV------DGPGGHVAVNGRGPRAPPSYYVNVMGNSEG 1642
Query: 539 ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY-YAGNPS------- 590
+G +P+ E+ +RRL LQNVMV LNPR FR K G P
Sbjct: 1643 GVGALIPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRRCGRPDAWSKKKW 1702
Query: 591 -RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
+ +D ++++FLQL + E+ + IG+ ++ L +++ ++ F
Sbjct: 1703 KKSFLDAFVLFRFLQLDYVAQKELARCIGTTPEVVIHNLLEVQHATATF 1751
>gi|310789917|gb|EFQ25450.1| CPSF A subunit region [Glomerella graminicola M1.001]
Length = 1439
Score = 195 bits (496), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 176/651 (27%), Positives = 281/651 (43%), Gaps = 90/651 (13%)
Query: 14 ETIVQELLTVSLG-LHGNRPLLLVR-TQHELLIYQAFRHPKG------ALKLRFKK---- 61
+ + ELL LG P L++R +L IY+ R + L F+K
Sbjct: 840 QETLTELLVADLGDTTTTSPYLILRHANDDLTIYEPIRLESQDKTVGLSKTLHFQKITNP 899
Query: 62 -LKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
L V ANEQP R +R NI GY VFL G P+++ +S+ +
Sbjct: 900 ALAKSPVEVADDEANEQP------RFVPLRPCPNINGYSTVFLPGASPSFIIKSSKSSPK 953
Query: 121 AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCT 179
+ G V ++ FH C RGF+Y +++ + R++ LP ++ + VRK+P+
Sbjct: 954 VIGLQGIG-VRGMSSFHTEGCERGFIYADSEGQTRVTQLPADTNFTELGVAVRKIPIGDN 1012
Query: 180 PHFLAYHLETKTYCIVTSTAE----PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVS 235
+AYH +TY + S E P D Y + + P+ R I +
Sbjct: 1013 VGLIAYHPPMETYAVACSVLERFELPKDDDYHKEWAKEATTSYPQTERGI--------IK 1064
Query: 236 LFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRG 295
L SP +W I HE +C+K + +E R I +GT N ED+ RG
Sbjct: 1065 LMSPTTWSVIDTVELEPHEV--AMCMKTLHLEVSEETKERRMLITIGTAINRGEDLPIRG 1122
Query: 296 RILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIW 351
RIL++D++ VVP+PG+P T K+K++ AKE+ +G VT +C V G ++ A GQK +
Sbjct: 1123 RILVYDVVPVVPQPGRPETNKKLKLV-AKEEIPRGAVTGLCEVGSQGLMLVAQGQKCMVR 1181
Query: 352 QLK-DNDLTGIAFIDTEVYIASMVSVKNL--ILVGDYARSIALLRYQPEYRTLSLVARDY 408
LK D L +AF+D Y+ ++ V+ L+ D + + + Y E
Sbjct: 1182 GLKEDGTLLPVAFMDMNCYVTAVREVRGTGYCLMTDAFKGVWFVGYAEE----------- 1230
Query: 409 KPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFM 468
P + ++ G KF L+ D + + +
Sbjct: 1231 --------------PYKMMLFGKSTGKFEVLTA---------------DFIIAGDELHIV 1261
Query: 469 ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG-QHVNTFFKIRCKPSSISDAPGARS 527
+ DKD + + + PE +S GH L+ + F H T + P+S + ++
Sbjct: 1262 VCDKDGVIHVMQFDPEHPKSLQGHLLLNRASFSAAPNHPTTTLSLPRTPASTATTSATKN 1321
Query: 528 RFLT-WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 586
T AS GAL PL E+ YRRL L N + H NP+A R
Sbjct: 1322 PPTTLLLASPTGALASLTPLSEQAYRRLTSLANSIAGALPHAAATNPKAHRLQPLDARTP 1381
Query: 587 G---NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
G + R I+DG+L+ ++ +L G R E+ K G + D+L+ ++E +
Sbjct: 1382 GVDTSAGRSIVDGALLARWNELGAGRRSEVAGKGG--YGDVLEVRGELEGV 1430
>gi|46120520|ref|XP_385083.1| hypothetical protein FG04907.1 [Gibberella zeae PH-1]
Length = 1436
Score = 195 bits (496), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 173/630 (27%), Positives = 282/630 (44%), Gaps = 88/630 (13%)
Query: 17 VQELLTVSLG-LHGNRPLLLVRTQ-HELLIYQAFRH--PKG----ALKLRFKKLKVLFVS 68
++E+L LG P L++R Q +L IY+ H P G + L FKK+ + ++
Sbjct: 835 LREILVADLGDTISQSPYLILRNQTDDLTIYEPIHHVRPGGESNLSAALSFKKMSNVTLA 894
Query: 69 DRSKRAN----EQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM 124
+ EQP R MR +NI GY VFL G P+++ +S+ R +
Sbjct: 895 TTPAQTEDDDVEQP------RFMPMRRCANINGYSTVFLPGSSPSFVLKSSKSIPRVIGL 948
Query: 125 TIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFL 183
G + ++ FH C RGF+Y + K R++ P+ ++ + V+KVPL +
Sbjct: 949 QGLG-IRGMSSFHTEGCDRGFIYADDKGIARVTQFPSDTNFTELGISVKKVPLGSDVRGI 1007
Query: 184 AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
AYH T Y T+EP ++ +D +++ PP + + + L SP +W
Sbjct: 1008 AYHQPTGAYIAGCMTSEP----FELPKDDDYHKEWAKETLSFPPTMPRGVLKLISPITWT 1063
Query: 244 EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
I + L E + C+K + +E R +A+GT + ED+ RGR+ ++DI+
Sbjct: 1064 VI--HDIELESCESIECMKTLHLEVSEDTKERRFLVAVGTAVSKGEDLPIRGRVHVYDIV 1121
Query: 304 EVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDL 358
V+PEPG+P T ++K I A+E +G VTAI + G ++ A GQK + LK D L
Sbjct: 1122 TVIPEPGKPETNRRLKAI-AREDIPRGGVTAISEIGTQGLMLVAQGQKCMVRGLKEDGSL 1180
Query: 359 TGIAFIDTEVYIASM--VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSK 416
+AF+D +++S +S L L+ D + + Y E T ++ + +
Sbjct: 1181 LPVAFLDMSCHVSSARELSRTGLCLMADAFKGVWFAGYTEEPYTFKVLGKSHG------- 1233
Query: 417 GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
RL + D L + + + +D D ++
Sbjct: 1234 ---------------------------RLPVVVA------DFLPDGDDLAIVAADVDGDL 1260
Query: 477 VLFMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIRCKPSS---ISDAPGARSRFLT 531
+ + PE +S GH L+ +T F + + T R P S D P +
Sbjct: 1261 HILEFNPEHPKSLQGHLLLHRTSFSVSPNPPSTTLLLPRTTPPSHPTPQDPP-----HVL 1315
Query: 532 WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK---GYYAGN 588
AS G L +PLPE YRRLL + N ++ + GGLN +A R G G A
Sbjct: 1316 LLASSSGHLSSLIPLPETAYRRLLSVTNQLLPALTPHGGLNAKAHRLPVGTRTVGVEAAG 1375
Query: 589 PSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
R I+DG+++ ++ +LS +R EI K G
Sbjct: 1376 -GRAIVDGAVLARWAELSAAKRAEIAGKGG 1404
>gi|448105510|ref|XP_004200513.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
gi|448108635|ref|XP_004201144.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
gi|359381935|emb|CCE80772.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
gi|359382700|emb|CCE80007.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
Length = 1344
Score = 195 bits (496), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 151/599 (25%), Positives = 271/599 (45%), Gaps = 63/599 (10%)
Query: 41 ELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQ-MRYFSNIAGYQ 99
E++IY+ F ++ K LK+ D + P G + + + Y N+ GY
Sbjct: 800 EVIIYKLFFDGDNFKFIKEKDLKITGAPDNA--------YPLGTTLERRLVYVPNVNGYS 851
Query: 100 GVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVL 159
+F+ G P ++ T R T P + + + + N GF+Y + R+ +
Sbjct: 852 SIFVTGIIPYFITKTVHSVPRIFRFT-KLPAVSFSSYSDSNIKNGFIYLDNSKNARMCEI 910
Query: 160 PTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDP 219
P +Y+ WP++K+ + T +AYH + T+ + + P Y + E K +V
Sbjct: 911 PLDFNYENNWPIKKIQMPETVKAIAYHELSNTFVVSSYEEIP---YDCLDEEGKPIVGID 967
Query: 220 RDSRFIPPLVS-QFHVSLFSPFSWEEIPQTNFPLHEW-EHVLCLKNVSMEYEGTLSGLRG 277
+ PP S + ++ L SP++W I +E +VL + +
Sbjct: 968 KSK---PPAESYKGYLRLISPYNWSVIDTIVLADNEIGMNVLSMVLDVGSSTKKFKSKKE 1024
Query: 278 YIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
I LG+ ED++ G +F+II+++PEPG+P T +K K ++ ++ +G VT+IC V+
Sbjct: 1025 LIVLGSGKYRIEDLSSNGSFKIFEIIDIIPEPGKPETNHKFKEVHIEDTRGAVTSICEVS 1084
Query: 338 GFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPE 397
G L+ GQKI I L+D+ + +AF+DT VY++ S NLIL+GD +S+ L + E
Sbjct: 1085 GRLLVTQGQKIIIRDLQDDGVVPVAFLDTAVYVSEAKSFGNLILLGDSLKSVWLAGFDAE 1144
Query: 398 YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 457
+ L+++D + L +S C K +
Sbjct: 1145 PFRMILLSKDIQT--------------------------LDVS-------CADFIVKDEE 1171
Query: 458 ILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS 517
I + +D + + + + PE S+ G RL+ KT F++ F R P
Sbjct: 1172 IF-------ILFADNNNVLHVVKFDPEDPLSSNGQRLVHKTSFNINSAATCF---RTIPK 1221
Query: 518 SISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 577
+ + P + F + +++DG+ P+ E YRR+ +LQ + H GLNPR R
Sbjct: 1222 NEENYPSLTTSFQSIGSTIDGSFFTVFPINESTYRRMYILQQQLTDKEFHICGLNPRLNR 1281
Query: 578 TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN--DILDELYDIEAL 634
+ S+ +++ ++ KF+ L+ + KIGSK++ DI +L + E++
Sbjct: 1282 FGGLNETNSDANSKPMLEYDVIKKFVNLNSDRKKNFASKIGSKNSYQDIWRDLIEFESV 1340
>gi|148886831|sp|Q7SEY2.2|CFT1_NEUCR RecName: Full=Protein cft-1; AltName: Full=Cleavage factor two
protein 1
Length = 1456
Score = 195 bits (496), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 175/630 (27%), Positives = 279/630 (44%), Gaps = 77/630 (12%)
Query: 17 VQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFV--SDRSK 72
V E+L LG H + L+L +L +YQ +R A + K L V S +K
Sbjct: 844 VAEILVADLGDTTHKSPYLILRHANDDLTLYQPYRLKATAGQPFSKSLFFQKVPNSTFAK 903
Query: 73 RANEQPGLP----RGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG 128
E+P R MR SNI+GY VFL G P+++ T++ R + G
Sbjct: 904 APEEKPADDDEPHNAQRFLPMRRCSNISGYSTVFLPGSSPSFILKTAKSSPRVLSLQGSG 963
Query: 129 PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHL 187
V ++ FH C GF+Y + R++ +PT SY + V+K+P+ +AYH
Sbjct: 964 -VQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSYAELGLSVKKIPIGVDTQSVAYHP 1022
Query: 188 ETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQ 247
T+ Y + + EP ++ +D R++ P+V + + L S +W I
Sbjct: 1023 PTQAYVVGCNDVEP----FELPKDDDYHKEWARENITFKPMVDRGVLKLLSGITWTVIDT 1078
Query: 248 TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
+ E VLC++ +++E + + + IA+GT ED+ RGR+ +FDI +V+P
Sbjct: 1079 VE--MEPCETVLCVETLNLEVSESTNERKQLIAVGTALIKGEDLPTRGRVYVFDIADVIP 1136
Query: 308 EPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIA 362
EPG+P T K+K++ AKE +G VTA+ V G ++ A GQK + LK D L +A
Sbjct: 1137 EPGKPETSKKLKLV-AKEDIPRGAVTALSEVGTQGLMLVAQGQKCMVRGLKEDGTLLPVA 1195
Query: 363 FIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
F+D Y+ S+ + L L+ D + + Y E + L +
Sbjct: 1196 FMDMNCYVTSVKELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKS------------- 1242
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
R+E+ + D L + + + SD D ++ +
Sbjct: 1243 ---------------------STRMEVL------NADFLPDGKELYIVASDADGHIHILQ 1275
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNT----FFKIRCKPSSISDAPGARSRFLTWYASL 536
+ PE +S GH L+ +T F+ G H T + PSS+S S + AS
Sbjct: 1276 FDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLLLPAVYPNPSSLSSNSEENSPHILLLASP 1335
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR--------TYKGKGYYAGN 588
G L PL E YRRL L + H GLNP+ +R + + G AG
Sbjct: 1336 TGVLATLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASMQLPGVDAGI 1395
Query: 589 PSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
R I+DG ++ +FL+L G+R E+ + G
Sbjct: 1396 -GRNIVDGKILERFLELGTGKRQEMAGRAG 1424
>gi|164429683|ref|XP_964609.2| hypothetical protein NCU02082 [Neurospora crassa OR74A]
gi|157073577|gb|EAA35373.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 1437
Score = 195 bits (495), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 175/630 (27%), Positives = 279/630 (44%), Gaps = 77/630 (12%)
Query: 17 VQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFV--SDRSK 72
V E+L LG H + L+L +L +YQ +R A + K L V S +K
Sbjct: 794 VAEILVADLGDTTHKSPYLILRHANDDLTLYQPYRLKATAGQPFSKSLFFQKVPNSTFAK 853
Query: 73 RANEQPGLP----RGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG 128
E+P R MR SNI+GY VFL G P+++ T++ R + G
Sbjct: 854 APEEKPADDDEPHNAQRFLPMRRCSNISGYSTVFLPGSSPSFILKTAKSSPRVLSLQGSG 913
Query: 129 PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHL 187
V ++ FH C GF+Y + R++ +PT SY + V+K+P+ +AYH
Sbjct: 914 -VQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSYAELGLSVKKIPIGVDTQSVAYHP 972
Query: 188 ETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQ 247
T+ Y + + EP ++ +D R++ P+V + + L S +W I
Sbjct: 973 PTQAYVVGCNDVEP----FELPKDDDYHKEWARENITFKPMVDRGVLKLLSGITWTVI-- 1026
Query: 248 TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
+ E VLC++ +++E + + + IA+GT ED+ RGR+ +FDI +V+P
Sbjct: 1027 DTVEMEPCETVLCVETLNLEVSESTNERKQLIAVGTALIKGEDLPTRGRVYVFDIADVIP 1086
Query: 308 EPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIA 362
EPG+P T K+K++ AKE +G VTA+ V G ++ A GQK + LK D L +A
Sbjct: 1087 EPGKPETSKKLKLV-AKEDIPRGAVTALSEVGTQGLMLVAQGQKCMVRGLKEDGTLLPVA 1145
Query: 363 FIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
F+D Y+ S+ + L L+ D + + Y E + L +
Sbjct: 1146 FMDMNCYVTSVKELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKS------------- 1192
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
R+E+ + D L + + + SD D ++ +
Sbjct: 1193 ---------------------STRMEVL------NADFLPDGKELYIVASDADGHIHILQ 1225
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNT----FFKIRCKPSSISDAPGARSRFLTWYASL 536
+ PE +S GH L+ +T F+ G H T + PSS+S S + AS
Sbjct: 1226 FDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLLLPAVYPNPSSLSSNSEENSPHILLLASP 1285
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR--------TYKGKGYYAGN 588
G L PL E YRRL L + H GLNP+ +R + + G AG
Sbjct: 1286 TGVLATLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASMQLPGVDAGI 1345
Query: 589 PSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
R I+DG ++ +FL+L G+R E+ + G
Sbjct: 1346 -GRNIVDGKILERFLELGTGKRQEMAGRAG 1374
>gi|406865186|gb|EKD18229.1| CPSF A subunit region [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 1443
Score = 195 bits (495), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 172/653 (26%), Positives = 290/653 (44%), Gaps = 89/653 (13%)
Query: 14 ETIVQELLTVSLGLHGNR-PLLLVR-TQHELLIYQAFR---HPKGALKLRFKKLKVL--- 65
ETI EL+ LG R P L++R + +L IY+ F G L + LK+
Sbjct: 841 ETIT-ELVVADLGDETARSPYLILRPSTDDLTIYEPFHTSSESSGGLASTLQFLKIHNPH 899
Query: 66 FVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
+ A E + R MR SN+ GY VFL G P+++ +++ + +
Sbjct: 900 LARNPDVSAAETADGIQETRDEPMRVISNLGGYCTVFLPGGSPSFIMKSAKSTPKVISLQ 959
Query: 126 IDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLA 184
G V ++ FH C RGF+Y + R+S LP ++ + ++K+ L H +A
Sbjct: 960 GLG-VRGMSSFHTEGCDRGFIYTDVDGLARVSQLPKDTTFAELGVSLQKIELGQEIHGVA 1018
Query: 185 YHLETKTYCIVTST-AE---PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPF 240
YH T+ Y TST AE P D KE +T P + Q + L +P
Sbjct: 1019 YHPPTECYVAATSTEAEFELPKEDDNHHPQWAKEQIT-------FKPTMEQGRLRLINPV 1071
Query: 241 SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
+W + + L +E ++C+K + +E + + IA+GT + ED+ +GRI ++
Sbjct: 1072 NWTVVDEVE--LDPFEVIMCIKTLILETSEITNERKQLIAVGTGISKGEDLAIKGRIHVY 1129
Query: 301 DIIEVVPEPGQPLTKNKIKMIYAKE-QKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DN 356
D+I VVPEP +P T ++K+I ++ +G +T I + GF++ + GQK + LK D
Sbjct: 1130 DVINVVPEPDRPETNKRLKLIATEDIARGAITCISEIGTQGFMIVSQGQKCMVRGLKEDG 1189
Query: 357 DLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
L +AF+D YI S+ +K + + D + + + Y E + L + K
Sbjct: 1190 TLLPVAFMDMNCYITSIKELKGTGICVFSDAVKGVWVAGYTEEPYKMMLFGKSAK----- 1244
Query: 415 SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
+EI + D+L + + + +D D
Sbjct: 1245 -----------------------------NMEIMQA------DLLPDGKELYIVAADSDC 1269
Query: 475 NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLT--- 531
N+ + + PE +S GH L+ ++ F LG H+ T + + S + P + T
Sbjct: 1270 NLHIMQFDPEHPKSLQGHLLLHRSTFALGGHLPTSMTLLPRTKSATLLPPSPDAMDTAAD 1329
Query: 532 --------WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG-- 581
S G + PL E YRRL L + ++ H GLNPRA+R K
Sbjct: 1330 ATIPEHEILITSSTGCISLLTPLSEAQYRRLSTLTSHLINTLYHACGLNPRAYRVDKDAP 1389
Query: 582 KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
+G SR +IDG+++ ++++L R E+ ++G D+L+ D+ +L
Sbjct: 1390 EGMVG---SRTVIDGNILMRWMELGSQRRAEVAGRVGV---DVLEVREDLASL 1436
>gi|336463425|gb|EGO51665.1| hypothetical protein NEUTE1DRAFT_89273 [Neurospora tetrasperma FGSC
2508]
Length = 1437
Score = 194 bits (494), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 176/630 (27%), Positives = 280/630 (44%), Gaps = 77/630 (12%)
Query: 17 VQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFV--SDRSK 72
V E+L LG H + L+L +L +YQ +R A + K L V S +K
Sbjct: 794 VAEILVADLGDTTHKSPYLILRHANDDLTLYQPYRLKATAGQPFSKSLFFQKVPNSTFAK 853
Query: 73 RANEQP---GLPRGV-RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG 128
E+P P R MR SNI+GY VFL G P+++ T++ R + G
Sbjct: 854 APEEKPVDDDEPHNAQRFLPMRRCSNISGYSTVFLPGSSPSFILKTAKSSPRVLSLQGSG 913
Query: 129 PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHL 187
V ++ FH C GF+Y + R++ +PT SY + V+K+P+ +AYH
Sbjct: 914 -VQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSYAELGLSVKKIPVGVDTQSVAYHP 972
Query: 188 ETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQ 247
T+ Y + + EP ++ +D R++ P+V + + L S +W I
Sbjct: 973 PTQAYVVGCNDVEP----FELPKDDDYHKEWARENITFKPMVDRGVLKLLSGITWTVI-- 1026
Query: 248 TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
+ E VLC++ +++E + + + IA+GT ED+ RGR+ +FDI +V+P
Sbjct: 1027 DTVEMEPCETVLCVETLNLEVSESTNERKQLIAVGTALIKGEDLPTRGRVYVFDIADVIP 1086
Query: 308 EPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIA 362
EPG+P T K+K++ AKE +G VTA+ V G ++ A GQK + LK D L +A
Sbjct: 1087 EPGKPETSKKLKLV-AKEDIPRGAVTALSEVGTQGLMLVAQGQKCMVRGLKEDGTLLPVA 1145
Query: 363 FIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
F+D Y+ S+ + L L+ D + + Y E + L +
Sbjct: 1146 FMDMNCYVTSVKELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKS------------- 1192
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
R+E+ + D L + + + SD D ++ +
Sbjct: 1193 ---------------------STRMEVL------NADFLPDGKELYIVASDADGHIHILQ 1225
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNT----FFKIRCKPSSISDAPGARSRFLTWYASL 536
+ PE +S GH L+ +T F+ G H T + PSS+S S + AS
Sbjct: 1226 FDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLLLPAVYPNPSSLSSNSEENSPHILLLASP 1285
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR--------TYKGKGYYAGN 588
G L PL E YRRL L + H GLNP+ +R + + G AG
Sbjct: 1286 TGVLATLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASMQLPGVDAGI 1345
Query: 589 PSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
R I+DG ++ +FL+L G+R E+ + G
Sbjct: 1346 -GRNIVDGKILERFLELGTGKRQEMAGRAG 1374
>gi|350297359|gb|EGZ78336.1| protein cft-1 [Neurospora tetrasperma FGSC 2509]
Length = 1437
Score = 194 bits (493), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 176/630 (27%), Positives = 280/630 (44%), Gaps = 77/630 (12%)
Query: 17 VQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFV--SDRSK 72
V E+L LG H + L+L +L +YQ +R A + K L V S +K
Sbjct: 794 VAEILVADLGDTTHKSPYLILRHANDDLTLYQPYRLKATAGQPFSKSLFFQKVPNSTFAK 853
Query: 73 RANEQP---GLPRGV-RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG 128
E+P P R MR SNI+GY VFL G P+++ T++ R + G
Sbjct: 854 APEEKPVDDDEPHNAQRFLPMRRCSNISGYSTVFLPGSSPSFILKTAKSSPRVLSLQGSG 913
Query: 129 PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHL 187
V ++ FH C GF+Y + R++ +PT SY + V+K+P+ +AYH
Sbjct: 914 -VQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSYAELGLSVKKIPVGVDTQSVAYHP 972
Query: 188 ETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQ 247
T+ Y + + EP ++ +D R++ P+V + + L S +W I
Sbjct: 973 PTQAYVVGCNDVEP----FELPKDDDYHKEWARENITFKPMVDRGVLKLLSGITWTVI-- 1026
Query: 248 TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
+ E VLC++ +++E + + + IA+GT ED+ RGR+ +FDI +V+P
Sbjct: 1027 DTVEMEPCETVLCVETLNLEVSESTNERKQLIAVGTALIKGEDLPTRGRVYVFDIADVIP 1086
Query: 308 EPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIA 362
EPG+P T K+K++ AKE +G VTA+ V G ++ A GQK + LK D L +A
Sbjct: 1087 EPGKPETSKKLKLV-AKEDIPRGAVTALSEVGTQGLMLVAQGQKCMVRGLKEDGTLLPVA 1145
Query: 363 FIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
F+D Y+ S+ + L L+ D + + Y E + L +
Sbjct: 1146 FMDMNCYVTSVKELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKS------------- 1192
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
R+E+ + D L + + + SD D ++ +
Sbjct: 1193 ---------------------STRMEVL------NADFLPDGKELYIVASDADGHIHILQ 1225
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNT----FFKIRCKPSSISDAPGARSRFLTWYASL 536
+ PE +S GH L+ +T F+ G H T + PSS+S S + AS
Sbjct: 1226 FDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLLLPAVYPNPSSLSSNSEENSPHILLLASP 1285
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR--------TYKGKGYYAGN 588
G L PL E YRRL L + H GLNP+ +R + + G AG
Sbjct: 1286 TGVLATLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASMQLPGVDAGI 1345
Query: 589 PSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
R I+DG ++ +FL+L G+R E+ + G
Sbjct: 1346 -GRNIVDGKILERFLELGTGKRQEMAGRAG 1374
>gi|392558419|gb|EIW51607.1| hypothetical protein TRAVEDRAFT_176174 [Trametes versicolor FP-101664
SS1]
Length = 1431
Score = 194 bits (493), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 160/641 (24%), Positives = 295/641 (46%), Gaps = 78/641 (12%)
Query: 9 PSAMDETIVQELLTVSLGLHGNRPLLLVRTQH-ELLIYQAF---RHPKGALKLRFKKLKV 64
P E + +++ LG RP L+V + +L +Y+A P+ R L V
Sbjct: 839 PRKPQELDIDQIVIAPLGESRPRPHLIVLLRSGQLAVYEAVAIPPPPEPLPSTRSSTLLV 898
Query: 65 LFVSDRSK-------RANEQPGLPRGVRISQMR--YFSNIA---GYQGVFLCGPHPAWLF 112
FV SK ++ L RIS++ + ++ A + GVF G P+W+
Sbjct: 899 KFVKVASKAFDIQHPEEEQKSVLAEQKRISRLLVPFVTSPAPGQTFSGVFFTGDRPSWIL 958
Query: 113 LTSRGELRAHPM--TIDGPVSTLAPFHNVNCPRG-FLYFNAKSELRISVLPTHLSYDAPW 169
T +G ++ P ++ +T + + + RG FL ++ + + +P + D
Sbjct: 959 STDKGGVKVFPSGHSVVQAFTTSSLWES----RGDFLLYSEEGPSLVEWMP-DVQLDGHL 1013
Query: 170 PVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLV 229
P R VP + P+ + + + S+ + + + ED +V +P PL
Sbjct: 1014 PARSVP-RSRPYSNVVFDASTSLIVAASSFQ---NRFASYDEDGNVVWEPDSPNISSPLC 1069
Query: 230 SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
+ L SP W I + E V C+ ++ +E T SG++ +IA+GT N E
Sbjct: 1070 ECSTLELISPDGW--ITMDGYEFAPNEFVNCIVSIPLETMSTESGMKDFIAVGTTINRGE 1127
Query: 290 DVTCRGRILLFDIIEVVPEPGQPLTK-NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI 348
D+ +G + +F+I+EVVP+P + + ++K++ + KGPV+ +C + G+LV+++GQKI
Sbjct: 1128 DLAVKGAVYIFEIVEVVPDPSTHVKRWWRLKLLCRDDAKGPVSFLCGINGYLVSSMGQKI 1187
Query: 349 YIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
++ D L G+AF+D VY+ S+ +VKNL+++GD +S+ + +Q + L ++ +D
Sbjct: 1188 FVRAFDLDERLVGVAFLDVGVYVTSLRAVKNLLVIGDAVKSVWFVAFQEDPYKLVVLGKD 1247
Query: 408 YKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGF 467
QL R ++ G +
Sbjct: 1248 P-----------------------------QLCCITRADLFFADG-----------QLSI 1267
Query: 468 MISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS 527
+ D++ V L+ Y P ES G L+++T+FH + + +P + D ++
Sbjct: 1268 VTCDEEGIVRLYAYDPHDPESKSGQHLLRRTEFHGQSEYRSSMLVARRPKN-GDPEIPQA 1326
Query: 528 RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
R + S+DG+L + E +RL +LQ ++ H LNP+AFR + + Y
Sbjct: 1327 RLVC--GSVDGSLSTLTYVDEAASKRLHLLQGQLIRTVQHVAALNPKAFRMVRNE--YVS 1382
Query: 588 NP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 627
P S+GI+DG+L+ F L + + E+ ++IG+ +L +
Sbjct: 1383 RPLSKGILDGNLLATFEDLPIARQNEVTRQIGTDRATVLKD 1423
>gi|392585051|gb|EIW74392.1| hypothetical protein CONPUDRAFT_133073 [Coniophora puteana RWD-64-598
SS2]
Length = 1490
Score = 194 bits (493), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 146/537 (27%), Positives = 253/537 (47%), Gaps = 69/537 (12%)
Query: 100 GVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNC-----PRG-FLYFNAKSE 153
G FL G P W+ T G +R +P S A H + RG FL ++ +
Sbjct: 1006 GAFLTGDKPHWIIRTDAGGVRLYP-------SGHALVHAFSACSLWESRGDFLVYSDEGP 1058
Query: 154 LRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDK 213
+ P L P P R VP T + Y + +V ++A+ + ++ ED
Sbjct: 1059 TLLEWAP-DLEVHGPLPSRSVPKGRTYGKVVYEHGSG---LVIASADGWASFASYD-EDG 1113
Query: 214 ELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLS 273
+V +P P + L SP W + F +E+ V ++ V++E T +
Sbjct: 1114 AIVWEPDAPGVAFPKADCSTLELISPELWITLDGYEFAPNEF--VNAVEVVTLETLSTET 1171
Query: 274 GLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTK-NKIKMIYAKEQKGPVTA 332
G + ++A+GT N ED+ RG +F+++EVVP+P L + K+KM + KGPVTA
Sbjct: 1172 GSKEFVAVGTTINRGEDLAVRGATYIFEVVEVVPDPSSKLDRWYKLKMRVRDDAKGPVTA 1231
Query: 333 ICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIAL 391
+C + G+LV+++GQKI+I D L G+AF+D VY+ S+ ++KNL+L+GD +S+ L
Sbjct: 1232 LCGINGYLVSSMGQKIFIRAFDLDERLVGVAFLDAGVYVTSLKALKNLLLIGDAVKSVWL 1291
Query: 392 LRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 451
+ +Q + L ++++D + S ++ N GE
Sbjct: 1292 VAFQEDPYKLVILSKDIRRQYAASVDFFFAN-------------------GE-------- 1324
Query: 452 GSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 511
+ + D++ + + Y P ES G +L+ T+FH + +T
Sbjct: 1325 -------------LSIVTEDEEGVLRAYEYDPNDPESRSGQQLLCHTEFHGHKECSTTLT 1371
Query: 512 IRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
I + + + P +++ ++ + DG+L P+ E ++RL +LQ + + H GL
Sbjct: 1372 IARRTKTEHEIP--QAKLISGFG--DGSLSALTPVDEAAFKRLQLLQGQLTRNVQHIAGL 1427
Query: 572 NPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 627
NPRAFR + + P S+GI+DG L+ F + + E+ ++IG++ IL E
Sbjct: 1428 NPRAFRIVRNE--TVSKPLSKGILDGQLLSSFEAQGITRQGEMTRQIGTERTTILQE 1482
>gi|301093545|ref|XP_002997618.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110008|gb|EEY68060.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1744
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 168/643 (26%), Positives = 282/643 (43%), Gaps = 125/643 (19%)
Query: 80 LPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT-----IDGPVSTLA 134
L G R + F N+ G F G HP W+ L RG PM + PV +
Sbjct: 1142 LRAGFRYPMLTCFHNVNNMSGAFFRGAHPMWI-LGDRGHASFVPMCNAAPRVSVPVLSFT 1200
Query: 135 PFHNVNCPRGFLYFNAKSELRISVLP-----THLSYDAPWPVRKVPLKCTPHFLAY---- 185
FH+ NCP GF+YF+++ LR+ LP T L + ++K T H + Y
Sbjct: 1201 SFHHWNCPNGFIYFHSRGALRVCELPSSKTSTILPSSGGFVLQKAEFGATLHHMLYLGSH 1260
Query: 186 -------HLETKTYCIVTST------AEPSTDYYKFNGEDKELVTDPRD----SRFIPPL 228
LE TY +V S A+ +T+ E + DP S + P
Sbjct: 1261 GPGGVAEALEAPTYAVVCSARLKPADADRATEVEGAEEELEPENLDPNGNPLGSNVMAPT 1320
Query: 229 VSQF------HVSLFSPFSWE-EIPQTN----------FPLH--EWEHVLCLK-----NV 264
F H++ +E + QT+ F +H +E VL +K +
Sbjct: 1321 AEMFADYETDHMAHTEEDVYELRLVQTDEFGEWGRRGVFRVHFERYEVVLSVKLMYLYDS 1380
Query: 265 SMEYEGTLSGL-------RGYIALGTNY--NYSEDVTCRGRILLF--DIIEVVPEPGQPL 313
S+ E S R Y+ +GT + + ED + RGR+LL+ D + V E G
Sbjct: 1381 SLMKEEVASTSPEWNKKKRPYLVVGTGWVGPHGEDESGRGRLLLYELDYAQYVNEEGGAT 1440
Query: 314 TKN--KIKMIYAKE-QKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYI 370
+ K+++++ KE ++G ++ + + +++ AVG K+ +++ K L G AF D ++YI
Sbjct: 1441 SGKLPKLRLVFIKEHRQGAISMVSQLGPYVLAAVGSKLIVYEFKSEQLIGCAFYDAQMYI 1500
Query: 371 ASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDG 430
++ VK+ ++ GD +S+ LR++ R L L+A+DY+P
Sbjct: 1501 VTLSVVKDFVMYGDVYKSVHFLRWREMQRQLVLLAKDYEP-------------------- 1540
Query: 431 SLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNG 490
L + S+ E+ + + D D+N+ + + P+ ES G
Sbjct: 1541 -LAVSATEFSVFEK-------------------KLALLAVDMDENLHVMQFAPQDIESRG 1580
Query: 491 GHRLIKKTDFHLGQHVNTFFKIRCKPS-SISDAPGAR-----SRFLTWYASLDGALGFFL 544
G RL++ +DFHLG V++ F+ R S S+ A R S ++ + +G +G +
Sbjct: 1581 GQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVNVMGTSEGGVGALV 1640
Query: 545 PLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY-YAGNPS--------RGIID 595
P+ E+ +RRL LQNVMV LNPR FR K G P + +D
Sbjct: 1641 PVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRRCGRPDAWSKKKWKKSFLD 1700
Query: 596 GSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
++++FLQL + E+ + IG+ ++ L +++ +S F
Sbjct: 1701 AFVLFRFLQLDYVAQKELARCIGTTPEVVMHNLLEVQHATSTF 1743
>gi|358372791|dbj|GAA89393.1| cleavage and polyadenylation specificity factor subunit A
[Aspergillus kawachii IFO 4308]
Length = 1372
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 168/625 (26%), Positives = 283/625 (45%), Gaps = 96/625 (15%)
Query: 32 PLLLVRTQHE-LLIYQAFRHPKGAL----KLRFKKLKVLFVSDRSKRANEQPGLPRGVRI 86
P L++R++++ L+IY+ F P G L+F K + S + R+
Sbjct: 816 PYLILRSENDDLIIYKPFVIPTGPTGEIHTLKFSKENNSVLPMISPDVDSTQPSGSDYRV 875
Query: 87 SQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFL 146
+R +I+G VF+ G ++ TS H + + G PR
Sbjct: 876 RPLRILPDISGLSAVFMPGASAGFVLRTSASA--PHFLRLRG-----------ESPRC-- 920
Query: 147 YFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYY 206
S +R LP +D W ++KV L LAY + Y + T A TD+
Sbjct: 921 -----STVRFCQLPPMTRFDYQWTLKKVHLGEQVDHLAYSTSSGMYVLGTCHA---TDFK 972
Query: 207 KFNGEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNV 264
+D EL + R+ F P F + L SP +W I ++ L E+V+ +KN+
Sbjct: 973 L--PDDDELHPEWRNEAISFFPSARGSF-IKLVSPNTWSII--DSYSLGTDEYVMAIKNI 1027
Query: 265 SMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAK 324
S+E + I +GT + ED+ RG I +F++++VVP+P P T K+K+I +
Sbjct: 1028 SLEISENTHERKDLIVVGTAFARGEDIPSRGCIYVFEVVQVVPDPDDPETDRKLKLIGKE 1087
Query: 325 EQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVK--NL 379
KG VTA+ + GF++ A GQK + LK D L +AF+D + Y++ + +K +
Sbjct: 1088 SVKGAVTALSEIGGQGFVLVAQGQKCMVRGLKEDGSLLPVAFMDMQCYVSVVKELKGTGM 1147
Query: 380 ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQL 439
++GD + I Y E +SL A+D +L++
Sbjct: 1148 CILGDAVKGIWFAGYSEEPYKMSLFAKDL--------------------------DYLEV 1181
Query: 440 SLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD 499
S E L ++ + +++D D N+ + Y PE +S+ G +L+ ++
Sbjct: 1182 SAAEFLPDGRR--------------LFIVVADSDCNIHVLQYDPEDPKSSNGDKLLSRSK 1227
Query: 500 FHLGQHVNTFFKI-RCKPSS---IS-----DAPGARSRFLTWYASLDGALGFFLPLPEKN 550
FH G +T + R SS IS D + + +G+LG +PE++
Sbjct: 1228 FHTGNFASTLTLLPRTMVSSEKMISNSDDMDIDNQSALHQVLMTTQNGSLGLITCMPEES 1287
Query: 551 YRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGER 610
YRRL LQ+ + H GLNPRAFR + G RG++DG+L++K++ +S +
Sbjct: 1288 YRRLSALQSQLTNTLEHPCGLNPRAFRAVESD----GTAGRGMLDGNLLFKWIDMSKQRK 1343
Query: 611 LEICKKIGSKHNDILDELYDIEALS 635
EI ++G++ +I D+EA+S
Sbjct: 1344 TEIAGRVGAREWEI---KADLEAIS 1365
>gi|358390357|gb|EHK39763.1| hypothetical protein TRIATDRAFT_48211 [Trichoderma atroviride IMI
206040]
Length = 1441
Score = 193 bits (490), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 179/656 (27%), Positives = 293/656 (44%), Gaps = 98/656 (14%)
Query: 17 VQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALK------LRFKK-----LK 63
+ EL+ LG +H + L+L + +L IY+ R P + L FKK L
Sbjct: 837 LTELVVADLGDTVHYSPYLILRHSTDDLTIYEPIRLPTDSPTRNLSDTLFFKKSANSILA 896
Query: 64 VLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHP 123
V D + +QP R +R +N+ GY VFL GP PA++ +S+ R
Sbjct: 897 KSTVEDPLEDTAQQP------RYVPLRICANVGGYSTVFLPGPSPAFILKSSKSVPRVVG 950
Query: 124 MTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHF 182
+ G V ++ F+ C RGF+Y +++ R++ LP+ ++ + V+KVPL
Sbjct: 951 VQGLG-VRGMSTFNTEGCDRGFIYSDSEGIARVTQLPSKTNFTELGVSVKKVPLGNDVRH 1009
Query: 183 LAYHLETKTY---CIVTSTAE-PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFS 238
+AYH T+TY C VT E P D Y KE ++S P + + L S
Sbjct: 1010 VAYHHPTETYIAGCAVTEGFELPKDDDYH-----KEWA---KESLSFHPSTVRGSLKLIS 1061
Query: 239 PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRIL 298
P +W I + + E + C+K + +E R +A+GT ED+ RGR+
Sbjct: 1062 PVTWTVIHSID--MEPGESIECMKTLHLEVSEETKERRMLLAVGTALTRGEDLPTRGRVQ 1119
Query: 299 LFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK 354
++DI+ V+PEPG+P T K+K++ AKE+ +G VTA+ + G ++ A GQK + LK
Sbjct: 1120 VYDIVTVIPEPGKPETNKKLKLL-AKEEIPRGGVTALSEIGTQGLMLMAQGQKCMVRGLK 1178
Query: 355 -DNDLTGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPT 411
D L +AF+D ++AS + L L+ D + + Y E T ++ +
Sbjct: 1179 EDGSLLPVAFLDMSCHVASARELPGTGLCLIADAFKGLWFAGYTEEPYTFKVLGKS---- 1234
Query: 412 QPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISD 471
GSL D L + + + D
Sbjct: 1235 -----------------SGSLPLLV-------------------ADFLPDGEDLSMVAVD 1258
Query: 472 KDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIRCKPSSISDAPGARSR- 528
D ++ + + PE +S GH L+ +T F + + +T R P+S S S
Sbjct: 1259 ADGDIHVLEFNPEHPKSLQGHLLLHRTTFSVTPNPPTSTLLLPRTLPASQSATASQDSST 1318
Query: 529 ---FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 585
L AS G+L PLPE YRRLL + N ++ GGL+ RA R +G G
Sbjct: 1319 PQPHLLLLASPSGSLAALTPLPESAYRRLLSVTNQLLPALVPHGGLHARAHRAPEGGGGM 1378
Query: 586 A-------GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
+ R I+DG+++ ++ +L +R E+ + G ++ +++ D+EA+
Sbjct: 1379 SRMVGVETAASGRAIVDGAILTRWNELGAAKRAEVASRGG--YDSVMELREDLEAV 1432
>gi|302924728|ref|XP_003053954.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256734895|gb|EEU48241.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 1429
Score = 192 bits (487), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 176/632 (27%), Positives = 284/632 (44%), Gaps = 91/632 (14%)
Query: 17 VQELLTVSLG-LHGNRPLLLVRTQ-HELLIYQAFRH-PKGA-----LKLRFKK-----LK 63
++ELL LG P L++R Q +L IY+ R+ P+GA L FKK L
Sbjct: 836 LRELLVADLGDTVSQSPYLILRNQTDDLTIYEPLRYQPEGAEPTLSATLTFKKTSNAALA 895
Query: 64 VLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHP 123
V + A +QP R +R +N+ GY VFL GP P+++ +S+ R
Sbjct: 896 TSPVETSQEDAVQQP------RFVPLRTCANVNGYSTVFLPGPSPSFILKSSKSIPRVIG 949
Query: 124 MTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHF 182
+ G + ++ FH C RGF+Y + + R++ LP+ ++ D V+KVPL
Sbjct: 950 LQGLG-IRGMSTFHTEGCDRGFIYADDEGIARVTQLPSETNFTDLGISVKKVPLDSDVCG 1008
Query: 183 LAYHLETKTYCIVTSTAEP-----STDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLF 237
+AYH T TY +T EP DY+K KE +T P + + + L
Sbjct: 1009 IAYHQPTGTYIAGCTTNEPFELPRDDDYHKEWA--KETLT-------FAPTMPRGVLKLI 1059
Query: 238 SPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRI 297
SP S I L E + C+K + +E R + +GT + ED+ RGR+
Sbjct: 1060 SPVSLTVIHDQE--LESCESIECMKTLQLEVSEETKERRFLLTVGTALSKGEDLPIRGRV 1117
Query: 298 LLFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQL 353
+FDI+ V+PEPG+P T ++K I A+E +G VTAI + G ++ A GQK + L
Sbjct: 1118 HVFDIVTVIPEPGKPETNKRLKAI-AREDIPRGGVTAISEIGTQGLMLVAQGQKCMVRGL 1176
Query: 354 K-DNDLTGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
K D L +AF+D +++S + L ++ D + + Y E T ++ + +
Sbjct: 1177 KEDGSLLPVAFLDMSCHVSSARELPRTGLCVMADAFKGVWFAGYTEEPYTFKILGKSHG- 1235
Query: 411 TQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMIS 470
RL + D L + + + +
Sbjct: 1236 ---------------------------------RLPLLVA------DFLPDGEDLAIVAA 1256
Query: 471 DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI--RCKPSSISDAPGARSR 528
D D ++ + + PE +S GH L+ +T F + + T + R P + +P S+
Sbjct: 1257 DADGDLHILEFNPEHPKSLQGHLLLHRTTFSVSPNPPTSMLLLPRTTPPA-HPSPSDPSQ 1315
Query: 529 FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
L AS G L +PLPE YRRLL + N ++ + GGLN + +R G +
Sbjct: 1316 IL-LLASPSGHLSTLVPLPEATYRRLLSVTNQLLPALTPYGGLNAKGYRLPSGTRPVGVD 1374
Query: 589 PSRG--IIDGSLVWKFLQLSLGERLEICKKIG 618
+ G I+DG+++ ++ +L +R EI K G
Sbjct: 1375 AAAGRTIVDGAILARWAELGAAKRAEIAGKGG 1406
>gi|156040479|ref|XP_001587226.1| hypothetical protein SS1G_12256 [Sclerotinia sclerotiorum 1980]
gi|154696312|gb|EDN96050.1| hypothetical protein SS1G_12256 [Sclerotinia sclerotiorum 1980 UF-70]
Length = 1447
Score = 192 bits (487), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 172/659 (26%), Positives = 282/659 (42%), Gaps = 91/659 (13%)
Query: 10 SAMDETIVQELLTVSLGLH-GNRPLLLVR-TQHELLIYQAFRHPKGALKLRFKKLKVLFV 67
SA+ ET+ E+L LG P L++R + +L IY+ FR + L L+ L +
Sbjct: 836 SAIRETLT-EILVADLGDSVSQSPYLILRPSNDDLTIYEPFRIASTSPNLLSSTLQFLKI 894
Query: 68 SDR------SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
+ A EQ + MR SN+ GY VF+ G P+++ +S+ +
Sbjct: 895 HNTHLAQAPDVSAEEQADETQQTSDKPMRAVSNLGGYSVVFMPGGSPSFIVKSSKTLPKV 954
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTP 180
+ G V L+ FH C RGF+Y + + +R++ P ++ D +RKV +
Sbjct: 955 LSLQGTG-VRGLSSFHTEGCDRGFIYADTEGIVRVAQFPPTTTFADIGMALRKVEIGEDV 1013
Query: 181 HFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPF 240
H +AYH +TY I TST TD+ +D D F P + + + L SP
Sbjct: 1014 HAVAYHSPLQTYVIGTSTF---TDFELPKDDDHRRSWQEEDIAFKPS-IEKSSLKLISPV 1069
Query: 241 SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
+W I L E + C+K +++ + + + +GT ED+ GR+ ++
Sbjct: 1070 NWSVI--DTIELEPCEVITCIKTMNLVVSEVTNERKPLLVVGTAITKGEDLATTGRLYVY 1127
Query: 301 DIIEVVPEPGQPLTKNKIKMIYAKE----QKGPVTAICHVA--GFLVTAVGQKIYIWQLK 354
D++ VVPEP +P T K+K+I A+ GPVT + + GF++ A GQK + LK
Sbjct: 1128 DVVIVVPEPDRPETNKKLKLISAETITRGAGGPVTGLSEIGTQGFMLVAQGQKCMVRGLK 1187
Query: 355 DNDLT-GIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPT 411
++ +AF+DT Y+ S+ + L ++ D + + Y E + L +
Sbjct: 1188 EDGTNLPVAFMDTNCYVTSIKELPGTGLCVIADALKGVWFAGYTEEPYKMLLFGKS---- 1243
Query: 412 QPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI-CKKIGSKHNDILDEFSSMGFMIS 470
R+E+ C D+L + + + +
Sbjct: 1244 ------------------------------ATRMEVLCA-------DLLPDGKDLFIVAA 1266
Query: 471 DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSI----------- 519
D D N+ + Y PE +S GH L+ +T F LG H T + S+
Sbjct: 1267 DADGNLHIMQYDPEHPKSLQGHLLLHRTTFSLGAHHPTTMTLLPAIPSLHPLTTASSSSL 1326
Query: 520 -----SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 574
D+P L S G PL E YRR L + + H GLNPR
Sbjct: 1327 SPSPQEDSPSPSQSLL--LTSRTGTFALLSPLTESQYRRFGTLVSHLTNTLYHPCGLNPR 1384
Query: 575 AFRTYK--GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
A+R K +G G R IIDG ++ ++++L R E+ ++G ++ DEL ++
Sbjct: 1385 AYRVDKDANEGIVGG---RTIIDGGVLGRWMELGSQRRGEVAGRVGVDVLELRDELSEL 1440
>gi|169864473|ref|XP_001838845.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
gi|116500065|gb|EAU82960.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
Length = 1458
Score = 191 bits (486), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 147/544 (27%), Positives = 246/544 (45%), Gaps = 82/544 (15%)
Query: 98 YQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNV-----NCP----RG-FLY 147
Y GVF G P W+ T +G ++ +P HNV C RG FL
Sbjct: 971 YSGVFFTGDKPNWIIGTDKGGVQIYPSG-----------HNVVHSFSACSLWEERGEFLV 1019
Query: 148 FNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYK 207
+ I LP +Y P P R +P + + T C++ + + +
Sbjct: 1020 YTEDGPCLIEWLP-DFTYSHPLPARSIPRGRGYSNVVFDPST---CLIVAASSMQARFAS 1075
Query: 208 FNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSME 267
++ ED V + P+ + L SP SW I F E++ + V++E
Sbjct: 1076 YD-EDGVRVWEKDGPGVDDPITDTSALELISPNSW--ITMDGFEFATNEYINDISIVTLE 1132
Query: 268 YEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPG-QPLTKNKIKMIYAKEQ 326
T +G + +IA+GT + ED+ +G +F+I+EVVP+P P K+++ +
Sbjct: 1133 TAATETGSKDFIAVGTTIDRGEDLAAKGAAYIFEIVEVVPDPAISPTRWYKLRLRCRDDA 1192
Query: 327 KGPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDY 385
KGPVTA+C G+LV+++GQKI++ D L G+AF+D +Y+ S+ +KNL+L+GD
Sbjct: 1193 KGPVTAVCGFQGYLVSSMGQKIFVRAFDSDERLVGVAFMDVGIYVTSLRVLKNLLLIGDA 1252
Query: 386 ARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERL 445
+S+ + +Q + L L+A+D ++ DG L
Sbjct: 1253 VKSVMFVAFQEDPYKLVLLAKDVHLHSVTRADFFFNA------DGDL------------- 1293
Query: 446 EICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQ- 504
++ D++ + ++ Y P +S G L+ +T++H GQ
Sbjct: 1294 --------------------ALIVGDEEGIMRIYEYNPNDPDSRDGRYLLLRTEYH-GQV 1332
Query: 505 --HVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 562
H +T R K D +S L S DG+L +P+ E ++RL +LQ +
Sbjct: 1333 PYHTSTTIARRDK----EDPSIPQSHLL--IGSADGSLSSLVPVDEYAFKRLQLLQGQLT 1386
Query: 563 THTSHTGGLNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 621
+ H GLNP+AFR K Y P S+GI+DG L+ ++ L + + E+ K+IG++
Sbjct: 1387 RNIQHVAGLNPKAFRIVKND--YVSKPLSKGILDGQLLAQYESLPIPRQNEMTKQIGTER 1444
Query: 622 NDIL 625
+L
Sbjct: 1445 GVVL 1448
>gi|389740693|gb|EIM81883.1| hypothetical protein STEHIDRAFT_65512 [Stereum hirsutum FP-91666 SS1]
Length = 1438
Score = 191 bits (485), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 165/650 (25%), Positives = 294/650 (45%), Gaps = 77/650 (11%)
Query: 1 MGNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQH-ELLIYQA--FRHPKGALK- 56
+ + P E + ++L LG RP L V + +L IY+A F P G +
Sbjct: 835 LSSLPQDQPRKPQELDIDQILVAPLGETSPRPHLFVLLRSGQLAIYEAVSFELPTGDPEP 894
Query: 57 ------LRFKKLKVLFVSDRSKRANEQPG----LPRGVRISQM--RYFSNIA---GYQGV 101
L K +KVL + + +EQP L +I ++ + ++ A + GV
Sbjct: 895 ASRPSILPVKLVKVLSRAFDIQHPDEQPQEKSVLAELKKIQRLFIPFVTSPAPEKTFTGV 954
Query: 102 FLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPT 161
F G P W+ T +G +R H + V + P + FL + + + +P
Sbjct: 955 FFTGDRPCWILGTDKGGIRVH-SSGHAVVHSFTPCSLWDSKGDFLLYTDEGPCLLEWMP- 1012
Query: 162 HLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRD 221
+ P R +P + + T C++ A + F+ ED +P
Sbjct: 1013 DVQLHTELPSRFMPRSRAYTNVVFDPFT---CLIVGAASLKAQFTSFD-EDGNQTWEPDA 1068
Query: 222 SRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIAL 281
P + L +P +W + F +E V ++ V +E + T SG + +IA+
Sbjct: 1069 PNISYPTTDCSTLELITPDAWLTMDGYEFASNEI--VNAVECVMLETQSTDSGQKSFIAV 1126
Query: 282 GTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTK-NKIKMIYAKEQKGPVTAICHVAGFL 340
GT N ED+ +G +F+I+EVVP+P + + K++M + KGPVTA+C + G+L
Sbjct: 1127 GTTINRGEDLAVKGATYIFEIVEVVPDPSFGVKRWFKLRMRCRDDAKGPVTALCGMDGYL 1186
Query: 341 VTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYR 399
V+++GQKI++ L D L G+AF+D VY+ S+ ++KNL+++ D +S+ + +Q +
Sbjct: 1187 VSSMGQKIFVRALDLDERLVGVAFLDVGVYVTSLRALKNLLIISDAVKSVWFVAFQEDPY 1246
Query: 400 TLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 459
L+++A+D + S ++ N QLS+
Sbjct: 1247 KLTVLAKDAQQVCFTSADFFFANQ--------------QLSI------------------ 1274
Query: 460 DEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQ--HVNTFFKIRCKPS 517
+ D++ + ++ Y P ES G RL+ +FH GQ + ++ R
Sbjct: 1275 --------VTCDEEGILRMYHYNPHDPESKNGQRLLCHAEFH-GQIEYRSSLTIARRTKG 1325
Query: 518 SISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 577
++ P A+ S DG+L +P+ E ++RL +LQ + + HT LNPRAFR
Sbjct: 1326 PDTEIPQAK----LICGSPDGSLSALVPVEEAAFKRLHLLQGQLTRNVQHTAALNPRAFR 1381
Query: 578 TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 627
+ + Y + +G +DG L+ F L + ++EI ++IG++ +L +
Sbjct: 1382 AVRNE-YVSKTLHKGFLDGLLLRSFEDLPVSRQIEITRQIGTERRLVLKD 1430
>gi|380494933|emb|CCF32776.1| cft-1, partial [Colletotrichum higginsianum]
Length = 542
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 158/581 (27%), Positives = 256/581 (44%), Gaps = 81/581 (13%)
Query: 73 RANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVST 132
ANEQP R +R +NI GY VFL G P+ + +++ + + G V
Sbjct: 15 EANEQP------RFVPLRPCANINGYSTVFLPGASPSLIVKSAKSSPKVVGLQGIG-VRG 67
Query: 133 LAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHLETKT 191
++ FH C RGF+Y +++ + R++ LP ++ + VRK+P+ +AYH +T
Sbjct: 68 MSSFHTEGCERGFIYADSEGQTRVTQLPADSNFAELGVSVRKIPIGDAVGLIAYHPPMET 127
Query: 192 YCIVTSTAE----PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQ 247
Y + S +E P D Y KE + S PL + V L SP +W I
Sbjct: 128 YAVACSISEHFELPKDDDYH-----KEWAKETTTSY---PLTERGIVKLMSPTTWSVIDT 179
Query: 248 TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
HE +C+K + +E R I +GT N ED+ RGRIL++D++ VVP
Sbjct: 180 VELEPHEV--AMCMKTLHLEVSEETKERRMLITIGTAINRGEDLPIRGRILVYDVVPVVP 237
Query: 308 EPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIA 362
+PG+P T K+K++ AKE+ +G VT +C V G ++ A GQK + LK D L +A
Sbjct: 238 QPGRPETNKKLKLV-AKEEIPRGAVTGLCEVGSQGLMLVAQGQKCMVRGLKEDGTLLPVA 296
Query: 363 FIDTEVYIASMVSVKNL--ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
F+D Y+ ++ V+ L+ D + + + Y E
Sbjct: 297 FMDMNCYVTAVREVRGTGYCLMTDAFKGVWFVGYAEE----------------------- 333
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
P + ++ G + F L+ D + + ++ DKD + +
Sbjct: 334 --PYKMMLFGKSMGNFEVLTA---------------DFVVAGDELHIVVCDKDGVIHVMQ 376
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTFF----KIRCKPSSISDAPGARSRFLTWYASL 536
+ PE +S GH L+ + F + T + PS+ S + + L AS
Sbjct: 377 FDPEHPKSLQGHLLLNRASFSAAPNHPTITLSLPRTPISPSATSVSKNPPTTLL--LASP 434
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG---NPSRGI 593
GAL PL E+ YRRL L N + H NP+ R G + R I
Sbjct: 435 TGALASLTPLSEQAYRRLTSLANSIAGALPHAAATNPKGHRLQPLDARTPGVDTSAGRSI 494
Query: 594 IDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
+DG+L+ ++ +L G R E+ K G + D+ + ++E +
Sbjct: 495 VDGALLARWNELGAGRRSEVAGKGG--YGDVHEVRSELEGV 533
>gi|409046890|gb|EKM56369.1| hypothetical protein PHACADRAFT_93103 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1417
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 133/537 (24%), Positives = 250/537 (46%), Gaps = 54/537 (10%)
Query: 97 GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRI 156
+ GVFL G P W+ T +G ++ P + V FL ++ + I
Sbjct: 929 AFSGVFLTGDRPCWILSTDKGGVKIMP-SGHQVVHAFTACSLWESKGDFLLYSDEGPSLI 987
Query: 157 SVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELV 216
+P + ++ P R +P + P+ T T + S+ + + Y ED+ +V
Sbjct: 988 EWVP-EIQFEGHLPSRSIP-RPRPYSHVVFEPTTTLLVAASSLQSTFTSYD---EDRNVV 1042
Query: 217 TDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLR 276
+P + P+ + L SP +W + F +E+ V C++ +++E T +G +
Sbjct: 1043 WEPDEPNMSLPVCETSALELISPDTWTTMDGYEFAQNEF--VTCMECITLETLSTETGTK 1100
Query: 277 GYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTK-NKIKMIYAKEQKGPVTAICH 335
++A+ T N ED+ +G + +F+++EVVP+P + ++K+ E KGPVTA+C
Sbjct: 1101 DFVAVSTTINRGEDLAVKGAVYIFEVVEVVPDPAMGQKRWYRLKLHCRDEAKGPVTALCG 1160
Query: 336 VAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
+ +LV+++GQKI++ L D L G+AF+D VY+ S+ +VKNL+++GD + + L+ +
Sbjct: 1161 MDNYLVSSMGQKIFVRALDLDERLVGVAFLDVSVYVTSLRAVKNLLVIGDALKGVWLVAF 1220
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
Q + L ++A+DY P + + +I
Sbjct: 1221 QEDPYKLVVLAKDYYPIPVACADLFFADGKASLIS------------------------- 1255
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 514
D++ + L Y P ES G RL+ +T+FH T I
Sbjct: 1256 ---------------CDEEGVLRLSEYDPHDPESRHGQRLLCRTEFHGQTEYRTSHLIAR 1300
Query: 515 KPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 574
+ + DA +++ + + DG+L + + +RL +LQ + + H GLNP+
Sbjct: 1301 RGKGL-DAEIPQAKLICGHT--DGSLTSLTYVDDAVSKRLHLLQGQLARNVQHVAGLNPK 1357
Query: 575 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
AFR + A ++GI+DG+L+ F L + ++E+ ++I ++ +L + D+
Sbjct: 1358 AFRVVRND-RVARPLTKGILDGNLLAAFEDLPVPRQVEVTRQIATERTTVLKDWLDL 1413
>gi|171695066|ref|XP_001912457.1| hypothetical protein [Podospora anserina S mat+]
gi|170947775|emb|CAP59938.1| unnamed protein product [Podospora anserina S mat+]
Length = 1441
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 161/629 (25%), Positives = 279/629 (44%), Gaps = 78/629 (12%)
Query: 17 VQELLTVSLG-LHGNRPLLLVR-TQHELLIYQAFRHPKGA-----LKLRFKKLKVLFVSD 69
V E+L LG P L++R +L +Y+ +R+ GA L F+K+ ++
Sbjct: 841 VSEILVADLGDTTAKSPYLILRHANDDLTMYEPYRYQLGAGLEFPKTLFFQKIPNSVLAK 900
Query: 70 RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP 129
+ + + +R +NI GY VFL GP P+++ +S+ + P+
Sbjct: 901 SPAEETDDEEVTHQAKCLALRRCNNIGGYSTVFLPGPSPSFIIKSSKSMPKVLPLQ-GAA 959
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHLE 188
V+ ++ FH C GF+Y ++ + +R+S LP S+ + V+K+P+ +AYH
Sbjct: 960 VTAISSFHTEGCEHGFIYADSHNIVRVSQLPKDWSFAETGLAVKKIPIGEDIVAVAYHPP 1019
Query: 189 TKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQT 248
+++Y + +T EP ++ +D R+ P + + + L P +W +
Sbjct: 1020 SQSYVVACNTPEP----FELPRDDDYHKEWAREVLPFKPTLERGTLKLIGPITWTVV--D 1073
Query: 249 NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPE 308
+ E+VLC++ +++E + + I +GT ED+ RG + ++++ +V+PE
Sbjct: 1074 TIVMEPCENVLCVETLNLEVSEATNERKLLIGVGTAITKGEDLPTRGAVYVYNVADVIPE 1133
Query: 309 PGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAF 363
PG+P T K+K+I AKE +G VTA+ + G ++ A G K + LK D L +AF
Sbjct: 1134 PGKPETGKKLKLI-AKEDIPRGAVTALSEIGTQGLMLVAQGPKCMVRGLKEDGTLLPVAF 1192
Query: 364 IDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
+D Y+ S + L L+ D + + Y E + L +
Sbjct: 1193 MDMNCYVTSAKELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKS-------------- 1238
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
RLE+ + D L + + D + ++ + +
Sbjct: 1239 --------------------NTRLEVL------NADFLPNGKELSIVACDAEGHIHILQF 1272
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS-------DAPGARSR-FLTWY 533
PE +S GH L+ +T F G H T K PS++S + GA SR +
Sbjct: 1273 DPEHPKSLQGHLLLHRTSFSTGAHHVT--KSLLLPSTLSPDNKEDNEENGATSRPHILLL 1330
Query: 534 ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR----TYKGKGYYAGNP 589
AS G L PL E YRRL L + +H GLNP+ +R T G AG
Sbjct: 1331 ASPTGVLAALRPLSETAYRRLSSLAAQLTNSLTHAAGLNPKGYRMPSATCPPAGVDAGI- 1389
Query: 590 SRGIIDGSLVWKFLQLSLGERLEICKKIG 618
R I+DG+++ +F +L +R E+ + G
Sbjct: 1390 GRHIVDGTILARFSELGRAKRGEVAGRAG 1418
>gi|302694047|ref|XP_003036702.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
gi|300110399|gb|EFJ01800.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
Length = 1396
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 167/644 (25%), Positives = 282/644 (43%), Gaps = 91/644 (14%)
Query: 9 PSAMDETIVQELLTVSLGLHGNRP--LLLVRTQHELLIYQAFRHPKGA-----LKLRFKK 61
P E ++++L +LG +P L+L+R+ H L IY+AF + LK R
Sbjct: 807 PRKPQELDIEQILLTNLGQSDPKPHLLVLLRSGH-LAIYEAFATNQAPIVEPPLKPRASS 865
Query: 62 LKVLFVSDRSK-----RANEQPGLPRGVRISQMRY------FSNIAGYQGVFLCGPHPAW 110
L++ FV SK R +E +G+ Q + F+ GVF G P W
Sbjct: 866 LQIQFVKIASKAFEMQRTDETE---KGILAEQKKALRTFVPFACAGAPAGVFFTGDRPHW 922
Query: 111 LFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRG--FLYFNAKSELRISVLPTHLSYDAP 168
+ T +G ++ +P G + A R FL ++ + + + T P
Sbjct: 923 IVATDKGGVQMYP---SGHAAVYAFSACTLWERSTEFLIYSEEGQTLCEWI-TEYEIGRP 978
Query: 169 WPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPL 228
P+R +P + Y ++ + A + F+ ED + P P
Sbjct: 979 LPMRHIPRGRAYSNIVYE---PASSMIVAAASLRARFASFD-EDGNQIWAPDGPGITEPT 1034
Query: 229 VSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYS 288
V + L SP W + F +E+ V ++ V +E T +G++ +IA+GT+
Sbjct: 1035 VECSTLELISPEVWATVDGYEFATNEF--VNTMECVPLETVSTEAGVKHFIAVGTSIVRG 1092
Query: 289 EDVTCRGRILLFDIIEVVPEPGQ-PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
ED+ +G +F+++EVVP+ P ++K+ + KGPVTA+C + +LV+++GQK
Sbjct: 1093 EDLAVKGATYIFEVVEVVPDQSNGPKRWYRLKLRCRDDAKGPVTALCGINNYLVSSMGQK 1152
Query: 348 IYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVAR 406
I++ D L G+AF+D VY+ S+ ++KNL+L+GD R I + +Q + L + R
Sbjct: 1153 IFVRAFDLDERLVGVAFMDVGVYVTSLRALKNLLLIGDVVRGIQFVAFQEDPYKLVTLGR 1212
Query: 407 DYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMG 466
D + ++ F+
Sbjct: 1213 DVSRMCATTVDFF------------------------------------------FAEEA 1230
Query: 467 FMISDKDKNVVLFM--YQPEARESNGGHRLIKKTDF--HLGQHVNTFFKIRCKPSSISDA 522
I D+N V+ M Y PEA +S+ G L+K+T+F H +T R K D
Sbjct: 1231 LAIVTTDENGVMSMYNYDPEAPDSHDGRLLLKQTEFNLHTDFRTSTLIARRTK-----DD 1285
Query: 523 PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 582
P L + DG L P+P+ +RL LQ + + H GLNP+A R + +
Sbjct: 1286 PIIPQGILI-FGGTDGTLSCLTPVPDDAAKRLQPLQLQLTRNMQHVAGLNPKALRIVRNE 1344
Query: 583 GYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 625
+ P S+GI+DG+L+ F L + + E+ ++IG++ IL
Sbjct: 1345 --HVSRPLSKGILDGNLIAYFEHLPITRQDEMTRQIGTERATIL 1386
>gi|342877552|gb|EGU79002.1| hypothetical protein FOXB_10431 [Fusarium oxysporum Fo5176]
Length = 1399
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 172/630 (27%), Positives = 278/630 (44%), Gaps = 88/630 (13%)
Query: 17 VQELLTVSLG-LHGNRPLLLVRTQ-HELLIYQAFRHPKG------ALKLRFKKLKVLFVS 68
++E+L LG P L++R Q +L IY+ RH + + L FKK ++
Sbjct: 807 LREILVADLGDTTSQSPYLILRNQTDDLTIYEPLRHVRDGGETSLSATLTFKKTSNTTLA 866
Query: 69 ----DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM 124
+ + EQP R +R +NI GY VFL GP P+++ +S+ R +
Sbjct: 867 TIPVETEQDDVEQP------RFVPLRPCANINGYSTVFLPGPSPSFVIKSSKSIPRVIGL 920
Query: 125 TIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFL 183
G V ++ FH C RGF+Y + K R++ LP ++ + V+KVPL +
Sbjct: 921 QGLG-VRGMSTFHTEGCDRGFIYADDKGIARVTQLPPDTNFTELGISVKKVPLGADVRGI 979
Query: 184 AYHLETKTYCIVTSTAEP-----STDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFS 238
AYH T Y +EP DY+K KE +T PP + + + L S
Sbjct: 980 AYHQPTGAYIAGCMISEPFELPKDDDYHKEWA--KETLT-------FPPTMPRGVLKLIS 1030
Query: 239 PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRIL 298
P SW I + L E + C+K + +E R + +GT + ED+ RGR+
Sbjct: 1031 PVSWTVIHEVE--LESCESIECMKTLHLEVSEDTKERRFLVTVGTAVSKGEDLPIRGRVH 1088
Query: 299 LFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK 354
+FDI+ V+PEPG+P T ++K I A+E +G VTAI + G ++ A GQK + LK
Sbjct: 1089 VFDIVTVIPEPGRPETNKRLKAI-AREDIPRGGVTAISEIGTQGLMLVAQGQKCMVRGLK 1147
Query: 355 -DNDLTGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPT 411
D L +AF+D ++++ + L L+ D + + Y E T ++ + +
Sbjct: 1148 EDGSLLPVAFLDMSCHVSTARELPRTGLCLMADAFKGVWFAGYTEEPYTFKVLGKSHG-- 1205
Query: 412 QPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISD 471
RL + D L + + + +D
Sbjct: 1206 --------------------------------RLPVLVA------DFLPDGEDLAIVAAD 1227
Query: 472 KDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLT 531
D ++ + + PE +S GH L+ +T F + + + + + S P +
Sbjct: 1228 ADGDLHILDFNPEHPKSLQGHLLLHRTSFSVSPNPPSTTLLLPRTLPPSHPPPQDPPHIL 1287
Query: 532 WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG---KGYYAGN 588
AS G L +PLPE YRRLL + N ++ + GGLN +A R G G A
Sbjct: 1288 LLASSSGHLATLVPLPETTYRRLLSVTNQLLPALTPHGGLNAKAHRLPDGIRPVGVEAAG 1347
Query: 589 PSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
R I+DG+++ ++ +L +R EI K G
Sbjct: 1348 -GRTIVDGAILARWAELGAAKRAEIAGKGG 1376
>gi|409076059|gb|EKM76433.1| hypothetical protein AGABI1DRAFT_108759 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1413
Score = 189 bits (481), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 156/631 (24%), Positives = 291/631 (46%), Gaps = 79/631 (12%)
Query: 17 VQELLTVSLGLHGNRPLLLVRTQH-ELLIYQAF---RHPKGALKLRFKKLKVLFVSDRSK 72
++++L +G P L V + +L IY+A ++P+ R L++ FV +K
Sbjct: 829 IEQILLAPIGESSPTPHLCVFLRSGQLAIYEAVVLGQNPEVPDTPRATSLQIQFVKIAAK 888
Query: 73 -------RANEQPGLPRGVRISQMRYFSNIAG------YQGVFLCGPHPAWLFLTSRGEL 119
NE+ L +I++M + + Y GVF G P W+ T R +
Sbjct: 889 SFEIQRPEENEKGILAEHKKINRM-FIPFVTSPRPSVTYSGVFFTGDRPHWILSTDRSGV 947
Query: 120 RAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCT 179
+ +P + V P FL + + + +P +D P P+R +P
Sbjct: 948 QVYP-SGHNVVHAFTPCSLWESKGEFLMYTEDGPILVEWVP-DFQFDGPLPMRSIPRGRA 1005
Query: 180 PHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP 239
+ + T IV +++ ST + F+ ED + +P P V + L +P
Sbjct: 1006 --YSNVLFDPSTSLIVAASSLQST-FTSFD-EDGNNIWEPDAPNISSPSVDCSALELIAP 1061
Query: 240 FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
W + F +E+ + + + V++E T +G + +IA+GT + ED+ +G +
Sbjct: 1062 DIWATMDGFEFATNEYINDMTI--VTLETAATETGTKDFIAVGTTIDRGEDLAVKGATYI 1119
Query: 300 FDIIEVVPEPGQPLTKN---KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL-KD 355
F+I EVVP+ Q +++ K+++ + KGPVTA+C ++ +LV+++GQKI++ D
Sbjct: 1120 FEIAEVVPD--QAVSQRRWYKLRLRCRDDAKGPVTAVCGLSDYLVSSMGQKIFVRAFDSD 1177
Query: 356 NDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNS 415
L G+AF+D VY+ S+ ++KNL+L+GD +S+ + +Q + L L+++D +
Sbjct: 1178 ERLVGVAFMDVGVYVTSLQTLKNLLLIGDAVKSVQFVAFQEDPYKLVLLSKDIQ------ 1231
Query: 416 KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKN 475
+C D L + + + D++
Sbjct: 1232 ------------------------------SVC----VTRADFLFSENDLRLVTGDEEGI 1257
Query: 476 VVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYAS 535
+ ++ Y P+ +S G L+ +T+FH + T + + P SR LT S
Sbjct: 1258 IRIYEYNPQDPDSREGRHLLLETEFHGQREYRTSVLVAHRIKEDQSIPN--SRLLT--GS 1313
Query: 536 LDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP-SRGII 594
DG+L + E+ ++RL +LQ ++ + H LNP+AFR K + Y P +RGI+
Sbjct: 1314 ADGSLASLTIVEEEAFKRLGLLQGQLMRNIQHMAALNPKAFRIVKNE--YVSKPLTRGIL 1371
Query: 595 DGSLVWKFLQLSLGERLEICKKIGSKHNDIL 625
DG+L+ ++ L + + E ++IG+ ++L
Sbjct: 1372 DGNLLGQYESLPINRQSEATQQIGADRVNVL 1402
>gi|426194401|gb|EKV44332.1| hypothetical protein AGABI2DRAFT_187183 [Agaricus bisporus var.
bisporus H97]
Length = 1413
Score = 189 bits (481), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 156/631 (24%), Positives = 291/631 (46%), Gaps = 79/631 (12%)
Query: 17 VQELLTVSLGLHGNRPLLLVRTQH-ELLIYQAF---RHPKGALKLRFKKLKVLFVSDRSK 72
++++L +G P L V + +L IY+A ++P+ R L++ FV +K
Sbjct: 829 IEQILLAPIGESSPTPHLCVFLRSGQLAIYEAVVLGQNPEVPDTPRATSLQIQFVKIAAK 888
Query: 73 -------RANEQPGLPRGVRISQMRYFSNIAG------YQGVFLCGPHPAWLFLTSRGEL 119
NE+ L +I++M + + Y GVF G P W+ T R +
Sbjct: 889 SFEIQRPEENEKGILAEHKKINRM-FIPFVTSPRPSVTYSGVFFTGDRPHWILSTDRSGV 947
Query: 120 RAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCT 179
+ +P + V P FL + + + +P +D P P+R +P
Sbjct: 948 QVYP-SGHNVVHAFTPCSLWESKGEFLMYTEDGPILVEWVP-DFQFDGPLPMRSIPRGRA 1005
Query: 180 PHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP 239
+ + T IV +++ ST + F+ ED + +P P V + L +P
Sbjct: 1006 --YSNVLFDPSTSLIVAASSLQST-FTSFD-EDGNNIWEPDAPNISSPSVDCSALELIAP 1061
Query: 240 FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
W + F +E+ + + + V++E T +G + +IA+GT + ED+ +G +
Sbjct: 1062 DIWATMDGFEFATNEYINDMTI--VTLETAATETGTKDFIAVGTTIDRGEDLAVKGATYI 1119
Query: 300 FDIIEVVPEPGQPLTKN---KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL-KD 355
F+I EVVP+ Q +++ K+++ + KGPVTA+C ++ +LV+++GQKI++ D
Sbjct: 1120 FEIAEVVPD--QAVSQRRWYKLRLRCRDDAKGPVTAVCGLSDYLVSSMGQKIFVRAFDSD 1177
Query: 356 NDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNS 415
L G+AF+D VY+ S+ ++KNL+L+GD +S+ + +Q + L L+++D +
Sbjct: 1178 ERLVGVAFMDVGVYVTSLQTLKNLLLIGDAVKSVQFVAFQEDPYKLVLLSKDIQ------ 1231
Query: 416 KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKN 475
+C D L + + + D++
Sbjct: 1232 ------------------------------SVC----VTRADFLFSENDLRLVTGDEEGI 1257
Query: 476 VVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYAS 535
+ ++ Y P+ +S G L+ +T+FH + T + + P SR LT S
Sbjct: 1258 IRIYEYNPQDPDSREGRHLLLETEFHGQREYRTSVLVAHRIKEDQSIPN--SRLLT--GS 1313
Query: 536 LDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP-SRGII 594
DG+L + E+ ++RL +LQ ++ + H LNP+AFR K + Y P +RGI+
Sbjct: 1314 ADGSLASLTIVEEEAFKRLGLLQGQLMRNIQHMAALNPKAFRIVKNE--YVSKPLTRGIL 1371
Query: 595 DGSLVWKFLQLSLGERLEICKKIGSKHNDIL 625
DG+L+ ++ L + + E ++IG+ ++L
Sbjct: 1372 DGNLLGQYESLPINRQSEATQQIGADRVNVL 1402
>gi|393245434|gb|EJD52944.1| hypothetical protein AURDEDRAFT_81080 [Auricularia delicata TFB-10046
SS5]
Length = 1422
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 140/530 (26%), Positives = 247/530 (46%), Gaps = 56/530 (10%)
Query: 100 GVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVL 159
GVFL G P W+ T + +R P ++ V + FL + + +
Sbjct: 939 GVFLTGGKPGWILGTDKTAVRLVP-AVNQVVHSFTACSLWGNRGEFLMNTDEGPCLVEWM 997
Query: 160 PTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDP 219
P L D P +P +AY T +V + + + + F+ ED V P
Sbjct: 998 P-DLRLDEELPSFFMPRGRPYTSIAYE---ATTGMVIAASSLRSRFVLFD-EDGNTVWKP 1052
Query: 220 RDSRFIP-PLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGY 278
D+ FI P + L P +W + F +E + ++ V++E T +G + +
Sbjct: 1053 -DAEFISDPTTDTSSLELIDPETWTTVDGFEFAFNEM--INTVRTVNLETVSTEAGSKDF 1109
Query: 279 IALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAG 338
IA+GT ED+ +G +F++IEVVP+ Q ++K+K+ E KGPV+A+C + G
Sbjct: 1110 IAVGTTVFRGEDLAVKGATYIFEVIEVVPDDTQQ-RRHKLKLWCRDEAKGPVSALCGING 1168
Query: 339 FLVTAVGQKIYIWQLKDND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPE 397
+LV+++GQK+++ N+ L G+AF+D +Y+ S+ ++KNL+L+GD +S+ + +Q +
Sbjct: 1169 YLVSSMGQKVFVRAFDLNERLVGVAFMDVGIYVTSLRTLKNLLLIGDAVKSVWFVAFQED 1228
Query: 398 YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 457
L L+ +D++ S ++ G
Sbjct: 1229 PFKLQLLGKDFQRAALTSAEFFFG------------------------------------ 1252
Query: 458 ILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS 517
F M + +D+ + +F Y P E+ G +L+ +T+F+ I + +
Sbjct: 1253 ----FGEMTIVSTDEQNVLRIFRYDPMHAEAQDGQKLLCQTEFNTQSDARGTTTI-LRRT 1307
Query: 518 SISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 577
S D +S+ + Y DG+L LP+ E ++RL +LQ M + H GL+P+AFR
Sbjct: 1308 SDEDILLPQSKIM--YCGTDGSLSALLPVEEHVFKRLHLLQGQMTRNIQHVAGLHPKAFR 1365
Query: 578 TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 627
+ + A +RGI+D +L+ KF +L L ++E K+IG IL +
Sbjct: 1366 VVRND-FTARPLARGILDSNLLAKFEELPLSRQVEFTKQIGQSREVILGD 1414
>gi|449543656|gb|EMD34631.1| hypothetical protein CERSUDRAFT_116804 [Ceriporiopsis subvermispora
B]
Length = 1440
Score = 189 bits (480), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 165/645 (25%), Positives = 280/645 (43%), Gaps = 75/645 (11%)
Query: 9 PSAMDETIVQELLTVSLGLHGNRPLLLVRTQH-ELLIYQAFRHPKGA----------LKL 57
P E +++++ LG RP L V + +L +Y+ A + +
Sbjct: 849 PRKPQELDIEQIVVAPLGESSPRPYLTVFLRSGQLAVYETIPVAPPADPLPNSRSCTILV 908
Query: 58 RFKK-LKVLFVSDRSKRANEQPGLPRGVRISQMR--YFSNIAGYQ---GVFLCGPHPAWL 111
RF+K L F + E+ L RIS++ + ++ Q GVF G P W+
Sbjct: 909 RFRKVLSKAFDIQQQNEEVEKSVLAEQKRISRLLIPFVTSPNPGQTLSGVFFTGDRPCWI 968
Query: 112 FLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPV 171
T +G ++ P + V FL ++ + + +P + D P
Sbjct: 969 LSTDKGGVKVFP-SGHSVVHAFTASSVWESKSDFLLYSEEGPSLLEWIP-GVQLDGHLPS 1026
Query: 172 RKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQ 231
R VP + Y + T IV +++ S + ED +V +P S P
Sbjct: 1027 RTVPRNKAYSNVVY--DPSTSLIVAASS--SQSRFASYDEDGNIVWEPDASNISLPFCET 1082
Query: 232 FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDV 291
+ L SP W + F +E+ V CL V++E T SG + YI +GT N ED+
Sbjct: 1083 STLELLSPDGWVTLDGYEFAPNEF--VNCLDCVTLETSSTESGTKDYIVVGTTINRGEDL 1140
Query: 292 TCRGRILLFDIIEVVPEPGQPLTK-NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYI 350
+G +F+IIEVVP+P + + +++K+ + KGPVTA+C + G+LV+++GQKI++
Sbjct: 1141 AVKGAAYVFEIIEVVPDPTAQMKRWHRLKLHCRDDAKGPVTAMCGMNGYLVSSMGQKIFV 1200
Query: 351 WQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
D L G+AF+D VY+ S+ +VKNL+++ D +S+ + +Q + L ++ +D
Sbjct: 1201 RAFDLDERLVGVAFLDVGVYVTSLCAVKNLLVISDAVKSVWFVAFQEDPYKLVILGKDPY 1260
Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
P ++ II
Sbjct: 1261 PLYVTKADFFFAEGRVSIIS---------------------------------------- 1280
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRF 529
D+D + + Y P ES G L+++T+FH GQ I + D P +SR
Sbjct: 1281 CDEDGVMRILEYDPHDPESKNGQHLLRRTEFH-GQVEYRTSAILARRLKGVDIP--QSRL 1337
Query: 530 LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
+ DG+L + E +RL +LQ + + H GLNPR FR + Y P
Sbjct: 1338 ICGLT--DGSLITMTYVEEAASKRLHLLQGQLTRNVQHVAGLNPRGFRIVRND--YVSRP 1393
Query: 590 -SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
+RGI+DG+L+ + L + + E+ ++IG+ IL + ++
Sbjct: 1394 LTRGILDGNLLMAYEDLPIVRQDEVTRQIGTDRTTILKDWLSLDG 1438
>gi|150951283|ref|XP_001387581.2| pre-mRNA 3'-end processing factor CF II mRNA cleavage and
polyadenylation factor II complex, subunit CFT1 (CPSF
subunit) RNA processing and modification [Scheffersomyces
stipitis CBS 6054]
gi|149388465|gb|EAZ63558.2| pre-mRNA 3'-end processing factor CF II mRNA cleavage and
polyadenylation factor II complex, subunit CFT1 (CPSF
subunit) RNA processing and modification [Scheffersomyces
stipitis CBS 6054]
Length = 1341
Score = 189 bits (479), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 145/612 (23%), Positives = 279/612 (45%), Gaps = 69/612 (11%)
Query: 28 HGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRIS 87
H L ++ E+L+Y+ + + FKK K L ++ + A P G +
Sbjct: 786 HKEEYLTILTIGGEVLLYKLYFDGEN---YEFKKEKDLAITGAPENA-----YPIGTAVE 837
Query: 88 Q-MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFL 146
+ + YF N+ GY +F+ G P +L L S + P +++PFH+ G +
Sbjct: 838 RRLAYFPNLNGYTCIFVTGVTP-YLILKSLHSIPRIYQFSKIPAVSISPFHDSKVANGLI 896
Query: 147 YFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYY 206
+ + + RI LP +Y+ WP++ + + + + YH + TY + T DY
Sbjct: 897 FLDNQQNARICQLPLDFNYENTWPMKLIHIGESIRAITYHESSHTYVVSTF---KDIDYE 953
Query: 207 KFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSM 266
F+ E K +V +D P + + L SPF+W I L + E + +K++ +
Sbjct: 954 CFDEEGKPIVGLHKDKP--PSSAYKGSIKLISPFNWSVID--TIELADNELGMTVKSMIL 1009
Query: 267 EYEGTLSGLR---GYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYA 323
+ + + +I +G+ ED++ G +++II+++PEP +P T +K K ++
Sbjct: 1010 DVGSSTKKFKHKKEFIVIGSGKYRMEDLSANGSFRIYEIIDIIPEPDRPETNHKFKEVFK 1069
Query: 324 KEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVG 383
++ KG VT++C V+G + + GQK+ + L+D+ + +AF+DT VY++ S N++++G
Sbjct: 1070 EDTKGAVTSVCEVSGRFLVSQGQKVIVRDLQDDGVVPVAFLDTAVYVSEAKSFGNMMILG 1129
Query: 384 DYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGE 443
D +S+ L+ + E + ++ +D + N
Sbjct: 1130 DSLKSVWLVGFDAEPFRMIMLGKDLQGLDVN----------------------------- 1160
Query: 444 RLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG 503
C +K ++ +I+D + + L Y PE + G RL+ K+ F +
Sbjct: 1161 ----CADFITKDEEVF-------ILIADNNNVLHLVQYDPEDPTALNGQRLLSKSSFSIN 1209
Query: 504 QHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
V T K K D F T +++DG+ +P+ E +YRR+ +LQ +
Sbjct: 1210 SFV-TCLKSLPKTEEKYDT----GNFQTIGSTIDGSFFSVVPINEASYRRMYILQQQLTD 1264
Query: 564 HTSHTGGLNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSK-- 620
H GLNPR R + G A + ++ I+D ++ + +L+ + + K+ +K
Sbjct: 1265 KEYHYCGLNPRLNR-FGGLSMTANDTNTKPILDYDVIRAYGKLNEERKKNLASKVSAKNI 1323
Query: 621 HNDILDELYDIE 632
+ DI ++ + E
Sbjct: 1324 YQDIWKDIIEFE 1335
>gi|260941626|ref|XP_002614979.1| hypothetical protein CLUG_04994 [Clavispora lusitaniae ATCC 42720]
gi|238851402|gb|EEQ40866.1| hypothetical protein CLUG_04994 [Clavispora lusitaniae ATCC 42720]
Length = 1363
Score = 187 bits (476), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 143/553 (25%), Positives = 256/553 (46%), Gaps = 60/553 (10%)
Query: 88 QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLY 147
+M YF +++G + + G P + + +++ + P+ + PF G +Y
Sbjct: 857 RMIYFPDVSGTTCIMVTGVIPYMITRSRHSQVKVFKFS-KIPIVSFVPFSTDKIKNGLIY 915
Query: 148 FNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYK 207
+ K RI LP+ SYD WP+RKV + T +A+H + T + T P Y
Sbjct: 916 LDTKKNARIVELPSEFSYDYNWPIRKVSIGETVKSVAFHEGSNTLVVSTLKEIP----YN 971
Query: 208 FNGEDKELVTDPRDSRFIPPLVS-QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSM 266
E+ + + ++ P +S + + L SP +W I N L + E L +K++ +
Sbjct: 972 CIDEEGNPIVGIKPNK--PSAISYKGSIKLISPVNWSVI--DNIELADNEVGLHVKSMPL 1027
Query: 267 EYEGTLSGLRG---YIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYA 323
+ + ++ +GT ED+ C G L +II+++PEPG+P T +K K
Sbjct: 1028 DVGSETKRFKSKKEFVLVGTGKYRLEDLACNGSYKLLEIIDIIPEPGKPETNHKFKEFTQ 1087
Query: 324 KEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVG 383
++ +G VT+IC V+G + A GQKI + +KDN +AF+DT V+++ S NL+++G
Sbjct: 1088 EDTRGAVTSICEVSGRFLVAQGQKIIVRDIKDNSAVSVAFLDTSVFVSESKSFGNLVVLG 1147
Query: 384 DYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGE 443
D +S+ L + E P R I+ LG+
Sbjct: 1148 DTLKSVWLAGFDAE-------------------------PFRMIM------------LGK 1170
Query: 444 RLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG 503
L+ + D L + + +++D ++++ + Y PE S+ G RL+ K+ F
Sbjct: 1171 DLQ---GLDVSSADFLVKDEEIYILVADNNRSLHVLQYNPEDPASSNGQRLLHKSSF-TT 1226
Query: 504 QHVNTFFKIRCKPSSISD--APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 561
++ T K K +S P A F T ++++GA+ P+ E YRR+ ++Q +
Sbjct: 1227 NYLTTCTKSVPKHEQLSTWFDPQAIP-FQTVGSTVEGAMYVVFPISEPTYRRMYIMQQQL 1285
Query: 562 VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 621
+ H GLNPR R + + N R ++D L+ +F +L+ + + KI +K+
Sbjct: 1286 IDKEYHHCGLNPRLNRIGRIESVNYANL-RAMLDCELIRRFSKLNEDRKRTLSSKISTKN 1344
Query: 622 --NDILDELYDIE 632
DI +L + E
Sbjct: 1345 VQVDIWKDLIEFE 1357
>gi|301103686|ref|XP_002900929.1| cleavage and polyadenylation specificity factor subunit, putative
[Phytophthora infestans T30-4]
gi|262101684|gb|EEY59736.1| cleavage and polyadenylation specificity factor subunit, putative
[Phytophthora infestans T30-4]
Length = 1561
Score = 187 bits (476), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 168/657 (25%), Positives = 282/657 (42%), Gaps = 139/657 (21%)
Query: 80 LPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG----------- 128
L G R + F N+ G F G HP W+ L RG PM +
Sbjct: 945 LRAGFRYPMLTCFYNVNNMSGAFFRGAHPMWI-LGDRGHASFVPMCVPSSAPPKANGTSK 1003
Query: 129 --------PVSTLAPFHNVNCPRGFLYFNAKSELRISVLP-----THLSYDAPWPVRKVP 175
PV + PFH+ +CP GF+YF+++ LR+ LP T L + ++K
Sbjct: 1004 NAAPRVSVPVLSFTPFHHWSCPNGFIYFHSRGALRVCELPSSKTSTILPSSGGFVLQKAE 1063
Query: 176 LKCTPHFLAY-----------HLETKTYCIVTST------AEPSTDYYKFNGEDKELVTD 218
T H + Y LE TY +V S A+ +T+ E + D
Sbjct: 1064 FGATLHHMLYLGSHGPGGVAEALEAPTYAVVCSARLKPADADRATEVEGAEEELEPENLD 1123
Query: 219 PRD----SRFIPPLVSQF------HVSLFSPFSWE-EIPQTN----------FPLH--EW 255
P S + P F H++ +E + QT+ F +H +
Sbjct: 1124 PNGNPLGSNVMAPTAEMFADYETDHMAHTEEDVYELRLVQTDEFGEWGRRGVFRVHFERY 1183
Query: 256 EHVLCLK-----NVSMEYEGTLSGL-------RGYIALGTNY--NYSEDVTCRGRILLF- 300
E VL +K + S+ E S R Y+ +GT + + ED + RGR+LL+
Sbjct: 1184 EVVLSVKLMYLYDSSLMKEEVASTSPEWNKKKRPYLVVGTGWVGPHGEDESGRGRLLLYE 1243
Query: 301 -DIIEVVPEPGQPLTKN--KIKMIYAKE-QKGPVTAICHVAGFLVTAVGQKIYIWQLKDN 356
D + V E G + K+++++ KE ++G ++ + + +++ AVG K+ +++ K
Sbjct: 1244 LDYAQYVNEEGGATSGKLPKLRLVFIKEHRQGAISMVSQLGPYVLAAVGSKLIVYEFKSE 1303
Query: 357 DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSK 416
L G AF D ++YI ++ VK+ ++ GD +S+ LR++ R L L+A+DY+P
Sbjct: 1304 QLIGCAFYDAQMYIVTLSVVKDFVMYGDVYKSVHFLRWREMQRQLVLLAKDYEP------ 1357
Query: 417 GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
L + S+ E+ + + D D+N+
Sbjct: 1358 ---------------LAVSATEFSVFEK-------------------KLALLAVDMDENL 1383
Query: 477 VLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS-SISDAPGAR-----SRFL 530
+ + P+ ES GG RL++ +DFHLG V++ F+ R S S+ A R S ++
Sbjct: 1384 HVMQFAPQDIESRGGQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYV 1443
Query: 531 TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY-YAGNP 589
+ +G +G +P+ E+ +RRL LQNVMV LNPR FR K G P
Sbjct: 1444 NVMGTSEGGVGALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRILKTNAQRRCGRP 1503
Query: 590 S--------RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
+ +D ++++FLQL + E+ + IG+ + L +++ +S F
Sbjct: 1504 DAWSKKKWKKSFLDAFVLFRFLQLDYVAQKELARCIGTTPEVAMHNLLEVQHATSTF 1560
>gi|12697776|dbj|BAB21613.1| polyadenylation specificity factor [Homo sapiens]
Length = 216
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 97/255 (38%), Positives = 147/255 (57%), Gaps = 43/255 (16%)
Query: 387 RSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 446
+SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 2 KSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN------------------------ 37
Query: 447 ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV 506
+ +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HV
Sbjct: 38 ----------------AQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHV 81
Query: 507 NTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
NTF++ C+ ++ + + ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T
Sbjct: 82 NTFWRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTT 141
Query: 564 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 623
H GLNPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ +
Sbjct: 142 MLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDI 201
Query: 624 ILDELYDIEALSSHF 638
ILD+L + + +++HF
Sbjct: 202 ILDDLLETDRVTAHF 216
>gi|392572878|gb|EIW66021.1| hypothetical protein TREMEDRAFT_70300 [Tremella mesenterica DSM 1558]
Length = 1408
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 136/544 (25%), Positives = 246/544 (45%), Gaps = 65/544 (11%)
Query: 92 FSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID----GPVSTLAPFHNVNCPRGFLY 147
+ N+ G G F+ G P W+ + + LR + + GP + L G +
Sbjct: 916 YDNLEGQSGAFITGEKPYWIMSSEKHPLRLYGLKQGAMAFGPTTHLGSM-------GEYF 968
Query: 148 FNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYK 207
I P L+ D P + ++ T + + + Y T+ + P Y
Sbjct: 969 MKIDDGCFICYFPQSLNTDLTMPCDRYEMQRTYTNVVFDPPSGHYLGATAISVPFQAY-- 1026
Query: 208 FNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS--WEEIPQTNFPLHEWEHVLCLKNVS 265
E+ E+ P +PPL + + LFS S W I +F + E+VL +++V
Sbjct: 1027 --DEEGEIQLGPEGENLVPPLNERSSLELFSRGSDPWRVIDGYDF--DQNENVLSMQSVL 1082
Query: 266 MEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKE 325
+E G R ++A+GT +++ ED RG + +F+++EVVPEPGQ + +K+
Sbjct: 1083 LESSSVPGGYRDFVAVGTGFDFGEDRATRGNVYIFEVVEVVPEPGQK-SAWALKLRCKDP 1141
Query: 326 QKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGD 384
+ PV+A+ ++ G+L+ + G K+Y+ L D L G+AF+D +Y+ S+ KN IL+ D
Sbjct: 1142 CRNPVSALGNINGYLLHSNGPKMYVKGLDFDERLMGLAFVDVMIYLTSIKVFKNFILISD 1201
Query: 385 YARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGER 444
+SI L +Q + +++++D P S + + DG +
Sbjct: 1202 MVKSIWFLSFQEDPYKFTVISKDLMPISVTSADFL-------VHDGHVT----------- 1243
Query: 445 LEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQ 504
F+ D+ ++ + + P ES G RLI +T++H G
Sbjct: 1244 ----------------------FLTYDRSGDIRMVDFDPANPESINGERLIVRTEYHGGS 1281
Query: 505 HVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 564
V T + + + + +++ + +A DG++ F+ +RRL + + ++ +
Sbjct: 1282 PV-TVSTMIARRRGVEEEFAPQTQIICAHA--DGSISTFVSTKPARFRRLHFVSDQLIRN 1338
Query: 565 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
H GLNPRAFRT + A SRGI+DG L+ +F + + E+ K+IG+ +
Sbjct: 1339 AQHVAGLNPRAFRTVRND-LVAKPLSRGILDGELLGRFAIQPIDRQREMLKQIGTDGGTV 1397
Query: 625 LDEL 628
+L
Sbjct: 1398 ASDL 1401
>gi|440466842|gb|ELQ36086.1| hypothetical protein OOU_Y34scaffold00669g71 [Magnaporthe oryzae Y34]
gi|440481991|gb|ELQ62520.1| hypothetical protein OOW_P131scaffold01068g7 [Magnaporthe oryzae
P131]
Length = 1475
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 162/628 (25%), Positives = 280/628 (44%), Gaps = 69/628 (10%)
Query: 12 MDETIVQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALKL----RFKKLKVL 65
M + E+L LG + + ++L + H+L IY+ +R + + L R +KL
Sbjct: 873 MARETISEILVTDLGDTVFKSPHVILRHSNHDLTIYEPYRIAEDSQSLTKILRLRKLPNP 932
Query: 66 FVSDRSKRAN-EQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM 124
V+ + N E P P R +R +NIAGY VF+ G P++L +++ + +
Sbjct: 933 AVAKAPEATNSEDP--PLMSRNMPLRACANIAGYSAVFMPGHSPSFLIKSAKATPKVIGL 990
Query: 125 TIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFL 183
G V ++ FH C RGF+Y ++ R++ +P S+ + V+KVPL +
Sbjct: 991 RGSG-VRAMSSFHTEGCERGFIYADSAGVARVAQIPKDTSFSELGLSVKKVPLGIDADGI 1049
Query: 184 AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
AYH T Y + S EP ++ +D +++ P+V + + + +P +W
Sbjct: 1050 AYHSPTGVYVLTCSYWEP----FELPKDDDYHCEWAKENISFKPMVERSVLKVINPINWS 1105
Query: 244 EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
+I F HE +C++++++E + + R I +GT ED+ RG I +FD+
Sbjct: 1106 DIWTEEFEQHEV--AMCIRSLNLEVSQSTNERRQLITVGTAMCKGEDLPVRGGIYVFDLA 1163
Query: 304 EVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDL 358
VVP+ G+P T K+K + AKE+ +G VT++ + G ++ A GQK + L+ D L
Sbjct: 1164 SVVPQKGRPETDKKLKQV-AKEEIPRGAVTSLSEIGTQGLMMVAQGQKTLVRGLQEDGKL 1222
Query: 359 TGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
+AF+D Y+ VK L G L ++A +K GY
Sbjct: 1223 PPVAFMDMNCYV---TCVKELAGTG-----------------LCVMADAFKGVW--FCGY 1260
Query: 419 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
G +K + C + D+L + + + +D D N+ +
Sbjct: 1261 TEGP-----------YKMMLFGKSSTNLECMNV-----DLLPDGKDLLIVAADSDGNLHV 1304
Query: 479 FMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIRCKPSSISDAPGARS-RFLTWYAS 535
+ PE +S GH L+ +T F G H + P ++ P + + R AS
Sbjct: 1305 LQFDPEHPKSLQGHLLLNRTTFSTGAHHPQKSLLLPTTDPRPSTNQPSSDAERQHILMAS 1364
Query: 536 LDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR-----TYKGKGYYAGNPS 590
G L PL + Y RL L + ++ H LNP+A+R T +
Sbjct: 1365 PTGVLAAVQPLSQSTYTRLSALASNLMASVPHHAALNPKAYRLPPTSTRNQVAAVDISVG 1424
Query: 591 RGIIDGSLVWKFLQLSLGERLEICKKIG 618
R ++DGSL+ ++ +L+ G R E+ + G
Sbjct: 1425 RAVVDGSLLARWAELASGRRAEVAGRAG 1452
>gi|389641257|ref|XP_003718261.1| cft-1 [Magnaporthe oryzae 70-15]
gi|351640814|gb|EHA48677.1| cft-1 [Magnaporthe oryzae 70-15]
Length = 1452
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 162/628 (25%), Positives = 280/628 (44%), Gaps = 69/628 (10%)
Query: 12 MDETIVQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALKL----RFKKLKVL 65
M + E+L LG + + ++L + H+L IY+ +R + + L R +KL
Sbjct: 850 MARETISEILVTDLGDTVFKSPHVILRHSNHDLTIYEPYRIAEDSQSLTKILRLRKLPNP 909
Query: 66 FVSDRSKRAN-EQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM 124
V+ + N E P P R +R +NIAGY VF+ G P++L +++ + +
Sbjct: 910 AVAKAPEATNSEDP--PLMSRNMPLRACANIAGYSAVFMPGHSPSFLIKSAKATPKVIGL 967
Query: 125 TIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFL 183
G V ++ FH C RGF+Y ++ R++ +P S+ + V+KVPL +
Sbjct: 968 RGSG-VRAMSSFHTEGCERGFIYADSAGVARVAQIPKDTSFSELGLSVKKVPLGIDADGI 1026
Query: 184 AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
AYH T Y + S EP ++ +D +++ P+V + + + +P +W
Sbjct: 1027 AYHSPTGVYVLTCSYWEP----FELPKDDDYHCEWAKENISFKPMVERSVLKVINPINWS 1082
Query: 244 EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
+I F HE +C++++++E + + R I +GT ED+ RG I +FD+
Sbjct: 1083 DIWTEEFEQHEV--AMCIRSLNLEVSQSTNERRQLITVGTAMCKGEDLPVRGGIYVFDLA 1140
Query: 304 EVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDL 358
VVP+ G+P T K+K + AKE+ +G VT++ + G ++ A GQK + L+ D L
Sbjct: 1141 SVVPQKGRPETDKKLKQV-AKEEIPRGAVTSLSEIGTQGLMMVAQGQKTLVRGLQEDGKL 1199
Query: 359 TGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
+AF+D Y+ VK L G L ++A +K GY
Sbjct: 1200 PPVAFMDMNCYV---TCVKELAGTG-----------------LCVMADAFKGVW--FCGY 1237
Query: 419 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
G +K + C + D+L + + + +D D N+ +
Sbjct: 1238 TEGP-----------YKMMLFGKSSTNLECMNV-----DLLPDGKDLLIVAADSDGNLHV 1281
Query: 479 FMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIRCKPSSISDAPGARS-RFLTWYAS 535
+ PE +S GH L+ +T F G H + P ++ P + + R AS
Sbjct: 1282 LQFDPEHPKSLQGHLLLNRTTFSTGAHHPQKSLLLPTTDPRPSTNQPSSDAERQHILMAS 1341
Query: 536 LDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR-----TYKGKGYYAGNPS 590
G L PL + Y RL L + ++ H LNP+A+R T +
Sbjct: 1342 PTGVLAAVQPLSQSTYTRLSALASNLMASVPHHAALNPKAYRLPPTSTRNQVAAVDISVG 1401
Query: 591 RGIIDGSLVWKFLQLSLGERLEICKKIG 618
R ++DGSL+ ++ +L+ G R E+ + G
Sbjct: 1402 RAVVDGSLLARWAELASGRRAEVAGRAG 1429
>gi|297722899|ref|NP_001173813.1| Os04g0252200 [Oryza sativa Japonica Group]
gi|255675253|dbj|BAH92541.1| Os04g0252200, partial [Oryza sativa Japonica Group]
Length = 432
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 119/392 (30%), Positives = 195/392 (49%), Gaps = 54/392 (13%)
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
WE ++ P+ +E+ L ++ V++ + T +A+GT Y EDV RGR+LLF
Sbjct: 86 WE--TKSTIPMQLFENALTVRIVTL-HNTTTKENETLLAIGTAYVLGEDVAARGRVLLFS 142
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
+ ++N + +Y+KE KG V+A+ + G L+ A G KI + + +LT +
Sbjct: 143 FTK------SENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWTGAELTAV 196
Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
AF D +++ S+ VKN +L GD +SI L ++ + LSL+A+D+ G
Sbjct: 197 AFYDAPLHVVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDF--------GSLDC 248
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
+ +IDGS ++ + SD DKNV +F Y
Sbjct: 249 FATEFLIDGS--------------------------------TLSLVASDSDKNVQIFYY 276
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALG 541
P+ ES G +L+ + +FH+G H+ F +++ P+ + +RF + +LDG +G
Sbjct: 277 APKMVESWKGQKLLSRAEFHVGAHITKFLRLQMLPTQ-GLSSEKTNRFALLFGNLDGGIG 335
Query: 542 FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY--KGKGYYAGNPSRGIIDGSLV 599
P+ E +RRL LQ +V H GLNPR+FR + GKG+ G IID L+
Sbjct: 336 CIAPIDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHRPG--PDNIIDFELL 393
Query: 600 WKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
+ LSL E+L++ ++IG+ + IL DI
Sbjct: 394 AHYEMLSLDEQLDVAQQIGTTRSQILSNFSDI 425
>gi|242075246|ref|XP_002447559.1| hypothetical protein SORBIDRAFT_06g003570 [Sorghum bicolor]
gi|241938742|gb|EES11887.1| hypothetical protein SORBIDRAFT_06g003570 [Sorghum bicolor]
Length = 389
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 118/389 (30%), Positives = 192/389 (49%), Gaps = 50/389 (12%)
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
WE + P+ +E+ L ++ V+++ T +A+GT Y EDV RGR+LL+
Sbjct: 43 WE--TRFTIPMQSFENALTVRIVTLQNTSTKEN-ETLMAIGTAYVQGEDVAARGRVLLYS 99
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
++N + +Y+KE KG V+A+ + G L+ A G KI + + ++LT +
Sbjct: 100 F------SRSENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWTGSELTAV 153
Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
AF D +++ S+ VKN +L GD +SI L ++ + L+L+A+D+ G
Sbjct: 154 AFYDAPLHVVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLNLLAKDF--------GSLDC 205
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
+ +IDGS ++ ++SD DKNV +F Y
Sbjct: 206 FATEFLIDGS--------------------------------TLSLVVSDSDKNVQIFYY 233
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALG 541
P+ ES G +L+ + +FH+G HV+ F +++ P+ A +RF + +LDG +G
Sbjct: 234 APKMVESWKGQKLLSRAEFHVGAHVSKFLRLQMLPTQ-GLASEKTNRFALVFGTLDGGIG 292
Query: 542 FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 601
P+ E +RRL LQ +V H GLNPR+FR +K G IID L+
Sbjct: 293 CIAPVDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRHFKSNGKAHRPGPDNIIDFELLSH 352
Query: 602 FLQLSLGERLEICKKIGSKHNDILDELYD 630
+ LSL E+LEI ++IG+ + IL D
Sbjct: 353 YEMLSLEEQLEIAQQIGTTRSQILSNFSD 381
>gi|347838999|emb|CCD53571.1| similar to Cleavage and polyadenylation specificity factor subunit 1
[Botryotinia fuckeliana]
Length = 1447
Score = 186 bits (471), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 171/655 (26%), Positives = 283/655 (43%), Gaps = 89/655 (13%)
Query: 10 SAMDETIVQELLTVSLGLH-GNRPLLLVR-TQHELLIYQAFRHPKGALKLRFKKLKVLFV 67
SA ET+ E+L +LG P L++R + +L IY+ FR + L L+ L +
Sbjct: 836 SAARETLT-EILVANLGDSVSQSPYLILRPSNDDLTIYEPFRVKSASPDLLSSTLQFLKI 894
Query: 68 SDR------SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
+ A EQ + MR SN+ GY VF+ G P+++ +S+ +
Sbjct: 895 QNTHLTQAPDVSAEEQVDGAQQTSDKPMRAISNLGGYSTVFMPGGSPSFIIKSSKTAPKV 954
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTP 180
+ G V +L+ FH C RGF+Y + + R++ P + ++ D +RK+ +
Sbjct: 955 LSLQGTG-VRSLSSFHTEGCDRGFIYASTEGIARVAQFPPNTTFADIGMALRKIEIGEDV 1013
Query: 181 HFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPF 240
H +AYH +TY I TST TD+ + +D T ++ + P + + + L SP
Sbjct: 1014 HAVAYHPPLQTYVIGTSTF---TDF-ELPKDDDHRKTWQEENIALKPSIEKSFLKLVSPV 1069
Query: 241 SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
+W I L E + C+K +++ + + I +GT ED+ GR+ ++
Sbjct: 1070 NWSVIDA--IELEPCELITCIKTMNLVISEVTNERKHLIVVGTAITKGEDLATTGRLYVY 1127
Query: 301 DIIEVVPEPGQPLTKNKIKMIYA----KEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK 354
D++ VVPEP +P T K+K+I + + GPVT + + GF++ A GQK + LK
Sbjct: 1128 DVVTVVPEPDRPETNKKLKLISSEIITRGAGGPVTGLSEIGTQGFMLVAQGQKCMVRGLK 1187
Query: 355 DNDLT-GIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPT 411
++ +AF+D Y+ S+ + L ++ D + + Y E
Sbjct: 1188 EDGTNLPVAFMDMNCYVTSVKELPGTGLCVMADALKGVWFAGYTEE-------------- 1233
Query: 412 QPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISD 471
P R ++ G K L C D+L + + + +D
Sbjct: 1234 -----------PYRMLLFGKSAAKMEVL--------CA-------DLLPDGKDLFIVAAD 1267
Query: 472 KDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH----------------VNTFFKIRCK 515
+ N+ + Y PE +S GH L+ +T F LG H + T
Sbjct: 1268 ANGNLHIMQYDPEHPKSLQGHLLLHRTTFSLGAHHPTTMTLLPTTRPLPQLTTAPSPSPD 1327
Query: 516 PSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRA 575
PS D P L S G L PL E YRR L + + H GLNPRA
Sbjct: 1328 PSPQEDTPSPSQPLL--LTSRTGTLALLSPLTESQYRRFGTLVSHLTNTLYHPCGLNPRA 1385
Query: 576 FRTYK--GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
+R + +G G R IIDG ++ ++++L R E+ ++G ++ DEL
Sbjct: 1386 YRIDRDANEGIVGG---RTIIDGGVLGRWMELGSQRRGEVAGRVGVDVLELRDEL 1437
>gi|346319828|gb|EGX89429.1| protein CFT1 [Cordyceps militaris CM01]
Length = 1452
Score = 185 bits (470), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 172/652 (26%), Positives = 297/652 (45%), Gaps = 77/652 (11%)
Query: 10 SAMDETIVQELLTVSLG-LHGNRPLLLVR-TQHELLIYQAFRH--PKGALK-----LRFK 60
+A ET+ E+L LG + P L++R +L +Y+ R+ P + L FK
Sbjct: 839 AAAKETLT-EILVADLGDVVAKSPYLILRHDTDDLTLYEPVRYHEPNSSSAPLSDTLFFK 897
Query: 61 KLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
K ++ + ++++ + R ++ +N+ GY VFL G P+++ +++ R
Sbjct: 898 KSTNSTIAKSAPASDKEDDETQQKRFVPLQLCANVGGYSAVFLSGDSPSFILKSAKSIPR 957
Query: 121 AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCT 179
+ G V ++ FH C RGF+Y + K R+S LPT +Y + V+K+PL C
Sbjct: 958 IVGLQGQG-VQGMSTFHTEGCDRGFIYADTKGIARVSQLPTDTNYAELGISVKKIPLDCD 1016
Query: 180 PHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP 239
+ +++H T TY ST EP ++ +D R++ P + + + L SP
Sbjct: 1017 VNRVSFHSHTATYIAACSTREP----FELPKDDDYHKEWARETVNFAPTMPRGILKLISP 1072
Query: 240 FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
+W I + L E + + + +E R +A+G+ ED+ RGR+ +
Sbjct: 1073 AAWTVI--HSLDLESCETIESMMALHLEISEETKERRMVVAVGSAICKGEDLPTRGRVQV 1130
Query: 300 FDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHV--AGFLVTAVGQKIYIWQLK- 354
FDI+ V+PEPG+P T ++K++ AKE+ +G VT++ + +G L+ A GQK + L+
Sbjct: 1131 FDIVTVIPEPGRPETNKRLKLL-AKEELPRGGVTSLSEIGTSGLLLIAQGQKCMVRGLRE 1189
Query: 355 DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
D L +AF+D +I + V+ L G L L+A +K
Sbjct: 1190 DGGLLPVAFLDMNCHI---LGVRELRGTG-----------------LCLMADAFKGM--- 1226
Query: 415 SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
++A G + +K L S G+ I D L + + + D D
Sbjct: 1227 ---WFA-----GYTEEPYTFKVLGKSGGQ-------IPMLVADFLPDGEDLNMIGVDADG 1271
Query: 475 NVVLFMYQPEARESNGGHRLIKKTDFHLG--QHVNTFFKIRCKPSSI---SDAPGARSRF 529
++ +F + P+ +S GH L+ +T F L + T R P+S GA +
Sbjct: 1272 DLHVFEFNPDHPKSLQGHLLLHRTTFSLSPNEPTTTVLLERTIPASQPQPQGTTGAETPH 1331
Query: 530 LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA--- 586
+ G L PL E YRRLL L N ++ GGL+P+A R +G+G +
Sbjct: 1332 TLLLSCPTGQLAALTPLSESAYRRLLSLANQLMPAVVPYGGLHPKAHRLPEGRGAQSHAR 1391
Query: 587 ------GNPSRGIIDGSLVWKFLQLSLGERLEICKKIG-SKHNDILDELYDI 631
R I+DG+++ ++ +L +R E+ K G N++ DEL +
Sbjct: 1392 AVGVETAASGRMIVDGAVLARWTELGAAKRAEMATKSGYDDLNEMRDELEGV 1443
>gi|154320778|ref|XP_001559705.1| hypothetical protein BC1G_01861 [Botryotinia fuckeliana B05.10]
Length = 1153
Score = 185 bits (469), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 171/655 (26%), Positives = 283/655 (43%), Gaps = 89/655 (13%)
Query: 10 SAMDETIVQELLTVSLGLH-GNRPLLLVR-TQHELLIYQAFRHPKGALKLRFKKLKVLFV 67
SA ET+ E+L +LG P L++R + +L IY+ FR + L L+ L +
Sbjct: 542 SAARETLT-EILVANLGDSVSQSPYLILRPSNDDLTIYEPFRVKSASPDLLSSTLQFLKI 600
Query: 68 SDR------SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
+ A EQ + MR SN+ GY VF+ G P+++ +S+ +
Sbjct: 601 QNTHLTQAPDVSAEEQVDGAQQTSDKPMRAISNLGGYSTVFMPGGSPSFIIKSSKTAPKV 660
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTP 180
+ G V +L+ FH C RGF+Y + + R++ P + ++ D +RK+ +
Sbjct: 661 LSLQGTG-VRSLSSFHTEGCDRGFIYASTEGIARVAQFPPNTTFADIGMALRKIEIGEDV 719
Query: 181 HFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPF 240
H +AYH +TY I TST TD+ + +D T ++ + P + + + L SP
Sbjct: 720 HAVAYHPPLQTYVIGTSTF---TDF-ELPKDDDHRKTWQEENIALKPSIEKSFLKLVSPV 775
Query: 241 SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
+W I L E + C+K +++ + + I +GT ED+ GR+ ++
Sbjct: 776 NWSVIDA--IELEPCELITCIKTMNLVISEVTNERKHLIVVGTAITKGEDLATTGRLYVY 833
Query: 301 DIIEVVPEPGQPLTKNKIKMIYA----KEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK 354
D++ VVPEP +P T K+K+I + + GPVT + + GF++ A GQK + LK
Sbjct: 834 DVVTVVPEPDRPETNKKLKLISSEIITRGAGGPVTGLSEIGTQGFMLVAQGQKCMVRGLK 893
Query: 355 DNDLT-GIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPT 411
++ +AF+D Y+ S+ + L ++ D + + Y E
Sbjct: 894 EDGTNLPVAFMDMNCYVTSVKELPGTGLCVMADALKGVWFAGYTEE-------------- 939
Query: 412 QPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISD 471
P R ++ G K L C D+L + + + +D
Sbjct: 940 -----------PYRMLLFGKSAAKMEVL--------CA-------DLLPDGKDLFIVAAD 973
Query: 472 KDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH----------------VNTFFKIRCK 515
+ N+ + Y PE +S GH L+ +T F LG H + T
Sbjct: 974 ANGNLHIMQYDPEHPKSLQGHLLLHRTTFSLGAHHPTTMTLLPTTRPLPQLTTAPSPSPD 1033
Query: 516 PSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRA 575
PS D P L S G L PL E YRR L + + H GLNPRA
Sbjct: 1034 PSPQEDTPSPSQPLL--LTSRTGTLALLSPLTESQYRRFGTLVSHLTNTLYHPCGLNPRA 1091
Query: 576 FRTYK--GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
+R + +G G R IIDG ++ ++++L R E+ ++G ++ DEL
Sbjct: 1092 YRIDRDANEGIVGG---RTIIDGGVLGRWMELGSQRRGEVAGRVGVDVLELRDEL 1143
>gi|169603229|ref|XP_001795036.1| hypothetical protein SNOG_04622 [Phaeosphaeria nodorum SN15]
gi|160706354|gb|EAT88382.2| hypothetical protein SNOG_04622 [Phaeosphaeria nodorum SN15]
Length = 1338
Score = 185 bits (469), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 155/560 (27%), Positives = 244/560 (43%), Gaps = 66/560 (11%)
Query: 72 KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS 131
+ A ++PG S + NI GY V G PA++ S R ++ PV
Sbjct: 831 EEAADEPGFE-----STLLALDNINGYSTVIQRGRSPAFILKESSSAPRVIGLS-GNPVK 884
Query: 132 TLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYD-APWPVRKVPLKCTPHFLAYHLETK 190
+L FH +C RGF Y ++ LRIS LP Y W R++P+ H LAYH
Sbjct: 885 SLTRFHTSSCQRGFAYLDSTDTLRISQLPPSTHYGHLGWAARRMPMDAEVHALAYH---P 941
Query: 191 TYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNF 250
+ V T +P + Y + D P++ P V + + +W I
Sbjct: 942 SGLYVIGTGQP--EEYTLDPNDTFHYELPKEETSFKPKVEHGIIKVMDEKTWTVI--DTH 997
Query: 251 PLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPG 310
L E +LC+K +++E T + IA+GT ED+ +G I +F++I VVPEP
Sbjct: 998 VLDPQEVILCIKTLNLEVSETTHQRKDVIAVGTAIVLGEDLATKGNIRIFEVITVVPEPD 1057
Query: 311 QPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFIDTE 367
P T ++K+I E KG V+AI + GFL+ A GQK + LK D L +AF+D +
Sbjct: 1058 HPETNKRLKLIVKDEVKGTVSAISDLGTQGFLIMAQGQKSMVRGLKEDGTLLPVAFMDMQ 1117
Query: 368 VYIASMVSVKN--LILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSR 425
Y+ ++ ++ N ++L+GD + Y E + L R SK +
Sbjct: 1118 CYVTTLKTLPNTGMLLMGDAYKGAWFTGYTEEPYKMMLFGR--------SKHHLE----- 1164
Query: 426 GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEA 485
+ FL E+L I +++D D N+ + + P+
Sbjct: 1165 -----CITADFLPFE--EQLHI--------------------IVADADMNLQVLQFDPDH 1197
Query: 486 RESNGGHRLIKKTDFHLGQHVNT--FFKIRCKPSSISDAPGARSRFLTWY----ASLDGA 539
+S GG RL++K+ FH G +T + R + S+ + + L + S G
Sbjct: 1198 PKSMGGTRLLQKSTFHTGHFPSTMHLLQSRLHMPTASEFTTSTTSSLPLHQILCTSQSGT 1257
Query: 540 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK-GKGYYAGNPSRGIIDGSL 598
L PL E +YRRL L + GLN +AFR +G + R ++DG L
Sbjct: 1258 LALITPLSESSYRRLSGLATHLQQFLDSPCGLNGKAFRAADVMEGGWDAGTQRAMLDGGL 1317
Query: 599 VWKFLQLSLGERLEICKKIG 618
+ ++ +L R E K+G
Sbjct: 1318 LMRWGELGEQRRREGLGKVG 1337
>gi|302506529|ref|XP_003015221.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
gi|291178793|gb|EFE34581.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
Length = 1370
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 155/611 (25%), Positives = 270/611 (44%), Gaps = 100/611 (16%)
Query: 32 PLLLVRTQHE-LLIYQAFRHP--KGALKLRF-KKLKVLFVSDRSKRANEQPGLPRGVRIS 87
P +++RT+H+ L++Y+ +R G LRF K + + + R+ + Q
Sbjct: 788 PYMILRTKHDDLVLYEPYRTAGESGQSGLRFLKAVNHVVMGPRTDQGVNQDINRSSSSCK 847
Query: 88 QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG-PVSTLAPFHNVNCPRGFL 146
+R ++ GY+ VF+ G P ++ ++ R H + + G V +L+ FH C RGF
Sbjct: 848 LLRALPDVCGYRTVFMSGHSPCFILKSAIA--RPHVLRLRGKAVQSLSGFHIAACERGFA 905
Query: 147 YFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYY 206
Y + + L + Y ++ Y I TS E +
Sbjct: 906 YVD----------------------EDITLGEQVDSIVYSSASECYVIGTSAKED----F 939
Query: 207 KFNGEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNV 264
K ED E T+ R+ F+P L + + L P +W I + L E + C++ +
Sbjct: 940 KLP-EDDESHTEWRNEFITFLPQL-ERGTIKLLEPRNWSTI--DSHELEPAERITCIEVI 995
Query: 265 SMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAK 324
+E + + +G++ ED+ +G I +F++I+VVPEP QP K+K+ +
Sbjct: 996 RLEISELTHERKDMVVVGSSIVKGEDIVPKGFIRVFEVIDVVPEPDQPEKNKKLKLFAKE 1055
Query: 325 EQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVK--NL 379
E KG VTA+ + GFL+ A GQK + LK D L +AF DT+ Y+ + +K +
Sbjct: 1056 EVKGAVTALSGIGGQGFLIVAQGQKCMVRGLKEDGSLLPVAFKDTQCYVNVLKELKGTGM 1115
Query: 380 ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQL 439
++GD + + + Y E L L ++ N + ++D
Sbjct: 1116 CIIGDAFKGLWFIGYSEEPYKLDLFGKE--------------NENLAVVDA--------- 1152
Query: 440 SLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD 499
D L + + + +++D D N+ + Y PE S+ G RL+ ++
Sbjct: 1153 -----------------DFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSV 1195
Query: 500 FHLGQHVNTFFKI---RCKPSSISDA--------PGARSRFLTWYASLDGALGFFLPLPE 548
FH G +T + PSS D P ++ + L + + G++ PL E
Sbjct: 1196 FHTGHFASTMTLLPHGGHTPSSPVDEDAMDTDSPPPSKYQILMTFQT--GSIAIITPLGE 1253
Query: 549 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 608
+YRRLL LQ+ +V H LNPR +R + G RG+IDG+L+ ++L +
Sbjct: 1254 DSYRRLLALQSQLVNALEHPCSLNPRGYRAVESDGMGG---QRGMIDGNLLLRWLDMGAQ 1310
Query: 609 ERLEICKKIGS 619
+ EI ++G+
Sbjct: 1311 RKAEIAGRVGA 1321
>gi|167526060|ref|XP_001747364.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774199|gb|EDQ87831.1| predicted protein [Monosiga brevicollis MX1]
Length = 1324
Score = 182 bits (463), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 167/632 (26%), Positives = 275/632 (43%), Gaps = 101/632 (15%)
Query: 13 DETIVQELLTVSLGLHGNRPLLLVRT-QHELLIYQAFRHPKGALKLRFKKLKVLFVSDRS 71
+E IV+ LL + LG G RP LL RT H LL+Y+ F V V++ S
Sbjct: 748 EEYIVETLL-IGLG-QGQRPHLLARTSDHHLLMYEVFP-------------VVPSVTEAS 792
Query: 72 KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA---HPMTIDG 128
R +++ F NIAG GV + GP P L + +L+A P+ ++
Sbjct: 793 VR--------------RLKPFQNIAGCDGVCVTGPRP--LLVACGHQLKAITIVPLALED 836
Query: 129 PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLE 188
V T P H + GF+YF L + P L + R+ L T +A+ L+
Sbjct: 837 AVKTFHPLHMDDVENGFIYFTKAGTLCCATAPDGLMLNRGVLARRAVLGRTIQKIAFDLD 896
Query: 189 TKTYCIVTSTAEPSTDYYKFNG-----EDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
++ ++ P + N E + P + + + P F + L SP S +
Sbjct: 897 SRLAALLLMEPRPELKPSRGNNDPPSNELPNISYRPDEPKALTPF---FQLQLLSPKSMK 953
Query: 244 EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
+P T HV V + +G + YIA+G + T G D
Sbjct: 954 LLPDTRIEYDLHHHVTSFAAVRLSSSLNSTGKQNYIAVGVTLLEGQRATTTG---FVDFY 1010
Query: 304 EVVPEPGQPLTKNKIKMIYAKEQKGPVTAI-CHVAGFLVTAVGQ----KIYIWQLKD-ND 357
V G+ + +++ + +Q G V+A+ C GFLV AVGQ KIY+W +D +
Sbjct: 1011 TVDVHDGK---ETRLEKRASCKQPGCVSAMDCTEDGFLVAAVGQRLGSKIYVWNFQDGQE 1067
Query: 358 LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
L +A+ + +Y + + +KNL +VGDY + LLR+ KG
Sbjct: 1068 LQPLAYFEAGIYTSCIRVIKNLAIVGDYESGVQLLRFS------------------RQKG 1109
Query: 418 YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS---KHNDILDEF----SSMGFMIS 470
RG R K+G+ K N +F S + +
Sbjct: 1110 LQQMPVFRGT--------------KHRFYSLVKVGADPHKSNCYCADFVVRESDLAMIYG 1155
Query: 471 DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG----AR 526
D D N+V Y ++ ++ GG L++ +FHLG ++ +++ P + APG A+
Sbjct: 1156 DADGNLVALDYDADSPDTRGGRILVRSANFHLGTRLSAMLRLQAAP--VVRAPGGLAEAQ 1213
Query: 527 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 586
+ ++G G +PL E YRRL MLQ +V+H+S GL+P FR +K +
Sbjct: 1214 KCHVVHTFGIEGQQGVVIPLHEAEYRRLEMLQKKLVSHSS-LAGLHPFQFRAFKSSIWRP 1272
Query: 587 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
+ ++GI+DG+L+ ++ L E+L++ +++G
Sbjct: 1273 RSFAQGILDGALLRQYFCLGRREQLDVAEQLG 1304
>gi|148886829|sp|A2R919.1|CFT1_ASPNC RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|134083776|emb|CAK47110.1| unnamed protein product [Aspergillus niger]
Length = 1383
Score = 181 bits (459), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 170/647 (26%), Positives = 285/647 (44%), Gaps = 131/647 (20%)
Query: 32 PLLLVRTQ-HELLIYQAFRHPKGALK----LRFKKLKVLFVSDRSKRANEQ-PGLPRGVR 85
P L++R++ +L+IY+ F G ++ L+F SK N P +P GV
Sbjct: 818 PYLILRSETDDLIIYKPFVVSTGPVEGIHSLKF-----------SKETNSVLPRIPPGVS 866
Query: 86 ISQ----------MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAP 135
+Q +R +I+G VF+ G ++ TS H + + G S
Sbjct: 867 STQPSGSDYRARPLRILPDISGLSAVFMPGASAGFIIRTSASA--PHFLRLRGENSR--- 921
Query: 136 FHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIV 195
S +R LP +D W +++V L LAY + Y +
Sbjct: 922 ---------------SSTVRFCKLPPMTRFDYQWTLKRVHLGEQVDHLAYSTSSGMYVLG 966
Query: 196 TSTAEPSTDYYKFNGEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWEEIPQTN---- 249
T A TD+ ED EL + R+ F P F + L W+ Q
Sbjct: 967 TCHA---TDFKL--PEDDELHPEWRNEAISFFPSARGSF-IKLV----WDHHLQRQDSVI 1016
Query: 250 --FPLHEW-----EHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
F LH + E+V+ +KN+S+E + I +GT + ED+ RG I +F++
Sbjct: 1017 LIFHLHSFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTAFARGEDIPSRGCIYVFEV 1076
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLT 359
++VVP+P P T K+K+I + KG VTA+ + GF++ A GQK + LK D L
Sbjct: 1077 VQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVLVAQGQKCMVRGLKEDGSLL 1136
Query: 360 GIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
+AF+D + Y++ + +K + ++GD + + Y E +SL A+D
Sbjct: 1137 PVAFMDMQCYVSVVKELKGTGMCILGDAVKGVWFAGYSEEPYKMSLFAKDL--------- 1187
Query: 418 YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVV 477
+ LE+C + L + + +++D D N+
Sbjct: 1188 -------------------------DYLEVCAA------EFLPDGKRLFIVVADSDCNIH 1216
Query: 478 LFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSS---ISDAPG-----ARSR 528
+ Y PE +S+ G RL+ ++ FH+G +T + R SS +S + G
Sbjct: 1217 VLQYDPEDPKSSNGDRLLSRSKFHMGNFASTLTLLPRTMVSSEKMVSSSDGMDIDNQSPL 1276
Query: 529 FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
+ +G+LG +PE++YRRL LQ+ + H GLNPRAFR + G
Sbjct: 1277 HQVLMTTQNGSLGLITCIPEESYRRLSALQSQLTNTLEHPCGLNPRAFRAVESD----GT 1332
Query: 589 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
RG++DG+L++K++ +S + EI ++G++ +I D+EA+S
Sbjct: 1333 AGRGMLDGNLLFKWIDMSKQRKTEIAGRVGAREWEI---KADLEAIS 1376
>gi|344305212|gb|EGW35444.1| pre-mRNA 3'-end processing factor CF II [Spathaspora passalidarum
NRRL Y-27907]
Length = 1348
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 137/576 (23%), Positives = 260/576 (45%), Gaps = 59/576 (10%)
Query: 56 KLRFKKLKVLFVSDRSKRANEQP--GLPRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLF 112
KL F +F ++ R P P G I + + YF N+ GY +F+ G P +
Sbjct: 801 KLYFDGENYIFKKEKDLRITGAPENAYPLGTTIERRLVYFPNLNGYTSIFVTGIIPYLIM 860
Query: 113 LTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVR 172
R + P +++ F + G ++ + RI L +Y+ WP+R
Sbjct: 861 KPMHSIPRIFQFS-KIPALSISAFSDSKIKNGLIFLDNSKNARICELSLDFTYEFNWPMR 919
Query: 173 KVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQF 232
++ + + + YH + TY + T P Y ED +L+ + P+ +
Sbjct: 920 QIHIGDSIKSITYHETSNTYVVSTFREIP----YDGLDEDGKLIVGTLPDKTPRPVAYKG 975
Query: 233 HVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRG---YIALGTNYNYSE 289
+ + SP +W I L + E + ++++ ++ ++ + +I +G+ +E
Sbjct: 976 SIKMISPLNWTVI--DTIELDDTEVAMNVQSMMLDVGSSMKKFKNKKEFIVIGSGKYRNE 1033
Query: 290 DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
D+ G +F+I+++VPEPG+P T +K K ++ ++ +G VT+IC ++G L+ A GQK+
Sbjct: 1034 DLVANGSFKIFEIVDIVPEPGKPETNHKFKEVFQEDTRGAVTSICGLSGRLLIAQGQKVI 1093
Query: 350 IWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
+ ++D+ + +AF+DT VY++ S+ NL+++GD +S L+ + E + ++ +D
Sbjct: 1094 VRDVQDDGVVPVAFLDTAVYVSESKSLGNLLMLGDPLKSCWLVGFDAEPFRMIMLGKD-- 1151
Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
L +S G+ + +K DI +I
Sbjct: 1152 ------------------------LHHLNVSCGDFI-------TKDEDIY-------MLI 1173
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-- 527
+D + + L Y P+ +S G RLI K+ F + V+ K+ SS + S
Sbjct: 1174 ADNNNILHLIQYDPDDPQSLNGQRLISKSAFEIESTVSCMRKLPKIESSFEKSEIKFSPI 1233
Query: 528 -RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 586
F ++ DG+ P+ E +YRR+ +LQ + H GLNPR R + G
Sbjct: 1234 DEFQIIGSTSDGSFFNVFPVDESSYRRMYILQQQLTDKEYHYCGLNPRLNR-FGGAIELR 1292
Query: 587 GNP--SRGIIDGSLVWKFLQLSLGERLEICKKIGSK 620
N ++ I+D L+ ++ QL+ + + K+ +K
Sbjct: 1293 DNETNTKPILDFGLIKRYAQLNEDRKRNLASKVSAK 1328
>gi|448530371|ref|XP_003870046.1| mRNA cleavage and polyadenylation factor [Candida orthopsilosis Co
90-125]
gi|380354400|emb|CCG23915.1| mRNA cleavage and polyadenylation factor [Candida orthopsilosis]
Length = 1327
Score = 180 bits (457), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 153/615 (24%), Positives = 275/615 (44%), Gaps = 78/615 (12%)
Query: 28 HGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRIS 87
H L ++ E+L+Y+ F + + FKK K L ++ + A G +
Sbjct: 775 HKEEYLTILTISGEVLMYKLFYDGENYM---FKKEKDLKITGAPENA-----FNLGTMVE 826
Query: 88 Q-MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFL 146
+ + YF N+ GY +F+ G P + + R + P +++ F + G +
Sbjct: 827 RRLVYFPNLNGYTSIFVAGVIPFLIIKSCHSIPRIFQFS-KIPAVSISAFSDSKIKNGLI 885
Query: 147 YFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYY 206
+ + RI L +Y+ P+R+V + + +AYH ++ T I T P Y
Sbjct: 886 FLDNNQNARICELSLDYNYEFNLPIRRVHIGESIRSVAYHEQSDTVVISTFKEIP---YN 942
Query: 207 KFNGEDKELVTDPRDSRFIPPLVS-QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVS 265
+ E K + +D PP S + + L SPF+W+ I L + E + +K++
Sbjct: 943 CVDEEGKPIAGVLKDK---PPATSFKGSIKLVSPFNWKVI--DTIELQDNEVGMAIKSMV 997
Query: 266 MEYEGTLSGL---RGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIY 322
++ ++ R YI +GT ED+ G ++DII+++PEPG+P T +K K I+
Sbjct: 998 LDVGSSMKKFKTKREYIVVGTGKLRMEDLAANGSFKIYDIIDIIPEPGKPETNHKFKEIF 1057
Query: 323 AKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILV 382
++ +G VT++C ++G + GQK+ + L+D+ + +AF+DT VY++ S NL L+
Sbjct: 1058 QEDTRGAVTSVCDLSGRFLVGQGQKVIVRDLEDDGVVPVAFLDTPVYVSEAKSFGNLFLL 1117
Query: 383 GDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLG 442
GD +SI L+ ++ + + ++ +D +
Sbjct: 1118 GDPLKSIWLVGFEADPFRMVMLGKDRQHL------------------------------- 1146
Query: 443 ERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHL 502
R+E C K +I +++D + ++ L + P+ +S G LI K F
Sbjct: 1147 -RVE-CADFIVKDEEIF-------ILVADVNNSLHLIQFDPDDPKSINGTILINKASFET 1197
Query: 503 GQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 562
+R P G + T +++DGA P+ E YRR+ ++Q +
Sbjct: 1198 NSQTTC---LRSVPK------GETGDYQTIGSTIDGAFFNVFPVNESTYRRMYIVQQQIS 1248
Query: 563 THTSHTGGLNPRAFRTYKGKGYYAGNPSRG--IIDGSLVWKFLQLSLGERLEICKKI--- 617
H GLNPR R + G N + I+D +L+ +F +L+L + I KI
Sbjct: 1249 DKEYHYCGLNPRLNR-FGGAVQIRDNDTNAKPILDYNLIKEFAKLNLDRQKNITTKINIK 1307
Query: 618 GSKHNDILDELYDIE 632
GS H DI +L ++E
Sbjct: 1308 GSAH-DIWKDLIELE 1321
>gi|400597740|gb|EJP65470.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
Length = 1444
Score = 179 bits (453), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 173/667 (25%), Positives = 297/667 (44%), Gaps = 89/667 (13%)
Query: 2 GNFRSHSPSAMDETIVQELLTVSLG-LHGNRPLLLVR-TQHELLIYQAFRH-------PK 52
NF +A + + E+L LG + P L++R +L +Y+ R+ P
Sbjct: 824 ANFTGRKAAAKER--LTEILVADLGDVVSKSPFLILRHDTDDLTLYEPVRYQEPNSSSPP 881
Query: 53 GALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLF 112
L FKK ++ + +++ + R ++ N+ GY VFL G P+++
Sbjct: 882 LTDTLFFKKSANATIAKSASAFDKEEDETQQRRFVPLQPCGNVGGYSTVFLSGDSPSFVL 941
Query: 113 LTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPV 171
+++ R + G V ++ FH C RGF+Y + K R+ LPT +Y + V
Sbjct: 942 KSAKSIPRIVGLQGQG-VQGMSTFHTAGCDRGFIYADTKGIARVCQLPTDTNYAELGISV 1000
Query: 172 RKVPLKCTPHFLAYHLETKTYCIVTSTAEP-----STDYYKFNGEDKELVTDPRDSRFIP 226
+K+PL C + +++H T TY ST EP DY+K +E+V+
Sbjct: 1001 KKIPLDCDVNRVSFHSHTATYIAACSTREPFELPKDDDYHKEWA--REVVS-------FA 1051
Query: 227 PLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYN 286
P + + + L SP +W I + L E + + + +E R +A+G+
Sbjct: 1052 PTMPRGMLKLISPAAWTVI--HSLDLESCETIESMMALHLEISEETKERRMLVAVGSAIC 1109
Query: 287 YSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHV--AGFLVT 342
ED+ RGR+ +FDI+ V+PEPG+P T ++K+ AKE+ +G VT++ + +G L+
Sbjct: 1110 KGEDLPTRGRVQVFDIVTVIPEPGRPETNKRLKL-QAKEELPRGGVTSLSEIGTSGLLLI 1168
Query: 343 AVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTL 401
A GQK + L+ D L +AF+D +I + V+ L G L
Sbjct: 1169 AQGQKCMVRGLREDGGLLPVAFLDMNCHI---LGVRELRGTG-----------------L 1208
Query: 402 SLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
L+A +K ++A G + +K L S G+ I D L +
Sbjct: 1209 CLMADAFKGM------WFA-----GYTEEPYTFKVLGKSGGQ-------IPMLVADFLPD 1250
Query: 462 FSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG--QHVNTFFKIRCKPSSI 519
+ + D D ++ +F + P+ +S GH LI +T F L + T R P+S
Sbjct: 1251 GEDLSMIGVDADGDLHVFEFDPDHPKSLQGHLLIHRTTFSLSPNEPTTTVLLERTIPASQ 1310
Query: 520 ---SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
GA + + G L PL E YRRLL L N ++ GGL+P+A
Sbjct: 1311 PQPKGTTGAETPHTLLLSCPTGQLAALTPLSESAYRRLLSLTNQVLPAVVPHGGLHPKAH 1370
Query: 577 RTYKGKGYYA---------GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 627
R +G+G + R I+DG+++ ++ +L +R E+ K G ++D+ +
Sbjct: 1371 RLPEGRGAQSHSRAVGVETAASGRMIVDGAVLARWTELGAAKRAEMALKSG--YDDVHEM 1428
Query: 628 LYDIEAL 634
++E +
Sbjct: 1429 RGELEGV 1435
>gi|254564833|ref|XP_002489527.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
[Komagataella pastoris GS115]
gi|238029323|emb|CAY67246.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
[Komagataella pastoris GS115]
gi|328349950|emb|CCA36350.1| Protein cft1 [Komagataella pastoris CBS 7435]
Length = 1388
Score = 179 bits (453), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 143/551 (25%), Positives = 246/551 (44%), Gaps = 66/551 (11%)
Query: 81 PRGVRISQ-MRYFSNIAGYQ--GVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFH 137
P+G ++ + + +NI + +F+ G W+ + H T +S A F+
Sbjct: 874 PQGTKLERRLIKLNNIGDSKLSTLFVVGVKSFWITKRHSSSINIHQFTKLSTISC-ARFN 932
Query: 138 NVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTS 197
C G + + R+ +P++L P+R+VP+ CT +A+H ++T+ + T
Sbjct: 933 TSRCKNGLMIIDTNKAARMVEIPSNLELSQRLPIRRVPVGCTIKCVAFHKASRTFVVSTV 992
Query: 198 TAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEH 257
P Y E+ + ++ P + + L SP SW I +F L E EH
Sbjct: 993 EETP----YNCVDEEGNPIVGVDNTINKPASSFKSSIKLISPISWTVID--SFDL-EDEH 1045
Query: 258 VLCLKNVSMEYEGT----LSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPL 313
V C+ SM + L+ Y+ LG + ED+ G+I + D+++++PEPG+P
Sbjct: 1046 V-CMSLKSMTLNTSRIPMFKNLKEYLVLGISNYRMEDLASNGQIRIVDVVDIIPEPGKPE 1104
Query: 314 TKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIAS 372
T +K K I+ KG VT++ ++G V GQKI + L+ DN + F+DT Y++
Sbjct: 1105 TNHKFKDIFQDATKGAVTSVSDISGRFVIGQGQKIIVRDLQEDNTALPVGFVDTPFYVSE 1164
Query: 373 MVSVKNLILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGS 431
S +NL+LVGD S+ L+ + E YR +SL +D
Sbjct: 1165 TKSFQNLLLVGDSMHSVILVGFDAEPYRMISL-GKDVA---------------------- 1201
Query: 432 LVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGG 491
+++C D + ++ +I+D+D + L Y PE S G
Sbjct: 1202 ------------HVDVCAA------DFVVFEGNLFIIIADEDGMLHLIQYDPEDPASMQG 1243
Query: 492 HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWY----ASLDGALGFFLPLP 547
RL++++ F Q+ T K+R + I + F + A+ DG+ P+
Sbjct: 1244 QRLLRRSIFKTNQYT-TCMKMRERKYVIKPPKNQFTNFSEAFEVVAANSDGSFYKVTPIS 1302
Query: 548 EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 607
E YRRL ++Q + +H GLNPR R Y Y N R I+D + +FL+
Sbjct: 1303 EATYRRLYVIQQQIFDQENHKCGLNPRENR-YLSDQYSIPN-QRLILDFDNIRRFLEFDE 1360
Query: 608 GERLEICKKIG 618
++ ++ K+G
Sbjct: 1361 IKKRDLVHKLG 1371
>gi|425765419|gb|EKV04111.1| Cleavage and polyadenylation specificity factor subunit A, putative
[Penicillium digitatum Pd1]
gi|425767100|gb|EKV05682.1| Cleavage and polyadenylation specificity factor subunit A, putative
[Penicillium digitatum PHI26]
Length = 1271
Score = 176 bits (445), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 132/493 (26%), Positives = 228/493 (46%), Gaps = 66/493 (13%)
Query: 154 LRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDK 213
+R LP+ +D W +RKVP++ +FLAY ++TY + TS D+ G+
Sbjct: 822 IRACQLPSQTQFDYSWTLRKVPIEEQVNFLAYSTSSETYVLGTSR---QGDFKLPEGD-- 876
Query: 214 ELVTDPRDSRF-IPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTL 272
EL + R+ P + + + + SP +W I ++PL E V +KNV++E
Sbjct: 877 ELHPEWRNEELSFCPKIPESSIKVVSPKTWTII--DSYPLDPDEQVTAVKNVNIEVSENT 934
Query: 273 SGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTA 332
I +GT ED+ RG I +FD+I+V P+P +P T K+K+I + KG VTA
Sbjct: 935 HERMDLIVVGTAIAKGEDMPARGTIYVFDVIKVAPDPERPETGRKLKLIGKETVKGAVTA 994
Query: 333 ICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVK--NLILVGDYAR 387
+ + GF++ A GQK + LK D L +AF+D + Y+ + +K ++++GD +
Sbjct: 995 LSGIGGQGFIIVAQGQKCMVRGLKEDGSLLPVAFMDMQCYVNVVKELKGTGMVILGDAVK 1054
Query: 388 SIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 447
+ Y E ++L +D E LE+
Sbjct: 1055 GLWFAGYSEEPYRMTLFGKD----------------------------------PEYLEV 1080
Query: 448 CKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVN 507
D L + + + +++D D N+ + Y PE +S+ G RL+ ++ F+ G +
Sbjct: 1081 VAA------DFLPDGNKLYMLVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFYTGNFAS 1134
Query: 508 TFFKI---------RCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQ 558
+ + D +++ AS +G+L + E++YRRL LQ
Sbjct: 1135 SVTLLPRTAVSSELTESSEEAMDVDETFAKYQVLIASQNGSLALVTSVAEESYRRLSGLQ 1194
Query: 559 NVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
+ ++ H GLN RAFR + G AG RG++DG+L+ +L + + EI ++G
Sbjct: 1195 SQLINTVDHPAGLNARAFRATESDG-AAG---RGMVDGNLLRLWLNMGKQRQAEIAGRVG 1250
Query: 619 SKHNDILDELYDI 631
+ +I +L I
Sbjct: 1251 ATEWEIKADLETI 1263
>gi|393220097|gb|EJD05583.1| cleavage factor protein [Fomitiporia mediterranea MF3/22]
Length = 1450
Score = 175 bits (444), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 156/617 (25%), Positives = 276/617 (44%), Gaps = 95/617 (15%)
Query: 41 ELLIYQAFR-----HPKGALKLRFKKLKVLFVSDRSKRANE-QPGLPRGVRISQMRYFSN 94
+L IYQA P+ ++ K+K + + RS + +P V Q R +
Sbjct: 887 QLAIYQAVAVDKDDFPESTVRTSTLKIKFIKMGTRSFEPRQLEPAEKSSVIAEQRRALRS 946
Query: 95 IAGY----------QGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRG 144
+ + GVF+ G P W+ T + L+ H + F VN
Sbjct: 947 LVPFIVSPNSEKRVSGVFVTGDEPCWIVATDKDGLKIHSCS----------FQTVNSFTS 996
Query: 145 FLYFNAKSELRI----SVLPTHLSY------DAPWPVRKVPLKCTPHFLAYHLETKTYCI 194
+++K + + + P L + P + V + T + + + +
Sbjct: 997 CSVWDSKCDFLMHTDEAFGPCLLGWIPEFNLGTDMPSKTVTVGRT--YTNVTFDAASGLM 1054
Query: 195 VTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHE 254
V S+ P+ + F+ E +L +P P + LF S +
Sbjct: 1055 VASSVVPNP-FTIFDEEGNKL-WEPDAPNINYPHSVMSALELF--HSDLSCVMDGYEFQP 1110
Query: 255 WEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLT 314
E V L V +E + T SG + +I +GT N ED+ +G +F+I+E+VP+P L
Sbjct: 1111 NEFVTALDCVQLETQSTESGTKEFIVVGTTVNRGEDLAVKGVTYVFEIVEIVPDPEGGLA 1170
Query: 315 KN-KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIAS 372
+ K++++ E KGPVTA+C + G+LV+++GQKI++ L D L G+AF+D VY+ S
Sbjct: 1171 RQFKLRLLCKDEAKGPVTALCGMNGYLVSSMGQKIFVRALDLDERLVGVAFLDVGVYVTS 1230
Query: 373 MVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSL 432
+ ++KNL+++GD +S+ L+ +Q + L +VA++
Sbjct: 1231 LRTIKNLLIIGDAVKSVWLVAFQEDPFKLVIVAKEV------------------------ 1266
Query: 433 VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMG---FMISDKDKNVVLFMYQPEARESN 489
+RL++ D L F+S G +SD++ + L Y ES+
Sbjct: 1267 ----------QRLDVMTA------DFL--FASDGDFYIAVSDEEGIIRLLEYDTSDPESH 1308
Query: 490 GGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPL-PE 548
G L+++T++H +T I + + P AR A++DG++ P+ +
Sbjct: 1309 SGQYLLRRTEYHAQVESHTTVLIARRSQNDGLVPQAR----LISAAVDGSMYALTPVDAD 1364
Query: 549 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 608
++ +RL +LQ + + H GLNPRAFR + G A ++GI+DG+L+ F QL +
Sbjct: 1365 ESAKRLQLLQGQLTRNMQHVAGLNPRAFRAVRSDG-VARPLTKGILDGNLLAGFEQLPIP 1423
Query: 609 ERLEICKKIGSKHNDIL 625
+ EI + IG+ +L
Sbjct: 1424 RQNEIARPIGTDRLAVL 1440
>gi|402085944|gb|EJT80842.1| cft-1 [Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 1450
Score = 175 bits (443), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 163/627 (25%), Positives = 283/627 (45%), Gaps = 72/627 (11%)
Query: 17 VQELLTVSLGLHG-NRPLLLVR-TQHELLIYQAFRHPKGALKL----RFKKLKVLFVSDR 70
+ EL+ LG P L++R + +L IY+ F+ + + L RF+KL V+ +
Sbjct: 848 LSELMVTDLGDSTFKSPHLILRHSNDDLTIYEPFKIAESSQSLSGTLRFRKLPNPAVA-K 906
Query: 71 SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP- 129
S+ P +R +R NIAGY VFL G P++L +S+ R + + GP
Sbjct: 907 SQDTKVSDDAPAPMRRMPLRACGNIAGYSCVFLPGHSPSFLIKSSKSTPRV--IGLQGPG 964
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHLE 188
V ++PFH C RGF+Y + + R++ +P S+ + V+KVPL +AYH
Sbjct: 965 VRAMSPFHTKGCDRGFIYADYEGVARVAQIPNDCSFAELGLSVKKVPLNMDADGIAYHTP 1024
Query: 189 TKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQT 248
+ Y + S EP ++ +D+ +++ P + + +P +W EI
Sbjct: 1025 SGVYVVTCSFWEP----FELPSDDESHREWAKENITFKPQTEHSVLKVINPVNWSEIWTE 1080
Query: 249 NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPE 308
F +E +C+K++++E + + R I +GT ED+ RG + ++D+ VVP+
Sbjct: 1081 EFDKNEV--AMCIKSLNLEVSQSTNERRHLITVGTAICKGEDLPVRGCVYVYDLASVVPQ 1138
Query: 309 PGQPLTKNKIKMIYAKE-QKGPVTAICHVA--GFLVTAVGQKIYIWQL-KDNDLTGIAFI 364
+P T K+K++ E +G VTA+ + G ++ A GQK + L +D L +AF+
Sbjct: 1139 KDRPETDKKLKLMAKDEVPRGAVTALSEIGTQGLMLVAQGQKCLVRGLGEDGRLLPVAFM 1198
Query: 365 DTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPS 424
D Y++ K L G A + A ++ + T GY G P
Sbjct: 1199 DMNCYVS---CAKELPGTGFCAMADA---FKGVWFT----------------GYTEG-PY 1235
Query: 425 RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPE 484
+ +I G LE+ + D L + ++ + +D + N+ +F + PE
Sbjct: 1236 KMMIFG---------KSSTNLEVI------NVDFLPDGRNLLLVAADAEGNLHIFQFDPE 1280
Query: 485 ARESNGGHRLIKKTDFHLGQH-------VNTFFKIRCKPSSISDAPGARSRFL-TWYASL 536
+S GH L+ +T F G H + T +P++ DA A + A+
Sbjct: 1281 HPKSLQGHLLLNRTTFSTGAHHPQKSLLMPTTSSNPSQPATNGDASAAAAGPQHILMAAP 1340
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT--YKGKGYYAG---NPSR 591
G L PL + Y RL L + + H LNP+A+R + A + R
Sbjct: 1341 TGVLAAVQPLGQGVYTRLSALASNLAASVPHHAALNPKAYRMPPAPARNQVAAVDISVGR 1400
Query: 592 GIIDGSLVWKFLQLSLGERLEICKKIG 618
++DG+L+ ++ +L G R E+ + G
Sbjct: 1401 AVVDGALLARWAELGSGRRAEVAGRAG 1427
>gi|367018592|ref|XP_003658581.1| hypothetical protein MYCTH_2294503 [Myceliophthora thermophila ATCC
42464]
gi|347005848|gb|AEO53336.1| hypothetical protein MYCTH_2294503 [Myceliophthora thermophila ATCC
42464]
Length = 1547
Score = 175 bits (443), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 139/510 (27%), Positives = 234/510 (45%), Gaps = 65/510 (12%)
Query: 14 ETIVQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGA-----LKLRFKKLKVLF 66
ETI E+L LG H + L+L T +L +YQ FR+ GA L F+KL
Sbjct: 873 ETIA-EILVADLGDMTHKSPHLILRHTNDDLTLYQPFRYNTGAGLEFSKTLFFQKLPNTV 931
Query: 67 VSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTI 126
+ + A++ + R MR +N+ GY VFL G P+++ +S+ + P+
Sbjct: 932 FAKSPEEADDDEATHQ-PRFLSMRRCANVGGYSTVFLPGASPSFIIKSSKSVPKVLPLQG 990
Query: 127 DGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAY 185
G V ++PFH C GF+Y +++ R++ LP SY + VRK+P+ AY
Sbjct: 991 TG-VIAMSPFHTEGCEHGFIYADSRDMARVAQLPQDWSYAELGLAVRKIPIGEDIAAAAY 1049
Query: 186 HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEI 245
H ++Y + +T EP ++ +D R++ P V + ++ L SP +W +
Sbjct: 1050 HPPMQSYVVGCNTPEP----FELPKDDDYHKEWARENLAFKPTVDRGNLKLVSPITWTVV 1105
Query: 246 PQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEV 305
+ + E VLC++ + +E + + IA+GT ED+ RGR+ ++DI +V
Sbjct: 1106 --DSIQMEPCETVLCVECLGLEVSEFTNERKQLIAVGTAITKGEDLPTRGRVYVYDIADV 1163
Query: 306 VPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTG 360
+P+PG+P T K+K+I AKE +G VTA+ + G ++ A GQK + LK D L
Sbjct: 1164 IPQPGRPETSKKLKLI-AKEDIPRGAVTALSEIGTQGLMLVAQGQKCMVRGLKEDGSLLP 1222
Query: 361 IAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
+AF+D Y+ + + L L+ D + + Y E + L +
Sbjct: 1223 VAFMDMSCYVTAAKELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKS----------- 1271
Query: 419 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
RLE+ + D L + + ++SD D ++ +
Sbjct: 1272 -----------------------ATRLEVL------NADFLPDGKELFIVVSDADGHIHI 1302
Query: 479 FMYQPEARESNGGHRLIKKTDFHLGQHVNT 508
+ PE +S GH L+ +T F+ G H T
Sbjct: 1303 LQFDPEHPKSLQGHLLLHRTTFNTGAHQPT 1332
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 44/104 (42%), Gaps = 20/104 (19%)
Query: 534 ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG------------ 581
A+ G L LPE YRRL L + H GLNPR +R G
Sbjct: 1421 AAPTGVLAALRALPESAYRRLSSLAAQLAGSLPHAAGLNPRGYRLPDGVASSSSPWSSSS 1480
Query: 582 -------KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
G AG R I+DG+L+ +F +L + R+E+ + G
Sbjct: 1481 SSFSAVVPGVDAGV-GRTIVDGALLQRFTELGMARRVELAGRAG 1523
>gi|384253955|gb|EIE27429.1| hypothetical protein COCSUDRAFT_64224 [Coccomyxa subellipsoidea
C-169]
Length = 1137
Score = 175 bits (443), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 161/635 (25%), Positives = 264/635 (41%), Gaps = 101/635 (15%)
Query: 43 LIYQAFRHPKGALKLRFKKLKVLFVSD------RSKRANEQPGLPRGVRISQMRYFSNIA 96
L Y+AF P+G ++ FK+L + + RSK + R + + + N
Sbjct: 565 LAYRAFHTPRG--RVCFKRLSLPAHAHCPPQDRRSKTTAPSSSMTRFDGLGESKEHVN-- 620
Query: 97 GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF----NAKS 152
G+F+ G P WL + SRG L AH M ++G VS + PFHN+NCP GF+ N
Sbjct: 621 --SGMFVSGERPLWL-VASRGTLVAHAMDVEGRVSGMTPFHNINCPLGFITACMAENDGE 677
Query: 153 ELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGED 212
L+I LP D PWP++K+ ++ TPH LAY+ E + Y ++ S P Y + E
Sbjct: 678 TLKICQLPMRTRLDTPWPLQKIAVRATPHRLAYYAEARLYVLLVSRPVP---YREHQEEA 734
Query: 213 KELVTDPRDS-RFIPPLVSQ--------FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKN 263
+ DP S +I + V L P ++ + + E C
Sbjct: 735 SD--GDPHASYSYICADAAAKASGTELGGEVRLLEPGRYQTVARHALDPGEEP---CSVA 789
Query: 264 VSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVV---PEPGQPLTKNKIKM 320
L YI +GT NY ED C GRILLF E P ++ +
Sbjct: 790 ADWLRNAQTGALEPYITVGTALNYGEDYPCSGRILLFKATRTSTSGAEQADPTISWQLTL 849
Query: 321 IYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLI 380
++A PV + + G LV AVG + + +L+ + L I+F +++I S+ ++K I
Sbjct: 850 VHASGFSRPVQGLAVMDGRLVAAVGNNMQVMELRGSSLHMISFFHAQLFITSVATIKTFI 909
Query: 381 LVGDYARSIALL--RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQ 438
L+GD + + + + Y L+ +++DY + + +++G ++
Sbjct: 910 LLGDVHKGLTFVYADKKANYTALTQLSKDYNDVDVEAAEF--------LVNGKKLF---- 957
Query: 439 LSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ--PEARESNGGHRLIK 496
+ D +N+ LF Y E + + G +L+
Sbjct: 958 ----------------------------LLACDAAQNLRLFAYDGGKEQQATWQGKKLLP 989
Query: 497 KTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLP----LPEKNYR 552
H+GQ++ + R P+S A G + R + S G++ P LP +
Sbjct: 990 LGAIHVGQNICSSLSHRITPAS---ATGVQLRAAV-FGSAAGSIASLAPTWDGLPAE--- 1042
Query: 553 RLLMLQNVMVTHTSHTGGLNPRAF-RTYK--------GKGYYAGNPSRGIIDGSLVWKFL 603
LL LQ MV GLNP +F R YK G+ + A ++D + +F
Sbjct: 1043 ELLALQREMVLAVPQVAGLNPVSFRRRYKHGVKALAGGQSFEAPVSDDRVLDLDQLNRFQ 1102
Query: 604 QLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
L L E++ + K +L L ++ S F
Sbjct: 1103 WLPLTEQVALAAKCNLSRQQVLHALREMVMAISTF 1137
>gi|344229600|gb|EGV61485.1| hypothetical protein CANTEDRAFT_109087 [Candida tenuis ATCC 10573]
Length = 1300
Score = 175 bits (443), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 135/550 (24%), Positives = 245/550 (44%), Gaps = 66/550 (12%)
Query: 88 QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLY 147
Q+ + N++G G+F+ G P ++ T+ R P+ + F N ++
Sbjct: 809 QLFHIENLSGLTGIFVSGDVPYYIVKTNHSIPRIFKFA-RIPIMSFGKFAN----NQLIF 863
Query: 148 FNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYK 207
+ K RI +P+ +Y+ WP R++ + T +AYH + T+ I T P Y
Sbjct: 864 LDDKKNTRICEIPSEFNYENNWPARQINIGETIKDVAYHETSNTFVISTYKEIP---YNC 920
Query: 208 FNGEDKELVTDPRDSRFIPPLVS-QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSM 266
+ E+ +V D P +S + + L SP SW I + F L + E + ++ +
Sbjct: 921 LDEENVPIVGIMEDK---PSALSYKGSIKLVSPISWTVIDE--FELDDNEVGTKVSSMVL 975
Query: 267 EYEGT---LSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYA 323
+ + R ++ +GT ED+ G + +II+V+PEPG P T +K K Y
Sbjct: 976 DVGSSTRRFKSKREFVVIGTGKLRMEDLAANGSFKVLEIIDVIPEPGHPETNHKFKEFYK 1035
Query: 324 KEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVG 383
+E KG VTA+ V+G + + GQKI + L+D+ + +AF+D VY++ S N +L+G
Sbjct: 1036 EETKGAVTAVSDVSGRFLVSQGQKIIVRDLQDDGVVPVAFLDCSVYVSESKSYGNFVLLG 1095
Query: 384 DYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGE 443
D +S+ L + E + ++ +D K N + + II G
Sbjct: 1096 DTLKSVWLAGFDAEPYRMIMLGKDLKSIDVNCADFIVKDEELYIIVGD------------ 1143
Query: 444 RLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG 503
+N+IL L Y PE S+ G RL++K F+L
Sbjct: 1144 -----------NNNILH-----------------LLKYDPEDPNSSNGQRLVEKAAFNLN 1175
Query: 504 QHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
V +++ P+ + ++ ++++G+ P+ E +YRR+ +LQ +
Sbjct: 1176 AKVT---QLKQLPNLMDNSTSCIG------STIEGSFFTVFPINESSYRRMYILQQQLTD 1226
Query: 564 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 623
H GLNPR R K + ++ I+D ++ + +L+ R I K+ + ++
Sbjct: 1227 KAYHHCGLNPRLNRFGGLKLTANESNNKPILDYDVIKLYAKLNEDRRRNIGAKVSREGSE 1286
Query: 624 ILDELYDIEA 633
I ++ + EA
Sbjct: 1287 IWRDMLEFEA 1296
>gi|403178252|ref|XP_003336695.2| hypothetical protein PGTG_18491 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|375164075|gb|EFP92276.2| hypothetical protein PGTG_18491 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1149
Score = 174 bits (442), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 140/563 (24%), Positives = 244/563 (43%), Gaps = 70/563 (12%)
Query: 88 QMRYFSNI---AGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRG 144
Q R F++I ++GV+L G P WL T G R + + + +A P G
Sbjct: 640 QSRSFTSIQMDGKFKGVYLAGQPPVWLLSTDHGPCRIYDSPDEKTIHGIAQL-----PDG 694
Query: 145 FLYFNAKSELRISVLPTHLSYDAPWPV---------RKVP---LKCTPHFLAYHLETKTY 192
FL +++ ++ + W R++P +K F ++ +
Sbjct: 695 FLMSLSEASVQDEEPSQACDPASLWETYISEYVCLDREIPSTLVKTGRPFNKVFYDSASE 754
Query: 193 CIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPL 252
+V ++ T + F+ E+ L+ P D I + + L P W I F
Sbjct: 755 TVVGASY-LETAFANFD-EEGNLMWQPDDDSLIRATTFRSSLELILPGKWVTIDGYEFQQ 812
Query: 253 HEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQP 312
+EW V + NV ++ T+SG R ++ +GT N +ED+ RG I +F+I+ V P
Sbjct: 813 NEW--VTSMANVELDSRSTVSGRRQFVGVGTTCNRAEDLAARGGIYVFEIVVVNPAQNHR 870
Query: 313 LTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIA 371
+++ Y +E K VTA+ + G+ + +GQK+Y +D L + F+D + Y
Sbjct: 871 TYNRALRLRYYEETKACVTAVDAINGYFLHTMGQKLYAKCFEQDERLLAVGFLDIKPYTT 930
Query: 372 SMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGS 431
M KN IL+GD + I L+ +Q E L + Y + ++ + +IDG
Sbjct: 931 CMRIFKNFILLGDAVKGITLVAFQEEPYKLIELGHTYVDLKCSTIDFL-------VIDGK 983
Query: 432 LVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGG 491
L + +D + + +F Y P ES GG
Sbjct: 984 L---------------------------------AIVATDLNGVIRIFEYNPTNIESQGG 1010
Query: 492 HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNY 551
+L+ +++F+ + + + S+ +A T++ASLDG++ +P E Y
Sbjct: 1011 QKLLCRSEFNTSSEMTCSMQFGKRLSAKDEA----KVMGTFFASLDGSISSLVPAKEAVY 1066
Query: 552 RRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERL 611
+RL ++Q + H H GLNP+ RT + + +RGI+DG L+ KF LS+ ++
Sbjct: 1067 KRLQLVQTRLTRHIQHFAGLNPKGHRTVRND-LVSRAINRGILDGELLIKFHLLSVTQQA 1125
Query: 612 EICKKIGSKHNDILDELYDIEAL 634
EI GS +L L ++ L
Sbjct: 1126 EIAGLAGSDRETVLVNLLNLRGL 1148
>gi|354547787|emb|CCE44522.1| hypothetical protein CPAR2_403250 [Candida parapsilosis]
Length = 1334
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 140/554 (25%), Positives = 250/554 (45%), Gaps = 69/554 (12%)
Query: 88 QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLY 147
++ YF N+ GY +F+ G P + + R + + P +++ F + G ++
Sbjct: 835 RLVYFPNLNGYTTIFVTGVIPFLIIKSCHSIPRIYQFS-KIPAVSVSAFSDSKIKNGLIF 893
Query: 148 FNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYK 207
+ RI L SY+ P+RKV + + +AYH ++ T I T P Y
Sbjct: 894 LDNNQNARICELSWDYSYEFNLPIRKVHIGESIKSVAYHEQSDTVVISTFKEIP---YDC 950
Query: 208 FNGEDKELVTDPRDSRFIPPLVS-QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSM 266
+ E K + +D PP S + + L SP++W+ I L + E + +K++ +
Sbjct: 951 VDEEGKPIAGALKDK---PPATSFKGSIKLVSPYNWKVIDTVE--LSDNEVGMSIKSMVL 1005
Query: 267 EYEGTLSGL---RGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYA 323
+ +L R YI +GT+ ED+ G ++DII+++PEPG+P T +K K I+
Sbjct: 1006 DVGSSLKKFKTKREYIVIGTSKLRMEDLAANGSFKIYDIIDIIPEPGKPETNHKFKEIFQ 1065
Query: 324 KEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVG 383
++ KG VT+IC ++G + GQK+ + L+D+ + +AF+DT VY++ S N+ L+G
Sbjct: 1066 EDTKGAVTSICDLSGRFLVGQGQKVIVRDLEDDGVVPVAFLDTPVYVSEAKSFGNIFLLG 1125
Query: 384 DYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGE 443
D +SI L+ ++ + + ++ +D +
Sbjct: 1126 DALKSIWLVGFEADPFRMVMLGKDRQHLHVE----------------------------- 1156
Query: 444 RLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG 503
C K +I +++D + + L + P+ +S G L+ K F
Sbjct: 1157 ----CADFIVKDEEIF-------ILVADINNGLHLIQFDPDDPKSINGTILVNKASFETN 1205
Query: 504 QHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
C S D G + T +++DGA P+ E YRR+ ++Q +
Sbjct: 1206 SQTT------CLRSVPKDEAG---DYQTIGSTIDGAFFNVFPVNESTYRRMYIVQQQISD 1256
Query: 564 HTSHTGGLNPRAFRTYKGKGYY--AGNPSRGIIDGSLVWKFLQLSLGERLEICKKI---G 618
H GLNPR R + G + ++ I+D +L+ +F +L+L + I KI G
Sbjct: 1257 KEFHHCGLNPRLNR-FGGAIQIRDSDTNAKPILDYNLIREFAKLNLDRQRNIATKINIKG 1315
Query: 619 SKHNDILDELYDIE 632
S H DI +L ++E
Sbjct: 1316 SAH-DIWKDLIELE 1328
>gi|403170487|ref|XP_003329830.2| hypothetical protein PGTG_11767 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|375168746|gb|EFP85411.2| hypothetical protein PGTG_11767 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1513
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 140/563 (24%), Positives = 244/563 (43%), Gaps = 70/563 (12%)
Query: 88 QMRYFSNI---AGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRG 144
Q R F++I ++GV+L G P WL T G R + + + +A P G
Sbjct: 1004 QSRSFTSIQMDGKFKGVYLAGQPPVWLLSTDHGPCRIYDSPDEKTIHGIAQL-----PDG 1058
Query: 145 FLYFNAKSELRISVLPTHLSYDAPWPV---------RKVP---LKCTPHFLAYHLETKTY 192
FL +++ ++ + W R++P +K F ++ +
Sbjct: 1059 FLMSLSEASVQDEEPSQACDPASLWETYISEYVCLDREIPSTLVKTGRPFNKVFYDSASE 1118
Query: 193 CIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPL 252
+V ++ T + F+ E+ L+ P D I + + L P W I F
Sbjct: 1119 TVVGASY-LETAFANFD-EEGNLMWQPDDDSLIRATTFRSSLELILPGKWVTIDGYEFQQ 1176
Query: 253 HEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQP 312
+EW V + NV ++ T+SG R ++ +GT N +ED+ RG I +F+I+ V P
Sbjct: 1177 NEW--VTSMANVELDSRSTVSGRRQFVGVGTTCNRAEDLAARGGIYVFEIVVVNPAQNHR 1234
Query: 313 LTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIA 371
+++ Y +E K VTA+ + G+ + +GQK+Y +D L + F+D + Y
Sbjct: 1235 TYNRALRLRYYEETKACVTAVDAINGYFLHTMGQKLYAKCFEQDERLLAVGFLDIKPYTT 1294
Query: 372 SMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGS 431
M KN IL+GD + I L+ +Q E L + Y + ++ + +IDG
Sbjct: 1295 CMRIFKNFILLGDAVKGITLVAFQEEPYKLIELGHTYVDLKCSTIDFL-------VIDGK 1347
Query: 432 LVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGG 491
L + +D + + +F Y P ES GG
Sbjct: 1348 L---------------------------------AIVATDLNGVIRIFEYNPTNIESQGG 1374
Query: 492 HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNY 551
+L+ +++F+ + + + S+ +A T++ASLDG++ +P E Y
Sbjct: 1375 QKLLCRSEFNTSSEMTCSMQFGKRLSAKDEA----KVMGTFFASLDGSISSLVPAKEAVY 1430
Query: 552 RRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERL 611
+RL ++Q + H H GLNP+ RT + + +RGI+DG L+ KF LS+ ++
Sbjct: 1431 KRLQLVQTRLTRHIQHFAGLNPKGHRTVRND-LVSRAINRGILDGELLIKFHLLSVTQQA 1489
Query: 612 EICKKIGSKHNDILDELYDIEAL 634
EI GS +L L ++ L
Sbjct: 1490 EIAGLAGSDRETVLVNLLNLRGL 1512
>gi|50552095|ref|XP_503522.1| YALI0E03982p [Yarrowia lipolytica]
gi|74634000|sp|Q6C740.1|CFT1_YARLI RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|49649391|emb|CAG79101.1| YALI0E03982p [Yarrowia lipolytica CLIB122]
Length = 1269
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 155/631 (24%), Positives = 276/631 (43%), Gaps = 95/631 (15%)
Query: 19 ELLTVSLGLHGN----RPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRA 74
EL+ ++L G+ R L++ T +L++Y+ + + KLRF+K+ +
Sbjct: 719 ELVDIALSPLGDDHILRDYLVLLTPQQLVVYEPYHYND---KLRFRKIFL---------- 765
Query: 75 NEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID-GPVSTL 133
P + R++Q+ I G + + G A++ + + L P I+ G
Sbjct: 766 ERTPTINSDRRLTQVPL---INGKHTLGVTG-ETAYILVKT---LHTSPRLIEFGETKGA 818
Query: 134 APFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKC--TPHFLAYHLETKT 191
F + + F Y E+ S + WPV+ V L C T + YH
Sbjct: 819 VAFTSWDGK--FAYLTQAGEVAECRFDPSFSLETNWPVKHVQL-CGETISKVTYHETMDV 875
Query: 192 YCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFP 251
Y I T P + ED E++ + +P Q + + +P+SW I F
Sbjct: 876 YVIATHKTVP----HVVRDEDDEVI-ESLTPDIMPATTYQGAIRIVNPYSWTVIDSYEFE 930
Query: 252 LHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQ 311
+ E LC ++V + S R +A+GT+ ED+ RG + LFD+IE+VPE +
Sbjct: 931 M-PAEAALCCESVKLSISDRKSQKREVVAVGTSILRGEDLAARGALYLFDVIEIVPEKER 989
Query: 312 PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDN-DLTGIAFIDTEVYI 370
P T ++K + +G TA+C V+G L+ GQK+ + L+D+ L +AF+D + Y+
Sbjct: 990 PETNRRLKKLVQDRVRGAFTAVCEVSGRLLAVQGQKLLVQALQDDLTLVPVAFLDMQTYV 1049
Query: 371 ASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDG 430
A S+ +++L+GD RS+ + + + + ARD
Sbjct: 1050 AVAKSLNSMLLLGDATRSVQFVGFSMDPYQMIPFARD----------------------- 1086
Query: 431 SLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNG 490
LQ L + C D E ++ F+++D K + + Y P+ +S
Sbjct: 1087 ------LQRVL---VTTC--------DFAIEGENLTFVVADLQKRLHILEYDPDDPQSYS 1129
Query: 491 GHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKN 550
G RL++++ F+ G+ +++ + P RF+ DG++ +P PE
Sbjct: 1130 GARLLRRSVFYSGKVIDSSAMV----------PINEDRFMVIGVCSDGSVTDVVPCPEDA 1179
Query: 551 YRRLLMLQNVMVTHTSHTGGLNPRAFR---TYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 607
YRRL +Q + +H GL+PRA+R G G +P R I+DG + +F L
Sbjct: 1180 YRRLYAIQTQITDKEAHVCGLHPRAYRYDPILPGTG---NSPHRPILDGHTLIRFANLPR 1236
Query: 608 GERLEICKKIGSKHNDILDELYDIEALSSHF 638
++ ++G ++ ++ D+E +S F
Sbjct: 1237 NKQNVYANRLGQRYQQLI--WKDLELISDLF 1265
>gi|325189779|emb|CCA24259.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1911
Score = 172 bits (437), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 160/675 (23%), Positives = 286/675 (42%), Gaps = 162/675 (24%)
Query: 85 RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG-------------PVS 131
R + F N+ G+F G +P W+ L ++G+ P+ I PV
Sbjct: 1277 RYPMLTRFFNVNNNSGMFFRGAYPVWI-LPNQGQPVFVPLNIAAAPSDPTRRTTFKVPVL 1335
Query: 132 TLAPFHNVNCPRGFLYFNAKSELRISVLP-----THLSYDAPWPVRKVPLKCTPHFLAY- 185
+ PFH+ NCP GF+YF++ LR+ LP T L + ++KV T H L Y
Sbjct: 1336 SFTPFHHWNCPNGFVYFHSSGSLRVCELPSSQNSTLLPSGNGFVLQKVRFGATIHHLLYL 1395
Query: 186 ----------HLETKTYCIVTST----AEPSTDYYKFN--------------GEDKELVT 217
L++ T+ +V S +E Y+ N G++ E
Sbjct: 1396 GRHGPGGVAEALKSPTFAMVLSRKVTPSEAEQAYWSENNDENADDTMYQNGVGKEAEEGD 1455
Query: 218 DPR----DSRFIPPLVSQF---------------------HVSLFSPFSWEEIPQTNFPL 252
DP +S + P +F + + ++ + + + F
Sbjct: 1456 DPNAEDLNSNVMAPTAEKFPDLDVNDMPLIGEDAYELRVVQLDEYGDWAGQGVFRAYFER 1515
Query: 253 HE----------WEHVLCLKNV----------SMEYEGTLSG-------LRGYIALGTNY 285
HE + L KNV +ME + T + R YI +GT Y
Sbjct: 1516 HEVVLSVKVLYLHDASLLKKNVDSATDEYHRRNMETDSTANEEAEWNRRKRPYIVIGTGY 1575
Query: 286 --NYSEDVTCRGRILLF--DIIEVVPEPGQPLTK-NKIKMIYAKEQ-KGPVTAICHVAGF 339
ED + +GR+LL+ D + V + G +K K+++ + KE +G +T++ + +
Sbjct: 1576 VGPNGEDASGKGRLLLYEVDYAQYVDKDGTTSSKLPKLRLTFIKEHHQGAITSVIQLGMY 1635
Query: 340 LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASM-VSVKNLILVGDYARSIALLRYQPEY 398
++ +VG K+ +++ K + L G AF D +++I S+ V K ++ D +S++ LR++ +
Sbjct: 1636 VLASVGSKMIVYEFKSDQLIGCAFYDAQMFITSLSVLRKEYVMYSDVYKSVSFLRWRQKD 1695
Query: 399 RTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 458
R L L+A+DY+P + + +I
Sbjct: 1696 RQLILLAKDYEPLAVTTAEF--------------------------------------NI 1717
Query: 459 LDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF-KIRCKPS 517
LD + + + +D ++N+ + Y P ES GG RL++ +DFH+G +++ K+ +
Sbjct: 1718 LD--TRLALIAADVEENLHVLQYAPHDIESRGGQRLLRTSDFHVGVQISSILRKLVISNA 1775
Query: 518 SISDAPGARSR-----FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
S A+ R +L S +G + +P+PE+ +RRL LQNVM++ LN
Sbjct: 1776 SHQQYIPAKGRCIGNMYLNVLGSSEGGIAALIPVPERVFRRLFTLQNVMISALPQNCALN 1835
Query: 573 PRAFRTYKGKGYYAGNPS---------RGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 623
PR FR K G + +G +DG ++ +FL L + E+ + IG+
Sbjct: 1836 PREFRVMKANGRVRSGRADAWCKQKWKKGFLDGQVLCRFLHLDYVAQKELARCIGTNPEV 1895
Query: 624 ILDELYDIEALSSHF 638
I+ L +++ + F
Sbjct: 1896 IIQNLSELQRNTMSF 1910
>gi|325187036|emb|CCA21579.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1912
Score = 172 bits (436), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 160/675 (23%), Positives = 286/675 (42%), Gaps = 162/675 (24%)
Query: 85 RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG-------------PVS 131
R + F N+ G+F G +P W+ L ++G+ P+ I PV
Sbjct: 1278 RYPMLTRFFNVNNNSGMFFRGAYPVWI-LPNQGQPVFVPLNIAAAPSDPTRRTTFKVPVL 1336
Query: 132 TLAPFHNVNCPRGFLYFNAKSELRISVLP-----THLSYDAPWPVRKVPLKCTPHFLAY- 185
+ PFH+ NCP GF+YF++ LR+ LP T L + ++KV T H L Y
Sbjct: 1337 SFTPFHHWNCPNGFVYFHSSGSLRVCELPSSQNSTLLPSGNGFVLQKVRFGATIHHLLYL 1396
Query: 186 ----------HLETKTYCIVTST----AEPSTDYYKFN--------------GEDKELVT 217
L++ T+ +V S +E Y+ N G++ E
Sbjct: 1397 GRHGPGGVAEALKSPTFAMVLSRKVTPSEAEQAYWSENNDENADDTMYQNGVGKEAEEGD 1456
Query: 218 DPR----DSRFIPPLVSQF---------------------HVSLFSPFSWEEIPQTNFPL 252
DP +S + P +F + + ++ + + + F
Sbjct: 1457 DPNAEDLNSNVMAPTAEKFPDLDVNDMPLIGEDAYELRVVQLDEYGDWAGQGVFRAYFER 1516
Query: 253 HE----------WEHVLCLKNV----------SMEYEGTLSG-------LRGYIALGTNY 285
HE + L KNV +ME + T + R YI +GT Y
Sbjct: 1517 HEVVLSVKVLYLHDASLLKKNVDSATDEYHRRNMETDSTANEEAEWNRRKRPYIVIGTGY 1576
Query: 286 --NYSEDVTCRGRILLF--DIIEVVPEPGQPLTK-NKIKMIYAKEQ-KGPVTAICHVAGF 339
ED + +GR+LL+ D + V + G +K K+++ + KE +G +T++ + +
Sbjct: 1577 VGPNGEDASGKGRLLLYEVDYAQYVDKDGTTSSKLPKLRLTFIKEHHQGAITSVIQLGMY 1636
Query: 340 LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASM-VSVKNLILVGDYARSIALLRYQPEY 398
++ +VG K+ +++ K + L G AF D +++I S+ V K ++ D +S++ LR++ +
Sbjct: 1637 VLASVGSKMIVYEFKSDQLIGCAFYDAQMFITSLSVLRKEYVMYSDVYKSVSFLRWRQKD 1696
Query: 399 RTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 458
R L L+A+DY+P + + +I
Sbjct: 1697 RQLILLAKDYEPLAVTTAEF--------------------------------------NI 1718
Query: 459 LDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF-KIRCKPS 517
LD + + + +D ++N+ + Y P ES GG RL++ +DFH+G +++ K+ +
Sbjct: 1719 LD--TRLALIAADVEENLHVLQYAPHDIESRGGQRLLRTSDFHVGVQISSILRKLVISNA 1776
Query: 518 SISDAPGARSR-----FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
S A+ R +L S +G + +P+PE+ +RRL LQNVM++ LN
Sbjct: 1777 SHQQYIPAKGRCIGNMYLNVLGSSEGGIAALIPVPERVFRRLFTLQNVMISALPQNCALN 1836
Query: 573 PRAFRTYKGKGYYAGNPS---------RGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 623
PR FR K G + +G +DG ++ +FL L + E+ + IG+
Sbjct: 1837 PREFRVMKANGRVRSGRADAWCKQKWKKGFLDGQVLCRFLHLDYVAQKELARCIGTNPEV 1896
Query: 624 ILDELYDIEALSSHF 638
I+ L +++ + F
Sbjct: 1897 IIQNLSELQRNTMSF 1911
>gi|146415762|ref|XP_001483851.1| hypothetical protein PGUG_04580 [Meyerozyma guilliermondii ATCC 6260]
Length = 1320
Score = 171 bits (434), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 143/609 (23%), Positives = 266/609 (43%), Gaps = 71/609 (11%)
Query: 33 LLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQ-MRY 91
L ++ E+ +Y+ F L + KK K L ++ A P G I + + Y
Sbjct: 768 LTILTVGGEIYMYKLFF---DGLNFKLKKEKDLLITGAPDNA-----YPAGTSIERRLVY 819
Query: 92 FSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAK 151
++G+ +F+ G P ++ T R T + A F + G ++ +
Sbjct: 820 IPLVSGFSSIFVTGVVPYFITRTRHSIPRIFKFT-KIAAQSFASFSDSKVSNGLIFLDNA 878
Query: 152 SELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGE 211
RI LP +YD PV+KVP+ T + YH + TY + T P Y + E
Sbjct: 879 KNARICELPRDFNYDNNLPVKKVPIGETVKSVTYHELSNTYVVSTYREIP---YNALDEE 935
Query: 212 DKELVTDPRDSRFIPPLVSQFHVSL--FSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYE 269
+ +D P + + SL SP++W I L + E + +K++ ++
Sbjct: 936 GNPIAGLKKDK----PSANSYKGSLKLISPYNWTVIETVE--LRDNEIAMTVKSMVLDIG 989
Query: 270 GTLSGLR---GYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ 326
+ + + +GT ED+ G +++II+++PEPG+P T +K K ++
Sbjct: 990 SSTKRFKHRKELLVVGTGRYRMEDLGANGAFKIYEIIDIIPEPGKPETNHKFKEYNTEDT 1049
Query: 327 KGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYA 386
KG VT++C V+G + A GQKI + ++D+ + +AF+DT VY++ S NL+++GD
Sbjct: 1050 KGAVTSMCEVSGRFLVAQGQKIIVRDVQDDGVVPVAFLDTSVYVSEAKSFGNLVILGDTL 1109
Query: 387 RSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 446
+S+ L + E + ++ +D + +D S
Sbjct: 1110 KSVWLAGFDAEPFRMIMLGKDLQS-----------------VDVS--------------- 1137
Query: 447 ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV 506
C + SK +I +I+ + + L + PE S+ G RL+ + F++
Sbjct: 1138 -CAEFISKDEEIY-------ILIAGNNNVMHLVQFDPEDPTSSNGQRLVHRASFNVSSST 1189
Query: 507 NTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 566
+R P + F T +++DG+ P+ E YRR+ ++Q +
Sbjct: 1190 TC---MRMVPKNEEINTQYSDVFQTVGSTIDGSFFTVFPVNEFTYRRMYIIQQQLTDKEY 1246
Query: 567 HTGGLNPRAFRTYKGKGYYAGNPS-RGIIDGSLVWKFLQLSLGERLEICKKIGSK--HND 623
H GLNPR R + G+ + + I+D ++ ++ +L+ + I +K+ SK + +
Sbjct: 1247 HYCGLNPRLNR-FGGEAFDDSQTGVKPILDHQVIKRYAKLNEDRKQTIAQKVSSKGVYQE 1305
Query: 624 ILDELYDIE 632
I +L + E
Sbjct: 1306 IWKDLIEFE 1314
>gi|429851266|gb|ELA26469.1| protein cft1 [Colletotrichum gloeosporioides Nara gc5]
Length = 1411
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 168/651 (25%), Positives = 279/651 (42%), Gaps = 117/651 (17%)
Query: 14 ETIVQELLTVSLG-LHGNRPLLLVR-TQHELLIYQAFR-----HPKGALK-LRFKK---- 61
+ + E+L LG + P L++R ++ IY+ R +G K L F+K
Sbjct: 839 QETLTEVLVAKLGDATESSPYLILRHANDDITIYEPIRLESQDKSEGLAKTLHFQKITNP 898
Query: 62 -LKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
L V ANEQP R +R +NI GY VFL G P+++ +++ +
Sbjct: 899 ALAKSPVEVADDDANEQP------RFVPLRPCANINGYSTVFLPGASPSFIIKSAKSAPK 952
Query: 121 AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTP 180
+ G V ++ FH C RGF+Y +++ R++ LP S++ +RK+P+
Sbjct: 953 VLGLQGIG-VRGMSSFHTEGCERGFIYADSEGHTRVTQLPADTSFELGVSIRKIPVGDAI 1011
Query: 181 HFLAYHLETKTYCIVTSTAEP-----STDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVS 235
+AYH +TY + S +EP DY+K ++ + T P+ R I +
Sbjct: 1012 GLIAYHPPMETYAVACSVSEPFELPKDDDYHKEWAKET-ITTFPQMERGI--------IK 1062
Query: 236 LFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRG 295
L SP +W I HE +C+K + +E R IA+GT N ED+ RG
Sbjct: 1063 LLSPATWSVIDTVELDPHEV--AMCMKTLHLEVSEETKERRMLIAIGTAINRGEDLPIRG 1120
Query: 296 RILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIW 351
RIL++D++ VVP+PG+P T K+K++ AKE+ +G VTA+C V G ++ A GQK +
Sbjct: 1121 RILVYDVVPVVPQPGRPETNKKLKLV-AKEEIPRGAVTALCEVGSQGLMLVAQGQKCMVR 1179
Query: 352 QLK-DNDLTGIAFIDTEVYIASMVSVKNL--ILVGDYARSIALLRYQPEYRTLSLVARDY 408
LK D L +AF+D Y+ S+ V+ L+ D + + + Y E
Sbjct: 1180 GLKEDGTLLPVAFMDMSCYVTSVREVRGTGYCLMADAFKGVWFVGYAEE----------- 1228
Query: 409 KPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFM 468
P + ++ G KF L+ D L + + +
Sbjct: 1229 --------------PYKIMLFGKSTGKFEVLTA---------------DFLVDGDELHIV 1259
Query: 469 ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSR 528
+ DKD + + + PE +S GH L+ + F + P++ P +
Sbjct: 1260 VCDKDGVIHVMQFDPEHPKSLQGHLLLNRASFSAAPN---------HPTATLSLPRTTTT 1310
Query: 529 FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRA-----FRTYKGKG 583
+ AS KN LML + T H R+ G
Sbjct: 1311 AQSASAS-------------KNPPSTLML----ASPTGHCRSAPSRSRHEPKGPPAPAPG 1353
Query: 584 YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
+ R I+DG+L+ ++ +L G R E+ K G + ++L+ ++E +
Sbjct: 1354 SPPTSAGRTIVDGALLSRWNELGAGRRSEVAGKGG--YGNVLEVRGELEGI 1402
>gi|322694449|gb|EFY86278.1| Cleavage factor two protein 1 [Metarhizium acridum CQMa 102]
Length = 1431
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 158/639 (24%), Positives = 283/639 (44%), Gaps = 74/639 (11%)
Query: 17 VQELLTVSLG-LHGNRPLLLVR-TQHELLIYQAFR-HPKGALKLR----FKKLKVLFVSD 69
+ E+L LG P L+VR +L IY+ R +G +L FKK ++
Sbjct: 837 ITEILVADLGDAISQTPYLIVRHASDDLTIYEPVRCQEEGDAELSASLLFKKCVNTSLAK 896
Query: 70 RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP 129
+ +E P R +R +N+ GY VFL G P+++ +S E R + G
Sbjct: 897 TAPEVSEDDAEPP--RFVPLRRCANVNGYGAVFLPGASPSFVLKSSHSEPRVIGLQGLG- 953
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHLE 188
V ++ FH C RGF+Y + + R++ LP++ S+ D V+K+ L ++YH
Sbjct: 954 VRGMSTFHTEGCDRGFIYVDVEGIARVTQLPSNASFTDLGVSVKKIALDGDVGMISYHHP 1013
Query: 189 TKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQT 248
T TY + + EP ++ +D +++ PP ++ + L +P +W I +
Sbjct: 1014 TGTYVVACTKLEP----FELPRDDDYHKEWAKETIKFPPTTARGILKLINPVTWTVIHE- 1068
Query: 249 NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPE 308
L E + +K + +E + +A+GT + ED+ RGR+ +FDI+ V+PE
Sbjct: 1069 -LELEPCESIESMKTLHLEVSEETKERKMLVAVGTALSKGEDLPTRGRVQVFDIVTVIPE 1127
Query: 309 PGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAF 363
PG+P T ++K+I AKE+ +G VTA+ V G ++ A GQK + LK D L +AF
Sbjct: 1128 PGRPETNKRLKLI-AKEEIPRGGVTALSEVGAQGLMLVAQGQKCMVRGLKEDGSLLPVAF 1186
Query: 364 IDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
+D ++AS+ + L ++ D + + Y E T ++ + S G
Sbjct: 1187 LDMNCHVASVKELPGTGLCVMADVFKGLWFAGYTEEPYTFKILGK--------SSG---- 1234
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
K+ D L + + + D + ++ + +
Sbjct: 1235 ----------------------------KLPLLVADFLPDGEDLSMVAVDAEGDMHILEF 1266
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALG 541
PE +S GH L+ +T F + + T + + S S + S + A G +
Sbjct: 1267 NPEHPKSLQGHLLLHRTSFAVTPNTPTSTLLLPRTHSPSYPHASSSSHMLLLACPSGQVA 1326
Query: 542 FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY------AGNPSRGIID 595
PL E YRRLL + N + GL+ +A R Y + A + R ++D
Sbjct: 1327 ALSPLAESTYRRLLSVTNQLHPAIVAHCGLHTKAHR-YPDQSCVAVGVETAASSGRALVD 1385
Query: 596 GSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
G+++ ++ +L +R ++ + G ++ + D D+E +
Sbjct: 1386 GTVLARWSELGAAKRTDVALRGG--YDSVADLRDDLEGV 1422
>gi|388581811|gb|EIM22118.1| hypothetical protein WALSEDRAFT_28358 [Wallemia sebi CBS 633.66]
Length = 1259
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 158/616 (25%), Positives = 258/616 (41%), Gaps = 84/616 (13%)
Query: 23 VSLGLHGNRP--LLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGL 80
V+LG G + L+++ EL+IY + G + F K+ + V
Sbjct: 725 VNLGKGGVKRAHLIILYQSGELVIYDTYNSSSG---IAFSKVSAVSVQ------------ 769
Query: 81 PRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVN 140
IS++ F + G + G P + R HP+ + APF
Sbjct: 770 LSATVISRILTFCDF----GALITGRTPVLISCEDTSIPRIHPLD-QKYIHHAAPFD--- 821
Query: 141 CPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAE 200
G L++ A E+ ++ + + Y P+R++ +AY ++ Y + TSTA
Sbjct: 822 ---GGLFYYANDEVILATIGDDVEYSENLPLRRLANGRNFDKVAYDPTSQMY-VATSTA- 876
Query: 201 PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLC 260
S + F+ L P ++ P + + L S I F +E+ V+C
Sbjct: 877 -SVPFRLFDNAGNYLWKPPTEN-LSPATSYRSAIELLSNDCRSSIFGYEFEQNEF--VIC 932
Query: 261 LKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKM 320
+ VS+ + +I +GT N EDV +G + LF+I E++P K+KM
Sbjct: 933 CETVSLLSPSADGTYKDFIGVGTCINRGEDVAVKGAMYLFEIAELIPSSKDSGNNYKLKM 992
Query: 321 IYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND-LTGIAFIDTEVYIASMVSVKNL 379
+ +E KG V+AI +G+ V AVGQK+ I L+ N+ L +AF D YI S+ +KN
Sbjct: 993 LMREETKGAVSAITSCSGYFVVAVGQKVLIRALEINERLISVAFYDAGTYIVSLEVLKNF 1052
Query: 380 ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQL 439
ILVGD +SI L +Q L ++RD +
Sbjct: 1053 ILVGDQVKSITFLAFQESPYKLVQLSRDAR------------------------------ 1082
Query: 440 SLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD 499
++E C H D + F+ +D ++ L Y P + GG +LI+ T+
Sbjct: 1083 ----QIETCVSNFLAHED------QISFVSNDIQGDLRLIDYNPFDPTAEGGEKLIRTTE 1132
Query: 500 FHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN 559
FH G S + P R +DG+L P+ E ++ L +LQ
Sbjct: 1133 FHKGSEATC--------SLLLPKPSVRPSSELLLGCVDGSLSCLSPVDEITFKALWLLQG 1184
Query: 560 VMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
+V H LNPRA R + Y + + S+GI+DG L+ + + ++EI K+IG
Sbjct: 1185 ALVRQIPHIAALNPRAHRHVRND-YVSRSLSKGILDGLLLSAYQTIDHATQVEIAKRIGY 1243
Query: 620 KHNDILDELYDIEALS 635
++L L + LS
Sbjct: 1244 SKAELLGYLRNFSWLS 1259
>gi|453082807|gb|EMF10854.1| CPSF_A-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 1349
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 161/643 (25%), Positives = 271/643 (42%), Gaps = 100/643 (15%)
Query: 14 ETIVQELLTVSLGLH-GNRPLLLVRT-QHELLIYQAFRHP------KGALKLRFKKLKVL 65
+ + ELL LG P L+VRT ++++Y+ F +P L F+K+
Sbjct: 750 KATLTELLVADLGQDSATEPYLIVRTAMDDIVLYEPFHYPLRPNEDSWHSSLHFRKVPFS 809
Query: 66 FVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGEL----RA 121
++ + NEQ + + +++ + GY V + G P L + L
Sbjct: 810 YI----PKYNEQLSDAQTPPLKRIQ----VGGYHAVNIPG-GPTNLLMKESSSLLKVLEV 860
Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
+ ++P H C GFL NA E++ + LP Y W +++V +
Sbjct: 861 RDTQSSQRATVMSPVHRPGCEHGFLTINADEEVQENQLPEKTWYGTGWSIQQVDIGEDVR 920
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
+AYH E + Y + T D+Y F GED +D + P V Q+ + L S S
Sbjct: 921 HIAYHAEREVYVVATCR---DIDFY-FAGEDGR--HPEQDDIELRPQVPQYTIHLVSAKS 974
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
+ + + L E V LK +S+E + + + T ED+ +G ++L+D
Sbjct: 975 HQRL--QSVELGYLETVTALKVMSLEVSENTHEQKDLVVVSTAAQRGEDMPAKGAVILYD 1032
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICH--VAGFLVTAVGQKIYIWQLK-DNDL 358
II+VVP+P P + ++ + ++ +G +T+I GFL TA G K+ + LK D
Sbjct: 1033 IIDVVPDPDVPESGFQLHQLAREQARGAITSIAGPLPGGFLGTAQGLKLMVRGLKEDGTC 1092
Query: 359 TGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
+AF+D + Y TL ++ P +
Sbjct: 1093 LPVAFLDAQSYT----------------------------HTLKVL--------PGRGMW 1116
Query: 419 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEF----SSMGFMISDKDK 474
AG+ +G+ G + +L++ + K SK + EF ++ +I D D
Sbjct: 1117 LAGDAWKGLWFGGFTEEPYKLTV-----MGKSPKSKMEVMTAEFLPFDGALYILIMDADN 1171
Query: 475 NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI--RCKPSSISD----------- 521
++ + Y PE +S GG RL+ ++ FH+G V + KP D
Sbjct: 1172 DLHVLQYDPENPKSVGGMRLLHRSTFHIGHLVTNMLLVPSSLKPFESQDRDMANGTNGNN 1231
Query: 522 -----APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
AP + L S G++G PL E YRRL LQ + H GLNPRA+
Sbjct: 1232 EEATRAPPSLHHILA--TSRSGSVGLITPLDEAAYRRLSALQTHLTAILEHAAGLNPRAY 1289
Query: 577 RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
R + + + +RG++DGSLV + +L +R ++ + GS
Sbjct: 1290 RAVEAESFGG---ARGVVDGSLVNRIGELGAAKRADVLGRAGS 1329
>gi|190348091|gb|EDK40482.2| hypothetical protein PGUG_04580 [Meyerozyma guilliermondii ATCC 6260]
Length = 1320
Score = 169 bits (429), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 142/611 (23%), Positives = 266/611 (43%), Gaps = 71/611 (11%)
Query: 33 LLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQ-MRY 91
L ++ E+ +Y+ F + KK K L ++ A P G I + + Y
Sbjct: 768 LTILTVGGEIYMYKLFFDGSN---FKLKKEKDLLITGAPDNA-----YPAGTSIERRLVY 819
Query: 92 FSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAK 151
++G+ +F+ G P ++ T R T + A F + G ++ +
Sbjct: 820 IPLVSGFSSIFVTGVVPYFITRTRHSIPRIFKFT-KIAAQSFASFSDSKVSNGLIFLDNA 878
Query: 152 SELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGE 211
RI LP +YD PV+KVP+ T + YH + TY + T P Y + E
Sbjct: 879 KNARICELPRDFNYDNNLPVKKVPIGETVKSVTYHELSNTYVVSTYREIP---YNALDEE 935
Query: 212 DKELVTDPRDSRFIPPLVSQFHVSL--FSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYE 269
+ +D P + + SL SP++W I L + E + +K++ ++
Sbjct: 936 GNPIAGLKKDK----PSANSYKGSLKLISPYNWTVIETVE--LRDNEIAMTVKSMVLDIG 989
Query: 270 GTLSGLR---GYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ 326
+ + + +GT ED+ G +++II+++PEPG+P T +K K ++
Sbjct: 990 SSTKRFKHRKELLVVGTGRYRMEDLGANGAFKIYEIIDIIPEPGKPETNHKFKEYNTEDT 1049
Query: 327 KGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYA 386
KG VT++C V+G + A GQKI + ++D+ + +AF+DT VY++ S NL+++GD
Sbjct: 1050 KGAVTSMCEVSGRFLVAQGQKIIVRDVQDDGVVPVAFLDTSVYVSEAKSFGNLVILGDTL 1109
Query: 387 RSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 446
+S+ L + E + ++ +D + +D S
Sbjct: 1110 KSVWLAGFDAEPFRMIMLGKDLQS-----------------VDVS--------------- 1137
Query: 447 ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV 506
C + SK +I +I+ + + L + PE S+ G RL+ + F++
Sbjct: 1138 -CAEFISKDEEIY-------ILIAGNNNVMHLVQFDPEDPTSSNGQRLVHRASFNVSSST 1189
Query: 507 NTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 566
+R P + F T +++DG+ P+ E YRR+ ++Q +
Sbjct: 1190 TC---MRMVPKNEEINTQYSDVFQTVGSTIDGSFFTVFPVNEFTYRRMYIIQQQLTDKEY 1246
Query: 567 HTGGLNPRAFRTYKGKGYYAGNPS-RGIIDGSLVWKFLQLSLGERLEICKKIGSK--HND 623
H GLNPR R + G+ + + I+D ++ ++ +L+ + I +K+ SK + +
Sbjct: 1247 HYCGLNPRLNR-FGGEAFDDSQTGVKPILDHQVIKRYAKLNEDRKQTIAQKVSSKGVYQE 1305
Query: 624 ILDELYDIEAL 634
I +L + E +
Sbjct: 1306 IWKDLIEFENV 1316
>gi|449299306|gb|EMC95320.1| hypothetical protein BAUCODRAFT_25380 [Baudoinia compniacensis UAMH
10762]
Length = 1437
Score = 168 bits (426), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 160/645 (24%), Positives = 270/645 (41%), Gaps = 104/645 (16%)
Query: 17 VQELLTVSLGLHG-NRPLLLVRT-QHELLIYQAFRHPKGALK--------LRFKKLKVLF 66
+ E+L V LG G RP L+VRT +L++Y+ F + L LRF+K+ +
Sbjct: 777 LTEVLVVDLGAEGVTRPYLIVRTAMDDLILYEPFHYSATTLDARATGFTDLRFRKVPFTY 836
Query: 67 VSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTI 126
+ + + G P +Q++ + I G ++L G P++L + + +
Sbjct: 837 LPKYDEGLDTADGRP-----AQLQP-AVIGGRNALYLPGGTPSFLVKEATSLPKVLGLRA 890
Query: 127 DGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFL--- 183
G V + +P H C +GF + +L+ LP H+S+ W VR + L P +
Sbjct: 891 RG-VRSFSPLHRAGCQQGFALVDGDGKLKEYQLPGHVSFATGWSVRTLTLGEPPQEVRQV 949
Query: 184 AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
AYH + Y + T D+ + ++++ +P + P V Q+ + L S S +
Sbjct: 950 AYHEQRGIYVVATCR---DVDFTLHDLDERQRDDEPN----LKPQVPQYTLHLLSATSHK 1002
Query: 244 EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
I P E V LK + +E + + +G ED +G + +FDII
Sbjct: 1003 VIQSLEMPYAEI--VTSLKIMPLEVSEHTHEQKLMLVVGAAAQRGEDAPAKGLLTVFDII 1060
Query: 304 EVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLV-TAVGQKIYIWQLK-DNDLTGI 361
+VVPEP P + ++ + +E KG +TA+ +G LV TA GQKI + LK D +
Sbjct: 1061 DVVPEPDDPESGIRLHIAAREETKGAITALESFSGGLVGTAQGQKIMVRGLKEDGTCLPV 1120
Query: 362 AFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
AF+D + Y+ S+ ++ L L GD + + + E L+L+ + + S +
Sbjct: 1121 AFLDAQTYMVSLKTMGRSGLSLAGDAWKGLWFGGWTEEPYRLTLLGKSRTKMEVVSAEFL 1180
Query: 420 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
DG L ++ D ++ +
Sbjct: 1181 P-------FDGQLY---------------------------------LLVVDGKMDLHVL 1200
Query: 480 MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI--------RCKPSSISDAPGARS---- 527
Y PE ++ G RL+ K+ FHLG + + P + D+ G +
Sbjct: 1201 QYDPENPKTVSGQRLLHKSTFHLGHWPVDMLLLPSDLAPFAQQAPLTNGDSNGHTNGTES 1260
Query: 528 -------------RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 574
LT + S GA+G P+ E YRRL LQ + + H GLNPR
Sbjct: 1261 SAANAPAPAPSLFHVLTTFQS--GAVGLITPVDEATYRRLGALQTQLTSVLEHAAGLNPR 1318
Query: 575 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
A+R + + RG++DG LV + +L R E+ + G+
Sbjct: 1319 AYRAVESESLGG----RGVVDGMLVQRIGELGAARRAEVLGRAGA 1359
>gi|116182170|ref|XP_001220934.1| hypothetical protein CHGG_01713 [Chaetomium globosum CBS 148.51]
gi|88186010|gb|EAQ93478.1| hypothetical protein CHGG_01713 [Chaetomium globosum CBS 148.51]
Length = 1394
Score = 168 bits (425), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 140/502 (27%), Positives = 226/502 (45%), Gaps = 66/502 (13%)
Query: 17 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGA-----LKLRFKKLKVLFVSDRS 71
V E+L L NR L +L IYQ FR+ A L F+KL +
Sbjct: 856 VAEILVADLA---NRSQLR-HANDDLTIYQPFRYSTSAGADFSKTLFFQKLPNAAFAKSP 911
Query: 72 KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS 131
+ A+E + R+ MR SNIAGY VFL G P+++ +S+ R P+ G V
Sbjct: 912 EEADEDEATHQ-PRMLSMRRCSNIAGYSTVFLPGASPSFIIKSSKSAPRVLPLQGAG-VI 969
Query: 132 TLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHLETK 190
++PFH C GF+Y +++ R++ LP +Y + VRK+P+ +AYH +
Sbjct: 970 AMSPFHTEGCENGFIYADSQHMARVTQLPQDWNYAETGLAVRKIPIGEDIAAVAYHPPMQ 1029
Query: 191 TYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNF 250
+Y + +T EP ++ +D R++ P V + + L SP +W +
Sbjct: 1030 SYVVGCNTLEP----FELPKDDDYHKEWARENLSFKPTVDRGILKLVSPITWTVVDSVQ- 1084
Query: 251 PLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPG 310
+ E VLC+ +S+E + + IA+GT ED+ RGR+ ++DI EV+PEPG
Sbjct: 1085 -MEPCETVLCVATLSLEVSEFTNERKQLIAVGTALIKGEDLPTRGRVYVYDITEVIPEPG 1143
Query: 311 QPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFID 365
+P T K+K+I AKE+ +G VTA+ + G ++ A GQK + LK D L +AF+D
Sbjct: 1144 RPETSKKLKLI-AKEEIPRGAVTALSEIGTQGLMLVAQGQKCMVRGLKEDGTLLPVAFMD 1202
Query: 366 TEVYI--ASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNP 423
Y+ A + L L+ D + + Y E + L +
Sbjct: 1203 MNCYVTNAKELPGTGLCLLADAFKGVWFTGYTEEPYKMMLFGKS---------------- 1246
Query: 424 SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQP 483
+LE+ + D L + + + D D N+ + + P
Sbjct: 1247 ------------------STKLEVL------NADFLPDGKDLFIVACDADGNIHILEFDP 1282
Query: 484 EARESNGGHRLIKKTDFHLGQH 505
E +S GH L+ +T F+ G +
Sbjct: 1283 EHPKSLQGHLLLHRTTFNTGAN 1304
>gi|405121446|gb|AFR96215.1| cleavage and polyadenylation specific protein [Cryptococcus
neoformans var. grubii H99]
Length = 1431
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 149/621 (23%), Positives = 264/621 (42%), Gaps = 75/621 (12%)
Query: 25 LGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKL--KVLFVSDRSKRANEQPGLPR 82
L LH + L Q + A H + +L +RF+K+ ++L +S N LP
Sbjct: 871 LALHHSGRLNAYEAQPRFTV-DASSHSRRSLAVRFRKVHTQLLPISGGVGTTNGNARLPY 929
Query: 83 GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVN-- 140
+ F+NI G G F+ G P W+ + AHP+ F
Sbjct: 930 TIV-----PFNNIEGLTGAFITGEKPHWIISS-----EAHPLRAFALKQAAMAFGKTTHL 979
Query: 141 CPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAE 200
+G + + I LP L+ D P + ++ + + + Y S
Sbjct: 980 GGKGEYFIRIEDGSFICYLPPTLNTDFAIPCDRYQMERAYTNITFDPTSAHYVGAASIEV 1039
Query: 201 PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS--WEEIPQTNFPLHEWEHV 258
P Y E+ E+ P IPP + + LFS S W I + + E V
Sbjct: 1040 PFQAY----DEEGEIQLGPDGPDLIPPTNQRSTLELFSQGSDPWRVI--DGYEFDQNEEV 1093
Query: 259 LCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKI 318
+ +++V++E G G R +IA+GT +N+ ED RG +F+I++ V G +
Sbjct: 1094 MSMESVNLESPGAPGGYRDFIAVGTGFNFGEDRATRGNTYIFEILQTVGPQGGGGPGSVP 1153
Query: 319 KMIYAKEQKG----PVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASM 373
K K PV A+ H+ G+L+ G K+Y+ L D L G+AF+D ++Y ++
Sbjct: 1154 GWKLVKRTKDPARHPVNAVNHINGYLLNTNGPKLYVKGLDYDAQLMGLAFLDIQLYATTV 1213
Query: 374 VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
KN +L+GD +S + Q + + +++D
Sbjct: 1214 KVFKNFMLIGDLCKSFWFVSLQEDPYKFTTISKD-------------------------- 1247
Query: 434 WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
+ + D L + F+ SD++ ++ + + P +S G R
Sbjct: 1248 --------------LQHVSVVTADFLVHDGQVTFISSDRNGDMRMLDFDPTDPDSLNGER 1293
Query: 494 LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRR 553
L+ KT++H G T K+ + + + +++ + YA+ DGAL + + + ++R
Sbjct: 1294 LMLKTEYHAGSAA-TVSKVIARRKTAEEEFAPQTQII--YATADGALTTVVSVKDARFKR 1350
Query: 554 LLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 613
L ++ + +V + H GLNPRAFRT + S+GI+DG L+ +F +G + E+
Sbjct: 1351 LQLVSDQLVRNAQHVAGLNPRAFRTVRND-LLPRPLSKGILDGQLLNQFALQPIGRQKEM 1409
Query: 614 CKKIGSKHNDILDELYDIEAL 634
++IG+ D + D++AL
Sbjct: 1410 MRQIGT---DAVTVASDLQAL 1427
>gi|58268668|ref|XP_571490.1| cleavage and polyadenylation specific protein [Cryptococcus
neoformans var. neoformans JEC21]
gi|134113364|ref|XP_774707.1| hypothetical protein CNBF3860 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|338817789|sp|P0CM63.1|CFT1_CRYNB RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|338817790|sp|P0CM62.1|CFT1_CRYNJ RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|50257351|gb|EAL20060.1| hypothetical protein CNBF3860 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57227725|gb|AAW44183.1| cleavage and polyadenylation specific protein, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 1431
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 143/599 (23%), Positives = 259/599 (43%), Gaps = 74/599 (12%)
Query: 47 AFRHPKGALKLRFKKL--KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLC 104
A H + +L +RF+K+ ++L +S N LP + F+NI G G F+
Sbjct: 892 ASSHSRRSLAVRFRKVHTQLLPISGGVGTTNGNARLPYTIV-----PFNNIEGLTGAFIT 946
Query: 105 GPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVN--CPRGFLYFNAKSELRISVLPTH 162
G P W+ + AHP+ F +G + + I LP
Sbjct: 947 GEKPHWIISS-----EAHPLRAFALKQAAMAFGKTTHLGGKGEYFIRIEDGSFICYLPPT 1001
Query: 163 LSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS 222
L+ D P + ++ + + + Y S P Y E+ E+ P
Sbjct: 1002 LNTDFAIPCDRYQMERAYTNITFDPTSAHYVGAASIEVPFQAY----DEEGEIQLGPDGP 1057
Query: 223 RFIPPLVSQFHVSLFSPFS--WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIA 280
IPP + + LFS S W+ I + + E V+ +++V++E G G R +IA
Sbjct: 1058 DLIPPTNQRSTLELFSQGSDPWKVI--DGYEFDQNEEVMSMESVNLESPGAPGGYRDFIA 1115
Query: 281 LGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKG----PVTAICHV 336
+GT +N+ ED RG +F+I++ V G + K K PV A+ H+
Sbjct: 1116 VGTGFNFGEDRATRGNTYIFEILQTVGPQGGGGPGSVPGWKLVKRTKDPARHPVNAVNHI 1175
Query: 337 AGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQ 395
G+L+ G K+Y+ L D+ L G+AF+D ++Y ++ KN +L+GD +S + Q
Sbjct: 1176 NGYLLNTNGPKLYVKGLDYDSQLMGLAFLDIQLYATTVKVFKNFMLIGDLCKSFWFVSLQ 1235
Query: 396 PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
+ + +++D + +
Sbjct: 1236 EDPYKFTTISKD----------------------------------------LQHVSVVT 1255
Query: 456 NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK 515
D L + F+ SD++ ++ + + P +S G RL+ +T++H G T K+ +
Sbjct: 1256 ADFLVHDGQVTFISSDRNGDMRMLDFDPTDPDSLNGERLMLRTEYHAGSAA-TVSKVIAR 1314
Query: 516 PSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRA 575
+ + +++ + YA+ DGAL + + + ++RL ++ + +V + H GLNPRA
Sbjct: 1315 RKTAEEEFAPQTQII--YATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLNPRA 1372
Query: 576 FRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
FRT + S+GI+DG L+ +F +G + E+ ++IG+ D + D++AL
Sbjct: 1373 FRTVRND-LLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGT---DAVTVASDLQAL 1427
>gi|452841862|gb|EME43798.1| hypothetical protein DOTSEDRAFT_79774 [Dothistroma septosporum NZE10]
Length = 1347
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 163/643 (25%), Positives = 263/643 (40%), Gaps = 85/643 (13%)
Query: 19 ELLTVSLGLHG-NRPLLLVRTQ-HELLIYQAFRHPKGA------LKLRFKKLKVLFVSDR 70
ELL LG G + P L+ RT +L++Y+ FRHP+ A LRF+K+ V ++
Sbjct: 746 ELLVAELGASGVDTPYLVARTALDDLVLYEPFRHPEPAPSDQWYTNLRFRKVPVTYI--- 802
Query: 71 SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP- 129
+ NE R +R ++ Y V + G P L + R + I
Sbjct: 803 -PKYNEAIAQEESTRPLPLRSI-HVGDYDAVTIPGSPPLLLVKEASSLPRVLEVRISNES 860
Query: 130 --VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTP---HFLA 184
V+TL P H +C +GF NA L LP Y W V++V L LA
Sbjct: 861 NRVATLLPIHLDHCKKGFAAVNADGLLEEYHLPLSAWYGTGWSVQQVDLGSEDLEVRHLA 920
Query: 185 YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDP--RDSRFIPPLVSQFHVSLFSPFSW 242
YH Y + T D+Y + + L +D + P V Q+ + L S +
Sbjct: 921 YHETRGVYVVATCK---DVDFYFAEDDHRHLGQSGGGQDDITLRPQVKQYSIHLVSSKTH 977
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
I P E + L+ + +E I + T ED+ RG I++F+I
Sbjct: 978 RVIDSRAMPY--LEAITALQVMPLEVSELTHEQDLRILVSTAAMRGEDMPARGAIIVFNI 1035
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV-AGFLVTAVGQKIYIWQLK-DNDLTG 360
I+VVP P P + K+ + +E KG +TA+ GF+ + GQKI I LK D
Sbjct: 1036 IDVVPAPDVPESGIKLHVNAREETKGAITALAPFPGGFVGSGQGQKIMIRGLKEDGSCLP 1095
Query: 361 IAFIDTEVY--IASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
+AF+D + + + + + L GD + + + E L+++ +
Sbjct: 1096 VAFLDAQCHTTVIKTLGTSGMWLAGDAWKGLWFGGFTEEPYKLTVLGK------------ 1143
Query: 419 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
P R + + +FL ++ +I D D ++ +
Sbjct: 1144 ---APERQM--EVMAAEFLPFD----------------------GALYILIIDADMDLHV 1176
Query: 479 FMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI--------RCKPSSISDAPGARSR-- 528
Y PE +S G RL+ ++ FHLG + +P + D G
Sbjct: 1177 LQYDPENPKSQNGMRLLHRSTFHLGHFATNMLLLPSSLNPFGENQPFTNGDTNGESPEES 1236
Query: 529 ---FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 585
F SL G++G PL E +YRRL LQ + T H LNPRA+R + + +
Sbjct: 1237 SPLFHVLTTSLTGSIGMITPLDESSYRRLSALQTHLTTILEHPASLNPRAYRAIESESFG 1296
Query: 586 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
+RG++DG++V + +L R ++ + G+ I +L
Sbjct: 1297 G---ARGVVDGNIVRRINELGAARRADVLARAGADAWSIRSDL 1336
>gi|149237256|ref|XP_001524505.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146452040|gb|EDK46296.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1380
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 132/542 (24%), Positives = 239/542 (44%), Gaps = 67/542 (12%)
Query: 88 QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLY 147
++ YF N+ GY +F+ G P + + R + P +++ F + G +
Sbjct: 877 RLVYFPNLNGYTCIFVTGVIPFIIIKSLHSIPRIFQFS-KIPAVSISAFSDSKIKNGLIC 935
Query: 148 FNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYK 207
+ RI L +Y+ P+++V L LAYH ++ T T P Y
Sbjct: 936 LDNNKNARICELSLDYTYEFNLPIKRVDLGELVRSLAYHEQSDTVVASTFKEVP----YN 991
Query: 208 FNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSME 267
E+ ++ + L + + L SP +W I +F L + E + +K++ ++
Sbjct: 992 CVDEEGNIIPGVYKEKLPHALTFKSSIKLISPHNWTVID--SFDLEDNEVGMTVKSMILD 1049
Query: 268 YEGTLSGL------RGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMI 321
+ L R YI +GT ED+ G +++II+++PEPG+P T +K K +
Sbjct: 1050 RGSGAASLKKFKSKREYIVIGTGKLRMEDLAANGSFKIYEIIDIIPEPGKPETNHKFKEV 1109
Query: 322 YAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLIL 381
+ ++ +G VTAIC ++G L+ GQKI + ++D+ + +AF+DT VYI+ S NL++
Sbjct: 1110 FQEDARGAVTAICDLSGRLMVGQGQKIIVRDIEDDGVVPVAFLDTSVYISEAKSFGNLLI 1169
Query: 382 VGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSL 441
+GD +S+ L+ ++ E + ++ +D
Sbjct: 1170 LGDPLKSVWLVGFEAEPYRMVMLGKDR--------------------------------- 1196
Query: 442 GERLEI-CKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDF 500
+ L++ C K DI +++D + + L Y P+ +S G L+ K F
Sbjct: 1197 -QHLDVECADFIIKDEDIF-------ILVADNNNCIHLIQYDPDDPKSINGTILLNKASF 1248
Query: 501 HLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNV 560
L +R P G + + ++LDGAL P+ E YRR+ +LQ
Sbjct: 1249 ELNSATTC---LRSIPK------GEKGDYQIIGSTLDGALYNVFPVNEFTYRRMYILQQQ 1299
Query: 561 MVTHTSHTGGLNPRAFRTYKGKGYYAGNP--SRGIIDGSLVWKFLQLSLGERLEICKKIG 618
+ H GLNPR R + G ++ I+D L+ +F +L+L + ++ KI
Sbjct: 1300 ISDKVYHFCGLNPRLNR-FGGSVTLRDRETNTKPILDYGLIRRFSKLNLDRQQQLAGKIS 1358
Query: 619 SK 620
+
Sbjct: 1359 VR 1360
>gi|321260384|ref|XP_003194912.1| cleavage and polyadenylation specific protein [Cryptococcus gattii
WM276]
gi|317461384|gb|ADV23125.1| cleavage and polyadenylation specific protein, putative [Cryptococcus
gattii WM276]
Length = 1431
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 147/621 (23%), Positives = 265/621 (42%), Gaps = 75/621 (12%)
Query: 25 LGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKL--KVLFVSDRSKRANEQPGLPR 82
L LH + L Q + A H + +L +RF+K+ ++L +S N LP
Sbjct: 871 LALHHSGRLNAYEAQPRFTV-DASSHSRRSLAVRFRKVHTQLLPISGGVGTTNGSARLPY 929
Query: 83 GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVN-- 140
+ F+NI G G F+ G P W+ + AHP+ F
Sbjct: 930 TIV-----PFNNIEGLTGAFITGEKPHWIISS-----EAHPLRAFALKQAAMAFGKTTHL 979
Query: 141 CPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAE 200
+G + + I LP L+ D P + ++ T + + + Y S
Sbjct: 980 GGKGEYFIRIEDGSFICYLPPTLNTDFAIPCDRYQMERTYTNITFDPTSAHYVGAASIEV 1039
Query: 201 PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS--WEEIPQTNFPLHEWEHV 258
P Y E+ E+ P IPP + + LFS S W+ I + + E V
Sbjct: 1040 PFQAY----DEEGEIQLGPDGPDLIPPTNQRSTLELFSQGSDPWKVI--DGYEFDQNEEV 1093
Query: 259 LCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKI 318
+ +++V++E G G R +IA+GT +N+ ED RG +F+I++ V G +
Sbjct: 1094 MSMESVNLESPGAPGGYRDFIAVGTGFNFGEDRATRGNTYIFEILQTVGPQGGGGPGSVP 1153
Query: 319 KMIYAKEQKG----PVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASM 373
+ K PV A+ H+ G+L+ G K+Y+ D L G+AF+D ++Y ++
Sbjct: 1154 GWKLVRRTKDPARHPVNAVNHINGYLLNTNGPKLYVKGFDYDAQLMGLAFLDIQLYATTV 1213
Query: 374 VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
KN +L+GD +S + Q + + +++D
Sbjct: 1214 KVFKNFMLIGDLCKSFWFVSLQEDPYKFTTISKD-------------------------- 1247
Query: 434 WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
+ + D L + F+ SD++ ++ + + P +S G R
Sbjct: 1248 --------------LQHVSVVTADFLVHDGQVTFISSDRNGDMRMLDFDPTDPDSLNGER 1293
Query: 494 LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRR 553
L+ +T++H G T K+ + + + +++ + YA+ DGAL + + + ++R
Sbjct: 1294 LMLRTEYHAGSAA-TVSKVIARRKTTEEEFAPQTQII--YATADGALTTVVSVKDARFKR 1350
Query: 554 LLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 613
L ++ + +V + H GLNPRAFRT + S+GI+DG L+ +F +G + E+
Sbjct: 1351 LQLVSDQLVRNAQHVAGLNPRAFRTVRND-LLPRPLSKGILDGQLLNQFALQPIGRQKEM 1409
Query: 614 CKKIGSKHNDILDELYDIEAL 634
++IG+ D + D++AL
Sbjct: 1410 MRQIGT---DAVTVASDLQAL 1427
>gi|328848896|gb|EGF98089.1| hypothetical protein MELLADRAFT_96156 [Melampsora larici-populina
98AG31]
Length = 1427
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 139/568 (24%), Positives = 244/568 (42%), Gaps = 87/568 (15%)
Query: 96 AGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELR 155
A + G+ L G P W+ T G ++ + + ++ L + FL + + E+
Sbjct: 917 ATFSGIHLSGLEPIWIVSTDHGPVQIYKAKTNQTITYL------DQSDKFLVSDHQVEIW 970
Query: 156 ISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKEL 215
S + + D PVR V + + Y + + P ++ E+ +
Sbjct: 971 ESEVGEGVCLDGRIPVRLVKDGRSFSKIVYEPKMDVVIGASYLVTPFANFT----EEGVM 1026
Query: 216 VTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGL 275
+ + D + P + + L P SW+ I F +EW V +K VS++ + SG
Sbjct: 1027 MWEQDDESKVRPNGFRSSLELILPGSWDTIDGHEFQQNEW--VTSMKLVSLDSKSKRSGR 1084
Query: 276 RGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICH 335
R +I GT N +ED+ RG + +F++IE+VP+P P +++ Y + K VTA+
Sbjct: 1085 RDFIGAGTTCNRAEDLAARGGVYVFEVIEIVPDPKHPERNRGLRLRYHETTKACVTAVDG 1144
Query: 336 VAGFLVTAVGQKI----------------------YIWQL------KDNDLTGIAFIDTE 367
+ G+ + +GQK+ + +L +D L + F+D
Sbjct: 1145 LNGYFIHTMGQKVDPGYPRSPTRKYSDILADQIIAFYSKLYAKCFEQDERLLAVGFLDIR 1204
Query: 368 VYIASMVSVKNLILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRG 426
Y ++ +KN I++GD + I L+ +Q E Y+ + L G+
Sbjct: 1205 PYTTTLKVLKNFIVLGDAVKGITLVAFQEEPYKLIEL-------------GH-------- 1243
Query: 427 IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
F+ L C I D L + + + SD + +F Y P
Sbjct: 1244 --------TFVDLR-------CSTI-----DFLVLENKLSIVTSDLGGTIRIFEYNPTNI 1283
Query: 487 ESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPL 546
ES GG +L+ +T+F + + + SS +A T +A LDG++ +P+
Sbjct: 1284 ESQGGLKLLCRTEFGTAGEMGSSLGFGKRLSSKEEAKSIG----TLFAGLDGSISSLVPV 1339
Query: 547 PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
E ++RL ++Q ++ H H GLNPR FRT + + +RGIIDG ++ +F L
Sbjct: 1340 KEAVFKRLQIVQTRLIRHLDHFAGLNPRGFRTVRND-LVSRAMNRGIIDGEIIERFGALK 1398
Query: 607 LGERLEICKKIGSKHNDILDELYDIEAL 634
L E+ I K GS N IL L +++ +
Sbjct: 1399 LDEQDSIGKLAGSDRNTILINLNNLKGI 1426
>gi|68471460|ref|XP_720278.1| likely Cleavage and Polyadenylation Specificity Factor subunit
fragment [Candida albicans SC5314]
gi|46442138|gb|EAL01430.1| likely Cleavage and Polyadenylation Specificity Factor subunit
fragment [Candida albicans SC5314]
Length = 758
Score = 165 bits (417), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 138/595 (23%), Positives = 250/595 (42%), Gaps = 85/595 (14%)
Query: 59 FKKLKVLFVSDRSKRANEQPGLPRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
FKK K L ++ A P G I + + YF N+ G+ +F+ G P + T
Sbjct: 196 FKKEKDLTITGAPDNA-----FPYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKTVHS 250
Query: 118 ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
R + +S ++ F + G ++ + + RI LP +Y+ P++ V +
Sbjct: 251 IPRIFQFSKIAAMS-ISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVDIG 309
Query: 178 CTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLF 237
+ +AYH + T + T P Y + E K + +D + P + + + L
Sbjct: 310 ESIKSIAYHETSDTVVLSTFKQIP---YDCLDEEGKPIAGIIKDIKDTPAMSFKGSIKLV 366
Query: 238 SPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSG-----------------LRGYIA 280
SP++W I L + E + LK++ ++ G+ SG R YI
Sbjct: 367 SPYNWTVIET--IELEDNEVGMTLKSMILDV-GSESGSTLGSDPNSLIKKYNKKKREYIP 423
Query: 281 LGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFL 340
+G ED+ G +++II+++PEPG+P T +K K I+ +E +G +T+IC ++G
Sbjct: 424 IGIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHKFKEIFKEETRGAITSICELSGRF 483
Query: 341 VTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRT 400
+ + GQK+ + L+D+ +AF+DT VY++ S NL+++GD + L+ + E
Sbjct: 484 LVSQGQKVIVRDLQDDGTVPVAFLDTPVYVSESKSFGNLLILGDLLKGCWLVGFDAEPFR 543
Query: 401 LSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 460
+ ++ +D + I + D +
Sbjct: 544 MIMLGKD----------------------------------------TQHISVECADFII 563
Query: 461 EFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK---IRCKPS 517
+ +++D + + L Y P+ +S G +L+ K F L ++ I + S
Sbjct: 564 NDDEIFVLVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRSLPLIDIEES 623
Query: 518 SISDA-----------PGARSRFLTWYASL-DGALGFFLPLPEKNYRRLLMLQNVMVTHT 565
+DA P S + S DG+ P+ E YRR+ +LQ ++
Sbjct: 624 VQTDALTNIAVPPPLPPNTTSNYFQVIGSTQDGSFFNVFPINEAAYRRMYILQQQLIDKE 683
Query: 566 SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 620
H GLNPR R K ++ I+D L+ F +LS + + K+ K
Sbjct: 684 FHYCGLNPRLNRIGSIKLQNNETNTKPILDYDLIRSFTKLSDDRKRNLANKVSGK 738
>gi|255720869|ref|XP_002545369.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240135858|gb|EER35411.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 1351
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 137/572 (23%), Positives = 247/572 (43%), Gaps = 77/572 (13%)
Query: 59 FKKLKVLFVSDRSKRANEQPGLPRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
F+K K L ++ + A P G I + + YF N+ G+ +FL G P + T
Sbjct: 827 FRKEKDLTITGAPENA-----FPYGTSIERRLVYFPNLNGFTCIFLTGVIPYLILKTIHA 881
Query: 118 ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
R T P +++ F + G ++ + + RI LP +Y+ P++ VP+
Sbjct: 882 IPRIFQFT-KIPAVSISAFSDSKIKNGLIFLDNEQNARICELPLDYNYEFNLPMKHVPIG 940
Query: 178 CTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFH--VS 235
+ +AYH + C+V ST + Y E+ +L+ + + P + F +
Sbjct: 941 ESIKAMAYHEASD--CVVVSTFKEIP--YNCVDEEGKLIVGVMEDK---PAATSFKGSIK 993
Query: 236 LFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGT-----LSGLRGYIALGTNYNYSED 290
L SP++W I L + E + LK++ ++ + R YI +GT ED
Sbjct: 994 LISPYNWSVI--DTIELDDNEVGMSLKSMVLDIGSSSLIKKFKNKREYIVVGTGKYRMED 1051
Query: 291 VTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYI 350
+ G +F+II+++PEPG+P T +K K + + KG VT++C ++G + + GQK+ +
Sbjct: 1052 LAANGAFKIFEIIDIIPEPGKPETNHKFKETFQENIKGAVTSVCELSGRFLVSQGQKVIV 1111
Query: 351 WQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
L+D+ +AF+DT VY++ S NL+++GD + L+ + E + ++ +D +
Sbjct: 1112 RDLQDDGTVPVAFLDTPVYVSESKSFGNLLILGDPLKGCWLIGFDAEPFRMIMLGKDTQ- 1170
Query: 411 TQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMIS 470
LS+ C K +++ +++
Sbjct: 1171 ---------------------------HLSVE-----CADFIIKDDEVY-------ILVA 1191
Query: 471 DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL 530
D + + L Y P+ +S G +L+ K F L + S + P + F
Sbjct: 1192 DNNNVLHLLNYDPDDPQSINGTKLLTKASFELASPI----------SCLRTLPIDDNNFQ 1241
Query: 531 TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPS 590
+ DG+ P+ E YRR+ +LQ + H GLNPR R G N +
Sbjct: 1242 IIGSCQDGSFFNVFPINESTYRRMYILQQQLTEKEYHYCGLNPRLNRV--GNLALKDNDT 1299
Query: 591 --RGIIDGSLVWKFLQLSLGERLEICKKIGSK 620
+ I+D L+ F +L+ + K+ K
Sbjct: 1300 NIKPILDYGLIRIFAKLNTDRKKAFANKVSGK 1331
>gi|238881599|gb|EEQ45237.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 1423
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 138/595 (23%), Positives = 251/595 (42%), Gaps = 85/595 (14%)
Query: 59 FKKLKVLFVSDRSKRANEQPGLPRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
FKK K L ++ A P G I + + YF N+ G+ +F+ G P + T
Sbjct: 861 FKKEKDLTITGAPDNA-----FPYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKTVHS 915
Query: 118 ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
R + +S ++ F + G ++ + + RI LP +Y+ P++ V +
Sbjct: 916 IPRIFQFSKIAAMS-ISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVDIG 974
Query: 178 CTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLF 237
+ +AYH + T + T P Y + E K + +D + P + + + L
Sbjct: 975 ESIKSIAYHETSDTVVLSTFKQIP---YDCLDEEGKPIAGIIKDIKDTPAMSFKGSIKLV 1031
Query: 238 SPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGL-----------------RGYIA 280
SP++W I L + E + LK++ ++ G+ SG R YI
Sbjct: 1032 SPYNWTVIET--IELEDNEVGMTLKSMILDV-GSESGSTLGSDPNSLIKKYNKKKREYIV 1088
Query: 281 LGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFL 340
+G ED+ G +++II+++PEPG+P T +K K I+ +E +G +T+IC ++G
Sbjct: 1089 IGIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHKFKEIFKEETRGAITSICELSGRF 1148
Query: 341 VTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRT 400
+ + GQK+ + L+D+ +AF+DT VY++ S NL+++GD + L+ + E
Sbjct: 1149 LVSQGQKVIVRDLQDDGTVPVAFLDTPVYVSESKSFGNLLILGDPLKGCWLVGFDAEPFR 1208
Query: 401 LSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 460
+ ++ +D + I + D +
Sbjct: 1209 MIMLGKD----------------------------------------TQHISVECADFII 1228
Query: 461 EFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK---IRCKPS 517
+ +++D + + L Y P+ +S G +L+ K F L ++ I + S
Sbjct: 1229 NDDEIFVLVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRSLPLIDIEES 1288
Query: 518 SISDA-----------PGARSRFLTWYASL-DGALGFFLPLPEKNYRRLLMLQNVMVTHT 565
+DA P S + S DG+ P+ E YRR+ +LQ ++
Sbjct: 1289 VQTDAFTNIVVPPTLPPNTTSNYFQVIGSTQDGSFFNVFPINEAAYRRMYILQQQLIDKE 1348
Query: 566 SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 620
H GLNPR R K ++ I+D L+ +F +LS + + K+ K
Sbjct: 1349 FHYCGLNPRLNRIGSIKLQNNETNTKPILDYDLIRRFTKLSDDRKRNLANKVSGK 1403
>gi|68471006|ref|XP_720510.1| likely Cleavage and Polyadenylation Specificity Factor subunit
[Candida albicans SC5314]
gi|74591422|sp|Q5AFT3.1|CFT1_CANAL RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|46442380|gb|EAL01670.1| likely Cleavage and Polyadenylation Specificity Factor subunit
[Candida albicans SC5314]
Length = 1420
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 138/595 (23%), Positives = 250/595 (42%), Gaps = 85/595 (14%)
Query: 59 FKKLKVLFVSDRSKRANEQPGLPRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
FKK K L ++ A P G I + + YF N+ G+ +F+ G P + T
Sbjct: 858 FKKEKDLTITGAPDNA-----FPYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKTVHS 912
Query: 118 ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
R + +S ++ F + G ++ + + RI LP +Y+ P++ V +
Sbjct: 913 IPRIFQFSKIAAMS-ISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVDIG 971
Query: 178 CTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLF 237
+ +AYH + T + T P Y + E K + +D + P + + + L
Sbjct: 972 ESIKSIAYHETSDTVVLSTFKQIP---YDCLDEEGKPIAGIIKDIKDTPAMSFKGSIKLV 1028
Query: 238 SPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSG-----------------LRGYIA 280
SP++W I L + E + LK++ ++ G+ SG R YI
Sbjct: 1029 SPYNWTVIET--IELGDNEVGMTLKSMILDV-GSESGSTLGSDPNSLIKKYNKKKREYIV 1085
Query: 281 LGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFL 340
+G ED+ G +++II+++PEPG+P T +K K I+ +E +G +T+IC ++G
Sbjct: 1086 IGIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHKFKEIFKEETRGAITSICELSGRF 1145
Query: 341 VTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRT 400
+ + GQK+ + L+D+ +AF+DT VY++ S NL+++GD + L+ + E
Sbjct: 1146 LVSQGQKVIVRDLQDDGTVPVAFLDTPVYVSESKSFGNLLILGDLLKGCWLVGFDAEPFR 1205
Query: 401 LSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 460
+ ++ +D + I + D +
Sbjct: 1206 MIMLGKD----------------------------------------TQHISVECADFII 1225
Query: 461 EFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK---IRCKPS 517
+ +++D + + L Y P+ +S G +L+ K F L ++ I + S
Sbjct: 1226 NDDEIFVLVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRSLPLIDIEES 1285
Query: 518 SISDA-----------PGARSRFLTWYASL-DGALGFFLPLPEKNYRRLLMLQNVMVTHT 565
+DA P S + S DG+ P+ E YRR+ +LQ ++
Sbjct: 1286 VQTDALTNIAVPPPLPPNTTSNYFQVIGSTQDGSFFNVFPINEAAYRRMYILQQQLIDKE 1345
Query: 566 SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 620
H GLNPR R K ++ I+D L+ F +LS + + K+ K
Sbjct: 1346 FHYCGLNPRLNRIGSIKLQNNETNTKPILDYDLIRSFTKLSDDRKRNLANKVSGK 1400
>gi|322704830|gb|EFY96421.1| Cleavage factor two protein 1 [Metarhizium anisopliae ARSEF 23]
Length = 1433
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 161/645 (24%), Positives = 282/645 (43%), Gaps = 84/645 (13%)
Query: 17 VQELLTVSLG-LHGNRPLLLVR------TQHELLIYQAFRHPKGALKLRFKKLKVLFVSD 69
+ E+L LG P L+VR T +E + YQA + + L FKK ++
Sbjct: 837 ITEILVADLGDAISQTPYLIVRHASDDLTIYEPVRYQAEGDAELSASLLFKKCVNTSLAK 896
Query: 70 RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG- 128
+ +E P R +R +N+ GY VFL P+++ +S E R M + G
Sbjct: 897 TAPEVSEDDAEPP--RFVPLRRCANVNGYGAVFLPNASPSFVLKSSHSEPRV--MGLQGL 952
Query: 129 PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHL 187
V ++ FH C RGF+Y + + R++ LP++ + + V+K+ L ++YH
Sbjct: 953 GVRGMSTFHTEGCDRGFIYVDMEGIARVTQLPSNANLTELGVSVKKIALDGDVGMISYHH 1012
Query: 188 ETKTYCIVTSTAE----PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
T TY + + E P D Y KE +++ PP +++ + L +P +W
Sbjct: 1013 PTGTYVVGCTKLEQFELPRDDDYH-----KEWA---KETSNFPPTMARGILKLINPVTWT 1064
Query: 244 EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
I + L E + +K + +E + +A+GT + ED+ RGR+ +FDI+
Sbjct: 1065 VIHE--LELEPCESIESMKTLHLEVSEETKERKMLVAVGTALSKGEDLPTRGRVQVFDIV 1122
Query: 304 EVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDL 358
V+PEPG+P T ++K+I AKE+ +G VTA+ V G ++ A GQK + LK D L
Sbjct: 1123 TVIPEPGRPETNKRLKLI-AKEEIPRGGVTALSEVGAQGLMLVAQGQKCMVRGLKEDGSL 1181
Query: 359 TGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSK 416
+AF+D ++AS+ + L ++ D + + Y E T ++ + S
Sbjct: 1182 LPVAFLDMSCHVASVKELPGTGLCVMADVFKGLWFAGYTEEPYTFKILGK--------SS 1233
Query: 417 GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
G K+ D L + + + D + ++
Sbjct: 1234 G--------------------------------KLPLLAADFLPDGEDLSMVAVDAEGDL 1261
Query: 477 VLFMYQPEARESNGGHRLIKKTDFHLGQHV--NTFFKIRCKPSSISDAPGARSRFLTWYA 534
+ + PE +S GH L+ +T F + + +T R S A + S + A
Sbjct: 1262 HILEFNPEHPKSLQGHLLLHRTSFAVTPNTPSSTLLLPRTHSPSYPQASSSSSSHMLLLA 1321
Query: 535 SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG-----NP 589
G L PL E YRRLL + N + GGL+ +A R G +
Sbjct: 1322 CPSGQLAALSPLAESTYRRLLSVTNQLHPAIVPHGGLHSKAHRYPDQSSVAVGVETAASS 1381
Query: 590 SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
R ++DG+++ ++ +L +R ++ + G ++ + D D+E +
Sbjct: 1382 GRALVDGTVLARWSELGAAKRTDVALRGG--YDSVADLRDDLEGV 1424
>gi|50288865|ref|XP_446862.1| hypothetical protein [Candida glabrata CBS 138]
gi|74609915|sp|Q6FSD2.1|CFT1_CANGA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|49526171|emb|CAG59795.1| unnamed protein product [Candida glabrata]
Length = 1361
Score = 162 bits (411), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 151/574 (26%), Positives = 256/574 (44%), Gaps = 87/574 (15%)
Query: 81 PRGV----RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPF 136
P+GV RI M Y N GY +F+ G P + M D + + PF
Sbjct: 841 PKGVSGIERI--MHYIPNFDGYSVIFVTGNTPYII------------MKEDDSLPRIFPF 886
Query: 137 HN---VNCPR----GFLYFNAKSELRI-SVLPTHLSYDAPWPVRKVPLKC------TPHF 182
N V+ R + + RI S+ ++ Y P+RK+ + T +
Sbjct: 887 GNIPIVSMSRWGEGSVICIDDIKNARIYSLNQDNIYYGNKLPIRKIKIGSMLQNYKTLNS 946
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFH--VSLFSPF 240
+ YH T+ Y +V+ T E S Y+ ED L+ + P F V L +P
Sbjct: 947 IVYHERTQLY-LVSYTKEIS---YEAKAEDGSLLIGYKPEL---PNAKAFKSGVLLINPK 999
Query: 241 SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
SWE I + + P + V +K+ ++ + R YI +G Y EDV G ++
Sbjct: 1000 SWEVIDELDLPDNSL--VNDMKSSFIQIDTRTKRKREYIIVGIGYATMEDVPPTGEFHIY 1057
Query: 301 DIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLT 359
DI EVVPEPG+P T K+K I+ ++ +G V+ + ++G + + QKI + ++ DN +
Sbjct: 1058 DITEVVPEPGKPNTNFKLKEIFKEDIRGIVSVVNGISGRFLISQSQKIMVRDVQQDNSVI 1117
Query: 360 GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
+AF+D V++ S+ + NLI++GD + I + + E
Sbjct: 1118 PVAFLDVPVFVTSLKTFGNLIVIGDAMQGIQFVGFDAE---------------------- 1155
Query: 420 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
P R I GS + KF +S+ + L + F+++D+D + +
Sbjct: 1156 ---PYRMITLGSSITKFEVISV---------------EFLVNNGDIYFLVTDRDSIMHVL 1197
Query: 480 MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGA 539
Y P+ + G RL+ + F+L N + D +RS F T A +DG+
Sbjct: 1198 KYAPDQPNTLSGQRLVHCSSFNLHSLNNCTMLLPKNDEFPRDQRYSRS-FQTITAQVDGS 1256
Query: 540 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
+ +P+ E+ YRRL +Q ++ GLNPR R K Y+ G+ R ++D +++
Sbjct: 1257 ISKIVPVKEETYRRLYFIQQQIIDKEPQLAGLNPRMERQ-DNKYYHLGHSLRPMLDFNII 1315
Query: 600 WKFLQLSLGERLEICKKIGSKHN-DILDELYDIE 632
+F +S+ R I +K+G N ++ +L D+E
Sbjct: 1316 KRFKDMSMNRRSHIVQKLGKNSNLEVWRDLIDLE 1349
>gi|241954348|ref|XP_002419895.1| subunit of the mRNA cleavage and polyadenylation factor, putative
[Candida dubliniensis CD36]
gi|223643236|emb|CAX42110.1| subunit of the mRNA cleavage and polyadenylation factor, putative
[Candida dubliniensis CD36]
Length = 1420
Score = 162 bits (410), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 136/593 (22%), Positives = 251/593 (42%), Gaps = 82/593 (13%)
Query: 59 FKKLKVLFVSDRSKRANEQPGLPRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
FKK K L ++ A P G I + + YF N+ G+ +F+ G P + T
Sbjct: 859 FKKEKDLTITGAPDNA-----FPYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKTIHS 913
Query: 118 ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
R + V +++ F + G ++ + + RI LP +Y+ P++ V +
Sbjct: 914 IPRIFQFS-KIAVMSISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVDIG 972
Query: 178 CTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLF 237
+ +AYH + T + T P Y + E K + ++ + P + + V L
Sbjct: 973 ESIKSIAYHETSDTVVLSTFKQIP---YECLDEEGKPIAGIIKNIKDTPAISFKGSVKLV 1029
Query: 238 SPFSWEEIPQTNFPLHEWEHVLCLK----NVSMEYEGTL------------SGLRGYIAL 281
SP++W I N L + E + +K +V E + T+ R YI +
Sbjct: 1030 SPYNWTVIE--NIELGDNEVGMTIKSMILDVGSESKSTVGTDPNSLIKKYNKKKREYIVI 1087
Query: 282 GTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLV 341
G ED+ G +++II+++PEPG+P T +K K I+ ++ +G +T+IC ++G +
Sbjct: 1088 GIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHKFKEIFKEDTRGAITSICELSGRFL 1147
Query: 342 TAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTL 401
+ GQK+ + L+D+ +AF+DT VY++ S NL+++GD + L+ + E +
Sbjct: 1148 VSQGQKVIVRDLQDDGTVPVAFLDTPVYVSESKSFGNLVILGDPLKGCWLVGFDAEPFRM 1207
Query: 402 SLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
++ +D + I + D +
Sbjct: 1208 IMLGKD----------------------------------------TQHISVECADFIIN 1227
Query: 462 FSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF-----KIRCKP 516
+ +++D + + L Y P+ +S G +L+ K F L ++ I K
Sbjct: 1228 DDEIFVLVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRSLPLKDIDEKV 1287
Query: 517 SSISDAPGA---------RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 567
+ +DA A ++ F ++ DG+ P+ E YRR+ +LQ ++ H
Sbjct: 1288 QNETDAAAAATIPLPNNTQNNFQVIGSTQDGSFFNVFPINEAAYRRMYILQQQLIDKEFH 1347
Query: 568 TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 620
GLNPR R K ++ I+D L+ +F +LS + K+ K
Sbjct: 1348 YCGLNPRLNRIGSIKLQNNETNTKPILDYDLIRRFTKLSDDRKRNFANKVSGK 1400
>gi|406699110|gb|EKD02327.1| cleavage and polyadenylation specific protein [Trichosporon asahii
var. asahii CBS 8904]
Length = 1339
Score = 161 bits (408), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 156/630 (24%), Positives = 275/630 (43%), Gaps = 82/630 (13%)
Query: 17 VQELLTVSLGLHGNRP-LLLVRTQHELLIYQA--------FRHPKGALKLRFKKLKVLFV 67
V ++L +G RP ++++ L IY+A + +L +RF+K+ +
Sbjct: 773 VSQMLFCPIGTRTLRPHVIVLHRSGRLNIYEAQPRFTVDARDQSRRSLAVRFRKVHTQLL 832
Query: 68 SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID 127
S + +P F++I G G F+ G P W+ + +HP+
Sbjct: 833 SVTPSSTVKPAAIP----------FTDIEGLTGAFITGERPHWIISSD-----SHPIRAF 877
Query: 128 GPVSTLAPFHNV--NCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
G F G + + I +P L+ D P + ++ T +A+
Sbjct: 878 GLKQAAYAFCKTTHQGGHGEYFLRIEDGSFICYMPPTLNTDFAMPCDRYKMERTYTHVAF 937
Query: 186 HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEI 245
+ Y + + P Y E+ E++ P +PP + + LFS S
Sbjct: 938 DPPSCHYVAAAAMSVPFQAY----DEEGEILLGPEGPDLLPPKNERSSIELFSAGSEPFR 993
Query: 246 PQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEV 305
+ + E VLC+++V++E + +G R +IA+GT N+ ED G + +F+++EV
Sbjct: 994 VLDGYDFDQNEEVLCVESVTLESSSSPTGFRDFIAVGTGKNFGEDRATSGAVYVFEVVEV 1053
Query: 306 V-PEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAF 363
V +PG ++ ++K + PV+AI ++ G++V + G KI L D+ L G+AF
Sbjct: 1054 VGTKPG--VSNWRLKYRCKDPTRNPVSAIANINGYIVHSNGPKILAKGLDYDDRLMGLAF 1111
Query: 364 IDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNP 423
+D +Y+ S+ KNLILVGD+ +S+ Q + RD
Sbjct: 1112 LDVSMYVTSIRVFKNLILVGDFVKSLIFASLQENPYKFVTIGRD---------------- 1155
Query: 424 SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQP 483
LSL D L + F+ +D+ N+ L + P
Sbjct: 1156 ------------LADLSL------------TAADFLVHEGQVTFITNDQHGNMRLVDFDP 1191
Query: 484 EARESNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSSISDAPGARSRFLTWYASLDGALGF 542
+S G +L+ +T+F G V I R K + AP +S+ + YA+ DGA+
Sbjct: 1192 ANPDSLNGEKLLTQTEFGTGCPVTASCMIARRKTAEEEFAP--QSQLI--YATADGAITS 1247
Query: 543 FLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWK 601
+ + E ++RL ++Q+ +V + H GLNPRAFRT + P +RG++DG L+
Sbjct: 1248 VVAVKEARFKRLQLVQDQLVRNAQHVAGLNPRAFRTVRND--LVPRPLARGVLDGGLLAH 1305
Query: 602 FLQLSLGERLEICKKIGSKHNDILDELYDI 631
F L + E+ ++IG+ + +LY +
Sbjct: 1306 FALQPLRRQREMMRQIGTDAVTVGSDLYTL 1335
>gi|401889164|gb|EJT53104.1| cleavage and polyadenylation specific protein [Trichosporon asahii
var. asahii CBS 2479]
Length = 1358
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 156/630 (24%), Positives = 275/630 (43%), Gaps = 82/630 (13%)
Query: 17 VQELLTVSLGLHGNRP-LLLVRTQHELLIYQA--------FRHPKGALKLRFKKLKVLFV 67
V ++L +G RP ++++ L IY+A + +L +RF+K+ +
Sbjct: 792 VSQMLFCPIGTRTLRPHVIVLHRSGRLNIYEAQPRFTVDARDQSRRSLAVRFRKVHTQLL 851
Query: 68 SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID 127
S + +P F++I G G F+ G P W+ + +HP+
Sbjct: 852 SVTPSSTVKPAAIP----------FTDIEGLTGAFITGERPHWIISSD-----SHPIRAF 896
Query: 128 GPVSTLAPFHNV--NCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
G F G + + I +P L+ D P + ++ T +A+
Sbjct: 897 GLKQAAYAFCKTTHQGGHGEYFLRIEDGSFICYMPPTLNTDFAMPCDRYKMERTYTHVAF 956
Query: 186 HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEI 245
+ Y + + P Y E+ E++ P +PP + + LFS S
Sbjct: 957 DPPSCHYVAAAAMSVPFQAY----DEEGEILLGPEGPDLLPPKNERSSIELFSAGSEPFR 1012
Query: 246 PQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEV 305
+ + E VLC+++V++E + +G R +IA+GT N+ ED G + +F+++EV
Sbjct: 1013 VLDGYDFDQNEEVLCVESVTLESSSSPTGFRDFIAVGTGKNFGEDRATSGAVYVFEVVEV 1072
Query: 306 V-PEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAF 363
V +PG ++ ++K + PV+AI ++ G++V + G KI L D+ L G+AF
Sbjct: 1073 VGTKPG--VSNWRLKYRCKDPTRNPVSAIANINGYIVHSNGPKILAKGLDYDDRLMGLAF 1130
Query: 364 IDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNP 423
+D +Y+ S+ KNLILVGD+ +S+ Q + RD
Sbjct: 1131 LDVSMYVTSIRVFKNLILVGDFVKSLIFASLQENPYKFVTIGRD---------------- 1174
Query: 424 SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQP 483
LSL D L + F+ +D+ N+ L + P
Sbjct: 1175 ------------LADLSL------------TAADFLVHEGQVTFITNDQHGNMRLVDFDP 1210
Query: 484 EARESNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSSISDAPGARSRFLTWYASLDGALGF 542
+S G +L+ +T+F G V I R K + AP +S+ + YA+ DGA+
Sbjct: 1211 ANPDSLNGEKLLTQTEFGTGCPVTASCMIARRKTAEEEFAP--QSQLI--YATADGAITS 1266
Query: 543 FLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWK 601
+ + E ++RL ++Q+ +V + H GLNPRAFRT + P +RG++DG L+
Sbjct: 1267 VVAVKEARFKRLQLVQDQLVRNAQHVAGLNPRAFRTVRND--LVPRPLARGVLDGGLLAH 1324
Query: 602 FLQLSLGERLEICKKIGSKHNDILDELYDI 631
F L + E+ ++IG+ + +LY +
Sbjct: 1325 FALQPLRRQREMMRQIGTDAVTVGSDLYTL 1354
>gi|33411764|emb|CAD58787.1| cleavage and polyadenylation specificity factor 1 [Bos taurus]
Length = 180
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 75/168 (44%), Positives = 113/168 (67%), Gaps = 7/168 (4%)
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
D + + + +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++ C+
Sbjct: 11 DFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRG 70
Query: 517 SSISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
++ + P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T H GL
Sbjct: 71 AA--EGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGL 128
Query: 572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
NPRAFR N R ++DG L+ ++L LS ER E+ KKIG+
Sbjct: 129 NPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGT 176
>gi|402219312|gb|EJT99386.1| hypothetical protein DACRYDRAFT_17537 [Dacryopinax sp. DJM-731 SS1]
Length = 1620
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 152/630 (24%), Positives = 278/630 (44%), Gaps = 102/630 (16%)
Query: 33 LLLVRTQHELLIYQA----FRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRI-- 86
LL+ T+ L +Y+A + RF K+ F +D + A + LP R+
Sbjct: 1064 LLVYYTEGRLAVYEATPRTATEADSTFQYRFTKVATHF-ADAEQHAAIRQMLPEARRLQL 1122
Query: 87 ----SQMRYFSNI-AGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP-VSTLAPFHNVN 140
+ + + S+I G VF G P W+ + + L+ + P V + P
Sbjct: 1123 PSRRALIPFISDIKGGTSAVFQRGEEPCWIMASRQNGLQI--IAYSSPLVYSFTPTSLFG 1180
Query: 141 CPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETK-TYC------ 193
F+ + + P + +D P L C H+ +K +Y
Sbjct: 1181 NSGDFILYGEEG-------PVLMEFDEE-PDTGRELSC------RHIHSKRSYTSMAVDP 1226
Query: 194 ---IVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNF 250
+V + + + + ++ E++ L P + P+ + L SP + + + +F
Sbjct: 1227 GSNLVAAASSLKSFFLLYDDEEQPLWV-PESTALFGPMAECSSLELVSPDTCQTLDGYDF 1285
Query: 251 PLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPG 310
+E+ +V+ K+V++E + SG + YIA+GT+ ED+ RG +F++IEVV P
Sbjct: 1286 APNEFINVV--KSVNLETLSSESGFKDYIAVGTSTFRGEDLAVRGATYIFEVIEVVSYPD 1343
Query: 311 QPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVY 369
PL ++K++ E K PV AIC + G+LV++ G K+++ +D L G+AF+D V
Sbjct: 1344 DPLPPYRLKLLCRDEAKAPVNAICGLNGYLVSSQGFKVFVRAFEQDERLVGVAFMDAGVC 1403
Query: 370 IASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIID 429
+ S+ +KNL+L+GD RS++ + +Q + L +PT +
Sbjct: 1404 VTSLTRLKNLLLIGDAKRSVSFVAFQEDPFKL-------RPTYVTDAAF----------- 1445
Query: 430 GSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESN 489
+ DE + +D + + LF + P +
Sbjct: 1446 ----------------------------LFDE-GDFSILAADDEGTLRLFEFDPNLTGAT 1476
Query: 490 GGHRLIKKTDFHLGQHVNTFF-----KIRCKPSSISDAPGARSRFLTWYASLDGALGFFL 544
G+ LI +T+F+ GQ +T + R P + P A+ F ++DG LG
Sbjct: 1477 HGNPLICETEFN-GQSEHTHILAIAGRGREDPEEMQ-IPEAQLIF----GTIDGTLGTIS 1530
Query: 545 PLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQ 604
P+P++ ++RL +L ++ H GLNPRAFRT + + ++G++D L+ F +
Sbjct: 1531 PVPDECFKRLQLLSGQLMRSVQHFAGLNPRAFRTVRND-LLSRPLNKGMLDYDLLHAFRE 1589
Query: 605 LSLGERLEICKKIGSKHNDILDELYDIEAL 634
L + + I K+IG+ IL ++ +E +
Sbjct: 1590 LDIRRQATITKQIGTDTITILRDIRSLEEI 1619
>gi|255718033|ref|XP_002555297.1| KLTH0G05984p [Lachancea thermotolerans]
gi|238936681|emb|CAR24860.1| KLTH0G05984p [Lachancea thermotolerans CBS 6340]
Length = 1307
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 138/513 (26%), Positives = 231/513 (45%), Gaps = 64/513 (12%)
Query: 129 PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF------ 182
P+ +LAP+ L + RI L T SY PV++VP++ ++
Sbjct: 843 PLVSLAPWGT----DSVLCVDDIKNARIVTLDTTFSYGNRLPVKRVPIEDPLNYYGCLNN 898
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS-QFHVSLFSPFS 241
+AYH + Y IV+ T E Y+ E+ E DS +P Q V L +P S
Sbjct: 899 VAYHERSGMY-IVSYTKEIE---YEAISEEGEKTVGSDDS--VPHARGFQSGVLLLNPKS 952
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
W I + ++ + + +K + ++ R Y+ +G + ED+ G L+D
Sbjct: 953 WNIIDKADYEKNSL--INDMKTMLIQTNSRTRRKREYLVVGNTFVRDEDIGTMGSFCLYD 1010
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTG 360
I EVVPEPG+P T K+K I+ +E +G V+++C ++G + + QK+ + ++ DN +
Sbjct: 1011 ITEVVPEPGKPDTNYKLKQIFYEEFRGAVSSVCEISGRFLISQSQKVLVRDVQEDNSVVP 1070
Query: 361 IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
+AF+D V++ S NL+++GD + + + E
Sbjct: 1071 VAFLDVPVFVTDSKSCGNLLIIGDAMQGFQFVGFDAE----------------------- 1107
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
P R I G V KF +SL + L S+ F++SD+ + +
Sbjct: 1108 --PYRMIPLGKSVSKFEVMSL---------------EFLVNNGSIYFLVSDRSNILHILK 1150
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGAL 540
Y P+ S G +L+ T F+L NT K+ K G F A DG+L
Sbjct: 1151 YAPDEPNSLSGQKLVHCTSFNL-HSTNTCMKLLLKNDEFP-TLGEPPAFQAIGAQTDGSL 1208
Query: 541 GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVW 600
+PL E +YRRL M+Q ++ H GLNP+ R + Y G+ R ++D +++
Sbjct: 1209 FNVVPLSESSYRRLYMVQQQLIEKDVHLCGLNPKMER-LQNDFYQLGHLMRPMLDFTVIK 1267
Query: 601 KFLQLSLGERLEICKKIGSKHN-DILDELYDIE 632
F L L +R +I K G + + +I +L ++E
Sbjct: 1268 SFATLPLNKRKQIAAKAGRQADFEIWRDLINVE 1300
>gi|422295485|gb|EKU22784.1| cleavage and polyadenylation specificity factor subunit 1
[Nannochloropsis gaditana CCMP526]
Length = 395
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 102/363 (28%), Positives = 170/363 (46%), Gaps = 45/363 (12%)
Query: 278 YIALGTNYNYS--EDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICH 335
Y+A+GT + EDV +GR+L++ I + P G + ++ GP TAI
Sbjct: 70 YLAVGTCTVRAKGEDVPSKGRLLMYRI-SLDPYAGLTSPPTLTLVDQYSQRSGPPTAIAQ 128
Query: 336 VAGFLVTAVGQKIYIWQLKDND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
+ ++ A G ++++ + L IAF D + Y+ S+ VK L+ V D S+ LLR+
Sbjct: 129 LGPHIIIAAGPTLWVYAFSAREKLKPIAFYDADFYVVSLRVVKTLVAVTDAYHSVHLLRW 188
Query: 395 QPE--YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
TL L+ +DY P + + G
Sbjct: 189 HEHDPAHTLELMGKDYSPI-----------------------------------VSAQPG 213
Query: 453 SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 512
H + + S+G ++ D N+ L Y P ES GG+RL+++ DFHL ++
Sbjct: 214 GSH--FVVDPPSLGMLVGDSRGNLQLLQYDPADVESRGGNRLVRRADFHLSHRLSFLQHT 271
Query: 513 RCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
R A A R + + S++G +G +P+ EK YRRL LQ VMV H G N
Sbjct: 272 RMAEVPRPGAYRAGVRVMV-FGSVEGGVGALVPVEEKVYRRLYALQAVMVNALPHVGAFN 330
Query: 573 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 632
PR FR + +G+ G +G +DG L+W+F LS+G++ ++ IG+ +L+ L +++
Sbjct: 331 PRGFRLVEARGWAQGR-KKGTLDGELLWRFAGLSVGKQEDLASAIGTSREMVLESLLEVD 389
Query: 633 ALS 635
++
Sbjct: 390 MMT 392
>gi|353234640|emb|CCA66663.1| related to cleavage and polyadenylation specificity factor, 160 kDa
subunit [Piriformospora indica DSM 11827]
Length = 1324
Score = 156 bits (395), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 151/644 (23%), Positives = 278/644 (43%), Gaps = 81/644 (12%)
Query: 15 TIVQELLTVSLGLHGNRPLLLVRTQHELLIY--------------QAFRHPKGALKLRFK 60
T +Q+++ LG P L+V LLI Q R +L++ F
Sbjct: 734 TTLQQVIITDLGEIEPSPHLIVLYDSNLLIVYQMVPLEPDKAGLPQLDRRSVPSLRISFV 793
Query: 61 KLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQG-----VFLCGPHPAWLFLTS 115
K V +++ + N+ G R+ + ++ ++G F+ G +PAW+ +
Sbjct: 794 KRMVHHLANPTPDENQTSGGSNEKRLPKTIVPFSVLDWEGNSIYGAFVTGDNPAWILSKN 853
Query: 116 RGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVP 175
L P + V + P + FL + + P +++ +P K
Sbjct: 854 HSGLLHLPCGYEA-VHSFTPCSMWDFSPTFLMSTEEGSCLVQWTP-GITFHGQYPCSKTR 911
Query: 176 LKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVS 235
T +AY + T ++ + + D+ F+ E +P P + +
Sbjct: 912 KGRTQTNIAY---SNTTGLLVAASSNDRDFLLFDEEGTN-SWEPDGVNVSLPKLGASALE 967
Query: 236 LFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRG 295
L P +W I F +E +++ ++V +E T +G + +IA+GT+ + ED+ RG
Sbjct: 968 LLDPETWVTIDGYEFAANEVVNIV--ESVKLETLSTQTGNKEFIAVGTSIHRGEDLAVRG 1025
Query: 296 RILLFDIIEVVPEPGQ-PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK 354
+F+I EV+ + + ++++K++ E KGPVTA+C + G+LV+++GQKI++
Sbjct: 1026 GTYIFEIAEVIQDTEERGRRRHRLKLLCKDEAKGPVTAVCGMNGYLVSSMGQKIFVRAFD 1085
Query: 355 -DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQ- 412
D L G+AF+D VY+ S+ +KNL+++ D + + + +Q + L +++++ +PT
Sbjct: 1086 LDERLVGVAFLDAGVYVTSIRCLKNLLVITDAIKGVWFVAFQEDPFKLVILSKEVRPTSI 1145
Query: 413 PNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDK 472
P ++A HND M + D
Sbjct: 1146 PQGDFFFA----------------------------------HND-------MELLTIDL 1164
Query: 473 DKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC---KPSSISDAPGARSRF 529
+ L Y P ++ G RL+ +F HV +R +PSS S + +R
Sbjct: 1165 RGVLRLHSYDPTHVDTEEGARLLCSVEFQ--THVEPVTIVRVAMEQPSSDSASDASR--- 1219
Query: 530 LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
+DG+L PL ++RL +LQ +V HT H LNP+A+R +G
Sbjct: 1220 -LLIPRVDGSLASLSPLDMDIFKRLYLLQAQLVRHTHHIAALNPKAYRAVQGSS-TTRTM 1277
Query: 590 SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
SR ++D L+ F +LS + I +IG ++ + +EA
Sbjct: 1278 SRRMLDFGLLVGFKKLSFDRQQGIANQIGETWETLIRDCTQLEA 1321
>gi|452979579|gb|EME79341.1| hypothetical protein MYCFIDRAFT_104419, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 1342
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 154/642 (23%), Positives = 265/642 (41%), Gaps = 106/642 (16%)
Query: 19 ELLTVSLGLHG-NRPLLLVRT-QHELLIYQAFRHPKGALK------LRFKKLKVLFVSDR 70
E+L LG HG +P L++RT ++++Y+ F +P+ + + LRF+K+ +
Sbjct: 748 EVLVSDLGQHGVTQPYLVLRTAMDDVVLYEPFHYPQTSGRKSWHQDLRFRKVPFSHIPKY 807
Query: 71 SKR-ANEQPGLPRGVRISQMRYFSNIA---GYQGVFLCGPH--PAWLFLTSRGELRAHPM 124
S+ A Q P ++ ++ +S IA + L P P L + EL
Sbjct: 808 SESIAESQSARPPPLKSVKIDTYSAIAIPGAPPCLLLKEPSTLPKVLEIRQSAELNR--- 864
Query: 125 TIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF-- 182
+S L P + V C GF NA EL LP + Y W V +VP+
Sbjct: 865 -----LSMLCPINRVGCENGFFMINADEELEEQQLPLNTWYGTGWSVHQVPIGHPNQIED 919
Query: 183 ---LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP 239
+AYH E Y + T D+Y F ED +D + P V Q++V L S
Sbjct: 920 VRRIAYHEERGLYVVATCR---EVDFY-FAEEDGR--HPEQDDITLRPKVPQYNVHLISA 973
Query: 240 FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
S I + P + L+ + +E + + + ED+ +G + +
Sbjct: 974 ISHHIIDTVHMPY--LAAITDLQVMMLEASENTHEQKPLVVVSAAAQRGEDMPAKGTLYV 1031
Query: 300 FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAIC--HVAGFLVTAVGQKIYIWQLK-DN 356
+DII+VVP+P + K+ + +E +G +TA+ GF+ TA G K+ I +K D
Sbjct: 1032 YDIIDVVPDPDIAESGVKLHQLAREENRGAITALAGPFPGGFIGTAQGLKVMIRGMKEDG 1091
Query: 357 DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSK 416
+AF+D + Y + T P
Sbjct: 1092 SCLPVAFLDAQSYTHVL------------------------------------KTLPGRG 1115
Query: 417 GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD-EF----SSMGFMISD 471
+ AG+ +G+ G + R+ + K H +++ EF ++ ++ D
Sbjct: 1116 MWLAGDAWKGLWFGGFTEEPY------RVTVLGKAPKMHMEVMSAEFLPFDGALYIVVLD 1169
Query: 472 KDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL- 530
D ++ + Y PE +S G RL+ ++ FH+G + PS+++ + +
Sbjct: 1170 ADCDMHVLQYDPENPKSLNGMRLLHRSTFHIGHFTTNSMLL---PSTLASFAAQQHEMMN 1226
Query: 531 --------------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
+S GA+G PL E+ YRRL LQ + + H GLNPRA+
Sbjct: 1227 GGSKAEVKPDPLQHVLTSSTSGAIGLITPLDEQAYRRLSALQTHLTSILEHAAGLNPRAY 1286
Query: 577 RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
R+ + + + +RG++DG LV + +L R ++ + G
Sbjct: 1287 RSIESESFGG---ARGVVDGLLVRRIHELGAARRADVLGRAG 1325
>gi|378734083|gb|EHY60542.1| histone H2A [Exophiala dermatitidis NIH/UT8656]
Length = 1361
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 157/586 (26%), Positives = 261/586 (44%), Gaps = 84/586 (14%)
Query: 19 ELLTVSLGLHGNR-PLLLVRT-QHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANE 76
E+L LG +R P L+VR +++IY++F P RFKK V + E
Sbjct: 775 EVLLADLGNSTDRQPYLVVRNLVGDVIIYESFAMPDVLGSFRFKK--VFTKAAGELEDGE 832
Query: 77 QPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPF 136
+ G P ++ M+ +N+AG+ VF+ G P + + R + + + ++
Sbjct: 833 EVGQPSTLQ--PMQAVTNVAGHASVFIPGRQPLLIMREASTMPRVYELN-PTKLKSMNSV 889
Query: 137 HNVNCPRGFLYFNAKSELRISVLPTHLSYD-APWPVRKVPLKCTPHFLAYHLETKTYCIV 195
H C +G + +A E++ +P + W +R+VPL +AY T +Y +
Sbjct: 890 HTGTCRQGLVLVDADDEIKFCNIPDSTVLGLSDWVIRRVPLGQDITSVAYFAPTDSYILA 949
Query: 196 TS-TAE---PSTDYY--KFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
T+ T E P D + ++ GE ++F+P + Q + L S + I Q +
Sbjct: 950 TNHTTEFQLPQDDEWHPEWQGEA---------TKFLPSSI-QSSLKLLSAKTHSIISQYS 999
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEP 309
F E VLCL+++++E + I +GT E+VT RG + +FD+++VVPEP
Sbjct: 1000 F--DACERVLCLESLNLEVSEETHERKDLIVVGTAIVKGENVTTRGNLYIFDVVDVVPEP 1057
Query: 310 GQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFIDT 366
+P + KIK+I ++ +G V+A+C + GFL+ A GQK + LK D + +AF+D
Sbjct: 1058 DRPESDLKIKLITKEDVRGAVSALCDIGSQGFLLAAQGQKSMVRGLKEDMSILPVAFLDM 1117
Query: 367 E--VYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPS 424
V++A + L ++GD + L+ Y E L ++ RD + +P
Sbjct: 1118 RYYVHVARELPGTGLCILGDAFSGLWLVGYSEEPYKLQILGRDLE------------DPP 1165
Query: 425 RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPE 484
L +F L G++L I SD D + + Y PE
Sbjct: 1166 ------VLAAEF--LPDGKQLYIIS--------------------SDDDGLLRVLQYDPE 1197
Query: 485 ARESNGGHRLIKKTDFHLGQHVNTFFKI-------RCKPSSI-----SDAPGARSRFLTW 532
++ G +L+ ++ FH G + R + I S A A R
Sbjct: 1198 NPKAERGTKLLLRSTFHSGAAPTKMILLPPQVASGRGRDPEIDMDVDSGAGPAAGRHRIL 1257
Query: 533 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS-HTGGLNPRAFR 577
+ +G+L PL E YRRL LQ ++T H LNPRA+R
Sbjct: 1258 VTTQEGSLCMLTPLSEATYRRLSALQTTLLTTLDFHPCSLNPRAYR 1303
>gi|302652143|ref|XP_003017931.1| hypothetical protein TRV_08063 [Trichophyton verrucosum HKI 0517]
gi|291181517|gb|EFE37286.1| hypothetical protein TRV_08063 [Trichophyton verrucosum HKI 0517]
Length = 429
Score = 152 bits (384), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 122/456 (26%), Positives = 206/456 (45%), Gaps = 70/456 (15%)
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS--RFIPPLVSQFHVSLFSP 239
F Y L KT + A+ +K ED E T+ R+ F+P L + V L P
Sbjct: 7 FCLYDLPNKTDNTLDRIAKED---FKLP-EDDESHTEWRNEFITFLPQL-ERGTVKLLEP 61
Query: 240 FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
+W I + L E + C++ + +E + + +G++ ED+ +G I +
Sbjct: 62 RNWSTI--DSHELEPAERITCIEVIRLEISELTHERKDMVVVGSSIVKGEDIVPKGFIRV 119
Query: 300 FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DN 356
F++I+VVPEP QP K+K+ +E KG VTA+ + GFL+ A GQK + LK D
Sbjct: 120 FEVIDVVPEPDQPEKSKKLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGLKEDG 179
Query: 357 DLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
L +AF DT+ Y+ + +K + ++GD + + + Y E L L ++
Sbjct: 180 SLLPVAFKDTQCYVNVLKELKGTGMCIIGDAFKGLWFIGYSEEPYKLDLFGKE------- 232
Query: 415 SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
N + ++D D L + + + +++D D
Sbjct: 233 -------NENLAVVDA--------------------------DFLPDGNKLYILVADDDC 259
Query: 475 NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI---RCKPSSISDA--------P 523
N+ + Y PE S+ G RL+ ++ FH G +T + PSS D P
Sbjct: 260 NLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTLLPHGARTPSSPVDEDAMDTDSPP 319
Query: 524 GARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKG 583
++ + L + + G++ PL E +YRRLL LQ+ +V H LNPR +R + G
Sbjct: 320 PSKYQILMTFQT--GSVAVITPLGEDSYRRLLALQSQLVNALEHPCSLNPRGYRAVESDG 377
Query: 584 YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
RG+IDG+L+ ++L + + EI ++G+
Sbjct: 378 MGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGA 410
>gi|302403950|ref|XP_002999813.1| cft-1 [Verticillium albo-atrum VaMs.102]
gi|261361315|gb|EEY23743.1| cft-1 [Verticillium albo-atrum VaMs.102]
Length = 1349
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 158/656 (24%), Positives = 268/656 (40%), Gaps = 131/656 (19%)
Query: 5 RSHSPSAMDETIVQEL-LTVSLGLHGNRPLLLVRTQHELLIYQAFR-----HPKG-ALKL 57
R SP + E +V +L + S H L+L ++ IY+ FR KG A L
Sbjct: 787 RGTSPETLTEILVADLGDSTSASAH----LILRHANDDMTIYEPFRIGGQEERKGLATSL 842
Query: 58 RFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
FKK+ ++ A E + R+ +R NI GY VF+ G P+++ +S+
Sbjct: 843 FFKKVSNSHLAKSPVEAAEDEAVQEN-RVIPLRACDNIGGYSTVFVPGASPSFILKSSKS 901
Query: 118 ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
+ + G V+ ++ FH C RGF+Y ++K R++ P DA
Sbjct: 902 TPKVIGLQGLG-VNGMSSFHTEGCERGFIYADSKGCARVTQFP-----DA---------- 945
Query: 178 CTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLF 237
+ AE D + KE ++ +PP+ + L+
Sbjct: 946 ------------------ANVAELGVD----DDYHKEWA---KEECPMPPMKEHGSIKLY 980
Query: 238 SPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRI 297
SP +W I + F L ++E +C+K + +E R A+GT ED+ RGRI
Sbjct: 981 SPITWNVIDE--FELEQYEVAMCMKTLLLEVSEETKERRMLFAVGTAILRGEDLPVRGRI 1038
Query: 298 LLFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVAGFLVTAVGQKIYIWQLKD 355
L+FD++ V+P+P +P T K+K+I AKE+ +G VT++C +K + L+
Sbjct: 1039 LVFDVVHVIPQPDRPETDRKLKLI-AKEEIPRGAVTSLC-----------EKCMVRGLRR 1086
Query: 356 NDLTGIAFIDTEVYIASMVSVKNL--ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
+A D Y+ ++ ++N L+ D + + Y E ++L +
Sbjct: 1087 WHAAAVALPDLSTYVVAVHELRNTGYCLMADANMGVWFVGYSEEPYRMTLFGKS------ 1140
Query: 414 NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
G +L+ D L + + + SD+D
Sbjct: 1141 ----------------------------GTQLKCLTA------DFLVAGNDLSIVASDED 1166
Query: 474 KNVVLFMYQPEARESNGGHRLIKKTDFHLGQH----VNTFFKIRCKP-----SSISDAPG 524
+ + + PE S GH L+ + F + + + +P ++A G
Sbjct: 1167 GVLHILQFDPEHPRSLQGHLLLNRASFSVAPNHAWVTLALPRTTTRPYLPQSEPATNAAG 1226
Query: 525 ARSRFLT-WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKG 583
+++R T AS GA+ P+ E YRRL L + H G+NP+A R G
Sbjct: 1227 SQNRTQTLLLASASGAIASLNPITEHAYRRLTSLTTSLANALPHAAGMNPKAHRLPPQDG 1286
Query: 584 YYAGNP-------SRGIIDGSLVWKFLQLSLGERLEICKKIG-SKHNDILDELYDI 631
A P R I+DG+L+ ++ +L +R E K G + D+ EL D+
Sbjct: 1287 --AARPPAVDVSAGRTIVDGALLARWNELGARQRAEAAGKGGFASAADVRGELEDV 1340
>gi|71654693|ref|XP_815961.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
strain CL Brener]
gi|50363265|gb|AAT75335.1| cleavage polyadenylation specificity factor CPSF160 [Trypanosoma
cruzi]
gi|70881056|gb|EAN94110.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 1436
Score = 151 bits (382), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 151/637 (23%), Positives = 260/637 (40%), Gaps = 103/637 (16%)
Query: 32 PLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRY 91
PL L ++ H L ++A R +++++ K+L+ +R N+ + + R ++
Sbjct: 861 PLRLKKSMHHFLDHKAEREVIESIEMKRKRLQ----RERGVVENDTQLMRQYSR--RIVP 914
Query: 92 FSNIAGYQGVFLCGPHPAWLFLTSRG-ELRAHPMTIDGPVSTLAPFHNVN-----CPRGF 145
F I G G ++CG HP +LF R EL A+ GPV PF +N C GF
Sbjct: 915 FDAIGGNTGAYVCGQHPLFLFWDRRTRELEAYRHQTLGPVRGFVPFRIINSGYIYCCEGF 974
Query: 146 LYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEP---- 201
+ F + + PT W R++ L TPHF+ YH ++ +VTS EP
Sbjct: 975 VDF---ASMDTYCRPT----GQGWLTRRIHLGVTPHFVVYHPPARSCFVVTSKKEPFRPQ 1027
Query: 202 --------STDYYKFNGEDKELVTDPRDSRFIP---------PLVSQFHVSLFSPFSWEE 244
+ Y + +G + + T+ S P P+ +F + L S W
Sbjct: 1028 RAPFDVQLNIVYDEESGGVQSITTEAPVSNMPPIAPNAGIRVPMADRFEIRLMSTTDWA- 1086
Query: 245 IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRG--YIALGTNYNYSEDVTCRGRILLFDI 302
L E E VL + + ++ E GL + T + ED+TCRGRILL
Sbjct: 1087 -CTDTLLLEENERVLGAQMMEIQCERDAEGLHTAPVCVVSTAFPLGEDITCRGRILLLAT 1145
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL--KDNDLTG 360
I K KI + +++ GP TA+ + + AVG I +++ + L
Sbjct: 1146 I-------CTKKKRKIVLFHSEPLNGPATAVVGIRHHIAVAVGGTIKLFRFDWSNRKLVV 1198
Query: 361 IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
A + Y+ M S +N ++ GD +RS A+ R+ E TLS++ +D
Sbjct: 1199 GALLYAGTYVTRMSSFRNYLIYGDLSRSCAIARFNEENHTLSVLGKDR------------ 1246
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
+ H D++ + G + SD ++N+++
Sbjct: 1247 ----------------------------NAVSVVHCDMMYHDRAFGLLCSDDERNLLVMG 1278
Query: 481 YQPEARESNGG--HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDG 538
Y P +E+ G +++++ G++ C S+ A + +T Y + G
Sbjct: 1279 YTPRVQETEAGSPNKVLESVLSLDGEY---RLSGGCLVKSLRFRSLAGNSSVTLYVTNYG 1335
Query: 539 ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY-KGKGYYAGNPSRGIIDGS 597
+GF +P+ E+ R L + H+ GL PR F +G A ++ S
Sbjct: 1336 EIGFIVPIGEQANRTASWLMRRLQIDLPHSAGLTPRMFLGLSQGSPRTAMRAKEMLVSAS 1395
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
L+ +F L + R K I S L+ + ++ +L
Sbjct: 1396 LLNEFFFLDIHSR----KTIASAAYTQLERVTNVASL 1428
>gi|401841121|gb|EJT43641.1| CFT1-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 1355
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 131/542 (24%), Positives = 231/542 (42%), Gaps = 72/542 (13%)
Query: 89 MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
M YF + GY +F+ G P + + + P+ ++ P++ R +
Sbjct: 849 MHYFPDYNGYSVIFVTGSVPYIIIKEDDTTPKIFKFA-NIPLVSVTPWN----ERSVMCV 903
Query: 149 NAKSELRISVLP-THLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
+ R+ L ++ Y +P++++ + T + YH + + + + P
Sbjct: 904 DDIKNARVYTLTINNMYYGNKFPLKQIKISNVLDDYKTLQKIVYHEKAQLFLVSYCKRIP 963
Query: 202 STDYYKFNGEDKELVTDPRDSRFIPPLVSQFH--VSLFSPFSWEEIPQTNFPLHEWEHVL 259
Y+ GED E VT + P F + L +P SW+ I + +FP + V
Sbjct: 964 ----YEALGEDGEKVTGYDEK---APHAEGFQGGILLINPKSWKVIDKIDFPKNSV--VN 1014
Query: 260 CLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIK 319
+++ ++ R YI G +ED G ++D+IEVVPEPG+P T K+K
Sbjct: 1015 EMRSSMIQINSKTKRKREYIVAGVANATTEDTPPTGSFYIYDVIEVVPEPGKPDTNYKLK 1074
Query: 320 MIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKN 378
I+ +E G V+ +C ++G + + QK+ + ++ DN + +AF+D V++ S N
Sbjct: 1075 EIFQEEVNGTVSTVCEISGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGN 1134
Query: 379 LILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQ 438
L+++GD + + + E P R I+ G V KF
Sbjct: 1135 LLIIGDAMQGFQFIGFDAE-------------------------PYRMILLGRSVSKFQT 1169
Query: 439 LSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKT 498
+SL + L M F +D D+NV + Y P+ S G RL+ +
Sbjct: 1170 MSL---------------EFLVNGGDMYFAATDADRNVHILKYAPDEPNSLSGQRLVHCS 1214
Query: 499 DFHLGQHVNTFFKIRCKPSSI--SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM 556
F + +N+ + K S P F +DG++ +PL E+ YRRL +
Sbjct: 1215 SFTV-HSINSCMMLLPKNQEFGSSQVPS----FQNVGGQVDGSVFKIVPLSEETYRRLYL 1269
Query: 557 LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 616
+Q ++ GGLNPR R Y G+ R ++D +++ +F LS+ R +K
Sbjct: 1270 IQQQIIDRELQLGGLNPRMER-LANDFYQMGHSMRPMLDFNVIRRFSGLSIDRRKNTAQK 1328
Query: 617 IG 618
G
Sbjct: 1329 AG 1330
>gi|407850337|gb|EKG04765.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 1436
Score = 149 bits (376), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 150/637 (23%), Positives = 259/637 (40%), Gaps = 103/637 (16%)
Query: 32 PLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRY 91
PL L ++ H L ++A R +++++ K+L+ +R N+ + + R ++
Sbjct: 861 PLRLKKSMHHFLDHKAEREVIESIEMKRKRLQ----RERGVVENDTQLMRQYSR--RIVP 914
Query: 92 FSNIAGYQGVFLCGPHPAWLFLTSRG-ELRAHPMTIDGPVSTLAPFHNVN-----CPRGF 145
F I G G ++CG HP +LF R EL A+ GPV PF +N C GF
Sbjct: 915 FDAIGGNAGAYVCGQHPLFLFWDRRTRELEAYRHQTLGPVRGFVPFRIINSGYIYCCEGF 974
Query: 146 LYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEP---- 201
+ F + + PT W R++ L TPHF+ YH ++ +VTS EP
Sbjct: 975 VDF---ASMDTYCRPT----GQGWLTRRIHLGVTPHFVVYHPPARSCFVVTSKKEPFRPQ 1027
Query: 202 --------STDYYKFNGEDKELVTDPRDSRFIP---------PLVSQFHVSLFSPFSWEE 244
Y + +G + + T+ P P+ +F + L S W
Sbjct: 1028 RSPFDVQLKIVYDEESGGVQSITTEAPVCNMPPIAPNAGIRVPMADRFEIRLMSTTDWA- 1086
Query: 245 IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRG--YIALGTNYNYSEDVTCRGRILLFDI 302
L E E VL + + ++ E GL + T + ED+TCRGRILL
Sbjct: 1087 -CTDTLLLEENERVLGAQMMEIQCEKDAEGLHTAPVCVVSTAFPLGEDITCRGRILLLAT 1145
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND--LTG 360
+ K KI + +++ GP TA+ + + AVG I +++ N+ L
Sbjct: 1146 M-------CTKKKRKIVLFHSEPLNGPATAVVGIRHHIAVAVGGTIKLFRFDWNNRKLVV 1198
Query: 361 IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
A + Y+ M S +N ++ GD +RS A+ R+ E TLS++ +D
Sbjct: 1199 GALLYAGTYVTRMSSFRNYLIYGDLSRSCAIARFNEENHTLSVLGKDR------------ 1246
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
+ H D++ + G + SD ++N+++
Sbjct: 1247 ----------------------------NAVSVVHCDMMYHDRAFGLLCSDDERNLLVMG 1278
Query: 481 YQPEARESNGG--HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDG 538
Y P +E+ G +++++ G++ C S+ A + +T Y + G
Sbjct: 1279 YTPRVQETEAGSPNKVLESVLSLDGEY---RLSGGCLVKSLRFRSLAGNSSVTLYVTNYG 1335
Query: 539 ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY-KGKGYYAGNPSRGIIDGS 597
+GF +P+ E+ R L + H+ GL PR F +G A ++ S
Sbjct: 1336 EIGFIVPIGEQANRTASWLMRRLQIDLPHSAGLTPRMFLGLSQGSPRTAMRAKEMLVSAS 1395
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
L+ +F L + R K I S L+ + ++ +L
Sbjct: 1396 LLNEFFFLDIHSR----KTIASAAYTQLERVTNVASL 1428
>gi|398397855|ref|XP_003852385.1| hypothetical protein MYCGRDRAFT_100364 [Zymoseptoria tritici IPO323]
gi|339472266|gb|EGP87361.1| hypothetical protein MYCGRDRAFT_100364 [Zymoseptoria tritici IPO323]
Length = 1333
Score = 149 bits (376), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 156/653 (23%), Positives = 264/653 (40%), Gaps = 98/653 (15%)
Query: 6 SHSPSAMDETIVQELLTVSLGLHGN-RPLLLVRT-QHELLIYQAFRHPKGAL------KL 57
SH + ET+ ELL LG G +P L VRT ++++Y+ F A L
Sbjct: 718 SHRRMGVKETLT-ELLVADLGNDGVLQPYLTVRTAMDDVVLYEPFHSSPSASTGPWHSNL 776
Query: 58 RFKKLKVLFVSDRSKRANEQPGL-PRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSR 116
RF+K+ V ++ + E P P +R Q I GY V + G L +
Sbjct: 777 RFRKVPVPYIPKYNDSPLEDPNARPPALRRMQ------IGGYNTVSIPGAPSCLLLKEAS 830
Query: 117 GE---LRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRK 173
G L + + L P + + C GF + L LP + W +R+
Sbjct: 831 GPPKILEVNEPKRSNATTILTPLNRIGCENGFATVDVNGALHECQLPPDAWFSTGWSIRQ 890
Query: 174 VPLKCTPH---FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
+ L LAYH + T T + D+Y F ED +D I P V
Sbjct: 891 IDLGDDAREVRHLAYHEARGIFVAATCT---TVDFY-FAEEDGR--HPEQDDISIRPQVP 944
Query: 231 QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
Q+ V L S + + I P E V LK + E ++ + + T ED
Sbjct: 945 QYSVHLISAKTHKIIHTHKLPY--LETVTALKVMPAEVSELSHEVKPVVVVSTGAQRGED 1002
Query: 291 VTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLV-TAVGQKIY 349
+ +G +++FD+I+VVP+P + + ++ +E +G +TA+ G ++ TA G K+
Sbjct: 1003 MPAKGALIVFDVIDVVPDPDVEESGLHLHVLAREESRGAITALASFPGGMIGTAQGLKLM 1062
Query: 350 IWQLK-DNDLTGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVAR 406
I ++ D +AF+D + Y + + ++ + L L GD + + + E L+L+ +
Sbjct: 1063 IRGMREDGSCLPVAFLDAQCYTSLLKTLDSRGLWLAGDAWKGLWFGGFTQEPYKLTLLGK 1122
Query: 407 DYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMG 466
+ +E+ + D L ++
Sbjct: 1123 SPR---------------------------------TEMEVIEA------DFLPFDGALF 1143
Query: 467 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS------ 520
++ D D ++ + Y PE +S G RL+ ++ FH+G + PS+++
Sbjct: 1144 LLVLDADADLHVLQYDPENPKSLNGQRLLHRSTFHIGHFPTGSMLL---PSTLAPFTEQA 1200
Query: 521 -DAPGARSR-----------FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 568
D P S F + G++G PL E YRRL LQ + H
Sbjct: 1201 RDLPNGDSEDTKQEEVNSPLFHVLTTTSSGSIGLITPLDESTYRRLSALQGHLTNILEHA 1260
Query: 569 GGLNPRAFRT---YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
GLNPR +RT K G ++G++DGSL+ + +L R ++ ++G
Sbjct: 1261 AGLNPRMYRTDTEMKATDSEMGG-AKGVVDGSLIRRISELGAARRADVLSRVG 1312
>gi|349577352|dbj|GAA22521.1| K7_Cft1p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 1357
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 137/556 (24%), Positives = 239/556 (42%), Gaps = 71/556 (12%)
Query: 89 MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
M YF + GY +F+ G P L + + P+ ++ P+ R +
Sbjct: 851 MHYFPDYNGYSVIFVTGSVPYILIKEDDSTPKIFKFG-NIPLVSVTPW----SERSVMCV 905
Query: 149 NAKSELRISVLPT-HLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
+ R+ L T ++ Y P++++ + T L YH + + + P
Sbjct: 906 DDIKNARVYTLTTDNMYYGNKLPLKQIKISNVLDDYKTLQKLVYHERAQLFLVSYCKRVP 965
Query: 202 STDYYKFNGEDKELVTDPRDSRFIPPLVSQFH--VSLFSPFSWEEIPQTNFPLHEWEHVL 259
Y+ GED E V ++ P F + L +P SW+ I + +FP + V
Sbjct: 966 ----YEALGEDGEKVIGYDEN---VPHAEGFQSGILLINPKSWKVIDKIDFPKNSV--VN 1016
Query: 260 CLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIK 319
+++ ++ R YI G +ED G ++D+IEVVPEPG+P T K+K
Sbjct: 1017 EMRSSMIQINSKTKRKREYIIAGVANATTEDTPPTGAFHIYDVIEVVPEPGKPDTNYKLK 1076
Query: 320 MIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKN 378
I+ +E G V+ +C V+G + + QK+ + ++ DN + +AF+D V++ S N
Sbjct: 1077 EIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGN 1136
Query: 379 LILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFL 437
L+++GD + + + E YR +SL G + KF
Sbjct: 1137 LLIIGDAMQGFQFIGFDAEPYRMISL--------------------------GRSMSKFQ 1170
Query: 438 QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
+SL + L M F +D D+NV + Y P+ S G RL+
Sbjct: 1171 TMSL---------------EFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHC 1215
Query: 498 TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
+ F L N+ + + +P S F +DG++ +PL E+ YRRL ++
Sbjct: 1216 SSFTL-HSTNSCMMLLPRNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVI 1272
Query: 558 QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
Q ++ GGLNPR R Y G+ R ++D +++ +F L++ R I +K
Sbjct: 1273 QQQIIDRELQLGGLNPRMER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKA 1331
Query: 618 G-SKHNDILDELYDIE 632
G + H + ++ +IE
Sbjct: 1332 GRNAHFEAWRDIINIE 1347
>gi|323309632|gb|EGA62840.1| Cft1p [Saccharomyces cerevisiae FostersO]
Length = 1357
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 134/541 (24%), Positives = 231/541 (42%), Gaps = 70/541 (12%)
Query: 89 MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
M YF + GY +F+ G P L + + P+ ++ P+ R +
Sbjct: 851 MHYFPDYNGYSVIFVTGSVPYILIKEDDSTPKIFKFG-NIPLVSVTPW----SERSVMCV 905
Query: 149 NAKSELRISVLPT-HLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
+ R+ L T ++ Y P++++ + T L YH + + + P
Sbjct: 906 DDIKNARVYTLTTDNMYYGNKLPLKQIKISNVLDDYKTLQKLVYHERAQLFLVSYCKRVP 965
Query: 202 STDYYKFNGEDKELVTDPRDSRFIPPLVSQFH--VSLFSPFSWEEIPQTNFPLHEWEHVL 259
Y+ GED E V ++ P F + L +P SW+ I + +FP + V
Sbjct: 966 ----YEALGEDGEKVIGYDEN---VPHAEGFQSGILLINPKSWKVIDKIDFPKNSV--VN 1016
Query: 260 CLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIK 319
+++ ++ R YI G +ED G ++D+IEVVPEPG+P T K+K
Sbjct: 1017 EMRSSMIQINSKTKRKREYIIAGVANATTEDTPPTGAFHIYDVIEVVPEPGKPDTNYKLK 1076
Query: 320 MIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKN 378
I+ +E G V+ +C V+G + + QK+ + ++ DN + +AF+D V++ S N
Sbjct: 1077 EIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGN 1136
Query: 379 LILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFL 437
L+++GD + + + E YR +SL G + KF
Sbjct: 1137 LLIIGDAMQGFQFIGFDAEPYRMISL--------------------------GRSMSKFQ 1170
Query: 438 QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
+SL + L M F +D D+NV + Y P+ S G RL+
Sbjct: 1171 TMSL---------------EFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHC 1215
Query: 498 TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
+ F L N+ + + +P S F +DG++ +PL E+ YRRL ++
Sbjct: 1216 SSFTL-HSTNSCMMLLPRNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVI 1272
Query: 558 QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
Q ++ GGLNPR R Y G+ R ++D +++ +F L++ R I +K
Sbjct: 1273 QQQIIDRELQLGGLNPRMER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKA 1331
Query: 618 G 618
G
Sbjct: 1332 G 1332
>gi|207346484|gb|EDZ72967.1| YDR301Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 1357
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 134/541 (24%), Positives = 231/541 (42%), Gaps = 70/541 (12%)
Query: 89 MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
M YF + GY +F+ G P L + + P+ ++ P+ R +
Sbjct: 851 MHYFPDYNGYSVIFVTGSVPYILIKEDDSTPKIFKFG-NIPLVSVTPW----SERSVMCV 905
Query: 149 NAKSELRISVLPT-HLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
+ R+ L T ++ Y P++++ + T L YH + + + P
Sbjct: 906 DDIKNARVYTLTTDNMYYGNKLPLKQIKISNVLDDYKTLQKLVYHERAQLFLVSYCKRVP 965
Query: 202 STDYYKFNGEDKELVTDPRDSRFIPPLVSQFH--VSLFSPFSWEEIPQTNFPLHEWEHVL 259
Y+ GED E V ++ P F + L +P SW+ I + +FP + V
Sbjct: 966 ----YEALGEDGEKVIGYDEN---VPHAEGFQSGILLINPKSWKVIDKIDFPKNSV--VN 1016
Query: 260 CLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIK 319
+++ ++ R YI G +ED G ++D+IEVVPEPG+P T K+K
Sbjct: 1017 EMRSSMIQINSKTKRKREYIIAGVANATTEDTPPTGAFHIYDVIEVVPEPGKPDTNYKLK 1076
Query: 320 MIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKN 378
I+ +E G V+ +C V+G + + QK+ + ++ DN + +AF+D V++ S N
Sbjct: 1077 EIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGN 1136
Query: 379 LILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFL 437
L+++GD + + + E YR +SL G + KF
Sbjct: 1137 LLIIGDAMQGFQFIGFDAEPYRMISL--------------------------GRSMSKFQ 1170
Query: 438 QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
+SL + L M F +D D+NV + Y P+ S G RL+
Sbjct: 1171 TMSL---------------EFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHC 1215
Query: 498 TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
+ F L N+ + + +P S F +DG++ +PL E+ YRRL ++
Sbjct: 1216 SSFTL-HSTNSCMMLLPRNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVI 1272
Query: 558 QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
Q ++ GGLNPR R Y G+ R ++D +++ +F L++ R I +K
Sbjct: 1273 QQQIIDRELQLGGLNPRMER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKA 1331
Query: 618 G 618
G
Sbjct: 1332 G 1332
>gi|323338222|gb|EGA79455.1| Cft1p [Saccharomyces cerevisiae Vin13]
gi|365766372|gb|EHN07870.1| Cft1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 1357
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 134/541 (24%), Positives = 231/541 (42%), Gaps = 70/541 (12%)
Query: 89 MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
M YF + GY +F+ G P L + + P+ ++ P+ R +
Sbjct: 851 MHYFPDYNGYSVIFVTGSVPYILIKEDDSTPKIFKFG-NIPLVSVTPW----SERSVMCV 905
Query: 149 NAKSELRISVLPT-HLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
+ R+ L T ++ Y P++++ + T L YH + + + P
Sbjct: 906 DDIKNARVYTLTTDNMYYGNKLPLKQIKISNVLDDYKTLQKLVYHERAQLFLVSYCKRVP 965
Query: 202 STDYYKFNGEDKELVTDPRDSRFIPPLVSQFH--VSLFSPFSWEEIPQTNFPLHEWEHVL 259
Y+ GED E V ++ P F + L +P SW+ I + +FP + V
Sbjct: 966 ----YEALGEDGEKVIGYDEN---VPHAEGFQSGILLINPKSWKVIDKIDFPKNSV--VN 1016
Query: 260 CLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIK 319
+++ ++ R YI G +ED G ++D+IEVVPEPG+P T K+K
Sbjct: 1017 EMRSSMIQINSKTKRKREYIIAGVANATTEDTPPTGAFHIYDVIEVVPEPGKPDTNYKLK 1076
Query: 320 MIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKN 378
I+ +E G V+ +C V+G + + QK+ + ++ DN + +AF+D V++ S N
Sbjct: 1077 EIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGN 1136
Query: 379 LILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFL 437
L+++GD + + + E YR +SL G + KF
Sbjct: 1137 LLIIGDAMQGFQFIGFDAEPYRMISL--------------------------GRSMSKFQ 1170
Query: 438 QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
+SL + L M F +D D+NV + Y P+ S G RL+
Sbjct: 1171 TMSL---------------EFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHC 1215
Query: 498 TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
+ F L N+ + + +P S F +DG++ +PL E+ YRRL ++
Sbjct: 1216 SSFTL-HSTNSCMMLLPRNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVI 1272
Query: 558 QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
Q ++ GGLNPR R Y G+ R ++D +++ +F L++ R I +K
Sbjct: 1273 QQQIIDRELQLGGLNPRMER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKA 1331
Query: 618 G 618
G
Sbjct: 1332 G 1332
>gi|6320507|ref|NP_010587.1| Cft1p [Saccharomyces cerevisiae S288c]
gi|74583567|sp|Q06632.1|CFT1_YEAST RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|849213|gb|AAB64737.1| Ydr301wp [Saccharomyces cerevisiae]
gi|256271799|gb|EEU06830.1| Cft1p [Saccharomyces cerevisiae JAY291]
gi|285811316|tpg|DAA12140.1| TPA: Cft1p [Saccharomyces cerevisiae S288c]
gi|392300415|gb|EIW11506.1| Cft1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 1357
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 134/541 (24%), Positives = 231/541 (42%), Gaps = 70/541 (12%)
Query: 89 MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
M YF + GY +F+ G P L + + P+ ++ P+ R +
Sbjct: 851 MHYFPDYNGYSVIFVTGSVPYILIKEDDSTPKIFKFG-NIPLVSVTPW----SERSVMCV 905
Query: 149 NAKSELRISVLPT-HLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
+ R+ L T ++ Y P++++ + T L YH + + + P
Sbjct: 906 DDIKNARVYTLTTDNMYYGNKLPLKQIKISNVLDDYKTLQKLVYHERAQLFLVSYCKRVP 965
Query: 202 STDYYKFNGEDKELVTDPRDSRFIPPLVSQFH--VSLFSPFSWEEIPQTNFPLHEWEHVL 259
Y+ GED E V ++ P F + L +P SW+ I + +FP + V
Sbjct: 966 ----YEALGEDGEKVIGYDEN---VPHAEGFQSGILLINPKSWKVIDKIDFPKNSV--VN 1016
Query: 260 CLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIK 319
+++ ++ R YI G +ED G ++D+IEVVPEPG+P T K+K
Sbjct: 1017 EMRSSMIQINSKTKRKREYIIAGVANATTEDTPPTGAFHIYDVIEVVPEPGKPDTNYKLK 1076
Query: 320 MIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKN 378
I+ +E G V+ +C V+G + + QK+ + ++ DN + +AF+D V++ S N
Sbjct: 1077 EIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGN 1136
Query: 379 LILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFL 437
L+++GD + + + E YR +SL G + KF
Sbjct: 1137 LLIIGDAMQGFQFIGFDAEPYRMISL--------------------------GRSMSKFQ 1170
Query: 438 QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
+SL + L M F +D D+NV + Y P+ S G RL+
Sbjct: 1171 TMSL---------------EFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHC 1215
Query: 498 TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
+ F L N+ + + +P S F +DG++ +PL E+ YRRL ++
Sbjct: 1216 SSFTL-HSTNSCMMLLPRNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVI 1272
Query: 558 QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
Q ++ GGLNPR R Y G+ R ++D +++ +F L++ R I +K
Sbjct: 1273 QQQIIDRELQLGGLNPRMER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKA 1331
Query: 618 G 618
G
Sbjct: 1332 G 1332
>gi|190404756|gb|EDV08023.1| 150 kDa protein associated with polyadenylation factor 1
[Saccharomyces cerevisiae RM11-1a]
gi|259145538|emb|CAY78802.1| Cft1p [Saccharomyces cerevisiae EC1118]
Length = 1357
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 134/541 (24%), Positives = 231/541 (42%), Gaps = 70/541 (12%)
Query: 89 MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
M YF + GY +F+ G P L + + P+ ++ P+ R +
Sbjct: 851 MHYFPDYNGYSVIFVTGSVPYILIKEDDSTPKIFKFG-NIPLVSVTPW----SERSVMCV 905
Query: 149 NAKSELRISVLPT-HLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
+ R+ L T ++ Y P++++ + T L YH + + + P
Sbjct: 906 DDIKNARVYTLTTDNMYYGNKLPLKQIKISNVLDDYKTLQKLVYHERAQLFLVSYCKRVP 965
Query: 202 STDYYKFNGEDKELVTDPRDSRFIPPLVSQFH--VSLFSPFSWEEIPQTNFPLHEWEHVL 259
Y+ GED E V ++ P F + L +P SW+ I + +FP + V
Sbjct: 966 ----YEALGEDGEKVIGYDEN---VPHAEGFQSGILLINPKSWKVIDKIDFPKNSV--VN 1016
Query: 260 CLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIK 319
+++ ++ R YI G +ED G ++D+IEVVPEPG+P T K+K
Sbjct: 1017 EMRSSMIQINSKTKRKREYIIAGVANATTEDTPPTGAFHIYDVIEVVPEPGKPDTNYKLK 1076
Query: 320 MIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKN 378
I+ +E G V+ +C V+G + + QK+ + ++ DN + +AF+D V++ S N
Sbjct: 1077 EIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGN 1136
Query: 379 LILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFL 437
L+++GD + + + E YR +SL G + KF
Sbjct: 1137 LLIIGDAMQGFQFIGFDAEPYRMISL--------------------------GRSMSKFQ 1170
Query: 438 QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
+SL + L M F +D D+NV + Y P+ S G RL+
Sbjct: 1171 TMSL---------------EFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHC 1215
Query: 498 TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
+ F L N+ + + +P S F +DG++ +PL E+ YRRL ++
Sbjct: 1216 SSFTL-HSTNSCMMLLPRNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVI 1272
Query: 558 QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
Q ++ GGLNPR R Y G+ R ++D +++ +F L++ R I +K
Sbjct: 1273 QQQIIDRELQLGGLNPRMER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKA 1331
Query: 618 G 618
G
Sbjct: 1332 G 1332
>gi|151942273|gb|EDN60629.1| cleavage factor II (CF II) component [Saccharomyces cerevisiae
YJM789]
Length = 1357
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 134/541 (24%), Positives = 231/541 (42%), Gaps = 70/541 (12%)
Query: 89 MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
M YF + GY +F+ G P L + + P+ ++ P+ R +
Sbjct: 851 MHYFPDYNGYSVIFVTGSVPYILIKEDDSTPKIFKFG-NIPLVSVTPW----SERSVMCV 905
Query: 149 NAKSELRISVLPT-HLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
+ R+ L T ++ Y P++++ + T L YH + + + P
Sbjct: 906 DDIKNARVYTLTTDNMYYGNKLPLKQIKISNVHDDYKTLQKLVYHERAQLFLVSYCKRVP 965
Query: 202 STDYYKFNGEDKELVTDPRDSRFIPPLVSQFH--VSLFSPFSWEEIPQTNFPLHEWEHVL 259
Y+ GED E V ++ P F + L +P SW+ I + +FP + V
Sbjct: 966 ----YEALGEDGEKVIGYDEN---VPHAEGFQSGILLINPKSWKVIDKIDFPKNSV--VN 1016
Query: 260 CLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIK 319
+++ ++ R YI G +ED G ++D+IEVVPEPG+P T K+K
Sbjct: 1017 EMRSSMIQINSKTKRKREYIIAGVANATTEDTPPTGAFHIYDVIEVVPEPGKPDTNYKLK 1076
Query: 320 MIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKN 378
I+ +E G V+ +C V+G + + QK+ + ++ DN + +AF+D V++ S N
Sbjct: 1077 EIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGN 1136
Query: 379 LILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFL 437
L+++GD + + + E YR +SL G + KF
Sbjct: 1137 LLIIGDAMQGFQFIGFDAEPYRMISL--------------------------GRSMSKFQ 1170
Query: 438 QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
+SL + L M F +D D+NV + Y P+ S G RL+
Sbjct: 1171 TMSL---------------EFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHC 1215
Query: 498 TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
+ F L N+ + + +P S F +DG++ +PL E+ YRRL ++
Sbjct: 1216 SSFTL-HSTNSCMMLLPRNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVI 1272
Query: 558 QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
Q ++ GGLNPR R Y G+ R ++D +++ +F L++ R I +K
Sbjct: 1273 QQQIIDRELQLGGLNPRMER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKA 1331
Query: 618 G 618
G
Sbjct: 1332 G 1332
>gi|407410979|gb|EKF33219.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi marinkellei]
Length = 1436
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 149/637 (23%), Positives = 260/637 (40%), Gaps = 103/637 (16%)
Query: 32 PLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRY 91
PL L ++ H L ++A R +++++ K+L+ +R N+ + + R ++
Sbjct: 861 PLRLKKSMHHFLDHKAEREVIESIEMKRKRLQ----RERGVVENDTQLMRQYSR--RIVP 914
Query: 92 FSNIAGYQGVFLCGPHPAWLFLTSRG-ELRAHPMTIDGPVSTLAPFHNVN-----CPRGF 145
F +I G G ++CG HP +LF R EL A+ GPV PF +N C GF
Sbjct: 915 FDSIGGNAGAYVCGQHPLFLFWDRRTRELEAYRHQTLGPVRGFVPFRIINSGYIYCCEGF 974
Query: 146 LYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEP---- 201
+ F + + PT W R++ L TPHF+ YH ++ +VTS EP
Sbjct: 975 VDF---ASMDTYCRPTGQG----WLTRRIHLGVTPHFVVYHPPARSCFVVTSKKEPFRPQ 1027
Query: 202 --------STDYYKFNGEDKELVTD---------PRDSRFIPPLVSQFHVSLFSPFSWEE 244
+ Y + +G + + T+ P ++ P+ +F + L S W
Sbjct: 1028 RAPFDVQLNIVYDEESGGVQSITTEAPVCNMPPIPPNAGIRVPMADRFEICLMSTTDWA- 1086
Query: 245 IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRG--YIALGTNYNYSEDVTCRGRILLFDI 302
L E E VL + + + E GL + T + ED+T RGRILL
Sbjct: 1087 -CTDTLLLEENERVLGAQMMEIHCEKDAEGLHTAPVCVVSTAFPLGEDITSRGRILLLST 1145
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL--KDNDLTG 360
+ K KI + +++ GP TA+ + + AVG I +++ ++ L
Sbjct: 1146 M-------CTKKKRKILLFHSEPLNGPATAVVGIRHHIAVAVGGTIKLFRFDWENRKLVV 1198
Query: 361 IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
A + Y+ M S +N ++ GD +RS A+ R+ E TLS++ +D
Sbjct: 1199 GALLYAGTYVTRMSSFRNYLIYGDLSRSCAIARFNEENHTLSVLGKDR------------ 1246
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
+ H D++ + G + SD ++N+++
Sbjct: 1247 ----------------------------NAVSVVHCDMMYHDRAFGLLCSDDERNLLVMG 1278
Query: 481 YQPEARESNGG--HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDG 538
Y P +E+ G +++++ G++ C S+ A + +T Y + G
Sbjct: 1279 YTPRVQETEAGSPNKVLESVLSLDGEY---RLSGGCLVKSLRFRSLAGNSSVTLYVTNYG 1335
Query: 539 ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY-KGKGYYAGNPSRGIIDGS 597
+GF +P+ E+ R L + H GL PR F +G A ++ S
Sbjct: 1336 EIGFIVPIGEQANRTASWLMRRLQMDLPHNAGLTPRMFLGLSQGSPRTALRAKEMLVSAS 1395
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
L+ +F L + R K I S L+ + ++ AL
Sbjct: 1396 LLNEFFFLDIHSR----KTIASAAYTQLERVTNVAAL 1428
>gi|71021721|ref|XP_761091.1| hypothetical protein UM04944.1 [Ustilago maydis 521]
gi|46100541|gb|EAK85774.1| hypothetical protein UM04944.1 [Ustilago maydis 521]
Length = 1597
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 127/506 (25%), Positives = 225/506 (44%), Gaps = 54/506 (10%)
Query: 126 IDGPVSTLAPFHNVNCPRG--------FLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
+D P L+ +++ P F++ + L + LP L WP V
Sbjct: 1060 LDWPDRDLSSLASISAPLASTGSVNADFVFCDRAGRLYLGRLPAGLDSSTAWPSSVVRTG 1119
Query: 178 CTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLF 237
+ H T T ++ ++ P + ED E + D + + + P VSQ
Sbjct: 1120 REYTNVVAHDPTST--VIAASVSPC--RFMLFDEDGEAIHDEQPNSTLYPSVSQRGSLEL 1175
Query: 238 SPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRI 297
P + E V L+ V+++ T+SG + ++A GT + ED T +G +
Sbjct: 1176 FISQHGSTPVDGYEFEANETVTSLEIVTLDSPSTVSGRKQFVAAGTTTFHGEDRTAKGSV 1235
Query: 298 LLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND 357
LF+II VV + + ++K++ + + PVTAI H+ G+L++ GQK+Y+ L+ +
Sbjct: 1236 YLFEIISVVSAASELGSDLRLKLVCRDDSRAPVTAISHINGYLISTCGQKLYVRALEKQE 1295
Query: 358 -LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNS 415
L IAF+D YI S+ VKNL+L+GD R + L +Q + Y+ + L +
Sbjct: 1296 WLISIAFLDCPFYITSIEVVKNLVLLGDCKRGLGLWAFQEDPYKFVELAKAE-------- 1347
Query: 416 KGYYAGNPSRGIIDGSL-VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
DG + V FL E++ + GS+ +G S +
Sbjct: 1348 -------------DGCVGVGAFLVRD--EKVSMLSISGSR----------LGGDASMEAS 1382
Query: 475 NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAP--GARS-RFLT 531
V+ +Y+ + GG +L+ +++F ++ C +SD+ G + R
Sbjct: 1383 AGVIRLYEYAPHLAVGGKKLVLRSEFQTTSEA--VARVECSGRWLSDSELRGRETLRNKV 1440
Query: 532 WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSR 591
+A +G++ + EK +RL +LQ +V HT LNPR+FR + Y +
Sbjct: 1441 VFAKANGSVESVAAVDEKVGKRLHLLQGQLVRSVMHTAALNPRSFRMVRND-YVPRALVK 1499
Query: 592 GIIDGSLVWKFLQLSLGERLEICKKI 617
G++D L+ +F++LS + LE K +
Sbjct: 1500 GVLDARLLDEFMRLSRPKMLEAVKTL 1525
>gi|401624207|gb|EJS42273.1| cft1p [Saccharomyces arboricola H-6]
Length = 1356
Score = 145 bits (367), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 130/540 (24%), Positives = 228/540 (42%), Gaps = 68/540 (12%)
Query: 89 MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
M YF + GY +F+ G P L + + P+ ++ P+ R +
Sbjct: 850 MHYFPDYNGYSVIFVTGSVPYILIKEDDSTPKIFKFA-NIPLVSVTPW----SERSVMCV 904
Query: 149 NAKSELRISVLPT-HLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
+ R+ L ++ Y P++++ + T + YH + + + + P
Sbjct: 905 DDIKNARVYTLTIDNMYYGNKMPLKQIKISNVLDDYKTLQKVVYHEKAELFLVSYCKRVP 964
Query: 202 STDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCL 261
Y+ GED E + D + Q + L +P SW+ I + +FP + V +
Sbjct: 965 ----YEALGEDGERIIG-YDEKVPHAEGFQGGILLINPKSWKVIDKIDFPNNSV--VNEM 1017
Query: 262 KNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMI 321
++ ++ R YI G +ED G ++D+ EVVPEPG+P T K+K I
Sbjct: 1018 RSSMIQVNSKTKKKREYIIAGVANATTEDTPPTGAFHIYDVTEVVPEPGKPDTNYKLKEI 1077
Query: 322 YAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLI 380
+ +E G V+ +C V+G + + QK+ + ++ DN + +AF+D V++ S NL+
Sbjct: 1078 FQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGNLL 1137
Query: 381 LVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLS 440
L+GD + + + E P R I+ G + KF +S
Sbjct: 1138 LIGDAMQGFQFIGFDAE-------------------------PYRMILLGRSISKFQTMS 1172
Query: 441 LGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDF 500
L + L M F +D D+NV + Y P+ S G RL+ + F
Sbjct: 1173 L---------------EFLVNGGDMYFSATDADRNVHVLKYAPDEPNSLSGQRLVHCSSF 1217
Query: 501 HLGQHVNTFFKIRCKPSSI--SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQ 558
L +N+ + K S P F +DG++ +PL E+ YRRL ++Q
Sbjct: 1218 TL-HSINSCMLLLPKNEEFGSSQVPS----FQNVGGQVDGSIFKIVPLSEETYRRLYVIQ 1272
Query: 559 NVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
++ GGLNPR R Y G+ R ++D +++ +F +L++ R +K G
Sbjct: 1273 QQIIDREIQLGGLNPRMER-LANDFYQMGHSMRPMLDFNVIRRFSELAIDRRKNTAQKAG 1331
>gi|343425828|emb|CBQ69361.1| related to cleavage and polyadenylation specificity factor, 160 kDa
subunit [Sporisorium reilianum SRZ2]
Length = 1567
Score = 145 bits (366), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 131/503 (26%), Positives = 220/503 (43%), Gaps = 50/503 (9%)
Query: 126 IDGPVSTLAPFHNVNCPRG----FLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
+D P L ++ PR F Y + +L ++ P L + W V +
Sbjct: 1035 LDWPEGDLCCIASIYTPRANDADFAYCDRAGQLWLARAPHGLYAETSWMSSVV--RTGRE 1092
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
+ T+ +V ++ +P + F+ ED E + DP +P +Q
Sbjct: 1093 YTRVVAHDATHTVVAASIQPCR-FVLFD-EDGEPIADPGADEALPSTTAQRGALELFISE 1150
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
+ E V L+ V+++ T SG + ++A GT + ED T +G + LF+
Sbjct: 1151 DRTTAADGYEFEANETVTALEIVTLDAPSTASGRKQFVAAGTTTFHGEDRTAKGCVYLFE 1210
Query: 302 IIEVVPEPGQPLTKN-KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND-LT 359
+IEVV + ++ ++K++ + +GPVTAI + GFLV+ GQK+Y+ L+ + L
Sbjct: 1211 VIEVVASARYQVGRDLRLKLVCRDDSRGPVTAIAQLNGFLVSTCGQKLYVRALEKEEWLI 1270
Query: 360 GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGY 418
IAF+D +Y+ + VKN +L+ D +S+ LL +Q E YR + L RD Y
Sbjct: 1271 SIAFLDCPLYVTGIRVVKNFVLLSDARKSLWLLAFQEEPYRFVDL-GRDIHDHHATLGQY 1329
Query: 419 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVV- 477
N ERL + G+ L ++ G +D VV
Sbjct: 1330 LVYN--------------------ERLALVSTSGAA----LGGSTAFG-----RDAGVVR 1360
Query: 478 LFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG---ARSRFLTWYA 534
L+ Y P +N RL+ +T+F R + S S+ G R++ + A
Sbjct: 1361 LYEYAPHVASAN--TRLVLRTEFQTASPATASVACRGRWLSDSELRGREHGRNKLV--LA 1416
Query: 535 SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGII 594
+GAL ++ +RL +LQ +V HT LNPRAFR + + + +G++
Sbjct: 1417 KANGALETLAAADDRVAKRLHVLQGQLVRSVLHTAALNPRAFRAVRND-FVSRALGKGVL 1475
Query: 595 DGSLVWKFLQLSLGERLEICKKI 617
D L+ F+ LS + LE K +
Sbjct: 1476 DARLLDSFVYLSRPKMLEAVKTL 1498
>gi|45184764|ref|NP_982482.1| AAL060Wp [Ashbya gossypii ATCC 10895]
gi|74695871|sp|Q75EY8.1|CFT1_ASHGO RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|44980110|gb|AAS50306.1| AAL060Wp [Ashbya gossypii ATCC 10895]
gi|374105681|gb|AEY94592.1| FAAL060Wp [Ashbya gossypii FDAG1]
Length = 1305
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 122/465 (26%), Positives = 209/465 (44%), Gaps = 65/465 (13%)
Query: 179 TPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFS 238
T + + YH T+T+ + + S DY + ED+ LV D I + Q + L S
Sbjct: 885 TLNNITYHERTQTFIV---SYAKSIDYVALSEEDEPLVGYNPDK--IHAMGFQSGIILLS 939
Query: 239 PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRIL 298
P SWE I + + + + ++ + ++ R Y+ +G Y ED+ G
Sbjct: 940 PKSWEIIDKIEYGKNSL--INDMRTMMIQLNSNTKRRREYLVVGNTYVRDEDIGGTGSFY 997
Query: 299 LFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DND 357
L+DI EVVPEPG+P T K K I+ ++ +G V+ +C ++G + + K + ++ DN
Sbjct: 998 LYDITEVVPEPGKPDTNYKFKDIFQEDIRGTVSTVCEISGRFMISQSSKAMVRDIQEDNS 1057
Query: 358 LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSK 416
+ +AF+D V+I S NL+++GD + + L + E YR L+L
Sbjct: 1058 VVPVAFLDMPVFITDAKSFGNLMIIGDSMQGFSFLGFDAEPYRMLTL------------- 1104
Query: 417 GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
G V K + C + + D+ F+++D++ +
Sbjct: 1105 -------------GKSVSKLETM--------CVEFLVNNGDVY-------FLVTDRNNLM 1136
Query: 477 VLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWY--- 533
+ Y P+ S G RL+ T F+L NT ++ K +D G SR Y
Sbjct: 1137 HVLKYAPDEPNSLSGQRLVHCTSFNL-HSTNTCMRLIKK----NDEFGKVSRGFGIYMPS 1191
Query: 534 -----ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
+ DG + +PL E +YR L ++Q ++ GLNPR R + Y G+
Sbjct: 1192 FQCIGSQADGTIFKVVPLSEASYRSLYLIQQQLIDKEVQLCGLNPRMER-LENPFYQMGH 1250
Query: 589 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDIE 632
R ++D +++ +F LS+ R+ + K G + H +I +L DIE
Sbjct: 1251 ILRPMLDFTVLKRFATLSIPTRMTMASKAGRQAHAEIWRDLIDIE 1295
>gi|363750592|ref|XP_003645513.1| hypothetical protein Ecym_3197 [Eremothecium cymbalariae DBVPG#7215]
gi|356889147|gb|AET38696.1| Hypothetical protein Ecym_3197 [Eremothecium cymbalariae DBVPG#7215]
Length = 1318
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 125/490 (25%), Positives = 213/490 (43%), Gaps = 69/490 (14%)
Query: 159 LPTHLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGED 212
L H Y P+RK+ L+ T + + YH T+ + + S S DY + E
Sbjct: 871 LDNHRYYGNKMPLRKIFLEDVLEDFETFNNITYHERTQNFIVSFS---KSIDYDALSEEG 927
Query: 213 KELV----TDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEY 268
+ +V + P F Q + L +P +W I + L + ++ + ++
Sbjct: 928 ERIVGYEASKPHAKGF------QSGILLINPKTWNIIDR--IELGPNSLISDMRTMMIQL 979
Query: 269 EGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKG 328
R Y+ +G Y ED++ G L+DI EVVPEPG+P T K K I+ ++ +G
Sbjct: 980 NSNTKRKREYLVVGNTYVRDEDISGTGSFYLYDITEVVPEPGKPDTNYKFKEIFQEDIRG 1039
Query: 329 PVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYAR 387
V+ +C ++G + + K + ++ DN + +AF+D V+I S NL+++GD
Sbjct: 1040 TVSTVCEISGRFMISQSSKAMVRDIQEDNSVVPVAFLDMPVFITDAKSFGNLMIIGDAMH 1099
Query: 388 SIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 447
+ + E P R I G V K +SL
Sbjct: 1100 GFTFVGFDAE-------------------------PYRMITLGKSVTKLETMSL------ 1128
Query: 448 CKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVN 507
+ L M F+I+D+ + + + Y P+ S G RL+ T F+L +N
Sbjct: 1129 ---------EFLVNNGDMYFIITDRSQVMHVLKYAPDEPNSLSGQRLVYCTSFNL-HSIN 1178
Query: 508 TFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
T ++ K + D S F +DG++ +PL E +YRRL ++Q ++
Sbjct: 1179 TCMRLIQKNNEFVDLRRNYGSHMSTFQCIGCHIDGSIFKVVPLTESSYRRLYLVQQQIID 1238
Query: 564 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HN 622
GLNPR R + Y G+ R ++D +++ KF LS+ +R + K G + H
Sbjct: 1239 KEVQLCGLNPRMER-LQNPYYQLGHLLRPMLDFTILKKFSTLSISKRRSMASKAGHQAHT 1297
Query: 623 DILDELYDIE 632
++ +L DIE
Sbjct: 1298 EVWRDLIDIE 1307
>gi|403218521|emb|CCK73011.1| hypothetical protein KNAG_0M01580 [Kazachstania naganishii CBS 8797]
Length = 1345
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 137/560 (24%), Positives = 245/560 (43%), Gaps = 74/560 (13%)
Query: 89 MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
M Y + GY +F+ G P L R + P+ ++ + N + +
Sbjct: 835 MHYVPDYNGYSVIFVTGKVPYLLIKEDDSVPRVFQFA-NIPLVSMTTWGN----KSIMCV 889
Query: 149 NAKSELRISVLP-THLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
+ R+ L + + Y P+++V + T +AYH TKTY IV+ + E
Sbjct: 890 DDIKNARVYTLDCSDVYYGNKIPLKRVTINSVMENYMTLTNVAYHERTKTY-IVSYSREI 948
Query: 202 STDYYKFNGEDKE-----LVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWE 256
+ GED E +V D ++ + Q + L +P +W I + +F
Sbjct: 949 D---FVAKGEDGEVVPVGIVDDAPHAKSV-----QSGLLLINPTTWSVIDKIDFEPDSL- 999
Query: 257 HVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKN 316
V +K++ ++ Y+ +GT++ +ED+ G ++DI EVVPEPG+P T
Sbjct: 1000 -VNDIKSMFIQLNSRTKRKIEYVVVGTSFVGTEDLPATGSFQMYDIAEVVPEPGKPDTNY 1058
Query: 317 KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVS 375
KIK + +E + VT++C ++G V + QK+ + + DN + +AF+D ++ A M S
Sbjct: 1059 KIKQFFKEELRSAVTSVCDISGRFVISQSQKLMVRDAQEDNSVVPVAFLDIPLFTADMKS 1118
Query: 376 VKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWK 435
NL+++GD + I L+ + E P R I G V K
Sbjct: 1119 FGNLLIIGDAMQGIQLVGFDAE-------------------------PYRMIPLGRSVLK 1153
Query: 436 FLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLI 495
F LSL + L + F + D++ + + Y P+ S G RLI
Sbjct: 1154 FETLSL---------------EFLVNGGDLYFTLIDRNDILHVLKYAPDEPNSLSGQRLI 1198
Query: 496 KKTDFHLGQHVNTFFKIRCKPSSISDAP--GARSRFLTWYASLDGALGFFLPLPEKNYRR 553
+ F++ + ++ K D P A + DG+L +P+PE YRR
Sbjct: 1199 HCSSFNM-YSTTSCTRLIPKNELFVDGPLNPAIQSYQVIGGQADGSLFKVMPVPETVYRR 1257
Query: 554 LLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 613
L ++Q ++ + G+NP+ R Y + R ++D ++V +F +S+ +R +
Sbjct: 1258 LYVVQQQIIDKETPLAGINPKMER-LSNDYYQTSHLLRPMLDYNVVKQFCAMSIPKRTTL 1316
Query: 614 CKKIGSK-HNDILDELYDIE 632
K+G + H DI ++ ++E
Sbjct: 1317 AHKLGKRAHFDIWRDVINLE 1336
>gi|325094074|gb|EGC47384.1| cleavage factor two protein 1 [Ajellomyces capsulatus H88]
Length = 1377
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 110/378 (29%), Positives = 186/378 (49%), Gaps = 26/378 (6%)
Query: 14 ETIVQELLTVSLGLHGNR-PLLLVRTQH-ELLIYQAFRHPKGALK----LRFKKLKVLFV 67
ETI ELL LG +R P L++R+ + +L +Y+ + + K LRF K+
Sbjct: 813 ETIT-ELLVADLGDSVSRSPYLILRSSNSDLTLYEPYHYTSSTEKQFSDLRFVKIANHHF 871
Query: 68 SDRSKRANEQPGLPRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTI 126
+N + +S+ +R ++ GY+ VF+ G P ++ +S H M +
Sbjct: 872 PKFHSESNVEKHPANCTALSKPLRVLGDVCGYRTVFMPGNSPCFIIKSSTS--IPHVMNL 929
Query: 127 DG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
G V +L+ F+ C +GF+Y + + +R+ P + +D W RK+ L + Y
Sbjct: 930 RGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRFPRNTHFDGSWAARKIGLGEQVDAVEY 989
Query: 186 HLETKTYCIVTSTAEPSTDYYKFN-GEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSW 242
++TY I T+ FN ED E+ + R+ F+P + + V L +P +W
Sbjct: 990 SSSSETYVIGTNQK------VDFNLPEDDEIHPEWRNEVISFLPQ-IDKGSVKLLTPRTW 1042
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
I N L E ++C+K +++E + I +GT ED+ RG I +F++
Sbjct: 1043 SIIDSYN--LRNAERIMCVKCLNLEVSEITHERKDTIVVGTALTKGEDIAARGCIYIFEV 1100
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLT 359
I+VVPE +P T K+K+I +E KG VT++ + GFL+ A GQK + LK D L
Sbjct: 1101 IKVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLL 1160
Query: 360 GIAFIDTEVYIASMVSVK 377
+AF+D + Y+ + +K
Sbjct: 1161 PVAFMDMQCYVNVLKELK 1178
Score = 73.2 bits (178), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 46/157 (29%), Positives = 73/157 (46%), Gaps = 17/157 (10%)
Query: 488 SNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL----------TWYASLD 537
S+ G RL+ ++ F G +T + +S S P A + S
Sbjct: 1220 SSKGDRLLHRSTFQTGHFASTMTLLPRTATSSSQGPDADPDMMDLDSSGPLHHVLVTSET 1279
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G++ P+ E +YRRL LQ+ + H GLNPRAFR + G RG++DG
Sbjct: 1280 GSIALITPVSETSYRRLSALQSQLANTLEHPCGLNPRAFRAVESDGIGG----RGMVDGD 1335
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
LV ++L L + EI ++G+ D+ + D+EA+
Sbjct: 1336 LVKRWLDLGTQRKAEIANRVGA---DVWEIRADLEAI 1369
>gi|226290902|gb|EEH46330.1| cleavage and polyadenylation specificity factor subunit A
[Paracoccidioides brasiliensis Pb18]
Length = 1343
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 108/378 (28%), Positives = 186/378 (49%), Gaps = 31/378 (8%)
Query: 17 VQELLTVSLGLHGNR-PLLLVRTQ-HELLIYQAFRHPKGALK----LRFKKL------KV 64
+ E+L LG +R P L++R+ +EL++Y+ + + K LRF K+ K
Sbjct: 818 LTEILVADLGDSVSRTPYLILRSNSNELILYEPYHTVQSTEKRLSDLRFLKIANHHFPKF 877
Query: 65 LFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM 124
L S+ ++ L R +R ++ GY+ VF+ G P F+ H M
Sbjct: 878 LPESNLGNLSDSDRQLAR-----PLRALGDVCGYRTVFMPGNSPC--FIIKSATSIPHVM 930
Query: 125 TIDG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFL 183
+ G V +L+ F+ C +GF+Y + + +R+ P + +D W RK+ L +
Sbjct: 931 NLRGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRFPRNTHFDGSWAARKIGLGEQVDSV 990
Query: 184 AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRF-IPPLVSQFHVSLFSPFSW 242
Y ++TY + TS +K ED E+ + R+ P + + V L +P +W
Sbjct: 991 EYSSSSETYVLGTSQKAD----FKLP-EDDEIHPEWRNEVISFFPQIDKGSVKLLNPRTW 1045
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
I ++ L E V+C+K +++E + IA+GT ED+ RG I +F++
Sbjct: 1046 SII--DSYQLRTAERVMCVKCLNLEASEITHERKEMIAVGTALTRGEDIAARGCIYVFEV 1103
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLT 359
I+VVPE +P T K+K+I +E KG +T++ + GFL+ A GQK + LK D L
Sbjct: 1104 IKVVPEVDRPETNRKLKLIAKEEVKGAITSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLL 1163
Query: 360 GIAFIDTEVYIASMVSVK 377
+AF+D + Y++ + +K
Sbjct: 1164 PVAFMDMQCYVSVLKELK 1181
Score = 79.0 bits (193), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 76/157 (48%), Gaps = 18/157 (11%)
Query: 488 SNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL----------TWYASLD 537
S G RL+ ++ FH GQ +T + + S +S P A + + S
Sbjct: 1187 SAKGDRLLHRSTFHTGQFASTL-TLLPRTSVLSQGPEAEANAMDLDSSGPLHQVLVTSET 1245
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G++ P+ E YRRL LQ+ M+ H GLNPRAFR + G RG++DG
Sbjct: 1246 GSIALITPVSEMAYRRLSALQSQMINTLEHPCGLNPRAFRAVESDGIGG----RGMVDGD 1301
Query: 598 LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
LV K+L L + EI ++G+ D+ + D+EA+
Sbjct: 1302 LVQKWLDLGTQRKAEIASRVGA---DVWEIRADLEAI 1335
>gi|307107849|gb|EFN56091.1| hypothetical protein CHLNCDRAFT_145620 [Chlorella variabilis]
Length = 1626
Score = 139 bits (349), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 135/536 (25%), Positives = 216/536 (40%), Gaps = 122/536 (22%)
Query: 32 PLLLVRT-QHELLIYQAFRHPKGALKL---------RFKKLKVLFVSDRSKRANEQPGLP 81
PLLL T H+LL YQAF G+ RF++L++ Q
Sbjct: 1032 PLLLALTADHQLLAYQAFSASPGSGGTRGSSGSGTPRFRRLRLDLPPLLPPAGGPQ---- 1087
Query: 82 RGVRISQMRYFSNI---AGYQGVFLCGPHPAWLFLTSRGELRAHP---------MTIDGP 129
+R+ ++ F + A Y GVF+ G HP WL + SRG L HP
Sbjct: 1088 --LRLRRLHCFEGLGEEAPYSGVFVAGQHPHWL-VASRGGLLPHPHFLPQPAGPGAAAVG 1144
Query: 130 VSTLAPFHNVNCPRGFLYFN--AKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHL 187
+ PFHNVNCP GF+ A+S ++IS LP DAPWP ++V +K TP +A++
Sbjct: 1145 AAGFTPFHNVNCPHGFIVATSGARSGIQISQLPPRTRLDAPWPRQRVSIKGTPLKVAHYA 1204
Query: 188 ETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQ 247
E + +++S + G + V + W+ + +
Sbjct: 1205 EADMFAVLSSRQGRARGRGVMEGHEVRWV--------------------WPGGGWQGVGR 1244
Query: 248 TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
E L + V ++ T + + +A+G ED C GR+LLF++
Sbjct: 1245 HQR--RPGERALSVGAVRLKDHATGATVP-LLAVGAALPAGEDYPCGGRLLLFEVTRGD- 1300
Query: 308 EPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI------------------- 348
G + ++IY +E KGPVT++ + G+L+ A G +I
Sbjct: 1301 GGGGGGGQWAGRLIYTREFKGPVTSVSGLEGYLLLASGNRIETCSLSSTTITSTADDGTV 1360
Query: 349 ---YIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVA 405
W+++ + AF D V + S+ VKN +L+GD S+ +RY+ E R LSL++
Sbjct: 1361 AATTTWKVQRS-----AFYDGPVLLTSLNVVKNFVLLGDCQHSVQFVRYKDEGRQLSLLS 1415
Query: 406 RDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
+D+ + + +I+G SS+
Sbjct: 1416 KDFNRADTAATQF--------LING--------------------------------SSL 1435
Query: 466 GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISD 521
D + L Y P S G RL+ FH+G+ + ++R PSS D
Sbjct: 1436 HLASCDSAGTLRLLSYAPSHPASWKGQRLVAWGSFHVGEAASCMRRLRLHPSSPED 1491
>gi|443894082|dbj|GAC71432.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
[Pseudozyma antarctica T-34]
Length = 1543
Score = 138 bits (347), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 116/394 (29%), Positives = 190/394 (48%), Gaps = 40/394 (10%)
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEP 309
F E V L+ VS++ + +G R +IA GT+ + ED T +G + LF++IEVV
Sbjct: 1141 FEFEANEIVTALELVSLDASSSPTGRRQFIAAGTSTFHGEDRTSKGSVYLFEVIEVVSGK 1200
Query: 310 GQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND-LTGIAFIDTEV 368
Q ++K++ + + PVTAI + GFL++ GQK+Y+ L+ + L +AF+D
Sbjct: 1201 YQLGRDLRLKLVCRDDARAPVTAIAELNGFLLSTCGQKLYVRALEKEEWLISVAFLDGPF 1260
Query: 369 YIASMVSVKNLILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGI 427
Y+ S+ +KN +LV D +S+ LL +Q E YR + L R I
Sbjct: 1261 YMTSLRVLKNFVLVSDAKKSLCLLAFQEEPYRFVDL--------------------GREI 1300
Query: 428 ID-GSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
D + + +FL + +RL + I +S G + L+ Y P
Sbjct: 1301 NDHNASMAQFLVYN--DRLSLVSTSDVPLGGISGFGASAGV--------IRLYEYAPHVA 1350
Query: 487 ESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG---ARSRFLTWYASLDGALGFF 543
+ GGHRL+ +++F R + S S+ G RS+ + A +GAL
Sbjct: 1351 TTLGGHRLLLRSEFQTPAAAVGSTVCRGRWLSDSELRGREEGRSKLV--LAKANGALDSL 1408
Query: 544 LPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFL 603
L +K +RL +LQ +V HT LNPRAFR + + + ++GI+D L+ +F+
Sbjct: 1409 SALDDKVAKRLHLLQGQLVRSVQHTAALNPRAFRAVRND-FVPRSLAKGILDARLLDRFV 1467
Query: 604 QLSLGERLEICKKIGSKHNDILDELYDIEALSSH 637
LS + LE + + S D LD++ + S+H
Sbjct: 1468 WLSRPKMLEAVRTL-SGLFDGLDQIKKRKRDSNH 1500
>gi|366994686|ref|XP_003677107.1| hypothetical protein NCAS_0F02680 [Naumovozyma castellii CBS 4309]
gi|342302975|emb|CCC70752.1| hypothetical protein NCAS_0F02680 [Naumovozyma castellii CBS 4309]
Length = 1340
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 125/561 (22%), Positives = 237/561 (42%), Gaps = 79/561 (14%)
Query: 89 MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNC------- 141
M Y + +GY +FL G P + M D + F N++
Sbjct: 832 MHYIPDYSGYSVIFLTGSVPYII------------MREDDSSPKIFRFANLSIVSLAQWG 879
Query: 142 PRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLK------CTPHFLAYHLETKTYCI 194
+ + R+ L SY P++K+ + T + YH +++ + +
Sbjct: 880 KNSVMAVDDIKNARVYSLDNKDSYYGNSLPLKKIKISDSLEDFMTLTKITYHEKSQLFLV 939
Query: 195 VTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS-QFHVSLFSPFSWEEIPQTNFPLH 253
+ Y+ GED E++ D +P S Q + L +P +W I + +F ++
Sbjct: 940 SYAKERE----YEALGEDGEIIVGSNDQ--VPHAKSFQSGILLINPRTWNVIDRVDFEVN 993
Query: 254 EWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPL 313
+ ++++ ++ + R YI G + +ED+ G ++D+ EV+PEPG+P
Sbjct: 994 SI--ISDMRSMLIQLDSKSRKKREYIVAGITFIGTEDLPSTGAFHIYDLTEVIPEPGKPD 1051
Query: 314 TKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIAS 372
T K+K I+ ++ +G V ++C ++G + QKI + ++ DN + +AF DT ++++
Sbjct: 1052 TNFKLKEIFKEDIRGSVNSVCDISGRFLINQSQKIMVRDVQEDNSVVPVAFYDTPIFVSD 1111
Query: 373 MVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSL 432
S N +++GD + L + E P R I G
Sbjct: 1112 AKSFGNFLILGDSMQGFQFLGFDAE-------------------------PYRMIPLGRS 1146
Query: 433 VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGH 492
V F +S+ + L + F I+D++ + + Y P+ + G
Sbjct: 1147 VSSFETVSV---------------EFLINAGEINFAITDREDILHVLKYAPDEPNTLSGQ 1191
Query: 493 RLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYR 552
+L+ + F+L NT + + + A +F +DG + +PL E YR
Sbjct: 1192 KLVHCSSFNLYSS-NTCMLMLPRNDEFETSDKAPPKFQAIGGQVDGGIFKIIPLKEDTYR 1250
Query: 553 RLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 612
RL ++Q ++ GGLNPR R Y + R +ID +++ +F +LS+ R
Sbjct: 1251 RLYVVQQQIIDKEVQLGGLNPRMER-LDNDFYQLTHVMRPMIDFNIIRRFSELSIERRTH 1309
Query: 613 ICKKIGSK-HNDILDELYDIE 632
+K G + H DI ++ ++E
Sbjct: 1310 FAQKAGRRAHFDIWRDIINVE 1330
>gi|328864890|gb|EGG13276.1| CPSF domain-containing protein [Dictyostelium fasciculatum]
Length = 1627
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 91/338 (26%), Positives = 159/338 (47%), Gaps = 58/338 (17%)
Query: 304 EVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAF 363
++ E QP + ++ ++Y K+QKGPVT+I + G L+ ++G K+ + L G+AF
Sbjct: 1325 KIETEELQPQLQKRLNLLYEKDQKGPVTSIAGLNGLLIMSIGPKMIVNNFSSGSLIGLAF 1384
Query: 364 IDTEVYIASMVSVKNLILVGDYARSIALLRYQP---EYRTLSLVARDYKPTQPNSKGYYA 420
DT+++I S+ +VKN ILVGD +SI+ + + + + + L+ +DY+ S +
Sbjct: 1385 YDTQIFIVSLNTVKNYILVGDMFKSISFFKLKVCIIQKKNIILLGKDYEEVSTYSSDF-- 1442
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
I+DE + ++SD ++N+ +F
Sbjct: 1443 -------------------------------------IVDE-KKLSMVLSDANRNIRMFS 1464
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSS-----ISDAPGARSRFLTWYAS 535
+ P ES G L+ K+ FH+G+ N F +I K ++ S + + L +Y +
Sbjct: 1465 FDPSDPESRAGQMLLAKSSFHIGELNNKFVRIPMKNTNYDNNSSSSSIIVNDKHLLFYGT 1524
Query: 536 LDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP-----S 590
L G + +P+ K + +L + H T GLNPR FR G++ N +
Sbjct: 1525 LGGGINLLMPI-NKRFHEILHALETKLMHRGQTAGLNPRGFRY----GHHVNNTLGHLHN 1579
Query: 591 RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
+ ++DG L+ KF LS + ++ IGS ILD L
Sbjct: 1580 QYVVDGDLLTKFQSLSPDDAKQLATSIGSTTPIILDLL 1617
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 73/233 (31%), Positives = 119/233 (51%), Gaps = 30/233 (12%)
Query: 92 FSNIAGYQGVFLCG-PHPAWLFLTSRGELRAHPM---------------TIDGPVSTLAP 135
FSNI +G+F+ G P W+F + + R HPM + P++T
Sbjct: 1023 FSNIGNKRGIFVSGVSTPIWIF-SEKNFPRIHPMKQQQQTTSSSSSSSSSSKRPITTFTT 1081
Query: 136 FHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIV 195
FHN+NC GF+YF+ L I LP +Y+ WP+RK+ ++ T H ++YH K Y +V
Sbjct: 1082 FHNINCKHGFIYFDHTGMLCICRLPDGTNYENEWPIRKLAIRMTCHKISYHPVQKCYVLV 1141
Query: 196 TSTAE-PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPF-SWEEIPQTNFPLH 253
S + P +D + +++EL+ P L ++ + L P +W I +F L
Sbjct: 1142 LSYPQAPQSDEDEQEEQERELLKKPL------VLEEKYQLKLIDPANNWNII--DSFSLA 1193
Query: 254 EWEHVLCLKNVSMEY---EGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
E E VLC K + + + + L+ ++ +GT Y + ED C+GRIL+F+I+
Sbjct: 1194 EKETVLCSKIIYLRHADESDIIPKLKPFVIVGTAYTHGEDTVCKGRILIFEIV 1246
>gi|358056450|dbj|GAA97624.1| hypothetical protein E5Q_04302 [Mixia osmundae IAM 14324]
Length = 1305
Score = 136 bits (342), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 130/546 (23%), Positives = 220/546 (40%), Gaps = 60/546 (10%)
Query: 92 FSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGF-LYFNA 150
F + G GVF+ G P +L G R + P + F + P L A
Sbjct: 811 FISTTGRSGVFITGSAPFYLLTDRAGIARLY----RAPYGRASAFGAFDPPSSTPLLVLA 866
Query: 151 KSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNG 210
+ L S PV V AYH + T P + F+
Sbjct: 867 DGAMHTYDLSDQASLARELPVTHVATSKCFTSTAYHDSSHTLVAARVVNAP---FELFDD 923
Query: 211 EDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEG 270
E + P + I P V + + L P SW+ I FP + E +L L ++
Sbjct: 924 EGAPVYRAPSED-MISPTVFRSCLELLVPGSWDCIDGHEFP--QNESILQLICATLPSAT 980
Query: 271 TLSGLRGYIALGTNYNYSEDVTCRGRILLFDI--IEVVPEPGQPLTKN-KIKMIYAKEQK 327
SG ++ T N ED+ RG + +F I E Q ++ K+ +++A + +
Sbjct: 981 DPSGRARFVIASTCNNRGEDLQTRGGLYVFRISTTESTAASDQAQARSAKLSLVHADDLR 1040
Query: 328 GPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYA 386
PV AIC V G ++ ++GQK++I D L + F+D + +++M S+KNL+++GD
Sbjct: 1041 HPVGAICEVNGHIIHSLGQKVFIKAFDSDQRLITVGFLDVGLDVSAMRSIKNLLIIGDSL 1100
Query: 387 RSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 446
+ +Q + L L+ ++ + T V+
Sbjct: 1101 TGTYFVAFQEDPFKLVLLGKEARKTD--------------------VYCV---------- 1130
Query: 447 ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV 506
D L + + +G + + + Y P ES G RL+ +T++HLG+ +
Sbjct: 1131 ----------DFLVQENRLGLLSVSRKGLLRQLEYNPGNAESRAGERLLDRTEYHLGKQI 1180
Query: 507 NTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 566
+ S+ D + DG+L + P+ E YRRL +L+ +
Sbjct: 1181 IDSLSFAKRLSTDEDLRQSG----VMLVGADGSLTWVTPVREVVYRRLALLERQLHRQLP 1236
Query: 567 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 626
H GLNPRAFRT + YY+ +RG++DG L+ + L + + I S + +
Sbjct: 1237 HFAGLNPRAFRTARND-YYSRPLARGMLDGDLLAIYANLHASRQQSLASHINSDPDTLSV 1295
Query: 627 ELYDIE 632
L ++E
Sbjct: 1296 NLGNLE 1301
>gi|320583269|gb|EFW97484.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
[Ogataea parapolymorpha DL-1]
Length = 1309
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 139/614 (22%), Positives = 257/614 (41%), Gaps = 65/614 (10%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQH--ELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKR 73
I+++++ LG + LV E+LIY+ F P ++ +K +K+ +
Sbjct: 735 IIKQIMFTKLGNSSSSKDYLVALTFGGEVLIYETFFDP---IERTYKLMKINEMCQFPIV 791
Query: 74 ANEQPGLPRGVRISQMRYF---SNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPV 130
+I RY N GY+ V + G A++ L +
Sbjct: 792 GAPDNSYAHATKIE--RYLISVDNFQGYKAVLVTGA-SAFVILKEYNSIPRMLQFTKRSS 848
Query: 131 STLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPL-KCTPHFLAYHLET 189
A ++ CP G + + RI L + +Y P+ K + T + + YH +
Sbjct: 849 LYFAEYNTDRCPNGVISIDETKACRICQLDSSYTYSNRLPIAKYKIGDKTINKIRYHSLS 908
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
TY I T P Y ED E + RD R + + V L SP +W I
Sbjct: 909 NTYIISTLEEGP----YNPVDEDGEPLPGLRDDRKLKSTSLKGTVHLVSPANWTIID--T 962
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEP 309
L + E+V ++ + ++ T++ + + +GT +ED+ G ++++I++VPEP
Sbjct: 963 IELEDNEYVTSIEVIELKVSETIAT-KTVVLIGTARCRNEDLATHGSWKIYEVIDIVPEP 1021
Query: 310 GQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEV 368
G+P KN++KMI ++ +GPV +IC+V+G GQ++ + L KD+++ +AF DT +
Sbjct: 1022 GRPEAKNRLKMITSETARGPVLSICNVSGRFAIVQGQRMLVRTLQKDDNVAPVAFTDTSI 1081
Query: 369 YIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGII 428
Y + + KNL+L+GD ++++SL D P
Sbjct: 1082 YSKEVKTFKNLVLIGD------------SFQSVSLYGFDAAP------------------ 1111
Query: 429 DGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARES 488
++ L E+ + + D L ++ +++D+D L Y P S
Sbjct: 1112 -----YRMLHFGKDEQ-----NVELRAADFLVHDGNLHLLVADEDSVFHLLQYDPYDGNS 1161
Query: 489 NGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWY----ASLDGALGFFL 544
G +L++++ + S S Y +++DG+ +
Sbjct: 1162 MKGLKLLRRSLLRSNALTTKMISVARDRSLFSMVSTLNHEDDLGYEIIGSNIDGSFYKVM 1221
Query: 545 PLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQ 604
P+ E YRRL +QN + H GLNP++ G + R I+ ++ +F+
Sbjct: 1222 PVNEYQYRRLYSIQNYLYDKELHWLGLNPKS-NAIGGLTELMPSIKRPFIELNMFHRFIG 1280
Query: 605 LSLGERLEICKKIG 618
+ + +I +K+G
Sbjct: 1281 FNNDRKKQIMQKLG 1294
>gi|367001853|ref|XP_003685661.1| hypothetical protein TPHA_0E01320 [Tetrapisispora phaffii CBS 4417]
gi|357523960|emb|CCE63227.1| hypothetical protein TPHA_0E01320 [Tetrapisispora phaffii CBS 4417]
Length = 1357
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 125/546 (22%), Positives = 227/546 (41%), Gaps = 77/546 (14%)
Query: 89 MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNV----NCPRG 144
M Y N GY +F+ G P L + D V + F N+ CP G
Sbjct: 848 MHYIPNYNGYSSIFITGNDPYIL------------LKEDDSVPRIFKFANIPLVSMCPWG 895
Query: 145 ---FLYFNAKSELRISVLP-THLSYDAPWPVRKVPLKCTPHF------LAYHLETKTYCI 194
+ + R+ L ++ Y P+ KV L T + YH + Y +
Sbjct: 896 KTSVMCVDDIKNARVYTLEVNNMYYGNKLPLLKVTLSDTIEDYMTLTKITYHEGSNMYIV 955
Query: 195 VTSTAEPSTDYYKFNGEDKELVTDPRDSRFIP-PLVSQFHVSLFSPFSWEEIPQTNFPLH 253
A Y GED E + + +P + +Q + L +P +W I + ++ +
Sbjct: 956 ----AYAKDIEYTAIGEDGERLVGSNEE--LPHSMSTQSGILLINPKTWNVIDRKDYEAN 1009
Query: 254 EWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPL 313
+ ++ + ++ + + I +G + +ED+ G +++ EVVP+P +P
Sbjct: 1010 TI--INDIRTMIIQLNSKTNFKKELIVVGISNVGTEDLPPTGSFYIYNTNEVVPDPSKPD 1067
Query: 314 TKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIAS 372
T + K ++ ++ KG + +C ++G + QK+ + ++ D + +AF D V++A
Sbjct: 1068 TNYRFKDVFHEQVKGTINNVCEISGRFMVNQSQKLLVRDIQEDESVVPVAFHDVPVFVAD 1127
Query: 373 MVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSL 432
+ S NL +VGD + + + E P R I+ G
Sbjct: 1128 IKSFGNLFIVGDSMQGFQFVGFDAE-------------------------PYRMIMLGRS 1162
Query: 433 VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGH 492
V KF ++L D + + F++SD D + + Y P+ S G
Sbjct: 1163 VSKFKTMAL---------------DFVVRNGEIYFVVSDTDDILHILKYSPDEPNSLSGQ 1207
Query: 493 RLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYR 552
RL + F++ + + I + S F T A+LDG++ LPL E ++R
Sbjct: 1208 RLAHYSSFNIHSTNTSMHLLPANDEFIENKGNGSSIFQTIGANLDGSIFKILPLSEDSFR 1267
Query: 553 RLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 612
RL ++Q ++ H GLNPR R + Y N +R ++D +L+ ++ LS+ +R
Sbjct: 1268 RLYVIQQQIIDTEVHAAGLNPRMER-LSNEYYQLTNVTRPLLDFNLIRRYSNLSIKKRKS 1326
Query: 613 ICKKIG 618
I +K G
Sbjct: 1327 IAQKAG 1332
>gi|242208344|ref|XP_002470023.1| predicted protein [Postia placenta Mad-698-R]
gi|220730923|gb|EED84773.1| predicted protein [Postia placenta Mad-698-R]
Length = 696
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 100/370 (27%), Positives = 160/370 (43%), Gaps = 63/370 (17%)
Query: 211 EDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEG 270
ED V +P P + L SP + F + E V CL V++E
Sbjct: 373 EDGNTVWEPDAPNISFPNCECLMLELISPEPEGWVTMDGFESAQKEFVTCLDCVTLETTS 432
Query: 271 TLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPV 330
T SG+ +I +GT N ED+ +G + +F I+EVVP+ + + KGPV
Sbjct: 433 TGSGMMDFIIVGTTINCREDLAVKGAVYIFSIVEVVPD-----------LQCRDDAKGPV 481
Query: 331 TAICHVAGFLVTAVGQKIYIWQLKDND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSI 389
A+C + LV+++GQKI++ N+ L G+AF+D VYI S+ +VKNL+++ D +S
Sbjct: 482 AALCGLNNSLVSSMGQKIFVRAFDLNERLVGVAFLDVGVYITSLRAVKNLLVISDAVKS- 540
Query: 390 ALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 449
+ Y+ + L Y+ + ++A DG +
Sbjct: 541 -----KDPYKLVILGKDPYQVCVTTADLFFA--------DGQVF---------------- 571
Query: 450 KIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
+I D+D + ++ Y P ES GG L+++T+FH
Sbjct: 572 -----------------LLIGDEDGVIRIYEYDPHDPESRGGQHLLRRTEFHGQMESRMS 614
Query: 510 FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
I + +D P AR S +G+L F + E +RL +LQ + + H
Sbjct: 615 ILIIRRRGKDTDIPQAR----LISGSTNGSLSMFTYVDEVASKRLHLLQGQLTRNVQHVV 670
Query: 570 GLNPRAFRTY 579
GLNP+ FR Y
Sbjct: 671 GLNPKVFRPY 680
>gi|406602601|emb|CCH45811.1| hypothetical protein BN7_5397 [Wickerhamomyces ciferrii]
Length = 1287
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 131/546 (23%), Positives = 239/546 (43%), Gaps = 70/546 (12%)
Query: 94 NIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS--TLAPFHNVNCPRGFLYFNAK 151
NI + +F+ G P ++ T+ + T +S + ++ + F+Y +
Sbjct: 791 NIKNQKFIFVTGKQPYIIWKTNHSIPKIFKFTSKTAISICKIKDSNDKDDDSKFMYIDID 850
Query: 152 SELRISVLPT--HLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFN 209
RI LP + +Y P+ V L TP+ + YH ET IV++ E S Y
Sbjct: 851 KTARICSLPIGENFNYSQNLPIEIVSLGQTPNKVTYH-ETSGLFIVSTFEEIS---YNAI 906
Query: 210 GEDKELVTDPRDSRFIPPLVSQFH--VSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSME 267
ED + + P F + L +P +W I + +E + ++++++
Sbjct: 907 DEDGVPIVGSESEK---PKAKNFKGFLKLINPINWTIIDEIEMEENEIIN--DVRSINLT 961
Query: 268 YEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQK 327
+ +I G ED++ G + DII +VP+P +P K K I+ + K
Sbjct: 962 ISSRSKKKKEFIIFGIGKYRLEDLSVFGEFKIIDIISIVPDPTKPEAIYKFKEIFQEVVK 1021
Query: 328 GPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYA 386
G VT I ++G +T+ GQKI I L +DN +AF+D Y++ S NL+L+ D
Sbjct: 1022 GAVTTINEISGRFLTSQGQKIIIRDLQQDNSTVPVAFMDCATYLSDSKSFGNLLLISDSM 1081
Query: 387 RSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 446
+SI L + E L L+ +D + + + I+D ++
Sbjct: 1082 KSIWFLGFDAEPYRLLLLGKD--------QQRFNAITTDFIVDDGEIY------------ 1121
Query: 447 ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV 506
F+++D ++++ L YQP+ +S G +L++K+ F +
Sbjct: 1122 --------------------FLVADDEESLHLLTYQPDDPKSLSGQKLLQKSTFTTN-SI 1160
Query: 507 NTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 566
T K+ K + G+ + + ++DG++ +P+ E +YRRL +LQ + +
Sbjct: 1161 TTCLKLVPKFNEFD--QGSITSYQNIGVNVDGSIFKMIPIDEISYRRLYILQQQLSDKIA 1218
Query: 567 HTGGLNPRAFRTYKGKGYYAGNP--SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
H GLNPR+ R ++ N + II+ L+ F+ L++ +R + K+G ND
Sbjct: 1219 HYVGLNPRSNR-------FSANEQGQKPIIEFGLLKWFINLNVDKRKQFSAKVG--RNDY 1269
Query: 625 LDELYD 630
L+ D
Sbjct: 1270 LELFKD 1275
>gi|50305395|ref|XP_452657.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|74606921|sp|Q6CTT2.1|CFT1_KLULA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|49641790|emb|CAH01508.1| KLLA0C10274p [Kluyveromyces lactis]
Length = 1300
Score = 132 bits (333), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 132/568 (23%), Positives = 242/568 (42%), Gaps = 79/568 (13%)
Query: 81 PRGV-RISQM-RYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPF-- 136
P+GV +I ++ YF N GY VF+ G P + R MT + P+ T+A +
Sbjct: 786 PQGVNKIERVAHYFPNYNGYSVVFITGQVPYIIIKEDNSVCRIFRMT-NIPIVTMARWGK 844
Query: 137 HNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK-CTPHF-----LAYHLETK 190
++V C N K+ R+ L Y +RK+ ++ F +AYH T
Sbjct: 845 NSVMCVD-----NIKNA-RVMKLDPECYYGNTQILRKIIIEDVVEEFETLGNIAYHERTG 898
Query: 191 TYCIVTSTAEPSTDYYKFNGEDKELV----TDPRDSRFIPPLVSQFHVSLFSPFSWEEIP 246
Y I + +Y + + + LV + P + + L+ L +P +W I
Sbjct: 899 MYII---SYTKFIEYQALSEDGEPLVGYDPSKPNSTGYKSGLL------LINPLTWNIID 949
Query: 247 QTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVV 306
+ + L E V +K + ++ R + +G+++ ED G +L+ DI EVV
Sbjct: 950 RLD--LSENSMVNDIKTMLIQLNSKTRRKRELVIIGSSFVKEEDQPSTGCLLVLDITEVV 1007
Query: 307 PEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFID 365
EPG+P + K K ++ +E +G V A+C ++G + K + ++ DN +AF+D
Sbjct: 1008 AEPGKPDSNFKFKQLFEEEIRGSVNAVCEISGRFMIGQSSKALVRDMQEDNSAVPVAFLD 1067
Query: 366 TEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSR 425
V+I S NL+++GD + + + E P R
Sbjct: 1068 MPVFITDAKSFSNLMIIGDSMQGFTFVGFDAE-------------------------PYR 1102
Query: 426 GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEA 485
I+ G KF ++L + L ++ F+++D+ ++ + Y P+
Sbjct: 1103 MIVLGKSTSKFQVMNL---------------EFLVNNGNINFIVTDRQNHLHVLRYAPDE 1147
Query: 486 RESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLP 545
S G RL+ F++ N + K+ K S ++ DG++ +P
Sbjct: 1148 ANSLSGQRLVHCNSFNMFT-TNNYMKLVRKHVEFG---SKTSNYIALGCQTDGSIFRMIP 1203
Query: 546 LPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
L E +YRR ++Q ++ H G N + R + Y+ G+ R +D ++ K++ L
Sbjct: 1204 LNEASYRRFYLVQQQLLDHEIPLAGFNTKMER-LDNEYYHKGHSLRPTLDSQVLKKYIHL 1262
Query: 606 SLGERLEICKKIGS-KHNDILDELYDIE 632
+ +R I ++G ++ +L DIE
Sbjct: 1263 PITKRTTIENRVGRHASTELWHDLIDIE 1290
>gi|367014525|ref|XP_003681762.1| hypothetical protein TDEL_0E03080 [Torulaspora delbrueckii]
gi|359749423|emb|CCE92551.1| hypothetical protein TDEL_0E03080 [Torulaspora delbrueckii]
Length = 1327
Score = 132 bits (332), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 101/404 (25%), Positives = 179/404 (44%), Gaps = 46/404 (11%)
Query: 231 QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
Q + L +P +W I + P + + K++ ++ + + Y+ +G +ED
Sbjct: 960 QSGILLVNPKTWNIIDKKELPANTL--INDAKSMLIQLDSRTRRKKEYVIVGVAVVGTED 1017
Query: 291 VTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYI 350
+ G +FDI EVVPEPG+P T K+ ++ +E +G V+ +C ++G + QK+ +
Sbjct: 1018 LPPSGSFFVFDITEVVPEPGKPDTNFKLSEVFQEEIRGTVSTVCEISGRFLINQSQKVLV 1077
Query: 351 WQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
++ DN + +AF+D V++ S N +++GD + + + E
Sbjct: 1078 RDVQDDNSVVPVAFLDIPVFVTDAKSFGNFMIIGDAMQGFQFVGFDAE------------ 1125
Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
P R I G + K +S+ + L + F I
Sbjct: 1126 -------------PYRMIPLGRSIAKMETVSV---------------EFLVNGGDIFFAI 1157
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRF 529
+D D + +F Y P+ S G RL+ T F+L NT + K A F
Sbjct: 1158 TDTDDILHVFKYAPDEPNSLSGQRLLHCTSFNL-HSTNTCMALLPKNEEFEPAQANMKNF 1216
Query: 530 LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
+DG++ LPL E YRRL ++Q + GGLNPR R + Y +
Sbjct: 1217 QAIGGQVDGSVFKLLPLREDVYRRLYVVQQQITEKELQLGGLNPRMER-LSNEHYKTTHV 1275
Query: 590 SRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDIE 632
R ++D +++ +F +LS R +I +K+G + H +I +L ++E
Sbjct: 1276 LRPMLDFNVIQRFKRLSTDRRKQISQKVGKRAHFEIWRDLINVE 1319
>gi|365984967|ref|XP_003669316.1| hypothetical protein NDAI_0C04130 [Naumovozyma dairenensis CBS 421]
gi|343768084|emb|CCD24073.1| hypothetical protein NDAI_0C04130 [Naumovozyma dairenensis CBS 421]
Length = 1388
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 130/566 (22%), Positives = 236/566 (41%), Gaps = 89/566 (15%)
Query: 89 MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNC------- 141
M Y S+ GY +F+ G P L D V + F N++
Sbjct: 880 MHYISDYNGYSVIFITGSVPYMLIRE------------DDSVPRIFQFANLSIVSMARWG 927
Query: 142 PRGFLYFNAKSELRISVLP-THLSYDAPWPVRKVPLK------CTPHFLAYHLETKTYCI 194
+ + RI L ++ Y +RK+ + T + YH +T+ + +
Sbjct: 928 KNSIMCVDNLKNARIYGLDHANIYYGNKLSIRKIKISDSLEDYMTLTKITYHEKTQMFLV 987
Query: 195 VTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS-QFHVSLFSPFSWEEIPQTNFPLH 253
+ T+Y +D+ +V D +P S Q V L +P +W I F +
Sbjct: 988 ---SYAKETEYDALGEDDERIVGYDED---VPHAKSFQSGVLLINPLTWNVIDSKTFGKN 1041
Query: 254 EWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPL 313
V ++++ ++ R YI G + +ED+ G ++DI EVVPEPG+P
Sbjct: 1042 TL--VNDMRSMLIQVNSKARRKREYIIAGVTHIGTEDLPPTGAFHIYDITEVVPEPGKPD 1099
Query: 314 TKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIAS 372
T ++K ++ +E +G V+ +C ++G + QK+ + + DN + +AF+D V+I
Sbjct: 1100 TNYRLKEVFKEEVRGIVSTVCEISGRFLVNQSQKVMVRDAQEDNSVVPVAFLDIPVFIND 1159
Query: 373 MVSVKNLILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGS 431
S + +++GD + + + + E YR ++L G
Sbjct: 1160 AKSFGDFLILGDAMQGLHFIGFDAEPYRMINL--------------------------GK 1193
Query: 432 LVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMG----FMISDKDKNVVLFMYQPEARE 487
V KF +S+ EF G F ++D++ + + Y P+
Sbjct: 1194 SVTKFETVSV-------------------EFVVNGGDLYFALTDRNNILHVLKYAPDELN 1234
Query: 488 SNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLP 547
S G +L+ + F+L N+ + K D A F T +DG++ +PL
Sbjct: 1235 SLSGQKLVHCSSFNLFSG-NSSLLLLPKNEEFEDTKNAPLTFQTIGGQVDGSIFKVIPLR 1293
Query: 548 EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 607
E YRRL ++Q M GGLNPR R + Y + R ++D +++ +F +L +
Sbjct: 1294 EDTYRRLYVIQQHMNDKEPQLGGLNPRMER-LSNEYYQLCHVMRPMLDFNIIRRFSELPI 1352
Query: 608 GERLEICKKIGSK-HNDILDELYDIE 632
R + K+ G + H +I ++ ++E
Sbjct: 1353 DRRTRVAKRAGQRAHYEIWRDMINVE 1378
>gi|313215162|emb|CBY42850.1| unnamed protein product [Oikopleura dioica]
Length = 228
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 136/273 (49%), Gaps = 59/273 (21%)
Query: 377 KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKF 436
KN LVGD + I LLR+Q E +S ++R + + + G ++DG+ V
Sbjct: 4 KNYALVGDIQQGITLLRHQGERNCISQISRARRAGEVTAVGI--------LLDGNQV--- 52
Query: 437 LQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIK 496
G + +D +N+ ++MY+P+ +ESNGG +L++
Sbjct: 53 -----------------------------GLVSTDMQRNLQVYMYKPDQKESNGGKQLVR 83
Query: 497 KTDFHLGQHV-----------NTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLP 545
+ D +LG+ V +TF K+ + +R +T+YA LDG++G +P
Sbjct: 84 QADINLGKRVISIWNSLGRQNDTFTKVALTEND--------ARHVTFYAGLDGSIGDIVP 135
Query: 546 LPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
+ EK +RRL MLQ ++ +H H GGLNPR +R + N ++ IIDG L+ +F L
Sbjct: 136 VSEKVFRRLEMLQTLVQSHLPHYGGLNPREYRYCTNEYRDLENAAKNIIDGDLLERFNGL 195
Query: 606 SLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
S E+ ++ +KIG +LD++ D++ + F
Sbjct: 196 SFTEQTDLSRKIGVTREALLDDMMDVQRTKNLF 228
>gi|156847699|ref|XP_001646733.1| hypothetical protein Kpol_1023p44 [Vanderwaltozyma polyspora DSM
70294]
gi|156117413|gb|EDO18875.1| hypothetical protein Kpol_1023p44 [Vanderwaltozyma polyspora DSM
70294]
Length = 1337
Score = 129 bits (324), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 97/404 (24%), Positives = 185/404 (45%), Gaps = 46/404 (11%)
Query: 231 QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
Q + L +P +W I + F + + +++++++ R + +G +ED
Sbjct: 968 QAGILLVNPKTWNVIDKIEFERNSL--INDMRSMTIQVNSKTKKKRELLVVGVASIGTED 1025
Query: 291 VTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYI 350
+ G + DI EVVPEPG+P T K K I+ + +G V ++C ++G + QK+ +
Sbjct: 1026 LPSAGSFHVIDINEVVPEPGKPDTNYKFKEIFQETVRGNVNSVCEISGRFMINQSQKLLV 1085
Query: 351 WQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
++ D + +AF+D VY+ S NL++VGD + + + E
Sbjct: 1086 RDIQEDESVVPVAFLDVPVYVTDTKSFSNLMIVGDSMQGFQFVGFDAE------------ 1133
Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
P R I G V KF ++L + L + F++
Sbjct: 1134 -------------PYRMIPLGRSVSKFKTVAL---------------EFLVNNGDIFFIV 1165
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRF 529
SD++ + + Y P+ S G RL + F++ NT + + +P ++ F
Sbjct: 1166 SDRNDILHVLKYAPDEPNSLSGQRLAHYSSFNI-HSTNTSMILLPSNNEFQSSPNGQATF 1224
Query: 530 LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
+ + +DG++ +PL E ++RRL ++Q ++ GGLNPR R + Y +
Sbjct: 1225 QSVGSCVDGSIFKVIPLDEDSFRRLYVIQQQVIDTEIQAGGLNPRMER-LSNEYYQLVHL 1283
Query: 590 SRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDIE 632
R ++D +++ +F LS+ +R +I +K G + H D+ ++ +IE
Sbjct: 1284 MRPMLDFNIIRRFSNLSITKRTKIAQKAGRRAHFDVWRDMINIE 1327
>gi|149512998|ref|XP_001514888.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like, partial [Ornithorhynchus anatinus]
Length = 831
Score = 129 bits (323), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 65/182 (35%), Positives = 100/182 (54%), Gaps = 44/182 (24%)
Query: 351 WQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
W ++D++LT I FID ++YI ++SVKN IL D +SI+LLRYQ E +TLSLV+RD KP
Sbjct: 694 WAIRDSELTSITFIDMQLYIHQIISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKP 753
Query: 411 TQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMIS 470
+ S + N + +GF++S
Sbjct: 754 LEVYSVDFMVDN----------------------------------------AQLGFLVS 773
Query: 471 DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL 530
D+D+N++++MY PEA+ES GG RL+++ DFH+G HVN F++ C+ + A G + +
Sbjct: 774 DRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNAFWRTPCRGA----AEGPSKKSI 829
Query: 531 TW 532
W
Sbjct: 830 VW 831
>gi|342186481|emb|CCC95967.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 1456
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 166/696 (23%), Positives = 258/696 (37%), Gaps = 133/696 (19%)
Query: 6 SHSPSAMDETIVQ----ELLTVSLG--LHGN-----RPLLLVRTQHELLIYQAFRHPKGA 54
+ PSA ETI E+L +S G G L +V + EL +Y + P
Sbjct: 819 TKEPSAATETIPHVTHVEVLKLSEGPATEGTDTVVATALAVVLSSGELAVYHVMK-PDTF 877
Query: 55 LKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRY------------------FSNIA 96
LRF K F+ R+ R + R R+ + R F+ +A
Sbjct: 878 GSLRFIKAVHHFLDTRAVREVIESIEARKCRLQRERTMIENDTQSTRHCARRIIPFACVA 937
Query: 97 GYQGVFLCGPHPAWLFLTSRGE-LRAHPMTIDGPVSTLAPFHN-----VNCPRGFLYFNA 150
G G ++CG HP +L R + A+ G V F + C GF+ F
Sbjct: 938 GQSGAYVCGQHPVFLLWDKRKRRIAAYRHQSPGAVRGFVSFPQMAGGFIYCCEGFVDFAR 997
Query: 151 KSELRISVLPTHLSYDAP----WPVRKVPLKCTPHFLAYHLETKTYCIVTS---TAEPST 203
+ +Y AP W R++ + TPHFL Y K+ +VTS T P
Sbjct: 998 MN-----------TYCAPNGQGWLTRRIAIGATPHFLVYDPPGKSCFVVTSEKKTFRPQR 1046
Query: 204 DYYKFNGE---DKELVT------DPRDSRFIP---------PLVSQFHVSLFSPFSWEEI 245
++ + D+EL T +P P P+V QF V L S +
Sbjct: 1047 AFFDVQLKIHYDEELNTVQSVTAEPPVCHMPPINPGAGVRVPMVEQFEVRLLSTTGEQWE 1106
Query: 246 PQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRG--YIALGTNYNYSEDVTCRGRILLFDII 303
F L E E VL + V + + ++G L T + EDVTCRGRI+L
Sbjct: 1107 CTHKFALEENEKVLGAQAVELRQDEAIAGAPSAPVCVLCTAFPLGEDVTCRGRIILLASK 1166
Query: 304 EVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI----YIWQLKDNDLT 359
V K I ++++ GP TA+ + + AVG I Y W+ K L
Sbjct: 1167 TV-------KKKRAIVQLHSEPLNGPATAVTGICSQIAVAVGGTIKIFRYDWETKK--LV 1217
Query: 360 GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
AF+ VY + + +N I+ GD RS A+ R+ + TL+++ +D+
Sbjct: 1218 VSAFLYAGVYATRLSAFRNYIIYGDLCRSCAMARFNEQNHTLTVLGKDH----------- 1266
Query: 420 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
+ H D++ + G + S+ ++++L
Sbjct: 1267 -----------------------------NAVSVVHCDMMYHDRTFGILCSNDQRDLLLM 1297
Query: 480 MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGA 539
Y P +ES G H + + C S+ A + +T Y S G
Sbjct: 1298 GYTPRVQES-GEHTPSRVLESPFSLDGEYRLPSGCLAKSLRFRSAAGNSSVTVYISNYGE 1356
Query: 540 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY-KGKGYYAGNPSRGIIDGSL 598
+GF +PL E+ R L + + GL PR F + +G ++ L
Sbjct: 1357 VGFIVPLGEQANRTALWITRRLQVDLPCDAGLTPRMFLSLSQGTPRTTLRGKEMLVSAPL 1416
Query: 599 VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
V L + R K I LD + ++ AL
Sbjct: 1417 VQGLFFLDVHSR----KAIARAAYTQLDRVINVAAL 1448
>gi|164655043|ref|XP_001728653.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
gi|159102535|gb|EDP41439.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
Length = 1212
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 125/483 (25%), Positives = 215/483 (44%), Gaps = 59/483 (12%)
Query: 162 HLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRD 221
L++DA P + T +A H E C V S+ +P T + +N E++ + RD
Sbjct: 783 ELAFDASVPYLRWTTGRTYTHVAVHEELA--CFVASSEQP-TQFVLYNDEEQPV----RD 835
Query: 222 SRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIAL 281
+ P +L E P + E V L M+ SG R ++ +
Sbjct: 836 PKQDPTRTYAACGALELLVRVGEPPVHGYEFSACETVSALHMAPMDCLDRGSGRRTFVVV 895
Query: 282 GTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKN-KIKMIYAKEQKGPVTAICHVAGFL 340
GT Y ED + +G + +FD++E VP G + +++++ +E + PVTA+ + GFL
Sbjct: 896 GTTVTYGEDRSSKGHMYVFDVVECVPSEGMAASDALRLQLLCTEEMRAPVTALHDLNGFL 955
Query: 341 VTAVGQKIYI--WQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEY 398
V AVGQK+ I W+ + L +AF+D +Y S+ VKN +L+ DY +S + +Q +
Sbjct: 956 VAAVGQKLLIRSWEYCEW-LVTVAFLDMGMYTTSIQRVKNFLLLTDYYQSAYFVAFQEDP 1014
Query: 399 RTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 458
L L+ RDY PT + +ID + RL I
Sbjct: 1015 ARLVLLGRDYIPTSVTCGAF--------LIDRA------------RLSI----------- 1043
Query: 459 LDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHL-GQHVNTFFKIRCKPS 517
+ D + + L Y P S GG RL+ + ++H G+ V ++ P
Sbjct: 1044 ---------VTCDMNGCLRLMDYHPSNPTSLGGQRLLARCEYHAPGEVVRA--RMLHGPY 1092
Query: 518 SISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 577
+ S + A +GA+ +P+ EK + L + Q+ +V HT GLNPR FR
Sbjct: 1093 LATSGECLTSEIV--LAKRNGAVDVLVPVTEKIFPTLQLFQSQLVRMVRHTAGLNPRGFR 1150
Query: 578 TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL--DELYDIEALS 635
+ + + ++GI+DG+L+ +S + + + + ++ ++ D L + L
Sbjct: 1151 AVFNQ-HISRPLAKGILDGTLLHTAESMSRPKLTSLVRDLSTRTGGVIADDLLRCLVHLQ 1209
Query: 636 SHF 638
SH+
Sbjct: 1210 SHW 1212
>gi|302831157|ref|XP_002947144.1| hypothetical protein VOLCADRAFT_87503 [Volvox carteri f. nagariensis]
gi|300267551|gb|EFJ51734.1| hypothetical protein VOLCADRAFT_87503 [Volvox carteri f. nagariensis]
Length = 2830
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 131/555 (23%), Positives = 210/555 (37%), Gaps = 130/555 (23%)
Query: 98 YQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFL-YFNAKSELRI 156
Y GVF+ G P WL + SRG L HPM +G V+ + PFHN NCP GF+ +++ L++
Sbjct: 2259 YSGVFVAGSRPLWL-VASRGGLVPHPMFAEGAVAAMTPFHNANCPLGFISACSSRGLLKV 2317
Query: 157 SVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTA---------EPSTDYYK 207
LP H D PW R+VPL+ TPH LA+ + +TS EP D +
Sbjct: 2318 CQLPPHTRLDTPWVTRRVPLRVTPHKLAWFRDAGLMAAITSRVVVSRPRPPEEPGGDAHA 2377
Query: 208 FNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSME 267
R + + L P + + L E LCLK + ++
Sbjct: 2378 AAAYAAAAAAAAGRGR-----EEAWELRLLEPNGCGRLWLSPL-LPPGEQALCLKVIYLQ 2431
Query: 268 YEGTLSGLRGYIALGT----------NYNY----------------------SEDVTCRG 295
T +A+GT N+ + CRG
Sbjct: 2432 -NATTGDTDALLAVGTGSPMGQLGGGNWRFRLPRGRVAGSGGLVVHRQCEREGAGRGCRG 2490
Query: 296 ------------RILLFDI-IEVVPEPGQPLTKN-KIKMIYAKEQKGPVTAICHVAGFLV 341
RILL+ I EVV G LT+ ++ ++ VT++ L+
Sbjct: 2491 ERPPGEDYPCLGRILLYTISAEVVDLGGGNLTRRWSAVLVATRDMASAVTSVQEFKSQLL 2550
Query: 342 TAVGQKIYIWQLKDND--------------LTGIAFIDTEVYIASMVSVKNLILVGDYAR 387
G +I +++ + L AF D + S+V+VK+ +L D ++
Sbjct: 2551 VTCGSRIEMYEWRGPAAGASGGGGGGPGGRLEKRAFFDLPSLVTSLVAVKDYLLAADASQ 2610
Query: 388 SIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 447
+ +RY R L +++D+ R ++ +V +L+
Sbjct: 2611 GLYFVRYSDSARVLEFMSKDFD--------------HRDVLTAGVVINEPKLA------- 2649
Query: 448 CKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESN----GGHRLIKKTDFHLG 503
F+ +D N+ L + +R +N G RL H+
Sbjct: 2650 -------------------FLAADAAGNLALSEFY-GSRNTNPEFWAGQRLAPLGLMHVA 2689
Query: 504 QHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNY-RRLLMLQNVMV 562
+ ++ I+ S ++R + +G L + P+P+ +RLL LQN M
Sbjct: 2690 RRLSCCVSIKMPTSD------GKNRHALLCGAAEGGLSYIAPVPDAEMTQRLLALQNHMS 2743
Query: 563 THTSHTGGLNPRAFR 577
H GLNPRAFR
Sbjct: 2744 RRLPHVAGLNPRAFR 2758
>gi|119580419|gb|EAW60015.1| hCG2010549, isoform CRA_a [Homo sapiens]
Length = 323
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 76/182 (41%), Positives = 99/182 (54%), Gaps = 40/182 (21%)
Query: 13 DETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGALKLRFKKLKVLFVS 68
+E + QE L +S + P LLV +LLIY+AF H +G LK+ FKK+
Sbjct: 110 EEAMCQEELPLSSRSRQSTPYLLVHVDQKLLIYKAFPHDSRLSQGNLKVHFKKV------ 163
Query: 69 DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG 128
NI+ + A G LR HP+ I+G
Sbjct: 164 -----------------------LHNISFREKKPKPSKKKA-------GVLRLHPVGING 193
Query: 129 PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLE 188
PV++ A FHNVNCPRGFLYFN + +LRISVLP +LSYD+PWPVRK+PL CT H +AYH+E
Sbjct: 194 PVNSFALFHNVNCPRGFLYFNRQGKLRISVLPAYLSYDSPWPVRKIPLCCTVHCVAYHVE 253
Query: 189 TK 190
+K
Sbjct: 254 SK 255
Score = 43.1 bits (100), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 40/68 (58%), Gaps = 8/68 (11%)
Query: 326 QKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAF-IDTEVYIASMVSVKNLILVGD 384
+K P+ H + V + KI L+ ++LTG+AF +D ++YI M+SV+N IL D
Sbjct: 237 RKIPLCCTVHCVAYHVES---KI----LQASELTGMAFMVDRQLYIHQMISVRNFILAAD 289
Query: 385 YARSIALL 392
+SI LL
Sbjct: 290 LMKSIWLL 297
>gi|414587798|tpg|DAA38369.1| TPA: hypothetical protein ZEAMMB73_163106, partial [Zea mays]
Length = 483
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 77/225 (34%), Positives = 118/225 (52%), Gaps = 17/225 (7%)
Query: 88 QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLY 147
++ F+N+ GY+G+FL GP P W+F+ R R HP DGP+ HNVNC RG +Y
Sbjct: 257 RITIFNNVGGYEGLFLGGPRPTWVFVC-RQRFRVHPQLCDGPIVAFTVLHNVNCCRGLIY 315
Query: 148 FNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPS-TDYY 206
++ L+I LP+ +YD WPV+KVPL TPH + Y+ E Y ++ S + +
Sbjct: 316 VTSQGFLKICQLPSAYNYDNYWPVQKVPLHGTPHQVTYYGEQSLYPLIVSVPQVRPLNQV 375
Query: 207 KFNGEDKEL-------VTDPRDSRFIPPLVSQFHVSLF----SPFSWEEIPQTNFPLHEW 255
+ D+EL VT D + + V +F V + S WE ++ P+ +
Sbjct: 376 LSSMADQELGLHMENDVTSGGDLQEV-YTVDEFEVRIMELGKSNGRWET--RSTIPMQSF 432
Query: 256 EHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
E+ L ++ V+++ T +A+GT Y EDV RGR+LLF
Sbjct: 433 ENALTVRIVTLQNTSTKEN-ETLMAIGTAYVQGEDVAARGRVLLF 476
>gi|410079681|ref|XP_003957421.1| hypothetical protein KAFR_0E01320 [Kazachstania africana CBS 2517]
gi|372464007|emb|CCF58286.1| hypothetical protein KAFR_0E01320 [Kazachstania africana CBS 2517]
Length = 1350
Score = 125 bits (314), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 112/478 (23%), Positives = 206/478 (43%), Gaps = 61/478 (12%)
Query: 165 YDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTD 218
Y +P++ + + T + + YH +T+TY I + DY ED EL+
Sbjct: 912 YGNKFPLKSIKINTELEDYMTFNKITYHEKTQTYVIAYN---KEIDYVA-KAEDGELLVG 967
Query: 219 PRDSRFIPPLVSQFH--VSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLR 276
+ + P F + L +P SW I + F E V ++++ ++ + R
Sbjct: 968 YKQN---VPHAKGFQSGLLLINPKSWNVIDKVEF--EENSLVNDIRSMIIQIDSRTKRKR 1022
Query: 277 GYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
YI G + +ED+ G L+DI VVPEPG+P T K + + +E +G VT++C +
Sbjct: 1023 EYIVAGFSAVGTEDLPPSGSFHLYDITAVVPEPGKPDTNYKFERFFKEEVRGSVTSVCEI 1082
Query: 337 AGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQ 395
+G + QKI + + D + +AF+D +++ M S NL+++ D + +
Sbjct: 1083 SGRFAISQSQKIMVRDAQEDGSVVPVAFLDIPIFVTDMKSFGNLMIISDAMHGFQFVGFD 1142
Query: 396 PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
E P R I G V KF +S+
Sbjct: 1143 AE-------------------------PYRMIQLGKSVSKFKTMSV-------------- 1163
Query: 456 NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK 515
+ L + F ++D+D + + Y P+ S G +L+ + F+L N+ + K
Sbjct: 1164 -EFLVNNGDIYFAVTDRDNILHVLKYAPDEPNSFSGQKLVHCSSFNLYAD-NSCMVLLAK 1221
Query: 516 PSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRA 575
+ + DG++ +PL E++YRRL ++Q ++ + GGLNPR
Sbjct: 1222 NDEFNKVDDTNRTYQVVGGQTDGSMFKIVPLSEESYRRLYVIQQQIIDKETQLGGLNPRM 1281
Query: 576 FRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDIE 632
R + + R ++D +++ KF + + +R + +K+G H + +L +IE
Sbjct: 1282 ER-LSNQYLPLCHVMRPMLDFNVIRKFSAMPISKRQALAQKLGRNVHFEAWRDLINIE 1338
>gi|261335516|emb|CBH18510.1| cleavage and polyadenylation specificity factor-like protein,
putative [Trypanosoma brucei gambiense DAL972]
Length = 1452
Score = 125 bits (314), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 153/643 (23%), Positives = 247/643 (38%), Gaps = 113/643 (17%)
Query: 32 PLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRY 91
PL L++ H L +A R +++ + +L+ S+R+ N+ + VR R
Sbjct: 875 PLRLIKKFHHFLDTKAVREVIESIEAKKMRLQ----SERTMIENDT----QSVRHCSRRI 926
Query: 92 --FSNIAGYQGVFLCGPHPAWLFLTSRG-ELRAHPMTIDGPVSTLAPFHNVN-----CPR 143
F+ +AG G ++CG HP +L +R +L A+ GPV PF +++ C
Sbjct: 927 IPFAAVAGQSGAYVCGQHPLFLMWDNRTRQLVAYRHQAPGPVRGFVPFTSMSGGFIYCCE 986
Query: 144 GFLYFNAKSELRISVLPTHLSYDAP-WPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEP- 201
GF+ F +V+ T+ S W R++ + TPHF+ Y ++ +VTS P
Sbjct: 987 GFVDF--------AVMNTYCSPGGNGWLRRRIHIGATPHFIVYDPPGRSCFVVTSKKVPF 1038
Query: 202 -----STDY-----YKFNGEDKELVTDPRDSRFIP----------PLVSQFHVSLFSPFS 241
S D Y + + VT +P PL +F V L S F
Sbjct: 1039 RPQRASFDVQLKIQYDEDSNTVQSVTTEAPVCNMPAIKPGTGVRVPLTERFEVRLHSTFK 1098
Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSG--LRGYIALGTNYNYSEDVTCRGRILL 299
L E E VL + V + + G + T + EDVTCRGRI+L
Sbjct: 1099 KGWDCTDKLMLDENEKVLGAQMVEIHQDANADGSATAPVCVVCTAFPLGEDVTCRGRIIL 1158
Query: 300 FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI----YIWQLKD 355
+ + I ++++ GP TA+ + + AVG I Y W+ K
Sbjct: 1159 LASRNIK-------GRRSIVQLHSEPLNGPATAVAGICSQIAVAVGGTIKIFRYDWETKK 1211
Query: 356 NDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNS 415
L AF+ +Y + +N I+ GD RS ++ R+ E TL+++ RD
Sbjct: 1212 --LVVSAFLYAGMYATRLSVFRNYIIYGDLCRSCSMARFNEENHTLTVLGRDR------- 1262
Query: 416 KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKN 475
+ H D++ + G + SD ++N
Sbjct: 1263 ---------------------------------SAVSVVHCDMMYHDRAFGILCSDDERN 1289
Query: 476 VVLFMYQPEARESNGG-HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYA 534
V++ Y P +E++ G H + ++ L K G S +T Y
Sbjct: 1290 VLIMGYTPRVQETDAGTHPKVLESVLSLDGEYRLPSGSLVKSLRFRSTAGNSS--VTLYV 1347
Query: 535 SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG-- 592
S G +GF +P+ E+ R L + + GL PR F + N RG
Sbjct: 1348 SNYGEIGFIVPIGEQANRTALWVTRRLQIDLPCEAGLTPRMFLSLNQSSPR--NSLRGKE 1405
Query: 593 -IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
++ L+ L L R K I LD + +I AL
Sbjct: 1406 MLVPAPLLRGLFSLDLRSR----KAIARAAYTQLDRVANIVAL 1444
>gi|254580509|ref|XP_002496240.1| ZYRO0C13816p [Zygosaccharomyces rouxii]
gi|238939131|emb|CAR27307.1| ZYRO0C13816p [Zygosaccharomyces rouxii]
Length = 1331
Score = 125 bits (314), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 98/399 (24%), Positives = 178/399 (44%), Gaps = 46/399 (11%)
Query: 236 LFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRG 295
L +P +W I + F + + + + ++ + + Y+ +G + +ED+ G
Sbjct: 969 LINPKTWNVIDKREFDDNSL--INDARTMLIQLDSRTRRRKEYVIVGVAHVETEDLPPSG 1026
Query: 296 RILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK- 354
+ +FDI EVVPEPG+P T K+ ++ + +G V+++C ++G + QK+ + ++
Sbjct: 1027 SLSVFDITEVVPEPGKPDTNFKLGEVFKENIRGTVSSVCDISGRFLINQSQKVIVRDVQE 1086
Query: 355 DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
DN + +AF+D V++ + S N +++GD + + + E
Sbjct: 1087 DNSVVPVAFLDVPVFVTDVKSFGNFLIIGDSMQGFQFIGFDAE----------------- 1129
Query: 415 SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
P R I G V K ++L + L + F ++D
Sbjct: 1130 --------PYRMIPLGRSVSKLETVAL---------------EFLVNGGDIFFAVTDTSN 1166
Query: 475 NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYA 534
+ +F Y P+ S G RL+ T F+L NT + K S + S
Sbjct: 1167 ILHIFKYAPDEPNSLSGQRLVHCTSFNL-HSTNTCMVLLPKNEEFSVGEKSLSPVQVVGG 1225
Query: 535 SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGII 594
DG+L +PL E YRRL +LQ + GGLNPR R + Y+ + R ++
Sbjct: 1226 QTDGSLFKLVPLREDTYRRLYVLQQQLTEKEVQLGGLNPRMER-LSNEYYHLTHAVRPML 1284
Query: 595 DGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDIE 632
+ +++ +F LS+ +R + +K G + H DI +L +IE
Sbjct: 1285 EFNVIRRFNTLSVEKRKQTAQKAGRRAHFDIWRDLVNIE 1323
>gi|340059653|emb|CCC54046.1| putative mitochondrial carrier protein [Trypanosoma vivax Y486]
Length = 1481
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 124/518 (23%), Positives = 207/518 (39%), Gaps = 90/518 (17%)
Query: 92 FSNIAGYQGVFLCGPHPAWLFLTSR-GELRAHPMTIDGPVSTLAPF-----HNVNCPRGF 145
F ++AG G ++CG HP +L R G L + I GPV APF V C GF
Sbjct: 958 FDSLAGNVGAYVCGRHPLFLLWDRRTGLLSGYRHQIQGPVRGFAPFPLMEGGFVYCGEGF 1017
Query: 146 LYFNAKSELRISVLPTHLS-YDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEP--- 201
F +V+ T+ W R++ + TPHF++Y++ + +VTS +P
Sbjct: 1018 TDF--------AVMNTYCRPIGHGWLGRRIDVGATPHFISYNMPGRGCFVVTSHKQPFRP 1069
Query: 202 ---------STDYYKFNGEDKELVTDPRDSRFIP---------PLVSQFHVSLFSPFSWE 243
Y + G + + T+P P P+ F V S +
Sbjct: 1070 QRAPFDVQLKISYNEETGAIQSIATEPLTCSMPPIASSAGVRVPMADWFEVRFMSTAHVD 1129
Query: 244 EIPQTNFPLHEWEHVLCLKNVSMEYEG--TLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
+ F L E E VL ++ V ++ + ++G + T + +DVTCRGRI L
Sbjct: 1130 WPCEDTFKLEENERVLSIQMVQIDGDRGMKINGTVPVCVVSTAFPLGDDVTCRGRIHLL- 1188
Query: 302 IIEVVPEPGQPLTK-NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDNDL 358
+ L + +KI ++A+ GP TA+ + + AVG KIY + + L
Sbjct: 1189 -------ATKSLRRGHKIVHLHAEALNGPATAVAEIRHHIAVAVGGTIKIYRYDWQSGKL 1241
Query: 359 TGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
+ +Y + ++N I+ GD S A+ R+ E TL+++ R+
Sbjct: 1242 VVSVLLYAGIYATKLSVIRNYIVYGDLIHSCAMARFNEENHTLTVLGRNRN--------- 1292
Query: 419 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
S ++D ++++ H+ S G + SD +NV++
Sbjct: 1293 -----SISVVDCNMMY--------------------HD------RSFGILCSDDQRNVLV 1321
Query: 479 FMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDG 538
Y P +E+ G R K + L C S+ + + + Y S G
Sbjct: 1322 MGYTPRVQEAGAG-RPAKTLESLLTLDGEYRLPSGCLAKSLRFSSDFGNSSVMLYTSNYG 1380
Query: 539 ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
+GF +P+ E+ R L + + T GL PR F
Sbjct: 1381 EVGFIVPIGEQANRTALWVTRRLQTDVPCDAGLTPRMF 1418
>gi|74025892|ref|XP_829512.1| cleavage and polyadenylation specificity factor-like protein
[Trypanosoma brucei brucei strain 927/4 GUTat10.1]
gi|70834898|gb|EAN80400.1| cleavage and polyadenylation specificity factor-like protein,
putative [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 1452
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 152/638 (23%), Positives = 248/638 (38%), Gaps = 103/638 (16%)
Query: 32 PLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRY 91
PL L++ H L +A R +++ + +L+ S+R+ N+ + VR R
Sbjct: 875 PLRLIKKFHHFLDTKAVREVIESIEAKKMRLQ----SERTMIENDT----QSVRHCSRRI 926
Query: 92 --FSNIAGYQGVFLCGPHPAWLFLTSRG-ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
F+ +AG G ++CG HP +L +R +L A+ G V PF ++ P GF+Y
Sbjct: 927 IPFAAVAGQSGAYVCGQHPLFLMWDNRTRQLVAYRHQAPGLVRGFVPFTSM--PGGFIYC 984
Query: 149 NAKSELRISVLPTHLSYDA-PWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEP------ 201
+ + +V+ T+ S W R++ + TPHF+ Y ++ +VTS P
Sbjct: 985 -CEGFVDFAVMNTYCSPGGNGWLRRRIHIGATPHFIVYDPPGRSCFVVTSKKVPFRPQRA 1043
Query: 202 STDY-----YKFNGEDKELVTDPRDSRFIP----------PLVSQFHVSLFSPFSWEEIP 246
S D Y + + VT +P PL +F V L S F
Sbjct: 1044 SFDVQLKIQYDEDSNTVQSVTTEAPVCNMPAIKPGTGVRVPLTERFEVRLHSTFKKGWDC 1103
Query: 247 QTNFPLHEWEHVLCLKNVSMEYEGTLSG--LRGYIALGTNYNYSEDVTCRGRILLFDIIE 304
L E E VL + V + + G + T + EDVTCRGRI+L
Sbjct: 1104 TDKLMLDENEKVLGAQMVEIHQDANADGSATAPVCVVCTAFPLGEDVTCRGRIILLASRN 1163
Query: 305 VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI----YIWQLKDNDLTG 360
+ + I ++++ GP TA+ + + AVG I Y W+ K L
Sbjct: 1164 IK-------GRRSIVQLHSEPLNGPATAVAGICSQIAVAVGGTIKIFRYDWETKK--LVV 1214
Query: 361 IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
AF+ +Y + +N I+ GD RS ++ R+ E TL+++ RD
Sbjct: 1215 SAFLYAGMYATRLSVFRNYIIYGDLCRSCSMARFNEENHTLTVLGRDR------------ 1262
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
+ H D++ + G + SD ++NV++
Sbjct: 1263 ----------------------------SAVSVVHCDMMYHDRAFGILCSDDERNVLIMG 1294
Query: 481 YQPEARESNGG-HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGA 539
Y P +E++ G H + ++ L K G S +T Y S G
Sbjct: 1295 YTPRVQETDAGTHPKVLESVLSLDGEYRLPSGSLVKSLRFRSTAGNSS--VTLYVSNYGE 1352
Query: 540 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG---IIDG 596
+GF +P+ E+ R L + + GL PR F + + N RG ++
Sbjct: 1353 IGFIVPIGEQANRTALWVTRRLQIDLPCEAGLTPRMFLSLNQRSPR--NSLRGKEMLVPA 1410
Query: 597 SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
L+ L L R K I LD + +I AL
Sbjct: 1411 PLLRGLFSLDLRSR----KAIARAAYTQLDRVANIVAL 1444
>gi|219109892|ref|XP_002176699.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411234|gb|EEC51162.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1678
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 103/432 (23%), Positives = 184/432 (42%), Gaps = 86/432 (19%)
Query: 249 NFPLHEWEHVLCLKNVSM-------------EYEGTLSGLRGYIALGTNY--NYSEDVTC 293
+F L E+EH + L + + + G R ++A+GT + EDV
Sbjct: 1285 SFKLDEYEHGMTLSIMELTEFPEEPGSSNDTDVSGDELSKRMFVAVGTGVLDHNGEDVAS 1344
Query: 294 RGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQK---GPVTAICHVAG----FLVTAVGQ 346
RGR +L ++ + + + +++ + E++ G VT++ ++ L+ G
Sbjct: 1345 RGRAILLEL-KRTNSSAKAAGRQVVELSFCYEKEIFHGAVTSLVCLSSEGKNRLLIGAGA 1403
Query: 347 KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVAR 406
I + Q + LT + F + + + K+ +L+ D S+ L ++ ++L+L+A+
Sbjct: 1404 DINVEQWGNAKLTQVGFFRATMQVLHTIPFKSFLLLSDAYDSLYFLIWRESDKSLTLLAK 1463
Query: 407 DYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMG 466
DY P Y AG SRG +M
Sbjct: 1464 DYDPIPV----YAAGVMSRG------------------------------------PAMT 1483
Query: 467 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSI------- 519
F+ D +N+ F Y P + GG+RL+ + D+HLG +F C+ S +
Sbjct: 1484 FLCHDDRQNLQFFQYAPGEAAARGGNRLVCRADYHLGTQTTSFASHFCRSSLMIHSATPT 1543
Query: 520 --------SDAPGARS----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 567
D+ RS R ++ + DG +G +PL E Y RL LQ+++
Sbjct: 1544 STLAALKQQDSYFGRSEEDQRLGAYFGTADGGMGAVVPLSEPVYWRLTALQSIVANALES 1603
Query: 568 TGGLNPRAFRTYKGK----GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 623
L PRA+R Y+ G + + +G+IDG LV ++ LS+ ++ +I IGS +
Sbjct: 1604 DCALAPRAWRLYRRSTRRGGCRSNDRKKGVIDGDLVLQYADLSISKQEDIASAIGSTVDL 1663
Query: 624 ILDELYDIEALS 635
ILD L +++ S
Sbjct: 1664 ILDNLLELQCGS 1675
>gi|443919095|gb|ELU39366.1| cleavage factor protein [Rhizoctonia solani AG-1 IA]
Length = 788
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 153/644 (23%), Positives = 278/644 (43%), Gaps = 114/644 (17%)
Query: 5 RSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELL-IYQAFR------HPKGALKL 57
++H+ ET ++ ++ +G+ +P L+V T+ L IY+ + + +
Sbjct: 232 QAHTVCTDGETDIEHVIIAPIGITRPKPHLVVITKSRTLAIYEPVPAPPPPDSSENSAPV 291
Query: 58 RFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQ-----GVFLCGPHPAWLF 112
R +L V FV S+ LP + ++ ++ ++ G+F+ G HP WL
Sbjct: 292 R-DQLTVQFVKVFSR------ALPLDMHDTKRVAGRSLVPFKSPNLSGIFVTGDHPFWLL 344
Query: 113 LTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLP-THLSYDAPWPV 171
T LR +P Y N+ + LP +S++ P
Sbjct: 345 RTDASALRIYPHAAQ-------------------YVNSFGTTVVEWLPDVDISHEIPCRS 385
Query: 172 RKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQ 231
+AY + T+ + + S + YY ED + P D+ P +
Sbjct: 386 YASDDGRVYTSVAYDVSTR-HILAASALRTTFAYYD---EDSNELYTP-DATHPNPEIHC 440
Query: 232 FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDV 291
+ L +P +W + F +E+ V ++++ +E T GL+ Y+ +GT + ED+
Sbjct: 441 SALELITPDTWTTVDGYEFAQNEF--VNAVESIPLETLSTERGLKDYVVVGTTISRGEDL 498
Query: 292 TCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIW 351
+G +F+++EVVPEPG + +++++ ++ KG VTA+C + G+LV+++GQKI++
Sbjct: 499 AVKGATYVFEVVEVVPEPGSKTRQYRLRLLCREDSKGAVTALCGMNGYLVSSMGQKIFVR 558
Query: 352 QLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
D LTGIAF+D V + S+ +KNL+LVGD +S+ + +Q E L + +D +
Sbjct: 559 AFDLDEKLTGIAFMDVGVCVTSLRPLKNLLLVGDMVKSVWFVAFQEEPFKLVPLGKDRQQ 618
Query: 411 TQPNSKGYYAGNPSR---GIIDGSLVWKFLQLSLGERLEICKK---IGSKHNDILDEFSS 464
++ G+ ++ ++D V+ G RL IC H +L
Sbjct: 619 LSVTHADFFFGSQAQLSFAVLDDFGVF-------GLRL-ICSSEFHTHVTHRGVLSVSRK 670
Query: 465 MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG 524
F D D + Q ES+ K FH QH N+ + C SS
Sbjct: 671 ADF---DSD----VMSIQSLGTESSLIFGETKPYPFH--QH-NSILTM-CGGSS------ 713
Query: 525 ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 584
DG + PL E + RL +LQ ++
Sbjct: 714 ------------DGTIASLTPLNESEFGRLQLLQGQLIR--------------------- 740
Query: 585 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
N G++DG+L+ F +L + +++E+ ++IG++ IL++L
Sbjct: 741 ---NVHNGVLDGNLLAAFEELPVSKQVEMTQQIGAEREKILNDL 781
>gi|298715584|emb|CBJ28137.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 255
Score = 119 bits (299), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 79/289 (27%), Positives = 134/289 (46%), Gaps = 47/289 (16%)
Query: 351 WQLKDNDLTGIAFIDTEVYIASMVSVKN-LILVGDYARSIALLRYQPEYRTLSLVARDYK 409
W K L I F D VY+ S+ +K+ ILVGD S+ L+ ++ E +L+ +++D++
Sbjct: 13 WDPKTCTLELIGFHDPRVYVMSLSVIKHKFILVGDAYGSVQLVVWREEDHSLTALSKDHE 72
Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
Q S Y P M ++
Sbjct: 73 DCQVFSAEYLIDEPG----------------------------------------MAIVV 92
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRF 529
+D +NV + Y P A S GG +L+ ++DF+LG V + R + ++ D +R+
Sbjct: 93 ADGRRNVKVLQYAPNATNSRGGTKLLCQSDFYLGSRVGKLTRRRTR-GNLRDG----ARY 147
Query: 530 LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
+LDG LG LP+ E+ +RRL LQ +M H G NPRA+R + +
Sbjct: 148 CLLAGTLDGGLGAVLPVDERVFRRLYALQGIMSNALGHNGAANPRAYRLFDHGPTFRYET 207
Query: 590 SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
+ ++DGSL+W+F+ L + ++ + IG+ + ++ L DI+ L+S F
Sbjct: 208 KQNMLDGSLLWRFVGLDAKTQHDLTRAIGTTVDRVMANLLDID-LASLF 255
>gi|444313909|ref|XP_004177612.1| hypothetical protein TBLA_0A02930 [Tetrapisispora blattae CBS 6284]
gi|387510651|emb|CCH58093.1| hypothetical protein TBLA_0A02930 [Tetrapisispora blattae CBS 6284]
Length = 1459
Score = 119 bits (297), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 121/558 (21%), Positives = 228/558 (40%), Gaps = 71/558 (12%)
Query: 89 MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
M Y GY +F+ G P + R+ VS + N +
Sbjct: 950 MHYIPEYNGYSVIFVTGKSPYIIIKEDDSSPRSFKFANIPLVSMIRWGKN-----SVMCV 1004
Query: 149 NAKSELRISVLPT-HLSYDAPWPVRKVPLK------CTPHFLAYHLETKTYCIVTSTAEP 201
+ R+ L ++ Y P++++ + T +AYH +K Y + +
Sbjct: 1005 DPLKNARVYTLDCKNIYYGNKLPIKRIDISDEMDNYMTFTKIAYHESSKLYVV---SYCK 1061
Query: 202 STDYYKFNGEDKELVTDPRDSRFIPPLVS-QFHVSLFSPFSWEEIPQTNFPLHEWEHVLC 260
DY + E + LV D +P S + + L +P +W I Q F E +
Sbjct: 1062 DIDYNALDEEAERLVGYNSD---VPHAKSYKSGILLINPKTWNVIDQREF--GENSLIND 1116
Query: 261 LKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKM 320
++++ ++ R YI G SED+ G ++DI V+PE G+P T K K
Sbjct: 1117 IRSMVIQLNSRTRAKREYIVAGLANIGSEDLPPTGSFYIYDISPVLPETGKPDTNYKFKE 1176
Query: 321 IYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNL 379
I+ ++ +G VT++C ++G QKI + ++ DN + +AF+D VY+ S N
Sbjct: 1177 IFTEDVRGLVTSVCEISGRFTINQSQKIMVRDVQEDNSVVPVAFLDIPVYVTDTKSFGNF 1236
Query: 380 ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQL 439
+L+ D + + + + E + L+ + + ++ + A N
Sbjct: 1237 LLISDSMQGLQFVGFDAEPFRMILLGKSIPDLKISTVEFIANN----------------- 1279
Query: 440 SLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD 499
++ F +D D + +F Y P+ S G +L+ +
Sbjct: 1280 -----------------------GNIYFAATDYDNILHIFKYAPDEPNSLSGQKLVHCSS 1316
Query: 500 FHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASL----DGALGFFLPLPEKNYRRLL 555
F+L + + P + + + F+ + +L DG++ +PL E YRRL
Sbjct: 1317 FNLHSSTSCMIML---PGNDEFSENEQDNFIPSFQTLGGQVDGSIFKVIPLEESPYRRLY 1373
Query: 556 MLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 615
++Q + + GGLNP+ R + Y N + ++D +++ +F L + +R +
Sbjct: 1374 VIQQQITDYEVQVGGLNPKMER-LSNEYYQKSNMLKPMLDFNIIRRFSMLPIDKRRRTAQ 1432
Query: 616 KIGSK-HNDILDELYDIE 632
K G + H +I +L +IE
Sbjct: 1433 KAGRRAHFEIWRDLINIE 1450
>gi|388856288|emb|CCF50097.1| related to cleavage and polyadenylation specificity factor, 160 kDa
subunit [Ustilago hordei]
Length = 1568
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 100/344 (29%), Positives = 162/344 (47%), Gaps = 46/344 (13%)
Query: 273 SGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKN-KIKMIYAKEQKGPVT 331
+G + +IA+GT + ED TC+G + LF+II+VV + ++ ++K+I PVT
Sbjct: 1150 TGRKQFIAVGTTTYHGEDRTCKGSVYLFEIIQVVSSRRFQVGRDLRLKLICRDGSNAPVT 1209
Query: 332 AICHVAGFLVTAVGQKIYIWQLKDND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIA 390
A+ + GFL++ GQK+Y+ L+ + L +AF+D YI S+ VKN +L+ D + +
Sbjct: 1210 ALAELHGFLLSTSGQKLYVRALEKEEWLISVAFLDCPFYITSIRVVKNFVLLSDAKKGLW 1269
Query: 391 LLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 449
L +Q + YR + L + +DG +LGE L
Sbjct: 1270 FLAFQEDPYRFVDLGS---------------------ALDGHCA------NLGEFLVYND 1302
Query: 450 KIG--SKHNDILDEFSSMGFMISDKDKNVV-LFMYQPEARESNGGHRLIKKTDFHLGQHV 506
K+ S L FS G +D V+ L+ Y P + S GG RL+ +T++
Sbjct: 1303 KLSLVSTSGVALGGFSGFG-----QDSGVIRLYEYNPSSPTSLGGQRLLLRTEYSTPSST 1357
Query: 507 NTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
+ S S+ G R++ L + +G+L + EK +RL +LQ +V
Sbjct: 1358 TCSLSAPGRWLSDSELRGREQLRNKLL--LSKSNGSLDSLASVEEKVAKRLHLLQGQLVR 1415
Query: 564 HTSHTGGLNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLS 606
HT LNPRAFR + + P +G++D L+ F LS
Sbjct: 1416 SVLHTAALNPRAFRQVRND--FVSRPLYKGVLDARLLDAFKGLS 1457
>gi|402591342|gb|EJW85272.1| hypothetical protein WUBG_03818, partial [Wuchereria bancrofti]
Length = 1025
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 71/233 (30%), Positives = 115/233 (49%), Gaps = 14/233 (6%)
Query: 14 ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP---KGALKLRFKKLKVLFVSDR 70
E ++ ELL V +G++ RPLL + + Y+ F + +G L +RFK+L V+ R
Sbjct: 794 EEVIMELLLVGMGMNQGRPLLFLLIDDTVSAYEMFTYNNGIQGHLAIRFKRLPYTTVT-R 852
Query: 71 SKRANEQPG------LPRGVRISQMRYFSNIAG--YQGVFLCGPHPAWLFLTSRGELRAH 122
S R G + VR + +F G GVF+C +P FL S G R H
Sbjct: 853 SCRFQGTDGRAAVESVRDAVRHKTVLHFFERIGNVLNGVFICSSYPCIFFLES-GVPRLH 911
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSEL-RISVLPTHLSYDAPWPVRKVPLKCTPH 181
P+ +DGP+ + F+N CP GF+Y + L R++ LP+ + DA +PV+++ + T H
Sbjct: 912 PVNLDGPILSFTTFNNAACPNGFIYLTERDRLMRVAKLPSDMILDASYPVKRINVGATVH 971
Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHV 234
+ Y L + TY ++TS T +DK + F+ P + Q+ +
Sbjct: 972 SVVYLLHSNTYAVLTSEKRKVTKMCVLINDDKTFEEHEKPDTFVYPEMDQYKL 1024
>gi|71413583|ref|XP_808925.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
strain CL Brener]
gi|70873226|gb|EAN87074.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 444
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 110/491 (22%), Positives = 194/491 (39%), Gaps = 84/491 (17%)
Query: 172 RKVPLKCTPHFLAYHLETKTYCIVTSTAEP------------STDYYKFNGEDKELVTDP 219
R++ L TPHF+ YH ++ +VTS EP + Y + +G + + T+
Sbjct: 2 RRIHLGVTPHFVVYHPPARSCFVVTSKKEPFRPQRAPFDFQLNIVYDEESGGVQSITTEA 61
Query: 220 RDSRFIP---------PLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEG 270
P P+ +F + L S W L E E VL + + + E
Sbjct: 62 PVCNMPPIAPNAGIRVPMADRFEIRLMSTTDWA--CTDTLLLEENERVLGAQMMEIHCEK 119
Query: 271 TLSGLRG--YIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKG 328
GL + T + ED+TCRGRILL + K KI + +++ G
Sbjct: 120 DAEGLHTAPVCVVSTAFPLGEDITCRGRILLLATMCTK-------KKRKILLFHSEPLNG 172
Query: 329 PVTAICHVAGFLVTAVGQKIYIWQL--KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYA 386
P TA+ + + AVG I +++ + L A + Y+ M S +N ++ GD +
Sbjct: 173 PATAVVGIRHHIAVAVGGTIKLFRFDWEKRKLVVGALLYAGTYVTRMSSFRNYLIYGDLS 232
Query: 387 RSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 446
RS A+ R+ E TLS++ +D
Sbjct: 233 RSCAIARFNEENHTLSVLGKDR-------------------------------------- 254
Query: 447 ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGG--HRLIKKTDFHLGQ 504
+ H D++ + G + SD ++N+++ Y P +E+ G +++++ G+
Sbjct: 255 --NAVSVVHCDMMYHDRAFGLLCSDDERNLLVMGYTPRVQETEAGSPNKVLESVLSLDGE 312
Query: 505 HVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 564
+ C S+ A + +T Y + G +GF +P+ E+ R L +
Sbjct: 313 Y---RLSGGCLVKSLRFRSLAGNSSVTLYVTNYGEIGFIVPIGEQANRTASWLMRRLQID 369
Query: 565 TSHTGGLNPRAFRTY-KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 623
H+ GL PR F +G A ++ SL+ +F L + R K I S
Sbjct: 370 LPHSAGLTPRMFLGLSQGSPRTAMRAKEMLVSASLLNEFFFLDIHSR----KTIASAAYT 425
Query: 624 ILDELYDIEAL 634
L+ + ++ +L
Sbjct: 426 QLERVTNVASL 436
>gi|159470707|ref|XP_001693498.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283001|gb|EDP08752.1| predicted protein [Chlamydomonas reinhardtii]
Length = 366
Score = 108 bits (270), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 80/259 (30%), Positives = 111/259 (42%), Gaps = 53/259 (20%)
Query: 81 PRGVRISQMRYFSNI-----AGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAP 135
PR +R + Y + A + GVF+ G P WL + RG L AH M +GPV+ L P
Sbjct: 144 PRLIRFDHIAYTDPLTRARGANHSGVFVAGARPLWL-VAGRGGLAAHAMWSEGPVAALTP 202
Query: 136 FHNVNCPRGFL-YFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCI 194
FHNVNCP GF+ +A+ +L++ LP H D W R+VPLK TPH LA+ E
Sbjct: 203 FHNVNCPLGFITACSARGQLKVCCLPPHTRLDGAWATRRVPLKVTPHRLAWFREAGIVAA 262
Query: 195 VTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHE 254
+TS PS PR + E P
Sbjct: 263 ITSRPAPS---------------RPRPA---------------------EEPGG------ 280
Query: 255 WEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLT 314
E LCLK V + T +A+GT ED C GR+LL+ + V + G+
Sbjct: 281 -EQALCLKFVYLR-NATTGDTDTLLAVGTGTPLGEDYPCLGRLLLYSVAAEVVDQGRGNM 338
Query: 315 KNK--IKMIYAKEQKGPVT 331
+ ++ A++ VT
Sbjct: 339 SRRWSATLVAARDTASAVT 357
>gi|224000243|ref|XP_002289794.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975002|gb|EED93331.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1820
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 92/389 (23%), Positives = 162/389 (41%), Gaps = 70/389 (17%)
Query: 278 YIALGTNY--NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKE-QKGPVTAIC 334
++A+GT ED+ +GRILLF++ + + + ++ + + K+ GPVT++
Sbjct: 1470 FVAVGTGRIERDGEDIASKGRILLFNLKKKKHQKDKRSMTLELHLKHEKDITIGPVTSLS 1529
Query: 335 HVAG----FLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIA 390
+ + G ++ + Q L + F + + ++ K L+ D ++
Sbjct: 1530 SLRSEDIFRVAVGAGAEVTVEQWGSGKLVQVGFYHAHMQVQNISLFKTFFLLSDAYDALH 1589
Query: 391 LLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 450
L ++ ++L+L+A+DY+PTQ + AG SRG
Sbjct: 1590 FLVWRESDKSLTLLAKDYEPTQV----FAAGMISRG------------------------ 1621
Query: 451 IGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV---- 506
+M F+ D +N+ Y P + GG++L+ + DFHLG
Sbjct: 1622 ------------GAMSFVCHDDRQNIQFLQYAPTDVAARGGNKLVCRADFHLGSQTTSLN 1669
Query: 507 -----NTFFKIRCKPSSI------SDAPGAR----SRFLTWYASLDGALGFFLPLPEKNY 551
++ C SS D+ R RF + + DG+ +PL E Y
Sbjct: 1670 SHWAQSSLLFNSCTVSSTLASLKQQDSLFGRLDDDQRFAVNFGTTDGSFVSIIPLSEPTY 1729
Query: 552 RRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK----GYYAGNPSRGIIDGSLVWKFLQLSL 607
RL LQ+VM L+ RA+R Y+ G + +G+ID LV KF+ L L
Sbjct: 1730 WRLTALQSVMSNALESNAALSHRAWRLYRRSTRRGGCRTNDRKKGVIDADLVMKFVDLPL 1789
Query: 608 GERLEICKKIGSKHNDILDELYDIEALSS 636
E+ ++ IGS ++D L ++ S
Sbjct: 1790 PEQEDLTSSIGSTVGLVMDNLLELSCAGS 1818
>gi|414587799|tpg|DAA38370.1| TPA: hypothetical protein ZEAMMB73_163106 [Zea mays]
Length = 461
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 47/113 (41%), Positives = 68/113 (60%), Gaps = 1/113 (0%)
Query: 88 QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLY 147
++ F+N+ GY+G+FL GP P W+F+ R R HP DGP+ HNVNC RG +Y
Sbjct: 257 RITIFNNVGGYEGLFLGGPRPTWVFVC-RQRFRVHPQLCDGPIVAFTVLHNVNCCRGLIY 315
Query: 148 FNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAE 200
++ L+I LP+ +YD WPV+KVPL TPH + Y+ E Y ++ S +
Sbjct: 316 VTSQGFLKICQLPSAYNYDNYWPVQKVPLHGTPHQVTYYGEQSLYPLIVSVPQ 368
>gi|452825139|gb|EME32137.1| cleavage and polyadenylation specificity factor subunit-like protein
[Galdieria sulphuraria]
Length = 1454
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 126/565 (22%), Positives = 236/565 (41%), Gaps = 118/565 (20%)
Query: 86 ISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID-----GPVSTLAPFHNVN 140
I +R F N++ + GVFL G P+ + L S+G + H + ID G + ++ +
Sbjct: 876 IPHLRPFYNLSSHFGVFLTGSVPSIIVL-SKGYPQKHEIMIDSGVEYGDILSITNMGDPE 934
Query: 141 CPRGFLYFNAKSELRI-SVLPTHL-SYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTST 198
R ++ + + T L S + WPV + + YH T T+ +V S+
Sbjct: 935 NNRKLWILDSNGRIHFGEIRETQLESINWAWPVEVFRMNGCVKNVVYHATTGTFGVVVSS 994
Query: 199 A------EPSTDYYKFNGEDKELV-----------------TDPRDSRFIPPLVSQFHVS 235
E ++ D+ + +P+++ +P V + +
Sbjct: 995 IVSMSRLERKRQIFERQKRDERAILGSQAPPEEENNTEFEENEPKNA--LPIEVEAYELQ 1052
Query: 236 LFSPFSWEEIPQTNFPLHEWEHVLCL---------------------KNVSMEYEGTLSG 274
++ +WE + + F E E VL + + E +S
Sbjct: 1053 IYRADTWELVDK--FAFKEEEAVLSATFMQVDAYKITEEENNDDKSSRATQQQAEAAISQ 1110
Query: 275 L--------RGYIALGTNYNYSEDVTCRGRILLFDII--EVVPEPGQPLTKNKIKMIYAK 324
+ I +GT + ED RGR++LF++ E E + ++ +I K
Sbjct: 1111 SSRSIKFKPKECIVIGTGFIKGEDAGTRGRLMLFEVARQEAYTEESGAFSAIQLMLIAEK 1170
Query: 325 EQKGPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDT-EVYIASMVSVKNLILV 382
E K V++I + G++ AVG K+ I++L +++L +F +++ S+ +VK + V
Sbjct: 1171 ELKSVVSSIARLEGYICCAVGPKVEIYKLVNESELVCCSFYSGFQLFSTSINTVKQYVFV 1230
Query: 383 GDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLG 442
GD + L ++ ++L+ + +D+ P Q +L +FL
Sbjct: 1231 GDMYKGGYFLFWRDRNKSLNFLGKDFDPVQ------------------TLSTEFL----- 1267
Query: 443 ERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ-PEARESNGGHRLIKKTDFH 501
IL+EF + F++SD N+ L Y P ES GG +L+++ H
Sbjct: 1268 ---------------ILNEF--ILFVVSDNFGNLHLLEYAGPHEIESRGGEKLLRRGVLH 1310
Query: 502 LGQHVNTFFKIRC--KPSSISDAPGARSRFL-TWYASLDGALGFFLPLPEKNY--RRLLM 556
LG ++ ++R K ++ D G+ L TW DG L LPL ++ Y + L+
Sbjct: 1311 LGTRSSSMIRLRTDWKENNSEDRAGSHIVVLGTW----DGGLACLLPLQQEEYEQKNELL 1366
Query: 557 LQNVMVTHTSHTGGLNPRAFRTYKG 581
+ + +++ + GLNP+ FR +G
Sbjct: 1367 KKVYLHSYSLYVAGLNPQEFRIPRG 1391
>gi|430810872|emb|CCJ31592.1| unnamed protein product [Pneumocystis jirovecii]
gi|430814599|emb|CCJ28188.1| unnamed protein product [Pneumocystis jirovecii]
Length = 203
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 60/184 (32%), Positives = 99/184 (53%), Gaps = 11/184 (5%)
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI--RC 514
D L + + F+I D D N+ +F Y PE +S G +L+K+ DFH+G H+ + +
Sbjct: 17 DFLVDDEHLYFVIGDDDGNIHVFNYDPENPQSFSGQKLLKRGDFHVGSHIKSILMLPKEA 76
Query: 515 KPSSISDAPGARSR----FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
P +++D R+ L AS DG++G + LPEK YRRL +Q ++ G
Sbjct: 77 FPQNVNDKEETRASKNQDSLCLCASQDGSMGVLISLPEKTYRRLYFIQGQLINTEDKVAG 136
Query: 571 LNPRAFR--TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
LNP ++R TY K NP+RGI+DG L++++ L ++ ++ +K G I+ +L
Sbjct: 137 LNPISYRTSTYVSK---TSNPARGILDGKLLYQYNNLERNKQKDMARKSGMPVETIIYDL 193
Query: 629 YDIE 632
I+
Sbjct: 194 LKID 197
>gi|398020786|ref|XP_003863556.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania donovani]
gi|322501789|emb|CBZ36871.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania donovani]
Length = 1542
Score = 105 bits (262), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 131/596 (21%), Positives = 226/596 (37%), Gaps = 106/596 (17%)
Query: 33 LLLVRTQHELLIYQAF----RHPKGALKLRFKKLKVL-------FVSDRSKRANEQPGLP 81
L+++ + EL+ Y+ P+ +K+ + L V + R KR E+
Sbjct: 938 LVMILSSGELVTYRVVPADANGPRRCVKVIYHILDVAPEVDVVESIEARKKRLQEERAHL 997
Query: 82 RGVRISQMRYFSN----IAGYQ----GVFLCGPHPAWL-FLTSRGELRAHPMTIDGPVST 132
V QMR+ S G Q G+++CG P +L + + +L V
Sbjct: 998 ASV-TQQMRHCSERLVPFRGLQDRHKGIYVCGQTPVFLVYHAATNQLVCTRHHATNAVRG 1056
Query: 133 LAPFHN-------VNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
APFH+ V C GF++F L + W + +V L CTPH + Y
Sbjct: 1057 FAPFHSRHVHGGFVYCGEGFVHFATMQPF------GELLGSSGWWLERVRLGCTPHQVIY 1110
Query: 186 HLETKTYCIVTSTAEP-STDYYKFNGEDKELVTDPRDSRF--------IPPLVS------ 230
+V S +P S F+ + + +V D +R +PPL +
Sbjct: 1111 SPAAHGCFVVASRPQPFSPKRAPFDVQLR-MVEDEEGNRVPHVIEPVSLPPLSATSGSPV 1169
Query: 231 ----QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEY---EGTLSGLRGYIALGT 283
++ V FS W+ + + ++E L V+ + S AL T
Sbjct: 1170 PTNERYEVQFFSTLDWQCMGRLVLDVNEKVLSATLMQVTRDTTMDAANRSTTAPVCALAT 1229
Query: 284 NYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA-GFLVT 342
Y EDVT RGRILL +++ ++ + KGPVTAI V +
Sbjct: 1230 AYPLGEDVTTRGRILLLTT-----SQQGGQGMQQLRTLHEEPMKGPVTAITRVGEDCVAV 1284
Query: 343 AVGQKIYIWQLKDNDLT--GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRT 400
AVG + +++ N T +A + Y+ + + +N +++GD S+ RY E T
Sbjct: 1285 AVGGTVRVYRYDTNKSTMETMAILYAGAYVTCLQAFRNYLVIGDLFNSVLFARYSEEIHT 1344
Query: 401 LSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 460
++++ RD I ND+L
Sbjct: 1345 ITILGRD----------------------------------------TNAISVVSNDMLY 1364
Query: 461 EFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS 520
+ G +++D +N+V Y+P E G I ++ + + K +
Sbjct: 1365 HDTRFGLLVTDDARNLVCMSYKPRVLEEPGKPPKILESLLTVTGEYRLAGGVLLKMMRLR 1424
Query: 521 DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
A R+ + Y + G +G+ +PL ++ R + + + +H GGL PR F
Sbjct: 1425 -AASTRNSSVAIYVTNMGEIGYLVPLGDQTSRTGQWVGRRLQSEVAHAGGLPPRMF 1479
>gi|146096490|ref|XP_001467824.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania infantum JPCM5]
gi|134072190|emb|CAM70891.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania infantum JPCM5]
Length = 1542
Score = 105 bits (262), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 131/596 (21%), Positives = 226/596 (37%), Gaps = 106/596 (17%)
Query: 33 LLLVRTQHELLIYQAF----RHPKGALKLRFKKLKVL-------FVSDRSKRANEQPGLP 81
L+++ + EL+ Y+ P+ +K+ + L V + R KR E+
Sbjct: 938 LVMILSSGELVTYRVVPADANGPRRCVKVIYHILDVAPEVDVVESIEARKKRLQEERAHL 997
Query: 82 RGVRISQMRYFSN----IAGYQ----GVFLCGPHPAWL-FLTSRGELRAHPMTIDGPVST 132
V QMR+ S G Q G+++CG P +L + + +L V
Sbjct: 998 ASV-TQQMRHCSERLVPFRGLQDRHKGIYVCGQTPVFLVYHAATNQLVCTRHHATNAVRG 1056
Query: 133 LAPFHN-------VNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
APFH+ V C GF++F L + W + +V L CTPH + Y
Sbjct: 1057 FAPFHSRHVHGGFVYCGEGFVHFATMQPF------GELLGSSGWWLERVRLGCTPHQVIY 1110
Query: 186 HLETKTYCIVTSTAEP-STDYYKFNGEDKELVTDPRDSRF--------IPPLVS------ 230
+V S +P S F+ + + +V D +R +PPL +
Sbjct: 1111 SPAAHGCFVVASRPQPFSPKRAPFDVQLR-MVEDEEGNRVPHVIEPVSLPPLSATSGSPV 1169
Query: 231 ----QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEY---EGTLSGLRGYIALGT 283
++ V FS W+ + + ++E L V+ + S AL T
Sbjct: 1170 PTNERYEVQFFSTLDWQCMGRLVLDVNEKVLSATLMQVTRDTTMDAANRSTTAPVCALAT 1229
Query: 284 NYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA-GFLVT 342
Y EDVT RGRILL +++ ++ + KGPVTAI V +
Sbjct: 1230 AYPLGEDVTTRGRILLLTT-----SQQGGQGMQQLRTLHEEPMKGPVTAITRVGEDCVAV 1284
Query: 343 AVGQKIYIWQLKDNDLT--GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRT 400
AVG + +++ N T +A + Y+ + + +N +++GD S+ RY E T
Sbjct: 1285 AVGGTVRVYRYDTNKSTMETMAILYAGAYVTCLQAFRNYLVIGDLFNSVLFARYSEEIHT 1344
Query: 401 LSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 460
++++ RD I ND+L
Sbjct: 1345 ITILGRD----------------------------------------TNAISVVSNDMLY 1364
Query: 461 EFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS 520
+ G +++D +N+V Y+P E G I ++ + + K +
Sbjct: 1365 HDTRFGLLVTDDARNLVCMSYKPRVLEEPGKPPKILESLLTVTGEYRLAGGVLLKMMRLR 1424
Query: 521 DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
A R+ + Y + G +G+ +PL ++ R + + + +H GGL PR F
Sbjct: 1425 -AASTRNSSVAIYVTNMGEIGYLVPLGDQTSRTGQWVGRRLQSEVAHAGGLPPRMF 1479
>gi|393907593|gb|EJD74705.1| CPSF A subunit region family protein [Loa loa]
Length = 990
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 104/200 (52%), Gaps = 14/200 (7%)
Query: 10 SAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP---KGALKLRFKKLKVLF 66
+A E ++ ELL V +G++ RP+L + + +Y+ F + +G L +RFK+L
Sbjct: 789 AAKPEEVIMELLMVGMGMNQGRPMLFLLIDDTVSVYEMFTYNNGIQGHLAVRFKRLPYTV 848
Query: 67 VSDRSKRANEQPG------LPRGVRISQMRYFSNIAG--YQGVFLCGPHPAWLFLTSRGE 118
V+ RS R G + VR + +F G GVF+C +P FL + G
Sbjct: 849 VT-RSCRFQGLDGRAAVESVRDAVRHKTVLHFFERIGNVLNGVFICSSYPCIFFLET-GV 906
Query: 119 LRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSEL-RISVLPTHLSYDAPWPVRKVPLK 177
R HP+ +DGP+ + F+N CP GF+Y + L R++ LP + D +PV+++ +
Sbjct: 907 PRLHPVNLDGPILSFTTFNNAACPNGFIYLTERERLMRVAKLPNDMILDTSYPVKRIDVG 966
Query: 178 CTPHFLAYHLETKTYCIVTS 197
+ H + Y L + TY ++TS
Sbjct: 967 ASVHSVTYLLHSNTYAVLTS 986
>gi|389602597|ref|XP_001567507.2| cleavage and polyadenylation specificity factor-like protein
[Leishmania braziliensis MHOM/BR/75/M2904]
gi|322505515|emb|CAM42945.2| cleavage and polyadenylation specificity factor-like protein
[Leishmania braziliensis MHOM/BR/75/M2904]
Length = 1536
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 142/660 (21%), Positives = 256/660 (38%), Gaps = 117/660 (17%)
Query: 33 LLLVRTQHELLIYQAF----RHPKGALKLRFKKLKVLFVSD-------RSKRANEQPGLP 81
L+++ + EL+ Y+ P+ +KL + L V D R KR E+
Sbjct: 932 LVMILSTGELVTYRVVPADASAPRRCVKLVYHILDVAPEVDVVESIEVRKKRLQEERAHL 991
Query: 82 RGVRISQMRY-------FSNIAG-YQGVFLCGPHPAWL---FLTSRGELRAHPMTIDGPV 130
V QMR F + G ++G+++CG P +L + T++ H T V
Sbjct: 992 ASV-TQQMRRCSERLVPFCALQGRHKGIYVCGQTPVFLVYHYATNQLVCTRHHAT--SAV 1048
Query: 131 STLAPFHN-------VNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFL 183
APFH+ V C GF++F L + W + +V L CTPH +
Sbjct: 1049 RGFAPFHSRHVHGGFVYCGEGFVHFATMQPF------GELLGSSGWWLERVRLGCTPHQV 1102
Query: 184 AYHLETKTYCIVTSTAEP-STDYYKFNGEDKELVTDPRDSRF--------IPPLVS---- 230
Y +V S +P S F+ + + +V D +R +PPL +
Sbjct: 1103 IYSPAAHGCFVVVSRPQPFSPKRAPFDVQLR-MVEDEEGNRVPHVVEPVSLPPLSATSGS 1161
Query: 231 ------QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTL---SGLRGYIAL 281
++ V LFS W+ + ++E L VS + + S AL
Sbjct: 1162 PVPTNGRYEVQLFSTLDWQRVDCLALDVNEKVLSATLMQVSRDTTMDVAYRSATAPVCAL 1221
Query: 282 GTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV-AGFL 340
T Y EDVT RGR+LL + + GQ + K+++++ + KGPVTAI + +
Sbjct: 1222 ATAYPLGEDVTTRGRVLLLATSQ---QGGQGM--QKLRILHEEPMKGPVTAITRIDEDCI 1276
Query: 341 VTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEY 398
AVG ++Y + + A + Y+ + ++++ +++GD S+ RY E
Sbjct: 1277 AVAVGGTVRVYRYDASKGVMETTAILYAGAYVTCLQALRDYLVIGDLFHSVLFARYSEEI 1336
Query: 399 RTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 458
T++++ RD I +D+
Sbjct: 1337 HTITILGRD----------------------------------------TNAISVVSSDM 1356
Query: 459 LDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSS 518
L + G +++D +N++ Y+P E G + ++ + + K
Sbjct: 1357 LYHDTRFGLLVADDARNLMCMSYKPRLLEEPGKPPKVLESLLSVTGEYRLAGGVLLKMMR 1416
Query: 519 ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT 578
+ + S + ++ G +G+ +PL ++ R + + + +H GGL PR F
Sbjct: 1417 LRASAARSSSVAIYVTNM-GEIGYLVPLGDQTSRTGQWVVRRLQSEVAHAGGLPPRMFL- 1474
Query: 579 YKGKGYYAGNPSRGIIDGSLVWKF--LQLSLGERLEICKKIGSKHNDILDELYDIEALSS 636
G+ +P R + + F L+ + L K I S L+ + ++ A S
Sbjct: 1475 ----GFPQDDPLRSLKGDEWMLHFPLLEQLYRQDLRTRKLIASAAQTQLERVMNVGATVS 1530
>gi|387593561|gb|EIJ88585.1| hypothetical protein NEQG_01275 [Nematocida parisii ERTm3]
gi|387597215|gb|EIJ94835.1| hypothetical protein NEPG_00359 [Nematocida parisii ERTm1]
Length = 1261
Score = 97.1 bits (240), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 113/505 (22%), Positives = 200/505 (39%), Gaps = 68/505 (13%)
Query: 137 HNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVT 196
N N R F+ AK + LP + SYD RK + ++Y K T
Sbjct: 798 ENSNSTRSFIVM-AKGSIAKGNLPVY-SYDKSVLYRKTKVDSICEKISYSKAKKVIVAAT 855
Query: 197 STAEPST-DYYKFNGE-----DKELVTDPRDSRF-IPPLVSQFHVSLFSPFSWEEIPQTN 249
P T D F + D ELV P + + PL + + ++S + +T
Sbjct: 856 YKNNPYTEDMIPFTVQATTELDAELVPAPVIPKISVNPLTRAYSLKIYSHEEMKVCSRTE 915
Query: 250 --------FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
+ L E++ K V++ + G ++ + T Y ED+ RGR+++ +
Sbjct: 916 GVLMAVDEYRLENNEYIAYHKIVTLPDKQNTEGFSEFVIVCTTYITDEDLMARGRLIVLE 975
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND-LTG 360
I VVP+ + T++K+K + A++ KG T V G +V VG K+ I+ N+ L
Sbjct: 976 IASVVPQRDRIETRHKLKALAAEKTKGATTCCDIVKGNIVVCVGTKLMIYMFDRNEGLRA 1035
Query: 361 IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
+AF D V++ S + ++N+I+ GD + LL YQ + L ++++ +S G Y
Sbjct: 1036 VAFHDIHVFLTSCMVMRNIIVCGDAYKGTFLLFYQSDPPLLHMLSQ-------SSGGVY- 1087
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
+ K IG +D +++ + D K V ++
Sbjct: 1088 --------------------------LLKGIGMTLHD-----TALSLISYDSLKTVCIYT 1116
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGAL 540
Y P+ S G RLI + + L + F I + + T + G +
Sbjct: 1117 YSPQHILSQDGSRLISRGECKLPDDIAGSFLIE-----------KKGVYRTALYTKHGYV 1165
Query: 541 GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVW 600
+ + Y LL LQ+ + + T G NPR+ + + I+ L+
Sbjct: 1166 YSHKTVVQTKYIALLDLQHAVESAHWMTLGTNPRSHWVTERSAEMKDITLKEILQTGLME 1225
Query: 601 KFLQLSLGERLEICKKIGSKHNDIL 625
+F + + I G D++
Sbjct: 1226 EFFNMCTVQSDRIVADTGRASADVV 1250
>gi|378755148|gb|EHY65175.1| hypothetical protein NERG_01621 [Nematocida sp. 1 ERTm2]
Length = 822
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 117/513 (22%), Positives = 212/513 (41%), Gaps = 82/513 (15%)
Query: 18 QELLTVSLGLHGNRPLLLVRT-QHELLIYQAFRHPKGALKLRFKKLKVL---FVSDRSKR 73
Q LL + + + + LL RT +E+++YQ R K KV F ++++
Sbjct: 257 QSLLDIEVIEYRSAVYLLARTISNEIVLYQERDG-------RLYKEKVTNNAFYYEKAEV 309
Query: 74 ANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTL 133
P S+MR ++ VF+ G + + + + ++ H I + ++
Sbjct: 310 GQSSP--------SRMRVCGSL-----VFIPGTYKTRVLVFTPYQVIVHAANIR--IDSI 354
Query: 134 APFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYC 193
+ P AK + +LP + +YD P +K + +AY K
Sbjct: 355 EEITEDSNPIKSFIIMAKGSIARGMLPLY-AYDKPVLYKKTKVDSICQRVAYSAVKKVIV 413
Query: 194 IVT-STAEPSTDYYKFNGE-DKELVTDPRDSRFIP-----PLVSQFHVSLFSPFSWEEIP 246
VT E + D F + EL +P IP PL + + ++S ++
Sbjct: 414 AVTYKDKEYTKDMIPFTVQATTELDAEPLPPPVIPEIKVNPLTRAYSLKIYSHEEMKQYS 473
Query: 247 QT--------NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRIL 298
QT +PL + E++ K V++ + G+ ++ + T Y ED+ RGR++
Sbjct: 474 QTGEVLMAVDEYPLEDNEYIAHHKIVTLPDKQNTEGVSEFVIVCTTYITDEDLMARGRLI 533
Query: 299 LFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND- 357
+ +I VVP+ + T++K+K + A++ KG T V G +V VG K+ I+ N+
Sbjct: 534 VLEIASVVPQRDRIETRHKLKALAAEKTKGATTCCDIVKGNIVVCVGTKLMIYMFDRNEG 593
Query: 358 LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
L +AF D V++ S + ++N+I+ GD + LL YQ E P+ +
Sbjct: 594 LRAVAFHDIHVFLTSCMVMRNIIVCGDAYKGTFLLFYQSE------------PSLLHLLS 641
Query: 418 YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVV 477
+G G + K + ++L S + + D K V
Sbjct: 642 QSSG--------GVYLLKGIGMTL-------------------YGSVLSLLSYDSAKTVC 674
Query: 478 LFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF 510
++ Y P+ S GG RLI + + L ++ F
Sbjct: 675 IYSYSPQHILSQGGTRLISRGECKLPDDISGSF 707
>gi|385304556|gb|EIF48568.1| rna-binding subunit of the mrna cleavage and polyadenylation factor
[Dekkera bruxellensis AWRI1499]
Length = 289
Score = 96.3 bits (238), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 75/311 (24%), Positives = 136/311 (43%), Gaps = 49/311 (15%)
Query: 272 LSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVT 331
++ + Y+ +G+ ED+ +G ++++II+VVP+P P KN++K+I ++ +G +
Sbjct: 1 MNDTKNYVIVGSGKYRVEDLATKGSWMVYEIIDVVPDPNHPEAKNRLKLIKSESSRGSIL 60
Query: 332 AICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIA 390
C+++G Q++ + + KD + +AF DT +Y + S ++++++GD ++
Sbjct: 61 GSCNISGRFSLVQAQRMLVRTIKKDGNAVPVAFXDTSLYTKDVKSFEDMMIIGDAFDGLS 120
Query: 391 LLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 449
L + E YR L L K TQ S L C
Sbjct: 121 LYGFDAEPYRMLKLG----KETQNLS-----------------------------LTAC- 146
Query: 450 KIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
D + + + +D+D + L Y P ES G +L+ ++ F +
Sbjct: 147 -------DFIVXEGGLYIIAADEDSVLHLLEYDPYDPESMKGXKLLTRSVFRFNGYTTAM 199
Query: 510 FKIRCKPS------SISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
K S +++ PGA F +++G+ P E YRRL LQN +
Sbjct: 200 RLCDRKNSIFSMLDTLAIPPGADLGFEVIGCNIEGSFYKVTPANEYTYRRLYALQNHISD 259
Query: 564 HTSHTGGLNPR 574
SH GLNP+
Sbjct: 260 KESHWLGLNPK 270
>gi|401426989|ref|XP_003877978.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania mexicana MHOM/GT/2001/U1103]
gi|322494225|emb|CBZ29522.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania mexicana MHOM/GT/2001/U1103]
Length = 1542
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 129/598 (21%), Positives = 230/598 (38%), Gaps = 110/598 (18%)
Query: 33 LLLVRTQHELLIYQAF----RHPKGALKLRFKKLKVL-------FVSDRSKRANEQPGLP 81
L+++ + EL+ Y+ P+ +K+ + L V + R KR E+
Sbjct: 938 LVMILSSGELVTYRVVPADAHGPRRCVKVIYHILDVAPEVDVVESIEARKKRLQEERAHL 997
Query: 82 RGVRISQMRYFSN----IAGYQ----GVFLCGPHPAWL-FLTSRGELRAHPMTIDGPVST 132
V QMR+ S G Q G+++CG P +L + + +L V
Sbjct: 998 ATV-TQQMRHCSERLVPFRGLQDRHKGMYVCGQTPVFLVYHAATNQLVCTRHHATNAVRG 1056
Query: 133 LAPFHN-------VNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
APFH+ V C GF++F L + W + +V L CTPH + Y
Sbjct: 1057 FAPFHSRHVHGGFVYCGEGFVHFATMQPF------GELLGSSGWWLERVRLGCTPHQIIY 1110
Query: 186 HLETKTYCIVTSTAEP-STDYYKFNGEDKELVTDPRDSRF--------IPPLVS------ 230
+V S +P S F+ + + +V D +R +PPL +
Sbjct: 1111 SPAAHGCFVVASRPQPFSPKRAPFDVQLR-MVEDEEGNRVPHVIEAVSLPPLSAASGSPV 1169
Query: 231 ----QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTL-----SGLRGYIAL 281
++ V FS W+ + + L E VL + + + T+ S AL
Sbjct: 1170 PTNERYEVQFFSTLDWQCMGR--LVLDANEKVLSATLMQVTRDTTMDAANRSTTAPVCAL 1227
Query: 282 GTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA-GFL 340
T Y EDVT RGRILL + + G + ++ ++ + KGPVTAI V +
Sbjct: 1228 ATAYPLGEDVTTRGRILLLTTTQ---QGGHGM--QHLRTLHEEPMKGPVTAITRVGEDCV 1282
Query: 341 VTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEY 398
AVG ++Y + + + +A + Y+ + + ++ +++GD S+ RY E
Sbjct: 1283 AAAVGGTVRVYRYDTYKSTMETMAILYAGAYVTCLQAFRDYLVIGDLFNSVLFARYSEEI 1342
Query: 399 RTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 458
T++++ RD I ND+
Sbjct: 1343 HTITILGRD----------------------------------------TNAISVVSNDM 1362
Query: 459 LDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSS 518
L + G +++D +N++ Y+P E G + ++ + + K
Sbjct: 1363 LYHDTRFGLLVTDDARNLMCMSYKPRVLEEPGKPPKVLESLLTVTGEYRLAGGVLLKMMR 1422
Query: 519 ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
+ A S + ++ G +G+ +PL ++ R + + + +H GGL PR F
Sbjct: 1423 LRAASAHSSSVAIYVTNM-GEIGYLVPLGDQTSRTGQWVVRRLQSEVAHAGGLPPRMF 1479
>gi|168066745|ref|XP_001785293.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663100|gb|EDQ49884.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1090
Score = 95.1 bits (235), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 116/498 (23%), Positives = 196/498 (39%), Gaps = 105/498 (21%)
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
V+ + PF++ + P L + EL I + +R VPL P +A+ ++
Sbjct: 670 VNHMCPFNSASFPDS-LAIGKEGELTIGTIDEI----QKLHIRTVPLGEHPRRIAHQEQS 724
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+T+ I ++ P + NGED E +V L ++E I +
Sbjct: 725 RTFAICSAKYAPGS-----NGEDME----------------THYVRLIEDQTFEII--SG 761
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT-CRGRILLFDIIEVVPE 308
FPL +E+ + S + + Y +GT Y E+ +GRIL+F + E
Sbjct: 762 FPLDPYENGCSIITCSFTDDSNV-----YYCVGTAYALPEESEPSKGRILVFSV-----E 811
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
G KI+++ KE KG V + G L+ + QKI Y W L+D+ + + +
Sbjct: 812 DG------KIQLVAEKEVKGAVYNLNAFNGKLLAGINQKIALYKWTLRDDGTRELQYESS 865
Query: 367 E---VYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNP 423
+ + S + I+VGD +SI+LL Y+PE + ARDY
Sbjct: 866 HHGHILALYVQSRGDFIVVGDLMKSISLLIYKPEEGAIEERARDYNAN------------ 913
Query: 424 SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQP 483
W +ILD+ + +G ++ N+
Sbjct: 914 ----------WM------------------TAVEILDDDTYLG---AENSFNLFTVRKNN 942
Query: 484 EARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGA 539
+A RL ++HLG+ VN F +R S S P + +++G
Sbjct: 943 DAATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEASQIP------TVIFGTVNGV 996
Query: 540 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
+G LP+ + L LQ +V GGL+ +R++ + +R +DG L+
Sbjct: 997 IGVIASLPQDQFLFLQKLQQALVKVIKGVGGLSHEQWRSFSNERKTV--DARNFLDGDLI 1054
Query: 600 WKFLQLSLGERLEICKKI 617
FL LS + EI +
Sbjct: 1055 ESFLDLSRNKMEEIATSL 1072
>gi|157873900|ref|XP_001685450.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania major strain Friedlin]
gi|68128522|emb|CAJ08654.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania major strain Friedlin]
Length = 1541
Score = 95.1 bits (235), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 127/598 (21%), Positives = 229/598 (38%), Gaps = 110/598 (18%)
Query: 33 LLLVRTQHELLIYQAF----RHPKGALKLRFKKLKVL-------FVSDRSKRANEQPGLP 81
L+++ + EL+ Y+ P+ +K+ + L V + R KR E+
Sbjct: 937 LVMILSSGELVTYRVVPADANGPRRCVKVIYHILDVAPEVDVMESIKARKKRLQEERAHL 996
Query: 82 RGVRISQMRYFSN--------IAGYQGVFLCGPHPAWL-FLTSRGELRAHPMTIDGPVST 132
V QMR+ S Y+G+++CG P +L + + +L V
Sbjct: 997 ASV-TQQMRHCSERLVPFRGLQDRYKGIYVCGQTPVFLVYHAATNQLVCTRHHATNAVRG 1055
Query: 133 LAPFHN-------VNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
APFH+ V C GF++F L + W + +V L CTPH + Y
Sbjct: 1056 FAPFHSRHVHGGFVYCGEGFVHFATMQPF------GELLGCSGWWLERVRLGCTPHQVIY 1109
Query: 186 HLETKTYCIVTSTAEP-STDYYKFNGEDKELVTDPRDSRF--------IPPLVS------ 230
+V S +P S F+ + + +V D +R +PPL +
Sbjct: 1110 SPAAHGCFVVASRPQPFSPKRAPFDVQLR-MVEDEEGNRVPHVIEPVSLPPLSATSGSPV 1168
Query: 231 ----QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTL-----SGLRGYIAL 281
++ V FS +W+ + + L E VL + + + T+ S AL
Sbjct: 1169 PTNERYEVQFFSTLNWQCMGR--LVLDGNEKVLSATLMQVTRDTTMDAANRSTTAPVCAL 1226
Query: 282 GTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA-GFL 340
T Y EDVT RGRILL + +++ ++ + +GPVTAI V +
Sbjct: 1227 ATAYPLGEDVTTRGRILLLTTSQQ-----SGQGMQQLRTLHEEPMEGPVTAITRVGEDCV 1281
Query: 341 VTAVGQKIYIWQLKDNDLT--GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEY 398
AVG + +++ N T +A + Y+ + + + +++GD S+ RY E
Sbjct: 1282 AVAVGGTVRVYRYDANKSTMETMAILYAGAYVTCLQAFREYLVIGDLFNSVLFARYSEEI 1341
Query: 399 RTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 458
T++++ RD I ND+
Sbjct: 1342 HTITILGRD----------------------------------------TSAISVVSNDM 1361
Query: 459 LDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSS 518
L + G +++D +N++ Y+P E +G + ++ + + K
Sbjct: 1362 LYHDTRFGLLVTDDARNLMCMSYKPRVLEEHGKPPKVLESLLTVTGEYRLAGGVLLKMMR 1421
Query: 519 ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
+ A S + ++ G +G+ +PL ++ R + + + +H GGL PR F
Sbjct: 1422 LRAASARSSSVAIYVTNM-GEIGYLVPLGDQTSRTGQWVVRRLQSEVAHAGGLPPRMF 1478
>gi|414587797|tpg|DAA38368.1| TPA: hypothetical protein ZEAMMB73_143443 [Zea mays]
Length = 153
Score = 93.2 bits (230), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 78/144 (54%), Gaps = 1/144 (0%)
Query: 487 ESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPL 546
ES G +L+ + +FH+G HV+ F +++ P+ A +RF + +LDG +G P+
Sbjct: 3 ESWKGQKLLSRAEFHVGAHVSKFLRLQMLPTQ-GLASEKTNRFALVFGTLDGGIGCIAPV 61
Query: 547 PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
E +RRL LQ +V H GLNPR+FR +K G IID L+ + +S
Sbjct: 62 DELTFRRLQSLQRKLVDAIPHVCGLNPRSFRHFKSNGKAHRPGPDNIIDFELLSHYEMMS 121
Query: 607 LGERLEICKKIGSKHNDILDELYD 630
L E+LEI ++IG+ + IL D
Sbjct: 122 LEEQLEIAQQIGTTRSQILSNFSD 145
>gi|428164905|gb|EKX33915.1| hypothetical protein GUITHDRAFT_158867 [Guillardia theta CCMP2712]
Length = 1092
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 116/529 (21%), Positives = 210/529 (39%), Gaps = 99/529 (18%)
Query: 97 GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRI 156
G VF P ++ +R L ++ + V+ +APF++ P L ++ LRI
Sbjct: 640 GATHVFAASDRPTVIYSNNRKLLFSNVNLKE--VTQMAPFNSEGFPDS-LAIATETSLRI 696
Query: 157 SVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELV 216
V+ +R V L+ P + + +KT+C+ T + + D GE+ E
Sbjct: 697 GVIDDI----QKLHIRTVYLREQPRRICHQESSKTFCVATLSIRINRD-----GEEVE-- 745
Query: 217 TDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLR 276
QF + LF ++E + + L E+E+ ++ S + TL
Sbjct: 746 -------------EQF-IKLFDDQTFEILD--TYQLQEFENTCSVECASFSDDPTL---- 785
Query: 277 GYIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICH 335
Y +GT ++ + GR+L+F++I+ K+ + +KE KG I
Sbjct: 786 -YYIVGTATAVPQESEPKEGRLLVFEVID-----------RKLHLKASKEIKGAPYQIKP 833
Query: 336 VAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-----EVYIASMVSVKNLILVGDYARSIA 390
G L+ + KI +++L D+D + + + + + + + I+ GD RSI+
Sbjct: 834 FNGKLLAGINSKIELFRLSDSDTGHMELVSECCHRGHILVLYLQTRGDFIVAGDLMRSIS 893
Query: 391 LLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 450
LL Y+ + +ARD+ W
Sbjct: 894 LLTYKQVDGQIEEIARDFNAN----------------------WM--------------- 916
Query: 451 IGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF 510
DILD+ + +G ++ N+ +A RL ++HLG VN F
Sbjct: 917 ---TAVDILDDDTFLG---AEGYFNLFTVRKNTDATSDEERARLEVVGEYHLGDMVNRFQ 970
Query: 511 KIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
+ S SD P + + +++G +G L ++ Y LL +Q+ + GG
Sbjct: 971 RGSLVLRS-SDTPTTDT---IIFGTVNGMIGVIAVLSKEEYEFLLKVQDALNFVIKGVGG 1026
Query: 571 LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
L +R+++ + +G IDG L+ FL L + E+C IGS
Sbjct: 1027 LRHEDWRSFENERTQGARAPKGFIDGDLIESFLDLRREKMEEVCHAIGS 1075
>gi|301124072|ref|XP_002909688.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262107255|gb|EEY65307.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 176
Score = 87.8 bits (216), Expect = 2e-14, Method: Composition-based stats.
Identities = 53/173 (30%), Positives = 88/173 (50%), Gaps = 15/173 (8%)
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS-SISDAPGAR-----SRFLTWYA 534
+ P+ ES GG RL++ +DFHLG V++ F+ R S S+ A R S ++
Sbjct: 3 FAPQDIESRGGQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVNVMG 62
Query: 535 SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY-YAGNPS--- 590
+ +G +G +P+ E+ +RRL LQNVMV LNPR FR K G P
Sbjct: 63 TSEGGVGALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRRCGRPDAWS 122
Query: 591 -----RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
+ +D ++++FLQL+ + E+ + IG+ ++ L +++ +S F
Sbjct: 123 KKKWKKSFLDAFVLFRFLQLNYVAQKELARCIGTTPEVVMHNLLEVQHATSTF 175
>gi|198432471|ref|XP_002129229.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
DNA-binding protein 1) (UV-damaged DNA-binding factor)
(DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
(UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
pigmentosum group E-co... isoform 2 [Ciona intestinalis]
Length = 1142
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 127/585 (21%), Positives = 226/585 (38%), Gaps = 104/585 (17%)
Query: 66 FVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
++SDR K +P G + + + F++ G + VF C P ++ +S +L +
Sbjct: 629 YISDRKK-------VPLGTQPTSLSVFTS-GGSRTVFACSDRPTVVY-SSNKKLVFSNVN 679
Query: 126 IDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
+ VS + P + P N + +L + +R VPL +P +AY
Sbjct: 680 LK-EVSHMCPLDSDGYPDSLALANDNT-----LLIGTIDEIQKLHIRTVPLYESPRRIAY 733
Query: 186 HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIP-----PLVSQFHVSLFSPF 240
E++ + +VT TD G DK +T P S P V V FS
Sbjct: 734 QEESQCFGLVT----LRTDSVDATG-DKMKITRPSASTQASVCTKSPPVDGRSVEGFSAT 788
Query: 241 ----SWEEIPQTNFPLHEW------EHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
S I Q F +H E L + + + S Y +GT + Y E+
Sbjct: 789 ADIGSLLIIDQHTFEVHHAYQLDTNEEPLSIMSCKLG-----SDPNSYFVVGTAFVYMEE 843
Query: 291 VTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
+ GRIL+F I+ NK+ ++ KE KG V +C G ++ A+ +
Sbjct: 844 TEPKHGRILVFHYID-----------NKLTLVAEKEVKGAVFCLCQFNGHVLAAINTSVS 892
Query: 350 IWQ-LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDY 408
I+Q + +L + + + + +LVGD RS+++L Y+ L +A+DY
Sbjct: 893 IYQWTTEKELRAECSNQSNILALYLKCKGDFVLVGDLMRSMSILNYKHVEGNLDEIAKDY 952
Query: 409 KPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFM 468
P W +ILD+ + +G
Sbjct: 953 SPN----------------------WM------------------TAVEILDDDNFLG-- 970
Query: 469 ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSR 528
++ NV + A +L + FH+G +NTF ++ + S+
Sbjct: 971 -AENFYNVFICQKDSGATTDEERSKLREAALFHVGDSINTFRHGSLVMQNVGET-AVSSK 1028
Query: 529 FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
+ ++ G++G + E Y L +QN + G ++ ++R++ +
Sbjct: 1029 GHILFGTVHGSIGVITTVDEDLYAFLHSIQNRLAKVIKSVGNIDHESWRSFCTNEKTEAH 1088
Query: 589 PSRGIIDGSLVWKFLQLSLGERLEICKKI-----GSKHNDILDEL 628
RG +DG L+ FL L+ + E+ K + G+K +D+L
Sbjct: 1089 --RGFVDGDLIECFLDLNREKMAEVAKGLMVKEHGTKREATVDDL 1131
>gi|301093655|ref|XP_002997673.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110063|gb|EEY68115.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 176
Score = 87.4 bits (215), Expect = 2e-14, Method: Composition-based stats.
Identities = 53/173 (30%), Positives = 87/173 (50%), Gaps = 15/173 (8%)
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS-SISDAPGAR-----SRFLTWYA 534
+ P+ ES GG RL++ +DFHLG V++ F+ R S S+ A R S ++
Sbjct: 3 FAPQDIESRGGQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVNVMG 62
Query: 535 SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY-YAGNPS--- 590
+ +G +G +P+ E+ +RRL LQNVMV LNPR FR K G P
Sbjct: 63 TSEGGVGALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRRCGRPDAWS 122
Query: 591 -----RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
+ +D ++++FLQL + E+ + IG+ ++ L +++ +S F
Sbjct: 123 KKKWKKSFLDAFVLFRFLQLDYVAQKELARCIGTTPEVVMHNLLEVQHATSTF 175
>gi|198432469|ref|XP_002129207.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
DNA-binding protein 1) (UV-damaged DNA-binding factor)
(DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
(UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
pigmentosum group E-co... isoform 1 [Ciona intestinalis]
Length = 1150
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 127/589 (21%), Positives = 226/589 (38%), Gaps = 108/589 (18%)
Query: 66 FVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
++SDR K +P G + + + F++ G + VF C P ++ +S +L +
Sbjct: 633 YISDRKK-------VPLGTQPTSLSVFTS-GGSRTVFACSDRPTVVY-SSNKKLVFSNVN 683
Query: 126 IDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
+ VS + P + P N + +L + +R VPL +P +AY
Sbjct: 684 LK-EVSHMCPLDSDGYPDSLALANDNT-----LLIGTIDEIQKLHIRTVPLYESPRRIAY 737
Query: 186 HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIP-----PLVSQFHVSLFSPF 240
E++ + +VT TD G DK +T P S P V V FS
Sbjct: 738 QEESQCFGLVTL----RTDSVDATG-DKMKITRPSASTQASVCTKSPPVDGRSVEGFSAT 792
Query: 241 ----SWEEIPQTNFPLHEW------EHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
S I Q F +H E L + + + S Y +GT + Y E+
Sbjct: 793 ADIGSLLIIDQHTFEVHHAYQLDTNEEPLSIMSCKLG-----SDPNSYFVVGTAFVYMEE 847
Query: 291 VTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
+ GRIL+F I+ NK+ ++ KE KG V +C G ++ A+ +
Sbjct: 848 TEPKHGRILVFHYID-----------NKLTLVAEKEVKGAVFCLCQFNGHVLAAINTSVS 896
Query: 350 IWQ-LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDY 408
I+Q + +L + + + + +LVGD RS+++L Y+ L +A+DY
Sbjct: 897 IYQWTTEKELRAECSNQSNILALYLKCKGDFVLVGDLMRSMSILNYKHVEGNLDEIAKDY 956
Query: 409 KPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFM 468
P W +ILD+ + +G
Sbjct: 957 SPN----------------------WM------------------TAVEILDDDNFLG-- 974
Query: 469 ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSR 528
++ NV + A +L + FH+G +NTF ++ + S+
Sbjct: 975 -AENFYNVFICQKDSGATTDEERSKLREAALFHVGDSINTFRHGSLVMQNVGET-AVSSK 1032
Query: 529 FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
+ ++ G++G + E Y L +QN + G ++ ++R++ +
Sbjct: 1033 GHILFGTVHGSIGVITTVDEDLYAFLHSIQNRLAKVIKSVGNIDHESWRSFCTNEKTEAH 1092
Query: 589 PSRGIIDGSLVWKFLQLSLGERLEICKKI---------GSKHNDILDEL 628
RG +DG L+ FL L+ + E+ K + G+K +D+L
Sbjct: 1093 --RGFVDGDLIECFLDLNREKMAEVAKGLMVKNFNDQHGTKREATVDDL 1139
>gi|255080490|ref|XP_002503825.1| predicted protein [Micromonas sp. RCC299]
gi|226519092|gb|ACO65083.1| predicted protein [Micromonas sp. RCC299]
Length = 1114
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 106/484 (21%), Positives = 189/484 (39%), Gaps = 116/484 (23%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
+R VPL P +A+ ET+TY +T + + NG
Sbjct: 727 IRTVPLGEQPRRIAHQPETRTYAALT-------ENFDENG-------------------- 759
Query: 231 QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
+ V LF ++E + + E + + +S + R Y +GT Y+ E+
Sbjct: 760 -YFVRLFDDVTFETLCKFRLEPDEQDSSV----ISCAFA---DDPRVYYVVGTGYSLPEE 811
Query: 291 VT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
RGRIL+F E G K++++ KE KG V + G L+ + K+
Sbjct: 812 PEPTRGRILVFR-----AEDG------KLQLVAEKEVKGAVYNLNAFNGKLLAGINSKVE 860
Query: 350 IWQLKDNDLTGIAFIDTEVY------------IASMVSVKN-LILVGDYARSIALLRYQP 396
++ + D G Y +A V+V+ I+VGD +S++LL Y+P
Sbjct: 861 LF--RGGDPVGADGAGGSTYELAKECSHHGHIVALYVAVRGEFIVVGDLMKSVSLLAYKP 918
Query: 397 EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 456
E + ARDY W
Sbjct: 919 EESVIEERARDYNAN----------------------WM------------------TAV 938
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK----I 512
DILD+ + +G ++ + N+ Q +A RL ++H+G+ VN F + +
Sbjct: 939 DILDDDTYLG---AENNFNLFTLRRQSDAATDEERSRLEVVGEYHVGEFVNRFRRGSLVM 995
Query: 513 RCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
R +D P + ++ G +G LP + + L LQ + S GGL+
Sbjct: 996 RLPDQENADVP------TLLFGTVSGVIGVLATLPREQFEFLSALQAALNKTVSGVGGLS 1049
Query: 573 PRAFRTYKGK-GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
A+R+++ + + A + +RG +DG L+ FL L + E+ + +++ + D+
Sbjct: 1050 HDAWRSFQNEHRHRAKDGARGFVDGDLIESFLDLRPEKAREVAAAVKLSVDELTRRVEDL 1109
Query: 632 EALS 635
+ L+
Sbjct: 1110 QRLT 1113
>gi|412992547|emb|CCO18527.1| predicted protein [Bathycoccus prasinos]
Length = 1275
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 107/494 (21%), Positives = 190/494 (38%), Gaps = 104/494 (21%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
+R VPL+ P +A+ ETKT ++T +P
Sbjct: 856 IRTVPLREQPRRIAHQPETKTLAVLTMKESD-----------------------VPGQEE 892
Query: 231 QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYN--YS 288
+F V LF ++E + + +PL E+ + + S + + + Y +GT + +S
Sbjct: 893 EFFVRLFDNKTFETLAK--YPLEPNENDASIISCSFDGDDDI-----YFVVGTAFADPHS 945
Query: 289 EDVTCRGRILLFDIIEVVPEPG------------------QPLTKNKIKMIYAKEQKGPV 330
E + RGRIL+F + G + + + ++ KE +G V
Sbjct: 946 EPESSRGRILVFKVSNTSSSGGGNAVVNGNDHGDGRASASSSVLQKSLTLVCEKETRGAV 1005
Query: 331 TAICHVAGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEV--YIASMVSVK-NLILVGDY 385
+ G L+ + K++ W + + + + + IA V K NLI+VGD
Sbjct: 1006 YNLNAFCGKLLAGINSLVKLFNWGVSKENKRELVHECSHMGHIIALKVETKDNLIVVGDL 1065
Query: 386 ARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERL 445
+SI LL+YQ E + VA D+ W
Sbjct: 1066 MKSITLLQYQRESGRIEEVAHDFSSN----------------------WMTAV------- 1096
Query: 446 EICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH 505
+ILD+ + +G ++ N+ +A + L FHLG
Sbjct: 1097 -----------EILDDNTYLG---AESSYNLFTVQRNADADTEDKRGTLELCGAFHLGDS 1142
Query: 506 VNTFFK--IRCKPSSISDAPGARSRFLTW-YASLDGALGFFLPLPEKNYRRLLMLQNVMV 562
VN F + + + +SD + S TW + ++ G LG LP++++ L +Q M
Sbjct: 1143 VNRFRRGSLVMRMPDLSDDTSSLSEISTWLFGTISGGLGVVATLPKRDFMLLNKVQEAMQ 1202
Query: 563 THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG-SKH 621
+ G + FR++ R IDG LV FL LS +++ + + G S
Sbjct: 1203 KVVTGVGNFSHSDFRSFHNVQRSV--EMRNFIDGDLVEIFLDLSKEDQVAVSELSGVSNS 1260
Query: 622 NDILDELYDIEALS 635
D++ ++ +I L+
Sbjct: 1261 EDLVKKIEEISRLT 1274
>gi|301630307|ref|XP_002944263.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Xenopus (Silurana) tropicalis]
Length = 92
Score = 81.3 bits (199), Expect = 2e-12, Method: Composition-based stats.
Identities = 43/93 (46%), Positives = 59/93 (63%), Gaps = 1/93 (1%)
Query: 546 LPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
+ EK YRRLLMLQN + T H GLNPRAFR NP R ++DG L+ ++L L
Sbjct: 1 MQEKTYRRLLMLQNAL-TVLPHHAGLNPRAFRMLNSSRRMLQNPVRNVLDGELLNRYLYL 59
Query: 606 SLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
S ER E+ +KIG+ + ILD+L +I+ ++S F
Sbjct: 60 SNMERSELARKIGTTTDIILDDLLEIDRVTSLF 92
Score = 40.0 bits (92), Expect = 4.2, Method: Composition-based stats.
Identities = 20/49 (40%), Positives = 30/49 (61%)
Query: 414 NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEF 462
NS NP R ++DG L+ ++L LS ER E+ +KIG+ + ILD+
Sbjct: 34 NSSRRMLQNPVRNVLDGELLNRYLYLSNMERSELARKIGTTTDIILDDL 82
>gi|401828022|ref|XP_003888303.1| pre-mRNA cleavage and polyadenylation specificity factor
[Encephalitozoon hellem ATCC 50504]
gi|392999575|gb|AFM99322.1| pre-mRNA cleavage and polyadenylation specificity factor
[Encephalitozoon hellem ATCC 50504]
Length = 1155
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 80/367 (21%), Positives = 154/367 (41%), Gaps = 56/367 (15%)
Query: 211 EDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEG 270
E+ E+ ++ IP +F+V L+S + E I + L E E+V +K + ++
Sbjct: 788 EEVEVSSNNEKDCGIPVNTYRFYVDLYSE-NHEHI--DTYELEENEYVFDVKYLILDDMQ 844
Query: 271 TLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPV 330
G ++ + T + ED +GR+ + +II VVP P P K+K++ ++ KG +
Sbjct: 845 GNYGKSPFLLICTTFIEGEDKPAKGRLHVLEIISVVPSPESPFKDCKLKVLGIEKTKGSI 904
Query: 331 TAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSI 389
+ G + +G KI I+++ + N + I F D ++ +S+ VKN IL D R +
Sbjct: 905 VQCSEIRGKIALCLGTKIMIYKIDRSNGIIPIGFYDLHIFTSSISVVKNYILASDIYRGL 964
Query: 390 ALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 449
+ +Q + L L++ + P + + L+ +LS + C
Sbjct: 965 SFFFFQSKPIRLHLIS--------------SSEPLKNVTSTELLIAGNELS----MVCCD 1006
Query: 450 KIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
G+ H + Y P S G +L+K+ + + ++
Sbjct: 1007 SKGTIH----------------------AYTYSPNNIISMDGAKLVKRAE--MKTNLGRL 1042
Query: 510 FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
F S G R + +Y+ + ++ + + NY +LL +Q ++ H
Sbjct: 1043 F---------SSGIGFRKNSIMFYSKTNLSI-HLAGIDDLNYPKLLEIQTSIMVHLKSVL 1092
Query: 570 GLNPRAF 576
GLN R +
Sbjct: 1093 GLNQRDY 1099
>gi|62318656|dbj|BAD95136.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
Length = 1088
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 124/546 (22%), Positives = 218/546 (39%), Gaps = 116/546 (21%)
Query: 83 GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCP 142
G R +R FS+ + VF PA ++ ++ L ++ + VS + PF++ P
Sbjct: 626 GTRPITLRTFSSKSATH-VFAASDRPAVIYSNNKKLLYSNVSLKE--VSHMCPFNSAAFP 682
Query: 143 RGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPS 202
L + EL I + +R +P+ + + +T+T+ I EPS
Sbjct: 683 DS-LAIAREGELTIGTIDDI----QKLHIRTIPIGEHARRICHQEQTRTFAISCLRNEPS 737
Query: 203 TDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLK 262
+ +S F+ L +Q S+E + +++PL +E +
Sbjct: 738 AE--------------ESESHFVRLLDAQ---------SFEFL--SSYPLDAFECGCSIL 772
Query: 263 NVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMI 321
+ S + + Y +GT Y E+ +GRIL+F I+E + ++++I
Sbjct: 773 SCSFTDDKNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------EGRLQLI 816
Query: 322 YAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV-----YIASMV 374
KE KG V ++ G L+ ++ QKI Y W L+D+ G + +E +A V
Sbjct: 817 TEKETKGAVYSLNAFNGKLLASINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYV 873
Query: 375 SVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
+ + I VGD +SI+LL Y+ E + ARDY
Sbjct: 874 QTRGDFIAVGDLMKSISLLIYKHEEGAIEERARDYNAN---------------------- 911
Query: 434 WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
W +EI ++DI ++ +D N+ E R
Sbjct: 912 WM-------AAVEIL------NDDI--------YLGTDNCFNIFTVKKNNEGATDEERAR 950
Query: 494 LIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEK 549
+ ++H+G+ VN F ++ S I P + ++ G +G LP++
Sbjct: 951 MEVVGEYHIGEFVNRFRHGSLVMKLPDSDIGQIP------TVIFGTVSGMIGVIASLPQE 1004
Query: 550 NYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGE 609
Y L LQ + GGL+ +R++ + A ++G +DG L+ FL LS G+
Sbjct: 1005 QYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKGYLDGDLIESFLDLSRGK 1062
Query: 610 RLEICK 615
EI K
Sbjct: 1063 MEEISK 1068
>gi|297799958|ref|XP_002867863.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
lyrata]
gi|297313699|gb|EFH44122.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
lyrata]
Length = 1088
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 124/559 (22%), Positives = 219/559 (39%), Gaps = 118/559 (21%)
Query: 71 SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPV 130
S + ++ + G + +R FS+ + VF PA ++ ++ L ++ + V
Sbjct: 614 SGKLRDRKKVSLGTQPITLRTFSSKSATH-VFAASDRPAVIYSNNKKLLYSNVNLKE--V 670
Query: 131 STLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETK 190
S + PF++ P L + EL I + +R +P+ + + +T+
Sbjct: 671 SHMCPFNSAAFPDS-LAIAREGELTIGTIDDI----QKLHIRTIPIGEHARRICHQEQTR 725
Query: 191 TYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFH-VSLFSPFSWEEIPQTN 249
T+ I +PS + S+ H V L S+E + +
Sbjct: 726 TFAICCLRNQPSAEE------------------------SEMHFVRLLDAQSFEFL--ST 759
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+PL +E+ + + S + + Y +GT Y E+ +GRIL+F I+E
Sbjct: 760 YPLDAFEYGCSILSCSFTDDKNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE---- 809
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
+ ++++I KE KG V ++ G L+ A+ QKI Y W L+D+ G + +
Sbjct: 810 ------EGRLQLITEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQS 860
Query: 367 EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
E +A V + + I+VGD +SI+LL Y+ E + ARDY
Sbjct: 861 ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN--------- 911
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
W +ILD+ +G +D N+
Sbjct: 912 -------------WM------------------AAVEILDDDIYLG---ADNCFNLFTVK 937
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
E R+ ++H+G+ VN F +R S I P + ++
Sbjct: 938 KNNEGATDEERARMEVVGEYHIGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTV 991
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
G +G LP++ Y L LQ + GGL+ +R++ + A ++ +DG
Sbjct: 992 SGMIGVIASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKSYLDG 1049
Query: 597 SLVWKFLQLSLGERLEICK 615
L+ FL LS G+ EI K
Sbjct: 1050 DLIESFLDLSRGKMEEISK 1068
>gi|15233515|ref|NP_193842.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
gi|73620956|sp|O49552.2|DDB1B_ARATH RecName: Full=DNA damage-binding protein 1b; AltName: Full=UV-damaged
DNA-binding protein 1b; Short=DDB1b
gi|110739453|dbj|BAF01636.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
gi|332659001|gb|AEE84401.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
Length = 1088
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 124/546 (22%), Positives = 218/546 (39%), Gaps = 116/546 (21%)
Query: 83 GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCP 142
G R +R FS+ + VF PA ++ ++ L ++ + VS + PF++ P
Sbjct: 626 GTRPITLRTFSSKSATH-VFAASDRPAVIYSNNKKLLYSNVNLKE--VSHMCPFNSAAFP 682
Query: 143 RGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPS 202
L + EL I + +R +P+ + + +T+T+ I EPS
Sbjct: 683 DS-LAIAREGELTIGTIDDI----QKLHIRTIPIGEHARRICHQEQTRTFAISCLRNEPS 737
Query: 203 TDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLK 262
+ +S F+ L +Q S+E + +++PL +E +
Sbjct: 738 AE--------------ESESHFVRLLDAQ---------SFEFL--SSYPLDAFECGCSIL 772
Query: 263 NVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMI 321
+ S + + Y +GT Y E+ +GRIL+F I+E + ++++I
Sbjct: 773 SCSFTDDKNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------EGRLQLI 816
Query: 322 YAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV-----YIASMV 374
KE KG V ++ G L+ ++ QKI Y W L+D+ G + +E +A V
Sbjct: 817 TEKETKGAVYSLNAFNGKLLASINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYV 873
Query: 375 SVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
+ + I VGD +SI+LL Y+ E + ARDY
Sbjct: 874 QTRGDFIAVGDLMKSISLLIYKHEEGAIEERARDYNAN---------------------- 911
Query: 434 WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
W +EI ++DI ++ +D N+ E R
Sbjct: 912 WM-------TAVEIL------NDDI--------YLGTDNCFNIFTVKKNNEGATDEERAR 950
Query: 494 LIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEK 549
+ ++H+G+ VN F ++ S I P + ++ G +G LP++
Sbjct: 951 MEVVGEYHIGEFVNRFRHGSLVMKLPDSDIGQIP------TVIFGTVSGMIGVIASLPQE 1004
Query: 550 NYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGE 609
Y L LQ + GGL+ +R++ + A ++G +DG L+ FL LS G+
Sbjct: 1005 QYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKGYLDGDLIESFLDLSRGK 1062
Query: 610 RLEICK 615
EI K
Sbjct: 1063 MEEISK 1068
>gi|449019486|dbj|BAM82888.1| similar to cleavage and polyadenylation specificity factor subunit
[Cyanidioschyzon merolae strain 10D]
Length = 1880
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 86/429 (20%), Positives = 174/429 (40%), Gaps = 56/429 (13%)
Query: 218 DPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNV-----SMEYEGTL 272
D R +P L+ Q V L + S E + + + E +C + + + E
Sbjct: 1481 DSGRERDLPLLIDQHAVVLLARNSLEVLARYDLEQTEVGLAMCATRIRHFQRTGDDEAPR 1540
Query: 273 SGLRGYIALGTNYNYSEDVTCRGRILLFDII--EVVPEPGQPLTKNKIKMIYAKEQKGPV 330
R + +GT + ED + RGR+L+F+I E T +++ + A E KG V
Sbjct: 1541 FTERDVLVVGTCFLRGEDTSIRGRLLVFEISRQEGRQHHQHQRTLYQMQTLAATEVKGAV 1600
Query: 331 TAICHV-AGFLVTAVGQKIYIWQLKDNDLTGIAFI-DTEVYIASMVSVKNLILVGDYARS 388
+A+ V GF+ + G ++ +++L +++++ I+F ++ + + ++K IL D
Sbjct: 1601 SAVAPVKGGFVCCSAGPRLEVYKLIEDEMSCISFYPGINLFFSHVGTLKQYILASDMRYG 1660
Query: 389 IALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 448
++ L ++ + + + RD + + + ++ ++ ++LS+
Sbjct: 1661 VSFLFWRSRNVSQNFLCRDEAQRELVASEWLMHGTKANVLSADMLGNIIELSI------- 1713
Query: 449 KKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT 508
P ES GG R+ + FH+G N
Sbjct: 1714 --------------------------------PSPVDPESAGGTRMTFEAGFHVGSRPNA 1741
Query: 509 FFKIRC-KPSSISDAPGARSRF----LTWYASLDGALGFFLPLPEKNYRRL-LMLQNVMV 562
++R PS+ + P S + ++DG + PL ++L L Q++M+
Sbjct: 1742 VRRVRIDDPSAETPPPNEPSSLWNTHVILLGTVDGMITMVSPLLRGVAKKLELAAQDLML 1801
Query: 563 THTSHTGGLNPRAFRTYKGKGYYAG--NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 620
L R++R + AG P R I+DG ++ + L R EI ++IG
Sbjct: 1802 EPELRKWCLYARSWRVMRSLTVAAGLRKPKRSILDGDVLQLYGSLDTPRRKEIARRIGMP 1861
Query: 621 HNDILDELY 629
+ + ++
Sbjct: 1862 QEALFEAIF 1870
>gi|297809743|ref|XP_002872755.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp. lyrata]
gi|297318592|gb|EFH49014.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp. lyrata]
Length = 1088
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 122/559 (21%), Positives = 223/559 (39%), Gaps = 116/559 (20%)
Query: 89 MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
+R FS+ + VF P ++ +++ L ++ + VS + PF++ P L
Sbjct: 632 LRTFSSKSATH-VFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPDS-LAI 687
Query: 149 NAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKF 208
+ EL I + +R +PL + + +T+T+ I + +
Sbjct: 688 AREGELTIGTIDDI----QKLHIRTIPLGEHARRICHQEQTRTFGICSLGNQS------- 736
Query: 209 NGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEY 268
N E+ E+ F+ L Q F + + +PL +E+ + + S
Sbjct: 737 NAEESEM-------HFVRLLDDQ-------TFEF----MSTYPLDSFEYGCSILSCSFTD 778
Query: 269 EGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQK 327
+ + Y +GT Y E+ +GRIL+F + E G ++++I KE K
Sbjct: 779 DKNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVEDG------RLQLIAEKETK 822
Query: 328 GPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV-----YIASMVSVK-NL 379
G V ++ G L+ A+ QKI Y W L+D+ G + +E +A V + +
Sbjct: 823 GAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDF 879
Query: 380 ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQL 439
I+VGD +SI+LL Y+ E + ARDY ++
Sbjct: 880 IVVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAV----------------------- 916
Query: 440 SLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD 499
+ILD+ +G ++ + N+V E RL +
Sbjct: 917 -----------------EILDDDIYLG---AENNFNLVTVKKNSEGATDEERGRLEVVGE 956
Query: 500 FHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLL 555
+HLG+ VN F +R S I P + +++G +G LP++ Y L
Sbjct: 957 YHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVIASLPQEQYTFLE 1010
Query: 556 MLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 615
LQ+ + GGL+ +R++ + A +R +DG L+ FL LS + +I K
Sbjct: 1011 KLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRNKMEDISK 1068
Query: 616 KIGSKHNDILDELYDIEAL 634
+ + ++ + ++ L
Sbjct: 1069 SMNVQVEELCKRVEELTRL 1087
>gi|2911067|emb|CAA17529.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
gi|7268907|emb|CAB79110.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
Length = 1102
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 124/546 (22%), Positives = 218/546 (39%), Gaps = 116/546 (21%)
Query: 83 GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCP 142
G R +R FS+ + VF PA ++ ++ L ++ + VS + PF++ P
Sbjct: 640 GTRPITLRTFSSKSATH-VFAASDRPAVIYSNNKKLLYSNVNLKE--VSHMCPFNSAAFP 696
Query: 143 RGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPS 202
L + EL I + +R +P+ + + +T+T+ I EPS
Sbjct: 697 DS-LAIAREGELTIGTIDDI----QKLHIRTIPIGEHARRICHQEQTRTFAISCLRNEPS 751
Query: 203 TDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLK 262
+ +S F+ L +Q S+E + +++PL +E +
Sbjct: 752 AE--------------ESESHFVRLLDAQ---------SFEFL--SSYPLDAFECGCSIL 786
Query: 263 NVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMI 321
+ S + + Y +GT Y E+ +GRIL+F I+E + ++++I
Sbjct: 787 SCSFTDDKNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------EGRLQLI 830
Query: 322 YAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV-----YIASMV 374
KE KG V ++ G L+ ++ QKI Y W L+D+ G + +E +A V
Sbjct: 831 TEKETKGAVYSLNAFNGKLLASINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYV 887
Query: 375 SVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
+ + I VGD +SI+LL Y+ E + ARDY
Sbjct: 888 QTRGDFIAVGDLMKSISLLIYKHEEGAIEERARDYNAN---------------------- 925
Query: 434 WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
W +EI ++DI ++ +D N+ E R
Sbjct: 926 WM-------TAVEIL------NDDI--------YLGTDNCFNIFTVKKNNEGATDEERAR 964
Query: 494 LIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEK 549
+ ++H+G+ VN F ++ S I P + ++ G +G LP++
Sbjct: 965 MEVVGEYHIGEFVNRFRHGSLVMKLPDSDIGQIP------TVIFGTVSGMIGVIASLPQE 1018
Query: 550 NYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGE 609
Y L LQ + GGL+ +R++ + A ++G +DG L+ FL LS G+
Sbjct: 1019 QYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKGYLDGDLIESFLDLSRGK 1076
Query: 610 RLEICK 615
EI K
Sbjct: 1077 MEEISK 1082
>gi|312283457|dbj|BAJ34594.1| unnamed protein product [Thellungiella halophila]
Length = 1088
Score = 80.5 bits (197), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 115/518 (22%), Positives = 205/518 (39%), Gaps = 113/518 (21%)
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
VS + PF++ P L + EL I + +R +PL + + +T
Sbjct: 670 VSHMCPFNSAAFPDS-LAIAREGELTIGTIDDI----QKLHIRTIPLGEHARRICHQEQT 724
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+T+ I + + N E+ E+ V L S+E + +
Sbjct: 725 RTFGICSLGNQT-------NAEESEM----------------HFVRLLDDQSFEFV--ST 759
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+PL +E+ + + S + + Y +GT Y E+ +GRIL+F + E
Sbjct: 760 YPLDAFEYGCSILSCSFADDKNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 809
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
G K+++I KE KG V ++ G L+ A+ QKI Y W L+D+ G + +
Sbjct: 810 DG------KLQLIAEKETKGSVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQS 860
Query: 367 EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
E +A V + + I+VGD +SI+LL Y+ E + ARDY ++
Sbjct: 861 ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 916
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
+ILD+ +G ++ + N++
Sbjct: 917 ------------------------------------EILDDDIYLG---AENNFNLLTVK 937
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
E RL ++HLG+ VN F +R S I P + ++
Sbjct: 938 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTV 991
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
+G +G LP++ Y L LQ+ + GGL+ +R++ + A +R +DG
Sbjct: 992 NGVIGVIASLPQEQYMFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDG 1049
Query: 597 SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
L+ FL LS + +I K + + ++ + ++ L
Sbjct: 1050 DLIESFLDLSRNKMEDISKSMNVQVEELCKRVEELTRL 1087
>gi|110741229|dbj|BAF02165.1| UV-damaged DNA binding factor - like protein [Arabidopsis thaliana]
Length = 727
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 121/559 (21%), Positives = 223/559 (39%), Gaps = 116/559 (20%)
Query: 89 MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
+R FS+ + VF P ++ +++ L ++ + VS + PF++ P L
Sbjct: 271 LRTFSSKSATH-VFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPDS-LAI 326
Query: 149 NAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKF 208
+ EL I + +R +PL + + +T+T+ I + +
Sbjct: 327 AREGELTIGTIDDI----QKLHIRTIPLGEHARRICHQEQTRTFGICSLGNQS------- 375
Query: 209 NGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEY 268
N E+ E+ F+ L Q F + + +PL +E+ + + S
Sbjct: 376 NSEESEM-------HFVRLLDDQ-------TFEF----MSTYPLDSFEYGCSILSCSFTE 417
Query: 269 EGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQK 327
+ + Y +GT Y E+ +GRIL+F + E G ++++I KE K
Sbjct: 418 DKNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVEDG------RLQLIAEKETK 461
Query: 328 GPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV-----YIASMVSVK-NL 379
G V ++ G L+ A+ QKI Y W L+D+ G + +E +A V + +
Sbjct: 462 GAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDF 518
Query: 380 ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQL 439
I+VGD +SI+LL Y+ E + ARDY ++
Sbjct: 519 IVVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAV----------------------- 555
Query: 440 SLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD 499
+ILD+ +G ++ + N++ E RL +
Sbjct: 556 -----------------EILDDDIYLG---AENNFNLLTVKKNSEGATDEERGRLEVVGE 595
Query: 500 FHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLL 555
+HLG+ VN F +R S I P + +++G +G LP++ Y L
Sbjct: 596 YHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVIASLPQEQYTFLE 649
Query: 556 MLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 615
LQ+ + GGL+ +R++ + A +R +DG L+ FL LS + +I K
Sbjct: 650 KLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRNKMEDISK 707
Query: 616 KIGSKHNDILDELYDIEAL 634
+ + ++ + ++ L
Sbjct: 708 SMNVQVEELCKRVEELTRL 726
>gi|300707023|ref|XP_002995737.1| hypothetical protein NCER_101290 [Nosema ceranae BRL01]
gi|239604943|gb|EEQ82066.1| hypothetical protein NCER_101290 [Nosema ceranae BRL01]
Length = 1155
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 87/379 (22%), Positives = 158/379 (41%), Gaps = 73/379 (19%)
Query: 248 TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
+ F L E+VL +K +S+ ++G +I + ED RGRI++F++I+++
Sbjct: 830 STFDLESDEYVLDIKELSLNDSIGINGKNNFIVICVTKVEGEDKHSRGRIIVFELIDIIV 889
Query: 308 EPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND-LTGIAFIDT 366
+ K+K++ ++ KG +T + G L+ A+G K I+++ ++ L I D
Sbjct: 890 DKANVHKDKKLKVLASENIKGCITKCDEIKGNLIVALGIKTMIYKIDRSEGLIPIGIHDL 949
Query: 367 EVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRG 426
SM+++KN +L D R ++ YQ + L+LV T + K
Sbjct: 950 YTLTTSMITIKNFVLFSDIYRGLSFFYYQNKPVRLNLVC-----TSESIK---------- 994
Query: 427 IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
+ H D + + ++G + +D N+ + Y P
Sbjct: 995 -------------------------NAVHVDFIVKEPALGIICTDFAGNIHTYTYSPVNI 1029
Query: 487 ESNGGHRLIKK--TDFHLGQHVNTFFKIR------CKPSSISDAPGARSRFLTWYASLDG 538
S G + +K+ T+F+LG+ V I+ P ISD+ + LD
Sbjct: 1030 LSCNGTKFVKRCETNFNLGKLV-----IKRAHSKLLNPVFISDS--------NYIIELDS 1076
Query: 539 ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPS-RGIIDGS 597
L NY L +QN ++ T GL P F + Y+ PS + I
Sbjct: 1077 -------LSLDNYNNFLKVQNAYLSLIEDTFGLCPENFNNCE---YHLKPPSVKKPILKE 1126
Query: 598 LVWKFLQLSLGERLEICKK 616
L+++FL L + ++ K+
Sbjct: 1127 LLFRFLHLPVDKKANFLKE 1145
>gi|449328561|gb|AGE94838.1| cleavage and polyadenylation specific factor [Encephalitozoon
cuniculi]
Length = 1156
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 86/355 (24%), Positives = 144/355 (40%), Gaps = 60/355 (16%)
Query: 225 IPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTN 284
IP +F+V L+S +E I + L E E+V +K + ++ G ++ + T
Sbjct: 803 IPVDTYRFYVDLYSE-KYEHID--TYELDENEYVFHIKYLILDDMQGNYGKSPFLLVCTT 859
Query: 285 YNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAV 344
+ ED RGR+ + +II VVP P K+K++ ++ KG + V G + +
Sbjct: 860 FIEGEDRPARGRLHVLEIISVVPSLESPFKDCKLKVLGIEKTKGSIVRCEEVRGKIALCL 919
Query: 345 GQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSL 403
G KI I+++ + N + I F D ++ +S+ VKN IL D R ++ +Q + L L
Sbjct: 920 GTKIMIYKIDRSNGIIPIGFYDLHIFTSSISVVKNYILASDIYRGLSFFFFQSKPIRLHL 979
Query: 404 VARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI--CKKIGSKHNDILDE 461
++ + P R L LS G L + C G+ H
Sbjct: 980 IS--------------SSEPLRNATSTEL------LSTGNELSMLCCDAKGTIHG----- 1014
Query: 462 FSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISD 521
+ Y P S G RL+K+ + + F K +SI
Sbjct: 1015 -----------------YTYSPNNIISMDGARLVKRAEIKTNLGRLSSFGAGFKKNSI-- 1055
Query: 522 APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
+RS L + +D A +Y +LL +Q ++ H GLN R +
Sbjct: 1056 MFYSRSNMLIHVSGIDDA----------HYLKLLGVQTAIMAHLKSVFGLNQRDY 1100
>gi|15235577|ref|NP_192451.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
gi|55976605|sp|Q9M0V3.1|DDB1A_ARATH RecName: Full=DNA damage-binding protein 1a; AltName: Full=UV-damaged
DNA-binding protein 1a; Short=DDB1a
gi|7267302|emb|CAB81084.1| UV-damaged DNA binding factor-like protein [Arabidopsis thaliana]
gi|25054828|gb|AAN71904.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
gi|332657117|gb|AEE82517.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
Length = 1088
Score = 79.7 bits (195), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 121/559 (21%), Positives = 223/559 (39%), Gaps = 116/559 (20%)
Query: 89 MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
+R FS+ + VF P ++ +++ L ++ + VS + PF++ P L
Sbjct: 632 LRTFSSKSATH-VFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPDS-LAI 687
Query: 149 NAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKF 208
+ EL I + +R +PL + + +T+T+ I + +
Sbjct: 688 AREGELTIGTIDDI----QKLHIRTIPLGEHARRICHQEQTRTFGICSLGNQS------- 736
Query: 209 NGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEY 268
N E+ E+ F+ L Q F + + +PL +E+ + + S
Sbjct: 737 NSEESEM-------HFVRLLDDQ-------TFEF----MSTYPLDSFEYGCSILSCSFTE 778
Query: 269 EGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQK 327
+ + Y +GT Y E+ +GRIL+F I+E ++++I KE K
Sbjct: 779 DKNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGRLQLIAEKETK 822
Query: 328 GPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV-----YIASMVSVK-NL 379
G V ++ G L+ A+ QKI Y W L+D+ G + +E +A V + +
Sbjct: 823 GAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDF 879
Query: 380 ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQL 439
I+VGD +SI+LL Y+ E + ARDY ++
Sbjct: 880 IVVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAV----------------------- 916
Query: 440 SLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD 499
+ILD+ +G ++ + N++ E RL +
Sbjct: 917 -----------------EILDDDIYLG---AENNFNLLTVKKNSEGATDEERGRLEVVGE 956
Query: 500 FHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLL 555
+HLG+ VN F +R S I P + +++G +G LP++ Y L
Sbjct: 957 YHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVIASLPQEQYTFLE 1010
Query: 556 MLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 615
LQ+ + GGL+ +R++ + A +R +DG L+ FL LS + +I K
Sbjct: 1011 KLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRNKMEDISK 1068
Query: 616 KIGSKHNDILDELYDIEAL 634
+ + ++ + ++ L
Sbjct: 1069 SMNVQVEELCKRVEELTRL 1087
>gi|186511557|ref|NP_001118940.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
gi|332657118|gb|AEE82518.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
Length = 1067
Score = 79.3 bits (194), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 121/559 (21%), Positives = 223/559 (39%), Gaps = 116/559 (20%)
Query: 89 MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
+R FS+ + VF P ++ +++ L ++ + VS + PF++ P L
Sbjct: 611 LRTFSSKSATH-VFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPDS-LAI 666
Query: 149 NAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKF 208
+ EL I + +R +PL + + +T+T+ I + +
Sbjct: 667 AREGELTIGTIDDI----QKLHIRTIPLGEHARRICHQEQTRTFGICSLGNQS------- 715
Query: 209 NGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEY 268
N E+ E+ F+ L Q F + + +PL +E+ + + S
Sbjct: 716 NSEESEM-------HFVRLLDDQ-------TFEF----MSTYPLDSFEYGCSILSCSFTE 757
Query: 269 EGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQK 327
+ + Y +GT Y E+ +GRIL+F + E G ++++I KE K
Sbjct: 758 DKNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVEDG------RLQLIAEKETK 801
Query: 328 GPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV-----YIASMVSVK-NL 379
G V ++ G L+ A+ QKI Y W L+D+ G + +E +A V + +
Sbjct: 802 GAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDF 858
Query: 380 ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQL 439
I+VGD +SI+LL Y+ E + ARDY ++
Sbjct: 859 IVVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAV----------------------- 895
Query: 440 SLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD 499
+ILD+ +G ++ + N++ E RL +
Sbjct: 896 -----------------EILDDDIYLG---AENNFNLLTVKKNSEGATDEERGRLEVVGE 935
Query: 500 FHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLL 555
+HLG+ VN F +R S I P + +++G +G LP++ Y L
Sbjct: 936 YHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVIASLPQEQYTFLE 989
Query: 556 MLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 615
LQ+ + GGL+ +R++ + A +R +DG L+ FL LS + +I K
Sbjct: 990 KLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRNKMEDISK 1047
Query: 616 KIGSKHNDILDELYDIEAL 634
+ + ++ + ++ L
Sbjct: 1048 SMNVQVEELCKRVEELTRL 1066
>gi|396082420|gb|AFN84029.1| pre-mRNA cleavage and polyadenylation [Encephalitozoon romaleae
SJ-2008]
Length = 1156
Score = 79.3 bits (194), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 89/408 (21%), Positives = 165/408 (40%), Gaps = 75/408 (18%)
Query: 172 RKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQ 231
RK+P+ P + Y Y +V S +D E + IP +
Sbjct: 765 RKIPILRIPKHIEY---ADRYMVVASC------------KDVEFSSKDEKDCGIPVNTYR 809
Query: 232 FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDV 291
F+V L+S +E I + + L E E++ +K + ++ G ++ + T + ED
Sbjct: 810 FYVDLYSE-RYEHI--STYELDENEYIFDVKYLVLDDMQGNYGKSPFLLVCTTFIEGEDR 866
Query: 292 TCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIW 351
RGR+ + +II VVP P K+K++ ++ KG + V G + +G KI I+
Sbjct: 867 PARGRLHVLEIISVVPSLESPFRDCKLKVLGIEKTKGSIVQCSEVRGKIALCLGTKIMIY 926
Query: 352 QL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
++ + + I F D ++ +S+ +KN IL D R ++ +Q + L L++
Sbjct: 927 KIDRSTGIIPIGFYDLHIFTSSISVMKNYILASDIYRGLSFFFFQSKPIRLHLIS----- 981
Query: 411 TQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMIS 470
+ P + + L+ +LS + C G+ H
Sbjct: 982 ---------SSEPLKNVTSTELLTAGNELS----MVCCDAKGTIH--------------- 1013
Query: 471 DKDKNVVLFMYQPEARESNGGHRLIKKTDF--HLGQHVNTFFKIRCKPSSISDAPGARSR 528
+ Y P S G +L+K+++ +LG R S I G R
Sbjct: 1014 -------AYTYSPNNIISMDGAKLVKRSEMKTNLG---------RLSSSGI----GFRKN 1053
Query: 529 FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
+ +Y+ + L + + + + Y +LL +Q ++ H GLN R +
Sbjct: 1054 SIMFYSKTN-LLIYLVGMDDSYYLKLLKIQTSIMVHLKSVLGLNQRDY 1100
>gi|145351726|ref|XP_001420218.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580451|gb|ABO98511.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1120
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 98/474 (20%), Positives = 181/474 (38%), Gaps = 99/474 (20%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
+R +PL P +A+ ++T T+ + + + D+EL
Sbjct: 736 IRTIPLGGHPRRIAHQVDTNTFAVAVE--------HLMSKGDQELF-------------- 773
Query: 231 QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYS-E 289
+ L S++ + Q F L E E L + S + R Y +GT + Y E
Sbjct: 774 ---IRLIDDGSFDTLHQ--FRLEEHELASSLMSCSFAGDS-----REYYVVGTGFAYEQE 823
Query: 290 DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI- 348
D RGRIL+ + + ++++ KE +G V + G L+ + K+
Sbjct: 824 DEPSRGRILVLRV-----------EADALELVSEKEVRGAVYNLNAFKGKLLAGINSKLE 872
Query: 349 -YIWQLKDND---LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLV 404
+ W +++D L ++ S+ + + ILVGD +S++LL+Y+PE + +
Sbjct: 873 LFKWTPREDDAHELVSECSHHGQIITFSVKTRGDWILVGDLLKSMSLLQYKPEEGAIDEI 932
Query: 405 ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
ARD+ + +LD+ +
Sbjct: 933 ARDFNANWMTAVA----------------------------------------MLDDDET 952
Query: 465 MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSI--SDA 522
++ ++ N+ A RL ++HLG+ VN F P S+ S
Sbjct: 953 --YLGAENSLNLFTVARNMNAMTDEERSRLEITGEYHLGEFVNVF-----SPGSLVMSLK 1005
Query: 523 PGARSRFLT-WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG 581
G T + + +G +G LP+ Y LQ M H GGL +R+++
Sbjct: 1006 DGDSLEVPTLLFGTGNGVIGVLASLPKDAYDFAERLQTSMNKHIQGVGGLKHAEWRSFRH 1065
Query: 582 KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
+PSR +DG LV FL L + + + + +I+ + +++ L+
Sbjct: 1066 TLRRKSDPSRNFVDGDLVESFLDLKVEQADVVAADMKCDRAEIIRRVEELQRLT 1119
>gi|19074861|ref|NP_586367.1| CLEAVAGE AND POLYADENYLATION SPECIFIC FACTOR [Encephalitozoon
cuniculi GB-M1]
gi|19069586|emb|CAD25971.1| CLEAVAGE AND POLYADENYLATION SPECIFIC FACTOR [Encephalitozoon
cuniculi GB-M1]
Length = 1156
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 85/355 (23%), Positives = 144/355 (40%), Gaps = 60/355 (16%)
Query: 225 IPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTN 284
IP +F+V L+S +E I + L E E+V +K + ++ G ++ + T
Sbjct: 803 IPVDTYRFYVDLYSE-KYEHID--TYELDENEYVFHIKYLILDDMQGNYGKSPFLLVCTT 859
Query: 285 YNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAV 344
+ ED RGR+ + +II VVP P K+K++ ++ KG + V G + +
Sbjct: 860 FIEGEDRPARGRLHVLEIISVVPSLESPFKDCKLKVLGIEKTKGSIVRCEEVRGKIALCL 919
Query: 345 GQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSL 403
G KI I+++ + + + I F D ++ +S+ VKN IL D R ++ +Q + L L
Sbjct: 920 GTKIMIYKIDRSSGIIPIGFYDLHIFTSSISVVKNYILASDIYRGLSFFFFQSKPIRLHL 979
Query: 404 VARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI--CKKIGSKHNDILDE 461
++ + P R L LS G L + C G+ H
Sbjct: 980 IS--------------SSEPLRNATSTEL------LSTGNELSMLCCDAKGTIHG----- 1014
Query: 462 FSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISD 521
+ Y P S G RL+K+ + + F K +SI
Sbjct: 1015 -----------------YTYSPNNIISMDGARLVKRAEIKTNLGRLSSFGAGFKKNSI-- 1055
Query: 522 APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
+RS L + +D A +Y +LL +Q ++ H GLN R +
Sbjct: 1056 MFYSRSNMLIHVSGIDDA----------HYLKLLGVQTAIMAHLKSVFGLNQRDY 1100
>gi|218197365|gb|EEC79792.1| hypothetical protein OsI_21216 [Oryza sativa Indica Group]
Length = 1089
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 90/382 (23%), Positives = 157/382 (41%), Gaps = 83/382 (21%)
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+ L ++EH + + S + + Y +GT Y E+ +GRIL+F + E
Sbjct: 761 YQLDQYEHGCSIISCSFSDDNNV-----YYCVGTAYVLPEENEPSKGRILVFAV-----E 810
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
G ++++I KE KG V ++ G L+ A+ QKI Y W L+++ G + +
Sbjct: 811 DG------RLQLIVEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRED---GSHELQS 861
Query: 367 EV----YIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
E +I ++ + + I+VGD +SI+LL Y+ E + +ARDY
Sbjct: 862 ECGHHGHILALYTQTRGDFIVVGDLMKSISLLVYKHEESAIEELARDYNAN--------- 912
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
W +S E L+ IG+++N N+
Sbjct: 913 -------------W----MSAVEMLDDEIYIGAENN-----------------YNIFTVR 938
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
+A RL ++HLG+ VN +R S + P + ++
Sbjct: 939 KNSDAATDEERGRLEVVGEYHLGEFVNRLRHGSLVMRLPDSEMGQIP------TVIFGTI 992
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
+G +G LP + Y L LQ+ +V G L+ +R++ + +R +DG
Sbjct: 993 NGVIGIIASLPHEQYVFLEKLQSTLVKFIKGVGNLSHEQWRSFHNDKKTS--EARNFLDG 1050
Query: 597 SLVWKFLQLSLGERLEICKKIG 618
L+ FL LS + E+ K +G
Sbjct: 1051 DLIESFLDLSRNKMEEVAKGMG 1072
>gi|115465791|ref|NP_001056495.1| Os05g0592400 [Oryza sativa Japonica Group]
gi|48475231|gb|AAT44300.1| putative DNA damage binding protein 1 [Oryza sativa Japonica Group]
gi|113580046|dbj|BAF18409.1| Os05g0592400 [Oryza sativa Japonica Group]
gi|215694552|dbj|BAG89545.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222632766|gb|EEE64898.1| hypothetical protein OsJ_19757 [Oryza sativa Japonica Group]
Length = 1090
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 90/382 (23%), Positives = 157/382 (41%), Gaps = 83/382 (21%)
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+ L ++EH + + S + + Y +GT Y E+ +GRIL+F + E
Sbjct: 762 YQLDQYEHGCSIISCSFSDDNNV-----YYCVGTAYVLPEENEPSKGRILVFAV-----E 811
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
G ++++I KE KG V ++ G L+ A+ QKI Y W L+++ G + +
Sbjct: 812 DG------RLQLIVEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRED---GSHELQS 862
Query: 367 EV----YIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
E +I ++ + + I+VGD +SI+LL Y+ E + +ARDY
Sbjct: 863 ECGHHGHILALYTQTRGDFIVVGDLMKSISLLVYKHEESAIEELARDYNAN--------- 913
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
W +S E L+ IG+++N N+
Sbjct: 914 -------------W----MSAVEMLDDEIYIGAENN-----------------YNIFTVR 939
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
+A RL ++HLG+ VN +R S + P + ++
Sbjct: 940 KNSDAATDEERGRLEVVGEYHLGEFVNRLRHGSLVMRLPDSEMGQIP------TVIFGTI 993
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
+G +G LP + Y L LQ+ +V G L+ +R++ + +R +DG
Sbjct: 994 NGVIGIIASLPHEQYVFLEKLQSTLVKFIKGVGNLSHEQWRSFHNDKKTS--EARNFLDG 1051
Query: 597 SLVWKFLQLSLGERLEICKKIG 618
L+ FL LS + E+ K +G
Sbjct: 1052 DLIESFLDLSRNKMEEVAKGMG 1073
>gi|223994993|ref|XP_002287180.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976296|gb|EED94623.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1517
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 101/484 (20%), Positives = 183/484 (37%), Gaps = 113/484 (23%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
V L TP +AYH + YC+ D G + ++ +
Sbjct: 1030 VTSYKLGMTPRRIAYHEAGRVYCV------GCIDGNAKGGNNNQVGAEINMGNC------ 1077
Query: 231 QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSM----------EYEGTLSGLRGYIA 280
V F ++EEI Q + L +E +L L +VS+ + S + YI
Sbjct: 1078 ---VRFFDDSTFEEINQID--LEPFETILSLVSVSLCTSSQTLTQSNSKQDTSEYKPYIL 1132
Query: 281 LGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNK----------IKMIYAKEQKGP 329
+GT Y Y ED +GRIL ++E +P K+ ++ + +G
Sbjct: 1133 IGTAYAYPDEDEPTQGRIL---VVECNSGEAEPHLKSDDDMEDTYSRYVRHVTQMPTRGG 1189
Query: 330 VTAIC-HVAGFLVTAVGQKIYIWQLK--DNDLTGIAFIDT-------EVYIASMVS---- 375
V +I G ++ V K ++ +L + + + F+ +++ S+
Sbjct: 1190 VYSISPFYGGTVLATVNSKTHLCRLSIGCDQIGELKFVGAGHHGHMLSLFVKSLAGSESE 1249
Query: 376 ---------VKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRG 426
K L +VGD RSI+L+ YQP++ + +ARDY
Sbjct: 1250 SESSGTNRQAKQLAIVGDLMRSISLVEYQPKHNVIEELARDYNAN--------------- 1294
Query: 427 IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
C + + ++ ++ S+ N+ + + A
Sbjct: 1295 --------------------FCTAV--------EMLTNGTYLGSEGFNNLFVLRHNANAS 1326
Query: 487 ESNGGHRLIKKTDFHLGQHVNTFFKIR-CKPSSISDAPGARSRFL---TWYASLDGALGF 542
RL ++HLG+ N F PS+ GA++ ++ T + ++DG++G
Sbjct: 1327 SEEARVRLDTVGEYHLGEMTNKFMGGSLIMPSNSGGIMGAQNAYVGSQTLFGTVDGSIGS 1386
Query: 543 FLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKF 602
L L + L LQ +++ G ++ +R ++ + PSRG IDG L+ F
Sbjct: 1387 VLGLDGPTFAFLACLQRAILSIVKTVGDISHEEYRAFRAERQV--RPSRGFIDGDLIETF 1444
Query: 603 LQLS 606
L L+
Sbjct: 1445 LDLN 1448
>gi|255956643|ref|XP_002569074.1| Pc21g20880 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211590785|emb|CAP96985.1| Pc21g20880 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1140
Score = 76.6 bits (187), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 81/365 (22%), Positives = 151/365 (41%), Gaps = 61/365 (16%)
Query: 275 LRGYIALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAI 333
++ +GT + + +D + RGRIL+ ++ + G+ L++ + + +
Sbjct: 828 MKDRFVVGTAFADEEQDESIRGRILILEV-----DHGRKLSQVAELPVMGACRALAMMGD 882
Query: 334 CHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLR 393
C VA + T V ++ I + L +A T ++ V +LI V D +S+ L+R
Sbjct: 883 CVVAALVKTVVVYRVKINNVGPMKLEKLAAYRTSTAPVDVIVVDDLIAVADLMKSLCLVR 942
Query: 394 YQP----EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 449
Y P E L+ V R Y+ VW +G+
Sbjct: 943 YTPGHAGEPAKLTEVGRHYQT----------------------VWSTAIACVGDET---- 976
Query: 450 KIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
F+ SD + N+++ + HRL+ ++ LG+ VN
Sbjct: 977 -----------------FLQSDAEGNLIVLSRNMNGVTAQDKHRLMPTSEISLGEMVN-- 1017
Query: 510 FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
R +P +I + A+++G++ F + ++ L+ LQ + T + G
Sbjct: 1018 ---RIRPVNIPQLSSVMVTPRAFMATVEGSIFLFAVINPEHQDFLMTLQASLSTKINSLG 1074
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 629
L+ FR+++ A P R +DG L+ +FL S + EI ++IGS +D+++
Sbjct: 1075 NLSFDKFRSFRTMVRSAEAPYR-FVDGELIEQFLNCSPSMQEEIVQEIGS--SDVVEVKR 1131
Query: 630 DIEAL 634
IEAL
Sbjct: 1132 MIEAL 1136
>gi|12082087|dbj|BAB20761.1| UV-damaged DNA binding protein [Oryza sativa Japonica Group]
Length = 1090
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 90/382 (23%), Positives = 157/382 (41%), Gaps = 83/382 (21%)
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+ L ++EH + + S + + Y +GT Y E+ +GRIL+F + E
Sbjct: 762 YQLDQYEHGCSIISCSFSDDNNV-----YYCVGTAYVLPEENEPSKGRILVFAV-----E 811
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
G ++++I KE KG V ++ G L+ A+ QKI Y W L+++ G + +
Sbjct: 812 DG------RLQLIVEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRED---GSHELQS 862
Query: 367 EV----YIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
E +I ++ + + I+VGD +SI+LL Y+ E + +ARDY
Sbjct: 863 ECGHHGHILALYTQTRGDFIVVGDLMKSISLLVYKHEESAIEELARDYNAN--------- 913
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
W +S E L+ IG+++N N+
Sbjct: 914 -------------W----MSAVEMLDDEIYIGAENN-----------------YNIFTVR 939
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
+A RL ++HLG+ N F +R S + P + ++
Sbjct: 940 KNSDAATDEERGRLEVVGEYHLGEFGNRFRHGSLVMRLPDSEMGQIP------TVIFGTI 993
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
+G +G LP + Y L LQ+ +V G L+ +R++ + +R +DG
Sbjct: 994 NGVIGIIASLPHEQYVFLEKLQSTLVKFIKGVGNLSHEQWRSFHNDKKTS--EARNFLDG 1051
Query: 597 SLVWKFLQLSLGERLEICKKIG 618
L+ FL LS + E+ K +G
Sbjct: 1052 DLIESFLDLSRNKMEEVAKGMG 1073
>gi|298711490|emb|CBJ26578.1| n/a [Ectocarpus siliculosus]
Length = 1135
Score = 76.3 bits (186), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 118/540 (21%), Positives = 209/540 (38%), Gaps = 100/540 (18%)
Query: 97 GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRI 156
G VF+ PA ++ S G+L + + G V+++ F + P L +++ L I
Sbjct: 664 GMVCVFVASDRPAVIY-CSGGKLLYANVNM-GEVNSVCSFDSSELPH-CLALASENSLTI 720
Query: 157 SVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELV 216
+ ++KV L P + +H + + I+T T Y D+E
Sbjct: 721 GTIDDI----QKLHIQKVSLGEAPQRITHHDSGRMFGIIT------TSYRAVENSDEE-- 768
Query: 217 TDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLR 276
+ F V ++EE+ PL +E+ SM + +
Sbjct: 769 ---EEHNF---------VKFLDDTNFEEL--YCHPLDAFEN-----GSSMVSCVFANDKK 809
Query: 277 GYIALGTNYNYSEDVT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICH 335
Y+ +GT Y ++ GR+L+F + GQ + K+ + E +G V +
Sbjct: 810 EYLVVGTGYVREDECEPAVGRLLVFSV------EGQG-AERKVDLAAEVETRGAVYVLNG 862
Query: 336 VAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTE------VYIASMVSVKNLILVGDYARSI 389
G L+ + K+ +++ + D GI + TE + M S + I+VGD RS+
Sbjct: 863 FNGKLLACINSKVQLFRWIEKD-DGIQELQTECGYHGHILALHMQSRGDFIIVGDLMRSV 921
Query: 390 ALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 449
+LL Y+ + VARDY W ++ E L
Sbjct: 922 SLLVYKAVDGAIEEVARDYHAN----------------------W----MTAVEML---- 951
Query: 450 KIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
++D+ ++ + D N+ +A RL + +FHLG+ VN F
Sbjct: 952 -----NDDV--------YIGGEADCNIFTLRRNADAATEEERARLEIQGEFHLGEFVNKF 998
Query: 510 FKIRC-KPSSISDAPGARSRFLT-----WYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
+ SS ++PG L + +++G +G L L E N+R L LQ M
Sbjct: 999 CRGSLLMQSSEVNSPGGMDSPLVKGQPLLFGTVNGMVGTILTLTEDNHRFLAQLQTAMTK 1058
Query: 564 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 623
GG + +R++ + PS IDG LV +L + + E+ + + + D
Sbjct: 1059 VVKGVGGFSHDEWRSFTNGRRTS--PSSNFIDGDLVESYLDMPRHNQEEVLRHVDTPVGD 1116
>gi|350537001|ref|NP_001234275.1| DNA damage-binding protein 1 [Solanum lycopersicum]
gi|350539125|ref|NP_001233864.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
gi|55976440|sp|Q6QNU4.1|DDB1_SOLLC RecName: Full=DNA damage-binding protein 1; AltName: Full=High
pigmentation protein 1; AltName: Full=UV-damaged
DNA-binding protein 1
gi|38455768|gb|AAR20885.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
gi|42602165|gb|AAS21683.1| UV-damaged DNA binding protein 1 [Solanum lycopersicum]
Length = 1090
Score = 76.3 bits (186), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 109/499 (21%), Positives = 192/499 (38%), Gaps = 107/499 (21%)
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
VS + PF+ P L + EL I + +R +PL +++ +T
Sbjct: 670 VSHMCPFNVAAFPDS-LAIAKEGELTIGTIDEI----QKLHIRSIPLGEHARRISHQEQT 724
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+T+ + S Y + N +D E+ V L ++E I +
Sbjct: 725 RTFALC------SVKYTQSNADDPEM----------------HFVRLLDDQTFEFI--ST 760
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+PL ++E+ + + S + + Y +GT Y E+ +GRIL+F I+E
Sbjct: 761 YPLDQFEYGCSILSCSFSDDSNV-----YYCIGTAYVMPEENEPTKGRILVF-IVE---- 810
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEV 368
K+++I KE KG V ++ G L+ A+ QKI +++ + G + TE
Sbjct: 811 ------DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWASREDGGSRELQTEC 864
Query: 369 -----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
+A V + + I+VGD +SI+LL ++ E + ARDY ++
Sbjct: 865 GHHGHILALYVQTRGDFIVVGDLMKSISLLIFKHEEGAIEERARDYNANWMSAV------ 918
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ILD+ +G ++ + N+
Sbjct: 919 ----------------------------------EILDDDIYLG---AENNFNLFTVRKN 941
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDG 538
E RL ++HLG+ VN F +R S + P + +++G
Sbjct: 942 SEGATDEERSRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTVNG 995
Query: 539 ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
+G LP Y L LQ + GGL+ +R++ + ++ +DG L
Sbjct: 996 VIGVIASLPHDQYLFLEKLQTNLRKVIKGVGGLSHEQWRSFYNEKKTV--DAKNFLDGDL 1053
Query: 599 VWKFLQLSLGERLEICKKI 617
+ FL LS EI K +
Sbjct: 1054 IESFLDLSRNRMEEISKAM 1072
>gi|55976392|sp|Q6E7D1.1|DDB1_SOLCE RecName: Full=DNA damage-binding protein 1; AltName: Full=UV-damaged
DNA-binding protein 1
gi|49484911|gb|AAT66742.1| UV-damaged DNA binding protein 1 [Solanum cheesmaniae]
Length = 1095
Score = 75.9 bits (185), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 109/499 (21%), Positives = 192/499 (38%), Gaps = 107/499 (21%)
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
VS + PF+ P L + EL I + +R +PL +++ +T
Sbjct: 675 VSHMCPFNVAAFPDS-LAIAKEGELTIGTIDEI----QKLHIRSIPLGEHARRISHQEQT 729
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+T+ + S Y + N +D E+ V L ++E I +
Sbjct: 730 RTFALC------SVKYTQSNADDPEM----------------HFVRLLDDQTFEFI--ST 765
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+PL ++E+ + + S + + Y +GT Y E+ +GRIL+F I+E
Sbjct: 766 YPLDQFEYGCSILSCSFSDDSNV-----YYCIGTAYVMPEENEPTKGRILVF-IVE---- 815
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEV 368
K+++I KE KG V ++ G L+ A+ QKI +++ + G + TE
Sbjct: 816 ------DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWASREDGGSRELQTEC 869
Query: 369 -----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
+A V + + I+VGD +SI+LL ++ E + ARDY ++
Sbjct: 870 GHHGHILALYVQTRGDFIVVGDLMKSISLLIFKHEEGAIEERARDYNANWMSAV------ 923
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
+ILD+ +G ++ + N+
Sbjct: 924 ----------------------------------EILDDDIYLG---AENNFNLFTVRKN 946
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDG 538
E RL ++HLG+ VN F +R S + P + +++G
Sbjct: 947 SEGATDEERSRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTVNG 1000
Query: 539 ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
+G LP Y L LQ + GGL+ +R++ + ++ +DG L
Sbjct: 1001 VIGVIASLPHDQYLFLEKLQTNLRKVIKGVGGLSHEQWRSFYNEKKTV--DAKNFLDGDL 1058
Query: 599 VWKFLQLSLGERLEICKKI 617
+ FL LS EI K +
Sbjct: 1059 IESFLDLSRNRMEEISKAM 1077
>gi|224061051|ref|XP_002300334.1| predicted protein [Populus trichocarpa]
gi|222847592|gb|EEE85139.1| predicted protein [Populus trichocarpa]
Length = 1088
Score = 75.9 bits (185), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 113/501 (22%), Positives = 196/501 (39%), Gaps = 113/501 (22%)
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
VS + PF++ P L + EL I + +R +PL + + ++
Sbjct: 670 VSHMCPFNSAAFPDS-LAIAKEGELSIGTIDDI----QKLHIRSIPLGEHARRICHQEQS 724
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+T+ I + + N E+ E+ FI L Q ++E I +
Sbjct: 725 RTFSICSMKNQS-------NAEESEM-------HFIRLLDDQ---------TFEFI--ST 759
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+PL +E+ + + S + + Y +GT Y E+ +GRIL+F + E
Sbjct: 760 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 809
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
G K+++I KE KG V ++ G L+ A+ QKI Y W L+D+ G + +
Sbjct: 810 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQS 860
Query: 367 EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
E +A V + + I+VGD +SI+LL Y+ E + ARDY ++
Sbjct: 861 ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 916
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
+ILD+ +G ++ + N+
Sbjct: 917 ------------------------------------EILDDDIYLG---AENNFNLFTVR 937
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
E RL ++HLG+ VN F +R S + P + ++
Sbjct: 938 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTV 991
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
+G +G LP + Y L LQ+ + GGL+ +R++ + ++ +DG
Sbjct: 992 NGVIGVIASLPHEQYLFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--DAKNFLDG 1049
Query: 597 SLVWKFLQLSLGERLEICKKI 617
L+ FL LS EI K +
Sbjct: 1050 DLIESFLDLSRSRMDEISKAM 1070
>gi|168047617|ref|XP_001776266.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672361|gb|EDQ58899.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1089
Score = 75.5 bits (184), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 112/496 (22%), Positives = 190/496 (38%), Gaps = 108/496 (21%)
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
V+ + PF++ + P L + EL I + +R VPL P +A+ ++
Sbjct: 670 VNHMCPFNSASFPDS-LAIGKEGELTIGTIDDI----QKLHIRTVPLGERPCRIAHQEQS 724
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+++ I ++ N ED E +V L ++E +
Sbjct: 725 RSFAICSAKYSQGP-----NNEDIE----------------THYVRLIEDQTFE--ITSG 761
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT-CRGRILLFDIIEVVPE 308
F L +E + S + + Y +GT Y E+ +GRIL+F + E
Sbjct: 762 FALDLYEIGCSIITCSFTDDSNV-----YYCVGTAYALPEESEPTKGRILVF-----LVE 811
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
G K++++ KE KG V + G L+ + QKI Y W L+D T + I++
Sbjct: 812 DG------KLQLVAEKEMKGAVYNLNAFNGKLLAGINQKIALYKWTLRDG--TRVLEIES 863
Query: 367 E----VYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
+ + S + I+VGD +SI+LL Y+PE + ARDY
Sbjct: 864 SHHGHILALYVQSRGDFIVVGDLMKSISLLIYKPEEGAIEERARDYNAN----------- 912
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
W +ILD+ + +G ++ N+
Sbjct: 913 -----------WM------------------TAVEILDDDTYLG---AENSFNLFTVRKN 940
Query: 483 PEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDG 538
+A RL ++HLG+ VN F +R S S P + +++G
Sbjct: 941 NDAATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEASLIP------TVIFGTVNG 994
Query: 539 ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
+G LP+ + L LQ +V GGL+ +R++ + +R +DG L
Sbjct: 995 VIGVIASLPQDKFLFLQKLQQALVKVIKGVGGLSHEQWRSFSNERKTV--DARNFLDGDL 1052
Query: 599 VWKFLQLSLGERLEIC 614
+ FL LS + EI
Sbjct: 1053 IESFLDLSRNKMEEIA 1068
>gi|449519304|ref|XP_004166675.1| PREDICTED: DNA damage-binding protein 1a-like [Cucumis sativus]
Length = 596
Score = 75.5 bits (184), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 115/501 (22%), Positives = 195/501 (38%), Gaps = 112/501 (22%)
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
VS + PF++ P L + EL I + +R +PL + + ++
Sbjct: 177 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHARRICHQEQS 231
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+T+ I S Y + ED E+ FI L Q ++E I +
Sbjct: 232 RTFAIC------SLRYNQSGTEDTEM-------HFIRLLDDQ---------TFESI--ST 267
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+ L +E+ + + S + + Y +GT Y E+ +GRIL+F V E
Sbjct: 268 YALDTYEYGCSILSCSFSDDNNV-----YYCVGTAYVMPEENEPTKGRILVF-----VVE 317
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
G K+++I KE KG V ++ G L+ A+ QKI Y W L+D+ G + +
Sbjct: 318 EG------KLQLIAEKETKGSVYSLNAFNGKLLAAINQKIQLYKWTLRDD---GTRELQS 368
Query: 367 EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
E +A V + + I+VGD +SI+LL Y+ E + ARDY ++
Sbjct: 369 ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 424
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
+ILD+ +G ++ N+
Sbjct: 425 ------------------------------------EILDDDIYLG---AENYFNLFTVR 445
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
E RL ++HLG+ VN F +R S + P + S+
Sbjct: 446 KNSEGATDEERSRLEVVGEYHLGEFVNRFQHGSLVMRLPDSDVGQIP------TVIFGSV 499
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
+G +G LP Y L LQ+ + GGL+ +R++ + A ++ +DG
Sbjct: 500 NGVIGVIASLPHDQYVFLERLQSNLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKNFLDG 557
Query: 597 SLVWKFLQLSLGERLEICKKI 617
L+ FL L+ + EI + +
Sbjct: 558 DLIESFLDLNRSKMEEISRAM 578
>gi|449435512|ref|XP_004135539.1| PREDICTED: DNA damage-binding protein 1-like [Cucumis sativus]
Length = 1093
Score = 75.1 bits (183), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 116/518 (22%), Positives = 201/518 (38%), Gaps = 112/518 (21%)
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
VS + PF++ P L + EL I + +R +PL + + ++
Sbjct: 674 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHARRICHQEQS 728
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+T+ I S Y + ED E+ FI L Q ++E I +
Sbjct: 729 RTFAIC------SLRYNQSGTEDTEM-------HFIRLLDDQ---------TFESI--ST 764
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+ L +E+ + + S + + Y +GT Y E+ +GRIL+F V E
Sbjct: 765 YALDTYEYGCSILSCSFSDDNNV-----YYCVGTAYVMPEENEPTKGRILVF-----VVE 814
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
G K+++I KE KG V ++ G L+ A+ QKI Y W L+D+ G + +
Sbjct: 815 EG------KLQLIAEKETKGSVYSLNAFNGKLLAAINQKIQLYKWTLRDD---GTRELQS 865
Query: 367 EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
E +A V + + I+VGD +SI+LL Y+ E + ARDY ++
Sbjct: 866 ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 921
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
+ILD+ +G ++ N+
Sbjct: 922 ------------------------------------EILDDDIYLG---AENYFNLFTVR 942
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
E RL ++HLG+ VN F +R S + P + S+
Sbjct: 943 KNSEGATDEERSRLEVVGEYHLGEFVNRFQHGSLVMRLPDSDVGQIP------TVIFGSV 996
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
+G +G LP Y L LQ+ + GGL+ +R++ + A ++ +DG
Sbjct: 997 NGVIGVIASLPHDQYVFLERLQSNLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKNFLDG 1054
Query: 597 SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
L+ FL L+ + EI + + ++ + ++ L
Sbjct: 1055 DLIESFLDLNRSKMEEISRAMSVSAEELCKRVEELTRL 1092
>gi|356512636|ref|XP_003525024.1| PREDICTED: DNA damage-binding protein 1a-like isoform 1 [Glycine max]
Length = 1089
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 111/501 (22%), Positives = 194/501 (38%), Gaps = 112/501 (22%)
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
VS + PF++ P L + EL I + +R +PL + + ++
Sbjct: 670 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHARRICHQEQS 724
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+T+ I + P++ GED E+ V L ++E I +
Sbjct: 725 RTFAICSLKYNPAS------GEDSEM----------------HFVRLLDDQTFEFI--ST 760
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+ L +E+ + + S + + Y +GT Y E+ +GRIL+F + E
Sbjct: 761 YSLDTYEYGCFIISCSFSDDNNV-----YYCVGTAYVLPEENEPTKGRILVFAV-----E 810
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
G K+++I KE KG V + G L+ A+ QKI Y W L+D+ G + +
Sbjct: 811 DG------KLQLIAEKETKGAVYCLNAFNGKLLAAINQKIQLYKWVLRDD---GTHELQS 861
Query: 367 EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
E +A V + + I+VGD +SI+LL Y+ E + ARDY ++
Sbjct: 862 ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 917
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
+I+D+ +G ++ N+
Sbjct: 918 ------------------------------------EIVDDDIYLG---AENSFNLFTVR 938
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
E RL ++HLG+ VN F +R S + P + ++
Sbjct: 939 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTI 992
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
+G +G LP + Y L LQ+ + GGL+ +R++ + +R +DG
Sbjct: 993 NGVIGVIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--EARNFLDG 1050
Query: 597 SLVWKFLQLSLGERLEICKKI 617
L+ FL L+ + EI K +
Sbjct: 1051 DLIESFLDLNRSKMDEISKAL 1071
>gi|356512638|ref|XP_003525025.1| PREDICTED: DNA damage-binding protein 1a-like isoform 2 [Glycine max]
Length = 1068
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 111/501 (22%), Positives = 194/501 (38%), Gaps = 112/501 (22%)
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
VS + PF++ P L + EL I + +R +PL + + ++
Sbjct: 649 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHARRICHQEQS 703
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+T+ I + P++ GED E+ V L ++E I +
Sbjct: 704 RTFAICSLKYNPAS------GEDSEM----------------HFVRLLDDQTFEFI--ST 739
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+ L +E+ + + S + + Y +GT Y E+ +GRIL+F + E
Sbjct: 740 YSLDTYEYGCFIISCSFSDDNNV-----YYCVGTAYVLPEENEPTKGRILVFAV-----E 789
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
G K+++I KE KG V + G L+ A+ QKI Y W L+D+ G + +
Sbjct: 790 DG------KLQLIAEKETKGAVYCLNAFNGKLLAAINQKIQLYKWVLRDD---GTHELQS 840
Query: 367 EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
E +A V + + I+VGD +SI+LL Y+ E + ARDY ++
Sbjct: 841 ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 896
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
+I+D+ +G ++ N+
Sbjct: 897 ------------------------------------EIVDDDIYLG---AENSFNLFTVR 917
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
E RL ++HLG+ VN F +R S + P + ++
Sbjct: 918 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTI 971
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
+G +G LP + Y L LQ+ + GGL+ +R++ + +R +DG
Sbjct: 972 NGVIGVIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--EARNFLDG 1029
Query: 597 SLVWKFLQLSLGERLEICKKI 617
L+ FL L+ + EI K +
Sbjct: 1030 DLIESFLDLNRSKMDEISKAL 1050
>gi|356525401|ref|XP_003531313.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Glycine max]
Length = 1089
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 110/501 (21%), Positives = 194/501 (38%), Gaps = 112/501 (22%)
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
VS + PF++ P L + EL I + +R +PL + + ++
Sbjct: 670 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHARRICHQEQS 724
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+T+ I + P++ GED E+ V L ++E I +
Sbjct: 725 RTFAICSLKYNPAS------GEDSEM----------------HFVRLLDDQTFEFI--ST 760
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+ L +E+ + + S + + Y +GT Y E+ +GRI++F + E
Sbjct: 761 YSLDTYEYGCFIISCSFSDDNNV-----YYCVGTAYVLPEENEPTKGRIIVFAV-----E 810
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
G K+++I KE KG V + G L+ A+ QKI Y W L+D+ G + +
Sbjct: 811 DG------KLQLIAEKETKGAVYCLNAFNGKLLAAINQKIQLYKWVLRDD---GTHELQS 861
Query: 367 EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
E +A V + + I+VGD +SI+LL Y+ E + ARDY ++
Sbjct: 862 ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 917
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
+I+D+ +G ++ N+
Sbjct: 918 ------------------------------------EIVDDDIYLG---AENSFNLFTVR 938
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
E RL ++HLG+ VN F +R S + P + ++
Sbjct: 939 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTI 992
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
+G +G LP + Y L LQ+ + GGL+ +R++ + +R +DG
Sbjct: 993 NGVIGVIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--EARNFLDG 1050
Query: 597 SLVWKFLQLSLGERLEICKKI 617
L+ FL L+ + EI K +
Sbjct: 1051 DLIESFLDLNRSKMDEISKAV 1071
>gi|356525403|ref|XP_003531314.1| PREDICTED: DNA damage-binding protein 1-like isoform 2 [Glycine max]
Length = 1068
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 110/501 (21%), Positives = 194/501 (38%), Gaps = 112/501 (22%)
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
VS + PF++ P L + EL I + +R +PL + + ++
Sbjct: 649 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHARRICHQEQS 703
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+T+ I + P++ GED E+ V L ++E I +
Sbjct: 704 RTFAICSLKYNPAS------GEDSEM----------------HFVRLLDDQTFEFI--ST 739
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+ L +E+ + + S + + Y +GT Y E+ +GRI++F + E
Sbjct: 740 YSLDTYEYGCFIISCSFSDDNNV-----YYCVGTAYVLPEENEPTKGRIIVFAV-----E 789
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
G K+++I KE KG V + G L+ A+ QKI Y W L+D+ G + +
Sbjct: 790 DG------KLQLIAEKETKGAVYCLNAFNGKLLAAINQKIQLYKWVLRDD---GTHELQS 840
Query: 367 EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
E +A V + + I+VGD +SI+LL Y+ E + ARDY ++
Sbjct: 841 ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 896
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
+I+D+ +G ++ N+
Sbjct: 897 ------------------------------------EIVDDDIYLG---AENSFNLFTVR 917
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
E RL ++HLG+ VN F +R S + P + ++
Sbjct: 918 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTI 971
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
+G +G LP + Y L LQ+ + GGL+ +R++ + +R +DG
Sbjct: 972 NGVIGVIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--EARNFLDG 1029
Query: 597 SLVWKFLQLSLGERLEICKKI 617
L+ FL L+ + EI K +
Sbjct: 1030 DLIESFLDLNRSKMDEISKAV 1050
>gi|255571318|ref|XP_002526608.1| DNA repair protein xp-E, putative [Ricinus communis]
gi|223534048|gb|EEF35767.1| DNA repair protein xp-E, putative [Ricinus communis]
Length = 1033
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 88/354 (24%), Positives = 143/354 (40%), Gaps = 78/354 (22%)
Query: 278 YIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ +GRIL+F + E G K+++I KE KG V ++
Sbjct: 728 YYCVGTAYVMPEENEPTKGRILVF-----LVEDG------KLQVITEKETKGAVYSLNSF 776
Query: 337 AGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV-----YIASMVSVK-NLILVGDYARS 388
G L+ A+ QKI Y W L+D+ G + +E +A V + + I+VGD +S
Sbjct: 777 NGKLLAAINQKIQLYKWMLRDD---GSRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 833
Query: 389 IALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 448
I+LL Y+ E + ARDY ++
Sbjct: 834 ISLLIYKHEEGAIEERARDYNANWMSAV-------------------------------- 861
Query: 449 KKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT 508
+ILD+ +G ++ + N+ E RL ++HLG+ VN
Sbjct: 862 --------EILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 910
Query: 509 F----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 564
F +R S + P + +++G +G LP + Y L LQ+ +
Sbjct: 911 FRHGSLVMRLPDSDVGQIP------TVIFGTVNGVIGVIASLPHEQYIFLEKLQSNLRRV 964
Query: 565 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
GGL+ +R++ + ++ +DG L+ FL LS EI K IG
Sbjct: 965 IKGVGGLSHEQWRSFNNEKKTV--EAKNFLDGDLIESFLDLSRNRMDEISKAIG 1016
>gi|225443990|ref|XP_002280735.1| PREDICTED: DNA damage-binding protein 1 isoform 1 [Vitis vinifera]
Length = 1089
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 113/502 (22%), Positives = 195/502 (38%), Gaps = 112/502 (22%)
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
VS + PF++ P L + +L I + +R +PL + + ++
Sbjct: 670 VSHMCPFNSAAFPDS-LAIAKEGDLTIGTIDDI----QKLHIRSIPLGEHARRICHQEQS 724
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+T+ I S Y + + ED E+ FI L Q ++E I +
Sbjct: 725 RTFAIC------SLKYNQSSTEDSEM-------HFIRLLDDQ---------TFEFI--ST 760
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+PL +E+ + + S + + Y +GT Y E+ +GRIL+F + E
Sbjct: 761 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 810
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
G K+++I KE KG V ++ G L+ A+ QKI Y W L+D+ G + +
Sbjct: 811 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQS 861
Query: 367 EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
E +A V + + I+VGD +SI+LL Y+ E + ARDY ++
Sbjct: 862 ESGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 917
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
+ILD+ +G ++ + N+
Sbjct: 918 ------------------------------------EILDDDIYLG---AENNFNIFTVR 938
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
E RL ++HLG+ VN F +R S + P + ++
Sbjct: 939 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTV 992
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
+G +G LP Y L LQ + GGL+ +R++ + ++ +DG
Sbjct: 993 NGVIGVIASLPHDQYVFLEKLQANLRKVIKGVGGLSHEQWRSFNNEKKTV--DAKNFLDG 1050
Query: 597 SLVWKFLQLSLGERLEICKKIG 618
L+ FL L+ EI K +
Sbjct: 1051 DLIETFLDLNRTRMDEISKAMA 1072
>gi|226510488|ref|NP_001145925.1| uncharacterized protein LOC100279448 [Zea mays]
gi|219884971|gb|ACL52860.1| unknown [Zea mays]
Length = 416
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 88/382 (23%), Positives = 154/382 (40%), Gaps = 83/382 (21%)
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+PL ++E + + S + + Y +GT Y E+ +GRIL+F + E
Sbjct: 88 YPLDQYECGCSIISCSFADDSNV-----YYCVGTAYVIPEENEPTKGRILVFAV-----E 137
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
G +++I KE KG V ++ G L+ A+ QKI Y W +++ G + +
Sbjct: 138 DG------SLQLIVEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMSRED---GSHELQS 188
Query: 367 EV----YIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
E +I ++ + + I+VGD +SI+LL Y+ E + ARDY +
Sbjct: 189 ECGHHGHILALYTQTRGDFIVVGDLMKSISLLVYKHEESAIEERARDYNANWMTAV---- 244
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
++LD+ +G ++ N+
Sbjct: 245 ------------------------------------EMLDDEVYVG---AENSYNLFTVR 265
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
+A + RL ++HLG+ VN F +R S I P + ++
Sbjct: 266 KNSDAATDDERARLEVVGEYHLGEFVNRFRHGSLVMRLPDSDIGQIP------TVIFGTI 319
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
+G +G LP Y L LQ+ +V + G L+ +R++ A +R +DG
Sbjct: 320 NGVIGIIASLPHDQYIFLEKLQSTLVKYIKGVGNLSHEQWRSFHNDKKTA--EARNFLDG 377
Query: 597 SLVWKFLQLSLGERLEICKKIG 618
L+ FL LS + E+ K +G
Sbjct: 378 DLIESFLDLSRSKMEEVSKAMG 399
>gi|225443992|ref|XP_002280744.1| PREDICTED: DNA damage-binding protein 1 isoform 2 [Vitis vinifera]
Length = 1068
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 113/502 (22%), Positives = 195/502 (38%), Gaps = 112/502 (22%)
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
VS + PF++ P L + +L I + +R +PL + + ++
Sbjct: 649 VSHMCPFNSAAFPDS-LAIAKEGDLTIGTIDDI----QKLHIRSIPLGEHARRICHQEQS 703
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+T+ I S Y + + ED E+ FI L Q ++E I +
Sbjct: 704 RTFAIC------SLKYNQSSTEDSEM-------HFIRLLDDQ---------TFEFI--ST 739
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+PL +E+ + + S + + Y +GT Y E+ +GRIL+F + E
Sbjct: 740 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 789
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
G K+++I KE KG V ++ G L+ A+ QKI Y W L+D+ G + +
Sbjct: 790 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQS 840
Query: 367 EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
E +A V + + I+VGD +SI+LL Y+ E + ARDY ++
Sbjct: 841 ESGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 896
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
+ILD+ +G ++ + N+
Sbjct: 897 ------------------------------------EILDDDIYLG---AENNFNIFTVR 917
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
E RL ++HLG+ VN F +R S + P + ++
Sbjct: 918 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTV 971
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
+G +G LP Y L LQ + GGL+ +R++ + ++ +DG
Sbjct: 972 NGVIGVIASLPHDQYVFLEKLQANLRKVIKGVGGLSHEQWRSFNNEKKTV--DAKNFLDG 1029
Query: 597 SLVWKFLQLSLGERLEICKKIG 618
L+ FL L+ EI K +
Sbjct: 1030 DLIETFLDLNRTRMDEISKAMA 1051
>gi|303391353|ref|XP_003073906.1| pre-mRNA cleavage and polyadenylation specificity factor
[Encephalitozoon intestinalis ATCC 50506]
gi|303303055|gb|ADM12546.1| pre-mRNA cleavage and polyadenylation specificity factor
[Encephalitozoon intestinalis ATCC 50506]
Length = 601
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 60/235 (25%), Positives = 112/235 (47%), Gaps = 19/235 (8%)
Query: 172 RKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQ 231
+K+P+ TP + Y Y +V S E ++ NG+D +P +
Sbjct: 210 KKIPVLRTPKHIEY---ADRYMVVASCEE--VEFSPKNGKDCG----------VPVNTYR 254
Query: 232 FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDV 291
F+V L+S +E I + + L E E+V ++ + ++ G ++ + T + ED
Sbjct: 255 FYVDLYSE-KYEHI--STYELEENEYVFDIQYLVLDDMQGNYGKSPFLLVCTTFIEGEDR 311
Query: 292 TCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIW 351
+GR+ + +II VVP P K+K++ ++ KG + V G +V +G KI I+
Sbjct: 312 PAKGRLHVLEIISVVPSLESPFKDCKLKVLGIEKTKGSIVQCSEVRGKIVLCLGTKIMIY 371
Query: 352 QL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVA 405
++ + + + I F D + +S+ VKN IL D R ++ +Q + L L++
Sbjct: 372 KIDRGSGIIPIGFHDLHTFTSSISVVKNYILASDIYRGLSFFFFQSKPIRLHLIS 426
>gi|413946716|gb|AFW79365.1| hypothetical protein ZEAMMB73_562969 [Zea mays]
Length = 1089
Score = 73.6 bits (179), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 90/382 (23%), Positives = 156/382 (40%), Gaps = 83/382 (21%)
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+PL ++E + + S + + Y +GT Y E+ +GRIL+F + E
Sbjct: 761 YPLDQYECGCSIISCSFADDSNV-----YYCVGTAYVIPEENEPTKGRILVFAV-----E 810
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
G +++I KE KG V ++ G L+ A+ QKI Y W +++ G + +
Sbjct: 811 DG------SLQLIVEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMSRED---GSHELQS 861
Query: 367 EV----YIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
E +I ++ + + I+VGD +SI+LL Y+ E + ARDY
Sbjct: 862 ECGHHGHILALYTQTRGDFIVVGDLMKSISLLVYKHEESAIEERARDYNAN--------- 912
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
W ++ E L+ DE ++ ++ N+
Sbjct: 913 -------------W----MTAVEMLD-------------DEV----YVGAENSYNLFTVR 938
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
+A + RL ++HLG+ VN F +R S I P + ++
Sbjct: 939 KNSDAATDDERARLEVVGEYHLGEFVNRFRHGSLVMRLPDSDIGQIP------TVIFGTI 992
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
+G +G LP Y L LQ+ +V + G L+ +R++ A +R +DG
Sbjct: 993 NGVIGIIASLPHDQYIFLEKLQSTLVKYIKGVGNLSHEQWRSFHNDKKTA--EARNFLDG 1050
Query: 597 SLVWKFLQLSLGERLEICKKIG 618
L+ FL LS + E+ K +G
Sbjct: 1051 DLIESFLDLSRSKMEEVSKAMG 1072
>gi|440492924|gb|ELQ75450.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
(CPSF subunit) [Trachipleistophora hominis]
Length = 1254
Score = 73.2 bits (178), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 59/264 (22%), Positives = 108/264 (40%), Gaps = 59/264 (22%)
Query: 294 RGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL 353
RGRIL+F++I+V+ + TK +K++ ++ KGP++ V G + ++ ++ +++
Sbjct: 976 RGRILVFEVIDVISDTADRKTKKALKLLGSERTKGPISCCAAVRGRIAVSLATRLMVYEF 1035
Query: 354 KDND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQ 412
N + IAF D +Y S+ +KN I+VGD + + +Q E L L+++ +
Sbjct: 1036 DRNTGIVAIAFYDLYMYAVSLAVIKNYIVVGDIMMGLHFVYFQSEPVKLHLLSKSGRVAN 1095
Query: 413 PNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDK 472
S ++ G+RL I DK
Sbjct: 1096 LGSLDFFNA--------------------GDRLFITG--------------------IDK 1115
Query: 473 DKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTW 532
V +F + P SN G +L+K+ F H + R+
Sbjct: 1116 TGEVQIFSFSPGNLYSNEGEKLVKRQQFETYAHFQSI----------------RTNTYRS 1159
Query: 533 YASLDGALGFFLPLP--EKNYRRL 554
YAS + FF+ L +K+Y ++
Sbjct: 1160 YASFFSSQNFFVTLSYTQKDYGKI 1183
>gi|357519461|ref|XP_003630019.1| DNA damage-binding protein [Medicago truncatula]
gi|355524041|gb|AET04495.1| DNA damage-binding protein [Medicago truncatula]
Length = 1171
Score = 72.8 bits (177), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 109/501 (21%), Positives = 191/501 (38%), Gaps = 112/501 (22%)
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
VS + PF++ P L + EL I + +R +PL + + +T
Sbjct: 752 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRTIPLGEHARRICHQEQT 806
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+T+ I S Y + E+ E+ F+ L Q F + +
Sbjct: 807 RTFAIC------SLKYNSASAEESEM-------HFVRLLDDQ-------TFDFISV---- 842
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+PL +E+ + + S + + Y +GT Y E+ +GRIL+F + E
Sbjct: 843 YPLDTYEYGCFIISCSFSDDNNV-----YYCVGTAYVLPEENEPTKGRILVFSVEE---- 893
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
K++++ KE KG V + G L+ A+ QKI Y W L+++ G + +
Sbjct: 894 -------GKLQLVAEKETKGAVYCLNAFNGKLLAAINQKIQLYKWVLRED---GTRELQS 943
Query: 367 EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
E +A V + + I+VGD +SI+LL Y+ E + ARDY ++
Sbjct: 944 ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 999
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
+ILD+ +G ++ N+
Sbjct: 1000 ------------------------------------EILDDDVYLG---AENSFNLFTVR 1020
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
E RL ++HLG+ +N F +R S + P + ++
Sbjct: 1021 KNSEGATDEERGRLEVAGEYHLGEFINRFRHGSLVMRLPDSDVGQIP------TVIFGTI 1074
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
+G +G LP + Y L LQ+ + GGL+ +R++ + +R +DG
Sbjct: 1075 NGVIGVIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--EARNFLDG 1132
Query: 597 SLVWKFLQLSLGERLEICKKI 617
L+ FL L + EI K +
Sbjct: 1133 DLIESFLDLKRSKMDEISKAM 1153
>gi|407923753|gb|EKG16818.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
phaseolina MS6]
Length = 1129
Score = 72.4 bits (176), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 109/544 (20%), Positives = 203/544 (37%), Gaps = 99/544 (18%)
Query: 83 GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCP 142
G + + R G VF HP+ ++ S G L +T + + + PF+ P
Sbjct: 656 GTQQANFRALPRGNGLYNVFATCEHPSLIY-GSEGRLVFSAVTAE-KATCVCPFNAEAYP 713
Query: 143 RGFLYFNAKSELRISVLP----THLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTST 198
R + A EL ++V+ TH V+ + + T +AY + K + +
Sbjct: 714 RS-IAIAASGELHLAVVDEERRTH--------VQTLHVNETVRRIAYSPQLKAFGL---- 760
Query: 199 AEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHV 258
G K ++ D + V Q H L ++E+ NF ++E+E V
Sbjct: 761 -----------GTIKRVLRDREE-------VVQGHFRLADEVIFKELD--NFEMNEYEIV 800
Query: 259 LCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKI 318
C ++ + R + E + RGRIL+F++ E ++
Sbjct: 801 ECAIRAELDDGDGETAERFIVGTSHLVEEEEQGSTRGRILVFEVTE----------DRRL 850
Query: 319 KMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND-----LTGIAFIDTEVYIASM 373
K+I KG + V +V + + + I+ + + L A T +
Sbjct: 851 KVIAEISTKGACRCLAMVDNKIVAGLIKTVVIYSFEYSTPSTPFLVKKASFRTSTAPIDI 910
Query: 374 VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
N I V D +S+++L Y+P AG+ S
Sbjct: 911 TVTGNQIAVADLIKSVSVLEYKPG----------------------AGDQS--------- 939
Query: 434 WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
E E+ + + + L E ++ +D + N++L R
Sbjct: 940 --------DELKEVARHVQVSWSMALAEVDENTYLQADAEGNLILLERDVSGVTEEDRKR 991
Query: 494 LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRR 553
L+ + D LG+ VN +I +++SDAP F +A+++G++ F +
Sbjct: 992 LMLRGDMLLGEQVNRIRRIDM--ATVSDAPVIPRAF---FATVEGSIYLFALIAPAKVDL 1046
Query: 554 LLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 613
L+ LQ+ + G +R ++ + P+R +DG L+ +FL L E+ E+
Sbjct: 1047 LIRLQSQLADFVRSPGHYPFLRYRAFRNQVREEDEPNR-FVDGDLIERFLDLKPREQEEV 1105
Query: 614 CKKI 617
K +
Sbjct: 1106 VKGV 1109
>gi|348681092|gb|EGZ20908.1| hypothetical protein PHYSODRAFT_259403 [Phytophthora sojae]
Length = 1137
Score = 72.4 bits (176), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 77/360 (21%), Positives = 143/360 (39%), Gaps = 71/360 (19%)
Query: 278 YIALGTNYNYSEDVT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y + ++ +GRIL+F + + E K++++ KE KG V +
Sbjct: 805 YFVVGTAYIHEDEAEPHQGRILVFAVTGIHGE-------RKLQLVTEKEVKGAVYCLNAF 857
Query: 337 AGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-----EVYIASMVSVKNLILVGDYARSIAL 391
G ++ V K +++ +N + + M S + I+VGD +S++L
Sbjct: 858 NGKVLAGVNSKAQLYKWSENTDNEKELVSECGHYGHTLVLYMESRGDFIVVGDLMKSVSL 917
Query: 392 LRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 451
L Y+ T+ +A+D ++ G
Sbjct: 918 LSYKQLDGTIEEIAKDLNSNWMSALG---------------------------------- 943
Query: 452 GSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF-- 509
I+D+ + +G S+ D N+ A RL +FHLG+ VN F
Sbjct: 944 ------IVDDDTYIG---SETDFNLFTVQRNSGAASDEERGRLETVGEFHLGEFVNRFRY 994
Query: 510 ---FKIRCKPSSISDA-------PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN 559
P+ + D P A+++ + + ++ G +G LPL + Y LL +Q
Sbjct: 995 GSLTPAAAGPTDMVDVVEQAPIVPAAQNQSM-LFGTVSGMIGVILPLTKDQYSFLLRVQQ 1053
Query: 560 VMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
+ GG + + +R ++ + + + +R IDG LV FL L + ++ K+ S
Sbjct: 1054 ALTQVVKGVGGFSHKDWRMFENR--RSVSEARNFIDGDLVESFLDLPKAQMTKVVDKLNS 1111
>gi|380488833|emb|CCF37111.1| CPSF A subunit region, partial [Colletotrichum higginsianum]
Length = 1062
Score = 72.0 bits (175), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 94/211 (44%), Gaps = 25/211 (11%)
Query: 14 ETIVQELLTVSLG-LHGNRPLLLVR-TQHELLIYQAFRHPKG------ALKLRFKK---- 61
+ + ELL LG P L+VR +L IY+ R A L F+K
Sbjct: 840 QETLTELLVADLGDTTATSPYLIVRHANDDLTIYEPIRLESQDKTLGLAKTLHFQKITNP 899
Query: 62 -LKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
L V ANEQP R +R +NI GY VFL G P+ + +++ +
Sbjct: 900 ALAKSPVEVADDEANEQP------RFVPLRPCANINGYSTVFLPGASPSLIVKSAKSSPK 953
Query: 121 AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCT 179
+ G V ++ FH C RGF+Y +++ + R++ LP ++ + VRK+P+
Sbjct: 954 VVGLQGIG-VRGMSSFHTEGCERGFIYADSEGQTRVTQLPADSNFAELGVSVRKIPIGDA 1012
Query: 180 PHFLAYHLETKTYCIVTSTAE----PSTDYY 206
+AYH +TY + S +E P D Y
Sbjct: 1013 VGLIAYHPPMETYAVACSISEHFELPKDDDY 1043
>gi|170589357|ref|XP_001899440.1| CPSF A subunit region family protein [Brugia malayi]
gi|158593653|gb|EDP32248.1| CPSF A subunit region family protein [Brugia malayi]
Length = 655
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 114/539 (21%), Positives = 201/539 (37%), Gaps = 92/539 (17%)
Query: 90 RYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFN 149
++ S + +F+C PA ++ +++ L ++ VST+ P + P + +
Sbjct: 139 KFRSRCSPVHNIFVCSDRPAVIYSSNQKLLFSNVNL--RMVSTMTPLYAEAYPDALVLTD 196
Query: 150 AKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFN 209
S ++ + +R VPL +P +AY ET T ++ E + +
Sbjct: 197 GHS-----LVIGRIDDIQKLHIRTVPLGESPSRIAYQPETNTIAVIVERLEVILFLFFY- 250
Query: 210 GEDKELVTDPRDSRFIPPLVSQFHVS----LFSPFSWEEIPQTNFPLHEWEHVLCLKNV- 264
+ D S+ + S E P+ E VL L +
Sbjct: 251 -----VFVDAMGKHHFGQCASKNAMETSSSRLSSMRREPTPECLAEEMEVSSVLLLDSNT 305
Query: 265 -----SMEYEGTLSGL-----------RGYIALGTNYNYSEDVTCR-GRILLFDIIEVVP 307
S E EG+ + + Y +GT S++ + GRI++F E P
Sbjct: 306 FEILHSHELEGSEMAMSLASCQLGDDSQPYFVVGTAVIMSDETESKMGRIMMFQASEG-P 364
Query: 308 EPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTE 367
E +++++Y KE KG +I + G LV AV + +++ + + D +
Sbjct: 365 E--------RMRLVYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEWTADKELRLECSDFD 416
Query: 368 VYIASMVSVKN-LILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRG 426
A + KN LILVGD RS++LL Y+ T VARD+
Sbjct: 417 NVTALYLKTKNDLILVGDLMRSLSLLSYKSMESTFEKVARDFMTN--------------- 461
Query: 427 IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
W + C+ I S + F+ ++ N+ M
Sbjct: 462 -------W----------MSACEIIDSDN-----------FLGAENSYNLFTVMKDSFTV 493
Query: 487 ESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPL 546
G RL + F+LG+ VN F + + AP S L Y + DG +G + +
Sbjct: 494 FKEEGTRLQELGLFYLGEMVNVFCHGSLTATQVDVAPLYHSSIL--YGTSDGGIGVIVQM 551
Query: 547 PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
P Y L +Q + + + ++ +RT++ + G IDG L+ L +
Sbjct: 552 PPVLYTFLQDVQKRLAEYAENCMRISHTQYRTFETEK--RSEAPNGFIDGDLIESLLDM 608
>gi|307111604|gb|EFN59838.1| hypothetical protein CHLNCDRAFT_29381 [Chlorella variabilis]
Length = 1108
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 89/445 (20%), Positives = 158/445 (35%), Gaps = 103/445 (23%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
VR VPL P +A+ ++T+ + + A +GE +
Sbjct: 724 VRTVPLGEQPRRIAHQETSRTFAVTCTQA-------TISGEGGD---------------- 760
Query: 231 QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SE 289
V L ++E + + HE LC + + Y +GT + +E
Sbjct: 761 --SVRLVDEQTFELLDRLQLQQHELACSLCSTQLGDDPAT-------YYVVGTAFAPPNE 811
Query: 290 DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
+GRI + K+ ++ KE +G V ++ G L+ + ++
Sbjct: 812 PEPTKGRIFVL-----------AAAGGKLCVVCEKETRGAVYSLAEFQGRLLAGINSRVQ 860
Query: 350 IWQLKDNDLTGIAFIDT-----EVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLV 404
+++ + G A + V + + +L++VGD +SI LL + E L L
Sbjct: 861 MYKWLEQGEGGRALVPECSHAGHVLALYLATRGDLVVVGDLMKSIQLLAWGEEEGALELR 920
Query: 405 ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
ARD+ P W +LD+ +
Sbjct: 921 ARDFHPN----------------------WM------------------SAVTVLDDDTY 940
Query: 465 MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSIS 520
MG ++ N+ +A RL +HLG+ VN F +R S +S
Sbjct: 941 MG---AENSYNLFTVRRNADAATDEERSRLETVGRYHLGEFVNRFQPGSLVMRLPDSELS 997
Query: 521 DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK 580
P + +++G +G LP Y+ L LQ M GG + +R +
Sbjct: 998 QIP------TVLFGTINGVIGVVASLPHAQYQLLESLQEAMRKVVKGVGGFDHAQWRAFS 1051
Query: 581 GKGYYAGNPSRGIIDGSLVWKFLQL 605
+ + P+R +DG L+ +FL L
Sbjct: 1052 NQ-HMPATPARQFVDGDLIEQFLDL 1075
>gi|384250802|gb|EIE24281.1| hypothetical protein COCSUDRAFT_28729 [Coccomyxa subellipsoidea
C-169]
Length = 1101
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 90/458 (19%), Positives = 174/458 (37%), Gaps = 100/458 (21%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
+R VPL P LA+ ++++ ++TS +T + L+ D
Sbjct: 714 IRTVPLGEQPRRLAHQEASRSFLVLTSPNNGATGMDDAGPDSVRLLDDQ----------- 762
Query: 231 QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
++E + + +E +C SM + Y +GT +E+
Sbjct: 763 ----------TFETLDRFGLETNE----VCCAAASMSFSDDPCP---YYVVGTAITVAEE 805
Query: 291 VT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
+GRIL+F K+ ++ KE KG + G L+ + ++
Sbjct: 806 PEPTKGRILVFGA-----------KGGKLSLVCEKEVKGAAYNLHPFQGKLIAGINSRVQ 854
Query: 350 I--WQLKDN---DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLV 404
+ W ++ +LT V +V+ + ++VGD RS+ LL Y+ + L +
Sbjct: 855 LFKWTQSEDGSRELTNECSHVGHVLALYIVTRGDFVIVGDLMRSLQLLIYRADEGILEVR 914
Query: 405 ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
ARDYK W ++LD+ +
Sbjct: 915 ARDYKTH----------------------WM------------------TAVEVLDDDTY 934
Query: 465 MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSIS 520
+G ++ N+ +A +RL +HLG VN F ++ S +
Sbjct: 935 LG---AENSNNIFTLRKNTDAAADEDRNRLETVGQYHLGVFVNRFRHGSLVMKLPDSEAA 991
Query: 521 DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK 580
P + +++G++G LP++ ++ L LQ+ + GGL+ A+RT++
Sbjct: 992 KIP------TVLFVTINGSIGVIASLPQQQFQFLSRLQDCLRKVIKGVGGLSHVAWRTFQ 1045
Query: 581 GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
+ + PS+ +DG L+ +FL L + +++G
Sbjct: 1046 DE--HTKMPSQNFVDGDLIEQFLDLKRDSMERVAREMG 1081
>gi|324518783|gb|ADY47203.1| Cleavage and polyadenylation specificity factor subunit 1 [Ascaris
suum]
Length = 108
Score = 70.5 bits (171), Expect = 3e-09, Method: Composition-based stats.
Identities = 34/87 (39%), Positives = 56/87 (64%), Gaps = 2/87 (2%)
Query: 465 MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSS--ISDA 522
M F++SD+ N+ +F Y PEA ES+GG RLI +++ ++G +VN+F +++ SS + +
Sbjct: 20 MAFIMSDEAANIAVFNYLPEALESSGGERLILRSEINIGTNVNSFMRVKGHISSGFVENE 79
Query: 523 PGARSRFLTWYASLDGALGFFLPLPEK 549
+ +R + SLDG+ GF PL EK
Sbjct: 80 HYSLNRQSVLFCSLDGSFGFVRPLSEK 106
>gi|312076590|ref|XP_003140929.1| CPSF A subunit region family protein [Loa loa]
Length = 655
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 117/531 (22%), Positives = 206/531 (38%), Gaps = 85/531 (16%)
Query: 90 RYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFN 149
++ S + +F+C PA ++ +++ L ++ VST+ P + P + +
Sbjct: 140 KFRSRCSSVHNIFVCSDRPAVIYSSNQKLLFSNVNL--RMVSTMTPLYAEAYPDALVLTD 197
Query: 150 AKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAE--PSTDYYK 207
S ++ + +R VPL +P +AY ET T + E + +
Sbjct: 198 GNS-----LVIGRIDDIQKLHIRTVPLGESPSRIAYQPETNTIAVTVERLEFVDAMGKHH 252
Query: 208 F----NGEDKELVTDPRDSRFIPP----LVSQFHVS---LFSPFSWEEIPQTNFPLHEWE 256
F + E + S P L + VS L ++E + + L E
Sbjct: 253 FGQCASKNAMETSSSRLSSMRREPTPECLAEEMEVSSILLLDSNTFEILH--SHELEGSE 310
Query: 257 HVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTK 315
+ L + + + + Y +GT S++ + GRI++F E PE
Sbjct: 311 MAMSLASCQLGNDS-----QPYFVVGTAVIMSDETESKMGRIMMFQASEG-PE------- 357
Query: 316 NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVS 375
+++++Y KE KG +I + G LV AV + +++ + + D + A +
Sbjct: 358 -RMRLVYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEWTADKELRLECSDFDNVTALYLK 416
Query: 376 VKN-LILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVW 434
KN LILVGD RS++LL Y+ T VARD+ W
Sbjct: 417 TKNDLILVGDLMRSLSLLSYKSVESTFEKVARDFMTN----------------------W 454
Query: 435 KFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRL 494
+ C+ I S + L +S KD V ++ E G RL
Sbjct: 455 ----------MSACEIIDS--DSFLGAENSYNLFTVVKDSFTV---FKEE------GTRL 493
Query: 495 IKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 554
+ F+LG+ VN F + + AP S L Y + DG +G + +P Y L
Sbjct: 494 QELGLFYLGEMVNVFCHGSLTATQVDVAPLYHSSIL--YGTSDGGIGVIVQMPPVLYTFL 551
Query: 555 LMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
+Q + +T + ++ +RT++ + G IDG L+ L +
Sbjct: 552 HDVQKRLADYTENCMRISHTQYRTFETEK--RSEVPNGFIDGDLIESLLDM 600
>gi|393905247|gb|EJD73911.1| CPSF A subunit region family protein [Loa loa]
Length = 1145
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 117/531 (22%), Positives = 206/531 (38%), Gaps = 85/531 (16%)
Query: 90 RYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFN 149
++ S + +F+C PA ++ +++ L ++ VST+ P + P + +
Sbjct: 639 KFRSRCSSVHNIFVCSDRPAVIYSSNQKLLFSNVNL--RMVSTMTPLYAEAYPDALVLTD 696
Query: 150 AKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAE--PSTDYYK 207
S ++ + +R VPL +P +AY ET T + E + +
Sbjct: 697 GNS-----LVIGRIDDIQKLHIRTVPLGESPSRIAYQPETNTIAVTVERLEFVDAMGKHH 751
Query: 208 F----NGEDKELVTDPRDSRFIPP----LVSQFHVS---LFSPFSWEEIPQTNFPLHEWE 256
F + E + S P L + VS L ++E + + L E
Sbjct: 752 FGQCASKNAMETSSSRLSSMRREPTPECLAEEMEVSSILLLDSNTFEILH--SHELEGSE 809
Query: 257 HVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTK 315
+ L + + + + Y +GT S++ + GRI++F E PE
Sbjct: 810 MAMSLASCQLGNDS-----QPYFVVGTAVIMSDETESKMGRIMMFQASEG-PE------- 856
Query: 316 NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVS 375
+++++Y KE KG +I + G LV AV + +++ + + D + A +
Sbjct: 857 -RMRLVYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEWTADKELRLECSDFDNVTALYLK 915
Query: 376 VKN-LILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVW 434
KN LILVGD RS++LL Y+ T VARD+ W
Sbjct: 916 TKNDLILVGDLMRSLSLLSYKSVESTFEKVARDFMTN----------------------W 953
Query: 435 KFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRL 494
+ C+ I S + L +S KD V ++ E G RL
Sbjct: 954 ----------MSACEIIDS--DSFLGAENSYNLFTVVKDSFTV---FKEE------GTRL 992
Query: 495 IKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 554
+ F+LG+ VN F + + AP S L Y + DG +G + +P Y L
Sbjct: 993 QELGLFYLGEMVNVFCHGSLTATQVDVAPLYHSSIL--YGTSDGGIGVIVQMPPVLYTFL 1050
Query: 555 LMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
+Q + +T + ++ +RT++ + G IDG L+ L +
Sbjct: 1051 HDVQKRLADYTENCMRISHTQYRTFETEK--RSEVPNGFIDGDLIESLLDM 1099
>gi|258572939|ref|XP_002540651.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237900917|gb|EEP75318.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 1144
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 78/348 (22%), Positives = 134/348 (38%), Gaps = 69/348 (19%)
Query: 294 RGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL 353
+GRIL+FD+ +++M+ +G A+ V G +V A+ + + I +
Sbjct: 848 KGRILIFDV----------GVNRELRMVSEFPVRGACRALAMVNGKIVAALMKSVVILSM 897
Query: 354 KDNDLTGIAFIDTEVYIASMVSVK-----NLILVGDYARSIALLRYQP----EYRTLSLV 404
K + I Y S V N+I+V D +SI+LL YQ + +L V
Sbjct: 898 KKGNSYSIDIGKESSYRTSTAPVDLSVTDNIIVVADLMKSISLLEYQAGEAGQPDSLKEV 957
Query: 405 ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
AR Y+ +W + E
Sbjct: 958 ARHYQT----------------------LWTTTAAPIAEN-------------------- 975
Query: 465 MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG 524
F++SD + N+V+ + R+ ++ LG VN ++ + S S P
Sbjct: 976 -AFLVSDAEGNLVVLNRNTTGVTEDDKRRMQITSELRLGTMVNRIRRMDLQASQSS--PV 1032
Query: 525 ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 584
FL A+ DG++ F + + L+ LQ+ + + + GG+ +R +K
Sbjct: 1033 IPKAFL---ATTDGSIYLFGVIAQFAQDLLMRLQSALASFVASPGGIPFSGYRAFKSATR 1089
Query: 585 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI-LDELYDI 631
A P R +DG LV +FL L + + K+ D+ L +L DI
Sbjct: 1090 QADEPFR-FVDGELVEQFLDCPLEVQEAVLAKMDGGGRDVTLSQLKDI 1136
>gi|443707495|gb|ELU03057.1| hypothetical protein CAPTEDRAFT_148808 [Capitella teleta]
Length = 1084
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 114/499 (22%), Positives = 191/499 (38%), Gaps = 107/499 (21%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLV- 229
+R VPL TP +AY ++T+ ++T +D NG T R S L
Sbjct: 656 IRNVPLGETPRRIAYQEASQTFGVIT----LRSDLQDSNGS-----TPARPSASTQALST 706
Query: 230 ---SQFHVSLFSPFSWEEIPQTNFPLHEW----EHVLCL--KNVSMEYE---GTLSGLRG 277
S V S + E +H +H + + M+YE +SG G
Sbjct: 707 SSSSNVKVMAASNANTEHTFGDEVEVHSLLVLDQHTFEVLHSHQLMQYEFATALMSGRFG 766
Query: 278 -----YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVT 331
Y +GT Y E+ + GRI++F K+ + KE KG
Sbjct: 767 EDPTTYYVVGTAMVYPEEAEPKQGRIIVF-----------RFHDGKLTQVAEKEIKGAAY 815
Query: 332 AICHVAGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARS 388
+ G L+ ++ +++ W + ++ + IA + K + ILVGD RS
Sbjct: 816 TLTEFNGKLLASINSTVRLFEWTAEKELRVECSYFNN--IIALYLKTKGDFILVGDLMRS 873
Query: 389 IALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 448
+ LL Y+P +ARDY P S
Sbjct: 874 VTLLSYKPMEGCFEEIARDYNPNWMTSI-------------------------------- 901
Query: 449 KKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHV 506
D+LD+ + +G + + +F Q ++ + R L + +HLG+ V
Sbjct: 902 --------DVLDDDTFLG-----AENSFNIFTCQKDSAATTDEERQHLQEVGLYHLGEFV 948
Query: 507 NTFFKIRCKPSSISDAPG---ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
N F S + PG + ++ + +++GALG LP++ Y LL +QN +
Sbjct: 949 NVFRH----GSLVMQHPGECTSPTQGSVLFGTVNGALGLVTQLPQEFYLFLLEVQNKLAK 1004
Query: 564 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------ 617
G + +R++ + P+ G IDG L+ FL LS + E+ + +
Sbjct: 1005 TIKSVGKVEHAFWRSFHTE--RKTEPATGFIDGDLIESFLDLSRDKMQEVVQGLQMDDGS 1062
Query: 618 GSKHNDILDELYD-IEALS 635
G K +D+L IE L+
Sbjct: 1063 GMKREAAVDDLVKMIEELT 1081
>gi|302769568|ref|XP_002968203.1| hypothetical protein SELMODRAFT_145521 [Selaginella moellendorffii]
gi|300163847|gb|EFJ30457.1| hypothetical protein SELMODRAFT_145521 [Selaginella moellendorffii]
Length = 1089
Score = 69.7 bits (169), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 109/493 (22%), Positives = 186/493 (37%), Gaps = 104/493 (21%)
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
V+ + PF++ + P L + EL I + +R V L P + + +T
Sbjct: 670 VNHMCPFNSASFPDS-LAIGKEGELTIGTIDDI----QKLHIRTVALGEHPRRICHQEQT 724
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+T+ + T+ Y NGED E S F+ L Q L S
Sbjct: 725 RTFGLCTARF-----YSNPNGEDHE-------SHFVKLLDDQTFEVLGS----------- 761
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+ L +E+ + S + Y +GT Y E+ +GRIL+F + E
Sbjct: 762 YNLDTFENGCTIITCSFTDDPAT-----YYCVGTAYALPEENEPSKGRILIFTV-----E 811
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDN--DLTGIAFI 364
G K +++ KE KG V + G L+ + QKI Y W +D+ +L
Sbjct: 812 DG------KFQLVTEKETKGAVYNLNAFNGKLLAGINQKIQLYKWTQRDSTRELQSECGH 865
Query: 365 DTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPS 424
+ + S + I+VGD +SI+LL Y+PE + ARDY
Sbjct: 866 HGHILALYVQSRGDFIVVGDLMKSISLLLYKPEEGAIEERARDYNAN------------- 912
Query: 425 RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPE 484
W +ILD+ +G ++ N+ +
Sbjct: 913 ---------WM------------------TAVEILDDDIYLG---AENSFNLFTVRKNSD 942
Query: 485 ARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGAL 540
A RL ++HLG+ VN F +R + S P + +++G +
Sbjct: 943 AATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDNETSQIP------TVIFGTVNGVI 996
Query: 541 GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVW 600
G L ++ + L LQ+ + GGL+ +R++ + A ++ +DG L+
Sbjct: 997 GVVASLQQEQFNFLQRLQHCLAKVIKGVGGLSHEQWRSFSSERKNA--DAKNFLDGDLIE 1054
Query: 601 KFLQLSLGERLEI 613
FL L+ + E+
Sbjct: 1055 SFLDLNRAKMDEV 1067
>gi|302788810|ref|XP_002976174.1| hypothetical protein SELMODRAFT_151061 [Selaginella moellendorffii]
gi|300156450|gb|EFJ23079.1| hypothetical protein SELMODRAFT_151061 [Selaginella moellendorffii]
Length = 1089
Score = 69.7 bits (169), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 109/493 (22%), Positives = 186/493 (37%), Gaps = 104/493 (21%)
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
V+ + PF++ + P L + EL I + +R V L P + + +T
Sbjct: 670 VNHMCPFNSASFPDS-LAIGKEGELTIGTIDDI----QKLHIRTVALGEHPRRICHQEQT 724
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+T+ + T+ Y NGED E S F+ L Q L S
Sbjct: 725 RTFGLCTARF-----YSNPNGEDHE-------SHFVKLLDDQTFEVLGS----------- 761
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+ L +E+ + S + Y +GT Y E+ +GRIL+F + E
Sbjct: 762 YNLDTFENGCTIITCSFTDDPAT-----YYCVGTAYALPEENEPSKGRILIFTV-----E 811
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDN--DLTGIAFI 364
G K +++ KE KG V + G L+ + QKI Y W +D+ +L
Sbjct: 812 DG------KFQLVTEKETKGAVYNLNAFNGKLLAGINQKIQLYKWTQRDSTRELQSECGH 865
Query: 365 DTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPS 424
+ + S + I+VGD +SI+LL Y+PE + ARDY
Sbjct: 866 HGHILALYVQSRGDFIVVGDLMKSISLLLYKPEEGAIEERARDYNAN------------- 912
Query: 425 RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPE 484
W +ILD+ +G ++ N+ +
Sbjct: 913 ---------WM------------------TAVEILDDDIYLG---AENSFNLFTVRKNSD 942
Query: 485 ARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGAL 540
A RL ++HLG+ VN F +R + S P + +++G +
Sbjct: 943 AATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDNETSQIP------TVIFGTVNGVI 996
Query: 541 GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVW 600
G L ++ + L LQ+ + GGL+ +R++ + A ++ +DG L+
Sbjct: 997 GVVASLQQEQFNFLQRLQHCLAKVIKGVGGLSHEQWRSFSSERKNA--DAKNFLDGDLIE 1054
Query: 601 KFLQLSLGERLEI 613
FL L+ + E+
Sbjct: 1055 SFLDLNRAKMDEV 1067
>gi|219125301|ref|XP_002182922.1| damage-specific DNA binding protein 1 [Phaeodactylum tricornutum CCAP
1055/1]
gi|217405716|gb|EEC45658.1| damage-specific DNA binding protein 1 [Phaeodactylum tricornutum CCAP
1055/1]
Length = 1284
Score = 69.7 bits (169), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 78/338 (23%), Positives = 132/338 (39%), Gaps = 55/338 (16%)
Query: 276 RGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAIC 334
R ++ +GT Y ED RGRIL++ + G P + ++ I +G V +IC
Sbjct: 929 RPFLLVGTAYAMPDEDEPSRGRILVYSC-QADEASGTPTSTRAVRQITEMSTQGGVYSIC 987
Query: 335 HV-AGFLVTAVGQKIYIWQLKDN------DLTGIAFIDTEVYIASMVSVKNLILVGDYAR 387
G + V K ++ Q+ + + GI V + K L +VGD R
Sbjct: 988 QFYDGNFLCTVNSKTHVVQIVADCGVLRLEYVGIGHHGHIVSLFVKSRAKPLAIVGDLMR 1047
Query: 388 SIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 447
S++L++Y P++ TL VARD+ P + + + G+ W L
Sbjct: 1048 SVSLMQYYPQHETLEEVARDFNPNWTTAVEMLTDD----VYIGAENWNNL---------F 1094
Query: 448 CKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVN 507
C + +N + R N G +FHLG+ N
Sbjct: 1095 CLR-----------------------RNKAATSEEIRCRLDNIG-------EFHLGEMCN 1124
Query: 508 TFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 567
F +S SR T + +++G+LG L L + + L+ +
Sbjct: 1125 KFMS-GSLVMPVSSNSTTSSRRATLFGTVEGSLGVILGLDGRTAAFFITLERAIAKTIQP 1183
Query: 568 TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
GG + + +R+ + + +P+ G +DG LV FL L
Sbjct: 1184 VGGFSHQLYRSCQAE--LRVHPAHGFVDGDLVETFLDL 1219
>gi|452824087|gb|EME31092.1| DNA damage-binding protein 1 isoform 1 [Galdieria sulphuraria]
Length = 1128
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 98/456 (21%), Positives = 182/456 (39%), Gaps = 86/456 (18%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRD-SRFIPPLV 229
+R +PL P +A HL+T V +T K++VT D + +
Sbjct: 732 IRTIPLGEQPRRIA-HLDTHHVFAVLTT--------------KQVVTISEDGNEALSETT 776
Query: 230 SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
+ +V L E + ++ L ++E + V+ + + Y +GT Y+Y++
Sbjct: 777 EEGYVRLIDDTMMEIVH--SYKLEQFETPCSVITVNFGDDAAAKDNQDYFVVGTAYSYAD 834
Query: 290 DVT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ-- 346
+ RGR+L+F + E ++ ++ + KG + ++ G ++ +V
Sbjct: 835 EPEPSRGRMLVFAVRE-----------QRLTLVAERTFKGALYSMDAFNGKILASVNSML 883
Query: 347 KIYIWQLKDN---DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSL 403
K+ W ++ LT ++I + + + IL+GD RS++LL Y+P T+
Sbjct: 884 KLVRWSETESGARTLTEECTYHGSIFILQIKCLGDFILIGDLVRSVSLLAYKPMNGTIED 943
Query: 404 VARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFS 463
VARD P+ W +++ E L+ LD +
Sbjct: 944 VARDIDPS----------------------W----ITVIEMLD------------LDYYI 965
Query: 464 SMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAP 523
S ++ N+ +A RL K ++HLG+ VN R + P
Sbjct: 966 S-----AENCFNLFTLKRNSDASTEEERSRLEKVGEYHLGELVNRIRHGRL----VLQIP 1016
Query: 524 GARSRFLT--WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG 581
+ L Y + +GALG + EK ++ L LQ + GG+ +R +
Sbjct: 1017 ESGISILKSLLYGTANGALGVIASIDEKTFQFLHSLQTALNEVIKGVGGIQHEDWRRFTS 1076
Query: 582 KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
+ S+ +DG L+ +FL LS + + KK+
Sbjct: 1077 ERRIG--DSKNFLDGDLIERFLDLSRDKMELVAKKV 1110
>gi|301121252|ref|XP_002908353.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
gi|262103384|gb|EEY61436.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
Length = 1150
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 79/373 (21%), Positives = 146/373 (39%), Gaps = 84/373 (22%)
Query: 278 YIALGTNYNYSEDVT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y + E+ +GRIL+F + + E K++++ KE KG V +
Sbjct: 805 YFVVGTAYIHEEEAEPHQGRILVFAVTGIHGE-------RKLQLVTEKEVKGAVYCLNSF 857
Query: 337 AGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-----EVYIASMVSVKNLILVGDYARSIAL 391
G ++ V K +++ +N + + M S + I+VGD +SI+L
Sbjct: 858 NGKVLAGVNSKAQLYKWSENTDNEKELVSECGHYGHTLVLYMESRGDFIVVGDLMKSISL 917
Query: 392 LRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 451
L Y+ T+ +A+D ++ G
Sbjct: 918 LSYKQLDGTIEEIAKDLNSNWMSAVG---------------------------------- 943
Query: 452 GSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF-- 509
I+D+ + +G S+ D N+ A RL +FHLG+ VN F
Sbjct: 944 ------IVDDDTYIG---SETDFNLFTVQRNSGAASDEERGRLETVGEFHLGEFVNRFRY 994
Query: 510 ----------------FKIRCKPSSISD-------APGARSRFLTWYASLDGALGFFLPL 546
+ P+++ D AP +++ + + ++ G +G LP+
Sbjct: 995 GSLVMQNSSSTSQTPSGVVSTGPTAMVDVGESAPAAPVVQNQSM-LFGTVSGMIGVILPI 1053
Query: 547 PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
+ Y LL +Q + GG + + +RT++ + + + +R IDG LV FL L
Sbjct: 1054 SKDQYSFLLRVQQALTHVVKGVGGFSHKDWRTFENR--RSVSEARNFIDGDLVESFLDLP 1111
Query: 607 LGERLEICKKIGS 619
+ ++ K+ S
Sbjct: 1112 KPQMTKVVDKLNS 1124
>gi|346321204|gb|EGX90804.1| DNA damage-binding protein 1 [Cordyceps militaris CM01]
Length = 1160
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 72/348 (20%), Positives = 143/348 (41%), Gaps = 62/348 (17%)
Query: 283 TNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVT 342
T+ + E RGRIL+ + E + ++ I KG + + ++V
Sbjct: 858 TDADVGEASETRGRILVLGVDE----------ERQLYTIVTHNLKGACRCLSVLDEYIVA 907
Query: 343 AVGQKIYIWQLKDNDLTGIAFIDTEVY------IASMVSVKNLILVGDYARSIALLRYQP 396
+ + + +++ + T + Y +A VS N+I VGD +S++L+ + P
Sbjct: 908 GLSKTVVVYRYTEETSTEGSLQKLAAYRPASFPVALDVS-GNMIGVGDLMQSLSLVEFTP 966
Query: 397 EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 456
+D +P + K + + W +C G +
Sbjct: 967 --------PKDGEPAKLQEKARHFQS----------AWA---------TSVCHLDGER-- 997
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
++ +D N+++ PEA RL ++ +LG+ +N K+ P
Sbjct: 998 ----------WLETDAQGNIMVLARNPEAPTEQDRGRLEITSEMNLGEQINKIRKLNVAP 1047
Query: 517 SSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
+ +A + FL AS++G L + + K L+ LQ+ + + TG ++ A+
Sbjct: 1048 AD--NAVVSPKAFL---ASIEGTLYLYGDIAPKYQDLLITLQSNIEQYVKTTGDISFNAW 1102
Query: 577 RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
R+++ + A P R +DG +V +FL L ++E+CK +G D+
Sbjct: 1103 RSFRNQTREADGPFR-FVDGEMVERFLDLDELTQVELCKDLGPSVEDV 1149
>gi|255316764|gb|ACU01763.1| putative DNA damage binding protein [Brachypodium distachyon]
Length = 384
Score = 68.9 bits (167), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 81/354 (22%), Positives = 147/354 (41%), Gaps = 78/354 (22%)
Query: 278 YIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ +GRIL+F + E G ++++I KE KG V ++
Sbjct: 79 YYCVGTAYVLPEENEPTKGRILVFAV-----EDG------RLQLIVEKETKGAVYSLNAF 127
Query: 337 AGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV----YIASMVSVK--NLILVGDYARS 388
G L+ A+ QKI Y W +++ G + +E +I ++ + + I+VGD +S
Sbjct: 128 NGKLLAAINQKIQLYKWMTRED---GSHELQSECGHHGHILALFTQTRGDFIVVGDLMKS 184
Query: 389 IALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 448
I+LL Y+ E + +ARDY W ++ E ++
Sbjct: 185 ISLLVYKHEESAIEELARDYNAN----------------------W----MTAVEMID-- 216
Query: 449 KKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT 508
+DI ++ ++ N+ +A RL ++HLG+ VN
Sbjct: 217 -------DDI--------YVGAENSYNLFTVRKNSDAATDEERGRLEVVGEYHLGEFVNR 261
Query: 509 F----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 564
F +R + + P + +++G +G LP Y L LQ+++
Sbjct: 262 FRHGSLVMRLPDTEMGQIP------TVIFGTINGVIGIIASLPHDQYVFLEKLQSILGKF 315
Query: 565 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
G L+ +R++ + A +R +DG L+ FL L+ + E+ K +G
Sbjct: 316 IKGVGSLSHDQWRSFHNEKKTA--EARNFLDGDLIESFLDLNRSKMEEVSKGMG 367
>gi|413948669|gb|AFW81318.1| hypothetical protein ZEAMMB73_456332 [Zea mays]
Length = 674
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 87/367 (23%), Positives = 152/367 (41%), Gaps = 81/367 (22%)
Query: 278 YIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ +GRIL+F + E G +++I KE KG V ++
Sbjct: 369 YYCVGTAYVIPEENEPTKGRILVFAV-----EDG------SLQLIVEKETKGAVYSLNAF 417
Query: 337 AGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV----YIASMVSVK--NLILVGDYARS 388
G L+ A+ QKI Y W +++ G + +E +I ++ + + I+VGD +S
Sbjct: 418 NGKLLAAINQKIQLYKWMSRED---GSHELQSECGHHGHILALYTQTRGDFIVVGDLMKS 474
Query: 389 IALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 448
I+LL Y+ E + ARDY W ++ E L+
Sbjct: 475 ISLLVYKHEESAIEERARDYNAN----------------------W----MTAVEMLDDE 508
Query: 449 KKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT 508
+G+++ G+ + KN +A + +L ++HLG+ VN
Sbjct: 509 VYVGAEN----------GYNLFTVRKN-------SDAATDDERAKLEVVGEYHLGEFVNR 551
Query: 509 F----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 564
F +R S I P + +++G +G LP +Y L Q+ +V +
Sbjct: 552 FRHGSLVMRLPDSEIGKIP------TVIFGTINGVIGIIASLPHDHYTFLEKFQSTLVKY 605
Query: 565 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND- 623
G ++ +R++ A +R +DG L+ FL LS + + K +G D
Sbjct: 606 IKGVGNMSHEQWRSFHNDKKTA--EARNFLDGDLIESFLDLSRSKMEVVSKAMGVSVEDL 663
Query: 624 --ILDEL 628
I++EL
Sbjct: 664 SKIVEEL 670
>gi|340381612|ref|XP_003389315.1| PREDICTED: DNA damage-binding protein 1-like [Amphimedon
queenslandica]
Length = 1142
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 101/483 (20%), Positives = 180/483 (37%), Gaps = 93/483 (19%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPS--TDYYKFNGEDKELVTDPRDSRFIPPL 228
+ +PL +P +AY ++T+ + + S + Y + + T +PP
Sbjct: 715 IETIPLGESPRCIAYQESSQTFLVGGYRTDKSGPDNTYTPSRQSVSTRTSNVSVAVVPPQ 774
Query: 229 --VSQFHVSLFSPFSWEEIPQTNFP------LHEWEHVLCLKNVSMEYEGTLSGLRGYIA 280
+ +F S QT F L EH+LC+ + ++ T R
Sbjct: 775 LNIEEFKCPQVEMHSLILFDQTTFDVSHVYQLCPQEHILCVTSCNLT---TNDEERSVYV 831
Query: 281 LGTNYNYSED-VTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF 339
+GT E+ + GRIL+F + K+++++ K + G V + G
Sbjct: 832 VGTALVKPEEKESSTGRILVFAV-----------NSGKLELLHEKLENGAVFQVLGFNGK 880
Query: 340 LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYR 399
++ +V +++ L D L + + + + ILVGD RS+ LL Y+ E
Sbjct: 881 ILNSVNSGVFVNALVDGALKEECAYKNNILALYLKTKGDFILVGDILRSLKLLVYKEE-- 938
Query: 400 TLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 459
LG ++IG HN I
Sbjct: 939 -----------------------------------------LG-----LEEIGVDHN-IS 951
Query: 460 DEFSSMGFMISDKD------KNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
F + MI D++ +++ + EA +++ + + G +VN F
Sbjct: 952 PCFCTAIEMIDDENYLGADGRHIFICQKNTEATSEADLLYMVQPSRMYFGDNVNVF---- 1007
Query: 514 CKPSSISDAPGARSRFL-----TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 568
+ S + D PGA + L + ++ GA+G L Y L LQ M +
Sbjct: 1008 SRGSFVMDHPGAGASSLLQGKPILFGTVHGAIGLIGTLNMDTYTLLSKLQQKMAANIKSV 1067
Query: 569 GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
G + +R++ + + P G IDG LV KFL+L + +I + G K D+
Sbjct: 1068 GNIEHEIYRSFSNE--HRSKPFAGFIDGDLVEKFLELPRPQMSQIVQ--GIKTTDVTGTE 1123
Query: 629 YDI 631
D+
Sbjct: 1124 VDV 1126
>gi|18377609|gb|AAL66955.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
Length = 270
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 74/306 (24%), Positives = 124/306 (40%), Gaps = 66/306 (21%)
Query: 324 KEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV-----YIASMVSV 376
KE KG V ++ G L+ A+ QKI Y W L+D+ G + +E +A V
Sbjct: 1 KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQT 57
Query: 377 K-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWK 435
+ + I+VGD +SI+LL Y+ E + ARDY ++
Sbjct: 58 RGDFIVVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAV------------------- 98
Query: 436 FLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLI 495
+ILD+ +G ++ + N++ E RL
Sbjct: 99 ---------------------EILDDDIYLG---AENNFNLLTVKKNSEGATDEERGRLE 134
Query: 496 KKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNY 551
++HLG+ VN F +R S I P + +++G +G LP++ Y
Sbjct: 135 VVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVIASLPQEQY 188
Query: 552 RRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERL 611
L LQ+ + GGL+ +R++ + A +R +DG L+ FL LS +
Sbjct: 189 TFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRNKME 246
Query: 612 EICKKI 617
+I K +
Sbjct: 247 DISKSM 252
>gi|357132340|ref|XP_003567788.1| PREDICTED: DNA damage-binding protein 1a-like [Brachypodium
distachyon]
Length = 1090
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 81/354 (22%), Positives = 147/354 (41%), Gaps = 78/354 (22%)
Query: 278 YIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ +GRIL+F + E G ++++I KE KG V ++
Sbjct: 785 YYCVGTAYVLPEENEPTKGRILVFAV-----EDG------RLQLIVEKETKGAVYSLNAF 833
Query: 337 AGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV----YIASMVSVK--NLILVGDYARS 388
G L+ A+ QKI Y W +++ G + +E +I ++ + + I+VGD +S
Sbjct: 834 NGKLLAAINQKIQLYKWMTRED---GSHELQSECGHHGHILALFTQTRGDFIVVGDLMKS 890
Query: 389 IALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 448
I+LL Y+ E + +ARDY W ++ E ++
Sbjct: 891 ISLLVYKHEESAIEELARDYNAN----------------------W----MTAVEMID-- 922
Query: 449 KKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT 508
+DI ++ ++ N+ +A RL ++HLG+ VN
Sbjct: 923 -------DDI--------YVGAENSYNLFTVRKNSDAATDEERGRLEVVGEYHLGEFVNR 967
Query: 509 F----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 564
F +R + + P + +++G +G LP Y L LQ+++
Sbjct: 968 FRHGSLVMRLPDTEMGQIP------TVIFGTINGVIGIIASLPHDQYVFLEKLQSILGKF 1021
Query: 565 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
G L+ +R++ + A +R +DG L+ FL L+ + E+ K +G
Sbjct: 1022 IKGVGSLSHDQWRSFHNEKKTA--EARNFLDGDLIESFLDLNRSKMEEVSKGMG 1073
>gi|325186344|emb|CCA20849.1| predicted protein putative [Albugo laibachii Nc14]
Length = 1148
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 90/431 (20%), Positives = 180/431 (41%), Gaps = 83/431 (19%)
Query: 229 VSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYS 288
V Q ++ LF ++E + +F L +E + ++ + G SG Y+ +GT + +
Sbjct: 769 VEQGYIRLFDDQTFECLK--SFRLDPFESPCSI--ITCIFTGDSSGGTYYV-VGTAFVHE 823
Query: 289 EDVT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
E+ +GRIL+F + + + +++++ KE KG V + G L+ V K
Sbjct: 824 EEAEPHQGRILVFTVSGIHGD-------RRLQLVTEKEVKGSVYCLNAFNGKLLAGVNSK 876
Query: 348 IYIWQLKDNDLTGIAFIDT-----EVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLS 402
+Y+++ +++ G + + M S + I+VGD +SI+LL ++ ++
Sbjct: 877 VYLFKWSESEENGEELVSECGHHGHTLVLYMESRGDFIVVGDLMKSISLLNHKQLDGSIE 936
Query: 403 LVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEF 462
+ARD + G I+D+
Sbjct: 937 EIARDLNSNWMTAVG----------------------------------------IIDDD 956
Query: 463 SSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSS 518
+ +G S+ D N+ A RL ++HLG+ VN F ++ S
Sbjct: 957 NYVG---SETDFNLFTVQRNSGAASDEERGRLETIGEYHLGEFVNRFRYGSLVMQHNLSI 1013
Query: 519 ISDAPGA-----RSRFLT--------WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 565
++APG R L+ + ++ G +G LP+ ++ + L+ +Q+ +
Sbjct: 1014 GAEAPGISLSDDRPESLSPLSVQRSMLFGTVSGMIGVILPISKEKHEFLMRVQSALNQVI 1073
Query: 566 SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 625
GG + +RT++ + + + IDG L+ FL LS E ++ ++ + D L
Sbjct: 1074 QGVGGFSHSEWRTFENR--RSSIEAHNFIDGDLIESFLDLSKDEMKQVVDEL---NRDQL 1128
Query: 626 DELYDIEALSS 636
+ +EAL++
Sbjct: 1129 EGKTTLEALAA 1139
>gi|320163506|gb|EFW40405.1| UV-damaged DNA binding protein [Capsaspora owczarzaki ATCC 30864]
Length = 1123
Score = 67.0 bits (162), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 102/426 (23%), Positives = 167/426 (39%), Gaps = 88/426 (20%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTST-AEP------STDYYKFNGEDKELVTD--PRD 221
VR +PL P +AYH T+TY + T T AEP S + + + D PR
Sbjct: 765 VRAIPLGEMPRRIAYHEPTRTYGVATVTLAEPLPVGSNSGNVAARAQNVRPMAFDDGPRS 824
Query: 222 SRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIAL 281
+ L V LF ++E + +F L E ++ + S + + S + Y+ +
Sbjct: 825 PSDV--LEDTSFVRLFDGQTFE--IRDSFQLPSTETIMSFISCSFANDSSDSTV--YLVV 878
Query: 282 GTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFL 340
GT + SED RGRIL+FD+ + ++ AK+ KG V ++ G L
Sbjct: 879 GTAFVIPSEDEPKRGRILVFDV-----------AGGALHLVTAKDVKGCVYSLNAFNGKL 927
Query: 341 VTAVGQKI--YIWQLKDNDLTGIAFIDTE------VYIASMVSVKNLILVGDYARSIALL 392
+ + K+ + W L + GI + +E + + S + I+VGD RSI+LL
Sbjct: 928 LAGINSKVNLFKWNLTGD---GIRELVSECSHHGHILTLYLKSRGDFIIVGDLMRSISLL 984
Query: 393 RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
Y+ ++ +A+D T PN W
Sbjct: 985 MYKSGTSSIEEIAQD---TCPN-------------------W------------------ 1004
Query: 453 SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 512
D+LD+ +G + N+ EA RL +FH+G+ +N F
Sbjct: 1005 VTAVDMLDDDVFIG---GESSFNIFTCRRNLEASTDEERKRLEVVGEFHVGEFINQFR-- 1059
Query: 513 RCKPSSISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
S + P + + + T + + +G +G L Y L ++Q M G
Sbjct: 1060 --AGSLVMKLPDEQEQPIQPSTLFGTGNGVIGVIARLTRSQYEFLQLVQAAMAKVIKGVG 1117
Query: 570 GLNPRA 575
GLN A
Sbjct: 1118 GLNHSA 1123
>gi|425777692|gb|EKV15851.1| UV-damaged DNA binding protein, putative [Penicillium digitatum Pd1]
gi|425779888|gb|EKV17916.1| UV-damaged DNA binding protein, putative [Penicillium digitatum
PHI26]
Length = 1140
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 78/365 (21%), Positives = 146/365 (40%), Gaps = 64/365 (17%)
Query: 275 LRGYIALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAI 333
++ +GT + + ++ + RGRIL+ ++ + G+ L++ + + +
Sbjct: 831 VKDRFVVGTAFADEDQEESIRGRILILEV-----DHGRKLSQVAELPVMGACRALAMMGD 885
Query: 334 CHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLR 393
C VA +V ++ I + L +A T + V NLI V D +S+ L+R
Sbjct: 886 CIVAALVVV---YRVKINNVGPMKLEKLAAYRTSTAPVDVTVVDNLIAVADLMKSLCLIR 942
Query: 394 YQP----EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 449
Y P E L+ V R Y+ VW +G+
Sbjct: 943 YTPGHTGEPAKLTEVGRHYQT----------------------VWSTAIACVGDET---- 976
Query: 450 KIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
F+ SD + N+++ + HRLI ++ LG+ VN
Sbjct: 977 -----------------FLQSDAEGNLIVLSRNTNGVTAQDKHRLIPTSEISLGEMVN-- 1017
Query: 510 FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
R +P I + A+++G++ F + ++ L+ LQ + + G
Sbjct: 1018 ---RIRPVHIPQLCSVMVTPRAFMATVEGSIFLFAVINPEHQDFLMTLQAALSQKLNSLG 1074
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 629
L+ FR ++ A P R +DG L+ +FL+ + + EI +++GS +D+ +
Sbjct: 1075 NLSFDKFRGFRTMVRSAAAPYR-FVDGELIEQFLKCTPSMQEEIAQEVGS--SDVGEVKR 1131
Query: 630 DIEAL 634
IEAL
Sbjct: 1132 LIEAL 1136
>gi|402592185|gb|EJW86114.1| CPSF A subunit region family protein [Wuchereria bancrofti]
Length = 278
Score = 65.5 bits (158), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 67/290 (23%), Positives = 114/290 (39%), Gaps = 48/290 (16%)
Query: 317 KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSV 376
+++++Y KE KG +I + G LV AV + +++ + + D + A +
Sbjct: 11 RMRLVYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEWTADKELRLECSDFDNVTALYLKT 70
Query: 377 KN-LILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWK 435
KN LILVGD RS++LL Y+ T VARD+ W
Sbjct: 71 KNDLILVGDLMRSLSLLSYKSMESTFEKVARDFMTN----------------------W- 107
Query: 436 FLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLI 495
+ C+ I S + F+ ++ N+ M G RL
Sbjct: 108 ---------MSACEIIDSDN-----------FLGAENSYNLFTVMKDSFTVFKEEGTRLQ 147
Query: 496 KKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLL 555
+ F+LG+ VN F + + AP S L Y + DG +G + +P Y L
Sbjct: 148 ELGLFYLGEMVNVFCHGSLTATQVDVAPLYHSSIL--YGTSDGGIGVIVQMPPVLYTFLQ 205
Query: 556 MLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
+Q + + + ++ +RT++ + G IDG L+ L +
Sbjct: 206 DVQKRLAEYAENCMRISHTQYRTFETEK--RSEAPNGFIDGDLIESLLDM 253
>gi|402223178|gb|EJU03243.1| hypothetical protein DACRYDRAFT_115454 [Dacryopinax sp. DJM-731
SS1]
Length = 1175
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/336 (22%), Positives = 143/336 (42%), Gaps = 48/336 (14%)
Query: 65 LFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM 124
+ + + S RA ++ G + + +++ +F CG PA LFL + L A P+
Sbjct: 682 IVLGEPSVRATDKKIFSLGTKPIMLNACTDLGRESNIFACGDRPALLFLKN-DRLTASPI 740
Query: 125 TIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKC-TPHFL 183
+ + + H P F++ +A S L I + D VR + L TP L
Sbjct: 741 KLRD-IHAGSVLHIPQFPSSFIFASA-STLLIGQIRESQKID----VRTISLGLDTPIRL 794
Query: 184 AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
YH + Y +V E + + +D+E+ + S F LF ++E
Sbjct: 795 TYHRGLRAYGVVCQRKELNRE------DDREIYS------------SSF--KLFDDITFE 834
Query: 244 EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGT-NYNYSEDVTCRGRILLFDI 302
+ NF E ++C+ + + T ++ +GT +E+ +GRIL+F
Sbjct: 835 YL--NNFTARPDEQMMCVTTIP---DSTGEEDSDFLVVGTYEATGAEEDVSKGRILIF-- 887
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL----KDNDL 358
E VP K+K++ + + G V A+ +V L A+ + ++ L D +
Sbjct: 888 -EEVP-------NRKLKLVVSHDVGGCVYAVTNVGANLAAAINGTLQVFSLHRSHDDIRI 939
Query: 359 TGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
+A + +S++ N +LVGD R++ +LR+
Sbjct: 940 ESVAKWSSAYVASSLICRGNTLLVGDAMRAVCILRW 975
>gi|345328202|ref|XP_003431248.1| PREDICTED: DNA damage-binding protein 1-like [Ornithorhynchus
anatinus]
Length = 1045
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 142/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 733 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 781
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 782 NGKLLASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 840
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P ++
Sbjct: 841 KPMEGNFEEIARDFNPNWMSAV-------------------------------------- 862
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 863 --EILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 912
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 913 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 971
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 972 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREA 1029
Query: 624 ILDELYDI 631
+D+L I
Sbjct: 1030 TVDDLIKI 1037
>gi|90108797|pdb|2B5L|A Chain A, Crystal Structure Of Ddb1 In Complex With Simian Virus 5 V
Protein
gi|90108798|pdb|2B5L|B Chain B, Crystal Structure Of Ddb1 In Complex With Simian Virus 5 V
Protein
gi|90108801|pdb|2B5M|A Chain A, Crystal Structure Of Ddb1
gi|116667897|pdb|2HYE|A Chain A, Crystal Structure Of The Ddb1-cul4a-rbx1-sv5v Complex
gi|1136228|gb|AAA88883.1| UV-damaged DNA binding factor [Homo sapiens]
gi|1588524|prf||2208446A xeroderma pigmentosum group E-binding factor
Length = 1140
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 141/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKDVRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|270346571|pdb|3I7H|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Hbx
gi|270346573|pdb|3I7K|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Whx
gi|270346575|pdb|3I7L|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Ddb2
gi|270346577|pdb|3I7N|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdtc1
gi|270346579|pdb|3I7O|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Iqwd1
gi|270346581|pdb|3I7P|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr40a
gi|270346583|pdb|3I89|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr22
gi|270346585|pdb|3I8C|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr21a
gi|270346587|pdb|3I8E|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr42a
gi|270346588|pdb|3I8E|B Chain B, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr42a
Length = 1143
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 141/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 831 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 879
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + + T + + + + + ILVGD RS+ LL Y
Sbjct: 880 NGKLLASINSTVRLYEWTTEKDVRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 938
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 939 KPMEGNFEEIARDFNPN----------------------WM------------------S 958
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 959 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1010
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1011 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1069
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1070 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1127
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1128 TADDLIKV 1135
>gi|221046721|pdb|3EI4|A Chain A, Structure Of The Hsddb1-Hsddb2 Complex
gi|221046723|pdb|3EI4|C Chain C, Structure Of The Hsddb1-Hsddb2 Complex
gi|221046725|pdb|3EI4|E Chain E, Structure Of The Hsddb1-Hsddb2 Complex
Length = 1158
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 141/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 846 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 894
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + + T + + + + + ILVGD RS+ LL Y
Sbjct: 895 NGKLLASINSTVRLYEWTTEKDVRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 953
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 954 KPMEGNFEEIARDFNPN----------------------WM------------------S 973
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 974 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1025
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1026 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1084
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1085 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1142
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1143 TADDLIKV 1150
>gi|2632123|emb|CAA05770.1| Xeroderma Pigmentosum Group E Complementing protein [Homo sapiens]
Length = 1140
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 141/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGDVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKDVRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|395544366|ref|XP_003774082.1| PREDICTED: DNA damage-binding protein 1 [Sarcophilus harrisii]
Length = 1239
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 927 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 975
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 976 NGKLLASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 1034
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 1035 KPMEGNFEEIARDFNPN----------------------WM------------------S 1054
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 1055 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1106
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1107 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1165
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1166 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1223
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1224 TADDLIKV 1231
>gi|429965418|gb|ELA47415.1| hypothetical protein VCUG_01066 [Vavraia culicis 'floridensis']
Length = 1176
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 51/208 (24%), Positives = 90/208 (43%), Gaps = 41/208 (19%)
Query: 294 RGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL 353
RGRIL+F++I V+ + TK +K++ ++ KGP++ V G + ++ K+ +++
Sbjct: 898 RGRILVFEVINVIGDMVAKKTKKALKLLGSERTKGPISCCAAVRGKIAVSLATKLMVYEC 957
Query: 354 KDND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQ 412
N + IAF D +Y S+ +KN I+VGD + + +Q E L L+++ +
Sbjct: 958 DRNSGIVAIAFYDLYMYAVSLAVIKNYIIVGDIMMGLHFVYFQSEPVKLHLLSKSDRIAN 1017
Query: 413 PNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDK 472
S ++ GE L I DK
Sbjct: 1018 LGSLDFFNA--------------------GESLFITG--------------------IDK 1037
Query: 473 DKNVVLFMYQPEARESNGGHRLIKKTDF 500
V +F + P SNGG +L+K+ +F
Sbjct: 1038 TGKVQIFSFSPSNLYSNGGEKLVKRQEF 1065
>gi|400600376|gb|EJP68050.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
Length = 1174
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 101/546 (18%), Positives = 202/546 (36%), Gaps = 111/546 (20%)
Query: 97 GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRI 156
G +F H + ++ +S G L T D + + PF + P L K+
Sbjct: 711 GTSSIFATTEHSSLIY-SSEGRLVYSATTADN-ATCVVPFDSYGFPHCILVSTDKNVRIC 768
Query: 157 SVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELV 216
V L++ V+ +P+ T +AY P + K+L+
Sbjct: 769 RVDKERLTH-----VKSLPVHETVRRVAY--------------APGAKAFALGCIKKDLI 809
Query: 217 TDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNV-SMEYEGTLSGL 275
+ V V L ++E+ T PL + +++V E L
Sbjct: 810 QNAE--------VITSSVKLVDEIMFQEL-GTPLPLAASSTLEMVESVIRAELPDPTGAL 860
Query: 276 RGYIALGTNYNYSEDV----TCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVT 331
+GT++ +V RGRIL+ + E K ++ I + KG
Sbjct: 861 VERFVVGTSFVNDAEVGEAGETRGRILVLGVDE----------KRQLYTIVSHNLKGACR 910
Query: 332 AICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK------------NL 379
+ + ++V + + + ++ + + TE Y+ + + + N+
Sbjct: 911 CLGILDEYIVACLAKTVVVYSYTEEN-------STEGYLQKLAAYRPASFPVALDISGNM 963
Query: 380 ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQL 439
I V D +S++L+ + P +D +P G L K
Sbjct: 964 IGVADIMQSLSLVEFTP--------PKDGEP-------------------GKLEEKARHF 996
Query: 440 SLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD 499
+C G + ++ +D N+++ P+A + RL ++
Sbjct: 997 QSAWATSVCHLGGER------------WLETDAQGNIIVLARNPDAPTEHDRSRLEITSE 1044
Query: 500 FHLGQHVNTFFKIRCKPS-SISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQ 558
+LG+ +N ++ P+ ++ +P A + AS++G L + + K L+ LQ
Sbjct: 1045 MNLGEQINKIQRLNVAPADNVVVSPKA------FLASIEGTLYLYGDIAPKYQDLLITLQ 1098
Query: 559 NVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
+ + TGG++ A+R ++ + A P R +DG +V +FL L + +C+ +G
Sbjct: 1099 TTIEKYVKTTGGISFDAWRAFRNQAREADGPFR-FVDGEMVERFLDLRKQTQAALCQDLG 1157
Query: 619 SKHNDI 624
D+
Sbjct: 1158 LNVEDV 1163
>gi|194377326|dbj|BAG57611.1| unnamed protein product [Homo sapiens]
Length = 451
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 82/366 (22%), Positives = 139/366 (37%), Gaps = 73/366 (19%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 139 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 187
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 188 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 246
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P + + I+D FL L +C+K +
Sbjct: 247 KPMEGNFEEIARDFNPNWMS---------AVEILDDD---NFLGAENAFNLFVCQKDSAA 294
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 514
D E R+ L + FHLG+ VN F C
Sbjct: 295 TTD--------------------------EERQ-----HLQEVGLFHLGEFVNVF----C 319
Query: 515 KPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
S + G S + + +++G +G L E Y LL +QN + G +
Sbjct: 320 HGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKI 379
Query: 572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDIL 625
+R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 380 EHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 437
Query: 626 DELYDI 631
D+L +
Sbjct: 438 DDLIKV 443
>gi|74138855|dbj|BAE27231.1| unnamed protein product [Mus musculus]
Length = 1140
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|410974071|ref|XP_003993471.1| PREDICTED: DNA damage-binding protein 1 [Felis catus]
Length = 1193
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 881 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 929
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 930 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 988
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 989 KPMEGNFEEIARDFNPN----------------------WM------------------S 1008
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 1009 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1060
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1061 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1119
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1120 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1177
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1178 TADDLIKV 1185
>gi|194381178|dbj|BAG64157.1| unnamed protein product [Homo sapiens]
Length = 826
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 514 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 562
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 563 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 621
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 622 KPMEGNFEEIARDFNPN----------------------WM------------------S 641
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 642 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 693
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 694 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 752
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 753 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 810
Query: 624 ILDELYDI 631
D+L +
Sbjct: 811 TADDLIKV 818
>gi|148529014|ref|NP_001914.3| DNA damage-binding protein 1 [Homo sapiens]
gi|296218432|ref|XP_002807395.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1
[Callithrix jacchus]
gi|397516558|ref|XP_003828491.1| PREDICTED: DNA damage-binding protein 1 [Pan paniscus]
gi|402893195|ref|XP_003909786.1| PREDICTED: DNA damage-binding protein 1 [Papio anubis]
gi|426368721|ref|XP_004051351.1| PREDICTED: DNA damage-binding protein 1 [Gorilla gorilla gorilla]
gi|12643730|sp|Q16531.1|DDB1_HUMAN RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
subunit; AltName: Full=DNA damage-binding protein a;
Short=DDBa; AltName: Full=Damage-specific DNA-binding
protein 1; AltName: Full=HBV X-associated protein 1;
Short=XAP-1; AltName: Full=UV-damaged DNA-binding factor;
AltName: Full=UV-damaged DNA-binding protein 1;
Short=UV-DDB 1; AltName: Full=XPE-binding factor;
Short=XPE-BF; AltName: Full=Xeroderma pigmentosum group
E-complementing protein; Short=XPCe
gi|203282525|pdb|3E0C|A Chain A, Crystal Structure Of Dna Damage-Binding Protein 1(Ddb1)
gi|695362|gb|AAA62838.1| X-associated protein 1, partial [Homo sapiens]
gi|1052865|gb|AAC50349.1| DDBa p127 [Homo sapiens]
gi|15079750|gb|AAH11686.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
gi|29792243|gb|AAH50530.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
gi|30354567|gb|AAH51764.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
gi|61354161|gb|AAX44048.1| damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
gi|119594341|gb|EAW73935.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_c [Homo
sapiens]
gi|168275638|dbj|BAG10539.1| DNA damage-binding protein 1 [synthetic construct]
gi|189065506|dbj|BAG35345.1| unnamed protein product [Homo sapiens]
gi|355566436|gb|EHH22815.1| Damage-specific DNA-binding protein 1 [Macaca mulatta]
gi|380784123|gb|AFE63937.1| DNA damage-binding protein 1 [Macaca mulatta]
gi|380808126|gb|AFE75938.1| DNA damage-binding protein 1 [Macaca mulatta]
gi|380810144|gb|AFE76947.1| DNA damage-binding protein 1 [Macaca mulatta]
gi|383408123|gb|AFH27275.1| DNA damage-binding protein 1 [Macaca mulatta]
gi|410305600|gb|JAA31400.1| damage-specific DNA binding protein 1, 127kDa [Pan troglodytes]
gi|410352015|gb|JAA42611.1| damage-specific DNA binding protein 1, 127kDa [Pan troglodytes]
Length = 1140
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|400260815|pdb|4E54|A Chain A, Damaged Dna Induced Uv-Damaged Dna-Binding Protein (Uv-Ddb)
Dimerization And Its Roles In Chromatinized Dna Repair
gi|401871507|pdb|4E5Z|A Chain A, Damaged Dna Induced Uv-Damaged Dna-Binding Protein (Uv-Ddb)
Dimerization And Its Roles In Chromatinized Dna Repair
Length = 1150
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 838 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 886
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 887 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 945
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 946 KPMEGNFEEIARDFNPN----------------------WM------------------S 965
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 966 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1017
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1018 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1076
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1077 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1134
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1135 TADDLIKV 1142
>gi|361132523|pdb|4A0L|A Chain A, Structure Of Ddb1-Ddb2-Cul4b-Rbx1 Bound To A 12 Bp Abasic
Site Containing Dna-Duplex
gi|361132525|pdb|4A0L|C Chain C, Structure Of Ddb1-Ddb2-Cul4b-Rbx1 Bound To A 12 Bp Abasic
Site Containing Dna-Duplex
Length = 1144
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 832 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 880
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 881 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 939
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 940 KPMEGNFEEIARDFNPN----------------------WM------------------S 959
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 960 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1011
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1012 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1070
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1071 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1128
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1129 TADDLIKV 1136
>gi|348526664|ref|XP_003450839.1| PREDICTED: DNA damage-binding protein 1-like [Oreochromis niloticus]
Length = 1140
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 121/585 (20%), Positives = 219/585 (37%), Gaps = 104/585 (17%)
Query: 75 NEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLA 134
+E+ + G + + +R F +++ VF C P ++ +S +L + + V+ +
Sbjct: 624 SERKKVTLGTQPTVLRTFRSLS-TSNVFACSDRPTVIY-SSNHKLVFSNVNLK-EVNYMC 680
Query: 135 PFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCI 194
P ++ P N S L I + +R VPL +P + Y ++ + +
Sbjct: 681 PLNSEGYPDSLALAN-NSTLTIGTIDEI----QKLHIRTVPLYESPRRICYQEVSQCFGV 735
Query: 195 VTSTAE-----PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
++S E +T + + + L + S+ P S S EE+ N
Sbjct: 736 LSSRVEIQDVSGTTSAVRPSASTQALSSSVSSSKLFPSSTSPHETSF-----GEEVEVHN 790
Query: 250 FPL---HEWEHVLCLKNVSMEYEGTLSGLR------GYIALGTNYNYSEDVTCR-GRILL 299
+ H +E + + + EY +L R Y +GT Y E+ + GRI++
Sbjct: 791 LLVVDQHTFEVLHAHQFLPSEYALSLVSCRLGKDPSVYFIVGTAMVYPEEAEPKQGRIIV 850
Query: 300 FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDND 357
F T K++ + KE KG V ++ G + ++ ++Y W +
Sbjct: 851 FH-----------YTDGKLQTVAEKEVKGAVYSMVEFNGKFLASINSTVRLYEWTAEKEL 899
Query: 358 LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
T + + + + + ILVGD RS+ LL Y+ +ARD+ P
Sbjct: 900 RTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAYKSMEGNFEEIARDFNPN------ 952
Query: 418 YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVV 477
W +ILD+ + +G +
Sbjct: 953 ----------------WM------------------SAVEILDDDNFLG-----AENAFN 973
Query: 478 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS---RFLTW 532
LF+ Q ++ + R L + FHLG+ VN F C S + G S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVLQNLGESSTPTQGSVL 1029
Query: 533 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 592
+ +++G +G L E Y LL LQN + G + +R++ + + G
Sbjct: 1030 FGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEQATG 1087
Query: 593 IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 631
IDG L+ FL L + E+ + G K +DE+ I
Sbjct: 1088 FIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREATVDEVIKI 1132
>gi|119594340|gb|EAW73934.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_b [Homo
sapiens]
Length = 923
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 611 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 659
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 660 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 718
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 719 KPMEGNFEEIARDFNPN----------------------WM------------------S 738
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 739 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 790
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 791 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 849
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 850 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 907
Query: 624 ILDELYDI 631
D+L +
Sbjct: 908 TADDLIKV 915
>gi|418316|sp|P33194.1|DDB1_CERAE RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
subunit; AltName: Full=DDBa; AltName:
Full=Damage-specific DNA-binding protein 1; AltName:
Full=UV-damaged DNA-binding protein 1; Short=UV-DDB 1
gi|304026|gb|AAA03021.1| UV-damaged DNA-binding protein [Chlorocebus aethiops]
Length = 1140
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|403255013|ref|XP_003920244.1| PREDICTED: DNA damage-binding protein 1 [Saimiri boliviensis
boliviensis]
Length = 1140
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|355752055|gb|EHH56175.1| Damage-specific DNA-binding protein 1, partial [Macaca fascicularis]
Length = 1125
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 813 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 861
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 862 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 920
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 921 KPMEGNFEEIARDFNPN----------------------WM------------------S 940
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 941 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 992
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 993 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1051
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1052 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1109
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1110 TADDLIKV 1117
>gi|73983859|ref|XP_533275.2| PREDICTED: DNA damage-binding protein 1 [Canis lupus familiaris]
gi|291409601|ref|XP_002721069.1| PREDICTED: damage-specific DNA binding protein 1 [Oryctolagus
cuniculus]
gi|301781686|ref|XP_002926259.1| PREDICTED: DNA damage-binding protein 1-like [Ailuropoda melanoleuca]
Length = 1140
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|7657011|ref|NP_056550.1| DNA damage-binding protein 1 [Mus musculus]
gi|134034087|sp|Q3U1J4.2|DDB1_MOUSE RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
subunit; AltName: Full=Damage-specific DNA-binding
protein 1; AltName: Full=UV-damaged DNA-binding factor
gi|5931596|dbj|BAA84699.1| XPE UV-damaged DNA binding factor [Mus musculus]
gi|16307148|gb|AAH09661.1| Damage specific DNA binding protein 1 [Mus musculus]
gi|74182145|dbj|BAE34102.1| unnamed protein product [Mus musculus]
gi|74196166|dbj|BAE32993.1| unnamed protein product [Mus musculus]
Length = 1140
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|5353754|gb|AAD42230.1|AF159853_1 damage-specific DNA binding protein 1 [Mus musculus]
Length = 1140
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|221046711|pdb|3EI1|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 14 Bp 6-4 Photoproduct
Containing Dna-Duplex
gi|221046715|pdb|3EI2|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Abasic Site
Containing Dna-Duplex
gi|221046719|pdb|3EI3|A Chain A, Structure Of The Hsddb1-Drddb2 Complex
Length = 1158
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 846 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 894
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 895 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 953
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 954 KPMEGNFEEIARDFNPN----------------------WM------------------S 973
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 974 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1025
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1026 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1084
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1085 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1142
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1143 TADDLIKV 1150
>gi|413081953|ref|NP_741992.2| DNA damage-binding protein 1 [Rattus norvegicus]
gi|293344614|ref|XP_002725831.1| PREDICTED: DNA damage-binding protein 1 [Rattus norvegicus]
gi|293356422|ref|XP_002728912.1| PREDICTED: DNA damage-binding protein 1 [Rattus norvegicus]
gi|149062405|gb|EDM12828.1| damage-specific DNA binding protein 1 [Rattus norvegicus]
Length = 1140
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|441604084|ref|XP_004087862.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1
[Nomascus leucogenys]
Length = 1140
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|358440070|pdb|4A0B|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
gi|358440072|pdb|4A0B|C Chain C, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
Length = 1159
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 847 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 895
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 896 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 954
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 955 KPMEGNFEEIARDFNPN----------------------WM------------------S 974
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 975 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1026
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1027 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1085
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1086 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1143
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1144 TADDLIKV 1151
>gi|354504619|ref|XP_003514371.1| PREDICTED: DNA damage-binding protein 1-like [Cricetulus griseus]
gi|344258340|gb|EGW14444.1| DNA damage-binding protein 1 [Cricetulus griseus]
Length = 1140
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|119594343|gb|EAW73937.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_e [Homo
sapiens]
Length = 896
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 584 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 632
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 633 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 691
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 692 KPMEGNFEEIARDFNPN----------------------WM------------------S 711
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 712 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 763
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 764 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 822
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 823 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 880
Query: 624 ILDELYDI 631
D+L +
Sbjct: 881 TADDLIKV 888
>gi|359546285|pdb|4A11|A Chain A, Structure Of The Hsddb1-Hscsa Complex
gi|361132519|pdb|4A0K|C Chain C, Structure Of Ddb1-Ddb2-Cul4a-Rbx1 Bound To A 12 Bp Abasic
Site Containing Dna-Duplex
Length = 1159
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 847 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 895
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 896 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 954
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 955 KPMEGNFEEIARDFNPN----------------------WM------------------S 974
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 975 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1026
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1027 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1085
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1086 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1143
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1144 TADDLIKV 1151
>gi|149725200|ref|XP_001502072.1| PREDICTED: DNA damage-binding protein 1 [Equus caballus]
Length = 1140
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|384941436|gb|AFI34323.1| DNA damage-binding protein 1 [Macaca mulatta]
Length = 1140
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|74178494|dbj|BAE32502.1| unnamed protein product [Mus musculus]
Length = 1140
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKLGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|395852550|ref|XP_003798801.1| PREDICTED: DNA damage-binding protein 1 [Otolemur garnettii]
Length = 1140
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|358440058|pdb|4A08|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 13 Bp Cpd-Duplex (
Purine At D-1 Position) At 3.0 A Resolution (Cpd 1)
gi|358440062|pdb|4A09|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 15 Bp Cpd-Duplex
(Purine At D-1 Position) At 3.1 A Resolution (Cpd 2)
Length = 1159
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 847 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 895
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 896 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 954
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 955 KPMEGNFEEIARDFNPN----------------------WM------------------S 974
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 975 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1026
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1027 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1085
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1086 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1143
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1144 TADDLIKV 1151
>gi|344295432|ref|XP_003419416.1| PREDICTED: DNA damage-binding protein 1 [Loxodonta africana]
Length = 1140
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|358440066|pdb|4A0A|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
Pyrimidine At D-1 Position) At 3.6 A Resolution (Cpd 3)
Length = 1159
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 847 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 895
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 896 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 954
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 955 KPMEGNFEEIARDFNPN----------------------WM------------------S 974
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 975 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1026
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1027 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1085
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1086 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1143
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1144 TADDLIKV 1151
>gi|311247551|ref|XP_003122699.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Sus scrofa]
Length = 1140
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|122692537|ref|NP_001073731.1| DNA damage-binding protein 1 [Bos taurus]
gi|426251842|ref|XP_004019630.1| PREDICTED: DNA damage-binding protein 1 [Ovis aries]
gi|134034086|sp|A1A4K3.1|DDB1_BOVIN RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|119223918|gb|AAI26630.1| Damage-specific DNA binding protein 1, 127kDa [Bos taurus]
gi|296471644|tpg|DAA13759.1| TPA: DNA damage-binding protein 1 [Bos taurus]
Length = 1140
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|355683071|gb|AER97036.1| damage-specific DNA binding protein 1, 127kDa [Mustela putorius furo]
Length = 1122
Score = 62.8 bits (151), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 76/337 (22%), Positives = 130/337 (38%), Gaps = 71/337 (21%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
+ +R++ + P+ G IDG L+ FL +S
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDIS 1101
>gi|58383228|ref|XP_312466.2| AGAP002472-PA [Anopheles gambiae str. PEST]
gi|55242305|gb|EAA08181.2| AGAP002472-PA [Anopheles gambiae str. PEST]
Length = 1138
Score = 62.8 bits (151), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 92/454 (20%), Positives = 170/454 (37%), Gaps = 84/454 (18%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPP--L 228
+R VPL +P +AY ++T+ ++T ++ + +D +T R S +
Sbjct: 713 IRTVPLGESPRRIAYQEASQTFGVIT---------FRMDVQDSSGLTPARQSASTQTNNI 763
Query: 229 VSQFHVSLFSPFS-----WEEIPQTNFPL---HEWEHVLCLKNVSMEYEGTLSGLR---- 276
+ L P + +E+ N + + +E + + + EY +L +
Sbjct: 764 TQSSGMGLLKPGASNTEFGQEVEVHNLLIIDQNTFEVLHAHQFMQTEYALSLMSAKLGND 823
Query: 277 --GYIALGTN-YNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAI 333
Y +GT N E GRI+++ N++KM+ KE KG ++
Sbjct: 824 PNTYFIVGTGLVNPEEPEPKTGRIIIY-----------RYADNELKMVSDKEVKGACYSL 872
Query: 334 CHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALL 392
G ++ + + +++ D+ + +A K + ILVGD RSI LL
Sbjct: 873 VEFNGRVLACINSTVRLYEWTDDKDLRLECSHFNNVLALYCKTKGDFILVGDLMRSITLL 932
Query: 393 RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
+Y+ + +ARDY+P W
Sbjct: 933 QYKQMEGSFEEIARDYQPN----------------------WM----------------- 953
Query: 453 SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G +D N+ + + A ++ + FHLG VN F
Sbjct: 954 -TAVEILDDDAFLG---ADNSNNLFVCLKDSAATTDEERQQMPEVAQFHLGDMVNVFRHG 1009
Query: 513 RCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
+IS+ + + + ++ GA+G + Y L LQ + G ++
Sbjct: 1010 SLVMQNISERSTPTTGCV-LFGTVSGAIGLVTQIQSDFYEFLRKLQENLTNTIKSVGKID 1068
Query: 573 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
+R++ + G IDG LV FL LS
Sbjct: 1069 HSYWRSFHTETKM--ERCEGFIDGDLVESFLDLS 1100
>gi|410912407|ref|XP_003969681.1| PREDICTED: DNA damage-binding protein 1-like [Takifugu rubripes]
Length = 1140
Score = 62.8 bits (151), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 82/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F T K++ + KE KG V ++
Sbjct: 828 YFVVGTAMVYPEEAEPKQGRIIVFH-----------YTDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECSHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEDRQHLQEVGVFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + ++ G +G L E + LL LQN + G
Sbjct: 1008 -CHGSLVLQNLGETSTPTQGSVLFGTVTGMIGLVTSLSEGWHSLLLDLQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + ++G IDG L+ FL L + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEQAKGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREA 1124
Query: 624 ILDELYDI 631
+DE+ I
Sbjct: 1125 TVDEVIKI 1132
>gi|74215029|dbj|BAE33503.1| unnamed protein product [Mus musculus]
Length = 1140
Score = 62.8 bits (151), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQRDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|223647932|gb|ACN10724.1| DNA damage-binding protein 1 [Salmo salar]
Length = 1139
Score = 62.8 bits (151), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 121/579 (20%), Positives = 216/579 (37%), Gaps = 92/579 (15%)
Query: 75 NEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLA 134
+E+ + G + + +R F +++ VF C P ++ +S +L + + V+ +
Sbjct: 623 SERKKVTLGTQPTVLRTFRSLS-TSNVFACSDRPTVIY-SSNHKLVFSNVNLK-EVNYMC 679
Query: 135 PFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCI 194
P ++ P N S L I + +R VPL +P + Y ++ + +
Sbjct: 680 PLNSEGYPDSLALAN-NSTLTIGTIDEI----QKLHIRTVPLYESPRRICYQEVSQCFGV 734
Query: 195 VTSTAE-----PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSL---FSPFSWEEIP 246
++S E +T + + + L + S+ P S S S +
Sbjct: 735 LSSRVEMQDASGTTAAVRPSASTQALSSSVSSSKLFPSSTSPHETSFGEEVEVHSLLVVD 794
Query: 247 QTNFP-LHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR-GRILLFDIIE 304
Q F LH + + +SM L Y +GT Y E+ + GRI++F
Sbjct: 795 QHTFEVLHAHQFLQSEYALSMVSCRLGRDLSVYFIVGTAMVYPEEAEPKQGRIIVFH--- 851
Query: 305 VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDNDLTGIA 362
T K++ + KE KG V ++ G L+ ++ ++Y W + T
Sbjct: 852 --------YTDGKLQTVAEKEVKGAVYSMMEFNGKLLASINSTVRLYEWTAEKELRTECN 903
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
+ + + + + ILVGD RS+ LL Y+P +ARD+ P
Sbjct: 904 HYNN-IMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPN----------- 951
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
W +ILD+ + +G + LF+ Q
Sbjct: 952 -----------WM------------------SAVEILDDDNFLG-----AENAFNLFVCQ 977
Query: 483 PEARESNGGHR--LIKKTDFHLGQHVNTFF--KIRCKPSSISDAPGARSRFLTWYASLDG 538
++ + R L + FHLG+ VN F + + S P S + +++G
Sbjct: 978 KDSAATTDEERQHLQEVGVFHLGEFVNVFSHGSLVLQNLGESSTPTQGS---VLFGTVNG 1034
Query: 539 ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
+G L E Y LL LQN + G + +R++ + + G IDG L
Sbjct: 1035 MIGLVTSLSEGWYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEQATGFIDGDL 1092
Query: 599 VWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 631
+ FL L + E+ + G K +DE+ I
Sbjct: 1093 IESFLDLGRAKMQEVVSTLQIDDGSGMKREATVDEVIKI 1131
>gi|167384458|ref|XP_001736962.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165900458|gb|EDR26769.1| hypothetical protein EDI_171140 [Entamoeba dispar SAW760]
Length = 836
Score = 62.8 bits (151), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 150/371 (40%), Gaps = 86/371 (23%)
Query: 275 LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYA-KEQKGPVTAI 333
L+ Y+ +G N +ED +G+ +F+I +N+I++I + K V A+
Sbjct: 521 LKNYLVVGVNKQTTEDNPVKGKTYIFNI------------ENQIQLINKIGDGKKSVHAV 568
Query: 334 CHVAGFLVTAVGQKI-YIWQLKDNDLTGIAFIDTEVYIASM------VSVKN------LI 380
+ GFL A G ++ I ++ + F D + I S+ V K LI
Sbjct: 569 NEIGGFLAVASGNELELIERVDETRWIKKCFSDISILINSIEYLPLKVMEKGNEKECYLI 628
Query: 381 LVGDYARSIALLRYQP-EYRTLSLVARDYKPTQPNSKGYYAGNPSRGI--IDGSLVWKFL 437
L+ D+ RS+ LL ++P +Y + L G +R I ID + +
Sbjct: 629 LLSDFYRSVVLLLFKPYDYTVIPL-----------------GKDARNIHCIDSTFI---- 667
Query: 438 QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
I K D FS + F D ++N+ L Y A E +
Sbjct: 668 ---------ITK----------DYFSVLEF---DSEQNLSLLNYSSAATEQLSIFEI--D 703
Query: 498 TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
F+LG ++ F + + + G ++ Y +++G++G+ + EK Y+ L +
Sbjct: 704 ATFNLGMNLLKF-------TRLWNGKG----YIYMYVTVEGSVGYISVVEEKIYQVLRQI 752
Query: 558 QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
M H G N +R KG G G +DG ++ +F L+ ++ +C +
Sbjct: 753 NIKMNREPWHFAGTNAEEYRFEKGYGMGFGTRKHVFLDGDMLKQFRLLNEEQQKRVCLR- 811
Query: 618 GSKHNDILDEL 628
+ ND+ L
Sbjct: 812 NTSINDVFKLL 822
>gi|301616502|ref|XP_002937687.1| PREDICTED: DNA damage-binding protein 1-like [Xenopus (Silurana)
tropicalis]
Length = 1140
Score = 62.8 bits (151), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 126/598 (21%), Positives = 225/598 (37%), Gaps = 119/598 (19%)
Query: 66 FVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
+SDR K + G + + +R F +++ VF C P ++ +S +L +
Sbjct: 622 LLSDRKK-------VTLGTQPTVLRTFRSLS-TTNVFACSDRPTVIY-SSNHKLVFSNVN 672
Query: 126 IDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
+ V+ + P ++ P N S L I + +R VPL +P + Y
Sbjct: 673 LK-EVNYMCPLNSEGYPDSLALAN-NSTLTIGTIDEI----QKLHIRTVPLYESPRKICY 726
Query: 186 HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDP-RDSRFIPPLVSQFHVS-LFSPFS-- 241
++ + +++S E +D + P R S L S S LFS +
Sbjct: 727 QEVSQCFGVLSSRIEV---------QDASGGSSPLRPSASTQALSSSVSCSKLFSGSTSP 777
Query: 242 -----WEEIPQTNFPL---HEWEHVLCLKNVSMEYEGTLSGLR------GYIALGTNYNY 287
EE+ N + H +E + + + EY +L + Y +GT Y
Sbjct: 778 HETSFGEEVEVHNLLIIDQHTFEVLHTHQFLQNEYTLSLVSCKLGKDPTTYFVVGTAMVY 837
Query: 288 SEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ 346
++ + GRI++F K++ + KE KG V ++ G L+ ++
Sbjct: 838 PDEAEPKQGRIVVFQ-----------YNDGKLQTVAEKEVKGAVYSMVEFNGKLLASINS 886
Query: 347 --KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLV 404
++Y W + T + + + + + ILVGD RS+ LL Y+P +
Sbjct: 887 TVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEI 945
Query: 405 ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
ARD+ P W +ILD+ +
Sbjct: 946 ARDFNPN----------------------WM------------------SAVEILDDDNF 965
Query: 465 MGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDA 522
+G + LF+ Q ++ + R L + FHLG+ VN F C S +
Sbjct: 966 LG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQN 1016
Query: 523 PGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
G S + + +++G +G L E Y LL +QN + G + +R++
Sbjct: 1017 LGETSPPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDVQNRLNKVIKSVGKIEHSFWRSF 1076
Query: 580 KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 631
+ P+ G IDG L+ FL +S + E+ + G K +D+L +
Sbjct: 1077 HTE--RKTEPATGFIDGDLIESFLDISRPKMQEVIANLQIDDGSGMKRETTVDDLIKV 1132
>gi|259155222|ref|NP_001158852.1| DNA damage-binding protein 1 [Salmo salar]
gi|223647700|gb|ACN10608.1| DNA damage-binding protein 1 [Salmo salar]
Length = 1139
Score = 62.8 bits (151), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 120/584 (20%), Positives = 220/584 (37%), Gaps = 102/584 (17%)
Query: 75 NEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLA 134
+E+ + G + + +R F +++ VF C P ++ +S +L + + V+ +
Sbjct: 623 SERKKVTLGTQPTVLRTFRSLS-TSNVFACSDRPTVIY-SSNHKLVFSNVNLK-EVNYMC 679
Query: 135 PFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCI 194
P ++ P N S L I + +R VPL +P + Y ++ + +
Sbjct: 680 PLNSEGYPDSLALAN-NSTLTIGTIDEI----QKLHIRTVPLYESPRRICYQEVSQCFGV 734
Query: 195 VTSTAE-----PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
++S E +T + + + L + S+ P S S EE+ +
Sbjct: 735 LSSRVEMQDASGTTAAVRPSASTQALSSSVSSSKLFPSSTSPHETSF-----GEEVEVHS 789
Query: 250 FPL---HEWEHVLCLKNVSMEYEGTLSGLR------GYIALGTNYNYSEDVTCR-GRILL 299
+ H +E + + + EY ++ R Y +GT Y E+ + GRI++
Sbjct: 790 LLVVDQHTFEVLHAHQFLQSEYALSMVSCRLGRDPAVYFIVGTAMVYPEEAEPKQGRIIV 849
Query: 300 FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDND 357
F T K++ + KE KG V ++ G L+ ++ ++Y W +
Sbjct: 850 FH-----------YTDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTAEKEL 898
Query: 358 LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
T + + + + + ILVGD RS+ LL Y+P +ARD+ P
Sbjct: 899 RTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPN------ 951
Query: 418 YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVV 477
W +ILD+ + +G +
Sbjct: 952 ----------------WM------------------SAVEILDDDNFLG-----AENAFN 972
Query: 478 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFF--KIRCKPSSISDAPGARSRFLTWY 533
LF+ Q ++ + R L + FHLG+ VN F + + S P S +
Sbjct: 973 LFVCQKDSAATTDEERQHLQEVGVFHLGEFVNVFSHGSLVLQNLGESSTPTQGS---VLF 1029
Query: 534 ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGI 593
+++G +G L E Y LL LQN + G + +R++ + + G
Sbjct: 1030 GTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEQATGF 1087
Query: 594 IDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 631
IDG L+ FL L + E+ + G K +DE+ I
Sbjct: 1088 IDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREATVDEVIKI 1131
>gi|327278830|ref|XP_003224163.1| PREDICTED: DNA damage-binding protein 1-like [Anolis carolinensis]
Length = 1140
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 84/369 (22%), Positives = 142/369 (38%), Gaps = 79/369 (21%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y ++ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPDEAEPKQGRIVVFH-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLR 393
G L+ ++ ++Y W + T + +A V K + ILVGD RS+ LL
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECNHYNN--IMALYVKTKGDFILVGDLMRSVLLLA 934
Query: 394 YQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 453
Y+P +ARD+ P W
Sbjct: 935 YKPMEGNFEEIARDFNPN----------------------WM------------------ 954
Query: 454 KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFK 511
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 955 SAVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEFGLFHLGEFVNVF-- 1007
Query: 512 IRCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 568
C S + G S + + +++G +G L E Y LL +QN +
Sbjct: 1008 --CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDVQNRLNKVIKSV 1065
Query: 569 GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHN 622
G + +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1066 GKIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKRE 1123
Query: 623 DILDELYDI 631
+D+L I
Sbjct: 1124 ATVDDLIKI 1132
>gi|328770638|gb|EGF80679.1| hypothetical protein BATDEDRAFT_11194 [Batrachochytrium dendrobatidis
JAM81]
Length = 1098
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 118/573 (20%), Positives = 215/573 (37%), Gaps = 112/573 (19%)
Query: 51 PKGALKLRFKKLKVLFVS-----------DRSKRANEQPGLPRGVRISQMRYFSNIAGYQ 99
P+ L + F L L VS +S + ++ + + +R F + G
Sbjct: 582 PRSILLVEFDNLPYLLVSLGDGQLFNFRIGKSLKLADRKKITLATQPITLRTFQS-HGRT 640
Query: 100 GVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVL 159
VF P +F+ S G+L + + +S ++PF N + G L F + L+I +
Sbjct: 641 HVFAASDRPTVIFVKS-GQLLYSNVNVR-EISHVSPF-NSHMAEGALAFASDGALKIGTI 697
Query: 160 PTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVT--STAEPSTDYYKFNGEDKELVT 217
T ++ + L TP +AYH + T+ ++T S P+ D +
Sbjct: 698 ETV----QKLHIKTIKLGETPRRIAYHDVSHTFGVLTVFSRNLPNGDLADISC------- 746
Query: 218 DPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRG 277
+ L +E + + L +E L + + TL
Sbjct: 747 ----------------LRLLDGQGYEVLD--SIELQPFEIASSLITIRFTDDDTL----- 783
Query: 278 YIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT + + ED RGRIL+F + ++ +++++ + +G + V
Sbjct: 784 YYTVGTGFAFPHEDEPVRGRILVFKVNDM----------RLLQLVHEYDIRGSAYSFVSV 833
Query: 337 AGFLVTAVGQKIYI--WQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLR 393
G LV V + + W D L + ++ +A ++V+ + ILV D +SI LL+
Sbjct: 834 HGRLVAGVNSNVMVLRWN-SDTSLLELQSMNHGHVLALSLAVRGDFILVADLIKSITLLQ 892
Query: 394 YQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 453
+ +L +A D +S A
Sbjct: 893 FDLATDSLKELAYD-----ADSNWMTAA-------------------------------- 915
Query: 454 KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
+++D+ + +G +D N+ Q + RL K FH G+ +N F K
Sbjct: 916 ---ELIDDDTFLG---ADSSMNIFALSKQGDQVSEEERQRLRPKGWFHTGELINRFRKGS 969
Query: 514 CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLL-MLQNVMVTHTSHTGGLN 572
+ + + Y ++ GA+G +P ++L LQ + + GGL
Sbjct: 970 LTLHATDETLALPAIPEILYCTVHGAIGVVARIPSDETAKILSTLQEALKSVVQGVGGLI 1029
Query: 573 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
+R Y+ + S GIIDG L+ FL+L
Sbjct: 1030 HSDWRRYRTE--RRSIKSAGIIDGDLIESFLEL 1060
>gi|91087281|ref|XP_975549.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270010588|gb|EFA07036.1| hypothetical protein TcasGA2_TC010010 [Tribolium castaneum]
Length = 1149
Score = 62.4 bits (150), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 80/360 (22%), Positives = 140/360 (38%), Gaps = 68/360 (18%)
Query: 278 YIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
YI N E +GRIL+F NK+ + KE KG ++
Sbjct: 838 YIVGTATVNPEESEPKQGRILIFQ-----------WNDNKLTQVSEKEIKGACYSLAEFN 886
Query: 338 GFLVTAVGQ--KIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ +++ W + K+ L F + + + + IL+GD RS+ LL+Y
Sbjct: 887 GKLLASINSTVRLFEWTVEKELRLECSHF--NNILTLFLKTKGDFILLGDLMRSMTLLQY 944
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+ + +ARDY P + I+D + FL + +C+K +
Sbjct: 945 KTMEGSFEEIARDYNPNWMTAVE---------ILDDDI---FLGAENSFNIFVCQKDSAA 992
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 514
D +E S M H + + FH+G +N F
Sbjct: 993 TTD--EERSQM--------------------------HEVGR---FHVGDMINVFRHGSL 1021
Query: 515 KPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 574
++ + + + + ++ GA+G + + Y LL LQN + T G ++
Sbjct: 1022 VMQNLGETSTPTTGCV-LFGTVSGAIGLVTQITQDFYDFLLELQNKLSTVIKSVGKIDHS 1080
Query: 575 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS------LGERLEICKKIGSKHNDILDEL 628
+R + PS G IDG L+ FL LS + + L+I + G K + +D+L
Sbjct: 1081 QWRAFNTD--IKTEPSEGFIDGDLIESFLDLSHDKMKEVADGLQITGEGGMKQDCTVDDL 1138
>gi|452824086|gb|EME31091.1| DNA damage-binding protein 1 isoform 2 [Galdieria sulphuraria]
Length = 1150
Score = 62.4 bits (150), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 99/474 (20%), Positives = 185/474 (39%), Gaps = 100/474 (21%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRD-SRFIPPLV 229
+R +PL P +A HL+T V +T K++VT D + +
Sbjct: 732 IRTIPLGEQPRRIA-HLDTHHVFAVLTT--------------KQVVTISEDGNEALSETT 776
Query: 230 SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
+ +V L E + ++ L ++E + V+ + + Y +GT Y+Y++
Sbjct: 777 EEGYVRLIDDTMMEIVH--SYKLEQFETPCSVITVNFGDDAAAKDNQDYFVVGTAYSYAD 834
Query: 290 DVT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ-- 346
+ RGR+L+F + E ++ ++ + KG + ++ G ++ +V
Sbjct: 835 EPEPSRGRMLVFAVRE-----------QRLTLVAERTFKGALYSMDAFNGKILASVNSML 883
Query: 347 KIYIWQLKDN---DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSL 403
K+ W ++ LT ++I + + + IL+GD RS++LL Y+P T+
Sbjct: 884 KLVRWSETESGARTLTEECTYHGSIFILQIKCLGDFILIGDLVRSVSLLAYKPMNGTIED 943
Query: 404 VARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFS 463
VARD P+ W +++ E L+ LD +
Sbjct: 944 VARDIDPS----------------------W----ITVIEMLD------------LDYYI 965
Query: 464 SMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSI 519
S ++ N+ +A RL K ++HLG+ VN ++ S I
Sbjct: 966 S-----AENCFNLFTLKRNSDASTEEERSRLEKVGEYHLGELVNRIRHGRLVLQIPESGI 1020
Query: 520 SD------------APGARSRFLTWY----ASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
S + F+ Y + +GALG + EK ++ L LQ +
Sbjct: 1021 SILKSLLYGMYICFDDNLKELFMHKYRFNLGTANGALGVIASIDEKTFQFLHSLQTALNE 1080
Query: 564 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
GG+ +R + + S+ +DG L+ +FL LS + + KK+
Sbjct: 1081 VIKGVGGIQHEDWRRFTSERRIG--DSKNFLDGDLIERFLDLSRDKMELVAKKV 1132
>gi|357623954|gb|EHJ74904.1| putative DNA repair protein xp-e [Danaus plexippus]
Length = 1128
Score = 62.4 bits (150), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 104/448 (23%), Positives = 173/448 (38%), Gaps = 77/448 (17%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
+R VPL TP +AY ++T+ ++T D ++ G LV + +
Sbjct: 707 IRTVPLGETPRRIAYQEASQTFGVITM----RVDKVEWTGGCGSLVRPSASTAAASASAA 762
Query: 231 QFHVSLF-SPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLR------GYIALGT 283
+P E H +E + + ++ E+ +L + Y A+GT
Sbjct: 763 APPSKHAPAPLDLELHNLLILDHHTFEVLHAHQLLANEFAMSLVSCKLADDPNHYYAVGT 822
Query: 284 N-YNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVT 342
N E +GRILLF E K+ + KE KG + G L+
Sbjct: 823 AILNPEESEPKQGRILLFHWCE-----------GKLTQVAEKEIKGGCYTLVEFNGKLLA 871
Query: 343 AVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQPEYRTL 401
++ + +++ + +A + VK + ILVGD RS++LL+Y+ +
Sbjct: 872 SINSTVRLFEWTSEKELRLECSHFNNIVALYLKVKGDFILVGDLMRSMSLLQYKQMEGSF 931
Query: 402 SLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
+ARDY P + I+D FL L +C+K + D +E
Sbjct: 932 EEIARDYSPNWMTAV---------EILDDD---TFLGAENSFNLFVCQKDSAATTD--EE 977
Query: 462 FSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISD 521
MG+M FH+G VN + + ++D
Sbjct: 978 RQQMGYM-----------------------------GQFHVGDMVNVMRR-GALVAQLAD 1007
Query: 522 --APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF-RT 578
AP AR L A++ GA+ + L ++ + L L+ + THT + G P +F R+
Sbjct: 1008 TAAPVARPVLL---ATVSGAICLVVQLSQELFDFLHQLEERL-THTIKSVGKIPHSFWRS 1063
Query: 579 YKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
+ P+ G IDG L+ FL LS
Sbjct: 1064 FNTD--IKTEPAEGFIDGDLIESFLDLS 1089
>gi|74208347|dbj|BAE26370.1| unnamed protein product [Mus musculus]
Length = 599
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 139/368 (37%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI +F + K++ + KE KG V ++
Sbjct: 287 YFIVGTAMVYPEEAEPKQGRIAVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 335
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 336 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 394
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 395 KPMEGNFEEIARDFNPN----------------------WM------------------S 414
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 415 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 466
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 467 -CHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 525
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 526 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 583
Query: 624 ILDELYDI 631
D+L +
Sbjct: 584 TADDLIKV 591
>gi|197097564|ref|NP_001126613.1| DNA damage-binding protein 1 [Pongo abelii]
gi|75041202|sp|Q5R649.1|DDB1_PONAB RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|55732122|emb|CAH92767.1| hypothetical protein [Pongo abelii]
Length = 1140
Score = 62.4 bits (150), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 139/368 (37%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V +
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYPMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|449710759|gb|EMD49776.1| cleavage and polyadenylation specificity factor subunit, putative
[Entamoeba histolytica KU27]
Length = 836
Score = 62.0 bits (149), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 83/371 (22%), Positives = 150/371 (40%), Gaps = 86/371 (23%)
Query: 275 LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYA-KEQKGPVTAI 333
L+ Y+ +G N +ED +G+ +F+I +N+I++I + K V A+
Sbjct: 521 LKNYLVVGVNKQTTEDNPVKGKTYIFNI------------ENQIQLINKIGDGKKSVHAV 568
Query: 334 CHVAGFLVTAVGQKI-YIWQLKDNDLTGIAFIDTEVYIASM----VSVKN--------LI 380
+ GFL A G ++ I ++ + F D + I S+ + V LI
Sbjct: 569 NEIGGFLAVASGNELELIERVDETRWIKKCFSDISILINSIEYLPLKVMERGNEKECYLI 628
Query: 381 LVGDYARSIALLRYQP-EYRTLSLVARDYKPTQPNSKGYYAGNPSRGI--IDGSLVWKFL 437
L+ D+ RS+ LL ++P +Y + L G +R I ID + +
Sbjct: 629 LLSDFYRSVVLLLFKPYDYTVIPL-----------------GKDARNIHCIDSTFI---- 667
Query: 438 QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
I K D FS + F D ++N+ L Y A E +
Sbjct: 668 ---------ITK----------DYFSVLEF---DSEQNLSLLNYSSAATEQLSIFEI--D 703
Query: 498 TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
F+LG ++ F + + + G ++ Y +++G++G+ + EK Y+ L +
Sbjct: 704 ATFNLGMNLLKF-------TRLWNGKG----YIYMYVTVEGSVGYISVVEEKIYQVLRQI 752
Query: 558 QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
M H G N +R KG G G +DG ++ +F L+ ++ +C +
Sbjct: 753 NIKMNREPWHFAGTNAEEYRFEKGYGMGFGTRKHVFLDGDMLKQFRLLNEEQQKRVCLR- 811
Query: 618 GSKHNDILDEL 628
+ ND+ L
Sbjct: 812 NTSINDVFKLL 822
>gi|147906138|ref|NP_001083624.1| DNA damage-binding protein 1 [Xenopus laevis]
gi|82186503|sp|Q6P6Z0.1|DDB1_XENLA RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|38303806|gb|AAH61946.1| Ddb1 protein [Xenopus laevis]
Length = 1140
Score = 62.0 bits (149), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 80/368 (21%), Positives = 140/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y ++ + GRI++F K++ + KE KG V ++
Sbjct: 828 YFVVGTAMVYPDEAEPKQGRIVVFQ-----------YNDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGETSPPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDVQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVIANLQIDDGSGMKRET 1124
Query: 624 ILDELYDI 631
+D+L +
Sbjct: 1125 TVDDLIKV 1132
>gi|45383688|ref|NP_989547.1| DNA damage-binding protein 1 [Gallus gallus]
gi|82098863|sp|Q805F9.1|DDB1_CHICK RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
subunit; AltName: Full=Damage-specific DNA-binding
protein 1; AltName: Full=UV-damaged DNA-binding factor
gi|28375613|dbj|BAC56999.1| damaged-DNA binding protein DDB p127 subunit [Gallus gallus]
gi|53130071|emb|CAG31438.1| hypothetical protein RCJMB04_6h2 [Gallus gallus]
Length = 1140
Score = 62.0 bits (149), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 125/607 (20%), Positives = 227/607 (37%), Gaps = 111/607 (18%)
Query: 53 GALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLF 112
GAL L+ +SDR K + G + + +R F +++ VF C P ++
Sbjct: 609 GALFYFGLSLETGLLSDRKK-------VTLGTQPTVLRTFRSLS-TTNVFACSDRPTVIY 660
Query: 113 LTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVR 172
+S +L + + V+ + P ++ P N S L I + +R
Sbjct: 661 -SSNHKLVFSNVNLK-EVNYMCPLNSDGYPDSLALAN-NSTLTIGTIDEI----QKLHIR 713
Query: 173 KVPLKCTPHFLAYHLETKTYCIVTS-----TAEPSTDYYKFNGEDKELVTDPRDSRFIPP 227
VPL +P + Y ++ + +++S A T + + + L + S+
Sbjct: 714 TVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKLFSS 773
Query: 228 LVSQFHVSLFSPFSWEEIPQTNFPL---HEWEHVLCLKNVSMEYEGTLSGLR------GY 278
+ S EE+ N + H +E + + + EY +L + Y
Sbjct: 774 STAPHETSF-----GEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTY 828
Query: 279 IALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
+GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 829 FIVGTAMVYPEEAEPKQGRIVVFH-----------YSDGKLQSLAEKEVKGAVYSMVEFN 877
Query: 338 GFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQ 395
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y+
Sbjct: 878 GKLLASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAYK 936
Query: 396 PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
P +ARD+ P W
Sbjct: 937 PMEGNFEEIARDFNPN----------------------WM------------------SA 956
Query: 456 NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIR 513
+ILD+ + +G + LF+ Q ++ + R L + HLG+ VN F
Sbjct: 957 VEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEFVNVF---- 1007
Query: 514 CKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGK 1067
Query: 571 LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDI 624
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1068 IEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREAT 1125
Query: 625 LDELYDI 631
+D+L I
Sbjct: 1126 VDDLIKI 1132
>gi|407035910|gb|EKE37921.1| CPSF A subunit region protein, putative [Entamoeba nuttalli P19]
Length = 836
Score = 62.0 bits (149), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 83/371 (22%), Positives = 150/371 (40%), Gaps = 86/371 (23%)
Query: 275 LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYA-KEQKGPVTAI 333
L+ Y+ +G N +ED +G+ +F+I +N+I++I + K V A+
Sbjct: 521 LKNYLVVGVNKQTTEDNPVKGKTYIFNI------------ENQIQLINKIGDGKKSVHAV 568
Query: 334 CHVAGFLVTAVGQKI-YIWQLKDNDLTGIAFIDTEVYIASM----VSVKN--------LI 380
+ GFL A G ++ I ++ + F D + I S+ + V LI
Sbjct: 569 NEIGGFLAVASGNELELIERVDETRWIKKCFSDISILINSIEYLPLKVMERGNEKECYLI 628
Query: 381 LVGDYARSIALLRYQP-EYRTLSLVARDYKPTQPNSKGYYAGNPSRGI--IDGSLVWKFL 437
L+ D+ RS+ LL ++P +Y + L G +R I ID + +
Sbjct: 629 LLSDFYRSVVLLLFKPYDYTVIPL-----------------GKDARNIHCIDSTFI---- 667
Query: 438 QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
I K D FS + F D ++N+ L Y A E +
Sbjct: 668 ---------ITK----------DYFSVLEF---DSEQNLSLLNYSSAATEQLSIFEI--D 703
Query: 498 TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
F+LG ++ F + + + G ++ Y +++G++G+ + EK Y+ L +
Sbjct: 704 ATFNLGMNLLKF-------TRLWNGKG----YIYMYVTVEGSVGYISVVEEKIYQVLRQI 752
Query: 558 QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
M H G N +R KG G G +DG ++ +F L+ ++ +C +
Sbjct: 753 NIKMNREPWHFAGTNAEEYRFEKGYGMGFGTRKHVFLDGDMLKQFRLLNEEQQKRVCLR- 811
Query: 618 GSKHNDILDEL 628
+ ND+ L
Sbjct: 812 NTSINDVFKLL 822
>gi|224050582|ref|XP_002191856.1| PREDICTED: DNA damage-binding protein 1 [Taeniopygia guttata]
Length = 1140
Score = 62.0 bits (149), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 125/607 (20%), Positives = 227/607 (37%), Gaps = 111/607 (18%)
Query: 53 GALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLF 112
GAL L+ +SDR K + G + + +R F +++ VF C P ++
Sbjct: 609 GALFYFGLSLETGLLSDRKK-------VTLGTQPTVLRTFRSLS-TTNVFACSDRPTVIY 660
Query: 113 LTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVR 172
+S +L + + V+ + P ++ P N S L I + +R
Sbjct: 661 -SSNHKLVFSNVNLK-EVNYMCPLNSDGYPDSLALAN-NSTLTIGTIDEI----QKLHIR 713
Query: 173 KVPLKCTPHFLAYHLETKTYCIVTS-----TAEPSTDYYKFNGEDKELVTDPRDSRFIPP 227
VPL +P + Y ++ + +++S A T + + + L + S+
Sbjct: 714 TVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKLFSS 773
Query: 228 LVSQFHVSLFSPFSWEEIPQTNFPL---HEWEHVLCLKNVSMEYEGTLSGLR------GY 278
+ S EE+ N + H +E + + + EY +L + Y
Sbjct: 774 STAPHETSF-----GEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTY 828
Query: 279 IALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
+GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 829 FIVGTAMVYPEEAEPKQGRIVVFH-----------YSDGKLQSLAEKEVKGAVYSMVEFN 877
Query: 338 GFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQ 395
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y+
Sbjct: 878 GKLLASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAYK 936
Query: 396 PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
P +ARD+ P W
Sbjct: 937 PMEGNFEEIARDFNPN----------------------WM------------------SA 956
Query: 456 NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIR 513
+ILD+ + +G + LF+ Q ++ + R L + HLG+ VN F
Sbjct: 957 VEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEFVNVF---- 1007
Query: 514 CKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGK 1067
Query: 571 LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDI 624
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1068 IEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREAT 1125
Query: 625 LDELYDI 631
+D+L I
Sbjct: 1126 VDDLIKI 1132
>gi|431910407|gb|ELK13480.1| DNA damage-binding protein 1 [Pteropus alecto]
Length = 1143
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 84/374 (22%), Positives = 142/374 (37%), Gaps = 86/374 (22%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM------VT 563
C S + G S + + +++G +G L E Y LL +QN + V
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 564 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------ 617
H+ + R+F T + P+ G IDG L+ FL +S + E+ +
Sbjct: 1067 KIEHSLYPSQRSFHTERKT-----EPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGS 1121
Query: 618 GSKHNDILDELYDI 631
G K D+L +
Sbjct: 1122 GMKREATADDLIKV 1135
>gi|389629928|ref|XP_003712617.1| hypothetical protein MGG_16867 [Magnaporthe oryzae 70-15]
gi|351644949|gb|EHA52810.1| hypothetical protein MGG_16867 [Magnaporthe oryzae 70-15]
gi|440464739|gb|ELQ34110.1| DNA damage-binding protein 1a [Magnaporthe oryzae Y34]
Length = 1183
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 74/351 (21%), Positives = 137/351 (39%), Gaps = 42/351 (11%)
Query: 281 LGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAG-F 339
+GT Y GR+L+F V E P +I+A K I +
Sbjct: 857 VGTRYLSGTGSGHGGRVLVFG----VDESRSPY------LIHAHSTKSGCRRIATMDDDL 906
Query: 340 LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-----NLILVGDYARSIALLRY 394
LV A+ + + + + + T F+ + S +V LI V D +SI LL Y
Sbjct: 907 LVIALTKTVVLVRYSETSTTSAKFLKVAAFQTSSYAVDVTVHGKLIAVADIMKSITLLEY 966
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
P + K T+ + + ++GS K + E+C+ +
Sbjct: 967 IPGVGKSAKTGGKDKATRSDKE-----------VEGSKQAKLV--------EVCRDYQAM 1007
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 514
+ + ++++D D N+V+ + R+ ++F LG+ VN K+
Sbjct: 1008 WSTAVSHLEGDSWIVADGDGNLVVLLRNTAGVTLEDKRRMQMTSEFGLGECVNKIQKVMV 1067
Query: 515 KPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH-TGGLNP 573
+ S ++AP FL+ + +G++ F + K L+ Q M H S G L
Sbjct: 1068 ETS--ANAPIVAKAFLS---TTEGSIYLFGTVAPKFQSLLMDFQANMEAHVSSPLGELQF 1122
Query: 574 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
+R+++ P R +DG + FL + +++IC+ + D+
Sbjct: 1123 NQWRSFRNPEREGAGPER-FLDGEFLEMFLDMEENTQIDICQGLSYTAEDM 1172
>gi|326919947|ref|XP_003206238.1| PREDICTED: DNA damage-binding protein 1-like [Meleagris gallopavo]
Length = 1079
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 125/607 (20%), Positives = 227/607 (37%), Gaps = 111/607 (18%)
Query: 53 GALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLF 112
GAL L+ +SDR K + G + + +R F +++ VF C P ++
Sbjct: 548 GALFYFGLSLETGLLSDRKK-------VTLGTQPTVLRTFRSLS-TTNVFACSDRPTVIY 599
Query: 113 LTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVR 172
+S +L + + V+ + P ++ P N S L I + +R
Sbjct: 600 -SSNHKLVFSNVNLK-EVNYMCPLNSDGYPDSLALAN-NSTLTIGTIDEI----QKLHIR 652
Query: 173 KVPLKCTPHFLAYHLETKTYCIVTS-----TAEPSTDYYKFNGEDKELVTDPRDSRFIPP 227
VPL +P + Y ++ + +++S A T + + + L + S+
Sbjct: 653 TVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKLFSS 712
Query: 228 LVSQFHVSLFSPFSWEEIPQTNFPL---HEWEHVLCLKNVSMEYEGTLSGLR------GY 278
+ S EE+ N + H +E + + + EY +L + Y
Sbjct: 713 STAPHETSF-----GEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTY 767
Query: 279 IALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
+GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 768 FIVGTAMVYPEEAEPKQGRIVVFH-----------YSDGKLQSLAEKEVKGAVYSMVEFN 816
Query: 338 GFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQ 395
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y+
Sbjct: 817 GKLLASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAYK 875
Query: 396 PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
P +ARD+ P W
Sbjct: 876 PMEGNFEEIARDFNPN----------------------WM------------------SA 895
Query: 456 NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIR 513
+ILD+ + +G + LF+ Q ++ + R L + HLG+ VN F
Sbjct: 896 VEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEFVNVF---- 946
Query: 514 CKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 947 CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGK 1006
Query: 571 LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDI 624
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1007 IEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREAT 1064
Query: 625 LDELYDI 631
+D+L I
Sbjct: 1065 VDDLIKI 1071
>gi|440893607|gb|ELR46310.1| DNA damage-binding protein 1 [Bos grunniens mutus]
Length = 1143
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 84/374 (22%), Positives = 142/374 (37%), Gaps = 86/374 (22%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM------VT 563
C S + G S + + +++G +G L E Y LL +QN + V
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 564 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------ 617
H+ + R+F T + P+ G IDG L+ FL +S + E+ +
Sbjct: 1067 KIEHSLYPSQRSFHTERKT-----EPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGS 1121
Query: 618 GSKHNDILDELYDI 631
G K D+L +
Sbjct: 1122 GMKREATADDLIKV 1135
>gi|440302955|gb|ELP95261.1| hypothetical protein EIN_430670 [Entamoeba invadens IP1]
Length = 1175
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 80/367 (21%), Positives = 144/367 (39%), Gaps = 77/367 (20%)
Query: 276 RGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAK-EQKGPVTAIC 334
R + G N +ED +G + LF + + ++ I+ I + K V AI
Sbjct: 857 RVLVGCGVNTQTTEDDPVKGNVFLFSL--------ESTSEGTIRHISTVCDGKKAVHAIN 908
Query: 335 HVAGFLVTAVGQKIYIWQLKDNDL------TGIAFIDTEVYIASMVSVKN-------LIL 381
+ G+L A G ++ I + K L + I+ + + M KN LIL
Sbjct: 909 SIGGYLAVAEGNELQILKGKTESLWVKKCFSDISILINTITFLPMTLSKNKVDEMCYLIL 968
Query: 382 VGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSL 441
+ D RS+ LL +QP+ +++ + +D + ID + V L
Sbjct: 969 LNDMYRSVILLLFQPQKKSVIPLGKDGRDIHA--------------IDAAFV---LDKDY 1011
Query: 442 GERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFH 501
LEI D S M ++ ++ ++ + + A N G +++ T
Sbjct: 1012 FHVLEI---------DYERNLSVMNYLRTETERISIFEV----AATFNVGVDILRLTRLR 1058
Query: 502 LGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 561
LG + ++ Y S G++G+ + E++Y+ L + M
Sbjct: 1059 LG-----------------------NGYVFVYLSAQGSVGYLTVVNERSYQTLRQINAKM 1095
Query: 562 VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 621
H G NP FR KG G G + I+DG ++ +F L+ ++ +C + S
Sbjct: 1096 NREPWHFAGTNPEEFRMEKGYGVGYGRRKQVILDGDILKEFHFLTQEQQKRVCLRNTSIS 1155
Query: 622 N--DILD 626
+ +ILD
Sbjct: 1156 DVVNILD 1162
>gi|440487047|gb|ELQ66855.1| DNA damage-binding protein 1a [Magnaporthe oryzae P131]
Length = 1213
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 74/351 (21%), Positives = 137/351 (39%), Gaps = 42/351 (11%)
Query: 281 LGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAG-F 339
+GT Y GR+L+F V E P +I+A K I +
Sbjct: 887 VGTRYLSGTGSGHGGRVLVFG----VDESRSPY------LIHAHSTKSGCRRIATMDDDL 936
Query: 340 LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-----NLILVGDYARSIALLRY 394
LV A+ + + + + + T F+ + S +V LI V D +SI LL Y
Sbjct: 937 LVIALTKTVVLVRYSETSTTSAKFLKVAAFQTSSYAVDVTVHGKLIAVADIMKSITLLEY 996
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
P + K T+ + + ++GS K + E+C+ +
Sbjct: 997 IPGVGKSAKTGGKDKATRSDKE-----------VEGSKQAKLV--------EVCRDYQAM 1037
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 514
+ + ++++D D N+V+ + R+ ++F LG+ VN K+
Sbjct: 1038 WSTAVSHLEGDSWIVADGDGNLVVLLRNTAGVTLEDKRRMQMTSEFGLGECVNKIQKVMV 1097
Query: 515 KPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH-TGGLNP 573
+ S ++AP FL+ + +G++ F + K L+ Q M H S G L
Sbjct: 1098 ETS--ANAPIVAKAFLS---TTEGSIYLFGTVAPKFQSLLMDFQANMEAHVSSPLGELQF 1152
Query: 574 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
+R+++ P R +DG + FL + +++IC+ + D+
Sbjct: 1153 NQWRSFRNPEREGAGPER-FLDGEFLEMFLDMEENTQIDICQGLSYTAEDM 1202
>gi|391335522|ref|XP_003742140.1| PREDICTED: DNA damage-binding protein 1-like [Metaseiulus
occidentalis]
Length = 1154
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 112/489 (22%), Positives = 180/489 (36%), Gaps = 101/489 (20%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTS-------------------TAEPSTDYYKFNGE 211
+R VPL +P +AY ET T+ ++ S A P + F+
Sbjct: 713 IRTVPLGESPRRIAYQEETGTFGVIVSRSDMACSTRCASLDAPNKSNASPYAWHKDFSSF 772
Query: 212 DKELVTDPRDSRFIPPLVSQFHVSLFSP-------FSWEEIPQTNFP-LHEWEHVLCLKN 263
D DS IP S SL P FS I Q F LH +
Sbjct: 773 GHTQCADRVDSG-IPSCSS---TSLQRPPSGCDETFSLLIIDQNTFEVLHAMQFCPNEYG 828
Query: 264 VSMEYEGTLSGLRGYIALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIY 322
VS+ S Y +GT + N E GRI + K++ I
Sbjct: 829 VSICSAKLGSDPNPYYIVGTAFINQEESEPKVGRIFVL-----------RWHDGKLETIA 877
Query: 323 AKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLI 380
KE G +I L A+ ++Y W + DL + I + + + I
Sbjct: 878 EKEAAGAPYSIREFHQKLAIAINSTVRLYSWN-AEKDLQSECTPFFNIVILHLKCLGDYI 936
Query: 381 LVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLS 440
LVGD RS+ LL Y + +L + RDY+ W
Sbjct: 937 LVGDLMRSMTLLNYNADITSLEEIGRDYQTN----------------------W------ 968
Query: 441 LGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDF 500
+ +ILDE + F+ ++ + N+ + P A + H + + +
Sbjct: 969 ------------TTAVEILDEDT---FLAAESNLNLYVCKRDPSAADDTRQH-MHEVALY 1012
Query: 501 HLGQHVNTFFKIRCKPSSISDAPGARSR-FLTWYASLDGALGFFLPLPEKNYRRLLMLQN 559
HLG+ VN K + D P ++ FL Y SL GA+G +P+ ++ Y L +Q
Sbjct: 1013 HLGEMVNVIVKGSLVMAQPGDMPLPLNKSFL--YGSLHGAVGVIVPIKQELYAILNQIQT 1070
Query: 560 VMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL------SLGERLEI 613
+ G + +RT+ + P+ G IDG L+ + L L S+ + +++
Sbjct: 1071 NLAKTIKSVGKIEHGFWRTFLAERKI--EPATGFIDGDLIEQLLDLPKEALESVSQSIKV 1128
Query: 614 CKKIGSKHN 622
++ G + N
Sbjct: 1129 DEEGGHQRN 1137
>gi|336369683|gb|EGN98024.1| hypothetical protein SERLA73DRAFT_109335 [Serpula lacrymans var.
lacrymans S7.3]
gi|336382464|gb|EGO23614.1| hypothetical protein SERLADRAFT_449959 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1257
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 89/361 (24%), Positives = 149/361 (41%), Gaps = 74/361 (20%)
Query: 256 EHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT-CRGRILLFDIIEVVPEPGQPLT 314
E + L VS+ E ++ YI GT Y Y ++V +GR+L+FD E G L
Sbjct: 922 EEITALGVVSVTLERSIGT---YICAGT-YKYVDEVEPSQGRLLVFD-----AEDGS-LL 971
Query: 315 KNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFI------DTEV 368
+ KI M + E +G V A+ V G ++ A+ + +++ + + T + + +
Sbjct: 972 REKITMAVSLEVRGCVYAVGSVNGMIIAAINSSVVVYRPEIDASTQLLALHKITEWNHNY 1031
Query: 369 YIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGII 428
+ ++V + ILVGD SI+ LR + +ARDY
Sbjct: 1032 LVTNLVCRGDKILVGDAINSISFLRMVES--QIQCLARDY-------------------- 1069
Query: 429 DGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ-PEARE 487
GSL W +C ++LD+ S +G ++ D N+ F Q E R+
Sbjct: 1070 -GSL-WP-----------VCV-------EMLDQSSIIG---ANSDYNLFTFALQETELRK 1106
Query: 488 SNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS-DAPGARSRFLTWYASLDGALGFFLPL 546
S L + +++G VN F +S D P + + + G +G + +
Sbjct: 1107 S-----LERDGSYYIGDMVNKFIPGALTAHDVSVDMPLEPKQL---FFTSTGCIGVIVDM 1158
Query: 547 PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK--GYYAGNPSRGIIDGSLVWKFLQ 604
++ + LQ M T+ S T G+ FR K A S G +DG + KF+Q
Sbjct: 1159 GDELSLHMTALQRNMSTYLSQTKGVTHTKFRAPKNAYGRSDAEATSFGFLDGDFLEKFMQ 1218
Query: 605 L 605
Sbjct: 1219 F 1219
>gi|67463896|ref|XP_648489.1| cleavage and polyadenylation specificity factor subunit [Entamoeba
histolytica HM-1:IMSS]
gi|56464653|gb|EAL43100.1| cleavage and polyadenylation specificity factor subunit, putative
[Entamoeba histolytica HM-1:IMSS]
Length = 1150
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 83/371 (22%), Positives = 150/371 (40%), Gaps = 86/371 (23%)
Query: 275 LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYA-KEQKGPVTAI 333
L+ Y+ +G N +ED +G+ +F+I +N+I++I + K V A+
Sbjct: 835 LKNYLVVGVNKQTTEDNPVKGKTYIFNI------------ENQIQLINKIGDGKKSVHAV 882
Query: 334 CHVAGFLVTAVGQKI-YIWQLKDNDLTGIAFIDTEVYIASM----VSVKN--------LI 380
+ GFL A G ++ I ++ + F D + I S+ + V LI
Sbjct: 883 NEIGGFLAVASGNELELIERVDETRWIKKCFSDISILINSIEYLPLKVMERGNEKECYLI 942
Query: 381 LVGDYARSIALLRYQP-EYRTLSLVARDYKPTQPNSKGYYAGNPSRGI--IDGSLVWKFL 437
L+ D+ RS+ LL ++P +Y + L G +R I ID + +
Sbjct: 943 LLSDFYRSVVLLLFKPYDYTVIPL-----------------GKDARNIHCIDSTFI---- 981
Query: 438 QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
I K D FS + F D ++N+ L Y A E +
Sbjct: 982 ---------ITK----------DYFSVLEF---DSEQNLSLLNYSSAATEQLSIFEI--D 1017
Query: 498 TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
F+LG ++ F + + + G ++ Y +++G++G+ + EK Y+ L +
Sbjct: 1018 ATFNLGMNLLKF-------TRLWNGKG----YIYMYVTVEGSVGYISVVEEKIYQVLRQI 1066
Query: 558 QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
M H G N +R KG G G +DG ++ +F L+ ++ +C +
Sbjct: 1067 NIKMNREPWHFAGTNAEEYRFEKGYGMGFGTRKHVFLDGDMLKQFRLLNEEQQKRVCLR- 1125
Query: 618 GSKHNDILDEL 628
+ ND+ L
Sbjct: 1126 NTSINDVFKLL 1136
>gi|281345356|gb|EFB20940.1| hypothetical protein PANDA_015888 [Ailuropoda melanoleuca]
Length = 1124
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 86/374 (22%), Positives = 143/374 (38%), Gaps = 87/374 (23%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 810 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 858
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 859 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 917
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 918 KPMEGNFEEIARDFNPN----------------------WM------------------S 937
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 938 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 989
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM------VT 563
C S + G S + + +++G +G L E Y LL +QN + +T
Sbjct: 990 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKNIT 1048
Query: 564 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------ 617
H S T R+F T + P+ G IDG L+ FL +S + E+ +
Sbjct: 1049 H-SLTHLSTWRSFHTERKT-----EPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGS 1102
Query: 618 GSKHNDILDELYDI 631
G K D+L +
Sbjct: 1103 GMKREATADDLIKV 1116
>gi|392591958|gb|EIW81285.1| hypothetical protein CONPUDRAFT_56293 [Coniophora puteana RWD-64-598
SS2]
Length = 1245
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 97/419 (23%), Positives = 170/419 (40%), Gaps = 100/419 (23%)
Query: 223 RFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALG 282
R P +S+ V L++ + +++ Q E + +V + E + G + +
Sbjct: 875 RVGEPEISRGSVQLYNDTTLDKLGQVVLDHDEEPMAIKALSVRVAEEAKDCFVVGTVIID 934
Query: 283 TNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVT 342
+ N S GR+LL EP ++ + + +++ KG V A+ V G +V
Sbjct: 935 SLENES----SSGRLLLV-------EPDYSRGESFVAVSASEKVKGCVYAVAAVDGLVVA 983
Query: 343 AVGQKIYIWQLKDNDLT-GIAFI-----DTEVYIASMVSVKNLILVGDYARSIALLRYQP 396
AV + I+ ++ +D T ++F+ + +A++VS NL+LVGD S+ LL+Y
Sbjct: 984 AVNSAVVIYSIEADDHTRALSFVKKVEWNHNYVVANLVSRGNLLLVGDAISSVTLLQY-- 1041
Query: 397 EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 456
E L VARDY P P S
Sbjct: 1042 ERGALQNVARDYSPLWPTSV---------------------------------------- 1061
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRC 514
++LDE + +G +D D N+ +F Q +G R L + ++ G VN F
Sbjct: 1062 EMLDERNVIG---ADNDCNLFMFTLQ------DGAERKVLERNGHYYFGDMVNKFI---- 1108
Query: 515 KPSSISDAPGARSRFLTWYASLD-------------GALGFFLPLPEKNYRRLLMLQNVM 561
PG R L+ + + D G++G + + ++ + LQ +
Sbjct: 1109 --------PGEIYRALSSFEASDIEVEPKQLFFTTTGSIGVVIDMSDELSLHMSSLQRNL 1160
Query: 562 VTH-TSHTGGLNPRAFRTYK-GKGYY-AGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
T+ + GG + +R K +G A N S G +DG L+ +FL G+ E +K+
Sbjct: 1161 STYFAAQPGGASHTKYRAPKNARGRSDADNSSFGFLDGDLLERFLL--FGDDEEAVRKV 1217
>gi|385865228|gb|AFI92852.1| DNA damage-binding protein 1 [Danio rerio]
Length = 1140
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 119/584 (20%), Positives = 221/584 (37%), Gaps = 102/584 (17%)
Query: 75 NEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLA 134
+E+ + G + + +R F +++ VF C P ++ +S +L + + V+ +
Sbjct: 624 SERKKVTLGTQPTVLRTFRSLS-TSNVFACSDRPTVIY-SSNHKLVFSNVNLK-EVNYMC 680
Query: 135 PFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCI 194
P ++ P N S L I + +R VPL +P + Y ++ + +
Sbjct: 681 PLNSEGYPDSLALAN-NSTLTIGTIDEI----QKLHIRTVPLYESPKRICYQEVSQCFGV 735
Query: 195 VTSTAE-----PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
++S E +T + + + L + S+ P S S EE+ +
Sbjct: 736 LSSRVEMQDASGTTAAVRPSASTQALSSSVSSSKLFPSSTSPHETSF-----GEEVEVHS 790
Query: 250 FPL---HEWEHVLCLKNVSMEYEGTLSGLR------GYIALGTNYNYSEDVTCR-GRILL 299
+ H +E + + + EY ++ + Y +GT Y E+ + GRI++
Sbjct: 791 LLVVDQHTFEVLHAHQFLQNEYALSMVSCKLGRDPAVYFIVGTAMVYPEEAEPKQGRIIV 850
Query: 300 FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDND 357
F T K++ + KE KG V ++ G L+ ++ ++Y W +
Sbjct: 851 FH-----------YTDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTAEKEL 899
Query: 358 LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
T + + + + + ILVGD RS+ LL Y+P + +ARD+ P
Sbjct: 900 RTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAYKPMEGSFEEIARDFNPN------ 952
Query: 418 YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVV 477
W +ILD+ + +G +
Sbjct: 953 ----------------WM------------------SAVEILDDDNFLG-----AENAFN 973
Query: 478 LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFF--KIRCKPSSISDAPGARSRFLTWY 533
LF+ Q ++ + R L + FHLG+ VN F + + S P S +
Sbjct: 974 LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFSHGSLVLQNLGESSTPTQGS---VLF 1030
Query: 534 ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGI 593
+++G +G L E Y LL LQN + G + +R++ + + G
Sbjct: 1031 GTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEQATGF 1088
Query: 594 IDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 631
IDG L+ FL L + E+ + G K +DE+ I
Sbjct: 1089 IDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREATVDEVIKI 1132
>gi|81868411|sp|Q9ESW0.1|DDB1_RAT RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|9843869|emb|CAB89874.2| damage-specific DNA binding protein 1 [Rattus norvegicus]
Length = 1140
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 139/368 (37%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSGGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLLGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|194389106|dbj|BAG61570.1| unnamed protein product [Homo sapiens]
Length = 1009
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 82/368 (22%), Positives = 143/368 (38%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 697 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 745
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 746 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 804
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W +S E L+ +G++
Sbjct: 805 KPMEGNFEEIARDFNPN----------------------W----MSAVEILDDDNFLGAE 838
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ F+S F+ Q ++ + R L + FHLG+ VN F
Sbjct: 839 -----NAFNS--------------FVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 876
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 877 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 935
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + P+ G IDG L+ FL +S + E+ + G K
Sbjct: 936 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 993
Query: 624 ILDELYDI 631
D+L +
Sbjct: 994 TADDLIKV 1001
>gi|224587439|gb|ACN58665.1| DNA damage-binding protein 1 [Salmo salar]
Length = 444
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 103/483 (21%), Positives = 180/483 (37%), Gaps = 84/483 (17%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAE-----PSTDYYKFNGEDKELVTDPRDSRFI 225
+R VPL +P + Y ++ + +++S E +T + + + L + S+
Sbjct: 16 IRTVPLYESPRRICYQEVSQCFGVLSSRVEMQDASGTTAAVRPSASTQALSSSVSSSKLF 75
Query: 226 PPLVSQFHVSL---FSPFSWEEIPQTNFP-LHEWEHVLCLKNVSMEYEGTLSGLRGYIAL 281
P S S S + Q F LH + + +SM L Y +
Sbjct: 76 PSSTSPHETSFGEEVEVHSLLVVDQHTFEVLHAHQFLQSEYALSMVSCRLGRDLSVYFIV 135
Query: 282 GTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFL 340
GT Y E+ + GRI++F T K++ + KE KG V ++ G L
Sbjct: 136 GTAMVYPEEAEPKQGRIIVFH-----------YTDGKLQTVAEKEVKGAVYSMMEFNGKL 184
Query: 341 VTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEY 398
+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y+P
Sbjct: 185 LASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAYKPME 243
Query: 399 RTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 458
+ARD+ P ++ +I
Sbjct: 244 GNFEEIARDFNPNWMSAV----------------------------------------EI 263
Query: 459 LDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFF--KIRC 514
LD+ + +G + LF+ Q ++ + R L + FHLG+ VN F +
Sbjct: 264 LDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGVFHLGEFVNVFSHGSLVL 318
Query: 515 KPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 574
+ S P S + +++G +G L E Y LL LQN + G +
Sbjct: 319 QNLGESSTPTQGS---VLFGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKSVGKIEHS 375
Query: 575 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDEL 628
+R++ + + G IDG L+ FL L + E+ + G K +DE+
Sbjct: 376 FWRSFHTE--RKTEQATGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREATVDEV 433
Query: 629 YDI 631
I
Sbjct: 434 IKI 436
>gi|193644722|ref|XP_001942922.1| PREDICTED: DNA damage-binding protein 1-like [Acyrthosiphon pisum]
Length = 1156
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 77/347 (22%), Positives = 128/347 (36%), Gaps = 57/347 (16%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y LGT ED + GRIL+F + + +K+ I KE KG +
Sbjct: 837 YYILGTAVVNPEDQDPKLGRILIFHWDD---------SSSKLTPITEKEVKGACYGMAEF 887
Query: 337 AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQ 395
G L+ AV + +++ + +A V K + I+ GD RS+ LL+Y+
Sbjct: 888 NGKLLAAVNCTVRLFEWTAEKELRLECSHFNNIVALFVKTKGDFIVCGDLMRSLTLLQYK 947
Query: 396 PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
+ +ARDY P W S
Sbjct: 948 TMEGSFEEIARDYNPK----------------------W------------------STA 967
Query: 456 NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK 515
+I+D+ +G ++ DKN+ + H+L + FH G +N F
Sbjct: 968 IEIIDDDVFLG---AENDKNLFIIHKDSTLTSDEARHQLQEIGQFHCGDLINVFRHGSLV 1024
Query: 516 PSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRA 575
+D + + Y + GALG L K + L L+ + T G +N +
Sbjct: 1025 MQHFTDTYVSVQGGI-LYGTCSGALGLVTQLTPKMFDFLSDLEKSLATVVKGVGKINHQF 1083
Query: 576 FRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 622
+R+Y + PS +DG L+ FL LS E + + + ++
Sbjct: 1084 WRSYHTE--IRTEPSESFVDGDLIESFLDLSKREMIAVVDALQGAYD 1128
>gi|449283451|gb|EMC90093.1| DNA damage-binding protein 1 [Columba livia]
Length = 1140
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 126/616 (20%), Positives = 229/616 (37%), Gaps = 118/616 (19%)
Query: 53 GALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLF 112
GAL L+ +SDR K + G + + +R F +++ VF C P ++
Sbjct: 598 GALFYFGLSLETGLLSDRKK-------VTLGTQPTVLRTFRSLS-TTNVFACSDRPTVIY 649
Query: 113 LTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVR 172
+S +L + + V+ + P ++ P N S L I + +R
Sbjct: 650 -SSNHKLVFSNVNLK-EVNYMCPLNSDGYPDSLALAN-NSTLTIGTIDEI----QKLHIR 702
Query: 173 KVPLKCTPHFLAYHLETKTYCIVTS-----TAEPSTDYYKFNGEDKELVTDPRDSRFIPP 227
VPL +P + Y ++ + +++S A T + + + L + S+
Sbjct: 703 TVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKLFSS 762
Query: 228 LVSQFHVSLFSPFSWEEIPQTNFPL---HEWEHVLCLKNVSMEYEGTLSGLR------GY 278
+ S EE+ N + H +E + + + EY +L + Y
Sbjct: 763 STAPHETSF-----GEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTY 817
Query: 279 IALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
+GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 818 FIVGTAMVYPEEAEPKQGRIVVFH-----------YSDGKLQSLAEKEVKGAVYSMVEFN 866
Query: 338 GFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQ 395
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y+
Sbjct: 867 GKLLASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAYK 925
Query: 396 PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
P +ARD+ P W
Sbjct: 926 PMEGNFEEIARDFNPN----------------------WM------------------SA 945
Query: 456 NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIR 513
+ILD+ + +G + LF+ Q ++ + R L + HLG+ VN F
Sbjct: 946 VEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEFVNVF---- 996
Query: 514 CKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM--------- 561
C S + G S + + +++G +G L E Y LL +QN +
Sbjct: 997 CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGK 1056
Query: 562 VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI---- 617
+ H+ + + RA+ + P+ G IDG L+ FL +S + E+ +
Sbjct: 1057 IEHSLYPSLVQLRAWASQSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDD 1116
Query: 618 --GSKHNDILDELYDI 631
G K +D+L I
Sbjct: 1117 GSGMKREATVDDLIKI 1132
>gi|119594342|gb|EAW73936.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_d [Homo
sapiens]
Length = 1146
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 84/377 (22%), Positives = 142/377 (37%), Gaps = 89/377 (23%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQN---------V 560
C S + G S + + +++G +G L E Y LL +QN
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKRCF 1066
Query: 561 MVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI--- 617
+++ S T R+F T + P+ G IDG L+ FL +S + E+ +
Sbjct: 1067 LISTCSLTHPSTWRSFHTERKT-----EPATGFIDGDLIESFLDISRPKMQEVVANLQYD 1121
Query: 618 ---GSKHNDILDELYDI 631
G K D+L +
Sbjct: 1122 DGSGMKREATADDLIKV 1138
>gi|452979181|gb|EME78944.1| hypothetical protein MYCFIDRAFT_43692 [Pseudocercospora fijiensis
CIRAD86]
Length = 1149
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 108/548 (19%), Positives = 210/548 (38%), Gaps = 114/548 (20%)
Query: 97 GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF-----NAK 151
G Q VF HP+ ++ S G L +T + + +A F + G + N +
Sbjct: 682 GLQNVFATCEHPSLIY-GSDGRLVYSAVTAEN-ATCIASFDSFGDYAGAIAIATTDENGE 739
Query: 152 SELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTY---CIVTSTAEPSTDYYKF 208
+EL+++V+ + V+ + + T +AY E K + CI +
Sbjct: 740 NELKLAVVDEERTTH----VQDLFIHETVRRIAYSAELKAFGLGCIKRT----------L 785
Query: 209 NGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEY 268
+ ++E+ + H L +++E+ + L+E E V C+ ++
Sbjct: 786 SAGNEEVAS---------------HFKLVDEVAFKELD--TWALNEDELVECVIRCYLD- 827
Query: 269 EGTLSGLRGYIALGTNYNYSEDVT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQK 327
+G+ ++ +GT Y +D +GRIL+ +I E +IK++ +
Sbjct: 828 DGSGEEAERFV-VGTAYLDDQDANNAKGRILVLEITE----------DRRIKLVTELAVR 876
Query: 328 GPVTAICHVAGFLVTAVGQKIYIWQLKDND-----LTGIAFIDTEVYIASMVSVKNLILV 382
G + G +V A+ + I ++ + LT A T + N I V
Sbjct: 877 GACRCLAVCQGRIVAALVKTIVVYDFEYQTPSTPALTKKASYRTATAPIDICVTNNTIAV 936
Query: 383 GDYARSIALLRY----QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQ 438
D +S++LL + Q + TL +AR ++ +W
Sbjct: 937 TDLMKSLSLLEFKAGRQGQPDTLIEIARHFET----------------------LWG--- 971
Query: 439 LSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKT 498
C ++ S ++ SD + N+++ + RL +
Sbjct: 972 -------TACARV-----------SENTYLESDAEGNLIVLQHDINGFSQEDRRRLRVTS 1013
Query: 499 DFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQ 558
+F LG+ VN R +P ++ +PGA + A+ DG++ + + + L+ +Q
Sbjct: 1014 EFLLGEMVN-----RIRPITVQPSPGAVVTPQAFLATTDGSIYVYCEIGKPRQDLLMRMQ 1068
Query: 559 NVMVTHTSHTGGLNPRAFRTYKGKGYYAGN--PSRGIIDGSLVWKFLQLSLGERLEICKK 616
+M GG+ FR +K G P R +DG L+ +FL + + E+ K
Sbjct: 1069 TLMADMVKSPGGVRFAKFRGFKTLVRDMGEEGPVR-FVDGELIERFLDMPEVLQNEVVKG 1127
Query: 617 IGSKHNDI 624
+ D+
Sbjct: 1128 LDGTGVDL 1135
>gi|427788481|gb|JAA59692.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
Length = 1156
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 108/499 (21%), Positives = 191/499 (38%), Gaps = 105/499 (21%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVT-----------STAEPSTDYYKFNGEDKELVTDP 219
+R VPL P +AY T+T+ ++T + PS N ++
Sbjct: 726 IRTVPLGELPRRIAYQEATQTFGVITIRNDILGSSGLTPVRPSASTQAQNVTHSAQMS-- 783
Query: 220 RDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPL---HEWEHVLCLKNVSMEYEGTLSGLR 276
S F P VS + L +E+ N + H +E + + + EY ++ R
Sbjct: 784 --SIFKPGSVSTGNDQL-----GQEVEIHNLLIIDQHTFEVLHAHQFMQTEYAMSIVSTR 836
Query: 277 ------GYIALGT-NYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGP 329
Y +GT N E +GRI++F ++ K++ + +E KG
Sbjct: 837 LGNDPNTYYIVGTANVLPDESDPKQGRIVVFHWVD-----------GKLEHVAEQEIKGA 885
Query: 330 VTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARS 388
++ G L+ A+ + +++ + +L + + + + +LVGD RS
Sbjct: 886 PYSMLEFNGKLLAAINSTVRLFEWNAERELRNECSHFNNILALYLRAKGDFVLVGDLMRS 945
Query: 389 IALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 448
++LL Y+P +ARDY+ +S
Sbjct: 946 MSLLAYKPLEGNFEEIARDYQTNWMSSV-------------------------------- 973
Query: 449 KKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHV 506
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ V
Sbjct: 974 --------EILDDDTFLG-----AESTTNLFVCQKDSAATTDEERQHLQEVGQFHLGEFV 1020
Query: 507 NTFFKIRCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
N F S + PG S + + ++ GA+G LP Y L +Q +
Sbjct: 1021 NVFR----HGSLVMQHPGETSSPTQGSVLFGTIHGAIGLVSQLPADFYTFLSEVQEKLTK 1076
Query: 564 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------ 617
G ++ +R++ + P+ G IDG L+ FL LS + E+ + I
Sbjct: 1077 VIKSVGKIDHAFWRSFSTE--RKTEPAVGFIDGDLIESFLDLSRDKMQEVVQGIQMDDGS 1134
Query: 618 GSKHNDILDELYD-IEALS 635
G K + +D+L IE LS
Sbjct: 1135 GMKRDASVDDLIKIIEELS 1153
>gi|444513057|gb|ELV10249.1| DNA damage-binding protein 1 [Tupaia chinensis]
Length = 1146
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 84/377 (22%), Positives = 142/377 (37%), Gaps = 89/377 (23%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM-------- 561
C S + G S + + +++G +G L E Y LL +QN +
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKRCF 1066
Query: 562 -VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI--- 617
++ S T R+F T + P+ G IDG L+ FL +S + E+ +
Sbjct: 1067 QISPNSLTDMSTWRSFHTERKT-----EPATGFIDGDLIESFLDISRPKMQEVVANLQYD 1121
Query: 618 ---GSKHNDILDELYDI 631
G K D+L +
Sbjct: 1122 DGSGMKREATADDLIKV 1138
>gi|452838792|gb|EME40732.1| hypothetical protein DOTSEDRAFT_177898 [Dothistroma septosporum
NZE10]
Length = 1138
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 110/539 (20%), Positives = 203/539 (37%), Gaps = 115/539 (21%)
Query: 97 GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRI 156
G Q VF HP+ ++ S G + +T + S + N N + + ELRI
Sbjct: 681 GLQNVFATCEHPSLIY-GSEGRMVYSAVTAESATSICS--FNSNSYGNAIAIASNDELRI 737
Query: 157 SVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTY---CIV-TSTAEPSTDYYKFNGED 212
+ + + V+ + + T AY E K + CI T TA G++
Sbjct: 738 AAVDEERTTH----VQDLFIHETVRRTAYSAELKAFGLGCIQRTLTA----------GQE 783
Query: 213 KELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTL 272
+ + H L +++E+ ++ L+E E V + ++ +G+
Sbjct: 784 E----------------VKSHFKLVDEVAFKELD--SYELNEDELVESVIRCKLD-DGSG 824
Query: 273 SGLRGYIALGTNYNYSEDV-TCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVT 331
G + A+GT Y +D T RGRIL+ ++ E ++K++ KG
Sbjct: 825 DGAERF-AVGTAYLDDQDSNTARGRILILEVTE----------DRRLKLVTELSVKGACR 873
Query: 332 AICHVAGFLVTAVGQKIYIWQLK----DNDLTGIAFIDTEVYIASMVSVKNLILVGDYAR 387
+ G +V A+ + + I+ + LT A T + N+I V D +
Sbjct: 874 CLAVCEGKIVAALIKTVIIYDFEFAASKATLTKKASYRTATAPIDVCVTGNVIAVTDLMK 933
Query: 388 SIALLRYQ------PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSL 441
S++L+ Y+ P+ TL+ +AR ++ + A N
Sbjct: 934 SMSLVEYKKGRTGMPD--TLTEIARHFETLWGTAVANVADNT------------------ 973
Query: 442 GERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFH 501
++ SD + N+++ + RL ++
Sbjct: 974 -------------------------YLQSDAEGNLIVLQHDTNGFSEEDRRRLRVTSELL 1008
Query: 502 LGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 561
LG+ VN +I P+ GA + A+++G++ F + L+ +QN M
Sbjct: 1009 LGEMVNRIRRIDVTPTH-----GALVIPRAFLATVEGSIYLFALIVPGKQDLLMRMQNNM 1063
Query: 562 VTHTSHTGGLNPRAFRTYKG--KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
+ G + FR +K + A PSR +DG L+ +FL + EI + +G
Sbjct: 1064 ASLVKSPGHVEFATFRGFKNQVRDEGANGPSR-FVDGELIERFLDCGQDIQEEIIRDLG 1121
>gi|348560393|ref|XP_003465998.1| PREDICTED: DNA damage-binding protein 1-like [Cavia porcellus]
Length = 1140
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 80/368 (21%), Positives = 139/368 (37%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E Y LL +QN + G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + + G IDG L+ FL +S + E+ + G K
Sbjct: 1067 KIEHSFWRSFHTE--RKTEQATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124
Query: 624 ILDELYDI 631
D+L +
Sbjct: 1125 TADDLIKV 1132
>gi|432851195|ref|XP_004066902.1| PREDICTED: DNA damage-binding protein 1-like [Oryzias latipes]
Length = 1140
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 82/368 (22%), Positives = 139/368 (37%), Gaps = 77/368 (20%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F T K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEPEPKQGRIIVFH-----------YTDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGVFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
C S + G S + + +++G +G L E + LL LQN + G
Sbjct: 1008 -CHGSLVLQNLGESSTPTQGSVLFGTVNGMIGLVTSLSEGWHSLLLDLQNRLNKVIKSVG 1066
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
+ +R++ + + G IDG L+ FL L + E+ + G K
Sbjct: 1067 KIEHSFWRSFYTE--RKTEQATGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGGGMKREA 1124
Query: 624 ILDELYDI 631
+DE+ I
Sbjct: 1125 TVDEVIKI 1132
>gi|427780151|gb|JAA55527.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
Length = 1181
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 108/499 (21%), Positives = 191/499 (38%), Gaps = 105/499 (21%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVT-----------STAEPSTDYYKFNGEDKELVTDP 219
+R VPL P +AY T+T+ ++T + PS N ++
Sbjct: 751 IRTVPLGELPRRIAYQEATQTFGVITIRNDILGSSGLTPVRPSASTQAQNVTHSAQMS-- 808
Query: 220 RDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPL---HEWEHVLCLKNVSMEYEGTLSGLR 276
S F P VS + L +E+ N + H +E + + + EY ++ R
Sbjct: 809 --SIFKPGSVSTGNDQL-----GQEVEIHNLLIIDQHTFEVLHAHQFMQTEYAMSIVSTR 861
Query: 277 ------GYIALGT-NYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGP 329
Y +GT N E +GRI++F ++ K++ + +E KG
Sbjct: 862 LGNDPNTYYIVGTANVLPDESDPKQGRIVVFHWVD-----------GKLEHVAEQEIKGA 910
Query: 330 VTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARS 388
++ G L+ A+ + +++ + +L + + + + +LVGD RS
Sbjct: 911 PYSMLEFNGKLLAAINSTVRLFEWNAERELRNECSHFNNILALYLRAKGDFVLVGDLMRS 970
Query: 389 IALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 448
++LL Y+P +ARDY+ +S
Sbjct: 971 MSLLAYKPLEGNFEEIARDYQTNWMSSV-------------------------------- 998
Query: 449 KKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHV 506
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ V
Sbjct: 999 --------EILDDDTFLG-----AESTTNLFVCQKDSAATTDEERQHLQEVGQFHLGEFV 1045
Query: 507 NTFFKIRCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
N F S + PG S + + ++ GA+G LP Y L +Q +
Sbjct: 1046 NVFR----HGSLVMQHPGETSSPTQGSVLFGTIHGAIGLVSQLPADFYTFLSEVQEKLTK 1101
Query: 564 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------ 617
G ++ +R++ + P+ G IDG L+ FL LS + E+ + I
Sbjct: 1102 VIKSVGKIDHAFWRSFSTE--RKTEPAVGFIDGDLIESFLDLSRDKMQEVVQGIQMDDGS 1159
Query: 618 GSKHNDILDELYD-IEALS 635
G K + +D+L IE LS
Sbjct: 1160 GMKRDASVDDLIKIIEELS 1178
>gi|260790329|ref|XP_002590195.1| hypothetical protein BRAFLDRAFT_128289 [Branchiostoma floridae]
gi|229275385|gb|EEN46206.1| hypothetical protein BRAFLDRAFT_128289 [Branchiostoma floridae]
Length = 1152
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 76/345 (22%), Positives = 132/345 (38%), Gaps = 65/345 (18%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F T K++ + KE KG V ++
Sbjct: 835 YFIIGTAMVYPEESEPKSGRIIVFQ-----------YTDGKLQQVAEKEVKGAVYSLVQF 883
Query: 337 AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQ 395
L+ ++ + +++ + +A + K + ILVGD RS+ LL Y+
Sbjct: 884 NNKLLASINSTVRLFEWTAEKELRVECNHYNNILALYLKTKGDFILVGDLMRSVTLLAYK 943
Query: 396 PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
P +ARD+ P W +S E L+ +G+++
Sbjct: 944 PMEGCFEEIARDFNPN----------------------W----MSAVEILDDDNFLGAEN 977
Query: 456 NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK 515
S F KD + +E GH FHLG+ VN F
Sbjct: 978 --------SFNFFTCQKDSAATTDEERQHLQEV--GH-------FHLGEFVNVFR----H 1016
Query: 516 PSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
S + PG S + + +++GA+G LP + L +Q+ + G +
Sbjct: 1017 GSLVMQHPGETSTPTQGSVLFGTVNGAVGLVTQLPADFFNFLQEVQSKLTRVIKSVGKIE 1076
Query: 573 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
+R++ + +G IDG L+ FL LS + E+ + +
Sbjct: 1077 HSFWRSFNTE--RKTEACQGFIDGDLIESFLDLSRDKMQEVVQGL 1119
>gi|324502823|gb|ADY41238.1| DNA damage-binding protein 1, partial [Ascaris suum]
Length = 1129
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 100/454 (22%), Positives = 177/454 (38%), Gaps = 82/454 (18%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGE------DKELVTDPRDSRF 224
+R VPL + +AY ET T I+ E + +G+ ++ + S
Sbjct: 700 IRTVPLGESVSRIAYQPETGTIAILVQRNE----FVDADGKHHCGHCASKMAVNASSSH- 754
Query: 225 IPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWE----HVLCLKNVSMEYEGTLSGL--RGY 278
P +V+ P E F + E H L ++M + + G + Y
Sbjct: 755 -PSVVTSATTPPIEPEEIEVSSVVVFDANTLEILHSHELGKNELAMSIKSCVLGDDPQPY 813
Query: 279 IALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
A+GT +++ + GR+L+F +V P ++++++ KE KG +I +
Sbjct: 814 YAVGTAVVLTDETESKSGRLLIF---QVAPSS----EGGRMRLVHDKEIKGAAYSIQVLM 866
Query: 338 GFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKN-LILVGDYARSIALLRYQP 396
G LV A+ + +++ + D + A + KN ++LVGD RS+++L Y+P
Sbjct: 867 GKLVVAINSCVRLFEWTAEKELRLECSDFDNVTALYLRTKNDVVLVGDLMRSLSVLAYKP 926
Query: 397 EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 456
+ +ARD+ W + C+ I
Sbjct: 927 MESSFEKIARDFVTN----------------------W----------MTACEIID---- 950
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRL-IKKTD-FHLGQHVNTFFKIRC 514
++ F M + LF + + G RL +++T ++LG+ VN F C
Sbjct: 951 --METFLGAEIMFN-------LFTVVKDCSSKDEGIRLQLQETGMYYLGESVNAF----C 997
Query: 515 KPSSISDAPGARSRFLT--WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
S I+ F T Y + DG LG + L + Y + L+ + T + +
Sbjct: 998 HGSLIATHIDLTPSFTTPILYGTSDGGLGVIVQLTPQFYDFVHELETRIAAVTKNCMRIE 1057
Query: 573 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
+RT++ G S G IDG LV L +S
Sbjct: 1058 HGQYRTFESDGRT--EQSVGFIDGDLVEGLLDMS 1089
>gi|297740793|emb|CBI30975.3| unnamed protein product [Vitis vinifera]
Length = 1043
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 100/442 (22%), Positives = 170/442 (38%), Gaps = 110/442 (24%)
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
VS + PF++ P L + +L I + +R +PL + + ++
Sbjct: 670 VSHMCPFNSAAFPDS-LAIAKEGDLTIGTIDDI----QKLHIRSIPLGEHARRICHQEQS 724
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+T+ I S Y + + ED E+ FI L Q ++E I +
Sbjct: 725 RTFAIC------SLKYNQSSTEDSEM-------HFIRLLDDQ---------TFEFI--ST 760
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
+PL +E+ + + S + + Y +GT Y E+ +GRIL+F I+E
Sbjct: 761 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE---- 810
Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
K+++I KE KG V ++ G L+ A+ QKI Y W L+D+ G + +
Sbjct: 811 ------DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQS 861
Query: 367 EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
E +A V + + I+VGD +SI+LL Y+ E + ARDY ++
Sbjct: 862 ESGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 917
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
+ILD+ +G ++ + N+
Sbjct: 918 ------------------------------------EILDDDIYLG---AENNFNIFTVR 938
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
E RL ++HLG+ VN F +R S + P + ++
Sbjct: 939 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTV 992
Query: 537 DGALGFFLPLPEKNYRRLLMLQ 558
+G +G LP Y L LQ
Sbjct: 993 NGVIGVIASLPHDQYVFLEKLQ 1014
>gi|170057515|ref|XP_001864517.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167876915|gb|EDS40298.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1138
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 114/573 (19%), Positives = 209/573 (36%), Gaps = 119/573 (20%)
Query: 66 FVSDRSK-RANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM 124
FV D++ R +Q + G + + ++ F +++ VF C P ++ ++ H +
Sbjct: 615 FVVDKTTHRLTDQKKVTLGTQPTILKTFRSLS-TTNVFACSDRPTVIYSSN------HKL 667
Query: 125 TIDGPVSTLAPFHNVNCPR--GFLYFNAKS-------ELRISVLPTHLSYDAPWPVRKVP 175
F NVN NA+S + SV+ + +R VP
Sbjct: 668 V----------FSNVNLKEVNHMCSLNAESYQDSLALATKNSVILGTIDEIQKLHIRTVP 717
Query: 176 LKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPP--LVSQFH 233
L +P +AY ++T+ ++T + + +D +T R S + S +
Sbjct: 718 LGESPRRIAYQEASQTFGVIT---------VRMDIQDSSGLTPSRQSASTQTSNVTSSSN 768
Query: 234 VSLFSP------FSWEE-------IPQTNFPL---HEW---EHVLCLKNVSMEYEGTLSG 274
+ L P F E I Q F + H++ E+VL L + + +
Sbjct: 769 MGLLKPGASNTEFGQEVEVHNLLIIDQNTFEVLHAHQFMQTEYVLSLISAKLGNDPATY- 827
Query: 275 LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAIC 334
YI N E GRI+++ + + + KE KG ++
Sbjct: 828 ---YIVGTAMVNPEEREPKVGRIIIYHYAD-----------GALTQVSEKEIKGACYSLV 873
Query: 335 HVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLR 393
G ++ + + +++ D+ + +A K + ILVGD RSI LL+
Sbjct: 874 EFNGRVLATINSTVRLYEWTDDKDLRLECSHFNNVLALYCKTKGDFILVGDLMRSITLLQ 933
Query: 394 YQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 453
Y+ + +ARDY+P W
Sbjct: 934 YKQMEGSFEEIARDYQPK----------------------WM------------------ 953
Query: 454 KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
+ILD+ + +G ++ N+ + + A + ++ + FHLG VN F
Sbjct: 954 TAVEILDDDAFLG---AENSNNLFVCLKDSAATTDDERQQMPEVAQFHLGDMVNVFRHGS 1010
Query: 514 CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
+I + S + + ++ GA+G +P Y L LQ + G ++
Sbjct: 1011 LVMQNIGERTTPTSGCV-LFGTVSGAIGLVTQIPPDYYEFLRKLQENLTNTIKSVGRIDH 1069
Query: 574 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
+R++ + S G IDG LV FL L+
Sbjct: 1070 TYWRSFHTE--MKTENSEGFIDGDLVESFLDLT 1100
>gi|357135348|ref|XP_003569272.1| PREDICTED: DNA damage-binding protein 1a-like [Brachypodium
distachyon]
Length = 1074
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 86/359 (23%), Positives = 145/359 (40%), Gaps = 76/359 (21%)
Query: 278 YIALGTNYNYSEDVT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y ++ +GRIL+F + E K++++ +E KG V ++ +
Sbjct: 780 YYCVGTAYILPYEIEPTKGRILIFLVEE-----------RKLRLVAERETKGAVYSLNAL 828
Query: 337 AGFLVTAVGQKI--YIWQLKDN--DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALL 392
G L+ AV QKI Y W +DN L V + + I+VGD RS++LL
Sbjct: 829 TGKLLAAVNQKIIVYKWVRRDNRHQLQSECSYRGCVLALHTQTHGHFIVVGDMVRSVSLL 888
Query: 393 RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
RY+ E + +V RD+ N+K A ++D + IG
Sbjct: 889 RYKYEEGLIEVVTRDF-----NTKWITA----VAMLDDDIY-----------------IG 922
Query: 453 SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 512
+ D ++ + S + V + + G ++ TD +GQ F
Sbjct: 923 A------DNCCNLFTLHSGRPGVVGEYHLGDLVNRMHHGSLVMHHTDSEIGQIPTVIF-- 974
Query: 513 RCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
+IS A G + F P Y L LQ+V+V G L+
Sbjct: 975 ----GTISGAIGVIASF-----------------PYDQYVFLEKLQSVLVKFIKSVGNLS 1013
Query: 573 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND---ILDEL 628
+R++ A +R +DG L+ FL LS + E+ + +G + ++ I++EL
Sbjct: 1014 HVEWRSFYNVSRTA--EARNFVDGDLIESFLSLSPSKMEEVSQVMGLRADELCKIVEEL 1070
>gi|166158025|ref|NP_001107422.1| damage-specific DNA binding protein 1, 127kDa [Xenopus (Silurana)
tropicalis]
gi|157422734|gb|AAI53474.1| Zgc:63840 protein [Danio rerio]
gi|163916541|gb|AAI57552.1| LOC100135265 protein [Xenopus (Silurana) tropicalis]
Length = 306
Score = 58.9 bits (141), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 76/348 (21%), Positives = 131/348 (37%), Gaps = 72/348 (20%)
Query: 305 VVPEPGQP---------LTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQL 353
V PE +P T K++ + KE KG V ++ G L+ ++ ++Y W
Sbjct: 2 VCPEEAEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTA 61
Query: 354 KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
+ T + + + + + ILVGD RS+ LL Y+P + +ARD+ P
Sbjct: 62 EKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAYKPMEGSFEEIARDFNPNWM 120
Query: 414 NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
++ +ILD+ + +G +
Sbjct: 121 SAV----------------------------------------EILDDDNFLG-----AE 135
Query: 474 KNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFF--KIRCKPSSISDAPGARSRF 529
LF+ Q ++ + R L + FHLG+ VN F + + S P S
Sbjct: 136 NAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFSHGSLVLQNLGESSTPTQGS-- 193
Query: 530 LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
+ +++G +G L E Y LL LQN + G + +R++ +
Sbjct: 194 -VLFGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEQ 250
Query: 590 SRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 631
+ G IDG L+ FL L + E+ + G K +DE+ I
Sbjct: 251 ATGFIDGDLIESFLDLGQAKMQEVVSTLQIDDGSGMKREATVDEVIKI 298
>gi|402083318|gb|EJT78336.1| hypothetical protein GGTG_03437 [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 1155
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 101/508 (19%), Positives = 194/508 (38%), Gaps = 98/508 (19%)
Query: 133 LAPFHNVNCPRGFLYFNAKSELRIS-VLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKT 191
+ PF P G L SEL+IS + P S+ V+ +P+ +AY T+
Sbjct: 719 VCPFDTAVFP-GSLAVATDSELKISKIDPQRQSH-----VQSLPMGENVRSIAYSAPTRV 772
Query: 192 Y---CI---VTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEI 245
+ CI ++ E ++ ++ E+V P L +PF
Sbjct: 773 FGLGCIRREISKGVEKASSTFRLV---DEVVLQP----------------LGNPFE---- 809
Query: 246 PQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT----CRGRILLFD 301
L+E E V + + + T L +GT + E++ +GR+L+F
Sbjct: 810 ------LNEGEVVETV--IRAQLRDTFGRLAERFIVGTRFLVDENLVPGSNSKGRVLVFG 861
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKD-----N 356
V E P I + K + + +V A+ + + + + ++
Sbjct: 862 ----VDEERSPF------QIVSHPLKSGCRRLAVMEEMIVVALTKTVVVARYEELTSTSG 911
Query: 357 DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSK 416
L +A T Y + LI VGD +S++L+ + P + VA D K +
Sbjct: 912 KLIKVASYQTTSYAIDVAVEGRLIAVGDIMKSMSLVEFVPP----TTVAGDGKAGETK-- 965
Query: 417 GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
K QL +E+C+ S + + F ++ +D D NV
Sbjct: 966 ------------------KPAQL-----IEVCRHYQSSWSTAVAHFEGESWLEADADGNV 1002
Query: 477 VLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASL 536
++ R+ ++ +LG+++N KI S+ P A + ++
Sbjct: 1003 MVLGRNTTGVTLEDRRRMEITSEINLGENINRIQKI-----SVETGPNAPIHPKAFLSTT 1057
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
+G++ + + LL LQ+ + + G + + FR+++ A P R IDG
Sbjct: 1058 EGSIYLVGAIAPQMRDLLLNLQDRLEDYVGTLGNIPFKNFRSFRNAEREADGPVR-FIDG 1116
Query: 597 SLVWKFLQLSLGERLEICKKIGSKHNDI 624
+ +FL ++ + ++C+ +G D+
Sbjct: 1117 EYIERFLDMNEETQSQVCRDLGPSVEDM 1144
>gi|281208174|gb|EFA82352.1| UV-damaged DNA binding protein1 [Polysphondylium pallidum PN500]
Length = 1054
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 85/368 (23%), Positives = 139/368 (37%), Gaps = 74/368 (20%)
Query: 278 YIALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y+ +GT + N E +GRIL+F I ++I E P C +
Sbjct: 753 YVVVGTAFHNEVESQQSKGRILVFRI-------------EDNRLILLDEVALPACVYCLL 799
Query: 337 --AGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALL 392
G L+ + +++ + W + N LT SMVS + +LV D +S+ LL
Sbjct: 800 PFNGRLLAGINKRVQAFNWGVDTNKLTKAESYSGHTLSHSMVSRGHFVLVADLMKSMTLL 859
Query: 393 RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
+ + + +AR+ P +W R+E+
Sbjct: 860 -VEDQQGAIKELARNPLP----------------------IWL-------SRIEM----- 884
Query: 453 SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 512
I DE F+ D N+++ EA L FHLG+ +N F
Sbjct: 885 -----IDDE----TFIGGDNSYNLIVVQKNAEASSEIDNELLDTVGQFHLGETINKF--- 932
Query: 513 RCKPSSISDAPGARSRFL--TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
K S+ +P S L + ++ GA+G + + + +Y LQ + GG
Sbjct: 933 --KHGSLVTSPDMDSPKLPTILFGTVSGAIGVIVSISKDDYEFFEKLQKGLNRVVHGVGG 990
Query: 571 LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYD 630
L +R++ + + PS+ IDG L+ FL L + LE K + I D
Sbjct: 991 LPFENWRSFSTE--HMTIPSKNFIDGDLIETFLDLRHDKMLEAIKDMNIS---IEDTYRR 1045
Query: 631 IEALSSHF 638
IE+L H
Sbjct: 1046 IESLMHHI 1053
>gi|145348011|ref|XP_001418451.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578680|gb|ABO96744.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1196
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 110/554 (19%), Positives = 213/554 (38%), Gaps = 97/554 (17%)
Query: 110 WLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPW 169
WL + +G P++ P+ + F++ CP G + + ++ LRI+ + +
Sbjct: 711 WLGYSEKGTFVLAPISYV-PLEEVCSFNSEQCPEGVVAISNQT-LRIASIE---RLGENF 765
Query: 170 PVRKVPLKCTPHFLAYHLETKTYCIVTST-------------AEPSTDYYKFNGEDKELV 216
V L+ TP ++ + +TK ++ S A P+ + + N ED+E
Sbjct: 766 NQTTVKLRYTPRAMSANPDTKMVALIESDQCTVPVGEREGPEATPADEAPETNDEDEE-- 823
Query: 217 TDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFP----------LHEWEHVLCLKNVSM 266
+++ +P V QF SP +W + P LH+ E L L +V +
Sbjct: 824 ----EAKMLP--VEQFGAPKSSPGTWAACVRIVDPKEAKSTFVLELHKSEAALSLCHVFL 877
Query: 267 EYEGTLSGLRGYIALGTNYNYS-EDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKE 325
L +A+GT N + C G F + G+ L ++++
Sbjct: 878 TGPNEL-----LLAVGTAVNLTFAPRNCDGG---FIHLYRYGNDGRTL-----NLVHSTP 924
Query: 326 QKGPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGD 384
GPV A+C G L+ V + I+ K L + + +I ++ + + I VGD
Sbjct: 925 TDGPVGALCGYKGHLLAGVNNSLRIYDYGKKKLLRKVENRNFPNFITTLHAAGDRIYVGD 984
Query: 385 YARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGER 444
SI ++Y+ + ++ + A D KP I +L + L+ ++
Sbjct: 985 VQESIHYVKYKADEGSIYIFADDTKPR---------------YITATLPLDYDTLAGADK 1029
Query: 445 LE--ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHL 502
++ ++ +D+ + G I + + P E++ ++
Sbjct: 1030 FGNIFVNRLPKDVSEDMDDDPTGGKNIYSQG----VLNGAPNKSETSA--------QTYI 1077
Query: 503 GQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVM 561
G+ V K +P I + Y + G +G LP ++ L+ M
Sbjct: 1078 GETVCALTKGALQPGGIE---------IIMYGTFMGGIGCLLPFSSRSEIEFFTHLEMHM 1128
Query: 562 VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 621
G + AFR+ YYA P + +IDG L +F L + I +++
Sbjct: 1129 RQEAPSIVGRDHMAFRS-----YYA--PVKNVIDGDLCEQFGALPADVQRRIAEEMDRTP 1181
Query: 622 NDILDELYDIEALS 635
+IL +L + +++
Sbjct: 1182 GEILKKLEQVRSVA 1195
>gi|313238818|emb|CBY20011.1| unnamed protein product [Oikopleura dioica]
gi|313245836|emb|CBY34826.1| unnamed protein product [Oikopleura dioica]
Length = 1135
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 73/390 (18%), Positives = 151/390 (38%), Gaps = 71/390 (18%)
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEP 309
F + E +C+ + + E +I +GT E GRI +F +
Sbjct: 805 FDVGEISSCMCIAKLGKKDEQ-------FIVVGTAITADEQECKNGRICVFSYSK----- 852
Query: 310 GQPLTKNKIKMIYAKEQKGPVTAICHVAGF-LVTAVGQKIYIWQLKDND-LTGIAFIDTE 367
+ K+ ++ K+ G V ++ + G ++ A+ Q++ ++++ + L A I
Sbjct: 853 -----EEKLTLVSTKQVNGAVYSVKALNGNKIICAINQQLKVFEMNEQTTLQSEAPIANH 907
Query: 368 VY-IASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRG 426
+ +A VS IL D RSI++ Y+P L +ARDY P
Sbjct: 908 ITCVAVDVSKNGFILSADLMRSISVFSYKPLEGALEEIARDYHPN--------------- 952
Query: 427 IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
W + K I + ++ ++ +N+ + EA
Sbjct: 953 -------W----------MTAIKMIDDDN-----------YIGAENSENIFICTRNTEAP 984
Query: 487 ESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPL 546
+ +L+ +H+G+H+NT + ++ +R S+ G +G
Sbjct: 985 DEEDRQQLLPTGYYHVGEHINTIVEGNLVMDVHVESSITPTRTF-LMGSVSGYVGLLAIF 1043
Query: 547 PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
PEK ++ L L+ M G ++ ++R ++ +G +DG L+ F L
Sbjct: 1044 PEKQWQFLSKLEAKMRKVIRGVGKIDHESWRRFESDSRM--EDCKGFVDGDLIEMFQDLR 1101
Query: 607 LGERLEICKKIG-----SKHNDILDELYDI 631
++ E+ ++ + H+D++ + D+
Sbjct: 1102 PEKQKEVISELTMDGEPATHDDVVRLVDDL 1131
>gi|405970039|gb|EKC34976.1| DNA damage-binding protein 1 [Crassostrea gigas]
Length = 1160
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 83/371 (22%), Positives = 142/371 (38%), Gaps = 74/371 (19%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT + E+ + GRI++F E K+ I KE KG +
Sbjct: 848 YYIVGTALVHPEEAEPKQGRIVIFHFHE-----------GKLNQIAEKEIKGAAYTLVEF 896
Query: 337 AGFLVTAVGQKIYIWQ-LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQ 395
G L+ ++ + +++ D +L + + + + ILVGD RSI LL Y+
Sbjct: 897 NGKLLASINSTVRLFEWTTDKELRLECNYFNSIVALYLKTKGDFILVGDLMRSITLLLYK 956
Query: 396 PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
P T +ARD P W +
Sbjct: 957 PMEGTFEEIARDCNPN----------------------W------------------TTA 976
Query: 456 NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFF--K 511
+ILD+ + +G + + LF Q ++ + R L + FHLG+ VN F
Sbjct: 977 VEILDDDNFLG-----AENSFNLFTCQKDSASTTDEDRQNLQEVGMFHLGEFVNVFRHGS 1031
Query: 512 IRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
+ + S + P S Y +++GA+G +P++ Y L +Q+ + G +
Sbjct: 1032 LVMQHSGETSTPTQGS---VLYGTVNGAVGLVTQVPQEFYSFLQDIQSRLAKVIKSVGKI 1088
Query: 572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDIL 625
+R++ + G IDG L+ FL L+ + E K + G K +
Sbjct: 1089 EHSFWRSFHTE--RKTEACEGFIDGDLIESFLDLNRDKMQETVKGLQIDDGSGMKREATV 1146
Query: 626 DELYD-IEALS 635
D+L IE L+
Sbjct: 1147 DDLVKTIEELT 1157
>gi|154286506|ref|XP_001544048.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150407689|gb|EDN03230.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 1158
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 80/355 (22%), Positives = 135/355 (38%), Gaps = 83/355 (23%)
Query: 281 LGTNY--NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICH--- 335
+GT+Y ++ E + RGRIL F++ T N+ AK + PV C
Sbjct: 845 VGTSYLDDFGEG-SIRGRILAFEV-----------TANRQ---LAKVAEMPVKGACRALA 889
Query: 336 ------VAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSI 389
VA + T V I Q D L+ A T + NLI V D +S+
Sbjct: 890 IVQDKIVAALMKTVVVYTISKGQFADYTLSKTASYRTSTAPIDIAVTGNLIAVADLMKSV 949
Query: 390 ALLRYQPEYR----TLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERL 445
+++ YQ +L+ VAR ++ + + A + W
Sbjct: 950 SIVEYQQGSNGLPDSLTEVARHFQTLWSTAVAHVAED----------TW----------- 988
Query: 446 EICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH 505
+ SD + N+V+ + RL ++ LG+
Sbjct: 989 ----------------------LESDAEGNLVMLHRNVNGVTDDDRRRLEVTSEILLGEM 1026
Query: 506 VNTFFKIRCKPSSISDAPGARSRF--LTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMV 562
VN R +P +I + GA + + +++G++ + + Y+ LLM LQ+ M
Sbjct: 1027 VN-----RIRPVNIQGSQGAEAAISPRAFLGTVEGSI-YLFGIINPTYQDLLMRLQSAMA 1080
Query: 563 THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
GG+ FR ++ A P R +DG L+ +FL S+ + EI K+
Sbjct: 1081 GMVVTPGGMPFNKFRAFRNTIRQAEEPYR-FVDGELIERFLSCSVELQEEIVGKV 1134
>gi|169611218|ref|XP_001799027.1| hypothetical protein SNOG_08717 [Phaeosphaeria nodorum SN15]
gi|160702249|gb|EAT83885.2| hypothetical protein SNOG_08717 [Phaeosphaeria nodorum SN15]
Length = 1140
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 107/550 (19%), Positives = 209/550 (38%), Gaps = 98/550 (17%)
Query: 83 GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCP 142
G R + R G VF HP+ ++ S G L +T + +T+ PF + P
Sbjct: 670 GTREATFRALPRGNGLFNVFATCEHPSLIY-ASEGRLVYSAVTAENA-TTVCPFDSEAYP 727
Query: 143 RGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPS 202
G + +LRI+++ T + V+ + + T +AY K + +
Sbjct: 728 -GSVAIATSDDLRIALVDTERTTH----VQTLKVDETVRRIAYSPGLKAFGL-------- 774
Query: 203 TDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLK 262
G K ++ + + H L ++E+ + L+E E V C+
Sbjct: 775 -------GTVKRILKAGEE-------IMLSHFKLVDEIQFKELD--TYALNEEELVECVM 818
Query: 263 NVSMEYEGTLSGLRGYIALGTNYNYSEDVTC-RGRILLFDIIEVVPEPGQPLTKNKIKMI 321
+ +G+ G +GT Y ++ T RGRIL I+EV PE +K++
Sbjct: 819 RCDL-ADGS-GGTAERFVIGTAYLDDQNSTVERGRIL---ILEVTPE-------RVLKLV 866
Query: 322 YAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK---- 377
KG + G +V A+ + I ++ ++ + + + S +
Sbjct: 867 TEIAVKGGCRCLAMCEGKIVAALIKTIVVYDIEYRTQSKPDLVKAATFRCSTAPIDITVN 926
Query: 378 -NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKF 436
I + D +S+ ++ YQ RG
Sbjct: 927 GTQIAIADLMKSMVVVEYQ-----------------------------RG---------- 947
Query: 437 LQLSLGERL-EICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLI 495
+ L ++L E+ + + E ++ SD + N+++ P+ + RL
Sbjct: 948 -ETGLPDKLVEVARHFQVTWATAVAEVDENTYLESDAEGNLLVLYRDPKGVTDDDKRRLN 1006
Query: 496 KKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLL 555
++ LG+ VN R + ++ AP A + +++G++ + L +NY LL
Sbjct: 1007 VSSEMLLGEMVN-----RIRRIDVATAPDAVVVPRAFMGTVEGSI-YLFALISQNYLDLL 1060
Query: 556 M-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 614
+ LQ+ + G ++ FR +K + P+R +DG L+ +FL + +
Sbjct: 1061 ITLQSNLGNLVVSPGNMDFAKFRAFKNQVRTEEEPNR-FVDGELIERFLDCEEDVQRKAI 1119
Query: 615 KKIGSKHNDI 624
+ +G + DI
Sbjct: 1120 EGLGVELEDI 1129
>gi|320593036|gb|EFX05445.1| uv-damaged DNA-binding protein [Grosmannia clavigera kw1407]
Length = 1504
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 59/321 (18%), Positives = 132/321 (41%), Gaps = 24/321 (7%)
Query: 320 MIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDN-----DLTGIAFIDTEVYIASMV 374
++ + +GP + V +V + + + + + + +L +A T Y+ +
Sbjct: 1201 IVSSHRVRGPCRCLAMVDDLIVAGLSKTVVLSRYTETSSMSGELKKVASYRTATYVVDLA 1260
Query: 375 SVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY-YAGNPSRGIIDGSLV 433
++I VGD +S AL+ Y P + + N KG + S+ I +G
Sbjct: 1261 VDGHMIAVGDMMKSTALVEYIPAT-SGDGEDEEDDGAGDNKKGKGKTADRSKTIAEGP-- 1317
Query: 434 WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
K ++ + G + + D+ E G N+ + + ++ R
Sbjct: 1318 -KLVERARGYQASWATAVCHVEGDLWLEADGFG--------NLTMLERDVQGVTADDKRR 1368
Query: 494 LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRR 553
L + +LG+ VN R +P ++ +PGA + A+++G++ + +
Sbjct: 1369 LRTVGEMYLGEMVN-----RIRPIAVETSPGAMVHPRAFLATVEGSIYMVGTIAPEAQDL 1423
Query: 554 LLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 613
L+ LQ + G + A+R+++ + P R +DG L+ +FL + + E+
Sbjct: 1424 LMNLQTKLAAIVKGPGNTSFSAYRSFRNAERESTEPFR-FVDGELLERFLDVGEDVQKEV 1482
Query: 614 CKKIGSKHNDILDELYDIEAL 634
+ +G D+ + + +++ L
Sbjct: 1483 AQGLGPSVEDLRNIIEELKRL 1503
>gi|156097003|ref|XP_001614535.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148803409|gb|EDL44808.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 2558
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 129/600 (21%), Positives = 221/600 (36%), Gaps = 129/600 (21%)
Query: 101 VFLCGPHPAWLFLTSRGELRAHPMTIDG--------PVSTLAPFHNV------NCPRGFL 146
+F+C P ++ T + +L ++I + L PFHN N +
Sbjct: 2018 LFVCCDSPIIIYSTLKKKLSISKLSIRNVHLVDMFSDFNYLNPFHNFLLFKKKNQNNSYF 2077
Query: 147 YFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYY 206
F ++L IS +L+ + ++P T +AYH +T + TA P + +
Sbjct: 2078 IFFDGNQLCIS----YLNEMKKTFMERIPFHRTVEKIAYHADTG----LLITACPVEEKH 2129
Query: 207 KFNGEDKELVT--DPRDSRFIPPLV--SQFHVSLFSPF---------SWEEIPQTNF--P 251
K N K++V DP + F + S+F VS + S E+ QT+
Sbjct: 2130 KTNQMMKQIVCFFDPFQNSFKYTYIIPSKFSVSSICIYELAPSSGGASMGEMEQTSQMGQ 2189
Query: 252 LHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT--CRGRILLFDIIEVVPEP 309
+ + LK E +R I +GT N +E +T G I +F
Sbjct: 2190 MEQTNQTNELKPSHPEERTDAPPVRTLICVGT-ANNNERITEPSSGHIYVF-------VA 2241
Query: 310 GQPLTKNKIKMIYAKEQK-GPVTAICHVAGFLVTAVGQKI--------------YIW--- 351
+ + +IK +Y G +T + +V AV + YI+
Sbjct: 2242 KKQTNQFEIKHVYTYNVSCGGITHLKQFRDKIVAAVNNTVVILDIGNFLANLGAYIYNSS 2301
Query: 352 ---QLKDND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
+++ ND +A +I S+ V+N I+VGD S+ LL Y E L+ V RD
Sbjct: 2302 KAIKIESNDAFLEVASFTPSSWIMSLDVVENYIVVGDIMTSVTLLSYDFENAILNEVCRD 2361
Query: 408 YKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGF 467
Y + +W C + + S F
Sbjct: 2362 Y----------------------ANIW-------------CTSVSA--------LSENHF 2378
Query: 468 MISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS 527
++SD + N ++ +L + F+ G VN F + ++ D R+
Sbjct: 2379 LVSDMESNFLVLQKSNIKFNDEESFKLSLVSQFNHGSVVNKMFSTSLR--NLVDDEERRN 2436
Query: 528 RFLT-----WYASLDGALGFFLPLPE-KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG 581
L AS +G++ +P ++R L ++ + + S G L+ ++R YK
Sbjct: 2437 EILQKEQSILCASSEGSISALIPFSNFLQFKRALCIEIAINDNISSLGNLSHSSYREYKV 2496
Query: 582 KGYYAGNPSRGIIDGSLVWKFLQLSLGERLE-------ICKKIGSKHNDILDELYDIEAL 634
A +G++DG L F L +L+ I KK+ K + DIE L
Sbjct: 2497 S--LASKNCKGVVDGELFKMFFYLPFERQLKTYIYAKWIAKKLNCKLGSFEHFMLDIENL 2554
>gi|330792580|ref|XP_003284366.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
gi|325085712|gb|EGC39114.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
Length = 1064
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 100/456 (21%), Positives = 178/456 (39%), Gaps = 87/456 (19%)
Query: 186 HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEI 245
HLE + C T + + D NGE+ + + + VS +V LF ++E
Sbjct: 684 HLEEYS-CYAVITIKTNEDIISGNGENATTIDEVEEE------VS--YVRLFDDQTFE-- 732
Query: 246 PQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEV 305
P ++F L +E L + + + Y+A+GT+ N D GR+LLF+I E
Sbjct: 733 PLSSFRLEHYEMGWSLTSTKFDDDPCT-----YLAVGTSINIP-DRQTSGRVLLFNINEA 786
Query: 306 VPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFID 365
K+ ++ + V + G L+ AV +++Y + + I
Sbjct: 787 ----------KKLVLLEEISFRSGVLYLHQFNGRLIAAVLKRLYSIRYSYSKEKNCKVIS 836
Query: 366 TE------VYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
+E I + S + +LVGD +S++LL Q E +L +A++ +P
Sbjct: 837 SENVHKGHTMILKLASRGHFMLVGDMMKSMSLLG-QSENGSLVQIAKNPQP--------- 886
Query: 420 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
+W + I ++D F+ S+ N V+
Sbjct: 887 -------------IW-------------IRSIAMINDDY--------FIGSETSNNFVVV 912
Query: 480 MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGA 539
++ L +H+G+ +N+ SDAP + YAS++G+
Sbjct: 913 KKNNDSTNELERELLDSVGHYHIGESINSMLCGSLVRLPDSDAPPIPT---ILYASVNGS 969
Query: 540 LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
+G + +++Y LQ + + GG ++R + + SR IDG L+
Sbjct: 970 IGVIASISKEDYEFFSKLQKGLNRVVNGIGGFTHESWRAFSNDHHTV--ESRNFIDGDLI 1027
Query: 600 WKFLQLSLGERLEICKKIGSKHNDILDE-LYDIEAL 634
F L ++E K+ N LDE L IE+L
Sbjct: 1028 EMFPDL----KIESMAKVIQDMNVTLDETLKRIESL 1059
>gi|241260143|ref|XP_002404926.1| DNA repair protein xp-E, putative [Ixodes scapularis]
gi|215496735|gb|EEC06375.1| DNA repair protein xp-E, putative [Ixodes scapularis]
Length = 1148
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 79/361 (21%), Positives = 143/361 (39%), Gaps = 79/361 (21%)
Query: 294 RGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL 353
+GRI++F ++ K++ + KE KG ++ G L+ ++ + +++
Sbjct: 845 QGRIIIFHWVD-----------GKLQQVAEKEIKGAPYSLLEFNGKLLASINSTVRLFEW 893
Query: 354 K-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQ 412
+ +L + + + + ILVGD RS++LL Y+P + +ARDY+
Sbjct: 894 NAERELHNECSHFNNILALYLKTKGDFILVGDLMRSMSLLAYKPLEGSFEEIARDYQTN- 952
Query: 413 PNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDK 472
W +C +ILD+ + +G
Sbjct: 953 ---------------------W------------MCAV------EILDDDTFLG-----A 968
Query: 473 DKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS--- 527
+ LF+ Q ++ + R L + FHLG+ VN F S + PG S
Sbjct: 969 ESTTNLFVCQKDSAATTDEDRQHLQEVGQFHLGEFVNIFR----HGSLVMQHPGEASSPT 1024
Query: 528 RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF---RTYKGKGY 584
+ + ++ GA+G LP Y LL +Q + G ++ + R + + +
Sbjct: 1025 QGSVLFGTIHGAIGLVAQLPSDFYNFLLEVQGNLTKVIKSVGKIDHTLYPFVRLFTWRSF 1084
Query: 585 YA---GNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYD-IEAL 634
++G IDG L+ FL LS + E+ + I G K + +D+L IE L
Sbjct: 1085 STERKTEQAQGFIDGDLIESFLDLSRDKMQEVLQGIQMDDGSGMKRDATVDDLIKIIEEL 1144
Query: 635 S 635
S
Sbjct: 1145 S 1145
>gi|453081643|gb|EMF09692.1| DNA damage-binding protein 1 [Mycosphaerella populorum SO2202]
Length = 1151
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 114/547 (20%), Positives = 204/547 (37%), Gaps = 123/547 (22%)
Query: 97 GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRI 156
G Q VF HP+ ++ S G + +T D S A F + + SEL++
Sbjct: 684 GLQNVFATCEHPSLIY-GSEGRMVYSAVTADSATSICA-FDSFGDYANSIAIATGSELKL 741
Query: 157 SVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTY---CI---VTSTAEPSTDYYKFNG 210
S + + V+ +P+ T +AY E K + CI + + E ++K
Sbjct: 742 SSVDEERTTH----VQDLPVYETVRRIAYSSELKAFGLGCIKRTLAAGVEEVRSHFKLVD 797
Query: 211 EDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEG 270
E V+ + SW L+E E V + ++
Sbjct: 798 E----------------------VAFKALDSW--------ALNEDELVESVIRCPLDDGT 827
Query: 271 TLSGLRGYIALGTNYNYSEDV-TCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGP 329
L R +GT Y +D T RGR+L+F++ E +IK++ KG
Sbjct: 828 GLDAER--FVVGTAYLDDQDANTARGRVLVFEVTE----------DRRIKLVTEMAVKGA 875
Query: 330 VTAICHVAGFLVTAVGQKIYIW-----------QL--KDNDLTGIAFIDTEVYIASMVSV 376
+ G +V A+ + + I QL K + T A ID ++ +S+
Sbjct: 876 CRCLAVCKGRIVAALVKTVVILAYEFSPPKSSPQLIKKASYRTSTAPID--IFASSL--- 930
Query: 377 KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKF 436
LI + D +S+ L++Y P + QP+S
Sbjct: 931 DGLIAISDLMKSLTLVKYTPG-----------RTGQPDS--------------------- 958
Query: 437 LQLSLGERLEICKKIGSKHNDILDEF-SSMGFMISDKDKNVVLFMYQPEARESNGGHRLI 495
+EI + + + + ++ SD + N+V+ + P + RL
Sbjct: 959 -------LVEIARHFDTLWGTAVAPIPGTHSYIQSDAEGNLVVLEHDPTGFSAEDRRRLR 1011
Query: 496 KKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL--TWYASLDGALGFFLPLPEKNYRR 553
++ LG+ VN R +P + P A + + + A+++G++ F + ++
Sbjct: 1012 VTSEMCLGEMVN-----RIRPITTVITPSANAVVIPKAFIATVEGSVYVFGTIAQQYQDL 1066
Query: 554 LLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN--PSRGIIDGSLVWKFLQLSLGERL 611
L+ LQ M G + FR +K + G P R +DG ++ FL LS +
Sbjct: 1067 LIRLQGSMAEMVKSPGFVRFNRFRGFKTQVRDMGEEGPVR-FVDGEIIEGFLGLSAEVQE 1125
Query: 612 EICKKIG 618
+ K +G
Sbjct: 1126 SVAKDLG 1132
>gi|393217872|gb|EJD03361.1| hypothetical protein FOMMEDRAFT_108572 [Fomitiporia mediterranea
MF3/22]
Length = 1213
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 69/327 (21%), Positives = 136/327 (41%), Gaps = 51/327 (15%)
Query: 318 IKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK 377
+++++ E A+ G L VG+ + I+++ L + ++T+ Y +++V++
Sbjct: 932 LELVHKTEADDVPMALMAFQGRLCAGVGKSLRIYEIGKKKL--LRKVETKTYGSAIVTLN 989
Query: 378 ---NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVW 434
+ I+VGD SI ++P L + A D +P S +++
Sbjct: 990 TQGSRIIVGDMQESIVYAVFKPPENRLLIFADDSQPRWTTS---------------AVMV 1034
Query: 435 KFLQLSLGERLE--ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGH 492
+ ++ G++ ++ SK +D +DE + ++ +K L M P H
Sbjct: 1035 DYTTIAAGDKFGNVFINRLDSKISDQVDEDPTGAGILHEKG----LLMGAP--------H 1082
Query: 493 RLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEK-NY 551
+ FH+G V + K IS G R L Y L G +G +P K +
Sbjct: 1083 KTGMIAHFHVGDIVTSIHK-------ISLVAGGREVLL--YTCLHGTIGILVPFVSKEDV 1133
Query: 552 RRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERL 611
+ L+ M + G + A+R GYY P + ++DG L +F +L ++
Sbjct: 1134 DFISTLEQHMRSEKLSLVGRDHLAWR-----GYYV--PVKAVVDGDLCEQFARLPANKQS 1186
Query: 612 EICKKIGSKHNDILDELYDIEALSSHF 638
I ++ ++L +L + +S F
Sbjct: 1187 AIAVELDRTVGEVLKKLEQLRVTASGF 1213
>gi|390366809|ref|XP_780126.3| PREDICTED: DNA damage-binding protein 1-like isoform 1
[Strongylocentrotus purpuratus]
Length = 630
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 80/342 (23%), Positives = 136/342 (39%), Gaps = 64/342 (18%)
Query: 304 EVVPEPGQPL----TKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLT 359
E P+ G+ + + K++ I KE KG ++ G L+ +V + +++
Sbjct: 331 EAEPKSGRIVVFQYSDGKLQEIAEKEIKGAPYSLVEFNGKLLASVNSVVRLFEWTPEHSL 390
Query: 360 GIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
+ +A + K + I+VGD RSI LL Y+P L +ARDY P
Sbjct: 391 RVECSHYNNVLALYLKTKGDFIVVGDLMRSITLLAYKPMEGCLEEIARDYSPN------- 443
Query: 419 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
W +S E ILD+ + +G ++ N L
Sbjct: 444 ---------------W----MSAVE--------------ILDDDTFLG---AENSSN--L 465
Query: 479 FMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDA--PGARSRFLTWYA 534
F Q ++ + R L + FHLG+ VN F +I ++ P S +
Sbjct: 466 FTCQKDSAATTDEERRHLQEVGLFHLGEFVNVFRHGSLVMQNIGESTIPTTGS---VLFG 522
Query: 535 SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGII 594
++ G++G L E+ YR LL +QN + G + +R++ + P I
Sbjct: 523 TVSGSVGLVTQLNEEFYRFLLEVQNKLTKVIKSVGKIKHSFWRSFYSE--RKTEPMDNFI 580
Query: 595 DGSLVWKFLQLSLGERLEICKKI-----GSKHNDILDELYDI 631
DG L+ FL LS E+ + + G K + + ++L I
Sbjct: 581 DGDLLESFLDLSRDTMDEVAQGLQIDDGGMKRDCMANDLIKI 622
>gi|449549048|gb|EMD40014.1| hypothetical protein CERSUDRAFT_63520 [Ceriporiopsis subvermispora B]
Length = 1265
Score = 55.5 bits (132), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 45/199 (22%), Positives = 91/199 (45%), Gaps = 11/199 (5%)
Query: 285 YNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAV 344
+ E GR+LLF I + +++++ ++ KG V I V F+ A+
Sbjct: 959 FEVEETEPTSGRLLLFAIGS---DGATSSADGELRLVTTQDVKGCVFQITSVNNFIAAAI 1015
Query: 345 GQKIYIWQLKDND----LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRT 400
+ ++ L+D + L +A + ++ ++ S + ++VGD S++LLR
Sbjct: 1016 NSNVVLFALRDTNKQYALQQVADWNHNYFVTNLASHGDRLIVGDAISSVSLLRVS--VAR 1073
Query: 401 LSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH-NDIL 459
+ ++RDY P P + A N G ++ F + R ++ ++ GS H +DI+
Sbjct: 1074 IECLSRDYSPLWPVAVEATAENQIIGANSDCNLFSFALQHIDGR-KVLERDGSYHLDDIV 1132
Query: 460 DEFSSMGFMISDKDKNVVL 478
++F+ G + +D L
Sbjct: 1133 NKFAPGGLVAADSSTGYTL 1151
>gi|303313681|ref|XP_003066852.1| CPSF A subunit region family protein [Coccidioides posadasii C735
delta SOWgp]
gi|240106514|gb|EER24707.1| CPSF A subunit region family protein [Coccidioides posadasii C735
delta SOWgp]
gi|320031496|gb|EFW13458.1| UV-damaged DNA binding protein [Coccidioides posadasii str. Silveira]
Length = 1144
Score = 55.5 bits (132), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 77/373 (20%), Positives = 140/373 (37%), Gaps = 82/373 (21%)
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGL-----RGYIALGTNY-NYSEDVTCR--GRILLFD 301
F L+ E V C+ + E+ G+ + + R +GT+ + E+ R GRIL+FD
Sbjct: 799 FDLNPNELVECV--IRTEHPGSNAQMGSSRPRDIFIVGTSVLDTPEEAEARTKGRILIFD 856
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
+ T +++ I +G A+ + +V A+ + + + +K +L
Sbjct: 857 VD----------TNRELRKICDFPVRGACRALAMINNKIVAALMKTVVVLNIKKGNLYNF 906
Query: 362 AFIDTEVYIASMVSVK-----NLILVGDYARSIALLRY------QPEYRTLSLVARDYKP 410
Y S V N+I V D +SI+L+ Y QP+ TL VAR Y+
Sbjct: 907 EIEKEASYRTSTAPVDISVTGNIIAVADLMKSISLVEYHAGEGGQPD--TLKEVARHYQT 964
Query: 411 TQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMIS 470
+ A N F+++
Sbjct: 965 LWTTAAAPVAENE-------------------------------------------FLVA 981
Query: 471 DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL 530
D + N+V+ + R+ ++ LG+ VN R P + +P +
Sbjct: 982 DAEGNLVVLNRNTTGVTEDDRRRMQVTSELRLGEMVN-----RIHPMDLQTSPESPVIPK 1036
Query: 531 TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPS 590
+ A++DG++ F + L+ LQ+ + + G + +R +K A P
Sbjct: 1037 AFLATVDGSIYLFGLISPSAQDTLMRLQSALADFVASPGEIPFNKYRAFKSSVRQAEEPF 1096
Query: 591 RGIIDGSLVWKFL 603
R +DG L+ +FL
Sbjct: 1097 R-FVDGELIEQFL 1108
>gi|157128864|ref|XP_001655231.1| DNA repair protein xp-e [Aedes aegypti]
gi|108882186|gb|EAT46411.1| AAEL002407-PB [Aedes aegypti]
Length = 1138
Score = 55.5 bits (132), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 118/570 (20%), Positives = 204/570 (35%), Gaps = 113/570 (19%)
Query: 66 FVSDR-SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM 124
FV D+ + R +Q + G + + ++ F +++ VF C P ++ ++ H +
Sbjct: 615 FVLDKNTNRLTDQKKVTLGTQPTILKTFRSLS-TTNVFACSDRPTVIYSSN------HKL 667
Query: 125 TIDGPVSTLAPFHNVNCPR--GFLYFNAKS-------ELRISVLPTHLSYDAPWPVRKVP 175
F NVN NA++ + SV+ + +R VP
Sbjct: 668 V----------FSNVNLKEVNHMCSLNAEAYQDSLALATKNSVILGTIDEIQKLHIRTVP 717
Query: 176 LKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPL-----VS 230
L +P +AY ++T+ ++T + TD +DS + P
Sbjct: 718 LGESPRRIAYQEASQTFGVIT------------------VRTDIQDSSGLTPSRQSASTQ 759
Query: 231 QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
+V+L + + +N E+ + + N+ + + T L + + T Y S
Sbjct: 760 TTNVTLSTNMGLLKAGASN---AEFGQEVEVHNLLIIDQNTFEVLHAHQFMQTEYAMSLI 816
Query: 291 VTCRGR----ILLFDIIEVVPEPGQPLTKNKIKMIYA---------KEQKGPVTAICHVA 337
G + V PE +P I YA KE KG ++
Sbjct: 817 SAKLGNDPNTYYIVGTALVNPEEPEPKVGRIIIYHYADGNLTQVSEKEIKGSCYSLVEFN 876
Query: 338 GFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQP 396
G ++ ++ + +++ D+ + +A K + ILVGD RSI LL+Y+
Sbjct: 877 GRVLASINSTVRLYEWTDDKDLRLECSHFNNVLALYCKTKGDFILVGDLMRSITLLQYKQ 936
Query: 397 EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 456
+ +ARDY+P + I+D FL L +C K G+
Sbjct: 937 MEGSFEEIARDYQPNWMTAVE---------ILDDD---AFLGADNSNNLFVCLKDGAATT 984
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
D DE M PE + HLG VN F
Sbjct: 985 D--DERQQM-----------------PEVAQ------------VHLGDMVNVFRHGSLVM 1013
Query: 517 SSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
+I + S + + ++ GA+G +P Y L LQ + G ++ +
Sbjct: 1014 ENIGERTTPTSGCV-LFGTVSGAIGLVTQIPADYYEFLRKLQENLTDTIKSVGKIDHAYW 1072
Query: 577 RTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
R++ + G IDG LV FL LS
Sbjct: 1073 RSFHTE--MKTERCEGFIDGDLVESFLDLS 1100
>gi|300122534|emb|CBK23104.2| unnamed protein product [Blastocystis hominis]
Length = 172
Score = 55.5 bits (132), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 34/141 (24%), Positives = 65/141 (46%), Gaps = 10/141 (7%)
Query: 494 LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRR 553
++++ DFHL + + P S+ D + + + +GA+G FL + + Y +
Sbjct: 28 VVRQADFHLASQITSIL-----PISLPDG-----QCINVILTAEGAMGVFLFVTGEEYTK 77
Query: 554 LLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 613
L LQ ++ LN FR Y G +G++D ++ K+L LS E+ +I
Sbjct: 78 LSSLQKRLIEALPQNAALNNFNFRKYMSDGMMKYPRRKGVLDMGVIRKYLMLSTQEQEDI 137
Query: 614 CKKIGSKHNDILDELYDIEAL 634
K + + +I + +Y + L
Sbjct: 138 AKSLDLETKEITEVIYRTDKL 158
>gi|449540702|gb|EMD31691.1| hypothetical protein CERSUDRAFT_109269 [Ceriporiopsis subvermispora
B]
Length = 1265
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/204 (23%), Positives = 96/204 (47%), Gaps = 12/204 (5%)
Query: 281 LGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF 339
+GT + E+ R GR+LLF I + +++++ ++ KG V I V F
Sbjct: 954 VGTAFLEVEETEPRSGRLLLFAIGS---DGATSSADGELRLVATQDVKGCVFQITSVNSF 1010
Query: 340 LVTAVGQKIYIWQLKDND----LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQ 395
+ A+ + ++ L++ + L +A + ++ ++ S +L++VGD S++LLR
Sbjct: 1011 IAAAISSNVVLFALRNTNKQYALQQVADWNHNYFVTNLASHGDLLIVGDAISSVSLLRVS 1070
Query: 396 PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
+ ++RDY P +P + A N G ++ F + R ++ ++ GS H
Sbjct: 1071 DSR--IECLSRDYGPLRPVAVEATAENQIIGANSYCNLFSFALQHIDGR-KVLERDGSYH 1127
Query: 456 -NDILDEFSSMGFMISDKDKNVVL 478
+DI+ +F G + +D L
Sbjct: 1128 LDDIVKKFVPGGLVAADSSTGYTL 1151
>gi|429961863|gb|ELA41407.1| hypothetical protein VICG_01512 [Vittaforma corneae ATCC 50505]
Length = 1153
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 73/364 (20%), Positives = 145/364 (39%), Gaps = 64/364 (17%)
Query: 275 LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAIC 334
++ + T++ ED +G+++++ ++ +VP+P K+K+I ++ K P
Sbjct: 835 FNNFLVVCTSFPEGEDKMTKGKLIVYSLVNIVPDPDNLHITKKLKLICSETLKNPCLFCE 894
Query: 335 HVAGFLVTAVGQKIYIWQLKDNDLTGIAFI---DTEVYIASMVSVKNLILVGDYARSIAL 391
V + VG K+ I++ +N TG+A + + + S+ KNLI V D I
Sbjct: 895 EVRSLISVCVGTKLMIYEFNEN--TGLAAVGRHELSLLCTSLFVTKNLIAVSDIMNGIYF 952
Query: 392 LRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDG-----SLVWKFLQLSLGERLE 446
+P RD P + + G P+ + G S LQ S+ +
Sbjct: 953 FFLRP---------RD--PLKLHLLGRSCLVPNCRFLGGIDFCPSFETDALQFSI---VS 998
Query: 447 ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV 506
+CK G V +F Y P S G++L+K+ + + +
Sbjct: 999 VCK---------------YGI--------VRIFTYSPYDPVSKNGNQLVKRAEI-VTKLA 1034
Query: 507 NTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 566
N +K+ G + F S+ + + L N+ +L +Q+ + S
Sbjct: 1035 NPLYKV---------VFGQINEF----ESILLSSNVMVLLRAINFPKLQAIQHCISIFIS 1081
Query: 567 HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 626
+ G+N R Y + + +I ++ +F + +ICK +G + +I++
Sbjct: 1082 NRCGIN---VRNYLETEEFVNPECKSVICEKILLEFFYFKPLVQEKICKLVGLDYFNIVE 1138
Query: 627 ELYD 630
+ D
Sbjct: 1139 LIED 1142
>gi|195500686|ref|XP_002097479.1| GE26244 [Drosophila yakuba]
gi|194183580|gb|EDW97191.1| GE26244 [Drosophila yakuba]
Length = 1140
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 71/318 (22%), Positives = 119/318 (37%), Gaps = 68/318 (21%)
Query: 305 VVPEPGQPLT---------KNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQL 353
V+PE +P +NK+ + + G A+ G ++ +G ++Y W
Sbjct: 837 VIPEEPEPKVGRIIIFHYHENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT- 895
Query: 354 KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
+ +L I + + + + ILVGD RSI LL+++ +ARD +P
Sbjct: 896 NEKELRMECNIQNMIAALYLKAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPK-- 953
Query: 414 NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
W + +ILD+ + +G +
Sbjct: 954 --------------------WM------------------RAVEILDDDTFLG-----SE 970
Query: 474 KNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL- 530
N LF+ Q ++ + R L + FHLG VN F S + G R+ +
Sbjct: 971 TNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVNVFRH----GSLVMQNVGERTTPIN 1026
Query: 531 --TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
Y + +GA+G +P+ Y L LQ + G + +R ++
Sbjct: 1027 GCVLYGTCNGAIGIVTQIPQDFYDFLHGLQERLKKIIKSVGKIEHTYYRNFQINNKV--E 1084
Query: 589 PSRGIIDGSLVWKFLQLS 606
PS G IDG L+ FL LS
Sbjct: 1085 PSEGFIDGDLIESFLDLS 1102
>gi|290998415|ref|XP_002681776.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
gi|284095401|gb|EFC49032.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
Length = 1103
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 78/379 (20%), Positives = 143/379 (37%), Gaps = 81/379 (21%)
Query: 278 YIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT E+ +GRIL+ + +K+ + K+ KG V +
Sbjct: 781 YFIVGTAITEGDEEEPSKGRILVLQV-----------QDDKLVLKAEKDVKGAVMVLHSF 829
Query: 337 AGFLVTAVGQKIYI--WQLKDN----DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIA 390
G L+ V ++ + W D+ DL +YI + S + IL+GD +S+
Sbjct: 830 NGKLLAGVSGRLMLFKWAESDDGDNKDLVQECSCSGGIYILDIDSHGDFILIGDMMKSVH 889
Query: 391 LLRYQ-PEYR----TLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERL 445
L Y+ PE + L L+++DY+ + W L L E
Sbjct: 890 LFVYENPEEQHVSGNLRLISKDYQYS----------------------WLSCSLMLNES- 926
Query: 446 EICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH 505
++ D+ N++ EA +L++ ++
Sbjct: 927 --------------------EYVAVDQQGNMITLKKNDEAASEEERKQLVRVGKYYCSDR 966
Query: 506 VNT----FFKIRCKPSS--ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN 559
VN F +R SS I+ P + F ++ G +G LP + + + +Q
Sbjct: 967 VNRIQPGFIGMRFANSSSDINTQPVKTALF----GTISGGIGVLAQLPPETFAFVTKIQK 1022
Query: 560 VMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
M + + ++ +R Y+ + S G IDG V FL+ + + +++ +
Sbjct: 1023 AMSSVVTGLANISRETYRQYRSE--RTREDSVGFIDGDFVESFLEFDFETQQRVIEELSN 1080
Query: 620 KHND--ILDELY-DIEALS 635
H + L+EL +IE LS
Sbjct: 1081 NHQEQITLEELVKNIEDLS 1099
>gi|156049323|ref|XP_001590628.1| hypothetical protein SS1G_08368 [Sclerotinia sclerotiorum 1980]
gi|154692767|gb|EDN92505.1| hypothetical protein SS1G_08368 [Sclerotinia sclerotiorum 1980 UF-70]
Length = 1153
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 75/364 (20%), Positives = 138/364 (37%), Gaps = 76/364 (20%)
Query: 281 LGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ-KGPVTAICHVAGF 339
+GT++ + +V RGR+L+F + ++ I A KG I + G
Sbjct: 835 VGTSFLHDGEVNIRGRLLIFGV-----------NSDRTPYIIASHTLKGSCRCIGVLNGK 883
Query: 340 LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-----NLILVGDYARSIALLRY 394
+V A+ + + ++ ++ T Y + + N+I V D +S+AL+ Y
Sbjct: 884 IVAALNKTVVMYDYEETSRTTANLRKVATYRCATCPIDIDIRGNIIAVADIMKSVALVEY 943
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
P D P + G +A + S+ E
Sbjct: 944 TP--------GVDGLPDKLEEVGRHA-------------QQVFATSIAE----------- 971
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 514
+ ++ SD D N+++ E RL + +LG+ VN +I
Sbjct: 972 -------VDTDTYLESDHDGNLIVLKRNREGVTREDKLRLEVLCEMNLGEMVNKIKRINV 1024
Query: 515 KPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM-------VTHTSH 567
+ S DA F+ A+ +G++ F +P +N L+ LQ+ + +T +S
Sbjct: 1025 ETSK--DALLIPRAFV---ATTEGSIYLFSLIPPQNQDLLMRLQSRLASLPARSLTDSSF 1079
Query: 568 T-------GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 620
+ G L+ +R+Y P R +DG L+ +FL L + IC +G +
Sbjct: 1080 SAPIEFSPGNLDFDKYRSYVSAVRETNEPFR-FVDGELIERFLDLDGAIQENICDGLGVR 1138
Query: 621 HNDI 624
D+
Sbjct: 1139 AEDL 1142
>gi|392864500|gb|EAS34654.2| UV-damaged DNA binding protein [Coccidioides immitis RS]
Length = 1144
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 65/321 (20%), Positives = 118/321 (36%), Gaps = 72/321 (22%)
Query: 294 RGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL 353
+GRIL+FD+ T +++ I +G A+ + +V A+ + + + +
Sbjct: 849 KGRILVFDVD----------TNRELRKICDFPVRGACRALAMINNKIVAALMKTVVVLNI 898
Query: 354 KDNDLTGIAFIDTEVYIASMVSVK-----NLILVGDYARSIALLRY------QPEYRTLS 402
K +L Y S V N+I V D +SI+L+ Y QP+ TL
Sbjct: 899 KKGNLYNFEIEKEASYRTSTAPVDISVTGNIIAVADLMKSISLVEYHAGEGGQPD--TLK 956
Query: 403 LVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEF 462
VAR Y+ + A N
Sbjct: 957 EVARHYQTLWTTAAAPVAENE--------------------------------------- 977
Query: 463 SSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDA 522
F+++D + N+V+ + R+ ++ LG+ VN R P + +
Sbjct: 978 ----FLVADAEGNLVVLNRDTTGVTEDDRRRMQVTSELRLGEMVN-----RIHPMDLQTS 1028
Query: 523 PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 582
P + + A++DG++ F + L+ LQ+ + + G + +R +K
Sbjct: 1029 PESPVIPKAFLATVDGSIYLFGLISPSAQDTLMRLQSALADFVASPGEIPFNKYRAFKSS 1088
Query: 583 GYYAGNPSRGIIDGSLVWKFL 603
A P R +DG L+ +FL
Sbjct: 1089 VRQAEEPFR-FVDGELIEQFL 1108
>gi|307186138|gb|EFN71863.1| DNA damage-binding protein 1 [Camponotus floridanus]
Length = 1136
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 76/364 (20%), Positives = 136/364 (37%), Gaps = 69/364 (18%)
Query: 278 YIALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT + N E GRILLF + K+ + KE KG ++
Sbjct: 824 YYVVGTAFINPDETEPKMGRILLFH-----------WSDGKLSQVAEKEIKGSCYSLVEF 872
Query: 337 AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQ 395
G L+ ++ + +++ + IA + K + +LVGD RS+ LL+Y+
Sbjct: 873 NGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKTKSDFVLVGDLMRSLTLLQYK 932
Query: 396 PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
+ +ARDY P S
Sbjct: 933 TMEGSFEEIARDYNPNWMTSI--------------------------------------- 953
Query: 456 NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIR 513
+ILD+ + +G + LF+ Q ++ ++ R + + FHLG VN F
Sbjct: 954 -EILDDDTFLG-----AENCFNLFICQKDSAATSEDERQQMQEVGQFHLGDMVNVFRHGS 1007
Query: 514 CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
++ ++ ++ + ++ GA+G +P Y L L++ + + G +
Sbjct: 1008 LVMQNLGES-STPTQGCVLFGTVSGAIGLVTQIPFGFYEFLRNLEDKLTSVIKSVGKIEH 1066
Query: 574 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDE 627
+R++K G IDG L+ FL LS + E+ + G K +D+
Sbjct: 1067 NFWRSFKTD--LKIEQCEGFIDGDLIESFLDLSHDKMAEVAMGLMMDDGSGMKKEATVDD 1124
Query: 628 LYDI 631
L I
Sbjct: 1125 LVKI 1128
>gi|328874742|gb|EGG23107.1| UV-damaged DNA binding protein1 [Dictyostelium fasciculatum]
Length = 1116
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 70/347 (20%), Positives = 133/347 (38%), Gaps = 71/347 (20%)
Query: 278 YIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
YI +GT Y+ + C GRIL+F +I+ +++ ++ +G + +
Sbjct: 815 YIVVGTTYHCHDRKEC-GRILVFKMID-----------SRLILLDETTVRGSIFCMIAFN 862
Query: 338 GFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEVYIASMVSV-----KNLILVGDYARSIA 390
G L+ A+ + + Y W D + E+Y S+ + +LVGD +S+A
Sbjct: 863 GQLLVAINKSVHRYTWS---GDSSSGKLTGEEIYGGHTASLYLAGRGDFVLVGDMMKSMA 919
Query: 391 LLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 450
LL+ +D K +S+ ++ +
Sbjct: 920 LLQAS---------GKDVKELSRSSQPFWLTGLT-------------------------- 944
Query: 451 IGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF 510
+D+ + +G SD N++L E L H G+ +N F
Sbjct: 945 -------FIDDDTYLG---SDNSYNLILMKKNTETANEVDSQLLDNIGHIHTGEFINRFH 994
Query: 511 KIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
+ D+P S +A++ G +G + +++Y LQ + GG
Sbjct: 995 HGTLATLTDVDSPKPNS---IIFATISGCIGVISTISKQDYDFFSKLQVGLNRVIRGIGG 1051
Query: 571 LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
+ +R+++ + + + SR IDG LV +FL L + LE+ K +
Sbjct: 1052 FSHDRWRSFQNE-HISNIESRNFIDGDLVEQFLHLRHDKMLEVTKDM 1097
>gi|241952575|ref|XP_002419009.1| pre-mRNA-splicing factor, putative; pre-spliceosome component,
putative [Candida dubliniensis CD36]
gi|223642349|emb|CAX42591.1| pre-mRNA-splicing factor, putative [Candida dubliniensis CD36]
Length = 1187
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 68/146 (46%), Gaps = 17/146 (11%)
Query: 492 HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNY 551
++L +FH+G + T + C + G S Y L G +G +PL K+
Sbjct: 1057 YKLQNLIEFHIGDII-TSLNLGCL-----NLAGTES---VIYTGLQGTIGLLVPLVSKSE 1107
Query: 552 RRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGER 610
LL LQ +M ++ G + FR+Y NP + +IDG L+ +FL+ R
Sbjct: 1108 VELLFNLQLLMQQFQNNLVGKDHLKFRSYY-------NPIKNVIDGDLLERFLEFDTSLR 1160
Query: 611 LEICKKIGSKHNDILDELYDIEALSS 636
+EI +K+ NDI +L D+ S+
Sbjct: 1161 IEISRKLNKSVNDIEKKLIDLRNRSA 1186
>gi|303271531|ref|XP_003055127.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463101|gb|EEH60379.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 1223
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 59/265 (22%), Positives = 101/265 (38%), Gaps = 65/265 (24%)
Query: 370 IASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGII 428
+A V V+ + I+VGD +SI+LL Y+P+ + ARD+ P
Sbjct: 990 VALYVDVRGDFIVVGDLMKSISLLVYKPDEGVIEERARDFNPN----------------- 1032
Query: 429 DGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARES 488
W +C LD+ + +G ++ N+ +A
Sbjct: 1033 -----WM---------TAVCA---------LDDETYLG---AENSFNLFTVRKNSDAAAD 1066
Query: 489 NGGHRLIKKTDFHLGQHVNTF---------------FKIRCKPSSISDAPGARSRFLTWY 533
RL ++HLG+ VN F + + ++AP +
Sbjct: 1067 EERSRLDVIGEYHLGEFVNRFRAGSLVMRLPGDGDGAGLGLGLDASNEAP------TQLF 1120
Query: 534 ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGI 593
+++GA+G LPE + L LQ M S GG + A+R++ + +RG
Sbjct: 1121 GTVNGAIGVVASLPESTHTFLAALQKAMNKVVSGVGGFSHDAWRSFHNEHRSRLVEARGF 1180
Query: 594 IDGSLVWKFLQLSLGERLEICKKIG 618
+DG L+ FL L + E+ +G
Sbjct: 1181 VDGDLIESFLDLRPEKASEVASVVG 1205
>gi|345498295|ref|XP_001607743.2| PREDICTED: DNA damage-binding protein 1-like [Nasonia vitripennis]
Length = 1140
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 100/486 (20%), Positives = 177/486 (36%), Gaps = 91/486 (18%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
+R VPL +P +AY T+T+ ++T D + NG + + P S + +
Sbjct: 713 IRTVPLYESPRRIAYQESTQTFGVITM----RVDIQESNGVN---IARPSASTQAASISN 765
Query: 231 QFHVSLFSPFS------WEEIPQTNFPL---HEWEHVLCLKNVSMEYEGTLSGLR----- 276
H+ + S +E+ N + H +E + V EY +L +
Sbjct: 766 SNHIPTHNKPSNTASEIGQEVEIHNLLIVDQHTFEVLHAHTLVPTEYAMSLISTKLGEDP 825
Query: 277 -GYIALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAIC 334
Y +GT N E GRILL+ K+ + KE KG ++
Sbjct: 826 TPYYIVGTAMINPDESEPKSGRILLYH-----------WNDGKLTQVAEKEIKGSCYSLV 874
Query: 335 HVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLR 393
G L+ ++ + +++ + IA + K + +LVGD RS+ LL+
Sbjct: 875 EFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKTKGDFVLVGDLMRSVTLLQ 934
Query: 394 YQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 453
Y+ + +ARDY P S
Sbjct: 935 YKTMEGSFEEIARDYNPNWMTSI------------------------------------- 957
Query: 454 KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFK 511
+ILD+ + +G + LF+ Q ++ ++ R + + FHLG VN F
Sbjct: 958 ---EILDDDTFLG-----AENCFNLFVCQKDSAATSEEERQQMQEVGQFHLGDMVNVFRH 1009
Query: 512 IRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
+ ++ + + ++ GA+G +P Y L L++ + + G +
Sbjct: 1010 GSLVMQHLGES-STPTHGCVLFGTVCGAIGLVTQIPSTFYEFLRNLEDRLTSVIKSVGKI 1068
Query: 572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDIL 625
+R++ G IDG L+ FL LS + E+ I G K +
Sbjct: 1069 EHNFWRSFNTD--LKIEQCEGFIDGDLIESFLDLSHEKMAEVAMGIVIDDGSGMKKEATV 1126
Query: 626 DELYDI 631
D+L I
Sbjct: 1127 DDLVKI 1132
>gi|195329354|ref|XP_002031376.1| GM24084 [Drosophila sechellia]
gi|194120319|gb|EDW42362.1| GM24084 [Drosophila sechellia]
Length = 1140
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 77/353 (21%), Positives = 135/353 (38%), Gaps = 74/353 (20%)
Query: 305 VVPEPGQP---------LTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQL 353
V+PE +P +NK+ + + G A+ G ++ +G ++Y W
Sbjct: 837 VIPEEPEPKVGRIIIFHYNENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT- 895
Query: 354 KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
+ +L I + + + + ILVGD RSI LL+++ +ARD +P
Sbjct: 896 NEKELRMECNIQNMIAALYLKAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPK-- 953
Query: 414 NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
W + +ILD+ + +G +
Sbjct: 954 --------------------WM------------------RAVEILDDDTFLG-----SE 970
Query: 474 KNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL- 530
N LF+ Q ++ + R L + FHLG VN F S + G R+ +
Sbjct: 971 TNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVNVFRH----GSLVMQNVGERTTPIN 1026
Query: 531 --TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
Y + +GA+G +P+ Y L L+ + G + + +R ++ +
Sbjct: 1027 GCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKIIKLVGKIGHKFYRNFRI--HTQVE 1084
Query: 589 PSRGIIDGSLVWKFLQLSLG------ERLEICKKIGSKHNDILDELYDIEALS 635
PS+G IDG L+ FL LS + LE+ K D+ D + +E L+
Sbjct: 1085 PSQGFIDGDLIESFLDLSRDKMRDAVQGLELTLNGERKSADVEDVIKIVEDLT 1137
>gi|195108657|ref|XP_001998909.1| GI23368 [Drosophila mojavensis]
gi|193915503|gb|EDW14370.1| GI23368 [Drosophila mojavensis]
Length = 1140
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 66/299 (22%), Positives = 113/299 (37%), Gaps = 59/299 (19%)
Query: 315 KNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIAS 372
+NK+ + + G A+ G ++ +G ++Y W + +L I +
Sbjct: 856 ENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT-NEKELRMECNIQNMIAALF 914
Query: 373 MVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSL 432
+ + + ILVGD RSI LL+++ +ARD +P
Sbjct: 915 LKAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPK--------------------- 953
Query: 433 VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGH 492
W + +ILD+ + +G D LF+ Q ++ +
Sbjct: 954 -WM------------------RAVEILDDDTFLGCETHDN-----LFVCQKDSAATTDEE 989
Query: 493 R--LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL---TWYASLDGALGFFLPLP 547
R L + FHLG +N F S + G R+ + Y + +GA+G +P
Sbjct: 990 RQLLPELARFHLGDTINVFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIP 1045
Query: 548 EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
+ Y L L+ + G ++ +R Y+ PS G IDG L+ FL LS
Sbjct: 1046 QDFYDFLHGLEERLKKIIKSVGKIDHTYYRNYQINTKV--EPSEGFIDGDLIESFLDLS 1102
>gi|68476233|ref|XP_717766.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
albicans SC5314]
gi|68476422|ref|XP_717672.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
albicans SC5314]
gi|74586274|sp|Q5A7S5.1|RSE1_CANAL RecName: Full=Pre-mRNA-splicing factor RSE1
gi|46439394|gb|EAK98712.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
albicans SC5314]
gi|46439495|gb|EAK98812.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
albicans SC5314]
Length = 1219
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 68/146 (46%), Gaps = 17/146 (11%)
Query: 492 HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNY 551
++L +FH+G + T F + C + G S Y L G +G +PL K+
Sbjct: 1089 YKLQNLIEFHIGDII-TSFNLGCL-----NLAGTES---VIYTGLQGTIGLLIPLVSKSE 1139
Query: 552 RRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGER 610
LL LQ M ++ G + R+Y NP + +IDG L+ +FL+ + +
Sbjct: 1140 VELLFNLQLYMQQSQNNLVGKDHLKLRSYY-------NPIKNVIDGDLLERFLEFDISLK 1192
Query: 611 LEICKKIGSKHNDILDELYDIEALSS 636
+EI +K+ NDI +L D+ S+
Sbjct: 1193 IEISRKLNKSVNDIEKKLIDLRNRSA 1218
>gi|406865227|gb|EKD18269.1| CPSF A subunit region [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 1146
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 74/365 (20%), Positives = 141/365 (38%), Gaps = 70/365 (19%)
Query: 281 LGTNY--NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAG 338
+GT++ S D +GRIL+F I +P K ++ + K + + G
Sbjct: 840 VGTSFLDEESADPNIKGRILVFGI-----DP-----KKNPYLVASLNLKCACRRVAMLDG 889
Query: 339 FLVTAVGQKIYIWQL-----KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLR 393
+V + + + +++ K + +A + + +N+I + D +S+++++
Sbjct: 890 KIVAVLNKTVAMFKYVEITEKAGEFKKLATFRSSTVPIDIAITENIIAITDMMQSVSIVQ 949
Query: 394 YQPEYR----TLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 449
Y P L VARDY+ W
Sbjct: 950 YTPGKEGMPDKLEQVARDYQT----------------------CW--------------- 972
Query: 450 KIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
G+ DI D ++ SD N+++ + RL + +LG+ VN
Sbjct: 973 --GTAVTDIGDN----SWLESDHHGNLLVLQRNIDGITLEDKQRLRITGEMNLGEQVNMI 1026
Query: 510 FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
KI PS P A + A+ +G++ F + + + LL LQ + G
Sbjct: 1027 RKIAIDPS-----PTAMVVPKAFLATTEGSIYLFSTILDGSQDLLLRLQENITECVDTLG 1081
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 629
L+ + +R++K P R +DG L+ +FL S + +IC+ +G I D +
Sbjct: 1082 RLDFKTYRSFKSAERTTEEPYR-FVDGELIERFLDESEDMQQQICEGLGYTVEAIRDVVE 1140
Query: 630 DIEAL 634
+++ L
Sbjct: 1141 NLKRL 1145
>gi|221508103|gb|EEE33690.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 1878
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/183 (24%), Positives = 82/183 (44%), Gaps = 27/183 (14%)
Query: 232 FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDV 291
+ V L+ F P + L E VL L V L G+ ++A G SE+V
Sbjct: 1433 YEVRLYHEFDLHR-PVGTYTLRTCEEVLSLSFV------VLDGVE-HLAAGVGVPLSENV 1484
Query: 292 TCRGRILLFDIIE----VVP---------EPGQPLTKNKIKMIYAKEQKGPVTAICHV-- 336
C GR+ LF + E VVP E + T ++++ GPVT +
Sbjct: 1485 ECGGRVYLFKLPESSLRVVPAGNAGDAPTEEAEFGTPERLELFADIVLNGPVTVVGSFFS 1544
Query: 337 ----AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALL 392
++V +VG ++++ +++ + AF D V + S+ +++N L+GD + + L+
Sbjct: 1545 SPAERSYVVHSVGPRLFVHEMEGSKFLRGAFSDASVCVTSVANIRNFFLLGDALKGLNLV 1604
Query: 393 RYQ 395
++
Sbjct: 1605 SWE 1607
>gi|301124447|ref|XP_002909707.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262106897|gb|EEY64949.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 328
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 79/322 (24%), Positives = 127/322 (39%), Gaps = 84/322 (26%)
Query: 99 QGVFLCGPHPAWLFLTSRGELRAHPMTIDG-------------------PVSTLAPFHNV 139
G F G HP W+ L RG PM + PV + PFH+
Sbjct: 2 SGAFFRGAHPMWI-LGDRGHASFVPMCVPSSAPPKANGTSKNAAPRVSVPVLSFTPFHHW 60
Query: 140 NCPRGFLYFNAKSELRISVLP-----THLSYDAPWPVRKVPLKCTPHFLAY--------- 185
+CP GF+YF+++ LR+ LP T L + ++K T H + Y
Sbjct: 61 SCPNGFIYFHSRGALRVCELPSSKTSTILPSSGGFVLQKAEFGATLHHMLYLGSHGPGGV 120
Query: 186 --HLETKTYCIVTST------AEPSTDYYKFNGEDKELVTDPR----DSRFIPPLVSQF- 232
LE TY +V S A+ +T+ E + DP S + P F
Sbjct: 121 AEALEAPTYAVVCSARLKPADADRATEVEGAEEELEPENLDPNGNPLGSNVMAPTAEMFA 180
Query: 233 -----HVSLFSPFSWE-EIPQTN----------FPLH--EWEHVLCLK-----NVSMEYE 269
H++ +E + QT+ F +H +E VL +K + S+ E
Sbjct: 181 DYETDHMAHTEEDVYELRLVQTDEFGEWGRRGVFRVHFERYEVVLSVKLMYLYDSSLMKE 240
Query: 270 GTLSGL-------RGYIALGTNY--NYSEDVTCRGRILLF--DIIEVVPEPGQPLTKN-- 316
S R Y+ +GT + + ED + RGR+LL+ D + V E G +
Sbjct: 241 EVASTSPEWNKKKRPYLVVGTGWVGPHGEDESGRGRLLLYELDYAQYVNEEGGATSGKLP 300
Query: 317 KIKMIYAKEQK-GPVTAICHVA 337
K+++++ KE + G V+ + +
Sbjct: 301 KLRLVFIKEHRQGAVSMVSQLG 322
>gi|195996153|ref|XP_002107945.1| hypothetical protein TRIADDRAFT_18324 [Trichoplax adhaerens]
gi|190588721|gb|EDV28743.1| hypothetical protein TRIADDRAFT_18324 [Trichoplax adhaerens]
Length = 1134
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 66/329 (20%), Positives = 125/329 (37%), Gaps = 56/329 (17%)
Query: 315 KNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDN-DLTGIAFIDTEVYIASM 373
+ KI+ +++KE G V + G L+ +V + +++ N +L V +
Sbjct: 851 EGKIQQVHSKEVSGAVYCMVAFNGRLLASVNSTVSVYEWTSNKELVEETSFHNNVLALYL 910
Query: 374 VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
+ + IL+GD RSI+L Y+P + L+ ++ P
Sbjct: 911 KTKGDFILIGDLMRSISLCAYRPMNNEIELICKNNDPN---------------------- 948
Query: 434 WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
W +I+D+ S +G + N LF Q + S +
Sbjct: 949 WM------------------TAVEIIDDDSYLG---GENSHN--LFTCQKNSSSSEEEQK 985
Query: 494 LIKKTD-FHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYR 552
+ +H+G+ VN F + + D P + + + ++ GA+G + L +
Sbjct: 986 HLPTVGVYHVGEFVNVFRQGSLVMQNTVDIPDSVQGSI-LFGTVSGAVGVVVTLAPAMFE 1044
Query: 553 RLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS------ 606
+ + N + T G + + +R++ P + +DG LV FL LS
Sbjct: 1045 FVSAIANKLSTVVKGVGKIEHQFWRSFSND--RKTEPCQSFVDGDLVESFLDLSPEDMQR 1102
Query: 607 LGERLEICKKIGSKHNDILDELYDIEALS 635
+ L I G++ + D L +E LS
Sbjct: 1103 VANGLTIQTADGTRPAMVEDVLKTVEELS 1131
>gi|308808936|ref|XP_003081778.1| putative UV-damaged DNA binding factor (ISS) [Ostreococcus tauri]
gi|116060244|emb|CAL56303.1| putative UV-damaged DNA binding factor (ISS) [Ostreococcus tauri]
Length = 1282
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 85/427 (19%), Positives = 160/427 (37%), Gaps = 117/427 (27%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTST--AEPSTDYYKFNGEDKELVTDPRDSRFIPPL 228
+R +PL P +A+ ET T+ +V ++ S D +
Sbjct: 897 IRTIPLGGQPRRIAHQPETNTFAVVVEHLWSKSSQDCF---------------------- 934
Query: 229 VSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY- 287
V L S+E + Q F L + E L + + + T Y +GT
Sbjct: 935 -----VRLVDDGSFETLSQ--FQLEDQELTSSLTSCTFAGDSTT-----YYVVGTGIALE 982
Query: 288 SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
+ED RGRIL+F + + +++ ++ KE +G V + G L+ + K
Sbjct: 983 TEDEPSRGRILVFKVDD-----------DQLVLVSEKEVRGAVYNLNAFKGKLLAGINSK 1031
Query: 348 I--YIWQLKDND---LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLS 402
+ + W ++++ L ++ ++ + + ILVGD +S++LL Y+PE +
Sbjct: 1032 LELFKWTPREDEVHELVSECSHHGQIVTFAVKTRGDWILVGDLMKSMSLLLYKPEEGAID 1091
Query: 403 LVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEF 462
VARD+ + ++D + +G++++ L
Sbjct: 1092 EVARDFNANWMTAV---------AMLDDDETY----------------LGAENSLNLFTV 1126
Query: 463 SSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDA 522
S ++D++++ RL ++HLG+ VN F A
Sbjct: 1127 SRNVNAVTDEERS-----------------RLEITGEYHLGELVNAF------------A 1157
Query: 523 PGARSRFLT----------WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
PG+ L + + +G +G LP+ Y LQ + H GGL
Sbjct: 1158 PGSLVMSLRDGESLSVPTLLFGTANGVIGVLASLPKDVYEFTERLQASINKHIQGVGGLK 1217
Query: 573 PRAFRTY 579
+R++
Sbjct: 1218 HADWRSF 1224
>gi|398391687|ref|XP_003849303.1| hypothetical protein MYCGRDRAFT_87400 [Zymoseptoria tritici IPO323]
gi|339469180|gb|EGP84279.1| hypothetical protein MYCGRDRAFT_87400 [Zymoseptoria tritici IPO323]
Length = 1143
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 69/352 (19%), Positives = 136/352 (38%), Gaps = 75/352 (21%)
Query: 281 LGTNYNYSEDVT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF 339
+GT Y +D + +GRIL+ ++ E ++K++ +G + G
Sbjct: 836 IGTAYLDDQDASNAKGRILVLEVTE----------DRRLKLVTEISVRGACRCLAVSHGR 885
Query: 340 LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIAS-----MVSVKNLILVGDYARSIALLRY 394
+V A+ + + I+ + + A + Y S M ++I V D +S++L+++
Sbjct: 886 IVAALIKTVIIYSFEYETPSSPAMVKKAAYRTSTAPIDMCVTGDIIAVTDLMKSMSLVQH 945
Query: 395 Q------PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 448
P+ L+ VAR + +W ++ E +
Sbjct: 946 TLGQAGGPD--NLTEVARHFDT----------------------LWGTAVANVDENI--- 978
Query: 449 KKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT 508
++ SD + N+V+ + + RL ++ LG+ VN
Sbjct: 979 ------------------YLESDAEGNLVVLEHDVKGFSEEDRRRLRVTSEILLGEMVNR 1020
Query: 509 FFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 568
+I P+ P A + A+++G++ F + E L+ +QN M
Sbjct: 1021 IRRIDVSPT-----PNATVIPRAFLATVEGSIYLFALIAEGKQDLLIRMQNKMAEMVQSP 1075
Query: 569 GGLNPRAFRTYKGKGYYAGN--PSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
G + FR +K + G PSR +DG L+ +FL + E+ K++G
Sbjct: 1076 GHVPFAKFRGFKTQVRDMGEEGPSR-FVDGELIERFLDCDEDVQAEVAKELG 1126
>gi|195571247|ref|XP_002103615.1| GD18880 [Drosophila simulans]
gi|194199542|gb|EDX13118.1| GD18880 [Drosophila simulans]
Length = 1140
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 70/318 (22%), Positives = 119/318 (37%), Gaps = 68/318 (21%)
Query: 305 VVPEPGQP---------LTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQL 353
V+PE +P +NK+ + + G A+ G ++ +G ++Y W
Sbjct: 837 VIPEEPEPKVGRIIIFHYNENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT- 895
Query: 354 KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
+ +L I + + + + ILVGD RSI LL+++ +ARD +P
Sbjct: 896 NEKELRMECNIQNMIAALYLKAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPK-- 953
Query: 414 NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
W + +ILD+ + +G +
Sbjct: 954 --------------------WM------------------RAVEILDDDTFLG-----SE 970
Query: 474 KNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL- 530
N LF+ Q ++ + R L + FHLG VN F S + G R+ +
Sbjct: 971 TNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVNVFRH----GSLVMQNVGERTTPIN 1026
Query: 531 --TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
Y + +GA+G +P+ Y L L+ + G + +R ++
Sbjct: 1027 GCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKV--E 1084
Query: 589 PSRGIIDGSLVWKFLQLS 606
PS G IDG L+ FL LS
Sbjct: 1085 PSEGFIDGDLIESFLDLS 1102
>gi|21357503|ref|NP_650257.1| piccolo [Drosophila melanogaster]
gi|74872881|sp|Q9XYZ5.1|DDB1_DROME RecName: Full=DNA damage-binding protein 1; Short=D-DDB1; AltName:
Full=Damage-specific DNA-binding protein 1; AltName:
Full=Protein piccolo
gi|4928452|gb|AAD33592.1|AF132145_1 damage-specific DNA binding protein DDBa p127 subunit [Drosophila
melanogaster]
gi|7299719|gb|AAF54901.1| piccolo [Drosophila melanogaster]
gi|220942640|gb|ACL83863.1| DDB1-PA [synthetic construct]
Length = 1140
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 70/318 (22%), Positives = 119/318 (37%), Gaps = 68/318 (21%)
Query: 305 VVPEPGQPLT---------KNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQL 353
V+PE +P +NK+ + + G A+ G ++ +G ++Y W
Sbjct: 837 VIPEEPEPKVGRIIIFHYHENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT- 895
Query: 354 KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
+ +L I + + + + ILVGD RSI LL+++ +ARD +P
Sbjct: 896 NEKELRMECNIQNMIAALFLKAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPK-- 953
Query: 414 NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
W + +ILD+ + +G +
Sbjct: 954 --------------------WM------------------RAVEILDDDTFLG-----SE 970
Query: 474 KNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL- 530
N LF+ Q ++ + R L + FHLG VN F S + G R+ +
Sbjct: 971 TNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVNVFRH----GSLVMQNVGERTTPIN 1026
Query: 531 --TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
Y + +GA+G +P+ Y L L+ + G + +R ++
Sbjct: 1027 GCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINSKV--E 1084
Query: 589 PSRGIIDGSLVWKFLQLS 606
PS G IDG L+ FL LS
Sbjct: 1085 PSEGFIDGDLIESFLDLS 1102
>gi|195395112|ref|XP_002056180.1| GJ10363 [Drosophila virilis]
gi|194142889|gb|EDW59292.1| GJ10363 [Drosophila virilis]
Length = 1140
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 70/319 (21%), Positives = 119/319 (37%), Gaps = 70/319 (21%)
Query: 295 GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQ 352
GRI++F E NK+ + + G A+ G ++ +G ++Y W
Sbjct: 847 GRIIIFHYNE-----------NKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT 895
Query: 353 LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQ 412
+ +L I + + + + ILVGD RSI LL+++ +ARD +P
Sbjct: 896 -NEKELRMECNIQNMIAALFLKAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPK- 953
Query: 413 PNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDK 472
W + +ILD+ + +G D
Sbjct: 954 ---------------------WM------------------RAVEILDDDTFLGCETHDN 974
Query: 473 DKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL 530
LF+ Q ++ + R L + FHLG +N F S + G R+ +
Sbjct: 975 -----LFVCQKDSAATTDEERQLLPELARFHLGDTINVFRH----GSLVMQNVGERTTPI 1025
Query: 531 ---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
Y + +GA+G +P+ Y L L+ + G ++ +R Y+
Sbjct: 1026 NGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKIIKSVGKIDHTYYRNYQINTKV-- 1083
Query: 588 NPSRGIIDGSLVWKFLQLS 606
PS G IDG L+ FL L+
Sbjct: 1084 EPSEGFIDGDLIESFLDLN 1102
>gi|195037449|ref|XP_001990173.1| GH18378 [Drosophila grimshawi]
gi|193894369|gb|EDV93235.1| GH18378 [Drosophila grimshawi]
Length = 1140
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 73/337 (21%), Positives = 126/337 (37%), Gaps = 71/337 (21%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y + T+ Y E+ + GRI++F NK+ + + G A+
Sbjct: 829 YYVVATSLVYPEEPEPKVGRIIIFH-----------YNDNKLTQVAETKVDGTCYALVEF 877
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G ++ +G ++Y W + +L I + + + + ILVGD RSI LL++
Sbjct: 878 NGKVLAGIGSFVRLYEWT-NEKELRMECNIQNMIAALFLKAKGDFILVGDLMRSITLLQH 936
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+ +ARD +P W +
Sbjct: 937 KQMEGIFVEIARDCEPK----------------------WM------------------R 956
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G D LF+ Q ++ + R L + FHLG +N F
Sbjct: 957 AVEILDDDTFLGCETHDN-----LFVCQKDSAATTDEERQLLPELARFHLGDTINVFRH- 1010
Query: 513 RCKPSSISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
S + G R+ + Y + +GA+G +P+ Y L L+ + G
Sbjct: 1011 ---GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKIIKSVG 1067
Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
++ +R Y+ PS G IDG L+ FL L+
Sbjct: 1068 KIDHTYYRNYQINTKV--EPSEGFIDGDLIESFLDLN 1102
>gi|169848339|ref|XP_001830877.1| pre-mRNA-splicing factor RSE1 [Coprinopsis cinerea okayama7#130]
gi|116508046|gb|EAU90941.1| pre-mRNA-splicing factor RSE1 [Coprinopsis cinerea okayama7#130]
Length = 1213
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 69/326 (21%), Positives = 131/326 (40%), Gaps = 49/326 (15%)
Query: 318 IKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSV 376
+++++ E A+ G L VG+ + I+ + K L + I ++ +
Sbjct: 932 LELLHKTETDDVPMALLAFQGRLAAGVGKALRIYDIGKKKLLRKVENKSFTTAIVTLTTQ 991
Query: 377 KNLILVGDYARSIALLRY-QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWK 435
+ ILVGD S+ + Y QPE R L+ A D +P R + ++V
Sbjct: 992 GSRILVGDMQESVQYVVYKQPENRLLTF-ADDTQP--------------RWVTAITMV-D 1035
Query: 436 FLQLSLGERLE--ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
+ + G+R ++ SK +D +DE + ++ +K + M P H+
Sbjct: 1036 YNTIVAGDRFGNIFVNRLDSKVSDQVDEDPTGAGILHEKP----ILMGAP--------HK 1083
Query: 494 LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLP-LPEKNYR 552
FH+G + + K+ A R + Y L G +G +P + +++
Sbjct: 1084 TKMIAHFHVGDIITSLHKVSLV---------AGGREVIVYTGLHGTIGILMPFISKEDVD 1134
Query: 553 RLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 612
+ L+ M T G + A+R GYY P + ++DG L + L ++
Sbjct: 1135 FISTLEQHMRTEQPSLVGRDQLAYR-----GYYV--PVKAVVDGDLCETYAHLPASKQSS 1187
Query: 613 ICKKIGSKHNDILDELYDIEALSSHF 638
I ++ ++L +L + SS F
Sbjct: 1188 IANELDRTVGEVLKKLEQMRVTSSGF 1213
>gi|321478515|gb|EFX89472.1| hypothetical protein DAPPUDRAFT_303245 [Daphnia pulex]
Length = 1158
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 68/321 (21%), Positives = 117/321 (36%), Gaps = 59/321 (18%)
Query: 305 VVPEPGQP---------LTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKD 355
VVPE +P K+ + KE KG ++ ++ A+ + +++
Sbjct: 853 VVPEESEPKQGRIVLFQWADGKLTTVAEKEVKGACYSLVDFNSKILAAINNVVRLYEWTA 912
Query: 356 NDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
+ + IA + K + ILVGD RSI LL+Y+ + +ARD P
Sbjct: 913 EKELRLECSNFNHIIALYLKRKGDFILVGDLMRSITLLQYKTMEGSFEEMARDSNPN--- 969
Query: 415 SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
W +ILD+ + +G +
Sbjct: 970 -------------------WM------------------SAVEILDDDTFLG-----AEN 987
Query: 475 NVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTW 532
+ LF+ Q ++ + R L + FHLG VN F ++ ++
Sbjct: 988 SFNLFVCQKDSAATTEEERQQLTEVGRFHLGDMVNVFRHGSLVMDHAAETLTTPTQGCVL 1047
Query: 533 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 592
+ ++ GA+G LP + Y L +Q M G + +R++ + P G
Sbjct: 1048 FGTVHGAIGVVTQLPSEFYHFLSEVQTRMARVIKPVGKIEHSFWRSFATERKV--EPCEG 1105
Query: 593 IIDGSLVWKFLQLSLGERLEI 613
IDG L+ FL LS + E+
Sbjct: 1106 FIDGDLIESFLDLSSDKMKEV 1126
>gi|449684814|ref|XP_004210722.1| PREDICTED: DNA damage-binding protein 1-like, partial [Hydra
magnipapillata]
Length = 725
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 86/368 (23%), Positives = 138/368 (37%), Gaps = 70/368 (19%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT+ Y E+ + G+I+LF + E K+ I +K G V +
Sbjct: 415 YYCVGTSMVYPEESEPKEGKIILFQLFE-----------GKLVQIGSKTVNGAVYVLQGF 463
Query: 337 AGFLVTAVGQKIYIWQ-LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQ 395
G L+ V + +++ D +L + + S + ILVGD RS+ LL Y+
Sbjct: 464 NGKLLAGVNSLVSVYEWTSDKELKQECCYHNTILALYLKSKGDFILVGDLMRSMTLLAYK 523
Query: 396 PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
P R L +A D+ P + IID FL L IC+K S
Sbjct: 524 PLGR-LEEIAHDFSPNWM---------TAVEIIDDD---TFLGAENSFNLFICQKDNSSV 570
Query: 456 NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF--FKIR 513
ND E R H L +HLG VN F +
Sbjct: 571 ND--------------------------EER-----HHLQTIGKYHLGDFVNVFKHGSLV 599
Query: 514 CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
S+ P + S Y ++ GA+G LP+ + L +Q + G +
Sbjct: 600 MHHSTEQLTPISSS---ILYGTVRGAIGLVAGLPKNTFDFLSQVQEKLSKTIKSVGKIEH 656
Query: 574 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC-----KKIGSKHNDILDEL 628
+R++ + + G +DG L+ L L+ + E+ ++ G K +D+L
Sbjct: 657 EFWRSFYNDK--KTDLAVGCVDGDLIESCLDLTRTQLHEVVSGLEIEEAGIKRECTVDDL 714
Query: 629 YD-IEALS 635
+E LS
Sbjct: 715 IKVVEELS 722
>gi|195449948|ref|XP_002072297.1| GK22405 [Drosophila willistoni]
gi|194168382|gb|EDW83283.1| GK22405 [Drosophila willistoni]
Length = 1140
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 66/298 (22%), Positives = 112/298 (37%), Gaps = 59/298 (19%)
Query: 316 NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASM 373
NK+ + + G A+ G ++ +G ++Y W + +L I + +
Sbjct: 857 NKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT-NEKELRMECNIQNMIAALYL 915
Query: 374 VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
+ + ILVGD RSI LL+++ +ARD +P
Sbjct: 916 KAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPK---------------------- 953
Query: 434 WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
W + +ILD+ + +G + N LF+ Q ++ + R
Sbjct: 954 WM------------------RAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEER 990
Query: 494 --LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL---TWYASLDGALGFFLPLPE 548
L + FHLG VN F S + G R+ + Y + +GA+G +P+
Sbjct: 991 QLLPELARFHLGDTVNVFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQ 1046
Query: 549 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
Y L L+ + G + +R ++ PS G IDG L+ FL LS
Sbjct: 1047 DFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKV--EPSEGFIDGDLIESFLDLS 1102
>gi|125774475|ref|XP_001358496.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
gi|54638233|gb|EAL27635.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
Length = 1140
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 66/298 (22%), Positives = 112/298 (37%), Gaps = 59/298 (19%)
Query: 316 NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASM 373
NK+ + + G A+ G ++ +G ++Y W + +L I + +
Sbjct: 857 NKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT-NEKELRMECNIQNMIAALYL 915
Query: 374 VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
+ + ILVGD RSI LL+++ +ARD +P
Sbjct: 916 KAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPK---------------------- 953
Query: 434 WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
W + +ILD+ + +G + N LF+ Q ++ + R
Sbjct: 954 WM------------------RAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEER 990
Query: 494 --LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL---TWYASLDGALGFFLPLPE 548
L + FHLG VN F S + G R+ + Y + +GA+G +P+
Sbjct: 991 QLLPELARFHLGDTVNVFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQ 1046
Query: 549 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
Y L L+ + G + +R ++ PS G IDG L+ FL LS
Sbjct: 1047 DFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKV--EPSEGFIDGDLIESFLDLS 1102
>gi|339235331|ref|XP_003379220.1| DNA damage-binding protein 1 [Trichinella spiralis]
gi|316978142|gb|EFV61158.1| DNA damage-binding protein 1 [Trichinella spiralis]
Length = 1329
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 70/308 (22%), Positives = 130/308 (42%), Gaps = 62/308 (20%)
Query: 315 KNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-EVYIASM 373
+ + +++ KE G V A+ L+ A+ + +++ +D+TG+ + + +++ +M
Sbjct: 1040 NSSLNLVHEKEVNGCVYAMASFKSKLLVAMNSSVLLFEW--SDVTGLQLVSSCSLFVTAM 1097
Query: 374 -VSVKN-LILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGS 431
+ V++ +ILVGD RSIA+LRY P + ARDY P
Sbjct: 1098 HLKVRDEVILVGDIQRSIAVLRYVPSESSFVEEARDYHPN-------------------- 1137
Query: 432 LVWKFLQLSLGERLEICKKIGSKHND-ILDEFSSMGFMISDKDKNVVLFMYQP--EARES 488
W +S E ++ ND + +S+ +S KD QP E++
Sbjct: 1138 --W----ISAIEVID---------NDYFMAAENSLNITVSQKD-----LQQQPVSESQVV 1177
Query: 489 NGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLT--WYASLDGALGFFLPL 546
RL HLG+++N F + S+ G S + +G++ + +
Sbjct: 1178 KSAGRL------HLGEYINVF---KHGALSMYSYAGISSLVSNPIMIGTAEGSILIYCQI 1228
Query: 547 PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN-PSRGIIDGSLVWKFLQL 605
+ ++R L LQ G A+ +Y+ Y N P+ G IDG L+ + L++
Sbjct: 1229 HDSHFRVLNDLQRCFSDIVPDNVGC--IAYDSYRRYVVYEKNAPAFGFIDGDLIEQLLEM 1286
Query: 606 SLGERLEI 613
E + +
Sbjct: 1287 PRQEAIRL 1294
>gi|124806507|ref|XP_001350742.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium falciparum
3D7]
gi|23496869|gb|AAN36422.1|AE014849_41 splicing factor 3b, subunit 3, 130kD, putative [Plasmodium falciparum
3D7]
Length = 1329
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 70/318 (22%), Positives = 111/318 (34%), Gaps = 79/318 (24%)
Query: 334 CHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALL 392
C G L+ ++G K+ I+ L K L + D I S+ N I D S+ +
Sbjct: 1066 CSYNGKLIASIGNKLRIYALGKKKLLKKCEYKDIPEAIVSIKISGNRIFACDIRESVLIF 1125
Query: 393 RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
Y P TL L++ D P R C +I
Sbjct: 1126 FYDPNQNTLRLISDDIIP---------------------------------RWITCSEIL 1152
Query: 453 SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARE------------------SNGGHRL 494
H M +DK +V + EA++ S +L
Sbjct: 1153 DHHT----------IMAADKFDSVFILRVPEEAKQDEYGITNKCWYGGEIMNSSTKNRKL 1202
Query: 495 IKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 554
FH+G+ V + K+R P+S Y+++ G +G F+P K L
Sbjct: 1203 EHMMSFHIGEIVTSMQKVRLSPTSSE---------CIIYSTIMGTIGAFIPYDNKEELEL 1253
Query: 555 LM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 613
L+ ++ T G FR+Y +P + ++DG L +F LS + +I
Sbjct: 1254 TQHLEIILRTEKPPLCGREHIFFRSYY-------HPVQNVVDGDLCEQFSSLSYDAQKKI 1306
Query: 614 CKKIGSKHNDILDELYDI 631
+ DIL +L DI
Sbjct: 1307 ANDLERTPEDILRKLEDI 1324
>gi|221486318|gb|EEE24579.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 2804
Score = 51.2 bits (121), Expect = 0.002, Method: Composition-based stats.
Identities = 45/183 (24%), Positives = 82/183 (44%), Gaps = 27/183 (14%)
Query: 232 FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDV 291
+ V L+ F P + L E VL L V L G+ ++A G SE+V
Sbjct: 2359 YEVRLYHEFDLHR-PVGTYTLRTCEEVLSLSFV------VLDGVE-HLAAGVGVPLSENV 2410
Query: 292 TCRGRILLFDIIE----VVP---------EPGQPLTKNKIKMIYAKEQKGPVTAICHV-- 336
C GR+ LF + E VVP E + T ++++ GPVT +
Sbjct: 2411 ECGGRVYLFKLPESSLRVVPAGNAGDAPTEEAEFGTPERLELFADIVLNGPVTVVGSFFS 2470
Query: 337 ----AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALL 392
++V +VG ++++ +++ + AF D V + S+ +++N L+GD + + L+
Sbjct: 2471 SPAERSYVVHSVGPRLFVHEMEGSKFLRGAFSDASVCVTSVANIRNFFLLGDALKGLNLV 2530
Query: 393 RYQ 395
++
Sbjct: 2531 SWE 2533
>gi|237833631|ref|XP_002366113.1| hypothetical protein TGME49_024280 [Toxoplasma gondii ME49]
gi|211963777|gb|EEA98972.1| hypothetical protein TGME49_024280 [Toxoplasma gondii ME49]
Length = 2804
Score = 51.2 bits (121), Expect = 0.002, Method: Composition-based stats.
Identities = 45/183 (24%), Positives = 82/183 (44%), Gaps = 27/183 (14%)
Query: 232 FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDV 291
+ V L+ F P + L E VL L V L G+ ++A G SE+V
Sbjct: 2359 YEVRLYHEFDLHR-PVGTYTLRTCEEVLSLSFV------VLDGVE-HLAAGVGVPLSENV 2410
Query: 292 TCRGRILLFDIIE----VVP---------EPGQPLTKNKIKMIYAKEQKGPVTAICHV-- 336
C GR+ LF + E VVP E + T ++++ GPVT +
Sbjct: 2411 ECGGRVYLFKLPESSLRVVPAGNAGDAPTEEAEFGTPERLELFADIVLNGPVTVVGSFFS 2470
Query: 337 ----AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALL 392
++V +VG ++++ +++ + AF D V + S+ +++N L+GD + + L+
Sbjct: 2471 SPAERSYVVHSVGPRLFVHEMEGSKFLRGAFSDASVCVTSVANIRNFFLLGDALKGLNLV 2530
Query: 393 RYQ 395
++
Sbjct: 2531 SWE 2533
>gi|389740093|gb|EIM81285.1| hypothetical protein STEHIDRAFT_86633 [Stereum hirsutum FP-91666 SS1]
Length = 1213
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 67/327 (20%), Positives = 138/327 (42%), Gaps = 51/327 (15%)
Query: 318 IKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK 377
+++++ E ++ G LV +G+ + I+ + L A +++ + ++++S+
Sbjct: 932 LELLHKTETDDIPMSLLAFQGRLVAGIGKALRIYDIGKKKLLRKA--ESKTFASAIISLN 989
Query: 378 ---NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVW 434
+ I+VGD SIA Y+ L + A D + +R + ++V
Sbjct: 990 TQGSRIIVGDMQESIAYAVYKAPENKLLVFADDTQ--------------ARWVTCSTMV- 1034
Query: 435 KFLQLSLGERLE--ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGH 492
+ ++ G+R ++ SK +D +D+ + ++ +K + M P H
Sbjct: 1035 DYTTVAAGDRFGNIFINRLDSKVSDQVDDDPTGAGILHEKG----ILMGAP--------H 1082
Query: 493 RLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEK-NY 551
+ FH+G V + K +S G R L Y L G +G +PL K +
Sbjct: 1083 KTAMLAHFHVGDLVTSIHK-------VSLVAGGREVLL--YTGLHGTIGMLVPLVSKEDV 1133
Query: 552 RRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERL 611
+ L+ + T + G + A+R GYY P + ++DG L F +L ++
Sbjct: 1134 DFISTLEQHIRTEQTSLVGRDHLAWR-----GYYV--PVKAVVDGDLCETFARLPAAKQS 1186
Query: 612 EICKKIGSKHNDILDELYDIEALSSHF 638
I ++ +++L +L + +S F
Sbjct: 1187 MIAGELDRTVSEVLKKLDQLRVTASGF 1213
>gi|212539802|ref|XP_002150056.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei ATCC
18224]
gi|210067355|gb|EEA21447.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei ATCC
18224]
Length = 1139
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 67/361 (18%), Positives = 136/361 (37%), Gaps = 65/361 (18%)
Query: 281 LGTNYNYSEDV-TCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF 339
+GT Y E + RGRILLF++ + K+ + KG A+ + +
Sbjct: 833 VGTAYLDDETAESIRGRILLFEVD----------SNRKLSLFLEHPVKGACRALAMMGDY 882
Query: 340 LVTAVGQKIYIWQLKDNDLTG------IAFIDTEVYIASMVSVKNLILVGDYARSIALLR 393
+V A+ + + I+++ TG A T + I+V D +SI+++
Sbjct: 883 IVAALVKTVVIFEVTGQPQTGKYSLQKAAVYRTSTAPVDIAVTDKTIVVADLMKSISIVE 942
Query: 394 YQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 453
+ L++ A+ E+ + +
Sbjct: 943 SN-KTDALTMEAK---------------------------------------EVARHFAT 962
Query: 454 KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
+ + S +++SD + N+++ + RL ++ LG+ VN R
Sbjct: 963 VWTTAVADIGSNQWLVSDAEGNLIVLRRNVDGMTEEDRRRLEVTSELLLGEMVN-----R 1017
Query: 514 CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
+P +I + +++G++ F + ++ L+ LQ + + G +
Sbjct: 1018 IRPVNIPQTSTMAVTPKAFLGTVEGSIYLFALINPEHQDFLMRLQTAISAYVDSPGLMPF 1077
Query: 574 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
FR ++ A P R +DG L+ +FL + EI +GS + + ++ IEA
Sbjct: 1078 NKFRAFRSTVREAEEPFR-FVDGELIERFLDCDRAVQEEILGVVGSGDLESVQKM--IEA 1134
Query: 634 L 634
L
Sbjct: 1135 L 1135
>gi|154421858|ref|XP_001583942.1| CPSF A subunit region family protein [Trichomonas vaginalis G3]
gi|121918186|gb|EAY22956.1| CPSF A subunit region family protein [Trichomonas vaginalis G3]
Length = 1297
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 78/388 (20%), Positives = 151/388 (38%), Gaps = 78/388 (20%)
Query: 269 EGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPG------QPLTKNKIKMIY 322
E ++ L Y+A+G+ + + RG + ++ I + + G +PL N+ IY
Sbjct: 954 EDGITLLNTYLAVGSGFLSQPEKMMRGVLYIYQIRYMQNDEGFNEITLRPLY-NETNKIY 1012
Query: 323 AKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI-AFIDTEVYIASMVSVKNLIL 381
K P+ I +G++ G +Y+ + + + I AF+ + +S+VS+KN +L
Sbjct: 1013 ----KNPIIEITDNSGYMAIFCGNLLYLMRFFNENTVKIEAFLVGRFFASSIVSLKNYLL 1068
Query: 382 VGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSL 441
D + R++ + L +ARD P S FLQ
Sbjct: 1069 YADSYEGFEVARWRKYGKKLISMARDTMTKLPLSAA------------------FLQ--- 1107
Query: 442 GERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFH 501
E C +G ++ D D N +F A ++ +++K+ F+
Sbjct: 1108 ---YEDC----------------LGGVVFDDDGNAHIFDVDEYAIPADA---VVRKSIFY 1145
Query: 502 LGQHVNTF--FKIRCKPSSISDAPGAR------------SRFLTWYASLDGALGFFLPLP 547
+G + F I+ + P + WY + G +G F P+
Sbjct: 1146 IGGRAISSGQFPIKAVTQATQQNPNEEIDEELLQLQTKIGGHIAWYVTTHGKIGAFTPID 1205
Query: 548 EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN----PSRGIIDGSLVWKFL 603
E + +L+ +Q+ + GL+ +R+ K K + + +ID ++ +
Sbjct: 1206 ENDRHKLVGVQS---AYEKSLCGLSHLEYRSGKFKNMIEQDIFNQSPKNVIDCDMLIDLI 1262
Query: 604 QLSLGERLEICKKIGSKHNDILDELYDI 631
+ + + L+ K G + D L EL I
Sbjct: 1263 E-DMPDHLKFATK-GLRTQDFLSELRKI 1288
>gi|156339616|ref|XP_001620212.1| hypothetical protein NEMVEDRAFT_v1g223331 [Nematostella vectensis]
gi|156204813|gb|EDO28112.1| predicted protein [Nematostella vectensis]
Length = 248
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 53/97 (54%), Gaps = 13/97 (13%)
Query: 17 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP--KGALKLRFKKLKVLFVSDRSKRA 74
V+E+L LG R L+ +LLIY+AF +P +G L LRFKKL+ + R K+
Sbjct: 150 VREVLLTGLGYKNRRATLVAVMDQDLLIYEAFSYPTVEGHLNLRFKKLQ-HNIQIREKKP 208
Query: 75 NEQ--------PGLPRGVRISQMRYFSNIAGYQGVFL 103
++ PGL +++ +R F++I+ Y GV +
Sbjct: 209 KQEPKNDSETKPGL--DPKVAMLRVFNDISSYSGVCM 243
>gi|124505011|ref|XP_001351247.1| CPSF (cleavage and polyadenylation specific factor), subunit A,
putative [Plasmodium falciparum 3D7]
gi|7768292|emb|CAB11136.2| CPSF (cleavage and polyadenylation specific factor), subunit A,
putative [Plasmodium falciparum 3D7]
Length = 2870
Score = 50.8 bits (120), Expect = 0.002, Method: Composition-based stats.
Identities = 119/604 (19%), Positives = 223/604 (36%), Gaps = 155/604 (25%)
Query: 95 IAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG--------PVSTLAPFHNVNCPRGFL 146
I Y +F+C P ++ + ++ +++ + L PFHN FL
Sbjct: 2358 IKKYNFLFVCCESPIIIYSDLKKKINVSKLSLKNIYIVDIFNDFNYLNPFHN------FL 2411
Query: 147 YFNAKSE----------LRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVT 196
F K++ I + P L+ ++K+P T +AYH +T +
Sbjct: 2412 SFKKKNQNNFYFIFYDGSNIHISP--LNQIKKTFLKKIPFHRTVEKIAYHSDTG----LL 2465
Query: 197 STAEPSTDYYKFNGEDKELVT--DP-RDS-RFIPPLVSQFHVSLFSPFSWEEIPQTNFPL 252
A PS + +K N K+++ DP DS ++ + S++ VS + E++ ++NF +
Sbjct: 2466 IAACPSEEKHKTNEMMKQIICFFDPYHDSIKYTYIIPSKYTVSTIIIYDNEKLMKSNFDV 2525
Query: 253 HEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQP 312
S + GT + +N Y+E + G I +F
Sbjct: 2526 -----------TSFIFVGTCN---------SNEKYTEPTS--GHIHIF------------ 2551
Query: 313 LTKNK-----IKMIYAKE-QKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT 366
+ K K IK IY G VT + +V + + I + + + AF+D
Sbjct: 2552 IAKKKANIFEIKHIYTHNINYGGVTNLVPYDDKIVATINNMVVILDINNLIIKYEAFMDP 2611
Query: 367 E---------------------VYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVA 405
+ +I ++ + I+VGD S+ +L+Y E L V
Sbjct: 2612 QNLQPKIEGNNAIVELVSFTPSSWIMTVDVYGDYIVVGDIMTSVTILQYDYENSQLFEVC 2671
Query: 406 RDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
RDY S +W C + + S
Sbjct: 2672 RDY----------------------SNIW-------------CTSLCA--------LSKS 2688
Query: 466 GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA 525
++SD D N ++ ++L + F+ G +N + +++ +
Sbjct: 2689 HIVVSDMDANFIILQKSKFKYNDEDSYKLSSVSLFNHGSIINKMLPL--SNTNLIEEDYD 2746
Query: 526 RSRFLT-----WYASLDGALGFFLPLPE-KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
+ LT AS +G++ +P N+++ L ++ + + S G L+ A+R Y
Sbjct: 2747 KRNILTKNDGILCASSEGSISVLIPFSSFANFKKALCIEIAITDNISSIGNLSHNAYREY 2806
Query: 580 KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE-------ICKKIGSKHNDILDELYDIE 632
K + +GI+DG L+ F +S ++ + I KKI K + + D+E
Sbjct: 2807 KVN--FRSKHCKGIVDGELLKMFFHMSFEKQYKTFIYAKWIAKKINCKFGSFNNFILDLE 2864
Query: 633 ALSS 636
+ S
Sbjct: 2865 NMCS 2868
>gi|194741158|ref|XP_001953056.1| GF17579 [Drosophila ananassae]
gi|190626115|gb|EDV41639.1| GF17579 [Drosophila ananassae]
Length = 1140
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 73/333 (21%), Positives = 124/333 (37%), Gaps = 65/333 (19%)
Query: 316 NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASM 373
NK+ + + G A+ G ++ +G ++Y W + +L I + +
Sbjct: 857 NKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT-NEKELRMECNIQNMIAALFL 915
Query: 374 VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
+ + ILVGD RSI LL+++ +ARD +P
Sbjct: 916 KAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPK---------------------- 953
Query: 434 WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
W + +ILD+ + +G + N LF+ Q ++ + R
Sbjct: 954 WM------------------RAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEER 990
Query: 494 --LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL---TWYASLDGALGFFLPLPE 548
L + FHLG VN F S + G R+ + Y + +GA+G +P+
Sbjct: 991 QLLPELARFHLGDTVNVFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQ 1046
Query: 549 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 608
Y L L+ + G + +R ++ PS G IDG L+ FL L
Sbjct: 1047 DFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKV--EPSEGFIDGDLIESFLDLGRD 1104
Query: 609 ------ERLEICKKIGSKHNDILDELYDIEALS 635
+ LEI K D+ D + +E L+
Sbjct: 1105 KMRDAVQGLEITLNGERKSADVEDVIKIVEDLT 1137
>gi|345570887|gb|EGX53705.1| hypothetical protein AOL_s00006g33 [Arthrobotrys oligospora ATCC
24927]
Length = 1133
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 64/247 (25%), Positives = 99/247 (40%), Gaps = 25/247 (10%)
Query: 372 SMVSVKNLILVGDYARSIALLRYQPEYRTLSL----VARDYKPTQPNSKGYYAGNPSRGI 427
S+ VK IL G ++SI L R+ +L ++ T P S Y + +
Sbjct: 861 SLAIVKGYILAG-LSKSIDLYRFSYTRGSLGASIQQISSIRAATLPVSLSVYG----KRV 915
Query: 428 IDGSLVWKFLQLSLGER--------LEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
G LV + L + E +E+C++ G L+ + +D D N+VL
Sbjct: 916 FVGDLVKGVMVLEVVEGGGEGNDKLVEVCRQYGVSWVTALEALDEDTCISADSDGNLVLL 975
Query: 480 MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGA 539
+ R+ ++ LG+ VN IR I+ + + + ++DG
Sbjct: 976 RRESTGATDEDTRRMRPLSEIRLGEMVNC---IRRVNDPITQGYVVQPK--AYLGTVDGG 1030
Query: 540 LGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
L F L L +Y +LM Q M G L+ +R Y KG P R +DG L
Sbjct: 1031 L-FMLGLIHPDYFDILMKCQVNMAKVIKGIGDLDFNRYRAYNTKGIQPEEPFR-FVDGEL 1088
Query: 599 VWKFLQL 605
V KFL L
Sbjct: 1089 VEKFLDL 1095
>gi|297267724|ref|XP_001082958.2| PREDICTED: DNA damage-binding protein 1 [Macaca mulatta]
Length = 1092
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 70/315 (22%), Positives = 119/315 (37%), Gaps = 78/315 (24%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM-------- 561
C S + G S + + +++G +G L E Y LL +QN +
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 562 -VTHTSHTGGLNPRA 575
+ H+ H L+ RA
Sbjct: 1067 KIEHSFHLEILSHRA 1081
>gi|443918546|gb|ELU38987.1| CPSF A subunit region domain-containing protein [Rhizoctonia solani
AG-1 IA]
Length = 1037
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 75/347 (21%), Positives = 139/347 (40%), Gaps = 59/347 (17%)
Query: 274 GLRGYIALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTA 332
G YI GT N E+ GRI+LF GQ +N IK +K+ +G V++
Sbjct: 714 GGNSYILAGTAIINPGENEPLAGRIILF---------GQD-EENMIKFKASKDVEGGVSS 763
Query: 333 ICHVAGFLVTAVGQKIYIWQLKDNDLT---GIAFIDTEVYIASMVSVKNLILVGDYARSI 389
I + ++ A+G IY++ L ++T +A + + ++ N+I+V D RS+
Sbjct: 764 IKQLGARIIAAIGHGIYLYNLGRGEVTISDPVARWERGYIVHDIIVRPNMIVVSDRLRSV 823
Query: 390 ALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 449
++LR+ + R P S + +F +++
Sbjct: 824 SVLRF---------IERTSTPESHEEIETEE---------DSTILQFETVAMD-----MH 860
Query: 450 KIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
+ ++L + ++ + S D N++ + E + N L + FH G+ ++ F
Sbjct: 861 AVWPTSVEVLPDNKTI--IASQTDGNILTW----ELEDGN----LEPRAAFHTGEIIHKF 910
Query: 510 FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
K S A R + + + G +G + + + +L L+ + G
Sbjct: 911 IASTAKSS-------AGPRTVAIFVTNTGRIGTLSTVDDADALQLTRLEMKLGDAIKGLG 963
Query: 570 GLNPRAFRTYKGKGYYAGN---PSRGIIDGSLVWKFLQLSLGERLEI 613
+ +R K + G P RG+ DG + KFL+LS E I
Sbjct: 964 NIKHPEWRAP--KLLHTGTKPPPRRGVTDGDFIKKFLELSSEEAKRI 1008
>gi|221040048|dbj|BAH11787.1| unnamed protein product [Homo sapiens]
Length = 1092
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 70/315 (22%), Positives = 119/315 (37%), Gaps = 78/315 (24%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM-------- 561
C S + G S + + +++G +G L E Y LL +QN +
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 562 -VTHTSHTGGLNPRA 575
+ H+ H L+ RA
Sbjct: 1067 KIEHSFHLEILSHRA 1081
>gi|119594339|gb|EAW73933.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_a [Homo
sapiens]
Length = 1094
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 65/290 (22%), Positives = 110/290 (37%), Gaps = 69/290 (23%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
+P +ARD+ P W
Sbjct: 936 KPMEGNFEEIARDFNPN----------------------WM------------------S 955
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
+ILD+ + +G + LF+ Q ++ + R L + FHLG+ VN F
Sbjct: 956 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007
Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQN 559
C S + G S + + +++G +G L E Y LL +QN
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQN 1056
>gi|299751161|ref|XP_001830098.2| pre-mRNA-splicing factor rse1 [Coprinopsis cinerea okayama7#130]
gi|298409248|gb|EAU91763.2| pre-mRNA-splicing factor rse1 [Coprinopsis cinerea okayama7#130]
Length = 1205
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 66/310 (21%), Positives = 121/310 (39%), Gaps = 46/310 (14%)
Query: 332 AICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIAL 391
A+ G L+ VG+ + I+ L L A + I S+ + + I++GD S
Sbjct: 939 ALLAFQGRLLAGVGKALRIYDLGKKKLLRKAETKSPTAIVSLATQGSRIVIGDMQESTLF 998
Query: 392 LRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE--ICK 449
Y+ L + D +P R + ++V + +++G++
Sbjct: 999 AVYKEAENRLLIFGDDTQP--------------RWVSAMTMV-DYNTVAVGDKFGNIFVN 1043
Query: 450 KIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
++ S +D +DE + ++ +K A + H+ FH+G + +
Sbjct: 1044 RLDSTISDQVDEDPTGAGILHEK------------ATLNGAPHKTKMLAHFHVGDIITSI 1091
Query: 510 FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEK-NYRRLLMLQNVMVTHTSHT 568
K+ S G R L Y L G +G +PL K + L ML+ +
Sbjct: 1092 HKV-------SLVVGGREVLL--YTGLQGTIGILVPLTSKEDIEFLTMLEQHIRNEQGSL 1142
Query: 569 GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
G + ++R GYY P + +IDG L + LS ++ I ++ D+L +L
Sbjct: 1143 VGRDHLSWR-----GYYV--PVKAVIDGDLCETYGGLSSSKQSAIASELDRTVGDVLKKL 1195
Query: 629 YDIEALSSHF 638
+ SS F
Sbjct: 1196 DQMRVASSGF 1205
>gi|326426696|gb|EGD72266.1| hypothetical protein PTSG_00286 [Salpingoeca sp. ATCC 50818]
Length = 1104
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 58/136 (42%)
Query: 478 LFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLD 537
L + Q E + L K + +LG+ V +F + ++ D+ + ++
Sbjct: 937 LSVCQREFEPGSTMQTLNAKFEIYLGETVTSFVRAALGSAAAVDSSMPLRNTFFVFGTMG 996
Query: 538 GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
G L LPL L L+ M GGL+ R FRT + + A + ++DG
Sbjct: 997 GGLACLLPLTPPQTELLTALECRMEEKIGGLGGLDHREFRTARDEQRMAQQVNPRLVDGD 1056
Query: 598 LVWKFLQLSLGERLEI 613
LV FLQL E+ E+
Sbjct: 1057 LVETFLQLPEEEQKEL 1072
>gi|302837243|ref|XP_002950181.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
nagariensis]
gi|300264654|gb|EFJ48849.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
nagariensis]
Length = 1104
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 106/489 (21%), Positives = 169/489 (34%), Gaps = 111/489 (22%)
Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
V+ LA FH+ PR L ++ L I VR VPL P +A+H
Sbjct: 714 VAFLASFHSAAFPRS-LAVASEGALTIGTADEIQKLH----VRAVPLGENPRRIAHHEGA 768
Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
+ ++T + F L+ D + F V + E+P +
Sbjct: 769 RMLGVLTMRLDSDGSERSF----LRLLDD-----------TTFDVVASYALAPGEMPCS- 812
Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT-CRGRILLFDIIEVVPE 308
L W S + L +GT + E+ +GRIL+ + + +V E
Sbjct: 813 --LAAWPG-------SSNGTAAVGALNACFLVGTAFIVPEEPEPTKGRILVLEHVRLVTE 863
Query: 309 PGQ--------PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTG 360
P K+KI + + K P + C + G V + Y+ +
Sbjct: 864 KEVKGAAYNVLPFVKDKI--LASVNSKVPASG-CDLGGVRVELASECSYLGNI------- 913
Query: 361 IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
+Y+A+ NL++VGD RS++LL Y E L A DY NS
Sbjct: 914 -----LALYLATR---GNLVVVGDLMRSVSLLSYNVEQGVLEHRAADY-----NSG---- 956
Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
W + + LD+ + ++ D N+V+
Sbjct: 957 -------------W------------------TTSVEALDDDT---YLEGDNHLNLVVLR 982
Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
++ RL ++H G VN F +R S P +
Sbjct: 983 RNADSATDEERARLQVVGEYHTGTFVNRFRHGSLVMRPPDSEFVSLP-----VPLLFGGT 1037
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
DG LG LP Y L LQ+ + GGL+ A+ + + A ++G +DG
Sbjct: 1038 DGRLGVIARLPPGLYEMLTKLQSALRQVVRGVGGLSHEAWIAFSNERRTA--DAKGFVDG 1095
Query: 597 SLVWKFLQL 605
L+ FL L
Sbjct: 1096 DLIETFLDL 1104
>gi|347838030|emb|CCD52602.1| similar to DDB1B (Damaged DNA Binding protein 1 B); damaged DNA
binding / protein binding [Botryotinia fuckeliana]
Length = 1157
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 75/369 (20%), Positives = 138/369 (37%), Gaps = 82/369 (22%)
Query: 281 LGTNYNYSEDVTCRGRILLFDI-IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF 339
+GT++ + E+ RGR+L+F + + P MI + KG I + G
Sbjct: 835 VGTSFLHEEEANVRGRLLIFGVNADRAP-----------YMIASHNLKGSCRCIGVLDGK 883
Query: 340 LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMV-----SVKNLILVGDYARSIALLRY 394
+V A+ + + ++ ++ T Y S N+I V D +SIAL+ Y
Sbjct: 884 IVAALNKTVVMYDYEETSSTSATLKKLATYRCSTCPIDIDITDNIIAVADIMKSIALVEY 943
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERL-EICKKIGS 453
P DG L ++L E+ +
Sbjct: 944 TPG------------------------------ADG----------LPDKLEEVARHAQQ 963
Query: 454 KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
+ + E + ++ +D D N++L E R+ + +LG+ VN +I
Sbjct: 964 VFSTSVAEVDTDTYLETDHDGNLILLKRNREGVTREDKTRMEVTCEMNLGEMVNRVKRIN 1023
Query: 514 CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT----- 568
+ S DA FL + +G++ F +P +N L+ LQ+ + + S +
Sbjct: 1024 VETS--KDALLIPRAFL---GTTEGSIYLFSLIPPQNQDLLMRLQSRLASLPSASSIRGS 1078
Query: 569 -------------GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 615
G L+ +R+Y P R +DG L+ +FL L + + + +
Sbjct: 1079 SDSTSPHQIELSPGNLDFNKYRSYISATRETSEPFR-FVDGELIERFLDLEVEVQEHVAE 1137
Query: 616 KIGSKHNDI 624
+G K D+
Sbjct: 1138 GLGVKAEDL 1146
>gi|195145844|ref|XP_002013900.1| GL24391 [Drosophila persimilis]
gi|194102843|gb|EDW24886.1| GL24391 [Drosophila persimilis]
Length = 1140
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 65/298 (21%), Positives = 112/298 (37%), Gaps = 59/298 (19%)
Query: 316 NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASM 373
+K+ + + G A+ G ++ +G ++Y W + +L I + +
Sbjct: 857 SKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT-NEKELRMECNIQNMIAALYL 915
Query: 374 VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
+ + ILVGD RSI LL+++ +ARD +P
Sbjct: 916 KAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPK---------------------- 953
Query: 434 WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
W + +ILD+ + +G + N LF+ Q ++ + R
Sbjct: 954 WM------------------RAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEER 990
Query: 494 --LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL---TWYASLDGALGFFLPLPE 548
L + FHLG VN F S + G R+ + Y + +GA+G +P+
Sbjct: 991 QLLPELARFHLGDTVNVFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQ 1046
Query: 549 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
Y L L+ + G + +R ++ PS G IDG L+ FL LS
Sbjct: 1047 DFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKV--EPSEGFIDGDLIESFLDLS 1102
>gi|66811906|ref|XP_640132.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
gi|74854972|sp|Q54SA7.1|SF3B3_DICDI RecName: Full=Probable splicing factor 3B subunit 3
gi|60468134|gb|EAL66144.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
Length = 1256
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 70/323 (21%), Positives = 135/323 (41%), Gaps = 51/323 (15%)
Query: 317 KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSV 376
K++++Y E + PV A+ G LV VG+ I I+ + L + +T+ ++V++
Sbjct: 975 KLELLYKTEVEEPVYAMAQFQGKLVCGVGKSIRIYDMGKKKL--LRKCETKNLPNTIVNI 1032
Query: 377 KNL---ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
+L ++VGD SI ++Y+ L + A D P S + G
Sbjct: 1033 HSLGDRLVVGDIQESIHFIKYKRSENMLYVFADDLAPRWMTSSVMLDYDTVAG------- 1085
Query: 434 WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDK-DKNVVLFMYQPEARESNGG- 491
K +I + +ISD+ +++ + E+ NG
Sbjct: 1086 ------------------ADKFGNIF--VLRLPLLISDEVEEDPTGTKLKFESGTLNGAP 1125
Query: 492 HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEK-N 550
H+L +F +G V T K S + P + Y ++ GA+G +P + +
Sbjct: 1126 HKLDHIANFFVGDTVTTL----NKTSLVVGGPE-----VILYTTISGAIGALIPFTSRED 1176
Query: 551 YRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGER 610
L+ M + G + A+R+Y Y+ P + IIDG L +F L+ ++
Sbjct: 1177 VDFFSTLEMNMRSDCLPLCGRDHLAYRSY----YF---PVKNIIDGDLCEQFSTLNYQKQ 1229
Query: 611 LEICKKIGSKHNDILDELYDIEA 633
L I +++ ++++ +L +I +
Sbjct: 1230 LSISEELSRSPSEVIKKLEEIRS 1252
>gi|392566425|gb|EIW59601.1| hypothetical protein TRAVEDRAFT_167065 [Trametes versicolor FP-101664
SS1]
Length = 1263
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 69/328 (21%), Positives = 126/328 (38%), Gaps = 68/328 (20%)
Query: 295 GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA-GFLVTAVGQKIYIWQL 353
GRILLF + E G + + + + +G V A+ HV+ G + A+ + ++++
Sbjct: 965 GRILLFSLSS---ENG----VRSLTTVASHKVRGCVYALQHVSEGVIAAAINTSVLLYKI 1017
Query: 354 KDNDLTGIAF---IDTEV------YIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLV 404
++ +L G F +D ++ S+V +LVGD S+++LR + L V
Sbjct: 1018 REGNL-GEGFDRVLDKAAEWNHNHFVTSLVWDGQFLLVGDAISSVSVLRVADDATKLESV 1076
Query: 405 ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
ARDY P P + ++ +
Sbjct: 1077 ARDYAPLWPVA-------------------------------------------IESTGN 1093
Query: 465 MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG 524
G + ++ D N+ F Q + NG L K +H+ VN K + +S
Sbjct: 1094 GGVIGANSDCNLFSFALQ-RGPQRNG---LEKNGVYHIDDVVNKLIKGALSSADVSQDQA 1149
Query: 525 ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT-YKGKG 583
++ + + ++ G +G L + + + LQ M GG+N R +G
Sbjct: 1150 VKAGHVFFTST--GRIGAILDMNDTMSLHMTALQRNMAKSLIGPGGVNHTKRRAPATPRG 1207
Query: 584 YYAGNPSRGIIDGSLVWKFLQLSLGERL 611
+ S G +DG + FL + E+L
Sbjct: 1208 HTDAEASYGFLDGDFLETFLSHAHPEQL 1235
>gi|242803623|ref|XP_002484212.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus ATCC
10500]
gi|218717557|gb|EED16978.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus ATCC
10500]
Length = 1140
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 64/349 (18%), Positives = 130/349 (37%), Gaps = 63/349 (18%)
Query: 281 LGTNYNYSEDV-TCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF 339
+GT Y E + RGRILLF++ + K+ + KG A+ +
Sbjct: 833 VGTAYLDDETAESIRGRILLFEVD----------SNRKLSLFLEHPVKGACRALAMMGNK 882
Query: 340 LVTAVGQKIYIW------QLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLR 393
+V A+ + + I+ QL + L +A T + + I+V D +SI+++
Sbjct: 883 IVAALVKTVVIFDVERKSQLGKHALKKVAAYRTSTAPVDIAVTDSTIVVADLMKSISIVE 942
Query: 394 YQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 453
++T +L E E+ + +
Sbjct: 943 ---SHKTDALTV-------------------------------------EAKEVARHFAT 962
Query: 454 KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
+ + S +++SD + N+++ + RL ++ LG+ VN R
Sbjct: 963 VWTTAVADIGSNQWLVSDAEGNLIVLRRNVDGVTEEDRRRLEVTSELLLGEMVN-----R 1017
Query: 514 CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
+P +I + +++G++ F + ++ L+ LQ + + G +
Sbjct: 1018 IRPVNILQTSTVAVNPKAFLGTVEGSIYLFALINPEHQDFLMRLQTAITAYVDSPGYMPF 1077
Query: 574 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 622
FR ++ P R +DG L+ +FL + EI +GS ++
Sbjct: 1078 SKFRAFRSSVREGDEPFR-FVDGELIERFLDCDRPVQEEILGVVGSGYD 1125
>gi|340521192|gb|EGR51427.1| predicted protein [Trichoderma reesei QM6a]
Length = 1161
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 101/546 (18%), Positives = 196/546 (35%), Gaps = 111/546 (20%)
Query: 97 GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRI 156
G VF H A L +S G + T D + +APF + P + + +RI
Sbjct: 698 GICNVFATTEH-ASLIYSSEGRIVYSATTADD-ATFVAPFDSEAFPDSIV-LSTDEHIRI 754
Query: 157 SVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTY---CIVTSTAEPSTDYYKFNGEDK 213
H+ + V+ +P+ T +AY K + CI E ++
Sbjct: 755 C----HVDSERLTHVKSLPMHETVRRVAYSPGLKAFGLGCIKKELVE-----------NE 799
Query: 214 ELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLS 273
E+VT R + ++ Q L PF E V C+ + E +
Sbjct: 800 EVVTST--VRLVDEIIFQ---ELGQPFELNASAS-------LELVECV--IRAELPDSNG 845
Query: 274 GLRGYIALGTNY----NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGP 329
+ +GT++ E RGRI++ + E ++ I + KG
Sbjct: 846 NMTERFLVGTSFVADPGTDEAGETRGRIVVLGVDE----------SRQLYQIASHNLKGV 895
Query: 330 VTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-----NLILVGD 384
+ + ++V + + + ++ T + Y + V N+I VGD
Sbjct: 896 CRCLAMLDDYIVAGLSKTVVVYSYAQETSTAASLTKVASYRPASFPVDLDVSGNMIGVGD 955
Query: 385 YARSIALLRYQP----EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLS 440
+S+ L+ + P + L AR Y+ S
Sbjct: 956 LMQSLTLIEFTPPQDGKMAKLEEKARHYQQAWTTS------------------------- 990
Query: 441 LGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDF 500
+C LDE ++ +D NV++ + EA +L ++
Sbjct: 991 ------VCA---------LDETR---WLEADAQGNVIVLRQRQEAPTEQDRSQLEITSEL 1032
Query: 501 HLGQHVNTFFKIRCKPSSISDAPGARSRFL--TWYASLDGALGFFLPLPEKNYRRLLMLQ 558
++G+ +N K++ APG + + + S++G L + + K L+ Q
Sbjct: 1033 NIGEQINRIRKLQV-------APGENAVVVPKAFLGSIEGTLYLYGDIAPKYQDLLMTFQ 1085
Query: 559 NVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
+ + + G L+ +R ++ + +P R +DG ++ +FL L ++ +C+ +G
Sbjct: 1086 SRLQGYIQTPGNLSFDLWRAFRNQAREGESPYR-FVDGEMIERFLDLDESQQELVCEGLG 1144
Query: 619 SKHNDI 624
D+
Sbjct: 1145 PNVEDM 1150
>gi|154303693|ref|XP_001552253.1| hypothetical protein BC1G_08731 [Botryotinia fuckeliana B05.10]
Length = 1087
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 73/368 (19%), Positives = 135/368 (36%), Gaps = 80/368 (21%)
Query: 281 LGTNYNYSEDVTCRGRILLFDI-IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF 339
+GT++ + E+ RGR+L+F + + P MI + KG I + G
Sbjct: 765 VGTSFLHEEEANVRGRLLIFGVNADRAP-----------YMIASHNLKGSCRCIGVLDGK 813
Query: 340 LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMV-----SVKNLILVGDYARSIALLRY 394
+V A+ + + ++ ++ T Y S N+I V D +SIAL+ Y
Sbjct: 814 IVAALNKTVVMYDYEETSSTSATLKKLATYRCSTCPIDIDITDNIIAVADIMKSIALVEY 873
Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
P D P + E+ +
Sbjct: 874 TP--------GADGLPDKLE-------------------------------EVARHAQQV 894
Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 514
+ + E + ++ +D D N++L E R+ + +LG+ VN +I
Sbjct: 895 FSTSVAEVDTDTYLETDHDGNLILLKRNREGVTREDKTRMEVTCEMNLGEMVNRVKRINV 954
Query: 515 KPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT------ 568
+ S DA FL + +G++ F +P +N L+ LQ+ + + S +
Sbjct: 955 ETS--KDALLIPRAFL---GTTEGSIYLFSLIPPQNQDLLMRLQSRLASLPSASSIRGSS 1009
Query: 569 ------------GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 616
G L+ +R+Y P R +DG L+ +FL L + + + +
Sbjct: 1010 DSTSPHQIELSPGNLDFNKYRSYISATRETSEPFR-FVDGELIERFLDLEVEVQEHVAEG 1068
Query: 617 IGSKHNDI 624
+G K D+
Sbjct: 1069 LGVKAEDL 1076
>gi|67516629|ref|XP_658200.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
gi|40747539|gb|EAA66695.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
gi|259489136|tpe|CBF89158.1| TPA: damaged DNA binding protein (Eurofung) [Aspergillus nidulans
FGSC A4]
Length = 1132
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 37/159 (23%), Positives = 70/159 (44%), Gaps = 8/159 (5%)
Query: 467 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR 526
++ SD + N+++ E + RL + L + VN R +P +I P A
Sbjct: 970 YLESDAEGNLIVLRRNRSGVEEDDRRRLEVTGEICLNEMVN-----RIRPVNIQQLPSAT 1024
Query: 527 SRFLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYY 585
+ A+++G++ + + +Y+ LM LQ M + GG+ +R ++
Sbjct: 1025 VVPRAFLATVEGSI-YLYAIINPDYQDFLMRLQATMASRADSLGGIPFTDYRAFRTMTRQ 1083
Query: 586 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
A P R +DG L+ +FL + EI +GS ++
Sbjct: 1084 ATEPYR-FVDGELIERFLTCEPAVQKEIVDIVGSSLEEV 1121
>gi|68071595|ref|XP_677711.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56497932|emb|CAI04454.1| conserved hypothetical protein [Plasmodium berghei]
Length = 493
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 113/549 (20%), Positives = 198/549 (36%), Gaps = 128/549 (23%)
Query: 133 LAPFHNVNCPRG-------FLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
L PFHN N + F++F+ +S+ +HL+ ++K+P T +A+
Sbjct: 26 LNPFHNFNSFKKKNQNNLYFIFFDG-----LSLYISHLNEINETYIQKIPFYRTVEKIAF 80
Query: 186 HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEI 245
H E+ ++TS P + +K N K+++ F P + F S P +
Sbjct: 81 HKESGL--LITSC--PPEEKHKTNKNLKQIIC------FFNPYQNSFKYSYIIPSKYNV- 129
Query: 246 PQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGT-NYNYSEDVTCRGRILLFDIIE 304
+C+ ++ + S + I +GT N N G I +F
Sbjct: 130 -----------SSICIYQINKDIYPNKSNINTLICVGTANINDRVSEPSSGNIYIF---- 174
Query: 305 VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF---LVTAVGQKIYIWQLKD------ 355
+ L + IK IY V I H+ F L++ + + I + D
Sbjct: 175 -FAKKKDNLFE--IKHIYT--HNVNVGGITHLKQFYDKLISTINNTVVILDISDFLINLD 229
Query: 356 -------------ND--LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRT 400
ND + +A +I S+ ++N I+VGD S+ +L Y T
Sbjct: 230 KYVDNTNKPIKLENDGTIVDVASFTPSSWIMSLDVIENYIVVGDIMTSVTILSYDFNNST 289
Query: 401 LSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 460
L+ V RDY S VW +L
Sbjct: 290 LTEVCRDY----------------------SNVWCTFVCAL------------------- 308
Query: 461 EFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS 520
S F++SD + N ++F +L + F+ G VN + SS+
Sbjct: 309 --SKSHFLVSDMESNFLVFQKSSIRYNDEDSFKLSRVAFFNHGHVVNKMLPVSL--SSLI 364
Query: 521 DAPGARSRFLTWYASLDGA-----LGFFLPLPE-KNYRRLLMLQNVMVTHTSHTGGLNPR 574
+ A++ L S+ A + +P N+++ L ++ + S G +N
Sbjct: 365 EEEEAQNEILRKKESILCASSEGSISSIIPFSNLTNFKKALCIEIALNDSLSFIGNINNN 424
Query: 575 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE-------ICKKIGSKHNDILDE 627
+ TYK + +G++DG L F + ++ + I KK+ K +
Sbjct: 425 SNNTYKMN--LSEKSCKGVVDGELFKMFFSMPFEKQFKTYIYAKWIGKKLNCKFGTFENF 482
Query: 628 LYDIEALSS 636
+ DIE L S
Sbjct: 483 ILDIENLCS 491
>gi|440639387|gb|ELR09306.1| hypothetical protein GMDG_03874 [Geomyces destructans 20631-21]
Length = 1138
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 76/346 (21%), Positives = 129/346 (37%), Gaps = 71/346 (20%)
Query: 294 RGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL 353
RGRIL+F G ++N K+ K KG + + G +V A+ + I +++
Sbjct: 849 RGRILVF---------GVDSSRNPYKIAEYK-VKGACRCLGVIDGKIVAALVKTIVVFEY 898
Query: 354 KDNDLTGIAFIDTEVYIASMVSVK-----NLILVGDYARSIALLRYQP----EYRTLSLV 404
+ T Y S V N I V D +S++L+ Y+ E TL V
Sbjct: 899 TELSGTSARIEKVASYRTSTCPVDLAIEGNTIAVADLMKSVSLVEYRAGTSGEAPTLVEV 958
Query: 405 ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
AR ++ VW + E
Sbjct: 959 ARHFQS----------------------VWATAVAHVDE--------------------- 975
Query: 465 MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG 524
G++ +D D N+++ A ++ +FHLG+ VN KIR S G
Sbjct: 976 -GWLEADADGNLIVLRRNEAAVTFEDRKKMEVTGEFHLGEQVNRIRKIRVDASE-----G 1029
Query: 525 ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 584
A + A+ +G+L + + + LL LQ + + G + +R+++
Sbjct: 1030 ATVVPRAFLATTEGSLFLYGSVAPASQDLLLRLQQRLAENVETPGNIPFTTYRSFRNAER 1089
Query: 585 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN--DILDEL 628
P R IDG L+ +FL L + +CK + D+++EL
Sbjct: 1090 ETEEPYR-FIDGELIERFLDLDEERQEVVCKGLAKVEEVRDLVEEL 1134
>gi|448528339|ref|XP_003869702.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis Co 90-125]
gi|380354055|emb|CCG23569.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis]
Length = 1170
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 53/104 (50%), Gaps = 4/104 (3%)
Query: 533 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 592
Y L G +G LPL K+ + +L ++ + +++ +N K + YY NP++
Sbjct: 1070 YTGLTGTIGILLPLISKS--EIELLHDLQLEISAYNDKVNVAGKNHAKLRSYY--NPAKN 1125
Query: 593 IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSS 636
I DG + +L L L E+L+I K++ ++ +L DI SS
Sbjct: 1126 IFDGDFLELYLNLPLDEKLKIAKRLNKSVGEVEKKLNDIRNRSS 1169
>gi|383863765|ref|XP_003707350.1| PREDICTED: DNA damage-binding protein 1-like [Megachile rotundata]
Length = 1138
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 68/341 (19%), Positives = 127/341 (37%), Gaps = 61/341 (17%)
Query: 304 EVVPEPGQPL----TKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLT 359
E P+ G+ L K+ + KE KG ++ G L+ ++ + +++
Sbjct: 838 ETEPKMGRILLYHWNDGKLTQVAEKEIKGSCYSLVEFNGKLLASINSTVRLFEWTAEKEL 897
Query: 360 GIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
+ IA + K + +LVGD RS+ LL+Y+ + +ARDY P
Sbjct: 898 RLECSHFNNIIALYLKTKGDFVLVGDLMRSLTLLQYKTMEGSFEEIARDYNPN------- 950
Query: 419 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
W +ILD+ + +G + L
Sbjct: 951 ---------------WM------------------TAVEILDDDTFLG-----AENCFNL 972
Query: 479 FMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASL 536
F+ Q ++ ++ R + + FHLG VN F ++ ++ ++ + ++
Sbjct: 973 FVCQKDSAATSEDERQQMQEIGQFHLGDMVNVFRHGSLVMQNLGES-STPTQGCVLFGTV 1031
Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
GA+G +P Y L L+ + G + R +R++ + G IDG
Sbjct: 1032 SGAIGLVTQIPFTFYEFLRHLEYRLTEVIKSVGKIEHRFWRSFNTE--LKVENCEGFIDG 1089
Query: 597 SLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 631
L+ FL LS + E+ + G + +D+L I
Sbjct: 1090 DLIESFLDLSPDKMAEVAVDLMMDDSSGMRKEATVDDLVKI 1130
>gi|429850956|gb|ELA26181.1| DNA damage-binding protein 1 [Colletotrichum gloeosporioides Nara
gc5]
Length = 1409
Score = 48.1 bits (113), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 110/552 (19%), Positives = 213/552 (38%), Gaps = 105/552 (19%)
Query: 97 GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRI 156
G VF H + ++ +S G + T + V+ +APF + P + K+ +RI
Sbjct: 671 GISNVFATTEHSSLIY-SSEGRIIYSAATAED-VTYIAPFDSEAFPDAIVLATDKN-VRI 727
Query: 157 SVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELV 216
+ H+ + V +PL+ T +AY K + I T E FN E E+V
Sbjct: 728 A----HIDVERRTHVNPLPLRQTVRRVAYSPALKAFGIGTIRRE------LFNNE--EMV 775
Query: 217 TDPRDSRFIPPLVSQFHVSLFS-PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGL 275
T S F LV + + + PF + T L + E +
Sbjct: 776 T----SSF--QLVDEIVLGVVGKPFHLDGAATTE---------LVESVIRAELPDSSGQP 820
Query: 276 RGYIALGTNY----NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVT 331
+GT+Y E+ +GRIL+ G KN +++ + E KG
Sbjct: 821 AERFIVGTSYLADPEMDENSEVKGRILVL---------GVDSDKNPYQIV-SHELKGACR 870
Query: 332 AICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-----NLILVGDYA 386
++ + LV + + + ++ + T + + + S V N+I V D
Sbjct: 871 SLAVMGDKLVAGLSKTVVVYDYAEESSTSGSLLKLATFRPSTFPVDLDVNGNMIGVADLM 930
Query: 387 RSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 446
+S+ L+ + P A+D GN +R +++ + ++++ + LE
Sbjct: 931 QSMTLIEFIP--------AQD-------------GNKAR-LVERARHFQYIWATAVCHLE 968
Query: 447 ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV 506
D+ E + G N+++ P A + ++ ++FHLG+ +
Sbjct: 969 ---------QDLWIEADAQG--------NLMVLRRNPNAPTEHDKKQMEVISEFHLGEQI 1011
Query: 507 NTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 566
N + +P + + A+++G++ F + + LL Q +
Sbjct: 1012 N-----KIRPLDVVSGENDPIEPKAFLATIEGSIYVFADIKPEYQSLLLQFQERLAGVIK 1066
Query: 567 HTG-------GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG- 618
G GL+ ++R ++ A P R +DG L+ +FL L G + + + +G
Sbjct: 1067 TLGQADEPGAGLSFMSWRGFRNAKRSADGPFR-FVDGELIERFLDLDAGRQEAVVQGLGP 1125
Query: 619 --SKHNDILDEL 628
+ D+++EL
Sbjct: 1126 TVERMRDLVEEL 1137
>gi|332030156|gb|EGI69950.1| DNA damage-binding protein 1 [Acromyrmex echinatior]
Length = 1138
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 72/364 (19%), Positives = 135/364 (37%), Gaps = 69/364 (18%)
Query: 278 YIALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT + N E GRILL+ ++ K + KE KG ++
Sbjct: 826 YFVVGTAFINPDETEPKMGRILLYH-----------WSEGKFTQVAEKEIKGSCYSLVEF 874
Query: 337 AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQ 395
G L+ ++ + +++ + IA + K + +LVGD RS+ LL+Y+
Sbjct: 875 NGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKTKGDFVLVGDLMRSLTLLQYK 934
Query: 396 PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
+ +ARDY P S
Sbjct: 935 TMEGSFEEIARDYNPNWMTSI--------------------------------------- 955
Query: 456 NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIR 513
+ILD+ + +G + LF+ Q ++ ++ R + + FHLG VN F
Sbjct: 956 -EILDDDTFLG-----AENCFNLFVCQKDSAATSEDERQQMQEIGQFHLGDMVNVFRHGS 1009
Query: 514 CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
++ ++ + + ++ GA+G +P Y L +++ + + G +
Sbjct: 1010 LVMQNLGES-STPTLGCVLFGTVSGAIGLVTQIPVTFYEFLRNMEDRLNSVIKSVGKIEH 1068
Query: 574 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDE 627
+R++ + G IDG L+ FL L+ + E+ + G K +D+
Sbjct: 1069 NFWRSFNTE--LKIEQCEGFIDGDLIESFLDLNHDKMAEVAMGLMIDDGSGMKKEATVDD 1126
Query: 628 LYDI 631
L I
Sbjct: 1127 LVKI 1130
>gi|302894051|ref|XP_003045906.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256726833|gb|EEU40193.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 1162
Score = 47.8 bits (112), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 61/339 (17%), Positives = 131/339 (38%), Gaps = 68/339 (20%)
Query: 295 GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK 354
GRIL+ + E ++ I + KGP + + ++V + + + ++
Sbjct: 872 GRILVLGVDE----------HRQVYQIVSHNLKGPCRCLGMMDDYIVAGLSKTVVVYNYS 921
Query: 355 DNDLTGIAFIDTEVYIASMVSVK-----NLILVGDYARSIALLRYQPEYR----TLSLVA 405
+ + + Y + + V N+I VGD +S++L+ + P L A
Sbjct: 922 QDTSSSGSLEKLAAYRPAALPVDLDISGNMIGVGDLMQSLSLVEFIPAQDGRKAKLEERA 981
Query: 406 RDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
R Y+P +W +C LDE
Sbjct: 982 RHYEP----------------------IWT---------TSLCH---------LDEER-- 999
Query: 466 GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA 525
++ +D N+++ +A RL ++ +G+ +N K+ P+ +
Sbjct: 1000 -WLEADSQGNLIVLQRNADAPTEQDRSRLEVTSEIGIGEQINRIRKLHV-PAGDNSIVHP 1057
Query: 526 RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 585
R+ + AS +G+L + + + L+ Q+ M + G + + +R+++ +
Sbjct: 1058 RA----FLASAEGSLYLYGDIAPQYQDLLMTFQSKMEEYIHAPGNIEFKLWRSFRNENRE 1113
Query: 586 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
+ P R IDG +V +FL + G++ +C+ +G D+
Sbjct: 1114 SDGPYR-FIDGEMVERFLDMDEGKQELVCEGLGPSVEDM 1151
>gi|83314897|ref|XP_730560.1| multisubunit cleavage/polyadenylation specificity factor subunit A
[Plasmodium yoelii yoelii 17XNL]
gi|23490318|gb|EAA22125.1| CPSF A subunit region, putative [Plasmodium yoelii yoelii]
Length = 863
Score = 47.4 bits (111), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 112/594 (18%), Positives = 214/594 (36%), Gaps = 135/594 (22%)
Query: 101 VFLCGPHPAWLFLTSRGELRAHPMTIDG--------PVSTLAPFHNVNCPRG-------F 145
+F+C +P ++ + ++ ++I L PFHN N + F
Sbjct: 345 LFICSDNPIIIYSDIKKKISLSKVSIKNIFLVDIFNDFDYLNPFHNFNSFKKKNQNNLYF 404
Query: 146 LYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDY 205
++F+ +S+ +HL+ ++K+P T +AYH E+ ++TS P+ +
Sbjct: 405 IFFDG-----LSLYISHLNEINETYIQKIPFYRTVEKIAYHNESGL--LITSC--PTEEK 455
Query: 206 YKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVS 265
+K N K+++ F P + F S P + +C+ ++
Sbjct: 456 HKTNKNLKQIIC------FFNPHQNSFKYSYIIPSKYNVSS------------ICIYQIN 497
Query: 266 MEYEGTLSGLRGYIALGTNYNYSEDVT-----------CRGRILLFDIIEVVPEP----- 309
+ S + I +GT N ++ V+ + + LF+I +
Sbjct: 498 KDIYPNKSNINTLICVGT-ANINDRVSEPSSGHIYIFFAKKKANLFEIKHIYTHNINVGG 556
Query: 310 --------GQPLTKNKIKMIYAKEQKGPVTAICHVAGFL------VTAVGQKIYIWQLKD 355
+ ++ +IY K + I ++ FL V + I + D
Sbjct: 557 ITHLKQFYDKLISTINNTVIYKCVNKKLIVVILDISDFLINLDKYVDNTNKPIKLEN--D 614
Query: 356 NDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNS 415
+ +A +I S+ ++N I+VGD S+ +L Y TL+ V RDY
Sbjct: 615 GTIVDVASFTPSSWIMSLDVIENYIVVGDIMTSVTILSYDFNNSTLTEVCRDY------- 667
Query: 416 KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKN 475
S VW +L S F++SD + N
Sbjct: 668 ---------------SNVWCTFVCAL---------------------SKSHFLVSDMESN 691
Query: 476 VVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYAS 535
++F +L + F+ G VN + SS+ + A++ L S
Sbjct: 692 FLVFQKSSIRYNDEDSFKLSRVALFNHGHVVNKMLPVSL--SSLIEEEEAQNEILRKKES 749
Query: 536 LDGA-----LGFFLPLPE-KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
+ A + +P N+++ L ++ + S +N + TYK +
Sbjct: 750 ILCASSEGSISSIIPFSNLTNFKKALCIEIALNDSLSFIXNINNNSNNTYKMN--LSEKS 807
Query: 590 SRGIIDGSLVWKFLQLSLGERLE-------ICKKIGSKHNDILDELYDIEALSS 636
S+G++DG + F + ++ I KK+ K + + DIE L S
Sbjct: 808 SKGVVDGEVFKMFFSMPFEKQFXTYIYAKWIAKKLNCKFGXFENFMLDIENLCS 861
>gi|269861065|ref|XP_002650248.1| pre-mRNA cleavage and polyadenylation specificity factor
[Enterocytozoon bieneusi H348]
gi|220066338|gb|EED43824.1| pre-mRNA cleavage and polyadenylation specificity factor
[Enterocytozoon bieneusi H348]
Length = 1022
Score = 47.4 bits (111), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 65/139 (46%), Gaps = 19/139 (13%)
Query: 275 LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAIC 334
L Y+ + + SED + RI+L+ I+ +V + P K +K+ Y E+K AI
Sbjct: 703 LDDYVVISLSTVDSEDKCTKSRIILYSIVPIVIDNTCP--KKNLKLKYLGEEKIKY-AIH 759
Query: 335 HVAGF----------------LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKN 378
F L+ VG ++ I++L N+ T I ++ V + ++ V+N
Sbjct: 760 SFDVFYKKKLQNHKYVLSDILLIVGVGTRLMIYELNYNEFTPIGRLEISVGVIAVTVVRN 819
Query: 379 LILVGDYARSIALLRYQPE 397
LIL+GD + L +PE
Sbjct: 820 LILLGDLFTGMELFYLRPE 838
>gi|124506183|ref|XP_001351689.1| conserved Plasmodium protein, unknown function [Plasmodium falciparum
3D7]
gi|23504617|emb|CAD51496.1| conserved Plasmodium protein, unknown function [Plasmodium falciparum
3D7]
Length = 2763
Score = 47.4 bits (111), Expect = 0.026, Method: Composition-based stats.
Identities = 25/85 (29%), Positives = 46/85 (54%), Gaps = 10/85 (11%)
Query: 340 LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARS--IALLRYQPE 397
++ + KIYI ++ DND T AF+D YI+ + +KN I++ D + I + Y+ +
Sbjct: 2502 ILHCINSKIYIHEVNDNDFTKGAFLDNNFYISDIKIMKNFIIIADLFKGIFINMYNYEEQ 2561
Query: 398 YRTLSLVARDYKPTQPNSKGYYAGN 422
Y + S+++ SK +Y+ N
Sbjct: 2562 YDSRSIISI--------SKNFYSNN 2578
>gi|307205760|gb|EFN83990.1| DNA damage-binding protein 1 [Harpegnathos saltator]
Length = 1138
Score = 47.0 bits (110), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 72/364 (19%), Positives = 134/364 (36%), Gaps = 69/364 (18%)
Query: 278 YIALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT N E GRILL+ + K+ + KE KG ++
Sbjct: 826 YFVVGTALINPDETEPKMGRILLYH-----------WSDGKLTQVAEKEIKGSCYSLVEF 874
Query: 337 AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQ 395
G L+ ++ + +++ + IA + K + +LVGD RS+ LL+Y+
Sbjct: 875 NGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKTKGDFVLVGDLMRSLTLLQYK 934
Query: 396 PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
+ +ARDY P S
Sbjct: 935 TMEGSFEEIARDYNPNWMTSI--------------------------------------- 955
Query: 456 NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIR 513
+ILD+ + +G + LF+ Q ++ ++ R + + FHLG VN F
Sbjct: 956 -EILDDDTFLG-----AENCFNLFVCQKDSAATSEDERQQMQEVGQFHLGDMVNVFRHGS 1009
Query: 514 CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
++ ++ + + ++ GA+G +P Y L L++ + + G +
Sbjct: 1010 LVMQNLGES-STPTLGCVLFGTVSGAIGLVTQIPFAFYEFLRNLEDRLNSVIKSVGKIEH 1068
Query: 574 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDE 627
+R++ + G IDG L+ FL L+ + E+ + G K +D+
Sbjct: 1069 NFWRSFNTE--LKIEQCEGFIDGDLIESFLDLNHDKMAEVAMGLMIDDGSGMKKEATVDD 1126
Query: 628 LYDI 631
L +
Sbjct: 1127 LVKV 1130
>gi|395330962|gb|EJF63344.1| hypothetical protein DICSQDRAFT_153890 [Dichomitus squalens LYAD-421
SS1]
Length = 1263
Score = 47.0 bits (110), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 67/344 (19%), Positives = 127/344 (36%), Gaps = 67/344 (19%)
Query: 273 SGLRGYIALGTNYNYSEDVT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVT 331
SG ALGT Y E+ +GRILLF + E + + + + G V
Sbjct: 938 SGASPAFALGTVYIRPEEREPSKGRILLFSVSST--EGARGANVRSLHTLASVNVGGCVY 995
Query: 332 AICHVA-GFLVTAVGQKIYIWQLKDND--------LTGIAFIDTEVYIASMVSVKNLILV 382
A+ +++ +V A+ + +++ +N+ L + + ++ ++V ILV
Sbjct: 996 ALANLSENLIVAAINTSVVLFKSTENEAGESTPLSLEKVTEWNHNHFVTNVVVDGERILV 1055
Query: 383 GDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLG 442
GD S+++L++ L +ARDY P P
Sbjct: 1056 GDAISSVSVLKWNERLERLESIARDYGPLWP----------------------------- 1086
Query: 443 ERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD--F 500
I E + G + ++ D N+ F Q HR + D +
Sbjct: 1087 ---------------IAIEGTGNGLIGANADCNLFSFSLQSVP------HRTYLEKDGVY 1125
Query: 501 HLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNV 560
HL N F + + +++ ++ + + ++ G +G L + + + LQ
Sbjct: 1126 HLNDVTNKFVRGALTSTDVAEDQVVKASHVFFTST--GCIGAILDMNDVTSLHMTALQRN 1183
Query: 561 MVTHTSHTGGLNPRAFRT-YKGKGYYAGNPSRGIIDGSLVWKFL 603
M + GG N R +G+ S G +DG + ++L
Sbjct: 1184 MAKTLTGPGGDNHTKLRAPSTPRGHTDAEASYGFLDGDFLEQYL 1227
>gi|242010743|ref|XP_002426118.1| DNA damage-binding protein, putative [Pediculus humanus corporis]
gi|212510165|gb|EEB13380.1| DNA damage-binding protein, putative [Pediculus humanus corporis]
Length = 1148
Score = 47.0 bits (110), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 75/358 (20%), Positives = 126/358 (35%), Gaps = 64/358 (17%)
Query: 278 YIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
YI N E + +GRIL+F E K+ + KE KG ++
Sbjct: 837 YIVGTAMVNPDESESKQGRILIFQFQE-----------GKLYQVAEKEIKGAAYSLVEFN 885
Query: 338 GFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQP 396
G L+ ++ + +++ + I+ + K + ILVGD RS+ LL+Y+
Sbjct: 886 GKLLASINSTVRLFEWTAEQELRLECSHFNNIISLYLKTKGDFILVGDLIRSMTLLQYKT 945
Query: 397 EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 456
+ARD+ P + IID FL L +C+K +
Sbjct: 946 MEGCFEEMARDHNPNWMTAV---------EIIDDD---TFLGAENSFNLFVCQKDSAAAT 993
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
D E R+ + FHLG VN F
Sbjct: 994 D--------------------------EERQQMHAVGM-----FHLGDMVNVFRHGSLVM 1022
Query: 517 SSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
++ + + + + ++ GA+G + Y L L+ + G + +
Sbjct: 1023 QNVGETSTPTTGCI-LFGTVSGAIGLVTQISANFYNFLHELECKLTEVIKSVGKIKHSFW 1081
Query: 577 RTYKGKGYYAGNPSRGIIDGSLVWKFLQLS------LGERLEICKKIGSKHNDILDEL 628
R++ + P G IDG L+ FL LS + L+I G K +D+L
Sbjct: 1082 RSFTTE--IKTEPCDGFIDGDLIESFLDLSHEKMKEVAAGLQIDNGSGMKQEATVDDL 1137
>gi|328788389|ref|XP_396048.3| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Apis
mellifera]
Length = 1141
Score = 47.0 bits (110), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 96/487 (19%), Positives = 175/487 (35%), Gaps = 92/487 (18%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
+R VPL +P +AY ++T+ ++T + D +DS + +
Sbjct: 713 IRTVPLGESPRRIAYQESSQTFGVIT------------------MRVDIQDSSGVSIVRH 754
Query: 231 QFHVSLFSPFSWEEIPQTNFPLHEWEHVLC----LKNVSMEYEGTLSGLRGYIALGTNYN 286
S S I N P +C + N+ + + T L ++ + T Y
Sbjct: 755 SASTQAASTSSSSHIASYNKPTGHTASDICQEIEVHNLLIIDQHTFEVLHAHMLMPTEYA 814
Query: 287 YSEDVTCRGR---------ILLFDIIEVVPEPGQPL----TKNKIKMIYAKEQKGPVTAI 333
S T G L E P+ G+ L + K+ + KE KG ++
Sbjct: 815 LSLISTKLGEDPTSYYIVGTALVHPDETEPKMGRILLYHWSDGKLTQVAEKEIKGSCYSL 874
Query: 334 CHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALL 392
G L+ ++ + +++ + IA + K + ILVGD RS+ LL
Sbjct: 875 TEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKSKGDFILVGDLMRSLTLL 934
Query: 393 RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
+Y+ +ARDY P W
Sbjct: 935 QYKTMEGCFEEIARDYNPN----------------------WM----------------- 955
Query: 453 SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFF 510
+ILD+ + +G + LF+ Q ++ ++ R + + FHLG VN F
Sbjct: 956 -TAIEILDDDTFLG-----AENCFNLFVCQKDSAATSEDERQQMQEVGQFHLGDMVNVFR 1009
Query: 511 KIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
++ ++ ++ + ++ GA+G +P Y L L++ + + G
Sbjct: 1010 HGSLVMQNLGES-STPTQGCVLFGTVSGAIGLVTQIPFIFYEFLRNLEDRLTSVIKSVGK 1068
Query: 571 LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDI 624
+ +R++ + G IDG L+ FL LS + E+ + G K
Sbjct: 1069 IEHNFWRSFNTE--LKIEQCEGFIDGDLIESFLDLSPDKMAEVASGLMIDDPSGMKKEAT 1126
Query: 625 LDELYDI 631
+D+L I
Sbjct: 1127 VDDLVKI 1133
>gi|432089478|gb|ELK23419.1| DNA damage-binding protein 1 [Myotis davidii]
Length = 1047
Score = 46.6 bits (109), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 45/193 (23%), Positives = 76/193 (39%), Gaps = 42/193 (21%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKPT---------------QPNSKGYYA------------GNPSRGI 427
+P +ARD+ P N+ + P+ G
Sbjct: 936 KPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDRSFHTERKTEPATGF 995
Query: 428 IDGSLVWKFLQLS 440
IDG L+ FL +S
Sbjct: 996 IDGDLIESFLDIS 1008
>gi|70929162|ref|XP_736684.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56511427|emb|CAH86674.1| hypothetical protein PC302114.00.0 [Plasmodium chabaudi chabaudi]
Length = 276
Score = 46.6 bits (109), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 23/85 (27%), Positives = 45/85 (52%), Gaps = 10/85 (11%)
Query: 340 LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARS--IALLRYQPE 397
++ + K+YI ++K+ D T AFID Y++ + V+N I++ D + I + Y+ +
Sbjct: 105 VLHCINSKMYIHEIKNKDFTKGAFIDNNFYVSDIKIVRNFIIISDLYKGIFINMYNYEEQ 164
Query: 398 YRTLSLVARDYKPTQPNSKGYYAGN 422
Y + S+++ SK +Y N
Sbjct: 165 YDSRSIISI--------SKNFYNNN 181
>gi|344231825|gb|EGV63707.1| hypothetical protein CANTEDRAFT_134986 [Candida tenuis ATCC 10573]
gi|344231826|gb|EGV63708.1| hypothetical protein CANTEDRAFT_134986 [Candida tenuis ATCC 10573]
Length = 991
Score = 46.2 bits (108), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 50/98 (51%), Gaps = 9/98 (9%)
Query: 533 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 592
YA L G +G LP+ E +++ L L + + G + FR GYY N +
Sbjct: 896 YAGLQGTIGILLPISESDFKFLSNLS--IELNKDLLLGRDHMKFR-----GYY--NSTHN 946
Query: 593 IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYD 630
+IDG ++ KFL+L+ R++I K+ +I +++ D
Sbjct: 947 VIDGDIIEKFLELNASSRIKISNKLNKSVREIENKIND 984
>gi|237839083|ref|XP_002368839.1| hypothetical protein TGME49_067710 [Toxoplasma gondii ME49]
gi|211966503|gb|EEB01699.1| hypothetical protein TGME49_067710 [Toxoplasma gondii ME49]
Length = 2136
Score = 46.2 bits (108), Expect = 0.052, Method: Composition-based stats.
Identities = 27/74 (36%), Positives = 43/74 (58%), Gaps = 3/74 (4%)
Query: 533 YASLDGALGFFLPLP-EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSR 591
+AS +GA+G L +P E+ + RL +LQ+ + T G L+ AF + K A PS+
Sbjct: 2051 WASSEGAIGHLLQIPDEQTFARLAVLQDAVTKVTKSIGKLSAVAFHSVKVGT--ATVPSK 2108
Query: 592 GIIDGSLVWKFLQL 605
G IDG ++ +FL+
Sbjct: 2109 GFIDGDILERFLEF 2122
>gi|221502136|gb|EEE27880.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 2131
Score = 46.2 bits (108), Expect = 0.052, Method: Composition-based stats.
Identities = 27/74 (36%), Positives = 43/74 (58%), Gaps = 3/74 (4%)
Query: 533 YASLDGALGFFLPLP-EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSR 591
+AS +GA+G L +P E+ + RL +LQ+ + T G L+ AF + K A PS+
Sbjct: 2046 WASSEGAIGHLLQIPDEQTFARLAVLQDAVTKVTKSIGKLSAVAFHSVKVGT--ATVPSK 2103
Query: 592 GIIDGSLVWKFLQL 605
G IDG ++ +FL+
Sbjct: 2104 GFIDGDILERFLEF 2117
>gi|322706594|gb|EFY98174.1| DNA damage-binding protein 1 [Metarhizium anisopliae ARSEF 23]
Length = 1121
Score = 46.2 bits (108), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 105/558 (18%), Positives = 199/558 (35%), Gaps = 116/558 (20%)
Query: 97 GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRI 156
G VF H A L ++ G + T D + +APF + P + + S +R+
Sbjct: 639 GTCNVFATTEH-ASLIYSAEGRIIYSATTAD-DATYVAPFDSEAFPNSIV-LSTDSHIRL 695
Query: 157 SVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTY---CIVTSTAEPSTDYYKFNGEDK 213
S H+ + V+ + +K T +AY K + CI K +++
Sbjct: 696 S----HIDKERLTHVKTLSVKETVRRVAYSPTLKVFGLGCI-----------KKELIQNE 740
Query: 214 ELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLS 273
E++T R + ++ Q L PF I T+ L E V + E ++
Sbjct: 741 EVITSS--FRIVDEIIFQ---ELGKPF----IFNTSTSLEMVETV-----IRAELPDSMG 786
Query: 274 GLRGYIALGTNYNYSEDVT----CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGP 329
L +GT++ +D RGRIL+ + E ++ I + KG
Sbjct: 787 NLAERFIIGTSFITDDDAIEENDTRGRILVLGVDE----------NRQVYQIVSHNLKGA 836
Query: 330 VTAICHVAGFLVTAVGQKIYIWQLKDN-----DLTGIAFIDTEVYIASMVSVKNLILVGD 384
+ + +V + + + ++ + L +A + S+ N+I V D
Sbjct: 837 CRCLGTLGEHIVAGLSKTVVVYHYVEETTVFGSLQKLAAYRPASFPLSLDISGNIIGVVD 896
Query: 385 YARSIALLRYQPEYR----TLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLS 440
+S+ L+ + P L AR Y+P S + G
Sbjct: 897 LMQSLTLVEFIPSEDGSRAKLEETARHYQPGWATSVAHLDG------------------- 937
Query: 441 LGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDF 500
ER ++ +D N+++ PEA +L ++
Sbjct: 938 --ER----------------------WLEADAQGNIIVLQRNPEAPTEQDRSKLEVTSEM 973
Query: 501 HLGQHVNTFFKIRC--------KPSSISDAPGARSRFLTWYASL------DGALGFFLPL 546
++G+ +N K+ P + + G +T + L +G L F +
Sbjct: 974 NIGEQINQIRKLHVASNENAVVSPKAFLGSVGLSETIITCWNQLLMLVQIEGTLYLFGEI 1033
Query: 547 PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
LL Q+ + + G ++ +R ++ K P R +DG +V +FL L
Sbjct: 1034 APNYQDLLLTFQSRLQDYIYAPGNVSFNLWRAFRNKAREGDGPFR-FVDGEMVERFLDLD 1092
Query: 607 LGERLEICKKIGSKHNDI 624
++ +C+ +G D+
Sbjct: 1093 EAKQELVCEGLGPSVEDM 1110
>gi|452820919|gb|EME27955.1| splicing factor 3B subunit 3 [Galdieria sulphuraria]
Length = 1294
Score = 46.2 bits (108), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 47/182 (25%), Positives = 80/182 (43%), Gaps = 29/182 (15%)
Query: 469 ISDKDKNVVLFMYQPEA-----RESNGG-------HRLIKKTDFHLGQHVNTFFKIRCKP 516
I DK N+ + PEA ++ GG H + +++G + K+
Sbjct: 1100 IGDKMGNISILRLPPEAGTFIEQDPTGGLLSKEAPHHFQLEACYYVGSVIQCLSKVEW-- 1157
Query: 517 SSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRA 575
+ D P L +Y +LDGA+G +PL L L+ + + S G + A
Sbjct: 1158 -TTGDVP------LLFYGTLDGAIGVMIPLRSTLDMELFQALELQLREYRSPLCGRHHLA 1210
Query: 576 FRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
+R+Y ++ P R +IDG L +F +LSL ++ +I K++ D+ +L D S
Sbjct: 1211 YRSY----FF---PVRHVIDGDLCEEFYRLSLEQQEKIVKELDRSIVDVHRKLEDYRERS 1263
Query: 636 SH 637
H
Sbjct: 1264 PH 1265
>gi|380025901|ref|XP_003696702.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1-like
[Apis florea]
Length = 1141
Score = 45.8 bits (107), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 96/487 (19%), Positives = 174/487 (35%), Gaps = 92/487 (18%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
+R VPL +P +AY ++T+ ++T + D +DS + +
Sbjct: 713 IRTVPLGESPRRIAYQESSQTFGVIT------------------MRVDIQDSSGVSIVRH 754
Query: 231 QFHVSLFSPFSWEEIPQTNFPLHEWEHVLC----LKNVSMEYEGTLSGLRGYIALGTNYN 286
S S I N P +C + N+ + + T L ++ + T Y
Sbjct: 755 SASTQAASTSSSSHIASYNKPTGHTASDICQEIEVHNLLIIDQHTFEVLHAHMLMPTEYA 814
Query: 287 YSEDVTCRGR---------ILLFDIIEVVPEPGQPL----TKNKIKMIYAKEQKGPVTAI 333
S T G L E P+ G+ L + K+ + KE KG ++
Sbjct: 815 LSLISTKLGEDPTSYYIVGTALVHPDETEPKMGRILLYHWSDGKLTQVAEKEXKGSCYSL 874
Query: 334 CHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALL 392
G L+ ++ + +++ + IA + K + ILVGD RS+ LL
Sbjct: 875 TEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKSKGDFILVGDLMRSLTLL 934
Query: 393 RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
+Y+ +ARDY P W
Sbjct: 935 QYKTMEGCFEEIARDYNPN----------------------WM----------------- 955
Query: 453 SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFF 510
+ILD+ + +G + LF+ Q ++ ++ R + + FHLG VN F
Sbjct: 956 -TAIEILDDDTFLG-----AENCFNLFVCQKDSAATSEDERQQMQEVGQFHLGDMVNVFR 1009
Query: 511 KIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
++ ++ ++ ++ GA+G +P Y L L++ + + G
Sbjct: 1010 HGSLVMQNLGES-STPTQGCVLXGTVSGAIGLVTQIPFIFYEFLRNLEDRLTSVIKSVGK 1068
Query: 571 LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDI 624
+ +R++ + G IDG L+ FL LS + E+ + G K
Sbjct: 1069 IEHNFWRSFNTE--LKIEQCEGFIDGDLIESFLDLSPDKMAEVASGLMIDDPSGMKKEAT 1126
Query: 625 LDELYDI 631
+D+L I
Sbjct: 1127 VDDLVKI 1133
>gi|300176205|emb|CBK23516.2| unnamed protein product [Blastocystis hominis]
Length = 702
Score = 45.8 bits (107), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 81/406 (19%), Positives = 152/406 (37%), Gaps = 75/406 (18%)
Query: 249 NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVP 307
PL E LC+ + S+ +GT + E+ +GR+L+ +E
Sbjct: 322 ELPLKPSEIALCVASGSIFPLSNAPERNEVFVVGTAFVLPEENEPSQGRLLVLRAVE--- 378
Query: 308 EPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTE 367
++++++ G +IC G +V V ++ ++ + D + I+ + +E
Sbjct: 379 --------HRLELVAETMLSGGCLSICLFKGKVVCGVNSELQVFDV-DEKTSTISKLASE 429
Query: 368 VYIASMVSVK-----NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
V S+ S+ I +GD S+ + Y+ + V R + Q
Sbjct: 430 VACISVTSLSPNEADETIALGDILYSVVV------YKLVLEVVRGRQLAQ---------- 473
Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
L+ ER ++ + L E S ++ D N+++
Sbjct: 474 --------------LECIASER----RRRDVTALERLPEAQSE-MVVGDAYGNLMVMQVV 514
Query: 483 PEAR--ESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISD------APGARSRFLTWYA 534
EA SN ++ K FHL +N F ++ S D A + F +A
Sbjct: 515 EEADLDRSNPQKIVVTKESFHLDDQINRFVPVQLFRSGAEDKKKEKRAEESEIAFNLAFA 574
Query: 535 SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGI- 593
++ G +G L ++ +R L ++ M + GGL+ + +R N GI
Sbjct: 575 TVSGRIGMIGALNDREFRMLRAIETAMENVITPVGGLDHKQWR--------CSNTPFGIK 626
Query: 594 -----IDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
IDG LV FL+L + +I + ++ L + I+ L
Sbjct: 627 NLAYCIDGDLVEMFLELDDESQAKIADSVSTELRSALSPQFLIDYL 672
>gi|358400469|gb|EHK49795.1| hypothetical protein TRIATDRAFT_146031 [Trichoderma atroviride IMI
206040]
Length = 1161
Score = 45.8 bits (107), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 33/160 (20%), Positives = 72/160 (45%), Gaps = 10/160 (6%)
Query: 467 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR 526
++ +D NV++ EA +L ++ ++G+ +N K++ APG
Sbjct: 999 WLEADAQGNVIVLRQNLEAPTEQDQSQLQVISELNIGEQINRIRKLQV-------APGEN 1051
Query: 527 SRFL--TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 584
+ + + S +G L + + K L+ Q+ + + S G L+ +R ++ +
Sbjct: 1052 AIVVPKAFLGSTEGTLYLYGDIAPKYQDLLMTFQSRLQEYISTPGNLSFDLWRAFRNQSR 1111
Query: 585 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
P R +DG ++ +FL L G++ +C+ +G D+
Sbjct: 1112 EGEAPFR-FVDGEMIERFLDLDEGKQELVCEGLGPSVEDM 1150
>gi|401413996|ref|XP_003886445.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325120865|emb|CBZ56420.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 2869
Score = 45.8 bits (107), Expect = 0.068, Method: Composition-based stats.
Identities = 41/200 (20%), Positives = 88/200 (44%), Gaps = 31/200 (15%)
Query: 232 FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDV 291
+ V L+ F ++ P ++ L E VL L V L G+ ++A G SE +
Sbjct: 2418 YEVRLYHEFDLQK-PIGSYTLRTCEEVLSLSFV------VLDGVE-HLAAGVGVPLSETI 2469
Query: 292 TCRGRILLFDIIEVVPEPGQPL-------------TKNKIKMIYAKEQKGPVTAICHV-- 336
C GR+ LF + E P T ++++ GPVT +
Sbjct: 2470 ECSGRLYLFKLPESAMRLASPPRSADTPGDQAEYGTPERLELFADIVLNGPVTVVGSFFS 2529
Query: 337 ----AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALL 392
++V +VG ++++ +++ + AF D+ V + ++ +++N L+ D + + L+
Sbjct: 2530 SPAERSYVVHSVGPRLFVHEMESSKFLRGAFSDSSVCVTAVANLRNFFLLADALKGLNLV 2589
Query: 393 RY----QPEYRTLSLVARDY 408
+ + + R ++ ++R +
Sbjct: 2590 AWEYHAEADSRKVTRISRTF 2609
>gi|156084934|ref|XP_001609950.1| splicing factor 3b, subunit 3, 130kD [Babesia bovis T2Bo]
gi|154797202|gb|EDO06382.1| splicing factor 3b, subunit 3, 130kD, putative [Babesia bovis]
Length = 1169
Score = 45.8 bits (107), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 37/160 (23%), Positives = 75/160 (46%), Gaps = 16/160 (10%)
Query: 473 DKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTW 532
DK +F+ + ES +L FHLG P+++ A ++S +
Sbjct: 1022 DKFDSIFVTRVPQEESTRHIQLENVCQFHLGD----------LPTAMDKAALSQSTHVVL 1071
Query: 533 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 592
Y ++ G++G +P K+ L LQ++ + + L R Y+ YY P +
Sbjct: 1072 YGTVMGSIGALVPFQSKD--ELDFLQHLEMLMATEAPPLCGREHSFYRS--YYV--PVQQ 1125
Query: 593 IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 632
++DG L +F L+ ++ ++ +++ + N++L +L DI+
Sbjct: 1126 VVDGDLCEQFRHLTEAQQRKVAQQLDTTVNNVLRKLDDIK 1165
>gi|340714589|ref|XP_003395809.1| PREDICTED: DNA damage-binding protein 1-like [Bombus terrestris]
Length = 1141
Score = 45.4 bits (106), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 95/487 (19%), Positives = 173/487 (35%), Gaps = 92/487 (18%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
+R VPL +P +AY ++T+ ++T + D +DS + +
Sbjct: 713 IRTVPLGESPRRIAYQESSQTFGVIT------------------MRVDIQDSSGVSIVRH 754
Query: 231 QFHVSLFSPFSWEEIPQTNFPLHEWEHVLC----LKNVSMEYEGTLSGLRGYIALGTNYN 286
S S I N P +C + N+ + + T L ++ + T Y
Sbjct: 755 SASTQAASTSSSSHIASYNKPTGHTASDICQEIEVHNLLIIDQHTFEVLHAHMLMPTEYA 814
Query: 287 YSEDVTCRGR---------ILLFDIIEVVPEPGQPL----TKNKIKMIYAKEQKGPVTAI 333
S T G L E P+ G+ L + K+ + KE KG ++
Sbjct: 815 LSLISTKLGEDPTSYYIVGTALVHPDETEPKMGRILLYHWSDGKLTQVAEKEIKGSCYSL 874
Query: 334 CHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALL 392
G L+ ++ + +++ + IA + K + ILVGD RS+ LL
Sbjct: 875 TEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKTKGDFILVGDLMRSLTLL 934
Query: 393 RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
+Y+ +ARDY P W
Sbjct: 935 QYKTMEGCFEEIARDYNPN----------------------WM----------------- 955
Query: 453 SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFF 510
+ILD+ + +G + LF+ Q ++ ++ R + + FHLG VN F
Sbjct: 956 -TAIEILDDDTFLG-----AENCFNLFVCQKDSAATSEDERQQMQEVGQFHLGDMVNVFR 1009
Query: 511 KIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
++ ++ ++ + ++ GA+G +P Y L L+ + G
Sbjct: 1010 HGSLVMQNLGES-STPTQGCVLFGTVSGAIGLVTQIPFTFYEFLRNLEERLTGVIKSVGK 1068
Query: 571 LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDI 624
+ +R++ + G IDG L+ FL LS + ++ + G K
Sbjct: 1069 IEHNFWRSFNTE--LKIEQCEGFIDGDLIESFLDLSPNKMADVASGLMIDDPSGMKKEAT 1126
Query: 625 LDELYDI 631
+D+L I
Sbjct: 1127 VDDLVKI 1133
>gi|168031491|ref|XP_001768254.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680432|gb|EDQ66868.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1391
Score = 45.4 bits (106), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 53/110 (48%), Gaps = 16/110 (14%)
Query: 110 WLFLTSRGELRAHPMTIDGPVST-LAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAP 168
WL T+R R +I P S+ AP ++V+CP G L F A L + + +
Sbjct: 864 WLLQTARHSQRIAHTSISFPSSSHAAPVNSVDCPNGIL-FVADCSLHL----VEMEHLKR 918
Query: 169 WPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTD 218
V+K+PL TP + YH E+KT ++ TDY G D LV+D
Sbjct: 919 LNVQKLPLGRTPRRVLYHTESKTLIVM------RTDY----GPDGGLVSD 958
>gi|156095699|ref|XP_001613884.1| Splicing factor 3B subunit 3 [Plasmodium vivax Sal-1]
gi|148802758|gb|EDL44157.1| Splicing factor 3B subunit 3, putative [Plasmodium vivax]
Length = 1230
Score = 45.4 bits (106), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 68/318 (21%), Positives = 111/318 (34%), Gaps = 79/318 (24%)
Query: 334 CHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALL 392
C G L+ ++G K+ I+ L K L + D I S+ + I D S+ +
Sbjct: 967 CPFNGRLLASIGNKLRIYALGKKKLLKKCEYKDIPEAIISIKVSGDRIFASDIRESVLIF 1026
Query: 393 RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
Y TL L++ D P R C +I
Sbjct: 1027 FYDANMNTLRLISDDIIP---------------------------------RWITCSEIL 1053
Query: 453 SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARE------------------SNGGHRL 494
H M +DK +V + EA++ SN RL
Sbjct: 1054 DHHT----------IMAADKFDSVFVLRVPEEAKQEEYGISNKCWYGGEIMAGSNKNRRL 1103
Query: 495 IKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 554
FH+G+ V + K++ P+S Y+++ G +G F+P K L
Sbjct: 1104 EHIMSFHVGEIVTSLQKVKLSPTSSE---------CIIYSTIMGTIGAFIPYDNKEELEL 1154
Query: 555 LM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 613
L+ ++ T G FR+Y +P + +IDG L +F L + ++
Sbjct: 1155 TQHLEIILRTENPPLCGREHIFFRSYY-------HPVQHVIDGDLCEQFSSLPYDVQRKV 1207
Query: 614 CKKIGSKHNDILDELYDI 631
+ +DIL +L DI
Sbjct: 1208 AADLERTPDDILRKLEDI 1225
>gi|298715583|emb|CBJ28136.1| cleavage and polyadenylation specificity factor CG10110-PA
[Ectocarpus siliculosus]
Length = 1906
Score = 45.4 bits (106), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 47/88 (53%), Gaps = 7/88 (7%)
Query: 249 NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYN--YSEDVTCRGRILLFDI--IE 304
+ P+ E+ +C+ V +E G R Y+A+GT N ED RGR++L ++
Sbjct: 1795 SHPMDSDENGVCMTLVRLEQGGAP---RMYVAVGTGMNEPQGEDKAARGRLILLEVDYAY 1851
Query: 305 VVPEPGQPLTKNKIKMIYAKEQKGPVTA 332
+ E G+ K++ ++AKEQ GPV+
Sbjct: 1852 LAREDGKHEHAVKLRQVFAKEQLGPVSG 1879
>gi|350410909|ref|XP_003489174.1| PREDICTED: DNA damage-binding protein 1-like [Bombus impatiens]
Length = 1141
Score = 45.4 bits (106), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 95/487 (19%), Positives = 173/487 (35%), Gaps = 92/487 (18%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
+R VPL +P +AY ++T+ ++T + D +DS + +
Sbjct: 713 IRTVPLGESPRRIAYQESSQTFGVIT------------------MRVDIQDSSGVSIVRH 754
Query: 231 QFHVSLFSPFSWEEIPQTNFPLHEWEHVLC----LKNVSMEYEGTLSGLRGYIALGTNYN 286
S S I N P +C + N+ + + T L ++ + T Y
Sbjct: 755 SASTQAASTSSSSHIASYNKPTGHTASDICQEIEVHNLLIIDQHTFEVLHAHMLMPTEYA 814
Query: 287 YSEDVTCRGR---------ILLFDIIEVVPEPGQPL----TKNKIKMIYAKEQKGPVTAI 333
S T G L E P+ G+ L + K+ + KE KG ++
Sbjct: 815 LSLISTKLGEDPTSYYIVGTALVHPDETEPKMGRILLYHWSDGKLTQVAEKEIKGSCYSL 874
Query: 334 CHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALL 392
G L+ ++ + +++ + IA + K + ILVGD RS+ LL
Sbjct: 875 TEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKTKGDFILVGDLMRSLTLL 934
Query: 393 RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
+Y+ +ARDY P W
Sbjct: 935 QYKTMEGCFEEIARDYNPN----------------------WM----------------- 955
Query: 453 SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFF 510
+ILD+ + +G + LF+ Q ++ ++ R + + FHLG VN F
Sbjct: 956 -TAIEILDDDTFLG-----AENCFNLFVCQKDSAATSEDERQQMQEVGQFHLGDMVNVFR 1009
Query: 511 KIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
++ ++ ++ + ++ GA+G +P Y L L+ + G
Sbjct: 1010 HGSLVMQNLGES-STPTQGCVLFGTVSGAIGLVTQIPFTFYEFLRNLEERLTGVIKSVGK 1068
Query: 571 LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDI 624
+ +R++ + G IDG L+ FL LS + ++ + G K
Sbjct: 1069 IEHNFWRSFNTE--LKIEQCEGFIDGDLIESFLDLSPNKMADVASGLMIDDPSGMKKEAT 1126
Query: 625 LDELYDI 631
+D+L I
Sbjct: 1127 VDDLVKI 1133
>gi|389586447|dbj|GAB69176.1| splicing factor 3B subunit 3 [Plasmodium cynomolgi strain B]
Length = 1286
Score = 45.4 bits (106), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 68/318 (21%), Positives = 111/318 (34%), Gaps = 79/318 (24%)
Query: 334 CHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALL 392
C G L+ ++G K+ I+ L K L + D I S+ + I D S+ +
Sbjct: 1023 CPFNGRLLASIGNKLRIYALGKKKLLKKCEYKDIPEAIISIKVSGDRIFASDIRESVLIF 1082
Query: 393 RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
Y TL L++ D P R C +I
Sbjct: 1083 FYDSNMNTLRLISDDIIP---------------------------------RWITCSEIL 1109
Query: 453 SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARE------------------SNGGHRL 494
H M +DK +V + EA++ SN RL
Sbjct: 1110 DHHT----------IMAADKFDSVFVLRVPEEAKQEEYGISNKCWYGGEIMAGSNKNRRL 1159
Query: 495 IKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 554
FH+G+ V + K++ P+S Y+++ G +G F+P K L
Sbjct: 1160 EHIMSFHVGEIVTSLQKVKLSPTSSE---------CIIYSTIMGTIGAFIPYDNKEELEL 1210
Query: 555 LM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 613
L+ ++ T G FR+Y +P + +IDG L +F L + ++
Sbjct: 1211 TQHLEIILRTENPPLCGREHIFFRSYY-------HPVQHVIDGDLCEQFSSLPYDVQRKV 1263
Query: 614 CKKIGSKHNDILDELYDI 631
+ +DIL +L DI
Sbjct: 1264 AADLERTPDDILRKLEDI 1281
>gi|410045300|ref|XP_508472.4| PREDICTED: DNA damage-binding protein 1 [Pan troglodytes]
Length = 1107
Score = 45.4 bits (106), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 61/136 (44%), Gaps = 15/136 (11%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 835 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 883
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 884 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 942
Query: 395 QPEYRTLSLVARDYKP 410
+P +ARD+ P
Sbjct: 943 KPMEGNFEEIARDFNP 958
>gi|358380497|gb|EHK18175.1| hypothetical protein TRIVIDRAFT_80808 [Trichoderma virens Gv29-8]
Length = 1161
Score = 45.4 bits (106), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 32/158 (20%), Positives = 71/158 (44%), Gaps = 6/158 (3%)
Query: 467 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR 526
++ +D N+++ EA +L ++ ++G+ +N KI+ P+ +A
Sbjct: 999 WLEADAQGNIIVLRQNQEAPTEQDRSQLEITSELNIGEQINRIRKIQVAPAE--NAIVIP 1056
Query: 527 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 586
FL S++G L + + K L+ Q+ + + G L+ +R ++ +
Sbjct: 1057 KAFL---GSIEGTLYLYGDIAPKYQDLLMTFQSRLQEYIQTPGNLSFDTWRAFRNQARDG 1113
Query: 587 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
P R +DG ++ +FL L ++ +C+ +G D+
Sbjct: 1114 EAPFR-FVDGEMIERFLDLDEKQQELVCEGLGPSVEDM 1150
>gi|443894313|dbj|GAC71661.1| hypothetical protein PANT_5d00006 [Pseudozyma antarctica T-34]
Length = 1625
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 37/147 (25%), Positives = 69/147 (46%), Gaps = 20/147 (13%)
Query: 279 IALGTNYNYSEDV-TCRGRILLFDIIEVVPEPGQPLTKN---KIKMIYAKEQKGPVTAIC 334
+ +GT Y S+ T GR++ FD+ PG TK +++ ++ ++ G V ++
Sbjct: 1254 LVIGTGYIDSQSQETVSGRLVGFDV-----SPGSSRTKEERGRLRRLFEHDENGNVYSVQ 1308
Query: 335 HVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEV---------YIASMVSV--KNLILVG 383
+ L AV ++ I+ + D + ++ +IA +SV + I+VG
Sbjct: 1309 SIGNRLAAAVNSEVKIYSVIDPRRGDASSPKIKIKQRGSWASSFIACSLSVVEPDRIVVG 1368
Query: 384 DYARSIALLRYQPEYRTLSLVARDYKP 410
D RS+ +L P+ +S +ARD P
Sbjct: 1369 DALRSMNVLHVHPQTARVSEIARDCDP 1395
>gi|384080885|dbj|BAM11105.1| damage-specific DNA binding protein 1, 127kDa, partial
[Siebenrockiella crassicollis]
Length = 364
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 62/137 (45%), Gaps = 17/137 (12%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 157 YFIVGTAMVYPEEAEPKQGRIVVFH-----------YSDGKLQSLAEKEVKGAVYSMVEF 205
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLR 393
G L+ ++ ++Y W + T + +A V K + ILVGD RS+ LL
Sbjct: 206 NGKLLASINSTVRLYEWTAEKELRTECNHYNN--IMALYVKTKGDFILVGDLMRSVLLLA 263
Query: 394 YQPEYRTLSLVARDYKP 410
Y+P +ARD+ P
Sbjct: 264 YKPMEGNFEEIARDFNP 280
>gi|380488197|emb|CCF37544.1| hypothetical protein CH063_08850 [Colletotrichum higginsianum]
Length = 271
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 38/157 (24%), Positives = 71/157 (45%), Gaps = 14/157 (8%)
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRF 529
+D N+++ P+A + ++ ++FHLG+ VN + P+ + P F
Sbjct: 104 ADAQGNLMVLRRNPDAPTEHDQKQMEVTSEFHLGEQVNKIRPLDITPN--ENDPIVPKAF 161
Query: 530 LTWYASLDGALGFFLPLPEKNYRRLLMLQNVM--VTHT------SHTGGLNPRAFRTYKG 581
L A+++G+L F + + LL Q + V T T GL+ A+R ++
Sbjct: 162 L---ATVEGSLYVFADIKSEYQSLLLQFQERLADVVKTLGQAGGDSTSGLSFMAWRGFRN 218
Query: 582 KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
A P R +DG L+ +FL L ++ + + +G
Sbjct: 219 AKRAADGPFR-FVDGELIERFLDLDEAKQEAVVQGLG 254
>gi|148709424|gb|EDL41370.1| damage specific DNA binding protein 1 [Mus musculus]
Length = 968
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 61/136 (44%), Gaps = 15/136 (11%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKP 410
+P +ARD+ P
Sbjct: 936 KPMEGNFEEIARDFNP 951
>gi|16197726|emb|CAC94909.1| damaged-DNA recognition protein 1 [Mus musculus]
Length = 994
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 61/136 (44%), Gaps = 15/136 (11%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ + GRI++F + K++ + KE KG V ++
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876
Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
G L+ ++ ++Y W + T + + + + + ILVGD RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935
Query: 395 QPEYRTLSLVARDYKP 410
+P +ARD+ P
Sbjct: 936 KPMEGNFEEIARDFNP 951
>gi|156389050|ref|XP_001634805.1| predicted protein [Nematostella vectensis]
gi|156221892|gb|EDO42742.1| predicted protein [Nematostella vectensis]
Length = 1157
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 67/339 (19%), Positives = 122/339 (35%), Gaps = 72/339 (21%)
Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y + E+ + GR+LLF L++ K+ + KE KG V ++
Sbjct: 841 YYCVGTAYVFPEEPEPKAGRLLLFH-----------LSEGKLVQVAEKEVKGAVYSLVEF 889
Query: 337 AGFLVTAVGQKIYIWQ-LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQ 395
G ++ + + I++ D + + + + + ILVGD RS+ LL Y
Sbjct: 890 NGKVLAGINSTVSIFEWTADKEFRYECSYYDNILALYLKTKGDFILVGDLMRSMTLLVYL 949
Query: 396 PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
P + +A D+ P W
Sbjct: 950 PLEGSFQEIAHDFSPK----------------------WM------------------TA 969
Query: 456 NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK 515
+ILD+ + +G ++ N+ A + L +HLG+ VN F
Sbjct: 970 IEILDDDTFLG---AENSYNLFTCTKDSGATTDEERYHLQDAGQYHLGEFVNVFR----H 1022
Query: 516 PSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
S + + PG S + + +++G +G + + + L+ +Q + G ++
Sbjct: 1023 GSLVMEHPGDASTPFQGCVLFGTVNGRIGIVAQIAQDLFNFLIQVQKKLNKVIKSVGKID 1082
Query: 573 ------PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
P + P+ G IDG L+ FL L
Sbjct: 1083 HSLYPFPHCSNLSHSRKM---EPAHGFIDGDLIESFLDL 1118
>gi|156095578|ref|XP_001613824.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148802698|gb|EDL44097.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 2213
Score = 44.7 bits (104), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 23/102 (22%), Positives = 54/102 (52%), Gaps = 12/102 (11%)
Query: 323 AKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILV 382
A ++ PV H ++ + K++I ++++ND T AF+D+ ++I+ + +KN ++V
Sbjct: 1937 ANQKSSPVDQNVHCN--ILHCINSKLFIHEVRENDFTKGAFLDSNLFISDIKVMKNFLIV 1994
Query: 383 GDYARS--IALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
D + I + Y+ ++ + S++ P +K ++ N
Sbjct: 1995 ADLYKGIFINMFNYEQQHDSRSII--------PIAKPFFCAN 2028
>gi|402222132|gb|EJU02199.1| hypothetical protein DACRYDRAFT_21931 [Dacryopinax sp. DJM-731 SS1]
Length = 1209
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 63/313 (20%), Positives = 127/313 (40%), Gaps = 51/313 (16%)
Query: 332 AICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK---NLILVGDYARS 388
A+ G LV +G+ + I+ + L + + + + ++V++ + I+VGD A S
Sbjct: 942 ALLSFQGRLVAGIGKALRIFDMGKKRL--LRKCENKSFATAIVTLSTQGSRIIVGDMAES 999
Query: 389 IALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE-- 446
I Y+P L + A D +P I S + + + G++
Sbjct: 1000 IYFATYKPPENRLLIFADDSQPRW---------------ITASAMVDYDTVCAGDKFGNV 1044
Query: 447 ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV 506
++ K + +DE + ++ +K LFM P H+ +++G +
Sbjct: 1045 FVNRLPPKVGEQVDEDPTGAGVLHEKG----LFMGAP--------HKTNMLAHYYVGDII 1092
Query: 507 NTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLP-LPEKNYRRLLMLQNVMVTHT 565
+ K+ A R + Y L G +G +P + +++ + L+ M T
Sbjct: 1093 TSMHKV---------ALVTGGRDIVLYTGLHGTIGVLIPFISKEDVDFIRTLEQHMRTEA 1143
Query: 566 SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 625
G R TY+G YY P +G++DG L F L ++ I ++ ++++L
Sbjct: 1144 PSLVG---RDHLTYRG--YYV--PVKGVVDGDLCELFSLLPTQKQQSIAGELDRTYSEVL 1196
Query: 626 DELYDIEALSSHF 638
+L + ++ F
Sbjct: 1197 KKLEQLRVTTTGF 1209
>gi|260947152|ref|XP_002617873.1| hypothetical protein CLUG_01332 [Clavispora lusitaniae ATCC 42720]
gi|238847745|gb|EEQ37209.1| hypothetical protein CLUG_01332 [Clavispora lusitaniae ATCC 42720]
Length = 1242
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 75/172 (43%), Gaps = 21/172 (12%)
Query: 469 ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSR 528
++++ +N VL ++ E ++ RL K DF+ V + K S G
Sbjct: 1077 VAEQLENNVLMKFEEETLGASSS-RLDKLCDFYTQDIVTSLHKG-------SFVVGGSES 1128
Query: 529 FLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHT--------GGLNPRAFRTY 579
+ Y L G +G LPL LLM L+N + + + + G N
Sbjct: 1129 II--YTGLQGTVGILLPLATTQEVDLLMKLENSLRDYFNDSFDDFDNTKQGFNLVGREHL 1186
Query: 580 KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
K +GYY NP +IDG + +F +L+ ++++ ++ DI ++YD+
Sbjct: 1187 KFRGYY--NPVENVIDGDFIERFFELNPSAQVKLAGRLDKSPRDIERKIYDL 1236
>gi|159470709|ref|XP_001693499.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283002|gb|EDP08753.1| predicted protein [Chlamydomonas reinhardtii]
Length = 279
Score = 44.7 bits (104), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 63/288 (21%), Positives = 104/288 (36%), Gaps = 56/288 (19%)
Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
AF D +V+VK+ +L D + + LRY R L +++D+ + G
Sbjct: 33 AFFDLPSLATGLVTVKDYLLASDVHQGLFFLRYSDASRVLEFMSKDFDGRDVLTCGVVIA 92
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
P FL L++ + G + D EF +
Sbjct: 93 EPK---------LHFLAADAAGTLQMMEFYGKR--DTNPEFWA----------------- 124
Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALG 541
G RL H+ + V ++ R+R S +G L
Sbjct: 125 ---------GQRLAPMGLLHVARRVGVAASVQLASRD------GRNRHALLCGSAEGGLS 169
Query: 542 FFLPLPEKNYRRLLMLQNVMVTHT-SHTGGLNPRAFRTY---------KGKGYYAGNPSR 591
F P+P+ L ++ T H GLNPR+FR G+ + A P R
Sbjct: 170 FVAPVPDPQAAARLAALQAHMSATLPHVAGLNPRSFRHRFIRIPKALGGGEHHRAPLPPR 229
Query: 592 G---IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSS 636
++DG L+ F LS ++ E + +GS +L++L I A ++
Sbjct: 230 NNSGLLDGQLLLGFPHLSRQQQAEAAEAVGSSPQQLLEDLRAIAAAAT 277
>gi|388853409|emb|CCF53029.1| related to UV-damaged DNA-binding protein [Ustilago hordei]
Length = 1508
Score = 44.3 bits (103), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 68/145 (46%), Gaps = 15/145 (10%)
Query: 279 IALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
+ +GT Y + E GR+L FD+ + + +++ ++ KEQ G V ++ +
Sbjct: 1135 LVVGTGYISDGEHEVISGRLLGFDVSAGSIRGKE--ERGRLRKLFVKEQAGNVYSVQSIN 1192
Query: 338 GFLVTAVGQKIYIWQLKD---NDLTGIAFIDTE-------VYIASMVSV--KNLILVGDY 385
L TAV ++ I+ + D +D I+ +IA +SV + I+VGD
Sbjct: 1193 NRLATAVNSEVKIYSVVDPRASDEVSAPRINVVQRGSWACSFIACNLSVVEPDQIVVGDA 1252
Query: 386 ARSIALLRYQPEYRTLSLVARDYKP 410
RSI +L P L+ +ARD P
Sbjct: 1253 LRSINVLHVHPYTARLTEIARDCDP 1277
>gi|221061705|ref|XP_002262422.1| splicing factor 3b, subunit 3, 130kd [Plasmodium knowlesi strain H]
gi|193811572|emb|CAQ42300.1| splicing factor 3b, subunit 3, 130kd, putative [Plasmodium knowlesi
strain H]
Length = 1276
Score = 43.9 bits (102), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 63/145 (43%), Gaps = 17/145 (11%)
Query: 488 SNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLP 547
SN RL +FH+G+ V + K++ P+S Y+++ G +G F+P
Sbjct: 1143 SNKNRRLEHIMNFHVGEIVTSLQKVKLSPTSSE---------CIIYSTIMGTIGAFIPYD 1193
Query: 548 EKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
K L L+ ++ T G FR+Y +P + +IDG L +F L
Sbjct: 1194 NKEELELTQHLEIILRTENPPLCGREHIFFRSYY-------HPVQHVIDGDLCEQFSSLP 1246
Query: 607 LGERLEICKKIGSKHNDILDELYDI 631
+ ++ + +DIL +L DI
Sbjct: 1247 YDIQRKVAADLERTPDDILRKLEDI 1271
>gi|449704103|gb|EMD44407.1| DNA-repair binding protein, putative [Entamoeba histolytica KU27]
Length = 1088
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 61/298 (20%), Positives = 114/298 (38%), Gaps = 15/298 (5%)
Query: 347 KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVAR 406
+I I Q+KD L I D + SM ++ L + + + YQ +
Sbjct: 794 RILIVQIKDGRLEIIFEKDVNGAVYSMKTLLKKYLAMSIEKKLVVFEYQRVITNGEFEVK 853
Query: 407 DYKPTQPNSK--GYYAGNPSRGIIDGSLVWKFLQLSLGER-------LEICKKIGSKHND 457
+ N K G Y I+ G L+ S E+ + + +
Sbjct: 854 LQEKGSCNVKLIGLYVKTLGNKILVGDLMKSISVYSFDNNGNNKNCLTEVSRDFYASYTT 913
Query: 458 ILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS 517
++ ++ SD + N+++F ES RL H+G+ +N K P+
Sbjct: 914 AIEFVDEDCYLSSDSNSNILIFNTNSTGNESER-FRLNNCAHIHVGECINVMCKGSIAPT 972
Query: 518 SISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN-VMVTHTSHTGGLNPRAF 576
+ + L + + G +G +P + Y L+ +QN +++ P +
Sbjct: 973 HSTYETVQKKCIL--FGGVTGYIGGICEIPNEIYDVLIKVQNQILLQMKGIVECTTPDNW 1030
Query: 577 RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
+ K + PS IIDGS+V +L++S ++ EI G I D + ++ +L
Sbjct: 1031 K--KVIDDWKRMPSSNIIDGSIVESYLEMSKEKQCEIAHLSGVNEEQISDIIENMISL 1086
>gi|384253371|gb|EIE26846.1| hypothetical protein COCSUDRAFT_52476 [Coccomyxa subellipsoidea
C-169]
Length = 1205
Score = 43.9 bits (102), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 63/145 (43%), Gaps = 18/145 (12%)
Query: 489 NGG-HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLP 547
NG H+L +FH+G V + + +P G R L YA++ GA+G LP P
Sbjct: 1072 NGAPHKLEDVVNFHVGDLVTSLQRAVLQP-------GGREVLL--YATVMGAIGAMLPFP 1122
Query: 548 EKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
+ L+ + GG R +Y+G + P + +IDG L F QL
Sbjct: 1123 SREDVDFFSHLEMHLRQEHPPMGG---RDHMSYRGSYF----PVKDVIDGDLCEHFSQLP 1175
Query: 607 LGERLEICKKIGSKHNDILDELYDI 631
++ I ++ +IL +L DI
Sbjct: 1176 AAKQKSIADELERTPGEILKKLEDI 1200
>gi|241560031|ref|XP_002400960.1| spliceosomal protein sap, putative [Ixodes scapularis]
gi|215501812|gb|EEC11306.1| spliceosomal protein sap, putative [Ixodes scapularis]
Length = 1019
Score = 43.9 bits (102), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 71/350 (20%), Positives = 143/350 (40%), Gaps = 59/350 (16%)
Query: 292 TCRGRILLFDIIEVVPEPGQPLT-KNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYI 350
CRG LL + P P +P+ ++++++A + TA+C G L+ VG+ + +
Sbjct: 713 VCRGGGLLL-TYRLAPNPEEPMAGPTQLELVHATPVEEAPTALCPFQGRLLAGVGKCLRL 771
Query: 351 WQLKDNDLTGIAFIDTEVYIASMVSVK---NLILVGDYARSIALLRYQPEYRTLSLVARD 407
+ L L + + + ++VS++ N ++V D S LRY+ + L + A D
Sbjct: 772 YDLGRKKL--LRKCENKYIPNAIVSIQAMGNRVVVSDVQESFFFLRYKRQENQLVIFADD 829
Query: 408 YKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGER---LEICKKIGSKHNDILDEFSS 464
P I S + + ++ ++ + I + S +D+
Sbjct: 830 SVPRW---------------ITASCMLDYETVAGADKFGNVSIIRLPSSISDDV------ 868
Query: 465 MGFMISDKDKNVVLFMYQPEARESNGG--HRLIKKTDFHLGQHVNTFFKIRCKPSSISDA 522
D+D + ++ R GG + ++FH+G+ V + K P
Sbjct: 869 ------DEDPTGIKSLWD---RGWLGGSSQKADVISNFHIGETVLSLQKATLIPG----- 914
Query: 523 PGARSRFLTWYASLDGALGFFLPL-PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG 581
G+ S Y +L G +G +P +++ L+ M G + +FR+
Sbjct: 915 -GSESLV---YVTLSGTVGVLVPFTAHEDHDFFQHLEMHMRYENPPLCGRDHLSFRS--- 967
Query: 582 KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
Y+ P + +IDG L +F L ++ I +++ +++ +L DI
Sbjct: 968 -SYF---PVKNVIDGDLCEQFNSLDPSKQKSIAEELDRNPSEVSKKLEDI 1013
>gi|70945139|ref|XP_742421.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56521397|emb|CAH76894.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 435
Score = 43.5 bits (101), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 98/501 (19%), Positives = 173/501 (34%), Gaps = 111/501 (22%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
++K+P T +AYH E+ ++TS P + +K N K+++ F P +
Sbjct: 9 IQKIPFYRTVEKIAYHKESGL--LITSC--PPEEKHKTNKNLKQIIC------FFNPHQN 58
Query: 231 QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGT-NYNYSE 289
F S P + +C+ ++ + S + I +GT N N
Sbjct: 59 SFKYSYIIPSKYNV------------SSICVYQINKDIYPNKSSINTLICVGTANINDRV 106
Query: 290 DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF---LVTAVGQ 346
G I +F + +IK IY V I H+ F L+T +
Sbjct: 107 SEPSSGHIYIF-------FAKKKANLFEIKHIYT--HNVNVGGITHLKQFYDKLITTINN 157
Query: 347 KIYIWQL--------------------KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYA 386
+ I + D + +A +I S+ ++N I+VGD
Sbjct: 158 TVVILDISEFLINLDKYVDNTNKPKLENDGTIVDVASFTPSSWIMSLDVIENYIVVGDIM 217
Query: 387 RSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 446
S+ +L Y L+ V RDY S VW +L
Sbjct: 218 TSVTILSYDFNNSILTEVCRDY----------------------SNVWCTFVCAL----- 250
Query: 447 ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV 506
S F++SD + N ++F +L + F+ G V
Sbjct: 251 ----------------SKSHFLVSDMESNFLVFQKSSIKYNDEDSFKLSRVALFNHGHVV 294
Query: 507 NTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGALGFFLPLPE-KNYRRLLMLQNVMV 562
N + + P R + AS +G++ +P N+++ L ++ +
Sbjct: 295 NKMLPVSLSSLIEEEEPQNEILRKKESILCASSEGSISSIIPFSNLANFKKALCIELALN 354
Query: 563 THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE-------ICK 615
S G +N + TYK + +G++DG + F + ++ + I K
Sbjct: 355 DSLSSIGNINDNSNNTYKMN--LSEKSCKGVVDGEVFKMFFSMPFEKQFKTYIYAKWIAK 412
Query: 616 KIGSKHNDILDELYDIEALSS 636
K+ K + + DIE L S
Sbjct: 413 KLNCKFGTFENFMLDIENLCS 433
>gi|328858656|gb|EGG07768.1| hypothetical protein MELLADRAFT_105631 [Melampsora larici-populina
98AG31]
Length = 1216
Score = 43.5 bits (101), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 52/214 (24%), Positives = 89/214 (41%), Gaps = 13/214 (6%)
Query: 372 SMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGS 431
++++ KN I+VGD +SI +L + + +L ++ RDY G + R +
Sbjct: 964 TVLTEKNWIIVGDLYKSIVVLEFDLKKFSLKVLGRDYSAMSVRPIGMIS---DRVFVAAD 1020
Query: 432 LVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGG 491
+ + + ER + G K D +E S+ D D+ + N
Sbjct: 1021 TEFNLFTVEMRER-----QKGLKEEDEDEEGLSVEEEKGDDDEWEEEERRMRVEKVFNDD 1075
Query: 492 HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRF--LTWYASLDGALGFFLPLPE- 548
H L FHLG++VN FK S+ G ++ + S G +G + L +
Sbjct: 1076 H-LDTVGGFHLGENVN-HFKAGSLVKSLKHFYGQDLKYGGKLIFVSSTGGIGVIIKLEDL 1133
Query: 549 KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 582
K Y+ L L++ + GGL+ FR +K K
Sbjct: 1134 KIYKHLKALEDRLKKEILSIGGLDSTEFRKFKNK 1167
>gi|323447810|gb|EGB03719.1| hypothetical protein AURANDRAFT_72671 [Aureococcus anophagefferens]
Length = 760
Score = 43.5 bits (101), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 49/181 (27%), Positives = 75/181 (41%), Gaps = 20/181 (11%)
Query: 240 FSWEEIPQTNF---PLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED--VTCR 294
F +E P + L E LC +S++ T R + +GT + E+ C
Sbjct: 413 FLRDEAPYNDVHREALEPLEIPLCCSIISLDSISTYKDQRAHFVVGTAFAAQENDFEPCS 472
Query: 295 GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV-AGFLVTAVGQKIYIWQ- 352
GR+++F GQ + ++ E G V + + A LV AV I+I+
Sbjct: 473 GRMIIF-------RSGQANVAPSV--LFFVEANGAVYDVAAMRASLLVCAVNHAIHIYDP 523
Query: 353 -LKDN---DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDY 408
++DN L A D V + NLI+VGD RS+ LL + + VA DY
Sbjct: 524 VVRDNRRGHLKPRASYDGLVVALKVQCYGNLIVVGDMMRSVTLLNLIRQKMIIVEVACDY 583
Query: 409 K 409
Sbjct: 584 N 584
>gi|119191318|ref|XP_001246265.1| hypothetical protein CIMG_00036 [Coccidioides immitis RS]
Length = 1072
Score = 43.5 bits (101), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 30/137 (21%), Positives = 60/137 (43%), Gaps = 6/137 (4%)
Query: 467 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR 526
F+++D + N+V+ + R+ ++ LG+ VN R P + +P +
Sbjct: 906 FLVADAEGNLVVLNRDTTGVTEDDRRRMQVTSELRLGEMVN-----RIHPMDLQTSPESP 960
Query: 527 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 586
+ A++DG++ F + L+ LQ+ + + G + +R +K A
Sbjct: 961 VIPKAFLATVDGSIYLFGLISPSAQDTLMRLQSALADFVASPGEIPFNKYRAFKSSVRQA 1020
Query: 587 GNPSRGIIDGSLVWKFL 603
P R +DG L+ +FL
Sbjct: 1021 EEPFR-FVDGELIEQFL 1036
>gi|358338734|dbj|GAA31211.2| DNA damage-binding protein 1, partial [Clonorchis sinensis]
Length = 1515
Score = 43.5 bits (101), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 60/273 (21%), Positives = 99/273 (36%), Gaps = 59/273 (21%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
VR VPL+ TP LA ET + ++T E + F P+ S
Sbjct: 769 VRTVPLEETPKRLALQDETGSLGVITYRQEVFQEGSGFK-----------------PVRS 811
Query: 231 QFHVSLFSPFSWEEIPQT----------NFPLHEWEHVLCLKNVSME--------YEGTL 272
+S P S +P+T F E +L +ME + TL
Sbjct: 812 SISLSQKVPKSTSRLPKTAPSSVSATERKFREVEVSSLLIFNKSTMELMFAHSFYFSQTL 871
Query: 273 SGLRGYIA--------------LGTNYNYSEDVT-CRGRILLFDIIEVVPEPGQPLTKNK 317
+ IA +GT + E+V +GRI LF PE +
Sbjct: 872 VEVAVSIASIEPTDGSKSMLYAVGTAFLVEEEVEPSKGRIHLF---HWDPETA------R 922
Query: 318 IKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK 377
++ + + G V + G L+ A+ + ++ +K++ L + + +
Sbjct: 923 LETVLVHDVNGAVYRLLDFNGRLLAAINSSVRLFDIKEDSLRLACSFNENIIALFLRRKG 982
Query: 378 NLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
+ +LVGD RS+ LL Y+P + R P
Sbjct: 983 DFVLVGDLMRSLTLLLYRPNVNNFEAIGRHRNP 1015
>gi|156841606|ref|XP_001644175.1| hypothetical protein Kpol_1059p7 [Vanderwaltozyma polyspora DSM
70294]
gi|156114812|gb|EDO16317.1| hypothetical protein Kpol_1059p7 [Vanderwaltozyma polyspora DSM
70294]
Length = 1346
Score = 43.1 bits (100), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 32/51 (62%), Gaps = 3/51 (5%)
Query: 416 KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI-GSKHNDILDEFSSM 465
+ YYA P + IIDG L +FL L+ ERLEICK + +K DI+ + + M
Sbjct: 1293 RSYYA--PVKNIIDGDLCERFLYLNSNERLEICKNLKDTKPEDIIRQINEM 1341
Score = 42.0 bits (97), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 34/53 (64%), Gaps = 3/53 (5%)
Query: 580 KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI-GSKHNDILDELYDI 631
K + YYA P + IIDG L +FL L+ ERLEICK + +K DI+ ++ ++
Sbjct: 1291 KYRSYYA--PVKNIIDGDLCERFLYLNSNERLEICKNLKDTKPEDIIRQINEM 1341
>gi|150865083|ref|XP_001384154.2| hypothetical protein PICST_58642 [Scheffersomyces stipitis CBS
6054]
gi|149386339|gb|ABN66125.2| DNA-repair [Scheffersomyces stipitis CBS 6054]
Length = 541
Score = 42.7 bits (99), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 55/113 (48%), Gaps = 15/113 (13%)
Query: 445 LEICKKIGSKHNDILDEFSSMGFMISDKDKNVV-LFMYQPEARESNGGHRLIKKTDFHLG 503
++ CK++ D+ +FSS+G +IS + K V L Y SNG + KT
Sbjct: 1 MDECKQLLDSGADVFSKFSSLGRLISLQGKMVTDLIDY------SNGNQSQLSKTTLRPI 54
Query: 504 QHVNTF-------FKIRCKPSSISDAPGARSRFL-TWYASLDGALGFFLPLPE 548
+ V+ F FK R KP S++D + S+F+ T +LD LG +P E
Sbjct: 55 REVDGFLMELSTAFKKRNKPKSVTDMIKSPSKFISTGLHTLDSDLGGGIPTGE 107
>gi|254585271|ref|XP_002498203.1| ZYRO0G04730p [Zygosaccharomyces rouxii]
gi|238941097|emb|CAR29270.1| ZYRO0G04730p [Zygosaccharomyces rouxii]
Length = 1302
Score = 42.7 bits (99), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 33/55 (60%), Gaps = 3/55 (5%)
Query: 578 TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI-GSKHNDILDELYDI 631
++K + YYA P R +IDG L FL LSL E+ ++CK+ GS + +L DI
Sbjct: 1245 SFKYRSYYA--PVRNVIDGDLCETFLNLSLSEQTKLCKETSGSNPEGVCKQLNDI 1297
>gi|407044103|gb|EKE42371.1| DNA damage-binding protein, putative [Entamoeba nuttalli P19]
Length = 1088
Score = 42.7 bits (99), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 39/169 (23%), Positives = 74/169 (43%), Gaps = 6/169 (3%)
Query: 467 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR 526
++ SD + N+++F ES RL H+G+ +N K P+ + +
Sbjct: 923 YLSSDSNSNILIFNTNSTGNESER-FRLNNCAHIHVGECINVMCKGSIAPTHSTYETVQK 981
Query: 527 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN-VMVTHTSHTGGLNPRAFRTYKGKGYY 585
L + + G +G +P + Y L+ +QN +++ P ++ K +
Sbjct: 982 KCIL--FGGVTGYIGGICEIPNEIYDILIKVQNQILLQMKGIVECTTPDDWK--KVIDDW 1037
Query: 586 AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
PS IIDGS+V +L++S ++ EI G I D + ++ +L
Sbjct: 1038 KRMPSSNIIDGSIVESYLEMSKEKQCEIAHLSGVNEEKISDIIENMISL 1086
>gi|393243160|gb|EJD50676.1| hypothetical protein AURDEDRAFT_112250 [Auricularia delicata
TFB-10046 SS5]
Length = 1140
Score = 42.7 bits (99), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 64/145 (44%), Gaps = 21/145 (14%)
Query: 281 LGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF 339
+GT Y SE RGRIL+F +E G LT + G V ++ V G
Sbjct: 824 VGTAYIKDSEMEPSRGRILVFGSLEDSGTGGSWLTA-------FLQVTGAVLSLTSVDGL 876
Query: 340 LVTAVGQKIYIWQLKDNDLTGIAFI-----------DTEVYIASMVSVKNLILVGDYARS 388
+V V + +++L+ N L+ + + S+ + + I +GD S
Sbjct: 877 IVAGVNTAVILYELRRNTLSEAERASHLTLRQKKEWNHNYVVTSLAARGDTIYIGDSVAS 936
Query: 389 IALLRYQPEYRTLSLVARDYKPTQP 413
IA+LR++ E TL +AR + P P
Sbjct: 937 IAILRWKHE--TLHTIARHFGPIFP 959
>gi|183232997|ref|XP_653855.2| damaged DNA binding protein [Entamoeba histolytica HM-1:IMSS]
gi|169801778|gb|EAL48469.2| damaged DNA binding protein, putative [Entamoeba histolytica
HM-1:IMSS]
Length = 1088
Score = 42.7 bits (99), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 57/277 (20%), Positives = 106/277 (38%), Gaps = 15/277 (5%)
Query: 347 KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVAR 406
+I I Q+KD L I D + SM ++ L + + + YQ +
Sbjct: 794 RILIVQIKDGRLEIIFEKDVNGAVYSMKTLLKKYLAMSIEKKLVVFEYQRVITNGEFEVK 853
Query: 407 DYKPTQPNSK--GYYAGNPSRGIIDGSLVWKFLQLSLGER-------LEICKKIGSKHND 457
+ N K G Y I+ G L+ S E+ + + +
Sbjct: 854 LQEKGSCNVKLIGLYVKTLGNKILVGDLMKSISVYSFDNNGNNKNCLTEVSRDFYASYTT 913
Query: 458 ILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS 517
++ ++ SD + N+++F ES RL H+G+ +N K P+
Sbjct: 914 AIEFVDEDCYLSSDSNSNILIFNTNSTGNESER-FRLNNCAHIHVGECINVMCKGSIAPT 972
Query: 518 SISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN-VMVTHTSHTGGLNPRAF 576
+ + L + + G +G +P + Y L+ +QN +++ P +
Sbjct: 973 HSTYETVQKKCIL--FGGVTGYIGGICEIPNEIYDVLIKVQNQILLQMKGIVECTTPDDW 1030
Query: 577 RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 613
+ K + PS IIDGS+V +L++S ++ EI
Sbjct: 1031 K--KVIDDWKRMPSSNIIDGSIVESYLEMSKEKQCEI 1065
>gi|342885673|gb|EGU85655.1| hypothetical protein FOXB_03801 [Fusarium oxysporum Fo5176]
Length = 1160
Score = 42.7 bits (99), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 32/161 (19%), Positives = 73/161 (45%), Gaps = 12/161 (7%)
Query: 467 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR 526
++ +D N+V+ +A RL ++ ++G+ +N K+ P A
Sbjct: 998 WLEADSKGNLVVLQRNVDAPTEQDRSRLEITSEMNIGEQINRIRKLHV--------PMAE 1049
Query: 527 SRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKG 583
+ + + AS +G+L + + + L+ Q+ M + G + + +R+++ +
Sbjct: 1050 NGIVHPRAFLASAEGSLYLYGDIAPQYQDLLMTFQSKMEEYIHVPGSVEFKLWRSFRNEN 1109
Query: 584 YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
+ P R IDG +V +FL + G++ +C+ +G D+
Sbjct: 1110 RESEGPFR-FIDGEMVERFLDMDEGKQELVCEGLGPSIEDM 1149
>gi|327301962|ref|XP_003235673.1| UV-damaged DNA binding protein [Trichophyton rubrum CBS 118892]
gi|326461015|gb|EGD86468.1| UV-damaged DNA binding protein [Trichophyton rubrum CBS 118892]
Length = 1147
Score = 42.4 bits (98), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 35/151 (23%), Positives = 68/151 (45%), Gaps = 6/151 (3%)
Query: 467 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR 526
++++D + N+V+ + RL ++ LG+ VN I + + A AR
Sbjct: 982 YLLADAEGNLVVLQQNITGVTESDRKRLQPTSEIRLGEMVNRIHPIVIQTYT-ETAVSAR 1040
Query: 527 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 586
+ A++DG++ F + LL LQ M + T G + +R ++ + +
Sbjct: 1041 A----LLATVDGSIYLFGLINPTYIDLLLRLQTAMGSITISPGEIPFSKYRAFRTTVHQS 1096
Query: 587 GNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
P R +DG L+ +FL + G + EI ++
Sbjct: 1097 DEPFR-FVDGELIERFLSCTPGMQEEIVSRL 1126
>gi|242208420|ref|XP_002470061.1| predicted protein [Postia placenta Mad-698-R]
gi|220730961|gb|EED84811.1| predicted protein [Postia placenta Mad-698-R]
Length = 776
Score = 42.4 bits (98), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 31/118 (26%), Positives = 53/118 (44%), Gaps = 20/118 (16%)
Query: 234 VSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTC 293
+ L SP W + F + E V CL V++E + SG++ +IA+GT N C
Sbjct: 411 LELISPEGW--VTMDGFESAQKEFVTCLDCVTLETTSSESGMKDFIAVGTKINCG---AC 465
Query: 294 RGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ-KGPVTAICHVAGFLVTAVGQKIYI 350
G P +N I + ++ K +TA+C + L++ + QKI++
Sbjct: 466 FGYT--------------PPYRNSILTLKCRDDAKVSITALCGMYNHLISTMDQKIFV 509
>gi|150863836|ref|XP_001382447.2| hypothetical protein PICST_54680 [Scheffersomyces stipitis CBS 6054]
gi|149385092|gb|ABN64418.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 1228
Score = 42.4 bits (98), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 25/104 (24%), Positives = 49/104 (47%), Gaps = 5/104 (4%)
Query: 533 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 592
Y + G +G LPL K+ + + N + G N K + YY NP +
Sbjct: 1129 YTGIQGTVGLLLPLSTKSEVQFI---NSLEQSLRQQMGFNLLGMDHLKFRSYY--NPVKN 1183
Query: 593 IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSS 636
+IDG L+ K+ +LS +++I +++ ++ ++ D+ S+
Sbjct: 1184 VIDGDLIEKYYELSQSLKIKIARELNRTPKEVEKKISDLRNRSA 1227
>gi|115490949|ref|XP_001210102.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114196962|gb|EAU38662.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 908
Score = 42.4 bits (98), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 44/89 (49%), Gaps = 4/89 (4%)
Query: 98 YQGVFLCGPHPAWLFLTSRGELRAHPMTIDG-PVSTLAPFHNVNCPRGFLYFNAKSELRI 156
+ VF+ G ++ TS H M + G P+ L F N GF++ ++++ LR+
Sbjct: 795 FSSVFMPGMSAGFVLKTSAS--LPHLMRMRGAPIQCLDAF-NSPSGNGFIFLDSENALRM 851
Query: 157 SVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
LP +D WP+R++P+ LAY
Sbjct: 852 CQLPRETHFDYQWPMRRIPIGEQIDHLAY 880
>gi|116195210|ref|XP_001223417.1| hypothetical protein CHGG_04203 [Chaetomium globosum CBS 148.51]
gi|88180116|gb|EAQ87584.1| hypothetical protein CHGG_04203 [Chaetomium globosum CBS 148.51]
Length = 1127
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 31/155 (20%), Positives = 62/155 (40%), Gaps = 6/155 (3%)
Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRF 529
+D N+++ E + R+ ++ +L + VN R + + PGA
Sbjct: 968 ADAQGNLMVLRRNVEGVTAEDKRRMEVTSEINLNEMVN-----RIRTIDVETTPGAMIVP 1022
Query: 530 LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
+ +++G + F + LL Q+ + G + R +R ++ P
Sbjct: 1023 KAFLGTVEGGIYMFGTVAPHVQDLLLRFQSRLADVLKTAGDIEFRTYRAFRNAEREGDGP 1082
Query: 590 SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
R +DG L+ KFL + + +CK +G D+
Sbjct: 1083 FR-FVDGELLEKFLDVDETTQEAVCKGLGPTVEDM 1116
>gi|399218485|emb|CCF75372.1| unnamed protein product [Babesia microti strain RI]
Length = 575
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 55/261 (21%), Positives = 99/261 (37%), Gaps = 48/261 (18%)
Query: 169 WPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTD--PRDSRFIP 226
W V+K+PL C + + T TY T+ + DY FN E ++T+ P +S
Sbjct: 206 WCVKKIPLNCRS--MKENSITNTY--NTNAYHSNADYAVFNDESSHIITESQPINSYISD 261
Query: 227 PLVSQFH-----------VSLFSPFSWEEIPQTN----FPLHEWEHVLCLKNV------- 264
SQ + +S ++ + IP+TN FP + C
Sbjct: 262 DAESQINNASNMMYKNNELSSYNNYMESNIPRTNYQDLFPCYTESLTTCFSEQPYHDHQC 321
Query: 265 --SMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLT----KNKI 318
+ + + +GY GTNYNYS L + +P QP+ +N+I
Sbjct: 322 ADNCDNQSFSQIYKGYDVYGTNYNYS---------YLNNEYADLPMYSQPIGYYGYENQI 372
Query: 319 KMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDL-TGIAFI--DTEVYIASMVS 375
+ +Y + P T + + + + N + +A + D Y +++
Sbjct: 373 ENVYTHQL--PYTITTNTENIASSTNNGNVAECSTRSNSCNSSVAELACDKSEYTNELIN 430
Query: 376 VKNLILVGDYARSIALLRYQP 396
L +A I+ +++QP
Sbjct: 431 TNPLFQYNQHASGISGVKFQP 451
>gi|322700871|gb|EFY92623.1| DNA damage-binding protein 1 [Metarhizium acridum CQMa 102]
Length = 1121
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 32/172 (18%), Positives = 69/172 (40%), Gaps = 15/172 (8%)
Query: 467 FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC--------KPSS 518
++ +D N+++ PEA +L ++ ++G+ +N ++ P +
Sbjct: 940 WLEADAQGNIIVLQRNPEAPTEQDRSKLEVTSEINIGEQINQIRRLHVASNENAVVSPKA 999
Query: 519 ISDAPGARSRFLTWYASL------DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
+ G + + L +G L F + K LL Q + + G ++
Sbjct: 1000 FLGSVGLSETTINCWTQLLILVQIEGTLYLFGEIAPKYQDLLLTFQARLQDYIYAPGNVS 1059
Query: 573 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
+R ++ K P R +DG +V +FL L ++ +C+ +G D+
Sbjct: 1060 FNLWRAFRNKAREGDGPFR-FVDGEMVERFLDLDEAKQELVCEGLGPSVEDM 1110
>gi|340367933|ref|XP_003382507.1| PREDICTED: splicing factor 3B subunit 3-like isoform 1 [Amphimedon
queenslandica]
Length = 1214
Score = 40.8 bits (94), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 34/135 (25%), Positives = 60/135 (44%), Gaps = 17/135 (12%)
Query: 498 TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM- 556
T +H+G+ +NT K +S PG + Y +L G++G +P K
Sbjct: 1090 TSYHVGEGINTLHK-------VSLIPGGSEVLV--YTTLSGSIGILVPFSSKEDSDFFQH 1140
Query: 557 LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 616
L+ M + S+ G + +FR+ YY P + +IDG L + L +R EI
Sbjct: 1141 LEMHMRSEWSNLVGRDHLSFRS-----YYV--PVKSVIDGDLCEVYNSLDPSKRREIALD 1193
Query: 617 IGSKHNDILDELYDI 631
+ +++ +L D+
Sbjct: 1194 LDRSPSEVAKKLEDL 1208
>gi|221057087|ref|XP_002259681.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|193809753|emb|CAQ40455.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 2256
Score = 40.8 bits (94), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 20/85 (23%), Positives = 46/85 (54%), Gaps = 10/85 (11%)
Query: 340 LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARS--IALLRYQPE 397
++ + K++I ++ +ND T AF++ +I+ + +KN +V D R I++ Y+ +
Sbjct: 1995 ILHCMNSKLFIHEVSENDFTKGAFLENNFFISDIKILKNFFIVADLHRGIFISMYNYEQQ 2054
Query: 398 YRTLSLVARDYKPTQPNSKGYYAGN 422
Y + S++ P +K +++ N
Sbjct: 2055 YDSRSII--------PIAKPFFSSN 2071
>gi|82541417|ref|XP_724950.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23479780|gb|EAA16515.1| CPSF A subunit region, putative [Plasmodium yoelii yoelii]
Length = 2227
Score = 40.4 bits (93), Expect = 2.9, Method: Composition-based stats.
Identities = 19/68 (27%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
Query: 340 LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARS--IALLRYQPE 397
L+ KIYI ++K+ND AF+D YI+ + +N I++ D + I + Y+ +
Sbjct: 2005 LLHCTNSKIYIHEIKNNDFIKGAFLDNNFYISDIKIFRNFIIISDLYKGIYINMYSYEEQ 2064
Query: 398 YRTLSLVA 405
Y + +++
Sbjct: 2065 YDSRRIIS 2072
>gi|70954357|ref|XP_746229.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56526771|emb|CAH77136.1| hypothetical protein PC000016.02.0 [Plasmodium chabaudi chabaudi]
Length = 372
Score = 40.4 bits (93), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 44/195 (22%), Positives = 79/195 (40%), Gaps = 38/195 (19%)
Query: 456 NDILDEFSSMGFMISDKDKNVVLFMYQPEARESN---------GGHRLIKKT-------- 498
++ILD + M +DK +V + EA++ GG + T
Sbjct: 192 SEILDHHTIMA---ADKFDSVFILRVPEEAKQEEYGIANKCWYGGEVISSSTKNRKMEHI 248
Query: 499 -DFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM- 556
FH+G+ V + K++ P+S Y+++ G +G F+P K L
Sbjct: 249 MSFHIGEIVTSLQKVKLSPASSE---------CIIYSTIMGTIGAFIPYDNKEELELTQH 299
Query: 557 LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 616
L+ ++ T G FR+Y +P + +IDG L +F L + ++
Sbjct: 300 LEIILRTEKHALCGREHIFFRSYY-------HPVQHVIDGDLCEQFSSLPFDVQRKVASD 352
Query: 617 IGSKHNDILDELYDI 631
+ ++IL +L DI
Sbjct: 353 LEKTPDEILRKLEDI 367
>gi|322787057|gb|EFZ13281.1| hypothetical protein SINV_13198 [Solenopsis invicta]
Length = 986
Score = 39.7 bits (91), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 34/135 (25%), Positives = 58/135 (42%), Gaps = 13/135 (9%)
Query: 278 YIALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT + N E GRILL+ ++ K + KE KG ++
Sbjct: 825 YFVVGTAFINPDETEPKMGRILLYH-----------WSEGKFTQVAEKEIKGSCYSLVEF 873
Query: 337 AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQ 395
G L+ ++ + +++ + IA + K + +LVGD RS+ LL+Y+
Sbjct: 874 NGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKTKGDFVLVGDLMRSLTLLQYK 933
Query: 396 PEYRTLSLVARDYKP 410
+ +ARDY P
Sbjct: 934 TMEGSFEEIARDYNP 948
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.139 0.428
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,737,815,892
Number of Sequences: 23463169
Number of extensions: 475138795
Number of successful extensions: 820092
Number of sequences better than 100.0: 681
Number of HSP's better than 100.0 without gapping: 380
Number of HSP's successfully gapped in prelim test: 301
Number of HSP's that attempted gapping in prelim test: 817301
Number of HSP's gapped (non-prelim): 1777
length of query: 638
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 489
effective length of database: 8,863,183,186
effective search space: 4334096577954
effective search space used: 4334096577954
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 80 (35.4 bits)