BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy92
         (638 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|350413821|ref|XP_003490124.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Bombus impatiens]
          Length = 1417

 Score =  773 bits (1995), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 369/631 (58%), Positives = 459/631 (72%), Gaps = 46/631 (7%)

Query: 14   ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLK--VLFVSDRS 71
            E  V+E+L V+LG HGNRP+LLVR   EL IYQA+R+PKG LKLRFKKL   ++    R 
Sbjct: 827  EMQVREILMVALGHHGNRPMLLVRLDSELQIYQAYRYPKGHLKLRFKKLDHGIIPGQLRP 886

Query: 72   KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS 131
            K  +E   +    R   MRYFSNIAGY GVF+C  +P W+FLT RGELR HPM IDGPV+
Sbjct: 887  KPRDEDIPMMNETRHCMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPVT 946

Query: 132  TLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKT 191
            + APF+N+NCP+GFLYFN K ELRI VLPTHLSYDAPWPVRKVPL+CTPHF+ YHLE+KT
Sbjct: 947  SFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKT 1006

Query: 192  YCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFP 251
            YC++TS AEP   YY+FNGEDKE   + R  RFI P   QF + LFSP SWE IP T   
Sbjct: 1007 YCVITSIAEPLKSYYRFNGEDKEFTEEERPERFIYPSQEQFSIVLFSPVSWETIPNTKIE 1066

Query: 252  LHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQ 311
            L +WEHV CLKNVS+ YEGT SGL+GYI LGTNYNY ED+T RGRIL+FDIIEVVPEPGQ
Sbjct: 1067 LDQWEHVTCLKNVSLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQ 1126

Query: 312  PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIA 371
            PLTKN+ K IYAKEQKGP+TAI  V+GFLV+AVGQKIYIWQLKDNDL G+AFIDT++YI 
Sbjct: 1127 PLTKNRFKQIYAKEQKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQIYIH 1186

Query: 372  SMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGS 431
             M+S+K+LIL+ D  +SI+LLR+Q EYRTLSLV+RD++P +  +  Y   N         
Sbjct: 1187 QMLSIKSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDN--------- 1237

Query: 432  LVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGG 491
                                           +++GF+++D + N+ LFMYQPE+RES GG
Sbjct: 1238 -------------------------------TNLGFLVADGESNMALFMYQPESRESLGG 1266

Query: 492  HRLIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPLP 547
             +LI+K DFHLGQ VNTFF+I+C+ S  ++      GA  R +T YASLDG+LG+ LP+P
Sbjct: 1267 QKLIRKADFHLGQKVNTFFRIKCRVSDPANDKKHFSGADKRHVTMYASLDGSLGYILPVP 1326

Query: 548  EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 607
            EK YRRLLMLQNV+VTH  H  GLNP+A+RTYK      GNP+RGIIDG LVW++L L  
Sbjct: 1327 EKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDLVWRYLYLPN 1386

Query: 608  GERLEICKKIGSKHNDILDELYDIEALSSHF 638
             E++++ KKIG++  +I+++L +I+  ++HF
Sbjct: 1387 NEKIDVAKKIGTRVQEIIEDLTEIDRQTAHF 1417


>gi|340710064|ref|XP_003393618.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Bombus terrestris]
          Length = 1417

 Score =  771 bits (1991), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 368/631 (58%), Positives = 458/631 (72%), Gaps = 46/631 (7%)

Query: 14   ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLK--VLFVSDRS 71
            E  V+E+L V+LG HGNRP+LLVR   EL IYQA+R+PKG LKLRFKKL   ++    + 
Sbjct: 827  EMQVREILMVALGHHGNRPMLLVRLDSELQIYQAYRYPKGHLKLRFKKLDHGIIPGQLKP 886

Query: 72   KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS 131
            K  +E   +    R   MRYFSNIAGY GVF+C  +P W+FLT RGELR HPM IDGPV+
Sbjct: 887  KLRDEDIPMMNETRHCMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPVT 946

Query: 132  TLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKT 191
            + APF+N+NCP+GFLYFN K ELRI VLPTHLSYDAPWPVRKVPL+CTPHF+ YHLE+KT
Sbjct: 947  SFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKT 1006

Query: 192  YCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFP 251
            YC++TS AEP   YY+FNGEDKE   + R  RFI P   QF + LFSP SWE IP T   
Sbjct: 1007 YCVITSIAEPLKSYYRFNGEDKEFTEEERPERFIYPSQEQFSIVLFSPVSWETIPNTKIE 1066

Query: 252  LHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQ 311
            L +WEHV CLKNVS+ YEGT SGL+GYI LGTNYNY ED+T RGRIL+FDIIEVVPEPGQ
Sbjct: 1067 LDQWEHVTCLKNVSLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQ 1126

Query: 312  PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIA 371
            PLTKN+ K IYAKEQKGP+TAI  V+GFLV+AVGQKIYIWQLKDNDL G+AFIDT++YI 
Sbjct: 1127 PLTKNRFKQIYAKEQKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQIYIH 1186

Query: 372  SMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGS 431
             M+S+K+LIL+ D  +SI+LLR+Q EYRTLSLV+RD++P +  +  Y   N         
Sbjct: 1187 QMLSIKSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDN--------- 1237

Query: 432  LVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGG 491
                                           +++GF+++D + N+ LFMYQPE+RES GG
Sbjct: 1238 -------------------------------TNLGFLVADGESNMALFMYQPESRESLGG 1266

Query: 492  HRLIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPLP 547
             +LI+K DFHLGQ VNTFF+IRC+ S  ++      GA  R +T YASLDG+LG+ LP+P
Sbjct: 1267 QKLIRKADFHLGQKVNTFFRIRCRLSDPANDKKHFSGADKRHVTMYASLDGSLGYILPVP 1326

Query: 548  EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 607
            EK YRRLLMLQNV+VTH  H  GLNP+A+RTYK      GNP+RGIIDG LVW++  L  
Sbjct: 1327 EKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDLVWRYFYLPN 1386

Query: 608  GERLEICKKIGSKHNDILDELYDIEALSSHF 638
             E++++ KKIG++  +I+++L +I+  ++HF
Sbjct: 1387 NEKIDVAKKIGTRVQEIIEDLTEIDRQTAHF 1417


>gi|383863556|ref|XP_003707246.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Megachile rotundata]
          Length = 1415

 Score =  771 bits (1990), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 368/631 (58%), Positives = 457/631 (72%), Gaps = 46/631 (7%)

Query: 14   ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLK--VLFVSDRS 71
            E  V+E+L V+LG HGNRP+LLVR   EL IYQ +R+PKG LKLRFKKL   ++  + R 
Sbjct: 825  EMQVREILMVALGHHGNRPMLLVRLDSELQIYQTYRYPKGHLKLRFKKLDHGIIPGNLRP 884

Query: 72   KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS 131
            K   E        R   MRYFSNIAGY GVF+C  +P W+FLT RGELR HPM IDGP++
Sbjct: 885  KPKEEDMSAMNETRHCMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPIT 944

Query: 132  TLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKT 191
            + APF+N+NCP+GFLYFN K ELRI VLPTHLSYDAPWPVRKVPL+CTPHF+ YHLE+KT
Sbjct: 945  SFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKT 1004

Query: 192  YCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFP 251
            YC++TS AEP   YY+FNGEDKE   + R  RFI P   QF + LFSP SWE IP T   
Sbjct: 1005 YCVITSIAEPLKSYYRFNGEDKEFTEEDRPDRFIFPSQEQFSIVLFSPVSWETIPNTKIE 1064

Query: 252  LHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQ 311
            L +WEHV CLKNVS+ YEGT SGL+GYI LGTNYNY ED+T RGRIL+FDIIEVVPEPGQ
Sbjct: 1065 LDQWEHVTCLKNVSLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQ 1124

Query: 312  PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIA 371
            PLTKN+ K IYAKEQKGP+TAI  V+GFLV+AVGQKIYIWQLKDNDL G+AFIDT++YI 
Sbjct: 1125 PLTKNRFKQIYAKEQKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQIYIH 1184

Query: 372  SMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGS 431
             M+S+K+LIL+ D  +SI+LLR+Q EYRTLSLV+RD++P +  +  Y   N         
Sbjct: 1185 QMLSIKSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDN--------- 1235

Query: 432  LVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGG 491
                                           +++GF+++D + N+ LFMYQPE+RES GG
Sbjct: 1236 -------------------------------NNLGFLVADGESNIALFMYQPESRESLGG 1264

Query: 492  HRLIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPLP 547
             +LI+K DFHLGQ VNTFF+IRC+ S  ++      GA  R +T YASLDG+LG+ LP+P
Sbjct: 1265 QKLIRKADFHLGQKVNTFFRIRCRISDPANDKKHFSGADKRHVTMYASLDGSLGYILPVP 1324

Query: 548  EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 607
            EK YRRLLMLQNV+VTH  H  GLNP+A+RTYK      GNP+RGIIDG LVW++L L  
Sbjct: 1325 EKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSYIRTQGNPARGIIDGDLVWRYLYLPN 1384

Query: 608  GERLEICKKIGSKHNDILDELYDIEALSSHF 638
             E++++ KKIG++  +I+++L +I+  ++HF
Sbjct: 1385 NEKIDVAKKIGTRVQEIIEDLTEIDRQTAHF 1415


>gi|110750698|ref|XP_624382.2| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Apis mellifera]
          Length = 1415

 Score =  768 bits (1984), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 368/632 (58%), Positives = 456/632 (72%), Gaps = 48/632 (7%)

Query: 14   ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSK- 72
            E  V+E+L V+LG HGNRP+LLVR   EL IYQA+R+PKG LKLRFKKL    +    + 
Sbjct: 825  EMQVREILMVALGHHGNRPMLLVRLDSELQIYQAYRYPKGHLKLRFKKLDHGIIPGHLRP 884

Query: 73   --RANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPV 130
              R  + P +    R   MRYFSNIAGY GVF+C  +P W+FLT RGELR HPM IDGPV
Sbjct: 885  RPRDEDMPAM-NDTRHCMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPV 943

Query: 131  STLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETK 190
            ++ APF+N+NCP+GFLYFN K ELRI VLPTHLSYDAPWPVRKVPL+CTPHF+ YHLE+K
Sbjct: 944  TSFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESK 1003

Query: 191  TYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNF 250
            TYC++TS AEP   YY+FNGEDKE   + R  RFI P   QF + LFSP SWE IP T  
Sbjct: 1004 TYCVITSIAEPLKSYYRFNGEDKEFTEEERPDRFIFPSQEQFSIVLFSPVSWETIPNTKI 1063

Query: 251  PLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPG 310
             L +WEHV CLKNVS+ YEGT SGL+GYI LGTNYNY ED+T RGRIL+FDIIEVVPEPG
Sbjct: 1064 ELDQWEHVTCLKNVSLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPG 1123

Query: 311  QPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYI 370
            QPLTKN+ K IYAKEQKGP+TAI  V+GFLV+AVGQKIYIWQLKDNDL G+AFIDT++YI
Sbjct: 1124 QPLTKNRFKQIYAKEQKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQIYI 1183

Query: 371  ASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDG 430
              M+S+K+LIL+ D  +SI+LLR+Q EYRTLSLV+RD++P +  +  Y   N        
Sbjct: 1184 HQMLSIKSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDN-------- 1235

Query: 431  SLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNG 490
                                            +++GF+++D + N+ LFMYQPE+RES G
Sbjct: 1236 --------------------------------TNLGFLVADGESNIALFMYQPESRESLG 1263

Query: 491  GHRLIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPL 546
            G +LI+K DFHLGQ VNTFF+IRC+ S  ++       A  R +T YASLDG LG+ LP+
Sbjct: 1264 GQKLIRKADFHLGQKVNTFFRIRCRISDPANDKKHFSDADKRHVTMYASLDGNLGYILPV 1323

Query: 547  PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
            PEK YRRLLMLQNV+VTH  H  GLNP+A+RTYK      GNP+RGIIDG LVW++L L 
Sbjct: 1324 PEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDLVWRYLYLP 1383

Query: 607  LGERLEICKKIGSKHNDILDELYDIEALSSHF 638
              E++++ KKIG++  +I+++L +I+  ++HF
Sbjct: 1384 NNEKIDVAKKIGTRVQEIIEDLTEIDRQTAHF 1415


>gi|345482082|ref|XP_001607052.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Nasonia vitripennis]
          Length = 1415

 Score =  766 bits (1977), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/631 (57%), Positives = 457/631 (72%), Gaps = 46/631 (7%)

Query: 14   ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKR 73
            E  V+E+  V+LG HGNRP+LLVR   EL IYQ +R+PKG LKLRFKK+   F+   S+ 
Sbjct: 825  EVQVREIAVVALGHHGNRPMLLVRLDSELQIYQVYRYPKGHLKLRFKKIDHNFIVGFSRI 884

Query: 74   ANEQPGLP--RGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS 131
              ++  +P     R+  MRYFSNIAGY GVF+ G +P W+FLT RGELRAHPM IDGPV 
Sbjct: 885  GPKEEDMPSMNDTRLCMMRYFSNIAGYNGVFIGGDYPHWIFLTGRGELRAHPMNIDGPVK 944

Query: 132  TLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKT 191
            + APF+NVNCP+GFLYFN K ELRI VLPTHLSYDAPWPVRKVPL+CTPHF+ YHLE+KT
Sbjct: 945  SFAPFNNVNCPQGFLYFNRKDELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKT 1004

Query: 192  YCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFP 251
            YC+VTSTAEP   YY+FNGEDKE   + R+ RF+ P   QF + LFSP SW+ IP T   
Sbjct: 1005 YCVVTSTAEPLKSYYRFNGEDKEFTEEERNERFLYPTQEQFSIVLFSPVSWDTIPNTKID 1064

Query: 252  LHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQ 311
            L +WEHV CLKNVS+ YEGT SGL+GYI +GTNYNY ED+T RGRI +FDIIEVVPEPGQ
Sbjct: 1065 LDQWEHVTCLKNVSLAYEGTRSGLKGYIVIGTNYNYGEDITSRGRIFIFDIIEVVPEPGQ 1124

Query: 312  PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIA 371
            PLTKN+ K IYAKEQKGPVTAI  V+GFLV+A+GQKIYIWQLKDNDL G+AFIDT++Y+ 
Sbjct: 1125 PLTKNRFKQIYAKEQKGPVTAITQVSGFLVSAIGQKIYIWQLKDNDLVGVAFIDTQIYVC 1184

Query: 372  SMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGS 431
             M+S+K+LILV D  +S++LLR+QPEY+TLSLV+RD++ T+  +  Y+  N         
Sbjct: 1185 QMLSIKSLILVADVYKSVSLLRFQPEYKTLSLVSRDFRTTEIYAIEYFIQN--------- 1235

Query: 432  LVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGG 491
                                           + +GF+++D + N+ +F YQPE+ +S GG
Sbjct: 1236 -------------------------------NELGFIVADGESNISIFSYQPESSQSLGG 1264

Query: 492  HRLIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPLP 547
             +LI+K D HLGQ +NTFF+I+CK +  ++      GA  R +T YA+LDG+LG+ LP+P
Sbjct: 1265 QKLIRKADIHLGQKINTFFRIKCKTTDSANPTKQFSGADKRHVTMYATLDGSLGYILPVP 1324

Query: 548  EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 607
            EK YRRLLMLQNV+V+H  H  GLNP+AFRTYK      GNP+RGIIDG LV K+L L +
Sbjct: 1325 EKTYRRLLMLQNVLVSHIYHIAGLNPKAFRTYKSCVRMQGNPARGIIDGDLVRKYLDLPV 1384

Query: 608  GERLEICKKIGSKHNDILDELYDIEALSSHF 638
             E++EI KKIG+   +I+D++++I   +SHF
Sbjct: 1385 NEKIEIAKKIGTGAQEIMDDMHEIYKQTSHF 1415


>gi|270003792|gb|EFA00240.1| hypothetical protein TcasGA2_TC003068 [Tribolium castaneum]
          Length = 1392

 Score =  761 bits (1965), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/641 (56%), Positives = 468/641 (73%), Gaps = 48/641 (7%)

Query: 6    SHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLK-- 63
            +H  +   +  V+E+L V+LG HG+RPLL+VR + +L IY+ FR P+G LK+RF+K+K  
Sbjct: 792  AHEANIQRQFDVKEILVVALGNHGSRPLLMVRLERDLYIYEVFRFPRGNLKMRFRKIKHS 851

Query: 64   VLFVSDRSKRANEQPGLPRGV--RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
            +++  + S R + +      +  RI +MRYF+NIAGY GVF+CG +P W+F+++RGELR 
Sbjct: 852  LIYSPNVSGRIDTEDSDFFAIQERIIKMRYFTNIAGYNGVFVCGANPHWIFMSARGELRT 911

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            HPMTIDG V + A F+NVNCP+GFLYFN KSELRI VLPTHLSYDA WPVRKVPL+CTPH
Sbjct: 912  HPMTIDGEVLSFAAFNNVNCPQGFLYFNRKSELRIGVLPTHLSYDAAWPVRKVPLRCTPH 971

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
            F+ YHLE+KTYC+VTS AEPS  YYKFNGEDKEL  + R  RF  PL  +F + LFSP S
Sbjct: 972  FVTYHLESKTYCLVTSIAEPSNKYYKFNGEDKELSVEDRGDRFPYPLQEKFSLMLFSPVS 1031

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
            W+ IP T   L EWEHV CLKNVS+ YEGT SGL+GYIA+GTNYNY EDVT RGRIL+FD
Sbjct: 1032 WDVIPNTKIDLDEWEHVNCLKNVSLAYEGTRSGLKGYIAVGTNYNYGEDVTSRGRILIFD 1091

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
            IIEVVPEPGQPLTKN+ K IYAK+QKGPVTA+  V GFLV+AVGQKIYIWQLKDNDL G+
Sbjct: 1092 IIEVVPEPGQPLTKNRFKEIYAKDQKGPVTALSQVKGFLVSAVGQKIYIWQLKDNDLVGV 1151

Query: 362  AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            AFIDT++Y   ++++K+L+LV D  +SI+LLR+Q EYRTLSLV+RD++P +  S  Y   
Sbjct: 1152 AFIDTQIYTHQILTIKSLLLVADVYKSISLLRFQEEYRTLSLVSRDFRPCEVFSVEYMID 1211

Query: 422  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
            N                                        ++MGF++SD +KN+VL+MY
Sbjct: 1212 N----------------------------------------TTMGFLVSDSEKNLVLYMY 1231

Query: 482  QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLD 537
            QPE+RES GG RL++K DFHLGQ VN+FF+I+CK   + +      GA  R +T YA+LD
Sbjct: 1232 QPESRESLGGQRLLRKADFHLGQAVNSFFRIKCKLGELGEDKKNLTGADKRHITMYATLD 1291

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G LG+ +P+PEK YRRLLMLQNV+V+  +H  GLNP+AFRTYK       NP+R +IDG 
Sbjct: 1292 GGLGYIMPVPEKTYRRLLMLQNVLVSQGAHIAGLNPKAFRTYKSWKKLQTNPARSVIDGE 1351

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            LV+ +LQLS+ E+LE+ KKIG+K  ++LD+L DI+ +++HF
Sbjct: 1352 LVYNYLQLSIPEKLEVSKKIGTKLEELLDDLSDIQKITNHF 1392


>gi|91078626|ref|XP_968117.1| PREDICTED: similar to cleavage and polyadenylation specificity factor
            cpsf [Tribolium castaneum]
          Length = 1413

 Score =  761 bits (1965), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/641 (56%), Positives = 468/641 (73%), Gaps = 48/641 (7%)

Query: 6    SHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLK-- 63
            +H  +   +  V+E+L V+LG HG+RPLL+VR + +L IY+ FR P+G LK+RF+K+K  
Sbjct: 813  AHEANIQRQFDVKEILVVALGNHGSRPLLMVRLERDLYIYEVFRFPRGNLKMRFRKIKHS 872

Query: 64   VLFVSDRSKRANEQPGLPRGV--RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
            +++  + S R + +      +  RI +MRYF+NIAGY GVF+CG +P W+F+++RGELR 
Sbjct: 873  LIYSPNVSGRIDTEDSDFFAIQERIIKMRYFTNIAGYNGVFVCGANPHWIFMSARGELRT 932

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            HPMTIDG V + A F+NVNCP+GFLYFN KSELRI VLPTHLSYDA WPVRKVPL+CTPH
Sbjct: 933  HPMTIDGEVLSFAAFNNVNCPQGFLYFNRKSELRIGVLPTHLSYDAAWPVRKVPLRCTPH 992

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
            F+ YHLE+KTYC+VTS AEPS  YYKFNGEDKEL  + R  RF  PL  +F + LFSP S
Sbjct: 993  FVTYHLESKTYCLVTSIAEPSNKYYKFNGEDKELSVEDRGDRFPYPLQEKFSLMLFSPVS 1052

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
            W+ IP T   L EWEHV CLKNVS+ YEGT SGL+GYIA+GTNYNY EDVT RGRIL+FD
Sbjct: 1053 WDVIPNTKIDLDEWEHVNCLKNVSLAYEGTRSGLKGYIAVGTNYNYGEDVTSRGRILIFD 1112

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
            IIEVVPEPGQPLTKN+ K IYAK+QKGPVTA+  V GFLV+AVGQKIYIWQLKDNDL G+
Sbjct: 1113 IIEVVPEPGQPLTKNRFKEIYAKDQKGPVTALSQVKGFLVSAVGQKIYIWQLKDNDLVGV 1172

Query: 362  AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            AFIDT++Y   ++++K+L+LV D  +SI+LLR+Q EYRTLSLV+RD++P +  S  Y   
Sbjct: 1173 AFIDTQIYTHQILTIKSLLLVADVYKSISLLRFQEEYRTLSLVSRDFRPCEVFSVEYMID 1232

Query: 422  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
            N                                        ++MGF++SD +KN+VL+MY
Sbjct: 1233 N----------------------------------------TTMGFLVSDSEKNLVLYMY 1252

Query: 482  QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLD 537
            QPE+RES GG RL++K DFHLGQ VN+FF+I+CK   + +      GA  R +T YA+LD
Sbjct: 1253 QPESRESLGGQRLLRKADFHLGQAVNSFFRIKCKLGELGEDKKNLTGADKRHITMYATLD 1312

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G LG+ +P+PEK YRRLLMLQNV+V+  +H  GLNP+AFRTYK       NP+R +IDG 
Sbjct: 1313 GGLGYIMPVPEKTYRRLLMLQNVLVSQGAHIAGLNPKAFRTYKSWKKLQTNPARSVIDGE 1372

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            LV+ +LQLS+ E+LE+ KKIG+K  ++LD+L DI+ +++HF
Sbjct: 1373 LVYNYLQLSIPEKLEVSKKIGTKLEELLDDLSDIQKITNHF 1413


>gi|307190910|gb|EFN74734.1| Cleavage and polyadenylation specificity factor subunit 1 [Camponotus
            floridanus]
          Length = 1418

 Score =  761 bits (1964), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/633 (57%), Positives = 457/633 (72%), Gaps = 48/633 (7%)

Query: 14   ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKR 73
            E  V+E+L V+LG HGNRP+LLVR   EL IYQA+++PKG LKLRFKKL+   +  R   
Sbjct: 826  EMQVREILMVALGHHGNRPMLLVRLDSELQIYQAYKYPKGYLKLRFKKLEHGIIPGRLSP 885

Query: 74   ANEQPGLPRGV---RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPV 130
              ++  +P      RI  MRYFSNIAGY GVF+C  +P W+FLT RGELR HPM IDGP+
Sbjct: 886  KPKEEDMPMNASETRICMMRYFSNIAGYNGVFICCDYPHWIFLTGRGELRTHPMGIDGPI 945

Query: 131  STLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETK 190
            ++ A F+NVNCP+GFLYFN K ELRI VLPTHLSYDAPWPVRKVPL+CTPHF+ YHLE+K
Sbjct: 946  TSFAAFNNVNCPQGFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESK 1005

Query: 191  TYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNF 250
            TYC++TS AEP   YY+FNGEDKE   + R  RF+ P   QF + LFSP SWE IP T  
Sbjct: 1006 TYCVITSIAEPLKSYYRFNGEDKEFTEEERPERFLYPSQEQFSIVLFSPVSWETIPNTKI 1065

Query: 251  PLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPG 310
             L +WEHV CLKNVS+ YEGT SGL+GYI LGTNYNY ED+T RGRIL+FDIIEVVPEPG
Sbjct: 1066 ELEQWEHVTCLKNVSLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPG 1125

Query: 311  QPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYI 370
            QPLTKN+ K IYAKEQKGP+TAI  V+GFLV+AVGQKIYIWQLKDNDL G+AFIDT++YI
Sbjct: 1126 QPLTKNRFKQIYAKEQKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQIYI 1185

Query: 371  ASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDG 430
              M+S+K+LIL+ D  +SI+LLR+Q EYRTLSLV+RD++P +  +  Y   N        
Sbjct: 1186 HQMLSIKSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDN-------- 1237

Query: 431  SLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNG 490
                                            +++GF ++D + N+ LFMYQPE+RES G
Sbjct: 1238 --------------------------------TNLGFFLADGESNLALFMYQPESRESLG 1265

Query: 491  GHRLIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPL 546
            G +LI+K DFHLGQ VNTFF+IRC+ S  ++      GA  R +T YA+LDG+LG+ LP+
Sbjct: 1266 GQKLIRKADFHLGQKVNTFFRIRCRVSDPANDKKQFSGADKRHVTMYATLDGSLGYILPV 1325

Query: 547  PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR-TYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
            PEK YRRLLMLQNV+VTH  H  GLNP+++R TYK      GNP+RGIIDG LVW++L L
Sbjct: 1326 PEKTYRRLLMLQNVLVTHICHIAGLNPKSYRQTYKSYIRNQGNPARGIIDGDLVWRYLFL 1385

Query: 606  SLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
               E+ ++ KKIG++  +I++++ +I+  ++HF
Sbjct: 1386 PNNEKTDVAKKIGTRVQEIIEDITEIDRQTAHF 1418


>gi|307191845|gb|EFN75271.1| Cleavage and polyadenylation specificity factor subunit 1
            [Harpegnathos saltator]
          Length = 1214

 Score =  759 bits (1960), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/632 (57%), Positives = 455/632 (71%), Gaps = 47/632 (7%)

Query: 14   ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKR 73
            E  V+E+L V+LG HGNRP+LLVR   EL IYQA+++PKG LKLRFKKL    +     R
Sbjct: 623  ELQVREVLMVALGHHGNRPMLLVRLDSELQIYQAYKYPKGHLKLRFKKLDHGIIPGHLSR 682

Query: 74   ANEQPGLP---RGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPV 130
              ++  +P      RI  MRYFSNIAGY GVF+C  +P W+FLT RGELR HPM IDG V
Sbjct: 683  KPKEEDVPVNANETRICMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGSV 742

Query: 131  STLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETK 190
            ++ A F+N+NCP+GFLYFN K ELRI VLPTHLSYDAPWPVRKVPL+CTPHF+ YHLE+K
Sbjct: 743  TSFAAFNNINCPQGFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESK 802

Query: 191  TYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNF 250
            TYC++TST+EP   YY+FNGEDKE   + R  RF+ P   QF + LFSP SWE IP T  
Sbjct: 803  TYCVITSTSEPLKSYYRFNGEDKEFTEEDRPERFLYPSQEQFCIVLFSPVSWETIPNTKI 862

Query: 251  PLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPG 310
             L +WEHV CLKNVS+ YEGT SGL+GYI LGTNYNY ED+T RGRIL+FDIIEVVPEPG
Sbjct: 863  ELDQWEHVTCLKNVSLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPG 922

Query: 311  QPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYI 370
            QPLTKN+ K IYAKEQKGP+TAI  V+GFLVTAVGQKIYIWQLKDNDL GIAFIDT++YI
Sbjct: 923  QPLTKNRFKQIYAKEQKGPITAITQVSGFLVTAVGQKIYIWQLKDNDLVGIAFIDTQIYI 982

Query: 371  ASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDG 430
              M+S+K+LIL+ D  +SI+LLR+Q + RTLSLV+RD++P +  +  Y   N        
Sbjct: 983  HQMLSIKSLILIADVYKSISLLRFQEKCRTLSLVSRDFRPAEVYTIEYLIDN-------- 1034

Query: 431  SLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNG 490
                                            +++GF+I+D + N+ LFMYQPE+RES G
Sbjct: 1035 --------------------------------TNLGFLIADGESNLALFMYQPESRESLG 1062

Query: 491  GHRLIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPL 546
            G +LI+K DFHLGQ +NTFF+I+C+ + ++        A  + +T YASLDG+LG+ LP+
Sbjct: 1063 GQKLIRKADFHLGQKINTFFRIKCRVTDVASDKKHFSDADKKHVTMYASLDGSLGYVLPV 1122

Query: 547  PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
            PEK YRRLLMLQNV+VTH  H  GLNP+A+RTYK      GNP+RGIIDG LVW++L L 
Sbjct: 1123 PEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSYVRNQGNPARGIIDGDLVWRYLSLP 1182

Query: 607  LGERLEICKKIGSKHNDILDELYDIEALSSHF 638
              E+ ++ KKIG++  +I++++ +I+  ++HF
Sbjct: 1183 NNEKADVAKKIGTRVQEIIEDITEIDRQTAHF 1214


>gi|380014171|ref|XP_003691113.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Apis florea]
          Length = 1583

 Score =  754 bits (1948), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/612 (59%), Positives = 440/612 (71%), Gaps = 48/612 (7%)

Query: 14   ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSK- 72
            E  V+E+L V+LG HGNRP+LLVR   EL IYQA+R+PKG LKLRFKKL    +    + 
Sbjct: 825  EMQVREILMVALGHHGNRPMLLVRLDSELQIYQAYRYPKGHLKLRFKKLDHGIIPGHLRP 884

Query: 73   --RANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPV 130
              R  + P +    R   MRYFSNIAGY GVF+C  +P W+FLT RGELR HPM IDGPV
Sbjct: 885  RPRDEDMPAM-NDTRHCMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPV 943

Query: 131  STLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETK 190
            ++ APF+N+NCP+GFLYFN K ELRI VLPTHLSYDAPWPVRKVPL+CTPHF+ YHLE+K
Sbjct: 944  TSFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESK 1003

Query: 191  TYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNF 250
            TYC++TS AEP   YY+FNGEDKE   + R  RFI P   QF + LFSP SWE IP T  
Sbjct: 1004 TYCVITSIAEPLKSYYRFNGEDKEFTEEERPDRFIYPSQEQFSIVLFSPVSWETIPNTKI 1063

Query: 251  PLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPG 310
             L +WEHV CLKNVS+ YEGT SGL+GYI LGTNYNY ED+T RGRIL+FDIIEVVPEPG
Sbjct: 1064 ELDQWEHVTCLKNVSLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPG 1123

Query: 311  QPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYI 370
            QPLTKN+ K IYAKEQKGP+TAI  V+GFLV+AVGQKIYIWQLKDNDL G+AFIDT++YI
Sbjct: 1124 QPLTKNRFKQIYAKEQKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQIYI 1183

Query: 371  ASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDG 430
              M+S+K+LIL+ D  +SI+LLR+Q EYRTLSLV+RD++P +  +  Y   N        
Sbjct: 1184 HQMLSIKSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDN-------- 1235

Query: 431  SLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNG 490
                                            +++GF+++D + N+ LFMYQPE+RES G
Sbjct: 1236 --------------------------------TNLGFLVADGESNIALFMYQPESRESLG 1263

Query: 491  GHRLIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPL 546
            G +LI+K DFHLGQ VNTFF+IRC+ S  ++       A  R +T YASLDG LG+ LP+
Sbjct: 1264 GQKLIRKADFHLGQKVNTFFRIRCRISDPANDKKHFSDADKRHVTMYASLDGNLGYILPV 1323

Query: 547  PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
            PEK YRRLLMLQNV+VTH  H  GLNP+A+RTYK      GNP+RGIIDG LVW++L L 
Sbjct: 1324 PEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDLVWRYLYLP 1383

Query: 607  LGERLEICKKIG 618
              E++++ KKI 
Sbjct: 1384 NNEKIDVAKKIA 1395


>gi|322792443|gb|EFZ16427.1| hypothetical protein SINV_15375 [Solenopsis invicta]
          Length = 1532

 Score =  751 bits (1938), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/612 (58%), Positives = 443/612 (72%), Gaps = 48/612 (7%)

Query: 17   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANE 76
            V+E+L V+LG HGNRP+LLVR   EL IYQ +R+PKG LKLRFKKL    +  R     +
Sbjct: 796  VREILMVALGHHGNRPMLLVRLDSELQIYQVYRYPKGYLKLRFKKLDHGIIPGRLSPRPK 855

Query: 77   QPGLPRGV---RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTL 133
            +  +PR     RI  MRYFSNIAGY GVF+C  +P W+FLT RGELR HPM IDG V++ 
Sbjct: 856  EEDVPRNTSDTRICVMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGSVTSF 915

Query: 134  APFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYC 193
            A F+N+NCP+GFLYFN K ELRI VLPTHLSYDAPWPVRKVPL+CTPHF+ YHLE+KTYC
Sbjct: 916  AAFNNINCPQGFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKTYC 975

Query: 194  IVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLH 253
            ++TSTAEP   YY+FNGEDKE   + R  RF+ P   QF + LFSP SWE IP T   L 
Sbjct: 976  VITSTAEPLKSYYRFNGEDKEFTEEERPDRFLYPSQEQFSIVLFSPVSWETIPNTKIELD 1035

Query: 254  EWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPL 313
            +WEHV CLKNVS+ YEGT SGL+GYI LGTNYNY ED+T RGRIL+FDIIEVVPEPGQPL
Sbjct: 1036 QWEHVTCLKNVSLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPL 1095

Query: 314  TKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASM 373
            TKN+ K IYAKEQKGP+TAI  V+GFLV+AVGQKIYIWQLKDNDL G+AFIDT++YI  M
Sbjct: 1096 TKNRFKQIYAKEQKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQIYIHQM 1155

Query: 374  VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
            +S+K+LIL+ D  +SI+LLR+Q EYRTLSLV+RD++P +  +  Y   N           
Sbjct: 1156 LSIKSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDN----------- 1204

Query: 434  WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
                                         +++GF+++D + N+ LFMYQPE+RES GG +
Sbjct: 1205 -----------------------------TNLGFIVADGESNLALFMYQPESRESLGGQK 1235

Query: 494  LIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPLPEK 549
            LI+K DFHLGQ VNTFF+IRC+ +  ++      GA  R +T YASLDG+LG+ LP+PEK
Sbjct: 1236 LIRKADFHLGQKVNTFFRIRCRVTDPANDKKQFSGADKRHVTMYASLDGSLGYILPVPEK 1295

Query: 550  NYRRLLMLQNVMVTHTSHTGGLNPRAFR-TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 608
             YRRLLMLQNV+VTH  H  GLNP+++R TYK      GNP+RGIIDG LVW++L L   
Sbjct: 1296 TYRRLLMLQNVLVTHICHIAGLNPKSYRHTYKSYIRNQGNPARGIIDGDLVWRYLFLPNN 1355

Query: 609  ERLEICKKIGSK 620
            E+ ++ KKIG++
Sbjct: 1356 EKADLAKKIGTR 1367


>gi|242021233|ref|XP_002431050.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
            putative [Pediculus humanus corporis]
 gi|212516279|gb|EEB18312.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
            putative [Pediculus humanus corporis]
          Length = 1409

 Score =  744 bits (1921), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 354/633 (55%), Positives = 466/633 (73%), Gaps = 47/633 (7%)

Query: 13   DETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGA-LKLRFKKL-KVLFVSDR 70
            D+  + ELL VSLG  G RP+LL+RT+++L+IYQAF+  KG  LK+RF++L + L + +R
Sbjct: 817  DDPEIHELLVVSLGHLGRRPILLLRTENDLMIYQAFKFAKGPNLKIRFRRLPQTLILKER 876

Query: 71   -SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP 129
             +K   +        R +++RYFSNI+GY GVF+CGP+P WLFLT+RGELR+HPM IDG 
Sbjct: 877  KAKFKVKYENEVESERATRLRYFSNISGYNGVFVCGPNPHWLFLTARGELRSHPMLIDGR 936

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            V++ A FHNVNCP GFLYF +K ELRI +LPTHLSYDAPWPVRKVPL+CTPH + YHLE+
Sbjct: 937  VTSFASFHNVNCPLGFLYFTSKCELRICILPTHLSYDAPWPVRKVPLRCTPHMVTYHLES 996

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            KTYC++TS++EPS +Y++FNGEDKE   + RD RF  PL  +F + LFSP SWE IP T 
Sbjct: 997  KTYCLITSSSEPSNEYFRFNGEDKEHSVEDRDDRFPLPLQDKFSIVLFSPVSWEVIPNTK 1056

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEP 309
              L EWEHV C+K V++ YEGT SGL+GY+A+GTNYNYSED+T +GRIL++DIIEVVPEP
Sbjct: 1057 MELDEWEHVTCVKTVNLSYEGTRSGLKGYVAVGTNYNYSEDITSKGRILIYDIIEVVPEP 1116

Query: 310  GQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVY 369
            GQPLTKN+ K +YAKEQKGPVTA+CHV GFLVTA+GQKIYIWQLKDNDL GIAFIDT++Y
Sbjct: 1117 GQPLTKNRFKTVYAKEQKGPVTALCHVLGFLVTAMGQKIYIWQLKDNDLVGIAFIDTQIY 1176

Query: 370  IASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIID 429
            I  M+SVK+LILV D  +SI+LLR+Q EYRTLSLV+RD++P +      YA         
Sbjct: 1177 IHQMISVKSLILVADVYKSISLLRFQEEYRTLSLVSRDFRPCE-----VYA--------- 1222

Query: 430  GSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESN 489
                                       ++L + + MGF+ISD + N++++MY+PE R+S 
Sbjct: 1223 --------------------------IELLLDNTQMGFLISDVEMNIIMYMYKPEDRDSV 1256

Query: 490  GGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS---DAP-GARSRFLTWYASLDGALGFFLP 545
            GG +L++K DFHLGQH+N++F+IRC+    +   D P GA  R ++ +A+LDGALG+ LP
Sbjct: 1257 GGQKLLRKADFHLGQHINSWFRIRCRLGDQAENYDFPIGAEKRHISMFATLDGALGYLLP 1316

Query: 546  LPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
            +PEK YRRL MLQN++V H  H  GLNP+AFR YK      GNP + I+DG L+W +L L
Sbjct: 1317 IPEKTYRRLQMLQNILVYHIPHLAGLNPKAFRIYKSGRKLLGNPCKRIVDGELIWMYLSL 1376

Query: 606  SLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            ++ E+ ++ KK+GSK +DI++++  IE LS HF
Sbjct: 1377 TVMEKQDVAKKMGSKMDDIIEDIAVIERLSGHF 1409


>gi|332018184|gb|EGI58789.1| Cleavage and polyadenylation specificity factor subunit 1 [Acromyrmex
            echinatior]
          Length = 1412

 Score =  733 bits (1892), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/612 (58%), Positives = 440/612 (71%), Gaps = 52/612 (8%)

Query: 17   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANE 76
            V+E+L V+LG HGNRP+LLVR   +L IYQA+R+PKG LKLRFKKL    +  R     +
Sbjct: 827  VREILMVALGHHGNRPMLLVRLDSDLQIYQAYRYPKGYLKLRFKKLDHGIIPGRLSPRPK 886

Query: 77   QPGLPRG---VRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTL 133
            +  +PR     RI  MRYFSNIAGY GVF+C  +P W+FLT RGELR HPM IDGPV++ 
Sbjct: 887  EEDVPRNRNITRICVMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPVTSF 946

Query: 134  APFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYC 193
            APF+N+NCP+GFLYFN K ELRI VLPTHLSYDAPWPVRKVPL+CTPHF+ YHLE+KTYC
Sbjct: 947  APFNNINCPQGFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKTYC 1006

Query: 194  IVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLH 253
            ++TSTAEP   YY+FNGEDK L        ++      F   LFSP SWE IP T   L 
Sbjct: 1007 VITSTAEPLKSYYRFNGEDKVLTK----LYYLFQFSRIFMNLLFSPVSWETIPNTKIELD 1062

Query: 254  EWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPL 313
            +WEHV CLKNVS+ YEGT SGL+GYI LGTNYNY ED+T RGRIL+FDIIEVVPEPGQPL
Sbjct: 1063 QWEHVTCLKNVSLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPL 1122

Query: 314  TKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASM 373
            TKN+ K IYAKEQKGP+TAI  V+GFLV+AVGQKIYIWQLKDNDL G+AFIDT++YI  M
Sbjct: 1123 TKNRFKQIYAKEQKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQIYIHQM 1182

Query: 374  VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
            +S+K+LIL+ D  +SI+LLR+Q EYRTLSLV+RD++P +  +  Y   N           
Sbjct: 1183 LSIKSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDN----------- 1231

Query: 434  WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
                                         S++GF+++D + N+ LFMYQPE+RES GG +
Sbjct: 1232 -----------------------------SNLGFIVADGESNLALFMYQPESRESLGGQK 1262

Query: 494  LIKKTDFHLGQHVNTFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPLPEK 549
            LI+K DFHLGQ +NTFF+I+C+ +  ++      GA  R +T YASLDG+LG+ LP+PEK
Sbjct: 1263 LIRKADFHLGQKINTFFRIKCRITDPANDKKQFSGADKRHVTMYASLDGSLGYILPVPEK 1322

Query: 550  NYRRLLMLQNVMVTHTSHTGGLNPRAFR-TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 608
             YRRLLMLQNV+VTH  H  GLNP+A+R TYK      GNP+RGIIDG LVW++L L   
Sbjct: 1323 TYRRLLMLQNVLVTHICHIAGLNPKAYRHTYKSYVRNQGNPARGIIDGDLVWRYLFLPNN 1382

Query: 609  ERLEICKKIGSK 620
            E+ ++ KKIG++
Sbjct: 1383 EKADLAKKIGTR 1394


>gi|193702313|ref|XP_001945086.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Acyrthosiphon pisum]
          Length = 1335

 Score =  686 bits (1770), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 332/628 (52%), Positives = 440/628 (70%), Gaps = 52/628 (8%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRAN 75
            I++E+L V LG    RP++ VR  +E++IY   RHP+G LK+RF K+  L ++ +S+  N
Sbjct: 755  IIKEILIVPLGYQDKRPIMFVRLDNEVVIYGIHRHPEGTLKMRFHKMTSL-LTFQSRSGN 813

Query: 76   EQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAP 135
               G       S +RYFS +AG+ GVF+CG +P  + LT RGELR HP+ IDGP+   AP
Sbjct: 814  PLEG------TSLLRYFSKVAGHNGVFICGQNPHLILLTVRGELRCHPLHIDGPIMCFAP 867

Query: 136  FHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIV 195
            FHNVNC +GFLYFN+  +LRIS+LPTHLSYD PWP+RKVPL+ TPHF+AYHLETKTYC+V
Sbjct: 868  FHNVNCSQGFLYFNSDHKLRISILPTHLSYDEPWPLRKVPLRKTPHFIAYHLETKTYCVV 927

Query: 196  TSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEW 255
            TS++E S  YY+FNGEDKEL T+ RD  F  P    F + LFSP SWE IP T+    +W
Sbjct: 928  TSSSELSASYYRFNGEDKELTTEERDPLFPLPSHEVFTLELFSPASWEPIPDTSIETEDW 987

Query: 256  EHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTK 315
            EH+ CLKNV++ YEG  SGL+GYIA+GTNY+YSED+T RGRI LFDII+VVPEPG+PLTK
Sbjct: 988  EHITCLKNVALAYEGARSGLKGYIAMGTNYSYSEDITSRGRIFLFDIIDVVPEPGKPLTK 1047

Query: 316  NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVS 375
            NKIKMIYAKEQKGPVTAI HV GFLVTAVGQKIYIWQLKDNDL GIAFIDTEVY+  M+S
Sbjct: 1048 NKIKMIYAKEQKGPVTAITHVVGFLVTAVGQKIYIWQLKDNDLIGIAFIDTEVYVHQMLS 1107

Query: 376  VKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWK 435
            +K+LILV D  +SI LLR+Q EYRTLSLV RD KP +     +   N             
Sbjct: 1108 IKSLILVADLFKSITLLRFQEEYRTLSLVCRDSKPLEVFDINFLIDN------------- 1154

Query: 436  FLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLI 495
                                       + +GF+ SD+D+N++L++YQP ARES GG  L+
Sbjct: 1155 ---------------------------TELGFLASDRDQNLLLYLYQPMARESYGGQHLV 1187

Query: 496  KKTDFHLGQHVNTFFKIRCKPSSIS----DAPGARSRFLTWYASLDGALGFFLPLPEKNY 551
            ++ DF++G +VN+FF++RCK S+++    +A G+  R +T Y +LDG++G+ +P+ EKNY
Sbjct: 1188 RRGDFNIGSNVNSFFRLRCKQSTVAPDRREAIGSDKRHVTMYTTLDGSIGYIVPIHEKNY 1247

Query: 552  RRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQ-LSLGER 610
            RRLL LQN++V + +H  GLNP+A+R++K       N +R +IDG LVW F+  ++  +R
Sbjct: 1248 RRLLTLQNMLVKNITHLAGLNPKAYRSFKATAPERMNQARRVIDGELVWMFVTCMNARQR 1307

Query: 611  LEICKKIGSKHNDILDELYDIEALSSHF 638
             EI  K+G K  ++L ++Y+++  + HF
Sbjct: 1308 NEIANKVGVKTIELLQDIYELDRTTWHF 1335


>gi|427795803|gb|JAA63353.1| Putative mrna cleavage and polyadenylation factor ii complex
           subunit cft1 cpsf subunit, partial [Rhipicephalus
           pulchellus]
          Length = 726

 Score =  659 bits (1700), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 328/637 (51%), Positives = 436/637 (68%), Gaps = 55/637 (8%)

Query: 16  IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGALKLRFKKLK-VLFVSDR 70
           +V E+L V LG+  +RPLLL R   +LLIY+AF       +G LKLRFKK+   +F+ +R
Sbjct: 131 VVHEILVVGLGIRHSRPLLLARVDEDLLIYEAFPFYETQREGHLKLRFKKMSHDIFLRER 190

Query: 71  SKRANEQPGLPRGVRISQMRY----FSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTI 126
            K   ++P      +  Q R     FS+I+GY GVFLCG  P WLF++SRGELR HPM +
Sbjct: 191 -KYKTQKPENEEEEKAFQSRQWLHPFSDISGYSGVFLCGYRPYWLFMSSRGELRCHPMFV 249

Query: 127 DGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYH 186
           DGP+   APFHNVNCP+GFL+FN + ELRIS LPTHL+YDAPWPVRKVPL+CTPHF+ YH
Sbjct: 250 DGPIHCFAPFHNVNCPKGFLHFNKQGELRISTLPTHLTYDAPWPVRKVPLRCTPHFVNYH 309

Query: 187 LETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIP 246
           +++KTYC+VTS  +P     +F GE+KE     RDSR+I P + +F + L SP SWE IP
Sbjct: 310 VDSKTYCVVTSQPDPCNHLVRFTGEEKEYELLERDSRYIFPTMDKFSLQLLSPVSWETIP 369

Query: 247 QTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVV 306
            T   L EWEH+ CLKNV +  EGT +G++GY+ALGTNY Y EDVT RGRI++ DII+VV
Sbjct: 370 NTRVDLDEWEHLTCLKNVMLSSEGTTTGMKGYLALGTNYCYGEDVTSRGRIIILDIIDVV 429

Query: 307 PEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT 366
           PEPGQPLTKNKIK++Y+KEQKGPVTA+  V GFL++A+GQKIYIWQLKDN+L G+AFIDT
Sbjct: 430 PEPGQPLTKNKIKIVYSKEQKGPVTALSQVVGFLLSAIGQKIYIWQLKDNELVGVAFIDT 489

Query: 367 EVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRG 426
           ++YI S+V+VKNLILVGD  +S++LLRYQ   RTLSLV+RD +P +  +  ++  N    
Sbjct: 490 QIYIHSVVTVKNLILVGDVFKSVSLLRYQEASRTLSLVSRDVRPLEVYAVEFFIDN---- 545

Query: 427 IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
                                               + M F+++D ++N++L+MYQPE+R
Sbjct: 546 ------------------------------------TQMSFLVTDAERNLLLYMYQPESR 569

Query: 487 ESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISD-----APGARSRFLTWYASLDGALG 541
           ES GG RL+++ DFH+G  V + F+I+C+   I+      A     R +T  A+LDG+L 
Sbjct: 570 ESCGGQRLLRRGDFHVGSPVVSMFRIKCRMGDIAKYDRRAASIVDGRHITMMATLDGSLA 629

Query: 542 FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 601
           + LP+PEK YRRLLMLQNV+VT+  H  GLNP+A+R Y  +  + GNP + I+DG L+WK
Sbjct: 630 YVLPVPEKTYRRLLMLQNVLVTNIPHYAGLNPKAYRMYYSQRRFLGNPHKNILDGELIWK 689

Query: 602 FLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
           F+ LS  ER E+ KKIG+    I D+L +IE  ++HF
Sbjct: 690 FMHLSFMERSELSKKIGTTVTQITDDLLEIETYTAHF 726


>gi|427780291|gb|JAA55597.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
            cft1 cpsf subunit [Rhipicephalus pulchellus]
          Length = 1237

 Score =  659 bits (1699), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 328/637 (51%), Positives = 436/637 (68%), Gaps = 55/637 (8%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGALKLRFKKLK-VLFVSDR 70
            +V E+L V LG+  +RPLLL R   +LLIY+AF       +G LKLRFKK+   +F+ +R
Sbjct: 642  VVHEILVVGLGIRHSRPLLLARVDEDLLIYEAFPFYETQREGHLKLRFKKMSHDIFLRER 701

Query: 71   SKRANEQPGLPRGVRISQMRY----FSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTI 126
             K   ++P      +  Q R     FS+I+GY GVFLCG  P WLF++SRGELR HPM +
Sbjct: 702  -KYKTQKPENEEEEKAFQSRQWLHPFSDISGYSGVFLCGYRPYWLFMSSRGELRCHPMFV 760

Query: 127  DGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYH 186
            DGP+   APFHNVNCP+GFL+FN + ELRIS LPTHL+YDAPWPVRKVPL+CTPHF+ YH
Sbjct: 761  DGPIHCFAPFHNVNCPKGFLHFNKQGELRISTLPTHLTYDAPWPVRKVPLRCTPHFVNYH 820

Query: 187  LETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIP 246
            +++KTYC+VTS  +P     +F GE+KE     RDSR+I P + +F + L SP SWE IP
Sbjct: 821  VDSKTYCVVTSQPDPCNHLVRFTGEEKEYELLERDSRYIFPTMDKFSLQLLSPVSWETIP 880

Query: 247  QTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVV 306
             T   L EWEH+ CLKNV +  EGT +G++GY+ALGTNY Y EDVT RGRI++ DII+VV
Sbjct: 881  NTRVDLDEWEHLTCLKNVMLSSEGTTTGMKGYLALGTNYCYGEDVTSRGRIIILDIIDVV 940

Query: 307  PEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT 366
            PEPGQPLTKNKIK++Y+KEQKGPVTA+  V GFL++A+GQKIYIWQLKDN+L G+AFIDT
Sbjct: 941  PEPGQPLTKNKIKIVYSKEQKGPVTALSQVVGFLLSAIGQKIYIWQLKDNELVGVAFIDT 1000

Query: 367  EVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRG 426
            ++YI S+V+VKNLILVGD  +S++LLRYQ   RTLSLV+RD +P +  +  ++  N    
Sbjct: 1001 QIYIHSVVTVKNLILVGDVFKSVSLLRYQEASRTLSLVSRDVRPLEVYAVEFFIDN---- 1056

Query: 427  IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
                                                + M F+++D ++N++L+MYQPE+R
Sbjct: 1057 ------------------------------------TQMSFLVTDAERNLLLYMYQPESR 1080

Query: 487  ESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISD-----APGARSRFLTWYASLDGALG 541
            ES GG RL+++ DFH+G  V + F+I+C+   I+      A     R +T  A+LDG+L 
Sbjct: 1081 ESCGGQRLLRRGDFHVGSPVVSMFRIKCRMGDIAKYDRRAASIVDGRHITMMATLDGSLA 1140

Query: 542  FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 601
            + LP+PEK YRRLLMLQNV+VT+  H  GLNP+A+R Y  +  + GNP + I+DG L+WK
Sbjct: 1141 YVLPVPEKTYRRLLMLQNVLVTNIPHYAGLNPKAYRMYYSQRRFLGNPHKNILDGELIWK 1200

Query: 602  FLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            F+ LS  ER E+ KKIG+    I D+L +IE  ++HF
Sbjct: 1201 FMHLSFMERSELSKKIGTTVTQITDDLLEIETYTAHF 1237


>gi|432883539|ref|XP_004074300.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Oryzias latipes]
          Length = 1456

 Score =  640 bits (1652), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 309/640 (48%), Positives = 424/640 (66%), Gaps = 57/640 (8%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGALKLRFKKL--------- 62
            +V+E+  V+LG + +RP LLV  ++ELL+Y+AF    + P+  LK+RFKK+         
Sbjct: 857  LVKEVALVALGNNRSRPYLLVHVENELLVYEAFPYDQQQPQNNLKVRFKKVPHSINFREK 916

Query: 63   -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
               L    +++    +  +    RIS+ RYF +I+GY GVF+CGP P W+ +TSRG LR 
Sbjct: 917  KPKLKKDKKAEGGGPEENVAVKSRISRFRYFEDISGYSGVFICGPSPHWMLITSRGGLRL 976

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            HPMTIDGP+ + +PFHN+NCP+GFLYFN + ELRISVLPT+LSYDAPWPVRK+PL+CT H
Sbjct: 977  HPMTIDGPIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVH 1036

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
            F++YH+E+K Y + TS  E  T   +  GE+KE  T  RD R+I PL  +F + L SP S
Sbjct: 1037 FVSYHVESKVYAVCTSVKELCTRIPRMTGEEKEFETIERDERYINPLQEKFSIQLISPVS 1096

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
            WE IP T   L EWEHV C+K V++  + T+SGL+GYIA GT     E+VTCRGRIL+ D
Sbjct: 1097 WETIPNTRIDLEEWEHVTCMKTVALRSQETVSGLKGYIAAGTCVLQGEEVTCRGRILILD 1156

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
            +IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G+LV+A+GQKI++W LKDNDLTG+
Sbjct: 1157 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCHGYLVSAIGQKIFLWALKDNDLTGM 1216

Query: 362  AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            AFIDT++YI  M+S+KN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   
Sbjct: 1217 AFIDTQLYIHQMISIKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSIEFIVD 1276

Query: 422  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
            N                                        + +GF++SD+DKN+ ++MY
Sbjct: 1277 N----------------------------------------NQLGFLVSDRDKNLFVYMY 1296

Query: 482  QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDG 538
             PEA+ES GG RL+++ DF+ G H+N+ +++ C+ +  S +  A    ++ +TW+A+LDG
Sbjct: 1297 LPEAKESFGGMRLLRRADFNAGAHINSLWRMPCRGALDSGSKKALTWDNKHITWFATLDG 1356

Query: 539  ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
             +G  LP+ EK YRRLLMLQN + T   H  GLNP+AFR          N  + I+DG L
Sbjct: 1357 GIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFRMMHSNRRSLQNAVKNILDGEL 1416

Query: 599  VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            + K+L LS  ER E+ KKIG+  + ILD+L +I+ +++HF
Sbjct: 1417 LAKYLYLSTMERSELAKKIGTTQDIILDDLLEIDRVTAHF 1456


>gi|229335612|ref|NP_001108153.2| cleavage and polyadenylation specificity factor subunit 1 [Danio
            rerio]
          Length = 1449

 Score =  640 bits (1652), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 311/642 (48%), Positives = 423/642 (65%), Gaps = 56/642 (8%)

Query: 13   DETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGALKLRFKKLKVLFVS 68
            D  +V+E+  VSLG + +RP LL   + ELLIY+AF    +  +  LK+RFKK+      
Sbjct: 848  DIPLVKEVALVSLGYNHSRPYLLAHVEQELLIYEAFPYDQQQAQSNLKVRFKKMPHNINY 907

Query: 69   DRSKRANEQPGLPRGV---------RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGEL 119
               K    +   P G          R+++ RYF +I+GY GVF+CGP P W+ +TSRG +
Sbjct: 908  REKKVKVRKDKKPEGQGEDTLGVKGRVARFRYFQDISGYSGVFICGPSPHWMLVTSRGAM 967

Query: 120  RAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCT 179
            R HPMTIDG + + +PFHN+NCP+GFLYFN + ELRISVLPT+LSYDAPWPVRK+PL+CT
Sbjct: 968  RLHPMTIDGAIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCT 1027

Query: 180  PHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP 239
             H+++YH+E+K Y + TS  EP T   +  GE+KE  T  RD R+I P   +F + L SP
Sbjct: 1028 VHYVSYHVESKVYAVCTSVKEPCTRIPRMTGEEKEFETIERDERYIHPQQDKFSIQLISP 1087

Query: 240  FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
             SWE IP T   L EWEHV C+K V+++ + T+SGL+GY+ALGT     E+VTCRGRIL+
Sbjct: 1088 VSWEAIPNTRVDLEEWEHVTCMKTVALKSQETVSGLKGYVALGTCLMQGEEVTCRGRILI 1147

Query: 300  FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLT 359
             D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH +GFLV+A+GQKI++W LKDNDLT
Sbjct: 1148 LDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCSGFLVSAIGQKIFLWSLKDNDLT 1207

Query: 360  GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
            G+AFIDT++YI  M S+KN IL  D  +SI+LLRYQPE +TLSLV+RD KP +  S  + 
Sbjct: 1208 GMAFIDTQLYIHQMYSIKNFILAADVMKSISLLRYQPESKTLSLVSRDAKPLEVYSIEFM 1267

Query: 420  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
              N                                        + +GF++SD+DKN++++
Sbjct: 1268 VDN----------------------------------------NQLGFLVSDRDKNLMVY 1287

Query: 480  MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK---PSSISDAPGARSRFLTWYASL 536
            MY PEA+ES GG RL+++ DF++G HVN F+++ C+    ++   A    ++ +TW+A+L
Sbjct: 1288 MYLPEAKESFGGMRLLRRADFNVGSHVNAFWRMPCRGTLDTANKKALTWDNKHITWFATL 1347

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            DG +G  LP+ EK YRRLLMLQN + T   H  GLNP+AFR          N  + I+DG
Sbjct: 1348 DGGVGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFRMLHCDRRTLQNAVKNILDG 1407

Query: 597  SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             L+ K+L LS  ER E+ KKIG+  + ILD+L +IE +++HF
Sbjct: 1408 ELLNKYLYLSTMERSELAKKIGTTPDIILDDLLEIERVTAHF 1449


>gi|348512553|ref|XP_003443807.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Oreochromis niloticus]
          Length = 1456

 Score =  639 bits (1648), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 310/641 (48%), Positives = 424/641 (66%), Gaps = 59/641 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGALKLRFKKL--------- 62
            +V+E+  VSLG + ++P LLV  + ELLIY+AF++    P+  LK+RFKK+         
Sbjct: 857  LVKEVALVSLGNNHSKPYLLVHVEQELLIYEAFQYDQQQPQNNLKVRFKKVPHNINFREK 916

Query: 63   --KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
              K+         A E+    +G RI++ R+F +I+GY GVF+CGP P W+ +TSRG LR
Sbjct: 917  KSKLKKDKKAESSATEESSGVKG-RIARFRFFEDISGYSGVFICGPSPHWMLVTSRGALR 975

Query: 121  AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTP 180
             HPMTIDG + + +PFHN+NCP+GFLYFN + ELRISVLPT+LSYDAPWPVRK+PL+CT 
Sbjct: 976  LHPMTIDGSIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTV 1035

Query: 181  HFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPF 240
            H+++YH+E+K Y + TS  EP T   +  GE+KE     RD R+I P   +F + L SP 
Sbjct: 1036 HYVSYHVESKVYAVCTSVKEPCTRIPRMTGEEKEYEVIERDERYIHPQQEKFSIQLISPV 1095

Query: 241  SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
            SWE IP T   L EWEHV C+K V++  + T+SGL+GYIA GT     E+VTCRGRIL+ 
Sbjct: 1096 SWEAIPNTRIDLEEWEHVTCMKTVALRSQETVSGLKGYIAAGTCLMQGEEVTCRGRILIL 1155

Query: 301  DIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTG 360
            D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G+LV+A+GQKI++W LKDNDLTG
Sbjct: 1156 DVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFLWVLKDNDLTG 1215

Query: 361  IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            +AFIDT++YI  M S+KN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +  
Sbjct: 1216 MAFIDTQLYIHQMFSIKNFILAADLMKSISLLRYQEESKTLSLVSRDAKPLEVYSIEFMV 1275

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
             N                                        + +GF++SD+DKN+ ++M
Sbjct: 1276 DN----------------------------------------NQLGFLVSDRDKNLYVYM 1295

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK---PSSISDAPGARSRFLTWYASLD 537
            Y PEA+ES GG RL+++ DF+ G ++NTF+++ C+    +S   A    ++ +TW+A+LD
Sbjct: 1296 YLPEAKESFGGMRLLRRADFNAGANINTFWRMPCRGALDASSKKALTWDNKHITWFATLD 1355

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G +G  LP+ EK YRRLLMLQN + T   H  GLNP+AFR          NP + I+DG 
Sbjct: 1356 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFRMLHSDRRSLQNPVKNILDGE 1415

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            L+ K+L LS+ ER E+ KKIG+  + ILD+L +I+ +++HF
Sbjct: 1416 LLNKYLYLSMMERSELAKKIGTTQDIILDDLLEIDRVTAHF 1456


>gi|49619065|gb|AAT68117.1| cleavage and polyadenylation specific factor 1 [Danio rerio]
          Length = 1105

 Score =  637 bits (1642), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 310/642 (48%), Positives = 421/642 (65%), Gaps = 56/642 (8%)

Query: 13   DETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGALKLRFKKLKVLFVS 68
            D  +V+E+  VSLG   +RP LL   + ELLIY+AF    +  +  LK+RFKK+      
Sbjct: 504  DIPLVKEVALVSLGYSHSRPYLLAHVEQELLIYEAFPYDQQQAQSNLKVRFKKMPHNINY 563

Query: 69   DRSKRANEQPGLPRGV---------RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGEL 119
               K    +   P G          R+++ RYF +I+GY GVF+CGP P W+ +TSRG +
Sbjct: 564  REKKVKVRKDKKPEGQGEDSLGVKGRVARFRYFQDISGYSGVFICGPSPHWMLVTSRGAM 623

Query: 120  RAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCT 179
            R HPMTIDG + + +PFHN+NCP+GFLYFN + ELRISVLPT+LSYDAPWPVRK+PL+CT
Sbjct: 624  RLHPMTIDGAIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCT 683

Query: 180  PHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP 239
             H+++YH+E+K Y + TS  EP T   +  GE+KE  T  RD R+I P   +F + L SP
Sbjct: 684  VHYVSYHVESKVYAVCTSVKEPCTRIPRMTGEEKEFETIERDERYIHPQQDKFSIQLISP 743

Query: 240  FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
             SWE IP T   L EWEHV C+K V+++ + T+SGL+GY+ALGT     E+VTCRGRIL+
Sbjct: 744  VSWEAIPNTRVDLEEWEHVTCMKTVALKSQETVSGLKGYVALGTCLMQGEEVTCRGRILI 803

Query: 300  FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLT 359
             D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH +GFLV+A+GQKI++W LK NDLT
Sbjct: 804  LDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCSGFLVSAIGQKIFLWSLKYNDLT 863

Query: 360  GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
            G+AFIDT++YI  M S+KN IL  D  +SI+LLRYQPE +TLSLV+RD KP +  S  + 
Sbjct: 864  GMAFIDTQLYIHQMYSIKNFILAADVMKSISLLRYQPESKTLSLVSRDAKPLEVYSIEFM 923

Query: 420  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
              N                                        + +GF++SD+DKN++++
Sbjct: 924  VDN----------------------------------------NQLGFLVSDRDKNLMVY 943

Query: 480  MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK---PSSISDAPGARSRFLTWYASL 536
            MY PEA+ES GG RL+++ DF++G HVN F+++ C+    ++   A    ++ +TW+A+L
Sbjct: 944  MYLPEAKESFGGMRLLRRADFNVGSHVNAFWRMPCRGTLDTANKKALTWDNKHITWFATL 1003

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            DG +G  LP+ EK YRRLLMLQN + T   H  GLNP+AFR          N  + I+DG
Sbjct: 1004 DGGVGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFRMLHCDRRTLQNAVKNILDG 1063

Query: 597  SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             L+ K+L LS  ER E+ KKIG+  + ILD+L +IE +++HF
Sbjct: 1064 ELLNKYLYLSTMERSELAKKIGTTPDIILDDLLEIERVTAHF 1105


>gi|27807297|ref|NP_777145.1| cleavage and polyadenylation specificity factor subunit 1 [Bos
            taurus]
 gi|1706101|sp|Q10569.1|CPSF1_BOVIN RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=CPSF 160 kDa subunit
 gi|929007|emb|CAA58152.1| cleavage and polyadenylation specificity factor, 160 kDa subunit [Bos
            taurus]
 gi|296480730|tpg|DAA22845.1| TPA: cleavage and polyadenylation specificity factor subunit 1 [Bos
            taurus]
          Length = 1444

 Score =  634 bits (1635), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 311/642 (48%), Positives = 415/642 (64%), Gaps = 62/642 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG    RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 846  LVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 905

Query: 63   -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
                        + E+   PRG R+++ RYF +I GY GVF+CGP P WL +T RG LR 
Sbjct: 906  KPKPSKKKAEGGSTEEGTGPRG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 964

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            HPM IDGP+ + APFHN+NCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 965  HPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 1024

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
            ++AYH+E+K Y + TST+ P T   +  GE+KE  T  RD R++ P    F + L SP S
Sbjct: 1025 YVAYHVESKVYAVATSTSTPCTRVPRMTGEEKEFETIERDERYVHPQQEAFCIQLISPVS 1084

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
            WE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D
Sbjct: 1085 WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1144

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
            +IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1145 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1204

Query: 362  AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   
Sbjct: 1205 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1264

Query: 422  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
            N                                        + +GF++SD+D+N++++MY
Sbjct: 1265 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1284

Query: 482  QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASL 536
             PEA+ES GG RL+++ DFH+G HVNTF++  C+    ++ P  +S     + +TW+A+L
Sbjct: 1285 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCR--GAAEGPSKKSVVWENKHITWFATL 1342

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            DG +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG
Sbjct: 1343 DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDG 1402

Query: 597  SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1403 ELLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 1444


>gi|431908146|gb|ELK11749.1| Cleavage and polyadenylation specificity factor subunit 1 [Pteropus
           alecto]
          Length = 820

 Score =  634 bits (1635), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 317/642 (49%), Positives = 421/642 (65%), Gaps = 63/642 (9%)

Query: 16  IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--KVLFVSD 69
           +V+E+L V+LG   +RP LLV    ELL+Y+AF H     +G LK+RFKK+   + F   
Sbjct: 223 LVKEVLLVALGSRQSRPYLLVHVDQELLVYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 282

Query: 70  R---SKR-----ANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
           +   SK+     A E PG  RG R+++ RYF +I GY GVF+CGP P WL +T RG LR 
Sbjct: 283 KPRPSKKKAEGGAEEGPG-ARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 340

Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
           HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 341 HPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 400

Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
           ++AYH+E+K Y + TST  P T   +  GE+KE  T  RD R+I P    F + L SP S
Sbjct: 401 YVAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS 460

Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
           WE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D
Sbjct: 461 WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 520

Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
           +IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+
Sbjct: 521 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 580

Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
           AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   
Sbjct: 581 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 640

Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
           N                                        + +GF++SD+D+N++++MY
Sbjct: 641 N----------------------------------------AQLGFLVSDRDRNLMVYMY 660

Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASL 536
            PEA+ES GG RL+++ DFH+G HVNTF++  C+    ++ P  +S     + +TW+A+L
Sbjct: 661 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCR--GAAEGPSKKSVVWENKHITWFATL 718

Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
           DG +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG
Sbjct: 719 DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDG 778

Query: 597 SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 779 ELLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 820


>gi|158287218|ref|XP_309311.4| AGAP011340-PA [Anopheles gambiae str. PEST]
 gi|157019545|gb|EAA05261.4| AGAP011340-PA [Anopheles gambiae str. PEST]
          Length = 1434

 Score =  634 bits (1634), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 306/637 (48%), Positives = 424/637 (66%), Gaps = 60/637 (9%)

Query: 18   QELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFV---------- 67
            +E+L V+LG +G+RPLL +R +H+LLIY+ FR+ KG LKLRFK+L               
Sbjct: 836  KEILMVALGSYGSRPLLFIRLEHDLLIYRVFRYSKGHLKLRFKRLSTSVTCPVFRTPEPS 895

Query: 68   -SDRSKRANEQPGLPRGVR-----ISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
             +  ++ ANEQ    R  +     IS +RYF+N++GY GV +CG  P +LFLT+ GELR+
Sbjct: 896  GAGATEAANEQQQ-ARATKVLYENISMIRYFANVSGYAGVAVCGEKPYFLFLTAHGELRS 954

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            H +     +   APF+NVNCP GFLYF+ + EL+IS+ PT+LSYD+ WPVRK+PL+ +P 
Sbjct: 955  HRLYARTVMKAFAPFNNVNCPNGFLYFDEQYELKISIFPTYLSYDSVWPVRKIPLRSSPK 1014

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
             + YH E K YC+V    E    YY+FNGEDKEL  + +  RF+ P+  +F V L +P +
Sbjct: 1015 QIVYHRENKVYCVVMDAEEICNKYYRFNGEDKELTEENKGERFLYPMGHRFSVVLVTPAA 1074

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
            WE +P+T+  L EWEHV+ LKNVS+ YEG  SGL+ YIA+GTN+NYSED+T RGR+LL+D
Sbjct: 1075 WEVVPETSINLEEWEHVIALKNVSLTYEGARSGLKEYIAVGTNFNYSEDITSRGRLLLYD 1134

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
            IIEVVPEPG+PLTK+K K +  K+QKGPV+AI HV GFLV AVGQK+Y+WQ+KD+DL G+
Sbjct: 1135 IIEVVPEPGKPLTKHKFKEVIVKDQKGPVSAISHVCGFLVGAVGQKVYLWQMKDDDLVGV 1194

Query: 362  AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            AFIDT +++  MVS+K+LILV D  +S++LLR+Q EYRTLS+V+RDY P       Y   
Sbjct: 1195 AFIDTNIFVHQMVSIKSLILVADVYKSVSLLRFQEEYRTLSVVSRDYHPLNVFQVEYVVD 1254

Query: 422  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
            N                                        +++GF++SD   N++ +MY
Sbjct: 1255 N----------------------------------------ANLGFLVSDDQCNLITYMY 1274

Query: 482  QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC---KPSSISDAPGARSRFLTWYASLDG 538
            QPE+RES GG RL++K+D+HLGQ VN  F+++C   +   +       ++  T++A+LDG
Sbjct: 1275 QPESRESFGGQRLLRKSDYHLGQQVNCMFRVQCDFHETDVMKRTLNYDNKHTTFFATLDG 1334

Query: 539  ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
             +GF LPLPEK YRRL MLQNV++TH+ HT GLNP+A+RT K       NPSR ++DG L
Sbjct: 1335 GIGFVLPLPEKTYRRLFMLQNVLLTHSPHTCGLNPKAYRTIKQTRKLPINPSRCVVDGDL 1394

Query: 599  VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
            VW FL+L   E+ E+ KKIG++  +I  +L +IE ++
Sbjct: 1395 VWSFLELPANEKHEVAKKIGTRIEEICADLMEIEHVT 1431


>gi|358415280|ref|XP_003583063.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Bos taurus]
          Length = 1490

 Score =  633 bits (1633), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 311/642 (48%), Positives = 415/642 (64%), Gaps = 62/642 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG    RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 892  LVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 951

Query: 63   -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
                        + E+   PRG R+++ RYF +I GY GVF+CGP P WL +T RG LR 
Sbjct: 952  KPKPSKKKAEGGSTEEGTGPRG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 1010

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            HPM IDGP+ + APFHN+NCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 1011 HPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 1070

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
            ++AYH+E+K Y + TST+ P T   +  GE+KE  T  RD R++ P    F + L SP S
Sbjct: 1071 YVAYHVESKVYAVATSTSTPCTRVPRMTGEEKEFETIERDERYVHPQQEAFCIQLISPVS 1130

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
            WE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D
Sbjct: 1131 WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1190

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
            +IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1191 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1250

Query: 362  AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   
Sbjct: 1251 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1310

Query: 422  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
            N                                        + +GF++SD+D+N++++MY
Sbjct: 1311 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1330

Query: 482  QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASL 536
             PEA+ES GG RL+++ DFH+G HVNTF++  C+    ++ P  +S     + +TW+A+L
Sbjct: 1331 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCR--GAAEGPSKKSVVWENKHITWFATL 1388

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            DG +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG
Sbjct: 1389 DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDG 1448

Query: 597  SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1449 ELLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 1490


>gi|395860104|ref|XP_003802355.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Otolemur garnettii]
          Length = 1441

 Score =  633 bits (1632), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 311/641 (48%), Positives = 417/641 (65%), Gaps = 60/641 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 843  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 902

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      +++  + + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 903  KPKPSKKKAEGGSTEEGAGVRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 962

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 963  PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1022

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST  P T   +  GE+KE  T  RD R+I P    F + L SP SW
Sbjct: 1023 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSW 1082

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D+
Sbjct: 1083 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1142

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
            IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1143 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1202

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct: 1203 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1262

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 1263 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1282

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
            PEA+ES GG RL+++ DFH+G HVNTF++  C+ +  ++ P  +S     + +TW+A+LD
Sbjct: 1283 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--TEGPSKKSVVWENKHITWFATLD 1340

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG 
Sbjct: 1341 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGE 1400

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1401 LLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 1441


>gi|348555854|ref|XP_003463738.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform 1 [Cavia porcellus]
          Length = 1440

 Score =  632 bits (1630), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 316/658 (48%), Positives = 420/658 (63%), Gaps = 65/658 (9%)

Query: 2    GNFRSHSPSAMDET-IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALK 56
            G  R    +   E  +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK
Sbjct: 827  GEVRKEEATRQGELPLVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLK 886

Query: 57   LRFKKL-----------KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCG 105
            +RFKK+           K           +E  G+ RG R+++ RYF +I GY GVF+CG
Sbjct: 887  VRFKKVPHNINFREKKPKPSKKKAEGGSTDEGSGV-RG-RVARFRYFEDIYGYSGVFICG 944

Query: 106  PHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY 165
            P P WL +T RG LR HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSY
Sbjct: 945  PSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSY 1004

Query: 166  DAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFI 225
            DAPWPVRK+PL+CT H++AYH+E+K Y + TST+ P T   +  GE+KE     RD R+I
Sbjct: 1005 DAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTSTPCTRIPRMTGEEKEFEAIERDDRYI 1064

Query: 226  PPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNY 285
             P    F + L SP SWE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT  
Sbjct: 1065 HPQQEAFSIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCL 1124

Query: 286  NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVG 345
               E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+G
Sbjct: 1125 MQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIG 1184

Query: 346  QKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVA 405
            QKI++W L+ ++LTG+AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+
Sbjct: 1185 QKIFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVS 1244

Query: 406  RDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
            RD KP +  S  +   N                                        + +
Sbjct: 1245 RDAKPLEVYSVDFMVDN----------------------------------------AQL 1264

Query: 466  GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA 525
            GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++  C+    ++ P  
Sbjct: 1265 GFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCR--GATEGPSK 1322

Query: 526  RS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK 580
            +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR   
Sbjct: 1323 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1382

Query: 581  GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
                   N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1383 VDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 1440


>gi|344236599|gb|EGV92702.1| Cleavage and polyadenylation specificity factor subunit 1 [Cricetulus
            griseus]
          Length = 1419

 Score =  631 bits (1627), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 310/641 (48%), Positives = 416/641 (64%), Gaps = 60/641 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 821  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 880

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      +++  + + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 881  KPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 940

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 941  PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1000

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST  P T   +  GE+KE     RD R+I P    F + L SP SW
Sbjct: 1001 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSW 1060

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D+
Sbjct: 1061 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1120

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
            IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1121 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1180

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct: 1181 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1240

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 1241 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1260

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
            PEA+ES GG RL+++ DFH+G HVNTF++  C+ +  ++ P  +S     + +TW+A+LD
Sbjct: 1261 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVMWENKHITWFATLD 1318

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG 
Sbjct: 1319 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1378

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1379 LLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1419


>gi|354491122|ref|XP_003507705.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform 1 [Cricetulus griseus]
          Length = 1441

 Score =  631 bits (1627), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 310/641 (48%), Positives = 416/641 (64%), Gaps = 60/641 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 843  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 902

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      +++  + + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 903  KPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 962

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 963  PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1022

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST  P T   +  GE+KE     RD R+I P    F + L SP SW
Sbjct: 1023 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSW 1082

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D+
Sbjct: 1083 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1142

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
            IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1143 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1202

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct: 1203 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1262

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 1263 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1282

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
            PEA+ES GG RL+++ DFH+G HVNTF++  C+ +  ++ P  +S     + +TW+A+LD
Sbjct: 1283 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVMWENKHITWFATLD 1340

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG 
Sbjct: 1341 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1400

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1401 LLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1441


>gi|403302917|ref|XP_003942095.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Saimiri boliviensis boliviensis]
          Length = 1390

 Score =  630 bits (1626), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 309/639 (48%), Positives = 416/639 (65%), Gaps = 56/639 (8%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 792  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLSQGNLKVRFKKVPHNINFREK 851

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      +++  + + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 852  KPKPSKKKAEGGSAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 911

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGPV + APFHN+NCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 912  PMGIDGPVDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 971

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST  P T   +  GE+KE  T  RD R+I P    F + L SP SW
Sbjct: 972  VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSW 1031

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D+
Sbjct: 1032 EAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1091

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
            IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1092 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1151

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct: 1152 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1211

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 1212 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1231

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGA 539
            PEA+ES GG RL+++ DFH+G HVNTF++  C+ ++   +  +    ++ +TW+A+LDG 
Sbjct: 1232 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSKKSVVWENKHITWFATLDGG 1291

Query: 540  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
            +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG L+
Sbjct: 1292 IGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELL 1351

Query: 600  WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1352 NRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1390


>gi|197245729|gb|AAI68713.1| Cpsf1 protein [Rattus norvegicus]
          Length = 1439

 Score =  630 bits (1626), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 310/641 (48%), Positives = 416/641 (64%), Gaps = 60/641 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 841  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 900

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      +++  + + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 901  KPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 960

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 961  PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1020

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST  P T   +  GE+KE     RD R+I P    F + L SP SW
Sbjct: 1021 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSW 1080

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D+
Sbjct: 1081 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1140

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
            IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1141 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1200

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct: 1201 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1260

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 1261 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1280

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
            PEA+ES GG RL+++ DFH+G HVNTF++  C+ +  ++ P  +S     + +TW+A+LD
Sbjct: 1281 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVMWENKHITWFATLD 1338

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG 
Sbjct: 1339 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1398

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1399 LLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1439


>gi|16751835|ref|NP_444423.1| cleavage and polyadenylation specificity factor subunit 1 isoform 2
            [Mus musculus]
 gi|17374611|sp|Q9EPU4.1|CPSF1_MOUSE RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=CPSF 160 kDa subunit
 gi|11762096|gb|AAG40326.1|AF322193_1 cleavage and polyadenylation specificity factor 1 [Mus musculus]
 gi|38614159|gb|AAH56388.1| Cleavage and polyadenylation specific factor 1 [Mus musculus]
          Length = 1441

 Score =  630 bits (1625), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 313/656 (47%), Positives = 420/656 (64%), Gaps = 61/656 (9%)

Query: 2    GNFRSHSPSAMDET-IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALK 56
            G  R    +   E  +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK
Sbjct: 828  GEVRKEEATRQGELPLVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLK 887

Query: 57   LRFKKL---------KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPH 107
            +RFKK+         K      +++  + + G     R+++ RYF +I GY GVF+CGP 
Sbjct: 888  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 947

Query: 108  PAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDA 167
            P WL +T RG LR HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDA
Sbjct: 948  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007

Query: 168  PWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPP 227
            PWPVRK+PL+CT H++AYH+E+K Y + TST  P T   +  GE+KE     RD R+I P
Sbjct: 1008 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHP 1067

Query: 228  LVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY 287
                F + L SP SWE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT    
Sbjct: 1068 QQEAFSIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQ 1127

Query: 288  SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
             E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQK
Sbjct: 1128 GEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQK 1187

Query: 348  IYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
            I++W L+ ++LTG+AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD
Sbjct: 1188 IFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRD 1247

Query: 408  YKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGF 467
             KP +  S  +   N                                        + +GF
Sbjct: 1248 AKPLEVYSVDFMVDN----------------------------------------AQLGF 1267

Query: 468  MISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS 527
            ++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++  C+ +  ++ P  +S
Sbjct: 1268 LVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKS 1325

Query: 528  -----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 582
                 + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR     
Sbjct: 1326 VVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVD 1385

Query: 583  GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
                 N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1386 RRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1441


>gi|410987992|ref|XP_004000273.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Felis catus]
          Length = 1432

 Score =  630 bits (1624), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 314/643 (48%), Positives = 416/643 (64%), Gaps = 64/643 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 834  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 893

Query: 63   --KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
              K          A E  G  RG R+++ RYF +I GY GVF+CGP P WL +T RG LR
Sbjct: 894  KPKPSKKKVEGGSAEEGAG-ARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALR 951

Query: 121  AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTP 180
             HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT 
Sbjct: 952  LHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTA 1011

Query: 181  HFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPF 240
            H++AYH+E+K Y + TST  P T   +  GE+KE  T  RD R+I P    F + L SP 
Sbjct: 1012 HYVAYHVESKVYAVATSTNMPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPV 1071

Query: 241  SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
            SWE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ 
Sbjct: 1072 SWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIM 1131

Query: 301  DIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTG 360
            D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG
Sbjct: 1132 DVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTG 1191

Query: 361  IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            +AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +  
Sbjct: 1192 MAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMV 1251

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
             N                                        + +GF++SD+D+N++++M
Sbjct: 1252 DN----------------------------------------AQLGFLVSDRDRNLMVYM 1271

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYAS 535
            Y PEA+ES GG RL+++ DFH+G HVNTF++  C+ +  ++ P  +S     + +TW+A+
Sbjct: 1272 YLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVVWENKHITWFAT 1329

Query: 536  LDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIID 595
            LDG +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++D
Sbjct: 1330 LDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLD 1389

Query: 596  GSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            G L+ ++L LS  ER E+ KKIG+  + IL++L + + +++HF
Sbjct: 1390 GELLNRYLYLSTMERGELAKKIGTTPDIILEDLLETDRVTAHF 1432


>gi|444523674|gb|ELV13604.1| Cleavage and polyadenylation specificity factor subunit 1 [Tupaia
            chinensis]
          Length = 1469

 Score =  629 bits (1623), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 311/642 (48%), Positives = 413/642 (64%), Gaps = 62/642 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELL+Y+AF H     +G LK+RFKK+         
Sbjct: 871  LVKEVLLVALGSRQSRPYLLVHVDQELLLYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 930

Query: 63   -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
                        + E+    RG R+++ RYF +I GY GVF+CGP P WL +T RG LR 
Sbjct: 931  KLKPSKKKAEGGSTEEGAGARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 989

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 990  HPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 1049

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
            ++AYH+E+K Y + TST  P T   +  GE+KE  T  RD R+I P    F + L SP S
Sbjct: 1050 YVAYHVESKVYAVATSTNAPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS 1109

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
            WE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D
Sbjct: 1110 WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1169

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
            +IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1170 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1229

Query: 362  AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   
Sbjct: 1230 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1289

Query: 422  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
            N                                        + +GF++SD+D+N++++MY
Sbjct: 1290 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1309

Query: 482  QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASL 536
             PEA+ES GG  L+++ DFHLG HVNTF++  C+ +   + P  +S     + +TW+A+L
Sbjct: 1310 LPEAKESFGGLLLLRRADFHLGAHVNTFWRTPCRGA--VEGPSKKSVVWENKHITWFATL 1367

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            DG +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG
Sbjct: 1368 DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDG 1427

Query: 597  SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1428 ELLSRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1469


>gi|338728513|ref|XP_003365689.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Equus caballus]
          Length = 1450

 Score =  629 bits (1622), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 312/642 (48%), Positives = 413/642 (64%), Gaps = 62/642 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 852  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 911

Query: 63   -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
                          E+    RG R+++ RYF +I GY GVF+CGP P WL +T RG LR 
Sbjct: 912  KPKPSKKKAEGGGAEEGVGARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 970

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 971  HPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 1030

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
            ++AYH+E+K Y + TST  P T   +  GE+KE  T  RD R+I P    F + L SP S
Sbjct: 1031 YVAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS 1090

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
            WE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D
Sbjct: 1091 WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1150

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
            +IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1151 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1210

Query: 362  AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   
Sbjct: 1211 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1270

Query: 422  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
            N                                        + +GF++SD+D+N++++MY
Sbjct: 1271 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1290

Query: 482  QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASL 536
             PEA+ES GG RL+++ DFH+G HVNTF++  C+    ++ P  +S     + +TW+A+L
Sbjct: 1291 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCR--GAAEGPSKKSVVWENKHITWFATL 1348

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            DG +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG
Sbjct: 1349 DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDG 1408

Query: 597  SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1409 ELLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 1450


>gi|338728511|ref|XP_001505047.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like isoform 1 [Equus caballus]
          Length = 1444

 Score =  629 bits (1622), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 312/642 (48%), Positives = 414/642 (64%), Gaps = 62/642 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 846  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 905

Query: 63   -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
                          E+    RG R+++ RYF +I GY GVF+CGP P WL +T RG LR 
Sbjct: 906  KPKPSKKKAEGGGAEEGVGARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 964

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 965  HPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 1024

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
            ++AYH+E+K Y + TST  P T   +  GE+KE  T  RD R+I P    F + L SP S
Sbjct: 1025 YVAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS 1084

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
            WE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D
Sbjct: 1085 WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1144

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
            +IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1145 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1204

Query: 362  AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   
Sbjct: 1205 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1264

Query: 422  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
            N                                        + +GF++SD+D+N++++MY
Sbjct: 1265 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1284

Query: 482  QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASL 536
             PEA+ES GG RL+++ DFH+G HVNTF++  C+ +  ++ P  +S     + +TW+A+L
Sbjct: 1285 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVVWENKHITWFATL 1342

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            DG +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG
Sbjct: 1343 DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDG 1402

Query: 597  SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1403 ELLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 1444


>gi|417406474|gb|JAA49895.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
            cft1 cpsf subunit [Desmodus rotundus]
          Length = 1444

 Score =  627 bits (1618), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 311/641 (48%), Positives = 415/641 (64%), Gaps = 60/641 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 846  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFAHDSQLGQGNLKVRFKKVPHNINFREK 905

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      ++     + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 906  KPKPSKKKADGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 965

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 966  PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1025

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST  P T   +  GE+KE  T  RD R+I P    F + L SP SW
Sbjct: 1026 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSW 1085

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D+
Sbjct: 1086 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1145

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
            IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1146 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1205

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ + +TLSLV+RD KP +  S  +   N
Sbjct: 1206 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEDSKTLSLVSRDAKPLEVYSVDFMVDN 1265

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 1266 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1285

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
            PEA+ES GG RL+++ DFH+G HVNTF++  C+ +  ++ P  +S     + +TW+A+LD
Sbjct: 1286 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVVWENKHITWFATLD 1343

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R I+DG 
Sbjct: 1344 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNILDGE 1403

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1404 LLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1444


>gi|392306997|ref|NP_001254722.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
            mulatta]
 gi|380812168|gb|AFE77959.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
            mulatta]
 gi|383417835|gb|AFH32131.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
            mulatta]
          Length = 1442

 Score =  627 bits (1617), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 310/640 (48%), Positives = 412/640 (64%), Gaps = 58/640 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 844  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 903

Query: 63   -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
                          E+    RG R+++ RYF +I GY GVF+CGP P WL +T RG LR 
Sbjct: 904  KPKPSKKKAEGGGTEEGAGARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 962

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            HPM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 963  HPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 1022

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
            ++AYH+E+K Y + TST  P     +  GE+KE  T  RD R+I P    F + L SP S
Sbjct: 1023 YVAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS 1082

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
            WE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D
Sbjct: 1083 WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1142

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
            +IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1143 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1202

Query: 362  AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   
Sbjct: 1203 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1262

Query: 422  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
            N                                        + +GF++SD+D+N++++MY
Sbjct: 1263 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1282

Query: 482  QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDG 538
             PEA+ES GG RL+++ DFH+G HVNTF++  C+ ++   +  +    ++ +TW+A+LDG
Sbjct: 1283 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSKKSVVWENKHITWFATLDG 1342

Query: 539  ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
             +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG L
Sbjct: 1343 GIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGEL 1402

Query: 599  VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            + ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1403 LNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1442


>gi|402879380|ref|XP_003903320.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
            specificity factor subunit 1 [Papio anubis]
          Length = 1389

 Score =  626 bits (1615), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 309/639 (48%), Positives = 414/639 (64%), Gaps = 56/639 (8%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 791  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 850

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      +++    + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 851  KPKPSKKKAEGGGTEEGAGXRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 910

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 911  PMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 970

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST  P     +  GE+KE  T  RD R+I P    F + L SP SW
Sbjct: 971  VAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSW 1030

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D+
Sbjct: 1031 EAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1090

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
            IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1091 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1150

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct: 1151 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1210

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 1211 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1230

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGA 539
            PEA+ES GG RL+++ DFH+G HVNTF++  C+ ++   +  +    ++ +TW+A+LDG 
Sbjct: 1231 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSKKSVVWENKHITWFATLDGG 1290

Query: 540  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
            +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG L+
Sbjct: 1291 IGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELL 1350

Query: 600  WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1351 NRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1389


>gi|334326317|ref|XP_001364707.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Monodelphis domestica]
          Length = 1449

 Score =  626 bits (1615), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 312/642 (48%), Positives = 414/642 (64%), Gaps = 62/642 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG    RP LLV    ELLIY+AF H     +  LK+RFKK+         
Sbjct: 851  LVKEVLLVALGNRQTRPYLLVHVDQELLIYEAFAHDSQLGQSNLKVRFKKVPHNINFREK 910

Query: 63   -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
                          E+    RG R+++ RYF +I GY GVF+CGP P WL +T RG LR 
Sbjct: 911  KPKPSKKKPEGGGTEEGAGARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 969

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 970  HPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 1029

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
            ++AYH+E+K Y + TST    T   +  GE+KE  T  RD R+I PL   F + L SP S
Sbjct: 1030 YVAYHVESKVYAVATSTNALCTRIPRMTGEEKEFETIERDERYIHPLQEAFSIQLISPVS 1089

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
            WE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D
Sbjct: 1090 WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1149

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
            +IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1150 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1209

Query: 362  AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S      
Sbjct: 1210 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSV----- 1264

Query: 422  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
                                               D + + + +GF++SD+D+N++++MY
Sbjct: 1265 -----------------------------------DFMVDSAQLGFLVSDRDRNLMVYMY 1289

Query: 482  QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASL 536
             PEA+ES GG RL+++ DFH+G HVNTF++  C+ +  ++ P  +S     + +TW+A+L
Sbjct: 1290 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSIVWENKHITWFATL 1347

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            DG +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG
Sbjct: 1348 DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDG 1407

Query: 597  SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             L+ ++L LS  ER E+ KKIG+  + ILD+L +I+ +++HF
Sbjct: 1408 ELLNRYLYLSTMERGELAKKIGTTPDIILDDLLEIDRVTAHF 1449


>gi|405977622|gb|EKC42064.1| Cleavage and polyadenylation specificity factor subunit 1
            [Crassostrea gigas]
          Length = 1369

 Score =  626 bits (1614), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 303/634 (47%), Positives = 422/634 (66%), Gaps = 52/634 (8%)

Query: 17   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGA----LKLRFKKLK----VLFVS 68
            ++ELL V LG   +RP LL R + +L IY+AF +P+ +    LKLRFKK++    +    
Sbjct: 776  LKELLMVGLGYKDSRPHLLARVEDDLYIYEAFSYPQSSIDNHLKLRFKKIQHDLILREKR 835

Query: 69   DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG 128
             +SK+ + +       ++ +MRYF ++AGY GVF+CG +P W+F+TSRG LR HPM IDG
Sbjct: 836  SKSKKKDPEEFQKEEKKVGKMRYFKDVAGYSGVFVCGAYPHWIFVTSRGSLRIHPMGIDG 895

Query: 129  PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLE 188
            PV   + FHN+NCP GFLYFN   ELRISVLPTHL+YDAPWPVRKVPL+CTPHF+AYH E
Sbjct: 896  PVWCFSEFHNINCPHGFLYFNKMGELRISVLPTHLTYDAPWPVRKVPLRCTPHFVAYHFE 955

Query: 189  TKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQT 248
             K Y +VTST E      K   ED+E  T  +D RFI P + +F + L+SP SWE +P T
Sbjct: 956  NKIYAVVTSTPEICNKLPKTTTEDREWDTIEKDERFIYPTIPRFTLQLYSPTSWEVVPNT 1015

Query: 249  NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPE 308
                 EWEHV+ +K + +  E TLSG + YI +GTN +  E+VT RGR+++ DIIEVVPE
Sbjct: 1016 KIECEEWEHVVSMKTIRLRSEETLSGFKSYIVMGTNLSLGEEVTSRGRVIIADIIEVVPE 1075

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEV 368
            PG PLTK+KIK +Y KEQKGPVTA+  + G L+TA+GQK+YIWQLKDNDL G+AFIDT +
Sbjct: 1076 PGMPLTKHKIKTLYEKEQKGPVTALADINGLLITAIGQKLYIWQLKDNDLMGVAFIDTHI 1135

Query: 369  YIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGII 428
            YI ++V++K++IL GD  +S+++ +YQ E++ LS+V+RD +P +  +  +   N      
Sbjct: 1136 YIHTLVTIKHIILAGDILKSVSVYQYQEEHKVLSIVSRDPRPLEVYTADFLIDN------ 1189

Query: 429  DGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARES 488
                                              + +  ++SD+ KN+V++ YQPEARES
Sbjct: 1190 ----------------------------------TQLCCLVSDRMKNLVVYSYQPEARES 1215

Query: 489  NGGHRLIKKTDFHLGQHVNTFFKIRCK---PSSISDAPGA-RSRFLTWYASLDGALGFFL 544
            +GG RLI+K DF+ G +V++ F++RCK   PSS     GA   R +T++A+LDG+LGF L
Sbjct: 1216 HGGQRLIRKADFNAGSNVSSMFRVRCKLYDPSSDKRMTGAPEKRHITYFATLDGSLGFVL 1275

Query: 545  PLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQ 604
            PL EK YRRL MLQN +VTH  H  GLNPR++R   G      NP + I+DG L+WK+  
Sbjct: 1276 PLSEKVYRRLFMLQNALVTHIPHVAGLNPRSYRHVIGTFPELRNPQKNILDGELLWKYTN 1335

Query: 605  LSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            LS+ E++EI K++G+ ++ I+D+L +I+ L++HF
Sbjct: 1336 LSIMEKIEIAKRLGTSNDQIMDDLMEIDRLTAHF 1369


>gi|157110889|ref|XP_001651294.1| cleavage and polyadenylation specificity factor cpsf [Aedes aegypti]
 gi|108883895|gb|EAT48120.1| AAEL000832-PA [Aedes aegypti]
          Length = 1417

 Score =  625 bits (1613), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 300/637 (47%), Positives = 420/637 (65%), Gaps = 56/637 (8%)

Query: 18   QELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLK------VLFVSDRS 71
            +E+L V+LG HG RP+L VR +++LL+Y+ +R+ KG LKLRF+++       +  ++ R 
Sbjct: 821  KEILMVALGHHGTRPMLFVRLENDLLVYRVYRYSKGHLKLRFRRVPSGVTGPIFKIAPRQ 880

Query: 72   KRANEQPGLPRGV--------RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHP 123
                +Q G              IS +RYF+N+ GY GV +CG  P  + LTSRGELRAH 
Sbjct: 881  SAPTDQEGEKPDEHSTKIMYENISMIRYFNNVNGYNGVAVCGEKPYIMLLTSRGELRAHR 940

Query: 124  MTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFL 183
            +     +   APF+NVNCP GFLYF+ + EL+I+V P +LSYD+ WPVRK+PL+ +P  +
Sbjct: 941  LYAKTIMKGFAPFNNVNCPNGFLYFDEQYELKIAVFPGYLSYDSIWPVRKIPLRSSPKQI 1000

Query: 184  AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
             YH E K YC+V    E    YY+FNGEDKEL  + +  RF+ P+  +F V L +P +WE
Sbjct: 1001 VYHKENKVYCVVMDAEEVCNKYYRFNGEDKELTEENKGERFLYPMAHKFSVVLVTPSAWE 1060

Query: 244  EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
             IP+T+  L EWEHV+ LKNVS+ YEG  SG + YIA+GTN+NYSED+T RGR+LL+DII
Sbjct: 1061 IIPETSINLDEWEHVIALKNVSLSYEGARSGFKEYIAVGTNFNYSEDITSRGRLLLYDII 1120

Query: 304  EVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAF 363
            EVVPEPG+PLT+ K K +  KEQKGPV+AI HV+GFLV AVGQK+Y+WQLKD+DL G+AF
Sbjct: 1121 EVVPEPGKPLTRYKFKEVIVKEQKGPVSAITHVSGFLVGAVGQKVYLWQLKDDDLVGVAF 1180

Query: 364  IDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNP 423
            IDT +++  +VS+K+LILV D  +S++LLR+Q +YRTLSLV+RDY+P       Y   N 
Sbjct: 1181 IDTNIFVHQLVSIKSLILVADVYKSVSLLRFQEDYRTLSLVSRDYQPLNVFQIEYVVDN- 1239

Query: 424  SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQP 483
                                           HN        +GF++SD+  N++ +MYQP
Sbjct: 1240 -------------------------------HN--------LGFLVSDEQCNIITYMYQP 1260

Query: 484  EARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA--RSRFLTWYASLDGALG 541
            E+RES GG RL++K D+H+GQ +N+ F+++C    +     +    +  T++A+LDG +G
Sbjct: 1261 ESRESFGGQRLLRKCDYHVGQKINSMFRVQCDFHEMDYKRNSNYECKHTTYFATLDGGIG 1320

Query: 542  FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 601
            + LPLPEK YRRL MLQNV++TH+ H  GLNP+AFRT K       NP+R ++DG L+W 
Sbjct: 1321 YVLPLPEKTYRRLFMLQNVLMTHSPHLCGLNPKAFRTIKTVKKLPINPARCVVDGDLIWT 1380

Query: 602  FLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            FL L   E+LE+ KKIG++ +DI  +L +IE+++  F
Sbjct: 1381 FLTLPANEKLEVAKKIGTRIDDICADLMEIESVTHVF 1417


>gi|345779232|ref|XP_532356.3| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Canis lupus familiaris]
          Length = 1460

 Score =  625 bits (1612), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 311/641 (48%), Positives = 415/641 (64%), Gaps = 60/641 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 862  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 921

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      +++    + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 922  KPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 981

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 982  PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1041

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST  P T   +  GE+KE  T  RD R+I P    F + L SP SW
Sbjct: 1042 VAYHVESKVYAVATSTNMPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSW 1101

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D+
Sbjct: 1102 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1161

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
            IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1162 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1221

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct: 1222 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1281

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 1282 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1301

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
            PEA+ES GG RL+++ DFH+G HVNTF++  C+    ++ P  +S     + +TW+A+LD
Sbjct: 1302 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCR--GAAEGPSKKSVVWENKHITWFATLD 1359

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG 
Sbjct: 1360 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1419

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1420 LLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 1460


>gi|119602512|gb|EAW82106.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
            [Homo sapiens]
 gi|119602513|gb|EAW82107.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
            [Homo sapiens]
 gi|119602514|gb|EAW82108.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
            [Homo sapiens]
          Length = 1365

 Score =  624 bits (1609), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 309/639 (48%), Positives = 414/639 (64%), Gaps = 56/639 (8%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 767  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 826

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      +++    + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 827  KPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 886

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 887  PMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 946

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST  P     +  GE+KE  T  RD R+I P    F + L SP SW
Sbjct: 947  VAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSW 1006

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D+
Sbjct: 1007 EAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1066

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
            IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1067 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1126

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct: 1127 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1186

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 1187 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1206

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGA 539
            PEA+ES GG RL+++ DFH+G HVNTF++  C+ ++   +  +    ++ +TW+A+LDG 
Sbjct: 1207 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSKKSVVWENKHITWFATLDGG 1266

Query: 540  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
            +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG L+
Sbjct: 1267 IGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELL 1326

Query: 600  WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1327 NRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1365


>gi|397497327|ref|XP_003819464.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Pan paniscus]
 gi|410336497|gb|JAA37195.1| cleavage and polyadenylation specific factor 1, 160kDa [Pan
            troglodytes]
          Length = 1442

 Score =  624 bits (1609), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 309/639 (48%), Positives = 414/639 (64%), Gaps = 56/639 (8%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 844  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 903

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      +++    + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 904  KPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 963

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 964  PMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1023

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST  P     +  GE+KE  T  RD R+I P    F + L SP SW
Sbjct: 1024 VAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSW 1083

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D+
Sbjct: 1084 EAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1143

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
            IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1144 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1203

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct: 1204 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1263

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 1264 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1283

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGA 539
            PEA+ES GG RL+++ DFH+G HVNTF++  C+ ++   +  +    ++ +TW+A+LDG 
Sbjct: 1284 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSKKSVVWENKHITWFATLDGG 1343

Query: 540  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
            +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG L+
Sbjct: 1344 IGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELL 1403

Query: 600  WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1404 NRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1442


>gi|1045574|gb|AAC50293.1| cleavage and polyadenylation specificity factor [Homo sapiens]
          Length = 1442

 Score =  624 bits (1608), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 309/638 (48%), Positives = 411/638 (64%), Gaps = 55/638 (8%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 845  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 904

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      +++    + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 905  KPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 964

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 965  PMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1024

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST  P     +  GE+KE  T  RD R+I P    F + L SP SW
Sbjct: 1025 VAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSW 1084

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D+
Sbjct: 1085 EAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1144

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
            IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1145 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1204

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct: 1205 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1264

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 1265 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1284

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA--RSRFLTWYASLDGAL 540
            PEA+ES GG RL+++ DFH+G HVNTF++  C+ +           ++ +TW+A+LDG +
Sbjct: 1285 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRATEGLSKKSVVWENKHITWFATLDGGI 1344

Query: 541  GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVW 600
            G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG L+ 
Sbjct: 1345 GLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLN 1404

Query: 601  KFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1405 RYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1442


>gi|56676371|ref|NP_037423.2| cleavage and polyadenylation specificity factor subunit 1 [Homo
            sapiens]
 gi|23503048|sp|Q10570.2|CPSF1_HUMAN RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=CPSF 160 kDa subunit
 gi|16878041|gb|AAH17232.1| Cleavage and polyadenylation specific factor 1, 160kDa [Homo sapiens]
 gi|119602516|gb|EAW82110.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_c
            [Homo sapiens]
 gi|123993607|gb|ABM84405.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
            construct]
 gi|123999626|gb|ABM87355.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
            construct]
 gi|307684758|dbj|BAJ20419.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
            construct]
          Length = 1443

 Score =  623 bits (1607), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 309/639 (48%), Positives = 414/639 (64%), Gaps = 56/639 (8%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 845  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 904

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      +++    + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 905  KPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 964

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 965  PMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1024

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST  P     +  GE+KE  T  RD R+I P    F + L SP SW
Sbjct: 1025 VAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSW 1084

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D+
Sbjct: 1085 EAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1144

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
            IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1145 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1204

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct: 1205 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1264

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 1265 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1284

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGA 539
            PEA+ES GG RL+++ DFH+G HVNTF++  C+ ++   +  +    ++ +TW+A+LDG 
Sbjct: 1285 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSKKSVVWENKHITWFATLDGG 1344

Query: 540  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
            +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG L+
Sbjct: 1345 IGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELL 1404

Query: 600  WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1405 NRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1443


>gi|410911304|ref|XP_003969130.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Takifugu rubripes]
          Length = 1444

 Score =  623 bits (1607), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 312/641 (48%), Positives = 425/641 (66%), Gaps = 59/641 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGALKLRFKKL--------- 62
            +V+E+  VSLG + +RP LLV    ELLIY+AF    + P+  LK+RFKK+         
Sbjct: 845  LVKEVTLVSLGYNHSRPYLLVHVDQELLIYEAFPYDQQQPQNNLKVRFKKVPHNINFREK 904

Query: 63   --KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
              K+         A E     RG RIS+ RYF +I+GY GVF+CGP P W+ +TSRG LR
Sbjct: 905  KSKLRKDKKAEGTAAEDSVAARG-RISRFRYFEDISGYSGVFICGPSPHWMLVTSRGALR 963

Query: 121  AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTP 180
             HPM+IDGP+ + +PFHN+NCP+GFLYFN + ELRISVLPT+LSYDAPWPVRK+PL+CT 
Sbjct: 964  LHPMSIDGPIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTV 1023

Query: 181  HFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPF 240
            H+++YH+E+K Y + TS  E  T   +  GE+KE  T  RD R+I P   +F + L SP 
Sbjct: 1024 HYVSYHVESKVYAVCTSLKELCTRIPRMTGEEKEYETIERDERYINPQQDKFSIQLISPV 1083

Query: 241  SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
            SWE IP T   L EWE+V C+K V++  + T+SGL+GYIA GT     E+VTCRGRIL+ 
Sbjct: 1084 SWEAIPNTRIDLEEWEYVTCMKTVALRSQETVSGLKGYIAAGTCLMQGEEVTCRGRILIL 1143

Query: 301  DIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTG 360
            D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G+LV+A+GQKI++W LKDNDLTG
Sbjct: 1144 DVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFLWVLKDNDLTG 1203

Query: 361  IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            +AFIDT+++I  M+S+KN IL  D  +S++LLRYQ E +TLSLV+RD KP +  S  +  
Sbjct: 1204 MAFIDTQLHIHQMMSIKNFILAADLMKSVSLLRYQEESKTLSLVSRDAKPLEVYSIEFMV 1263

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
             N                                        + +GF++SD+DKN+ ++M
Sbjct: 1264 DN----------------------------------------NQLGFLVSDRDKNLYVYM 1283

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS---RFLTWYASLD 537
            Y PEA+ES GG RL+++ DF+ G ++NTF+++ C+ +  + +  A +   + +TW+A+LD
Sbjct: 1284 YLPEAKESFGGMRLLRRADFNAGANINTFWRMPCRGALEAGSRKAMTWDNKHITWFATLD 1343

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G +G  LP+ EK YRRLLMLQN + T  SH  GLNP+AFR          NP + I+DG 
Sbjct: 1344 GGVGLLLPMQEKTYRRLLMLQNALTTMLSHHAGLNPKAFRMLHCDRRSLQNPVKNILDGE 1403

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            L+ K+L LS+ ER E+ KKIG+  + ILD+L DI+ +++HF
Sbjct: 1404 LLNKYLYLSMMERSELAKKIGTTQDIILDDLLDIDRVTAHF 1444


>gi|426361048|ref|XP_004047737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Gorilla gorilla gorilla]
          Length = 1440

 Score =  623 bits (1606), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 309/639 (48%), Positives = 414/639 (64%), Gaps = 56/639 (8%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 842  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 901

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      +++    + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 902  KPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 961

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 962  PMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1021

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST  P     +  GE+KE  T  RD R+I P    F + L SP SW
Sbjct: 1022 VAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSW 1081

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D+
Sbjct: 1082 EAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1141

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
            IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1142 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1201

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct: 1202 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1261

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 1262 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1281

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGA 539
            PEA+ES GG RL+++ DFH+G HVNTF++  C+ ++   +  +    ++ +TW+A+LDG 
Sbjct: 1282 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSKKSVVWENKHITWFATLDGG 1341

Query: 540  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
            +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG L+
Sbjct: 1342 IGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELL 1401

Query: 600  WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1402 NRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1440


>gi|195056749|ref|XP_001995154.1| GH22991 [Drosophila grimshawi]
 gi|193899360|gb|EDV98226.1| GH22991 [Drosophila grimshawi]
          Length = 1426

 Score =  623 bits (1606), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 304/631 (48%), Positives = 412/631 (65%), Gaps = 51/631 (8%)

Query: 19   ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQP 78
            EL  V LG HG RPLLLVRT+ ELLIYQ FR+ KG LK+RF+KL+ L + ++     E  
Sbjct: 836  ELCLVGLGQHGERPLLLVRTRLELLIYQVFRYAKGHLKIRFRKLEQLHLLEQQPTHIELD 895

Query: 79   GLP---------RGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP 129
            G           +   + ++RYF+N+ G  G+ +CG +P ++FLTSRGELR H +  +G 
Sbjct: 896  GEDVEEAESYNMQAKYVQKLRYFANVGGLAGIMVCGVNPCFVFLTSRGELRIHRLLGNGD 955

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            V + A F+NVN P GFLYF+   EL+ISVLP++LSYDA WPVRKVPL+CTP  L YH E 
Sbjct: 956  VRSFAAFNNVNIPHGFLYFDTTYELKISVLPSYLSYDAAWPVRKVPLRCTPRQLVYHREN 1015

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            + YC++T   EP T YY+FNGEDKEL  + R  RFI P+ S F + L SP +WE +P  +
Sbjct: 1016 RVYCLITQKEEPMTKYYRFNGEDKELSEECRGERFIYPIGSLFEMVLISPETWEIVPDAS 1075

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEP 309
                 WEHV   K V + YEGT SGL+ Y+ +GTN+NYSED+T RG I ++DIIEVVPEP
Sbjct: 1076 IQFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPEP 1135

Query: 310  GQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVY 369
            G+P+TK K+K ++ KEQKGPV+AI  V GFLVT +GQKIYIWQL+D DL G+AFIDT +Y
Sbjct: 1136 GKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGLGQKIYIWQLRDGDLIGVAFIDTNIY 1195

Query: 370  IASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIID 429
            +  +++VK+LI + D  +SI+LLR+Q E+RTLSL +RD+ P +     +   N       
Sbjct: 1196 VHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLASRDFNPMEVFGIEFMVDN------- 1248

Query: 430  GSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESN 489
                                             S++GF+++D ++N++++MYQPEARES 
Sbjct: 1249 ---------------------------------SNLGFLVTDAERNLIVYMYQPEARESL 1275

Query: 490  GGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARSRFLTWYASLDGALGFFLPLP 547
            GG +L++K D+HLGQ VNT F+++C    +         ++ L  Y SLDGALG+ LPLP
Sbjct: 1276 GGQKLLRKADYHLGQVVNTMFRVQCHQRGLHQRQPFLYENKHLVIYGSLDGALGYCLPLP 1335

Query: 548  EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 607
            EK YRR LMLQNV++++  H  GLNP+ +RT K       NPSR IIDG L+W F  L+ 
Sbjct: 1336 EKVYRRFLMLQNVLLSYQDHLCGLNPKEYRTIKSVKKLGINPSRCIIDGDLIWSFRMLAH 1395

Query: 608  GERLEICKKIGSKHNDILDELYDIEALSSHF 638
             ER E+ KKIG++  +IL +L +IE +S+ F
Sbjct: 1396 SERNEVAKKIGTRTEEILADLLEIERISAVF 1426


>gi|395512730|ref|XP_003760588.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Sarcophilus harrisii]
          Length = 1449

 Score =  622 bits (1605), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 311/641 (48%), Positives = 415/641 (64%), Gaps = 60/641 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG    RP LLV    ELLIY+AF H     +  LK+RFKK+         
Sbjct: 851  LVKEVLLVALGNRQTRPYLLVHVDQELLIYEAFAHDSQLGQSNLKVRFKKVPHNINFREK 910

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      + +    + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 911  KPKPSKKKPEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 970

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 971  PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1030

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST    T   +  GE+KE  T  RD R+I PL   F + L SP SW
Sbjct: 1031 VAYHVESKVYAVATSTNALCTRIPRMTGEEKEFETIERDDRYIHPLQEAFSIQLISPVSW 1090

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D+
Sbjct: 1091 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1150

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
            IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1151 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1210

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S       
Sbjct: 1211 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSV------ 1264

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                              D + + + +GF++SD+D+N++++MY 
Sbjct: 1265 ----------------------------------DFMVDSAQLGFLVSDRDRNLMVYMYL 1290

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
            PEA+ES GG RL+++ DFH+G HVNTF++  C+ +  ++ P  +S     + +TW+A+LD
Sbjct: 1291 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPTKKSIVWENKHITWFATLD 1348

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG 
Sbjct: 1349 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1408

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            L+ ++L LS  ER E+ KKIG+  + ILD+L +I+ +++HF
Sbjct: 1409 LLNRYLYLSTMERGELAKKIGTTPDIILDDLLEIDRVTAHF 1449


>gi|195122290|ref|XP_002005645.1| GI18959 [Drosophila mojavensis]
 gi|193910713|gb|EDW09580.1| GI18959 [Drosophila mojavensis]
          Length = 1431

 Score =  622 bits (1604), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 304/637 (47%), Positives = 413/637 (64%), Gaps = 62/637 (9%)

Query: 19   ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFV----------- 67
            EL  V LG HG+RPLLLVRT+ ELLIYQ FR+ KG LK+RF+KL+ L +           
Sbjct: 840  ELSLVGLGQHGDRPLLLVRTRLELLIYQVFRYAKGHLKIRFRKLEQLHLLDQQPTHIELI 899

Query: 68   ----SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHP 123
                +D ++  N QP       + ++RYF+N+ G  G+ +CG +P ++FLT+RGELR H 
Sbjct: 900  NEEETDEAESYNMQPKY-----VQKLRYFNNVGGLAGIMVCGVNPCFIFLTARGELRIHR 954

Query: 124  MTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFL 183
            +  +  V + A F+NVN P GFLYF+   EL+ISVLPT+LSYDA WPVRKVPL+CTP  L
Sbjct: 955  LLGNAEVRSFAAFNNVNIPHGFLYFDTTYELKISVLPTYLSYDAAWPVRKVPLRCTPRQL 1014

Query: 184  AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
             YH E + YC++T   EP T YY+FNGEDKEL  + R  RFI P+ S F + L SP +WE
Sbjct: 1015 VYHRENRVYCLITQKEEPMTKYYRFNGEDKELSEESRGERFIYPIGSLFEMVLISPETWE 1074

Query: 244  EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
             +P  +     WEHV   K V + YEGT SGL+ Y+ +GTN+NYSED+T RG I ++DII
Sbjct: 1075 IVPDASIQFEPWEHVTAFKLVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDII 1134

Query: 304  EVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAF 363
            EVVPEPG+P+TK K+K ++ KEQKGPV+AI  V GFLVT +GQKIYIWQL+D DL G+AF
Sbjct: 1135 EVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGLGQKIYIWQLRDGDLIGVAF 1194

Query: 364  IDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNP 423
            IDT +Y+  +++VK+LI + D  +SI+LLR+Q EYRTLSL +RD+ P +     +   N 
Sbjct: 1195 IDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVFGIEFMVDN- 1253

Query: 424  SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQP 483
                                                   S++GF+++D ++N++++MYQP
Sbjct: 1254 ---------------------------------------SNLGFLVTDAERNIIVYMYQP 1274

Query: 484  EARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARSRFLTWYASLDGALG 541
            EARES GG +L++K D+HLGQ VNT F+++C    +         ++    Y +LDGALG
Sbjct: 1275 EARESLGGQKLLRKADYHLGQVVNTMFRVQCHQRGLHQRQPFLYENKHFVIYGTLDGALG 1334

Query: 542  FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 601
            + LPLPEK YRR LMLQNV++++  H  GLNP+ +RT K       NPSR IIDG L+W 
Sbjct: 1335 YCLPLPEKVYRRFLMLQNVLLSYQDHLCGLNPKEYRTIKTVKKMGINPSRCIIDGDLIWS 1394

Query: 602  FLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            +  L+  ER E+ KKIG++  +IL +L +IE LS+ F
Sbjct: 1395 YRMLAHSERSEVAKKIGTRTEEILADLLEIERLSAIF 1431


>gi|355680843|gb|AER96659.1| cleavage and polyadenylation specific factor 1, 160kDa [Mustela
            putorius furo]
          Length = 1399

 Score =  622 bits (1603), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 308/640 (48%), Positives = 414/640 (64%), Gaps = 60/640 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 802  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 861

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      +++    + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 862  KPKPSKKKAEGGGAEEGAAARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 921

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 922  PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 981

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST  P T   +  GE+KE     RD R++ P    F + L SP SW
Sbjct: 982  VAYHVESKVYAVATSTNMPCTRIPRMTGEEKEFEAIERDDRYVHPQQEAFSIQLISPVSW 1041

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D+
Sbjct: 1042 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1101

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
            IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1102 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1161

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct: 1162 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1221

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 1222 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1241

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
            PEA+ES GG RL+++ DFH+G HVNTF++  C+ +  ++ P  +S     + +TW+A+LD
Sbjct: 1242 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVVWENKHITWFATLD 1299

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG 
Sbjct: 1300 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRLLHADRRALQNAVRNVLDGE 1359

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSH 637
            L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++H
Sbjct: 1360 LLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAH 1399


>gi|195381337|ref|XP_002049409.1| GJ21566 [Drosophila virilis]
 gi|194144206|gb|EDW60602.1| GJ21566 [Drosophila virilis]
          Length = 1420

 Score =  621 bits (1602), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 304/633 (48%), Positives = 413/633 (65%), Gaps = 55/633 (8%)

Query: 19   ELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFV----------- 67
            EL  V LG HG RPLLLVRT+ ELLIYQ FR+ KG LK+RF+KL+ L +           
Sbjct: 830  ELCLVGLGQHGERPLLLVRTRLELLIYQVFRYAKGHLKIRFRKLEQLHLLDQQPTHIELD 889

Query: 68   SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID 127
             D ++ A      P+ V+  ++RYFSN+ G  G+ +CG +P ++FLT+RGELR H +  +
Sbjct: 890  GDEAEEAESYNMQPKYVQ--KLRYFSNVGGLAGIMVCGMNPVFVFLTARGELRIHRLLGN 947

Query: 128  GPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHL 187
              V + A F+NVN P GFLYF+   EL+ISVLP++LSYDA WPVRKVPL+CTP  L YH 
Sbjct: 948  ADVRSFAAFNNVNIPHGFLYFDTTYELKISVLPSYLSYDAAWPVRKVPLRCTPRQLVYHR 1007

Query: 188  ETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQ 247
            E + YC++T   EP T YY+FNGEDKEL  + R  RFI P+ S F + L SP +WE +P 
Sbjct: 1008 ENRVYCLITQKEEPMTKYYRFNGEDKELSEESRGERFIYPIGSLFEMVLISPETWEIVPD 1067

Query: 248  TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
             +     WEHV   K V + YEGT SGL+ Y+ +GTN+NYSED+T RG I ++DIIEVVP
Sbjct: 1068 ASIQFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVP 1127

Query: 308  EPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTE 367
            EPG+P+TK K+K ++ KEQKGPV+AI  V GFLVT +GQKIYIWQL+D DL G+AFIDT 
Sbjct: 1128 EPGKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGLGQKIYIWQLRDGDLIGVAFIDTN 1187

Query: 368  VYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGI 427
            +Y+  +++VK+LI + D  +SI+LLR+Q EYRTLSL +RD+ P +     +   N     
Sbjct: 1188 IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVFGIEFMVDN----- 1242

Query: 428  IDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARE 487
                                               S++GF+++D ++N++++MYQPEARE
Sbjct: 1243 -----------------------------------SNLGFLVTDAERNLIVYMYQPEARE 1267

Query: 488  SNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARSRFLTWYASLDGALGFFLP 545
            S GG +L++K D+HLGQ VNT F+++C    +         ++ L  Y +LDGALG+ LP
Sbjct: 1268 SLGGQKLLRKADYHLGQVVNTMFRVQCHQRGLHHRQPFLYENKHLVIYGTLDGALGYCLP 1327

Query: 546  LPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
            LPEK YRR LMLQNV++++  H  GLNP+ +RT K       NPSR IIDG L+W +  L
Sbjct: 1328 LPEKVYRRFLMLQNVLLSYQDHLCGLNPKEYRTIKTVKKMGINPSRCIIDGDLIWSYRML 1387

Query: 606  SLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            +  ER E+ KKIG++  +IL ++ +IE LS+ F
Sbjct: 1388 AHSERSEVAKKIGTRTEEILADMLEIERLSAVF 1420


>gi|351713968|gb|EHB16887.1| Cleavage and polyadenylation specificity factor subunit 1
            [Heterocephalus glaber]
          Length = 1440

 Score =  620 bits (1598), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 309/642 (48%), Positives = 413/642 (64%), Gaps = 63/642 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 843  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 902

Query: 63   -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
                        + E+    RG R+++ RYF +I GY GVF+CGP P WL +T RG LR 
Sbjct: 903  KPKPSKKKAEGGSTEEGSGVRG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRG-LRL 960

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 961  HPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 1020

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
            ++AYH+E+K Y + TST+ P T   +  GE+KE     RD R+I P    F + L SP S
Sbjct: 1021 YVAYHVESKVYAVATSTSTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVS 1080

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
            WE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGR+  ++
Sbjct: 1081 WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRVRDWE 1140

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
             IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1141 RIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1200

Query: 362  AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   
Sbjct: 1201 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1260

Query: 422  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
            N                                        + +GF++SD+D+N++++MY
Sbjct: 1261 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1280

Query: 482  QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASL 536
             PEA+ES GG RL+++ DFH+G HVNTF++  C+ +  S+ P  +S     + +TW+A+L
Sbjct: 1281 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--SEGPSKKSVVWENKHITWFATL 1338

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            DG +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG
Sbjct: 1339 DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDG 1398

Query: 597  SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1399 ELLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 1440


>gi|440904368|gb|ELR54893.1| Cleavage and polyadenylation specificity factor subunit 1, partial
            [Bos grunniens mutus]
          Length = 1417

 Score =  619 bits (1595), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 305/623 (48%), Positives = 402/623 (64%), Gaps = 62/623 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG    RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 836  LVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 895

Query: 63   -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
                        + E+   PRG R+++ RYF +I GY GVF+CGP P WL +T RG LR 
Sbjct: 896  KPKPSKKKAEGGSTEEGTGPRG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 954

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            HPM IDGP+ + APFHN+NCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 955  HPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 1014

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
            ++AYH+E+K Y + TST+ P T   +  GE+KE  T  RD R++ P    F + L SP S
Sbjct: 1015 YVAYHVESKVYAVATSTSTPCTRVPRMTGEEKEFETIERDERYVHPQQEAFCIQLISPVS 1074

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
            WE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D
Sbjct: 1075 WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1134

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
            +IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1135 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1194

Query: 362  AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   
Sbjct: 1195 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1254

Query: 422  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
            N                                        + +GF++SD+D+N++++MY
Sbjct: 1255 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1274

Query: 482  QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASL 536
             PEA+ES GG RL+++ DFH+G HVNTF++  C+    ++ P  +S     + +TW+A+L
Sbjct: 1275 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCR--GAAEGPSKKSVVWENKHITWFATL 1332

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            DG +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG
Sbjct: 1333 DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDG 1392

Query: 597  SLVWKFLQLSLGERLEICKKIGS 619
             L+ ++L LS  ER E+ KKIG+
Sbjct: 1393 ELLNRYLYLSPMERGELAKKIGT 1415


>gi|312380158|gb|EFR26239.1| hypothetical protein AND_07834 [Anopheles darlingi]
          Length = 1503

 Score =  618 bits (1594), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 304/645 (47%), Positives = 419/645 (64%), Gaps = 70/645 (10%)

Query: 18   QELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKV-----LFVSDRSK 72
            +E+L V+LG +G+RP+L +R + +LLIY+ FR+ KG LKLRFK+L        F +  ++
Sbjct: 845  KEILMVALGSYGSRPILFIRLEQDLLIYRVFRYAKGHLKLRFKRLTSSVTCPAFRTVPAR 904

Query: 73   RAN--EQPGL--------PRGV------------RISQMRYFSNIAGYQGVFLCGPHPAW 110
             AN  ++P          P G              IS +RYF N++GY GV +CG  P +
Sbjct: 905  LANLPDKPATGATTDATEPNGKDTQEHATKVQYENISMIRYFGNVSGYAGVAVCGEKPYF 964

Query: 111  LFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWP 170
            LFLT+ GELR+H +     +   APF+NVNCP GFLYF+ + +L+IS+LPT+LSYD+ WP
Sbjct: 965  LFLTAHGELRSHRLYARTVMKAFAPFNNVNCPNGFLYFDEQYQLKISILPTYLSYDSVWP 1024

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
            VRK+PL+ +P  + YH E + YC+V    E    YY+FNGEDKEL  + +  RF+ P+  
Sbjct: 1025 VRKIPLRSSPKQIVYHRENRVYCVVMDAEEICNKYYRFNGEDKELTEENKGERFLYPMGH 1084

Query: 231  QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
            QF V L +P +WE +P T   L EWEHV+ LKNVS+ YEG  SGL+ YIA+GTN+NYSED
Sbjct: 1085 QFSVVLVNPAAWEIVPDTAIALEEWEHVVSLKNVSLAYEGARSGLKEYIAVGTNFNYSED 1144

Query: 291  VTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYI 350
            +T RGR+LL+DIIEVVPEPG+PLTK+K K +  K+QKGPV+AI HV GFLV AVGQK+Y+
Sbjct: 1145 ITSRGRLLLYDIIEVVPEPGKPLTKHKFKEVIVKDQKGPVSAISHVCGFLVGAVGQKVYL 1204

Query: 351  WQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
            WQ+KD+DL G+AFIDT +++  MVS+K+LILV D  +S++LLR+Q E+RTLSLV+RDY P
Sbjct: 1205 WQMKDDDLVGVAFIDTNIFVHQMVSIKSLILVADVYKSVSLLRFQDEFRTLSLVSRDYHP 1264

Query: 411  TQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMIS 470
                   Y   N                                        +++GF+++
Sbjct: 1265 LNVYQVEYVVDN----------------------------------------TNLGFLVA 1284

Query: 471  DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC---KPSSISDAPGARS 527
            D   N++ +MYQPE+RES GG RL++K D+HLGQ VN  F+++C   +   +       +
Sbjct: 1285 DDQANLITYMYQPESRESFGGQRLLRKGDYHLGQRVNAMFRVQCDFHESDVMRRTLNYDN 1344

Query: 528  RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
            +  T++A+LDG  GF LPLPEK YRRL MLQNV++TH+ HT GLNP+A+RT K       
Sbjct: 1345 KHTTFFATLDGGFGFVLPLPEKTYRRLFMLQNVLLTHSPHTCGLNPKAYRTIKQSRALPI 1404

Query: 588  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 632
            NPSR ++DG LVW FL+L   E+ E+ KKIG++  +I  +L +IE
Sbjct: 1405 NPSRCVVDGDLVWSFLELPANEKQEVAKKIGTRIEEICADLMEIE 1449


>gi|194756960|ref|XP_001960738.1| GF11349 [Drosophila ananassae]
 gi|190622036|gb|EDV37560.1| GF11349 [Drosophila ananassae]
          Length = 1455

 Score =  618 bits (1594), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 302/649 (46%), Positives = 419/649 (64%), Gaps = 52/649 (8%)

Query: 2    GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKK 61
            G  ++  P   +  +  EL  + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K
Sbjct: 847  GIVQACMPQHANSPLPLELTVLGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRK 906

Query: 62   LKVLFVSD----------RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWL 111
            L+ L + D            +R   +    +   + ++R F+N+ G  G+ +CG +P ++
Sbjct: 907  LEQLNLMDHQPSHIELDENDEREEMESYQMQPKYVQKLRPFANVGGLSGIMVCGVNPCFV 966

Query: 112  FLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPV 171
            FLTSRGELR H +  +G V + A F+NVN P GFLYF+   EL+ISVLP++LSYD+ WP+
Sbjct: 967  FLTSRGELRIHRLLGNGDVRSFAAFNNVNIPNGFLYFDTTFELKISVLPSYLSYDSTWPI 1026

Query: 172  RKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQ 231
            RKVPL+CTP  L YH E + YC++T   EP T +Y+FNGEDKEL  + R  RFI P+ SQ
Sbjct: 1027 RKVPLRCTPRQLVYHRENRVYCLITQNEEPMTKFYRFNGEDKELSEESRGERFIYPIGSQ 1086

Query: 232  FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDV 291
            F + L SP +WE +P  +     WEHV   K V + YEGT SGL+ Y+ +GTN+NYSED+
Sbjct: 1087 FEMVLISPETWEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDI 1146

Query: 292  TCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIW 351
            T RG I ++DIIEVVPEPG+P+TK K+K ++ KEQKGPV+AI  V GFLVT +GQKIYIW
Sbjct: 1147 TSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKIYIW 1206

Query: 352  QLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPT 411
            QL+D DL G+AFIDT +Y+  +++VK+LI + D  +SI+LLR+Q EYRTLSL +RD+ P 
Sbjct: 1207 QLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPL 1266

Query: 412  QPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISD 471
            +     +   N                                        S++GF+++D
Sbjct: 1267 EVYGIEFMVDN----------------------------------------SNLGFLVTD 1286

Query: 472  KDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARSRF 529
             ++N++++MYQPEARES GG +L++K D+HLGQ VNT F+++C    +         ++ 
Sbjct: 1287 AERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFLYENKH 1346

Query: 530  LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
               Y +LDGALG+ LPLPEK YRR LMLQNV++++  H  GLNP+ +RT K       NP
Sbjct: 1347 FVVYGTLDGALGYCLPLPEKLYRRFLMLQNVLLSYQEHLCGLNPKEYRTIKAVKKQGINP 1406

Query: 590  SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            SR IIDG L+W +  L+  ER E+ KKIG++  +IL +L +IE L+S F
Sbjct: 1407 SRCIIDGDLIWSYRLLANSERNEVAKKIGTRTEEILSDLLEIERLASVF 1455


>gi|354491126|ref|XP_003507707.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform 3 [Cricetulus griseus]
          Length = 1449

 Score =  618 bits (1594), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 304/622 (48%), Positives = 403/622 (64%), Gaps = 60/622 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 843  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 902

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      +++  + + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 903  KPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 962

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 963  PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1022

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST  P T   +  GE+KE     RD R+I P    F + L SP SW
Sbjct: 1023 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSW 1082

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D+
Sbjct: 1083 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1142

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
            IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1143 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1202

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct: 1203 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1262

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 1263 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1282

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
            PEA+ES GG RL+++ DFH+G HVNTF++  C+ +  ++ P  +S     + +TW+A+LD
Sbjct: 1283 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVMWENKHITWFATLD 1340

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG 
Sbjct: 1341 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1400

Query: 598  LVWKFLQLSLGERLEICKKIGS 619
            L+ ++L LS  ER E+ KKIG+
Sbjct: 1401 LLNRYLYLSTMERSELAKKIGT 1422


>gi|195455711|ref|XP_002074834.1| GK23274 [Drosophila willistoni]
 gi|194170919|gb|EDW85820.1| GK23274 [Drosophila willistoni]
          Length = 1463

 Score =  617 bits (1592), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 306/654 (46%), Positives = 418/654 (63%), Gaps = 62/654 (9%)

Query: 2    GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKK 61
            G  +S  P   +  +  EL  V LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K
Sbjct: 855  GIVQSCMPQHANSPLPLELSLVGLGLNGERPLLLVRTRLELLIYQVFRYPKGHLKIRFRK 914

Query: 62   LKVLFVSDRS---------------KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGP 106
            +  L + D+                +  N QP       + ++R F+N+ G  GV +CG 
Sbjct: 915  MDQLNLLDQQPTHVNLDDNEENEELESYNMQPKY-----VQKLRPFNNVGGMSGVMICGV 969

Query: 107  HPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYD 166
            +P +LFLTSRGELR H +  +G V + A F+N+N P GFL+F+   EL+ISVLP++LSYD
Sbjct: 970  NPCFLFLTSRGELRIHRLLGNGEVRSFAAFNNINIPNGFLFFDTTFELKISVLPSYLSYD 1029

Query: 167  APWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIP 226
            + WPVRKVPL+CTP  L YH E + YC++T T EP T +Y+FNGEDKEL  + R  RFI 
Sbjct: 1030 STWPVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKFYRFNGEDKELSEESRGERFIY 1089

Query: 227  PLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYN 286
            P+ SQF + L SP +WE +P  +     WEHV   K V + YEGT SGL+ Y+ +GTN+N
Sbjct: 1090 PIGSQFDMVLISPETWEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFN 1149

Query: 287  YSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ 346
            YSED+T RG I ++DIIEVVPEPG+P+TK K+K ++ KEQKGPV+AI  V GFLVT +GQ
Sbjct: 1150 YSEDITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQ 1209

Query: 347  KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVAR 406
            KIYIWQL+D DL G+AFIDT +Y+  +++VK+LI + D  +SI+LLR+Q EYRTLSL +R
Sbjct: 1210 KIYIWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASR 1269

Query: 407  DYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMG 466
            D+ P +     +   N                                        +++G
Sbjct: 1270 DFNPLEVYGIEFMVDN----------------------------------------TNLG 1289

Query: 467  FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG-- 524
            F+++D + N++++MYQPEARES GG +L++K D+HLGQ VNT F+++C    +       
Sbjct: 1290 FLVTDAESNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQRGLHQRQPFL 1349

Query: 525  ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 584
              ++    Y +LDGALG+ LPLPEK YRR LMLQNV++++  H  GLNP+ +RT K    
Sbjct: 1350 YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQDHLCGLNPKEYRTLKSSKR 1409

Query: 585  YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
               NPSR IIDG L+W +  L+  ER E+ KKIG++  +IL +L +IE LS  F
Sbjct: 1410 LGINPSRCIIDGDLIWSYRLLANSERNEVAKKIGTRTEEILADLLEIERLSGVF 1463


>gi|296227035|ref|XP_002807684.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
            specificity factor subunit 1 [Callithrix jacchus]
          Length = 1394

 Score =  617 bits (1591), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 305/640 (47%), Positives = 409/640 (63%), Gaps = 58/640 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 796  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLSQGNLKVRFKKVPHNINFREK 855

Query: 63   -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
                        + E+    RG R+++ RYF +I GY GVF+CGP P WL +T RG LR 
Sbjct: 856  KPKPSKKKAEGGSTEEGAGARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 914

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            HPM IDGPV + APFHN+NCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 915  HPMGIDGPVDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 974

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
            ++AYH+E+K Y + TST  P T   +  GE+KE  T  RD R+I P    F + L SP S
Sbjct: 975  YVAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS 1034

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
            WE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D
Sbjct: 1035 WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1094

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
            +IEVV EP Q LT  K K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1095 VIEVVTEPRQTLTXXKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1154

Query: 362  AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   
Sbjct: 1155 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1214

Query: 422  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
            N                                        + +GF++SD+D+N++++MY
Sbjct: 1215 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1234

Query: 482  QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDG 538
             PEA+ES GG RL+++ DFH+G HVNTF++  C+ ++   +  +    ++ +TW+A+LDG
Sbjct: 1235 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSKKSVMWENKHITWFATLDG 1294

Query: 539  ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
             +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG L
Sbjct: 1295 GIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGEL 1354

Query: 599  VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            + ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1355 LNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1394


>gi|255918233|ref|NP_001157645.1| cleavage and polyadenylation specificity factor subunit 1 isoform 1
            [Mus musculus]
          Length = 1450

 Score =  617 bits (1590), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 307/637 (48%), Positives = 407/637 (63%), Gaps = 61/637 (9%)

Query: 2    GNFRSHSPSAMDET-IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALK 56
            G  R    +   E  +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK
Sbjct: 828  GEVRKEEATRQGELPLVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLK 887

Query: 57   LRFKKL---------KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPH 107
            +RFKK+         K      +++  + + G     R+++ RYF +I GY GVF+CGP 
Sbjct: 888  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 947

Query: 108  PAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDA 167
            P WL +T RG LR HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDA
Sbjct: 948  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007

Query: 168  PWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPP 227
            PWPVRK+PL+CT H++AYH+E+K Y + TST  P T   +  GE+KE     RD R+I P
Sbjct: 1008 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHP 1067

Query: 228  LVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY 287
                F + L SP SWE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT    
Sbjct: 1068 QQEAFSIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQ 1127

Query: 288  SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
             E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQK
Sbjct: 1128 GEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQK 1187

Query: 348  IYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
            I++W L+ ++LTG+AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD
Sbjct: 1188 IFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRD 1247

Query: 408  YKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGF 467
             KP +  S  +   N                                        + +GF
Sbjct: 1248 AKPLEVYSVDFMVDN----------------------------------------AQLGF 1267

Query: 468  MISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS 527
            ++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++  C+ +  ++ P  +S
Sbjct: 1268 LVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKS 1325

Query: 528  -----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 582
                 + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR     
Sbjct: 1326 VVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVD 1385

Query: 583  GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
                 N  R ++DG L+ ++L LS  ER E+ KKIG+
Sbjct: 1386 RRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGT 1422


>gi|47217773|emb|CAG05995.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1446

 Score =  613 bits (1582), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 313/672 (46%), Positives = 429/672 (63%), Gaps = 89/672 (13%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGALKLRFKKL--------- 62
            +V+E+  VSLG + +RP LLV  + ELL+Y+AF    + P+  LK+RFKK+         
Sbjct: 815  LVKEVTLVSLGYNHSRPYLLVHVEQELLVYEAFPYDQQQPQNNLKVRFKKVPHNINFREK 874

Query: 63   -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
               L    +++ A  + G+    RIS+ RYF +I+GY GVF+CGP P W+ +TSRG LR 
Sbjct: 875  KSKLRKDKKAEGAAAEDGVAARGRISRFRYFEDISGYSGVFICGPSPHWMLVTSRGALRL 934

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            HPMTIDGP+ + +PFHN+NCP+GFLYFN + ELRISVLPT+LSYDAPWPVRK+PL+CT H
Sbjct: 935  HPMTIDGPIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVH 994

Query: 182  FLAYHLETKT-------YCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHV 234
            +++YH+E+K        Y + TS  E  T   +  GE+KE  T  RD R+I P   +F +
Sbjct: 995  YVSYHVESKASLSHCCVYAVCTSVKELCTRIPRMTGEEKEYETIERDERYINPQQDKFSI 1054

Query: 235  SLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR 294
             L SP SWE IP T   L EWE+V C+K V++  + T+SGL+GYIA GT     E+VTCR
Sbjct: 1055 QLISPVSWEAIPNTRIDLEEWEYVTCMKTVALRSQETVSGLKGYIAAGTCLMQGEEVTCR 1114

Query: 295  GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK 354
            GRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G+LV+A+GQKI++W LK
Sbjct: 1115 GRILILDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFLWVLK 1174

Query: 355  DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
            DNDLTG+AFIDT++YI  M+S+KN IL  D  +S++LLRYQ E +TLSLV+RD KP +  
Sbjct: 1175 DNDLTGMAFIDTQLYIHQMMSIKNFILAADLMKSVSLLRYQEESKTLSLVSRDAKPLEVY 1234

Query: 415  SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
            S  +   N                                        S +GF++SD+DK
Sbjct: 1235 SIEFMVDN----------------------------------------SQLGFLVSDRDK 1254

Query: 475  NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS---RFLT 531
            N+ ++MY PEA+ES GG RL+++ DF+ G ++NTF+++ C+ +  + +  A +   + +T
Sbjct: 1255 NLYVYMYLPEAKESFGGMRLLRRADFNAGANINTFWRMPCRGALEAGSRKAMTWDNKHIT 1314

Query: 532  WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG---- 587
            W+A+LDG +G  LP+ EK YRRLLMLQN + T  SH  GLNP+AFR        A     
Sbjct: 1315 WFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLSHHAGLNPKAFRCVGADRTSAAMLSG 1374

Query: 588  ---------------------NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 626
                                 NP + I+DG L+ K+L LS+ ER E+ KKIG+  + ILD
Sbjct: 1375 MLPDFATSVSRMLHCDRRSLQNPVKNILDGELLNKYLYLSMMERSELAKKIGTTQDIILD 1434

Query: 627  ELYDIEALSSHF 638
            +L DI+ +++HF
Sbjct: 1435 DLLDIDRVTAHF 1446


>gi|198457226|ref|XP_001360595.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
 gi|198135905|gb|EAL25170.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
          Length = 1459

 Score =  613 bits (1581), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 306/655 (46%), Positives = 418/655 (63%), Gaps = 62/655 (9%)

Query: 1    MGNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFK 60
            +G  +S  P   +  +  EL  V LGL+G RP+L+VRT+ ELLIYQ FR+PKG LK+RF+
Sbjct: 850  VGIVQSCMPQHANSPLPLELSLVGLGLNGERPVLMVRTRVELLIYQVFRYPKGNLKIRFR 909

Query: 61   KLKVLFVSDRS---------------KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCG 105
            KL+ L + D+                +  N QP       + ++R FSN+ G  G+ +CG
Sbjct: 910  KLEQLNLLDQQPSHIELEENDEEEELESYNMQPKY-----VQKLRPFSNVGGLAGIMVCG 964

Query: 106  PHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY 165
             +P ++FLT+RGELR H +  +G V + A F+NVN P GFLYF+   EL+ISVLP++LSY
Sbjct: 965  VNPCFVFLTARGELRIHRLQGNGDVRSFAAFNNVNIPNGFLYFDTTFELKISVLPSYLSY 1024

Query: 166  DAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFI 225
            D+ WPVRKVPL+CTP  L YH E + YC++T T EP T YY+FNGEDKEL  + R  RFI
Sbjct: 1025 DSVWPVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFI 1084

Query: 226  PPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNY 285
             P  SQF + L SP +WE +P  +     WEHV   K V + YEGT SGL+ Y+ +GTN+
Sbjct: 1085 YPNGSQFEMVLISPETWEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNF 1144

Query: 286  NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVG 345
            NYSED+T RG I ++DIIEVVPEPG+P+TK K+K ++ KEQKGPV+AI  V GFLVT +G
Sbjct: 1145 NYSEDITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLG 1204

Query: 346  QKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVA 405
            QKIYIWQL+D DL G+AFIDT +Y+  +++VK+LI + D  +SI+LLR+Q E+RTLSL +
Sbjct: 1205 QKIYIWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLAS 1264

Query: 406  RDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
            RD+ P +     +   N                                        S++
Sbjct: 1265 RDFNPLEVYGIEFMVDN----------------------------------------SNL 1284

Query: 466  GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG- 524
            GF+++D ++N++++MYQPEARES GG +LI+K D+HLGQ VNT F+++C    +      
Sbjct: 1285 GFLVTDAERNLIVYMYQPEARESLGGQKLIRKADYHLGQVVNTMFRVQCHQRGVHQRQPF 1344

Query: 525  -ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKG 583
               ++    Y +LDG LG+ LPLPEK YRR LMLQNV++++  H  GLNP+ FRT K   
Sbjct: 1345 LYENKHFVVYGTLDGGLGYCLPLPEKVYRRFLMLQNVLLSYQDHLCGLNPKEFRTLKSFK 1404

Query: 584  YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
                NPSR IIDG L+W +  L   +R E+ KKIG++  +IL +L +IE LS  F
Sbjct: 1405 KQGLNPSRCIIDGDLIWSYRLLPNSDRNEVAKKIGTRTEEILSDLLEIERLSGVF 1459


>gi|195150431|ref|XP_002016158.1| GL10645 [Drosophila persimilis]
 gi|194110005|gb|EDW32048.1| GL10645 [Drosophila persimilis]
          Length = 1459

 Score =  613 bits (1581), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 306/655 (46%), Positives = 418/655 (63%), Gaps = 62/655 (9%)

Query: 1    MGNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFK 60
            +G  +S  P   +  +  EL  V LGL+G RP+L+VRT+ ELLIYQ FR+PKG LK+RF+
Sbjct: 850  VGIVQSCMPQHANSPLPLELSLVGLGLNGERPVLMVRTRVELLIYQVFRYPKGNLKIRFR 909

Query: 61   KLKVLFVSDRS---------------KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCG 105
            KL+ L + D+                +  N QP       + ++R FSN+ G  G+ +CG
Sbjct: 910  KLEQLNLLDQQPSHIELEENDEEEELESYNMQPKY-----VQKLRPFSNVGGLAGIMVCG 964

Query: 106  PHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY 165
             +P ++FLT+RGELR H +  +G V + A F+NVN P GFLYF+   EL+ISVLP++LSY
Sbjct: 965  VNPCFVFLTARGELRIHRLQGNGDVRSFAAFNNVNIPNGFLYFDTTFELKISVLPSYLSY 1024

Query: 166  DAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFI 225
            D+ WPVRKVPL+CTP  L YH E + YC++T T EP T YY+FNGEDKEL  + R  RFI
Sbjct: 1025 DSVWPVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFI 1084

Query: 226  PPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNY 285
             P  SQF + L SP +WE +P  +     WEHV   K V + YEGT SGL+ Y+ +GTN+
Sbjct: 1085 YPNGSQFEMVLISPETWEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNF 1144

Query: 286  NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVG 345
            NYSED+T RG I ++DIIEVVPEPG+P+TK K+K ++ KEQKGPV+AI  V GFLVT +G
Sbjct: 1145 NYSEDITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLG 1204

Query: 346  QKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVA 405
            QKIYIWQL+D DL G+AFIDT +Y+  +++VK+LI + D  +SI+LLR+Q E+RTLSL +
Sbjct: 1205 QKIYIWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLAS 1264

Query: 406  RDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
            RD+ P +     +   N                                        S++
Sbjct: 1265 RDFNPLEVYGIEFMVDN----------------------------------------SNL 1284

Query: 466  GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG- 524
            GF+++D ++N++++MYQPEARES GG +LI+K D+HLGQ VNT F+++C    +      
Sbjct: 1285 GFLVTDAERNLIVYMYQPEARESLGGQKLIRKADYHLGQVVNTMFRVQCHQRGVHQRQPF 1344

Query: 525  -ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKG 583
               ++    Y +LDG LG+ LPLPEK YRR LMLQNV++++  H  GLNP+ FRT K   
Sbjct: 1345 LYENKHFVVYGTLDGGLGYCLPLPEKVYRRFLMLQNVLLSYQDHLCGLNPKEFRTLKSFK 1404

Query: 584  YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
                NPSR IIDG L+W +  L   +R E+ KKIG++  +IL +L +IE LS  F
Sbjct: 1405 KQGLNPSRCIIDGDLIWSYRLLPNSDRNEVAKKIGTRTEEILSDLLEIERLSGVF 1459


>gi|443684051|gb|ELT88095.1| hypothetical protein CAPTEDRAFT_161045 [Capitella teleta]
          Length = 1410

 Score =  613 bits (1581), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 294/648 (45%), Positives = 424/648 (65%), Gaps = 59/648 (9%)

Query: 2    GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGALKL 57
             +F +   S  +   V E++    G++G++PLL+ R   EL IY+ F H     KG L++
Sbjct: 811  ASFVAPERSTQEVPFVHEVMLHGFGVNGSQPLLMARVHDELYIYKVFSHVGSKAKGRLQV 870

Query: 58   RFKKLK---VLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLT 114
            RFK+     ++   DR ++  E            +R F++I+GY GVF+CG +P WL +T
Sbjct: 871  RFKRRSHGLIIRPRDREEKIPENK--------KWLRPFTDISGYSGVFICGSYPHWLIMT 922

Query: 115  SRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKV 174
             RG LR HPM IDG +     FHNVNCP+GFLYF++  ELRI VLPTHLSYDAPWPVRKV
Sbjct: 923  QRGTLRGHPMAIDGTIPCFTAFHNVNCPKGFLYFSSNEELRICVLPTHLSYDAPWPVRKV 982

Query: 175  PLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGE-DKELVTDPRDSRFIPPLVSQFH 233
            PL+CTPHF+ YH ++KTY +V+S   P T   +  G+ +KE+    +D RF+ P++++F+
Sbjct: 983  PLRCTPHFVVYHPDSKTYSVVSSQQVPCTQLVRVAGDGEKEIEAVQKDDRFVFPIMNKFN 1042

Query: 234  VSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTC 293
            + LFSP SWE IP T F L EWEHV+C+K ++++ EGTLSGL+GY+ +GTN NY+EDV+ 
Sbjct: 1043 IQLFSPVSWEPIPNTRFDLEEWEHVMCIKTINLKSEGTLSGLKGYVVVGTNLNYNEDVSS 1102

Query: 294  RGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL 353
            RG++ ++D+I+VVPEPGQPLTKNKIK++Y KEQKGPVTA+  V GFLVTA+GQK+YIWQL
Sbjct: 1103 RGKLTIYDVIDVVPEPGQPLTKNKIKVVYNKEQKGPVTALDGVQGFLVTAIGQKVYIWQL 1162

Query: 354  KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
            KDNDL GIAFIDT++YI  M ++KNLI++GD  +SI++LRYQ + + LSLV++D +P   
Sbjct: 1163 KDNDLAGIAFIDTQIYIHKMEALKNLIIIGDVCKSISVLRYQEDMKVLSLVSKDVRPLAV 1222

Query: 414  NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
                Y                                       ++DE +S+ F+++DK 
Sbjct: 1223 YGVAY---------------------------------------LVDE-TSLAFIVADKL 1242

Query: 474  KNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS---RFL 530
            KN +++ YQP+  +S GG RLI+K D ++G  VN FF+++C+ S  S +   +S   + +
Sbjct: 1243 KNFLVYCYQPDLVQSQGGQRLIRKADINIGSLVNAFFRVKCRVSDPSTSKTDQSLAMKHI 1302

Query: 531  TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPS 590
            T+Y +LDG++G+ LP+ E  YRRL MLQ +++     T GLNP+A+RT + +     N  
Sbjct: 1303 TYYVTLDGSIGYLLPISESLYRRLYMLQKMLIQQVQQTAGLNPKAYRTCQTEFRQLINIQ 1362

Query: 591  RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            R IIDG L WK+L L+  +R E+ K+IG+  + I D+L +I+  + HF
Sbjct: 1363 RNIIDGDLAWKYLALTSHDRAEMAKRIGTTSHQIEDDLLEIDRCTCHF 1410


>gi|9794908|gb|AAF98388.1| cleavage and polyadenylation specificity factor [Drosophila
           melanogaster]
          Length = 813

 Score =  608 bits (1568), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 310/651 (47%), Positives = 423/651 (64%), Gaps = 56/651 (8%)

Query: 2   GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKK 61
           G  ++  P   +  +  EL  + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K
Sbjct: 205 GIVQACMPQHANSPLPLELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRK 264

Query: 62  LKVLFVSDRS------KRANEQPGL------PRGVRISQMRYFSNIAGYQGVFLCGPHPA 109
           L  L + D+          +EQ  +      P+ V+  ++R F+N+ G  GV +CG +P 
Sbjct: 265 LDQLNLLDQQPTHIELDENDEQEEIESYQMQPKYVQ--KLRPFANVGGLSGVMVCGVNPC 322

Query: 110 WLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPW 169
           ++FLT RGELR H +  +G V + A F+NVN P GFLYF+   EL+ISVLP++LSYD+ W
Sbjct: 323 FVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNGFLYFDTTYELKISVLPSYLSYDSVW 382

Query: 170 PVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLV 229
           PVRKVPL+CTP  L YH E + YC++T T EP T YY+FNGEDKEL  + RD RFI P+ 
Sbjct: 383 PVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRDERFIYPIG 442

Query: 230 SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
           SQF + L SP +WE +P  +     WEHV   K V + YEGT SGL+ Y+ +GTN+NYSE
Sbjct: 443 SQFEMVLISPETWEIVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSE 502

Query: 290 DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
           D+T RG I ++DIIEVVPEPG+P+TK KIK I+ KEQKGPV+AI  V GFLVT +GQKIY
Sbjct: 503 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIY 562

Query: 350 IWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
           IWQL+D DL G+AFIDT +Y+  +++VK+LI + D  +SI+LLR+Q EYRTLSL +RD+ 
Sbjct: 563 IWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFN 622

Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
           P +     +   N                                        S++GF++
Sbjct: 623 PLEVYGIEFMVDN----------------------------------------SNLGFLV 642

Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARS 527
           +D ++N++++MYQPEARES GG +L++K D+HLGQ VNT F+++C    +         +
Sbjct: 643 TDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFLYEN 702

Query: 528 RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
           +    Y +LDGALG+ LPLPEK YRR LMLQNV++++  H  GLNP+ +RT K       
Sbjct: 703 KHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSSKKQGI 762

Query: 588 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
           NPSR IIDG L+W +  ++  ER E+ KKIG++  +IL +L +IE L+S F
Sbjct: 763 NPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDLLEIERLASVF 813


>gi|24653655|ref|NP_725397.1| cleavage and polyadenylation specificity factor 160, isoform B
            [Drosophila melanogaster]
 gi|15292103|gb|AAK93320.1| LD38533p [Drosophila melanogaster]
 gi|21627189|gb|AAM68553.1| cleavage and polyadenylation specificity factor 160, isoform B
            [Drosophila melanogaster]
          Length = 1420

 Score =  607 bits (1564), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 308/651 (47%), Positives = 422/651 (64%), Gaps = 56/651 (8%)

Query: 2    GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKK 61
            G  ++  P   +  +  EL  + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K
Sbjct: 812  GIVQACMPQHANSPLPLELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRK 871

Query: 62   LKVLFVSDRS------KRANEQPGL------PRGVRISQMRYFSNIAGYQGVFLCGPHPA 109
            +  L + D+          +EQ  +      P+ V+  ++R F+N+ G  GV +CG +P 
Sbjct: 872  MDQLNLLDQQPTHIDLDENDEQEEIESYQMQPKYVQ--KLRPFANVGGLSGVMVCGVNPC 929

Query: 110  WLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPW 169
            ++FLT RGELR H +  +G V + A F+NVN P GFLYF+   EL+ISVLP++LSYD+ W
Sbjct: 930  FVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNGFLYFDTTYELKISVLPSYLSYDSVW 989

Query: 170  PVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLV 229
            PVRKVPL+CTP  L YH E + YC++T T EP T YY+FNGEDKEL  + R  RFI P+ 
Sbjct: 990  PVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIG 1049

Query: 230  SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
            SQF + L SP +WE +P  +     WEHV   K V + YEGT SGL+ Y+ +GTN+NYSE
Sbjct: 1050 SQFEMVLISPETWEIVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSE 1109

Query: 290  DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
            D+T RG I ++DIIEVVPEPG+P+TK KIK I+ KEQKGPV+AI  V GFLVT +GQKIY
Sbjct: 1110 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIY 1169

Query: 350  IWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
            IWQL+D DL G+AFIDT +Y+  +++VK+LI + D  +SI+LLR+Q EYRTLSL +RD+ 
Sbjct: 1170 IWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFN 1229

Query: 410  PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
            P +     +   N                                        S++GF++
Sbjct: 1230 PLEVYGIEFMVDN----------------------------------------SNLGFLV 1249

Query: 470  SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARS 527
            +D ++N++++MYQPEARES GG +L++K D+HLGQ VNT F+++C    +         +
Sbjct: 1250 TDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFLYEN 1309

Query: 528  RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
            +    Y +LDGALG+ LPLPEK YRR LMLQNV++++  H  GLNP+ +RT K       
Sbjct: 1310 KHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSSKKQGI 1369

Query: 588  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            NPSR IIDG L+W +  ++  ER E+ KKIG++  +IL +L +IE L+S F
Sbjct: 1370 NPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDLLEIERLASVF 1420


>gi|195334368|ref|XP_002033855.1| GM20208 [Drosophila sechellia]
 gi|194125825|gb|EDW47868.1| GM20208 [Drosophila sechellia]
          Length = 1455

 Score =  606 bits (1563), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 309/651 (47%), Positives = 422/651 (64%), Gaps = 56/651 (8%)

Query: 2    GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKK 61
            G  ++  P   +  +  EL  + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K
Sbjct: 847  GIVQACMPQHANSPLPLELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRK 906

Query: 62   LKVLFVSDRS------KRANEQPGL------PRGVRISQMRYFSNIAGYQGVFLCGPHPA 109
            L  L + D+          +EQ  +      P+ V+  ++R F+N+ G  GV +CG +P 
Sbjct: 907  LDQLNLLDQQPTHIELDENDEQEEIESYQMQPKYVQ--KLRPFANVGGLSGVMVCGVNPC 964

Query: 110  WLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPW 169
            ++FLT RGELR H +  +G V + A F+NVN P GFLYF+   EL+ISVLP++LSYD+ W
Sbjct: 965  FVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNGFLYFDTTYELKISVLPSYLSYDSIW 1024

Query: 170  PVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLV 229
            PVRKVPL+CTP  L YH E + YC++T T EP T YY+FNGEDKEL  + R  RFI P+ 
Sbjct: 1025 PVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIG 1084

Query: 230  SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
            SQF + L SP +WE +P  +     WEHV   K V + YEGT SGL+ Y+ +GTN+NYSE
Sbjct: 1085 SQFEMVLISPETWEIVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSE 1144

Query: 290  DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
            D+T RG I ++DIIEVVPEPG+P+TK KIK I+ KEQKGPV+AI  V GFLVT +GQKIY
Sbjct: 1145 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIY 1204

Query: 350  IWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
            IWQL+D DL G+AFIDT +Y+  +++VK+LI + D  +SI+LLR+Q EYRTLSL +RD+ 
Sbjct: 1205 IWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFN 1264

Query: 410  PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
            P +     +   N                                        S++GF++
Sbjct: 1265 PLEVYGIEFMVDN----------------------------------------SNLGFLV 1284

Query: 470  SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARS 527
            +D ++N++++MYQPEARES GG +L++K D+HLGQ VNT F+++C    +         +
Sbjct: 1285 TDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFLYEN 1344

Query: 528  RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
            +    Y +LDGALG+ LPLPEK YRR LMLQNV++++  H  GLNP+ +RT K       
Sbjct: 1345 KHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSSKKQGI 1404

Query: 588  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            NPSR IIDG L+W +  ++  ER E+ KKIG++  +IL +L +IE L+S F
Sbjct: 1405 NPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDLLEIERLASVF 1455


>gi|45552619|ref|NP_995833.1| cleavage and polyadenylation specificity factor 160, isoform A
            [Drosophila melanogaster]
 gi|18203551|sp|Q9V726.1|CPSF1_DROME RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=CPSF 160 kDa subunit;
            Short=dCPSF 160
 gi|7303176|gb|AAF58240.1| cleavage and polyadenylation specificity factor 160, isoform A
            [Drosophila melanogaster]
          Length = 1455

 Score =  606 bits (1563), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 308/651 (47%), Positives = 422/651 (64%), Gaps = 56/651 (8%)

Query: 2    GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKK 61
            G  ++  P   +  +  EL  + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K
Sbjct: 847  GIVQACMPQHANSPLPLELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRK 906

Query: 62   LKVLFVSDRS------KRANEQPGL------PRGVRISQMRYFSNIAGYQGVFLCGPHPA 109
            +  L + D+          +EQ  +      P+ V+  ++R F+N+ G  GV +CG +P 
Sbjct: 907  MDQLNLLDQQPTHIDLDENDEQEEIESYQMQPKYVQ--KLRPFANVGGLSGVMVCGVNPC 964

Query: 110  WLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPW 169
            ++FLT RGELR H +  +G V + A F+NVN P GFLYF+   EL+ISVLP++LSYD+ W
Sbjct: 965  FVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNGFLYFDTTYELKISVLPSYLSYDSVW 1024

Query: 170  PVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLV 229
            PVRKVPL+CTP  L YH E + YC++T T EP T YY+FNGEDKEL  + R  RFI P+ 
Sbjct: 1025 PVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIG 1084

Query: 230  SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
            SQF + L SP +WE +P  +     WEHV   K V + YEGT SGL+ Y+ +GTN+NYSE
Sbjct: 1085 SQFEMVLISPETWEIVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSE 1144

Query: 290  DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
            D+T RG I ++DIIEVVPEPG+P+TK KIK I+ KEQKGPV+AI  V GFLVT +GQKIY
Sbjct: 1145 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIY 1204

Query: 350  IWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
            IWQL+D DL G+AFIDT +Y+  +++VK+LI + D  +SI+LLR+Q EYRTLSL +RD+ 
Sbjct: 1205 IWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFN 1264

Query: 410  PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
            P +     +   N                                        S++GF++
Sbjct: 1265 PLEVYGIEFMVDN----------------------------------------SNLGFLV 1284

Query: 470  SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARS 527
            +D ++N++++MYQPEARES GG +L++K D+HLGQ VNT F+++C    +         +
Sbjct: 1285 TDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFLYEN 1344

Query: 528  RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
            +    Y +LDGALG+ LPLPEK YRR LMLQNV++++  H  GLNP+ +RT K       
Sbjct: 1345 KHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSSKKQGI 1404

Query: 588  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            NPSR IIDG L+W +  ++  ER E+ KKIG++  +IL +L +IE L+S F
Sbjct: 1405 NPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDLLEIERLASVF 1455


>gi|194883064|ref|XP_001975624.1| GG22421 [Drosophila erecta]
 gi|190658811|gb|EDV56024.1| GG22421 [Drosophila erecta]
          Length = 1455

 Score =  606 bits (1562), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 309/651 (47%), Positives = 421/651 (64%), Gaps = 56/651 (8%)

Query: 2    GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKK 61
            G  ++  P   +  +  EL    LGL+G RPLL+VRT+ ELLIYQ FR+PKG LK+RF+K
Sbjct: 847  GIVQACMPQHANSPLPLELSLTGLGLNGERPLLMVRTRVELLIYQVFRYPKGHLKIRFRK 906

Query: 62   LKVLFVSDRS------KRANEQPGL------PRGVRISQMRYFSNIAGYQGVFLCGPHPA 109
            L  L + D+          +EQ  +      P+ V+  ++R F+N+ G  GV +CG +P 
Sbjct: 907  LDQLNLLDQQPTHIELDENDEQEDIESYQMQPKYVQ--KLRPFANVGGLSGVMVCGVNPC 964

Query: 110  WLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPW 169
            ++FLT RGELR H +  +G V + A F+NVN P GFLYF+   EL+ISVLP++LSYD+ W
Sbjct: 965  FVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNGFLYFDTTYELKISVLPSYLSYDSTW 1024

Query: 170  PVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLV 229
            PVRKVPL+CTP  L YH E + YC++T T EP T YY+FNGEDKEL  + R  RFI P+ 
Sbjct: 1025 PVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIG 1084

Query: 230  SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
            SQF + L SP +WE +P  +     WEHV   K V + YEGT SGL+ Y+ +GTN+NYSE
Sbjct: 1085 SQFEMVLISPETWEIVPDASISFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSE 1144

Query: 290  DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
            D+T RG I ++DIIEVVPEPG+P+TK KIK I+ KEQKGPV+AI  V GFLVT +GQKIY
Sbjct: 1145 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIY 1204

Query: 350  IWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
            IWQL+D DL G+AFIDT +Y+  +++VK+LI + D  +SI+LLR+Q EYRTLSL +RD+ 
Sbjct: 1205 IWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFN 1264

Query: 410  PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
            P +     +   N                                        S++GF++
Sbjct: 1265 PLEVYGIEFMVDN----------------------------------------SNLGFLV 1284

Query: 470  SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARS 527
            +D ++N++++MYQPEARES GG +L++K D+HLGQ VNT F+++C    +         +
Sbjct: 1285 TDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFLYEN 1344

Query: 528  RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
            +    Y +LDGALG+ LPLPEK YRR LMLQNV+V++  H  GLNP+ +RT K       
Sbjct: 1345 KHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLVSYQEHLCGLNPKEYRTLKSFKKQGI 1404

Query: 588  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            NPSR IIDG L+W +  ++  ER E+ KKIG++  +IL +L +IE L+S F
Sbjct: 1405 NPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDLLEIERLASVF 1455


>gi|195485994|ref|XP_002091320.1| GE12310 [Drosophila yakuba]
 gi|194177421|gb|EDW91032.1| GE12310 [Drosophila yakuba]
          Length = 1455

 Score =  603 bits (1556), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 308/651 (47%), Positives = 421/651 (64%), Gaps = 56/651 (8%)

Query: 2    GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKK 61
            G  ++  P   +  +  EL  + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K
Sbjct: 847  GIVQACMPQHANSPLPLELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRK 906

Query: 62   LKVLFVSDRS--------KRANEQ----PGLPRGVRISQMRYFSNIAGYQGVFLCGPHPA 109
            L  L + D+           A E+       P+ V+  ++R F+N+ G  GV +CG +P 
Sbjct: 907  LDQLNLLDQQPTHIELDENDAQEEIESYQMQPKYVQ--KLRPFANVGGLSGVMVCGVNPC 964

Query: 110  WLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPW 169
            ++FLT RGELR H +  +G V + A F+NVN P GFLYF+   EL+ISVLP++LSYD+ W
Sbjct: 965  FVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNGFLYFDTTYELKISVLPSYLSYDSTW 1024

Query: 170  PVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLV 229
            PVRKVPL+CTP  L YH E + YC++T T EP T YY+FNGEDKEL  + R  RFI P+ 
Sbjct: 1025 PVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIG 1084

Query: 230  SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
            SQF + L SP +WE +P  +     WEHV   K V + YEGT SGL+ Y+ +GTN+NYSE
Sbjct: 1085 SQFEMVLISPETWEIVPDASISFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSE 1144

Query: 290  DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
            D+T RG I ++DIIEVVPEPG+P+TK KIK I+ KEQKGPV+AI  V GFLVT +GQKIY
Sbjct: 1145 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIY 1204

Query: 350  IWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
            IWQL+D DL G+AFIDT +Y+  +++VK+LI + D  +SI+LLR+Q EYRTLSL +RD+ 
Sbjct: 1205 IWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFN 1264

Query: 410  PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
            P +     +   N                                        S++GF++
Sbjct: 1265 PLEVYGIEFMVDN----------------------------------------SNLGFLV 1284

Query: 470  SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARS 527
            +D ++N++++MYQPEARES GG +L++K D+HLGQ VNT F+++C    +         +
Sbjct: 1285 TDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFLYEN 1344

Query: 528  RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
            +    Y +LDGALG+ LPLPEK YRR LMLQNV++++  H  GLNP+ +RT K       
Sbjct: 1345 KHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSFKKQGI 1404

Query: 588  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            NPSR +IDG L+W +  ++  ER E+ KKIG++  +IL +L +IE L+S F
Sbjct: 1405 NPSRCVIDGDLIWSYRLMANSERNEVAKKIGTRTEEILADLLEIERLASVF 1455


>gi|355698297|gb|EHH28845.1| Cleavage and polyadenylation specificity factor 160 kDa subunit
            [Macaca mulatta]
          Length = 1436

 Score =  599 bits (1544), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 295/626 (47%), Positives = 399/626 (63%), Gaps = 60/626 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRAN 75
            +V+E+L V+LG   +RP LLV                    + F++ K      +++   
Sbjct: 868  LVKEVLLVALGSRQSRPYLLV-----------------PHNINFREKKPKPSKKKAEGGG 910

Query: 76   EQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAP 135
             + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR HPM IDGPV + AP
Sbjct: 911  TEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMAIDGPVDSFAP 970

Query: 136  FHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIV 195
            FHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H++AYH+E+K Y + 
Sbjct: 971  FHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVA 1030

Query: 196  TSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEW 255
            TST  P     +  GE+KE  T  RD R+I P    F + L SP SWE IP     L EW
Sbjct: 1031 TSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNARIELQEW 1090

Query: 256  EHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTK 315
            EHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D+IEVVPEPGQPLTK
Sbjct: 1091 EHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTK 1150

Query: 316  NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVS 375
            NK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+AFIDT++YI  M+S
Sbjct: 1151 NKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQLYIHQMIS 1210

Query: 376  VKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWK 435
            VKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N             
Sbjct: 1211 VKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN------------- 1257

Query: 436  FLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLI 495
                                       + +GF++SD+D+N++++MY PEA+ES GG RL+
Sbjct: 1258 ---------------------------AQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1290

Query: 496  KKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYR 552
            ++ DFH+G HVNTF++  C+ ++   +  +    ++ +TW+A+LDG +G  LP+ EK YR
Sbjct: 1291 RRADFHVGAHVNTFWRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYR 1350

Query: 553  RLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 612
            RLLMLQN + T   H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E
Sbjct: 1351 RLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSE 1410

Query: 613  ICKKIGSKHNDILDELYDIEALSSHF 638
            + KKIG+  + ILD+L + + +++HF
Sbjct: 1411 LAKKIGTTPDIILDDLLETDRVTAHF 1436


>gi|391328522|ref|XP_003738737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Metaseiulus occidentalis]
          Length = 1500

 Score =  593 bits (1529), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 293/649 (45%), Positives = 410/649 (63%), Gaps = 56/649 (8%)

Query: 2    GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF---RHPKGALKLR 58
            G   S S S      V E+   +LG+H +RPLL  R   EL IY+A+      +G LKL+
Sbjct: 896  GQTTSASTSEAQLPKVMEIFVCALGMHQSRPLLFARVDSELHIYEAYPFVNQKEGHLKLQ 955

Query: 59   FKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGE 118
            F++L+     +  +   ++ G P  + +  +R F ++ GY GVF+CG  P W+FLT+RGE
Sbjct: 956  FRRLQHAVTMEPRRVYKQKEGDPT-LSLRWIRAFQDVCGYNGVFVCGRRPHWIFLTARGE 1014

Query: 119  LRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKC 178
            LRAHPM  DG + + A FHNVNC +GFL+FN   ELRI  LP++L+YDAPWP+RK+P+  
Sbjct: 1015 LRAHPMLNDGRIYSFATFHNVNCEKGFLFFNKYGELRICALPSYLNYDAPWPMRKIPIYE 1074

Query: 179  TPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS-RFIPPLVSQFHVSLF 237
            TPH + YH++++TYC+ TS  E +T   K   EDKE     R+S RFIPP V +F + L+
Sbjct: 1075 TPHSVNYHVDSRTYCVATSKEETATCVPKLANEDKEFEPIERESSRFIPPTVDKFALELW 1134

Query: 238  SPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRI 297
            SP SWE IP T  P+ +WE + C+KNV +  EGT SG +G IA+GT +N+ ED+T +GRI
Sbjct: 1135 SPVSWEAIPNTRMPMEDWEKITCVKNVMIASEGTTSGEKGLIAVGTIHNFGEDITAKGRI 1194

Query: 298  LLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND 357
            LL DIIEVVPEPGQPLT++K+K I +K Q  PVTA+C V G L+ AVGQK++++QLKDND
Sbjct: 1195 LLIDIIEVVPEPGQPLTRSKVKTILSKPQNAPVTALCSVKGHLMAAVGQKLFLFQLKDND 1254

Query: 358  LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
            L G+AF+DT++YI S +S+K+ IL+GD  +SI LLRYQ E +TL++V++D KP Q  S  
Sbjct: 1255 LVGMAFLDTQIYILSAISIKSFILIGDVHKSITLLRYQEESKTLAVVSKDTKPVQIYSIE 1314

Query: 418  YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVV 477
            Y   N                                        S M F+ +D   N++
Sbjct: 1315 YLVDN----------------------------------------SQMAFLATDAQCNIL 1334

Query: 478  LFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL------- 530
            ++MYQPE RE+ GG RLI++ DF++G  +NT F+IRC+   +++ P +  R L       
Sbjct: 1335 VYMYQPENRETFGGQRLIRRGDFNIGSRINTMFRIRCR---LAEVPRSERRLLSDLEARH 1391

Query: 531  -TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
             T YASLDGA G+ LP+ EK YRRLLMLQNV+ ++  H GGLNP+AFR  +       NP
Sbjct: 1392 VTLYASLDGAFGYLLPISEKTYRRLLMLQNVLNSYCQHVGGLNPKAFRIMQTDVRALSNP 1451

Query: 590  SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             + I+DG L+  F+ L+  E+ E+ +KIG+  + I  +L +IE L+ HF
Sbjct: 1452 QKNIVDGDLINVFMDLNFNEKAEVARKIGTTVHQIQLDLAEIEGLTYHF 1500


>gi|301773406|ref|XP_002922132.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
            specificity factor subunit 1-like [Ailuropoda
            melanoleuca]
          Length = 1469

 Score =  578 bits (1490), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 300/649 (46%), Positives = 406/649 (62%), Gaps = 86/649 (13%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--KVLFVSD 69
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+   + F   
Sbjct: 881  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 940

Query: 70   RSKR--------ANEQPGLPRGVRISQMRYFSNIAGYQG-------VFLCGPHPAWLFLT 114
            + K         + E+    RG R+++ RYF +I GY G       VF+CGP P WL +T
Sbjct: 941  KPKPSKKKVEGGSAEEGAGARG-RVARFRYFEDIYGYSGGGGACPQVFICGPSPHWLLVT 999

Query: 115  SRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKV 174
             RG LR HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+
Sbjct: 1000 GRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKI 1059

Query: 175  PLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHV 234
            PL+CT H++AYH+E+K Y + TST  P T   +  GE+KE  T  RD R+I P    F +
Sbjct: 1060 PLRCTAHYVAYHVESKVYAVATSTNMPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSI 1119

Query: 235  SLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR 294
             L SP SWE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCR
Sbjct: 1120 QLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCR 1179

Query: 295  GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK 354
            GRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+
Sbjct: 1180 GRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLR 1239

Query: 355  DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
             ++LTG+AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  
Sbjct: 1240 ASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVY 1299

Query: 415  SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
            S  +   N                                        + +GF++SD+D+
Sbjct: 1300 SVDFMVDN----------------------------------------AQLGFLVSDRDR 1319

Query: 475  NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RF 529
            N++++MY PEA+ES GG RL+++ DFH+G HVNTF++  C+    ++ P  +S     + 
Sbjct: 1320 NLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCR--GAAEGPSKKSVVWENKH 1377

Query: 530  LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
            +TW+A+LDG +G  LP+ EK  R    LQ        H   ++ R  +          N 
Sbjct: 1378 ITWFATLDGGIGLLLPMQEKTNR----LQPAXSPRMLH---VDRRILQ----------NA 1420

Query: 590  SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1421 VRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 1469


>gi|410042329|ref|XP_003954555.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
            specificity factor subunit 1 [Pan troglodytes]
          Length = 1296

 Score =  578 bits (1489), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 272/540 (50%), Positives = 360/540 (66%), Gaps = 43/540 (7%)

Query: 102  FLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPT 161
            F+CGP P WL +T RG LR HPM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP 
Sbjct: 797  FICGPSPPWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPA 856

Query: 162  HLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRD 221
            +LSYDAPWPVRK+PL+CT H++AYH+E+K Y + TST  P     +  GE+KE  T  RD
Sbjct: 857  YLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERD 916

Query: 222  SRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIAL 281
             R+I P    F + L SP SWE IP     L EWEHV C+K VS+  E T+SGL+GY+A 
Sbjct: 917  ERYIHPQQEAFSIQLISPVSWEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAA 976

Query: 282  GTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLV 341
            GT     E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV
Sbjct: 977  GTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLV 1036

Query: 342  TAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTL 401
            +A+GQKI++W L+ ++LTG+AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TL
Sbjct: 1037 SAIGQKIFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTL 1096

Query: 402  SLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
            SLV+RD KP +  S  +   N                                       
Sbjct: 1097 SLVSRDAKPLEVYSVDFMVDN--------------------------------------- 1117

Query: 462  FSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISD 521
             + +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++  C+ ++   
Sbjct: 1118 -AQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGL 1176

Query: 522  APGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT 578
            +  +    ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR 
Sbjct: 1177 SKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRM 1236

Query: 579  YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
                     N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1237 LHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1296


>gi|195583398|ref|XP_002081509.1| GD25678 [Drosophila simulans]
 gi|194193518|gb|EDX07094.1| GD25678 [Drosophila simulans]
          Length = 1450

 Score =  577 bits (1488), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 293/615 (47%), Positives = 397/615 (64%), Gaps = 56/615 (9%)

Query: 2    GNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKK 61
            G  ++  P   +  +  EL  + LGL+G RPLLLVRT+ ELLIYQ FR+PKG LK+RF+K
Sbjct: 847  GIVQACMPQHANSPLPLELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRK 906

Query: 62   LKVLFVSDRS------KRANEQPGL------PRGVRISQMRYFSNIAGYQGVFLCGPHPA 109
            L    + D+          +EQ  +      P+ V+  ++R F+N+ G  GV +CG +P 
Sbjct: 907  LDXXNLLDQQPTHIELDENDEQEEIESYQMQPKYVQ--KLRPFANVGGLSGVMVCGVNPC 964

Query: 110  WLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPW 169
            ++FLT RGELR H +  +G V + A F+NVN P GFLYF+   EL+ISVLP++LSYD+ W
Sbjct: 965  FVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNGFLYFDTTYELKISVLPSYLSYDSIW 1024

Query: 170  PVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLV 229
            PVRKVPL+CTP  L YH E + YC++T T EP T YY+FNGEDKEL  + R  RFI P+ 
Sbjct: 1025 PVRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIG 1084

Query: 230  SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
            SQF + L SP +WE +P  +     WEHV   K V + YEGT SGL+ Y+ +GTN+NYSE
Sbjct: 1085 SQFEMVLISPETWEIVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSE 1144

Query: 290  DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
            D+T RG I ++DIIEVVPEPG+P+TK KIK I+ KEQKGPV+AI  V GFLVT +GQKIY
Sbjct: 1145 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIY 1204

Query: 350  IWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
            IWQL+D DL G+AFIDT +Y+  +++VK+LI + D  +SI+LLR+Q EYRTLSL +RD+ 
Sbjct: 1205 IWQLRDGDLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFN 1264

Query: 410  PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
            P +     +   N                                        S++GF++
Sbjct: 1265 PLEVYGIEFMVDN----------------------------------------SNLGFLV 1284

Query: 470  SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG--ARS 527
            +D ++N++++MYQPEARES GG +L++K D+HLGQ VNT F+++C    +         +
Sbjct: 1285 TDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFLYEN 1344

Query: 528  RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
            +    Y +LDGALG+ LPLPEK YRR LMLQNV++++  H  GLNP+ +RT K       
Sbjct: 1345 KHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSSKKQGI 1404

Query: 588  NPSRGIIDGSLVWKF 602
            NPSR IIDG L+W +
Sbjct: 1405 NPSRCIIDGDLIWSY 1419


>gi|156364999|ref|XP_001626630.1| predicted protein [Nematostella vectensis]
 gi|156213514|gb|EDO34530.1| predicted protein [Nematostella vectensis]
          Length = 1420

 Score =  577 bits (1488), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 294/645 (45%), Positives = 404/645 (62%), Gaps = 55/645 (8%)

Query: 6    SHSPSAMDETI-VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP--KGALKLRFKKL 62
            + S  + +E++ V+E+L   LG    R  L+     +LLIY+AF +P  +G L LRFKKL
Sbjct: 819  TQSSVSEEESLNVREVLLTGLGYKNRRATLVAVMDQDLLIYEAFSYPTVEGHLNLRFKKL 878

Query: 63   KVLFVSDRSKRANEQP--------GLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLT 114
            +   +  R K+  ++P        GL    +++ +R F++I+ Y G+F+CG +P W+F+T
Sbjct: 879  Q-HNIQIREKKPKQEPKNDSETKSGL--DPKVAMLRVFNDISSYSGIFVCGSYPFWIFVT 935

Query: 115  SRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKV 174
            +RG    HPM+IDGPV+  A FHNVNCP+GFLYFN + ELRISVLPTHLSYD+PWPVRKV
Sbjct: 936  NRGAFHWHPMSIDGPVTCFAAFHNVNCPKGFLYFNTRGELRISVLPTHLSYDSPWPVRKV 995

Query: 175  PLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHV 234
            PL+ TPH ++Y+ E+KTY IVTS  EP     +   EDKE V   RD+RFI P   +F +
Sbjct: 996  PLRYTPHMVSYNRESKTYAIVTSEQEPCKKIPRVTAEDKEFVDTIRDARFIYPSTERFVL 1055

Query: 235  SLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR 294
             L SP SWE IP T   L EWEHV  +KN+ +  E T +G +G+I +GT   Y E++  R
Sbjct: 1056 QLISPISWEVIPNTRHDLDEWEHVTTMKNLLLHSEETHTGRKGFICVGTTQLYGEEIAVR 1115

Query: 295  GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK 354
            GRIL+FDIIEVVPEPGQPLTKNK K++Y KEQKGPVTA+  V G+LV+ +GQKIYIW   
Sbjct: 1116 GRILIFDIIEVVPEPGQPLTKNKFKLLYEKEQKGPVTALNQVNGYLVSGIGQKIYIWNFT 1175

Query: 355  DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
            DNDL G+AFIDT++YI S+V+++N ++  D  +SI LLR Q E +TL+ V++D     P 
Sbjct: 1176 DNDLVGMAFIDTQLYIHSLVTIRNFVIAADVCKSITLLRLQEETKTLAFVSKD-----PK 1230

Query: 415  SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
            +   YA +     IDG                                  +GF++SD +K
Sbjct: 1231 NLEVYAAD---FFIDG--------------------------------PQIGFLVSDVEK 1255

Query: 475  NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS-SISDAPGARSRFLTWY 533
            N+VLF YQPEA ES GG RL+++ D ++G H+ +FF+I  K     S       R LT +
Sbjct: 1256 NLVLFTYQPEAIESQGGQRLLQRADINVGTHITSFFRIAAKAHLKASGEKSKEMRQLTCF 1315

Query: 534  ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGI 593
             +LDGALG  LP+ EK +RRL MLQ  +V    H  GLNP+AFR  + +     NP R +
Sbjct: 1316 GTLDGALGLMLPMTEKTFRRLHMLQTKLVDCIPHVAGLNPKAFRMLQWRKRKLCNPHRNV 1375

Query: 594  IDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            +D  L++K++ LS  ER E+ +KIG+    I+D++ DIE   + F
Sbjct: 1376 LDWQLLFKYMHLSFMERQEVARKIGTTPAQIMDDMMDIERACAQF 1420


>gi|390358535|ref|XP_789715.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Strongylocentrotus purpuratus]
          Length = 1223

 Score =  568 bits (1465), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 293/653 (44%), Positives = 402/653 (61%), Gaps = 76/653 (11%)

Query: 17   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGALKLRFKKL-KVLFVSDRS 71
            VQE+L V LG    +  +L   + +++IY+AF +     +  L++RF+K+   + +  + 
Sbjct: 616  VQEVLLVGLGHDRKKIYMLALVEDDIMIYEAFPYNTVTQEHHLRVRFRKIPHKILMKPKK 675

Query: 72   KRANEQPGLPRGV-----------------RISQMRYFSNIAGYQGVFLCGPHPAWLFLT 114
             R +++P    G                  R++++R F N+  Y GVF+ G HP WLF+T
Sbjct: 676  TRTSKKPTAEGGTKPETETEAESDTKTTSRRVNRLREFHNVQTYSGVFISGSHPYWLFVT 735

Query: 115  SRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKV 174
            SRG LR HPM +DG +S  A FHNVNCP GFLYFN K ELRI VLP+HLSYDAPWPVRKV
Sbjct: 736  SRGALRTHPMPVDGAISCFASFHNVNCPNGFLYFNRKEELRICVLPSHLSYDAPWPVRKV 795

Query: 175  PLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDP--RDSRFIPPLVSQF 232
            PL+CTPHF+AYH+ETKTY +VTS  E  T  +K  GE  E+  +P  RD RF+P     F
Sbjct: 796  PLRCTPHFVAYHVETKTYAVVTSVQETKTHVWKVTGE--EIGEEPVERDDRFVPTTKVVF 853

Query: 233  HVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT 292
             + LFSP SW+ IP T       E+V CLK V++  EGT++G +GY+ + T + YSED+ 
Sbjct: 854  SIQLFSPVSWDAIPNTRIEYEAAENVTCLKVVNLSCEGTMTGKKGYVVVATTHVYSEDLQ 913

Query: 293  CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQ 352
             RG + ++D IEVVPEPGQPLTKNK+K +Y K QKGPV+A+C V GFL+T +GQK+Y+WQ
Sbjct: 914  TRGSVYIYDCIEVVPEPGQPLTKNKLKPLYEKRQKGPVSALCEVMGFLLTCIGQKVYMWQ 973

Query: 353  LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQ 412
             KDNDL G+AFIDT++YI + VSVK  IL+ D  +    L+YQ + RTLSLV+RD +P  
Sbjct: 974  FKDNDLIGLAFIDTQIYIHNAVSVKQFILITDVMKGAYFLQYQAQDRTLSLVSRDARP-- 1031

Query: 413  PNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI--CKKIGSKHNDILDEFSSMGFMIS 470
                                            LEI  C        + + +   M F++S
Sbjct: 1032 --------------------------------LEIFGC--------EFMVDDKQMAFLVS 1051

Query: 471  DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK---PSS--ISDAPGA 525
            D DKN+++F Y PEA ES+GG  L+++ D ++G  VNTF ++RC+   PS+  +   P  
Sbjct: 1052 DADKNLIVFHYHPEAPESHGGAYLLRRGDMNIGSAVNTFVRVRCRLTDPSTEQVLSGPVL 1111

Query: 526  RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 585
            R R + ++A+LDG+LG  LP+ EK YRRLLMLQNV+     H GGLNP+++R  K     
Sbjct: 1112 R-RQVVFFATLDGSLGLLLPMVEKTYRRLLMLQNVLTNGLPHVGGLNPKSYRHVKSHMRN 1170

Query: 586  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
              NP R I+DG L+ K+  LS+ ER E  KKIG+  + I+ +L   E L+ HF
Sbjct: 1171 LNNPHRNILDGDLLLKYCHLSVVERNEFAKKIGTSVDQIISDLMLAENLTMHF 1223


>gi|390347522|ref|XP_003726804.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Strongylocentrotus purpuratus]
          Length = 1439

 Score =  565 bits (1456), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 292/653 (44%), Positives = 402/653 (61%), Gaps = 76/653 (11%)

Query: 17   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGALKLRFKKL-KVLFVSDRS 71
            VQE+L V LG    +  +L   + +++IY+AF +     +  L++RF+K+   + +  + 
Sbjct: 832  VQEVLLVGLGHDRKKIYMLALVEDDIMIYEAFPYNTVTQEHHLRVRFRKIPHKILMKPKK 891

Query: 72   KRANEQPGL-----------------PRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLT 114
             R +++P                    +  R++++R F N+  Y GVF+ G HP WLF+T
Sbjct: 892  TRTSKKPTAEGGTKTETETEAESDTKTQTRRVNRLREFHNVQTYSGVFISGSHPYWLFVT 951

Query: 115  SRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKV 174
            SRG LR HPM +DG +S  A FHNVNCP GFLYFN K ELRI VLP+HLSYDAPWPVRKV
Sbjct: 952  SRGALRTHPMPVDGAISCFASFHNVNCPNGFLYFNRKEELRICVLPSHLSYDAPWPVRKV 1011

Query: 175  PLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDP--RDSRFIPPLVSQF 232
            PL+CTPHF+AYH+ETKTY +VTS  E  T  +K  GE  E+  +P  RD RF+P     F
Sbjct: 1012 PLRCTPHFVAYHVETKTYAVVTSVQETKTHVWKVTGE--EIGEEPVERDDRFVPTTKVVF 1069

Query: 233  HVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT 292
             + LFSP SW+ IP T       E+V CLK V++  EGT++G +GY+ + T + YSED+ 
Sbjct: 1070 SIQLFSPVSWDAIPNTRIEYEAAENVTCLKVVNLSCEGTMTGKKGYVVVATTHVYSEDLQ 1129

Query: 293  CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQ 352
             RG + ++D IEVVPEPGQPLTKNK+K +Y K QKGPV+A+C V GFL+T +GQK+Y+WQ
Sbjct: 1130 TRGSVYIYDCIEVVPEPGQPLTKNKLKPLYEKRQKGPVSALCEVMGFLLTCIGQKVYMWQ 1189

Query: 353  LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQ 412
             KDNDL G+AFIDT++YI + VSVK  IL+ D  +    L+YQ + RTLSLV+RD +P  
Sbjct: 1190 FKDNDLIGLAFIDTQIYIHNAVSVKQFILITDVMKGAYFLQYQAQDRTLSLVSRDARP-- 1247

Query: 413  PNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI--CKKIGSKHNDILDEFSSMGFMIS 470
                                            LEI  C        + + +   M F++S
Sbjct: 1248 --------------------------------LEIFGC--------EFMVDDKQMAFLVS 1267

Query: 471  DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK---PSS--ISDAPGA 525
            D DKN+++F Y PEA ES+GG  L+++ D ++G  VNTF ++RC+   PS+  +   P  
Sbjct: 1268 DADKNLIVFHYHPEAPESHGGAYLLRRGDMNIGSAVNTFVRVRCRLTDPSTEQVLSGPVL 1327

Query: 526  RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 585
            R R + ++A+LDG+LG  LP+ EK YRRLLMLQNV+     H GGLNP+++R  K     
Sbjct: 1328 R-RQVVFFATLDGSLGLLLPMVEKTYRRLLMLQNVLTNGLPHVGGLNPKSYRHVKSHMRN 1386

Query: 586  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
              NP R I+DG L+ K+  LS+ ER E  KKIG+  + I+ +L   E L+ HF
Sbjct: 1387 LNNPHRNILDGDLLLKYCHLSVVERNEFAKKIGTSVDQIISDLMLAENLTMHF 1439


>gi|395740218|ref|XP_002819588.2| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Pongo abelii]
          Length = 1388

 Score =  555 bits (1429), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 286/640 (44%), Positives = 386/640 (60%), Gaps = 60/640 (9%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 792  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 851

Query: 63   -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
                        + E+    RG R+++ RYF +I GY GVF+CGP P WL +T RG LR 
Sbjct: 852  KPKPSKKKAEGGSTEEGAGARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 910

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            HPM IDGPV + APFHNVNCPRGFLYFN +   R+S  P+      P P   + L    H
Sbjct: 911  HPMAIDGPVDSFAPFHNVNCPRGFLYFNRQEPQRLSGSPSRTXXXXPTPPGLLGLPG--H 968

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
            +       + Y + TST  P     +  GE+KE  T  RD R+I P    F + L SP S
Sbjct: 969  WCVTPTNPQVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS 1028

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
            WE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D
Sbjct: 1029 WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1088

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
            +IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1089 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1148

Query: 362  AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   
Sbjct: 1149 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1208

Query: 422  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
            N                                        + +GF++SD+D+N++++MY
Sbjct: 1209 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1228

Query: 482  QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDG 538
             PEA+ES GG RL+++ DFH+G HVNTF++  C+ ++   +  +    ++ +TW  S+ G
Sbjct: 1229 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGAAEGLSKKSVVWENKHITWLVSVRG 1288

Query: 539  ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
             +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG L
Sbjct: 1289 GIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGEL 1348

Query: 599  VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            + ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1349 LNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1388


>gi|241060959|ref|XP_002408050.1| cleavage and polyadenylation specificity factor, putative [Ixodes
            scapularis]
 gi|215492346|gb|EEC01987.1| cleavage and polyadenylation specificity factor, putative [Ixodes
            scapularis]
          Length = 1241

 Score =  553 bits (1425), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 289/638 (45%), Positives = 390/638 (61%), Gaps = 82/638 (12%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAF----RHPKGALKLRFKKLKVLFVSDRS 71
            +V E+L V LG+  +RPLLL R   +LLIY+AF       +G LKLRFKKL    +    
Sbjct: 645  VVHEILMVGLGVRQSRPLLLARVDEDLLIYEAFPFYETQREGHLKLRFKKLNHDIILRSR 704

Query: 72   KRANEQPGLPRGVRISQMRY----FSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID 127
            K   ++P      +  Q R     FS+I+GY GVFLCG  P WLF++SRGELR HPM +D
Sbjct: 705  KYKTQKPENEEEEKAFQSRLWLQPFSDISGYSGVFLCGHRPHWLFMSSRGELRYHPMFVD 764

Query: 128  GPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVR---KVPLKCTPHFLA 184
            GPV   APFHNVNCP+GFL+FN +S+    +L ++     P P R   ++   C  H   
Sbjct: 765  GPVYCFAPFHNVNCPKGFLHFNKQSDSYALLLHSYWLSQLPSPKRHGERLLFNCPSH--- 821

Query: 185  YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDP------------RDSRFIPPLVSQF 232
                 K  CI          ++    +  + +  P             DSR+I P + +F
Sbjct: 822  -----KKICI------HRCHFFALQQKAADFLWPPPFVTTVSPLPFVADSRYIFPTMDKF 870

Query: 233  HVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT 292
             + L SP SWE IP T   L EWEH+ C+KNV +  EGT +G++GY+ALGTNY Y EDVT
Sbjct: 871  SLQLLSPVSWETIPNTRVDLDEWEHLTCIKNVMLSSEGTSTGMKGYLALGTNYCYGEDVT 930

Query: 293  CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQ 352
             RGRI + DII+VVPEPGQPLTKNKIK++Y+KEQKGPVTA+  V GFL++A+GQK+YIWQ
Sbjct: 931  SRGRITILDIIDVVPEPGQPLTKNKIKIVYSKEQKGPVTALSQVVGFLLSAIGQKMYIWQ 990

Query: 353  LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQ 412
            LKDN L G+AFIDT++YI S+V+VKNLILVGD  +S++LLRYQ   RTLSLV+RD +P +
Sbjct: 991  LKDNGLVGVAFIDTQIYIHSVVTVKNLILVGDVFKSVSLLRYQEASRTLSLVSRDVRPLE 1050

Query: 413  PNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDK 472
              +  ++  N                                        S M F+++D 
Sbjct: 1051 VFAVEFFIDN----------------------------------------SQMSFLVTDS 1070

Query: 473  DKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISD-----APGARS 527
            ++N++L+MYQPE+RES GG RL+++ DFH+G  V + F+I+C+   ++      A     
Sbjct: 1071 ERNMILYMYQPESRESCGGQRLLRRGDFHIGSPVVSMFRIKCRMGEVAKHDRRLAASVDG 1130

Query: 528  RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
            R +T  A+LDG+LG+ LP+PEK YRRLLMLQNV+VT+  H  GLNP+AFR Y  +    G
Sbjct: 1131 RHITMLATLDGSLGYVLPVPEKTYRRLLMLQNVLVTNMPHYAGLNPKAFRMYHSQRRVLG 1190

Query: 588  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 625
            NP + I+DG L+WKF+ LS  ER E+ KKIG+    ++
Sbjct: 1191 NPHKNILDGELIWKFMHLSFMERSELSKKIGTTVTQVV 1228


>gi|260835071|ref|XP_002612533.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
 gi|229297910|gb|EEN68542.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
          Length = 1003

 Score =  550 bits (1416), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 286/638 (44%), Positives = 389/638 (60%), Gaps = 86/638 (13%)

Query: 17   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGALKLRFKKLK-VLFVSDR- 70
            V+E+L V LG  G+RP LL R   +LLIY+AF +        LK+RFKK++  L + +R 
Sbjct: 436  VKEILMVGLGHKGSRPHLLARVDEDLLIYEAFPYHLSPSYTMLKIRFKKVQHNLILRERK 495

Query: 71   ---SKRA--NEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
               +K+A   E+     G RI   R F++I+GY G+F+CG  P WLF+TSRG LR HPM+
Sbjct: 496  GGKTKKAGDQEESDGQTGSRIQHFRTFTDISGYSGLFICGSSPHWLFMTSRGALRIHPMS 555

Query: 126  IDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
            IDG V+  +PFHNVNCP+GFLYFN   ELRISVLPTHLSYDAPWPVRKVPL+CTPHF+AY
Sbjct: 556  IDGAVTCFSPFHNVNCPKGFLYFNRGGELRISVLPTHLSYDAPWPVRKVPLRCTPHFVAY 615

Query: 186  HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEI 245
            H+E K Y +  ST E      +  G++KE     +D R+I P++ +F++ L SP SWE I
Sbjct: 616  HMECKVYAVAASTFEMCNRIPRMAGDEKEYDAVEKDDRYIYPMLDKFNIQLMSPVSWEII 675

Query: 246  PQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEV 305
            P T                 M+ E   +       +G N+     +   G+I++ D+IEV
Sbjct: 676  PNTR---------------GMQLEENYAECTCSFLVGINFV----LFVAGQIVILDVIEV 716

Query: 306  VPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFID 365
            VPEPGQPLTKNKIK +Y KEQKGPV+A+C   G+L++A+GQKI++W+ ++NDL G+AFID
Sbjct: 717  VPEPGQPLTKNKIKELYGKEQKGPVSALCGCNGYLLSAIGQKIFLWEFRNNDLIGVAFID 776

Query: 366  TEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSR 425
            T+VYI + +S+KN +++ D  +SI+LLRYQ           D +P +     ++  N   
Sbjct: 777  TQVYIHTAISIKNYVILADVFKSISLLRYQ-----------DMRPLETYCVEFFVDN--- 822

Query: 426  GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEA 485
                                                 + +GF++SD  KN +L+ YQPEA
Sbjct: 823  -------------------------------------AQIGFLVSDAQKNFLLYSYQPEA 845

Query: 486  RESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS-----DAPGARSRFLTWYASLDGAL 540
            RES GG RL+++ DF++G HVNTFF++RCK    S     DA     R +T +A+LDG L
Sbjct: 846  RESYGGQRLVRRADFNVGSHVNTFFRVRCKIMDPSGERRRDADTVAKRHVTMFATLDGGL 905

Query: 541  GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVW 600
            G  LP+ EK YRRLLMLQN ++TH     GLNP+AFR  K       N  R I+DG L+W
Sbjct: 906  GALLPMAEKTYRRLLMLQNTLMTHMPFPAGLNPKAFRMLKHNHRSLINACRNILDGELLW 965

Query: 601  KFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            KFL LS+ ER E+ +KIG+    I ++L DI+ LS+HF
Sbjct: 966  KFLHLSVVERSELARKIGTSPETITEDLMDIDRLSAHF 1003


>gi|384946686|gb|AFI36948.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
            mulatta]
          Length = 1428

 Score =  546 bits (1407), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 286/637 (44%), Positives = 379/637 (59%), Gaps = 66/637 (10%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 844  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 903

Query: 63   -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
                          E+    RG R+++ RYF +I GY GVF+CGP P WL +T RG LR 
Sbjct: 904  KPKPSKKKAEGGGTEEGAGARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 962

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            HPM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 963  HPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 1022

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
            ++AYH+E+K Y + TST  P     +  GE+KE  T  RD R+I P    F + L SP S
Sbjct: 1023 YVAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS 1082

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
            WE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D
Sbjct: 1083 WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMD 1142

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
            +IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+
Sbjct: 1143 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGM 1202

Query: 362  AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   
Sbjct: 1203 AFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVD 1262

Query: 422  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
            N                                        + +GF++SD+D+N++++MY
Sbjct: 1263 N----------------------------------------AQLGFLVSDRDRNLMVYMY 1282

Query: 482  QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALG 541
             PEA+ES GG RL+++ DFH+G HVNTF++  C+ ++     G   + + W        G
Sbjct: 1283 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGAT----EGLSKKSVVWENKHITWFG 1338

Query: 542  FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 601
              LP                  H S      P           +         DG L+ +
Sbjct: 1339 EDLPA-------AADAAERADHHASAPRRPQPPCLPDAARGPPHPPECCAQRADGELLNR 1391

Query: 602  FLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            +L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1392 YLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1428


>gi|321475208|gb|EFX86171.1| hypothetical protein DAPPUDRAFT_313209 [Daphnia pulex]
          Length = 1260

 Score =  541 bits (1393), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 265/518 (51%), Positives = 344/518 (66%), Gaps = 54/518 (10%)

Query: 9    PSAMDETIVQELLTVSLGLHGNRPLLLVRTQH-ELLIYQAF-------RHPKGALKLRFK 60
            PS+    IV E+    LG    RPLL++RT    +L+Y+A           K  LK+RF+
Sbjct: 784  PSSTHCNIV-EMGIFGLGHLHRRPLLMIRTSDFGVLLYEAIPALPVYDSKQKNELKIRFR 842

Query: 61   KLKVLFVSDRSKRANEQPGL-----PRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTS 115
            KL    +   +K    + G      P   + +Q +YFSNIAGY GVF+ GP+P WLF+TS
Sbjct: 843  KLNHSLLLRETKTYVRKGGQSVVLEPYAWKTNQFKYFSNIAGYTGVFIGGPYPHWLFMTS 902

Query: 116  RGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVP 175
            RGELR HPM+IDG +   A FHNVNC +GF+Y N K ELRI +LPT  +YDAPWPVRKVP
Sbjct: 903  RGELRLHPMSIDGSIKCFACFHNVNCAQGFIYLNRKDELRICLLPTLFNYDAPWPVRKVP 962

Query: 176  LKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVS 235
            L+CTPH+L YH+ETKTY + TS AEP+   Y+FNG+DKEL  + RD RF  P V +F + 
Sbjct: 963  LRCTPHYLIYHVETKTYILATSLAEPTNRIYRFNGDDKELSLEERDDRFPYPHVEKFAIQ 1022

Query: 236  LFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRG 295
            L SP +WE +P T   L +WEHV CLK VS+EYEG  SGL+ Y+A+ TNYNY ED+  RG
Sbjct: 1023 LISPVTWEAVPNTRMDLDDWEHVTCLKTVSLEYEGHASGLKDYLAVSTNYNYGEDIISRG 1082

Query: 296  RILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKD 355
            RI + D+IEVVPEPGQPLTKNKIK +YAK+QKGPV AI  V G+LV A+GQKIY+WQLK+
Sbjct: 1083 RIFILDLIEVVPEPGQPLTKNKIKTLYAKDQKGPVAAISSVCGYLVAAIGQKIYLWQLKN 1142

Query: 356  NDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNS 415
            +DL GIAFIDTE+YI  ++++K+ IL  D  +S+++LR+Q EYRTL +VARDY+P +  +
Sbjct: 1143 DDLVGIAFIDTEIYIHQLLNIKSFILAADVYKSVSILRFQEEYRTLCIVARDYQPLEVMA 1202

Query: 416  KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKN 475
              YY  N                                        + +GF++SD +KN
Sbjct: 1203 VDYYIDN----------------------------------------TQLGFLVSDAEKN 1222

Query: 476  VVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
            ++L+MYQPEARES GGHRLI+K DFH+GQ V+T F+I+
Sbjct: 1223 LILYMYQPEARESQGGHRLIRKADFHVGQVVSTMFRIK 1260


>gi|348555856|ref|XP_003463739.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform 2 [Cavia porcellus]
          Length = 1387

 Score =  525 bits (1351), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 276/643 (42%), Positives = 370/643 (57%), Gaps = 115/643 (17%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 840  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 899

Query: 63   --KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
              K           +E  G+ RG R+++ RYF +I GY GVF+CGP P WL +T RG LR
Sbjct: 900  KPKPSKKKAEGGSTDEGSGV-RG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALR 957

Query: 121  AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTP 180
             HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT 
Sbjct: 958  LHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTA 1017

Query: 181  HFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPF 240
            H++AYH+E+K Y + TST+ P T   +  GE+KE     RD R+I P    F + L SP 
Sbjct: 1018 HYVAYHVESKVYAVATSTSTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPV 1077

Query: 241  SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
            SWE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRI L 
Sbjct: 1078 SWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRIFL- 1136

Query: 301  DIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTG 360
                                                              W L+ ++LTG
Sbjct: 1137 --------------------------------------------------WSLRASELTG 1146

Query: 361  IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            +AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +  
Sbjct: 1147 MAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMV 1206

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
             N                                        + +GF++SD+D+N++++M
Sbjct: 1207 DN----------------------------------------AQLGFLVSDRDRNLMVYM 1226

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYAS 535
            Y PEA+ES GG RL+++ DFH+G HVNTF++  C+    ++ P  +S     + +TW+A+
Sbjct: 1227 YLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCR--GATEGPSKKSVVWENKHITWFAT 1284

Query: 536  LDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIID 595
            LDG +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++D
Sbjct: 1285 LDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLD 1344

Query: 596  GSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            G L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1345 GELLNRYLYLSTMERGELAKKIGTTPDIILDDLLETDRVTAHF 1387


>gi|354491124|ref|XP_003507706.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform 2 [Cricetulus griseus]
          Length = 1388

 Score =  524 bits (1350), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 273/641 (42%), Positives = 370/641 (57%), Gaps = 111/641 (17%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 841  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 900

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      +++  + + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 901  KPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 960

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 961  PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1020

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST  P T   +  GE+KE     RD R+I P    F + L SP SW
Sbjct: 1021 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSW 1080

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRI L   
Sbjct: 1081 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRIFL--- 1137

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
                                                            W L+ ++LTG+A
Sbjct: 1138 ------------------------------------------------WSLRASELTGMA 1149

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct: 1150 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1209

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 1210 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1229

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
            PEA+ES GG RL+++ DFH+G HVNTF++  C+ +  ++ P  +S     + +TW+A+LD
Sbjct: 1230 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVMWENKHITWFATLD 1287

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG 
Sbjct: 1288 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1347

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1348 LLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1388


>gi|148697644|gb|EDL29591.1| cleavage and polyadenylation specific factor 1, isoform CRA_c [Mus
            musculus]
          Length = 1388

 Score =  524 bits (1349), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 273/641 (42%), Positives = 370/641 (57%), Gaps = 111/641 (17%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 841  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 900

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      +++  + + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 901  KPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 960

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 961  PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1020

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST  P T   +  GE+KE     RD R+I P    F + L SP SW
Sbjct: 1021 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSW 1080

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRI L   
Sbjct: 1081 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRIFL--- 1137

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
                                                            W L+ ++LTG+A
Sbjct: 1138 ------------------------------------------------WSLRASELTGMA 1149

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct: 1150 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1209

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 1210 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1229

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
            PEA+ES GG RL+++ DFH+G HVNTF++  C+ +  ++ P  +S     + +TW+A+LD
Sbjct: 1230 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVVWENKHITWFATLD 1287

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG 
Sbjct: 1288 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1347

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1348 LLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1388


>gi|194474008|ref|NP_001124043.1| cleavage and polyadenylation specificity factor subunit 1 [Rattus
            norvegicus]
 gi|149066087|gb|EDM15960.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
            isoform CRA_a [Rattus norvegicus]
          Length = 1386

 Score =  524 bits (1349), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 273/641 (42%), Positives = 370/641 (57%), Gaps = 111/641 (17%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 839  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 898

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      +++  + + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 899  KPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 958

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 959  PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1018

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST  P T   +  GE+KE     RD R+I P    F + L SP SW
Sbjct: 1019 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSW 1078

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRI L   
Sbjct: 1079 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRIFL--- 1135

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
                                                            W L+ ++LTG+A
Sbjct: 1136 ------------------------------------------------WSLRASELTGMA 1147

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct: 1148 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1207

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 1208 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1227

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
            PEA+ES GG RL+++ DFH+G HVNTF++  C+ +  ++ P  +S     + +TW+A+LD
Sbjct: 1228 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVMWENKHITWFATLD 1285

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG 
Sbjct: 1286 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1345

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1346 LLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1386


>gi|148697642|gb|EDL29589.1| cleavage and polyadenylation specific factor 1, isoform CRA_a [Mus
            musculus]
          Length = 1417

 Score =  523 bits (1348), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 273/641 (42%), Positives = 369/641 (57%), Gaps = 111/641 (17%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 870  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 929

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      +++  + + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 930  KPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 989

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 990  PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 1049

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST  P T   +  GE+KE     RD R+I P    F + L SP SW
Sbjct: 1050 VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSW 1109

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRI L   
Sbjct: 1110 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRIFL--- 1166

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
                                                            W L+ ++LTG+A
Sbjct: 1167 ------------------------------------------------WSLRASELTGMA 1178

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct: 1179 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1238

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 1239 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1258

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
            PEA+ES GG RL+++ DFH+G HVNTF++  C+    ++ P  +S     + +TW+A+LD
Sbjct: 1259 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCR--GAAEGPSKKSVVWENKHITWFATLD 1316

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG 
Sbjct: 1317 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1376

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1377 LLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1417


>gi|148697643|gb|EDL29590.1| cleavage and polyadenylation specific factor 1, isoform CRA_b [Mus
            musculus]
          Length = 1311

 Score =  523 bits (1348), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 273/641 (42%), Positives = 370/641 (57%), Gaps = 111/641 (17%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 764  LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 823

Query: 63   KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            K      +++  + + G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 824  KPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 883

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 884  PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 943

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            +AYH+E+K Y + TST  P T   +  GE+KE     RD R+I P    F + L SP SW
Sbjct: 944  VAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSW 1003

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
            E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRI L   
Sbjct: 1004 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRIFL--- 1060

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
                                                            W L+ ++LTG+A
Sbjct: 1061 ------------------------------------------------WSLRASELTGMA 1072

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct: 1073 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1132

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 1133 ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1152

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYASLD 537
            PEA+ES GG RL+++ DFH+G HVNTF++  C+ +  ++ P  +S     + +TW+A+LD
Sbjct: 1153 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGA--AEGPSKKSVVWENKHITWFATLD 1210

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG 
Sbjct: 1211 GGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGE 1270

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1271 LLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1311


>gi|340371789|ref|XP_003384427.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Amphimedon queenslandica]
          Length = 1408

 Score =  516 bits (1328), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 269/637 (42%), Positives = 386/637 (60%), Gaps = 56/637 (8%)

Query: 17   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFR-----HPKGALKLRFKKLK--VLFVSD 69
            V+++L V +GL+G +P ++     EL+IY+AF+     HP G LKLRF K++  V+    
Sbjct: 813  VEQVLCVGMGLNGKKPHIMAFINKELVIYEAFQYTSAIHP-GHLKLRFSKVQHNVILQDK 871

Query: 70   RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP 129
            R  +  +            +R FSNIAGY GVF+CGP+P W+F+ +RG L  HPM IDGP
Sbjct: 872  RVGKLAKHFQQQEFSFPPHLRKFSNIAGYSGVFVCGPYPHWIFMAARGHLSIHPMYIDGP 931

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            V + APF NVNCP GFLYFN +SELRISVLPT LSYD+ WPVRKVPLK TPHF+ YH+E+
Sbjct: 932  VQSFAPFDNVNCPSGFLYFNKESELRISVLPTQLSYDSYWPVRKVPLKATPHFVGYHMES 991

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKE-LVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQT 248
            K + I+ ST +P T     NGE ++ L T  RD RF+      +++ L SP SWE IP +
Sbjct: 992  KVHVIIASTPQPVTVIPDPNGETEDALETVERDGRFVYSQEETYYLQLLSPTSWETIPHS 1051

Query: 249  NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPE 308
             + +    HV  +K + +  + TLSG + YI +GT   + E+++ +G++L+FD+  V+PE
Sbjct: 1052 KYEMEAHYHVTDMKVMRLRSQETLSGRKEYIVVGTMATFGEELSAKGKVLIFDVSVVIPE 1111

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDN-DLTGIAFIDTE 367
            PG+P ++ ++K +Y +EQK PVT +  V G ++TA+GQKI++WQ KDN DL  +AFID E
Sbjct: 1112 PGKPFSQYRLKNLYDQEQKWPVTGLECVNGLILTAMGQKIFMWQFKDNKDLLAVAFIDAE 1171

Query: 368  VYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGI 427
             YI +  S+K  IL GD  RSI LL Y  + R+LSL+++D  P +  S  +        +
Sbjct: 1172 TYIHTAQSIKGFILTGDVTRSIQLLHYNEDRRSLSLISQDPNPMEVFSTTF--------M 1223

Query: 428  IDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARE 487
            IDG                                 ++GF++SD D+N+ LF YQPE   
Sbjct: 1224 IDG--------------------------------KALGFLVSDSDRNITLFQYQPENPA 1251

Query: 488  SNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG------ARSRFLTWYASLDGALG 541
            S+GG  L++  D H+G  VN F  IRCK S+   A        A  R  T++ +LDG +G
Sbjct: 1252 SSGGANLVRCGDIHVGSLVNVFLNIRCKTSAGLGASREMKIALADKRQCTFFGTLDGGIG 1311

Query: 542  FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 601
              LP+PEK YRRL MLQ  M     H  GLNP+AFRT++ +  Y  N  R I+DG+L+++
Sbjct: 1312 CLLPIPEKVYRRLSMLQVKMTQGMRHMAGLNPKAFRTFQTRHQYLHNAQRNILDGTLLYQ 1371

Query: 602  FLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            +L L+  E+ +  K+IG+    I+++L +I+ + SHF
Sbjct: 1372 YLSLTAKEKFDFSKQIGTTVAQIMEDLKEIDKVMSHF 1408


>gi|198415711|ref|XP_002123169.1| PREDICTED: similar to cleavage and polyadenylation specificity factor
            1, partial [Ciona intestinalis]
          Length = 1370

 Score =  473 bits (1218), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 246/587 (41%), Positives = 353/587 (60%), Gaps = 57/587 (9%)

Query: 5    RSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPK-------GALKL 57
            +S S    D+  + E+L V LG   + P L+ R + E+LIY+ F+           +L++
Sbjct: 827  KSTSTRYSDKPRIFEILLVGLGYKNSSPHLIARIEEEILIYEVFKFSAPEKFKKYNSLQI 886

Query: 58   RFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
            RFKK+    +  R+   +E        R + +R FSNI GY GVFLCGP+P W+F+T RG
Sbjct: 887  RFKKVNHSMMIRRAPVTHETKTDQLEHR-NCLRTFSNIGGYSGVFLCGPYPYWIFVTIRG 945

Query: 118  ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
             L  HPM++DG VS   PFHNVNCP GFLYFN++ ELRI +LP H+ YD  WP+RK+ L+
Sbjct: 946  ALCCHPMSVDGSVSCFVPFHNVNCPNGFLYFNSQGELRICMLPPHMKYDTAWPMRKITLR 1005

Query: 178  CTPHFLAYHLETKTYCIVTSTAEPSTD--YYKFNGEDKELVTDPRDSRFIPPLVSQFHVS 235
            C+ HFLAY +E K Y +VTS +EP T   Y  F  E +E     +  RFI P + +F V 
Sbjct: 1006 CSVHFLAYSIEHKVYALVTSVSEPCTRLPYLTFENE-REFEDLEKGDRFIYPHIDKFSVQ 1064

Query: 236  LFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRG 295
            L SP SW+ +P     + E+EH+ C+KNV +      S  + ++ LGT   + E+++ RG
Sbjct: 1065 LISPASWDLVPNARLDMGEFEHITCMKNVWLSCGQDSSARQNFLVLGTVNVFGEEMSSRG 1124

Query: 296  RILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKD 355
            +I++ ++IEVVPEPGQPLTKNK+K IY++EQKGPVTA+C + G L+TA+GQKI+IW+  +
Sbjct: 1125 KIIILEVIEVVPEPGQPLTKNKLKQIYSEEQKGPVTAVCGLEGNLLTAIGQKIFIWRFDE 1184

Query: 356  ND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
            N  L G+AF+DT VYI   +S ++  LVGD  RSI LLRYQ +++TLS+ +RD +P +  
Sbjct: 1185 NQSLRGLAFVDTNVYIHHALSFRSFALVGDIQRSITLLRYQTDFKTLSVTSRDVRPLE-- 1242

Query: 415  SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
                Y  +    ++DG                                + + F++SD +K
Sbjct: 1243 ---VYTADL---VVDG--------------------------------TGINFLVSDHEK 1264

Query: 475  NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC----KPSSISDAPGARSRFL 530
            N+VLF Y PE  ES+GG RL K+ D H+G   N  +++      + + + + P A    +
Sbjct: 1265 NLVLFAYDPEDHESHGGSRLTKRADMHIGSRANCMWRVAACGVDRSTGLPNQPYA-GVHI 1323

Query: 531  TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 577
            T   +LDG++   LP+ EK YRRLLMLQN+M+T   H  GLNP+AFR
Sbjct: 1324 TMMGTLDGSICHVLPVAEKVYRRLLMLQNIMITGLQHIAGLNPKAFR 1370


>gi|291232722|ref|XP_002736302.1| PREDICTED: cleavage and polyadenylation specific factor 1-like
           [Saccoglossus kowalevskii]
          Length = 984

 Score =  441 bits (1135), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 205/396 (51%), Positives = 267/396 (67%), Gaps = 43/396 (10%)

Query: 16  IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAL----KLRFKKLKVLFVSDRS 71
           IV+ELL + LG    +  LL R   +L IY+AF H + +L    +LRF+K          
Sbjct: 473 IVKELLLIGLGHKNKKTHLLARVDEDLYIYEAFTHDQSSLDNHLRLRFRK---------- 522

Query: 72  KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS 131
                                        VF+CGP+P WLF+TSRG LR+HPM IDG V+
Sbjct: 523 -----------------------------VFVCGPYPHWLFMTSRGALRSHPMHIDGSVT 553

Query: 132 TLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKT 191
             APFHN+NCP+GFLYFN   ELRI VLPTHLSYDA WPVRKVPL+CTPHF++YH+E+KT
Sbjct: 554 CFAPFHNINCPKGFLYFNKHGELRICVLPTHLSYDALWPVRKVPLRCTPHFISYHIESKT 613

Query: 192 YCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFP 251
           Y +VTS +EP     K  G+DKE     RD RFI P + +F + LFSP SWE IP T   
Sbjct: 614 YAVVTSVSEPCLRICKMTGDDKEFEDVERDDRFIFPTIEKFSLQLFSPLSWEAIPNTKID 673

Query: 252 LHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQ 311
             +WEH+  LK V ++ EGT+SGL+G+IA+ T   Y E+VTCRGRIL+FD+IEVVPEPGQ
Sbjct: 674 TEDWEHITGLKTVFLKSEGTVSGLKGFIAVSTTIVYGEEVTCRGRILIFDVIEVVPEPGQ 733

Query: 312 PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIA 371
           PLTKNK+K++Y KEQKGPVT +C + G L  A+GQKI++W  ++NDL G+AFIDT+++I 
Sbjct: 734 PLTKNKLKLLYDKEQKGPVTTLCDIEGLLAAAIGQKIFLWAFRNNDLIGVAFIDTQIHIH 793

Query: 372 SMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
           ++ ++KN IL  D  +S++LLR+  E R+LSLV R+
Sbjct: 794 TLCTIKNFILAADIRKSVSLLRFSDEDRSLSLVTRE 829



 Score =  155 bits (392), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 82/187 (43%), Positives = 117/187 (62%), Gaps = 13/187 (6%)

Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
           DI    S + F  SD+D+++ L       RES GG RL+++ DF+ G HV +FF++R K 
Sbjct: 806 DIRKSVSLLRF--SDEDRSLSLV-----TRESFGGQRLLRRADFNAGSHVCSFFRMRSKL 858

Query: 517 SS-----ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
           S      +   P  R R +T +A+LDG++G+ +P+ EK YRRLLMLQN + T T HT GL
Sbjct: 859 SDPATEKLLTGPMER-RHVTMFATLDGSIGYLIPMTEKTYRRLLMLQNALTTQTLHTAGL 917

Query: 572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
           NP+ FR  K +     N  + I+DG L+WK+  LS+ ER E+ KKIG+    ILD+L D+
Sbjct: 918 NPKGFRMVKHQTKSLENTHKNILDGDLLWKYTFLSVNERTELAKKIGTSVEQILDDLMDV 977

Query: 632 EALSSHF 638
           E L++HF
Sbjct: 978 ERLTAHF 984



 Score = 44.3 bits (103), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 27/41 (65%)

Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEF 462
           N  + I+DG L+WK+  LS+ ER E+ KKIG+    ILD+ 
Sbjct: 934 NTHKNILDGDLLWKYTFLSVNERTELAKKIGTSVEQILDDL 974


>gi|339253000|ref|XP_003371723.1| cleavage and polyadenylation specificity factor subunit 1
            [Trichinella spiralis]
 gi|316967988|gb|EFV52332.1| cleavage and polyadenylation specificity factor subunit 1
            [Trichinella spiralis]
          Length = 1376

 Score =  403 bits (1035), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 226/659 (34%), Positives = 348/659 (52%), Gaps = 98/659 (14%)

Query: 30   NRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKLKVLFV------------------ 67
            +RP L    + +LLIY+AF +P    +  L +RFKK++   +                  
Sbjct: 754  DRPFLFAVVEEQLLIYEAFHYPYPQQRYRLSVRFKKVRHTAILQRFRRIGRDDFKLLADD 813

Query: 68   -----------------SDRSKRANEQPG------------LPRGVRISQMRYFSNIAGY 98
                             S+RS+R +   G            L       Q+  F N+AGY
Sbjct: 814  FQFSEQYRRRRKRSKHDSNRSRRGDRHSGRRQEAHEHEPYRLTYEAPARQLSPFENVAGY 873

Query: 99   QGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISV 158
             G+F+ G +P + FL+ +G+LR HPM IDGPV   AP+ +    R F YF A   +R+S 
Sbjct: 874  AGLFIGGGYPYFCFLSKQGDLRLHPMHIDGPVVAFAPYCSPKQLRAFAYFTADGMMRVSS 933

Query: 159  LPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTD 218
            LP+   +D   P  KV L    HF+ Y +E+ TY + TS   P        G+DK+  T 
Sbjct: 934  LPSKFDFDRSIPSMKVELGRAAHFVVYLMESHTYALTTSEQMPCHKVVTLIGDDKQFETF 993

Query: 219  PRDS-RFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRG 277
             R++  FI P + QF + L+S  +W  +P       E+EHV   + V ++ EG+ SGL+ 
Sbjct: 994  DREAPHFIYPTMEQFKLQLYSADTWLPVPGAELDFDEFEHVTACQEVQLKSEGSASGLQS 1053

Query: 278  YIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
            Y+A+GT  NY E+V  RGR+L+ D++EVVPEP +P+TK K+K++Y+KEQKGPVT++C + 
Sbjct: 1054 YLAIGTVLNYGEEVLIRGRLLIIDVVEVVPEPDRPMTKFKLKVVYSKEQKGPVTSLCSLR 1113

Query: 338  GFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPE 397
            G+L+T +GQK+YIWQ KDN L GI+F+D +VY+  M S++ L L  D    ++LLRYQ E
Sbjct: 1114 GYLLTGMGQKVYIWQYKDNALVGISFLDLQVYVHQMASIRYLALTADAFFGVSLLRYQEE 1173

Query: 398  YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 457
            Y+ LSLV+RD +P                  D  L  +FL                    
Sbjct: 1174 YKALSLVSRDPRP------------------DEVLAVEFLV------------------- 1196

Query: 458  ILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS 517
               + + + F+++    +++ ++Y PE+ +S GG RL+ + D+H G  VN F ++RC   
Sbjct: 1197 ---DRTDLSFLMTSAAGDILTYVYLPESLDSFGGQRLVPQADYHFGSQVNAFVRMRCHAQ 1253

Query: 518  SISDAPGARSRFLT----WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
             I  A   R   L      +AS DG++ + LPLPE+ YR L MLQ++++       GLN 
Sbjct: 1254 EI--AGRKRQEVLQRQGLIFASSDGSVNYLLPLPEREYRLLGMLQSLLIDMLPSFAGLNV 1311

Query: 574  RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 632
              +RT +        P++ IIDG++   +L +   ++ +I ++IGS H+ I+ EL  +E
Sbjct: 1312 DDYRTVRFPNSCLREPTKNIIDGNICMLYLYIDALQQEDIVRQIGSSHSQIMLELAYME 1370


>gi|324499955|gb|ADY39993.1| Cleavage and polyadenylation specificity factor subunit 1 [Ascaris
            suum]
          Length = 1434

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 220/641 (34%), Positives = 358/641 (55%), Gaps = 52/641 (8%)

Query: 11   AMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGA---LKLRFKKLKVLFV 67
            A  E I+ E+L   +G++  RP+L V     + +Y+ F +  G    L +RFK+L    V
Sbjct: 833  AKPEEIIVEVLLTGMGMNQGRPMLFVVVDDMVSVYEMFMYDNGVVEHLAVRFKRLPYTTV 892

Query: 68   SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQ-------GVFLCGPHPAWLFLTSRGELR 120
            +   +        P       +RY + +  ++       GVF+C  +P  +FL   G LR
Sbjct: 893  TRSCRFQGNDGRAPVEAARDTVRYRTALHPFERIGNILNGVFICSSYPC-VFLMDSGILR 951

Query: 121  AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKS-ELRISVLPTHLSYDAPWPVRKVPLKCT 179
             HP+ ++GP+ +   F+NV CP GF+Y   +   +RI+ LPT +  D+  PVRK+    T
Sbjct: 952  MHPLNLEGPILSFTAFNNVLCPNGFIYLTEREWAMRIAKLPTDVELDSSLPVRKIRTGRT 1011

Query: 180  PHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP 239
             H + Y L++ TY +V S  +P+        EDK      +   F+ P +  + V L+SP
Sbjct: 1012 IHNIVYLLQSNTYAVVGSEKKPNNRLCVLVNEDKSFDEHEKADSFVLPELEVYDVKLYSP 1071

Query: 240  FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
              W+ +P     + ++E + C + V +  EGT+SG++ Y+A+GT  NY E+V  RGRI++
Sbjct: 1072 EDWKPVPNAEIKMEDFEVLTCCEEVVLRSEGTVSGVQNYLAVGTACNYGEEVLVRGRIII 1131

Query: 300  FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLT 359
             +IIEVVPEPGQP +K++IK +Y KEQKGPVT++C   G+L+  +GQK++IW  +DN+L 
Sbjct: 1132 SEIIEVVPEPGQPTSKHRIKTLYDKEQKGPVTSLCSCNGYLLAGMGQKVFIWLFRDNNLQ 1191

Query: 360  GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
            GI+F+D   YI  +V V+NL L  D  RS+ALLRYQ EY+ LSL +RD            
Sbjct: 1192 GISFLDMHFYIHQLVGVRNLALACDIYRSVALLRYQEEYKALSLASRDM----------- 1240

Query: 420  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
                 R ++   +  +FL             I ++          M F++SD+  N+ +F
Sbjct: 1241 -----RAVVQPPMAAQFL-------------IDNRQ---------MAFIMSDEAANIAVF 1273

Query: 480  MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSS--ISDAPGARSRFLTWYASLD 537
             Y PEA ES+GG RLI +++ ++G +VN+F +++   SS  + +   + +R    + SLD
Sbjct: 1274 NYLPEALESSGGERLILRSEINIGTNVNSFMRVKGHISSGFVENEHYSLNRQSVLFCSLD 1333

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G+ GF  PL EK +RRL MLQ +M +  +   GLN +  R  + +       +R ++DG 
Sbjct: 1334 GSFGFVRPLSEKVFRRLHMLQQLMSSLVAQAAGLNVKGSRAARPQRPNHYLNTRNMVDGD 1393

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            +V+++L LSL ++ ++ +K+G+    I+D+L +I  L++H+
Sbjct: 1394 VVFQYLHLSLADKNDLARKLGTSRYHIIDDLTEISRLTTHY 1434


>gi|449661926|ref|XP_002167992.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Hydra magnipapillata]
          Length = 1122

 Score =  387 bits (994), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 190/449 (42%), Positives = 277/449 (61%), Gaps = 40/449 (8%)

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            + Y + +S  E      +F+ E++E  T  R+ R+I P + +F VSL SP SWE +P + 
Sbjct: 714  QVYAVASSYTENQKKLPRFHTEEREFDTVEREPRYIYPQIERFVVSLISPTSWETVPNSR 773

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEP 309
              L E+EHV C+K + +  E    GL+ Y+ +GT +NY ED+ C+GRIL+FD++EVVPEP
Sbjct: 774  TVLQEFEHVTCMKVLLLHSELVDIGLKQYLVVGTTFNYGEDLACKGRILIFDVLEVVPEP 833

Query: 310  GQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVY 369
            GQPLTK K K +Y KEQKGPVTAIC  +G+++ AVGQKIY ++ KDNDL G+AF+D++V+
Sbjct: 834  GQPLTKTKCKCVYDKEQKGPVTAICATSGYIIAAVGQKIYAFKYKDNDLVGVAFVDSQVF 893

Query: 370  IASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIID 429
              ++++++N+I+  D +RSI+L+R+Q E+++L+LV+RD K  +  +  ++        ID
Sbjct: 894  TVNLMAIRNVIVAADISRSISLVRFQVEHKSLALVSRDTKTLEAYTSEFF--------ID 945

Query: 430  GSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESN 489
            GS V                                GF++SD ++N+V+F YQPEA ES 
Sbjct: 946  GSQV--------------------------------GFVVSDAERNIVIFSYQPEALESF 973

Query: 490  GGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEK 549
            GGHRL++K D ++G HVNT  +I+      S +  +  R L    +LDG++G   PL EK
Sbjct: 974  GGHRLLQKADINIGSHVNTMMRIKLIQDEQSLSKSSEQRQLIILPTLDGSIGILFPLSEK 1033

Query: 550  NYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGE 609
             +RRL MLQN +V    H  GLNPRAFR          NP R I+DG L+ K+ QLS  E
Sbjct: 1034 PFRRLTMLQNKLVDCLPHKAGLNPRAFRALDVPLRTLTNPHRNILDGQLLDKYAQLSFQE 1093

Query: 610  RLEICKKIGSKHNDILDELYDIEALSSHF 638
            R +I KK+G+    ILD++ DIE  S+H 
Sbjct: 1094 RFDIAKKMGTTSGQILDDMMDIERASNHL 1122


>gi|327287424|ref|XP_003228429.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Anolis carolinensis]
          Length = 1294

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 226/540 (41%), Positives = 304/540 (56%), Gaps = 114/540 (21%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--KVLFVSD 69
            +V+E+L V+LG   +RP LLV    ELLIY+AF H     +  LK+RFKK+   + F   
Sbjct: 847  LVKEVLLVALGNRQSRPYLLVHVDQELLIYEAFNHDSQLGQTNLKVRFKKVPHNINFREK 906

Query: 70   R---SKRANEQPG-----LPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
            +   SK+  E  G     +PRG R+++ RYF +I GY GVF+CGP P WL +TSRG LR 
Sbjct: 907  KPRPSKKKTESAGGEEASVPRG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTSRGALRL 965

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            HPMTIDGP+ + APFHN                                     + C   
Sbjct: 966  HPMTIDGPIESFAPFHN-------------------------------------VNCPKG 988

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDK-ELVTDPRDSRFIPPLVSQFHVSLFSPF 240
            FL ++ +     I  + +       +  GED  E  T  R      P     H  L   F
Sbjct: 989  FLYFNRQGTGGGIHNACSR----IPRMTGEDDMEFETIERGVLKCVPGEGFGHPDLILSF 1044

Query: 241  SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
              +        L EWEHV C+K VS++ E T+SGL+GYIA+GT     E+VTCRGRIL+ 
Sbjct: 1045 KID--------LEEWEHVTCMKTVSLKSEETVSGLKGYIAVGTCLMQGEEVTCRGRILIM 1096

Query: 301  DIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTG 360
            DIIEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G+LV+A+GQKI++W LKDNDLTG
Sbjct: 1097 DIIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFLWSLKDNDLTG 1156

Query: 361  IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            +AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP          
Sbjct: 1157 MAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKP---------- 1206

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEI-CKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
                                    LE+ C        D + +   +GF++SD+D+N++++
Sbjct: 1207 ------------------------LEVYCV-------DFMVDSCQLGFLVSDRDRNLLVY 1235

Query: 480  MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK-----PSSISDAPGARSRFLTWYA 534
            MY PEA+ES GG RL+++ DFH+G HVN F++  C+     P+  S A    ++ +TW+ 
Sbjct: 1236 MYLPEAKESFGGMRLLRRADFHVGAHVNAFWRTPCRGAMEGPTKKSSA--WENKHITWFG 1293


>gi|268580265|ref|XP_002645115.1| Hypothetical protein CBG16808 [Caenorhabditis briggsae]
 gi|296439546|sp|A8XPU7.1|CPSF1_CAEBR RecName: Full=Probable cleavage and polyadenylation specificity
            factor subunit 1; AltName: Full=Cleavage and
            polyadenylation specificity factor 160 kDa subunit;
            Short=CPSF 160 kDa subunit
          Length = 1454

 Score =  370 bits (951), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 213/640 (33%), Positives = 344/640 (53%), Gaps = 56/640 (8%)

Query: 17   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFR--HPK-GALKLRFKKLKVL-------F 66
            V E   V +G++   P+L+     E+++Y+ F   +P+ G L + F+KL  L       +
Sbjct: 853  VVEAQIVGMGINQAHPVLIAIIDEEVVLYEMFASYNPQPGHLGVAFRKLPHLIGLRTSPY 912

Query: 67   VSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQ-GVFLCGPHPAWLFLTSRGELRAHPMT 125
            V+   KRA  +  +  G R + +  F  I+    GV + G  P  L   + G ++ H MT
Sbjct: 913  VNIDGKRAPFEMEMEHGKRYTLIHPFERISSINNGVMIGGAVPTLLVYGAWGGMQTHQMT 972

Query: 126  IDGPVSTLAPFHNVNCPRGFLYF-NAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLA 184
            IDG +    PF+N N   GF+Y    KSELRI+ +     YD P+PV+K+ +  T H + 
Sbjct: 973  IDGSIKAFTPFNNENVLHGFVYMTQQKSELRIARMHPDFDYDMPYPVKKIEVGKTVHNVR 1032

Query: 185  YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEE 244
            Y + +  Y +V+S  +PS   +    +DK+     +D  F+ P   ++ ++LFS   W  
Sbjct: 1033 YLMNSDIYAVVSSVPKPSNKIWVVMNDDKQEEIHEKDENFVLPAPPKYTLNLFSSQDWAA 1092

Query: 245  IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIE 304
            +P T F   + E V  +++V ++ E    GL  Y+AL T  NY E+V  RGRI+L ++IE
Sbjct: 1093 VPNTEFEFEDMEAVTAMEDVPLKSESRYGGLDTYLALATVNNYGEEVLVRGRIILCEVIE 1152

Query: 305  VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFI 364
            VVPEPGQP +  KIK++Y KEQKGPVT +C + G L++ +GQK++IWQ KDNDL GI+F+
Sbjct: 1153 VVPEPGQPTSNRKIKVLYDKEQKGPVTGLCAINGLLLSGMGQKVFIWQFKDNDLMGISFL 1212

Query: 365  DTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPS 424
            D   Y+  + S++ + L  D   S++L+R+Q E + +S+ +RD      + K   A   S
Sbjct: 1213 DMHYYVYQLHSIRTIALALDARESMSLIRFQEENKAMSIASRD------DRKCAQAPMAS 1266

Query: 425  RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPE 484
              ++DG                                  +GF++SD+  N+ LF Y PE
Sbjct: 1267 EFLVDG--------------------------------MHIGFLLSDEHGNITLFSYSPE 1294

Query: 485  ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSI-SDAPGARS----RFLTWYASLDGA 539
            A ESNGG RL  K   ++G ++N F +++   S + S +P  R     R  T + SLDG+
Sbjct: 1295 APESNGGERLTVKAAINIGTNINAFLRVKGHTSLLDSSSPEERENIEQRMNTIFGSLDGS 1354

Query: 540  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK-GKGYYAGNPSRGIIDGSL 598
             G+  PL EK+YRRL  LQ  + + T    GL+ +  R+ K  +    G  +R +IDG +
Sbjct: 1355 FGYIRPLTEKSYRRLHFLQTFIGSVTPQIAGLHIKGARSSKPSQPIVNGRNARNLIDGDV 1414

Query: 599  VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            V ++L LS+ ++ ++ +++G     ILD+L  +  ++ ++
Sbjct: 1415 VEQYLHLSVYDKTDLARRLGVGRYHILDDLMQLRRMAYYY 1454


>gi|308459872|ref|XP_003092248.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
 gi|308253976|gb|EFO97928.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
          Length = 1448

 Score =  369 bits (947), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 203/641 (31%), Positives = 345/641 (53%), Gaps = 58/641 (9%)

Query: 17   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH---PKGALKLRFKKLKVLF------- 66
            V E   V +G++ + P+L+     ++++Y+ F H     G L + F+KL           
Sbjct: 847  VMEAQIVGMGINQSHPVLMAIVDEQVVMYEMFSHYNPQAGHLGIAFRKLPHFICLRTSSH 906

Query: 67   VSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQ-GVFLCGPHPAWLFLTSRGELRAHPMT 125
            ++   KRA  +  +  G R + +  F  I+    GV + G  P  +   + G ++ H MT
Sbjct: 907  LNSDGKRAPFEMEVENGKRYTLIHPFERISSINNGVMIGGAVPTLVVYGAWGGMQTHQMT 966

Query: 126  IDGPVSTLAPFHNVNCPRGFLYF-NAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLA 184
            IDGP+    PF+N N   GF+Y    KSELRI+ +     Y+ P+P++K+ +  T H + 
Sbjct: 967  IDGPIKAFTPFNNENVLHGFVYMTQQKSELRIARMHPDFDYEMPYPMKKIEVGRTIHNVR 1026

Query: 185  YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEE 244
            Y + +  Y +V+S  +PS   +    +DK+     +D  F+ P   ++ ++LFS   W+ 
Sbjct: 1027 YLMNSDVYVVVSSIPKPSNKIWVVMNDDKQEEIHEKDENFVLPAPPKYTLNLFSSQDWKA 1086

Query: 245  IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIE 304
            +P T     + E V   ++VS++ E T+SG+  Y+A+GT  NY E+V  RGRI+L ++IE
Sbjct: 1087 VPNTEIEFEDMEAVTACEDVSLKSESTISGVETYLAVGTVNNYGEEVLVRGRIILCEVIE 1146

Query: 305  VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFI 364
            VVPEP QP +  KIK+++ KEQKGPVT +C + G L++ +GQK++IWQ KDNDL G++F+
Sbjct: 1147 VVPEPDQPTSNRKIKVLFDKEQKGPVTGLCAINGLLLSGMGQKVFIWQFKDNDLMGLSFL 1206

Query: 365  DTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPT-QPNSKGYYAGNP 423
            D   Y+  + S++ + L  D   S++L+R+Q E + +S+ +RD + T +P     +    
Sbjct: 1207 DMHYYVYQLHSLRTIALACDARESMSLIRFQEENKAMSIASRDDRRTAKPPMAAQF---- 1262

Query: 424  SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQP 483
               ++DG                                + +GF++SD++ N+ LF Y P
Sbjct: 1263 ---VVDG--------------------------------AHLGFLLSDENGNITLFNYSP 1287

Query: 484  EARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS-----SISDAPGARSRFLTWYASLDG 538
            EA ESNGG RL  +   ++G +VN F +++   S     S  +      R  T + SLDG
Sbjct: 1288 EAPESNGGERLTVRAAMNIGTNVNAFLRVKGHTSLLNLQSDEEKESVEQRMSTIFGSLDG 1347

Query: 539  ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK-GKGYYAGNPSRGIIDGS 597
            + GF  PL EK+YRRL  LQ  + + T    GL+ +  R+ +  +    G  +R +IDG 
Sbjct: 1348 SFGFVRPLSEKSYRRLHFLQTFIGSVTPQIAGLHIKGARSARPAQPIVNGRNARNLIDGD 1407

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            +V ++L LSL ++ ++ +++G     I+D+L  +  ++ ++
Sbjct: 1408 VVEQYLHLSLYDKTDLARRLGVGRYHIIDDLMHLRRMAYYY 1448


>gi|25148482|ref|NP_500157.2| Protein CPSF-1 [Caenorhabditis elegans]
 gi|22096347|sp|Q9N4C2.2|CPSF1_CAEEL RecName: Full=Probable cleavage and polyadenylation specificity
            factor subunit 1; AltName: Full=Cleavage and
            polyadenylation specificity factor 160 kDa subunit;
            Short=CPSF 160 kDa subunit
 gi|373220398|emb|CCD73182.1| Protein CPSF-1 [Caenorhabditis elegans]
          Length = 1454

 Score =  363 bits (933), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 203/641 (31%), Positives = 339/641 (52%), Gaps = 58/641 (9%)

Query: 17   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPK---GALKLRFKKLKVLF------- 66
            V E   V +G++   P+L+     ++++Y+ F       G L + F+KL           
Sbjct: 853  VLEAQIVGMGINQAHPILMAIVDEQVVLYEMFSSSNPIPGHLGISFRKLPHFICLRTSSH 912

Query: 67   VSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQ-GVFLCGPHPAWLFLTSRGELRAHPMT 125
            ++   KRA  +  +  G R S +  F  ++    GV + G  P  L   + G ++ H MT
Sbjct: 913  LNSDGKRAPFEMKINNGKRFSLIHPFERVSSVNNGVMIVGAVPTLLVYGAWGGMQTHQMT 972

Query: 126  IDGPVSTLAPFHNVNCPRGFLYFNA-KSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLA 184
            +DGP+    PF+N N   G +Y    KSELRI+ +     Y+ P+PV+K+ +  T H + 
Sbjct: 973  VDGPIKAFTPFNNENVLHGIVYMTQHKSELRIARMHPDFDYEMPYPVKKIEVGRTIHHVR 1032

Query: 185  YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEE 244
            Y + +  Y +V+S  +PS   +    +DK+     +D  F+ P   ++ ++LFS   W  
Sbjct: 1033 YLMNSDVYAVVSSIPKPSNKIWVVMNDDKQEEIHEKDENFVLPAPPKYTLNLFSSQDWAA 1092

Query: 245  IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIE 304
            +P T     + E V   ++V+++ E T+SGL   +A+GT  NY E+V  RGRI+L ++IE
Sbjct: 1093 VPNTEISFEDMEAVTACEDVALKSESTISGLETLLAMGTVNNYGEEVLVRGRIILCEVIE 1152

Query: 305  VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFI 364
            VVPEP QP +  KIK+++ KEQKGPVT +C + G L+  +GQK++IWQ KDNDL GI+F+
Sbjct: 1153 VVPEPDQPTSNRKIKVLFDKEQKGPVTGLCAINGLLLCGMGQKVFIWQFKDNDLMGISFL 1212

Query: 365  DTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVAR-DYKPTQPNSKGYYAGNP 423
            D   Y+  + S++ + +  D   S++L+R+Q + + +S+ +R D K  QP          
Sbjct: 1213 DMHYYVYQLHSLRTIAIACDARESMSLIRFQEDNKAMSIASRDDRKCAQPPMA------- 1265

Query: 424  SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQP 483
            S+ ++DG+ V                                GF++SD+  N+ +F Y P
Sbjct: 1266 SQLVVDGAHV--------------------------------GFLLSDETGNITMFNYAP 1293

Query: 484  EARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSI-----SDAPGARSRFLTWYASLDG 538
            EA ESNGG RL  +   ++G ++N F ++R   S +      +      R  T +ASLDG
Sbjct: 1294 EAPESNGGERLTVRAAINIGTNINAFVRLRGHTSLLQLNNEDEKEAIEQRMTTVFASLDG 1353

Query: 539  ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK-GKGYYAGNPSRGIIDGS 597
            + GF  PL EK+YRRL  LQ  + + T    GL+ +  R+ K  +    G  +R +IDG 
Sbjct: 1354 SFGFVRPLTEKSYRRLHFLQTFIGSVTPQIAGLHIKGSRSAKPSQPIVNGRNARNLIDGD 1413

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            +V ++L LSL ++ ++ +++G     I+D+L  +  ++ ++
Sbjct: 1414 VVEQYLHLSLYDKTDLARRLGVGRYHIIDDLMQLRRMAFYY 1454


>gi|357611296|gb|EHJ67409.1| putative cleavage and polyadenylation specific factor 1 [Danaus
           plexippus]
          Length = 328

 Score =  362 bits (928), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 179/368 (48%), Positives = 244/368 (66%), Gaps = 49/368 (13%)

Query: 272 LSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVT 331
           LSGLRGYIA+GTNYNY ED+T RGRIL++DII+VVPEPGQPLTKN+ K IYAKEQKGPVT
Sbjct: 9   LSGLRGYIAIGTNYNYGEDITSRGRILIYDIIDVVPEPGQPLTKNRFKEIYAKEQKGPVT 68

Query: 332 AICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIAL 391
           A+  V GFL++AVGQKIY+WQLKDNDL G+AFIDT++Y+  M++VKNLILV D  +SI+L
Sbjct: 69  ALTQVLGFLISAVGQKIYLWQLKDNDLVGVAFIDTQIYVHRMLAVKNLILVADVYKSISL 128

Query: 392 LRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 451
           LRYQ ++RTLSLV+RD +  Q     +   N                             
Sbjct: 129 LRYQHQHRTLSLVSRDLRTAQIYDMQFMIDN----------------------------- 159

Query: 452 GSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 511
                      +S+GF++S+ + N  ++M+QP+ARES GG RLI+K D+HLGQ V+  F+
Sbjct: 160 -----------TSLGFLVSESEGNFAMYMHQPQARESYGGQRLIRKCDYHLGQRVHAMFR 208

Query: 512 IRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
           +         A G R   +T + +LDG +G+ LP+ EK YRRLLMLQNV+  +  H  GL
Sbjct: 209 L--------AARGERQTHVTMFTTLDGGVGYVLPVSEKVYRRLLMLQNVINNYCCHLAGL 260

Query: 572 NPRAFRTYK-GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYD 630
           NP+A+RTYK  +    G  +RG++DG LV  +  +   E+ +I +KIG+K  +I+ +LY+
Sbjct: 261 NPKAYRTYKVSRRALCGGAARGVLDGDLVSLYTSMPRTEQQDIARKIGTKVEEIMSDLYE 320

Query: 631 IEALSSHF 638
           I+  ++HF
Sbjct: 321 IDRQTAHF 328


>gi|341892673|gb|EGT48608.1| CBN-CPSF-1 protein [Caenorhabditis brenneri]
          Length = 1440

 Score =  360 bits (924), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 204/640 (31%), Positives = 349/640 (54%), Gaps = 57/640 (8%)

Query: 17   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPK---GALKLRFKKLKVLFVSDRS-- 71
            + E   V +G++ + P+L+     ++++Y+ F +P    G L + F+KL   F+  RS  
Sbjct: 840  IMEAQIVGMGINQSHPILMAIVDEQVIMYEMFANPNSQPGHLGIAFRKLP-HFICLRSSP 898

Query: 72   ------KRANEQPGLPRGVRISQMRYFSNIAGYQ-GVFLCGPHPAWLFLTSRGELRAHPM 124
                  KRA  Q     G R   +  F  ++    GV + G  P  L   + G ++ HPM
Sbjct: 899  YLKSDGKRAAFQIVEEDGKRYPLIHSFERVSTVNNGVIIGGAVPTLLVYGAWGGMQTHPM 958

Query: 125  TIDGPVSTLAPFHNVNCPRGFLYF-NAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFL 183
            TIDG +    PF+  N P GF+Y    KSELRI+ +     Y+ P+PV+K+ +  T H +
Sbjct: 959  TIDGSIKAFTPFNIDNVPYGFVYMTQKKSELRIAKMHADFDYEMPYPVKKIEVGRTIHSV 1018

Query: 184  AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
             Y + +  Y +V+S  +PS   +    +DK+     +D  F+ P   ++ ++LFS   W+
Sbjct: 1019 RYLMNSDVYVVVSSVPKPSNKIWVVMNDDKQEEIHEKDENFVLPAPPKYTLNLFSSQDWK 1078

Query: 244  EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
             +P T     + E V   ++V+++ E T +G   Y+A+GT  NY E+V  RGRI+L ++I
Sbjct: 1079 AVPNTEISFEDMEAVTACEDVALKSESTHTGFETYLAIGTVNNYGEEVLVRGRIILAEVI 1138

Query: 304  EVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAF 363
            EVVPEPGQP +  KIK+++ KEQKGPVT +C + G L++ +GQK++IWQ KDNDL G++F
Sbjct: 1139 EVVPEPGQPTSNRKIKVLFDKEQKGPVTGLCAMEGLLLSGMGQKVFIWQFKDNDLMGLSF 1198

Query: 364  IDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNP 423
            +D   Y+  + S++++ L  D   S++L+R+Q E + +S+ +RD      + K   A   
Sbjct: 1199 LDMHYYVYQLHSLRSIALACDARESMSLIRFQEENKAMSVASRD------DRKCAQAPMA 1252

Query: 424  SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQP 483
            ++ ++DG                                + +GF++SD++ N+ LF Y P
Sbjct: 1253 AQFMVDG--------------------------------AHIGFLLSDENGNITLFNYAP 1280

Query: 484  EARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS----DAPGARSRFLTWYASLDGA 539
            EA ESNGG RL  +   ++G ++N F +++   + ++    +   A  R  T +ASLDG+
Sbjct: 1281 EAPESNGGERLTVRAAINIGTNINAFLRVKGHTALLNLHEFEKEAAEQRMSTIFASLDGS 1340

Query: 540  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK-GKGYYAGNPSRGIIDGSL 598
             GF  PL EK+YRRL  LQ  + + +    GL+ +  R+ K  +    G  +R +IDG +
Sbjct: 1341 FGFIRPLTEKSYRRLHFLQTFIGSVSQQIAGLHIKGARSAKPPQPIVNGRNARNLIDGDV 1400

Query: 599  VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            V ++L LS  ++ ++ +++G     I+D+L ++  ++ ++
Sbjct: 1401 VEQYLNLSTYDKTDLARRLGVGKYHIIDDLMELRRMAFYY 1440


>gi|313232279|emb|CBY09388.1| unnamed protein product [Oikopleura dioica]
          Length = 1451

 Score =  360 bits (923), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 216/662 (32%), Positives = 341/662 (51%), Gaps = 83/662 (12%)

Query: 4    FRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQ------AFRHPKGALKL 57
            F       +D   VQE+   ++G   + P ++V    +L+IY+       F+     L  
Sbjct: 846  FEGSEGRRVDVLDVQEMNVFNMG-PSSLPYIVVMIGDQLMIYRFRATLNRFQTESPVLSG 904

Query: 58   RFKKLKVLFVSDRSKRANEQPGL--------PRGVRISQMRYFSNIAGYQGVFLCGPHPA 109
            RF KL+     D++K     PG+         R  +I  MR F NI+ + G+FL G +P 
Sbjct: 905  RFIKLQ-----DKTKLLRRIPGVHDESSKTKNRNNKI--MRQFMNISDHNGIFLGGAYPT 957

Query: 110  WLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF-NAKSELRISVLPTHLSYDAP 168
            W+F    G L  H M  +G V+   PF N  C  GFLYF ++   L ++ L   L YDA 
Sbjct: 958  WIFCGQNGRLNIHSMWQEGFVNAFTPFDNEKCADGFLYFRHSTKTLTVANLQPFLKYDAD 1017

Query: 169  WPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGED-KELVTDPRDSRFIPP 227
            WP +K+ L  TP F +Y LE K   +  S +E      K N E  KE    P        
Sbjct: 1018 WPFKKIKLNYTPCFSSYDLEQKVLTVCGSRSEKIEMLPKINAEGHKEYEDLPEVQNVETQ 1077

Query: 228  LVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY 287
            L  QF V +FSP SWE IP +   +   EH+LC ++V ++ E ++SG + YIA+GT+   
Sbjct: 1078 LFPQFFVEMFSPASWEVIPNSRIEMDAHEHILCCRSVYLKSEASMSGRKQYIAIGTSNIC 1137

Query: 288  SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
             ED   RGR++L ++I+VVPEPG+PLT+ K K ++   Q+GPV+A+  + G L+ A+GQK
Sbjct: 1138 GEDFQSRGRLILLEVIDVVPEPGKPLTRYKYKTVFDASQRGPVSAVDSLDGALIAAIGQK 1197

Query: 348  IYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
            ++I   +D++L    F+DT++Y  +    KN  LVGD  + I LLR+Q E   +S ++R 
Sbjct: 1198 VFIHAFQDDNLRATGFVDTQLYTHATHCFKNYALVGDIQQGITLLRHQGERNCISQISRA 1257

Query: 408  YKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGF 467
             +  +  + G         ++DG+ V                                G 
Sbjct: 1258 RRAGEVTAVGI--------LLDGNQV--------------------------------GL 1277

Query: 468  MISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV-----------NTFFKIRCKP 516
            + +D  +N+ ++MY+P+ +ESNGG +L+++ D +LG+ V           +TF K+    
Sbjct: 1278 VSTDMQRNLQVYMYKPDQKESNGGKQLVRQADINLGKRVISIWNSLGRQNDTFTKVALTE 1337

Query: 517  SSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
            +         +R +T+YA LDG++G  +P+ EK +RRL MLQ ++ +H  H GGLNPR +
Sbjct: 1338 ND--------ARHVTFYAGLDGSIGDIVPVSEKVFRRLEMLQTLVQSHLPHYGGLNPREY 1389

Query: 577  RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSS 636
            R    +     N ++ IIDG L+ +F  LS  E+ ++ +KIG     +LD++ D++   +
Sbjct: 1390 RYCTNEYRDLENAAKNIIDGDLLERFNGLSFTEQTDLSRKIGVTREALLDDMMDVQRTKN 1449

Query: 637  HF 638
             F
Sbjct: 1450 LF 1451


>gi|320169222|gb|EFW46121.1| cleavage and polyadenylation specificity factor 1 [Capsaspora
            owczarzaki ATCC 30864]
          Length = 1725

 Score =  348 bits (894), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 195/549 (35%), Positives = 295/549 (53%), Gaps = 52/549 (9%)

Query: 91   YFSNIAGYQ---GVFLCGPHPAWLFLT-SRGELRAHPMTIDGPVSTLAPFHNVNCPRGFL 146
            Y   + G+Q   GVF+CG  P WL ++ +R  LRAH M  DG VS  + F+N  CP GF+
Sbjct: 1214 YTGVLGGHQLCSGVFVCGRRPLWLLMSPTRKALRAHLMLTDGSVSAFSAFNNNACPGGFV 1273

Query: 147  YFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYY 206
            YF  +  LR   L    ++D PWPVR+VPL+ T H++ YH   +TY +VTS  +P  +  
Sbjct: 1274 YFTTQGTLRFCQLAPTTNHDNPWPVRRVPLRATAHYIGYHEVFRTYVLVTSHPKPYFNLP 1333

Query: 207  KF-NGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVS 265
            +  N E    V      R IP     F + L SP +WE I   +F L  +E V  +   +
Sbjct: 1334 RLTNDETYTPVPYTPKPRAIPATFDTFSLQLISPVTWESI--HSFDLPAFERVTSVDIAA 1391

Query: 266  MEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKE 325
            +  + T++GL+ Y+ +GT     EDVTC GRI++F+II+VVPE  +P T  K+K +  +E
Sbjct: 1392 ITSQETVTGLKDYVVIGTTVIEGEDVTCHGRIIVFEIIDVVPEVNRPQTNRKLKYLMERE 1451

Query: 326  QKGPVTAICHVAGFLVTAVGQKIYIWQLKDND-LTGIAFIDTEVYIASMVSVKNLILVGD 384
            QKG +TA+ HV G LV+ +GQKI IWQ   +D + G+AFIDT+ ++ S+ ++KN ILVGD
Sbjct: 1452 QKGAITALSHVCGHLVSCIGQKIIIWQFASDDTMDGVAFIDTQTFVVSVSAIKNFILVGD 1511

Query: 385  YARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGER 444
               S+ LLR+    + L  +ARD+      S  +        ++DG              
Sbjct: 1512 LNNSVFLLRFNETTKHLGFIARDFDHMSVASTQF--------LVDG-------------- 1549

Query: 445  LEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQ 504
                              SS+GF+ +D  +N+V+F Y P  RESN G RL+++ DFH+G 
Sbjct: 1550 ------------------SSLGFLATDSHQNLVVFAYNPLNRESNNGQRLLRQLDFHVGS 1591

Query: 505  HVNTFFKI--RCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 562
            HV    ++  R  P S+ D   +  R +   A+L+G+L    P+ E  +RRL  LQ  +V
Sbjct: 1592 HVQQVLRMVPRSLPVSV-DRGASVKRHIDLLATLEGSLNALAPIGETTFRRLEWLQRQLV 1650

Query: 563  THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 622
                   GLNP  +R Y+         +  +IDG L+ +FL L L E+ E+ ++  +   
Sbjct: 1651 G-LQQRAGLNPIGYRAYRFPRKMTTTRAGNVIDGELLSRFLYLGLAEQRELARQRRNTPE 1709

Query: 623  DILDELYDI 631
            D++D++  +
Sbjct: 1710 DLIDDILSV 1718


>gi|358338426|dbj|GAA28838.2| cleavage and polyadenylation specificity factor subunit 1 [Clonorchis
            sinensis]
          Length = 1741

 Score =  346 bits (888), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 210/679 (30%), Positives = 353/679 (51%), Gaps = 50/679 (7%)

Query: 7    HSPSAMDETI---VQELLTVSLGLHGNRPLLLVRTQHELLIYQAF------RHPKGALK- 56
            + P+A ++ I   V E+    +G + +RP+LLVRT  E+  ++A        HP  +   
Sbjct: 1066 NCPAAEEDNIPPTVLEITVFPIGRNRDRPVLLVRTSQEIAFFEALCPSHNEAHPFASESW 1125

Query: 57   ----LRFKKLKVL--FVSDRSKRANEQPGLPRGVRISQ---MRYFSNIAGYQGVFLCGPH 107
                LR+++L +    V+ R  R + +    +   +++   +R F +I G+ GVF+CG  
Sbjct: 1126 SQEGLRWRRLPIPCPLVAPRRVRTDPKIADVQSTMLTRKNLLRPFEDIDGHCGVFVCGAT 1185

Query: 108  PAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDA 167
            P WLF +  G +R    +IDG + + AP +   CP GF+YF   +E+R++ L    S+  
Sbjct: 1186 PIWLFSSDTGHIRVFNHSIDGIMGSFAPLNTDICPSGFVYFTYSNEMRLATLLPGYSFKE 1245

Query: 168  PWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGE-DKELVTDPRDSRFIP 226
               +R VPL+ TP+FL YH+E+KTY +V +  +  +  Y  N E +KE     R    + 
Sbjct: 1246 HLGMRWVPLELTPYFLQYHIESKTYALVGTRVKSCSSVYHLNAEGNKEEEVLLRPPTCVL 1305

Query: 227  PLVSQFHVSLFSPFS-------WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYI 279
            P +  + + +++P +       W+ IP        WE V C+    +  E T  G + Y+
Sbjct: 1306 PSLDYYVLQMYAPSTSLAEATPWQAIPHACIDFEPWEVVTCMITAQLSSEQTFHGTKDYL 1365

Query: 280  ALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF 339
            ALG N +Y E++  RGRI++ D+I+VVPEPGQPLT++K+K IY  EQKGPVTA+    G 
Sbjct: 1366 ALGANLSYGEEIPVRGRIIILDVIDVVPEPGQPLTRHKLKTIYDGEQKGPVTALSSCQGH 1425

Query: 340  LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYR 399
            LV+A+GQK+YIW LK+ DL G+AF+D+E+YI S++ VKNLIL  D  +SI LLR+Q + R
Sbjct: 1426 LVSAIGQKVYIWTLKNADLVGVAFVDSELYIHSLLCVKNLILAADVLKSIQLLRFQSDLR 1485

Query: 400  TLSLVARDYKPTQPNSKGYYAGNPSRGII-----DGSLVWKFLQLSLGERLEICKKIGSK 454
             LS+V+RD  P +  +  ++      G +        +++ +  L    R    +++  +
Sbjct: 1486 VLSVVSRDAIPREVYTSNFFVDGRRLGFLVTDERGNVVIYSYDPLEPSSRSG--RRLVRR 1543

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQ---------PEARESNGGHRLIKKT------D 499
             +  L   +     ++++ ++ +L +           P A    GG  ++++T       
Sbjct: 1544 ADMCLPTRAISSLRVANRLRHALLSVKSAGTGTQTTVPSA-AGVGGSEVLERTGKTGVSS 1602

Query: 500  FHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN 559
            F      N+   +     S ++    + +   +  +  GA+    PL +K Y RL + + 
Sbjct: 1603 FVAPGRANSASAMTLSTPSATNIDPEKLKHSVYLGTQTGAVFLIGPLRDKMYSRLRITEK 1662

Query: 560  VMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
             ++ H   T GL P+    Y+       NPS  + D  L+W++L L   +RLEI KK G 
Sbjct: 1663 NLIHHFGPTCGLLPKLCWNYRPSAPELVNPSGQVADADLLWRYLTLPHSQRLEIAKKSGQ 1722

Query: 620  KHNDILDELYDIEALSSHF 638
                I+D++ ++ A + HF
Sbjct: 1723 SLEGIMDDIAELNATTLHF 1741


>gi|312069702|ref|XP_003137805.1| hypothetical protein LOAG_02219 [Loa loa]
          Length = 1065

 Score =  339 bits (869), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 212/647 (32%), Positives = 333/647 (51%), Gaps = 103/647 (15%)

Query: 10   SAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP---KGALKLRFKKLKVLF 66
            +A  E ++ ELL V +G++  RP+L +     + +Y+ F +    +G L +RFK+L    
Sbjct: 504  AAKPEEVIMELLMVGMGMNQGRPMLFLLIDDTVSVYEMFTYNNGIQGHLAVRFKRLPYTV 563

Query: 67   VSDRSKRANEQPGLPRGVRISQMR----------YFSNIAGY-QGVFLCGPHPAWLFLTS 115
            V+ RS R     GL     +  +R          +F  I     GVF+C  +P   FL +
Sbjct: 564  VT-RSCRFQ---GLDGRAAVESVRDAVRHKTVLHFFERIGNVLNGVFICSSYPCIFFLET 619

Query: 116  RGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSEL-RISVLPTHLSYDAPWPVRKV 174
             G  R HP+ +DGP+ +   F+N  CP GF+Y   +  L R+          A  PV K+
Sbjct: 620  -GVPRLHPVNLDGPILSFTTFNNAACPNGFIYLTERERLMRV----------AKLPVTKM 668

Query: 175  PLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHV 234
                              C++ +             +DK      +   F+ P + Q+ +
Sbjct: 669  ------------------CVLIN-------------DDKTFEEHEKPDTFVYPEMDQYKL 697

Query: 235  SLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR 294
             L+SP  W+ +        E+E V C + V +  EGT+SG++ Y+A+GT  NY E+V  R
Sbjct: 698  QLYSPEDWKPVQNVEVLFEEFEVVTCCEEVVLRSEGTVSGVQNYLAVGTACNYGEEVLVR 757

Query: 295  GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK 354
            GRI++ +IIEVVPEPGQP +K++IK +Y KEQKGPVT++C   G+L+T +GQK++IW  K
Sbjct: 758  GRIIISEIIEVVPEPGQPTSKHRIKTLYDKEQKGPVTSLCSCNGYLLTGMGQKVFIWLFK 817

Query: 355  DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKP-TQP 413
            DN+L GI+F+D   Y+  ++ V+NL L  D  RS+ALLRYQ EY+ LSL +RD +   QP
Sbjct: 818  DNNLQGISFLDMHFYVHQLIGVRNLALACDMYRSVALLRYQEEYKALSLASRDMRSDVQP 877

Query: 414  NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
                 +       IID                                   MGF++SD+ 
Sbjct: 878  PMAAQF-------IIDN--------------------------------KQMGFVMSDEA 898

Query: 474  KNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSS--ISDAPGARSRFLT 531
             N+ +F Y PE  ES GG +L  + + ++G  VN+F +++   SS  + +   +  R   
Sbjct: 899  ANIAIFNYLPETLESLGGEKLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSLERQSV 958

Query: 532  WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSR 591
             +ASLDG+ GF  PL EK +RRL MLQ +M +      GLN +  R  +         +R
Sbjct: 959  LFASLDGSFGFLRPLTEKVFRRLHMLQQLMSSMVPQPAGLNAKGARAARPPRPNHYLNTR 1018

Query: 592  GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             ++DG +V ++L LSL E+ ++ +K+G+    I+D+L +I  +++H+
Sbjct: 1019 NLVDGDMVMQYLHLSLPEKNDLARKLGTSRYHIIDDLIEICRVTAHY 1065


>gi|384487281|gb|EIE79461.1| hypothetical protein RO3G_04166 [Rhizopus delemar RA 99-880]
          Length = 1468

 Score =  338 bits (866), Expect = 7e-90,   Method: Compositional matrix adjust.
 Identities = 204/663 (30%), Positives = 342/663 (51%), Gaps = 88/663 (13%)

Query: 17   VQELLTVSLGLHGNRPLLLVRTQ-HELLIYQAFRHPKGA----LKLRFKKLKVLFVSDRS 71
            +QE+L   +G     P L+VRT  ++++IY+AF +   +    L LRF +++  +VS +S
Sbjct: 853  IQEILMTHIGKERKDPHLVVRTDTNDIIIYKAFTYLDESSPDRLALRFSRVQHEYVSRKS 912

Query: 72   KRANEQPGLPRGV------------------RISQMRY-----------FSNIAGYQGVF 102
                 +P   RG+                  ++S  +            F+++AGY GVF
Sbjct: 913  SSHESKPKKKRGIIDEFEIPDTDLNEEEEDLKLSTKKMDKKIQRKLLIPFTDVAGYAGVF 972

Query: 103  LCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTH 162
            + G  PAWL  + +  +R HPM  +  +     FHNVNC  GF+  ++KS +++S L T 
Sbjct: 973  VAGAQPAWLMCSCKSFVRVHPMKTEHEIVGFTQFHNVNCQHGFITVDSKSTIQLSRLRTE 1032

Query: 163  -LSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELV----T 217
             ++YD  W ++KV L  T H + YH   + Y ++ S++ P+    +   +D + +    T
Sbjct: 1033 GINYDLDWVIQKVLLGQTVHKIQYHPVMRVYAVLVSSSVPT----RMKNDDNQYIDGKET 1088

Query: 218  DPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRG 277
            D R      P + QF + L SP +WE + +  F   E+E    L+   ++ + T +G + 
Sbjct: 1089 DERGPGEFLPEMEQFSMILVSPVTWEIVDKVEF--EEFEQCFSLECALLDSKQTSTGRKY 1146

Query: 278  YIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
            Y+ +GT     ED T +G I ++DIIEVVPEP  P T +K K +  ++ KG VTA+C V+
Sbjct: 1147 YMIIGTGTLKGEDTTMKGSIRMYDIIEVVPEPDNPQTNHKFKPVLTEDVKGAVTAMCTVS 1206

Query: 338  GFLVTAVGQKIYIWQLKDND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQP 396
            G L   +G K+ +W L+D++ L G+AFID ++Y+ SM S+KN IL+GD  +SI  L +Q 
Sbjct: 1207 GHLAACIGSKVIVWSLEDDERLVGVAFIDVQIYVTSMSSIKNFILIGDAQKSIWFLGFQL 1266

Query: 397  EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 456
            E   L+L+ +DY+            +   G +D                           
Sbjct: 1267 EPAKLTLLGKDYQ------------SFDVGCVDF-------------------------- 1288

Query: 457  DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
             I+D+  S+  ++ D ++N+ L+ Y P   +S GG +L+++ DFH+G  V T  ++    
Sbjct: 1289 -IIDD-KSLYLIVGDTNENIDLYQYAPFNLQSFGGQKLMRRGDFHVGSQVQTMVRLPQIE 1346

Query: 517  SSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
             +      +R  F     + +G++     + EK ++RL  L   +V +  H  GLNPRAF
Sbjct: 1347 KTEKGFEYSRRHFCLC-GTFNGSIAVISSISEKTFKRLNTLYGHLVNNLQHVAGLNPRAF 1405

Query: 577  RTYKG-KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
            R  KG K   + N ++ ++DG L+++F  LS+ E+ E  K+IG+    I+++L DIE   
Sbjct: 1406 RLIKGPKQRMSTNRTKAVLDGDLIFEFAGLSIEEQKETTKQIGTTVTRIMEDLVDIECSI 1465

Query: 636  SHF 638
            +HF
Sbjct: 1466 NHF 1468


>gi|353231025|emb|CCD77443.1| putative cleavage and polyadenylation specificity factor cpsf
            [Schistosoma mansoni]
          Length = 1825

 Score =  335 bits (860), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 201/682 (29%), Positives = 339/682 (49%), Gaps = 69/682 (10%)

Query: 17   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAF------RHP-------KGALKLRFKKLK 63
            + E+L   +G+  +RP+L+VRT  E+  ++A        +P       +G L+ R   L 
Sbjct: 1153 ILEILVYPIGIDKDRPVLMVRTSQEIAFFEALCPSPDESYPLISGTFYEGRLRWRRLPLP 1212

Query: 64   VLFVSDRSKRANEQPGLPRGV---RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
               V+ R  R + +    +     R   +R F NI  ++GVF+CG +P WLF T  G+LR
Sbjct: 1213 CPLVAPRRVRTDPKIMDVQSTLLTRTHMLRSFENIGDHRGVFVCGGNPIWLFATDSGQLR 1272

Query: 121  AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTP 180
              P +IDG + + AP +   C  GF+YF   +E+R++ LP   S++    ++ + L   P
Sbjct: 1273 VFPHSIDGIMGSFAPLNAKICHSGFVYFTFSNEMRLATLPPGYSFNEHLGIKWITLDPVP 1332

Query: 181  HFLAYHLETKTYCIVTSTAEPSTDYYKFNGE-DKELVTDPRDSRFIPPLVSQFHVSLFSP 239
            +++ YH+E+KTY +V   +EP    ++ N E +KE     R    + P +  + + +++P
Sbjct: 1333 YYVQYHVESKTYAVVGIHSEPCKSVFRLNAEGNKEEDVLVRPKTCVLPTLDYYSLQMYAP 1392

Query: 240  F----------SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
                        W  IP T      WE V CL    +  E T  G + Y+ALG N  Y E
Sbjct: 1393 NLNANHRNKQPPWLLIPNTLIEFEPWEVVTCLITAQLASEETFHGTKDYLALGANLTYGE 1452

Query: 290  DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
            ++  RGRIL+ D+I+VVPEPGQPLT++K+K+I+  EQKGPVTA+    G L++A+GQKIY
Sbjct: 1453 EIPVRGRILILDVIDVVPEPGQPLTRHKLKIIHDGEQKGPVTALTSCQGHLISAIGQKIY 1512

Query: 350  IWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
            IW LK+ DL G+AF+D+E+YI +++ VKNL+L  D  +S+ LLR+Q + R LS+V+RD  
Sbjct: 1513 IWTLKNTDLVGVAFVDSELYIHNLLCVKNLVLAADVLKSVQLLRFQSDLRVLSVVSRDNI 1572

Query: 410  PTQPNSKGYYAGNPSRGI-----IDGSLVWKFLQLS----LGERLEICKKIGSKHNDILD 460
              +  +  ++      G      +    ++ +  L      G RL  C  +       L 
Sbjct: 1573 SREVYTSNFFVDGRRLGFMVSDELGNVTIYSYDPLDPSSRSGRRLVRCADMR------LP 1626

Query: 461  EFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS 520
              ++    ++++ ++ +L +   +   +     +   T   +    NT      +  S++
Sbjct: 1627 SRATCSLRVANRLRHALLSV---KPSSTTTASAMTAGTSATIQDSTNTVLDNLSRVDSVN 1683

Query: 521  DAPGARS------------------------RFLTWYASLDGALGFFLPLPEKNYRRLLM 556
                 R                         R   ++ S +G++    P+ +K Y RL +
Sbjct: 1684 QMNNLRQSQQQSTAAQQGTTNPNSGVDPEKFRQSIYFGSQNGSIYRIGPIRDKMYSRLRI 1743

Query: 557  LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 616
             +  ++ H     G+ P++  +Y        NP   + DG L+W++L L   +RLEI KK
Sbjct: 1744 TEKNLIHHLGPICGMPPKSCWSYNRPQPELANPCGKVADGDLIWRYLTLPHCQRLEIAKK 1803

Query: 617  IGSKHNDILDELYDIEALSSHF 638
             G     I+D++ ++ A + HF
Sbjct: 1804 SGQSLESIMDDIAELIATTLHF 1825


>gi|328773280|gb|EGF83317.1| hypothetical protein BATDEDRAFT_21894 [Batrachochytrium dendrobatidis
            JAM81]
          Length = 1673

 Score =  331 bits (849), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 199/590 (33%), Positives = 306/590 (51%), Gaps = 98/590 (16%)

Query: 98   YQGVFLCGPHPAWLF--LTSRGE---------------------------LRAHPMTIDG 128
            Y GV + G  P W+   L SR +                           LR HPM +DG
Sbjct: 1119 YSGVVVTGSRPCWIMVALQSRQQDLDVISFDNSVACSTKLPPVPLLGTNMLRFHPMPVDG 1178

Query: 129  PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLE 188
            P+   AP HNVN   GFLY N K   RI  LP   ++D  WPV KVP+  T H +AYH  
Sbjct: 1179 PMKCFAPLHNVNVAHGFLYINWKGLFRICQLPPQFNFDHDWPVCKVPIHKTVHKVAYHYS 1238

Query: 189  TKTYCIVTSTAE---------PSTDYYKFNGEDKEL------VTDPRDSRFIPP-----L 228
            ++TY I TST E          S        E  E+      VT  R+   I P      
Sbjct: 1239 SQTYAIATSTPERFDIPHAQYASAVAAAVIDEGDEMPDAERKVTGIRELSEIKPGMYEAT 1298

Query: 229  VSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYS 288
            V ++ + L S  +WE +   +  L E E V+ L+ V +  + T+SG + Y+A+GT Y+  
Sbjct: 1299 VDRYKIELVSSVTWETV--DSIELSEAETVMALEAVDLSSKETISGKKLYLAIGTGYSRG 1356

Query: 289  EDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI 348
            ED++ RG++ L+D+IEVVP+P  P T  K K + +++ + P +AIC V  +L+ A+G KI
Sbjct: 1357 EDLSSRGKLHLYDVIEVVPDPNNPQTNRKFKHVDSEDDRSPFSAICTVNDYLLAAIGPKI 1416

Query: 349  YIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDY 408
             ++QL+D ++TG+AF+D  V++ S+ SVKNLI + D  +S+  + +Q E   L+++ RD 
Sbjct: 1417 IMYQLEDGEITGVAFLDVNVFVTSLSSVKNLIQICDIQKSVWFVAFQEEPAKLAVLGRDV 1476

Query: 409  KPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFM 468
             P Q    GY A                                   N ++D+ + +  +
Sbjct: 1477 HPLQ----GYAA-----------------------------------NMLIDD-NQLALL 1496

Query: 469  ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSR 528
            ++D DKN+   +Y P+  +S GG RLI+K + HLGQHV+ F ++R KP   +DA     +
Sbjct: 1497 VADGDKNLHTMIYAPDNVQSLGGERLIRKGEIHLGQHVSKFIRMRRKPLLRNDAIVFSKQ 1556

Query: 529  FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK------ 582
            +L   A+LDGAL    P+ E+ ++RL  L + MVT   H  GLNPR FR  + +      
Sbjct: 1557 YLNVAATLDGALEIITPVSERIFKRLYGLYSRMVTSIEHIAGLNPRGFRQAQHRVRPITL 1616

Query: 583  -GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
             G+      RGI+DG L++++++LS  ++  + K IGSK + ++D+L ++
Sbjct: 1617 SGFIGPPGPRGILDGDLLYEYVRLSRTQQRGLAKAIGSKDDRLMDDLLEV 1666


>gi|256079900|ref|XP_002576222.1| cleavage and polyadenylation specificity factor cpsf [Schistosoma
            mansoni]
          Length = 1958

 Score =  313 bits (803), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 181/542 (33%), Positives = 279/542 (51%), Gaps = 76/542 (14%)

Query: 17   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAF------RHP-------KGALKLRFKKLK 63
            + E+L   +G+  +RP+L+VRT  E+  ++A        +P       +G L+ R   L 
Sbjct: 1170 ILEILVYPIGIDKDRPVLMVRTSQEIAFFEALCPSPDESYPLISGTFYEGRLRWRRLPLP 1229

Query: 64   VLFVSDRSKRANEQPGLPRGV---RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
               V+ R  R + +    +     R   +R F NI  ++GVF+CG +P WLF T  G+LR
Sbjct: 1230 CPLVAPRRVRTDPKIMDVQSTLLTRTHMLRSFENIGDHRGVFVCGGNPIWLFATDSGQLR 1289

Query: 121  AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTP 180
              P +IDG + + AP +   C  GF+YF   +E+R++ LP   S++    ++ + L   P
Sbjct: 1290 VFPHSIDGIMGSFAPLNAKICHSGFVYFTFSNEMRLATLPPGYSFNEHLGIKWITLDPVP 1349

Query: 181  HFLAYHLETKTYCIVTSTAEPSTDYYKFNGE-DKELVTDPRDSRFIPPLVSQFHVSLFSP 239
            +++ YH+E+KTY +V   +EP    ++ N E +KE     R    + P +  + + +++P
Sbjct: 1350 YYVQYHVESKTYAVVGIHSEPCKSVFRLNAEGNKEEDVLVRPKTCVLPTLDYYSLQMYAP 1409

Query: 240  F----------SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
                        W  IP T      WE V CL    +  E T  G + Y+ALG N  Y E
Sbjct: 1410 NLNANHRNKQPPWLLIPNTLIEFEPWEVVTCLITAQLASEETFHGTKDYLALGANLTYGE 1469

Query: 290  DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
            ++  RGRIL+ D+I+VVPEPGQPLT++K+K+I+  EQKGPVTA+    G L++A+GQKIY
Sbjct: 1470 EIPVRGRILILDVIDVVPEPGQPLTRHKLKIIHDGEQKGPVTALTSCQGHLISAIGQKIY 1529

Query: 350  IWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
            IW LK+ DL G+AF+D+E+YI +++ VKNL+L  D  +S+ LLR+Q + R LS+V+RD  
Sbjct: 1530 IWTLKNTDLVGVAFVDSELYIHNLLCVKNLVLAADVLKSVQLLRFQSDLRVLSVVSRDNI 1589

Query: 410  PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
                 S+  Y  N     +DG                                  +GFM+
Sbjct: 1590 -----SREVYTSN---FFVDG--------------------------------RRLGFMV 1609

Query: 470  SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI---------RCKPSSIS 520
            SD+  NV ++ Y P    S  G RL++  D  L        ++           KPSS +
Sbjct: 1610 SDELGNVTIYSYDPLDPSSRSGRRLVRCADMRLPSRATCSLRVANRLRHALLSVKPSSTT 1669

Query: 521  DA 522
             A
Sbjct: 1670 TA 1671



 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 57/107 (53%)

Query: 532  WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSR 591
            ++ S +G++    P+ +K Y RL + +  ++ H     G+ P++  +Y        NP  
Sbjct: 1852 YFGSQNGSIYRIGPIRDKMYSRLRITEKNLIHHLGPICGMPPKSCWSYNRPQPELANPCG 1911

Query: 592  GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             + DG L+W++L L   +RLEI KK G     I+D++ ++ A + HF
Sbjct: 1912 KVADGDLIWRYLTLPHCQRLEIAKKSGQSLESIMDDIAELIATTLHF 1958



 Score = 42.0 bits (97), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 17/45 (37%), Positives = 26/45 (57%)

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
             NP   + DG L+W++L L   +RLEI KK G     I+D+ + +
Sbjct: 1907 ANPCGKVADGDLIWRYLTLPHCQRLEIAKKSGQSLESIMDDIAEL 1951


>gi|426235955|ref|XP_004011942.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Ovis aries]
          Length = 819

 Score =  308 bits (790), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 163/365 (44%), Positives = 207/365 (56%), Gaps = 61/365 (16%)

Query: 101 VFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLP 160
           VF+CGP P WL +T RG LR HPM IDGP+ + APFHN+NCPRGFLYFN + ELRISVLP
Sbjct: 503 VFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLP 562

Query: 161 THLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPR 220
            +LSYDAPWPVRK+PL+CT H++AYH+E+K Y + TST+ P T   +  GE+KE  T  R
Sbjct: 563 AYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTSTPCTRVPRMTGEEKEFETIER 622

Query: 221 DSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGY-- 278
           D R++ P    F + L SP SWE IP     L E                   G RG+  
Sbjct: 623 DERYVHPQQEAFCIQLISPVSWEAIPNARIELEE------------XXXXXXXGSRGHVY 670

Query: 279 -IALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
            +  G+     E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  
Sbjct: 671 SVPAGSCLKEGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCN 730

Query: 338 GFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPE 397
           G LV+A+GQK     L  +                          G   R+  +L    +
Sbjct: 731 GHLVSAIGQKXXXXXLPPH-------------------------AGLNPRAFRMLHV--D 763

Query: 398 YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 457
            R L                    N  R ++DG L+ ++L LS  ER E+ KKIG+  + 
Sbjct: 764 RRVLQ-------------------NAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDI 804

Query: 458 ILDEF 462
           ILD+ 
Sbjct: 805 ILDDL 809



 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/70 (41%), Positives = 43/70 (61%)

Query: 569 GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
            GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+L
Sbjct: 750 AGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDL 809

Query: 629 YDIEALSSHF 638
            + + +++HF
Sbjct: 810 LETDRVTAHF 819


>gi|168021793|ref|XP_001763425.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685218|gb|EDQ71614.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1452

 Score =  291 bits (745), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 203/655 (30%), Positives = 321/655 (49%), Gaps = 97/655 (14%)

Query: 17   VQELLTVSLGLHGNRPLLLVR-TQHELLIYQAF-------------RHPKGALK------ 56
            V ++   S G    RP LL   +   +L Y AF             R    +LK      
Sbjct: 861  VSQICFESWGEKFGRPFLLATLSDGTMLCYHAFSYDANESSDALEFRETATSLKDLSRLT 920

Query: 57   -LRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTS 115
             LRF ++ + +VS +   A       + +  ++   F N+  + GVF+ G  P WL +  
Sbjct: 921  HLRFARIPIDWVSGQEDGA-------KVLYETKFCSFKNVGSFPGVFVTGLRPTWL-MVC 972

Query: 116  RGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVP 175
            RG LR HP   DG +    P HNVNC  GF+Y  A+ +L+I  LP+ L YD  WPV+K+P
Sbjct: 973  RGRLRPHPQFCDGAILGFTPLHNVNCAHGFIYITAQGQLKICQLPSLLFYDNDWPVQKIP 1032

Query: 176  LKCTPHFLAYHLETKTYCIVTST--AEPSTDYYKFNG----EDKELVTDPRDSRFIPPLV 229
            L+ TPH + YH +   Y ++ ST  + P++     +G    + +E        R +    
Sbjct: 1033 LRGTPHQITYHSDVNLYALIISTPVSRPTSQVLMGDGHPFDQQQENSIGEDGQRLVTS-- 1090

Query: 230  SQFHVSLFSPF----SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNY 285
              + V +  P     +WE   +    +H  E+ L ++ VS++   T    +  +A+GT+Y
Sbjct: 1091 EDYEVRIIEPAQPGGNWE--AKAAIKMHLTENALTVRIVSIK-NITTDQTQTLLAIGTSY 1147

Query: 286  NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVG 345
               EDV  +GRI+L  + +   +PG     +  + +Y+KE KG ++AI  + G L+ A+G
Sbjct: 1148 VQGEDVAAKGRIILVSVGKDPQDPG-----SWAREVYSKELKGSISAIASLQGHLLIAIG 1202

Query: 346  QKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVA 405
             KI +     ++L G AF D  +Y+ S+  VKN IL GD  +SI  L ++ +   L+L+A
Sbjct: 1203 PKIILHSWNGSELNGAAFFDAPLYVVSLNIVKNFILFGDIHKSIYFLCWKEDGAQLTLLA 1262

Query: 406  RDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
            +D+      S   YA   +  +IDG                                S++
Sbjct: 1263 KDF-----GSLDCYA---TEFLIDG--------------------------------STL 1282

Query: 466  GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG- 524
              ++SD  KN+ +F Y P++ ES  G +L+ + +FHLG HVN F +++  P+     PG 
Sbjct: 1283 SLLVSDSRKNLQIFSYAPKSMESWKGQKLLSRAEFHLGAHVNKFHRLQMLPT-----PGS 1337

Query: 525  ARS-RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKG 583
            ARS R+   + +LDGA+ +  PL E  +RRL  LQ  +V   SH  G+NPRAFR ++  G
Sbjct: 1338 ARSNRYAVLFGTLDGAIDYLAPLDELTFRRLHTLQRKLVDCVSHVAGVNPRAFRQFRCDG 1397

Query: 584  YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
                     I+D  L+  +  L L E+LEI ++IG+    +L  L D+ ALS+ F
Sbjct: 1398 KAHRPGPDNIVDCELLSHYDMLPLDEQLEIARQIGTTRAHVLSNLRDL-ALSTSF 1451


>gi|196012166|ref|XP_002115946.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
 gi|190581722|gb|EDV21798.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
          Length = 1187

 Score =  283 bits (723), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 135/249 (54%), Positives = 172/249 (69%), Gaps = 1/249 (0%)

Query: 126  IDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
            +DG V   APF+  NCP GFLYFN++ +LRI VL    +YD PWPV KVPL+ T HF+ +
Sbjct: 768  VDGYVKCFAPFNIANCPNGFLYFNSEEDLRICVLDQRFTYDCPWPVHKVPLRNTLHFITH 827

Query: 186  HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEI 245
            H  TKTY I++ST            EDKE +   +  RFI   V +F + L +  +WE I
Sbjct: 828  HFVTKTYVIISSTMTVCEKMPHITTEDKEFIPVEKGDRFIHAPVEKFCLQLITSETWEII 887

Query: 246  PQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEV 305
            P     + EWEHV CLK+V ++ E T+SGL+ +IA+GT     E+V CRGRI++FD+IEV
Sbjct: 888  PDAEIQMAEWEHVTCLKSVKLKSEETVSGLKEFIAVGTTNVCGEEVACRGRIVIFDVIEV 947

Query: 306  VPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDN-DLTGIAFI 364
            VPEPG+PLTKNKIK  Y KEQKGPVTAI  V GFLVT++GQKIYIW+ +DN DL G+AFI
Sbjct: 948  VPEPGKPLTKNKIKTYYDKEQKGPVTAITCVEGFLVTSIGQKIYIWEFRDNKDLIGMAFI 1007

Query: 365  DTEVYIASM 373
            DT +YI S+
Sbjct: 1008 DTLIYIHSL 1016



 Score = 98.6 bits (244), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 82/148 (55%), Gaps = 3/148 (2%)

Query: 485  ARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFL 544
            A ES+GG  L+++ +   G + + FF+ + +     +     ++ +TW+ +LDG++G  L
Sbjct: 1039 APESHGGQFLVRRAEIQTGSNAHAFFRTKVRAL---NQRQNENKHITWFGTLDGSIGLLL 1095

Query: 545  PLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQ 604
            P+ EK YRRL  LQ  +  +     GLN +AFRT++       N  R I+DG L+ ++  
Sbjct: 1096 PVDEKEYRRLFSLQAKLSIYLEQNAGLNQKAFRTFRSHQKKLQNSMRNILDGDLLKRYFH 1155

Query: 605  LSLGERLEICKKIGSKHNDILDELYDIE 632
            L   ER ++ K+I S    I+++L  +E
Sbjct: 1156 LGFVERRDLAKQIMSTPEQIINDLTKLE 1183


>gi|356530945|ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Glycine max]
          Length = 1449

 Score =  282 bits (722), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 185/592 (31%), Positives = 289/592 (48%), Gaps = 73/592 (12%)

Query: 58   RFKKLKVLFVS-DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSR 116
            R + L+ + V  D   R +   G P      Q+  F NI  YQG FL G  PAW+ +  R
Sbjct: 906  RLRNLRFVRVPLDAYPREDTSNGSP----CQQITIFKNIGSYQGFFLSGSRPAWVMVL-R 960

Query: 117  GELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPL 176
              LR HP   DG +      HNVNC  G +Y  ++  L+I  LP+  +YD+ WPV+K+PL
Sbjct: 961  ERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSGSNYDSYWPVQKIPL 1020

Query: 177  KCTPHFLAYHLETKTYCIVTS--TAEP-----STDYYKFNGEDKELVTDPRD-SRFIPPL 228
            K TPH + Y  E   Y ++ S    +P     S     FN +++    +P + +RF P  
Sbjct: 1021 KATPHQVTYFAEKNLYPLIVSFPVLKPLNQVISLVDQDFNHQNESQNMNPDEQNRFYP-- 1078

Query: 229  VSQFHVSLFSPFS----WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTN 284
            + +F V +  P      W+   +   P+   E+ L ++ V++    T       +A+GT 
Sbjct: 1079 IDEFEVRIMEPEKSGGPWQ--TKATIPMQSSENALTVRMVTL-LNTTSKENETLLAIGTA 1135

Query: 285  YNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAV 344
            Y   EDV  RGRILLF + ++   P     +  +  +Y+KE KG ++A+  + G L+ A 
Sbjct: 1136 YVQGEDVAARGRILLFSLGKITDNP-----QTLVSEVYSKELKGAISALASLQGHLLIAS 1190

Query: 345  GQKIYIWQLKDNDLTGIAFIDT-EVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSL 403
            G KI + +    +L GIAF D   +++ S+  VKN IL+GD  +SI  L ++ +   LSL
Sbjct: 1191 GPKIILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSL 1250

Query: 404  VARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFS 463
            +A+D+        G      +  +IDG                                S
Sbjct: 1251 LAKDF--------GSLDCFATEFLIDG--------------------------------S 1270

Query: 464  SMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSS--ISD 521
            ++  M+SD ++N+ +F Y P+  ES  G +L+ + +FH+G HV  F +++   +S     
Sbjct: 1271 TLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTSDRAGS 1330

Query: 522  APGA--RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
             PG+   +RF   + +LDG++G   PL E  +RRL  LQ  +V    H  GLNPRAFR +
Sbjct: 1331 VPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQRKLVDAVPHVAGLNPRAFRLF 1390

Query: 580  KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
            +  G         I+D  L+  +  L L E+LEI  +IG+  + IL  L D+
Sbjct: 1391 RSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIANQIGTTRSQILSNLSDL 1442


>gi|449524573|ref|XP_004169296.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like, partial [Cucumis sativus]
          Length = 741

 Score =  281 bits (719), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 180/570 (31%), Positives = 275/570 (48%), Gaps = 70/570 (12%)

Query: 80  LPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNV 139
           +P G    ++  F NI+GYQG+FLCG  PAW F+  R  LR HP   DGP+   A  HNV
Sbjct: 217 MPNGTLSRRLSIFKNISGYQGLFLCGSRPAW-FMVFRERLRVHPQLCDGPIVAFAVLHNV 275

Query: 140 NCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTST- 198
           NC  G +Y  ++  L+I  LP+  +YD  WPV+KVPLK TPH + Y  E   Y ++ S  
Sbjct: 276 NCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQKVPLKGTPHQVTYFHEKNLYPVIISAP 335

Query: 199 --------AEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS----WEEIP 246
                        D    + E+  L  D     +    V +F + +  P      W+   
Sbjct: 336 VQKPLNQVLSSMVDQDVGHVENHNLSADELQQTYS---VEEFEIRILEPEKSGGPWQ--T 390

Query: 247 QTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVV 306
           +    +H  E+ L ++ V++    T       +A+GT Y   EDV  RGR+LLF + +  
Sbjct: 391 RATIAMHSSENALTIRVVTL-LNTTTKENETLLAVGTAYVQGEDVAARGRVLLFSVGKDA 449

Query: 307 PEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT 366
                  ++  +  +Y+KE KG ++A+  + G L+ A G KI + +    +L GIAF D 
Sbjct: 450 DN-----SQTLVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGAELNGIAFYDV 504

Query: 367 -EVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSR 425
             +Y+ S+  VKN IL+GD  +SI  L ++ +   LSL+A+D+      S   YA   + 
Sbjct: 505 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDF-----GSLDCYA---TE 556

Query: 426 GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEA 485
            +IDG                                S++   +SD  KN+ +F Y P++
Sbjct: 557 FLIDG--------------------------------STLSLTVSDDQKNIQIFYYAPKS 584

Query: 486 RESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS----RFLTWYASLDGALG 541
            ES  G +L+ + +FH+G HV  F +++   +S   A    S    RF   + +LDG++G
Sbjct: 585 TESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTSSDKACSTVSDKTNRFALLFGTLDGSIG 644

Query: 542 FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 601
              PL E  +RRL  LQ  +     H GGLNPR+FR +   G         I+D  L+  
Sbjct: 645 CIAPLDELTFRRLQSLQKKLGDAVPHVGGLNPRSFRQFHSNGKVHRRGPDSIVDCELLCH 704

Query: 602 FLQLSLGERLEICKKIGSKHNDILDELYDI 631
           +  L L E+L+I  +IG+  + IL  L D+
Sbjct: 705 YEMLPLEEQLDIAHQIGTTRSQILSNLNDL 734


>gi|393907594|gb|EJD74706.1| hypothetical protein LOAG_18016 [Loa loa]
          Length = 398

 Score =  281 bits (719), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 157/431 (36%), Positives = 244/431 (56%), Gaps = 42/431 (9%)

Query: 211 EDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEG 270
           +DK      +   F+ P + Q+ + L+SP  W+ +        E+E V C + V +  EG
Sbjct: 7   DDKTFEEHEKPDTFVYPEMDQYKLQLYSPEDWKPVQNVEVLFEEFEVVTCCEEVVLRSEG 66

Query: 271 TLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPV 330
           T+SG++ Y+A+GT  NY E+V  RGRI++ +IIEVVPEPGQP +K++IK +Y KEQKGPV
Sbjct: 67  TVSGVQNYLAVGTACNYGEEVLVRGRIIISEIIEVVPEPGQPTSKHRIKTLYDKEQKGPV 126

Query: 331 TAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIA 390
           T++C   G+L+T +GQK++IW  KDN+L GI+F+D   Y+  ++ V+NL L  D  RS+A
Sbjct: 127 TSLCSCNGYLLTGMGQKVFIWLFKDNNLQGISFLDMHFYVHQLIGVRNLALACDMYRSVA 186

Query: 391 LLRYQPEYRTLSLVARDYKP-TQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 449
           LLRYQ EY+ LSL +RD +   QP     +       IID                    
Sbjct: 187 LLRYQEEYKALSLASRDMRSDVQPPMAAQF-------IIDN------------------- 220

Query: 450 KIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
                          MGF++SD+  N+ +F Y PE  ES GG +L  + + ++G  VN+F
Sbjct: 221 -------------KQMGFVMSDEAANIAIFNYLPETLESLGGEKLTLRAEINIGTVVNSF 267

Query: 510 FKIRCKPSS--ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 567
            +++   SS  + +   +  R    +ASLDG+ GF  PL EK +RRL MLQ +M +    
Sbjct: 268 IRVKGHISSGFVENELFSLERQSVLFASLDGSFGFLRPLTEKVFRRLHMLQQLMSSMVPQ 327

Query: 568 TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 627
             GLN +  R  +         +R ++DG +V ++L LSL E+ ++ +K+G+    I+D+
Sbjct: 328 PAGLNAKGARAARPPRPNHYLNTRNLVDGDMVMQYLHLSLPEKNDLARKLGTSRYHIIDD 387

Query: 628 LYDIEALSSHF 638
           L +I  +++H+
Sbjct: 388 LIEICRVTAHY 398


>gi|449470342|ref|XP_004152876.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Cucumis sativus]
          Length = 1504

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 180/570 (31%), Positives = 275/570 (48%), Gaps = 70/570 (12%)

Query: 80   LPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNV 139
            +P G    ++  F NI+GYQG+FLCG  PAW F+  R  LR HP   DGP+   A  HNV
Sbjct: 980  MPNGTLSCRLSIFKNISGYQGLFLCGSRPAW-FMVFRERLRVHPQLCDGPIVAFAVLHNV 1038

Query: 140  NCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTST- 198
            NC  G +Y  ++  L+I  LP+  +YD  WPV+KVPLK TPH + Y  E   Y ++ S  
Sbjct: 1039 NCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQKVPLKGTPHQVTYFHEKNLYPVIISAP 1098

Query: 199  --------AEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS----WEEIP 246
                         D    + E+  L  D     +    V +F + +  P      W+   
Sbjct: 1099 VQKPLNQVLSSMVDQDVGHVENHNLSADELQQTYS---VEEFEIRILEPEKSGGPWQ--T 1153

Query: 247  QTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVV 306
            +    +H  E+ L ++ V++    T       +A+GT Y   EDV  RGR+LLF + +  
Sbjct: 1154 RATIAMHSSENALTIRVVTL-LNTTTKENETLLAVGTAYVQGEDVAARGRVLLFSVGKDA 1212

Query: 307  PEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT 366
                   ++  +  +Y+KE KG ++A+  + G L+ A G KI + +    +L GIAF D 
Sbjct: 1213 DN-----SQTLVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGAELNGIAFYDV 1267

Query: 367  -EVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSR 425
              +Y+ S+  VKN IL+GD  +SI  L ++ +   LSL+A+D+      S   YA   + 
Sbjct: 1268 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDF-----GSLDCYA---TE 1319

Query: 426  GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEA 485
             +IDG                                S++   +SD  KN+ +F Y P++
Sbjct: 1320 FLIDG--------------------------------STLSLTVSDDQKNIQIFYYAPKS 1347

Query: 486  RESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS----RFLTWYASLDGALG 541
             ES  G +L+ + +FH+G HV  F +++   +S   A    S    RF   + +LDG++G
Sbjct: 1348 TESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTSSDKACSTVSDKTNRFALLFGTLDGSIG 1407

Query: 542  FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 601
               PL E  +RRL  LQ  +     H GGLNPR+FR +   G         I+D  L+  
Sbjct: 1408 CIAPLDELTFRRLQSLQKKLGDAVPHVGGLNPRSFRQFHSNGKVHRRGPDSIVDCELLCH 1467

Query: 602  FLQLSLGERLEICKKIGSKHNDILDELYDI 631
            +  L L E+L+I  +IG+  + IL  L D+
Sbjct: 1468 YEMLPLEEQLDIAHQIGTTRSQILSNLNDL 1497


>gi|218194461|gb|EEC76888.1| hypothetical protein OsI_15095 [Oryza sativa Indica Group]
          Length = 1503

 Score =  278 bits (712), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 197/664 (29%), Positives = 315/664 (47%), Gaps = 86/664 (12%)

Query: 30   NRPLLL-VRTQHELLIYQAFRH-------------PKG------ALKLRFKKLKVLFVSD 69
            +RP L  +     LL Y AF +             P+G      A   R + L+   VS 
Sbjct: 857  SRPFLFGLLNDGTLLCYHAFSYEASESNVKRVPLSPQGSADHHNASDSRLRNLRFHRVSI 916

Query: 70   RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP 129
                  + P L R     ++  F+N+ GY+G+FL G  PAW+ +  R  LR HP   DGP
Sbjct: 917  DITSREDIPTLGR----PRITTFNNVGGYEGLFLSGTRPAWV-MVCRQRLRVHPQLCDGP 971

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            +      HNVNC  GF+Y  ++  L+I  LP+  +YD  WPV+KVPL  TPH + Y+ E 
Sbjct: 972  IEAFTVLHNVNCSHGFIYVTSQGFLKICQLPSAYNYDNYWPVQKVPLHGTPHQVTYYAEQ 1031

Query: 190  KTYCIVTSTA---------EPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFS-- 238
              Y ++ S               D    +  D ++ +   D+      V +F V +    
Sbjct: 1032 SLYPLIVSVPVVRPLNQVLSSMADQESVHHMDNDVTS--TDALHKTYTVDEFEVRILELE 1089

Query: 239  --PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGR 296
                 WE   ++  P+  +E+ L ++ V++ +  T       +A+GT Y   EDV  RGR
Sbjct: 1090 KPGGHWE--TKSTIPMQLFENALTVRIVTL-HNTTTKENETLLAIGTAYVLGEDVAARGR 1146

Query: 297  ILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDN 356
            +LLF  ++         ++N +  +Y+KE KG V+A+  + G L+ A G KI + +    
Sbjct: 1147 VLLFSFMK------SENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWTGA 1200

Query: 357  DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSK 416
            +LT +AF D  +++ S+  VKN +L GD  +SI  L ++ +   LSL+A+D+      + 
Sbjct: 1201 ELTAVAFYDAPLHVVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDFGSLDCFAT 1260

Query: 417  GYYAGNPSRGIIDGS----------------------LVWKFL--QLSLGERLEICKKIG 452
             +     +  ++                         L WK    QLSL     + K  G
Sbjct: 1261 EFLIDGSTLSLVASDSDKNVQVKNFVLFGDIHKSIYFLSWKEQGSQLSL-----LAKDFG 1315

Query: 453  SKH---NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
            S      + L + S++  + SD DKNV +F Y P+  ES  G +L+ + +FH+G H+  F
Sbjct: 1316 SLDCFATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESWKGQKLLSRAEFHVGAHITKF 1375

Query: 510  FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             +++  P+    +    +RF   + +LDG +G   P+ E  +RRL  LQ  +V    H  
Sbjct: 1376 LRLQMLPTQ-GLSSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVC 1434

Query: 570  GLNPRAFRTY--KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 627
            GLNPR+FR +   GKG+  G  +  IID  L+  +  LSL E+L++ ++IG+  + IL  
Sbjct: 1435 GLNPRSFRQFHSNGKGHRPGPDN--IIDFELLAHYEMLSLDEQLDVAQQIGTTRSQILSN 1492

Query: 628  LYDI 631
              DI
Sbjct: 1493 FSDI 1496


>gi|356559917|ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Glycine max]
          Length = 1447

 Score =  277 bits (709), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 183/594 (30%), Positives = 284/594 (47%), Gaps = 77/594 (12%)

Query: 58   RFKKLKVLFVS-DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSR 116
            R + L+ + V  D   R +   G P      Q+  F NI  Y+G FL G  PAW+ +  R
Sbjct: 904  RLRNLRFVRVPLDAYAREDTSNGPP----CQQITIFKNIGSYEGFFLSGSRPAWVMVL-R 958

Query: 117  GELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPL 176
              LR HP   DG +      HNVNC +G +Y  ++  L+I  LP+  +YD+ WPV+K+PL
Sbjct: 959  ERLRVHPQLCDGSIVAFTVLHNVNCNQGLIYVTSQGVLKICQLPSGSNYDSYWPVQKIPL 1018

Query: 177  KCTPHFLAYHLETKTYCIVTS--TAEPSTDYYKFNGED------KELVTDPRDSRFIPPL 228
            K TPH + Y  E   Y ++ S    +P         +D       + +     +RF P  
Sbjct: 1019 KATPHQVTYFAEKNLYPLIVSFPVLKPLNQVISLVDQDINHQNESQNMNPDEQNRFYP-- 1076

Query: 229  VSQFHVSLFSPFS----WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTN 284
            + +F V +  P      W+   +   P+   E+ L ++ V++    T       +A+GT 
Sbjct: 1077 IDEFEVRIMEPEKSGGPWQ--TKATIPMQSSENALTVRMVTL-VNTTSKENETLLAIGTA 1133

Query: 285  YNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAV 344
            Y   EDV  RGRILLF + +    P     +  +  +Y+KE KG ++A+  + G L+ A 
Sbjct: 1134 YVQGEDVAARGRILLFSLGKNTDNP-----QTLVSEVYSKELKGAISALASLQGHLLIAS 1188

Query: 345  GQKIYIWQLKDNDLTGIAFIDT-EVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSL 403
            G KI + +    +L GIAF D   +++ S+  VKN IL+GD  +SI  L ++ +   LSL
Sbjct: 1189 GPKIILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSL 1248

Query: 404  VARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFS 463
            +A+D+        G      +  +IDG                                S
Sbjct: 1249 LAKDF--------GSLDCFATEFLIDG--------------------------------S 1268

Query: 464  SMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAP 523
            ++  M+SD ++N+ +F Y P+  ES  G +L+ + +FH+G HV  F  +R +  S SD  
Sbjct: 1269 TLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF--LRLQMLSTSDRA 1326

Query: 524  GA------RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 577
            GA       +RF   + +LDG++G   PL E  +RRL  LQ  +V    H  GLNPRAFR
Sbjct: 1327 GAVPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQRKLVDAVPHVAGLNPRAFR 1386

Query: 578  TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
             ++  G         I+D  L+  +  L L E+LEI  ++G+  + IL  L D+
Sbjct: 1387 LFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAHQVGTTRSQILSNLSDL 1440


>gi|302761560|ref|XP_002964202.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
 gi|300167931|gb|EFJ34535.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
          Length = 1413

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 201/656 (30%), Positives = 317/656 (48%), Gaps = 91/656 (13%)

Query: 12   MDETIVQELLTVSLGLHGNRPLLLVR-TQHELLIYQAFRHP---KGA--------LKLRF 59
            M +  V ++   + G    RP + V  +   LL Y+AF +     GA          LRF
Sbjct: 820  MSKIKVVDICVDTWGEKYGRPFVFVLLSDGTLLSYRAFIYEGQDSGAHASDGTSFRNLRF 879

Query: 60   KKLKV-LFVSDRSKRANEQPGLPRGVR-ISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
             +L++ L + +    A+E       VR + ++  F ++ G QG+FL G  P WL +  R 
Sbjct: 880  LRLQLDLELGEEDSNADE-------VRSVQKIIPFKDVGGLQGLFLAGGKPTWLMIF-RE 931

Query: 118  ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
            ++R HP   DGP+      HNVNC  G +Y   ++ L+I  L   L+YD  WPV+K+PLK
Sbjct: 932  QIRLHPQASDGPIVAFTSLHNVNCQHGLIYVTNEASLKICRLSNILNYDNDWPVQKIPLK 991

Query: 178  CTPHFLAYHLETKTYCIV--------TSTAEPS-TDYYKFNGEDKELVTDPRDSRFIPPL 228
             TPH +A+H +   Y +V        TS   PS  D    +  D+   +D  D + +   
Sbjct: 992  GTPHQMAHHPDLNIYVLVLSFSVSVPTSLVLPSAADGPPGHQIDQSEASDGLDPQKMVQ- 1050

Query: 229  VSQFHVSLFSPFS----WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTN 284
            V  F V L  P +    WE      F     E+VL ++ VS++   T   +   +A+GT 
Sbjct: 1051 VDDFEVRLLEPMAQGVPWETKDTIKF--QPAENVLTVRIVSIKNAAT-EQVENLLAIGTG 1107

Query: 285  YNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAV 344
            Y   EDV  RGRI+L  + E   +P  P  K   K +Y+KE KG ++A+  + G L+ A+
Sbjct: 1108 YLQGEDVASRGRIILVSLGE---DPSDP--KVWAKELYSKELKGAISALAALQGHLLLAI 1162

Query: 345  GQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLV 404
            G KI +     ++L G AF D  +Y+ S+  VKN +L GD+ +SI  L ++ E   L L+
Sbjct: 1163 GPKIILHTWNGSELIGTAFFDAPLYVVSLNIVKNFVLFGDFHKSIYFLCWKEEGAQLVLL 1222

Query: 405  ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
            A+D+      S   YA   +  +IDG                                S+
Sbjct: 1223 AKDF-----GSLDCYA---TEFLIDG--------------------------------ST 1242

Query: 465  MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG 524
            +  ++SD  KN+ +F Y P+  ES  G +L+ + +FHLG HV  F +++     +   PG
Sbjct: 1243 LSLLVSDSRKNIQVFSYAPKNAESWKGQKLLPRVEFHLGSHVTKFLRLQ-----MLQTPG 1297

Query: 525  AR--SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 582
            +   +RF   + +LDG +G+  PL E  +RRL  LQ  +V    H  GLNP+A+R ++  
Sbjct: 1298 SSRTNRFALCFGTLDGGIGYITPLDELTFRRLQTLQRKLVDLVPHVAGLNPKAYRQFQAN 1357

Query: 583  GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            G +  +     +D   + ++  LSL +++ I ++IG+    I   L DI   +S F
Sbjct: 1358 GEHHKHGPDNTVDSEQLREYESLSLDKQVAIARQIGTTRQQIFANLRDISLSTSFF 1413


>gi|302814354|ref|XP_002988861.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
 gi|300143432|gb|EFJ10123.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
          Length = 1413

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 201/656 (30%), Positives = 317/656 (48%), Gaps = 91/656 (13%)

Query: 12   MDETIVQELLTVSLGLHGNRPLLLVR-TQHELLIYQAFRHP---KGA--------LKLRF 59
            M +  V ++   + G    RP + V  +   LL Y+AF +     GA          LRF
Sbjct: 820  MSKIKVVDICVDTWGEKYGRPFVFVLLSDGTLLSYRAFIYEGQDSGAHASDGTSFRNLRF 879

Query: 60   KKLKV-LFVSDRSKRANEQPGLPRGVR-ISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
             +L++ L + +    A+E       VR + ++  F ++ G QG+FL G  P WL +  R 
Sbjct: 880  LRLQLDLELGEEDSNADE-------VRSVQKIIPFKDVGGLQGLFLAGGKPTWLMIF-RE 931

Query: 118  ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
            ++R HP   DGP+      HNVNC  G +Y   ++ L+I  L   L+YD  WPV+K+PLK
Sbjct: 932  QIRLHPQASDGPIVAFTSLHNVNCQHGLIYVTNEASLKICRLSNILNYDNDWPVQKIPLK 991

Query: 178  CTPHFLAYHLETKTYCIV--------TSTAEPS-TDYYKFNGEDKELVTDPRDSRFIPPL 228
             TPH +A+H +   Y +V        TS   PS  D    +  D+   +D  D + +   
Sbjct: 992  GTPHQMAHHPDLNIYVLVLSFSVSVPTSLVLPSAADGPPGHQIDQSEASDGLDPQKMVQ- 1050

Query: 229  VSQFHVSLFSPFS----WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTN 284
            V  F V L  P +    WE      F     E+VL ++ VS++   T   +   +A+GT 
Sbjct: 1051 VDDFEVRLLEPMAQGVPWETKDTIKF--QPAENVLTVRIVSIKNAAT-EQVENLLAIGTG 1107

Query: 285  YNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAV 344
            Y   EDV  RGRI+L  + E   +P  P  K   K +Y+KE KG ++A+  + G L+ A+
Sbjct: 1108 YLQGEDVASRGRIILVSLGE---DPSDP--KVWAKELYSKELKGAISALAALQGHLLLAI 1162

Query: 345  GQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLV 404
            G KI +     ++L G AF D  +Y+ S+  VKN +L GD+ +SI  L ++ E   L L+
Sbjct: 1163 GPKIILHTWNGSELIGTAFFDAPLYVVSLNIVKNFVLFGDFHKSIYFLCWKEEGAQLVLL 1222

Query: 405  ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
            A+D+      S   YA   +  +IDG                                S+
Sbjct: 1223 AKDF-----GSLDCYA---TEFLIDG--------------------------------ST 1242

Query: 465  MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG 524
            +  ++SD  KN+ +F Y P+  ES  G +L+ + +FHLG HV  F +++     +   PG
Sbjct: 1243 LSLLVSDSRKNIQVFSYAPKNAESWKGQKLLPRVEFHLGSHVTKFLRLQ-----MLQTPG 1297

Query: 525  AR--SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 582
            +   +RF   + +LDG +G+  PL E  +RRL  LQ  +V    H  GLNP+A+R ++  
Sbjct: 1298 SSRTNRFALCFGTLDGGIGYITPLDELTFRRLQTLQRKLVDLVPHVAGLNPKAYRQFQAN 1357

Query: 583  GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            G +  +     +D   + ++  LSL +++ I ++IG+    I   L DI   +S F
Sbjct: 1358 GEHHKHGPDNTVDSEQLREYESLSLDKQVAIARQIGTTRQQIFANLRDISLSTSFF 1413


>gi|343962533|dbj|BAK62854.1| cleavage and polyadenylation specificity factor 160 kDa subunit
           [Pan troglodytes]
          Length = 269

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 134/279 (48%), Positives = 175/279 (62%), Gaps = 40/279 (14%)

Query: 208 FNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSME 267
             GE+KE  T  RD R+I P    F + L SP SWE IP     L EWEHV C+K VS+ 
Sbjct: 1   MTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNARIELQEWEHVTCMKTVSLR 60

Query: 268 YEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQK 327
            E T+SGL+GY+A GT     E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQK
Sbjct: 61  SEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQK 120

Query: 328 GPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYAR 387
           GPVTA+CH  G LV+A+GQKI++W L+ ++LTG+AFIDT++YI  M+SVKN IL  D  +
Sbjct: 121 GPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMK 180

Query: 388 SIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 447
           SI+LLRYQ E +TLSLV+RD KP +  S  +   N                         
Sbjct: 181 SISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN------------------------- 215

Query: 448 CKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
                          + +GF++SD+D+N++++MY PE  
Sbjct: 216 ---------------AQLGFLVSDRDRNLMVYMYLPEGE 239


>gi|75145059|sp|Q7XWP1.2|CPSF1_ORYSJ RecName: Full=Probable cleavage and polyadenylation specificity
            factor subunit 1; AltName: Full=Cleavage and
            polyadenylation specificity factor 160 kDa subunit;
            Short=CPSF 160 kDa subunit
 gi|38345987|emb|CAD39979.2| OSJNBa0032B23.5 [Oryza sativa Japonica Group]
          Length = 1441

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 190/637 (29%), Positives = 301/637 (47%), Gaps = 94/637 (14%)

Query: 30   NRPLLL-VRTQHELLIYQAFRH-------------PKG------ALKLRFKKLKVLFVSD 69
            +RP L  +     LL Y AF +             P+G      A   R + L+   VS 
Sbjct: 857  SRPFLFGLLNDGTLLCYHAFSYEASESNVKRVPLSPQGSADHHNASDSRLRNLRFHRVSI 916

Query: 70   RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP 129
                  + P L R     ++  F+N+ GY+G+FL G  PAW+ +  R  LR HP   DGP
Sbjct: 917  DITSREDIPTLGR----PRITTFNNVGGYEGLFLSGTRPAWV-MVCRQRLRVHPQLCDGP 971

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            +      HNVNC  GF+Y  ++  L+I  LP+  +YD+ WPV+KVPL  TPH + Y+ E 
Sbjct: 972  IEAFTVLHNVNCSHGFIYVTSQGFLKICQLPSAYNYDSYWPVQKVPLHGTPHQVTYYAEQ 1031

Query: 190  KTYCIVTSTA---------EPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFS-- 238
              Y ++ S               D    +  D ++ +   D+      V +F V +    
Sbjct: 1032 SLYPLIVSVPVVRPLNQVLSSMADQESVHHMDNDVTS--TDALHKTYTVDEFEVRILELE 1089

Query: 239  --PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGR 296
                 WE   ++  P+  +E+ L ++ V++ +  T       +A+GT Y   EDV  RGR
Sbjct: 1090 KPGGHWE--TKSTIPMQLFENALTVRIVTL-HNTTTKENETLLAIGTAYVLGEDVAARGR 1146

Query: 297  ILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDN 356
            +LLF   +         ++N +  +Y+KE KG V+A+  + G L+ A G KI + +    
Sbjct: 1147 VLLFSFTK------SENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWTGA 1200

Query: 357  DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSK 416
            +LT +AF D  +++ S+  VKN +L GD  +SI  L ++ +   LSL+A+D+        
Sbjct: 1201 ELTAVAFYDAPLHVVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDF-------- 1252

Query: 417  GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
            G      +  +IDG                                S++  + SD DKNV
Sbjct: 1253 GSLDCFATEFLIDG--------------------------------STLSLVASDSDKNV 1280

Query: 477  VLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASL 536
             +F Y P+  ES  G +L+ + +FH+G H+  F +++  P+    +    +RF   + +L
Sbjct: 1281 QIFYYAPKMVESWKGQKLLSRAEFHVGAHITKFLRLQMLPTQ-GLSSEKTNRFALLFGNL 1339

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY--KGKGYYAGNPSRGII 594
            DG +G   P+ E  +RRL  LQ  +V    H  GLNPR+FR +   GKG+  G     II
Sbjct: 1340 DGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHRPG--PDNII 1397

Query: 595  DGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
            D  L+  +  LSL E+L++ ++IG+  + IL    DI
Sbjct: 1398 DFELLCSYEMLSLDEQLDVAQQIGTTRSQILSNFSDI 1434


>gi|222628488|gb|EEE60620.1| hypothetical protein OsJ_14038 [Oryza sativa Japonica Group]
          Length = 1441

 Score =  275 bits (704), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 190/637 (29%), Positives = 301/637 (47%), Gaps = 94/637 (14%)

Query: 30   NRPLLL-VRTQHELLIYQAFRH-------------PKG------ALKLRFKKLKVLFVSD 69
            +RP L  +     LL Y AF +             P+G      A   R + L+   VS 
Sbjct: 857  SRPFLFGLLNDGTLLCYHAFSYEASESNVKRVPLSPQGSADHHNASDSRLRNLRFHRVSI 916

Query: 70   RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP 129
                  + P L R     ++  F+N+ GY+G+FL G  PAW+ +  R  LR HP   DGP
Sbjct: 917  DITSREDIPTLGR----PRITTFNNVGGYEGLFLSGTRPAWV-MVCRQRLRVHPQLCDGP 971

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            +      HNVNC  GF+Y  ++  L+I  LP+  +YD+ WPV+KVPL  TPH + Y+ E 
Sbjct: 972  IEAFTVLHNVNCSHGFIYVTSQGFLKICQLPSAYNYDSYWPVQKVPLHGTPHQVTYYAEQ 1031

Query: 190  KTYCIVTSTA---------EPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFS-- 238
              Y ++ S               D    +  D ++ +   D+      V +F V +    
Sbjct: 1032 SLYPLIVSVPVVRPLNQVLSSMADQESVHHMDNDVTS--TDALHKTYTVDEFEVRILELE 1089

Query: 239  --PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGR 296
                 WE   ++  P+  +E+ L ++ V++ +  T       +A+GT Y   EDV  RGR
Sbjct: 1090 KPGGHWE--TKSTIPMQLFENALTVRIVTL-HNTTTKENETLLAIGTAYVLGEDVAARGR 1146

Query: 297  ILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDN 356
            +LLF   +         ++N +  +Y+KE KG V+A+  + G L+ A G KI + +    
Sbjct: 1147 VLLFSFTK------SENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWTGA 1200

Query: 357  DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSK 416
            +LT +AF D  +++ S+  VKN +L GD  +SI  L ++ +   LSL+A+D+        
Sbjct: 1201 ELTAVAFYDAPLHVVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDF-------- 1252

Query: 417  GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
            G      +  +IDG                                S++  + SD DKNV
Sbjct: 1253 GSLDCFATEFLIDG--------------------------------STLSLVASDSDKNV 1280

Query: 477  VLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASL 536
             +F Y P+  ES  G +L+ + +FH+G H+  F +++  P+    +    +RF   + +L
Sbjct: 1281 QIFYYAPKMVESWKGQKLLSRAEFHVGAHITKFLRLQMLPTQ-GLSSEKTNRFALLFGNL 1339

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY--KGKGYYAGNPSRGII 594
            DG +G   P+ E  +RRL  LQ  +V    H  GLNPR+FR +   GKG+  G     II
Sbjct: 1340 DGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHRPG--PDNII 1397

Query: 595  DGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
            D  L+  +  LSL E+L++ ++IG+  + IL    DI
Sbjct: 1398 DFELLAHYEMLSLDEQLDVAQQIGTTRSQILSNFSDI 1434


>gi|402590016|gb|EJW83947.1| hypothetical protein WUBG_05142 [Wuchereria bancrofti]
          Length = 374

 Score =  274 bits (701), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 153/412 (37%), Positives = 240/412 (58%), Gaps = 40/412 (9%)

Query: 229 VSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYS 288
           + Q+ + L+SP  W+ +        E+E V C + V +  EGT+SG++ Y+A+GT  NY 
Sbjct: 1   MDQYKLQLYSPEDWKPVQHVEILFEEFEVVTCCEEVVLRSEGTVSGVQNYLAVGTACNYG 60

Query: 289 EDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI 348
           E+V  RGRI++ +IIEVVPEPGQP +K++IK +Y KEQKGPVT++C   G+L+T +GQK+
Sbjct: 61  EEVLVRGRIIISEIIEVVPEPGQPTSKHRIKTLYDKEQKGPVTSLCSCNGYLLTGMGQKV 120

Query: 349 YIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDY 408
           +IW  KDN+L GI+F+D   YI  ++ V+NL L  D  RS+ALLRYQ EY+ LSL +RD 
Sbjct: 121 FIWLFKDNNLQGISFLDMHFYIHQLIGVRNLALACDMYRSLALLRYQEEYKALSLASRDM 180

Query: 409 KPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFM 468
                           R  +   +  +FL             I +K          MGF+
Sbjct: 181 ----------------RSDVQPPMAAQFL-------------IDNKQ---------MGFI 202

Query: 469 ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSS--ISDAPGAR 526
           +SD+  N+ +F Y PE  ES GG +L  + + ++G  VN+F +++   SS  + +   + 
Sbjct: 203 MSDEAANIAIFNYLPETLESLGGEKLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSL 262

Query: 527 SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 586
            R    +ASLDG+ G+  PL EK +RRL MLQ +M +      GLN +  R  + +    
Sbjct: 263 ERQSVLFASLDGSFGYLRPLTEKVFRRLHMLQQLMSSMVLQPAGLNAKGARAARPQRPNH 322

Query: 587 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
              +R ++DG +V ++L LSL E+ ++ +K+G+    I+D+L +I  +++H+
Sbjct: 323 YLNTRNLVDGDVVMQYLHLSLPEKNDLARKLGTSRYHIIDDLNEICRVTAHY 374


>gi|255539681|ref|XP_002510905.1| cleavage and polyadenylation specificity factor cpsf, putative
            [Ricinus communis]
 gi|223550020|gb|EEF51507.1| cleavage and polyadenylation specificity factor cpsf, putative
            [Ricinus communis]
          Length = 1461

 Score =  273 bits (697), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 178/562 (31%), Positives = 276/562 (49%), Gaps = 68/562 (12%)

Query: 88   QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLY 147
            ++  F+NI+G+QG FL G  PAW F+  R  LR HP   DG +      HNVNC  G +Y
Sbjct: 943  RITIFNNISGHQGFFLLGSRPAW-FMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIY 1001

Query: 148  FNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTA--EP---- 201
              ++  L+I  LP+  +YD  WPV+K+PLK TPH + Y  E   Y ++ S    +P    
Sbjct: 1002 VTSQGNLKICQLPSFSNYDNYWPVQKIPLKGTPHQVTYFPEKNLYPLIVSVPVHKPVNQV 1061

Query: 202  -STDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS----WEEIPQTNFPLHEWE 256
             S+   +  G   E      D       V +F V +    +    W+   +   P+   E
Sbjct: 1062 LSSLVDQEVGHQIENHNLSSDELLQTYSVEEFEVRILESENGGGPWQ--TKATIPMQSSE 1119

Query: 257  HVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKN 316
            + L ++ V++ +  T       +A+GT Y   EDV  RGR+LLF +++   E  Q L   
Sbjct: 1120 NALTVRVVTL-FNATTKENETLLAIGTAYVQGEDVAARGRVLLFSVVKST-ENSQVL--- 1174

Query: 317  KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-EVYIASMVS 375
             +  +Y+KE KG ++A+  + G L+ A G KI + +    +L G+AF D   +Y+ASM  
Sbjct: 1175 -VSEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGVAFYDAPPLYVASMNI 1233

Query: 376  VKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWK 435
            VKN IL+GD  +SI  L ++ +   LSL+A+D+        G      +  +IDG     
Sbjct: 1234 VKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDF--------GSLDCFATEFLIDG----- 1280

Query: 436  FLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLI 495
                                       S++  ++SD+ KN+ +F Y P+  ES  G +L+
Sbjct: 1281 ---------------------------STLSLVVSDEQKNIQIFYYAPKMLESWKGQKLL 1313

Query: 496  KKTDFHLGQHVNTFFKIRCKPSSISDAPGA------RSRFLTWYASLDGALGFFLPLPEK 549
             + +FH+G H+  F ++    +S SD  GA       +RF   + +LDG++G   PL E 
Sbjct: 1314 SRAEFHVGAHITKFIRLSMLSTS-SDRSGAAPGPDKTNRFALLFGTLDGSIGCIAPLDEL 1372

Query: 550  NYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGE 609
             +RRL  LQ  +V    H  GLNPR+FR ++  G         I+D  L+  F  L L E
Sbjct: 1373 TFRRLQSLQRKLVDAVPHVAGLNPRSFRQFRSDGKVHRPGPESIVDCELLSHFEMLPLEE 1432

Query: 610  RLEICKKIGSKHNDILDELYDI 631
            +LEI +++G+    IL  L D+
Sbjct: 1433 QLEIAQQVGTTRAQILSNLNDL 1454


>gi|357162146|ref|XP_003579318.1| PREDICTED: probable cleavage and polyadenylation specificity factor
            subunit 1-like [Brachypodium distachyon]
          Length = 1442

 Score =  272 bits (695), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 186/655 (28%), Positives = 304/655 (46%), Gaps = 87/655 (13%)

Query: 11   AMDETIVQELLTVSLGLHG-----NRPLLL-VRTQHELLIYQAFRH-------------P 51
            ++ + +   +  V L +H      +RP L  +     LL YQA+ +             P
Sbjct: 834  SLKKEVANNIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYQAYCYEGLESNIKGTSLSP 893

Query: 52   KGALKL------RFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCG 105
             G++ L      R K L+   VS       +   L R     ++  F+N+ GY+G+FL G
Sbjct: 894  DGSVDLGNASDSRLKNLRFHRVSVDITSREDISSLAR----PRITIFNNVGGYEGLFLSG 949

Query: 106  PHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY 165
              P W+ +  R   R HP   DGP+      HNVNC  G +Y  ++  L+I  LP+  +Y
Sbjct: 950  TRPVWV-MVCRQRFRVHPQLCDGPIEAFTVLHNVNCSHGLIYVTSQGFLKICQLPSAYNY 1008

Query: 166  DAPWPVRKVPLKCTPHFLAYHLETKTYCIVTS--TAEPSTDYYKFNGEDKELVTDPRDSR 223
            D  WPV+K+PL  TPH + Y+ E   Y ++ S     P         + + +     D+ 
Sbjct: 1009 DNYWPVQKIPLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSIMADQEMIHHMDNDAS 1068

Query: 224  FIPPLVSQFHVSLFSPFSWE-EIP------QTNFPLHEWEHVLCLKNVSMEYEGTLSGLR 276
                L   + V  F     E E P      ++  P+  +E+ L ++ V++ +  T     
Sbjct: 1069 SADDLQKTYTVEEFEVRVLELEKPGGRWETRSTIPMQSFENALTVRIVTL-HNTTTKENE 1127

Query: 277  GYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
              +A+GT Y   EDV  RGR+LLF   +         ++N +  +Y+KE KG V+A+  +
Sbjct: 1128 TLMAIGTAYVQGEDVAARGRVLLFSFTK------SENSQNLVTEVYSKESKGAVSAVASL 1181

Query: 337  AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQP 396
             G LV A G KI + +   ++LT +AF D  +++ S+  VKN +L GD  +S+  L ++ 
Sbjct: 1182 QGHLVIASGPKITLNKWNGSELTAVAFYDAPLHVVSLNIVKNFVLFGDIHKSVYFLSWKE 1241

Query: 397  EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 456
            +   L+L+A+D+        G      +  +IDG                          
Sbjct: 1242 QGSQLTLLAKDF--------GSLDCFATEFLIDG-------------------------- 1267

Query: 457  DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
                  S++  ++SD DKN+ +F Y P+  ES  G +L+ + + H+G H+  F +++  P
Sbjct: 1268 ------STLSLVVSDSDKNLQIFYYAPKMVESWKGQKLLSRAELHVGAHMTKFLRLQMLP 1321

Query: 517  SSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
            +    A    +RF   + +LDG++G   P+ E  +RRL  LQ  +V   SH  GLNPR+F
Sbjct: 1322 AQ-GLASEKTNRFALLFGTLDGSIGCIAPVDELTFRRLQSLQRKLVDAVSHVCGLNPRSF 1380

Query: 577  RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
            R +K  G         IID  L+  +  LSL E+L++ ++IG+    IL    DI
Sbjct: 1381 RQFKSNGKAHRPGPDNIIDFELLTYYEILSLEEQLDMAQQIGTTRAQILSNFSDI 1435


>gi|296084122|emb|CBI24510.3| unnamed protein product [Vitis vinifera]
          Length = 1448

 Score =  269 bits (687), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 176/567 (31%), Positives = 270/567 (47%), Gaps = 68/567 (11%)

Query: 83   GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCP 142
            G    +M  F NI G QG+FL G  P W F+  R  +R HP   DG +      HN+NC 
Sbjct: 925  GTTSPRMTVFKNIGGCQGLFLSGSRPLW-FMVFRERIRVHPQLCDGSIVAFTVLHNINCN 983

Query: 143  RGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTA--E 200
             G +Y  ++  L+I  LP   SYD  WPV+K+PLK TPH + Y  E   Y ++ S    +
Sbjct: 984  HGLIYVTSQGFLKICQLPAVSSYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVLK 1043

Query: 201  P-----STDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS----WEEIPQTNFP 251
            P     S+   +  G   E      D       V +F V +  P      W+   +   P
Sbjct: 1044 PLNHVLSSLVDQEAGHQLENDNLSSDELHRSYSVDEFEVRVLEPEKSGAPWQ--TRATIP 1101

Query: 252  LHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQ 311
            +   E+ L ++ V++ +  T       +A+GT Y   EDV  RGR+LLF + +       
Sbjct: 1102 MQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSVGKNTDN--- 1157

Query: 312  PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-EVYI 370
              ++N +  IY+KE KG ++A+  + G L+ A G KI + +    +L G+AF D   +Y+
Sbjct: 1158 --SQNLVSEIYSKELKGAISAVASLQGHLLIASGPKIILHKWTGTELNGVAFFDAPPLYV 1215

Query: 371  ASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDG 430
             S+  VKN IL+GD  RSI  L ++ +   L+L+A+D+        G      +  +IDG
Sbjct: 1216 VSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLLAKDF--------GSLDCFATEFLIDG 1267

Query: 431  SLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNG 490
                                            S++  ++SD  KN+ +F Y P+  ES  
Sbjct: 1268 --------------------------------STLSLIVSDDQKNIQIFYYAPKMSESWK 1295

Query: 491  GHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA------RSRFLTWYASLDGALGFFL 544
            G +L+ + +FH+G HV  F +++  P+S SD   A       +RF   + +LDG++G   
Sbjct: 1296 GQKLLSRAEFHVGAHVTKFLRLQMLPAS-SDRTSATQGSDKTNRFALLFGTLDGSIGCIA 1354

Query: 545  PLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQ 604
            PL E  +RRL  LQ  +V    H  GLNPR+FR ++  G         I+D  L+  +  
Sbjct: 1355 PLDELTFRRLQSLQKKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEM 1414

Query: 605  LSLGERLEICKKIGSKHNDILDELYDI 631
            L   E+LEI ++IG+    IL  L D+
Sbjct: 1415 LPFEEQLEIAQQIGTTRMQILSNLNDL 1441


>gi|225455571|ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Vitis vinifera]
          Length = 1442

 Score =  269 bits (687), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 176/567 (31%), Positives = 270/567 (47%), Gaps = 68/567 (11%)

Query: 83   GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCP 142
            G    +M  F NI G QG+FL G  P W F+  R  +R HP   DG +      HN+NC 
Sbjct: 919  GTTSPRMTVFKNIGGCQGLFLSGSRPLW-FMVFRERIRVHPQLCDGSIVAFTVLHNINCN 977

Query: 143  RGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTA--E 200
             G +Y  ++  L+I  LP   SYD  WPV+K+PLK TPH + Y  E   Y ++ S    +
Sbjct: 978  HGLIYVTSQGFLKICQLPAVSSYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVLK 1037

Query: 201  P-----STDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS----WEEIPQTNFP 251
            P     S+   +  G   E      D       V +F V +  P      W+   +   P
Sbjct: 1038 PLNHVLSSLVDQEAGHQLENDNLSSDELHRSYSVDEFEVRVLEPEKSGAPWQ--TRATIP 1095

Query: 252  LHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQ 311
            +   E+ L ++ V++ +  T       +A+GT Y   EDV  RGR+LLF + +       
Sbjct: 1096 MQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSVGKNTDN--- 1151

Query: 312  PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-EVYI 370
              ++N +  IY+KE KG ++A+  + G L+ A G KI + +    +L G+AF D   +Y+
Sbjct: 1152 --SQNLVSEIYSKELKGAISAVASLQGHLLIASGPKIILHKWTGTELNGVAFFDAPPLYV 1209

Query: 371  ASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDG 430
             S+  VKN IL+GD  RSI  L ++ +   L+L+A+D+        G      +  +IDG
Sbjct: 1210 VSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLLAKDF--------GSLDCFATEFLIDG 1261

Query: 431  SLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNG 490
                                            S++  ++SD  KN+ +F Y P+  ES  
Sbjct: 1262 --------------------------------STLSLIVSDDQKNIQIFYYAPKMSESWK 1289

Query: 491  GHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA------RSRFLTWYASLDGALGFFL 544
            G +L+ + +FH+G HV  F +++  P+S SD   A       +RF   + +LDG++G   
Sbjct: 1290 GQKLLSRAEFHVGAHVTKFLRLQMLPAS-SDRTSATQGSDKTNRFALLFGTLDGSIGCIA 1348

Query: 545  PLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQ 604
            PL E  +RRL  LQ  +V    H  GLNPR+FR ++  G         I+D  L+  +  
Sbjct: 1349 PLDELTFRRLQSLQKKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEM 1408

Query: 605  LSLGERLEICKKIGSKHNDILDELYDI 631
            L   E+LEI ++IG+    IL  L D+
Sbjct: 1409 LPFEEQLEIAQQIGTTRMQILSNLNDL 1435


>gi|10257491|dbj|BAB11613.1| cleavage and polyadenylation specificity factor subunit [Arabidopsis
            thaliana]
          Length = 1448

 Score =  268 bits (685), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 183/603 (30%), Positives = 287/603 (47%), Gaps = 72/603 (11%)

Query: 47   AFRHPKGALKLRFKKLKVLFVS-DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCG 105
            A  +  G+ KLR   LK L +  D S R     G   GV   ++  F NI+G+QG FL G
Sbjct: 903  AALNSSGSSKLR--NLKFLRIPLDTSTR----EGTSDGVASQRITMFKNISGHQGFFLSG 956

Query: 106  PHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY 165
              P W  L  R  LR H    DG ++     HNVNC  GF+Y  A+  L+I  LP+   Y
Sbjct: 957  SRPGWCMLF-RERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIY 1015

Query: 166  DAPWPVRKVPLKCTPHFLAYHLETKTYCIVTS--TAEP-----STDYYKFNGEDKELVTD 218
            D  WPV+K+PLK TPH + Y+ E   Y ++ S   ++P     S+   +  G+  +    
Sbjct: 1016 DNYWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNM 1075

Query: 219  PRDSRFIPPLVSQFHVSLFSPFS----WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSG 274
              D       V +F + +  P      WE   +   P+   EH L ++ V++    T   
Sbjct: 1076 SSDDLQRTYTVEEFEIQILEPERSGGPWE--TKAKIPMQTSEHALTVRVVTLLNASTGEN 1133

Query: 275  LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAIC 334
                +A+GT Y   EDV  RGR+LLF             ++N +  +Y++E KG ++A+ 
Sbjct: 1134 -ETLLAVGTAYVQGEDVAARGRVLLFSF-----GKNGDNSQNVVTEVYSRELKGAISAVA 1187

Query: 335  HVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-EVYIASMVSVKNLILVGDYARSIALLR 393
             + G L+ + G KI + +    +L G+AF D   +Y+ SM  VK+ IL+GD  +SI  L 
Sbjct: 1188 SIQGHLLISSGPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLS 1247

Query: 394  YQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 453
            ++ +   LSL+A+D++     +  +        +IDG                       
Sbjct: 1248 WKEQGSQLSLLAKDFESLDCFATEF--------LIDG----------------------- 1276

Query: 454  KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
                     S++   +SD+ KN+ +F Y P+  ES  G +L+ + +FH+G HV+ F +++
Sbjct: 1277 ---------STLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKFLRLQ 1327

Query: 514  CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
                 +S      +RF   + +LDG+ G   PL E  +RRL  LQ  +V    H  GLNP
Sbjct: 1328 M----VSSGADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNP 1383

Query: 574  RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
             AFR ++  G    +    I+D  L+  +  L L E+LE+  +IG+    IL +L D+  
Sbjct: 1384 LAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIGTTRYSILKDLVDLSV 1443

Query: 634  LSS 636
             +S
Sbjct: 1444 GTS 1446


>gi|30696088|ref|NP_199979.2| cleavage and polyadenylation specificity factor subunit 1
            [Arabidopsis thaliana]
 gi|290457637|sp|Q9FGR0.2|CPSF1_ARATH RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=AtCPSF160; Short=CPSF 160
            kDa subunit
 gi|332008729|gb|AED96112.1| cleavage and polyadenylation specificity factor subunit 1
            [Arabidopsis thaliana]
          Length = 1442

 Score =  268 bits (684), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 183/603 (30%), Positives = 287/603 (47%), Gaps = 72/603 (11%)

Query: 47   AFRHPKGALKLRFKKLKVLFVS-DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCG 105
            A  +  G+ KLR   LK L +  D S R     G   GV   ++  F NI+G+QG FL G
Sbjct: 897  AALNSSGSSKLR--NLKFLRIPLDTSTR----EGTSDGVASQRITMFKNISGHQGFFLSG 950

Query: 106  PHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY 165
              P W  L  R  LR H    DG ++     HNVNC  GF+Y  A+  L+I  LP+   Y
Sbjct: 951  SRPGWCMLF-RERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIY 1009

Query: 166  DAPWPVRKVPLKCTPHFLAYHLETKTYCIVTS--TAEP-----STDYYKFNGEDKELVTD 218
            D  WPV+K+PLK TPH + Y+ E   Y ++ S   ++P     S+   +  G+  +    
Sbjct: 1010 DNYWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNM 1069

Query: 219  PRDSRFIPPLVSQFHVSLFSPFS----WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSG 274
              D       V +F + +  P      WE   +   P+   EH L ++ V++    T   
Sbjct: 1070 SSDDLQRTYTVEEFEIQILEPERSGGPWE--TKAKIPMQTSEHALTVRVVTLLNASTGEN 1127

Query: 275  LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAIC 334
                +A+GT Y   EDV  RGR+LLF             ++N +  +Y++E KG ++A+ 
Sbjct: 1128 -ETLLAVGTAYVQGEDVAARGRVLLFSF-----GKNGDNSQNVVTEVYSRELKGAISAVA 1181

Query: 335  HVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-EVYIASMVSVKNLILVGDYARSIALLR 393
             + G L+ + G KI + +    +L G+AF D   +Y+ SM  VK+ IL+GD  +SI  L 
Sbjct: 1182 SIQGHLLISSGPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLS 1241

Query: 394  YQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 453
            ++ +   LSL+A+D++     +  +        +IDG                       
Sbjct: 1242 WKEQGSQLSLLAKDFESLDCFATEF--------LIDG----------------------- 1270

Query: 454  KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
                     S++   +SD+ KN+ +F Y P+  ES  G +L+ + +FH+G HV+ F +++
Sbjct: 1271 ---------STLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKFLRLQ 1321

Query: 514  CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
                 +S      +RF   + +LDG+ G   PL E  +RRL  LQ  +V    H  GLNP
Sbjct: 1322 M----VSSGADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNP 1377

Query: 574  RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
             AFR ++  G    +    I+D  L+  +  L L E+LE+  +IG+    IL +L D+  
Sbjct: 1378 LAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIGTTRYSILKDLVDLSV 1437

Query: 634  LSS 636
             +S
Sbjct: 1438 GTS 1440


>gi|24415580|gb|AAN41460.1| putative cleavage and polyadenylation specificity factor 160 kDa
            subunit [Arabidopsis thaliana]
          Length = 1442

 Score =  268 bits (684), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 183/603 (30%), Positives = 287/603 (47%), Gaps = 72/603 (11%)

Query: 47   AFRHPKGALKLRFKKLKVLFVS-DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCG 105
            A  +  G+ KLR   LK L +  D S R     G   GV   ++  F NI+G+QG FL G
Sbjct: 897  AALNSSGSSKLR--NLKFLRIPLDTSTR----EGTSDGVASQRITMFKNISGHQGFFLSG 950

Query: 106  PHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY 165
              P W  L  R  LR H    DG ++     HNVNC  GF+Y  A+  L+I  LP+   Y
Sbjct: 951  SRPGWCMLF-RERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIY 1009

Query: 166  DAPWPVRKVPLKCTPHFLAYHLETKTYCIVTS--TAEP-----STDYYKFNGEDKELVTD 218
            D  WPV+K+PLK TPH + Y+ E   Y ++ S   ++P     S+   +  G+  +    
Sbjct: 1010 DNYWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNM 1069

Query: 219  PRDSRFIPPLVSQFHVSLFSPFS----WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSG 274
              D       V +F + +  P      WE   +   P+   EH L ++ V++    T   
Sbjct: 1070 SSDDLQRTYTVEEFEIQILEPERSGGPWE--TKAKIPMQTSEHALTVRVVTLLNASTGEN 1127

Query: 275  LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAIC 334
                +A+GT Y   EDV  RGR+LLF             ++N +  +Y++E KG ++A+ 
Sbjct: 1128 -ETLLAVGTAYVQGEDVAARGRVLLFSF-----GKNGDNSQNVVTEVYSRELKGAISAVA 1181

Query: 335  HVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-EVYIASMVSVKNLILVGDYARSIALLR 393
             + G L+ + G KI + +    +L G+AF D   +Y+ SM  VK+ IL+GD  +SI  L 
Sbjct: 1182 SIQGHLLISSGPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLS 1241

Query: 394  YQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 453
            ++ +   LSL+A+D++     +  +        +IDG                       
Sbjct: 1242 WKEQGSQLSLLAKDFESLDCFATEF--------LIDG----------------------- 1270

Query: 454  KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
                     S++   +SD+ KN+ +F Y P+  ES  G +L+ + +FH+G HV+ F +++
Sbjct: 1271 ---------STLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKFLRLQ 1321

Query: 514  CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
                 +S      +RF   + +LDG+ G   PL E  +RRL  LQ  +V    H  GLNP
Sbjct: 1322 M----VSSGADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNP 1377

Query: 574  RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
             AFR ++  G    +    I+D  L+  +  L L E+LE+  +IG+    IL +L D+  
Sbjct: 1378 LAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIGTTRYSILKDLVDLSV 1437

Query: 634  LSS 636
             +S
Sbjct: 1438 GTS 1440


>gi|224120960|ref|XP_002318462.1| predicted protein [Populus trichocarpa]
 gi|222859135|gb|EEE96682.1| predicted protein [Populus trichocarpa]
          Length = 1455

 Score =  268 bits (684), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 189/636 (29%), Positives = 295/636 (46%), Gaps = 84/636 (13%)

Query: 30   NRPLLL-VRTQHELLIYQA--FRHPKGALKL------------------RFKKLKVLFVS 68
            +RP L  + T   +L Y A  F  P G  KL                  R + L+ + V 
Sbjct: 863  SRPFLFGILTDGTILCYHAYLFEGPDGTSKLEDSVSAQNSVGASTISASRLRNLRFVRVP 922

Query: 69   DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG 128
              +    E        RI+    F NI+GYQG FL G  PAW F+  R  LR HP   DG
Sbjct: 923  LDTYTREETSSETSCQRITT---FKNISGYQGFFLSGSRPAW-FMVFRERLRVHPQLCDG 978

Query: 129  PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLE 188
             +      H VNC  G +Y  ++  L+I  L +  SYD  WPV+K+PLK TPH + Y  E
Sbjct: 979  SIVAFTVLHTVNCNHGLIYVTSQGNLKICHLSSVSSYDNYWPVQKIPLKGTPHQVTYFAE 1038

Query: 189  TKTY-CIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPP------LVSQFHVSLFSPFS 241
               Y  IV+   +   +    +  D+E+     +             V +F V +  P +
Sbjct: 1039 RNLYPLIVSVPVQKPVNQVLSSLVDQEVGHQIENHNLSSEEIHRTYSVDEFEVRILEPSN 1098

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
                 +   P+   E+ L ++ VS+ +  +       +A+GT Y   EDV  RGRILLF 
Sbjct: 1099 GPWQVKATIPMQTSENALTVRMVSL-FNTSTKENETLLAVGTAYVQGEDVAARGRILLFS 1157

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
            +++  PE  Q L    +  +Y+KE KG ++A+  + G L+ A G KI + +    +LTG+
Sbjct: 1158 VVK-NPENSQIL----VSEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELTGV 1212

Query: 362  AFIDT-EVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            AF D   +Y+ S+  VKN IL+GD  +SI  L ++ +   LSL+A+D+      S  +  
Sbjct: 1213 AFSDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFASLDCFSTEF-- 1270

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                  +IDG                                S++  ++SD+ KNV +F 
Sbjct: 1271 ------LIDG--------------------------------STLSLVVSDEQKNVQIFY 1292

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA-----RSRFLTWYAS 535
            Y P+  ES  G +L+ + +FH+G  V  F +++    S+  +  A      +RF   + +
Sbjct: 1293 YAPKMSESWKGQKLLSRAEFHVGALVTKFMRLQMLSPSLDRSGAAPVSDKTNRFALLFGT 1352

Query: 536  LDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIID 595
            LDG++G   PL E  +RRL  LQ  +V    H  GLNP++FR ++  G         I+D
Sbjct: 1353 LDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVAGLNPKSFRQFRSDGKAHRPGPESIVD 1412

Query: 596  GSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
              ++  +  + L E++EI ++IG+    IL  L D+
Sbjct: 1413 CEMLSYYEMIPLEEQVEIAQQIGTTRAQILSNLNDL 1448


>gi|297792471|ref|XP_002864120.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297309955|gb|EFH40379.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1444

 Score =  264 bits (674), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 180/603 (29%), Positives = 286/603 (47%), Gaps = 72/603 (11%)

Query: 47   AFRHPKGALKLR-FKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCG 105
            A  +  G+ KLR  K L++ F  D S R     G   GV   ++  F NI+G+QG FL G
Sbjct: 899  AALNSSGSSKLRNLKFLRIPF--DTSTR----EGTSDGVASQRITMFKNISGHQGFFLSG 952

Query: 106  PHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY 165
              P W  L  R  LR H    DG ++     HNVNC  GF+Y  ++  L+I  LP+   Y
Sbjct: 953  SRPGWCMLF-RERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTSQVVLKICQLPSASIY 1011

Query: 166  DAPWPVRKVPLKCTPHFLAYHLETKTYCIVTS--TAEP-----STDYYKFNGEDKELVTD 218
            D  WPV+K+PLK TPH + Y+ E   Y ++ S   ++P     S+   +  G+  +    
Sbjct: 1012 DNYWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPINQVLSSLVDQEAGQQIDNHNL 1071

Query: 219  PRDSRFIPPLVSQFHVSLFSPFS----WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSG 274
              D       V +F + +  P      WE   +   P+   EH L ++ V++    T   
Sbjct: 1072 SSDDLQRTYTVEEFEIQILEPERSGGPWE--TKATIPMQSSEHALTVRVVTLLNASTGEN 1129

Query: 275  LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAIC 334
                +A+GT Y   EDV  RGR+LLF   +         ++N +  +Y++E KG ++A+ 
Sbjct: 1130 -ETLLAVGTAYVQGEDVAARGRVLLFSFGK-----NGDNSQNVVTEVYSRELKGAISAVA 1183

Query: 335  HVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-EVYIASMVSVKNLILVGDYARSIALLR 393
             + G L+ + G KI + +    +L G+AF D   +Y+ SM  VK  IL+GD  +SI  L 
Sbjct: 1184 SIQGHLLISSGPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKTFILLGDVHKSIYFLS 1243

Query: 394  YQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 453
            ++ +   LSL+A+D+        G      +  +IDG                       
Sbjct: 1244 WKEQGSQLSLLAKDF--------GSLDCFATEFLIDG----------------------- 1272

Query: 454  KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
                     +++   +SD+ KN+ +F Y P+  ES  G +L+ + +FH+G HV  F +++
Sbjct: 1273 ---------NTLSLAVSDEQKNIQVFYYAPKMAESWKGQKLLSRAEFHVGSHVTKFLRLQ 1323

Query: 514  CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
                 ++      +RF   + +LDG+ G   PL E  +RRL  LQ  +V    H  GLNP
Sbjct: 1324 M----VTSGADKTNRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNP 1379

Query: 574  RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
             +FR ++  G    +    IID  L+  +  L L E+LE+  +IG+  + IL  L ++  
Sbjct: 1380 HSFRQFRTSGKARRSGPDSIIDCELLCHYEMLPLEEQLELAHQIGTTRSVILLNLVELSV 1439

Query: 634  LSS 636
             +S
Sbjct: 1440 GTS 1442


>gi|290981010|ref|XP_002673224.1| CPSF A subunit [Naegleria gruberi]
 gi|284086806|gb|EFC40480.1| CPSF A subunit [Naegleria gruberi]
          Length = 1373

 Score =  258 bits (660), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 175/598 (29%), Positives = 283/598 (47%), Gaps = 105/598 (17%)

Query: 87   SQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFL 146
            SQ+  F NI GY G+F  G  P WLF T    LR HP     PV+T  P+H+ NCP GF+
Sbjct: 835  SQLIPFKNIGGYGGLFKTGEKPFWLF-TEHSNLRVHPTQSRDPVTTFTPYHHENCPHGFI 893

Query: 147  YFNAK-------SELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTA 199
            Y   K       S+L IS L  ++ ++A WP RK+ LK TP+ + +H +T T    TS  
Sbjct: 894  YLTDKEQDNKKQSKLHISSLNANVKFNAYWPQRKILLKSTPNVITFHQDTNTCLAFTSVP 953

Query: 200  EPSTDYYKFNGEDKELVTD--PRDSRFIPPLVSQFH-VSLFSPFSWEEIPQTNFPLHE-- 254
                         K ++ D  P      PP   Q H V LFS  +W+E+ +  F LHE  
Sbjct: 954  V------------KAILPDSIPFPEGKCPPPAEQKHTVKLFSGHNWQEMDKFEFDLHESA 1001

Query: 255  -WEHVLCLK------NVSMEYEGTLSG----LRGYIALGTNYNYSEDVTCRGRILLFDII 303
                V+ L       +  + +E  L+     L   +A+GT Y  SE   CRGR+LLFD+ 
Sbjct: 1002 VAAKVVYLSKEEYNDDTDISFEEPLNSRKQDLVSVVAVGTAYVQSERELCRGRLLLFDLD 1061

Query: 304  EVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYI----WQLKDNDLT 359
             ++    +     K+ +I +   KGP+T +  V  +++ +VG +IY     W+ K   +T
Sbjct: 1062 PILGRENE----YKLNLISSTSVKGPITTLEQVDRYIICSVGNRIYTYYFDWEEKRMHIT 1117

Query: 360  GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
              +F DT+ Y AS+ +V+N I+ GD  +S++ LR++ +   L L+A+D +P Q  S  + 
Sbjct: 1118 --SFYDTQFYTASLNTVRNFIMFGDIYKSVSFLRWKEKGHRLILLAKDNRPLQVVSSEFL 1175

Query: 420  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
                                               +ND+L      G  + D  KN+ +F
Sbjct: 1176 V----------------------------------NNDLL------GLAVIDTSKNLQIF 1195

Query: 480  MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP---------SSISDAPGARSR-- 528
             Y P+ +ESN G  L+   DFH+G  +N+  +++ +           ++++ P    +  
Sbjct: 1196 SYLPQHQESNDGRNLVPVCDFHIGTLINSLIRMKVRELPDDNTIRLGNVNEKPKQSGKKD 1255

Query: 529  --------FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK 580
                        + S+DGA+G+  P+ E  +RRL  LQ  M T      GL+P++FR YK
Sbjct: 1256 ITKTNPNHQFILFGSVDGAIGYVAPINEVTHRRLFALQLKMYTQLEQAAGLHPKSFRLYK 1315

Query: 581  GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
                   N  + IIDG L+W +  ++   + ++ ++IG+  ++IL  + ++   +  F
Sbjct: 1316 PLERTEYNYKKNIIDGQLIWNYANINTILQRDLARQIGTNSDNILRSIQELNQATFFF 1373


>gi|330799483|ref|XP_003287774.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
 gi|325082229|gb|EGC35718.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
          Length = 1453

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 173/631 (27%), Positives = 311/631 (49%), Gaps = 87/631 (13%)

Query: 19   ELLTVSLG-LHGNRPLLLVRTQ-HELLIYQAFRHPKGALKLRFKKLKVLFV----SDRSK 72
            E++ +SL  L+ ++P LL++ +  +L++Y++F+   G   LRFKK    F+    S+ SK
Sbjct: 879  EIVEISLEILNNSQPYLLLKNRIGDLIVYKSFKKENG--DLRFKKYNHNFILRDLSNNSK 936

Query: 73   RANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVST 132
              N       G R   +      +   GVF+ G  P W+F   +G +R H M  DG + +
Sbjct: 937  SINSD-----GYRKKSIVNIKLSSKNNGVFIGGQKPVWIF-NEKGYIRLHSMDFDGAIVS 990

Query: 133  LAPFHNVNCPRGFLYFNA-KSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKT 191
            L PFHN +CP GFLY+   K  ++I  L   ++++  + +R+VP+K + H +AYH E K 
Sbjct: 991  LKPFHNADCPNGFLYYTEDKQHIKIGYLNGLMNFENEYAIRRVPIKLSAHKIAYHNELKC 1050

Query: 192  YCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP---FSWEEIPQT 248
            Y +V S  + + +  +     K ++TD +           F + +  P   +SW  I   
Sbjct: 1051 YVVVVSFPQVTQELEE--DSKKPILTDEK-----------FQIKIIDPTIDWSWRFID-- 1095

Query: 249  NFPLHEWEHVLCLKNVSMEYEGTLSGLRG--YIALGTNYNYSEDVTCRGRILLFDIIEVV 306
            +F L + E VL +K VS++++ +   ++   ++ +GT + + ED  C+GR+L+F+I+   
Sbjct: 1096 SFSLQDRETVLAMKIVSLKFKESDETIKSKPFLVIGTAFTFGEDTQCKGRVLVFEIVSHK 1155

Query: 307  PE-PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFID 365
             +     L   ++ ++Y KEQKGPVTA+  V+G L+  +G K+ + Q     L  ++F D
Sbjct: 1156 TQFESDDLGTKRLNLLYEKEQKGPVTALSSVSGLLLMTIGPKLTVNQFLTGQLVTLSFHD 1215

Query: 366  TEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSR 425
             ++YI S+ ++K  I++GD  +S+  L++    + L  +++DY+     S  +       
Sbjct: 1216 AQIYICSISTIKTYIVIGDMYKSVYFLQWNG--KQLVPLSKDYQSLNIFSTEFIVNQ--- 1270

Query: 426  GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEA 485
                                                  ++  ++SD DKN++LF + P  
Sbjct: 1271 -------------------------------------QTLSILVSDLDKNILLFSFDPAD 1293

Query: 486  RESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS-SISDAPGARSRFLTWYASLDGALGFFL 544
              S  G  L+ K DFH+G ++  F +   K +   S      +  L ++ +LDG+L    
Sbjct: 1294 PTSRQGQMLLCKADFHIGSNIEKFVRTPMKFNIQSSSNGNNNNDQLVFFGTLDGSLNVLR 1353

Query: 545  PLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG-KGYYAGNPS------RGIIDGS 597
            PL E+ Y+    LQ+ +  +     GLN + +R +K     +  +PS      + I+DG 
Sbjct: 1354 PLDERMYQLFYHLQSKLY-YLPQPAGLNAKQYRAFKSFSQNFHFSPSTIHQLPKYILDGD 1412

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
            L+ KF++L+  ER  +   +GS  ++IL  L
Sbjct: 1413 LLSKFVKLNQKERRLLASSVGSNTDEILTAL 1443


>gi|19112233|ref|NP_595441.1| cleavage factor one Cft1 (predicted) [Schizosaccharomyces pombe
            972h-]
 gi|74582544|sp|O74733.1|CFT1_SCHPO RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|3738146|emb|CAA21247.1| cleavage factor one Cft1 (predicted) [Schizosaccharomyces pombe]
          Length = 1441

 Score =  249 bits (637), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 186/641 (29%), Positives = 303/641 (47%), Gaps = 78/641 (12%)

Query: 19   ELLTVSLGLHGNRPLLLVRTQ-HELLIYQAFRHP---KGALKLRFKKLKVLFVSDRSKRA 74
            ELL   LG     P L +R++ +E+ +Y+AF +    K    L F K+    ++ R  +A
Sbjct: 853  ELLVADLGDDFKEPHLFLRSRLNEITVYKAFLYSNTDKHKNLLAFAKVPQETMT-REFQA 911

Query: 75   NEQPGLPRGVRIS------------QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
            N   G PR    +            +M     +  +  VF+ G  P  +  T     +  
Sbjct: 912  N--VGTPRDAESTMEKKASSSVDHLKMTALEVVGNHSAVFVTGRKPFLILSTLHSNAKFF 969

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            P++ + P+ ++APFH  + P+G++Y +  S +RI        YD  WP +KV L    + 
Sbjct: 970  PISSNIPILSVAPFHAHHAPQGYIYVDENSFIRICKFQEDFEYDNKWPYKKVSLGKQING 1029

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKE---LVTDPRDSRFIPPLVSQFHVSLFSP 239
            +AYH     Y +   +A P    +K   ED      +TD  D     P+ +   + L SP
Sbjct: 1030 IAYHPTKMVYAV--GSAVPIE--FKVTDEDGNEPYAITDDNDYL---PMANTGSLDLVSP 1082

Query: 240  FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
             +W  I    F   ++E  L +  V++E   T    + YIA+GT+    ED+  RG   L
Sbjct: 1083 LTWTVIDSYEF--QQFEIPLSVALVNLEVSETTKLRKPYIAVGTSITKGEDIAVRGSTYL 1140

Query: 300  FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND-L 358
            F+II+VVP+PG+P T++K+K++  +E KG V  +C V G+L++  GQK+ +  L+D D L
Sbjct: 1141 FEIIDVVPQPGRPETRHKLKLVTREEIKGTVAVVCEVDGYLLSGQGQKVIVRALEDEDHL 1200

Query: 359  TGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
             G++FID   Y  S   ++NL+L GD  +++  + +  E   ++L           SKG 
Sbjct: 1201 VGVSFIDLGSYTLSAKCLRNLLLFGDVRQNVTFVGFAEEPYRMTLF----------SKGQ 1250

Query: 419  YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
             A N S                                D L +  ++ F+++D   N+ L
Sbjct: 1251 EALNVSAA------------------------------DFLVQGENLYFVVADTSGNLRL 1280

Query: 479  FMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAP---GARSRFLTWYAS 535
              Y PE  ES+ G RL+ + DFH+G +V T   I  K     +A         F     +
Sbjct: 1281 LAYDPENPESHSGERLVTRGDFHIG-NVITAMTILPKEKKHQNAEYGYDTGDDFSCVMVN 1339

Query: 536  LDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIID 595
             DG L   +P+ ++ YRRL ++QN +    +  GGLNP+++R          NP+R I+D
Sbjct: 1340 SDGGLQMLVPISDRVYRRLNIIQNYLANRVNTIGGLNPKSYRLITSPSNLT-NPTRRILD 1398

Query: 596  GSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI-EALS 635
            G L+  F  +S+  R E+  K G   + I+++L ++ EALS
Sbjct: 1399 GMLIDYFTYMSVAHRHEMAHKCGVPVSTIMNDLVELDEALS 1439


>gi|213407244|ref|XP_002174393.1| cleavage factor one Cft1 [Schizosaccharomyces japonicus yFS275]
 gi|212002440|gb|EEB08100.1| cleavage factor one Cft1 [Schizosaccharomyces japonicus yFS275]
          Length = 1431

 Score =  248 bits (634), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 178/648 (27%), Positives = 300/648 (46%), Gaps = 81/648 (12%)

Query: 19   ELLTVSLGLHGNRPLLLVRTQ-HELLIYQAF--RHP-KGALKLRFKKLKVLFVSDRSKRA 74
            E+L   LG       LL+R++ +E+ +Y+ F   +P     +LRF K+    ++  S   
Sbjct: 832  EVLATDLGDEAKEAHLLIRSRMNEITVYKPFVCSNPVTHKTELRFSKIPQEGMTRESTEC 891

Query: 75   N--------EQPGLPRG------------VRISQMRYFSNIAGYQGVFLCGPHPAWLFLT 114
            +        EQ   P+             V   +M     I  +  VF+ G  P +L  T
Sbjct: 892  SLQDLVAETEQENAPKDASEQKPQKSSSTVDKPRMVALQRIGNHSAVFITGAKPFFLLKT 951

Query: 115  SRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKV 174
            +    + HP+  +  + +LA FH  + P+G+++ +   ++ I      ++YD  W  +KV
Sbjct: 952  AHSVAKFHPLLSECRILSLASFHTEHAPKGYIFVDENYDINICRFQDDINYDHRWGYKKV 1011

Query: 175  PLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHV 234
             +  + H +AYH     Y I TST  P    Y+   E+  +V   ++     P  +   +
Sbjct: 1012 NVGRSVHGIAYHPTKMVYAIATSTLTP----YEVTDEEGNVVYPLKNEGEYLPRTNSGML 1067

Query: 235  SLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR 294
             L SP +W  I +  F   ++E  LC++ V++E        + +IA+GT+    ED+  R
Sbjct: 1068 ELVSPLTWTVIDRYKF--LDYEIPLCVRLVNLEISDVTKLRKPFIAVGTSITKGEDIAVR 1125

Query: 295  GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK 354
            G   LF+II+VVP+PG P T++K+K++  +E KG V  +  + G+L++  GQK+ +  L+
Sbjct: 1126 GSTYLFEIIDVVPQPGHPETRHKLKLVTREEIKGTVAVVSEINGYLLSGQGQKVIVRALE 1185

Query: 355  DND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
            D D L G+AFID   Y     S++NL++ GD  +SI+ + +  E   ++L A+   P   
Sbjct: 1186 DEDHLVGVAFIDLGSYTVVAKSLRNLLIFGDIRQSISFVGFAEEPYRMTLFAKGQDPLSV 1245

Query: 414  NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
            +S                                         D L +  S+ F ++D  
Sbjct: 1246 SSA----------------------------------------DFLVQGQSLYFAVADMR 1265

Query: 474  KNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR-----SR 528
             N+ +  Y PE  ES+ G RL+ + D H+G H+ T   I   P    D PG         
Sbjct: 1266 GNLRILAYDPENPESHSGERLVTRGDIHVG-HIIT--AIHLVPKMKKDRPGEVDYDEGDE 1322

Query: 529  FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
            F     + DG+L    P+ E+ YRRL ++QN +       GGLNPR++R          N
Sbjct: 1323 FACITTNSDGSLQALCPISERVYRRLNIIQNYLANRIETVGGLNPRSYRLINTVSSL-NN 1381

Query: 589  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI-EALS 635
             +  I+DG L+  F  +S+  R E+  K G   + I+++L ++ EAL+
Sbjct: 1382 ATHRILDGGLIEHFSYMSVAHRQEMAYKCGVPISTIMNDLVELDEALN 1429


>gi|308805673|ref|XP_003080148.1| cleavage and polyadenylation specificity factor (ISS) [Ostreococcus
            tauri]
 gi|116058608|emb|CAL54315.1| cleavage and polyadenylation specificity factor (ISS), partial
            [Ostreococcus tauri]
          Length = 1473

 Score =  246 bits (629), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 179/616 (29%), Positives = 286/616 (46%), Gaps = 77/616 (12%)

Query: 30   NRPLLL-VRTQHELLIYQAFRHPKGAL-----------KLRFKKLKV------LFVSDRS 71
             RPLL  VR    LL+Y+ F  P G             +LRF ++ +      L V+   
Sbjct: 622  ERPLLTAVRGDGTLLLYRGFIVPAGTTCEGSEEPLARGELRFSRVNIDVEGSGLNVAGVG 681

Query: 72   KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS 131
                 +  L  G R++++       G QG+F+ GP+P WL +  R  + A P   +G + 
Sbjct: 682  VAGQVRDSLA-GTRLTRISNVGEGQGLQGIFVAGPNPLWLIV-RRSRVLALPTRGEGEIV 739

Query: 132  TLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKT 191
                FHNVNCP GF+   A   +RI  +P+ + Y+A WPVRK+ LKCTPH +AY  + K 
Sbjct: 740  AFTDFHNVNCPYGFILGTAVGGVRICQMPSKMHYEAAWPVRKIALKCTPHAVAYLPDFKL 799

Query: 192  YCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPP----LVSQFHVSLFSPFSWEEIPQ 247
            Y +VTS   P  D  + +GE+   ++  +  R        +  Q+ V L  P S + + Q
Sbjct: 800  YALVTSANVPWVD-REIDGENVHGLSLSKARRERAKAHDDMELQYSVRLLVPGSLDCVWQ 858

Query: 248  TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
                L   EHV C++NV ++   T   L  Y+A+GT     ED  CRGR+ LF+++    
Sbjct: 859  HT--LEPGEHVQCVRNVQLKDINTGHSL-SYLAVGTAMPGGEDTPCRGRVYLFNMVWERD 915

Query: 308  EPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTE 367
                   + K ++   +E K   TA+  + G L+ AVG K+ +      +L  +AF DT 
Sbjct: 916  SESADGYRWKGQVCCVREAKMACTALEGLGGHLIVAVGTKLTVHTWDGRELNSVAFFDTP 975

Query: 368  VYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGI 427
            ++  S+  VKN ILVGD  + +   R++                                
Sbjct: 976  IHTVSINVVKNFILVGDLEKGLHFFRWK-------------------------------- 1003

Query: 428  IDGSLVWKFLQLSLG-ERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
             D       +QLS   ER+++        ++ L + +++  + SD   N   F Y P++ 
Sbjct: 1004 -DTGFEKSLIQLSKDFERMDVVS------SEFLIDGTTLSLLGSDMSGNARTFGYDPKSI 1056

Query: 487  ESNGGHRLIKKTDFHLGQHVNTFFKI-----RCKPSSISDAPGARSRFLTWYASLDGALG 541
            ES  G +L+ +  +H+G  ++   +      + K +S    P   +RF  ++ +LDGALG
Sbjct: 1057 ESWKGQKLLPRAAYHVGSPISRMVRFNVEGSKSKMASTDGKPKGANRFAVFFGTLDGALG 1116

Query: 542  FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT---YKGKGYYAGNPSRGIIDGSL 598
             F+P     Y +LL +Q  + T      G NPR FRT   ++GK      P   ++DG L
Sbjct: 1117 IFMPTDPVTYEKLLAIQRELTTAVRSPIGCNPRTFRTPKVFEGKHVQLRAP-LDVLDGGL 1175

Query: 599  VWKFLQLSLGERLEIC 614
            + KF  L+  E+++I 
Sbjct: 1176 LSKFETLTFSEQVKIA 1191


>gi|428186188|gb|EKX55039.1| hypothetical protein GUITHDRAFT_160593 [Guillardia theta CCMP2712]
          Length = 2290

 Score =  241 bits (616), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 170/569 (29%), Positives = 271/569 (47%), Gaps = 80/569 (14%)

Query: 84   VRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID--GPVSTLAPFHNVNC 141
            +R S++       G +GV +    PA + L  RG  R HP  +D    V + A F+N+ C
Sbjct: 1064 LRTSRLMPLGGAGGLEGVLIAARQPA-VVLFGRGLPRIHPWKLDRGEGVRSAARFNNLQC 1122

Query: 142  PRGFLYF------NAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIV 195
              G +         AK  L+I  +P  +S D PWP+R   +  T H +A+H  T  + +V
Sbjct: 1123 KDGIVCIADKGRDRAKGVLKICNIPEGISGDTPWPLRTKHVGMTVHHVAFHAATGCHVLV 1182

Query: 196  TSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQ-FHVSLFSPFSWEEIPQTNFPLHE 254
             S+ +   D  K  G  +           IPPL  + + V L +P+S E +    F    
Sbjct: 1183 VSSQQEIEDERKPEGTLEGA---------IPPLTEEKYEVQLRAPYSMELLDSYEFDFAN 1233

Query: 255  WEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR--GRILLFDIIEVVPEPG-Q 311
             E  LCL+ V ++       L  ++A+GT +   E  T R  GRI +F++  VV E G +
Sbjct: 1234 GEKALCLQVVHLKNTRVKDSLLPFVAVGTGFQNGESETSRATGRIYVFEVTTVVGEEGYE 1293

Query: 312  PLTKNKIKMIYA----KEQKGPVTAICHVAGFLVTAVG--------QKIYIWQLKDNDLT 359
              T  KIK I+     ++ K PV+A+C + G+L+ A G         K+Y+++  D  L 
Sbjct: 1294 GRTSFKIKKIFTSADIQDIKAPVSALCQLEGYLLVAQGPNPGMIGGSKLYVYEWVDEKLV 1353

Query: 360  GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
            G AF D  +YI ++ +VK  I+ GD   S+ LLR++ + R L L+A+D  P        Y
Sbjct: 1354 GRAFFDAHLYITTLKTVKFFIVFGDIRHSVHLLRWREDIRMLQLLAKDALPLS-----VY 1408

Query: 420  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
            A              +F+ +                       S+ G + SD+ KNV +F
Sbjct: 1409 AA-------------EFVVMG----------------------SNFGLLASDEQKNVQVF 1433

Query: 480  MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGA 539
            ++ P + E     +LI + D H+G H+N F +    P       G R+     Y +LDG 
Sbjct: 1434 VFNPNSPEYR-RQQLICRADLHVGSHINKFIRW---PLPFRPTLGVRT--AAHYTTLDGG 1487

Query: 540  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
            +G  +P+PE++YRRLL LQN++VT   H  GLNPR++R YK         ++  +DG+L+
Sbjct: 1488 IGAIIPIPEQSYRRLLALQNLLVTAMPHYAGLNPRSWRLYKPAMCMKRRYAKNFLDGNLL 1547

Query: 600  WKFLQLSLGERLEICKKIGSKHNDILDEL 628
             ++L L L  ++++   +      IL +L
Sbjct: 1548 GRYLHLDLALQMQLSSALNQTREAILGDL 1576


>gi|145348791|ref|XP_001418827.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579057|gb|ABO97120.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1386

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 176/619 (28%), Positives = 286/619 (46%), Gaps = 76/619 (12%)

Query: 30   NRPLLL-VRTQHELLIYQAFRHPKGAL-----------KLRFKKLKV------LFVSDRS 71
             RPLL  VR    LL+Y+ F  P G             +LRF ++ V      L V+   
Sbjct: 794  ERPLLTAVRGDGTLLLYKGFIVPAGTTYEGQDEPLEKNELRFSRVNVDVEGSGLNVAGIG 853

Query: 72   KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS 131
                 +  L  G R++++       G QG+F+ GP+P WL +  R  + A P   +G V 
Sbjct: 854  AAGQLRDSLA-GARLTRIGNVGEGQGVQGIFVAGPNPLWLIV-RRSRVLALPTRGEGEVV 911

Query: 132  TLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKT 191
                FHNVNCP GF+   A   +RI  +P+ + Y+A WPVRKV LKCTPH + Y  + K 
Sbjct: 912  AFTVFHNVNCPHGFILGTALGGVRICQMPSKMHYEAAWPVRKVALKCTPHTITYLPDFKL 971

Query: 192  YCIVTSTAEP--STDYYKFNGEDKELVTDPRD-SRFIPPLVSQFHVSLFSPFSWEEIPQT 248
            Y +VTS   P    +  + N     L    R+ ++    +  Q+ V L  P S +   Q 
Sbjct: 972  YALVTSAPVPWVEREIEQDNVHGIALAKVRRERAKANDDMELQYSVRLLVPGSLDSAWQ- 1030

Query: 249  NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPE 308
               L   EHV C++NV +    T   L   +A+GT     ED  CRGR++LF ++     
Sbjct: 1031 -HALEPGEHVQCVRNVQLRDINT-GALLSLLAVGTAMPGGEDTPCRGRVILFQMVWERDA 1088

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEV 368
                  + K ++   +E K   TA+  + G L+ AVG K+ +      +L  +AF DT +
Sbjct: 1089 ESMDGYRWKGQVCCVREAKMACTALSALDGHLIVAVGTKLTVHTWDGVELNSVAFFDTPI 1148

Query: 369  YIASMVSVKNLILVGDYARSIALLRYQPE--YRTLSLVARDYKPTQPNSKGYYAGNPSRG 426
            +  S+  VKN ILVGD  + +   R++     +++  +++D+                  
Sbjct: 1149 HTVSINVVKNFILVGDLEKGLHFFRWKANGFEKSIIQLSKDF------------------ 1190

Query: 427  IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
                            +R+++         + L + +++  + SD   N  +F Y P++ 
Sbjct: 1191 ----------------DRMDVVS------TEFLIDGATLSLLGSDMSGNARIFGYDPKSL 1228

Query: 487  ESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR----SRFLTWYASLDGALGF 542
            ES  G +L+ ++ +H+G  ++   +   + ++   APG R    +R   ++ +LDGALG 
Sbjct: 1229 ESWKGQKLLVRSAYHVGSPISRMVRFNVEGTTAKAAPGERPKGTNRHAVFFGTLDGALGI 1288

Query: 543  FLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT---YKGKGYYAGNPSRGIIDGSLV 599
            F+P  E  Y +L  LQ  + T      G NPR FRT   ++GK      P   ++DG L+
Sbjct: 1289 FMPTDEPTYAKLHALQRELNTTVRSPIGCNPRTFRTPKVFEGKHVQLLAP-LDVLDGGLL 1347

Query: 600  WKFLQLSLGERLEICKKIG 618
             KF  L+  E+  + ++ G
Sbjct: 1348 SKFETLTFTEQRAVAERSG 1366


>gi|440793679|gb|ELR14857.1| CPSF A subunit region protein [Acanthamoeba castellanii str. Neff]
          Length = 1477

 Score =  234 bits (598), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 168/578 (29%), Positives = 269/578 (46%), Gaps = 90/578 (15%)

Query: 84   VRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPR 143
            +R  ++ YF  +    GVF+ G  PAW+F   RG  R +PM +D  V   A FHN NCP 
Sbjct: 964  LRYRRIHYFGTVGKSNGVFISGSAPAWVF-AQRGYARLYPMKLDTFVRAFAEFHNANCPH 1022

Query: 144  GFLYFNAKSELRISVLPTH---LSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAE 200
            GF+YFN +  L+I  LP     + ++ P  VRKVPL  TP  +AYH  ++TY +  +T  
Sbjct: 1023 GFIYFNHEGTLKICQLPAAEGAIHWELPGVVRKVPLGRTPREIAYHPPSRTYVVALATPV 1082

Query: 201  P-------STDYYK--------------FNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP 239
                     TD  +                 E+K+    PR+   I  +  +  + L SP
Sbjct: 1083 TTVVPTPPETDMERQEREREEEESREMGIEPEEKQRDMGPRE---IAMMEERHELHLISP 1139

Query: 240  FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
             +W+ +      L   EHVL           TLS L+    LG NY+          +L+
Sbjct: 1140 RTWQILHHVE--LEPKEHVL-----------TLSVLK----LGDNYSQVNRELRPPHLLI 1182

Query: 300  FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLT 359
            ++I +V  E    LT    K +  K  KGPV+A   + G+L+ AVG KI+++        
Sbjct: 1183 YEI-DVTGEEQCKLTMAYQKPMKEKPMKGPVSAAASLQGYLIIAVGPKIWVFNFDGGSTE 1241

Query: 360  GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
             +AF D   YI S+ ++KN +L GD  +SI  LR++     L+L+A+D       +  Y 
Sbjct: 1242 AVAFYDAPHYIVSIKTLKNFVLCGDIYKSIFFLRWKDSASQLALLAKDVGRVSVFATEY- 1300

Query: 420  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
                   ++D                        K N        +  ++SD+ +N+ + 
Sbjct: 1301 -------VVD------------------------KQN--------LALLMSDERQNLQVT 1321

Query: 480  MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGA 539
             Y P   ES GG  L+ + DF++GQ +N F ++   P ++     +  R   W+ +L G 
Sbjct: 1322 AYAPHTAESRGGQLLVPRGDFNVGQSINKFVRL---PMTLPSGTTSLQRHALWFGTLSGG 1378

Query: 540  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
            +G+  P+ E  +RRL MLQ+ +++   HT GL+P+A+R  + +     N    I+DG L+
Sbjct: 1379 VGYLAPMDESVFRRLGMLQSALLSAIPHTAGLHPQAYRALQ-RERLLRNRKHTILDGLLL 1437

Query: 600  WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSH 637
             ++L L    + +I  K+G+    IL++L  I    +H
Sbjct: 1438 SRYLALDSATQQQIALKLGTSRERILNDLQGIPQSVTH 1475


>gi|33411762|emb|CAD58786.1| cleavage and polyadenylation specificity factor 1 [Bos taurus]
          Length = 880

 Score =  234 bits (598), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 114/219 (52%), Positives = 145/219 (66%), Gaps = 15/219 (6%)

Query: 16  IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
           +V+E+L V+LG    RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 663 LVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 722

Query: 63  -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
                       + E+   PRG R+++ RYF +I GY GVF+CGP P WL +T RG LR 
Sbjct: 723 KPKPSKKKAEGGSTEEGTGPRG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 781

Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
           HPM IDGP+ + APFHN+NCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H
Sbjct: 782 HPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAH 841

Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPR 220
           ++AYH+E+K Y + TST+ P T   +  GE+KE  T  R
Sbjct: 842 YVAYHVESKVYAVATSTSTPCTRVPRMTGEEKEFETIER 880


>gi|320040273|gb|EFW22206.1| hypothetical protein CPSG_00105 [Coccidioides posadasii str.
            Silveira]
          Length = 1387

 Score =  234 bits (596), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 187/646 (28%), Positives = 314/646 (48%), Gaps = 95/646 (14%)

Query: 17   VQELLTVSLGLHGNR-PLLLVRT-QHELLIYQAFRHPKGAL---KLRFKKLKVLFVS--D 69
            + E+L   LG   +R P +++RT  ++L++YQ + HPK +L   +LRF K+   F+   D
Sbjct: 804  LSEVLIADLGDSISRQPYIILRTANNDLILYQPY-HPKTSLDKQELRFVKIIDHFLPRFD 862

Query: 70   RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG- 128
             S +A     +PR      +R +S+I GY+ VF+ G +P ++  +S      H + + G 
Sbjct: 863  PSPKAY----MPRS---KFLRAYSDICGYKTVFMSGSNPCFVMKSSTSS--PHVLRLRGE 913

Query: 129  PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLE 188
             VS+L+ FH   C +GF Y +A + +R+  LP +  +D  W  RKV +      + Y   
Sbjct: 914  AVSSLSSFHIPACEKGFAYVDASNMVRMCRLPGNTRFDNSWVTRKVHVGDQIDCVEYFAH 973

Query: 189  TKTYCIVTSTAEPSTDYYKFN---GEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWE 243
            ++ Y + +S        +K +    ED E+  + R     F+P L  +  + L SP +W 
Sbjct: 974  SEIYALGSS--------HKVDFKLPEDDEIHPEWRSEVISFMPQL-ERGCIKLLSPRTWS 1024

Query: 244  EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
             +   ++ L + E V+C+K ++ME       ++  + +GT     ED+T RG I +F+II
Sbjct: 1025 VV--DSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITPRGSIYVFEII 1082

Query: 304  EVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTG 360
            EV P+P +P T  K+K+    + KG VTA+  +   GFL+ A GQK  +  LK D  L  
Sbjct: 1083 EVAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRGLKEDGSLLP 1142

Query: 361  IAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
            +AF+D + Y+  +  ++   L ++GD  + I    Y  E   L+L  +D +  Q  +  +
Sbjct: 1143 VAFMDMQCYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKDNEYLQVIAADF 1202

Query: 419  YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
                                L  G+RL I                    +++D D  + +
Sbjct: 1203 --------------------LPDGKRLYI--------------------LVADDDCTIHV 1222

Query: 479  FMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS-DAPGARS--------RF 529
              Y PE   S+ G RL+ ++ FH+G   +T   +    SS S D PG            +
Sbjct: 1223 LEYDPEDPTSSKGDRLLHRSSFHMGHFTSTMTLLPQHSSSPSADDPGEDDMDVDYVPKSY 1282

Query: 530  LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
                 S +G++G   PL E +YRRL  LQ+ +VT   H  GLNP+A+R  +  G+     
Sbjct: 1283 QVLVTSQEGSIGVVTPLTEDSYRRLSALQSQLVTSMEHPCGLNPKAYRAVESDGFGG--- 1339

Query: 590  SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
             RGI+DG+L+ ++L + +  + EI  ++G+   DI     D+E +S
Sbjct: 1340 -RGIVDGNLLLRWLDMGVQRKAEIAGRVGA---DIESIRVDLEKIS 1381


>gi|412986884|emb|CCO15310.1| predicted protein [Bathycoccus prasinos]
          Length = 1595

 Score =  233 bits (595), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 177/620 (28%), Positives = 280/620 (45%), Gaps = 86/620 (13%)

Query: 31   RPLLLV-RTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQM 89
            RPLL   R    +L YQAF+ P  + +LRF ++ +   +  S+  N    +  G R++++
Sbjct: 1024 RPLLTCFRADGSVLAYQAFKSPS-SNELRFARVPIEIETAGSELTNNDVSVQGGSRLTRI 1082

Query: 90   RYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS-TLAPFHNVNCPRGFLYF 148
                +  G  GVF+ G +P WL +  RG + A P   +G      APFHNVNCP+GF+  
Sbjct: 1083 ENIGDGRGIAGVFVSGLNPIWLIV-RRGRVLALPTRGEGGARIAFAPFHNVNCPKGFILA 1141

Query: 149  NAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDY--- 205
              +  +R+  LP  +  +A WPVRK+ L+CTP  + Y  + K Y +VTS + P  D+   
Sbjct: 1142 TNEGGIRVCRLPGKMHIEAQWPVRKLALRCTPRAITYMNDFKLYALVTSASVPWKDFEID 1201

Query: 206  ---------YKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWE 256
                     Y+F  E          ++    +V QF + L  P + E   Q    +   E
Sbjct: 1202 ETDSHARALYRFRKE---------KAKSEGNVVQQFAIRLLVPGTLETAWQK--AVEPGE 1250

Query: 257  HVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKN 316
            H+LC+KNV +  + T   L   +A+GT     ED  CRGRILLF I+      G    + 
Sbjct: 1251 HILCVKNVQIRDQST-GALLSMLAIGTAMPGGEDTPCRGRILLFAIMWERARDGGVRWRG 1309

Query: 317  KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSV 376
            ++K    K  K   +AI  V G  + A+G K+         L  IAF DT +Y  ++  V
Sbjct: 1310 ELKC--EKPSKMACSAIESVDGTFMVAIGTKLTAHSWDGKHLNPIAFYDTPLYTTTLCCV 1367

Query: 377  KNLILVGDYARSIALLRYQPEY--RTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVW 434
            KN +L GD  +SI  +R++     +TLS + +DY+     +  +        +IDG    
Sbjct: 1368 KNFLLCGDLHKSIRFVRWKDSQGEKTLSQLGKDYEVLDCIASEF--------MIDG---- 1415

Query: 435  KFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRL 494
                                         ++  + +D + N  +F Y P+  ES  G +L
Sbjct: 1416 ----------------------------GTLSLLAADANGNAHVFQYAPKLAESWKGDKL 1447

Query: 495  IKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 554
            + K+ +H G  +    + +     I      ++R   ++ S DG LG F P+ E  +  L
Sbjct: 1448 LPKSAYHAGSLIRKMVRFQ-----IGVGEQKQNRHAVFFGSSDGGLGIFSPVDEHTFLNL 1502

Query: 555  LMLQNVMVTHTSHTG------GLNPRAFRTYK-GKGYYA-GNPSRGIIDGSLVWKFL-QL 605
              LQ+ M ++   +       GLN + +R  K  +G  A   P R I+DG L+ KF   L
Sbjct: 1503 EKLQDAMRSNIVASSNSINPLGLNSKTYRALKSSEGSVARQTPPRTIVDGGLLSKFEHSL 1562

Query: 606  SLGERLEICKKIGSKHNDIL 625
            S+  +  +  K G   +  L
Sbjct: 1563 SITAQTRVAAKAGLTRDQAL 1582


>gi|315045910|ref|XP_003172330.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
 gi|311342716|gb|EFR01919.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
          Length = 1397

 Score =  229 bits (585), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 177/645 (27%), Positives = 302/645 (46%), Gaps = 88/645 (13%)

Query: 4    FRSHSPSAMDETIVQELLTVSLG--LHGNRPLLLVRTQHE-LLIYQAFR--HPKGALKLR 58
            + S S   ++   + ELL   LG  +H   P +++RT+H+ L++Y+ +R     G  KL+
Sbjct: 793  YESSSRRPVNRETLTELLVADLGDAIH-KSPYMILRTKHDDLVLYEPYRITGENGRSKLQ 851

Query: 59   F-KKLKVLFVSDRS-----KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLF 112
            F K +  + +  R+     K  N  P   +      +R  S++ GY+ VF+ G +P ++ 
Sbjct: 852  FIKAVNHVVMGPRTNQPMNKDINRSPSPSK-----LLRALSDVCGYKTVFMSGQNPCFIL 906

Query: 113  LTSRGELRAHPMTIDG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPV 171
             ++    R + + + G  V +L  FH   C RGF Y +  + +R+S LP++  +D+ W  
Sbjct: 907  KSAIA--RPNVLRLRGKAVQSLTGFHIAACERGFAYVDEDNVIRMSRLPSNTRFDSAWAT 964

Query: 172  RKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS--RFIPPLV 229
            RK+PL      + Y   +++Y I TST E     +K   ED E  T+ R+    F+P L 
Sbjct: 965  RKIPLGEQVDCIVYSSASESYVIGTSTKED----FKLP-EDDESHTEWRNEFITFLPQL- 1018

Query: 230  SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
             +  V L  P +W  I    + +   E + C+K + +E   T    +  + +G+     E
Sbjct: 1019 DRGTVKLLEPKNWSAI--DIYEVEPAERITCIKIIRLEISETTHERKDMVVVGSAVAKGE 1076

Query: 290  DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQK 347
            D+  +G I +F+II+VVP+P  P    K+K+   +E KG VTA+  +   GFL+ A GQK
Sbjct: 1077 DIVPKGCIRVFEIIDVVPDPDHPEKNKKLKLFAREEVKGAVTAVSGIGGQGFLIVAQGQK 1136

Query: 348  IYIWQLK-DNDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLV 404
              +  LK D  L  IAF DT+ Y+  +  +K   + ++GD  + +    Y  E   L L 
Sbjct: 1137 CMVRGLKEDGSLLPIAFKDTQCYVNVLKELKGTGMCIIGDAFKGLWFTGYSEEPYKLDLF 1196

Query: 405  ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
             ++              N +  ++D                           D L + + 
Sbjct: 1197 GKE--------------NENLAVVDA--------------------------DFLPDGNK 1216

Query: 465  MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS---- 520
            +  +++D D N+ +  Y PE   S+ G RL+ ++ FH G   +T   +     ++S    
Sbjct: 1217 LYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTLLPHGSHTLSSPVD 1276

Query: 521  ------DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 574
                  D P   S++        G++G   PL E +YRRLL LQ+ +V    H  GLNPR
Sbjct: 1277 EDAMDTDLPPPPSKYQVLITFQTGSIGVISPLNEDSYRRLLALQSQLVNALEHPCGLNPR 1336

Query: 575  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
             +R  +  G       RG+IDG+L+ ++L +    + EI  ++G+
Sbjct: 1337 GYRAVESDGMGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGA 1378


>gi|119195757|ref|XP_001248482.1| hypothetical protein CIMG_02253 [Coccidioides immitis RS]
 gi|121769680|sp|Q1E5B0.1|CFT1_COCIM RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|392862316|gb|EAS37050.2| protein CFT1 [Coccidioides immitis RS]
          Length = 1387

 Score =  229 bits (583), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 184/646 (28%), Positives = 311/646 (48%), Gaps = 95/646 (14%)

Query: 17   VQELLTVSLGLHGNR-PLLLVRTQH-ELLIYQAFRHPKGAL---KLRFKKLKVLFVS--D 69
            + E+L   LG   +R P +++RT + +L++YQ + HPK +L   +LRF K+   F+   D
Sbjct: 804  LSEVLIADLGDSISRQPYMILRTANDDLILYQPY-HPKTSLDKPELRFVKIIDHFLPRFD 862

Query: 70   RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG- 128
             S +A     +P       +R +S+I GY+ VF+ G +P ++  +S      H + + G 
Sbjct: 863  PSPKAY----MPHS---KFLRAYSDICGYKTVFMSGSNPCFVMKSSTSS--PHVLRLRGE 913

Query: 129  PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLE 188
             VS+L+ FH   C +GF Y +A + +R+  LP++  +D  W  RKV +      + Y   
Sbjct: 914  AVSSLSSFHIPACEKGFAYVDASNMVRMCRLPSNTRFDNSWVTRKVHVGDQIDCVEYFAH 973

Query: 189  TKTYCIVTSTAEPSTDYYKFN---GEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWE 243
            ++ Y + +S        +K +    ED E+  + R     F+P L  +  + L SP +W 
Sbjct: 974  SEIYALGSS--------HKVDFKLPEDDEIHPEWRSEVISFMPQL-ERGCIKLLSPRTWS 1024

Query: 244  EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
             +   ++ L + E V+C+K ++ME       ++  + +GT     ED+T RG I +F+II
Sbjct: 1025 VV--DSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITPRGSIYVFEII 1082

Query: 304  EVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTG 360
            EV P+P +P T  K+K+    + KG VTA+  +   GFL+ A GQK  +  LK D  L  
Sbjct: 1083 EVAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRGLKEDGSLLP 1142

Query: 361  IAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
            +AF+D + Y+  +  ++   L ++GD  + I    Y  E   L+L  +D +  Q  +  +
Sbjct: 1143 VAFMDMQCYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKDNEYLQVIAADF 1202

Query: 419  YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
                                L  G+RL I                    +++D D  + +
Sbjct: 1203 --------------------LPDGKRLYI--------------------LVADDDCTIHV 1222

Query: 479  FMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS---------DAPGARSRF 529
              Y PE   S+ G RL+ ++ FH G   +T   +    SS S         D       +
Sbjct: 1223 LEYDPEDPTSSKGDRLLHRSSFHTGHFTSTMTLLPEHSSSPSADDPEEDDMDVDYVPKSY 1282

Query: 530  LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
                 S +G++G   PL E +YRRL  LQ+ +VT   H  GLNP+A+R  +  G+     
Sbjct: 1283 QVLVTSQEGSIGVVTPLTEDSYRRLSALQSQLVTSMEHPCGLNPKAYRAVESDGFGG--- 1339

Query: 590  SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
             RGI+DG+L+ ++L + +  + EI  ++G+   DI     D+E +S
Sbjct: 1340 -RGIVDGNLLLRWLDMGVQRKAEIAGRVGA---DIESIRVDLETIS 1381


>gi|66812672|ref|XP_640515.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
 gi|60468551|gb|EAL66554.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
          Length = 1628

 Score =  227 bits (578), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 172/677 (25%), Positives = 314/677 (46%), Gaps = 141/677 (20%)

Query: 17   VQELLTVSLGLHGNRP--LLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLF-----VSD 69
            + +++ +SL    N    L +     +L+IY++F+  K   +LRFKK    F     V++
Sbjct: 1024 ILDIVEISLHNFNNSDPYLFMFNKIGDLIIYKSFKREKNG-ELRFKKYNHSFILRDSVTE 1082

Query: 70   RSKRANEQP-------------------------GLPRGVRISQMRYFSNIAGYQGVFLC 104
              ++  E+                           L R  RI +   FS+I+G +G+F+ 
Sbjct: 1083 FYQKQQEKELLNGMDDDDDMDDEKKKKKEEEEEENLNRQKRIFE---FSSISGKRGLFIG 1139

Query: 105  GPHPAWLFLTSRGELRAHPMTIDG----------------PVSTLAPFHNVNCPRGFLYF 148
            G  P W F   +G LR H M                     V T   F+N++C  GF+YF
Sbjct: 1140 GKKPIWAF-CEKGYLRLHSMDSSDNSNSNNSNNNNNNNSNTVETFTSFNNISCQDGFIYF 1198

Query: 149  NAKSE-LRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYK 207
            + + + ++I  L T ++++    +R++P K + H +AYH E K Y ++ S  + + +  +
Sbjct: 1199 SKEKDVIKICTLSTLMNFENDIAIRRIPTKNSCHKIAYHSEAKCYVVIVSFPQVTQELQE 1258

Query: 208  FNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP---FSWEEIPQTNFPLHEWEHVLCLKNV 264
                 K ++TD +           F + L  P   ++W+ I   +F L + E VL +K V
Sbjct: 1259 --DSKKPILTDDK-----------FQIKLIDPTIDWNWKFID--SFSLQDRETVLAMKIV 1303

Query: 265  SMEYE--GTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPE-PGQPLTKNKIKMI 321
            S+++     ++  R ++ +GT + + ED  C+GR+L+F+I+    +   + L + ++ ++
Sbjct: 1304 SLKFTEPDGITRARPFLVIGTAFTFGEDTQCKGRVLVFEIVSHKTQFESEELGEKRLNLL 1363

Query: 322  YAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLIL 381
            Y KEQKGPVTA+  V G L+  +G K+ + Q     L  ++F D ++YI S+ ++KN I+
Sbjct: 1364 YEKEQKGPVTALSSVNGLLLMTIGPKLTVNQFYTGSLVTLSFYDAQIYICSICTIKNYIV 1423

Query: 382  VGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSL 441
            +GD  +S+  L+++ + +TL+L+++DY+     S  +     +  I              
Sbjct: 1424 IGDMYKSVYFLQWK-DNKTLNLLSKDYQALNIFSTEFIVNQKTLSI-------------- 1468

Query: 442  GERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFH 501
                                      ++SD DKN++LF ++P+   S  G          
Sbjct: 1469 --------------------------LVSDLDKNILLFSFEPQDPSSRSG---------Q 1493

Query: 502  LGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 561
            + Q +N         ++ +D    +   L  + +LDG L    PL EK Y     +Q+ +
Sbjct: 1494 INQEIN--------GNNKNDNRLPKKEQLVIFGTLDGGLNVLRPLDEKIYLLFYHIQSKL 1545

Query: 562  VTHTSHTGGLNPRAFRTYKG-KGYYAGNPS------RGIIDGSLVWKFLQLSLGERLEIC 614
              +   T GLNP+ +R++K     +  +PS      + I+DG L+ KFL LS  E+  I 
Sbjct: 1546 Y-YLPQTAGLNPKQYRSFKSFSQNFHFSPSTFHQLPKFILDGDLISKFLSLSQSEKRLIS 1604

Query: 615  KKIGSKHNDILDELYDI 631
              I S  ++I++ L D+
Sbjct: 1605 NSINSTSDEIIESLKDV 1621


>gi|170576536|ref|XP_001893668.1| CPSF A subunit region family protein [Brugia malayi]
 gi|158600196|gb|EDP37499.1| CPSF A subunit region family protein [Brugia malayi]
          Length = 1323

 Score =  226 bits (575), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 122/338 (36%), Positives = 193/338 (57%), Gaps = 40/338 (11%)

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
            ++VVPEPGQP +K++IK +Y KEQKGPVT++C   G+L+T +GQK++IW  KDN+L GI+
Sbjct: 1024 LQVVPEPGQPTSKHRIKTLYDKEQKGPVTSLCSCNGYLLTGMGQKVFIWLFKDNNLQGIS 1083

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            F+D   YI  ++ V+NL L  D  RS+ALLRYQ EY+ LSL +RD               
Sbjct: 1084 FLDMHFYIHQLIGVRNLALACDMYRSLALLRYQEEYKALSLASRDM-------------- 1129

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
              R  +   +  +FL             I +K          MGF++SD+  N+ +F Y 
Sbjct: 1130 --RSDVQPPMAAQFL-------------IDNKQ---------MGFIMSDEAANIAIFNYL 1165

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSS--ISDAPGARSRFLTWYASLDGAL 540
            PE  ES GG +L  + + ++G  VN+F +++   SS  + +   +  R    +ASLDG+ 
Sbjct: 1166 PETLESLGGEKLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSLERQSVLFASLDGSF 1225

Query: 541  GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVW 600
            G+  PL EK +RRL MLQ +M +      GLN +  R  + +       +R ++DG +  
Sbjct: 1226 GYLRPLTEKVFRRLHMLQQLMSSMVLQPAGLNAKGARAARPQRPNHYLNTRNLVDGDVAM 1285

Query: 601  KFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            ++L LSL E+ ++ +K+G+    I+D+L +I  +++H+
Sbjct: 1286 QYLHLSLPEKNDLARKLGTSRYHIIDDLIEICRVTAHY 1323



 Score =  112 bits (280), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 70/235 (29%), Positives = 116/235 (49%), Gaps = 14/235 (5%)

Query: 14   ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP---KGALKLRFKKLKVLFVSDR 70
            E ++ ELL V +G++  RPLL +     +  Y+ F +    +G L +RFK+L    V+ R
Sbjct: 794  EEVIMELLLVGMGMNQGRPLLFLLIDDTVSAYEMFTYNNGIQGHLAIRFKRLPYTTVT-R 852

Query: 71   SKRANEQPG------LPRGVRISQMRYFSNIAG--YQGVFLCGPHPAWLFLTSRGELRAH 122
            S R     G      +   VR   + +F    G    GVF+C  +P   FL S G  R H
Sbjct: 853  SCRFQGTDGRAAVESVRDAVRHKTVLHFFERIGNVLNGVFICSSYPCIFFLES-GVPRLH 911

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSE-LRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            P+ +DGP+ +   F+N  CP GF+Y   +   +R++ LP+ +  DA +PV+++ +  T H
Sbjct: 912  PVNLDGPILSFTTFNNAVCPNGFIYLTERDRFMRVAKLPSDMILDASYPVKRINVGATVH 971

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSL 236
             + Y L + TY ++TS     T       +DK      +   F+ P + Q+ + +
Sbjct: 972  SVVYLLHSNTYAVLTSEKRKVTKMCVLINDDKTFEEHEKPDTFVYPEMDQYKLQV 1026


>gi|242798830|ref|XP_002483249.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Talaromyces stipitatus ATCC 10500]
 gi|218716594|gb|EED16015.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Talaromyces stipitatus ATCC 10500]
          Length = 1382

 Score =  225 bits (574), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 172/642 (26%), Positives = 293/642 (45%), Gaps = 78/642 (12%)

Query: 14   ETIVQELLTVSLG-LHGNRPLLLVRTQ-HELLIYQAFRH----PKGALKLRFKKLKVLFV 67
            ETI  ELL   LG +    P L++R+   +L+IY+  R      K  + L++ K    F+
Sbjct: 792  ETIA-ELLVADLGEISTASPYLIIRSATDDLIIYKPVRENSKDEKTGVTLKYIKESNHFL 850

Query: 68   SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID 127
                K   E        R+  +R  ++I GY  V + G  P+ +  TS+   R   +  D
Sbjct: 851  P---KVPIEAAATDTQQRMPGLRRLADIGGYAAVLMSGASPSLVVRTSKSLPRVFSIQSD 907

Query: 128  GPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHL 187
              +  ++ F +  C +G +Y + +  +R   L  +   D  WP+RK+PL     +LAY  
Sbjct: 908  S-IRGISGFDSAGCEKGLIYVDNEHVVRTCRLHDNTQLDFSWPIRKIPLNEEVDYLAYST 966

Query: 188  ETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQ 247
             + TY + T+  +     +K    D+       +   + P V+Q  + L +P +W+ I  
Sbjct: 967  VSGTYVVGTTHEQD----FKLPDNDELHPEWANEDISLRPKVAQGSIKLLNPKTWKVIDS 1022

Query: 248  TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
              F  +  E +  ++N+++E     S  +  I +GT +   ED+  RG + +FD+I VVP
Sbjct: 1023 YTF--NAAERITAIENINLEISEKTSERKDMIVVGTTFAKGEDIAARGNVYVFDVINVVP 1080

Query: 308  EPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLKDN-DLTGIAFI 364
            +P +P T  K+K+I  +  +G +TA+  +   GFL+ A GQK  +  LKD+  L  +AFI
Sbjct: 1081 DPDEPGTNLKLKLIGEESVRGALTAVSGIGGQGFLIVAQGQKCMVRGLKDDGSLLPVAFI 1140

Query: 365  DTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            D + Y++ +  +K   + L+GD  + +    Y  E   ++L  +D               
Sbjct: 1141 DVQCYVSVIKELKGTGMCLIGDALKGLWFTGYSEEPYKMTLFGKDL-------------- 1186

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                + LE+         D L +   +  +++D D N+ +  Y 
Sbjct: 1187 --------------------DELEVVTA------DFLPDGKKLYILVADSDCNLHVLQYD 1220

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSSISDAPGARSRFLTWYASL----- 536
            PE  +S+ G RL+ +  FH+G   +T   + R   SS      + S  +  Y  L     
Sbjct: 1221 PEDPKSSNGDRLLNRCKFHMGHFASTITLLPRTAVSSELAVMNSDSMDIDSYIPLHQALI 1280

Query: 537  ---DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGI 593
                G +     L E++YRRL  LQ+ +     H  GLNPRA+R  +  G       RG+
Sbjct: 1281 TTQSGLMALVTSLSEESYRRLSALQSQLSNTLEHPCGLNPRAYRAVESDGVVG----RGM 1336

Query: 594  IDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
            IDG L+ ++L LS   +LEI  ++G+   +I     D+EA+S
Sbjct: 1337 IDGKLLMRWLDLSRPRKLEIAGRVGADEWEI---RADLEAVS 1375


>gi|149066088|gb|EDM15961.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
           isoform CRA_b [Rattus norvegicus]
          Length = 241

 Score =  222 bits (565), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 114/283 (40%), Positives = 167/283 (59%), Gaps = 47/283 (16%)

Query: 361 IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
           +AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +  
Sbjct: 1   MAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMV 60

Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
            N                                        + +GF++SD+D+N++++M
Sbjct: 61  DN----------------------------------------AQLGFLVSDRDRNLMVYM 80

Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-----RFLTWYAS 535
           Y PEA+ES GG RL+++ DFH+G HVNTF++  C+ ++  + P  +S     + +TW+A+
Sbjct: 81  YLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGAA--EGPSKKSVMWENKHITWFAT 138

Query: 536 LDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIID 595
           LDG +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++D
Sbjct: 139 LDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLD 198

Query: 596 GSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
           G L+ ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 199 GELLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 241


>gi|296414526|ref|XP_002836950.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295632796|emb|CAZ81141.1| unnamed protein product [Tuber melanosporum]
          Length = 1468

 Score =  221 bits (564), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 153/574 (26%), Positives = 257/574 (44%), Gaps = 89/574 (15%)

Query: 94   NIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSE 153
            N+AGY  VFL G  P+++  T++   R H +   G V +L+ FH+    RGF+Y ++   
Sbjct: 934  NLAGYSAVFLPGADPSFVIKTAKSSPRIHKLAGTG-VRSLSSFHSAGADRGFVYVDSLGI 992

Query: 154  LRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDK 213
            +R++++P   ++D  W  +KV        LAY      Y I TS  +P    +    ED 
Sbjct: 993  VRVALMPAEFTFDGNWGYKKVTPGEHVQSLAYFPPMNVYVISTSKRQP----FDLAEEDG 1048

Query: 214  ELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLS 273
             +    +D   + P +    + L SP +W  + +  F  +E    L +K +S+E      
Sbjct: 1049 NIA---KDDTTLQPEIDSGTLKLLSPQTWTAVDEYKFAHNEI--ALVVKTISLEVSEHTK 1103

Query: 274  GLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAI 333
              +  +++GT     ED + RG I +F++IEVVPEP +P T  K+K++  +E KG V+AI
Sbjct: 1104 ERKQLVSVGTAIFRGEDHSARGGIYVFEVIEVVPEPNRPETNRKLKLVTREEVKGTVSAI 1163

Query: 334  CHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALL 392
            C V G+L+ A GQKI +  LK D  L  +AF+D  +Y++   ++  +IL GD+ +S+   
Sbjct: 1164 CGVNGYLLAAQGQKIMVRGLKEDQSLLPVAFLDMCLYVSVAKNLDGMILFGDFMKSVWFA 1223

Query: 393  RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
             +  E   ++L  +D                                   ++LEI     
Sbjct: 1224 GFSEEPYKMTLFGKDT----------------------------------QKLEIISA-- 1247

Query: 453  SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 512
                + L + + + F++ D + N+    Y PE  +S  G RLI++ DF  G  ++T   +
Sbjct: 1248 ----EFLPDGNQLYFVVVDAESNIHTLQYDPEHPKSLAGQRLIRRADFFSGHEISTLTML 1303

Query: 513  RCKPSSISDAPGAR---------------------SRFLTWYASLDGALGFFLPLPEKNY 551
               P S+S +  +                        +     +  G+L     +PE  Y
Sbjct: 1304 PFSPYSLSASSNSHLPADATDTSPLHHHHQNQQQQQEYFVLAGTQTGSLAMIRTIPETAY 1363

Query: 552  RRLLMLQNVMVTHTSHTGGLNPRAFRTY-----------------KGKGYYAGNPSRGII 594
            RRL ++Q  +V    H  GLNPR +R                      G   G+  RG++
Sbjct: 1364 RRLNIVQGQIVNGEEHVAGLNPREYRAVVNYSGGGGGGAGGGGWGGSGGGVGGDTMRGVL 1423

Query: 595  DGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
            DG LV +++ L+ G + E+  K G     I  +L
Sbjct: 1424 DGGLVSRWIGLAEGRKGEVSAKAGCGVQGIRGDL 1457


>gi|119484094|ref|XP_001261950.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Neosartorya fischeri NRRL 181]
 gi|148886830|sp|A1DB13.1|CFT1_NEOFI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|119410106|gb|EAW20053.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Neosartorya fischeri NRRL 181]
          Length = 1400

 Score =  221 bits (562), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 180/649 (27%), Positives = 304/649 (46%), Gaps = 92/649 (14%)

Query: 16   IVQELLTVSLGLHGN-RPLLLVRTQ-HELLIYQAFRHP-KGA--LKLRFKK-----LKVL 65
            ++ E +   LG   N  P L++RT+  +L+IY+AF    KG     L F K     L  +
Sbjct: 807  VLSEAVIADLGESWNPSPHLILRTESDDLVIYKAFASSIKGESHTHLSFVKETNHTLPRV 866

Query: 66   FVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
              SD+  ++NE+    R +RI       NI+    VF+ GP  +++  T++     H   
Sbjct: 867  TTSDKEMQSNEELSRSRSLRI-----LPNISDLSAVFMPGPSASFILKTAKS--CPHVFR 919

Query: 126  IDGPVSTLAPFHNVNCP---RGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            + G         ++  P   +GF+Y ++K  LRI   P+   +D  W +RK+ +      
Sbjct: 920  LRGEFVRGLSIFDLASPSLDKGFIYVDSKDVLRICRFPSETLFDYTWALRKIGIGEQVDH 979

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS--RFIPPLVSQFHVSLFSPF 240
            LAY   ++TY + TS    S D+     +D EL  D R+    F+P L  Q  + + SP 
Sbjct: 980  LAYATSSETYVLGTSH---SADFKL--PDDDELHPDWRNEVISFLPEL-RQCSLKVVSPR 1033

Query: 241  SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
            +W  I   ++ L   E+V+ +KN+ +E        R  I +GT + + ED+  RG I +F
Sbjct: 1034 TWTVI--DSYSLGPAEYVMAVKNMDLEVSENTHERRNMIVVGTAFAWGEDIPSRGCIYVF 1091

Query: 301  DIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DND 357
            ++I+VVP+P +P T  K+K+I  +  KG VTA+  +   GFL+ A GQK  +  LK D  
Sbjct: 1092 EVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRGLKEDGS 1151

Query: 358  LTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNS 415
            L  +AF+D + Y+  +  +K   + ++GD  + +    Y  E   +SL  +D        
Sbjct: 1152 LLPVAFMDMQCYVNVVKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKD-------- 1203

Query: 416  KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKN 475
            +GY     +  + DG  ++                                 +++D D N
Sbjct: 1204 QGYLEVVAAEFLPDGDKLF--------------------------------ILVADSDCN 1231

Query: 476  VVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF----------KIRCKPSSIS-DAPG 524
            + +  Y PE  +S+ G RL+ ++ FH+G    T            K    P S+  D+  
Sbjct: 1232 LHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLLPRTMVSSEKAMADPDSMEIDSQT 1291

Query: 525  ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 584
               + L    S  G++G    +PE++YRRL  LQ+ +     H  GLNPRA+R  +    
Sbjct: 1292 ISQQVLI--TSQSGSVGIVTSVPEESYRRLSALQSQLTNSLEHPCGLNPRAYRAVESD-- 1347

Query: 585  YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
              G   RG++DG+L++++L +    ++EI  ++G+   +I  +L  I A
Sbjct: 1348 --GTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAHEWEIKADLEAIGA 1394


>gi|212541400|ref|XP_002150855.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Talaromyces marneffei ATCC 18224]
 gi|210068154|gb|EEA22246.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Talaromyces marneffei ATCC 18224]
          Length = 1383

 Score =  219 bits (558), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 172/642 (26%), Positives = 291/642 (45%), Gaps = 78/642 (12%)

Query: 14   ETIVQELLTVSLG-LHGNRPLLLVRTQ-HELLIYQAFRHPKGALK----LRFKKLKVLFV 67
            ETI  ELL   LG L    P L++RT   +L+IY+ F     A K    L++ K    F+
Sbjct: 793  ETIA-ELLIADLGELPTVSPYLIIRTATDDLIIYKPFWENSNAEKSGGSLKYIKETNHFL 851

Query: 68   SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID 127
               S  A       R      +R  S++ GY  V + G  P  +  TS+     + +  D
Sbjct: 852  PKVSLEAASSASQQR---TPGLRRLSDLGGYAAVVMSGASPNLIVRTSKSLPHVYSIQSD 908

Query: 128  GPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHL 187
              +  ++ F+   C +G +Y + +  +R   L  +   D  WP+R++PL      LAY  
Sbjct: 909  F-IRGISGFNGAGCKKGLVYVDNERLVRTCQLYNNAQLDFSWPIRRIPLNEQVDHLAYST 967

Query: 188  ETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQ 247
             + TY + T+  +     +K   +D+       +   + P V+   + L +P +W+ I  
Sbjct: 968  ASGTYVVGTTHEQD----FKLPDDDELHPEWATEEISLLPKVAYGSIKLINPKTWKVIDS 1023

Query: 248  TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
              F     E +  ++N+++E        +  I +GT Y   ED+  RG + +FD+I+VVP
Sbjct: 1024 YTF--SPAERITAVENINLEISEKTGKRKDMIVVGTTYAKGEDIAARGNVYVFDVIDVVP 1081

Query: 308  EPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLKDN-DLTGIAFI 364
            +P +P T  K+K+I  +  +G VTA+  +   GF++ A GQK  +  LKD+  L  +AFI
Sbjct: 1082 DPDEPGTNLKLKLIGEESIRGAVTAVSGIGGQGFMIVAQGQKCMVRGLKDDGSLLPVAFI 1141

Query: 365  DTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            D + Y++ +  +K   + L+GD  + +    Y  E   ++L  +D               
Sbjct: 1142 DVQCYVSVIKELKGTGMCLIGDAFKGLWFTGYSEEPYKMTLFGKDL-------------- 1187

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                + LE+         D L +   +  +++D D N+ +  Y 
Sbjct: 1188 --------------------DELEVVTA------DFLPDGKKLYILVADGDCNLYVLQYD 1221

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSSISDAPGARSRFLTWYASL----- 536
            PE  +S+ G RL+ +  FH+G   +T   + R   SS      + S  +  Y  L     
Sbjct: 1222 PEDPKSSNGDRLLNRCKFHMGHFASTLTLLPRTAVSSELAVMSSDSMDIDSYTPLYQALI 1281

Query: 537  ---DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGI 593
                G++     L E++YRRL  LQ+ +     H  GLNPRA+R+ +  G       RG+
Sbjct: 1282 TTQSGSMALITSLSEESYRRLTALQSQLSNTLEHPCGLNPRAYRSVESDGVVG----RGM 1337

Query: 594  IDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
            IDG L+ ++L LS   +LEI  ++G+   +I     D+EA+S
Sbjct: 1338 IDGKLLMRWLDLSRSRKLEIAGRVGADEWEI---RADLEAVS 1376


>gi|225679191|gb|EEH17475.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 1377

 Score =  219 bits (558), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 181/647 (27%), Positives = 294/647 (45%), Gaps = 101/647 (15%)

Query: 17   VQELLTVSLGLHGNR-PLLLVRTQ-HELLIYQAFRHPKGALK----LRFKKL------KV 64
            + E+L   LG   +R P L++R+  +EL++Y+ +   +   K    LRF K+      K 
Sbjct: 793  LTEILVADLGDSVSRTPYLILRSNSNELILYEPYHIVQSTEKRLSDLRFLKIANHHFPKF 852

Query: 65   LFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM 124
            L  S+    ++    L R      +R   ++ GY+ VF+ G  P   F+        H M
Sbjct: 853  LPESNLGNLSDSDRQLAR-----PLRALGDVCGYRTVFMPGNSPC--FIIKSATSIPHVM 905

Query: 125  TIDG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFL 183
             + G  V +L+ F+   C +GF+Y +  + +R+   P +  +D  W  RK+ L      +
Sbjct: 906  NLRGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRFPRNTHFDGSWAARKIGLGEQVDSV 965

Query: 184  AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
             Y   ++TY + TS      D+             P D    P   ++  V L +P +W 
Sbjct: 966  EYSSSSETYVLGTSQ---KVDFKL-----------PEDDEIHPEWRNEESVKLLNPRTWS 1011

Query: 244  EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
             I   ++ L   E V+C+K +++E        +  IA+GT     ED+  RG I +F++I
Sbjct: 1012 II--DSYQLRTAERVMCVKCLNLEASEITHERKEMIAVGTALTRGEDIAARGCIYVFEVI 1069

Query: 304  EVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTG 360
            +VVPE  +P T  K+K+I  +E KG +T++  +   GFL+ A GQK  +  LK D  L  
Sbjct: 1070 KVVPEVDRPETNRKLKLIAKEEVKGAITSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLP 1129

Query: 361  IAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
            +AF+D + Y++ +  +K   + ++GD  + +    Y  E   LSL ++D           
Sbjct: 1130 VAFMDMQCYVSVLKELKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKD----------- 1178

Query: 419  YAGNPSRGIIDGSL-VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVV 477
                      DGSL V     L  G+RL I                    M++D D N+ 
Sbjct: 1179 ----------DGSLQVMAADFLPHGKRLFI--------------------MVADDDCNIH 1208

Query: 478  LFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL------- 530
            +  Y PE   S  G RL+ ++ FH GQ  +T   +  + S +S  P A +  +       
Sbjct: 1209 VLQYDPEDPGSAKGDRLLHRSTFHTGQFAST-LTLLPRTSVLSQGPEAEANAMDLDSSGP 1267

Query: 531  ---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
                   S  G++    P+ E  YRRL  LQ+ M+    H  GLNPRAFR  +  G    
Sbjct: 1268 LHQVLVTSETGSIALITPVSEMAYRRLSALQSQMINTLEHPCGLNPRAFRAVESDGIGG- 1326

Query: 588  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
               RG++DG LV K+L L    + EI  ++G+   D+ +   D+EA+
Sbjct: 1327 ---RGMVDGDLVQKWLDLGTQRKAEIASRVGA---DVWEIRADLEAI 1367


>gi|255948500|ref|XP_002565017.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211592034|emb|CAP98296.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 1392

 Score =  219 bits (557), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 177/650 (27%), Positives = 308/650 (47%), Gaps = 81/650 (12%)

Query: 5    RSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHE-LLIYQAFRHPK----GALKLR- 58
            RS++   M E +V +L   S    G  P L+VRT+++ L+ Y+    P     G+ +L+ 
Sbjct: 793  RSNTRETMTEFVVADLGDSS----GLSPYLIVRTENDDLVFYKPSLIPANDGHGSSRLQL 848

Query: 59   FKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGE 118
            F+    +     S  A+ Q  + +  R+  +R   NI+G+  +F+ G   +++F T++  
Sbjct: 849  FRDSNHVLPKSPSGEASSQ--IQKQQRLRPLRILPNISGFSTIFMPGASSSFVFRTAKSS 906

Query: 119  LRAHPMTIDGPVST-LAPFHNVNCPR--GFLYFNAKSELRISVLPTHLSYDAPWPVRKVP 175
               H + + G  +  L+ F +V+  R  GF+Y ++++ +R   LP+   +D PW +RKVP
Sbjct: 907  --PHIIRLRGGFTRWLSSFDSVDTGRDNGFIYVDSQNCVRACQLPSQTQFDYPWTLRKVP 964

Query: 176  LKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVS 235
            ++    FLAY   ++TY + TS      D+    G+D        +  F P  + +  + 
Sbjct: 965  IEEQVDFLAYSTSSETYVLGTSR---EGDFKLPEGDDLHPEWRNEELSFCPK-IPESSIK 1020

Query: 236  LFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRG 295
            + SP +W  I   ++PL   E V  +KNV++E        R  I +GT     ED+  RG
Sbjct: 1021 VVSPKTWTII--DSYPLDPDEQVTAVKNVNIEVSENTHERRDLIVVGTAIVKGEDMPARG 1078

Query: 296  RILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQL 353
             I +FD+I+V P+P +P T +K+K+I  +  KG VTA+  +   GF++ A GQK  +  L
Sbjct: 1079 TIYVFDVIKVAPDPEKPETGHKLKLIGKESVKGAVTALSGIGGQGFVIVAQGQKCMVRGL 1138

Query: 354  K-DNDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
            K D  L  +AF+D + Y+     +K   L+++GD  + +    Y  E   ++L  +D   
Sbjct: 1139 KEDGSLLPVAFMDMQCYVTVAKELKGTGLVILGDAVKGLWFAGYSEEPYRMTLFGKD--- 1195

Query: 411  TQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMIS 470
                                            E LE+         D L + + +  +++
Sbjct: 1196 -------------------------------PEYLEVVAA------DFLPDGNKLYMLVA 1218

Query: 471  DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS---------D 521
            D D N+ +  Y PE  +S+ G RL+ ++ F+ G   ++   +     S           D
Sbjct: 1219 DSDCNLHVLQYDPEDPKSSNGDRLLSRSKFYTGNFASSVTLLPRTAVSSERTESSEEGMD 1278

Query: 522  APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG 581
                 +R     AS +G+L     + E++YRRL  LQ+ ++    H  GLNPRAFR  + 
Sbjct: 1279 LDETFARHQVLIASQNGSLALVTSVAEESYRRLSALQSQLINTVDHPAGLNPRAFRAIES 1338

Query: 582  KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
             G  AG   RG++DG+L+  +L +    + EI  ++G+   +I  +L  I
Sbjct: 1339 DG-AAG---RGMVDGNLLRLWLNMGKQRQTEIAGRVGATEWEIKADLETI 1384


>gi|452001482|gb|EMD93941.1| hypothetical protein COCHEDRAFT_1129958 [Cochliobolus heterostrophus
            C5]
          Length = 1385

 Score =  219 bits (557), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 179/623 (28%), Positives = 280/623 (44%), Gaps = 63/623 (10%)

Query: 10   SAMDETIVQELLTVSLGLHGNR-PLLLVRTQHE-LLIYQAFRHP-KGALKLRFKKLKVLF 66
            SA+  TI  E+L   LG    + P L+VRT  + L+IY+AF  P + A  L  K L+ + 
Sbjct: 788  SAIKATIT-EILAADLGDATTKSPHLIVRTSSDNLVIYKAFHSPSRSAADLWTKNLRWVK 846

Query: 67   VSDRS-KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
            +S +   R  E  G       S +   S+I GY  VF  G  PA++F  S    R   ++
Sbjct: 847  LSQQHIPRYTEDGGAEDSGFESTLLTLSDIGGYSTVFQRGTTPAFIFKESSSAPRVIGLS 906

Query: 126  IDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYD-APWPVRKVPLKCTPHFLA 184
               PV +L  FH  +C RGF Y ++   LRIS LP    Y    W  R++P+    H LA
Sbjct: 907  -GKPVKSLTSFHTSSCQRGFAYLDSTDTLRISQLPPQTHYGHLGWATRRMPMDAEIHALA 965

Query: 185  YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEE 244
            YH    +   +  T +P  + Y+ +  +      P++     P + +  + L    +W  
Sbjct: 966  YH---SSGLYIVGTGQP--EEYQLDPSETYHYELPKEDMSFKPTIERGIIKLLDEKTWTI 1020

Query: 245  IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIE 304
            I      L   E VL +K +++E        +  +A+GT   + ED+  +G I +F++I 
Sbjct: 1021 I--DTHVLDPQEVVLSIKTLNLEVSENTHQRKDLVAVGTAILHGEDLATKGCIRIFEVIT 1078

Query: 305  VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGI 361
            VVPEP +P T  ++K+I   E KG V+AI  +   GF++ A GQK  +  LK D  L  +
Sbjct: 1079 VVPEPDRPETNKRLKLIVKDEVKGAVSAISELGTQGFMIMAQGQKCMVRGLKEDGTLLPV 1138

Query: 362  AFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
            AF+D + Y++ + ++    ++ + D  R +    Y  E   +SL AR     +       
Sbjct: 1139 AFMDMQCYVSDLKNLPGTGMLAMSDAYRGVWFTGYTEEPYRMSLFARSKHSLE------- 1191

Query: 420  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
                       ++   F+     E+L +                    +++D D N+ + 
Sbjct: 1192 -----------AIAIDFIPFE--EQLHL--------------------LVADADMNLQVL 1218

Query: 480  MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI--RCKPSSISDAPGARSRFLTWYASLD 537
             + P+  +S  G RL+ K+ FH G    T   +  R K  S SD    +        S  
Sbjct: 1219 QFDPDNPKSEAGSRLLHKSTFHTGHFPATLHVVHSRLKMPSASDFAATQPLHQILCTSQS 1278

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK--GKGYYAGNPSRGIID 595
            G L    PL E  YRRL  L   +      T GLNPRAFR       G+ AG  +RG++D
Sbjct: 1279 GTLALVTPLSEDTYRRLSNLSAYLSNTLDATAGLNPRAFRASDTPDGGWDAGTGARGMLD 1338

Query: 596  GSLVWKFLQLSLGERLEICKKIG 618
            G+L+ ++ +L    R E   K G
Sbjct: 1339 GNLLMRWGELGERGRREGLAKYG 1361


>gi|146324727|ref|XP_747211.2| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus fumigatus Af293]
 gi|148886828|sp|Q4WCL1.2|CFT1_ASPFU RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|129556124|gb|EAL85173.2| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus fumigatus Af293]
          Length = 1401

 Score =  218 bits (555), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 181/651 (27%), Positives = 309/651 (47%), Gaps = 94/651 (14%)

Query: 16   IVQELLTVSLGLHGN-RPLLLVRTQ-HELLIYQAF-RHPKGA--LKLRFKK-----LKVL 65
            ++ E +   LG   N  P L++RT+  +L+IY+AF  + KG    +L F K     L  +
Sbjct: 806  VLSEAVIADLGESWNPSPHLILRTESDDLVIYKAFASYIKGESHTRLSFVKESNHTLPRV 865

Query: 66   FVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
              S++  ++NE+   PR +RI       NI+ +  VF+ G   +++  T++     H   
Sbjct: 866  TTSEKEMQSNEKLSRPRSLRI-----LPNISNFSAVFMPGRPASFILKTAKS--CPHVFR 918

Query: 126  IDGP-VSTLAPFH--NVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            + G  V +L+ F   + +   GF+Y ++K  LRI   P+   +D  W +RK+ +      
Sbjct: 919  LRGEFVRSLSIFDLASPSLDTGFIYVDSKDVLRICRFPSETLFDYTWALRKISIGEQVDH 978

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS----RFIPPLVSQFHVSLFS 238
            LAY   ++TY + TS    S D+     +D EL  D R+      F+P L  Q  + + S
Sbjct: 979  LAYATSSETYVLGTSH---SADFKL--PDDDELHPDWRNEGLVISFLPEL-RQCSLKVVS 1032

Query: 239  PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRIL 298
            P +W  I   ++ L   E+V+ +KN+ +E        R  I +GT +   ED+  RG I 
Sbjct: 1033 PRTWTVI--DSYSLGPDEYVMAVKNMDLEVSENTHERRNMIVVGTAFARGEDIPSRGCIY 1090

Query: 299  LFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-D 355
            +F++I+VVP+P +P T  K+K+I  +  KG VTA+  +   GFL+ A GQK  +  LK D
Sbjct: 1091 VFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRGLKED 1150

Query: 356  NDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
              L  +AF+D + Y+  +  +K   + ++GD  + +    Y  E   +SL  +D      
Sbjct: 1151 GSLLPVAFMDMQCYVNVLKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKD------ 1204

Query: 414  NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
              +GY     +  + DG  ++                                 +++D D
Sbjct: 1205 --QGYLEVVAAEFLPDGDKLF--------------------------------ILVADSD 1230

Query: 474  KNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF----------KIRCKPSSIS-DA 522
             N+ +  Y PE  +S+ G RL+ ++ FH+G    T            K    P S+  D+
Sbjct: 1231 CNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLLPRTMVSSEKAMANPDSMEIDS 1290

Query: 523  PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 582
                 + L    S  G++G    +PE++YRRL  LQ+ +     H  GLNPRA+R  +  
Sbjct: 1291 QTISQQVLI--TSQSGSVGIVTSVPEESYRRLSALQSQLANSLEHPCGLNPRAYRAVESD 1348

Query: 583  GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
                G   RG++DG+L++++L +    ++EI  ++G+   +I  +L  I A
Sbjct: 1349 ----GTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAHEWEIKADLEAIGA 1395


>gi|295665178|ref|XP_002793140.1| cleavage and polyadenylation specificity factor subunit A
            [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226278054|gb|EEH33620.1| cleavage and polyadenylation specificity factor subunit A
            [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 1408

 Score =  218 bits (555), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 180/646 (27%), Positives = 298/646 (46%), Gaps = 89/646 (13%)

Query: 17   VQELLTVSLGLHGNR-PLLLVRTQ-HELLIYQAFRHPKGALKLRFKKLKVLFVSD----- 69
            + E+L   LG   +R P L +R+  +EL++Y+ + H   + + R   L+ + +++     
Sbjct: 816  LTEILVADLGDSVSRTPYLTLRSNSNELILYEPY-HTVQSTEKRLSDLRFVKIANHHFPK 874

Query: 70   ---RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTI 126
                S   N   G  + VR   +R   ++ GY+ VF+ G  P   F+        H M +
Sbjct: 875  FLPESNLGNLSDGDRQLVR--PLRALGDVCGYRTVFMPGNSPC--FIIKSATSIPHVMNL 930

Query: 127  DG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
             G  V +L+ F+   C +GF+Y +  + +R+   P +  +D  W  RK+ L      + Y
Sbjct: 931  RGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRFPRNTHFDGSWAARKIGLGEQVDSVEY 990

Query: 186  HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRF-IPPLVSQFHVSLFSPFSWEE 244
               ++TY + TS      D+     ED E+  + R+      P + +  V L +P +W  
Sbjct: 991  SSSSETYVLGTSQ---KVDFKL--PEDDEIHPEWRNEVISFFPQIDKGSVKLLNPRTWSI 1045

Query: 245  IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIE 304
            I   ++ L   E V+C+K +++E        +  IA+GT     ED+  RG I +F++I+
Sbjct: 1046 I--DSYQLRTSERVMCVKCLNLEASEITHERKEMIAVGTALTRGEDIAARGCIYVFEVIK 1103

Query: 305  VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGI 361
            VVPE  +P T  K+K+I  +E KG +T++  +   GFL+ A GQK  +  LK D  L  +
Sbjct: 1104 VVPEVDRPETNRKLKLIAKEEVKGAITSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLPV 1163

Query: 362  AFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
            AF+D + Y++ +  +K   + ++GD  + +    Y  E   LSL ++D            
Sbjct: 1164 AFMDMQCYVSVLKELKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKD------------ 1211

Query: 420  AGNPSRGIIDGSL-VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
                     DGSL V     L  G+RL I                    M++D D N+ +
Sbjct: 1212 ---------DGSLQVMAADFLPDGKRLYI--------------------MVADDDCNIHV 1242

Query: 479  FMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL-------- 530
              Y PE   S  G RL+ ++ FH GQ  +T   +  + S +S  P   +  +        
Sbjct: 1243 LQYDPEDPGSAKGDRLLHRSTFHTGQFAST-LTLLPRTSVLSQGPETEANAMDLDLSGPL 1301

Query: 531  --TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
                  S  G++    P+ E  YRRL  LQ+ M+    H  GLNPRAFR  +  G     
Sbjct: 1302 HQVLVTSETGSIALITPVSEMAYRRLSALQSQMINTLEHPCGLNPRAFRAVESDGIGG-- 1359

Query: 589  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
              RG++DG LV K+L L    + EI  ++G+   D+ +   D+EA+
Sbjct: 1360 --RGMVDGDLVQKWLDLGTQRKAEIASRVGA---DVWEIRADLEAI 1400


>gi|258575565|ref|XP_002541964.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237902230|gb|EEP76631.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 1376

 Score =  218 bits (554), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 176/638 (27%), Positives = 295/638 (46%), Gaps = 94/638 (14%)

Query: 17   VQELLTVSLGLHGNR-PLLLVRTQHE-LLIYQAFRHPKGALK---LRFKKLKVLFVSDRS 71
            + E+L   LG   +R P +++RT H+ L+IYQ + + K +L+   LRF K+   F+    
Sbjct: 803  LSEVLMADLGDSISRQPYMILRTTHDDLVIYQPY-YTKPSLEQPELRFLKITDYFLPKVD 861

Query: 72   KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG-PV 130
              +N           +++R   ++ GY+ +F+ G +P ++  +S      H + + G PV
Sbjct: 862  PASNMDN--TNRTSFARLRAIPDLCGYKTMFMPGSNPCFIMKSSTSS--PHVLRLKGEPV 917

Query: 131  STLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETK 190
            S+L+ FH   C +GF Y +AK+ +R+  LP +  +D  W  RK+ +      + Y   ++
Sbjct: 918  SSLSSFHMPACEKGFAYVDAKNMVRMCRLPGNTRFDNAWAARKIHIGEQVDCVEYFARSE 977

Query: 191  TYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWEEIPQT 248
            TY + TS  E     +K   ED E+ T+ R     F+P L  +  V L SP +W  I   
Sbjct: 978  TYVLGTSYHED----FKLP-EDDEVHTEWRSEVISFMPQL-DRGRVKLLSPRTWSII--D 1029

Query: 249  NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPE 308
             + L   E +LCLK ++ME        +  + +GT     ED+T RG I +F+II+V P+
Sbjct: 1030 CYDLGATERILCLKTINMEVSEITHERQDMVVVGTAIVRGEDITPRGSIYVFEIIDVAPD 1089

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFID 365
            P +P T  K K+   ++ KG VTAI  +   GFL+ A GQK  +  LK D  L  +AF+D
Sbjct: 1090 PDRPETNQKFKLFAKEDVKGAVTAISGIGGQGFLIAAQGQKCLVRGLKEDGSLLPVAFMD 1149

Query: 366  TEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNP 423
             + Y++ +  ++   L ++GD  + +    Y                             
Sbjct: 1150 MQCYVSVLKELQGTGLCIMGDALKGLWFTGYS---------------------------- 1181

Query: 424  SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQP 483
                         +QLS    +E C+          + +    F    +   VV   + P
Sbjct: 1182 -------------VQLSSAVDVETCE----------EPYKLTLFGKDSEYLQVVAADFLP 1218

Query: 484  EARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR----------SRFLTWY 533
            +   S+ G RL+ ++ FH G  ++T   I   P   S   GA           + +    
Sbjct: 1219 DDPSSSKGDRLLHRSSFHTGHFISTLTLI---PQYTSSGTGASEDNMDVDYMPAGYQVVV 1275

Query: 534  ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGI 593
             S  G++G   PL E+ YRRL  LQ+ +V    H  GLNP+A+R  +  G+      RG+
Sbjct: 1276 TSQSGSVGVITPLTEETYRRLSALQSQLVMSMEHPCGLNPKAYRAVESDGFSG----RGL 1331

Query: 594  IDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
            +DG+L+ ++L + +  + EI  ++G+    I  +L  I
Sbjct: 1332 VDGNLLLRWLDMGVQRKAEIAGRVGADLQSIRADLERI 1369


>gi|159123784|gb|EDP48903.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus fumigatus A1163]
          Length = 1401

 Score =  218 bits (554), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 181/651 (27%), Positives = 309/651 (47%), Gaps = 94/651 (14%)

Query: 16   IVQELLTVSLGLHGN-RPLLLVRTQ-HELLIYQAF-RHPKGA--LKLRFKK-----LKVL 65
            ++ E +   LG   N  P L++RT+  +L+IY+AF  + KG    +L F K     L  +
Sbjct: 806  VLSEAVIADLGESWNPSPHLILRTESDDLVIYKAFASYIKGESHTRLSFVKESNHTLPRV 865

Query: 66   FVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
              S++  ++NE+   PR +RI       NI+ +  VF+ G   +++  T++     H   
Sbjct: 866  TTSEKEMQSNEKLSRPRSLRI-----LPNISNFSAVFMPGRPASFILKTAKS--CPHVFR 918

Query: 126  IDGP-VSTLAPFH--NVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
            + G  V +L+ F   + +   GF+Y ++K  LRI   P+   +D  W +RK+ +      
Sbjct: 919  LRGEFVRSLSIFDLASPSLDTGFIYVDSKDVLRICRFPSDTLFDYTWALRKISIGEQVDH 978

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS----RFIPPLVSQFHVSLFS 238
            LAY   ++TY + TS    S D+     +D EL  D R+      F+P L  Q  + + S
Sbjct: 979  LAYATSSETYVLGTSH---SADFKL--PDDDELHPDWRNEGLVISFLPEL-RQCSLKVVS 1032

Query: 239  PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRIL 298
            P +W  I   ++ L   E+V+ +KN+ +E        R  I +GT +   ED+  RG I 
Sbjct: 1033 PRTWTVI--DSYSLGPDEYVMAVKNMDLEVSENTHERRNMIVVGTAFARGEDIPSRGCIY 1090

Query: 299  LFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-D 355
            +F++I+VVP+P +P T  K+K+I  +  KG VTA+  +   GFL+ A GQK  +  LK D
Sbjct: 1091 VFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRGLKED 1150

Query: 356  NDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
              L  +AF+D + Y+  +  +K   + ++GD  + +    Y  E   +SL  +D      
Sbjct: 1151 GSLLPVAFMDMQCYVNVLKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKD------ 1204

Query: 414  NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
              +GY     +  + DG  ++                                 +++D D
Sbjct: 1205 --QGYLEVVAAEFLPDGDKLF--------------------------------ILVADSD 1230

Query: 474  KNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF----------KIRCKPSSIS-DA 522
             N+ +  Y PE  +S+ G RL+ ++ FH+G    T            K    P S+  D+
Sbjct: 1231 CNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLLPRTMVSSEKAMANPDSMEIDS 1290

Query: 523  PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 582
                 + L    S  G++G    +PE++YRRL  LQ+ +     H  GLNPRA+R  +  
Sbjct: 1291 QTISQQVLI--TSQSGSVGIVTSVPEESYRRLSALQSQLANSLEHPCGLNPRAYRAVESD 1348

Query: 583  GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
                G   RG++DG+L++++L +    ++EI  ++G+   +I  +L  I A
Sbjct: 1349 ----GTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAHEWEIKADLEAIGA 1395


>gi|239611898|gb|EEQ88885.1| protein CFT1 [Ajellomyces dermatitidis ER-3]
 gi|327352847|gb|EGE81704.1| CFT1 [Ajellomyces dermatitidis ATCC 18188]
          Length = 1402

 Score =  217 bits (553), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 181/644 (28%), Positives = 292/644 (45%), Gaps = 85/644 (13%)

Query: 17   VQELLTVSLGLHGNR-PLLLVRT-QHELLIYQAFRH----PKGALKLRFKKLKVLFVSDR 70
            + E+L   +G   +R P L++R+  ++L++Y+ +       K +  LRF K         
Sbjct: 810  LTEILVADIGDSVSRTPYLILRSSNNDLILYEPYHTTHSTEKKSSDLRFLKTINHHFPKF 869

Query: 71   SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG-P 129
               +N +     G     +R   ++ GY+ VF+ G  P ++  +S      H + + G  
Sbjct: 870  HAGSNVEDSSHIGALPKPLRVLGDVCGYRTVFMPGNSPCFVIKSSTS--IPHVLNLRGKT 927

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            V +L+ F+   C RGF+Y +A + +R+   P +  +D  W  RK+ L      + Y   +
Sbjct: 928  VHSLSSFNIPACERGFVYVDADNVVRMCRFPRNTHFDGSWATRKIGLGEQVDIVEYSSSS 987

Query: 190  KTYCIVTSTAEPSTDYYKFN-GEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWEEIP 246
            +TY I TS          FN  ED E+  + R+    F+P  + Q  V L SP +W  I 
Sbjct: 988  ETYVIGTSQK------VDFNLPEDDEIHPEWRNEVISFLPQ-IDQGSVKLLSPRTWSII- 1039

Query: 247  QTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVV 306
              +  L   E ++C+K + +E        R  IA+GT     ED+  RG I +F++IEVV
Sbjct: 1040 -DSHTLRTAERIMCVKCLDLEVSEITHERRDMIAVGTAVTRGEDIAARGCIYIFEVIEVV 1098

Query: 307  PEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAF 363
            PE  +P T  K+K+I  +E KG VT++  +   GFL+ A GQK  +  LK D  L  +AF
Sbjct: 1099 PEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLPVAF 1158

Query: 364  IDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            +D + Y+  +  +K   + ++GD  + I    Y  E   LSL ++D              
Sbjct: 1159 MDMQCYVNVLKELKGTGMCIMGDALKGIWFAGYSEEPYKLSLFSKD-------------- 1204

Query: 422  NPSRGIIDGSL-VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                   DG+L V     L  G+RL I                    +++D D N+ +  
Sbjct: 1205 -------DGTLQVMAADFLPDGKRLYI--------------------LVADDDCNIHVLQ 1237

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL---------- 530
            Y PE   S+ G RL+ ++ FH G   +T   +       +  P A    +          
Sbjct: 1238 YDPEDPGSSKGDRLLHRSTFHTGHFASTMTLLPRTIIPSAQGPDANPDMMELDSSGPLYH 1297

Query: 531  TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPS 590
                S  G++    PL E  YRRL  LQ+ ++    H  GLNPRAFR  +  G       
Sbjct: 1298 VLVTSETGSIALITPLSETAYRRLSALQSQLINTLEHPCGLNPRAFRAIESDGIGG---- 1353

Query: 591  RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
            RG++DG L+ ++L L    + EI  ++G+   DI +   D+EA+
Sbjct: 1354 RGMVDGDLLHRWLDLGTQRKAEIAHRVGA---DIWEIRADLEAI 1394


>gi|261201748|ref|XP_002628088.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
 gi|239590185|gb|EEQ72766.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
          Length = 1403

 Score =  217 bits (552), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 181/644 (28%), Positives = 292/644 (45%), Gaps = 85/644 (13%)

Query: 17   VQELLTVSLGLHGNR-PLLLVRT-QHELLIYQAFRH----PKGALKLRFKKLKVLFVSDR 70
            + E+L   +G   +R P L++R+  ++L++Y+ +       K +  LRF K         
Sbjct: 811  LTEILVADIGDSVSRTPYLILRSSNNDLILYEPYHTTHSTEKKSSDLRFLKTINHHFPKF 870

Query: 71   SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG-P 129
               +N +     G     +R   ++ GY+ VF+ G  P ++  +S      H + + G  
Sbjct: 871  HAGSNVEDSSHIGALPKPLRVLGDVCGYRTVFMPGNSPCFVIKSSTS--IPHVLNLRGKT 928

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            V +L+ F+   C RGF+Y +A + +R+   P +  +D  W  RK+ L      + Y   +
Sbjct: 929  VHSLSSFNIPACERGFVYVDADNVVRMCRFPRNTHFDGSWATRKIGLGEQVDIVEYSSSS 988

Query: 190  KTYCIVTSTAEPSTDYYKFN-GEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWEEIP 246
            +TY I TS          FN  ED E+  + R+    F+P  + Q  V L SP +W  I 
Sbjct: 989  ETYVIGTSQK------VDFNLPEDDEIHPEWRNEVISFLPQ-IDQGSVKLLSPRTWSII- 1040

Query: 247  QTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVV 306
              +  L   E ++C+K + +E        R  IA+GT     ED+  RG I +F++IEVV
Sbjct: 1041 -DSHTLRTAERIMCVKCLDLEVSEITHERRDMIAVGTAVTRGEDIAARGCIYIFEVIEVV 1099

Query: 307  PEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAF 363
            PE  +P T  K+K+I  +E KG VT++  +   GFL+ A GQK  +  LK D  L  +AF
Sbjct: 1100 PEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLPVAF 1159

Query: 364  IDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            +D + Y+  +  +K   + ++GD  + I    Y  E   LSL ++D              
Sbjct: 1160 MDMQCYVNVLKELKGTGMCIMGDALKGIWFAGYSEEPYKLSLFSKD-------------- 1205

Query: 422  NPSRGIIDGSL-VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                   DG+L V     L  G+RL I                    +++D D N+ +  
Sbjct: 1206 -------DGTLQVMAADFLPDGKRLYI--------------------LVADDDCNIHVLQ 1238

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL---------- 530
            Y PE   S+ G RL+ ++ FH G   +T   +       +  P A    +          
Sbjct: 1239 YDPEDPGSSKGDRLLHRSTFHTGHFASTMTLLPRTIIPSAQGPDANPDMMELDSSGPLYH 1298

Query: 531  TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPS 590
                S  G++    PL E  YRRL  LQ+ ++    H  GLNPRAFR  +  G       
Sbjct: 1299 VLVTSETGSIALITPLSETAYRRLSALQSQLINTLEHPCGLNPRAFRAIESDGIGG---- 1354

Query: 591  RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
            RG++DG L+ ++L L    + EI  ++G+   DI +   D+EA+
Sbjct: 1355 RGMVDGDLLHRWLDLGTQRKAEIAHRVGA---DIWEIRADLEAI 1395


>gi|296806499|ref|XP_002844059.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
 gi|238845361|gb|EEQ35023.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
          Length = 1348

 Score =  217 bits (552), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 177/640 (27%), Positives = 295/640 (46%), Gaps = 86/640 (13%)

Query: 8    SPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHE-LLIYQAFR--HPKGALKLRFKKL-- 62
            S S  +  +   +L   L  +GN   L +RT+H+ L++Y+ +R     G  +LRF K   
Sbjct: 748  SESGTENIVGNNVLLFLLDGNGN---LSLRTKHDDLILYEPYRVTGENGESRLRFLKAVN 804

Query: 63   KVLFVSDRSKRANEQPG---LPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGEL 119
             V+  S   K AN   G    PR      +R  S+I GY+ VF+ G +P   F+      
Sbjct: 805  HVVMRSHSEKAANVVEGKHPFPR----KPLRALSDICGYKTVFMPGQNPC--FILKSAIT 858

Query: 120  RAHPMTIDG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKC 178
            + H + + G  V +L+ FH   C RGF Y +  + +R+S LP++  +D+ W  RK+PL  
Sbjct: 859  QPHVLRLRGKAVQSLSGFHIAACERGFAYVDEDNIIRMSRLPSNTRFDSTWATRKIPLGE 918

Query: 179  TPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS--RFIPPLVSQFHVSL 236
                + Y   +++Y I TS  E     +K   ED E  T+ ++    F+P L  +  V L
Sbjct: 919  QVDCIVYSSASESYVIGTSVKED----FKLP-EDDESHTEWQNEFITFLPQL-ERGTVKL 972

Query: 237  FSPFSWE--EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR 294
              P +W   +I  ++  L   E + C++ + +E        +  + +G+     ED+  +
Sbjct: 973  LDPKNWSIADIAPSSHELEPAERITCIEVIRLEISEITHERKDMVVVGSAIVKGEDIVPK 1032

Query: 295  GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQ 352
            G I +F+II+VVP+P       ++K+   +E KG VTA+  +   GFL+ A GQK  +  
Sbjct: 1033 GCIRVFEIIDVVPDPDHSEMNKRLKLFAREEVKGAVTALSGIGSQGFLIVAQGQKCMVRG 1092

Query: 353  LK-DNDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
            LK D  L  +AF D + Y++ +  +K   + +VGD  + +    Y  E   L L  ++  
Sbjct: 1093 LKEDGSLLPVAFKDAQCYVSVLKELKGTGMCIVGDAIKGLWFTGYSEEPYKLDLFGKE-- 1150

Query: 410  PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
                        N +  +I          L  G RL +                    ++
Sbjct: 1151 ------------NENIAVIAADF------LPDGNRLYV--------------------LV 1172

Query: 470  SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI----------RCKPSSI 519
            +D D N+ +  Y PE   S+ G RL+ +  FH+G   +T   +            + +  
Sbjct: 1173 ADDDCNLHVLQYDPEDPSSSKGDRLLHRNVFHVGHFASTMTLLPQGSHTPHSPADRDAMD 1232

Query: 520  SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
            +DAP   S++        G++G   PL E +YRRLL LQ+ +V    H  GLNPR +R  
Sbjct: 1233 TDAPLPPSKYQILMTFQTGSVGIITPLNEDSYRRLLALQSQLVNALEHPCGLNPRGYRAV 1292

Query: 580  KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
            +  G       RG+IDG+L+ ++L +    + EI  ++G+
Sbjct: 1293 ESDGIGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGA 1329


>gi|326471884|gb|EGD95893.1| protein kinase subdomain-containing protein [Trichophyton tonsurans
            CBS 112818]
          Length = 1398

 Score =  216 bits (550), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 172/650 (26%), Positives = 301/650 (46%), Gaps = 81/650 (12%)

Query: 4    FRSHSPSAMDETIVQELLTVSLG--LHGNRPLLLVRTQHE-LLIYQAFR--HPKGALKLR 58
            + S S   ++   + ELL   LG  +H   P +++RT+H+ L++Y+ +R     G   LR
Sbjct: 795  YESSSRRPVNRETLTELLIADLGDAIH-KSPYMILRTKHDDLVLYEPYRIAGESGHSGLR 853

Query: 59   F-KKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
            F K +  + +  R+ +               +R   ++ GY+ VF+ G +P ++  ++  
Sbjct: 854  FLKAVNHVVMGPRTDQGVNHDINRSPSSCKLLRALPDVCGYKTVFMSGHNPCFILKSAIA 913

Query: 118  ELRAHPMTIDG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPL 176
              R H + + G  V +L+ FH   C RGF Y +  + +R+S LP++  +D+ W  RK+ L
Sbjct: 914  --RPHVLRLRGKAVQSLSGFHIAACERGFAYVDEDNVIRMSRLPSNTRFDSGWATRKIAL 971

Query: 177  KCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS--RFIPPLVSQFHV 234
                  + Y   ++ Y I TS  E     +K   ED E  T+ R+    F+P L  +  V
Sbjct: 972  GEQVDSIVYSSASECYVIGTSAKED----FKLP-EDDESHTEWRNEFITFLPQL-ERGTV 1025

Query: 235  SLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR 294
             L  P +W  I   +  L   E + C++ + +E        +  + +G++    ED+  +
Sbjct: 1026 KLLEPKNWSTI--DSHELKPAERITCIEVIRLEISELTHERKDMVVVGSSIVKGEDIVPK 1083

Query: 295  GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQ 352
            G I +F++I+VVPEP QP    K+K+   +E KG VTA+  +   GFL+ A GQK  +  
Sbjct: 1084 GFIRVFEVIDVVPEPDQPEKSKKLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRG 1143

Query: 353  LK-DNDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
            LK D  L  +AF DT+ Y+  +  +K   + ++GD  + +  + Y  E   L L  ++  
Sbjct: 1144 LKEDGSLLPVAFKDTQCYVNVLKELKGTGMCIIGDAFKGLWFIGYSEEPYKLDLFGKE-- 1201

Query: 410  PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
                        N +  ++D                           D L + + +  ++
Sbjct: 1202 ------------NENLAVVDA--------------------------DFLPDGNKLYILV 1223

Query: 470  SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI---RCKPSSISDA---- 522
            +D D N+ +  Y PE   S+ G RL+ ++ FH G   +T   +      PS+  D     
Sbjct: 1224 ADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTLLPHGAYTPSAPVDEDAMD 1283

Query: 523  ----PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT 578
                P ++ + L  + +  G++    PL E +YRRLL LQ+ +V    H   LNPR +R 
Sbjct: 1284 TDSLPPSKYQILMTFQT--GSIAVITPLSEDSYRRLLALQSQLVNALEHPCSLNPRGYRA 1341

Query: 579  YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
             +  G       RG+IDG+L+ ++L +    + EI  ++G+    I  +L
Sbjct: 1342 VESDGMGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGADVGAIRTDL 1388


>gi|317036382|ref|XP_001398211.2| protein cft1 [Aspergillus niger CBS 513.88]
          Length = 1393

 Score =  216 bits (550), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 175/637 (27%), Positives = 298/637 (46%), Gaps = 101/637 (15%)

Query: 32   PLLLVRTQ-HELLIYQAFRHPKGALK----LRFKKLKVLFVSDRSKRANEQ-PGLPRGVR 85
            P L++R++  +L+IY+ F    G ++    L+F           SK  N   P +P GV 
Sbjct: 818  PYLILRSETDDLIIYKPFVVSTGPVEGIHSLKF-----------SKETNSVLPRIPPGVS 866

Query: 86   ISQ----------MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS-TLA 134
             +Q          +R   +I+G   VF+ G    ++  TS      H + + G  S +++
Sbjct: 867  STQPSGSDYRARPLRILPDISGLSAVFMPGASAGFIIRTSASA--PHFLRLRGENSRSVS 924

Query: 135  PFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCI 194
                  C +GF+Y +++S +R   LP    +D  W +++V L      LAY   +  Y +
Sbjct: 925  SLDTPECSKGFIYLDSQSTVRFCKLPPMTRFDYQWTLKRVHLGEQVDHLAYSTSSGMYVL 984

Query: 195  VTSTAEPSTDYYKFNGEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWEEIPQTNFPL 252
             T  A   TD+     ED EL  + R+    F P     F + L SP +W  I   +F L
Sbjct: 985  GTCHA---TDFKL--PEDDELHPEWRNEAISFFPSARGSF-IKLVSPNTWSII--DSFSL 1036

Query: 253  HEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQP 312
               E+V+ +KN+S+E        +  I +GT +   ED+  RG I +F++++VVP+P  P
Sbjct: 1037 GADEYVMAIKNISLEVSENTHERKDMIVVGTAFARGEDIPSRGCIYVFEVVQVVPDPDHP 1096

Query: 313  LTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVY 369
             T  K+K+I  +  KG VTA+  +   GF++ A GQK  +  LK D  L  +AF+D + Y
Sbjct: 1097 ETDRKLKLIGKEPVKGAVTALSEIGGQGFVLVAQGQKCMVRGLKEDGSLLPVAFMDMQCY 1156

Query: 370  IASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGI 427
            ++ +  +K   + ++GD  + +    Y  E   +SL A+D                    
Sbjct: 1157 VSVVKELKGTGMCILGDAVKGVWFAGYSEEPYKMSLFAKDL------------------- 1197

Query: 428  IDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARE 487
                           + LE+C        + L +   +  +++D D N+ +  Y PE  +
Sbjct: 1198 ---------------DYLEVCAA------EFLPDGKRLFIVVADSDCNIHVLQYDPEDPK 1236

Query: 488  SNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSS---ISDAPG-----ARSRFLTWYASLDG 538
            S+ G RL+ ++ FH+G   +T   + R   SS   +S + G               + +G
Sbjct: 1237 SSNGDRLLSRSKFHMGNFASTLTLLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNG 1296

Query: 539  ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
            +LG    +PE++YRRL  LQ+ +     H  GLNPRAFR  +      G   RG++DG+L
Sbjct: 1297 SLGLITCIPEESYRRLSALQSQLTNTLEHPCGLNPRAFRAVESD----GTAGRGMLDGNL 1352

Query: 599  VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
            ++K++ +S   + EI  ++G++  +I     D+EA+S
Sbjct: 1353 LFKWIDMSKQRKTEIAGRVGAREWEI---KADLEAIS 1386


>gi|225558298|gb|EEH06582.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 1408

 Score =  216 bits (550), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 180/648 (27%), Positives = 296/648 (45%), Gaps = 87/648 (13%)

Query: 14   ETIVQELLTVSLGLHGNR-PLLLVRTQH-ELLIYQAFRHPKGALK----LRFKKLKVLFV 67
            ETI  ELL   LG   +R P L++R+ + +L++Y+ + +     K    LRF K+     
Sbjct: 813  ETIT-ELLVADLGDSVSRSPYLILRSSNSDLILYEPYHYTSSTEKQFSDLRFVKIANHHF 871

Query: 68   SDRSKRANEQPGLPRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTI 126
                  +N +        +S+ +R   ++ GY+ VF+ G  P ++  +S      H M +
Sbjct: 872  PKFHSESNVEKHPANCTTLSKPLRVLGDVCGYRTVFMPGNSPCFIIKSSTS--IPHVMNL 929

Query: 127  DG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
             G  V +L+ F+   C +GF+Y +  + +R+   P +  +D  W  RK+ L      + Y
Sbjct: 930  RGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRFPRNTHFDGSWAARKIGLGEQVDAVEY 989

Query: 186  HLETKTYCIVTSTAEPSTDYYKFN-GEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSW 242
               ++TY I T+          FN  ED E+  + R+    F+P  + +  V L +P +W
Sbjct: 990  SSSSETYVIGTNQK------VDFNLPEDDEIHPEWRNEVISFLPQ-IDKGSVKLLTPRTW 1042

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
              I   N  L   E ++C+K +++E        +  I +GT     ED+  RG I +F++
Sbjct: 1043 SIIDSYN--LRNAERIMCVKCLNLEVSEITHERKDTIVVGTALTKGEDIAARGCIYIFEV 1100

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLT 359
            IEVVPE  +P T  K+K+I  +E KG VT++  +   GFL+ A GQK  +  LK D  L 
Sbjct: 1101 IEVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLL 1160

Query: 360  GIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
             +AF+D + Y+  +  +K   + ++GD  + +    Y  E   LSL ++D          
Sbjct: 1161 PVAFMDMQCYVNVLKELKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKD---------- 1210

Query: 418  YYAGNPSRGIIDGSL-VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
                       DG+L V     L  G RL I                    +++D D N+
Sbjct: 1211 -----------DGTLQVMAADFLPDGNRLYI--------------------LVADDDCNI 1239

Query: 477  VLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL------ 530
             +  Y PE   S+ G RL+ ++ F  G   +T   +    +S S  P A    +      
Sbjct: 1240 HVLQYDPEDPGSSKGDRLLHRSTFQTGHFASTMTLLPRTATSSSQGPDADPDMMDLDSSG 1299

Query: 531  ----TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 586
                    S  G++    P+ E +YRRL  LQ+ +     H  GLNPRAFR  +  G   
Sbjct: 1300 PLHHVLVTSETGSIALITPVSETSYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGIGG 1359

Query: 587  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
                RG++DG LV ++L L    + EI  ++G+   D+ +   D+EA+
Sbjct: 1360 ----RGMVDGDLVKRWLDLGTQRKAEIANRVGA---DVWEIRADLEAI 1400


>gi|327304811|ref|XP_003237097.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
 gi|326460095|gb|EGD85548.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
          Length = 1398

 Score =  215 bits (547), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 171/645 (26%), Positives = 300/645 (46%), Gaps = 89/645 (13%)

Query: 4    FRSHSPSAMDETIVQELLTVSLG--LHGNRPLLLVRTQHE-LLIYQAFRHPK--GALKLR 58
            + S S   ++   + ELL   LG  +H   P +++RT+H+ L++Y+ +R     G   LR
Sbjct: 795  YESSSRRPVNRVTLAELLIADLGDSIH-KSPYMILRTKHDDLVLYEPYRVAGECGQSGLR 853

Query: 59   FKKLKVLFVSDRSKRANEQPGLPRGVR-----ISQMRYFSNIAGYQGVFLCGPHPAWLFL 113
            F K     V+         PG+ + +        ++R   ++ GY+ VF+ G +P ++  
Sbjct: 854  FLKA----VNHVVMGPLTDPGVNQDINRCPSSCKRLRALPDVCGYKTVFMSGHNPCFILK 909

Query: 114  TSRGELRAHPMTIDG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVR 172
            ++    R H + + G  V +L+ FH   C RGF Y +  + +R+S LP++  +D+ W  R
Sbjct: 910  SAIA--RPHVLRLRGKAVQSLSGFHIAACERGFAYVDEDNVIRMSRLPSNTRFDSGWATR 967

Query: 173  KVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS--RFIPPLVS 230
            K+        + Y   ++ Y I TS  E     +K   ED E  T+ R+    F+P L  
Sbjct: 968  KIAFGEQVDSIVYSSASECYVIGTSAKED----FKLP-EDDESHTEWRNEFITFLPQL-E 1021

Query: 231  QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
            +  V L  P +W  I   +  L   E ++C++ + +E        +  + +G++    ED
Sbjct: 1022 RGTVKLLEPRNWSTI--DSHELEPAERIMCIEVIRLEISELTHERKDMVVVGSSIVKGED 1079

Query: 291  VTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKI 348
            +  +G I +F++I+VVPEP QP    K+K+   +E KG VTA+  +   GFL+ A GQK 
Sbjct: 1080 IVPKGFIRVFEVIDVVPEPDQPEKSKKLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKC 1139

Query: 349  YIWQLK-DNDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVA 405
             +  LK D  L  +AF DT+ Y+  +  +K   + ++GD  + +    Y  E   L L  
Sbjct: 1140 MVRGLKEDGSLLPVAFKDTQCYVNVLKELKGTGMCIIGDAFKGLWFTGYSEEPYKLDLFG 1199

Query: 406  RDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
            ++              N +  ++D                           D L + + +
Sbjct: 1200 KE--------------NENLAVVDA--------------------------DFLPDGNKL 1219

Query: 466  GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS----- 520
              +++D D N+ +  Y PE   S+ G RL++++ FH G   +T   +     + S     
Sbjct: 1220 YILVADDDCNLHVLQYDPEDPSSSKGDRLLRRSVFHTGHFASTVTLLPHGAHTTSSPVDE 1279

Query: 521  DA------PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 574
            DA      P ++ + L  + +  G++    PL E +YRRLL LQ+ +V    H   LNPR
Sbjct: 1280 DAMDTDSPPPSKYQILMTFQT--GSIAVITPLSEDSYRRLLALQSQLVNALEHPCSLNPR 1337

Query: 575  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
             +R  +  G       RG+IDG+L+ ++L +    + EI  ++G+
Sbjct: 1338 GYRAVESDGMGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGA 1379


>gi|317157892|ref|XP_001826637.2| protein cft1 [Aspergillus oryzae RIB40]
 gi|391864317|gb|EIT73613.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
            [Aspergillus oryzae 3.042]
          Length = 1389

 Score =  214 bits (546), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 168/644 (26%), Positives = 300/644 (46%), Gaps = 81/644 (12%)

Query: 16   IVQELLTVSLGLH-GNRPLLLVRTQHE-LLIYQAFRHPKGAL-----KLRFKKLKVLFVS 68
            ++ E++   LG    + P L++R++H+ L +Y+ F     ++      L F K   L + 
Sbjct: 796  VLTEIVVADLGDSWSSFPYLIIRSRHDDLAVYRPFISITKSVGEPHADLNFLKETNLVLP 855

Query: 69   DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG 128
              +    +Q      ++   +R  SNI+G+  +F  G  P ++  TS      H + + G
Sbjct: 856  RITSGVEDQSSTEEVIKSVPLRIVSNISGFSAIFRPGVSPGFIVRTSTSS--PHFLGLKG 913

Query: 129  P-VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHL 187
                +L+ F    C  GF+  ++K  + +  +P  +  D PW ++++P+      LAY  
Sbjct: 914  GYAQSLSKFQTSECGEGFILLDSKGVIHVCQMPLGVQLDYPWTIQQIPIGEQVDHLAYSS 973

Query: 188  ETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRD--SRFIPPLVSQFHVSLFSPFSWEEI 245
             +  Y I TS        +K   ED EL  + R+  + F P  V +  + + SP +W  I
Sbjct: 974  SSGMYVIGTS----HRTEFKLP-EDDELHPEWRNEMTSFFPE-VQRSSLKVVSPKTWTVI 1027

Query: 246  PQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEV 305
               ++ L   EHV+ +KN+S+E        +  I +GT +   ED+  RG + +F++I+V
Sbjct: 1028 --DSYLLSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGEDIASRGCVYVFEVIKV 1085

Query: 306  VPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIA 362
            VP+P +P    K++++  +  KG VTA+  +   GFL+ A GQK  +  LK D  L  +A
Sbjct: 1086 VPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQKCIVRGLKEDGSLLPVA 1145

Query: 363  FIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            F+D + +++ +  +K   + ++ D  + +    Y  E   +SL A+D             
Sbjct: 1146 FMDVQCHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPYKMSLFAKDL------------ 1193

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                  + LE+         D L + + +  +++D D N+ +  
Sbjct: 1194 ----------------------DYLEVLAA------DFLPDGNKLFILVADSDCNLHVLQ 1225

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSS---ISDAPGAR-----SRFLT 531
            Y PE  +S+ G RL+ ++ FH G  ++T   + R   SS   ISD           R   
Sbjct: 1226 YDPEDPKSSNGDRLLSRSKFHTGNFISTLTLLPRTSVSSEQMISDVDAMDVDIKIPRHQM 1285

Query: 532  WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSR 591
               S +G++G    + E++YRRL  LQ+ +     H  GLNPRAFR  +      G   R
Sbjct: 1286 LITSQNGSVGLVTCVSEESYRRLSALQSQLTNTIEHPCGLNPRAFRAVESD----GTAGR 1341

Query: 592  GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
            G++DG L++++L +S   ++EI  ++G+   +I     D EA+S
Sbjct: 1342 GMLDGKLLFQWLDMSKQRKVEIASRVGANEWEI---KADFEAIS 1382


>gi|345566738|gb|EGX49680.1| hypothetical protein AOL_s00078g169 [Arthrobotrys oligospora ATCC
            24927]
          Length = 1407

 Score =  214 bits (544), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 164/639 (25%), Positives = 296/639 (46%), Gaps = 74/639 (11%)

Query: 10   SAMDETIVQELLTVSLGLHGNR-PLLLVRTQHE-LLIYQAFRHPKGALKLRFKKLKVLFV 67
            +A DE  ++E++   LG + ++ P L+V+T+ + ++IY+ F     +  + FKK+    +
Sbjct: 833  TARDE--IEEIIVADLGDNISKAPYLIVKTKRDDIIIYEPFI----SNGICFKKIYNTVL 886

Query: 68   SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID 127
               S    + P  P       +    ++ GY   F+ G  P ++  +S+   + + +   
Sbjct: 887  PTVSLSEQKSPSGP-------LVKIDDLGGYSVAFMAGDTPTFITKSSKTLPKLYKLQ-G 938

Query: 128  GPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHL 187
            G V +L+PF+     RGFLY ++K   R+   P  +S +  W  +++PL+ TP  L Y+ 
Sbjct: 939  GMVRSLSPFNTKETERGFLYIDSKGTARVCHFP-EVSMEHTWLSQRIPLERTPTSLTYYD 997

Query: 188  ETKTYCI-VTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIP 246
                Y + V ST++P  D   F  E+  +     D   +P L +  H+ + SP +W    
Sbjct: 998  PKNVYVVSVLSTSKPEVDDEDFQMEEGLV-----DETLLPELETG-HLVMISPVTWTTTD 1051

Query: 247  QTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVV 306
            +  FP+HE   V+  K V +E        +  IA+GT     E+   RG + +FD+I+VV
Sbjct: 1052 RYEFPVHEVPFVV--KAVELEISEVTKERKVLIAVGTGLLRGENSPARGAVYVFDVIDVV 1109

Query: 307  PEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFID 365
            PE G+P T  K K+I  +E KG V+ +  + G+L+   GQK  I  LK D  L  +AF+D
Sbjct: 1110 PEIGKPETGKKFKLISREEVKGVVSTLAGMDGYLLITHGQKCMIRGLKEDGSLLPVAFMD 1169

Query: 366  TEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSR 425
               +     +++ +++ GD  + ++ + +  E   + L  +D +                
Sbjct: 1170 MNTHTTVAKTLEKMVMFGDVLKGVSFVGFSEEPYKMILFGKDPR---------------- 1213

Query: 426  GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEA 485
                        QLS+               D L   ++  F+++D   N+ +  Y PE 
Sbjct: 1214 ------------QLSI------------TAGDFLPAGTACYFVVADAQSNIHVLQYDPEN 1249

Query: 486  RESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS------DAPGARSRFLTWYASLDGA 539
             +S  G+RL+ K + + G  V +   +  K S  +              FL  ++++ G 
Sbjct: 1250 PKSIHGNRLLPKGEIYCGHEVKSICILPKKKSLFTEPDEDDMDEDEDEEFLCMFSTMTGV 1309

Query: 540  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
             G    + E  YRRL ++Q  +     H  GLNPRA+R  K +   +  P R I+DG L+
Sbjct: 1310 FGTVSSITESMYRRLNVIQGQITNTGEHIAGLNPRAYRAAKFRN-TSSEPMRAILDGKLL 1368

Query: 600  WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             ++L L  G R E+  + G+    + ++L+ ++  ++ F
Sbjct: 1369 VRWLMLGAGRRKELAGRAGTSEEMLREDLWFLQDATAFF 1407


>gi|238508528|ref|XP_002385456.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus flavus NRRL3357]
 gi|220688975|gb|EED45327.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus flavus NRRL3357]
          Length = 1204

 Score =  214 bits (544), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 171/651 (26%), Positives = 301/651 (46%), Gaps = 82/651 (12%)

Query: 8    SPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHE-LLIYQAFRHPKGAL-----KLRFKK 61
            S SA  + + Q  +T+ L     R  L +R++H+ L +Y+ F     ++      L F K
Sbjct: 606  SISATSDELAQNSMTLFLMTQDCR--LFIRSRHDDLAVYRPFISITKSVGEPHADLNFLK 663

Query: 62   LKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
               L +   +    +Q      ++   +R  SNI+G+  +F  G  P ++  TS      
Sbjct: 664  ETNLVLPRITSGVEDQSSTEEVIKSVPLRIVSNISGFSAIFRPGVSPGFIVRTSTSS--P 721

Query: 122  HPMTIDGP-VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTP 180
            H + + G    +L+ F    C  GF+  ++K  + +  +P  +  D PW ++++P+    
Sbjct: 722  HFLGLKGGYAQSLSKFQTSECGEGFILLDSKGVIHVCQMPLGVQLDYPWTIQQIPIGEQV 781

Query: 181  HFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRD--SRFIPPLVSQFHVSLFS 238
              LAY   +  Y I TS        +K   ED EL  + R+  + F P  V +  + + S
Sbjct: 782  DHLAYSSSSGMYVIGTS----HRTEFKLP-EDDELHPEWRNEMTSFFPE-VQRSSLKVVS 835

Query: 239  PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRIL 298
            P +W  I   ++ L   EHV+ +KN+S+E        +  I +GT +   ED+  RG + 
Sbjct: 836  PKTWTVI--DSYLLSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGEDIASRGCVY 893

Query: 299  LFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-D 355
            +F++I+VVP+P +P    K++++  +  KG VTA+  +   GFL+ A GQK  +  LK D
Sbjct: 894  VFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQKCIVRGLKED 953

Query: 356  NDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
              L  +AF+D + +++ +  +K   + ++ D  + +    Y  E   +SL A+D      
Sbjct: 954  GSLLPVAFMDVQCHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPYKMSLFAKDL----- 1008

Query: 414  NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
                                         + LE+         D L + + +  +++D D
Sbjct: 1009 -----------------------------DYLEVLAA------DFLPDGNKLFILVADSD 1033

Query: 474  KNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSS---ISDAPGAR--- 526
             N+ +  Y PE  +S+ G RL+ ++ FH G  ++T   + R   SS   ISD        
Sbjct: 1034 CNLHVLQYDPEDPKSSNGDRLLSRSKFHTGNFISTLTLLPRTSVSSEQMISDVDAMDVDI 1093

Query: 527  --SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 584
               R      S +G++G    + E++YRRL  LQ+ +     H  GLNPRAFR  +    
Sbjct: 1094 KIPRHQMLITSQNGSVGLVTCVSEESYRRLSALQSQLTNTIEHPCGLNPRAFRAVESD-- 1151

Query: 585  YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
              G   RG++DG L++++L +S   ++EI  ++G+   +I     D EA+S
Sbjct: 1152 --GTAGRGMLDGKLLFQWLDMSKQRKVEIASRVGANEWEI---KADFEAIS 1197


>gi|240277254|gb|EER40763.1| cleavage factor two protein 1 [Ajellomyces capsulatus H143]
          Length = 1408

 Score =  213 bits (542), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 179/648 (27%), Positives = 295/648 (45%), Gaps = 87/648 (13%)

Query: 14   ETIVQELLTVSLGLHGNR-PLLLVRTQH-ELLIYQAFRHPKGALK----LRFKKLKVLFV 67
            ETI  ELL   LG   +R P L++R+ + +L +Y+ + +     K    LRF K+     
Sbjct: 813  ETIT-ELLVADLGDSVSRSPYLILRSSNSDLTLYEPYHYTSSTEKQFSDLRFVKIANHHF 871

Query: 68   SDRSKRANEQPGLPRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTI 126
                  +N +        +S+ +R   ++ GY+ VF+ G  P ++  +S      H M +
Sbjct: 872  PKFHSESNVEKHPANCTALSKPLRVLGDVCGYRTVFMPGNSPCFIIKSSTS--IPHVMNL 929

Query: 127  DG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
             G  V +L+ F+   C +GF+Y +  + +R+   P +  +D  W  RK+ L      + Y
Sbjct: 930  RGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRFPRNTHFDGSWAARKIGLGEQVDAVEY 989

Query: 186  HLETKTYCIVTSTAEPSTDYYKFN-GEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSW 242
               ++TY I T+          FN  ED E+  + R+    F+P  + +  V L +P +W
Sbjct: 990  SSSSETYVIGTNQK------VDFNLPEDDEIHPEWRNEVISFLPQ-IDKGSVKLLTPRTW 1042

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
              I   N  L   E ++C+K +++E        +  I +GT     ED+  RG I +F++
Sbjct: 1043 SIIDSYN--LRNAERIMCVKCLNLEVSEITHERKDTIVVGTALTKGEDIAARGCIYIFEV 1100

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLT 359
            I+VVPE  +P T  K+K+I  +E KG VT++  +   GFL+ A GQK  +  LK D  L 
Sbjct: 1101 IKVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLL 1160

Query: 360  GIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
             +AF+D + Y+  +  +K   + ++GD  + +    Y  E   LSL ++D          
Sbjct: 1161 PVAFMDMQCYVNVLKELKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKD---------- 1210

Query: 418  YYAGNPSRGIIDGSL-VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
                       DG+L V     L  G RL I                    +++D D N+
Sbjct: 1211 -----------DGTLQVMAADFLPDGNRLYI--------------------LVADDDCNI 1239

Query: 477  VLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL------ 530
             +  Y PE   S+ G RL+ ++ F  G   +T   +    +S S  P A    +      
Sbjct: 1240 HVLQYDPEDPGSSKGDRLLHRSTFQTGHFASTMTLLPRTATSSSQGPDADPDMMDLDSSG 1299

Query: 531  ----TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 586
                    S  G++    P+ E +YRRL  LQ+ +     H  GLNPRAFR  +  G   
Sbjct: 1300 PLHHVLVTSETGSIALITPVSETSYRRLSALQSQLANTLEHPCGLNPRAFRAVESDGIGG 1359

Query: 587  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
                RG++DG LV ++L L    + EI  ++G+   D+ +   D+EA+
Sbjct: 1360 ----RGMVDGDLVKRWLDLGTQRKAEIANRVGA---DVWEIRADLEAI 1400


>gi|154285962|ref|XP_001543776.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150407417|gb|EDN02958.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 1283

 Score =  212 bits (540), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 179/648 (27%), Positives = 294/648 (45%), Gaps = 87/648 (13%)

Query: 14   ETIVQELLTVSLGLHGNR-PLLLVRTQH-ELLIYQAFRHPKGALK----LRFKKLKVLFV 67
            ETI  ELL   LG   +R P L++R+ + +L++Y+ + +     +    LRF K+     
Sbjct: 688  ETIT-ELLVADLGDSVSRSPYLILRSSNSDLILYEPYHYTSSTERQFSGLRFVKIANHHF 746

Query: 68   SDRSKRANEQPGLPRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTI 126
                  +N          IS+ +R   ++ GY+ VF+ G  P ++  +S      H M +
Sbjct: 747  PKSHSESNAGKHPANCTAISKPLRVLGDVCGYRTVFMPGNSPCFIIKSSTS--IPHVMNL 804

Query: 127  DG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
             G  V +L+ F+   C +GF+Y +  + +R+   P +  +D  W  RK+ L      + Y
Sbjct: 805  RGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRFPRNTHFDGSWAARKIGLGEQVDAVEY 864

Query: 186  HLETKTYCIVTSTAEPSTDYYKFN-GEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSW 242
               ++TY I T+          FN  ED E+  + R+    F+P  + +  V L +P +W
Sbjct: 865  SSSSETYVIGTNQK------VDFNLPEDDEIHPEWRNEVISFLPQ-IDKGSVKLLTPRTW 917

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
              I   N  L   E ++C+K +++E        +  I +GT     ED+  RG I +F++
Sbjct: 918  SIIDSYN--LRTAERIMCVKCLNLEVSEITHERKDTIVVGTALTKGEDIAARGCIYIFEV 975

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLT 359
            IEVVPE  +P T  K+K+I  +E KG VT++  +   G L+ A GQK  +  LK D  L 
Sbjct: 976  IEVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGSLIAAQGQKCIVRGLKEDGSLL 1035

Query: 360  GIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
             +AF+D + Y+  +  +K   + ++GD  + +    Y  E   LSL ++D          
Sbjct: 1036 PVAFMDMQCYVNVLKELKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKD---------- 1085

Query: 418  YYAGNPSRGIIDGSL-VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
                       DG+L V     L  G RL I                    +++D D N+
Sbjct: 1086 -----------DGTLQVMAADFLPDGNRLYI--------------------LVADDDCNI 1114

Query: 477  VLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL------ 530
             +  Y PE   S+ G RL+ ++ F  G   +T   +    +S S  P A    +      
Sbjct: 1115 HVLQYDPEDPGSSKGDRLLHRSTFQTGHFASTMTLLPRTATSSSQRPDADPDMMDLDSSG 1174

Query: 531  ----TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 586
                    S  G++    P+ E +YRRL  LQ+ +     H  GLNPRAFR  +  G   
Sbjct: 1175 PLHHVLVTSETGSIALITPVSETSYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGIGG 1234

Query: 587  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
                RG++DG LV ++L L    + EI  ++G+   D+ +   D+EA+
Sbjct: 1235 ----RGMVDGDLVKRWLDLGTQRKAEIANRVGA---DVWEIRADLEAI 1275


>gi|303321596|ref|XP_003070792.1| CPSF A subunit region family protein [Coccidioides posadasii C735
            delta SOWgp]
 gi|240110489|gb|EER28647.1| CPSF A subunit region family protein [Coccidioides posadasii C735
            delta SOWgp]
          Length = 1394

 Score =  212 bits (540), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 176/630 (27%), Positives = 293/630 (46%), Gaps = 111/630 (17%)

Query: 50   HPKGAL---KLRFKKLKVLFVS--DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLC 104
            HPK +L   +LRF K+   F+   D S +A     +PR      +R +S+I GY+ VF+ 
Sbjct: 826  HPKTSLDKQELRFVKIIDHFLPRFDPSPKAY----MPRS---KFLRAYSDICGYKTVFMS 878

Query: 105  GPHPAWLFLTSRGELRAHPMTIDG-PVSTLAPFHNVNCPRGFLYFNA------------- 150
            G +P ++  +S      H + + G  VS+L+ FH   C +GF Y +A             
Sbjct: 879  GSNPCFVMKSSTSS--PHVLRLRGEAVSSLSSFHIPACEKGFAYVDASVCVPKQYFVPWN 936

Query: 151  ------KSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTD 204
                  ++ +R+  LP +  +D  W  RKV +      + Y   ++ Y + +S       
Sbjct: 937  KLILVIQNMVRMCRLPGNTRFDNSWVTRKVHVGDQIDCVEYFAHSEIYALGSS------- 989

Query: 205  YYKFN---GEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVL 259
             +K +    ED E+  + R     F+P L  +  + L SP +W  +   ++ L + E V+
Sbjct: 990  -HKVDFKLPEDDEIHPEWRSEVISFMPQL-ERGCIKLLSPRTWSVV--DSYELGDAERVM 1045

Query: 260  CLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIK 319
            C+K ++ME       ++  + +GT     ED+T RG I +F+IIEV P+P +P T  K+K
Sbjct: 1046 CMKTINMEISEITHEMKDMLVVGTATVRGEDITPRGSIYVFEIIEVAPDPDRPETNRKLK 1105

Query: 320  MIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSV 376
            +    + KG VTA+  +   GFL+ A GQK  +  LK D  L  +AF+D + Y+  +  +
Sbjct: 1106 IFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRGLKEDGSLLPVAFMDMQCYVKVLKEL 1165

Query: 377  K--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVW 434
            +   L ++GD  + I    Y  E   L+L  +D +  Q  +  +                
Sbjct: 1166 QGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKDNEYLQVIAADF---------------- 1209

Query: 435  KFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRL 494
                L  G+RL I                    +++D D  + +  Y PE   S+ G RL
Sbjct: 1210 ----LPDGKRLYI--------------------LVADDDCTIHVLEYDPEDPTSSKGDRL 1245

Query: 495  IKKTDFHLGQHVNTFFKIRCKPSSIS-DAPGARS--------RFLTWYASLDGALGFFLP 545
            + ++ FH+G   +T   +    SS S D PG            +     S +G++G   P
Sbjct: 1246 LHRSSFHMGHFTSTMTLLPQHSSSPSADDPGEDDMDVDYVPKSYQVLVTSQEGSIGVVTP 1305

Query: 546  LPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
            L E +YRRL  LQ+ +VT   H  GLNP+A+R  +  G+      RGI+DG+L+ ++L +
Sbjct: 1306 LTEDSYRRLSALQSQLVTSMEHPCGLNPKAYRAVESDGFGG----RGIVDGNLLLRWLDM 1361

Query: 606  SLGERLEICKKIGSKHNDILDELYDIEALS 635
             +  + EI  ++G+   DI     D+E +S
Sbjct: 1362 GVQRKAEIAGRVGA---DIESIRVDLEKIS 1388


>gi|326477251|gb|EGE01261.1| protein kinase subdomain-containing protein [Trichophyton equinum CBS
            127.97]
          Length = 1267

 Score =  212 bits (540), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 164/620 (26%), Positives = 288/620 (46%), Gaps = 78/620 (12%)

Query: 32   PLLLVRTQHE-LLIYQAFR--HPKGALKLRF-KKLKVLFVSDRSKRANEQPGLPRGVRIS 87
            P +++RT+H+ L++Y+ +R     G   LRF K +  + +  R+ +              
Sbjct: 693  PYMILRTKHDDLVLYEPYRIAGESGHSGLRFLKAVNHVVMGPRTDQGVNHDINRSPSSCK 752

Query: 88   QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG-PVSTLAPFHNVNCPRGFL 146
             +R   ++ GY+ VF+ G +P ++  ++    R H + + G  V +L+ FH   C RGF 
Sbjct: 753  LLRALPDVCGYKTVFMSGHNPCFILKSAIA--RPHVLRLRGKAVQSLSGFHIAACERGFA 810

Query: 147  YFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYY 206
            Y +  + +R+S LP++  +D+ W  RK+ L      + Y   ++ Y I TS  E     +
Sbjct: 811  YVDEDNVIRMSRLPSNTRFDSGWATRKIALGEQVDSIVYSSASECYVIGTSAKED----F 866

Query: 207  KFNGEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNV 264
            K   ED E  T+ R+    F+P L  +  V L  P +W  I   +  L   E + C++ +
Sbjct: 867  KLP-EDDESHTEWRNEFITFLPQL-ERGTVKLLEPKNWSTI--DSHELKPAERITCIEVI 922

Query: 265  SMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAK 324
             +E        +  + +G++    ED+  +G I +F++I+VVPEP QP    K+K+   +
Sbjct: 923  RLEISELTHERKDMVVVGSSIVKGEDIVPKGFIRVFEVIDVVPEPDQPEKSKKLKLFAKE 982

Query: 325  EQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVK--NL 379
            E KG VTA+  +   GFL+ A GQK  +  LK D  L  +AF DT+ Y+  +  +K   +
Sbjct: 983  EVKGAVTALSGIGGQGFLIVAQGQKCMVRGLKEDGSLLPVAFKDTQCYVNVLKELKGTGM 1042

Query: 380  ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQL 439
             ++GD  + +  + Y  E   L L  ++              N +  ++D          
Sbjct: 1043 CIIGDAFKGLWFIGYSEEPYKLDLFGKE--------------NENLAVVDA--------- 1079

Query: 440  SLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD 499
                             D L + + +  +++D D N+ +  Y PE   S+ G RL+ ++ 
Sbjct: 1080 -----------------DFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSV 1122

Query: 500  FHLGQHVNTFFKI---RCKPSSISDA--------PGARSRFLTWYASLDGALGFFLPLPE 548
            FH G   +T   +      PS+  D         P ++ + L  + +  G++    PL E
Sbjct: 1123 FHTGHFASTMTLLPHGAYTPSAPVDEDAMDTDSLPPSKYQILMTFQT--GSIAVITPLSE 1180

Query: 549  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 608
             +YRRLL LQ+ +V    H   LNPR +R  +  G       RG+IDG+L+ ++L +   
Sbjct: 1181 DSYRRLLALQSQLVNALEHPCSLNPRGYRAVESDGMGG---QRGMIDGNLLLRWLDMGAQ 1237

Query: 609  ERLEICKKIGSKHNDILDEL 628
             + EI  ++G+    I  +L
Sbjct: 1238 RKAEIAGRVGADVGAIRTDL 1257


>gi|281205270|gb|EFA79463.1| CPSF domain-containing protein [Polysphondylium pallidum PN500]
          Length = 1395

 Score =  212 bits (539), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 129/398 (32%), Positives = 221/398 (55%), Gaps = 35/398 (8%)

Query: 28   HGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQ-------PGL 80
            H +  L+++    ++LIY+A ++ K ++    + ++ +  +D++  + ++       P  
Sbjct: 886  HSSPYLMILNEFGDILIYKAIKY-KDSMDNTKELIRFIKHTDQNLHSKQREYSYGIDPSS 944

Query: 81   PRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVN 140
                 I ++  F NI G++GVF+CG    W F   +  LRAHPM    PV++   FHN+N
Sbjct: 945  ESSFYIRKIVAFDNIGGHKGVFMCGKRSLWFF-CEKNYLRAHPMNFKDPVTSFTCFHNIN 1003

Query: 141  CPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAE 200
            C  GF+YF  K  LRI+ L   ++++  W +RK+PL+ T H +++H E K Y +V S  +
Sbjct: 1004 CSYGFIYFTEKGVLRINQLSNMMNFENEWAIRKIPLRMTCHKISFHQEFKCYVLVISYPQ 1063

Query: 201  -PSTDYYKFNGEDKELVTDPRDSRFIPPLV--SQFHVSLFSP-FSWEEIPQTNFPLHEWE 256
             P +D             +    +   PL+   +F V L  P  +W  +   +F + E E
Sbjct: 1064 APQSD-----------EEEEEKEKSKKPLILEEKFQVKLIDPSMNWSIVD--SFSMSEKE 1110

Query: 257  HVLCLKNVSMEYEGTLSG--LRGYIALGTNYNYSEDVTCRGRILLFDII---EVVPEPGQ 311
             VLC K V ++Y   + G  L+ Y+ +GT Y + ED  C+GRIL+F+II   EV  + G+
Sbjct: 1111 TVLCAKIVHLKY-ADVDGIKLKPYLCVGTAYTHGEDTVCKGRILVFEIISHREVQDDTGE 1169

Query: 312  PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIA 371
               K ++ ++Y K+QKGPVTA+  + G L+ ++G K+ +       L GIAF DT+++I 
Sbjct: 1170 E--KKRLNLLYEKDQKGPVTALAGLNGLLLMSIGPKLIVNNFSSGSLVGIAFYDTQIFIV 1227

Query: 372  SMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
            S+ +VKN ILVGD  +S++  + + + + L L+ +DY+
Sbjct: 1228 SLSTVKNYILVGDMYKSVSFFKLKDQ-KQLILLGKDYE 1264


>gi|350633238|gb|EHA21604.1| hypothetical protein ASPNIDRAFT_51242 [Aspergillus niger ATCC 1015]
          Length = 1406

 Score =  211 bits (537), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 165/596 (27%), Positives = 278/596 (46%), Gaps = 88/596 (14%)

Query: 71   SKRANEQ-PGLPRGVRISQ----------MRYFSNIAGYQGVFLCGPHPAWLFLTSRGEL 119
            SK  N   P +P GV  +Q          +R   +I+G   VF+ G    ++  TS    
Sbjct: 861  SKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLSAVFMPGASAGFIIRTSASA- 919

Query: 120  RAHPMTIDGPVS-TLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKC 178
              H + + G  S +++      C +GF+Y +++S +R   LP    +D  W +++V L  
Sbjct: 920  -PHFLRLRGENSRSVSSLDTPECSKGFIYLDSQSTVRFCKLPPMTRFDYQWTLKRVHLGE 978

Query: 179  TPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS-----RFIPPLVSQFH 233
                LAY   +  Y + T  A   TD+     ED EL  + R+       F P     F 
Sbjct: 979  QVDHLAYSTSSGMYVLGTCHA---TDFKL--PEDDELHPEWRNEDCLAISFFPSARGSF- 1032

Query: 234  VSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTC 293
            + L SP +W  I   +F L   E+V+ +KN+S+E        +  I +GT +   ED+  
Sbjct: 1033 IKLVSPNTWSII--DSFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTAFARGEDIPS 1090

Query: 294  RGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIW 351
            RG I +F++++VVP+P  P T  K+K+I  +  KG VTA+  +   GF++ A GQK  + 
Sbjct: 1091 RGCIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVLVAQGQKCMVR 1150

Query: 352  QLK-DNDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDY 408
             LK D  L  +AF+D + Y++ +  +K   + ++GD  + +    Y  E   +SL A+D 
Sbjct: 1151 GLKEDGSLLPVAFMDMQCYVSVVKELKGTGMCILGDAVKGVWFAGYSEEPYKMSLFAKDL 1210

Query: 409  KPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFM 468
                                              + LE+C        + L +   +  +
Sbjct: 1211 ----------------------------------DYLEVCAA------EFLPDGKRLFIV 1230

Query: 469  ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSS---ISDAPG 524
            ++D D N+ +  Y PE  +S+ G RL+ ++ FH+G   +T   + R   SS   +S + G
Sbjct: 1231 VADSDCNIHVLQYDPEDPKSSNGDRLLSRSKFHMGNFASTLTLLPRTMVSSEKMVSSSDG 1290

Query: 525  -----ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
                           + +G+LG    +PE++YRRL  LQ+ +     H  GLNPRAFR  
Sbjct: 1291 MDIDNQSPLHQVLMTTQNGSLGLITCIPEESYRRLSALQSQLTNTLEHPCGLNPRAFRAV 1350

Query: 580  KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
            +      G   RG++DG+L++K++ +S   + EI  ++G++  +I     D+EA+S
Sbjct: 1351 ESD----GTAGRGMLDGNLLFKWIDMSKQRKTEIAGRVGAREWEI---KADLEAIS 1399


>gi|121719617|ref|XP_001276507.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus clavatus NRRL 1]
 gi|148886827|sp|A1C3U1.1|CFT1_ASPCL RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|119404719|gb|EAW15081.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus clavatus NRRL 1]
          Length = 1401

 Score =  210 bits (535), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 182/650 (28%), Positives = 306/650 (47%), Gaps = 103/650 (15%)

Query: 16   IVQELLTVSLGLHGN-RPLLLVRTQHE-LLIYQAF----RHPKGALKLRFKK-----LKV 64
            I+ E +  +LG   N  P L++RT ++ L+IY+ F            LRF K     L  
Sbjct: 807  ILSEAIVANLGDSWNPLPHLILRTDNDDLVIYKPFISSVEEDGDPHCLRFVKETNHVLPR 866

Query: 65   LFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRG-----EL 119
            +     +  ++++P   R + I       +I+GY  VF+ G   +++F TSR       L
Sbjct: 867  IPPDSDTNISDKEPSNHRPLCI-----LPDISGYSAVFMPGTSASFIFKTSRSCPHILRL 921

Query: 120  RAHPMTIDGPVSTLAPFH--NVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
            R       G V +L+ F   + +  RGF+Y ++K  +RI  LP    YD  W ++KV + 
Sbjct: 922  RG------GVVRSLSDFDFTDPSLGRGFIYVDSKDVVRICQLPPETIYDYSWTLKKVAIG 975

Query: 178  CTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS--RFIPPLVSQFHVS 235
                 LAY + ++TY + TS    S D+     ED EL  + R+    F+P L  Q  + 
Sbjct: 976  EHVDHLAYSISSETYVLGTSH---SADFKL--PEDDELHPEWRNEAISFLPEL-RQCCLK 1029

Query: 236  LFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRG 295
            +  P +W  I   ++ L   E ++ +KN+++E        +  I +GT     ED+  RG
Sbjct: 1030 VVHPKTWTVI--DSYTLGPDEEIMAVKNMNLEVSENTHERKNMIVVGTALARGEDIPARG 1087

Query: 296  RILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQL 353
             I +F++I+VVP+P +P T  K+K+I  +  KG VTA+  +   GFL+ A GQK  +  L
Sbjct: 1088 CIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSEIGGQGFLIAAQGQKCMVRGL 1147

Query: 354  K-DNDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
            K D  L  +AF+D + Y+  +  +K   + +VGD  + I    Y  E   +SL  +D   
Sbjct: 1148 KEDGSLLPVAFMDVQCYVNVLKELKGTGMCIVGDAFKGIWFAGYSEEPYKMSLFGKD--- 1204

Query: 411  TQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMIS 470
                                              LE  + + +   D L +   +  +++
Sbjct: 1205 ----------------------------------LEYPEVVAA---DFLPDGDKLFILVA 1227

Query: 471  DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF---------FKIRCKPSSISD 521
            D D N+ +  Y+PE   S+ G +L+ ++ FH+G   +T          ++I   PS+ SD
Sbjct: 1228 DSDCNLHVLQYEPEDPMSSNGDKLLVRSKFHMGHFTSTLTLLPRTTASYEI---PSADSD 1284

Query: 522  APGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT 578
            +     R         S  G++G    +PE++YRRL  LQ+ +     H  GLNPRA+R 
Sbjct: 1285 SMEVDPRITPQQVLITSQSGSIGIVTSIPEESYRRLSALQSQLANTVEHPCGLNPRAYRA 1344

Query: 579  YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
             +      G   RG++DG+L++++L +S   R+EI  ++G+   +I  +L
Sbjct: 1345 IESD----GTAGRGMLDGNLLYQWLSMSKQRRMEIAARVGAHEWEIKADL 1390



 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 27/98 (27%), Positives = 50/98 (51%), Gaps = 11/98 (11%)

Query: 380  ILVGDYARSIALLRYQPE--YRTLSLVARDYK-----PTQPNSKGYYA----GNPSRGII 428
            +L+   + SI ++   PE  YR LS +          P   N + Y A    G   RG++
Sbjct: 1297 VLITSQSGSIGIVTSIPEESYRRLSALQSQLANTVEHPCGLNPRAYRAIESDGTAGRGML 1356

Query: 429  DGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMG 466
            DG+L++++L +S   R+EI  ++G+   +I  +  ++G
Sbjct: 1357 DGNLLYQWLSMSKQRRMEIAARVGAHEWEIKADLEAVG 1394


>gi|441648592|ref|XP_004093268.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 1 [Nomascus leucogenys]
          Length = 1177

 Score =  210 bits (535), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 110/225 (48%), Positives = 141/225 (62%), Gaps = 19/225 (8%)

Query: 16  IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP----KGALKLRFKKL--------- 62
           +V+E+L V+LG   +RP LLV    ELLIY+AF H     +G LK+RFKK+         
Sbjct: 763 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 822

Query: 63  -KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
                         E+    RG R+++ RYF +I GY GVF+CGP P WL +T RG LR 
Sbjct: 823 KPKPSKKKAEGGGTEEGAGARG-RVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRL 881

Query: 122 HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
           HPM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPV K+PL+CT H
Sbjct: 882 HPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVXKIPLRCTAH 941

Query: 182 FLAYHLETKTYC---IVTSTAEPSTDYYKFNGEDKELVTDPRDSR 223
           ++AYH+E+K  C   I+ +    S    ++  E K L    RD++
Sbjct: 942 YVAYHVESKV-CPNFILAADVMKSISLLRYQEESKTLSLVSRDAK 985



 Score =  194 bits (494), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 106/279 (37%), Positives = 157/279 (56%), Gaps = 46/279 (16%)

Query: 366  TEVYIASMVSVK---NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
            T  Y+A  V  K   N IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct: 939  TAHYVAYHVESKVCPNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 998

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                                    + +GF++SD+D+N++++MY 
Sbjct: 999  ----------------------------------------AQLGFLVSDRDRNLMVYMYL 1018

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGA 539
            PEA+ES GG RL+++ DFH+G HVNTF++  C+ ++   +  +    ++ +TW+A+LDG 
Sbjct: 1019 PEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSKKSVVWENKHITWFATLDGG 1078

Query: 540  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
            +G  LP+ EK YRRLLMLQN + T   H  GLNPRAFR          N  R ++DG L+
Sbjct: 1079 IGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELL 1138

Query: 600  WKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             ++L LS  ER E+ KKIG+  + ILD+L + + +++HF
Sbjct: 1139 NRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVTAHF 1177


>gi|396471273|ref|XP_003838832.1| similar to cleavage and polyadenylation specificity factor subunit A
            [Leptosphaeria maculans JN3]
 gi|312215401|emb|CBX95353.1| similar to cleavage and polyadenylation specificity factor subunit A
            [Leptosphaeria maculans JN3]
          Length = 1402

 Score =  210 bits (535), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 185/648 (28%), Positives = 282/648 (43%), Gaps = 101/648 (15%)

Query: 10   SAMDETIVQELLTVSLGLHGNR-PLLLVRTQ-HELLIYQAFRHP-KGALKLRFKKLK-VL 65
            SA   TI  E+L   LG    R P L++RT   +L+IY+AF  P + A  L  K L+ + 
Sbjct: 793  SAAKATIT-EILAADLGDVTTRSPHLIIRTSSDDLVIYKAFHFPSRSAADLWTKNLRWIK 851

Query: 66   FVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
                   R  E  G       S +    ++ GY  VF  G  P+++F  +    R   ++
Sbjct: 852  LAQQHVPRYVEDAGSEDAGVESTLLALDDVCGYSTVFQRGASPSFIFKEASSSPRVIGLS 911

Query: 126  IDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLP--THLSYDAPWPVRKVPLKCTPHFL 183
               PV  L  FH  +C RGF Y ++   LRIS LP  TH  +   W  R++P+    + L
Sbjct: 912  -GKPVKGLTTFHTSSCERGFAYVDSTDTLRISQLPSRTHFGHLG-WATRRLPMDAEVYAL 969

Query: 184  AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRF---------IPPLVSQFHV 234
            AYH              P+  Y    G+ ++ V DP ++             P V +  +
Sbjct: 970  AYH--------------PAGLYVVGTGQPEDFVLDPSETYHYELPKEDISFKPSVERGVI 1015

Query: 235  SLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR 294
             L    +W  I    F   E   VLC+K +++E   T    +  IA+GT+  + ED+  +
Sbjct: 1016 KLIDEGTWSIIDTHVFDPQEV--VLCIKALNLEVSETTHQRKDLIAVGTSIVHGEDLATK 1073

Query: 295  GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQ 352
            G I +F++I VVPEP +P T  ++K+I   E KG V+AI  +   GFL+ A GQK  +  
Sbjct: 1074 GCIRIFEVITVVPEPDRPETNKRLKLIVKDEVKGAVSAISELGTQGFLIMAQGQKCMVRG 1133

Query: 353  LK-DNDLTGIAFIDTEVYIASMVSVKN--LILVGDYARSIALLRYQPEYRTLSLVARDYK 409
            LK D  L  +AF+D + Y+ ++ ++ N  ++L+GD  R +    Y  E   +SL  R   
Sbjct: 1134 LKEDGTLLPVAFMDMQCYVTTLKTLPNTGMLLMGDAYRGVWFTGYTEEPYKMSLFGRSKH 1193

Query: 410  PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
              +                  ++  +FL  +           G  H            ++
Sbjct: 1194 NLE------------------AMAVEFLPFN-----------GELH-----------IIV 1213

Query: 470  SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG----------------QHVNTFFKIR 513
            +D D N+ +  + PE  +S G  RL+ K  FH G                +  +TF    
Sbjct: 1214 ADADMNIQVLQFDPENPKSEGS-RLLHKATFHTGHFPTTTHLLQSHLQMPESASTFGTTD 1272

Query: 514  C-KPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
               P S   AP    + L    S  G L    PL E +YRRL  L   ++       GLN
Sbjct: 1273 TFAPDSTPSAPLPLHQVL--ITSQSGTLALITPLSESSYRRLSNLAAYLINTLESPCGLN 1330

Query: 573  PRAFRTYKG--KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
            P AFR  +G   G+ AG  +RG++DG L+ ++ +L    R E   K G
Sbjct: 1331 PVAFRAGEGVEGGWDAGGGARGVLDGGLLMRWGELGEQRRKEGLAKYG 1378


>gi|407929511|gb|EKG22329.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
            phaseolina MS6]
          Length = 1418

 Score =  210 bits (534), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 184/663 (27%), Positives = 283/663 (42%), Gaps = 115/663 (17%)

Query: 5    RSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQ-HELLIYQAFRHPK-GALKLRFKKL 62
            RS S +A+ E IV EL   +       P L+VRT  ++L+IYQ +  P    +K  F+ L
Sbjct: 801  RSSSKAALTEVIVAELGDSTY----KTPYLIVRTSSNDLVIYQPYHFPAHEVVKPFFENL 856

Query: 63   KVLFVSD-RSKRANEQPGLPR---GV-RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
            + L +   R    +E+P L     G+ + S +   +N+ GY  VF+ G  P+++   S  
Sbjct: 857  RWLKIPQPRLPEFSEEPALESEDTGIGKESILTTIANVGGYSAVFMAGTSPSFILKESSS 916

Query: 118  ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPL 176
              R   M     V  L+ FH   C RGF Y NA   LR+  LP    Y DA W V+K+ +
Sbjct: 917  LPRVIKMRTKS-VKNLSSFHRAECDRGFAYINADGNLRVCQLPRGYRYGDAGWAVKKISI 975

Query: 177  KCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRF---------IPP 227
                  + YH              P  D       DK+  T P D              P
Sbjct: 976  NQDVQAMCYH--------------PPKDVLVLGVGDKKPFTLPEDEHHHEWLEENITFKP 1021

Query: 228  LVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY 287
            +V Q  + +    S   I    + L  +E VL +K +++E        +  +A+GT +  
Sbjct: 1022 MVEQGMIKVLDTQSLAVI--DTYELEAFEVVLTIKVLNLEVSENTHERKQLVAVGTGFIR 1079

Query: 288  SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVG 345
             ED+  RG I +F++I VVPEPG+P T  ++K+I  +E +G VTAI  V   GFL+ A G
Sbjct: 1080 GEDLPSRGCIYVFEVINVVPEPGRPETNRRLKLIAKEEVRGSVTAITDVGSQGFLLMAQG 1139

Query: 346  QKIYIWQLK-DNDLTGIAFIDTEVY--IASMVSVKNLILVGDYARSIALLRYQPEYRTLS 402
            QK  +  LK D  L  +AF+D + Y  +A  ++   ++L+GD A+    + Y  +   + 
Sbjct: 1140 QKCMVRGLKEDGTLLPVAFMDMQCYVTVAKELNGSGMLLMGDAAKGAWFVGYTEDPYKMI 1199

Query: 403  LVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEF 462
            L  +                 SR                  ++E+         D L   
Sbjct: 1200 LFGK-----------------SR-----------------SKMEVMAA------DFLPHD 1219

Query: 463  SSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKT------------------------ 498
              +  M++D D N+    Y P+  +S  G RL+ K+                        
Sbjct: 1220 KQLYLMVADGDCNLHALQYDPDHPKSLSGQRLLHKSTFHTGHFTTTMTLLPSSLSPTVSP 1279

Query: 499  ---DFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLL 555
               D H   HV+           I  AP    + +    +  G+L    PL E+ YRRL 
Sbjct: 1280 SSADEHANGHVSPSPSPENDAMDIDPAPAGTVQHI-LLTTQTGSLALLTPLSEQQYRRLG 1338

Query: 556  MLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 615
             LQ  ++    H  GLNPRA+R  + +G+     SRGI+DG+L+ ++ +L    R E   
Sbjct: 1339 ALQTYLIGALEHWCGLNPRAYRAVESEGF----GSRGIVDGALLARWCELGSQRRAEGAA 1394

Query: 616  KIG 618
            K+G
Sbjct: 1395 KVG 1397


>gi|451849663|gb|EMD62966.1| hypothetical protein COCSADRAFT_92785 [Cochliobolus sativus ND90Pr]
          Length = 1405

 Score =  209 bits (532), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 181/643 (28%), Positives = 284/643 (44%), Gaps = 83/643 (12%)

Query: 10   SAMDETIVQELLTVSLGLHGNR-PLLLVRTQHE-LLIYQAFRHP-KGALKLRFKKLKVLF 66
            SA+  TI  E+L   LG    + P L++RT  + ++IY+AF  P + A  L  K L+ + 
Sbjct: 788  SAIKATIT-EILAADLGDATTKSPHLIIRTSSDNIVIYKAFHSPSRSAADLWTKNLRWVK 846

Query: 67   VSDRS-KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
            +S +   R  E  G       S +   S+I GY  VF  G  PA++F  S    R   ++
Sbjct: 847  LSQQHIPRYTEDGGAEDSGFESTLLALSDIGGYSTVFQRGTTPAFIFKESSSAPRVIGLS 906

Query: 126  IDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYD-APWPVRKVPLKCTPHFLA 184
               PV +L  FH  +C RGF Y ++   LRIS LP    Y    W  R++P+    H LA
Sbjct: 907  -GKPVKSLTSFHTSSCQRGFAYLDSTDTLRISQLPPQTHYGHLGWATRRMPMDAEIHALA 965

Query: 185  YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEE 244
            YH    +   +    +P  + Y+ +  +      P++     P + +  + L    +W  
Sbjct: 966  YH---SSGLYIIGAGQP--EEYQLDPSETYHYELPKEDMSFKPTIERGIIQLLDEKTWAI 1020

Query: 245  IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIE 304
            I      L   E VL +K +++E        +  IA+GT   + ED+  +G I +F++I 
Sbjct: 1021 I--DTHVLDPQEVVLSIKTLNLEVSENTHQRKDLIAVGTAILHGEDLATKGCIRIFEVIT 1078

Query: 305  VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGI 361
            VVPEP +P T  ++K+I   E KG V+AI  +   GF++ A GQK  +  LK D  L  +
Sbjct: 1079 VVPEPDRPETNKRLKLIVKDEVKGAVSAISELGTQGFMIMAQGQKCMVRGLKEDGTLLPV 1138

Query: 362  AFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
            AF+D + Y++ + ++    ++ + D  R +    Y  E   +SL AR     +       
Sbjct: 1139 AFMDMQCYVSDLKNLPGTGMLAMSDAYRGVWFTGYTEEPYRMSLFARSKHSLE------- 1191

Query: 420  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
                       ++   F+     E+L +                    +++D D N+ + 
Sbjct: 1192 -----------AIAVDFIPFE--EQLHL--------------------LVADADMNLQVL 1218

Query: 480  MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI--RCKPSSISDAPGARSR----FLTWY 533
             + P+  +S  G RL+ K+ FH G    T   +  R K  S SD  GA +     F    
Sbjct: 1219 QFDPDNPKSEAGSRLLHKSTFHTGHFPATLHVVHSRLKMPSASDFAGANNTENGDFEMDT 1278

Query: 534  ASLD----------------GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 577
            +S D                G L    PL E  YRRL  L   +      T GLNPRAFR
Sbjct: 1279 SSPDDKATQPLHQILCTTQSGTLALVTPLSEDTYRRLSNLSAYLSNTLDATAGLNPRAFR 1338

Query: 578  TYK--GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
                   G+ AG  +RG++DG+L+ ++ +L    R E   K G
Sbjct: 1339 ASDTPDGGWDAGTGARGMLDGNLLMRWGELGERGRREGLAKYG 1381


>gi|390599704|gb|EIN09100.1| hypothetical protein PUNSTDRAFT_67240 [Punctularia strigosozonata
            HHB-11173 SS5]
          Length = 1439

 Score =  209 bits (532), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 164/645 (25%), Positives = 290/645 (44%), Gaps = 74/645 (11%)

Query: 7    HSPSAMDETIVQELLTVSLGLHGNRP-LLLVRTQHELLIYQAF---------RHPKGALK 56
             SP    E  V++ +   LG    +P LLL     +L IYQA             + +L 
Sbjct: 845  ESPRRPQELDVEQAVIAPLGETAPQPHLLLFLRSGQLAIYQAIPMQASSVDESLSRPSLG 904

Query: 57   LRFKKLKVLFVSDRSKRANEQPGLPRGVRISQ-----MRYFSNIAGYQGVFLCGPHPAWL 111
            +RF K+       + +  +E+  L    +IS+     +   S    + GVF  G HP W+
Sbjct: 905  VRFAKVATRVFEIQRQDDSEKSILAEQKKISRVLIPFLTSPSPTTTFSGVFFTGDHPCWI 964

Query: 112  FLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPV 171
                R  +R HP +    V              FL ++ +    +  +P     +   P 
Sbjct: 965  LKPDRSGIRIHP-SGHSVVHAFTSCSLWESKGDFLLYSDEGPSLLEWMP-DTDVETELPS 1022

Query: 172  RKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQ 231
            R +P   +   + +   T    ++ + A    ++  ++ ED  +V +P  +    P  S 
Sbjct: 1023 RSIPQPRSYSKVTFDASTG---LIVAAAHLEAEFATYD-EDNNIVWEPDSANVSFPRSSC 1078

Query: 232  FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDV 291
              + L SP  W  I    F     E V  +++V +E   T SG + +IA+GT  +  ED+
Sbjct: 1079 STLELISPDEW--ITMDGFEFANNEFVTSVESVPLETSSTESGSKDFIAVGTTIDRGEDL 1136

Query: 292  TCRGRILLFDIIEVVPEPGQPLTKN-KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYI 350
              RG   +F+I+EVVP     L++  K+++    + KGPVTA+C + G+LV+++GQKI++
Sbjct: 1137 AVRGTTYVFEIVEVVPPENSSLSRWWKLRLRCRDDAKGPVTALCAMDGYLVSSMGQKIFV 1196

Query: 351  WQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
                 D  L G+AF+D  VY+ ++ +VKNL+++GD A+S+  + +Q +   L ++A+D++
Sbjct: 1197 RAFDMDERLVGVAFLDVGVYVTTLRAVKNLLVIGDAAKSVWFVGFQEDPYKLVILAKDFQ 1256

Query: 410  PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
                                                 +C        D +    SM  + 
Sbjct: 1257 ------------------------------------TVCVTTA----DFIFTEDSMSILT 1276

Query: 470  SDKDKNVVLFMYQPEARESNGGHRLIKKTDF--HLGQHVNTFFKIRCKPSSISDAPGARS 527
            +D++  + L+ Y P+  +S  G +L+ +T+F  H     +  F  R      +  P A+ 
Sbjct: 1277 NDENGVMRLYQYDPQDPDSRNGQQLMCRTEFDTHTTCQTSIVFARRVGEGEEAALPQAK- 1335

Query: 528  RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
                   S+DG+L     + E  ++RL +LQ  +  +  H  GLNP+AFR  +    Y  
Sbjct: 1336 ---VVAGSIDGSLAALTCMDEPAFKRLQLLQGQLTRNIQHVAGLNPKAFRIVRND--YVS 1390

Query: 588  NP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
             P S+GI+DG+L+  +L+L +  + EI K+I ++   +L +   I
Sbjct: 1391 KPLSKGILDGNLLSSYLELPIPRQEEITKQIATERAAVLRDWTSI 1435


>gi|326432241|gb|EGD77811.1| hypothetical protein PTSG_08901 [Salpingoeca sp. ATCC 50818]
          Length = 1506

 Score =  209 bits (531), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 163/639 (25%), Positives = 302/639 (47%), Gaps = 57/639 (8%)

Query: 14   ETIVQELLTVSLGLHGNRPLLLVR--TQHELLIYQAF-----RHPK--GALKLRFKKLKV 64
            E  + ELL + LG  G+RP L +R  TQH +++Y+ F     RH K  G L++R +K   
Sbjct: 899  EMTIVELLAIGLG-RGSRPHLFLRNETQH-VIVYEIFTSSYKRHEKYEGRLQIRLRKRHQ 956

Query: 65   --LFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLT-SRGELRA 121
               ++ +R  +++  P        +  R F++I+G  GVF+C   P+W     +   +R 
Sbjct: 957  HPTWIDERLAQSSSIPP-------AAFRPFADISGCDGVFVCARRPSWFMCDHTHKVVRH 1009

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            H M  DG V       +      FLYF  K  +R++          P P R+ P+K +  
Sbjct: 1010 HAMRFDGAVQCFTQLKHAMHTSCFLYFTGKGVMRMATTAAGQVLSTPLPSRRTPIKASAC 1069

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKEL-VTDPRDSRFIP-PLVSQFHVSLFSP 239
            ++ +  E+  Y +V    EP     KF    +E    D + +   P P   ++ + LFS 
Sbjct: 1070 YVDFDPESGVYVVVLKHKEPCAHLPKFGPPMEEAPAVDMKFASDEPLPQRERYSICLFSC 1129

Query: 240  FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
              W+ +P +   +    HV   K +++  E  L+G +  +A+GT     E    RG + L
Sbjct: 1130 EDWQLVPNSPVEIPADHHVTAFKVINISSERHLTGKKPCVAVGTTPVLGERNLERGLLQL 1189

Query: 300  FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAV----GQKIYIWQLKD 355
            +D++EVVPEPG+P TKN++K++ + ++ G VTA+  + G+++ A+    G KI++W+++D
Sbjct: 1190 YDVLEVVPEPGKPTTKNRLKLMLSSDETGAVTALNSIEGYVIGALARRDGPKIFVWRVED 1249

Query: 356  ND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
            ++ L  IAF++  ++  ++    N +++GDY   + L R   +  TL ++          
Sbjct: 1250 DEKLQPIAFLEGSMFTVTLKVALNFVIIGDYMGRVMLARLIKD-ETLKIL---------- 1298

Query: 415  SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
                   N S+G    +L      L +G  +       +   D +   + +  +  D+  
Sbjct: 1299 -------NLSKGTTSQAL------LQVGRDVAPTSVYAA---DFIVRGAELHVLFLDQHA 1342

Query: 475  NVVLFMYQPEARESNGGHRLIKKTDFHLG-QHVNTFFKIRCKPSSISDAPGARSRFLTWY 533
            N+ +  +  +   + GG  L + + ++ G Q +    +++  P   S      + FLT Y
Sbjct: 1343 NMTILAFDSDDPTTRGGRILKRHSVYNTGHQRIVALTRLQNVPPRNSRNATVDAHFLT-Y 1401

Query: 534  ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGI 593
             +L+G  G+   +PE  +RRL++LQ  ++ H     GL+P AF+ YK    +  +     
Sbjct: 1402 QTLEGGAGYITSIPEDIFRRLMLLQLRLLPHLKFRAGLHPSAFKKYKSASLHMVHQEVRT 1461

Query: 594  IDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 632
            I   +  +   L L  + E+ +++G+    + D+   IE
Sbjct: 1462 ICADVYTRLFMLDLDAQKEVARQVGTTTKQLCDDFLFIE 1500


>gi|403411348|emb|CCL98048.1| predicted protein [Fibroporia radiculosa]
          Length = 1437

 Score =  208 bits (529), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 172/639 (26%), Positives = 288/639 (45%), Gaps = 76/639 (11%)

Query: 9    PSAMDETIVQELLTVSLGLHGNRP-LLLVRTQHELLIYQAFRHPKGALKL---RFKKLKV 64
            P    E  +++LL   LG    RP L+L     +L +Y+    P  A  L   R   L V
Sbjct: 847  PRKPQELDIEQLLVAPLGESSPRPHLMLFLRSGQLAVYEVHSTPVPAEPLPAARSSTLLV 906

Query: 65   LFVSDRSKRAN-------EQPGLPRGVRISQMRY-FSNIAG----YQGVFLCGPHPAWLF 112
             FV   S+  N       E+  L    RIS +   F+        + GVFL G  P+WL 
Sbjct: 907  KFVKVLSRAFNIQHSDEVEKSVLAEQKRISHLLIPFATSPSPGQTFSGVFLTGDRPSWLL 966

Query: 113  LTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVR 172
             T +G ++  P +    V              FL ++ +    +  LP  +  D   P R
Sbjct: 967  CTDKGGVKVLP-SGHSVVHAFTASSVWESKNDFLLYSEEGPSLMEWLP-DVQLDGHLPSR 1024

Query: 173  KVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQF 232
             VP   +   + Y  +  T  IV ++++ S   +    ED  +V +P D+    P     
Sbjct: 1025 SVPRPRSYSNVVY--DPSTSLIVAASSQQSK--FASYDEDGNIVWEP-DTNISFPSCECS 1079

Query: 233  HVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT 292
             + L SP  W  +    +   + E V CL  +++E   T +G + +IA+GT  N  ED+ 
Sbjct: 1080 ALELISPEGW--VTMDGYEFAQNEFVNCLDCITLETMSTETGTKDFIAVGTTINRGEDLA 1137

Query: 293  CRGRILLFDIIEVVPEPGQPLTK-NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIW 351
             +G + +F+I+EVVP+    L +  ++K+    + KGPVTA+C +  +LV+++GQKI++ 
Sbjct: 1138 VKGAVYIFEIVEVVPDTNSGLKRLYRLKLQCRDDAKGPVTALCGMDNYLVSSMGQKIFVR 1197

Query: 352  QLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD-YK 409
                D  L G+AF+D  V++ S+ SVKNL+++GD  +S+  + +Q +   L ++ +D Y 
Sbjct: 1198 AFDLDERLVGVAFLDVGVFVTSLRSVKNLLVIGDAVKSVWFVAFQEDPYKLVILGKDPYH 1257

Query: 410  PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
                 +  ++A N                                          +  ++
Sbjct: 1258 TCVTCADLFFAEN-----------------------------------------RVSLLV 1276

Query: 470  SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRF 529
             D+D  + L  Y P   ES GG  L+++T+FH      T   I  +     D P A+   
Sbjct: 1277 CDEDGVIRLLEYDPHDPESRGGQHLLRRTEFHGQTEYRTSVLIARRKDKDIDIPQAK--- 1333

Query: 530  LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
                 S DG+L  F  + E  ++ L +LQ  +  +  H  GLNPRAFR  +    Y   P
Sbjct: 1334 -LVCGSTDGSLVSFTFVEEAAFKGLHLLQGQLTRNVQHVAGLNPRAFRIVRND--YVSRP 1390

Query: 590  -SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 627
             S+GI+DG+L+  F +L +  + E+ ++IG++   +L +
Sbjct: 1391 LSKGILDGNLLTTFEELPIARQNEMTRQIGTERATVLKD 1429


>gi|255075065|ref|XP_002501207.1| predicted protein [Micromonas sp. RCC299]
 gi|226516471|gb|ACO62465.1| predicted protein [Micromonas sp. RCC299]
          Length = 1423

 Score =  205 bits (522), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 164/602 (27%), Positives = 278/602 (46%), Gaps = 86/602 (14%)

Query: 30   NRPLLL-VRTQHELLIYQAFRHPKGA--------LKLRFKKLKVLFVSDRSKRANEQPGL 80
             RP+L  +R    +L+Y+AF  P GA         +LRF ++ +          + +   
Sbjct: 837  ERPMLTALRGDGSVLVYRAFLCPPGAGNVGHEAKPQLRFCRVPIELEGGGGGMVDTKA-- 894

Query: 81   PRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS-TLAPFHNV 139
              G R+++     +  G +GVF+ GP P WL L  R  + A P+  +   + +  PFHNV
Sbjct: 895  LSGSRLTRFERVGDRGGIRGVFVSGPRPLWL-LVRRSRVLALPIRGEAQRTVSFTPFHNV 953

Query: 140  NCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTA 199
            NC  GF+   A   +RI  +P  + Y+A WPVRK+ L+CTPH + Y  + + Y + TS  
Sbjct: 954  NCLNGFMLGTAAGGVRICQIPGRMHYEAAWPVRKLALRCTPHHVQYLPDFRLYALSTSAP 1013

Query: 200  EPSTDYYKFNGED---KELVTDPRDSRFIPPLVSQ-FHVSLFSPFSWEEIPQTNFPLHEW 255
                D ++ N +D     L+   + +      V Q F + L  P + E   Q  + +   
Sbjct: 1014 VKWKD-HEVNEDDIHLSTLIKVRKANAMAKGGVEQVFSLRLLVPGTLECAWQ--YTVDPG 1070

Query: 256  EHVLCLKNVSMEYEGTLSG-LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLT 314
            EHV  ++NV +    T++G L+  + +GT     ED  CRGR+L+F+++  + + G   T
Sbjct: 1071 EHVQSIRNVQL--RNTMTGALQSMLVVGTALPGGEDAPCRGRVLIFEVVWQMTDRG---T 1125

Query: 315  KNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMV 374
            K + +++  ++ K   TA+  V G L  A+G K+ +     + L  +AF DT ++  +M 
Sbjct: 1126 KWQGQLVCVRDAKMACTALEGVGGHLAVAIGTKLIVHSWDGHSLMPVAFFDTPLHTVTMN 1185

Query: 375  SVKNLILVGDYARSIALLRYQ--PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSL 432
             VKN IL+GD  +     R++  P+ + L  +A+D+                        
Sbjct: 1186 VVKNFILLGDIQKGAFFFRWKDTPDEKLLVQMAKDF------------------------ 1221

Query: 433  VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGH 492
                      E ++I         + L + S++  + +D   N  +F Y P++ ES  G 
Sbjct: 1222 ----------EGMDILA------TEFLVDGSTLSMLTTDMTGNAFIFSYDPKSLESWKGQ 1265

Query: 493  RLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG----------ARSRFLTWYASLDGALGF 542
            +L+ K  FH+G  V+   + R K  + + APG            +R   ++ +LDG+LG 
Sbjct: 1266 KLLTKGAFHVGSPVHRMVRFRLK--APTAAPGQTISPAEQKAQANRHAVFFGTLDGSLGI 1323

Query: 543  FLPLPEKNYRRLLMLQNVMVTHTSHT--GGLNPRAFR---TYKGKGYYAGNPSRGIIDGS 597
             +P+ E  +  L  LQ  +   T H    GLN R  R   T +G+      P   ++DG 
Sbjct: 1324 LVPIEEAAHASLQSLQRYLTYATPHAALAGLNARTHRHPKTVEGRPMRQPAP-HSLLDGG 1382

Query: 598  LV 599
            L+
Sbjct: 1383 LL 1384


>gi|336276223|ref|XP_003352865.1| hypothetical protein SMAC_04980 [Sordaria macrospora k-hell]
 gi|380092984|emb|CCC09221.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 1486

 Score =  205 bits (521), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 171/633 (27%), Positives = 280/633 (44%), Gaps = 81/633 (12%)

Query: 17   VQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALK-----LRFKKL-KVLFVS 68
            V E+L   LG   H +  L+L     +L +YQ +R    A +     L F+K+    F  
Sbjct: 842  VAEILVADLGDTTHKSPYLILRHANDDLTLYQPYRVKATAGQPFSKSLFFQKVPNSTFAK 901

Query: 69   DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG 128
               ++  E   L    R   MR  +NI+GY  VFL G  P+++  T++   R   +   G
Sbjct: 902  APEEKPVEDDELHNAQRFLPMRRCTNISGYSTVFLPGSSPSFILKTAKSSPRVLGLQGSG 961

Query: 129  PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHL 187
             V  ++ FH   C  GF+Y +     R++ +PT  S+ +    V+K+P+      + YH 
Sbjct: 962  -VQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSFAELGLSVKKIPVGVDTQSVVYHP 1020

Query: 188  ETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQ 247
             T+ Y +  + AEP    ++   +D       R++    P+V +  + L S  +W  I  
Sbjct: 1021 PTQAYVVGCNNAEP----FELPKDDDYHKEWARENITFKPMVDRGMLKLLSGITWTVI-- 1074

Query: 248  TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
                +   E VLC++ +++E   + +  +  IA+GT     ED+  RGR+ +FDI +V+P
Sbjct: 1075 DTVEMEPCETVLCVETLNLEVSESTNERKQLIAVGTALTKGEDLPTRGRVYVFDIADVIP 1134

Query: 308  EPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIA 362
            EPG+P T  K+K++ AKE   +G VTA+  V   G ++ A GQK  +  LK D  L  +A
Sbjct: 1135 EPGKPETSKKLKLV-AKEDIPRGAVTALSEVGTQGLMLVAQGQKCMVRGLKEDGTLLPVA 1193

Query: 363  FIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            F+D   Y+ S+  +    L L+ D  + +    Y  E   + L  +              
Sbjct: 1194 FMDMNCYVTSVKELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKSST----------- 1242

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                   R+E+       + D L +   +  + SD D ++ +  
Sbjct: 1243 -----------------------RMEVL------NADFLPDGKELYIVASDADGHIHILQ 1273

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNT-------FFKIRCKPSSISDAPGARSRFLTWY 533
            + PE  +S  GH L+ +T F+ G H  T        +     P+S S+  G     +   
Sbjct: 1274 FDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLLLPAVYPTTTSPNSNSEV-GENPPHILLL 1332

Query: 534  ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR--------TYKGKGYY 585
            AS  G L    PL E  YRRL  L   +     H  GLNP+ +R        + +  G  
Sbjct: 1333 ASPTGLLATLRPLQENAYRRLSSLAIQLTNALPHPAGLNPKGYRLPSPSASASMQLPGVD 1392

Query: 586  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
            AG   R I+DG ++ +F++L  G+R EI  + G
Sbjct: 1393 AGI-GRNIVDGKILERFMELGTGKRQEIAGRAG 1424


>gi|367052335|ref|XP_003656546.1| hypothetical protein THITE_2121311 [Thielavia terrestris NRRL 8126]
 gi|347003811|gb|AEO70210.1| hypothetical protein THITE_2121311 [Thielavia terrestris NRRL 8126]
          Length = 1460

 Score =  204 bits (519), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 171/650 (26%), Positives = 283/650 (43%), Gaps = 104/650 (16%)

Query: 17   VQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALK-----LRFKKLKVLFVSD 69
            + E+L   LG   H +  L+L     +L +YQ FR  K   +     L F+K+    ++ 
Sbjct: 844  LAEILVADLGDSTHKSPYLILRHANDDLTLYQPFRSRKATEQAFSETLFFQKVPNTALAK 903

Query: 70   RSKRANE-----QPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM 124
              + A+E     QP      R   MR   N+ GY  VF+ G  P+++  +S+   R  P+
Sbjct: 904  SPQEADEDEASHQP------RFLSMRRCDNVGGYSTVFVPGASPSFIIASSKSMPRVMPL 957

Query: 125  TIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFL 183
               G V  ++PFH   C  GF+Y +++   R+   P    Y +    VRK+P+      +
Sbjct: 958  QGSG-VIAMSPFHTEGCEHGFIYADSRRIARVCQFPDGCIYAETGVAVRKIPIGEDIAAV 1016

Query: 184  AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
            AYH   ++Y +  +T+EP    ++   +D       R++    P V +  + L SP +W 
Sbjct: 1017 AYHPPMQSYVVGCNTSEP----FELPKDDDYHKEWARENLSFKPTVDRGILKLLSPITWT 1072

Query: 244  EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
             +      +   E +LC++ +++E     +  +  IA+GT     ED+  RGR+ ++DI 
Sbjct: 1073 VVDAVQ--MEPCETILCVETLNLEVSEFTNERKQLIAVGTALTKGEDLPTRGRVYVYDIA 1130

Query: 304  EVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDL 358
            +V+PEPG+P T  K+K+I AKE   +G VTA+  +   G ++ A GQK  +  LK D  L
Sbjct: 1131 DVIPEPGRPETGKKLKLI-AKEDIPRGAVTALSEIGTQGLMLVAQGQKCMVRGLKEDGSL 1189

Query: 359  TGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSK 416
              +AF+D   Y+ +   +    L L+ D  + +    Y  E   + L  +          
Sbjct: 1190 LPVAFMDMNCYVTAAKELPGTGLCLLADAFKGVWFTGYTEEPYKMMLFGKSST------- 1242

Query: 417  GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
                                       +LE+       + D L +   + F++SD D  +
Sbjct: 1243 ---------------------------KLEVL------NADFLPDGKELSFVVSDADGYI 1269

Query: 477  VLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSI----------------- 519
             +  + PE  +S  GH L+ +T F+ G H  T  K    P+S                  
Sbjct: 1270 HILQFDPEHPKSLQGHLLLHRTTFNTGAHHAT--KSLLLPASTPADKEKNDGNAANAQAK 1327

Query: 520  ---SD-----APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
               SD      P A+   +   AS  G L    PL E  YRRL  L   +     H  GL
Sbjct: 1328 AKASDNKQPREPAAQRPHVLLLASPTGVLAALRPLSESAYRRLSSLAAQLTNSLPHPAGL 1387

Query: 572  NPRAFRTYKGKGYYAGNPS---RGIIDGSLVWKFLQLSLGERLEICKKIG 618
            NPR +R    +   AG  +   R I+DG+++ +F +L +  R+E+  + G
Sbjct: 1388 NPRGYRAAGAECPPAGVDAGLGRSIVDGTVLERFAELGMARRVELAGRAG 1437


>gi|189203597|ref|XP_001938134.1| conserved hypothetical protein [Pyrenophora tritici-repentis
            Pt-1C-BFP]
 gi|187985233|gb|EDU50721.1| conserved hypothetical protein [Pyrenophora tritici-repentis
            Pt-1C-BFP]
          Length = 1407

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 178/657 (27%), Positives = 280/657 (42%), Gaps = 109/657 (16%)

Query: 10   SAMDETIVQELLTVSLGLHGNR-PLLLVRTQHE-LLIYQAFRHP-KGALKLRFKKLKVLF 66
            SA   TI  E+L   LG    + P L++RT  + L+IY+AF  P + A  L  K L+ + 
Sbjct: 788  SAAKATIT-EILAADLGDATTKSPHLIIRTSSDNLVIYKAFHAPSRSASDLWTKNLRWVK 846

Query: 67   VSDR------SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
            +S +          +E PG       S +    ++ GY  VF  G  PA++F  +    R
Sbjct: 847  LSQQHVPRYIEDNGSEDPGFE-----STLVALDDVCGYSTVFQRGTTPAFIFKEASSAPR 901

Query: 121  AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYD-APWPVRKVPLKCT 179
               ++   PV +L  FH   C RGF Y ++   LRI  LP    Y    W  R++P+   
Sbjct: 902  VIGLS-GKPVKSLTSFHTSKCQRGFAYLDSTDTLRICQLPPQTHYGHLGWATRRMPMDSE 960

Query: 180  PHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP 239
             H L YH  +  Y + T      T+ Y+ +  +      P++     P + +  V L   
Sbjct: 961  VHALTYH-PSGLYIVGTG----QTEDYQLDPTETYHYDLPKEDLTFKPSIERGVVKLLDE 1015

Query: 240  FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
             SW  I  T+  L   E VL +K +++E        +  IA+GT+  + ED+  +G I +
Sbjct: 1016 KSWT-IIDTHI-LDPQEIVLSIKTLNLEVSEITHQRKDLIAVGTSVVHGEDLATKGCIRI 1073

Query: 300  FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DN 356
            F++I VVP+P +P T  ++K+I   E KG V+AI  +   GFL+ A GQK  +  LK D 
Sbjct: 1074 FEVITVVPQPDRPETNKRLKLIVKDEVKGAVSAISELGTQGFLIMAQGQKCMVRGLKEDG 1133

Query: 357  DLTGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
             L  +AF+D + Y++ + ++    ++ +GD  R +    Y  E   +SL AR        
Sbjct: 1134 TLLPVAFMDMQCYVSDLKNLPGTGMLAMGDAYRGVWFTGYTEEPYKMSLFAR-------- 1185

Query: 415  SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN------DILDEFSSMGFM 468
                                                  SKHN      D L     +  +
Sbjct: 1186 --------------------------------------SKHNLETIAVDFLPFDQQLHLV 1207

Query: 469  ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF------------------- 509
            ++D D N+ +  + P+  +S  G RL+ K  FH G    +                    
Sbjct: 1208 VADADMNLQILQFDPDNPKSEAGSRLLHKATFHTGHLPTSLHLIHSHLKLPSATDFAATN 1267

Query: 510  ------FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
                  F +   P++ +D P  +      + +  G L    PL E +YRRL  L   +  
Sbjct: 1268 SNPADAFAMDTSPNTTTDTP-QQPFHQILHTTQSGTLALLTPLSEDSYRRLSNLTAYLAN 1326

Query: 564  HTSHTGGLNPRAFRT--YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
                   LNPRAFRT      G+ AG  +RG++DG+L+ ++ +L    R E   K G
Sbjct: 1327 TLDSACSLNPRAFRTGDVAEGGWDAGTGARGVLDGNLLLRWGELGERGRREGLAKYG 1383


>gi|340924328|gb|EGS19231.1| hypothetical protein CTHT_0058560 [Chaetomium thermophilum var.
            thermophilum DSM 1495]
          Length = 1460

 Score =  203 bits (516), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 172/647 (26%), Positives = 287/647 (44%), Gaps = 96/647 (14%)

Query: 17   VQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKG-----ALKLRFKKLKVLFVSD 69
            + E++   LG   H +  L+L  +  +L IYQ +R+  G     +  L F+KL     + 
Sbjct: 842  ITEIMVADLGDTTHKSPYLILRHSNDDLTIYQPYRYKLGTGQVFSKTLFFQKLPNPSFA- 900

Query: 70   RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP 129
            R+    EQ  +P   R+  MR  +NIAGY  VFL G  P+++  +++   R  P+   G 
Sbjct: 901  RAPEETEQDDVPPQPRLLSMRRCNNIAGYSTVFLPGHSPSFILKSAKSMPRVVPLQGAG- 959

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHLE 188
            V  ++PFH   C  GF+Y ++ +  R++ +P   SY +    V+KVP+      +AYH  
Sbjct: 960  VIAMSPFHTEGCDHGFIYADSHNIARVTQIPEDWSYAELGLAVKKVPIGEDIAAVAYHPP 1019

Query: 189  TKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQT 248
             + Y +  + +EP    ++   +D       R++    P + +  + L SP +W  I   
Sbjct: 1020 QQCYVVGCNASEP----FELPKDDDYHKEWARENLVFKPTLDRGLLKLISPITWTVIDTV 1075

Query: 249  NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPE 308
               L   E VLC++ +++E   + +  R  IA+GT     ED+  RGR+ ++DI +V+PE
Sbjct: 1076 Q--LEPCETVLCVETLNLEVSESTNERRQLIAVGTALTKGEDLPTRGRVHVYDIADVIPE 1133

Query: 309  PGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAF 363
            PG+P T  K+K+I AKE   +G VTA+  +   G ++ A GQK  +  LK D  L  +AF
Sbjct: 1134 PGKPETSKKLKLI-AKEDIPRGAVTALSEIGTQGLMLVAQGQKCMVRGLKEDGTLLPVAF 1192

Query: 364  IDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            +D   Y+ +   +    L L+ D  + +  + Y  E   + L  +               
Sbjct: 1193 MDMSCYVTAAKELPGTGLCLMADAFKGVWFVGYTEEPYKMMLFGKS-------------- 1238

Query: 422  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
                                  +LE+         D L +   +  +  D D ++ +  +
Sbjct: 1239 --------------------STKLEVLTA------DFLPDGKELFIVACDADGHIHILQF 1272

Query: 482  QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSI-SDAPG---------------- 524
             PE  +S  GH L+ +T F+ G H  T  K    PS++ +D P                 
Sbjct: 1273 DPEHPKSLQGHLLLHRTSFNTGAHNPT--KSLLLPSTLPTDTPSTIDGSNPNTNNTNGTP 1330

Query: 525  ---------ARSR-FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 574
                     A  R  +    S  G +    PL E +YRRL  L   +V    H  GLNP+
Sbjct: 1331 NASNLAPYDATERPHILLLCSPTGLIAALRPLSESSYRRLSSLAAQLVNSLPHAAGLNPK 1390

Query: 575  AFRTYKGKGYYAG---NPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
             +R        AG   +  R I+DG+++ +F +L +  R E+  + G
Sbjct: 1391 GYRMPSADCPPAGVDASVGRNIVDGTVLERFTELGMARRAELAGRAG 1437


>gi|336388105|gb|EGO29249.1| hypothetical protein SERLADRAFT_445076 [Serpula lacrymans var.
            lacrymans S7.9]
          Length = 1424

 Score =  203 bits (516), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 166/644 (25%), Positives = 296/644 (45%), Gaps = 77/644 (11%)

Query: 9    PSAMDETIVQELLTVSLGLHGNRPLLLVRTQH-ELLIYQAFRHPKGALKL---RFKKLKV 64
            P   ++  V++++   LG     P LLV  +  +++IY+A   P  A  +   R   LKV
Sbjct: 833  PRKSNDLDVEQIILAPLGETAPLPYLLVFLRSGQIVIYEAVPTPAPADSIPPSRVSVLKV 892

Query: 65   LFVSDRSK-------RANEQPGLPRGVRISQMRYF-----SNIAG--YQGVFLCGPHPAW 110
             F+   +K          E+  L    RIS  R F     S   G    GVF  G  P+W
Sbjct: 893  KFIKTATKIFELPKHEETEKSILAEQKRIS--RQFVPFVTSPTPGSVLSGVFFTGDRPSW 950

Query: 111  LFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWP 170
            +  T++G +R +  +    V +            FL ++ +    +  +P  L  D+  P
Sbjct: 951  IVATNKGGIRIYS-SGHHIVHSFTSCSLWESKGDFLVYSDEGPSLLEWMP-DLCLDSVLP 1008

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
             R +P       + Y     +  ++ + +    ++  F+ ED  ++ +P  S    P   
Sbjct: 1009 SRNIPRSRAYANVVYD---PSAMLIVAASSMQANFASFD-EDGNIIWEPEASNVSLPKCD 1064

Query: 231  QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
               + L +P +W  I    +     E+V  L+ V++E   T +G + +IA+GT+ +  ED
Sbjct: 1065 CSTLELIAPEAW--ITMDGYEFAPNEYVNALECVTLETLSTETGSKDFIAVGTSIDRGED 1122

Query: 291  VTCRGRILLFDIIEVVPEPGQPLTK-NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
            +  +G   LF+I+EVVP+  Q L +  K+K++   + KGPVTA+C + G+LV+++GQKI+
Sbjct: 1123 LAVKGATYLFEIVEVVPDYSQNLKRWYKLKLLARDDAKGPVTALCGINGYLVSSMGQKIF 1182

Query: 350  IWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDY 408
            I     D  L G+AF+D  VY+ S+  VKN +L+GD  +SI  + +Q +   L ++A+D 
Sbjct: 1183 IRAFDMDERLVGVAFLDVGVYVTSLRVVKNFLLIGDAVKSIWFVAFQEDPYKLVVLAKDV 1242

Query: 409  KPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFM 468
              T   +  ++  + +  I+                                        
Sbjct: 1243 HRTHVTNADFFFTDDTLSIV---------------------------------------- 1262

Query: 469  ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSR 528
              D D  + ++ Y P+  ES  G  L+ +T+FH      +   I  +    S  P A  +
Sbjct: 1263 TEDGDGILRMYAYDPDDPESKNGQHLLCRTEFHNHSECRSSLVIARRTKEESVLPQA--K 1320

Query: 529  FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
             L+ ++  DG+L    P+ + +++RL +LQ  +  +  H  GLNPRA+R  +    +   
Sbjct: 1321 ILSAFS--DGSLSSLTPVDDASFKRLQLLQGQLTRNIQHVAGLNPRAYRIVRND--FVSK 1376

Query: 589  P-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
            P S+ I+DG L+  F  L +  + E+ K+IG++ N +L +  ++
Sbjct: 1377 PLSKDILDGQLLSAFESLPISRQNEMTKQIGTERNIVLHDWMEL 1420


>gi|336375160|gb|EGO03496.1| hypothetical protein SERLA73DRAFT_165174 [Serpula lacrymans var.
            lacrymans S7.3]
          Length = 1428

 Score =  202 bits (515), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 166/644 (25%), Positives = 296/644 (45%), Gaps = 77/644 (11%)

Query: 9    PSAMDETIVQELLTVSLGLHGNRPLLLVRTQH-ELLIYQAFRHPKGALKL---RFKKLKV 64
            P   ++  V++++   LG     P LLV  +  +++IY+A   P  A  +   R   LKV
Sbjct: 837  PRKSNDLDVEQIILAPLGETAPLPYLLVFLRSGQIVIYEAVPTPAPADSIPPSRVSVLKV 896

Query: 65   LFVSDRSK-------RANEQPGLPRGVRISQMRYF-----SNIAG--YQGVFLCGPHPAW 110
             F+   +K          E+  L    RIS  R F     S   G    GVF  G  P+W
Sbjct: 897  KFIKTATKIFELPKHEETEKSILAEQKRIS--RQFVPFVTSPTPGSVLSGVFFTGDRPSW 954

Query: 111  LFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWP 170
            +  T++G +R +  +    V +            FL ++ +    +  +P  L  D+  P
Sbjct: 955  IVATNKGGIRIYS-SGHHIVHSFTSCSLWESKGDFLVYSDEGPSLLEWMP-DLCLDSVLP 1012

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
             R +P       + Y     +  ++ + +    ++  F+ ED  ++ +P  S    P   
Sbjct: 1013 SRNIPRSRAYANVVYD---PSAMLIVAASSMQANFASFD-EDGNIIWEPEASNVSLPKCD 1068

Query: 231  QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
               + L +P +W  I    +     E+V  L+ V++E   T +G + +IA+GT+ +  ED
Sbjct: 1069 CSTLELIAPEAW--ITMDGYEFAPNEYVNALECVTLETLSTETGSKDFIAVGTSIDRGED 1126

Query: 291  VTCRGRILLFDIIEVVPEPGQPLTK-NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
            +  +G   LF+I+EVVP+  Q L +  K+K++   + KGPVTA+C + G+LV+++GQKI+
Sbjct: 1127 LAVKGATYLFEIVEVVPDYSQNLKRWYKLKLLARDDAKGPVTALCGINGYLVSSMGQKIF 1186

Query: 350  IWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDY 408
            I     D  L G+AF+D  VY+ S+  VKN +L+GD  +SI  + +Q +   L ++A+D 
Sbjct: 1187 IRAFDMDERLVGVAFLDVGVYVTSLRVVKNFLLIGDAVKSIWFVAFQEDPYKLVVLAKDV 1246

Query: 409  KPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFM 468
              T   +  ++  + +  I+                                        
Sbjct: 1247 HRTHVTNADFFFTDDTLSIV---------------------------------------- 1266

Query: 469  ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSR 528
              D D  + ++ Y P+  ES  G  L+ +T+FH      +   I  +    S  P A  +
Sbjct: 1267 TEDGDGILRMYAYDPDDPESKNGQHLLCRTEFHNHSECRSSLVIARRTKEESVLPQA--K 1324

Query: 529  FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
             L+ ++  DG+L    P+ + +++RL +LQ  +  +  H  GLNPRA+R  +    +   
Sbjct: 1325 ILSAFS--DGSLSSLTPVDDASFKRLQLLQGQLTRNIQHVAGLNPRAYRIVRND--FVSK 1380

Query: 589  P-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
            P S+ I+DG L+  F  L +  + E+ K+IG++ N +L +  ++
Sbjct: 1381 PLSKDILDGQLLSAFESLPISRQNEMTKQIGTERNIVLHDWMEL 1424


>gi|320591495|gb|EFX03934.1| cleavage and polyadenylation specificity factor subunit [Grosmannia
            clavigera kw1407]
          Length = 1461

 Score =  201 bits (512), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 177/652 (27%), Positives = 289/652 (44%), Gaps = 104/652 (15%)

Query: 12   MDETIVQELLTVSLGLHGNR-PLLLVR-TQHELLIYQAFRHPKGALKL----RFKKL-KV 64
            M +  + E+L   LG   ++ P L+VR    +L IYQ  R P     L    RF K+   
Sbjct: 846  MAKEPLTEILVADLGDAVSKAPYLIVRHANDDLTIYQPLRTPSSLGSLSESLRFLKVPNP 905

Query: 65   LF----VSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
            +F    VS  S  A+ Q      +R   +R   NI GY  VFL G   +++  +++ + R
Sbjct: 906  VFAKSPVSISSDDASSQ------LRAMPLRVCENIGGYSTVFLPGSSASFVLKSAKSQPR 959

Query: 121  AHPMTIDG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLP-----THLSYDAPWPVRKV 174
               +++ G  V +L+PFH  +  R F+Y + +   R+  +P     T L   A    RKV
Sbjct: 960  V--VSLQGTAVRSLSPFHTESSERSFIYVDVEGSGRVCSMPAGWNLTELGVCA----RKV 1013

Query: 175  PLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHV 234
             L    + LAYH  T TY + TS  E     ++   +D       ++S    PL  +  +
Sbjct: 1014 ALDTDANALAYHPPTGTYAVGTSALE----AFELPKDDPHRADWNKESTAFRPLAERGRL 1069

Query: 235  SLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR 294
             L SP SW  I      +  +E V+C+K +++E     +  +  +A+GT  +  ED+  R
Sbjct: 1070 LLMSPGSWSTIDTVE--MEPYEVVMCVKTLNLEVSEATNERKQLVAVGTAISRGEDLAIR 1127

Query: 295  GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYI 350
            GR+ +FD++ V+PEPG+P T  K+K+I AKE   +G VTA+  +   G ++ A GQK  +
Sbjct: 1128 GRVYVFDVVSVIPEPGRPETNRKLKLI-AKEDIPRGAVTAVSEIGTQGLMLVAQGQKCLV 1186

Query: 351  WQLK-DNDLTGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARD 407
              LK D  L  +AF+D   Y+ S   +    L ++ D  + +    Y  E          
Sbjct: 1187 RGLKEDGTLLPVAFMDMNCYVTSAKELPGTGLCVMSDAFKGVWFTGYTEE---------- 1236

Query: 408  YKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGF 467
                           P + I+ G    +   L++               D+L +   +  
Sbjct: 1237 ---------------PYKMILFGKSNTRLHALNV---------------DLLPDGKELFI 1266

Query: 468  MISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF-------FKIRCKPSSIS 520
            +++D D N+ +  + PE  +S  GH L+ +  F  G H +T        F    +P++  
Sbjct: 1267 VVTDADGNLHVMQFDPEHPKSLQGHILLHRATFCTGAHFSTLSLLLPSTFTPADRPTANG 1326

Query: 521  DAPGARSR--------FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
            +  GA S+              S  G L   +PL E  YRRL  L   + T  + T GLN
Sbjct: 1327 ETNGASSQPEAQQHQQHQLLLGSPTGLLASLVPLSESEYRRLSSLAGQLATSLTQTAGLN 1386

Query: 573  PRAFRTYKGKGYYAGNP------SRGIIDGSLVWKFLQLSLGERLEICKKIG 618
            P+ +R   G       P       R ++DG+L+ ++ +L  G + EI  ++G
Sbjct: 1387 PKGYRMTAGSAAATLAPGVDAAVGRSVVDGALLARWTELGSGRKGEIAGRVG 1438


>gi|303285993|ref|XP_003062286.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455803|gb|EEH53105.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 1469

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 172/645 (26%), Positives = 286/645 (44%), Gaps = 105/645 (16%)

Query: 30   NRPLLL-VRTQHELLIYQAFR----HPKG-AL-KLRFKKLKVLFVSDRSKRANEQPGLPR 82
             RPLL  +R    +L+Y+AF      P G AL +LRF ++ V         A +   LP 
Sbjct: 884  ERPLLTALRADGAVLVYRAFTCAVAGPGGRALTQLRFARVPVEL-EGGGGGAVDLSALP- 941

Query: 83   GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP-VSTLAPFHNVNC 141
            G R+++     +  G +GVF+ GP P WL L  R  + A P+  +   V +   FHNVNC
Sbjct: 942  GSRLTRFERVGDRGGIRGVFVSGPQPLWL-LARRSRVLALPVRGEAQRVVSFTAFHNVNC 1000

Query: 142  PRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTST--- 198
              GF+   A   +RI  +P  + Y+A WPVRK+ L+CTPH + Y  + K Y + TS    
Sbjct: 1001 HAGFILGTAAGGVRICQIPGRMHYEAAWPVRKLALRCTPHHVQYLPDFKLYALSTSAPAK 1060

Query: 199  -AEPSTDYYKFNGEDKELVTDPRDSRFIP--PLVSQFHVSLFSPFSWEEIPQTNFPLHEW 255
              EP       +      V   R ++ +    +  QF V L  P S E     +  +   
Sbjct: 1061 WVEPEVAEEDIHA---ATVVKTRRAKAMARGGVEEQFAVKLLVPGSLET--AWSRTMDPG 1115

Query: 256  EHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTK 315
            EHV  +KNV +    T   L   +A+GT     ED  CRGR++LF+I             
Sbjct: 1116 EHVQAVKNVQVRNLRT-GALHSMLAVGTAMPGGEDTPCRGRVILFEI------------- 1161

Query: 316  NKIKMIYAKEQKGP---------VTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT 366
               +M+  + ++ P         + A+  + G LV A+G K+ +      +L  +AF DT
Sbjct: 1162 -SWQMVDGETRRVPLLLLFFDDALAALSGLEGHLVVAIGTKLIVHAWDGAELIPVAFFDT 1220

Query: 367  EVYIASMVSVKNLILVGDYARSIALLRYQPEYRT----LSLVARDYKPTQPNSKGYYAGN 422
             V+  ++  VKN + +GD  +     R++ + RT    L  +A+D++     S  +    
Sbjct: 1221 PVHTVTINVVKNFVCIGDVQKGAYFFRWKDDPRTGEKNLIQLAKDFESMDVLSTEF---- 1276

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                ++DG                                S++  + +D   N  +F Y 
Sbjct: 1277 ----LVDG--------------------------------STLSLLAADTAGNAYVFAYD 1300

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTF--FKIRCKPSSISDAPGA---------RSRFLT 531
            P++ ES  G +L+ K  FH+G  V+    FK++    + +D   A          +R   
Sbjct: 1301 PKSSESWKGQKLLTKASFHVGSPVHRMVRFKLKTPTGAGNDGRAAPTPAEIKANANRHAV 1360

Query: 532  WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR---TYKGKGYYAGN 588
            ++ +LDG+LG  +P+    + +L +LQ  +  +T+   GLN R++R   T +G+   +  
Sbjct: 1361 FFGTLDGSLGILVPMESSTHAKLEVLQRWLNYNTAQNAGLNGRSYRAPKTTEGRAMRSPA 1420

Query: 589  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
            P   ++DG ++  F  L+  ++ E     G    + L  L+ + A
Sbjct: 1421 P-HNLLDGEMLQGFESLAWTKQAEAADAAGMTREEALTYLHTLSA 1464


>gi|121797760|sp|Q2TZ19.1|CFT1_ASPOR RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|83775384|dbj|BAE65504.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1393

 Score =  198 bits (504), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 169/652 (25%), Positives = 298/652 (45%), Gaps = 93/652 (14%)

Query: 16   IVQELLTVSLGLH-GNRPLLLVRTQHE-LLIYQAFRHPKGAL-----KLRFKKLKVLFVS 68
            ++ E++   LG    + P L++R++H+ L +Y+ F     ++      L F K   L + 
Sbjct: 796  VLTEIVVADLGDSWSSFPYLIIRSRHDDLAVYRPFISITKSVGEPHADLNFLKETNLVLP 855

Query: 69   DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG 128
              +    +Q      ++   +R  SNI+G+  +F  G  P ++  TS      H + + G
Sbjct: 856  RITSGVEDQSSTEEVIKSVPLRIVSNISGFSAIFRPGVSPGFIVRTSTSS--PHFLGLKG 913

Query: 129  P-VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTH--LSYDA------PWPVRKVPLKCT 179
                +L+ F    C  GF+  ++K    I +  T+  LS+        PW ++++P+   
Sbjct: 914  GYAQSLSKFQTSECGEGFILLDSKVLCFILLCLTYCILSFHTGCHSYYPWTIQQIPIGEQ 973

Query: 180  PHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRD--SRFIPPLVSQFHVSLF 237
               LAY   +  Y I TS        +K   ED EL  + R+  + F P  V +  + + 
Sbjct: 974  VDHLAYSSSSGMYVIGTS----HRTEFKLP-EDDELHPEWRNEMTSFFPE-VQRSSLKVV 1027

Query: 238  SPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRI 297
            SP +W  I          EHV+ +KN+S+E        +  I +GT +   ED+  RG +
Sbjct: 1028 SPKTWTVIDSPA------EHVMAVKNMSLEISENTHERKDMIVVGTAFARGEDIASRGCV 1081

Query: 298  LLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK- 354
             +F++I+VVP+P +P    K++++  +  KG VTA+  +   GFL+ A GQK  +  LK 
Sbjct: 1082 YVFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQKCIVRGLKE 1141

Query: 355  DNDLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQ 412
            D  L  +AF+D + +++ +  +K   + ++ D  + +    Y  E   +SL A+D     
Sbjct: 1142 DGSLLPVAFMDVQCHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPYKMSLFAKDL---- 1197

Query: 413  PNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDK 472
                                          + LE+         D L + + +  +++D 
Sbjct: 1198 ------------------------------DYLEVLAA------DFLPDGNKLFILVADS 1221

Query: 473  DKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSS---ISDAPGAR-- 526
            D N+ +  Y PE  +S+ G RL+ ++ FH G  ++T   + R   SS   ISD       
Sbjct: 1222 DCNLHVLQYDPEDPKSSNGDRLLSRSKFHTGNFISTLTLLPRTSVSSEQMISDVDAMDVD 1281

Query: 527  ---SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKG 583
                R      S +G++G    + E++YRRL  LQ+ +     H  GLNPRAFR  +   
Sbjct: 1282 IKIPRHQMLITSQNGSVGLVTCVSEESYRRLSALQSQLTNTIEHPCGLNPRAFRAVESD- 1340

Query: 584  YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
               G   RG++DG L++++L +S   ++EI  ++G+   +I     D EA+S
Sbjct: 1341 ---GTAGRGMLDGKLLFQWLDMSKQRKVEIASRVGANEWEI---KADFEAIS 1386


>gi|294659889|ref|XP_462318.2| DEHA2G17908p [Debaryomyces hansenii CBS767]
 gi|218511978|sp|Q6BHK3.2|CFT1_DEBHA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|199434312|emb|CAG90824.2| DEHA2G17908p [Debaryomyces hansenii CBS767]
          Length = 1342

 Score =  197 bits (502), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 139/552 (25%), Positives = 252/552 (45%), Gaps = 60/552 (10%)

Query: 88   QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLY 147
            ++ YF N+ G+  +F+ G  P ++  T+    R    T   P  + AP+ +     G +Y
Sbjct: 838  RLVYFPNVNGFTSIFVTGITPYYISKTTHSVPRIFKFT-KLPAVSFAPYSDDKIKNGLIY 896

Query: 148  FNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYK 207
             +     RI  +P   +Y+  WP++K+P+K +   + YH  + T+ I T    P   Y  
Sbjct: 897  LDNSKNARICEIPVDFNYENNWPIKKIPIKESIKSVTYHELSNTFVISTYEEIP---YDC 953

Query: 208  FNGEDKELVTDPRDSRFIPPLVSQF--HVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVS 265
             + E K +V   +      P  + +  ++ L SP++W  I      L + E  + ++++ 
Sbjct: 954  LDEEGKPIVGVDKSK----PSANSYKGYIKLISPYNWSVIDT--IELVDGEIGMNVQSMV 1007

Query: 266  MEYEGTLSGLRG---YIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIY 322
            ++   +    +     I +GT     ED++  G   +F+II+++PEPG+P T +K K I+
Sbjct: 1008 LDVGSSTKKFKNKKELIVIGTGKYRMEDLSANGSFKIFEIIDIIPEPGKPETNHKFKEIH 1067

Query: 323  AKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILV 382
             ++ KG VT+IC ++G  + + GQKI I  L+D+ +  +AF+DT VY++   S  NL+++
Sbjct: 1068 QEDTKGAVTSICEISGRFLVSQGQKIIIRDLQDDGVVPVAFLDTSVYVSEAKSFGNLLIL 1127

Query: 383  GDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLG 442
            GD  +SI L  +  E   + ++ +D +    N                            
Sbjct: 1128 GDSLKSIWLAGFDAEPFRMVMLGKDLQSLDVN---------------------------- 1159

Query: 443  ERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHL 502
                 C     K  +I         +I+D +  + L  Y PE   S+ G RLI K  F++
Sbjct: 1160 -----CADFIIKDEEIF-------ILIADNNSTLHLVKYDPEDPTSSNGQRLIHKASFNI 1207

Query: 503  GQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 562
                 T   IR  P +    P +   F +  +++DG+     P+ E +YRR+ +LQ  + 
Sbjct: 1208 NS---TPTCIRSIPKNEEINPSSTEVFQSIGSTIDGSFYTVFPINEASYRRMYILQQQIT 1264

Query: 563  THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-- 620
                H  GLNPR  R            ++ ++D  ++  F +L+   R  +  K+ SK  
Sbjct: 1265 DKEYHFCGLNPRLNRFGGLSMTVNDTNTKPLLDYEVIRMFAKLNEDRRKNLSMKVSSKNV 1324

Query: 621  HNDILDELYDIE 632
            + DI  +L + +
Sbjct: 1325 YQDIWKDLIEFD 1336


>gi|440637976|gb|ELR07895.1| hypothetical protein GMDG_02777 [Geomyces destructans 20631-21]
          Length = 1495

 Score =  197 bits (502), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 179/657 (27%), Positives = 289/657 (43%), Gaps = 91/657 (13%)

Query: 10   SAMDETIVQELLTVSLGLHGNRPLLLVRTQHE-LLIYQAFRHPKGALK-----LRFKKLK 63
            S + ET+ + LL          P L+ R  ++ L IY+ F+ P  A +     L F+K+ 
Sbjct: 892  STVAETLTEVLLADLGDATSKSPYLIFRASNDDLTIYEPFQVPSEAPRPLSKSLHFQKIH 951

Query: 64   VLFVSDRSKRANEQPGLPRGV-RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
               V+  +    E         R S MR  +N+ G   VFL G  P+++  +S+   R  
Sbjct: 952  NPHVAKTANPETEVAADAESAKRGSPMRAIANVGGLSSVFLPGDSPSFVVKSSKSTPRVV 1011

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVL-PTHLSYDAPWPVRKVPLKCTPH 181
             +   G V +L+ FH   C RGF+Y ++K   R+S L P     D    +RKV +     
Sbjct: 1012 GLRGHG-VRSLSGFHTEGCDRGFIYVDSKGIARVSQLEPETNVTDIGLTLRKVKIGEEVQ 1070

Query: 182  FLAYHLETKTYCIVTSTAEP-----STDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSL 236
             + YH     Y I T   EP       DY++     KE +T         PL  +  + L
Sbjct: 1071 AVTYHPPKDVYVIGTVVKEPFELPKDDDYHREWA--KEDIT-------FKPLTGRGFLKL 1121

Query: 237  FSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGR 296
             +P +W  I +     HE   ++C+K +++E        +  I +GT  +  ED+  RGR
Sbjct: 1122 LNPSNWSVIDKVELDSHEI--IMCIKTLNLEVSENTHERKQLITVGTAISKGEDLAIRGR 1179

Query: 297  ILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQ 352
            + ++++I VVP P +P T  K+K+I AKE+  +G +T I  +   GF++ A GQK  +  
Sbjct: 1180 VYVYEVITVVPFPDRPETNKKLKLI-AKEEIPRGAITGISEIGTQGFMIVAQGQKSMVRG 1238

Query: 353  LK-DNDLTGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
            LK D  L  +AFID   Y+ ++ S+    + L  D  + +    Y  E   +++  +   
Sbjct: 1239 LKEDGTLLPVAFIDMNTYVTTVKSLPGTGMCLFADAIKGVWFAGYSEEPYKMTIFGK--- 1295

Query: 410  PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
                 S+G         +I   L      L +G+ L I                    ++
Sbjct: 1296 ----QSQGME-------VITADL------LPIGDELYI--------------------IV 1318

Query: 470  SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH-----------VNTFFKIRCKPSS 518
            +D D N+ +  + PE  +S  G  L+++T F LG H             T        S+
Sbjct: 1319 ADSDCNLHVLQFDPEHPKSLHGQLLLQRTTFSLGGHMPTTMTLLPLTTTTQTPTPAVTST 1378

Query: 519  ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT 578
             S+     S  L   +S  G +    PL E+ YRRL  L N +     H GGLNP+A R 
Sbjct: 1379 ASEPTNPASGLLMTLSS--GVVAILTPLSEQQYRRLNALSNHLSNLLYHPGGLNPKAHRI 1436

Query: 579  YKG--KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
                 +    G P   I+DGS++W++L+L   +R E+  ++G     I ++L +I A
Sbjct: 1437 SNTAPEAVIGGRP---IVDGSVLWRWLELGSQKRAEVAGRVGVDGETIREDLQEIAA 1490


>gi|395324102|gb|EJF56549.1| hypothetical protein DICSQDRAFT_93527 [Dichomitus squalens LYAD-421
            SS1]
          Length = 1433

 Score =  197 bits (502), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 168/648 (25%), Positives = 292/648 (45%), Gaps = 92/648 (14%)

Query: 9    PSAMDETIVQELLTVSLGLHGNRPLLLVRTQH-ELLIYQAFRHPKGALKL---RFKKLKV 64
            P    E  V +L+   LG    RP L+V  +  +L IY+A      A  L   R   L V
Sbjct: 841  PRKPQELDVDQLVIAPLGESHPRPHLIVLLRSGQLAIYEAVAASPPADPLPPTRSLTLLV 900

Query: 65   LFVSDRSK-------RANEQPGLPRGVRISQMR--YFSNIA---GYQGVFLCGPHPAWLF 112
              V  +SK          ++  L    RIS++   + ++ A    Y GVF  G  P+W+ 
Sbjct: 901  NLVKVKSKAFDIQHTEEEQKSVLAEQKRISRLLLPFVTSPAPGQTYSGVFFTGDRPSWIV 960

Query: 113  LTSRGELRAHPMTIDGPVSTLAPFHNV-----NCP----RG-FLYFNAKSELRISVLPTH 162
             T +G +R  P             HNV      C     RG FL ++ +    +  +P  
Sbjct: 961  STDKGGVRVFPSG-----------HNVVHAFTTCSLWESRGDFLLYSEEGPSLVEWMP-D 1008

Query: 163  LSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS 222
            +  DA  P R VP +  P+    H+       +   A    + +    ED  +V +P   
Sbjct: 1009 IILDAHLPARSVP-RSRPY---SHVVFDASSSLIVAASSFMNRFASYDEDGNIVWEPDSP 1064

Query: 223  RFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALG 282
                P      + L SP  W  +    F  +E+  V C+ +V +E   T SG++ +IA+G
Sbjct: 1065 NISFPHCETSTLELISPDGWITMDGYEFAANEF--VSCVVSVPLETVSTESGMKDFIAVG 1122

Query: 283  TNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTK-NKIKMIYAKEQKGPVTAICHVAGFLV 341
            T  N  ED+  +G + +F+I+EVVP+    + +  ++K++   + KGPV+ +C + G+LV
Sbjct: 1123 TTINRGEDLAVKGAVYIFEIVEVVPDASLNIKRWWRLKLLCRDDAKGPVSFLCGMNGYLV 1182

Query: 342  TAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRT 400
            +++GQKI++     D  L G+AF+D  VY+ S+ +VKNL+++GD  +S+  + +Q +   
Sbjct: 1183 SSMGQKIFVRAFDLDERLVGVAFLDVGVYVTSLRAVKNLLVIGDAVKSVWFVAFQEDPYK 1242

Query: 401  LSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 460
            L ++ +D               P    +                            D+  
Sbjct: 1243 LVILGKD---------------PHHCCV-------------------------TRADLFF 1262

Query: 461  EFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS 520
                +  +  D++  V L+ Y P   ES GG  L+++T+FH      +   +  +P +  
Sbjct: 1263 ADGHLSIVTCDEEGVVRLYAYDPHDPESKGGQHLLRRTEFHGQTEYRSSLLVARRPKA-G 1321

Query: 521  DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK 580
            D    ++R +    S+DG+L     + E  ++RL +LQ  ++    H   LNP+AFR  +
Sbjct: 1322 DPEIPQARLIC--GSVDGSLTTLTYVDENAFKRLHLLQGQLIRTVQHVAALNPKAFRMVR 1379

Query: 581  GKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 627
             +  Y   P S+G++DG+L+  F  L +G + E+ ++IG+    +L +
Sbjct: 1380 NE--YVSRPLSKGVLDGNLLATFEDLPIGRQNEVTRQIGTDRATVLKD 1425


>gi|67521912|ref|XP_659017.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
 gi|74598221|sp|Q5BDG7.1|CFT1_EMENI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|40745387|gb|EAA64543.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
 gi|259486722|tpe|CBF84808.1| TPA: Protein cft1 (Cleavage factor two protein 1)
            [Source:UniProtKB/Swiss-Prot;Acc:Q5BDG7] [Aspergillus
            nidulans FGSC A4]
          Length = 1339

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 172/640 (26%), Positives = 295/640 (46%), Gaps = 87/640 (13%)

Query: 17   VQELLTVSLG-LHGNRPLLLVRTQHE-LLIYQAF-RHPKGALKLRFKKLKVLFVSDRSKR 73
            V ++  V LG  + + P L++RT+++ L++Y+ F  + K    LRF K     +      
Sbjct: 759  VLQIAVVELGDSYSSLPFLILRTENDDLVVYKPFFTNSKELTGLRFLKEANHTLPKTPNT 818

Query: 74   ANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP-VST 132
             +E         +  +R   NIAG   +F+ GP   ++F  S      H + + G  +  
Sbjct: 819  TDELQS-----EMKPLRILPNIAGCSSIFMPGPSAGFIFRAST--TSPHFIRLRGGFIKG 871

Query: 133  LAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTY 192
            L  F + +  +GF Y ++   L ++ LP       PW +R VP+      L Y   + TY
Sbjct: 872  LGCFDSPD--KGFAYLDSHG-LHLAKLPEGTQLGYPWIMRTVPIGQQIDKLTYVSASDTY 928

Query: 193  CIVTSTAEPSTDYYKFN-GEDKELVTDPRDSR--FIPPLVSQFHVSLFSPFSWEEIPQTN 249
             + T          +F   ED EL  + R+    F+P  V+Q  + + SP +W  I   +
Sbjct: 929  VLGT------CQRCEFRLPEDDELHPEWRNEEISFLPE-VNQSSLKVVSPKTWSVI--DS 979

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEP 309
            +PL   EH++ +K +S+E        R  I +GT+    ED+  RG I +F++IEVVP+P
Sbjct: 980  YPLEPAEHIMVMKTMSLEVSENTHERRDMIVVGTSLARGEDIPSRGCIYVFEVIEVVPDP 1039

Query: 310  GQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFIDT 366
             QP T  ++K+I  +  KG VTA+  +   GFL+ A GQK  +  LK D  L  +AF+D 
Sbjct: 1040 EQPETNRRLKLIGKEPVKGAVTALSEIGGQGFLIAAQGQKSMVRGLKEDGSLLPVAFMDM 1099

Query: 367  EVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPS 424
            + +++ +  +K   + + GD  + +    Y  E   +SL A+D                 
Sbjct: 1100 QCFVSVIKELKGTGMCIFGDAVKGLWFAGYSEEPYKMSLFAKDL---------------- 1143

Query: 425  RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPE 484
                              + LE+         D L + + +  +++D D N+ +  Y PE
Sbjct: 1144 ------------------DYLEVLAA------DFLPDGNKLFIVVADSDCNLYVLQYDPE 1179

Query: 485  ARESNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSSISDAPGARSRFLTWYASL------- 536
               S+ G +L+ ++ FH G   +T   + R   SS     G+    +   A L       
Sbjct: 1180 DPNSSNGDKLLNRSKFHTGNFASTVTLLPRTLVSSERAMSGSDKMDIDNTAPLHQVLVTS 1239

Query: 537  -DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIID 595
             +G++G    +PE++YRRL  LQ+ +     H  GLNPRA+R  +       +  RG++D
Sbjct: 1240 HNGSIGLVTCVPEESYRRLSALQSQLTNTLEHPCGLNPRAYRAVESD----ASAGRGMLD 1295

Query: 596  GSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
             +L+ ++L +S   + EI  ++G+   +I     D+EA+S
Sbjct: 1296 SNLLLQYLDMSKQRKAEIAGRVGATEWEI---RADLEAIS 1332


>gi|346971831|gb|EGY15283.1| cft-1 [Verticillium dahliae VdLs.17]
          Length = 1445

 Score =  197 bits (501), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 171/660 (25%), Positives = 289/660 (43%), Gaps = 88/660 (13%)

Query: 5    RSHSPSAMDETIVQEL-LTVSLGLHGNRPLLLVRTQHELLIYQAFR------HPKGALKL 57
            R  SP  + E +V +L  + S   H    L+L     ++ IY+ FR          A  L
Sbjct: 832  RGTSPETLTEILVADLGDSTSASAH----LILRHANDDMTIYEPFRIGGQEEKEDLANSL 887

Query: 58   RFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
             FKK+    ++     A E   +    R+  +R   NI GY  VFL G  P+++  +S+ 
Sbjct: 888  FFKKVSNSHLAKSPVEAAEDEAVQEN-RVIPLRACDNIGGYSTVFLPGASPSFILKSSKS 946

Query: 118  ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPL 176
              +   +   G V+ ++ FH   C RGF+Y ++K   R++  P   +  +    VRKVP+
Sbjct: 947  TPKVIGLQGLG-VNGMSSFHTEGCERGFIYADSKGCARVTQFPDAANVAELGVSVRKVPI 1005

Query: 177  KCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSL 236
                  +A+H   + Y + +S  EP    ++   +D       ++   +PP+     + L
Sbjct: 1006 DTAVSHVAWHPNMEVYAVASSKLEP----FELPKDDDYHKEWAKEECPMPPMKEHGSIKL 1061

Query: 237  FSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGR 296
            +SP +W  I +  F L ++E  +C+K + +E        R   A+GT     ED+  RGR
Sbjct: 1062 YSPITWNVIDE--FELEQYEVAMCMKTLLLEVSEETKERRMLFAVGTAILRGEDLPVRGR 1119

Query: 297  ILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQ 352
            IL+FD++ V+P+P +P T  K+K+I AKE+  +G VT++C V   G ++ A GQK  +  
Sbjct: 1120 ILVFDVVHVIPQPDRPETDRKLKLI-AKEEIPRGAVTSLCEVGTQGLMLVAQGQKCMVRG 1178

Query: 353  LK-DNDLTGIAFIDTEVYIASMVSVKNL--ILVGDYARSIALLRYQPEYRTLSLVARDYK 409
            LK D  L  +AF+D   Y+ ++  ++N    L+ D    +  + Y  E   ++L  +   
Sbjct: 1179 LKEDGTLLPVAFLDMSTYVVAVHELRNTGYCLMADANMGVWFVGYSEEPYRMTLFGKS-- 1236

Query: 410  PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
                                            G +L+          D L   + +  + 
Sbjct: 1237 --------------------------------GTQLKCLTA------DFLVAGNDLSIVA 1258

Query: 470  SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH----VNTFFKIRCKP-----SSIS 520
            SD+D  + +  + PE   S  GH L+ +  F +  +         +   +P        +
Sbjct: 1259 SDEDGVLHILQFDPEHPRSLQGHLLLNRASFSVAPNHAWATLVLPRTTTRPYLPQSEPAT 1318

Query: 521  DAPGARSRFLT-WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
             A G+++R  T   AS  GA+    P+ E  YRRL  L   +     H  G+NP+A R  
Sbjct: 1319 GAAGSQNRTQTLLLASASGAIASLNPITEHAYRRLTSLTTSLANALPHAAGMNPKAHRLP 1378

Query: 580  KGKGYYAGNP-------SRGIIDGSLVWKFLQLSLGERLEICKKIG-SKHNDILDELYDI 631
               G  A  P        R I+DG+L+ ++ +L   +R E   K G +   D+  EL D+
Sbjct: 1379 PQDG--AARPPAVDVSAGRTIVDGALLARWNELGARQRAEAAGKGGFASAADVRGELEDV 1436


>gi|170102106|ref|XP_001882269.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164642641|gb|EDR06896.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 1406

 Score =  197 bits (501), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 163/639 (25%), Positives = 275/639 (43%), Gaps = 80/639 (12%)

Query: 9    PSAMDETIVQELLTVSLGLHGNRPLLLVRTQH-ELLIYQAF---RHPKGALKLRFKKLKV 64
            P    E  V+++L   +G    RP L V  +  +L IY+     R  +   K+R   +K+
Sbjct: 816  PRKPQEFDVEQILVAPIGESSPRPHLCVFLRSGQLTIYEVLPLGRTTEALPKVRPAHVKI 875

Query: 65   LFVSDRS-----KRANEQPGLPRGVRISQMRYF----------SNIAGYQGVFLCGPHPA 109
             FV   S     +R  E     +G+   Q R +          S    + GVF  G  P 
Sbjct: 876  KFVKISSMAFEIQRPEEGE---KGIIAEQKRIYRMFVPFVTSASPGVTFSGVFFTGDRPN 932

Query: 110  WLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPW 169
            W+F T +G ++ +P +    V+   P         FL +    E  +S       YD P 
Sbjct: 933  WIFGTDKGGVQIYP-SGHAVVNAFTPCSLFESKGDFLMYT--EEASVSKWLPDFHYDGPL 989

Query: 170  PVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLV 229
            P+R VP       L +   T      +S       Y     +D   + +P       P+ 
Sbjct: 990  PLRSVPRGRAYSSLVFDPSTSLLVAASSLQAKFASY----DDDDNKIWEPETPNIGNPMC 1045

Query: 230  SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
                + L SP  W  I    F     E++  +  V++E  GT  G + +IA+GT  +  E
Sbjct: 1046 DTSTLELISPDMW--ITMDGFEFATNEYINDVACVTLETAGTEVGSKDFIAVGTTIDRGE 1103

Query: 290  DVTCRGRILLFDIIEVVPEPG-QPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI 348
            D+  RG   +++I+EVVP+P   P    K+++    + KGPVTA+C   G+LV+++GQKI
Sbjct: 1104 DLAARGATYIYEIVEVVPDPAISPKRWYKLRLRCRDDAKGPVTAVCGFHGYLVSSMGQKI 1163

Query: 349  YIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
            ++     D  L G+AF+D  VY+ S+ ++KNL+LVGD  +S++ + +Q +   L L+ +D
Sbjct: 1164 FVRAFDSDERLVGVAFMDVGVYVTSLRTLKNLLLVGDAVKSLSFIAFQEDPYKLVLLGKD 1223

Query: 408  YKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGF 467
             +     +  ++         DG L                                   
Sbjct: 1224 TQHVCVTNADFF-------FTDGEL---------------------------------SL 1243

Query: 468  MISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS 527
            +  D++  + ++ Y P+  +S  G  L+ +T+FH      T   I  +       P A+ 
Sbjct: 1244 VTGDEEGIMRMYEYNPQDPDSKDGRYLLLRTEFHGQSEYRTSTTIARRLKDDPSIPQAK- 1302

Query: 528  RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
                     DG L    P+ E  ++RL +LQ  +  +  H  GLNP+AFR  +    +  
Sbjct: 1303 ---LIIGGTDGCLSSLTPVEEHAFKRLQLLQGQLTRNIQHVAGLNPKAFRIVRND--FVS 1357

Query: 588  NP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 625
             P S+GI+DG+L+  +  L +  + E+ ++IG+    +L
Sbjct: 1358 KPLSKGILDGNLLAHYESLPIIRQNEMTRQIGTDRVTLL 1396


>gi|408396642|gb|EKJ75797.1| hypothetical protein FPSE_03977 [Fusarium pseudograminearum CS3096]
          Length = 1427

 Score =  197 bits (500), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 173/627 (27%), Positives = 283/627 (45%), Gaps = 82/627 (13%)

Query: 17   VQELLTVSLG-LHGNRPLLLVRTQ-HELLIYQAFRH--PKG----ALKLRFKKL-KVLFV 67
            ++E+L   LG      P L++R Q  +L IY+  RH  P G    +  L FKK   V   
Sbjct: 835  LREILVADLGDTISQSPYLILRNQTDDLTIYEPIRHVRPGGESNLSAALSFKKTSNVTLA 894

Query: 68   SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID 127
            +  ++  +++   PR      MR  +NI GY  VFL G  P+++  +S+   R   +   
Sbjct: 895  TTPAQTEDDEVEQPR---FMPMRRCANINGYSTVFLPGSSPSFVLKSSKSIPRVIGLQGL 951

Query: 128  GPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYH 186
            G +  ++ FH   C RGF+Y + K   R++  P+  ++ +    V+KVPL      +AYH
Sbjct: 952  G-IRGMSSFHTEGCDRGFIYADDKGIARVTQFPSDTNFTELGISVKKVPLGSDVRGIAYH 1010

Query: 187  LETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIP 246
              T  Y     T+EP    ++   +D       +++   PP + +  + L SP +W  I 
Sbjct: 1011 QPTGAYIAGCMTSEP----FELPKDDDYHKEWAKETLSFPPTMPRGILKLISPITWTVI- 1065

Query: 247  QTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVV 306
              +  L   E + C+K + +E        R  +A+GT  +  ED+  RGR+ ++DI+ V+
Sbjct: 1066 -HDIELESCESIECMKTLHLEVSEDTKERRFLVAVGTAVSKGEDLPIRGRVHVYDIVTVI 1124

Query: 307  PEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGI 361
            PEPG+P T  ++K I A+E   +G VTAI  +   G ++ A GQK  +  LK D  L  +
Sbjct: 1125 PEPGKPETNRRLKAI-AREDIPRGGVTAISEIGTQGLMLVAQGQKCMVRGLKEDGSLLPV 1183

Query: 362  AFIDTEVYIASM--VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
            AF+D   +++S   +S   L L+ D  + +    Y  E  T  ++ + +           
Sbjct: 1184 AFLDMSCHVSSARELSRTGLCLMADAFKGVWFAGYTEEPYTFKVLGKSHG---------- 1233

Query: 420  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
                                    RL +         D L +   +  + +D D ++ + 
Sbjct: 1234 ------------------------RLPVVVA------DFLPDGDDLAIVAADVDGDLHIL 1263

Query: 480  MYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIRCKPSS---ISDAPGARSRFLTWYA 534
             + PE  +S  GH L+ +T F +  +    T    R  P S     D P      +   A
Sbjct: 1264 EFNPEHPKSLQGHLLLHRTSFSVSPNPPSTTLLLPRTTPPSHPTPQDPP-----HVLLLA 1318

Query: 535  SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK---GYYAGNPSR 591
            S  G L   +PLPE  YRRLL + N ++   +  GGLN +A R   G    G  A    R
Sbjct: 1319 SSSGHLSSLIPLPETAYRRLLSVTNQLLPALTPHGGLNAKAHRLPVGTRTVGVEAAG-GR 1377

Query: 592  GIIDGSLVWKFLQLSLGERLEICKKIG 618
             I+DG+++ ++ +LS  +R EI  K G
Sbjct: 1378 AIVDGAVLARWAELSAAKRAEIAGKGG 1404


>gi|330919204|ref|XP_003298516.1| hypothetical protein PTT_09264 [Pyrenophora teres f. teres 0-1]
 gi|311328242|gb|EFQ93393.1| hypothetical protein PTT_09264 [Pyrenophora teres f. teres 0-1]
          Length = 1388

 Score =  197 bits (500), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 174/644 (27%), Positives = 272/644 (42%), Gaps = 101/644 (15%)

Query: 10   SAMDETIVQELLTVSLG-LHGNRPLLLVRTQHE-LLIYQAFRHP-KGALKLRFKKLKVLF 66
            SA   TI  E+L   LG      P L++RT  + L+IY+AF  P + A     K L+ + 
Sbjct: 788  SAAKATIT-EILAADLGDATAKSPHLIIRTSSDNLVIYKAFHAPSRSASDQWTKNLRWVK 846

Query: 67   VSDRS-KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
            +S +   R  E  G       S +    +I GY  VF  G  PA++   +    R   ++
Sbjct: 847  LSQQHVPRYIEDSGSEDSGFDSTLVALDDICGYSTVFQRGTTPAFILKEASSAPRVIGLS 906

Query: 126  IDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYD-APWPVRKVPLKCTPHFLA 184
               PV +L  FH  +C RGF Y ++   LRI  LP    Y    W  R++P+    H L 
Sbjct: 907  -GKPVKSLTSFHTSSCQRGFAYLDSTDTLRICQLPPQTHYGHLGWATRRMPMDSEVHTLT 965

Query: 185  YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEE 244
            YH     Y + T  AE     Y+ +  +      P++     P + +  + L    SW  
Sbjct: 966  YH-PPGLYIVGTGQAED----YQLDPTETYHYDLPKEDLTFKPSIERGVIKLLDEKSWTI 1020

Query: 245  IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIE 304
            I      L   E VL +K +++E        +  IA+GT+  + ED+  +G I +F++I 
Sbjct: 1021 I--DTHVLDPQEVVLSIKTLNLEVSEITHQRKDLIAVGTSVVHGEDLATKGCIRIFEVIT 1078

Query: 305  VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGI 361
            VVP+P +P T  ++K+I   E KG V+AI  +   GFL+ A GQK  +  LK D  L  +
Sbjct: 1079 VVPQPDRPETNRRLKLIVKDEVKGAVSAISELGTQGFLIMAQGQKCMVRGLKEDGTLLPV 1138

Query: 362  AFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
            AF+D + Y++ + ++    ++ +GD  R +    Y  E   +SL AR             
Sbjct: 1139 AFMDMQCYVSDLKNLPGTGMLAMGDAYRGVWFTGYTEEPYKMSLFAR------------- 1185

Query: 420  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN------DILDEFSSMGFMISDKD 473
                                             SKHN      D L     +  +++D D
Sbjct: 1186 ---------------------------------SKHNLETIAVDFLPFDQQLHLVVADAD 1212

Query: 474  KNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF------------------------ 509
             N+ +  + P+  +   G RL+ K  FH G    +                         
Sbjct: 1213 MNLQILQFDPDNPKGEAGSRLLHKATFHTGHFPTSLHLIHSHLKLPSATDFAATNNNPAD 1272

Query: 510  -FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 568
             F +   P++ +D P  +      + +  G L    PL E +YRRL  L   +       
Sbjct: 1273 AFAMDTSPNTTTDTP-QQPFHQILHTTQSGTLALLTPLSEDSYRRLSNLSAYLANTLDSA 1331

Query: 569  GGLNPRAFRT--YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGER 610
              LNPRAFRT      G+ AG  +RG++DG+L+ ++ +  LGER
Sbjct: 1332 CSLNPRAFRTGDVAEGGWDAGTGARGVLDGNLLLRWGE--LGER 1373


>gi|121925707|sp|Q0UUE2.1|CFT1_PHANO RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
          Length = 1375

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 175/628 (27%), Positives = 274/628 (43%), Gaps = 79/628 (12%)

Query: 17   VQELLTVSLGLHGNR-PLLLVRTQHE-LLIYQAFRHPKGAL------KLRFKKLKVLFV- 67
            + E+L   LG   +R P L+VRT ++ L+IY+A   P  +        LR+ KL    V 
Sbjct: 800  ITEILAADLGDATSRSPHLIVRTSNDDLVIYKAIHSPSRSSSDLWTHNLRWVKLSQQHVP 859

Query: 68   ----SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHP 123
                    + A ++PG       S +    NI GY  V   G  PA++   S    R   
Sbjct: 860  RYMEDGAQEEAADEPGFE-----STLLALDNINGYSTVIQRGRSPAFILKESSSAPRVIG 914

Query: 124  MTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYD-APWPVRKVPLKCTPHF 182
            ++   PV +L  FH  +C RGF Y ++   LRIS LP    Y    W  R++P+    H 
Sbjct: 915  LS-GNPVKSLTRFHTSSCQRGFAYLDSTDTLRISQLPPSTHYGHLGWAARRMPMDAEVHA 973

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
            LAYH    +   V  T +P  + Y  +  D      P++     P V    + +    +W
Sbjct: 974  LAYH---PSGLYVIGTGQP--EEYTLDPNDTFHYELPKEETSFKPKVEHGIIKVMDEKTW 1028

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
              I      L   E +LC+K +++E   T    +  IA+GT     ED+  +G I +F++
Sbjct: 1029 TVI--DTHVLDPQEVILCIKTLNLEVSETTHQRKDVIAVGTAIVLGEDLATKGNIRIFEV 1086

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLT 359
            I VVPEP  P T  ++K+I   E KG V+AI  +   GFL+ A GQK  +  LK D  L 
Sbjct: 1087 ITVVPEPDHPETNKRLKLIVKDEVKGTVSAISDLGTQGFLIMAQGQKSMVRGLKEDGTLL 1146

Query: 360  GIAFIDTEVYIASMVSVKN--LILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
             +AF+D + Y+ ++ ++ N  ++L+GD  +      Y  E   + L  R        SK 
Sbjct: 1147 PVAFMDMQCYVTTLKTLPNTGMLLMGDAYKGAWFTGYTEEPYKMMLFGR--------SKH 1198

Query: 418  YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVV 477
            +             +   FL     E+L I                    +++D D N+ 
Sbjct: 1199 HLE----------CITADFLPFE--EQLHI--------------------IVADADMNLQ 1226

Query: 478  LFMYQPEARESNGGHRLIKKTDFHLGQHVNT--FFKIRCKPSSISDAPGARSRFLTWY-- 533
            +  + P+  +S GG RL++K+ FH G   +T    + R    + S+   + +  L  +  
Sbjct: 1227 VLQFDPDHPKSMGGTRLLQKSTFHTGHFPSTMHLLQSRLHMPTASEFTTSTTSSLPLHQI 1286

Query: 534  --ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK-GKGYYAGNPS 590
               S  G L    PL E +YRRL  L   +        GLN +AFR     +G +     
Sbjct: 1287 LCTSQSGTLALITPLSESSYRRLSGLATHLQQFLDSPCGLNGKAFRAADVMEGGWDAGTQ 1346

Query: 591  RGIIDGSLVWKFLQLSLGERLEICKKIG 618
            R ++DG L+ ++ +L    R E   K+G
Sbjct: 1347 RAMLDGGLLMRWGELGEQRRREGLGKVG 1374


>gi|340515387|gb|EGR45642.1| predicted protein [Trichoderma reesei QM6a]
          Length = 1441

 Score =  196 bits (498), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 181/662 (27%), Positives = 299/662 (45%), Gaps = 99/662 (14%)

Query: 11   AMDETIVQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALK------LRFKK- 61
            A  ET+  E++   LG  +H +  L+L  + ++L IY+  R P           L FKK 
Sbjct: 832  ATRETLT-EIVVADLGDAVHASPYLILRHSTNDLTIYEPIRLPANETAHTLSDTLFFKKS 890

Query: 62   ----LKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
                L    V D S    + P      R   +R  +N+ GY  VFL GP PA++  +SR 
Sbjct: 891  PNAVLAKSAVEDPSDDTAQPP------RYVPLRICANVGGYSSVFLPGPSPAFVIKSSRS 944

Query: 118  ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPL 176
              R   +   G V  ++ FH   C RGF+Y +++   R++ LP+  ++ +    V+KVPL
Sbjct: 945  VPRVVGLQGHG-VRGMSTFHTEGCDRGFIYADSEGIARVTQLPSKTNFTELGISVKKVPL 1003

Query: 177  KCTPHFLAYHLETKTY---CIVTSTAE-PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQF 232
                  +AYH  T+TY   C VT   E P  D Y      KE     R+S  +PP   + 
Sbjct: 1004 GFDVRHVAYHHPTETYIAGCAVTENFELPKDDDYH-----KEWA---RESVPLPPTAVRG 1055

Query: 233  HVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT 292
             + L +P +W  I   +  +   E + C+K + +E        R  +A+GT  +  ED+ 
Sbjct: 1056 ALKLINPITWTVIHSID--MEAGESIECMKTLHLEVSEETKERRMLLAVGTALSRGEDLP 1113

Query: 293  CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKI 348
             RGR+ ++DI+ V+PEPG+P T  ++K++ AKE   +G VTA+  +   G ++ A GQK 
Sbjct: 1114 TRGRVQVYDIVTVIPEPGKPETNKRLKLL-AKEDIPRGGVTALSEIGTQGLMLVAQGQKC 1172

Query: 349  YIWQLK-DNDLTGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVA 405
             +  LK D  L  +AF+D   +++S+  +    L L+ D  + +    Y  E  T  ++ 
Sbjct: 1173 MVRGLKEDGSLLPVAFLDMSCHVSSVRELPGTGLCLIADAFKGLWFAGYTEEPYTFKVLG 1232

Query: 406  RDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
            +                       GSL                        D L +   +
Sbjct: 1233 KS---------------------SGSLPLLV-------------------ADFLPDGEDL 1252

Query: 466  GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIRCKPSSISDAP 523
              +  D D ++ +  + PE  +S  GH L+ +T F +  +   +T    R  P+S S   
Sbjct: 1253 SMVAVDADGDMHVLEFNPEHPKSLQGHLLLHRTTFSVTPNPPTSTLLLPRTLPASQSSQD 1312

Query: 524  GARSRF----LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
             + S      +   AS  G++    PLPE  YRRLL + N ++      GGL+ RA RT 
Sbjct: 1313 SSSSSSTQPHILLLASPSGSIAALTPLPESAYRRLLSVTNQLLPALVPHGGLHARAHRTP 1372

Query: 580  KGKGYYA-------GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 632
            +G G  +           R I+DG+++ ++ +L   +R E+  + G  ++ + +   D+E
Sbjct: 1373 EGGGGMSRTVGVETAATGRAIVDGTVLTRWNELGAAKRAEVATRGG--YDGVTEMREDLE 1430

Query: 633  AL 634
            A+
Sbjct: 1431 AV 1432


>gi|358387835|gb|EHK25429.1| hypothetical protein TRIVIDRAFT_32877 [Trichoderma virens Gv29-8]
          Length = 1440

 Score =  196 bits (498), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 180/662 (27%), Positives = 299/662 (45%), Gaps = 99/662 (14%)

Query: 11   AMDETIVQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALK------LRFKK- 61
            A  ET+  E++   LG  +H +  L+L  +  +L IY+  R P  +        L FKK 
Sbjct: 831  ATRETLT-EIVVADLGDSVHSSPYLILRHSTDDLTIYEPIRLPTASATHALSDTLFFKKS 889

Query: 62   ----LKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
                L    V D S    + P      R   +R  +N+ GY  VFL GP PA++  +S+ 
Sbjct: 890  ANSSLAKSAVEDPSDDTAQPP------RYVPLRTCANVGGYSAVFLPGPSPAFIIKSSKS 943

Query: 118  ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPL 176
              R   +   G V  ++ FH   C RGF+Y +++   R++ LP+  +  +    V+KVPL
Sbjct: 944  IPRVVGLQGLG-VRGMSTFHTEGCDRGFIYADSEGIARVTQLPSKTNLTELGVSVKKVPL 1002

Query: 177  KCTPHFLAYHLETKTY---CIVTSTAE-PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQF 232
                  +AYH  T+TY   C +T   E P  D Y      KE     R+S    P +++ 
Sbjct: 1003 GHDIRHVAYHHPTETYIAGCTITENFELPKDDDYH-----KEWA---RESLSFLPSMARG 1054

Query: 233  HVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT 292
             + L +P +W  I   +  +   E + C+K + +E        R  +A+GT     ED+ 
Sbjct: 1055 ALKLINPITWTVIHSID--MEPGESIECMKTLHLEVSEETKERRMLLAVGTALTRGEDLP 1112

Query: 293  CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKI 348
             RGR+ ++DI+ V+PEPG+P T  ++K++ AKE+  +G VTA+  +   G ++ A GQK 
Sbjct: 1113 TRGRVQVYDIVTVIPEPGKPETNKRLKLL-AKEEIPRGGVTALSEIGTQGLMLVAQGQKC 1171

Query: 349  YIWQLK-DNDLTGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVA 405
             +  LK D  L  +AF+D   ++++   +    L L+ D  + +    Y  E  T  ++ 
Sbjct: 1172 MVRGLKEDGSLLPVAFLDMSCHVSTARELPGTGLCLIADAFKGLWFAGYTEEPYTFKVLG 1231

Query: 406  RDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
            +                       GSL                        D L +   +
Sbjct: 1232 KS---------------------SGSLPLLV-------------------ADFLPDGEDL 1251

Query: 466  GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIRCKPSSIS--D 521
              +  D D ++ +  + PE  +S  GH L+ +T F +  +   +T    R  P+S S   
Sbjct: 1252 SMVAVDADGDIHVLEFNPEHPKSLQGHLLLHRTTFSVTPNPPTSTLLLPRTLPASQSATT 1311

Query: 522  APGARSR--FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
            +P + S    L   AS  G L    PLPE  YRRLL + N ++      GGL+ RA RT 
Sbjct: 1312 SPDSSSSQPHLLLLASPSGCLASLTPLPESAYRRLLSVTNQLLPALVPHGGLHARAHRTP 1371

Query: 580  KGKGYYA-------GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 632
            +G G  +           R I+DG+++ ++ +L   +R E+  + G  ++ +++   D+E
Sbjct: 1372 EGGGGMSRTVGVETAASGRAIVDGAILARWNELGAAKRAEVATRGG--YDGVMEMREDLE 1429

Query: 633  AL 634
            A+
Sbjct: 1430 AV 1431


>gi|348679545|gb|EGZ19361.1| putative cleavage and polyadenylation specificity factor CPSF
            [Phytophthora sojae]
          Length = 1752

 Score =  196 bits (497), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 166/649 (25%), Positives = 279/649 (42%), Gaps = 137/649 (21%)

Query: 80   LPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT-----IDGPVSTLA 134
            L  G R   +  F N+    G F  G HP W+ L  RG+    PM      +  PV +  
Sbjct: 1150 LRAGFRYPMLTTFYNVNNMSGAFFRGAHPMWI-LGDRGQPTFIPMCSAAPKVSVPVLSFT 1208

Query: 135  PFHNVNCPRGFLYFNAKSELRISVLP-----THLSYDAPWPVRKVPLKCTPHFLAY---- 185
            PFH+ NCP GF+YF+++  LR+  LP     T L     + ++K     T H + Y    
Sbjct: 1209 PFHHWNCPNGFIYFHSRGALRVCELPSSKTSTILPSSGGFVLQKAEFGATLHHMLYLGNH 1268

Query: 186  -------HLETKTYCIVTSTAEPSTDYYKFNG-EDKELVTDPRD---------SRFIPPL 228
                    LE  TY +V S     TD  +    ED +   +P +         S  + P 
Sbjct: 1269 GPGGVSEALEAPTYAVVCSVKMKPTDAERATEVEDADEEKEPENLDANGNPVGSNVMAPT 1328

Query: 229  VSQFHVSLFSPFSWEE-------IPQTN----------FPLH--EWEHVLCLK-----NV 264
               F        +  E       + QTN          F +H   +E VL +K     + 
Sbjct: 1329 AEMFPDFEIDQMAHTEEEVYELRLVQTNEFGEWGRRGVFRVHFERYEVVLSVKLMYLYDS 1388

Query: 265  SMEYEGTLSGL-------RGYIALGTNY--NYSEDVTCRGRILLF--DIIEVVPEPGQPL 313
            S+  E   S         R Y+ +GT +   + ED + RGR+LL+  D  + V E G   
Sbjct: 1389 SLMKEEVASTSAEWNKKKRPYLVIGTGWVGPHGEDESGRGRLLLYELDYAQYVDEEGGST 1448

Query: 314  TKN--KIKMIYAKE-QKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYI 370
            +    K+++++ KE ++G ++++  +  +++ AVG K+ +++ K   L G AF D +++I
Sbjct: 1449 SSKLPKLRLVFIKEHRQGAISSVVQLGPYVLAAVGSKLIVYEFKSEQLIGCAFYDAQMFI 1508

Query: 371  ASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDG 430
             ++  VK+ ++ GD  +S+  LR++   R L L+A+DY+P                    
Sbjct: 1509 VTLNVVKDFVMYGDVYKSVHFLRWREMQRQLVLLAKDYEP-------------------- 1548

Query: 431  SLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNG 490
             L     + S+ E+                    +  +  D D+N+ +  + P+  ES G
Sbjct: 1549 -LAVSATEFSVFEK-------------------KLALLAVDMDENLHVMQFAPQDIESRG 1588

Query: 491  GHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR------------SRFLTWYASLDG 538
            G RL++ +DFHLG  V + F+ R       D PG              S ++    + +G
Sbjct: 1589 GQRLLRVSDFHLGVQVASMFRKRV------DGPGGHVAVNGRGPRAPPSYYVNVMGNSEG 1642

Query: 539  ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY-YAGNPS------- 590
             +G  +P+ E+ +RRL  LQNVMV        LNPR FR  K       G P        
Sbjct: 1643 GVGALIPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRRCGRPDAWSKKKW 1702

Query: 591  -RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             +  +D  ++++FLQL    + E+ + IG+    ++  L +++  ++ F
Sbjct: 1703 KKSFLDAFVLFRFLQLDYVAQKELARCIGTTPEVVIHNLLEVQHATATF 1751


>gi|310789917|gb|EFQ25450.1| CPSF A subunit region [Glomerella graminicola M1.001]
          Length = 1439

 Score =  195 bits (496), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 176/651 (27%), Positives = 281/651 (43%), Gaps = 90/651 (13%)

Query: 14   ETIVQELLTVSLG-LHGNRPLLLVR-TQHELLIYQAFRHPKG------ALKLRFKK---- 61
            +  + ELL   LG      P L++R    +L IY+  R          +  L F+K    
Sbjct: 840  QETLTELLVADLGDTTTTSPYLILRHANDDLTIYEPIRLESQDKTVGLSKTLHFQKITNP 899

Query: 62   -LKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
             L    V      ANEQP      R   +R   NI GY  VFL G  P+++  +S+   +
Sbjct: 900  ALAKSPVEVADDEANEQP------RFVPLRPCPNINGYSTVFLPGASPSFIIKSSKSSPK 953

Query: 121  AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCT 179
               +   G V  ++ FH   C RGF+Y +++ + R++ LP   ++ +    VRK+P+   
Sbjct: 954  VIGLQGIG-VRGMSSFHTEGCERGFIYADSEGQTRVTQLPADTNFTELGVAVRKIPIGDN 1012

Query: 180  PHFLAYHLETKTYCIVTSTAE----PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVS 235
               +AYH   +TY +  S  E    P  D Y      +   + P+  R I        + 
Sbjct: 1013 VGLIAYHPPMETYAVACSVLERFELPKDDDYHKEWAKEATTSYPQTERGI--------IK 1064

Query: 236  LFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRG 295
            L SP +W  I       HE    +C+K + +E        R  I +GT  N  ED+  RG
Sbjct: 1065 LMSPTTWSVIDTVELEPHEV--AMCMKTLHLEVSEETKERRMLITIGTAINRGEDLPIRG 1122

Query: 296  RILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIW 351
            RIL++D++ VVP+PG+P T  K+K++ AKE+  +G VT +C V   G ++ A GQK  + 
Sbjct: 1123 RILVYDVVPVVPQPGRPETNKKLKLV-AKEEIPRGAVTGLCEVGSQGLMLVAQGQKCMVR 1181

Query: 352  QLK-DNDLTGIAFIDTEVYIASMVSVKNL--ILVGDYARSIALLRYQPEYRTLSLVARDY 408
             LK D  L  +AF+D   Y+ ++  V+     L+ D  + +  + Y  E           
Sbjct: 1182 GLKEDGTLLPVAFMDMNCYVTAVREVRGTGYCLMTDAFKGVWFVGYAEE----------- 1230

Query: 409  KPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFM 468
                          P + ++ G    KF  L+                D +     +  +
Sbjct: 1231 --------------PYKMMLFGKSTGKFEVLTA---------------DFIIAGDELHIV 1261

Query: 469  ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG-QHVNTFFKIRCKPSSISDAPGARS 527
            + DKD  + +  + PE  +S  GH L+ +  F     H  T   +   P+S +     ++
Sbjct: 1262 VCDKDGVIHVMQFDPEHPKSLQGHLLLNRASFSAAPNHPTTTLSLPRTPASTATTSATKN 1321

Query: 528  RFLT-WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 586
               T   AS  GAL    PL E+ YRRL  L N +     H    NP+A R         
Sbjct: 1322 PPTTLLLASPTGALASLTPLSEQAYRRLTSLANSIAGALPHAAATNPKAHRLQPLDARTP 1381

Query: 587  G---NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
            G   +  R I+DG+L+ ++ +L  G R E+  K G  + D+L+   ++E +
Sbjct: 1382 GVDTSAGRSIVDGALLARWNELGAGRRSEVAGKGG--YGDVLEVRGELEGV 1430


>gi|46120520|ref|XP_385083.1| hypothetical protein FG04907.1 [Gibberella zeae PH-1]
          Length = 1436

 Score =  195 bits (496), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 173/630 (27%), Positives = 282/630 (44%), Gaps = 88/630 (13%)

Query: 17   VQELLTVSLG-LHGNRPLLLVRTQ-HELLIYQAFRH--PKG----ALKLRFKKLKVLFVS 68
            ++E+L   LG      P L++R Q  +L IY+   H  P G    +  L FKK+  + ++
Sbjct: 835  LREILVADLGDTISQSPYLILRNQTDDLTIYEPIHHVRPGGESNLSAALSFKKMSNVTLA 894

Query: 69   DRSKRAN----EQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM 124
                +      EQP      R   MR  +NI GY  VFL G  P+++  +S+   R   +
Sbjct: 895  TTPAQTEDDDVEQP------RFMPMRRCANINGYSTVFLPGSSPSFVLKSSKSIPRVIGL 948

Query: 125  TIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFL 183
               G +  ++ FH   C RGF+Y + K   R++  P+  ++ +    V+KVPL      +
Sbjct: 949  QGLG-IRGMSSFHTEGCDRGFIYADDKGIARVTQFPSDTNFTELGISVKKVPLGSDVRGI 1007

Query: 184  AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
            AYH  T  Y     T+EP    ++   +D       +++   PP + +  + L SP +W 
Sbjct: 1008 AYHQPTGAYIAGCMTSEP----FELPKDDDYHKEWAKETLSFPPTMPRGVLKLISPITWT 1063

Query: 244  EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
             I   +  L   E + C+K + +E        R  +A+GT  +  ED+  RGR+ ++DI+
Sbjct: 1064 VI--HDIELESCESIECMKTLHLEVSEDTKERRFLVAVGTAVSKGEDLPIRGRVHVYDIV 1121

Query: 304  EVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDL 358
             V+PEPG+P T  ++K I A+E   +G VTAI  +   G ++ A GQK  +  LK D  L
Sbjct: 1122 TVIPEPGKPETNRRLKAI-AREDIPRGGVTAISEIGTQGLMLVAQGQKCMVRGLKEDGSL 1180

Query: 359  TGIAFIDTEVYIASM--VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSK 416
              +AF+D   +++S   +S   L L+ D  + +    Y  E  T  ++ + +        
Sbjct: 1181 LPVAFLDMSCHVSSARELSRTGLCLMADAFKGVWFAGYTEEPYTFKVLGKSHG------- 1233

Query: 417  GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
                                       RL +         D L +   +  + +D D ++
Sbjct: 1234 ---------------------------RLPVVVA------DFLPDGDDLAIVAADVDGDL 1260

Query: 477  VLFMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIRCKPSS---ISDAPGARSRFLT 531
             +  + PE  +S  GH L+ +T F +  +    T    R  P S     D P      + 
Sbjct: 1261 HILEFNPEHPKSLQGHLLLHRTSFSVSPNPPSTTLLLPRTTPPSHPTPQDPP-----HVL 1315

Query: 532  WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK---GYYAGN 588
              AS  G L   +PLPE  YRRLL + N ++   +  GGLN +A R   G    G  A  
Sbjct: 1316 LLASSSGHLSSLIPLPETAYRRLLSVTNQLLPALTPHGGLNAKAHRLPVGTRTVGVEAAG 1375

Query: 589  PSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
              R I+DG+++ ++ +LS  +R EI  K G
Sbjct: 1376 -GRAIVDGAVLARWAELSAAKRAEIAGKGG 1404


>gi|448105510|ref|XP_004200513.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
 gi|448108635|ref|XP_004201144.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
 gi|359381935|emb|CCE80772.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
 gi|359382700|emb|CCE80007.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
          Length = 1344

 Score =  195 bits (496), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 151/599 (25%), Positives = 271/599 (45%), Gaps = 63/599 (10%)

Query: 41   ELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQ-MRYFSNIAGYQ 99
            E++IY+ F        ++ K LK+    D +         P G  + + + Y  N+ GY 
Sbjct: 800  EVIIYKLFFDGDNFKFIKEKDLKITGAPDNA--------YPLGTTLERRLVYVPNVNGYS 851

Query: 100  GVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVL 159
             +F+ G  P ++  T     R    T   P  + + + + N   GF+Y +     R+  +
Sbjct: 852  SIFVTGIIPYFITKTVHSVPRIFRFT-KLPAVSFSSYSDSNIKNGFIYLDNSKNARMCEI 910

Query: 160  PTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDP 219
            P   +Y+  WP++K+ +  T   +AYH  + T+ + +    P   Y   + E K +V   
Sbjct: 911  PLDFNYENNWPIKKIQMPETVKAIAYHELSNTFVVSSYEEIP---YDCLDEEGKPIVGID 967

Query: 220  RDSRFIPPLVS-QFHVSLFSPFSWEEIPQTNFPLHEW-EHVLCLKNVSMEYEGTLSGLRG 277
            +     PP  S + ++ L SP++W  I       +E   +VL +              + 
Sbjct: 968  KSK---PPAESYKGYLRLISPYNWSVIDTIVLADNEIGMNVLSMVLDVGSSTKKFKSKKE 1024

Query: 278  YIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
             I LG+     ED++  G   +F+II+++PEPG+P T +K K ++ ++ +G VT+IC V+
Sbjct: 1025 LIVLGSGKYRIEDLSSNGSFKIFEIIDIIPEPGKPETNHKFKEVHIEDTRGAVTSICEVS 1084

Query: 338  GFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPE 397
            G L+   GQKI I  L+D+ +  +AF+DT VY++   S  NLIL+GD  +S+ L  +  E
Sbjct: 1085 GRLLVTQGQKIIIRDLQDDGVVPVAFLDTAVYVSEAKSFGNLILLGDSLKSVWLAGFDAE 1144

Query: 398  YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 457
               + L+++D +                           L +S       C     K  +
Sbjct: 1145 PFRMILLSKDIQT--------------------------LDVS-------CADFIVKDEE 1171

Query: 458  ILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS 517
            I         + +D +  + +  + PE   S+ G RL+ KT F++      F   R  P 
Sbjct: 1172 IF-------ILFADNNNVLHVVKFDPEDPLSSNGQRLVHKTSFNINSAATCF---RTIPK 1221

Query: 518  SISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 577
            +  + P   + F +  +++DG+     P+ E  YRR+ +LQ  +     H  GLNPR  R
Sbjct: 1222 NEENYPSLTTSFQSIGSTIDGSFFTVFPINESTYRRMYILQQQLTDKEFHICGLNPRLNR 1281

Query: 578  TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN--DILDELYDIEAL 634
                    +   S+ +++  ++ KF+ L+   +     KIGSK++  DI  +L + E++
Sbjct: 1282 FGGLNETNSDANSKPMLEYDVIKKFVNLNSDRKKNFASKIGSKNSYQDIWRDLIEFESV 1340


>gi|148886831|sp|Q7SEY2.2|CFT1_NEUCR RecName: Full=Protein cft-1; AltName: Full=Cleavage factor two
            protein 1
          Length = 1456

 Score =  195 bits (496), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 175/630 (27%), Positives = 279/630 (44%), Gaps = 77/630 (12%)

Query: 17   VQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFV--SDRSK 72
            V E+L   LG   H +  L+L     +L +YQ +R    A +   K L    V  S  +K
Sbjct: 844  VAEILVADLGDTTHKSPYLILRHANDDLTLYQPYRLKATAGQPFSKSLFFQKVPNSTFAK 903

Query: 73   RANEQPGLP----RGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG 128
               E+P          R   MR  SNI+GY  VFL G  P+++  T++   R   +   G
Sbjct: 904  APEEKPADDDEPHNAQRFLPMRRCSNISGYSTVFLPGSSPSFILKTAKSSPRVLSLQGSG 963

Query: 129  PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHL 187
             V  ++ FH   C  GF+Y +     R++ +PT  SY +    V+K+P+      +AYH 
Sbjct: 964  -VQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSYAELGLSVKKIPIGVDTQSVAYHP 1022

Query: 188  ETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQ 247
             T+ Y +  +  EP    ++   +D       R++    P+V +  + L S  +W  I  
Sbjct: 1023 PTQAYVVGCNDVEP----FELPKDDDYHKEWARENITFKPMVDRGVLKLLSGITWTVIDT 1078

Query: 248  TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
                +   E VLC++ +++E   + +  +  IA+GT     ED+  RGR+ +FDI +V+P
Sbjct: 1079 VE--MEPCETVLCVETLNLEVSESTNERKQLIAVGTALIKGEDLPTRGRVYVFDIADVIP 1136

Query: 308  EPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIA 362
            EPG+P T  K+K++ AKE   +G VTA+  V   G ++ A GQK  +  LK D  L  +A
Sbjct: 1137 EPGKPETSKKLKLV-AKEDIPRGAVTALSEVGTQGLMLVAQGQKCMVRGLKEDGTLLPVA 1195

Query: 363  FIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            F+D   Y+ S+  +    L L+ D  + +    Y  E   + L  +              
Sbjct: 1196 FMDMNCYVTSVKELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKS------------- 1242

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                   R+E+       + D L +   +  + SD D ++ +  
Sbjct: 1243 ---------------------STRMEVL------NADFLPDGKELYIVASDADGHIHILQ 1275

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNT----FFKIRCKPSSISDAPGARSRFLTWYASL 536
            + PE  +S  GH L+ +T F+ G H  T       +   PSS+S      S  +   AS 
Sbjct: 1276 FDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLLLPAVYPNPSSLSSNSEENSPHILLLASP 1335

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR--------TYKGKGYYAGN 588
             G L    PL E  YRRL  L   +     H  GLNP+ +R        + +  G  AG 
Sbjct: 1336 TGVLATLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASMQLPGVDAGI 1395

Query: 589  PSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
              R I+DG ++ +FL+L  G+R E+  + G
Sbjct: 1396 -GRNIVDGKILERFLELGTGKRQEMAGRAG 1424


>gi|164429683|ref|XP_964609.2| hypothetical protein NCU02082 [Neurospora crassa OR74A]
 gi|157073577|gb|EAA35373.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 1437

 Score =  195 bits (495), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 175/630 (27%), Positives = 279/630 (44%), Gaps = 77/630 (12%)

Query: 17   VQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFV--SDRSK 72
            V E+L   LG   H +  L+L     +L +YQ +R    A +   K L    V  S  +K
Sbjct: 794  VAEILVADLGDTTHKSPYLILRHANDDLTLYQPYRLKATAGQPFSKSLFFQKVPNSTFAK 853

Query: 73   RANEQPGLP----RGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG 128
               E+P          R   MR  SNI+GY  VFL G  P+++  T++   R   +   G
Sbjct: 854  APEEKPADDDEPHNAQRFLPMRRCSNISGYSTVFLPGSSPSFILKTAKSSPRVLSLQGSG 913

Query: 129  PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHL 187
             V  ++ FH   C  GF+Y +     R++ +PT  SY +    V+K+P+      +AYH 
Sbjct: 914  -VQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSYAELGLSVKKIPIGVDTQSVAYHP 972

Query: 188  ETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQ 247
             T+ Y +  +  EP    ++   +D       R++    P+V +  + L S  +W  I  
Sbjct: 973  PTQAYVVGCNDVEP----FELPKDDDYHKEWARENITFKPMVDRGVLKLLSGITWTVI-- 1026

Query: 248  TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
                +   E VLC++ +++E   + +  +  IA+GT     ED+  RGR+ +FDI +V+P
Sbjct: 1027 DTVEMEPCETVLCVETLNLEVSESTNERKQLIAVGTALIKGEDLPTRGRVYVFDIADVIP 1086

Query: 308  EPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIA 362
            EPG+P T  K+K++ AKE   +G VTA+  V   G ++ A GQK  +  LK D  L  +A
Sbjct: 1087 EPGKPETSKKLKLV-AKEDIPRGAVTALSEVGTQGLMLVAQGQKCMVRGLKEDGTLLPVA 1145

Query: 363  FIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            F+D   Y+ S+  +    L L+ D  + +    Y  E   + L  +              
Sbjct: 1146 FMDMNCYVTSVKELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKS------------- 1192

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                   R+E+       + D L +   +  + SD D ++ +  
Sbjct: 1193 ---------------------STRMEVL------NADFLPDGKELYIVASDADGHIHILQ 1225

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNT----FFKIRCKPSSISDAPGARSRFLTWYASL 536
            + PE  +S  GH L+ +T F+ G H  T       +   PSS+S      S  +   AS 
Sbjct: 1226 FDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLLLPAVYPNPSSLSSNSEENSPHILLLASP 1285

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR--------TYKGKGYYAGN 588
             G L    PL E  YRRL  L   +     H  GLNP+ +R        + +  G  AG 
Sbjct: 1286 TGVLATLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASMQLPGVDAGI 1345

Query: 589  PSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
              R I+DG ++ +FL+L  G+R E+  + G
Sbjct: 1346 -GRNIVDGKILERFLELGTGKRQEMAGRAG 1374


>gi|406865186|gb|EKD18229.1| CPSF A subunit region [Marssonina brunnea f. sp. 'multigermtubi'
            MB_m1]
          Length = 1443

 Score =  195 bits (495), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 172/653 (26%), Positives = 290/653 (44%), Gaps = 89/653 (13%)

Query: 14   ETIVQELLTVSLGLHGNR-PLLLVR-TQHELLIYQAFR---HPKGALKLRFKKLKVL--- 65
            ETI  EL+   LG    R P L++R +  +L IY+ F       G L    + LK+    
Sbjct: 841  ETIT-ELVVADLGDETARSPYLILRPSTDDLTIYEPFHTSSESSGGLASTLQFLKIHNPH 899

Query: 66   FVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
               +    A E     +  R   MR  SN+ GY  VFL G  P+++  +++   +   + 
Sbjct: 900  LARNPDVSAAETADGIQETRDEPMRVISNLGGYCTVFLPGGSPSFIMKSAKSTPKVISLQ 959

Query: 126  IDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLA 184
              G V  ++ FH   C RGF+Y +     R+S LP   ++ +    ++K+ L    H +A
Sbjct: 960  GLG-VRGMSSFHTEGCDRGFIYTDVDGLARVSQLPKDTTFAELGVSLQKIELGQEIHGVA 1018

Query: 185  YHLETKTYCIVTST-AE---PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPF 240
            YH  T+ Y   TST AE   P  D        KE +T         P + Q  + L +P 
Sbjct: 1019 YHPPTECYVAATSTEAEFELPKEDDNHHPQWAKEQIT-------FKPTMEQGRLRLINPV 1071

Query: 241  SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
            +W  + +    L  +E ++C+K + +E     +  +  IA+GT  +  ED+  +GRI ++
Sbjct: 1072 NWTVVDEVE--LDPFEVIMCIKTLILETSEITNERKQLIAVGTGISKGEDLAIKGRIHVY 1129

Query: 301  DIIEVVPEPGQPLTKNKIKMIYAKE-QKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DN 356
            D+I VVPEP +P T  ++K+I  ++  +G +T I  +   GF++ + GQK  +  LK D 
Sbjct: 1130 DVINVVPEPDRPETNKRLKLIATEDIARGAITCISEIGTQGFMIVSQGQKCMVRGLKEDG 1189

Query: 357  DLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
             L  +AF+D   YI S+  +K   + +  D  + + +  Y  E   + L  +  K     
Sbjct: 1190 TLLPVAFMDMNCYITSIKELKGTGICVFSDAVKGVWVAGYTEEPYKMMLFGKSAK----- 1244

Query: 415  SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
                                          +EI +       D+L +   +  + +D D 
Sbjct: 1245 -----------------------------NMEIMQA------DLLPDGKELYIVAADSDC 1269

Query: 475  NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLT--- 531
            N+ +  + PE  +S  GH L+ ++ F LG H+ T   +  +  S +  P +     T   
Sbjct: 1270 NLHIMQFDPEHPKSLQGHLLLHRSTFALGGHLPTSMTLLPRTKSATLLPPSPDAMDTAAD 1329

Query: 532  --------WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG-- 581
                       S  G +    PL E  YRRL  L + ++    H  GLNPRA+R  K   
Sbjct: 1330 ATIPEHEILITSSTGCISLLTPLSEAQYRRLSTLTSHLINTLYHACGLNPRAYRVDKDAP 1389

Query: 582  KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
            +G      SR +IDG+++ ++++L    R E+  ++G    D+L+   D+ +L
Sbjct: 1390 EGMVG---SRTVIDGNILMRWMELGSQRRAEVAGRVGV---DVLEVREDLASL 1436


>gi|336463425|gb|EGO51665.1| hypothetical protein NEUTE1DRAFT_89273 [Neurospora tetrasperma FGSC
            2508]
          Length = 1437

 Score =  194 bits (494), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 176/630 (27%), Positives = 280/630 (44%), Gaps = 77/630 (12%)

Query: 17   VQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFV--SDRSK 72
            V E+L   LG   H +  L+L     +L +YQ +R    A +   K L    V  S  +K
Sbjct: 794  VAEILVADLGDTTHKSPYLILRHANDDLTLYQPYRLKATAGQPFSKSLFFQKVPNSTFAK 853

Query: 73   RANEQP---GLPRGV-RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG 128
               E+P     P    R   MR  SNI+GY  VFL G  P+++  T++   R   +   G
Sbjct: 854  APEEKPVDDDEPHNAQRFLPMRRCSNISGYSTVFLPGSSPSFILKTAKSSPRVLSLQGSG 913

Query: 129  PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHL 187
             V  ++ FH   C  GF+Y +     R++ +PT  SY +    V+K+P+      +AYH 
Sbjct: 914  -VQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSYAELGLSVKKIPVGVDTQSVAYHP 972

Query: 188  ETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQ 247
             T+ Y +  +  EP    ++   +D       R++    P+V +  + L S  +W  I  
Sbjct: 973  PTQAYVVGCNDVEP----FELPKDDDYHKEWARENITFKPMVDRGVLKLLSGITWTVI-- 1026

Query: 248  TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
                +   E VLC++ +++E   + +  +  IA+GT     ED+  RGR+ +FDI +V+P
Sbjct: 1027 DTVEMEPCETVLCVETLNLEVSESTNERKQLIAVGTALIKGEDLPTRGRVYVFDIADVIP 1086

Query: 308  EPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIA 362
            EPG+P T  K+K++ AKE   +G VTA+  V   G ++ A GQK  +  LK D  L  +A
Sbjct: 1087 EPGKPETSKKLKLV-AKEDIPRGAVTALSEVGTQGLMLVAQGQKCMVRGLKEDGTLLPVA 1145

Query: 363  FIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            F+D   Y+ S+  +    L L+ D  + +    Y  E   + L  +              
Sbjct: 1146 FMDMNCYVTSVKELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKS------------- 1192

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                   R+E+       + D L +   +  + SD D ++ +  
Sbjct: 1193 ---------------------STRMEVL------NADFLPDGKELYIVASDADGHIHILQ 1225

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNT----FFKIRCKPSSISDAPGARSRFLTWYASL 536
            + PE  +S  GH L+ +T F+ G H  T       +   PSS+S      S  +   AS 
Sbjct: 1226 FDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLLLPAVYPNPSSLSSNSEENSPHILLLASP 1285

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR--------TYKGKGYYAGN 588
             G L    PL E  YRRL  L   +     H  GLNP+ +R        + +  G  AG 
Sbjct: 1286 TGVLATLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASMQLPGVDAGI 1345

Query: 589  PSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
              R I+DG ++ +FL+L  G+R E+  + G
Sbjct: 1346 -GRNIVDGKILERFLELGTGKRQEMAGRAG 1374


>gi|350297359|gb|EGZ78336.1| protein cft-1 [Neurospora tetrasperma FGSC 2509]
          Length = 1437

 Score =  194 bits (493), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 176/630 (27%), Positives = 280/630 (44%), Gaps = 77/630 (12%)

Query: 17   VQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFV--SDRSK 72
            V E+L   LG   H +  L+L     +L +YQ +R    A +   K L    V  S  +K
Sbjct: 794  VAEILVADLGDTTHKSPYLILRHANDDLTLYQPYRLKATAGQPFSKSLFFQKVPNSTFAK 853

Query: 73   RANEQP---GLPRGV-RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG 128
               E+P     P    R   MR  SNI+GY  VFL G  P+++  T++   R   +   G
Sbjct: 854  APEEKPVDDDEPHNAQRFLPMRRCSNISGYSTVFLPGSSPSFILKTAKSSPRVLSLQGSG 913

Query: 129  PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHL 187
             V  ++ FH   C  GF+Y +     R++ +PT  SY +    V+K+P+      +AYH 
Sbjct: 914  -VQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSYAELGLSVKKIPVGVDTQSVAYHP 972

Query: 188  ETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQ 247
             T+ Y +  +  EP    ++   +D       R++    P+V +  + L S  +W  I  
Sbjct: 973  PTQAYVVGCNDVEP----FELPKDDDYHKEWARENITFKPMVDRGVLKLLSGITWTVI-- 1026

Query: 248  TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
                +   E VLC++ +++E   + +  +  IA+GT     ED+  RGR+ +FDI +V+P
Sbjct: 1027 DTVEMEPCETVLCVETLNLEVSESTNERKQLIAVGTALIKGEDLPTRGRVYVFDIADVIP 1086

Query: 308  EPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIA 362
            EPG+P T  K+K++ AKE   +G VTA+  V   G ++ A GQK  +  LK D  L  +A
Sbjct: 1087 EPGKPETSKKLKLV-AKEDIPRGAVTALSEVGTQGLMLVAQGQKCMVRGLKEDGTLLPVA 1145

Query: 363  FIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            F+D   Y+ S+  +    L L+ D  + +    Y  E   + L  +              
Sbjct: 1146 FMDMNCYVTSVKELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKS------------- 1192

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                   R+E+       + D L +   +  + SD D ++ +  
Sbjct: 1193 ---------------------STRMEVL------NADFLPDGKELYIVASDADGHIHILQ 1225

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNT----FFKIRCKPSSISDAPGARSRFLTWYASL 536
            + PE  +S  GH L+ +T F+ G H  T       +   PSS+S      S  +   AS 
Sbjct: 1226 FDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLLLPAVYPNPSSLSSNSEENSPHILLLASP 1285

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR--------TYKGKGYYAGN 588
             G L    PL E  YRRL  L   +     H  GLNP+ +R        + +  G  AG 
Sbjct: 1286 TGVLATLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASMQLPGVDAGI 1345

Query: 589  PSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
              R I+DG ++ +FL+L  G+R E+  + G
Sbjct: 1346 -GRNIVDGKILERFLELGTGKRQEMAGRAG 1374


>gi|392558419|gb|EIW51607.1| hypothetical protein TRAVEDRAFT_176174 [Trametes versicolor FP-101664
            SS1]
          Length = 1431

 Score =  194 bits (493), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 160/641 (24%), Positives = 295/641 (46%), Gaps = 78/641 (12%)

Query: 9    PSAMDETIVQELLTVSLGLHGNRPLLLVRTQH-ELLIYQAF---RHPKGALKLRFKKLKV 64
            P    E  + +++   LG    RP L+V  +  +L +Y+A      P+     R   L V
Sbjct: 839  PRKPQELDIDQIVIAPLGESRPRPHLIVLLRSGQLAVYEAVAIPPPPEPLPSTRSSTLLV 898

Query: 65   LFVSDRSK-------RANEQPGLPRGVRISQMR--YFSNIA---GYQGVFLCGPHPAWLF 112
             FV   SK          ++  L    RIS++   + ++ A    + GVF  G  P+W+ 
Sbjct: 899  KFVKVASKAFDIQHPEEEQKSVLAEQKRISRLLVPFVTSPAPGQTFSGVFFTGDRPSWIL 958

Query: 113  LTSRGELRAHPM--TIDGPVSTLAPFHNVNCPRG-FLYFNAKSELRISVLPTHLSYDAPW 169
             T +G ++  P   ++    +T + + +    RG FL ++ +    +  +P  +  D   
Sbjct: 959  STDKGGVKVFPSGHSVVQAFTTSSLWES----RGDFLLYSEEGPSLVEWMP-DVQLDGHL 1013

Query: 170  PVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLV 229
            P R VP +  P+       + +  +  S+ +   + +    ED  +V +P       PL 
Sbjct: 1014 PARSVP-RSRPYSNVVFDASTSLIVAASSFQ---NRFASYDEDGNVVWEPDSPNISSPLC 1069

Query: 230  SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
                + L SP  W  I    +     E V C+ ++ +E   T SG++ +IA+GT  N  E
Sbjct: 1070 ECSTLELISPDGW--ITMDGYEFAPNEFVNCIVSIPLETMSTESGMKDFIAVGTTINRGE 1127

Query: 290  DVTCRGRILLFDIIEVVPEPGQPLTK-NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI 348
            D+  +G + +F+I+EVVP+P   + +  ++K++   + KGPV+ +C + G+LV+++GQKI
Sbjct: 1128 DLAVKGAVYIFEIVEVVPDPSTHVKRWWRLKLLCRDDAKGPVSFLCGINGYLVSSMGQKI 1187

Query: 349  YIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
            ++     D  L G+AF+D  VY+ S+ +VKNL+++GD  +S+  + +Q +   L ++ +D
Sbjct: 1188 FVRAFDLDERLVGVAFLDVGVYVTSLRAVKNLLVIGDAVKSVWFVAFQEDPYKLVVLGKD 1247

Query: 408  YKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGF 467
                                          QL    R ++    G            +  
Sbjct: 1248 P-----------------------------QLCCITRADLFFADG-----------QLSI 1267

Query: 468  MISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS 527
            +  D++  V L+ Y P   ES  G  L+++T+FH      +   +  +P +  D    ++
Sbjct: 1268 VTCDEEGIVRLYAYDPHDPESKSGQHLLRRTEFHGQSEYRSSMLVARRPKN-GDPEIPQA 1326

Query: 528  RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
            R +    S+DG+L     + E   +RL +LQ  ++    H   LNP+AFR  + +  Y  
Sbjct: 1327 RLVC--GSVDGSLSTLTYVDEAASKRLHLLQGQLIRTVQHVAALNPKAFRMVRNE--YVS 1382

Query: 588  NP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 627
             P S+GI+DG+L+  F  L +  + E+ ++IG+    +L +
Sbjct: 1383 RPLSKGILDGNLLATFEDLPIARQNEVTRQIGTDRATVLKD 1423


>gi|392585051|gb|EIW74392.1| hypothetical protein CONPUDRAFT_133073 [Coniophora puteana RWD-64-598
            SS2]
          Length = 1490

 Score =  194 bits (493), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 146/537 (27%), Positives = 253/537 (47%), Gaps = 69/537 (12%)

Query: 100  GVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNC-----PRG-FLYFNAKSE 153
            G FL G  P W+  T  G +R +P       S  A  H  +       RG FL ++ +  
Sbjct: 1006 GAFLTGDKPHWIIRTDAGGVRLYP-------SGHALVHAFSACSLWESRGDFLVYSDEGP 1058

Query: 154  LRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDK 213
              +   P  L    P P R VP   T   + Y   +    +V ++A+    +  ++ ED 
Sbjct: 1059 TLLEWAP-DLEVHGPLPSRSVPKGRTYGKVVYEHGSG---LVIASADGWASFASYD-EDG 1113

Query: 214  ELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLS 273
             +V +P       P      + L SP  W  +    F  +E+  V  ++ V++E   T +
Sbjct: 1114 AIVWEPDAPGVAFPKADCSTLELISPELWITLDGYEFAPNEF--VNAVEVVTLETLSTET 1171

Query: 274  GLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTK-NKIKMIYAKEQKGPVTA 332
            G + ++A+GT  N  ED+  RG   +F+++EVVP+P   L +  K+KM    + KGPVTA
Sbjct: 1172 GSKEFVAVGTTINRGEDLAVRGATYIFEVVEVVPDPSSKLDRWYKLKMRVRDDAKGPVTA 1231

Query: 333  ICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIAL 391
            +C + G+LV+++GQKI+I     D  L G+AF+D  VY+ S+ ++KNL+L+GD  +S+ L
Sbjct: 1232 LCGINGYLVSSMGQKIFIRAFDLDERLVGVAFLDAGVYVTSLKALKNLLLIGDAVKSVWL 1291

Query: 392  LRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 451
            + +Q +   L ++++D +     S  ++  N                   GE        
Sbjct: 1292 VAFQEDPYKLVILSKDIRRQYAASVDFFFAN-------------------GE-------- 1324

Query: 452  GSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK 511
                         +  +  D++  +  + Y P   ES  G +L+  T+FH  +  +T   
Sbjct: 1325 -------------LSIVTEDEEGVLRAYEYDPNDPESRSGQQLLCHTEFHGHKECSTTLT 1371

Query: 512  IRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
            I  +  +  + P  +++ ++ +   DG+L    P+ E  ++RL +LQ  +  +  H  GL
Sbjct: 1372 IARRTKTEHEIP--QAKLISGFG--DGSLSALTPVDEAAFKRLQLLQGQLTRNVQHIAGL 1427

Query: 572  NPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 627
            NPRAFR  + +      P S+GI+DG L+  F    +  + E+ ++IG++   IL E
Sbjct: 1428 NPRAFRIVRNE--TVSKPLSKGILDGQLLSSFEAQGITRQGEMTRQIGTERTTILQE 1482


>gi|301093545|ref|XP_002997618.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110008|gb|EEY68060.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 1744

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 168/643 (26%), Positives = 282/643 (43%), Gaps = 125/643 (19%)

Query: 80   LPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT-----IDGPVSTLA 134
            L  G R   +  F N+    G F  G HP W+ L  RG     PM      +  PV +  
Sbjct: 1142 LRAGFRYPMLTCFHNVNNMSGAFFRGAHPMWI-LGDRGHASFVPMCNAAPRVSVPVLSFT 1200

Query: 135  PFHNVNCPRGFLYFNAKSELRISVLP-----THLSYDAPWPVRKVPLKCTPHFLAY---- 185
             FH+ NCP GF+YF+++  LR+  LP     T L     + ++K     T H + Y    
Sbjct: 1201 SFHHWNCPNGFIYFHSRGALRVCELPSSKTSTILPSSGGFVLQKAEFGATLHHMLYLGSH 1260

Query: 186  -------HLETKTYCIVTST------AEPSTDYYKFNGEDKELVTDPRD----SRFIPPL 228
                    LE  TY +V S       A+ +T+      E +    DP      S  + P 
Sbjct: 1261 GPGGVAEALEAPTYAVVCSARLKPADADRATEVEGAEEELEPENLDPNGNPLGSNVMAPT 1320

Query: 229  VSQF------HVSLFSPFSWE-EIPQTN----------FPLH--EWEHVLCLK-----NV 264
               F      H++      +E  + QT+          F +H   +E VL +K     + 
Sbjct: 1321 AEMFADYETDHMAHTEEDVYELRLVQTDEFGEWGRRGVFRVHFERYEVVLSVKLMYLYDS 1380

Query: 265  SMEYEGTLSGL-------RGYIALGTNY--NYSEDVTCRGRILLF--DIIEVVPEPGQPL 313
            S+  E   S         R Y+ +GT +   + ED + RGR+LL+  D  + V E G   
Sbjct: 1381 SLMKEEVASTSPEWNKKKRPYLVVGTGWVGPHGEDESGRGRLLLYELDYAQYVNEEGGAT 1440

Query: 314  TKN--KIKMIYAKE-QKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYI 370
            +    K+++++ KE ++G ++ +  +  +++ AVG K+ +++ K   L G AF D ++YI
Sbjct: 1441 SGKLPKLRLVFIKEHRQGAISMVSQLGPYVLAAVGSKLIVYEFKSEQLIGCAFYDAQMYI 1500

Query: 371  ASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDG 430
             ++  VK+ ++ GD  +S+  LR++   R L L+A+DY+P                    
Sbjct: 1501 VTLSVVKDFVMYGDVYKSVHFLRWREMQRQLVLLAKDYEP-------------------- 1540

Query: 431  SLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNG 490
             L     + S+ E+                    +  +  D D+N+ +  + P+  ES G
Sbjct: 1541 -LAVSATEFSVFEK-------------------KLALLAVDMDENLHVMQFAPQDIESRG 1580

Query: 491  GHRLIKKTDFHLGQHVNTFFKIRCKPS-SISDAPGAR-----SRFLTWYASLDGALGFFL 544
            G RL++ +DFHLG  V++ F+ R   S S+  A   R     S ++    + +G +G  +
Sbjct: 1581 GQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVNVMGTSEGGVGALV 1640

Query: 545  PLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY-YAGNPS--------RGIID 595
            P+ E+ +RRL  LQNVMV        LNPR FR  K       G P         +  +D
Sbjct: 1641 PVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRRCGRPDAWSKKKWKKSFLD 1700

Query: 596  GSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
              ++++FLQL    + E+ + IG+    ++  L +++  +S F
Sbjct: 1701 AFVLFRFLQLDYVAQKELARCIGTTPEVVMHNLLEVQHATSTF 1743


>gi|358372791|dbj|GAA89393.1| cleavage and polyadenylation specificity factor subunit A
            [Aspergillus kawachii IFO 4308]
          Length = 1372

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 168/625 (26%), Positives = 283/625 (45%), Gaps = 96/625 (15%)

Query: 32   PLLLVRTQHE-LLIYQAFRHPKGAL----KLRFKKLKVLFVSDRSKRANEQPGLPRGVRI 86
            P L++R++++ L+IY+ F  P G       L+F K     +   S   +         R+
Sbjct: 816  PYLILRSENDDLIIYKPFVIPTGPTGEIHTLKFSKENNSVLPMISPDVDSTQPSGSDYRV 875

Query: 87   SQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFL 146
              +R   +I+G   VF+ G    ++  TS      H + + G             PR   
Sbjct: 876  RPLRILPDISGLSAVFMPGASAGFVLRTSASA--PHFLRLRG-----------ESPRC-- 920

Query: 147  YFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYY 206
                 S +R   LP    +D  W ++KV L      LAY   +  Y + T  A   TD+ 
Sbjct: 921  -----STVRFCQLPPMTRFDYQWTLKKVHLGEQVDHLAYSTSSGMYVLGTCHA---TDFK 972

Query: 207  KFNGEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNV 264
                +D EL  + R+    F P     F + L SP +W  I   ++ L   E+V+ +KN+
Sbjct: 973  L--PDDDELHPEWRNEAISFFPSARGSF-IKLVSPNTWSII--DSYSLGTDEYVMAIKNI 1027

Query: 265  SMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAK 324
            S+E        +  I +GT +   ED+  RG I +F++++VVP+P  P T  K+K+I  +
Sbjct: 1028 SLEISENTHERKDLIVVGTAFARGEDIPSRGCIYVFEVVQVVPDPDDPETDRKLKLIGKE 1087

Query: 325  EQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVK--NL 379
              KG VTA+  +   GF++ A GQK  +  LK D  L  +AF+D + Y++ +  +K   +
Sbjct: 1088 SVKGAVTALSEIGGQGFVLVAQGQKCMVRGLKEDGSLLPVAFMDMQCYVSVVKELKGTGM 1147

Query: 380  ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQL 439
             ++GD  + I    Y  E   +SL A+D                            +L++
Sbjct: 1148 CILGDAVKGIWFAGYSEEPYKMSLFAKDL--------------------------DYLEV 1181

Query: 440  SLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD 499
            S  E L   ++              +  +++D D N+ +  Y PE  +S+ G +L+ ++ 
Sbjct: 1182 SAAEFLPDGRR--------------LFIVVADSDCNIHVLQYDPEDPKSSNGDKLLSRSK 1227

Query: 500  FHLGQHVNTFFKI-RCKPSS---IS-----DAPGARSRFLTWYASLDGALGFFLPLPEKN 550
            FH G   +T   + R   SS   IS     D     +       + +G+LG    +PE++
Sbjct: 1228 FHTGNFASTLTLLPRTMVSSEKMISNSDDMDIDNQSALHQVLMTTQNGSLGLITCMPEES 1287

Query: 551  YRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGER 610
            YRRL  LQ+ +     H  GLNPRAFR  +      G   RG++DG+L++K++ +S   +
Sbjct: 1288 YRRLSALQSQLTNTLEHPCGLNPRAFRAVESD----GTAGRGMLDGNLLFKWIDMSKQRK 1343

Query: 611  LEICKKIGSKHNDILDELYDIEALS 635
             EI  ++G++  +I     D+EA+S
Sbjct: 1344 TEIAGRVGAREWEI---KADLEAIS 1365


>gi|358390357|gb|EHK39763.1| hypothetical protein TRIATDRAFT_48211 [Trichoderma atroviride IMI
            206040]
          Length = 1441

 Score =  193 bits (490), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 179/656 (27%), Positives = 293/656 (44%), Gaps = 98/656 (14%)

Query: 17   VQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALK------LRFKK-----LK 63
            + EL+   LG  +H +  L+L  +  +L IY+  R P  +        L FKK     L 
Sbjct: 837  LTELVVADLGDTVHYSPYLILRHSTDDLTIYEPIRLPTDSPTRNLSDTLFFKKSANSILA 896

Query: 64   VLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHP 123
               V D  +   +QP      R   +R  +N+ GY  VFL GP PA++  +S+   R   
Sbjct: 897  KSTVEDPLEDTAQQP------RYVPLRICANVGGYSTVFLPGPSPAFILKSSKSVPRVVG 950

Query: 124  MTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHF 182
            +   G V  ++ F+   C RGF+Y +++   R++ LP+  ++ +    V+KVPL      
Sbjct: 951  VQGLG-VRGMSTFNTEGCDRGFIYSDSEGIARVTQLPSKTNFTELGVSVKKVPLGNDVRH 1009

Query: 183  LAYHLETKTY---CIVTSTAE-PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFS 238
            +AYH  T+TY   C VT   E P  D Y      KE     ++S    P   +  + L S
Sbjct: 1010 VAYHHPTETYIAGCAVTEGFELPKDDDYH-----KEWA---KESLSFHPSTVRGSLKLIS 1061

Query: 239  PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRIL 298
            P +W  I   +  +   E + C+K + +E        R  +A+GT     ED+  RGR+ 
Sbjct: 1062 PVTWTVIHSID--MEPGESIECMKTLHLEVSEETKERRMLLAVGTALTRGEDLPTRGRVQ 1119

Query: 299  LFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK 354
            ++DI+ V+PEPG+P T  K+K++ AKE+  +G VTA+  +   G ++ A GQK  +  LK
Sbjct: 1120 VYDIVTVIPEPGKPETNKKLKLL-AKEEIPRGGVTALSEIGTQGLMLMAQGQKCMVRGLK 1178

Query: 355  -DNDLTGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPT 411
             D  L  +AF+D   ++AS   +    L L+ D  + +    Y  E  T  ++ +     
Sbjct: 1179 EDGSLLPVAFLDMSCHVASARELPGTGLCLIADAFKGLWFAGYTEEPYTFKVLGKS---- 1234

Query: 412  QPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISD 471
                              GSL                        D L +   +  +  D
Sbjct: 1235 -----------------SGSLPLLV-------------------ADFLPDGEDLSMVAVD 1258

Query: 472  KDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIRCKPSSISDAPGARSR- 528
             D ++ +  + PE  +S  GH L+ +T F +  +   +T    R  P+S S      S  
Sbjct: 1259 ADGDIHVLEFNPEHPKSLQGHLLLHRTTFSVTPNPPTSTLLLPRTLPASQSATASQDSST 1318

Query: 529  ---FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 585
                L   AS  G+L    PLPE  YRRLL + N ++      GGL+ RA R  +G G  
Sbjct: 1319 PQPHLLLLASPSGSLAALTPLPESAYRRLLSVTNQLLPALVPHGGLHARAHRAPEGGGGM 1378

Query: 586  A-------GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
            +           R I+DG+++ ++ +L   +R E+  + G  ++ +++   D+EA+
Sbjct: 1379 SRMVGVETAASGRAIVDGAILTRWNELGAAKRAEVASRGG--YDSVMELREDLEAV 1432


>gi|302924728|ref|XP_003053954.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256734895|gb|EEU48241.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 1429

 Score =  192 bits (487), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 176/632 (27%), Positives = 284/632 (44%), Gaps = 91/632 (14%)

Query: 17   VQELLTVSLG-LHGNRPLLLVRTQ-HELLIYQAFRH-PKGA-----LKLRFKK-----LK 63
            ++ELL   LG      P L++R Q  +L IY+  R+ P+GA       L FKK     L 
Sbjct: 836  LRELLVADLGDTVSQSPYLILRNQTDDLTIYEPLRYQPEGAEPTLSATLTFKKTSNAALA 895

Query: 64   VLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHP 123
               V    + A +QP      R   +R  +N+ GY  VFL GP P+++  +S+   R   
Sbjct: 896  TSPVETSQEDAVQQP------RFVPLRTCANVNGYSTVFLPGPSPSFILKSSKSIPRVIG 949

Query: 124  MTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHF 182
            +   G +  ++ FH   C RGF+Y + +   R++ LP+  ++ D    V+KVPL      
Sbjct: 950  LQGLG-IRGMSTFHTEGCDRGFIYADDEGIARVTQLPSETNFTDLGISVKKVPLDSDVCG 1008

Query: 183  LAYHLETKTYCIVTSTAEP-----STDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLF 237
            +AYH  T TY    +T EP       DY+K     KE +T         P + +  + L 
Sbjct: 1009 IAYHQPTGTYIAGCTTNEPFELPRDDDYHKEWA--KETLT-------FAPTMPRGVLKLI 1059

Query: 238  SPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRI 297
            SP S   I      L   E + C+K + +E        R  + +GT  +  ED+  RGR+
Sbjct: 1060 SPVSLTVIHDQE--LESCESIECMKTLQLEVSEETKERRFLLTVGTALSKGEDLPIRGRV 1117

Query: 298  LLFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQL 353
             +FDI+ V+PEPG+P T  ++K I A+E   +G VTAI  +   G ++ A GQK  +  L
Sbjct: 1118 HVFDIVTVIPEPGKPETNKRLKAI-AREDIPRGGVTAISEIGTQGLMLVAQGQKCMVRGL 1176

Query: 354  K-DNDLTGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
            K D  L  +AF+D   +++S   +    L ++ D  + +    Y  E  T  ++ + +  
Sbjct: 1177 KEDGSLLPVAFLDMSCHVSSARELPRTGLCVMADAFKGVWFAGYTEEPYTFKILGKSHG- 1235

Query: 411  TQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMIS 470
                                             RL +         D L +   +  + +
Sbjct: 1236 ---------------------------------RLPLLVA------DFLPDGEDLAIVAA 1256

Query: 471  DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI--RCKPSSISDAPGARSR 528
            D D ++ +  + PE  +S  GH L+ +T F +  +  T   +  R  P +   +P   S+
Sbjct: 1257 DADGDLHILEFNPEHPKSLQGHLLLHRTTFSVSPNPPTSMLLLPRTTPPA-HPSPSDPSQ 1315

Query: 529  FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
             L   AS  G L   +PLPE  YRRLL + N ++   +  GGLN + +R   G      +
Sbjct: 1316 IL-LLASPSGHLSTLVPLPEATYRRLLSVTNQLLPALTPYGGLNAKGYRLPSGTRPVGVD 1374

Query: 589  PSRG--IIDGSLVWKFLQLSLGERLEICKKIG 618
             + G  I+DG+++ ++ +L   +R EI  K G
Sbjct: 1375 AAAGRTIVDGAILARWAELGAAKRAEIAGKGG 1406


>gi|156040479|ref|XP_001587226.1| hypothetical protein SS1G_12256 [Sclerotinia sclerotiorum 1980]
 gi|154696312|gb|EDN96050.1| hypothetical protein SS1G_12256 [Sclerotinia sclerotiorum 1980 UF-70]
          Length = 1447

 Score =  192 bits (487), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 172/659 (26%), Positives = 282/659 (42%), Gaps = 91/659 (13%)

Query: 10   SAMDETIVQELLTVSLGLH-GNRPLLLVR-TQHELLIYQAFRHPKGALKLRFKKLKVLFV 67
            SA+ ET+  E+L   LG      P L++R +  +L IY+ FR    +  L    L+ L +
Sbjct: 836  SAIRETLT-EILVADLGDSVSQSPYLILRPSNDDLTIYEPFRIASTSPNLLSSTLQFLKI 894

Query: 68   SDR------SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
             +          A EQ    +      MR  SN+ GY  VF+ G  P+++  +S+   + 
Sbjct: 895  HNTHLAQAPDVSAEEQADETQQTSDKPMRAVSNLGGYSVVFMPGGSPSFIVKSSKTLPKV 954

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTP 180
              +   G V  L+ FH   C RGF+Y + +  +R++  P   ++ D    +RKV +    
Sbjct: 955  LSLQGTG-VRGLSSFHTEGCDRGFIYADTEGIVRVAQFPPTTTFADIGMALRKVEIGEDV 1013

Query: 181  HFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPF 240
            H +AYH   +TY I TST    TD+     +D        D  F P  + +  + L SP 
Sbjct: 1014 HAVAYHSPLQTYVIGTSTF---TDFELPKDDDHRRSWQEEDIAFKPS-IEKSSLKLISPV 1069

Query: 241  SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
            +W  I      L   E + C+K +++      +  +  + +GT     ED+   GR+ ++
Sbjct: 1070 NWSVI--DTIELEPCEVITCIKTMNLVVSEVTNERKPLLVVGTAITKGEDLATTGRLYVY 1127

Query: 301  DIIEVVPEPGQPLTKNKIKMIYAKE----QKGPVTAICHVA--GFLVTAVGQKIYIWQLK 354
            D++ VVPEP +P T  K+K+I A+       GPVT +  +   GF++ A GQK  +  LK
Sbjct: 1128 DVVIVVPEPDRPETNKKLKLISAETITRGAGGPVTGLSEIGTQGFMLVAQGQKCMVRGLK 1187

Query: 355  DNDLT-GIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPT 411
            ++     +AF+DT  Y+ S+  +    L ++ D  + +    Y  E   + L  +     
Sbjct: 1188 EDGTNLPVAFMDTNCYVTSIKELPGTGLCVIADALKGVWFAGYTEEPYKMLLFGKS---- 1243

Query: 412  QPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI-CKKIGSKHNDILDEFSSMGFMIS 470
                                            R+E+ C        D+L +   +  + +
Sbjct: 1244 ------------------------------ATRMEVLCA-------DLLPDGKDLFIVAA 1266

Query: 471  DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSI----------- 519
            D D N+ +  Y PE  +S  GH L+ +T F LG H  T   +     S+           
Sbjct: 1267 DADGNLHIMQYDPEHPKSLQGHLLLHRTTFSLGAHHPTTMTLLPAIPSLHPLTTASSSSL 1326

Query: 520  -----SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 574
                  D+P      L    S  G      PL E  YRR   L + +     H  GLNPR
Sbjct: 1327 SPSPQEDSPSPSQSLL--LTSRTGTFALLSPLTESQYRRFGTLVSHLTNTLYHPCGLNPR 1384

Query: 575  AFRTYK--GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
            A+R  K   +G   G   R IIDG ++ ++++L    R E+  ++G    ++ DEL ++
Sbjct: 1385 AYRVDKDANEGIVGG---RTIIDGGVLGRWMELGSQRRGEVAGRVGVDVLELRDELSEL 1440


>gi|169864473|ref|XP_001838845.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
 gi|116500065|gb|EAU82960.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
          Length = 1458

 Score =  191 bits (486), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 147/544 (27%), Positives = 246/544 (45%), Gaps = 82/544 (15%)

Query: 98   YQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNV-----NCP----RG-FLY 147
            Y GVF  G  P W+  T +G ++ +P             HNV      C     RG FL 
Sbjct: 971  YSGVFFTGDKPNWIIGTDKGGVQIYPSG-----------HNVVHSFSACSLWEERGEFLV 1019

Query: 148  FNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYK 207
            +       I  LP   +Y  P P R +P       + +   T   C++ + +     +  
Sbjct: 1020 YTEDGPCLIEWLP-DFTYSHPLPARSIPRGRGYSNVVFDPST---CLIVAASSMQARFAS 1075

Query: 208  FNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSME 267
            ++ ED   V +        P+     + L SP SW  I    F     E++  +  V++E
Sbjct: 1076 YD-EDGVRVWEKDGPGVDDPITDTSALELISPNSW--ITMDGFEFATNEYINDISIVTLE 1132

Query: 268  YEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPG-QPLTKNKIKMIYAKEQ 326
               T +G + +IA+GT  +  ED+  +G   +F+I+EVVP+P   P    K+++    + 
Sbjct: 1133 TAATETGSKDFIAVGTTIDRGEDLAAKGAAYIFEIVEVVPDPAISPTRWYKLRLRCRDDA 1192

Query: 327  KGPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDY 385
            KGPVTA+C   G+LV+++GQKI++     D  L G+AF+D  +Y+ S+  +KNL+L+GD 
Sbjct: 1193 KGPVTAVCGFQGYLVSSMGQKIFVRAFDSDERLVGVAFMDVGIYVTSLRVLKNLLLIGDA 1252

Query: 386  ARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERL 445
             +S+  + +Q +   L L+A+D          ++         DG L             
Sbjct: 1253 VKSVMFVAFQEDPYKLVLLAKDVHLHSVTRADFFFNA------DGDL------------- 1293

Query: 446  EICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQ- 504
                                  ++ D++  + ++ Y P   +S  G  L+ +T++H GQ 
Sbjct: 1294 --------------------ALIVGDEEGIMRIYEYNPNDPDSRDGRYLLLRTEYH-GQV 1332

Query: 505  --HVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 562
              H +T    R K     D    +S  L    S DG+L   +P+ E  ++RL +LQ  + 
Sbjct: 1333 PYHTSTTIARRDK----EDPSIPQSHLL--IGSADGSLSSLVPVDEYAFKRLQLLQGQLT 1386

Query: 563  THTSHTGGLNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 621
             +  H  GLNP+AFR  K    Y   P S+GI+DG L+ ++  L +  + E+ K+IG++ 
Sbjct: 1387 RNIQHVAGLNPKAFRIVKND--YVSKPLSKGILDGQLLAQYESLPIPRQNEMTKQIGTER 1444

Query: 622  NDIL 625
              +L
Sbjct: 1445 GVVL 1448


>gi|389740693|gb|EIM81883.1| hypothetical protein STEHIDRAFT_65512 [Stereum hirsutum FP-91666 SS1]
          Length = 1438

 Score =  191 bits (485), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 165/650 (25%), Positives = 294/650 (45%), Gaps = 77/650 (11%)

Query: 1    MGNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQH-ELLIYQA--FRHPKGALK- 56
            + +     P    E  + ++L   LG    RP L V  +  +L IY+A  F  P G  + 
Sbjct: 835  LSSLPQDQPRKPQELDIDQILVAPLGETSPRPHLFVLLRSGQLAIYEAVSFELPTGDPEP 894

Query: 57   ------LRFKKLKVLFVSDRSKRANEQPG----LPRGVRISQM--RYFSNIA---GYQGV 101
                  L  K +KVL  +   +  +EQP     L    +I ++   + ++ A    + GV
Sbjct: 895  ASRPSILPVKLVKVLSRAFDIQHPDEQPQEKSVLAELKKIQRLFIPFVTSPAPEKTFTGV 954

Query: 102  FLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPT 161
            F  G  P W+  T +G +R H  +    V +  P    +    FL +  +    +  +P 
Sbjct: 955  FFTGDRPCWILGTDKGGIRVH-SSGHAVVHSFTPCSLWDSKGDFLLYTDEGPCLLEWMP- 1012

Query: 162  HLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRD 221
             +      P R +P       + +   T   C++   A     +  F+ ED     +P  
Sbjct: 1013 DVQLHTELPSRFMPRSRAYTNVVFDPFT---CLIVGAASLKAQFTSFD-EDGNQTWEPDA 1068

Query: 222  SRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIAL 281
                 P      + L +P +W  +    F  +E   V  ++ V +E + T SG + +IA+
Sbjct: 1069 PNISYPTTDCSTLELITPDAWLTMDGYEFASNEI--VNAVECVMLETQSTDSGQKSFIAV 1126

Query: 282  GTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTK-NKIKMIYAKEQKGPVTAICHVAGFL 340
            GT  N  ED+  +G   +F+I+EVVP+P   + +  K++M    + KGPVTA+C + G+L
Sbjct: 1127 GTTINRGEDLAVKGATYIFEIVEVVPDPSFGVKRWFKLRMRCRDDAKGPVTALCGMDGYL 1186

Query: 341  VTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYR 399
            V+++GQKI++  L  D  L G+AF+D  VY+ S+ ++KNL+++ D  +S+  + +Q +  
Sbjct: 1187 VSSMGQKIFVRALDLDERLVGVAFLDVGVYVTSLRALKNLLIISDAVKSVWFVAFQEDPY 1246

Query: 400  TLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 459
             L+++A+D +     S  ++  N               QLS+                  
Sbjct: 1247 KLTVLAKDAQQVCFTSADFFFANQ--------------QLSI------------------ 1274

Query: 460  DEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQ--HVNTFFKIRCKPS 517
                    +  D++  + ++ Y P   ES  G RL+   +FH GQ  + ++    R    
Sbjct: 1275 --------VTCDEEGILRMYHYNPHDPESKNGQRLLCHAEFH-GQIEYRSSLTIARRTKG 1325

Query: 518  SISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 577
              ++ P A+        S DG+L   +P+ E  ++RL +LQ  +  +  HT  LNPRAFR
Sbjct: 1326 PDTEIPQAK----LICGSPDGSLSALVPVEEAAFKRLHLLQGQLTRNVQHTAALNPRAFR 1381

Query: 578  TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 627
              + + Y +    +G +DG L+  F  L +  ++EI ++IG++   +L +
Sbjct: 1382 AVRNE-YVSKTLHKGFLDGLLLRSFEDLPVSRQIEITRQIGTERRLVLKD 1430


>gi|380494933|emb|CCF32776.1| cft-1, partial [Colletotrichum higginsianum]
          Length = 542

 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 158/581 (27%), Positives = 256/581 (44%), Gaps = 81/581 (13%)

Query: 73  RANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVST 132
            ANEQP      R   +R  +NI GY  VFL G  P+ +  +++   +   +   G V  
Sbjct: 15  EANEQP------RFVPLRPCANINGYSTVFLPGASPSLIVKSAKSSPKVVGLQGIG-VRG 67

Query: 133 LAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHLETKT 191
           ++ FH   C RGF+Y +++ + R++ LP   ++ +    VRK+P+      +AYH   +T
Sbjct: 68  MSSFHTEGCERGFIYADSEGQTRVTQLPADSNFAELGVSVRKIPIGDAVGLIAYHPPMET 127

Query: 192 YCIVTSTAE----PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQ 247
           Y +  S +E    P  D Y      KE   +   S    PL  +  V L SP +W  I  
Sbjct: 128 YAVACSISEHFELPKDDDYH-----KEWAKETTTSY---PLTERGIVKLMSPTTWSVIDT 179

Query: 248 TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
                HE    +C+K + +E        R  I +GT  N  ED+  RGRIL++D++ VVP
Sbjct: 180 VELEPHEV--AMCMKTLHLEVSEETKERRMLITIGTAINRGEDLPIRGRILVYDVVPVVP 237

Query: 308 EPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIA 362
           +PG+P T  K+K++ AKE+  +G VT +C V   G ++ A GQK  +  LK D  L  +A
Sbjct: 238 QPGRPETNKKLKLV-AKEEIPRGAVTGLCEVGSQGLMLVAQGQKCMVRGLKEDGTLLPVA 296

Query: 363 FIDTEVYIASMVSVKNL--ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
           F+D   Y+ ++  V+     L+ D  + +  + Y  E                       
Sbjct: 297 FMDMNCYVTAVREVRGTGYCLMTDAFKGVWFVGYAEE----------------------- 333

Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
             P + ++ G  +  F  L+                D +     +  ++ DKD  + +  
Sbjct: 334 --PYKMMLFGKSMGNFEVLTA---------------DFVVAGDELHIVVCDKDGVIHVMQ 376

Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTFF----KIRCKPSSISDAPGARSRFLTWYASL 536
           + PE  +S  GH L+ +  F    +  T      +    PS+ S +    +  L   AS 
Sbjct: 377 FDPEHPKSLQGHLLLNRASFSAAPNHPTITLSLPRTPISPSATSVSKNPPTTLL--LASP 434

Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG---NPSRGI 593
            GAL    PL E+ YRRL  L N +     H    NP+  R         G   +  R I
Sbjct: 435 TGALASLTPLSEQAYRRLTSLANSIAGALPHAAATNPKGHRLQPLDARTPGVDTSAGRSI 494

Query: 594 IDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
           +DG+L+ ++ +L  G R E+  K G  + D+ +   ++E +
Sbjct: 495 VDGALLARWNELGAGRRSEVAGKGG--YGDVHEVRSELEGV 533


>gi|409046890|gb|EKM56369.1| hypothetical protein PHACADRAFT_93103 [Phanerochaete carnosa
            HHB-10118-sp]
          Length = 1417

 Score =  190 bits (483), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 133/537 (24%), Positives = 250/537 (46%), Gaps = 54/537 (10%)

Query: 97   GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRI 156
             + GVFL G  P W+  T +G ++  P +    V              FL ++ +    I
Sbjct: 929  AFSGVFLTGDRPCWILSTDKGGVKIMP-SGHQVVHAFTACSLWESKGDFLLYSDEGPSLI 987

Query: 157  SVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELV 216
              +P  + ++   P R +P +  P+       T T  +  S+ + +   Y    ED+ +V
Sbjct: 988  EWVP-EIQFEGHLPSRSIP-RPRPYSHVVFEPTTTLLVAASSLQSTFTSYD---EDRNVV 1042

Query: 217  TDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLR 276
             +P +     P+     + L SP +W  +    F  +E+  V C++ +++E   T +G +
Sbjct: 1043 WEPDEPNMSLPVCETSALELISPDTWTTMDGYEFAQNEF--VTCMECITLETLSTETGTK 1100

Query: 277  GYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTK-NKIKMIYAKEQKGPVTAICH 335
             ++A+ T  N  ED+  +G + +F+++EVVP+P     +  ++K+    E KGPVTA+C 
Sbjct: 1101 DFVAVSTTINRGEDLAVKGAVYIFEVVEVVPDPAMGQKRWYRLKLHCRDEAKGPVTALCG 1160

Query: 336  VAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
            +  +LV+++GQKI++  L  D  L G+AF+D  VY+ S+ +VKNL+++GD  + + L+ +
Sbjct: 1161 MDNYLVSSMGQKIFVRALDLDERLVGVAFLDVSVYVTSLRAVKNLLVIGDALKGVWLVAF 1220

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            Q +   L ++A+DY P        +  +    +I                          
Sbjct: 1221 QEDPYKLVVLAKDYYPIPVACADLFFADGKASLIS------------------------- 1255

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 514
                            D++  + L  Y P   ES  G RL+ +T+FH      T   I  
Sbjct: 1256 ---------------CDEEGVLRLSEYDPHDPESRHGQRLLCRTEFHGQTEYRTSHLIAR 1300

Query: 515  KPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 574
            +   + DA   +++ +  +   DG+L     + +   +RL +LQ  +  +  H  GLNP+
Sbjct: 1301 RGKGL-DAEIPQAKLICGHT--DGSLTSLTYVDDAVSKRLHLLQGQLARNVQHVAGLNPK 1357

Query: 575  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
            AFR  +     A   ++GI+DG+L+  F  L +  ++E+ ++I ++   +L +  D+
Sbjct: 1358 AFRVVRND-RVARPLTKGILDGNLLAAFEDLPVPRQVEVTRQIATERTTVLKDWLDL 1413


>gi|171695066|ref|XP_001912457.1| hypothetical protein [Podospora anserina S mat+]
 gi|170947775|emb|CAP59938.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1441

 Score =  190 bits (483), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 161/629 (25%), Positives = 279/629 (44%), Gaps = 78/629 (12%)

Query: 17   VQELLTVSLG-LHGNRPLLLVR-TQHELLIYQAFRHPKGA-----LKLRFKKLKVLFVSD 69
            V E+L   LG      P L++R    +L +Y+ +R+  GA       L F+K+    ++ 
Sbjct: 841  VSEILVADLGDTTAKSPYLILRHANDDLTMYEPYRYQLGAGLEFPKTLFFQKIPNSVLAK 900

Query: 70   RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP 129
                  +   +    +   +R  +NI GY  VFL GP P+++  +S+   +  P+     
Sbjct: 901  SPAEETDDEEVTHQAKCLALRRCNNIGGYSTVFLPGPSPSFIIKSSKSMPKVLPLQ-GAA 959

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHLE 188
            V+ ++ FH   C  GF+Y ++ + +R+S LP   S+ +    V+K+P+      +AYH  
Sbjct: 960  VTAISSFHTEGCEHGFIYADSHNIVRVSQLPKDWSFAETGLAVKKIPIGEDIVAVAYHPP 1019

Query: 189  TKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQT 248
            +++Y +  +T EP    ++   +D       R+     P + +  + L  P +W  +   
Sbjct: 1020 SQSYVVACNTPEP----FELPRDDDYHKEWAREVLPFKPTLERGTLKLIGPITWTVV--D 1073

Query: 249  NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPE 308
               +   E+VLC++ +++E     +  +  I +GT     ED+  RG + ++++ +V+PE
Sbjct: 1074 TIVMEPCENVLCVETLNLEVSEATNERKLLIGVGTAITKGEDLPTRGAVYVYNVADVIPE 1133

Query: 309  PGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAF 363
            PG+P T  K+K+I AKE   +G VTA+  +   G ++ A G K  +  LK D  L  +AF
Sbjct: 1134 PGKPETGKKLKLI-AKEDIPRGAVTALSEIGTQGLMLVAQGPKCMVRGLKEDGTLLPVAF 1192

Query: 364  IDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            +D   Y+ S   +    L L+ D  + +    Y  E   + L  +               
Sbjct: 1193 MDMNCYVTSAKELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKS-------------- 1238

Query: 422  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
                                  RLE+       + D L     +  +  D + ++ +  +
Sbjct: 1239 --------------------NTRLEVL------NADFLPNGKELSIVACDAEGHIHILQF 1272

Query: 482  QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS-------DAPGARSR-FLTWY 533
             PE  +S  GH L+ +T F  G H  T  K    PS++S       +  GA SR  +   
Sbjct: 1273 DPEHPKSLQGHLLLHRTSFSTGAHHVT--KSLLLPSTLSPDNKEDNEENGATSRPHILLL 1330

Query: 534  ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR----TYKGKGYYAGNP 589
            AS  G L    PL E  YRRL  L   +    +H  GLNP+ +R    T    G  AG  
Sbjct: 1331 ASPTGVLAALRPLSETAYRRLSSLAAQLTNSLTHAAGLNPKGYRMPSATCPPAGVDAGI- 1389

Query: 590  SRGIIDGSLVWKFLQLSLGERLEICKKIG 618
             R I+DG+++ +F +L   +R E+  + G
Sbjct: 1390 GRHIVDGTILARFSELGRAKRGEVAGRAG 1418


>gi|302694047|ref|XP_003036702.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
 gi|300110399|gb|EFJ01800.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
          Length = 1396

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 167/644 (25%), Positives = 282/644 (43%), Gaps = 91/644 (14%)

Query: 9    PSAMDETIVQELLTVSLGLHGNRP--LLLVRTQHELLIYQAFRHPKGA-----LKLRFKK 61
            P    E  ++++L  +LG    +P  L+L+R+ H L IY+AF   +       LK R   
Sbjct: 807  PRKPQELDIEQILLTNLGQSDPKPHLLVLLRSGH-LAIYEAFATNQAPIVEPPLKPRASS 865

Query: 62   LKVLFVSDRSK-----RANEQPGLPRGVRISQMRY------FSNIAGYQGVFLCGPHPAW 110
            L++ FV   SK     R +E     +G+   Q +       F+      GVF  G  P W
Sbjct: 866  LQIQFVKIASKAFEMQRTDETE---KGILAEQKKALRTFVPFACAGAPAGVFFTGDRPHW 922

Query: 111  LFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRG--FLYFNAKSELRISVLPTHLSYDAP 168
            +  T +G ++ +P    G  +  A        R   FL ++ + +     + T      P
Sbjct: 923  IVATDKGGVQMYP---SGHAAVYAFSACTLWERSTEFLIYSEEGQTLCEWI-TEYEIGRP 978

Query: 169  WPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPL 228
             P+R +P       + Y        ++ + A     +  F+ ED   +  P       P 
Sbjct: 979  LPMRHIPRGRAYSNIVYE---PASSMIVAAASLRARFASFD-EDGNQIWAPDGPGITEPT 1034

Query: 229  VSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYS 288
            V    + L SP  W  +    F  +E+  V  ++ V +E   T +G++ +IA+GT+    
Sbjct: 1035 VECSTLELISPEVWATVDGYEFATNEF--VNTMECVPLETVSTEAGVKHFIAVGTSIVRG 1092

Query: 289  EDVTCRGRILLFDIIEVVPEPGQ-PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
            ED+  +G   +F+++EVVP+    P    ++K+    + KGPVTA+C +  +LV+++GQK
Sbjct: 1093 EDLAVKGATYIFEVVEVVPDQSNGPKRWYRLKLRCRDDAKGPVTALCGINNYLVSSMGQK 1152

Query: 348  IYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVAR 406
            I++     D  L G+AF+D  VY+ S+ ++KNL+L+GD  R I  + +Q +   L  + R
Sbjct: 1153 IFVRAFDLDERLVGVAFMDVGVYVTSLRALKNLLLIGDVVRGIQFVAFQEDPYKLVTLGR 1212

Query: 407  DYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMG 466
            D       +  ++                                          F+   
Sbjct: 1213 DVSRMCATTVDFF------------------------------------------FAEEA 1230

Query: 467  FMISDKDKNVVLFM--YQPEARESNGGHRLIKKTDF--HLGQHVNTFFKIRCKPSSISDA 522
              I   D+N V+ M  Y PEA +S+ G  L+K+T+F  H     +T    R K     D 
Sbjct: 1231 LAIVTTDENGVMSMYNYDPEAPDSHDGRLLLKQTEFNLHTDFRTSTLIARRTK-----DD 1285

Query: 523  PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 582
            P      L  +   DG L    P+P+   +RL  LQ  +  +  H  GLNP+A R  + +
Sbjct: 1286 PIIPQGILI-FGGTDGTLSCLTPVPDDAAKRLQPLQLQLTRNMQHVAGLNPKALRIVRNE 1344

Query: 583  GYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 625
              +   P S+GI+DG+L+  F  L +  + E+ ++IG++   IL
Sbjct: 1345 --HVSRPLSKGILDGNLIAYFEHLPITRQDEMTRQIGTERATIL 1386


>gi|342877552|gb|EGU79002.1| hypothetical protein FOXB_10431 [Fusarium oxysporum Fo5176]
          Length = 1399

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 172/630 (27%), Positives = 278/630 (44%), Gaps = 88/630 (13%)

Query: 17   VQELLTVSLG-LHGNRPLLLVRTQ-HELLIYQAFRHPKG------ALKLRFKKLKVLFVS 68
            ++E+L   LG      P L++R Q  +L IY+  RH +       +  L FKK     ++
Sbjct: 807  LREILVADLGDTTSQSPYLILRNQTDDLTIYEPLRHVRDGGETSLSATLTFKKTSNTTLA 866

Query: 69   ----DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM 124
                +  +   EQP      R   +R  +NI GY  VFL GP P+++  +S+   R   +
Sbjct: 867  TIPVETEQDDVEQP------RFVPLRPCANINGYSTVFLPGPSPSFVIKSSKSIPRVIGL 920

Query: 125  TIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFL 183
               G V  ++ FH   C RGF+Y + K   R++ LP   ++ +    V+KVPL      +
Sbjct: 921  QGLG-VRGMSTFHTEGCDRGFIYADDKGIARVTQLPPDTNFTELGISVKKVPLGADVRGI 979

Query: 184  AYHLETKTYCIVTSTAEP-----STDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFS 238
            AYH  T  Y      +EP       DY+K     KE +T        PP + +  + L S
Sbjct: 980  AYHQPTGAYIAGCMISEPFELPKDDDYHKEWA--KETLT-------FPPTMPRGVLKLIS 1030

Query: 239  PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRIL 298
            P SW  I +    L   E + C+K + +E        R  + +GT  +  ED+  RGR+ 
Sbjct: 1031 PVSWTVIHEVE--LESCESIECMKTLHLEVSEDTKERRFLVTVGTAVSKGEDLPIRGRVH 1088

Query: 299  LFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK 354
            +FDI+ V+PEPG+P T  ++K I A+E   +G VTAI  +   G ++ A GQK  +  LK
Sbjct: 1089 VFDIVTVIPEPGRPETNKRLKAI-AREDIPRGGVTAISEIGTQGLMLVAQGQKCMVRGLK 1147

Query: 355  -DNDLTGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPT 411
             D  L  +AF+D   ++++   +    L L+ D  + +    Y  E  T  ++ + +   
Sbjct: 1148 EDGSLLPVAFLDMSCHVSTARELPRTGLCLMADAFKGVWFAGYTEEPYTFKVLGKSHG-- 1205

Query: 412  QPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISD 471
                                            RL +         D L +   +  + +D
Sbjct: 1206 --------------------------------RLPVLVA------DFLPDGEDLAIVAAD 1227

Query: 472  KDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLT 531
             D ++ +  + PE  +S  GH L+ +T F +  +  +   +  +    S  P      + 
Sbjct: 1228 ADGDLHILDFNPEHPKSLQGHLLLHRTSFSVSPNPPSTTLLLPRTLPPSHPPPQDPPHIL 1287

Query: 532  WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG---KGYYAGN 588
              AS  G L   +PLPE  YRRLL + N ++   +  GGLN +A R   G    G  A  
Sbjct: 1288 LLASSSGHLATLVPLPETTYRRLLSVTNQLLPALTPHGGLNAKAHRLPDGIRPVGVEAAG 1347

Query: 589  PSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
              R I+DG+++ ++ +L   +R EI  K G
Sbjct: 1348 -GRTIVDGAILARWAELGAAKRAEIAGKGG 1376


>gi|409076059|gb|EKM76433.1| hypothetical protein AGABI1DRAFT_108759 [Agaricus bisporus var.
            burnettii JB137-S8]
          Length = 1413

 Score =  189 bits (481), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 156/631 (24%), Positives = 291/631 (46%), Gaps = 79/631 (12%)

Query: 17   VQELLTVSLGLHGNRPLLLVRTQH-ELLIYQAF---RHPKGALKLRFKKLKVLFVSDRSK 72
            ++++L   +G     P L V  +  +L IY+A    ++P+     R   L++ FV   +K
Sbjct: 829  IEQILLAPIGESSPTPHLCVFLRSGQLAIYEAVVLGQNPEVPDTPRATSLQIQFVKIAAK 888

Query: 73   -------RANEQPGLPRGVRISQMRYFSNIAG------YQGVFLCGPHPAWLFLTSRGEL 119
                     NE+  L    +I++M +   +        Y GVF  G  P W+  T R  +
Sbjct: 889  SFEIQRPEENEKGILAEHKKINRM-FIPFVTSPRPSVTYSGVFFTGDRPHWILSTDRSGV 947

Query: 120  RAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCT 179
            + +P +    V    P         FL +     + +  +P    +D P P+R +P    
Sbjct: 948  QVYP-SGHNVVHAFTPCSLWESKGEFLMYTEDGPILVEWVP-DFQFDGPLPMRSIPRGRA 1005

Query: 180  PHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP 239
              +     +  T  IV +++  ST +  F+ ED   + +P       P V    + L +P
Sbjct: 1006 --YSNVLFDPSTSLIVAASSLQST-FTSFD-EDGNNIWEPDAPNISSPSVDCSALELIAP 1061

Query: 240  FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
              W  +    F  +E+ + + +  V++E   T +G + +IA+GT  +  ED+  +G   +
Sbjct: 1062 DIWATMDGFEFATNEYINDMTI--VTLETAATETGTKDFIAVGTTIDRGEDLAVKGATYI 1119

Query: 300  FDIIEVVPEPGQPLTKN---KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL-KD 355
            F+I EVVP+  Q +++    K+++    + KGPVTA+C ++ +LV+++GQKI++     D
Sbjct: 1120 FEIAEVVPD--QAVSQRRWYKLRLRCRDDAKGPVTAVCGLSDYLVSSMGQKIFVRAFDSD 1177

Query: 356  NDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNS 415
              L G+AF+D  VY+ S+ ++KNL+L+GD  +S+  + +Q +   L L+++D +      
Sbjct: 1178 ERLVGVAFMDVGVYVTSLQTLKNLLLIGDAVKSVQFVAFQEDPYKLVLLSKDIQ------ 1231

Query: 416  KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKN 475
                                           +C        D L   + +  +  D++  
Sbjct: 1232 ------------------------------SVC----VTRADFLFSENDLRLVTGDEEGI 1257

Query: 476  VVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYAS 535
            + ++ Y P+  +S  G  L+ +T+FH  +   T   +  +       P   SR LT   S
Sbjct: 1258 IRIYEYNPQDPDSREGRHLLLETEFHGQREYRTSVLVAHRIKEDQSIPN--SRLLT--GS 1313

Query: 536  LDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP-SRGII 594
             DG+L     + E+ ++RL +LQ  ++ +  H   LNP+AFR  K +  Y   P +RGI+
Sbjct: 1314 ADGSLASLTIVEEEAFKRLGLLQGQLMRNIQHMAALNPKAFRIVKNE--YVSKPLTRGIL 1371

Query: 595  DGSLVWKFLQLSLGERLEICKKIGSKHNDIL 625
            DG+L+ ++  L +  + E  ++IG+   ++L
Sbjct: 1372 DGNLLGQYESLPINRQSEATQQIGADRVNVL 1402


>gi|426194401|gb|EKV44332.1| hypothetical protein AGABI2DRAFT_187183 [Agaricus bisporus var.
            bisporus H97]
          Length = 1413

 Score =  189 bits (481), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 156/631 (24%), Positives = 291/631 (46%), Gaps = 79/631 (12%)

Query: 17   VQELLTVSLGLHGNRPLLLVRTQH-ELLIYQAF---RHPKGALKLRFKKLKVLFVSDRSK 72
            ++++L   +G     P L V  +  +L IY+A    ++P+     R   L++ FV   +K
Sbjct: 829  IEQILLAPIGESSPTPHLCVFLRSGQLAIYEAVVLGQNPEVPDTPRATSLQIQFVKIAAK 888

Query: 73   -------RANEQPGLPRGVRISQMRYFSNIAG------YQGVFLCGPHPAWLFLTSRGEL 119
                     NE+  L    +I++M +   +        Y GVF  G  P W+  T R  +
Sbjct: 889  SFEIQRPEENEKGILAEHKKINRM-FIPFVTSPRPSVTYSGVFFTGDRPHWILSTDRSGV 947

Query: 120  RAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCT 179
            + +P +    V    P         FL +     + +  +P    +D P P+R +P    
Sbjct: 948  QVYP-SGHNVVHAFTPCSLWESKGEFLMYTEDGPILVEWVP-DFQFDGPLPMRSIPRGRA 1005

Query: 180  PHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP 239
              +     +  T  IV +++  ST +  F+ ED   + +P       P V    + L +P
Sbjct: 1006 --YSNVLFDPSTSLIVAASSLQST-FTSFD-EDGNNIWEPDAPNISSPSVDCSALELIAP 1061

Query: 240  FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
              W  +    F  +E+ + + +  V++E   T +G + +IA+GT  +  ED+  +G   +
Sbjct: 1062 DIWATMDGFEFATNEYINDMTI--VTLETAATETGTKDFIAVGTTIDRGEDLAVKGATYI 1119

Query: 300  FDIIEVVPEPGQPLTKN---KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL-KD 355
            F+I EVVP+  Q +++    K+++    + KGPVTA+C ++ +LV+++GQKI++     D
Sbjct: 1120 FEIAEVVPD--QAVSQRRWYKLRLRCRDDAKGPVTAVCGLSDYLVSSMGQKIFVRAFDSD 1177

Query: 356  NDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNS 415
              L G+AF+D  VY+ S+ ++KNL+L+GD  +S+  + +Q +   L L+++D +      
Sbjct: 1178 ERLVGVAFMDVGVYVTSLQTLKNLLLIGDAVKSVQFVAFQEDPYKLVLLSKDIQ------ 1231

Query: 416  KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKN 475
                                           +C        D L   + +  +  D++  
Sbjct: 1232 ------------------------------SVC----VTRADFLFSENDLRLVTGDEEGI 1257

Query: 476  VVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYAS 535
            + ++ Y P+  +S  G  L+ +T+FH  +   T   +  +       P   SR LT   S
Sbjct: 1258 IRIYEYNPQDPDSREGRHLLLETEFHGQREYRTSVLVAHRIKEDQSIPN--SRLLT--GS 1313

Query: 536  LDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP-SRGII 594
             DG+L     + E+ ++RL +LQ  ++ +  H   LNP+AFR  K +  Y   P +RGI+
Sbjct: 1314 ADGSLASLTIVEEEAFKRLGLLQGQLMRNIQHMAALNPKAFRIVKNE--YVSKPLTRGIL 1371

Query: 595  DGSLVWKFLQLSLGERLEICKKIGSKHNDIL 625
            DG+L+ ++  L +  + E  ++IG+   ++L
Sbjct: 1372 DGNLLGQYESLPINRQSEATQQIGADRVNVL 1402


>gi|393245434|gb|EJD52944.1| hypothetical protein AURDEDRAFT_81080 [Auricularia delicata TFB-10046
            SS5]
          Length = 1422

 Score =  189 bits (480), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 140/530 (26%), Positives = 247/530 (46%), Gaps = 56/530 (10%)

Query: 100  GVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVL 159
            GVFL G  P W+  T +  +R  P  ++  V +            FL    +    +  +
Sbjct: 939  GVFLTGGKPGWILGTDKTAVRLVP-AVNQVVHSFTACSLWGNRGEFLMNTDEGPCLVEWM 997

Query: 160  PTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDP 219
            P  L  D   P   +P       +AY     T  +V + +   + +  F+ ED   V  P
Sbjct: 998  P-DLRLDEELPSFFMPRGRPYTSIAYE---ATTGMVIAASSLRSRFVLFD-EDGNTVWKP 1052

Query: 220  RDSRFIP-PLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGY 278
             D+ FI  P      + L  P +W  +    F  +E   +  ++ V++E   T +G + +
Sbjct: 1053 -DAEFISDPTTDTSSLELIDPETWTTVDGFEFAFNEM--INTVRTVNLETVSTEAGSKDF 1109

Query: 279  IALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAG 338
            IA+GT     ED+  +G   +F++IEVVP+  Q   ++K+K+    E KGPV+A+C + G
Sbjct: 1110 IAVGTTVFRGEDLAVKGATYIFEVIEVVPDDTQQ-RRHKLKLWCRDEAKGPVSALCGING 1168

Query: 339  FLVTAVGQKIYIWQLKDND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPE 397
            +LV+++GQK+++     N+ L G+AF+D  +Y+ S+ ++KNL+L+GD  +S+  + +Q +
Sbjct: 1169 YLVSSMGQKVFVRAFDLNERLVGVAFMDVGIYVTSLRTLKNLLLIGDAVKSVWFVAFQED 1228

Query: 398  YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 457
               L L+ +D++     S  ++ G                                    
Sbjct: 1229 PFKLQLLGKDFQRAALTSAEFFFG------------------------------------ 1252

Query: 458  ILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS 517
                F  M  + +D+   + +F Y P   E+  G +L+ +T+F+          I  + +
Sbjct: 1253 ----FGEMTIVSTDEQNVLRIFRYDPMHAEAQDGQKLLCQTEFNTQSDARGTTTI-LRRT 1307

Query: 518  SISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 577
            S  D    +S+ +  Y   DG+L   LP+ E  ++RL +LQ  M  +  H  GL+P+AFR
Sbjct: 1308 SDEDILLPQSKIM--YCGTDGSLSALLPVEEHVFKRLHLLQGQMTRNIQHVAGLHPKAFR 1365

Query: 578  TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 627
              +   + A   +RGI+D +L+ KF +L L  ++E  K+IG     IL +
Sbjct: 1366 VVRND-FTARPLARGILDSNLLAKFEELPLSRQVEFTKQIGQSREVILGD 1414


>gi|449543656|gb|EMD34631.1| hypothetical protein CERSUDRAFT_116804 [Ceriporiopsis subvermispora
            B]
          Length = 1440

 Score =  189 bits (480), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 165/645 (25%), Positives = 280/645 (43%), Gaps = 75/645 (11%)

Query: 9    PSAMDETIVQELLTVSLGLHGNRPLLLVRTQH-ELLIYQAFRHPKGA----------LKL 57
            P    E  +++++   LG    RP L V  +  +L +Y+       A          + +
Sbjct: 849  PRKPQELDIEQIVVAPLGESSPRPYLTVFLRSGQLAVYETIPVAPPADPLPNSRSCTILV 908

Query: 58   RFKK-LKVLFVSDRSKRANEQPGLPRGVRISQMR--YFSNIAGYQ---GVFLCGPHPAWL 111
            RF+K L   F   +     E+  L    RIS++   + ++    Q   GVF  G  P W+
Sbjct: 909  RFRKVLSKAFDIQQQNEEVEKSVLAEQKRISRLLIPFVTSPNPGQTLSGVFFTGDRPCWI 968

Query: 112  FLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPV 171
              T +G ++  P +    V              FL ++ +    +  +P  +  D   P 
Sbjct: 969  LSTDKGGVKVFP-SGHSVVHAFTASSVWESKSDFLLYSEEGPSLLEWIP-GVQLDGHLPS 1026

Query: 172  RKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQ 231
            R VP       + Y  +  T  IV +++  S   +    ED  +V +P  S    P    
Sbjct: 1027 RTVPRNKAYSNVVY--DPSTSLIVAASS--SQSRFASYDEDGNIVWEPDASNISLPFCET 1082

Query: 232  FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDV 291
              + L SP  W  +    F  +E+  V CL  V++E   T SG + YI +GT  N  ED+
Sbjct: 1083 STLELLSPDGWVTLDGYEFAPNEF--VNCLDCVTLETSSTESGTKDYIVVGTTINRGEDL 1140

Query: 292  TCRGRILLFDIIEVVPEPGQPLTK-NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYI 350
              +G   +F+IIEVVP+P   + + +++K+    + KGPVTA+C + G+LV+++GQKI++
Sbjct: 1141 AVKGAAYVFEIIEVVPDPTAQMKRWHRLKLHCRDDAKGPVTAMCGMNGYLVSSMGQKIFV 1200

Query: 351  WQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
                 D  L G+AF+D  VY+ S+ +VKNL+++ D  +S+  + +Q +   L ++ +D  
Sbjct: 1201 RAFDLDERLVGVAFLDVGVYVTSLCAVKNLLVISDAVKSVWFVAFQEDPYKLVILGKDPY 1260

Query: 410  PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
            P       ++       II                                         
Sbjct: 1261 PLYVTKADFFFAEGRVSIIS---------------------------------------- 1280

Query: 470  SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRF 529
             D+D  + +  Y P   ES  G  L+++T+FH GQ       I  +     D P  +SR 
Sbjct: 1281 CDEDGVMRILEYDPHDPESKNGQHLLRRTEFH-GQVEYRTSAILARRLKGVDIP--QSRL 1337

Query: 530  LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
            +      DG+L     + E   +RL +LQ  +  +  H  GLNPR FR  +    Y   P
Sbjct: 1338 ICGLT--DGSLITMTYVEEAASKRLHLLQGQLTRNVQHVAGLNPRGFRIVRND--YVSRP 1393

Query: 590  -SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
             +RGI+DG+L+  +  L +  + E+ ++IG+    IL +   ++ 
Sbjct: 1394 LTRGILDGNLLMAYEDLPIVRQDEVTRQIGTDRTTILKDWLSLDG 1438


>gi|150951283|ref|XP_001387581.2| pre-mRNA 3'-end processing factor CF II mRNA cleavage and
            polyadenylation factor II complex, subunit CFT1 (CPSF
            subunit) RNA processing and modification [Scheffersomyces
            stipitis CBS 6054]
 gi|149388465|gb|EAZ63558.2| pre-mRNA 3'-end processing factor CF II mRNA cleavage and
            polyadenylation factor II complex, subunit CFT1 (CPSF
            subunit) RNA processing and modification [Scheffersomyces
            stipitis CBS 6054]
          Length = 1341

 Score =  189 bits (479), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 145/612 (23%), Positives = 279/612 (45%), Gaps = 69/612 (11%)

Query: 28   HGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRIS 87
            H    L ++    E+L+Y+ +   +      FKK K L ++   + A      P G  + 
Sbjct: 786  HKEEYLTILTIGGEVLLYKLYFDGEN---YEFKKEKDLAITGAPENA-----YPIGTAVE 837

Query: 88   Q-MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFL 146
            + + YF N+ GY  +F+ G  P +L L S   +         P  +++PFH+     G +
Sbjct: 838  RRLAYFPNLNGYTCIFVTGVTP-YLILKSLHSIPRIYQFSKIPAVSISPFHDSKVANGLI 896

Query: 147  YFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYY 206
            + + +   RI  LP   +Y+  WP++ + +  +   + YH  + TY + T       DY 
Sbjct: 897  FLDNQQNARICQLPLDFNYENTWPMKLIHIGESIRAITYHESSHTYVVSTF---KDIDYE 953

Query: 207  KFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSM 266
             F+ E K +V   +D    P    +  + L SPF+W  I      L + E  + +K++ +
Sbjct: 954  CFDEEGKPIVGLHKDKP--PSSAYKGSIKLISPFNWSVID--TIELADNELGMTVKSMIL 1009

Query: 267  EYEGTLSGLR---GYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYA 323
            +   +    +    +I +G+     ED++  G   +++II+++PEP +P T +K K ++ 
Sbjct: 1010 DVGSSTKKFKHKKEFIVIGSGKYRMEDLSANGSFRIYEIIDIIPEPDRPETNHKFKEVFK 1069

Query: 324  KEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVG 383
            ++ KG VT++C V+G  + + GQK+ +  L+D+ +  +AF+DT VY++   S  N++++G
Sbjct: 1070 EDTKGAVTSVCEVSGRFLVSQGQKVIVRDLQDDGVVPVAFLDTAVYVSEAKSFGNMMILG 1129

Query: 384  DYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGE 443
            D  +S+ L+ +  E   + ++ +D +    N                             
Sbjct: 1130 DSLKSVWLVGFDAEPFRMIMLGKDLQGLDVN----------------------------- 1160

Query: 444  RLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG 503
                C    +K  ++         +I+D +  + L  Y PE   +  G RL+ K+ F + 
Sbjct: 1161 ----CADFITKDEEVF-------ILIADNNNVLHLVQYDPEDPTALNGQRLLSKSSFSIN 1209

Query: 504  QHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
              V T  K   K     D       F T  +++DG+    +P+ E +YRR+ +LQ  +  
Sbjct: 1210 SFV-TCLKSLPKTEEKYDT----GNFQTIGSTIDGSFFSVVPINEASYRRMYILQQQLTD 1264

Query: 564  HTSHTGGLNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLSLGERLEICKKIGSK-- 620
               H  GLNPR  R + G    A +  ++ I+D  ++  + +L+   +  +  K+ +K  
Sbjct: 1265 KEYHYCGLNPRLNR-FGGLSMTANDTNTKPILDYDVIRAYGKLNEERKKNLASKVSAKNI 1323

Query: 621  HNDILDELYDIE 632
            + DI  ++ + E
Sbjct: 1324 YQDIWKDIIEFE 1335


>gi|260941626|ref|XP_002614979.1| hypothetical protein CLUG_04994 [Clavispora lusitaniae ATCC 42720]
 gi|238851402|gb|EEQ40866.1| hypothetical protein CLUG_04994 [Clavispora lusitaniae ATCC 42720]
          Length = 1363

 Score =  187 bits (476), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 143/553 (25%), Positives = 256/553 (46%), Gaps = 60/553 (10%)

Query: 88   QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLY 147
            +M YF +++G   + + G  P  +  +   +++    +   P+ +  PF       G +Y
Sbjct: 857  RMIYFPDVSGTTCIMVTGVIPYMITRSRHSQVKVFKFS-KIPIVSFVPFSTDKIKNGLIY 915

Query: 148  FNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYK 207
             + K   RI  LP+  SYD  WP+RKV +  T   +A+H  + T  + T    P    Y 
Sbjct: 916  LDTKKNARIVELPSEFSYDYNWPIRKVSIGETVKSVAFHEGSNTLVVSTLKEIP----YN 971

Query: 208  FNGEDKELVTDPRDSRFIPPLVS-QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSM 266
               E+   +   + ++  P  +S +  + L SP +W  I   N  L + E  L +K++ +
Sbjct: 972  CIDEEGNPIVGIKPNK--PSAISYKGSIKLISPVNWSVI--DNIELADNEVGLHVKSMPL 1027

Query: 267  EYEGTLSGLRG---YIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYA 323
            +        +    ++ +GT     ED+ C G   L +II+++PEPG+P T +K K    
Sbjct: 1028 DVGSETKRFKSKKEFVLVGTGKYRLEDLACNGSYKLLEIIDIIPEPGKPETNHKFKEFTQ 1087

Query: 324  KEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVG 383
            ++ +G VT+IC V+G  + A GQKI +  +KDN    +AF+DT V+++   S  NL+++G
Sbjct: 1088 EDTRGAVTSICEVSGRFLVAQGQKIIVRDIKDNSAVSVAFLDTSVFVSESKSFGNLVVLG 1147

Query: 384  DYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGE 443
            D  +S+ L  +  E                         P R I+            LG+
Sbjct: 1148 DTLKSVWLAGFDAE-------------------------PFRMIM------------LGK 1170

Query: 444  RLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG 503
             L+    +     D L +   +  +++D ++++ +  Y PE   S+ G RL+ K+ F   
Sbjct: 1171 DLQ---GLDVSSADFLVKDEEIYILVADNNRSLHVLQYNPEDPASSNGQRLLHKSSF-TT 1226

Query: 504  QHVNTFFKIRCKPSSISD--APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 561
             ++ T  K   K   +S    P A   F T  ++++GA+    P+ E  YRR+ ++Q  +
Sbjct: 1227 NYLTTCTKSVPKHEQLSTWFDPQAIP-FQTVGSTVEGAMYVVFPISEPTYRRMYIMQQQL 1285

Query: 562  VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 621
            +    H  GLNPR  R  + +     N  R ++D  L+ +F +L+   +  +  KI +K+
Sbjct: 1286 IDKEYHHCGLNPRLNRIGRIESVNYANL-RAMLDCELIRRFSKLNEDRKRTLSSKISTKN 1344

Query: 622  --NDILDELYDIE 632
               DI  +L + E
Sbjct: 1345 VQVDIWKDLIEFE 1357


>gi|301103686|ref|XP_002900929.1| cleavage and polyadenylation specificity factor subunit, putative
            [Phytophthora infestans T30-4]
 gi|262101684|gb|EEY59736.1| cleavage and polyadenylation specificity factor subunit, putative
            [Phytophthora infestans T30-4]
          Length = 1561

 Score =  187 bits (476), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 168/657 (25%), Positives = 282/657 (42%), Gaps = 139/657 (21%)

Query: 80   LPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG----------- 128
            L  G R   +  F N+    G F  G HP W+ L  RG     PM +             
Sbjct: 945  LRAGFRYPMLTCFYNVNNMSGAFFRGAHPMWI-LGDRGHASFVPMCVPSSAPPKANGTSK 1003

Query: 129  --------PVSTLAPFHNVNCPRGFLYFNAKSELRISVLP-----THLSYDAPWPVRKVP 175
                    PV +  PFH+ +CP GF+YF+++  LR+  LP     T L     + ++K  
Sbjct: 1004 NAAPRVSVPVLSFTPFHHWSCPNGFIYFHSRGALRVCELPSSKTSTILPSSGGFVLQKAE 1063

Query: 176  LKCTPHFLAY-----------HLETKTYCIVTST------AEPSTDYYKFNGEDKELVTD 218
               T H + Y            LE  TY +V S       A+ +T+      E +    D
Sbjct: 1064 FGATLHHMLYLGSHGPGGVAEALEAPTYAVVCSARLKPADADRATEVEGAEEELEPENLD 1123

Query: 219  PRD----SRFIPPLVSQF------HVSLFSPFSWE-EIPQTN----------FPLH--EW 255
            P      S  + P    F      H++      +E  + QT+          F +H   +
Sbjct: 1124 PNGNPLGSNVMAPTAEMFADYETDHMAHTEEDVYELRLVQTDEFGEWGRRGVFRVHFERY 1183

Query: 256  EHVLCLK-----NVSMEYEGTLSGL-------RGYIALGTNY--NYSEDVTCRGRILLF- 300
            E VL +K     + S+  E   S         R Y+ +GT +   + ED + RGR+LL+ 
Sbjct: 1184 EVVLSVKLMYLYDSSLMKEEVASTSPEWNKKKRPYLVVGTGWVGPHGEDESGRGRLLLYE 1243

Query: 301  -DIIEVVPEPGQPLTKN--KIKMIYAKE-QKGPVTAICHVAGFLVTAVGQKIYIWQLKDN 356
             D  + V E G   +    K+++++ KE ++G ++ +  +  +++ AVG K+ +++ K  
Sbjct: 1244 LDYAQYVNEEGGATSGKLPKLRLVFIKEHRQGAISMVSQLGPYVLAAVGSKLIVYEFKSE 1303

Query: 357  DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSK 416
             L G AF D ++YI ++  VK+ ++ GD  +S+  LR++   R L L+A+DY+P      
Sbjct: 1304 QLIGCAFYDAQMYIVTLSVVKDFVMYGDVYKSVHFLRWREMQRQLVLLAKDYEP------ 1357

Query: 417  GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
                           L     + S+ E+                    +  +  D D+N+
Sbjct: 1358 ---------------LAVSATEFSVFEK-------------------KLALLAVDMDENL 1383

Query: 477  VLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS-SISDAPGAR-----SRFL 530
             +  + P+  ES GG RL++ +DFHLG  V++ F+ R   S S+  A   R     S ++
Sbjct: 1384 HVMQFAPQDIESRGGQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYV 1443

Query: 531  TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY-YAGNP 589
                + +G +G  +P+ E+ +RRL  LQNVMV        LNPR FR  K       G P
Sbjct: 1444 NVMGTSEGGVGALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRILKTNAQRRCGRP 1503

Query: 590  S--------RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
                     +  +D  ++++FLQL    + E+ + IG+     +  L +++  +S F
Sbjct: 1504 DAWSKKKWKKSFLDAFVLFRFLQLDYVAQKELARCIGTTPEVAMHNLLEVQHATSTF 1560


>gi|12697776|dbj|BAB21613.1| polyadenylation specificity factor [Homo sapiens]
          Length = 216

 Score =  187 bits (475), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 97/255 (38%), Positives = 147/255 (57%), Gaps = 43/255 (16%)

Query: 387 RSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 446
           +SI+LLRYQ E +TLSLV+RD KP +  S  +   N                        
Sbjct: 2   KSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN------------------------ 37

Query: 447 ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV 506
                           + +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HV
Sbjct: 38  ----------------AQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHV 81

Query: 507 NTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
           NTF++  C+ ++   +  +    ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T
Sbjct: 82  NTFWRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTT 141

Query: 564 HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 623
              H  GLNPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + 
Sbjct: 142 MLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDI 201

Query: 624 ILDELYDIEALSSHF 638
           ILD+L + + +++HF
Sbjct: 202 ILDDLLETDRVTAHF 216


>gi|392572878|gb|EIW66021.1| hypothetical protein TREMEDRAFT_70300 [Tremella mesenterica DSM 1558]
          Length = 1408

 Score =  187 bits (475), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 136/544 (25%), Positives = 246/544 (45%), Gaps = 65/544 (11%)

Query: 92   FSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID----GPVSTLAPFHNVNCPRGFLY 147
            + N+ G  G F+ G  P W+  + +  LR + +       GP + L          G  +
Sbjct: 916  YDNLEGQSGAFITGEKPYWIMSSEKHPLRLYGLKQGAMAFGPTTHLGSM-------GEYF 968

Query: 148  FNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYK 207
                    I   P  L+ D   P  +  ++ T   + +   +  Y   T+ + P   Y  
Sbjct: 969  MKIDDGCFICYFPQSLNTDLTMPCDRYEMQRTYTNVVFDPPSGHYLGATAISVPFQAY-- 1026

Query: 208  FNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS--WEEIPQTNFPLHEWEHVLCLKNVS 265
               E+ E+   P     +PPL  +  + LFS  S  W  I   +F   + E+VL +++V 
Sbjct: 1027 --DEEGEIQLGPEGENLVPPLNERSSLELFSRGSDPWRVIDGYDF--DQNENVLSMQSVL 1082

Query: 266  MEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKE 325
            +E      G R ++A+GT +++ ED   RG + +F+++EVVPEPGQ  +   +K+     
Sbjct: 1083 LESSSVPGGYRDFVAVGTGFDFGEDRATRGNVYIFEVVEVVPEPGQK-SAWALKLRCKDP 1141

Query: 326  QKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGD 384
             + PV+A+ ++ G+L+ + G K+Y+  L  D  L G+AF+D  +Y+ S+   KN IL+ D
Sbjct: 1142 CRNPVSALGNINGYLLHSNGPKMYVKGLDFDERLMGLAFVDVMIYLTSIKVFKNFILISD 1201

Query: 385  YARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGER 444
              +SI  L +Q +    +++++D  P    S  +        + DG +            
Sbjct: 1202 MVKSIWFLSFQEDPYKFTVISKDLMPISVTSADFL-------VHDGHVT----------- 1243

Query: 445  LEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQ 504
                                  F+  D+  ++ +  + P   ES  G RLI +T++H G 
Sbjct: 1244 ----------------------FLTYDRSGDIRMVDFDPANPESINGERLIVRTEYHGGS 1281

Query: 505  HVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 564
             V T   +  +   + +    +++ +  +A  DG++  F+      +RRL  + + ++ +
Sbjct: 1282 PV-TVSTMIARRRGVEEEFAPQTQIICAHA--DGSISTFVSTKPARFRRLHFVSDQLIRN 1338

Query: 565  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
              H  GLNPRAFRT +     A   SRGI+DG L+ +F    +  + E+ K+IG+    +
Sbjct: 1339 AQHVAGLNPRAFRTVRND-LVAKPLSRGILDGELLGRFAIQPIDRQREMLKQIGTDGGTV 1397

Query: 625  LDEL 628
              +L
Sbjct: 1398 ASDL 1401


>gi|440466842|gb|ELQ36086.1| hypothetical protein OOU_Y34scaffold00669g71 [Magnaporthe oryzae Y34]
 gi|440481991|gb|ELQ62520.1| hypothetical protein OOW_P131scaffold01068g7 [Magnaporthe oryzae
            P131]
          Length = 1475

 Score =  187 bits (474), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 162/628 (25%), Positives = 280/628 (44%), Gaps = 69/628 (10%)

Query: 12   MDETIVQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALKL----RFKKLKVL 65
            M    + E+L   LG  +  +  ++L  + H+L IY+ +R  + +  L    R +KL   
Sbjct: 873  MARETISEILVTDLGDTVFKSPHVILRHSNHDLTIYEPYRIAEDSQSLTKILRLRKLPNP 932

Query: 66   FVSDRSKRAN-EQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM 124
             V+   +  N E P  P   R   +R  +NIAGY  VF+ G  P++L  +++   +   +
Sbjct: 933  AVAKAPEATNSEDP--PLMSRNMPLRACANIAGYSAVFMPGHSPSFLIKSAKATPKVIGL 990

Query: 125  TIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFL 183
               G V  ++ FH   C RGF+Y ++    R++ +P   S+ +    V+KVPL      +
Sbjct: 991  RGSG-VRAMSSFHTEGCERGFIYADSAGVARVAQIPKDTSFSELGLSVKKVPLGIDADGI 1049

Query: 184  AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
            AYH  T  Y +  S  EP    ++   +D       +++    P+V +  + + +P +W 
Sbjct: 1050 AYHSPTGVYVLTCSYWEP----FELPKDDDYHCEWAKENISFKPMVERSVLKVINPINWS 1105

Query: 244  EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
            +I    F  HE    +C++++++E   + +  R  I +GT     ED+  RG I +FD+ 
Sbjct: 1106 DIWTEEFEQHEV--AMCIRSLNLEVSQSTNERRQLITVGTAMCKGEDLPVRGGIYVFDLA 1163

Query: 304  EVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDL 358
             VVP+ G+P T  K+K + AKE+  +G VT++  +   G ++ A GQK  +  L+ D  L
Sbjct: 1164 SVVPQKGRPETDKKLKQV-AKEEIPRGAVTSLSEIGTQGLMMVAQGQKTLVRGLQEDGKL 1222

Query: 359  TGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
              +AF+D   Y+     VK L   G                 L ++A  +K       GY
Sbjct: 1223 PPVAFMDMNCYV---TCVKELAGTG-----------------LCVMADAFKGVW--FCGY 1260

Query: 419  YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
              G            +K +          C  +     D+L +   +  + +D D N+ +
Sbjct: 1261 TEGP-----------YKMMLFGKSSTNLECMNV-----DLLPDGKDLLIVAADSDGNLHV 1304

Query: 479  FMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIRCKPSSISDAPGARS-RFLTWYAS 535
              + PE  +S  GH L+ +T F  G H    +       P   ++ P + + R     AS
Sbjct: 1305 LQFDPEHPKSLQGHLLLNRTTFSTGAHHPQKSLLLPTTDPRPSTNQPSSDAERQHILMAS 1364

Query: 536  LDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR-----TYKGKGYYAGNPS 590
              G L    PL +  Y RL  L + ++    H   LNP+A+R     T         +  
Sbjct: 1365 PTGVLAAVQPLSQSTYTRLSALASNLMASVPHHAALNPKAYRLPPTSTRNQVAAVDISVG 1424

Query: 591  RGIIDGSLVWKFLQLSLGERLEICKKIG 618
            R ++DGSL+ ++ +L+ G R E+  + G
Sbjct: 1425 RAVVDGSLLARWAELASGRRAEVAGRAG 1452


>gi|389641257|ref|XP_003718261.1| cft-1 [Magnaporthe oryzae 70-15]
 gi|351640814|gb|EHA48677.1| cft-1 [Magnaporthe oryzae 70-15]
          Length = 1452

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 162/628 (25%), Positives = 280/628 (44%), Gaps = 69/628 (10%)

Query: 12   MDETIVQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGALKL----RFKKLKVL 65
            M    + E+L   LG  +  +  ++L  + H+L IY+ +R  + +  L    R +KL   
Sbjct: 850  MARETISEILVTDLGDTVFKSPHVILRHSNHDLTIYEPYRIAEDSQSLTKILRLRKLPNP 909

Query: 66   FVSDRSKRAN-EQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM 124
             V+   +  N E P  P   R   +R  +NIAGY  VF+ G  P++L  +++   +   +
Sbjct: 910  AVAKAPEATNSEDP--PLMSRNMPLRACANIAGYSAVFMPGHSPSFLIKSAKATPKVIGL 967

Query: 125  TIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFL 183
               G V  ++ FH   C RGF+Y ++    R++ +P   S+ +    V+KVPL      +
Sbjct: 968  RGSG-VRAMSSFHTEGCERGFIYADSAGVARVAQIPKDTSFSELGLSVKKVPLGIDADGI 1026

Query: 184  AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
            AYH  T  Y +  S  EP    ++   +D       +++    P+V +  + + +P +W 
Sbjct: 1027 AYHSPTGVYVLTCSYWEP----FELPKDDDYHCEWAKENISFKPMVERSVLKVINPINWS 1082

Query: 244  EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
            +I    F  HE    +C++++++E   + +  R  I +GT     ED+  RG I +FD+ 
Sbjct: 1083 DIWTEEFEQHEV--AMCIRSLNLEVSQSTNERRQLITVGTAMCKGEDLPVRGGIYVFDLA 1140

Query: 304  EVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDL 358
             VVP+ G+P T  K+K + AKE+  +G VT++  +   G ++ A GQK  +  L+ D  L
Sbjct: 1141 SVVPQKGRPETDKKLKQV-AKEEIPRGAVTSLSEIGTQGLMMVAQGQKTLVRGLQEDGKL 1199

Query: 359  TGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
              +AF+D   Y+     VK L   G                 L ++A  +K       GY
Sbjct: 1200 PPVAFMDMNCYV---TCVKELAGTG-----------------LCVMADAFKGVW--FCGY 1237

Query: 419  YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
              G            +K +          C  +     D+L +   +  + +D D N+ +
Sbjct: 1238 TEGP-----------YKMMLFGKSSTNLECMNV-----DLLPDGKDLLIVAADSDGNLHV 1281

Query: 479  FMYQPEARESNGGHRLIKKTDFHLGQH--VNTFFKIRCKPSSISDAPGARS-RFLTWYAS 535
              + PE  +S  GH L+ +T F  G H    +       P   ++ P + + R     AS
Sbjct: 1282 LQFDPEHPKSLQGHLLLNRTTFSTGAHHPQKSLLLPTTDPRPSTNQPSSDAERQHILMAS 1341

Query: 536  LDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR-----TYKGKGYYAGNPS 590
              G L    PL +  Y RL  L + ++    H   LNP+A+R     T         +  
Sbjct: 1342 PTGVLAAVQPLSQSTYTRLSALASNLMASVPHHAALNPKAYRLPPTSTRNQVAAVDISVG 1401

Query: 591  RGIIDGSLVWKFLQLSLGERLEICKKIG 618
            R ++DGSL+ ++ +L+ G R E+  + G
Sbjct: 1402 RAVVDGSLLARWAELASGRRAEVAGRAG 1429


>gi|297722899|ref|NP_001173813.1| Os04g0252200 [Oryza sativa Japonica Group]
 gi|255675253|dbj|BAH92541.1| Os04g0252200, partial [Oryza sativa Japonica Group]
          Length = 432

 Score =  186 bits (472), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 119/392 (30%), Positives = 195/392 (49%), Gaps = 54/392 (13%)

Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
           WE   ++  P+  +E+ L ++ V++ +  T       +A+GT Y   EDV  RGR+LLF 
Sbjct: 86  WE--TKSTIPMQLFENALTVRIVTL-HNTTTKENETLLAIGTAYVLGEDVAARGRVLLFS 142

Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
             +         ++N +  +Y+KE KG V+A+  + G L+ A G KI + +    +LT +
Sbjct: 143 FTK------SENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWTGAELTAV 196

Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
           AF D  +++ S+  VKN +L GD  +SI  L ++ +   LSL+A+D+        G    
Sbjct: 197 AFYDAPLHVVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDF--------GSLDC 248

Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
             +  +IDGS                                ++  + SD DKNV +F Y
Sbjct: 249 FATEFLIDGS--------------------------------TLSLVASDSDKNVQIFYY 276

Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALG 541
            P+  ES  G +L+ + +FH+G H+  F +++  P+    +    +RF   + +LDG +G
Sbjct: 277 APKMVESWKGQKLLSRAEFHVGAHITKFLRLQMLPTQ-GLSSEKTNRFALLFGNLDGGIG 335

Query: 542 FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY--KGKGYYAGNPSRGIIDGSLV 599
              P+ E  +RRL  LQ  +V    H  GLNPR+FR +   GKG+  G     IID  L+
Sbjct: 336 CIAPIDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHRPG--PDNIIDFELL 393

Query: 600 WKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
             +  LSL E+L++ ++IG+  + IL    DI
Sbjct: 394 AHYEMLSLDEQLDVAQQIGTTRSQILSNFSDI 425


>gi|242075246|ref|XP_002447559.1| hypothetical protein SORBIDRAFT_06g003570 [Sorghum bicolor]
 gi|241938742|gb|EES11887.1| hypothetical protein SORBIDRAFT_06g003570 [Sorghum bicolor]
          Length = 389

 Score =  186 bits (472), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 118/389 (30%), Positives = 192/389 (49%), Gaps = 50/389 (12%)

Query: 242 WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
           WE   +   P+  +E+ L ++ V+++   T       +A+GT Y   EDV  RGR+LL+ 
Sbjct: 43  WE--TRFTIPMQSFENALTVRIVTLQNTSTKEN-ETLMAIGTAYVQGEDVAARGRVLLYS 99

Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
                       ++N +  +Y+KE KG V+A+  + G L+ A G KI + +   ++LT +
Sbjct: 100 F------SRSENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWTGSELTAV 153

Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
           AF D  +++ S+  VKN +L GD  +SI  L ++ +   L+L+A+D+        G    
Sbjct: 154 AFYDAPLHVVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLNLLAKDF--------GSLDC 205

Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
             +  +IDGS                                ++  ++SD DKNV +F Y
Sbjct: 206 FATEFLIDGS--------------------------------TLSLVVSDSDKNVQIFYY 233

Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALG 541
            P+  ES  G +L+ + +FH+G HV+ F +++  P+    A    +RF   + +LDG +G
Sbjct: 234 APKMVESWKGQKLLSRAEFHVGAHVSKFLRLQMLPTQ-GLASEKTNRFALVFGTLDGGIG 292

Query: 542 FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 601
              P+ E  +RRL  LQ  +V    H  GLNPR+FR +K  G         IID  L+  
Sbjct: 293 CIAPVDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRHFKSNGKAHRPGPDNIIDFELLSH 352

Query: 602 FLQLSLGERLEICKKIGSKHNDILDELYD 630
           +  LSL E+LEI ++IG+  + IL    D
Sbjct: 353 YEMLSLEEQLEIAQQIGTTRSQILSNFSD 381


>gi|347838999|emb|CCD53571.1| similar to Cleavage and polyadenylation specificity factor subunit 1
            [Botryotinia fuckeliana]
          Length = 1447

 Score =  186 bits (471), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 171/655 (26%), Positives = 283/655 (43%), Gaps = 89/655 (13%)

Query: 10   SAMDETIVQELLTVSLGLH-GNRPLLLVR-TQHELLIYQAFRHPKGALKLRFKKLKVLFV 67
            SA  ET+  E+L  +LG      P L++R +  +L IY+ FR    +  L    L+ L +
Sbjct: 836  SAARETLT-EILVANLGDSVSQSPYLILRPSNDDLTIYEPFRVKSASPDLLSSTLQFLKI 894

Query: 68   SDR------SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
             +          A EQ    +      MR  SN+ GY  VF+ G  P+++  +S+   + 
Sbjct: 895  QNTHLTQAPDVSAEEQVDGAQQTSDKPMRAISNLGGYSTVFMPGGSPSFIIKSSKTAPKV 954

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTP 180
              +   G V +L+ FH   C RGF+Y + +   R++  P + ++ D    +RK+ +    
Sbjct: 955  LSLQGTG-VRSLSSFHTEGCDRGFIYASTEGIARVAQFPPNTTFADIGMALRKIEIGEDV 1013

Query: 181  HFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPF 240
            H +AYH   +TY I TST    TD+ +   +D    T   ++  + P + +  + L SP 
Sbjct: 1014 HAVAYHPPLQTYVIGTSTF---TDF-ELPKDDDHRKTWQEENIALKPSIEKSFLKLVSPV 1069

Query: 241  SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
            +W  I      L   E + C+K +++      +  +  I +GT     ED+   GR+ ++
Sbjct: 1070 NWSVIDA--IELEPCELITCIKTMNLVISEVTNERKHLIVVGTAITKGEDLATTGRLYVY 1127

Query: 301  DIIEVVPEPGQPLTKNKIKMIYA----KEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK 354
            D++ VVPEP +P T  K+K+I +    +   GPVT +  +   GF++ A GQK  +  LK
Sbjct: 1128 DVVTVVPEPDRPETNKKLKLISSEIITRGAGGPVTGLSEIGTQGFMLVAQGQKCMVRGLK 1187

Query: 355  DNDLT-GIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPT 411
            ++     +AF+D   Y+ S+  +    L ++ D  + +    Y  E              
Sbjct: 1188 EDGTNLPVAFMDMNCYVTSVKELPGTGLCVMADALKGVWFAGYTEE-------------- 1233

Query: 412  QPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISD 471
                       P R ++ G    K   L        C        D+L +   +  + +D
Sbjct: 1234 -----------PYRMLLFGKSAAKMEVL--------CA-------DLLPDGKDLFIVAAD 1267

Query: 472  KDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH----------------VNTFFKIRCK 515
             + N+ +  Y PE  +S  GH L+ +T F LG H                + T       
Sbjct: 1268 ANGNLHIMQYDPEHPKSLQGHLLLHRTTFSLGAHHPTTMTLLPTTRPLPQLTTAPSPSPD 1327

Query: 516  PSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRA 575
            PS   D P      L    S  G L    PL E  YRR   L + +     H  GLNPRA
Sbjct: 1328 PSPQEDTPSPSQPLL--LTSRTGTLALLSPLTESQYRRFGTLVSHLTNTLYHPCGLNPRA 1385

Query: 576  FRTYK--GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
            +R  +   +G   G   R IIDG ++ ++++L    R E+  ++G    ++ DEL
Sbjct: 1386 YRIDRDANEGIVGG---RTIIDGGVLGRWMELGSQRRGEVAGRVGVDVLELRDEL 1437


>gi|346319828|gb|EGX89429.1| protein CFT1 [Cordyceps militaris CM01]
          Length = 1452

 Score =  185 bits (470), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 172/652 (26%), Positives = 297/652 (45%), Gaps = 77/652 (11%)

Query: 10   SAMDETIVQELLTVSLG-LHGNRPLLLVR-TQHELLIYQAFRH--PKGALK-----LRFK 60
            +A  ET+  E+L   LG +    P L++R    +L +Y+  R+  P  +       L FK
Sbjct: 839  AAAKETLT-EILVADLGDVVAKSPYLILRHDTDDLTLYEPVRYHEPNSSSAPLSDTLFFK 897

Query: 61   KLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
            K     ++  +  ++++    +  R   ++  +N+ GY  VFL G  P+++  +++   R
Sbjct: 898  KSTNSTIAKSAPASDKEDDETQQKRFVPLQLCANVGGYSAVFLSGDSPSFILKSAKSIPR 957

Query: 121  AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCT 179
               +   G V  ++ FH   C RGF+Y + K   R+S LPT  +Y +    V+K+PL C 
Sbjct: 958  IVGLQGQG-VQGMSTFHTEGCDRGFIYADTKGIARVSQLPTDTNYAELGISVKKIPLDCD 1016

Query: 180  PHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP 239
             + +++H  T TY    ST EP    ++   +D       R++    P + +  + L SP
Sbjct: 1017 VNRVSFHSHTATYIAACSTREP----FELPKDDDYHKEWARETVNFAPTMPRGILKLISP 1072

Query: 240  FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
             +W  I   +  L   E +  +  + +E        R  +A+G+     ED+  RGR+ +
Sbjct: 1073 AAWTVI--HSLDLESCETIESMMALHLEISEETKERRMVVAVGSAICKGEDLPTRGRVQV 1130

Query: 300  FDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHV--AGFLVTAVGQKIYIWQLK- 354
            FDI+ V+PEPG+P T  ++K++ AKE+  +G VT++  +  +G L+ A GQK  +  L+ 
Sbjct: 1131 FDIVTVIPEPGRPETNKRLKLL-AKEELPRGGVTSLSEIGTSGLLLIAQGQKCMVRGLRE 1189

Query: 355  DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
            D  L  +AF+D   +I   + V+ L   G                 L L+A  +K     
Sbjct: 1190 DGGLLPVAFLDMNCHI---LGVRELRGTG-----------------LCLMADAFKGM--- 1226

Query: 415  SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
               ++A     G  +    +K L  S G+       I     D L +   +  +  D D 
Sbjct: 1227 ---WFA-----GYTEEPYTFKVLGKSGGQ-------IPMLVADFLPDGEDLNMIGVDADG 1271

Query: 475  NVVLFMYQPEARESNGGHRLIKKTDFHLG--QHVNTFFKIRCKPSSI---SDAPGARSRF 529
            ++ +F + P+  +S  GH L+ +T F L   +   T    R  P+S        GA +  
Sbjct: 1272 DLHVFEFNPDHPKSLQGHLLLHRTTFSLSPNEPTTTVLLERTIPASQPQPQGTTGAETPH 1331

Query: 530  LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA--- 586
                +   G L    PL E  YRRLL L N ++      GGL+P+A R  +G+G  +   
Sbjct: 1332 TLLLSCPTGQLAALTPLSESAYRRLLSLANQLMPAVVPYGGLHPKAHRLPEGRGAQSHAR 1391

Query: 587  ------GNPSRGIIDGSLVWKFLQLSLGERLEICKKIG-SKHNDILDELYDI 631
                      R I+DG+++ ++ +L   +R E+  K G    N++ DEL  +
Sbjct: 1392 AVGVETAASGRMIVDGAVLARWTELGAAKRAEMATKSGYDDLNEMRDELEGV 1443


>gi|154320778|ref|XP_001559705.1| hypothetical protein BC1G_01861 [Botryotinia fuckeliana B05.10]
          Length = 1153

 Score =  185 bits (469), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 171/655 (26%), Positives = 283/655 (43%), Gaps = 89/655 (13%)

Query: 10   SAMDETIVQELLTVSLGLH-GNRPLLLVR-TQHELLIYQAFRHPKGALKLRFKKLKVLFV 67
            SA  ET+  E+L  +LG      P L++R +  +L IY+ FR    +  L    L+ L +
Sbjct: 542  SAARETLT-EILVANLGDSVSQSPYLILRPSNDDLTIYEPFRVKSASPDLLSSTLQFLKI 600

Query: 68   SDR------SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA 121
             +          A EQ    +      MR  SN+ GY  VF+ G  P+++  +S+   + 
Sbjct: 601  QNTHLTQAPDVSAEEQVDGAQQTSDKPMRAISNLGGYSTVFMPGGSPSFIIKSSKTAPKV 660

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTP 180
              +   G V +L+ FH   C RGF+Y + +   R++  P + ++ D    +RK+ +    
Sbjct: 661  LSLQGTG-VRSLSSFHTEGCDRGFIYASTEGIARVAQFPPNTTFADIGMALRKIEIGEDV 719

Query: 181  HFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPF 240
            H +AYH   +TY I TST    TD+ +   +D    T   ++  + P + +  + L SP 
Sbjct: 720  HAVAYHPPLQTYVIGTSTF---TDF-ELPKDDDHRKTWQEENIALKPSIEKSFLKLVSPV 775

Query: 241  SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
            +W  I      L   E + C+K +++      +  +  I +GT     ED+   GR+ ++
Sbjct: 776  NWSVIDA--IELEPCELITCIKTMNLVISEVTNERKHLIVVGTAITKGEDLATTGRLYVY 833

Query: 301  DIIEVVPEPGQPLTKNKIKMIYA----KEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK 354
            D++ VVPEP +P T  K+K+I +    +   GPVT +  +   GF++ A GQK  +  LK
Sbjct: 834  DVVTVVPEPDRPETNKKLKLISSEIITRGAGGPVTGLSEIGTQGFMLVAQGQKCMVRGLK 893

Query: 355  DNDLT-GIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPT 411
            ++     +AF+D   Y+ S+  +    L ++ D  + +    Y  E              
Sbjct: 894  EDGTNLPVAFMDMNCYVTSVKELPGTGLCVMADALKGVWFAGYTEE-------------- 939

Query: 412  QPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISD 471
                       P R ++ G    K   L        C        D+L +   +  + +D
Sbjct: 940  -----------PYRMLLFGKSAAKMEVL--------CA-------DLLPDGKDLFIVAAD 973

Query: 472  KDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH----------------VNTFFKIRCK 515
             + N+ +  Y PE  +S  GH L+ +T F LG H                + T       
Sbjct: 974  ANGNLHIMQYDPEHPKSLQGHLLLHRTTFSLGAHHPTTMTLLPTTRPLPQLTTAPSPSPD 1033

Query: 516  PSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRA 575
            PS   D P      L    S  G L    PL E  YRR   L + +     H  GLNPRA
Sbjct: 1034 PSPQEDTPSPSQPLL--LTSRTGTLALLSPLTESQYRRFGTLVSHLTNTLYHPCGLNPRA 1091

Query: 576  FRTYK--GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
            +R  +   +G   G   R IIDG ++ ++++L    R E+  ++G    ++ DEL
Sbjct: 1092 YRIDRDANEGIVGG---RTIIDGGVLGRWMELGSQRRGEVAGRVGVDVLELRDEL 1143


>gi|169603229|ref|XP_001795036.1| hypothetical protein SNOG_04622 [Phaeosphaeria nodorum SN15]
 gi|160706354|gb|EAT88382.2| hypothetical protein SNOG_04622 [Phaeosphaeria nodorum SN15]
          Length = 1338

 Score =  185 bits (469), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 155/560 (27%), Positives = 244/560 (43%), Gaps = 66/560 (11%)

Query: 72   KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS 131
            + A ++PG       S +    NI GY  V   G  PA++   S    R   ++   PV 
Sbjct: 831  EEAADEPGFE-----STLLALDNINGYSTVIQRGRSPAFILKESSSAPRVIGLS-GNPVK 884

Query: 132  TLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYD-APWPVRKVPLKCTPHFLAYHLETK 190
            +L  FH  +C RGF Y ++   LRIS LP    Y    W  R++P+    H LAYH    
Sbjct: 885  SLTRFHTSSCQRGFAYLDSTDTLRISQLPPSTHYGHLGWAARRMPMDAEVHALAYH---P 941

Query: 191  TYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNF 250
            +   V  T +P  + Y  +  D      P++     P V    + +    +W  I     
Sbjct: 942  SGLYVIGTGQP--EEYTLDPNDTFHYELPKEETSFKPKVEHGIIKVMDEKTWTVI--DTH 997

Query: 251  PLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPG 310
             L   E +LC+K +++E   T    +  IA+GT     ED+  +G I +F++I VVPEP 
Sbjct: 998  VLDPQEVILCIKTLNLEVSETTHQRKDVIAVGTAIVLGEDLATKGNIRIFEVITVVPEPD 1057

Query: 311  QPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFIDTE 367
             P T  ++K+I   E KG V+AI  +   GFL+ A GQK  +  LK D  L  +AF+D +
Sbjct: 1058 HPETNKRLKLIVKDEVKGTVSAISDLGTQGFLIMAQGQKSMVRGLKEDGTLLPVAFMDMQ 1117

Query: 368  VYIASMVSVKN--LILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSR 425
             Y+ ++ ++ N  ++L+GD  +      Y  E   + L  R        SK +       
Sbjct: 1118 CYVTTLKTLPNTGMLLMGDAYKGAWFTGYTEEPYKMMLFGR--------SKHHLE----- 1164

Query: 426  GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEA 485
                  +   FL     E+L I                    +++D D N+ +  + P+ 
Sbjct: 1165 -----CITADFLPFE--EQLHI--------------------IVADADMNLQVLQFDPDH 1197

Query: 486  RESNGGHRLIKKTDFHLGQHVNT--FFKIRCKPSSISDAPGARSRFLTWY----ASLDGA 539
             +S GG RL++K+ FH G   +T    + R    + S+   + +  L  +     S  G 
Sbjct: 1198 PKSMGGTRLLQKSTFHTGHFPSTMHLLQSRLHMPTASEFTTSTTSSLPLHQILCTSQSGT 1257

Query: 540  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK-GKGYYAGNPSRGIIDGSL 598
            L    PL E +YRRL  L   +        GLN +AFR     +G +     R ++DG L
Sbjct: 1258 LALITPLSESSYRRLSGLATHLQQFLDSPCGLNGKAFRAADVMEGGWDAGTQRAMLDGGL 1317

Query: 599  VWKFLQLSLGERLEICKKIG 618
            + ++ +L    R E   K+G
Sbjct: 1318 LMRWGELGEQRRREGLGKVG 1337


>gi|302506529|ref|XP_003015221.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
 gi|291178793|gb|EFE34581.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
          Length = 1370

 Score =  184 bits (466), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 155/611 (25%), Positives = 270/611 (44%), Gaps = 100/611 (16%)

Query: 32   PLLLVRTQHE-LLIYQAFRHP--KGALKLRF-KKLKVLFVSDRSKRANEQPGLPRGVRIS 87
            P +++RT+H+ L++Y+ +R     G   LRF K +  + +  R+ +   Q          
Sbjct: 788  PYMILRTKHDDLVLYEPYRTAGESGQSGLRFLKAVNHVVMGPRTDQGVNQDINRSSSSCK 847

Query: 88   QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG-PVSTLAPFHNVNCPRGFL 146
             +R   ++ GY+ VF+ G  P ++  ++    R H + + G  V +L+ FH   C RGF 
Sbjct: 848  LLRALPDVCGYRTVFMSGHSPCFILKSAIA--RPHVLRLRGKAVQSLSGFHIAACERGFA 905

Query: 147  YFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYY 206
            Y +                        + L      + Y   ++ Y I TS  E     +
Sbjct: 906  YVD----------------------EDITLGEQVDSIVYSSASECYVIGTSAKED----F 939

Query: 207  KFNGEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNV 264
            K   ED E  T+ R+    F+P L  +  + L  P +W  I   +  L   E + C++ +
Sbjct: 940  KLP-EDDESHTEWRNEFITFLPQL-ERGTIKLLEPRNWSTI--DSHELEPAERITCIEVI 995

Query: 265  SMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAK 324
             +E        +  + +G++    ED+  +G I +F++I+VVPEP QP    K+K+   +
Sbjct: 996  RLEISELTHERKDMVVVGSSIVKGEDIVPKGFIRVFEVIDVVPEPDQPEKNKKLKLFAKE 1055

Query: 325  EQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVK--NL 379
            E KG VTA+  +   GFL+ A GQK  +  LK D  L  +AF DT+ Y+  +  +K   +
Sbjct: 1056 EVKGAVTALSGIGGQGFLIVAQGQKCMVRGLKEDGSLLPVAFKDTQCYVNVLKELKGTGM 1115

Query: 380  ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQL 439
             ++GD  + +  + Y  E   L L  ++              N +  ++D          
Sbjct: 1116 CIIGDAFKGLWFIGYSEEPYKLDLFGKE--------------NENLAVVDA--------- 1152

Query: 440  SLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD 499
                             D L + + +  +++D D N+ +  Y PE   S+ G RL+ ++ 
Sbjct: 1153 -----------------DFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSV 1195

Query: 500  FHLGQHVNTFFKI---RCKPSSISDA--------PGARSRFLTWYASLDGALGFFLPLPE 548
            FH G   +T   +      PSS  D         P ++ + L  + +  G++    PL E
Sbjct: 1196 FHTGHFASTMTLLPHGGHTPSSPVDEDAMDTDSPPPSKYQILMTFQT--GSIAIITPLGE 1253

Query: 549  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 608
             +YRRLL LQ+ +V    H   LNPR +R  +  G       RG+IDG+L+ ++L +   
Sbjct: 1254 DSYRRLLALQSQLVNALEHPCSLNPRGYRAVESDGMGG---QRGMIDGNLLLRWLDMGAQ 1310

Query: 609  ERLEICKKIGS 619
             + EI  ++G+
Sbjct: 1311 RKAEIAGRVGA 1321


>gi|167526060|ref|XP_001747364.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774199|gb|EDQ87831.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1324

 Score =  182 bits (463), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 167/632 (26%), Positives = 275/632 (43%), Gaps = 101/632 (15%)

Query: 13   DETIVQELLTVSLGLHGNRPLLLVRT-QHELLIYQAFRHPKGALKLRFKKLKVLFVSDRS 71
            +E IV+ LL + LG  G RP LL RT  H LL+Y+ F               V  V++ S
Sbjct: 748  EEYIVETLL-IGLG-QGQRPHLLARTSDHHLLMYEVFP-------------VVPSVTEAS 792

Query: 72   KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRA---HPMTIDG 128
             R              +++ F NIAG  GV + GP P  L +    +L+A    P+ ++ 
Sbjct: 793  VR--------------RLKPFQNIAGCDGVCVTGPRP--LLVACGHQLKAITIVPLALED 836

Query: 129  PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLE 188
             V T  P H  +   GF+YF     L  +  P  L  +     R+  L  T   +A+ L+
Sbjct: 837  AVKTFHPLHMDDVENGFIYFTKAGTLCCATAPDGLMLNRGVLARRAVLGRTIQKIAFDLD 896

Query: 189  TKTYCIVTSTAEPSTDYYKFNG-----EDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
            ++   ++     P     + N      E   +   P + + + P    F + L SP S +
Sbjct: 897  SRLAALLLMEPRPELKPSRGNNDPPSNELPNISYRPDEPKALTPF---FQLQLLSPKSMK 953

Query: 244  EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
             +P T        HV     V +      +G + YIA+G      +  T  G     D  
Sbjct: 954  LLPDTRIEYDLHHHVTSFAAVRLSSSLNSTGKQNYIAVGVTLLEGQRATTTG---FVDFY 1010

Query: 304  EVVPEPGQPLTKNKIKMIYAKEQKGPVTAI-CHVAGFLVTAVGQ----KIYIWQLKD-ND 357
             V    G+   + +++   + +Q G V+A+ C   GFLV AVGQ    KIY+W  +D  +
Sbjct: 1011 TVDVHDGK---ETRLEKRASCKQPGCVSAMDCTEDGFLVAAVGQRLGSKIYVWNFQDGQE 1067

Query: 358  LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
            L  +A+ +  +Y + +  +KNL +VGDY   + LLR+                     KG
Sbjct: 1068 LQPLAYFEAGIYTSCIRVIKNLAIVGDYESGVQLLRFS------------------RQKG 1109

Query: 418  YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS---KHNDILDEF----SSMGFMIS 470
                   RG                 R     K+G+   K N    +F    S +  +  
Sbjct: 1110 LQQMPVFRGT--------------KHRFYSLVKVGADPHKSNCYCADFVVRESDLAMIYG 1155

Query: 471  DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG----AR 526
            D D N+V   Y  ++ ++ GG  L++  +FHLG  ++   +++  P  +  APG    A+
Sbjct: 1156 DADGNLVALDYDADSPDTRGGRILVRSANFHLGTRLSAMLRLQAAP--VVRAPGGLAEAQ 1213

Query: 527  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 586
               +     ++G  G  +PL E  YRRL MLQ  +V+H+S   GL+P  FR +K   +  
Sbjct: 1214 KCHVVHTFGIEGQQGVVIPLHEAEYRRLEMLQKKLVSHSS-LAGLHPFQFRAFKSSIWRP 1272

Query: 587  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
             + ++GI+DG+L+ ++  L   E+L++ +++G
Sbjct: 1273 RSFAQGILDGALLRQYFCLGRREQLDVAEQLG 1304


>gi|148886829|sp|A2R919.1|CFT1_ASPNC RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|134083776|emb|CAK47110.1| unnamed protein product [Aspergillus niger]
          Length = 1383

 Score =  181 bits (459), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 170/647 (26%), Positives = 285/647 (44%), Gaps = 131/647 (20%)

Query: 32   PLLLVRTQ-HELLIYQAFRHPKGALK----LRFKKLKVLFVSDRSKRANEQ-PGLPRGVR 85
            P L++R++  +L+IY+ F    G ++    L+F           SK  N   P +P GV 
Sbjct: 818  PYLILRSETDDLIIYKPFVVSTGPVEGIHSLKF-----------SKETNSVLPRIPPGVS 866

Query: 86   ISQ----------MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAP 135
             +Q          +R   +I+G   VF+ G    ++  TS      H + + G  S    
Sbjct: 867  STQPSGSDYRARPLRILPDISGLSAVFMPGASAGFIIRTSASA--PHFLRLRGENSR--- 921

Query: 136  FHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIV 195
                            S +R   LP    +D  W +++V L      LAY   +  Y + 
Sbjct: 922  ---------------SSTVRFCKLPPMTRFDYQWTLKRVHLGEQVDHLAYSTSSGMYVLG 966

Query: 196  TSTAEPSTDYYKFNGEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSWEEIPQTN---- 249
            T  A   TD+     ED EL  + R+    F P     F + L     W+   Q      
Sbjct: 967  TCHA---TDFKL--PEDDELHPEWRNEAISFFPSARGSF-IKLV----WDHHLQRQDSVI 1016

Query: 250  --FPLHEW-----EHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
              F LH +     E+V+ +KN+S+E        +  I +GT +   ED+  RG I +F++
Sbjct: 1017 LIFHLHSFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTAFARGEDIPSRGCIYVFEV 1076

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLT 359
            ++VVP+P  P T  K+K+I  +  KG VTA+  +   GF++ A GQK  +  LK D  L 
Sbjct: 1077 VQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVLVAQGQKCMVRGLKEDGSLL 1136

Query: 360  GIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
             +AF+D + Y++ +  +K   + ++GD  + +    Y  E   +SL A+D          
Sbjct: 1137 PVAFMDMQCYVSVVKELKGTGMCILGDAVKGVWFAGYSEEPYKMSLFAKDL--------- 1187

Query: 418  YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVV 477
                                     + LE+C        + L +   +  +++D D N+ 
Sbjct: 1188 -------------------------DYLEVCAA------EFLPDGKRLFIVVADSDCNIH 1216

Query: 478  LFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSS---ISDAPG-----ARSR 528
            +  Y PE  +S+ G RL+ ++ FH+G   +T   + R   SS   +S + G         
Sbjct: 1217 VLQYDPEDPKSSNGDRLLSRSKFHMGNFASTLTLLPRTMVSSEKMVSSSDGMDIDNQSPL 1276

Query: 529  FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
                  + +G+LG    +PE++YRRL  LQ+ +     H  GLNPRAFR  +      G 
Sbjct: 1277 HQVLMTTQNGSLGLITCIPEESYRRLSALQSQLTNTLEHPCGLNPRAFRAVESD----GT 1332

Query: 589  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
              RG++DG+L++K++ +S   + EI  ++G++  +I     D+EA+S
Sbjct: 1333 AGRGMLDGNLLFKWIDMSKQRKTEIAGRVGAREWEI---KADLEAIS 1376


>gi|344305212|gb|EGW35444.1| pre-mRNA 3'-end processing factor CF II [Spathaspora passalidarum
            NRRL Y-27907]
          Length = 1348

 Score =  181 bits (458), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 137/576 (23%), Positives = 260/576 (45%), Gaps = 59/576 (10%)

Query: 56   KLRFKKLKVLFVSDRSKRANEQP--GLPRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLF 112
            KL F     +F  ++  R    P    P G  I + + YF N+ GY  +F+ G  P  + 
Sbjct: 801  KLYFDGENYIFKKEKDLRITGAPENAYPLGTTIERRLVYFPNLNGYTSIFVTGIIPYLIM 860

Query: 113  LTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVR 172
                   R    +   P  +++ F +     G ++ +     RI  L    +Y+  WP+R
Sbjct: 861  KPMHSIPRIFQFS-KIPALSISAFSDSKIKNGLIFLDNSKNARICELSLDFTYEFNWPMR 919

Query: 173  KVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQF 232
            ++ +  +   + YH  + TY + T    P    Y    ED +L+      +   P+  + 
Sbjct: 920  QIHIGDSIKSITYHETSNTYVVSTFREIP----YDGLDEDGKLIVGTLPDKTPRPVAYKG 975

Query: 233  HVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRG---YIALGTNYNYSE 289
             + + SP +W  I      L + E  + ++++ ++   ++   +    +I +G+    +E
Sbjct: 976  SIKMISPLNWTVI--DTIELDDTEVAMNVQSMMLDVGSSMKKFKNKKEFIVIGSGKYRNE 1033

Query: 290  DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
            D+   G   +F+I+++VPEPG+P T +K K ++ ++ +G VT+IC ++G L+ A GQK+ 
Sbjct: 1034 DLVANGSFKIFEIVDIVPEPGKPETNHKFKEVFQEDTRGAVTSICGLSGRLLIAQGQKVI 1093

Query: 350  IWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
            +  ++D+ +  +AF+DT VY++   S+ NL+++GD  +S  L+ +  E   + ++ +D  
Sbjct: 1094 VRDVQDDGVVPVAFLDTAVYVSESKSLGNLLMLGDPLKSCWLVGFDAEPFRMIMLGKD-- 1151

Query: 410  PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
                                       L +S G+ +       +K  DI         +I
Sbjct: 1152 ------------------------LHHLNVSCGDFI-------TKDEDIY-------MLI 1173

Query: 470  SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS-- 527
            +D +  + L  Y P+  +S  G RLI K+ F +   V+   K+    SS   +    S  
Sbjct: 1174 ADNNNILHLIQYDPDDPQSLNGQRLISKSAFEIESTVSCMRKLPKIESSFEKSEIKFSPI 1233

Query: 528  -RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 586
              F    ++ DG+     P+ E +YRR+ +LQ  +     H  GLNPR  R + G     
Sbjct: 1234 DEFQIIGSTSDGSFFNVFPVDESSYRRMYILQQQLTDKEYHYCGLNPRLNR-FGGAIELR 1292

Query: 587  GNP--SRGIIDGSLVWKFLQLSLGERLEICKKIGSK 620
             N   ++ I+D  L+ ++ QL+   +  +  K+ +K
Sbjct: 1293 DNETNTKPILDFGLIKRYAQLNEDRKRNLASKVSAK 1328


>gi|448530371|ref|XP_003870046.1| mRNA cleavage and polyadenylation factor [Candida orthopsilosis Co
            90-125]
 gi|380354400|emb|CCG23915.1| mRNA cleavage and polyadenylation factor [Candida orthopsilosis]
          Length = 1327

 Score =  180 bits (457), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 153/615 (24%), Positives = 275/615 (44%), Gaps = 78/615 (12%)

Query: 28   HGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRIS 87
            H    L ++    E+L+Y+ F   +  +   FKK K L ++   + A        G  + 
Sbjct: 775  HKEEYLTILTISGEVLMYKLFYDGENYM---FKKEKDLKITGAPENA-----FNLGTMVE 826

Query: 88   Q-MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFL 146
            + + YF N+ GY  +F+ G  P  +  +     R    +   P  +++ F +     G +
Sbjct: 827  RRLVYFPNLNGYTSIFVAGVIPFLIIKSCHSIPRIFQFS-KIPAVSISAFSDSKIKNGLI 885

Query: 147  YFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYY 206
            + +     RI  L    +Y+   P+R+V +  +   +AYH ++ T  I T    P   Y 
Sbjct: 886  FLDNNQNARICELSLDYNYEFNLPIRRVHIGESIRSVAYHEQSDTVVISTFKEIP---YN 942

Query: 207  KFNGEDKELVTDPRDSRFIPPLVS-QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVS 265
              + E K +    +D    PP  S +  + L SPF+W+ I      L + E  + +K++ 
Sbjct: 943  CVDEEGKPIAGVLKDK---PPATSFKGSIKLVSPFNWKVI--DTIELQDNEVGMAIKSMV 997

Query: 266  MEYEGTLSGL---RGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIY 322
            ++   ++      R YI +GT     ED+   G   ++DII+++PEPG+P T +K K I+
Sbjct: 998  LDVGSSMKKFKTKREYIVVGTGKLRMEDLAANGSFKIYDIIDIIPEPGKPETNHKFKEIF 1057

Query: 323  AKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILV 382
             ++ +G VT++C ++G  +   GQK+ +  L+D+ +  +AF+DT VY++   S  NL L+
Sbjct: 1058 QEDTRGAVTSVCDLSGRFLVGQGQKVIVRDLEDDGVVPVAFLDTPVYVSEAKSFGNLFLL 1117

Query: 383  GDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLG 442
            GD  +SI L+ ++ +   + ++ +D +                                 
Sbjct: 1118 GDPLKSIWLVGFEADPFRMVMLGKDRQHL------------------------------- 1146

Query: 443  ERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHL 502
             R+E C     K  +I         +++D + ++ L  + P+  +S  G  LI K  F  
Sbjct: 1147 -RVE-CADFIVKDEEIF-------ILVADVNNSLHLIQFDPDDPKSINGTILINKASFET 1197

Query: 503  GQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMV 562
                     +R  P       G    + T  +++DGA     P+ E  YRR+ ++Q  + 
Sbjct: 1198 NSQTTC---LRSVPK------GETGDYQTIGSTIDGAFFNVFPVNESTYRRMYIVQQQIS 1248

Query: 563  THTSHTGGLNPRAFRTYKGKGYYAGNPSRG--IIDGSLVWKFLQLSLGERLEICKKI--- 617
                H  GLNPR  R + G      N +    I+D +L+ +F +L+L  +  I  KI   
Sbjct: 1249 DKEYHYCGLNPRLNR-FGGAVQIRDNDTNAKPILDYNLIKEFAKLNLDRQKNITTKINIK 1307

Query: 618  GSKHNDILDELYDIE 632
            GS H DI  +L ++E
Sbjct: 1308 GSAH-DIWKDLIELE 1321


>gi|400597740|gb|EJP65470.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
          Length = 1444

 Score =  179 bits (453), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 173/667 (25%), Positives = 297/667 (44%), Gaps = 89/667 (13%)

Query: 2    GNFRSHSPSAMDETIVQELLTVSLG-LHGNRPLLLVR-TQHELLIYQAFRH-------PK 52
             NF     +A +   + E+L   LG +    P L++R    +L +Y+  R+       P 
Sbjct: 824  ANFTGRKAAAKER--LTEILVADLGDVVSKSPFLILRHDTDDLTLYEPVRYQEPNSSSPP 881

Query: 53   GALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLF 112
                L FKK     ++  +   +++    +  R   ++   N+ GY  VFL G  P+++ 
Sbjct: 882  LTDTLFFKKSANATIAKSASAFDKEEDETQQRRFVPLQPCGNVGGYSTVFLSGDSPSFVL 941

Query: 113  LTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPV 171
             +++   R   +   G V  ++ FH   C RGF+Y + K   R+  LPT  +Y +    V
Sbjct: 942  KSAKSIPRIVGLQGQG-VQGMSTFHTAGCDRGFIYADTKGIARVCQLPTDTNYAELGISV 1000

Query: 172  RKVPLKCTPHFLAYHLETKTYCIVTSTAEP-----STDYYKFNGEDKELVTDPRDSRFIP 226
            +K+PL C  + +++H  T TY    ST EP       DY+K     +E+V+         
Sbjct: 1001 KKIPLDCDVNRVSFHSHTATYIAACSTREPFELPKDDDYHKEWA--REVVS-------FA 1051

Query: 227  PLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYN 286
            P + +  + L SP +W  I   +  L   E +  +  + +E        R  +A+G+   
Sbjct: 1052 PTMPRGMLKLISPAAWTVI--HSLDLESCETIESMMALHLEISEETKERRMLVAVGSAIC 1109

Query: 287  YSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHV--AGFLVT 342
              ED+  RGR+ +FDI+ V+PEPG+P T  ++K+  AKE+  +G VT++  +  +G L+ 
Sbjct: 1110 KGEDLPTRGRVQVFDIVTVIPEPGRPETNKRLKL-QAKEELPRGGVTSLSEIGTSGLLLI 1168

Query: 343  AVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTL 401
            A GQK  +  L+ D  L  +AF+D   +I   + V+ L   G                 L
Sbjct: 1169 AQGQKCMVRGLREDGGLLPVAFLDMNCHI---LGVRELRGTG-----------------L 1208

Query: 402  SLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
             L+A  +K        ++A     G  +    +K L  S G+       I     D L +
Sbjct: 1209 CLMADAFKGM------WFA-----GYTEEPYTFKVLGKSGGQ-------IPMLVADFLPD 1250

Query: 462  FSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG--QHVNTFFKIRCKPSSI 519
               +  +  D D ++ +F + P+  +S  GH LI +T F L   +   T    R  P+S 
Sbjct: 1251 GEDLSMIGVDADGDLHVFEFDPDHPKSLQGHLLIHRTTFSLSPNEPTTTVLLERTIPASQ 1310

Query: 520  ---SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
                   GA +      +   G L    PL E  YRRLL L N ++      GGL+P+A 
Sbjct: 1311 PQPKGTTGAETPHTLLLSCPTGQLAALTPLSESAYRRLLSLTNQVLPAVVPHGGLHPKAH 1370

Query: 577  RTYKGKGYYA---------GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 627
            R  +G+G  +             R I+DG+++ ++ +L   +R E+  K G  ++D+ + 
Sbjct: 1371 RLPEGRGAQSHSRAVGVETAASGRMIVDGAVLARWTELGAAKRAEMALKSG--YDDVHEM 1428

Query: 628  LYDIEAL 634
              ++E +
Sbjct: 1429 RGELEGV 1435


>gi|254564833|ref|XP_002489527.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
            [Komagataella pastoris GS115]
 gi|238029323|emb|CAY67246.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
            [Komagataella pastoris GS115]
 gi|328349950|emb|CCA36350.1| Protein cft1 [Komagataella pastoris CBS 7435]
          Length = 1388

 Score =  179 bits (453), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 143/551 (25%), Positives = 246/551 (44%), Gaps = 66/551 (11%)

Query: 81   PRGVRISQ-MRYFSNIAGYQ--GVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFH 137
            P+G ++ + +   +NI   +   +F+ G    W+       +  H  T    +S  A F+
Sbjct: 874  PQGTKLERRLIKLNNIGDSKLSTLFVVGVKSFWITKRHSSSINIHQFTKLSTISC-ARFN 932

Query: 138  NVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTS 197
               C  G +  +     R+  +P++L      P+R+VP+ CT   +A+H  ++T+ + T 
Sbjct: 933  TSRCKNGLMIIDTNKAARMVEIPSNLELSQRLPIRRVPVGCTIKCVAFHKASRTFVVSTV 992

Query: 198  TAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEH 257
               P    Y    E+   +    ++   P    +  + L SP SW  I   +F L E EH
Sbjct: 993  EETP----YNCVDEEGNPIVGVDNTINKPASSFKSSIKLISPISWTVID--SFDL-EDEH 1045

Query: 258  VLCLKNVSMEYEGT----LSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPL 313
            V C+   SM    +       L+ Y+ LG +    ED+   G+I + D+++++PEPG+P 
Sbjct: 1046 V-CMSLKSMTLNTSRIPMFKNLKEYLVLGISNYRMEDLASNGQIRIVDVVDIIPEPGKPE 1104

Query: 314  TKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIAS 372
            T +K K I+    KG VT++  ++G  V   GQKI +  L+ DN    + F+DT  Y++ 
Sbjct: 1105 TNHKFKDIFQDATKGAVTSVSDISGRFVIGQGQKIIVRDLQEDNTALPVGFVDTPFYVSE 1164

Query: 373  MVSVKNLILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGS 431
              S +NL+LVGD   S+ L+ +  E YR +SL  +D                        
Sbjct: 1165 TKSFQNLLLVGDSMHSVILVGFDAEPYRMISL-GKDVA---------------------- 1201

Query: 432  LVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGG 491
                         +++C        D +    ++  +I+D+D  + L  Y PE   S  G
Sbjct: 1202 ------------HVDVCAA------DFVVFEGNLFIIIADEDGMLHLIQYDPEDPASMQG 1243

Query: 492  HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWY----ASLDGALGFFLPLP 547
             RL++++ F   Q+  T  K+R +   I       + F   +    A+ DG+     P+ 
Sbjct: 1244 QRLLRRSIFKTNQYT-TCMKMRERKYVIKPPKNQFTNFSEAFEVVAANSDGSFYKVTPIS 1302

Query: 548  EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 607
            E  YRRL ++Q  +    +H  GLNPR  R Y    Y   N  R I+D   + +FL+   
Sbjct: 1303 EATYRRLYVIQQQIFDQENHKCGLNPRENR-YLSDQYSIPN-QRLILDFDNIRRFLEFDE 1360

Query: 608  GERLEICKKIG 618
             ++ ++  K+G
Sbjct: 1361 IKKRDLVHKLG 1371


>gi|425765419|gb|EKV04111.1| Cleavage and polyadenylation specificity factor subunit A, putative
            [Penicillium digitatum Pd1]
 gi|425767100|gb|EKV05682.1| Cleavage and polyadenylation specificity factor subunit A, putative
            [Penicillium digitatum PHI26]
          Length = 1271

 Score =  176 bits (445), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 132/493 (26%), Positives = 228/493 (46%), Gaps = 66/493 (13%)

Query: 154  LRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDK 213
            +R   LP+   +D  W +RKVP++   +FLAY   ++TY + TS      D+    G+  
Sbjct: 822  IRACQLPSQTQFDYSWTLRKVPIEEQVNFLAYSTSSETYVLGTSR---QGDFKLPEGD-- 876

Query: 214  ELVTDPRDSRF-IPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTL 272
            EL  + R+      P + +  + + SP +W  I   ++PL   E V  +KNV++E     
Sbjct: 877  ELHPEWRNEELSFCPKIPESSIKVVSPKTWTII--DSYPLDPDEQVTAVKNVNIEVSENT 934

Query: 273  SGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTA 332
                  I +GT     ED+  RG I +FD+I+V P+P +P T  K+K+I  +  KG VTA
Sbjct: 935  HERMDLIVVGTAIAKGEDMPARGTIYVFDVIKVAPDPERPETGRKLKLIGKETVKGAVTA 994

Query: 333  ICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVK--NLILVGDYAR 387
            +  +   GF++ A GQK  +  LK D  L  +AF+D + Y+  +  +K   ++++GD  +
Sbjct: 995  LSGIGGQGFIIVAQGQKCMVRGLKEDGSLLPVAFMDMQCYVNVVKELKGTGMVILGDAVK 1054

Query: 388  SIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 447
             +    Y  E   ++L  +D                                   E LE+
Sbjct: 1055 GLWFAGYSEEPYRMTLFGKD----------------------------------PEYLEV 1080

Query: 448  CKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVN 507
                     D L + + +  +++D D N+ +  Y PE  +S+ G RL+ ++ F+ G   +
Sbjct: 1081 VAA------DFLPDGNKLYMLVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFYTGNFAS 1134

Query: 508  TFFKI---------RCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQ 558
            +   +                 D     +++    AS +G+L     + E++YRRL  LQ
Sbjct: 1135 SVTLLPRTAVSSELTESSEEAMDVDETFAKYQVLIASQNGSLALVTSVAEESYRRLSGLQ 1194

Query: 559  NVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
            + ++    H  GLN RAFR  +  G  AG   RG++DG+L+  +L +    + EI  ++G
Sbjct: 1195 SQLINTVDHPAGLNARAFRATESDG-AAG---RGMVDGNLLRLWLNMGKQRQAEIAGRVG 1250

Query: 619  SKHNDILDELYDI 631
            +   +I  +L  I
Sbjct: 1251 ATEWEIKADLETI 1263


>gi|393220097|gb|EJD05583.1| cleavage factor protein [Fomitiporia mediterranea MF3/22]
          Length = 1450

 Score =  175 bits (444), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 156/617 (25%), Positives = 276/617 (44%), Gaps = 95/617 (15%)

Query: 41   ELLIYQAFR-----HPKGALKLRFKKLKVLFVSDRSKRANE-QPGLPRGVRISQMRYFSN 94
            +L IYQA        P+  ++    K+K + +  RS    + +P     V   Q R   +
Sbjct: 887  QLAIYQAVAVDKDDFPESTVRTSTLKIKFIKMGTRSFEPRQLEPAEKSSVIAEQRRALRS 946

Query: 95   IAGY----------QGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRG 144
            +  +           GVF+ G  P W+  T +  L+ H  +          F  VN    
Sbjct: 947  LVPFIVSPNSEKRVSGVFVTGDEPCWIVATDKDGLKIHSCS----------FQTVNSFTS 996

Query: 145  FLYFNAKSELRI----SVLPTHLSY------DAPWPVRKVPLKCTPHFLAYHLETKTYCI 194
               +++K +  +    +  P  L +          P + V +  T  +     +  +  +
Sbjct: 997  CSVWDSKCDFLMHTDEAFGPCLLGWIPEFNLGTDMPSKTVTVGRT--YTNVTFDAASGLM 1054

Query: 195  VTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHE 254
            V S+  P+  +  F+ E  +L  +P       P      + LF   S        +    
Sbjct: 1055 VASSVVPNP-FTIFDEEGNKL-WEPDAPNINYPHSVMSALELF--HSDLSCVMDGYEFQP 1110

Query: 255  WEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLT 314
             E V  L  V +E + T SG + +I +GT  N  ED+  +G   +F+I+E+VP+P   L 
Sbjct: 1111 NEFVTALDCVQLETQSTESGTKEFIVVGTTVNRGEDLAVKGVTYVFEIVEIVPDPEGGLA 1170

Query: 315  KN-KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIAS 372
            +  K++++   E KGPVTA+C + G+LV+++GQKI++  L  D  L G+AF+D  VY+ S
Sbjct: 1171 RQFKLRLLCKDEAKGPVTALCGMNGYLVSSMGQKIFVRALDLDERLVGVAFLDVGVYVTS 1230

Query: 373  MVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSL 432
            + ++KNL+++GD  +S+ L+ +Q +   L +VA++                         
Sbjct: 1231 LRTIKNLLIIGDAVKSVWLVAFQEDPFKLVIVAKEV------------------------ 1266

Query: 433  VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMG---FMISDKDKNVVLFMYQPEARESN 489
                      +RL++         D L  F+S G     +SD++  + L  Y     ES+
Sbjct: 1267 ----------QRLDVMTA------DFL--FASDGDFYIAVSDEEGIIRLLEYDTSDPESH 1308

Query: 490  GGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPL-PE 548
             G  L+++T++H     +T   I  +  +    P AR       A++DG++    P+  +
Sbjct: 1309 SGQYLLRRTEYHAQVESHTTVLIARRSQNDGLVPQAR----LISAAVDGSMYALTPVDAD 1364

Query: 549  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 608
            ++ +RL +LQ  +  +  H  GLNPRAFR  +  G  A   ++GI+DG+L+  F QL + 
Sbjct: 1365 ESAKRLQLLQGQLTRNMQHVAGLNPRAFRAVRSDG-VARPLTKGILDGNLLAGFEQLPIP 1423

Query: 609  ERLEICKKIGSKHNDIL 625
             + EI + IG+    +L
Sbjct: 1424 RQNEIARPIGTDRLAVL 1440


>gi|402085944|gb|EJT80842.1| cft-1 [Gaeumannomyces graminis var. tritici R3-111a-1]
          Length = 1450

 Score =  175 bits (443), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 163/627 (25%), Positives = 283/627 (45%), Gaps = 72/627 (11%)

Query: 17   VQELLTVSLGLHG-NRPLLLVR-TQHELLIYQAFRHPKGALKL----RFKKLKVLFVSDR 70
            + EL+   LG      P L++R +  +L IY+ F+  + +  L    RF+KL    V+ +
Sbjct: 848  LSELMVTDLGDSTFKSPHLILRHSNDDLTIYEPFKIAESSQSLSGTLRFRKLPNPAVA-K 906

Query: 71   SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP- 129
            S+        P  +R   +R   NIAGY  VFL G  P++L  +S+   R   + + GP 
Sbjct: 907  SQDTKVSDDAPAPMRRMPLRACGNIAGYSCVFLPGHSPSFLIKSSKSTPRV--IGLQGPG 964

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHLE 188
            V  ++PFH   C RGF+Y + +   R++ +P   S+ +    V+KVPL      +AYH  
Sbjct: 965  VRAMSPFHTKGCDRGFIYADYEGVARVAQIPNDCSFAELGLSVKKVPLNMDADGIAYHTP 1024

Query: 189  TKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQT 248
            +  Y +  S  EP    ++   +D+      +++    P      + + +P +W EI   
Sbjct: 1025 SGVYVVTCSFWEP----FELPSDDESHREWAKENITFKPQTEHSVLKVINPVNWSEIWTE 1080

Query: 249  NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPE 308
             F  +E    +C+K++++E   + +  R  I +GT     ED+  RG + ++D+  VVP+
Sbjct: 1081 EFDKNEV--AMCIKSLNLEVSQSTNERRHLITVGTAICKGEDLPVRGCVYVYDLASVVPQ 1138

Query: 309  PGQPLTKNKIKMIYAKE-QKGPVTAICHVA--GFLVTAVGQKIYIWQL-KDNDLTGIAFI 364
              +P T  K+K++   E  +G VTA+  +   G ++ A GQK  +  L +D  L  +AF+
Sbjct: 1139 KDRPETDKKLKLMAKDEVPRGAVTALSEIGTQGLMLVAQGQKCLVRGLGEDGRLLPVAFM 1198

Query: 365  DTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPS 424
            D   Y++     K L   G  A + A   ++  + T                GY  G P 
Sbjct: 1199 DMNCYVS---CAKELPGTGFCAMADA---FKGVWFT----------------GYTEG-PY 1235

Query: 425  RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPE 484
            + +I G              LE+       + D L +  ++  + +D + N+ +F + PE
Sbjct: 1236 KMMIFG---------KSSTNLEVI------NVDFLPDGRNLLLVAADAEGNLHIFQFDPE 1280

Query: 485  ARESNGGHRLIKKTDFHLGQH-------VNTFFKIRCKPSSISDAPGARSRFL-TWYASL 536
              +S  GH L+ +T F  G H       + T      +P++  DA  A +       A+ 
Sbjct: 1281 HPKSLQGHLLLNRTTFSTGAHHPQKSLLMPTTSSNPSQPATNGDASAAAAGPQHILMAAP 1340

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT--YKGKGYYAG---NPSR 591
             G L    PL +  Y RL  L + +     H   LNP+A+R      +   A    +  R
Sbjct: 1341 TGVLAAVQPLGQGVYTRLSALASNLAASVPHHAALNPKAYRMPPAPARNQVAAVDISVGR 1400

Query: 592  GIIDGSLVWKFLQLSLGERLEICKKIG 618
             ++DG+L+ ++ +L  G R E+  + G
Sbjct: 1401 AVVDGALLARWAELGSGRRAEVAGRAG 1427


>gi|367018592|ref|XP_003658581.1| hypothetical protein MYCTH_2294503 [Myceliophthora thermophila ATCC
            42464]
 gi|347005848|gb|AEO53336.1| hypothetical protein MYCTH_2294503 [Myceliophthora thermophila ATCC
            42464]
          Length = 1547

 Score =  175 bits (443), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 139/510 (27%), Positives = 234/510 (45%), Gaps = 65/510 (12%)

Query: 14   ETIVQELLTVSLG--LHGNRPLLLVRTQHELLIYQAFRHPKGA-----LKLRFKKLKVLF 66
            ETI  E+L   LG   H +  L+L  T  +L +YQ FR+  GA       L F+KL    
Sbjct: 873  ETIA-EILVADLGDMTHKSPHLILRHTNDDLTLYQPFRYNTGAGLEFSKTLFFQKLPNTV 931

Query: 67   VSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTI 126
             +   + A++     +  R   MR  +N+ GY  VFL G  P+++  +S+   +  P+  
Sbjct: 932  FAKSPEEADDDEATHQ-PRFLSMRRCANVGGYSTVFLPGASPSFIIKSSKSVPKVLPLQG 990

Query: 127  DGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAY 185
             G V  ++PFH   C  GF+Y +++   R++ LP   SY +    VRK+P+       AY
Sbjct: 991  TG-VIAMSPFHTEGCEHGFIYADSRDMARVAQLPQDWSYAELGLAVRKIPIGEDIAAAAY 1049

Query: 186  HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEI 245
            H   ++Y +  +T EP    ++   +D       R++    P V + ++ L SP +W  +
Sbjct: 1050 HPPMQSYVVGCNTPEP----FELPKDDDYHKEWARENLAFKPTVDRGNLKLVSPITWTVV 1105

Query: 246  PQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEV 305
               +  +   E VLC++ + +E     +  +  IA+GT     ED+  RGR+ ++DI +V
Sbjct: 1106 --DSIQMEPCETVLCVECLGLEVSEFTNERKQLIAVGTAITKGEDLPTRGRVYVYDIADV 1163

Query: 306  VPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTG 360
            +P+PG+P T  K+K+I AKE   +G VTA+  +   G ++ A GQK  +  LK D  L  
Sbjct: 1164 IPQPGRPETSKKLKLI-AKEDIPRGAVTALSEIGTQGLMLVAQGQKCMVRGLKEDGSLLP 1222

Query: 361  IAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
            +AF+D   Y+ +   +    L L+ D  + +    Y  E   + L  +            
Sbjct: 1223 VAFMDMSCYVTAAKELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKS----------- 1271

Query: 419  YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
                                     RLE+       + D L +   +  ++SD D ++ +
Sbjct: 1272 -----------------------ATRLEVL------NADFLPDGKELFIVVSDADGHIHI 1302

Query: 479  FMYQPEARESNGGHRLIKKTDFHLGQHVNT 508
              + PE  +S  GH L+ +T F+ G H  T
Sbjct: 1303 LQFDPEHPKSLQGHLLLHRTTFNTGAHQPT 1332



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 44/104 (42%), Gaps = 20/104 (19%)

Query: 534  ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG------------ 581
            A+  G L     LPE  YRRL  L   +     H  GLNPR +R   G            
Sbjct: 1421 AAPTGVLAALRALPESAYRRLSSLAAQLAGSLPHAAGLNPRGYRLPDGVASSSSPWSSSS 1480

Query: 582  -------KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
                    G  AG   R I+DG+L+ +F +L +  R+E+  + G
Sbjct: 1481 SSFSAVVPGVDAGV-GRTIVDGALLQRFTELGMARRVELAGRAG 1523


>gi|384253955|gb|EIE27429.1| hypothetical protein COCSUDRAFT_64224 [Coccomyxa subellipsoidea
            C-169]
          Length = 1137

 Score =  175 bits (443), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 161/635 (25%), Positives = 264/635 (41%), Gaps = 101/635 (15%)

Query: 43   LIYQAFRHPKGALKLRFKKLKVLFVSD------RSKRANEQPGLPRGVRISQMRYFSNIA 96
            L Y+AF  P+G  ++ FK+L +   +       RSK       + R   + + +   N  
Sbjct: 565  LAYRAFHTPRG--RVCFKRLSLPAHAHCPPQDRRSKTTAPSSSMTRFDGLGESKEHVN-- 620

Query: 97   GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF----NAKS 152
               G+F+ G  P WL + SRG L AH M ++G VS + PFHN+NCP GF+      N   
Sbjct: 621  --SGMFVSGERPLWL-VASRGTLVAHAMDVEGRVSGMTPFHNINCPLGFITACMAENDGE 677

Query: 153  ELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGED 212
             L+I  LP     D PWP++K+ ++ TPH LAY+ E + Y ++ S   P   Y +   E 
Sbjct: 678  TLKICQLPMRTRLDTPWPLQKIAVRATPHRLAYYAEARLYVLLVSRPVP---YREHQEEA 734

Query: 213  KELVTDPRDS-RFIPPLVSQ--------FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKN 263
             +   DP  S  +I    +           V L  P  ++ + +      E     C   
Sbjct: 735  SD--GDPHASYSYICADAAAKASGTELGGEVRLLEPGRYQTVARHALDPGEEP---CSVA 789

Query: 264  VSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVV---PEPGQPLTKNKIKM 320
                       L  YI +GT  NY ED  C GRILLF          E   P    ++ +
Sbjct: 790  ADWLRNAQTGALEPYITVGTALNYGEDYPCSGRILLFKATRTSTSGAEQADPTISWQLTL 849

Query: 321  IYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLI 380
            ++A     PV  +  + G LV AVG  + + +L+ + L  I+F   +++I S+ ++K  I
Sbjct: 850  VHASGFSRPVQGLAVMDGRLVAAVGNNMQVMELRGSSLHMISFFHAQLFITSVATIKTFI 909

Query: 381  LVGDYARSIALL--RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQ 438
            L+GD  + +  +    +  Y  L+ +++DY      +  +        +++G  ++    
Sbjct: 910  LLGDVHKGLTFVYADKKANYTALTQLSKDYNDVDVEAAEF--------LVNGKKLF---- 957

Query: 439  LSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ--PEARESNGGHRLIK 496
                                         +  D  +N+ LF Y    E + +  G +L+ 
Sbjct: 958  ----------------------------LLACDAAQNLRLFAYDGGKEQQATWQGKKLLP 989

Query: 497  KTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLP----LPEKNYR 552
                H+GQ++ +    R  P+S   A G + R    + S  G++    P    LP +   
Sbjct: 990  LGAIHVGQNICSSLSHRITPAS---ATGVQLRAAV-FGSAAGSIASLAPTWDGLPAE--- 1042

Query: 553  RLLMLQNVMVTHTSHTGGLNPRAF-RTYK--------GKGYYAGNPSRGIIDGSLVWKFL 603
             LL LQ  MV       GLNP +F R YK        G+ + A      ++D   + +F 
Sbjct: 1043 ELLALQREMVLAVPQVAGLNPVSFRRRYKHGVKALAGGQSFEAPVSDDRVLDLDQLNRFQ 1102

Query: 604  QLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             L L E++ +  K       +L  L ++    S F
Sbjct: 1103 WLPLTEQVALAAKCNLSRQQVLHALREMVMAISTF 1137


>gi|344229600|gb|EGV61485.1| hypothetical protein CANTEDRAFT_109087 [Candida tenuis ATCC 10573]
          Length = 1300

 Score =  175 bits (443), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 135/550 (24%), Positives = 245/550 (44%), Gaps = 66/550 (12%)

Query: 88   QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLY 147
            Q+ +  N++G  G+F+ G  P ++  T+    R        P+ +   F N       ++
Sbjct: 809  QLFHIENLSGLTGIFVSGDVPYYIVKTNHSIPRIFKFA-RIPIMSFGKFAN----NQLIF 863

Query: 148  FNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYK 207
             + K   RI  +P+  +Y+  WP R++ +  T   +AYH  + T+ I T    P   Y  
Sbjct: 864  LDDKKNTRICEIPSEFNYENNWPARQINIGETIKDVAYHETSNTFVISTYKEIP---YNC 920

Query: 208  FNGEDKELVTDPRDSRFIPPLVS-QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSM 266
             + E+  +V    D    P  +S +  + L SP SW  I +  F L + E    + ++ +
Sbjct: 921  LDEENVPIVGIMEDK---PSALSYKGSIKLVSPISWTVIDE--FELDDNEVGTKVSSMVL 975

Query: 267  EYEGT---LSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYA 323
            +   +       R ++ +GT     ED+   G   + +II+V+PEPG P T +K K  Y 
Sbjct: 976  DVGSSTRRFKSKREFVVIGTGKLRMEDLAANGSFKVLEIIDVIPEPGHPETNHKFKEFYK 1035

Query: 324  KEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVG 383
            +E KG VTA+  V+G  + + GQKI +  L+D+ +  +AF+D  VY++   S  N +L+G
Sbjct: 1036 EETKGAVTAVSDVSGRFLVSQGQKIIVRDLQDDGVVPVAFLDCSVYVSESKSYGNFVLLG 1095

Query: 384  DYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGE 443
            D  +S+ L  +  E   + ++ +D K    N   +   +    II G             
Sbjct: 1096 DTLKSVWLAGFDAEPYRMIMLGKDLKSIDVNCADFIVKDEELYIIVGD------------ 1143

Query: 444  RLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG 503
                       +N+IL                  L  Y PE   S+ G RL++K  F+L 
Sbjct: 1144 -----------NNNILH-----------------LLKYDPEDPNSSNGQRLVEKAAFNLN 1175

Query: 504  QHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
              V    +++  P+ + ++           ++++G+     P+ E +YRR+ +LQ  +  
Sbjct: 1176 AKVT---QLKQLPNLMDNSTSCIG------STIEGSFFTVFPINESSYRRMYILQQQLTD 1226

Query: 564  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 623
               H  GLNPR  R    K     + ++ I+D  ++  + +L+   R  I  K+  + ++
Sbjct: 1227 KAYHHCGLNPRLNRFGGLKLTANESNNKPILDYDVIKLYAKLNEDRRRNIGAKVSREGSE 1286

Query: 624  ILDELYDIEA 633
            I  ++ + EA
Sbjct: 1287 IWRDMLEFEA 1296


>gi|403178252|ref|XP_003336695.2| hypothetical protein PGTG_18491 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
 gi|375164075|gb|EFP92276.2| hypothetical protein PGTG_18491 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
          Length = 1149

 Score =  174 bits (442), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 140/563 (24%), Positives = 244/563 (43%), Gaps = 70/563 (12%)

Query: 88   QMRYFSNI---AGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRG 144
            Q R F++I     ++GV+L G  P WL  T  G  R +    +  +  +A       P G
Sbjct: 640  QSRSFTSIQMDGKFKGVYLAGQPPVWLLSTDHGPCRIYDSPDEKTIHGIAQL-----PDG 694

Query: 145  FLYFNAKSELRISVLPTHLSYDAPWPV---------RKVP---LKCTPHFLAYHLETKTY 192
            FL   +++ ++           + W           R++P   +K    F     ++ + 
Sbjct: 695  FLMSLSEASVQDEEPSQACDPASLWETYISEYVCLDREIPSTLVKTGRPFNKVFYDSASE 754

Query: 193  CIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPL 252
             +V ++    T +  F+ E+  L+  P D   I     +  + L  P  W  I    F  
Sbjct: 755  TVVGASY-LETAFANFD-EEGNLMWQPDDDSLIRATTFRSSLELILPGKWVTIDGYEFQQ 812

Query: 253  HEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQP 312
            +EW  V  + NV ++   T+SG R ++ +GT  N +ED+  RG I +F+I+ V P     
Sbjct: 813  NEW--VTSMANVELDSRSTVSGRRQFVGVGTTCNRAEDLAARGGIYVFEIVVVNPAQNHR 870

Query: 313  LTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIA 371
                 +++ Y +E K  VTA+  + G+ +  +GQK+Y     +D  L  + F+D + Y  
Sbjct: 871  TYNRALRLRYYEETKACVTAVDAINGYFLHTMGQKLYAKCFEQDERLLAVGFLDIKPYTT 930

Query: 372  SMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGS 431
             M   KN IL+GD  + I L+ +Q E   L  +   Y   + ++  +        +IDG 
Sbjct: 931  CMRIFKNFILLGDAVKGITLVAFQEEPYKLIELGHTYVDLKCSTIDFL-------VIDGK 983

Query: 432  LVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGG 491
            L                                   + +D +  + +F Y P   ES GG
Sbjct: 984  L---------------------------------AIVATDLNGVIRIFEYNPTNIESQGG 1010

Query: 492  HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNY 551
             +L+ +++F+    +    +   + S+  +A        T++ASLDG++   +P  E  Y
Sbjct: 1011 QKLLCRSEFNTSSEMTCSMQFGKRLSAKDEA----KVMGTFFASLDGSISSLVPAKEAVY 1066

Query: 552  RRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERL 611
            +RL ++Q  +  H  H  GLNP+  RT +     +   +RGI+DG L+ KF  LS+ ++ 
Sbjct: 1067 KRLQLVQTRLTRHIQHFAGLNPKGHRTVRND-LVSRAINRGILDGELLIKFHLLSVTQQA 1125

Query: 612  EICKKIGSKHNDILDELYDIEAL 634
            EI    GS    +L  L ++  L
Sbjct: 1126 EIAGLAGSDRETVLVNLLNLRGL 1148


>gi|354547787|emb|CCE44522.1| hypothetical protein CPAR2_403250 [Candida parapsilosis]
          Length = 1334

 Score =  174 bits (442), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 140/554 (25%), Positives = 250/554 (45%), Gaps = 69/554 (12%)

Query: 88   QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLY 147
            ++ YF N+ GY  +F+ G  P  +  +     R +  +   P  +++ F +     G ++
Sbjct: 835  RLVYFPNLNGYTTIFVTGVIPFLIIKSCHSIPRIYQFS-KIPAVSVSAFSDSKIKNGLIF 893

Query: 148  FNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYK 207
             +     RI  L    SY+   P+RKV +  +   +AYH ++ T  I T    P   Y  
Sbjct: 894  LDNNQNARICELSWDYSYEFNLPIRKVHIGESIKSVAYHEQSDTVVISTFKEIP---YDC 950

Query: 208  FNGEDKELVTDPRDSRFIPPLVS-QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSM 266
             + E K +    +D    PP  S +  + L SP++W+ I      L + E  + +K++ +
Sbjct: 951  VDEEGKPIAGALKDK---PPATSFKGSIKLVSPYNWKVIDTVE--LSDNEVGMSIKSMVL 1005

Query: 267  EYEGTLSGL---RGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYA 323
            +   +L      R YI +GT+    ED+   G   ++DII+++PEPG+P T +K K I+ 
Sbjct: 1006 DVGSSLKKFKTKREYIVIGTSKLRMEDLAANGSFKIYDIIDIIPEPGKPETNHKFKEIFQ 1065

Query: 324  KEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVG 383
            ++ KG VT+IC ++G  +   GQK+ +  L+D+ +  +AF+DT VY++   S  N+ L+G
Sbjct: 1066 EDTKGAVTSICDLSGRFLVGQGQKVIVRDLEDDGVVPVAFLDTPVYVSEAKSFGNIFLLG 1125

Query: 384  DYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGE 443
            D  +SI L+ ++ +   + ++ +D +                                  
Sbjct: 1126 DALKSIWLVGFEADPFRMVMLGKDRQHLHVE----------------------------- 1156

Query: 444  RLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLG 503
                C     K  +I         +++D +  + L  + P+  +S  G  L+ K  F   
Sbjct: 1157 ----CADFIVKDEEIF-------ILVADINNGLHLIQFDPDDPKSINGTILVNKASFETN 1205

Query: 504  QHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
                      C  S   D  G    + T  +++DGA     P+ E  YRR+ ++Q  +  
Sbjct: 1206 SQTT------CLRSVPKDEAG---DYQTIGSTIDGAFFNVFPVNESTYRRMYIVQQQISD 1256

Query: 564  HTSHTGGLNPRAFRTYKGKGYY--AGNPSRGIIDGSLVWKFLQLSLGERLEICKKI---G 618
               H  GLNPR  R + G      +   ++ I+D +L+ +F +L+L  +  I  KI   G
Sbjct: 1257 KEFHHCGLNPRLNR-FGGAIQIRDSDTNAKPILDYNLIREFAKLNLDRQRNIATKINIKG 1315

Query: 619  SKHNDILDELYDIE 632
            S H DI  +L ++E
Sbjct: 1316 SAH-DIWKDLIELE 1328


>gi|403170487|ref|XP_003329830.2| hypothetical protein PGTG_11767 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
 gi|375168746|gb|EFP85411.2| hypothetical protein PGTG_11767 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
          Length = 1513

 Score =  174 bits (442), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 140/563 (24%), Positives = 244/563 (43%), Gaps = 70/563 (12%)

Query: 88   QMRYFSNI---AGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRG 144
            Q R F++I     ++GV+L G  P WL  T  G  R +    +  +  +A       P G
Sbjct: 1004 QSRSFTSIQMDGKFKGVYLAGQPPVWLLSTDHGPCRIYDSPDEKTIHGIAQL-----PDG 1058

Query: 145  FLYFNAKSELRISVLPTHLSYDAPWPV---------RKVP---LKCTPHFLAYHLETKTY 192
            FL   +++ ++           + W           R++P   +K    F     ++ + 
Sbjct: 1059 FLMSLSEASVQDEEPSQACDPASLWETYISEYVCLDREIPSTLVKTGRPFNKVFYDSASE 1118

Query: 193  CIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPL 252
             +V ++    T +  F+ E+  L+  P D   I     +  + L  P  W  I    F  
Sbjct: 1119 TVVGASY-LETAFANFD-EEGNLMWQPDDDSLIRATTFRSSLELILPGKWVTIDGYEFQQ 1176

Query: 253  HEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQP 312
            +EW  V  + NV ++   T+SG R ++ +GT  N +ED+  RG I +F+I+ V P     
Sbjct: 1177 NEW--VTSMANVELDSRSTVSGRRQFVGVGTTCNRAEDLAARGGIYVFEIVVVNPAQNHR 1234

Query: 313  LTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIA 371
                 +++ Y +E K  VTA+  + G+ +  +GQK+Y     +D  L  + F+D + Y  
Sbjct: 1235 TYNRALRLRYYEETKACVTAVDAINGYFLHTMGQKLYAKCFEQDERLLAVGFLDIKPYTT 1294

Query: 372  SMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGS 431
             M   KN IL+GD  + I L+ +Q E   L  +   Y   + ++  +        +IDG 
Sbjct: 1295 CMRIFKNFILLGDAVKGITLVAFQEEPYKLIELGHTYVDLKCSTIDFL-------VIDGK 1347

Query: 432  LVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGG 491
            L                                   + +D +  + +F Y P   ES GG
Sbjct: 1348 L---------------------------------AIVATDLNGVIRIFEYNPTNIESQGG 1374

Query: 492  HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNY 551
             +L+ +++F+    +    +   + S+  +A        T++ASLDG++   +P  E  Y
Sbjct: 1375 QKLLCRSEFNTSSEMTCSMQFGKRLSAKDEA----KVMGTFFASLDGSISSLVPAKEAVY 1430

Query: 552  RRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERL 611
            +RL ++Q  +  H  H  GLNP+  RT +     +   +RGI+DG L+ KF  LS+ ++ 
Sbjct: 1431 KRLQLVQTRLTRHIQHFAGLNPKGHRTVRND-LVSRAINRGILDGELLIKFHLLSVTQQA 1489

Query: 612  EICKKIGSKHNDILDELYDIEAL 634
            EI    GS    +L  L ++  L
Sbjct: 1490 EIAGLAGSDRETVLVNLLNLRGL 1512


>gi|50552095|ref|XP_503522.1| YALI0E03982p [Yarrowia lipolytica]
 gi|74634000|sp|Q6C740.1|CFT1_YARLI RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|49649391|emb|CAG79101.1| YALI0E03982p [Yarrowia lipolytica CLIB122]
          Length = 1269

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 155/631 (24%), Positives = 276/631 (43%), Gaps = 95/631 (15%)

Query: 19   ELLTVSLGLHGN----RPLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRA 74
            EL+ ++L   G+    R  L++ T  +L++Y+ + +     KLRF+K+ +          
Sbjct: 719  ELVDIALSPLGDDHILRDYLVLLTPQQLVVYEPYHYND---KLRFRKIFL---------- 765

Query: 75   NEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID-GPVSTL 133
               P +    R++Q+     I G   + + G   A++ + +   L   P  I+ G     
Sbjct: 766  ERTPTINSDRRLTQVPL---INGKHTLGVTG-ETAYILVKT---LHTSPRLIEFGETKGA 818

Query: 134  APFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKC--TPHFLAYHLETKT 191
              F + +    F Y     E+         S +  WPV+ V L C  T   + YH     
Sbjct: 819  VAFTSWDGK--FAYLTQAGEVAECRFDPSFSLETNWPVKHVQL-CGETISKVTYHETMDV 875

Query: 192  YCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFP 251
            Y I T    P    +    ED E++ +      +P    Q  + + +P+SW  I    F 
Sbjct: 876  YVIATHKTVP----HVVRDEDDEVI-ESLTPDIMPATTYQGAIRIVNPYSWTVIDSYEFE 930

Query: 252  LHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQ 311
            +   E  LC ++V +      S  R  +A+GT+    ED+  RG + LFD+IE+VPE  +
Sbjct: 931  M-PAEAALCCESVKLSISDRKSQKREVVAVGTSILRGEDLAARGALYLFDVIEIVPEKER 989

Query: 312  PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDN-DLTGIAFIDTEVYI 370
            P T  ++K +     +G  TA+C V+G L+   GQK+ +  L+D+  L  +AF+D + Y+
Sbjct: 990  PETNRRLKKLVQDRVRGAFTAVCEVSGRLLAVQGQKLLVQALQDDLTLVPVAFLDMQTYV 1049

Query: 371  ASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDG 430
            A   S+ +++L+GD  RS+  + +  +   +   ARD                       
Sbjct: 1050 AVAKSLNSMLLLGDATRSVQFVGFSMDPYQMIPFARD----------------------- 1086

Query: 431  SLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNG 490
                  LQ  L   +  C        D   E  ++ F+++D  K + +  Y P+  +S  
Sbjct: 1087 ------LQRVL---VTTC--------DFAIEGENLTFVVADLQKRLHILEYDPDDPQSYS 1129

Query: 491  GHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKN 550
            G RL++++ F+ G+ +++   +          P    RF+      DG++   +P PE  
Sbjct: 1130 GARLLRRSVFYSGKVIDSSAMV----------PINEDRFMVIGVCSDGSVTDVVPCPEDA 1179

Query: 551  YRRLLMLQNVMVTHTSHTGGLNPRAFR---TYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 607
            YRRL  +Q  +    +H  GL+PRA+R      G G    +P R I+DG  + +F  L  
Sbjct: 1180 YRRLYAIQTQITDKEAHVCGLHPRAYRYDPILPGTG---NSPHRPILDGHTLIRFANLPR 1236

Query: 608  GERLEICKKIGSKHNDILDELYDIEALSSHF 638
             ++     ++G ++  ++    D+E +S  F
Sbjct: 1237 NKQNVYANRLGQRYQQLI--WKDLELISDLF 1265


>gi|325189779|emb|CCA24259.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1911

 Score =  172 bits (437), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 160/675 (23%), Positives = 286/675 (42%), Gaps = 162/675 (24%)

Query: 85   RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG-------------PVS 131
            R   +  F N+    G+F  G +P W+ L ++G+    P+ I               PV 
Sbjct: 1277 RYPMLTRFFNVNNNSGMFFRGAYPVWI-LPNQGQPVFVPLNIAAAPSDPTRRTTFKVPVL 1335

Query: 132  TLAPFHNVNCPRGFLYFNAKSELRISVLP-----THLSYDAPWPVRKVPLKCTPHFLAY- 185
            +  PFH+ NCP GF+YF++   LR+  LP     T L     + ++KV    T H L Y 
Sbjct: 1336 SFTPFHHWNCPNGFVYFHSSGSLRVCELPSSQNSTLLPSGNGFVLQKVRFGATIHHLLYL 1395

Query: 186  ----------HLETKTYCIVTST----AEPSTDYYKFN--------------GEDKELVT 217
                       L++ T+ +V S     +E    Y+  N              G++ E   
Sbjct: 1396 GRHGPGGVAEALKSPTFAMVLSRKVTPSEAEQAYWSENNDENADDTMYQNGVGKEAEEGD 1455

Query: 218  DPR----DSRFIPPLVSQF---------------------HVSLFSPFSWEEIPQTNFPL 252
            DP     +S  + P   +F                      +  +  ++ + + +  F  
Sbjct: 1456 DPNAEDLNSNVMAPTAEKFPDLDVNDMPLIGEDAYELRVVQLDEYGDWAGQGVFRAYFER 1515

Query: 253  HE----------WEHVLCLKNV----------SMEYEGTLSG-------LRGYIALGTNY 285
            HE           +  L  KNV          +ME + T +         R YI +GT Y
Sbjct: 1516 HEVVLSVKVLYLHDASLLKKNVDSATDEYHRRNMETDSTANEEAEWNRRKRPYIVIGTGY 1575

Query: 286  --NYSEDVTCRGRILLF--DIIEVVPEPGQPLTK-NKIKMIYAKEQ-KGPVTAICHVAGF 339
                 ED + +GR+LL+  D  + V + G   +K  K+++ + KE  +G +T++  +  +
Sbjct: 1576 VGPNGEDASGKGRLLLYEVDYAQYVDKDGTTSSKLPKLRLTFIKEHHQGAITSVIQLGMY 1635

Query: 340  LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASM-VSVKNLILVGDYARSIALLRYQPEY 398
            ++ +VG K+ +++ K + L G AF D +++I S+ V  K  ++  D  +S++ LR++ + 
Sbjct: 1636 VLASVGSKMIVYEFKSDQLIGCAFYDAQMFITSLSVLRKEYVMYSDVYKSVSFLRWRQKD 1695

Query: 399  RTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 458
            R L L+A+DY+P    +  +                                      +I
Sbjct: 1696 RQLILLAKDYEPLAVTTAEF--------------------------------------NI 1717

Query: 459  LDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF-KIRCKPS 517
            LD  + +  + +D ++N+ +  Y P   ES GG RL++ +DFH+G  +++   K+    +
Sbjct: 1718 LD--TRLALIAADVEENLHVLQYAPHDIESRGGQRLLRTSDFHVGVQISSILRKLVISNA 1775

Query: 518  SISDAPGARSR-----FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
            S      A+ R     +L    S +G +   +P+PE+ +RRL  LQNVM++       LN
Sbjct: 1776 SHQQYIPAKGRCIGNMYLNVLGSSEGGIAALIPVPERVFRRLFTLQNVMISALPQNCALN 1835

Query: 573  PRAFRTYKGKGYYAGNPS---------RGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 623
            PR FR  K  G      +         +G +DG ++ +FL L    + E+ + IG+    
Sbjct: 1836 PREFRVMKANGRVRSGRADAWCKQKWKKGFLDGQVLCRFLHLDYVAQKELARCIGTNPEV 1895

Query: 624  ILDELYDIEALSSHF 638
            I+  L +++  +  F
Sbjct: 1896 IIQNLSELQRNTMSF 1910


>gi|325187036|emb|CCA21579.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1912

 Score =  172 bits (436), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 160/675 (23%), Positives = 286/675 (42%), Gaps = 162/675 (24%)

Query: 85   RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG-------------PVS 131
            R   +  F N+    G+F  G +P W+ L ++G+    P+ I               PV 
Sbjct: 1278 RYPMLTRFFNVNNNSGMFFRGAYPVWI-LPNQGQPVFVPLNIAAAPSDPTRRTTFKVPVL 1336

Query: 132  TLAPFHNVNCPRGFLYFNAKSELRISVLP-----THLSYDAPWPVRKVPLKCTPHFLAY- 185
            +  PFH+ NCP GF+YF++   LR+  LP     T L     + ++KV    T H L Y 
Sbjct: 1337 SFTPFHHWNCPNGFVYFHSSGSLRVCELPSSQNSTLLPSGNGFVLQKVRFGATIHHLLYL 1396

Query: 186  ----------HLETKTYCIVTST----AEPSTDYYKFN--------------GEDKELVT 217
                       L++ T+ +V S     +E    Y+  N              G++ E   
Sbjct: 1397 GRHGPGGVAEALKSPTFAMVLSRKVTPSEAEQAYWSENNDENADDTMYQNGVGKEAEEGD 1456

Query: 218  DPR----DSRFIPPLVSQF---------------------HVSLFSPFSWEEIPQTNFPL 252
            DP     +S  + P   +F                      +  +  ++ + + +  F  
Sbjct: 1457 DPNAEDLNSNVMAPTAEKFPDLDVNDMPLIGEDAYELRVVQLDEYGDWAGQGVFRAYFER 1516

Query: 253  HE----------WEHVLCLKNV----------SMEYEGTLSG-------LRGYIALGTNY 285
            HE           +  L  KNV          +ME + T +         R YI +GT Y
Sbjct: 1517 HEVVLSVKVLYLHDASLLKKNVDSATDEYHRRNMETDSTANEEAEWNRRKRPYIVIGTGY 1576

Query: 286  --NYSEDVTCRGRILLF--DIIEVVPEPGQPLTK-NKIKMIYAKEQ-KGPVTAICHVAGF 339
                 ED + +GR+LL+  D  + V + G   +K  K+++ + KE  +G +T++  +  +
Sbjct: 1577 VGPNGEDASGKGRLLLYEVDYAQYVDKDGTTSSKLPKLRLTFIKEHHQGAITSVIQLGMY 1636

Query: 340  LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASM-VSVKNLILVGDYARSIALLRYQPEY 398
            ++ +VG K+ +++ K + L G AF D +++I S+ V  K  ++  D  +S++ LR++ + 
Sbjct: 1637 VLASVGSKMIVYEFKSDQLIGCAFYDAQMFITSLSVLRKEYVMYSDVYKSVSFLRWRQKD 1696

Query: 399  RTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 458
            R L L+A+DY+P    +  +                                      +I
Sbjct: 1697 RQLILLAKDYEPLAVTTAEF--------------------------------------NI 1718

Query: 459  LDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF-KIRCKPS 517
            LD  + +  + +D ++N+ +  Y P   ES GG RL++ +DFH+G  +++   K+    +
Sbjct: 1719 LD--TRLALIAADVEENLHVLQYAPHDIESRGGQRLLRTSDFHVGVQISSILRKLVISNA 1776

Query: 518  SISDAPGARSR-----FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
            S      A+ R     +L    S +G +   +P+PE+ +RRL  LQNVM++       LN
Sbjct: 1777 SHQQYIPAKGRCIGNMYLNVLGSSEGGIAALIPVPERVFRRLFTLQNVMISALPQNCALN 1836

Query: 573  PRAFRTYKGKGYYAGNPS---------RGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 623
            PR FR  K  G      +         +G +DG ++ +FL L    + E+ + IG+    
Sbjct: 1837 PREFRVMKANGRVRSGRADAWCKQKWKKGFLDGQVLCRFLHLDYVAQKELARCIGTNPEV 1896

Query: 624  ILDELYDIEALSSHF 638
            I+  L +++  +  F
Sbjct: 1897 IIQNLSELQRNTMSF 1911


>gi|146415762|ref|XP_001483851.1| hypothetical protein PGUG_04580 [Meyerozyma guilliermondii ATCC 6260]
          Length = 1320

 Score =  171 bits (434), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 143/609 (23%), Positives = 266/609 (43%), Gaps = 71/609 (11%)

Query: 33   LLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQ-MRY 91
            L ++    E+ +Y+ F      L  + KK K L ++     A      P G  I + + Y
Sbjct: 768  LTILTVGGEIYMYKLFF---DGLNFKLKKEKDLLITGAPDNA-----YPAGTSIERRLVY 819

Query: 92   FSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAK 151
               ++G+  +F+ G  P ++  T     R    T      + A F +     G ++ +  
Sbjct: 820  IPLVSGFSSIFVTGVVPYFITRTRHSIPRIFKFT-KIAAQSFASFSDSKVSNGLIFLDNA 878

Query: 152  SELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGE 211
               RI  LP   +YD   PV+KVP+  T   + YH  + TY + T    P   Y   + E
Sbjct: 879  KNARICELPRDFNYDNNLPVKKVPIGETVKSVTYHELSNTYVVSTYREIP---YNALDEE 935

Query: 212  DKELVTDPRDSRFIPPLVSQFHVSL--FSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYE 269
               +    +D     P  + +  SL   SP++W  I      L + E  + +K++ ++  
Sbjct: 936  GNPIAGLKKDK----PSANSYKGSLKLISPYNWTVIETVE--LRDNEIAMTVKSMVLDIG 989

Query: 270  GTLSGLR---GYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ 326
             +    +     + +GT     ED+   G   +++II+++PEPG+P T +K K    ++ 
Sbjct: 990  SSTKRFKHRKELLVVGTGRYRMEDLGANGAFKIYEIIDIIPEPGKPETNHKFKEYNTEDT 1049

Query: 327  KGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYA 386
            KG VT++C V+G  + A GQKI +  ++D+ +  +AF+DT VY++   S  NL+++GD  
Sbjct: 1050 KGAVTSMCEVSGRFLVAQGQKIIVRDVQDDGVVPVAFLDTSVYVSEAKSFGNLVILGDTL 1109

Query: 387  RSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 446
            +S+ L  +  E   + ++ +D +                  +D S               
Sbjct: 1110 KSVWLAGFDAEPFRMIMLGKDLQS-----------------VDVS--------------- 1137

Query: 447  ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV 506
             C +  SK  +I         +I+  +  + L  + PE   S+ G RL+ +  F++    
Sbjct: 1138 -CAEFISKDEEIY-------ILIAGNNNVMHLVQFDPEDPTSSNGQRLVHRASFNVSSST 1189

Query: 507  NTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 566
                 +R  P +          F T  +++DG+     P+ E  YRR+ ++Q  +     
Sbjct: 1190 TC---MRMVPKNEEINTQYSDVFQTVGSTIDGSFFTVFPVNEFTYRRMYIIQQQLTDKEY 1246

Query: 567  HTGGLNPRAFRTYKGKGYYAGNPS-RGIIDGSLVWKFLQLSLGERLEICKKIGSK--HND 623
            H  GLNPR  R + G+ +       + I+D  ++ ++ +L+   +  I +K+ SK  + +
Sbjct: 1247 HYCGLNPRLNR-FGGEAFDDSQTGVKPILDHQVIKRYAKLNEDRKQTIAQKVSSKGVYQE 1305

Query: 624  ILDELYDIE 632
            I  +L + E
Sbjct: 1306 IWKDLIEFE 1314


>gi|429851266|gb|ELA26469.1| protein cft1 [Colletotrichum gloeosporioides Nara gc5]
          Length = 1411

 Score =  171 bits (432), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 168/651 (25%), Positives = 279/651 (42%), Gaps = 117/651 (17%)

Query: 14   ETIVQELLTVSLG-LHGNRPLLLVR-TQHELLIYQAFR-----HPKGALK-LRFKK---- 61
            +  + E+L   LG    + P L++R    ++ IY+  R       +G  K L F+K    
Sbjct: 839  QETLTEVLVAKLGDATESSPYLILRHANDDITIYEPIRLESQDKSEGLAKTLHFQKITNP 898

Query: 62   -LKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
             L    V      ANEQP      R   +R  +NI GY  VFL G  P+++  +++   +
Sbjct: 899  ALAKSPVEVADDDANEQP------RFVPLRPCANINGYSTVFLPGASPSFIIKSAKSAPK 952

Query: 121  AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTP 180
               +   G V  ++ FH   C RGF+Y +++   R++ LP   S++    +RK+P+    
Sbjct: 953  VLGLQGIG-VRGMSSFHTEGCERGFIYADSEGHTRVTQLPADTSFELGVSIRKIPVGDAI 1011

Query: 181  HFLAYHLETKTYCIVTSTAEP-----STDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVS 235
              +AYH   +TY +  S +EP       DY+K   ++  + T P+  R I        + 
Sbjct: 1012 GLIAYHPPMETYAVACSVSEPFELPKDDDYHKEWAKET-ITTFPQMERGI--------IK 1062

Query: 236  LFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRG 295
            L SP +W  I       HE    +C+K + +E        R  IA+GT  N  ED+  RG
Sbjct: 1063 LLSPATWSVIDTVELDPHEV--AMCMKTLHLEVSEETKERRMLIAIGTAINRGEDLPIRG 1120

Query: 296  RILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIW 351
            RIL++D++ VVP+PG+P T  K+K++ AKE+  +G VTA+C V   G ++ A GQK  + 
Sbjct: 1121 RILVYDVVPVVPQPGRPETNKKLKLV-AKEEIPRGAVTALCEVGSQGLMLVAQGQKCMVR 1179

Query: 352  QLK-DNDLTGIAFIDTEVYIASMVSVKNL--ILVGDYARSIALLRYQPEYRTLSLVARDY 408
             LK D  L  +AF+D   Y+ S+  V+     L+ D  + +  + Y  E           
Sbjct: 1180 GLKEDGTLLPVAFMDMSCYVTSVREVRGTGYCLMADAFKGVWFVGYAEE----------- 1228

Query: 409  KPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFM 468
                          P + ++ G    KF  L+                D L +   +  +
Sbjct: 1229 --------------PYKIMLFGKSTGKFEVLTA---------------DFLVDGDELHIV 1259

Query: 469  ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSR 528
            + DKD  + +  + PE  +S  GH L+ +  F    +          P++    P   + 
Sbjct: 1260 VCDKDGVIHVMQFDPEHPKSLQGHLLLNRASFSAAPN---------HPTATLSLPRTTTT 1310

Query: 529  FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRA-----FRTYKGKG 583
              +  AS             KN    LML     + T H      R+            G
Sbjct: 1311 AQSASAS-------------KNPPSTLML----ASPTGHCRSAPSRSRHEPKGPPAPAPG 1353

Query: 584  YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
                +  R I+DG+L+ ++ +L  G R E+  K G  + ++L+   ++E +
Sbjct: 1354 SPPTSAGRTIVDGALLSRWNELGAGRRSEVAGKGG--YGNVLEVRGELEGI 1402


>gi|322694449|gb|EFY86278.1| Cleavage factor two protein 1 [Metarhizium acridum CQMa 102]
          Length = 1431

 Score =  171 bits (432), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 158/639 (24%), Positives = 283/639 (44%), Gaps = 74/639 (11%)

Query: 17   VQELLTVSLG-LHGNRPLLLVR-TQHELLIYQAFR-HPKGALKLR----FKKLKVLFVSD 69
            + E+L   LG      P L+VR    +L IY+  R   +G  +L     FKK     ++ 
Sbjct: 837  ITEILVADLGDAISQTPYLIVRHASDDLTIYEPVRCQEEGDAELSASLLFKKCVNTSLAK 896

Query: 70   RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP 129
             +   +E    P   R   +R  +N+ GY  VFL G  P+++  +S  E R   +   G 
Sbjct: 897  TAPEVSEDDAEPP--RFVPLRRCANVNGYGAVFLPGASPSFVLKSSHSEPRVIGLQGLG- 953

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHLE 188
            V  ++ FH   C RGF+Y + +   R++ LP++ S+ D    V+K+ L      ++YH  
Sbjct: 954  VRGMSTFHTEGCDRGFIYVDVEGIARVTQLPSNASFTDLGVSVKKIALDGDVGMISYHHP 1013

Query: 189  TKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQT 248
            T TY +  +  EP    ++   +D       +++   PP  ++  + L +P +W  I + 
Sbjct: 1014 TGTYVVACTKLEP----FELPRDDDYHKEWAKETIKFPPTTARGILKLINPVTWTVIHE- 1068

Query: 249  NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPE 308
               L   E +  +K + +E        +  +A+GT  +  ED+  RGR+ +FDI+ V+PE
Sbjct: 1069 -LELEPCESIESMKTLHLEVSEETKERKMLVAVGTALSKGEDLPTRGRVQVFDIVTVIPE 1127

Query: 309  PGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAF 363
            PG+P T  ++K+I AKE+  +G VTA+  V   G ++ A GQK  +  LK D  L  +AF
Sbjct: 1128 PGRPETNKRLKLI-AKEEIPRGGVTALSEVGAQGLMLVAQGQKCMVRGLKEDGSLLPVAF 1186

Query: 364  IDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
            +D   ++AS+  +    L ++ D  + +    Y  E  T  ++ +        S G    
Sbjct: 1187 LDMNCHVASVKELPGTGLCVMADVFKGLWFAGYTEEPYTFKILGK--------SSG---- 1234

Query: 422  NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
                                        K+     D L +   +  +  D + ++ +  +
Sbjct: 1235 ----------------------------KLPLLVADFLPDGEDLSMVAVDAEGDMHILEF 1266

Query: 482  QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALG 541
             PE  +S  GH L+ +T F +  +  T   +  +  S S    + S  +   A   G + 
Sbjct: 1267 NPEHPKSLQGHLLLHRTSFAVTPNTPTSTLLLPRTHSPSYPHASSSSHMLLLACPSGQVA 1326

Query: 542  FFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY------AGNPSRGIID 595
               PL E  YRRLL + N +        GL+ +A R Y  +         A +  R ++D
Sbjct: 1327 ALSPLAESTYRRLLSVTNQLHPAIVAHCGLHTKAHR-YPDQSCVAVGVETAASSGRALVD 1385

Query: 596  GSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
            G+++ ++ +L   +R ++  + G  ++ + D   D+E +
Sbjct: 1386 GTVLARWSELGAAKRTDVALRGG--YDSVADLRDDLEGV 1422


>gi|388581811|gb|EIM22118.1| hypothetical protein WALSEDRAFT_28358 [Wallemia sebi CBS 633.66]
          Length = 1259

 Score =  170 bits (431), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 158/616 (25%), Positives = 258/616 (41%), Gaps = 84/616 (13%)

Query: 23   VSLGLHGNRP--LLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGL 80
            V+LG  G +   L+++    EL+IY  +    G   + F K+  + V             
Sbjct: 725  VNLGKGGVKRAHLIILYQSGELVIYDTYNSSSG---IAFSKVSAVSVQ------------ 769

Query: 81   PRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVN 140
                 IS++  F +     G  + G  P  +        R HP+     +   APF    
Sbjct: 770  LSATVISRILTFCDF----GALITGRTPVLISCEDTSIPRIHPLD-QKYIHHAAPFD--- 821

Query: 141  CPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAE 200
               G L++ A  E+ ++ +   + Y    P+R++        +AY   ++ Y + TSTA 
Sbjct: 822  ---GGLFYYANDEVILATIGDDVEYSENLPLRRLANGRNFDKVAYDPTSQMY-VATSTA- 876

Query: 201  PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLC 260
             S  +  F+     L   P ++   P    +  + L S      I    F  +E+  V+C
Sbjct: 877  -SVPFRLFDNAGNYLWKPPTEN-LSPATSYRSAIELLSNDCRSSIFGYEFEQNEF--VIC 932

Query: 261  LKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKM 320
             + VS+         + +I +GT  N  EDV  +G + LF+I E++P         K+KM
Sbjct: 933  CETVSLLSPSADGTYKDFIGVGTCINRGEDVAVKGAMYLFEIAELIPSSKDSGNNYKLKM 992

Query: 321  IYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND-LTGIAFIDTEVYIASMVSVKNL 379
            +  +E KG V+AI   +G+ V AVGQK+ I  L+ N+ L  +AF D   YI S+  +KN 
Sbjct: 993  LMREETKGAVSAITSCSGYFVVAVGQKVLIRALEINERLISVAFYDAGTYIVSLEVLKNF 1052

Query: 380  ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQL 439
            ILVGD  +SI  L +Q     L  ++RD +                              
Sbjct: 1053 ILVGDQVKSITFLAFQESPYKLVQLSRDAR------------------------------ 1082

Query: 440  SLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD 499
                ++E C      H D       + F+ +D   ++ L  Y P    + GG +LI+ T+
Sbjct: 1083 ----QIETCVSNFLAHED------QISFVSNDIQGDLRLIDYNPFDPTAEGGEKLIRTTE 1132

Query: 500  FHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN 559
            FH G             S +   P  R         +DG+L    P+ E  ++ L +LQ 
Sbjct: 1133 FHKGSEATC--------SLLLPKPSVRPSSELLLGCVDGSLSCLSPVDEITFKALWLLQG 1184

Query: 560  VMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
             +V    H   LNPRA R  +   Y + + S+GI+DG L+  +  +    ++EI K+IG 
Sbjct: 1185 ALVRQIPHIAALNPRAHRHVRND-YVSRSLSKGILDGLLLSAYQTIDHATQVEIAKRIGY 1243

Query: 620  KHNDILDELYDIEALS 635
               ++L  L +   LS
Sbjct: 1244 SKAELLGYLRNFSWLS 1259


>gi|453082807|gb|EMF10854.1| CPSF_A-domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 1349

 Score =  170 bits (431), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 161/643 (25%), Positives = 271/643 (42%), Gaps = 100/643 (15%)

Query: 14   ETIVQELLTVSLGLH-GNRPLLLVRT-QHELLIYQAFRHP------KGALKLRFKKLKVL 65
            +  + ELL   LG      P L+VRT   ++++Y+ F +P           L F+K+   
Sbjct: 750  KATLTELLVADLGQDSATEPYLIVRTAMDDIVLYEPFHYPLRPNEDSWHSSLHFRKVPFS 809

Query: 66   FVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGEL----RA 121
            ++     + NEQ    +   + +++    + GY  V + G  P  L +     L      
Sbjct: 810  YI----PKYNEQLSDAQTPPLKRIQ----VGGYHAVNIPG-GPTNLLMKESSSLLKVLEV 860

Query: 122  HPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
                     + ++P H   C  GFL  NA  E++ + LP    Y   W +++V +     
Sbjct: 861  RDTQSSQRATVMSPVHRPGCEHGFLTINADEEVQENQLPEKTWYGTGWSIQQVDIGEDVR 920

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
             +AYH E + Y + T       D+Y F GED       +D   + P V Q+ + L S  S
Sbjct: 921  HIAYHAEREVYVVATCR---DIDFY-FAGEDGR--HPEQDDIELRPQVPQYTIHLVSAKS 974

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
             + +   +  L   E V  LK +S+E        +  + + T     ED+  +G ++L+D
Sbjct: 975  HQRL--QSVELGYLETVTALKVMSLEVSENTHEQKDLVVVSTAAQRGEDMPAKGAVILYD 1032

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICH--VAGFLVTAVGQKIYIWQLK-DNDL 358
            II+VVP+P  P +  ++  +  ++ +G +T+I      GFL TA G K+ +  LK D   
Sbjct: 1033 IIDVVPDPDVPESGFQLHQLAREQARGAITSIAGPLPGGFLGTAQGLKLMVRGLKEDGTC 1092

Query: 359  TGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
              +AF+D + Y                              TL ++        P    +
Sbjct: 1093 LPVAFLDAQSYT----------------------------HTLKVL--------PGRGMW 1116

Query: 419  YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEF----SSMGFMISDKDK 474
             AG+  +G+  G    +  +L++     + K   SK   +  EF     ++  +I D D 
Sbjct: 1117 LAGDAWKGLWFGGFTEEPYKLTV-----MGKSPKSKMEVMTAEFLPFDGALYILIMDADN 1171

Query: 475  NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI--RCKPSSISD----------- 521
            ++ +  Y PE  +S GG RL+ ++ FH+G  V     +    KP    D           
Sbjct: 1172 DLHVLQYDPENPKSVGGMRLLHRSTFHIGHLVTNMLLVPSSLKPFESQDRDMANGTNGNN 1231

Query: 522  -----APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
                 AP +    L    S  G++G   PL E  YRRL  LQ  +     H  GLNPRA+
Sbjct: 1232 EEATRAPPSLHHILA--TSRSGSVGLITPLDEAAYRRLSALQTHLTAILEHAAGLNPRAY 1289

Query: 577  RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
            R  + + +     +RG++DGSLV +  +L   +R ++  + GS
Sbjct: 1290 RAVEAESFGG---ARGVVDGSLVNRIGELGAAKRADVLGRAGS 1329


>gi|190348091|gb|EDK40482.2| hypothetical protein PGUG_04580 [Meyerozyma guilliermondii ATCC 6260]
          Length = 1320

 Score =  169 bits (429), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 142/611 (23%), Positives = 266/611 (43%), Gaps = 71/611 (11%)

Query: 33   LLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQ-MRY 91
            L ++    E+ +Y+ F         + KK K L ++     A      P G  I + + Y
Sbjct: 768  LTILTVGGEIYMYKLFFDGSN---FKLKKEKDLLITGAPDNA-----YPAGTSIERRLVY 819

Query: 92   FSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAK 151
               ++G+  +F+ G  P ++  T     R    T      + A F +     G ++ +  
Sbjct: 820  IPLVSGFSSIFVTGVVPYFITRTRHSIPRIFKFT-KIAAQSFASFSDSKVSNGLIFLDNA 878

Query: 152  SELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGE 211
               RI  LP   +YD   PV+KVP+  T   + YH  + TY + T    P   Y   + E
Sbjct: 879  KNARICELPRDFNYDNNLPVKKVPIGETVKSVTYHELSNTYVVSTYREIP---YNALDEE 935

Query: 212  DKELVTDPRDSRFIPPLVSQFHVSL--FSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYE 269
               +    +D     P  + +  SL   SP++W  I      L + E  + +K++ ++  
Sbjct: 936  GNPIAGLKKDK----PSANSYKGSLKLISPYNWTVIETVE--LRDNEIAMTVKSMVLDIG 989

Query: 270  GTLSGLR---GYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ 326
             +    +     + +GT     ED+   G   +++II+++PEPG+P T +K K    ++ 
Sbjct: 990  SSTKRFKHRKELLVVGTGRYRMEDLGANGAFKIYEIIDIIPEPGKPETNHKFKEYNTEDT 1049

Query: 327  KGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYA 386
            KG VT++C V+G  + A GQKI +  ++D+ +  +AF+DT VY++   S  NL+++GD  
Sbjct: 1050 KGAVTSMCEVSGRFLVAQGQKIIVRDVQDDGVVPVAFLDTSVYVSEAKSFGNLVILGDTL 1109

Query: 387  RSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 446
            +S+ L  +  E   + ++ +D +                  +D S               
Sbjct: 1110 KSVWLAGFDAEPFRMIMLGKDLQS-----------------VDVS--------------- 1137

Query: 447  ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV 506
             C +  SK  +I         +I+  +  + L  + PE   S+ G RL+ +  F++    
Sbjct: 1138 -CAEFISKDEEIY-------ILIAGNNNVMHLVQFDPEDPTSSNGQRLVHRASFNVSSST 1189

Query: 507  NTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 566
                 +R  P +          F T  +++DG+     P+ E  YRR+ ++Q  +     
Sbjct: 1190 TC---MRMVPKNEEINTQYSDVFQTVGSTIDGSFFTVFPVNEFTYRRMYIIQQQLTDKEY 1246

Query: 567  HTGGLNPRAFRTYKGKGYYAGNPS-RGIIDGSLVWKFLQLSLGERLEICKKIGSK--HND 623
            H  GLNPR  R + G+ +       + I+D  ++ ++ +L+   +  I +K+ SK  + +
Sbjct: 1247 HYCGLNPRLNR-FGGEAFDDSQTGVKPILDHQVIKRYAKLNEDRKQTIAQKVSSKGVYQE 1305

Query: 624  ILDELYDIEAL 634
            I  +L + E +
Sbjct: 1306 IWKDLIEFENV 1316


>gi|449299306|gb|EMC95320.1| hypothetical protein BAUCODRAFT_25380 [Baudoinia compniacensis UAMH
            10762]
          Length = 1437

 Score =  168 bits (426), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 160/645 (24%), Positives = 270/645 (41%), Gaps = 104/645 (16%)

Query: 17   VQELLTVSLGLHG-NRPLLLVRT-QHELLIYQAFRHPKGALK--------LRFKKLKVLF 66
            + E+L V LG  G  RP L+VRT   +L++Y+ F +    L         LRF+K+   +
Sbjct: 777  LTEVLVVDLGAEGVTRPYLIVRTAMDDLILYEPFHYSATTLDARATGFTDLRFRKVPFTY 836

Query: 67   VSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTI 126
            +    +  +   G P     +Q++  + I G   ++L G  P++L   +    +   +  
Sbjct: 837  LPKYDEGLDTADGRP-----AQLQP-AVIGGRNALYLPGGTPSFLVKEATSLPKVLGLRA 890

Query: 127  DGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFL--- 183
             G V + +P H   C +GF   +   +L+   LP H+S+   W VR + L   P  +   
Sbjct: 891  RG-VRSFSPLHRAGCQQGFALVDGDGKLKEYQLPGHVSFATGWSVRTLTLGEPPQEVRQV 949

Query: 184  AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
            AYH +   Y + T       D+   + ++++   +P     + P V Q+ + L S  S +
Sbjct: 950  AYHEQRGIYVVATCR---DVDFTLHDLDERQRDDEPN----LKPQVPQYTLHLLSATSHK 1002

Query: 244  EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
             I     P  E   V  LK + +E        +  + +G      ED   +G + +FDII
Sbjct: 1003 VIQSLEMPYAEI--VTSLKIMPLEVSEHTHEQKLMLVVGAAAQRGEDAPAKGLLTVFDII 1060

Query: 304  EVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLV-TAVGQKIYIWQLK-DNDLTGI 361
            +VVPEP  P +  ++ +   +E KG +TA+   +G LV TA GQKI +  LK D     +
Sbjct: 1061 DVVPEPDDPESGIRLHIAAREETKGAITALESFSGGLVGTAQGQKIMVRGLKEDGTCLPV 1120

Query: 362  AFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
            AF+D + Y+ S+ ++    L L GD  + +    +  E   L+L+ +     +  S  + 
Sbjct: 1121 AFLDAQTYMVSLKTMGRSGLSLAGDAWKGLWFGGWTEEPYRLTLLGKSRTKMEVVSAEFL 1180

Query: 420  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
                     DG L                                   ++ D   ++ + 
Sbjct: 1181 P-------FDGQLY---------------------------------LLVVDGKMDLHVL 1200

Query: 480  MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI--------RCKPSSISDAPGARS---- 527
             Y PE  ++  G RL+ K+ FHLG        +        +  P +  D+ G  +    
Sbjct: 1201 QYDPENPKTVSGQRLLHKSTFHLGHWPVDMLLLPSDLAPFAQQAPLTNGDSNGHTNGTES 1260

Query: 528  -------------RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 574
                           LT + S  GA+G   P+ E  YRRL  LQ  + +   H  GLNPR
Sbjct: 1261 SAANAPAPAPSLFHVLTTFQS--GAVGLITPVDEATYRRLGALQTQLTSVLEHAAGLNPR 1318

Query: 575  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
            A+R  + +        RG++DG LV +  +L    R E+  + G+
Sbjct: 1319 AYRAVESESLGG----RGVVDGMLVQRIGELGAARRAEVLGRAGA 1359


>gi|116182170|ref|XP_001220934.1| hypothetical protein CHGG_01713 [Chaetomium globosum CBS 148.51]
 gi|88186010|gb|EAQ93478.1| hypothetical protein CHGG_01713 [Chaetomium globosum CBS 148.51]
          Length = 1394

 Score =  168 bits (425), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 140/502 (27%), Positives = 226/502 (45%), Gaps = 66/502 (13%)

Query: 17   VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGA-----LKLRFKKLKVLFVSDRS 71
            V E+L   L    NR  L      +L IYQ FR+   A       L F+KL     +   
Sbjct: 856  VAEILVADLA---NRSQLR-HANDDLTIYQPFRYSTSAGADFSKTLFFQKLPNAAFAKSP 911

Query: 72   KRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS 131
            + A+E     +  R+  MR  SNIAGY  VFL G  P+++  +S+   R  P+   G V 
Sbjct: 912  EEADEDEATHQ-PRMLSMRRCSNIAGYSTVFLPGASPSFIIKSSKSAPRVLPLQGAG-VI 969

Query: 132  TLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHLETK 190
             ++PFH   C  GF+Y +++   R++ LP   +Y +    VRK+P+      +AYH   +
Sbjct: 970  AMSPFHTEGCENGFIYADSQHMARVTQLPQDWNYAETGLAVRKIPIGEDIAAVAYHPPMQ 1029

Query: 191  TYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNF 250
            +Y +  +T EP    ++   +D       R++    P V +  + L SP +W  +     
Sbjct: 1030 SYVVGCNTLEP----FELPKDDDYHKEWARENLSFKPTVDRGILKLVSPITWTVVDSVQ- 1084

Query: 251  PLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPG 310
             +   E VLC+  +S+E     +  +  IA+GT     ED+  RGR+ ++DI EV+PEPG
Sbjct: 1085 -MEPCETVLCVATLSLEVSEFTNERKQLIAVGTALIKGEDLPTRGRVYVYDITEVIPEPG 1143

Query: 311  QPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFID 365
            +P T  K+K+I AKE+  +G VTA+  +   G ++ A GQK  +  LK D  L  +AF+D
Sbjct: 1144 RPETSKKLKLI-AKEEIPRGAVTALSEIGTQGLMLVAQGQKCMVRGLKEDGTLLPVAFMD 1202

Query: 366  TEVYI--ASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNP 423
               Y+  A  +    L L+ D  + +    Y  E   + L  +                 
Sbjct: 1203 MNCYVTNAKELPGTGLCLLADAFKGVWFTGYTEEPYKMMLFGKS---------------- 1246

Query: 424  SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQP 483
                                +LE+       + D L +   +  +  D D N+ +  + P
Sbjct: 1247 ------------------STKLEVL------NADFLPDGKDLFIVACDADGNIHILEFDP 1282

Query: 484  EARESNGGHRLIKKTDFHLGQH 505
            E  +S  GH L+ +T F+ G +
Sbjct: 1283 EHPKSLQGHLLLHRTTFNTGAN 1304


>gi|405121446|gb|AFR96215.1| cleavage and polyadenylation specific protein [Cryptococcus
            neoformans var. grubii H99]
          Length = 1431

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 149/621 (23%), Positives = 264/621 (42%), Gaps = 75/621 (12%)

Query: 25   LGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKL--KVLFVSDRSKRANEQPGLPR 82
            L LH +  L     Q    +  A  H + +L +RF+K+  ++L +S      N    LP 
Sbjct: 871  LALHHSGRLNAYEAQPRFTV-DASSHSRRSLAVRFRKVHTQLLPISGGVGTTNGNARLPY 929

Query: 83   GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVN-- 140
             +       F+NI G  G F+ G  P W+  +      AHP+           F      
Sbjct: 930  TIV-----PFNNIEGLTGAFITGEKPHWIISS-----EAHPLRAFALKQAAMAFGKTTHL 979

Query: 141  CPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAE 200
              +G  +   +    I  LP  L+ D   P  +  ++     + +   +  Y    S   
Sbjct: 980  GGKGEYFIRIEDGSFICYLPPTLNTDFAIPCDRYQMERAYTNITFDPTSAHYVGAASIEV 1039

Query: 201  PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS--WEEIPQTNFPLHEWEHV 258
            P   Y     E+ E+   P     IPP   +  + LFS  S  W  I    +   + E V
Sbjct: 1040 PFQAY----DEEGEIQLGPDGPDLIPPTNQRSTLELFSQGSDPWRVI--DGYEFDQNEEV 1093

Query: 259  LCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKI 318
            + +++V++E  G   G R +IA+GT +N+ ED   RG   +F+I++ V   G     +  
Sbjct: 1094 MSMESVNLESPGAPGGYRDFIAVGTGFNFGEDRATRGNTYIFEILQTVGPQGGGGPGSVP 1153

Query: 319  KMIYAKEQKG----PVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASM 373
                 K  K     PV A+ H+ G+L+   G K+Y+  L  D  L G+AF+D ++Y  ++
Sbjct: 1154 GWKLVKRTKDPARHPVNAVNHINGYLLNTNGPKLYVKGLDYDAQLMGLAFLDIQLYATTV 1213

Query: 374  VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
               KN +L+GD  +S   +  Q +    + +++D                          
Sbjct: 1214 KVFKNFMLIGDLCKSFWFVSLQEDPYKFTTISKD-------------------------- 1247

Query: 434  WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
                           + +     D L     + F+ SD++ ++ +  + P   +S  G R
Sbjct: 1248 --------------LQHVSVVTADFLVHDGQVTFISSDRNGDMRMLDFDPTDPDSLNGER 1293

Query: 494  LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRR 553
            L+ KT++H G    T  K+  +  +  +    +++ +  YA+ DGAL   + + +  ++R
Sbjct: 1294 LMLKTEYHAGSAA-TVSKVIARRKTAEEEFAPQTQII--YATADGALTTVVSVKDARFKR 1350

Query: 554  LLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 613
            L ++ + +V +  H  GLNPRAFRT +         S+GI+DG L+ +F    +G + E+
Sbjct: 1351 LQLVSDQLVRNAQHVAGLNPRAFRTVRND-LLPRPLSKGILDGQLLNQFALQPIGRQKEM 1409

Query: 614  CKKIGSKHNDILDELYDIEAL 634
             ++IG+   D +    D++AL
Sbjct: 1410 MRQIGT---DAVTVASDLQAL 1427


>gi|58268668|ref|XP_571490.1| cleavage and polyadenylation specific protein [Cryptococcus
            neoformans var. neoformans JEC21]
 gi|134113364|ref|XP_774707.1| hypothetical protein CNBF3860 [Cryptococcus neoformans var.
            neoformans B-3501A]
 gi|338817789|sp|P0CM63.1|CFT1_CRYNB RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|338817790|sp|P0CM62.1|CFT1_CRYNJ RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|50257351|gb|EAL20060.1| hypothetical protein CNBF3860 [Cryptococcus neoformans var.
            neoformans B-3501A]
 gi|57227725|gb|AAW44183.1| cleavage and polyadenylation specific protein, putative [Cryptococcus
            neoformans var. neoformans JEC21]
          Length = 1431

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 143/599 (23%), Positives = 259/599 (43%), Gaps = 74/599 (12%)

Query: 47   AFRHPKGALKLRFKKL--KVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLC 104
            A  H + +L +RF+K+  ++L +S      N    LP  +       F+NI G  G F+ 
Sbjct: 892  ASSHSRRSLAVRFRKVHTQLLPISGGVGTTNGNARLPYTIV-----PFNNIEGLTGAFIT 946

Query: 105  GPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVN--CPRGFLYFNAKSELRISVLPTH 162
            G  P W+  +      AHP+           F        +G  +   +    I  LP  
Sbjct: 947  GEKPHWIISS-----EAHPLRAFALKQAAMAFGKTTHLGGKGEYFIRIEDGSFICYLPPT 1001

Query: 163  LSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS 222
            L+ D   P  +  ++     + +   +  Y    S   P   Y     E+ E+   P   
Sbjct: 1002 LNTDFAIPCDRYQMERAYTNITFDPTSAHYVGAASIEVPFQAY----DEEGEIQLGPDGP 1057

Query: 223  RFIPPLVSQFHVSLFSPFS--WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIA 280
              IPP   +  + LFS  S  W+ I    +   + E V+ +++V++E  G   G R +IA
Sbjct: 1058 DLIPPTNQRSTLELFSQGSDPWKVI--DGYEFDQNEEVMSMESVNLESPGAPGGYRDFIA 1115

Query: 281  LGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKG----PVTAICHV 336
            +GT +N+ ED   RG   +F+I++ V   G     +       K  K     PV A+ H+
Sbjct: 1116 VGTGFNFGEDRATRGNTYIFEILQTVGPQGGGGPGSVPGWKLVKRTKDPARHPVNAVNHI 1175

Query: 337  AGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQ 395
             G+L+   G K+Y+  L  D+ L G+AF+D ++Y  ++   KN +L+GD  +S   +  Q
Sbjct: 1176 NGYLLNTNGPKLYVKGLDYDSQLMGLAFLDIQLYATTVKVFKNFMLIGDLCKSFWFVSLQ 1235

Query: 396  PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
             +    + +++D                                         + +    
Sbjct: 1236 EDPYKFTTISKD----------------------------------------LQHVSVVT 1255

Query: 456  NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK 515
             D L     + F+ SD++ ++ +  + P   +S  G RL+ +T++H G    T  K+  +
Sbjct: 1256 ADFLVHDGQVTFISSDRNGDMRMLDFDPTDPDSLNGERLMLRTEYHAGSAA-TVSKVIAR 1314

Query: 516  PSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRA 575
              +  +    +++ +  YA+ DGAL   + + +  ++RL ++ + +V +  H  GLNPRA
Sbjct: 1315 RKTAEEEFAPQTQII--YATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLNPRA 1372

Query: 576  FRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
            FRT +         S+GI+DG L+ +F    +G + E+ ++IG+   D +    D++AL
Sbjct: 1373 FRTVRND-LLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGT---DAVTVASDLQAL 1427


>gi|452841862|gb|EME43798.1| hypothetical protein DOTSEDRAFT_79774 [Dothistroma septosporum NZE10]
          Length = 1347

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 163/643 (25%), Positives = 263/643 (40%), Gaps = 85/643 (13%)

Query: 19   ELLTVSLGLHG-NRPLLLVRTQ-HELLIYQAFRHPKGA------LKLRFKKLKVLFVSDR 70
            ELL   LG  G + P L+ RT   +L++Y+ FRHP+ A        LRF+K+ V ++   
Sbjct: 746  ELLVAELGASGVDTPYLVARTALDDLVLYEPFRHPEPAPSDQWYTNLRFRKVPVTYI--- 802

Query: 71   SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP- 129
              + NE        R   +R   ++  Y  V + G  P  L   +    R   + I    
Sbjct: 803  -PKYNEAIAQEESTRPLPLRSI-HVGDYDAVTIPGSPPLLLVKEASSLPRVLEVRISNES 860

Query: 130  --VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTP---HFLA 184
              V+TL P H  +C +GF   NA   L    LP    Y   W V++V L         LA
Sbjct: 861  NRVATLLPIHLDHCKKGFAAVNADGLLEEYHLPLSAWYGTGWSVQQVDLGSEDLEVRHLA 920

Query: 185  YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDP--RDSRFIPPLVSQFHVSLFSPFSW 242
            YH     Y + T       D+Y    + + L      +D   + P V Q+ + L S  + 
Sbjct: 921  YHETRGVYVVATCK---DVDFYFAEDDHRHLGQSGGGQDDITLRPQVKQYSIHLVSSKTH 977

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
              I     P    E +  L+ + +E           I + T     ED+  RG I++F+I
Sbjct: 978  RVIDSRAMPY--LEAITALQVMPLEVSELTHEQDLRILVSTAAMRGEDMPARGAIIVFNI 1035

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV-AGFLVTAVGQKIYIWQLK-DNDLTG 360
            I+VVP P  P +  K+ +   +E KG +TA+     GF+ +  GQKI I  LK D     
Sbjct: 1036 IDVVPAPDVPESGIKLHVNAREETKGAITALAPFPGGFVGSGQGQKIMIRGLKEDGSCLP 1095

Query: 361  IAFIDTEVY--IASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
            +AF+D + +  +   +    + L GD  + +    +  E   L+++ +            
Sbjct: 1096 VAFLDAQCHTTVIKTLGTSGMWLAGDAWKGLWFGGFTEEPYKLTVLGK------------ 1143

Query: 419  YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
                P R +    +  +FL                          ++  +I D D ++ +
Sbjct: 1144 ---APERQM--EVMAAEFLPFD----------------------GALYILIIDADMDLHV 1176

Query: 479  FMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI--------RCKPSSISDAPGARSR-- 528
              Y PE  +S  G RL+ ++ FHLG        +          +P +  D  G      
Sbjct: 1177 LQYDPENPKSQNGMRLLHRSTFHLGHFATNMLLLPSSLNPFGENQPFTNGDTNGESPEES 1236

Query: 529  ---FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 585
               F     SL G++G   PL E +YRRL  LQ  + T   H   LNPRA+R  + + + 
Sbjct: 1237 SPLFHVLTTSLTGSIGMITPLDESSYRRLSALQTHLTTILEHPASLNPRAYRAIESESFG 1296

Query: 586  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
                +RG++DG++V +  +L    R ++  + G+    I  +L
Sbjct: 1297 G---ARGVVDGNIVRRINELGAARRADVLARAGADAWSIRSDL 1336


>gi|149237256|ref|XP_001524505.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
            YB-4239]
 gi|146452040|gb|EDK46296.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
            YB-4239]
          Length = 1380

 Score =  167 bits (423), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 132/542 (24%), Positives = 239/542 (44%), Gaps = 67/542 (12%)

Query: 88   QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLY 147
            ++ YF N+ GY  +F+ G  P  +  +     R    +   P  +++ F +     G + 
Sbjct: 877  RLVYFPNLNGYTCIFVTGVIPFIIIKSLHSIPRIFQFS-KIPAVSISAFSDSKIKNGLIC 935

Query: 148  FNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYK 207
             +     RI  L    +Y+   P+++V L      LAYH ++ T    T    P    Y 
Sbjct: 936  LDNNKNARICELSLDYTYEFNLPIKRVDLGELVRSLAYHEQSDTVVASTFKEVP----YN 991

Query: 208  FNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSME 267
               E+  ++      +    L  +  + L SP +W  I   +F L + E  + +K++ ++
Sbjct: 992  CVDEEGNIIPGVYKEKLPHALTFKSSIKLISPHNWTVID--SFDLEDNEVGMTVKSMILD 1049

Query: 268  YEGTLSGL------RGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMI 321
                 + L      R YI +GT     ED+   G   +++II+++PEPG+P T +K K +
Sbjct: 1050 RGSGAASLKKFKSKREYIVIGTGKLRMEDLAANGSFKIYEIIDIIPEPGKPETNHKFKEV 1109

Query: 322  YAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLIL 381
            + ++ +G VTAIC ++G L+   GQKI +  ++D+ +  +AF+DT VYI+   S  NL++
Sbjct: 1110 FQEDARGAVTAICDLSGRLMVGQGQKIIVRDIEDDGVVPVAFLDTSVYISEAKSFGNLLI 1169

Query: 382  VGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSL 441
            +GD  +S+ L+ ++ E   + ++ +D                                  
Sbjct: 1170 LGDPLKSVWLVGFEAEPYRMVMLGKDR--------------------------------- 1196

Query: 442  GERLEI-CKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDF 500
             + L++ C     K  DI         +++D +  + L  Y P+  +S  G  L+ K  F
Sbjct: 1197 -QHLDVECADFIIKDEDIF-------ILVADNNNCIHLIQYDPDDPKSINGTILLNKASF 1248

Query: 501  HLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNV 560
             L         +R  P       G +  +    ++LDGAL    P+ E  YRR+ +LQ  
Sbjct: 1249 ELNSATTC---LRSIPK------GEKGDYQIIGSTLDGALYNVFPVNEFTYRRMYILQQQ 1299

Query: 561  MVTHTSHTGGLNPRAFRTYKGKGYYAGNP--SRGIIDGSLVWKFLQLSLGERLEICKKIG 618
            +     H  GLNPR  R + G          ++ I+D  L+ +F +L+L  + ++  KI 
Sbjct: 1300 ISDKVYHFCGLNPRLNR-FGGSVTLRDRETNTKPILDYGLIRRFSKLNLDRQQQLAGKIS 1358

Query: 619  SK 620
             +
Sbjct: 1359 VR 1360


>gi|321260384|ref|XP_003194912.1| cleavage and polyadenylation specific protein [Cryptococcus gattii
            WM276]
 gi|317461384|gb|ADV23125.1| cleavage and polyadenylation specific protein, putative [Cryptococcus
            gattii WM276]
          Length = 1431

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 147/621 (23%), Positives = 265/621 (42%), Gaps = 75/621 (12%)

Query: 25   LGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFKKL--KVLFVSDRSKRANEQPGLPR 82
            L LH +  L     Q    +  A  H + +L +RF+K+  ++L +S      N    LP 
Sbjct: 871  LALHHSGRLNAYEAQPRFTV-DASSHSRRSLAVRFRKVHTQLLPISGGVGTTNGSARLPY 929

Query: 83   GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVN-- 140
             +       F+NI G  G F+ G  P W+  +      AHP+           F      
Sbjct: 930  TIV-----PFNNIEGLTGAFITGEKPHWIISS-----EAHPLRAFALKQAAMAFGKTTHL 979

Query: 141  CPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAE 200
              +G  +   +    I  LP  L+ D   P  +  ++ T   + +   +  Y    S   
Sbjct: 980  GGKGEYFIRIEDGSFICYLPPTLNTDFAIPCDRYQMERTYTNITFDPTSAHYVGAASIEV 1039

Query: 201  PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS--WEEIPQTNFPLHEWEHV 258
            P   Y     E+ E+   P     IPP   +  + LFS  S  W+ I    +   + E V
Sbjct: 1040 PFQAY----DEEGEIQLGPDGPDLIPPTNQRSTLELFSQGSDPWKVI--DGYEFDQNEEV 1093

Query: 259  LCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKI 318
            + +++V++E  G   G R +IA+GT +N+ ED   RG   +F+I++ V   G     +  
Sbjct: 1094 MSMESVNLESPGAPGGYRDFIAVGTGFNFGEDRATRGNTYIFEILQTVGPQGGGGPGSVP 1153

Query: 319  KMIYAKEQKG----PVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASM 373
                 +  K     PV A+ H+ G+L+   G K+Y+     D  L G+AF+D ++Y  ++
Sbjct: 1154 GWKLVRRTKDPARHPVNAVNHINGYLLNTNGPKLYVKGFDYDAQLMGLAFLDIQLYATTV 1213

Query: 374  VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
               KN +L+GD  +S   +  Q +    + +++D                          
Sbjct: 1214 KVFKNFMLIGDLCKSFWFVSLQEDPYKFTTISKD-------------------------- 1247

Query: 434  WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
                           + +     D L     + F+ SD++ ++ +  + P   +S  G R
Sbjct: 1248 --------------LQHVSVVTADFLVHDGQVTFISSDRNGDMRMLDFDPTDPDSLNGER 1293

Query: 494  LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRR 553
            L+ +T++H G    T  K+  +  +  +    +++ +  YA+ DGAL   + + +  ++R
Sbjct: 1294 LMLRTEYHAGSAA-TVSKVIARRKTTEEEFAPQTQII--YATADGALTTVVSVKDARFKR 1350

Query: 554  LLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 613
            L ++ + +V +  H  GLNPRAFRT +         S+GI+DG L+ +F    +G + E+
Sbjct: 1351 LQLVSDQLVRNAQHVAGLNPRAFRTVRND-LLPRPLSKGILDGQLLNQFALQPIGRQKEM 1409

Query: 614  CKKIGSKHNDILDELYDIEAL 634
             ++IG+   D +    D++AL
Sbjct: 1410 MRQIGT---DAVTVASDLQAL 1427


>gi|328848896|gb|EGF98089.1| hypothetical protein MELLADRAFT_96156 [Melampsora larici-populina
            98AG31]
          Length = 1427

 Score =  166 bits (420), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 139/568 (24%), Positives = 244/568 (42%), Gaps = 87/568 (15%)

Query: 96   AGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELR 155
            A + G+ L G  P W+  T  G ++ +    +  ++ L      +    FL  + + E+ 
Sbjct: 917  ATFSGIHLSGLEPIWIVSTDHGPVQIYKAKTNQTITYL------DQSDKFLVSDHQVEIW 970

Query: 156  ISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKEL 215
             S +   +  D   PVR V    +   + Y  +       +    P  ++     E+  +
Sbjct: 971  ESEVGEGVCLDGRIPVRLVKDGRSFSKIVYEPKMDVVIGASYLVTPFANFT----EEGVM 1026

Query: 216  VTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGL 275
            + +  D   + P   +  + L  P SW+ I    F  +EW  V  +K VS++ +   SG 
Sbjct: 1027 MWEQDDESKVRPNGFRSSLELILPGSWDTIDGHEFQQNEW--VTSMKLVSLDSKSKRSGR 1084

Query: 276  RGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICH 335
            R +I  GT  N +ED+  RG + +F++IE+VP+P  P     +++ Y +  K  VTA+  
Sbjct: 1085 RDFIGAGTTCNRAEDLAARGGVYVFEVIEIVPDPKHPERNRGLRLRYHETTKACVTAVDG 1144

Query: 336  VAGFLVTAVGQKI----------------------YIWQL------KDNDLTGIAFIDTE 367
            + G+ +  +GQK+                      +  +L      +D  L  + F+D  
Sbjct: 1145 LNGYFIHTMGQKVDPGYPRSPTRKYSDILADQIIAFYSKLYAKCFEQDERLLAVGFLDIR 1204

Query: 368  VYIASMVSVKNLILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRG 426
             Y  ++  +KN I++GD  + I L+ +Q E Y+ + L             G+        
Sbjct: 1205 PYTTTLKVLKNFIVLGDAVKGITLVAFQEEPYKLIEL-------------GH-------- 1243

Query: 427  IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
                     F+ L        C  I     D L   + +  + SD    + +F Y P   
Sbjct: 1244 --------TFVDLR-------CSTI-----DFLVLENKLSIVTSDLGGTIRIFEYNPTNI 1283

Query: 487  ESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPL 546
            ES GG +L+ +T+F     + +      + SS  +A        T +A LDG++   +P+
Sbjct: 1284 ESQGGLKLLCRTEFGTAGEMGSSLGFGKRLSSKEEAKSIG----TLFAGLDGSISSLVPV 1339

Query: 547  PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
             E  ++RL ++Q  ++ H  H  GLNPR FRT +     +   +RGIIDG ++ +F  L 
Sbjct: 1340 KEAVFKRLQIVQTRLIRHLDHFAGLNPRGFRTVRND-LVSRAMNRGIIDGEIIERFGALK 1398

Query: 607  LGERLEICKKIGSKHNDILDELYDIEAL 634
            L E+  I K  GS  N IL  L +++ +
Sbjct: 1399 LDEQDSIGKLAGSDRNTILINLNNLKGI 1426


>gi|68471460|ref|XP_720278.1| likely Cleavage and Polyadenylation Specificity Factor subunit
           fragment [Candida albicans SC5314]
 gi|46442138|gb|EAL01430.1| likely Cleavage and Polyadenylation Specificity Factor subunit
           fragment [Candida albicans SC5314]
          Length = 758

 Score =  165 bits (417), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 138/595 (23%), Positives = 250/595 (42%), Gaps = 85/595 (14%)

Query: 59  FKKLKVLFVSDRSKRANEQPGLPRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
           FKK K L ++     A      P G  I + + YF N+ G+  +F+ G  P  +  T   
Sbjct: 196 FKKEKDLTITGAPDNA-----FPYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKTVHS 250

Query: 118 ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
             R    +    +S ++ F +     G ++ + +   RI  LP   +Y+   P++ V + 
Sbjct: 251 IPRIFQFSKIAAMS-ISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVDIG 309

Query: 178 CTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLF 237
            +   +AYH  + T  + T    P   Y   + E K +    +D +  P +  +  + L 
Sbjct: 310 ESIKSIAYHETSDTVVLSTFKQIP---YDCLDEEGKPIAGIIKDIKDTPAMSFKGSIKLV 366

Query: 238 SPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSG-----------------LRGYIA 280
           SP++W  I      L + E  + LK++ ++  G+ SG                  R YI 
Sbjct: 367 SPYNWTVIET--IELEDNEVGMTLKSMILDV-GSESGSTLGSDPNSLIKKYNKKKREYIP 423

Query: 281 LGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFL 340
           +G      ED+   G   +++II+++PEPG+P T +K K I+ +E +G +T+IC ++G  
Sbjct: 424 IGIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHKFKEIFKEETRGAITSICELSGRF 483

Query: 341 VTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRT 400
           + + GQK+ +  L+D+    +AF+DT VY++   S  NL+++GD  +   L+ +  E   
Sbjct: 484 LVSQGQKVIVRDLQDDGTVPVAFLDTPVYVSESKSFGNLLILGDLLKGCWLVGFDAEPFR 543

Query: 401 LSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 460
           + ++ +D                                         + I  +  D + 
Sbjct: 544 MIMLGKD----------------------------------------TQHISVECADFII 563

Query: 461 EFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK---IRCKPS 517
               +  +++D +  + L  Y P+  +S  G +L+ K  F L   ++       I  + S
Sbjct: 564 NDDEIFVLVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRSLPLIDIEES 623

Query: 518 SISDA-----------PGARSRFLTWYASL-DGALGFFLPLPEKNYRRLLMLQNVMVTHT 565
             +DA           P   S +     S  DG+     P+ E  YRR+ +LQ  ++   
Sbjct: 624 VQTDALTNIAVPPPLPPNTTSNYFQVIGSTQDGSFFNVFPINEAAYRRMYILQQQLIDKE 683

Query: 566 SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 620
            H  GLNPR  R    K       ++ I+D  L+  F +LS   +  +  K+  K
Sbjct: 684 FHYCGLNPRLNRIGSIKLQNNETNTKPILDYDLIRSFTKLSDDRKRNLANKVSGK 738


>gi|255720869|ref|XP_002545369.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240135858|gb|EER35411.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 1351

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 137/572 (23%), Positives = 247/572 (43%), Gaps = 77/572 (13%)

Query: 59   FKKLKVLFVSDRSKRANEQPGLPRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
            F+K K L ++   + A      P G  I + + YF N+ G+  +FL G  P  +  T   
Sbjct: 827  FRKEKDLTITGAPENA-----FPYGTSIERRLVYFPNLNGFTCIFLTGVIPYLILKTIHA 881

Query: 118  ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
              R    T   P  +++ F +     G ++ + +   RI  LP   +Y+   P++ VP+ 
Sbjct: 882  IPRIFQFT-KIPAVSISAFSDSKIKNGLIFLDNEQNARICELPLDYNYEFNLPMKHVPIG 940

Query: 178  CTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFH--VS 235
             +   +AYH  +   C+V ST +     Y    E+ +L+    + +   P  + F   + 
Sbjct: 941  ESIKAMAYHEASD--CVVVSTFKEIP--YNCVDEEGKLIVGVMEDK---PAATSFKGSIK 993

Query: 236  LFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGT-----LSGLRGYIALGTNYNYSED 290
            L SP++W  I      L + E  + LK++ ++   +         R YI +GT     ED
Sbjct: 994  LISPYNWSVI--DTIELDDNEVGMSLKSMVLDIGSSSLIKKFKNKREYIVVGTGKYRMED 1051

Query: 291  VTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYI 350
            +   G   +F+II+++PEPG+P T +K K  + +  KG VT++C ++G  + + GQK+ +
Sbjct: 1052 LAANGAFKIFEIIDIIPEPGKPETNHKFKETFQENIKGAVTSVCELSGRFLVSQGQKVIV 1111

Query: 351  WQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
              L+D+    +AF+DT VY++   S  NL+++GD  +   L+ +  E   + ++ +D + 
Sbjct: 1112 RDLQDDGTVPVAFLDTPVYVSESKSFGNLLILGDPLKGCWLIGFDAEPFRMIMLGKDTQ- 1170

Query: 411  TQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMIS 470
                                        LS+      C     K +++         +++
Sbjct: 1171 ---------------------------HLSVE-----CADFIIKDDEVY-------ILVA 1191

Query: 471  DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL 530
            D +  + L  Y P+  +S  G +L+ K  F L   +          S +   P   + F 
Sbjct: 1192 DNNNVLHLLNYDPDDPQSINGTKLLTKASFELASPI----------SCLRTLPIDDNNFQ 1241

Query: 531  TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPS 590
               +  DG+     P+ E  YRR+ +LQ  +     H  GLNPR  R   G      N +
Sbjct: 1242 IIGSCQDGSFFNVFPINESTYRRMYILQQQLTEKEYHYCGLNPRLNRV--GNLALKDNDT 1299

Query: 591  --RGIIDGSLVWKFLQLSLGERLEICKKIGSK 620
              + I+D  L+  F +L+   +     K+  K
Sbjct: 1300 NIKPILDYGLIRIFAKLNTDRKKAFANKVSGK 1331


>gi|238881599|gb|EEQ45237.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 1423

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 138/595 (23%), Positives = 251/595 (42%), Gaps = 85/595 (14%)

Query: 59   FKKLKVLFVSDRSKRANEQPGLPRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
            FKK K L ++     A      P G  I + + YF N+ G+  +F+ G  P  +  T   
Sbjct: 861  FKKEKDLTITGAPDNA-----FPYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKTVHS 915

Query: 118  ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
              R    +    +S ++ F +     G ++ + +   RI  LP   +Y+   P++ V + 
Sbjct: 916  IPRIFQFSKIAAMS-ISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVDIG 974

Query: 178  CTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLF 237
             +   +AYH  + T  + T    P   Y   + E K +    +D +  P +  +  + L 
Sbjct: 975  ESIKSIAYHETSDTVVLSTFKQIP---YDCLDEEGKPIAGIIKDIKDTPAMSFKGSIKLV 1031

Query: 238  SPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGL-----------------RGYIA 280
            SP++W  I      L + E  + LK++ ++  G+ SG                  R YI 
Sbjct: 1032 SPYNWTVIET--IELEDNEVGMTLKSMILDV-GSESGSTLGSDPNSLIKKYNKKKREYIV 1088

Query: 281  LGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFL 340
            +G      ED+   G   +++II+++PEPG+P T +K K I+ +E +G +T+IC ++G  
Sbjct: 1089 IGIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHKFKEIFKEETRGAITSICELSGRF 1148

Query: 341  VTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRT 400
            + + GQK+ +  L+D+    +AF+DT VY++   S  NL+++GD  +   L+ +  E   
Sbjct: 1149 LVSQGQKVIVRDLQDDGTVPVAFLDTPVYVSESKSFGNLLILGDPLKGCWLVGFDAEPFR 1208

Query: 401  LSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 460
            + ++ +D                                         + I  +  D + 
Sbjct: 1209 MIMLGKD----------------------------------------TQHISVECADFII 1228

Query: 461  EFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK---IRCKPS 517
                +  +++D +  + L  Y P+  +S  G +L+ K  F L   ++       I  + S
Sbjct: 1229 NDDEIFVLVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRSLPLIDIEES 1288

Query: 518  SISDA-----------PGARSRFLTWYASL-DGALGFFLPLPEKNYRRLLMLQNVMVTHT 565
              +DA           P   S +     S  DG+     P+ E  YRR+ +LQ  ++   
Sbjct: 1289 VQTDAFTNIVVPPTLPPNTTSNYFQVIGSTQDGSFFNVFPINEAAYRRMYILQQQLIDKE 1348

Query: 566  SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 620
             H  GLNPR  R    K       ++ I+D  L+ +F +LS   +  +  K+  K
Sbjct: 1349 FHYCGLNPRLNRIGSIKLQNNETNTKPILDYDLIRRFTKLSDDRKRNLANKVSGK 1403


>gi|68471006|ref|XP_720510.1| likely Cleavage and Polyadenylation Specificity Factor subunit
            [Candida albicans SC5314]
 gi|74591422|sp|Q5AFT3.1|CFT1_CANAL RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|46442380|gb|EAL01670.1| likely Cleavage and Polyadenylation Specificity Factor subunit
            [Candida albicans SC5314]
          Length = 1420

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 138/595 (23%), Positives = 250/595 (42%), Gaps = 85/595 (14%)

Query: 59   FKKLKVLFVSDRSKRANEQPGLPRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
            FKK K L ++     A      P G  I + + YF N+ G+  +F+ G  P  +  T   
Sbjct: 858  FKKEKDLTITGAPDNA-----FPYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKTVHS 912

Query: 118  ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
              R    +    +S ++ F +     G ++ + +   RI  LP   +Y+   P++ V + 
Sbjct: 913  IPRIFQFSKIAAMS-ISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVDIG 971

Query: 178  CTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLF 237
             +   +AYH  + T  + T    P   Y   + E K +    +D +  P +  +  + L 
Sbjct: 972  ESIKSIAYHETSDTVVLSTFKQIP---YDCLDEEGKPIAGIIKDIKDTPAMSFKGSIKLV 1028

Query: 238  SPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSG-----------------LRGYIA 280
            SP++W  I      L + E  + LK++ ++  G+ SG                  R YI 
Sbjct: 1029 SPYNWTVIET--IELGDNEVGMTLKSMILDV-GSESGSTLGSDPNSLIKKYNKKKREYIV 1085

Query: 281  LGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFL 340
            +G      ED+   G   +++II+++PEPG+P T +K K I+ +E +G +T+IC ++G  
Sbjct: 1086 IGIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHKFKEIFKEETRGAITSICELSGRF 1145

Query: 341  VTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRT 400
            + + GQK+ +  L+D+    +AF+DT VY++   S  NL+++GD  +   L+ +  E   
Sbjct: 1146 LVSQGQKVIVRDLQDDGTVPVAFLDTPVYVSESKSFGNLLILGDLLKGCWLVGFDAEPFR 1205

Query: 401  LSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 460
            + ++ +D                                         + I  +  D + 
Sbjct: 1206 MIMLGKD----------------------------------------TQHISVECADFII 1225

Query: 461  EFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK---IRCKPS 517
                +  +++D +  + L  Y P+  +S  G +L+ K  F L   ++       I  + S
Sbjct: 1226 NDDEIFVLVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRSLPLIDIEES 1285

Query: 518  SISDA-----------PGARSRFLTWYASL-DGALGFFLPLPEKNYRRLLMLQNVMVTHT 565
              +DA           P   S +     S  DG+     P+ E  YRR+ +LQ  ++   
Sbjct: 1286 VQTDALTNIAVPPPLPPNTTSNYFQVIGSTQDGSFFNVFPINEAAYRRMYILQQQLIDKE 1345

Query: 566  SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 620
             H  GLNPR  R    K       ++ I+D  L+  F +LS   +  +  K+  K
Sbjct: 1346 FHYCGLNPRLNRIGSIKLQNNETNTKPILDYDLIRSFTKLSDDRKRNLANKVSGK 1400


>gi|322704830|gb|EFY96421.1| Cleavage factor two protein 1 [Metarhizium anisopliae ARSEF 23]
          Length = 1433

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 161/645 (24%), Positives = 282/645 (43%), Gaps = 84/645 (13%)

Query: 17   VQELLTVSLG-LHGNRPLLLVR------TQHELLIYQAFRHPKGALKLRFKKLKVLFVSD 69
            + E+L   LG      P L+VR      T +E + YQA    + +  L FKK     ++ 
Sbjct: 837  ITEILVADLGDAISQTPYLIVRHASDDLTIYEPVRYQAEGDAELSASLLFKKCVNTSLAK 896

Query: 70   RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG- 128
             +   +E    P   R   +R  +N+ GY  VFL    P+++  +S  E R   M + G 
Sbjct: 897  TAPEVSEDDAEPP--RFVPLRRCANVNGYGAVFLPNASPSFVLKSSHSEPRV--MGLQGL 952

Query: 129  PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCTPHFLAYHL 187
             V  ++ FH   C RGF+Y + +   R++ LP++ +  +    V+K+ L      ++YH 
Sbjct: 953  GVRGMSTFHTEGCDRGFIYVDMEGIARVTQLPSNANLTELGVSVKKIALDGDVGMISYHH 1012

Query: 188  ETKTYCIVTSTAE----PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
             T TY +  +  E    P  D Y      KE     +++   PP +++  + L +P +W 
Sbjct: 1013 PTGTYVVGCTKLEQFELPRDDDYH-----KEWA---KETSNFPPTMARGILKLINPVTWT 1064

Query: 244  EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
             I +    L   E +  +K + +E        +  +A+GT  +  ED+  RGR+ +FDI+
Sbjct: 1065 VIHE--LELEPCESIESMKTLHLEVSEETKERKMLVAVGTALSKGEDLPTRGRVQVFDIV 1122

Query: 304  EVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDL 358
             V+PEPG+P T  ++K+I AKE+  +G VTA+  V   G ++ A GQK  +  LK D  L
Sbjct: 1123 TVIPEPGRPETNKRLKLI-AKEEIPRGGVTALSEVGAQGLMLVAQGQKCMVRGLKEDGSL 1181

Query: 359  TGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSK 416
              +AF+D   ++AS+  +    L ++ D  + +    Y  E  T  ++ +        S 
Sbjct: 1182 LPVAFLDMSCHVASVKELPGTGLCVMADVFKGLWFAGYTEEPYTFKILGK--------SS 1233

Query: 417  GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
            G                                K+     D L +   +  +  D + ++
Sbjct: 1234 G--------------------------------KLPLLAADFLPDGEDLSMVAVDAEGDL 1261

Query: 477  VLFMYQPEARESNGGHRLIKKTDFHLGQHV--NTFFKIRCKPSSISDAPGARSRFLTWYA 534
             +  + PE  +S  GH L+ +T F +  +   +T    R    S   A  + S  +   A
Sbjct: 1262 HILEFNPEHPKSLQGHLLLHRTSFAVTPNTPSSTLLLPRTHSPSYPQASSSSSSHMLLLA 1321

Query: 535  SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG-----NP 589
               G L    PL E  YRRLL + N +       GGL+ +A R         G     + 
Sbjct: 1322 CPSGQLAALSPLAESTYRRLLSVTNQLHPAIVPHGGLHSKAHRYPDQSSVAVGVETAASS 1381

Query: 590  SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
             R ++DG+++ ++ +L   +R ++  + G  ++ + D   D+E +
Sbjct: 1382 GRALVDGTVLARWSELGAAKRTDVALRGG--YDSVADLRDDLEGV 1424


>gi|50288865|ref|XP_446862.1| hypothetical protein [Candida glabrata CBS 138]
 gi|74609915|sp|Q6FSD2.1|CFT1_CANGA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|49526171|emb|CAG59795.1| unnamed protein product [Candida glabrata]
          Length = 1361

 Score =  162 bits (411), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 151/574 (26%), Positives = 256/574 (44%), Gaps = 87/574 (15%)

Query: 81   PRGV----RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPF 136
            P+GV    RI  M Y  N  GY  +F+ G  P  +            M  D  +  + PF
Sbjct: 841  PKGVSGIERI--MHYIPNFDGYSVIFVTGNTPYII------------MKEDDSLPRIFPF 886

Query: 137  HN---VNCPR----GFLYFNAKSELRI-SVLPTHLSYDAPWPVRKVPLKC------TPHF 182
             N   V+  R      +  +     RI S+   ++ Y    P+RK+ +        T + 
Sbjct: 887  GNIPIVSMSRWGEGSVICIDDIKNARIYSLNQDNIYYGNKLPIRKIKIGSMLQNYKTLNS 946

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFH--VSLFSPF 240
            + YH  T+ Y +V+ T E S   Y+   ED  L+   +      P    F   V L +P 
Sbjct: 947  IVYHERTQLY-LVSYTKEIS---YEAKAEDGSLLIGYKPEL---PNAKAFKSGVLLINPK 999

Query: 241  SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
            SWE I + + P +    V  +K+  ++ +      R YI +G  Y   EDV   G   ++
Sbjct: 1000 SWEVIDELDLPDNSL--VNDMKSSFIQIDTRTKRKREYIIVGIGYATMEDVPPTGEFHIY 1057

Query: 301  DIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLT 359
            DI EVVPEPG+P T  K+K I+ ++ +G V+ +  ++G  + +  QKI +  ++ DN + 
Sbjct: 1058 DITEVVPEPGKPNTNFKLKEIFKEDIRGIVSVVNGISGRFLISQSQKIMVRDVQQDNSVI 1117

Query: 360  GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
             +AF+D  V++ S+ +  NLI++GD  + I  + +  E                      
Sbjct: 1118 PVAFLDVPVFVTSLKTFGNLIVIGDAMQGIQFVGFDAE---------------------- 1155

Query: 420  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
               P R I  GS + KF  +S+               + L     + F+++D+D  + + 
Sbjct: 1156 ---PYRMITLGSSITKFEVISV---------------EFLVNNGDIYFLVTDRDSIMHVL 1197

Query: 480  MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGA 539
             Y P+   +  G RL+  + F+L    N    +        D   +RS F T  A +DG+
Sbjct: 1198 KYAPDQPNTLSGQRLVHCSSFNLHSLNNCTMLLPKNDEFPRDQRYSRS-FQTITAQVDGS 1256

Query: 540  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
            +   +P+ E+ YRRL  +Q  ++       GLNPR  R    K Y+ G+  R ++D +++
Sbjct: 1257 ISKIVPVKEETYRRLYFIQQQIIDKEPQLAGLNPRMERQ-DNKYYHLGHSLRPMLDFNII 1315

Query: 600  WKFLQLSLGERLEICKKIGSKHN-DILDELYDIE 632
             +F  +S+  R  I +K+G   N ++  +L D+E
Sbjct: 1316 KRFKDMSMNRRSHIVQKLGKNSNLEVWRDLIDLE 1349


>gi|241954348|ref|XP_002419895.1| subunit of the mRNA cleavage and polyadenylation factor, putative
            [Candida dubliniensis CD36]
 gi|223643236|emb|CAX42110.1| subunit of the mRNA cleavage and polyadenylation factor, putative
            [Candida dubliniensis CD36]
          Length = 1420

 Score =  162 bits (410), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 136/593 (22%), Positives = 251/593 (42%), Gaps = 82/593 (13%)

Query: 59   FKKLKVLFVSDRSKRANEQPGLPRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
            FKK K L ++     A      P G  I + + YF N+ G+  +F+ G  P  +  T   
Sbjct: 859  FKKEKDLTITGAPDNA-----FPYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKTIHS 913

Query: 118  ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
              R    +    V +++ F +     G ++ + +   RI  LP   +Y+   P++ V + 
Sbjct: 914  IPRIFQFS-KIAVMSISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVDIG 972

Query: 178  CTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLF 237
             +   +AYH  + T  + T    P   Y   + E K +    ++ +  P +  +  V L 
Sbjct: 973  ESIKSIAYHETSDTVVLSTFKQIP---YECLDEEGKPIAGIIKNIKDTPAISFKGSVKLV 1029

Query: 238  SPFSWEEIPQTNFPLHEWEHVLCLK----NVSMEYEGTL------------SGLRGYIAL 281
            SP++W  I   N  L + E  + +K    +V  E + T+               R YI +
Sbjct: 1030 SPYNWTVIE--NIELGDNEVGMTIKSMILDVGSESKSTVGTDPNSLIKKYNKKKREYIVI 1087

Query: 282  GTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLV 341
            G      ED+   G   +++II+++PEPG+P T +K K I+ ++ +G +T+IC ++G  +
Sbjct: 1088 GIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHKFKEIFKEDTRGAITSICELSGRFL 1147

Query: 342  TAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTL 401
             + GQK+ +  L+D+    +AF+DT VY++   S  NL+++GD  +   L+ +  E   +
Sbjct: 1148 VSQGQKVIVRDLQDDGTVPVAFLDTPVYVSESKSFGNLVILGDPLKGCWLVGFDAEPFRM 1207

Query: 402  SLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
             ++ +D                                         + I  +  D +  
Sbjct: 1208 IMLGKD----------------------------------------TQHISVECADFIIN 1227

Query: 462  FSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF-----KIRCKP 516
               +  +++D +  + L  Y P+  +S  G +L+ K  F L   ++         I  K 
Sbjct: 1228 DDEIFVLVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRSLPLKDIDEKV 1287

Query: 517  SSISDAPGA---------RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 567
             + +DA  A         ++ F    ++ DG+     P+ E  YRR+ +LQ  ++    H
Sbjct: 1288 QNETDAAAAATIPLPNNTQNNFQVIGSTQDGSFFNVFPINEAAYRRMYILQQQLIDKEFH 1347

Query: 568  TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 620
              GLNPR  R    K       ++ I+D  L+ +F +LS   +     K+  K
Sbjct: 1348 YCGLNPRLNRIGSIKLQNNETNTKPILDYDLIRRFTKLSDDRKRNFANKVSGK 1400


>gi|406699110|gb|EKD02327.1| cleavage and polyadenylation specific protein [Trichosporon asahii
            var. asahii CBS 8904]
          Length = 1339

 Score =  161 bits (408), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 156/630 (24%), Positives = 275/630 (43%), Gaps = 82/630 (13%)

Query: 17   VQELLTVSLGLHGNRP-LLLVRTQHELLIYQA--------FRHPKGALKLRFKKLKVLFV 67
            V ++L   +G    RP ++++     L IY+A            + +L +RF+K+    +
Sbjct: 773  VSQMLFCPIGTRTLRPHVIVLHRSGRLNIYEAQPRFTVDARDQSRRSLAVRFRKVHTQLL 832

Query: 68   SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID 127
            S       +   +P          F++I G  G F+ G  P W+  +      +HP+   
Sbjct: 833  SVTPSSTVKPAAIP----------FTDIEGLTGAFITGERPHWIISSD-----SHPIRAF 877

Query: 128  GPVSTLAPFHNV--NCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
            G       F         G  +   +    I  +P  L+ D   P  +  ++ T   +A+
Sbjct: 878  GLKQAAYAFCKTTHQGGHGEYFLRIEDGSFICYMPPTLNTDFAMPCDRYKMERTYTHVAF 937

Query: 186  HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEI 245
               +  Y    + + P   Y     E+ E++  P     +PP   +  + LFS  S    
Sbjct: 938  DPPSCHYVAAAAMSVPFQAY----DEEGEILLGPEGPDLLPPKNERSSIELFSAGSEPFR 993

Query: 246  PQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEV 305
                +   + E VLC+++V++E   + +G R +IA+GT  N+ ED    G + +F+++EV
Sbjct: 994  VLDGYDFDQNEEVLCVESVTLESSSSPTGFRDFIAVGTGKNFGEDRATSGAVYVFEVVEV 1053

Query: 306  V-PEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAF 363
            V  +PG  ++  ++K       + PV+AI ++ G++V + G KI    L  D+ L G+AF
Sbjct: 1054 VGTKPG--VSNWRLKYRCKDPTRNPVSAIANINGYIVHSNGPKILAKGLDYDDRLMGLAF 1111

Query: 364  IDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNP 423
            +D  +Y+ S+   KNLILVGD+ +S+     Q        + RD                
Sbjct: 1112 LDVSMYVTSIRVFKNLILVGDFVKSLIFASLQENPYKFVTIGRD---------------- 1155

Query: 424  SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQP 483
                           LSL               D L     + F+ +D+  N+ L  + P
Sbjct: 1156 ------------LADLSL------------TAADFLVHEGQVTFITNDQHGNMRLVDFDP 1191

Query: 484  EARESNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSSISDAPGARSRFLTWYASLDGALGF 542
               +S  G +L+ +T+F  G  V     I R K +    AP  +S+ +  YA+ DGA+  
Sbjct: 1192 ANPDSLNGEKLLTQTEFGTGCPVTASCMIARRKTAEEEFAP--QSQLI--YATADGAITS 1247

Query: 543  FLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWK 601
             + + E  ++RL ++Q+ +V +  H  GLNPRAFRT +        P +RG++DG L+  
Sbjct: 1248 VVAVKEARFKRLQLVQDQLVRNAQHVAGLNPRAFRTVRND--LVPRPLARGVLDGGLLAH 1305

Query: 602  FLQLSLGERLEICKKIGSKHNDILDELYDI 631
            F    L  + E+ ++IG+    +  +LY +
Sbjct: 1306 FALQPLRRQREMMRQIGTDAVTVGSDLYTL 1335


>gi|401889164|gb|EJT53104.1| cleavage and polyadenylation specific protein [Trichosporon asahii
            var. asahii CBS 2479]
          Length = 1358

 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 156/630 (24%), Positives = 275/630 (43%), Gaps = 82/630 (13%)

Query: 17   VQELLTVSLGLHGNRP-LLLVRTQHELLIYQA--------FRHPKGALKLRFKKLKVLFV 67
            V ++L   +G    RP ++++     L IY+A            + +L +RF+K+    +
Sbjct: 792  VSQMLFCPIGTRTLRPHVIVLHRSGRLNIYEAQPRFTVDARDQSRRSLAVRFRKVHTQLL 851

Query: 68   SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID 127
            S       +   +P          F++I G  G F+ G  P W+  +      +HP+   
Sbjct: 852  SVTPSSTVKPAAIP----------FTDIEGLTGAFITGERPHWIISSD-----SHPIRAF 896

Query: 128  GPVSTLAPFHNV--NCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
            G       F         G  +   +    I  +P  L+ D   P  +  ++ T   +A+
Sbjct: 897  GLKQAAYAFCKTTHQGGHGEYFLRIEDGSFICYMPPTLNTDFAMPCDRYKMERTYTHVAF 956

Query: 186  HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEI 245
               +  Y    + + P   Y     E+ E++  P     +PP   +  + LFS  S    
Sbjct: 957  DPPSCHYVAAAAMSVPFQAY----DEEGEILLGPEGPDLLPPKNERSSIELFSAGSEPFR 1012

Query: 246  PQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEV 305
                +   + E VLC+++V++E   + +G R +IA+GT  N+ ED    G + +F+++EV
Sbjct: 1013 VLDGYDFDQNEEVLCVESVTLESSSSPTGFRDFIAVGTGKNFGEDRATSGAVYVFEVVEV 1072

Query: 306  V-PEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAF 363
            V  +PG  ++  ++K       + PV+AI ++ G++V + G KI    L  D+ L G+AF
Sbjct: 1073 VGTKPG--VSNWRLKYRCKDPTRNPVSAIANINGYIVHSNGPKILAKGLDYDDRLMGLAF 1130

Query: 364  IDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNP 423
            +D  +Y+ S+   KNLILVGD+ +S+     Q        + RD                
Sbjct: 1131 LDVSMYVTSIRVFKNLILVGDFVKSLIFASLQENPYKFVTIGRD---------------- 1174

Query: 424  SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQP 483
                           LSL               D L     + F+ +D+  N+ L  + P
Sbjct: 1175 ------------LADLSL------------TAADFLVHEGQVTFITNDQHGNMRLVDFDP 1210

Query: 484  EARESNGGHRLIKKTDFHLGQHVNTFFKI-RCKPSSISDAPGARSRFLTWYASLDGALGF 542
               +S  G +L+ +T+F  G  V     I R K +    AP  +S+ +  YA+ DGA+  
Sbjct: 1211 ANPDSLNGEKLLTQTEFGTGCPVTASCMIARRKTAEEEFAP--QSQLI--YATADGAITS 1266

Query: 543  FLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWK 601
             + + E  ++RL ++Q+ +V +  H  GLNPRAFRT +        P +RG++DG L+  
Sbjct: 1267 VVAVKEARFKRLQLVQDQLVRNAQHVAGLNPRAFRTVRND--LVPRPLARGVLDGGLLAH 1324

Query: 602  FLQLSLGERLEICKKIGSKHNDILDELYDI 631
            F    L  + E+ ++IG+    +  +LY +
Sbjct: 1325 FALQPLRRQREMMRQIGTDAVTVGSDLYTL 1354


>gi|33411764|emb|CAD58787.1| cleavage and polyadenylation specificity factor 1 [Bos taurus]
          Length = 180

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 75/168 (44%), Positives = 113/168 (67%), Gaps = 7/168 (4%)

Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
           D + + + +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++  C+ 
Sbjct: 11  DFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRG 70

Query: 517 SSISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
           ++  + P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   H  GL
Sbjct: 71  AA--EGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGL 128

Query: 572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
           NPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+
Sbjct: 129 NPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGT 176


>gi|402219312|gb|EJT99386.1| hypothetical protein DACRYDRAFT_17537 [Dacryopinax sp. DJM-731 SS1]
          Length = 1620

 Score =  160 bits (404), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 152/630 (24%), Positives = 278/630 (44%), Gaps = 102/630 (16%)

Query: 33   LLLVRTQHELLIYQA----FRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRI-- 86
            LL+  T+  L +Y+A            + RF K+   F +D  + A  +  LP   R+  
Sbjct: 1064 LLVYYTEGRLAVYEATPRTATEADSTFQYRFTKVATHF-ADAEQHAAIRQMLPEARRLQL 1122

Query: 87   ----SQMRYFSNI-AGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGP-VSTLAPFHNVN 140
                + + + S+I  G   VF  G  P W+  + +  L+   +    P V +  P     
Sbjct: 1123 PSRRALIPFISDIKGGTSAVFQRGEEPCWIMASRQNGLQI--IAYSSPLVYSFTPTSLFG 1180

Query: 141  CPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETK-TYC------ 193
                F+ +  +        P  + +D   P     L C       H+ +K +Y       
Sbjct: 1181 NSGDFILYGEEG-------PVLMEFDEE-PDTGRELSC------RHIHSKRSYTSMAVDP 1226

Query: 194  ---IVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNF 250
               +V + +   + +  ++ E++ L   P  +    P+     + L SP + + +   +F
Sbjct: 1227 GSNLVAAASSLKSFFLLYDDEEQPLWV-PESTALFGPMAECSSLELVSPDTCQTLDGYDF 1285

Query: 251  PLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPG 310
              +E+ +V+  K+V++E   + SG + YIA+GT+    ED+  RG   +F++IEVV  P 
Sbjct: 1286 APNEFINVV--KSVNLETLSSESGFKDYIAVGTSTFRGEDLAVRGATYIFEVIEVVSYPD 1343

Query: 311  QPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVY 369
             PL   ++K++   E K PV AIC + G+LV++ G K+++    +D  L G+AF+D  V 
Sbjct: 1344 DPLPPYRLKLLCRDEAKAPVNAICGLNGYLVSSQGFKVFVRAFEQDERLVGVAFMDAGVC 1403

Query: 370  IASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIID 429
            + S+  +KNL+L+GD  RS++ + +Q +   L       +PT      +           
Sbjct: 1404 VTSLTRLKNLLLIGDAKRSVSFVAFQEDPFKL-------RPTYVTDAAF----------- 1445

Query: 430  GSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESN 489
                                        + DE      + +D +  + LF + P    + 
Sbjct: 1446 ----------------------------LFDE-GDFSILAADDEGTLRLFEFDPNLTGAT 1476

Query: 490  GGHRLIKKTDFHLGQHVNTFF-----KIRCKPSSISDAPGARSRFLTWYASLDGALGFFL 544
             G+ LI +T+F+ GQ  +T       + R  P  +   P A+  F     ++DG LG   
Sbjct: 1477 HGNPLICETEFN-GQSEHTHILAIAGRGREDPEEMQ-IPEAQLIF----GTIDGTLGTIS 1530

Query: 545  PLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQ 604
            P+P++ ++RL +L   ++    H  GLNPRAFRT +     +   ++G++D  L+  F +
Sbjct: 1531 PVPDECFKRLQLLSGQLMRSVQHFAGLNPRAFRTVRND-LLSRPLNKGMLDYDLLHAFRE 1589

Query: 605  LSLGERLEICKKIGSKHNDILDELYDIEAL 634
            L +  +  I K+IG+    IL ++  +E +
Sbjct: 1590 LDIRRQATITKQIGTDTITILRDIRSLEEI 1619


>gi|255718033|ref|XP_002555297.1| KLTH0G05984p [Lachancea thermotolerans]
 gi|238936681|emb|CAR24860.1| KLTH0G05984p [Lachancea thermotolerans CBS 6340]
          Length = 1307

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 138/513 (26%), Positives = 231/513 (45%), Gaps = 64/513 (12%)

Query: 129  PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF------ 182
            P+ +LAP+         L  +     RI  L T  SY    PV++VP++   ++      
Sbjct: 843  PLVSLAPWGT----DSVLCVDDIKNARIVTLDTTFSYGNRLPVKRVPIEDPLNYYGCLNN 898

Query: 183  LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS-QFHVSLFSPFS 241
            +AYH  +  Y IV+ T E     Y+   E+ E      DS  +P     Q  V L +P S
Sbjct: 899  VAYHERSGMY-IVSYTKEIE---YEAISEEGEKTVGSDDS--VPHARGFQSGVLLLNPKS 952

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
            W  I + ++  +    +  +K + ++        R Y+ +G  +   ED+   G   L+D
Sbjct: 953  WNIIDKADYEKNSL--INDMKTMLIQTNSRTRRKREYLVVGNTFVRDEDIGTMGSFCLYD 1010

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTG 360
            I EVVPEPG+P T  K+K I+ +E +G V+++C ++G  + +  QK+ +  ++ DN +  
Sbjct: 1011 ITEVVPEPGKPDTNYKLKQIFYEEFRGAVSSVCEISGRFLISQSQKVLVRDVQEDNSVVP 1070

Query: 361  IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            +AF+D  V++    S  NL+++GD  +    + +  E                       
Sbjct: 1071 VAFLDVPVFVTDSKSCGNLLIIGDAMQGFQFVGFDAE----------------------- 1107

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
              P R I  G  V KF  +SL               + L    S+ F++SD+   + +  
Sbjct: 1108 --PYRMIPLGKSVSKFEVMSL---------------EFLVNNGSIYFLVSDRSNILHILK 1150

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGAL 540
            Y P+   S  G +L+  T F+L    NT  K+  K        G    F    A  DG+L
Sbjct: 1151 YAPDEPNSLSGQKLVHCTSFNL-HSTNTCMKLLLKNDEFP-TLGEPPAFQAIGAQTDGSL 1208

Query: 541  GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVW 600
               +PL E +YRRL M+Q  ++    H  GLNP+  R  +   Y  G+  R ++D +++ 
Sbjct: 1209 FNVVPLSESSYRRLYMVQQQLIEKDVHLCGLNPKMER-LQNDFYQLGHLMRPMLDFTVIK 1267

Query: 601  KFLQLSLGERLEICKKIGSKHN-DILDELYDIE 632
             F  L L +R +I  K G + + +I  +L ++E
Sbjct: 1268 SFATLPLNKRKQIAAKAGRQADFEIWRDLINVE 1300


>gi|422295485|gb|EKU22784.1| cleavage and polyadenylation specificity factor subunit 1
           [Nannochloropsis gaditana CCMP526]
          Length = 395

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 102/363 (28%), Positives = 170/363 (46%), Gaps = 45/363 (12%)

Query: 278 YIALGTNYNYS--EDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICH 335
           Y+A+GT    +  EDV  +GR+L++ I  + P  G         +    ++ GP TAI  
Sbjct: 70  YLAVGTCTVRAKGEDVPSKGRLLMYRI-SLDPYAGLTSPPTLTLVDQYSQRSGPPTAIAQ 128

Query: 336 VAGFLVTAVGQKIYIWQLKDND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
           +   ++ A G  ++++     + L  IAF D + Y+ S+  VK L+ V D   S+ LLR+
Sbjct: 129 LGPHIIIAAGPTLWVYAFSAREKLKPIAFYDADFYVVSLRVVKTLVAVTDAYHSVHLLRW 188

Query: 395 QPE--YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
                  TL L+ +DY P                                    +  + G
Sbjct: 189 HEHDPAHTLELMGKDYSPI-----------------------------------VSAQPG 213

Query: 453 SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 512
             H   + +  S+G ++ D   N+ L  Y P   ES GG+RL+++ DFHL   ++     
Sbjct: 214 GSH--FVVDPPSLGMLVGDSRGNLQLLQYDPADVESRGGNRLVRRADFHLSHRLSFLQHT 271

Query: 513 RCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
           R        A  A  R +  + S++G +G  +P+ EK YRRL  LQ VMV    H G  N
Sbjct: 272 RMAEVPRPGAYRAGVRVMV-FGSVEGGVGALVPVEEKVYRRLYALQAVMVNALPHVGAFN 330

Query: 573 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 632
           PR FR  + +G+  G   +G +DG L+W+F  LS+G++ ++   IG+    +L+ L +++
Sbjct: 331 PRGFRLVEARGWAQGR-KKGTLDGELLWRFAGLSVGKQEDLASAIGTSREMVLESLLEVD 389

Query: 633 ALS 635
            ++
Sbjct: 390 MMT 392


>gi|353234640|emb|CCA66663.1| related to cleavage and polyadenylation specificity factor, 160 kDa
            subunit [Piriformospora indica DSM 11827]
          Length = 1324

 Score =  156 bits (395), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 151/644 (23%), Positives = 278/644 (43%), Gaps = 81/644 (12%)

Query: 15   TIVQELLTVSLGLHGNRPLLLVRTQHELLIY--------------QAFRHPKGALKLRFK 60
            T +Q+++   LG     P L+V     LLI               Q  R    +L++ F 
Sbjct: 734  TTLQQVIITDLGEIEPSPHLIVLYDSNLLIVYQMVPLEPDKAGLPQLDRRSVPSLRISFV 793

Query: 61   KLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQG-----VFLCGPHPAWLFLTS 115
            K  V  +++ +   N+  G     R+ +     ++  ++G      F+ G +PAW+   +
Sbjct: 794  KRMVHHLANPTPDENQTSGGSNEKRLPKTIVPFSVLDWEGNSIYGAFVTGDNPAWILSKN 853

Query: 116  RGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVP 175
               L   P   +  V +  P    +    FL    +    +   P  +++   +P  K  
Sbjct: 854  HSGLLHLPCGYEA-VHSFTPCSMWDFSPTFLMSTEEGSCLVQWTP-GITFHGQYPCSKTR 911

Query: 176  LKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVS 235
               T   +AY   + T  ++ + +    D+  F+ E      +P       P +    + 
Sbjct: 912  KGRTQTNIAY---SNTTGLLVAASSNDRDFLLFDEEGTN-SWEPDGVNVSLPKLGASALE 967

Query: 236  LFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRG 295
            L  P +W  I    F  +E  +++  ++V +E   T +G + +IA+GT+ +  ED+  RG
Sbjct: 968  LLDPETWVTIDGYEFAANEVVNIV--ESVKLETLSTQTGNKEFIAVGTSIHRGEDLAVRG 1025

Query: 296  RILLFDIIEVVPEPGQ-PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK 354
               +F+I EV+ +  +    ++++K++   E KGPVTA+C + G+LV+++GQKI++    
Sbjct: 1026 GTYIFEIAEVIQDTEERGRRRHRLKLLCKDEAKGPVTAVCGMNGYLVSSMGQKIFVRAFD 1085

Query: 355  -DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQ- 412
             D  L G+AF+D  VY+ S+  +KNL+++ D  + +  + +Q +   L +++++ +PT  
Sbjct: 1086 LDERLVGVAFLDAGVYVTSIRCLKNLLVITDAIKGVWFVAFQEDPFKLVILSKEVRPTSI 1145

Query: 413  PNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDK 472
            P    ++A                                  HND       M  +  D 
Sbjct: 1146 PQGDFFFA----------------------------------HND-------MELLTIDL 1164

Query: 473  DKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC---KPSSISDAPGARSRF 529
               + L  Y P   ++  G RL+   +F    HV     +R    +PSS S +  +R   
Sbjct: 1165 RGVLRLHSYDPTHVDTEEGARLLCSVEFQ--THVEPVTIVRVAMEQPSSDSASDASR--- 1219

Query: 530  LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
                  +DG+L    PL    ++RL +LQ  +V HT H   LNP+A+R  +G        
Sbjct: 1220 -LLIPRVDGSLASLSPLDMDIFKRLYLLQAQLVRHTHHIAALNPKAYRAVQGSS-TTRTM 1277

Query: 590  SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
            SR ++D  L+  F +LS   +  I  +IG     ++ +   +EA
Sbjct: 1278 SRRMLDFGLLVGFKKLSFDRQQGIANQIGETWETLIRDCTQLEA 1321


>gi|452979579|gb|EME79341.1| hypothetical protein MYCFIDRAFT_104419, partial [Pseudocercospora
            fijiensis CIRAD86]
          Length = 1342

 Score =  154 bits (390), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 154/642 (23%), Positives = 265/642 (41%), Gaps = 106/642 (16%)

Query: 19   ELLTVSLGLHG-NRPLLLVRT-QHELLIYQAFRHPKGALK------LRFKKLKVLFVSDR 70
            E+L   LG HG  +P L++RT   ++++Y+ F +P+ + +      LRF+K+    +   
Sbjct: 748  EVLVSDLGQHGVTQPYLVLRTAMDDVVLYEPFHYPQTSGRKSWHQDLRFRKVPFSHIPKY 807

Query: 71   SKR-ANEQPGLPRGVRISQMRYFSNIA---GYQGVFLCGPH--PAWLFLTSRGELRAHPM 124
            S+  A  Q   P  ++  ++  +S IA       + L  P   P  L +    EL     
Sbjct: 808  SESIAESQSARPPPLKSVKIDTYSAIAIPGAPPCLLLKEPSTLPKVLEIRQSAELNR--- 864

Query: 125  TIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF-- 182
                 +S L P + V C  GF   NA  EL    LP +  Y   W V +VP+        
Sbjct: 865  -----LSMLCPINRVGCENGFFMINADEELEEQQLPLNTWYGTGWSVHQVPIGHPNQIED 919

Query: 183  ---LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP 239
               +AYH E   Y + T       D+Y F  ED       +D   + P V Q++V L S 
Sbjct: 920  VRRIAYHEERGLYVVATCR---EVDFY-FAEEDGR--HPEQDDITLRPKVPQYNVHLISA 973

Query: 240  FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
             S   I   + P      +  L+ + +E        +  + +       ED+  +G + +
Sbjct: 974  ISHHIIDTVHMPY--LAAITDLQVMMLEASENTHEQKPLVVVSAAAQRGEDMPAKGTLYV 1031

Query: 300  FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAIC--HVAGFLVTAVGQKIYIWQLK-DN 356
            +DII+VVP+P    +  K+  +  +E +G +TA+      GF+ TA G K+ I  +K D 
Sbjct: 1032 YDIIDVVPDPDIAESGVKLHQLAREENRGAITALAGPFPGGFIGTAQGLKVMIRGMKEDG 1091

Query: 357  DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSK 416
                +AF+D + Y   +                                     T P   
Sbjct: 1092 SCLPVAFLDAQSYTHVL------------------------------------KTLPGRG 1115

Query: 417  GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD-EF----SSMGFMISD 471
             + AG+  +G+  G    +        R+ +  K    H +++  EF     ++  ++ D
Sbjct: 1116 MWLAGDAWKGLWFGGFTEEPY------RVTVLGKAPKMHMEVMSAEFLPFDGALYIVVLD 1169

Query: 472  KDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL- 530
             D ++ +  Y PE  +S  G RL+ ++ FH+G        +   PS+++     +   + 
Sbjct: 1170 ADCDMHVLQYDPENPKSLNGMRLLHRSTFHIGHFTTNSMLL---PSTLASFAAQQHEMMN 1226

Query: 531  --------------TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
                             +S  GA+G   PL E+ YRRL  LQ  + +   H  GLNPRA+
Sbjct: 1227 GGSKAEVKPDPLQHVLTSSTSGAIGLITPLDEQAYRRLSALQTHLTSILEHAAGLNPRAY 1286

Query: 577  RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
            R+ + + +     +RG++DG LV +  +L    R ++  + G
Sbjct: 1287 RSIESESFGG---ARGVVDGLLVRRIHELGAARRADVLGRAG 1325


>gi|378734083|gb|EHY60542.1| histone H2A [Exophiala dermatitidis NIH/UT8656]
          Length = 1361

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 157/586 (26%), Positives = 261/586 (44%), Gaps = 84/586 (14%)

Query: 19   ELLTVSLGLHGNR-PLLLVRT-QHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANE 76
            E+L   LG   +R P L+VR    +++IY++F  P      RFKK  V   +       E
Sbjct: 775  EVLLADLGNSTDRQPYLVVRNLVGDVIIYESFAMPDVLGSFRFKK--VFTKAAGELEDGE 832

Query: 77   QPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPF 136
            + G P  ++   M+  +N+AG+  VF+ G  P  +   +    R + +     + ++   
Sbjct: 833  EVGQPSTLQ--PMQAVTNVAGHASVFIPGRQPLLIMREASTMPRVYELN-PTKLKSMNSV 889

Query: 137  HNVNCPRGFLYFNAKSELRISVLPTHLSYD-APWPVRKVPLKCTPHFLAYHLETKTYCIV 195
            H   C +G +  +A  E++   +P       + W +R+VPL      +AY   T +Y + 
Sbjct: 890  HTGTCRQGLVLVDADDEIKFCNIPDSTVLGLSDWVIRRVPLGQDITSVAYFAPTDSYILA 949

Query: 196  TS-TAE---PSTDYY--KFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            T+ T E   P  D +  ++ GE          ++F+P  + Q  + L S  +   I Q +
Sbjct: 950  TNHTTEFQLPQDDEWHPEWQGEA---------TKFLPSSI-QSSLKLLSAKTHSIISQYS 999

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEP 309
            F     E VLCL+++++E        +  I +GT     E+VT RG + +FD+++VVPEP
Sbjct: 1000 F--DACERVLCLESLNLEVSEETHERKDLIVVGTAIVKGENVTTRGNLYIFDVVDVVPEP 1057

Query: 310  GQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLTGIAFIDT 366
             +P +  KIK+I  ++ +G V+A+C +   GFL+ A GQK  +  LK D  +  +AF+D 
Sbjct: 1058 DRPESDLKIKLITKEDVRGAVSALCDIGSQGFLLAAQGQKSMVRGLKEDMSILPVAFLDM 1117

Query: 367  E--VYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPS 424
               V++A  +    L ++GD    + L+ Y  E   L ++ RD +            +P 
Sbjct: 1118 RYYVHVARELPGTGLCILGDAFSGLWLVGYSEEPYKLQILGRDLE------------DPP 1165

Query: 425  RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPE 484
                   L  +F  L  G++L I                      SD D  + +  Y PE
Sbjct: 1166 ------VLAAEF--LPDGKQLYIIS--------------------SDDDGLLRVLQYDPE 1197

Query: 485  ARESNGGHRLIKKTDFHLGQHVNTFFKI-------RCKPSSI-----SDAPGARSRFLTW 532
              ++  G +L+ ++ FH G        +       R +   I     S A  A  R    
Sbjct: 1198 NPKAERGTKLLLRSTFHSGAAPTKMILLPPQVASGRGRDPEIDMDVDSGAGPAAGRHRIL 1257

Query: 533  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS-HTGGLNPRAFR 577
              + +G+L    PL E  YRRL  LQ  ++T    H   LNPRA+R
Sbjct: 1258 VTTQEGSLCMLTPLSEATYRRLSALQTTLLTTLDFHPCSLNPRAYR 1303


>gi|302652143|ref|XP_003017931.1| hypothetical protein TRV_08063 [Trichophyton verrucosum HKI 0517]
 gi|291181517|gb|EFE37286.1| hypothetical protein TRV_08063 [Trichophyton verrucosum HKI 0517]
          Length = 429

 Score =  152 bits (384), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 122/456 (26%), Positives = 206/456 (45%), Gaps = 70/456 (15%)

Query: 182 FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDS--RFIPPLVSQFHVSLFSP 239
           F  Y L  KT   +   A+     +K   ED E  T+ R+    F+P L  +  V L  P
Sbjct: 7   FCLYDLPNKTDNTLDRIAKED---FKLP-EDDESHTEWRNEFITFLPQL-ERGTVKLLEP 61

Query: 240 FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
            +W  I   +  L   E + C++ + +E        +  + +G++    ED+  +G I +
Sbjct: 62  RNWSTI--DSHELEPAERITCIEVIRLEISELTHERKDMVVVGSSIVKGEDIVPKGFIRV 119

Query: 300 FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DN 356
           F++I+VVPEP QP    K+K+   +E KG VTA+  +   GFL+ A GQK  +  LK D 
Sbjct: 120 FEVIDVVPEPDQPEKSKKLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGLKEDG 179

Query: 357 DLTGIAFIDTEVYIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
            L  +AF DT+ Y+  +  +K   + ++GD  + +  + Y  E   L L  ++       
Sbjct: 180 SLLPVAFKDTQCYVNVLKELKGTGMCIIGDAFKGLWFIGYSEEPYKLDLFGKE------- 232

Query: 415 SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
                  N +  ++D                           D L + + +  +++D D 
Sbjct: 233 -------NENLAVVDA--------------------------DFLPDGNKLYILVADDDC 259

Query: 475 NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI---RCKPSSISDA--------P 523
           N+ +  Y PE   S+ G RL+ ++ FH G   +T   +      PSS  D         P
Sbjct: 260 NLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTLLPHGARTPSSPVDEDAMDTDSPP 319

Query: 524 GARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKG 583
            ++ + L  + +  G++    PL E +YRRLL LQ+ +V    H   LNPR +R  +  G
Sbjct: 320 PSKYQILMTFQT--GSVAVITPLGEDSYRRLLALQSQLVNALEHPCSLNPRGYRAVESDG 377

Query: 584 YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
                  RG+IDG+L+ ++L +    + EI  ++G+
Sbjct: 378 MGG---QRGMIDGNLLLRWLDMGAQRKAEIAGRVGA 410


>gi|302403950|ref|XP_002999813.1| cft-1 [Verticillium albo-atrum VaMs.102]
 gi|261361315|gb|EEY23743.1| cft-1 [Verticillium albo-atrum VaMs.102]
          Length = 1349

 Score =  152 bits (383), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 158/656 (24%), Positives = 268/656 (40%), Gaps = 131/656 (19%)

Query: 5    RSHSPSAMDETIVQEL-LTVSLGLHGNRPLLLVRTQHELLIYQAFR-----HPKG-ALKL 57
            R  SP  + E +V +L  + S   H    L+L     ++ IY+ FR       KG A  L
Sbjct: 787  RGTSPETLTEILVADLGDSTSASAH----LILRHANDDMTIYEPFRIGGQEERKGLATSL 842

Query: 58   RFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRG 117
             FKK+    ++     A E   +    R+  +R   NI GY  VF+ G  P+++  +S+ 
Sbjct: 843  FFKKVSNSHLAKSPVEAAEDEAVQEN-RVIPLRACDNIGGYSTVFVPGASPSFILKSSKS 901

Query: 118  ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
              +   +   G V+ ++ FH   C RGF+Y ++K   R++  P     DA          
Sbjct: 902  TPKVIGLQGLG-VNGMSSFHTEGCERGFIYADSKGCARVTQFP-----DA---------- 945

Query: 178  CTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLF 237
                               + AE   D    +   KE     ++   +PP+     + L+
Sbjct: 946  ------------------ANVAELGVD----DDYHKEWA---KEECPMPPMKEHGSIKLY 980

Query: 238  SPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRI 297
            SP +W  I +  F L ++E  +C+K + +E        R   A+GT     ED+  RGRI
Sbjct: 981  SPITWNVIDE--FELEQYEVAMCMKTLLLEVSEETKERRMLFAVGTAILRGEDLPVRGRI 1038

Query: 298  LLFDIIEVVPEPGQPLTKNKIKMIYAKEQ--KGPVTAICHVAGFLVTAVGQKIYIWQLKD 355
            L+FD++ V+P+P +P T  K+K+I AKE+  +G VT++C           +K  +  L+ 
Sbjct: 1039 LVFDVVHVIPQPDRPETDRKLKLI-AKEEIPRGAVTSLC-----------EKCMVRGLRR 1086

Query: 356  NDLTGIAFIDTEVYIASMVSVKNL--ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
                 +A  D   Y+ ++  ++N    L+ D    +  + Y  E   ++L  +       
Sbjct: 1087 WHAAAVALPDLSTYVVAVHELRNTGYCLMADANMGVWFVGYSEEPYRMTLFGKS------ 1140

Query: 414  NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
                                        G +L+          D L   + +  + SD+D
Sbjct: 1141 ----------------------------GTQLKCLTA------DFLVAGNDLSIVASDED 1166

Query: 474  KNVVLFMYQPEARESNGGHRLIKKTDFHLGQH----VNTFFKIRCKP-----SSISDAPG 524
              + +  + PE   S  GH L+ +  F +  +         +   +P        ++A G
Sbjct: 1167 GVLHILQFDPEHPRSLQGHLLLNRASFSVAPNHAWVTLALPRTTTRPYLPQSEPATNAAG 1226

Query: 525  ARSRFLT-WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKG 583
            +++R  T   AS  GA+    P+ E  YRRL  L   +     H  G+NP+A R     G
Sbjct: 1227 SQNRTQTLLLASASGAIASLNPITEHAYRRLTSLTTSLANALPHAAGMNPKAHRLPPQDG 1286

Query: 584  YYAGNP-------SRGIIDGSLVWKFLQLSLGERLEICKKIG-SKHNDILDELYDI 631
              A  P        R I+DG+L+ ++ +L   +R E   K G +   D+  EL D+
Sbjct: 1287 --AARPPAVDVSAGRTIVDGALLARWNELGARQRAEAAGKGGFASAADVRGELEDV 1340


>gi|71654693|ref|XP_815961.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
            strain CL Brener]
 gi|50363265|gb|AAT75335.1| cleavage polyadenylation specificity factor CPSF160 [Trypanosoma
            cruzi]
 gi|70881056|gb|EAN94110.1| cleavage and polyadenylation specificity factor, putative
            [Trypanosoma cruzi]
          Length = 1436

 Score =  151 bits (382), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 151/637 (23%), Positives = 260/637 (40%), Gaps = 103/637 (16%)

Query: 32   PLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRY 91
            PL L ++ H  L ++A R    +++++ K+L+     +R    N+   + +  R  ++  
Sbjct: 861  PLRLKKSMHHFLDHKAEREVIESIEMKRKRLQ----RERGVVENDTQLMRQYSR--RIVP 914

Query: 92   FSNIAGYQGVFLCGPHPAWLFLTSRG-ELRAHPMTIDGPVSTLAPFHNVN-----CPRGF 145
            F  I G  G ++CG HP +LF   R  EL A+     GPV    PF  +N     C  GF
Sbjct: 915  FDAIGGNTGAYVCGQHPLFLFWDRRTRELEAYRHQTLGPVRGFVPFRIINSGYIYCCEGF 974

Query: 146  LYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEP---- 201
            + F   + +     PT       W  R++ L  TPHF+ YH   ++  +VTS  EP    
Sbjct: 975  VDF---ASMDTYCRPT----GQGWLTRRIHLGVTPHFVVYHPPARSCFVVTSKKEPFRPQ 1027

Query: 202  --------STDYYKFNGEDKELVTDPRDSRFIP---------PLVSQFHVSLFSPFSWEE 244
                    +  Y + +G  + + T+   S   P         P+  +F + L S   W  
Sbjct: 1028 RAPFDVQLNIVYDEESGGVQSITTEAPVSNMPPIAPNAGIRVPMADRFEIRLMSTTDWA- 1086

Query: 245  IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRG--YIALGTNYNYSEDVTCRGRILLFDI 302
                   L E E VL  + + ++ E    GL       + T +   ED+TCRGRILL   
Sbjct: 1087 -CTDTLLLEENERVLGAQMMEIQCERDAEGLHTAPVCVVSTAFPLGEDITCRGRILLLAT 1145

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL--KDNDLTG 360
            I           K KI + +++   GP TA+  +   +  AVG  I +++    +  L  
Sbjct: 1146 I-------CTKKKRKIVLFHSEPLNGPATAVVGIRHHIAVAVGGTIKLFRFDWSNRKLVV 1198

Query: 361  IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
             A +    Y+  M S +N ++ GD +RS A+ R+  E  TLS++ +D             
Sbjct: 1199 GALLYAGTYVTRMSSFRNYLIYGDLSRSCAIARFNEENHTLSVLGKDR------------ 1246

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                          +   H D++    + G + SD ++N+++  
Sbjct: 1247 ----------------------------NAVSVVHCDMMYHDRAFGLLCSDDERNLLVMG 1278

Query: 481  YQPEARESNGG--HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDG 538
            Y P  +E+  G  +++++      G++        C   S+     A +  +T Y +  G
Sbjct: 1279 YTPRVQETEAGSPNKVLESVLSLDGEY---RLSGGCLVKSLRFRSLAGNSSVTLYVTNYG 1335

Query: 539  ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY-KGKGYYAGNPSRGIIDGS 597
             +GF +P+ E+  R    L   +     H+ GL PR F    +G    A      ++  S
Sbjct: 1336 EIGFIVPIGEQANRTASWLMRRLQIDLPHSAGLTPRMFLGLSQGSPRTAMRAKEMLVSAS 1395

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
            L+ +F  L +  R    K I S     L+ + ++ +L
Sbjct: 1396 LLNEFFFLDIHSR----KTIASAAYTQLERVTNVASL 1428


>gi|401841121|gb|EJT43641.1| CFT1-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 1355

 Score =  150 bits (379), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 131/542 (24%), Positives = 231/542 (42%), Gaps = 72/542 (13%)

Query: 89   MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
            M YF +  GY  +F+ G  P  +        +      + P+ ++ P++     R  +  
Sbjct: 849  MHYFPDYNGYSVIFVTGSVPYIIIKEDDTTPKIFKFA-NIPLVSVTPWN----ERSVMCV 903

Query: 149  NAKSELRISVLP-THLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
            +     R+  L   ++ Y   +P++++ +        T   + YH + + + +      P
Sbjct: 904  DDIKNARVYTLTINNMYYGNKFPLKQIKISNVLDDYKTLQKIVYHEKAQLFLVSYCKRIP 963

Query: 202  STDYYKFNGEDKELVTDPRDSRFIPPLVSQFH--VSLFSPFSWEEIPQTNFPLHEWEHVL 259
                Y+  GED E VT   +     P    F   + L +P SW+ I + +FP +    V 
Sbjct: 964  ----YEALGEDGEKVTGYDEK---APHAEGFQGGILLINPKSWKVIDKIDFPKNSV--VN 1014

Query: 260  CLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIK 319
             +++  ++        R YI  G     +ED    G   ++D+IEVVPEPG+P T  K+K
Sbjct: 1015 EMRSSMIQINSKTKRKREYIVAGVANATTEDTPPTGSFYIYDVIEVVPEPGKPDTNYKLK 1074

Query: 320  MIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKN 378
             I+ +E  G V+ +C ++G  + +  QK+ +  ++ DN +  +AF+D  V++    S  N
Sbjct: 1075 EIFQEEVNGTVSTVCEISGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGN 1134

Query: 379  LILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQ 438
            L+++GD  +    + +  E                         P R I+ G  V KF  
Sbjct: 1135 LLIIGDAMQGFQFIGFDAE-------------------------PYRMILLGRSVSKFQT 1169

Query: 439  LSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKT 498
            +SL               + L     M F  +D D+NV +  Y P+   S  G RL+  +
Sbjct: 1170 MSL---------------EFLVNGGDMYFAATDADRNVHILKYAPDEPNSLSGQRLVHCS 1214

Query: 499  DFHLGQHVNTFFKIRCKPSSI--SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM 556
             F +   +N+   +  K      S  P     F      +DG++   +PL E+ YRRL +
Sbjct: 1215 SFTV-HSINSCMMLLPKNQEFGSSQVPS----FQNVGGQVDGSVFKIVPLSEETYRRLYL 1269

Query: 557  LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 616
            +Q  ++      GGLNPR  R      Y  G+  R ++D +++ +F  LS+  R    +K
Sbjct: 1270 IQQQIIDRELQLGGLNPRMER-LANDFYQMGHSMRPMLDFNVIRRFSGLSIDRRKNTAQK 1328

Query: 617  IG 618
             G
Sbjct: 1329 AG 1330


>gi|407850337|gb|EKG04765.1| cleavage and polyadenylation specificity factor, putative
            [Trypanosoma cruzi]
          Length = 1436

 Score =  149 bits (376), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 150/637 (23%), Positives = 259/637 (40%), Gaps = 103/637 (16%)

Query: 32   PLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRY 91
            PL L ++ H  L ++A R    +++++ K+L+     +R    N+   + +  R  ++  
Sbjct: 861  PLRLKKSMHHFLDHKAEREVIESIEMKRKRLQ----RERGVVENDTQLMRQYSR--RIVP 914

Query: 92   FSNIAGYQGVFLCGPHPAWLFLTSRG-ELRAHPMTIDGPVSTLAPFHNVN-----CPRGF 145
            F  I G  G ++CG HP +LF   R  EL A+     GPV    PF  +N     C  GF
Sbjct: 915  FDAIGGNAGAYVCGQHPLFLFWDRRTRELEAYRHQTLGPVRGFVPFRIINSGYIYCCEGF 974

Query: 146  LYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEP---- 201
            + F   + +     PT       W  R++ L  TPHF+ YH   ++  +VTS  EP    
Sbjct: 975  VDF---ASMDTYCRPT----GQGWLTRRIHLGVTPHFVVYHPPARSCFVVTSKKEPFRPQ 1027

Query: 202  --------STDYYKFNGEDKELVTDPRDSRFIP---------PLVSQFHVSLFSPFSWEE 244
                       Y + +G  + + T+       P         P+  +F + L S   W  
Sbjct: 1028 RSPFDVQLKIVYDEESGGVQSITTEAPVCNMPPIAPNAGIRVPMADRFEIRLMSTTDWA- 1086

Query: 245  IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRG--YIALGTNYNYSEDVTCRGRILLFDI 302
                   L E E VL  + + ++ E    GL       + T +   ED+TCRGRILL   
Sbjct: 1087 -CTDTLLLEENERVLGAQMMEIQCEKDAEGLHTAPVCVVSTAFPLGEDITCRGRILLLAT 1145

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND--LTG 360
            +           K KI + +++   GP TA+  +   +  AVG  I +++   N+  L  
Sbjct: 1146 M-------CTKKKRKIVLFHSEPLNGPATAVVGIRHHIAVAVGGTIKLFRFDWNNRKLVV 1198

Query: 361  IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
             A +    Y+  M S +N ++ GD +RS A+ R+  E  TLS++ +D             
Sbjct: 1199 GALLYAGTYVTRMSSFRNYLIYGDLSRSCAIARFNEENHTLSVLGKDR------------ 1246

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                          +   H D++    + G + SD ++N+++  
Sbjct: 1247 ----------------------------NAVSVVHCDMMYHDRAFGLLCSDDERNLLVMG 1278

Query: 481  YQPEARESNGG--HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDG 538
            Y P  +E+  G  +++++      G++        C   S+     A +  +T Y +  G
Sbjct: 1279 YTPRVQETEAGSPNKVLESVLSLDGEY---RLSGGCLVKSLRFRSLAGNSSVTLYVTNYG 1335

Query: 539  ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY-KGKGYYAGNPSRGIIDGS 597
             +GF +P+ E+  R    L   +     H+ GL PR F    +G    A      ++  S
Sbjct: 1336 EIGFIVPIGEQANRTASWLMRRLQIDLPHSAGLTPRMFLGLSQGSPRTAMRAKEMLVSAS 1395

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
            L+ +F  L +  R    K I S     L+ + ++ +L
Sbjct: 1396 LLNEFFFLDIHSR----KTIASAAYTQLERVTNVASL 1428


>gi|398397855|ref|XP_003852385.1| hypothetical protein MYCGRDRAFT_100364 [Zymoseptoria tritici IPO323]
 gi|339472266|gb|EGP87361.1| hypothetical protein MYCGRDRAFT_100364 [Zymoseptoria tritici IPO323]
          Length = 1333

 Score =  149 bits (376), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 156/653 (23%), Positives = 264/653 (40%), Gaps = 98/653 (15%)

Query: 6    SHSPSAMDETIVQELLTVSLGLHGN-RPLLLVRT-QHELLIYQAFRHPKGAL------KL 57
            SH    + ET+  ELL   LG  G  +P L VRT   ++++Y+ F     A        L
Sbjct: 718  SHRRMGVKETLT-ELLVADLGNDGVLQPYLTVRTAMDDVVLYEPFHSSPSASTGPWHSNL 776

Query: 58   RFKKLKVLFVSDRSKRANEQPGL-PRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSR 116
            RF+K+ V ++   +    E P   P  +R  Q      I GY  V + G     L   + 
Sbjct: 777  RFRKVPVPYIPKYNDSPLEDPNARPPALRRMQ------IGGYNTVSIPGAPSCLLLKEAS 830

Query: 117  GE---LRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRK 173
            G    L  +        + L P + + C  GF   +    L    LP    +   W +R+
Sbjct: 831  GPPKILEVNEPKRSNATTILTPLNRIGCENGFATVDVNGALHECQLPPDAWFSTGWSIRQ 890

Query: 174  VPLKCTPH---FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
            + L         LAYH     +   T T   + D+Y F  ED       +D   I P V 
Sbjct: 891  IDLGDDAREVRHLAYHEARGIFVAATCT---TVDFY-FAEEDGR--HPEQDDISIRPQVP 944

Query: 231  QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
            Q+ V L S  + + I     P    E V  LK +  E       ++  + + T     ED
Sbjct: 945  QYSVHLISAKTHKIIHTHKLPY--LETVTALKVMPAEVSELSHEVKPVVVVSTGAQRGED 1002

Query: 291  VTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLV-TAVGQKIY 349
            +  +G +++FD+I+VVP+P    +   + ++  +E +G +TA+    G ++ TA G K+ 
Sbjct: 1003 MPAKGALIVFDVIDVVPDPDVEESGLHLHVLAREESRGAITALASFPGGMIGTAQGLKLM 1062

Query: 350  IWQLK-DNDLTGIAFIDTEVYIASMVSV--KNLILVGDYARSIALLRYQPEYRTLSLVAR 406
            I  ++ D     +AF+D + Y + + ++  + L L GD  + +    +  E   L+L+ +
Sbjct: 1063 IRGMREDGSCLPVAFLDAQCYTSLLKTLDSRGLWLAGDAWKGLWFGGFTQEPYKLTLLGK 1122

Query: 407  DYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMG 466
              +                                   +E+ +       D L    ++ 
Sbjct: 1123 SPR---------------------------------TEMEVIEA------DFLPFDGALF 1143

Query: 467  FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS------ 520
             ++ D D ++ +  Y PE  +S  G RL+ ++ FH+G        +   PS+++      
Sbjct: 1144 LLVLDADADLHVLQYDPENPKSLNGQRLLHRSTFHIGHFPTGSMLL---PSTLAPFTEQA 1200

Query: 521  -DAPGARSR-----------FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 568
             D P   S            F     +  G++G   PL E  YRRL  LQ  +     H 
Sbjct: 1201 RDLPNGDSEDTKQEEVNSPLFHVLTTTSSGSIGLITPLDESTYRRLSALQGHLTNILEHA 1260

Query: 569  GGLNPRAFRT---YKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
             GLNPR +RT    K      G  ++G++DGSL+ +  +L    R ++  ++G
Sbjct: 1261 AGLNPRMYRTDTEMKATDSEMGG-AKGVVDGSLIRRISELGAARRADVLSRVG 1312


>gi|349577352|dbj|GAA22521.1| K7_Cft1p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 1357

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 137/556 (24%), Positives = 239/556 (42%), Gaps = 71/556 (12%)

Query: 89   MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
            M YF +  GY  +F+ G  P  L        +      + P+ ++ P+      R  +  
Sbjct: 851  MHYFPDYNGYSVIFVTGSVPYILIKEDDSTPKIFKFG-NIPLVSVTPW----SERSVMCV 905

Query: 149  NAKSELRISVLPT-HLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
            +     R+  L T ++ Y    P++++ +        T   L YH   + + +      P
Sbjct: 906  DDIKNARVYTLTTDNMYYGNKLPLKQIKISNVLDDYKTLQKLVYHERAQLFLVSYCKRVP 965

Query: 202  STDYYKFNGEDKELVTDPRDSRFIPPLVSQFH--VSLFSPFSWEEIPQTNFPLHEWEHVL 259
                Y+  GED E V    ++    P    F   + L +P SW+ I + +FP +    V 
Sbjct: 966  ----YEALGEDGEKVIGYDEN---VPHAEGFQSGILLINPKSWKVIDKIDFPKNSV--VN 1016

Query: 260  CLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIK 319
             +++  ++        R YI  G     +ED    G   ++D+IEVVPEPG+P T  K+K
Sbjct: 1017 EMRSSMIQINSKTKRKREYIIAGVANATTEDTPPTGAFHIYDVIEVVPEPGKPDTNYKLK 1076

Query: 320  MIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKN 378
             I+ +E  G V+ +C V+G  + +  QK+ +  ++ DN +  +AF+D  V++    S  N
Sbjct: 1077 EIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGN 1136

Query: 379  LILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFL 437
            L+++GD  +    + +  E YR +SL                          G  + KF 
Sbjct: 1137 LLIIGDAMQGFQFIGFDAEPYRMISL--------------------------GRSMSKFQ 1170

Query: 438  QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
             +SL               + L     M F  +D D+NV +  Y P+   S  G RL+  
Sbjct: 1171 TMSL---------------EFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHC 1215

Query: 498  TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
            + F L    N+   +  +      +P   S F      +DG++   +PL E+ YRRL ++
Sbjct: 1216 SSFTL-HSTNSCMMLLPRNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVI 1272

Query: 558  QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
            Q  ++      GGLNPR  R      Y  G+  R ++D +++ +F  L++  R  I +K 
Sbjct: 1273 QQQIIDRELQLGGLNPRMER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKA 1331

Query: 618  G-SKHNDILDELYDIE 632
            G + H +   ++ +IE
Sbjct: 1332 GRNAHFEAWRDIINIE 1347


>gi|323309632|gb|EGA62840.1| Cft1p [Saccharomyces cerevisiae FostersO]
          Length = 1357

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 134/541 (24%), Positives = 231/541 (42%), Gaps = 70/541 (12%)

Query: 89   MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
            M YF +  GY  +F+ G  P  L        +      + P+ ++ P+      R  +  
Sbjct: 851  MHYFPDYNGYSVIFVTGSVPYILIKEDDSTPKIFKFG-NIPLVSVTPW----SERSVMCV 905

Query: 149  NAKSELRISVLPT-HLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
            +     R+  L T ++ Y    P++++ +        T   L YH   + + +      P
Sbjct: 906  DDIKNARVYTLTTDNMYYGNKLPLKQIKISNVLDDYKTLQKLVYHERAQLFLVSYCKRVP 965

Query: 202  STDYYKFNGEDKELVTDPRDSRFIPPLVSQFH--VSLFSPFSWEEIPQTNFPLHEWEHVL 259
                Y+  GED E V    ++    P    F   + L +P SW+ I + +FP +    V 
Sbjct: 966  ----YEALGEDGEKVIGYDEN---VPHAEGFQSGILLINPKSWKVIDKIDFPKNSV--VN 1016

Query: 260  CLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIK 319
             +++  ++        R YI  G     +ED    G   ++D+IEVVPEPG+P T  K+K
Sbjct: 1017 EMRSSMIQINSKTKRKREYIIAGVANATTEDTPPTGAFHIYDVIEVVPEPGKPDTNYKLK 1076

Query: 320  MIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKN 378
             I+ +E  G V+ +C V+G  + +  QK+ +  ++ DN +  +AF+D  V++    S  N
Sbjct: 1077 EIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGN 1136

Query: 379  LILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFL 437
            L+++GD  +    + +  E YR +SL                          G  + KF 
Sbjct: 1137 LLIIGDAMQGFQFIGFDAEPYRMISL--------------------------GRSMSKFQ 1170

Query: 438  QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
             +SL               + L     M F  +D D+NV +  Y P+   S  G RL+  
Sbjct: 1171 TMSL---------------EFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHC 1215

Query: 498  TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
            + F L    N+   +  +      +P   S F      +DG++   +PL E+ YRRL ++
Sbjct: 1216 SSFTL-HSTNSCMMLLPRNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVI 1272

Query: 558  QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
            Q  ++      GGLNPR  R      Y  G+  R ++D +++ +F  L++  R  I +K 
Sbjct: 1273 QQQIIDRELQLGGLNPRMER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKA 1331

Query: 618  G 618
            G
Sbjct: 1332 G 1332


>gi|207346484|gb|EDZ72967.1| YDR301Wp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 1357

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 134/541 (24%), Positives = 231/541 (42%), Gaps = 70/541 (12%)

Query: 89   MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
            M YF +  GY  +F+ G  P  L        +      + P+ ++ P+      R  +  
Sbjct: 851  MHYFPDYNGYSVIFVTGSVPYILIKEDDSTPKIFKFG-NIPLVSVTPW----SERSVMCV 905

Query: 149  NAKSELRISVLPT-HLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
            +     R+  L T ++ Y    P++++ +        T   L YH   + + +      P
Sbjct: 906  DDIKNARVYTLTTDNMYYGNKLPLKQIKISNVLDDYKTLQKLVYHERAQLFLVSYCKRVP 965

Query: 202  STDYYKFNGEDKELVTDPRDSRFIPPLVSQFH--VSLFSPFSWEEIPQTNFPLHEWEHVL 259
                Y+  GED E V    ++    P    F   + L +P SW+ I + +FP +    V 
Sbjct: 966  ----YEALGEDGEKVIGYDEN---VPHAEGFQSGILLINPKSWKVIDKIDFPKNSV--VN 1016

Query: 260  CLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIK 319
             +++  ++        R YI  G     +ED    G   ++D+IEVVPEPG+P T  K+K
Sbjct: 1017 EMRSSMIQINSKTKRKREYIIAGVANATTEDTPPTGAFHIYDVIEVVPEPGKPDTNYKLK 1076

Query: 320  MIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKN 378
             I+ +E  G V+ +C V+G  + +  QK+ +  ++ DN +  +AF+D  V++    S  N
Sbjct: 1077 EIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGN 1136

Query: 379  LILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFL 437
            L+++GD  +    + +  E YR +SL                          G  + KF 
Sbjct: 1137 LLIIGDAMQGFQFIGFDAEPYRMISL--------------------------GRSMSKFQ 1170

Query: 438  QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
             +SL               + L     M F  +D D+NV +  Y P+   S  G RL+  
Sbjct: 1171 TMSL---------------EFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHC 1215

Query: 498  TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
            + F L    N+   +  +      +P   S F      +DG++   +PL E+ YRRL ++
Sbjct: 1216 SSFTL-HSTNSCMMLLPRNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVI 1272

Query: 558  QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
            Q  ++      GGLNPR  R      Y  G+  R ++D +++ +F  L++  R  I +K 
Sbjct: 1273 QQQIIDRELQLGGLNPRMER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKA 1331

Query: 618  G 618
            G
Sbjct: 1332 G 1332


>gi|323338222|gb|EGA79455.1| Cft1p [Saccharomyces cerevisiae Vin13]
 gi|365766372|gb|EHN07870.1| Cft1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 1357

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 134/541 (24%), Positives = 231/541 (42%), Gaps = 70/541 (12%)

Query: 89   MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
            M YF +  GY  +F+ G  P  L        +      + P+ ++ P+      R  +  
Sbjct: 851  MHYFPDYNGYSVIFVTGSVPYILIKEDDSTPKIFKFG-NIPLVSVTPW----SERSVMCV 905

Query: 149  NAKSELRISVLPT-HLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
            +     R+  L T ++ Y    P++++ +        T   L YH   + + +      P
Sbjct: 906  DDIKNARVYTLTTDNMYYGNKLPLKQIKISNVLDDYKTLQKLVYHERAQLFLVSYCKRVP 965

Query: 202  STDYYKFNGEDKELVTDPRDSRFIPPLVSQFH--VSLFSPFSWEEIPQTNFPLHEWEHVL 259
                Y+  GED E V    ++    P    F   + L +P SW+ I + +FP +    V 
Sbjct: 966  ----YEALGEDGEKVIGYDEN---VPHAEGFQSGILLINPKSWKVIDKIDFPKNSV--VN 1016

Query: 260  CLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIK 319
             +++  ++        R YI  G     +ED    G   ++D+IEVVPEPG+P T  K+K
Sbjct: 1017 EMRSSMIQINSKTKRKREYIIAGVANATTEDTPPTGAFHIYDVIEVVPEPGKPDTNYKLK 1076

Query: 320  MIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKN 378
             I+ +E  G V+ +C V+G  + +  QK+ +  ++ DN +  +AF+D  V++    S  N
Sbjct: 1077 EIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGN 1136

Query: 379  LILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFL 437
            L+++GD  +    + +  E YR +SL                          G  + KF 
Sbjct: 1137 LLIIGDAMQGFQFIGFDAEPYRMISL--------------------------GRSMSKFQ 1170

Query: 438  QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
             +SL               + L     M F  +D D+NV +  Y P+   S  G RL+  
Sbjct: 1171 TMSL---------------EFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHC 1215

Query: 498  TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
            + F L    N+   +  +      +P   S F      +DG++   +PL E+ YRRL ++
Sbjct: 1216 SSFTL-HSTNSCMMLLPRNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVI 1272

Query: 558  QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
            Q  ++      GGLNPR  R      Y  G+  R ++D +++ +F  L++  R  I +K 
Sbjct: 1273 QQQIIDRELQLGGLNPRMER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKA 1331

Query: 618  G 618
            G
Sbjct: 1332 G 1332


>gi|6320507|ref|NP_010587.1| Cft1p [Saccharomyces cerevisiae S288c]
 gi|74583567|sp|Q06632.1|CFT1_YEAST RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|849213|gb|AAB64737.1| Ydr301wp [Saccharomyces cerevisiae]
 gi|256271799|gb|EEU06830.1| Cft1p [Saccharomyces cerevisiae JAY291]
 gi|285811316|tpg|DAA12140.1| TPA: Cft1p [Saccharomyces cerevisiae S288c]
 gi|392300415|gb|EIW11506.1| Cft1p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 1357

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 134/541 (24%), Positives = 231/541 (42%), Gaps = 70/541 (12%)

Query: 89   MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
            M YF +  GY  +F+ G  P  L        +      + P+ ++ P+      R  +  
Sbjct: 851  MHYFPDYNGYSVIFVTGSVPYILIKEDDSTPKIFKFG-NIPLVSVTPW----SERSVMCV 905

Query: 149  NAKSELRISVLPT-HLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
            +     R+  L T ++ Y    P++++ +        T   L YH   + + +      P
Sbjct: 906  DDIKNARVYTLTTDNMYYGNKLPLKQIKISNVLDDYKTLQKLVYHERAQLFLVSYCKRVP 965

Query: 202  STDYYKFNGEDKELVTDPRDSRFIPPLVSQFH--VSLFSPFSWEEIPQTNFPLHEWEHVL 259
                Y+  GED E V    ++    P    F   + L +P SW+ I + +FP +    V 
Sbjct: 966  ----YEALGEDGEKVIGYDEN---VPHAEGFQSGILLINPKSWKVIDKIDFPKNSV--VN 1016

Query: 260  CLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIK 319
             +++  ++        R YI  G     +ED    G   ++D+IEVVPEPG+P T  K+K
Sbjct: 1017 EMRSSMIQINSKTKRKREYIIAGVANATTEDTPPTGAFHIYDVIEVVPEPGKPDTNYKLK 1076

Query: 320  MIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKN 378
             I+ +E  G V+ +C V+G  + +  QK+ +  ++ DN +  +AF+D  V++    S  N
Sbjct: 1077 EIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGN 1136

Query: 379  LILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFL 437
            L+++GD  +    + +  E YR +SL                          G  + KF 
Sbjct: 1137 LLIIGDAMQGFQFIGFDAEPYRMISL--------------------------GRSMSKFQ 1170

Query: 438  QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
             +SL               + L     M F  +D D+NV +  Y P+   S  G RL+  
Sbjct: 1171 TMSL---------------EFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHC 1215

Query: 498  TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
            + F L    N+   +  +      +P   S F      +DG++   +PL E+ YRRL ++
Sbjct: 1216 SSFTL-HSTNSCMMLLPRNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVI 1272

Query: 558  QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
            Q  ++      GGLNPR  R      Y  G+  R ++D +++ +F  L++  R  I +K 
Sbjct: 1273 QQQIIDRELQLGGLNPRMER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKA 1331

Query: 618  G 618
            G
Sbjct: 1332 G 1332


>gi|190404756|gb|EDV08023.1| 150 kDa protein associated with polyadenylation factor 1
            [Saccharomyces cerevisiae RM11-1a]
 gi|259145538|emb|CAY78802.1| Cft1p [Saccharomyces cerevisiae EC1118]
          Length = 1357

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 134/541 (24%), Positives = 231/541 (42%), Gaps = 70/541 (12%)

Query: 89   MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
            M YF +  GY  +F+ G  P  L        +      + P+ ++ P+      R  +  
Sbjct: 851  MHYFPDYNGYSVIFVTGSVPYILIKEDDSTPKIFKFG-NIPLVSVTPW----SERSVMCV 905

Query: 149  NAKSELRISVLPT-HLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
            +     R+  L T ++ Y    P++++ +        T   L YH   + + +      P
Sbjct: 906  DDIKNARVYTLTTDNMYYGNKLPLKQIKISNVLDDYKTLQKLVYHERAQLFLVSYCKRVP 965

Query: 202  STDYYKFNGEDKELVTDPRDSRFIPPLVSQFH--VSLFSPFSWEEIPQTNFPLHEWEHVL 259
                Y+  GED E V    ++    P    F   + L +P SW+ I + +FP +    V 
Sbjct: 966  ----YEALGEDGEKVIGYDEN---VPHAEGFQSGILLINPKSWKVIDKIDFPKNSV--VN 1016

Query: 260  CLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIK 319
             +++  ++        R YI  G     +ED    G   ++D+IEVVPEPG+P T  K+K
Sbjct: 1017 EMRSSMIQINSKTKRKREYIIAGVANATTEDTPPTGAFHIYDVIEVVPEPGKPDTNYKLK 1076

Query: 320  MIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKN 378
             I+ +E  G V+ +C V+G  + +  QK+ +  ++ DN +  +AF+D  V++    S  N
Sbjct: 1077 EIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGN 1136

Query: 379  LILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFL 437
            L+++GD  +    + +  E YR +SL                          G  + KF 
Sbjct: 1137 LLIIGDAMQGFQFIGFDAEPYRMISL--------------------------GRSMSKFQ 1170

Query: 438  QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
             +SL               + L     M F  +D D+NV +  Y P+   S  G RL+  
Sbjct: 1171 TMSL---------------EFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHC 1215

Query: 498  TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
            + F L    N+   +  +      +P   S F      +DG++   +PL E+ YRRL ++
Sbjct: 1216 SSFTL-HSTNSCMMLLPRNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVI 1272

Query: 558  QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
            Q  ++      GGLNPR  R      Y  G+  R ++D +++ +F  L++  R  I +K 
Sbjct: 1273 QQQIIDRELQLGGLNPRMER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKA 1331

Query: 618  G 618
            G
Sbjct: 1332 G 1332


>gi|151942273|gb|EDN60629.1| cleavage factor II (CF II) component [Saccharomyces cerevisiae
            YJM789]
          Length = 1357

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 134/541 (24%), Positives = 231/541 (42%), Gaps = 70/541 (12%)

Query: 89   MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
            M YF +  GY  +F+ G  P  L        +      + P+ ++ P+      R  +  
Sbjct: 851  MHYFPDYNGYSVIFVTGSVPYILIKEDDSTPKIFKFG-NIPLVSVTPW----SERSVMCV 905

Query: 149  NAKSELRISVLPT-HLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
            +     R+  L T ++ Y    P++++ +        T   L YH   + + +      P
Sbjct: 906  DDIKNARVYTLTTDNMYYGNKLPLKQIKISNVHDDYKTLQKLVYHERAQLFLVSYCKRVP 965

Query: 202  STDYYKFNGEDKELVTDPRDSRFIPPLVSQFH--VSLFSPFSWEEIPQTNFPLHEWEHVL 259
                Y+  GED E V    ++    P    F   + L +P SW+ I + +FP +    V 
Sbjct: 966  ----YEALGEDGEKVIGYDEN---VPHAEGFQSGILLINPKSWKVIDKIDFPKNSV--VN 1016

Query: 260  CLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIK 319
             +++  ++        R YI  G     +ED    G   ++D+IEVVPEPG+P T  K+K
Sbjct: 1017 EMRSSMIQINSKTKRKREYIIAGVANATTEDTPPTGAFHIYDVIEVVPEPGKPDTNYKLK 1076

Query: 320  MIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKN 378
             I+ +E  G V+ +C V+G  + +  QK+ +  ++ DN +  +AF+D  V++    S  N
Sbjct: 1077 EIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGN 1136

Query: 379  LILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFL 437
            L+++GD  +    + +  E YR +SL                          G  + KF 
Sbjct: 1137 LLIIGDAMQGFQFIGFDAEPYRMISL--------------------------GRSMSKFQ 1170

Query: 438  QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
             +SL               + L     M F  +D D+NV +  Y P+   S  G RL+  
Sbjct: 1171 TMSL---------------EFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHC 1215

Query: 498  TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
            + F L    N+   +  +      +P   S F      +DG++   +PL E+ YRRL ++
Sbjct: 1216 SSFTL-HSTNSCMMLLPRNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLYVI 1272

Query: 558  QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
            Q  ++      GGLNPR  R      Y  G+  R ++D +++ +F  L++  R  I +K 
Sbjct: 1273 QQQIIDRELQLGGLNPRMER-LANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQKA 1331

Query: 618  G 618
            G
Sbjct: 1332 G 1332


>gi|407410979|gb|EKF33219.1| cleavage and polyadenylation specificity factor, putative
            [Trypanosoma cruzi marinkellei]
          Length = 1436

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 149/637 (23%), Positives = 260/637 (40%), Gaps = 103/637 (16%)

Query: 32   PLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRY 91
            PL L ++ H  L ++A R    +++++ K+L+     +R    N+   + +  R  ++  
Sbjct: 861  PLRLKKSMHHFLDHKAEREVIESIEMKRKRLQ----RERGVVENDTQLMRQYSR--RIVP 914

Query: 92   FSNIAGYQGVFLCGPHPAWLFLTSRG-ELRAHPMTIDGPVSTLAPFHNVN-----CPRGF 145
            F +I G  G ++CG HP +LF   R  EL A+     GPV    PF  +N     C  GF
Sbjct: 915  FDSIGGNAGAYVCGQHPLFLFWDRRTRELEAYRHQTLGPVRGFVPFRIINSGYIYCCEGF 974

Query: 146  LYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEP---- 201
            + F   + +     PT       W  R++ L  TPHF+ YH   ++  +VTS  EP    
Sbjct: 975  VDF---ASMDTYCRPTGQG----WLTRRIHLGVTPHFVVYHPPARSCFVVTSKKEPFRPQ 1027

Query: 202  --------STDYYKFNGEDKELVTD---------PRDSRFIPPLVSQFHVSLFSPFSWEE 244
                    +  Y + +G  + + T+         P ++    P+  +F + L S   W  
Sbjct: 1028 RAPFDVQLNIVYDEESGGVQSITTEAPVCNMPPIPPNAGIRVPMADRFEICLMSTTDWA- 1086

Query: 245  IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRG--YIALGTNYNYSEDVTCRGRILLFDI 302
                   L E E VL  + + +  E    GL       + T +   ED+T RGRILL   
Sbjct: 1087 -CTDTLLLEENERVLGAQMMEIHCEKDAEGLHTAPVCVVSTAFPLGEDITSRGRILLLST 1145

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL--KDNDLTG 360
            +           K KI + +++   GP TA+  +   +  AVG  I +++   ++  L  
Sbjct: 1146 M-------CTKKKRKILLFHSEPLNGPATAVVGIRHHIAVAVGGTIKLFRFDWENRKLVV 1198

Query: 361  IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
             A +    Y+  M S +N ++ GD +RS A+ R+  E  TLS++ +D             
Sbjct: 1199 GALLYAGTYVTRMSSFRNYLIYGDLSRSCAIARFNEENHTLSVLGKDR------------ 1246

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                          +   H D++    + G + SD ++N+++  
Sbjct: 1247 ----------------------------NAVSVVHCDMMYHDRAFGLLCSDDERNLLVMG 1278

Query: 481  YQPEARESNGG--HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDG 538
            Y P  +E+  G  +++++      G++        C   S+     A +  +T Y +  G
Sbjct: 1279 YTPRVQETEAGSPNKVLESVLSLDGEY---RLSGGCLVKSLRFRSLAGNSSVTLYVTNYG 1335

Query: 539  ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY-KGKGYYAGNPSRGIIDGS 597
             +GF +P+ E+  R    L   +     H  GL PR F    +G    A      ++  S
Sbjct: 1336 EIGFIVPIGEQANRTASWLMRRLQMDLPHNAGLTPRMFLGLSQGSPRTALRAKEMLVSAS 1395

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
            L+ +F  L +  R    K I S     L+ + ++ AL
Sbjct: 1396 LLNEFFFLDIHSR----KTIASAAYTQLERVTNVAAL 1428


>gi|71021721|ref|XP_761091.1| hypothetical protein UM04944.1 [Ustilago maydis 521]
 gi|46100541|gb|EAK85774.1| hypothetical protein UM04944.1 [Ustilago maydis 521]
          Length = 1597

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 127/506 (25%), Positives = 225/506 (44%), Gaps = 54/506 (10%)

Query: 126  IDGPVSTLAPFHNVNCPRG--------FLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK 177
            +D P   L+   +++ P          F++ +    L +  LP  L     WP   V   
Sbjct: 1060 LDWPDRDLSSLASISAPLASTGSVNADFVFCDRAGRLYLGRLPAGLDSSTAWPSSVVRTG 1119

Query: 178  CTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLF 237
                 +  H  T T  ++ ++  P    +    ED E + D + +  + P VSQ      
Sbjct: 1120 REYTNVVAHDPTST--VIAASVSPC--RFMLFDEDGEAIHDEQPNSTLYPSVSQRGSLEL 1175

Query: 238  SPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRI 297
                    P   +     E V  L+ V+++   T+SG + ++A GT   + ED T +G +
Sbjct: 1176 FISQHGSTPVDGYEFEANETVTSLEIVTLDSPSTVSGRKQFVAAGTTTFHGEDRTAKGSV 1235

Query: 298  LLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND 357
             LF+II VV    +  +  ++K++   + + PVTAI H+ G+L++  GQK+Y+  L+  +
Sbjct: 1236 YLFEIISVVSAASELGSDLRLKLVCRDDSRAPVTAISHINGYLISTCGQKLYVRALEKQE 1295

Query: 358  -LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNS 415
             L  IAF+D   YI S+  VKNL+L+GD  R + L  +Q + Y+ + L   +        
Sbjct: 1296 WLISIAFLDCPFYITSIEVVKNLVLLGDCKRGLGLWAFQEDPYKFVELAKAE-------- 1347

Query: 416  KGYYAGNPSRGIIDGSL-VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
                         DG + V  FL     E++ +    GS+          +G   S +  
Sbjct: 1348 -------------DGCVGVGAFLVRD--EKVSMLSISGSR----------LGGDASMEAS 1382

Query: 475  NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAP--GARS-RFLT 531
              V+ +Y+     + GG +L+ +++F          ++ C    +SD+   G  + R   
Sbjct: 1383 AGVIRLYEYAPHLAVGGKKLVLRSEFQTTSEA--VARVECSGRWLSDSELRGRETLRNKV 1440

Query: 532  WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSR 591
             +A  +G++     + EK  +RL +LQ  +V    HT  LNPR+FR  +   Y      +
Sbjct: 1441 VFAKANGSVESVAAVDEKVGKRLHLLQGQLVRSVMHTAALNPRSFRMVRND-YVPRALVK 1499

Query: 592  GIIDGSLVWKFLQLSLGERLEICKKI 617
            G++D  L+ +F++LS  + LE  K +
Sbjct: 1500 GVLDARLLDEFMRLSRPKMLEAVKTL 1525


>gi|401624207|gb|EJS42273.1| cft1p [Saccharomyces arboricola H-6]
          Length = 1356

 Score =  145 bits (367), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 130/540 (24%), Positives = 228/540 (42%), Gaps = 68/540 (12%)

Query: 89   MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
            M YF +  GY  +F+ G  P  L        +      + P+ ++ P+      R  +  
Sbjct: 850  MHYFPDYNGYSVIFVTGSVPYILIKEDDSTPKIFKFA-NIPLVSVTPW----SERSVMCV 904

Query: 149  NAKSELRISVLPT-HLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
            +     R+  L   ++ Y    P++++ +        T   + YH + + + +      P
Sbjct: 905  DDIKNARVYTLTIDNMYYGNKMPLKQIKISNVLDDYKTLQKVVYHEKAELFLVSYCKRVP 964

Query: 202  STDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCL 261
                Y+  GED E +    D +       Q  + L +P SW+ I + +FP +    V  +
Sbjct: 965  ----YEALGEDGERIIG-YDEKVPHAEGFQGGILLINPKSWKVIDKIDFPNNSV--VNEM 1017

Query: 262  KNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMI 321
            ++  ++        R YI  G     +ED    G   ++D+ EVVPEPG+P T  K+K I
Sbjct: 1018 RSSMIQVNSKTKKKREYIIAGVANATTEDTPPTGAFHIYDVTEVVPEPGKPDTNYKLKEI 1077

Query: 322  YAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLI 380
            + +E  G V+ +C V+G  + +  QK+ +  ++ DN +  +AF+D  V++    S  NL+
Sbjct: 1078 FQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGNLL 1137

Query: 381  LVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLS 440
            L+GD  +    + +  E                         P R I+ G  + KF  +S
Sbjct: 1138 LIGDAMQGFQFIGFDAE-------------------------PYRMILLGRSISKFQTMS 1172

Query: 441  LGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDF 500
            L               + L     M F  +D D+NV +  Y P+   S  G RL+  + F
Sbjct: 1173 L---------------EFLVNGGDMYFSATDADRNVHVLKYAPDEPNSLSGQRLVHCSSF 1217

Query: 501  HLGQHVNTFFKIRCKPSSI--SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQ 558
             L   +N+   +  K      S  P     F      +DG++   +PL E+ YRRL ++Q
Sbjct: 1218 TL-HSINSCMLLLPKNEEFGSSQVPS----FQNVGGQVDGSIFKIVPLSEETYRRLYVIQ 1272

Query: 559  NVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
              ++      GGLNPR  R      Y  G+  R ++D +++ +F +L++  R    +K G
Sbjct: 1273 QQIIDREIQLGGLNPRMER-LANDFYQMGHSMRPMLDFNVIRRFSELAIDRRKNTAQKAG 1331


>gi|343425828|emb|CBQ69361.1| related to cleavage and polyadenylation specificity factor, 160 kDa
            subunit [Sporisorium reilianum SRZ2]
          Length = 1567

 Score =  145 bits (366), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 131/503 (26%), Positives = 220/503 (43%), Gaps = 50/503 (9%)

Query: 126  IDGPVSTLAPFHNVNCPRG----FLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPH 181
            +D P   L    ++  PR     F Y +   +L ++  P  L  +  W    V  +    
Sbjct: 1035 LDWPEGDLCCIASIYTPRANDADFAYCDRAGQLWLARAPHGLYAETSWMSSVV--RTGRE 1092

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS 241
            +        T+ +V ++ +P   +  F+ ED E + DP     +P   +Q          
Sbjct: 1093 YTRVVAHDATHTVVAASIQPCR-FVLFD-EDGEPIADPGADEALPSTTAQRGALELFISE 1150

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
                    +     E V  L+ V+++   T SG + ++A GT   + ED T +G + LF+
Sbjct: 1151 DRTTAADGYEFEANETVTALEIVTLDAPSTASGRKQFVAAGTTTFHGEDRTAKGCVYLFE 1210

Query: 302  IIEVVPEPGQPLTKN-KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND-LT 359
            +IEVV      + ++ ++K++   + +GPVTAI  + GFLV+  GQK+Y+  L+  + L 
Sbjct: 1211 VIEVVASARYQVGRDLRLKLVCRDDSRGPVTAIAQLNGFLVSTCGQKLYVRALEKEEWLI 1270

Query: 360  GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGY 418
             IAF+D  +Y+  +  VKN +L+ D  +S+ LL +Q E YR + L  RD          Y
Sbjct: 1271 SIAFLDCPLYVTGIRVVKNFVLLSDARKSLWLLAFQEEPYRFVDL-GRDIHDHHATLGQY 1329

Query: 419  YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVV- 477
               N                    ERL +    G+     L   ++ G     +D  VV 
Sbjct: 1330 LVYN--------------------ERLALVSTSGAA----LGGSTAFG-----RDAGVVR 1360

Query: 478  LFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG---ARSRFLTWYA 534
            L+ Y P    +N   RL+ +T+F            R +  S S+  G    R++ +   A
Sbjct: 1361 LYEYAPHVASAN--TRLVLRTEFQTASPATASVACRGRWLSDSELRGREHGRNKLV--LA 1416

Query: 535  SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGII 594
              +GAL       ++  +RL +LQ  +V    HT  LNPRAFR  +   + +    +G++
Sbjct: 1417 KANGALETLAAADDRVAKRLHVLQGQLVRSVLHTAALNPRAFRAVRND-FVSRALGKGVL 1475

Query: 595  DGSLVWKFLQLSLGERLEICKKI 617
            D  L+  F+ LS  + LE  K +
Sbjct: 1476 DARLLDSFVYLSRPKMLEAVKTL 1498


>gi|45184764|ref|NP_982482.1| AAL060Wp [Ashbya gossypii ATCC 10895]
 gi|74695871|sp|Q75EY8.1|CFT1_ASHGO RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|44980110|gb|AAS50306.1| AAL060Wp [Ashbya gossypii ATCC 10895]
 gi|374105681|gb|AEY94592.1| FAAL060Wp [Ashbya gossypii FDAG1]
          Length = 1305

 Score =  145 bits (365), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 122/465 (26%), Positives = 209/465 (44%), Gaps = 65/465 (13%)

Query: 179  TPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFS 238
            T + + YH  T+T+ +   +   S DY   + ED+ LV    D   I  +  Q  + L S
Sbjct: 885  TLNNITYHERTQTFIV---SYAKSIDYVALSEEDEPLVGYNPDK--IHAMGFQSGIILLS 939

Query: 239  PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRIL 298
            P SWE I +  +  +    +  ++ + ++        R Y+ +G  Y   ED+   G   
Sbjct: 940  PKSWEIIDKIEYGKNSL--INDMRTMMIQLNSNTKRRREYLVVGNTYVRDEDIGGTGSFY 997

Query: 299  LFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DND 357
            L+DI EVVPEPG+P T  K K I+ ++ +G V+ +C ++G  + +   K  +  ++ DN 
Sbjct: 998  LYDITEVVPEPGKPDTNYKFKDIFQEDIRGTVSTVCEISGRFMISQSSKAMVRDIQEDNS 1057

Query: 358  LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSK 416
            +  +AF+D  V+I    S  NL+++GD  +  + L +  E YR L+L             
Sbjct: 1058 VVPVAFLDMPVFITDAKSFGNLMIIGDSMQGFSFLGFDAEPYRMLTL------------- 1104

Query: 417  GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
                         G  V K   +        C +    + D+        F+++D++  +
Sbjct: 1105 -------------GKSVSKLETM--------CVEFLVNNGDVY-------FLVTDRNNLM 1136

Query: 477  VLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWY--- 533
             +  Y P+   S  G RL+  T F+L    NT  ++  K    +D  G  SR    Y   
Sbjct: 1137 HVLKYAPDEPNSLSGQRLVHCTSFNL-HSTNTCMRLIKK----NDEFGKVSRGFGIYMPS 1191

Query: 534  -----ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
                 +  DG +   +PL E +YR L ++Q  ++       GLNPR  R  +   Y  G+
Sbjct: 1192 FQCIGSQADGTIFKVVPLSEASYRSLYLIQQQLIDKEVQLCGLNPRMER-LENPFYQMGH 1250

Query: 589  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDIE 632
              R ++D +++ +F  LS+  R+ +  K G + H +I  +L DIE
Sbjct: 1251 ILRPMLDFTVLKRFATLSIPTRMTMASKAGRQAHAEIWRDLIDIE 1295


>gi|363750592|ref|XP_003645513.1| hypothetical protein Ecym_3197 [Eremothecium cymbalariae DBVPG#7215]
 gi|356889147|gb|AET38696.1| Hypothetical protein Ecym_3197 [Eremothecium cymbalariae DBVPG#7215]
          Length = 1318

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 125/490 (25%), Positives = 213/490 (43%), Gaps = 69/490 (14%)

Query: 159  LPTHLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGED 212
            L  H  Y    P+RK+ L+       T + + YH  T+ + +  S    S DY   + E 
Sbjct: 871  LDNHRYYGNKMPLRKIFLEDVLEDFETFNNITYHERTQNFIVSFS---KSIDYDALSEEG 927

Query: 213  KELV----TDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEY 268
            + +V    + P    F      Q  + L +P +W  I +    L     +  ++ + ++ 
Sbjct: 928  ERIVGYEASKPHAKGF------QSGILLINPKTWNIIDR--IELGPNSLISDMRTMMIQL 979

Query: 269  EGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKG 328
                   R Y+ +G  Y   ED++  G   L+DI EVVPEPG+P T  K K I+ ++ +G
Sbjct: 980  NSNTKRKREYLVVGNTYVRDEDISGTGSFYLYDITEVVPEPGKPDTNYKFKEIFQEDIRG 1039

Query: 329  PVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYAR 387
             V+ +C ++G  + +   K  +  ++ DN +  +AF+D  V+I    S  NL+++GD   
Sbjct: 1040 TVSTVCEISGRFMISQSSKAMVRDIQEDNSVVPVAFLDMPVFITDAKSFGNLMIIGDAMH 1099

Query: 388  SIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 447
                + +  E                         P R I  G  V K   +SL      
Sbjct: 1100 GFTFVGFDAE-------------------------PYRMITLGKSVTKLETMSL------ 1128

Query: 448  CKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVN 507
                     + L     M F+I+D+ + + +  Y P+   S  G RL+  T F+L   +N
Sbjct: 1129 ---------EFLVNNGDMYFIITDRSQVMHVLKYAPDEPNSLSGQRLVYCTSFNL-HSIN 1178

Query: 508  TFFKIRCKPSSISDA----PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
            T  ++  K +   D         S F      +DG++   +PL E +YRRL ++Q  ++ 
Sbjct: 1179 TCMRLIQKNNEFVDLRRNYGSHMSTFQCIGCHIDGSIFKVVPLTESSYRRLYLVQQQIID 1238

Query: 564  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HN 622
                  GLNPR  R  +   Y  G+  R ++D +++ KF  LS+ +R  +  K G + H 
Sbjct: 1239 KEVQLCGLNPRMER-LQNPYYQLGHLLRPMLDFTILKKFSTLSISKRRSMASKAGHQAHT 1297

Query: 623  DILDELYDIE 632
            ++  +L DIE
Sbjct: 1298 EVWRDLIDIE 1307


>gi|403218521|emb|CCK73011.1| hypothetical protein KNAG_0M01580 [Kazachstania naganishii CBS 8797]
          Length = 1345

 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 137/560 (24%), Positives = 245/560 (43%), Gaps = 74/560 (13%)

Query: 89   MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
            M Y  +  GY  +F+ G  P  L        R      + P+ ++  + N    +  +  
Sbjct: 835  MHYVPDYNGYSVIFVTGKVPYLLIKEDDSVPRVFQFA-NIPLVSMTTWGN----KSIMCV 889

Query: 149  NAKSELRISVLP-THLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
            +     R+  L  + + Y    P+++V +        T   +AYH  TKTY IV+ + E 
Sbjct: 890  DDIKNARVYTLDCSDVYYGNKIPLKRVTINSVMENYMTLTNVAYHERTKTY-IVSYSREI 948

Query: 202  STDYYKFNGEDKE-----LVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWE 256
                +   GED E     +V D   ++ +     Q  + L +P +W  I + +F      
Sbjct: 949  D---FVAKGEDGEVVPVGIVDDAPHAKSV-----QSGLLLINPTTWSVIDKIDFEPDSL- 999

Query: 257  HVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKN 316
             V  +K++ ++          Y+ +GT++  +ED+   G   ++DI EVVPEPG+P T  
Sbjct: 1000 -VNDIKSMFIQLNSRTKRKIEYVVVGTSFVGTEDLPATGSFQMYDIAEVVPEPGKPDTNY 1058

Query: 317  KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVS 375
            KIK  + +E +  VT++C ++G  V +  QK+ +   + DN +  +AF+D  ++ A M S
Sbjct: 1059 KIKQFFKEELRSAVTSVCDISGRFVISQSQKLMVRDAQEDNSVVPVAFLDIPLFTADMKS 1118

Query: 376  VKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWK 435
              NL+++GD  + I L+ +  E                         P R I  G  V K
Sbjct: 1119 FGNLLIIGDAMQGIQLVGFDAE-------------------------PYRMIPLGRSVLK 1153

Query: 436  FLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLI 495
            F  LSL               + L     + F + D++  + +  Y P+   S  G RLI
Sbjct: 1154 FETLSL---------------EFLVNGGDLYFTLIDRNDILHVLKYAPDEPNSLSGQRLI 1198

Query: 496  KKTDFHLGQHVNTFFKIRCKPSSISDAP--GARSRFLTWYASLDGALGFFLPLPEKNYRR 553
              + F++     +  ++  K     D P   A   +       DG+L   +P+PE  YRR
Sbjct: 1199 HCSSFNM-YSTTSCTRLIPKNELFVDGPLNPAIQSYQVIGGQADGSLFKVMPVPETVYRR 1257

Query: 554  LLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 613
            L ++Q  ++   +   G+NP+  R      Y   +  R ++D ++V +F  +S+ +R  +
Sbjct: 1258 LYVVQQQIIDKETPLAGINPKMER-LSNDYYQTSHLLRPMLDYNVVKQFCAMSIPKRTTL 1316

Query: 614  CKKIGSK-HNDILDELYDIE 632
              K+G + H DI  ++ ++E
Sbjct: 1317 AHKLGKRAHFDIWRDVINLE 1336


>gi|325094074|gb|EGC47384.1| cleavage factor two protein 1 [Ajellomyces capsulatus H88]
          Length = 1377

 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 110/378 (29%), Positives = 186/378 (49%), Gaps = 26/378 (6%)

Query: 14   ETIVQELLTVSLGLHGNR-PLLLVRTQH-ELLIYQAFRHPKGALK----LRFKKLKVLFV 67
            ETI  ELL   LG   +R P L++R+ + +L +Y+ + +     K    LRF K+     
Sbjct: 813  ETIT-ELLVADLGDSVSRSPYLILRSSNSDLTLYEPYHYTSSTEKQFSDLRFVKIANHHF 871

Query: 68   SDRSKRANEQPGLPRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTI 126
                  +N +        +S+ +R   ++ GY+ VF+ G  P ++  +S      H M +
Sbjct: 872  PKFHSESNVEKHPANCTALSKPLRVLGDVCGYRTVFMPGNSPCFIIKSSTS--IPHVMNL 929

Query: 127  DG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
             G  V +L+ F+   C +GF+Y +  + +R+   P +  +D  W  RK+ L      + Y
Sbjct: 930  RGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRFPRNTHFDGSWAARKIGLGEQVDAVEY 989

Query: 186  HLETKTYCIVTSTAEPSTDYYKFN-GEDKELVTDPRDS--RFIPPLVSQFHVSLFSPFSW 242
               ++TY I T+          FN  ED E+  + R+    F+P  + +  V L +P +W
Sbjct: 990  SSSSETYVIGTNQK------VDFNLPEDDEIHPEWRNEVISFLPQ-IDKGSVKLLTPRTW 1042

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
              I   N  L   E ++C+K +++E        +  I +GT     ED+  RG I +F++
Sbjct: 1043 SIIDSYN--LRNAERIMCVKCLNLEVSEITHERKDTIVVGTALTKGEDIAARGCIYIFEV 1100

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLT 359
            I+VVPE  +P T  K+K+I  +E KG VT++  +   GFL+ A GQK  +  LK D  L 
Sbjct: 1101 IKVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLL 1160

Query: 360  GIAFIDTEVYIASMVSVK 377
             +AF+D + Y+  +  +K
Sbjct: 1161 PVAFMDMQCYVNVLKELK 1178



 Score = 73.2 bits (178), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 46/157 (29%), Positives = 73/157 (46%), Gaps = 17/157 (10%)

Query: 488  SNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL----------TWYASLD 537
            S+ G RL+ ++ F  G   +T   +    +S S  P A    +              S  
Sbjct: 1220 SSKGDRLLHRSTFQTGHFASTMTLLPRTATSSSQGPDADPDMMDLDSSGPLHHVLVTSET 1279

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G++    P+ E +YRRL  LQ+ +     H  GLNPRAFR  +  G       RG++DG 
Sbjct: 1280 GSIALITPVSETSYRRLSALQSQLANTLEHPCGLNPRAFRAVESDGIGG----RGMVDGD 1335

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
            LV ++L L    + EI  ++G+   D+ +   D+EA+
Sbjct: 1336 LVKRWLDLGTQRKAEIANRVGA---DVWEIRADLEAI 1369


>gi|226290902|gb|EEH46330.1| cleavage and polyadenylation specificity factor subunit A
            [Paracoccidioides brasiliensis Pb18]
          Length = 1343

 Score =  139 bits (350), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 108/378 (28%), Positives = 186/378 (49%), Gaps = 31/378 (8%)

Query: 17   VQELLTVSLGLHGNR-PLLLVRTQ-HELLIYQAFRHPKGALK----LRFKKL------KV 64
            + E+L   LG   +R P L++R+  +EL++Y+ +   +   K    LRF K+      K 
Sbjct: 818  LTEILVADLGDSVSRTPYLILRSNSNELILYEPYHTVQSTEKRLSDLRFLKIANHHFPKF 877

Query: 65   LFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM 124
            L  S+    ++    L R      +R   ++ GY+ VF+ G  P   F+        H M
Sbjct: 878  LPESNLGNLSDSDRQLAR-----PLRALGDVCGYRTVFMPGNSPC--FIIKSATSIPHVM 930

Query: 125  TIDG-PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFL 183
             + G  V +L+ F+   C +GF+Y +  + +R+   P +  +D  W  RK+ L      +
Sbjct: 931  NLRGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRFPRNTHFDGSWAARKIGLGEQVDSV 990

Query: 184  AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRF-IPPLVSQFHVSLFSPFSW 242
             Y   ++TY + TS        +K   ED E+  + R+      P + +  V L +P +W
Sbjct: 991  EYSSSSETYVLGTSQKAD----FKLP-EDDEIHPEWRNEVISFFPQIDKGSVKLLNPRTW 1045

Query: 243  EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
              I   ++ L   E V+C+K +++E        +  IA+GT     ED+  RG I +F++
Sbjct: 1046 SII--DSYQLRTAERVMCVKCLNLEASEITHERKEMIAVGTALTRGEDIAARGCIYVFEV 1103

Query: 303  IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA--GFLVTAVGQKIYIWQLK-DNDLT 359
            I+VVPE  +P T  K+K+I  +E KG +T++  +   GFL+ A GQK  +  LK D  L 
Sbjct: 1104 IKVVPEVDRPETNRKLKLIAKEEVKGAITSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLL 1163

Query: 360  GIAFIDTEVYIASMVSVK 377
             +AF+D + Y++ +  +K
Sbjct: 1164 PVAFMDMQCYVSVLKELK 1181



 Score = 79.0 bits (193), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 50/157 (31%), Positives = 76/157 (48%), Gaps = 18/157 (11%)

Query: 488  SNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL----------TWYASLD 537
            S  G RL+ ++ FH GQ  +T   +  + S +S  P A +  +              S  
Sbjct: 1187 SAKGDRLLHRSTFHTGQFASTL-TLLPRTSVLSQGPEAEANAMDLDSSGPLHQVLVTSET 1245

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G++    P+ E  YRRL  LQ+ M+    H  GLNPRAFR  +  G       RG++DG 
Sbjct: 1246 GSIALITPVSEMAYRRLSALQSQMINTLEHPCGLNPRAFRAVESDGIGG----RGMVDGD 1301

Query: 598  LVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
            LV K+L L    + EI  ++G+   D+ +   D+EA+
Sbjct: 1302 LVQKWLDLGTQRKAEIASRVGA---DVWEIRADLEAI 1335


>gi|307107849|gb|EFN56091.1| hypothetical protein CHLNCDRAFT_145620 [Chlorella variabilis]
          Length = 1626

 Score =  139 bits (349), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 135/536 (25%), Positives = 216/536 (40%), Gaps = 122/536 (22%)

Query: 32   PLLLVRT-QHELLIYQAFRHPKGALKL---------RFKKLKVLFVSDRSKRANEQPGLP 81
            PLLL  T  H+LL YQAF    G+            RF++L++            Q    
Sbjct: 1032 PLLLALTADHQLLAYQAFSASPGSGGTRGSSGSGTPRFRRLRLDLPPLLPPAGGPQ---- 1087

Query: 82   RGVRISQMRYFSNI---AGYQGVFLCGPHPAWLFLTSRGELRAHP---------MTIDGP 129
              +R+ ++  F  +   A Y GVF+ G HP WL + SRG L  HP               
Sbjct: 1088 --LRLRRLHCFEGLGEEAPYSGVFVAGQHPHWL-VASRGGLLPHPHFLPQPAGPGAAAVG 1144

Query: 130  VSTLAPFHNVNCPRGFLYFN--AKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHL 187
             +   PFHNVNCP GF+     A+S ++IS LP     DAPWP ++V +K TP  +A++ 
Sbjct: 1145 AAGFTPFHNVNCPHGFIVATSGARSGIQISQLPPRTRLDAPWPRQRVSIKGTPLKVAHYA 1204

Query: 188  ETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQ 247
            E   + +++S    +       G +   V                    +    W+ + +
Sbjct: 1205 EADMFAVLSSRQGRARGRGVMEGHEVRWV--------------------WPGGGWQGVGR 1244

Query: 248  TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
                    E  L +  V ++   T + +   +A+G      ED  C GR+LLF++     
Sbjct: 1245 HQR--RPGERALSVGAVRLKDHATGATVP-LLAVGAALPAGEDYPCGGRLLLFEVTRGD- 1300

Query: 308  EPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI------------------- 348
              G    +   ++IY +E KGPVT++  + G+L+ A G +I                   
Sbjct: 1301 GGGGGGGQWAGRLIYTREFKGPVTSVSGLEGYLLLASGNRIETCSLSSTTITSTADDGTV 1360

Query: 349  ---YIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVA 405
                 W+++ +     AF D  V + S+  VKN +L+GD   S+  +RY+ E R LSL++
Sbjct: 1361 AATTTWKVQRS-----AFYDGPVLLTSLNVVKNFVLLGDCQHSVQFVRYKDEGRQLSLLS 1415

Query: 406  RDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
            +D+      +  +        +I+G                                SS+
Sbjct: 1416 KDFNRADTAATQF--------LING--------------------------------SSL 1435

Query: 466  GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISD 521
                 D    + L  Y P    S  G RL+    FH+G+  +   ++R  PSS  D
Sbjct: 1436 HLASCDSAGTLRLLSYAPSHPASWKGQRLVAWGSFHVGEAASCMRRLRLHPSSPED 1491


>gi|443894082|dbj|GAC71432.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
            [Pseudozyma antarctica T-34]
          Length = 1543

 Score =  138 bits (347), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 116/394 (29%), Positives = 190/394 (48%), Gaps = 40/394 (10%)

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEP 309
            F     E V  L+ VS++   + +G R +IA GT+  + ED T +G + LF++IEVV   
Sbjct: 1141 FEFEANEIVTALELVSLDASSSPTGRRQFIAAGTSTFHGEDRTSKGSVYLFEVIEVVSGK 1200

Query: 310  GQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND-LTGIAFIDTEV 368
             Q     ++K++   + + PVTAI  + GFL++  GQK+Y+  L+  + L  +AF+D   
Sbjct: 1201 YQLGRDLRLKLVCRDDARAPVTAIAELNGFLLSTCGQKLYVRALEKEEWLISVAFLDGPF 1260

Query: 369  YIASMVSVKNLILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGI 427
            Y+ S+  +KN +LV D  +S+ LL +Q E YR + L                     R I
Sbjct: 1261 YMTSLRVLKNFVLVSDAKKSLCLLAFQEEPYRFVDL--------------------GREI 1300

Query: 428  ID-GSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
             D  + + +FL  +  +RL +          I    +S G         + L+ Y P   
Sbjct: 1301 NDHNASMAQFLVYN--DRLSLVSTSDVPLGGISGFGASAGV--------IRLYEYAPHVA 1350

Query: 487  ESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG---ARSRFLTWYASLDGALGFF 543
             + GGHRL+ +++F            R +  S S+  G    RS+ +   A  +GAL   
Sbjct: 1351 TTLGGHRLLLRSEFQTPAAAVGSTVCRGRWLSDSELRGREEGRSKLV--LAKANGALDSL 1408

Query: 544  LPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFL 603
              L +K  +RL +LQ  +V    HT  LNPRAFR  +   +   + ++GI+D  L+ +F+
Sbjct: 1409 SALDDKVAKRLHLLQGQLVRSVQHTAALNPRAFRAVRND-FVPRSLAKGILDARLLDRFV 1467

Query: 604  QLSLGERLEICKKIGSKHNDILDELYDIEALSSH 637
             LS  + LE  + + S   D LD++   +  S+H
Sbjct: 1468 WLSRPKMLEAVRTL-SGLFDGLDQIKKRKRDSNH 1500


>gi|366994686|ref|XP_003677107.1| hypothetical protein NCAS_0F02680 [Naumovozyma castellii CBS 4309]
 gi|342302975|emb|CCC70752.1| hypothetical protein NCAS_0F02680 [Naumovozyma castellii CBS 4309]
          Length = 1340

 Score =  137 bits (345), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 125/561 (22%), Positives = 237/561 (42%), Gaps = 79/561 (14%)

Query: 89   MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNC------- 141
            M Y  + +GY  +FL G  P  +            M  D     +  F N++        
Sbjct: 832  MHYIPDYSGYSVIFLTGSVPYII------------MREDDSSPKIFRFANLSIVSLAQWG 879

Query: 142  PRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLK------CTPHFLAYHLETKTYCI 194
                +  +     R+  L    SY     P++K+ +        T   + YH +++ + +
Sbjct: 880  KNSVMAVDDIKNARVYSLDNKDSYYGNSLPLKKIKISDSLEDFMTLTKITYHEKSQLFLV 939

Query: 195  VTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS-QFHVSLFSPFSWEEIPQTNFPLH 253
              +        Y+  GED E++    D   +P   S Q  + L +P +W  I + +F ++
Sbjct: 940  SYAKERE----YEALGEDGEIIVGSNDQ--VPHAKSFQSGILLINPRTWNVIDRVDFEVN 993

Query: 254  EWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPL 313
                +  ++++ ++ +      R YI  G  +  +ED+   G   ++D+ EV+PEPG+P 
Sbjct: 994  SI--ISDMRSMLIQLDSKSRKKREYIVAGITFIGTEDLPSTGAFHIYDLTEVIPEPGKPD 1051

Query: 314  TKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIAS 372
            T  K+K I+ ++ +G V ++C ++G  +    QKI +  ++ DN +  +AF DT ++++ 
Sbjct: 1052 TNFKLKEIFKEDIRGSVNSVCDISGRFLINQSQKIMVRDVQEDNSVVPVAFYDTPIFVSD 1111

Query: 373  MVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSL 432
              S  N +++GD  +    L +  E                         P R I  G  
Sbjct: 1112 AKSFGNFLILGDSMQGFQFLGFDAE-------------------------PYRMIPLGRS 1146

Query: 433  VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGH 492
            V  F  +S+               + L     + F I+D++  + +  Y P+   +  G 
Sbjct: 1147 VSSFETVSV---------------EFLINAGEINFAITDREDILHVLKYAPDEPNTLSGQ 1191

Query: 493  RLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYR 552
            +L+  + F+L    NT   +  +      +  A  +F      +DG +   +PL E  YR
Sbjct: 1192 KLVHCSSFNLYSS-NTCMLMLPRNDEFETSDKAPPKFQAIGGQVDGGIFKIIPLKEDTYR 1250

Query: 553  RLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 612
            RL ++Q  ++      GGLNPR  R      Y   +  R +ID +++ +F +LS+  R  
Sbjct: 1251 RLYVVQQQIIDKEVQLGGLNPRMER-LDNDFYQLTHVMRPMIDFNIIRRFSELSIERRTH 1309

Query: 613  ICKKIGSK-HNDILDELYDIE 632
              +K G + H DI  ++ ++E
Sbjct: 1310 FAQKAGRRAHFDIWRDIINVE 1330


>gi|328864890|gb|EGG13276.1| CPSF domain-containing protein [Dictyostelium fasciculatum]
          Length = 1627

 Score =  137 bits (344), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 91/338 (26%), Positives = 159/338 (47%), Gaps = 58/338 (17%)

Query: 304  EVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAF 363
            ++  E  QP  + ++ ++Y K+QKGPVT+I  + G L+ ++G K+ +       L G+AF
Sbjct: 1325 KIETEELQPQLQKRLNLLYEKDQKGPVTSIAGLNGLLIMSIGPKMIVNNFSSGSLIGLAF 1384

Query: 364  IDTEVYIASMVSVKNLILVGDYARSIALLRYQP---EYRTLSLVARDYKPTQPNSKGYYA 420
             DT+++I S+ +VKN ILVGD  +SI+  + +    + + + L+ +DY+     S  +  
Sbjct: 1385 YDTQIFIVSLNTVKNYILVGDMFKSISFFKLKVCIIQKKNIILLGKDYEEVSTYSSDF-- 1442

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                                 I+DE   +  ++SD ++N+ +F 
Sbjct: 1443 -------------------------------------IVDE-KKLSMVLSDANRNIRMFS 1464

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSS-----ISDAPGARSRFLTWYAS 535
            + P   ES  G  L+ K+ FH+G+  N F +I  K ++      S +     + L +Y +
Sbjct: 1465 FDPSDPESRAGQMLLAKSSFHIGELNNKFVRIPMKNTNYDNNSSSSSIIVNDKHLLFYGT 1524

Query: 536  LDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP-----S 590
            L G +   +P+  K +  +L      + H   T GLNPR FR     G++  N      +
Sbjct: 1525 LGGGINLLMPI-NKRFHEILHALETKLMHRGQTAGLNPRGFRY----GHHVNNTLGHLHN 1579

Query: 591  RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
            + ++DG L+ KF  LS  +  ++   IGS    ILD L
Sbjct: 1580 QYVVDGDLLTKFQSLSPDDAKQLATSIGSTTPIILDLL 1617



 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 73/233 (31%), Positives = 119/233 (51%), Gaps = 30/233 (12%)

Query: 92   FSNIAGYQGVFLCG-PHPAWLFLTSRGELRAHPM---------------TIDGPVSTLAP 135
            FSNI   +G+F+ G   P W+F + +   R HPM               +   P++T   
Sbjct: 1023 FSNIGNKRGIFVSGVSTPIWIF-SEKNFPRIHPMKQQQQTTSSSSSSSSSSKRPITTFTT 1081

Query: 136  FHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIV 195
            FHN+NC  GF+YF+    L I  LP   +Y+  WP+RK+ ++ T H ++YH   K Y +V
Sbjct: 1082 FHNINCKHGFIYFDHTGMLCICRLPDGTNYENEWPIRKLAIRMTCHKISYHPVQKCYVLV 1141

Query: 196  TSTAE-PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPF-SWEEIPQTNFPLH 253
             S  + P +D  +   +++EL+  P        L  ++ + L  P  +W  I   +F L 
Sbjct: 1142 LSYPQAPQSDEDEQEEQERELLKKPL------VLEEKYQLKLIDPANNWNII--DSFSLA 1193

Query: 254  EWEHVLCLKNVSMEY---EGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDII 303
            E E VLC K + + +      +  L+ ++ +GT Y + ED  C+GRIL+F+I+
Sbjct: 1194 EKETVLCSKIIYLRHADESDIIPKLKPFVIVGTAYTHGEDTVCKGRILIFEIV 1246


>gi|358056450|dbj|GAA97624.1| hypothetical protein E5Q_04302 [Mixia osmundae IAM 14324]
          Length = 1305

 Score =  136 bits (342), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 130/546 (23%), Positives = 220/546 (40%), Gaps = 60/546 (10%)

Query: 92   FSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGF-LYFNA 150
            F +  G  GVF+ G  P +L     G  R +      P    + F   + P    L   A
Sbjct: 811  FISTTGRSGVFITGSAPFYLLTDRAGIARLY----RAPYGRASAFGAFDPPSSTPLLVLA 866

Query: 151  KSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNG 210
               +    L    S     PV  V         AYH  + T         P   +  F+ 
Sbjct: 867  DGAMHTYDLSDQASLARELPVTHVATSKCFTSTAYHDSSHTLVAARVVNAP---FELFDD 923

Query: 211  EDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEG 270
            E   +   P +   I P V +  + L  P SW+ I    FP  + E +L L   ++    
Sbjct: 924  EGAPVYRAPSED-MISPTVFRSCLELLVPGSWDCIDGHEFP--QNESILQLICATLPSAT 980

Query: 271  TLSGLRGYIALGTNYNYSEDVTCRGRILLFDI--IEVVPEPGQPLTKN-KIKMIYAKEQK 327
              SG   ++   T  N  ED+  RG + +F I   E      Q   ++ K+ +++A + +
Sbjct: 981  DPSGRARFVIASTCNNRGEDLQTRGGLYVFRISTTESTAASDQAQARSAKLSLVHADDLR 1040

Query: 328  GPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYA 386
             PV AIC V G ++ ++GQK++I     D  L  + F+D  + +++M S+KNL+++GD  
Sbjct: 1041 HPVGAICEVNGHIIHSLGQKVFIKAFDSDQRLITVGFLDVGLDVSAMRSIKNLLIIGDSL 1100

Query: 387  RSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 446
                 + +Q +   L L+ ++ + T                     V+            
Sbjct: 1101 TGTYFVAFQEDPFKLVLLGKEARKTD--------------------VYCV---------- 1130

Query: 447  ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV 506
                      D L + + +G +   +   +    Y P   ES  G RL+ +T++HLG+ +
Sbjct: 1131 ----------DFLVQENRLGLLSVSRKGLLRQLEYNPGNAESRAGERLLDRTEYHLGKQI 1180

Query: 507  NTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 566
                    + S+  D   +           DG+L +  P+ E  YRRL +L+  +     
Sbjct: 1181 IDSLSFAKRLSTDEDLRQSG----VMLVGADGSLTWVTPVREVVYRRLALLERQLHRQLP 1236

Query: 567  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 626
            H  GLNPRAFRT +   YY+   +RG++DG L+  +  L    +  +   I S  + +  
Sbjct: 1237 HFAGLNPRAFRTARND-YYSRPLARGMLDGDLLAIYANLHASRQQSLASHINSDPDTLSV 1295

Query: 627  ELYDIE 632
             L ++E
Sbjct: 1296 NLGNLE 1301


>gi|320583269|gb|EFW97484.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
            [Ogataea parapolymorpha DL-1]
          Length = 1309

 Score =  135 bits (341), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 139/614 (22%), Positives = 257/614 (41%), Gaps = 65/614 (10%)

Query: 16   IVQELLTVSLGLHGNRPLLLVRTQH--ELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKR 73
            I+++++   LG   +    LV      E+LIY+ F  P   ++  +K +K+  +      
Sbjct: 735  IIKQIMFTKLGNSSSSKDYLVALTFGGEVLIYETFFDP---IERTYKLMKINEMCQFPIV 791

Query: 74   ANEQPGLPRGVRISQMRYF---SNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPV 130
                       +I   RY     N  GY+ V + G   A++ L     +           
Sbjct: 792  GAPDNSYAHATKIE--RYLISVDNFQGYKAVLVTGA-SAFVILKEYNSIPRMLQFTKRSS 848

Query: 131  STLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPL-KCTPHFLAYHLET 189
               A ++   CP G +  +     RI  L +  +Y    P+ K  +   T + + YH  +
Sbjct: 849  LYFAEYNTDRCPNGVISIDETKACRICQLDSSYTYSNRLPIAKYKIGDKTINKIRYHSLS 908

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
             TY I T    P    Y    ED E +   RD R +     +  V L SP +W  I    
Sbjct: 909  NTYIISTLEEGP----YNPVDEDGEPLPGLRDDRKLKSTSLKGTVHLVSPANWTIID--T 962

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEP 309
              L + E+V  ++ + ++   T++  +  + +GT    +ED+   G   ++++I++VPEP
Sbjct: 963  IELEDNEYVTSIEVIELKVSETIAT-KTVVLIGTARCRNEDLATHGSWKIYEVIDIVPEP 1021

Query: 310  GQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEV 368
            G+P  KN++KMI ++  +GPV +IC+V+G      GQ++ +  L KD+++  +AF DT +
Sbjct: 1022 GRPEAKNRLKMITSETARGPVLSICNVSGRFAIVQGQRMLVRTLQKDDNVAPVAFTDTSI 1081

Query: 369  YIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGII 428
            Y   + + KNL+L+GD             ++++SL   D  P                  
Sbjct: 1082 YSKEVKTFKNLVLIGD------------SFQSVSLYGFDAAP------------------ 1111

Query: 429  DGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARES 488
                 ++ L     E+      +  +  D L    ++  +++D+D    L  Y P    S
Sbjct: 1112 -----YRMLHFGKDEQ-----NVELRAADFLVHDGNLHLLVADEDSVFHLLQYDPYDGNS 1161

Query: 489  NGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWY----ASLDGALGFFL 544
              G +L++++             +    S  S            Y    +++DG+    +
Sbjct: 1162 MKGLKLLRRSLLRSNALTTKMISVARDRSLFSMVSTLNHEDDLGYEIIGSNIDGSFYKVM 1221

Query: 545  PLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQ 604
            P+ E  YRRL  +QN +     H  GLNP++     G      +  R  I+ ++  +F+ 
Sbjct: 1222 PVNEYQYRRLYSIQNYLYDKELHWLGLNPKS-NAIGGLTELMPSIKRPFIELNMFHRFIG 1280

Query: 605  LSLGERLEICKKIG 618
             +   + +I +K+G
Sbjct: 1281 FNNDRKKQIMQKLG 1294


>gi|367001853|ref|XP_003685661.1| hypothetical protein TPHA_0E01320 [Tetrapisispora phaffii CBS 4417]
 gi|357523960|emb|CCE63227.1| hypothetical protein TPHA_0E01320 [Tetrapisispora phaffii CBS 4417]
          Length = 1357

 Score =  134 bits (336), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 125/546 (22%), Positives = 227/546 (41%), Gaps = 77/546 (14%)

Query: 89   MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNV----NCPRG 144
            M Y  N  GY  +F+ G  P  L            +  D  V  +  F N+     CP G
Sbjct: 848  MHYIPNYNGYSSIFITGNDPYIL------------LKEDDSVPRIFKFANIPLVSMCPWG 895

Query: 145  ---FLYFNAKSELRISVLP-THLSYDAPWPVRKVPLKCTPHF------LAYHLETKTYCI 194
                +  +     R+  L   ++ Y    P+ KV L  T         + YH  +  Y +
Sbjct: 896  KTSVMCVDDIKNARVYTLEVNNMYYGNKLPLLKVTLSDTIEDYMTLTKITYHEGSNMYIV 955

Query: 195  VTSTAEPSTDYYKFNGEDKELVTDPRDSRFIP-PLVSQFHVSLFSPFSWEEIPQTNFPLH 253
                A      Y   GED E +    +   +P  + +Q  + L +P +W  I + ++  +
Sbjct: 956  ----AYAKDIEYTAIGEDGERLVGSNEE--LPHSMSTQSGILLINPKTWNVIDRKDYEAN 1009

Query: 254  EWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPL 313
                +  ++ + ++     +  +  I +G +   +ED+   G   +++  EVVP+P +P 
Sbjct: 1010 TI--INDIRTMIIQLNSKTNFKKELIVVGISNVGTEDLPPTGSFYIYNTNEVVPDPSKPD 1067

Query: 314  TKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIAS 372
            T  + K ++ ++ KG +  +C ++G  +    QK+ +  ++ D  +  +AF D  V++A 
Sbjct: 1068 TNYRFKDVFHEQVKGTINNVCEISGRFMVNQSQKLLVRDIQEDESVVPVAFHDVPVFVAD 1127

Query: 373  MVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSL 432
            + S  NL +VGD  +    + +  E                         P R I+ G  
Sbjct: 1128 IKSFGNLFIVGDSMQGFQFVGFDAE-------------------------PYRMIMLGRS 1162

Query: 433  VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGH 492
            V KF  ++L               D +     + F++SD D  + +  Y P+   S  G 
Sbjct: 1163 VSKFKTMAL---------------DFVVRNGEIYFVVSDTDDILHILKYSPDEPNSLSGQ 1207

Query: 493  RLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYR 552
            RL   + F++     +   +      I +     S F T  A+LDG++   LPL E ++R
Sbjct: 1208 RLAHYSSFNIHSTNTSMHLLPANDEFIENKGNGSSIFQTIGANLDGSIFKILPLSEDSFR 1267

Query: 553  RLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 612
            RL ++Q  ++    H  GLNPR  R    + Y   N +R ++D +L+ ++  LS+ +R  
Sbjct: 1268 RLYVIQQQIIDTEVHAAGLNPRMER-LSNEYYQLTNVTRPLLDFNLIRRYSNLSIKKRKS 1326

Query: 613  ICKKIG 618
            I +K G
Sbjct: 1327 IAQKAG 1332


>gi|242208344|ref|XP_002470023.1| predicted protein [Postia placenta Mad-698-R]
 gi|220730923|gb|EED84773.1| predicted protein [Postia placenta Mad-698-R]
          Length = 696

 Score =  134 bits (336), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 100/370 (27%), Positives = 160/370 (43%), Gaps = 63/370 (17%)

Query: 211 EDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEG 270
           ED   V +P       P      + L SP     +    F   + E V CL  V++E   
Sbjct: 373 EDGNTVWEPDAPNISFPNCECLMLELISPEPEGWVTMDGFESAQKEFVTCLDCVTLETTS 432

Query: 271 TLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPV 330
           T SG+  +I +GT  N  ED+  +G + +F I+EVVP+           +    + KGPV
Sbjct: 433 TGSGMMDFIIVGTTINCREDLAVKGAVYIFSIVEVVPD-----------LQCRDDAKGPV 481

Query: 331 TAICHVAGFLVTAVGQKIYIWQLKDND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSI 389
            A+C +   LV+++GQKI++     N+ L G+AF+D  VYI S+ +VKNL+++ D  +S 
Sbjct: 482 AALCGLNNSLVSSMGQKIFVRAFDLNERLVGVAFLDVGVYITSLRAVKNLLVISDAVKS- 540

Query: 390 ALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 449
                +  Y+ + L    Y+     +  ++A        DG +                 
Sbjct: 541 -----KDPYKLVILGKDPYQVCVTTADLFFA--------DGQVF---------------- 571

Query: 450 KIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
                             +I D+D  + ++ Y P   ES GG  L+++T+FH        
Sbjct: 572 -----------------LLIGDEDGVIRIYEYDPHDPESRGGQHLLRRTEFHGQMESRMS 614

Query: 510 FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             I  +    +D P AR        S +G+L  F  + E   +RL +LQ  +  +  H  
Sbjct: 615 ILIIRRRGKDTDIPQAR----LISGSTNGSLSMFTYVDEVASKRLHLLQGQLTRNVQHVV 670

Query: 570 GLNPRAFRTY 579
           GLNP+ FR Y
Sbjct: 671 GLNPKVFRPY 680


>gi|406602601|emb|CCH45811.1| hypothetical protein BN7_5397 [Wickerhamomyces ciferrii]
          Length = 1287

 Score =  133 bits (334), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 131/546 (23%), Positives = 239/546 (43%), Gaps = 70/546 (12%)

Query: 94   NIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVS--TLAPFHNVNCPRGFLYFNAK 151
            NI   + +F+ G  P  ++ T+    +    T    +S   +   ++ +    F+Y +  
Sbjct: 791  NIKNQKFIFVTGKQPYIIWKTNHSIPKIFKFTSKTAISICKIKDSNDKDDDSKFMYIDID 850

Query: 152  SELRISVLPT--HLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFN 209
               RI  LP   + +Y    P+  V L  TP+ + YH ET    IV++  E S   Y   
Sbjct: 851  KTARICSLPIGENFNYSQNLPIEIVSLGQTPNKVTYH-ETSGLFIVSTFEEIS---YNAI 906

Query: 210  GEDKELVTDPRDSRFIPPLVSQFH--VSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSME 267
             ED   +      +   P    F   + L +P +W  I +     +E  +   ++++++ 
Sbjct: 907  DEDGVPIVGSESEK---PKAKNFKGFLKLINPINWTIIDEIEMEENEIIN--DVRSINLT 961

Query: 268  YEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQK 327
                    + +I  G      ED++  G   + DII +VP+P +P    K K I+ +  K
Sbjct: 962  ISSRSKKKKEFIIFGIGKYRLEDLSVFGEFKIIDIISIVPDPTKPEAIYKFKEIFQEVVK 1021

Query: 328  GPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYA 386
            G VT I  ++G  +T+ GQKI I  L +DN    +AF+D   Y++   S  NL+L+ D  
Sbjct: 1022 GAVTTINEISGRFLTSQGQKIIIRDLQQDNSTVPVAFMDCATYLSDSKSFGNLLLISDSM 1081

Query: 387  RSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 446
            +SI  L +  E   L L+ +D        +  +    +  I+D   ++            
Sbjct: 1082 KSIWFLGFDAEPYRLLLLGKD--------QQRFNAITTDFIVDDGEIY------------ 1121

Query: 447  ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV 506
                                F+++D ++++ L  YQP+  +S  G +L++K+ F     +
Sbjct: 1122 --------------------FLVADDEESLHLLTYQPDDPKSLSGQKLLQKSTFTTN-SI 1160

Query: 507  NTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 566
             T  K+  K +      G+ + +     ++DG++   +P+ E +YRRL +LQ  +    +
Sbjct: 1161 TTCLKLVPKFNEFD--QGSITSYQNIGVNVDGSIFKMIPIDEISYRRLYILQQQLSDKIA 1218

Query: 567  HTGGLNPRAFRTYKGKGYYAGNP--SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
            H  GLNPR+ R       ++ N    + II+  L+  F+ L++ +R +   K+G   ND 
Sbjct: 1219 HYVGLNPRSNR-------FSANEQGQKPIIEFGLLKWFINLNVDKRKQFSAKVG--RNDY 1269

Query: 625  LDELYD 630
            L+   D
Sbjct: 1270 LELFKD 1275


>gi|50305395|ref|XP_452657.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|74606921|sp|Q6CTT2.1|CFT1_KLULA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|49641790|emb|CAH01508.1| KLLA0C10274p [Kluyveromyces lactis]
          Length = 1300

 Score =  132 bits (333), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 132/568 (23%), Positives = 242/568 (42%), Gaps = 79/568 (13%)

Query: 81   PRGV-RISQM-RYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPF-- 136
            P+GV +I ++  YF N  GY  VF+ G  P  +        R   MT + P+ T+A +  
Sbjct: 786  PQGVNKIERVAHYFPNYNGYSVVFITGQVPYIIIKEDNSVCRIFRMT-NIPIVTMARWGK 844

Query: 137  HNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLK-CTPHF-----LAYHLETK 190
            ++V C       N K+  R+  L     Y     +RK+ ++     F     +AYH  T 
Sbjct: 845  NSVMCVD-----NIKNA-RVMKLDPECYYGNTQILRKIIIEDVVEEFETLGNIAYHERTG 898

Query: 191  TYCIVTSTAEPSTDYYKFNGEDKELV----TDPRDSRFIPPLVSQFHVSLFSPFSWEEIP 246
             Y I   +     +Y   + + + LV    + P  + +   L+      L +P +W  I 
Sbjct: 899  MYII---SYTKFIEYQALSEDGEPLVGYDPSKPNSTGYKSGLL------LINPLTWNIID 949

Query: 247  QTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVV 306
            + +  L E   V  +K + ++        R  + +G+++   ED    G +L+ DI EVV
Sbjct: 950  RLD--LSENSMVNDIKTMLIQLNSKTRRKRELVIIGSSFVKEEDQPSTGCLLVLDITEVV 1007

Query: 307  PEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFID 365
             EPG+P +  K K ++ +E +G V A+C ++G  +     K  +  ++ DN    +AF+D
Sbjct: 1008 AEPGKPDSNFKFKQLFEEEIRGSVNAVCEISGRFMIGQSSKALVRDMQEDNSAVPVAFLD 1067

Query: 366  TEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSR 425
              V+I    S  NL+++GD  +    + +  E                         P R
Sbjct: 1068 MPVFITDAKSFSNLMIIGDSMQGFTFVGFDAE-------------------------PYR 1102

Query: 426  GIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEA 485
             I+ G    KF  ++L               + L    ++ F+++D+  ++ +  Y P+ 
Sbjct: 1103 MIVLGKSTSKFQVMNL---------------EFLVNNGNINFIVTDRQNHLHVLRYAPDE 1147

Query: 486  RESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLP 545
              S  G RL+    F++    N + K+  K           S ++      DG++   +P
Sbjct: 1148 ANSLSGQRLVHCNSFNMFT-TNNYMKLVRKHVEFG---SKTSNYIALGCQTDGSIFRMIP 1203

Query: 546  LPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
            L E +YRR  ++Q  ++ H     G N +  R    + Y+ G+  R  +D  ++ K++ L
Sbjct: 1204 LNEASYRRFYLVQQQLLDHEIPLAGFNTKMER-LDNEYYHKGHSLRPTLDSQVLKKYIHL 1262

Query: 606  SLGERLEICKKIGS-KHNDILDELYDIE 632
             + +R  I  ++G     ++  +L DIE
Sbjct: 1263 PITKRTTIENRVGRHASTELWHDLIDIE 1290


>gi|367014525|ref|XP_003681762.1| hypothetical protein TDEL_0E03080 [Torulaspora delbrueckii]
 gi|359749423|emb|CCE92551.1| hypothetical protein TDEL_0E03080 [Torulaspora delbrueckii]
          Length = 1327

 Score =  132 bits (332), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 101/404 (25%), Positives = 179/404 (44%), Gaps = 46/404 (11%)

Query: 231  QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
            Q  + L +P +W  I +   P +    +   K++ ++ +      + Y+ +G     +ED
Sbjct: 960  QSGILLVNPKTWNIIDKKELPANTL--INDAKSMLIQLDSRTRRKKEYVIVGVAVVGTED 1017

Query: 291  VTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYI 350
            +   G   +FDI EVVPEPG+P T  K+  ++ +E +G V+ +C ++G  +    QK+ +
Sbjct: 1018 LPPSGSFFVFDITEVVPEPGKPDTNFKLSEVFQEEIRGTVSTVCEISGRFLINQSQKVLV 1077

Query: 351  WQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
              ++ DN +  +AF+D  V++    S  N +++GD  +    + +  E            
Sbjct: 1078 RDVQDDNSVVPVAFLDIPVFVTDAKSFGNFMIIGDAMQGFQFVGFDAE------------ 1125

Query: 410  PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
                         P R I  G  + K   +S+               + L     + F I
Sbjct: 1126 -------------PYRMIPLGRSIAKMETVSV---------------EFLVNGGDIFFAI 1157

Query: 470  SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRF 529
            +D D  + +F Y P+   S  G RL+  T F+L    NT   +  K      A      F
Sbjct: 1158 TDTDDILHVFKYAPDEPNSLSGQRLLHCTSFNL-HSTNTCMALLPKNEEFEPAQANMKNF 1216

Query: 530  LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
                  +DG++   LPL E  YRRL ++Q  +       GGLNPR  R    + Y   + 
Sbjct: 1217 QAIGGQVDGSVFKLLPLREDVYRRLYVVQQQITEKELQLGGLNPRMER-LSNEHYKTTHV 1275

Query: 590  SRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDIE 632
             R ++D +++ +F +LS   R +I +K+G + H +I  +L ++E
Sbjct: 1276 LRPMLDFNVIQRFKRLSTDRRKQISQKVGKRAHFEIWRDLINVE 1319


>gi|365984967|ref|XP_003669316.1| hypothetical protein NDAI_0C04130 [Naumovozyma dairenensis CBS 421]
 gi|343768084|emb|CCD24073.1| hypothetical protein NDAI_0C04130 [Naumovozyma dairenensis CBS 421]
          Length = 1388

 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 130/566 (22%), Positives = 236/566 (41%), Gaps = 89/566 (15%)

Query: 89   MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNC------- 141
            M Y S+  GY  +F+ G  P  L               D  V  +  F N++        
Sbjct: 880  MHYISDYNGYSVIFITGSVPYMLIRE------------DDSVPRIFQFANLSIVSMARWG 927

Query: 142  PRGFLYFNAKSELRISVLP-THLSYDAPWPVRKVPLK------CTPHFLAYHLETKTYCI 194
                +  +     RI  L   ++ Y     +RK+ +        T   + YH +T+ + +
Sbjct: 928  KNSIMCVDNLKNARIYGLDHANIYYGNKLSIRKIKISDSLEDYMTLTKITYHEKTQMFLV 987

Query: 195  VTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS-QFHVSLFSPFSWEEIPQTNFPLH 253
               +    T+Y     +D+ +V    D   +P   S Q  V L +P +W  I    F  +
Sbjct: 988  ---SYAKETEYDALGEDDERIVGYDED---VPHAKSFQSGVLLINPLTWNVIDSKTFGKN 1041

Query: 254  EWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPL 313
                V  ++++ ++        R YI  G  +  +ED+   G   ++DI EVVPEPG+P 
Sbjct: 1042 TL--VNDMRSMLIQVNSKARRKREYIIAGVTHIGTEDLPPTGAFHIYDITEVVPEPGKPD 1099

Query: 314  TKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIAS 372
            T  ++K ++ +E +G V+ +C ++G  +    QK+ +   + DN +  +AF+D  V+I  
Sbjct: 1100 TNYRLKEVFKEEVRGIVSTVCEISGRFLVNQSQKVMVRDAQEDNSVVPVAFLDIPVFIND 1159

Query: 373  MVSVKNLILVGDYARSIALLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGS 431
              S  + +++GD  + +  + +  E YR ++L                          G 
Sbjct: 1160 AKSFGDFLILGDAMQGLHFIGFDAEPYRMINL--------------------------GK 1193

Query: 432  LVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMG----FMISDKDKNVVLFMYQPEARE 487
             V KF  +S+                   EF   G    F ++D++  + +  Y P+   
Sbjct: 1194 SVTKFETVSV-------------------EFVVNGGDLYFALTDRNNILHVLKYAPDELN 1234

Query: 488  SNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLP 547
            S  G +L+  + F+L    N+   +  K     D   A   F T    +DG++   +PL 
Sbjct: 1235 SLSGQKLVHCSSFNLFSG-NSSLLLLPKNEEFEDTKNAPLTFQTIGGQVDGSIFKVIPLR 1293

Query: 548  EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSL 607
            E  YRRL ++Q  M       GGLNPR  R    + Y   +  R ++D +++ +F +L +
Sbjct: 1294 EDTYRRLYVIQQHMNDKEPQLGGLNPRMER-LSNEYYQLCHVMRPMLDFNIIRRFSELPI 1352

Query: 608  GERLEICKKIGSK-HNDILDELYDIE 632
              R  + K+ G + H +I  ++ ++E
Sbjct: 1353 DRRTRVAKRAGQRAHYEIWRDMINVE 1378


>gi|313215162|emb|CBY42850.1| unnamed protein product [Oikopleura dioica]
          Length = 228

 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 80/273 (29%), Positives = 136/273 (49%), Gaps = 59/273 (21%)

Query: 377 KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKF 436
           KN  LVGD  + I LLR+Q E   +S ++R  +  +  + G         ++DG+ V   
Sbjct: 4   KNYALVGDIQQGITLLRHQGERNCISQISRARRAGEVTAVGI--------LLDGNQV--- 52

Query: 437 LQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIK 496
                                        G + +D  +N+ ++MY+P+ +ESNGG +L++
Sbjct: 53  -----------------------------GLVSTDMQRNLQVYMYKPDQKESNGGKQLVR 83

Query: 497 KTDFHLGQHV-----------NTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLP 545
           + D +LG+ V           +TF K+    +         +R +T+YA LDG++G  +P
Sbjct: 84  QADINLGKRVISIWNSLGRQNDTFTKVALTEND--------ARHVTFYAGLDGSIGDIVP 135

Query: 546 LPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
           + EK +RRL MLQ ++ +H  H GGLNPR +R    +     N ++ IIDG L+ +F  L
Sbjct: 136 VSEKVFRRLEMLQTLVQSHLPHYGGLNPREYRYCTNEYRDLENAAKNIIDGDLLERFNGL 195

Query: 606 SLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
           S  E+ ++ +KIG     +LD++ D++   + F
Sbjct: 196 SFTEQTDLSRKIGVTREALLDDMMDVQRTKNLF 228


>gi|156847699|ref|XP_001646733.1| hypothetical protein Kpol_1023p44 [Vanderwaltozyma polyspora DSM
            70294]
 gi|156117413|gb|EDO18875.1| hypothetical protein Kpol_1023p44 [Vanderwaltozyma polyspora DSM
            70294]
          Length = 1337

 Score =  129 bits (324), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 97/404 (24%), Positives = 185/404 (45%), Gaps = 46/404 (11%)

Query: 231  QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
            Q  + L +P +W  I +  F  +    +  +++++++        R  + +G     +ED
Sbjct: 968  QAGILLVNPKTWNVIDKIEFERNSL--INDMRSMTIQVNSKTKKKRELLVVGVASIGTED 1025

Query: 291  VTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYI 350
            +   G   + DI EVVPEPG+P T  K K I+ +  +G V ++C ++G  +    QK+ +
Sbjct: 1026 LPSAGSFHVIDINEVVPEPGKPDTNYKFKEIFQETVRGNVNSVCEISGRFMINQSQKLLV 1085

Query: 351  WQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
              ++ D  +  +AF+D  VY+    S  NL++VGD  +    + +  E            
Sbjct: 1086 RDIQEDESVVPVAFLDVPVYVTDTKSFSNLMIVGDSMQGFQFVGFDAE------------ 1133

Query: 410  PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
                         P R I  G  V KF  ++L               + L     + F++
Sbjct: 1134 -------------PYRMIPLGRSVSKFKTVAL---------------EFLVNNGDIFFIV 1165

Query: 470  SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRF 529
            SD++  + +  Y P+   S  G RL   + F++    NT   +    +    +P  ++ F
Sbjct: 1166 SDRNDILHVLKYAPDEPNSLSGQRLAHYSSFNI-HSTNTSMILLPSNNEFQSSPNGQATF 1224

Query: 530  LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
             +  + +DG++   +PL E ++RRL ++Q  ++      GGLNPR  R    + Y   + 
Sbjct: 1225 QSVGSCVDGSIFKVIPLDEDSFRRLYVIQQQVIDTEIQAGGLNPRMER-LSNEYYQLVHL 1283

Query: 590  SRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDIE 632
             R ++D +++ +F  LS+ +R +I +K G + H D+  ++ +IE
Sbjct: 1284 MRPMLDFNIIRRFSNLSITKRTKIAQKAGRRAHFDVWRDMINIE 1327


>gi|149512998|ref|XP_001514888.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like, partial [Ornithorhynchus anatinus]
          Length = 831

 Score =  129 bits (323), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 65/182 (35%), Positives = 100/182 (54%), Gaps = 44/182 (24%)

Query: 351 WQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
           W ++D++LT I FID ++YI  ++SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP
Sbjct: 694 WAIRDSELTSITFIDMQLYIHQIISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKP 753

Query: 411 TQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMIS 470
            +  S  +   N                                        + +GF++S
Sbjct: 754 LEVYSVDFMVDN----------------------------------------AQLGFLVS 773

Query: 471 DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL 530
           D+D+N++++MY PEA+ES GG RL+++ DFH+G HVN F++  C+ +    A G   + +
Sbjct: 774 DRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNAFWRTPCRGA----AEGPSKKSI 829

Query: 531 TW 532
            W
Sbjct: 830 VW 831


>gi|342186481|emb|CCC95967.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 1456

 Score =  128 bits (321), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 166/696 (23%), Positives = 258/696 (37%), Gaps = 133/696 (19%)

Query: 6    SHSPSAMDETIVQ----ELLTVSLG--LHGN-----RPLLLVRTQHELLIYQAFRHPKGA 54
            +  PSA  ETI      E+L +S G    G        L +V +  EL +Y   + P   
Sbjct: 819  TKEPSAATETIPHVTHVEVLKLSEGPATEGTDTVVATALAVVLSSGELAVYHVMK-PDTF 877

Query: 55   LKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRY------------------FSNIA 96
              LRF K    F+  R+ R   +    R  R+ + R                   F+ +A
Sbjct: 878  GSLRFIKAVHHFLDTRAVREVIESIEARKCRLQRERTMIENDTQSTRHCARRIIPFACVA 937

Query: 97   GYQGVFLCGPHPAWLFLTSRGE-LRAHPMTIDGPVSTLAPFHN-----VNCPRGFLYFNA 150
            G  G ++CG HP +L    R   + A+     G V     F       + C  GF+ F  
Sbjct: 938  GQSGAYVCGQHPVFLLWDKRKRRIAAYRHQSPGAVRGFVSFPQMAGGFIYCCEGFVDFAR 997

Query: 151  KSELRISVLPTHLSYDAP----WPVRKVPLKCTPHFLAYHLETKTYCIVTS---TAEPST 203
             +           +Y AP    W  R++ +  TPHFL Y    K+  +VTS   T  P  
Sbjct: 998  MN-----------TYCAPNGQGWLTRRIAIGATPHFLVYDPPGKSCFVVTSEKKTFRPQR 1046

Query: 204  DYYKFNGE---DKELVT------DPRDSRFIP---------PLVSQFHVSLFSPFSWEEI 245
             ++    +   D+EL T      +P      P         P+V QF V L S    +  
Sbjct: 1047 AFFDVQLKIHYDEELNTVQSVTAEPPVCHMPPINPGAGVRVPMVEQFEVRLLSTTGEQWE 1106

Query: 246  PQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRG--YIALGTNYNYSEDVTCRGRILLFDII 303
                F L E E VL  + V +  +  ++G        L T +   EDVTCRGRI+L    
Sbjct: 1107 CTHKFALEENEKVLGAQAVELRQDEAIAGAPSAPVCVLCTAFPLGEDVTCRGRIILLASK 1166

Query: 304  EVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI----YIWQLKDNDLT 359
             V         K  I  ++++   GP TA+  +   +  AVG  I    Y W+ K   L 
Sbjct: 1167 TV-------KKKRAIVQLHSEPLNGPATAVTGICSQIAVAVGGTIKIFRYDWETKK--LV 1217

Query: 360  GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
              AF+   VY   + + +N I+ GD  RS A+ R+  +  TL+++ +D+           
Sbjct: 1218 VSAFLYAGVYATRLSAFRNYIIYGDLCRSCAMARFNEQNHTLTVLGKDH----------- 1266

Query: 420  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
                                           +   H D++    + G + S+  ++++L 
Sbjct: 1267 -----------------------------NAVSVVHCDMMYHDRTFGILCSNDQRDLLLM 1297

Query: 480  MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGA 539
             Y P  +ES G H   +  +              C   S+     A +  +T Y S  G 
Sbjct: 1298 GYTPRVQES-GEHTPSRVLESPFSLDGEYRLPSGCLAKSLRFRSAAGNSSVTVYISNYGE 1356

Query: 540  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY-KGKGYYAGNPSRGIIDGSL 598
            +GF +PL E+  R  L +   +        GL PR F +  +G           ++   L
Sbjct: 1357 VGFIVPLGEQANRTALWITRRLQVDLPCDAGLTPRMFLSLSQGTPRTTLRGKEMLVSAPL 1416

Query: 599  VWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
            V     L +  R    K I       LD + ++ AL
Sbjct: 1417 VQGLFFLDVHSR----KAIARAAYTQLDRVINVAAL 1448


>gi|164655043|ref|XP_001728653.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
 gi|159102535|gb|EDP41439.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
          Length = 1212

 Score =  128 bits (321), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 125/483 (25%), Positives = 215/483 (44%), Gaps = 59/483 (12%)

Query: 162  HLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRD 221
             L++DA  P  +     T   +A H E    C V S+ +P T +  +N E++ +    RD
Sbjct: 783  ELAFDASVPYLRWTTGRTYTHVAVHEELA--CFVASSEQP-TQFVLYNDEEQPV----RD 835

Query: 222  SRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIAL 281
             +  P        +L       E P   +     E V  L    M+     SG R ++ +
Sbjct: 836  PKQDPTRTYAACGALELLVRVGEPPVHGYEFSACETVSALHMAPMDCLDRGSGRRTFVVV 895

Query: 282  GTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKN-KIKMIYAKEQKGPVTAICHVAGFL 340
            GT   Y ED + +G + +FD++E VP  G   +   +++++  +E + PVTA+  + GFL
Sbjct: 896  GTTVTYGEDRSSKGHMYVFDVVECVPSEGMAASDALRLQLLCTEEMRAPVTALHDLNGFL 955

Query: 341  VTAVGQKIYI--WQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEY 398
            V AVGQK+ I  W+  +  L  +AF+D  +Y  S+  VKN +L+ DY +S   + +Q + 
Sbjct: 956  VAAVGQKLLIRSWEYCEW-LVTVAFLDMGMYTTSIQRVKNFLLLTDYYQSAYFVAFQEDP 1014

Query: 399  RTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 458
              L L+ RDY PT      +        +ID +            RL I           
Sbjct: 1015 ARLVLLGRDYIPTSVTCGAF--------LIDRA------------RLSI----------- 1043

Query: 459  LDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHL-GQHVNTFFKIRCKPS 517
                     +  D +  + L  Y P    S GG RL+ + ++H  G+ V    ++   P 
Sbjct: 1044 ---------VTCDMNGCLRLMDYHPSNPTSLGGQRLLARCEYHAPGEVVRA--RMLHGPY 1092

Query: 518  SISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFR 577
              +      S  +   A  +GA+   +P+ EK +  L + Q+ +V    HT GLNPR FR
Sbjct: 1093 LATSGECLTSEIV--LAKRNGAVDVLVPVTEKIFPTLQLFQSQLVRMVRHTAGLNPRGFR 1150

Query: 578  TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL--DELYDIEALS 635
                + + +   ++GI+DG+L+     +S  +   + + + ++   ++  D L  +  L 
Sbjct: 1151 AVFNQ-HISRPLAKGILDGTLLHTAESMSRPKLTSLVRDLSTRTGGVIADDLLRCLVHLQ 1209

Query: 636  SHF 638
            SH+
Sbjct: 1210 SHW 1212


>gi|302831157|ref|XP_002947144.1| hypothetical protein VOLCADRAFT_87503 [Volvox carteri f. nagariensis]
 gi|300267551|gb|EFJ51734.1| hypothetical protein VOLCADRAFT_87503 [Volvox carteri f. nagariensis]
          Length = 2830

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 131/555 (23%), Positives = 210/555 (37%), Gaps = 130/555 (23%)

Query: 98   YQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFL-YFNAKSELRI 156
            Y GVF+ G  P WL + SRG L  HPM  +G V+ + PFHN NCP GF+   +++  L++
Sbjct: 2259 YSGVFVAGSRPLWL-VASRGGLVPHPMFAEGAVAAMTPFHNANCPLGFISACSSRGLLKV 2317

Query: 157  SVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTA---------EPSTDYYK 207
              LP H   D PW  R+VPL+ TPH LA+  +      +TS           EP  D + 
Sbjct: 2318 CQLPPHTRLDTPWVTRRVPLRVTPHKLAWFRDAGLMAAITSRVVVSRPRPPEEPGGDAHA 2377

Query: 208  FNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSME 267
                           R        + + L  P     +  +   L   E  LCLK + ++
Sbjct: 2378 AAAYAAAAAAAAGRGR-----EEAWELRLLEPNGCGRLWLSPL-LPPGEQALCLKVIYLQ 2431

Query: 268  YEGTLSGLRGYIALGT----------NYNY----------------------SEDVTCRG 295
               T       +A+GT          N+ +                           CRG
Sbjct: 2432 -NATTGDTDALLAVGTGSPMGQLGGGNWRFRLPRGRVAGSGGLVVHRQCEREGAGRGCRG 2490

Query: 296  ------------RILLFDI-IEVVPEPGQPLTKN-KIKMIYAKEQKGPVTAICHVAGFLV 341
                        RILL+ I  EVV   G  LT+     ++  ++    VT++      L+
Sbjct: 2491 ERPPGEDYPCLGRILLYTISAEVVDLGGGNLTRRWSAVLVATRDMASAVTSVQEFKSQLL 2550

Query: 342  TAVGQKIYIWQLKDND--------------LTGIAFIDTEVYIASMVSVKNLILVGDYAR 387
               G +I +++ +                 L   AF D    + S+V+VK+ +L  D ++
Sbjct: 2551 VTCGSRIEMYEWRGPAAGASGGGGGGPGGRLEKRAFFDLPSLVTSLVAVKDYLLAADASQ 2610

Query: 388  SIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 447
             +  +RY    R L  +++D+                R ++   +V    +L+       
Sbjct: 2611 GLYFVRYSDSARVLEFMSKDFD--------------HRDVLTAGVVINEPKLA------- 2649

Query: 448  CKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESN----GGHRLIKKTDFHLG 503
                               F+ +D   N+ L  +   +R +N     G RL      H+ 
Sbjct: 2650 -------------------FLAADAAGNLALSEFY-GSRNTNPEFWAGQRLAPLGLMHVA 2689

Query: 504  QHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNY-RRLLMLQNVMV 562
            + ++    I+   S        ++R      + +G L +  P+P+    +RLL LQN M 
Sbjct: 2690 RRLSCCVSIKMPTSD------GKNRHALLCGAAEGGLSYIAPVPDAEMTQRLLALQNHMS 2743

Query: 563  THTSHTGGLNPRAFR 577
                H  GLNPRAFR
Sbjct: 2744 RRLPHVAGLNPRAFR 2758


>gi|119580419|gb|EAW60015.1| hCG2010549, isoform CRA_a [Homo sapiens]
          Length = 323

 Score =  127 bits (319), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 76/182 (41%), Positives = 99/182 (54%), Gaps = 40/182 (21%)

Query: 13  DETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRH----PKGALKLRFKKLKVLFVS 68
           +E + QE L +S     + P LLV    +LLIY+AF H     +G LK+ FKK+      
Sbjct: 110 EEAMCQEELPLSSRSRQSTPYLLVHVDQKLLIYKAFPHDSRLSQGNLKVHFKKV------ 163

Query: 69  DRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG 128
                                    NI+  +         A       G LR HP+ I+G
Sbjct: 164 -----------------------LHNISFREKKPKPSKKKA-------GVLRLHPVGING 193

Query: 129 PVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLE 188
           PV++ A FHNVNCPRGFLYFN + +LRISVLP +LSYD+PWPVRK+PL CT H +AYH+E
Sbjct: 194 PVNSFALFHNVNCPRGFLYFNRQGKLRISVLPAYLSYDSPWPVRKIPLCCTVHCVAYHVE 253

Query: 189 TK 190
           +K
Sbjct: 254 SK 255



 Score = 43.1 bits (100), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 40/68 (58%), Gaps = 8/68 (11%)

Query: 326 QKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAF-IDTEVYIASMVSVKNLILVGD 384
           +K P+    H   + V +   KI    L+ ++LTG+AF +D ++YI  M+SV+N IL  D
Sbjct: 237 RKIPLCCTVHCVAYHVES---KI----LQASELTGMAFMVDRQLYIHQMISVRNFILAAD 289

Query: 385 YARSIALL 392
             +SI LL
Sbjct: 290 LMKSIWLL 297


>gi|414587798|tpg|DAA38369.1| TPA: hypothetical protein ZEAMMB73_163106, partial [Zea mays]
          Length = 483

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 77/225 (34%), Positives = 118/225 (52%), Gaps = 17/225 (7%)

Query: 88  QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLY 147
           ++  F+N+ GY+G+FL GP P W+F+  R   R HP   DGP+      HNVNC RG +Y
Sbjct: 257 RITIFNNVGGYEGLFLGGPRPTWVFVC-RQRFRVHPQLCDGPIVAFTVLHNVNCCRGLIY 315

Query: 148 FNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPS-TDYY 206
             ++  L+I  LP+  +YD  WPV+KVPL  TPH + Y+ E   Y ++ S  +    +  
Sbjct: 316 VTSQGFLKICQLPSAYNYDNYWPVQKVPLHGTPHQVTYYGEQSLYPLIVSVPQVRPLNQV 375

Query: 207 KFNGEDKEL-------VTDPRDSRFIPPLVSQFHVSLF----SPFSWEEIPQTNFPLHEW 255
             +  D+EL       VT   D + +   V +F V +     S   WE   ++  P+  +
Sbjct: 376 LSSMADQELGLHMENDVTSGGDLQEV-YTVDEFEVRIMELGKSNGRWET--RSTIPMQSF 432

Query: 256 EHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF 300
           E+ L ++ V+++   T       +A+GT Y   EDV  RGR+LLF
Sbjct: 433 ENALTVRIVTLQNTSTKEN-ETLMAIGTAYVQGEDVAARGRVLLF 476


>gi|410079681|ref|XP_003957421.1| hypothetical protein KAFR_0E01320 [Kazachstania africana CBS 2517]
 gi|372464007|emb|CCF58286.1| hypothetical protein KAFR_0E01320 [Kazachstania africana CBS 2517]
          Length = 1350

 Score =  125 bits (314), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 112/478 (23%), Positives = 206/478 (43%), Gaps = 61/478 (12%)

Query: 165  YDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTD 218
            Y   +P++ + +        T + + YH +T+TY I  +      DY     ED EL+  
Sbjct: 912  YGNKFPLKSIKINTELEDYMTFNKITYHEKTQTYVIAYN---KEIDYVA-KAEDGELLVG 967

Query: 219  PRDSRFIPPLVSQFH--VSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLR 276
             + +    P    F   + L +P SW  I +  F   E   V  ++++ ++ +      R
Sbjct: 968  YKQN---VPHAKGFQSGLLLINPKSWNVIDKVEF--EENSLVNDIRSMIIQIDSRTKRKR 1022

Query: 277  GYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
             YI  G +   +ED+   G   L+DI  VVPEPG+P T  K +  + +E +G VT++C +
Sbjct: 1023 EYIVAGFSAVGTEDLPPSGSFHLYDITAVVPEPGKPDTNYKFERFFKEEVRGSVTSVCEI 1082

Query: 337  AGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQ 395
            +G    +  QKI +   + D  +  +AF+D  +++  M S  NL+++ D       + + 
Sbjct: 1083 SGRFAISQSQKIMVRDAQEDGSVVPVAFLDIPIFVTDMKSFGNLMIISDAMHGFQFVGFD 1142

Query: 396  PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
             E                         P R I  G  V KF  +S+              
Sbjct: 1143 AE-------------------------PYRMIQLGKSVSKFKTMSV-------------- 1163

Query: 456  NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK 515
             + L     + F ++D+D  + +  Y P+   S  G +L+  + F+L    N+   +  K
Sbjct: 1164 -EFLVNNGDIYFAVTDRDNILHVLKYAPDEPNSFSGQKLVHCSSFNLYAD-NSCMVLLAK 1221

Query: 516  PSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRA 575
                +        +       DG++   +PL E++YRRL ++Q  ++   +  GGLNPR 
Sbjct: 1222 NDEFNKVDDTNRTYQVVGGQTDGSMFKIVPLSEESYRRLYVIQQQIIDKETQLGGLNPRM 1281

Query: 576  FRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDIE 632
             R    +     +  R ++D +++ KF  + + +R  + +K+G   H +   +L +IE
Sbjct: 1282 ER-LSNQYLPLCHVMRPMLDFNVIRKFSAMPISKRQALAQKLGRNVHFEAWRDLINIE 1338


>gi|261335516|emb|CBH18510.1| cleavage and polyadenylation specificity factor-like protein,
            putative [Trypanosoma brucei gambiense DAL972]
          Length = 1452

 Score =  125 bits (314), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 153/643 (23%), Positives = 247/643 (38%), Gaps = 113/643 (17%)

Query: 32   PLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRY 91
            PL L++  H  L  +A R    +++ +  +L+    S+R+   N+     + VR    R 
Sbjct: 875  PLRLIKKFHHFLDTKAVREVIESIEAKKMRLQ----SERTMIENDT----QSVRHCSRRI 926

Query: 92   --FSNIAGYQGVFLCGPHPAWLFLTSRG-ELRAHPMTIDGPVSTLAPFHNVN-----CPR 143
              F+ +AG  G ++CG HP +L   +R  +L A+     GPV    PF +++     C  
Sbjct: 927  IPFAAVAGQSGAYVCGQHPLFLMWDNRTRQLVAYRHQAPGPVRGFVPFTSMSGGFIYCCE 986

Query: 144  GFLYFNAKSELRISVLPTHLSYDAP-WPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEP- 201
            GF+ F        +V+ T+ S     W  R++ +  TPHF+ Y    ++  +VTS   P 
Sbjct: 987  GFVDF--------AVMNTYCSPGGNGWLRRRIHIGATPHFIVYDPPGRSCFVVTSKKVPF 1038

Query: 202  -----STDY-----YKFNGEDKELVTDPRDSRFIP----------PLVSQFHVSLFSPFS 241
                 S D      Y  +    + VT       +P          PL  +F V L S F 
Sbjct: 1039 RPQRASFDVQLKIQYDEDSNTVQSVTTEAPVCNMPAIKPGTGVRVPLTERFEVRLHSTFK 1098

Query: 242  WEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSG--LRGYIALGTNYNYSEDVTCRGRILL 299
                      L E E VL  + V +  +    G        + T +   EDVTCRGRI+L
Sbjct: 1099 KGWDCTDKLMLDENEKVLGAQMVEIHQDANADGSATAPVCVVCTAFPLGEDVTCRGRIIL 1158

Query: 300  FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI----YIWQLKD 355
                 +         +  I  ++++   GP TA+  +   +  AVG  I    Y W+ K 
Sbjct: 1159 LASRNIK-------GRRSIVQLHSEPLNGPATAVAGICSQIAVAVGGTIKIFRYDWETKK 1211

Query: 356  NDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNS 415
              L   AF+   +Y   +   +N I+ GD  RS ++ R+  E  TL+++ RD        
Sbjct: 1212 --LVVSAFLYAGMYATRLSVFRNYIIYGDLCRSCSMARFNEENHTLTVLGRDR------- 1262

Query: 416  KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKN 475
                                               +   H D++    + G + SD ++N
Sbjct: 1263 ---------------------------------SAVSVVHCDMMYHDRAFGILCSDDERN 1289

Query: 476  VVLFMYQPEARESNGG-HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYA 534
            V++  Y P  +E++ G H  + ++   L            K        G  S  +T Y 
Sbjct: 1290 VLIMGYTPRVQETDAGTHPKVLESVLSLDGEYRLPSGSLVKSLRFRSTAGNSS--VTLYV 1347

Query: 535  SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG-- 592
            S  G +GF +P+ E+  R  L +   +        GL PR F +         N  RG  
Sbjct: 1348 SNYGEIGFIVPIGEQANRTALWVTRRLQIDLPCEAGLTPRMFLSLNQSSPR--NSLRGKE 1405

Query: 593  -IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
             ++   L+     L L  R    K I       LD + +I AL
Sbjct: 1406 MLVPAPLLRGLFSLDLRSR----KAIARAAYTQLDRVANIVAL 1444


>gi|254580509|ref|XP_002496240.1| ZYRO0C13816p [Zygosaccharomyces rouxii]
 gi|238939131|emb|CAR27307.1| ZYRO0C13816p [Zygosaccharomyces rouxii]
          Length = 1331

 Score =  125 bits (314), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 98/399 (24%), Positives = 178/399 (44%), Gaps = 46/399 (11%)

Query: 236  LFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRG 295
            L +P +W  I +  F  +    +   + + ++ +      + Y+ +G  +  +ED+   G
Sbjct: 969  LINPKTWNVIDKREFDDNSL--INDARTMLIQLDSRTRRRKEYVIVGVAHVETEDLPPSG 1026

Query: 296  RILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK- 354
             + +FDI EVVPEPG+P T  K+  ++ +  +G V+++C ++G  +    QK+ +  ++ 
Sbjct: 1027 SLSVFDITEVVPEPGKPDTNFKLGEVFKENIRGTVSSVCDISGRFLINQSQKVIVRDVQE 1086

Query: 355  DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
            DN +  +AF+D  V++  + S  N +++GD  +    + +  E                 
Sbjct: 1087 DNSVVPVAFLDVPVFVTDVKSFGNFLIIGDSMQGFQFIGFDAE----------------- 1129

Query: 415  SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
                    P R I  G  V K   ++L               + L     + F ++D   
Sbjct: 1130 --------PYRMIPLGRSVSKLETVAL---------------EFLVNGGDIFFAVTDTSN 1166

Query: 475  NVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYA 534
             + +F Y P+   S  G RL+  T F+L    NT   +  K    S    + S       
Sbjct: 1167 ILHIFKYAPDEPNSLSGQRLVHCTSFNL-HSTNTCMVLLPKNEEFSVGEKSLSPVQVVGG 1225

Query: 535  SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGII 594
              DG+L   +PL E  YRRL +LQ  +       GGLNPR  R    + Y+  +  R ++
Sbjct: 1226 QTDGSLFKLVPLREDTYRRLYVLQQQLTEKEVQLGGLNPRMER-LSNEYYHLTHAVRPML 1284

Query: 595  DGSLVWKFLQLSLGERLEICKKIGSK-HNDILDELYDIE 632
            + +++ +F  LS+ +R +  +K G + H DI  +L +IE
Sbjct: 1285 EFNVIRRFNTLSVEKRKQTAQKAGRRAHFDIWRDLVNIE 1323


>gi|340059653|emb|CCC54046.1| putative mitochondrial carrier protein [Trypanosoma vivax Y486]
          Length = 1481

 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 124/518 (23%), Positives = 207/518 (39%), Gaps = 90/518 (17%)

Query: 92   FSNIAGYQGVFLCGPHPAWLFLTSR-GELRAHPMTIDGPVSTLAPF-----HNVNCPRGF 145
            F ++AG  G ++CG HP +L    R G L  +   I GPV   APF       V C  GF
Sbjct: 958  FDSLAGNVGAYVCGRHPLFLLWDRRTGLLSGYRHQIQGPVRGFAPFPLMEGGFVYCGEGF 1017

Query: 146  LYFNAKSELRISVLPTHLS-YDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEP--- 201
              F        +V+ T+       W  R++ +  TPHF++Y++  +   +VTS  +P   
Sbjct: 1018 TDF--------AVMNTYCRPIGHGWLGRRIDVGATPHFISYNMPGRGCFVVTSHKQPFRP 1069

Query: 202  ---------STDYYKFNGEDKELVTDPRDSRFIP---------PLVSQFHVSLFSPFSWE 243
                        Y +  G  + + T+P      P         P+   F V   S    +
Sbjct: 1070 QRAPFDVQLKISYNEETGAIQSIATEPLTCSMPPIASSAGVRVPMADWFEVRFMSTAHVD 1129

Query: 244  EIPQTNFPLHEWEHVLCLKNVSMEYEG--TLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
               +  F L E E VL ++ V ++ +    ++G      + T +   +DVTCRGRI L  
Sbjct: 1130 WPCEDTFKLEENERVLSIQMVQIDGDRGMKINGTVPVCVVSTAFPLGDDVTCRGRIHLL- 1188

Query: 302  IIEVVPEPGQPLTK-NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDNDL 358
                     + L + +KI  ++A+   GP TA+  +   +  AVG   KIY +  +   L
Sbjct: 1189 -------ATKSLRRGHKIVHLHAEALNGPATAVAEIRHHIAVAVGGTIKIYRYDWQSGKL 1241

Query: 359  TGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
                 +   +Y   +  ++N I+ GD   S A+ R+  E  TL+++ R+           
Sbjct: 1242 VVSVLLYAGIYATKLSVIRNYIVYGDLIHSCAMARFNEENHTLTVLGRNRN--------- 1292

Query: 419  YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
                 S  ++D ++++                    H+       S G + SD  +NV++
Sbjct: 1293 -----SISVVDCNMMY--------------------HD------RSFGILCSDDQRNVLV 1321

Query: 479  FMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDG 538
              Y P  +E+  G R  K  +  L           C   S+  +    +  +  Y S  G
Sbjct: 1322 MGYTPRVQEAGAG-RPAKTLESLLTLDGEYRLPSGCLAKSLRFSSDFGNSSVMLYTSNYG 1380

Query: 539  ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
             +GF +P+ E+  R  L +   + T      GL PR F
Sbjct: 1381 EVGFIVPIGEQANRTALWVTRRLQTDVPCDAGLTPRMF 1418


>gi|74025892|ref|XP_829512.1| cleavage and polyadenylation specificity factor-like protein
            [Trypanosoma brucei brucei strain 927/4 GUTat10.1]
 gi|70834898|gb|EAN80400.1| cleavage and polyadenylation specificity factor-like protein,
            putative [Trypanosoma brucei brucei strain 927/4
            GUTat10.1]
          Length = 1452

 Score =  124 bits (311), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 152/638 (23%), Positives = 248/638 (38%), Gaps = 103/638 (16%)

Query: 32   PLLLVRTQHELLIYQAFRHPKGALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRY 91
            PL L++  H  L  +A R    +++ +  +L+    S+R+   N+     + VR    R 
Sbjct: 875  PLRLIKKFHHFLDTKAVREVIESIEAKKMRLQ----SERTMIENDT----QSVRHCSRRI 926

Query: 92   --FSNIAGYQGVFLCGPHPAWLFLTSRG-ELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
              F+ +AG  G ++CG HP +L   +R  +L A+     G V    PF ++  P GF+Y 
Sbjct: 927  IPFAAVAGQSGAYVCGQHPLFLMWDNRTRQLVAYRHQAPGLVRGFVPFTSM--PGGFIYC 984

Query: 149  NAKSELRISVLPTHLSYDA-PWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEP------ 201
              +  +  +V+ T+ S     W  R++ +  TPHF+ Y    ++  +VTS   P      
Sbjct: 985  -CEGFVDFAVMNTYCSPGGNGWLRRRIHIGATPHFIVYDPPGRSCFVVTSKKVPFRPQRA 1043

Query: 202  STDY-----YKFNGEDKELVTDPRDSRFIP----------PLVSQFHVSLFSPFSWEEIP 246
            S D      Y  +    + VT       +P          PL  +F V L S F      
Sbjct: 1044 SFDVQLKIQYDEDSNTVQSVTTEAPVCNMPAIKPGTGVRVPLTERFEVRLHSTFKKGWDC 1103

Query: 247  QTNFPLHEWEHVLCLKNVSMEYEGTLSG--LRGYIALGTNYNYSEDVTCRGRILLFDIIE 304
                 L E E VL  + V +  +    G        + T +   EDVTCRGRI+L     
Sbjct: 1104 TDKLMLDENEKVLGAQMVEIHQDANADGSATAPVCVVCTAFPLGEDVTCRGRIILLASRN 1163

Query: 305  VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI----YIWQLKDNDLTG 360
            +         +  I  ++++   GP TA+  +   +  AVG  I    Y W+ K   L  
Sbjct: 1164 IK-------GRRSIVQLHSEPLNGPATAVAGICSQIAVAVGGTIKIFRYDWETKK--LVV 1214

Query: 361  IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
             AF+   +Y   +   +N I+ GD  RS ++ R+  E  TL+++ RD             
Sbjct: 1215 SAFLYAGMYATRLSVFRNYIIYGDLCRSCSMARFNEENHTLTVLGRDR------------ 1262

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                          +   H D++    + G + SD ++NV++  
Sbjct: 1263 ----------------------------SAVSVVHCDMMYHDRAFGILCSDDERNVLIMG 1294

Query: 481  YQPEARESNGG-HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGA 539
            Y P  +E++ G H  + ++   L            K        G  S  +T Y S  G 
Sbjct: 1295 YTPRVQETDAGTHPKVLESVLSLDGEYRLPSGSLVKSLRFRSTAGNSS--VTLYVSNYGE 1352

Query: 540  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG---IIDG 596
            +GF +P+ E+  R  L +   +        GL PR F +   +     N  RG   ++  
Sbjct: 1353 IGFIVPIGEQANRTALWVTRRLQIDLPCEAGLTPRMFLSLNQRSPR--NSLRGKEMLVPA 1410

Query: 597  SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
             L+     L L  R    K I       LD + +I AL
Sbjct: 1411 PLLRGLFSLDLRSR----KAIARAAYTQLDRVANIVAL 1444


>gi|219109892|ref|XP_002176699.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411234|gb|EEC51162.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1678

 Score =  122 bits (307), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 103/432 (23%), Positives = 184/432 (42%), Gaps = 86/432 (19%)

Query: 249  NFPLHEWEHVLCLKNVSM-------------EYEGTLSGLRGYIALGTNY--NYSEDVTC 293
            +F L E+EH + L  + +             +  G     R ++A+GT    +  EDV  
Sbjct: 1285 SFKLDEYEHGMTLSIMELTEFPEEPGSSNDTDVSGDELSKRMFVAVGTGVLDHNGEDVAS 1344

Query: 294  RGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQK---GPVTAICHVAG----FLVTAVGQ 346
            RGR +L ++ +      +   +  +++ +  E++   G VT++  ++      L+   G 
Sbjct: 1345 RGRAILLEL-KRTNSSAKAAGRQVVELSFCYEKEIFHGAVTSLVCLSSEGKNRLLIGAGA 1403

Query: 347  KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVAR 406
             I + Q  +  LT + F    + +   +  K+ +L+ D   S+  L ++   ++L+L+A+
Sbjct: 1404 DINVEQWGNAKLTQVGFFRATMQVLHTIPFKSFLLLSDAYDSLYFLIWRESDKSLTLLAK 1463

Query: 407  DYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMG 466
            DY P       Y AG  SRG                                     +M 
Sbjct: 1464 DYDPIPV----YAAGVMSRG------------------------------------PAMT 1483

Query: 467  FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSI------- 519
            F+  D  +N+  F Y P    + GG+RL+ + D+HLG    +F    C+ S +       
Sbjct: 1484 FLCHDDRQNLQFFQYAPGEAAARGGNRLVCRADYHLGTQTTSFASHFCRSSLMIHSATPT 1543

Query: 520  --------SDAPGARS----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 567
                     D+   RS    R   ++ + DG +G  +PL E  Y RL  LQ+++      
Sbjct: 1544 STLAALKQQDSYFGRSEEDQRLGAYFGTADGGMGAVVPLSEPVYWRLTALQSIVANALES 1603

Query: 568  TGGLNPRAFRTYKGK----GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 623
               L PRA+R Y+      G  + +  +G+IDG LV ++  LS+ ++ +I   IGS  + 
Sbjct: 1604 DCALAPRAWRLYRRSTRRGGCRSNDRKKGVIDGDLVLQYADLSISKQEDIASAIGSTVDL 1663

Query: 624  ILDELYDIEALS 635
            ILD L +++  S
Sbjct: 1664 ILDNLLELQCGS 1675


>gi|443919095|gb|ELU39366.1| cleavage factor protein [Rhizoctonia solani AG-1 IA]
          Length = 788

 Score =  122 bits (306), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 153/644 (23%), Positives = 278/644 (43%), Gaps = 114/644 (17%)

Query: 5   RSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELL-IYQAFR------HPKGALKL 57
           ++H+     ET ++ ++   +G+   +P L+V T+   L IY+           + +  +
Sbjct: 232 QAHTVCTDGETDIEHVIIAPIGITRPKPHLVVITKSRTLAIYEPVPAPPPPDSSENSAPV 291

Query: 58  RFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQ-----GVFLCGPHPAWLF 112
           R  +L V FV   S+       LP  +  ++     ++  ++     G+F+ G HP WL 
Sbjct: 292 R-DQLTVQFVKVFSR------ALPLDMHDTKRVAGRSLVPFKSPNLSGIFVTGDHPFWLL 344

Query: 113 LTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLP-THLSYDAPWPV 171
            T    LR +P                       Y N+     +  LP   +S++ P   
Sbjct: 345 RTDASALRIYPHAAQ-------------------YVNSFGTTVVEWLPDVDISHEIPCRS 385

Query: 172 RKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQ 231
                      +AY + T+ + +  S    +  YY    ED   +  P D+    P +  
Sbjct: 386 YASDDGRVYTSVAYDVSTR-HILAASALRTTFAYYD---EDSNELYTP-DATHPNPEIHC 440

Query: 232 FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDV 291
             + L +P +W  +    F  +E+  V  ++++ +E   T  GL+ Y+ +GT  +  ED+
Sbjct: 441 SALELITPDTWTTVDGYEFAQNEF--VNAVESIPLETLSTERGLKDYVVVGTTISRGEDL 498

Query: 292 TCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIW 351
             +G   +F+++EVVPEPG    + +++++  ++ KG VTA+C + G+LV+++GQKI++ 
Sbjct: 499 AVKGATYVFEVVEVVPEPGSKTRQYRLRLLCREDSKGAVTALCGMNGYLVSSMGQKIFVR 558

Query: 352 QLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
               D  LTGIAF+D  V + S+  +KNL+LVGD  +S+  + +Q E   L  + +D + 
Sbjct: 559 AFDLDEKLTGIAFMDVGVCVTSLRPLKNLLLVGDMVKSVWFVAFQEEPFKLVPLGKDRQQ 618

Query: 411 TQPNSKGYYAGNPSR---GIIDGSLVWKFLQLSLGERLEICKK---IGSKHNDILDEFSS 464
                  ++ G+ ++    ++D   V+       G RL IC         H  +L     
Sbjct: 619 LSVTHADFFFGSQAQLSFAVLDDFGVF-------GLRL-ICSSEFHTHVTHRGVLSVSRK 670

Query: 465 MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG 524
             F   D D    +   Q    ES+      K   FH  QH N+   + C  SS      
Sbjct: 671 ADF---DSD----VMSIQSLGTESSLIFGETKPYPFH--QH-NSILTM-CGGSS------ 713

Query: 525 ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 584
                       DG +    PL E  + RL +LQ  ++                      
Sbjct: 714 ------------DGTIASLTPLNESEFGRLQLLQGQLIR--------------------- 740

Query: 585 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
              N   G++DG+L+  F +L + +++E+ ++IG++   IL++L
Sbjct: 741 ---NVHNGVLDGNLLAAFEELPVSKQVEMTQQIGAEREKILNDL 781


>gi|298715584|emb|CBJ28137.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 255

 Score =  119 bits (299), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 79/289 (27%), Positives = 134/289 (46%), Gaps = 47/289 (16%)

Query: 351 WQLKDNDLTGIAFIDTEVYIASMVSVKN-LILVGDYARSIALLRYQPEYRTLSLVARDYK 409
           W  K   L  I F D  VY+ S+  +K+  ILVGD   S+ L+ ++ E  +L+ +++D++
Sbjct: 13  WDPKTCTLELIGFHDPRVYVMSLSVIKHKFILVGDAYGSVQLVVWREEDHSLTALSKDHE 72

Query: 410 PTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMI 469
             Q  S  Y    P                                         M  ++
Sbjct: 73  DCQVFSAEYLIDEPG----------------------------------------MAIVV 92

Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRF 529
           +D  +NV +  Y P A  S GG +L+ ++DF+LG  V    + R +  ++ D     +R+
Sbjct: 93  ADGRRNVKVLQYAPNATNSRGGTKLLCQSDFYLGSRVGKLTRRRTR-GNLRDG----ARY 147

Query: 530 LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
                +LDG LG  LP+ E+ +RRL  LQ +M     H G  NPRA+R +     +    
Sbjct: 148 CLLAGTLDGGLGAVLPVDERVFRRLYALQGIMSNALGHNGAANPRAYRLFDHGPTFRYET 207

Query: 590 SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
            + ++DGSL+W+F+ L    + ++ + IG+  + ++  L DI+ L+S F
Sbjct: 208 KQNMLDGSLLWRFVGLDAKTQHDLTRAIGTTVDRVMANLLDID-LASLF 255


>gi|444313909|ref|XP_004177612.1| hypothetical protein TBLA_0A02930 [Tetrapisispora blattae CBS 6284]
 gi|387510651|emb|CCH58093.1| hypothetical protein TBLA_0A02930 [Tetrapisispora blattae CBS 6284]
          Length = 1459

 Score =  119 bits (297), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 121/558 (21%), Positives = 228/558 (40%), Gaps = 71/558 (12%)

Query: 89   MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
            M Y     GY  +F+ G  P  +        R+        VS +    N       +  
Sbjct: 950  MHYIPEYNGYSVIFVTGKSPYIIIKEDDSSPRSFKFANIPLVSMIRWGKN-----SVMCV 1004

Query: 149  NAKSELRISVLPT-HLSYDAPWPVRKVPLK------CTPHFLAYHLETKTYCIVTSTAEP 201
            +     R+  L   ++ Y    P++++ +        T   +AYH  +K Y +   +   
Sbjct: 1005 DPLKNARVYTLDCKNIYYGNKLPIKRIDISDEMDNYMTFTKIAYHESSKLYVV---SYCK 1061

Query: 202  STDYYKFNGEDKELVTDPRDSRFIPPLVS-QFHVSLFSPFSWEEIPQTNFPLHEWEHVLC 260
              DY   + E + LV    D   +P   S +  + L +P +W  I Q  F   E   +  
Sbjct: 1062 DIDYNALDEEAERLVGYNSD---VPHAKSYKSGILLINPKTWNVIDQREF--GENSLIND 1116

Query: 261  LKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKM 320
            ++++ ++        R YI  G     SED+   G   ++DI  V+PE G+P T  K K 
Sbjct: 1117 IRSMVIQLNSRTRAKREYIVAGLANIGSEDLPPTGSFYIYDISPVLPETGKPDTNYKFKE 1176

Query: 321  IYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNL 379
            I+ ++ +G VT++C ++G       QKI +  ++ DN +  +AF+D  VY+    S  N 
Sbjct: 1177 IFTEDVRGLVTSVCEISGRFTINQSQKIMVRDVQEDNSVVPVAFLDIPVYVTDTKSFGNF 1236

Query: 380  ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQL 439
            +L+ D  + +  + +  E   + L+ +     + ++  + A N                 
Sbjct: 1237 LLISDSMQGLQFVGFDAEPFRMILLGKSIPDLKISTVEFIANN----------------- 1279

Query: 440  SLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD 499
                                    ++ F  +D D  + +F Y P+   S  G +L+  + 
Sbjct: 1280 -----------------------GNIYFAATDYDNILHIFKYAPDEPNSLSGQKLVHCSS 1316

Query: 500  FHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASL----DGALGFFLPLPEKNYRRLL 555
            F+L    +    +   P +   +   +  F+  + +L    DG++   +PL E  YRRL 
Sbjct: 1317 FNLHSSTSCMIML---PGNDEFSENEQDNFIPSFQTLGGQVDGSIFKVIPLEESPYRRLY 1373

Query: 556  MLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 615
            ++Q  +  +    GGLNP+  R    + Y   N  + ++D +++ +F  L + +R    +
Sbjct: 1374 VIQQQITDYEVQVGGLNPKMER-LSNEYYQKSNMLKPMLDFNIIRRFSMLPIDKRRRTAQ 1432

Query: 616  KIGSK-HNDILDELYDIE 632
            K G + H +I  +L +IE
Sbjct: 1433 KAGRRAHFEIWRDLINIE 1450


>gi|388856288|emb|CCF50097.1| related to cleavage and polyadenylation specificity factor, 160 kDa
            subunit [Ustilago hordei]
          Length = 1568

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 100/344 (29%), Positives = 162/344 (47%), Gaps = 46/344 (13%)

Query: 273  SGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKN-KIKMIYAKEQKGPVT 331
            +G + +IA+GT   + ED TC+G + LF+II+VV      + ++ ++K+I       PVT
Sbjct: 1150 TGRKQFIAVGTTTYHGEDRTCKGSVYLFEIIQVVSSRRFQVGRDLRLKLICRDGSNAPVT 1209

Query: 332  AICHVAGFLVTAVGQKIYIWQLKDND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIA 390
            A+  + GFL++  GQK+Y+  L+  + L  +AF+D   YI S+  VKN +L+ D  + + 
Sbjct: 1210 ALAELHGFLLSTSGQKLYVRALEKEEWLISVAFLDCPFYITSIRVVKNFVLLSDAKKGLW 1269

Query: 391  LLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 449
             L +Q + YR + L +                      +DG         +LGE L    
Sbjct: 1270 FLAFQEDPYRFVDLGS---------------------ALDGHCA------NLGEFLVYND 1302

Query: 450  KIG--SKHNDILDEFSSMGFMISDKDKNVV-LFMYQPEARESNGGHRLIKKTDFHLGQHV 506
            K+   S     L  FS  G     +D  V+ L+ Y P +  S GG RL+ +T++      
Sbjct: 1303 KLSLVSTSGVALGGFSGFG-----QDSGVIRLYEYNPSSPTSLGGQRLLLRTEYSTPSST 1357

Query: 507  NTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
                    +  S S+  G    R++ L   +  +G+L     + EK  +RL +LQ  +V 
Sbjct: 1358 TCSLSAPGRWLSDSELRGREQLRNKLL--LSKSNGSLDSLASVEEKVAKRLHLLQGQLVR 1415

Query: 564  HTSHTGGLNPRAFRTYKGKGYYAGNP-SRGIIDGSLVWKFLQLS 606
               HT  LNPRAFR  +    +   P  +G++D  L+  F  LS
Sbjct: 1416 SVLHTAALNPRAFRQVRND--FVSRPLYKGVLDARLLDAFKGLS 1457


>gi|402591342|gb|EJW85272.1| hypothetical protein WUBG_03818, partial [Wuchereria bancrofti]
          Length = 1025

 Score =  113 bits (282), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 71/233 (30%), Positives = 115/233 (49%), Gaps = 14/233 (6%)

Query: 14   ETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP---KGALKLRFKKLKVLFVSDR 70
            E ++ ELL V +G++  RPLL +     +  Y+ F +    +G L +RFK+L    V+ R
Sbjct: 794  EEVIMELLLVGMGMNQGRPLLFLLIDDTVSAYEMFTYNNGIQGHLAIRFKRLPYTTVT-R 852

Query: 71   SKRANEQPG------LPRGVRISQMRYFSNIAG--YQGVFLCGPHPAWLFLTSRGELRAH 122
            S R     G      +   VR   + +F    G    GVF+C  +P   FL S G  R H
Sbjct: 853  SCRFQGTDGRAAVESVRDAVRHKTVLHFFERIGNVLNGVFICSSYPCIFFLES-GVPRLH 911

Query: 123  PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSEL-RISVLPTHLSYDAPWPVRKVPLKCTPH 181
            P+ +DGP+ +   F+N  CP GF+Y   +  L R++ LP+ +  DA +PV+++ +  T H
Sbjct: 912  PVNLDGPILSFTTFNNAACPNGFIYLTERDRLMRVAKLPSDMILDASYPVKRINVGATVH 971

Query: 182  FLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHV 234
             + Y L + TY ++TS     T       +DK      +   F+ P + Q+ +
Sbjct: 972  SVVYLLHSNTYAVLTSEKRKVTKMCVLINDDKTFEEHEKPDTFVYPEMDQYKL 1024


>gi|71413583|ref|XP_808925.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
           strain CL Brener]
 gi|70873226|gb|EAN87074.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi]
          Length = 444

 Score =  110 bits (276), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 110/491 (22%), Positives = 194/491 (39%), Gaps = 84/491 (17%)

Query: 172 RKVPLKCTPHFLAYHLETKTYCIVTSTAEP------------STDYYKFNGEDKELVTDP 219
           R++ L  TPHF+ YH   ++  +VTS  EP            +  Y + +G  + + T+ 
Sbjct: 2   RRIHLGVTPHFVVYHPPARSCFVVTSKKEPFRPQRAPFDFQLNIVYDEESGGVQSITTEA 61

Query: 220 RDSRFIP---------PLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEG 270
                 P         P+  +F + L S   W         L E E VL  + + +  E 
Sbjct: 62  PVCNMPPIAPNAGIRVPMADRFEIRLMSTTDWA--CTDTLLLEENERVLGAQMMEIHCEK 119

Query: 271 TLSGLRG--YIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKG 328
              GL       + T +   ED+TCRGRILL   +           K KI + +++   G
Sbjct: 120 DAEGLHTAPVCVVSTAFPLGEDITCRGRILLLATMCTK-------KKRKILLFHSEPLNG 172

Query: 329 PVTAICHVAGFLVTAVGQKIYIWQL--KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYA 386
           P TA+  +   +  AVG  I +++   +   L   A +    Y+  M S +N ++ GD +
Sbjct: 173 PATAVVGIRHHIAVAVGGTIKLFRFDWEKRKLVVGALLYAGTYVTRMSSFRNYLIYGDLS 232

Query: 387 RSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 446
           RS A+ R+  E  TLS++ +D                                       
Sbjct: 233 RSCAIARFNEENHTLSVLGKDR-------------------------------------- 254

Query: 447 ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGG--HRLIKKTDFHLGQ 504
               +   H D++    + G + SD ++N+++  Y P  +E+  G  +++++      G+
Sbjct: 255 --NAVSVVHCDMMYHDRAFGLLCSDDERNLLVMGYTPRVQETEAGSPNKVLESVLSLDGE 312

Query: 505 HVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 564
           +        C   S+     A +  +T Y +  G +GF +P+ E+  R    L   +   
Sbjct: 313 Y---RLSGGCLVKSLRFRSLAGNSSVTLYVTNYGEIGFIVPIGEQANRTASWLMRRLQID 369

Query: 565 TSHTGGLNPRAFRTY-KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 623
             H+ GL PR F    +G    A      ++  SL+ +F  L +  R    K I S    
Sbjct: 370 LPHSAGLTPRMFLGLSQGSPRTAMRAKEMLVSASLLNEFFFLDIHSR----KTIASAAYT 425

Query: 624 ILDELYDIEAL 634
            L+ + ++ +L
Sbjct: 426 QLERVTNVASL 436


>gi|159470707|ref|XP_001693498.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283001|gb|EDP08752.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 366

 Score =  108 bits (270), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 80/259 (30%), Positives = 111/259 (42%), Gaps = 53/259 (20%)

Query: 81  PRGVRISQMRYFSNI-----AGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAP 135
           PR +R   + Y   +     A + GVF+ G  P WL +  RG L AH M  +GPV+ L P
Sbjct: 144 PRLIRFDHIAYTDPLTRARGANHSGVFVAGARPLWL-VAGRGGLAAHAMWSEGPVAALTP 202

Query: 136 FHNVNCPRGFL-YFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCI 194
           FHNVNCP GF+   +A+ +L++  LP H   D  W  R+VPLK TPH LA+  E      
Sbjct: 203 FHNVNCPLGFITACSARGQLKVCCLPPHTRLDGAWATRRVPLKVTPHRLAWFREAGIVAA 262

Query: 195 VTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHE 254
           +TS   PS                PR +                     E P        
Sbjct: 263 ITSRPAPS---------------RPRPA---------------------EEPGG------ 280

Query: 255 WEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLT 314
            E  LCLK V +    T       +A+GT     ED  C GR+LL+ +   V + G+   
Sbjct: 281 -EQALCLKFVYLR-NATTGDTDTLLAVGTGTPLGEDYPCLGRLLLYSVAAEVVDQGRGNM 338

Query: 315 KNK--IKMIYAKEQKGPVT 331
             +    ++ A++    VT
Sbjct: 339 SRRWSATLVAARDTASAVT 357


>gi|224000243|ref|XP_002289794.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220975002|gb|EED93331.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1820

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 92/389 (23%), Positives = 162/389 (41%), Gaps = 70/389 (17%)

Query: 278  YIALGTNY--NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKE-QKGPVTAIC 334
            ++A+GT       ED+  +GRILLF++ +   +  +     ++ + + K+   GPVT++ 
Sbjct: 1470 FVAVGTGRIERDGEDIASKGRILLFNLKKKKHQKDKRSMTLELHLKHEKDITIGPVTSLS 1529

Query: 335  HVAG----FLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIA 390
             +       +    G ++ + Q     L  + F    + + ++   K   L+ D   ++ 
Sbjct: 1530 SLRSEDIFRVAVGAGAEVTVEQWGSGKLVQVGFYHAHMQVQNISLFKTFFLLSDAYDALH 1589

Query: 391  LLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 450
             L ++   ++L+L+A+DY+PTQ     + AG  SRG                        
Sbjct: 1590 FLVWRESDKSLTLLAKDYEPTQV----FAAGMISRG------------------------ 1621

Query: 451  IGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV---- 506
                         +M F+  D  +N+    Y P    + GG++L+ + DFHLG       
Sbjct: 1622 ------------GAMSFVCHDDRQNIQFLQYAPTDVAARGGNKLVCRADFHLGSQTTSLN 1669

Query: 507  -----NTFFKIRCKPSSI------SDAPGAR----SRFLTWYASLDGALGFFLPLPEKNY 551
                 ++     C  SS        D+   R     RF   + + DG+    +PL E  Y
Sbjct: 1670 SHWAQSSLLFNSCTVSSTLASLKQQDSLFGRLDDDQRFAVNFGTTDGSFVSIIPLSEPTY 1729

Query: 552  RRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK----GYYAGNPSRGIIDGSLVWKFLQLSL 607
             RL  LQ+VM         L+ RA+R Y+      G    +  +G+ID  LV KF+ L L
Sbjct: 1730 WRLTALQSVMSNALESNAALSHRAWRLYRRSTRRGGCRTNDRKKGVIDADLVMKFVDLPL 1789

Query: 608  GERLEICKKIGSKHNDILDELYDIEALSS 636
             E+ ++   IGS    ++D L ++    S
Sbjct: 1790 PEQEDLTSSIGSTVGLVMDNLLELSCAGS 1818


>gi|414587799|tpg|DAA38370.1| TPA: hypothetical protein ZEAMMB73_163106 [Zea mays]
          Length = 461

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 47/113 (41%), Positives = 68/113 (60%), Gaps = 1/113 (0%)

Query: 88  QMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLY 147
           ++  F+N+ GY+G+FL GP P W+F+  R   R HP   DGP+      HNVNC RG +Y
Sbjct: 257 RITIFNNVGGYEGLFLGGPRPTWVFVC-RQRFRVHPQLCDGPIVAFTVLHNVNCCRGLIY 315

Query: 148 FNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAE 200
             ++  L+I  LP+  +YD  WPV+KVPL  TPH + Y+ E   Y ++ S  +
Sbjct: 316 VTSQGFLKICQLPSAYNYDNYWPVQKVPLHGTPHQVTYYGEQSLYPLIVSVPQ 368


>gi|452825139|gb|EME32137.1| cleavage and polyadenylation specificity factor subunit-like protein
            [Galdieria sulphuraria]
          Length = 1454

 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 126/565 (22%), Positives = 236/565 (41%), Gaps = 118/565 (20%)

Query: 86   ISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID-----GPVSTLAPFHNVN 140
            I  +R F N++ + GVFL G  P+ + L S+G  + H + ID     G + ++    +  
Sbjct: 876  IPHLRPFYNLSSHFGVFLTGSVPSIIVL-SKGYPQKHEIMIDSGVEYGDILSITNMGDPE 934

Query: 141  CPRGFLYFNAKSELRI-SVLPTHL-SYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTST 198
              R     ++   +    +  T L S +  WPV    +      + YH  T T+ +V S+
Sbjct: 935  NNRKLWILDSNGRIHFGEIRETQLESINWAWPVEVFRMNGCVKNVVYHATTGTFGVVVSS 994

Query: 199  A------EPSTDYYKFNGEDKELV-----------------TDPRDSRFIPPLVSQFHVS 235
                   E     ++    D+  +                  +P+++  +P  V  + + 
Sbjct: 995  IVSMSRLERKRQIFERQKRDERAILGSQAPPEEENNTEFEENEPKNA--LPIEVEAYELQ 1052

Query: 236  LFSPFSWEEIPQTNFPLHEWEHVLCL---------------------KNVSMEYEGTLSG 274
            ++   +WE + +  F   E E VL                       +    + E  +S 
Sbjct: 1053 IYRADTWELVDK--FAFKEEEAVLSATFMQVDAYKITEEENNDDKSSRATQQQAEAAISQ 1110

Query: 275  L--------RGYIALGTNYNYSEDVTCRGRILLFDII--EVVPEPGQPLTKNKIKMIYAK 324
                     +  I +GT +   ED   RGR++LF++   E   E     +  ++ +I  K
Sbjct: 1111 SSRSIKFKPKECIVIGTGFIKGEDAGTRGRLMLFEVARQEAYTEESGAFSAIQLMLIAEK 1170

Query: 325  EQKGPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDT-EVYIASMVSVKNLILV 382
            E K  V++I  + G++  AVG K+ I++L  +++L   +F    +++  S+ +VK  + V
Sbjct: 1171 ELKSVVSSIARLEGYICCAVGPKVEIYKLVNESELVCCSFYSGFQLFSTSINTVKQYVFV 1230

Query: 383  GDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLG 442
            GD  +    L ++   ++L+ + +D+ P Q                  +L  +FL     
Sbjct: 1231 GDMYKGGYFLFWRDRNKSLNFLGKDFDPVQ------------------TLSTEFL----- 1267

Query: 443  ERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ-PEARESNGGHRLIKKTDFH 501
                           IL+EF  + F++SD   N+ L  Y  P   ES GG +L+++   H
Sbjct: 1268 ---------------ILNEF--ILFVVSDNFGNLHLLEYAGPHEIESRGGEKLLRRGVLH 1310

Query: 502  LGQHVNTFFKIRC--KPSSISDAPGARSRFL-TWYASLDGALGFFLPLPEKNY--RRLLM 556
            LG   ++  ++R   K ++  D  G+    L TW    DG L   LPL ++ Y  +  L+
Sbjct: 1311 LGTRSSSMIRLRTDWKENNSEDRAGSHIVVLGTW----DGGLACLLPLQQEEYEQKNELL 1366

Query: 557  LQNVMVTHTSHTGGLNPRAFRTYKG 581
             +  + +++ +  GLNP+ FR  +G
Sbjct: 1367 KKVYLHSYSLYVAGLNPQEFRIPRG 1391


>gi|430810872|emb|CCJ31592.1| unnamed protein product [Pneumocystis jirovecii]
 gi|430814599|emb|CCJ28188.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 203

 Score =  105 bits (263), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 60/184 (32%), Positives = 99/184 (53%), Gaps = 11/184 (5%)

Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI--RC 514
           D L +   + F+I D D N+ +F Y PE  +S  G +L+K+ DFH+G H+ +   +    
Sbjct: 17  DFLVDDEHLYFVIGDDDGNIHVFNYDPENPQSFSGQKLLKRGDFHVGSHIKSILMLPKEA 76

Query: 515 KPSSISDAPGARSR----FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
            P +++D    R+      L   AS DG++G  + LPEK YRRL  +Q  ++       G
Sbjct: 77  FPQNVNDKEETRASKNQDSLCLCASQDGSMGVLISLPEKTYRRLYFIQGQLINTEDKVAG 136

Query: 571 LNPRAFR--TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
           LNP ++R  TY  K     NP+RGI+DG L++++  L   ++ ++ +K G     I+ +L
Sbjct: 137 LNPISYRTSTYVSK---TSNPARGILDGKLLYQYNNLERNKQKDMARKSGMPVETIIYDL 193

Query: 629 YDIE 632
             I+
Sbjct: 194 LKID 197


>gi|398020786|ref|XP_003863556.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania donovani]
 gi|322501789|emb|CBZ36871.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania donovani]
          Length = 1542

 Score =  105 bits (262), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 131/596 (21%), Positives = 226/596 (37%), Gaps = 106/596 (17%)

Query: 33   LLLVRTQHELLIYQAF----RHPKGALKLRFKKLKVL-------FVSDRSKRANEQPGLP 81
            L+++ +  EL+ Y+        P+  +K+ +  L V         +  R KR  E+    
Sbjct: 938  LVMILSSGELVTYRVVPADANGPRRCVKVIYHILDVAPEVDVVESIEARKKRLQEERAHL 997

Query: 82   RGVRISQMRYFSN----IAGYQ----GVFLCGPHPAWL-FLTSRGELRAHPMTIDGPVST 132
              V   QMR+ S       G Q    G+++CG  P +L +  +  +L          V  
Sbjct: 998  ASV-TQQMRHCSERLVPFRGLQDRHKGIYVCGQTPVFLVYHAATNQLVCTRHHATNAVRG 1056

Query: 133  LAPFHN-------VNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
             APFH+       V C  GF++F              L   + W + +V L CTPH + Y
Sbjct: 1057 FAPFHSRHVHGGFVYCGEGFVHFATMQPF------GELLGSSGWWLERVRLGCTPHQVIY 1110

Query: 186  HLETKTYCIVTSTAEP-STDYYKFNGEDKELVTDPRDSRF--------IPPLVS------ 230
                    +V S  +P S     F+ + + +V D   +R         +PPL +      
Sbjct: 1111 SPAAHGCFVVASRPQPFSPKRAPFDVQLR-MVEDEEGNRVPHVIEPVSLPPLSATSGSPV 1169

Query: 231  ----QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEY---EGTLSGLRGYIALGT 283
                ++ V  FS   W+ + +    ++E      L  V+ +        S      AL T
Sbjct: 1170 PTNERYEVQFFSTLDWQCMGRLVLDVNEKVLSATLMQVTRDTTMDAANRSTTAPVCALAT 1229

Query: 284  NYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA-GFLVT 342
             Y   EDVT RGRILL                 +++ ++ +  KGPVTAI  V    +  
Sbjct: 1230 AYPLGEDVTTRGRILLLTT-----SQQGGQGMQQLRTLHEEPMKGPVTAITRVGEDCVAV 1284

Query: 343  AVGQKIYIWQLKDNDLT--GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRT 400
            AVG  + +++   N  T   +A +    Y+  + + +N +++GD   S+   RY  E  T
Sbjct: 1285 AVGGTVRVYRYDTNKSTMETMAILYAGAYVTCLQAFRNYLVIGDLFNSVLFARYSEEIHT 1344

Query: 401  LSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 460
            ++++ RD                                           I    ND+L 
Sbjct: 1345 ITILGRD----------------------------------------TNAISVVSNDMLY 1364

Query: 461  EFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS 520
              +  G +++D  +N+V   Y+P   E  G    I ++   +         +  K   + 
Sbjct: 1365 HDTRFGLLVTDDARNLVCMSYKPRVLEEPGKPPKILESLLTVTGEYRLAGGVLLKMMRLR 1424

Query: 521  DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
             A   R+  +  Y +  G +G+ +PL ++  R    +   + +  +H GGL PR F
Sbjct: 1425 -AASTRNSSVAIYVTNMGEIGYLVPLGDQTSRTGQWVGRRLQSEVAHAGGLPPRMF 1479


>gi|146096490|ref|XP_001467824.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania infantum JPCM5]
 gi|134072190|emb|CAM70891.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania infantum JPCM5]
          Length = 1542

 Score =  105 bits (262), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 131/596 (21%), Positives = 226/596 (37%), Gaps = 106/596 (17%)

Query: 33   LLLVRTQHELLIYQAF----RHPKGALKLRFKKLKVL-------FVSDRSKRANEQPGLP 81
            L+++ +  EL+ Y+        P+  +K+ +  L V         +  R KR  E+    
Sbjct: 938  LVMILSSGELVTYRVVPADANGPRRCVKVIYHILDVAPEVDVVESIEARKKRLQEERAHL 997

Query: 82   RGVRISQMRYFSN----IAGYQ----GVFLCGPHPAWL-FLTSRGELRAHPMTIDGPVST 132
              V   QMR+ S       G Q    G+++CG  P +L +  +  +L          V  
Sbjct: 998  ASV-TQQMRHCSERLVPFRGLQDRHKGIYVCGQTPVFLVYHAATNQLVCTRHHATNAVRG 1056

Query: 133  LAPFHN-------VNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
             APFH+       V C  GF++F              L   + W + +V L CTPH + Y
Sbjct: 1057 FAPFHSRHVHGGFVYCGEGFVHFATMQPF------GELLGSSGWWLERVRLGCTPHQVIY 1110

Query: 186  HLETKTYCIVTSTAEP-STDYYKFNGEDKELVTDPRDSRF--------IPPLVS------ 230
                    +V S  +P S     F+ + + +V D   +R         +PPL +      
Sbjct: 1111 SPAAHGCFVVASRPQPFSPKRAPFDVQLR-MVEDEEGNRVPHVIEPVSLPPLSATSGSPV 1169

Query: 231  ----QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEY---EGTLSGLRGYIALGT 283
                ++ V  FS   W+ + +    ++E      L  V+ +        S      AL T
Sbjct: 1170 PTNERYEVQFFSTLDWQCMGRLVLDVNEKVLSATLMQVTRDTTMDAANRSTTAPVCALAT 1229

Query: 284  NYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA-GFLVT 342
             Y   EDVT RGRILL                 +++ ++ +  KGPVTAI  V    +  
Sbjct: 1230 AYPLGEDVTTRGRILLLTT-----SQQGGQGMQQLRTLHEEPMKGPVTAITRVGEDCVAV 1284

Query: 343  AVGQKIYIWQLKDNDLT--GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRT 400
            AVG  + +++   N  T   +A +    Y+  + + +N +++GD   S+   RY  E  T
Sbjct: 1285 AVGGTVRVYRYDTNKSTMETMAILYAGAYVTCLQAFRNYLVIGDLFNSVLFARYSEEIHT 1344

Query: 401  LSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 460
            ++++ RD                                           I    ND+L 
Sbjct: 1345 ITILGRD----------------------------------------TNAISVVSNDMLY 1364

Query: 461  EFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS 520
              +  G +++D  +N+V   Y+P   E  G    I ++   +         +  K   + 
Sbjct: 1365 HDTRFGLLVTDDARNLVCMSYKPRVLEEPGKPPKILESLLTVTGEYRLAGGVLLKMMRLR 1424

Query: 521  DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
             A   R+  +  Y +  G +G+ +PL ++  R    +   + +  +H GGL PR F
Sbjct: 1425 -AASTRNSSVAIYVTNMGEIGYLVPLGDQTSRTGQWVGRRLQSEVAHAGGLPPRMF 1479


>gi|393907593|gb|EJD74705.1| CPSF A subunit region family protein [Loa loa]
          Length = 990

 Score =  104 bits (259), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 62/200 (31%), Positives = 104/200 (52%), Gaps = 14/200 (7%)

Query: 10  SAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP---KGALKLRFKKLKVLF 66
           +A  E ++ ELL V +G++  RP+L +     + +Y+ F +    +G L +RFK+L    
Sbjct: 789 AAKPEEVIMELLMVGMGMNQGRPMLFLLIDDTVSVYEMFTYNNGIQGHLAVRFKRLPYTV 848

Query: 67  VSDRSKRANEQPG------LPRGVRISQMRYFSNIAG--YQGVFLCGPHPAWLFLTSRGE 118
           V+ RS R     G      +   VR   + +F    G    GVF+C  +P   FL + G 
Sbjct: 849 VT-RSCRFQGLDGRAAVESVRDAVRHKTVLHFFERIGNVLNGVFICSSYPCIFFLET-GV 906

Query: 119 LRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSEL-RISVLPTHLSYDAPWPVRKVPLK 177
            R HP+ +DGP+ +   F+N  CP GF+Y   +  L R++ LP  +  D  +PV+++ + 
Sbjct: 907 PRLHPVNLDGPILSFTTFNNAACPNGFIYLTERERLMRVAKLPNDMILDTSYPVKRIDVG 966

Query: 178 CTPHFLAYHLETKTYCIVTS 197
            + H + Y L + TY ++TS
Sbjct: 967 ASVHSVTYLLHSNTYAVLTS 986


>gi|389602597|ref|XP_001567507.2| cleavage and polyadenylation specificity factor-like protein
            [Leishmania braziliensis MHOM/BR/75/M2904]
 gi|322505515|emb|CAM42945.2| cleavage and polyadenylation specificity factor-like protein
            [Leishmania braziliensis MHOM/BR/75/M2904]
          Length = 1536

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 142/660 (21%), Positives = 256/660 (38%), Gaps = 117/660 (17%)

Query: 33   LLLVRTQHELLIYQAF----RHPKGALKLRFKKLKVLFVSD-------RSKRANEQPGLP 81
            L+++ +  EL+ Y+        P+  +KL +  L V    D       R KR  E+    
Sbjct: 932  LVMILSTGELVTYRVVPADASAPRRCVKLVYHILDVAPEVDVVESIEVRKKRLQEERAHL 991

Query: 82   RGVRISQMRY-------FSNIAG-YQGVFLCGPHPAWL---FLTSRGELRAHPMTIDGPV 130
              V   QMR        F  + G ++G+++CG  P +L   + T++     H  T    V
Sbjct: 992  ASV-TQQMRRCSERLVPFCALQGRHKGIYVCGQTPVFLVYHYATNQLVCTRHHAT--SAV 1048

Query: 131  STLAPFHN-------VNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFL 183
               APFH+       V C  GF++F              L   + W + +V L CTPH +
Sbjct: 1049 RGFAPFHSRHVHGGFVYCGEGFVHFATMQPF------GELLGSSGWWLERVRLGCTPHQV 1102

Query: 184  AYHLETKTYCIVTSTAEP-STDYYKFNGEDKELVTDPRDSRF--------IPPLVS---- 230
             Y        +V S  +P S     F+ + + +V D   +R         +PPL +    
Sbjct: 1103 IYSPAAHGCFVVVSRPQPFSPKRAPFDVQLR-MVEDEEGNRVPHVVEPVSLPPLSATSGS 1161

Query: 231  ------QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTL---SGLRGYIAL 281
                  ++ V LFS   W+ +      ++E      L  VS +    +   S      AL
Sbjct: 1162 PVPTNGRYEVQLFSTLDWQRVDCLALDVNEKVLSATLMQVSRDTTMDVAYRSATAPVCAL 1221

Query: 282  GTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV-AGFL 340
             T Y   EDVT RGR+LL    +   + GQ +   K+++++ +  KGPVTAI  +    +
Sbjct: 1222 ATAYPLGEDVTTRGRVLLLATSQ---QGGQGM--QKLRILHEEPMKGPVTAITRIDEDCI 1276

Query: 341  VTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEY 398
              AVG   ++Y +      +   A +    Y+  + ++++ +++GD   S+   RY  E 
Sbjct: 1277 AVAVGGTVRVYRYDASKGVMETTAILYAGAYVTCLQALRDYLVIGDLFHSVLFARYSEEI 1336

Query: 399  RTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 458
             T++++ RD                                           I    +D+
Sbjct: 1337 HTITILGRD----------------------------------------TNAISVVSSDM 1356

Query: 459  LDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSS 518
            L   +  G +++D  +N++   Y+P   E  G    + ++   +         +  K   
Sbjct: 1357 LYHDTRFGLLVADDARNLMCMSYKPRLLEEPGKPPKVLESLLSVTGEYRLAGGVLLKMMR 1416

Query: 519  ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT 578
            +  +    S    +  ++ G +G+ +PL ++  R    +   + +  +H GGL PR F  
Sbjct: 1417 LRASAARSSSVAIYVTNM-GEIGYLVPLGDQTSRTGQWVVRRLQSEVAHAGGLPPRMFL- 1474

Query: 579  YKGKGYYAGNPSRGIIDGSLVWKF--LQLSLGERLEICKKIGSKHNDILDELYDIEALSS 636
                G+   +P R +     +  F  L+    + L   K I S     L+ + ++ A  S
Sbjct: 1475 ----GFPQDDPLRSLKGDEWMLHFPLLEQLYRQDLRTRKLIASAAQTQLERVMNVGATVS 1530


>gi|387593561|gb|EIJ88585.1| hypothetical protein NEQG_01275 [Nematocida parisii ERTm3]
 gi|387597215|gb|EIJ94835.1| hypothetical protein NEPG_00359 [Nematocida parisii ERTm1]
          Length = 1261

 Score = 97.1 bits (240), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 113/505 (22%), Positives = 200/505 (39%), Gaps = 68/505 (13%)

Query: 137  HNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVT 196
             N N  R F+   AK  +    LP + SYD     RK  +      ++Y    K     T
Sbjct: 798  ENSNSTRSFIVM-AKGSIAKGNLPVY-SYDKSVLYRKTKVDSICEKISYSKAKKVIVAAT 855

Query: 197  STAEPST-DYYKFNGE-----DKELVTDPRDSRF-IPPLVSQFHVSLFSPFSWEEIPQTN 249
                P T D   F  +     D ELV  P   +  + PL   + + ++S    +   +T 
Sbjct: 856  YKNNPYTEDMIPFTVQATTELDAELVPAPVIPKISVNPLTRAYSLKIYSHEEMKVCSRTE 915

Query: 250  --------FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFD 301
                    + L   E++   K V++  +    G   ++ + T Y   ED+  RGR+++ +
Sbjct: 916  GVLMAVDEYRLENNEYIAYHKIVTLPDKQNTEGFSEFVIVCTTYITDEDLMARGRLIVLE 975

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND-LTG 360
            I  VVP+  +  T++K+K + A++ KG  T    V G +V  VG K+ I+    N+ L  
Sbjct: 976  IASVVPQRDRIETRHKLKALAAEKTKGATTCCDIVKGNIVVCVGTKLMIYMFDRNEGLRA 1035

Query: 361  IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            +AF D  V++ S + ++N+I+ GD  +   LL YQ +   L ++++       +S G Y 
Sbjct: 1036 VAFHDIHVFLTSCMVMRNIIVCGDAYKGTFLLFYQSDPPLLHMLSQ-------SSGGVY- 1087

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                      + K IG   +D     +++  +  D  K V ++ 
Sbjct: 1088 --------------------------LLKGIGMTLHD-----TALSLISYDSLKTVCIYT 1116

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGAL 540
            Y P+   S  G RLI + +  L   +   F I             +  + T   +  G +
Sbjct: 1117 YSPQHILSQDGSRLISRGECKLPDDIAGSFLIE-----------KKGVYRTALYTKHGYV 1165

Query: 541  GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVW 600
                 + +  Y  LL LQ+ + +    T G NPR+    +          + I+   L+ 
Sbjct: 1166 YSHKTVVQTKYIALLDLQHAVESAHWMTLGTNPRSHWVTERSAEMKDITLKEILQTGLME 1225

Query: 601  KFLQLSLGERLEICKKIGSKHNDIL 625
            +F  +   +   I    G    D++
Sbjct: 1226 EFFNMCTVQSDRIVADTGRASADVV 1250


>gi|378755148|gb|EHY65175.1| hypothetical protein NERG_01621 [Nematocida sp. 1 ERTm2]
          Length = 822

 Score = 96.7 bits (239), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 117/513 (22%), Positives = 212/513 (41%), Gaps = 82/513 (15%)

Query: 18  QELLTVSLGLHGNRPLLLVRT-QHELLIYQAFRHPKGALKLRFKKLKVL---FVSDRSKR 73
           Q LL + +  + +   LL RT  +E+++YQ           R  K KV    F  ++++ 
Sbjct: 257 QSLLDIEVIEYRSAVYLLARTISNEIVLYQERDG-------RLYKEKVTNNAFYYEKAEV 309

Query: 74  ANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTL 133
               P        S+MR   ++     VF+ G +   + + +  ++  H   I   + ++
Sbjct: 310 GQSSP--------SRMRVCGSL-----VFIPGTYKTRVLVFTPYQVIVHAANIR--IDSI 354

Query: 134 APFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYC 193
                 + P       AK  +   +LP + +YD P   +K  +      +AY    K   
Sbjct: 355 EEITEDSNPIKSFIIMAKGSIARGMLPLY-AYDKPVLYKKTKVDSICQRVAYSAVKKVIV 413

Query: 194 IVT-STAEPSTDYYKFNGE-DKELVTDPRDSRFIP-----PLVSQFHVSLFSPFSWEEIP 246
            VT    E + D   F  +   EL  +P     IP     PL   + + ++S    ++  
Sbjct: 414 AVTYKDKEYTKDMIPFTVQATTELDAEPLPPPVIPEIKVNPLTRAYSLKIYSHEEMKQYS 473

Query: 247 QT--------NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRIL 298
           QT         +PL + E++   K V++  +    G+  ++ + T Y   ED+  RGR++
Sbjct: 474 QTGEVLMAVDEYPLEDNEYIAHHKIVTLPDKQNTEGVSEFVIVCTTYITDEDLMARGRLI 533

Query: 299 LFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND- 357
           + +I  VVP+  +  T++K+K + A++ KG  T    V G +V  VG K+ I+    N+ 
Sbjct: 534 VLEIASVVPQRDRIETRHKLKALAAEKTKGATTCCDIVKGNIVVCVGTKLMIYMFDRNEG 593

Query: 358 LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
           L  +AF D  V++ S + ++N+I+ GD  +   LL YQ E            P+  +   
Sbjct: 594 LRAVAFHDIHVFLTSCMVMRNIIVCGDAYKGTFLLFYQSE------------PSLLHLLS 641

Query: 418 YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVV 477
             +G        G  + K + ++L                     S +  +  D  K V 
Sbjct: 642 QSSG--------GVYLLKGIGMTL-------------------YGSVLSLLSYDSAKTVC 674

Query: 478 LFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF 510
           ++ Y P+   S GG RLI + +  L   ++  F
Sbjct: 675 IYSYSPQHILSQGGTRLISRGECKLPDDISGSF 707


>gi|385304556|gb|EIF48568.1| rna-binding subunit of the mrna cleavage and polyadenylation factor
           [Dekkera bruxellensis AWRI1499]
          Length = 289

 Score = 96.3 bits (238), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 75/311 (24%), Positives = 136/311 (43%), Gaps = 49/311 (15%)

Query: 272 LSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVT 331
           ++  + Y+ +G+     ED+  +G  ++++II+VVP+P  P  KN++K+I ++  +G + 
Sbjct: 1   MNDTKNYVIVGSGKYRVEDLATKGSWMVYEIIDVVPDPNHPEAKNRLKLIKSESSRGSIL 60

Query: 332 AICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIA 390
             C+++G       Q++ +  + KD +   +AF DT +Y   + S ++++++GD    ++
Sbjct: 61  GSCNISGRFSLVQAQRMLVRTIKKDGNAVPVAFXDTSLYTKDVKSFEDMMIIGDAFDGLS 120

Query: 391 LLRYQPE-YRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 449
           L  +  E YR L L     K TQ  S                             L  C 
Sbjct: 121 LYGFDAEPYRMLKLG----KETQNLS-----------------------------LTAC- 146

Query: 450 KIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
                  D +     +  + +D+D  + L  Y P   ES  G +L+ ++ F    +    
Sbjct: 147 -------DFIVXEGGLYIIAADEDSVLHLLEYDPYDPESMKGXKLLTRSVFRFNGYTTAM 199

Query: 510 FKIRCKPS------SISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
                K S      +++  PGA   F     +++G+     P  E  YRRL  LQN +  
Sbjct: 200 RLCDRKNSIFSMLDTLAIPPGADLGFEVIGCNIEGSFYKVTPANEYTYRRLYALQNHISD 259

Query: 564 HTSHTGGLNPR 574
             SH  GLNP+
Sbjct: 260 KESHWLGLNPK 270


>gi|401426989|ref|XP_003877978.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|322494225|emb|CBZ29522.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 1542

 Score = 95.9 bits (237), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 129/598 (21%), Positives = 230/598 (38%), Gaps = 110/598 (18%)

Query: 33   LLLVRTQHELLIYQAF----RHPKGALKLRFKKLKVL-------FVSDRSKRANEQPGLP 81
            L+++ +  EL+ Y+        P+  +K+ +  L V         +  R KR  E+    
Sbjct: 938  LVMILSSGELVTYRVVPADAHGPRRCVKVIYHILDVAPEVDVVESIEARKKRLQEERAHL 997

Query: 82   RGVRISQMRYFSN----IAGYQ----GVFLCGPHPAWL-FLTSRGELRAHPMTIDGPVST 132
              V   QMR+ S       G Q    G+++CG  P +L +  +  +L          V  
Sbjct: 998  ATV-TQQMRHCSERLVPFRGLQDRHKGMYVCGQTPVFLVYHAATNQLVCTRHHATNAVRG 1056

Query: 133  LAPFHN-------VNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
             APFH+       V C  GF++F              L   + W + +V L CTPH + Y
Sbjct: 1057 FAPFHSRHVHGGFVYCGEGFVHFATMQPF------GELLGSSGWWLERVRLGCTPHQIIY 1110

Query: 186  HLETKTYCIVTSTAEP-STDYYKFNGEDKELVTDPRDSRF--------IPPLVS------ 230
                    +V S  +P S     F+ + + +V D   +R         +PPL +      
Sbjct: 1111 SPAAHGCFVVASRPQPFSPKRAPFDVQLR-MVEDEEGNRVPHVIEAVSLPPLSAASGSPV 1169

Query: 231  ----QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTL-----SGLRGYIAL 281
                ++ V  FS   W+ + +    L   E VL    + +  + T+     S      AL
Sbjct: 1170 PTNERYEVQFFSTLDWQCMGR--LVLDANEKVLSATLMQVTRDTTMDAANRSTTAPVCAL 1227

Query: 282  GTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA-GFL 340
             T Y   EDVT RGRILL    +   + G  +    ++ ++ +  KGPVTAI  V    +
Sbjct: 1228 ATAYPLGEDVTTRGRILLLTTTQ---QGGHGM--QHLRTLHEEPMKGPVTAITRVGEDCV 1282

Query: 341  VTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEY 398
              AVG   ++Y +    + +  +A +    Y+  + + ++ +++GD   S+   RY  E 
Sbjct: 1283 AAAVGGTVRVYRYDTYKSTMETMAILYAGAYVTCLQAFRDYLVIGDLFNSVLFARYSEEI 1342

Query: 399  RTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 458
             T++++ RD                                           I    ND+
Sbjct: 1343 HTITILGRD----------------------------------------TNAISVVSNDM 1362

Query: 459  LDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSS 518
            L   +  G +++D  +N++   Y+P   E  G    + ++   +         +  K   
Sbjct: 1363 LYHDTRFGLLVTDDARNLMCMSYKPRVLEEPGKPPKVLESLLTVTGEYRLAGGVLLKMMR 1422

Query: 519  ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
            +  A    S    +  ++ G +G+ +PL ++  R    +   + +  +H GGL PR F
Sbjct: 1423 LRAASAHSSSVAIYVTNM-GEIGYLVPLGDQTSRTGQWVVRRLQSEVAHAGGLPPRMF 1479


>gi|168066745|ref|XP_001785293.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162663100|gb|EDQ49884.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1090

 Score = 95.1 bits (235), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 116/498 (23%), Positives = 196/498 (39%), Gaps = 105/498 (21%)

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            V+ + PF++ + P   L    + EL I  +           +R VPL   P  +A+  ++
Sbjct: 670  VNHMCPFNSASFPDS-LAIGKEGELTIGTIDEI----QKLHIRTVPLGEHPRRIAHQEQS 724

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            +T+ I ++   P +     NGED E                  +V L    ++E I  + 
Sbjct: 725  RTFAICSAKYAPGS-----NGEDME----------------THYVRLIEDQTFEII--SG 761

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT-CRGRILLFDIIEVVPE 308
            FPL  +E+   +   S   +  +     Y  +GT Y   E+    +GRIL+F +     E
Sbjct: 762  FPLDPYENGCSIITCSFTDDSNV-----YYCVGTAYALPEESEPSKGRILVFSV-----E 811

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
             G      KI+++  KE KG V  +    G L+  + QKI  Y W L+D+    + +  +
Sbjct: 812  DG------KIQLVAEKEVKGAVYNLNAFNGKLLAGINQKIALYKWTLRDDGTRELQYESS 865

Query: 367  E---VYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNP 423
                +    + S  + I+VGD  +SI+LL Y+PE   +   ARDY               
Sbjct: 866  HHGHILALYVQSRGDFIVVGDLMKSISLLIYKPEEGAIEERARDYNAN------------ 913

Query: 424  SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQP 483
                      W                      +ILD+ + +G   ++   N+       
Sbjct: 914  ----------WM------------------TAVEILDDDTYLG---AENSFNLFTVRKNN 942

Query: 484  EARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGA 539
            +A       RL    ++HLG+ VN F      +R   S  S  P         + +++G 
Sbjct: 943  DAATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEASQIP------TVIFGTVNGV 996

Query: 540  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
            +G    LP+  +  L  LQ  +V      GGL+   +R++  +       +R  +DG L+
Sbjct: 997  IGVIASLPQDQFLFLQKLQQALVKVIKGVGGLSHEQWRSFSNERKTV--DARNFLDGDLI 1054

Query: 600  WKFLQLSLGERLEICKKI 617
              FL LS  +  EI   +
Sbjct: 1055 ESFLDLSRNKMEEIATSL 1072


>gi|157873900|ref|XP_001685450.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania major strain Friedlin]
 gi|68128522|emb|CAJ08654.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania major strain Friedlin]
          Length = 1541

 Score = 95.1 bits (235), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 127/598 (21%), Positives = 229/598 (38%), Gaps = 110/598 (18%)

Query: 33   LLLVRTQHELLIYQAF----RHPKGALKLRFKKLKVL-------FVSDRSKRANEQPGLP 81
            L+++ +  EL+ Y+        P+  +K+ +  L V         +  R KR  E+    
Sbjct: 937  LVMILSSGELVTYRVVPADANGPRRCVKVIYHILDVAPEVDVMESIKARKKRLQEERAHL 996

Query: 82   RGVRISQMRYFSN--------IAGYQGVFLCGPHPAWL-FLTSRGELRAHPMTIDGPVST 132
              V   QMR+ S            Y+G+++CG  P +L +  +  +L          V  
Sbjct: 997  ASV-TQQMRHCSERLVPFRGLQDRYKGIYVCGQTPVFLVYHAATNQLVCTRHHATNAVRG 1055

Query: 133  LAPFHN-------VNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
             APFH+       V C  GF++F              L   + W + +V L CTPH + Y
Sbjct: 1056 FAPFHSRHVHGGFVYCGEGFVHFATMQPF------GELLGCSGWWLERVRLGCTPHQVIY 1109

Query: 186  HLETKTYCIVTSTAEP-STDYYKFNGEDKELVTDPRDSRF--------IPPLVS------ 230
                    +V S  +P S     F+ + + +V D   +R         +PPL +      
Sbjct: 1110 SPAAHGCFVVASRPQPFSPKRAPFDVQLR-MVEDEEGNRVPHVIEPVSLPPLSATSGSPV 1168

Query: 231  ----QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTL-----SGLRGYIAL 281
                ++ V  FS  +W+ + +    L   E VL    + +  + T+     S      AL
Sbjct: 1169 PTNERYEVQFFSTLNWQCMGR--LVLDGNEKVLSATLMQVTRDTTMDAANRSTTAPVCAL 1226

Query: 282  GTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA-GFL 340
             T Y   EDVT RGRILL    +            +++ ++ +  +GPVTAI  V    +
Sbjct: 1227 ATAYPLGEDVTTRGRILLLTTSQQ-----SGQGMQQLRTLHEEPMEGPVTAITRVGEDCV 1281

Query: 341  VTAVGQKIYIWQLKDNDLT--GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEY 398
              AVG  + +++   N  T   +A +    Y+  + + +  +++GD   S+   RY  E 
Sbjct: 1282 AVAVGGTVRVYRYDANKSTMETMAILYAGAYVTCLQAFREYLVIGDLFNSVLFARYSEEI 1341

Query: 399  RTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 458
             T++++ RD                                           I    ND+
Sbjct: 1342 HTITILGRD----------------------------------------TSAISVVSNDM 1361

Query: 459  LDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSS 518
            L   +  G +++D  +N++   Y+P   E +G    + ++   +         +  K   
Sbjct: 1362 LYHDTRFGLLVTDDARNLMCMSYKPRVLEEHGKPPKVLESLLTVTGEYRLAGGVLLKMMR 1421

Query: 519  ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
            +  A    S    +  ++ G +G+ +PL ++  R    +   + +  +H GGL PR F
Sbjct: 1422 LRAASARSSSVAIYVTNM-GEIGYLVPLGDQTSRTGQWVVRRLQSEVAHAGGLPPRMF 1478


>gi|414587797|tpg|DAA38368.1| TPA: hypothetical protein ZEAMMB73_143443 [Zea mays]
          Length = 153

 Score = 93.2 bits (230), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 51/144 (35%), Positives = 78/144 (54%), Gaps = 1/144 (0%)

Query: 487 ESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPL 546
           ES  G +L+ + +FH+G HV+ F +++  P+    A    +RF   + +LDG +G   P+
Sbjct: 3   ESWKGQKLLSRAEFHVGAHVSKFLRLQMLPTQ-GLASEKTNRFALVFGTLDGGIGCIAPV 61

Query: 547 PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
            E  +RRL  LQ  +V    H  GLNPR+FR +K  G         IID  L+  +  +S
Sbjct: 62  DELTFRRLQSLQRKLVDAIPHVCGLNPRSFRHFKSNGKAHRPGPDNIIDFELLSHYEMMS 121

Query: 607 LGERLEICKKIGSKHNDILDELYD 630
           L E+LEI ++IG+  + IL    D
Sbjct: 122 LEEQLEIAQQIGTTRSQILSNFSD 145


>gi|428164905|gb|EKX33915.1| hypothetical protein GUITHDRAFT_158867 [Guillardia theta CCMP2712]
          Length = 1092

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 116/529 (21%), Positives = 210/529 (39%), Gaps = 99/529 (18%)

Query: 97   GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRI 156
            G   VF     P  ++  +R  L ++    +  V+ +APF++   P   L    ++ LRI
Sbjct: 640  GATHVFAASDRPTVIYSNNRKLLFSNVNLKE--VTQMAPFNSEGFPDS-LAIATETSLRI 696

Query: 157  SVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELV 216
             V+           +R V L+  P  + +   +KT+C+ T +   + D     GE+ E  
Sbjct: 697  GVIDDI----QKLHIRTVYLREQPRRICHQESSKTFCVATLSIRINRD-----GEEVE-- 745

Query: 217  TDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLR 276
                          QF + LF   ++E +    + L E+E+   ++  S   + TL    
Sbjct: 746  -------------EQF-IKLFDDQTFEILD--TYQLQEFENTCSVECASFSDDPTL---- 785

Query: 277  GYIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICH 335
             Y  +GT     ++   + GR+L+F++I+            K+ +  +KE KG    I  
Sbjct: 786  -YYIVGTATAVPQESEPKEGRLLVFEVID-----------RKLHLKASKEIKGAPYQIKP 833

Query: 336  VAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-----EVYIASMVSVKNLILVGDYARSIA 390
              G L+  +  KI +++L D+D   +  +        + +  + +  + I+ GD  RSI+
Sbjct: 834  FNGKLLAGINSKIELFRLSDSDTGHMELVSECCHRGHILVLYLQTRGDFIVAGDLMRSIS 893

Query: 391  LLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 450
            LL Y+     +  +ARD+                         W                
Sbjct: 894  LLTYKQVDGQIEEIARDFNAN----------------------WM--------------- 916

Query: 451  IGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF 510
                  DILD+ + +G   ++   N+       +A       RL    ++HLG  VN F 
Sbjct: 917  ---TAVDILDDDTFLG---AEGYFNLFTVRKNTDATSDEERARLEVVGEYHLGDMVNRFQ 970

Query: 511  KIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
            +      S SD P   +     + +++G +G    L ++ Y  LL +Q+ +       GG
Sbjct: 971  RGSLVLRS-SDTPTTDT---IIFGTVNGMIGVIAVLSKEEYEFLLKVQDALNFVIKGVGG 1026

Query: 571  LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
            L    +R+++ +        +G IDG L+  FL L   +  E+C  IGS
Sbjct: 1027 LRHEDWRSFENERTQGARAPKGFIDGDLIESFLDLRREKMEEVCHAIGS 1075


>gi|301124072|ref|XP_002909688.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262107255|gb|EEY65307.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 176

 Score = 87.8 bits (216), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 53/173 (30%), Positives = 88/173 (50%), Gaps = 15/173 (8%)

Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS-SISDAPGAR-----SRFLTWYA 534
           + P+  ES GG RL++ +DFHLG  V++ F+ R   S S+  A   R     S ++    
Sbjct: 3   FAPQDIESRGGQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVNVMG 62

Query: 535 SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY-YAGNPS--- 590
           + +G +G  +P+ E+ +RRL  LQNVMV        LNPR FR  K       G P    
Sbjct: 63  TSEGGVGALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRRCGRPDAWS 122

Query: 591 -----RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
                +  +D  ++++FLQL+   + E+ + IG+    ++  L +++  +S F
Sbjct: 123 KKKWKKSFLDAFVLFRFLQLNYVAQKELARCIGTTPEVVMHNLLEVQHATSTF 175


>gi|198432471|ref|XP_002129229.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
            DNA-binding protein 1) (UV-damaged DNA-binding factor)
            (DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
            (UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
            pigmentosum group E-co... isoform 2 [Ciona intestinalis]
          Length = 1142

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 127/585 (21%), Positives = 226/585 (38%), Gaps = 104/585 (17%)

Query: 66   FVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
            ++SDR K       +P G + + +  F++  G + VF C   P  ++ +S  +L    + 
Sbjct: 629  YISDRKK-------VPLGTQPTSLSVFTS-GGSRTVFACSDRPTVVY-SSNKKLVFSNVN 679

Query: 126  IDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
            +   VS + P  +   P      N  +     +L   +       +R VPL  +P  +AY
Sbjct: 680  LK-EVSHMCPLDSDGYPDSLALANDNT-----LLIGTIDEIQKLHIRTVPLYESPRRIAY 733

Query: 186  HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIP-----PLVSQFHVSLFSPF 240
              E++ + +VT      TD     G DK  +T P  S         P V    V  FS  
Sbjct: 734  QEESQCFGLVT----LRTDSVDATG-DKMKITRPSASTQASVCTKSPPVDGRSVEGFSAT 788

Query: 241  ----SWEEIPQTNFPLHEW------EHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
                S   I Q  F +H        E  L + +  +      S    Y  +GT + Y E+
Sbjct: 789  ADIGSLLIIDQHTFEVHHAYQLDTNEEPLSIMSCKLG-----SDPNSYFVVGTAFVYMEE 843

Query: 291  VTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
               + GRIL+F  I+           NK+ ++  KE KG V  +C   G ++ A+   + 
Sbjct: 844  TEPKHGRILVFHYID-----------NKLTLVAEKEVKGAVFCLCQFNGHVLAAINTSVS 892

Query: 350  IWQ-LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDY 408
            I+Q   + +L       + +    +    + +LVGD  RS+++L Y+     L  +A+DY
Sbjct: 893  IYQWTTEKELRAECSNQSNILALYLKCKGDFVLVGDLMRSMSILNYKHVEGNLDEIAKDY 952

Query: 409  KPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFM 468
             P                       W                      +ILD+ + +G  
Sbjct: 953  SPN----------------------WM------------------TAVEILDDDNFLG-- 970

Query: 469  ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSR 528
             ++   NV +      A       +L +   FH+G  +NTF        ++ +     S+
Sbjct: 971  -AENFYNVFICQKDSGATTDEERSKLREAALFHVGDSINTFRHGSLVMQNVGET-AVSSK 1028

Query: 529  FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
                + ++ G++G    + E  Y  L  +QN +       G ++  ++R++        +
Sbjct: 1029 GHILFGTVHGSIGVITTVDEDLYAFLHSIQNRLAKVIKSVGNIDHESWRSFCTNEKTEAH 1088

Query: 589  PSRGIIDGSLVWKFLQLSLGERLEICKKI-----GSKHNDILDEL 628
              RG +DG L+  FL L+  +  E+ K +     G+K    +D+L
Sbjct: 1089 --RGFVDGDLIECFLDLNREKMAEVAKGLMVKEHGTKREATVDDL 1131


>gi|301093655|ref|XP_002997673.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110063|gb|EEY68115.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 176

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 53/173 (30%), Positives = 87/173 (50%), Gaps = 15/173 (8%)

Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS-SISDAPGAR-----SRFLTWYA 534
           + P+  ES GG RL++ +DFHLG  V++ F+ R   S S+  A   R     S ++    
Sbjct: 3   FAPQDIESRGGQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVNVMG 62

Query: 535 SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY-YAGNPS--- 590
           + +G +G  +P+ E+ +RRL  LQNVMV        LNPR FR  K       G P    
Sbjct: 63  TSEGGVGALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRRCGRPDAWS 122

Query: 591 -----RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
                +  +D  ++++FLQL    + E+ + IG+    ++  L +++  +S F
Sbjct: 123 KKKWKKSFLDAFVLFRFLQLDYVAQKELARCIGTTPEVVMHNLLEVQHATSTF 175


>gi|198432469|ref|XP_002129207.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
            DNA-binding protein 1) (UV-damaged DNA-binding factor)
            (DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
            (UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
            pigmentosum group E-co... isoform 1 [Ciona intestinalis]
          Length = 1150

 Score = 86.3 bits (212), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 127/589 (21%), Positives = 226/589 (38%), Gaps = 108/589 (18%)

Query: 66   FVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
            ++SDR K       +P G + + +  F++  G + VF C   P  ++ +S  +L    + 
Sbjct: 633  YISDRKK-------VPLGTQPTSLSVFTS-GGSRTVFACSDRPTVVY-SSNKKLVFSNVN 683

Query: 126  IDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
            +   VS + P  +   P      N  +     +L   +       +R VPL  +P  +AY
Sbjct: 684  LK-EVSHMCPLDSDGYPDSLALANDNT-----LLIGTIDEIQKLHIRTVPLYESPRRIAY 737

Query: 186  HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIP-----PLVSQFHVSLFSPF 240
              E++ + +VT      TD     G DK  +T P  S         P V    V  FS  
Sbjct: 738  QEESQCFGLVTL----RTDSVDATG-DKMKITRPSASTQASVCTKSPPVDGRSVEGFSAT 792

Query: 241  ----SWEEIPQTNFPLHEW------EHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
                S   I Q  F +H        E  L + +  +      S    Y  +GT + Y E+
Sbjct: 793  ADIGSLLIIDQHTFEVHHAYQLDTNEEPLSIMSCKLG-----SDPNSYFVVGTAFVYMEE 847

Query: 291  VTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
               + GRIL+F  I+           NK+ ++  KE KG V  +C   G ++ A+   + 
Sbjct: 848  TEPKHGRILVFHYID-----------NKLTLVAEKEVKGAVFCLCQFNGHVLAAINTSVS 896

Query: 350  IWQ-LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDY 408
            I+Q   + +L       + +    +    + +LVGD  RS+++L Y+     L  +A+DY
Sbjct: 897  IYQWTTEKELRAECSNQSNILALYLKCKGDFVLVGDLMRSMSILNYKHVEGNLDEIAKDY 956

Query: 409  KPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFM 468
             P                       W                      +ILD+ + +G  
Sbjct: 957  SPN----------------------WM------------------TAVEILDDDNFLG-- 974

Query: 469  ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSR 528
             ++   NV +      A       +L +   FH+G  +NTF        ++ +     S+
Sbjct: 975  -AENFYNVFICQKDSGATTDEERSKLREAALFHVGDSINTFRHGSLVMQNVGET-AVSSK 1032

Query: 529  FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
                + ++ G++G    + E  Y  L  +QN +       G ++  ++R++        +
Sbjct: 1033 GHILFGTVHGSIGVITTVDEDLYAFLHSIQNRLAKVIKSVGNIDHESWRSFCTNEKTEAH 1092

Query: 589  PSRGIIDGSLVWKFLQLSLGERLEICKKI---------GSKHNDILDEL 628
              RG +DG L+  FL L+  +  E+ K +         G+K    +D+L
Sbjct: 1093 --RGFVDGDLIECFLDLNREKMAEVAKGLMVKNFNDQHGTKREATVDDL 1139


>gi|255080490|ref|XP_002503825.1| predicted protein [Micromonas sp. RCC299]
 gi|226519092|gb|ACO65083.1| predicted protein [Micromonas sp. RCC299]
          Length = 1114

 Score = 84.3 bits (207), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 106/484 (21%), Positives = 189/484 (39%), Gaps = 116/484 (23%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
            +R VPL   P  +A+  ET+TY  +T       + +  NG                    
Sbjct: 727  IRTVPLGEQPRRIAHQPETRTYAALT-------ENFDENG-------------------- 759

Query: 231  QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
             + V LF   ++E + +      E +  +    +S  +       R Y  +GT Y+  E+
Sbjct: 760  -YFVRLFDDVTFETLCKFRLEPDEQDSSV----ISCAFA---DDPRVYYVVGTGYSLPEE 811

Query: 291  VT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
                RGRIL+F       E G      K++++  KE KG V  +    G L+  +  K+ 
Sbjct: 812  PEPTRGRILVFR-----AEDG------KLQLVAEKEVKGAVYNLNAFNGKLLAGINSKVE 860

Query: 350  IWQLKDNDLTGIAFIDTEVY------------IASMVSVKN-LILVGDYARSIALLRYQP 396
            ++  +  D  G        Y            +A  V+V+   I+VGD  +S++LL Y+P
Sbjct: 861  LF--RGGDPVGADGAGGSTYELAKECSHHGHIVALYVAVRGEFIVVGDLMKSVSLLAYKP 918

Query: 397  EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 456
            E   +   ARDY                         W                      
Sbjct: 919  EESVIEERARDYNAN----------------------WM------------------TAV 938

Query: 457  DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK----I 512
            DILD+ + +G   ++ + N+     Q +A       RL    ++H+G+ VN F +    +
Sbjct: 939  DILDDDTYLG---AENNFNLFTLRRQSDAATDEERSRLEVVGEYHVGEFVNRFRRGSLVM 995

Query: 513  RCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
            R      +D P         + ++ G +G    LP + +  L  LQ  +    S  GGL+
Sbjct: 996  RLPDQENADVP------TLLFGTVSGVIGVLATLPREQFEFLSALQAALNKTVSGVGGLS 1049

Query: 573  PRAFRTYKGK-GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
              A+R+++ +  + A + +RG +DG L+  FL L   +  E+   +    +++   + D+
Sbjct: 1050 HDAWRSFQNEHRHRAKDGARGFVDGDLIESFLDLRPEKAREVAAAVKLSVDELTRRVEDL 1109

Query: 632  EALS 635
            + L+
Sbjct: 1110 QRLT 1113


>gi|412992547|emb|CCO18527.1| predicted protein [Bathycoccus prasinos]
          Length = 1275

 Score = 82.4 bits (202), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 107/494 (21%), Positives = 190/494 (38%), Gaps = 104/494 (21%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
            +R VPL+  P  +A+  ETKT  ++T                            +P    
Sbjct: 856  IRTVPLREQPRRIAHQPETKTLAVLTMKESD-----------------------VPGQEE 892

Query: 231  QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYN--YS 288
            +F V LF   ++E + +  +PL   E+   + + S + +  +     Y  +GT +   +S
Sbjct: 893  EFFVRLFDNKTFETLAK--YPLEPNENDASIISCSFDGDDDI-----YFVVGTAFADPHS 945

Query: 289  EDVTCRGRILLFDIIEVVPEPG------------------QPLTKNKIKMIYAKEQKGPV 330
            E  + RGRIL+F +       G                    + +  + ++  KE +G V
Sbjct: 946  EPESSRGRILVFKVSNTSSSGGGNAVVNGNDHGDGRASASSSVLQKSLTLVCEKETRGAV 1005

Query: 331  TAICHVAGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEV--YIASMVSVK-NLILVGDY 385
              +    G L+  +    K++ W +   +   +    + +   IA  V  K NLI+VGD 
Sbjct: 1006 YNLNAFCGKLLAGINSLVKLFNWGVSKENKRELVHECSHMGHIIALKVETKDNLIVVGDL 1065

Query: 386  ARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERL 445
             +SI LL+YQ E   +  VA D+                         W           
Sbjct: 1066 MKSITLLQYQRESGRIEEVAHDFSSN----------------------WMTAV------- 1096

Query: 446  EICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH 505
                       +ILD+ + +G   ++   N+       +A   +    L     FHLG  
Sbjct: 1097 -----------EILDDNTYLG---AESSYNLFTVQRNADADTEDKRGTLELCGAFHLGDS 1142

Query: 506  VNTFFK--IRCKPSSISDAPGARSRFLTW-YASLDGALGFFLPLPEKNYRRLLMLQNVMV 562
            VN F +  +  +   +SD   + S   TW + ++ G LG    LP++++  L  +Q  M 
Sbjct: 1143 VNRFRRGSLVMRMPDLSDDTSSLSEISTWLFGTISGGLGVVATLPKRDFMLLNKVQEAMQ 1202

Query: 563  THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG-SKH 621
               +  G  +   FR++           R  IDG LV  FL LS  +++ + +  G S  
Sbjct: 1203 KVVTGVGNFSHSDFRSFHNVQRSV--EMRNFIDGDLVEIFLDLSKEDQVAVSELSGVSNS 1260

Query: 622  NDILDELYDIEALS 635
             D++ ++ +I  L+
Sbjct: 1261 EDLVKKIEEISRLT 1274


>gi|301630307|ref|XP_002944263.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Xenopus (Silurana) tropicalis]
          Length = 92

 Score = 81.3 bits (199), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 43/93 (46%), Positives = 59/93 (63%), Gaps = 1/93 (1%)

Query: 546 LPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
           + EK YRRLLMLQN + T   H  GLNPRAFR          NP R ++DG L+ ++L L
Sbjct: 1   MQEKTYRRLLMLQNAL-TVLPHHAGLNPRAFRMLNSSRRMLQNPVRNVLDGELLNRYLYL 59

Query: 606 SLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
           S  ER E+ +KIG+  + ILD+L +I+ ++S F
Sbjct: 60  SNMERSELARKIGTTTDIILDDLLEIDRVTSLF 92



 Score = 40.0 bits (92), Expect = 4.2,   Method: Composition-based stats.
 Identities = 20/49 (40%), Positives = 30/49 (61%)

Query: 414 NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEF 462
           NS      NP R ++DG L+ ++L LS  ER E+ +KIG+  + ILD+ 
Sbjct: 34  NSSRRMLQNPVRNVLDGELLNRYLYLSNMERSELARKIGTTTDIILDDL 82


>gi|401828022|ref|XP_003888303.1| pre-mRNA cleavage and polyadenylation specificity factor
            [Encephalitozoon hellem ATCC 50504]
 gi|392999575|gb|AFM99322.1| pre-mRNA cleavage and polyadenylation specificity factor
            [Encephalitozoon hellem ATCC 50504]
          Length = 1155

 Score = 81.3 bits (199), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 80/367 (21%), Positives = 154/367 (41%), Gaps = 56/367 (15%)

Query: 211  EDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEG 270
            E+ E+ ++      IP    +F+V L+S  + E I    + L E E+V  +K + ++   
Sbjct: 788  EEVEVSSNNEKDCGIPVNTYRFYVDLYSE-NHEHI--DTYELEENEYVFDVKYLILDDMQ 844

Query: 271  TLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPV 330
               G   ++ + T +   ED   +GR+ + +II VVP P  P    K+K++  ++ KG +
Sbjct: 845  GNYGKSPFLLICTTFIEGEDKPAKGRLHVLEIISVVPSPESPFKDCKLKVLGIEKTKGSI 904

Query: 331  TAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSI 389
                 + G +   +G KI I+++ + N +  I F D  ++ +S+  VKN IL  D  R +
Sbjct: 905  VQCSEIRGKIALCLGTKIMIYKIDRSNGIIPIGFYDLHIFTSSISVVKNYILASDIYRGL 964

Query: 390  ALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 449
            +   +Q +   L L++              +  P + +    L+    +LS    +  C 
Sbjct: 965  SFFFFQSKPIRLHLIS--------------SSEPLKNVTSTELLIAGNELS----MVCCD 1006

Query: 450  KIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
              G+ H                       + Y P    S  G +L+K+ +  +  ++   
Sbjct: 1007 SKGTIH----------------------AYTYSPNNIISMDGAKLVKRAE--MKTNLGRL 1042

Query: 510  FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
            F         S   G R   + +Y+  + ++     + + NY +LL +Q  ++ H     
Sbjct: 1043 F---------SSGIGFRKNSIMFYSKTNLSI-HLAGIDDLNYPKLLEIQTSIMVHLKSVL 1092

Query: 570  GLNPRAF 576
            GLN R +
Sbjct: 1093 GLNQRDY 1099


>gi|62318656|dbj|BAD95136.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
          Length = 1088

 Score = 81.3 bits (199), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 124/546 (22%), Positives = 218/546 (39%), Gaps = 116/546 (21%)

Query: 83   GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCP 142
            G R   +R FS+ +    VF     PA ++  ++  L ++    +  VS + PF++   P
Sbjct: 626  GTRPITLRTFSSKSATH-VFAASDRPAVIYSNNKKLLYSNVSLKE--VSHMCPFNSAAFP 682

Query: 143  RGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPS 202
               L    + EL I  +           +R +P+      + +  +T+T+ I     EPS
Sbjct: 683  DS-LAIAREGELTIGTIDDI----QKLHIRTIPIGEHARRICHQEQTRTFAISCLRNEPS 737

Query: 203  TDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLK 262
             +                +S F+  L +Q         S+E +  +++PL  +E    + 
Sbjct: 738  AE--------------ESESHFVRLLDAQ---------SFEFL--SSYPLDAFECGCSIL 772

Query: 263  NVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMI 321
            + S   +  +     Y  +GT Y    E+   +GRIL+F I+E          + ++++I
Sbjct: 773  SCSFTDDKNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------EGRLQLI 816

Query: 322  YAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV-----YIASMV 374
              KE KG V ++    G L+ ++ QKI  Y W L+D+   G   + +E       +A  V
Sbjct: 817  TEKETKGAVYSLNAFNGKLLASINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYV 873

Query: 375  SVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
              + + I VGD  +SI+LL Y+ E   +   ARDY                         
Sbjct: 874  QTRGDFIAVGDLMKSISLLIYKHEEGAIEERARDYNAN---------------------- 911

Query: 434  WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
            W          +EI       ++DI        ++ +D   N+       E        R
Sbjct: 912  WM-------AAVEIL------NDDI--------YLGTDNCFNIFTVKKNNEGATDEERAR 950

Query: 494  LIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEK 549
            +    ++H+G+ VN F      ++   S I   P         + ++ G +G    LP++
Sbjct: 951  MEVVGEYHIGEFVNRFRHGSLVMKLPDSDIGQIP------TVIFGTVSGMIGVIASLPQE 1004

Query: 550  NYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGE 609
             Y  L  LQ  +       GGL+   +R++  +   A   ++G +DG L+  FL LS G+
Sbjct: 1005 QYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKGYLDGDLIESFLDLSRGK 1062

Query: 610  RLEICK 615
              EI K
Sbjct: 1063 MEEISK 1068


>gi|297799958|ref|XP_002867863.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297313699|gb|EFH44122.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1088

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 124/559 (22%), Positives = 219/559 (39%), Gaps = 118/559 (21%)

Query: 71   SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPV 130
            S +  ++  +  G +   +R FS+ +    VF     PA ++  ++  L ++    +  V
Sbjct: 614  SGKLRDRKKVSLGTQPITLRTFSSKSATH-VFAASDRPAVIYSNNKKLLYSNVNLKE--V 670

Query: 131  STLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETK 190
            S + PF++   P   L    + EL I  +           +R +P+      + +  +T+
Sbjct: 671  SHMCPFNSAAFPDS-LAIAREGELTIGTIDDI----QKLHIRTIPIGEHARRICHQEQTR 725

Query: 191  TYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFH-VSLFSPFSWEEIPQTN 249
            T+ I     +PS +                         S+ H V L    S+E +  + 
Sbjct: 726  TFAICCLRNQPSAEE------------------------SEMHFVRLLDAQSFEFL--ST 759

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
            +PL  +E+   + + S   +  +     Y  +GT Y    E+   +GRIL+F I+E    
Sbjct: 760  YPLDAFEYGCSILSCSFTDDKNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE---- 809

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
                  + ++++I  KE KG V ++    G L+ A+ QKI  Y W L+D+   G   + +
Sbjct: 810  ------EGRLQLITEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQS 860

Query: 367  EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            E       +A  V  + + I+VGD  +SI+LL Y+ E   +   ARDY            
Sbjct: 861  ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN--------- 911

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                         W                      +ILD+   +G   +D   N+    
Sbjct: 912  -------------WM------------------AAVEILDDDIYLG---ADNCFNLFTVK 937

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
               E        R+    ++H+G+ VN F      +R   S I   P         + ++
Sbjct: 938  KNNEGATDEERARMEVVGEYHIGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTV 991

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
             G +G    LP++ Y  L  LQ  +       GGL+   +R++  +   A   ++  +DG
Sbjct: 992  SGMIGVIASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKSYLDG 1049

Query: 597  SLVWKFLQLSLGERLEICK 615
             L+  FL LS G+  EI K
Sbjct: 1050 DLIESFLDLSRGKMEEISK 1068


>gi|15233515|ref|NP_193842.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
 gi|73620956|sp|O49552.2|DDB1B_ARATH RecName: Full=DNA damage-binding protein 1b; AltName: Full=UV-damaged
            DNA-binding protein 1b; Short=DDB1b
 gi|110739453|dbj|BAF01636.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
 gi|332659001|gb|AEE84401.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
          Length = 1088

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 124/546 (22%), Positives = 218/546 (39%), Gaps = 116/546 (21%)

Query: 83   GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCP 142
            G R   +R FS+ +    VF     PA ++  ++  L ++    +  VS + PF++   P
Sbjct: 626  GTRPITLRTFSSKSATH-VFAASDRPAVIYSNNKKLLYSNVNLKE--VSHMCPFNSAAFP 682

Query: 143  RGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPS 202
               L    + EL I  +           +R +P+      + +  +T+T+ I     EPS
Sbjct: 683  DS-LAIAREGELTIGTIDDI----QKLHIRTIPIGEHARRICHQEQTRTFAISCLRNEPS 737

Query: 203  TDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLK 262
             +                +S F+  L +Q         S+E +  +++PL  +E    + 
Sbjct: 738  AE--------------ESESHFVRLLDAQ---------SFEFL--SSYPLDAFECGCSIL 772

Query: 263  NVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMI 321
            + S   +  +     Y  +GT Y    E+   +GRIL+F I+E          + ++++I
Sbjct: 773  SCSFTDDKNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------EGRLQLI 816

Query: 322  YAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV-----YIASMV 374
              KE KG V ++    G L+ ++ QKI  Y W L+D+   G   + +E       +A  V
Sbjct: 817  TEKETKGAVYSLNAFNGKLLASINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYV 873

Query: 375  SVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
              + + I VGD  +SI+LL Y+ E   +   ARDY                         
Sbjct: 874  QTRGDFIAVGDLMKSISLLIYKHEEGAIEERARDYNAN---------------------- 911

Query: 434  WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
            W          +EI       ++DI        ++ +D   N+       E        R
Sbjct: 912  WM-------TAVEIL------NDDI--------YLGTDNCFNIFTVKKNNEGATDEERAR 950

Query: 494  LIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEK 549
            +    ++H+G+ VN F      ++   S I   P         + ++ G +G    LP++
Sbjct: 951  MEVVGEYHIGEFVNRFRHGSLVMKLPDSDIGQIP------TVIFGTVSGMIGVIASLPQE 1004

Query: 550  NYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGE 609
             Y  L  LQ  +       GGL+   +R++  +   A   ++G +DG L+  FL LS G+
Sbjct: 1005 QYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKGYLDGDLIESFLDLSRGK 1062

Query: 610  RLEICK 615
              EI K
Sbjct: 1063 MEEISK 1068


>gi|449019486|dbj|BAM82888.1| similar to cleavage and polyadenylation specificity factor subunit
            [Cyanidioschyzon merolae strain 10D]
          Length = 1880

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 86/429 (20%), Positives = 174/429 (40%), Gaps = 56/429 (13%)

Query: 218  DPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNV-----SMEYEGTL 272
            D    R +P L+ Q  V L +  S E + + +    E    +C   +     + + E   
Sbjct: 1481 DSGRERDLPLLIDQHAVVLLARNSLEVLARYDLEQTEVGLAMCATRIRHFQRTGDDEAPR 1540

Query: 273  SGLRGYIALGTNYNYSEDVTCRGRILLFDII--EVVPEPGQPLTKNKIKMIYAKEQKGPV 330
               R  + +GT +   ED + RGR+L+F+I   E         T  +++ + A E KG V
Sbjct: 1541 FTERDVLVVGTCFLRGEDTSIRGRLLVFEISRQEGRQHHQHQRTLYQMQTLAATEVKGAV 1600

Query: 331  TAICHV-AGFLVTAVGQKIYIWQLKDNDLTGIAFI-DTEVYIASMVSVKNLILVGDYARS 388
            +A+  V  GF+  + G ++ +++L +++++ I+F     ++ + + ++K  IL  D    
Sbjct: 1601 SAVAPVKGGFVCCSAGPRLEVYKLIEDEMSCISFYPGINLFFSHVGTLKQYILASDMRYG 1660

Query: 389  IALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 448
            ++ L ++    + + + RD    +  +  +        ++   ++   ++LS+       
Sbjct: 1661 VSFLFWRSRNVSQNFLCRDEAQRELVASEWLMHGTKANVLSADMLGNIIELSI------- 1713

Query: 449  KKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT 508
                                              P   ES GG R+  +  FH+G   N 
Sbjct: 1714 --------------------------------PSPVDPESAGGTRMTFEAGFHVGSRPNA 1741

Query: 509  FFKIRC-KPSSISDAPGARSRF----LTWYASLDGALGFFLPLPEKNYRRL-LMLQNVMV 562
              ++R   PS+ +  P   S      +    ++DG +    PL     ++L L  Q++M+
Sbjct: 1742 VRRVRIDDPSAETPPPNEPSSLWNTHVILLGTVDGMITMVSPLLRGVAKKLELAAQDLML 1801

Query: 563  THTSHTGGLNPRAFRTYKGKGYYAG--NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 620
                    L  R++R  +     AG   P R I+DG ++  +  L    R EI ++IG  
Sbjct: 1802 EPELRKWCLYARSWRVMRSLTVAAGLRKPKRSILDGDVLQLYGSLDTPRRKEIARRIGMP 1861

Query: 621  HNDILDELY 629
               + + ++
Sbjct: 1862 QEALFEAIF 1870


>gi|297809743|ref|XP_002872755.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp. lyrata]
 gi|297318592|gb|EFH49014.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp. lyrata]
          Length = 1088

 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 122/559 (21%), Positives = 223/559 (39%), Gaps = 116/559 (20%)

Query: 89   MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
            +R FS+ +    VF     P  ++ +++  L ++    +  VS + PF++   P   L  
Sbjct: 632  LRTFSSKSATH-VFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPDS-LAI 687

Query: 149  NAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKF 208
              + EL I  +           +R +PL      + +  +T+T+ I +   +        
Sbjct: 688  AREGELTIGTIDDI----QKLHIRTIPLGEHARRICHQEQTRTFGICSLGNQS------- 736

Query: 209  NGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEY 268
            N E+ E+        F+  L  Q        F +     + +PL  +E+   + + S   
Sbjct: 737  NAEESEM-------HFVRLLDDQ-------TFEF----MSTYPLDSFEYGCSILSCSFTD 778

Query: 269  EGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQK 327
            +  +     Y  +GT Y    E+   +GRIL+F     + E G      ++++I  KE K
Sbjct: 779  DKNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVEDG------RLQLIAEKETK 822

Query: 328  GPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV-----YIASMVSVK-NL 379
            G V ++    G L+ A+ QKI  Y W L+D+   G   + +E       +A  V  + + 
Sbjct: 823  GAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDF 879

Query: 380  ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQL 439
            I+VGD  +SI+LL Y+ E   +   ARDY     ++                        
Sbjct: 880  IVVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAV----------------------- 916

Query: 440  SLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD 499
                             +ILD+   +G   ++ + N+V      E        RL    +
Sbjct: 917  -----------------EILDDDIYLG---AENNFNLVTVKKNSEGATDEERGRLEVVGE 956

Query: 500  FHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLL 555
            +HLG+ VN F      +R   S I   P         + +++G +G    LP++ Y  L 
Sbjct: 957  YHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVIASLPQEQYTFLE 1010

Query: 556  MLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 615
             LQ+ +       GGL+   +R++  +   A   +R  +DG L+  FL LS  +  +I K
Sbjct: 1011 KLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRNKMEDISK 1068

Query: 616  KIGSKHNDILDELYDIEAL 634
             +  +  ++   + ++  L
Sbjct: 1069 SMNVQVEELCKRVEELTRL 1087


>gi|2911067|emb|CAA17529.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
 gi|7268907|emb|CAB79110.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
          Length = 1102

 Score = 80.5 bits (197), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 124/546 (22%), Positives = 218/546 (39%), Gaps = 116/546 (21%)

Query: 83   GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCP 142
            G R   +R FS+ +    VF     PA ++  ++  L ++    +  VS + PF++   P
Sbjct: 640  GTRPITLRTFSSKSATH-VFAASDRPAVIYSNNKKLLYSNVNLKE--VSHMCPFNSAAFP 696

Query: 143  RGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPS 202
               L    + EL I  +           +R +P+      + +  +T+T+ I     EPS
Sbjct: 697  DS-LAIAREGELTIGTIDDI----QKLHIRTIPIGEHARRICHQEQTRTFAISCLRNEPS 751

Query: 203  TDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLK 262
             +                +S F+  L +Q         S+E +  +++PL  +E    + 
Sbjct: 752  AE--------------ESESHFVRLLDAQ---------SFEFL--SSYPLDAFECGCSIL 786

Query: 263  NVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMI 321
            + S   +  +     Y  +GT Y    E+   +GRIL+F I+E          + ++++I
Sbjct: 787  SCSFTDDKNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------EGRLQLI 830

Query: 322  YAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV-----YIASMV 374
              KE KG V ++    G L+ ++ QKI  Y W L+D+   G   + +E       +A  V
Sbjct: 831  TEKETKGAVYSLNAFNGKLLASINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYV 887

Query: 375  SVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
              + + I VGD  +SI+LL Y+ E   +   ARDY                         
Sbjct: 888  QTRGDFIAVGDLMKSISLLIYKHEEGAIEERARDYNAN---------------------- 925

Query: 434  WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
            W          +EI       ++DI        ++ +D   N+       E        R
Sbjct: 926  WM-------TAVEIL------NDDI--------YLGTDNCFNIFTVKKNNEGATDEERAR 964

Query: 494  LIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEK 549
            +    ++H+G+ VN F      ++   S I   P         + ++ G +G    LP++
Sbjct: 965  MEVVGEYHIGEFVNRFRHGSLVMKLPDSDIGQIP------TVIFGTVSGMIGVIASLPQE 1018

Query: 550  NYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGE 609
             Y  L  LQ  +       GGL+   +R++  +   A   ++G +DG L+  FL LS G+
Sbjct: 1019 QYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKGYLDGDLIESFLDLSRGK 1076

Query: 610  RLEICK 615
              EI K
Sbjct: 1077 MEEISK 1082


>gi|312283457|dbj|BAJ34594.1| unnamed protein product [Thellungiella halophila]
          Length = 1088

 Score = 80.5 bits (197), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 115/518 (22%), Positives = 205/518 (39%), Gaps = 113/518 (21%)

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            VS + PF++   P   L    + EL I  +           +R +PL      + +  +T
Sbjct: 670  VSHMCPFNSAAFPDS-LAIAREGELTIGTIDDI----QKLHIRTIPLGEHARRICHQEQT 724

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            +T+ I +   +        N E+ E+                  V L    S+E +  + 
Sbjct: 725  RTFGICSLGNQT-------NAEESEM----------------HFVRLLDDQSFEFV--ST 759

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
            +PL  +E+   + + S   +  +     Y  +GT Y    E+   +GRIL+F     + E
Sbjct: 760  YPLDAFEYGCSILSCSFADDKNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 809

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
             G      K+++I  KE KG V ++    G L+ A+ QKI  Y W L+D+   G   + +
Sbjct: 810  DG------KLQLIAEKETKGSVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQS 860

Query: 367  EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            E       +A  V  + + I+VGD  +SI+LL Y+ E   +   ARDY     ++     
Sbjct: 861  ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 916

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                                +ILD+   +G   ++ + N++   
Sbjct: 917  ------------------------------------EILDDDIYLG---AENNFNLLTVK 937

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
               E        RL    ++HLG+ VN F      +R   S I   P         + ++
Sbjct: 938  KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTV 991

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            +G +G    LP++ Y  L  LQ+ +       GGL+   +R++  +   A   +R  +DG
Sbjct: 992  NGVIGVIASLPQEQYMFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDG 1049

Query: 597  SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
             L+  FL LS  +  +I K +  +  ++   + ++  L
Sbjct: 1050 DLIESFLDLSRNKMEDISKSMNVQVEELCKRVEELTRL 1087


>gi|110741229|dbj|BAF02165.1| UV-damaged DNA binding factor - like protein [Arabidopsis thaliana]
          Length = 727

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 121/559 (21%), Positives = 223/559 (39%), Gaps = 116/559 (20%)

Query: 89  MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
           +R FS+ +    VF     P  ++ +++  L ++    +  VS + PF++   P   L  
Sbjct: 271 LRTFSSKSATH-VFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPDS-LAI 326

Query: 149 NAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKF 208
             + EL I  +           +R +PL      + +  +T+T+ I +   +        
Sbjct: 327 AREGELTIGTIDDI----QKLHIRTIPLGEHARRICHQEQTRTFGICSLGNQS------- 375

Query: 209 NGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEY 268
           N E+ E+        F+  L  Q        F +     + +PL  +E+   + + S   
Sbjct: 376 NSEESEM-------HFVRLLDDQ-------TFEF----MSTYPLDSFEYGCSILSCSFTE 417

Query: 269 EGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQK 327
           +  +     Y  +GT Y    E+   +GRIL+F     + E G      ++++I  KE K
Sbjct: 418 DKNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVEDG------RLQLIAEKETK 461

Query: 328 GPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV-----YIASMVSVK-NL 379
           G V ++    G L+ A+ QKI  Y W L+D+   G   + +E       +A  V  + + 
Sbjct: 462 GAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDF 518

Query: 380 ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQL 439
           I+VGD  +SI+LL Y+ E   +   ARDY     ++                        
Sbjct: 519 IVVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAV----------------------- 555

Query: 440 SLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD 499
                            +ILD+   +G   ++ + N++      E        RL    +
Sbjct: 556 -----------------EILDDDIYLG---AENNFNLLTVKKNSEGATDEERGRLEVVGE 595

Query: 500 FHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLL 555
           +HLG+ VN F      +R   S I   P         + +++G +G    LP++ Y  L 
Sbjct: 596 YHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVIASLPQEQYTFLE 649

Query: 556 MLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 615
            LQ+ +       GGL+   +R++  +   A   +R  +DG L+  FL LS  +  +I K
Sbjct: 650 KLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRNKMEDISK 707

Query: 616 KIGSKHNDILDELYDIEAL 634
            +  +  ++   + ++  L
Sbjct: 708 SMNVQVEELCKRVEELTRL 726


>gi|300707023|ref|XP_002995737.1| hypothetical protein NCER_101290 [Nosema ceranae BRL01]
 gi|239604943|gb|EEQ82066.1| hypothetical protein NCER_101290 [Nosema ceranae BRL01]
          Length = 1155

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 87/379 (22%), Positives = 158/379 (41%), Gaps = 73/379 (19%)

Query: 248  TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
            + F L   E+VL +K +S+     ++G   +I +       ED   RGRI++F++I+++ 
Sbjct: 830  STFDLESDEYVLDIKELSLNDSIGINGKNNFIVICVTKVEGEDKHSRGRIIVFELIDIIV 889

Query: 308  EPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND-LTGIAFIDT 366
            +        K+K++ ++  KG +T    + G L+ A+G K  I+++  ++ L  I   D 
Sbjct: 890  DKANVHKDKKLKVLASENIKGCITKCDEIKGNLIVALGIKTMIYKIDRSEGLIPIGIHDL 949

Query: 367  EVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRG 426
                 SM+++KN +L  D  R ++   YQ +   L+LV      T  + K          
Sbjct: 950  YTLTTSMITIKNFVLFSDIYRGLSFFYYQNKPVRLNLVC-----TSESIK---------- 994

Query: 427  IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
                                      + H D + +  ++G + +D   N+  + Y P   
Sbjct: 995  -------------------------NAVHVDFIVKEPALGIICTDFAGNIHTYTYSPVNI 1029

Query: 487  ESNGGHRLIKK--TDFHLGQHVNTFFKIR------CKPSSISDAPGARSRFLTWYASLDG 538
             S  G + +K+  T+F+LG+ V     I+        P  ISD+         +   LD 
Sbjct: 1030 LSCNGTKFVKRCETNFNLGKLV-----IKRAHSKLLNPVFISDS--------NYIIELDS 1076

Query: 539  ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPS-RGIIDGS 597
                   L   NY   L +QN  ++    T GL P  F   +   Y+   PS +  I   
Sbjct: 1077 -------LSLDNYNNFLKVQNAYLSLIEDTFGLCPENFNNCE---YHLKPPSVKKPILKE 1126

Query: 598  LVWKFLQLSLGERLEICKK 616
            L+++FL L + ++    K+
Sbjct: 1127 LLFRFLHLPVDKKANFLKE 1145


>gi|449328561|gb|AGE94838.1| cleavage and polyadenylation specific factor [Encephalitozoon
            cuniculi]
          Length = 1156

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 86/355 (24%), Positives = 144/355 (40%), Gaps = 60/355 (16%)

Query: 225  IPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTN 284
            IP    +F+V L+S   +E I    + L E E+V  +K + ++      G   ++ + T 
Sbjct: 803  IPVDTYRFYVDLYSE-KYEHID--TYELDENEYVFHIKYLILDDMQGNYGKSPFLLVCTT 859

Query: 285  YNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAV 344
            +   ED   RGR+ + +II VVP    P    K+K++  ++ KG +     V G +   +
Sbjct: 860  FIEGEDRPARGRLHVLEIISVVPSLESPFKDCKLKVLGIEKTKGSIVRCEEVRGKIALCL 919

Query: 345  GQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSL 403
            G KI I+++ + N +  I F D  ++ +S+  VKN IL  D  R ++   +Q +   L L
Sbjct: 920  GTKIMIYKIDRSNGIIPIGFYDLHIFTSSISVVKNYILASDIYRGLSFFFFQSKPIRLHL 979

Query: 404  VARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI--CKKIGSKHNDILDE 461
            ++              +  P R      L      LS G  L +  C   G+ H      
Sbjct: 980  IS--------------SSEPLRNATSTEL------LSTGNELSMLCCDAKGTIHG----- 1014

Query: 462  FSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISD 521
                             + Y P    S  G RL+K+ +        + F    K +SI  
Sbjct: 1015 -----------------YTYSPNNIISMDGARLVKRAEIKTNLGRLSSFGAGFKKNSI-- 1055

Query: 522  APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
               +RS  L   + +D A          +Y +LL +Q  ++ H     GLN R +
Sbjct: 1056 MFYSRSNMLIHVSGIDDA----------HYLKLLGVQTAIMAHLKSVFGLNQRDY 1100


>gi|15235577|ref|NP_192451.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
 gi|55976605|sp|Q9M0V3.1|DDB1A_ARATH RecName: Full=DNA damage-binding protein 1a; AltName: Full=UV-damaged
            DNA-binding protein 1a; Short=DDB1a
 gi|7267302|emb|CAB81084.1| UV-damaged DNA binding factor-like protein [Arabidopsis thaliana]
 gi|25054828|gb|AAN71904.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
 gi|332657117|gb|AEE82517.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
          Length = 1088

 Score = 79.7 bits (195), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 121/559 (21%), Positives = 223/559 (39%), Gaps = 116/559 (20%)

Query: 89   MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
            +R FS+ +    VF     P  ++ +++  L ++    +  VS + PF++   P   L  
Sbjct: 632  LRTFSSKSATH-VFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPDS-LAI 687

Query: 149  NAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKF 208
              + EL I  +           +R +PL      + +  +T+T+ I +   +        
Sbjct: 688  AREGELTIGTIDDI----QKLHIRTIPLGEHARRICHQEQTRTFGICSLGNQS------- 736

Query: 209  NGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEY 268
            N E+ E+        F+  L  Q        F +     + +PL  +E+   + + S   
Sbjct: 737  NSEESEM-------HFVRLLDDQ-------TFEF----MSTYPLDSFEYGCSILSCSFTE 778

Query: 269  EGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQK 327
            +  +     Y  +GT Y    E+   +GRIL+F I+E            ++++I  KE K
Sbjct: 779  DKNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGRLQLIAEKETK 822

Query: 328  GPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV-----YIASMVSVK-NL 379
            G V ++    G L+ A+ QKI  Y W L+D+   G   + +E       +A  V  + + 
Sbjct: 823  GAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDF 879

Query: 380  ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQL 439
            I+VGD  +SI+LL Y+ E   +   ARDY     ++                        
Sbjct: 880  IVVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAV----------------------- 916

Query: 440  SLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD 499
                             +ILD+   +G   ++ + N++      E        RL    +
Sbjct: 917  -----------------EILDDDIYLG---AENNFNLLTVKKNSEGATDEERGRLEVVGE 956

Query: 500  FHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLL 555
            +HLG+ VN F      +R   S I   P         + +++G +G    LP++ Y  L 
Sbjct: 957  YHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVIASLPQEQYTFLE 1010

Query: 556  MLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 615
             LQ+ +       GGL+   +R++  +   A   +R  +DG L+  FL LS  +  +I K
Sbjct: 1011 KLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRNKMEDISK 1068

Query: 616  KIGSKHNDILDELYDIEAL 634
             +  +  ++   + ++  L
Sbjct: 1069 SMNVQVEELCKRVEELTRL 1087


>gi|186511557|ref|NP_001118940.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
 gi|332657118|gb|AEE82518.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
          Length = 1067

 Score = 79.3 bits (194), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 121/559 (21%), Positives = 223/559 (39%), Gaps = 116/559 (20%)

Query: 89   MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
            +R FS+ +    VF     P  ++ +++  L ++    +  VS + PF++   P   L  
Sbjct: 611  LRTFSSKSATH-VFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPDS-LAI 666

Query: 149  NAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKF 208
              + EL I  +           +R +PL      + +  +T+T+ I +   +        
Sbjct: 667  AREGELTIGTIDDI----QKLHIRTIPLGEHARRICHQEQTRTFGICSLGNQS------- 715

Query: 209  NGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEY 268
            N E+ E+        F+  L  Q        F +     + +PL  +E+   + + S   
Sbjct: 716  NSEESEM-------HFVRLLDDQ-------TFEF----MSTYPLDSFEYGCSILSCSFTE 757

Query: 269  EGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQK 327
            +  +     Y  +GT Y    E+   +GRIL+F     + E G      ++++I  KE K
Sbjct: 758  DKNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVEDG------RLQLIAEKETK 801

Query: 328  GPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV-----YIASMVSVK-NL 379
            G V ++    G L+ A+ QKI  Y W L+D+   G   + +E       +A  V  + + 
Sbjct: 802  GAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDF 858

Query: 380  ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQL 439
            I+VGD  +SI+LL Y+ E   +   ARDY     ++                        
Sbjct: 859  IVVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAV----------------------- 895

Query: 440  SLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD 499
                             +ILD+   +G   ++ + N++      E        RL    +
Sbjct: 896  -----------------EILDDDIYLG---AENNFNLLTVKKNSEGATDEERGRLEVVGE 935

Query: 500  FHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLL 555
            +HLG+ VN F      +R   S I   P         + +++G +G    LP++ Y  L 
Sbjct: 936  YHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVIASLPQEQYTFLE 989

Query: 556  MLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 615
             LQ+ +       GGL+   +R++  +   A   +R  +DG L+  FL LS  +  +I K
Sbjct: 990  KLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRNKMEDISK 1047

Query: 616  KIGSKHNDILDELYDIEAL 634
             +  +  ++   + ++  L
Sbjct: 1048 SMNVQVEELCKRVEELTRL 1066


>gi|396082420|gb|AFN84029.1| pre-mRNA cleavage and polyadenylation [Encephalitozoon romaleae
            SJ-2008]
          Length = 1156

 Score = 79.3 bits (194), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 89/408 (21%), Positives = 165/408 (40%), Gaps = 75/408 (18%)

Query: 172  RKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQ 231
            RK+P+   P  + Y      Y +V S             +D E  +       IP    +
Sbjct: 765  RKIPILRIPKHIEY---ADRYMVVASC------------KDVEFSSKDEKDCGIPVNTYR 809

Query: 232  FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDV 291
            F+V L+S   +E I  + + L E E++  +K + ++      G   ++ + T +   ED 
Sbjct: 810  FYVDLYSE-RYEHI--STYELDENEYIFDVKYLVLDDMQGNYGKSPFLLVCTTFIEGEDR 866

Query: 292  TCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIW 351
              RGR+ + +II VVP    P    K+K++  ++ KG +     V G +   +G KI I+
Sbjct: 867  PARGRLHVLEIISVVPSLESPFRDCKLKVLGIEKTKGSIVQCSEVRGKIALCLGTKIMIY 926

Query: 352  QL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
            ++ +   +  I F D  ++ +S+  +KN IL  D  R ++   +Q +   L L++     
Sbjct: 927  KIDRSTGIIPIGFYDLHIFTSSISVMKNYILASDIYRGLSFFFFQSKPIRLHLIS----- 981

Query: 411  TQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMIS 470
                     +  P + +    L+    +LS    +  C   G+ H               
Sbjct: 982  ---------SSEPLKNVTSTELLTAGNELS----MVCCDAKGTIH--------------- 1013

Query: 471  DKDKNVVLFMYQPEARESNGGHRLIKKTDF--HLGQHVNTFFKIRCKPSSISDAPGARSR 528
                    + Y P    S  G +L+K+++   +LG         R   S I    G R  
Sbjct: 1014 -------AYTYSPNNIISMDGAKLVKRSEMKTNLG---------RLSSSGI----GFRKN 1053

Query: 529  FLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
             + +Y+  +  L + + + +  Y +LL +Q  ++ H     GLN R +
Sbjct: 1054 SIMFYSKTN-LLIYLVGMDDSYYLKLLKIQTSIMVHLKSVLGLNQRDY 1100


>gi|145351726|ref|XP_001420218.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580451|gb|ABO98511.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1120

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 98/474 (20%), Positives = 181/474 (38%), Gaps = 99/474 (20%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
            +R +PL   P  +A+ ++T T+ +           +  +  D+EL               
Sbjct: 736  IRTIPLGGHPRRIAHQVDTNTFAVAVE--------HLMSKGDQELF-------------- 773

Query: 231  QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYS-E 289
               + L    S++ + Q  F L E E    L + S   +      R Y  +GT + Y  E
Sbjct: 774  ---IRLIDDGSFDTLHQ--FRLEEHELASSLMSCSFAGDS-----REYYVVGTGFAYEQE 823

Query: 290  DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI- 348
            D   RGRIL+  +             + ++++  KE +G V  +    G L+  +  K+ 
Sbjct: 824  DEPSRGRILVLRV-----------EADALELVSEKEVRGAVYNLNAFKGKLLAGINSKLE 872

Query: 349  -YIWQLKDND---LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLV 404
             + W  +++D   L        ++   S+ +  + ILVGD  +S++LL+Y+PE   +  +
Sbjct: 873  LFKWTPREDDAHELVSECSHHGQIITFSVKTRGDWILVGDLLKSMSLLQYKPEEGAIDEI 932

Query: 405  ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
            ARD+      +                                          +LD+  +
Sbjct: 933  ARDFNANWMTAVA----------------------------------------MLDDDET 952

Query: 465  MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSI--SDA 522
              ++ ++   N+        A       RL    ++HLG+ VN F      P S+  S  
Sbjct: 953  --YLGAENSLNLFTVARNMNAMTDEERSRLEITGEYHLGEFVNVF-----SPGSLVMSLK 1005

Query: 523  PGARSRFLT-WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG 581
             G      T  + + +G +G    LP+  Y     LQ  M  H    GGL    +R+++ 
Sbjct: 1006 DGDSLEVPTLLFGTGNGVIGVLASLPKDAYDFAERLQTSMNKHIQGVGGLKHAEWRSFRH 1065

Query: 582  KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
                  +PSR  +DG LV  FL L + +   +   +     +I+  + +++ L+
Sbjct: 1066 TLRRKSDPSRNFVDGDLVESFLDLKVEQADVVAADMKCDRAEIIRRVEELQRLT 1119


>gi|19074861|ref|NP_586367.1| CLEAVAGE AND POLYADENYLATION SPECIFIC FACTOR [Encephalitozoon
            cuniculi GB-M1]
 gi|19069586|emb|CAD25971.1| CLEAVAGE AND POLYADENYLATION SPECIFIC FACTOR [Encephalitozoon
            cuniculi GB-M1]
          Length = 1156

 Score = 77.8 bits (190), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 85/355 (23%), Positives = 144/355 (40%), Gaps = 60/355 (16%)

Query: 225  IPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTN 284
            IP    +F+V L+S   +E I    + L E E+V  +K + ++      G   ++ + T 
Sbjct: 803  IPVDTYRFYVDLYSE-KYEHID--TYELDENEYVFHIKYLILDDMQGNYGKSPFLLVCTT 859

Query: 285  YNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAV 344
            +   ED   RGR+ + +II VVP    P    K+K++  ++ KG +     V G +   +
Sbjct: 860  FIEGEDRPARGRLHVLEIISVVPSLESPFKDCKLKVLGIEKTKGSIVRCEEVRGKIALCL 919

Query: 345  GQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSL 403
            G KI I+++ + + +  I F D  ++ +S+  VKN IL  D  R ++   +Q +   L L
Sbjct: 920  GTKIMIYKIDRSSGIIPIGFYDLHIFTSSISVVKNYILASDIYRGLSFFFFQSKPIRLHL 979

Query: 404  VARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI--CKKIGSKHNDILDE 461
            ++              +  P R      L      LS G  L +  C   G+ H      
Sbjct: 980  IS--------------SSEPLRNATSTEL------LSTGNELSMLCCDAKGTIHG----- 1014

Query: 462  FSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISD 521
                             + Y P    S  G RL+K+ +        + F    K +SI  
Sbjct: 1015 -----------------YTYSPNNIISMDGARLVKRAEIKTNLGRLSSFGAGFKKNSI-- 1055

Query: 522  APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
               +RS  L   + +D A          +Y +LL +Q  ++ H     GLN R +
Sbjct: 1056 MFYSRSNMLIHVSGIDDA----------HYLKLLGVQTAIMAHLKSVFGLNQRDY 1100


>gi|218197365|gb|EEC79792.1| hypothetical protein OsI_21216 [Oryza sativa Indica Group]
          Length = 1089

 Score = 77.0 bits (188), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 90/382 (23%), Positives = 157/382 (41%), Gaps = 83/382 (21%)

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
            + L ++EH   + + S   +  +     Y  +GT Y    E+   +GRIL+F +     E
Sbjct: 761  YQLDQYEHGCSIISCSFSDDNNV-----YYCVGTAYVLPEENEPSKGRILVFAV-----E 810

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
             G      ++++I  KE KG V ++    G L+ A+ QKI  Y W L+++   G   + +
Sbjct: 811  DG------RLQLIVEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRED---GSHELQS 861

Query: 367  EV----YIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            E     +I ++ +    + I+VGD  +SI+LL Y+ E   +  +ARDY            
Sbjct: 862  ECGHHGHILALYTQTRGDFIVVGDLMKSISLLVYKHEESAIEELARDYNAN--------- 912

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                         W    +S  E L+    IG+++N                  N+    
Sbjct: 913  -------------W----MSAVEMLDDEIYIGAENN-----------------YNIFTVR 938

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
               +A       RL    ++HLG+ VN        +R   S +   P         + ++
Sbjct: 939  KNSDAATDEERGRLEVVGEYHLGEFVNRLRHGSLVMRLPDSEMGQIP------TVIFGTI 992

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            +G +G    LP + Y  L  LQ+ +V      G L+   +R++      +   +R  +DG
Sbjct: 993  NGVIGIIASLPHEQYVFLEKLQSTLVKFIKGVGNLSHEQWRSFHNDKKTS--EARNFLDG 1050

Query: 597  SLVWKFLQLSLGERLEICKKIG 618
             L+  FL LS  +  E+ K +G
Sbjct: 1051 DLIESFLDLSRNKMEEVAKGMG 1072


>gi|115465791|ref|NP_001056495.1| Os05g0592400 [Oryza sativa Japonica Group]
 gi|48475231|gb|AAT44300.1| putative DNA damage binding protein 1 [Oryza sativa Japonica Group]
 gi|113580046|dbj|BAF18409.1| Os05g0592400 [Oryza sativa Japonica Group]
 gi|215694552|dbj|BAG89545.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222632766|gb|EEE64898.1| hypothetical protein OsJ_19757 [Oryza sativa Japonica Group]
          Length = 1090

 Score = 77.0 bits (188), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 90/382 (23%), Positives = 157/382 (41%), Gaps = 83/382 (21%)

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
            + L ++EH   + + S   +  +     Y  +GT Y    E+   +GRIL+F +     E
Sbjct: 762  YQLDQYEHGCSIISCSFSDDNNV-----YYCVGTAYVLPEENEPSKGRILVFAV-----E 811

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
             G      ++++I  KE KG V ++    G L+ A+ QKI  Y W L+++   G   + +
Sbjct: 812  DG------RLQLIVEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRED---GSHELQS 862

Query: 367  EV----YIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            E     +I ++ +    + I+VGD  +SI+LL Y+ E   +  +ARDY            
Sbjct: 863  ECGHHGHILALYTQTRGDFIVVGDLMKSISLLVYKHEESAIEELARDYNAN--------- 913

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                         W    +S  E L+    IG+++N                  N+    
Sbjct: 914  -------------W----MSAVEMLDDEIYIGAENN-----------------YNIFTVR 939

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
               +A       RL    ++HLG+ VN        +R   S +   P         + ++
Sbjct: 940  KNSDAATDEERGRLEVVGEYHLGEFVNRLRHGSLVMRLPDSEMGQIP------TVIFGTI 993

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            +G +G    LP + Y  L  LQ+ +V      G L+   +R++      +   +R  +DG
Sbjct: 994  NGVIGIIASLPHEQYVFLEKLQSTLVKFIKGVGNLSHEQWRSFHNDKKTS--EARNFLDG 1051

Query: 597  SLVWKFLQLSLGERLEICKKIG 618
             L+  FL LS  +  E+ K +G
Sbjct: 1052 DLIESFLDLSRNKMEEVAKGMG 1073


>gi|223994993|ref|XP_002287180.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220976296|gb|EED94623.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1517

 Score = 77.0 bits (188), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 101/484 (20%), Positives = 183/484 (37%), Gaps = 113/484 (23%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
            V    L  TP  +AYH   + YC+         D     G + ++  +            
Sbjct: 1030 VTSYKLGMTPRRIAYHEAGRVYCV------GCIDGNAKGGNNNQVGAEINMGNC------ 1077

Query: 231  QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSM----------EYEGTLSGLRGYIA 280
               V  F   ++EEI Q +  L  +E +L L +VS+            +   S  + YI 
Sbjct: 1078 ---VRFFDDSTFEEINQID--LEPFETILSLVSVSLCTSSQTLTQSNSKQDTSEYKPYIL 1132

Query: 281  LGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNK----------IKMIYAKEQKGP 329
            +GT Y Y  ED   +GRIL   ++E      +P  K+           ++ +     +G 
Sbjct: 1133 IGTAYAYPDEDEPTQGRIL---VVECNSGEAEPHLKSDDDMEDTYSRYVRHVTQMPTRGG 1189

Query: 330  VTAIC-HVAGFLVTAVGQKIYIWQLK--DNDLTGIAFIDT-------EVYIASMVS---- 375
            V +I     G ++  V  K ++ +L    + +  + F+          +++ S+      
Sbjct: 1190 VYSISPFYGGTVLATVNSKTHLCRLSIGCDQIGELKFVGAGHHGHMLSLFVKSLAGSESE 1249

Query: 376  ---------VKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRG 426
                      K L +VGD  RSI+L+ YQP++  +  +ARDY                  
Sbjct: 1250 SESSGTNRQAKQLAIVGDLMRSISLVEYQPKHNVIEELARDYNAN--------------- 1294

Query: 427  IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
                                 C  +        +  ++  ++ S+   N+ +  +   A 
Sbjct: 1295 --------------------FCTAV--------EMLTNGTYLGSEGFNNLFVLRHNANAS 1326

Query: 487  ESNGGHRLIKKTDFHLGQHVNTFFKIR-CKPSSISDAPGARSRFL---TWYASLDGALGF 542
                  RL    ++HLG+  N F       PS+     GA++ ++   T + ++DG++G 
Sbjct: 1327 SEEARVRLDTVGEYHLGEMTNKFMGGSLIMPSNSGGIMGAQNAYVGSQTLFGTVDGSIGS 1386

Query: 543  FLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKF 602
             L L    +  L  LQ  +++     G ++   +R ++ +      PSRG IDG L+  F
Sbjct: 1387 VLGLDGPTFAFLACLQRAILSIVKTVGDISHEEYRAFRAERQV--RPSRGFIDGDLIETF 1444

Query: 603  LQLS 606
            L L+
Sbjct: 1445 LDLN 1448


>gi|255956643|ref|XP_002569074.1| Pc21g20880 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211590785|emb|CAP96985.1| Pc21g20880 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 1140

 Score = 76.6 bits (187), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 81/365 (22%), Positives = 151/365 (41%), Gaps = 61/365 (16%)

Query: 275  LRGYIALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAI 333
            ++    +GT + +  +D + RGRIL+ ++     + G+ L++     +    +   +   
Sbjct: 828  MKDRFVVGTAFADEEQDESIRGRILILEV-----DHGRKLSQVAELPVMGACRALAMMGD 882

Query: 334  CHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLR 393
            C VA  + T V  ++ I  +    L  +A   T      ++ V +LI V D  +S+ L+R
Sbjct: 883  CVVAALVKTVVVYRVKINNVGPMKLEKLAAYRTSTAPVDVIVVDDLIAVADLMKSLCLVR 942

Query: 394  YQP----EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 449
            Y P    E   L+ V R Y+                       VW      +G+      
Sbjct: 943  YTPGHAGEPAKLTEVGRHYQT----------------------VWSTAIACVGDET---- 976

Query: 450  KIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
                             F+ SD + N+++         +   HRL+  ++  LG+ VN  
Sbjct: 977  -----------------FLQSDAEGNLIVLSRNMNGVTAQDKHRLMPTSEISLGEMVN-- 1017

Query: 510  FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
               R +P +I            + A+++G++  F  +  ++   L+ LQ  + T  +  G
Sbjct: 1018 ---RIRPVNIPQLSSVMVTPRAFMATVEGSIFLFAVINPEHQDFLMTLQASLSTKINSLG 1074

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 629
             L+   FR+++     A  P R  +DG L+ +FL  S   + EI ++IGS  +D+++   
Sbjct: 1075 NLSFDKFRSFRTMVRSAEAPYR-FVDGELIEQFLNCSPSMQEEIVQEIGS--SDVVEVKR 1131

Query: 630  DIEAL 634
             IEAL
Sbjct: 1132 MIEAL 1136


>gi|12082087|dbj|BAB20761.1| UV-damaged DNA binding protein [Oryza sativa Japonica Group]
          Length = 1090

 Score = 76.3 bits (186), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 90/382 (23%), Positives = 157/382 (41%), Gaps = 83/382 (21%)

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
            + L ++EH   + + S   +  +     Y  +GT Y    E+   +GRIL+F +     E
Sbjct: 762  YQLDQYEHGCSIISCSFSDDNNV-----YYCVGTAYVLPEENEPSKGRILVFAV-----E 811

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
             G      ++++I  KE KG V ++    G L+ A+ QKI  Y W L+++   G   + +
Sbjct: 812  DG------RLQLIVEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRED---GSHELQS 862

Query: 367  EV----YIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            E     +I ++ +    + I+VGD  +SI+LL Y+ E   +  +ARDY            
Sbjct: 863  ECGHHGHILALYTQTRGDFIVVGDLMKSISLLVYKHEESAIEELARDYNAN--------- 913

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                         W    +S  E L+    IG+++N                  N+    
Sbjct: 914  -------------W----MSAVEMLDDEIYIGAENN-----------------YNIFTVR 939

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
               +A       RL    ++HLG+  N F      +R   S +   P         + ++
Sbjct: 940  KNSDAATDEERGRLEVVGEYHLGEFGNRFRHGSLVMRLPDSEMGQIP------TVIFGTI 993

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            +G +G    LP + Y  L  LQ+ +V      G L+   +R++      +   +R  +DG
Sbjct: 994  NGVIGIIASLPHEQYVFLEKLQSTLVKFIKGVGNLSHEQWRSFHNDKKTS--EARNFLDG 1051

Query: 597  SLVWKFLQLSLGERLEICKKIG 618
             L+  FL LS  +  E+ K +G
Sbjct: 1052 DLIESFLDLSRNKMEEVAKGMG 1073


>gi|298711490|emb|CBJ26578.1| n/a [Ectocarpus siliculosus]
          Length = 1135

 Score = 76.3 bits (186), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 118/540 (21%), Positives = 209/540 (38%), Gaps = 100/540 (18%)

Query: 97   GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRI 156
            G   VF+    PA ++  S G+L    + + G V+++  F +   P   L   +++ L I
Sbjct: 664  GMVCVFVASDRPAVIY-CSGGKLLYANVNM-GEVNSVCSFDSSELPH-CLALASENSLTI 720

Query: 157  SVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELV 216
              +           ++KV L   P  + +H   + + I+T      T Y      D+E  
Sbjct: 721  GTIDDI----QKLHIQKVSLGEAPQRITHHDSGRMFGIIT------TSYRAVENSDEE-- 768

Query: 217  TDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLR 276
                +  F         V      ++EE+     PL  +E+       SM      +  +
Sbjct: 769  ---EEHNF---------VKFLDDTNFEEL--YCHPLDAFEN-----GSSMVSCVFANDKK 809

Query: 277  GYIALGTNYNYSEDVT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICH 335
             Y+ +GT Y   ++     GR+L+F +       GQ   + K+ +    E +G V  +  
Sbjct: 810  EYLVVGTGYVREDECEPAVGRLLVFSV------EGQG-AERKVDLAAEVETRGAVYVLNG 862

Query: 336  VAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTE------VYIASMVSVKNLILVGDYARSI 389
              G L+  +  K+ +++  + D  GI  + TE      +    M S  + I+VGD  RS+
Sbjct: 863  FNGKLLACINSKVQLFRWIEKD-DGIQELQTECGYHGHILALHMQSRGDFIIVGDLMRSV 921

Query: 390  ALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 449
            +LL Y+     +  VARDY                         W    ++  E L    
Sbjct: 922  SLLVYKAVDGAIEEVARDYHAN----------------------W----MTAVEML---- 951

Query: 450  KIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
                 ++D+        ++  + D N+       +A       RL  + +FHLG+ VN F
Sbjct: 952  -----NDDV--------YIGGEADCNIFTLRRNADAATEEERARLEIQGEFHLGEFVNKF 998

Query: 510  FKIRC-KPSSISDAPGARSRFLT-----WYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
             +      SS  ++PG     L       + +++G +G  L L E N+R L  LQ  M  
Sbjct: 999  CRGSLLMQSSEVNSPGGMDSPLVKGQPLLFGTVNGMVGTILTLTEDNHRFLAQLQTAMTK 1058

Query: 564  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND 623
                 GG +   +R++      +  PS   IDG LV  +L +    + E+ + + +   D
Sbjct: 1059 VVKGVGGFSHDEWRSFTNGRRTS--PSSNFIDGDLVESYLDMPRHNQEEVLRHVDTPVGD 1116


>gi|350537001|ref|NP_001234275.1| DNA damage-binding protein 1 [Solanum lycopersicum]
 gi|350539125|ref|NP_001233864.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
 gi|55976440|sp|Q6QNU4.1|DDB1_SOLLC RecName: Full=DNA damage-binding protein 1; AltName: Full=High
            pigmentation protein 1; AltName: Full=UV-damaged
            DNA-binding protein 1
 gi|38455768|gb|AAR20885.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
 gi|42602165|gb|AAS21683.1| UV-damaged DNA binding protein 1 [Solanum lycopersicum]
          Length = 1090

 Score = 76.3 bits (186), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 109/499 (21%), Positives = 192/499 (38%), Gaps = 107/499 (21%)

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            VS + PF+    P   L    + EL I  +           +R +PL      +++  +T
Sbjct: 670  VSHMCPFNVAAFPDS-LAIAKEGELTIGTIDEI----QKLHIRSIPLGEHARRISHQEQT 724

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            +T+ +       S  Y + N +D E+                  V L    ++E I  + 
Sbjct: 725  RTFALC------SVKYTQSNADDPEM----------------HFVRLLDDQTFEFI--ST 760

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
            +PL ++E+   + + S   +  +     Y  +GT Y    E+   +GRIL+F I+E    
Sbjct: 761  YPLDQFEYGCSILSCSFSDDSNV-----YYCIGTAYVMPEENEPTKGRILVF-IVE---- 810

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEV 368
                    K+++I  KE KG V ++    G L+ A+ QKI +++    +  G   + TE 
Sbjct: 811  ------DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWASREDGGSRELQTEC 864

Query: 369  -----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
                  +A  V  + + I+VGD  +SI+LL ++ E   +   ARDY     ++       
Sbjct: 865  GHHGHILALYVQTRGDFIVVGDLMKSISLLIFKHEEGAIEERARDYNANWMSAV------ 918

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                              +ILD+   +G   ++ + N+      
Sbjct: 919  ----------------------------------EILDDDIYLG---AENNFNLFTVRKN 941

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDG 538
             E        RL    ++HLG+ VN F      +R   S +   P         + +++G
Sbjct: 942  SEGATDEERSRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTVNG 995

Query: 539  ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
             +G    LP   Y  L  LQ  +       GGL+   +R++  +       ++  +DG L
Sbjct: 996  VIGVIASLPHDQYLFLEKLQTNLRKVIKGVGGLSHEQWRSFYNEKKTV--DAKNFLDGDL 1053

Query: 599  VWKFLQLSLGERLEICKKI 617
            +  FL LS     EI K +
Sbjct: 1054 IESFLDLSRNRMEEISKAM 1072


>gi|55976392|sp|Q6E7D1.1|DDB1_SOLCE RecName: Full=DNA damage-binding protein 1; AltName: Full=UV-damaged
            DNA-binding protein 1
 gi|49484911|gb|AAT66742.1| UV-damaged DNA binding protein 1 [Solanum cheesmaniae]
          Length = 1095

 Score = 75.9 bits (185), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 109/499 (21%), Positives = 192/499 (38%), Gaps = 107/499 (21%)

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            VS + PF+    P   L    + EL I  +           +R +PL      +++  +T
Sbjct: 675  VSHMCPFNVAAFPDS-LAIAKEGELTIGTIDEI----QKLHIRSIPLGEHARRISHQEQT 729

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            +T+ +       S  Y + N +D E+                  V L    ++E I  + 
Sbjct: 730  RTFALC------SVKYTQSNADDPEM----------------HFVRLLDDQTFEFI--ST 765

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
            +PL ++E+   + + S   +  +     Y  +GT Y    E+   +GRIL+F I+E    
Sbjct: 766  YPLDQFEYGCSILSCSFSDDSNV-----YYCIGTAYVMPEENEPTKGRILVF-IVE---- 815

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEV 368
                    K+++I  KE KG V ++    G L+ A+ QKI +++    +  G   + TE 
Sbjct: 816  ------DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWASREDGGSRELQTEC 869

Query: 369  -----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
                  +A  V  + + I+VGD  +SI+LL ++ E   +   ARDY     ++       
Sbjct: 870  GHHGHILALYVQTRGDFIVVGDLMKSISLLIFKHEEGAIEERARDYNANWMSAV------ 923

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                                              +ILD+   +G   ++ + N+      
Sbjct: 924  ----------------------------------EILDDDIYLG---AENNFNLFTVRKN 946

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDG 538
             E        RL    ++HLG+ VN F      +R   S +   P         + +++G
Sbjct: 947  SEGATDEERSRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTVNG 1000

Query: 539  ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
             +G    LP   Y  L  LQ  +       GGL+   +R++  +       ++  +DG L
Sbjct: 1001 VIGVIASLPHDQYLFLEKLQTNLRKVIKGVGGLSHEQWRSFYNEKKTV--DAKNFLDGDL 1058

Query: 599  VWKFLQLSLGERLEICKKI 617
            +  FL LS     EI K +
Sbjct: 1059 IESFLDLSRNRMEEISKAM 1077


>gi|224061051|ref|XP_002300334.1| predicted protein [Populus trichocarpa]
 gi|222847592|gb|EEE85139.1| predicted protein [Populus trichocarpa]
          Length = 1088

 Score = 75.9 bits (185), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 113/501 (22%), Positives = 196/501 (39%), Gaps = 113/501 (22%)

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            VS + PF++   P   L    + EL I  +           +R +PL      + +  ++
Sbjct: 670  VSHMCPFNSAAFPDS-LAIAKEGELSIGTIDDI----QKLHIRSIPLGEHARRICHQEQS 724

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            +T+ I +   +        N E+ E+        FI  L  Q         ++E I  + 
Sbjct: 725  RTFSICSMKNQS-------NAEESEM-------HFIRLLDDQ---------TFEFI--ST 759

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
            +PL  +E+   + + S   +  +     Y  +GT Y    E+   +GRIL+F     + E
Sbjct: 760  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 809

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
             G      K+++I  KE KG V ++    G L+ A+ QKI  Y W L+D+   G   + +
Sbjct: 810  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQS 860

Query: 367  EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            E       +A  V  + + I+VGD  +SI+LL Y+ E   +   ARDY     ++     
Sbjct: 861  ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 916

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                                +ILD+   +G   ++ + N+    
Sbjct: 917  ------------------------------------EILDDDIYLG---AENNFNLFTVR 937

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
               E        RL    ++HLG+ VN F      +R   S +   P         + ++
Sbjct: 938  KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTV 991

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            +G +G    LP + Y  L  LQ+ +       GGL+   +R++  +       ++  +DG
Sbjct: 992  NGVIGVIASLPHEQYLFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--DAKNFLDG 1049

Query: 597  SLVWKFLQLSLGERLEICKKI 617
             L+  FL LS     EI K +
Sbjct: 1050 DLIESFLDLSRSRMDEISKAM 1070


>gi|168047617|ref|XP_001776266.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672361|gb|EDQ58899.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1089

 Score = 75.5 bits (184), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 112/496 (22%), Positives = 190/496 (38%), Gaps = 108/496 (21%)

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            V+ + PF++ + P   L    + EL I  +           +R VPL   P  +A+  ++
Sbjct: 670  VNHMCPFNSASFPDS-LAIGKEGELTIGTIDDI----QKLHIRTVPLGERPCRIAHQEQS 724

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            +++ I ++           N ED E                  +V L    ++E    + 
Sbjct: 725  RSFAICSAKYSQGP-----NNEDIE----------------THYVRLIEDQTFE--ITSG 761

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT-CRGRILLFDIIEVVPE 308
            F L  +E    +   S   +  +     Y  +GT Y   E+    +GRIL+F     + E
Sbjct: 762  FALDLYEIGCSIITCSFTDDSNV-----YYCVGTAYALPEESEPTKGRILVF-----LVE 811

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
             G      K++++  KE KG V  +    G L+  + QKI  Y W L+D   T +  I++
Sbjct: 812  DG------KLQLVAEKEMKGAVYNLNAFNGKLLAGINQKIALYKWTLRDG--TRVLEIES 863

Query: 367  E----VYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
                 +    + S  + I+VGD  +SI+LL Y+PE   +   ARDY              
Sbjct: 864  SHHGHILALYVQSRGDFIVVGDLMKSISLLIYKPEEGAIEERARDYNAN----------- 912

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                       W                      +ILD+ + +G   ++   N+      
Sbjct: 913  -----------WM------------------TAVEILDDDTYLG---AENSFNLFTVRKN 940

Query: 483  PEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDG 538
             +A       RL    ++HLG+ VN F      +R   S  S  P         + +++G
Sbjct: 941  NDAATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEASLIP------TVIFGTVNG 994

Query: 539  ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
             +G    LP+  +  L  LQ  +V      GGL+   +R++  +       +R  +DG L
Sbjct: 995  VIGVIASLPQDKFLFLQKLQQALVKVIKGVGGLSHEQWRSFSNERKTV--DARNFLDGDL 1052

Query: 599  VWKFLQLSLGERLEIC 614
            +  FL LS  +  EI 
Sbjct: 1053 IESFLDLSRNKMEEIA 1068


>gi|449519304|ref|XP_004166675.1| PREDICTED: DNA damage-binding protein 1a-like [Cucumis sativus]
          Length = 596

 Score = 75.5 bits (184), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 115/501 (22%), Positives = 195/501 (38%), Gaps = 112/501 (22%)

Query: 130 VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
           VS + PF++   P   L    + EL I  +           +R +PL      + +  ++
Sbjct: 177 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHARRICHQEQS 231

Query: 190 KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
           +T+ I       S  Y +   ED E+        FI  L  Q         ++E I  + 
Sbjct: 232 RTFAIC------SLRYNQSGTEDTEM-------HFIRLLDDQ---------TFESI--ST 267

Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
           + L  +E+   + + S   +  +     Y  +GT Y    E+   +GRIL+F     V E
Sbjct: 268 YALDTYEYGCSILSCSFSDDNNV-----YYCVGTAYVMPEENEPTKGRILVF-----VVE 317

Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
            G      K+++I  KE KG V ++    G L+ A+ QKI  Y W L+D+   G   + +
Sbjct: 318 EG------KLQLIAEKETKGSVYSLNAFNGKLLAAINQKIQLYKWTLRDD---GTRELQS 368

Query: 367 EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
           E       +A  V  + + I+VGD  +SI+LL Y+ E   +   ARDY     ++     
Sbjct: 369 ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 424

Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                               +ILD+   +G   ++   N+    
Sbjct: 425 ------------------------------------EILDDDIYLG---AENYFNLFTVR 445

Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
              E        RL    ++HLG+ VN F      +R   S +   P         + S+
Sbjct: 446 KNSEGATDEERSRLEVVGEYHLGEFVNRFQHGSLVMRLPDSDVGQIP------TVIFGSV 499

Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
           +G +G    LP   Y  L  LQ+ +       GGL+   +R++  +   A   ++  +DG
Sbjct: 500 NGVIGVIASLPHDQYVFLERLQSNLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKNFLDG 557

Query: 597 SLVWKFLQLSLGERLEICKKI 617
            L+  FL L+  +  EI + +
Sbjct: 558 DLIESFLDLNRSKMEEISRAM 578


>gi|449435512|ref|XP_004135539.1| PREDICTED: DNA damage-binding protein 1-like [Cucumis sativus]
          Length = 1093

 Score = 75.1 bits (183), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 116/518 (22%), Positives = 201/518 (38%), Gaps = 112/518 (21%)

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            VS + PF++   P   L    + EL I  +           +R +PL      + +  ++
Sbjct: 674  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHARRICHQEQS 728

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            +T+ I       S  Y +   ED E+        FI  L  Q         ++E I  + 
Sbjct: 729  RTFAIC------SLRYNQSGTEDTEM-------HFIRLLDDQ---------TFESI--ST 764

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
            + L  +E+   + + S   +  +     Y  +GT Y    E+   +GRIL+F     V E
Sbjct: 765  YALDTYEYGCSILSCSFSDDNNV-----YYCVGTAYVMPEENEPTKGRILVF-----VVE 814

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
             G      K+++I  KE KG V ++    G L+ A+ QKI  Y W L+D+   G   + +
Sbjct: 815  EG------KLQLIAEKETKGSVYSLNAFNGKLLAAINQKIQLYKWTLRDD---GTRELQS 865

Query: 367  EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            E       +A  V  + + I+VGD  +SI+LL Y+ E   +   ARDY     ++     
Sbjct: 866  ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 921

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                                +ILD+   +G   ++   N+    
Sbjct: 922  ------------------------------------EILDDDIYLG---AENYFNLFTVR 942

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
               E        RL    ++HLG+ VN F      +R   S +   P         + S+
Sbjct: 943  KNSEGATDEERSRLEVVGEYHLGEFVNRFQHGSLVMRLPDSDVGQIP------TVIFGSV 996

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            +G +G    LP   Y  L  LQ+ +       GGL+   +R++  +   A   ++  +DG
Sbjct: 997  NGVIGVIASLPHDQYVFLERLQSNLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKNFLDG 1054

Query: 597  SLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
             L+  FL L+  +  EI + +     ++   + ++  L
Sbjct: 1055 DLIESFLDLNRSKMEEISRAMSVSAEELCKRVEELTRL 1092


>gi|356512636|ref|XP_003525024.1| PREDICTED: DNA damage-binding protein 1a-like isoform 1 [Glycine max]
          Length = 1089

 Score = 74.7 bits (182), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 111/501 (22%), Positives = 194/501 (38%), Gaps = 112/501 (22%)

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            VS + PF++   P   L    + EL I  +           +R +PL      + +  ++
Sbjct: 670  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHARRICHQEQS 724

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            +T+ I +    P++      GED E+                  V L    ++E I  + 
Sbjct: 725  RTFAICSLKYNPAS------GEDSEM----------------HFVRLLDDQTFEFI--ST 760

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
            + L  +E+   + + S   +  +     Y  +GT Y    E+   +GRIL+F +     E
Sbjct: 761  YSLDTYEYGCFIISCSFSDDNNV-----YYCVGTAYVLPEENEPTKGRILVFAV-----E 810

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
             G      K+++I  KE KG V  +    G L+ A+ QKI  Y W L+D+   G   + +
Sbjct: 811  DG------KLQLIAEKETKGAVYCLNAFNGKLLAAINQKIQLYKWVLRDD---GTHELQS 861

Query: 367  EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            E       +A  V  + + I+VGD  +SI+LL Y+ E   +   ARDY     ++     
Sbjct: 862  ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 917

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                                +I+D+   +G   ++   N+    
Sbjct: 918  ------------------------------------EIVDDDIYLG---AENSFNLFTVR 938

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
               E        RL    ++HLG+ VN F      +R   S +   P         + ++
Sbjct: 939  KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTI 992

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            +G +G    LP + Y  L  LQ+ +       GGL+   +R++  +       +R  +DG
Sbjct: 993  NGVIGVIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--EARNFLDG 1050

Query: 597  SLVWKFLQLSLGERLEICKKI 617
             L+  FL L+  +  EI K +
Sbjct: 1051 DLIESFLDLNRSKMDEISKAL 1071


>gi|356512638|ref|XP_003525025.1| PREDICTED: DNA damage-binding protein 1a-like isoform 2 [Glycine max]
          Length = 1068

 Score = 74.7 bits (182), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 111/501 (22%), Positives = 194/501 (38%), Gaps = 112/501 (22%)

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            VS + PF++   P   L    + EL I  +           +R +PL      + +  ++
Sbjct: 649  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHARRICHQEQS 703

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            +T+ I +    P++      GED E+                  V L    ++E I  + 
Sbjct: 704  RTFAICSLKYNPAS------GEDSEM----------------HFVRLLDDQTFEFI--ST 739

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
            + L  +E+   + + S   +  +     Y  +GT Y    E+   +GRIL+F +     E
Sbjct: 740  YSLDTYEYGCFIISCSFSDDNNV-----YYCVGTAYVLPEENEPTKGRILVFAV-----E 789

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
             G      K+++I  KE KG V  +    G L+ A+ QKI  Y W L+D+   G   + +
Sbjct: 790  DG------KLQLIAEKETKGAVYCLNAFNGKLLAAINQKIQLYKWVLRDD---GTHELQS 840

Query: 367  EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            E       +A  V  + + I+VGD  +SI+LL Y+ E   +   ARDY     ++     
Sbjct: 841  ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 896

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                                +I+D+   +G   ++   N+    
Sbjct: 897  ------------------------------------EIVDDDIYLG---AENSFNLFTVR 917

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
               E        RL    ++HLG+ VN F      +R   S +   P         + ++
Sbjct: 918  KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTI 971

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            +G +G    LP + Y  L  LQ+ +       GGL+   +R++  +       +R  +DG
Sbjct: 972  NGVIGVIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--EARNFLDG 1029

Query: 597  SLVWKFLQLSLGERLEICKKI 617
             L+  FL L+  +  EI K +
Sbjct: 1030 DLIESFLDLNRSKMDEISKAL 1050


>gi|356525401|ref|XP_003531313.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Glycine max]
          Length = 1089

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 110/501 (21%), Positives = 194/501 (38%), Gaps = 112/501 (22%)

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            VS + PF++   P   L    + EL I  +           +R +PL      + +  ++
Sbjct: 670  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHARRICHQEQS 724

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            +T+ I +    P++      GED E+                  V L    ++E I  + 
Sbjct: 725  RTFAICSLKYNPAS------GEDSEM----------------HFVRLLDDQTFEFI--ST 760

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
            + L  +E+   + + S   +  +     Y  +GT Y    E+   +GRI++F +     E
Sbjct: 761  YSLDTYEYGCFIISCSFSDDNNV-----YYCVGTAYVLPEENEPTKGRIIVFAV-----E 810

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
             G      K+++I  KE KG V  +    G L+ A+ QKI  Y W L+D+   G   + +
Sbjct: 811  DG------KLQLIAEKETKGAVYCLNAFNGKLLAAINQKIQLYKWVLRDD---GTHELQS 861

Query: 367  EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            E       +A  V  + + I+VGD  +SI+LL Y+ E   +   ARDY     ++     
Sbjct: 862  ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 917

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                                +I+D+   +G   ++   N+    
Sbjct: 918  ------------------------------------EIVDDDIYLG---AENSFNLFTVR 938

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
               E        RL    ++HLG+ VN F      +R   S +   P         + ++
Sbjct: 939  KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTI 992

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            +G +G    LP + Y  L  LQ+ +       GGL+   +R++  +       +R  +DG
Sbjct: 993  NGVIGVIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--EARNFLDG 1050

Query: 597  SLVWKFLQLSLGERLEICKKI 617
             L+  FL L+  +  EI K +
Sbjct: 1051 DLIESFLDLNRSKMDEISKAV 1071


>gi|356525403|ref|XP_003531314.1| PREDICTED: DNA damage-binding protein 1-like isoform 2 [Glycine max]
          Length = 1068

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 110/501 (21%), Positives = 194/501 (38%), Gaps = 112/501 (22%)

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            VS + PF++   P   L    + EL I  +           +R +PL      + +  ++
Sbjct: 649  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHARRICHQEQS 703

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            +T+ I +    P++      GED E+                  V L    ++E I  + 
Sbjct: 704  RTFAICSLKYNPAS------GEDSEM----------------HFVRLLDDQTFEFI--ST 739

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
            + L  +E+   + + S   +  +     Y  +GT Y    E+   +GRI++F +     E
Sbjct: 740  YSLDTYEYGCFIISCSFSDDNNV-----YYCVGTAYVLPEENEPTKGRIIVFAV-----E 789

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
             G      K+++I  KE KG V  +    G L+ A+ QKI  Y W L+D+   G   + +
Sbjct: 790  DG------KLQLIAEKETKGAVYCLNAFNGKLLAAINQKIQLYKWVLRDD---GTHELQS 840

Query: 367  EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            E       +A  V  + + I+VGD  +SI+LL Y+ E   +   ARDY     ++     
Sbjct: 841  ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 896

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                                +I+D+   +G   ++   N+    
Sbjct: 897  ------------------------------------EIVDDDIYLG---AENSFNLFTVR 917

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
               E        RL    ++HLG+ VN F      +R   S +   P         + ++
Sbjct: 918  KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTI 971

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            +G +G    LP + Y  L  LQ+ +       GGL+   +R++  +       +R  +DG
Sbjct: 972  NGVIGVIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--EARNFLDG 1029

Query: 597  SLVWKFLQLSLGERLEICKKI 617
             L+  FL L+  +  EI K +
Sbjct: 1030 DLIESFLDLNRSKMDEISKAV 1050


>gi|255571318|ref|XP_002526608.1| DNA repair protein xp-E, putative [Ricinus communis]
 gi|223534048|gb|EEF35767.1| DNA repair protein xp-E, putative [Ricinus communis]
          Length = 1033

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 88/354 (24%), Positives = 143/354 (40%), Gaps = 78/354 (22%)

Query: 278  YIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT Y    E+   +GRIL+F     + E G      K+++I  KE KG V ++   
Sbjct: 728  YYCVGTAYVMPEENEPTKGRILVF-----LVEDG------KLQVITEKETKGAVYSLNSF 776

Query: 337  AGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV-----YIASMVSVK-NLILVGDYARS 388
             G L+ A+ QKI  Y W L+D+   G   + +E       +A  V  + + I+VGD  +S
Sbjct: 777  NGKLLAAINQKIQLYKWMLRDD---GSRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 833

Query: 389  IALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 448
            I+LL Y+ E   +   ARDY     ++                                 
Sbjct: 834  ISLLIYKHEEGAIEERARDYNANWMSAV-------------------------------- 861

Query: 449  KKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT 508
                    +ILD+   +G   ++ + N+       E        RL    ++HLG+ VN 
Sbjct: 862  --------EILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 910

Query: 509  F----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 564
            F      +R   S +   P         + +++G +G    LP + Y  L  LQ+ +   
Sbjct: 911  FRHGSLVMRLPDSDVGQIP------TVIFGTVNGVIGVIASLPHEQYIFLEKLQSNLRRV 964

Query: 565  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
                GGL+   +R++  +       ++  +DG L+  FL LS     EI K IG
Sbjct: 965  IKGVGGLSHEQWRSFNNEKKTV--EAKNFLDGDLIESFLDLSRNRMDEISKAIG 1016


>gi|225443990|ref|XP_002280735.1| PREDICTED: DNA damage-binding protein 1 isoform 1 [Vitis vinifera]
          Length = 1089

 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 113/502 (22%), Positives = 195/502 (38%), Gaps = 112/502 (22%)

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            VS + PF++   P   L    + +L I  +           +R +PL      + +  ++
Sbjct: 670  VSHMCPFNSAAFPDS-LAIAKEGDLTIGTIDDI----QKLHIRSIPLGEHARRICHQEQS 724

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            +T+ I       S  Y + + ED E+        FI  L  Q         ++E I  + 
Sbjct: 725  RTFAIC------SLKYNQSSTEDSEM-------HFIRLLDDQ---------TFEFI--ST 760

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
            +PL  +E+   + + S   +  +     Y  +GT Y    E+   +GRIL+F     + E
Sbjct: 761  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 810

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
             G      K+++I  KE KG V ++    G L+ A+ QKI  Y W L+D+   G   + +
Sbjct: 811  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQS 861

Query: 367  EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            E       +A  V  + + I+VGD  +SI+LL Y+ E   +   ARDY     ++     
Sbjct: 862  ESGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 917

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                                +ILD+   +G   ++ + N+    
Sbjct: 918  ------------------------------------EILDDDIYLG---AENNFNIFTVR 938

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
               E        RL    ++HLG+ VN F      +R   S +   P         + ++
Sbjct: 939  KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTV 992

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            +G +G    LP   Y  L  LQ  +       GGL+   +R++  +       ++  +DG
Sbjct: 993  NGVIGVIASLPHDQYVFLEKLQANLRKVIKGVGGLSHEQWRSFNNEKKTV--DAKNFLDG 1050

Query: 597  SLVWKFLQLSLGERLEICKKIG 618
             L+  FL L+     EI K + 
Sbjct: 1051 DLIETFLDLNRTRMDEISKAMA 1072


>gi|226510488|ref|NP_001145925.1| uncharacterized protein LOC100279448 [Zea mays]
 gi|219884971|gb|ACL52860.1| unknown [Zea mays]
          Length = 416

 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 88/382 (23%), Positives = 154/382 (40%), Gaps = 83/382 (21%)

Query: 250 FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
           +PL ++E    + + S   +  +     Y  +GT Y    E+   +GRIL+F +     E
Sbjct: 88  YPLDQYECGCSIISCSFADDSNV-----YYCVGTAYVIPEENEPTKGRILVFAV-----E 137

Query: 309 PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
            G       +++I  KE KG V ++    G L+ A+ QKI  Y W  +++   G   + +
Sbjct: 138 DG------SLQLIVEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMSRED---GSHELQS 188

Query: 367 EV----YIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
           E     +I ++ +    + I+VGD  +SI+LL Y+ E   +   ARDY      +     
Sbjct: 189 ECGHHGHILALYTQTRGDFIVVGDLMKSISLLVYKHEESAIEERARDYNANWMTAV---- 244

Query: 421 GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                               ++LD+   +G   ++   N+    
Sbjct: 245 ------------------------------------EMLDDEVYVG---AENSYNLFTVR 265

Query: 481 YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
              +A   +   RL    ++HLG+ VN F      +R   S I   P         + ++
Sbjct: 266 KNSDAATDDERARLEVVGEYHLGEFVNRFRHGSLVMRLPDSDIGQIP------TVIFGTI 319

Query: 537 DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
           +G +G    LP   Y  L  LQ+ +V +    G L+   +R++      A   +R  +DG
Sbjct: 320 NGVIGIIASLPHDQYIFLEKLQSTLVKYIKGVGNLSHEQWRSFHNDKKTA--EARNFLDG 377

Query: 597 SLVWKFLQLSLGERLEICKKIG 618
            L+  FL LS  +  E+ K +G
Sbjct: 378 DLIESFLDLSRSKMEEVSKAMG 399


>gi|225443992|ref|XP_002280744.1| PREDICTED: DNA damage-binding protein 1 isoform 2 [Vitis vinifera]
          Length = 1068

 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 113/502 (22%), Positives = 195/502 (38%), Gaps = 112/502 (22%)

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            VS + PF++   P   L    + +L I  +           +R +PL      + +  ++
Sbjct: 649  VSHMCPFNSAAFPDS-LAIAKEGDLTIGTIDDI----QKLHIRSIPLGEHARRICHQEQS 703

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            +T+ I       S  Y + + ED E+        FI  L  Q         ++E I  + 
Sbjct: 704  RTFAIC------SLKYNQSSTEDSEM-------HFIRLLDDQ---------TFEFI--ST 739

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
            +PL  +E+   + + S   +  +     Y  +GT Y    E+   +GRIL+F     + E
Sbjct: 740  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 789

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
             G      K+++I  KE KG V ++    G L+ A+ QKI  Y W L+D+   G   + +
Sbjct: 790  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQS 840

Query: 367  EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            E       +A  V  + + I+VGD  +SI+LL Y+ E   +   ARDY     ++     
Sbjct: 841  ESGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 896

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                                +ILD+   +G   ++ + N+    
Sbjct: 897  ------------------------------------EILDDDIYLG---AENNFNIFTVR 917

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
               E        RL    ++HLG+ VN F      +R   S +   P         + ++
Sbjct: 918  KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTV 971

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            +G +G    LP   Y  L  LQ  +       GGL+   +R++  +       ++  +DG
Sbjct: 972  NGVIGVIASLPHDQYVFLEKLQANLRKVIKGVGGLSHEQWRSFNNEKKTV--DAKNFLDG 1029

Query: 597  SLVWKFLQLSLGERLEICKKIG 618
             L+  FL L+     EI K + 
Sbjct: 1030 DLIETFLDLNRTRMDEISKAMA 1051


>gi|303391353|ref|XP_003073906.1| pre-mRNA cleavage and polyadenylation specificity factor
           [Encephalitozoon intestinalis ATCC 50506]
 gi|303303055|gb|ADM12546.1| pre-mRNA cleavage and polyadenylation specificity factor
           [Encephalitozoon intestinalis ATCC 50506]
          Length = 601

 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 60/235 (25%), Positives = 112/235 (47%), Gaps = 19/235 (8%)

Query: 172 RKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQ 231
           +K+P+  TP  + Y      Y +V S  E   ++   NG+D            +P    +
Sbjct: 210 KKIPVLRTPKHIEY---ADRYMVVASCEE--VEFSPKNGKDCG----------VPVNTYR 254

Query: 232 FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDV 291
           F+V L+S   +E I  + + L E E+V  ++ + ++      G   ++ + T +   ED 
Sbjct: 255 FYVDLYSE-KYEHI--STYELEENEYVFDIQYLVLDDMQGNYGKSPFLLVCTTFIEGEDR 311

Query: 292 TCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIW 351
             +GR+ + +II VVP    P    K+K++  ++ KG +     V G +V  +G KI I+
Sbjct: 312 PAKGRLHVLEIISVVPSLESPFKDCKLKVLGIEKTKGSIVQCSEVRGKIVLCLGTKIMIY 371

Query: 352 QL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVA 405
           ++ + + +  I F D   + +S+  VKN IL  D  R ++   +Q +   L L++
Sbjct: 372 KIDRGSGIIPIGFHDLHTFTSSISVVKNYILASDIYRGLSFFFFQSKPIRLHLIS 426


>gi|413946716|gb|AFW79365.1| hypothetical protein ZEAMMB73_562969 [Zea mays]
          Length = 1089

 Score = 73.6 bits (179), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 90/382 (23%), Positives = 156/382 (40%), Gaps = 83/382 (21%)

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
            +PL ++E    + + S   +  +     Y  +GT Y    E+   +GRIL+F +     E
Sbjct: 761  YPLDQYECGCSIISCSFADDSNV-----YYCVGTAYVIPEENEPTKGRILVFAV-----E 810

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
             G       +++I  KE KG V ++    G L+ A+ QKI  Y W  +++   G   + +
Sbjct: 811  DG------SLQLIVEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMSRED---GSHELQS 861

Query: 367  EV----YIASMVSVK--NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            E     +I ++ +    + I+VGD  +SI+LL Y+ E   +   ARDY            
Sbjct: 862  ECGHHGHILALYTQTRGDFIVVGDLMKSISLLVYKHEESAIEERARDYNAN--------- 912

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                         W    ++  E L+             DE     ++ ++   N+    
Sbjct: 913  -------------W----MTAVEMLD-------------DEV----YVGAENSYNLFTVR 938

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
               +A   +   RL    ++HLG+ VN F      +R   S I   P         + ++
Sbjct: 939  KNSDAATDDERARLEVVGEYHLGEFVNRFRHGSLVMRLPDSDIGQIP------TVIFGTI 992

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            +G +G    LP   Y  L  LQ+ +V +    G L+   +R++      A   +R  +DG
Sbjct: 993  NGVIGIIASLPHDQYIFLEKLQSTLVKYIKGVGNLSHEQWRSFHNDKKTA--EARNFLDG 1050

Query: 597  SLVWKFLQLSLGERLEICKKIG 618
             L+  FL LS  +  E+ K +G
Sbjct: 1051 DLIESFLDLSRSKMEEVSKAMG 1072


>gi|440492924|gb|ELQ75450.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
            (CPSF subunit) [Trachipleistophora hominis]
          Length = 1254

 Score = 73.2 bits (178), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 59/264 (22%), Positives = 108/264 (40%), Gaps = 59/264 (22%)

Query: 294  RGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL 353
            RGRIL+F++I+V+ +     TK  +K++ ++  KGP++    V G +  ++  ++ +++ 
Sbjct: 976  RGRILVFEVIDVISDTADRKTKKALKLLGSERTKGPISCCAAVRGRIAVSLATRLMVYEF 1035

Query: 354  KDND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQ 412
              N  +  IAF D  +Y  S+  +KN I+VGD    +  + +Q E   L L+++  +   
Sbjct: 1036 DRNTGIVAIAFYDLYMYAVSLAVIKNYIVVGDIMMGLHFVYFQSEPVKLHLLSKSGRVAN 1095

Query: 413  PNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDK 472
              S  ++                      G+RL I                       DK
Sbjct: 1096 LGSLDFFNA--------------------GDRLFITG--------------------IDK 1115

Query: 473  DKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTW 532
               V +F + P    SN G +L+K+  F    H  +                 R+     
Sbjct: 1116 TGEVQIFSFSPGNLYSNEGEKLVKRQQFETYAHFQSI----------------RTNTYRS 1159

Query: 533  YASLDGALGFFLPLP--EKNYRRL 554
            YAS   +  FF+ L   +K+Y ++
Sbjct: 1160 YASFFSSQNFFVTLSYTQKDYGKI 1183


>gi|357519461|ref|XP_003630019.1| DNA damage-binding protein [Medicago truncatula]
 gi|355524041|gb|AET04495.1| DNA damage-binding protein [Medicago truncatula]
          Length = 1171

 Score = 72.8 bits (177), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 109/501 (21%), Positives = 191/501 (38%), Gaps = 112/501 (22%)

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            VS + PF++   P   L    + EL I  +           +R +PL      + +  +T
Sbjct: 752  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRTIPLGEHARRICHQEQT 806

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            +T+ I       S  Y   + E+ E+        F+  L  Q        F +  +    
Sbjct: 807  RTFAIC------SLKYNSASAEESEM-------HFVRLLDDQ-------TFDFISV---- 842

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
            +PL  +E+   + + S   +  +     Y  +GT Y    E+   +GRIL+F + E    
Sbjct: 843  YPLDTYEYGCFIISCSFSDDNNV-----YYCVGTAYVLPEENEPTKGRILVFSVEE---- 893

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
                    K++++  KE KG V  +    G L+ A+ QKI  Y W L+++   G   + +
Sbjct: 894  -------GKLQLVAEKETKGAVYCLNAFNGKLLAAINQKIQLYKWVLRED---GTRELQS 943

Query: 367  EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            E       +A  V  + + I+VGD  +SI+LL Y+ E   +   ARDY     ++     
Sbjct: 944  ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 999

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                                +ILD+   +G   ++   N+    
Sbjct: 1000 ------------------------------------EILDDDVYLG---AENSFNLFTVR 1020

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
               E        RL    ++HLG+ +N F      +R   S +   P         + ++
Sbjct: 1021 KNSEGATDEERGRLEVAGEYHLGEFINRFRHGSLVMRLPDSDVGQIP------TVIFGTI 1074

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            +G +G    LP + Y  L  LQ+ +       GGL+   +R++  +       +R  +DG
Sbjct: 1075 NGVIGVIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTV--EARNFLDG 1132

Query: 597  SLVWKFLQLSLGERLEICKKI 617
             L+  FL L   +  EI K +
Sbjct: 1133 DLIESFLDLKRSKMDEISKAM 1153


>gi|407923753|gb|EKG16818.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
            phaseolina MS6]
          Length = 1129

 Score = 72.4 bits (176), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 109/544 (20%), Positives = 203/544 (37%), Gaps = 99/544 (18%)

Query: 83   GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCP 142
            G + +  R      G   VF    HP+ ++  S G L    +T +   + + PF+    P
Sbjct: 656  GTQQANFRALPRGNGLYNVFATCEHPSLIY-GSEGRLVFSAVTAE-KATCVCPFNAEAYP 713

Query: 143  RGFLYFNAKSELRISVLP----THLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTST 198
            R  +   A  EL ++V+     TH        V+ + +  T   +AY  + K + +    
Sbjct: 714  RS-IAIAASGELHLAVVDEERRTH--------VQTLHVNETVRRIAYSPQLKAFGL---- 760

Query: 199  AEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHV 258
                       G  K ++ D  +       V Q H  L     ++E+   NF ++E+E V
Sbjct: 761  -----------GTIKRVLRDREE-------VVQGHFRLADEVIFKELD--NFEMNEYEIV 800

Query: 259  LCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKI 318
             C     ++     +  R  +         E  + RGRIL+F++ E            ++
Sbjct: 801  ECAIRAELDDGDGETAERFIVGTSHLVEEEEQGSTRGRILVFEVTE----------DRRL 850

Query: 319  KMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDND-----LTGIAFIDTEVYIASM 373
            K+I     KG    +  V   +V  + + + I+  + +      L   A   T      +
Sbjct: 851  KVIAEISTKGACRCLAMVDNKIVAGLIKTVVIYSFEYSTPSTPFLVKKASFRTSTAPIDI 910

Query: 374  VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
                N I V D  +S+++L Y+P                       AG+ S         
Sbjct: 911  TVTGNQIAVADLIKSVSVLEYKPG----------------------AGDQS--------- 939

Query: 434  WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
                     E  E+ + +    +  L E     ++ +D + N++L              R
Sbjct: 940  --------DELKEVARHVQVSWSMALAEVDENTYLQADAEGNLILLERDVSGVTEEDRKR 991

Query: 494  LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRR 553
            L+ + D  LG+ VN   +I    +++SDAP     F   +A+++G++  F  +       
Sbjct: 992  LMLRGDMLLGEQVNRIRRIDM--ATVSDAPVIPRAF---FATVEGSIYLFALIAPAKVDL 1046

Query: 554  LLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 613
            L+ LQ+ +       G      +R ++ +      P+R  +DG L+ +FL L   E+ E+
Sbjct: 1047 LIRLQSQLADFVRSPGHYPFLRYRAFRNQVREEDEPNR-FVDGDLIERFLDLKPREQEEV 1105

Query: 614  CKKI 617
             K +
Sbjct: 1106 VKGV 1109


>gi|348681092|gb|EGZ20908.1| hypothetical protein PHYSODRAFT_259403 [Phytophthora sojae]
          Length = 1137

 Score = 72.4 bits (176), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 77/360 (21%), Positives = 143/360 (39%), Gaps = 71/360 (19%)

Query: 278  YIALGTNYNYSEDVT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT Y + ++    +GRIL+F +  +  E        K++++  KE KG V  +   
Sbjct: 805  YFVVGTAYIHEDEAEPHQGRILVFAVTGIHGE-------RKLQLVTEKEVKGAVYCLNAF 857

Query: 337  AGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-----EVYIASMVSVKNLILVGDYARSIAL 391
             G ++  V  K  +++  +N       +          +  M S  + I+VGD  +S++L
Sbjct: 858  NGKVLAGVNSKAQLYKWSENTDNEKELVSECGHYGHTLVLYMESRGDFIVVGDLMKSVSL 917

Query: 392  LRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 451
            L Y+    T+  +A+D      ++ G                                  
Sbjct: 918  LSYKQLDGTIEEIAKDLNSNWMSALG---------------------------------- 943

Query: 452  GSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF-- 509
                  I+D+ + +G   S+ D N+        A       RL    +FHLG+ VN F  
Sbjct: 944  ------IVDDDTYIG---SETDFNLFTVQRNSGAASDEERGRLETVGEFHLGEFVNRFRY 994

Query: 510  ---FKIRCKPSSISDA-------PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN 559
                     P+ + D        P A+++ +  + ++ G +G  LPL +  Y  LL +Q 
Sbjct: 995  GSLTPAAAGPTDMVDVVEQAPIVPAAQNQSM-LFGTVSGMIGVILPLTKDQYSFLLRVQQ 1053

Query: 560  VMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
             +       GG + + +R ++ +   + + +R  IDG LV  FL L   +  ++  K+ S
Sbjct: 1054 ALTQVVKGVGGFSHKDWRMFENR--RSVSEARNFIDGDLVESFLDLPKAQMTKVVDKLNS 1111


>gi|380488833|emb|CCF37111.1| CPSF A subunit region, partial [Colletotrichum higginsianum]
          Length = 1062

 Score = 72.0 bits (175), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 61/211 (28%), Positives = 94/211 (44%), Gaps = 25/211 (11%)

Query: 14   ETIVQELLTVSLG-LHGNRPLLLVR-TQHELLIYQAFRHPKG------ALKLRFKK---- 61
            +  + ELL   LG      P L+VR    +L IY+  R          A  L F+K    
Sbjct: 840  QETLTELLVADLGDTTATSPYLIVRHANDDLTIYEPIRLESQDKTLGLAKTLHFQKITNP 899

Query: 62   -LKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR 120
             L    V      ANEQP      R   +R  +NI GY  VFL G  P+ +  +++   +
Sbjct: 900  ALAKSPVEVADDEANEQP------RFVPLRPCANINGYSTVFLPGASPSLIVKSAKSSPK 953

Query: 121  AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSY-DAPWPVRKVPLKCT 179
               +   G V  ++ FH   C RGF+Y +++ + R++ LP   ++ +    VRK+P+   
Sbjct: 954  VVGLQGIG-VRGMSSFHTEGCERGFIYADSEGQTRVTQLPADSNFAELGVSVRKIPIGDA 1012

Query: 180  PHFLAYHLETKTYCIVTSTAE----PSTDYY 206
               +AYH   +TY +  S +E    P  D Y
Sbjct: 1013 VGLIAYHPPMETYAVACSISEHFELPKDDDY 1043


>gi|170589357|ref|XP_001899440.1| CPSF A subunit region family protein [Brugia malayi]
 gi|158593653|gb|EDP32248.1| CPSF A subunit region family protein [Brugia malayi]
          Length = 655

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 114/539 (21%), Positives = 201/539 (37%), Gaps = 92/539 (17%)

Query: 90  RYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFN 149
           ++ S  +    +F+C   PA ++ +++  L ++       VST+ P +    P   +  +
Sbjct: 139 KFRSRCSPVHNIFVCSDRPAVIYSSNQKLLFSNVNL--RMVSTMTPLYAEAYPDALVLTD 196

Query: 150 AKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFN 209
             S     ++   +       +R VPL  +P  +AY  ET T  ++    E     + + 
Sbjct: 197 GHS-----LVIGRIDDIQKLHIRTVPLGESPSRIAYQPETNTIAVIVERLEVILFLFFY- 250

Query: 210 GEDKELVTDPRDSRFIPPLVSQFHVS----LFSPFSWEEIPQTNFPLHEWEHVLCLKNV- 264
                +  D           S+  +       S    E  P+      E   VL L +  
Sbjct: 251 -----VFVDAMGKHHFGQCASKNAMETSSSRLSSMRREPTPECLAEEMEVSSVLLLDSNT 305

Query: 265 -----SMEYEGTLSGL-----------RGYIALGTNYNYSEDVTCR-GRILLFDIIEVVP 307
                S E EG+   +           + Y  +GT    S++   + GRI++F   E  P
Sbjct: 306 FEILHSHELEGSEMAMSLASCQLGDDSQPYFVVGTAVIMSDETESKMGRIMMFQASEG-P 364

Query: 308 EPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTE 367
           E        +++++Y KE KG   +I  + G LV AV   + +++   +    +   D +
Sbjct: 365 E--------RMRLVYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEWTADKELRLECSDFD 416

Query: 368 VYIASMVSVKN-LILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRG 426
              A  +  KN LILVGD  RS++LL Y+    T   VARD+                  
Sbjct: 417 NVTALYLKTKNDLILVGDLMRSLSLLSYKSMESTFEKVARDFMTN--------------- 461

Query: 427 IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
                  W          +  C+ I S +           F+ ++   N+   M      
Sbjct: 462 -------W----------MSACEIIDSDN-----------FLGAENSYNLFTVMKDSFTV 493

Query: 487 ESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPL 546
               G RL +   F+LG+ VN F       + +  AP   S  L  Y + DG +G  + +
Sbjct: 494 FKEEGTRLQELGLFYLGEMVNVFCHGSLTATQVDVAPLYHSSIL--YGTSDGGIGVIVQM 551

Query: 547 PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
           P   Y  L  +Q  +  +  +   ++   +RT++ +         G IDG L+   L +
Sbjct: 552 PPVLYTFLQDVQKRLAEYAENCMRISHTQYRTFETEK--RSEAPNGFIDGDLIESLLDM 608


>gi|307111604|gb|EFN59838.1| hypothetical protein CHLNCDRAFT_29381 [Chlorella variabilis]
          Length = 1108

 Score = 70.5 bits (171), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 89/445 (20%), Positives = 158/445 (35%), Gaps = 103/445 (23%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
            VR VPL   P  +A+   ++T+ +  + A         +GE  +                
Sbjct: 724  VRTVPLGEQPRRIAHQETSRTFAVTCTQA-------TISGEGGD---------------- 760

Query: 231  QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SE 289
               V L    ++E + +     HE    LC   +  +          Y  +GT +   +E
Sbjct: 761  --SVRLVDEQTFELLDRLQLQQHELACSLCSTQLGDDPAT-------YYVVGTAFAPPNE 811

Query: 290  DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
                +GRI +                 K+ ++  KE +G V ++    G L+  +  ++ 
Sbjct: 812  PEPTKGRIFVL-----------AAAGGKLCVVCEKETRGAVYSLAEFQGRLLAGINSRVQ 860

Query: 350  IWQLKDNDLTGIAFIDT-----EVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLV 404
            +++  +    G A +        V    + +  +L++VGD  +SI LL +  E   L L 
Sbjct: 861  MYKWLEQGEGGRALVPECSHAGHVLALYLATRGDLVVVGDLMKSIQLLAWGEEEGALELR 920

Query: 405  ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
            ARD+ P                       W                       +LD+ + 
Sbjct: 921  ARDFHPN----------------------WM------------------SAVTVLDDDTY 940

Query: 465  MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSIS 520
            MG   ++   N+       +A       RL     +HLG+ VN F      +R   S +S
Sbjct: 941  MG---AENSYNLFTVRRNADAATDEERSRLETVGRYHLGEFVNRFQPGSLVMRLPDSELS 997

Query: 521  DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK 580
              P         + +++G +G    LP   Y+ L  LQ  M       GG +   +R + 
Sbjct: 998  QIP------TVLFGTINGVIGVVASLPHAQYQLLESLQEAMRKVVKGVGGFDHAQWRAFS 1051

Query: 581  GKGYYAGNPSRGIIDGSLVWKFLQL 605
             + +    P+R  +DG L+ +FL L
Sbjct: 1052 NQ-HMPATPARQFVDGDLIEQFLDL 1075


>gi|384250802|gb|EIE24281.1| hypothetical protein COCSUDRAFT_28729 [Coccomyxa subellipsoidea
            C-169]
          Length = 1101

 Score = 70.5 bits (171), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 90/458 (19%), Positives = 174/458 (37%), Gaps = 100/458 (21%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
            +R VPL   P  LA+   ++++ ++TS    +T       +   L+ D            
Sbjct: 714  IRTVPLGEQPRRLAHQEASRSFLVLTSPNNGATGMDDAGPDSVRLLDDQ----------- 762

Query: 231  QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
                      ++E + +     +E    +C    SM +         Y  +GT    +E+
Sbjct: 763  ----------TFETLDRFGLETNE----VCCAAASMSFSDDPCP---YYVVGTAITVAEE 805

Query: 291  VT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIY 349
                +GRIL+F                K+ ++  KE KG    +    G L+  +  ++ 
Sbjct: 806  PEPTKGRILVFGA-----------KGGKLSLVCEKEVKGAAYNLHPFQGKLIAGINSRVQ 854

Query: 350  I--WQLKDN---DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLV 404
            +  W   ++   +LT        V    +V+  + ++VGD  RS+ LL Y+ +   L + 
Sbjct: 855  LFKWTQSEDGSRELTNECSHVGHVLALYIVTRGDFVIVGDLMRSLQLLIYRADEGILEVR 914

Query: 405  ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
            ARDYK                        W                      ++LD+ + 
Sbjct: 915  ARDYKTH----------------------WM------------------TAVEVLDDDTY 934

Query: 465  MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSIS 520
            +G   ++   N+       +A      +RL     +HLG  VN F      ++   S  +
Sbjct: 935  LG---AENSNNIFTLRKNTDAAADEDRNRLETVGQYHLGVFVNRFRHGSLVMKLPDSEAA 991

Query: 521  DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK 580
              P         + +++G++G    LP++ ++ L  LQ+ +       GGL+  A+RT++
Sbjct: 992  KIP------TVLFVTINGSIGVIASLPQQQFQFLSRLQDCLRKVIKGVGGLSHVAWRTFQ 1045

Query: 581  GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
             +  +   PS+  +DG L+ +FL L       + +++G
Sbjct: 1046 DE--HTKMPSQNFVDGDLIEQFLDLKRDSMERVAREMG 1081


>gi|324518783|gb|ADY47203.1| Cleavage and polyadenylation specificity factor subunit 1 [Ascaris
           suum]
          Length = 108

 Score = 70.5 bits (171), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 34/87 (39%), Positives = 56/87 (64%), Gaps = 2/87 (2%)

Query: 465 MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSS--ISDA 522
           M F++SD+  N+ +F Y PEA ES+GG RLI +++ ++G +VN+F +++   SS  + + 
Sbjct: 20  MAFIMSDEAANIAVFNYLPEALESSGGERLILRSEINIGTNVNSFMRVKGHISSGFVENE 79

Query: 523 PGARSRFLTWYASLDGALGFFLPLPEK 549
             + +R    + SLDG+ GF  PL EK
Sbjct: 80  HYSLNRQSVLFCSLDGSFGFVRPLSEK 106


>gi|312076590|ref|XP_003140929.1| CPSF A subunit region family protein [Loa loa]
          Length = 655

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 117/531 (22%), Positives = 206/531 (38%), Gaps = 85/531 (16%)

Query: 90  RYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFN 149
           ++ S  +    +F+C   PA ++ +++  L ++       VST+ P +    P   +  +
Sbjct: 140 KFRSRCSSVHNIFVCSDRPAVIYSSNQKLLFSNVNL--RMVSTMTPLYAEAYPDALVLTD 197

Query: 150 AKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAE--PSTDYYK 207
             S     ++   +       +R VPL  +P  +AY  ET T  +     E   +   + 
Sbjct: 198 GNS-----LVIGRIDDIQKLHIRTVPLGESPSRIAYQPETNTIAVTVERLEFVDAMGKHH 252

Query: 208 F----NGEDKELVTDPRDSRFIPP----LVSQFHVS---LFSPFSWEEIPQTNFPLHEWE 256
           F    +    E  +    S    P    L  +  VS   L    ++E +   +  L   E
Sbjct: 253 FGQCASKNAMETSSSRLSSMRREPTPECLAEEMEVSSILLLDSNTFEILH--SHELEGSE 310

Query: 257 HVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTK 315
             + L +  +  +      + Y  +GT    S++   + GRI++F   E  PE       
Sbjct: 311 MAMSLASCQLGNDS-----QPYFVVGTAVIMSDETESKMGRIMMFQASEG-PE------- 357

Query: 316 NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVS 375
            +++++Y KE KG   +I  + G LV AV   + +++   +    +   D +   A  + 
Sbjct: 358 -RMRLVYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEWTADKELRLECSDFDNVTALYLK 416

Query: 376 VKN-LILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVW 434
            KN LILVGD  RS++LL Y+    T   VARD+                         W
Sbjct: 417 TKNDLILVGDLMRSLSLLSYKSVESTFEKVARDFMTN----------------------W 454

Query: 435 KFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRL 494
                     +  C+ I S  +  L   +S       KD   V   ++ E      G RL
Sbjct: 455 ----------MSACEIIDS--DSFLGAENSYNLFTVVKDSFTV---FKEE------GTRL 493

Query: 495 IKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 554
            +   F+LG+ VN F       + +  AP   S  L  Y + DG +G  + +P   Y  L
Sbjct: 494 QELGLFYLGEMVNVFCHGSLTATQVDVAPLYHSSIL--YGTSDGGIGVIVQMPPVLYTFL 551

Query: 555 LMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
             +Q  +  +T +   ++   +RT++ +         G IDG L+   L +
Sbjct: 552 HDVQKRLADYTENCMRISHTQYRTFETEK--RSEVPNGFIDGDLIESLLDM 600


>gi|393905247|gb|EJD73911.1| CPSF A subunit region family protein [Loa loa]
          Length = 1145

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 117/531 (22%), Positives = 206/531 (38%), Gaps = 85/531 (16%)

Query: 90   RYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFN 149
            ++ S  +    +F+C   PA ++ +++  L ++       VST+ P +    P   +  +
Sbjct: 639  KFRSRCSSVHNIFVCSDRPAVIYSSNQKLLFSNVNL--RMVSTMTPLYAEAYPDALVLTD 696

Query: 150  AKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAE--PSTDYYK 207
              S     ++   +       +R VPL  +P  +AY  ET T  +     E   +   + 
Sbjct: 697  GNS-----LVIGRIDDIQKLHIRTVPLGESPSRIAYQPETNTIAVTVERLEFVDAMGKHH 751

Query: 208  F----NGEDKELVTDPRDSRFIPP----LVSQFHVS---LFSPFSWEEIPQTNFPLHEWE 256
            F    +    E  +    S    P    L  +  VS   L    ++E +   +  L   E
Sbjct: 752  FGQCASKNAMETSSSRLSSMRREPTPECLAEEMEVSSILLLDSNTFEILH--SHELEGSE 809

Query: 257  HVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTK 315
              + L +  +  +      + Y  +GT    S++   + GRI++F   E  PE       
Sbjct: 810  MAMSLASCQLGNDS-----QPYFVVGTAVIMSDETESKMGRIMMFQASEG-PE------- 856

Query: 316  NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVS 375
             +++++Y KE KG   +I  + G LV AV   + +++   +    +   D +   A  + 
Sbjct: 857  -RMRLVYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEWTADKELRLECSDFDNVTALYLK 915

Query: 376  VKN-LILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVW 434
             KN LILVGD  RS++LL Y+    T   VARD+                         W
Sbjct: 916  TKNDLILVGDLMRSLSLLSYKSVESTFEKVARDFMTN----------------------W 953

Query: 435  KFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRL 494
                      +  C+ I S  +  L   +S       KD   V   ++ E      G RL
Sbjct: 954  ----------MSACEIIDS--DSFLGAENSYNLFTVVKDSFTV---FKEE------GTRL 992

Query: 495  IKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 554
             +   F+LG+ VN F       + +  AP   S  L  Y + DG +G  + +P   Y  L
Sbjct: 993  QELGLFYLGEMVNVFCHGSLTATQVDVAPLYHSSIL--YGTSDGGIGVIVQMPPVLYTFL 1050

Query: 555  LMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
              +Q  +  +T +   ++   +RT++ +         G IDG L+   L +
Sbjct: 1051 HDVQKRLADYTENCMRISHTQYRTFETEK--RSEVPNGFIDGDLIESLLDM 1099


>gi|258572939|ref|XP_002540651.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237900917|gb|EEP75318.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 1144

 Score = 70.1 bits (170), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 78/348 (22%), Positives = 134/348 (38%), Gaps = 69/348 (19%)

Query: 294  RGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL 353
            +GRIL+FD+              +++M+     +G   A+  V G +V A+ + + I  +
Sbjct: 848  KGRILIFDV----------GVNRELRMVSEFPVRGACRALAMVNGKIVAALMKSVVILSM 897

Query: 354  KDNDLTGIAFIDTEVYIASMVSVK-----NLILVGDYARSIALLRYQP----EYRTLSLV 404
            K  +   I       Y  S   V      N+I+V D  +SI+LL YQ     +  +L  V
Sbjct: 898  KKGNSYSIDIGKESSYRTSTAPVDLSVTDNIIVVADLMKSISLLEYQAGEAGQPDSLKEV 957

Query: 405  ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
            AR Y+                       +W      + E                     
Sbjct: 958  ARHYQT----------------------LWTTTAAPIAEN-------------------- 975

Query: 465  MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG 524
              F++SD + N+V+          +   R+   ++  LG  VN   ++  + S  S  P 
Sbjct: 976  -AFLVSDAEGNLVVLNRNTTGVTEDDKRRMQITSELRLGTMVNRIRRMDLQASQSS--PV 1032

Query: 525  ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 584
                FL   A+ DG++  F  + +     L+ LQ+ + +  +  GG+    +R +K    
Sbjct: 1033 IPKAFL---ATTDGSIYLFGVIAQFAQDLLMRLQSALASFVASPGGIPFSGYRAFKSATR 1089

Query: 585  YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI-LDELYDI 631
             A  P R  +DG LV +FL   L  +  +  K+     D+ L +L DI
Sbjct: 1090 QADEPFR-FVDGELVEQFLDCPLEVQEAVLAKMDGGGRDVTLSQLKDI 1136


>gi|443707495|gb|ELU03057.1| hypothetical protein CAPTEDRAFT_148808 [Capitella teleta]
          Length = 1084

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 114/499 (22%), Positives = 191/499 (38%), Gaps = 107/499 (21%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLV- 229
            +R VPL  TP  +AY   ++T+ ++T      +D    NG      T  R S     L  
Sbjct: 656  IRNVPLGETPRRIAYQEASQTFGVIT----LRSDLQDSNGS-----TPARPSASTQALST 706

Query: 230  ---SQFHVSLFSPFSWEEIPQTNFPLHEW----EHVLCL--KNVSMEYE---GTLSGLRG 277
               S   V   S  + E        +H      +H   +   +  M+YE     +SG  G
Sbjct: 707  SSSSNVKVMAASNANTEHTFGDEVEVHSLLVLDQHTFEVLHSHQLMQYEFATALMSGRFG 766

Query: 278  -----YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVT 331
                 Y  +GT   Y E+   + GRI++F                K+  +  KE KG   
Sbjct: 767  EDPTTYYVVGTAMVYPEEAEPKQGRIIVF-----------RFHDGKLTQVAEKEIKGAAY 815

Query: 332  AICHVAGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARS 388
             +    G L+ ++    +++ W  +       ++ +    IA  +  K + ILVGD  RS
Sbjct: 816  TLTEFNGKLLASINSTVRLFEWTAEKELRVECSYFNN--IIALYLKTKGDFILVGDLMRS 873

Query: 389  IALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 448
            + LL Y+P       +ARDY P    S                                 
Sbjct: 874  VTLLSYKPMEGCFEEIARDYNPNWMTSI-------------------------------- 901

Query: 449  KKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHV 506
                    D+LD+ + +G      + +  +F  Q ++  +    R  L +   +HLG+ V
Sbjct: 902  --------DVLDDDTFLG-----AENSFNIFTCQKDSAATTDEERQHLQEVGLYHLGEFV 948

Query: 507  NTFFKIRCKPSSISDAPG---ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
            N F       S +   PG   + ++    + +++GALG    LP++ Y  LL +QN +  
Sbjct: 949  NVFRH----GSLVMQHPGECTSPTQGSVLFGTVNGALGLVTQLPQEFYLFLLEVQNKLAK 1004

Query: 564  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------ 617
                 G +    +R++  +      P+ G IDG L+  FL LS  +  E+ + +      
Sbjct: 1005 TIKSVGKVEHAFWRSFHTE--RKTEPATGFIDGDLIESFLDLSRDKMQEVVQGLQMDDGS 1062

Query: 618  GSKHNDILDELYD-IEALS 635
            G K    +D+L   IE L+
Sbjct: 1063 GMKREAAVDDLVKMIEELT 1081


>gi|302769568|ref|XP_002968203.1| hypothetical protein SELMODRAFT_145521 [Selaginella moellendorffii]
 gi|300163847|gb|EFJ30457.1| hypothetical protein SELMODRAFT_145521 [Selaginella moellendorffii]
          Length = 1089

 Score = 69.7 bits (169), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 109/493 (22%), Positives = 186/493 (37%), Gaps = 104/493 (21%)

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            V+ + PF++ + P   L    + EL I  +           +R V L   P  + +  +T
Sbjct: 670  VNHMCPFNSASFPDS-LAIGKEGELTIGTIDDI----QKLHIRTVALGEHPRRICHQEQT 724

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            +T+ + T+       Y   NGED E       S F+  L  Q    L S           
Sbjct: 725  RTFGLCTARF-----YSNPNGEDHE-------SHFVKLLDDQTFEVLGS----------- 761

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
            + L  +E+   +   S   +        Y  +GT Y    E+   +GRIL+F +     E
Sbjct: 762  YNLDTFENGCTIITCSFTDDPAT-----YYCVGTAYALPEENEPSKGRILIFTV-----E 811

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDN--DLTGIAFI 364
             G      K +++  KE KG V  +    G L+  + QKI  Y W  +D+  +L      
Sbjct: 812  DG------KFQLVTEKETKGAVYNLNAFNGKLLAGINQKIQLYKWTQRDSTRELQSECGH 865

Query: 365  DTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPS 424
               +    + S  + I+VGD  +SI+LL Y+PE   +   ARDY                
Sbjct: 866  HGHILALYVQSRGDFIVVGDLMKSISLLLYKPEEGAIEERARDYNAN------------- 912

Query: 425  RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPE 484
                     W                      +ILD+   +G   ++   N+       +
Sbjct: 913  ---------WM------------------TAVEILDDDIYLG---AENSFNLFTVRKNSD 942

Query: 485  ARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGAL 540
            A       RL    ++HLG+ VN F      +R   +  S  P         + +++G +
Sbjct: 943  AATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDNETSQIP------TVIFGTVNGVI 996

Query: 541  GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVW 600
            G    L ++ +  L  LQ+ +       GGL+   +R++  +   A   ++  +DG L+ 
Sbjct: 997  GVVASLQQEQFNFLQRLQHCLAKVIKGVGGLSHEQWRSFSSERKNA--DAKNFLDGDLIE 1054

Query: 601  KFLQLSLGERLEI 613
             FL L+  +  E+
Sbjct: 1055 SFLDLNRAKMDEV 1067


>gi|302788810|ref|XP_002976174.1| hypothetical protein SELMODRAFT_151061 [Selaginella moellendorffii]
 gi|300156450|gb|EFJ23079.1| hypothetical protein SELMODRAFT_151061 [Selaginella moellendorffii]
          Length = 1089

 Score = 69.7 bits (169), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 109/493 (22%), Positives = 186/493 (37%), Gaps = 104/493 (21%)

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            V+ + PF++ + P   L    + EL I  +           +R V L   P  + +  +T
Sbjct: 670  VNHMCPFNSASFPDS-LAIGKEGELTIGTIDDI----QKLHIRTVALGEHPRRICHQEQT 724

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            +T+ + T+       Y   NGED E       S F+  L  Q    L S           
Sbjct: 725  RTFGLCTARF-----YSNPNGEDHE-------SHFVKLLDDQTFEVLGS----------- 761

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
            + L  +E+   +   S   +        Y  +GT Y    E+   +GRIL+F +     E
Sbjct: 762  YNLDTFENGCTIITCSFTDDPAT-----YYCVGTAYALPEENEPSKGRILIFTV-----E 811

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDN--DLTGIAFI 364
             G      K +++  KE KG V  +    G L+  + QKI  Y W  +D+  +L      
Sbjct: 812  DG------KFQLVTEKETKGAVYNLNAFNGKLLAGINQKIQLYKWTQRDSTRELQSECGH 865

Query: 365  DTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPS 424
               +    + S  + I+VGD  +SI+LL Y+PE   +   ARDY                
Sbjct: 866  HGHILALYVQSRGDFIVVGDLMKSISLLLYKPEEGAIEERARDYNAN------------- 912

Query: 425  RGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPE 484
                     W                      +ILD+   +G   ++   N+       +
Sbjct: 913  ---------WM------------------TAVEILDDDIYLG---AENSFNLFTVRKNSD 942

Query: 485  ARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGAL 540
            A       RL    ++HLG+ VN F      +R   +  S  P         + +++G +
Sbjct: 943  AATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDNETSQIP------TVIFGTVNGVI 996

Query: 541  GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVW 600
            G    L ++ +  L  LQ+ +       GGL+   +R++  +   A   ++  +DG L+ 
Sbjct: 997  GVVASLQQEQFNFLQRLQHCLAKVIKGVGGLSHEQWRSFSSERKNA--DAKNFLDGDLIE 1054

Query: 601  KFLQLSLGERLEI 613
             FL L+  +  E+
Sbjct: 1055 SFLDLNRAKMDEV 1067


>gi|219125301|ref|XP_002182922.1| damage-specific DNA binding protein 1 [Phaeodactylum tricornutum CCAP
            1055/1]
 gi|217405716|gb|EEC45658.1| damage-specific DNA binding protein 1 [Phaeodactylum tricornutum CCAP
            1055/1]
          Length = 1284

 Score = 69.7 bits (169), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 78/338 (23%), Positives = 132/338 (39%), Gaps = 55/338 (16%)

Query: 276  RGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAIC 334
            R ++ +GT Y    ED   RGRIL++   +     G P +   ++ I     +G V +IC
Sbjct: 929  RPFLLVGTAYAMPDEDEPSRGRILVYSC-QADEASGTPTSTRAVRQITEMSTQGGVYSIC 987

Query: 335  HV-AGFLVTAVGQKIYIWQLKDN------DLTGIAFIDTEVYIASMVSVKNLILVGDYAR 387
                G  +  V  K ++ Q+  +      +  GI      V +      K L +VGD  R
Sbjct: 988  QFYDGNFLCTVNSKTHVVQIVADCGVLRLEYVGIGHHGHIVSLFVKSRAKPLAIVGDLMR 1047

Query: 388  SIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 447
            S++L++Y P++ TL  VARD+ P    +      +    +  G+  W  L          
Sbjct: 1048 SVSLMQYYPQHETLEEVARDFNPNWTTAVEMLTDD----VYIGAENWNNL---------F 1094

Query: 448  CKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVN 507
            C +                       +N      +   R  N G       +FHLG+  N
Sbjct: 1095 CLR-----------------------RNKAATSEEIRCRLDNIG-------EFHLGEMCN 1124

Query: 508  TFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 567
             F         +S      SR  T + +++G+LG  L L  +     + L+  +      
Sbjct: 1125 KFMS-GSLVMPVSSNSTTSSRRATLFGTVEGSLGVILGLDGRTAAFFITLERAIAKTIQP 1183

Query: 568  TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
             GG + + +R+ + +     +P+ G +DG LV  FL L
Sbjct: 1184 VGGFSHQLYRSCQAE--LRVHPAHGFVDGDLVETFLDL 1219


>gi|452824087|gb|EME31092.1| DNA damage-binding protein 1 isoform 1 [Galdieria sulphuraria]
          Length = 1128

 Score = 68.9 bits (167), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 98/456 (21%), Positives = 182/456 (39%), Gaps = 86/456 (18%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRD-SRFIPPLV 229
            +R +PL   P  +A HL+T     V +T              K++VT   D +  +    
Sbjct: 732  IRTIPLGEQPRRIA-HLDTHHVFAVLTT--------------KQVVTISEDGNEALSETT 776

Query: 230  SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
             + +V L      E +   ++ L ++E    +  V+   +      + Y  +GT Y+Y++
Sbjct: 777  EEGYVRLIDDTMMEIVH--SYKLEQFETPCSVITVNFGDDAAAKDNQDYFVVGTAYSYAD 834

Query: 290  DVT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ-- 346
            +    RGR+L+F + E            ++ ++  +  KG + ++    G ++ +V    
Sbjct: 835  EPEPSRGRMLVFAVRE-----------QRLTLVAERTFKGALYSMDAFNGKILASVNSML 883

Query: 347  KIYIWQLKDN---DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSL 403
            K+  W   ++    LT        ++I  +  + + IL+GD  RS++LL Y+P   T+  
Sbjct: 884  KLVRWSETESGARTLTEECTYHGSIFILQIKCLGDFILIGDLVRSVSLLAYKPMNGTIED 943

Query: 404  VARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFS 463
            VARD  P+                      W    +++ E L+            LD + 
Sbjct: 944  VARDIDPS----------------------W----ITVIEMLD------------LDYYI 965

Query: 464  SMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAP 523
            S     ++   N+       +A       RL K  ++HLG+ VN     R     +   P
Sbjct: 966  S-----AENCFNLFTLKRNSDASTEEERSRLEKVGEYHLGELVNRIRHGRL----VLQIP 1016

Query: 524  GARSRFLT--WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG 581
             +    L    Y + +GALG    + EK ++ L  LQ  +       GG+    +R +  
Sbjct: 1017 ESGISILKSLLYGTANGALGVIASIDEKTFQFLHSLQTALNEVIKGVGGIQHEDWRRFTS 1076

Query: 582  KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
            +       S+  +DG L+ +FL LS  +   + KK+
Sbjct: 1077 ERRIG--DSKNFLDGDLIERFLDLSRDKMELVAKKV 1110


>gi|301121252|ref|XP_002908353.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
 gi|262103384|gb|EEY61436.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
          Length = 1150

 Score = 68.9 bits (167), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 79/373 (21%), Positives = 146/373 (39%), Gaps = 84/373 (22%)

Query: 278  YIALGTNYNYSEDVT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT Y + E+    +GRIL+F +  +  E        K++++  KE KG V  +   
Sbjct: 805  YFVVGTAYIHEEEAEPHQGRILVFAVTGIHGE-------RKLQLVTEKEVKGAVYCLNSF 857

Query: 337  AGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-----EVYIASMVSVKNLILVGDYARSIAL 391
             G ++  V  K  +++  +N       +          +  M S  + I+VGD  +SI+L
Sbjct: 858  NGKVLAGVNSKAQLYKWSENTDNEKELVSECGHYGHTLVLYMESRGDFIVVGDLMKSISL 917

Query: 392  LRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 451
            L Y+    T+  +A+D      ++ G                                  
Sbjct: 918  LSYKQLDGTIEEIAKDLNSNWMSAVG---------------------------------- 943

Query: 452  GSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF-- 509
                  I+D+ + +G   S+ D N+        A       RL    +FHLG+ VN F  
Sbjct: 944  ------IVDDDTYIG---SETDFNLFTVQRNSGAASDEERGRLETVGEFHLGEFVNRFRY 994

Query: 510  ----------------FKIRCKPSSISD-------APGARSRFLTWYASLDGALGFFLPL 546
                              +   P+++ D       AP  +++ +  + ++ G +G  LP+
Sbjct: 995  GSLVMQNSSSTSQTPSGVVSTGPTAMVDVGESAPAAPVVQNQSM-LFGTVSGMIGVILPI 1053

Query: 547  PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
             +  Y  LL +Q  +       GG + + +RT++ +   + + +R  IDG LV  FL L 
Sbjct: 1054 SKDQYSFLLRVQQALTHVVKGVGGFSHKDWRTFENR--RSVSEARNFIDGDLVESFLDLP 1111

Query: 607  LGERLEICKKIGS 619
              +  ++  K+ S
Sbjct: 1112 KPQMTKVVDKLNS 1124


>gi|346321204|gb|EGX90804.1| DNA damage-binding protein 1 [Cordyceps militaris CM01]
          Length = 1160

 Score = 68.9 bits (167), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 72/348 (20%), Positives = 143/348 (41%), Gaps = 62/348 (17%)

Query: 283  TNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVT 342
            T+ +  E    RGRIL+  + E          + ++  I     KG    +  +  ++V 
Sbjct: 858  TDADVGEASETRGRILVLGVDE----------ERQLYTIVTHNLKGACRCLSVLDEYIVA 907

Query: 343  AVGQKIYIWQLKDNDLTGIAFIDTEVY------IASMVSVKNLILVGDYARSIALLRYQP 396
             + + + +++  +   T  +      Y      +A  VS  N+I VGD  +S++L+ + P
Sbjct: 908  GLSKTVVVYRYTEETSTEGSLQKLAAYRPASFPVALDVS-GNMIGVGDLMQSLSLVEFTP 966

Query: 397  EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 456
                     +D +P +   K  +  +           W            +C   G +  
Sbjct: 967  --------PKDGEPAKLQEKARHFQS----------AWA---------TSVCHLDGER-- 997

Query: 457  DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
                      ++ +D   N+++    PEA       RL   ++ +LG+ +N   K+   P
Sbjct: 998  ----------WLETDAQGNIMVLARNPEAPTEQDRGRLEITSEMNLGEQINKIRKLNVAP 1047

Query: 517  SSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
            +   +A  +   FL   AS++G L  +  +  K    L+ LQ+ +  +   TG ++  A+
Sbjct: 1048 AD--NAVVSPKAFL---ASIEGTLYLYGDIAPKYQDLLITLQSNIEQYVKTTGDISFNAW 1102

Query: 577  RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
            R+++ +   A  P R  +DG +V +FL L    ++E+CK +G    D+
Sbjct: 1103 RSFRNQTREADGPFR-FVDGEMVERFLDLDELTQVELCKDLGPSVEDV 1149


>gi|255316764|gb|ACU01763.1| putative DNA damage binding protein [Brachypodium distachyon]
          Length = 384

 Score = 68.9 bits (167), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 81/354 (22%), Positives = 147/354 (41%), Gaps = 78/354 (22%)

Query: 278 YIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
           Y  +GT Y    E+   +GRIL+F +     E G      ++++I  KE KG V ++   
Sbjct: 79  YYCVGTAYVLPEENEPTKGRILVFAV-----EDG------RLQLIVEKETKGAVYSLNAF 127

Query: 337 AGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV----YIASMVSVK--NLILVGDYARS 388
            G L+ A+ QKI  Y W  +++   G   + +E     +I ++ +    + I+VGD  +S
Sbjct: 128 NGKLLAAINQKIQLYKWMTRED---GSHELQSECGHHGHILALFTQTRGDFIVVGDLMKS 184

Query: 389 IALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 448
           I+LL Y+ E   +  +ARDY                         W    ++  E ++  
Sbjct: 185 ISLLVYKHEESAIEELARDYNAN----------------------W----MTAVEMID-- 216

Query: 449 KKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT 508
                  +DI        ++ ++   N+       +A       RL    ++HLG+ VN 
Sbjct: 217 -------DDI--------YVGAENSYNLFTVRKNSDAATDEERGRLEVVGEYHLGEFVNR 261

Query: 509 F----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 564
           F      +R   + +   P         + +++G +G    LP   Y  L  LQ+++   
Sbjct: 262 FRHGSLVMRLPDTEMGQIP------TVIFGTINGVIGIIASLPHDQYVFLEKLQSILGKF 315

Query: 565 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
               G L+   +R++  +   A   +R  +DG L+  FL L+  +  E+ K +G
Sbjct: 316 IKGVGSLSHDQWRSFHNEKKTA--EARNFLDGDLIESFLDLNRSKMEEVSKGMG 367


>gi|413948669|gb|AFW81318.1| hypothetical protein ZEAMMB73_456332 [Zea mays]
          Length = 674

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 87/367 (23%), Positives = 152/367 (41%), Gaps = 81/367 (22%)

Query: 278 YIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
           Y  +GT Y    E+   +GRIL+F +     E G       +++I  KE KG V ++   
Sbjct: 369 YYCVGTAYVIPEENEPTKGRILVFAV-----EDG------SLQLIVEKETKGAVYSLNAF 417

Query: 337 AGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV----YIASMVSVK--NLILVGDYARS 388
            G L+ A+ QKI  Y W  +++   G   + +E     +I ++ +    + I+VGD  +S
Sbjct: 418 NGKLLAAINQKIQLYKWMSRED---GSHELQSECGHHGHILALYTQTRGDFIVVGDLMKS 474

Query: 389 IALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 448
           I+LL Y+ E   +   ARDY                         W    ++  E L+  
Sbjct: 475 ISLLVYKHEESAIEERARDYNAN----------------------W----MTAVEMLDDE 508

Query: 449 KKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT 508
             +G+++          G+ +    KN        +A   +   +L    ++HLG+ VN 
Sbjct: 509 VYVGAEN----------GYNLFTVRKN-------SDAATDDERAKLEVVGEYHLGEFVNR 551

Query: 509 F----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 564
           F      +R   S I   P         + +++G +G    LP  +Y  L   Q+ +V +
Sbjct: 552 FRHGSLVMRLPDSEIGKIP------TVIFGTINGVIGIIASLPHDHYTFLEKFQSTLVKY 605

Query: 565 TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND- 623
               G ++   +R++      A   +R  +DG L+  FL LS  +   + K +G    D 
Sbjct: 606 IKGVGNMSHEQWRSFHNDKKTA--EARNFLDGDLIESFLDLSRSKMEVVSKAMGVSVEDL 663

Query: 624 --ILDEL 628
             I++EL
Sbjct: 664 SKIVEEL 670


>gi|340381612|ref|XP_003389315.1| PREDICTED: DNA damage-binding protein 1-like [Amphimedon
            queenslandica]
          Length = 1142

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 101/483 (20%), Positives = 180/483 (37%), Gaps = 93/483 (19%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPS--TDYYKFNGEDKELVTDPRDSRFIPPL 228
            +  +PL  +P  +AY   ++T+ +     + S   + Y  + +     T       +PP 
Sbjct: 715  IETIPLGESPRCIAYQESSQTFLVGGYRTDKSGPDNTYTPSRQSVSTRTSNVSVAVVPPQ 774

Query: 229  --VSQFHVSLFSPFSWEEIPQTNFP------LHEWEHVLCLKNVSMEYEGTLSGLRGYIA 280
              + +F        S     QT F       L   EH+LC+ + ++    T    R    
Sbjct: 775  LNIEEFKCPQVEMHSLILFDQTTFDVSHVYQLCPQEHILCVTSCNLT---TNDEERSVYV 831

Query: 281  LGTNYNYSED-VTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF 339
            +GT     E+  +  GRIL+F +              K+++++ K + G V  +    G 
Sbjct: 832  VGTALVKPEEKESSTGRILVFAV-----------NSGKLELLHEKLENGAVFQVLGFNGK 880

Query: 340  LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYR 399
            ++ +V   +++  L D  L         +    + +  + ILVGD  RS+ LL Y+ E  
Sbjct: 881  ILNSVNSGVFVNALVDGALKEECAYKNNILALYLKTKGDFILVGDILRSLKLLVYKEE-- 938

Query: 400  TLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 459
                                                     LG      ++IG  HN I 
Sbjct: 939  -----------------------------------------LG-----LEEIGVDHN-IS 951

Query: 460  DEFSSMGFMISDKD------KNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
              F +   MI D++      +++ +     EA        +++ +  + G +VN F    
Sbjct: 952  PCFCTAIEMIDDENYLGADGRHIFICQKNTEATSEADLLYMVQPSRMYFGDNVNVF---- 1007

Query: 514  CKPSSISDAPGARSRFL-----TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 568
             + S + D PGA +  L       + ++ GA+G    L    Y  L  LQ  M  +    
Sbjct: 1008 SRGSFVMDHPGAGASSLLQGKPILFGTVHGAIGLIGTLNMDTYTLLSKLQQKMAANIKSV 1067

Query: 569  GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
            G +    +R++  +  +   P  G IDG LV KFL+L   +  +I +  G K  D+    
Sbjct: 1068 GNIEHEIYRSFSNE--HRSKPFAGFIDGDLVEKFLELPRPQMSQIVQ--GIKTTDVTGTE 1123

Query: 629  YDI 631
             D+
Sbjct: 1124 VDV 1126


>gi|18377609|gb|AAL66955.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
          Length = 270

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 74/306 (24%), Positives = 124/306 (40%), Gaps = 66/306 (21%)

Query: 324 KEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV-----YIASMVSV 376
           KE KG V ++    G L+ A+ QKI  Y W L+D+   G   + +E       +A  V  
Sbjct: 1   KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQT 57

Query: 377 K-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWK 435
           + + I+VGD  +SI+LL Y+ E   +   ARDY     ++                    
Sbjct: 58  RGDFIVVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAV------------------- 98

Query: 436 FLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLI 495
                                +ILD+   +G   ++ + N++      E        RL 
Sbjct: 99  ---------------------EILDDDIYLG---AENNFNLLTVKKNSEGATDEERGRLE 134

Query: 496 KKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNY 551
              ++HLG+ VN F      +R   S I   P         + +++G +G    LP++ Y
Sbjct: 135 VVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP------TVIFGTVNGVIGVIASLPQEQY 188

Query: 552 RRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERL 611
             L  LQ+ +       GGL+   +R++  +   A   +R  +DG L+  FL LS  +  
Sbjct: 189 TFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRNKME 246

Query: 612 EICKKI 617
           +I K +
Sbjct: 247 DISKSM 252


>gi|357132340|ref|XP_003567788.1| PREDICTED: DNA damage-binding protein 1a-like [Brachypodium
            distachyon]
          Length = 1090

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 81/354 (22%), Positives = 147/354 (41%), Gaps = 78/354 (22%)

Query: 278  YIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT Y    E+   +GRIL+F +     E G      ++++I  KE KG V ++   
Sbjct: 785  YYCVGTAYVLPEENEPTKGRILVFAV-----EDG------RLQLIVEKETKGAVYSLNAF 833

Query: 337  AGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV----YIASMVSVK--NLILVGDYARS 388
             G L+ A+ QKI  Y W  +++   G   + +E     +I ++ +    + I+VGD  +S
Sbjct: 834  NGKLLAAINQKIQLYKWMTRED---GSHELQSECGHHGHILALFTQTRGDFIVVGDLMKS 890

Query: 389  IALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 448
            I+LL Y+ E   +  +ARDY                         W    ++  E ++  
Sbjct: 891  ISLLVYKHEESAIEELARDYNAN----------------------W----MTAVEMID-- 922

Query: 449  KKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT 508
                   +DI        ++ ++   N+       +A       RL    ++HLG+ VN 
Sbjct: 923  -------DDI--------YVGAENSYNLFTVRKNSDAATDEERGRLEVVGEYHLGEFVNR 967

Query: 509  F----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTH 564
            F      +R   + +   P         + +++G +G    LP   Y  L  LQ+++   
Sbjct: 968  FRHGSLVMRLPDTEMGQIP------TVIFGTINGVIGIIASLPHDQYVFLEKLQSILGKF 1021

Query: 565  TSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
                G L+   +R++  +   A   +R  +DG L+  FL L+  +  E+ K +G
Sbjct: 1022 IKGVGSLSHDQWRSFHNEKKTA--EARNFLDGDLIESFLDLNRSKMEEVSKGMG 1073


>gi|325186344|emb|CCA20849.1| predicted protein putative [Albugo laibachii Nc14]
          Length = 1148

 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 90/431 (20%), Positives = 180/431 (41%), Gaps = 83/431 (19%)

Query: 229  VSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYS 288
            V Q ++ LF   ++E +   +F L  +E    +  ++  + G  SG   Y+ +GT + + 
Sbjct: 769  VEQGYIRLFDDQTFECLK--SFRLDPFESPCSI--ITCIFTGDSSGGTYYV-VGTAFVHE 823

Query: 289  EDVT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
            E+    +GRIL+F +  +  +        +++++  KE KG V  +    G L+  V  K
Sbjct: 824  EEAEPHQGRILVFTVSGIHGD-------RRLQLVTEKEVKGSVYCLNAFNGKLLAGVNSK 876

Query: 348  IYIWQLKDNDLTGIAFIDT-----EVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLS 402
            +Y+++  +++  G   +          +  M S  + I+VGD  +SI+LL ++    ++ 
Sbjct: 877  VYLFKWSESEENGEELVSECGHHGHTLVLYMESRGDFIVVGDLMKSISLLNHKQLDGSIE 936

Query: 403  LVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEF 462
             +ARD       + G                                        I+D+ 
Sbjct: 937  EIARDLNSNWMTAVG----------------------------------------IIDDD 956

Query: 463  SSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSS 518
            + +G   S+ D N+        A       RL    ++HLG+ VN F      ++   S 
Sbjct: 957  NYVG---SETDFNLFTVQRNSGAASDEERGRLETIGEYHLGEFVNRFRYGSLVMQHNLSI 1013

Query: 519  ISDAPGA-----RSRFLT--------WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHT 565
             ++APG      R   L+         + ++ G +G  LP+ ++ +  L+ +Q+ +    
Sbjct: 1014 GAEAPGISLSDDRPESLSPLSVQRSMLFGTVSGMIGVILPISKEKHEFLMRVQSALNQVI 1073

Query: 566  SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 625
               GG +   +RT++ +   +   +   IDG L+  FL LS  E  ++  ++   + D L
Sbjct: 1074 QGVGGFSHSEWRTFENR--RSSIEAHNFIDGDLIESFLDLSKDEMKQVVDEL---NRDQL 1128

Query: 626  DELYDIEALSS 636
            +    +EAL++
Sbjct: 1129 EGKTTLEALAA 1139


>gi|320163506|gb|EFW40405.1| UV-damaged DNA binding protein [Capsaspora owczarzaki ATCC 30864]
          Length = 1123

 Score = 67.0 bits (162), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 102/426 (23%), Positives = 167/426 (39%), Gaps = 88/426 (20%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTST-AEP------STDYYKFNGEDKELVTD--PRD 221
            VR +PL   P  +AYH  T+TY + T T AEP      S +        + +  D  PR 
Sbjct: 765  VRAIPLGEMPRRIAYHEPTRTYGVATVTLAEPLPVGSNSGNVAARAQNVRPMAFDDGPRS 824

Query: 222  SRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIAL 281
               +  L     V LF   ++E   + +F L   E ++   + S   + + S +  Y+ +
Sbjct: 825  PSDV--LEDTSFVRLFDGQTFE--IRDSFQLPSTETIMSFISCSFANDSSDSTV--YLVV 878

Query: 282  GTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFL 340
            GT +   SED   RGRIL+FD+               + ++ AK+ KG V ++    G L
Sbjct: 879  GTAFVIPSEDEPKRGRILVFDV-----------AGGALHLVTAKDVKGCVYSLNAFNGKL 927

Query: 341  VTAVGQKI--YIWQLKDNDLTGIAFIDTE------VYIASMVSVKNLILVGDYARSIALL 392
            +  +  K+  + W L  +   GI  + +E      +    + S  + I+VGD  RSI+LL
Sbjct: 928  LAGINSKVNLFKWNLTGD---GIRELVSECSHHGHILTLYLKSRGDFIIVGDLMRSISLL 984

Query: 393  RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
             Y+    ++  +A+D   T PN                   W                  
Sbjct: 985  MYKSGTSSIEEIAQD---TCPN-------------------W------------------ 1004

Query: 453  SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 512
                D+LD+   +G    +   N+       EA       RL    +FH+G+ +N F   
Sbjct: 1005 VTAVDMLDDDVFIG---GESSFNIFTCRRNLEASTDEERKRLEVVGEFHVGEFINQFR-- 1059

Query: 513  RCKPSSISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
                S +   P  + + +   T + + +G +G    L    Y  L ++Q  M       G
Sbjct: 1060 --AGSLVMKLPDEQEQPIQPSTLFGTGNGVIGVIARLTRSQYEFLQLVQAAMAKVIKGVG 1117

Query: 570  GLNPRA 575
            GLN  A
Sbjct: 1118 GLNHSA 1123


>gi|425777692|gb|EKV15851.1| UV-damaged DNA binding protein, putative [Penicillium digitatum Pd1]
 gi|425779888|gb|EKV17916.1| UV-damaged DNA binding protein, putative [Penicillium digitatum
            PHI26]
          Length = 1140

 Score = 65.9 bits (159), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 78/365 (21%), Positives = 146/365 (40%), Gaps = 64/365 (17%)

Query: 275  LRGYIALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAI 333
            ++    +GT + +  ++ + RGRIL+ ++     + G+ L++     +    +   +   
Sbjct: 831  VKDRFVVGTAFADEDQEESIRGRILILEV-----DHGRKLSQVAELPVMGACRALAMMGD 885

Query: 334  CHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLR 393
            C VA  +V     ++ I  +    L  +A   T      +  V NLI V D  +S+ L+R
Sbjct: 886  CIVAALVVV---YRVKINNVGPMKLEKLAAYRTSTAPVDVTVVDNLIAVADLMKSLCLIR 942

Query: 394  YQP----EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 449
            Y P    E   L+ V R Y+                       VW      +G+      
Sbjct: 943  YTPGHTGEPAKLTEVGRHYQT----------------------VWSTAIACVGDET---- 976

Query: 450  KIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
                             F+ SD + N+++         +   HRLI  ++  LG+ VN  
Sbjct: 977  -----------------FLQSDAEGNLIVLSRNTNGVTAQDKHRLIPTSEISLGEMVN-- 1017

Query: 510  FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
               R +P  I            + A+++G++  F  +  ++   L+ LQ  +    +  G
Sbjct: 1018 ---RIRPVHIPQLCSVMVTPRAFMATVEGSIFLFAVINPEHQDFLMTLQAALSQKLNSLG 1074

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 629
             L+   FR ++     A  P R  +DG L+ +FL+ +   + EI +++GS  +D+ +   
Sbjct: 1075 NLSFDKFRGFRTMVRSAAAPYR-FVDGELIEQFLKCTPSMQEEIAQEVGS--SDVGEVKR 1131

Query: 630  DIEAL 634
             IEAL
Sbjct: 1132 LIEAL 1136


>gi|402592185|gb|EJW86114.1| CPSF A subunit region family protein [Wuchereria bancrofti]
          Length = 278

 Score = 65.5 bits (158), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 67/290 (23%), Positives = 114/290 (39%), Gaps = 48/290 (16%)

Query: 317 KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSV 376
           +++++Y KE KG   +I  + G LV AV   + +++   +    +   D +   A  +  
Sbjct: 11  RMRLVYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEWTADKELRLECSDFDNVTALYLKT 70

Query: 377 KN-LILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWK 435
           KN LILVGD  RS++LL Y+    T   VARD+                         W 
Sbjct: 71  KNDLILVGDLMRSLSLLSYKSMESTFEKVARDFMTN----------------------W- 107

Query: 436 FLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLI 495
                    +  C+ I S +           F+ ++   N+   M          G RL 
Sbjct: 108 ---------MSACEIIDSDN-----------FLGAENSYNLFTVMKDSFTVFKEEGTRLQ 147

Query: 496 KKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLL 555
           +   F+LG+ VN F       + +  AP   S  L  Y + DG +G  + +P   Y  L 
Sbjct: 148 ELGLFYLGEMVNVFCHGSLTATQVDVAPLYHSSIL--YGTSDGGIGVIVQMPPVLYTFLQ 205

Query: 556 MLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
            +Q  +  +  +   ++   +RT++ +         G IDG L+   L +
Sbjct: 206 DVQKRLAEYAENCMRISHTQYRTFETEK--RSEAPNGFIDGDLIESLLDM 253


>gi|402223178|gb|EJU03243.1| hypothetical protein DACRYDRAFT_115454 [Dacryopinax sp. DJM-731
           SS1]
          Length = 1175

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 77/336 (22%), Positives = 143/336 (42%), Gaps = 48/336 (14%)

Query: 65  LFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM 124
           + + + S RA ++     G +   +   +++     +F CG  PA LFL +   L A P+
Sbjct: 682 IVLGEPSVRATDKKIFSLGTKPIMLNACTDLGRESNIFACGDRPALLFLKN-DRLTASPI 740

Query: 125 TIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKC-TPHFL 183
            +   +   +  H    P  F++ +A S L I  +      D    VR + L   TP  L
Sbjct: 741 KLRD-IHAGSVLHIPQFPSSFIFASA-STLLIGQIRESQKID----VRTISLGLDTPIRL 794

Query: 184 AYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWE 243
            YH   + Y +V    E + +      +D+E+ +            S F   LF   ++E
Sbjct: 795 TYHRGLRAYGVVCQRKELNRE------DDREIYS------------SSF--KLFDDITFE 834

Query: 244 EIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGT-NYNYSEDVTCRGRILLFDI 302
            +   NF     E ++C+  +    + T      ++ +GT     +E+   +GRIL+F  
Sbjct: 835 YL--NNFTARPDEQMMCVTTIP---DSTGEEDSDFLVVGTYEATGAEEDVSKGRILIF-- 887

Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL----KDNDL 358
            E VP         K+K++ + +  G V A+ +V   L  A+   + ++ L     D  +
Sbjct: 888 -EEVP-------NRKLKLVVSHDVGGCVYAVTNVGANLAAAINGTLQVFSLHRSHDDIRI 939

Query: 359 TGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             +A   +    +S++   N +LVGD  R++ +LR+
Sbjct: 940 ESVAKWSSAYVASSLICRGNTLLVGDAMRAVCILRW 975


>gi|345328202|ref|XP_003431248.1| PREDICTED: DNA damage-binding protein 1-like [Ornithorhynchus
            anatinus]
          Length = 1045

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 142/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 733  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 781

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 782  NGKLLASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 840

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P   ++                                       
Sbjct: 841  KPMEGNFEEIARDFNPNWMSAV-------------------------------------- 862

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 863  --EILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 912

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 913  -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 971

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 972  KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREA 1029

Query: 624  ILDELYDI 631
             +D+L  I
Sbjct: 1030 TVDDLIKI 1037


>gi|90108797|pdb|2B5L|A Chain A, Crystal Structure Of Ddb1 In Complex With Simian Virus 5 V
            Protein
 gi|90108798|pdb|2B5L|B Chain B, Crystal Structure Of Ddb1 In Complex With Simian Virus 5 V
            Protein
 gi|90108801|pdb|2B5M|A Chain A, Crystal Structure Of Ddb1
 gi|116667897|pdb|2HYE|A Chain A, Crystal Structure Of The Ddb1-cul4a-rbx1-sv5v Complex
 gi|1136228|gb|AAA88883.1| UV-damaged DNA binding factor [Homo sapiens]
 gi|1588524|prf||2208446A xeroderma pigmentosum group E-binding factor
          Length = 1140

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 141/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  + +  T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKDVRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|270346571|pdb|3I7H|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Hbx
 gi|270346573|pdb|3I7K|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Whx
 gi|270346575|pdb|3I7L|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Ddb2
 gi|270346577|pdb|3I7N|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdtc1
 gi|270346579|pdb|3I7O|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Iqwd1
 gi|270346581|pdb|3I7P|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdr40a
 gi|270346583|pdb|3I89|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdr22
 gi|270346585|pdb|3I8C|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdr21a
 gi|270346587|pdb|3I8E|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdr42a
 gi|270346588|pdb|3I8E|B Chain B, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdr42a
          Length = 1143

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 141/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 831  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 879

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  + +  T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 880  NGKLLASINSTVRLYEWTTEKDVRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 938

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 939  KPMEGNFEEIARDFNPN----------------------WM------------------S 958

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 959  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1010

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1011 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1069

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1070 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1127

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1128 TADDLIKV 1135


>gi|221046721|pdb|3EI4|A Chain A, Structure Of The Hsddb1-Hsddb2 Complex
 gi|221046723|pdb|3EI4|C Chain C, Structure Of The Hsddb1-Hsddb2 Complex
 gi|221046725|pdb|3EI4|E Chain E, Structure Of The Hsddb1-Hsddb2 Complex
          Length = 1158

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 141/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 846  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 894

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  + +  T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 895  NGKLLASINSTVRLYEWTTEKDVRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 953

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 954  KPMEGNFEEIARDFNPN----------------------WM------------------S 973

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 974  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1025

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1026 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1084

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1085 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1142

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1143 TADDLIKV 1150


>gi|2632123|emb|CAA05770.1| Xeroderma Pigmentosum Group E Complementing protein [Homo sapiens]
          Length = 1140

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 141/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGDVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  + +  T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKDVRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|395544366|ref|XP_003774082.1| PREDICTED: DNA damage-binding protein 1 [Sarcophilus harrisii]
          Length = 1239

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 927  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 975

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 976  NGKLLASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 1034

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 1035 KPMEGNFEEIARDFNPN----------------------WM------------------S 1054

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 1055 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1106

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1107 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1165

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1166 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1223

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1224 TADDLIKV 1231


>gi|429965418|gb|ELA47415.1| hypothetical protein VCUG_01066 [Vavraia culicis 'floridensis']
          Length = 1176

 Score = 63.5 bits (153), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 51/208 (24%), Positives = 90/208 (43%), Gaps = 41/208 (19%)

Query: 294  RGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL 353
            RGRIL+F++I V+ +     TK  +K++ ++  KGP++    V G +  ++  K+ +++ 
Sbjct: 898  RGRILVFEVINVIGDMVAKKTKKALKLLGSERTKGPISCCAAVRGKIAVSLATKLMVYEC 957

Query: 354  KDND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQ 412
              N  +  IAF D  +Y  S+  +KN I+VGD    +  + +Q E   L L+++  +   
Sbjct: 958  DRNSGIVAIAFYDLYMYAVSLAVIKNYIIVGDIMMGLHFVYFQSEPVKLHLLSKSDRIAN 1017

Query: 413  PNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDK 472
              S  ++                      GE L I                       DK
Sbjct: 1018 LGSLDFFNA--------------------GESLFITG--------------------IDK 1037

Query: 473  DKNVVLFMYQPEARESNGGHRLIKKTDF 500
               V +F + P    SNGG +L+K+ +F
Sbjct: 1038 TGKVQIFSFSPSNLYSNGGEKLVKRQEF 1065


>gi|400600376|gb|EJP68050.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
          Length = 1174

 Score = 63.5 bits (153), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 101/546 (18%), Positives = 202/546 (36%), Gaps = 111/546 (20%)

Query: 97   GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRI 156
            G   +F    H + ++ +S G L     T D   + + PF +   P   L    K+    
Sbjct: 711  GTSSIFATTEHSSLIY-SSEGRLVYSATTADN-ATCVVPFDSYGFPHCILVSTDKNVRIC 768

Query: 157  SVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELV 216
             V    L++     V+ +P+  T   +AY               P    +      K+L+
Sbjct: 769  RVDKERLTH-----VKSLPVHETVRRVAY--------------APGAKAFALGCIKKDLI 809

Query: 217  TDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNV-SMEYEGTLSGL 275
             +          V    V L     ++E+  T  PL     +  +++V   E       L
Sbjct: 810  QNAE--------VITSSVKLVDEIMFQEL-GTPLPLAASSTLEMVESVIRAELPDPTGAL 860

Query: 276  RGYIALGTNYNYSEDV----TCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVT 331
                 +GT++    +V      RGRIL+  + E          K ++  I +   KG   
Sbjct: 861  VERFVVGTSFVNDAEVGEAGETRGRILVLGVDE----------KRQLYTIVSHNLKGACR 910

Query: 332  AICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK------------NL 379
             +  +  ++V  + + + ++   + +        TE Y+  + + +            N+
Sbjct: 911  CLGILDEYIVACLAKTVVVYSYTEEN-------STEGYLQKLAAYRPASFPVALDISGNM 963

Query: 380  ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQL 439
            I V D  +S++L+ + P         +D +P                   G L  K    
Sbjct: 964  IGVADIMQSLSLVEFTP--------PKDGEP-------------------GKLEEKARHF 996

Query: 440  SLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD 499
                   +C   G +            ++ +D   N+++    P+A   +   RL   ++
Sbjct: 997  QSAWATSVCHLGGER------------WLETDAQGNIIVLARNPDAPTEHDRSRLEITSE 1044

Query: 500  FHLGQHVNTFFKIRCKPS-SISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQ 558
             +LG+ +N   ++   P+ ++  +P A      + AS++G L  +  +  K    L+ LQ
Sbjct: 1045 MNLGEQINKIQRLNVAPADNVVVSPKA------FLASIEGTLYLYGDIAPKYQDLLITLQ 1098

Query: 559  NVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
              +  +   TGG++  A+R ++ +   A  P R  +DG +V +FL L    +  +C+ +G
Sbjct: 1099 TTIEKYVKTTGGISFDAWRAFRNQAREADGPFR-FVDGEMVERFLDLRKQTQAALCQDLG 1157

Query: 619  SKHNDI 624
                D+
Sbjct: 1158 LNVEDV 1163


>gi|194377326|dbj|BAG57611.1| unnamed protein product [Homo sapiens]
          Length = 451

 Score = 63.5 bits (153), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 82/366 (22%), Positives = 139/366 (37%), Gaps = 73/366 (19%)

Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
           Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 139 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 187

Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
            G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 188 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 246

Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
           +P       +ARD+ P   +         +  I+D      FL       L +C+K  + 
Sbjct: 247 KPMEGNFEEIARDFNPNWMS---------AVEILDDD---NFLGAENAFNLFVCQKDSAA 294

Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 514
             D                          E R+      L +   FHLG+ VN F    C
Sbjct: 295 TTD--------------------------EERQ-----HLQEVGLFHLGEFVNVF----C 319

Query: 515 KPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
             S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G +
Sbjct: 320 HGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKI 379

Query: 572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDIL 625
               +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K     
Sbjct: 380 EHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 437

Query: 626 DELYDI 631
           D+L  +
Sbjct: 438 DDLIKV 443


>gi|74138855|dbj|BAE27231.1| unnamed protein product [Mus musculus]
          Length = 1140

 Score = 63.5 bits (153), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|410974071|ref|XP_003993471.1| PREDICTED: DNA damage-binding protein 1 [Felis catus]
          Length = 1193

 Score = 63.5 bits (153), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 881  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 929

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 930  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 988

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 989  KPMEGNFEEIARDFNPN----------------------WM------------------S 1008

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 1009 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1060

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1061 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1119

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1120 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1177

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1178 TADDLIKV 1185


>gi|194381178|dbj|BAG64157.1| unnamed protein product [Homo sapiens]
          Length = 826

 Score = 63.5 bits (153), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
           Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 514 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 562

Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
            G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 563 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 621

Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
           +P       +ARD+ P                       W                    
Sbjct: 622 KPMEGNFEEIARDFNPN----------------------WM------------------S 641

Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
             +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 642 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 693

Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
            C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 694 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 752

Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
            +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 753 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 810

Query: 624 ILDELYDI 631
             D+L  +
Sbjct: 811 TADDLIKV 818


>gi|148529014|ref|NP_001914.3| DNA damage-binding protein 1 [Homo sapiens]
 gi|296218432|ref|XP_002807395.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1
            [Callithrix jacchus]
 gi|397516558|ref|XP_003828491.1| PREDICTED: DNA damage-binding protein 1 [Pan paniscus]
 gi|402893195|ref|XP_003909786.1| PREDICTED: DNA damage-binding protein 1 [Papio anubis]
 gi|426368721|ref|XP_004051351.1| PREDICTED: DNA damage-binding protein 1 [Gorilla gorilla gorilla]
 gi|12643730|sp|Q16531.1|DDB1_HUMAN RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
            subunit; AltName: Full=DNA damage-binding protein a;
            Short=DDBa; AltName: Full=Damage-specific DNA-binding
            protein 1; AltName: Full=HBV X-associated protein 1;
            Short=XAP-1; AltName: Full=UV-damaged DNA-binding factor;
            AltName: Full=UV-damaged DNA-binding protein 1;
            Short=UV-DDB 1; AltName: Full=XPE-binding factor;
            Short=XPE-BF; AltName: Full=Xeroderma pigmentosum group
            E-complementing protein; Short=XPCe
 gi|203282525|pdb|3E0C|A Chain A, Crystal Structure Of Dna Damage-Binding Protein 1(Ddb1)
 gi|695362|gb|AAA62838.1| X-associated protein 1, partial [Homo sapiens]
 gi|1052865|gb|AAC50349.1| DDBa p127 [Homo sapiens]
 gi|15079750|gb|AAH11686.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
 gi|29792243|gb|AAH50530.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
 gi|30354567|gb|AAH51764.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
 gi|61354161|gb|AAX44048.1| damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
 gi|119594341|gb|EAW73935.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_c [Homo
            sapiens]
 gi|168275638|dbj|BAG10539.1| DNA damage-binding protein 1 [synthetic construct]
 gi|189065506|dbj|BAG35345.1| unnamed protein product [Homo sapiens]
 gi|355566436|gb|EHH22815.1| Damage-specific DNA-binding protein 1 [Macaca mulatta]
 gi|380784123|gb|AFE63937.1| DNA damage-binding protein 1 [Macaca mulatta]
 gi|380808126|gb|AFE75938.1| DNA damage-binding protein 1 [Macaca mulatta]
 gi|380810144|gb|AFE76947.1| DNA damage-binding protein 1 [Macaca mulatta]
 gi|383408123|gb|AFH27275.1| DNA damage-binding protein 1 [Macaca mulatta]
 gi|410305600|gb|JAA31400.1| damage-specific DNA binding protein 1, 127kDa [Pan troglodytes]
 gi|410352015|gb|JAA42611.1| damage-specific DNA binding protein 1, 127kDa [Pan troglodytes]
          Length = 1140

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|400260815|pdb|4E54|A Chain A, Damaged Dna Induced Uv-Damaged Dna-Binding Protein (Uv-Ddb)
            Dimerization And Its Roles In Chromatinized Dna Repair
 gi|401871507|pdb|4E5Z|A Chain A, Damaged Dna Induced Uv-Damaged Dna-Binding Protein (Uv-Ddb)
            Dimerization And Its Roles In Chromatinized Dna Repair
          Length = 1150

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 838  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 886

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 887  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 945

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 946  KPMEGNFEEIARDFNPN----------------------WM------------------S 965

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 966  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1017

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1018 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1076

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1077 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1134

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1135 TADDLIKV 1142


>gi|361132523|pdb|4A0L|A Chain A, Structure Of Ddb1-Ddb2-Cul4b-Rbx1 Bound To A 12 Bp Abasic
            Site Containing Dna-Duplex
 gi|361132525|pdb|4A0L|C Chain C, Structure Of Ddb1-Ddb2-Cul4b-Rbx1 Bound To A 12 Bp Abasic
            Site Containing Dna-Duplex
          Length = 1144

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 832  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 880

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 881  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 939

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 940  KPMEGNFEEIARDFNPN----------------------WM------------------S 959

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 960  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1011

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1012 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1070

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1071 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1128

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1129 TADDLIKV 1136


>gi|348526664|ref|XP_003450839.1| PREDICTED: DNA damage-binding protein 1-like [Oreochromis niloticus]
          Length = 1140

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 121/585 (20%), Positives = 219/585 (37%), Gaps = 104/585 (17%)

Query: 75   NEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLA 134
            +E+  +  G + + +R F +++    VF C   P  ++ +S  +L    + +   V+ + 
Sbjct: 624  SERKKVTLGTQPTVLRTFRSLS-TSNVFACSDRPTVIY-SSNHKLVFSNVNLK-EVNYMC 680

Query: 135  PFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCI 194
            P ++   P      N  S L I  +           +R VPL  +P  + Y   ++ + +
Sbjct: 681  PLNSEGYPDSLALAN-NSTLTIGTIDEI----QKLHIRTVPLYESPRRICYQEVSQCFGV 735

Query: 195  VTSTAE-----PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            ++S  E      +T   + +   + L +    S+  P   S    S       EE+   N
Sbjct: 736  LSSRVEIQDVSGTTSAVRPSASTQALSSSVSSSKLFPSSTSPHETSF-----GEEVEVHN 790

Query: 250  FPL---HEWEHVLCLKNVSMEYEGTLSGLR------GYIALGTNYNYSEDVTCR-GRILL 299
              +   H +E +   + +  EY  +L   R       Y  +GT   Y E+   + GRI++
Sbjct: 791  LLVVDQHTFEVLHAHQFLPSEYALSLVSCRLGKDPSVYFIVGTAMVYPEEAEPKQGRIIV 850

Query: 300  FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDND 357
            F             T  K++ +  KE KG V ++    G  + ++    ++Y W  +   
Sbjct: 851  FH-----------YTDGKLQTVAEKEVKGAVYSMVEFNGKFLASINSTVRLYEWTAEKEL 899

Query: 358  LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
             T     +  +    + +  + ILVGD  RS+ LL Y+        +ARD+ P       
Sbjct: 900  RTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAYKSMEGNFEEIARDFNPN------ 952

Query: 418  YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVV 477
                            W                      +ILD+ + +G      +    
Sbjct: 953  ----------------WM------------------SAVEILDDDNFLG-----AENAFN 973

Query: 478  LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS---RFLTW 532
            LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +    G  S   +    
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVLQNLGESSTPTQGSVL 1029

Query: 533  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 592
            + +++G +G    L E  Y  LL LQN +       G +    +R++  +       + G
Sbjct: 1030 FGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEQATG 1087

Query: 593  IIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 631
             IDG L+  FL L   +  E+   +      G K    +DE+  I
Sbjct: 1088 FIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREATVDEVIKI 1132


>gi|119594340|gb|EAW73934.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_b [Homo
           sapiens]
          Length = 923

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
           Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 611 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 659

Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
            G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 660 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 718

Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
           +P       +ARD+ P                       W                    
Sbjct: 719 KPMEGNFEEIARDFNPN----------------------WM------------------S 738

Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
             +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 739 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 790

Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
            C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 791 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 849

Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
            +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 850 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 907

Query: 624 ILDELYDI 631
             D+L  +
Sbjct: 908 TADDLIKV 915


>gi|418316|sp|P33194.1|DDB1_CERAE RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
            subunit; AltName: Full=DDBa; AltName:
            Full=Damage-specific DNA-binding protein 1; AltName:
            Full=UV-damaged DNA-binding protein 1; Short=UV-DDB 1
 gi|304026|gb|AAA03021.1| UV-damaged DNA-binding protein [Chlorocebus aethiops]
          Length = 1140

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|403255013|ref|XP_003920244.1| PREDICTED: DNA damage-binding protein 1 [Saimiri boliviensis
            boliviensis]
          Length = 1140

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|355752055|gb|EHH56175.1| Damage-specific DNA-binding protein 1, partial [Macaca fascicularis]
          Length = 1125

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 813  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 861

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 862  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 920

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 921  KPMEGNFEEIARDFNPN----------------------WM------------------S 940

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 941  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 992

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 993  -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1051

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1052 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1109

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1110 TADDLIKV 1117


>gi|73983859|ref|XP_533275.2| PREDICTED: DNA damage-binding protein 1 [Canis lupus familiaris]
 gi|291409601|ref|XP_002721069.1| PREDICTED: damage-specific DNA binding protein 1 [Oryctolagus
            cuniculus]
 gi|301781686|ref|XP_002926259.1| PREDICTED: DNA damage-binding protein 1-like [Ailuropoda melanoleuca]
          Length = 1140

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|7657011|ref|NP_056550.1| DNA damage-binding protein 1 [Mus musculus]
 gi|134034087|sp|Q3U1J4.2|DDB1_MOUSE RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
            subunit; AltName: Full=Damage-specific DNA-binding
            protein 1; AltName: Full=UV-damaged DNA-binding factor
 gi|5931596|dbj|BAA84699.1| XPE UV-damaged DNA binding factor [Mus musculus]
 gi|16307148|gb|AAH09661.1| Damage specific DNA binding protein 1 [Mus musculus]
 gi|74182145|dbj|BAE34102.1| unnamed protein product [Mus musculus]
 gi|74196166|dbj|BAE32993.1| unnamed protein product [Mus musculus]
          Length = 1140

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|5353754|gb|AAD42230.1|AF159853_1 damage-specific DNA binding protein 1 [Mus musculus]
          Length = 1140

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|221046711|pdb|3EI1|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 14 Bp 6-4 Photoproduct
            Containing Dna-Duplex
 gi|221046715|pdb|3EI2|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Abasic Site
            Containing Dna-Duplex
 gi|221046719|pdb|3EI3|A Chain A, Structure Of The Hsddb1-Drddb2 Complex
          Length = 1158

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 846  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 894

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 895  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 953

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 954  KPMEGNFEEIARDFNPN----------------------WM------------------S 973

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 974  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1025

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1026 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1084

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1085 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1142

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1143 TADDLIKV 1150


>gi|413081953|ref|NP_741992.2| DNA damage-binding protein 1 [Rattus norvegicus]
 gi|293344614|ref|XP_002725831.1| PREDICTED: DNA damage-binding protein 1 [Rattus norvegicus]
 gi|293356422|ref|XP_002728912.1| PREDICTED: DNA damage-binding protein 1 [Rattus norvegicus]
 gi|149062405|gb|EDM12828.1| damage-specific DNA binding protein 1 [Rattus norvegicus]
          Length = 1140

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|441604084|ref|XP_004087862.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1
            [Nomascus leucogenys]
          Length = 1140

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|358440070|pdb|4A0B|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
            Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
 gi|358440072|pdb|4A0B|C Chain C, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
            Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
          Length = 1159

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 847  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 895

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 896  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 954

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 955  KPMEGNFEEIARDFNPN----------------------WM------------------S 974

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 975  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1026

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1027 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1085

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1086 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1143

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1144 TADDLIKV 1151


>gi|354504619|ref|XP_003514371.1| PREDICTED: DNA damage-binding protein 1-like [Cricetulus griseus]
 gi|344258340|gb|EGW14444.1| DNA damage-binding protein 1 [Cricetulus griseus]
          Length = 1140

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|119594343|gb|EAW73937.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_e [Homo
           sapiens]
          Length = 896

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
           Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 584 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 632

Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
            G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 633 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 691

Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
           +P       +ARD+ P                       W                    
Sbjct: 692 KPMEGNFEEIARDFNPN----------------------WM------------------S 711

Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
             +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 712 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 763

Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
            C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 764 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 822

Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
            +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 823 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 880

Query: 624 ILDELYDI 631
             D+L  +
Sbjct: 881 TADDLIKV 888


>gi|359546285|pdb|4A11|A Chain A, Structure Of The Hsddb1-Hscsa Complex
 gi|361132519|pdb|4A0K|C Chain C, Structure Of Ddb1-Ddb2-Cul4a-Rbx1 Bound To A 12 Bp Abasic
            Site Containing Dna-Duplex
          Length = 1159

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 847  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 895

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 896  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 954

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 955  KPMEGNFEEIARDFNPN----------------------WM------------------S 974

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 975  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1026

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1027 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1085

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1086 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1143

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1144 TADDLIKV 1151


>gi|149725200|ref|XP_001502072.1| PREDICTED: DNA damage-binding protein 1 [Equus caballus]
          Length = 1140

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|384941436|gb|AFI34323.1| DNA damage-binding protein 1 [Macaca mulatta]
          Length = 1140

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|74178494|dbj|BAE32502.1| unnamed protein product [Mus musculus]
          Length = 1140

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKLGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|395852550|ref|XP_003798801.1| PREDICTED: DNA damage-binding protein 1 [Otolemur garnettii]
          Length = 1140

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|358440058|pdb|4A08|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 13 Bp Cpd-Duplex (
            Purine At D-1 Position) At 3.0 A Resolution (Cpd 1)
 gi|358440062|pdb|4A09|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 15 Bp Cpd-Duplex
            (Purine At D-1 Position) At 3.1 A Resolution (Cpd 2)
          Length = 1159

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 847  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 895

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 896  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 954

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 955  KPMEGNFEEIARDFNPN----------------------WM------------------S 974

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 975  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1026

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1027 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1085

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1086 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1143

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1144 TADDLIKV 1151


>gi|344295432|ref|XP_003419416.1| PREDICTED: DNA damage-binding protein 1 [Loxodonta africana]
          Length = 1140

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|358440066|pdb|4A0A|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
            Pyrimidine At D-1 Position) At 3.6 A Resolution (Cpd 3)
          Length = 1159

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 847  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 895

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 896  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 954

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 955  KPMEGNFEEIARDFNPN----------------------WM------------------S 974

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 975  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1026

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1027 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1085

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1086 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1143

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1144 TADDLIKV 1151


>gi|311247551|ref|XP_003122699.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Sus scrofa]
          Length = 1140

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|122692537|ref|NP_001073731.1| DNA damage-binding protein 1 [Bos taurus]
 gi|426251842|ref|XP_004019630.1| PREDICTED: DNA damage-binding protein 1 [Ovis aries]
 gi|134034086|sp|A1A4K3.1|DDB1_BOVIN RecName: Full=DNA damage-binding protein 1; AltName:
            Full=Damage-specific DNA-binding protein 1
 gi|119223918|gb|AAI26630.1| Damage-specific DNA binding protein 1, 127kDa [Bos taurus]
 gi|296471644|tpg|DAA13759.1| TPA: DNA damage-binding protein 1 [Bos taurus]
          Length = 1140

 Score = 63.2 bits (152), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|355683071|gb|AER97036.1| damage-specific DNA binding protein 1, 127kDa [Mustela putorius furo]
          Length = 1122

 Score = 62.8 bits (151), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 76/337 (22%), Positives = 130/337 (38%), Gaps = 71/337 (21%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
             +    +R++  +      P+ G IDG L+  FL +S
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDIS 1101


>gi|58383228|ref|XP_312466.2| AGAP002472-PA [Anopheles gambiae str. PEST]
 gi|55242305|gb|EAA08181.2| AGAP002472-PA [Anopheles gambiae str. PEST]
          Length = 1138

 Score = 62.8 bits (151), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 92/454 (20%), Positives = 170/454 (37%), Gaps = 84/454 (18%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPP--L 228
            +R VPL  +P  +AY   ++T+ ++T         ++ + +D   +T  R S       +
Sbjct: 713  IRTVPLGESPRRIAYQEASQTFGVIT---------FRMDVQDSSGLTPARQSASTQTNNI 763

Query: 229  VSQFHVSLFSPFS-----WEEIPQTNFPL---HEWEHVLCLKNVSMEYEGTLSGLR---- 276
                 + L  P +      +E+   N  +   + +E +   + +  EY  +L   +    
Sbjct: 764  TQSSGMGLLKPGASNTEFGQEVEVHNLLIIDQNTFEVLHAHQFMQTEYALSLMSAKLGND 823

Query: 277  --GYIALGTN-YNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAI 333
               Y  +GT   N  E     GRI+++               N++KM+  KE KG   ++
Sbjct: 824  PNTYFIVGTGLVNPEEPEPKTGRIIIY-----------RYADNELKMVSDKEVKGACYSL 872

Query: 334  CHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALL 392
                G ++  +   + +++  D+    +        +A     K + ILVGD  RSI LL
Sbjct: 873  VEFNGRVLACINSTVRLYEWTDDKDLRLECSHFNNVLALYCKTKGDFILVGDLMRSITLL 932

Query: 393  RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
            +Y+    +   +ARDY+P                       W                  
Sbjct: 933  QYKQMEGSFEEIARDYQPN----------------------WM----------------- 953

Query: 453  SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 512
                +ILD+ + +G   +D   N+ + +    A       ++ +   FHLG  VN F   
Sbjct: 954  -TAVEILDDDAFLG---ADNSNNLFVCLKDSAATTDEERQQMPEVAQFHLGDMVNVFRHG 1009

Query: 513  RCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
                 +IS+     +  +  + ++ GA+G    +    Y  L  LQ  +       G ++
Sbjct: 1010 SLVMQNISERSTPTTGCV-LFGTVSGAIGLVTQIQSDFYEFLRKLQENLTNTIKSVGKID 1068

Query: 573  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
               +R++  +         G IDG LV  FL LS
Sbjct: 1069 HSYWRSFHTETKM--ERCEGFIDGDLVESFLDLS 1100


>gi|410912407|ref|XP_003969681.1| PREDICTED: DNA damage-binding protein 1-like [Takifugu rubripes]
          Length = 1140

 Score = 62.8 bits (151), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 82/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             T  K++ +  KE KG V ++   
Sbjct: 828  YFVVGTAMVYPEEAEPKQGRIIVFH-----------YTDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T  +  +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTAEKELRTECSHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEDRQHLQEVGVFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + ++ G +G    L E  +  LL LQN +       G
Sbjct: 1008 -CHGSLVLQNLGETSTPTQGSVLFGTVTGMIGLVTSLSEGWHSLLLDLQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +       ++G IDG L+  FL L   +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEQAKGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREA 1124

Query: 624  ILDELYDI 631
             +DE+  I
Sbjct: 1125 TVDEVIKI 1132


>gi|74215029|dbj|BAE33503.1| unnamed protein product [Mus musculus]
          Length = 1140

 Score = 62.8 bits (151), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQRDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|223647932|gb|ACN10724.1| DNA damage-binding protein 1 [Salmo salar]
          Length = 1139

 Score = 62.8 bits (151), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 121/579 (20%), Positives = 216/579 (37%), Gaps = 92/579 (15%)

Query: 75   NEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLA 134
            +E+  +  G + + +R F +++    VF C   P  ++ +S  +L    + +   V+ + 
Sbjct: 623  SERKKVTLGTQPTVLRTFRSLS-TSNVFACSDRPTVIY-SSNHKLVFSNVNLK-EVNYMC 679

Query: 135  PFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCI 194
            P ++   P      N  S L I  +           +R VPL  +P  + Y   ++ + +
Sbjct: 680  PLNSEGYPDSLALAN-NSTLTIGTIDEI----QKLHIRTVPLYESPRRICYQEVSQCFGV 734

Query: 195  VTSTAE-----PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSL---FSPFSWEEIP 246
            ++S  E      +T   + +   + L +    S+  P   S    S        S   + 
Sbjct: 735  LSSRVEMQDASGTTAAVRPSASTQALSSSVSSSKLFPSSTSPHETSFGEEVEVHSLLVVD 794

Query: 247  QTNFP-LHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR-GRILLFDIIE 304
            Q  F  LH  + +     +SM        L  Y  +GT   Y E+   + GRI++F    
Sbjct: 795  QHTFEVLHAHQFLQSEYALSMVSCRLGRDLSVYFIVGTAMVYPEEAEPKQGRIIVFH--- 851

Query: 305  VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDNDLTGIA 362
                     T  K++ +  KE KG V ++    G L+ ++    ++Y W  +    T   
Sbjct: 852  --------YTDGKLQTVAEKEVKGAVYSMMEFNGKLLASINSTVRLYEWTAEKELRTECN 903

Query: 363  FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
              +  +    + +  + ILVGD  RS+ LL Y+P       +ARD+ P            
Sbjct: 904  HYNN-IMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPN----------- 951

Query: 423  PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                       W                      +ILD+ + +G      +    LF+ Q
Sbjct: 952  -----------WM------------------SAVEILDDDNFLG-----AENAFNLFVCQ 977

Query: 483  PEARESNGGHR--LIKKTDFHLGQHVNTFF--KIRCKPSSISDAPGARSRFLTWYASLDG 538
             ++  +    R  L +   FHLG+ VN F    +  +    S  P   S     + +++G
Sbjct: 978  KDSAATTDEERQHLQEVGVFHLGEFVNVFSHGSLVLQNLGESSTPTQGS---VLFGTVNG 1034

Query: 539  ALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
             +G    L E  Y  LL LQN +       G +    +R++  +       + G IDG L
Sbjct: 1035 MIGLVTSLSEGWYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEQATGFIDGDL 1092

Query: 599  VWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 631
            +  FL L   +  E+   +      G K    +DE+  I
Sbjct: 1093 IESFLDLGRAKMQEVVSTLQIDDGSGMKREATVDEVIKI 1131


>gi|167384458|ref|XP_001736962.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165900458|gb|EDR26769.1| hypothetical protein EDI_171140 [Entamoeba dispar SAW760]
          Length = 836

 Score = 62.8 bits (151), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 150/371 (40%), Gaps = 86/371 (23%)

Query: 275 LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYA-KEQKGPVTAI 333
           L+ Y+ +G N   +ED   +G+  +F+I            +N+I++I    + K  V A+
Sbjct: 521 LKNYLVVGVNKQTTEDNPVKGKTYIFNI------------ENQIQLINKIGDGKKSVHAV 568

Query: 334 CHVAGFLVTAVGQKI-YIWQLKDNDLTGIAFIDTEVYIASM------VSVKN------LI 380
             + GFL  A G ++  I ++ +       F D  + I S+      V  K       LI
Sbjct: 569 NEIGGFLAVASGNELELIERVDETRWIKKCFSDISILINSIEYLPLKVMEKGNEKECYLI 628

Query: 381 LVGDYARSIALLRYQP-EYRTLSLVARDYKPTQPNSKGYYAGNPSRGI--IDGSLVWKFL 437
           L+ D+ RS+ LL ++P +Y  + L                 G  +R I  ID + +    
Sbjct: 629 LLSDFYRSVVLLLFKPYDYTVIPL-----------------GKDARNIHCIDSTFI---- 667

Query: 438 QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
                    I K          D FS + F   D ++N+ L  Y   A E      +   
Sbjct: 668 ---------ITK----------DYFSVLEF---DSEQNLSLLNYSSAATEQLSIFEI--D 703

Query: 498 TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
             F+LG ++  F       + + +  G    ++  Y +++G++G+   + EK Y+ L  +
Sbjct: 704 ATFNLGMNLLKF-------TRLWNGKG----YIYMYVTVEGSVGYISVVEEKIYQVLRQI 752

Query: 558 QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
              M     H  G N   +R  KG G   G      +DG ++ +F  L+  ++  +C + 
Sbjct: 753 NIKMNREPWHFAGTNAEEYRFEKGYGMGFGTRKHVFLDGDMLKQFRLLNEEQQKRVCLR- 811

Query: 618 GSKHNDILDEL 628
            +  ND+   L
Sbjct: 812 NTSINDVFKLL 822


>gi|301616502|ref|XP_002937687.1| PREDICTED: DNA damage-binding protein 1-like [Xenopus (Silurana)
            tropicalis]
          Length = 1140

 Score = 62.8 bits (151), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 126/598 (21%), Positives = 225/598 (37%), Gaps = 119/598 (19%)

Query: 66   FVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMT 125
             +SDR K       +  G + + +R F +++    VF C   P  ++ +S  +L    + 
Sbjct: 622  LLSDRKK-------VTLGTQPTVLRTFRSLS-TTNVFACSDRPTVIY-SSNHKLVFSNVN 672

Query: 126  IDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
            +   V+ + P ++   P      N  S L I  +           +R VPL  +P  + Y
Sbjct: 673  LK-EVNYMCPLNSEGYPDSLALAN-NSTLTIGTIDEI----QKLHIRTVPLYESPRKICY 726

Query: 186  HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDP-RDSRFIPPLVSQFHVS-LFSPFS-- 241
               ++ + +++S  E          +D    + P R S     L S    S LFS  +  
Sbjct: 727  QEVSQCFGVLSSRIEV---------QDASGGSSPLRPSASTQALSSSVSCSKLFSGSTSP 777

Query: 242  -----WEEIPQTNFPL---HEWEHVLCLKNVSMEYEGTLSGLR------GYIALGTNYNY 287
                  EE+   N  +   H +E +   + +  EY  +L   +       Y  +GT   Y
Sbjct: 778  HETSFGEEVEVHNLLIIDQHTFEVLHTHQFLQNEYTLSLVSCKLGKDPTTYFVVGTAMVY 837

Query: 288  SEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ 346
             ++   + GRI++F                K++ +  KE KG V ++    G L+ ++  
Sbjct: 838  PDEAEPKQGRIVVFQ-----------YNDGKLQTVAEKEVKGAVYSMVEFNGKLLASINS 886

Query: 347  --KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLV 404
              ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y+P       +
Sbjct: 887  TVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEI 945

Query: 405  ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
            ARD+ P                       W                      +ILD+ + 
Sbjct: 946  ARDFNPN----------------------WM------------------SAVEILDDDNF 965

Query: 465  MGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDA 522
            +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F    C  S +   
Sbjct: 966  LG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF----CHGSLVMQN 1016

Query: 523  PGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
             G  S   +    + +++G +G    L E  Y  LL +QN +       G +    +R++
Sbjct: 1017 LGETSPPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDVQNRLNKVIKSVGKIEHSFWRSF 1076

Query: 580  KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 631
              +      P+ G IDG L+  FL +S  +  E+   +      G K    +D+L  +
Sbjct: 1077 HTE--RKTEPATGFIDGDLIESFLDISRPKMQEVIANLQIDDGSGMKRETTVDDLIKV 1132


>gi|259155222|ref|NP_001158852.1| DNA damage-binding protein 1 [Salmo salar]
 gi|223647700|gb|ACN10608.1| DNA damage-binding protein 1 [Salmo salar]
          Length = 1139

 Score = 62.8 bits (151), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 120/584 (20%), Positives = 220/584 (37%), Gaps = 102/584 (17%)

Query: 75   NEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLA 134
            +E+  +  G + + +R F +++    VF C   P  ++ +S  +L    + +   V+ + 
Sbjct: 623  SERKKVTLGTQPTVLRTFRSLS-TSNVFACSDRPTVIY-SSNHKLVFSNVNLK-EVNYMC 679

Query: 135  PFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCI 194
            P ++   P      N  S L I  +           +R VPL  +P  + Y   ++ + +
Sbjct: 680  PLNSEGYPDSLALAN-NSTLTIGTIDEI----QKLHIRTVPLYESPRRICYQEVSQCFGV 734

Query: 195  VTSTAE-----PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            ++S  E      +T   + +   + L +    S+  P   S    S       EE+   +
Sbjct: 735  LSSRVEMQDASGTTAAVRPSASTQALSSSVSSSKLFPSSTSPHETSF-----GEEVEVHS 789

Query: 250  FPL---HEWEHVLCLKNVSMEYEGTLSGLR------GYIALGTNYNYSEDVTCR-GRILL 299
              +   H +E +   + +  EY  ++   R       Y  +GT   Y E+   + GRI++
Sbjct: 790  LLVVDQHTFEVLHAHQFLQSEYALSMVSCRLGRDPAVYFIVGTAMVYPEEAEPKQGRIIV 849

Query: 300  FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDND 357
            F             T  K++ +  KE KG V ++    G L+ ++    ++Y W  +   
Sbjct: 850  FH-----------YTDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTAEKEL 898

Query: 358  LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
             T     +  +    + +  + ILVGD  RS+ LL Y+P       +ARD+ P       
Sbjct: 899  RTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPN------ 951

Query: 418  YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVV 477
                            W                      +ILD+ + +G      +    
Sbjct: 952  ----------------WM------------------SAVEILDDDNFLG-----AENAFN 972

Query: 478  LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFF--KIRCKPSSISDAPGARSRFLTWY 533
            LF+ Q ++  +    R  L +   FHLG+ VN F    +  +    S  P   S     +
Sbjct: 973  LFVCQKDSAATTDEERQHLQEVGVFHLGEFVNVFSHGSLVLQNLGESSTPTQGS---VLF 1029

Query: 534  ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGI 593
             +++G +G    L E  Y  LL LQN +       G +    +R++  +       + G 
Sbjct: 1030 GTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEQATGF 1087

Query: 594  IDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 631
            IDG L+  FL L   +  E+   +      G K    +DE+  I
Sbjct: 1088 IDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREATVDEVIKI 1131


>gi|327278830|ref|XP_003224163.1| PREDICTED: DNA damage-binding protein 1-like [Anolis carolinensis]
          Length = 1140

 Score = 62.4 bits (150), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 84/369 (22%), Positives = 142/369 (38%), Gaps = 79/369 (21%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y ++   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPDEAEPKQGRIVVFH-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLR 393
             G L+ ++    ++Y W  +    T     +    +A  V  K + ILVGD  RS+ LL 
Sbjct: 877  NGKLLASINSTVRLYEWTAEKELRTECNHYNN--IMALYVKTKGDFILVGDLMRSVLLLA 934

Query: 394  YQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 453
            Y+P       +ARD+ P                       W                   
Sbjct: 935  YKPMEGNFEEIARDFNPN----------------------WM------------------ 954

Query: 454  KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFK 511
               +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F  
Sbjct: 955  SAVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEFGLFHLGEFVNVF-- 1007

Query: 512  IRCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 568
              C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       
Sbjct: 1008 --CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDVQNRLNKVIKSV 1065

Query: 569  GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHN 622
            G +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K  
Sbjct: 1066 GKIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKRE 1123

Query: 623  DILDELYDI 631
              +D+L  I
Sbjct: 1124 ATVDDLIKI 1132


>gi|328770638|gb|EGF80679.1| hypothetical protein BATDEDRAFT_11194 [Batrachochytrium dendrobatidis
            JAM81]
          Length = 1098

 Score = 62.4 bits (150), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 118/573 (20%), Positives = 215/573 (37%), Gaps = 112/573 (19%)

Query: 51   PKGALKLRFKKLKVLFVS-----------DRSKRANEQPGLPRGVRISQMRYFSNIAGYQ 99
            P+  L + F  L  L VS            +S +  ++  +    +   +R F +  G  
Sbjct: 582  PRSILLVEFDNLPYLLVSLGDGQLFNFRIGKSLKLADRKKITLATQPITLRTFQS-HGRT 640

Query: 100  GVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVL 159
             VF     P  +F+ S G+L    + +   +S ++PF N +   G L F +   L+I  +
Sbjct: 641  HVFAASDRPTVIFVKS-GQLLYSNVNVR-EISHVSPF-NSHMAEGALAFASDGALKIGTI 697

Query: 160  PTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVT--STAEPSTDYYKFNGEDKELVT 217
             T         ++ + L  TP  +AYH  + T+ ++T  S   P+ D    +        
Sbjct: 698  ETV----QKLHIKTIKLGETPRRIAYHDVSHTFGVLTVFSRNLPNGDLADISC------- 746

Query: 218  DPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRG 277
                            + L     +E +   +  L  +E    L  +    + TL     
Sbjct: 747  ----------------LRLLDGQGYEVLD--SIELQPFEIASSLITIRFTDDDTL----- 783

Query: 278  YIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT + +  ED   RGRIL+F + ++            +++++  + +G   +   V
Sbjct: 784  YYTVGTGFAFPHEDEPVRGRILVFKVNDM----------RLLQLVHEYDIRGSAYSFVSV 833

Query: 337  AGFLVTAVGQKIYI--WQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLR 393
             G LV  V   + +  W   D  L  +  ++    +A  ++V+ + ILV D  +SI LL+
Sbjct: 834  HGRLVAGVNSNVMVLRWN-SDTSLLELQSMNHGHVLALSLAVRGDFILVADLIKSITLLQ 892

Query: 394  YQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 453
            +     +L  +A D      +S    A                                 
Sbjct: 893  FDLATDSLKELAYD-----ADSNWMTAA-------------------------------- 915

Query: 454  KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
               +++D+ + +G   +D   N+     Q +        RL  K  FH G+ +N F K  
Sbjct: 916  ---ELIDDDTFLG---ADSSMNIFALSKQGDQVSEEERQRLRPKGWFHTGELINRFRKGS 969

Query: 514  CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLL-MLQNVMVTHTSHTGGLN 572
                +  +     +     Y ++ GA+G    +P     ++L  LQ  + +     GGL 
Sbjct: 970  LTLHATDETLALPAIPEILYCTVHGAIGVVARIPSDETAKILSTLQEALKSVVQGVGGLI 1029

Query: 573  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
               +R Y+ +       S GIIDG L+  FL+L
Sbjct: 1030 HSDWRRYRTE--RRSIKSAGIIDGDLIESFLEL 1060


>gi|91087281|ref|XP_975549.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
            castaneum]
 gi|270010588|gb|EFA07036.1| hypothetical protein TcasGA2_TC010010 [Tribolium castaneum]
          Length = 1149

 Score = 62.4 bits (150), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 80/360 (22%), Positives = 140/360 (38%), Gaps = 68/360 (18%)

Query: 278  YIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
            YI      N  E    +GRIL+F               NK+  +  KE KG   ++    
Sbjct: 838  YIVGTATVNPEESEPKQGRILIFQ-----------WNDNKLTQVSEKEIKGACYSLAEFN 886

Query: 338  GFLVTAVGQ--KIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
            G L+ ++    +++ W + K+  L    F    +    + +  + IL+GD  RS+ LL+Y
Sbjct: 887  GKLLASINSTVRLFEWTVEKELRLECSHF--NNILTLFLKTKGDFILLGDLMRSMTLLQY 944

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +    +   +ARDY P    +           I+D  +   FL       + +C+K  + 
Sbjct: 945  KTMEGSFEEIARDYNPNWMTAVE---------ILDDDI---FLGAENSFNIFVCQKDSAA 992

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 514
              D  +E S M                          H + +   FH+G  +N F     
Sbjct: 993  TTD--EERSQM--------------------------HEVGR---FHVGDMINVFRHGSL 1021

Query: 515  KPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 574
               ++ +     +  +  + ++ GA+G    + +  Y  LL LQN + T     G ++  
Sbjct: 1022 VMQNLGETSTPTTGCV-LFGTVSGAIGLVTQITQDFYDFLLELQNKLSTVIKSVGKIDHS 1080

Query: 575  AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS------LGERLEICKKIGSKHNDILDEL 628
             +R +         PS G IDG L+  FL LS      + + L+I  + G K +  +D+L
Sbjct: 1081 QWRAFNTD--IKTEPSEGFIDGDLIESFLDLSHDKMKEVADGLQITGEGGMKQDCTVDDL 1138


>gi|452824086|gb|EME31091.1| DNA damage-binding protein 1 isoform 2 [Galdieria sulphuraria]
          Length = 1150

 Score = 62.4 bits (150), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 99/474 (20%), Positives = 185/474 (39%), Gaps = 100/474 (21%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRD-SRFIPPLV 229
            +R +PL   P  +A HL+T     V +T              K++VT   D +  +    
Sbjct: 732  IRTIPLGEQPRRIA-HLDTHHVFAVLTT--------------KQVVTISEDGNEALSETT 776

Query: 230  SQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSE 289
             + +V L      E +   ++ L ++E    +  V+   +      + Y  +GT Y+Y++
Sbjct: 777  EEGYVRLIDDTMMEIVH--SYKLEQFETPCSVITVNFGDDAAAKDNQDYFVVGTAYSYAD 834

Query: 290  DVT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ-- 346
            +    RGR+L+F + E            ++ ++  +  KG + ++    G ++ +V    
Sbjct: 835  EPEPSRGRMLVFAVRE-----------QRLTLVAERTFKGALYSMDAFNGKILASVNSML 883

Query: 347  KIYIWQLKDN---DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSL 403
            K+  W   ++    LT        ++I  +  + + IL+GD  RS++LL Y+P   T+  
Sbjct: 884  KLVRWSETESGARTLTEECTYHGSIFILQIKCLGDFILIGDLVRSVSLLAYKPMNGTIED 943

Query: 404  VARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFS 463
            VARD  P+                      W    +++ E L+            LD + 
Sbjct: 944  VARDIDPS----------------------W----ITVIEMLD------------LDYYI 965

Query: 464  SMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSI 519
            S     ++   N+       +A       RL K  ++HLG+ VN        ++   S I
Sbjct: 966  S-----AENCFNLFTLKRNSDASTEEERSRLEKVGEYHLGELVNRIRHGRLVLQIPESGI 1020

Query: 520  SD------------APGARSRFLTWY----ASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
            S                 +  F+  Y     + +GALG    + EK ++ L  LQ  +  
Sbjct: 1021 SILKSLLYGMYICFDDNLKELFMHKYRFNLGTANGALGVIASIDEKTFQFLHSLQTALNE 1080

Query: 564  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
                 GG+    +R +  +       S+  +DG L+ +FL LS  +   + KK+
Sbjct: 1081 VIKGVGGIQHEDWRRFTSERRIG--DSKNFLDGDLIERFLDLSRDKMELVAKKV 1132


>gi|357623954|gb|EHJ74904.1| putative DNA repair protein xp-e [Danaus plexippus]
          Length = 1128

 Score = 62.4 bits (150), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 104/448 (23%), Positives = 173/448 (38%), Gaps = 77/448 (17%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
            +R VPL  TP  +AY   ++T+ ++T       D  ++ G    LV     +       +
Sbjct: 707  IRTVPLGETPRRIAYQEASQTFGVITM----RVDKVEWTGGCGSLVRPSASTAAASASAA 762

Query: 231  QFHVSLF-SPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLR------GYIALGT 283
                    +P   E         H +E +   + ++ E+  +L   +       Y A+GT
Sbjct: 763  APPSKHAPAPLDLELHNLLILDHHTFEVLHAHQLLANEFAMSLVSCKLADDPNHYYAVGT 822

Query: 284  N-YNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVT 342
               N  E    +GRILLF   E            K+  +  KE KG    +    G L+ 
Sbjct: 823  AILNPEESEPKQGRILLFHWCE-----------GKLTQVAEKEIKGGCYTLVEFNGKLLA 871

Query: 343  AVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQPEYRTL 401
            ++   + +++        +        +A  + VK + ILVGD  RS++LL+Y+    + 
Sbjct: 872  SINSTVRLFEWTSEKELRLECSHFNNIVALYLKVKGDFILVGDLMRSMSLLQYKQMEGSF 931

Query: 402  SLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
              +ARDY P    +           I+D      FL       L +C+K  +   D  +E
Sbjct: 932  EEIARDYSPNWMTAV---------EILDDD---TFLGAENSFNLFVCQKDSAATTD--EE 977

Query: 462  FSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISD 521
               MG+M                               FH+G  VN   +     + ++D
Sbjct: 978  RQQMGYM-----------------------------GQFHVGDMVNVMRR-GALVAQLAD 1007

Query: 522  --APGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF-RT 578
              AP AR   L   A++ GA+   + L ++ +  L  L+  + THT  + G  P +F R+
Sbjct: 1008 TAAPVARPVLL---ATVSGAICLVVQLSQELFDFLHQLEERL-THTIKSVGKIPHSFWRS 1063

Query: 579  YKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
            +         P+ G IDG L+  FL LS
Sbjct: 1064 FNTD--IKTEPAEGFIDGDLIESFLDLS 1089


>gi|74208347|dbj|BAE26370.1| unnamed protein product [Mus musculus]
          Length = 599

 Score = 62.4 bits (150), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 139/368 (37%), Gaps = 77/368 (20%)

Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
           Y  +GT   Y E+   + GRI +F             +  K++ +  KE KG V ++   
Sbjct: 287 YFIVGTAMVYPEEAEPKQGRIAVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 335

Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
            G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 336 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 394

Query: 395 QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
           +P       +ARD+ P                       W                    
Sbjct: 395 KPMEGNFEEIARDFNPN----------------------WM------------------S 414

Query: 455 HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
             +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 415 AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 466

Query: 513 RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
            C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 467 -CHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 525

Query: 570 GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
            +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 526 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 583

Query: 624 ILDELYDI 631
             D+L  +
Sbjct: 584 TADDLIKV 591


>gi|197097564|ref|NP_001126613.1| DNA damage-binding protein 1 [Pongo abelii]
 gi|75041202|sp|Q5R649.1|DDB1_PONAB RecName: Full=DNA damage-binding protein 1; AltName:
            Full=Damage-specific DNA-binding protein 1
 gi|55732122|emb|CAH92767.1| hypothetical protein [Pongo abelii]
          Length = 1140

 Score = 62.4 bits (150), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 139/368 (37%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V  +   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYPMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|449710759|gb|EMD49776.1| cleavage and polyadenylation specificity factor subunit, putative
           [Entamoeba histolytica KU27]
          Length = 836

 Score = 62.0 bits (149), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 83/371 (22%), Positives = 150/371 (40%), Gaps = 86/371 (23%)

Query: 275 LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYA-KEQKGPVTAI 333
           L+ Y+ +G N   +ED   +G+  +F+I            +N+I++I    + K  V A+
Sbjct: 521 LKNYLVVGVNKQTTEDNPVKGKTYIFNI------------ENQIQLINKIGDGKKSVHAV 568

Query: 334 CHVAGFLVTAVGQKI-YIWQLKDNDLTGIAFIDTEVYIASM----VSVKN--------LI 380
             + GFL  A G ++  I ++ +       F D  + I S+    + V          LI
Sbjct: 569 NEIGGFLAVASGNELELIERVDETRWIKKCFSDISILINSIEYLPLKVMERGNEKECYLI 628

Query: 381 LVGDYARSIALLRYQP-EYRTLSLVARDYKPTQPNSKGYYAGNPSRGI--IDGSLVWKFL 437
           L+ D+ RS+ LL ++P +Y  + L                 G  +R I  ID + +    
Sbjct: 629 LLSDFYRSVVLLLFKPYDYTVIPL-----------------GKDARNIHCIDSTFI---- 667

Query: 438 QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
                    I K          D FS + F   D ++N+ L  Y   A E      +   
Sbjct: 668 ---------ITK----------DYFSVLEF---DSEQNLSLLNYSSAATEQLSIFEI--D 703

Query: 498 TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
             F+LG ++  F       + + +  G    ++  Y +++G++G+   + EK Y+ L  +
Sbjct: 704 ATFNLGMNLLKF-------TRLWNGKG----YIYMYVTVEGSVGYISVVEEKIYQVLRQI 752

Query: 558 QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
              M     H  G N   +R  KG G   G      +DG ++ +F  L+  ++  +C + 
Sbjct: 753 NIKMNREPWHFAGTNAEEYRFEKGYGMGFGTRKHVFLDGDMLKQFRLLNEEQQKRVCLR- 811

Query: 618 GSKHNDILDEL 628
            +  ND+   L
Sbjct: 812 NTSINDVFKLL 822


>gi|147906138|ref|NP_001083624.1| DNA damage-binding protein 1 [Xenopus laevis]
 gi|82186503|sp|Q6P6Z0.1|DDB1_XENLA RecName: Full=DNA damage-binding protein 1; AltName:
            Full=Damage-specific DNA-binding protein 1
 gi|38303806|gb|AAH61946.1| Ddb1 protein [Xenopus laevis]
          Length = 1140

 Score = 62.0 bits (149), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 80/368 (21%), Positives = 140/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y ++   + GRI++F                K++ +  KE KG V ++   
Sbjct: 828  YFVVGTAMVYPDEAEPKQGRIVVFQ-----------YNDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGETSPPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDVQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVIANLQIDDGSGMKRET 1124

Query: 624  ILDELYDI 631
             +D+L  +
Sbjct: 1125 TVDDLIKV 1132


>gi|45383688|ref|NP_989547.1| DNA damage-binding protein 1 [Gallus gallus]
 gi|82098863|sp|Q805F9.1|DDB1_CHICK RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
            subunit; AltName: Full=Damage-specific DNA-binding
            protein 1; AltName: Full=UV-damaged DNA-binding factor
 gi|28375613|dbj|BAC56999.1| damaged-DNA binding protein DDB p127 subunit [Gallus gallus]
 gi|53130071|emb|CAG31438.1| hypothetical protein RCJMB04_6h2 [Gallus gallus]
          Length = 1140

 Score = 62.0 bits (149), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 125/607 (20%), Positives = 227/607 (37%), Gaps = 111/607 (18%)

Query: 53   GALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLF 112
            GAL      L+   +SDR K       +  G + + +R F +++    VF C   P  ++
Sbjct: 609  GALFYFGLSLETGLLSDRKK-------VTLGTQPTVLRTFRSLS-TTNVFACSDRPTVIY 660

Query: 113  LTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVR 172
             +S  +L    + +   V+ + P ++   P      N  S L I  +           +R
Sbjct: 661  -SSNHKLVFSNVNLK-EVNYMCPLNSDGYPDSLALAN-NSTLTIGTIDEI----QKLHIR 713

Query: 173  KVPLKCTPHFLAYHLETKTYCIVTS-----TAEPSTDYYKFNGEDKELVTDPRDSRFIPP 227
             VPL  +P  + Y   ++ + +++S      A   T   + +   + L +    S+    
Sbjct: 714  TVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKLFSS 773

Query: 228  LVSQFHVSLFSPFSWEEIPQTNFPL---HEWEHVLCLKNVSMEYEGTLSGLR------GY 278
              +    S       EE+   N  +   H +E +   + +  EY  +L   +       Y
Sbjct: 774  STAPHETSF-----GEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTY 828

Query: 279  IALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
              +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++    
Sbjct: 829  FIVGTAMVYPEEAEPKQGRIVVFH-----------YSDGKLQSLAEKEVKGAVYSMVEFN 877

Query: 338  GFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQ 395
            G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y+
Sbjct: 878  GKLLASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAYK 936

Query: 396  PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
            P       +ARD+ P                       W                     
Sbjct: 937  PMEGNFEEIARDFNPN----------------------WM------------------SA 956

Query: 456  NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIR 513
             +ILD+ + +G      +    LF+ Q ++  +    R  L +    HLG+ VN F    
Sbjct: 957  VEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEFVNVF---- 1007

Query: 514  CKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
            C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G 
Sbjct: 1008 CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGK 1067

Query: 571  LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDI 624
            +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K    
Sbjct: 1068 IEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREAT 1125

Query: 625  LDELYDI 631
            +D+L  I
Sbjct: 1126 VDDLIKI 1132


>gi|407035910|gb|EKE37921.1| CPSF A subunit region protein, putative [Entamoeba nuttalli P19]
          Length = 836

 Score = 62.0 bits (149), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 83/371 (22%), Positives = 150/371 (40%), Gaps = 86/371 (23%)

Query: 275 LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYA-KEQKGPVTAI 333
           L+ Y+ +G N   +ED   +G+  +F+I            +N+I++I    + K  V A+
Sbjct: 521 LKNYLVVGVNKQTTEDNPVKGKTYIFNI------------ENQIQLINKIGDGKKSVHAV 568

Query: 334 CHVAGFLVTAVGQKI-YIWQLKDNDLTGIAFIDTEVYIASM----VSVKN--------LI 380
             + GFL  A G ++  I ++ +       F D  + I S+    + V          LI
Sbjct: 569 NEIGGFLAVASGNELELIERVDETRWIKKCFSDISILINSIEYLPLKVMERGNEKECYLI 628

Query: 381 LVGDYARSIALLRYQP-EYRTLSLVARDYKPTQPNSKGYYAGNPSRGI--IDGSLVWKFL 437
           L+ D+ RS+ LL ++P +Y  + L                 G  +R I  ID + +    
Sbjct: 629 LLSDFYRSVVLLLFKPYDYTVIPL-----------------GKDARNIHCIDSTFI---- 667

Query: 438 QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
                    I K          D FS + F   D ++N+ L  Y   A E      +   
Sbjct: 668 ---------ITK----------DYFSVLEF---DSEQNLSLLNYSSAATEQLSIFEI--D 703

Query: 498 TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
             F+LG ++  F       + + +  G    ++  Y +++G++G+   + EK Y+ L  +
Sbjct: 704 ATFNLGMNLLKF-------TRLWNGKG----YIYMYVTVEGSVGYISVVEEKIYQVLRQI 752

Query: 558 QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
              M     H  G N   +R  KG G   G      +DG ++ +F  L+  ++  +C + 
Sbjct: 753 NIKMNREPWHFAGTNAEEYRFEKGYGMGFGTRKHVFLDGDMLKQFRLLNEEQQKRVCLR- 811

Query: 618 GSKHNDILDEL 628
            +  ND+   L
Sbjct: 812 NTSINDVFKLL 822


>gi|224050582|ref|XP_002191856.1| PREDICTED: DNA damage-binding protein 1 [Taeniopygia guttata]
          Length = 1140

 Score = 62.0 bits (149), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 125/607 (20%), Positives = 227/607 (37%), Gaps = 111/607 (18%)

Query: 53   GALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLF 112
            GAL      L+   +SDR K       +  G + + +R F +++    VF C   P  ++
Sbjct: 609  GALFYFGLSLETGLLSDRKK-------VTLGTQPTVLRTFRSLS-TTNVFACSDRPTVIY 660

Query: 113  LTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVR 172
             +S  +L    + +   V+ + P ++   P      N  S L I  +           +R
Sbjct: 661  -SSNHKLVFSNVNLK-EVNYMCPLNSDGYPDSLALAN-NSTLTIGTIDEI----QKLHIR 713

Query: 173  KVPLKCTPHFLAYHLETKTYCIVTS-----TAEPSTDYYKFNGEDKELVTDPRDSRFIPP 227
             VPL  +P  + Y   ++ + +++S      A   T   + +   + L +    S+    
Sbjct: 714  TVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKLFSS 773

Query: 228  LVSQFHVSLFSPFSWEEIPQTNFPL---HEWEHVLCLKNVSMEYEGTLSGLR------GY 278
              +    S       EE+   N  +   H +E +   + +  EY  +L   +       Y
Sbjct: 774  STAPHETSF-----GEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTY 828

Query: 279  IALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
              +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++    
Sbjct: 829  FIVGTAMVYPEEAEPKQGRIVVFH-----------YSDGKLQSLAEKEVKGAVYSMVEFN 877

Query: 338  GFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQ 395
            G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y+
Sbjct: 878  GKLLASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAYK 936

Query: 396  PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
            P       +ARD+ P                       W                     
Sbjct: 937  PMEGNFEEIARDFNPN----------------------WM------------------SA 956

Query: 456  NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIR 513
             +ILD+ + +G      +    LF+ Q ++  +    R  L +    HLG+ VN F    
Sbjct: 957  VEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEFVNVF---- 1007

Query: 514  CKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
            C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G 
Sbjct: 1008 CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGK 1067

Query: 571  LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDI 624
            +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K    
Sbjct: 1068 IEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREAT 1125

Query: 625  LDELYDI 631
            +D+L  I
Sbjct: 1126 VDDLIKI 1132


>gi|431910407|gb|ELK13480.1| DNA damage-binding protein 1 [Pteropus alecto]
          Length = 1143

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 84/374 (22%), Positives = 142/374 (37%), Gaps = 86/374 (22%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM------VT 563
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +      V 
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 564  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------ 617
               H+   + R+F T +        P+ G IDG L+  FL +S  +  E+   +      
Sbjct: 1067 KIEHSLYPSQRSFHTERKT-----EPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGS 1121

Query: 618  GSKHNDILDELYDI 631
            G K     D+L  +
Sbjct: 1122 GMKREATADDLIKV 1135


>gi|389629928|ref|XP_003712617.1| hypothetical protein MGG_16867 [Magnaporthe oryzae 70-15]
 gi|351644949|gb|EHA52810.1| hypothetical protein MGG_16867 [Magnaporthe oryzae 70-15]
 gi|440464739|gb|ELQ34110.1| DNA damage-binding protein 1a [Magnaporthe oryzae Y34]
          Length = 1183

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 74/351 (21%), Positives = 137/351 (39%), Gaps = 42/351 (11%)

Query: 281  LGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAG-F 339
            +GT Y         GR+L+F     V E   P       +I+A   K     I  +    
Sbjct: 857  VGTRYLSGTGSGHGGRVLVFG----VDESRSPY------LIHAHSTKSGCRRIATMDDDL 906

Query: 340  LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-----NLILVGDYARSIALLRY 394
            LV A+ + + + +  +   T   F+    +  S  +V       LI V D  +SI LL Y
Sbjct: 907  LVIALTKTVVLVRYSETSTTSAKFLKVAAFQTSSYAVDVTVHGKLIAVADIMKSITLLEY 966

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
             P     +      K T+ + +           ++GS   K +        E+C+   + 
Sbjct: 967  IPGVGKSAKTGGKDKATRSDKE-----------VEGSKQAKLV--------EVCRDYQAM 1007

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 514
             +  +       ++++D D N+V+ +            R+   ++F LG+ VN   K+  
Sbjct: 1008 WSTAVSHLEGDSWIVADGDGNLVVLLRNTAGVTLEDKRRMQMTSEFGLGECVNKIQKVMV 1067

Query: 515  KPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH-TGGLNP 573
            + S  ++AP     FL+   + +G++  F  +  K    L+  Q  M  H S   G L  
Sbjct: 1068 ETS--ANAPIVAKAFLS---TTEGSIYLFGTVAPKFQSLLMDFQANMEAHVSSPLGELQF 1122

Query: 574  RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
              +R+++        P R  +DG  +  FL +    +++IC+ +     D+
Sbjct: 1123 NQWRSFRNPEREGAGPER-FLDGEFLEMFLDMEENTQIDICQGLSYTAEDM 1172


>gi|326919947|ref|XP_003206238.1| PREDICTED: DNA damage-binding protein 1-like [Meleagris gallopavo]
          Length = 1079

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 125/607 (20%), Positives = 227/607 (37%), Gaps = 111/607 (18%)

Query: 53   GALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLF 112
            GAL      L+   +SDR K       +  G + + +R F +++    VF C   P  ++
Sbjct: 548  GALFYFGLSLETGLLSDRKK-------VTLGTQPTVLRTFRSLS-TTNVFACSDRPTVIY 599

Query: 113  LTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVR 172
             +S  +L    + +   V+ + P ++   P      N  S L I  +           +R
Sbjct: 600  -SSNHKLVFSNVNLK-EVNYMCPLNSDGYPDSLALAN-NSTLTIGTIDEI----QKLHIR 652

Query: 173  KVPLKCTPHFLAYHLETKTYCIVTS-----TAEPSTDYYKFNGEDKELVTDPRDSRFIPP 227
             VPL  +P  + Y   ++ + +++S      A   T   + +   + L +    S+    
Sbjct: 653  TVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKLFSS 712

Query: 228  LVSQFHVSLFSPFSWEEIPQTNFPL---HEWEHVLCLKNVSMEYEGTLSGLR------GY 278
              +    S       EE+   N  +   H +E +   + +  EY  +L   +       Y
Sbjct: 713  STAPHETSF-----GEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTY 767

Query: 279  IALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
              +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++    
Sbjct: 768  FIVGTAMVYPEEAEPKQGRIVVFH-----------YSDGKLQSLAEKEVKGAVYSMVEFN 816

Query: 338  GFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQ 395
            G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y+
Sbjct: 817  GKLLASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAYK 875

Query: 396  PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
            P       +ARD+ P                       W                     
Sbjct: 876  PMEGNFEEIARDFNPN----------------------WM------------------SA 895

Query: 456  NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIR 513
             +ILD+ + +G      +    LF+ Q ++  +    R  L +    HLG+ VN F    
Sbjct: 896  VEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEFVNVF---- 946

Query: 514  CKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
            C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G 
Sbjct: 947  CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGK 1006

Query: 571  LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDI 624
            +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K    
Sbjct: 1007 IEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREAT 1064

Query: 625  LDELYDI 631
            +D+L  I
Sbjct: 1065 VDDLIKI 1071


>gi|440893607|gb|ELR46310.1| DNA damage-binding protein 1 [Bos grunniens mutus]
          Length = 1143

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 84/374 (22%), Positives = 142/374 (37%), Gaps = 86/374 (22%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM------VT 563
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +      V 
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 564  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------ 617
               H+   + R+F T +        P+ G IDG L+  FL +S  +  E+   +      
Sbjct: 1067 KIEHSLYPSQRSFHTERKT-----EPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGS 1121

Query: 618  GSKHNDILDELYDI 631
            G K     D+L  +
Sbjct: 1122 GMKREATADDLIKV 1135


>gi|440302955|gb|ELP95261.1| hypothetical protein EIN_430670 [Entamoeba invadens IP1]
          Length = 1175

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 80/367 (21%), Positives = 144/367 (39%), Gaps = 77/367 (20%)

Query: 276  RGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAK-EQKGPVTAIC 334
            R  +  G N   +ED   +G + LF +        +  ++  I+ I    + K  V AI 
Sbjct: 857  RVLVGCGVNTQTTEDDPVKGNVFLFSL--------ESTSEGTIRHISTVCDGKKAVHAIN 908

Query: 335  HVAGFLVTAVGQKIYIWQLKDNDL------TGIAFIDTEVYIASMVSVKN-------LIL 381
             + G+L  A G ++ I + K   L      + I+ +   +    M   KN       LIL
Sbjct: 909  SIGGYLAVAEGNELQILKGKTESLWVKKCFSDISILINTITFLPMTLSKNKVDEMCYLIL 968

Query: 382  VGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSL 441
            + D  RS+ LL +QP+ +++  + +D +                  ID + V   L    
Sbjct: 969  LNDMYRSVILLLFQPQKKSVIPLGKDGRDIHA--------------IDAAFV---LDKDY 1011

Query: 442  GERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFH 501
               LEI         D     S M ++ ++ ++  +  +    A   N G  +++ T   
Sbjct: 1012 FHVLEI---------DYERNLSVMNYLRTETERISIFEV----AATFNVGVDILRLTRLR 1058

Query: 502  LGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 561
            LG                       + ++  Y S  G++G+   + E++Y+ L  +   M
Sbjct: 1059 LG-----------------------NGYVFVYLSAQGSVGYLTVVNERSYQTLRQINAKM 1095

Query: 562  VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 621
                 H  G NP  FR  KG G   G   + I+DG ++ +F  L+  ++  +C +  S  
Sbjct: 1096 NREPWHFAGTNPEEFRMEKGYGVGYGRRKQVILDGDILKEFHFLTQEQQKRVCLRNTSIS 1155

Query: 622  N--DILD 626
            +  +ILD
Sbjct: 1156 DVVNILD 1162


>gi|440487047|gb|ELQ66855.1| DNA damage-binding protein 1a [Magnaporthe oryzae P131]
          Length = 1213

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 74/351 (21%), Positives = 137/351 (39%), Gaps = 42/351 (11%)

Query: 281  LGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAG-F 339
            +GT Y         GR+L+F     V E   P       +I+A   K     I  +    
Sbjct: 887  VGTRYLSGTGSGHGGRVLVFG----VDESRSPY------LIHAHSTKSGCRRIATMDDDL 936

Query: 340  LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-----NLILVGDYARSIALLRY 394
            LV A+ + + + +  +   T   F+    +  S  +V       LI V D  +SI LL Y
Sbjct: 937  LVIALTKTVVLVRYSETSTTSAKFLKVAAFQTSSYAVDVTVHGKLIAVADIMKSITLLEY 996

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
             P     +      K T+ + +           ++GS   K +        E+C+   + 
Sbjct: 997  IPGVGKSAKTGGKDKATRSDKE-----------VEGSKQAKLV--------EVCRDYQAM 1037

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 514
             +  +       ++++D D N+V+ +            R+   ++F LG+ VN   K+  
Sbjct: 1038 WSTAVSHLEGDSWIVADGDGNLVVLLRNTAGVTLEDKRRMQMTSEFGLGECVNKIQKVMV 1097

Query: 515  KPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSH-TGGLNP 573
            + S  ++AP     FL+   + +G++  F  +  K    L+  Q  M  H S   G L  
Sbjct: 1098 ETS--ANAPIVAKAFLS---TTEGSIYLFGTVAPKFQSLLMDFQANMEAHVSSPLGELQF 1152

Query: 574  RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
              +R+++        P R  +DG  +  FL +    +++IC+ +     D+
Sbjct: 1153 NQWRSFRNPEREGAGPER-FLDGEFLEMFLDMEENTQIDICQGLSYTAEDM 1202


>gi|391335522|ref|XP_003742140.1| PREDICTED: DNA damage-binding protein 1-like [Metaseiulus
            occidentalis]
          Length = 1154

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 112/489 (22%), Positives = 180/489 (36%), Gaps = 101/489 (20%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTS-------------------TAEPSTDYYKFNGE 211
            +R VPL  +P  +AY  ET T+ ++ S                    A P   +  F+  
Sbjct: 713  IRTVPLGESPRRIAYQEETGTFGVIVSRSDMACSTRCASLDAPNKSNASPYAWHKDFSSF 772

Query: 212  DKELVTDPRDSRFIPPLVSQFHVSLFSP-------FSWEEIPQTNFP-LHEWEHVLCLKN 263
                  D  DS  IP   S    SL  P       FS   I Q  F  LH  +       
Sbjct: 773  GHTQCADRVDSG-IPSCSS---TSLQRPPSGCDETFSLLIIDQNTFEVLHAMQFCPNEYG 828

Query: 264  VSMEYEGTLSGLRGYIALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIY 322
            VS+      S    Y  +GT + N  E     GRI +                 K++ I 
Sbjct: 829  VSICSAKLGSDPNPYYIVGTAFINQEESEPKVGRIFVL-----------RWHDGKLETIA 877

Query: 323  AKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLI 380
             KE  G   +I      L  A+    ++Y W   + DL         + I  +  + + I
Sbjct: 878  EKEAAGAPYSIREFHQKLAIAINSTVRLYSWN-AEKDLQSECTPFFNIVILHLKCLGDYI 936

Query: 381  LVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLS 440
            LVGD  RS+ LL Y  +  +L  + RDY+                        W      
Sbjct: 937  LVGDLMRSMTLLNYNADITSLEEIGRDYQTN----------------------W------ 968

Query: 441  LGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDF 500
                        +   +ILDE +   F+ ++ + N+ +    P A +    H + +   +
Sbjct: 969  ------------TTAVEILDEDT---FLAAESNLNLYVCKRDPSAADDTRQH-MHEVALY 1012

Query: 501  HLGQHVNTFFKIRCKPSSISDAPGARSR-FLTWYASLDGALGFFLPLPEKNYRRLLMLQN 559
            HLG+ VN   K     +   D P   ++ FL  Y SL GA+G  +P+ ++ Y  L  +Q 
Sbjct: 1013 HLGEMVNVIVKGSLVMAQPGDMPLPLNKSFL--YGSLHGAVGVIVPIKQELYAILNQIQT 1070

Query: 560  VMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL------SLGERLEI 613
             +       G +    +RT+  +      P+ G IDG L+ + L L      S+ + +++
Sbjct: 1071 NLAKTIKSVGKIEHGFWRTFLAERKI--EPATGFIDGDLIEQLLDLPKEALESVSQSIKV 1128

Query: 614  CKKIGSKHN 622
             ++ G + N
Sbjct: 1129 DEEGGHQRN 1137


>gi|336369683|gb|EGN98024.1| hypothetical protein SERLA73DRAFT_109335 [Serpula lacrymans var.
            lacrymans S7.3]
 gi|336382464|gb|EGO23614.1| hypothetical protein SERLADRAFT_449959 [Serpula lacrymans var.
            lacrymans S7.9]
          Length = 1257

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 89/361 (24%), Positives = 149/361 (41%), Gaps = 74/361 (20%)

Query: 256  EHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT-CRGRILLFDIIEVVPEPGQPLT 314
            E +  L  VS+  E ++     YI  GT Y Y ++V   +GR+L+FD      E G  L 
Sbjct: 922  EEITALGVVSVTLERSIGT---YICAGT-YKYVDEVEPSQGRLLVFD-----AEDGS-LL 971

Query: 315  KNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFI------DTEV 368
            + KI M  + E +G V A+  V G ++ A+   + +++ + +  T +  +      +   
Sbjct: 972  REKITMAVSLEVRGCVYAVGSVNGMIIAAINSSVVVYRPEIDASTQLLALHKITEWNHNY 1031

Query: 369  YIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGII 428
             + ++V   + ILVGD   SI+ LR       +  +ARDY                    
Sbjct: 1032 LVTNLVCRGDKILVGDAINSISFLRMVES--QIQCLARDY-------------------- 1069

Query: 429  DGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ-PEARE 487
             GSL W            +C        ++LD+ S +G   ++ D N+  F  Q  E R+
Sbjct: 1070 -GSL-WP-----------VCV-------EMLDQSSIIG---ANSDYNLFTFALQETELRK 1106

Query: 488  SNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS-DAPGARSRFLTWYASLDGALGFFLPL 546
            S     L +   +++G  VN F         +S D P    +    + +  G +G  + +
Sbjct: 1107 S-----LERDGSYYIGDMVNKFIPGALTAHDVSVDMPLEPKQL---FFTSTGCIGVIVDM 1158

Query: 547  PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK--GYYAGNPSRGIIDGSLVWKFLQ 604
             ++    +  LQ  M T+ S T G+    FR  K       A   S G +DG  + KF+Q
Sbjct: 1159 GDELSLHMTALQRNMSTYLSQTKGVTHTKFRAPKNAYGRSDAEATSFGFLDGDFLEKFMQ 1218

Query: 605  L 605
             
Sbjct: 1219 F 1219


>gi|67463896|ref|XP_648489.1| cleavage and polyadenylation specificity factor subunit [Entamoeba
            histolytica HM-1:IMSS]
 gi|56464653|gb|EAL43100.1| cleavage and polyadenylation specificity factor subunit, putative
            [Entamoeba histolytica HM-1:IMSS]
          Length = 1150

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 83/371 (22%), Positives = 150/371 (40%), Gaps = 86/371 (23%)

Query: 275  LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYA-KEQKGPVTAI 333
            L+ Y+ +G N   +ED   +G+  +F+I            +N+I++I    + K  V A+
Sbjct: 835  LKNYLVVGVNKQTTEDNPVKGKTYIFNI------------ENQIQLINKIGDGKKSVHAV 882

Query: 334  CHVAGFLVTAVGQKI-YIWQLKDNDLTGIAFIDTEVYIASM----VSVKN--------LI 380
              + GFL  A G ++  I ++ +       F D  + I S+    + V          LI
Sbjct: 883  NEIGGFLAVASGNELELIERVDETRWIKKCFSDISILINSIEYLPLKVMERGNEKECYLI 942

Query: 381  LVGDYARSIALLRYQP-EYRTLSLVARDYKPTQPNSKGYYAGNPSRGI--IDGSLVWKFL 437
            L+ D+ RS+ LL ++P +Y  + L                 G  +R I  ID + +    
Sbjct: 943  LLSDFYRSVVLLLFKPYDYTVIPL-----------------GKDARNIHCIDSTFI---- 981

Query: 438  QLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKK 497
                     I K          D FS + F   D ++N+ L  Y   A E      +   
Sbjct: 982  ---------ITK----------DYFSVLEF---DSEQNLSLLNYSSAATEQLSIFEI--D 1017

Query: 498  TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLML 557
              F+LG ++  F       + + +  G    ++  Y +++G++G+   + EK Y+ L  +
Sbjct: 1018 ATFNLGMNLLKF-------TRLWNGKG----YIYMYVTVEGSVGYISVVEEKIYQVLRQI 1066

Query: 558  QNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
               M     H  G N   +R  KG G   G      +DG ++ +F  L+  ++  +C + 
Sbjct: 1067 NIKMNREPWHFAGTNAEEYRFEKGYGMGFGTRKHVFLDGDMLKQFRLLNEEQQKRVCLR- 1125

Query: 618  GSKHNDILDEL 628
             +  ND+   L
Sbjct: 1126 NTSINDVFKLL 1136


>gi|281345356|gb|EFB20940.1| hypothetical protein PANDA_015888 [Ailuropoda melanoleuca]
          Length = 1124

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 86/374 (22%), Positives = 143/374 (38%), Gaps = 87/374 (23%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 810  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 858

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 859  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 917

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 918  KPMEGNFEEIARDFNPN----------------------WM------------------S 937

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 938  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 989

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM------VT 563
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +      +T
Sbjct: 990  -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKNIT 1048

Query: 564  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------ 617
            H S T     R+F T +        P+ G IDG L+  FL +S  +  E+   +      
Sbjct: 1049 H-SLTHLSTWRSFHTERKT-----EPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGS 1102

Query: 618  GSKHNDILDELYDI 631
            G K     D+L  +
Sbjct: 1103 GMKREATADDLIKV 1116


>gi|392591958|gb|EIW81285.1| hypothetical protein CONPUDRAFT_56293 [Coniophora puteana RWD-64-598
            SS2]
          Length = 1245

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 97/419 (23%), Positives = 170/419 (40%), Gaps = 100/419 (23%)

Query: 223  RFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALG 282
            R   P +S+  V L++  + +++ Q      E    +   +V +  E     + G + + 
Sbjct: 875  RVGEPEISRGSVQLYNDTTLDKLGQVVLDHDEEPMAIKALSVRVAEEAKDCFVVGTVIID 934

Query: 283  TNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVT 342
            +  N S      GR+LL        EP     ++ + +  +++ KG V A+  V G +V 
Sbjct: 935  SLENES----SSGRLLLV-------EPDYSRGESFVAVSASEKVKGCVYAVAAVDGLVVA 983

Query: 343  AVGQKIYIWQLKDNDLT-GIAFI-----DTEVYIASMVSVKNLILVGDYARSIALLRYQP 396
            AV   + I+ ++ +D T  ++F+     +    +A++VS  NL+LVGD   S+ LL+Y  
Sbjct: 984  AVNSAVVIYSIEADDHTRALSFVKKVEWNHNYVVANLVSRGNLLLVGDAISSVTLLQY-- 1041

Query: 397  EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 456
            E   L  VARDY P  P S                                         
Sbjct: 1042 ERGALQNVARDYSPLWPTSV---------------------------------------- 1061

Query: 457  DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRC 514
            ++LDE + +G   +D D N+ +F  Q      +G  R  L +   ++ G  VN F     
Sbjct: 1062 EMLDERNVIG---ADNDCNLFMFTLQ------DGAERKVLERNGHYYFGDMVNKFI---- 1108

Query: 515  KPSSISDAPGARSRFLTWYASLD-------------GALGFFLPLPEKNYRRLLMLQNVM 561
                    PG   R L+ + + D             G++G  + + ++    +  LQ  +
Sbjct: 1109 --------PGEIYRALSSFEASDIEVEPKQLFFTTTGSIGVVIDMSDELSLHMSSLQRNL 1160

Query: 562  VTH-TSHTGGLNPRAFRTYK-GKGYY-AGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
             T+  +  GG +   +R  K  +G   A N S G +DG L+ +FL    G+  E  +K+
Sbjct: 1161 STYFAAQPGGASHTKYRAPKNARGRSDADNSSFGFLDGDLLERFLL--FGDDEEAVRKV 1217


>gi|385865228|gb|AFI92852.1| DNA damage-binding protein 1 [Danio rerio]
          Length = 1140

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 119/584 (20%), Positives = 221/584 (37%), Gaps = 102/584 (17%)

Query: 75   NEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLA 134
            +E+  +  G + + +R F +++    VF C   P  ++ +S  +L    + +   V+ + 
Sbjct: 624  SERKKVTLGTQPTVLRTFRSLS-TSNVFACSDRPTVIY-SSNHKLVFSNVNLK-EVNYMC 680

Query: 135  PFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCI 194
            P ++   P      N  S L I  +           +R VPL  +P  + Y   ++ + +
Sbjct: 681  PLNSEGYPDSLALAN-NSTLTIGTIDEI----QKLHIRTVPLYESPKRICYQEVSQCFGV 735

Query: 195  VTSTAE-----PSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            ++S  E      +T   + +   + L +    S+  P   S    S       EE+   +
Sbjct: 736  LSSRVEMQDASGTTAAVRPSASTQALSSSVSSSKLFPSSTSPHETSF-----GEEVEVHS 790

Query: 250  FPL---HEWEHVLCLKNVSMEYEGTLSGLR------GYIALGTNYNYSEDVTCR-GRILL 299
              +   H +E +   + +  EY  ++   +       Y  +GT   Y E+   + GRI++
Sbjct: 791  LLVVDQHTFEVLHAHQFLQNEYALSMVSCKLGRDPAVYFIVGTAMVYPEEAEPKQGRIIV 850

Query: 300  FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDND 357
            F             T  K++ +  KE KG V ++    G L+ ++    ++Y W  +   
Sbjct: 851  FH-----------YTDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTAEKEL 899

Query: 358  LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKG 417
             T     +  +    + +  + ILVGD  RS+ LL Y+P   +   +ARD+ P       
Sbjct: 900  RTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAYKPMEGSFEEIARDFNPN------ 952

Query: 418  YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVV 477
                            W                      +ILD+ + +G      +    
Sbjct: 953  ----------------WM------------------SAVEILDDDNFLG-----AENAFN 973

Query: 478  LFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFF--KIRCKPSSISDAPGARSRFLTWY 533
            LF+ Q ++  +    R  L +   FHLG+ VN F    +  +    S  P   S     +
Sbjct: 974  LFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFSHGSLVLQNLGESSTPTQGS---VLF 1030

Query: 534  ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGI 593
             +++G +G    L E  Y  LL LQN +       G +    +R++  +       + G 
Sbjct: 1031 GTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEQATGF 1088

Query: 594  IDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 631
            IDG L+  FL L   +  E+   +      G K    +DE+  I
Sbjct: 1089 IDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREATVDEVIKI 1132


>gi|81868411|sp|Q9ESW0.1|DDB1_RAT RecName: Full=DNA damage-binding protein 1; AltName:
            Full=Damage-specific DNA-binding protein 1
 gi|9843869|emb|CAB89874.2| damage-specific DNA binding protein 1 [Rattus norvegicus]
          Length = 1140

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 81/368 (22%), Positives = 139/368 (37%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSGGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +      +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLLGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|194389106|dbj|BAG61570.1| unnamed protein product [Homo sapiens]
          Length = 1009

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 82/368 (22%), Positives = 143/368 (38%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 697  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 745

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 746  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 804

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W    +S  E L+    +G++
Sbjct: 805  KPMEGNFEEIARDFNPN----------------------W----MSAVEILDDDNFLGAE 838

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
                 + F+S              F+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 839  -----NAFNS--------------FVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 876

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 877  -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 935

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +      P+ G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 936  KIEHSFWRSFHTE--RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 993

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 994  TADDLIKV 1001


>gi|224587439|gb|ACN58665.1| DNA damage-binding protein 1 [Salmo salar]
          Length = 444

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 103/483 (21%), Positives = 180/483 (37%), Gaps = 84/483 (17%)

Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAE-----PSTDYYKFNGEDKELVTDPRDSRFI 225
           +R VPL  +P  + Y   ++ + +++S  E      +T   + +   + L +    S+  
Sbjct: 16  IRTVPLYESPRRICYQEVSQCFGVLSSRVEMQDASGTTAAVRPSASTQALSSSVSSSKLF 75

Query: 226 PPLVSQFHVSL---FSPFSWEEIPQTNFP-LHEWEHVLCLKNVSMEYEGTLSGLRGYIAL 281
           P   S    S        S   + Q  F  LH  + +     +SM        L  Y  +
Sbjct: 76  PSSTSPHETSFGEEVEVHSLLVVDQHTFEVLHAHQFLQSEYALSMVSCRLGRDLSVYFIV 135

Query: 282 GTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFL 340
           GT   Y E+   + GRI++F             T  K++ +  KE KG V ++    G L
Sbjct: 136 GTAMVYPEEAEPKQGRIIVFH-----------YTDGKLQTVAEKEVKGAVYSMMEFNGKL 184

Query: 341 VTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEY 398
           + ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y+P  
Sbjct: 185 LASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAYKPME 243

Query: 399 RTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 458
                +ARD+ P   ++                                         +I
Sbjct: 244 GNFEEIARDFNPNWMSAV----------------------------------------EI 263

Query: 459 LDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFF--KIRC 514
           LD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F    +  
Sbjct: 264 LDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGVFHLGEFVNVFSHGSLVL 318

Query: 515 KPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPR 574
           +    S  P   S     + +++G +G    L E  Y  LL LQN +       G +   
Sbjct: 319 QNLGESSTPTQGS---VLFGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKSVGKIEHS 375

Query: 575 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDEL 628
            +R++  +       + G IDG L+  FL L   +  E+   +      G K    +DE+
Sbjct: 376 FWRSFHTE--RKTEQATGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREATVDEV 433

Query: 629 YDI 631
             I
Sbjct: 434 IKI 436


>gi|193644722|ref|XP_001942922.1| PREDICTED: DNA damage-binding protein 1-like [Acyrthosiphon pisum]
          Length = 1156

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 77/347 (22%), Positives = 128/347 (36%), Gaps = 57/347 (16%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  LGT     ED   + GRIL+F   +         + +K+  I  KE KG    +   
Sbjct: 837  YYILGTAVVNPEDQDPKLGRILIFHWDD---------SSSKLTPITEKEVKGACYGMAEF 887

Query: 337  AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQ 395
             G L+ AV   + +++        +        +A  V  K + I+ GD  RS+ LL+Y+
Sbjct: 888  NGKLLAAVNCTVRLFEWTAEKELRLECSHFNNIVALFVKTKGDFIVCGDLMRSLTLLQYK 947

Query: 396  PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
                +   +ARDY P                       W                  S  
Sbjct: 948  TMEGSFEEIARDYNPK----------------------W------------------STA 967

Query: 456  NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK 515
             +I+D+   +G   ++ DKN+ +             H+L +   FH G  +N F      
Sbjct: 968  IEIIDDDVFLG---AENDKNLFIIHKDSTLTSDEARHQLQEIGQFHCGDLINVFRHGSLV 1024

Query: 516  PSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRA 575
                +D   +    +  Y +  GALG    L  K +  L  L+  + T     G +N + 
Sbjct: 1025 MQHFTDTYVSVQGGI-LYGTCSGALGLVTQLTPKMFDFLSDLEKSLATVVKGVGKINHQF 1083

Query: 576  FRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 622
            +R+Y  +      PS   +DG L+  FL LS  E + +   +   ++
Sbjct: 1084 WRSYHTE--IRTEPSESFVDGDLIESFLDLSKREMIAVVDALQGAYD 1128


>gi|449283451|gb|EMC90093.1| DNA damage-binding protein 1 [Columba livia]
          Length = 1140

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 126/616 (20%), Positives = 229/616 (37%), Gaps = 118/616 (19%)

Query: 53   GALKLRFKKLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLF 112
            GAL      L+   +SDR K       +  G + + +R F +++    VF C   P  ++
Sbjct: 598  GALFYFGLSLETGLLSDRKK-------VTLGTQPTVLRTFRSLS-TTNVFACSDRPTVIY 649

Query: 113  LTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVR 172
             +S  +L    + +   V+ + P ++   P      N  S L I  +           +R
Sbjct: 650  -SSNHKLVFSNVNLK-EVNYMCPLNSDGYPDSLALAN-NSTLTIGTIDEI----QKLHIR 702

Query: 173  KVPLKCTPHFLAYHLETKTYCIVTS-----TAEPSTDYYKFNGEDKELVTDPRDSRFIPP 227
             VPL  +P  + Y   ++ + +++S      A   T   + +   + L +    S+    
Sbjct: 703  TVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKLFSS 762

Query: 228  LVSQFHVSLFSPFSWEEIPQTNFPL---HEWEHVLCLKNVSMEYEGTLSGLR------GY 278
              +    S       EE+   N  +   H +E +   + +  EY  +L   +       Y
Sbjct: 763  STAPHETSF-----GEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTY 817

Query: 279  IALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
              +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++    
Sbjct: 818  FIVGTAMVYPEEAEPKQGRIVVFH-----------YSDGKLQSLAEKEVKGAVYSMVEFN 866

Query: 338  GFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQ 395
            G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y+
Sbjct: 867  GKLLASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAYK 925

Query: 396  PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
            P       +ARD+ P                       W                     
Sbjct: 926  PMEGNFEEIARDFNPN----------------------WM------------------SA 945

Query: 456  NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIR 513
             +ILD+ + +G      +    LF+ Q ++  +    R  L +    HLG+ VN F    
Sbjct: 946  VEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEFVNVF---- 996

Query: 514  CKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM--------- 561
            C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +         
Sbjct: 997  CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGK 1056

Query: 562  VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI---- 617
            + H+ +   +  RA+ +          P+ G IDG L+  FL +S  +  E+   +    
Sbjct: 1057 IEHSLYPSLVQLRAWASQSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDD 1116

Query: 618  --GSKHNDILDELYDI 631
              G K    +D+L  I
Sbjct: 1117 GSGMKREATVDDLIKI 1132


>gi|119594342|gb|EAW73936.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_d [Homo
            sapiens]
          Length = 1146

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 84/377 (22%), Positives = 142/377 (37%), Gaps = 89/377 (23%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQN---------V 560
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN          
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKRCF 1066

Query: 561  MVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI--- 617
            +++  S T     R+F T +        P+ G IDG L+  FL +S  +  E+   +   
Sbjct: 1067 LISTCSLTHPSTWRSFHTERKT-----EPATGFIDGDLIESFLDISRPKMQEVVANLQYD 1121

Query: 618  ---GSKHNDILDELYDI 631
               G K     D+L  +
Sbjct: 1122 DGSGMKREATADDLIKV 1138


>gi|452979181|gb|EME78944.1| hypothetical protein MYCFIDRAFT_43692 [Pseudocercospora fijiensis
            CIRAD86]
          Length = 1149

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 108/548 (19%), Positives = 210/548 (38%), Gaps = 114/548 (20%)

Query: 97   GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF-----NAK 151
            G Q VF    HP+ ++  S G L    +T +   + +A F +     G +       N +
Sbjct: 682  GLQNVFATCEHPSLIY-GSDGRLVYSAVTAEN-ATCIASFDSFGDYAGAIAIATTDENGE 739

Query: 152  SELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTY---CIVTSTAEPSTDYYKF 208
            +EL+++V+    +      V+ + +  T   +AY  E K +   CI  +           
Sbjct: 740  NELKLAVVDEERTTH----VQDLFIHETVRRIAYSAELKAFGLGCIKRT----------L 785

Query: 209  NGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEY 268
            +  ++E+ +               H  L    +++E+    + L+E E V C+    ++ 
Sbjct: 786  SAGNEEVAS---------------HFKLVDEVAFKELD--TWALNEDELVECVIRCYLD- 827

Query: 269  EGTLSGLRGYIALGTNYNYSEDVT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQK 327
            +G+      ++ +GT Y   +D    +GRIL+ +I E            +IK++     +
Sbjct: 828  DGSGEEAERFV-VGTAYLDDQDANNAKGRILVLEITE----------DRRIKLVTELAVR 876

Query: 328  GPVTAICHVAGFLVTAVGQKIYIWQLKDND-----LTGIAFIDTEVYIASMVSVKNLILV 382
            G    +    G +V A+ + I ++  +        LT  A   T      +    N I V
Sbjct: 877  GACRCLAVCQGRIVAALVKTIVVYDFEYQTPSTPALTKKASYRTATAPIDICVTNNTIAV 936

Query: 383  GDYARSIALLRY----QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQ 438
             D  +S++LL +    Q +  TL  +AR ++                       +W    
Sbjct: 937  TDLMKSLSLLEFKAGRQGQPDTLIEIARHFET----------------------LWG--- 971

Query: 439  LSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKT 498
                     C ++           S   ++ SD + N+++  +           RL   +
Sbjct: 972  -------TACARV-----------SENTYLESDAEGNLIVLQHDINGFSQEDRRRLRVTS 1013

Query: 499  DFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQ 558
            +F LG+ VN     R +P ++  +PGA      + A+ DG++  +  + +     L+ +Q
Sbjct: 1014 EFLLGEMVN-----RIRPITVQPSPGAVVTPQAFLATTDGSIYVYCEIGKPRQDLLMRMQ 1068

Query: 559  NVMVTHTSHTGGLNPRAFRTYKGKGYYAGN--PSRGIIDGSLVWKFLQLSLGERLEICKK 616
             +M       GG+    FR +K      G   P R  +DG L+ +FL +    + E+ K 
Sbjct: 1069 TLMADMVKSPGGVRFAKFRGFKTLVRDMGEEGPVR-FVDGELIERFLDMPEVLQNEVVKG 1127

Query: 617  IGSKHNDI 624
            +     D+
Sbjct: 1128 LDGTGVDL 1135


>gi|427788481|gb|JAA59692.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
          Length = 1156

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 108/499 (21%), Positives = 191/499 (38%), Gaps = 105/499 (21%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVT-----------STAEPSTDYYKFNGEDKELVTDP 219
            +R VPL   P  +AY   T+T+ ++T           +   PS      N      ++  
Sbjct: 726  IRTVPLGELPRRIAYQEATQTFGVITIRNDILGSSGLTPVRPSASTQAQNVTHSAQMS-- 783

Query: 220  RDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPL---HEWEHVLCLKNVSMEYEGTLSGLR 276
              S F P  VS  +  L      +E+   N  +   H +E +   + +  EY  ++   R
Sbjct: 784  --SIFKPGSVSTGNDQL-----GQEVEIHNLLIIDQHTFEVLHAHQFMQTEYAMSIVSTR 836

Query: 277  ------GYIALGT-NYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGP 329
                   Y  +GT N    E    +GRI++F  ++            K++ +  +E KG 
Sbjct: 837  LGNDPNTYYIVGTANVLPDESDPKQGRIVVFHWVD-----------GKLEHVAEQEIKGA 885

Query: 330  VTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARS 388
              ++    G L+ A+   + +++   + +L         +    + +  + +LVGD  RS
Sbjct: 886  PYSMLEFNGKLLAAINSTVRLFEWNAERELRNECSHFNNILALYLRAKGDFVLVGDLMRS 945

Query: 389  IALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 448
            ++LL Y+P       +ARDY+    +S                                 
Sbjct: 946  MSLLAYKPLEGNFEEIARDYQTNWMSSV-------------------------------- 973

Query: 449  KKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHV 506
                    +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ V
Sbjct: 974  --------EILDDDTFLG-----AESTTNLFVCQKDSAATTDEERQHLQEVGQFHLGEFV 1020

Query: 507  NTFFKIRCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
            N F       S +   PG  S   +    + ++ GA+G    LP   Y  L  +Q  +  
Sbjct: 1021 NVFR----HGSLVMQHPGETSSPTQGSVLFGTIHGAIGLVSQLPADFYTFLSEVQEKLTK 1076

Query: 564  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------ 617
                 G ++   +R++  +      P+ G IDG L+  FL LS  +  E+ + I      
Sbjct: 1077 VIKSVGKIDHAFWRSFSTE--RKTEPAVGFIDGDLIESFLDLSRDKMQEVVQGIQMDDGS 1134

Query: 618  GSKHNDILDELYD-IEALS 635
            G K +  +D+L   IE LS
Sbjct: 1135 GMKRDASVDDLIKIIEELS 1153


>gi|444513057|gb|ELV10249.1| DNA damage-binding protein 1 [Tupaia chinensis]
          Length = 1146

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 84/377 (22%), Positives = 142/377 (37%), Gaps = 89/377 (23%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM-------- 561
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +        
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKRCF 1066

Query: 562  -VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI--- 617
             ++  S T     R+F T +        P+ G IDG L+  FL +S  +  E+   +   
Sbjct: 1067 QISPNSLTDMSTWRSFHTERKT-----EPATGFIDGDLIESFLDISRPKMQEVVANLQYD 1121

Query: 618  ---GSKHNDILDELYDI 631
               G K     D+L  +
Sbjct: 1122 DGSGMKREATADDLIKV 1138


>gi|452838792|gb|EME40732.1| hypothetical protein DOTSEDRAFT_177898 [Dothistroma septosporum
            NZE10]
          Length = 1138

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 110/539 (20%), Positives = 203/539 (37%), Gaps = 115/539 (21%)

Query: 97   GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRI 156
            G Q VF    HP+ ++  S G +    +T +   S  +   N N     +   +  ELRI
Sbjct: 681  GLQNVFATCEHPSLIY-GSEGRMVYSAVTAESATSICS--FNSNSYGNAIAIASNDELRI 737

Query: 157  SVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTY---CIV-TSTAEPSTDYYKFNGED 212
            + +    +      V+ + +  T    AY  E K +   CI  T TA          G++
Sbjct: 738  AAVDEERTTH----VQDLFIHETVRRTAYSAELKAFGLGCIQRTLTA----------GQE 783

Query: 213  KELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTL 272
            +                 + H  L    +++E+   ++ L+E E V  +    ++ +G+ 
Sbjct: 784  E----------------VKSHFKLVDEVAFKELD--SYELNEDELVESVIRCKLD-DGSG 824

Query: 273  SGLRGYIALGTNYNYSEDV-TCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVT 331
             G   + A+GT Y   +D  T RGRIL+ ++ E            ++K++     KG   
Sbjct: 825  DGAERF-AVGTAYLDDQDSNTARGRILILEVTE----------DRRLKLVTELSVKGACR 873

Query: 332  AICHVAGFLVTAVGQKIYIWQLK----DNDLTGIAFIDTEVYIASMVSVKNLILVGDYAR 387
             +    G +V A+ + + I+  +       LT  A   T      +    N+I V D  +
Sbjct: 874  CLAVCEGKIVAALIKTVIIYDFEFAASKATLTKKASYRTATAPIDVCVTGNVIAVTDLMK 933

Query: 388  SIALLRYQ------PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSL 441
            S++L+ Y+      P+  TL+ +AR ++     +    A N                   
Sbjct: 934  SMSLVEYKKGRTGMPD--TLTEIARHFETLWGTAVANVADNT------------------ 973

Query: 442  GERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFH 501
                                     ++ SD + N+++  +           RL   ++  
Sbjct: 974  -------------------------YLQSDAEGNLIVLQHDTNGFSEEDRRRLRVTSELL 1008

Query: 502  LGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM 561
            LG+ VN   +I   P+      GA      + A+++G++  F  +       L+ +QN M
Sbjct: 1009 LGEMVNRIRRIDVTPTH-----GALVIPRAFLATVEGSIYLFALIVPGKQDLLMRMQNNM 1063

Query: 562  VTHTSHTGGLNPRAFRTYKG--KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
             +     G +    FR +K   +   A  PSR  +DG L+ +FL      + EI + +G
Sbjct: 1064 ASLVKSPGHVEFATFRGFKNQVRDEGANGPSR-FVDGELIERFLDCGQDIQEEIIRDLG 1121


>gi|348560393|ref|XP_003465998.1| PREDICTED: DNA damage-binding protein 1-like [Cavia porcellus]
          Length = 1140

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 80/368 (21%), Positives = 139/368 (37%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +       G
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +       + G IDG L+  FL +S  +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFHTE--RKTEQATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1124

Query: 624  ILDELYDI 631
              D+L  +
Sbjct: 1125 TADDLIKV 1132


>gi|432851195|ref|XP_004066902.1| PREDICTED: DNA damage-binding protein 1-like [Oryzias latipes]
          Length = 1140

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 82/368 (22%), Positives = 139/368 (37%), Gaps = 77/368 (20%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             T  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEPEPKQGRIIVFH-----------YTDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGVFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             C  S +    G  S   +    + +++G +G    L E  +  LL LQN +       G
Sbjct: 1008 -CHGSLVLQNLGESSTPTQGSVLFGTVNGMIGLVTSLSEGWHSLLLDLQNRLNKVIKSVG 1066

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHND 623
             +    +R++  +       + G IDG L+  FL L   +  E+   +      G K   
Sbjct: 1067 KIEHSFWRSFYTE--RKTEQATGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGGGMKREA 1124

Query: 624  ILDELYDI 631
             +DE+  I
Sbjct: 1125 TVDEVIKI 1132


>gi|427780151|gb|JAA55527.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
          Length = 1181

 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 108/499 (21%), Positives = 191/499 (38%), Gaps = 105/499 (21%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVT-----------STAEPSTDYYKFNGEDKELVTDP 219
            +R VPL   P  +AY   T+T+ ++T           +   PS      N      ++  
Sbjct: 751  IRTVPLGELPRRIAYQEATQTFGVITIRNDILGSSGLTPVRPSASTQAQNVTHSAQMS-- 808

Query: 220  RDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPL---HEWEHVLCLKNVSMEYEGTLSGLR 276
              S F P  VS  +  L      +E+   N  +   H +E +   + +  EY  ++   R
Sbjct: 809  --SIFKPGSVSTGNDQL-----GQEVEIHNLLIIDQHTFEVLHAHQFMQTEYAMSIVSTR 861

Query: 277  ------GYIALGT-NYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGP 329
                   Y  +GT N    E    +GRI++F  ++            K++ +  +E KG 
Sbjct: 862  LGNDPNTYYIVGTANVLPDESDPKQGRIVVFHWVD-----------GKLEHVAEQEIKGA 910

Query: 330  VTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARS 388
              ++    G L+ A+   + +++   + +L         +    + +  + +LVGD  RS
Sbjct: 911  PYSMLEFNGKLLAAINSTVRLFEWNAERELRNECSHFNNILALYLRAKGDFVLVGDLMRS 970

Query: 389  IALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 448
            ++LL Y+P       +ARDY+    +S                                 
Sbjct: 971  MSLLAYKPLEGNFEEIARDYQTNWMSSV-------------------------------- 998

Query: 449  KKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHV 506
                    +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ V
Sbjct: 999  --------EILDDDTFLG-----AESTTNLFVCQKDSAATTDEERQHLQEVGQFHLGEFV 1045

Query: 507  NTFFKIRCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVT 563
            N F       S +   PG  S   +    + ++ GA+G    LP   Y  L  +Q  +  
Sbjct: 1046 NVFR----HGSLVMQHPGETSSPTQGSVLFGTIHGAIGLVSQLPADFYTFLSEVQEKLTK 1101

Query: 564  HTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------ 617
                 G ++   +R++  +      P+ G IDG L+  FL LS  +  E+ + I      
Sbjct: 1102 VIKSVGKIDHAFWRSFSTE--RKTEPAVGFIDGDLIESFLDLSRDKMQEVVQGIQMDDGS 1159

Query: 618  GSKHNDILDELYD-IEALS 635
            G K +  +D+L   IE LS
Sbjct: 1160 GMKRDASVDDLIKIIEELS 1178


>gi|260790329|ref|XP_002590195.1| hypothetical protein BRAFLDRAFT_128289 [Branchiostoma floridae]
 gi|229275385|gb|EEN46206.1| hypothetical protein BRAFLDRAFT_128289 [Branchiostoma floridae]
          Length = 1152

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 76/345 (22%), Positives = 132/345 (38%), Gaps = 65/345 (18%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             T  K++ +  KE KG V ++   
Sbjct: 835  YFIIGTAMVYPEESEPKSGRIIVFQ-----------YTDGKLQQVAEKEVKGAVYSLVQF 883

Query: 337  AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQ 395
               L+ ++   + +++        +        +A  +  K + ILVGD  RS+ LL Y+
Sbjct: 884  NNKLLASINSTVRLFEWTAEKELRVECNHYNNILALYLKTKGDFILVGDLMRSVTLLAYK 943

Query: 396  PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
            P       +ARD+ P                       W    +S  E L+    +G+++
Sbjct: 944  PMEGCFEEIARDFNPN----------------------W----MSAVEILDDDNFLGAEN 977

Query: 456  NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK 515
                    S  F    KD        +   +E   GH       FHLG+ VN F      
Sbjct: 978  --------SFNFFTCQKDSAATTDEERQHLQEV--GH-------FHLGEFVNVFR----H 1016

Query: 516  PSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
             S +   PG  S   +    + +++GA+G    LP   +  L  +Q+ +       G + 
Sbjct: 1017 GSLVMQHPGETSTPTQGSVLFGTVNGAVGLVTQLPADFFNFLQEVQSKLTRVIKSVGKIE 1076

Query: 573  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
               +R++  +        +G IDG L+  FL LS  +  E+ + +
Sbjct: 1077 HSFWRSFNTE--RKTEACQGFIDGDLIESFLDLSRDKMQEVVQGL 1119


>gi|324502823|gb|ADY41238.1| DNA damage-binding protein 1, partial [Ascaris suum]
          Length = 1129

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 100/454 (22%), Positives = 177/454 (38%), Gaps = 82/454 (18%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGE------DKELVTDPRDSRF 224
            +R VPL  +   +AY  ET T  I+    E    +   +G+        ++  +   S  
Sbjct: 700  IRTVPLGESVSRIAYQPETGTIAILVQRNE----FVDADGKHHCGHCASKMAVNASSSH- 754

Query: 225  IPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWE----HVLCLKNVSMEYEGTLSGL--RGY 278
             P +V+        P   E      F  +  E    H L    ++M  +  + G   + Y
Sbjct: 755  -PSVVTSATTPPIEPEEIEVSSVVVFDANTLEILHSHELGKNELAMSIKSCVLGDDPQPY 813

Query: 279  IALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
             A+GT    +++   + GR+L+F   +V P         ++++++ KE KG   +I  + 
Sbjct: 814  YAVGTAVVLTDETESKSGRLLIF---QVAPSS----EGGRMRLVHDKEIKGAAYSIQVLM 866

Query: 338  GFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKN-LILVGDYARSIALLRYQP 396
            G LV A+   + +++        +   D +   A  +  KN ++LVGD  RS+++L Y+P
Sbjct: 867  GKLVVAINSCVRLFEWTAEKELRLECSDFDNVTALYLRTKNDVVLVGDLMRSLSVLAYKP 926

Query: 397  EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 456
               +   +ARD+                         W          +  C+ I     
Sbjct: 927  MESSFEKIARDFVTN----------------------W----------MTACEIID---- 950

Query: 457  DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRL-IKKTD-FHLGQHVNTFFKIRC 514
              ++ F     M +       LF    +    + G RL +++T  ++LG+ VN F    C
Sbjct: 951  --METFLGAEIMFN-------LFTVVKDCSSKDEGIRLQLQETGMYYLGESVNAF----C 997

Query: 515  KPSSISDAPGARSRFLT--WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
              S I+        F T   Y + DG LG  + L  + Y  +  L+  +   T +   + 
Sbjct: 998  HGSLIATHIDLTPSFTTPILYGTSDGGLGVIVQLTPQFYDFVHELETRIAAVTKNCMRIE 1057

Query: 573  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
               +RT++  G      S G IDG LV   L +S
Sbjct: 1058 HGQYRTFESDGRT--EQSVGFIDGDLVEGLLDMS 1089


>gi|297740793|emb|CBI30975.3| unnamed protein product [Vitis vinifera]
          Length = 1043

 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 100/442 (22%), Positives = 170/442 (38%), Gaps = 110/442 (24%)

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            VS + PF++   P   L    + +L I  +           +R +PL      + +  ++
Sbjct: 670  VSHMCPFNSAAFPDS-LAIAKEGDLTIGTIDDI----QKLHIRSIPLGEHARRICHQEQS 724

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            +T+ I       S  Y + + ED E+        FI  L  Q         ++E I  + 
Sbjct: 725  RTFAIC------SLKYNQSSTEDSEM-------HFIRLLDDQ---------TFEFI--ST 760

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVPE 308
            +PL  +E+   + + S   +  +     Y  +GT Y    E+   +GRIL+F I+E    
Sbjct: 761  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE---- 810

Query: 309  PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKI--YIWQLKDNDLTGIAFIDT 366
                    K+++I  KE KG V ++    G L+ A+ QKI  Y W L+D+   G   + +
Sbjct: 811  ------DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQS 861

Query: 367  EV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
            E       +A  V  + + I+VGD  +SI+LL Y+ E   +   ARDY     ++     
Sbjct: 862  ESGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV---- 917

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                                                +ILD+   +G   ++ + N+    
Sbjct: 918  ------------------------------------EILDDDIYLG---AENNFNIFTVR 938

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
               E        RL    ++HLG+ VN F      +R   S +   P         + ++
Sbjct: 939  KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTV 992

Query: 537  DGALGFFLPLPEKNYRRLLMLQ 558
            +G +G    LP   Y  L  LQ
Sbjct: 993  NGVIGVIASLPHDQYVFLEKLQ 1014


>gi|170057515|ref|XP_001864517.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167876915|gb|EDS40298.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 1138

 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 114/573 (19%), Positives = 209/573 (36%), Gaps = 119/573 (20%)

Query: 66   FVSDRSK-RANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM 124
            FV D++  R  +Q  +  G + + ++ F +++    VF C   P  ++ ++      H +
Sbjct: 615  FVVDKTTHRLTDQKKVTLGTQPTILKTFRSLS-TTNVFACSDRPTVIYSSN------HKL 667

Query: 125  TIDGPVSTLAPFHNVNCPR--GFLYFNAKS-------ELRISVLPTHLSYDAPWPVRKVP 175
                       F NVN          NA+S         + SV+   +       +R VP
Sbjct: 668  V----------FSNVNLKEVNHMCSLNAESYQDSLALATKNSVILGTIDEIQKLHIRTVP 717

Query: 176  LKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPP--LVSQFH 233
            L  +P  +AY   ++T+ ++T          + + +D   +T  R S       + S  +
Sbjct: 718  LGESPRRIAYQEASQTFGVIT---------VRMDIQDSSGLTPSRQSASTQTSNVTSSSN 768

Query: 234  VSLFSP------FSWEE-------IPQTNFPL---HEW---EHVLCLKNVSMEYEGTLSG 274
            + L  P      F  E        I Q  F +   H++   E+VL L +  +  +     
Sbjct: 769  MGLLKPGASNTEFGQEVEVHNLLIIDQNTFEVLHAHQFMQTEYVLSLISAKLGNDPATY- 827

Query: 275  LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAIC 334
               YI      N  E     GRI+++   +             +  +  KE KG   ++ 
Sbjct: 828  ---YIVGTAMVNPEEREPKVGRIIIYHYAD-----------GALTQVSEKEIKGACYSLV 873

Query: 335  HVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLR 393
               G ++  +   + +++  D+    +        +A     K + ILVGD  RSI LL+
Sbjct: 874  EFNGRVLATINSTVRLYEWTDDKDLRLECSHFNNVLALYCKTKGDFILVGDLMRSITLLQ 933

Query: 394  YQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 453
            Y+    +   +ARDY+P                       W                   
Sbjct: 934  YKQMEGSFEEIARDYQPK----------------------WM------------------ 953

Query: 454  KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
               +ILD+ + +G   ++   N+ + +    A   +   ++ +   FHLG  VN F    
Sbjct: 954  TAVEILDDDAFLG---AENSNNLFVCLKDSAATTDDERQQMPEVAQFHLGDMVNVFRHGS 1010

Query: 514  CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
                +I +     S  +  + ++ GA+G    +P   Y  L  LQ  +       G ++ 
Sbjct: 1011 LVMQNIGERTTPTSGCV-LFGTVSGAIGLVTQIPPDYYEFLRKLQENLTNTIKSVGRIDH 1069

Query: 574  RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
              +R++  +       S G IDG LV  FL L+
Sbjct: 1070 TYWRSFHTE--MKTENSEGFIDGDLVESFLDLT 1100


>gi|357135348|ref|XP_003569272.1| PREDICTED: DNA damage-binding protein 1a-like [Brachypodium
            distachyon]
          Length = 1074

 Score = 58.9 bits (141), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 86/359 (23%), Positives = 145/359 (40%), Gaps = 76/359 (21%)

Query: 278  YIALGTNYNYSEDVT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT Y    ++   +GRIL+F + E            K++++  +E KG V ++  +
Sbjct: 780  YYCVGTAYILPYEIEPTKGRILIFLVEE-----------RKLRLVAERETKGAVYSLNAL 828

Query: 337  AGFLVTAVGQKI--YIWQLKDN--DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALL 392
             G L+ AV QKI  Y W  +DN   L         V      +  + I+VGD  RS++LL
Sbjct: 829  TGKLLAAVNQKIIVYKWVRRDNRHQLQSECSYRGCVLALHTQTHGHFIVVGDMVRSVSLL 888

Query: 393  RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
            RY+ E   + +V RD+     N+K   A      ++D  +                  IG
Sbjct: 889  RYKYEEGLIEVVTRDF-----NTKWITA----VAMLDDDIY-----------------IG 922

Query: 453  SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 512
            +      D   ++  + S +   V  +         + G  ++  TD  +GQ     F  
Sbjct: 923  A------DNCCNLFTLHSGRPGVVGEYHLGDLVNRMHHGSLVMHHTDSEIGQIPTVIF-- 974

Query: 513  RCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
                 +IS A G  + F                 P   Y  L  LQ+V+V      G L+
Sbjct: 975  ----GTISGAIGVIASF-----------------PYDQYVFLEKLQSVLVKFIKSVGNLS 1013

Query: 573  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHND---ILDEL 628
               +R++      A   +R  +DG L+  FL LS  +  E+ + +G + ++   I++EL
Sbjct: 1014 HVEWRSFYNVSRTA--EARNFVDGDLIESFLSLSPSKMEEVSQVMGLRADELCKIVEEL 1070


>gi|166158025|ref|NP_001107422.1| damage-specific DNA binding protein 1, 127kDa [Xenopus (Silurana)
           tropicalis]
 gi|157422734|gb|AAI53474.1| Zgc:63840 protein [Danio rerio]
 gi|163916541|gb|AAI57552.1| LOC100135265 protein [Xenopus (Silurana) tropicalis]
          Length = 306

 Score = 58.9 bits (141), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 76/348 (21%), Positives = 131/348 (37%), Gaps = 72/348 (20%)

Query: 305 VVPEPGQP---------LTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQL 353
           V PE  +P          T  K++ +  KE KG V ++    G L+ ++    ++Y W  
Sbjct: 2   VCPEEAEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTA 61

Query: 354 KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
           +    T     +  +    + +  + ILVGD  RS+ LL Y+P   +   +ARD+ P   
Sbjct: 62  EKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAYKPMEGSFEEIARDFNPNWM 120

Query: 414 NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
           ++                                         +ILD+ + +G      +
Sbjct: 121 SAV----------------------------------------EILDDDNFLG-----AE 135

Query: 474 KNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFF--KIRCKPSSISDAPGARSRF 529
               LF+ Q ++  +    R  L +   FHLG+ VN F    +  +    S  P   S  
Sbjct: 136 NAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFSHGSLVLQNLGESSTPTQGS-- 193

Query: 530 LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
              + +++G +G    L E  Y  LL LQN +       G +    +R++  +       
Sbjct: 194 -VLFGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTE--RKTEQ 250

Query: 590 SRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 631
           + G IDG L+  FL L   +  E+   +      G K    +DE+  I
Sbjct: 251 ATGFIDGDLIESFLDLGQAKMQEVVSTLQIDDGSGMKREATVDEVIKI 298


>gi|402083318|gb|EJT78336.1| hypothetical protein GGTG_03437 [Gaeumannomyces graminis var. tritici
            R3-111a-1]
          Length = 1155

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 101/508 (19%), Positives = 194/508 (38%), Gaps = 98/508 (19%)

Query: 133  LAPFHNVNCPRGFLYFNAKSELRIS-VLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKT 191
            + PF     P G L     SEL+IS + P   S+     V+ +P+      +AY   T+ 
Sbjct: 719  VCPFDTAVFP-GSLAVATDSELKISKIDPQRQSH-----VQSLPMGENVRSIAYSAPTRV 772

Query: 192  Y---CI---VTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEI 245
            +   CI   ++   E ++  ++      E+V  P                L +PF     
Sbjct: 773  FGLGCIRREISKGVEKASSTFRLV---DEVVLQP----------------LGNPFE---- 809

Query: 246  PQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT----CRGRILLFD 301
                  L+E E V  +  +  +   T   L     +GT +   E++      +GR+L+F 
Sbjct: 810  ------LNEGEVVETV--IRAQLRDTFGRLAERFIVGTRFLVDENLVPGSNSKGRVLVFG 861

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKD-----N 356
                V E   P        I +   K     +  +   +V A+ + + + + ++      
Sbjct: 862  ----VDEERSPF------QIVSHPLKSGCRRLAVMEEMIVVALTKTVVVARYEELTSTSG 911

Query: 357  DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSK 416
             L  +A   T  Y   +     LI VGD  +S++L+ + P     + VA D K  +    
Sbjct: 912  KLIKVASYQTTSYAIDVAVEGRLIAVGDIMKSMSLVEFVPP----TTVAGDGKAGETK-- 965

Query: 417  GYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNV 476
                              K  QL     +E+C+   S  +  +  F    ++ +D D NV
Sbjct: 966  ------------------KPAQL-----IEVCRHYQSSWSTAVAHFEGESWLEADADGNV 1002

Query: 477  VLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASL 536
            ++              R+   ++ +LG+++N   KI     S+   P A      + ++ 
Sbjct: 1003 MVLGRNTTGVTLEDRRRMEITSEINLGENINRIQKI-----SVETGPNAPIHPKAFLSTT 1057

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            +G++     +  +    LL LQ+ +  +    G +  + FR+++     A  P R  IDG
Sbjct: 1058 EGSIYLVGAIAPQMRDLLLNLQDRLEDYVGTLGNIPFKNFRSFRNAEREADGPVR-FIDG 1116

Query: 597  SLVWKFLQLSLGERLEICKKIGSKHNDI 624
              + +FL ++   + ++C+ +G    D+
Sbjct: 1117 EYIERFLDMNEETQSQVCRDLGPSVEDM 1144


>gi|281208174|gb|EFA82352.1| UV-damaged DNA binding protein1 [Polysphondylium pallidum PN500]
          Length = 1054

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 85/368 (23%), Positives = 139/368 (37%), Gaps = 74/368 (20%)

Query: 278  YIALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y+ +GT + N  E    +GRIL+F I                ++I   E   P    C +
Sbjct: 753  YVVVGTAFHNEVESQQSKGRILVFRI-------------EDNRLILLDEVALPACVYCLL 799

Query: 337  --AGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALL 392
               G L+  + +++  + W +  N LT            SMVS  + +LV D  +S+ LL
Sbjct: 800  PFNGRLLAGINKRVQAFNWGVDTNKLTKAESYSGHTLSHSMVSRGHFVLVADLMKSMTLL 859

Query: 393  RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
              + +   +  +AR+  P                      +W         R+E+     
Sbjct: 860  -VEDQQGAIKELARNPLP----------------------IWL-------SRIEM----- 884

Query: 453  SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI 512
                 I DE     F+  D   N+++     EA        L     FHLG+ +N F   
Sbjct: 885  -----IDDE----TFIGGDNSYNLIVVQKNAEASSEIDNELLDTVGQFHLGETINKF--- 932

Query: 513  RCKPSSISDAPGARSRFL--TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
              K  S+  +P   S  L    + ++ GA+G  + + + +Y     LQ  +       GG
Sbjct: 933  --KHGSLVTSPDMDSPKLPTILFGTVSGAIGVIVSISKDDYEFFEKLQKGLNRVVHGVGG 990

Query: 571  LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYD 630
            L    +R++  +  +   PS+  IDG L+  FL L   + LE  K +      I D    
Sbjct: 991  LPFENWRSFSTE--HMTIPSKNFIDGDLIETFLDLRHDKMLEAIKDMNIS---IEDTYRR 1045

Query: 631  IEALSSHF 638
            IE+L  H 
Sbjct: 1046 IESLMHHI 1053


>gi|145348011|ref|XP_001418451.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578680|gb|ABO96744.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1196

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 110/554 (19%), Positives = 213/554 (38%), Gaps = 97/554 (17%)

Query: 110  WLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPW 169
            WL  + +G     P++   P+  +  F++  CP G +  + ++ LRI+ +         +
Sbjct: 711  WLGYSEKGTFVLAPISYV-PLEEVCSFNSEQCPEGVVAISNQT-LRIASIE---RLGENF 765

Query: 170  PVRKVPLKCTPHFLAYHLETKTYCIVTST-------------AEPSTDYYKFNGEDKELV 216
                V L+ TP  ++ + +TK   ++ S              A P+ +  + N ED+E  
Sbjct: 766  NQTTVKLRYTPRAMSANPDTKMVALIESDQCTVPVGEREGPEATPADEAPETNDEDEE-- 823

Query: 217  TDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFP----------LHEWEHVLCLKNVSM 266
                +++ +P  V QF     SP +W    +   P          LH+ E  L L +V +
Sbjct: 824  ----EAKMLP--VEQFGAPKSSPGTWAACVRIVDPKEAKSTFVLELHKSEAALSLCHVFL 877

Query: 267  EYEGTLSGLRGYIALGTNYNYS-EDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKE 325
                 L      +A+GT  N +     C G    F  +      G+ L      ++++  
Sbjct: 878  TGPNEL-----LLAVGTAVNLTFAPRNCDGG---FIHLYRYGNDGRTL-----NLVHSTP 924

Query: 326  QKGPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGD 384
              GPV A+C   G L+  V   + I+   K   L  +   +   +I ++ +  + I VGD
Sbjct: 925  TDGPVGALCGYKGHLLAGVNNSLRIYDYGKKKLLRKVENRNFPNFITTLHAAGDRIYVGD 984

Query: 385  YARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGER 444
               SI  ++Y+ +  ++ + A D KP                 I  +L   +  L+  ++
Sbjct: 985  VQESIHYVKYKADEGSIYIFADDTKPR---------------YITATLPLDYDTLAGADK 1029

Query: 445  LE--ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHL 502
                   ++    ++ +D+  + G  I  +     +    P   E++           ++
Sbjct: 1030 FGNIFVNRLPKDVSEDMDDDPTGGKNIYSQG----VLNGAPNKSETSA--------QTYI 1077

Query: 503  GQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVM 561
            G+ V    K   +P  I          +  Y +  G +G  LP   ++       L+  M
Sbjct: 1078 GETVCALTKGALQPGGIE---------IIMYGTFMGGIGCLLPFSSRSEIEFFTHLEMHM 1128

Query: 562  VTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 621
                    G +  AFR+     YYA  P + +IDG L  +F  L    +  I +++    
Sbjct: 1129 RQEAPSIVGRDHMAFRS-----YYA--PVKNVIDGDLCEQFGALPADVQRRIAEEMDRTP 1181

Query: 622  NDILDELYDIEALS 635
             +IL +L  + +++
Sbjct: 1182 GEILKKLEQVRSVA 1195


>gi|313238818|emb|CBY20011.1| unnamed protein product [Oikopleura dioica]
 gi|313245836|emb|CBY34826.1| unnamed protein product [Oikopleura dioica]
          Length = 1135

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 73/390 (18%), Positives = 151/390 (38%), Gaps = 71/390 (18%)

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEP 309
            F + E    +C+  +  + E        +I +GT     E     GRI +F   +     
Sbjct: 805  FDVGEISSCMCIAKLGKKDEQ-------FIVVGTAITADEQECKNGRICVFSYSK----- 852

Query: 310  GQPLTKNKIKMIYAKEQKGPVTAICHVAGF-LVTAVGQKIYIWQLKDND-LTGIAFIDTE 367
                 + K+ ++  K+  G V ++  + G  ++ A+ Q++ ++++ +   L   A I   
Sbjct: 853  -----EEKLTLVSTKQVNGAVYSVKALNGNKIICAINQQLKVFEMNEQTTLQSEAPIANH 907

Query: 368  VY-IASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRG 426
            +  +A  VS    IL  D  RSI++  Y+P    L  +ARDY P                
Sbjct: 908  ITCVAVDVSKNGFILSADLMRSISVFSYKPLEGALEEIARDYHPN--------------- 952

Query: 427  IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEAR 486
                   W          +   K I   +           ++ ++  +N+ +     EA 
Sbjct: 953  -------W----------MTAIKMIDDDN-----------YIGAENSENIFICTRNTEAP 984

Query: 487  ESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPL 546
            +     +L+    +H+G+H+NT  +         ++    +R      S+ G +G     
Sbjct: 985  DEEDRQQLLPTGYYHVGEHINTIVEGNLVMDVHVESSITPTRTF-LMGSVSGYVGLLAIF 1043

Query: 547  PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
            PEK ++ L  L+  M       G ++  ++R ++          +G +DG L+  F  L 
Sbjct: 1044 PEKQWQFLSKLEAKMRKVIRGVGKIDHESWRRFESDSRM--EDCKGFVDGDLIEMFQDLR 1101

Query: 607  LGERLEICKKIG-----SKHNDILDELYDI 631
              ++ E+  ++      + H+D++  + D+
Sbjct: 1102 PEKQKEVISELTMDGEPATHDDVVRLVDDL 1131


>gi|405970039|gb|EKC34976.1| DNA damage-binding protein 1 [Crassostrea gigas]
          Length = 1160

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 83/371 (22%), Positives = 142/371 (38%), Gaps = 74/371 (19%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   + E+   + GRI++F   E            K+  I  KE KG    +   
Sbjct: 848  YYIVGTALVHPEEAEPKQGRIVIFHFHE-----------GKLNQIAEKEIKGAAYTLVEF 896

Query: 337  AGFLVTAVGQKIYIWQ-LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQ 395
             G L+ ++   + +++   D +L         +    + +  + ILVGD  RSI LL Y+
Sbjct: 897  NGKLLASINSTVRLFEWTTDKELRLECNYFNSIVALYLKTKGDFILVGDLMRSITLLLYK 956

Query: 396  PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
            P   T   +ARD  P                       W                  +  
Sbjct: 957  PMEGTFEEIARDCNPN----------------------W------------------TTA 976

Query: 456  NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFF--K 511
             +ILD+ + +G      + +  LF  Q ++  +    R  L +   FHLG+ VN F    
Sbjct: 977  VEILDDDNFLG-----AENSFNLFTCQKDSASTTDEDRQNLQEVGMFHLGEFVNVFRHGS 1031

Query: 512  IRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
            +  + S  +  P   S     Y +++GA+G    +P++ Y  L  +Q+ +       G +
Sbjct: 1032 LVMQHSGETSTPTQGS---VLYGTVNGAVGLVTQVPQEFYSFLQDIQSRLAKVIKSVGKI 1088

Query: 572  NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDIL 625
                +R++  +         G IDG L+  FL L+  +  E  K +      G K    +
Sbjct: 1089 EHSFWRSFHTE--RKTEACEGFIDGDLIESFLDLNRDKMQETVKGLQIDDGSGMKREATV 1146

Query: 626  DELYD-IEALS 635
            D+L   IE L+
Sbjct: 1147 DDLVKTIEELT 1157


>gi|154286506|ref|XP_001544048.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150407689|gb|EDN03230.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 1158

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 80/355 (22%), Positives = 135/355 (38%), Gaps = 83/355 (23%)

Query: 281  LGTNY--NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICH--- 335
            +GT+Y  ++ E  + RGRIL F++           T N+     AK  + PV   C    
Sbjct: 845  VGTSYLDDFGEG-SIRGRILAFEV-----------TANRQ---LAKVAEMPVKGACRALA 889

Query: 336  ------VAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSI 389
                  VA  + T V   I   Q  D  L+  A   T      +    NLI V D  +S+
Sbjct: 890  IVQDKIVAALMKTVVVYTISKGQFADYTLSKTASYRTSTAPIDIAVTGNLIAVADLMKSV 949

Query: 390  ALLRYQPEYR----TLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERL 445
            +++ YQ        +L+ VAR ++     +  + A +           W           
Sbjct: 950  SIVEYQQGSNGLPDSLTEVARHFQTLWSTAVAHVAED----------TW----------- 988

Query: 446  EICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH 505
                                  + SD + N+V+          +   RL   ++  LG+ 
Sbjct: 989  ----------------------LESDAEGNLVMLHRNVNGVTDDDRRRLEVTSEILLGEM 1026

Query: 506  VNTFFKIRCKPSSISDAPGARSRF--LTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMV 562
            VN     R +P +I  + GA +      +  +++G++ +   +    Y+ LLM LQ+ M 
Sbjct: 1027 VN-----RIRPVNIQGSQGAEAAISPRAFLGTVEGSI-YLFGIINPTYQDLLMRLQSAMA 1080

Query: 563  THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
                  GG+    FR ++     A  P R  +DG L+ +FL  S+  + EI  K+
Sbjct: 1081 GMVVTPGGMPFNKFRAFRNTIRQAEEPYR-FVDGELIERFLSCSVELQEEIVGKV 1134


>gi|169611218|ref|XP_001799027.1| hypothetical protein SNOG_08717 [Phaeosphaeria nodorum SN15]
 gi|160702249|gb|EAT83885.2| hypothetical protein SNOG_08717 [Phaeosphaeria nodorum SN15]
          Length = 1140

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 107/550 (19%), Positives = 209/550 (38%), Gaps = 98/550 (17%)

Query: 83   GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCP 142
            G R +  R      G   VF    HP+ ++  S G L    +T +   +T+ PF +   P
Sbjct: 670  GTREATFRALPRGNGLFNVFATCEHPSLIY-ASEGRLVYSAVTAENA-TTVCPFDSEAYP 727

Query: 143  RGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPS 202
             G +      +LRI+++ T  +      V+ + +  T   +AY    K + +        
Sbjct: 728  -GSVAIATSDDLRIALVDTERTTH----VQTLKVDETVRRIAYSPGLKAFGL-------- 774

Query: 203  TDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLK 262
                   G  K ++    +       +   H  L     ++E+    + L+E E V C+ 
Sbjct: 775  -------GTVKRILKAGEE-------IMLSHFKLVDEIQFKELD--TYALNEEELVECVM 818

Query: 263  NVSMEYEGTLSGLRGYIALGTNYNYSEDVTC-RGRILLFDIIEVVPEPGQPLTKNKIKMI 321
               +  +G+  G      +GT Y   ++ T  RGRIL   I+EV PE         +K++
Sbjct: 819  RCDL-ADGS-GGTAERFVIGTAYLDDQNSTVERGRIL---ILEVTPE-------RVLKLV 866

Query: 322  YAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK---- 377
                 KG    +    G +V A+ + I ++ ++    +    +    +  S   +     
Sbjct: 867  TEIAVKGGCRCLAMCEGKIVAALIKTIVVYDIEYRTQSKPDLVKAATFRCSTAPIDITVN 926

Query: 378  -NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKF 436
               I + D  +S+ ++ YQ                             RG          
Sbjct: 927  GTQIAIADLMKSMVVVEYQ-----------------------------RG---------- 947

Query: 437  LQLSLGERL-EICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLI 495
             +  L ++L E+ +         + E     ++ SD + N+++    P+    +   RL 
Sbjct: 948  -ETGLPDKLVEVARHFQVTWATAVAEVDENTYLESDAEGNLLVLYRDPKGVTDDDKRRLN 1006

Query: 496  KKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLL 555
              ++  LG+ VN     R +   ++ AP A      +  +++G++ +   L  +NY  LL
Sbjct: 1007 VSSEMLLGEMVN-----RIRRIDVATAPDAVVVPRAFMGTVEGSI-YLFALISQNYLDLL 1060

Query: 556  M-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 614
            + LQ+ +       G ++   FR +K +      P+R  +DG L+ +FL      + +  
Sbjct: 1061 ITLQSNLGNLVVSPGNMDFAKFRAFKNQVRTEEEPNR-FVDGELIERFLDCEEDVQRKAI 1119

Query: 615  KKIGSKHNDI 624
            + +G +  DI
Sbjct: 1120 EGLGVELEDI 1129


>gi|320593036|gb|EFX05445.1| uv-damaged DNA-binding protein [Grosmannia clavigera kw1407]
          Length = 1504

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 59/321 (18%), Positives = 132/321 (41%), Gaps = 24/321 (7%)

Query: 320  MIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDN-----DLTGIAFIDTEVYIASMV 374
            ++ +   +GP   +  V   +V  + + + + +  +      +L  +A   T  Y+  + 
Sbjct: 1201 IVSSHRVRGPCRCLAMVDDLIVAGLSKTVVLSRYTETSSMSGELKKVASYRTATYVVDLA 1260

Query: 375  SVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY-YAGNPSRGIIDGSLV 433
               ++I VGD  +S AL+ Y P   +      +      N KG     + S+ I +G   
Sbjct: 1261 VDGHMIAVGDMMKSTALVEYIPAT-SGDGEDEEDDGAGDNKKGKGKTADRSKTIAEGP-- 1317

Query: 434  WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
             K ++ + G +      +     D+  E    G        N+ +     +   ++   R
Sbjct: 1318 -KLVERARGYQASWATAVCHVEGDLWLEADGFG--------NLTMLERDVQGVTADDKRR 1368

Query: 494  LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRR 553
            L    + +LG+ VN     R +P ++  +PGA      + A+++G++     +  +    
Sbjct: 1369 LRTVGEMYLGEMVN-----RIRPIAVETSPGAMVHPRAFLATVEGSIYMVGTIAPEAQDL 1423

Query: 554  LLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 613
            L+ LQ  +       G  +  A+R+++     +  P R  +DG L+ +FL +    + E+
Sbjct: 1424 LMNLQTKLAAIVKGPGNTSFSAYRSFRNAERESTEPFR-FVDGELLERFLDVGEDVQKEV 1482

Query: 614  CKKIGSKHNDILDELYDIEAL 634
             + +G    D+ + + +++ L
Sbjct: 1483 AQGLGPSVEDLRNIIEELKRL 1503


>gi|156097003|ref|XP_001614535.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148803409|gb|EDL44808.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 2558

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 129/600 (21%), Positives = 221/600 (36%), Gaps = 129/600 (21%)

Query: 101  VFLCGPHPAWLFLTSRGELRAHPMTIDG--------PVSTLAPFHNV------NCPRGFL 146
            +F+C   P  ++ T + +L    ++I            + L PFHN       N    + 
Sbjct: 2018 LFVCCDSPIIIYSTLKKKLSISKLSIRNVHLVDMFSDFNYLNPFHNFLLFKKKNQNNSYF 2077

Query: 147  YFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYY 206
             F   ++L IS    +L+      + ++P   T   +AYH +T     +  TA P  + +
Sbjct: 2078 IFFDGNQLCIS----YLNEMKKTFMERIPFHRTVEKIAYHADTG----LLITACPVEEKH 2129

Query: 207  KFNGEDKELVT--DPRDSRFIPPLV--SQFHVSLFSPF---------SWEEIPQTNF--P 251
            K N   K++V   DP  + F    +  S+F VS    +         S  E+ QT+    
Sbjct: 2130 KTNQMMKQIVCFFDPFQNSFKYTYIIPSKFSVSSICIYELAPSSGGASMGEMEQTSQMGQ 2189

Query: 252  LHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT--CRGRILLFDIIEVVPEP 309
            + +      LK    E       +R  I +GT  N +E +T    G I +F         
Sbjct: 2190 MEQTNQTNELKPSHPEERTDAPPVRTLICVGT-ANNNERITEPSSGHIYVF-------VA 2241

Query: 310  GQPLTKNKIKMIYAKEQK-GPVTAICHVAGFLVTAVGQKI--------------YIW--- 351
             +   + +IK +Y      G +T +      +V AV   +              YI+   
Sbjct: 2242 KKQTNQFEIKHVYTYNVSCGGITHLKQFRDKIVAAVNNTVVILDIGNFLANLGAYIYNSS 2301

Query: 352  ---QLKDND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
               +++ ND    +A      +I S+  V+N I+VGD   S+ LL Y  E   L+ V RD
Sbjct: 2302 KAIKIESNDAFLEVASFTPSSWIMSLDVVENYIVVGDIMTSVTLLSYDFENAILNEVCRD 2361

Query: 408  YKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGF 467
            Y                      + +W             C  + +         S   F
Sbjct: 2362 Y----------------------ANIW-------------CTSVSA--------LSENHF 2378

Query: 468  MISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS 527
            ++SD + N ++              +L   + F+ G  VN  F    +  ++ D    R+
Sbjct: 2379 LVSDMESNFLVLQKSNIKFNDEESFKLSLVSQFNHGSVVNKMFSTSLR--NLVDDEERRN 2436

Query: 528  RFLT-----WYASLDGALGFFLPLPE-KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG 581
              L        AS +G++   +P      ++R L ++  +  + S  G L+  ++R YK 
Sbjct: 2437 EILQKEQSILCASSEGSISALIPFSNFLQFKRALCIEIAINDNISSLGNLSHSSYREYKV 2496

Query: 582  KGYYAGNPSRGIIDGSLVWKFLQLSLGERLE-------ICKKIGSKHNDILDELYDIEAL 634
                A    +G++DG L   F  L    +L+       I KK+  K       + DIE L
Sbjct: 2497 S--LASKNCKGVVDGELFKMFFYLPFERQLKTYIYAKWIAKKLNCKLGSFEHFMLDIENL 2554


>gi|330792580|ref|XP_003284366.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
 gi|325085712|gb|EGC39114.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
          Length = 1064

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 100/456 (21%), Positives = 178/456 (39%), Gaps = 87/456 (19%)

Query: 186  HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEI 245
            HLE  + C    T + + D    NGE+   + +  +       VS  +V LF   ++E  
Sbjct: 684  HLEEYS-CYAVITIKTNEDIISGNGENATTIDEVEEE------VS--YVRLFDDQTFE-- 732

Query: 246  PQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEV 305
            P ++F L  +E    L +   + +        Y+A+GT+ N   D    GR+LLF+I E 
Sbjct: 733  PLSSFRLEHYEMGWSLTSTKFDDDPCT-----YLAVGTSINIP-DRQTSGRVLLFNINEA 786

Query: 306  VPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFID 365
                       K+ ++     +  V  +    G L+ AV +++Y  +   +       I 
Sbjct: 787  ----------KKLVLLEEISFRSGVLYLHQFNGRLIAAVLKRLYSIRYSYSKEKNCKVIS 836

Query: 366  TE------VYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
            +E        I  + S  + +LVGD  +S++LL  Q E  +L  +A++ +P         
Sbjct: 837  SENVHKGHTMILKLASRGHFMLVGDMMKSMSLLG-QSENGSLVQIAKNPQP--------- 886

Query: 420  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
                         +W              + I   ++D         F+ S+   N V+ 
Sbjct: 887  -------------IW-------------IRSIAMINDDY--------FIGSETSNNFVVV 912

Query: 480  MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGA 539
                ++        L     +H+G+ +N+           SDAP   +     YAS++G+
Sbjct: 913  KKNNDSTNELERELLDSVGHYHIGESINSMLCGSLVRLPDSDAPPIPT---ILYASVNGS 969

Query: 540  LGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLV 599
            +G    + +++Y     LQ  +    +  GG    ++R +    +     SR  IDG L+
Sbjct: 970  IGVIASISKEDYEFFSKLQKGLNRVVNGIGGFTHESWRAFSNDHHTV--ESRNFIDGDLI 1027

Query: 600  WKFLQLSLGERLEICKKIGSKHNDILDE-LYDIEAL 634
              F  L    ++E   K+    N  LDE L  IE+L
Sbjct: 1028 EMFPDL----KIESMAKVIQDMNVTLDETLKRIESL 1059


>gi|241260143|ref|XP_002404926.1| DNA repair protein xp-E, putative [Ixodes scapularis]
 gi|215496735|gb|EEC06375.1| DNA repair protein xp-E, putative [Ixodes scapularis]
          Length = 1148

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 79/361 (21%), Positives = 143/361 (39%), Gaps = 79/361 (21%)

Query: 294  RGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL 353
            +GRI++F  ++            K++ +  KE KG   ++    G L+ ++   + +++ 
Sbjct: 845  QGRIIIFHWVD-----------GKLQQVAEKEIKGAPYSLLEFNGKLLASINSTVRLFEW 893

Query: 354  K-DNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQ 412
              + +L         +    + +  + ILVGD  RS++LL Y+P   +   +ARDY+   
Sbjct: 894  NAERELHNECSHFNNILALYLKTKGDFILVGDLMRSMSLLAYKPLEGSFEEIARDYQTN- 952

Query: 413  PNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDK 472
                                 W            +C        +ILD+ + +G      
Sbjct: 953  ---------------------W------------MCAV------EILDDDTFLG-----A 968

Query: 473  DKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARS--- 527
            +    LF+ Q ++  +    R  L +   FHLG+ VN F       S +   PG  S   
Sbjct: 969  ESTTNLFVCQKDSAATTDEDRQHLQEVGQFHLGEFVNIFR----HGSLVMQHPGEASSPT 1024

Query: 528  RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF---RTYKGKGY 584
            +    + ++ GA+G    LP   Y  LL +Q  +       G ++   +   R +  + +
Sbjct: 1025 QGSVLFGTIHGAIGLVAQLPSDFYNFLLEVQGNLTKVIKSVGKIDHTLYPFVRLFTWRSF 1084

Query: 585  YA---GNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDELYD-IEAL 634
                    ++G IDG L+  FL LS  +  E+ + I      G K +  +D+L   IE L
Sbjct: 1085 STERKTEQAQGFIDGDLIESFLDLSRDKMQEVLQGIQMDDGSGMKRDATVDDLIKIIEEL 1144

Query: 635  S 635
            S
Sbjct: 1145 S 1145


>gi|453081643|gb|EMF09692.1| DNA damage-binding protein 1 [Mycosphaerella populorum SO2202]
          Length = 1151

 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 114/547 (20%), Positives = 204/547 (37%), Gaps = 123/547 (22%)

Query: 97   GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRI 156
            G Q VF    HP+ ++  S G +    +T D   S  A F +       +     SEL++
Sbjct: 684  GLQNVFATCEHPSLIY-GSEGRMVYSAVTADSATSICA-FDSFGDYANSIAIATGSELKL 741

Query: 157  SVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTY---CI---VTSTAEPSTDYYKFNG 210
            S +    +      V+ +P+  T   +AY  E K +   CI   + +  E    ++K   
Sbjct: 742  SSVDEERTTH----VQDLPVYETVRRIAYSSELKAFGLGCIKRTLAAGVEEVRSHFKLVD 797

Query: 211  EDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEG 270
            E                      V+  +  SW         L+E E V  +    ++   
Sbjct: 798  E----------------------VAFKALDSW--------ALNEDELVESVIRCPLDDGT 827

Query: 271  TLSGLRGYIALGTNYNYSEDV-TCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGP 329
             L   R    +GT Y   +D  T RGR+L+F++ E            +IK++     KG 
Sbjct: 828  GLDAER--FVVGTAYLDDQDANTARGRVLVFEVTE----------DRRIKLVTEMAVKGA 875

Query: 330  VTAICHVAGFLVTAVGQKIYIW-----------QL--KDNDLTGIAFIDTEVYIASMVSV 376
               +    G +V A+ + + I            QL  K +  T  A ID  ++ +S+   
Sbjct: 876  CRCLAVCKGRIVAALVKTVVILAYEFSPPKSSPQLIKKASYRTSTAPID--IFASSL--- 930

Query: 377  KNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKF 436
              LI + D  +S+ L++Y P            +  QP+S                     
Sbjct: 931  DGLIAISDLMKSLTLVKYTPG-----------RTGQPDS--------------------- 958

Query: 437  LQLSLGERLEICKKIGSKHNDILDEF-SSMGFMISDKDKNVVLFMYQPEARESNGGHRLI 495
                    +EI +   +     +     +  ++ SD + N+V+  + P    +    RL 
Sbjct: 959  -------LVEIARHFDTLWGTAVAPIPGTHSYIQSDAEGNLVVLEHDPTGFSAEDRRRLR 1011

Query: 496  KKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL--TWYASLDGALGFFLPLPEKNYRR 553
              ++  LG+ VN     R +P +    P A +  +   + A+++G++  F  + ++    
Sbjct: 1012 VTSEMCLGEMVN-----RIRPITTVITPSANAVVIPKAFIATVEGSVYVFGTIAQQYQDL 1066

Query: 554  LLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN--PSRGIIDGSLVWKFLQLSLGERL 611
            L+ LQ  M       G +    FR +K +    G   P R  +DG ++  FL LS   + 
Sbjct: 1067 LIRLQGSMAEMVKSPGFVRFNRFRGFKTQVRDMGEEGPVR-FVDGEIIEGFLGLSAEVQE 1125

Query: 612  EICKKIG 618
             + K +G
Sbjct: 1126 SVAKDLG 1132


>gi|393217872|gb|EJD03361.1| hypothetical protein FOMMEDRAFT_108572 [Fomitiporia mediterranea
            MF3/22]
          Length = 1213

 Score = 56.2 bits (134), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 69/327 (21%), Positives = 136/327 (41%), Gaps = 51/327 (15%)

Query: 318  IKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK 377
            +++++  E      A+    G L   VG+ + I+++    L  +  ++T+ Y +++V++ 
Sbjct: 932  LELVHKTEADDVPMALMAFQGRLCAGVGKSLRIYEIGKKKL--LRKVETKTYGSAIVTLN 989

Query: 378  ---NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVW 434
               + I+VGD   SI    ++P    L + A D +P    S               +++ 
Sbjct: 990  TQGSRIIVGDMQESIVYAVFKPPENRLLIFADDSQPRWTTS---------------AVMV 1034

Query: 435  KFLQLSLGERLE--ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGH 492
             +  ++ G++       ++ SK +D +DE  +   ++ +K     L M  P        H
Sbjct: 1035 DYTTIAAGDKFGNVFINRLDSKISDQVDEDPTGAGILHEKG----LLMGAP--------H 1082

Query: 493  RLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEK-NY 551
            +      FH+G  V +  K       IS   G R   L  Y  L G +G  +P   K + 
Sbjct: 1083 KTGMIAHFHVGDIVTSIHK-------ISLVAGGREVLL--YTCLHGTIGILVPFVSKEDV 1133

Query: 552  RRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERL 611
              +  L+  M +      G +  A+R     GYY   P + ++DG L  +F +L   ++ 
Sbjct: 1134 DFISTLEQHMRSEKLSLVGRDHLAWR-----GYYV--PVKAVVDGDLCEQFARLPANKQS 1186

Query: 612  EICKKIGSKHNDILDELYDIEALSSHF 638
             I  ++     ++L +L  +   +S F
Sbjct: 1187 AIAVELDRTVGEVLKKLEQLRVTASGF 1213


>gi|390366809|ref|XP_780126.3| PREDICTED: DNA damage-binding protein 1-like isoform 1
           [Strongylocentrotus purpuratus]
          Length = 630

 Score = 56.2 bits (134), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 80/342 (23%), Positives = 136/342 (39%), Gaps = 64/342 (18%)

Query: 304 EVVPEPGQPL----TKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLT 359
           E  P+ G+ +    +  K++ I  KE KG   ++    G L+ +V   + +++       
Sbjct: 331 EAEPKSGRIVVFQYSDGKLQEIAEKEIKGAPYSLVEFNGKLLASVNSVVRLFEWTPEHSL 390

Query: 360 GIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
            +        +A  +  K + I+VGD  RSI LL Y+P    L  +ARDY P        
Sbjct: 391 RVECSHYNNVLALYLKTKGDFIVVGDLMRSITLLAYKPMEGCLEEIARDYSPN------- 443

Query: 419 YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
                          W    +S  E              ILD+ + +G   ++   N  L
Sbjct: 444 ---------------W----MSAVE--------------ILDDDTFLG---AENSSN--L 465

Query: 479 FMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDA--PGARSRFLTWYA 534
           F  Q ++  +    R  L +   FHLG+ VN F        +I ++  P   S     + 
Sbjct: 466 FTCQKDSAATTDEERRHLQEVGLFHLGEFVNVFRHGSLVMQNIGESTIPTTGS---VLFG 522

Query: 535 SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGII 594
           ++ G++G    L E+ YR LL +QN +       G +    +R++  +      P    I
Sbjct: 523 TVSGSVGLVTQLNEEFYRFLLEVQNKLTKVIKSVGKIKHSFWRSFYSE--RKTEPMDNFI 580

Query: 595 DGSLVWKFLQLSLGERLEICKKI-----GSKHNDILDELYDI 631
           DG L+  FL LS     E+ + +     G K + + ++L  I
Sbjct: 581 DGDLLESFLDLSRDTMDEVAQGLQIDDGGMKRDCMANDLIKI 622


>gi|449549048|gb|EMD40014.1| hypothetical protein CERSUDRAFT_63520 [Ceriporiopsis subvermispora B]
          Length = 1265

 Score = 55.5 bits (132), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 45/199 (22%), Positives = 91/199 (45%), Gaps = 11/199 (5%)

Query: 285  YNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAV 344
            +   E     GR+LLF I     +        +++++  ++ KG V  I  V  F+  A+
Sbjct: 959  FEVEETEPTSGRLLLFAIGS---DGATSSADGELRLVTTQDVKGCVFQITSVNNFIAAAI 1015

Query: 345  GQKIYIWQLKDND----LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRT 400
               + ++ L+D +    L  +A  +   ++ ++ S  + ++VGD   S++LLR       
Sbjct: 1016 NSNVVLFALRDTNKQYALQQVADWNHNYFVTNLASHGDRLIVGDAISSVSLLRVS--VAR 1073

Query: 401  LSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH-NDIL 459
            +  ++RDY P  P +    A N   G      ++ F    +  R ++ ++ GS H +DI+
Sbjct: 1074 IECLSRDYSPLWPVAVEATAENQIIGANSDCNLFSFALQHIDGR-KVLERDGSYHLDDIV 1132

Query: 460  DEFSSMGFMISDKDKNVVL 478
            ++F+  G + +D      L
Sbjct: 1133 NKFAPGGLVAADSSTGYTL 1151


>gi|303313681|ref|XP_003066852.1| CPSF A subunit region family protein [Coccidioides posadasii C735
            delta SOWgp]
 gi|240106514|gb|EER24707.1| CPSF A subunit region family protein [Coccidioides posadasii C735
            delta SOWgp]
 gi|320031496|gb|EFW13458.1| UV-damaged DNA binding protein [Coccidioides posadasii str. Silveira]
          Length = 1144

 Score = 55.5 bits (132), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 77/373 (20%), Positives = 140/373 (37%), Gaps = 82/373 (21%)

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGL-----RGYIALGTNY-NYSEDVTCR--GRILLFD 301
            F L+  E V C+  +  E+ G+ + +     R    +GT+  +  E+   R  GRIL+FD
Sbjct: 799  FDLNPNELVECV--IRTEHPGSNAQMGSSRPRDIFIVGTSVLDTPEEAEARTKGRILIFD 856

Query: 302  IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI 361
            +           T  +++ I     +G   A+  +   +V A+ + + +  +K  +L   
Sbjct: 857  VD----------TNRELRKICDFPVRGACRALAMINNKIVAALMKTVVVLNIKKGNLYNF 906

Query: 362  AFIDTEVYIASMVSVK-----NLILVGDYARSIALLRY------QPEYRTLSLVARDYKP 410
                   Y  S   V      N+I V D  +SI+L+ Y      QP+  TL  VAR Y+ 
Sbjct: 907  EIEKEASYRTSTAPVDISVTGNIIAVADLMKSISLVEYHAGEGGQPD--TLKEVARHYQT 964

Query: 411  TQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMIS 470
                +    A N                                            F+++
Sbjct: 965  LWTTAAAPVAENE-------------------------------------------FLVA 981

Query: 471  DKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL 530
            D + N+V+          +   R+   ++  LG+ VN     R  P  +  +P +     
Sbjct: 982  DAEGNLVVLNRNTTGVTEDDRRRMQVTSELRLGEMVN-----RIHPMDLQTSPESPVIPK 1036

Query: 531  TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPS 590
             + A++DG++  F  +       L+ LQ+ +    +  G +    +R +K     A  P 
Sbjct: 1037 AFLATVDGSIYLFGLISPSAQDTLMRLQSALADFVASPGEIPFNKYRAFKSSVRQAEEPF 1096

Query: 591  RGIIDGSLVWKFL 603
            R  +DG L+ +FL
Sbjct: 1097 R-FVDGELIEQFL 1108


>gi|157128864|ref|XP_001655231.1| DNA repair protein xp-e [Aedes aegypti]
 gi|108882186|gb|EAT46411.1| AAEL002407-PB [Aedes aegypti]
          Length = 1138

 Score = 55.5 bits (132), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 118/570 (20%), Positives = 204/570 (35%), Gaps = 113/570 (19%)

Query: 66   FVSDR-SKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM 124
            FV D+ + R  +Q  +  G + + ++ F +++    VF C   P  ++ ++      H +
Sbjct: 615  FVLDKNTNRLTDQKKVTLGTQPTILKTFRSLS-TTNVFACSDRPTVIYSSN------HKL 667

Query: 125  TIDGPVSTLAPFHNVNCPR--GFLYFNAKS-------ELRISVLPTHLSYDAPWPVRKVP 175
                       F NVN          NA++         + SV+   +       +R VP
Sbjct: 668  V----------FSNVNLKEVNHMCSLNAEAYQDSLALATKNSVILGTIDEIQKLHIRTVP 717

Query: 176  LKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPL-----VS 230
            L  +P  +AY   ++T+ ++T                  + TD +DS  + P        
Sbjct: 718  LGESPRRIAYQEASQTFGVIT------------------VRTDIQDSSGLTPSRQSASTQ 759

Query: 231  QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED 290
              +V+L +     +   +N    E+   + + N+ +  + T   L  +  + T Y  S  
Sbjct: 760  TTNVTLSTNMGLLKAGASN---AEFGQEVEVHNLLIIDQNTFEVLHAHQFMQTEYAMSLI 816

Query: 291  VTCRGR----ILLFDIIEVVPEPGQPLTKNKIKMIYA---------KEQKGPVTAICHVA 337
                G       +     V PE  +P     I   YA         KE KG   ++    
Sbjct: 817  SAKLGNDPNTYYIVGTALVNPEEPEPKVGRIIIYHYADGNLTQVSEKEIKGSCYSLVEFN 876

Query: 338  GFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQP 396
            G ++ ++   + +++  D+    +        +A     K + ILVGD  RSI LL+Y+ 
Sbjct: 877  GRVLASINSTVRLYEWTDDKDLRLECSHFNNVLALYCKTKGDFILVGDLMRSITLLQYKQ 936

Query: 397  EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 456
               +   +ARDY+P    +           I+D      FL       L +C K G+   
Sbjct: 937  MEGSFEEIARDYQPNWMTAVE---------ILDDD---AFLGADNSNNLFVCLKDGAATT 984

Query: 457  DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
            D  DE   M                 PE  +             HLG  VN F       
Sbjct: 985  D--DERQQM-----------------PEVAQ------------VHLGDMVNVFRHGSLVM 1013

Query: 517  SSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
             +I +     S  +  + ++ GA+G    +P   Y  L  LQ  +       G ++   +
Sbjct: 1014 ENIGERTTPTSGCV-LFGTVSGAIGLVTQIPADYYEFLRKLQENLTDTIKSVGKIDHAYW 1072

Query: 577  RTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
            R++  +         G IDG LV  FL LS
Sbjct: 1073 RSFHTE--MKTERCEGFIDGDLVESFLDLS 1100


>gi|300122534|emb|CBK23104.2| unnamed protein product [Blastocystis hominis]
          Length = 172

 Score = 55.5 bits (132), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 34/141 (24%), Positives = 65/141 (46%), Gaps = 10/141 (7%)

Query: 494 LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRR 553
           ++++ DFHL   + +       P S+ D      + +    + +GA+G FL +  + Y +
Sbjct: 28  VVRQADFHLASQITSIL-----PISLPDG-----QCINVILTAEGAMGVFLFVTGEEYTK 77

Query: 554 LLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 613
           L  LQ  ++        LN   FR Y   G       +G++D  ++ K+L LS  E+ +I
Sbjct: 78  LSSLQKRLIEALPQNAALNNFNFRKYMSDGMMKYPRRKGVLDMGVIRKYLMLSTQEQEDI 137

Query: 614 CKKIGSKHNDILDELYDIEAL 634
            K +  +  +I + +Y  + L
Sbjct: 138 AKSLDLETKEITEVIYRTDKL 158


>gi|449540702|gb|EMD31691.1| hypothetical protein CERSUDRAFT_109269 [Ceriporiopsis subvermispora
            B]
          Length = 1265

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 48/204 (23%), Positives = 96/204 (47%), Gaps = 12/204 (5%)

Query: 281  LGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF 339
            +GT +   E+   R GR+LLF I     +        +++++  ++ KG V  I  V  F
Sbjct: 954  VGTAFLEVEETEPRSGRLLLFAIGS---DGATSSADGELRLVATQDVKGCVFQITSVNSF 1010

Query: 340  LVTAVGQKIYIWQLKDND----LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQ 395
            +  A+   + ++ L++ +    L  +A  +   ++ ++ S  +L++VGD   S++LLR  
Sbjct: 1011 IAAAISSNVVLFALRNTNKQYALQQVADWNHNYFVTNLASHGDLLIVGDAISSVSLLRVS 1070

Query: 396  PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
                 +  ++RDY P +P +    A N   G      ++ F    +  R ++ ++ GS H
Sbjct: 1071 DSR--IECLSRDYGPLRPVAVEATAENQIIGANSYCNLFSFALQHIDGR-KVLERDGSYH 1127

Query: 456  -NDILDEFSSMGFMISDKDKNVVL 478
             +DI+ +F   G + +D      L
Sbjct: 1128 LDDIVKKFVPGGLVAADSSTGYTL 1151


>gi|429961863|gb|ELA41407.1| hypothetical protein VICG_01512 [Vittaforma corneae ATCC 50505]
          Length = 1153

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 73/364 (20%), Positives = 145/364 (39%), Gaps = 64/364 (17%)

Query: 275  LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAIC 334
               ++ + T++   ED   +G+++++ ++ +VP+P       K+K+I ++  K P     
Sbjct: 835  FNNFLVVCTSFPEGEDKMTKGKLIVYSLVNIVPDPDNLHITKKLKLICSETLKNPCLFCE 894

Query: 335  HVAGFLVTAVGQKIYIWQLKDNDLTGIAFI---DTEVYIASMVSVKNLILVGDYARSIAL 391
             V   +   VG K+ I++  +N  TG+A +   +  +   S+   KNLI V D    I  
Sbjct: 895  EVRSLISVCVGTKLMIYEFNEN--TGLAAVGRHELSLLCTSLFVTKNLIAVSDIMNGIYF 952

Query: 392  LRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDG-----SLVWKFLQLSLGERLE 446
               +P         RD  P + +  G     P+   + G     S     LQ S+   + 
Sbjct: 953  FFLRP---------RD--PLKLHLLGRSCLVPNCRFLGGIDFCPSFETDALQFSI---VS 998

Query: 447  ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV 506
            +CK                G         V +F Y P    S  G++L+K+ +  + +  
Sbjct: 999  VCK---------------YGI--------VRIFTYSPYDPVSKNGNQLVKRAEI-VTKLA 1034

Query: 507  NTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 566
            N  +K+           G  + F     S+  +    + L   N+ +L  +Q+ +    S
Sbjct: 1035 NPLYKV---------VFGQINEF----ESILLSSNVMVLLRAINFPKLQAIQHCISIFIS 1081

Query: 567  HTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 626
            +  G+N    R Y     +     + +I   ++ +F       + +ICK +G  + +I++
Sbjct: 1082 NRCGIN---VRNYLETEEFVNPECKSVICEKILLEFFYFKPLVQEKICKLVGLDYFNIVE 1138

Query: 627  ELYD 630
             + D
Sbjct: 1139 LIED 1142


>gi|195500686|ref|XP_002097479.1| GE26244 [Drosophila yakuba]
 gi|194183580|gb|EDW97191.1| GE26244 [Drosophila yakuba]
          Length = 1140

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 71/318 (22%), Positives = 119/318 (37%), Gaps = 68/318 (21%)

Query: 305  VVPEPGQPLT---------KNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQL 353
            V+PE  +P           +NK+  +   +  G   A+    G ++  +G   ++Y W  
Sbjct: 837  VIPEEPEPKVGRIIIFHYHENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT- 895

Query: 354  KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
             + +L     I   +    + +  + ILVGD  RSI LL+++        +ARD +P   
Sbjct: 896  NEKELRMECNIQNMIAALYLKAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPK-- 953

Query: 414  NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
                                W                   +  +ILD+ + +G      +
Sbjct: 954  --------------------WM------------------RAVEILDDDTFLG-----SE 970

Query: 474  KNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL- 530
             N  LF+ Q ++  +    R  L +   FHLG  VN F       S +    G R+  + 
Sbjct: 971  TNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVNVFRH----GSLVMQNVGERTTPIN 1026

Query: 531  --TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
                Y + +GA+G    +P+  Y  L  LQ  +       G +    +R ++        
Sbjct: 1027 GCVLYGTCNGAIGIVTQIPQDFYDFLHGLQERLKKIIKSVGKIEHTYYRNFQINNKV--E 1084

Query: 589  PSRGIIDGSLVWKFLQLS 606
            PS G IDG L+  FL LS
Sbjct: 1085 PSEGFIDGDLIESFLDLS 1102


>gi|290998415|ref|XP_002681776.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
 gi|284095401|gb|EFC49032.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
          Length = 1103

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 78/379 (20%), Positives = 143/379 (37%), Gaps = 81/379 (21%)

Query: 278  YIALGTNYNY-SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT      E+   +GRIL+  +             +K+ +   K+ KG V  +   
Sbjct: 781  YFIVGTAITEGDEEEPSKGRILVLQV-----------QDDKLVLKAEKDVKGAVMVLHSF 829

Query: 337  AGFLVTAVGQKIYI--WQLKDN----DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIA 390
             G L+  V  ++ +  W   D+    DL         +YI  + S  + IL+GD  +S+ 
Sbjct: 830  NGKLLAGVSGRLMLFKWAESDDGDNKDLVQECSCSGGIYILDIDSHGDFILIGDMMKSVH 889

Query: 391  LLRYQ-PEYR----TLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERL 445
            L  Y+ PE +     L L+++DY+ +                      W    L L E  
Sbjct: 890  LFVYENPEEQHVSGNLRLISKDYQYS----------------------WLSCSLMLNES- 926

Query: 446  EICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQH 505
                                 ++  D+  N++      EA       +L++   ++    
Sbjct: 927  --------------------EYVAVDQQGNMITLKKNDEAASEEERKQLVRVGKYYCSDR 966

Query: 506  VNT----FFKIRCKPSS--ISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN 559
            VN     F  +R   SS  I+  P   + F     ++ G +G    LP + +  +  +Q 
Sbjct: 967  VNRIQPGFIGMRFANSSSDINTQPVKTALF----GTISGGIGVLAQLPPETFAFVTKIQK 1022

Query: 560  VMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 619
             M +  +    ++   +R Y+ +       S G IDG  V  FL+     +  + +++ +
Sbjct: 1023 AMSSVVTGLANISRETYRQYRSE--RTREDSVGFIDGDFVESFLEFDFETQQRVIEELSN 1080

Query: 620  KHND--ILDELY-DIEALS 635
             H +   L+EL  +IE LS
Sbjct: 1081 NHQEQITLEELVKNIEDLS 1099


>gi|156049323|ref|XP_001590628.1| hypothetical protein SS1G_08368 [Sclerotinia sclerotiorum 1980]
 gi|154692767|gb|EDN92505.1| hypothetical protein SS1G_08368 [Sclerotinia sclerotiorum 1980 UF-70]
          Length = 1153

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 75/364 (20%), Positives = 138/364 (37%), Gaps = 76/364 (20%)

Query: 281  LGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ-KGPVTAICHVAGF 339
            +GT++ +  +V  RGR+L+F +             ++   I A    KG    I  + G 
Sbjct: 835  VGTSFLHDGEVNIRGRLLIFGV-----------NSDRTPYIIASHTLKGSCRCIGVLNGK 883

Query: 340  LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-----NLILVGDYARSIALLRY 394
            +V A+ + + ++  ++   T         Y  +   +      N+I V D  +S+AL+ Y
Sbjct: 884  IVAALNKTVVMYDYEETSRTTANLRKVATYRCATCPIDIDIRGNIIAVADIMKSVALVEY 943

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
             P          D  P +    G +A              +    S+ E           
Sbjct: 944  TP--------GVDGLPDKLEEVGRHA-------------QQVFATSIAE----------- 971

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 514
                     +  ++ SD D N+++     E        RL    + +LG+ VN   +I  
Sbjct: 972  -------VDTDTYLESDHDGNLIVLKRNREGVTREDKLRLEVLCEMNLGEMVNKIKRINV 1024

Query: 515  KPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM-------VTHTSH 567
            + S   DA      F+   A+ +G++  F  +P +N   L+ LQ+ +       +T +S 
Sbjct: 1025 ETSK--DALLIPRAFV---ATTEGSIYLFSLIPPQNQDLLMRLQSRLASLPARSLTDSSF 1079

Query: 568  T-------GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 620
            +       G L+   +R+Y         P R  +DG L+ +FL L    +  IC  +G +
Sbjct: 1080 SAPIEFSPGNLDFDKYRSYVSAVRETNEPFR-FVDGELIERFLDLDGAIQENICDGLGVR 1138

Query: 621  HNDI 624
              D+
Sbjct: 1139 AEDL 1142


>gi|392864500|gb|EAS34654.2| UV-damaged DNA binding protein [Coccidioides immitis RS]
          Length = 1144

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 65/321 (20%), Positives = 118/321 (36%), Gaps = 72/321 (22%)

Query: 294  RGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL 353
            +GRIL+FD+           T  +++ I     +G   A+  +   +V A+ + + +  +
Sbjct: 849  KGRILVFDVD----------TNRELRKICDFPVRGACRALAMINNKIVAALMKTVVVLNI 898

Query: 354  KDNDLTGIAFIDTEVYIASMVSVK-----NLILVGDYARSIALLRY------QPEYRTLS 402
            K  +L          Y  S   V      N+I V D  +SI+L+ Y      QP+  TL 
Sbjct: 899  KKGNLYNFEIEKEASYRTSTAPVDISVTGNIIAVADLMKSISLVEYHAGEGGQPD--TLK 956

Query: 403  LVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEF 462
             VAR Y+     +    A N                                        
Sbjct: 957  EVARHYQTLWTTAAAPVAENE--------------------------------------- 977

Query: 463  SSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDA 522
                F+++D + N+V+          +   R+   ++  LG+ VN     R  P  +  +
Sbjct: 978  ----FLVADAEGNLVVLNRDTTGVTEDDRRRMQVTSELRLGEMVN-----RIHPMDLQTS 1028

Query: 523  PGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 582
            P +      + A++DG++  F  +       L+ LQ+ +    +  G +    +R +K  
Sbjct: 1029 PESPVIPKAFLATVDGSIYLFGLISPSAQDTLMRLQSALADFVASPGEIPFNKYRAFKSS 1088

Query: 583  GYYAGNPSRGIIDGSLVWKFL 603
               A  P R  +DG L+ +FL
Sbjct: 1089 VRQAEEPFR-FVDGELIEQFL 1108


>gi|307186138|gb|EFN71863.1| DNA damage-binding protein 1 [Camponotus floridanus]
          Length = 1136

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 76/364 (20%), Positives = 136/364 (37%), Gaps = 69/364 (18%)

Query: 278  YIALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT + N  E     GRILLF             +  K+  +  KE KG   ++   
Sbjct: 824  YYVVGTAFINPDETEPKMGRILLFH-----------WSDGKLSQVAEKEIKGSCYSLVEF 872

Query: 337  AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQ 395
             G L+ ++   + +++        +        IA  +  K + +LVGD  RS+ LL+Y+
Sbjct: 873  NGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKTKSDFVLVGDLMRSLTLLQYK 932

Query: 396  PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
                +   +ARDY P    S                                        
Sbjct: 933  TMEGSFEEIARDYNPNWMTSI--------------------------------------- 953

Query: 456  NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIR 513
             +ILD+ + +G      +    LF+ Q ++  ++   R  + +   FHLG  VN F    
Sbjct: 954  -EILDDDTFLG-----AENCFNLFICQKDSAATSEDERQQMQEVGQFHLGDMVNVFRHGS 1007

Query: 514  CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
                ++ ++    ++    + ++ GA+G    +P   Y  L  L++ + +     G +  
Sbjct: 1008 LVMQNLGES-STPTQGCVLFGTVSGAIGLVTQIPFGFYEFLRNLEDKLTSVIKSVGKIEH 1066

Query: 574  RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDE 627
              +R++K           G IDG L+  FL LS  +  E+   +      G K    +D+
Sbjct: 1067 NFWRSFKTD--LKIEQCEGFIDGDLIESFLDLSHDKMAEVAMGLMMDDGSGMKKEATVDD 1124

Query: 628  LYDI 631
            L  I
Sbjct: 1125 LVKI 1128


>gi|328874742|gb|EGG23107.1| UV-damaged DNA binding protein1 [Dictyostelium fasciculatum]
          Length = 1116

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 70/347 (20%), Positives = 133/347 (38%), Gaps = 71/347 (20%)

Query: 278  YIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
            YI +GT Y+  +   C GRIL+F +I+           +++ ++     +G +  +    
Sbjct: 815  YIVVGTTYHCHDRKEC-GRILVFKMID-----------SRLILLDETTVRGSIFCMIAFN 862

Query: 338  GFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEVYIASMVSV-----KNLILVGDYARSIA 390
            G L+ A+ + +  Y W     D +       E+Y     S+      + +LVGD  +S+A
Sbjct: 863  GQLLVAINKSVHRYTWS---GDSSSGKLTGEEIYGGHTASLYLAGRGDFVLVGDMMKSMA 919

Query: 391  LLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 450
            LL+            +D K    +S+ ++    +                          
Sbjct: 920  LLQAS---------GKDVKELSRSSQPFWLTGLT-------------------------- 944

Query: 451  IGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFF 510
                    +D+ + +G   SD   N++L     E         L      H G+ +N F 
Sbjct: 945  -------FIDDDTYLG---SDNSYNLILMKKNTETANEVDSQLLDNIGHIHTGEFINRFH 994

Query: 511  KIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
                   +  D+P   S     +A++ G +G    + +++Y     LQ  +       GG
Sbjct: 995  HGTLATLTDVDSPKPNS---IIFATISGCIGVISTISKQDYDFFSKLQVGLNRVIRGIGG 1051

Query: 571  LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
             +   +R+++ + + +   SR  IDG LV +FL L   + LE+ K +
Sbjct: 1052 FSHDRWRSFQNE-HISNIESRNFIDGDLVEQFLHLRHDKMLEVTKDM 1097


>gi|241952575|ref|XP_002419009.1| pre-mRNA-splicing factor, putative; pre-spliceosome component,
            putative [Candida dubliniensis CD36]
 gi|223642349|emb|CAX42591.1| pre-mRNA-splicing factor, putative [Candida dubliniensis CD36]
          Length = 1187

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 42/146 (28%), Positives = 68/146 (46%), Gaps = 17/146 (11%)

Query: 492  HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNY 551
            ++L    +FH+G  + T   + C      +  G  S     Y  L G +G  +PL  K+ 
Sbjct: 1057 YKLQNLIEFHIGDII-TSLNLGCL-----NLAGTES---VIYTGLQGTIGLLVPLVSKSE 1107

Query: 552  RRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGER 610
              LL  LQ +M    ++  G +   FR+Y        NP + +IDG L+ +FL+     R
Sbjct: 1108 VELLFNLQLLMQQFQNNLVGKDHLKFRSYY-------NPIKNVIDGDLLERFLEFDTSLR 1160

Query: 611  LEICKKIGSKHNDILDELYDIEALSS 636
            +EI +K+    NDI  +L D+   S+
Sbjct: 1161 IEISRKLNKSVNDIEKKLIDLRNRSA 1186


>gi|303271531|ref|XP_003055127.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463101|gb|EEH60379.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 1223

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 59/265 (22%), Positives = 101/265 (38%), Gaps = 65/265 (24%)

Query: 370  IASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGII 428
            +A  V V+ + I+VGD  +SI+LL Y+P+   +   ARD+ P                  
Sbjct: 990  VALYVDVRGDFIVVGDLMKSISLLVYKPDEGVIEERARDFNPN----------------- 1032

Query: 429  DGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARES 488
                 W            +C          LD+ + +G   ++   N+       +A   
Sbjct: 1033 -----WM---------TAVCA---------LDDETYLG---AENSFNLFTVRKNSDAAAD 1066

Query: 489  NGGHRLIKKTDFHLGQHVNTF---------------FKIRCKPSSISDAPGARSRFLTWY 533
                RL    ++HLG+ VN F                 +     + ++AP         +
Sbjct: 1067 EERSRLDVIGEYHLGEFVNRFRAGSLVMRLPGDGDGAGLGLGLDASNEAP------TQLF 1120

Query: 534  ASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGI 593
             +++GA+G    LPE  +  L  LQ  M    S  GG +  A+R++  +       +RG 
Sbjct: 1121 GTVNGAIGVVASLPESTHTFLAALQKAMNKVVSGVGGFSHDAWRSFHNEHRSRLVEARGF 1180

Query: 594  IDGSLVWKFLQLSLGERLEICKKIG 618
            +DG L+  FL L   +  E+   +G
Sbjct: 1181 VDGDLIESFLDLRPEKASEVASVVG 1205


>gi|345498295|ref|XP_001607743.2| PREDICTED: DNA damage-binding protein 1-like [Nasonia vitripennis]
          Length = 1140

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 100/486 (20%), Positives = 177/486 (36%), Gaps = 91/486 (18%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
            +R VPL  +P  +AY   T+T+ ++T       D  + NG +   +  P  S     + +
Sbjct: 713  IRTVPLYESPRRIAYQESTQTFGVITM----RVDIQESNGVN---IARPSASTQAASISN 765

Query: 231  QFHVSLFSPFS------WEEIPQTNFPL---HEWEHVLCLKNVSMEYEGTLSGLR----- 276
              H+   +  S       +E+   N  +   H +E +     V  EY  +L   +     
Sbjct: 766  SNHIPTHNKPSNTASEIGQEVEIHNLLIVDQHTFEVLHAHTLVPTEYAMSLISTKLGEDP 825

Query: 277  -GYIALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAIC 334
              Y  +GT   N  E     GRILL+                K+  +  KE KG   ++ 
Sbjct: 826  TPYYIVGTAMINPDESEPKSGRILLYH-----------WNDGKLTQVAEKEIKGSCYSLV 874

Query: 335  HVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLR 393
               G L+ ++   + +++        +        IA  +  K + +LVGD  RS+ LL+
Sbjct: 875  EFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKTKGDFVLVGDLMRSVTLLQ 934

Query: 394  YQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 453
            Y+    +   +ARDY P    S                                      
Sbjct: 935  YKTMEGSFEEIARDYNPNWMTSI------------------------------------- 957

Query: 454  KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFK 511
               +ILD+ + +G      +    LF+ Q ++  ++   R  + +   FHLG  VN F  
Sbjct: 958  ---EILDDDTFLG-----AENCFNLFVCQKDSAATSEEERQQMQEVGQFHLGDMVNVFRH 1009

Query: 512  IRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
                   + ++    +     + ++ GA+G    +P   Y  L  L++ + +     G +
Sbjct: 1010 GSLVMQHLGES-STPTHGCVLFGTVCGAIGLVTQIPSTFYEFLRNLEDRLTSVIKSVGKI 1068

Query: 572  NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDIL 625
                +R++            G IDG L+  FL LS  +  E+   I      G K    +
Sbjct: 1069 EHNFWRSFNTD--LKIEQCEGFIDGDLIESFLDLSHEKMAEVAMGIVIDDGSGMKKEATV 1126

Query: 626  DELYDI 631
            D+L  I
Sbjct: 1127 DDLVKI 1132


>gi|195329354|ref|XP_002031376.1| GM24084 [Drosophila sechellia]
 gi|194120319|gb|EDW42362.1| GM24084 [Drosophila sechellia]
          Length = 1140

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 77/353 (21%), Positives = 135/353 (38%), Gaps = 74/353 (20%)

Query: 305  VVPEPGQP---------LTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQL 353
            V+PE  +P           +NK+  +   +  G   A+    G ++  +G   ++Y W  
Sbjct: 837  VIPEEPEPKVGRIIIFHYNENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT- 895

Query: 354  KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
             + +L     I   +    + +  + ILVGD  RSI LL+++        +ARD +P   
Sbjct: 896  NEKELRMECNIQNMIAALYLKAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPK-- 953

Query: 414  NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
                                W                   +  +ILD+ + +G      +
Sbjct: 954  --------------------WM------------------RAVEILDDDTFLG-----SE 970

Query: 474  KNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL- 530
             N  LF+ Q ++  +    R  L +   FHLG  VN F       S +    G R+  + 
Sbjct: 971  TNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVNVFRH----GSLVMQNVGERTTPIN 1026

Query: 531  --TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
                Y + +GA+G    +P+  Y  L  L+  +       G +  + +R ++   +    
Sbjct: 1027 GCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKIIKLVGKIGHKFYRNFRI--HTQVE 1084

Query: 589  PSRGIIDGSLVWKFLQLSLG------ERLEICKKIGSKHNDILDELYDIEALS 635
            PS+G IDG L+  FL LS        + LE+      K  D+ D +  +E L+
Sbjct: 1085 PSQGFIDGDLIESFLDLSRDKMRDAVQGLELTLNGERKSADVEDVIKIVEDLT 1137


>gi|195108657|ref|XP_001998909.1| GI23368 [Drosophila mojavensis]
 gi|193915503|gb|EDW14370.1| GI23368 [Drosophila mojavensis]
          Length = 1140

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 66/299 (22%), Positives = 113/299 (37%), Gaps = 59/299 (19%)

Query: 315  KNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIAS 372
            +NK+  +   +  G   A+    G ++  +G   ++Y W   + +L     I   +    
Sbjct: 856  ENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT-NEKELRMECNIQNMIAALF 914

Query: 373  MVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSL 432
            + +  + ILVGD  RSI LL+++        +ARD +P                      
Sbjct: 915  LKAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPK--------------------- 953

Query: 433  VWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGH 492
             W                   +  +ILD+ + +G    D      LF+ Q ++  +    
Sbjct: 954  -WM------------------RAVEILDDDTFLGCETHDN-----LFVCQKDSAATTDEE 989

Query: 493  R--LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL---TWYASLDGALGFFLPLP 547
            R  L +   FHLG  +N F       S +    G R+  +     Y + +GA+G    +P
Sbjct: 990  RQLLPELARFHLGDTINVFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIP 1045

Query: 548  EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
            +  Y  L  L+  +       G ++   +R Y+        PS G IDG L+  FL LS
Sbjct: 1046 QDFYDFLHGLEERLKKIIKSVGKIDHTYYRNYQINTKV--EPSEGFIDGDLIESFLDLS 1102


>gi|68476233|ref|XP_717766.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
            albicans SC5314]
 gi|68476422|ref|XP_717672.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
            albicans SC5314]
 gi|74586274|sp|Q5A7S5.1|RSE1_CANAL RecName: Full=Pre-mRNA-splicing factor RSE1
 gi|46439394|gb|EAK98712.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
            albicans SC5314]
 gi|46439495|gb|EAK98812.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
            albicans SC5314]
          Length = 1219

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 68/146 (46%), Gaps = 17/146 (11%)

Query: 492  HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNY 551
            ++L    +FH+G  + T F + C      +  G  S     Y  L G +G  +PL  K+ 
Sbjct: 1089 YKLQNLIEFHIGDII-TSFNLGCL-----NLAGTES---VIYTGLQGTIGLLIPLVSKSE 1139

Query: 552  RRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGER 610
              LL  LQ  M    ++  G +    R+Y        NP + +IDG L+ +FL+  +  +
Sbjct: 1140 VELLFNLQLYMQQSQNNLVGKDHLKLRSYY-------NPIKNVIDGDLLERFLEFDISLK 1192

Query: 611  LEICKKIGSKHNDILDELYDIEALSS 636
            +EI +K+    NDI  +L D+   S+
Sbjct: 1193 IEISRKLNKSVNDIEKKLIDLRNRSA 1218


>gi|406865227|gb|EKD18269.1| CPSF A subunit region [Marssonina brunnea f. sp. 'multigermtubi'
            MB_m1]
          Length = 1146

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 74/365 (20%), Positives = 141/365 (38%), Gaps = 70/365 (19%)

Query: 281  LGTNY--NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAG 338
            +GT++    S D   +GRIL+F I     +P     K    ++ +   K     +  + G
Sbjct: 840  VGTSFLDEESADPNIKGRILVFGI-----DP-----KKNPYLVASLNLKCACRRVAMLDG 889

Query: 339  FLVTAVGQKIYIWQL-----KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLR 393
             +V  + + + +++      K  +   +A   +      +   +N+I + D  +S+++++
Sbjct: 890  KIVAVLNKTVAMFKYVEITEKAGEFKKLATFRSSTVPIDIAITENIIAITDMMQSVSIVQ 949

Query: 394  YQPEYR----TLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 449
            Y P        L  VARDY+                        W               
Sbjct: 950  YTPGKEGMPDKLEQVARDYQT----------------------CW--------------- 972

Query: 450  KIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
              G+   DI D      ++ SD   N+++     +        RL    + +LG+ VN  
Sbjct: 973  --GTAVTDIGDN----SWLESDHHGNLLVLQRNIDGITLEDKQRLRITGEMNLGEQVNMI 1026

Query: 510  FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
             KI   PS     P A      + A+ +G++  F  + + +   LL LQ  +       G
Sbjct: 1027 RKIAIDPS-----PTAMVVPKAFLATTEGSIYLFSTILDGSQDLLLRLQENITECVDTLG 1081

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELY 629
             L+ + +R++K        P R  +DG L+ +FL  S   + +IC+ +G     I D + 
Sbjct: 1082 RLDFKTYRSFKSAERTTEEPYR-FVDGELIERFLDESEDMQQQICEGLGYTVEAIRDVVE 1140

Query: 630  DIEAL 634
            +++ L
Sbjct: 1141 NLKRL 1145


>gi|221508103|gb|EEE33690.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 1878

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 45/183 (24%), Positives = 82/183 (44%), Gaps = 27/183 (14%)

Query: 232  FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDV 291
            + V L+  F     P   + L   E VL L  V       L G+  ++A G     SE+V
Sbjct: 1433 YEVRLYHEFDLHR-PVGTYTLRTCEEVLSLSFV------VLDGVE-HLAAGVGVPLSENV 1484

Query: 292  TCRGRILLFDIIE----VVP---------EPGQPLTKNKIKMIYAKEQKGPVTAICHV-- 336
             C GR+ LF + E    VVP         E  +  T  ++++       GPVT +     
Sbjct: 1485 ECGGRVYLFKLPESSLRVVPAGNAGDAPTEEAEFGTPERLELFADIVLNGPVTVVGSFFS 1544

Query: 337  ----AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALL 392
                  ++V +VG ++++ +++ +     AF D  V + S+ +++N  L+GD  + + L+
Sbjct: 1545 SPAERSYVVHSVGPRLFVHEMEGSKFLRGAFSDASVCVTSVANIRNFFLLGDALKGLNLV 1604

Query: 393  RYQ 395
             ++
Sbjct: 1605 SWE 1607


>gi|301124447|ref|XP_002909707.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262106897|gb|EEY64949.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 328

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 79/322 (24%), Positives = 127/322 (39%), Gaps = 84/322 (26%)

Query: 99  QGVFLCGPHPAWLFLTSRGELRAHPMTIDG-------------------PVSTLAPFHNV 139
            G F  G HP W+ L  RG     PM +                     PV +  PFH+ 
Sbjct: 2   SGAFFRGAHPMWI-LGDRGHASFVPMCVPSSAPPKANGTSKNAAPRVSVPVLSFTPFHHW 60

Query: 140 NCPRGFLYFNAKSELRISVLP-----THLSYDAPWPVRKVPLKCTPHFLAY--------- 185
           +CP GF+YF+++  LR+  LP     T L     + ++K     T H + Y         
Sbjct: 61  SCPNGFIYFHSRGALRVCELPSSKTSTILPSSGGFVLQKAEFGATLHHMLYLGSHGPGGV 120

Query: 186 --HLETKTYCIVTST------AEPSTDYYKFNGEDKELVTDPR----DSRFIPPLVSQF- 232
              LE  TY +V S       A+ +T+      E +    DP      S  + P    F 
Sbjct: 121 AEALEAPTYAVVCSARLKPADADRATEVEGAEEELEPENLDPNGNPLGSNVMAPTAEMFA 180

Query: 233 -----HVSLFSPFSWE-EIPQTN----------FPLH--EWEHVLCLK-----NVSMEYE 269
                H++      +E  + QT+          F +H   +E VL +K     + S+  E
Sbjct: 181 DYETDHMAHTEEDVYELRLVQTDEFGEWGRRGVFRVHFERYEVVLSVKLMYLYDSSLMKE 240

Query: 270 GTLSGL-------RGYIALGTNY--NYSEDVTCRGRILLF--DIIEVVPEPGQPLTKN-- 316
              S         R Y+ +GT +   + ED + RGR+LL+  D  + V E G   +    
Sbjct: 241 EVASTSPEWNKKKRPYLVVGTGWVGPHGEDESGRGRLLLYELDYAQYVNEEGGATSGKLP 300

Query: 317 KIKMIYAKEQK-GPVTAICHVA 337
           K+++++ KE + G V+ +  + 
Sbjct: 301 KLRLVFIKEHRQGAVSMVSQLG 322


>gi|195996153|ref|XP_002107945.1| hypothetical protein TRIADDRAFT_18324 [Trichoplax adhaerens]
 gi|190588721|gb|EDV28743.1| hypothetical protein TRIADDRAFT_18324 [Trichoplax adhaerens]
          Length = 1134

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 66/329 (20%), Positives = 125/329 (37%), Gaps = 56/329 (17%)

Query: 315  KNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDN-DLTGIAFIDTEVYIASM 373
            + KI+ +++KE  G V  +    G L+ +V   + +++   N +L         V    +
Sbjct: 851  EGKIQQVHSKEVSGAVYCMVAFNGRLLASVNSTVSVYEWTSNKELVEETSFHNNVLALYL 910

Query: 374  VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
             +  + IL+GD  RSI+L  Y+P    + L+ ++  P                       
Sbjct: 911  KTKGDFILIGDLMRSISLCAYRPMNNEIELICKNNDPN---------------------- 948

Query: 434  WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
            W                      +I+D+ S +G    +   N  LF  Q  +  S    +
Sbjct: 949  WM------------------TAVEIIDDDSYLG---GENSHN--LFTCQKNSSSSEEEQK 985

Query: 494  LIKKTD-FHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYR 552
             +     +H+G+ VN F +      +  D P +    +  + ++ GA+G  + L    + 
Sbjct: 986  HLPTVGVYHVGEFVNVFRQGSLVMQNTVDIPDSVQGSI-LFGTVSGAVGVVVTLAPAMFE 1044

Query: 553  RLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS------ 606
             +  + N + T     G +  + +R++         P +  +DG LV  FL LS      
Sbjct: 1045 FVSAIANKLSTVVKGVGKIEHQFWRSFSND--RKTEPCQSFVDGDLVESFLDLSPEDMQR 1102

Query: 607  LGERLEICKKIGSKHNDILDELYDIEALS 635
            +   L I    G++   + D L  +E LS
Sbjct: 1103 VANGLTIQTADGTRPAMVEDVLKTVEELS 1131


>gi|308808936|ref|XP_003081778.1| putative UV-damaged DNA binding factor (ISS) [Ostreococcus tauri]
 gi|116060244|emb|CAL56303.1| putative UV-damaged DNA binding factor (ISS) [Ostreococcus tauri]
          Length = 1282

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 85/427 (19%), Positives = 160/427 (37%), Gaps = 117/427 (27%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTST--AEPSTDYYKFNGEDKELVTDPRDSRFIPPL 228
            +R +PL   P  +A+  ET T+ +V     ++ S D +                      
Sbjct: 897  IRTIPLGGQPRRIAHQPETNTFAVVVEHLWSKSSQDCF---------------------- 934

Query: 229  VSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY- 287
                 V L    S+E + Q  F L + E    L + +   + T      Y  +GT     
Sbjct: 935  -----VRLVDDGSFETLSQ--FQLEDQELTSSLTSCTFAGDSTT-----YYVVGTGIALE 982

Query: 288  SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
            +ED   RGRIL+F + +           +++ ++  KE +G V  +    G L+  +  K
Sbjct: 983  TEDEPSRGRILVFKVDD-----------DQLVLVSEKEVRGAVYNLNAFKGKLLAGINSK 1031

Query: 348  I--YIWQLKDND---LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLS 402
            +  + W  ++++   L        ++   ++ +  + ILVGD  +S++LL Y+PE   + 
Sbjct: 1032 LELFKWTPREDEVHELVSECSHHGQIVTFAVKTRGDWILVGDLMKSMSLLLYKPEEGAID 1091

Query: 403  LVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEF 462
             VARD+      +           ++D    +                +G++++  L   
Sbjct: 1092 EVARDFNANWMTAV---------AMLDDDETY----------------LGAENSLNLFTV 1126

Query: 463  SSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDA 522
            S     ++D++++                 RL    ++HLG+ VN F            A
Sbjct: 1127 SRNVNAVTDEERS-----------------RLEITGEYHLGELVNAF------------A 1157

Query: 523  PGARSRFLT----------WYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
            PG+    L            + + +G +G    LP+  Y     LQ  +  H    GGL 
Sbjct: 1158 PGSLVMSLRDGESLSVPTLLFGTANGVIGVLASLPKDVYEFTERLQASINKHIQGVGGLK 1217

Query: 573  PRAFRTY 579
               +R++
Sbjct: 1218 HADWRSF 1224


>gi|398391687|ref|XP_003849303.1| hypothetical protein MYCGRDRAFT_87400 [Zymoseptoria tritici IPO323]
 gi|339469180|gb|EGP84279.1| hypothetical protein MYCGRDRAFT_87400 [Zymoseptoria tritici IPO323]
          Length = 1143

 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 69/352 (19%), Positives = 136/352 (38%), Gaps = 75/352 (21%)

Query: 281  LGTNYNYSEDVT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF 339
            +GT Y   +D +  +GRIL+ ++ E            ++K++     +G    +    G 
Sbjct: 836  IGTAYLDDQDASNAKGRILVLEVTE----------DRRLKLVTEISVRGACRCLAVSHGR 885

Query: 340  LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIAS-----MVSVKNLILVGDYARSIALLRY 394
            +V A+ + + I+  +    +  A +    Y  S     M    ++I V D  +S++L+++
Sbjct: 886  IVAALIKTVIIYSFEYETPSSPAMVKKAAYRTSTAPIDMCVTGDIIAVTDLMKSMSLVQH 945

Query: 395  Q------PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 448
                   P+   L+ VAR +                        +W     ++ E +   
Sbjct: 946  TLGQAGGPD--NLTEVARHFDT----------------------LWGTAVANVDENI--- 978

Query: 449  KKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNT 508
                              ++ SD + N+V+  +  +        RL   ++  LG+ VN 
Sbjct: 979  ------------------YLESDAEGNLVVLEHDVKGFSEEDRRRLRVTSEILLGEMVNR 1020

Query: 509  FFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT 568
              +I   P+     P A      + A+++G++  F  + E     L+ +QN M       
Sbjct: 1021 IRRIDVSPT-----PNATVIPRAFLATVEGSIYLFALIAEGKQDLLIRMQNKMAEMVQSP 1075

Query: 569  GGLNPRAFRTYKGKGYYAGN--PSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
            G +    FR +K +    G   PSR  +DG L+ +FL      + E+ K++G
Sbjct: 1076 GHVPFAKFRGFKTQVRDMGEEGPSR-FVDGELIERFLDCDEDVQAEVAKELG 1126


>gi|195571247|ref|XP_002103615.1| GD18880 [Drosophila simulans]
 gi|194199542|gb|EDX13118.1| GD18880 [Drosophila simulans]
          Length = 1140

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 70/318 (22%), Positives = 119/318 (37%), Gaps = 68/318 (21%)

Query: 305  VVPEPGQP---------LTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQL 353
            V+PE  +P           +NK+  +   +  G   A+    G ++  +G   ++Y W  
Sbjct: 837  VIPEEPEPKVGRIIIFHYNENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT- 895

Query: 354  KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
             + +L     I   +    + +  + ILVGD  RSI LL+++        +ARD +P   
Sbjct: 896  NEKELRMECNIQNMIAALYLKAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPK-- 953

Query: 414  NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
                                W                   +  +ILD+ + +G      +
Sbjct: 954  --------------------WM------------------RAVEILDDDTFLG-----SE 970

Query: 474  KNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL- 530
             N  LF+ Q ++  +    R  L +   FHLG  VN F       S +    G R+  + 
Sbjct: 971  TNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVNVFRH----GSLVMQNVGERTTPIN 1026

Query: 531  --TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
                Y + +GA+G    +P+  Y  L  L+  +       G +    +R ++        
Sbjct: 1027 GCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKV--E 1084

Query: 589  PSRGIIDGSLVWKFLQLS 606
            PS G IDG L+  FL LS
Sbjct: 1085 PSEGFIDGDLIESFLDLS 1102


>gi|21357503|ref|NP_650257.1| piccolo [Drosophila melanogaster]
 gi|74872881|sp|Q9XYZ5.1|DDB1_DROME RecName: Full=DNA damage-binding protein 1; Short=D-DDB1; AltName:
            Full=Damage-specific DNA-binding protein 1; AltName:
            Full=Protein piccolo
 gi|4928452|gb|AAD33592.1|AF132145_1 damage-specific DNA binding protein DDBa p127 subunit [Drosophila
            melanogaster]
 gi|7299719|gb|AAF54901.1| piccolo [Drosophila melanogaster]
 gi|220942640|gb|ACL83863.1| DDB1-PA [synthetic construct]
          Length = 1140

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 70/318 (22%), Positives = 119/318 (37%), Gaps = 68/318 (21%)

Query: 305  VVPEPGQPLT---------KNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQL 353
            V+PE  +P           +NK+  +   +  G   A+    G ++  +G   ++Y W  
Sbjct: 837  VIPEEPEPKVGRIIIFHYHENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT- 895

Query: 354  KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQP 413
             + +L     I   +    + +  + ILVGD  RSI LL+++        +ARD +P   
Sbjct: 896  NEKELRMECNIQNMIAALFLKAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPK-- 953

Query: 414  NSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKD 473
                                W                   +  +ILD+ + +G      +
Sbjct: 954  --------------------WM------------------RAVEILDDDTFLG-----SE 970

Query: 474  KNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL- 530
             N  LF+ Q ++  +    R  L +   FHLG  VN F       S +    G R+  + 
Sbjct: 971  TNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVNVFRH----GSLVMQNVGERTTPIN 1026

Query: 531  --TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN 588
                Y + +GA+G    +P+  Y  L  L+  +       G +    +R ++        
Sbjct: 1027 GCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINSKV--E 1084

Query: 589  PSRGIIDGSLVWKFLQLS 606
            PS G IDG L+  FL LS
Sbjct: 1085 PSEGFIDGDLIESFLDLS 1102


>gi|195395112|ref|XP_002056180.1| GJ10363 [Drosophila virilis]
 gi|194142889|gb|EDW59292.1| GJ10363 [Drosophila virilis]
          Length = 1140

 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 70/319 (21%), Positives = 119/319 (37%), Gaps = 70/319 (21%)

Query: 295  GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQ 352
            GRI++F   E           NK+  +   +  G   A+    G ++  +G   ++Y W 
Sbjct: 847  GRIIIFHYNE-----------NKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT 895

Query: 353  LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQ 412
              + +L     I   +    + +  + ILVGD  RSI LL+++        +ARD +P  
Sbjct: 896  -NEKELRMECNIQNMIAALFLKAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPK- 953

Query: 413  PNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDK 472
                                 W                   +  +ILD+ + +G    D 
Sbjct: 954  ---------------------WM------------------RAVEILDDDTFLGCETHDN 974

Query: 473  DKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL 530
                 LF+ Q ++  +    R  L +   FHLG  +N F       S +    G R+  +
Sbjct: 975  -----LFVCQKDSAATTDEERQLLPELARFHLGDTINVFRH----GSLVMQNVGERTTPI 1025

Query: 531  ---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAG 587
                 Y + +GA+G    +P+  Y  L  L+  +       G ++   +R Y+       
Sbjct: 1026 NGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKIIKSVGKIDHTYYRNYQINTKV-- 1083

Query: 588  NPSRGIIDGSLVWKFLQLS 606
             PS G IDG L+  FL L+
Sbjct: 1084 EPSEGFIDGDLIESFLDLN 1102


>gi|195037449|ref|XP_001990173.1| GH18378 [Drosophila grimshawi]
 gi|193894369|gb|EDV93235.1| GH18378 [Drosophila grimshawi]
          Length = 1140

 Score = 52.8 bits (125), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 73/337 (21%), Positives = 126/337 (37%), Gaps = 71/337 (21%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  + T+  Y E+   + GRI++F               NK+  +   +  G   A+   
Sbjct: 829  YYVVATSLVYPEEPEPKVGRIIIFH-----------YNDNKLTQVAETKVDGTCYALVEF 877

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G ++  +G   ++Y W   + +L     I   +    + +  + ILVGD  RSI LL++
Sbjct: 878  NGKVLAGIGSFVRLYEWT-NEKELRMECNIQNMIAALFLKAKGDFILVGDLMRSITLLQH 936

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +        +ARD +P                       W                   +
Sbjct: 937  KQMEGIFVEIARDCEPK----------------------WM------------------R 956

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G    D      LF+ Q ++  +    R  L +   FHLG  +N F   
Sbjct: 957  AVEILDDDTFLGCETHDN-----LFVCQKDSAATTDEERQLLPELARFHLGDTINVFRH- 1010

Query: 513  RCKPSSISDAPGARSRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
                S +    G R+  +     Y + +GA+G    +P+  Y  L  L+  +       G
Sbjct: 1011 ---GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKIIKSVG 1067

Query: 570  GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
             ++   +R Y+        PS G IDG L+  FL L+
Sbjct: 1068 KIDHTYYRNYQINTKV--EPSEGFIDGDLIESFLDLN 1102


>gi|169848339|ref|XP_001830877.1| pre-mRNA-splicing factor RSE1 [Coprinopsis cinerea okayama7#130]
 gi|116508046|gb|EAU90941.1| pre-mRNA-splicing factor RSE1 [Coprinopsis cinerea okayama7#130]
          Length = 1213

 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 69/326 (21%), Positives = 131/326 (40%), Gaps = 49/326 (15%)

Query: 318  IKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSV 376
            +++++  E      A+    G L   VG+ + I+ + K   L  +        I ++ + 
Sbjct: 932  LELLHKTETDDVPMALLAFQGRLAAGVGKALRIYDIGKKKLLRKVENKSFTTAIVTLTTQ 991

Query: 377  KNLILVGDYARSIALLRY-QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWK 435
             + ILVGD   S+  + Y QPE R L+  A D +P              R +   ++V  
Sbjct: 992  GSRILVGDMQESVQYVVYKQPENRLLTF-ADDTQP--------------RWVTAITMV-D 1035

Query: 436  FLQLSLGERLE--ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
            +  +  G+R       ++ SK +D +DE  +   ++ +K     + M  P        H+
Sbjct: 1036 YNTIVAGDRFGNIFVNRLDSKVSDQVDEDPTGAGILHEKP----ILMGAP--------HK 1083

Query: 494  LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLP-LPEKNYR 552
                  FH+G  + +  K+            A  R +  Y  L G +G  +P + +++  
Sbjct: 1084 TKMIAHFHVGDIITSLHKVSLV---------AGGREVIVYTGLHGTIGILMPFISKEDVD 1134

Query: 553  RLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 612
             +  L+  M T      G +  A+R     GYY   P + ++DG L   +  L   ++  
Sbjct: 1135 FISTLEQHMRTEQPSLVGRDQLAYR-----GYYV--PVKAVVDGDLCETYAHLPASKQSS 1187

Query: 613  ICKKIGSKHNDILDELYDIEALSSHF 638
            I  ++     ++L +L  +   SS F
Sbjct: 1188 IANELDRTVGEVLKKLEQMRVTSSGF 1213


>gi|321478515|gb|EFX89472.1| hypothetical protein DAPPUDRAFT_303245 [Daphnia pulex]
          Length = 1158

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 68/321 (21%), Positives = 117/321 (36%), Gaps = 59/321 (18%)

Query: 305  VVPEPGQP---------LTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKD 355
            VVPE  +P             K+  +  KE KG   ++      ++ A+   + +++   
Sbjct: 853  VVPEESEPKQGRIVLFQWADGKLTTVAEKEVKGACYSLVDFNSKILAAINNVVRLYEWTA 912

Query: 356  NDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPN 414
                 +   +    IA  +  K + ILVGD  RSI LL+Y+    +   +ARD  P    
Sbjct: 913  EKELRLECSNFNHIIALYLKRKGDFILVGDLMRSITLLQYKTMEGSFEEMARDSNPN--- 969

Query: 415  SKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDK 474
                               W                      +ILD+ + +G      + 
Sbjct: 970  -------------------WM------------------SAVEILDDDTFLG-----AEN 987

Query: 475  NVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTW 532
            +  LF+ Q ++  +    R  L +   FHLG  VN F          ++     ++    
Sbjct: 988  SFNLFVCQKDSAATTEEERQQLTEVGRFHLGDMVNVFRHGSLVMDHAAETLTTPTQGCVL 1047

Query: 533  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 592
            + ++ GA+G    LP + Y  L  +Q  M       G +    +R++  +      P  G
Sbjct: 1048 FGTVHGAIGVVTQLPSEFYHFLSEVQTRMARVIKPVGKIEHSFWRSFATERKV--EPCEG 1105

Query: 593  IIDGSLVWKFLQLSLGERLEI 613
             IDG L+  FL LS  +  E+
Sbjct: 1106 FIDGDLIESFLDLSSDKMKEV 1126


>gi|449684814|ref|XP_004210722.1| PREDICTED: DNA damage-binding protein 1-like, partial [Hydra
           magnipapillata]
          Length = 725

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 86/368 (23%), Positives = 138/368 (37%), Gaps = 70/368 (19%)

Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
           Y  +GT+  Y E+   + G+I+LF + E            K+  I +K   G V  +   
Sbjct: 415 YYCVGTSMVYPEESEPKEGKIILFQLFE-----------GKLVQIGSKTVNGAVYVLQGF 463

Query: 337 AGFLVTAVGQKIYIWQ-LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQ 395
            G L+  V   + +++   D +L         +    + S  + ILVGD  RS+ LL Y+
Sbjct: 464 NGKLLAGVNSLVSVYEWTSDKELKQECCYHNTILALYLKSKGDFILVGDLMRSMTLLAYK 523

Query: 396 PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
           P  R L  +A D+ P             +  IID      FL       L IC+K  S  
Sbjct: 524 PLGR-LEEIAHDFSPNWM---------TAVEIIDDD---TFLGAENSFNLFICQKDNSSV 570

Query: 456 NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF--FKIR 513
           ND                          E R     H L     +HLG  VN F    + 
Sbjct: 571 ND--------------------------EER-----HHLQTIGKYHLGDFVNVFKHGSLV 599

Query: 514 CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
              S+    P + S     Y ++ GA+G    LP+  +  L  +Q  +       G +  
Sbjct: 600 MHHSTEQLTPISSS---ILYGTVRGAIGLVAGLPKNTFDFLSQVQEKLSKTIKSVGKIEH 656

Query: 574 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC-----KKIGSKHNDILDEL 628
             +R++        + + G +DG L+   L L+  +  E+      ++ G K    +D+L
Sbjct: 657 EFWRSFYNDK--KTDLAVGCVDGDLIESCLDLTRTQLHEVVSGLEIEEAGIKRECTVDDL 714

Query: 629 YD-IEALS 635
              +E LS
Sbjct: 715 IKVVEELS 722


>gi|195449948|ref|XP_002072297.1| GK22405 [Drosophila willistoni]
 gi|194168382|gb|EDW83283.1| GK22405 [Drosophila willistoni]
          Length = 1140

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 66/298 (22%), Positives = 112/298 (37%), Gaps = 59/298 (19%)

Query: 316  NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASM 373
            NK+  +   +  G   A+    G ++  +G   ++Y W   + +L     I   +    +
Sbjct: 857  NKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT-NEKELRMECNIQNMIAALYL 915

Query: 374  VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
             +  + ILVGD  RSI LL+++        +ARD +P                       
Sbjct: 916  KAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPK---------------------- 953

Query: 434  WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
            W                   +  +ILD+ + +G      + N  LF+ Q ++  +    R
Sbjct: 954  WM------------------RAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEER 990

Query: 494  --LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL---TWYASLDGALGFFLPLPE 548
              L +   FHLG  VN F       S +    G R+  +     Y + +GA+G    +P+
Sbjct: 991  QLLPELARFHLGDTVNVFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQ 1046

Query: 549  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
              Y  L  L+  +       G +    +R ++        PS G IDG L+  FL LS
Sbjct: 1047 DFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKV--EPSEGFIDGDLIESFLDLS 1102


>gi|125774475|ref|XP_001358496.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
 gi|54638233|gb|EAL27635.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
          Length = 1140

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 66/298 (22%), Positives = 112/298 (37%), Gaps = 59/298 (19%)

Query: 316  NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASM 373
            NK+  +   +  G   A+    G ++  +G   ++Y W   + +L     I   +    +
Sbjct: 857  NKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT-NEKELRMECNIQNMIAALYL 915

Query: 374  VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
             +  + ILVGD  RSI LL+++        +ARD +P                       
Sbjct: 916  KAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPK---------------------- 953

Query: 434  WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
            W                   +  +ILD+ + +G      + N  LF+ Q ++  +    R
Sbjct: 954  WM------------------RAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEER 990

Query: 494  --LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL---TWYASLDGALGFFLPLPE 548
              L +   FHLG  VN F       S +    G R+  +     Y + +GA+G    +P+
Sbjct: 991  QLLPELARFHLGDTVNVFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQ 1046

Query: 549  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
              Y  L  L+  +       G +    +R ++        PS G IDG L+  FL LS
Sbjct: 1047 DFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKV--EPSEGFIDGDLIESFLDLS 1102


>gi|339235331|ref|XP_003379220.1| DNA damage-binding protein 1 [Trichinella spiralis]
 gi|316978142|gb|EFV61158.1| DNA damage-binding protein 1 [Trichinella spiralis]
          Length = 1329

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 70/308 (22%), Positives = 130/308 (42%), Gaps = 62/308 (20%)

Query: 315  KNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT-EVYIASM 373
             + + +++ KE  G V A+      L+ A+   + +++   +D+TG+  + +  +++ +M
Sbjct: 1040 NSSLNLVHEKEVNGCVYAMASFKSKLLVAMNSSVLLFEW--SDVTGLQLVSSCSLFVTAM 1097

Query: 374  -VSVKN-LILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGS 431
             + V++ +ILVGD  RSIA+LRY P   +    ARDY P                     
Sbjct: 1098 HLKVRDEVILVGDIQRSIAVLRYVPSESSFVEEARDYHPN-------------------- 1137

Query: 432  LVWKFLQLSLGERLEICKKIGSKHND-ILDEFSSMGFMISDKDKNVVLFMYQP--EARES 488
              W    +S  E ++         ND  +   +S+   +S KD        QP  E++  
Sbjct: 1138 --W----ISAIEVID---------NDYFMAAENSLNITVSQKD-----LQQQPVSESQVV 1177

Query: 489  NGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLT--WYASLDGALGFFLPL 546
                RL      HLG+++N F   +    S+    G  S         + +G++  +  +
Sbjct: 1178 KSAGRL------HLGEYINVF---KHGALSMYSYAGISSLVSNPIMIGTAEGSILIYCQI 1228

Query: 547  PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN-PSRGIIDGSLVWKFLQL 605
             + ++R L  LQ           G    A+ +Y+    Y  N P+ G IDG L+ + L++
Sbjct: 1229 HDSHFRVLNDLQRCFSDIVPDNVGC--IAYDSYRRYVVYEKNAPAFGFIDGDLIEQLLEM 1286

Query: 606  SLGERLEI 613
               E + +
Sbjct: 1287 PRQEAIRL 1294


>gi|124806507|ref|XP_001350742.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium falciparum
            3D7]
 gi|23496869|gb|AAN36422.1|AE014849_41 splicing factor 3b, subunit 3, 130kD, putative [Plasmodium falciparum
            3D7]
          Length = 1329

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 70/318 (22%), Positives = 111/318 (34%), Gaps = 79/318 (24%)

Query: 334  CHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALL 392
            C   G L+ ++G K+ I+ L K   L    + D    I S+    N I   D   S+ + 
Sbjct: 1066 CSYNGKLIASIGNKLRIYALGKKKLLKKCEYKDIPEAIVSIKISGNRIFACDIRESVLIF 1125

Query: 393  RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
             Y P   TL L++ D  P                                 R   C +I 
Sbjct: 1126 FYDPNQNTLRLISDDIIP---------------------------------RWITCSEIL 1152

Query: 453  SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARE------------------SNGGHRL 494
              H            M +DK  +V +     EA++                  S    +L
Sbjct: 1153 DHHT----------IMAADKFDSVFILRVPEEAKQDEYGITNKCWYGGEIMNSSTKNRKL 1202

Query: 495  IKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 554
                 FH+G+ V +  K+R  P+S              Y+++ G +G F+P   K    L
Sbjct: 1203 EHMMSFHIGEIVTSMQKVRLSPTSSE---------CIIYSTIMGTIGAFIPYDNKEELEL 1253

Query: 555  LM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 613
               L+ ++ T      G     FR+Y        +P + ++DG L  +F  LS   + +I
Sbjct: 1254 TQHLEIILRTEKPPLCGREHIFFRSYY-------HPVQNVVDGDLCEQFSSLSYDAQKKI 1306

Query: 614  CKKIGSKHNDILDELYDI 631
               +     DIL +L DI
Sbjct: 1307 ANDLERTPEDILRKLEDI 1324


>gi|221486318|gb|EEE24579.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 2804

 Score = 51.2 bits (121), Expect = 0.002,   Method: Composition-based stats.
 Identities = 45/183 (24%), Positives = 82/183 (44%), Gaps = 27/183 (14%)

Query: 232  FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDV 291
            + V L+  F     P   + L   E VL L  V       L G+  ++A G     SE+V
Sbjct: 2359 YEVRLYHEFDLHR-PVGTYTLRTCEEVLSLSFV------VLDGVE-HLAAGVGVPLSENV 2410

Query: 292  TCRGRILLFDIIE----VVP---------EPGQPLTKNKIKMIYAKEQKGPVTAICHV-- 336
             C GR+ LF + E    VVP         E  +  T  ++++       GPVT +     
Sbjct: 2411 ECGGRVYLFKLPESSLRVVPAGNAGDAPTEEAEFGTPERLELFADIVLNGPVTVVGSFFS 2470

Query: 337  ----AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALL 392
                  ++V +VG ++++ +++ +     AF D  V + S+ +++N  L+GD  + + L+
Sbjct: 2471 SPAERSYVVHSVGPRLFVHEMEGSKFLRGAFSDASVCVTSVANIRNFFLLGDALKGLNLV 2530

Query: 393  RYQ 395
             ++
Sbjct: 2531 SWE 2533


>gi|237833631|ref|XP_002366113.1| hypothetical protein TGME49_024280 [Toxoplasma gondii ME49]
 gi|211963777|gb|EEA98972.1| hypothetical protein TGME49_024280 [Toxoplasma gondii ME49]
          Length = 2804

 Score = 51.2 bits (121), Expect = 0.002,   Method: Composition-based stats.
 Identities = 45/183 (24%), Positives = 82/183 (44%), Gaps = 27/183 (14%)

Query: 232  FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDV 291
            + V L+  F     P   + L   E VL L  V       L G+  ++A G     SE+V
Sbjct: 2359 YEVRLYHEFDLHR-PVGTYTLRTCEEVLSLSFV------VLDGVE-HLAAGVGVPLSENV 2410

Query: 292  TCRGRILLFDIIE----VVP---------EPGQPLTKNKIKMIYAKEQKGPVTAICHV-- 336
             C GR+ LF + E    VVP         E  +  T  ++++       GPVT +     
Sbjct: 2411 ECGGRVYLFKLPESSLRVVPAGNAGDAPTEEAEFGTPERLELFADIVLNGPVTVVGSFFS 2470

Query: 337  ----AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALL 392
                  ++V +VG ++++ +++ +     AF D  V + S+ +++N  L+GD  + + L+
Sbjct: 2471 SPAERSYVVHSVGPRLFVHEMEGSKFLRGAFSDASVCVTSVANIRNFFLLGDALKGLNLV 2530

Query: 393  RYQ 395
             ++
Sbjct: 2531 SWE 2533


>gi|389740093|gb|EIM81285.1| hypothetical protein STEHIDRAFT_86633 [Stereum hirsutum FP-91666 SS1]
          Length = 1213

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 67/327 (20%), Positives = 138/327 (42%), Gaps = 51/327 (15%)

Query: 318  IKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK 377
            +++++  E      ++    G LV  +G+ + I+ +    L   A  +++ + ++++S+ 
Sbjct: 932  LELLHKTETDDIPMSLLAFQGRLVAGIGKALRIYDIGKKKLLRKA--ESKTFASAIISLN 989

Query: 378  ---NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVW 434
               + I+VGD   SIA   Y+     L + A D +              +R +   ++V 
Sbjct: 990  TQGSRIIVGDMQESIAYAVYKAPENKLLVFADDTQ--------------ARWVTCSTMV- 1034

Query: 435  KFLQLSLGERLE--ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGH 492
             +  ++ G+R       ++ SK +D +D+  +   ++ +K     + M  P        H
Sbjct: 1035 DYTTVAAGDRFGNIFINRLDSKVSDQVDDDPTGAGILHEKG----ILMGAP--------H 1082

Query: 493  RLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEK-NY 551
            +      FH+G  V +  K       +S   G R   L  Y  L G +G  +PL  K + 
Sbjct: 1083 KTAMLAHFHVGDLVTSIHK-------VSLVAGGREVLL--YTGLHGTIGMLVPLVSKEDV 1133

Query: 552  RRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERL 611
              +  L+  + T  +   G +  A+R     GYY   P + ++DG L   F +L   ++ 
Sbjct: 1134 DFISTLEQHIRTEQTSLVGRDHLAWR-----GYYV--PVKAVVDGDLCETFARLPAAKQS 1186

Query: 612  EICKKIGSKHNDILDELYDIEALSSHF 638
             I  ++    +++L +L  +   +S F
Sbjct: 1187 MIAGELDRTVSEVLKKLDQLRVTASGF 1213


>gi|212539802|ref|XP_002150056.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei ATCC
            18224]
 gi|210067355|gb|EEA21447.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei ATCC
            18224]
          Length = 1139

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 67/361 (18%), Positives = 136/361 (37%), Gaps = 65/361 (18%)

Query: 281  LGTNYNYSEDV-TCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF 339
            +GT Y   E   + RGRILLF++           +  K+ +      KG   A+  +  +
Sbjct: 833  VGTAYLDDETAESIRGRILLFEVD----------SNRKLSLFLEHPVKGACRALAMMGDY 882

Query: 340  LVTAVGQKIYIWQLKDNDLTG------IAFIDTEVYIASMVSVKNLILVGDYARSIALLR 393
            +V A+ + + I+++     TG       A   T      +      I+V D  +SI+++ 
Sbjct: 883  IVAALVKTVVIFEVTGQPQTGKYSLQKAAVYRTSTAPVDIAVTDKTIVVADLMKSISIVE 942

Query: 394  YQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 453
               +   L++ A+                                       E+ +   +
Sbjct: 943  SN-KTDALTMEAK---------------------------------------EVARHFAT 962

Query: 454  KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
                 + +  S  +++SD + N+++     +        RL   ++  LG+ VN     R
Sbjct: 963  VWTTAVADIGSNQWLVSDAEGNLIVLRRNVDGMTEEDRRRLEVTSELLLGEMVN-----R 1017

Query: 514  CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
             +P +I            +  +++G++  F  +  ++   L+ LQ  +  +    G +  
Sbjct: 1018 IRPVNIPQTSTMAVTPKAFLGTVEGSIYLFALINPEHQDFLMRLQTAISAYVDSPGLMPF 1077

Query: 574  RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
              FR ++     A  P R  +DG L+ +FL      + EI   +GS   + + ++  IEA
Sbjct: 1078 NKFRAFRSTVREAEEPFR-FVDGELIERFLDCDRAVQEEILGVVGSGDLESVQKM--IEA 1134

Query: 634  L 634
            L
Sbjct: 1135 L 1135


>gi|154421858|ref|XP_001583942.1| CPSF A subunit region family protein [Trichomonas vaginalis G3]
 gi|121918186|gb|EAY22956.1| CPSF A subunit region family protein [Trichomonas vaginalis G3]
          Length = 1297

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 78/388 (20%), Positives = 151/388 (38%), Gaps = 78/388 (20%)

Query: 269  EGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPG------QPLTKNKIKMIY 322
            E  ++ L  Y+A+G+ +    +   RG + ++ I  +  + G      +PL  N+   IY
Sbjct: 954  EDGITLLNTYLAVGSGFLSQPEKMMRGVLYIYQIRYMQNDEGFNEITLRPLY-NETNKIY 1012

Query: 323  AKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGI-AFIDTEVYIASMVSVKNLIL 381
                K P+  I   +G++    G  +Y+ +  + +   I AF+    + +S+VS+KN +L
Sbjct: 1013 ----KNPIIEITDNSGYMAIFCGNLLYLMRFFNENTVKIEAFLVGRFFASSIVSLKNYLL 1068

Query: 382  VGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSL 441
              D      + R++   + L  +ARD     P S                    FLQ   
Sbjct: 1069 YADSYEGFEVARWRKYGKKLISMARDTMTKLPLSAA------------------FLQ--- 1107

Query: 442  GERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFH 501
                E C                +G ++ D D N  +F     A  ++    +++K+ F+
Sbjct: 1108 ---YEDC----------------LGGVVFDDDGNAHIFDVDEYAIPADA---VVRKSIFY 1145

Query: 502  LGQHVNTF--FKIRCKPSSISDAPGAR------------SRFLTWYASLDGALGFFLPLP 547
            +G    +   F I+    +    P                  + WY +  G +G F P+ 
Sbjct: 1146 IGGRAISSGQFPIKAVTQATQQNPNEEIDEELLQLQTKIGGHIAWYVTTHGKIGAFTPID 1205

Query: 548  EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGN----PSRGIIDGSLVWKFL 603
            E +  +L+ +Q+    +     GL+   +R+ K K     +      + +ID  ++   +
Sbjct: 1206 ENDRHKLVGVQS---AYEKSLCGLSHLEYRSGKFKNMIEQDIFNQSPKNVIDCDMLIDLI 1262

Query: 604  QLSLGERLEICKKIGSKHNDILDELYDI 631
            +  + + L+   K G +  D L EL  I
Sbjct: 1263 E-DMPDHLKFATK-GLRTQDFLSELRKI 1288


>gi|156339616|ref|XP_001620212.1| hypothetical protein NEMVEDRAFT_v1g223331 [Nematostella vectensis]
 gi|156204813|gb|EDO28112.1| predicted protein [Nematostella vectensis]
          Length = 248

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/97 (34%), Positives = 53/97 (54%), Gaps = 13/97 (13%)

Query: 17  VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHP--KGALKLRFKKLKVLFVSDRSKRA 74
           V+E+L   LG    R  L+     +LLIY+AF +P  +G L LRFKKL+   +  R K+ 
Sbjct: 150 VREVLLTGLGYKNRRATLVAVMDQDLLIYEAFSYPTVEGHLNLRFKKLQ-HNIQIREKKP 208

Query: 75  NEQ--------PGLPRGVRISQMRYFSNIAGYQGVFL 103
            ++        PGL    +++ +R F++I+ Y GV +
Sbjct: 209 KQEPKNDSETKPGL--DPKVAMLRVFNDISSYSGVCM 243


>gi|124505011|ref|XP_001351247.1| CPSF (cleavage and polyadenylation specific factor), subunit A,
            putative [Plasmodium falciparum 3D7]
 gi|7768292|emb|CAB11136.2| CPSF (cleavage and polyadenylation specific factor), subunit A,
            putative [Plasmodium falciparum 3D7]
          Length = 2870

 Score = 50.8 bits (120), Expect = 0.002,   Method: Composition-based stats.
 Identities = 119/604 (19%), Positives = 223/604 (36%), Gaps = 155/604 (25%)

Query: 95   IAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDG--------PVSTLAPFHNVNCPRGFL 146
            I  Y  +F+C   P  ++   + ++    +++            + L PFHN      FL
Sbjct: 2358 IKKYNFLFVCCESPIIIYSDLKKKINVSKLSLKNIYIVDIFNDFNYLNPFHN------FL 2411

Query: 147  YFNAKSE----------LRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVT 196
             F  K++            I + P  L+      ++K+P   T   +AYH +T     + 
Sbjct: 2412 SFKKKNQNNFYFIFYDGSNIHISP--LNQIKKTFLKKIPFHRTVEKIAYHSDTG----LL 2465

Query: 197  STAEPSTDYYKFNGEDKELVT--DP-RDS-RFIPPLVSQFHVSLFSPFSWEEIPQTNFPL 252
              A PS + +K N   K+++   DP  DS ++   + S++ VS    +  E++ ++NF +
Sbjct: 2466 IAACPSEEKHKTNEMMKQIICFFDPYHDSIKYTYIIPSKYTVSTIIIYDNEKLMKSNFDV 2525

Query: 253  HEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQP 312
                        S  + GT +         +N  Y+E  +  G I +F            
Sbjct: 2526 -----------TSFIFVGTCN---------SNEKYTEPTS--GHIHIF------------ 2551

Query: 313  LTKNK-----IKMIYAKE-QKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDT 366
            + K K     IK IY      G VT +      +V  +   + I  + +  +   AF+D 
Sbjct: 2552 IAKKKANIFEIKHIYTHNINYGGVTNLVPYDDKIVATINNMVVILDINNLIIKYEAFMDP 2611

Query: 367  E---------------------VYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVA 405
            +                      +I ++    + I+VGD   S+ +L+Y  E   L  V 
Sbjct: 2612 QNLQPKIEGNNAIVELVSFTPSSWIMTVDVYGDYIVVGDIMTSVTILQYDYENSQLFEVC 2671

Query: 406  RDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
            RDY                      S +W             C  + +         S  
Sbjct: 2672 RDY----------------------SNIW-------------CTSLCA--------LSKS 2688

Query: 466  GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA 525
              ++SD D N ++             ++L   + F+ G  +N    +    +++ +    
Sbjct: 2689 HIVVSDMDANFIILQKSKFKYNDEDSYKLSSVSLFNHGSIINKMLPL--SNTNLIEEDYD 2746

Query: 526  RSRFLT-----WYASLDGALGFFLPLPE-KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
            +   LT       AS +G++   +P     N+++ L ++  +  + S  G L+  A+R Y
Sbjct: 2747 KRNILTKNDGILCASSEGSISVLIPFSSFANFKKALCIEIAITDNISSIGNLSHNAYREY 2806

Query: 580  KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE-------ICKKIGSKHNDILDELYDIE 632
            K    +     +GI+DG L+  F  +S  ++ +       I KKI  K     + + D+E
Sbjct: 2807 KVN--FRSKHCKGIVDGELLKMFFHMSFEKQYKTFIYAKWIAKKINCKFGSFNNFILDLE 2864

Query: 633  ALSS 636
             + S
Sbjct: 2865 NMCS 2868


>gi|194741158|ref|XP_001953056.1| GF17579 [Drosophila ananassae]
 gi|190626115|gb|EDV41639.1| GF17579 [Drosophila ananassae]
          Length = 1140

 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 73/333 (21%), Positives = 124/333 (37%), Gaps = 65/333 (19%)

Query: 316  NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASM 373
            NK+  +   +  G   A+    G ++  +G   ++Y W   + +L     I   +    +
Sbjct: 857  NKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT-NEKELRMECNIQNMIAALFL 915

Query: 374  VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
             +  + ILVGD  RSI LL+++        +ARD +P                       
Sbjct: 916  KAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPK---------------------- 953

Query: 434  WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
            W                   +  +ILD+ + +G      + N  LF+ Q ++  +    R
Sbjct: 954  WM------------------RAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEER 990

Query: 494  --LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL---TWYASLDGALGFFLPLPE 548
              L +   FHLG  VN F       S +    G R+  +     Y + +GA+G    +P+
Sbjct: 991  QLLPELARFHLGDTVNVFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQ 1046

Query: 549  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLG 608
              Y  L  L+  +       G +    +R ++        PS G IDG L+  FL L   
Sbjct: 1047 DFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKV--EPSEGFIDGDLIESFLDLGRD 1104

Query: 609  ------ERLEICKKIGSKHNDILDELYDIEALS 635
                  + LEI      K  D+ D +  +E L+
Sbjct: 1105 KMRDAVQGLEITLNGERKSADVEDVIKIVEDLT 1137


>gi|345570887|gb|EGX53705.1| hypothetical protein AOL_s00006g33 [Arthrobotrys oligospora ATCC
            24927]
          Length = 1133

 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 64/247 (25%), Positives = 99/247 (40%), Gaps = 25/247 (10%)

Query: 372  SMVSVKNLILVGDYARSIALLRYQPEYRTLSL----VARDYKPTQPNSKGYYAGNPSRGI 427
            S+  VK  IL G  ++SI L R+     +L      ++     T P S   Y     + +
Sbjct: 861  SLAIVKGYILAG-LSKSIDLYRFSYTRGSLGASIQQISSIRAATLPVSLSVYG----KRV 915

Query: 428  IDGSLVWKFLQLSLGER--------LEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLF 479
              G LV   + L + E         +E+C++ G      L+       + +D D N+VL 
Sbjct: 916  FVGDLVKGVMVLEVVEGGGEGNDKLVEVCRQYGVSWVTALEALDEDTCISADSDGNLVLL 975

Query: 480  MYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGA 539
              +          R+   ++  LG+ VN    IR     I+     + +   +  ++DG 
Sbjct: 976  RRESTGATDEDTRRMRPLSEIRLGEMVNC---IRRVNDPITQGYVVQPK--AYLGTVDGG 1030

Query: 540  LGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
            L F L L   +Y  +LM  Q  M       G L+   +R Y  KG     P R  +DG L
Sbjct: 1031 L-FMLGLIHPDYFDILMKCQVNMAKVIKGIGDLDFNRYRAYNTKGIQPEEPFR-FVDGEL 1088

Query: 599  VWKFLQL 605
            V KFL L
Sbjct: 1089 VEKFLDL 1095


>gi|297267724|ref|XP_001082958.2| PREDICTED: DNA damage-binding protein 1 [Macaca mulatta]
          Length = 1092

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 70/315 (22%), Positives = 119/315 (37%), Gaps = 78/315 (24%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM-------- 561
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +        
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 562  -VTHTSHTGGLNPRA 575
             + H+ H   L+ RA
Sbjct: 1067 KIEHSFHLEILSHRA 1081


>gi|443918546|gb|ELU38987.1| CPSF A subunit region domain-containing protein [Rhizoctonia solani
            AG-1 IA]
          Length = 1037

 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 75/347 (21%), Positives = 139/347 (40%), Gaps = 59/347 (17%)

Query: 274  GLRGYIALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTA 332
            G   YI  GT   N  E+    GRI+LF         GQ   +N IK   +K+ +G V++
Sbjct: 714  GGNSYILAGTAIINPGENEPLAGRIILF---------GQD-EENMIKFKASKDVEGGVSS 763

Query: 333  ICHVAGFLVTAVGQKIYIWQLKDNDLT---GIAFIDTEVYIASMVSVKNLILVGDYARSI 389
            I  +   ++ A+G  IY++ L   ++T    +A  +    +  ++   N+I+V D  RS+
Sbjct: 764  IKQLGARIIAAIGHGIYLYNLGRGEVTISDPVARWERGYIVHDIIVRPNMIVVSDRLRSV 823

Query: 390  ALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 449
            ++LR+         + R   P                    S + +F  +++        
Sbjct: 824  SVLRF---------IERTSTPESHEEIETEE---------DSTILQFETVAMD-----MH 860

Query: 450  KIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
             +     ++L +  ++  + S  D N++ +    E  + N    L  +  FH G+ ++ F
Sbjct: 861  AVWPTSVEVLPDNKTI--IASQTDGNILTW----ELEDGN----LEPRAAFHTGEIIHKF 910

Query: 510  FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTG 569
                 K S       A  R +  + +  G +G    + + +  +L  L+  +       G
Sbjct: 911  IASTAKSS-------AGPRTVAIFVTNTGRIGTLSTVDDADALQLTRLEMKLGDAIKGLG 963

Query: 570  GLNPRAFRTYKGKGYYAGN---PSRGIIDGSLVWKFLQLSLGERLEI 613
             +    +R    K  + G    P RG+ DG  + KFL+LS  E   I
Sbjct: 964  NIKHPEWRAP--KLLHTGTKPPPRRGVTDGDFIKKFLELSSEEAKRI 1008


>gi|221040048|dbj|BAH11787.1| unnamed protein product [Homo sapiens]
          Length = 1092

 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 70/315 (22%), Positives = 119/315 (37%), Gaps = 78/315 (24%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVM-------- 561
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN +        
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 562  -VTHTSHTGGLNPRA 575
             + H+ H   L+ RA
Sbjct: 1067 KIEHSFHLEILSHRA 1081


>gi|119594339|gb|EAW73933.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_a [Homo
            sapiens]
          Length = 1094

 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 65/290 (22%), Positives = 110/290 (37%), Gaps = 69/290 (23%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
            +P       +ARD+ P                       W                    
Sbjct: 936  KPMEGNFEEIARDFNPN----------------------WM------------------S 955

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKI 512
              +ILD+ + +G      +    LF+ Q ++  +    R  L +   FHLG+ VN F   
Sbjct: 956  AVEILDDDNFLG-----AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVF--- 1007

Query: 513  RCKPSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQN 559
             C  S +    G  S   +    + +++G +G    L E  Y  LL +QN
Sbjct: 1008 -CHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQN 1056


>gi|299751161|ref|XP_001830098.2| pre-mRNA-splicing factor rse1 [Coprinopsis cinerea okayama7#130]
 gi|298409248|gb|EAU91763.2| pre-mRNA-splicing factor rse1 [Coprinopsis cinerea okayama7#130]
          Length = 1205

 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 66/310 (21%), Positives = 121/310 (39%), Gaps = 46/310 (14%)

Query: 332  AICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIAL 391
            A+    G L+  VG+ + I+ L    L   A   +   I S+ +  + I++GD   S   
Sbjct: 939  ALLAFQGRLLAGVGKALRIYDLGKKKLLRKAETKSPTAIVSLATQGSRIVIGDMQESTLF 998

Query: 392  LRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE--ICK 449
              Y+     L +   D +P              R +   ++V  +  +++G++       
Sbjct: 999  AVYKEAENRLLIFGDDTQP--------------RWVSAMTMV-DYNTVAVGDKFGNIFVN 1043

Query: 450  KIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF 509
            ++ S  +D +DE  +   ++ +K            A  +   H+      FH+G  + + 
Sbjct: 1044 RLDSTISDQVDEDPTGAGILHEK------------ATLNGAPHKTKMLAHFHVGDIITSI 1091

Query: 510  FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEK-NYRRLLMLQNVMVTHTSHT 568
             K+       S   G R   L  Y  L G +G  +PL  K +   L ML+  +       
Sbjct: 1092 HKV-------SLVVGGREVLL--YTGLQGTIGILVPLTSKEDIEFLTMLEQHIRNEQGSL 1142

Query: 569  GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEL 628
             G +  ++R     GYY   P + +IDG L   +  LS  ++  I  ++     D+L +L
Sbjct: 1143 VGRDHLSWR-----GYYV--PVKAVIDGDLCETYGGLSSSKQSAIASELDRTVGDVLKKL 1195

Query: 629  YDIEALSSHF 638
              +   SS F
Sbjct: 1196 DQMRVASSGF 1205


>gi|326426696|gb|EGD72266.1| hypothetical protein PTSG_00286 [Salpingoeca sp. ATCC 50818]
          Length = 1104

 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 36/136 (26%), Positives = 58/136 (42%)

Query: 478  LFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLD 537
            L + Q E    +    L  K + +LG+ V +F +     ++  D+          + ++ 
Sbjct: 937  LSVCQREFEPGSTMQTLNAKFEIYLGETVTSFVRAALGSAAAVDSSMPLRNTFFVFGTMG 996

Query: 538  GALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGS 597
            G L   LPL       L  L+  M       GGL+ R FRT + +   A   +  ++DG 
Sbjct: 997  GGLACLLPLTPPQTELLTALECRMEEKIGGLGGLDHREFRTARDEQRMAQQVNPRLVDGD 1056

Query: 598  LVWKFLQLSLGERLEI 613
            LV  FLQL   E+ E+
Sbjct: 1057 LVETFLQLPEEEQKEL 1072


>gi|302837243|ref|XP_002950181.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
            nagariensis]
 gi|300264654|gb|EFJ48849.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
            nagariensis]
          Length = 1104

 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 106/489 (21%), Positives = 169/489 (34%), Gaps = 111/489 (22%)

Query: 130  VSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLET 189
            V+ LA FH+   PR  L   ++  L I              VR VPL   P  +A+H   
Sbjct: 714  VAFLASFHSAAFPRS-LAVASEGALTIGTADEIQKLH----VRAVPLGENPRRIAHHEGA 768

Query: 190  KTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTN 249
            +   ++T   +       F      L+ D           + F V      +  E+P + 
Sbjct: 769  RMLGVLTMRLDSDGSERSF----LRLLDD-----------TTFDVVASYALAPGEMPCS- 812

Query: 250  FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVT-CRGRILLFDIIEVVPE 308
              L  W         S      +  L     +GT +   E+    +GRIL+ + + +V E
Sbjct: 813  --LAAWPG-------SSNGTAAVGALNACFLVGTAFIVPEEPEPTKGRILVLEHVRLVTE 863

Query: 309  PGQ--------PLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTG 360
                       P  K+KI  + +   K P +  C + G  V    +  Y+  +       
Sbjct: 864  KEVKGAAYNVLPFVKDKI--LASVNSKVPASG-CDLGGVRVELASECSYLGNI------- 913

Query: 361  IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA 420
                   +Y+A+     NL++VGD  RS++LL Y  E   L   A DY     NS     
Sbjct: 914  -----LALYLATR---GNLVVVGDLMRSVSLLSYNVEQGVLEHRAADY-----NSG---- 956

Query: 421  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM 480
                         W                  +   + LD+ +   ++  D   N+V+  
Sbjct: 957  -------------W------------------TTSVEALDDDT---YLEGDNHLNLVVLR 982

Query: 481  YQPEARESNGGHRLIKKTDFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASL 536
               ++       RL    ++H G  VN F      +R   S     P         +   
Sbjct: 983  RNADSATDEERARLQVVGEYHTGTFVNRFRHGSLVMRPPDSEFVSLP-----VPLLFGGT 1037

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
            DG LG    LP   Y  L  LQ+ +       GGL+  A+  +  +   A   ++G +DG
Sbjct: 1038 DGRLGVIARLPPGLYEMLTKLQSALRQVVRGVGGLSHEAWIAFSNERRTA--DAKGFVDG 1095

Query: 597  SLVWKFLQL 605
             L+  FL L
Sbjct: 1096 DLIETFLDL 1104


>gi|347838030|emb|CCD52602.1| similar to DDB1B (Damaged DNA Binding protein 1 B); damaged DNA
            binding / protein binding [Botryotinia fuckeliana]
          Length = 1157

 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 75/369 (20%), Positives = 138/369 (37%), Gaps = 82/369 (22%)

Query: 281  LGTNYNYSEDVTCRGRILLFDI-IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF 339
            +GT++ + E+   RGR+L+F +  +  P            MI +   KG    I  + G 
Sbjct: 835  VGTSFLHEEEANVRGRLLIFGVNADRAP-----------YMIASHNLKGSCRCIGVLDGK 883

Query: 340  LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMV-----SVKNLILVGDYARSIALLRY 394
            +V A+ + + ++  ++   T         Y  S          N+I V D  +SIAL+ Y
Sbjct: 884  IVAALNKTVVMYDYEETSSTSATLKKLATYRCSTCPIDIDITDNIIAVADIMKSIALVEY 943

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERL-EICKKIGS 453
             P                                DG          L ++L E+ +    
Sbjct: 944  TPG------------------------------ADG----------LPDKLEEVARHAQQ 963

Query: 454  KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
              +  + E  +  ++ +D D N++L     E        R+    + +LG+ VN   +I 
Sbjct: 964  VFSTSVAEVDTDTYLETDHDGNLILLKRNREGVTREDKTRMEVTCEMNLGEMVNRVKRIN 1023

Query: 514  CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT----- 568
             + S   DA      FL    + +G++  F  +P +N   L+ LQ+ + +  S +     
Sbjct: 1024 VETS--KDALLIPRAFL---GTTEGSIYLFSLIPPQNQDLLMRLQSRLASLPSASSIRGS 1078

Query: 569  -------------GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 615
                         G L+   +R+Y         P R  +DG L+ +FL L +  +  + +
Sbjct: 1079 SDSTSPHQIELSPGNLDFNKYRSYISATRETSEPFR-FVDGELIERFLDLEVEVQEHVAE 1137

Query: 616  KIGSKHNDI 624
             +G K  D+
Sbjct: 1138 GLGVKAEDL 1146


>gi|195145844|ref|XP_002013900.1| GL24391 [Drosophila persimilis]
 gi|194102843|gb|EDW24886.1| GL24391 [Drosophila persimilis]
          Length = 1140

 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 65/298 (21%), Positives = 112/298 (37%), Gaps = 59/298 (19%)

Query: 316  NKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASM 373
            +K+  +   +  G   A+    G ++  +G   ++Y W   + +L     I   +    +
Sbjct: 857  SKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT-NEKELRMECNIQNMIAALYL 915

Query: 374  VSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
             +  + ILVGD  RSI LL+++        +ARD +P                       
Sbjct: 916  KAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPK---------------------- 953

Query: 434  WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR 493
            W                   +  +ILD+ + +G      + N  LF+ Q ++  +    R
Sbjct: 954  WM------------------RAVEILDDDTFLG-----SETNGNLFVCQKDSAATTDEER 990

Query: 494  --LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFL---TWYASLDGALGFFLPLPE 548
              L +   FHLG  VN F       S +    G R+  +     Y + +GA+G    +P+
Sbjct: 991  QLLPELARFHLGDTVNVFRH----GSLVMQNVGERTTPINGCVLYGTCNGAIGIVTQIPQ 1046

Query: 549  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
              Y  L  L+  +       G +    +R ++        PS G IDG L+  FL LS
Sbjct: 1047 DFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKV--EPSEGFIDGDLIESFLDLS 1102


>gi|66811906|ref|XP_640132.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
 gi|74854972|sp|Q54SA7.1|SF3B3_DICDI RecName: Full=Probable splicing factor 3B subunit 3
 gi|60468134|gb|EAL66144.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
          Length = 1256

 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 70/323 (21%), Positives = 135/323 (41%), Gaps = 51/323 (15%)

Query: 317  KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSV 376
            K++++Y  E + PV A+    G LV  VG+ I I+ +    L  +   +T+    ++V++
Sbjct: 975  KLELLYKTEVEEPVYAMAQFQGKLVCGVGKSIRIYDMGKKKL--LRKCETKNLPNTIVNI 1032

Query: 377  KNL---ILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLV 433
             +L   ++VGD   SI  ++Y+     L + A D  P    S      +   G       
Sbjct: 1033 HSLGDRLVVGDIQESIHFIKYKRSENMLYVFADDLAPRWMTSSVMLDYDTVAG------- 1085

Query: 434  WKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDK-DKNVVLFMYQPEARESNGG- 491
                                K  +I      +  +ISD+ +++      + E+   NG  
Sbjct: 1086 ------------------ADKFGNIF--VLRLPLLISDEVEEDPTGTKLKFESGTLNGAP 1125

Query: 492  HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEK-N 550
            H+L    +F +G  V T      K S +   P      +  Y ++ GA+G  +P   + +
Sbjct: 1126 HKLDHIANFFVGDTVTTL----NKTSLVVGGPE-----VILYTTISGAIGALIPFTSRED 1176

Query: 551  YRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGER 610
                  L+  M +      G +  A+R+Y    Y+   P + IIDG L  +F  L+  ++
Sbjct: 1177 VDFFSTLEMNMRSDCLPLCGRDHLAYRSY----YF---PVKNIIDGDLCEQFSTLNYQKQ 1229

Query: 611  LEICKKIGSKHNDILDELYDIEA 633
            L I +++    ++++ +L +I +
Sbjct: 1230 LSISEELSRSPSEVIKKLEEIRS 1252


>gi|392566425|gb|EIW59601.1| hypothetical protein TRAVEDRAFT_167065 [Trametes versicolor FP-101664
            SS1]
          Length = 1263

 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 69/328 (21%), Positives = 126/328 (38%), Gaps = 68/328 (20%)

Query: 295  GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA-GFLVTAVGQKIYIWQL 353
            GRILLF +     E G       +  + + + +G V A+ HV+ G +  A+   + ++++
Sbjct: 965  GRILLFSLSS---ENG----VRSLTTVASHKVRGCVYALQHVSEGVIAAAINTSVLLYKI 1017

Query: 354  KDNDLTGIAF---IDTEV------YIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLV 404
            ++ +L G  F   +D         ++ S+V     +LVGD   S+++LR   +   L  V
Sbjct: 1018 REGNL-GEGFDRVLDKAAEWNHNHFVTSLVWDGQFLLVGDAISSVSVLRVADDATKLESV 1076

Query: 405  ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
            ARDY P  P +                                           ++   +
Sbjct: 1077 ARDYAPLWPVA-------------------------------------------IESTGN 1093

Query: 465  MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG 524
             G + ++ D N+  F  Q    + NG   L K   +H+   VN   K     + +S    
Sbjct: 1094 GGVIGANSDCNLFSFALQ-RGPQRNG---LEKNGVYHIDDVVNKLIKGALSSADVSQDQA 1149

Query: 525  ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRT-YKGKG 583
             ++  + + ++  G +G  L + +     +  LQ  M       GG+N    R     +G
Sbjct: 1150 VKAGHVFFTST--GRIGAILDMNDTMSLHMTALQRNMAKSLIGPGGVNHTKRRAPATPRG 1207

Query: 584  YYAGNPSRGIIDGSLVWKFLQLSLGERL 611
            +     S G +DG  +  FL  +  E+L
Sbjct: 1208 HTDAEASYGFLDGDFLETFLSHAHPEQL 1235


>gi|242803623|ref|XP_002484212.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus ATCC
            10500]
 gi|218717557|gb|EED16978.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus ATCC
            10500]
          Length = 1140

 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 64/349 (18%), Positives = 130/349 (37%), Gaps = 63/349 (18%)

Query: 281  LGTNYNYSEDV-TCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF 339
            +GT Y   E   + RGRILLF++           +  K+ +      KG   A+  +   
Sbjct: 833  VGTAYLDDETAESIRGRILLFEVD----------SNRKLSLFLEHPVKGACRALAMMGNK 882

Query: 340  LVTAVGQKIYIW------QLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLR 393
            +V A+ + + I+      QL  + L  +A   T      +    + I+V D  +SI+++ 
Sbjct: 883  IVAALVKTVVIFDVERKSQLGKHALKKVAAYRTSTAPVDIAVTDSTIVVADLMKSISIVE 942

Query: 394  YQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGS 453
                ++T +L                                       E  E+ +   +
Sbjct: 943  ---SHKTDALTV-------------------------------------EAKEVARHFAT 962

Query: 454  KHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIR 513
                 + +  S  +++SD + N+++     +        RL   ++  LG+ VN     R
Sbjct: 963  VWTTAVADIGSNQWLVSDAEGNLIVLRRNVDGVTEEDRRRLEVTSELLLGEMVN-----R 1017

Query: 514  CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
             +P +I            +  +++G++  F  +  ++   L+ LQ  +  +    G +  
Sbjct: 1018 IRPVNILQTSTVAVNPKAFLGTVEGSIYLFALINPEHQDFLMRLQTAITAYVDSPGYMPF 1077

Query: 574  RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 622
              FR ++        P R  +DG L+ +FL      + EI   +GS ++
Sbjct: 1078 SKFRAFRSSVREGDEPFR-FVDGELIERFLDCDRPVQEEILGVVGSGYD 1125


>gi|340521192|gb|EGR51427.1| predicted protein [Trichoderma reesei QM6a]
          Length = 1161

 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 101/546 (18%), Positives = 196/546 (35%), Gaps = 111/546 (20%)

Query: 97   GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRI 156
            G   VF    H A L  +S G +     T D   + +APF +   P   +  +    +RI
Sbjct: 698  GICNVFATTEH-ASLIYSSEGRIVYSATTADD-ATFVAPFDSEAFPDSIV-LSTDEHIRI 754

Query: 157  SVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTY---CIVTSTAEPSTDYYKFNGEDK 213
                 H+  +    V+ +P+  T   +AY    K +   CI     E           ++
Sbjct: 755  C----HVDSERLTHVKSLPMHETVRRVAYSPGLKAFGLGCIKKELVE-----------NE 799

Query: 214  ELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLS 273
            E+VT     R +  ++ Q    L  PF               E V C+  +  E   +  
Sbjct: 800  EVVTST--VRLVDEIIFQ---ELGQPFELNASAS-------LELVECV--IRAELPDSNG 845

Query: 274  GLRGYIALGTNY----NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGP 329
             +     +GT++       E    RGRI++  + E            ++  I +   KG 
Sbjct: 846  NMTERFLVGTSFVADPGTDEAGETRGRIVVLGVDE----------SRQLYQIASHNLKGV 895

Query: 330  VTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-----NLILVGD 384
               +  +  ++V  + + + ++       T  +      Y  +   V      N+I VGD
Sbjct: 896  CRCLAMLDDYIVAGLSKTVVVYSYAQETSTAASLTKVASYRPASFPVDLDVSGNMIGVGD 955

Query: 385  YARSIALLRYQP----EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLS 440
              +S+ L+ + P    +   L   AR Y+     S                         
Sbjct: 956  LMQSLTLIEFTPPQDGKMAKLEEKARHYQQAWTTS------------------------- 990

Query: 441  LGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDF 500
                  +C          LDE     ++ +D   NV++   + EA       +L   ++ 
Sbjct: 991  ------VCA---------LDETR---WLEADAQGNVIVLRQRQEAPTEQDRSQLEITSEL 1032

Query: 501  HLGQHVNTFFKIRCKPSSISDAPGARSRFL--TWYASLDGALGFFLPLPEKNYRRLLMLQ 558
            ++G+ +N   K++        APG  +  +   +  S++G L  +  +  K    L+  Q
Sbjct: 1033 NIGEQINRIRKLQV-------APGENAVVVPKAFLGSIEGTLYLYGDIAPKYQDLLMTFQ 1085

Query: 559  NVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
            + +  +    G L+   +R ++ +     +P R  +DG ++ +FL L   ++  +C+ +G
Sbjct: 1086 SRLQGYIQTPGNLSFDLWRAFRNQAREGESPYR-FVDGEMIERFLDLDESQQELVCEGLG 1144

Query: 619  SKHNDI 624
                D+
Sbjct: 1145 PNVEDM 1150


>gi|154303693|ref|XP_001552253.1| hypothetical protein BC1G_08731 [Botryotinia fuckeliana B05.10]
          Length = 1087

 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 73/368 (19%), Positives = 135/368 (36%), Gaps = 80/368 (21%)

Query: 281  LGTNYNYSEDVTCRGRILLFDI-IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF 339
            +GT++ + E+   RGR+L+F +  +  P            MI +   KG    I  + G 
Sbjct: 765  VGTSFLHEEEANVRGRLLIFGVNADRAP-----------YMIASHNLKGSCRCIGVLDGK 813

Query: 340  LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMV-----SVKNLILVGDYARSIALLRY 394
            +V A+ + + ++  ++   T         Y  S          N+I V D  +SIAL+ Y
Sbjct: 814  IVAALNKTVVMYDYEETSSTSATLKKLATYRCSTCPIDIDITDNIIAVADIMKSIALVEY 873

Query: 395  QPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK 454
             P          D  P +                                 E+ +     
Sbjct: 874  TP--------GADGLPDKLE-------------------------------EVARHAQQV 894

Query: 455  HNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC 514
             +  + E  +  ++ +D D N++L     E        R+    + +LG+ VN   +I  
Sbjct: 895  FSTSVAEVDTDTYLETDHDGNLILLKRNREGVTREDKTRMEVTCEMNLGEMVNRVKRINV 954

Query: 515  KPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHT------ 568
            + S   DA      FL    + +G++  F  +P +N   L+ LQ+ + +  S +      
Sbjct: 955  ETS--KDALLIPRAFL---GTTEGSIYLFSLIPPQNQDLLMRLQSRLASLPSASSIRGSS 1009

Query: 569  ------------GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 616
                        G L+   +R+Y         P R  +DG L+ +FL L +  +  + + 
Sbjct: 1010 DSTSPHQIELSPGNLDFNKYRSYISATRETSEPFR-FVDGELIERFLDLEVEVQEHVAEG 1068

Query: 617  IGSKHNDI 624
            +G K  D+
Sbjct: 1069 LGVKAEDL 1076


>gi|67516629|ref|XP_658200.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
 gi|40747539|gb|EAA66695.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
 gi|259489136|tpe|CBF89158.1| TPA: damaged DNA binding protein (Eurofung) [Aspergillus nidulans
            FGSC A4]
          Length = 1132

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 37/159 (23%), Positives = 70/159 (44%), Gaps = 8/159 (5%)

Query: 467  FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR 526
            ++ SD + N+++        E +   RL    +  L + VN     R +P +I   P A 
Sbjct: 970  YLESDAEGNLIVLRRNRSGVEEDDRRRLEVTGEICLNEMVN-----RIRPVNIQQLPSAT 1024

Query: 527  SRFLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYY 585
                 + A+++G++ +   +   +Y+  LM LQ  M +     GG+    +R ++     
Sbjct: 1025 VVPRAFLATVEGSI-YLYAIINPDYQDFLMRLQATMASRADSLGGIPFTDYRAFRTMTRQ 1083

Query: 586  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
            A  P R  +DG L+ +FL      + EI   +GS   ++
Sbjct: 1084 ATEPYR-FVDGELIERFLTCEPAVQKEIVDIVGSSLEEV 1121


>gi|68071595|ref|XP_677711.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56497932|emb|CAI04454.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 493

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 113/549 (20%), Positives = 198/549 (36%), Gaps = 128/549 (23%)

Query: 133 LAPFHNVNCPRG-------FLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
           L PFHN N  +        F++F+      +S+  +HL+      ++K+P   T   +A+
Sbjct: 26  LNPFHNFNSFKKKNQNNLYFIFFDG-----LSLYISHLNEINETYIQKIPFYRTVEKIAF 80

Query: 186 HLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEI 245
           H E+    ++TS   P  + +K N   K+++       F  P  + F  S   P  +   
Sbjct: 81  HKESGL--LITSC--PPEEKHKTNKNLKQIIC------FFNPYQNSFKYSYIIPSKYNV- 129

Query: 246 PQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGT-NYNYSEDVTCRGRILLFDIIE 304
                        +C+  ++ +     S +   I +GT N N        G I +F    
Sbjct: 130 -----------SSICIYQINKDIYPNKSNINTLICVGTANINDRVSEPSSGNIYIF---- 174

Query: 305 VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF---LVTAVGQKIYIWQLKD------ 355
              +    L +  IK IY       V  I H+  F   L++ +   + I  + D      
Sbjct: 175 -FAKKKDNLFE--IKHIYT--HNVNVGGITHLKQFYDKLISTINNTVVILDISDFLINLD 229

Query: 356 -------------ND--LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRT 400
                        ND  +  +A      +I S+  ++N I+VGD   S+ +L Y     T
Sbjct: 230 KYVDNTNKPIKLENDGTIVDVASFTPSSWIMSLDVIENYIVVGDIMTSVTILSYDFNNST 289

Query: 401 LSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILD 460
           L+ V RDY                      S VW     +L                   
Sbjct: 290 LTEVCRDY----------------------SNVWCTFVCAL------------------- 308

Query: 461 EFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS 520
             S   F++SD + N ++F             +L +   F+ G  VN    +    SS+ 
Sbjct: 309 --SKSHFLVSDMESNFLVFQKSSIRYNDEDSFKLSRVAFFNHGHVVNKMLPVSL--SSLI 364

Query: 521 DAPGARSRFLTWYASLDGA-----LGFFLPLPE-KNYRRLLMLQNVMVTHTSHTGGLNPR 574
           +   A++  L    S+  A     +   +P     N+++ L ++  +    S  G +N  
Sbjct: 365 EEEEAQNEILRKKESILCASSEGSISSIIPFSNLTNFKKALCIEIALNDSLSFIGNINNN 424

Query: 575 AFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE-------ICKKIGSKHNDILDE 627
           +  TYK     +    +G++DG L   F  +   ++ +       I KK+  K     + 
Sbjct: 425 SNNTYKMN--LSEKSCKGVVDGELFKMFFSMPFEKQFKTYIYAKWIGKKLNCKFGTFENF 482

Query: 628 LYDIEALSS 636
           + DIE L S
Sbjct: 483 ILDIENLCS 491


>gi|440639387|gb|ELR09306.1| hypothetical protein GMDG_03874 [Geomyces destructans 20631-21]
          Length = 1138

 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 76/346 (21%), Positives = 129/346 (37%), Gaps = 71/346 (20%)

Query: 294  RGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL 353
            RGRIL+F         G   ++N  K+   K  KG    +  + G +V A+ + I +++ 
Sbjct: 849  RGRILVF---------GVDSSRNPYKIAEYK-VKGACRCLGVIDGKIVAALVKTIVVFEY 898

Query: 354  KDNDLTGIAFIDTEVYIASMVSVK-----NLILVGDYARSIALLRYQP----EYRTLSLV 404
             +   T         Y  S   V      N I V D  +S++L+ Y+     E  TL  V
Sbjct: 899  TELSGTSARIEKVASYRTSTCPVDLAIEGNTIAVADLMKSVSLVEYRAGTSGEAPTLVEV 958

Query: 405  ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
            AR ++                       VW      + E                     
Sbjct: 959  ARHFQS----------------------VWATAVAHVDE--------------------- 975

Query: 465  MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPG 524
             G++ +D D N+++      A       ++    +FHLG+ VN   KIR   S      G
Sbjct: 976  -GWLEADADGNLIVLRRNEAAVTFEDRKKMEVTGEFHLGEQVNRIRKIRVDASE-----G 1029

Query: 525  ARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 584
            A      + A+ +G+L  +  +   +   LL LQ  +  +    G +    +R+++    
Sbjct: 1030 ATVVPRAFLATTEGSLFLYGSVAPASQDLLLRLQQRLAENVETPGNIPFTTYRSFRNAER 1089

Query: 585  YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN--DILDEL 628
                P R  IDG L+ +FL L    +  +CK +       D+++EL
Sbjct: 1090 ETEEPYR-FIDGELIERFLDLDEERQEVVCKGLAKVEEVRDLVEEL 1134


>gi|448528339|ref|XP_003869702.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis Co 90-125]
 gi|380354055|emb|CCG23569.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis]
          Length = 1170

 Score = 48.9 bits (115), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 53/104 (50%), Gaps = 4/104 (3%)

Query: 533  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 592
            Y  L G +G  LPL  K+   + +L ++ +  +++   +N       K + YY  NP++ 
Sbjct: 1070 YTGLTGTIGILLPLISKS--EIELLHDLQLEISAYNDKVNVAGKNHAKLRSYY--NPAKN 1125

Query: 593  IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSS 636
            I DG  +  +L L L E+L+I K++     ++  +L DI   SS
Sbjct: 1126 IFDGDFLELYLNLPLDEKLKIAKRLNKSVGEVEKKLNDIRNRSS 1169


>gi|383863765|ref|XP_003707350.1| PREDICTED: DNA damage-binding protein 1-like [Megachile rotundata]
          Length = 1138

 Score = 48.5 bits (114), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 68/341 (19%), Positives = 127/341 (37%), Gaps = 61/341 (17%)

Query: 304  EVVPEPGQPL----TKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLT 359
            E  P+ G+ L       K+  +  KE KG   ++    G L+ ++   + +++       
Sbjct: 838  ETEPKMGRILLYHWNDGKLTQVAEKEIKGSCYSLVEFNGKLLASINSTVRLFEWTAEKEL 897

Query: 360  GIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
             +        IA  +  K + +LVGD  RS+ LL+Y+    +   +ARDY P        
Sbjct: 898  RLECSHFNNIIALYLKTKGDFVLVGDLMRSLTLLQYKTMEGSFEEIARDYNPN------- 950

Query: 419  YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVL 478
                           W                      +ILD+ + +G      +    L
Sbjct: 951  ---------------WM------------------TAVEILDDDTFLG-----AENCFNL 972

Query: 479  FMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASL 536
            F+ Q ++  ++   R  + +   FHLG  VN F        ++ ++    ++    + ++
Sbjct: 973  FVCQKDSAATSEDERQQMQEIGQFHLGDMVNVFRHGSLVMQNLGES-STPTQGCVLFGTV 1031

Query: 537  DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDG 596
             GA+G    +P   Y  L  L+  +       G +  R +R++  +         G IDG
Sbjct: 1032 SGAIGLVTQIPFTFYEFLRHLEYRLTEVIKSVGKIEHRFWRSFNTE--LKVENCEGFIDG 1089

Query: 597  SLVWKFLQLSLGERLEICKKI------GSKHNDILDELYDI 631
             L+  FL LS  +  E+   +      G +    +D+L  I
Sbjct: 1090 DLIESFLDLSPDKMAEVAVDLMMDDSSGMRKEATVDDLVKI 1130


>gi|429850956|gb|ELA26181.1| DNA damage-binding protein 1 [Colletotrichum gloeosporioides Nara
            gc5]
          Length = 1409

 Score = 48.1 bits (113), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 110/552 (19%), Positives = 213/552 (38%), Gaps = 105/552 (19%)

Query: 97   GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRI 156
            G   VF    H + ++ +S G +     T +  V+ +APF +   P   +    K+ +RI
Sbjct: 671  GISNVFATTEHSSLIY-SSEGRIIYSAATAED-VTYIAPFDSEAFPDAIVLATDKN-VRI 727

Query: 157  SVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELV 216
            +    H+  +    V  +PL+ T   +AY    K + I T   E       FN E  E+V
Sbjct: 728  A----HIDVERRTHVNPLPLRQTVRRVAYSPALKAFGIGTIRRE------LFNNE--EMV 775

Query: 217  TDPRDSRFIPPLVSQFHVSLFS-PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGL 275
            T    S F   LV +  + +   PF  +    T          L    +  E   +    
Sbjct: 776  T----SSF--QLVDEIVLGVVGKPFHLDGAATTE---------LVESVIRAELPDSSGQP 820

Query: 276  RGYIALGTNY----NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVT 331
                 +GT+Y       E+   +GRIL+          G    KN  +++ + E KG   
Sbjct: 821  AERFIVGTSYLADPEMDENSEVKGRILVL---------GVDSDKNPYQIV-SHELKGACR 870

Query: 332  AICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-----NLILVGDYA 386
            ++  +   LV  + + + ++   +   T  + +    +  S   V      N+I V D  
Sbjct: 871  SLAVMGDKLVAGLSKTVVVYDYAEESSTSGSLLKLATFRPSTFPVDLDVNGNMIGVADLM 930

Query: 387  RSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 446
            +S+ L+ + P        A+D             GN +R +++ +  ++++  +    LE
Sbjct: 931  QSMTLIEFIP--------AQD-------------GNKAR-LVERARHFQYIWATAVCHLE 968

Query: 447  ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV 506
                      D+  E  + G        N+++    P A   +   ++   ++FHLG+ +
Sbjct: 969  ---------QDLWIEADAQG--------NLMVLRRNPNAPTEHDKKQMEVISEFHLGEQI 1011

Query: 507  NTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTS 566
            N     + +P  +            + A+++G++  F  +  +    LL  Q  +     
Sbjct: 1012 N-----KIRPLDVVSGENDPIEPKAFLATIEGSIYVFADIKPEYQSLLLQFQERLAGVIK 1066

Query: 567  HTG-------GLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG- 618
              G       GL+  ++R ++     A  P R  +DG L+ +FL L  G +  + + +G 
Sbjct: 1067 TLGQADEPGAGLSFMSWRGFRNAKRSADGPFR-FVDGELIERFLDLDAGRQEAVVQGLGP 1125

Query: 619  --SKHNDILDEL 628
               +  D+++EL
Sbjct: 1126 TVERMRDLVEEL 1137


>gi|332030156|gb|EGI69950.1| DNA damage-binding protein 1 [Acromyrmex echinatior]
          Length = 1138

 Score = 48.1 bits (113), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 72/364 (19%), Positives = 135/364 (37%), Gaps = 69/364 (18%)

Query: 278  YIALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT + N  E     GRILL+             ++ K   +  KE KG   ++   
Sbjct: 826  YFVVGTAFINPDETEPKMGRILLYH-----------WSEGKFTQVAEKEIKGSCYSLVEF 874

Query: 337  AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQ 395
             G L+ ++   + +++        +        IA  +  K + +LVGD  RS+ LL+Y+
Sbjct: 875  NGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKTKGDFVLVGDLMRSLTLLQYK 934

Query: 396  PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
                +   +ARDY P    S                                        
Sbjct: 935  TMEGSFEEIARDYNPNWMTSI--------------------------------------- 955

Query: 456  NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIR 513
             +ILD+ + +G      +    LF+ Q ++  ++   R  + +   FHLG  VN F    
Sbjct: 956  -EILDDDTFLG-----AENCFNLFVCQKDSAATSEDERQQMQEIGQFHLGDMVNVFRHGS 1009

Query: 514  CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
                ++ ++    +     + ++ GA+G    +P   Y  L  +++ + +     G +  
Sbjct: 1010 LVMQNLGES-STPTLGCVLFGTVSGAIGLVTQIPVTFYEFLRNMEDRLNSVIKSVGKIEH 1068

Query: 574  RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDE 627
              +R++  +         G IDG L+  FL L+  +  E+   +      G K    +D+
Sbjct: 1069 NFWRSFNTE--LKIEQCEGFIDGDLIESFLDLNHDKMAEVAMGLMIDDGSGMKKEATVDD 1126

Query: 628  LYDI 631
            L  I
Sbjct: 1127 LVKI 1130


>gi|302894051|ref|XP_003045906.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256726833|gb|EEU40193.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 1162

 Score = 47.8 bits (112), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 61/339 (17%), Positives = 131/339 (38%), Gaps = 68/339 (20%)

Query: 295  GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK 354
            GRIL+  + E            ++  I +   KGP   +  +  ++V  + + + ++   
Sbjct: 872  GRILVLGVDE----------HRQVYQIVSHNLKGPCRCLGMMDDYIVAGLSKTVVVYNYS 921

Query: 355  DNDLTGIAFIDTEVYIASMVSVK-----NLILVGDYARSIALLRYQPEYR----TLSLVA 405
             +  +  +      Y  + + V      N+I VGD  +S++L+ + P        L   A
Sbjct: 922  QDTSSSGSLEKLAAYRPAALPVDLDISGNMIGVGDLMQSLSLVEFIPAQDGRKAKLEERA 981

Query: 406  RDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSM 465
            R Y+P                      +W            +C          LDE    
Sbjct: 982  RHYEP----------------------IWT---------TSLCH---------LDEER-- 999

Query: 466  GFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGA 525
             ++ +D   N+++     +A       RL   ++  +G+ +N   K+   P+  +     
Sbjct: 1000 -WLEADSQGNLIVLQRNADAPTEQDRSRLEVTSEIGIGEQINRIRKLHV-PAGDNSIVHP 1057

Query: 526  RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYY 585
            R+    + AS +G+L  +  +  +    L+  Q+ M  +    G +  + +R+++ +   
Sbjct: 1058 RA----FLASAEGSLYLYGDIAPQYQDLLMTFQSKMEEYIHAPGNIEFKLWRSFRNENRE 1113

Query: 586  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
            +  P R  IDG +V +FL +  G++  +C+ +G    D+
Sbjct: 1114 SDGPYR-FIDGEMVERFLDMDEGKQELVCEGLGPSVEDM 1151


>gi|83314897|ref|XP_730560.1| multisubunit cleavage/polyadenylation specificity factor subunit A
           [Plasmodium yoelii yoelii 17XNL]
 gi|23490318|gb|EAA22125.1| CPSF A subunit region, putative [Plasmodium yoelii yoelii]
          Length = 863

 Score = 47.4 bits (111), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 112/594 (18%), Positives = 214/594 (36%), Gaps = 135/594 (22%)

Query: 101 VFLCGPHPAWLFLTSRGELRAHPMTIDG--------PVSTLAPFHNVNCPRG-------F 145
           +F+C  +P  ++   + ++    ++I              L PFHN N  +        F
Sbjct: 345 LFICSDNPIIIYSDIKKKISLSKVSIKNIFLVDIFNDFDYLNPFHNFNSFKKKNQNNLYF 404

Query: 146 LYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDY 205
           ++F+      +S+  +HL+      ++K+P   T   +AYH E+    ++TS   P+ + 
Sbjct: 405 IFFDG-----LSLYISHLNEINETYIQKIPFYRTVEKIAYHNESGL--LITSC--PTEEK 455

Query: 206 YKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVS 265
           +K N   K+++       F  P  + F  S   P  +                +C+  ++
Sbjct: 456 HKTNKNLKQIIC------FFNPHQNSFKYSYIIPSKYNVSS------------ICIYQIN 497

Query: 266 MEYEGTLSGLRGYIALGTNYNYSEDVT-----------CRGRILLFDIIEVVPEP----- 309
            +     S +   I +GT  N ++ V+            + +  LF+I  +         
Sbjct: 498 KDIYPNKSNINTLICVGT-ANINDRVSEPSSGHIYIFFAKKKANLFEIKHIYTHNINVGG 556

Query: 310 --------GQPLTKNKIKMIYAKEQKGPVTAICHVAGFL------VTAVGQKIYIWQLKD 355
                    + ++     +IY    K  +  I  ++ FL      V    + I +    D
Sbjct: 557 ITHLKQFYDKLISTINNTVIYKCVNKKLIVVILDISDFLINLDKYVDNTNKPIKLEN--D 614

Query: 356 NDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNS 415
             +  +A      +I S+  ++N I+VGD   S+ +L Y     TL+ V RDY       
Sbjct: 615 GTIVDVASFTPSSWIMSLDVIENYIVVGDIMTSVTILSYDFNNSTLTEVCRDY------- 667

Query: 416 KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKN 475
                          S VW     +L                     S   F++SD + N
Sbjct: 668 ---------------SNVWCTFVCAL---------------------SKSHFLVSDMESN 691

Query: 476 VVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYAS 535
            ++F             +L +   F+ G  VN    +    SS+ +   A++  L    S
Sbjct: 692 FLVFQKSSIRYNDEDSFKLSRVALFNHGHVVNKMLPVSL--SSLIEEEEAQNEILRKKES 749

Query: 536 LDGA-----LGFFLPLPE-KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
           +  A     +   +P     N+++ L ++  +    S    +N  +  TYK     +   
Sbjct: 750 ILCASSEGSISSIIPFSNLTNFKKALCIEIALNDSLSFIXNINNNSNNTYKMN--LSEKS 807

Query: 590 SRGIIDGSLVWKFLQLSLGERLE-------ICKKIGSKHNDILDELYDIEALSS 636
           S+G++DG +   F  +   ++         I KK+  K     + + DIE L S
Sbjct: 808 SKGVVDGEVFKMFFSMPFEKQFXTYIYAKWIAKKLNCKFGXFENFMLDIENLCS 861


>gi|269861065|ref|XP_002650248.1| pre-mRNA cleavage and polyadenylation specificity factor
           [Enterocytozoon bieneusi H348]
 gi|220066338|gb|EED43824.1| pre-mRNA cleavage and polyadenylation specificity factor
           [Enterocytozoon bieneusi H348]
          Length = 1022

 Score = 47.4 bits (111), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 38/139 (27%), Positives = 65/139 (46%), Gaps = 19/139 (13%)

Query: 275 LRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAIC 334
           L  Y+ +  +   SED   + RI+L+ I+ +V +   P  K  +K+ Y  E+K    AI 
Sbjct: 703 LDDYVVISLSTVDSEDKCTKSRIILYSIVPIVIDNTCP--KKNLKLKYLGEEKIKY-AIH 759

Query: 335 HVAGF----------------LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKN 378
               F                L+  VG ++ I++L  N+ T I  ++  V + ++  V+N
Sbjct: 760 SFDVFYKKKLQNHKYVLSDILLIVGVGTRLMIYELNYNEFTPIGRLEISVGVIAVTVVRN 819

Query: 379 LILVGDYARSIALLRYQPE 397
           LIL+GD    + L   +PE
Sbjct: 820 LILLGDLFTGMELFYLRPE 838


>gi|124506183|ref|XP_001351689.1| conserved Plasmodium protein, unknown function [Plasmodium falciparum
            3D7]
 gi|23504617|emb|CAD51496.1| conserved Plasmodium protein, unknown function [Plasmodium falciparum
            3D7]
          Length = 2763

 Score = 47.4 bits (111), Expect = 0.026,   Method: Composition-based stats.
 Identities = 25/85 (29%), Positives = 46/85 (54%), Gaps = 10/85 (11%)

Query: 340  LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARS--IALLRYQPE 397
            ++  +  KIYI ++ DND T  AF+D   YI+ +  +KN I++ D  +   I +  Y+ +
Sbjct: 2502 ILHCINSKIYIHEVNDNDFTKGAFLDNNFYISDIKIMKNFIIIADLFKGIFINMYNYEEQ 2561

Query: 398  YRTLSLVARDYKPTQPNSKGYYAGN 422
            Y + S+++         SK +Y+ N
Sbjct: 2562 YDSRSIISI--------SKNFYSNN 2578


>gi|307205760|gb|EFN83990.1| DNA damage-binding protein 1 [Harpegnathos saltator]
          Length = 1138

 Score = 47.0 bits (110), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 72/364 (19%), Positives = 134/364 (36%), Gaps = 69/364 (18%)

Query: 278  YIALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   N  E     GRILL+             +  K+  +  KE KG   ++   
Sbjct: 826  YFVVGTALINPDETEPKMGRILLYH-----------WSDGKLTQVAEKEIKGSCYSLVEF 874

Query: 337  AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQ 395
             G L+ ++   + +++        +        IA  +  K + +LVGD  RS+ LL+Y+
Sbjct: 875  NGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKTKGDFVLVGDLMRSLTLLQYK 934

Query: 396  PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
                +   +ARDY P    S                                        
Sbjct: 935  TMEGSFEEIARDYNPNWMTSI--------------------------------------- 955

Query: 456  NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFFKIR 513
             +ILD+ + +G      +    LF+ Q ++  ++   R  + +   FHLG  VN F    
Sbjct: 956  -EILDDDTFLG-----AENCFNLFVCQKDSAATSEDERQQMQEVGQFHLGDMVNVFRHGS 1009

Query: 514  CKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
                ++ ++    +     + ++ GA+G    +P   Y  L  L++ + +     G +  
Sbjct: 1010 LVMQNLGES-STPTLGCVLFGTVSGAIGLVTQIPFAFYEFLRNLEDRLNSVIKSVGKIEH 1068

Query: 574  RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDILDE 627
              +R++  +         G IDG L+  FL L+  +  E+   +      G K    +D+
Sbjct: 1069 NFWRSFNTE--LKIEQCEGFIDGDLIESFLDLNHDKMAEVAMGLMIDDGSGMKKEATVDD 1126

Query: 628  LYDI 631
            L  +
Sbjct: 1127 LVKV 1130


>gi|395330962|gb|EJF63344.1| hypothetical protein DICSQDRAFT_153890 [Dichomitus squalens LYAD-421
            SS1]
          Length = 1263

 Score = 47.0 bits (110), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 67/344 (19%), Positives = 127/344 (36%), Gaps = 67/344 (19%)

Query: 273  SGLRGYIALGTNYNYSEDVT-CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVT 331
            SG     ALGT Y   E+    +GRILLF +     E  +      +  + +    G V 
Sbjct: 938  SGASPAFALGTVYIRPEEREPSKGRILLFSVSST--EGARGANVRSLHTLASVNVGGCVY 995

Query: 332  AICHVA-GFLVTAVGQKIYIWQLKDND--------LTGIAFIDTEVYIASMVSVKNLILV 382
            A+ +++   +V A+   + +++  +N+        L  +   +   ++ ++V     ILV
Sbjct: 996  ALANLSENLIVAAINTSVVLFKSTENEAGESTPLSLEKVTEWNHNHFVTNVVVDGERILV 1055

Query: 383  GDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLG 442
            GD   S+++L++      L  +ARDY P  P                             
Sbjct: 1056 GDAISSVSVLKWNERLERLESIARDYGPLWP----------------------------- 1086

Query: 443  ERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTD--F 500
                           I  E +  G + ++ D N+  F  Q         HR   + D  +
Sbjct: 1087 ---------------IAIEGTGNGLIGANADCNLFSFSLQSVP------HRTYLEKDGVY 1125

Query: 501  HLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNV 560
            HL    N F +     + +++    ++  + + ++  G +G  L + +     +  LQ  
Sbjct: 1126 HLNDVTNKFVRGALTSTDVAEDQVVKASHVFFTST--GCIGAILDMNDVTSLHMTALQRN 1183

Query: 561  MVTHTSHTGGLNPRAFRT-YKGKGYYAGNPSRGIIDGSLVWKFL 603
            M    +  GG N    R     +G+     S G +DG  + ++L
Sbjct: 1184 MAKTLTGPGGDNHTKLRAPSTPRGHTDAEASYGFLDGDFLEQYL 1227


>gi|242010743|ref|XP_002426118.1| DNA damage-binding protein, putative [Pediculus humanus corporis]
 gi|212510165|gb|EEB13380.1| DNA damage-binding protein, putative [Pediculus humanus corporis]
          Length = 1148

 Score = 47.0 bits (110), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 75/358 (20%), Positives = 126/358 (35%), Gaps = 64/358 (17%)

Query: 278  YIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
            YI      N  E  + +GRIL+F   E            K+  +  KE KG   ++    
Sbjct: 837  YIVGTAMVNPDESESKQGRILIFQFQE-----------GKLYQVAEKEIKGAAYSLVEFN 885

Query: 338  GFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQP 396
            G L+ ++   + +++        +        I+  +  K + ILVGD  RS+ LL+Y+ 
Sbjct: 886  GKLLASINSTVRLFEWTAEQELRLECSHFNNIISLYLKTKGDFILVGDLIRSMTLLQYKT 945

Query: 397  EYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 456
                   +ARD+ P    +           IID      FL       L +C+K  +   
Sbjct: 946  MEGCFEEMARDHNPNWMTAV---------EIIDDD---TFLGAENSFNLFVCQKDSAAAT 993

Query: 457  DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
            D                          E R+      +     FHLG  VN F       
Sbjct: 994  D--------------------------EERQQMHAVGM-----FHLGDMVNVFRHGSLVM 1022

Query: 517  SSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
             ++ +     +  +  + ++ GA+G    +    Y  L  L+  +       G +    +
Sbjct: 1023 QNVGETSTPTTGCI-LFGTVSGAIGLVTQISANFYNFLHELECKLTEVIKSVGKIKHSFW 1081

Query: 577  RTYKGKGYYAGNPSRGIIDGSLVWKFLQLS------LGERLEICKKIGSKHNDILDEL 628
            R++  +      P  G IDG L+  FL LS      +   L+I    G K    +D+L
Sbjct: 1082 RSFTTE--IKTEPCDGFIDGDLIESFLDLSHEKMKEVAAGLQIDNGSGMKQEATVDDL 1137


>gi|328788389|ref|XP_396048.3| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Apis
            mellifera]
          Length = 1141

 Score = 47.0 bits (110), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 96/487 (19%), Positives = 175/487 (35%), Gaps = 92/487 (18%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
            +R VPL  +P  +AY   ++T+ ++T                  +  D +DS  +  +  
Sbjct: 713  IRTVPLGESPRRIAYQESSQTFGVIT------------------MRVDIQDSSGVSIVRH 754

Query: 231  QFHVSLFSPFSWEEIPQTNFPLHEWEHVLC----LKNVSMEYEGTLSGLRGYIALGTNYN 286
                   S  S   I   N P       +C    + N+ +  + T   L  ++ + T Y 
Sbjct: 755  SASTQAASTSSSSHIASYNKPTGHTASDICQEIEVHNLLIIDQHTFEVLHAHMLMPTEYA 814

Query: 287  YSEDVTCRGR---------ILLFDIIEVVPEPGQPL----TKNKIKMIYAKEQKGPVTAI 333
             S   T  G            L    E  P+ G+ L    +  K+  +  KE KG   ++
Sbjct: 815  LSLISTKLGEDPTSYYIVGTALVHPDETEPKMGRILLYHWSDGKLTQVAEKEIKGSCYSL 874

Query: 334  CHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALL 392
                G L+ ++   + +++        +        IA  +  K + ILVGD  RS+ LL
Sbjct: 875  TEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKSKGDFILVGDLMRSLTLL 934

Query: 393  RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
            +Y+        +ARDY P                       W                  
Sbjct: 935  QYKTMEGCFEEIARDYNPN----------------------WM----------------- 955

Query: 453  SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFF 510
                +ILD+ + +G      +    LF+ Q ++  ++   R  + +   FHLG  VN F 
Sbjct: 956  -TAIEILDDDTFLG-----AENCFNLFVCQKDSAATSEDERQQMQEVGQFHLGDMVNVFR 1009

Query: 511  KIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
                   ++ ++    ++    + ++ GA+G    +P   Y  L  L++ + +     G 
Sbjct: 1010 HGSLVMQNLGES-STPTQGCVLFGTVSGAIGLVTQIPFIFYEFLRNLEDRLTSVIKSVGK 1068

Query: 571  LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDI 624
            +    +R++  +         G IDG L+  FL LS  +  E+   +      G K    
Sbjct: 1069 IEHNFWRSFNTE--LKIEQCEGFIDGDLIESFLDLSPDKMAEVASGLMIDDPSGMKKEAT 1126

Query: 625  LDELYDI 631
            +D+L  I
Sbjct: 1127 VDDLVKI 1133


>gi|432089478|gb|ELK23419.1| DNA damage-binding protein 1 [Myotis davidii]
          Length = 1047

 Score = 46.6 bits (109), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 45/193 (23%), Positives = 76/193 (39%), Gaps = 42/193 (21%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828  YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337  AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
             G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877  NGKLLASINSTVRLYEWTAEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395  QPEYRTLSLVARDYKPT---------------QPNSKGYYA------------GNPSRGI 427
            +P       +ARD+ P                  N+   +               P+ G 
Sbjct: 936  KPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDRSFHTERKTEPATGF 995

Query: 428  IDGSLVWKFLQLS 440
            IDG L+  FL +S
Sbjct: 996  IDGDLIESFLDIS 1008


>gi|70929162|ref|XP_736684.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56511427|emb|CAH86674.1| hypothetical protein PC302114.00.0 [Plasmodium chabaudi chabaudi]
          Length = 276

 Score = 46.6 bits (109), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 23/85 (27%), Positives = 45/85 (52%), Gaps = 10/85 (11%)

Query: 340 LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARS--IALLRYQPE 397
           ++  +  K+YI ++K+ D T  AFID   Y++ +  V+N I++ D  +   I +  Y+ +
Sbjct: 105 VLHCINSKMYIHEIKNKDFTKGAFIDNNFYVSDIKIVRNFIIISDLYKGIFINMYNYEEQ 164

Query: 398 YRTLSLVARDYKPTQPNSKGYYAGN 422
           Y + S+++         SK +Y  N
Sbjct: 165 YDSRSIISI--------SKNFYNNN 181


>gi|344231825|gb|EGV63707.1| hypothetical protein CANTEDRAFT_134986 [Candida tenuis ATCC 10573]
 gi|344231826|gb|EGV63708.1| hypothetical protein CANTEDRAFT_134986 [Candida tenuis ATCC 10573]
          Length = 991

 Score = 46.2 bits (108), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 29/98 (29%), Positives = 50/98 (51%), Gaps = 9/98 (9%)

Query: 533 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 592
           YA L G +G  LP+ E +++ L  L   +  +     G +   FR     GYY  N +  
Sbjct: 896 YAGLQGTIGILLPISESDFKFLSNLS--IELNKDLLLGRDHMKFR-----GYY--NSTHN 946

Query: 593 IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYD 630
           +IDG ++ KFL+L+   R++I  K+     +I +++ D
Sbjct: 947 VIDGDIIEKFLELNASSRIKISNKLNKSVREIENKIND 984


>gi|237839083|ref|XP_002368839.1| hypothetical protein TGME49_067710 [Toxoplasma gondii ME49]
 gi|211966503|gb|EEB01699.1| hypothetical protein TGME49_067710 [Toxoplasma gondii ME49]
          Length = 2136

 Score = 46.2 bits (108), Expect = 0.052,   Method: Composition-based stats.
 Identities = 27/74 (36%), Positives = 43/74 (58%), Gaps = 3/74 (4%)

Query: 533  YASLDGALGFFLPLP-EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSR 591
            +AS +GA+G  L +P E+ + RL +LQ+ +   T   G L+  AF + K     A  PS+
Sbjct: 2051 WASSEGAIGHLLQIPDEQTFARLAVLQDAVTKVTKSIGKLSAVAFHSVKVGT--ATVPSK 2108

Query: 592  GIIDGSLVWKFLQL 605
            G IDG ++ +FL+ 
Sbjct: 2109 GFIDGDILERFLEF 2122


>gi|221502136|gb|EEE27880.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 2131

 Score = 46.2 bits (108), Expect = 0.052,   Method: Composition-based stats.
 Identities = 27/74 (36%), Positives = 43/74 (58%), Gaps = 3/74 (4%)

Query: 533  YASLDGALGFFLPLP-EKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSR 591
            +AS +GA+G  L +P E+ + RL +LQ+ +   T   G L+  AF + K     A  PS+
Sbjct: 2046 WASSEGAIGHLLQIPDEQTFARLAVLQDAVTKVTKSIGKLSAVAFHSVKVGT--ATVPSK 2103

Query: 592  GIIDGSLVWKFLQL 605
            G IDG ++ +FL+ 
Sbjct: 2104 GFIDGDILERFLEF 2117


>gi|322706594|gb|EFY98174.1| DNA damage-binding protein 1 [Metarhizium anisopliae ARSEF 23]
          Length = 1121

 Score = 46.2 bits (108), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 105/558 (18%), Positives = 199/558 (35%), Gaps = 116/558 (20%)

Query: 97   GYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRI 156
            G   VF    H A L  ++ G +     T D   + +APF +   P   +  +  S +R+
Sbjct: 639  GTCNVFATTEH-ASLIYSAEGRIIYSATTAD-DATYVAPFDSEAFPNSIV-LSTDSHIRL 695

Query: 157  SVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTY---CIVTSTAEPSTDYYKFNGEDK 213
            S    H+  +    V+ + +K T   +AY    K +   CI            K   +++
Sbjct: 696  S----HIDKERLTHVKTLSVKETVRRVAYSPTLKVFGLGCI-----------KKELIQNE 740

Query: 214  ELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLS 273
            E++T     R +  ++ Q    L  PF    I  T+  L   E V     +  E   ++ 
Sbjct: 741  EVITSS--FRIVDEIIFQ---ELGKPF----IFNTSTSLEMVETV-----IRAELPDSMG 786

Query: 274  GLRGYIALGTNYNYSEDVT----CRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGP 329
             L     +GT++   +D       RGRIL+  + E            ++  I +   KG 
Sbjct: 787  NLAERFIIGTSFITDDDAIEENDTRGRILVLGVDE----------NRQVYQIVSHNLKGA 836

Query: 330  VTAICHVAGFLVTAVGQKIYIWQLKDN-----DLTGIAFIDTEVYIASMVSVKNLILVGD 384
               +  +   +V  + + + ++   +       L  +A      +  S+    N+I V D
Sbjct: 837  CRCLGTLGEHIVAGLSKTVVVYHYVEETTVFGSLQKLAAYRPASFPLSLDISGNIIGVVD 896

Query: 385  YARSIALLRYQPEYR----TLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLS 440
              +S+ L+ + P        L   AR Y+P    S  +  G                   
Sbjct: 897  LMQSLTLVEFIPSEDGSRAKLEETARHYQPGWATSVAHLDG------------------- 937

Query: 441  LGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDF 500
              ER                      ++ +D   N+++    PEA       +L   ++ 
Sbjct: 938  --ER----------------------WLEADAQGNIIVLQRNPEAPTEQDRSKLEVTSEM 973

Query: 501  HLGQHVNTFFKIRC--------KPSSISDAPGARSRFLTWYASL------DGALGFFLPL 546
            ++G+ +N   K+           P +   + G     +T +  L      +G L  F  +
Sbjct: 974  NIGEQINQIRKLHVASNENAVVSPKAFLGSVGLSETIITCWNQLLMLVQIEGTLYLFGEI 1033

Query: 547  PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
                   LL  Q+ +  +    G ++   +R ++ K      P R  +DG +V +FL L 
Sbjct: 1034 APNYQDLLLTFQSRLQDYIYAPGNVSFNLWRAFRNKAREGDGPFR-FVDGEMVERFLDLD 1092

Query: 607  LGERLEICKKIGSKHNDI 624
              ++  +C+ +G    D+
Sbjct: 1093 EAKQELVCEGLGPSVEDM 1110


>gi|452820919|gb|EME27955.1| splicing factor 3B subunit 3 [Galdieria sulphuraria]
          Length = 1294

 Score = 46.2 bits (108), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 47/182 (25%), Positives = 80/182 (43%), Gaps = 29/182 (15%)

Query: 469  ISDKDKNVVLFMYQPEA-----RESNGG-------HRLIKKTDFHLGQHVNTFFKIRCKP 516
            I DK  N+ +    PEA     ++  GG       H    +  +++G  +    K+    
Sbjct: 1100 IGDKMGNISILRLPPEAGTFIEQDPTGGLLSKEAPHHFQLEACYYVGSVIQCLSKVEW-- 1157

Query: 517  SSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRA 575
             +  D P      L +Y +LDGA+G  +PL       L   L+  +  + S   G +  A
Sbjct: 1158 -TTGDVP------LLFYGTLDGAIGVMIPLRSTLDMELFQALELQLREYRSPLCGRHHLA 1210

Query: 576  FRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALS 635
            +R+Y    ++   P R +IDG L  +F +LSL ++ +I K++     D+  +L D    S
Sbjct: 1211 YRSY----FF---PVRHVIDGDLCEEFYRLSLEQQEKIVKELDRSIVDVHRKLEDYRERS 1263

Query: 636  SH 637
             H
Sbjct: 1264 PH 1265


>gi|380025901|ref|XP_003696702.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1-like
            [Apis florea]
          Length = 1141

 Score = 45.8 bits (107), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 96/487 (19%), Positives = 174/487 (35%), Gaps = 92/487 (18%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
            +R VPL  +P  +AY   ++T+ ++T                  +  D +DS  +  +  
Sbjct: 713  IRTVPLGESPRRIAYQESSQTFGVIT------------------MRVDIQDSSGVSIVRH 754

Query: 231  QFHVSLFSPFSWEEIPQTNFPLHEWEHVLC----LKNVSMEYEGTLSGLRGYIALGTNYN 286
                   S  S   I   N P       +C    + N+ +  + T   L  ++ + T Y 
Sbjct: 755  SASTQAASTSSSSHIASYNKPTGHTASDICQEIEVHNLLIIDQHTFEVLHAHMLMPTEYA 814

Query: 287  YSEDVTCRGR---------ILLFDIIEVVPEPGQPL----TKNKIKMIYAKEQKGPVTAI 333
             S   T  G            L    E  P+ G+ L    +  K+  +  KE KG   ++
Sbjct: 815  LSLISTKLGEDPTSYYIVGTALVHPDETEPKMGRILLYHWSDGKLTQVAEKEXKGSCYSL 874

Query: 334  CHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALL 392
                G L+ ++   + +++        +        IA  +  K + ILVGD  RS+ LL
Sbjct: 875  TEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKSKGDFILVGDLMRSLTLL 934

Query: 393  RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
            +Y+        +ARDY P                       W                  
Sbjct: 935  QYKTMEGCFEEIARDYNPN----------------------WM----------------- 955

Query: 453  SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFF 510
                +ILD+ + +G      +    LF+ Q ++  ++   R  + +   FHLG  VN F 
Sbjct: 956  -TAIEILDDDTFLG-----AENCFNLFVCQKDSAATSEDERQQMQEVGQFHLGDMVNVFR 1009

Query: 511  KIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
                   ++ ++    ++      ++ GA+G    +P   Y  L  L++ + +     G 
Sbjct: 1010 HGSLVMQNLGES-STPTQGCVLXGTVSGAIGLVTQIPFIFYEFLRNLEDRLTSVIKSVGK 1068

Query: 571  LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDI 624
            +    +R++  +         G IDG L+  FL LS  +  E+   +      G K    
Sbjct: 1069 IEHNFWRSFNTE--LKIEQCEGFIDGDLIESFLDLSPDKMAEVASGLMIDDPSGMKKEAT 1126

Query: 625  LDELYDI 631
            +D+L  I
Sbjct: 1127 VDDLVKI 1133


>gi|300176205|emb|CBK23516.2| unnamed protein product [Blastocystis hominis]
          Length = 702

 Score = 45.8 bits (107), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 81/406 (19%), Positives = 152/406 (37%), Gaps = 75/406 (18%)

Query: 249 NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY-SEDVTCRGRILLFDIIEVVP 307
             PL   E  LC+ + S+              +GT +    E+   +GR+L+   +E   
Sbjct: 322 ELPLKPSEIALCVASGSIFPLSNAPERNEVFVVGTAFVLPEENEPSQGRLLVLRAVE--- 378

Query: 308 EPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTE 367
                   ++++++      G   +IC   G +V  V  ++ ++ + D   + I+ + +E
Sbjct: 379 --------HRLELVAETMLSGGCLSICLFKGKVVCGVNSELQVFDV-DEKTSTISKLASE 429

Query: 368 VYIASMVSVK-----NLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
           V   S+ S+        I +GD   S+ +      Y+ +  V R  +  Q          
Sbjct: 430 VACISVTSLSPNEADETIALGDILYSVVV------YKLVLEVVRGRQLAQ---------- 473

Query: 423 PSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQ 482
                         L+    ER    ++      + L E  S   ++ D   N+++    
Sbjct: 474 --------------LECIASER----RRRDVTALERLPEAQSE-MVVGDAYGNLMVMQVV 514

Query: 483 PEAR--ESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISD------APGARSRFLTWYA 534
            EA    SN    ++ K  FHL   +N F  ++   S   D      A  +   F   +A
Sbjct: 515 EEADLDRSNPQKIVVTKESFHLDDQINRFVPVQLFRSGAEDKKKEKRAEESEIAFNLAFA 574

Query: 535 SLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGI- 593
           ++ G +G    L ++ +R L  ++  M    +  GGL+ + +R          N   GI 
Sbjct: 575 TVSGRIGMIGALNDREFRMLRAIETAMENVITPVGGLDHKQWR--------CSNTPFGIK 626

Query: 594 -----IDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
                IDG LV  FL+L    + +I   + ++    L   + I+ L
Sbjct: 627 NLAYCIDGDLVEMFLELDDESQAKIADSVSTELRSALSPQFLIDYL 672


>gi|358400469|gb|EHK49795.1| hypothetical protein TRIATDRAFT_146031 [Trichoderma atroviride IMI
            206040]
          Length = 1161

 Score = 45.8 bits (107), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 33/160 (20%), Positives = 72/160 (45%), Gaps = 10/160 (6%)

Query: 467  FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR 526
            ++ +D   NV++     EA       +L   ++ ++G+ +N   K++        APG  
Sbjct: 999  WLEADAQGNVIVLRQNLEAPTEQDQSQLQVISELNIGEQINRIRKLQV-------APGEN 1051

Query: 527  SRFL--TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY 584
            +  +   +  S +G L  +  +  K    L+  Q+ +  + S  G L+   +R ++ +  
Sbjct: 1052 AIVVPKAFLGSTEGTLYLYGDIAPKYQDLLMTFQSRLQEYISTPGNLSFDLWRAFRNQSR 1111

Query: 585  YAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
                P R  +DG ++ +FL L  G++  +C+ +G    D+
Sbjct: 1112 EGEAPFR-FVDGEMIERFLDLDEGKQELVCEGLGPSVEDM 1150


>gi|401413996|ref|XP_003886445.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325120865|emb|CBZ56420.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 2869

 Score = 45.8 bits (107), Expect = 0.068,   Method: Composition-based stats.
 Identities = 41/200 (20%), Positives = 88/200 (44%), Gaps = 31/200 (15%)

Query: 232  FHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDV 291
            + V L+  F  ++ P  ++ L   E VL L  V       L G+  ++A G     SE +
Sbjct: 2418 YEVRLYHEFDLQK-PIGSYTLRTCEEVLSLSFV------VLDGVE-HLAAGVGVPLSETI 2469

Query: 292  TCRGRILLFDIIEVVPEPGQPL-------------TKNKIKMIYAKEQKGPVTAICHV-- 336
             C GR+ LF + E       P              T  ++++       GPVT +     
Sbjct: 2470 ECSGRLYLFKLPESAMRLASPPRSADTPGDQAEYGTPERLELFADIVLNGPVTVVGSFFS 2529

Query: 337  ----AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALL 392
                  ++V +VG ++++ +++ +     AF D+ V + ++ +++N  L+ D  + + L+
Sbjct: 2530 SPAERSYVVHSVGPRLFVHEMESSKFLRGAFSDSSVCVTAVANLRNFFLLADALKGLNLV 2589

Query: 393  RY----QPEYRTLSLVARDY 408
             +    + + R ++ ++R +
Sbjct: 2590 AWEYHAEADSRKVTRISRTF 2609


>gi|156084934|ref|XP_001609950.1| splicing factor 3b, subunit 3, 130kD [Babesia bovis T2Bo]
 gi|154797202|gb|EDO06382.1| splicing factor 3b, subunit 3, 130kD, putative [Babesia bovis]
          Length = 1169

 Score = 45.8 bits (107), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 37/160 (23%), Positives = 75/160 (46%), Gaps = 16/160 (10%)

Query: 473  DKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTW 532
            DK   +F+ +    ES    +L     FHLG            P+++  A  ++S  +  
Sbjct: 1022 DKFDSIFVTRVPQEESTRHIQLENVCQFHLGD----------LPTAMDKAALSQSTHVVL 1071

Query: 533  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 592
            Y ++ G++G  +P   K+   L  LQ++ +   +    L  R    Y+   YY   P + 
Sbjct: 1072 YGTVMGSIGALVPFQSKD--ELDFLQHLEMLMATEAPPLCGREHSFYRS--YYV--PVQQ 1125

Query: 593  IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 632
            ++DG L  +F  L+  ++ ++ +++ +  N++L +L DI+
Sbjct: 1126 VVDGDLCEQFRHLTEAQQRKVAQQLDTTVNNVLRKLDDIK 1165


>gi|340714589|ref|XP_003395809.1| PREDICTED: DNA damage-binding protein 1-like [Bombus terrestris]
          Length = 1141

 Score = 45.4 bits (106), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 95/487 (19%), Positives = 173/487 (35%), Gaps = 92/487 (18%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
            +R VPL  +P  +AY   ++T+ ++T                  +  D +DS  +  +  
Sbjct: 713  IRTVPLGESPRRIAYQESSQTFGVIT------------------MRVDIQDSSGVSIVRH 754

Query: 231  QFHVSLFSPFSWEEIPQTNFPLHEWEHVLC----LKNVSMEYEGTLSGLRGYIALGTNYN 286
                   S  S   I   N P       +C    + N+ +  + T   L  ++ + T Y 
Sbjct: 755  SASTQAASTSSSSHIASYNKPTGHTASDICQEIEVHNLLIIDQHTFEVLHAHMLMPTEYA 814

Query: 287  YSEDVTCRGR---------ILLFDIIEVVPEPGQPL----TKNKIKMIYAKEQKGPVTAI 333
             S   T  G            L    E  P+ G+ L    +  K+  +  KE KG   ++
Sbjct: 815  LSLISTKLGEDPTSYYIVGTALVHPDETEPKMGRILLYHWSDGKLTQVAEKEIKGSCYSL 874

Query: 334  CHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALL 392
                G L+ ++   + +++        +        IA  +  K + ILVGD  RS+ LL
Sbjct: 875  TEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKTKGDFILVGDLMRSLTLL 934

Query: 393  RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
            +Y+        +ARDY P                       W                  
Sbjct: 935  QYKTMEGCFEEIARDYNPN----------------------WM----------------- 955

Query: 453  SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFF 510
                +ILD+ + +G      +    LF+ Q ++  ++   R  + +   FHLG  VN F 
Sbjct: 956  -TAIEILDDDTFLG-----AENCFNLFVCQKDSAATSEDERQQMQEVGQFHLGDMVNVFR 1009

Query: 511  KIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
                   ++ ++    ++    + ++ GA+G    +P   Y  L  L+  +       G 
Sbjct: 1010 HGSLVMQNLGES-STPTQGCVLFGTVSGAIGLVTQIPFTFYEFLRNLEERLTGVIKSVGK 1068

Query: 571  LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDI 624
            +    +R++  +         G IDG L+  FL LS  +  ++   +      G K    
Sbjct: 1069 IEHNFWRSFNTE--LKIEQCEGFIDGDLIESFLDLSPNKMADVASGLMIDDPSGMKKEAT 1126

Query: 625  LDELYDI 631
            +D+L  I
Sbjct: 1127 VDDLVKI 1133


>gi|168031491|ref|XP_001768254.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680432|gb|EDQ66868.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1391

 Score = 45.4 bits (106), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 53/110 (48%), Gaps = 16/110 (14%)

Query: 110 WLFLTSRGELRAHPMTIDGPVST-LAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAP 168
           WL  T+R   R    +I  P S+  AP ++V+CP G L F A   L +      + +   
Sbjct: 864 WLLQTARHSQRIAHTSISFPSSSHAAPVNSVDCPNGIL-FVADCSLHL----VEMEHLKR 918

Query: 169 WPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTD 218
             V+K+PL  TP  + YH E+KT  ++       TDY    G D  LV+D
Sbjct: 919 LNVQKLPLGRTPRRVLYHTESKTLIVM------RTDY----GPDGGLVSD 958


>gi|156095699|ref|XP_001613884.1| Splicing factor 3B subunit 3 [Plasmodium vivax Sal-1]
 gi|148802758|gb|EDL44157.1| Splicing factor 3B subunit 3, putative [Plasmodium vivax]
          Length = 1230

 Score = 45.4 bits (106), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 68/318 (21%), Positives = 111/318 (34%), Gaps = 79/318 (24%)

Query: 334  CHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALL 392
            C   G L+ ++G K+ I+ L K   L    + D    I S+    + I   D   S+ + 
Sbjct: 967  CPFNGRLLASIGNKLRIYALGKKKLLKKCEYKDIPEAIISIKVSGDRIFASDIRESVLIF 1026

Query: 393  RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
             Y     TL L++ D  P                                 R   C +I 
Sbjct: 1027 FYDANMNTLRLISDDIIP---------------------------------RWITCSEIL 1053

Query: 453  SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARE------------------SNGGHRL 494
              H            M +DK  +V +     EA++                  SN   RL
Sbjct: 1054 DHHT----------IMAADKFDSVFVLRVPEEAKQEEYGISNKCWYGGEIMAGSNKNRRL 1103

Query: 495  IKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 554
                 FH+G+ V +  K++  P+S              Y+++ G +G F+P   K    L
Sbjct: 1104 EHIMSFHVGEIVTSLQKVKLSPTSSE---------CIIYSTIMGTIGAFIPYDNKEELEL 1154

Query: 555  LM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 613
               L+ ++ T      G     FR+Y        +P + +IDG L  +F  L    + ++
Sbjct: 1155 TQHLEIILRTENPPLCGREHIFFRSYY-------HPVQHVIDGDLCEQFSSLPYDVQRKV 1207

Query: 614  CKKIGSKHNDILDELYDI 631
               +    +DIL +L DI
Sbjct: 1208 AADLERTPDDILRKLEDI 1225


>gi|298715583|emb|CBJ28136.1| cleavage and polyadenylation specificity factor CG10110-PA
            [Ectocarpus siliculosus]
          Length = 1906

 Score = 45.4 bits (106), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 47/88 (53%), Gaps = 7/88 (7%)

Query: 249  NFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYN--YSEDVTCRGRILLFDI--IE 304
            + P+   E+ +C+  V +E  G     R Y+A+GT  N    ED   RGR++L ++    
Sbjct: 1795 SHPMDSDENGVCMTLVRLEQGGAP---RMYVAVGTGMNEPQGEDKAARGRLILLEVDYAY 1851

Query: 305  VVPEPGQPLTKNKIKMIYAKEQKGPVTA 332
            +  E G+     K++ ++AKEQ GPV+ 
Sbjct: 1852 LAREDGKHEHAVKLRQVFAKEQLGPVSG 1879


>gi|350410909|ref|XP_003489174.1| PREDICTED: DNA damage-binding protein 1-like [Bombus impatiens]
          Length = 1141

 Score = 45.4 bits (106), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 95/487 (19%), Positives = 173/487 (35%), Gaps = 92/487 (18%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
            +R VPL  +P  +AY   ++T+ ++T                  +  D +DS  +  +  
Sbjct: 713  IRTVPLGESPRRIAYQESSQTFGVIT------------------MRVDIQDSSGVSIVRH 754

Query: 231  QFHVSLFSPFSWEEIPQTNFPLHEWEHVLC----LKNVSMEYEGTLSGLRGYIALGTNYN 286
                   S  S   I   N P       +C    + N+ +  + T   L  ++ + T Y 
Sbjct: 755  SASTQAASTSSSSHIASYNKPTGHTASDICQEIEVHNLLIIDQHTFEVLHAHMLMPTEYA 814

Query: 287  YSEDVTCRGR---------ILLFDIIEVVPEPGQPL----TKNKIKMIYAKEQKGPVTAI 333
             S   T  G            L    E  P+ G+ L    +  K+  +  KE KG   ++
Sbjct: 815  LSLISTKLGEDPTSYYIVGTALVHPDETEPKMGRILLYHWSDGKLTQVAEKEIKGSCYSL 874

Query: 334  CHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALL 392
                G L+ ++   + +++        +        IA  +  K + ILVGD  RS+ LL
Sbjct: 875  TEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKTKGDFILVGDLMRSLTLL 934

Query: 393  RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
            +Y+        +ARDY P                       W                  
Sbjct: 935  QYKTMEGCFEEIARDYNPN----------------------WM----------------- 955

Query: 453  SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHR--LIKKTDFHLGQHVNTFF 510
                +ILD+ + +G      +    LF+ Q ++  ++   R  + +   FHLG  VN F 
Sbjct: 956  -TAIEILDDDTFLG-----AENCFNLFVCQKDSAATSEDERQQMQEVGQFHLGDMVNVFR 1009

Query: 511  KIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGG 570
                   ++ ++    ++    + ++ GA+G    +P   Y  L  L+  +       G 
Sbjct: 1010 HGSLVMQNLGES-STPTQGCVLFGTVSGAIGLVTQIPFTFYEFLRNLEERLTGVIKSVGK 1068

Query: 571  LNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI------GSKHNDI 624
            +    +R++  +         G IDG L+  FL LS  +  ++   +      G K    
Sbjct: 1069 IEHNFWRSFNTE--LKIEQCEGFIDGDLIESFLDLSPNKMADVASGLMIDDPSGMKKEAT 1126

Query: 625  LDELYDI 631
            +D+L  I
Sbjct: 1127 VDDLVKI 1133


>gi|389586447|dbj|GAB69176.1| splicing factor 3B subunit 3 [Plasmodium cynomolgi strain B]
          Length = 1286

 Score = 45.4 bits (106), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 68/318 (21%), Positives = 111/318 (34%), Gaps = 79/318 (24%)

Query: 334  CHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALL 392
            C   G L+ ++G K+ I+ L K   L    + D    I S+    + I   D   S+ + 
Sbjct: 1023 CPFNGRLLASIGNKLRIYALGKKKLLKKCEYKDIPEAIISIKVSGDRIFASDIRESVLIF 1082

Query: 393  RYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 452
             Y     TL L++ D  P                                 R   C +I 
Sbjct: 1083 FYDSNMNTLRLISDDIIP---------------------------------RWITCSEIL 1109

Query: 453  SKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARE------------------SNGGHRL 494
              H            M +DK  +V +     EA++                  SN   RL
Sbjct: 1110 DHHT----------IMAADKFDSVFVLRVPEEAKQEEYGISNKCWYGGEIMAGSNKNRRL 1159

Query: 495  IKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 554
                 FH+G+ V +  K++  P+S              Y+++ G +G F+P   K    L
Sbjct: 1160 EHIMSFHVGEIVTSLQKVKLSPTSSE---------CIIYSTIMGTIGAFIPYDNKEELEL 1210

Query: 555  LM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 613
               L+ ++ T      G     FR+Y        +P + +IDG L  +F  L    + ++
Sbjct: 1211 TQHLEIILRTENPPLCGREHIFFRSYY-------HPVQHVIDGDLCEQFSSLPYDVQRKV 1263

Query: 614  CKKIGSKHNDILDELYDI 631
               +    +DIL +L DI
Sbjct: 1264 AADLERTPDDILRKLEDI 1281


>gi|410045300|ref|XP_508472.4| PREDICTED: DNA damage-binding protein 1 [Pan troglodytes]
          Length = 1107

 Score = 45.4 bits (106), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 35/136 (25%), Positives = 61/136 (44%), Gaps = 15/136 (11%)

Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
           Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 835 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 883

Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
            G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 884 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 942

Query: 395 QPEYRTLSLVARDYKP 410
           +P       +ARD+ P
Sbjct: 943 KPMEGNFEEIARDFNP 958


>gi|358380497|gb|EHK18175.1| hypothetical protein TRIVIDRAFT_80808 [Trichoderma virens Gv29-8]
          Length = 1161

 Score = 45.4 bits (106), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 32/158 (20%), Positives = 71/158 (44%), Gaps = 6/158 (3%)

Query: 467  FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR 526
            ++ +D   N+++     EA       +L   ++ ++G+ +N   KI+  P+   +A    
Sbjct: 999  WLEADAQGNIIVLRQNQEAPTEQDRSQLEITSELNIGEQINRIRKIQVAPAE--NAIVIP 1056

Query: 527  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 586
              FL    S++G L  +  +  K    L+  Q+ +  +    G L+   +R ++ +    
Sbjct: 1057 KAFL---GSIEGTLYLYGDIAPKYQDLLMTFQSRLQEYIQTPGNLSFDTWRAFRNQARDG 1113

Query: 587  GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
              P R  +DG ++ +FL L   ++  +C+ +G    D+
Sbjct: 1114 EAPFR-FVDGEMIERFLDLDEKQQELVCEGLGPSVEDM 1150


>gi|443894313|dbj|GAC71661.1| hypothetical protein PANT_5d00006 [Pseudozyma antarctica T-34]
          Length = 1625

 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 37/147 (25%), Positives = 69/147 (46%), Gaps = 20/147 (13%)

Query: 279  IALGTNYNYSEDV-TCRGRILLFDIIEVVPEPGQPLTKN---KIKMIYAKEQKGPVTAIC 334
            + +GT Y  S+   T  GR++ FD+      PG   TK    +++ ++  ++ G V ++ 
Sbjct: 1254 LVIGTGYIDSQSQETVSGRLVGFDV-----SPGSSRTKEERGRLRRLFEHDENGNVYSVQ 1308

Query: 335  HVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEV---------YIASMVSV--KNLILVG 383
             +   L  AV  ++ I+ + D      +    ++         +IA  +SV   + I+VG
Sbjct: 1309 SIGNRLAAAVNSEVKIYSVIDPRRGDASSPKIKIKQRGSWASSFIACSLSVVEPDRIVVG 1368

Query: 384  DYARSIALLRYQPEYRTLSLVARDYKP 410
            D  RS+ +L   P+   +S +ARD  P
Sbjct: 1369 DALRSMNVLHVHPQTARVSEIARDCDP 1395


>gi|384080885|dbj|BAM11105.1| damage-specific DNA binding protein 1, 127kDa, partial
           [Siebenrockiella crassicollis]
          Length = 364

 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 38/137 (27%), Positives = 62/137 (45%), Gaps = 17/137 (12%)

Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
           Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 157 YFIVGTAMVYPEEAEPKQGRIVVFH-----------YSDGKLQSLAEKEVKGAVYSMVEF 205

Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLR 393
            G L+ ++    ++Y W  +    T     +    +A  V  K + ILVGD  RS+ LL 
Sbjct: 206 NGKLLASINSTVRLYEWTAEKELRTECNHYNN--IMALYVKTKGDFILVGDLMRSVLLLA 263

Query: 394 YQPEYRTLSLVARDYKP 410
           Y+P       +ARD+ P
Sbjct: 264 YKPMEGNFEEIARDFNP 280


>gi|380488197|emb|CCF37544.1| hypothetical protein CH063_08850 [Colletotrichum higginsianum]
          Length = 271

 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 38/157 (24%), Positives = 71/157 (45%), Gaps = 14/157 (8%)

Query: 470 SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRF 529
           +D   N+++    P+A   +   ++   ++FHLG+ VN    +   P+   + P     F
Sbjct: 104 ADAQGNLMVLRRNPDAPTEHDQKQMEVTSEFHLGEQVNKIRPLDITPN--ENDPIVPKAF 161

Query: 530 LTWYASLDGALGFFLPLPEKNYRRLLMLQNVM--VTHT------SHTGGLNPRAFRTYKG 581
           L   A+++G+L  F  +  +    LL  Q  +  V  T        T GL+  A+R ++ 
Sbjct: 162 L---ATVEGSLYVFADIKSEYQSLLLQFQERLADVVKTLGQAGGDSTSGLSFMAWRGFRN 218

Query: 582 KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIG 618
               A  P R  +DG L+ +FL L   ++  + + +G
Sbjct: 219 AKRAADGPFR-FVDGELIERFLDLDEAKQEAVVQGLG 254


>gi|148709424|gb|EDL41370.1| damage specific DNA binding protein 1 [Mus musculus]
          Length = 968

 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 35/136 (25%), Positives = 61/136 (44%), Gaps = 15/136 (11%)

Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
           Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
            G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395 QPEYRTLSLVARDYKP 410
           +P       +ARD+ P
Sbjct: 936 KPMEGNFEEIARDFNP 951


>gi|16197726|emb|CAC94909.1| damaged-DNA recognition protein 1 [Mus musculus]
          Length = 994

 Score = 45.1 bits (105), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 35/136 (25%), Positives = 61/136 (44%), Gaps = 15/136 (11%)

Query: 278 YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
           Y  +GT   Y E+   + GRI++F             +  K++ +  KE KG V ++   
Sbjct: 828 YFIVGTAMVYPEEAEPKQGRIVVFQ-----------YSDGKLQTVAEKEVKGAVYSMVEF 876

Query: 337 AGFLVTAVGQ--KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRY 394
            G L+ ++    ++Y W  +    T     +  +    + +  + ILVGD  RS+ LL Y
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNHYNN-IMALYLKTKGDFILVGDLMRSVLLLAY 935

Query: 395 QPEYRTLSLVARDYKP 410
           +P       +ARD+ P
Sbjct: 936 KPMEGNFEEIARDFNP 951


>gi|156389050|ref|XP_001634805.1| predicted protein [Nematostella vectensis]
 gi|156221892|gb|EDO42742.1| predicted protein [Nematostella vectensis]
          Length = 1157

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 67/339 (19%), Positives = 122/339 (35%), Gaps = 72/339 (21%)

Query: 278  YIALGTNYNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
            Y  +GT Y + E+   + GR+LLF            L++ K+  +  KE KG V ++   
Sbjct: 841  YYCVGTAYVFPEEPEPKAGRLLLFH-----------LSEGKLVQVAEKEVKGAVYSLVEF 889

Query: 337  AGFLVTAVGQKIYIWQ-LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQ 395
             G ++  +   + I++   D +          +    + +  + ILVGD  RS+ LL Y 
Sbjct: 890  NGKVLAGINSTVSIFEWTADKEFRYECSYYDNILALYLKTKGDFILVGDLMRSMTLLVYL 949

Query: 396  PEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKH 455
            P   +   +A D+ P                       W                     
Sbjct: 950  PLEGSFQEIAHDFSPK----------------------WM------------------TA 969

Query: 456  NDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK 515
             +ILD+ + +G   ++   N+        A      + L     +HLG+ VN F      
Sbjct: 970  IEILDDDTFLG---AENSYNLFTCTKDSGATTDEERYHLQDAGQYHLGEFVNVFR----H 1022

Query: 516  PSSISDAPGARS---RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
             S + + PG  S   +    + +++G +G    + +  +  L+ +Q  +       G ++
Sbjct: 1023 GSLVMEHPGDASTPFQGCVLFGTVNGRIGIVAQIAQDLFNFLIQVQKKLNKVIKSVGKID 1082

Query: 573  ------PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQL 605
                  P        +      P+ G IDG L+  FL L
Sbjct: 1083 HSLYPFPHCSNLSHSRKM---EPAHGFIDGDLIESFLDL 1118


>gi|156095578|ref|XP_001613824.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148802698|gb|EDL44097.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 2213

 Score = 44.7 bits (104), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 23/102 (22%), Positives = 54/102 (52%), Gaps = 12/102 (11%)

Query: 323  AKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILV 382
            A ++  PV    H    ++  +  K++I ++++ND T  AF+D+ ++I+ +  +KN ++V
Sbjct: 1937 ANQKSSPVDQNVHCN--ILHCINSKLFIHEVRENDFTKGAFLDSNLFISDIKVMKNFLIV 1994

Query: 383  GDYARS--IALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
             D  +   I +  Y+ ++ + S++        P +K ++  N
Sbjct: 1995 ADLYKGIFINMFNYEQQHDSRSII--------PIAKPFFCAN 2028


>gi|402222132|gb|EJU02199.1| hypothetical protein DACRYDRAFT_21931 [Dacryopinax sp. DJM-731 SS1]
          Length = 1209

 Score = 44.7 bits (104), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 63/313 (20%), Positives = 127/313 (40%), Gaps = 51/313 (16%)

Query: 332  AICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK---NLILVGDYARS 388
            A+    G LV  +G+ + I+ +    L  +   + + +  ++V++    + I+VGD A S
Sbjct: 942  ALLSFQGRLVAGIGKALRIFDMGKKRL--LRKCENKSFATAIVTLSTQGSRIIVGDMAES 999

Query: 389  IALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE-- 446
            I    Y+P    L + A D +P                 I  S +  +  +  G++    
Sbjct: 1000 IYFATYKPPENRLLIFADDSQPRW---------------ITASAMVDYDTVCAGDKFGNV 1044

Query: 447  ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV 506
               ++  K  + +DE  +   ++ +K     LFM  P        H+      +++G  +
Sbjct: 1045 FVNRLPPKVGEQVDEDPTGAGVLHEKG----LFMGAP--------HKTNMLAHYYVGDII 1092

Query: 507  NTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLP-LPEKNYRRLLMLQNVMVTHT 565
             +  K+         A     R +  Y  L G +G  +P + +++   +  L+  M T  
Sbjct: 1093 TSMHKV---------ALVTGGRDIVLYTGLHGTIGVLIPFISKEDVDFIRTLEQHMRTEA 1143

Query: 566  SHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDIL 625
                G   R   TY+G  YY   P +G++DG L   F  L   ++  I  ++   ++++L
Sbjct: 1144 PSLVG---RDHLTYRG--YYV--PVKGVVDGDLCELFSLLPTQKQQSIAGELDRTYSEVL 1196

Query: 626  DELYDIEALSSHF 638
             +L  +   ++ F
Sbjct: 1197 KKLEQLRVTTTGF 1209


>gi|260947152|ref|XP_002617873.1| hypothetical protein CLUG_01332 [Clavispora lusitaniae ATCC 42720]
 gi|238847745|gb|EEQ37209.1| hypothetical protein CLUG_01332 [Clavispora lusitaniae ATCC 42720]
          Length = 1242

 Score = 44.7 bits (104), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 42/172 (24%), Positives = 75/172 (43%), Gaps = 21/172 (12%)

Query: 469  ISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSR 528
            ++++ +N VL  ++ E   ++   RL K  DF+    V +  K        S   G    
Sbjct: 1077 VAEQLENNVLMKFEEETLGASSS-RLDKLCDFYTQDIVTSLHKG-------SFVVGGSES 1128

Query: 529  FLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVTHTSHT--------GGLNPRAFRTY 579
             +  Y  L G +G  LPL       LLM L+N +  + + +         G N       
Sbjct: 1129 II--YTGLQGTVGILLPLATTQEVDLLMKLENSLRDYFNDSFDDFDNTKQGFNLVGREHL 1186

Query: 580  KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
            K +GYY  NP   +IDG  + +F +L+   ++++  ++     DI  ++YD+
Sbjct: 1187 KFRGYY--NPVENVIDGDFIERFFELNPSAQVKLAGRLDKSPRDIERKIYDL 1236


>gi|159470709|ref|XP_001693499.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283002|gb|EDP08753.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 279

 Score = 44.7 bits (104), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 63/288 (21%), Positives = 104/288 (36%), Gaps = 56/288 (19%)

Query: 362 AFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAG 421
           AF D       +V+VK+ +L  D  + +  LRY    R L  +++D+      + G    
Sbjct: 33  AFFDLPSLATGLVTVKDYLLASDVHQGLFFLRYSDASRVLEFMSKDFDGRDVLTCGVVIA 92

Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMY 481
            P            FL       L++ +  G +  D   EF +                 
Sbjct: 93  EPK---------LHFLAADAAGTLQMMEFYGKR--DTNPEFWA----------------- 124

Query: 482 QPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALG 541
                    G RL      H+ + V     ++            R+R      S +G L 
Sbjct: 125 ---------GQRLAPMGLLHVARRVGVAASVQLASRD------GRNRHALLCGSAEGGLS 169

Query: 542 FFLPLPEKNYRRLLMLQNVMVTHT-SHTGGLNPRAFRTY---------KGKGYYAGNPSR 591
           F  P+P+      L      ++ T  H  GLNPR+FR            G+ + A  P R
Sbjct: 170 FVAPVPDPQAAARLAALQAHMSATLPHVAGLNPRSFRHRFIRIPKALGGGEHHRAPLPPR 229

Query: 592 G---IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSS 636
               ++DG L+  F  LS  ++ E  + +GS    +L++L  I A ++
Sbjct: 230 NNSGLLDGQLLLGFPHLSRQQQAEAAEAVGSSPQQLLEDLRAIAAAAT 277


>gi|388853409|emb|CCF53029.1| related to UV-damaged DNA-binding protein [Ustilago hordei]
          Length = 1508

 Score = 44.3 bits (103), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 40/145 (27%), Positives = 68/145 (46%), Gaps = 15/145 (10%)

Query: 279  IALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVA 337
            + +GT Y +  E     GR+L FD+        +   + +++ ++ KEQ G V ++  + 
Sbjct: 1135 LVVGTGYISDGEHEVISGRLLGFDVSAGSIRGKE--ERGRLRKLFVKEQAGNVYSVQSIN 1192

Query: 338  GFLVTAVGQKIYIWQLKD---NDLTGIAFIDTE-------VYIASMVSV--KNLILVGDY 385
              L TAV  ++ I+ + D   +D      I+          +IA  +SV   + I+VGD 
Sbjct: 1193 NRLATAVNSEVKIYSVVDPRASDEVSAPRINVVQRGSWACSFIACNLSVVEPDQIVVGDA 1252

Query: 386  ARSIALLRYQPEYRTLSLVARDYKP 410
             RSI +L   P    L+ +ARD  P
Sbjct: 1253 LRSINVLHVHPYTARLTEIARDCDP 1277


>gi|221061705|ref|XP_002262422.1| splicing factor 3b, subunit 3, 130kd [Plasmodium knowlesi strain H]
 gi|193811572|emb|CAQ42300.1| splicing factor 3b, subunit 3, 130kd, putative [Plasmodium knowlesi
            strain H]
          Length = 1276

 Score = 43.9 bits (102), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 37/145 (25%), Positives = 63/145 (43%), Gaps = 17/145 (11%)

Query: 488  SNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLP 547
            SN   RL    +FH+G+ V +  K++  P+S              Y+++ G +G F+P  
Sbjct: 1143 SNKNRRLEHIMNFHVGEIVTSLQKVKLSPTSSE---------CIIYSTIMGTIGAFIPYD 1193

Query: 548  EKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
             K    L   L+ ++ T      G     FR+Y        +P + +IDG L  +F  L 
Sbjct: 1194 NKEELELTQHLEIILRTENPPLCGREHIFFRSYY-------HPVQHVIDGDLCEQFSSLP 1246

Query: 607  LGERLEICKKIGSKHNDILDELYDI 631
               + ++   +    +DIL +L DI
Sbjct: 1247 YDIQRKVAADLERTPDDILRKLEDI 1271


>gi|449704103|gb|EMD44407.1| DNA-repair binding protein, putative [Entamoeba histolytica KU27]
          Length = 1088

 Score = 43.9 bits (102), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 61/298 (20%), Positives = 114/298 (38%), Gaps = 15/298 (5%)

Query: 347  KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVAR 406
            +I I Q+KD  L  I   D    + SM ++    L     + + +  YQ          +
Sbjct: 794  RILIVQIKDGRLEIIFEKDVNGAVYSMKTLLKKYLAMSIEKKLVVFEYQRVITNGEFEVK 853

Query: 407  DYKPTQPNSK--GYYAGNPSRGIIDGSLVWKFLQLSLGER-------LEICKKIGSKHND 457
              +    N K  G Y       I+ G L+      S            E+ +   + +  
Sbjct: 854  LQEKGSCNVKLIGLYVKTLGNKILVGDLMKSISVYSFDNNGNNKNCLTEVSRDFYASYTT 913

Query: 458  ILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS 517
             ++      ++ SD + N+++F       ES    RL      H+G+ +N   K    P+
Sbjct: 914  AIEFVDEDCYLSSDSNSNILIFNTNSTGNESER-FRLNNCAHIHVGECINVMCKGSIAPT 972

Query: 518  SISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN-VMVTHTSHTGGLNPRAF 576
              +     +   L  +  + G +G    +P + Y  L+ +QN +++          P  +
Sbjct: 973  HSTYETVQKKCIL--FGGVTGYIGGICEIPNEIYDVLIKVQNQILLQMKGIVECTTPDNW 1030

Query: 577  RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
            +  K    +   PS  IIDGS+V  +L++S  ++ EI    G     I D + ++ +L
Sbjct: 1031 K--KVIDDWKRMPSSNIIDGSIVESYLEMSKEKQCEIAHLSGVNEEQISDIIENMISL 1086


>gi|384253371|gb|EIE26846.1| hypothetical protein COCSUDRAFT_52476 [Coccomyxa subellipsoidea
            C-169]
          Length = 1205

 Score = 43.9 bits (102), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 40/145 (27%), Positives = 63/145 (43%), Gaps = 18/145 (12%)

Query: 489  NGG-HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLP 547
            NG  H+L    +FH+G  V +  +   +P       G R   L  YA++ GA+G  LP P
Sbjct: 1072 NGAPHKLEDVVNFHVGDLVTSLQRAVLQP-------GGREVLL--YATVMGAIGAMLPFP 1122

Query: 548  EKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLS 606
             +        L+  +       GG   R   +Y+G  +    P + +IDG L   F QL 
Sbjct: 1123 SREDVDFFSHLEMHLRQEHPPMGG---RDHMSYRGSYF----PVKDVIDGDLCEHFSQLP 1175

Query: 607  LGERLEICKKIGSKHNDILDELYDI 631
              ++  I  ++     +IL +L DI
Sbjct: 1176 AAKQKSIADELERTPGEILKKLEDI 1200


>gi|241560031|ref|XP_002400960.1| spliceosomal protein sap, putative [Ixodes scapularis]
 gi|215501812|gb|EEC11306.1| spliceosomal protein sap, putative [Ixodes scapularis]
          Length = 1019

 Score = 43.9 bits (102), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 71/350 (20%), Positives = 143/350 (40%), Gaps = 59/350 (16%)

Query: 292  TCRGRILLFDIIEVVPEPGQPLT-KNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYI 350
             CRG  LL     + P P +P+    ++++++A   +   TA+C   G L+  VG+ + +
Sbjct: 713  VCRGGGLLL-TYRLAPNPEEPMAGPTQLELVHATPVEEAPTALCPFQGRLLAGVGKCLRL 771

Query: 351  WQLKDNDLTGIAFIDTEVYIASMVSVK---NLILVGDYARSIALLRYQPEYRTLSLVARD 407
            + L    L  +   + +    ++VS++   N ++V D   S   LRY+ +   L + A D
Sbjct: 772  YDLGRKKL--LRKCENKYIPNAIVSIQAMGNRVVVSDVQESFFFLRYKRQENQLVIFADD 829

Query: 408  YKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGER---LEICKKIGSKHNDILDEFSS 464
              P                 I  S +  +  ++  ++   + I +   S  +D+      
Sbjct: 830  SVPRW---------------ITASCMLDYETVAGADKFGNVSIIRLPSSISDDV------ 868

Query: 465  MGFMISDKDKNVVLFMYQPEARESNGG--HRLIKKTDFHLGQHVNTFFKIRCKPSSISDA 522
                  D+D   +  ++    R   GG   +    ++FH+G+ V +  K    P      
Sbjct: 869  ------DEDPTGIKSLWD---RGWLGGSSQKADVISNFHIGETVLSLQKATLIPG----- 914

Query: 523  PGARSRFLTWYASLDGALGFFLPL-PEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKG 581
             G+ S     Y +L G +G  +P    +++     L+  M        G +  +FR+   
Sbjct: 915  -GSESLV---YVTLSGTVGVLVPFTAHEDHDFFQHLEMHMRYENPPLCGRDHLSFRS--- 967

Query: 582  KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
              Y+   P + +IDG L  +F  L   ++  I +++    +++  +L DI
Sbjct: 968  -SYF---PVKNVIDGDLCEQFNSLDPSKQKSIAEELDRNPSEVSKKLEDI 1013


>gi|70945139|ref|XP_742421.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56521397|emb|CAH76894.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 435

 Score = 43.5 bits (101), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 98/501 (19%), Positives = 173/501 (34%), Gaps = 111/501 (22%)

Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
           ++K+P   T   +AYH E+    ++TS   P  + +K N   K+++       F  P  +
Sbjct: 9   IQKIPFYRTVEKIAYHKESGL--LITSC--PPEEKHKTNKNLKQIIC------FFNPHQN 58

Query: 231 QFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGT-NYNYSE 289
            F  S   P  +                +C+  ++ +     S +   I +GT N N   
Sbjct: 59  SFKYSYIIPSKYNV------------SSICVYQINKDIYPNKSSINTLICVGTANINDRV 106

Query: 290 DVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF---LVTAVGQ 346
                G I +F          +     +IK IY       V  I H+  F   L+T +  
Sbjct: 107 SEPSSGHIYIF-------FAKKKANLFEIKHIYT--HNVNVGGITHLKQFYDKLITTINN 157

Query: 347 KIYIWQL--------------------KDNDLTGIAFIDTEVYIASMVSVKNLILVGDYA 386
            + I  +                     D  +  +A      +I S+  ++N I+VGD  
Sbjct: 158 TVVILDISEFLINLDKYVDNTNKPKLENDGTIVDVASFTPSSWIMSLDVIENYIVVGDIM 217

Query: 387 RSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE 446
            S+ +L Y      L+ V RDY                      S VW     +L     
Sbjct: 218 TSVTILSYDFNNSILTEVCRDY----------------------SNVWCTFVCAL----- 250

Query: 447 ICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHV 506
                           S   F++SD + N ++F             +L +   F+ G  V
Sbjct: 251 ----------------SKSHFLVSDMESNFLVFQKSSIKYNDEDSFKLSRVALFNHGHVV 294

Query: 507 NTFFKIRCKPSSISDAPGA---RSRFLTWYASLDGALGFFLPLPE-KNYRRLLMLQNVMV 562
           N    +        + P     R +     AS +G++   +P     N+++ L ++  + 
Sbjct: 295 NKMLPVSLSSLIEEEEPQNEILRKKESILCASSEGSISSIIPFSNLANFKKALCIELALN 354

Query: 563 THTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLE-------ICK 615
              S  G +N  +  TYK     +    +G++DG +   F  +   ++ +       I K
Sbjct: 355 DSLSSIGNINDNSNNTYKMN--LSEKSCKGVVDGEVFKMFFSMPFEKQFKTYIYAKWIAK 412

Query: 616 KIGSKHNDILDELYDIEALSS 636
           K+  K     + + DIE L S
Sbjct: 413 KLNCKFGTFENFMLDIENLCS 433


>gi|328858656|gb|EGG07768.1| hypothetical protein MELLADRAFT_105631 [Melampsora larici-populina
            98AG31]
          Length = 1216

 Score = 43.5 bits (101), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 52/214 (24%), Positives = 89/214 (41%), Gaps = 13/214 (6%)

Query: 372  SMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGS 431
            ++++ KN I+VGD  +SI +L +  +  +L ++ RDY        G  +    R  +   
Sbjct: 964  TVLTEKNWIIVGDLYKSIVVLEFDLKKFSLKVLGRDYSAMSVRPIGMIS---DRVFVAAD 1020

Query: 432  LVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGG 491
              +    + + ER     + G K  D  +E  S+     D D+           +  N  
Sbjct: 1021 TEFNLFTVEMRER-----QKGLKEEDEDEEGLSVEEEKGDDDEWEEEERRMRVEKVFNDD 1075

Query: 492  HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRF--LTWYASLDGALGFFLPLPE- 548
            H L     FHLG++VN  FK      S+    G   ++     + S  G +G  + L + 
Sbjct: 1076 H-LDTVGGFHLGENVN-HFKAGSLVKSLKHFYGQDLKYGGKLIFVSSTGGIGVIIKLEDL 1133

Query: 549  KNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGK 582
            K Y+ L  L++ +       GGL+   FR +K K
Sbjct: 1134 KIYKHLKALEDRLKKEILSIGGLDSTEFRKFKNK 1167


>gi|323447810|gb|EGB03719.1| hypothetical protein AURANDRAFT_72671 [Aureococcus anophagefferens]
          Length = 760

 Score = 43.5 bits (101), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 49/181 (27%), Positives = 75/181 (41%), Gaps = 20/181 (11%)

Query: 240 FSWEEIPQTNF---PLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSED--VTCR 294
           F  +E P  +     L   E  LC   +S++   T    R +  +GT +   E+    C 
Sbjct: 413 FLRDEAPYNDVHREALEPLEIPLCCSIISLDSISTYKDQRAHFVVGTAFAAQENDFEPCS 472

Query: 295 GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV-AGFLVTAVGQKIYIWQ- 352
           GR+++F         GQ      +  ++  E  G V  +  + A  LV AV   I+I+  
Sbjct: 473 GRMIIF-------RSGQANVAPSV--LFFVEANGAVYDVAAMRASLLVCAVNHAIHIYDP 523

Query: 353 -LKDN---DLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDY 408
            ++DN    L   A  D  V    +    NLI+VGD  RS+ LL    +   +  VA DY
Sbjct: 524 VVRDNRRGHLKPRASYDGLVVALKVQCYGNLIVVGDMMRSVTLLNLIRQKMIIVEVACDY 583

Query: 409 K 409
            
Sbjct: 584 N 584


>gi|119191318|ref|XP_001246265.1| hypothetical protein CIMG_00036 [Coccidioides immitis RS]
          Length = 1072

 Score = 43.5 bits (101), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 30/137 (21%), Positives = 60/137 (43%), Gaps = 6/137 (4%)

Query: 467  FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR 526
            F+++D + N+V+          +   R+   ++  LG+ VN     R  P  +  +P + 
Sbjct: 906  FLVADAEGNLVVLNRDTTGVTEDDRRRMQVTSELRLGEMVN-----RIHPMDLQTSPESP 960

Query: 527  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 586
                 + A++DG++  F  +       L+ LQ+ +    +  G +    +R +K     A
Sbjct: 961  VIPKAFLATVDGSIYLFGLISPSAQDTLMRLQSALADFVASPGEIPFNKYRAFKSSVRQA 1020

Query: 587  GNPSRGIIDGSLVWKFL 603
              P R  +DG L+ +FL
Sbjct: 1021 EEPFR-FVDGELIEQFL 1036


>gi|358338734|dbj|GAA31211.2| DNA damage-binding protein 1, partial [Clonorchis sinensis]
          Length = 1515

 Score = 43.5 bits (101), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 60/273 (21%), Positives = 99/273 (36%), Gaps = 59/273 (21%)

Query: 171  VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
            VR VPL+ TP  LA   ET +  ++T   E   +   F                  P+ S
Sbjct: 769  VRTVPLEETPKRLALQDETGSLGVITYRQEVFQEGSGFK-----------------PVRS 811

Query: 231  QFHVSLFSPFSWEEIPQT----------NFPLHEWEHVLCLKNVSME--------YEGTL 272
               +S   P S   +P+T           F   E   +L     +ME        +  TL
Sbjct: 812  SISLSQKVPKSTSRLPKTAPSSVSATERKFREVEVSSLLIFNKSTMELMFAHSFYFSQTL 871

Query: 273  SGLRGYIA--------------LGTNYNYSEDVT-CRGRILLFDIIEVVPEPGQPLTKNK 317
              +   IA              +GT +   E+V   +GRI LF      PE        +
Sbjct: 872  VEVAVSIASIEPTDGSKSMLYAVGTAFLVEEEVEPSKGRIHLF---HWDPETA------R 922

Query: 318  IKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK 377
            ++ +   +  G V  +    G L+ A+   + ++ +K++ L      +  +    +    
Sbjct: 923  LETVLVHDVNGAVYRLLDFNGRLLAAINSSVRLFDIKEDSLRLACSFNENIIALFLRRKG 982

Query: 378  NLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
            + +LVGD  RS+ LL Y+P       + R   P
Sbjct: 983  DFVLVGDLMRSLTLLLYRPNVNNFEAIGRHRNP 1015


>gi|156841606|ref|XP_001644175.1| hypothetical protein Kpol_1059p7 [Vanderwaltozyma polyspora DSM
            70294]
 gi|156114812|gb|EDO16317.1| hypothetical protein Kpol_1059p7 [Vanderwaltozyma polyspora DSM
            70294]
          Length = 1346

 Score = 43.1 bits (100), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 32/51 (62%), Gaps = 3/51 (5%)

Query: 416  KGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI-GSKHNDILDEFSSM 465
            + YYA  P + IIDG L  +FL L+  ERLEICK +  +K  DI+ + + M
Sbjct: 1293 RSYYA--PVKNIIDGDLCERFLYLNSNERLEICKNLKDTKPEDIIRQINEM 1341



 Score = 42.0 bits (97), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 34/53 (64%), Gaps = 3/53 (5%)

Query: 580  KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI-GSKHNDILDELYDI 631
            K + YYA  P + IIDG L  +FL L+  ERLEICK +  +K  DI+ ++ ++
Sbjct: 1291 KYRSYYA--PVKNIIDGDLCERFLYLNSNERLEICKNLKDTKPEDIIRQINEM 1341


>gi|150865083|ref|XP_001384154.2| hypothetical protein PICST_58642 [Scheffersomyces stipitis CBS
           6054]
 gi|149386339|gb|ABN66125.2| DNA-repair [Scheffersomyces stipitis CBS 6054]
          Length = 541

 Score = 42.7 bits (99), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 55/113 (48%), Gaps = 15/113 (13%)

Query: 445 LEICKKIGSKHNDILDEFSSMGFMISDKDKNVV-LFMYQPEARESNGGHRLIKKTDFHLG 503
           ++ CK++     D+  +FSS+G +IS + K V  L  Y      SNG    + KT     
Sbjct: 1   MDECKQLLDSGADVFSKFSSLGRLISLQGKMVTDLIDY------SNGNQSQLSKTTLRPI 54

Query: 504 QHVNTF-------FKIRCKPSSISDAPGARSRFL-TWYASLDGALGFFLPLPE 548
           + V+ F       FK R KP S++D   + S+F+ T   +LD  LG  +P  E
Sbjct: 55  REVDGFLMELSTAFKKRNKPKSVTDMIKSPSKFISTGLHTLDSDLGGGIPTGE 107


>gi|254585271|ref|XP_002498203.1| ZYRO0G04730p [Zygosaccharomyces rouxii]
 gi|238941097|emb|CAR29270.1| ZYRO0G04730p [Zygosaccharomyces rouxii]
          Length = 1302

 Score = 42.7 bits (99), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 33/55 (60%), Gaps = 3/55 (5%)

Query: 578  TYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKI-GSKHNDILDELYDI 631
            ++K + YYA  P R +IDG L   FL LSL E+ ++CK+  GS    +  +L DI
Sbjct: 1245 SFKYRSYYA--PVRNVIDGDLCETFLNLSLSEQTKLCKETSGSNPEGVCKQLNDI 1297


>gi|407044103|gb|EKE42371.1| DNA damage-binding protein, putative [Entamoeba nuttalli P19]
          Length = 1088

 Score = 42.7 bits (99), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 39/169 (23%), Positives = 74/169 (43%), Gaps = 6/169 (3%)

Query: 467  FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR 526
            ++ SD + N+++F       ES    RL      H+G+ +N   K    P+  +     +
Sbjct: 923  YLSSDSNSNILIFNTNSTGNESER-FRLNNCAHIHVGECINVMCKGSIAPTHSTYETVQK 981

Query: 527  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN-VMVTHTSHTGGLNPRAFRTYKGKGYY 585
               L  +  + G +G    +P + Y  L+ +QN +++          P  ++  K    +
Sbjct: 982  KCIL--FGGVTGYIGGICEIPNEIYDILIKVQNQILLQMKGIVECTTPDDWK--KVIDDW 1037

Query: 586  AGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEAL 634
               PS  IIDGS+V  +L++S  ++ EI    G     I D + ++ +L
Sbjct: 1038 KRMPSSNIIDGSIVESYLEMSKEKQCEIAHLSGVNEEKISDIIENMISL 1086


>gi|393243160|gb|EJD50676.1| hypothetical protein AURDEDRAFT_112250 [Auricularia delicata
           TFB-10046 SS5]
          Length = 1140

 Score = 42.7 bits (99), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 40/145 (27%), Positives = 64/145 (44%), Gaps = 21/145 (14%)

Query: 281 LGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGF 339
           +GT Y   SE    RGRIL+F  +E     G  LT          +  G V ++  V G 
Sbjct: 824 VGTAYIKDSEMEPSRGRILVFGSLEDSGTGGSWLTA-------FLQVTGAVLSLTSVDGL 876

Query: 340 LVTAVGQKIYIWQLKDNDLTGIAFI-----------DTEVYIASMVSVKNLILVGDYARS 388
           +V  V   + +++L+ N L+                +    + S+ +  + I +GD   S
Sbjct: 877 IVAGVNTAVILYELRRNTLSEAERASHLTLRQKKEWNHNYVVTSLAARGDTIYIGDSVAS 936

Query: 389 IALLRYQPEYRTLSLVARDYKPTQP 413
           IA+LR++ E  TL  +AR + P  P
Sbjct: 937 IAILRWKHE--TLHTIARHFGPIFP 959


>gi|183232997|ref|XP_653855.2| damaged DNA binding protein [Entamoeba histolytica HM-1:IMSS]
 gi|169801778|gb|EAL48469.2| damaged DNA binding protein, putative [Entamoeba histolytica
            HM-1:IMSS]
          Length = 1088

 Score = 42.7 bits (99), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 57/277 (20%), Positives = 106/277 (38%), Gaps = 15/277 (5%)

Query: 347  KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVAR 406
            +I I Q+KD  L  I   D    + SM ++    L     + + +  YQ          +
Sbjct: 794  RILIVQIKDGRLEIIFEKDVNGAVYSMKTLLKKYLAMSIEKKLVVFEYQRVITNGEFEVK 853

Query: 407  DYKPTQPNSK--GYYAGNPSRGIIDGSLVWKFLQLSLGER-------LEICKKIGSKHND 457
              +    N K  G Y       I+ G L+      S            E+ +   + +  
Sbjct: 854  LQEKGSCNVKLIGLYVKTLGNKILVGDLMKSISVYSFDNNGNNKNCLTEVSRDFYASYTT 913

Query: 458  ILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPS 517
             ++      ++ SD + N+++F       ES    RL      H+G+ +N   K    P+
Sbjct: 914  AIEFVDEDCYLSSDSNSNILIFNTNSTGNESER-FRLNNCAHIHVGECINVMCKGSIAPT 972

Query: 518  SISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQN-VMVTHTSHTGGLNPRAF 576
              +     +   L  +  + G +G    +P + Y  L+ +QN +++          P  +
Sbjct: 973  HSTYETVQKKCIL--FGGVTGYIGGICEIPNEIYDVLIKVQNQILLQMKGIVECTTPDDW 1030

Query: 577  RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEI 613
            +  K    +   PS  IIDGS+V  +L++S  ++ EI
Sbjct: 1031 K--KVIDDWKRMPSSNIIDGSIVESYLEMSKEKQCEI 1065


>gi|342885673|gb|EGU85655.1| hypothetical protein FOXB_03801 [Fusarium oxysporum Fo5176]
          Length = 1160

 Score = 42.7 bits (99), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 32/161 (19%), Positives = 73/161 (45%), Gaps = 12/161 (7%)

Query: 467  FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR 526
            ++ +D   N+V+     +A       RL   ++ ++G+ +N   K+          P A 
Sbjct: 998  WLEADSKGNLVVLQRNVDAPTEQDRSRLEITSEMNIGEQINRIRKLHV--------PMAE 1049

Query: 527  SRFL---TWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKG 583
            +  +    + AS +G+L  +  +  +    L+  Q+ M  +    G +  + +R+++ + 
Sbjct: 1050 NGIVHPRAFLASAEGSLYLYGDIAPQYQDLLMTFQSKMEEYIHVPGSVEFKLWRSFRNEN 1109

Query: 584  YYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
              +  P R  IDG +V +FL +  G++  +C+ +G    D+
Sbjct: 1110 RESEGPFR-FIDGEMVERFLDMDEGKQELVCEGLGPSIEDM 1149


>gi|327301962|ref|XP_003235673.1| UV-damaged DNA binding protein [Trichophyton rubrum CBS 118892]
 gi|326461015|gb|EGD86468.1| UV-damaged DNA binding protein [Trichophyton rubrum CBS 118892]
          Length = 1147

 Score = 42.4 bits (98), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 35/151 (23%), Positives = 68/151 (45%), Gaps = 6/151 (3%)

Query: 467  FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGAR 526
            ++++D + N+V+          +   RL   ++  LG+ VN    I  +  +   A  AR
Sbjct: 982  YLLADAEGNLVVLQQNITGVTESDRKRLQPTSEIRLGEMVNRIHPIVIQTYT-ETAVSAR 1040

Query: 527  SRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYA 586
            +      A++DG++  F  +       LL LQ  M + T   G +    +R ++   + +
Sbjct: 1041 A----LLATVDGSIYLFGLINPTYIDLLLRLQTAMGSITISPGEIPFSKYRAFRTTVHQS 1096

Query: 587  GNPSRGIIDGSLVWKFLQLSLGERLEICKKI 617
              P R  +DG L+ +FL  + G + EI  ++
Sbjct: 1097 DEPFR-FVDGELIERFLSCTPGMQEEIVSRL 1126


>gi|242208420|ref|XP_002470061.1| predicted protein [Postia placenta Mad-698-R]
 gi|220730961|gb|EED84811.1| predicted protein [Postia placenta Mad-698-R]
          Length = 776

 Score = 42.4 bits (98), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 31/118 (26%), Positives = 53/118 (44%), Gaps = 20/118 (16%)

Query: 234 VSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTC 293
           + L SP  W  +    F   + E V CL  V++E   + SG++ +IA+GT  N      C
Sbjct: 411 LELISPEGW--VTMDGFESAQKEFVTCLDCVTLETTSSESGMKDFIAVGTKINCG---AC 465

Query: 294 RGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQ-KGPVTAICHVAGFLVTAVGQKIYI 350
            G                P  +N I  +  ++  K  +TA+C +   L++ + QKI++
Sbjct: 466 FGYT--------------PPYRNSILTLKCRDDAKVSITALCGMYNHLISTMDQKIFV 509


>gi|150863836|ref|XP_001382447.2| hypothetical protein PICST_54680 [Scheffersomyces stipitis CBS 6054]
 gi|149385092|gb|ABN64418.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 1228

 Score = 42.4 bits (98), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 25/104 (24%), Positives = 49/104 (47%), Gaps = 5/104 (4%)

Query: 533  YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 592
            Y  + G +G  LPL  K+  + +   N +        G N       K + YY  NP + 
Sbjct: 1129 YTGIQGTVGLLLPLSTKSEVQFI---NSLEQSLRQQMGFNLLGMDHLKFRSYY--NPVKN 1183

Query: 593  IIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSS 636
            +IDG L+ K+ +LS   +++I +++     ++  ++ D+   S+
Sbjct: 1184 VIDGDLIEKYYELSQSLKIKIARELNRTPKEVEKKISDLRNRSA 1227


>gi|115490949|ref|XP_001210102.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114196962|gb|EAU38662.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 908

 Score = 42.4 bits (98), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 44/89 (49%), Gaps = 4/89 (4%)

Query: 98  YQGVFLCGPHPAWLFLTSRGELRAHPMTIDG-PVSTLAPFHNVNCPRGFLYFNAKSELRI 156
           +  VF+ G    ++  TS      H M + G P+  L  F N     GF++ ++++ LR+
Sbjct: 795 FSSVFMPGMSAGFVLKTSAS--LPHLMRMRGAPIQCLDAF-NSPSGNGFIFLDSENALRM 851

Query: 157 SVLPTHLSYDAPWPVRKVPLKCTPHFLAY 185
             LP    +D  WP+R++P+      LAY
Sbjct: 852 CQLPRETHFDYQWPMRRIPIGEQIDHLAY 880


>gi|116195210|ref|XP_001223417.1| hypothetical protein CHGG_04203 [Chaetomium globosum CBS 148.51]
 gi|88180116|gb|EAQ87584.1| hypothetical protein CHGG_04203 [Chaetomium globosum CBS 148.51]
          Length = 1127

 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 31/155 (20%), Positives = 62/155 (40%), Gaps = 6/155 (3%)

Query: 470  SDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRF 529
            +D   N+++     E   +    R+   ++ +L + VN     R +   +   PGA    
Sbjct: 968  ADAQGNLMVLRRNVEGVTAEDKRRMEVTSEINLNEMVN-----RIRTIDVETTPGAMIVP 1022

Query: 530  LTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNP 589
              +  +++G +  F  +       LL  Q+ +       G +  R +R ++        P
Sbjct: 1023 KAFLGTVEGGIYMFGTVAPHVQDLLLRFQSRLADVLKTAGDIEFRTYRAFRNAEREGDGP 1082

Query: 590  SRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
             R  +DG L+ KFL +    +  +CK +G    D+
Sbjct: 1083 FR-FVDGELLEKFLDVDETTQEAVCKGLGPTVEDM 1116


>gi|399218485|emb|CCF75372.1| unnamed protein product [Babesia microti strain RI]
          Length = 575

 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 55/261 (21%), Positives = 99/261 (37%), Gaps = 48/261 (18%)

Query: 169 WPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTD--PRDSRFIP 226
           W V+K+PL C    +  +  T TY   T+    + DY  FN E   ++T+  P +S    
Sbjct: 206 WCVKKIPLNCRS--MKENSITNTY--NTNAYHSNADYAVFNDESSHIITESQPINSYISD 261

Query: 227 PLVSQFH-----------VSLFSPFSWEEIPQTN----FPLHEWEHVLCLKNV------- 264
              SQ +           +S ++ +    IP+TN    FP +      C           
Sbjct: 262 DAESQINNASNMMYKNNELSSYNNYMESNIPRTNYQDLFPCYTESLTTCFSEQPYHDHQC 321

Query: 265 --SMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLT----KNKI 318
             + + +      +GY   GTNYNYS          L +    +P   QP+     +N+I
Sbjct: 322 ADNCDNQSFSQIYKGYDVYGTNYNYS---------YLNNEYADLPMYSQPIGYYGYENQI 372

Query: 319 KMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDL-TGIAFI--DTEVYIASMVS 375
           + +Y  +   P T   +      +     +     + N   + +A +  D   Y   +++
Sbjct: 373 ENVYTHQL--PYTITTNTENIASSTNNGNVAECSTRSNSCNSSVAELACDKSEYTNELIN 430

Query: 376 VKNLILVGDYARSIALLRYQP 396
              L     +A  I+ +++QP
Sbjct: 431 TNPLFQYNQHASGISGVKFQP 451


>gi|322700871|gb|EFY92623.1| DNA damage-binding protein 1 [Metarhizium acridum CQMa 102]
          Length = 1121

 Score = 41.2 bits (95), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 32/172 (18%), Positives = 69/172 (40%), Gaps = 15/172 (8%)

Query: 467  FMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRC--------KPSS 518
            ++ +D   N+++    PEA       +L   ++ ++G+ +N   ++           P +
Sbjct: 940  WLEADAQGNIIVLQRNPEAPTEQDRSKLEVTSEINIGEQINQIRRLHVASNENAVVSPKA 999

Query: 519  ISDAPGARSRFLTWYASL------DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
               + G     +  +  L      +G L  F  +  K    LL  Q  +  +    G ++
Sbjct: 1000 FLGSVGLSETTINCWTQLLILVQIEGTLYLFGEIAPKYQDLLLTFQARLQDYIYAPGNVS 1059

Query: 573  PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDI 624
               +R ++ K      P R  +DG +V +FL L   ++  +C+ +G    D+
Sbjct: 1060 FNLWRAFRNKAREGDGPFR-FVDGEMVERFLDLDEAKQELVCEGLGPSVEDM 1110


>gi|340367933|ref|XP_003382507.1| PREDICTED: splicing factor 3B subunit 3-like isoform 1 [Amphimedon
            queenslandica]
          Length = 1214

 Score = 40.8 bits (94), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 34/135 (25%), Positives = 60/135 (44%), Gaps = 17/135 (12%)

Query: 498  TDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM- 556
            T +H+G+ +NT  K       +S  PG     +  Y +L G++G  +P   K        
Sbjct: 1090 TSYHVGEGINTLHK-------VSLIPGGSEVLV--YTTLSGSIGILVPFSSKEDSDFFQH 1140

Query: 557  LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 616
            L+  M +  S+  G +  +FR+     YY   P + +IDG L   +  L   +R EI   
Sbjct: 1141 LEMHMRSEWSNLVGRDHLSFRS-----YYV--PVKSVIDGDLCEVYNSLDPSKRREIALD 1193

Query: 617  IGSKHNDILDELYDI 631
            +    +++  +L D+
Sbjct: 1194 LDRSPSEVAKKLEDL 1208


>gi|221057087|ref|XP_002259681.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
            knowlesi strain H]
 gi|193809753|emb|CAQ40455.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
            knowlesi strain H]
          Length = 2256

 Score = 40.8 bits (94), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 20/85 (23%), Positives = 46/85 (54%), Gaps = 10/85 (11%)

Query: 340  LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARS--IALLRYQPE 397
            ++  +  K++I ++ +ND T  AF++   +I+ +  +KN  +V D  R   I++  Y+ +
Sbjct: 1995 ILHCMNSKLFIHEVSENDFTKGAFLENNFFISDIKILKNFFIVADLHRGIFISMYNYEQQ 2054

Query: 398  YRTLSLVARDYKPTQPNSKGYYAGN 422
            Y + S++        P +K +++ N
Sbjct: 2055 YDSRSII--------PIAKPFFSSN 2071


>gi|82541417|ref|XP_724950.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23479780|gb|EAA16515.1| CPSF A subunit region, putative [Plasmodium yoelii yoelii]
          Length = 2227

 Score = 40.4 bits (93), Expect = 2.9,   Method: Composition-based stats.
 Identities = 19/68 (27%), Positives = 37/68 (54%), Gaps = 2/68 (2%)

Query: 340  LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARS--IALLRYQPE 397
            L+     KIYI ++K+ND    AF+D   YI+ +   +N I++ D  +   I +  Y+ +
Sbjct: 2005 LLHCTNSKIYIHEIKNNDFIKGAFLDNNFYISDIKIFRNFIIISDLYKGIYINMYSYEEQ 2064

Query: 398  YRTLSLVA 405
            Y +  +++
Sbjct: 2065 YDSRRIIS 2072


>gi|70954357|ref|XP_746229.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56526771|emb|CAH77136.1| hypothetical protein PC000016.02.0 [Plasmodium chabaudi chabaudi]
          Length = 372

 Score = 40.4 bits (93), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 44/195 (22%), Positives = 79/195 (40%), Gaps = 38/195 (19%)

Query: 456 NDILDEFSSMGFMISDKDKNVVLFMYQPEARESN---------GGHRLIKKT-------- 498
           ++ILD  + M    +DK  +V +     EA++           GG  +   T        
Sbjct: 192 SEILDHHTIMA---ADKFDSVFILRVPEEAKQEEYGIANKCWYGGEVISSSTKNRKMEHI 248

Query: 499 -DFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM- 556
             FH+G+ V +  K++  P+S              Y+++ G +G F+P   K    L   
Sbjct: 249 MSFHIGEIVTSLQKVKLSPASSE---------CIIYSTIMGTIGAFIPYDNKEELELTQH 299

Query: 557 LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 616
           L+ ++ T      G     FR+Y        +P + +IDG L  +F  L    + ++   
Sbjct: 300 LEIILRTEKHALCGREHIFFRSYY-------HPVQHVIDGDLCEQFSSLPFDVQRKVASD 352

Query: 617 IGSKHNDILDELYDI 631
           +    ++IL +L DI
Sbjct: 353 LEKTPDEILRKLEDI 367


>gi|322787057|gb|EFZ13281.1| hypothetical protein SINV_13198 [Solenopsis invicta]
          Length = 986

 Score = 39.7 bits (91), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 34/135 (25%), Positives = 58/135 (42%), Gaps = 13/135 (9%)

Query: 278 YIALGTNY-NYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
           Y  +GT + N  E     GRILL+             ++ K   +  KE KG   ++   
Sbjct: 825 YFVVGTAFINPDETEPKMGRILLYH-----------WSEGKFTQVAEKEIKGSCYSLVEF 873

Query: 337 AGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQ 395
            G L+ ++   + +++        +        IA  +  K + +LVGD  RS+ LL+Y+
Sbjct: 874 NGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKTKGDFVLVGDLMRSLTLLQYK 933

Query: 396 PEYRTLSLVARDYKP 410
               +   +ARDY P
Sbjct: 934 TMEGSFEEIARDYNP 948


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.322    0.139    0.428 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,737,815,892
Number of Sequences: 23463169
Number of extensions: 475138795
Number of successful extensions: 820092
Number of sequences better than 100.0: 681
Number of HSP's better than 100.0 without gapping: 380
Number of HSP's successfully gapped in prelim test: 301
Number of HSP's that attempted gapping in prelim test: 817301
Number of HSP's gapped (non-prelim): 1777
length of query: 638
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 489
effective length of database: 8,863,183,186
effective search space: 4334096577954
effective search space used: 4334096577954
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 80 (35.4 bits)