BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 000465
         (1478 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224129926|ref|XP_002320705.1| predicted protein [Populus trichocarpa]
 gi|222861478|gb|EEE99020.1| predicted protein [Populus trichocarpa]
          Length = 1566

 Score = 1729 bits (4479), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 925/1495 (61%), Positives = 1097/1495 (73%), Gaps = 71/1495 (4%)

Query: 1    MSKSNPRPNLSQNPLAEYFTNSLMAQNQLIDSLTSHISLYHSHSLSSNPNPSSNPRSSIL 60
            MS  NP  N S++   +   N+LM QN LI SLTSHISLYHS S   +    +  RSSIL
Sbjct: 1    MSVPNPNNNFSKH---QNIPNALMTQNHLIGSLTSHISLYHSQSNPPSSPNPNP-RSSIL 56

Query: 61   KWFASLTVHQRQAHLTIVDSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCY 120
            KWF SL+VHQRQ+HLT VD KF Q+L+QML KL ++GH  FIILPDL SRD   LP LC+
Sbjct: 57   KWFKSLSVHQRQSHLTTVDFKFTQILLQMLAKLHSHGHCRFIILPDLLSRD---LPSLCF 113

Query: 121  KKSRGLLSRVAESNESGRWVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDR 180
            KKSRGLLSR+AESNES R +FESTRLFSSREGEK+++       LD+ TVS + +ENV++
Sbjct: 114  KKSRGLLSRIAESNESERLIFESTRLFSSREGEKVDDCRSGAEGLDSVTVSEDLIENVEK 173

Query: 181  FIDIMDEISNGGFLRGEESELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNN 240
            F+++MD+ISNGGFLRGEESEL  DWVE +WLK +GYY IEAF+ N+LEV LRLAWLNC N
Sbjct: 174  FVELMDDISNGGFLRGEESELGTDWVELEWLKVRGYYCIEAFLANKLEVALRLAWLNCGN 233

Query: 241  GKKRGVKLKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVL 300
            GKKRGVKLKEKL+AAG+AANV+WR+KGCVDWW NLD  +RRKVL   LGKAAKSLT E+L
Sbjct: 234  GKKRGVKLKEKLSAAGVAANVFWRRKGCVDWWRNLDAEVRRKVLNFALGKAAKSLTREIL 293

Query: 301  KEASNALEDGMWLFNAGMKQSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLA 360
            K+ S    D + LF AG+++  R  H++S QR    L  D E GLA  P S SG  AS A
Sbjct: 294  KDVSGVSGDELSLFRAGVQRPWRDLHAESRQRIFLKLPADAEFGLAPKP-SFSGKDASFA 352

Query: 361  TVFSGLFVLQDITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCT 420
             +F+ LFVLQDI ++VL  Q +EYD   IFFS L  + T +DC+LRKLRGL+MV+SLDCT
Sbjct: 353  NIFNSLFVLQDIVSLVLPDQGSEYDTSHIFFSMLGSLGTLSDCILRKLRGLVMVISLDCT 412

Query: 421  KLELFGEGNFKSSPNKSKEKPSTIGRRKKCRACSTKR-QNPLPKSALDELSLDKLPKDPE 479
            +LEL GEG   SS NK  EK     RRKK +  + K+  NP P  ++DE S  KL +D +
Sbjct: 413  RLELLGEGTSNSSANKPSEKLGAGSRRKKGKTQNMKKLMNPTPVKSVDESSFKKLAEDIK 472

Query: 480  GALTDTEKVDLMGSDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRTNARKT 539
             A    +K +LM S+++PGI +  + +R+ S+  +EM     EH + LV  K RT  RK 
Sbjct: 473  CAPACIKKTELMESNEMPGIPHENENHRDISSPTVEM-----EHTQGLVHEKKRTAGRKN 527

Query: 540  KTVKNKNKNCTYNNPVPVKDPKVSVLETSS-SISLQDEVEKYDKLSAQNVSVDNSTCSNV 598
            +  +NK K  +++NPV V+ P+++V E  S S+   DE  K  +LS      DN T    
Sbjct: 528  RKGRNKKKKSSFSNPVEVRKPEIAVSEAPSFSVCSSDEEAKLCRLS------DNLTTQK- 580

Query: 599  LASNQSSCT-SASVPAREGIATQSTQEDCVVNSVNS------ECRRFSNGRIDNQTQHFL 651
             ASN S    S + P R+ I      ED  V           E  R SNG +DN++    
Sbjct: 581  -ASNDSLIDPSINEPTRKEIDALGIPEDHAVGCTEGISDAGLEHYRSSNGFVDNKSMPSR 639

Query: 652  QETTDSKVECNIISPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEI 711
            +ET    V  NII      ++L     N   G +F N   E +      + +K +   E+
Sbjct: 640  RETR-CGVGQNIIYQVATTKELITVSSNE--GTSFLNKKTEVKLD----VGNKLVRTHEV 692

Query: 712  KKESAVTQDQRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHN 771
            K+   + + + +E+F  +  K   +C SYEWP++ PVYFPSI+SHL PAT RLHLDVGHN
Sbjct: 693  KEVPTLNRREESENFHESGSKGLSDCLSYEWPSLGPVYFPSINSHLPPATYRLHLDVGHN 752

Query: 772  WHNHVRQPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSG 831
            WHNH+ QPF+PT+HQARN P +GG N++LSQPLPMSLDWPPMV++  G+AP++TCNYDSG
Sbjct: 753  WHNHIHQPFLPTVHQARNSPIEGGSNRMLSQPLPMSLDWPPMVRSNCGLAPTMTCNYDSG 812

Query: 832  FISSRQSGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEE 891
            FIS  QS FQ+++  K MQ+ +KT DDE +CSGD +D  E T++QE  DE ++HW+SEEE
Sbjct: 813  FISRWQSTFQKSYTAKNMQYISKTFDDERRCSGDAIDFTEATSSQELMDEYENHWISEEE 872

Query: 892  LEVHTVSGIDYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMV 951
             EVH VSGIDYNQ+FGGGVMYW+ SDHPGTGFSRPPSLSSDDS W WHEA++ RAVDDMV
Sbjct: 873  YEVHAVSGIDYNQHFGGGVMYWDPSDHPGTGFSRPPSLSSDDSGWPWHEAELNRAVDDMV 932

Query: 952  AFSSSYSTNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEE 1011
            AFSSSYST GLTSPTAASFCS FDPL PGHQA  YV+ GNEVPGK +  SST TD A EE
Sbjct: 933  AFSSSYSTTGLTSPTAASFCSAFDPLVPGHQALGYVMSGNEVPGKAM-LSSTVTDAAAEE 991

Query: 1012 EISGSFASLSGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRRE 1071
            ++SGS ASLS DV+ KA D+LP PILRPIIIPN+SRERSRSDFKRS +HKSPCVPP+RRE
Sbjct: 992  DVSGSLASLSSDVEGKAGDSLPYPILRPIIIPNMSRERSRSDFKRSLDHKSPCVPPTRRE 1051

Query: 1072 QPRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTT 1131
             PRIKRPPSPVVLCVPRAPRPPPPSPVSDSRK RGFPTVRSGSSSPR WGVRGWYH+GT 
Sbjct: 1052 HPRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKHRGFPTVRSGSSSPRQWGVRGWYHDGTN 1111

Query: 1132 SEEGCVRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEH------- 1184
             EE C RMDG+EVVWPSWRNK LS HPM+QPL GALLQD LIA+S LARDQ+H       
Sbjct: 1112 LEEACGRMDGAEVVWPSWRNKKLSTHPMVQPLPGALLQDRLIAMSHLARDQDHVSVLLYC 1171

Query: 1185 --PDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRV 1242
              PDV FPLQ  E+QNCPTRKASL L+ SLLH+EIDSFCKQVAA N ARKP+INWAVKRV
Sbjct: 1172 AIPDVLFPLQRAEIQNCPTRKASLCLVQSLLHDEIDSFCKQVAAANMARKPFINWAVKRV 1231

Query: 1243 TRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKET 1302
            TRSLQVLWPRSR NIFGS+ATGL+LP+SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKET
Sbjct: 1232 TRSLQVLWPRSRINIFGSSATGLALPTSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKET 1291

Query: 1303 CLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTT 1362
            CLQ                        IP+IMLVVEVP DLI SAAS+VQSPKE+  H T
Sbjct: 1292 CLQ------------------------IPVIMLVVEVPTDLITSAASNVQSPKEEPIHLT 1327

Query: 1363 LKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELT 1422
             +HD  V S+MV L+DS SPKC+  + D+ +   S+RLDISFKSPSHTGLQTT LVK+LT
Sbjct: 1328 GEHDIQVQSNMVVLEDSISPKCTQLNCDSKRDVKSIRLDISFKSPSHTGLQTTQLVKDLT 1387

Query: 1423 EQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
            EQFPA+TPLALVLKQFLADRSLDQSYSGGLSSYCL+LLI RFLQHEHHLGRPINQ
Sbjct: 1388 EQFPAATPLALVLKQFLADRSLDQSYSGGLSSYCLVLLIIRFLQHEHHLGRPINQ 1442


>gi|147819078|emb|CAN65347.1| hypothetical protein VITISV_000637 [Vitis vinifera]
          Length = 1500

 Score = 1609 bits (4166), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 882/1474 (59%), Positives = 1042/1474 (70%), Gaps = 134/1474 (9%)

Query: 11   SQNPLAEYFTNSLMAQNQLIDSLTSHISLYHSHSLSSNPNPSSNPRSSILKWFASLTVHQ 70
            S NP    F +S    NQL+DSLT+HISLYH+ S SS+PNP+ NPRSSILKWF+SLTV Q
Sbjct: 45   SSNPPGNLFMDS----NQLVDSLTAHISLYHNRSPSSSPNPNPNPRSSILKWFSSLTVQQ 100

Query: 71   RQAHLTIVDSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRV 130
            RQ++++ VDS F Q+L+QM  KL  +GHGFFIILPDLPSRD P+LP LC++KSRGLL+RV
Sbjct: 101  RQSYISAVDSNFVQILLQMQFKLYTHGHGFFIILPDLPSRDRPHLPSLCFRKSRGLLARV 160

Query: 131  AESNESGRWVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISN 190
            +ESN+  R + +S RLF S+EGE++E+ SC  + LD+ TV  EFV NVDRF+  MD +SN
Sbjct: 161  SESNDLERLINDSVRLFGSKEGERVEDCSCSASFLDSLTVCEEFVSNVDRFVAAMDSVSN 220

Query: 191  GGFLRGEESELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNC-NNGKKRGVKLK 249
            GGFLRGEES L  DWVE +WLKAKGYYSIE+F+ NRLEV LRLAW NC NNGKKRGVKLK
Sbjct: 221  GGFLRGEESGLGSDWVELEWLKAKGYYSIESFVANRLEVALRLAWFNCGNNGKKRGVKLK 280

Query: 250  EKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNALED 309
            EK+N AG+AANV+WRKKGC+DWW NLD AMRRK++ V+LGKAAKSLT E+LK A +ALED
Sbjct: 281  EKVNVAGIAANVFWRKKGCIDWWQNLDCAMRRKMIIVVLGKAAKSLTDEILKGAYSALED 340

Query: 310  GMWLFNAGMKQSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVFSGLFVL 369
              WLFNAG  Q  ++ ++ S QRT   LS D E G  + P+S+SG               
Sbjct: 341  EKWLFNAGGGQPVKYKYTASSQRTDQALSDDAEAGSIMIPSSVSG-------------KT 387

Query: 370  QDITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGN 429
            QDI  ++L+ QH+EYD +KIFFS+L  +ST +DC+ RKLRGLLMVV LD TKLEL GEGN
Sbjct: 388  QDILNIILTCQHSEYDRDKIFFSTLGSISTISDCIFRKLRGLLMVVWLDFTKLELLGEGN 447

Query: 430  FKSSPNKSKEKPSTIGRRKKCRACSTKRQNPLPKSALDELSLDKLPKDPEGALTDTEKVD 489
             KS PNKSKEK  T  R+K+ +  + K+ NP+P+S  D+    K  KD    L   + VD
Sbjct: 448  LKSPPNKSKEKLGTGXRKKRGKTRNMKKLNPVPRSCGDBSKSLKPLKDHGCRLAYAKCVD 507

Query: 490  LMGSDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRTNARKTKTVKNKNKNC 549
             + S+++ G     D+  E S+S +EM          + +GK +  ARK++  +NKN+  
Sbjct: 508  FVESNRMAGELQQSDLRMEASSSVVEM-------ENDMFSGKVQNAARKSRKERNKNRIY 560

Query: 550  TYNNPVPVKDPKVSVLETSS-SISLQDEVEKY----DKLSAQNVSVDNSTCSNVLASNQS 604
            +  +PV V+D +    E S+ S+  Q E  K     D   ++NV  D S   +   S  S
Sbjct: 561  SLKDPVEVRDLETITTEPSAPSVISQSEPSKSNWKSDSSVSENVPNDASIGCDKFIS--S 618

Query: 605  SCTSASVPAREGIATQSTQEDCVVNSVNSECRRFSNGRIDNQTQHFLQETTDSKVECNII 664
             C   + P+R     QS +ED VV+S+                                 
Sbjct: 619  PCKPTNGPSRAETTAQSIREDPVVSSI--------------------------------- 645

Query: 665  SPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKKESAVTQDQRNE 724
                   ++D AF  S   I FQNS H SET    V  DK I+A E+++E    Q+Q   
Sbjct: 646  -------EVDVAF--SGEDIKFQNSEHLSETDTKCV-SDKPIKATELEEEIVQNQEQERG 695

Query: 725  SFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPA-TDRLHLDVGHNWHNHVRQPFVPT 783
             F  T   SS ECPSYEWPT+AP++F SI+S  LPA TDRLHLDVG NWHNH  Q FVP+
Sbjct: 696  KFCNTGSTSSSECPSYEWPTVAPIHFTSINSQHLPAATDRLHLDVGRNWHNHFHQSFVPS 755

Query: 784  LHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQN 843
            +HQ RN P D GC+QILS+PLPMSLDWPPMV+++S +APS+TCNYD GFIS  QS F+Q 
Sbjct: 756  IHQTRNPPLDAGCSQILSRPLPMSLDWPPMVRSISRLAPSMTCNYDPGFISRMQSSFRQG 815

Query: 844  FATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYN 903
            F    +Q N  TS+DE K SGD MDL + T  QE  DECDSHW+SEEE E+H VSG+DY+
Sbjct: 816  FPAHNVQVNTATSEDERKYSGDLMDLSDLTNVQELADECDSHWISEEEFELHAVSGLDYS 875

Query: 904  QYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTNGLT 963
            QYFGGGVMYWN+SDHPG+GFSRPPSLSSDDSSWAWHEAD+ RAVDDMVAFSSSYSTNGL 
Sbjct: 876  QYFGGGVMYWNSSDHPGSGFSRPPSLSSDDSSWAWHEADMNRAVDDMVAFSSSYSTNGLA 935

Query: 964  SPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGD 1023
            SPTAASFCSPFDPLG GHQ   YV+ GNE PGKVLHSSS + D   EE++SGS A+L  D
Sbjct: 936  SPTAASFCSPFDPLGAGHQPLGYVISGNEGPGKVLHSSSASADAMPEEKVSGSLANLPVD 995

Query: 1024 VDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVV 1083
            V+ K  D LP  +L PIIIPN+SRERSRS+FKR+ + KSPCVPP+RREQPRIKRPPSPVV
Sbjct: 996  VEGKTGDPLPYSLLPPIIIPNMSRERSRSEFKRNFDRKSPCVPPARREQPRIKRPPSPVV 1055

Query: 1084 LCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSE 1143
            LCVPRAPRPPPPSPVSDSRK RGFPTVRSGSSSPRHWG+RGWYH+G+  EE CV +DG+E
Sbjct: 1056 LCVPRAPRPPPPSPVSDSRKNRGFPTVRSGSSSPRHWGMRGWYHDGSNLEEACVCIDGAE 1115

Query: 1144 VVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRK 1203
            VVWPSWRNKNLS  PMIQPL GALLQD LIAISQLARDQEHPDVAFPLQP ++ +C  RK
Sbjct: 1116 VVWPSWRNKNLSTRPMIQPLPGALLQDRLIAISQLARDQEHPDVAFPLQPPDLLSCSMRK 1175

Query: 1204 ASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNAT 1263
             +LS+MHSLLHEEIDSF K+VAAEN  RKPYINWAVKRVTRSLQVLWPRSRTNIFGSNAT
Sbjct: 1176 TALSMMHSLLHEEIDSFWKKVAAENMIRKPYINWAVKRVTRSLQVLWPRSRTNIFGSNAT 1235

Query: 1264 GLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLK 1323
            GLSLP+SDVDLV+CLPPVRNLEPIKEAGILEGRNGIKETCLQ                  
Sbjct: 1236 GLSLPTSDVDLVICLPPVRNLEPIKEAGILEGRNGIKETCLQ------------------ 1277

Query: 1324 TVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPK 1383
                  IPIIMLVVEVP DL  SAA ++Q+ KE+         +H+ SD           
Sbjct: 1278 ------IPIIMLVVEVPPDLTTSAAPNLQTSKEEPTPMPGGQGSHI-SDRNGW------- 1323

Query: 1384 CSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
                                F+    T       VKELTEQFPA+TPLALVLKQFLADRS
Sbjct: 1324 --------------------FRELCFTK------VKELTEQFPAATPLALVLKQFLADRS 1357

Query: 1444 LDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
            LDQSYSGGLSSYCL+LLITRFLQHEHHLGRPINQ
Sbjct: 1358 LDQSYSGGLSSYCLVLLITRFLQHEHHLGRPINQ 1391


>gi|297734532|emb|CBI16583.3| unnamed protein product [Vitis vinifera]
          Length = 1331

 Score = 1547 bits (4006), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 822/1301 (63%), Positives = 960/1301 (73%), Gaps = 87/1301 (6%)

Query: 185  MDEISNGGFLRGEESELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNC-NNGKK 243
            MD +SNGGFLRGEES L  DWVE +WLKAKGYYSIE+F+ NRLEV LRLAW NC NNGKK
Sbjct: 1    MDSVSNGGFLRGEESGLGSDWVELEWLKAKGYYSIESFVANRLEVALRLAWFNCGNNGKK 60

Query: 244  RGVKLKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEA 303
            RGVKLKEK+N AG+AANV+WRKKGC+DWW NLD AMRRK++ V+LGKAAKSLT E+LK A
Sbjct: 61   RGVKLKEKVNVAGIAANVFWRKKGCIDWWQNLDCAMRRKMIIVVLGKAAKSLTDEILKGA 120

Query: 304  SNALEDGMWLFNAGMKQSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVF 363
             +ALED  WLFNAG  Q  ++ ++ S QRT   LS D E G  + P+S+SG P S     
Sbjct: 121  YSALEDEKWLFNAGGGQPVKYKYTASSQRTDQALSDDAEAGSIMIPSSVSGKPKSFFNFS 180

Query: 364  SGLFVLQDITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLE 423
            +GLFV+QDI  ++L+ QH+EYD +KIFFS+L  +ST +DC+ RKLRGLLMVV LD TKLE
Sbjct: 181  NGLFVVQDILNIILTCQHSEYDRDKIFFSTLGSISTISDCIFRKLRGLLMVVWLDFTKLE 240

Query: 424  LFGEGNFKSSPNKSKEKPSTIGRRKKCRACSTKRQNPLPKSALDELSLDKLPKDPEGALT 483
            L GEGN KS PNKSKEK  T GR+K+ R  + K+ NP+P+S  D+    K  KD    L 
Sbjct: 241  LLGEGNLKSPPNKSKEKLGTGGRKKRGRTRNMKKLNPVPRSCGDDSKSLKPLKDHGCGLA 300

Query: 484  DTEKVDLMGSDKVPGISNGKDINRETSTS-EMEMVVCHQEHARALVAGKGRTNARKTKTV 542
              + VD + S+++ G     D++ E S+S E +M            +GK +  ARK++  
Sbjct: 301  YAKCVDFVESNRMAGELQQSDLHMEASSSVENDMF-----------SGKVQNAARKSRKE 349

Query: 543  KNKNKNCTYNNPVPVKDPKVSVLETSS-SISLQDEVEKY----DKLSAQNVSVDNSTCSN 597
            +NKN+  +  +PV V+D +    E S+ S+  Q E  K     D   ++NV  D S   +
Sbjct: 350  RNKNRIYSLKDPVEVRDLETITTEPSAPSVISQSEPSKSNWKSDSSVSENVPNDASIGCD 409

Query: 598  VLASNQSSCTSASVPAREGIATQSTQEDCVVNSVNSECRRFSNGRIDNQTQHFLQETTDS 657
               S  S C   + P+R     QS +ED VV+S+                          
Sbjct: 410  KFIS--SPCKPTNGPSRAETTAQSIREDPVVSSI-------------------------- 441

Query: 658  KVECNIISPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKKESAV 717
                          ++D AF  S   I FQNS H SET    V  DK I+A E+++E   
Sbjct: 442  --------------EVDVAF--SGEDIKFQNSEHLSETDTKCV-SDKPIKATELEEEIVQ 484

Query: 718  TQDQRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPA-TDRLHLDVGHNWHNHV 776
             Q+Q    F  T   SS ECPSYEWPT+AP++F SI+S  LPA TDRLHLDVG NWHNH 
Sbjct: 485  NQEQERGKFCNTGSTSSSECPSYEWPTVAPIHFTSINSQHLPAATDRLHLDVGRNWHNHF 544

Query: 777  RQPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSR 836
             Q FVP++HQ RN   D GC+QILS+PLPMSLDWPPMV+++S +APS+TCNYD GFIS  
Sbjct: 545  HQSFVPSIHQTRNPSLDAGCSQILSRPLPMSLDWPPMVRSISRLAPSMTCNYDPGFISRM 604

Query: 837  QSGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHT 896
            Q                       K SGD MDL + T  QE  DECDSHW+SEEE E+H 
Sbjct: 605  Q-----------------------KYSGDLMDLSDLTNVQELADECDSHWISEEEFELHA 641

Query: 897  VSGIDYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSS 956
            VSG+DY+QYFGGGVMYWN+SDHPG+GFSRPPSLSSDDSSWAWHEAD+ RAVDDMVAFSSS
Sbjct: 642  VSGLDYSQYFGGGVMYWNSSDHPGSGFSRPPSLSSDDSSWAWHEADMNRAVDDMVAFSSS 701

Query: 957  YSTNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGS 1016
            YSTNGL SPTAASFCSPFDPLG GHQ   YV+ GNE PGKVLHSSS + D   EE++SGS
Sbjct: 702  YSTNGLASPTAASFCSPFDPLGAGHQPLGYVISGNEGPGKVLHSSSASADAMPEEKVSGS 761

Query: 1017 FASLSGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIK 1076
             A+L  DV+ K  D LP  +L PIIIPN+SRERSRS+FKR+ + KSPCVPP+RREQPRIK
Sbjct: 762  LANLPVDVEGKTGDPLPYSLLPPIIIPNMSRERSRSEFKRNFDRKSPCVPPARREQPRIK 821

Query: 1077 RPPSPVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGC 1136
            RPPSPVVLCVPRAPRPPPPSPVSDSRK RGFPTVRSGSSSPRHWG+RGWYH+G+  EE C
Sbjct: 822  RPPSPVVLCVPRAPRPPPPSPVSDSRKNRGFPTVRSGSSSPRHWGMRGWYHDGSNLEEAC 881

Query: 1137 VRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEV 1196
            V +DG+EVVWPSWRNKNLS  PMIQPL GALLQD LIAISQLARDQEHPDVAFPLQP ++
Sbjct: 882  VCIDGAEVVWPSWRNKNLSTRPMIQPLPGALLQDRLIAISQLARDQEHPDVAFPLQPPDL 941

Query: 1197 QNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTN 1256
             +C  RK +LS+MHSLLHEEIDSF K+VAAEN  RKPYINWAVKRVTRSLQVLWPRSRTN
Sbjct: 942  LSCSMRKTALSMMHSLLHEEIDSFWKKVAAENMIRKPYINWAVKRVTRSLQVLWPRSRTN 1001

Query: 1257 IFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEW 1316
            IFGSNATGLSLP+SDVDLV+CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEW
Sbjct: 1002 IFGSNATGLSLPTSDVDLVICLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEW 1061

Query: 1317 VKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVAL 1376
            VK+DSLKTVENTAIPIIMLVVEVP DL  SAA ++Q+ KE+         +H+ ++M  L
Sbjct: 1062 VKNDSLKTVENTAIPIIMLVVEVPPDLTTSAAPNLQTSKEEPTPMPGGQGSHIQTEMGGL 1121

Query: 1377 DDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLK 1436
            ++SASPKC+  + DN K + SVR+DISFKSPSHTGLQTT+LVKELTEQFPA+TPLALVLK
Sbjct: 1122 ENSASPKCAQINYDNSKDSKSVRIDISFKSPSHTGLQTTELVKELTEQFPAATPLALVLK 1181

Query: 1437 QFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
            QFLADRSLDQSYSGGLSSYCL+LLITRFLQHEHHLGRPINQ
Sbjct: 1182 QFLADRSLDQSYSGGLSSYCLVLLITRFLQHEHHLGRPINQ 1222


>gi|449432996|ref|XP_004134284.1| PREDICTED: uncharacterized protein LOC101221970 [Cucumis sativus]
          Length = 1526

 Score = 1529 bits (3958), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 833/1484 (56%), Positives = 1027/1484 (69%), Gaps = 97/1484 (6%)

Query: 24   MAQNQLIDSLTSHISLYHSHSLSSNPNPSSN--PRSSILKWFASLTVHQRQAHLTIVDSK 81
            MAQNQLIDSLTSHISLYHS SL  NP+ +SN  PRSSILKWF+SL+VHQRQAHLT+VD K
Sbjct: 1    MAQNQLIDSLTSHISLYHSTSLPLNPDTNSNLNPRSSILKWFSSLSVHQRQAHLTVVDFK 60

Query: 82   FAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAESNESGRWVF 141
            F Q+LIQM+ ++R  GHGFFIILPD+ S DP +LP LC+KKSRGLLSRV++SNES R +F
Sbjct: 61   FVQILIQMVAEVRKRGHGFFIILPDILSTDPLHLPSLCFKKSRGLLSRVSQSNESQRMIF 120

Query: 142  ESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGEESEL 201
            ESTRLF SREG+K+EE SC +  +D+ TVS EFV NVD+F++ MD +SNG FLRGE  +L
Sbjct: 121  ESTRLFGSREGDKLEECSCSLKNIDSITVSEEFVSNVDKFVEAMDGVSNGAFLRGEGGDL 180

Query: 202  AGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLNAAGMAANV 261
            A +W E +WLKAKGYYS+EAF+ N+LEV LRL+W+N NNGKKR VK KEK  A GMA NV
Sbjct: 181  ASNWAELNWLKAKGYYSMEAFVANKLEVALRLSWMNLNNGKKRSVKFKEKATATGMATNV 240

Query: 262  YWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKS-LTHEVLKEASNALEDGMWLFNAGMKQ 320
            +WRKKGCVDWW  LD + R+ +LT ILGK+AK+ LTHE+L+  S   E  M LF+A   +
Sbjct: 241  FWRKKGCVDWWDKLDYSSRKNILTAILGKSAKNLLTHEILRWTSGLAEHEMGLFSAEWNR 300

Query: 321  SSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVFSGLFVLQDITTMVLSSQ 380
              R+  + S  R++ T   D+     I P + SG P  L+ +F  L VLQDI TMV S  
Sbjct: 301  PFRYNCTTSPPRSMLTSQADLHIDFNIIPDTHSGKPYLLSNIFRNLLVLQDIVTMVSSCL 360

Query: 381  HNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNFKSSPNKSKEK 440
            H+EY    +F+S+L  +    DC+LRKLR  LM +SLDCTK EL GEGN KS P+KS+E+
Sbjct: 361  HDEYYKCNLFYSTLGSICAIPDCILRKLREFLMFISLDCTKFELLGEGNGKSFPSKSREQ 420

Query: 441  PSTIGRRKKCRACSTKRQNPLPKSALDELSLDKLPKDPE----------GALTDTEKVDL 490
                 RRKK +  S K QNP  ++ +D+LS +   K  E            +TD+  + +
Sbjct: 421  VGASSRRKKGK--SRKSQNPALRACVDDLSSNNFTKRQEFDKECGHRGREVMTDSTTMSI 478

Query: 491  MGSDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRTNAR--KTKTVKNKNKN 548
            M        S G +  RE      + V     H + +  GK +   R  K    KN   N
Sbjct: 479  M--------SKGNETCREIPADVSKTV-----HDQKMSVGKDQGTVRKKKKHKSKNSGGN 525

Query: 549  CTYNNPVPVKDPKVSVLETSSSISLQDEVEKYDKLS---------AQNVSVDNSTCSNVL 599
                   P   P V    +S S S QD+V + DK S          +N S +N   S ++
Sbjct: 526  SRLVEIRPSVGPAVKF--SSPSFSSQDQVAELDKDSIFIKPSISNIKNDSTNNFDSSTLI 583

Query: 600  ASNQSSCTSASVPAREGIATQSTQEDCV------VNSVNSECRRFSNGRIDNQTQHFLQE 653
                S    ++ P RE  +    +   V      V+ +     +FS G I+NQ      E
Sbjct: 584  P---SPLVLSNEPNREYESILKIEVHEVSGITKSVSQIGPGESQFSKGIIENQFLSSTLE 640

Query: 654  TTDSKVECNIISPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKK 713
             + S ++C+ +   +P+ +L N      S +N ++S    E G  S L DK    +++K+
Sbjct: 641  NSSSFMDCSAVPSHLPSLELKNIVK---SDVNVKSSVRTCEVGNKSSLLDKLPRTIDVKE 697

Query: 714  ESAVTQDQRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWH 773
            +S  ++ Q +         + LE   YEW  +A +Y PS +SHL PATDRLHLDVGHNWH
Sbjct: 698  KSCSSRHQFSGDTCNARTLNPLEHSPYEWHGVASLYIPSFNSHLPPATDRLHLDVGHNWH 757

Query: 774  NHVRQPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFI 833
            NH R+ F P +HQ+RN    G CN IL++PL MSLDWPP++++ SG+A ++T N+DSG +
Sbjct: 758  NHFRRSFTPAMHQSRNSSAKGSCNPILTRPLLMSLDWPPVLRSASGLASTMTSNHDSGKL 817

Query: 834  SSRQSGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELE 893
            +                                 D P+ +  Q+  DECD +W+SEEE+E
Sbjct: 818  T---------------------------------DFPDLSNNQDLADECDGNWISEEEME 844

Query: 894  VHTVSGIDYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAF 953
            +H VSGIDYNQYFGGGVMYWN SDH G GFSRPPSLSSDDSSWAW EAD+ R VDDMVAF
Sbjct: 845  MHAVSGIDYNQYFGGGVMYWNPSDHHGAGFSRPPSLSSDDSSWAWREADMNRTVDDMVAF 904

Query: 954  SSSYSTNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEI 1013
            SSSYS NGLTSPTA SFCS FDPLG G QA  YVV G ++P  +LHSS+T  D  TEE+ 
Sbjct: 905  SSSYS-NGLTSPTATSFCS-FDPLGSGKQALGYVVQGTDLPNNMLHSSTTMKDTVTEEDD 962

Query: 1014 SGSFASLSGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQP 1073
              S  +L  DV+ KA D+   PILRPI+IP++SRERSRS+F   ++HKSPC+PP+RREQ 
Sbjct: 963  PRSLPNLPSDVEGKA-DSHSFPILRPIVIPSMSRERSRSEFCHGYDHKSPCIPPTRREQS 1021

Query: 1074 RIKRPPSPVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSE 1133
            R+KRPPSPVVLCVPRAP PPPPSPVSDSRK RGFPTVRSGSSSPRHWGV+GWY +GT  E
Sbjct: 1022 RVKRPPSPVVLCVPRAPIPPPPSPVSDSRKHRGFPTVRSGSSSPRHWGVKGWYPDGTNLE 1081

Query: 1134 EGCVRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQP 1193
            E C+R+DG+EVVWP+WRNK+ S    +QPLS       LIA+ Q+A DQEHPDVAFPL P
Sbjct: 1082 EACLRIDGAEVVWPNWRNKSNSNCSRVQPLS-------LIAMPQIALDQEHPDVAFPLFP 1134

Query: 1194 LEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRS 1253
              + +C  +K SLSLMHS LH+EIDSFCK VAAEN A+KPYI WAVKRVTRSLQVLWPRS
Sbjct: 1135 PTI-SCSVKKESLSLMHSRLHDEIDSFCKHVAAENMAKKPYITWAVKRVTRSLQVLWPRS 1193

Query: 1254 RTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLAN 1313
            RTNIFGSNATGLSLP+SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYL+N
Sbjct: 1194 RTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLSN 1253

Query: 1314 QEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDM 1373
            QEWVKSDSLKTVENTAIPIIMLVVEVPH+L+ S+ S++QSPKE+++  + + D +  +DM
Sbjct: 1254 QEWVKSDSLKTVENTAIPIIMLVVEVPHELVTSSTSNMQSPKEESSAVSGEQDANNLNDM 1313

Query: 1374 VALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLAL 1433
             +L+DS  PKC   + D+  +  SVR+DISFK+PSHTGLQT++LVKELTEQFPA+ PLAL
Sbjct: 1314 ASLEDSILPKCLEVNYDSSISTKSVRIDISFKTPSHTGLQTSELVKELTEQFPATIPLAL 1373

Query: 1434 VLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
            VLK+FLADRSLDQSYSGGLSSYCL+LLI RFLQHEHHLGRPINQ
Sbjct: 1374 VLKKFLADRSLDQSYSGGLSSYCLVLLIIRFLQHEHHLGRPINQ 1417


>gi|449478246|ref|XP_004155262.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101221970 [Cucumis
            sativus]
          Length = 1514

 Score = 1527 bits (3953), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 834/1485 (56%), Positives = 1029/1485 (69%), Gaps = 98/1485 (6%)

Query: 24   MAQNQLIDSLTSHISLYHSHSLSSNPNPSSN--PRSSILKWFASLTVHQRQAHLTIVDSK 81
            MAQNQLIDSLTSHISLYHS SL  NP+ +SN  PRSSILKWF+SL+VHQRQAHLT+VD K
Sbjct: 1    MAQNQLIDSLTSHISLYHSTSLPLNPDTNSNLNPRSSILKWFSSLSVHQRQAHLTVVDFK 60

Query: 82   FAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAESNESGRWVF 141
            F Q+LIQM+ ++R  GHGFFIILPD+ S DP +LP LC+KKSRGLLSRV++SNES R +F
Sbjct: 61   FVQILIQMVAEVRKRGHGFFIILPDILSTDPLHLPSLCFKKSRGLLSRVSQSNESQRMIF 120

Query: 142  ESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGEESEL 201
            ESTRLF SREG+K+EE SC +  +D+ TVS EFV NVD+F++ MD +SNG FLRGE  +L
Sbjct: 121  ESTRLFGSREGDKLEECSCSLKNIDSITVSEEFVSNVDKFVEAMDGVSNGAFLRGEGGDL 180

Query: 202  AGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLNAAGMAANV 261
            A +W E +WLKAKGYYS+EAF+ N+LEV LRL+W+N NNGKKR VK KEK  A GMA NV
Sbjct: 181  ASNWAELNWLKAKGYYSMEAFVANKLEVALRLSWMNLNNGKKRSVKFKEKATATGMATNV 240

Query: 262  YWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKS--LTHEVLKEASNALEDGMWLFNAGMK 319
            +WRKKGCVDWW  LD + R+ +LT ILGK+AK+  LTHE+L+  S   E  M LF+A   
Sbjct: 241  FWRKKGCVDWWDKLDYSSRKNILTAILGKSAKNLILTHEILRWTSGLAEHEMGLFSAEWN 300

Query: 320  QSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVFSGLFVLQDITTMVLSS 379
            +  R+  + S  R++ T   D+     I P + SG P  L+ +F  L VLQDI TMV S 
Sbjct: 301  RPFRYNCTTSPPRSMLTSQADLHIDFNIIPDTHSGKPYLLSNIFRNLLVLQDIVTMVSSC 360

Query: 380  QHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNFKSSPNKSKE 439
             H+EY    +F+S+L  +    DC+LRKLR  LM +SLDCTK EL GEGN KS P+KS+E
Sbjct: 361  LHDEYYKCNLFYSTLGSICAIPDCILRKLREFLMFISLDCTKFELLGEGNGKSFPSKSRE 420

Query: 440  KPSTIGRRKKCRACSTKRQNPLPKSALDELSLDKLPKDPE----------GALTDTEKVD 489
            +     RRKK +  S K QNP  ++ +D+LS +   K  E            +TD+  + 
Sbjct: 421  QVGASSRRKKGK--SRKSQNPALRACVDDLSSNNFTKRQEFDKECGHRGREVMTDSTTMS 478

Query: 490  LMGSDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRTNARKTKTVKNKNK-- 547
            +M        S G +  RE      + V     H + +  GK +   RK K  K+KN   
Sbjct: 479  IM--------SKGNETCREIPADVSKTV-----HDQKMSVGKDQGTVRKKKKHKSKNSGG 525

Query: 548  NCTYNNPVPVKDPKVSVLETSSSISLQDEVEKYDKLS---------AQNVSVDNSTCSNV 598
            N       P   P V    +S S S QD+V + DK S          +N S +N   S +
Sbjct: 526  NSRLVEIRPSVGPAVKF--SSPSFSSQDQVAELDKDSIFIKPSISNIKNDSTNNFDSSTL 583

Query: 599  LASNQSSCTSASVPAREGIATQSTQEDCV------VNSVNSECRRFSNGRIDNQTQHFLQ 652
            +    S    ++ P RE  +    +   V      V+ +     +FS G I+NQ      
Sbjct: 584  IP---SPLVLSNEPNREYESILKIEVHEVSGITKSVSQIGPGESQFSKGIIENQFLSSTL 640

Query: 653  ETTDSKVECNIISPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIK 712
            E + S ++C+ +   +P+ +L N      S +N ++S    E G  S L DK    +++K
Sbjct: 641  ENSSSFMDCSAVPSHLPSLELKNIVK---SDVNVKSSVRTCEVGNKSSLLDKLPRTIDVK 697

Query: 713  KESAVTQDQRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNW 772
            ++S  ++ Q +         + LE   YEW  +A +Y PS +SHL PATDRLHLDVGHNW
Sbjct: 698  EKSCSSRHQFSGDTCNARTLNPLEHSPYEWHGVASLYIPSFNSHLPPATDRLHLDVGHNW 757

Query: 773  HNHVRQPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGF 832
            HNH R+ F P +HQ+RN    G CN IL++PL MSLDWPP++++ SG+A ++T N+DSG 
Sbjct: 758  HNHFRRSFTPAMHQSRNSSAKGSCNPILTRPLLMSLDWPPVLRSASGLASTMTSNHDSGK 817

Query: 833  ISSRQSGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEEL 892
            ++                                 D P+ +  Q+  DECD +W+SEEE+
Sbjct: 818  LT---------------------------------DFPDLSNNQDLADECDGNWISEEEM 844

Query: 893  EVHTVSGIDYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVA 952
            E+H VSGIDYNQYFGGGVMYWN SDH G GFSRPPSLSSDDSSWAW EAD+ R VDDMVA
Sbjct: 845  EMHAVSGIDYNQYFGGGVMYWNPSDHHGAGFSRPPSLSSDDSSWAWREADMNRTVDDMVA 904

Query: 953  FSSSYSTNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEE 1012
            FSSSYS NGLTSPTA SFCS FDPLG G QA  YVV G ++P  +LHSS+T  D  TEE+
Sbjct: 905  FSSSYS-NGLTSPTATSFCS-FDPLGSGKQALGYVVQGTDLPNNMLHSSTTMKDTVTEED 962

Query: 1013 ISGSFASLSGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQ 1072
               S  +L  DV+ KA D+   PILRPI+IP++SRERSRS+F   ++HKSPC+PP+RREQ
Sbjct: 963  DPRSLPNLPSDVEGKA-DSHSFPILRPIVIPSMSRERSRSEFCHGYDHKSPCIPPTRREQ 1021

Query: 1073 PRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTS 1132
             R+KRPPSPVVLCVPRAP PPPPSPVSDSRK RGFPTVRSGSSSPRHWGV+GWY +GT  
Sbjct: 1022 SRVKRPPSPVVLCVPRAPIPPPPSPVSDSRKHRGFPTVRSGSSSPRHWGVKGWYPDGTNL 1081

Query: 1133 EEGCVRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQ 1192
            EE C+R+DG+EVVWP+WRNK+ S    +QPLS       LIA+ Q+A DQEHPDVAFPL 
Sbjct: 1082 EEACLRIDGAEVVWPNWRNKSNSNCSRVQPLS-------LIAMPQIALDQEHPDVAFPLF 1134

Query: 1193 PLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPR 1252
            P  + +C  +K SLSLMHS LH+EIDSFCK VAAEN A+KPYI WAVKR TRSLQVLWPR
Sbjct: 1135 PPTI-SCSVKKESLSLMHSRLHDEIDSFCKHVAAENMAKKPYITWAVKRXTRSLQVLWPR 1193

Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLA 1312
            SRTNIFGSNATGLSLP+SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYL+
Sbjct: 1194 SRTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLS 1253

Query: 1313 NQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSD 1372
            NQEWVKSDSLKTVENTAIPIIMLVVEVPH+L+ S+ S++QSPKE+++  + + D +  +D
Sbjct: 1254 NQEWVKSDSLKTVENTAIPIIMLVVEVPHELVTSSTSNMQSPKEESSAVSGEQDANNLND 1313

Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
            M +L+DS  PKC   + D+  +  SVR+DISFK+PSHTGLQT++LVKELTEQFPA+ PLA
Sbjct: 1314 MASLEDSILPKCLEVNYDSSISTKSVRIDISFKTPSHTGLQTSELVKELTEQFPATIPLA 1373

Query: 1433 LVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
            LVLK+FLADRSLDQSYSGGLSSYCL+LLI RFLQHEHHLGRPINQ
Sbjct: 1374 LVLKKFLADRSLDQSYSGGLSSYCLVLLIIRFLQHEHHLGRPINQ 1418


>gi|334186240|ref|NP_191917.3| Nucleotidyltransferase family protein [Arabidopsis thaliana]
 gi|332656419|gb|AEE81819.1| Nucleotidyltransferase family protein [Arabidopsis thaliana]
          Length = 1481

 Score = 1451 bits (3755), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 826/1470 (56%), Positives = 1029/1470 (70%), Gaps = 114/1470 (7%)

Query: 16   AEYFTN-SLMAQNQLIDSLTSHISLYHSHSLSSN-PNPSSNPRSSILKWFASLTVHQRQA 73
            +++F++ S MAQNQLIDSLTSHISLYHSHS SS+  N   NPRS+IL+WF+SL+VHQR +
Sbjct: 8    SKFFSSASSMAQNQLIDSLTSHISLYHSHSSSSSMANTIPNPRSAILRWFSSLSVHQRLS 67

Query: 74   HLTIVDSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAES 133
            HLT+VD KF Q+L+QMLG +R  G   FIILPDLPS     LP LC+KKSRGL+SRV+ES
Sbjct: 68   HLTVVDPKFVQILLQMLGYIRTKGPCSFIILPDLPSSSD--LPSLCFKKSRGLISRVSES 125

Query: 134  NESGRWVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGF 193
            NES R+VF+STRLF S EGE+ ++ SC VN LD+  ++ EF+ NVDRF++ MD +S+G F
Sbjct: 126  NESERFVFDSTRLFGSGEGERAQDCSCSVNSLDSVVMAEEFLTNVDRFVETMDVLSDGAF 185

Query: 194  LRGEESELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLN 253
            LRGEES+L  +WVE +WLKAKGYYS+EAF+ NRLEV +RLAWLN N+GK+RG+KLKEKLN
Sbjct: 186  LRGEESDLGSNWVELEWLKAKGYYSMEAFVANRLEVSMRLAWLNTNSGKRRGIKLKEKLN 245

Query: 254  AAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNALEDGMWL 313
            AA  AAN YWRKK CVDWW NLD A  +K+ T + GK+AKS+ +E+L+EA+ A +  MWL
Sbjct: 246  AAAAAANSYWRKKACVDWWQNLDAATHKKIWTCLFGKSAKSVIYEILREANQAQQGEMWL 305

Query: 314  FNAGMKQSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVFSGLFVLQDIT 373
            FN        F  ++  +   S +S    C + + P S+   P ++A+  SGL+VLQ+  
Sbjct: 306  FN--------FASARKGRTDTSAVSF---CDMILEPNSVPRKPITVASNLSGLYVLQEFA 354

Query: 374  TMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNFKSS 433
            ++++  Q+    +  +FFSS+  ++T  DC+LRKLRG LMV+S+D  K EL  +   K S
Sbjct: 355  SLLILCQNGLVPVHSVFFSSMGTITTLVDCILRKLRGFLMVISIDSVKSELLDDNTHKCS 414

Query: 434  PNKS-KEKPSTIGRRKKCRACSTKRQNPLPKSALDELSLDKLPKDPEGALTDTEKVDLMG 492
            P+ S  +K  +  R++K +  + K+  P  KS  +     K  K  +  L   +  + + 
Sbjct: 415  PSSSSNQKLGSTNRKQKGKTRNMKKPTPEAKSDKNVNLSTKNGKKDQAKLEFNKSREAIE 474

Query: 493  SDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRTNARKTKTVKNKNKNCTYN 552
              KVP  S   + + E S + ME+V         LVA K     +K +  KNK+K CT  
Sbjct: 475  CKKVPTASTMIN-DPEASAATMEVV-------PGLVARK--GRTKKKRKEKNKSKKCT-- 522

Query: 553  NPVPVKDPKVSVLETSSSISLQDEVEKYDKLSAQNVSVDNSTCSNVLASNQSSCTSASVP 612
                               SL++  E        N SV NS+     +   SSCTSA+  
Sbjct: 523  -------------------SLENNGEV-------NKSVVNSSAIVKASKCDSSCTSANQH 556

Query: 613  AREGIATQSTQEDCVVNSVNSECRRFSNGRIDN-----QTQHFLQETTDSKVECNIISPD 667
             +E I  Q  +E    +     C R  +G   +       ++  +E + SK E ++IS D
Sbjct: 557  PQEYINAQIIEEHGSFS-----CERNRSGTCASVNGAANCEYSGEEESHSKAETHVISSD 611

Query: 668  MPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKKESAVTQDQRNESFF 727
            + +  +D A G S   +N Q S    +      +P++    L+ + ES     QR E+ +
Sbjct: 612  LSS--VDPAGGPSCENVNPQKSCCRGDRKEKLTMPNERSRTLD-EGESHRIHHQRREAGY 668

Query: 728  GTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPFVPTLHQA 787
            G A  SS E  SYEWP +AP+YF  +SSHL  ATDRLHLDVGHN H +VRQPFV T+  A
Sbjct: 669  GFASSSS-EFVSYEWPAVAPMYFSHVSSHLPTATDRLHLDVGHNLHPYVRQPFVSTVQHA 727

Query: 788  RNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATK 847
            RN   +G   Q+LS+P+PMSLDWPPMV +  G+  + TCNYDSG +              
Sbjct: 728  RNPSIEGSHKQVLSRPMPMSLDWPPMVHSNCGLTTAFTCNYDSGIL-------------- 773

Query: 848  GMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFG 907
                               +D+PE     E G+EC+++W+ EE+ EVHTVSG+DYNQYFG
Sbjct: 774  -------------------VDIPEQKNKHELGNECENNWMLEEDFEVHTVSGVDYNQYFG 814

Query: 908  GGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTNGLTSPTA 967
            GGVMYWN SDH GTGFSRPPSLSSDDSSWAWHEA++KR+VDDMVAFSSSYS NGL SPTA
Sbjct: 815  GGVMYWNPSDHLGTGFSRPPSLSSDDSSWAWHEAEMKRSVDDMVAFSSSYSANGLDSPTA 874

Query: 968  ASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSK 1027
            ASFCSPF PLGP +Q   YVVPGNE+  K+L +  TT + A EEE+SG+ ASLSGDV+  
Sbjct: 875  ASFCSPFHPLGPPNQPLGYVVPGNEISTKILQAPPTTIEGAGEEEVSGTLASLSGDVEGN 934

Query: 1028 ALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVLCVP 1087
            + D+LP PILRPIIIPN+S+    S++KRS++ KSP VPP+RRE PRIKRPPSPVVLCVP
Sbjct: 935  SGDSLPYPILRPIIIPNMSK----SEYKRSYDTKSPNVPPTRREHPRIKRPPSPVVLCVP 990

Query: 1088 RAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWP 1147
            RAPRPPPPSPVS+SR  RGFPTVRSGSSSPRHWG+RGW+H+G   EE      G+E+V P
Sbjct: 991  RAPRPPPPSPVSNSRARRGFPTVRSGSSSPRHWGMRGWFHDGVNWEEP----RGAEIVLP 1046

Query: 1148 SWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLS 1207
             WRNK+L+  P+IQPL GALLQDHLIA+SQL RDQEHPDVAFPLQP E+ NCP +  SLS
Sbjct: 1047 -WRNKSLAVRPIIQPLPGALLQDHLIAMSQLGRDQEHPDVAFPLQPPELLNCPMQGESLS 1105

Query: 1208 LMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSL 1267
            L+H +L++EIDSFCKQVAAEN ARKPYINWA+KRVTRSLQVLWPRSRTNIFGS+ATGLSL
Sbjct: 1106 LIHGILNDEIDSFCKQVAAENMARKPYINWAIKRVTRSLQVLWPRSRTNIFGSSATGLSL 1165

Query: 1268 PSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            PSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVK+DSLKTVEN
Sbjct: 1166 PSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKTDSLKTVEN 1225

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
            TAIPIIMLVVEVP DLI     S+QSPK+     T+  D++ +++MV  +DSA+     T
Sbjct: 1226 TAIPIIMLVVEVPCDLIC----SIQSPKDGPDCITVDQDSNGNTEMVGFEDSAAANSLPT 1281

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
            ++ N+  A SVRLDISFK+PSHTGLQTT LVK+LTEQFPA+TPLALVLKQFLADR+LDQS
Sbjct: 1282 NTGNLAIAKSVRLDISFKTPSHTGLQTTQLVKDLTEQFPAATPLALVLKQFLADRTLDQS 1341

Query: 1448 YSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
            YSGGLSSYCL+LLITRFLQHEHHLGR INQ
Sbjct: 1342 YSGGLSSYCLVLLITRFLQHEHHLGRSINQ 1371


>gi|255541176|ref|XP_002511652.1| nucleotidyltransferase, putative [Ricinus communis]
 gi|223548832|gb|EEF50321.1| nucleotidyltransferase, putative [Ricinus communis]
          Length = 1420

 Score = 1285 bits (3325), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 689/1120 (61%), Positives = 813/1120 (72%), Gaps = 103/1120 (9%)

Query: 371  DITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNF 430
            DI  ++L  QH EYD+ KIFFS+LR VS+ +DC+LRKLRGL+MV+SLDCTKLEL GEGN 
Sbjct: 279  DIVALILPGQHREYDVSKIFFSTLRSVSSISDCILRKLRGLVMVISLDCTKLELLGEGNM 338

Query: 431  KSSPNKSKEKPSTIGRRKKCRACSTKRQNPLPKSALDELSLDKLPKDPEGALTDTEKVDL 490
            K   +K KEKP+T  RRKK +  + K+    P   ++E  ++K  K              
Sbjct: 339  KHLTSKQKEKPNTGSRRKKAKTHNMKKPESAPDIVVNEAYINKPLK-------------- 384

Query: 491  MGSDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRTNARKTKTVKNKNKN-- 548
                                           EH++ LV  KGRT ARK +  +NKNKN  
Sbjct: 385  -------------------------------EHSQGLVLAKGRTAARKNRRGRNKNKNRN 413

Query: 549  CTYNNPVPVKDPKVSVLETSSS-ISLQDEVEKY----DKLSAQNVSVDNSTCSNVLASNQ 603
             + N+PV +++ + SV E     +   DE        D L+ QNV  D+   S     N 
Sbjct: 414  SSLNDPVDIRNSERSVAEAPCVPVISSDEAAMLGRASDDLAIQNVFSDDLVESANFTLNT 473

Query: 604  SSCTSASVPAREGIATQSTQEDCV-----VNSVNSECRRFSNGRIDNQTQHFLQETTDSK 658
            S C   + P +EGI  +  Q   V       S+NSEC++ SN  I+++T     E  + K
Sbjct: 474  SFCGCVTEPRKEGIDAKRVQGRVVGCNGGTCSINSECKQTSNVMIEDRTISSRAEGVNFK 533

Query: 659  VECNIISPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKKESAVT 718
            +E  +IS  +   +LD    N    I F+N     ET   S    + +  + +K+ S + 
Sbjct: 534  MEDKVISHVVQTPELDTVSSNE--DIKFRNE----ETKGKSNFSYRTVRNINVKEGSTLI 587

Query: 719  QDQRNESFFGTALKSSL-ECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVR 777
            +++        A  ++L E  SYEWP++APVYFPSI+SHLLPA DRLHLDVG NWH+H+R
Sbjct: 588  KNK----ILNEARSTNLSEYISYEWPSLAPVYFPSITSHLLPAADRLHLDVGRNWHSHIR 643

Query: 778  QPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQ 837
            QPFVPT+HQAR  P + G N+ LS+PLPMSLDWPPMV+++SG+APS+TCNYDSGFIS  Q
Sbjct: 644  QPFVPTVHQARTSPIESGYNRTLSRPLPMSLDWPPMVRSISGLAPSMTCNYDSGFISRLQ 703

Query: 838  SGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTV 897
            + F                                   +E   E +SH +SEEE+E+H V
Sbjct: 704  TAFH---------------------------------PKEPMAEYESHCISEEEMEMHAV 730

Query: 898  SGIDYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSSY 957
            SGIDYNQYFGGGVMYWN SD+PGTGFSRPPSLSSDDS+WAWHE D+ RAVDDMVAFSSSY
Sbjct: 731  SGIDYNQYFGGGVMYWNPSDYPGTGFSRPPSLSSDDSTWAWHE-DMNRAVDDMVAFSSSY 789

Query: 958  STNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSF 1017
            STNGL SPTAASFCSPFDP+G GHQA  YVVPGNE+ GKVL SSST TD A  EE++GS 
Sbjct: 790  STNGLASPTAASFCSPFDPIGSGHQALGYVVPGNELTGKVLQSSSTVTDTAALEELTGSL 849

Query: 1018 ASLSGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKR 1077
            A++SGDV+ KA D+LP PIL PIIIPN+SRE+SRSDFKRSH+HKSPCVPPSRRE+PRIKR
Sbjct: 850  ANVSGDVEGKAGDSLPYPILPPIIIPNISREKSRSDFKRSHDHKSPCVPPSRRERPRIKR 909

Query: 1078 PPSPVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCV 1137
            PPSPVVLCVPRAP PPPPSPVS+SRK RGFPTVRSGSSSPRHW +RGWY E T SEE  +
Sbjct: 910  PPSPVVLCVPRAPHPPPPSPVSNSRKQRGFPTVRSGSSSPRHWSMRGWY-ERTNSEEAYM 968

Query: 1138 RMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQ 1197
             MDG+EVVWPSWRNKNLS HPMIQPL G LLQDHLIA+SQLARDQEHPDV+FPLQP E+ 
Sbjct: 969  HMDGTEVVWPSWRNKNLSTHPMIQPLPGGLLQDHLIAMSQLARDQEHPDVSFPLQPPELH 1028

Query: 1198 NCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNI 1257
            NCP RKASLSLMHSLLH+EID FCK+VAAEN  RKP+INWAVKRVTRSLQVLWPRSRTN+
Sbjct: 1029 NCPARKASLSLMHSLLHDEIDFFCKKVAAENMDRKPFINWAVKRVTRSLQVLWPRSRTNV 1088

Query: 1258 FGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWV 1317
            +GSNATGLSLP+SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWV
Sbjct: 1089 YGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWV 1148

Query: 1318 KSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALD 1377
            K+DSLKTVENTAIPIIMLVVEVP DLI SA S++QS K++    T +++N V+SD+V  +
Sbjct: 1149 KNDSLKTVENTAIPIIMLVVEVPSDLIISATSNIQSTKDEPTRMTAENENCVNSDIVISE 1208

Query: 1378 DSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQ 1437
            +S+SPKC   + D+ K   S+RLDISFKSPSHTGLQTT+LVKELTEQFPA+TPLALVLKQ
Sbjct: 1209 ESSSPKCLQVNHDSRKDVKSIRLDISFKSPSHTGLQTTELVKELTEQFPAATPLALVLKQ 1268

Query: 1438 FLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
            FLADRSLDQSYSGGLSSYCL+LLITRFLQHEHHLGRPINQ
Sbjct: 1269 FLADRSLDQSYSGGLSSYCLVLLITRFLQHEHHLGRPINQ 1308



 Score =  390 bits (1003), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 203/277 (73%), Positives = 226/277 (81%), Gaps = 5/277 (1%)

Query: 19  FTNSLMAQNQLIDSLTSHISLYHSHSLSSNPNPSSNPRSSILKWFASLTVHQRQAHLTIV 78
            TNSL+ QNQ +DSLT+HISLYHS SL  NP+P+ NPRSSILKWF+SLTVHQRQAHLT V
Sbjct: 7   LTNSLVTQNQFVDSLTAHISLYHSKSLPLNPSPNPNPRSSILKWFSSLTVHQRQAHLTTV 66

Query: 79  DSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAESNESGR 138
           D KF QL+IQMLGKLR  GHG FIILPDLP  D   LP LCYKKSRGLLSR AESNES R
Sbjct: 67  DFKFVQLVIQMLGKLRTQGHGRFIILPDLPVSD---LPSLCYKKSRGLLSRDAESNESER 123

Query: 139 WVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGEE 198
            +FES RLF SREGE  E  SC + CLD+ TVS + + NVD+F++ MD +SNG FLRGE+
Sbjct: 124 LIFESIRLFGSREGE--ESTSCSIKCLDSLTVSEDLIGNVDKFVETMDRVSNGEFLRGED 181

Query: 199 SELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLNAAGMA 258
           SEL  DWVE DWLKAKGYYSIEAF+ NRLEV LRLAWLNC NGKKRGVK+KEK+NAAG A
Sbjct: 182 SELGSDWVELDWLKAKGYYSIEAFVANRLEVALRLAWLNCGNGKKRGVKVKEKVNAAGAA 241

Query: 259 ANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSL 295
           ANV+ RKKGCVDWW+NLD   RRK  TV+LGKAA SL
Sbjct: 242 ANVFLRKKGCVDWWLNLDAETRRKFFTVVLGKAANSL 278


>gi|356506428|ref|XP_003521985.1| PREDICTED: uncharacterized protein LOC100809291 [Glycine max]
          Length = 1436

 Score = 1178 bits (3047), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 658/1110 (59%), Positives = 788/1110 (70%), Gaps = 77/1110 (6%)

Query: 381  HNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNFKSSPNKSKEK 440
            ++EYDIE +FFSSL  V T +DC+LRK+RG LMV+SLDCTKLEL GE + KSS  K KEK
Sbjct: 282  NSEYDIENLFFSSLGSVCTISDCILRKMRGFLMVISLDCTKLELLGEEHDKSSSGKPKEK 341

Query: 441  PSTIGRRKKCRACSTKRQNPLPKSALDELSLDKLPKDPEGALTDTEKVDLMGSDKVPGIS 500
            PS   R+KK R  + KRQNP+ K+ +D++S +   KD +  + + +K DL+ S ++P + 
Sbjct: 342  PSVSNRKKKGRNRNNKRQNPVSKTCVDDISHENPLKDIDCKVDNKKKTDLVASSELPAVH 401

Query: 501  NGKDINRETSTSEMEMVVCHQEHARALVAGK--GRTNARKTKTVKNKNKNCTYNNPVPVK 558
             GK+I+ E  +S         +H + L  GK   RT +R  K  KNK+KN   +      
Sbjct: 402  MGKEISMEFPSS--------TDHTQGLDIGKIKVRTTSRSRKE-KNKSKNILIS--AGGD 450

Query: 559  DPKVSVLETSSSISLQDEVEKYDKL----SAQNVSVDNSTCSNVLASNQSSCTSASVPAR 614
              K S+   S+++  + EV   D+     + QNV  DN+  +++LASN S C+S S  +R
Sbjct: 451  SQKSSIHAASTTVISEGEVAICDRFLNSSTIQNVKNDNAIGNDILASNSSLCSSLSGLSR 510

Query: 615  EGIATQSTQ---EDCVV--NSVNSECRRFSNGRIDNQTQHFLQETTDSKVECNI-ISPDM 668
            E  +T+  +   ED     NS+  +C   S+ R   +T     +T    ++C    +P +
Sbjct: 511  ENSSTRKVEGKTEDLAESGNSLGPQCCLLSDER---KTLCSGLDTLTCDIDCTAATTPPV 567

Query: 669  PARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKKESAVTQDQRNESFFG 728
            PA    + F  S       NS   ++    + +PDK I  +  K+   + +  R+   F 
Sbjct: 568  PAVKQGSFF--SKEDTCPLNSSCAAKADLKTTVPDKPIREVNAKEFGLLKE--RDRCLFE 623

Query: 729  TALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPFVPTLHQAR 788
            +   +  +C  YEWP +  +YFPS +SHL PATDRLHLDVGHNWHNH   PFVPTL QAR
Sbjct: 624  SRNSAFSKCSPYEWPGLPSIYFPSFNSHLPPATDRLHLDVGHNWHNHFCHPFVPTLQQAR 683

Query: 789  NHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKG 848
            N P +GGCN ILS+P+PMS DWPP+ +   G+ PS   NYDSGFIS              
Sbjct: 684  NPPIEGGCNPILSRPIPMSFDWPPVFRG--GMTPSPNYNYDSGFIS-------------- 727

Query: 849  MQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGG 908
                               DLP+ T T E  DE D+H +SEEE EVHTVSGIDYNQYFGG
Sbjct: 728  -------------------DLPDLTNTLELADEFDNHCVSEEEYEVHTVSGIDYNQYFGG 768

Query: 909  GVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTNGLTSPTAA 968
            GVMYWN SD+PG GFSRPPSLSSDDS WA  +AD+ R VDDMVAFSSSYSTNGLTSPTAA
Sbjct: 769  GVMYWNPSDYPGKGFSRPPSLSSDDSLWALRDADMNRTVDDMVAFSSSYSTNGLTSPTAA 828

Query: 969  SFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFAS-LSGDVDSK 1027
            +FCSPFDP+G   Q   YV+ GNEVPGK+LHSSS T D A +E+ SGS  + L G+V+ K
Sbjct: 829  TFCSPFDPVGTATQTIGYVMSGNEVPGKMLHSSSVT-DAAVDEDTSGSLGNNLPGEVEGK 887

Query: 1028 ALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVLCVP 1087
            A D+ P PILRPIIIPNLSRER         +HKSPCVPPSRREQPRIKRPPSPVVLCVP
Sbjct: 888  AGDSHPYPILRPIIIPNLSRER--------FDHKSPCVPPSRREQPRIKRPPSPVVLCVP 939

Query: 1088 RAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWP 1147
            RAPRPPPPSPVSDSRK RGFPTVRSGSSSPRHWG+RGWYH+G+  EE C+RMDG+EVVWP
Sbjct: 940  RAPRPPPPSPVSDSRKHRGFPTVRSGSSSPRHWGMRGWYHDGSNFEEACLRMDGAEVVWP 999

Query: 1148 SWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLS 1207
             WR+ NL+  P+IQPL  ALLQD LIA+SQ+ARDQEHPDV FPLQP ++Q+C  + ASL+
Sbjct: 1000 -WRSNNLAVRPLIQPLPAALLQDRLIALSQIARDQEHPDVTFPLQPPDLQSCSAQSASLT 1058

Query: 1208 LMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSL 1267
            LMH +LH+EIDSFCKQVAAEN AR+PYINWAVKRVTR LQVLWPRSRTNIFGSNATG+SL
Sbjct: 1059 LMHGILHDEIDSFCKQVAAENMARRPYINWAVKRVTRFLQVLWPRSRTNIFGSNATGMSL 1118

Query: 1268 PSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            P+SDVDLVV LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQ+WVK+DSLKTVEN
Sbjct: 1119 PTSDVDLVVGLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQDWVKNDSLKTVEN 1178

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
            TAIPIIMLVVEVP D+I S A  +QS  E+   T  +H N   SD + L+DSA PK S  
Sbjct: 1179 TAIPIIMLVVEVPQDVITSLAPMIQSLNEEPHCTPGEHGNDNQSDSIRLEDSALPKGSQM 1238

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              D +K + SVRLDISFKSPSHTGLQTT++VKELT QFPA+TPLALVLKQFLADRSLDQS
Sbjct: 1239 KFDALK-SKSVRLDISFKSPSHTGLQTTEMVKELTAQFPAATPLALVLKQFLADRSLDQS 1297

Query: 1448 YSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
            YSGGLSSYCL+LLI RFLQHEHHLGRPINQ
Sbjct: 1298 YSGGLSSYCLVLLIIRFLQHEHHLGRPINQ 1327



 Score =  368 bits (945), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 182/274 (66%), Positives = 218/274 (79%), Gaps = 3/274 (1%)

Query: 24  MAQNQLIDSLTSHISLYHSHSLSSNPNPSSNPRSSILKWFASLTVHQRQAHLTIVDSKFA 83
           MA +QLID+LTSHISLYHS S + NPNP+ NPRSSILKWF+SL++H RQAHLTIVD+ F 
Sbjct: 1   MAHHQLIDTLTSHISLYHSQSPNPNPNPNPNPRSSILKWFSSLSIHHRQAHLTIVDANFV 60

Query: 84  QLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAESNESGRWVFES 143
           Q+L+QML KLR++GHG FI+LPDLPSRD   LP LC+KKSRGLL+RVA+S+ +GR VFES
Sbjct: 61  QILLQMLAKLRSHGHGSFILLPDLPSRD--NLPTLCFKKSRGLLARVADSDAAGRAVFES 118

Query: 144 TRLFSSREGEKIEEWSCP-VNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGEESELA 202
           +RLF SREGE+    + P    LD   ++  FV +VDRF++ MD IS GGFLRGEE+EL 
Sbjct: 119 SRLFDSREGEEAAIATLPSARRLDALALAEGFVGDVDRFVEAMDRISGGGFLRGEEAELG 178

Query: 203 GDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLNAAGMAANVY 262
            DWVE  WLK+KGYY IEAFI NR+EV +RLAWLNC  G+KRGVKLKEK+ AAG+  NV+
Sbjct: 179 EDWVELHWLKSKGYYGIEAFIANRIEVSMRLAWLNCCGGRKRGVKLKEKMGAAGVGVNVF 238

Query: 263 WRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLT 296
           WRKKGCVDWW NLD   RRKV++  L KAAK L 
Sbjct: 239 WRKKGCVDWWGNLDAGTRRKVISTFLMKAAKPLV 272


>gi|297810181|ref|XP_002872974.1| hypothetical protein ARALYDRAFT_912246 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297318811|gb|EFH49233.1| hypothetical protein ARALYDRAFT_912246 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 840

 Score = 1031 bits (2665), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 528/779 (67%), Positives = 615/779 (78%), Gaps = 51/779 (6%)

Query: 701  LPDKGIEALEIKKESAVTQDQRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPA 760
            +P+     LE + ES     QR E+ +G A  SS E  SYEWP +AP+YF  ++SHL  A
Sbjct: 1    MPNGRSRTLE-EGESHRIHHQRREAGYGIASSSS-EFVSYEWPAVAPMYFSHVNSHLPTA 58

Query: 761  TDRLHLDVGHNWHNHVRQPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGI 820
            TDRLHLDVGHN H +VRQPFV T+  ARN   +G   Q+LS+P+PMSLDWPPMV +  G+
Sbjct: 59   TDRLHLDVGHNLHAYVRQPFVSTVQHARNPSIEGSHKQVLSRPMPMSLDWPPMVHSNCGL 118

Query: 821  APSVTCNYDSGFISSRQSGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGD 880
              +  CNYDSG +                                 +D+PE     E G+
Sbjct: 119  TTAFACNYDSGIL---------------------------------VDIPEQKNKPELGN 145

Query: 881  ECDSHWLSEEELEVHTVSGIDYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHE 940
            EC+++W+ EE+ EVHTVSG+DYNQYFGGGVMYWN SDH GTGFSRPPSLSSDDSSWAWHE
Sbjct: 146  ECENNWMLEEDFEVHTVSGVDYNQYFGGGVMYWNPSDHLGTGFSRPPSLSSDDSSWAWHE 205

Query: 941  ADIKRAVDDMVAFSSSYSTNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHS 1000
            A++KR+VDDMVAFSSSYS NGL SPTAASFCSPF PLGP +Q   YVVPGNE+  K+L +
Sbjct: 206  AEMKRSVDDMVAFSSSYSANGLDSPTAASFCSPFHPLGPANQPLGYVVPGNEISTKILQA 265

Query: 1001 SSTTTDVATEEEISGSFASLSGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEH 1060
              TT++VA EEE+SG+ ASLSGDV+  + D+LP PILRPIIIPN+    S+SD+KRS++ 
Sbjct: 266  PPTTSEVAGEEEVSGTLASLSGDVEGNSGDSLPYPILRPIIIPNM----SKSDYKRSYDT 321

Query: 1061 KSPCVPPSRREQPRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHW 1120
            KSP VPP+RRE PRIKRPPSPVVLCVPRAPR PPPSPVS+SR  RGFPTVRSGSSSPRHW
Sbjct: 322  KSPNVPPTRREHPRIKRPPSPVVLCVPRAPRLPPPSPVSNSRARRGFPTVRSGSSSPRHW 381

Query: 1121 GVRGWYHEGTTSEEGCVRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLAR 1180
            G+RGW+H+G   EE C    G+EVV P WRNK+L+  P+IQPL GALLQDHLIA+SQL R
Sbjct: 382  GMRGWFHDGVNWEEPC----GAEVVLP-WRNKSLAVRPIIQPLPGALLQDHLIAMSQLGR 436

Query: 1181 DQEHPDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVK 1240
            DQEHPDVAFPLQP E+ NCP ++ SLSL+H LL++EIDSFCKQVAAEN ARKPYI+WA+K
Sbjct: 437  DQEHPDVAFPLQPPELLNCPMQE-SLSLIHGLLNDEIDSFCKQVAAENMARKPYISWAIK 495

Query: 1241 RVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIK 1300
            RVTRSLQVLWPRSRTNIFGS+ATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIK
Sbjct: 496  RVTRSLQVLWPRSRTNIFGSSATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIK 555

Query: 1301 ETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKE--DA 1358
            ETCLQHAARYLANQEWVK+DSLKTVENTAIPIIMLVVEVP DL+     S+QSPK+  D 
Sbjct: 556  ETCLQHAARYLANQEWVKTDSLKTVENTAIPIIMLVVEVPCDLVC----SIQSPKDGPDC 611

Query: 1359 AHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLV 1418
               T+  D++ +++MV  +DSA+     T++ N+  A SVRLDISFK+PSHTGLQTT LV
Sbjct: 612  ISITVDQDSNGNTEMVGFEDSAAANSLPTNTGNLANAKSVRLDISFKTPSHTGLQTTQLV 671

Query: 1419 KELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
            K+LT+QFPA+TPLALVLKQFLADR+LDQSYSGGLSSYCL+LLITRFLQHEHHLG  INQ
Sbjct: 672  KDLTDQFPAATPLALVLKQFLADRTLDQSYSGGLSSYCLVLLITRFLQHEHHLGLSINQ 730


>gi|19699089|gb|AAL90911.1| AT4g00060/F6N15_10 [Arabidopsis thaliana]
 gi|25090419|gb|AAN72296.1| At4g00060/F6N15_10 [Arabidopsis thaliana]
          Length = 839

 Score =  999 bits (2584), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 526/766 (68%), Positives = 608/766 (79%), Gaps = 47/766 (6%)

Query: 712  KKESAVTQDQRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHN 771
            + ES     QR E+ +G A  SS E  SYEWP +AP+YF  +SSHL  ATDRLHLDVGHN
Sbjct: 11   EGESHRIHHQRREAGYGFASSSS-EFVSYEWPAVAPMYFSHVSSHLPTATDRLHLDVGHN 69

Query: 772  WHNHVRQPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSG 831
             H +VRQPFV T+  ARN   +G   Q+LS+P+PMSLDWPPMV +  G+  + TCNYDSG
Sbjct: 70   LHPYVRQPFVSTVQHARNPSIEGSHKQVLSRPMPMSLDWPPMVHSNCGLTTAFTCNYDSG 129

Query: 832  FISSRQSGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEE 891
             +                                 +D+PE     E G+EC+++W+ EE+
Sbjct: 130  IL---------------------------------VDIPEQKNKHELGNECENNWMLEED 156

Query: 892  LEVHTVSGIDYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMV 951
             EVHTVSG+DYNQYFGGGVMYWN SDH GTGFSRPPSLSSDDSSWAWHEA++KR+VDDMV
Sbjct: 157  FEVHTVSGVDYNQYFGGGVMYWNPSDHLGTGFSRPPSLSSDDSSWAWHEAEMKRSVDDMV 216

Query: 952  AFSSSYSTNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEE 1011
            AFSSSYS NGL SPTAASFCSPF PLGP +Q   YVVPGNE+  K+L +  TT + A EE
Sbjct: 217  AFSSSYSANGLDSPTAASFCSPFHPLGPPNQPLGYVVPGNEISTKILQAPPTTIEGAGEE 276

Query: 1012 EISGSFASLSGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRRE 1071
            E+SG+ ASLSGDV+  + D+LP PILRPIIIPN+S+    S++KRS++ KSP VPP+RRE
Sbjct: 277  EVSGTLASLSGDVEGNSGDSLPYPILRPIIIPNMSK----SEYKRSYDTKSPNVPPTRRE 332

Query: 1072 QPRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTT 1131
             PRIKRPPSPVVLCVPRAPRPPPPSPVS+SR  RGFPTVRSGSSSPRHWG+RGW+H+G  
Sbjct: 333  HPRIKRPPSPVVLCVPRAPRPPPPSPVSNSRARRGFPTVRSGSSSPRHWGMRGWFHDGVN 392

Query: 1132 SEEGCVRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPL 1191
             EE      G+E+V P WRNK+L+  P+IQPL GALLQDHLIA+SQL RDQEHPDVAFPL
Sbjct: 393  WEEP----RGAEIVLP-WRNKSLAVRPIIQPLPGALLQDHLIAMSQLGRDQEHPDVAFPL 447

Query: 1192 QPLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWP 1251
            QP E+ NCP +  SLSL+H +L++EIDSFCKQVAAEN ARKPYINWA+KRVTRSLQVLWP
Sbjct: 448  QPPELLNCPMQGESLSLIHGILNDEIDSFCKQVAAENMARKPYINWAIKRVTRSLQVLWP 507

Query: 1252 RSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYL 1311
            RSRTNIFGS+ATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYL
Sbjct: 508  RSRTNIFGSSATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYL 567

Query: 1312 ANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHS 1371
            ANQEWVK+DSLKTVENTAIPIIMLVVEVP DLI     S+QSPK+     T+  D++ ++
Sbjct: 568  ANQEWVKTDSLKTVENTAIPIIMLVVEVPCDLIC----SIQSPKDGPDCITVDQDSNGNT 623

Query: 1372 DMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPL 1431
            +MV  +DSA+     T++ N+  A SVRLDISFK+PSHTGLQTT LVK+LTEQFPA+TPL
Sbjct: 624  EMVGFEDSAAANSLPTNTGNLAIAKSVRLDISFKTPSHTGLQTTQLVKDLTEQFPAATPL 683

Query: 1432 ALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
            ALVLKQFLADR+LDQSYSGGLSSYCL+LLITRFLQHEHHLGR INQ
Sbjct: 684  ALVLKQFLADRTLDQSYSGGLSSYCLVLLITRFLQHEHHLGRSINQ 729


>gi|3193327|gb|AAC19309.1| F6N15.10 gene product [Arabidopsis thaliana]
 gi|7267093|emb|CAB80764.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1260

 Score =  854 bits (2206), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 551/1135 (48%), Positives = 692/1135 (60%), Gaps = 161/1135 (14%)

Query: 392  SSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNFKSSPNKS-KEKPSTIGRRKKC 450
            SS+  ++T  DC+LRKLRG LMV+S+D  K EL  +   K SP+ S  +K  +  R++K 
Sbjct: 218  SSMGTITTLVDCILRKLRGFLMVISIDSVKSELLDDNTHKCSPSSSSNQKLGSTNRKQKG 277

Query: 451  RACSTKRQNPLPKSALDELSLDKLPKDPEGALTDTEKVDLMGSDKVPGISNGKDINRETS 510
            +  + K+  P  KS  +     K  K  +  L   +  + +   KVP  S   + + E S
Sbjct: 278  KTRNMKKPTPEAKSDKNVNLSTKNGKKDQAKLEFNKSREAIECKKVPTASTMIN-DPEAS 336

Query: 511  TSEMEMVVCHQEHARALVAGKGRTNARKTKTVKNKNKNCTYNNPVPVKDPKVSVLETSSS 570
             + ME+V         LVA K     +K +  KNK+K CT                    
Sbjct: 337  AATMEVV-------PGLVARK--GRTKKKRKEKNKSKKCT-------------------- 367

Query: 571  ISLQDEVEKYDKLSAQNVSVDNSTCSNVLASNQSSCTSASVPAREGIATQSTQEDCVVNS 630
             SL++  E        N SV NS+     +   SSCTSA+   +E I  Q  +E    + 
Sbjct: 368  -SLENNGEV-------NKSVVNSSAIVKASKCDSSCTSANQHPQEYINAQIIEEHGSFS- 418

Query: 631  VNSECRRFSNGRIDN-----QTQHFLQETTDSKVECNIISPDMPARDLDNAFGNSISGIN 685
                C R  +G   +       ++  +E + SK E ++IS D+ +  +D A G S   +N
Sbjct: 419  ----CERNRSGTCASVNGAANCEYSGEEESHSKAETHVISSDLSS--VDPAGGPSCENVN 472

Query: 686  FQNSFHESETGAISVLPDKGIEALEIKKESAVTQDQRNESFFGTALKSSLECPSYEWPTI 745
             Q S    +      +P++    L+ + ES     QR E+ +G A  SS E  SYEWP +
Sbjct: 473  PQKSCCRGDRKEKLTMPNERSRTLD-EGESHRIHHQRREAGYGFASSSS-EFVSYEWPAV 530

Query: 746  APVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPFVPTLHQARNHPFDGGCNQILSQPLP 805
            AP+YF  +SSHL  ATDRLHLDVGHN H +VRQPFV T+  ARN   +G   Q+LS+P+P
Sbjct: 531  APMYFSHVSSHLPTATDRLHLDVGHNLHPYVRQPFVSTVQHARNPSIEGSHKQVLSRPMP 590

Query: 806  MSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGMQFNAKTSDDEGKCSGD 865
            MSLDWPPMV +  G+  + TCNYDSG +                                
Sbjct: 591  MSLDWPPMVHSNCGLTTAFTCNYDSGIL-------------------------------- 618

Query: 866  FMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGGVMYWNTSDHPGTGFSR 925
             +D+PE     E G+EC+++W+ EE+ EVHTVSG+DYNQYFGGGVMYWN SDH GTGFSR
Sbjct: 619  -VDIPEQKNKHELGNECENNWMLEEDFEVHTVSGVDYNQYFGGGVMYWNPSDHLGTGFSR 677

Query: 926  PPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTNGLTSPTAASFCSPFDPLGPGHQAFS 985
            PPSLSSDDSSWAWHEA++KR+VDDMVAFSSSYS NGL SPTAASFCSPF PLGP +Q   
Sbjct: 678  PPSLSSDDSSWAWHEAEMKRSVDDMVAFSSSYSANGLDSPTAASFCSPFHPLGPPNQPLG 737

Query: 986  YVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSKALDTLPCPILRPIIIPNL 1045
            YVVPGNE+  K+L +  TT + A EEE+SG+ ASLSGDV+  + D+LP PILRPIIIPN+
Sbjct: 738  YVVPGNEISTKILQAPPTTIEGAGEEEVSGTLASLSGDVEGNSGDSLPYPILRPIIIPNM 797

Query: 1046 SRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKTR 1105
                S+S++KRS++ KSP VPP+RRE PRIKRPPSPVVLCVPRAPRPPPPSPVS+SR  R
Sbjct: 798  ----SKSEYKRSYDTKSPNVPPTRREHPRIKRPPSPVVLCVPRAPRPPPPSPVSNSRARR 853

Query: 1106 GFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPSWRNKNLSAHPMIQPLSG 1165
            GFPTVRSGSSSPRHWG+RGW+H+G   EE      G+E+V P WRNK+L+  P+IQPL G
Sbjct: 854  GFPTVRSGSSSPRHWGMRGWFHDGVNWEEP----RGAEIVLP-WRNKSLAVRPIIQPLPG 908

Query: 1166 ALLQDHLIAISQLARDQEH--------PDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEI 1217
            ALLQDHLIA+SQL RDQEH        PDVAFPLQP E+ NCP +  SLSL+H   +  I
Sbjct: 909  ALLQDHLIAMSQLGRDQEHVSNVFDIYPDVAFPLQPPELLNCPMQGESLSLIHE-NYMSI 967

Query: 1218 DSFC------------------KQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFG 1259
               C                   +VAAEN ARKPYINWA+KRVTRSLQVLWPRSRTNIFG
Sbjct: 968  FEVCIYFSWDGVSSNPLNYILLAKVAAENMARKPYINWAIKRVTRSLQVLWPRSRTNIFG 1027

Query: 1260 SNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKS 1319
            S+ATGLSLPSSDVDLVVCLPPVRNL  + +  I+          L      + NQ+ V  
Sbjct: 1028 SSATGLSLPSSDVDLVVCLPPVRNL--VSKPKIIPHLVLKIYISLLLKWVVVYNQDSVLK 1085

Query: 1320 DSLKTV--ENTAIPIIM-------------------------------LVVEVPHDLIAS 1346
              +  +   NT + I +                               LVVEVP DLI  
Sbjct: 1086 FYIYFILMMNTCLYIFVLGASLIIPYMVLFFLICGCSGHVIYEIPIIMLVVEVPCDLIC- 1144

Query: 1347 AASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKS 1406
               S+QSPK+     T+  D++ +++MV  +DSA+     T++ N+  A SVRLDISFK+
Sbjct: 1145 ---SIQSPKDGPDCITVDQDSNGNTEMVGFEDSAAANSLPTNTGNLAIAKSVRLDISFKT 1201

Query: 1407 PSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLI 1461
            PSHTGLQTT LVK+LTEQFPA+TPLALVLKQFLADR+LDQSYSGGLSSYCL+  I
Sbjct: 1202 PSHTGLQTTQLVKDLTEQFPAATPLALVLKQFLADRTLDQSYSGGLSSYCLVRCI 1256



 Score =  249 bits (637), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 140/232 (60%), Positives = 179/232 (77%), Gaps = 5/232 (2%)

Query: 16  AEYFTN-SLMAQNQLIDSLTSHISLYHSHSLSSN-PNPSSNPRSSILKWFASLTVHQRQA 73
           +++F++ S MAQNQLIDSLTSHISLYHSHS SS+  N   NPRS+IL+WF+SL+VHQR +
Sbjct: 8   SKFFSSASSMAQNQLIDSLTSHISLYHSHSSSSSMANTIPNPRSAILRWFSSLSVHQRLS 67

Query: 74  HLTIVDSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAES 133
           HLT+VD KF Q+L+QMLG +R  G   FIILPDLPS     LP LC+KKSRGL+SRV+ES
Sbjct: 68  HLTVVDPKFVQILLQMLGYIRTKGPCSFIILPDLPSSSD--LPSLCFKKSRGLISRVSES 125

Query: 134 NESGRWVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGF 193
           NES R+VF+STRLF S EGE+ ++ SC VN LD+  ++ EF+ NVDRF++ MD +S+G F
Sbjct: 126 NESERFVFDSTRLFGSGEGERAQDCSCSVNSLDSVVMAEEFLTNVDRFVETMDVLSDGAF 185

Query: 194 LRGEESELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRG 245
           LRGEES+L  +WVE +WLKAKGYYS+EAF+ N   +G     ++C   K RG
Sbjct: 186 LRGEESDLGSNWVELEWLKAKGYYSMEAFVANS-SMGTITTLVDCILRKLRG 236


>gi|218198719|gb|EEC81146.1| hypothetical protein OsI_24054 [Oryza sativa Indica Group]
          Length = 1254

 Score =  772 bits (1993), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 423/752 (56%), Positives = 522/752 (69%), Gaps = 14/752 (1%)

Query: 732  KSSLECPSYEWPTIAPVYFPSISSHLLPAT-DRLHLDVGHNWHNHVRQPFVPTLHQARNH 790
            +++L   SYEWP++AP +F S      PAT DRLHLDVG+ W     QPF+P  HQ RN 
Sbjct: 401  RNTLGGTSYEWPSVAPHHFVSPEMQQRPATTDRLHLDVGYRWPAQFEQPFLPPNHQMRNP 460

Query: 791  PFDGGCNQILSQ-PLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGM 849
            P + GCNQ+LS   +P+S DWPP+ +    +  +    YD  +    QS     F    +
Sbjct: 461  PVEAGCNQVLSSLSVPLSFDWPPVFRGYGKLNQNNALGYDPLYTPQMQSSAWSGFHAPLI 520

Query: 850  QFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGG 909
            Q  +  ++ + K  GD     +P    + GD+ +S+W SEEE +   +SG D NQYFGGG
Sbjct: 521  QRGSICNEKDRKYFGD----SDPRNKSDVGDDTESYWFSEEESDGRGLSGRDINQYFGGG 576

Query: 910  VMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDM-VAFSSSYSTNGLTSPTAA 968
            VMYW+ ++H GTGFSRPPSLSSDDS+WAWHEAD+ + +DD+ V    +Y+ N ++SP + 
Sbjct: 577  VMYWSPAEHAGTGFSRPPSLSSDDSAWAWHEADVSQVLDDIPVGIPCTYNANSVSSPPST 636

Query: 969  SFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSKA 1028
            S CS  +   P  Q+  + +  N +  + L S S+  D   ++ IS S +   G    K 
Sbjct: 637  SVCSQNESPDPLPQSVGHSMTANGMNSESLQSPSSMQDSPEDKTISVSKSVSCGSEVIKG 696

Query: 1029 LDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVLCVPR 1088
             DTLP  +LRPI++P +SR  SRS+FK  H+H+SPCVP +RR+ P ++RPPSPVVL VPR
Sbjct: 697  -DTLPYAMLRPIVVP-ISRRPSRSEFKGGHDHRSPCVPSNRRDIPLVRRPPSPVVLSVPR 754

Query: 1089 APRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPS 1148
             PRPPPPSPV +SRK RGFP VRSGSSSPRHWG+R  + E          +DG EVVWPS
Sbjct: 755  VPRPPPPSPVGESRK-RGFPIVRSGSSSPRHWGMRSLFTEDKILNRAQFCLDGPEVVWPS 813

Query: 1149 WRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSL 1208
            W NK   A  + Q +   +LQDHL+ ISQL+RDQ HPDVA PLQP ++ NC + KASLSL
Sbjct: 814  WGNKGSPAGTLGQSIEDTVLQDHLVKISQLSRDQ-HPDVALPLQPPDMLNCSSSKASLSL 872

Query: 1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
            MH+ LHEEID FCKQV+AEN  RKPYINWAVKRVTR LQVLWPRSRTN+FGSNATGL+LP
Sbjct: 873  MHNALHEEIDQFCKQVSAENLVRKPYINWAVKRVTRCLQVLWPRSRTNLFGSNATGLALP 932

Query: 1269 SSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
            +SDVDLV+ LPPVRNLEPIKEAGILEGRNGIKETCLQHAAR LANQ+WV++DSLKTVENT
Sbjct: 933  TSDVDLVISLPPVRNLEPIKEAGILEGRNGIKETCLQHAARCLANQDWVRTDSLKTVENT 992

Query: 1329 AIPIIMLVVEVPHDL-IASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSAS--PKCS 1385
            AIP+IMLV +VP D  ++S  SSV     +     +  D    S         S  P CS
Sbjct: 993  AIPVIMLVAQVPCDTNLSSEYSSVLDSSHEQLSINVLGDQGSPSRSDNSSSEGSSMPMCS 1052

Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
              + D+     S+RLDISFKSPSHTGLQTT+LV ELT+QFPA+ PLAL+LK+FLADRSLD
Sbjct: 1053 RINKDDGNVVQSIRLDISFKSPSHTGLQTTELVCELTQQFPAAVPLALILKKFLADRSLD 1112

Query: 1446 QSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
              YSGGLSSYCL+LLI RFLQHEHHLGRPINQ
Sbjct: 1113 HPYSGGLSSYCLVLLIVRFLQHEHHLGRPINQ 1144



 Score =  122 bits (306), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 82/247 (33%), Positives = 125/247 (50%), Gaps = 23/247 (9%)

Query: 207 EFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNC----------NNGKKRGVKLKEKLNAAG 256
           E  WLKAKGYY IE F+ N +E+ LR++W                 + G  +KEK   AG
Sbjct: 8   ELPWLKAKGYYVIEEFVANWVEIALRMSWAAAGGGGGGGGGGRKAVRVGKGVKEK---AG 64

Query: 257 MAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNALEDGMWLFNA 316
           +AAN +WR+KG VDWWM L+  +R +++    GK   +L +E++ E +       + F  
Sbjct: 65  LAANAFWREKGYVDWWMRLEPRVRARIMGAFFGKGTTALANEII-EGTYIASSEKFSFCL 123

Query: 317 GMKQS--SRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASL-ATVFSGLFVLQDIT 373
           G  +S  +  Y+  + Q   S    +  C L + P++LS     +       L ++Q+I 
Sbjct: 124 GEPESFVAETYYESTRQ---SFFRRNRPCCLDV-PSTLSCKKNPIFVKELQRLMLVQEI- 178

Query: 374 TMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNFKSS 433
            + L S  +    + IFF+ L    T  D +L KLRG+LMVVS +   LEL G+   K++
Sbjct: 179 -VYLKSNISNRSGDAIFFTKLMSAGTVADHILMKLRGILMVVSTESINLELIGDEAPKAA 237

Query: 434 PNKSKEK 440
             K  EK
Sbjct: 238 KKKDVEK 244


>gi|222636058|gb|EEE66190.1| hypothetical protein OsJ_22308 [Oryza sativa Japonica Group]
          Length = 1486

 Score =  772 bits (1993), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 423/752 (56%), Positives = 522/752 (69%), Gaps = 14/752 (1%)

Query: 732  KSSLECPSYEWPTIAPVYFPSISSHLLPAT-DRLHLDVGHNWHNHVRQPFVPTLHQARNH 790
            +++L   SYEWP++AP +F S      PAT DRLHLDVG+ W     QPF+P  HQ RN 
Sbjct: 633  RNTLGGTSYEWPSVAPHHFVSPEMQQRPATTDRLHLDVGYRWPAQFEQPFLPPNHQMRNP 692

Query: 791  PFDGGCNQILSQ-PLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGM 849
            P + GCNQ+LS   +P+S DWPP+ +    +  +    YD  +    QS     F    +
Sbjct: 693  PVEAGCNQVLSSLSVPLSFDWPPVFRGYGKLNQNNALGYDPLYTPQMQSSAWSGFHAPLI 752

Query: 850  QFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGG 909
            Q  +  ++ + K  GD     +P    + GD+ +S+W SEEE +   +SG D NQYFGGG
Sbjct: 753  QRGSICNEKDRKYFGD----SDPRNKSDVGDDTESYWFSEEESDGRGLSGRDINQYFGGG 808

Query: 910  VMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDM-VAFSSSYSTNGLTSPTAA 968
            VMYW+ ++H GTGFSRPPSLSSDDS+WAWHEAD+ + +DD+ V    +Y+ N ++SP + 
Sbjct: 809  VMYWSPAEHAGTGFSRPPSLSSDDSAWAWHEADVSQVLDDIPVGIPCTYNANSVSSPPST 868

Query: 969  SFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSKA 1028
            S CS  +   P  Q+  + +  N +  + L S S+  D   ++ IS S +   G    K 
Sbjct: 869  SVCSQNESPDPLPQSVGHSMTANGMNSESLQSPSSMQDSPEDKTISVSKSVSCGSEVIKG 928

Query: 1029 LDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVLCVPR 1088
             DTLP  +LRPI++P +SR  SRS+FK  H+H+SPCVP +RR+ P ++RPPSPVVL VPR
Sbjct: 929  -DTLPYAMLRPIVVP-ISRRPSRSEFKGGHDHRSPCVPSNRRDIPLVRRPPSPVVLSVPR 986

Query: 1089 APRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPS 1148
             PRPPPPSPV +SRK RGFP VRSGSSSPRHWG+R  + E          +DG EVVWPS
Sbjct: 987  VPRPPPPSPVGESRK-RGFPIVRSGSSSPRHWGMRSLFTEDKILNRAQFCLDGPEVVWPS 1045

Query: 1149 WRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSL 1208
            W NK   A  + Q +   +LQDHL+ ISQL+RDQ HPDVA PLQP ++ NC + KASLSL
Sbjct: 1046 WGNKGSPAGTLGQSIEDTVLQDHLVKISQLSRDQ-HPDVALPLQPPDMLNCSSSKASLSL 1104

Query: 1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
            MH+ LHEEID FCKQV+AEN  RKPYINWAVKRVTR LQVLWPRSRTN+FGSNATGL+LP
Sbjct: 1105 MHNALHEEIDQFCKQVSAENLVRKPYINWAVKRVTRCLQVLWPRSRTNLFGSNATGLALP 1164

Query: 1269 SSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
            +SDVDLV+ LPPVRNLEPIKEAGILEGRNGIKETCLQHAAR LANQ+WV++DSLKTVENT
Sbjct: 1165 TSDVDLVISLPPVRNLEPIKEAGILEGRNGIKETCLQHAARCLANQDWVRTDSLKTVENT 1224

Query: 1329 AIPIIMLVVEVPHDL-IASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSAS--PKCS 1385
            AIP+IMLV +VP D  ++S  SSV     +     +  D    S         S  P CS
Sbjct: 1225 AIPVIMLVAQVPCDTNLSSEYSSVLDSSHEQLSINVLGDQGSPSRSDNSSSEGSSMPMCS 1284

Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
              + D+     S+RLDISFKSPSHTGLQTT+LV ELT+QFPA+ PLAL+LK+FLADRSLD
Sbjct: 1285 RINKDDGNVVQSIRLDISFKSPSHTGLQTTELVCELTQQFPAAVPLALILKKFLADRSLD 1344

Query: 1446 QSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
              YSGGLSSYCL+LLI RFLQHEHHLGRPINQ
Sbjct: 1345 HPYSGGLSSYCLVLLIVRFLQHEHHLGRPINQ 1376



 Score =  171 bits (434), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 126/376 (33%), Positives = 188/376 (50%), Gaps = 42/376 (11%)

Query: 89  MLGKLRANGHG-FFIILPDLPS-----RDPPYLPGLCYKKSRGLLSRVAESNESGRWVFE 142
           ML +LRA GH  FF++ P  PS      +    P +  + SRGLL+R AE++     VF+
Sbjct: 1   MLRRLRARGHSSFFVLHPPSPSSAAVAEEGAAAPTVLSRLSRGLLARAAEASRGQGLVFD 60

Query: 143 STRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGE-ESEL 201
              LF S               LD  TV+ E + ++D F+  MDEIS G FL G+ E +L
Sbjct: 61  HLLLFPSSPTSSSR--------LDAITVAEELLGDLDGFVAAMDEISGGRFLCGDGEVDL 112

Query: 202 AG----DWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNC----------NNGKKRGVK 247
                 ++ E  WLKAKGYY IE F+ N +E+ LR++W                 + G  
Sbjct: 113 VAMVGEEFPELPWLKAKGYYVIEEFVANWVEIALRMSWAAAGGGGGGGGGGRKAVRVGKG 172

Query: 248 LKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNAL 307
           +KEK   AG+AAN +WR+KG VDWWM L+  +R +++    GK   +L +E++ E +   
Sbjct: 173 VKEK---AGLAANAFWREKGYVDWWMRLEPRVRARIMGAFFGKGTTALANEII-EGTYIA 228

Query: 308 EDGMWLFNAGMKQS--SRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASL-ATVFS 364
               + F  G  +S  +  Y+  + Q   S    +  C L + P++LS     +      
Sbjct: 229 SSEKFSFCLGEPESFVAETYYESTRQ---SFFRRNRPCCLDV-PSTLSCKKNPIFVKELQ 284

Query: 365 GLFVLQDITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLEL 424
            L ++Q+I  + L S  +    + IFF+ L    T  D +L KLRG+LMVVS +   LEL
Sbjct: 285 RLMLVQEI--VYLKSNISNRSGDAIFFTKLMSAGTVADHILMKLRGILMVVSTESINLEL 342

Query: 425 FGEGNFKSSPNKSKEK 440
            G+   K++  K  EK
Sbjct: 343 IGDEAPKAAKKKDVEK 358


>gi|52075837|dbj|BAD45445.1| nucleotidyltransferase-like [Oryza sativa Japonica Group]
          Length = 1461

 Score =  771 bits (1991), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 423/752 (56%), Positives = 522/752 (69%), Gaps = 14/752 (1%)

Query: 732  KSSLECPSYEWPTIAPVYFPSISSHLLPAT-DRLHLDVGHNWHNHVRQPFVPTLHQARNH 790
            +++L   SYEWP++AP +F S      PAT DRLHLDVG+ W     QPF+P  HQ RN 
Sbjct: 634  RNTLGGTSYEWPSVAPHHFVSPEMQQRPATTDRLHLDVGYRWPAQFEQPFLPPNHQMRNP 693

Query: 791  PFDGGCNQILSQ-PLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGM 849
            P + GCNQ+LS   +P+S DWPP+ +    +  +    YD  +    QS     F    +
Sbjct: 694  PVEAGCNQVLSSLSVPLSFDWPPVFRGYGKLNQNNALGYDPLYTPQMQSSAWSGFHAPLI 753

Query: 850  QFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGG 909
            Q  +  ++ + K  GD     +P    + GD+ +S+W SEEE +   +SG D NQYFGGG
Sbjct: 754  QRGSICNEKDRKYFGD----SDPRNKSDVGDDTESYWFSEEESDGRGLSGRDINQYFGGG 809

Query: 910  VMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDM-VAFSSSYSTNGLTSPTAA 968
            VMYW+ ++H GTGFSRPPSLSSDDS+WAWHEAD+ + +DD+ V    +Y+ N ++SP + 
Sbjct: 810  VMYWSPAEHAGTGFSRPPSLSSDDSAWAWHEADVSQVLDDIPVGIPCTYNANSVSSPPST 869

Query: 969  SFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSKA 1028
            S CS  +   P  Q+  + +  N +  + L S S+  D   ++ IS S +   G    K 
Sbjct: 870  SVCSQNESPDPLPQSVGHSMTANGMNSESLQSPSSMQDSPEDKTISVSKSVSCGSEVIKG 929

Query: 1029 LDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVLCVPR 1088
             DTLP  +LRPI++P +SR  SRS+FK  H+H+SPCVP +RR+ P ++RPPSPVVL VPR
Sbjct: 930  -DTLPYAMLRPIVVP-ISRRPSRSEFKGGHDHRSPCVPSNRRDIPLVRRPPSPVVLSVPR 987

Query: 1089 APRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPS 1148
             PRPPPPSPV +SRK RGFP VRSGSSSPRHWG+R  + E          +DG EVVWPS
Sbjct: 988  VPRPPPPSPVGESRK-RGFPIVRSGSSSPRHWGMRSLFTEDKILNRAQFCLDGPEVVWPS 1046

Query: 1149 WRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSL 1208
            W NK   A  + Q +   +LQDHL+ ISQL+RDQ HPDVA PLQP ++ NC + KASLSL
Sbjct: 1047 WGNKGSPAGTLGQSIEDTVLQDHLVKISQLSRDQ-HPDVALPLQPPDMLNCSSSKASLSL 1105

Query: 1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
            MH+ LHEEID FCKQV+AEN  RKPYINWAVKRVTR LQVLWPRSRTN+FGSNATGL+LP
Sbjct: 1106 MHNALHEEIDQFCKQVSAENLVRKPYINWAVKRVTRCLQVLWPRSRTNLFGSNATGLALP 1165

Query: 1269 SSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
            +SDVDLV+ LPPVRNLEPIKEAGILEGRNGIKETCLQHAAR LANQ+WV++DSLKTVENT
Sbjct: 1166 TSDVDLVISLPPVRNLEPIKEAGILEGRNGIKETCLQHAARCLANQDWVRTDSLKTVENT 1225

Query: 1329 AIPIIMLVVEVPHDL-IASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSAS--PKCS 1385
            AIP+IMLV +VP D  ++S  SSV     +     +  D    S         S  P CS
Sbjct: 1226 AIPVIMLVAQVPCDTNLSSEYSSVLDSSHEQLSINVLGDQGSPSRSDNSSSEGSSMPMCS 1285

Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
              + D+     S+RLDISFKSPSHTGLQTT+LV ELT+QFPA+ PLAL+LK+FLADRSLD
Sbjct: 1286 RINKDDGNVVQSIRLDISFKSPSHTGLQTTELVCELTQQFPAAVPLALILKKFLADRSLD 1345

Query: 1446 QSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
              YSGGLSSYCL+LLI RFLQHEHHLGRPINQ
Sbjct: 1346 HPYSGGLSSYCLVLLIVRFLQHEHHLGRPINQ 1377



 Score =  160 bits (406), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 119/341 (34%), Positives = 176/341 (51%), Gaps = 39/341 (11%)

Query: 28  QLIDSLTSHISLYHSHSLSSNPNPSSNPRSSILKWFASLTVHQRQAHLT-IVDSKFAQLL 86
           QL D+LT+H+SLYH+ +     + S   R++IL+W ASL+   R A  T ++    A  L
Sbjct: 15  QLADALTAHLSLYHAATPPPPSSSSP--RAAILRWLASLSPPCRAAAATSLLTPAAAAAL 72

Query: 87  IQMLGKLRANGHG-FFIILPDLPS-----RDPPYLPGLCYKKSRGLLSRVAESNESGRWV 140
           + ML +LRA GH  FF++ P  PS      +    P +  + SRGLL+R AE++     V
Sbjct: 73  LSMLRRLRARGHSSFFVLHPPSPSSAAVAEEGAAAPTVLSRLSRGLLARAAEASRGQGLV 132

Query: 141 FESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGE-ES 199
           F+   LF S               LD  TV+ E + ++D F+  MDEIS G FL G+ E 
Sbjct: 133 FDHLLLFPSSPTSSSR--------LDAITVAEELLGDLDGFVAAMDEISGGRFLCGDGEV 184

Query: 200 ELAG----DWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNC----------NNGKKRG 245
           +L      ++ E  WLKAKGYY IE F+ N +E+ LR++W                 + G
Sbjct: 185 DLVAMVGEEFPELPWLKAKGYYVIEEFVANWVEIALRMSWAAAGGGGGGGGGGRKAVRVG 244

Query: 246 VKLKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLK-EAS 304
             +KEK   AG+AAN +WR+KG VDWWM L+  +R +++    GK   +L    LK   S
Sbjct: 245 KGVKEK---AGLAANAFWREKGYVDWWMRLEPRVRARIMGAFFGKGTTALEIVYLKSNIS 301

Query: 305 NALEDGMW---LFNAGMKQSSRFYHSKSLQRTISTLSVDVE 342
           N   D ++   L +AG          + +   +ST S+++E
Sbjct: 302 NRSGDAIFFTKLMSAGTVADHILMKLRGILMVVSTESINLE 342



 Score = 46.6 bits (109), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 24/54 (44%), Positives = 32/54 (59%)

Query: 387 EKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNFKSSPNKSKEK 440
           + IFF+ L    T  D +L KLRG+LMVVS +   LEL G+   K++  K  EK
Sbjct: 306 DAIFFTKLMSAGTVADHILMKLRGILMVVSTESINLELIGDEAPKAAKKKDVEK 359


>gi|242096678|ref|XP_002438829.1| hypothetical protein SORBIDRAFT_10g026810 [Sorghum bicolor]
 gi|241917052|gb|EER90196.1| hypothetical protein SORBIDRAFT_10g026810 [Sorghum bicolor]
          Length = 1031

 Score =  764 bits (1973), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 416/743 (55%), Positives = 512/743 (68%), Gaps = 33/743 (4%)

Query: 739  SYEWPTIAPVYFPSISSHLLPAT-DRLHLDVGHNWHNHVRQPFVPTLHQARNHPFDGGCN 797
            SYEWP++ P +F S      PA  DRLHLDVG+ W     QPF+P  HQ RN P D GCN
Sbjct: 208  SYEWPSVTPSHFVSPQMQQRPAAEDRLHLDVGYKWPTQFSQPFLPANHQVRNPPIDAGCN 267

Query: 798  QIL-SQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGMQFNAKTS 856
            Q+L S  +P+S DWPP+ +   G   +   +YD  +    QS     F  + +Q     S
Sbjct: 268  QMLPSLAVPLSFDWPPVFR---GYGKNAAVSYDPLYTPQMQSSAWSGFPAQLVQRGGICS 324

Query: 857  DDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGGVMYWNTS 916
            D                   +  D+ +S+W SEEE +    SG D NQYFGGGVMYW+ +
Sbjct: 325  D------------------SDVADDTESYWFSEEESDSRAHSGRDINQYFGGGVMYWSPA 366

Query: 917  DHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTNGLTSPTAASFCSPFDP 976
            +H GTGFSRPPSLSSDDS+WAWHEAD+ R VDD+     S  TNG++SP +   CS  + 
Sbjct: 367  EHAGTGFSRPPSLSSDDSAWAWHEADVIRVVDDVANGIPSTYTNGVSSPPSTPSCSQNES 426

Query: 977  LGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSKALDTLPCPI 1036
            L PG    ++ + GNE+  + L S S+  D + E++I+    ++S   +    DTLP  +
Sbjct: 427  LDPG----AHSITGNEINNEALTSPSSVQD-SPEDKITSVAKAVSCGSEVVKADTLPYAM 481

Query: 1037 LRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVLCVPRAPRPPPPS 1096
            LRPI++PN+SR  SRSD K  H+H+SPCVP +RR+ P ++RPPSPVVL VPR PRPPPPS
Sbjct: 482  LRPIVVPNISRRSSRSDIKGGHDHRSPCVPSTRRDIPILRRPPSPVVLSVPRVPRPPPPS 541

Query: 1097 PVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPSWRNKNLSA 1156
            P  +SRK RGFP VRSGSSSPRHWG+RG + E          +DG EVVWPSW NK  S 
Sbjct: 542  PAGESRK-RGFPIVRSGSSSPRHWGMRGLFSEDKIFHRAQFCLDGPEVVWPSWGNKGTSG 600

Query: 1157 HPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSLMHSLLHEE 1216
              ++Q +   +LQDHL+ ISQL+RDQ HPDVA P+QPL++ N  + K SLSLMH+ LHEE
Sbjct: 601  T-LVQTIEDTVLQDHLVKISQLSRDQ-HPDVALPVQPLDMLNGLSHKTSLSLMHNALHEE 658

Query: 1217 IDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVV 1276
            ID FCKQVAA +  R+PYINWAV+RVTR LQVLWPRSRTN+FGSNATGL+LP+SDVDLVV
Sbjct: 659  IDQFCKQVAAADLVRRPYINWAVRRVTRCLQVLWPRSRTNLFGSNATGLALPTSDVDLVV 718

Query: 1277 CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLV 1336
             LPPVRNLEPIKEAGILEGRNGIKETCLQHAAR L NQ+WV+SDSLKTVENTAIP+IMLV
Sbjct: 719  SLPPVRNLEPIKEAGILEGRNGIKETCLQHAARCLTNQDWVRSDSLKTVENTAIPVIMLV 778

Query: 1337 VEVPHDLIAS--AASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKA 1394
             +VP D   S   +S +   +E + +   +  +   SD  + + S    CS  + D+   
Sbjct: 779  ADVPCDTNMSNEYSSVLDGSQEYSVNMLGEQGSPPRSDTSSSEGSNMLVCSKLNKDDCDI 838

Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
              S+RLDISFKS SHTGLQTT+LV+ELT+QFPA  PLAL+LK+FLADRSLD  YSGGLSS
Sbjct: 839  VQSIRLDISFKSSSHTGLQTTELVRELTQQFPAVVPLALILKKFLADRSLDHPYSGGLSS 898

Query: 1455 YCLMLLITRFLQHEHHLGRPINQ 1477
            YCL+LLI RFLQHEHHLGRPINQ
Sbjct: 899  YCLVLLIVRFLQHEHHLGRPINQ 921


>gi|357117360|ref|XP_003560438.1| PREDICTED: uncharacterized protein LOC100844132 [Brachypodium
            distachyon]
          Length = 1247

 Score =  712 bits (1838), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 410/801 (51%), Positives = 519/801 (64%), Gaps = 41/801 (5%)

Query: 685  NFQNSFHESETGAISVLPD---KGIEALEIKKESAVTQDQRNESFFGTALKSSLECPSYE 741
            N  + FH S      VLP      I    I     V ++   E +     ++ L   SYE
Sbjct: 462  NMDHKFHNSVVTTDKVLPPVIPANILQSAINDNGTVMKNGGGEYYVFK--RNLLGGTSYE 519

Query: 742  WPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPFVPTLHQARNHPFDGGCNQILS 801
            WP++AP +          A DRLHLD+G+ W     QPF+PT HQ R+ P + GCNQ+LS
Sbjct: 520  WPSVAPHFVSPEIQQRPAAADRLHLDIGYRWPTQFDQPFLPTNHQLRSPPIESGCNQMLS 579

Query: 802  Q-PLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGMQFNAKTSDDEG 860
               +P+S DWPP+ +    ++ +   +YD  +    QS     F  + MQ     S+ + 
Sbjct: 580  SLAVPLSFDWPPVFRGYGKLSQNAALSYDPVYAPQMQSSAWPGFPAQLMQRGGFCSEKDR 639

Query: 861  KCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGGVMYWNTSDHPG 920
            K   D     +P  T + GD+ +S+W SEEE +   VSG D NQYFGGGVMYW+ ++H G
Sbjct: 640  KYFAD----SDPRNTSDVGDDTESYWCSEEESDGRAVSGRDINQYFGGGVMYWSPAEHAG 695

Query: 921  TGFSRPPSLSSDDSSWAWHEADIKRAVDDM-VAFSSSYSTNGLTSPTAASFCSPFDPLGP 979
            TGFSRPPSLSSDDS+WAWHEAD+ R VDD+ V   S+Y+  G +SP +   CS  +   P
Sbjct: 696  TGFSRPPSLSSDDSAWAWHEADVSRVVDDLGVGIPSAYNA-GASSPPSTPSCSQNETSDP 754

Query: 980  GHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSKALDTLPCPILRP 1039
              Q   + + GN++  + LHSSS+T D + E++ + +  S S   +    DTLP  +LRP
Sbjct: 755  STQPVCHSMVGNDINNEALHSSSSTQD-SPEDKTTSAARSPSSVSEIVKGDTLPYAMLRP 813

Query: 1040 IIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVLCVPRAPRPPPPSPVS 1099
            I++PN+SR  SRSDF+  H+H+SPCV  +RR+ P ++RPPSPVVL VPR PRPPPPSPV 
Sbjct: 814  IVVPNISRRLSRSDFRGGHDHRSPCVSSTRRDIPVVRRPPSPVVLSVPRMPRPPPPSPVG 873

Query: 1100 DSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPSWRNKNLSAHPM 1159
            +SRK RGFP VRSGSSSPRHWG+R  + +          +DG EVVWPSW NK   A  +
Sbjct: 874  ESRK-RGFPIVRSGSSSPRHWGMRSLFSDDKIFNRAQFCLDGPEVVWPSWVNKGTYAGTL 932

Query: 1160 IQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEIDS 1219
            +Q +   +LQDHL+ ISQL+RDQ HPDVA PLQP ++ NC + KASLSLM + LHEEID 
Sbjct: 933  VQSIEDTVLQDHLVKISQLSRDQ-HPDVAVPLQPPDMSNCSSPKASLSLMLNALHEEIDQ 991

Query: 1220 FCKQV-------------------AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGS 1260
            FCKQV                   AA N   KPYINWAVKRVTR LQVLWPRSRTN+FGS
Sbjct: 992  FCKQVSPLSPAFCITTFIRNTCKVAAGNLVTKPYINWAVKRVTRCLQVLWPRSRTNLFGS 1051

Query: 1261 NATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSD 1320
            NATGL+LP+SDVDLVV LPPVRNLEPIKEAGILEGRNGIKETCLQHAAR L NQ+WV+SD
Sbjct: 1052 NATGLALPTSDVDLVVSLPPVRNLEPIKEAGILEGRNGIKETCLQHAARCLGNQDWVRSD 1111

Query: 1321 SLKTVENTAIPIIMLVVEVPHDLIASA--ASSVQSPKEDAAHTTLKHDNHVHSDMVALDD 1378
            SLKT     IP+IMLV +VP D   S   +S + S ++++ +   +  +   SD  + + 
Sbjct: 1112 SLKT-----IPVIMLVAQVPCDTNMSIEYSSVLDSSQDNSVNVLAEQASPPRSDNSSSEG 1166

Query: 1379 SASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQF 1438
            S +   S  + D+  A  S+RLDISFKSPSHTGLQTT+LV ELT+QFPA+ PLAL+LK+F
Sbjct: 1167 SNTLMGSKMNKDDCDAVKSIRLDISFKSPSHTGLQTTELVCELTQQFPAALPLALILKKF 1226

Query: 1439 LADRSLDQSYSGGLSSYCLML 1459
            LADRSLD  YSGGLSSYCL L
Sbjct: 1227 LADRSLDHPYSGGLSSYCLHL 1247



 Score =  122 bits (306), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 106/336 (31%), Positives = 157/336 (46%), Gaps = 79/336 (23%)

Query: 28  QLIDSLTSHISLYHSHSLSSNPNPSSNPRSS-------ILKWFASLTVHQRQAHLTIVDS 80
           QL+D+LT+H+S YH+ + S   + SS+  S        IL+W ASL    R A  T    
Sbjct: 15  QLVDALTAHLSHYHASNPSPASSSSSSSSSPSSSPRAAILRWLASLPPASRAAACTS--- 71

Query: 81  KFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAESNESGRWV 140
                                 +LP   S      P +  + SRGLL+R A ++ +   +
Sbjct: 72  ----------------------LLPPAASG-----PTVLSRLSRGLLARAAAASRAHELL 104

Query: 141 FESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGEESE 200
           F S  LF+S                D  TV+   + ++D F+  MDEIS G FLR  + E
Sbjct: 105 FSSALLFASSPASPRP---------DAITVAEALLADLDAFVAAMDEISGGRFLRCGDGE 155

Query: 201 L------AGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGK--------KRGV 246
           +        ++ E  WLKAKGYY IE F+ N +E+ LR++W     G         + G 
Sbjct: 156 VDLAALACEEFPELPWLKAKGYYVIEEFVANWVEIALRMSWAAAAGGGGGGGKKAVRVGK 215

Query: 247 KLKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNA 306
            +K+K   AG+A+N +WR+KG VDWWM L+  +R +++    GK A             A
Sbjct: 216 SVKDK---AGLASNTFWREKGYVDWWMRLELPVRVRIMGAFFGKGAI------------A 260

Query: 307 LEDGMWLFNAGMKQSSRFYHSKSLQRTISTLSVDVE 342
           L++G    ++   Q+SR   SKS QRT+S     +E
Sbjct: 261 LDNGC---SSTESQNSRVV-SKSNQRTLSVGDTTIE 292


>gi|413943401|gb|AFW76050.1| hypothetical protein ZEAMMB73_219043 [Zea mays]
          Length = 1447

 Score =  707 bits (1824), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 408/776 (52%), Positives = 515/776 (66%), Gaps = 39/776 (5%)

Query: 691  HESETGAISVLPDK---GIEALEIKKESAVTQDQRNESFFGTALKSSLECPSYEWPTIAP 747
            H+SE     +LP      +    I K   V ++   E +     ++ L   SYEWP++ P
Sbjct: 610  HKSEVTKDRILPSAIPTDMLQSAISKNGGVMKNGGGECYVYN--RNLLGGTSYEWPSVTP 667

Query: 748  VYFPSISSHLLPAT-DRLHLDVGHNWHNHVRQPFVPTLHQARNHPFDGGCNQIL-SQPLP 805
             +F S      PA  DRLHLDVG+ W     QPF+P  HQ RN P D G NQ+L S  +P
Sbjct: 668  SHFVSPQMQQRPAAADRLHLDVGYRWPTQFNQPFLPANHQVRNPPIDAGHNQMLPSLAVP 727

Query: 806  MSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGMQFNAKTSDDEGKCSGD 865
            +S DWPP+ +   G   + + +YD  +    QS     F  + +Q     SD        
Sbjct: 728  LSFDWPPVFR---GYGKNASVSYDPLYTPQMQSSAWSGFPAQLVQRGGICSD-------- 776

Query: 866  FMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGGVMYWNTSDHPGTGFSR 925
                       +  D+ +S+W SEEE +    SG D NQYFGGGVMYW+ ++H GTGFSR
Sbjct: 777  ----------SDVADDTESYWFSEEESDSRAHSGRDINQYFGGGVMYWSPAEHAGTGFSR 826

Query: 926  PPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTNGLTSPTAASFCSPFDPLGPGHQAFS 985
            PPSLSSDDS+WAWHEAD+ R VDD+     S  TNG++SP +   CS  + L P     +
Sbjct: 827  PPSLSSDDSAWAWHEADVIRVVDDVANGIPSTYTNGVSSPPSTPSCSQNESLDPA----A 882

Query: 986  YVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSKALDTLPCPILRPIIIPNL 1045
            +++ GNE+  + L S S+  D + E++I     ++S   +    DTLP  +LRPI++P++
Sbjct: 883  HLITGNEINNEALTSPSSVQD-SPEDKIMQVAKAVSCGSEVVKADTLPYAMLRPIVVPSI 941

Query: 1046 SRERSRSDFKRSHEHK-SPCVPPSRREQPRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKT 1104
            SR  SRS+ K +H+H+ SPCVP +RR+ P ++RPPSPVVL VPR PRPPPPSP  +SRK 
Sbjct: 942  SRRSSRSEIKGAHDHRRSPCVPSTRRDIPILRRPPSPVVLSVPRVPRPPPPSPAGESRK- 1000

Query: 1105 RGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPSWRNKNLSAHPMIQPLS 1164
            RGFP VRSGSSSPRHWG+RG + E          +DG EVVWPSW NK  S   ++Q + 
Sbjct: 1001 RGFPIVRSGSSSPRHWGMRGLFSEDKIFHRAQFCLDGPEVVWPSWGNKGTSG-TLVQTIE 1059

Query: 1165 GALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEIDSFCKQV 1224
              +LQDHL+ ISQL+ DQ HPDVA P+QPL++ N  + KASLSLMH+ LHEEID FCKQV
Sbjct: 1060 DTVLQDHLVKISQLSCDQ-HPDVALPVQPLDMLNGSSHKASLSLMHNALHEEIDQFCKQV 1118

Query: 1225 AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNL 1284
            AA N  R+PYINWAVKRVTR LQVLWPRSRTN+FGSNATGL+LP+SDVDLVV LPPVRNL
Sbjct: 1119 AAGNLVRRPYINWAVKRVTRCLQVLWPRSRTNLFGSNATGLALPTSDVDLVVSLPPVRNL 1178

Query: 1285 EPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDL- 1343
            EPIKEAGILEGRNGIKETCLQHAAR L NQ+WV+SDSLKTVENTAIP+IMLV +VP D  
Sbjct: 1179 EPIKEAGILEGRNGIKETCLQHAARCLTNQDWVRSDSLKTVENTAIPVIMLVADVPCDTN 1238

Query: 1344 IASAASSVQSPKEDAAHTTL-KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDI 1402
            +++  SSV    +D +   L +  +   SD  + + S    CS  + D+     S+RLDI
Sbjct: 1239 MSNEYSSVLDGSQDYSVNMLGEQGSPPRSDTSSSEGSNMLMCSKLNKDDCDIVQSIRLDI 1298

Query: 1403 SFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
            SFKSPSHTGLQTT+LV+EL++QFPA  PLAL+LK+FLADRSLD  YSGGLSSYCL+
Sbjct: 1299 SFKSPSHTGLQTTELVRELSQQFPAVIPLALILKKFLADRSLDHPYSGGLSSYCLV 1354



 Score =  103 bits (256), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 96/265 (36%), Positives = 139/265 (52%), Gaps = 35/265 (13%)

Query: 28  QLIDSLTSHISLYHSHS-----LSSNPNPSSNPRSSILKWFASLTVHQRQAHLTIVDSKF 82
           +L+D+LT+H+SLYH+ +      ++    SS+PR++IL+W ASL+   R A  T + S  
Sbjct: 13  ELMDALTAHLSLYHAAANPSPATAAASTSSSSPRAAILRWLASLSPAARAAAATSLLSPA 72

Query: 83  AQLLIQ-MLGKLRANGHGFFIIL----PDLPSRDPPYLPGLCYKKSRGLLSRVAESNESG 137
           A   +  M+ +LR  GH  F +L    P   +R     P +  + SRGLL+R A S+ + 
Sbjct: 73  AAAALLSMVRRLRLRGHSSFFVLHSSSPSTSARGAEE-PTVLSRLSRGLLARAAASSRAQ 131

Query: 138 RWVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGE 197
             +F +T LF S                D  TV+  FV ++D F+  MDEIS G FL   
Sbjct: 132 ALLFANTLLFPSSPASSRRP--------DVITVAEAFVADLDGFVAAMDEISGGRFLCCG 183

Query: 198 ESEL------AGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGK-------KR 244
           E E+        D+ E  WL AKGYY IE F+ NRL++ LR++W     G        + 
Sbjct: 184 EGEVDLAALACQDFPELPWLNAKGYYVIEEFLANRLDIALRMSWAAAGGGSAGGRKAVRI 243

Query: 245 GVKLKEKLNAAGMAANVYWRKKGCV 269
           G   KEK   AG+AAN +WR+KG +
Sbjct: 244 GKSAKEK---AGLAANAFWREKGIL 265


>gi|297810179|ref|XP_002872973.1| hypothetical protein ARALYDRAFT_912245 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318810|gb|EFH49232.1| hypothetical protein ARALYDRAFT_912245 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 504

 Score =  459 bits (1181), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 258/486 (53%), Positives = 339/486 (69%), Gaps = 20/486 (4%)

Query: 19  FTNSLMAQNQLIDSLTSHISLYHSHSLSSN-PNPSSNPRSSILKWFASLTVHQRQAHLTI 77
           F+ S MAQNQLIDSLTSHISLYHSHS SS+  N   NPRS+IL+WF+SL+VHQR +HLT 
Sbjct: 12  FSGSSMAQNQLIDSLTSHISLYHSHSSSSSMANSIPNPRSAILRWFSSLSVHQRLSHLTF 71

Query: 78  VDSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAESNESG 137
           VD KF Q+L+QMLG +R  G G FIILPDLPS D   LP LC+KKSRGL+SRVAESNE  
Sbjct: 72  VDPKFVQILLQMLGYIRTKGPGSFIILPDLPSSD---LPSLCFKKSRGLISRVAESNEPE 128

Query: 138 RWVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGE 197
           R+VF+STRLF SREGE  ++ SC VN LD+  ++ + + NVDRF++ MD +SNG FLRGE
Sbjct: 129 RFVFDSTRLFGSREGENAQDCSCSVNSLDSVVMADDLLTNVDRFVETMDALSNGAFLRGE 188

Query: 198 ESELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLNAAGM 257
           ES+L  +WVE +WLKAKGYYS+EAFI NRLEV LRLAWLN N+GKKRG+KLKEKLNAA  
Sbjct: 189 ESDLGSNWVELEWLKAKGYYSMEAFIANRLEVSLRLAWLNTNSGKKRGIKLKEKLNAAAA 248

Query: 258 AANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNALEDGMWLFNAG 317
           AAN YWRKK CVDWW NLD A  RK+ T +LGK+AKS+ +E+L+EA+ A +  +WLFN  
Sbjct: 249 AANSYWRKKACVDWWQNLDAATHRKIWTCLLGKSAKSVIYEILREANQAQQGDIWLFN-- 306

Query: 318 MKQSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVFSGLFVLQDITTMVL 377
                 F  ++  +   S +S    C + + P S+S  P ++A+  SGL+VLQ+  ++++
Sbjct: 307 ------FASARKRRTETSEVSF---CDMILEPNSVSRKPITVASNLSGLYVLQEFASLLI 357

Query: 378 SSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNFKSSP-NK 436
             Q+    ++ +FFSSL  ++T  DC+LRKLRG LMV+S+D  K+EL  +   K SP + 
Sbjct: 358 LCQNGLVPVQSVFFSSLGTITTLVDCILRKLRGFLMVISIDSVKVELLDDNTHKCSPSSS 417

Query: 437 SKEKPSTIGRRKKCRACSTKRQNPLPKSALDE-LSLD-KLPKDPEGALTDTEKVDLMGSD 494
           S +K     R+ K +  + K+  P P++  D+ +SL  K  K  +  L  T+  + +   
Sbjct: 418 SNQKLGLTSRKHKGKTRNLKK--PTPEATSDKNVSLSTKSGKKDQAKLEFTKNREAIECK 475

Query: 495 KVPGIS 500
           KVP  S
Sbjct: 476 KVPAAS 481


>gi|168032650|ref|XP_001768831.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162679943|gb|EDQ66384.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1918

 Score =  429 bits (1102), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 314/804 (39%), Positives = 428/804 (53%), Gaps = 110/804 (13%)

Query: 728  GTALKSSLECP------SYEWPTIAPVYFPSISSHLLP-ATDRLHLDVGHNWHNHVRQPF 780
            G +L+S    P      S+EWP  + V  P   S + P AT+RLHLDV H+W   +    
Sbjct: 1055 GGSLRSQHNTPAFAMFYSHEWPGYSQVRVPG--STVRPTATERLHLDVVHDWPKCLGASS 1112

Query: 781  VPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCN------------Y 828
            V   +                 P P +LDW P+V N   + P V               +
Sbjct: 1113 VSISNTV---------------PAPGNLDWSPLVHNGYSLTPMVGYADSNEPKVVTLSPF 1157

Query: 829  DSGFISSRQ--SGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHW 886
             S   +SR   +G  +      ++     +D E   +GDF DL     ++E+G       
Sbjct: 1158 SSSPYASRTPPNGHTRLGEDSEVEERLYDTDLEIFDAGDFGDLDGYLVSEEEG------- 1210

Query: 887  LSEEELEVHT-VSGIDYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKR 945
              E ++E +  V   DYNQ FGGGVMYWN++D+ G G+SRP S+SSDDSSWA  EAD+  
Sbjct: 1211 --ERQIEDNAHVKAEDYNQVFGGGVMYWNSADYTGMGYSRPGSMSSDDSSWARREADLGA 1268

Query: 946  AVDDMVA--FSSSYSTNGLTSPTAASFCS---------PFDPLGPGHQAFSYVVPGNEVP 994
             +DD+V   +   Y + G TS   +S            PFD +G           G++V 
Sbjct: 1269 VLDDIVGIPYGVPYGSKGSTSSNLSSTSPSQPSTSLPVPFDQIG-----------GSQVG 1317

Query: 995  GKVLHSSSTTTDVAT-------EEEISGSFASLSGDVDSKALDTLPCPILRPIII---PN 1044
             +V  +  T +D +        + + +G+    S  V    +     P LRPI++   P 
Sbjct: 1318 PRVCLTRPTGSDFSVSYRAFSDDSQTAGTNVGNSAGVVDGIMSDGFMPRLRPIVVMRDPG 1377

Query: 1045 LSRERSRSDFKRSHEHKSPCV--------PPSRREQPRIKRPPSPVVLCVPRAPRPPPPS 1096
            LSR   +++  R  E +SP +        P  RR     +RP  PV      AP PPPPS
Sbjct: 1378 LSRSMGKNEAVRLRELRSPHIVSRGAPDFPGDRR-----RRPSPPVTR---HAPPPPPPS 1429

Query: 1097 PVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPSWRNKNLSA 1156
            PV+  ++ +G+ + RSGSSSPRHWG+  W+++                +    R   L++
Sbjct: 1430 PVAGLKRRKGWMSARSGSSSPRHWGLTSWWNKDDVDNFEVQAAGRDNALENGRRMAILAS 1489

Query: 1157 HPMIQPLSGALLQDHLIAISQLARDQEH-PDVAFPLQPLEVQN-CPTRKASLSLMHSLLH 1214
             P I+P+S  L  +H +AI   A DQEH  D A  +Q   + N  P+ +  ++L H  LH
Sbjct: 1490 TPPIRPMSRVLPLEHSVAIHSSALDQEHMADEALVMQSTVLFNKNPSLQLVITLFHKSLH 1549

Query: 1215 EEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDL 1274
            +EI++FC QVAAE   R P+IN AVK+VT SLQVLWPRSR  IFGSNATGL+LP+SDVD+
Sbjct: 1550 KEIETFCLQVAAEKRLRMPFINTAVKKVTHSLQVLWPRSRAKIFGSNATGLALPTSDVDI 1609

Query: 1275 VVCLPPVRNLE-PIKEAGILEGR-NGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            VV LPPVR    PIK+AGILEGR +G+KETCL HAAR L +Q+WVKS  L+  E T +PI
Sbjct: 1610 VVSLPPVRKARNPIKQAGILEGRIDGVKETCLLHAARNLEDQDWVKS--LQVFERTMVPI 1667

Query: 1333 IMLVVEV-PHDL-IASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSD 1390
            I L   + PH   +   + SV+      +    +  +    D     +   P  S+    
Sbjct: 1668 IRLEAHILPHQCELGDISLSVKETCPMGSSNDAQDGSPTSQDSGDSGEKEGPWGSN---- 1723

Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
              K    VRLDISF+S ++TGL+T++LV+EL  QFP  TPLALV KQFLADRSLD +Y G
Sbjct: 1724 --KDRQIVRLDISFESLANTGLRTSELVRELVGQFPPLTPLALVSKQFLADRSLDHAYKG 1781

Query: 1451 GLSSYCLMLLITRFLQHEHHLGRP 1474
            GLSSYC +LLITRFLQH+ HLGRP
Sbjct: 1782 GLSSYCQVLLITRFLQHQQHLGRP 1805



 Score =  122 bits (305), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 101/356 (28%), Positives = 165/356 (46%), Gaps = 64/356 (17%)

Query: 111 DPPYLPGLCYKKSRGLLSRVAESNESGRWVFESTRLFSSREGEK--IEEWSCPVNCLDTF 168
           D   LPGLCY+K+ GLL+R+     +   + ++ ++FSS++G K  I   S PV   D  
Sbjct: 315 DSVTLPGLCYRKANGLLARLKSQQLAEEKLRKAVQIFSSQDGHKSLISCGSEPV--FDAL 372

Query: 169 TVSVEFVENVDRFIDIMDEISNGGFLRGEESELAGDWVEFDWLKAKGYYSIEAFIVNRLE 228
           ++S E  EN+D F+++M EIS G FL       A  W E  WL+ +GYYS+ AFI N+LE
Sbjct: 373 SLSSELAENLDVFLEVMGEISYGEFLEHPLGLTASPWEETPWLQVQGYYSVAAFIANKLE 432

Query: 229 VGLRLAWLNCNNGKKRGVK----------LKEK---------LNAAGMAANVYWRKKGCV 269
           + +  ++ + + G KR  K          L+ K         L +A  A +++ RK+GC+
Sbjct: 433 ICIWSSY-SLSMGSKRSFKNQWIIKTSESLRSKWTSGRSVDVLESAAAAQSLHERKRGCI 491

Query: 270 DWWMNLDDAMRRKVLTVILGKAAK--------------------SLTHEVLKEASNALED 309
           DWW  +D  ++ K +   L  A K                      +HE+L+      ++
Sbjct: 492 DWWNRVDCDLKEKTVRSALIAATKFEVISAVSSARSAIRSIKPAHRSHELLEATKYRGKN 551

Query: 310 GMWLFNAGMKQSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVFSGLFVL 369
             W     +  +   +H   L+ + ST +        +SP  L G+ A      SG+   
Sbjct: 552 TRW---TSLSDTEVEHHKLDLKNSWSTST--------LSPL-LLGLAALKEAAESGILSG 599

Query: 370 QDITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELF 425
           + +T    S+         +FFS+L  V T  D ++R+ R +L  V  +  + EL 
Sbjct: 600 ESLTIAPRSNN--------LFFSNLESVHTLPDRIVRRSRNVLARVMNEAVEYELL 647



 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 38/104 (36%), Positives = 57/104 (54%), Gaps = 16/104 (15%)

Query: 22  SLMAQNQLIDSLTSHISLYHSHS------------LSSNPNPSSNP----RSSILKWFAS 65
           S M   Q +D+LT H+++YH  +            L +NP P +      +  +++W + 
Sbjct: 136 STMDTQQAVDALTDHMAIYHRSAERLKAGRSTGGDLKNNPKPGAKGGNVVKQRVIEWLSE 195

Query: 66  LTVHQRQAHLTIVDSKFAQLLIQMLGKLRANGHGFFIILPDLPS 109
           L+  QRQA LT+ D  +  LL+QM  KL  NG G FIILPD+P+
Sbjct: 196 LSTSQRQAVLTVFDRSWVLLLLQMQQKLNENGPGNFIILPDIPA 239


>gi|297725171|ref|NP_001174949.1| Os06g0669275 [Oryza sativa Japonica Group]
 gi|255677308|dbj|BAH93677.1| Os06g0669275, partial [Oryza sativa Japonica Group]
          Length = 615

 Score =  410 bits (1055), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 243/511 (47%), Positives = 323/511 (63%), Gaps = 11/511 (2%)

Query: 732  KSSLECPSYEWPTIAPVYFPSISSHLLPAT-DRLHLDVGHNWHNHVRQPFVPTLHQARNH 790
            +++L   SYEWP++AP +F S      PAT DRLHLDVG+ W     QPF+P  HQ RN 
Sbjct: 110  RNTLGGTSYEWPSVAPHHFVSPEMQQRPATTDRLHLDVGYRWPAQFEQPFLPPNHQMRNP 169

Query: 791  PFDGGCNQILSQ-PLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGM 849
            P + GCNQ+LS   +P+S DWPP+ +    +  +    YD  +    QS     F    +
Sbjct: 170  PVEAGCNQVLSSLSVPLSFDWPPVFRGYGKLNQNNALGYDPLYTPQMQSSAWSGFHAPLI 229

Query: 850  QFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGG 909
            Q  +  ++ + K  GD     +P    + GD+ +S+W SEEE +   +SG D NQYFGGG
Sbjct: 230  QRGSICNEKDRKYFGD----SDPRNKSDVGDDTESYWFSEEESDGRGLSGRDINQYFGGG 285

Query: 910  VMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDM-VAFSSSYSTNGLTSPTAA 968
            VMYW+ ++H GTGFSRPPSLSSDDS+WAWHEAD+ + +DD+ V    +Y+ N ++SP + 
Sbjct: 286  VMYWSPAEHAGTGFSRPPSLSSDDSAWAWHEADVSQVLDDIPVGIPCTYNANSVSSPPST 345

Query: 969  SFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSKA 1028
            S CS  +   P  Q+  + +  N +  + L S S+  D   ++ IS S  S+S   +   
Sbjct: 346  SVCSQNESPDPLPQSVGHSMTANGMNSESLQSPSSMQDSPEDKTISVS-KSVSCGSEVIK 404

Query: 1029 LDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVLCVPR 1088
             DTLP  +LRPI++P +SR  SRS+FK  H+H+SPCVP +RR+ P ++RPPSPVVL VPR
Sbjct: 405  GDTLPYAMLRPIVVP-ISRRPSRSEFKGGHDHRSPCVPSNRRDIPLVRRPPSPVVLSVPR 463

Query: 1089 APRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPS 1148
             PRPPPPSPV +SRK RGFP VRSGSSSPRHWG+R  + E          +DG EVVWPS
Sbjct: 464  VPRPPPPSPVGESRK-RGFPIVRSGSSSPRHWGMRSLFTEDKILNRAQFCLDGPEVVWPS 522

Query: 1149 WRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSL 1208
            W NK   A  + Q +   +LQDHL+ ISQL+RDQ HPDVA PLQP ++ NC + KASLSL
Sbjct: 523  WGNKGSPAGTLGQSIEDTVLQDHLVKISQLSRDQ-HPDVALPLQPPDMLNCSSSKASLSL 581

Query: 1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAV 1239
            MH+ LHEEID FCKQV         Y+ + +
Sbjct: 582  MHNALHEEIDQFCKQVCPFCATSFMYVRYVI 612


>gi|345291705|gb|AEN82344.1| AT4G00060-like protein, partial [Capsella grandiflora]
 gi|345291707|gb|AEN82345.1| AT4G00060-like protein, partial [Capsella grandiflora]
 gi|345291709|gb|AEN82346.1| AT4G00060-like protein, partial [Capsella rubella]
 gi|345291711|gb|AEN82347.1| AT4G00060-like protein, partial [Capsella rubella]
 gi|345291713|gb|AEN82348.1| AT4G00060-like protein, partial [Capsella rubella]
 gi|345291715|gb|AEN82349.1| AT4G00060-like protein, partial [Capsella rubella]
 gi|345291717|gb|AEN82350.1| AT4G00060-like protein, partial [Capsella rubella]
 gi|345291719|gb|AEN82351.1| AT4G00060-like protein, partial [Capsella rubella]
 gi|345291721|gb|AEN82352.1| AT4G00060-like protein, partial [Capsella rubella]
 gi|345291723|gb|AEN82353.1| AT4G00060-like protein, partial [Neslia paniculata]
          Length = 197

 Score =  253 bits (647), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 131/228 (57%), Positives = 159/228 (69%), Gaps = 33/228 (14%)

Query: 739 SYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPFVPTLHQARNHPFDGGCNQ 798
           SYEWP +AP+YF  ++SHL  ATDRLHLDVGHN H +VRQPFV ++H  RN   +G   Q
Sbjct: 2   SYEWPAVAPMYFSHVNSHLPTATDRLHLDVGHNLHAYVRQPFVSSVHHTRNPSIEGSHKQ 61

Query: 799 ILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGMQFNAKTSDD 858
           +LS+P+PMSLDWPPMV +  G+  +  CNYDSG +                         
Sbjct: 62  VLSRPMPMSLDWPPMVHSNCGLTTAFACNYDSGIL------------------------- 96

Query: 859 EGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGGVMYWNTSDH 918
                   +D+PE     E G+EC+++W+ EE+ E+HTVSG+DYNQYFGGGVMYWN SDH
Sbjct: 97  --------VDIPEQKNKPELGNECENNWMLEEDFEMHTVSGVDYNQYFGGGVMYWNPSDH 148

Query: 919 PGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTNGLTSPT 966
            GTGFSRPPSLSSDDSSWAWHEA++KR+VDDMVAFSSSYS NGL SPT
Sbjct: 149 LGTGFSRPPSLSSDDSSWAWHEAEMKRSVDDMVAFSSSYSANGLDSPT 196


>gi|428166591|gb|EKX35564.1| hypothetical protein GUITHDRAFT_118270 [Guillardia theta CCMP2712]
          Length = 1889

 Score =  172 bits (437), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 96/266 (36%), Positives = 148/266 (55%), Gaps = 40/266 (15%)

Query: 1208 LMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSL 1267
            LMH++L  ++  F +++ +E   R  Y   A+ ++  +   LWPR++  IFGS AT LS+
Sbjct: 1485 LMHAMLDSDMLLFSRRIQSELDRRSHYRLCAIAKIQSAASSLWPRAQCKIFGSVATSLSV 1544

Query: 1268 PSSDVDLVVCLPPV--RNLEP-IKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
            P+SDVD++VCLP V  R  EP +  AG++EGR  IK T   + A+ LAN +W++S++LKT
Sbjct: 1545 PTSDVDIIVCLPKVMSRESEPSLARAGVMEGRRVIKGTLQANLAQVLANADWIQSETLKT 1604

Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKC 1384
            +EN  +P+I    + P  L                                LD S     
Sbjct: 1605 IENAVVPVITFSTK-PMSLF-------------------------------LDPS----- 1627

Query: 1385 SHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSL 1444
             H +S+  +  + +RLD++F+ P H GL    LV+ L + FPA   L  V+K  L+D+ L
Sbjct: 1628 QHKASEEEEKISVIRLDVTFEGPDHRGLPAVRLVQSLVQDFPALKHLFFVVKSLLSDKGL 1687

Query: 1445 DQSYSGGLSSYCLMLLITRFLQHEHH 1470
            D  Y+GGL+S+ L+LL+TR+LQH + 
Sbjct: 1688 DNPYTGGLTSFGLVLLVTRYLQHSNQ 1713


>gi|387219009|gb|AFJ69213.1| hypothetical protein NGATSA_2001900, partial [Nannochloropsis
            gaditana CCMP526]
 gi|422292689|gb|EKU19991.1| hypothetical protein NGA_2001900, partial [Nannochloropsis gaditana
            CCMP526]
          Length = 1119

 Score =  170 bits (431), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 101/272 (37%), Positives = 147/272 (54%), Gaps = 18/272 (6%)

Query: 1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
            + S L ++I +F ++V+    +R P    A+ R  + +Q LWPR++   FGS  +GL+LP
Sbjct: 694  IRSRLCDDIAAFVERVSVCTGSRLPQQTVAINRCRQVIQSLWPRAQVKAFGSFVSGLALP 753

Query: 1269 SSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
            SSDVDL++CLP VR   P +  G+LEGRN IKET  Q  AR L    WV   S+K + +T
Sbjct: 754  SSDVDLIICLPNVRRDAPAQAPGVLEGRNAIKETWQQELARRLRATSWVNPASIKIISHT 813

Query: 1329 AIPIIMLVVEV----------PHDLIASAASSVQSPK--EDAAHTTLKHDNHVHSDMVAL 1376
            AIP+I + VE           P   +     +V S +  E AA +   H     +D+ A 
Sbjct: 814  AIPVIKMTVESQPRPSPASPHPTSPVEGGTGTVASGQHSEMAAASQRPHSCRNEADLSAE 873

Query: 1377 DDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLK 1436
             +  +     T          V LD+S +   H GL    ++ +L  +  A  PL LVLK
Sbjct: 874  PEGVALNMGWT------VEYGVHLDVSIEGGQHNGLLANRVIAKLLGENFALRPLVLVLK 927

Query: 1437 QFLADRSLDQSYSGGLSSYCLMLLITRFLQHE 1468
            QF+ +R L +SYSGGLSSY L+L++ R+LQ +
Sbjct: 928  QFMKERGLMESYSGGLSSYGLVLMVARYLQEQ 959


>gi|348685799|gb|EGZ25614.1| hypothetical protein PHYSODRAFT_297213 [Phytophthora sojae]
          Length = 1731

 Score =  167 bits (424), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 101/259 (38%), Positives = 134/259 (51%), Gaps = 56/259 (21%)

Query: 1211 SLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
            S LH ++  F  Q+ A++ AR P    A +  T ++Q LWPR++   +GS+ + L LPSS
Sbjct: 1062 SKLHWDLCEFAAQLQADSDARLPAQLAAQRLCTATVQSLWPRAQVRPYGSHVSRLVLPSS 1121

Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAI 1330
            DVDLV+CLP VR   P   AG+LEGRN IKET  Q+ AR L  + WV  DS+KT+ + A+
Sbjct: 1122 DVDLVICLPKVRRDAPADAAGVLEGRNAIKETWQQNLARKLRQEPWVVRDSVKTLPHAAV 1181

Query: 1331 PIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSD 1390
            PII L+   P++                               V LD             
Sbjct: 1182 PIITLLTAPPYN-------------------------------VRLD------------- 1197

Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
                       ISF+ P H GL T D+V  L  +FP   P+ LVLK F  +R L  +YSG
Sbjct: 1198 -----------ISFEGPGHNGLATNDVVLSLLHEFPPLAPMMLVLKSFAIERGLAVAYSG 1246

Query: 1451 GLSSYCLMLLITRFLQHEH 1469
            GLSSY L+LL+ R+LQ EH
Sbjct: 1247 GLSSYALLLLVARYLQ-EH 1264



 Score = 64.7 bits (156), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 71/278 (25%), Positives = 113/278 (40%), Gaps = 64/278 (23%)

Query: 56  RSSILKWFASLTVHQRQAHLTIVDSKFAQLLIQMLGKLRANGHGF--FIILPDL------ 107
           + ++LKW   L+  +  +   + D  F + L+ M  + R +  G   F +LP        
Sbjct: 31  KRAVLKWLRGLSGEELASLCCVEDVGFVKTLLHMAARSRGSSGGVQEFQLLPQTVGAGSG 90

Query: 108 ----------PSRDPP--YLPGLCYKKSRG-----LLSRVAESNESGRWVFESTRLFSSR 150
                     P R  P  ++     K + G       SR  E  ES R +    R+ ++ 
Sbjct: 91  VQSKRTATKTPPRTAPREFVKRPVVKTADGDVVGSFCSR--EYEESSRKLLSGMRVLNTL 148

Query: 151 EGEKIEEWSCPVNCLDTFTVSVEFVENVDR--------FIDIMDEISNGGFLRG--EESE 200
           +       SC     DT  +S+EF   V+R           +M+ IS G FL     ES 
Sbjct: 149 Q-------SC-----DTVALSMEFFHPVERKKGAKFAECFQLMEVISRGAFLIECPPEST 196

Query: 201 LAGD-WVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLNAAGMAA 259
           L    W E  WLK +GYYS++A  VN++E+ +   W      K+R      K+   G+A+
Sbjct: 197 LKNRVWGETKWLKEQGYYSLQALFVNQIELNVWAFW------KQRVKDASTKIPLQGLAS 250

Query: 260 NVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTH 297
            ++      V  W    DA R   L V+     K+++H
Sbjct: 251 KLH-----LVREWEAASDAQRTAALKVL---GDKTISH 280


>gi|325184829|emb|CCA19321.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1322

 Score =  164 bits (415), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 94/256 (36%), Positives = 133/256 (51%), Gaps = 55/256 (21%)

Query: 1211 SLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
            S LH ++  F   + A++  RK     A +  T ++Q LWPR++   +GS  T L+LPSS
Sbjct: 695  SRLHWDMCEFVTLLQAQSNGRKSAQTAAQRLCTTTVQSLWPRAQVRAYGSFVTQLALPSS 754

Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAI 1330
            D+DLV+CLP VR+  P + AG+LEGRN IKET  Q+ AR L  + WV  +S+KT+ + A+
Sbjct: 755  DLDLVICLPKVRHDAPAEAAGVLEGRNAIKETWQQNLARKLRREPWVVPESVKTIPHAAM 814

Query: 1331 PIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSD 1390
            PII L+ + P++                               V LD             
Sbjct: 815  PIITLLTKAPYN-------------------------------VRLD------------- 830

Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
                       ISF+ P H GL T D+V     +FP+  PL LVLK F+ +R    +YSG
Sbjct: 831  -----------ISFEGPGHNGLATNDVVHAFIHEFPSVVPLMLVLKSFVIERGYAFAYSG 879

Query: 1451 GLSSYCLMLLITRFLQ 1466
            GLSSY L+L++ R+LQ
Sbjct: 880  GLSSYALLLMVVRYLQ 895


>gi|219121652|ref|XP_002181176.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407162|gb|EEC47099.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1458

 Score =  164 bits (414), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 93/265 (35%), Positives = 130/265 (49%), Gaps = 53/265 (20%)

Query: 1205 SLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATG 1264
            SL  M S L ++I  F          +      AV+R+TR +  +WPR++  ++GS+ TG
Sbjct: 1100 SLRGMQSRLSKDILHFLDATNMHLRKQDSKRMKAVERMTRLVNTVWPRAQVKVYGSHVTG 1159

Query: 1265 LSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
            L LPSSD+D V+CLP V    P    G LEGRN I ET  +  AR L ++ W+   S+K 
Sbjct: 1160 LCLPSSDLDFVICLPAVHKRAPAVAPGALEGRNAINETSQKLLARKLKSESWIDPRSMKL 1219

Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKC 1384
            +E   +P+I +                                                 
Sbjct: 1220 IERAVVPVITV------------------------------------------------- 1230

Query: 1385 SHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSL 1444
               S+ + KA T ++LDISF  P H GLQ  D+V E+ E+ P   PL L+LKQFL DR L
Sbjct: 1231 ---STKDTKART-IQLDISFDGPGHHGLQAIDMVSEILEELPMLRPLVLILKQFLLDRGL 1286

Query: 1445 DQSYSGGLSSYCLMLLITRFLQHEH 1469
              +Y+GGLSSYCL L++ R+LQ + 
Sbjct: 1287 LTAYTGGLSSYCLFLMVARYLQEQQ 1311


>gi|301111622|ref|XP_002904890.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262095220|gb|EEY53272.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 1620

 Score =  164 bits (414), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 100/259 (38%), Positives = 132/259 (50%), Gaps = 56/259 (21%)

Query: 1211 SLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
            S LH ++  F  Q+ A++ AR P    A +    ++Q LWPR++   +GS+ T L LPSS
Sbjct: 958  SKLHWDLCEFAAQLQADSDARLPAQLAAQRLCMATVQSLWPRAQVRPYGSHVTRLVLPSS 1017

Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAI 1330
            DVDLV+CLP VR   P   AG+LEGRN IKET  Q+ AR L  + WV  DS+KT+ + A+
Sbjct: 1018 DVDLVICLPKVRRDAPADAAGVLEGRNAIKETWQQNLARKLKQEPWVVRDSVKTLPHAAV 1077

Query: 1331 PIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSD 1390
            PII L+   P++                               V LD S           
Sbjct: 1078 PIITLLTAPPYN-------------------------------VRLDIS----------- 1095

Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
                         F+ P H GL T D+V  L  +FP   P+ LVLK F  +R L  +YSG
Sbjct: 1096 -------------FEGPGHNGLATNDVVLALLHEFPPLAPMMLVLKSFAIERGLAVAYSG 1142

Query: 1451 GLSSYCLMLLITRFLQHEH 1469
            GLSSY L+LL+ R+LQ EH
Sbjct: 1143 GLSSYALLLLVARYLQ-EH 1160


>gi|242096680|ref|XP_002438830.1| hypothetical protein SORBIDRAFT_10g026820 [Sorghum bicolor]
 gi|241917053|gb|EER90197.1| hypothetical protein SORBIDRAFT_10g026820 [Sorghum bicolor]
          Length = 303

 Score =  138 bits (348), Expect = 2e-29,   Method: Composition-based stats.
 Identities = 103/299 (34%), Positives = 148/299 (49%), Gaps = 42/299 (14%)

Query: 28  QLIDSLTSHISLYHSHSLSSNPNPSSNPRSS----------ILKWFASLT-VHQRQAHLT 76
           +L+D+LT+H+SLYH+ +  S    +S   SS          IL+W ASLT   +  A  +
Sbjct: 13  ELMDALTAHLSLYHAAANPSPAAAASASSSSSSSSSSPRAAILRWLASLTPAARAAAAAS 72

Query: 77  IVDSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYL-------PGLCYKKSRGLLSR 129
           ++    A  L+ M+ +LR  GH  F +L    S             P +  + SRGLL+R
Sbjct: 73  LLSPAAAAALLSMVRRLRLRGHSSFFVLHSSSSSPSTSGSARGAEEPTVLSRLSRGLLAR 132

Query: 130 VAESNESGRWVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEIS 189
            A  + +   +F +  LF S              C D  TV+  F+ ++D F+  MDEIS
Sbjct: 133 AAAGSRAQTLLFANILLFPSSPASS--------RCPDAITVAEAFLADLDGFVAAMDEIS 184

Query: 190 NGGFLRGEESEL------AGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNC----- 238
            G FL   E E+        D+ E  WL  KGYY IE F+ NRLE+ LR++W        
Sbjct: 185 GGRFLCCGEGEVDLAVLACQDFPELPWLNDKGYYVIEEFLANRLEIALRMSWAAAGGGGV 244

Query: 239 --NNGKKRGVKLKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSL 295
                 + G  +KEK   AG+AAN +WR+KG VDWWM L+  MR +++    GK A +L
Sbjct: 245 GGRKAVRIGKGVKEK---AGLAANAFWREKGYVDWWMRLEPRMRARIIGAFFGKGAIAL 300


>gi|323454423|gb|EGB10293.1| hypothetical protein AURANDRAFT_71230 [Aureococcus anophagefferens]
          Length = 2400

 Score =  132 bits (333), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 91/264 (34%), Positives = 126/264 (47%), Gaps = 52/264 (19%)

Query: 1210 HSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
             S L +++ +F ++V+ E  AR      A ++V  +++ LWPR+R   +GS  T LSL  
Sbjct: 840  QSRLADDVRAFAEKVSREQRARDFAWRCAREKVRGAVRSLWPRARVEPYGSCVTRLSLAG 899

Query: 1270 -SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
             SD+DLV+ LP VR   P    G LEGRN IKET  Q  AR L ++ WV + S++ V + 
Sbjct: 900  VSDLDLVIRLPRVRVATPAMTPGDLEGRNAIKETWPQELARRLRSEPWVAAMSVRVVASA 959

Query: 1329 AIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTS 1388
             +P++ L                                      V L D          
Sbjct: 960  RVPVVKLAT------------------------------------VPLGD---------- 973

Query: 1389 SDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSY 1448
               ++ A   RLD+SF+ P H GL    LV  +  Q P +  + LVLKQ  ADR L  SY
Sbjct: 974  ---VREAV--RLDVSFEGPWHRGLDANRLVGRVLGQHPHARQILLVLKQHAADRGLCASY 1028

Query: 1449 SGGLSSYCLMLLITRFLQHEHHLG 1472
            +GGLSSY L LL+ RFL  +   G
Sbjct: 1029 TGGLSSYALTLLVARFLSEQPRRG 1052


>gi|298705070|emb|CBJ28529.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 2789

 Score =  115 bits (289), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 90/259 (34%), Positives = 117/259 (45%), Gaps = 68/259 (26%)

Query: 1211 SLLHEEIDSFCKQVAAENTARKPYINW-AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            S L E++  F   V  E   R     W A +R    ++ LWPR++   +GS  TGLSLP 
Sbjct: 1666 SRLCEDLSRFLHLVT-ERERRTFDAKWVAYQRCRAVVRGLWPRAQVKPYGSFVTGLSLPW 1724

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
            SD+DLV+CLP VR   P    G LEGRN IKET  Q  AR L  + WV        +N +
Sbjct: 1725 SDLDLVICLPKVRRDAPADTPGALEGRNAIKETWQQELARSLGGERWV--------DNNS 1776

Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
            + II                                                   SHT+ 
Sbjct: 1777 LKII---------------------------------------------------SHTAV 1785

Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
              IK  T       F+   H G     LVK+L + FPA  PLAL+LKQF+ ++ L  SY+
Sbjct: 1786 PVIKLTT-------FEGQGHNGQAANQLVKDLVDGFPALRPLALLLKQFMMEKGLSVSYT 1838

Query: 1450 GGLSSYCLMLLITRFLQHE 1468
            GGLSSY L L++ R+LQ +
Sbjct: 1839 GGLSSYALTLMVARYLQEQ 1857


>gi|440803146|gb|ELR24057.1| nucleotidyltransferase domain containing protein [Acanthamoeba
            castellanii str. Neff]
          Length = 1552

 Score =  108 bits (269), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 80/272 (29%), Positives = 128/272 (47%), Gaps = 49/272 (18%)

Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
             LH+EI+ +   + +  + R+ Y    + ++   +  LW  S    +GS AT L +PSSD
Sbjct: 1217 FLHKEIEDYVDHIVSLTSVRRVYQELMISKIRAIVAKLWSNSLVECYGSYATSLCIPSSD 1276

Query: 1272 VDLVVCLPPVRNL----EPIKEAGILEGRNGIKE--TCLQHAARYLANQEWVKSDSLKTV 1325
            +DLVV L P   L    + ++        N ++   T +Q     L   +WV   SL+ +
Sbjct: 1277 LDLVVLLSPASTLLCGYKSVETNNAGSNDNLLRRDWTHMQILVEELRKHDWVH--SLQFI 1334

Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
            +   +P+I L+                             D H     + +D + S   S
Sbjct: 1335 DTAKVPVIKLMTVA--------------------------DGHT----ILMDVTFSFSNS 1364

Query: 1386 HTSSDNIKAATSVRL-DISFKSPS----------HTGLQTTDLVKELTEQFPASTPLALV 1434
             ++S  + A+T+  L D+ F   S          H+G+  + LV+      PA TPLALV
Sbjct: 1365 TSTSSPLHASTAESLADLPFNPTSLLRSNASTFEHSGIAASHLVRSYIRSLPALTPLALV 1424

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LKQFL++R L+ +Y+GGLSSYCL+L+I  FL+
Sbjct: 1425 LKQFLSERGLNNTYTGGLSSYCLVLMIVAFLR 1456


>gi|224002835|ref|XP_002291089.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220972865|gb|EED91196.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1570

 Score =  100 bits (249), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 48/127 (37%), Positives = 74/127 (58%)

Query: 1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
            M S L  +I  F + + ++    +    WAV+ +TR++  +WPR++   +GS+ T L LP
Sbjct: 1407 MQSRLSTDIHRFMRALKSQLKKNERRRLWAVENITRAVNAVWPRAQIKQYGSHVTKLCLP 1466

Query: 1269 SSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
            SSD+D V+CLP V    P    G LEGRN I ET  +  +R L  + W+   S+K +E T
Sbjct: 1467 SSDMDFVICLPAVHKNAPAVAPGDLEGRNAINETNQKVLSRKLKGEPWLDQRSIKVIERT 1526

Query: 1329 AIPIIML 1335
            A+P+I +
Sbjct: 1527 AVPVIKV 1533


>gi|357620388|gb|EHJ72600.1| hypothetical protein KGM_01329 [Danaus plexippus]
          Length = 564

 Score = 94.4 bits (233), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 74/274 (27%), Positives = 115/274 (41%), Gaps = 80/274 (29%)

Query: 1195 EVQNCPTRKASLSLMHSLL--HEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPR 1252
            E   CP R  + +    +L  HEEI+ F   ++   T         V R+  ++  LWP+
Sbjct: 94   EYGGCPWRIPNYNYKPGVLGLHEEIEHFYMYMSPSETEHLVRTT-VVTRIRSAILSLWPQ 152

Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLA 1312
            +R  +FGS  TGL LP+SD+DLVV                       ++  L    R L 
Sbjct: 153  ARVEVFGSFRTGLYLPTSDIDLVVI-------------------GQWEKLPLWTLERELV 193

Query: 1313 NQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSD 1372
             Q+  + DS+K +E   +P                                         
Sbjct: 194  AQDIAEQDSIKVLEKATVP----------------------------------------- 212

Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
            +V + D  S                V++DISF   S  G+++ +L+K+  EQ+P  + L 
Sbjct: 213  IVKMTDKYS---------------DVKVDISFNMSS--GVKSAELIKQFKEQYPELSRLV 255

Query: 1433 LVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            +VLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 256  MVLKQFLLQRDLNEVFTGGISSYSLILMCISFLQ 289


>gi|298715261|emb|CBJ27910.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 2092

 Score = 93.2 bits (230), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 68/251 (27%), Positives = 117/251 (46%), Gaps = 37/251 (14%)

Query: 1220 FCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLP 1279
            F +++      R+   +  + RV   +  +W  +R  ++GS  TGL LPSSDVD+VVC  
Sbjct: 1430 FMQKLGRAMRPRRRAWSQVIARVRELVSGIWKEARVEMYGSCFTGLELPSSDVDVVVCG- 1488

Query: 1280 PVRNLEPIKEAGILEGRNGIKETC-LQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVE 1338
             +  L+     G   G + ++    LQ  A  L  Q WV+   +K +E  A+P+I ++  
Sbjct: 1489 -INGLK-YSYGGSRAGYSSMQTVANLQKLANVLQEQTWVR--GMKVIETAAVPVIKILA- 1543

Query: 1339 VPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSV 1398
               D +++                        +     DDS + +       +      +
Sbjct: 1544 ---DPVSNG-----------------------NGQFMFDDSRNGRDGWLGGGS--GLDYI 1575

Query: 1399 RLDISFKSPSHTGLQTTDLVKELTE--QFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
             LDISF+SP H G+ ++  V++  +  ++  + P  +VLK+ L  R L+Q +SGGLSSY 
Sbjct: 1576 PLDISFESPQHGGIASSKWVRDCIQGGKYGFALPAVMVLKEILCQRGLNQPWSGGLSSYS 1635

Query: 1457 LMLLITRFLQH 1467
            L+ ++   LQ 
Sbjct: 1636 LINMMITVLQR 1646


>gi|281209393|gb|EFA83561.1| hypothetical protein PPL_02627 [Polysphondylium pallidum PN500]
          Length = 1489

 Score = 91.7 bits (226), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 74/266 (27%), Positives = 115/266 (43%), Gaps = 71/266 (26%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH+EI  F   V           +  ++ +   ++ LWP +   I+GS  TGL LPSSDV
Sbjct: 1095 LHQEILDFVSDVGERTYPHVQNCHQVIEMIRTVVKRLWPNAELEIYGSFYTGLWLPSSDV 1154

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            D+VV                                       + K+ S+++ +N    +
Sbjct: 1155 DIVV--------------------------------------NYGKNMSIRS-KNAQFLL 1175

Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
             +L  ++ HDL                       N+    M+ +  +  P     + +NI
Sbjct: 1176 EVLEKQIRHDL-----------------------NNFILTMMCIPSARIPVIKLVTKNNI 1212

Query: 1393 KAATSVRLDISFK-SPS--HTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
                   +DISF+ SP+  HTG+   +LV E  E      PLA+VLK FL +R L+ +YS
Sbjct: 1213 A------VDISFRESPTSPHTGIAARNLVSENVETLVGLYPLAIVLKWFLRERGLNNTYS 1266

Query: 1450 GGLSSYCLMLLITRFLQHEHHLGRPI 1475
            GGLSSYCL+L++  FL++  H  + I
Sbjct: 1267 GGLSSYCLVLMLISFLKNNEHCPKKI 1292


>gi|313232447|emb|CBY24115.1| unnamed protein product [Oikopleura dioica]
          Length = 887

 Score = 89.4 bits (220), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 73/260 (28%), Positives = 110/260 (42%), Gaps = 78/260 (30%)

Query: 1213 LHEEIDSF---CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI  F    +    E T R       V RV  +++  +P ++  +FGS  TGL LP+
Sbjct: 144  LHEEIIDFHNWIRSTPEEYTMRHD----VVLRVEEAIKQEFPGAQVEVFGSFQTGLYLPT 199

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAAR-YLANQEWVKSDSLKTVENT 1328
            SD+D+VV                +E R G  +    +  +  L  Q   +  S+K +++ 
Sbjct: 200  SDIDMVVL------------GEKIEPRYGNPQNGPHYRLQDRLLKQGIAERYSIKVIDSA 247

Query: 1329 AIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTS 1388
            A+PII +                                    DM+              
Sbjct: 248  AVPIIKM-----------------------------------RDMI-------------- 258

Query: 1389 SDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSY 1448
                   T +++DISF     TG+    LVK    QFPA   L LVLKQFL  R +++ +
Sbjct: 259  -------TDIKVDISFNM--KTGVTAIGLVKGYIRQFPALRYLVLVLKQFLLQRDMNEVW 309

Query: 1449 SGGLSSYCLMLLITRFLQHE 1468
            +GG+SSY L+L++  FLQH+
Sbjct: 310  TGGISSYGLILMVVSFLQHQ 329


>gi|320169914|gb|EFW46813.1| hypothetical protein CAOG_04771 [Capsaspora owczarzaki ATCC 30864]
          Length = 1503

 Score = 89.4 bits (220), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 63/254 (24%), Positives = 105/254 (41%), Gaps = 62/254 (24%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH++I  F + V+    +  P +  A+  V   +  +WP S   ++GS + GLSLP SD+
Sbjct: 1157 LHDDILRFYQYVSKTAQSNAPNVQTAIHHVQGVVLSIWPNSYVVLYGSYSYGLSLPHSDI 1216

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            DLVV  PP           +        ++ +   A  L    W     ++ +E + IP+
Sbjct: 1217 DLVVVRPP-----------MAPNSGAPAQSAVAELAARLTAFPWCS--GVRAIETSRIPV 1263

Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
            +  V  +           ++ P + + H                                
Sbjct: 1264 VKFVATI---------GGLRIPVDVSFH-------------------------------- 1282

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
                    D S    SH+G+   D V+     +P   PL +VLKQ+L  + L+++YSGGL
Sbjct: 1283 --------DTSRGPLSHSGILARDFVRSQISAYPMLAPLVMVLKQYLHSKGLNEAYSGGL 1334

Query: 1453 SSYCLMLLITRFLQ 1466
            +SY ++LL+T  LQ
Sbjct: 1335 TSYGIVLLVTFLLQ 1348


>gi|313242854|emb|CBY39607.1| unnamed protein product [Oikopleura dioica]
          Length = 833

 Score = 89.4 bits (220), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 73/260 (28%), Positives = 110/260 (42%), Gaps = 78/260 (30%)

Query: 1213 LHEEIDSF---CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI  F    +    E T R       V RV  +++  +P ++  +FGS  TGL LP+
Sbjct: 90   LHEEIIDFHNWIRSTPEEYTMRHD----VVLRVEEAIKQEFPGAQVEVFGSFQTGLYLPT 145

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAAR-YLANQEWVKSDSLKTVENT 1328
            SD+D+VV                +E R G  +    +  +  L  Q   +  S+K +++ 
Sbjct: 146  SDIDMVVL------------GEKIEPRYGNPQNGPHYRLQDRLLKQGIAERYSIKVIDSA 193

Query: 1329 AIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTS 1388
            A+PII +                                    DM+              
Sbjct: 194  AVPIIKM-----------------------------------RDMI-------------- 204

Query: 1389 SDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSY 1448
                   T +++DISF     TG+    LVK    QFPA   L LVLKQFL  R +++ +
Sbjct: 205  -------TDIKVDISFNM--KTGVTAIGLVKGYIRQFPALRYLVLVLKQFLLQRDMNEVW 255

Query: 1449 SGGLSSYCLMLLITRFLQHE 1468
            +GG+SSY L+L++  FLQH+
Sbjct: 256  TGGISSYGLILMVVSFLQHQ 275


>gi|301119917|ref|XP_002907686.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262106198|gb|EEY64250.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 647

 Score = 89.0 bits (219), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 69/258 (26%), Positives = 110/258 (42%), Gaps = 77/258 (29%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI ++ +           +I   +  V  S+Q LWP+S+   FGS +TG+ LPSSDV
Sbjct: 148  LHEEILAYTQYTKKTVDTMAVHIEQMISNVRASVQSLWPQSKVETFGSYSTGIWLPSSDV 207

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNG-IKETCLQHAARYLANQEWVKSDSLKTVENTAIP 1331
            DLV+               ++E  +  +    L+  A+ L  ++WV  +SL  ++   +P
Sbjct: 208  DLVIL-------------DVVEVNDSKLTAKHLRQLAKVLEKKKWV--ESLLCLDAAKVP 252

Query: 1332 IIMLV---VEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTS 1388
            ++ LV     VP D+   +A++                   HS ++A             
Sbjct: 253  VLKLVSAETSVPIDITFESAAT-------------------HSGLLA------------- 280

Query: 1389 SDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSY 1448
                                       DL+K   +  P   PLA+V KQ L +R L+ +Y
Sbjct: 281  --------------------------RDLIKRYADNMPELYPLAIVFKQLLRERDLNDAY 314

Query: 1449 SGGLSSYCLMLLITRFLQ 1466
            +GGLSSY ++L+I  F Q
Sbjct: 315  TGGLSSYSVVLMIIHFSQ 332


>gi|328872412|gb|EGG20779.1| hypothetical protein DFA_00644 [Dictyostelium fasciculatum]
          Length = 1767

 Score = 87.8 bits (216), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 78/282 (27%), Positives = 114/282 (40%), Gaps = 69/282 (24%)

Query: 1193 PLEVQNCPTRKASLSLMHSL----LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQV 1248
            PL+V   P     L   H L    LH+EI  F   V           +  +  +   ++ 
Sbjct: 1355 PLQVYLQPWYDPPLPYPHPLFSIKLHQEILDFVSDVGDRTYPHVQSCHHVIALIKGVVKR 1414

Query: 1249 LWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAA 1308
            LWP +  +I+GS  TGL LPSSD+D+VV       + P     +LE         L+   
Sbjct: 1415 LWPNAEIDIYGSFMTGLWLPSSDIDIVVDYNKNMAIRPKNAQFLLE--------VLEKQI 1466

Query: 1309 RYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNH 1368
            R+  N   +   ++  + +  IP+I LV                  KE+ A         
Sbjct: 1467 RHDLNSFIL---TMLCIPSAKIPVIKLVT-----------------KENIA--------- 1497

Query: 1369 VHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPAS 1428
                 V +    SP   H                       TG+   DLV +  ++    
Sbjct: 1498 -----VDISFRESPTSPH-----------------------TGIAARDLVADNVKKVIGL 1529

Query: 1429 TPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHH 1470
             PLA+VLK FL DR L+ +YSGGLSSYCL+L++  FL++  H
Sbjct: 1530 YPLAIVLKWFLRDRGLNNTYSGGLSSYCLVLMLISFLKNNEH 1571


>gi|325183687|emb|CCA18146.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 737

 Score = 86.7 bits (213), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 72/255 (28%), Positives = 108/255 (42%), Gaps = 75/255 (29%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH EI     Q      + K +I   ++ VT+S+Q LW  S    FGS +TGL LP+SDV
Sbjct: 191  LHSEIVHIASQTKKTVDSMKIHIAKIIENVTKSVQELWHDSTIETFGSYSTGLWLPTSDV 250

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETC--LQHAARYLANQEWVKSDSLKTVENTAI 1330
            DLV+               +LE  + ++ T   L   A  L+ Q+WV S           
Sbjct: 251  DLVIL-------------NVLENCD-LRTTVANLGKLADVLSQQDWVAS----------- 285

Query: 1331 PIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSD 1390
                                                      +V L+ +  P     S D
Sbjct: 286  ------------------------------------------IVVLETAKVPVIKLVSKD 303

Query: 1391 NIKAATSVRLDISFKSPS-HTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
                  S+ +DISF++ S H+GL   DLV    ++ P   P A+++KQ L +R L+ +Y+
Sbjct: 304  -----VSIPIDISFETSSTHSGLLARDLVSLYVKELPELYPAAIIMKQLLRERDLNDAYT 358

Query: 1450 GGLSSYCLMLLITRF 1464
            GGLSSY ++L++  F
Sbjct: 359  GGLSSYSIVLMLIHF 373


>gi|348690009|gb|EGZ29823.1| hypothetical protein PHYSODRAFT_466551 [Phytophthora sojae]
          Length = 666

 Score = 84.7 bits (208), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 67/257 (26%), Positives = 109/257 (42%), Gaps = 75/257 (29%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  + +           +I   +  V  S+  LWP+S+   FGS +TG+ LPSSDV
Sbjct: 161  LHEEILEYTEYTKKTVDKMAVHIEQMIANVRASVLSLWPQSKVETFGSYSTGIWLPSSDV 220

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            DLV+      ++  + ++ +           L+  A+ L  ++WV  +SL  ++   +P+
Sbjct: 221  DLVIL-----DVVDVNDSQLTAKH-------LKELAKVLEKKKWV--ESLLVLDTAKVPV 266

Query: 1333 IMLV---VEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
            + LV     VP D+   +A++                   HS ++A              
Sbjct: 267  LKLVSAETSVPIDITFESAAT-------------------HSGLLA-------------- 293

Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
                                      DL+K   +  P   PLA+V KQ L +R L+ +Y+
Sbjct: 294  -------------------------RDLIKRYADNMPELYPLAIVFKQLLRERDLNDAYT 328

Query: 1450 GGLSSYCLMLLITRFLQ 1466
            GGLSSY ++L+I  F Q
Sbjct: 329  GGLSSYSVVLMIIHFSQ 345


>gi|299115714|emb|CBN74279.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 871

 Score = 84.3 bits (207), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 73/267 (27%), Positives = 108/267 (40%), Gaps = 56/267 (20%)

Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
            +LHEEI  F  +       RK  +    +R+   + +LWP +    +GS A G+  P SD
Sbjct: 309  ILHEEILDFAARCRDVQEKRKRVLENLWRRIEDVICLLWPDTVVVPYGSYACGMMTPESD 368

Query: 1272 VDLVVCLPPVRNLEPIKEAGILEGRN--------GIKETCLQHAARYLANQEWVKSDSLK 1323
            VD++V     +N  P  EA   + R         G   T +   A +L       S  +K
Sbjct: 369  VDIMVFPGDCKNSSPPFEAKDDQPRRRGSRTDSVGGMLTRIHELADHLTQHGTFSS--MK 426

Query: 1324 TVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPK 1383
             + +  +PII             A +   S  +D +  T+                    
Sbjct: 427  VIPHAMVPII------------KARTGALSDGDDGSGDTV-------------------- 454

Query: 1384 CSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
                         S  +DIS    SHTGL T+     L +  P   PL +VLKQ L  R 
Sbjct: 455  -------------SFSIDISIDCQSHTGLATSAFTSYLADHLPNLCPLTMVLKQLLQSRG 501

Query: 1444 LDQSYSGGLSSYCLMLLIT-RFLQHEH 1469
            L+  ++GGLSSY L+L++T   LQ +H
Sbjct: 502  LNDPFTGGLSSYGLVLMVTFALLQRDH 528


>gi|124481633|gb|AAI33102.1| LOC568678 protein [Danio rerio]
          Length = 535

 Score = 84.0 bits (206), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 72/262 (27%), Positives = 108/262 (41%), Gaps = 94/262 (35%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI  F + ++    E   R       V R+ R ++ LWP +   +FGS +TGL LP+
Sbjct: 50   LHEEIKDFYEYISPRPEEEQMRHE----VVARIQRVIKDLWPNAEVCVFGSFSTGLYLPT 105

Query: 1270 SDVDLVV-----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
            SD+DLVV      LP    L  ++EA                    L  ++    +S+K 
Sbjct: 106  SDIDLVVFGNWETLP----LWTLEEA--------------------LRKRKVADENSIKV 141

Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKC 1384
            ++   +PII L+                      +HT +K D                  
Sbjct: 142  LDKATVPIIKLM---------------------DSHTEVKVD------------------ 162

Query: 1385 SHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSL 1444
                             ISF   S  G++  +L+K+  +Q+P    L LVLKQFL  R L
Sbjct: 163  -----------------ISFNVQS--GVKAANLIKDYKQQYPVLPYLVLVLKQFLLQREL 203

Query: 1445 DQSYSGGLSSYCLMLLITRFLQ 1466
            ++ ++GG+ SY L L+   FLQ
Sbjct: 204  NEVFTGGIGSYSLFLMAVSFLQ 225


>gi|68363844|ref|XP_697115.1| PREDICTED: PAP-associated domain-containing protein 5 [Danio rerio]
          Length = 653

 Score = 84.0 bits (206), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 72/262 (27%), Positives = 108/262 (41%), Gaps = 94/262 (35%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI  F + ++    E   R       V R+ R ++ LWP +   +FGS +TGL LP+
Sbjct: 168  LHEEIKDFYEYISPRPEEEQMRHE----VVARIQRVIKDLWPNAEVCVFGSFSTGLYLPT 223

Query: 1270 SDVDLVV-----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
            SD+DLVV      LP    L  ++EA                    L  ++    +S+K 
Sbjct: 224  SDIDLVVFGNWETLP----LWTLEEA--------------------LRKRKVADENSIKV 259

Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKC 1384
            ++   +PII L+                      +HT +K D                  
Sbjct: 260  LDKATVPIIKLM---------------------DSHTEVKVD------------------ 280

Query: 1385 SHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSL 1444
                             ISF   S  G++  +L+K+  +Q+P    L LVLKQFL  R L
Sbjct: 281  -----------------ISFNVQS--GVKAANLIKDYKQQYPVLPYLVLVLKQFLLQREL 321

Query: 1445 DQSYSGGLSSYCLMLLITRFLQ 1466
            ++ ++GG+ SY L L+   FLQ
Sbjct: 322  NEVFTGGIGSYSLFLMAVSFLQ 343


>gi|307200518|gb|EFN80680.1| PAP-associated domain-containing protein 5 [Harpegnathos saltator]
          Length = 636

 Score = 83.2 bits (204), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 70/274 (25%), Positives = 111/274 (40%), Gaps = 80/274 (29%)

Query: 1195 EVQNCPTRKASLSLMHSL--LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPR 1252
            E   CP R  +      +  LHEEI+ F   +   +      +   VKR+   +  LWP 
Sbjct: 159  EYGGCPWRMPNKHYSKGVIGLHEEIEDFFAYMCPSHEEHVLRMR-VVKRIEYVIYDLWPD 217

Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLA 1312
            S+  +FGS  TGL LP+SD+DL V +   +NL                   L+   R L 
Sbjct: 218  SKVEVFGSFRTGLYLPTSDIDL-VVIGMWKNLP------------------LRTLERALL 258

Query: 1313 NQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSD 1372
            +Q   +  S+K ++  ++P                                         
Sbjct: 259  DQNIAEPSSIKVLDKASVP----------------------------------------- 277

Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
            +V L D  S                +++DISF    + G+++ +L+     Q+P    L 
Sbjct: 278  IVKLTDKES---------------EIKVDISFNM--NNGVKSAELINSFKRQYPVLEKLV 320

Query: 1433 LVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            +VLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 321  MVLKQFLLQRDLNEVFTGGISSYSLILMTISFLQ 354


>gi|66557991|ref|XP_625041.1| PREDICTED: PAP-associated domain-containing protein 5-like [Apis
            mellifera]
          Length = 539

 Score = 83.2 bits (204), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 77/278 (27%), Positives = 114/278 (41%), Gaps = 88/278 (31%)

Query: 1195 EVQNCPTRKASLSLMHSL--LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPR 1252
            E   CP R  +      +  LHEEI+ F   +   N      I   VKR+ + +  LWP 
Sbjct: 101  EYGGCPWRVPNKHYSKGVIGLHEEIEDFFAYMCPSNEEHSLRIR-VVKRIEQVIYDLWPD 159

Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLP----PVRNLEPIKEAGILEGRNGIKETCLQHAA 1308
            S+  +FGS  TGL LP+SD+DLVV       P+R LE                       
Sbjct: 160  SKVEVFGSFRTGLYLPTSDIDLVVIGMWTNLPLRTLE----------------------- 196

Query: 1309 RYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNH 1368
            R L +Q   +  S+K ++  ++PI+ L                 + KE    T +K D  
Sbjct: 197  RALLDQNIAEPSSIKVLDKASVPIVKL-----------------TDKE----TEIKVD-- 233

Query: 1369 VHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPAS 1428
                            S   ++ +K+A  +    SFK                 ++FP  
Sbjct: 234  ---------------ISFNMNNGVKSAELIN---SFK-----------------KRFPVL 258

Query: 1429 TPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
              L +VLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 259  EKLVMVLKQFLLQRDLNEVFTGGISSYSLILMTISFLQ 296


>gi|383852647|ref|XP_003701838.1| PREDICTED: PAP-associated domain-containing protein 5-like [Megachile
            rotundata]
          Length = 573

 Score = 82.8 bits (203), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 77/278 (27%), Positives = 114/278 (41%), Gaps = 88/278 (31%)

Query: 1195 EVQNCPTRKASLSLMHSL--LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPR 1252
            E   CP R  +      +  LHEEI+ F   +   N      I   VKR+ + +  LWP 
Sbjct: 101  EYGGCPWRVPNKHYSKGVIGLHEEIEDFFAYMCPSNEEHSLRIR-VVKRIEQVIYDLWPD 159

Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLP----PVRNLEPIKEAGILEGRNGIKETCLQHAA 1308
            S+  +FGS  TGL LP+SD+DLVV       P+R LE                       
Sbjct: 160  SKVEVFGSFRTGLYLPTSDIDLVVIGMWTNLPLRTLE----------------------- 196

Query: 1309 RYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNH 1368
            R L +Q   +  S+K ++  ++PI+ L                 + KE    T +K D  
Sbjct: 197  RALLDQNIAEPSSIKVLDKASVPIVKL-----------------TDKE----TEIKVD-- 233

Query: 1369 VHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPAS 1428
                            S   S+ +K+A  +    SFK                 +++P  
Sbjct: 234  ---------------ISFNMSNGVKSAELIN---SFK-----------------KRYPVL 258

Query: 1429 TPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
              L +VLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 259  EKLVMVLKQFLLQRDLNEVFTGGISSYSLILMTISFLQ 296


>gi|412986788|emb|CCO15214.1| predicted protein [Bathycoccus prasinos]
          Length = 687

 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 74/254 (29%), Positives = 105/254 (41%), Gaps = 77/254 (30%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH EI  FC+ +       K  +  AV+RV  ++  +WP SR  + GS AT + LPSSDV
Sbjct: 169  LHGEIVDFCRFLEPTEEEEKSRLA-AVERVREAVMTIWPSSRFEVHGSFATKMYLPSSDV 227

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            D                A IL+        CL+  A YLA          +  E T I +
Sbjct: 228  D----------------AVILDSGAKSPAVCLKALALYLA----------RRGEATNIQL 261

Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
            I               S  + P                  +V  +++ S           
Sbjct: 262  I---------------SKARVP------------------IVKFEETKS----------- 277

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
                 V+ D+SF   +  G  + ++VK  T +FPA  PL  VLK FL+ R L++ YSGG+
Sbjct: 278  ----GVQFDVSFDVAN--GPASAEIVKRNTRRFPALRPLTTVLKCFLSQRGLNEVYSGGI 331

Query: 1453 SSYCLMLLITRFLQ 1466
             SY L+ +I  FLQ
Sbjct: 332  GSYALLCMIMTFLQ 345


>gi|348687890|gb|EGZ27704.1| hypothetical protein PHYSODRAFT_343641 [Phytophthora sojae]
          Length = 501

 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 68/262 (25%), Positives = 109/262 (41%), Gaps = 85/262 (32%)

Query: 1209 MHSLLHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGL 1265
            +++ LHEEI  F   ++    E ++R   I    + +   ++ LWP +    FGS+ T +
Sbjct: 128  VYACLHEEIMDFVSFISPTEQELSSRAELI----EEMREIVKGLWPEATVETFGSHYTQM 183

Query: 1266 SLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
             LP SD+D+V+              G+ EG+       L   A+ L  +E V    L+ +
Sbjct: 184  FLPQSDIDMVLF-------------GVPEGK-----APLFKLAQCLEEKELV--SYLEVI 223

Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
            +   IPI+                                                 K  
Sbjct: 224  DKARIPIV-------------------------------------------------KMV 234

Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
            H +SD       + +D+SF      GL T DLVK     +P+  PL LVLK F+A R L+
Sbjct: 235  HKASD-------IHVDVSFNVAG--GLATGDLVKHYMRVYPSFRPLTLVLKYFMAQRGLN 285

Query: 1446 QSYSGGLSSYCLMLLITRFLQH 1467
            ++Y+GG+ S+ L +++  FLQH
Sbjct: 286  ETYTGGVGSFLLQMMVVSFLQH 307


>gi|345493399|ref|XP_001604785.2| PREDICTED: PAP-associated domain-containing protein 5-like [Nasonia
            vitripennis]
          Length = 462

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 70/258 (27%), Positives = 105/258 (40%), Gaps = 86/258 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI+ F   +   N      +   +KR+   +  LWP S+  IFGS  TGL LP+SD+
Sbjct: 46   LHEEIEDFFTYMCPTNEEHLLRVK-VIKRIENVIYDLWPDSKVEIFGSFRTGLYLPTSDI 104

Query: 1273 DLVVCLP----PVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
            DLVV       P+  LE                       R L +Q  V+  S+K ++  
Sbjct: 105  DLVVIGMWTNLPLHTLE-----------------------RALIDQNIVEPSSVKVLDRA 141

Query: 1329 AIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTS 1388
            ++PI+ L                         T +K D                  S   
Sbjct: 142  SVPIVKLT---------------------DRETEIKVD-----------------ISFNM 163

Query: 1389 SDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSY 1448
            ++ +K+A  ++   +FK                  Q+PA   L +VLKQFL  R L++ +
Sbjct: 164  NNGVKSAELIK---TFK-----------------RQYPALEKLVMVLKQFLLQRDLNEVF 203

Query: 1449 SGGLSSYCLMLLITRFLQ 1466
            +GG+SSY L+L+   FLQ
Sbjct: 204  TGGISSYSLILMTISFLQ 221


>gi|301093772|ref|XP_002997731.1| Poly(A) RNA polymerase, putative [Phytophthora infestans T30-4]
 gi|262109980|gb|EEY68032.1| Poly(A) RNA polymerase, putative [Phytophthora infestans T30-4]
          Length = 489

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 66/262 (25%), Positives = 106/262 (40%), Gaps = 85/262 (32%)

Query: 1209 MHSLLHEEIDSFCKQVA-AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSL 1267
            ++  LH+EI  F   ++  E+  R       V+ +   ++ LWP +    FGS+ T + L
Sbjct: 117  VYECLHDEIMDFVTFISPTEDELRSR--AQLVEEMRGVVKGLWPEATVETFGSHYTQMFL 174

Query: 1268 PSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKE--TCLQHAARYLANQEWVKSDSLKTV 1325
            P SD+D+V+              G+ EG+  + +   CL+   R            L+ +
Sbjct: 175  PQSDIDMVLF-------------GVPEGKEPLYKLAQCLEEKDR---------VSYLEVI 212

Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
            +   IPI+ +V                                                 
Sbjct: 213  DKARIPIVKMV------------------------------------------------- 223

Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
            H  SD       + +D+SF      GL T DLVK     +P+  PL LVLK F+A R L+
Sbjct: 224  HKGSD-------IHVDVSFNVAG--GLATGDLVKHYMRVYPSFRPLTLVLKYFMAQRGLN 274

Query: 1446 QSYSGGLSSYCLMLLITRFLQH 1467
            ++YSGG+ S+ L +++  FLQH
Sbjct: 275  ETYSGGVGSFLLQMMVVSFLQH 296


>gi|294948742|ref|XP_002785872.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239899980|gb|EER17668.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 436

 Score = 80.5 bits (197), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 71/264 (26%), Positives = 110/264 (41%), Gaps = 81/264 (30%)

Query: 1209 MHSLLHEEIDSFCKQVAA---ENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGL 1265
            MH +LHEE+  + K V     E  AR+ +    V R+    + LWP+    +FGS  +GL
Sbjct: 118  MHLMLHEELVDYTKWVVLNDDEVKAREEF----VARINGVCKDLWPQCELKVFGSFLSGL 173

Query: 1266 SLPSSDVDLVVCLPPVRNLEPIKE-AGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
            SLPS D+DL +   PV  +E ++  +  L  +  +KE  L+  AR               
Sbjct: 174  SLPSGDIDLSLQNVPVGAVEAVRMLSDKLLSQGQLKELELRETAR--------------- 218

Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSAS-PK 1383
                 +PI+ LV  +P                           HV  D+   D+S+  P 
Sbjct: 219  -----VPIVKLVDVLPP--------------------------HVEVDVSVHDNSSGEPD 247

Query: 1384 CSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
             S+T                          T  +++E  E++PA  PL + LK +LA R 
Sbjct: 248  SSNT--------------------------TKFVIREGVEKYPAFKPLMIFLKTYLASRK 281

Query: 1444 LDQSYSGGLSSYCLMLLITRFLQH 1467
            L+ +++GG+ SY    L+  FLQ 
Sbjct: 282  LNLTFTGGVGSYLAACLVIAFLQQ 305


>gi|241855549|ref|XP_002416033.1| PAP-associated domain-containing protein, putative [Ixodes
            scapularis]
 gi|215510247|gb|EEC19700.1| PAP-associated domain-containing protein, putative [Ixodes
            scapularis]
          Length = 347

 Score = 80.1 bits (196), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 72/262 (27%), Positives = 108/262 (41%), Gaps = 94/262 (35%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH+EI+ F + +    AE+  R       ++R+   +  LWP++   IFGS  TGL LP+
Sbjct: 1    LHDEIEEFYRYMQPSPAEHEMRLG----VIQRIKEVILSLWPQAEVEIFGSFRTGLYLPT 56

Query: 1270 SDVDLVVC----LPPVRNLE-PIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
            SD+D+VV       P+  LE  +   GI E R                        S+K 
Sbjct: 57   SDIDVVVLGKWETLPMWTLEKALLTHGIAEPR------------------------SIKV 92

Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKC 1384
            ++  ++PI+ L                       A TT+K D                  
Sbjct: 93   LDKASVPIVKLT---------------------DARTTVKVD-----------------I 114

Query: 1385 SHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSL 1444
            S   ++ +K+A                     L+K   E+FPA   L LVLKQFL  R L
Sbjct: 115  SFNMNNGVKSAR--------------------LIKSFKEKFPALAKLVLVLKQFLLQRDL 154

Query: 1445 DQSYSGGLSSYCLMLLITRFLQ 1466
            ++ ++GG+SSY L+L+   FLQ
Sbjct: 155  NEVFTGGISSYSLILMTVSFLQ 176


>gi|348678978|gb|EGZ18795.1| hypothetical protein PHYSODRAFT_502025 [Phytophthora sojae]
          Length = 778

 Score = 80.1 bits (196), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 72/273 (26%), Positives = 112/273 (41%), Gaps = 64/273 (23%)

Query: 1207 SLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRS-RTNIFGSNATGL 1265
            ++M  +LH+EI  + +    E   ++P+I  A+  VT  +Q ++    +   +GS AT L
Sbjct: 288  AMMQVVLHDEIVDYYQHRRREAREKRPWIQKAIAMVTEEIQRVYNHCVQVEGYGSFATRL 347

Query: 1266 SLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
               SSDVDLVV              G     N   + C+   A +L   +      +  +
Sbjct: 348  HGESSDVDLVVF-------------GATGEFNFTSQQCVAALAEHL--HKLPAFVDVSAI 392

Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
               +IP++ +VV                                   +V +D      C 
Sbjct: 393  TRASIPLLKVVV-----------------------------------LVTID------CE 411

Query: 1386 HTSSDNIKAATSVRL----DISFKSPS---HTGLQTTDLVKELTEQFPASTPLALVLKQF 1438
             T S+N      V L    DI+F  P    H G+ +  LV+ L + F     L LVLK F
Sbjct: 412  ETQSENAAREGGVELRIPFDITFDEPHGIHHNGVASVTLVQGLVDAFYGLRELTLVLKHF 471

Query: 1439 LADRSLDQSYSGGLSSYCLMLLITRFLQHEHHL 1471
            L +R L+  Y GGLSSY L+L++   L+ +  L
Sbjct: 472  LVERGLNDPYVGGLSSYGLLLMVVYVLREQGAL 504


>gi|115673160|ref|XP_796681.2| PREDICTED: uncharacterized protein LOC592046 [Strongylocentrotus
            purpuratus]
          Length = 830

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 59/229 (25%), Positives = 93/229 (40%), Gaps = 77/229 (33%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRN 1297
             V+R+   ++ +WP+++  I+GS  T L LP+SD+DLV+                     
Sbjct: 159  VVQRIQGIVRSIWPKAKVEIYGSTRTMLYLPTSDIDLVLF-------------------G 199

Query: 1298 GIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKED 1357
             I E+        L      +  S+K ++  ++PI+ L                      
Sbjct: 200  DIGESPFFRLGNELEKSGIAEQGSIKVLDKASVPIVKL---------------------- 237

Query: 1358 AAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDL 1417
                                           +DN+   T VR+DISF     TG     L
Sbjct: 238  -------------------------------TDNV---TKVRVDISFN--MQTGTDCAKL 261

Query: 1418 VKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++E   QFP+   +  VLKQFL  R L++ ++GG+SSY L+L+I  FLQ
Sbjct: 262  IEEFICQFPSLPFMVFVLKQFLLQRDLNEVWTGGISSYSLILMIVSFLQ 310


>gi|427795543|gb|JAA63223.1| Putative pap-associated domain-containing protein 5, partial
            [Rhipicephalus pulchellus]
          Length = 627

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 70/258 (27%), Positives = 108/258 (41%), Gaps = 86/258 (33%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH+EI+ F + +    AE+  R   I    +R+   +  LWP++   IFGS  TGL LP+
Sbjct: 142  LHDEIEDFYRYMQPTPAEHQMRLGVI----QRIKDVILGLWPQAEVEIFGSFRTGLYLPT 197

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETC-LQHAARYLANQEWVKSDSLKTVENT 1328
            SD+D+VV                     G  ET  +    + L +    +  S+K ++  
Sbjct: 198  SDIDVVVL--------------------GKWETLPMWTLEKALLSHGIAEPQSIKVLDKA 237

Query: 1329 AIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTS 1388
            ++PI+ L                       A TT+K D                      
Sbjct: 238  SVPIVKLT---------------------DAKTTVKVD---------------------- 254

Query: 1389 SDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSY 1448
                         ISF    + G+++  L++   E+FPA   L LVLKQFL  R L++ +
Sbjct: 255  -------------ISFN--MNNGVKSACLIQSFKEKFPALPKLVLVLKQFLLQRDLNEVF 299

Query: 1449 SGGLSSYCLMLLITRFLQ 1466
            +GG+SSY L+L+   FLQ
Sbjct: 300  TGGISSYSLILMTVSFLQ 317


>gi|119603156|gb|EAW82750.1| PAP associated domain containing 5, isoform CRA_c [Homo sapiens]
          Length = 371

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 68/255 (26%), Positives = 102/255 (40%), Gaps = 80/255 (31%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 28   LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 86

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGR-NGIKETCLQHAARYLANQEWVKSDSLKTVENTAIP 1331
            DLVV                  G+   +    L+ A R     +    DS+K ++   +P
Sbjct: 87   DLVVF-----------------GKWENLPLWTLEEALR---KHKVADEDSVKVLDKATVP 126

Query: 1332 IIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDN 1391
            II L                       + T +K D                  S    + 
Sbjct: 127  IIKLT---------------------DSFTEVKVD-----------------ISFNVQNG 148

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            ++AA                    DL+K+ T+++P    L LVLKQFL  R L++ ++GG
Sbjct: 149  VRAA--------------------DLIKDFTKKYPVLPYLVLVLKQFLLQRDLNEVFTGG 188

Query: 1452 LSSYCLMLLITRFLQ 1466
            + SY L L+   FLQ
Sbjct: 189  IGSYSLFLMAVSFLQ 203


>gi|270009939|gb|EFA06387.1| hypothetical protein TcasGA2_TC009265 [Tribolium castaneum]
          Length = 513

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 72/278 (25%), Positives = 111/278 (39%), Gaps = 88/278 (31%)

Query: 1195 EVQNCPTRKASLSLMHSLL--HEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPR 1252
            E   CP RK        +L  H+EID F + ++   T      +  V R+ + +   WP 
Sbjct: 91   EFGGCPWRKDPHVYARGILGLHQEIDHFYQYMSPTRTEHLVRDD-VVCRIKQIILSKWPE 149

Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLP----PVRNLEPIKEAGILEGRNGIKETCLQHAA 1308
            ++  +FGS  TGL LP+SD+DLVV       P+R LE                       
Sbjct: 150  AQVEVFGSYRTGLYLPTSDIDLVVIGKWSNLPLRTLE----------------------Q 187

Query: 1309 RYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNH 1368
             +L N +  + +S+K ++  ++PI+ L                         T +K D  
Sbjct: 188  EFLDN-DVAQENSIKVLDKASVPIVKLT---------------------DKQTEIKVD-- 223

Query: 1369 VHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPAS 1428
                            S   S+ +K+A                    +L+K    QFP  
Sbjct: 224  ---------------ISFNMSNGVKSA--------------------ELIKTYIHQFPVL 248

Query: 1429 TPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
              L  VLKQFL +R L++ ++GG+SSY L+L+   FLQ
Sbjct: 249  PKLVYVLKQFLLERDLNEVFTGGISSYSLILMCISFLQ 286


>gi|119603155|gb|EAW82749.1| PAP associated domain containing 5, isoform CRA_b [Homo sapiens]
          Length = 374

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 68/255 (26%), Positives = 102/255 (40%), Gaps = 80/255 (31%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 28   LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 86

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGR-NGIKETCLQHAARYLANQEWVKSDSLKTVENTAIP 1331
            DLVV                  G+   +    L+ A R     +    DS+K ++   +P
Sbjct: 87   DLVVF-----------------GKWENLPLWTLEEALR---KHKVADEDSVKVLDKATVP 126

Query: 1332 IIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDN 1391
            II L                       + T +K D                  S    + 
Sbjct: 127  IIKLT---------------------DSFTEVKVD-----------------ISFNVQNG 148

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            ++AA                    DL+K+ T+++P    L LVLKQFL  R L++ ++GG
Sbjct: 149  VRAA--------------------DLIKDFTKKYPVLPYLVLVLKQFLLQRDLNEVFTGG 188

Query: 1452 LSSYCLMLLITRFLQ 1466
            + SY L L+   FLQ
Sbjct: 189  IGSYSLFLMAVSFLQ 203


>gi|294896867|ref|XP_002775743.1| PAP-associated domain-containing protein, putative [Perkinsus marinus
            ATCC 50983]
 gi|239882023|gb|EER07559.1| PAP-associated domain-containing protein, putative [Perkinsus marinus
            ATCC 50983]
          Length = 588

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 71/264 (26%), Positives = 110/264 (41%), Gaps = 81/264 (30%)

Query: 1209 MHSLLHEEIDSFCKQVAA---ENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGL 1265
            MH +LHEE+  + K V     E  AR+ +    V R+    + LWP+    +FGS  +GL
Sbjct: 118  MHLMLHEELVDYTKWVVLNDDEVKAREEF----VARINGVCKDLWPQCELKVFGSFLSGL 173

Query: 1266 SLPSSDVDLVVCLPPVRNLEPIKE-AGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
            SLPS D+DL +   PV  +E ++  +  L  +  +KE  L+  AR               
Sbjct: 174  SLPSGDIDLSLQNVPVGAVEAVRMLSDKLLSQGQLKELELRETAR--------------- 218

Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSAS-PK 1383
                 +PI+ LV  +P                           HV  D+   D+S+  P 
Sbjct: 219  -----VPIVKLVDVLP--------------------------PHVEVDVSVHDNSSGEPD 247

Query: 1384 CSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
             S+T                          T  +++E  E++PA  PL + LK +LA R 
Sbjct: 248  SSNT--------------------------TKFVIREGVEKYPAFKPLMIFLKTYLASRK 281

Query: 1444 LDQSYSGGLSSYCLMLLITRFLQH 1467
            L+ +++GG+ SY    L+  FLQ 
Sbjct: 282  LNLTFTGGVGSYLAACLVIAFLQQ 305


>gi|294929610|ref|XP_002779288.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239888351|gb|EER11083.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 589

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 71/264 (26%), Positives = 108/264 (40%), Gaps = 81/264 (30%)

Query: 1209 MHSLLHEEIDSFCKQVAA---ENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGL 1265
            MH +LHEE+  + K V     E  AR+ +    V R+    + LWP+    +FGS  +GL
Sbjct: 118  MHLMLHEELVDYTKWVVLNDDEVKAREEF----VARINGVCKDLWPQCELRVFGSFLSGL 173

Query: 1266 SLPSSDVDLVVCLPPVRNLEPIKE-AGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
            SLPS D+DL +   PV  +E ++  +  L  +  +KE  L+  AR               
Sbjct: 174  SLPSGDIDLSLQNVPVGAVEAVRMLSDKLLSQGQLKELELRETAR--------------- 218

Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDS-ASPK 1383
                 +PI+ LV  +P                           HV  D+   D+S   P 
Sbjct: 219  -----VPIVKLVDVLP--------------------------PHVEVDVSVQDNSTGEPD 247

Query: 1384 CSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
             S T                          T  +++E  E++PA  PL + LK +LA R 
Sbjct: 248  SSDT--------------------------TKFVIREGVEKYPAFKPLMIFLKTYLASRK 281

Query: 1444 LDQSYSGGLSSYCLMLLITRFLQH 1467
            L+ +++GG+ SY    L+  FLQ 
Sbjct: 282  LNLTFTGGVGSYLAACLVIAFLQQ 305


>gi|343172004|gb|AEL98706.1| Nucleotidyltransferase family protein, partial [Silene latifolia]
          Length = 501

 Score = 79.0 bits (193), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 69/271 (25%), Positives = 109/271 (40%), Gaps = 91/271 (33%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH EI  FC  ++    E  +R    N A++RV+  ++ +WP  R  +FGS  TGL LPS
Sbjct: 123  LHNEILDFCDFLSPTPEEGRSR----NSAIQRVSDVIKYIWPNCRVEVFGSYRTGLYLPS 178

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
            SD+D+V                IL+      +  LQ  +R L+ +   K   ++ +    
Sbjct: 179  SDIDVV----------------ILDSNIKSPQIGLQALSRALSQKGIAK--KIQVIGKAR 220

Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
            +PII  V +V                     T ++ D       V+ D    PK +    
Sbjct: 221  VPIIKFVEKV---------------------TDVQFD-------VSFDVDNGPKAA---- 248

Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
                                      + +K++  + P   PL L+LK FL  R L++ YS
Sbjct: 249  --------------------------EYIKDVISRLPPLRPLCLILKVFLQQRELNEVYS 282

Query: 1450 GGLSSYCLMLLITRFLQH--------EHHLG 1472
            GG+ SY L+ ++   L+         EH+LG
Sbjct: 283  GGIGSYALLAMLIAMLKSVNEGQRYPEHNLG 313


>gi|334311788|ref|XP_003339660.1| PREDICTED: PAP-associated domain-containing protein 5-like
            [Monodelphis domestica]
          Length = 809

 Score = 79.0 bits (193), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 75/275 (27%), Positives = 108/275 (39%), Gaps = 93/275 (33%)

Query: 1202 RKASLSLMHS-----LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTN 1256
            RK S   +HS      LHEEI  F + ++      K  +   V R+   ++ LWP +   
Sbjct: 341  RKFSTMPVHSTVPGTYLHEEISDFYEYMSPRPEEEKMRME-VVNRIENVIKELWPSADVQ 399

Query: 1257 IFGSNATGLSLPSSDVDLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYL 1311
            IFGS  TGL LP+SD+DLVV      LP    L  ++EA                    L
Sbjct: 400  IFGSFKTGLYLPTSDIDLVVFGKWENLP----LWTLEEA--------------------L 435

Query: 1312 ANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHS 1371
               +    DS+K ++   +PII L                       + T +K D     
Sbjct: 436  RKHKVADEDSVKVLDKATVPIIKLT---------------------DSFTEVKVD----- 469

Query: 1372 DMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPL 1431
                         S    + +KAA                     L+K+ T+++P    L
Sbjct: 470  ------------ISFNVQNGVKAA--------------------QLIKDFTKKYPVLPYL 497

Query: 1432 ALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
             LVLKQFL  R L++ ++GG+ SY L L+   FLQ
Sbjct: 498  VLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 532


>gi|350415058|ref|XP_003490519.1| PREDICTED: PAP-associated domain-containing protein 5-like [Bombus
            impatiens]
          Length = 572

 Score = 78.6 bits (192), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 75/278 (26%), Positives = 113/278 (40%), Gaps = 88/278 (31%)

Query: 1195 EVQNCPTRKASLSLMHSL--LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPR 1252
            E   CP R  +      +  LHEEI+ F   +   N      +   VKR+ + +  LW  
Sbjct: 101  EYGGCPWRVPNKHYSKGVIGLHEEIEDFFAYMCPSNEEHSLRMR-VVKRIEQVIYDLWQD 159

Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLP----PVRNLEPIKEAGILEGRNGIKETCLQHAA 1308
            S+  +FGS  TGL LP+SD+DLVV       P+R LE                       
Sbjct: 160  SKVEVFGSFRTGLYLPTSDIDLVVIGMWTNLPLRTLE----------------------- 196

Query: 1309 RYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNH 1368
            R L +Q   +  S+K ++  ++PI+ L                 + KE    T +K D  
Sbjct: 197  RALLDQNIAEPSSIKVLDKASVPIVKL-----------------TDKE----TEIKVD-- 233

Query: 1369 VHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPAS 1428
                            S   ++ +K+A  +    SFK                 ++FP  
Sbjct: 234  ---------------ISFNMNNGVKSAELIN---SFK-----------------KRFPVL 258

Query: 1429 TPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
              L +VLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 259  EKLVMVLKQFLLQRDLNEVFTGGISSYSLILMTISFLQ 296


>gi|325185090|emb|CCA19582.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 328

 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 67/256 (26%), Positives = 112/256 (43%), Gaps = 69/256 (26%)

Query: 1217 IDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVV 1276
            +++F + + AE    + +   A+  + + +Q +W  ++  I+GS  T L+LP+SD+D V+
Sbjct: 1    MEAFVEHLDAEVFYLEQHQRQAISTLQQVVQEIWQDAQIEIYGSTYTRLALPTSDIDCVL 60

Query: 1277 CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLV 1336
              P   +  P                 LQ  A+ +A+Q   +   L+ + N  IP++ ++
Sbjct: 61   VSPSCEHESP--------------SFILQEIAQGIASQP--ELHRLELLTNAKIPVLKVM 104

Query: 1337 VEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAAT 1396
             +               PK+                 + LD +    C+H          
Sbjct: 105  YQ---------------PKDQM-------------RAILLDLT----CAH---------- 122

Query: 1397 SVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
                     SP H+GL   DLV     + PA  PL LVLK+ L    L+ +++GGLSSY 
Sbjct: 123  ---------SPGHSGLSARDLVFSYQTRMPALRPLVLVLKRHLKALGLNCAFTGGLSSYA 173

Query: 1457 LMLLITRFLQH--EHH 1470
            L+LL+ RFLQ   +HH
Sbjct: 174  LVLLVVRFLQACGDHH 189


>gi|294879926|ref|XP_002768827.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239871765|gb|EER01545.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 509

 Score = 78.2 bits (191), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 71/264 (26%), Positives = 108/264 (40%), Gaps = 81/264 (30%)

Query: 1209 MHSLLHEEIDSFCKQVAA---ENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGL 1265
            MH +LHEE+  + K V     E  AR+ +    V R+    + LWP+    +FGS  +GL
Sbjct: 118  MHLMLHEELVDYTKWVVLNDDEVKAREEF----VARINGVCKDLWPQCELKVFGSFLSGL 173

Query: 1266 SLPSSDVDLVVCLPPVRNLEPIKE-AGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
            SLPS D+DL +   PV  +E ++  +  L  +  +KE  L+  AR               
Sbjct: 174  SLPSGDIDLSLQNVPVGAVEAVRMLSDKLLSQGQLKELELRETAR--------------- 218

Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDS-ASPK 1383
                 +PI+ LV  +P                           HV  D+   D+S   P 
Sbjct: 219  -----VPIVKLVDLLP--------------------------PHVEVDVSVQDNSTGEPD 247

Query: 1384 CSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
             S T                          T  +++E  E++PA  PL + LK +LA R 
Sbjct: 248  SSDT--------------------------TKFVMQEGVEKYPAFKPLMIFLKTYLASRK 281

Query: 1444 LDQSYSGGLSSYCLMLLITRFLQH 1467
            L+ +++GG+ SY    L+  FLQ 
Sbjct: 282  LNLTFTGGVGSYLAACLVIAFLQQ 305


>gi|343172002|gb|AEL98705.1| Nucleotidyltransferase family protein, partial [Silene latifolia]
          Length = 501

 Score = 77.8 bits (190), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 68/271 (25%), Positives = 110/271 (40%), Gaps = 91/271 (33%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH+EI  FC  ++    E  +R    N A++RV+  ++ +WP  R  +FGS  TGL LPS
Sbjct: 123  LHKEILDFCDFLSPTPEEGRSR----NSAIQRVSDVIKYIWPNCRVEVFGSYRTGLYLPS 178

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
            SD+D+V                IL+      +  LQ  +R L+ +   K   ++ +    
Sbjct: 179  SDIDVV----------------ILDSNIKSPQIGLQALSRALSQKGIAK--KIQVIGKAR 220

Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
            +PII  V +V                     T ++ D       V+ D    PK +    
Sbjct: 221  VPIIKFVEKV---------------------TDVQFD-------VSFDVDNGPKAA---- 248

Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
                                      + ++++  + P   PL L+LK FL  R L++ YS
Sbjct: 249  --------------------------EYIQDVISRLPPLRPLCLILKVFLQQRELNEVYS 282

Query: 1450 GGLSSYCLMLLITRFLQH--------EHHLG 1472
            GG+ SY L+ ++   L+         EH+LG
Sbjct: 283  GGIGSYALLAMLIAMLKSVNEGQRYPEHNLG 313


>gi|255566595|ref|XP_002524282.1| nucleic acid binding protein, putative [Ricinus communis]
 gi|223536473|gb|EEF38121.1| nucleic acid binding protein, putative [Ricinus communis]
          Length = 526

 Score = 77.8 bits (190), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 68/267 (25%), Positives = 109/267 (40%), Gaps = 89/267 (33%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH+EI  FC  ++    E  AR    N AVK V   ++ +WP  +  +FGS  TGL LP+
Sbjct: 125  LHKEIVDFCDFLSPTPEEEDAR----NTAVKCVFDVIKYIWPNCKVEVFGSYKTGLYLPT 180

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIK--ETCLQHAARYLANQEWVKSDSLKTVEN 1327
            SD+D+V+                   R+GIK  +  LQ  +R L+ +   K   ++ +  
Sbjct: 181  SDIDVVIF------------------RSGIKNPQIGLQALSRALSQKGIAK--KIQVIAK 220

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PI+  V                        + +  D       ++ D    PK +  
Sbjct: 221  ARVPIVKFV---------------------EKRSGVSFD-------ISFDVDNGPKAAEF 252

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              D ++                              ++PA  PL+L+LK FL  R L++ 
Sbjct: 253  IKDAVR------------------------------KWPALRPLSLILKVFLQQRELNEV 282

Query: 1448 YSGGLSSYCLMLLITRFLQ--HEHHLG 1472
            YSGG+ SY L+ ++   L+   EH+LG
Sbjct: 283  YSGGIGSYALLTMLMAVLKASSEHNLG 309


>gi|384248025|gb|EIE21510.1| Nucleotidyltransferase, partial [Coccomyxa subellipsoidea C-169]
          Length = 291

 Score = 77.4 bits (189), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 66/262 (25%), Positives = 107/262 (40%), Gaps = 83/262 (31%)

Query: 1216 EIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            E+  FC+ +A   +E  +R+     A+ RVT ++Q +WP +   +FGS  TGL LPSSD+
Sbjct: 1    ELVEFCRFLAPSSSELASRQA----ALARVTDAVQSIWPSASVQVFGSFVTGLYLPSSDM 56

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            D+VV                ++ + G   + L+  A  L  +   K  +++ +    +PI
Sbjct: 57   DIVV----------------MDSQCGDIRSALKAVANSLVRKNMAK--NIQIIAKAKVPI 98

Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
            I                     K +   + +K D                          
Sbjct: 99   I---------------------KFEDIESGIKFD-------------------------- 111

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
                     ISF + +  G +  D VK L ++ P   PL L+LK FL  R L++ Y GG+
Sbjct: 112  ---------ISFDAAN--GPEAADFVKGLMQRLPPMRPLVLILKVFLHQRELNEVYQGGI 160

Query: 1453 SSYCLMLLITRFLQHEHHLGRP 1474
             SY L++++  FL      G P
Sbjct: 161  GSYALLVMVAGFLLLHPSRGGP 182


>gi|301097760|ref|XP_002897974.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262106419|gb|EEY64471.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 735

 Score = 76.6 bits (187), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 68/269 (25%), Positives = 107/269 (39%), Gaps = 69/269 (25%)

Query: 1207 SLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRS-RTNIFGSNATGL 1265
            ++M  +LH+EI  F      E   ++P++  A+  VT  + +LW    +   +GS AT L
Sbjct: 286  AMMQVVLHDEIVDFYNHHRREARDKRPWMQQAIAMVTEEIDLLWNHCVQVEGYGSFATRL 345

Query: 1266 SLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
               SSDVDLVV              G  E  N   + C++  A ++  +E      +  +
Sbjct: 346  HGESSDVDLVV-------------FGATEEFNFTSQQCVEALADHV--RELAAYVDVSAI 390

Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
               +IP++ +VV                                   +V +D        
Sbjct: 391  TRASIPLLKVVV-----------------------------------LVTIDG------- 408

Query: 1386 HTSSDNIKAATSVRLDISFKSPS---HTGLQTTDLVKELTEQFPASTPLALVLKQFLADR 1442
                        +  DI+F  P    H G+ +  LV+ L   F     L LVLK FL +R
Sbjct: 409  --------VELRIPFDITFDEPHGIHHNGVASVTLVQGLASAFYGLRELTLVLKHFLVER 460

Query: 1443 SLDQSYSGGLSSYCLMLLITRFLQHEHHL 1471
             L+  Y GGLSSY L+L++   L+ +  L
Sbjct: 461  GLNDPYVGGLSSYGLLLMVVYVLREQGAL 489


>gi|348512677|ref|XP_003443869.1| PREDICTED: DNA polymerase sigma-like [Oreochromis niloticus]
          Length = 805

 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 64/261 (24%), Positives = 102/261 (39%), Gaps = 92/261 (35%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEE+  F K ++    E   RK  +N    R+   ++ LWP     I+GS +TGL LP+
Sbjct: 253  LHEEVMDFYKYISPRPEEAAMRKEVVN----RIEVVIKELWPTVDVQIYGSFSTGLYLPT 308

Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
            SD+DL+V      PP++ LE                       + L   E    +S+K +
Sbjct: 309  SDIDLMVTGKWERPPLQELE-----------------------QALRKHEVADPNSIKVL 345

Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
            +   +PII L  +                                               
Sbjct: 346  DKATVPIIKLTDQ----------------------------------------------- 358

Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
                 ++K   S  ++   K+ S         +K   +++P    L  VLKQFL  R L+
Sbjct: 359  ---RTDVKVDISFNVETGVKAAS--------FIKGYMKKYPVLPYLIFVLKQFLLQRDLN 407

Query: 1446 QSYSGGLSSYCLMLLITRFLQ 1466
            + ++GG+SSY L+L++  FLQ
Sbjct: 408  EVFTGGISSYSLILMVISFLQ 428


>gi|297698707|ref|XP_002826459.1| PREDICTED: PAP-associated domain-containing protein 5 [Pongo abelii]
          Length = 588

 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 75/287 (26%), Positives = 113/287 (39%), Gaps = 100/287 (34%)

Query: 1185 PDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTR 1244
            P+VA      ++Q C + KA       LLHEEI  F + ++      K  +   V R+  
Sbjct: 73   PEVA-----TQLQTCSSFKA-------LLHEEISDFYEYMSPRPEEEKMRME-VVNRIES 119

Query: 1245 SLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC-----LPPVRNLEPIKEAGILEGRNGI 1299
             ++ LWP +   IFGS  TGL LP+SD+DLVV      LP    L  ++EA         
Sbjct: 120  VIKELWPSADVQIFGSFKTGLYLPTSDIDLVVFGKWENLP----LWTLEEA--------- 166

Query: 1300 KETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAA 1359
                       L   +    DS+K ++   +PII L                       +
Sbjct: 167  -----------LRKHKVADEDSVKVLDKATVPIIKLT---------------------DS 194

Query: 1360 HTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVK 1419
             T +K D                  S    + ++AA  ++                    
Sbjct: 195  FTEVKVD-----------------ISFNVQNGVRAADLIK-------------------- 217

Query: 1420 ELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            + T+++P    L LVLKQFL  R L++ ++GG+ SY L L+   FLQ
Sbjct: 218  DFTKKYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 264


>gi|66821005|ref|XP_644037.1| hypothetical protein DDB_G0274847 [Dictyostelium discoideum AX4]
 gi|60472363|gb|EAL70316.1| hypothetical protein DDB_G0274847 [Dictyostelium discoideum AX4]
          Length = 1880

 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 65/260 (25%), Positives = 105/260 (40%), Gaps = 69/260 (26%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH+EI  F   V           +  +  +   ++ LW  +  +I+GS  TGL LPSSD+
Sbjct: 1473 LHQEILDFVSDVGERTYPHVQSCHHVIGLIRSVVKKLWSNADLDIYGSFMTGLWLPSSDI 1532

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            D+VV       ++P     +L+         L+   R   N   +   ++  + +  IP+
Sbjct: 1533 DIVVNYGKNMAIKPKNAQFLLK--------VLEKQFRTDLNGMIL---TMLCIPSAKIPV 1581

Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDS--ASPKCSHTSSD 1390
            I LV                                  ++ +++D S   SP   HT   
Sbjct: 1582 IKLVT---------------------------------TENISVDISFRESPTSVHT--- 1605

Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
                                G+   DLV +   +     PLA+VLK FL +R L+ +Y+G
Sbjct: 1606 --------------------GIAARDLVADCVREIIGLYPLAIVLKWFLRERGLNNTYTG 1645

Query: 1451 GLSSYCLMLLITRFLQHEHH 1470
            GLSSYCL+L++  FL++  H
Sbjct: 1646 GLSSYCLVLMLISFLKNNEH 1665


>gi|21240658|gb|AAM44369.1| putative protein contains similarity to topoisomerase-related
            function protein [Dictyostelium discoideum]
          Length = 1880

 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 65/260 (25%), Positives = 105/260 (40%), Gaps = 69/260 (26%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH+EI  F   V           +  +  +   ++ LW  +  +I+GS  TGL LPSSD+
Sbjct: 1473 LHQEILDFVSDVGERTYPHVQSCHHVIGLIRSVVKKLWSNADLDIYGSFMTGLWLPSSDI 1532

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            D+VV       ++P     +L+         L+   R   N   +   ++  + +  IP+
Sbjct: 1533 DIVVNYGKNMAIKPKNAQFLLK--------VLEKQFRTDLNGMIL---TMLCIPSAKIPV 1581

Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDS--ASPKCSHTSSD 1390
            I LV                                  ++ +++D S   SP   HT   
Sbjct: 1582 IKLVT---------------------------------TENISVDISFRESPTSVHT--- 1605

Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
                                G+   DLV +   +     PLA+VLK FL +R L+ +Y+G
Sbjct: 1606 --------------------GIAARDLVADCVREIIGLYPLAIVLKWFLRERGLNNTYTG 1645

Query: 1451 GLSSYCLMLLITRFLQHEHH 1470
            GLSSYCL+L++  FL++  H
Sbjct: 1646 GLSSYCLVLMLISFLKNNEH 1665


>gi|330792667|ref|XP_003284409.1| hypothetical protein DICPUDRAFT_93688 [Dictyostelium purpureum]
 gi|325085656|gb|EGC39059.1| hypothetical protein DICPUDRAFT_93688 [Dictyostelium purpureum]
          Length = 1460

 Score = 75.9 bits (185), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 69/270 (25%), Positives = 107/270 (39%), Gaps = 83/270 (30%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH+EI  F   V           +  +  +   ++ LW  +  ++FGS  TGL LPSSD+
Sbjct: 1067 LHQEILDFVSDVGERTYPHVQSCHQVIGWIRAVVKKLWSHADLDLFGSFMTGLWLPSSDI 1126

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRN-GIKETCLQHAARYLANQEWVKSD------SLKTV 1325
            D+VV                  G N  IK    Q   + L  Q  +++D      S+  +
Sbjct: 1127 DIVVNY----------------GNNMSIKPKNAQFLLKVLEKQ--IRNDLDGFILSMVCI 1168

Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDS--ASPK 1383
             +  IP+I LV                                  ++ +++D S   SP 
Sbjct: 1169 PSAKIPVIKLVT---------------------------------TENISVDISFRESPT 1195

Query: 1384 CSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
              HT                       G+   DL+ +  +      PLA+VLK FL +R 
Sbjct: 1196 SIHT-----------------------GIAARDLIADCVKDVVGLYPLAIVLKWFLRERG 1232

Query: 1444 LDQSYSGGLSSYCLMLLITRFLQHEHHLGR 1473
            L+ +Y+GGLSSYCL+L++  FL++  H  R
Sbjct: 1233 LNNTYTGGLSSYCLVLMLISFLKNNEHCPR 1262


>gi|18423551|ref|NP_568798.1| Nucleotidyltransferase family protein [Arabidopsis thaliana]
 gi|27754278|gb|AAO22592.1| unknown protein [Arabidopsis thaliana]
 gi|332009022|gb|AED96405.1| Nucleotidyltransferase family protein [Arabidopsis thaliana]
          Length = 530

 Score = 75.5 bits (184), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 62/268 (23%), Positives = 106/268 (39%), Gaps = 85/268 (31%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH+EI  FC        A K   + AV+ V+  ++ +WP  +  +FGS  TGL LP+SD+
Sbjct: 121  LHKEIVDFC-DFLLPTQAEKAERDAAVESVSSVIKYIWPSCKVEVFGSYKTGLYLPTSDI 179

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            D+V+            E+G+   + G++      + R +A        +L  +    +PI
Sbjct: 180  DVVIL-----------ESGLTNPQLGLRALSRALSQRGIAK-------NLLVIAKARVPI 221

Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
            I  V                           +  +++  D+ + D    PK +       
Sbjct: 222  IKFV---------------------------EKKSNIAFDL-SFDMENGPKAAE------ 247

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
                                     +++   + P   PL L+LK FL  R L++ YSGG+
Sbjct: 248  ------------------------FIQDAVSKLPPLRPLCLILKVFLQQRELNEVYSGGI 283

Query: 1453 SSYCLMLLITRFLQH--------EHHLG 1472
             SY L+ ++  FL++        EH+LG
Sbjct: 284  GSYALLAMLIAFLKYLKDGRSAPEHNLG 311


>gi|405123317|gb|AFR98082.1| DNA polymerase sigma [Cryptococcus neoformans var. grubii H99]
          Length = 649

 Score = 75.1 bits (183), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 65/264 (24%), Positives = 111/264 (42%), Gaps = 58/264 (21%)

Query: 1210 HSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
             S L EEI++F K V+      +  + + ++ +TR++  LWP +    FGS  T L LP 
Sbjct: 5    QSCLTEEINAFYKYVSPTREEFEVRL-FMIELITRTINKLWPEAEVTPFGSWQTQLYLPQ 63

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
             D+DLVV    + +                K+  L    +  A ++   +D +  +    
Sbjct: 64   GDIDLVVAHKYLSDAN--------------KQRLLAELGK--AMRQANITDVVAIIARAR 107

Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
            +PII  V                         TL+ ++HV S    L DS+         
Sbjct: 108  VPIIKFV-------------------------TLEGESHVTS----LADSSKQG------ 132

Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
                A   + +DIS       G+    ++ +  +  P +  L L++K FL+ RS+++ Y+
Sbjct: 133  ----AIGKINVDISLNQ--GNGVTAGKIINQYLDALPGARQLILIVKYFLSQRSMNEVYT 186

Query: 1450 GGLSSYCLMLLITRFLQHEHHLGR 1473
            GGL SY ++ ++  FLQ    L R
Sbjct: 187  GGLGSYSVICMVISFLQLHPKLRR 210


>gi|168031583|ref|XP_001768300.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680478|gb|EDQ66914.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 787

 Score = 74.7 bits (182), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 70/257 (27%), Positives = 105/257 (40%), Gaps = 83/257 (32%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH+EI  FC+ VA     ++     AV+RV+  +Q +WP S+  +FGS ATGL LP+SDV
Sbjct: 203  LHQEIVDFCEFVAPTEEEQQ-MRETAVERVSGVVQSIWPHSQVKVFGSFATGLYLPTSDV 261

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            D+VV            ++G    ++G+K       A+ L      K  +++ +    +PI
Sbjct: 262  DVVVL-----------DSGCTALQDGLKAL-----AKALTRGHVGK--NIQVIGKARVPI 303

Query: 1333 IMLVVEV---PHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
            I  V  V   P D                               ++ D +  P+     +
Sbjct: 304  IKFVETVSNIPFD-------------------------------ISFDVANGPEA----A 328

Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
            D IKAA                              P   PL LVLK FL  R L++ Y 
Sbjct: 329  DFIKAAMGA--------------------------IPPLRPLCLVLKIFLQQRELNEVYQ 362

Query: 1450 GGLSSYCLMLLITRFLQ 1466
            GG+ SY L++++   LQ
Sbjct: 363  GGIGSYALLVMLLTHLQ 379


>gi|410983511|ref|XP_003998082.1| PREDICTED: PAP-associated domain-containing protein 5 [Felis catus]
          Length = 514

 Score = 74.7 bits (182), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 69/269 (25%), Positives = 106/269 (39%), Gaps = 88/269 (32%)

Query: 1203 KASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNA 1262
            + +L L  + LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  
Sbjct: 52   EVTLELSKADLHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFK 110

Query: 1263 TGLSLPSSDVDLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWV 1317
            TGL LP+SD+DLVV      LP    L  ++EA                    L   +  
Sbjct: 111  TGLYLPTSDIDLVVFGKWENLP----LWTLEEA--------------------LRKHKVA 146

Query: 1318 KSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALD 1377
              DS+K ++   +PII L                       + T +K D           
Sbjct: 147  DEDSVKVLDKATVPIIKLT---------------------DSFTEVKVD----------- 174

Query: 1378 DSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQ 1437
                   S    + ++AA  ++                    + T+++P    L LVLKQ
Sbjct: 175  ------ISFNVQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQ 208

Query: 1438 FLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            FL  R L++ ++GG+ SY L L+   FLQ
Sbjct: 209  FLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 237


>gi|449446931|ref|XP_004141224.1| PREDICTED: PAP-associated domain-containing protein 5-like [Cucumis
            sativus]
          Length = 544

 Score = 74.3 bits (181), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 71/271 (26%), Positives = 112/271 (41%), Gaps = 91/271 (33%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH+EI  FC+ ++    E  AR    + AV+RV   ++ +WP  +  +FGS  TGL LP+
Sbjct: 126  LHKEIVDFCEFLSPTEEERVAR----DSAVERVFSVVKHIWPHCKVEVFGSFQTGLYLPT 181

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
            SD+D+V+             +GI + + G     LQ  +R L+ +   K   ++ +    
Sbjct: 182  SDIDVVIL-----------GSGIPKPQLG-----LQALSRALSQKGIAK--KIQVIGKAR 223

Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
            +PII  +                        + +  D       ++ D    PK    ++
Sbjct: 224  VPIIKFI---------------------EKQSGISFD-------ISFDVQNGPK----AA 251

Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
            D IK A S                          ++P   PL L+LK FL  R L++ YS
Sbjct: 252  DFIKGAVS--------------------------KWPPLRPLCLILKVFLQQRELNEVYS 285

Query: 1450 GGLSSYCLMLLITRFLQH--------EHHLG 1472
            GGL SY L+ ++   LQ         EH+LG
Sbjct: 286  GGLGSYALLTMLMAMLQSINVPPSSLEHNLG 316


>gi|119910013|ref|XP_001256516.1| PREDICTED: PAP-associated domain-containing protein 5 [Bos taurus]
 gi|297485254|ref|XP_002694925.1| PREDICTED: PAP-associated domain-containing protein 5 [Bos taurus]
 gi|296478153|tpg|DAA20268.1| TPA: DNA polymerase sigma-like [Bos taurus]
          Length = 467

 Score = 74.3 bits (181), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 68/260 (26%), Positives = 102/260 (39%), Gaps = 88/260 (33%)

Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
            LLHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD
Sbjct: 14   LLHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSD 72

Query: 1272 VDLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVE 1326
            +DLVV      LP    L  ++EA                    L   +    DS+K ++
Sbjct: 73   IDLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLD 108

Query: 1327 NTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSH 1386
               +PII L                       + T +K D                  S 
Sbjct: 109  KATVPIIKLT---------------------DSFTEVKVD-----------------ISF 130

Query: 1387 TSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQ 1446
               + ++AA  ++                    + T+++P    L LVLKQFL  R L++
Sbjct: 131  NVQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNE 170

Query: 1447 SYSGGLSSYCLMLLITRFLQ 1466
             ++GG+ SY L L+   FLQ
Sbjct: 171  VFTGGIGSYSLFLMAVSFLQ 190


>gi|145479659|ref|XP_001425852.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392924|emb|CAK58454.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1013

 Score = 74.3 bits (181), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 65/262 (24%), Positives = 114/262 (43%), Gaps = 52/262 (19%)

Query: 1216 EIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLW---PRSRTNIFGSNATGLSLPSSDV 1272
            +I  F   + +E     P+   A  RV   +Q ++   P     +FGS ATGL+L  SD+
Sbjct: 670  DILDFTDNIMSEYENMLPFRLLAFDRVKSVIQKVFLGIPDGMIMLFGSCATGLALIDSDI 729

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            D+ +    V N            RN +K +   +       ++WV   +   + N+A+PI
Sbjct: 730  DIGINGLEVYN------------RNMLK-SHFDNLFFEFTRKKWVVKAN--PIFNSAVPI 774

Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
            I L ++   ++      +                         LDD    +         
Sbjct: 775  IKLEIDPQINIFEYEGRN-------------------------LDDQ---QIQLWKKLKQ 806

Query: 1393 KAATSVRLDISFK------SPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQ 1446
            K  + +++DISF        P+H G  TTDLVK+  E++P+   + L+LK  +    L +
Sbjct: 807  KLKSGIKVDISFNFNGNGNYPTHIGSITTDLVKKWMEEYPSLQQIVLILKSMIKKLKLSE 866

Query: 1447 SYSGGLSSYCLMLLITRFLQHE 1468
            SY+GGLSSY L++++  +++ +
Sbjct: 867  SYTGGLSSYSLIIMVYSYMREQ 888


>gi|145530896|ref|XP_001451220.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124418864|emb|CAK83823.1| unnamed protein product [Paramecium tetraurelia]
          Length = 367

 Score = 73.9 bits (180), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 35/68 (51%), Positives = 42/68 (61%)

Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
            R+DISF+ P H G+Q   L+K      P   PL +VLKQFL+  SL   Y GGLSSY L 
Sbjct: 176  RIDISFQEPRHNGIQCVSLIKSYLNYLPELRPLTMVLKQFLSQSSLLDPYQGGLSSYGLN 235

Query: 1459 LLITRFLQ 1466
            L+I  FLQ
Sbjct: 236  LMIISFLQ 243


>gi|363738258|ref|XP_414098.3| PREDICTED: PAP-associated domain-containing protein 5 [Gallus gallus]
          Length = 534

 Score = 73.9 bits (180), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 69/268 (25%), Positives = 107/268 (39%), Gaps = 88/268 (32%)

Query: 1204 ASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNAT 1263
            A  +++ ++LHEEI  F K ++  +   +  +   V R+   ++ LWP +   IFGS  T
Sbjct: 25   AIAAVVKAILHEEIIDFYKYMSPRHEEERMRME-VVNRIENVIKELWPNADVQIFGSFKT 83

Query: 1264 GLSLPSSDVDLVV-----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVK 1318
            GL LP+SD+DLVV      LP    L  ++EA                    L       
Sbjct: 84   GLYLPTSDIDLVVFGKWETLP----LWTLEEA--------------------LRKHNVAD 119

Query: 1319 SDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDD 1378
             +S+K ++   +PII L                       + T +K D            
Sbjct: 120  ENSVKVLDKATVPIIKLT---------------------DSFTEVKVD------------ 146

Query: 1379 SASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQF 1438
                  S    + +KAA  ++                D +K    ++P    L LVLKQF
Sbjct: 147  -----ISFNVQNGVKAAQLIK----------------DFIK----KYPVLPYLVLVLKQF 181

Query: 1439 LADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            L  R L++ ++GG+ SY L L+   FLQ
Sbjct: 182  LLQRDLNEVFTGGIGSYSLFLMAVSFLQ 209


>gi|344289184|ref|XP_003416325.1| PREDICTED: PAP-associated domain-containing protein 5-like [Loxodonta
            africana]
          Length = 595

 Score = 73.9 bits (180), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 69/264 (26%), Positives = 103/264 (39%), Gaps = 88/264 (33%)

Query: 1208 LMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSL 1267
            L   LLHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL L
Sbjct: 91   LSSELLHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYL 149

Query: 1268 PSSDVDLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
            P+SD+DLVV      LP    L  ++EA                    L   +    DS+
Sbjct: 150  PTSDIDLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSV 185

Query: 1323 KTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASP 1382
            K ++   +PII L                       + T +K D                
Sbjct: 186  KVLDKATVPIIKLT---------------------DSFTEVKVD---------------- 208

Query: 1383 KCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADR 1442
              S    + ++AA  ++                    + T+++P    L LVLKQFL  R
Sbjct: 209  -ISFNVQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQR 247

Query: 1443 SLDQSYSGGLSSYCLMLLITRFLQ 1466
             L++ ++GG+ SY L L+   FLQ
Sbjct: 248  DLNEVFTGGIGSYSLFLMAVSFLQ 271


>gi|395505923|ref|XP_003757286.1| PREDICTED: PAP-associated domain-containing protein 5, partial
            [Sarcophilus harrisii]
          Length = 615

 Score = 73.9 bits (180), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 68/259 (26%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 116  LHEEISDFYEYMSPRPEEEKMRME-VVNRIENVIKELWPSADVQIFGSFKTGLYLPTSDI 174

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 175  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 210

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 211  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 232

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + +KAA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 233  VQNGVKAAQLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 272

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 273  FTGGIGSYSLFLMAVSFLQ 291


>gi|58260578|ref|XP_567699.1| hypothetical protein CNK02250 [Cryptococcus neoformans var.
            neoformans JEC21]
 gi|57229780|gb|AAW46182.1| hypothetical protein CNK02250 [Cryptococcus neoformans var.
            neoformans JEC21]
          Length = 779

 Score = 73.9 bits (180), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 73/314 (23%), Positives = 123/314 (39%), Gaps = 67/314 (21%)

Query: 1169 QDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSLMHS---------LLHEEIDS 1219
            Q+H    S   RD      A P    E Q    R A  + +           LL EEI++
Sbjct: 90   QEHKARKSDRKRDNSRGIEAGPRNKKEEQKAAERHAPWTELVDVDLCRDPVDLLTEEINA 149

Query: 1220 FCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLP 1279
            F K V+      +  + + ++ +TR++  LWP +    FGS  T L LP  D+DLVV   
Sbjct: 150  FYKYVSPTREEFEVRL-FMIELITRTINKLWPEAEVTPFGSWQTQLYLPQGDIDLVVAHK 208

Query: 1280 PVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEV 1339
             + +                K+  L    +  A ++   +D +  +    +PII  V   
Sbjct: 209  YLSDAN--------------KQRLLAELGK--AMRQANITDVVAIIARARVPIIKFV--- 249

Query: 1340 PHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVR 1399
                                  TL+  +HV S              + S    +    + 
Sbjct: 250  ----------------------TLEGKSHVSS------------LEYFSKQ--EGVGKIN 273

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            +DIS    +  G+    ++ +  +  P +  L L++K FL+ RS+++ Y+GGL SY ++ 
Sbjct: 274  VDISLNQAN--GVTAGKIINQYLDALPGARQLILIVKYFLSQRSMNEVYTGGLGSYSVIC 331

Query: 1460 LITRFLQHEHHLGR 1473
            ++  FLQ    L R
Sbjct: 332  MVISFLQLHPKLRR 345


>gi|9759071|dbj|BAB09549.1| unnamed protein product [Arabidopsis thaliana]
          Length = 533

 Score = 73.6 bits (179), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 59/259 (22%), Positives = 104/259 (40%), Gaps = 77/259 (29%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH+EI  FC  +     A K   + AV+ V+  ++ +WP  +  +FGS  TGL LP+SD+
Sbjct: 121  LHKEIVDFCDFLLP-TQAEKAERDAAVESVSSVIKYIWPSCKVEVFGSYKTGLYLPTSDI 179

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            D+V+            E+G+   + G++      + R +A        +L  +    +PI
Sbjct: 180  DVVIL-----------ESGLTNPQLGLRALSRALSQRGIAK-------NLLVIAKARVPI 221

Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
            I  V                           +  +++  D+ + D    PK +       
Sbjct: 222  IKFV---------------------------EKKSNIAFDL-SFDMENGPKAAE------ 247

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
                                     +++   + P   PL L+LK FL  R L++ YSGG+
Sbjct: 248  ------------------------FIQDAVSKLPPLRPLCLILKVFLQQRELNEVYSGGI 283

Query: 1453 SSYCLMLLITRFLQHEHHL 1471
             SY L+ ++  FL+ + +L
Sbjct: 284  GSYALLAMLIAFLKVQVYL 302


>gi|297792777|ref|XP_002864273.1| hypothetical protein ARALYDRAFT_495454 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297310108|gb|EFH40532.1| hypothetical protein ARALYDRAFT_495454 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 530

 Score = 73.6 bits (179), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 63/268 (23%), Positives = 105/268 (39%), Gaps = 85/268 (31%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH+EI  FC  +     A K   + AV+ V+  +  +WP  +  +FGS  TGL LP+SD+
Sbjct: 121  LHKEIVDFCDFLLP-TQAEKAERDAAVESVSSVITYIWPSCKVEVFGSYKTGLYLPTSDI 179

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            D+V+            E+G+   + G++      + R +A        +L  +    +PI
Sbjct: 180  DVVIL-----------ESGLTNPQLGLRALSRALSQRGIAK-------NLVVIAKARVPI 221

Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
            I  V                           +  +++  D+ + D    PK +    D +
Sbjct: 222  IKFV---------------------------EKKSNIAFDL-SFDMENGPKAAEFIQDAV 253

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
                                           + P   PL L+LK FL  R L++ YSGG+
Sbjct: 254  S------------------------------KLPPLRPLCLILKVFLQQRELNEVYSGGI 283

Query: 1453 SSYCLMLLITRFLQH--------EHHLG 1472
             SY L+ ++  FL++        EH+LG
Sbjct: 284  GSYALLAMLIAFLKYLKDGRSAPEHNLG 311


>gi|348500306|ref|XP_003437714.1| PREDICTED: PAP-associated domain-containing protein 5-like
            [Oreochromis niloticus]
          Length = 672

 Score = 73.6 bits (179), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 70/259 (27%), Positives = 100/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F   ++      K  +   V R+   +  LWP +   +FGS +TGL LP+SD+
Sbjct: 180  LHEEIIDFYNYISPRPEEEKMRLE-VVDRIKEVIHDLWPSAEVEVFGSFSTGLYLPTSDI 238

Query: 1273 DLVV-----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L  +     +S+K ++ 
Sbjct: 239  DLVVFGKWESLP----LWTLEEA--------------------LRKKNVADENSIKVLDK 274

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       ++T +K D                  S  
Sbjct: 275  ATVPIIKLT---------------------DSYTEVKVD-----------------ISFN 296

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
                +KAA   RL   FK                 E++P    L LVLKQFL  R L++ 
Sbjct: 297  VMSGVKAA---RLIKEFK-----------------EKYPVLPYLVLVLKQFLLQRDLNEV 336

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 337  FTGGIGSYSLFLMAVSFLQ 355


>gi|312077329|ref|XP_003141256.1| PAP/25A associated domain-containing protein [Loa loa]
 gi|307763579|gb|EFO22813.1| PAP/25A associated domain-containing protein [Loa loa]
          Length = 419

 Score = 73.6 bits (179), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 67/268 (25%), Positives = 109/268 (40%), Gaps = 86/268 (32%)

Query: 1202 RKASLSLMHSLLHEEI---DSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIF 1258
            R+ +LSL    LHEE+    ++ K    E   R        +RV   LQ +WP ++ ++F
Sbjct: 56   RRYALSLRG--LHEELLDLYAWLKPSPLEKALRLR----VFERVRGVLQRIWPTAKIDVF 109

Query: 1259 GSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVK 1318
            GS  T L LP+SD+D+VV    V    P+ +  +    +GI E                 
Sbjct: 110  GSLYTSLFLPTSDIDVVVESDLVSEEPPLWKTAVALKESGITE----------------- 152

Query: 1319 SDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDD 1378
              S+  ++   +PI+ +V                             D  ++ D+     
Sbjct: 153  --SINVLDKAFVPIVKMV---------------------------DKDTKIYLDI----- 178

Query: 1379 SASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQF 1438
                     S + ++   S R                  ++++  ++P   PL LVLKQF
Sbjct: 179  ---------SFNTVQGVRSAR-----------------FIEDMKMRYPVLEPLVLVLKQF 212

Query: 1439 LADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            L  R L+Q ++GGLSSY L+L++  FLQ
Sbjct: 213  LMQRQLNQVFTGGLSSYGLILMLISFLQ 240


>gi|134117055|ref|XP_772754.1| hypothetical protein CNBK1280 [Cryptococcus neoformans var.
            neoformans B-3501A]
 gi|50255372|gb|EAL18107.1| hypothetical protein CNBK1280 [Cryptococcus neoformans var.
            neoformans B-3501A]
          Length = 779

 Score = 73.2 bits (178), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 73/314 (23%), Positives = 123/314 (39%), Gaps = 67/314 (21%)

Query: 1169 QDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSLMHS---------LLHEEIDS 1219
            Q+H    S   RD      A P    E Q    R A  + +           LL EEI++
Sbjct: 90   QEHKARKSDRKRDNSRGIEAGPRNKKEEQKAAERHAPWTELVDVDLCRDPVDLLTEEINA 149

Query: 1220 FCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLP 1279
            F K V+      +  + + ++ +TR++  LWP +    FGS  T L LP  D+DLVV   
Sbjct: 150  FYKYVSPTREEFEVRL-FMIELITRTINKLWPEAEVTPFGSWQTQLYLPQGDIDLVVAHK 208

Query: 1280 PVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEV 1339
             + +                K+  L    +  A ++   +D +  +    +PII  V   
Sbjct: 209  YLSDAN--------------KQRLLAELGK--AMRQANITDVVAIIARARVPIIKFV--- 249

Query: 1340 PHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVR 1399
                                  TL+  +HV S              + S    +    + 
Sbjct: 250  ----------------------TLEGKSHVSS------------LEYFSKQ--EGIGKIN 273

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            +DIS    +  G+    ++ +  +  P +  L L++K FL+ RS+++ Y+GGL SY ++ 
Sbjct: 274  VDISLNQAN--GVTAGKIINQYLDALPGARQLILIVKYFLSQRSMNEVYTGGLGSYSVIC 331

Query: 1460 LITRFLQHEHHLGR 1473
            ++  FLQ    L R
Sbjct: 332  MVISFLQLHPKLRR 345


>gi|59800139|sp|Q8NDF8.2|PAPD5_HUMAN RecName: Full=PAP-associated domain-containing protein 5; AltName:
            Full=Terminal uridylyltransferase 3; Short=TUTase 3;
            AltName: Full=Topoisomerase-related function protein 4-2;
            Short=TRF4-2
          Length = 572

 Score = 73.2 bits (178), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 120  LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 178

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 179  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 214

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 215  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 236

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 237  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 276

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 277  FTGGIGSYSLFLMAVSFLQ 295


>gi|358341786|dbj|GAA49377.1| DNA polymerase sigma subunit [Clonorchis sinensis]
          Length = 826

 Score = 73.2 bits (178), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 42/119 (35%), Positives = 67/119 (56%), Gaps = 10/119 (8%)

Query: 1362 TLKH---DNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLV 1418
            TLKH    + + S++  LD +  P    T  +     T +++DISF   +   ++   L+
Sbjct: 655  TLKHALSSSGISSEITVLDKATVPIVKMTDKE-----TGLKVDISFNMIN--SVRAAVLI 707

Query: 1419 KELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
            ++    FP    L  VLKQFL  R+L++ ++GGLSSY L+L++ RFLQ   HL  P+N+
Sbjct: 708  QDYMRTFPCMPYLVFVLKQFLLQRNLNEVWTGGLSSYALILMVVRFLQVRAHLIYPLNR 766



 Score = 44.3 bits (103), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 24/67 (35%), Positives = 35/67 (52%), Gaps = 7/67 (10%)

Query: 1213 LHEEIDSF---CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH EI  F    K    E  AR    +  V ++   +  +WP    ++FGS  TGL LP+
Sbjct: 582  LHNEIKDFYNYIKPTDEEQYAR----DVVVSKIKDVVHSMWPDCEVDVFGSFKTGLYLPT 637

Query: 1270 SDVDLVV 1276
            SD+D+V+
Sbjct: 638  SDIDMVI 644


>gi|301756837|ref|XP_002914273.1| PREDICTED: PAP-associated domain-containing protein 5-like, partial
            [Ailuropoda melanoleuca]
          Length = 593

 Score = 73.2 bits (178), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 141  LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 199

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 200  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 235

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 236  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 257

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 258  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 297

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 298  FTGGIGSYSLFLMAVSFLQ 316


>gi|224004818|ref|XP_002296060.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|209586092|gb|ACI64777.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 775

 Score = 73.2 bits (178), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 65/261 (24%), Positives = 115/261 (44%), Gaps = 25/261 (9%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYI-NWAVKRVTRSLQVLWPRSRTNI--FGSNATGLSLPS 1269
            LH E+ SF K +  E T  +  +    VKRVT   +  + + +  +  FGS  TGL LP+
Sbjct: 160  LHNEMVSFVKLM--EPTKEELEVRERMVKRVTELAERTFGKGKCEVLPFGSQVTGLCLPN 217

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETC--LQHAARYLANQEWVKSDSLKTVEN 1327
            SD+D V+  P  +      + G      G K     L  + +    ++  + +     + 
Sbjct: 218  SDIDFVIRFPKKKKDATKIKKG------GSKPDAINLDESEKKTKEKDGGQEEGEDLEDL 271

Query: 1328 TAIPIIMLVVEVPHDL-IASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSH 1386
             + P+  L   V  +  I S   ++    +D      +H +++      ++ +  P    
Sbjct: 272  ISNPLHKLAEAVREEFGIRSELDTMDPAVDDEVDDASEHLSYLE----VIEQTRVPLVKF 327

Query: 1387 TSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQ 1446
            T      A  ++ +D+ F  P   G ++ +L+    E  P   PL  +LK FLA R +++
Sbjct: 328  TV-----APYNLDIDVCFDQPG--GPESAELMHRFMESMPPLRPLTFILKYFLASRDMNK 380

Query: 1447 SYSGGLSSYCLMLLITRFLQH 1467
             Y+GG+ SY L ++I  FLQ 
Sbjct: 381  PYTGGIGSYLLQIMIVSFLQQ 401


>gi|256818782|ref|NP_001035375.2| PAP-associated domain-containing protein 5 isoform b [Homo sapiens]
          Length = 651

 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 199  LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 257

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 258  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 293

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 294  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 315

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 316  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 355

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 356  FTGGIGSYSLFLMAVSFLQ 374


>gi|332845909|ref|XP_003315148.1| PREDICTED: PAP-associated domain-containing protein 5 [Pan
            troglodytes]
          Length = 586

 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 87   LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 145

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 146  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 181

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 182  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 203

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 204  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 243

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 244  FTGGIGSYSLFLMAVSFLQ 262


>gi|296231051|ref|XP_002760982.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
            [Callithrix jacchus]
          Length = 664

 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 212  LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 270

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 271  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 306

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 307  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 328

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 329  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 368

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 369  FTGGIGSYSLFLMAVSFLQ 387


>gi|441597299|ref|XP_003263084.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
            [Nomascus leucogenys]
          Length = 666

 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 214  LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 272

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 273  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 308

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 309  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 330

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 331  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 370

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 371  FTGGIGSYSLFLMAVSFLQ 389


>gi|355710188|gb|EHH31652.1| hypothetical protein EGK_12764, partial [Macaca mulatta]
          Length = 564

 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 65   LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 123

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 124  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 159

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 160  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 181

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 182  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 221

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 222  FTGGIGSYSLFLMAVSFLQ 240


>gi|297283970|ref|XP_002802516.1| PREDICTED: PAP-associated domain-containing protein 5 [Macaca
            mulatta]
          Length = 653

 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 201  LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 259

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 260  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 295

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 296  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 317

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 318  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 357

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 358  FTGGIGSYSLFLMAVSFLQ 376


>gi|380798533|gb|AFE71142.1| PAP-associated domain-containing protein 5 isoform a, partial [Macaca
            mulatta]
          Length = 618

 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 119  LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 177

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 178  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 213

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 214  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 235

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 236  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 275

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 276  FTGGIGSYSLFLMAVSFLQ 294


>gi|281338901|gb|EFB14485.1| hypothetical protein PANDA_002140 [Ailuropoda melanoleuca]
          Length = 632

 Score = 72.8 bits (177), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 133  LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 191

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 192  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 227

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 228  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 249

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 250  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 289

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 290  FTGGIGSYSLFLMAVSFLQ 308


>gi|119603153|gb|EAW82747.1| PAP associated domain containing 5, isoform CRA_a [Homo sapiens]
 gi|119603154|gb|EAW82748.1| PAP associated domain containing 5, isoform CRA_a [Homo sapiens]
          Length = 527

 Score = 72.8 bits (177), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 28   LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 86

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 87   DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 122

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 123  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 144

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 145  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 184

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 185  FTGGIGSYSLFLMAVSFLQ 203


>gi|441597295|ref|XP_003263083.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
            [Nomascus leucogenys]
 gi|348031139|emb|CCB84642.1| PAP associated domain containing 5 [Homo sapiens]
          Length = 631

 Score = 72.8 bits (177), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 132  LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 190

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 191  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 226

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 227  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 248

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 249  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 288

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 289  FTGGIGSYSLFLMAVSFLQ 307


>gi|321263807|ref|XP_003196621.1| hypothetical protein CGB_K1560W [Cryptococcus gattii WM276]
 gi|317463098|gb|ADV24834.1| Hypothetical protein CGB_K1560W [Cryptococcus gattii WM276]
          Length = 784

 Score = 72.8 bits (177), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 64/262 (24%), Positives = 110/262 (41%), Gaps = 58/262 (22%)

Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
            LL EEI++F K V+      +  + + ++ +TR++  LWP +    FGS  T L LP  D
Sbjct: 142  LLTEEINAFYKYVSPTREEFEVRL-FMIELITRTINKLWPEAEVTPFGSWQTQLYLPQGD 200

Query: 1272 VDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIP 1331
            +DLVV    + +                K+  L    +  A ++   +D +  +    +P
Sbjct: 201  IDLVVAHKYLSDAN--------------KQRLLAELGK--AMRQANITDVVAIIARARVP 244

Query: 1332 IIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDN 1391
            II  V                         TL+  +HV S +  L          T  + 
Sbjct: 245  IIKFV-------------------------TLEGKSHVFS-LAYL----------TKQEG 268

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I     + +DIS       G+    ++ +  +  P +  L L++K FL+ RS+++ Y+GG
Sbjct: 269  I---GKINVDISLNQ--GNGVTAGKIINQYLDALPGARQLILIVKYFLSQRSMNEVYTGG 323

Query: 1452 LSSYCLMLLITRFLQHEHHLGR 1473
            L SY ++ ++  FLQ    L R
Sbjct: 324  LGSYSVICMVISFLQLHPKLRR 345


>gi|403292555|ref|XP_003937307.1| PREDICTED: PAP-associated domain-containing protein 5 [Saimiri
            boliviensis boliviensis]
          Length = 631

 Score = 72.8 bits (177), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 132  LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 190

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 191  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 226

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 227  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 248

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 249  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 288

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 289  FTGGIGSYSLFLMAVSFLQ 307


>gi|256818780|ref|NP_001035374.2| PAP-associated domain-containing protein 5 isoform a [Homo sapiens]
 gi|194374871|dbj|BAG62550.1| unnamed protein product [Homo sapiens]
          Length = 698

 Score = 72.8 bits (177), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 199  LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 257

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 258  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 293

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 294  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 315

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 316  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 355

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 356  FTGGIGSYSLFLMAVSFLQ 374


>gi|397498213|ref|XP_003819879.1| PREDICTED: PAP-associated domain-containing protein 5, partial [Pan
            paniscus]
          Length = 593

 Score = 72.8 bits (177), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 94   LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 152

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 153  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 188

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 189  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 210

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 211  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 250

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 251  FTGGIGSYSLFLMAVSFLQ 269


>gi|395839409|ref|XP_003792582.1| PREDICTED: PAP-associated domain-containing protein 5 [Otolemur
            garnettii]
          Length = 629

 Score = 72.8 bits (177), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 130  LHEEISDFYEYMSPRPEEEKMRME-VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 188

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 189  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 224

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 225  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 246

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 247  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 286

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 287  FTGGIGSYSLFLMAVSFLQ 305


>gi|426382139|ref|XP_004057678.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
            [Gorilla gorilla gorilla]
          Length = 664

 Score = 72.4 bits (176), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 212  LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 270

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 271  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 306

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 307  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 328

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 329  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 368

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 369  FTGGIGSYSLFLMAVSFLQ 387


>gi|390477686|ref|XP_002760981.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
            [Callithrix jacchus]
          Length = 631

 Score = 72.4 bits (176), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 132  LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 190

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 191  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 226

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 227  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 248

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 249  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 288

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 289  FTGGIGSYSLFLMAVSFLQ 307


>gi|359319041|ref|XP_535307.4| PREDICTED: PAP-associated domain-containing protein 5 [Canis lupus
            familiaris]
          Length = 641

 Score = 72.4 bits (176), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 142  LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 200

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 201  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 236

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 237  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 258

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 259  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 298

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 299  FTGGIGSYSLFLMAVSFLQ 317


>gi|354474676|ref|XP_003499556.1| PREDICTED: PAP-associated domain-containing protein 5-like
            [Cricetulus griseus]
          Length = 464

 Score = 72.4 bits (176), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 12   LHEEISDFYEYMSPRPEEEKMRME-VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 70

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 71   DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 106

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 107  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 128

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 129  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 168

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 169  FTGGIGSYSLFLMAVSFLQ 187


>gi|335308290|ref|XP_003361170.1| PREDICTED: PAP-associated domain-containing protein 5-like [Sus
            scrofa]
          Length = 511

 Score = 72.4 bits (176), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 12   LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 70

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 71   DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 106

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 107  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 128

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 129  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 168

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 169  FTGGIGSYSLFLMAVSFLQ 187


>gi|297283968|ref|XP_001083145.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
            [Macaca mulatta]
          Length = 700

 Score = 72.4 bits (176), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 201  LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 259

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 260  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 295

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 296  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 317

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 318  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 357

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 358  FTGGIGSYSLFLMAVSFLQ 376


>gi|402908342|ref|XP_003916909.1| PREDICTED: PAP-associated domain-containing protein 5 [Papio anubis]
          Length = 605

 Score = 72.4 bits (176), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 106  LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 164

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 165  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 200

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 201  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 222

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 223  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 262

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 263  FTGGIGSYSLFLMAVSFLQ 281


>gi|312384741|gb|EFR29395.1| hypothetical protein AND_01692 [Anopheles darlingi]
          Length = 1101

 Score = 72.4 bits (176), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 34/74 (45%), Positives = 48/74 (64%), Gaps = 2/74 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T V++DISF   S  G+Q+  L+KE  + +P    L LVLKQFL  R L++ ++GG+SSY
Sbjct: 378  TQVKVDISFNMES--GVQSAKLIKEYKQDYPVLEKLVLVLKQFLLQRDLNEVFTGGISSY 435

Query: 1456 CLMLLITRFLQHEH 1469
             L+L+   FLQ  H
Sbjct: 436  SLILMCISFLQQHH 449



 Score = 59.7 bits (143), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 45/150 (30%), Positives = 71/150 (47%), Gaps = 31/150 (20%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH+EID F   +    T     +   V R+ + +  LWP +R  +FGS  TGL LP+SD+
Sbjct: 271  LHQEIDQFYNHMIPTPTEHALRV-MVVNRIEQIVLNLWPSARVEMFGSFRTGLYLPTSDI 329

Query: 1273 DLVVCLP----PVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
            DLVV       P+R LE                  ++   RY+A     + +S++ ++  
Sbjct: 330  DLVVIGQWEKLPLRTLE------------------MELINRYIA-----EPNSVRVLDKA 366

Query: 1329 AIPIIMLV---VEVPHDLIASAASSVQSPK 1355
            ++PI+ L     +V  D+  +  S VQS K
Sbjct: 367  SVPIVKLTDRETQVKVDISFNMESGVQSAK 396


>gi|431914108|gb|ELK15367.1| PAP-associated domain-containing protein 5 [Pteropus alecto]
          Length = 530

 Score = 72.4 bits (176), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 31   LHEEISDFYEYMSPRPEEEKMRME-VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 89

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 90   DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 125

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 126  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 147

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 148  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 187

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 188  FTGGIGSYSLFLMAVSFLQ 206


>gi|51328369|gb|AAH80314.1| Papd5 protein [Mus musculus]
          Length = 583

 Score = 72.4 bits (176), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 84   LHEEISDFYEYMSPRPEEEKMRME-VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 142

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 143  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 178

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 179  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 200

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 201  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 240

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 241  FTGGIGSYSLFLMAVSFLQ 259


>gi|327278603|ref|XP_003224050.1| PREDICTED: PAP-associated domain-containing protein 5-like [Anolis
            carolinensis]
          Length = 665

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 68/259 (26%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI+ F K ++     ++  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 169  LHEEINDFYKYMSPRPEEQRMRME-VVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDI 227

Query: 1273 DLVV-----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L         S+K ++ 
Sbjct: 228  DLVVFGKWETLP----LWTLEEA--------------------LRKHNVADKGSVKVLDK 263

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 264  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 285

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + +KAA  ++                D +K    ++P    L LVLKQFL  R L++ 
Sbjct: 286  VQNGVKAADLIK----------------DFIK----KYPVLPYLVLVLKQFLLQRDLNEV 325

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 326  FTGGIGSYSLFLMAVSFLQ 344


>gi|426243516|ref|XP_004015600.1| PREDICTED: PAP-associated domain-containing protein 5 [Ovis aries]
          Length = 588

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 136  LHEEISDFYEYMSPRPEEEKMRME-VVNRIEGVIKELWPSADVQIFGSFKTGLYLPTSDI 194

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 195  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 230

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 231  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 252

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 253  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 292

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 293  FTGGIGSYSLFLMAVSFLQ 311


>gi|291410211|ref|XP_002721395.1| PREDICTED: DNA polymerase sigma-like [Oryctolagus cuniculus]
          Length = 522

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 23   LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 81

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 82   DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 117

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 118  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 139

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 140  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 179

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 180  FTGGIGSYSLFLMAVSFLQ 198


>gi|60392891|sp|Q68ED3.2|PAPD5_MOUSE RecName: Full=PAP-associated domain-containing protein 5; AltName:
            Full=Topoisomerase-related function protein 4-2;
            Short=TRF4-2
 gi|148878177|gb|AAI45738.1| Papd5 protein [Mus musculus]
 gi|219519562|gb|AAI44797.1| Papd5 protein [Mus musculus]
          Length = 633

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 134  LHEEISDFYEYMSPRPEEEKMRME-VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 192

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 193  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 228

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 229  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 250

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 251  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 290

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 291  FTGGIGSYSLFLMAVSFLQ 309


>gi|256818784|ref|NP_001157969.1| PAP-associated domain-containing protein 5 isoform a [Mus musculus]
 gi|256818786|ref|NP_001157970.1| PAP-associated domain-containing protein 5 isoform a [Mus musculus]
          Length = 680

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 181  LHEEISDFYEYMSPRPEEEKMRME-VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 239

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 240  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 275

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 276  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 297

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 298  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 337

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 338  FTGGIGSYSLFLMAVSFLQ 356


>gi|410911160|ref|XP_003969058.1| PREDICTED: DNA polymerase sigma-like [Takifugu rubripes]
          Length = 803

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 69/261 (26%), Positives = 103/261 (39%), Gaps = 92/261 (35%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEE+  F   ++    E   RK  +N    R+   ++ LWP +   IFGS +TGL LP+
Sbjct: 254  LHEEVIDFYNFMSPRPEEAAMRKEVVN----RIETIIKELWPTADVQIFGSFSTGLYLPT 309

Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
            SD+DLVV      PP++ LE                       + L      +  S+K +
Sbjct: 310  SDIDLVVFGKWERPPLQELE-----------------------QALRKHNVAEPFSIKVL 346

Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
            +   +PII L  +                      T +K D                  S
Sbjct: 347  DKATVPIIKLTDQ---------------------ETEVKVD-----------------IS 368

Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
                  +KAA+ ++                D VK     +P    L  VLKQFL  R L+
Sbjct: 369  FNVETGVKAASFIK----------------DYVK----MYPVLPYLIFVLKQFLLQRDLN 408

Query: 1446 QSYSGGLSSYCLMLLITRFLQ 1466
            + ++GG+SSY L+L++  FLQ
Sbjct: 409  EVFTGGISSYSLILMVISFLQ 429


>gi|351712688|gb|EHB15607.1| PAP-associated domain-containing protein 5, partial [Heterocephalus
            glaber]
          Length = 599

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 100  LHEEISDFYEYMSPRPEEEKMRME-VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 158

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 159  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 194

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 195  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 216

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                    + T+++P    L LVLKQFL  R L++ 
Sbjct: 217  VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 256

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 257  FTGGIGSYSLFLMAVSFLQ 275


>gi|326927263|ref|XP_003209812.1| PREDICTED: PAP-associated domain-containing protein 5-like [Meleagris
            gallopavo]
          Length = 571

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 68/260 (26%), Positives = 102/260 (39%), Gaps = 88/260 (33%)

Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
            +LHEEI  F K ++  +   +  +   V R+   ++ LWP +   IFGS  TGL LP+SD
Sbjct: 70   VLHEEIIDFYKYMSPRHEEERMRME-VVNRIENVIKELWPNADVQIFGSFKTGLYLPTSD 128

Query: 1272 VDLVV-----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVE 1326
            +DLVV      LP    L  ++EA                    L        +S+K ++
Sbjct: 129  IDLVVFGKWETLP----LWTLEEA--------------------LRKHNVADENSVKVLD 164

Query: 1327 NTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSH 1386
               +PII L                       + T +K D                  S 
Sbjct: 165  KATVPIIKLT---------------------DSFTEVKVD-----------------ISF 186

Query: 1387 TSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQ 1446
               + +KAA  ++                D +K    ++P    L LVLKQFL  R L++
Sbjct: 187  NVQNGVKAAQLIK----------------DFIK----KYPVLPYLVLVLKQFLLQRDLNE 226

Query: 1447 SYSGGLSSYCLMLLITRFLQ 1466
             ++GG+ SY L L+   FLQ
Sbjct: 227  VFTGGIGSYSLFLMAVSFLQ 246


>gi|159478216|ref|XP_001697200.1| trf4 poly(A) polymerase [Chlamydomonas reinhardtii]
 gi|158274674|gb|EDP00455.1| trf4 poly(A) polymerase [Chlamydomonas reinhardtii]
          Length = 1144

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/256 (26%), Positives = 102/256 (39%), Gaps = 80/256 (31%)

Query: 1213 LHEEIDSFCK--QVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
            LH+E+   C+  Q   E  A +     AV+ V  S+   WP +R  +FGS ATGL +P+S
Sbjct: 308  LHQELVELCELVQPTPEEAAARAGAVAAVREVVGSI---WPSARVEVFGSYATGLYVPTS 364

Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAI 1330
            DVDLV+      N+                +  LQ  A+ L  +      S++ +    +
Sbjct: 365  DVDLVILDSGCTNI----------------QAGLQALAKALTKRGV--GQSIQVIGKARV 406

Query: 1331 PIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSD 1390
            PI+                      E   +  L  D       V+ D +  P+ +     
Sbjct: 407  PIVKF--------------------ETVDYGNLAFD-------VSFDVANGPQAAE---- 435

Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
                                      LVKE+T ++P   PL L LK FL  R L++ Y+G
Sbjct: 436  --------------------------LVKEMTGRWPMMRPLILALKLFLQQRELNEVYTG 469

Query: 1451 GLSSYCLMLLITRFLQ 1466
            G+ SY L+ L++ FLQ
Sbjct: 470  GIGSYALITLVSAFLQ 485


>gi|403337445|gb|EJY67938.1| Nucleotidyltransferase domain containing protein [Oxytricha
            trifallax]
          Length = 2816

 Score = 72.0 bits (175), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 62/232 (26%), Positives = 109/232 (46%), Gaps = 36/232 (15%)

Query: 1234 YINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGIL 1293
            YI   VK+  +     +   +  ++GS ATGL++ SSD+D++V    V N E     G++
Sbjct: 2448 YIEQFVKQAYQE----FVNVKVQMYGSMATGLAIDSSDMDVLVS--GVFNQE-----GLI 2496

Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQS 1353
            +    I++    H  +YL     ++S+SL  +E   +P+I +++++            Q 
Sbjct: 2497 DRPQLIEQMRRLH--KYLNELYCLESNSL--IETATVPVIKIILDL------------QK 2540

Query: 1354 PKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQ 1413
             ++     T    N +  DM  L    +       ++ I           F  P H G++
Sbjct: 2541 IRDQNQEQT--SSNIIEEDMKHLKIDITFNEQEGENNYITG-------FGFAQPIHMGIK 2591

Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
                +K+   ++P    L L+LK+FLA R L+  ++GGLSSY L+LLI  F+
Sbjct: 2592 CCHFIKQKLHEYPQLQSLTLLLKKFLALRDLNSPFTGGLSSYGLILLIIAFI 2643


>gi|170059968|ref|XP_001865594.1| sigma DNA polymerase [Culex quinquefasciatus]
 gi|167878539|gb|EDS41922.1| sigma DNA polymerase [Culex quinquefasciatus]
          Length = 618

 Score = 71.6 bits (174), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 37/92 (40%), Positives = 55/92 (59%), Gaps = 7/92 (7%)

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
             LD ++ P    T  +     T V++DISF   S  G+Q+ +L+K+   Q+P    L LV
Sbjct: 94   VLDKASVPIVKLTDRE-----TQVKVDISFNMQS--GVQSAELIKDFKRQYPVLAKLVLV 146

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 147  LKQFLLQRDLNEVFTGGISSYSLILMCISFLQ 178



 Score = 54.3 bits (129), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 42/78 (53%), Gaps = 5/78 (6%)

Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
            +LHEEI+ F   +    T     +   V R+   +  LWP +R  +FGS  TGL LP+SD
Sbjct: 2    ILHEEIEQFYAHMIPTGTEHTLRVQ-VVARIEAIVLNLWPMARVEMFGSFRTGLYLPTSD 60

Query: 1272 VDLVVC----LPPVRNLE 1285
            +DLVV       P+R LE
Sbjct: 61   IDLVVIGRWEKLPLRTLE 78


>gi|313218878|emb|CBY43207.1| unnamed protein product [Oikopleura dioica]
          Length = 677

 Score = 71.6 bits (174), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 34/73 (46%), Positives = 48/73 (65%), Gaps = 2/73 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T +++DISF     TG+    LVK    QFPA   L LVLKQFL  R +++ ++GG+SSY
Sbjct: 48   TDIKVDISFNM--KTGVTAIGLVKGYIRQFPALRYLVLVLKQFLLQRDMNEVWTGGISSY 105

Query: 1456 CLMLLITRFLQHE 1468
             L+L++  FLQH+
Sbjct: 106  GLILMVVSFLQHQ 118


>gi|402583139|gb|EJW77083.1| PAP/25A associated domain-containing protein, partial [Wuchereria
            bancrofti]
          Length = 416

 Score = 71.6 bits (174), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 61/254 (24%), Positives = 101/254 (39%), Gaps = 78/254 (30%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEE+      +      R   +    +RV   LQ +WP ++ ++FGS  T L LP+SD+
Sbjct: 1    LHEELLDLYAWLKPSPLERALRLR-VFERVRGVLQRIWPTAKIDVFGSLYTSLFLPTSDI 59

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            D+VV    V    P+ +  I    +GI E                   S+  ++   +PI
Sbjct: 60   DVVVESDLVSEEPPLWKTAIALKESGITE-------------------SINVLDKAFVPI 100

Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
            + +V                             D  ++ D+           S  +   +
Sbjct: 101  VKMV---------------------------DKDTKIYLDI-----------SFNTVQGV 122

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
            ++A  +                    +++  ++P   PL LVLKQFL  R L+Q ++GGL
Sbjct: 123  RSAKFI--------------------EDMKMRYPVLEPLVLVLKQFLMQRQLNQVFTGGL 162

Query: 1453 SSYCLMLLITRFLQ 1466
            SSY L+L++  FLQ
Sbjct: 163  SSYGLILMLISFLQ 176


>gi|340371638|ref|XP_003384352.1| PREDICTED: PAP-associated domain-containing protein 5-like
            [Amphimedon queenslandica]
          Length = 462

 Score = 71.6 bits (174), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 40/111 (36%), Positives = 62/111 (55%), Gaps = 10/111 (9%)

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
            EDA    LK    +   ++ LD +  P    T  +     T V++DISF     TG+ + 
Sbjct: 168  EDAF---LKAQIAIEDSIMVLDKTTVPIIKFTDRE-----TEVKVDISFNQ--ETGIYSA 217

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            +L+ +  +QFP    LAL++KQFLA R L++ Y GG++SY L+L++  F Q
Sbjct: 218  NLICQYVQQFPYLPYLALIVKQFLAQRQLNEVYYGGINSYSLILMLVSFFQ 268



 Score = 43.9 bits (102), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 36/64 (56%), Gaps = 1/64 (1%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
             HEEI  F   ++ + + R   +    ++V   +  LWPR++   FGS  T L LP+SD+
Sbjct: 93   FHEEIVHFYSYMSPKRSERCMRLR-VFEKVKAIILKLWPRAQVYPFGSFCTNLYLPTSDI 151

Query: 1273 DLVV 1276
            D+VV
Sbjct: 152  DIVV 155


>gi|49899785|gb|AAH76872.1| LOC445836 protein, partial [Xenopus laevis]
          Length = 563

 Score = 71.6 bits (174), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 69/259 (26%), Positives = 100/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F K ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 69   LHEEIIDFYKYMSPRPEEEKMRME-VVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDI 127

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L        +S+K ++ 
Sbjct: 128  DLVVFGKWENLP----LWTLEEA--------------------LRKHNVADENSVKVLDK 163

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 164  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 185

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + +KAA  ++                D +K    ++P    L LVLKQFL  R L++ 
Sbjct: 186  VQNGVKAAQLIK----------------DFIK----KYPVLPYLVLVLKQFLLQRDLNEV 225

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 226  FTGGIGSYSLFLMAVSFLQ 244


>gi|392598007|gb|EIW87329.1| Nucleotidyltransferase [Coniophora puteana RWD-64-598 SS2]
          Length = 383

 Score = 71.2 bits (173), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 69/255 (27%), Positives = 109/255 (42%), Gaps = 74/255 (29%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKR-VTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
            LH+EI +F   V    T+++ Y+   V+  V R L+  + RS  N+FGS A GLSLP  D
Sbjct: 73   LHDEITAFVTYVGL--TSQEEYMREQVRALVHRILRNRFRRSEVNVFGSAAQGLSLPGGD 130

Query: 1272 VDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIP 1331
            VD+V+ LP              + R+ +K +  Q AA  +  +  V SD ++      IP
Sbjct: 131  VDIVLSLPHA------------DSRSELKTSLFQLAA--MLKRTGVASD-VEVRHQARIP 175

Query: 1332 IIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDN 1391
            II                S Q+  E     TL+ D  V+++                   
Sbjct: 176  II----------------SFQTVPE---LCTLRFDIGVNNE------------------- 197

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
                               GL+   +V++  +  P   PL LVLK FL+ R L  + +GG
Sbjct: 198  ------------------DGLKALPIVEKYMQDHPVLKPLMLVLKSFLSRRGLSSASNGG 239

Query: 1452 LSSYCLMLLITRFLQ 1466
            ++S+ L+ ++  F+Q
Sbjct: 240  INSFVLLCMVVSFIQ 254


>gi|326676716|ref|XP_686065.4| PREDICTED: DNA polymerase sigma [Danio rerio]
          Length = 706

 Score = 70.9 bits (172), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 68/261 (26%), Positives = 104/261 (39%), Gaps = 92/261 (35%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI  F   ++    E T R+      V R+   ++ LWP +   IFGS +TGL LP+
Sbjct: 210  LHEEIMDFYNFMSPRPEEATMRQE----VVDRIESVIKELWPTADVQIFGSFSTGLFLPT 265

Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
            SD+DLVV      PP++ LE               +   +H+          +  S+K +
Sbjct: 266  SDIDLVVFGKWEKPPLQQLE---------------QALRKHSV--------AEPYSIKVL 302

Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
            +   +PII L  +                      T +K D                  S
Sbjct: 303  DKATVPIIKLTDQ---------------------ETEVKVD-----------------IS 324

Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
                  IKAA+ ++                    E  +++     L  VLKQFL  R L+
Sbjct: 325  FNVETGIKAASFIK--------------------EYVKKYTVLPYLIFVLKQFLLQRDLN 364

Query: 1446 QSYSGGLSSYCLMLLITRFLQ 1466
            + ++GG+SSY L+L++  FLQ
Sbjct: 365  EVFTGGISSYSLILMVISFLQ 385


>gi|449282422|gb|EMC89255.1| PAP-associated domain-containing protein 5, partial [Columba livia]
          Length = 501

 Score = 70.5 bits (171), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 70/262 (26%), Positives = 101/262 (38%), Gaps = 94/262 (35%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI  F K ++    E   R   +N    R+   ++ LWP +   IFGS  TGL LP+
Sbjct: 1    LHEEIIDFYKYMSPRPEEERMRMEVVN----RIENVIKELWPNADVQIFGSFKTGLYLPT 56

Query: 1270 SDVDLVV-----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
            SD+DLVV      LP    L  ++EA                    L        +S+K 
Sbjct: 57   SDIDLVVFGKWETLP----LWTLEEA--------------------LRKHNVADENSVKV 92

Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKC 1384
            ++   +PII L                       + T +K D                  
Sbjct: 93   LDKATVPIIKLT---------------------DSFTEVKVD-----------------I 114

Query: 1385 SHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSL 1444
            S    + +KAA  ++                D +K    ++P    L LVLKQFL  R L
Sbjct: 115  SFNVQNGVKAAQLIK----------------DFIK----KYPVLPYLVLVLKQFLLQRDL 154

Query: 1445 DQSYSGGLSSYCLMLLITRFLQ 1466
            ++ ++GG+ SY L L+   FLQ
Sbjct: 155  NEVFTGGIGSYSLFLMAVSFLQ 176


>gi|260798238|ref|XP_002594107.1| hypothetical protein BRAFLDRAFT_118785 [Branchiostoma floridae]
 gi|229279340|gb|EEN50118.1| hypothetical protein BRAFLDRAFT_118785 [Branchiostoma floridae]
          Length = 946

 Score = 70.5 bits (171), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 59/105 (56%), Gaps = 8/105 (7%)

Query: 1363 LKHDNHVHSDMV-ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKEL 1421
            L+H N    D +  LD +  P    T        T V++DISF  P+  G+++  L+K+ 
Sbjct: 376  LRHSNIADPDSIKVLDKATVPIVKLTDRK-----TDVKVDISFNMPN--GVKSAKLIKKF 428

Query: 1422 TEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
             + +P    L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 429  MQDYPTLKYLVLVLKQFLLQRDLNEVFTGGISSYSLILMTVSFLQ 473



 Score = 47.4 bits (111), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 26/60 (43%), Positives = 37/60 (61%), Gaps = 5/60 (8%)

Query: 1213 LHEEIDSFCKQVA--AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
            LHEEI  F K ++   E  A +   N  V+R+T  ++ +WPR++  IFGS  TGL LP+S
Sbjct: 307  LHEEILDFYKYMSPRMEEFAMR---NEVVQRITNIIRSIWPRAKVEIFGSFETGLYLPTS 363


>gi|403338429|gb|EJY68454.1| hypothetical protein OXYTRI_10932 [Oxytricha trifallax]
          Length = 1545

 Score = 70.5 bits (171), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 58/225 (25%), Positives = 95/225 (42%), Gaps = 46/225 (20%)

Query: 1242 VTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKE 1301
            +T  +Q   P S   ++GS+AT L LP SD+DLV+      +    K    +  R    E
Sbjct: 1183 ITSVIQKALPNSEVKVYGSHATKLCLPWSDIDLVIKTNSTDHYSTPKHVLSIITR----E 1238

Query: 1302 TCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHT 1361
                H  +      W++   +K VEN ++P++ +  ++ H +  S  +S           
Sbjct: 1239 LQSDHTTK------WIQE--VKFVENASVPVVKVKCQIDHIMQTSGLAS----------- 1279

Query: 1362 TLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKEL 1421
                  ++      L+   S                  +DI+  + +H GL+   LV+E 
Sbjct: 1280 -----QNISKYQTFLEQPFS------------------IDITQLTDNHNGLECVKLVQEF 1316

Query: 1422 TEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
              +     PL LVLKQ+L     +  Y GG+SSY L L+I  +LQ
Sbjct: 1317 LSENEVIEPLILVLKQYLKVCQYNDPYFGGISSYALFLMIVSYLQ 1361


>gi|328713984|ref|XP_001948412.2| PREDICTED: PAP-associated domain-containing protein 5-like
            [Acyrthosiphon pisum]
          Length = 595

 Score = 70.1 bits (170), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 36/95 (37%), Positives = 57/95 (60%), Gaps = 7/95 (7%)

Query: 1372 DMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPL 1431
            ++  LD ++ P    T       AT VR+DISF    + G+++ +++K   + FP    L
Sbjct: 205  NIQVLDKASVPIVKMTDK-----ATDVRVDISFNM--NNGVKSAEMIKHYMDVFPMLPKL 257

Query: 1432 ALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
             LVLKQFLA R +++ + GG+SSY L+L++  FLQ
Sbjct: 258  VLVLKQFLAQRDMNEVFYGGISSYSLILMVVSFLQ 292



 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 42/68 (61%), Gaps = 7/68 (10%)

Query: 1213 LHEEIDSFCKQVAA---ENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH+EI+ F   ++A   E++ R       ++RVT  +   WP+++  +FGS  TGL LP+
Sbjct: 118  LHKEIEDFFAYMSATPEEHSMRLD----VLERVTNVIHAEWPKAKVEVFGSFRTGLYLPT 173

Query: 1270 SDVDLVVC 1277
            SD+DLVV 
Sbjct: 174  SDMDLVVI 181


>gi|325181595|emb|CCA16045.1| Poly(A) RNA polymerase putative [Albugo laibachii Nc14]
 gi|325191995|emb|CCA26462.1| Poly(A) RNA polymerase putative [Albugo laibachii Nc14]
          Length = 494

 Score = 70.1 bits (170), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 51/78 (65%), Gaps = 3/78 (3%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            +++++DISF   S  GL T DL+K+    FP+  PL LVLK FLA R L++++ GG+ S+
Sbjct: 232  SAIQVDISFNISS--GLATADLIKQYMRIFPSFRPLVLVLKYFLAQRELNETFQGGIGSF 289

Query: 1456 CLMLLITRFL-QHEHHLG 1472
             L L++  FL Q+   LG
Sbjct: 290  LLQLMVVSFLQQYRRQLG 307


>gi|157133264|ref|XP_001656207.1| sigma DNA polymerase, putative [Aedes aegypti]
 gi|108870912|gb|EAT35137.1| AAEL012692-PA [Aedes aegypti]
          Length = 491

 Score = 70.1 bits (170), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 55/92 (59%), Gaps = 7/92 (7%)

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
             LD ++ P    T  +     T V++DISF   S  G+Q+ +L+K+   ++P    L LV
Sbjct: 81   VLDKASVPIVKLTDRE-----TQVKVDISFNMQS--GVQSAELIKDFKRKYPVLAKLVLV 133

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 134  LKQFLLQRDLNEVFTGGISSYSLILMCISFLQ 165



 Score = 46.6 bits (109), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 24/52 (46%), Positives = 32/52 (61%), Gaps = 4/52 (7%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
             V R+ + +  LWP +R  +FGS  TGL LP+SD+DLVV       P+R LE
Sbjct: 14   VVARIEQIVLNLWPAARVEMFGSFRTGLYLPTSDIDLVVIGLWEKLPLRTLE 65


>gi|145478719|ref|XP_001425382.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392452|emb|CAK57984.1| unnamed protein product [Paramecium tetraurelia]
          Length = 592

 Score = 70.1 bits (170), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 33/74 (44%), Positives = 45/74 (60%)

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
            K   + RLDIS +   H GLQ   L+++  + +    PL L++KQFL    L  +YSGGL
Sbjct: 374  KQYMNKRLDISIQETKHNGLQCVQLIRKYIKNYEPLKPLTLIMKQFLHKSDLSDTYSGGL 433

Query: 1453 SSYCLMLLITRFLQ 1466
            SSY L+L+I  FLQ
Sbjct: 434  SSYGLILMIVSFLQ 447


>gi|198451695|ref|XP_001358484.2| GA14518 [Drosophila pseudoobscura pseudoobscura]
 gi|198131603|gb|EAL27623.2| GA14518 [Drosophila pseudoobscura pseudoobscura]
          Length = 400

 Score = 70.1 bits (170), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 73/275 (26%), Positives = 111/275 (40%), Gaps = 92/275 (33%)

Query: 1211 SLLHEEIDSFCKQVAAENTARKPYINW-AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            SLLH EI+ F   +  ++TA + Y+   AV+ +   L  +WP +   +FGS  TGL+LP 
Sbjct: 26   SLLHLEIEGFYNFI--QSTATEFYLRAEAVRCIEDMLVSIWPGASVEVFGSFRTGLNLPL 83

Query: 1270 SDVDLVV-------CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
            SD+DLVV         PP+  LE                   +  AR +A+ +     ++
Sbjct: 84   SDIDLVVENRKSYWYTPPLYELES------------------ELVARGVADPQ-----TV 120

Query: 1323 KTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASP 1382
              V+  A+P++     V                     + +K D                
Sbjct: 121  NVVDTAAVPVVKFTERV---------------------SQIKFD---------------- 143

Query: 1383 KCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADR 1442
              S      +KAA  ++                DL++E    FP    L +VLKQFLA  
Sbjct: 144  -ISFNVGSGVKAAELIK----------------DLIQE----FPDLPKLVMVLKQFLAMH 182

Query: 1443 SLDQSY-SGGLSSYCLMLLITRFLQHEHHLGRPIN 1476
             L+  Y SGG+SSY + L+   FLQH+    +  N
Sbjct: 183  GLNDVYRSGGISSYGITLMCIGFLQHQSQSNKKYN 217


>gi|440900205|gb|ELR51393.1| PAP-associated domain-containing protein 5, partial [Bos grunniens
            mutus]
          Length = 563

 Score = 70.1 bits (170), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 68/259 (26%), Positives = 100/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 64   LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 122

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 123  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 158

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 159  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 180

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                D  K    ++P    L LVLKQFL  R L++ 
Sbjct: 181  VQNGVRAADLIK----------------DFTK----KYPVLPYLVLVLKQFLLQRDLNEV 220

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 221  FTGGIGSYSLFLMAVSFLQ 239


>gi|145530898|ref|XP_001451221.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124418865|emb|CAK83824.1| unnamed protein product [Paramecium tetraurelia]
          Length = 583

 Score = 69.7 bits (169), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 33/74 (44%), Positives = 45/74 (60%)

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
            K   + RLDIS +   H GLQ   L+++  + +    PL L++KQFL    L  +YSGGL
Sbjct: 365  KQYMNKRLDISIQETKHNGLQCVQLIRKYIKNYEPLKPLTLIMKQFLHKSDLSDTYSGGL 424

Query: 1453 SSYCLMLLITRFLQ 1466
            SSY L+L+I  FLQ
Sbjct: 425  SSYGLILMIVSFLQ 438


>gi|118794510|ref|XP_321547.3| AGAP001564-PA [Anopheles gambiae str. PEST]
 gi|19572384|emb|CAD27930.1| putative DNA topoisomerase [Anopheles gambiae]
 gi|116116329|gb|EAA01236.4| AGAP001564-PA [Anopheles gambiae str. PEST]
          Length = 1039

 Score = 69.7 bits (169), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 33/75 (44%), Positives = 47/75 (62%), Gaps = 2/75 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T V++DISF   S  G+Q+  L+K    ++P    L LVLKQFL  R L++ ++GG+SSY
Sbjct: 378  TQVKVDISFNMES--GVQSAKLIKGFKREYPVLEKLVLVLKQFLLQRDLNEVFTGGISSY 435

Query: 1456 CLMLLITRFLQHEHH 1470
             L+L+   FLQ  H 
Sbjct: 436  SLILMCISFLQQHHQ 450



 Score = 63.5 bits (153), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 48/166 (28%), Positives = 76/166 (45%), Gaps = 33/166 (19%)

Query: 1199 CPTRKASLSLMHSL--LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTN 1256
            CP R  +      +  LH+EI+ F   + A  T     +   V R+ + +Q LWP +R  
Sbjct: 255  CPWRPPNFKYGRGIIGLHQEIEQFYAHMIATPTEHALRV-MVVSRIEKIVQNLWPSARVE 313

Query: 1257 IFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLA 1312
            +FGS  TGL LP+SD+DLVV     + P+R LE                         L 
Sbjct: 314  MFGSFRTGLYLPTSDIDLVVIGQWTMLPLRTLE-----------------------MELI 350

Query: 1313 NQEWVKSDSLKTVENTAIPIIMLV---VEVPHDLIASAASSVQSPK 1355
            +Q   + +S++ ++  ++PI+ L     +V  D+  +  S VQS K
Sbjct: 351  SQGIAEPNSVRVLDKASVPIVKLTDRQTQVKVDISFNMESGVQSAK 396


>gi|426382137|ref|XP_004057677.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
            [Gorilla gorilla gorilla]
          Length = 631

 Score = 69.7 bits (169), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 68/259 (26%), Positives = 100/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 132  LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 190

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 191  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 226

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 227  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 248

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                D  K    ++P    L LVLKQFL  R L++ 
Sbjct: 249  VQNGVRAADLIK----------------DFTK----KYPVLPYLVLVLKQFLLQRDLNEV 288

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 289  FTGGIGSYSLFLMAVSFLQ 307


>gi|5565687|gb|AAD45198.1|AF089896_1 topoisomerase-related function protein [Homo sapiens]
          Length = 517

 Score = 69.7 bits (169), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 68/264 (25%), Positives = 101/264 (38%), Gaps = 98/264 (37%)

Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
            LHEEI  F      C + AA            VKR+   ++ LWP +   IFGS +TGL 
Sbjct: 30   LHEEIIDFYNFMSPCPEEAAMRRE-------VVKRIETVVKDLWPTADVQIFGSFSTGLY 82

Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
            LP+SD+DLVV      PP++ LE          ++ + E C                 S+
Sbjct: 83   LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 119

Query: 1323 KTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASP 1382
            K ++   +PII L                              +  V  D+         
Sbjct: 120  KVLDKATVPIIKLT---------------------------DQETEVKVDI--------- 143

Query: 1383 KCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADR 1442
              S      ++AA                    + +K   +++     L LVLKQFL  R
Sbjct: 144  --SFNMETGVRAA--------------------EFIKNYMKKYSLLPYLILVLKQFLLQR 181

Query: 1443 SLDQSYSGGLSSYCLMLLITRFLQ 1466
             L++ ++GG+SSY L+L+   FLQ
Sbjct: 182  DLNEVFTGGISSYSLILMAISFLQ 205


>gi|313241181|emb|CBY33472.1| unnamed protein product [Oikopleura dioica]
          Length = 422

 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 64/256 (25%), Positives = 103/256 (40%), Gaps = 82/256 (32%)

Query: 1213 LHEEIDSFCK--QVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
            LHEEI+ F K  Q      A +  + W +++V + L   WP ++   FGS  TGL LP  
Sbjct: 88   LHEEIEDFIKFMQPTESEQAMRDDVVWRIRQVVKEL---WPSAKLETFGSYNTGLYLPDG 144

Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAI 1330
            D+D+V+           ++  + + RN + E       R +A +E     ++  +E   +
Sbjct: 145  DIDMVIQ-------GQWEQLPMWQLRNKLVE-------RRIAREE-----NITVIEKAVV 185

Query: 1331 PIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSD 1390
            PII L+                           + +  VH D+           S  +S+
Sbjct: 186  PIIKLI---------------------------ESNTLVHVDI-----------SFNTSN 207

Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
              +AA                     LVK+   ++P    L ++LK  L  R L++ + G
Sbjct: 208  GREAAA--------------------LVKKYMAEYPNLKQLVVLLKYILNHRGLNEVWKG 247

Query: 1451 GLSSYCLMLLITRFLQ 1466
            GL SY L LL+  FLQ
Sbjct: 248  GLGSYALTLLVVNFLQ 263


>gi|145514940|ref|XP_001443375.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124410753|emb|CAK75978.1| unnamed protein product [Paramecium tetraurelia]
          Length = 962

 Score = 69.3 bits (168), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 65/248 (26%), Positives = 107/248 (43%), Gaps = 52/248 (20%)

Query: 1235 INWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVV-----CLPPVRNLEPIKE 1289
            IN+  +     +  L   S+  +FGS ATGL+LP SD+D+ +     C P   NL PI+ 
Sbjct: 626  INYLYRGFIHHIYQLDFNSQVCLFGSCATGLALPESDIDIGITGFEMCSPSQLNL-PIQ- 683

Query: 1290 AGILEGRNGIKETCLQHAARYLANQEWVKSDSL-KTVENTAIPIIMLVVEVPHDLIASAA 1348
                      K T      R++ N  + KS    + + ++A+P+I L V+ P      ++
Sbjct: 684  ----------KLTEFLQKMRWVNNIVYFKSIYFFRAITSSAMPLIKLQVD-PTISFVQSS 732

Query: 1349 SSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISF---- 1404
              +  P               + D+V   D   PK   +            +D+SF    
Sbjct: 733  LPIGLP---------------YIDLVLNPDEDIPKQIFS------------VDVSFFQYS 765

Query: 1405 --KSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLIT 1462
              K   H GL +TDL  +    +    P+ L+ K  L  R L+  Y GG+SS+C++ ++ 
Sbjct: 766  GPKQNQHLGLISTDLTLQWLSFYSELRPIVLLFKSLLKKRGLNDQYKGGISSFCIIQMVL 825

Query: 1463 RFLQHEHH 1470
             FL+  +H
Sbjct: 826  AFLECFYH 833


>gi|194208578|ref|XP_001491462.2| PREDICTED: PAP-associated domain-containing protein 5 [Equus
            caballus]
          Length = 528

 Score = 69.3 bits (168), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 68/259 (26%), Positives = 100/259 (38%), Gaps = 88/259 (33%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 29   LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKGLWPSADVQIFGSFKTGLYLPTSDI 87

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 88   DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 123

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII L                       + T +K D                  S  
Sbjct: 124  ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 145

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
              + ++AA  ++                D  K    ++P    L LVLKQFL  R L++ 
Sbjct: 146  VQNGVRAADLIK----------------DFTK----KYPVLPYLVLVLKQFLLQRDLNEV 185

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            ++GG+ SY L L+   FLQ
Sbjct: 186  FTGGIGSYSLFLMAVSFLQ 204


>gi|403331574|gb|EJY64740.1| Poly(A) RNA polymerase putative [Oxytricha trifallax]
          Length = 316

 Score = 69.3 bits (168), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 49/79 (62%), Gaps = 2/79 (2%)

Query: 1398 VRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCL 1457
            V +DISF   +  G+    LVK+L + FP   PL +VLK FL  R L++ YSGG+ S+ L
Sbjct: 124  VNVDISFNRTN--GVYCVKLVKQLLQMFPELKPLMMVLKCFLKSRQLNEPYSGGVGSFLL 181

Query: 1458 MLLITRFLQHEHHLGRPIN 1476
             +++T FLQ ++ LG   N
Sbjct: 182  TMMVTSFLQRQYKLGNTNN 200


>gi|170592851|ref|XP_001901178.1| PAP/25A associated domain containing protein [Brugia malayi]
 gi|158591245|gb|EDP29858.1| PAP/25A associated domain containing protein [Brugia malayi]
          Length = 421

 Score = 68.9 bits (167), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 66/265 (24%), Positives = 105/265 (39%), Gaps = 80/265 (30%)

Query: 1202 RKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSN 1261
            R+ +LSL    LHEE+      +      R   +    +RV   LQ +WP ++ ++FGS 
Sbjct: 58   RRYALSLRG--LHEELLDLYAWLKPSPLERALRLR-VFERVRGVLQRIWPTAKIDVFGSL 114

Query: 1262 ATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDS 1321
             T L LP+SD+D+VV    V    P+ +  I    +GI E                   S
Sbjct: 115  YTSLFLPTSDIDVVVESDLVSEEPPLWKTAIALKESGITE-------------------S 155

Query: 1322 LKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSAS 1381
            +  ++   +PI+ +V                             D  ++ D+        
Sbjct: 156  INVLDKAFVPIVKMV---------------------------DKDTKIYLDI-------- 180

Query: 1382 PKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLAD 1441
               S  +   +++A  +  D+  + P                      PL LVLKQFL  
Sbjct: 181  ---SFNTVQGVRSAKFIE-DMKMRYP-------------------VLEPLVLVLKQFLMQ 217

Query: 1442 RSLDQSYSGGLSSYCLMLLITRFLQ 1466
            R L+Q ++GGLSSY L+L++  FLQ
Sbjct: 218  RQLNQVFTGGLSSYGLILMLISFLQ 242


>gi|145491674|ref|XP_001431836.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398942|emb|CAK64438.1| unnamed protein product [Paramecium tetraurelia]
          Length = 685

 Score = 68.9 bits (167), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 61/267 (22%), Positives = 109/267 (40%), Gaps = 73/267 (27%)

Query: 1200 PTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFG 1259
            P  K S+  +   L  EI++F ++++     +KP I+  V+ V  ++Q +  +SR  ++G
Sbjct: 324  PQHKISIDDIVKQLTFEINTFTQELSKYLDEQKPIIDKIVQLVDETVQNVQFKSRAFLYG 383

Query: 1260 SNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKS 1319
            S  TGL+L  SD+D+V+        E ++   I+          L   A       ++K 
Sbjct: 384  SCQTGLNLLDSDIDIVI--------ETVESEEII---------LLFKLAEQFKTTSFIK- 425

Query: 1320 DSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDS 1379
              +K +EN   P++ +                                            
Sbjct: 426  -DVKVIENAKKPVLKM-------------------------------------------- 440

Query: 1380 ASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFL 1439
               +CS    D +       +DI+     H+G +T + + E  ++F     LAL+LK + 
Sbjct: 441  ---QCSKEFQDKL-------IDITISRNDHSGRKTANSMIEFQKEFKQFRSLALMLKFYF 490

Query: 1440 ADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
               +L  SY GGL+SYC++ +I   LQ
Sbjct: 491  KSINLLNSYQGGLNSYCILTMILALLQ 517


>gi|145539774|ref|XP_001455577.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124423385|emb|CAK88180.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1013

 Score = 68.9 bits (167), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 72/302 (23%), Positives = 130/302 (43%), Gaps = 59/302 (19%)

Query: 1178 LARDQEHPDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINW 1237
            L+  QE P+ +   Q   VQ    +K    L+  + + +I  F   + +E     P+   
Sbjct: 635  LSIKQESPEKSLKEQLENVQ----QKGKQKLIEQI-NSDILDFTDNIMSEYEEMLPFRFL 689

Query: 1238 AVKRVTRSLQVLW-----PRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGI 1292
            A +RV   +Q ++         + +FGS ATGL+L  SD+D+ +    V N         
Sbjct: 690  AFERVKSVIQKVFLGIPDGMITSRLFGSCATGLALLDSDIDIGINGLEVYN--------- 740

Query: 1293 LEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQ 1352
               RN +K +   +     + ++WV   +   + N+++PII L ++ P   I        
Sbjct: 741  ---RNMLK-SHFDNLYFEFSRKKWVVKAN--PIFNSSVPIIKLEID-PQINIFEYEGRNL 793

Query: 1353 SPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFK------S 1406
              ++      LK                            K  + +++DISF        
Sbjct: 794  DEQQIQLWKRLKQ---------------------------KLKSGIKVDISFNFNGNGIY 826

Query: 1407 PSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
             +H G  TTDLVK+  E++P+   + L+LK  +    L +SY+GGLSSY L++++  +++
Sbjct: 827  STHLGSITTDLVKKWMEEYPSLQQIVLILKSMIKKLKLSESYTGGLSSYSLIIMVYSYMR 886

Query: 1467 HE 1468
             +
Sbjct: 887  EQ 888


>gi|195053696|ref|XP_001993762.1| GH19417 [Drosophila grimshawi]
 gi|193895632|gb|EDV94498.1| GH19417 [Drosophila grimshawi]
          Length = 340

 Score = 68.6 bits (166), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 40/111 (36%), Positives = 62/111 (55%), Gaps = 13/111 (11%)

Query: 1369 VHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPAS 1428
            + +D++ LD ++ P    T     +  T ++ DISF     TG++  +L++   EQFP  
Sbjct: 108  IATDVLVLDKASVPVVKFT-----EHITQIKFDISFNKT--TGVKAAELIQRYIEQFPEL 160

Query: 1429 TPLALVLKQFLADRSLDQSYS-GGLSSYCLMLLITRFLQH-----EHHLGR 1473
              L +VLKQFL  R L++ YS GG+SSY + L+   FLQ      ++ LGR
Sbjct: 161  PKLVIVLKQFLMLRGLNEVYSTGGISSYAITLMCISFLQQHSSKNDNKLGR 211



 Score = 47.4 bits (111), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 34/64 (53%), Gaps = 1/64 (1%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH EI+ F   +    T     +  AV+R+   L  LWP     +FGS  TGL+LP SD+
Sbjct: 23   LHMEIEHFYNYIVNTRTEYMMRLE-AVQRIENLLLGLWPEVSIEVFGSFRTGLNLPISDI 81

Query: 1273 DLVV 1276
            D+ V
Sbjct: 82   DIAV 85


>gi|195045442|ref|XP_001991977.1| GH24509 [Drosophila grimshawi]
 gi|193892818|gb|EDV91684.1| GH24509 [Drosophila grimshawi]
          Length = 489

 Score = 68.6 bits (166), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 37/92 (40%), Positives = 55/92 (59%), Gaps = 7/92 (7%)

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
             LD ++ P    T  +     T V++DISF   S  G+Q+ +L+K+   +FP    L LV
Sbjct: 359  VLDKASVPIIKLTDRE-----TQVKVDISFNMQS--GVQSAELIKQFKREFPVLGKLVLV 411

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 412  LKQFLLLRDLNEVFTGGISSYSLILMCICFLQ 443



 Score = 59.7 bits (143), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 54/172 (31%), Positives = 79/172 (45%), Gaps = 39/172 (22%)

Query: 1194 LEVQNCPTRKASLSLMHSL--LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQV 1248
            ++ + CP R         +  LHEEI+ F + V     E+  R    N  VKR+   +Q 
Sbjct: 247  VKYKGCPWRVRDFQYGDGIIGLHEEIEHFYQYVLPTPCEHAIR----NEVVKRIEAVVQS 302

Query: 1249 LWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCL 1304
            +WP++   IFGS  TGL LP+SD+DLVV       P+R LE       L  R GI E C 
Sbjct: 303  IWPQAVVEIFGSFRTGLFLPTSDIDLVVLGLWEKLPLRTLE-----FELVSR-GIAEAC- 355

Query: 1305 QHAARYLANQEWVKSDSLKTVENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
                            +++ ++  ++PII L     +V  D+  +  S VQS
Sbjct: 356  ----------------TVRVLDKASVPIIKLTDRETQVKVDISFNMQSGVQS 391


>gi|145541945|ref|XP_001456660.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124424473|emb|CAK89263.1| unnamed protein product [Paramecium tetraurelia]
          Length = 805

 Score = 68.2 bits (165), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 58/256 (22%), Positives = 102/256 (39%), Gaps = 53/256 (20%)

Query: 1216 EIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLV 1275
            +I  F  Q+  EN  + P       R+  ++Q+L+  +   +FGS  T L+LP SD+D+ 
Sbjct: 480  DIKEFTDQIRRENDQQFPIRQLIFNRIQFTIQLLFKEAGVCLFGSCVTRLALPDSDIDI- 538

Query: 1276 VCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
                           GI      +    +     +L+   WVK         T +P+I L
Sbjct: 539  ---------------GITGLETHLLNQKMDAIIEFLSKMNWVKRIKPIYPTQTTLPLIKL 583

Query: 1336 VVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAA 1395
             V+     I   + ++  P               H D+V                    +
Sbjct: 584  WVDPS---IPFRSGNMNLP---------------HIDLVC------------------QS 607

Query: 1396 TSVRLDISFKS-PSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
              +++DISF     H GL +T+L     +++     + L+ K  L  R L+    GG+SS
Sbjct: 608  QLIQVDISFFGHIQHQGLTSTELTCYWLQEYQELKTITLLFKSLLKKRGLNDQSKGGISS 667

Query: 1455 YCLMLLITRFLQHEHH 1470
            +CL+L++  FL++ + 
Sbjct: 668  FCLVLIVVAFLEYHYQ 683


>gi|432853107|ref|XP_004067543.1| PREDICTED: PAP-associated domain-containing protein 5-like [Oryzias
            latipes]
          Length = 679

 Score = 68.2 bits (165), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 53/96 (55%), Gaps = 7/96 (7%)

Query: 1371 SDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTP 1430
            S +  LD +  P    T S      T V++DISF   S  G++   L+KE  E++P    
Sbjct: 273  SAIKVLDKATVPIIKLTDS-----VTEVKVDISFNVES--GVKAARLIKEFKEKYPVLPY 325

Query: 1431 LALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            L LVLKQFL  R L++ ++GG+ SY L L+   FLQ
Sbjct: 326  LVLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 361



 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 69/151 (45%), Gaps = 33/151 (21%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F   ++      K  +   V R+   +  LWP +   +FGS +TGL LP+SD+
Sbjct: 186  LHEEIKDFYDYISPRPEEEKMRLE-VVDRIKGVIHDLWPSAEVQVFGSFSTGLYLPTSDI 244

Query: 1273 DLVV-----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA  L  RN   ++                  ++K ++ 
Sbjct: 245  DLVVFGKWETLP----LWTLEEA--LRKRNVADKS------------------AIKVLDK 280

Query: 1328 TAIPIIML---VVEVPHDLIASAASSVQSPK 1355
              +PII L   V EV  D+  +  S V++ +
Sbjct: 281  ATVPIIKLTDSVTEVKVDISFNVESGVKAAR 311


>gi|308805789|ref|XP_003080206.1| S-M checkpoint control protein CID1 and related
            nucleotidyltransferases (ISS) [Ostreococcus tauri]
 gi|116058666|emb|CAL54373.1| S-M checkpoint control protein CID1 and related
            nucleotidyltransferases (ISS) [Ostreococcus tauri]
          Length = 555

 Score = 68.2 bits (165), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 62/257 (24%), Positives = 99/257 (38%), Gaps = 83/257 (32%)

Query: 1213 LHEEI---DSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH EI     F +   AE ++R      AV+RV   ++ +WP +R  + GS ATG+ LP 
Sbjct: 127  LHNEIVDFSRFLEPTEAEASSR----TAAVERVRDVVKGIWPNARFEVHGSFATGMYLPG 182

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
            SD+D V+     +N                   CL+  A  LA ++   +  ++ +    
Sbjct: 183  SDIDAVILDSGAKN----------------PGVCLKALAIALARRDM--AIKIQLIAKAR 224

Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
            +PI+                 V+S                H   ++ D +  P  +    
Sbjct: 225  VPIVKF-------------EEVESG---------------HQFDISFDVANGPASAEIVR 256

Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
            +N++                              +FPA  PL  VLK FLA R L++ YS
Sbjct: 257  ENMR------------------------------RFPALRPLTTVLKAFLAQRGLNEVYS 286

Query: 1450 GGLSSYCLMLLITRFLQ 1466
            GG+ SY L+ ++   LQ
Sbjct: 287  GGIGSYALLCMVMAHLQ 303


>gi|162317662|gb|AAI56330.1| PAP associated domain containing 5 [synthetic construct]
 gi|162318878|gb|AAI57080.1| PAP associated domain containing 5 [synthetic construct]
          Length = 442

 Score = 68.2 bits (165), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH       +  LD +  P    T S      T V++DISF      G++  DL+K+ T+
Sbjct: 70   KHKVADEDSVKVLDKATVPIIKLTDS-----FTEVKVDISFNV--QNGVRAADLIKDFTK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++P    L LVLKQFL  R L++ ++GG+ SY L L+   FLQ
Sbjct: 123  KYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 165


>gi|357130698|ref|XP_003566984.1| PREDICTED: PAP-associated domain-containing protein 5-like
            [Brachypodium distachyon]
          Length = 619

 Score = 67.8 bits (164), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 61/260 (23%), Positives = 99/260 (38%), Gaps = 89/260 (34%)

Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH+EI  FC  +   A E ++R      AV+ V+  ++ +WP  +  +FGS  TGL LP+
Sbjct: 176  LHKEILDFCDFISPSAEEQSSRTA----AVQAVSDVVKHIWPHCKVEVFGSFRTGLYLPT 231

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
            SD+D+V                I E R    +  L   A+ L+ +   K   ++ +    
Sbjct: 232  SDIDVV----------------IFESRVKTPQVGLYALAKALSQKGVAK--KIQVIAKAR 273

Query: 1330 IPIIMLVVEV---PHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSH 1386
            +PI+  V  V   P D                               ++ D    P+ + 
Sbjct: 274  VPIVKFVERVSGIPFD-------------------------------ISFDIDGGPQAAD 302

Query: 1387 TSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQ 1446
               D I+                              + PA  PL ++LK FL  R L++
Sbjct: 303  FIKDAIR------------------------------KMPALRPLCMILKVFLHQRELNE 332

Query: 1447 SYSGGLSSYCLMLLITRFLQ 1466
             Y+GG+ SY L+ ++   LQ
Sbjct: 333  VYTGGVGSYALLTMLITHLQ 352


>gi|297597347|ref|NP_001043830.2| Os01g0672700 [Oryza sativa Japonica Group]
 gi|56201854|dbj|BAD73304.1| polymerase (DNA directed) sigma-like [Oryza sativa Japonica Group]
 gi|56201907|dbj|BAD73357.1| polymerase (DNA directed) sigma-like [Oryza sativa Japonica Group]
 gi|255673541|dbj|BAF05744.2| Os01g0672700 [Oryza sativa Japonica Group]
          Length = 578

 Score = 67.8 bits (164), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 63/276 (22%), Positives = 106/276 (38%), Gaps = 96/276 (34%)

Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH+EI  FC  +   A E ++R      AVK V+  ++ +WP+ +  +FGS  TGL LP+
Sbjct: 142  LHKEILDFCDFISPSAEEQSSRTA----AVKAVSNVIKHIWPQCKVEVFGSFRTGLFLPT 197

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
            SD+D+V                I + R    +  L   A+ L+ +   K   ++ +    
Sbjct: 198  SDIDVV----------------IFDSRVKTPQVGLYALAKALSQKGVAK--KIQVIAKAR 239

Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
            +PI+  V                           +  + +  D ++ D    P+ +    
Sbjct: 240  VPIVKFV---------------------------ERKSEIAFD-ISFDMDGGPQAADFIK 271

Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
            D +K                              +FPA   L ++LK FL  R L++ Y+
Sbjct: 272  DYVK------------------------------KFPALRHLCMILKVFLHQRELNEVYT 301

Query: 1450 GGLSSYCLMLLITRFLQ-------------HEHHLG 1472
            GG+ SY L+ ++   LQ              EH+LG
Sbjct: 302  GGIGSYALLTMLITHLQLIWGGKDILGYRKKEHNLG 337


>gi|116235017|dbj|BAF34948.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 578

 Score = 67.8 bits (164), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 63/276 (22%), Positives = 106/276 (38%), Gaps = 96/276 (34%)

Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH+EI  FC  +   A E ++R      AVK V+  ++ +WP+ +  +FGS  TGL LP+
Sbjct: 142  LHKEILDFCDFISPSAEEQSSRTA----AVKAVSNVIKHIWPQCKVEVFGSFRTGLFLPT 197

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
            SD+D+V                I + R    +  L   A+ L+ +   K   ++ +    
Sbjct: 198  SDIDVV----------------IFDSRVKTPQVGLYALAKALSQKGVAK--KIQVIAKAR 239

Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
            +PI+  V                           +  + +  D ++ D    P+ +    
Sbjct: 240  VPIVKFV---------------------------ERKSEIAFD-ISFDMDGGPQAADFIK 271

Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
            D +K                              +FPA   L ++LK FL  R L++ Y+
Sbjct: 272  DYVK------------------------------KFPALRHLCMILKVFLHQRELNEVYT 301

Query: 1450 GGLSSYCLMLLITRFLQ-------------HEHHLG 1472
            GG+ SY L+ ++   LQ              EH+LG
Sbjct: 302  GGIGSYALLTMLITHLQLIWGGKDILGYRKKEHNLG 337


>gi|198427134|ref|XP_002121817.1| PREDICTED: similar to PAP-associated domain-containing protein 5
            (Topoisomerase-related function protein 4-2) (TRF4-2)
            [Ciona intestinalis]
          Length = 391

 Score = 67.4 bits (163), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 60/104 (57%), Gaps = 13/104 (12%)

Query: 1369 VHSDMVA------LDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELT 1422
            V  D+VA      LD +A P       +     T V++DISF   S  G+Q+ +L+K+  
Sbjct: 158  VSKDIVAEGSVKVLDRAAVPLIKFQHKE-----TLVKVDISFNIQS--GVQSVELIKDFM 210

Query: 1423 EQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            +++PA   L  VLKQFL  R L++ ++GGLSSY L+L+   FLQ
Sbjct: 211  KKYPALPKLIFVLKQFLLVRELNEVWTGGLSSYSLILMAISFLQ 254


>gi|118366560|ref|XP_001016496.1| Nucleotidyltransferase domain containing protein [Tetrahymena
            thermophila]
 gi|89298263|gb|EAR96251.1| Nucleotidyltransferase domain containing protein [Tetrahymena
            thermophila SB210]
          Length = 1873

 Score = 67.4 bits (163), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 66/284 (23%), Positives = 123/284 (43%), Gaps = 46/284 (16%)

Query: 1203 KASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTN----IF 1258
            K +  L    ++ +I SF + +  E+   +P       R+   L+  + +  +     IF
Sbjct: 1459 KIAKRLFRRKINNDILSFTEMINLESYRHQPSRFLIFYRIQHLLKKQFEKVESQVDLRIF 1518

Query: 1259 GSNATGLSLPSSDVDLVV----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQ 1314
            GS ATGLSLP+SD+DL +    C P          + ++          L   A  L   
Sbjct: 1519 GSCATGLSLPTSDIDLGITGFECHP---------RSDLI--------YILNQLANILEQF 1561

Query: 1315 EWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMV 1374
            +WVKS  ++ +   +IPII + V+   D         +   +D            + +++
Sbjct: 1562 KWVKS--IERIFTASIPIIKMEVDPFVDF-------KECINQDELLLFFDPIEIQNLNLL 1612

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPS-----HTGLQTTDLVKELTEQFPAST 1429
              +++ S K        I     + ++   +SP+     H G  +TD V      +    
Sbjct: 1613 VTEENQSLK-------GINLKVDITVECQMQSPNTEILNHMGFLSTDFVVRWMSYYVRLQ 1665

Query: 1430 PLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGR 1473
            P+ +V+K FL  R+L+ ++ GG+SSYCL+++I  F +   ++ +
Sbjct: 1666 PMTIVMKCFLHARNLNTNFKGGISSYCLLIMIIAFFRDNQNISQ 1709


>gi|355709229|gb|AES03522.1| PAP associated domain containing 5 [Mustela putorius furo]
          Length = 458

 Score = 67.4 bits (163), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH       +  LD +  P    T S      T V++DISF      G++  DL+K+ T+
Sbjct: 40   KHKVADEDSVKVLDKATVPIIKLTDS-----FTEVKVDISFNV--QNGVRAADLIKDFTK 92

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++P    L LVLKQFL  R L++ ++GG+ SY L L+   FLQ
Sbjct: 93   KYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 135


>gi|21739473|emb|CAD38778.1| hypothetical protein [Homo sapiens]
          Length = 410

 Score = 67.4 bits (163), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH       +  LD +  P    T S      T V++DISF      G++  DL+K+ T+
Sbjct: 38   KHKVADEDSVKVLDKATVPIIKLTDS-----FTEVKVDISFNV--QNGVRAADLIKDFTK 90

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++P    L LVLKQFL  R L++ ++GG+ SY L L+   FLQ
Sbjct: 91   KYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 133


>gi|62898381|dbj|BAD97130.1| PAP associated domain containing protein 5 variant [Homo sapiens]
          Length = 466

 Score = 67.4 bits (163), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH       +  LD +  P    T S      T V++DISF      G++  DL+K+ T+
Sbjct: 47   KHKVADEDSVKVLDKATVPIIKLTDS-----FTEVKVDISFNV--QNGVRAADLIKDFTK 99

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++P    L LVLKQFL  R L++ ++GG+ SY L L+   FLQ
Sbjct: 100  KYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 142


>gi|349603773|gb|AEP99516.1| PAP-associated domain-containing protein 5-like protein, partial
            [Equus caballus]
          Length = 459

 Score = 67.4 bits (163), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH       +  LD +  P    T S      T V++DISF      G++  DL+K+ T+
Sbjct: 40   KHKVADEDSVKVLDKATVPIIKLTDS-----FTEVKVDISFNV--QNGVRAADLIKDFTK 92

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++P    L LVLKQFL  R L++ ++GG+ SY L L+   FLQ
Sbjct: 93   KYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 135


>gi|5565689|gb|AAD45199.1|AF089897_1 topoisomerase-related function protein [Homo sapiens]
          Length = 379

 Score = 67.4 bits (163), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 36/95 (37%), Positives = 54/95 (56%), Gaps = 7/95 (7%)

Query: 1372 DMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPL 1431
            ++  LD +  P    T S      T V++DISF      G++  DL+K+ T+++P    L
Sbjct: 2    EVKVLDKATVPIIKLTDS-----FTEVKVDISFNV--QNGVRAADLIKDFTKKYPVLPYL 54

Query: 1432 ALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
             LVLKQFL  R L++ ++GG+ SY L L+   FLQ
Sbjct: 55   VLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 89


>gi|242014489|ref|XP_002427922.1| PAP-associated domain-containing protein, putative [Pediculus humanus
            corporis]
 gi|212512406|gb|EEB15184.1| PAP-associated domain-containing protein, putative [Pediculus humanus
            corporis]
          Length = 531

 Score = 67.0 bits (162), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 32/71 (45%), Positives = 47/71 (66%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            + V++DISF   S  G+Q+  L+K   ++FP    L LVLKQFL  R L++ ++GG+SSY
Sbjct: 222  SDVKVDISFNMSS--GVQSAQLIKTFKKKFPVLAKLVLVLKQFLLQRDLNEVFTGGISSY 279

Query: 1456 CLMLLITRFLQ 1466
             L+L+   FLQ
Sbjct: 280  SLILMTISFLQ 290



 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 41/150 (27%), Positives = 70/150 (46%), Gaps = 31/150 (20%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH+EI  F   ++      +  +   V ++  ++  LWP+++  +FGS  TGL LP+SD+
Sbjct: 115  LHDEIIEFYNYMSPRKCEHELRLK-VVHKIKSAIHELWPQAKVEVFGSFRTGLYLPTSDI 173

Query: 1273 DLVV--CLP--PVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
            DLVV  C P  P+  LE                       + L   E  +  ++K ++  
Sbjct: 174  DLVVIGCWPTLPLWKLE-----------------------QKLLENEIAEQHTIKVLDKA 210

Query: 1329 AIPIIMLV---VEVPHDLIASAASSVQSPK 1355
            ++PII L     +V  D+  + +S VQS +
Sbjct: 211  SVPIIKLRDKESDVKVDISFNMSSGVQSAQ 240


>gi|242053947|ref|XP_002456119.1| hypothetical protein SORBIDRAFT_03g030810 [Sorghum bicolor]
 gi|241928094|gb|EES01239.1| hypothetical protein SORBIDRAFT_03g030810 [Sorghum bicolor]
          Length = 568

 Score = 67.0 bits (162), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 62/254 (24%), Positives = 100/254 (39%), Gaps = 77/254 (30%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH+EI  FC  ++  +T  +     AV+ V+  ++ +WP+ +  +FGS  TGL LP+SD+
Sbjct: 141  LHKEILDFCDFISP-STEEQSSRTAAVQDVSDVVKHIWPQCKVEVFGSFRTGLYLPTSDI 199

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            D+VV                 E R    +  L   A+ L+ +   K   ++ +    +PI
Sbjct: 200  DVVV----------------FESRVKTPQVGLYALAKALSQKGVAK--KIQVIAKARVPI 241

Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
            +  V                                     +A D S        ++D I
Sbjct: 242  VKFV--------------------------------ERKSGIAFDISFDMDGGPQAADFI 269

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
            K                      D VK+L    PA  PL ++LK FL  R L++ Y+GG+
Sbjct: 270  K----------------------DAVKKL----PALRPLCMILKVFLHQRELNEVYTGGI 303

Query: 1453 SSYCLMLLITRFLQ 1466
             SY L+ ++   LQ
Sbjct: 304  GSYALLTMLITHLQ 317


>gi|195145882|ref|XP_002013919.1| GL24402 [Drosophila persimilis]
 gi|194102862|gb|EDW24905.1| GL24402 [Drosophila persimilis]
          Length = 400

 Score = 66.6 bits (161), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 72/267 (26%), Positives = 108/267 (40%), Gaps = 92/267 (34%)

Query: 1211 SLLHEEIDSFCKQVAAENTARKPYINW-AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            SLLH EI+ F   +  ++TA + Y+   AV+ +   L  +WP +   +FGS  TGL+LP 
Sbjct: 26   SLLHLEIERFYNFI--QSTATEFYLRAEAVRCIEDMLVSIWPGASVEVFGSFRTGLNLPL 83

Query: 1270 SDVDLVV-------CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
            SD+DLVV         PP+  LE                   +  AR +A+ +     ++
Sbjct: 84   SDIDLVVENRRSYWYNPPLYELES------------------ELVARGVADPQ-----TV 120

Query: 1323 KTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASP 1382
              V+  A+P++     V                     + +K D                
Sbjct: 121  NVVDTAAVPVVKFTERV---------------------SQIKFD---------------- 143

Query: 1383 KCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADR 1442
              S      +KAA  ++                DL+ E    FP    L +VLKQFLA  
Sbjct: 144  -ISFNVGSGVKAAELIK----------------DLIHE----FPDLPKLVMVLKQFLAMH 182

Query: 1443 SLDQSY-SGGLSSYCLMLLITRFLQHE 1468
             L+  Y SGG+SSY + L+   FLQH+
Sbjct: 183  GLNDVYRSGGISSYGITLMCIGFLQHQ 209


>gi|195131647|ref|XP_002010257.1| GI15833 [Drosophila mojavensis]
 gi|193908707|gb|EDW07574.1| GI15833 [Drosophila mojavensis]
          Length = 807

 Score = 66.6 bits (161), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 54/92 (58%), Gaps = 7/92 (7%)

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
             LD ++ P    T  +     T V++DISF   S  G+Q+ +L+K+    +P    L LV
Sbjct: 391  VLDKASVPIIKLTDRE-----TQVKVDISFNMQS--GVQSAELIKQFKRDYPVLGKLVLV 443

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 444  LKQFLLLRDLNEVFTGGISSYSLILMCISFLQ 475



 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 54/171 (31%), Positives = 77/171 (45%), Gaps = 39/171 (22%)

Query: 1197 QNCPTRKASLSLMHSL--LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWP 1251
            + CP R         +  LHEEID F + V   A E+  R    N  VKR+   +  +WP
Sbjct: 282  KGCPWRVRDYQYGKGIIGLHEEIDHFYQYVLPTACEHAIR----NEVVKRIESVVHSIWP 337

Query: 1252 RSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHA 1307
            ++   IFGS  TGL LP+SD+DLVV       P+R LE    A       GI E C    
Sbjct: 338  QAVVEIFGSFRTGLFLPTSDIDLVVLGLWEKLPLRTLEFELVA------RGIAEAC---- 387

Query: 1308 ARYLANQEWVKSDSLKTVENTAIPIIMLV---VEVPHDLIASAASSVQSPK 1355
                         +++ ++  ++PII L     +V  D+  +  S VQS +
Sbjct: 388  -------------TVRVLDKASVPIIKLTDRETQVKVDISFNMQSGVQSAE 425


>gi|195447518|ref|XP_002071250.1| GK25231 [Drosophila willistoni]
 gi|194167335|gb|EDW82236.1| GK25231 [Drosophila willistoni]
          Length = 1048

 Score = 66.6 bits (161), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 54/92 (58%), Gaps = 7/92 (7%)

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
             LD ++ P    T  +     T V++DISF   S  G+Q+ +L+K+    +P    L LV
Sbjct: 377  VLDKASVPIIKLTDRE-----TQVKVDISFNMQS--GVQSAELIKKFKRDYPVLGKLVLV 429

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 430  LKQFLLLRDLNEVFTGGISSYSLILMCISFLQ 461



 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 71/151 (47%), Gaps = 37/151 (24%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI+ F + V     E+  R    N  VKR+   +  +WP++   IFGS  TGL LP+
Sbjct: 286  LHEEIEHFYQYVLPTPCEHAIR----NEVVKRIEGVVHSIWPQAVVEIFGSFRTGLFLPT 341

Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
            SD+DLVV       P+R LE       L  R GI E C                 +++ +
Sbjct: 342  SDIDLVVLGLWEKLPLRTLE-----FELVSR-GIAEAC-----------------TVRVL 378

Query: 1326 ENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
            +  ++PII L     +V  D+  +  S VQS
Sbjct: 379  DKASVPIIKLTDRETQVKVDISFNMQSGVQS 409


>gi|281211597|gb|EFA85759.1| PAP/25A-associated domain-containing protein [Polysphondylium
            pallidum PN500]
          Length = 918

 Score = 66.6 bits (161), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 59/263 (22%), Positives = 106/263 (40%), Gaps = 87/263 (33%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            L +EI+ F + +     EN  R+  +N     V ++    WP+++  +FGS +T L +PS
Sbjct: 460  LEKEIELFYQWLVPTQYENRMRQKIVNDIEAIVKQN----WPKAKVIVFGSFSTDLCIPS 515

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGR------NGIKETCLQHAARYLANQEWVKSDSLK 1323
            SD+D+ +          I E     GR      N +  T  +H  R   N        ++
Sbjct: 516  SDIDIQIS--------GITEVASGNGRTYSNPINDLYNTLSKHHQREFTN--------IR 559

Query: 1324 TVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPK 1383
             +    +PII                                                 K
Sbjct: 560  LIAAAKVPII-------------------------------------------------K 570

Query: 1384 CSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
             +H S       T   +DI F +P+  G++ T++VK+   ++ +   L L+LK F+   +
Sbjct: 571  MAHKS-------TWYNVDICFDTPN--GIENTEIVKQFLRKYKSMKILLLILKYFMFQNN 621

Query: 1444 LDQSYSGGLSSYCLMLLITRFLQ 1466
            ++++YSGG+ SY L L++  ++Q
Sbjct: 622  MNETYSGGIGSYALALMVVSYIQ 644


>gi|194909177|ref|XP_001981903.1| GG11338 [Drosophila erecta]
 gi|190656541|gb|EDV53773.1| GG11338 [Drosophila erecta]
          Length = 405

 Score = 66.6 bits (161), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 57/206 (27%), Positives = 98/206 (47%), Gaps = 27/206 (13%)

Query: 1285 EPIKEAGILEGRNGIKETCLQHAA-----RYLAN---QEWVKSDSLKTVENTAIPIIMLV 1336
            +P + +G+L G NGI   CL H        Y+ +   +  +++D+++ +E+  + I    
Sbjct: 9    KPWQLSGLLYG-NGIPALCLLHQEIEQFYSYMQSTPTEFCLRADAVRRIEDVVLSIWPGA 67

Query: 1337 VEVPHDLIASAASSVQSPKEDAAHTT---------LKHD--NHVHSDMVALDDSAS--PK 1383
                 D+  S  + +  P  D              L HD  N + S  V   DS S   K
Sbjct: 68   CV---DVFGSFRTGLNLPNSDIDLVVYNGYYWSPRLLHDLENELVSRGVTDPDSVSVLDK 124

Query: 1384 CSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
             S          + +R D++F + + +G+Q  +L+K+   +FP    L +VLKQFL+ + 
Sbjct: 125  ASVPVVKFTDRVSRIRFDVTFNAAA-SGVQAAELIKDFIREFPELPKLVMVLKQFLSLQG 183

Query: 1444 LDQSY-SGGLSSYCLMLLITRFLQHE 1468
             ++ Y SGG+SSY L L++  FLQ +
Sbjct: 184  FNEVYSSGGVSSYALTLMVISFLQQQ 209



 Score = 48.9 bits (115), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 26/65 (40%), Positives = 41/65 (63%), Gaps = 1/65 (1%)

Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
            LLH+EI+ F   + +  T      + AV+R+   +  +WP +  ++FGS  TGL+LP+SD
Sbjct: 27   LLHQEIEQFYSYMQSTPTEFCLRAD-AVRRIEDVVLSIWPGACVDVFGSFRTGLNLPNSD 85

Query: 1272 VDLVV 1276
            +DLVV
Sbjct: 86   IDLVV 90


>gi|324522375|gb|ADY48050.1| PAP-associated domain-containing protein 5 [Ascaris suum]
          Length = 244

 Score = 66.6 bits (161), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 52/92 (56%), Gaps = 7/92 (7%)

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
             LD +  P       D     T + LDISF +    G++    ++++  ++P   PL L+
Sbjct: 155  VLDKAFVPIVKMVDKD-----TKIFLDISFNTVQ--GVRAAKYIEQMKSRYPVLEPLVLI 207

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LKQFL  R L+Q ++GGLSSY L+L++  FLQ
Sbjct: 208  LKQFLMQRQLNQVFTGGLSSYGLILMLISFLQ 239



 Score = 46.6 bits (109), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 39/64 (60%), Gaps = 6/64 (9%)

Query: 1240 KRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPV------RNLEPIKEAGIL 1293
            +RV   ++ +WP ++  +FGS  TGL LP+SD+D+VV + P+      +  E +K +GI 
Sbjct: 91   ERVRGVIERVWPSAKVAVFGSLFTGLFLPTSDIDVVVEVQPLDEPPLWKTAEALKASGIA 150

Query: 1294 EGRN 1297
            E  N
Sbjct: 151  ERIN 154


>gi|161077646|ref|NP_001096913.1| Trf4-1, isoform F [Drosophila melanogaster]
 gi|161077648|ref|NP_001096914.1| Trf4-1, isoform G [Drosophila melanogaster]
 gi|158031752|gb|ABW09362.1| Trf4-1, isoform F [Drosophila melanogaster]
 gi|158031753|gb|ABW09363.1| Trf4-1, isoform G [Drosophila melanogaster]
          Length = 802

 Score = 66.6 bits (161), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 54/92 (58%), Gaps = 7/92 (7%)

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
             LD ++ P    T  +     T V++DISF   S  G+Q+ +L+K+    +P    L LV
Sbjct: 163  VLDKASVPIIKLTDRE-----TQVKVDISFNMQS--GVQSAELIKKFKRDYPVLEKLVLV 215

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 216  LKQFLLLRDLNEVFTGGISSYSLILMCISFLQ 247



 Score = 57.4 bits (137), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 71/151 (47%), Gaps = 37/151 (24%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI+ F + V     E+  R    N  VKR+   +  +WP++   IFGS  TGL LP+
Sbjct: 72   LHEEIEHFYQYVLPTPCEHAIR----NEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPT 127

Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
            SD+DLVV       P+R LE       L  R GI E C                 +++ +
Sbjct: 128  SDIDLVVLGLWEKLPLRTLE-----FELVSR-GIAEAC-----------------TVRVL 164

Query: 1326 ENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
            +  ++PII L     +V  D+  +  S VQS
Sbjct: 165  DKASVPIIKLTDRETQVKVDISFNMQSGVQS 195


>gi|323445977|gb|EGB02334.1| hypothetical protein AURANDRAFT_68978 [Aureococcus anophagefferens]
          Length = 587

 Score = 66.6 bits (161), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 29/76 (38%), Positives = 49/76 (64%), Gaps = 2/76 (2%)

Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
            AT + +D+SF     +GL+T  LV+ L ++ P   PL +VLK FLA R L+++++GG+ S
Sbjct: 496  ATGIPVDVSFDV--ESGLRTGRLVRGLMDRMPPLRPLVIVLKFFLAQRGLNETFTGGVGS 553

Query: 1455 YCLMLLITRFLQHEHH 1470
            + + +++  FLQ  H 
Sbjct: 554  FMMQMMVVSFLQMRHR 569



 Score = 58.2 bits (139), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 31/66 (46%), Positives = 45/66 (68%), Gaps = 5/66 (7%)

Query: 1213 LHEEIDSFCKQVA--AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
            LH+EI +FC  VA  A  T R+   + AV  +  ++  +WP++  ++FGS+ TGLSLPSS
Sbjct: 392  LHDEILAFCALVAPSAGETRRR---DAAVANIEAAVAGVWPKATVHVFGSSLTGLSLPSS 448

Query: 1271 DVDLVV 1276
            DVD+VV
Sbjct: 449  DVDVVV 454


>gi|45554544|ref|NP_996381.1| Trf4-1, isoform E [Drosophila melanogaster]
 gi|161077643|ref|NP_996382.2| Trf4-1, isoform C [Drosophila melanogaster]
 gi|442615523|ref|NP_996383.2| Trf4-1, isoform H [Drosophila melanogaster]
 gi|442615525|ref|NP_572490.2| Trf4-1, isoform I [Drosophila melanogaster]
 gi|442615527|ref|NP_001259340.1| Trf4-1, isoform J [Drosophila melanogaster]
 gi|33636605|gb|AAQ23600.1| RE04457p [Drosophila melanogaster]
 gi|45446864|gb|AAS65290.1| Trf4-1, isoform E [Drosophila melanogaster]
 gi|158031751|gb|AAS65289.2| Trf4-1, isoform C [Drosophila melanogaster]
 gi|440216540|gb|AAS65288.2| Trf4-1, isoform H [Drosophila melanogaster]
 gi|440216541|gb|AAF46390.3| Trf4-1, isoform I [Drosophila melanogaster]
 gi|440216542|gb|AGB95183.1| Trf4-1, isoform J [Drosophila melanogaster]
          Length = 1001

 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 54/92 (58%), Gaps = 7/92 (7%)

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
             LD ++ P    T  +     T V++DISF   S  G+Q+ +L+K+    +P    L LV
Sbjct: 362  VLDKASVPIIKLTDRE-----TQVKVDISFNMQS--GVQSAELIKKFKRDYPVLEKLVLV 414

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 415  LKQFLLLRDLNEVFTGGISSYSLILMCISFLQ 446



 Score = 57.0 bits (136), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 71/151 (47%), Gaps = 37/151 (24%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI+ F + V     E+  R    N  VKR+   +  +WP++   IFGS  TGL LP+
Sbjct: 271  LHEEIEHFYQYVLPTPCEHAIR----NEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPT 326

Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
            SD+DLVV       P+R LE       L  R GI E C                 +++ +
Sbjct: 327  SDIDLVVLGLWEKLPLRTLE-----FELVSR-GIAEAC-----------------TVRVL 363

Query: 1326 ENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
            +  ++PII L     +V  D+  +  S VQS
Sbjct: 364  DKASVPIIKLTDRETQVKVDISFNMQSGVQS 394


>gi|198469802|ref|XP_001355128.2| GA10875 [Drosophila pseudoobscura pseudoobscura]
 gi|198147042|gb|EAL32185.2| GA10875 [Drosophila pseudoobscura pseudoobscura]
          Length = 1202

 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 54/92 (58%), Gaps = 7/92 (7%)

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
             LD ++ P    T  +     T V++DISF   S  G+Q+ +L+K+    +P    L LV
Sbjct: 402  VLDKASVPIIKLTDRE-----TQVKVDISFNMQS--GVQSAELIKKFKRDYPVLGKLVLV 454

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 455  LKQFLLLRDLNEVFTGGISSYSLILMCISFLQ 486



 Score = 58.5 bits (140), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 71/152 (46%), Gaps = 39/152 (25%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI+ F + V     E+  R    N  VKR+   +  +WP++   IFGS  TGL LP+
Sbjct: 311  LHEEIEHFYQYVLPTPCEHAIR----NEVVKRIENVVHTIWPQAVVEIFGSFRTGLFLPT 366

Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRN-GIKETCLQHAARYLANQEWVKSDSLKT 1324
            SD+DLVV       P+R LE        E  N GI E C                 +++ 
Sbjct: 367  SDIDLVVLGLWEKLPLRTLE-------YELVNRGIAEAC-----------------TVRV 402

Query: 1325 VENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
            ++  ++PII L     +V  D+  +  S VQS
Sbjct: 403  LDKASVPIIKLTDRETQVKVDISFNMQSGVQS 434


>gi|194891045|ref|XP_001977427.1| GG18260 [Drosophila erecta]
 gi|190649076|gb|EDV46354.1| GG18260 [Drosophila erecta]
          Length = 1008

 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 54/92 (58%), Gaps = 7/92 (7%)

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
             LD ++ P    T  +     T V++DISF   S  G+Q+ +L+K+    +P    L LV
Sbjct: 350  VLDKASVPIIKLTDRE-----TQVKVDISFNMQS--GVQSAELIKKFKRDYPVLGKLVLV 402

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 403  LKQFLLLRDLNEVFTGGISSYSLILMCISFLQ 434



 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 51/151 (33%), Positives = 72/151 (47%), Gaps = 37/151 (24%)

Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI+ F + V   A E+  R    N  VKR+   +  +WP++   IFGS  TGL LP+
Sbjct: 259  LHEEIEHFYQYVLPTACEHAIR----NEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPT 314

Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
            SD+DLVV       P+R LE       L  R GI E C                 +++ +
Sbjct: 315  SDIDLVVLGLWEKLPLRTLE-----FELVSR-GIAEAC-----------------TVRVL 351

Query: 1326 ENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
            +  ++PII L     +V  D+  +  S VQS
Sbjct: 352  DKASVPIIKLTDRETQVKVDISFNMQSGVQS 382


>gi|195163618|ref|XP_002022646.1| GL14678 [Drosophila persimilis]
 gi|194104669|gb|EDW26712.1| GL14678 [Drosophila persimilis]
          Length = 1197

 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 54/92 (58%), Gaps = 7/92 (7%)

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
             LD ++ P    T  +     T V++DISF   S  G+Q+ +L+K+    +P    L LV
Sbjct: 403  VLDKASVPIIKLTDRE-----TQVKVDISFNMQS--GVQSAELIKKFKRDYPVLGKLVLV 455

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 456  LKQFLLLRDLNEVFTGGISSYSLILMCISFLQ 487



 Score = 58.5 bits (140), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 71/152 (46%), Gaps = 39/152 (25%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI+ F + V     E+  R    N  VKR+   +  +WP++   IFGS  TGL LP+
Sbjct: 312  LHEEIEHFYQYVLPTPCEHAIR----NEVVKRIENVVHTIWPQAVVEIFGSFRTGLFLPT 367

Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRN-GIKETCLQHAARYLANQEWVKSDSLKT 1324
            SD+DLVV       P+R LE        E  N GI E C                 +++ 
Sbjct: 368  SDIDLVVLGLWEKLPLRTLE-------YELVNRGIAEAC-----------------TVRV 403

Query: 1325 VENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
            ++  ++PII L     +V  D+  +  S VQS
Sbjct: 404  LDKASVPIIKLTDRETQVKVDISFNMQSGVQS 435


>gi|391342828|ref|XP_003745717.1| PREDICTED: PAP-associated domain-containing protein 5-like, partial
            [Metaseiulus occidentalis]
          Length = 512

 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/96 (36%), Positives = 58/96 (60%), Gaps = 7/96 (7%)

Query: 1371 SDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTP 1430
            S +  LD ++ P    T     + +T+V++DISF   +  G+++  L+K+  + FP    
Sbjct: 181  STIKVLDRASVPIVKFT-----EQSTNVKVDISFNQKN--GVKSAKLIKDFCKTFPPLPK 233

Query: 1431 LALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            L  VLKQ+L  R L++ ++GG+SSY L+LL+  FLQ
Sbjct: 234  LVFVLKQYLLQRDLNEVFTGGISSYSLILLVVSFLQ 269



 Score = 46.2 bits (108), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 19/39 (48%), Positives = 28/39 (71%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVV 1276
             V RV   ++ LWP+++  +FGS  TGL LP+SD+DLV+
Sbjct: 118  VVNRVKEVVRQLWPQAQCEVFGSFCTGLYLPTSDIDLVI 156


>gi|195480074|ref|XP_002101128.1| GE15792 [Drosophila yakuba]
 gi|194188652|gb|EDX02236.1| GE15792 [Drosophila yakuba]
          Length = 1014

 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 54/92 (58%), Gaps = 7/92 (7%)

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
             LD ++ P    T  +     T V++DISF   S  G+Q+ +L+K+    +P    L LV
Sbjct: 351  VLDKASVPIIKLTDRE-----TQVKVDISFNMQS--GVQSAELIKKFKRDYPVLGKLVLV 403

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 404  LKQFLLLRDLNEVFTGGISSYSLILMCISFLQ 435



 Score = 57.0 bits (136), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 71/151 (47%), Gaps = 37/151 (24%)

Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI+ F + V     E+  R    N  VKR+   +  +WP++   IFGS  TGL LP+
Sbjct: 260  LHEEIEHFYQYVLPTPCEHAIR----NEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPT 315

Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
            SD+DLVV       P+R LE       L  R GI E C                 +++ +
Sbjct: 316  SDIDLVVLGLWEKLPLRTLE-----FELVSR-GIAEAC-----------------TVRVL 352

Query: 1326 ENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
            +  ++PII L     +V  D+  +  S VQS
Sbjct: 353  DKASVPIIKLTDRETQVKVDISFNMQSGVQS 383


>gi|194769888|ref|XP_001967033.1| GF21834 [Drosophila ananassae]
 gi|190622828|gb|EDV38352.1| GF21834 [Drosophila ananassae]
          Length = 1092

 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 54/92 (58%), Gaps = 7/92 (7%)

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
             LD ++ P    T  +     T V++DISF   S  G+Q+ +L+K+    +P    L LV
Sbjct: 445  VLDKASVPIIKLTDRE-----THVKVDISFNMQS--GVQSAELIKKFKRDYPVLGKLVLV 497

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 498  LKQFLLLRDLNEVFTGGISSYSLILMCISFLQ 529



 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 70/151 (46%), Gaps = 37/151 (24%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI+ F + V     E+  R    N  VKR+   +  +WP++   IFGS  TGL LP+
Sbjct: 354  LHEEIEHFYQYVLPTPCEHAIR----NEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPT 409

Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
            SD+DLVV       P+R LE       L  R GI E C                 +++ +
Sbjct: 410  SDIDLVVLGLWEKLPLRTLE-----FELVSR-GIAEAC-----------------TVRVL 446

Query: 1326 ENTAIPIIMLVVEVPH---DLIASAASSVQS 1353
            +  ++PII L     H   D+  +  S VQS
Sbjct: 447  DKASVPIIKLTDRETHVKVDISFNMQSGVQS 477


>gi|195355122|ref|XP_002044042.1| GM21691 [Drosophila sechellia]
 gi|194129295|gb|EDW51338.1| GM21691 [Drosophila sechellia]
          Length = 1000

 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 54/92 (58%), Gaps = 7/92 (7%)

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
             LD ++ P    T  +     T V++DISF   S  G+Q+ +L+K+    +P    L LV
Sbjct: 363  VLDKASVPIIKLTDRE-----TQVKVDISFNMQS--GVQSAELIKKFKRDYPVLEKLVLV 415

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 416  LKQFLLLRDLNEVFTGGISSYSLILMCISFLQ 447



 Score = 57.0 bits (136), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 71/151 (47%), Gaps = 37/151 (24%)

Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI+ F + V     E+  R    N  VKR+   +  +WP++   IFGS  TGL LP+
Sbjct: 272  LHEEIEHFYQYVLPTPCEHAIR----NEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPT 327

Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
            SD+DLVV       P+R LE       L  R GI E C                 +++ +
Sbjct: 328  SDIDLVVLGLWEKLPLRTLE-----FELVSR-GIAEAC-----------------TVRVL 364

Query: 1326 ENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
            +  ++PII L     +V  D+  +  S VQS
Sbjct: 365  DKASVPIIKLTDRETQVKVDISFNMQSGVQS 395


>gi|212645230|ref|NP_492446.3| Protein GLD-4 [Caenorhabditis elegans]
 gi|403399397|sp|G5EFL0.1|GLD4_CAEEL RecName: Full=Poly(A) RNA polymerase gld-4; AltName: Full=Defective
            in germ line development protein 4; AltName:
            Full=Germline development defective-4
 gi|194686198|emb|CAB02138.3| Protein GLD-4 [Caenorhabditis elegans]
 gi|226972859|gb|ACO95123.1| germline defective-4 [Caenorhabditis elegans]
          Length = 845

 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 31/71 (43%), Positives = 48/71 (67%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T + +DISF +    G++    + ++ E+FP   PL L+LKQFL  R+L+Q+++GGLSSY
Sbjct: 191  TRLSIDISFNTVQ--GVRAASYIAKVKEEFPLIEPLVLLLKQFLHYRNLNQTFTGGLSSY 248

Query: 1456 CLMLLITRFLQ 1466
             L+LL+  F Q
Sbjct: 249  GLVLLLVNFFQ 259


>gi|195504602|ref|XP_002099149.1| GE23533 [Drosophila yakuba]
 gi|194185250|gb|EDW98861.1| GE23533 [Drosophila yakuba]
          Length = 405

 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 33/97 (34%), Positives = 58/97 (59%), Gaps = 7/97 (7%)

Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
            ++ LD ++ P    T     +  + +R D++F + + +G+Q  +L+K+   QFP    L 
Sbjct: 119  VIVLDKASVPVVKFT-----ERISRIRFDVTFNAAA-SGVQAAELIKDFIRQFPELPKLV 172

Query: 1433 LVLKQFLADRSLDQSY-SGGLSSYCLMLLITRFLQHE 1468
            +VLKQFL+ +  ++ Y SGG+SSY L L++  FLQ +
Sbjct: 173  MVLKQFLSLQGFNEVYSSGGISSYALTLMVISFLQQQ 209



 Score = 49.3 bits (116), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 40/68 (58%), Gaps = 7/68 (10%)

Query: 1212 LLHEEIDSFCKQVAA---ENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
            LLH+EI+ F   + +   E   R      AV+R+   +  +WP +  ++FGS  TGL+LP
Sbjct: 27   LLHQEIEHFYSYIQSTPTEFCLRAE----AVRRIEDVVLTIWPGACVDVFGSFRTGLNLP 82

Query: 1269 SSDVDLVV 1276
             SD+DLVV
Sbjct: 83   CSDIDLVV 90


>gi|195393956|ref|XP_002055618.1| GJ18687 [Drosophila virilis]
 gi|194150128|gb|EDW65819.1| GJ18687 [Drosophila virilis]
          Length = 1111

 Score = 65.9 bits (159), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/71 (45%), Positives = 47/71 (66%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T V++DISF   S  G+Q+ +L+K+    +P    L LVLKQFL  R L++ ++GG+SSY
Sbjct: 376  TQVKVDISFNMQS--GVQSAELIKQFKRDYPVLGKLVLVLKQFLLLRDLNEVFTGGISSY 433

Query: 1456 CLMLLITRFLQ 1466
             L+L+   FLQ
Sbjct: 434  SLILMCISFLQ 444



 Score = 57.8 bits (138), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 51/151 (33%), Positives = 71/151 (47%), Gaps = 37/151 (24%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEID F + V     E+  R    N  VKR+   +  +WP++   IFGS  TGL LP+
Sbjct: 269  LHEEIDHFYQYVLPTPCEHAIR----NEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPT 324

Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
            SD+DLVV       P+R LE       L  RN I E C                 +++ +
Sbjct: 325  SDIDLVVLGLWEKLPLRTLE-----FELVSRN-IAEAC-----------------TVRVL 361

Query: 1326 ENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
            +  ++PII L     +V  D+  +  S VQS
Sbjct: 362  DKASVPIIKLTDRETQVKVDISFNMQSGVQS 392


>gi|449674689|ref|XP_002159462.2| PREDICTED: poly(A) RNA polymerase gld-4-like [Hydra magnipapillata]
          Length = 710

 Score = 65.9 bits (159), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/71 (45%), Positives = 46/71 (64%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T V +DISF +   T +Q+ ++VK   ++F     L L LK FL+ R L++ ++GG+SSY
Sbjct: 227  TRVHVDISFNT--DTAVQSAEMVKSYIKEFKVLPHLYLTLKYFLSQRELNEVFTGGISSY 284

Query: 1456 CLMLLITRFLQ 1466
            CLMLL   FLQ
Sbjct: 285  CLMLLTVSFLQ 295



 Score = 46.6 bits (109), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 38/67 (56%), Gaps = 7/67 (10%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH+EI  F   +A    E   R    N  ++R+T  +  +WP ++  +FGS  T + LP+
Sbjct: 120  LHQEIIDFYNYMAPMPEEEMMR----NDVIQRITDLVVSIWPSAKVEVFGSFRTNIYLPT 175

Query: 1270 SDVDLVV 1276
            SD+DLVV
Sbjct: 176  SDIDLVV 182


>gi|341895116|gb|EGT51051.1| hypothetical protein CAEBREN_16945 [Caenorhabditis brenneri]
          Length = 901

 Score = 65.9 bits (159), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 31/71 (43%), Positives = 48/71 (67%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T + +DISF +    G++    + ++ E+FP   PL L+LKQFL  R+L+Q+++GGLSSY
Sbjct: 199  TRLSIDISFNTVQ--GVRAASYIDKVKEEFPLIEPLVLLLKQFLHYRNLNQTFTGGLSSY 256

Query: 1456 CLMLLITRFLQ 1466
             L+LL+  F Q
Sbjct: 257  GLVLLLVNFFQ 267


>gi|403370926|gb|EJY85334.1| hypothetical protein OXYTRI_16805 [Oxytricha trifallax]
          Length = 1384

 Score = 65.9 bits (159), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 60/211 (28%), Positives = 98/211 (46%), Gaps = 25/211 (11%)

Query: 1257 IFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEW 1316
            ++GS A+ LS+ +SDVDL V      +    +E  + E R  I++  L            
Sbjct: 1072 LYGSMASNLSIDTSDVDLAVV---GLDFSGSREKQVNEMRKLIEQINLIMK--------- 1119

Query: 1317 VKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVAL 1376
             KS  LK +E   +P+I L V    DL+  A    +S K +    + K  N +  +M  L
Sbjct: 1120 -KSTKLKFIETATVPVIKLEV----DLVQIAKGHERSEKNEFGSKSAKK-NIIDDNMRYL 1173

Query: 1377 DDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLK 1436
                      T  D IK      L I   S  + G++  + +KEL +  P    + LVLK
Sbjct: 1174 G------IDITYEDTIKQHY-YGLHIENLSRINLGVKCINYIKELCKDLPGLQTIVLVLK 1226

Query: 1437 QFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
            + L    L+Q+++GGL+SY ++L+ T +++ 
Sbjct: 1227 KLLQVNDLNQTFTGGLNSYSIVLMTTAYIKQ 1257


>gi|341883718|gb|EGT39653.1| hypothetical protein CAEBREN_22894 [Caenorhabditis brenneri]
          Length = 901

 Score = 65.9 bits (159), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 31/71 (43%), Positives = 48/71 (67%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T + +DISF +    G++    + ++ E+FP   PL L+LKQFL  R+L+Q+++GGLSSY
Sbjct: 199  TRLSIDISFNTVQ--GVRAASYIDKVKEEFPLIEPLVLLLKQFLHYRNLNQTFTGGLSSY 256

Query: 1456 CLMLLITRFLQ 1466
             L+LL+  F Q
Sbjct: 257  GLVLLLVNFFQ 267


>gi|268566431|ref|XP_002639720.1| Hypothetical protein CBG12446 [Caenorhabditis briggsae]
          Length = 897

 Score = 65.9 bits (159), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 31/71 (43%), Positives = 48/71 (67%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T + +DISF +    G++    + ++ E+FP   PL L+LKQFL  R+L+Q+++GGLSSY
Sbjct: 193  TRLSIDISFNTVQ--GVRAASYIAKVKEEFPLIEPLVLLLKQFLHYRNLNQTFTGGLSSY 250

Query: 1456 CLMLLITRFLQ 1466
             L+LL+  F Q
Sbjct: 251  GLVLLLVNFFQ 261


>gi|256078812|ref|XP_002575688.1| hypothetical protein [Schistosoma mansoni]
 gi|360044186|emb|CCD81733.1| hypothetical protein Smp_145600 [Schistosoma mansoni]
          Length = 672

 Score = 65.9 bits (159), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 57/99 (57%), Gaps = 7/99 (7%)

Query: 1369 VHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPAS 1428
            + S++  LD +  P    T  +     T +R+DISF   +   ++  +L++   +++P  
Sbjct: 139  ISSEIKVLDKATVPIVKMTDKE-----TELRVDISFNMIN--SVKAAELIRVFMKRYPCL 191

Query: 1429 TPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
              L  VLKQFL  RSL++ ++GGLSSY L+L+   FLQH
Sbjct: 192  PCLVFVLKQFLLQRSLNEVWTGGLSSYALILMCVSFLQH 230



 Score = 52.4 bits (124), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 56/108 (51%), Gaps = 10/108 (9%)

Query: 1175 ISQLARDQEHPDVAFPL-QPLEVQNCPTRKASLSLMHSL--LHEEIDSFCKQVA---AEN 1228
            ++ + +  E+ +  FPL Q  ++++ P +         +  LH EI  F   ++   AE 
Sbjct: 15   VTSVLKPDEYKNYGFPLGQSFKLEDIPWKSTEKEYDFGVIGLHAEIKDFLNYISPSPAEQ 74

Query: 1229 TARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVV 1276
             AR+      V +V   +  LWP  + ++FGS  TGL LP+SD+D+V+
Sbjct: 75   FAREV----VVAKVKDIVYSLWPNCQVDVFGSFKTGLYLPTSDIDMVI 118


>gi|448101749|ref|XP_004199636.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
 gi|359381058|emb|CCE81517.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
          Length = 646

 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 57/231 (24%), Positives = 91/231 (39%), Gaps = 75/231 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  VK + R +  LWP + T++FGS+AT L LP SD+D+VV            + G  E 
Sbjct: 210  NRVVKDLKREINSLWPDTETHVFGSSATDLYLPGSDIDMVVT----------SKTGDYEN 259

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            R     + L   + YL N++  K   ++ +    +PII  V                 P 
Sbjct: 260  R-----SKLYQLSSYLRNRKLAK--DIEVIAKAKVPIIKFV----------------DPS 296

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       +++H D+           S    + I+AA  +R                
Sbjct: 297  -----------SNIHIDI-----------SFERRNGIEAAKRIR---------------- 318

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
                +  ++ P    L L++KQFL  R L+  + GGL  Y  ++L   FL+
Sbjct: 319  ----KWLDKTPGLRELVLIIKQFLRSRRLNNVHVGGLGGYSTIILCYHFLR 365


>gi|145348860|ref|XP_001418861.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579091|gb|ABO97154.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 347

 Score = 65.1 bits (157), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 64/257 (24%), Positives = 100/257 (38%), Gaps = 83/257 (32%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH EI  F + +     E T+R      AV+RV   +  +WP +R  + GS ATG+ LPS
Sbjct: 65   LHTEIVDFSRYLEPTEEEATSRAA----AVERVRAVVNGIWPDARFEVHGSFATGMYLPS 120

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
            SD+D V+     +N      AG+          CL+  A  LA +    +  ++ +    
Sbjct: 121  SDIDAVILDSGAKN------AGL----------CLKALAVALARRGM--AIKIQLIAKAR 162

Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
            +PI+                 V+S                H   ++ D +  P  +    
Sbjct: 163  VPIVKF-------------EEVESG---------------HQFDISFDVANGPASAEIVR 194

Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
            +N++                              +FPA  PL  VLK FL  R L++ YS
Sbjct: 195  ENMR------------------------------RFPALRPLTTVLKAFLHQRGLNEVYS 224

Query: 1450 GGLSSYCLMLLITRFLQ 1466
            GG+ SY L+ ++   LQ
Sbjct: 225  GGIGSYALLCMVMAHLQ 241


>gi|72392291|ref|XP_846946.1| DNA polymerase sigma [Trypanosoma brucei brucei strain 927/4
            GUTat10.1]
 gi|62176235|gb|AAX70351.1| DNA polymerase sigma, putative [Trypanosoma brucei]
 gi|70802976|gb|AAZ12880.1| DNA polymerase sigma, putative [Trypanosoma brucei brucei strain
            927/4 GUTat10.1]
          Length = 746

 Score = 65.1 bits (157), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 74/265 (27%), Positives = 105/265 (39%), Gaps = 81/265 (30%)

Query: 1213 LHEEIDSFCK--QVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
            LH+EI  F K  Q      A +  +   V  + + L   WP+    +FGS ATGL LP S
Sbjct: 324  LHQEICDFVKFHQPTEAEIAIRCLVEAEVVTIAKQL---WPKCEPVVFGSMATGLLLPLS 380

Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAI 1330
            DVD+ +    V    P +EA             L+  AR +         SL  + NTA 
Sbjct: 381  DVDITILNVDV----PTEEA-------------LRLLAREI---------SLSGICNTAY 414

Query: 1331 PIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSD 1390
            P ++L  +VP                      L    H  S    LD             
Sbjct: 415  PQLVLKAKVP----------------------LLKFQHCGS---LLD------------- 436

Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
                     +DIS    +  G + T +V +L + +  + PL +V+K FL  R +D+ Y G
Sbjct: 437  ---------VDISIN--ARDGPRNTLIVIDLLKHYREAEPLIVVVKYFLHQRGMDEPYHG 485

Query: 1451 GLSSYCLMLLITRFL-QHEHHLGRP 1474
            GL S+ L L+   FL QH  + G P
Sbjct: 486  GLGSFALTLMAISFLQQHPIYTGSP 510


>gi|224064673|ref|XP_002197521.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
            [Taeniopygia guttata]
          Length = 443

 Score = 64.7 bits (156), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 64/245 (26%), Positives = 93/245 (37%), Gaps = 91/245 (37%)

Query: 1227 ENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVV-----CLPPV 1281
            E T R   +N    R+   ++ LWP +   IFGS  TGL LP+SD+DLVV      LP  
Sbjct: 7    EETMRMEVVN----RIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLVVFGKWETLP-- 60

Query: 1282 RNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPH 1341
              L  ++EA                    L        +S+K ++   +PII L      
Sbjct: 61   --LWTLEEA--------------------LRKHNVADENSVKVLDKATVPIIKLT----- 93

Query: 1342 DLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLD 1401
                             + T +K D                  S    + +KAA  ++  
Sbjct: 94   ----------------DSFTEVKVD-----------------ISFNVQNGVKAAQLIK-- 118

Query: 1402 ISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLI 1461
                          D +K    ++P    L LVLKQFL  R L++ ++GG+ SY L L+ 
Sbjct: 119  --------------DFIK----KYPVLPFLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMA 160

Query: 1462 TRFLQ 1466
              FLQ
Sbjct: 161  VSFLQ 165


>gi|145509541|ref|XP_001440709.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124407937|emb|CAK73312.1| unnamed protein product [Paramecium tetraurelia]
          Length = 962

 Score = 64.7 bits (156), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 57/226 (25%), Positives = 97/226 (42%), Gaps = 55/226 (24%)

Query: 1257 IFGSNATGLSLPSSDVDLVVC---LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLAN 1313
            +FGS ATGL+LP SD+D+ +    L P   L            NG     +Q    +L N
Sbjct: 657  LFGSCATGLALPESDIDIGITGFELLPSNQL------------NG----PIQKIIEFLQN 700

Query: 1314 QEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDM 1373
             +WV   +++ +  +++P+I L V+ P      ++  +  P+                D+
Sbjct: 701  MKWVT--NIRAITTSSMPLIKLQVD-PSISFVESSHKILLPQ---------------IDL 742

Query: 1374 VALDDSASPKCSHTSSDNIKAATSVRLDISF------KSPSHTGLQTTDLVKELTEQFPA 1427
            +   D  SP            +    +DISF      K   H G  +T+   +    +  
Sbjct: 743  ITNYDKESP------------SRIFSVDISFFQYQGSKQNWHLGQISTEQTLQWLSFYSE 790

Query: 1428 STPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGR 1473
              P+ L+ K  L  R L+  Y GG+SS+C++ +I  FL+  +H G+
Sbjct: 791  LRPIVLLFKSLLKKRGLNDQYKGGISSFCIIQMILAFLESCYHQGQ 836


>gi|195398455|ref|XP_002057837.1| GJ17879 [Drosophila virilis]
 gi|194141491|gb|EDW57910.1| GJ17879 [Drosophila virilis]
          Length = 416

 Score = 64.7 bits (156), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 34/94 (36%), Positives = 52/94 (55%), Gaps = 7/94 (7%)

Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
            M  +D +  P    T S      T ++LD+SF      G+ + +L+K     +P    L 
Sbjct: 115  MHVVDTALVPIIKFTDSK-----TQIKLDVSFNISD--GIHSAELIKTYLRDYPGLGKLV 167

Query: 1433 LVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LVLKQF+  R L++++SGG+SSY L+L+   FLQ
Sbjct: 168  LVLKQFMLQRKLNEAFSGGISSYSLLLMCISFLQ 201



 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/80 (40%), Positives = 46/80 (57%), Gaps = 3/80 (3%)

Query: 1199 CPTRKASLSLMHSL--LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTN 1256
            CP ++A+ +    +  LHEEI+ F + +    T     I   VKR+   +  LWP++   
Sbjct: 10   CPWQQANYNYGDGIIGLHEEIEHFYRYILPTPTEHAVRIE-LVKRIESIIHELWPQALVE 68

Query: 1257 IFGSNATGLSLPSSDVDLVV 1276
            IFGS  TGL LP+SD+DLVV
Sbjct: 69   IFGSFRTGLFLPNSDIDLVV 88


>gi|449472874|ref|XP_004176276.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
            [Taeniopygia guttata]
          Length = 490

 Score = 64.7 bits (156), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 64/245 (26%), Positives = 93/245 (37%), Gaps = 91/245 (37%)

Query: 1227 ENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVV-----CLPPV 1281
            E T R   +N    R+   ++ LWP +   IFGS  TGL LP+SD+DLVV      LP  
Sbjct: 7    EETMRMEVVN----RIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLVVFGKWETLP-- 60

Query: 1282 RNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPH 1341
              L  ++EA                    L        +S+K ++   +PII L      
Sbjct: 61   --LWTLEEA--------------------LRKHNVADENSVKVLDKATVPIIKLT----- 93

Query: 1342 DLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLD 1401
                             + T +K D                  S    + +KAA  ++  
Sbjct: 94   ----------------DSFTEVKVD-----------------ISFNVQNGVKAAQLIK-- 118

Query: 1402 ISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLI 1461
                          D +K    ++P    L LVLKQFL  R L++ ++GG+ SY L L+ 
Sbjct: 119  --------------DFIK----KYPVLPFLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMA 160

Query: 1462 TRFLQ 1466
              FLQ
Sbjct: 161  VSFLQ 165


>gi|345325980|ref|XP_001507597.2| PREDICTED: PAP-associated domain-containing protein 5-like
            [Ornithorhynchus anatinus]
          Length = 578

 Score = 64.3 bits (155), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 35/92 (38%), Positives = 51/92 (55%), Gaps = 7/92 (7%)

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
             LD +  P    T S      T V++DISF      G++   L+K+ T+++P    L LV
Sbjct: 170  VLDKATVPIIKLTDS-----FTEVKVDISFNV--QNGVKAAQLIKDFTKKYPVLPYLVLV 222

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LKQFL  R L++ ++GG+ SY L L+   FLQ
Sbjct: 223  LKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 254



 Score = 52.8 bits (125), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 38/64 (59%), Gaps = 1/64 (1%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI+ F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 79   LHEEINDFYEYMSPRPEEEKMRME-VVNRIENVIKELWPTADVQIFGSFKTGLYLPTSDI 137

Query: 1273 DLVV 1276
            DLVV
Sbjct: 138  DLVV 141


>gi|310799736|gb|EFQ34629.1| hypothetical protein GLRG_09773 [Glomerella graminicola M1.001]
          Length = 756

 Score = 64.3 bits (155), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 67/287 (23%), Positives = 109/287 (37%), Gaps = 85/287 (29%)

Query: 1201 TRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQV--LWPRSRTNIF 1258
            T  +  S M   LH+EI  F + V   +   +      V+R+ RSL+    +       F
Sbjct: 416  TDHSKTSDMSVWLHKEIIDFYEVVRPRDFEHE-MRTQLVERLRRSLKTSHFYKDCDVRPF 474

Query: 1259 GSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG--RNGIKETCLQHAARYLANQEW 1316
            GS  +GL LP++D+DLVVC               L+G   N      L++  ++LA  + 
Sbjct: 475  GSYMSGLYLPTADMDLVVCA-----------RSWLDGAHSNFFGMKALRNFGKFLAQNKV 523

Query: 1317 VKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVAL 1376
               ++++ + +  +P++  +                                        
Sbjct: 524  THYNTMEFIASAKVPLVKYI---------------------------------------- 543

Query: 1377 DDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLK 1436
                         DNI   T +R+DISF      G Q      E  EQ+PA   L  ++K
Sbjct: 544  -------------DNI---TGLRVDISFDR--LDGPQAVKTFAEWKEQYPAMPILVTMIK 585

Query: 1437 QFLADRSLDQSYSGGLSSYCLMLLITRFLQ-----------HEHHLG 1472
             FLA R L++  +GG+ S+ +  ++   LQ            EHHLG
Sbjct: 586  HFLAMRGLNEPVNGGIGSFTVTCMVVSMLQLMPQVQSRNLIPEHHLG 632


>gi|307168873|gb|EFN61797.1| PAP-associated domain-containing protein 5 [Camponotus floridanus]
          Length = 643

 Score = 64.3 bits (155), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 56/96 (58%), Gaps = 7/96 (7%)

Query: 1371 SDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTP 1430
            S +  LD ++ P    T  +     + +++DISF    + G+++ DL+    +++P    
Sbjct: 274  SSIKVLDKASVPIVKLTDKE-----SEIKVDISFNM--NNGVKSADLINSFKKRYPVLEK 326

Query: 1431 LALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            L +VLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 327  LVMVLKQFLLQRDLNEVFTGGISSYSLILMTISFLQ 362



 Score = 58.5 bits (140), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 65/147 (44%), Gaps = 30/147 (20%)

Query: 1195 EVQNCPTRKASLSLMHSL--LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPR 1252
            E   CP R  +      +  LHEEI+ F   +   +      I   VKR+ + +  LWP 
Sbjct: 167  EYGGCPWRVPNKHYSRGVIGLHEEIEDFFAYMCPSHEEHVLRIR-VVKRIEQVIYDLWPN 225

Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLP----PVRNLEPIKEAGILEGRNGIKETCLQHAA 1308
            S+  +FGS  TGL LP+SD+DLVV       P+R LE                       
Sbjct: 226  SKVEVFGSFRTGLYLPTSDIDLVVIGMWTNLPLRTLE----------------------- 262

Query: 1309 RYLANQEWVKSDSLKTVENTAIPIIML 1335
            R L +Q   +  S+K ++  ++PI+ L
Sbjct: 263  RALLDQNIAEPSSIKVLDKASVPIVKL 289


>gi|219130012|ref|XP_002185169.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403348|gb|EEC43301.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 425

 Score = 64.3 bits (155), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 66/260 (25%), Positives = 105/260 (40%), Gaps = 70/260 (26%)

Query: 1213 LHEEIDSFCKQV--AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
            LH EI  F + +    E   ++  +   VKRV   +   +  ++ ++FGS ATGL LPSS
Sbjct: 74   LHNEIVQFVRLMEPMPEEIRQREDL---VKRVEELVHRTFDNAQVHVFGSQATGLFLPSS 130

Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARY--LANQEWVKSDS-LKTVEN 1327
            DVDL+V    + N +   E    + +   ++       R+  +  ++W+   S L+ + N
Sbjct: 131  DVDLLV----ITNEKANDETSQPDRQEDWQKPSGSPLDRFESVLREDWLMELSYLEVIGN 186

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
            T +P++                     K   A T +  D       V  D  + P  +  
Sbjct: 187  TKVPLV---------------------KFTHAPTNISVD-------VCFDQESGPGAAQL 218

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
                ++A   +R                              PL  VLK FL+ R L++ 
Sbjct: 219  MKTYLEALPPLR------------------------------PLTFVLKYFLSARGLNEP 248

Query: 1448 YSGGLSSYCLMLLITRFLQH 1467
            YSGG+ SY L L+I  FLQH
Sbjct: 249  YSGGVGSYLLQLMIVSFLQH 268


>gi|224483450|gb|ACN51269.1| nuclear non-canonical poly(A) polymerase 1 [Trypanosoma brucei]
          Length = 746

 Score = 64.3 bits (155), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 74/265 (27%), Positives = 105/265 (39%), Gaps = 81/265 (30%)

Query: 1213 LHEEIDSFCK--QVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
            LH+EI  F K  Q      A +  +   V  + + L   WP+    +FGS ATGL LP S
Sbjct: 324  LHQEIWDFVKFHQPTEAEIAIRCLVEAEVVTIAKQL---WPKCEPVVFGSMATGLLLPLS 380

Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAI 1330
            DVD+ +    V    P +EA             L+  AR +         SL  + NTA 
Sbjct: 381  DVDITILNVDV----PTEEA-------------LRLLAREI---------SLSGICNTAY 414

Query: 1331 PIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSD 1390
            P ++L  +VP                      L    H  S    LD             
Sbjct: 415  PQLVLKAKVP----------------------LLKFQHCGS---LLD------------- 436

Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
                     +DIS    +  G + T +V +L + +  + PL +V+K FL  R +D+ Y G
Sbjct: 437  ---------VDISIN--ARDGPRNTLIVIDLLKHYREAEPLIVVVKYFLHQRGMDEPYHG 485

Query: 1451 GLSSYCLMLLITRFL-QHEHHLGRP 1474
            GL S+ L L+   FL QH  + G P
Sbjct: 486  GLGSFALTLMAISFLQQHPIYTGSP 510


>gi|50557292|ref|XP_506054.1| YALI0F30525p [Yarrowia lipolytica]
 gi|49651924|emb|CAG78867.1| YALI0F30525p [Yarrowia lipolytica CLIB122]
          Length = 716

 Score = 63.9 bits (154), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 58/256 (22%), Positives = 102/256 (39%), Gaps = 77/256 (30%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            L EE+  F K ++   +  K   +  V+RV  ++  LW  ++ ++ GS  T + LP SD+
Sbjct: 249  LQEEVMDFVKYISPTQSEIKARQDL-VERVRGAVNGLWGDAKVHVLGSFTTNMHLPQSDI 307

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            DLVVC P               G  G +  C+   +  + ++  +K   L+T+    +PI
Sbjct: 308  DLVVCSP--------------HGHYG-ERACIYQLSSVIRSR--MKVAELQTITKARVPI 350

Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
            I  +                                VH D+           S      I
Sbjct: 351  IKFI---------------------------DSRTGVHVDI-----------SFEKDGGI 372

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
            K A++                    + + ++  PA  PL L++KQFL+ R L++ ++GGL
Sbjct: 373  KTAST--------------------ITKWSKSMPALRPLVLIVKQFLSSRRLNEVFTGGL 412

Query: 1453 SSYCLMLLITRFL-QH 1467
              + ++ ++  FL QH
Sbjct: 413  GGFSVICMVYSFLIQH 428


>gi|391346299|ref|XP_003747415.1| PREDICTED: PAP-associated domain-containing protein 5-like
            [Metaseiulus occidentalis]
          Length = 491

 Score = 63.9 bits (154), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 65/272 (23%), Positives = 104/272 (38%), Gaps = 89/272 (32%)

Query: 1200 PTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFG 1259
            P R  SL L    L EEI  F    A  N A +      +++V  +++  WP     +FG
Sbjct: 75   PNRSYSLGL--EGLTEEIHDFF-MYAQPNAADQSRREQVIEKVRAAIREKWPDCVVEVFG 131

Query: 1260 SNATGLSLPSSDVDLVV-----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQ 1314
            S  TGL LP+ D+D+V+      +PP+ +LE                       R L  +
Sbjct: 132  SYKTGLYLPTGDIDMVIQGNWEIIPPLFDLE-----------------------RQLIEK 168

Query: 1315 EWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMV 1374
            +  + ++ K ++  ++P+I                      +DA       D  +  D+ 
Sbjct: 169  KVGEKNTFKVLDKASVPLIKF--------------------KDA-------DTEIRVDL- 200

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
                      S   ++  +AA  V                    K+    FP    L  V
Sbjct: 201  ----------SFNQANCTEAAAFV--------------------KQCCRTFPPLAKLIFV 230

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LKQ+L+   L++ + GG+SSY L L+I  FLQ
Sbjct: 231  LKQYLSLHGLNEVFHGGISSYSLTLMILSFLQ 262


>gi|159112073|ref|XP_001706266.1| Topoisomerase I-related protein [Giardia lamblia ATCC 50803]
 gi|157434361|gb|EDO78592.1| Topoisomerase I-related protein [Giardia lamblia ATCC 50803]
          Length = 520

 Score = 63.9 bits (154), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 57/253 (22%), Positives = 103/253 (40%), Gaps = 78/253 (30%)

Query: 1219 SFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCL 1278
            ++C    AE  AR+    + + ++   +Q++ P +  ++FGS +TG+S  SSD+D+ V +
Sbjct: 106  AYCTLTEAEKAARE----YVLGQLRDIIQLVLPDATVDVFGSYSTGMSSYSSDLDICVNV 161

Query: 1279 PPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVE 1338
                   P+  A +++        C  H    L  +  + ++ +    +  +PII     
Sbjct: 162  -------PVNSAAMMQ--------CHMHDIATLLRRS-ISTNYVDVRSHARVPII----- 200

Query: 1339 VPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSV 1398
                                          VHS++                       S+
Sbjct: 201  ----------------------------KGVHSEL-----------------------SL 209

Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
              DISF SP   G    + +    E+ P +  L +V+K FL  R L+Q Y+GG+SSY L+
Sbjct: 210  EYDISFNSPH--GAAHRETILGYIEKHPLARILIMVVKSFLKKRGLNQPYTGGMSSYILL 267

Query: 1459 LLITRFLQHEHHL 1471
             LI  ++   H++
Sbjct: 268  QLIVVYISERHNV 280


>gi|195354696|ref|XP_002043832.1| GM17777 [Drosophila sechellia]
 gi|194129070|gb|EDW51113.1| GM17777 [Drosophila sechellia]
          Length = 236

 Score = 63.9 bits (154), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 36/96 (37%), Positives = 56/96 (58%), Gaps = 7/96 (7%)

Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
            +  LD ++ P    T  D I   + +R D++F S + +G+Q  +L+K+   QFP    L 
Sbjct: 118  VTVLDKASVPVVKFT--DRI---SRIRFDVTFNSVA-SGVQAAELIKDFIRQFPELPKLV 171

Query: 1433 LVLKQFLADRSLDQSY-SGGLSSYCLMLLITRFLQH 1467
            +VLKQFL+    ++ Y SGG+SSY L L++  FLQ 
Sbjct: 172  MVLKQFLSLHGFNEVYNSGGVSSYALTLMVISFLQQ 207



 Score = 51.6 bits (122), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/65 (41%), Positives = 40/65 (61%), Gaps = 1/65 (1%)

Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
            LLH+EI+ F   + +  T        AV+R+   +  +WPR+  ++FGS  TGL+LP SD
Sbjct: 27   LLHQEIEQFYNYIRSTPTEFCLRAG-AVRRIEDVVLSIWPRAFVDLFGSFRTGLNLPDSD 85

Query: 1272 VDLVV 1276
            +DLVV
Sbjct: 86   IDLVV 90


>gi|195331816|ref|XP_002032595.1| GM26647 [Drosophila sechellia]
 gi|195354690|ref|XP_002043829.1| GM17772 [Drosophila sechellia]
 gi|194121538|gb|EDW43581.1| GM26647 [Drosophila sechellia]
 gi|194129067|gb|EDW51110.1| GM17772 [Drosophila sechellia]
          Length = 403

 Score = 63.5 bits (153), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 36/96 (37%), Positives = 56/96 (58%), Gaps = 7/96 (7%)

Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
            +  LD ++ P    T  D I   + +R D++F S + +G+Q  +L+K+   QFP    L 
Sbjct: 118  VTVLDKASVPVVKFT--DRI---SRIRFDVTFNSVA-SGVQAAELIKDFIRQFPELPKLV 171

Query: 1433 LVLKQFLADRSLDQSY-SGGLSSYCLMLLITRFLQH 1467
            +VLKQFL+    ++ Y SGG+SSY L L++  FLQ 
Sbjct: 172  MVLKQFLSLHGFNEVYNSGGVSSYALTLMVISFLQQ 207



 Score = 52.0 bits (123), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/65 (41%), Positives = 40/65 (61%), Gaps = 1/65 (1%)

Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
            LLH+EI+ F   + +  T        AV+R+   +  +WPR+  ++FGS  TGL+LP SD
Sbjct: 27   LLHQEIEQFYNYIRSTPTEFCLRAG-AVRRIEDVVLSIWPRAFVDLFGSFRTGLNLPDSD 85

Query: 1272 VDLVV 1276
            +DLVV
Sbjct: 86   IDLVV 90


>gi|195391176|ref|XP_002054239.1| GJ24338 [Drosophila virilis]
 gi|194152325|gb|EDW67759.1| GJ24338 [Drosophila virilis]
          Length = 397

 Score = 63.5 bits (153), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 62/257 (24%), Positives = 104/257 (40%), Gaps = 80/257 (31%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH+EI+ F   + +  T     +  AV R+ R +  +WP +   +FGS  TGL+LP SD+
Sbjct: 25   LHQEIEHFYNYIVSTPTEYMMRME-AVHRIERVVLSIWPDACIEVFGSFRTGLNLPISDI 83

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            D+ V      N      A +LE +N +        AR +A       D++  ++  ++P+
Sbjct: 84   DIAV------NNFYWHGAPLLELKNAL-------MARGVA-------DNVNVLDKASVPV 123

Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
            +     +                     + +K D                  S  ++  +
Sbjct: 124  VKFTERI---------------------SEIKFD-----------------ISFNTTTGV 145

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSY-SGG 1451
            KAA  V                    +   EQFP    L ++LKQ+L  + L++ Y SGG
Sbjct: 146  KAAELV--------------------QRFIEQFPELPKLVIILKQYLMLQGLNEVYSSGG 185

Query: 1452 LSSYCLMLLITRFLQHE 1468
            +SSY + L+   FLQ +
Sbjct: 186  ISSYAVTLMCISFLQQQ 202


>gi|195573661|ref|XP_002104810.1| GD21148 [Drosophila simulans]
 gi|194200737|gb|EDX14313.1| GD21148 [Drosophila simulans]
          Length = 403

 Score = 63.5 bits (153), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 36/96 (37%), Positives = 56/96 (58%), Gaps = 7/96 (7%)

Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
            +  LD ++ P    T  D I   + +R D++F S + +G+Q  +L+K+   QFP    L 
Sbjct: 118  VTVLDKASVPVVKFT--DRI---SRIRFDVTFNSVA-SGVQAAELIKDFIRQFPELPKLV 171

Query: 1433 LVLKQFLADRSLDQSY-SGGLSSYCLMLLITRFLQH 1467
            +VLKQFL+    ++ Y SGG+SSY L L++  FLQ 
Sbjct: 172  MVLKQFLSLHGFNEVYNSGGVSSYALTLMVISFLQQ 207



 Score = 48.9 bits (115), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 26/65 (40%), Positives = 39/65 (60%), Gaps = 1/65 (1%)

Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
            LLH+EI+ F   + +  T        AV+R+   +  +WP +  ++FGS  TGL+LP SD
Sbjct: 27   LLHQEIEQFYNYIRSTPTEFCLRAG-AVRRIEDVVLSIWPSAFVDLFGSFRTGLNLPDSD 85

Query: 1272 VDLVV 1276
            +DLVV
Sbjct: 86   IDLVV 90


>gi|195053247|ref|XP_001993538.1| GH13010 [Drosophila grimshawi]
 gi|193900597|gb|EDV99463.1| GH13010 [Drosophila grimshawi]
          Length = 359

 Score = 63.5 bits (153), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 35/94 (37%), Positives = 53/94 (56%), Gaps = 7/94 (7%)

Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
            M  L  +A P    T  +     T +++D+SF   +  G+Q+ +L+K     +PA   L 
Sbjct: 121  MRVLATAAVPIIKFTDRE-----TKIKVDLSFNMAN--GVQSAELIKIFKRDYPALGYLV 173

Query: 1433 LVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            +VLKQFL  R L++ ++ GLSSY L+LL   FLQ
Sbjct: 174  MVLKQFLLQRDLNEVFTAGLSSYSLILLCISFLQ 207



 Score = 57.4 bits (137), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 29/67 (43%), Positives = 43/67 (64%), Gaps = 7/67 (10%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI+ F + +    +E+ AR    N  ++R+   ++ LWP++   IFGS  TGL LP+
Sbjct: 32   LHEEIEHFYRYILPTPSEHAAR----NKVIQRIENIIRALWPQALVEIFGSFRTGLFLPN 87

Query: 1270 SDVDLVV 1276
            SD+DLVV
Sbjct: 88   SDIDLVV 94


>gi|59807821|gb|AAH89405.1| PAPD5 protein, partial [Homo sapiens]
          Length = 390

 Score = 63.5 bits (153), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 29/68 (42%), Positives = 44/68 (64%), Gaps = 2/68 (2%)

Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
            ++DISF      G++  DL+K+ T+++P    L LVLKQFL  R L++ ++GG+ SY L 
Sbjct: 1    KVDISFNV--QNGVRAADLIKDFTKKYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSLF 58

Query: 1459 LLITRFLQ 1466
            L+   FLQ
Sbjct: 59   LMAVSFLQ 66


>gi|67989518|ref|NP_001018181.1| poly(A) polymerase Cid14 [Schizosaccharomyces pombe 972h-]
 gi|81175166|sp|Q9UTN3.2|CID14_SCHPO RecName: Full=Poly(A) RNA polymerase cid14; Short=PAP; AltName:
            Full=Caffeine-induced death protein 14; AltName:
            Full=Polynucleotide adenylyltransferase cid14
 gi|62554069|emb|CAI79317.1| poly(A) polymerase Cid14 [Schizosaccharomyces pombe]
          Length = 684

 Score = 63.5 bits (153), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/71 (42%), Positives = 46/71 (64%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T V +DISF  P   GL+T  +V    +++PA  PL +++K FL  R+L++ + GGLSSY
Sbjct: 351  TKVHVDISFNQPG--GLKTCLVVNGFMKKYPALRPLVIIIKHFLNMRALNEVFLGGLSSY 408

Query: 1456 CLMLLITRFLQ 1466
             ++ L+  FLQ
Sbjct: 409  AIVCLVVSFLQ 419


>gi|24649854|ref|NP_651313.1| Trf4-2, isoform A [Drosophila melanogaster]
 gi|442621019|ref|NP_001262941.1| Trf4-2, isoform B [Drosophila melanogaster]
 gi|7301238|gb|AAF56369.1| Trf4-2, isoform A [Drosophila melanogaster]
 gi|21430468|gb|AAM50912.1| LP06848p [Drosophila melanogaster]
 gi|220950110|gb|ACL87598.1| CG17462-PA [synthetic construct]
 gi|440217872|gb|AGB96321.1| Trf4-2, isoform B [Drosophila melanogaster]
          Length = 407

 Score = 63.5 bits (153), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 57/106 (53%), Gaps = 7/106 (6%)

Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
            +  LD ++ P    T        + +R D++F S + +G+Q  DL+K+    FP    L 
Sbjct: 118  VTVLDKASVPVVKFTD-----LISRIRFDVTFNSVA-SGVQAADLIKDFIRHFPELPKLV 171

Query: 1433 LVLKQFLADRSLDQSY-SGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
            +VLKQFL+    ++ Y SGG+SSY L L++  FLQ      R +++
Sbjct: 172  MVLKQFLSLHGFNEVYNSGGVSSYALTLMVISFLQQHARSNRRLSE 217



 Score = 49.3 bits (116), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 26/65 (40%), Positives = 39/65 (60%), Gaps = 1/65 (1%)

Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
            LLH+EI+ F   + +  T        AV+R+   +  +WP +  ++FGS  TGL+LP SD
Sbjct: 27   LLHQEIEQFYNYIRSTPTEFCLRAG-AVRRIEDVVLSIWPSASVDLFGSFRTGLNLPDSD 85

Query: 1272 VDLVV 1276
            +DLVV
Sbjct: 86   IDLVV 90


>gi|332030078|gb|EGI69903.1| PAP-associated domain-containing protein 5 [Acromyrmex echinatior]
          Length = 662

 Score = 63.2 bits (152), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 55/96 (57%), Gaps = 7/96 (7%)

Query: 1371 SDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTP 1430
            S +  LD ++ P    T  +     + +++DISF    + G+++ +L+     Q+P    
Sbjct: 294  SSIKVLDKASVPIVKLTDKE-----SEIKVDISFNM--NNGVKSAELINSYKRQYPVLEK 346

Query: 1431 LALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            L +VLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 347  LVMVLKQFLLQRDLNEVFTGGISSYSLILMTISFLQ 382



 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/147 (29%), Positives = 65/147 (44%), Gaps = 30/147 (20%)

Query: 1195 EVQNCPTRKAS--LSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPR 1252
            E   CP R  +   S+    LHEEI+ F   +   +      +   VKR+   +  LWP 
Sbjct: 187  EYGGCPWRVTNKHYSMGTIGLHEEIEDFFSYMCPSHEEHVLRLR-VVKRIENVIYDLWPD 245

Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLP----PVRNLEPIKEAGILEGRNGIKETCLQHAA 1308
            S+  +FGS  TGL LP+SD+DLVV       P+R LE                       
Sbjct: 246  SKVEVFGSFRTGLYLPTSDIDLVVIGMWTNLPLRTLE----------------------- 282

Query: 1309 RYLANQEWVKSDSLKTVENTAIPIIML 1335
            R L ++   +  S+K ++  ++PI+ L
Sbjct: 283  RALLDRNIAEPSSIKVLDKASVPIVKL 309


>gi|261330131|emb|CBH13115.1| DNA polymerase sigma, putative [Trypanosoma brucei gambiense DAL972]
          Length = 746

 Score = 63.2 bits (152), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 73/265 (27%), Positives = 105/265 (39%), Gaps = 81/265 (30%)

Query: 1213 LHEEIDSFCK--QVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
            LH+EI  F +  Q      A +  +   V  + + L   WP+    +FGS ATGL LP S
Sbjct: 324  LHQEICDFVEFHQPTEAEIAIRCLVEAEVVTIAKQL---WPKCEPVVFGSMATGLLLPLS 380

Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAI 1330
            DVD+ +    V    P +EA             L+  AR +         SL  + NTA 
Sbjct: 381  DVDITILNVDV----PTEEA-------------LRLLAREI---------SLSGICNTAY 414

Query: 1331 PIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSD 1390
            P ++L  +VP                      L    H  S    LD             
Sbjct: 415  PQLVLKAKVP----------------------LLKFQHCGS---LLD------------- 436

Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
                     +DIS    +  G + T +V +L + +  + PL +V+K FL  R +D+ Y G
Sbjct: 437  ---------VDISIN--ARDGPRNTLIVIDLLKHYREAEPLIVVVKYFLHQRGMDEPYHG 485

Query: 1451 GLSSYCLMLLITRFL-QHEHHLGRP 1474
            GL S+ L L+   FL QH  + G P
Sbjct: 486  GLGSFALTLMAISFLQQHPIYTGSP 510


>gi|403373923|gb|EJY86891.1| Poly(A) RNA polymerase putative [Oxytricha trifallax]
          Length = 403

 Score = 63.2 bits (152), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 2/78 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            TS  +DISF   +  G+    LVK L  ++P   PL +VLK FL  R L+++YSGG+SS+
Sbjct: 214  TSCNIDISFNRTN--GIYCVKLVKTLMIKYPELRPLMIVLKAFLKCRGLNETYSGGISSF 271

Query: 1456 CLMLLITRFLQHEHHLGR 1473
             L +L T +LQ  +  G+
Sbjct: 272  LLTMLATSYLQMAYKSGK 289



 Score = 47.4 bits (111), Expect = 0.069,   Method: Composition-based stats.
 Identities = 28/74 (37%), Positives = 41/74 (55%), Gaps = 7/74 (9%)

Query: 1212 LLHEEIDSFCKQVAA---ENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
             LH EI  F K +     +  AR+      V R+ + ++  + +++  IFGS ATGL LP
Sbjct: 106  FLHNEILDFVKYIGTTTEDQQARRK----VVSRIHKIVKECFSQAKVMIFGSCATGLDLP 161

Query: 1269 SSDVDLVVCLPPVR 1282
            +SDVDL+V  P  R
Sbjct: 162  NSDVDLLVYYPDQR 175


>gi|387219307|gb|AFJ69362.1| poly rna polymerase, partial [Nannochloropsis gaditana CCMP526]
 gi|422292753|gb|EKU20055.1| poly rna polymerase, partial [Nannochloropsis gaditana CCMP526]
          Length = 1647

 Score = 63.2 bits (152), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/77 (40%), Positives = 47/77 (61%), Gaps = 3/77 (3%)

Query: 1394 AATSVRLDISFKSPSHTGLQTTDLVKELT---EQFPASTPLALVLKQFLADRSLDQSYSG 1450
            A T V +DIS + P H G+ ++ L+++L    + +  + PL LVLK  L  R L+Q + G
Sbjct: 1265 ATTLVPVDISLEGPQHGGIASSLLIRDLVGPGKPYAHAVPLTLVLKALLMQRGLNQPWCG 1324

Query: 1451 GLSSYCLMLLITRFLQH 1467
            G+SSY LML++   LQ 
Sbjct: 1325 GISSYALMLMVITVLQQ 1341



 Score = 48.1 bits (113), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 39/82 (47%), Gaps = 1/82 (1%)

Query: 1206 LSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGL 1265
            +  M S L E I  F K    +    +      ++ V   +Q LW  SR  ++GS  TGL
Sbjct: 986  IDAMRSQLSENIADFMKNRETKMMEHRAARERILEEVRSIIQSLWAGSRVELYGSCFTGL 1045

Query: 1266 SLPSSDVDLVVC-LPPVRNLEP 1286
             L SSD+DLVVC L P     P
Sbjct: 1046 DLISSDLDLVVCGLKPAGTASP 1067


>gi|195115910|ref|XP_002002499.1| GI12386 [Drosophila mojavensis]
 gi|193913074|gb|EDW11941.1| GI12386 [Drosophila mojavensis]
          Length = 348

 Score = 63.2 bits (152), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 55/92 (59%), Gaps = 7/92 (7%)

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
             +D +A P    T  +     T +++DISF  P+  G+ +++L+K+   + P    L LV
Sbjct: 117  VVDTAAVPIIRFTDCE-----THLKVDISFNMPN--GIDSSELIKKFLHEHPVLGKLVLV 169

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LKQFL  R+L+ + +GG+SSY L+++   FLQ
Sbjct: 170  LKQFLEQRNLNSTLNGGISSYNLIIMCINFLQ 201



 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 61/126 (48%), Gaps = 30/126 (23%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI+ F + +    T     I   + RV R +Q LWP +   IFGS   G++LP+SD+
Sbjct: 26   LHEEIEHFYRYMLPTPTEHAARIE-LLSRVERVIQGLWPEALVEIFGSFRLGINLPNSDI 84

Query: 1273 DLVV--CLP--PVRNLE-PIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV  C    P+R+LE  ++ +GI                        V   +L+ V+ 
Sbjct: 85   DLVVLGCWEHLPLRSLESELRSSGI------------------------VLPGTLQVVDT 120

Query: 1328 TAIPII 1333
             A+PII
Sbjct: 121  AAVPII 126


>gi|301100842|ref|XP_002899510.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262103818|gb|EEY61870.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 270

 Score = 63.2 bits (152), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/70 (44%), Positives = 47/70 (67%), Gaps = 1/70 (1%)

Query: 1398 VRLDISF-KSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
            V LD++   S  HTGL   DL+  L  + PA  PL L+LK  L +++L+ +++GG+SSY 
Sbjct: 78   VLLDLTCGHSVGHTGLGARDLIYSLQTEMPALRPLVLILKSHLVNKNLNCAFTGGISSYV 137

Query: 1457 LMLLITRFLQ 1466
            L++L+ RFLQ
Sbjct: 138  LVILVIRFLQ 147


>gi|119188673|ref|XP_001244943.1| hypothetical protein CIMG_04384 [Coccidioides immitis RS]
          Length = 785

 Score = 62.8 bits (151), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 74/318 (23%), Positives = 124/318 (38%), Gaps = 89/318 (27%)

Query: 1170 DHLIAISQLARDQEHPDVAFPLQPLEVQNCPT-------------RKASLSLMHSLLHEE 1216
            D ++  +  ++  E PD  F L    +Q   T             R+A+  L  +L +E 
Sbjct: 297  DEIVGPAPRSQRYEKPDTRFNLDGSIIQAYRTIAGQDSTPWFDRSRQATFHLGAALQNEI 356

Query: 1217 IDSF--CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDL 1274
            +D +   K    E+  R   I     R  R +Q  +P S+ + FGS A+GL LP +DVDL
Sbjct: 357  VDFYHWVKPRPFEDVIRTDLIT----RFERLMQNRFPGSQLHAFGSYASGLYLPVADVDL 412

Query: 1275 VVCLPPVRNLEPIKEAGILEGRNGI--KETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            V+          +  + I +GR  +  K   +     Y+ + E     S++T+ +  +PI
Sbjct: 413  VL----------LSRSFIRQGRKFLCQKIKDIYSLTAYIRDTEIAVPGSIETIAHARVPI 462

Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
            I  V  +                     T LK D       ++ D+++    + T     
Sbjct: 463  IKFVDRL---------------------TGLKVD-------LSFDNNSGLAANRT----- 489

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
                                      ++  E FPA   +  V+KQFL  R L++  +GGL
Sbjct: 490  -------------------------FQQWKEHFPAMPLIVSVVKQFLLLRGLNEVPTGGL 524

Query: 1453 SSYCLMLLITRFLQHEHH 1470
              + ++ L+T  LQH  H
Sbjct: 525  GGFSIICLVTSLLQHLPH 542


>gi|47229501|emb|CAF99489.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1518

 Score = 62.8 bits (151), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/71 (40%), Positives = 45/71 (63%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T V++DISF     TG++    +K+  + +P    L  VLKQFL  R L++ ++GG+SSY
Sbjct: 11   TEVKVDISFNV--ETGVKAASFIKDYVKMYPVLPYLIFVLKQFLLQRDLNEVFTGGISSY 68

Query: 1456 CLMLLITRFLQ 1466
             L+L++  FLQ
Sbjct: 69   SLILMVISFLQ 79


>gi|326430489|gb|EGD76059.1| hypothetical protein PTSG_00768 [Salpingoeca sp. ATCC 50818]
          Length = 425

 Score = 62.8 bits (151), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/72 (43%), Positives = 45/72 (62%), Gaps = 2/72 (2%)

Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
             T++ +DISF  PS  G Q +  VK+   Q+    PL L++KQFL  R L++ +SGG+ S
Sbjct: 201  VTNLWMDISFNQPS--GPQDSINVKKWKTQYRGLVPLVLIIKQFLLQRGLNEPFSGGIGS 258

Query: 1455 YCLMLLITRFLQ 1466
            Y + LL+  FLQ
Sbjct: 259  YAVFLLVMSFLQ 270



 Score = 53.5 bits (127), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 11/79 (13%)

Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH+EI  F K +     E T R+ +++    RV   +  LWP++   +FGS  TGL LP+
Sbjct: 97   LHQEIVDFHKYMEPMKEEVTLRRAFVD----RVKEVILGLWPKAEVTVFGSFNTGLYLPT 152

Query: 1270 SDVDLVV----CLPPVRNL 1284
            SD+D+VV     +PP++ L
Sbjct: 153  SDIDVVVFGDWAVPPLQTL 171


>gi|145497122|ref|XP_001434550.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124401677|emb|CAK67153.1| unnamed protein product [Paramecium tetraurelia]
          Length = 801

 Score = 62.8 bits (151), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 59/256 (23%), Positives = 101/256 (39%), Gaps = 53/256 (20%)

Query: 1216 EIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLV 1275
            +I  F  Q+  +N  + P       R+   +Q L+  +   +FGS AT L+LP SD+D+ 
Sbjct: 476  DIKEFIDQIRRDNDLQFPIRQLVFNRIQFIIQFLFKDAGVCLFGSCATRLALPDSDIDIG 535

Query: 1276 VCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
            +       L   K   I+E               +L    W+K         T +P+I L
Sbjct: 536  ITGLETHQLNQ-KMDVIIE---------------FLYKMNWIKRIKPIYPTQTTLPLIKL 579

Query: 1336 VVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAA 1395
             V+     I     ++  P               H D+V                    +
Sbjct: 580  WVDPS---IPFRNGNMNLP---------------HIDLVC------------------QS 603

Query: 1396 TSVRLDISF-KSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
              +++DISF     H GL +T+L     +++     + L+ K  L  R L+    GG+SS
Sbjct: 604  QLIQVDISFFGHIQHQGLTSTELTCFWLQEYQELKTITLLFKSLLKKRGLNDQSKGGISS 663

Query: 1455 YCLMLLITRFLQHEHH 1470
            +CL+L++  FL++ + 
Sbjct: 664  FCLVLIVVAFLEYYYQ 679


>gi|47208265|emb|CAF92498.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 297

 Score = 62.8 bits (151), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/73 (41%), Positives = 44/73 (60%), Gaps = 2/73 (2%)

Query: 1398 VRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCL 1457
            + +DISF   S  G++   L+KE  E++P    L LVLKQFL  R L++ ++GG+ SY L
Sbjct: 93   IFVDISFNMKS--GVKAAQLIKEFKEKYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSL 150

Query: 1458 MLLITRFLQHEHH 1470
             L+   FLQ  + 
Sbjct: 151  FLMAVSFLQLHYR 163



 Score = 45.4 bits (106), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 19/39 (48%), Positives = 26/39 (66%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVV 1276
             V R+   +  LWP +   +FGS +TGL LP+SD+DLVV
Sbjct: 14   VVDRIKGVIHDLWPSAEVQVFGSFSTGLYLPTSDIDLVV 52


>gi|213403316|ref|XP_002172430.1| Poly(A) RNA polymerase cid14 [Schizosaccharomyces japonicus yFS275]
 gi|212000477|gb|EEB06137.1| Poly(A) RNA polymerase cid14 [Schizosaccharomyces japonicus yFS275]
          Length = 667

 Score = 62.4 bits (150), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/71 (40%), Positives = 46/71 (64%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T + +D+SF  P   GL+T  +V    ++FPA  PL +++K FL  R+L++ + GGLSSY
Sbjct: 342  TRIHVDVSFNQPG--GLKTCLVVNGFLKKFPAVRPLTMLVKHFLNMRALNEVFLGGLSSY 399

Query: 1456 CLMLLITRFLQ 1466
             ++ L+  FLQ
Sbjct: 400  AIVCLVVSFLQ 410



 Score = 43.9 bits (102), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 7/68 (10%)

Query: 1212 LLHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
            LLH++I +F   +     E+  RK  I     ++ R+++  WP     +FGS  T L LP
Sbjct: 231  LLHQDILNFINYLEPTPQEHAVRKSLIT----KLDRAIRAKWPEVTVYVFGSFETRLYLP 286

Query: 1269 SSDVDLVV 1276
            +SD+D+VV
Sbjct: 287  TSDIDMVV 294


>gi|348512463|ref|XP_003443762.1| PREDICTED: DNA polymerase sigma-like [Oreochromis niloticus]
          Length = 789

 Score = 62.4 bits (150), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/71 (40%), Positives = 44/71 (61%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T V++DISF     T ++    +K   +++P   PL  VLKQFL  R L++ ++GG+SSY
Sbjct: 338  TRVKVDISFNI--ETAVKAAQFIKSYLKKYPVLPPLIFVLKQFLLQRELNEVFTGGISSY 395

Query: 1456 CLMLLITRFLQ 1466
             L+L+   FLQ
Sbjct: 396  SLILMAISFLQ 406



 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/95 (37%), Positives = 51/95 (53%), Gaps = 11/95 (11%)

Query: 1213 LHEEI---DSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI    SF      E + R+  +N    R+   ++ LWP  +  IFGS +TGL LP+
Sbjct: 231  LHEEILDFFSFMSPKPEEESMRRDVVN----RIEGIIKDLWPTVQVEIFGSFSTGLYLPT 286

Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIK 1300
            SD+DLVV      PP++ LE   +   + G + IK
Sbjct: 287  SDIDLVVFGNWDHPPLQELEQALKKHNVSGSHPIK 321


>gi|326917206|ref|XP_003204892.1| PREDICTED: DNA polymerase sigma-like [Meleagris gallopavo]
          Length = 429

 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++    +KE  +
Sbjct: 70   KHNVAEPYSIKVLDKATVPIIKLTDQE-----TEVKVDISFNV--ETGVKAARFIKEYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 49.3 bits (116), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 25/52 (48%), Positives = 34/52 (65%), Gaps = 4/52 (7%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
             VKR+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE
Sbjct: 14   VVKRIETVIKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLE 65


>gi|61098442|ref|NP_001012968.1| DNA polymerase sigma [Gallus gallus]
 gi|53130894|emb|CAG31774.1| hypothetical protein RCJMB04_10o8 [Gallus gallus]
          Length = 429

 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++    +KE  +
Sbjct: 70   KHNVAEPYSIKVLDKATVPIIKLTDQE-----TEVKVDISFNV--ETGVKAARFIKEYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYTLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 49.3 bits (116), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 25/52 (48%), Positives = 34/52 (65%), Gaps = 4/52 (7%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
             VKR+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE
Sbjct: 14   VVKRIETVIKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLE 65


>gi|327270606|ref|XP_003220080.1| PREDICTED: DNA polymerase sigma-like [Anolis carolinensis]
          Length = 686

 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 56/103 (54%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++   L+K+  +
Sbjct: 271  KHNVAEPYSIKVLDKATVPIIKLTDQE-----TEVKVDISFNV--ETGVKAARLIKDYMK 323

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 324  KYTLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 366



 Score = 52.0 bits (123), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 45/80 (56%), Gaps = 11/80 (13%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI  F   ++    E   RK      VKR+   ++ LWP +   IFGS +TGL LP+
Sbjct: 191  LHEEIMDFYDFMSPRPEEAVMRKE----VVKRIETVIKDLWPTADVQIFGSFSTGLYLPT 246

Query: 1270 SDVDLVVC----LPPVRNLE 1285
            SD+DLVV      PP++ LE
Sbjct: 247  SDIDLVVFGKWESPPLQLLE 266


>gi|405972624|gb|EKC37384.1| PAP-associated domain-containing protein 5 [Crassostrea gigas]
          Length = 672

 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 45/71 (63%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T V++DISF +     +++ +L+ +  E+FP    L LVLKQFL  R L++ ++GG+SSY
Sbjct: 224  TEVKVDISFNT--QNSVESANLITKYMEEFPNLKYLVLVLKQFLLQRDLNEVFTGGISSY 281

Query: 1456 CLMLLITRFLQ 1466
             L+ +   F Q
Sbjct: 282  SLIYMTVSFFQ 292



 Score = 58.5 bits (140), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 70/152 (46%), Gaps = 39/152 (25%)

Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI  F + +   A E   R    N  V+R+   ++ LWP ++  IFGS  TGL LP+
Sbjct: 117  LHEEIKDFYEYMSPKAEEANMR----NEVVRRIKDVVEDLWPEAKVEIFGSFKTGLYLPT 172

Query: 1270 SDVDLVV-----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
            SD+DLVV      LP     + ++E G                        +    S+K 
Sbjct: 173  SDIDLVVYGKWDSLPLFTLEKALRERG------------------------YADPASVKV 208

Query: 1325 VENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
            ++  ++PII +V    EV  D+  +  +SV+S
Sbjct: 209  LDKASVPIIKMVDLQTEVKVDISFNTQNSVES 240


>gi|449493801|ref|XP_002190556.2| PREDICTED: DNA polymerase sigma [Taeniopygia guttata]
          Length = 429

 Score = 62.0 bits (149), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++    +KE  +
Sbjct: 70   KHNVAEPYSIKVLDKATVPIIKLTDQE-----TEVKVDISFNV--ETGVKAARFIKEYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 49.3 bits (116), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 25/52 (48%), Positives = 34/52 (65%), Gaps = 4/52 (7%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
             VKR+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE
Sbjct: 14   VVKRIETVIKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLE 65


>gi|387196341|gb|AFJ68755.1| DNA polymerase sigma subunit, partial [Nannochloropsis gaditana
            CCMP526]
          Length = 419

 Score = 62.0 bits (149), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 32/71 (45%), Positives = 45/71 (63%), Gaps = 2/71 (2%)

Query: 1397 SVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
            ++++DISF S S  GL++   + E  E  P   PL LVLK FLA R L+Q+Y GG  S+ 
Sbjct: 177  NLQVDISFSSDS--GLKSARYMLEKMEAMPPLRPLILVLKYFLAQRELNQTYMGGCGSFL 234

Query: 1457 LMLLITRFLQH 1467
            L L++  +LQH
Sbjct: 235  LQLMVIAYLQH 245



 Score = 45.8 bits (107), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 30/74 (40%), Positives = 42/74 (56%), Gaps = 16/74 (21%)

Query: 1214 HEEIDSFCKQVA---AENTARKPYINWAVKRVTR----SLQVLWPRSRTNIFGSNATGLS 1266
            H E+  FC  +A   AE  AR+        +VTR    +++ LWP    ++FGS AT + 
Sbjct: 68   HNELLDFCDLLAPTRAELEARQ--------KVTRISADTVKKLWPSFDVHVFGSEATKVF 119

Query: 1267 LPSSDVDLVVCLPP 1280
            LP SD+D+VV LPP
Sbjct: 120  LPDSDIDMVV-LPP 132


>gi|261192276|ref|XP_002622545.1| topoisomerase TRF4 [Ajellomyces dermatitidis SLH14081]
 gi|239589420|gb|EEQ72063.1| topoisomerase TRF4 [Ajellomyces dermatitidis SLH14081]
          Length = 681

 Score = 61.6 bits (148), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 74/290 (25%), Positives = 116/290 (40%), Gaps = 81/290 (27%)

Query: 1192 QPLEVQNCPTRKASL--SLMH--SLLHEEIDSF---CKQVAAENTARKPYINWAVKRVTR 1244
            +P E QN      SL  S++H  S LH EI SF    K    EN+ R    N  + R+ R
Sbjct: 227  KPFEDQNTTPWIDSLGPSIVHLGSRLHNEILSFYHWVKPKPFENSIR----NDLIARLQR 282

Query: 1245 SLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGI--KET 1302
              +     S+   FGS A+GL LP++D+DLV+          +    I + R  +  +  
Sbjct: 283  HFERRHYSSQLRAFGSFASGLYLPTADMDLVL----------LSRQFIRQDRKVLCQRPR 332

Query: 1303 CLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTT 1362
             +   A YL + +     S++T+ +  +PII  V  +                     T 
Sbjct: 333  EIYSFAAYLKDLDIAVPGSVETIAHAKVPIIKFVDRL---------------------TG 371

Query: 1363 LKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELT 1422
            LK D       ++ D+S                              TG+   +  +   
Sbjct: 372  LKVD-------LSFDNS------------------------------TGIAANETFQVWK 394

Query: 1423 EQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLG 1472
             QFPA   +  V+KQFL  R L++  +GGL  + ++ L+T  LQH  + G
Sbjct: 395  SQFPAMPIIVSVIKQFLLLRGLNEVPTGGLGGFSIICLVTSLLQHLPYSG 444


>gi|440796505|gb|ELR17614.1| nucleotidyltransferase domain containing protein [Acanthamoeba
            castellanii str. Neff]
          Length = 911

 Score = 61.6 bits (148), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 47/75 (62%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + AAT   +DISF  P  TG    +L+K   ++ P+  PLAL++K +L    +++ Y+GG
Sbjct: 363  VDAATGCLVDISFDVP--TGPAHINLIKRYLDEEPSVKPLALLIKYYLKQFGMNEPYTGG 420

Query: 1452 LSSYCLMLLITRFLQ 1466
            L SY L+++I  +LQ
Sbjct: 421  LGSYALIIMIISYLQ 435



 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/80 (38%), Positives = 45/80 (56%), Gaps = 7/80 (8%)

Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH+EI SFCK V   A E   R+  I     R+++ ++ LWP  +  +FGS AT + LP+
Sbjct: 261  LHDEIISFCKWVSPSAEEKQMREDVI----ARISKVVETLWPSVQLRVFGSCATDIYLPT 316

Query: 1270 SDVDLVVCLPPVRNLEPIKE 1289
            SD+DL +      +  PI E
Sbjct: 317  SDIDLCIMGANACSPSPIDE 336


>gi|320164013|gb|EFW40912.1| PAP associated domain containing 5 [Capsaspora owczarzaki ATCC 30864]
          Length = 558

 Score = 61.6 bits (148), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 31/74 (41%), Positives = 46/74 (62%), Gaps = 2/74 (2%)

Query: 1394 AATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLS 1453
            A + V +DISF   S  GL+T + ++   +++     L  V+KQFLA R L++ YSGGL 
Sbjct: 315  ALSGVFVDISFNLES--GLRTVEFIRACVDEYRMLYHLTFVIKQFLAQRQLNEPYSGGLG 372

Query: 1454 SYCLMLLITRFLQH 1467
            SY ++LL+  FLQ 
Sbjct: 373  SYSVVLLVVSFLQR 386



 Score = 45.1 bits (105), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 37/70 (52%), Gaps = 6/70 (8%)

Query: 1209 MHSLLHEEID--SFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
            ++SL  E  D   F K    E+  R+      V+R+   +   W  +R  +FGS ATGL 
Sbjct: 209  LYSLEQEMYDFVEFIKPTPLEHQMREE----IVQRIREVITGAWKHARVEVFGSFATGLY 264

Query: 1267 LPSSDVDLVV 1276
            LP SD+D+VV
Sbjct: 265  LPMSDIDIVV 274


>gi|296423712|ref|XP_002841397.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295637635|emb|CAZ85588.1| unnamed protein product [Tuber melanosporum]
          Length = 802

 Score = 61.6 bits (148), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 68/259 (26%), Positives = 98/259 (37%), Gaps = 78/259 (30%)

Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH EI  F   +   A E+  R+  ++    RV   +  LWP +   +FGS A  L LP+
Sbjct: 391  LHNEILDFIAYIQPRAYEHAVRRELVH----RVRVVVTKLWPDTDLRVFGSFAAELYLPT 446

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGI--LEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            SD+DLVV            + GI     RN + +  L+ A   + +    K  SL  + N
Sbjct: 447  SDIDLVVI------STSFAKIGIPRYSERNALYK--LRQA---ILSSNTAKPGSLAVIAN 495

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII  V                             + ++H D               
Sbjct: 496  AKVPIIKFV---------------------------DRETNIHVD--------------- 513

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
                          ISF++ S  GL   +       Q+PA   L  V+KQFLA R L++ 
Sbjct: 514  --------------ISFENSS--GLVANETFIRWKAQYPAMPILLTVIKQFLAMRGLNEV 557

Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
            Y GGL S+ +  LI    Q
Sbjct: 558  YLGGLGSFSVTCLIVSMFQ 576


>gi|327349744|gb|EGE78601.1| topoisomerase TRF4 [Ajellomyces dermatitidis ATCC 18188]
          Length = 681

 Score = 61.6 bits (148), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 74/290 (25%), Positives = 116/290 (40%), Gaps = 81/290 (27%)

Query: 1192 QPLEVQNCPTRKASL--SLMH--SLLHEEIDSF---CKQVAAENTARKPYINWAVKRVTR 1244
            +P E QN      SL  S++H  S LH EI SF    K    EN+ R    N  + R+ R
Sbjct: 227  KPFEDQNTTPWIDSLGPSIVHLGSRLHNEILSFYHWVKPKPFENSIR----NDLIARLQR 282

Query: 1245 SLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGI--KET 1302
              +     S+   FGS A+GL LP++D+DLV+          +    I + R  +  +  
Sbjct: 283  HFERRHYGSQLRAFGSFASGLYLPTADMDLVL----------LSRQFIRQDRKVLCQRPR 332

Query: 1303 CLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTT 1362
             +   A YL + +     S++T+ +  +PII  V  +                     T 
Sbjct: 333  EIYSFAAYLKDLDIAVPGSVETIAHAKVPIIKFVDRL---------------------TG 371

Query: 1363 LKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELT 1422
            LK D       ++ D+S                              TG+   +  +   
Sbjct: 372  LKVD-------LSFDNS------------------------------TGIAANETFQVWK 394

Query: 1423 EQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLG 1472
             QFPA   +  V+KQFL  R L++  +GGL  + ++ L+T  LQH  + G
Sbjct: 395  SQFPAMPIIVSVIKQFLLLRGLNEVPTGGLGGFSIICLVTSLLQHLPYSG 444


>gi|74003095|ref|XP_545180.2| PREDICTED: DNA polymerase sigma [Canis lupus familiaris]
          Length = 672

 Score = 61.6 bits (148), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 56/103 (54%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  +L+K   +
Sbjct: 200  KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAELIKNYMK 252

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 253  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 295



 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 42/133 (31%), Positives = 59/133 (44%), Gaps = 40/133 (30%)

Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
            LHEEI  F      C + AA            VKR+   ++ LWP +   IFGS +TGL 
Sbjct: 120  LHEEIIDFYNFMSPCPEEAAMRRE-------VVKRIETVVKDLWPAADVQIFGSFSTGLY 172

Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
            LP+SD+DLVV      PP++ LE          ++ + E C                 S+
Sbjct: 173  LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 209

Query: 1323 KTVENTAIPIIML 1335
            K ++   +PII L
Sbjct: 210  KVLDKATVPIIKL 222


>gi|322697398|gb|EFY89178.1| topoisomerase family protein TRF4 [Metarhizium acridum CQMa 102]
          Length = 702

 Score = 61.6 bits (148), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 69/291 (23%), Positives = 113/291 (38%), Gaps = 84/291 (28%)

Query: 1199 CPTRKASLSLMHSL---LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPR 1252
            CP +    S  H++   LH+EI  F   V   A E   R   +    K + R  +  +  
Sbjct: 352  CPWQTKDHSATHNMAFRLHKEILDFYDYVRPRAFEQRMRDNLVENLCKAMRRDRRN-FAS 410

Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLA 1312
            ++ + FGS  +GL LP++D+DLVVC        P    G         ++ L    ++L 
Sbjct: 411  AQVHPFGSFMSGLYLPTADMDLVVCSASYMRGGPPTYLGA--------KSWLYKFQKFLV 462

Query: 1313 NQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSD 1372
             Q+   SDS++ + +  IP++  V ++                     T LK D      
Sbjct: 463  MQKVADSDSIEVIAHARIPLVKFVDKL---------------------TGLKVD------ 495

Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
                           S +N+   T+V   +S+K                 +Q+PA   L 
Sbjct: 496  --------------VSFENLGGVTAVDTFLSWK-----------------KQYPAMPILV 524

Query: 1433 LVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH-----------EHHLG 1472
             V+K FL  R L++  +GG+  + ++ L+   LQ            EHHLG
Sbjct: 525  TVIKHFLLMRGLNEPVNGGIGGFSVICLVVSMLQMMPHVQSRSLIPEHHLG 575


>gi|410949807|ref|XP_003981609.1| PREDICTED: DNA polymerase sigma [Felis catus]
          Length = 542

 Score = 61.2 bits (147), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 56/103 (54%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  +L+K   +
Sbjct: 70   KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAELIKNYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 48.9 bits (115), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
             VKR+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE        
Sbjct: 14   VVKRIETVVKDLWPAADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69

Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
              ++ + E C                 S+K ++   +PII L
Sbjct: 70   --KHNVAEPC-----------------SIKVLDKATVPIIKL 92


>gi|403374868|gb|EJY87397.1| Nucleotidyltransferase domain containing protein [Oxytricha
            trifallax]
          Length = 2589

 Score = 61.2 bits (147), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 58/228 (25%), Positives = 99/228 (43%), Gaps = 54/228 (23%)

Query: 1257 IFGSNATGLSLPSSDVDLVVCLPPVRN-LEPIKEAGILEGRNGIKETCLQHAARYLANQE 1315
            +FGS  TGL+L SSD+DL V    + +    I E   L G                A Q+
Sbjct: 2075 LFGSLVTGLALESSDMDLAVTGLYIEDRYAVICELNKLSG----------------ALQD 2118

Query: 1316 WVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDM-- 1373
            W   +S K++E  +IP+I + V    DL       ++              NH  SD   
Sbjct: 2119 WDILESFKSIETASIPVIKMKV----DLYQLREKEMK--------------NHGFSDFDL 2160

Query: 1374 -------------VALDDSASPKCSHTSSDNIKAATSV-RLDISFKSPSHTGLQTTDLVK 1419
                         +  DDS+  +  +  + NI     + +L+I+    +H GLQ+  LV+
Sbjct: 2161 RPIPEELRYFSVDITFDDSSCSQMMNQVTSNIFQQDLISQLNIT---KTHLGLQSCSLVQ 2217

Query: 1420 ELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
                 +     ++++LK+FLA  + +  Y GG+SSY  ++L+  ++ +
Sbjct: 2218 FFIANYQNLKEVSILLKRFLALNNFNSPYYGGISSYSAVILLVAYMNY 2265


>gi|448097882|ref|XP_004198786.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
 gi|359380208|emb|CCE82449.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
          Length = 650

 Score = 61.2 bits (147), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 55/231 (23%), Positives = 88/231 (38%), Gaps = 75/231 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  VK + R +  LWP +  ++FGS+AT L LP SD+D+VV              G  E 
Sbjct: 211  NRVVKDLKREINNLWPDTEAHVFGSSATDLYLPGSDIDMVVT----------SNTGDYEN 260

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            R     + L   + YL N++  K   ++ +    +PI+  V                 P 
Sbjct: 261  R-----SKLYQLSSYLRNRKLAK--DIEVIAKAKVPIVKFV----------------DPS 297

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       +++H D+           S    + I+AA  +R                
Sbjct: 298  -----------SNIHIDI-----------SFERRNGIEAAKRIR---------------- 319

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
                   ++ P    L L++KQFL  R L+  + GGL  Y  ++L   FL+
Sbjct: 320  ----RWLDRTPGLRELVLIVKQFLRSRRLNNVHVGGLGGYSTIILCYHFLR 366


>gi|281354437|gb|EFB30021.1| hypothetical protein PANDA_012597 [Ailuropoda melanoleuca]
          Length = 542

 Score = 61.2 bits (147), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 56/103 (54%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  +L+K   +
Sbjct: 70   KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAELIKNYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 48.9 bits (115), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
             VKR+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE        
Sbjct: 14   VVKRIETVVKDLWPAADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69

Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
              ++ + E C                 S+K ++   +PII L
Sbjct: 70   --KHNVAEPC-----------------SIKVLDKATVPIIKL 92


>gi|443713391|gb|ELU06261.1| hypothetical protein CAPTEDRAFT_115913 [Capitella teleta]
          Length = 431

 Score = 61.2 bits (147), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 51/92 (55%), Gaps = 7/92 (7%)

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
             LD ++ P    T        T V++DISF    + G+++ +L+ +     P    L LV
Sbjct: 175  VLDKASVPIVKLTDK-----KTEVKVDISFNM--NNGVRSANLIMDYMRDMPTLPYLVLV 227

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LKQFL  R L++ ++GG+SSY L+LL   FLQ
Sbjct: 228  LKQFLLQRDLNEVFTGGISSYSLILLTVSFLQ 259



 Score = 48.5 bits (114), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 27/67 (40%), Positives = 41/67 (61%), Gaps = 7/67 (10%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH+EI  F + ++    E+  R+      V+R++  +Q LWP ++  IFGS  T L LP+
Sbjct: 84   LHQEIMDFYEYMSPQPEEHYMRED----VVQRISAVIQGLWPCAKVEIFGSFRTHLYLPT 139

Query: 1270 SDVDLVV 1276
            SD+DLVV
Sbjct: 140  SDIDLVV 146


>gi|194745606|ref|XP_001955278.1| GF16314 [Drosophila ananassae]
 gi|190628315|gb|EDV43839.1| GF16314 [Drosophila ananassae]
          Length = 394

 Score = 61.2 bits (147), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 57/103 (55%), Gaps = 7/103 (6%)

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
             LD ++ P    T +      + ++ D++F + + +G+Q  +L++E   Q+P    L +V
Sbjct: 126  VLDKASMPVVKFTET-----ISQIKFDVNFNAAA-SGVQAAELIREFIRQYPDLPKLVMV 179

Query: 1435 LKQFLADRSLDQSYS-GGLSSYCLMLLITRFLQHEHHLGRPIN 1476
            LKQFL  + L++ YS GG+SSY L L+   FL+ +    +  N
Sbjct: 180  LKQFLTIQGLNEVYSTGGVSSYALTLMCISFLKQQARCKKKYN 222



 Score = 49.3 bits (116), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 40/149 (26%), Positives = 69/149 (46%), Gaps = 28/149 (18%)

Query: 1212 LLHEEID---SFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
            LLH+EI+   +F +    E   R      AV+R+   +  +WP S  ++FGS  TGL+LP
Sbjct: 32   LLHQEIEKFYNFMQSTPIEFLLRAE----AVRRIEDVVLAIWPSSCVDVFGSFRTGLNLP 87

Query: 1269 SSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
             SD+D+VV                    N   +  L    R L  +   + DS+  ++  
Sbjct: 88   VSDIDMVV-----------------YHANHWHKAPLHELQRELIARGVAEPDSISVLDKA 130

Query: 1329 AIPIIML---VVEVPHDL-IASAASSVQS 1353
            ++P++     + ++  D+   +AAS VQ+
Sbjct: 131  SMPVVKFTETISQIKFDVNFNAAASGVQA 159


>gi|145488179|ref|XP_001430094.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124397189|emb|CAK62696.1| unnamed protein product [Paramecium tetraurelia]
          Length = 986

 Score = 61.2 bits (147), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 53/215 (24%), Positives = 94/215 (43%), Gaps = 49/215 (22%)

Query: 1257 IFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEW 1316
            IFGS+ TGL+L  SDVD+VV   PV       +A + +    + E          +  +W
Sbjct: 681  IFGSSRTGLALHDSDVDMVVFGLPV-----FTKAQLFDPMRKLLEV--------FSQMKW 727

Query: 1317 VKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVAL 1376
              + S K +    IP+I + ++     +    + +        +  + H N         
Sbjct: 728  --AISYKHIFQATIPLIKITIDPSTGFLEFVGNPL--------YYVMNHRN--------- 768

Query: 1377 DDSASPKCSHTSSDNIKAATSVRLDISFKSPS----------HTGLQTTDLVKELTEQFP 1426
             D  + K    S +       +++DI+F+             H GL +TD  +++     
Sbjct: 769  IDQFNLKYGELSQN-------IQIDITFELQQPYPMYGYQGYHVGLLSTDYQRQVGNSVK 821

Query: 1427 ASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLI 1461
              T +A+VLK+ L  R L+ SY+GG+SS+CL +++
Sbjct: 822  GFTEVAIVLKKLLKHRGLNDSYTGGVSSFCLTVML 856


>gi|388580693|gb|EIM21006.1| Nucleotidyltransferase, partial [Wallemia sebi CBS 633.66]
          Length = 360

 Score = 61.2 bits (147), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 41/128 (32%), Positives = 67/128 (52%), Gaps = 11/128 (8%)

Query: 1342 DLIASAASSVQSPKEDAAHTT---LKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSV 1398
            DL+    S  Q  K+   HT    L+  N   S  V +  +  P    TSS         
Sbjct: 134  DLVVMRKSVNQYNKQSMLHTMASMLRQANLAQSIQV-ISKARVPIIKFTSS-----FGGY 187

Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
             +DIS    +  G+    +V E+ +++PA+ PL+++LK FL+ RS+++ Y+GG+SSY ++
Sbjct: 188  PIDISLNQTN--GVDAGRMVNEILDRYPAARPLSMLLKCFLSQRSMNEVYTGGVSSYSVI 245

Query: 1459 LLITRFLQ 1466
             L+  FLQ
Sbjct: 246  CLVVSFLQ 253



 Score = 40.8 bits (94), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 35/70 (50%), Gaps = 1/70 (1%)

Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
            LLH E+++F K ++   T  K    + ++ + R +   W  +    FGS  T L LP  D
Sbjct: 74   LLHREVNAFTKFISPSLTEHKTR-EYTIECIRRCITSRWADAEVFAFGSFETRLYLPDGD 132

Query: 1272 VDLVVCLPPV 1281
            +DLVV    V
Sbjct: 133  IDLVVMRKSV 142


>gi|239615134|gb|EEQ92121.1| topoisomerase TRF4 [Ajellomyces dermatitidis ER-3]
          Length = 681

 Score = 61.2 bits (147), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 74/290 (25%), Positives = 116/290 (40%), Gaps = 81/290 (27%)

Query: 1192 QPLEVQNCPTRKASL--SLMH--SLLHEEIDSF---CKQVAAENTARKPYINWAVKRVTR 1244
            +P E QN      SL  S++H  S LH EI SF    K    EN+ R    N  + R+ R
Sbjct: 227  KPFEDQNTTPWIDSLGPSIVHLGSRLHNEILSFYHWVKPKPFENSIR----NDLIARLQR 282

Query: 1245 SLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGI--KET 1302
              +     S+   FGS A+GL LP++D+DLV+          +    I + R  +  +  
Sbjct: 283  HFERRHYGSQLRAFGSFASGLYLPTADMDLVL----------LSRQFIRQDRKVLCQRPR 332

Query: 1303 CLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTT 1362
             +   A YL + +     S++T+ +  +PII  V  +                     T 
Sbjct: 333  EIYSFAAYLKDLDIAVPGSVETIAHAKVPIIKFVDRL---------------------TG 371

Query: 1363 LKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELT 1422
            LK D       ++ D+S                              TG+   +  +   
Sbjct: 372  LKVD-------LSFDNS------------------------------TGIAANETFQVWK 394

Query: 1423 EQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLG 1472
             QFPA   +  V+KQFL  R L++  +GGL  + ++ L+T  LQH  + G
Sbjct: 395  SQFPAMPIIVSVIKQFLLLRGLNEVPTGGLGGFSIICLVTSLLQHLPYSG 444


>gi|395510809|ref|XP_003759662.1| PREDICTED: DNA polymerase sigma [Sarcophilus harrisii]
          Length = 594

 Score = 61.2 bits (147), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 56/103 (54%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++   L+K+  +
Sbjct: 122  KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVKAARLIKDYMK 174

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 175  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 217



 Score = 52.8 bits (125), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 46/156 (29%), Positives = 69/156 (44%), Gaps = 43/156 (27%)

Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
            LHEEI  F      C + AA            VKR+   ++ LWP +   IFGS +TGL 
Sbjct: 42   LHEEIIDFYNFMSPCPEEAAMRRE-------VVKRIETVIKDLWPTADVQIFGSFSTGLY 94

Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
            LP+SD+DLVV      PP++ LE          ++ + E C                 S+
Sbjct: 95   LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 131

Query: 1323 KTVENTAIPIIMLV---VEVPHDLIASAASSVQSPK 1355
            K ++   +PII L     EV  D+  +  + V++ +
Sbjct: 132  KVLDKATVPIIKLTDQETEVKVDISFNMETGVKAAR 167


>gi|301776096|ref|XP_002923468.1| PREDICTED: DNA polymerase sigma-like [Ailuropoda melanoleuca]
          Length = 656

 Score = 61.2 bits (147), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 56/103 (54%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  +L+K   +
Sbjct: 184  KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAELIKNYMK 236

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 237  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 279



 Score = 52.0 bits (123), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 42/133 (31%), Positives = 59/133 (44%), Gaps = 40/133 (30%)

Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
            LHEEI  F      C + AA            VKR+   ++ LWP +   IFGS +TGL 
Sbjct: 104  LHEEIIDFYNFMSPCPEEAAMRRE-------VVKRIETVVKDLWPAADVQIFGSFSTGLY 156

Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
            LP+SD+DLVV      PP++ LE          ++ + E C                 S+
Sbjct: 157  LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 193

Query: 1323 KTVENTAIPIIML 1335
            K ++   +PII L
Sbjct: 194  KVLDKATVPIIKL 206


>gi|334325414|ref|XP_001371537.2| PREDICTED: DNA polymerase sigma-like [Monodelphis domestica]
          Length = 860

 Score = 60.8 bits (146), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 56/103 (54%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++   L+K+  +
Sbjct: 388  KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVKAARLIKDYMK 440

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 441  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 483



 Score = 52.4 bits (124), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 46/156 (29%), Positives = 69/156 (44%), Gaps = 43/156 (27%)

Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
            LHEEI  F      C + AA            VKR+   ++ LWP +   IFGS +TGL 
Sbjct: 308  LHEEIIDFYNFMSPCPEEAAMRRE-------VVKRIETVIKDLWPTADVQIFGSFSTGLY 360

Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
            LP+SD+DLVV      PP++ LE          ++ + E C                 S+
Sbjct: 361  LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 397

Query: 1323 KTVENTAIPIIMLV---VEVPHDLIASAASSVQSPK 1355
            K ++   +PII L     EV  D+  +  + V++ +
Sbjct: 398  KVLDKATVPIIKLTDQETEVKVDISFNMETGVKAAR 433


>gi|145494404|ref|XP_001433196.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400313|emb|CAK65799.1| unnamed protein product [Paramecium tetraurelia]
          Length = 940

 Score = 60.8 bits (146), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 59/241 (24%), Positives = 103/241 (42%), Gaps = 60/241 (24%)

Query: 1235 INWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC---LPPVRNLEPIKEAG 1291
            I++ + ++ R        S   +FGS ATGL+LP SD+D+ +    L P   L       
Sbjct: 618  ISFVINQLFREF-----NSNVRLFGSCATGLALPESDIDIGITGFELFPSTQL------- 665

Query: 1292 ILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSV 1351
                 NG     +Q    +L   +WVK+  ++ +  + +P+I L V+     +       
Sbjct: 666  -----NG----PIQKIIDFLQKMKWVKN--IRAITTSNMPLIKLQVDPTISFV------- 707

Query: 1352 QSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISF------K 1405
                 D++H  +      + D++   D   P  SH  S          +D+SF      K
Sbjct: 708  -----DSSHVLVLP----YIDLIPNSDEEIP--SHLFS----------VDVSFFQYQGAK 746

Query: 1406 SPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
               H G  +T+L  +    +    P+ L+ K  L  R L+  Y GG+SS+C++ ++  FL
Sbjct: 747  QNWHLGQISTELTLQWLSFYNELRPIVLLFKSLLKKRGLNDQYKGGISSFCIIQMVLAFL 806

Query: 1466 Q 1466
            +
Sbjct: 807  E 807


>gi|366992111|ref|XP_003675821.1| hypothetical protein NCAS_0C04670 [Naumovozyma castellii CBS 4309]
 gi|342301686|emb|CCC69457.1| hypothetical protein NCAS_0C04670 [Naumovozyma castellii CBS 4309]
          Length = 586

 Score = 60.8 bits (146), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 53/231 (22%), Positives = 93/231 (40%), Gaps = 75/231 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + +V  +++ LWP +  ++FGS AT L LP SD+D V+            +AG  E 
Sbjct: 199  NQTISKVRNAVKQLWPDADLHVFGSYATDLYLPGSDIDCVIN----------SKAGDKEN 248

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN      L   A +L  Q    +  ++ +  T +PII  V                   
Sbjct: 249  RN-----SLYSLASFLKQQGL--ATQIEVIAKTRVPIIKFV------------------- 282

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                    + ++++H D+           S   ++ ++AA                    
Sbjct: 283  --------EPESNIHIDV-----------SFERTNGLEAAK------------------- 304

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
             L++E  +  P    L L++KQFL  R L+  ++GGL  + ++ ++  FLQ
Sbjct: 305  -LIREWLQDTPGLRELVLIIKQFLHSRRLNNVHTGGLGGFSIICIVFSFLQ 354


>gi|440905114|gb|ELR55540.1| DNA polymerase sigma [Bos grunniens mutus]
          Length = 542

 Score = 60.5 bits (145), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 70   KHNVAEPGSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 47.4 bits (111), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 24/52 (46%), Positives = 34/52 (65%), Gaps = 4/52 (7%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
             V+R+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE
Sbjct: 14   VVRRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLE 65


>gi|237512970|ref|NP_001153629.1| DNA polymerase sigma [Bos taurus]
          Length = 541

 Score = 60.5 bits (145), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 70   KHNVAEPGSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 47.4 bits (111), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 24/52 (46%), Positives = 34/52 (65%), Gaps = 4/52 (7%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
             V+R+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE
Sbjct: 14   VVRRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLE 65


>gi|157819111|ref|NP_001100803.1| DNA polymerase sigma [Rattus norvegicus]
 gi|149032753|gb|EDL87608.1| polymerase (DNA directed) sigma (predicted) [Rattus norvegicus]
          Length = 541

 Score = 60.5 bits (145), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 70   KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 48.9 bits (115), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
             VKR+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE        
Sbjct: 14   VVKRIETVVKDLWPSADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69

Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
              ++ + E C                 S+K ++   +PII L
Sbjct: 70   --KHNVAEPC-----------------SIKVLDKATVPIIKL 92


>gi|145513226|ref|XP_001442524.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409877|emb|CAK75127.1| unnamed protein product [Paramecium tetraurelia]
          Length = 220

 Score = 60.5 bits (145), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 30/81 (37%), Positives = 50/81 (61%), Gaps = 8/81 (9%)

Query: 1393 KAATSVRLDISFK-------SPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
            +  + +++DISF        SP H+G  TT+LVK+  +++P    L L+LK  +    L 
Sbjct: 14   RIKSCIKVDISFNFSGSTYDSP-HSGFITTNLVKQWMKEYPVIQQLVLILKSMIKKLGLS 72

Query: 1446 QSYSGGLSSYCLMLLITRFLQ 1466
            +SY+GGLSSY L++++  FL+
Sbjct: 73   ESYTGGLSSYSLIIMVYSFLR 93


>gi|296475642|tpg|DAA17757.1| TPA: DNA polymerase sigma [Bos taurus]
          Length = 541

 Score = 60.5 bits (145), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 70   KHNVAEPGSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 47.4 bits (111), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 24/52 (46%), Positives = 34/52 (65%), Gaps = 4/52 (7%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
             V+R+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE
Sbjct: 14   VVRRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLE 65


>gi|387273371|gb|AFJ70180.1| DNA polymerase sigma isoform 1 [Macaca mulatta]
          Length = 542

 Score = 60.5 bits (145), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 70   KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 48.9 bits (115), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
             VKR+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE        
Sbjct: 14   VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69

Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
              ++ + E C                 S+K ++   +PII L
Sbjct: 70   --KHNVAEPC-----------------SIKVLDKATVPIIKL 92


>gi|431900711|gb|ELK08155.1| DNA polymerase sigma [Pteropus alecto]
          Length = 539

 Score = 60.5 bits (145), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 70   KHNVAEPGSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 45.8 bits (107), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 24/52 (46%), Positives = 33/52 (63%), Gaps = 4/52 (7%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
             V R+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE
Sbjct: 14   VVGRIETVVKDLWPAADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLE 65


>gi|395735628|ref|XP_002815457.2| PREDICTED: DNA polymerase sigma [Pongo abelii]
          Length = 542

 Score = 60.5 bits (145), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 70   KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 48.9 bits (115), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
             VKR+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE        
Sbjct: 14   VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69

Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
              ++ + E C                 S+K ++   +PII L
Sbjct: 70   --KHNVAEPC-----------------SIKVLDKATVPIIKL 92


>gi|284507293|ref|NP_001165276.1| DNA polymerase sigma isoform 2 [Homo sapiens]
 gi|219518042|gb|AAI43883.1| POLS protein [Homo sapiens]
          Length = 541

 Score = 60.5 bits (145), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 70   KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 48.9 bits (115), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
             VKR+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE        
Sbjct: 14   VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69

Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
              ++ + E C                 S+K ++   +PII L
Sbjct: 70   --KHNVAEPC-----------------SIKVLDKATVPIIKL 92


>gi|426246851|ref|XP_004017201.1| PREDICTED: LOW QUALITY PROTEIN: DNA polymerase sigma [Ovis aries]
          Length = 602

 Score = 60.5 bits (145), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 70   KHNVAEPGSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 47.0 bits (110), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 34/51 (66%), Gaps = 4/51 (7%)

Query: 1239 VKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
            V+R+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE
Sbjct: 15   VRRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLE 65


>gi|417402533|gb|JAA48112.1| Putative dna polymerase sigma [Desmodus rotundus]
          Length = 542

 Score = 60.5 bits (145), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 70   KHNVAEPGSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 48.1 bits (113), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 25/52 (48%), Positives = 34/52 (65%), Gaps = 4/52 (7%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
             VKR+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE
Sbjct: 14   VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLE 65


>gi|5902142|ref|NP_008930.1| DNA polymerase sigma isoform 1 [Homo sapiens]
 gi|397472993|ref|XP_003808011.1| PREDICTED: DNA polymerase sigma [Pan paniscus]
 gi|60392922|sp|Q5XG87.2|PAPD7_HUMAN RecName: Full=DNA polymerase sigma; AltName: Full=DNA polymerase
            kappa; AltName: Full=LAK-1; AltName: Full=PAP-associated
            domain-containing protein 7; AltName: Full=Terminal
            uridylyltransferase 5; Short=TUTase 5; AltName:
            Full=Topoisomerase-related function protein 4-1;
            Short=TRF4-1
 gi|5139669|dbj|BAA24434.2| LAK-1 [Homo sapiens]
 gi|47777665|gb|AAT38110.1| polymerase (DNA directed) sigma [Homo sapiens]
 gi|109658946|gb|AAI17138.1| Polymerase (DNA directed) sigma [Homo sapiens]
 gi|116496943|gb|AAI26107.1| Polymerase (DNA directed) sigma [Homo sapiens]
 gi|119628506|gb|EAX08101.1| polymerase (DNA directed) sigma [Homo sapiens]
 gi|127799472|gb|AAH84567.2| Polymerase (DNA directed) sigma [Homo sapiens]
 gi|158260737|dbj|BAF82546.1| unnamed protein product [Homo sapiens]
 gi|313883146|gb|ADR83059.1| PAP associated domain containing 7 [synthetic construct]
 gi|410210372|gb|JAA02405.1| PAP associated domain containing 7 [Pan troglodytes]
 gi|410259248|gb|JAA17590.1| PAP associated domain containing 7 [Pan troglodytes]
 gi|410303968|gb|JAA30584.1| PAP associated domain containing 7 [Pan troglodytes]
 gi|410337733|gb|JAA37813.1| PAP associated domain containing 7 [Pan troglodytes]
          Length = 542

 Score = 60.5 bits (145), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 70   KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 48.9 bits (115), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
             VKR+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE        
Sbjct: 14   VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69

Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
              ++ + E C                 S+K ++   +PII L
Sbjct: 70   --KHNVAEPC-----------------SIKVLDKATVPIIKL 92


>gi|386782099|ref|NP_001248228.1| DNA polymerase sigma [Macaca mulatta]
 gi|355691204|gb|EHH26389.1| DNA polymerase sigma [Macaca mulatta]
 gi|355749808|gb|EHH54146.1| DNA polymerase sigma [Macaca fascicularis]
 gi|380809116|gb|AFE76433.1| DNA polymerase sigma isoform 1 [Macaca mulatta]
 gi|383415413|gb|AFH30920.1| DNA polymerase sigma isoform 1 [Macaca mulatta]
 gi|384945014|gb|AFI36112.1| DNA polymerase sigma isoform 1 [Macaca mulatta]
          Length = 542

 Score = 60.5 bits (145), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 70   KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 48.9 bits (115), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
             VKR+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE        
Sbjct: 14   VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69

Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
              ++ + E C                 S+K ++   +PII L
Sbjct: 70   --KHNVAEPC-----------------SIKVLDKATVPIIKL 92


>gi|281306714|ref|NP_001162602.1| DNA polymerase sigma isoform 2 [Mus musculus]
 gi|74220251|dbj|BAE31304.1| unnamed protein product [Mus musculus]
          Length = 541

 Score = 60.1 bits (144), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 70   KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 48.5 bits (114), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
             VKR+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE        
Sbjct: 14   VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69

Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
              ++ + E C                 S+K ++   +PII L
Sbjct: 70   --KHNVAEPC-----------------SIKVLDKATVPIIKL 92


>gi|403282221|ref|XP_003932554.1| PREDICTED: DNA polymerase sigma [Saimiri boliviensis boliviensis]
          Length = 542

 Score = 60.1 bits (144), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 70   KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 48.9 bits (115), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
             VKR+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE        
Sbjct: 14   VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69

Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
              ++ + E C                 S+K ++   +PII L
Sbjct: 70   --KHNVAEPC-----------------SIKVLDKATVPIIKL 92


>gi|444729764|gb|ELW70170.1| DNA polymerase sigma [Tupaia chinensis]
          Length = 729

 Score = 60.1 bits (144), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 137  KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 189

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 190  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 232



 Score = 51.6 bits (122), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 42/133 (31%), Positives = 59/133 (44%), Gaps = 40/133 (30%)

Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
            LHEEI  F      C + AA            VKR+   ++ LWP +   IFGS +TGL 
Sbjct: 57   LHEEIIDFYNFMSPCPEEAAMRRE-------VVKRIETVVKDLWPAADVQIFGSFSTGLY 109

Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
            LP+SD+DLVV      PP++ LE          ++ + E C                 S+
Sbjct: 110  LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 146

Query: 1323 KTVENTAIPIIML 1335
            K ++   +PII L
Sbjct: 147  KVLDKATVPIIKL 159


>gi|281306711|ref|NP_941002.2| DNA polymerase sigma isoform 1 [Mus musculus]
 gi|148705067|gb|EDL37014.1| polymerase (DNA directed) sigma [Mus musculus]
          Length = 542

 Score = 60.1 bits (144), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 70   KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 48.5 bits (114), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
             VKR+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE        
Sbjct: 14   VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69

Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
              ++ + E C                 S+K ++   +PII L
Sbjct: 70   --KHNVAEPC-----------------SIKVLDKATVPIIKL 92


>gi|325186972|emb|CCA21516.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 766

 Score = 60.1 bits (144), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 34/75 (45%), Positives = 44/75 (58%), Gaps = 3/75 (4%)

Query: 1395 ATSVRLDISFKSPS---HTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            A SV  DI+F  P+   H G+ T  LV  L ++F     L +VLK FLA R L+  Y GG
Sbjct: 476  AQSVHFDITFDDPNVSPHDGVATAVLVNCLIDRFYGLGQLNIVLKYFLAKRDLNDPYMGG 535

Query: 1452 LSSYCLMLLITRFLQ 1466
            LSSY L L+IT  ++
Sbjct: 536  LSSYGLFLMITHIVR 550


>gi|354479063|ref|XP_003501733.1| PREDICTED: DNA polymerase sigma-like [Cricetulus griseus]
          Length = 624

 Score = 60.1 bits (144), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 151  KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 203

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 204  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 246



 Score = 49.7 bits (117), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 41/133 (30%), Positives = 58/133 (43%), Gaps = 40/133 (30%)

Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
            LHEEI  F      C + AA            V R+   ++ LWP +   IFGS +TGL 
Sbjct: 71   LHEEIIDFYNFMSPCPEEAAMRRE-------VVTRIETVVKDLWPTADVQIFGSFSTGLY 123

Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
            LP+SD+DLVV      PP++ LE          ++ + E C                 S+
Sbjct: 124  LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 160

Query: 1323 KTVENTAIPIIML 1335
            K ++   +PII L
Sbjct: 161  KVLDKATVPIIKL 173


>gi|432104646|gb|ELK31258.1| DNA polymerase sigma [Myotis davidii]
          Length = 513

 Score = 60.1 bits (144), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 70   KHNVAEPGSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 48.5 bits (114), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 25/52 (48%), Positives = 34/52 (65%), Gaps = 4/52 (7%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
             VKR+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE
Sbjct: 14   VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLE 65


>gi|60392923|sp|Q6PB75.1|PAPD7_MOUSE RecName: Full=DNA polymerase sigma; AltName: Full=PAP-associated
            domain-containing protein 7
 gi|37590586|gb|AAH59846.1| Polymerase (DNA directed) sigma [Mus musculus]
          Length = 542

 Score = 60.1 bits (144), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 70   KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 48.5 bits (114), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
             VKR+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE        
Sbjct: 14   VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69

Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
              ++ + E C                 S+K ++   +PII L
Sbjct: 70   --KHNVAEPC-----------------SIKVLDKATVPIIKL 92


>gi|449326076|gb|AGE92663.1| PAPD7 [Homo sapiens]
          Length = 772

 Score = 60.1 bits (144), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 300  KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 352

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 353  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 395



 Score = 52.4 bits (124), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 42/134 (31%), Positives = 59/134 (44%), Gaps = 40/134 (29%)

Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
            LHEEI  F      C + AA            VKR+   ++ LWP +   IFGS +TGL 
Sbjct: 220  LHEEIIDFYNFMSPCPEEAAMRRE-------VVKRIETVVKDLWPTADVQIFGSFSTGLY 272

Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
            LP+SD+DLVV      PP++ LE          ++ + E C                 S+
Sbjct: 273  LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 309

Query: 1323 KTVENTAIPIIMLV 1336
            K ++   +PII L 
Sbjct: 310  KVLDKATVPIIKLT 323


>gi|344255499|gb|EGW11603.1| DNA polymerase sigma [Cricetulus griseus]
          Length = 534

 Score = 60.1 bits (144), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 61   KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 113

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 114  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 156



 Score = 46.6 bits (109), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 27/102 (26%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
             V R+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE        
Sbjct: 5    VVTRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 60

Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
              ++ + E C                 S+K ++   +PII L
Sbjct: 61   --KHNVAEPC-----------------SIKVLDKATVPIIKL 83


>gi|344272726|ref|XP_003408182.1| PREDICTED: DNA polymerase sigma-like [Loxodonta africana]
          Length = 776

 Score = 60.1 bits (144), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/71 (42%), Positives = 46/71 (64%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T V++DISF     TG++  +L+K   +++     L LVLKQFL  R L++ ++GG+SSY
Sbjct: 331  TEVKVDISFNV--ETGVRAAELIKSYMKKYSLLPYLILVLKQFLLQRDLNEVFTGGISSY 388

Query: 1456 CLMLLITRFLQ 1466
             L+L+   FLQ
Sbjct: 389  SLILMAISFLQ 399



 Score = 53.5 bits (127), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 35/83 (42%), Positives = 45/83 (54%), Gaps = 17/83 (20%)

Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
            LHEEI  F      C + AA  T         VKR+   ++ LWP +   IFGS +TGL 
Sbjct: 124  LHEEIIDFYNFMSPCPEEAAMRTE-------VVKRIETVIKDLWPTADVQIFGSFSTGLY 176

Query: 1267 LPSSDVDLVVC----LPPVRNLE 1285
            LP+SD+DLVV      PP++ LE
Sbjct: 177  LPTSDIDLVVFGKWERPPLQLLE 199


>gi|296194961|ref|XP_002745186.1| PREDICTED: DNA polymerase sigma [Callithrix jacchus]
          Length = 866

 Score = 60.1 bits (144), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 395  KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 447

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 448  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 490



 Score = 48.5 bits (114), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 49/101 (48%), Gaps = 27/101 (26%)

Query: 1239 VKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGILE 1294
            VKR+   ++ LWP  +  IFGS +TGL LP+SD+DLVV      PP++ LE         
Sbjct: 340  VKRIETVVKDLWPFFQVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR----- 394

Query: 1295 GRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
             ++ + E C                 S+K ++   +PII L
Sbjct: 395  -KHNVAEPC-----------------SIKVLDKATVPIIKL 417


>gi|338718783|ref|XP_001501238.3| PREDICTED: DNA polymerase sigma [Equus caballus]
          Length = 574

 Score = 60.1 bits (144), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 102  KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 154

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 155  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 197



 Score = 52.4 bits (124), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 42/133 (31%), Positives = 59/133 (44%), Gaps = 40/133 (30%)

Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
            LHEEI  F      C + AA            VKR+   ++ LWP +   IFGS +TGL 
Sbjct: 22   LHEEIIDFYNFMSPCPEEAAMRRE-------VVKRIETVVKDLWPTADVQIFGSFSTGLY 74

Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
            LP+SD+DLVV      PP++ LE          ++ + E C                 S+
Sbjct: 75   LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 111

Query: 1323 KTVENTAIPIIML 1335
            K ++   +PII L
Sbjct: 112  KVLDKATVPIIKL 124


>gi|402871130|ref|XP_003899535.1| PREDICTED: DNA polymerase sigma [Papio anubis]
          Length = 775

 Score = 60.1 bits (144), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 303  KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 355

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 356  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 398



 Score = 52.4 bits (124), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 42/134 (31%), Positives = 59/134 (44%), Gaps = 40/134 (29%)

Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
            LHEEI  F      C + AA            VKR+   ++ LWP +   IFGS +TGL 
Sbjct: 223  LHEEIIDFYNFMSPCPEEAAMRRE-------VVKRIETVVKDLWPTADVQIFGSFSTGLY 275

Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
            LP+SD+DLVV      PP++ LE          ++ + E C                 S+
Sbjct: 276  LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 312

Query: 1323 KTVENTAIPIIMLV 1336
            K ++   +PII L 
Sbjct: 313  KVLDKATVPIIKLT 326


>gi|145533933|ref|XP_001452711.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124420410|emb|CAK85314.1| unnamed protein product [Paramecium tetraurelia]
          Length = 262

 Score = 60.1 bits (144), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 29/78 (37%), Positives = 49/78 (62%), Gaps = 8/78 (10%)

Query: 1396 TSVRLDISFK-------SPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSY 1448
            + +++DISF        SP H+G  TT+LVK+  +++P    L L+LK  +    L +SY
Sbjct: 59   SCIKVDISFNFSGSTYDSP-HSGFMTTNLVKQWMKEYPVIQQLVLILKSMIKKLGLSESY 117

Query: 1449 SGGLSSYCLMLLITRFLQ 1466
            +GGLSSY L++++  +L+
Sbjct: 118  TGGLSSYSLIIMVYSYLR 135


>gi|291411166|ref|XP_002721862.1| PREDICTED: DNA polymerase sigma [Oryctolagus cuniculus]
          Length = 542

 Score = 60.1 bits (144), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 70   KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKSYMK 122

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 123  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 48.9 bits (115), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
             VKR+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE        
Sbjct: 14   VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69

Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
              ++ + E C                 S+K ++   +PII L
Sbjct: 70   --KHNVAEPC-----------------SIKVLDKATVPIIKL 92


>gi|240276761|gb|EER40272.1| Poly(A) RNA polymerase cid14 [Ajellomyces capsulatus H143]
          Length = 680

 Score = 59.7 bits (143), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 74/291 (25%), Positives = 117/291 (40%), Gaps = 83/291 (28%)

Query: 1192 QPLEVQNCPT---RKASLSLMH--SLLHEEIDSF---CKQVAAENTARKPYINWAVKRVT 1243
            +P E QN PT     +  S +H  S LH EI SF    K  + E++ R    N  V+R+ 
Sbjct: 227  KPFEDQN-PTPWMDSSGPSTLHLGSRLHNEILSFYHWVKPKSFEHSIR----NDLVERLQ 281

Query: 1244 RSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGI--KE 1301
            R  +     S+   FGS A+GL LP++D+DLV+C              + + R  +  + 
Sbjct: 282  RHFERRHYGSQLCAFGSFASGLYLPTADMDLVLC----------SRQFVRQDRKVLCQRP 331

Query: 1302 TCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHT 1361
              +   A YL + +     S++T+ +  +PII  V  +                     T
Sbjct: 332  REIHSFAAYLKDLDIAVPGSVETIAHAKVPIIKFVDRL---------------------T 370

Query: 1362 TLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKEL 1421
             LK D       ++ D+S                              TG+      +  
Sbjct: 371  GLKVD-------LSFDNS------------------------------TGIAANKTFQVW 393

Query: 1422 TEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLG 1472
              QFPA   +  V+KQFL  R L++  +GGL  + ++ L+T  LQH  + G
Sbjct: 394  KSQFPAMPVIVSVVKQFLLLRGLNEVATGGLGGFSIICLVTSLLQHLPYSG 444


>gi|225556159|gb|EEH04448.1| Poly(A) RNA polymerase cid14 [Ajellomyces capsulatus G186AR]
          Length = 680

 Score = 59.7 bits (143), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 74/291 (25%), Positives = 117/291 (40%), Gaps = 83/291 (28%)

Query: 1192 QPLEVQNCPT---RKASLSLMH--SLLHEEIDSF---CKQVAAENTARKPYINWAVKRVT 1243
            +P E QN PT     +  S +H  S LH EI SF    K  + E++ R    N  V+R+ 
Sbjct: 227  KPFEDQN-PTPWMDSSGPSTLHLGSRLHNEILSFYHWVKPKSFEHSIR----NDLVERLQ 281

Query: 1244 RSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGI--KE 1301
            R  +     S+   FGS A+GL LP++D+DLV+C              + + R  +  + 
Sbjct: 282  RHFERRHYGSQLCAFGSFASGLYLPTADMDLVLC----------SRQFVRQDRKVLCQRP 331

Query: 1302 TCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHT 1361
              +   A YL + +     S++T+ +  +PII  V  +                     T
Sbjct: 332  REIHSFAAYLKDLDIAVPGSVETIAHAKVPIIKFVDRL---------------------T 370

Query: 1362 TLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKEL 1421
             LK D       ++ D+S                              TG+      +  
Sbjct: 371  GLKVD-------LSFDNS------------------------------TGIAANKTFQVW 393

Query: 1422 TEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLG 1472
              QFPA   +  V+KQFL  R L++  +GGL  + ++ L+T  LQH  + G
Sbjct: 394  KSQFPAMPVIVSVVKQFLLLRGLNEVATGGLGGFSIICLVTSLLQHLPYSG 444


>gi|219518895|gb|AAI43707.1| PAPD5 protein [Homo sapiens]
          Length = 277

 Score = 59.7 bits (143), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 25/58 (43%), Positives = 38/58 (65%)

Query: 1409 HTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
              G++  DL+K+ T+++P    L LVLKQFL  R L++ ++GG+ SY L L+   FLQ
Sbjct: 1    QNGVRAADLIKDFTKKYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 58


>gi|335304248|ref|XP_003134203.2| PREDICTED: DNA polymerase sigma [Sus scrofa]
          Length = 625

 Score = 59.7 bits (143), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH       +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 153  KHKVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 205

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 206  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 248



 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 42/133 (31%), Positives = 59/133 (44%), Gaps = 40/133 (30%)

Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
            LHEEI  F      C + AA            VKR+   ++ LWP +   IFGS +TGL 
Sbjct: 73   LHEEIIDFYNFMSPCPEEAAMRRE-------VVKRIETVVKDLWPTAEVQIFGSFSTGLY 125

Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
            LP+SD+DLVV      PP++ LE          ++ + E C                 S+
Sbjct: 126  LPTSDIDLVVFGKWERPPLQLLEQALR------KHKVAEPC-----------------SI 162

Query: 1323 KTVENTAIPIIML 1335
            K ++   +PII L
Sbjct: 163  KVLDKATVPIIKL 175


>gi|395857085|ref|XP_003800943.1| PREDICTED: DNA polymerase sigma [Otolemur garnettii]
          Length = 837

 Score = 59.7 bits (143), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH+      +  LD +  P    T  +     T V++DISF     TG++  + +K   +
Sbjct: 365  KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 417

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 418  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 460



 Score = 51.6 bits (122), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 42/134 (31%), Positives = 59/134 (44%), Gaps = 40/134 (29%)

Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
            LHEEI  F      C + AA            VKR+   ++ LWP +   IFGS +TGL 
Sbjct: 285  LHEEIIDFYNFMSPCPEEAAMR-------REVVKRIETVVKDLWPTADVQIFGSFSTGLY 337

Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
            LP+SD+DLVV      PP++ LE          ++ + E C                 S+
Sbjct: 338  LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 374

Query: 1323 KTVENTAIPIIMLV 1336
            K ++   +PII L 
Sbjct: 375  KVLDKATVPIIKLT 388


>gi|351706462|gb|EHB09381.1| DNA polymerase sigma [Heterocephalus glaber]
          Length = 542

 Score = 59.7 bits (143), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 33/92 (35%), Positives = 51/92 (55%), Gaps = 7/92 (7%)

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
             LD +  P    T  +     T V++DISF     TG++  + +K   +++     L LV
Sbjct: 81   VLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMKEYSLLPYLILV 133

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            LKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 134  LKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165



 Score = 47.8 bits (112), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 49/102 (48%), Gaps = 27/102 (26%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
             V+R+   ++ LWP +   IFGS +TGL LP+SD+DLVV      PP++ LE        
Sbjct: 14   VVQRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGEWERPPLQLLEQALR---- 69

Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
              ++ + E C                 S+K ++   +PII L
Sbjct: 70   --KHSVAEPC-----------------SIKVLDKATVPIIKL 92


>gi|258575851|ref|XP_002542107.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237902373|gb|EEP76774.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 675

 Score = 59.7 bits (143), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 64/263 (24%), Positives = 104/263 (39%), Gaps = 77/263 (29%)

Query: 1213 LHEEIDSF---CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH+EI  F    K    E+  R   I     R+   +Q  +P S+ + FGS A+GL LP 
Sbjct: 245  LHDEIIGFYHWVKPKPFEDVIRTDLIT----RLEMHMQRRFPGSQLHAFGSYASGLYLPV 300

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGI--KETCLQHAARYLANQEWVKSDSLKTVEN 1327
            +DVDLV+          +  + + +G+  +  K   +   + Y+ + E     S++T+ +
Sbjct: 301  ADVDLVL----------LSRSFLRQGKKFLCQKPKDIWSLSAYIKDTEIAVPGSIETIAH 350

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII  V  +                     T LK D       ++ D+S        
Sbjct: 351  ARVPIIKFVDRL---------------------TGLKVD-------LSFDNS-------- 374

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
                                  +GL      +    QFPA   +  V+KQFL  R L++ 
Sbjct: 375  ----------------------SGLTANRTFQIWKTQFPAMPLIVSVIKQFLLLRGLNEV 412

Query: 1448 YSGGLSSYCLMLLITRFLQHEHH 1470
             +GGL  + ++ L+T  LQH  H
Sbjct: 413  PTGGLGGFSIICLVTSLLQHLPH 435


>gi|219110415|ref|XP_002176959.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411494|gb|EEC51422.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1606

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 68/267 (25%), Positives = 114/267 (42%), Gaps = 76/267 (28%)

Query: 1245 SLQVLWPRS-RTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEA----GILEGR--- 1296
            S+  LWP S +  ++GS AT L LPSSD+D+VV     R    ++      G L+G+   
Sbjct: 904  SVAKLWPASCQVELYGSCATHLDLPSSDLDVVVVGLDRRRGTLMQVGTQNNGCLDGKMES 963

Query: 1297 ------NGIKETCL-------------------QHAARYLANQEWVKSDSLKTVENTAIP 1331
                  + +++  L                   +  A  LA Q W     +K +   ++P
Sbjct: 964  KSDFDVDDVRQKILNASLPPYIPTSTSLNAERIKRLAAELATQPWAVQ--VKAIPTASVP 1021

Query: 1332 IIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDN 1391
            ++ ++ + P+ L            ++ +    K D H     +A + S   K S  SSD+
Sbjct: 1022 VVKVLAD-PYRL------------QNVSGNDWKLDKH---PKIAAETS---KVSQVSSDS 1062

Query: 1392 IKAATS------------VRLDISFKSPSHTGLQTTDL-VKELTEQF------PASTPLA 1432
             +  +             +  DI+F+ P H G+ +T+  ++ + E        P  TP  
Sbjct: 1063 ARLQSFQPWRGADAMNGLLSFDITFEGPEHGGIGSTEFSIRTVNEACRETGLPPEGTPFV 1122

Query: 1433 ---LVLKQFLADRSLDQSYSGGLSSYC 1456
               +VLK+ LA R L++ YSGGLSSY 
Sbjct: 1123 QVIMVLKELLAQRKLNEPYSGGLSSYA 1149


>gi|345313578|ref|XP_001519035.2| PREDICTED: DNA polymerase sigma-like, partial [Ornithorhynchus
            anatinus]
          Length = 641

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)

Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
            KH       +  LD +  P    T  +     T V++DISF     TG++    +K+  +
Sbjct: 170  KHSVAEPKTIKVLDKATVPIIKLTDQE-----TDVKVDISFNM--ETGVKAARFIKDYMK 222

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++     L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 223  KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 265



 Score = 53.5 bits (127), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/87 (41%), Positives = 48/87 (55%), Gaps = 11/87 (12%)

Query: 1206 LSLMHSLLHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNA 1262
            LSL    LHEEI  F   ++    E   R+      VKR+   ++ LWP +   IFGS +
Sbjct: 83   LSLCSLKLHEEIIDFYNFMSPCPEEGIMRRE----VVKRIETVIKDLWPTADVQIFGSFS 138

Query: 1263 TGLSLPSSDVDLVVC----LPPVRNLE 1285
            TGL LP+SD+DLVV      PP++ LE
Sbjct: 139  TGLYLPTSDIDLVVFGKWERPPLQLLE 165


>gi|145491624|ref|XP_001431811.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398917|emb|CAK64413.1| unnamed protein product [Paramecium tetraurelia]
          Length = 985

 Score = 59.3 bits (142), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 52/215 (24%), Positives = 94/215 (43%), Gaps = 49/215 (22%)

Query: 1257 IFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEW 1316
            IFGS+ TGL+L  SDVD+VV   PV       +  + E    + E   Q         +W
Sbjct: 680  IFGSSRTGLALHDSDVDMVVFGLPV-----FTKPQLFEPMRKLLEVFCQ--------MKW 726

Query: 1317 VKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVAL 1376
              + S K +   +IP+I + ++     +    + +        +  + H N    ++   
Sbjct: 727  --AISYKHIFQASIPLIKITIDPSIGFLEFVGNPL--------YYVMNHRNIEQMNLKIG 776

Query: 1377 DDSASPKCSHTSSDNIKAATSVRLDISFKSPS----------HTGLQTTDLVKELTEQFP 1426
            + S +                +++DI+F+             H GL +TD  +++     
Sbjct: 777  EQSQN----------------IQIDITFELQQPYPMYGYQGYHVGLLSTDYQRQVGISVK 820

Query: 1427 ASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLI 1461
              T +A+VLK+ L  R L+ SY+GG+SS+CL +++
Sbjct: 821  GFTEVAIVLKKILKQRGLNDSYTGGVSSFCLTIML 855


>gi|170084663|ref|XP_001873555.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164651107|gb|EDR15347.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 300

 Score = 59.3 bits (142), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 33/70 (47%), Positives = 39/70 (55%), Gaps = 2/70 (2%)

Query: 1397 SVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
            SV +DI   +    GLQT DLVK   EQ PA  PL LV K+FL  R L+ +   GL SY 
Sbjct: 11   SVAIDIGINNVD--GLQTRDLVKSYLEQMPALRPLILVFKRFLEQRELNNASKSGLGSYA 68

Query: 1457 LMLLITRFLQ 1466
              L+   FLQ
Sbjct: 69   ATLMCINFLQ 78


>gi|397613693|gb|EJK62373.1| hypothetical protein THAOC_17017 [Thalassiosira oceanica]
          Length = 458

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 42/71 (59%), Gaps = 2/71 (2%)

Query: 1397 SVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
            ++ +D+ F  P+  G ++ DL+    E  P   PL   LK FL+ R +++ ++GG+ SY 
Sbjct: 134  NIDVDVCFDQPN--GPESADLMHRFMESMPPLRPLTFALKYFLSSRDINKPFTGGIGSYL 191

Query: 1457 LMLLITRFLQH 1467
            L L+I  FLQH
Sbjct: 192  LQLMIVSFLQH 202


>gi|294883728|ref|XP_002771045.1| hypothetical protein Pmar_PMAR026020 [Perkinsus marinus ATCC 50983]
 gi|239874251|gb|EER02861.1| hypothetical protein Pmar_PMAR026020 [Perkinsus marinus ATCC 50983]
          Length = 651

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 42/72 (58%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            +SV++ IS   P+H+G++T+  V  L + +    PL LV+K  L    L   Y GG+SSY
Sbjct: 383  SSVKVQISVTGPAHSGIETSAYVAMLCQHYDCLRPLVLVIKHVLVCDGLVTPYDGGMSSY 442

Query: 1456 CLMLLITRFLQH 1467
             L+L++  FL  
Sbjct: 443  TLVLMVVSFLNQ 454


>gi|294945352|ref|XP_002784637.1| hypothetical protein Pmar_PMAR021029 [Perkinsus marinus ATCC 50983]
 gi|239897822|gb|EER16433.1| hypothetical protein Pmar_PMAR021029 [Perkinsus marinus ATCC 50983]
          Length = 681

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/70 (38%), Positives = 42/70 (60%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            +SV++ IS   P+H+G++T+  V  L + +    PL LV+K  L    L   Y GG+SSY
Sbjct: 413  SSVKVQISVTGPAHSGIETSAYVAMLCQHYDCLRPLVLVIKHVLVCDGLVTPYDGGMSSY 472

Query: 1456 CLMLLITRFL 1465
             L+L++  FL
Sbjct: 473  TLVLMVVSFL 482


>gi|410905163|ref|XP_003966061.1| PREDICTED: DNA polymerase sigma-like [Takifugu rubripes]
          Length = 778

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 47/80 (58%), Gaps = 11/80 (13%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI  F   ++    E   R+  +N    R+ R ++ LWP +R  IFGS +TGL LP+
Sbjct: 237  LHEEIMDFFNFISPRPEEEAMRRDVVN----RIERVIKDLWPTARVEIFGSFSTGLYLPT 292

Query: 1270 SDVDLVVC----LPPVRNLE 1285
            SD+DLVV      PP++ LE
Sbjct: 293  SDIDLVVFGKWDHPPLQELE 312



 Score = 57.8 bits (138), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 42/71 (59%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T V++DISF     T ++    +K   +++    PL  VLKQFL  R L++ ++GG+ SY
Sbjct: 344  TEVKVDISFNV--ETAVKAAQFIKSYLKKYTVLPPLIFVLKQFLLQRDLNEVFTGGIGSY 401

Query: 1456 CLMLLITRFLQ 1466
             L+L+   FLQ
Sbjct: 402  SLILMAISFLQ 412


>gi|344305107|gb|EGW35339.1| hypothetical protein SPAPADRAFT_48344 [Spathaspora passalidarum NRRL
            Y-27907]
          Length = 615

 Score = 58.5 bits (140), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 57/231 (24%), Positives = 90/231 (38%), Gaps = 75/231 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  V R+   +   WP +  ++FGS AT L LP SD+D+VV          I E G  E 
Sbjct: 206  NTVVNRLKTQIAKFWPGTEAHVFGSCATDLYLPGSDIDMVV----------ISETGDYEN 255

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            R     + L   + +L +++  K  +++ + N  +PII  V                 P 
Sbjct: 256  R-----SRLYQLSSFLRSKKLAK--NVEVIANAKVPIIKFV----------------DP- 291

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                      ++ +H D+           S   ++ I AA  +R            L TT
Sbjct: 292  ----------ESEIHIDV-----------SFERTNGIDAAKRIR----------KWLITT 320

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
                      P    L L++KQFL  R L+  + GGL  Y  +++   FL+
Sbjct: 321  ----------PGLRELVLIVKQFLRSRRLNNVHVGGLGGYATIIMCYHFLR 361


>gi|401837753|gb|EJT41641.1| PAP2-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 592

 Score = 58.5 bits (140), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 56/234 (23%), Positives = 87/234 (37%), Gaps = 76/234 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  +  +  +L+ LWP +  ++FGS +T L LP SD+D VV            E G  E 
Sbjct: 203  NQTISTIREALKQLWPDADLHVFGSYSTDLYLPGSDIDCVVN----------SELGGKES 252

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN +           LA +       ++ V    +PII  V   PH              
Sbjct: 253  RNNLYSLASHLKKNNLATE-------IEVVAKARVPIIKFV--EPH-------------- 289

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D                             +SF+  +  GL+  
Sbjct: 290  -----------SRIHID-----------------------------VSFERTN--GLEAA 307

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEH 1469
             L++E     P    L L++KQFL  R L+  ++GGL  + ++ L+  FL H H
Sbjct: 308  KLIREWLNDTPGLRELVLIVKQFLHARRLNNVHTGGLGGFSIICLVFSFL-HMH 360


>gi|410077415|ref|XP_003956289.1| hypothetical protein KAFR_0C01610 [Kazachstania africana CBS 2517]
 gi|372462873|emb|CCF57154.1| hypothetical protein KAFR_0C01610 [Kazachstania africana CBS 2517]
          Length = 537

 Score = 58.5 bits (140), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 55/234 (23%), Positives = 91/234 (38%), Gaps = 76/234 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + R+  +++ LWP +  ++FGS +T L LP SD+D VV            E G  + 
Sbjct: 167  NITISRIRDAVKELWPDADLHVFGSYSTDLYLPGSDIDCVVN----------SERGNKDS 216

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            +N     CL   A++L  ++   +  ++ V    +PII  V   PH              
Sbjct: 217  KN-----CLYQLAKFLTTKKL--ATDVEVVSKARVPIIKFV--EPH-------------- 253

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                         +H D                             +SF+  +  GL+  
Sbjct: 254  -----------TGIHID-----------------------------VSFERTN--GLEAA 271

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEH 1469
             L++   +       L LV+KQFL  R L+  ++GGL  + ++ L+  FL H H
Sbjct: 272  KLIRSWLDSTAGLRELVLVIKQFLHARRLNNVHTGGLGGFSIICLVFTFL-HMH 324


>gi|344230457|gb|EGV62342.1| hypothetical protein CANTEDRAFT_126141 [Candida tenuis ATCC 10573]
          Length = 615

 Score = 58.2 bits (139), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 56/231 (24%), Positives = 94/231 (40%), Gaps = 75/231 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  VK + + ++V WP +  ++FGS AT L LP SD+D+VV          + + G  E 
Sbjct: 202  NSVVKTLKQQIKVCWPDAEAHVFGSFATDLYLPGSDIDMVV----------VSKNGDCEN 251

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            R+      L   + +L +++  K   ++ +    +PII  V                 PK
Sbjct: 252  RHK-----LYQLSSFLRSKKLAK--DIEVIAGAKVPIIKFV----------------DPK 288

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                        ++H D+           S   ++ + AA  +R            L+TT
Sbjct: 289  -----------TNIHLDI-----------SFERTNGLDAARRIR----------KWLETT 316

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
              ++E          L LV+KQFL  R L+  + GGL  Y  ++L   F++
Sbjct: 317  AGLRE----------LVLVVKQFLRSRKLNNVHVGGLGGYATIILCYHFIK 357


>gi|145513903|ref|XP_001442862.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124410223|emb|CAK75465.1| unnamed protein product [Paramecium tetraurelia]
          Length = 627

 Score = 58.2 bits (139), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 70/314 (22%), Positives = 120/314 (38%), Gaps = 87/314 (27%)

Query: 1194 LEVQNCPTRKA-------------SLSLMHSLLHEEIDSFCKQVAAENTA-----RKPY- 1234
            LEVQN P ++              + +LM   L  ++  F   +  +N       R  Y 
Sbjct: 235  LEVQNSPFKQTVKEDQIDQHIMNQARALMIKKLDMDMREFNDIIQGQNQQILKLRRLIYD 294

Query: 1235 -----INWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKE 1289
                 IN+  +     +  L   S+  +FGS ATGL+LP SD+D+               
Sbjct: 295  RLQFVINYLFRGFIHHISQLDFNSQVCLFGSCATGLALPESDIDI--------------- 339

Query: 1290 AGILEGRNGIK--ETC-----LQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHD 1342
                 G  G +   TC     +Q    +L    WV  +S+  + ++ +P+I L V+ P  
Sbjct: 340  -----GITGFEMCSTCQLYVPIQKLTEFLQRMRWV--NSIVAITSSTMPLIKLQVD-PTI 391

Query: 1343 LIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDI 1402
                +   +  P  D     L  D ++   + ++D                        I
Sbjct: 392  SFVQSTLPIGLPYIDLI---LNPDENITRQIFSVD------------------------I 424

Query: 1403 SF------KSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
            SF      K   H GL +T+L  +    +     + L+ K  L  R L+  + GG+SS+C
Sbjct: 425  SFFQYSGPKQNQHLGLISTELTLQWLSFYSELRSIVLLFKSLLKKRGLNDQFKGGMSSFC 484

Query: 1457 LMLLITRFLQHEHH 1470
            ++ ++  FL+  +H
Sbjct: 485  IIQMVLAFLECFYH 498


>gi|365758533|gb|EHN00370.1| Pap2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 514

 Score = 58.2 bits (139), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 56/234 (23%), Positives = 87/234 (37%), Gaps = 76/234 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  +  +  +L+ LWP +  ++FGS +T L LP SD+D VV            E G  E 
Sbjct: 203  NQTISTIREALKQLWPDADLHVFGSYSTDLYLPGSDIDCVVN----------SELGGKES 252

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN +           LA +       ++ V    +PII  V   PH              
Sbjct: 253  RNNLYSLASHLKKNNLATE-------IEVVAKARVPIIKFV--EPH-------------- 289

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D                             +SF+  +  GL+  
Sbjct: 290  -----------SRIHID-----------------------------VSFERTN--GLEAA 307

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEH 1469
             L++E     P    L L++KQFL  R L+  ++GGL  + ++ L+  FL H H
Sbjct: 308  KLIREWLNDTPGLRELVLIVKQFLHARRLNNVHTGGLGGFSIICLVFSFL-HMH 360


>gi|308499447|ref|XP_003111909.1| CRE-GLD-4 protein [Caenorhabditis remanei]
 gi|308268390|gb|EFP12343.1| CRE-GLD-4 protein [Caenorhabditis remanei]
          Length = 907

 Score = 58.2 bits (139), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 25/47 (53%), Positives = 36/47 (76%)

Query: 1420 ELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++ E+FP   PL L+LKQFL  R+L+Q+++GGLSSY L+LL+  F Q
Sbjct: 230  QVKEEFPLIEPLVLLLKQFLHYRNLNQTFTGGLSSYGLVLLLVNFFQ 276


>gi|432908932|ref|XP_004078069.1| PREDICTED: uncharacterized protein LOC101163355 [Oryzias latipes]
          Length = 779

 Score = 58.2 bits (139), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 43/71 (60%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            + V++DISF     T ++    +K   +++    PL  VLKQFL  R L++ ++GG+SSY
Sbjct: 324  SKVKVDISFNV--ETAVKAAQFIKSYLKKYAVLPPLIFVLKQFLLQRDLNEVFTGGISSY 381

Query: 1456 CLMLLITRFLQ 1466
             L+L+   FLQ
Sbjct: 382  SLILMAISFLQ 392



 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/80 (40%), Positives = 46/80 (57%), Gaps = 11/80 (13%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI  F   ++    E   R+  +N    ++   ++ LWP +R  IFGS +TGL LP+
Sbjct: 217  LHEEILDFFHFMSPKPEEEAMRRDVVN----KIEGVIKDLWPTARVEIFGSFSTGLYLPT 272

Query: 1270 SDVDLVVC----LPPVRNLE 1285
            SD+DLVV      PP++ LE
Sbjct: 273  SDIDLVVFGKWEHPPLQELE 292


>gi|260948920|ref|XP_002618757.1| hypothetical protein CLUG_02216 [Clavispora lusitaniae ATCC 42720]
 gi|238848629|gb|EEQ38093.1| hypothetical protein CLUG_02216 [Clavispora lusitaniae ATCC 42720]
          Length = 567

 Score = 58.2 bits (139), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 52/231 (22%), Positives = 89/231 (38%), Gaps = 75/231 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  ++R+ R +   WP+++ ++FGS AT L LP SD+D+VV          I   G  E 
Sbjct: 180  NTVIRRLKRRIAEFWPQTQAHVFGSCATDLYLPGSDIDMVV----------ISTTGDYEQ 229

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            R       L   + +L   +  K  +++ +    +PII  V                 P+
Sbjct: 230  RGK-----LYQLSSFLRTNKLAK--NIEVIATAKVPIIKFV----------------DPQ 266

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
             +           +H D+           S   ++ + AA  +R                
Sbjct: 267  YN-----------IHVDI-----------SFERTNGLDAARRIR---------------- 288

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
                +  +  P    L L++KQFL  R L+  + GGL  Y  ++L+  FL+
Sbjct: 289  ----KWLDSMPGLRELVLIVKQFLRSRRLNNVHVGGLGGYATIILMYHFLR 335


>gi|342182382|emb|CCC91860.1| putative DNA polymerase sigma [Trypanosoma congolense IL3000]
          Length = 403

 Score = 57.8 bits (138), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 42/68 (61%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            LD+     +  G + T +V +L +++P + PL +V+K FL  R +D+ Y GGL S+ + L
Sbjct: 92   LDVDISINAEDGQRNTSIVIDLLKRYPEARPLTVVVKYFLLQRGMDEPYHGGLGSFAVTL 151

Query: 1460 LITRFLQH 1467
            L+  FLQ+
Sbjct: 152  LVISFLQN 159


>gi|146419896|ref|XP_001485907.1| hypothetical protein PGUG_01578 [Meyerozyma guilliermondii ATCC 6260]
          Length = 588

 Score = 57.8 bits (138), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 56/235 (23%), Positives = 91/235 (38%), Gaps = 83/235 (35%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + R+  ++   WP +  ++FGS+AT L LP SD+D+VV          I   G  E 
Sbjct: 190  NNVIGRLKSTITKFWPDTEVHVFGSSATDLYLPGSDIDMVV----------ISRDGDREQ 239

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            R+ + +      ++ LA        +++ +    +PI+  V                 P 
Sbjct: 240  RSRLYQLSTHLRSKKLAK-------NIEVIAKAKVPIVKFV----------------DP- 275

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                      D+++H D+           S   S+ I AA  +R                
Sbjct: 276  ----------DSNIHIDV-----------SFERSNGIDAAIKIR---------------- 298

Query: 1416 DLVKELTEQFPASTP----LALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
                    ++ ASTP    L LV+KQFL  R L+  + GGL  Y  ++L   FL+
Sbjct: 299  --------EWLASTPGLRELVLVVKQFLRSRRLNNVHVGGLGGYSTIILCYHFLK 345


>gi|190345571|gb|EDK37480.2| hypothetical protein PGUG_01578 [Meyerozyma guilliermondii ATCC 6260]
          Length = 588

 Score = 57.8 bits (138), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 56/235 (23%), Positives = 91/235 (38%), Gaps = 83/235 (35%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + R+  ++   WP +  ++FGS+AT L LP SD+D+VV          I   G  E 
Sbjct: 190  NNVIGRLKSTITKFWPDTEVHVFGSSATDLYLPGSDIDMVV----------ISRDGDREQ 239

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            R+ + +      ++ LA        +++ +    +PI+  V                 P 
Sbjct: 240  RSRLYQLSTHLRSKKLAK-------NIEVIAKAKVPIVKFV----------------DP- 275

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                      D+++H D+           S   S+ I AA  +R                
Sbjct: 276  ----------DSNIHIDV-----------SFERSNGIDAAIKIR---------------- 298

Query: 1416 DLVKELTEQFPASTP----LALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
                    ++ ASTP    L LV+KQFL  R L+  + GGL  Y  ++L   FL+
Sbjct: 299  --------EWLASTPGLRELVLVVKQFLRSRRLNNVHVGGLGGYSTIILCYHFLK 345


>gi|359493669|ref|XP_002282332.2| PREDICTED: PAP-associated domain-containing protein 5-like [Vitis
            vinifera]
 gi|302143015|emb|CBI20310.3| unnamed protein product [Vitis vinifera]
          Length = 497

 Score = 57.4 bits (137), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 49/89 (55%), Gaps = 10/89 (11%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I+  +SV  DISF      G +  + +++   ++P   PL L+LK FL  R L++ YSGG
Sbjct: 209  IEKRSSVAFDISFDV--ENGPKAAEYIQDAISKWPPLRPLCLILKVFLQQRELNEVYSGG 266

Query: 1452 LSSYCLMLLITRFLQH--------EHHLG 1472
            + SY L+ ++   LQ+        EH+LG
Sbjct: 267  IGSYALLAMLIAMLQNLQEWNASVEHNLG 295


>gi|268637610|ref|XP_002649102.1| PAP/25A-associated domain-containing protein [Dictyostelium
            discoideum AX4]
 gi|256012839|gb|EEU04050.1| PAP/25A-associated domain-containing protein [Dictyostelium
            discoideum AX4]
          Length = 938

 Score = 57.0 bits (136), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 45/67 (67%), Gaps = 2/67 (2%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            +DI F +P+  G++ T +VK L +Q+ +   L L++K FL   +L+++Y+GG+ SY L L
Sbjct: 565  IDICFDTPN--GIENTAVVKGLLKQYKSMRTLLLIIKFFLHQNNLNETYTGGIGSYALAL 622

Query: 1460 LITRFLQ 1466
            ++  F+Q
Sbjct: 623  MVVSFIQ 629


>gi|150951520|ref|XP_001387852.2| topoisomerase I-related protein [Scheffersomyces stipitis CBS 6054]
 gi|149388662|gb|EAZ63829.2| topoisomerase I-related protein [Scheffersomyces stipitis CBS 6054]
          Length = 605

 Score = 57.0 bits (136), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 55/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++  S+   WP + T++FGS+AT L LP SD+D+V+          +   G  E 
Sbjct: 200  NRLINKLKSSISSYWPETETHVFGSSATDLYLPGSDIDIVI----------VSRTGDYEN 249

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            R     + L   + YL ++   K  +++ +    +PII  V                 P+
Sbjct: 250  R-----SRLYQLSSYLRHKGLAK--NMEVIAKAKVPIIKFV----------------DPE 286

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
             +                V +D       S    + I+AA  +R            + TT
Sbjct: 287  SN----------------VNID------VSFERRNGIEAAKKIR----------RWMTTT 314

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
              ++EL           L++KQFL+ R L+  +SGGL  Y  ++L   FL
Sbjct: 315  PGLREL----------VLIIKQFLSSRRLNNVHSGGLGGYATIILCYHFL 354


>gi|307102621|gb|EFN50891.1| hypothetical protein CHLNCDRAFT_37639 [Chlorella variabilis]
          Length = 509

 Score = 57.0 bits (136), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 41/73 (56%), Gaps = 2/73 (2%)

Query: 1394 AATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLS 1453
            A +    DISF      G +  + V+ L +  P   PL +VLK FL  R L++ YSGGL 
Sbjct: 264  AESGYNFDISFDV--ANGPEAAENVRALMDALPPMRPLVMVLKVFLQQRELNEVYSGGLG 321

Query: 1454 SYCLMLLITRFLQ 1466
            SY L++++  FLQ
Sbjct: 322  SYALLVMVATFLQ 334



 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/66 (45%), Positives = 42/66 (63%), Gaps = 5/66 (7%)

Query: 1213 LHEEIDSFCKQVA--AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
            LH+EI  FC+ +A  AE  A++     A+ R+   +  +WP++R   FGS ATGL LP+S
Sbjct: 162  LHQEIVEFCRYLAPSAEEQAQR---QAAMDRIEAVVTSIWPKARLQAFGSFATGLYLPTS 218

Query: 1271 DVDLVV 1276
            DVD VV
Sbjct: 219  DVDAVV 224


>gi|255072677|ref|XP_002500013.1| predicted protein [Micromonas sp. RCC299]
 gi|226515275|gb|ACO61271.1| predicted protein, partial [Micromonas sp. RCC299]
          Length = 292

 Score = 57.0 bits (136), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 60/261 (22%), Positives = 102/261 (39%), Gaps = 91/261 (34%)

Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH EI  FC+ +   A E++AR      AV+RV  ++  +WP +R  + GS ATG+ LP+
Sbjct: 4    LHHEIVDFCRYLEPTAEESSARTA----AVERVRGAVLSIWPGARFEVHGSFATGMYLPN 59

Query: 1270 SDVDLVV----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
            SD+D V+    C  P                     TCL+  A  L+ +   +   ++ +
Sbjct: 60   SDIDAVILGSGCKSPA--------------------TCLKALALSLSRKGMAR--KIQLI 97

Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
                +PI+                     K +   +  + D       V+ D +  P  +
Sbjct: 98   AKARVPIV---------------------KFEERPSGFQFD-------VSFDVANGPASA 129

Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
                 N++   ++R                          P +T L    K FL  R+L+
Sbjct: 130  EIVRANMRRFPALR--------------------------PLTTVL----KAFLQQRALN 159

Query: 1446 QSYSGGLSSYCLMLLITRFLQ 1466
            + Y+GG+ SY L+ ++   LQ
Sbjct: 160  EVYTGGVGSYALLCMVMAHLQ 180


>gi|406604992|emb|CCH43591.1| Poly(A) RNA polymerase protein 1 [Wickerhamomyces ciferrii]
          Length = 624

 Score = 57.0 bits (136), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 51/230 (22%), Positives = 90/230 (39%), Gaps = 75/230 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  V+++  ++  LWP    ++FGS AT L LP SD+D+V+          + E G  E 
Sbjct: 234  NNTVRKLREAIMELWPDCEVHVFGSYATDLYLPGSDIDMVI----------VSEHGGYES 283

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN      L   + +L  +   K  +++ +    +PII                      
Sbjct: 284  RN-----SLYSLSSFLKRKNLAK--NVEVIAKAKVPIIKFT------------------- 317

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                    +  +++H D+           S   ++ I AA ++R  I+            
Sbjct: 318  --------ESTSNIHIDV-----------SFERTNGIDAAKTIRSWIT------------ 346

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
                    + P    + L++KQFL+ R L+  + GGL  Y ++ L+  FL
Sbjct: 347  --------ETPGLREIVLIVKQFLSSRKLNNVHVGGLGGYSIICLVYSFL 388


>gi|348677916|gb|EGZ17733.1| hypothetical protein PHYSODRAFT_503192 [Phytophthora sojae]
          Length = 269

 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/70 (42%), Positives = 41/70 (58%), Gaps = 1/70 (1%)

Query: 1398 VRLDISF-KSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
            V LD++   S  HTGL   DL+     + PA  PL L+LK  L    L  ++ GG+SSY 
Sbjct: 77   VLLDLTCGHSVGHTGLGARDLIYSFQAEMPALRPLVLILKSHLLRNDLKCAFIGGISSYV 136

Query: 1457 LMLLITRFLQ 1466
            L++L+ RFLQ
Sbjct: 137  LVILVIRFLQ 146


>gi|164656242|ref|XP_001729249.1| hypothetical protein MGL_3716 [Malassezia globosa CBS 7966]
 gi|159103139|gb|EDP42035.1| hypothetical protein MGL_3716 [Malassezia globosa CBS 7966]
          Length = 527

 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/74 (40%), Positives = 43/74 (58%), Gaps = 2/74 (2%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            +DIS    +  GLQ +  V    ++ PA  PL +V+KQFL  R+L + Y+GGL SY + L
Sbjct: 92   VDISINQAN--GLQASKFVNGWLKKQPAIRPLVMVIKQFLQQRALSEVYTGGLGSYSVTL 149

Query: 1460 LITRFLQHEHHLGR 1473
            ++  FLQ    L R
Sbjct: 150  MVLSFLQLHPKLQR 163


>gi|147800856|emb|CAN64475.1| hypothetical protein VITISV_017481 [Vitis vinifera]
          Length = 493

 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 49/89 (55%), Gaps = 10/89 (11%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            ++  +SV  DISF      G +  + +++   ++P   PL L+LK FL  R L++ YSGG
Sbjct: 199  VEKRSSVAFDISFDV--ENGPKAAEYIQDAISKWPPLRPLCLILKVFLQQRELNEVYSGG 256

Query: 1452 LSSYCLMLLITRFLQH--------EHHLG 1472
            + SY L+ ++   LQ+        EH+LG
Sbjct: 257  IGSYALLAMLIAMLQNLQEWNASVEHNLG 285


>gi|440637467|gb|ELR07386.1| hypothetical protein GMDG_08401 [Geomyces destructans 20631-21]
          Length = 753

 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 47/92 (51%), Gaps = 13/92 (14%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I   T +R+D+SF++   TGL      +   + FPA   L  ++KQFLA R L++  +GG
Sbjct: 534  IDYYTGLRVDVSFEN--DTGLIANKTFQNWKDTFPAMPILVTIVKQFLAMRGLNEPVNGG 591

Query: 1452 LSSYCLMLLITRFLQH-----------EHHLG 1472
            +  + +  L+   LQH           EHHLG
Sbjct: 592  IGGFTVTCLVVSLLQHMPQIQSGNMIPEHHLG 623


>gi|349581056|dbj|GAA26214.1| K7_Pap2p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 584

 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 55/234 (23%), Positives = 89/234 (38%), Gaps = 76/234 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  +  +  +++ LWP +  ++FGS +T L LP SD+D VV            E G  E 
Sbjct: 201  NQTISTIREAVKQLWPDADLHVFGSYSTDLYLPGSDIDCVVT----------SELGGKES 250

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN +         + LA +       ++ V    +PII  V   PH              
Sbjct: 251  RNNLYSLASHLKKKNLATE-------VEVVAKARVPIIKFV--EPH-------------- 287

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ I+AA                    
Sbjct: 288  -----------SGIHIDV-----------SFERTNGIEAAK------------------- 306

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEH 1469
             L++E  +  P    L L++KQFL  R L+  ++GGL  + ++ L+  FL H H
Sbjct: 307  -LIREWLDDTPGLRELVLIVKQFLHARRLNNVHTGGLGGFSIICLVFSFL-HMH 358


>gi|6324457|ref|NP_014526.1| non-canonical poly(A) polymerase PAP2 [Saccharomyces cerevisiae
            S288c]
 gi|1717744|sp|P53632.1|PAP2_YEAST RecName: Full=Poly(A) RNA polymerase protein 2; AltName: Full=DNA
            polymerase kappa; AltName: Full=DNA polymerase sigma;
            AltName: Full=Topoisomerase 1-related protein TRF4
 gi|663237|emb|CAA88145.1| ORF [Saccharomyces cerevisiae]
 gi|950226|gb|AAC49091.1| Trf4p [Saccharomyces cerevisiae]
 gi|1419987|emb|CAA99134.1| TRF4 [Saccharomyces cerevisiae]
 gi|51830518|gb|AAU09782.1| YOL115W [Saccharomyces cerevisiae]
 gi|285814775|tpg|DAA10668.1| TPA: non-canonical poly(A) polymerase PAP2 [Saccharomyces cerevisiae
            S288c]
 gi|392296670|gb|EIW07772.1| Pap2p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 584

 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 55/234 (23%), Positives = 89/234 (38%), Gaps = 76/234 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  +  +  +++ LWP +  ++FGS +T L LP SD+D VV            E G  E 
Sbjct: 201  NQTISTIREAVKQLWPDADLHVFGSYSTDLYLPGSDIDCVVT----------SELGGKES 250

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN +         + LA +       ++ V    +PII  V   PH              
Sbjct: 251  RNNLYSLASHLKKKNLATE-------VEVVAKARVPIIKFV--EPH-------------- 287

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ I+AA                    
Sbjct: 288  -----------SGIHIDV-----------SFERTNGIEAAK------------------- 306

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEH 1469
             L++E  +  P    L L++KQFL  R L+  ++GGL  + ++ L+  FL H H
Sbjct: 307  -LIREWLDDTPGLRELVLIVKQFLHARRLNNVHTGGLGGFSIICLVFSFL-HMH 358


>gi|453083258|gb|EMF11304.1| hypothetical protein SEPMUDRAFT_47828 [Mycosphaerella populorum
            SO2202]
          Length = 672

 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 47/76 (61%), Gaps = 2/76 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + A T +++D+SF++ S  GL+     +E  +++P    L +++KQFL  R L++ + GG
Sbjct: 345  VDATTGIKVDVSFENDS--GLRAIPTFQEWKQRYPQMPVLVVIIKQFLTMRGLNEVFLGG 402

Query: 1452 LSSYCLMLLITRFLQH 1467
            +  + ++ L+T  LQH
Sbjct: 403  IGGFTIICLVTSMLQH 418


>gi|196004468|ref|XP_002112101.1| hypothetical protein TRIADDRAFT_23436 [Trichoplax adhaerens]
 gi|190586000|gb|EDV26068.1| hypothetical protein TRIADDRAFT_23436 [Trichoplax adhaerens]
          Length = 289

 Score = 56.6 bits (135), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 52/93 (55%), Gaps = 6/93 (6%)

Query: 1378 DSASPKCSHTSSDNIKAAT----SVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLAL 1433
            D  + KC H +   I   T    + ++DI F   +   +++ D+++    ++     L L
Sbjct: 95   DRDNIKCIHNAMVPIIKLTEKTCNYKMDIEFNIEN--SVKSADIIQTYIRKYEPLKYLVL 152

Query: 1434 VLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            VLKQFL  R L++ +SGG+SSY L++++  FLQ
Sbjct: 153  VLKQFLFQRELNEVFSGGVSSYTLVMMVVNFLQ 185



 Score = 55.1 bits (131), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 40/126 (31%), Positives = 60/126 (47%), Gaps = 26/126 (20%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI  F + ++    E   R+      V+ V   +  LWP  +  +FGS  TGL LP+
Sbjct: 10   LHEEIIDFYQYISPRPEEKNMRET----VVEGVKEVILTLWPHVQVEVFGSFRTGLYLPT 65

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
            SD+DLV+              GI +G+   ++  L+ A   L   E    D++K + N  
Sbjct: 66   SDIDLVI-------------FGI-DGKGAFED--LEKA---LMQHEVCDRDNIKCIHNAM 106

Query: 1330 IPIIML 1335
            +PII L
Sbjct: 107  VPIIKL 112


>gi|294654384|ref|XP_456434.2| DEHA2A02200p [Debaryomyces hansenii CBS767]
 gi|199428840|emb|CAG84386.2| DEHA2A02200p [Debaryomyces hansenii CBS767]
          Length = 600

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 55/231 (23%), Positives = 90/231 (38%), Gaps = 75/231 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  V ++ + +   WP +  ++FGS AT L LP SD+D+VV          + E G  E 
Sbjct: 194  NRVVNQLKQQIGQFWPATELHVFGSCATDLYLPGSDIDMVV----------VSETGDYEH 243

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            R     + L   + +L N++  K  +++ +    +PII  V                 P 
Sbjct: 244  R-----SRLYQLSSFLRNKKLAK--NIEVIAKAKVPIIKFV----------------DPT 280

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       +++H D+           S   ++ I AA  +R            L +T
Sbjct: 281  -----------SNIHIDI-----------SFERTNGIDAAKKIR----------RWLSST 308

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
                      P    L L++KQFL  R L+  + GGL  Y  ++L   FL+
Sbjct: 309  ----------PGLRELVLIVKQFLRSRKLNNVHVGGLGGYSTIILCYHFLK 349


>gi|440795046|gb|ELR16187.1| PAP/25A associated domain containing protein [Acanthamoeba
            castellanii str. Neff]
          Length = 471

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 48/75 (64%), Gaps = 1/75 (1%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + + T+ ++DISF + ++ G   T +VK+   +     PLALVLK +L+   L++ Y+GG
Sbjct: 208  VDSETNCQVDISFATTAN-GAVNTAIVKKFCVEAEEIRPLALVLKYYLSQMGLNEPYTGG 266

Query: 1452 LSSYCLMLLITRFLQ 1466
            + SY L+L+I  +LQ
Sbjct: 267  IGSYTLLLMIISYLQ 281



 Score = 40.8 bits (94), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 31/56 (55%), Gaps = 1/56 (1%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
            LHEEI +F  Q  +     K      ++R++  +  +WP +  N+FGS+A+ L LP
Sbjct: 134  LHEEIKAFA-QFMSPTPGEKKMREEVIRRISSVITTVWPYAEVNVFGSSASELYLP 188


>gi|313226931|emb|CBY22076.1| unnamed protein product [Oikopleura dioica]
          Length = 424

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 44/75 (58%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I++ T V +DISF + +  G +   LVK+   ++P    L ++LK  L  R L++ + GG
Sbjct: 193  IESNTLVHVDISFNTSN--GREAAALVKKYMAEYPNLKQLVVLLKYILNHRGLNEVWKGG 250

Query: 1452 LSSYCLMLLITRFLQ 1466
            L SY L LL+  FLQ
Sbjct: 251  LGSYALTLLVVNFLQ 265


>gi|345563688|gb|EGX46674.1| hypothetical protein AOL_s00097g578 [Arthrobotrys oligospora ATCC
            24927]
          Length = 1706

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 29/71 (40%), Positives = 44/71 (61%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T + +DISF++PS  GL       +  + FP    LAL+LKQFLA R +++ ++GGL S+
Sbjct: 1390 TGLSVDISFENPS--GLIAIRTFNQWRQVFPEMPKLALLLKQFLAIRGINEPFNGGLGSF 1447

Query: 1456 CLMLLITRFLQ 1466
             L+ +I   LQ
Sbjct: 1448 SLICMIVSMLQ 1458


>gi|222619028|gb|EEE55160.1| hypothetical protein OsJ_02969 [Oryza sativa Japonica Group]
          Length = 536

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 48/94 (51%), Gaps = 15/94 (15%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            ++  + +  DISF      G Q  D +K+  ++FPA   L ++LK FL  R L++ Y+GG
Sbjct: 204  VERKSEIAFDISFDM--DGGPQAADFIKDYVKKFPALRHLCMILKVFLHQRELNEVYTGG 261

Query: 1452 LSSYCLMLLITRFLQ-------------HEHHLG 1472
            + SY L+ ++   LQ              EH+LG
Sbjct: 262  IGSYALLTMLITHLQLIWGGKDILGYRKKEHNLG 295


>gi|367014043|ref|XP_003681521.1| hypothetical protein TDEL_0E00670 [Torulaspora delbrueckii]
 gi|359749182|emb|CCE92310.1| hypothetical protein TDEL_0E00670 [Torulaspora delbrueckii]
          Length = 663

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 53/234 (22%), Positives = 89/234 (38%), Gaps = 76/234 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++  +++ LWP +   +FGS AT L LP SD+D VV                 +G
Sbjct: 229  NKTISKIRAAVRELWPDADLQVFGSYATDLYLPGSDIDCVVN---------------SKG 273

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            R+      L   A +L ++E      ++ +    +PII  V                 P+
Sbjct: 274  RDKENRNSLYSLASFLKSKELAT--RVEVIAKARVPIIKFV----------------EPQ 315

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+              S + I    + RL               
Sbjct: 316  -----------SQIHIDV--------------SFERINGLEAARL--------------- 335

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEH 1469
              ++E  E+ P    L L++KQFL  R L+  ++GGL  + ++ L+  FL H H
Sbjct: 336  --IREWLEETPGLRELVLIVKQFLHSRRLNNVHTGGLGGFSIICLVYSFL-HLH 386


>gi|159470731|ref|XP_001693510.1| poly(a) polymerase [Chlamydomonas reinhardtii]
 gi|158283013|gb|EDP08764.1| poly(a) polymerase [Chlamydomonas reinhardtii]
          Length = 517

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 64/273 (23%), Positives = 106/273 (38%), Gaps = 68/273 (24%)

Query: 1210 HSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRT---NIFGSNATGLS 1266
            +S LH  I+ FC++V      R+      ++ V   ++ +WP +R     +FGS A GLS
Sbjct: 100  YSPLHYNIEEFCQRVVPTEGERRQR-QEVIEAVRGGVRRVWPGARGVELQVFGSFANGLS 158

Query: 1267 LPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETC-LQHAARYLANQEWVKSDSLKTV 1325
              +SD+DLVV        EP +  G  E  +  + T  L+  A  L   + +  +  + +
Sbjct: 159  TWNSDLDLVV----TGIYEPDRMTGGYEINDRGRITAKLRKIAEALNRSKAIDIERQQLI 214

Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
                IPI+ L  +                    A  T+          V++ D + P+ +
Sbjct: 215  PRARIPILKLWTK--------------------ARVTVD---------VSMSDDSGPRAA 245

Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
               +   +A                              +P   PL LVLK +L    L+
Sbjct: 246  RYMAQQCRA------------------------------YPPLKPLVLVLKAYLKACRLN 275

Query: 1446 QSYSGGLSSYCLMLLITRFLQHEHHLGRPINQV 1478
            +  SGGLSSY L  ++   LQ E   G  ++ +
Sbjct: 276  EVNSGGLSSYSLTNMVIAHLQEELKSGHDVSDL 308


>gi|410075647|ref|XP_003955406.1| hypothetical protein KAFR_0A08370 [Kazachstania africana CBS 2517]
 gi|372461988|emb|CCF56271.1| hypothetical protein KAFR_0A08370 [Kazachstania africana CBS 2517]
          Length = 630

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 56/234 (23%), Positives = 93/234 (39%), Gaps = 76/234 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N AV ++ R+++ LW  S   +FGS AT L LP SD+D VV             +G  E 
Sbjct: 191  NKAVSKLGRAIKELWSDSELLVFGSYATDLYLPGSDIDCVVN----------SASGNKEH 240

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            R     + L   AR+L  +    + S++ +    +PII  +                   
Sbjct: 241  R-----SYLYELARFLKKKNL--ATSIEVIARARVPIIKFI------------------- 274

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                    + ++ VH D+           S   ++ ++AA                    
Sbjct: 275  --------EPESGVHIDI-----------SFERTNGVEAAK------------------- 296

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEH 1469
             L++E  +  P    L L++KQFL  R L+  ++GGL  + ++ L+  FL H H
Sbjct: 297  -LIREWLDMTPGLRELVLIVKQFLTARRLNDVHTGGLGGFSIICLVYSFL-HLH 348


>gi|412993216|emb|CCO16749.1| predicted protein [Bathycoccus prasinos]
          Length = 767

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 65/262 (24%), Positives = 106/262 (40%), Gaps = 71/262 (27%)

Query: 1213 LHEEIDSFCK---QVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH++I+ F     Q  AE +AR   I   V+   +     + +++ + FGS ATGL+L  
Sbjct: 267  LHDDIERFASENVQNEAETSARDE-IEQTVRETIQKYS--FSKAQVHRFGSGATGLALKD 323

Query: 1270 SDVDLVVC-LPPVRNLEPIKEAGILEGRNGIKETCLQH---AARYLANQEWVKSDSLKTV 1325
            +DVDLV+  + P    + +K  G    R+  K+  +QH    A+ L N+   K   +  +
Sbjct: 324  ADVDLVILGVGP----QSVKGGGGGFTRSE-KQLLVQHLRTIAKTLRNKGVCKRAEI--I 376

Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
             +  +PI  L                     DA H   K                     
Sbjct: 377  SSAKVPIAKL---------------------DAYHAETK--------------------- 394

Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
                      T ++LD++    +  GL     ++E   +FPA  PL LVLK+ L    L+
Sbjct: 395  ----------THIKLDLAIGVSN--GLAAAQWIREQVGEFPALKPLVLVLKRLLQIHKLN 442

Query: 1446 QSYSGGLSSYCLMLLITRFLQH 1467
             + +GG   Y L+ L+   L+ 
Sbjct: 443  NAATGGCGGYLLISLVVSHLKQ 464


>gi|218188825|gb|EEC71252.1| hypothetical protein OsI_03224 [Oryza sativa Indica Group]
          Length = 571

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 43/75 (57%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            ++  + +  DISF      G Q  D +K+  ++FPA   L ++LK FL  R L++ Y+GG
Sbjct: 236  VERKSEIAFDISFDM--DGGPQAADFIKDYVKKFPALRHLCMILKVFLHQRELNEVYTGG 293

Query: 1452 LSSYCLMLLITRFLQ 1466
            + SY L+ ++   LQ
Sbjct: 294  IGSYALLTMLITHLQ 308


>gi|429327416|gb|AFZ79176.1| nucleotidyltransferase domain containing protein [Babesia equi]
          Length = 520

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 43/76 (56%), Gaps = 4/76 (5%)

Query: 1392 IKAATSVRLDIS-FKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
            +   T VRLDIS F+  S+T   TT  +KE    F    PL L++K FL  R+L  +Y G
Sbjct: 234  VDKRTGVRLDISVFQESSNT---TTKFIKEKCSAFKYMQPLILLIKLFLQARNLGDTYFG 290

Query: 1451 GLSSYCLMLLITRFLQ 1466
            G+ SY L  ++  FLQ
Sbjct: 291  GVGSYLLYCMVLSFLQ 306


>gi|389742365|gb|EIM83552.1| Nucleotidyltransferase [Stereum hirsutum FP-91666 SS1]
          Length = 540

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 59/257 (22%), Positives = 97/257 (37%), Gaps = 75/257 (29%)

Query: 1213 LHEEI---DSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH+EI    S+     AE  AR+   +     + R     W  ++ +++GS +T L LP 
Sbjct: 52   LHDEIVAHASYILPTPAERLARQALYDQIASVLVRGK---WRNAKLHLYGSISTDLCLPR 108

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
            SD+DLV     +  +   +EA          ++ L  A RYL  Q+++  +         
Sbjct: 109  SDMDLVF---EIHQIADTQEAC---------KSVLFEAKRYLFEQQYLAEE--------- 147

Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
                + VV                                H+ +  L+  + P+      
Sbjct: 148  ----LFVV-------------------------------AHARVPVLNMVSHPQTG---- 168

Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
                     R ++        G+ T   VKE  +  PA  PL L++K  L    L  + S
Sbjct: 169  ---------RFNVDITVNGADGVPTIAPVKEYLKTMPALRPLVLLIKSLLEQHGLHSAQS 219

Query: 1450 GGLSSYCLMLLITRFLQ 1466
             GLSSYC++ ++  FLQ
Sbjct: 220  SGLSSYCVICMVISFLQ 236


>gi|50302781|ref|XP_451327.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49640458|emb|CAH02915.1| KLLA0A07359p [Kluyveromyces lactis]
          Length = 684

 Score = 55.5 bits (132), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 56/231 (24%), Positives = 88/231 (38%), Gaps = 75/231 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N A+ ++  ++  LWP S  N FGS AT L LP SD+D VV             +G  E 
Sbjct: 213  NQAIAKLKEAVVELWPDSSLNCFGSYATDLYLPGSDIDCVVR----------SASGDKEN 262

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN +         + LA Q       ++ +    +PII  V                   
Sbjct: 263  RNALYSLASFLKRKQLATQ-------VEVIAKARVPIIKFV------------------- 296

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                    + ++ +H D+           S   ++ ++AA  +R                
Sbjct: 297  --------EPESKIHIDV-----------SFERTNGLEAARVIR---------------- 321

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
                 L EQ P    L L++KQFL  R L+  ++GGL  Y ++ L+  FL+
Sbjct: 322  ---GWLEEQ-PGLRELVLIVKQFLHARRLNNVHTGGLGGYSIICLVYTFLK 368


>gi|47209824|emb|CAF91228.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 964

 Score = 55.5 bits (132), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 32/80 (40%), Positives = 47/80 (58%), Gaps = 11/80 (13%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI  F   ++    E   R+  +N    R+ + ++ LWP ++  IFGS +TGL LP+
Sbjct: 450  LHEEIMDFFNFISPRPEEEAMRRDVVN----RIEKVIKDLWPTAQVEIFGSFSTGLYLPT 505

Query: 1270 SDVDLVVC----LPPVRNLE 1285
            SD+DLVV      PP++ LE
Sbjct: 506  SDIDLVVFGKWDHPPLQELE 525


>gi|84995372|ref|XP_952408.1| topoisomerase-related nucleotidyltransferase [Theileria annulata
            strain Ankara]
 gi|65302569|emb|CAI74676.1| topoisomerase-related nucleotidyltransferase, putative [Theileria
            annulata]
          Length = 532

 Score = 55.1 bits (131), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 43/75 (57%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T VRLD+S  + S    +TT  ++E+  ++    PL L++K FL  RSL  +Y GG
Sbjct: 233  VDKRTGVRLDLSVYNDS--AFKTTKFIQEMCLKYKYMQPLILLIKLFLQSRSLGDTYFGG 290

Query: 1452 LSSYCLMLLITRFLQ 1466
            + SY L  ++  FLQ
Sbjct: 291  VGSYLLYCMVLSFLQ 305


>gi|340055136|emb|CCC49447.1| putative DNA polymerase sigma [Trypanosoma vivax Y486]
          Length = 744

 Score = 55.1 bits (131), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/76 (36%), Positives = 42/76 (55%), Gaps = 1/76 (1%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            LD+     +  G + T +V +L  Q+P +  L +V+K FL  R +D+ Y GGL S+   L
Sbjct: 429  LDVDVSINAVDGSRNTSIVVDLLRQYPEARSLIVVVKYFLQQRGMDEPYHGGLGSFATTL 488

Query: 1460 LITRFL-QHEHHLGRP 1474
            L+  FL QH  +  RP
Sbjct: 489  LVISFLQQHPIYTNRP 504


>gi|149244754|ref|XP_001526920.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
            YB-4239]
 gi|146449314|gb|EDK43570.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
            YB-4239]
          Length = 664

 Score = 55.1 bits (131), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 52/231 (22%), Positives = 87/231 (37%), Gaps = 75/231 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  V  +   + + WP +  ++FGS+AT L LP SD+D+VV          + + G  E 
Sbjct: 245  NKVVNTLKTQIALFWPGTEAHVFGSSATDLYLPGSDIDMVV----------LSDTGDYEN 294

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            R+ + +      A+ LA        +++ + +  +PII  V                 P 
Sbjct: 295  RSRLYQLSSFLKAKKLAT-------NVEVIASAKVPIIKFV----------------DP- 330

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                      D+++H D                             ISF+  +  GL   
Sbjct: 331  ----------DSNIHVD-----------------------------ISFERKN--GLDAA 349

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
              ++      P    L LV+KQFL  R L+  + GGL  Y  +++   FL+
Sbjct: 350  RRIRRWLASTPGLRELVLVVKQFLRSRKLNNVHVGGLGGYATIIICYHFLR 400


>gi|302834665|ref|XP_002948895.1| hypothetical protein VOLCADRAFT_104084 [Volvox carteri f.
            nagariensis]
 gi|300266086|gb|EFJ50275.1| hypothetical protein VOLCADRAFT_104084 [Volvox carteri f.
            nagariensis]
          Length = 1052

 Score = 55.1 bits (131), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 67/272 (24%), Positives = 104/272 (38%), Gaps = 66/272 (24%)

Query: 1210 HSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRT---NIFGSNATGLS 1266
            +S LH  I+ FC +V      ++  +   +  +   ++ +WP SR     +FGS A GLS
Sbjct: 651  YSPLHYNIEDFCTKVVPTEGEKRQRME-VIDAIRAGVRKVWPNSRQVELQVFGSFANGLS 709

Query: 1267 LPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVE 1326
              SSD+DLVV       +EP + +G  E         L   A+  A    + +D+L   +
Sbjct: 710  TWSSDLDLVV----TGVMEPDRVSGGYE---------LADRAKITARLRKI-ADALNRAK 755

Query: 1327 NTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSH 1386
            N  I    L+                 P+       L   + V  D+   DDS  P+ + 
Sbjct: 756  NIDILRQQLI-----------------PRARIPILKLWTKSRVCVDVSVSDDSG-PRAAR 797

Query: 1387 TSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQ 1446
                  +A                              FP   PL LV+K +L    L++
Sbjct: 798  YMVQQCRA------------------------------FPPVKPLVLVVKTYLKACRLNE 827

Query: 1447 SYSGGLSSYCLMLLITRFLQHEHHLGRPINQV 1478
              +GGLSSY L  ++   LQ E   G  I+ +
Sbjct: 828  VNTGGLSSYSLTNMVIAHLQEELKSGHDISDL 859


>gi|253741392|gb|EES98263.1| Topoisomerase I-related protein [Giardia intestinalis ATCC 50581]
          Length = 524

 Score = 55.1 bits (131), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 45/80 (56%), Gaps = 2/80 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + +  S+  DISF SP   G    + +    E+ P +  L +V+K FL  R L+Q Y+GG
Sbjct: 203  VHSELSLEYDISFNSPH--GAAHRESILNYIEKHPLARVLIMVVKSFLKKRGLNQPYTGG 260

Query: 1452 LSSYCLMLLITRFLQHEHHL 1471
            +SSY L+ LI  ++   H++
Sbjct: 261  MSSYILLQLIVVYISERHNV 280


>gi|303277243|ref|XP_003057915.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226460572|gb|EEH57866.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 658

 Score = 54.7 bits (130), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 44/74 (59%), Gaps = 2/74 (2%)

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
            +A +  + DISF   +  G  + ++V+    +FPA  PL  VLK FL  R+L++ Y+GG+
Sbjct: 244  EAPSGFQFDISFDVAN--GPASAEIVRMNMRRFPALRPLTTVLKAFLQQRALNEVYTGGI 301

Query: 1453 SSYCLMLLITRFLQ 1466
             SY L+ ++   LQ
Sbjct: 302  GSYALLCMVMAHLQ 315



 Score = 51.6 bits (122), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/67 (43%), Positives = 40/67 (59%), Gaps = 7/67 (10%)

Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH EI  FC+ +   A E   RK     AV RV  ++  +WP+SR  + GS ATG+ LP+
Sbjct: 139  LHAEIVDFCRFLEPTAEEAEGRKA----AVDRVRAAVLSIWPQSRFEVHGSFATGMYLPN 194

Query: 1270 SDVDLVV 1276
            SD+D V+
Sbjct: 195  SDIDAVI 201


>gi|308162052|gb|EFO64479.1| Topoisomerase I-related protein [Giardia lamblia P15]
          Length = 520

 Score = 54.7 bits (130), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 45/80 (56%), Gaps = 2/80 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + +  S+  DISF SP   G    + +    E+ P +  L +V+K FL  R L+Q Y+GG
Sbjct: 203  VHSELSLEYDISFNSPH--GAAHRETILGYIEKHPLARILIMVVKSFLKKRGLNQPYTGG 260

Query: 1452 LSSYCLMLLITRFLQHEHHL 1471
            +SSY L+ LI  ++   H++
Sbjct: 261  MSSYILLQLIVVYISERHNV 280


>gi|71030472|ref|XP_764878.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68351834|gb|EAN32595.1| hypothetical protein TP02_0312 [Theileria parva]
          Length = 408

 Score = 54.7 bits (130), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 43/75 (57%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T VRLD+S  + S    +TT  ++E+  ++    PL L++K FL  RSL  +Y GG
Sbjct: 233  VDKRTGVRLDLSVYNDS--AFKTTKFIQEMCLKYKYMQPLILLIKLFLQSRSLGDTYFGG 290

Query: 1452 LSSYCLMLLITRFLQ 1466
            + SY L  ++  FLQ
Sbjct: 291  VGSYLLYCMVLSFLQ 305


>gi|67539048|ref|XP_663298.1| hypothetical protein AN5694.2 [Aspergillus nidulans FGSC A4]
 gi|40743597|gb|EAA62787.1| hypothetical protein AN5694.2 [Aspergillus nidulans FGSC A4]
 gi|259484832|tpe|CBF81391.1| TPA: topoisomerase family protein TRF4, putative (AFU_orthologue;
            AFUA_7G04130) [Aspergillus nidulans FGSC A4]
          Length = 694

 Score = 54.7 bits (130), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 63/261 (24%), Positives = 103/261 (39%), Gaps = 73/261 (27%)

Query: 1213 LHEEIDSF---CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH+EI SF    K V  E+  R+      V R+  + Q  +     + FGS A+GL LP+
Sbjct: 255  LHDEILSFYHWVKPVRYEHIVRQDL----VARLQAAFQSRYYGVEIHAFGSFASGLYLPN 310

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
            +D+DLV+     R    +K  G  +G+       +   + +L NQ      S++T+ +  
Sbjct: 311  ADIDLVLLSTSFRRTG-VKTFGERKGQ-------IYAFSAFLKNQNIAVPGSVETIAHAR 362

Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
            +PI+  V ++                     T LK D                       
Sbjct: 363  VPILKFVDKL---------------------TGLKVD----------------------- 378

Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
                        +SF + S  GL      ++   ++PA   +  V+KQFL  R L++  +
Sbjct: 379  ------------LSFDNDS--GLIANRTFQQWKSEYPAMPVIVSVIKQFLLLRGLNEVPT 424

Query: 1450 GGLSSYCLMLLITRFLQHEHH 1470
             GL  + +  L+T  LQH  H
Sbjct: 425  LGLGGFSITCLVTSLLQHMPH 445


>gi|395333834|gb|EJF66211.1| hypothetical protein DICSQDRAFT_152192 [Dichomitus squalens LYAD-421
            SS1]
          Length = 647

 Score = 54.7 bits (130), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 38/68 (55%)

Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
            R  +        G+ T  +VK+   + PA   L L++K FL+ RS+++ ++GGL SY ++
Sbjct: 269  RFSVDISVNQGNGVDTGKMVKQFLRELPALRSLVLIIKNFLSQRSMNEVFTGGLGSYSIV 328

Query: 1459 LLITRFLQ 1466
             L   FLQ
Sbjct: 329  CLAISFLQ 336


>gi|393216777|gb|EJD02267.1| hypothetical protein FOMMEDRAFT_141374 [Fomitiporia mediterranea
            MF3/22]
          Length = 732

 Score = 54.3 bits (129), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 2/67 (2%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            +DIS    +  G++T D++     +FPA   + L++K FL  R+L++ YSGGL SY ++ 
Sbjct: 263  VDISMNQSN--GVKTGDMINRFLNEFPALRAIVLIVKSFLKQRNLNEVYSGGLGSYAIVC 320

Query: 1460 LITRFLQ 1466
            L    LQ
Sbjct: 321  LAVSHLQ 327


>gi|255710469|ref|XP_002551518.1| KLTH0A01276p [Lachancea thermotolerans]
 gi|238932895|emb|CAR21076.1| KLTH0A01276p [Lachancea thermotolerans CBS 6340]
          Length = 609

 Score = 54.3 bits (129), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 53/231 (22%), Positives = 92/231 (39%), Gaps = 75/231 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++  +++ LWP +  ++FGS AT L LP SD+D V+             +G  E 
Sbjct: 191  NSTISKIRNAVKDLWPDADLHVFGSFATDLYLPGSDIDCVIN----------SSSGDKEN 240

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN     CL   A +L  ++      ++ +    +PII  V  +                
Sbjct: 241  RN-----CLYSLASFLKRRKLAT--QVEVISKARVPIIKFVEPI---------------- 277

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S    + ++AA  +R            L+ T
Sbjct: 278  -----------SQIHIDI-----------SFERVNGLEAARVIR----------GWLKET 305

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
              ++EL           L++KQFLA R L+  ++GGL  + ++ L+  FLQ
Sbjct: 306  PGLRELV----------LIVKQFLAARRLNMVHTGGLGGFSIICLVYAFLQ 346


>gi|242775359|ref|XP_002478628.1| topoisomerase family protein TRF4, putative [Talaromyces stipitatus
            ATCC 10500]
 gi|218722247|gb|EED21665.1| topoisomerase family protein TRF4, putative [Talaromyces stipitatus
            ATCC 10500]
          Length = 692

 Score = 54.3 bits (129), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 47/81 (58%), Gaps = 3/81 (3%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I   T +R+D++F + S  G++     ++  E +PA   +  V+KQFL  RSL++  +GG
Sbjct: 403  IDRLTGLRVDMAFDNDS--GIRAIQTFEKWKEAYPAMPAIVAVIKQFLLLRSLNEVPTGG 460

Query: 1452 LSSYCLMLLITRFLQH-EHHL 1471
            L  + ++ L+   LQH  HHL
Sbjct: 461  LGGFAIICLVVSLLQHMPHHL 481


>gi|145497035|ref|XP_001434507.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124401633|emb|CAK67110.1| unnamed protein product [Paramecium tetraurelia]
          Length = 648

 Score = 54.3 bits (129), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 54/267 (20%), Positives = 102/267 (38%), Gaps = 74/267 (27%)

Query: 1200 PTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFG 1259
            P  + SL  +   L  EI++F +++      ++P I+  V+ V  ++Q +  +SR  ++G
Sbjct: 304  PQHRISLDDIVKQLSFEINTFSQELNKLLEEQRPIIDKIVQMVDETVQSVCSKSRAFLYG 363

Query: 1260 SNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKS 1319
            S  TG  L   D D+ + +                  N  +   L   A     Q ++K 
Sbjct: 364  SCQTG--LNLLDSDIDIVIET----------------NEQERISLYKIAEQFKTQGFIKE 405

Query: 1320 DSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDS 1379
              +K ++N   P++ +                                            
Sbjct: 406  --VKVIDNARKPVLKM-------------------------------------------- 419

Query: 1380 ASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFL 1439
               +CS    + +       +DI+     H+G +T + + E  ++F     LAL+LK + 
Sbjct: 420  ---QCSQEFQNKL-------IDITISKNDHSGRKTANSMIEFQKEFKQFKSLALILKFYF 469

Query: 1440 ADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
               +L  +Y GGL+SYC++ +I   LQ
Sbjct: 470  KSINLLNAYQGGLNSYCILTMILALLQ 496


>gi|392567029|gb|EIW60204.1| hypothetical protein TRAVEDRAFT_164816 [Trametes versicolor FP-101664
            SS1]
          Length = 660

 Score = 54.3 bits (129), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 24/71 (33%), Positives = 39/71 (54%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T  R  +        G+    +VK+  E+ PA   L L++K FL+ RS+++ ++GGL SY
Sbjct: 276  THGRFSVDISINQGNGVTAGKMVKQFLEELPALRSLVLIIKSFLSQRSMNEVFTGGLGSY 335

Query: 1456 CLMLLITRFLQ 1466
             ++ L   FLQ
Sbjct: 336  SIVCLAISFLQ 346


>gi|302508901|ref|XP_003016411.1| hypothetical protein ARB_05810 [Arthroderma benhamiae CBS 112371]
 gi|291179980|gb|EFE35766.1| hypothetical protein ARB_05810 [Arthroderma benhamiae CBS 112371]
          Length = 577

 Score = 53.9 bits (128), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 47/84 (55%), Gaps = 3/84 (3%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + A T +++D+SF + S  GL   D  K   +++PA   +A ++K FL  R L+   +GG
Sbjct: 337  VDALTGLKVDLSFDNSS--GLAAIDTFKAWKQEYPAMPVIASIIKHFLLIRGLNDVATGG 394

Query: 1452 LSSYCLMLLITRFLQHE-HHLGRP 1474
            L  + ++ L+T  LQH    +G P
Sbjct: 395  LGGFSIICLVTSILQHSPRGIGEP 418


>gi|389585006|dbj|GAB67737.1| hypothetical protein PCYB_123030 [Plasmodium cynomolgi strain B]
          Length = 853

 Score = 53.9 bits (128), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 43/71 (60%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T V++D+     S    QTTD +K   E++    PL ++LK FL  R+L+++Y GG+ S+
Sbjct: 527  TGVQVDVCTNQVSSK--QTTDFIKSKVEKYLYLRPLVILLKFFLNTRNLNETYIGGIGSF 584

Query: 1456 CLMLLITRFLQ 1466
             L  ++T FLQ
Sbjct: 585  LLCCMVTHFLQ 595


>gi|328772133|gb|EGF82172.1| hypothetical protein BATDEDRAFT_23561 [Batrachochytrium dendrobatidis
            JAM81]
          Length = 752

 Score = 53.9 bits (128), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 68/137 (49%), Gaps = 32/137 (23%)

Query: 1211 SLLHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSL 1267
            ++L++EID +   V    AE++ RK      + RV + ++ +W  +  ++FGS  T L L
Sbjct: 176  TMLNQEIDDYVDYVRPTEAEHSLRK----LTIARVRKIVKQIWADAEVHVFGSFQTKLYL 231

Query: 1268 PSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL---KT 1324
            PSSDVD+VV                  G + +   CL+  A+      + K+D+L   + 
Sbjct: 232  PSSDVDIVVV-----------------GDSCVLPKCLRQLAK-----AFEKADTLSRMEV 269

Query: 1325 VENTAIPIIMLVVEVPH 1341
            +E T +PII  V ++ H
Sbjct: 270  IEKTKVPIIKGVDKLTH 286


>gi|83763953|emb|CAJ12156.1| cg11265 protein [Drosophila simulans]
          Length = 279

 Score = 53.9 bits (128), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 45/80 (56%), Gaps = 11/80 (13%)

Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEEI+ F + V     E+  R    N  VKR+   +  +WP++   IFGS  TGL LP+
Sbjct: 178  LHEEIEHFYQYVLPTPCEHAIR----NEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPT 233

Query: 1270 SDVDLVVC----LPPVRNLE 1285
            SD+DLVV       P+R LE
Sbjct: 234  SDIDLVVLGLWEKLPLRTLE 253


>gi|323449959|gb|EGB05843.1| hypothetical protein AURANDRAFT_66064 [Aureococcus anophagefferens]
          Length = 1630

 Score = 53.9 bits (128), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 37/64 (57%)

Query: 1398 VRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCL 1457
            V++DIS     H+GL    L + L  + P   P AL ++  LA   L+ +Y+GGL SY +
Sbjct: 675  VKVDISLDCAGHSGLAAAALCERLVSRLPELAPAALAVRCLLAAAGLNDAYTGGLPSYAV 734

Query: 1458 MLLI 1461
            +L++
Sbjct: 735  LLMV 738


>gi|296821440|ref|XP_002850129.1| Poly(A) RNA polymerase cid14 [Arthroderma otae CBS 113480]
 gi|238837683|gb|EEQ27345.1| Poly(A) RNA polymerase cid14 [Arthroderma otae CBS 113480]
          Length = 620

 Score = 53.9 bits (128), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 44/76 (57%), Gaps = 2/76 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T +++D+SF +  ++GL   D  K   E++PA   +A ++K FL  R L+   +GG
Sbjct: 380  VDGLTGLKVDLSFDN--NSGLAAIDTFKAWKEEYPAMPVIASIIKHFLLIRGLNDVATGG 437

Query: 1452 LSSYCLMLLITRFLQH 1467
            L  + ++ L+T  LQH
Sbjct: 438  LGGFSIICLVTSILQH 453


>gi|302657626|ref|XP_003020531.1| hypothetical protein TRV_05360 [Trichophyton verrucosum HKI 0517]
 gi|291184373|gb|EFE39913.1| hypothetical protein TRV_05360 [Trichophyton verrucosum HKI 0517]
          Length = 577

 Score = 53.9 bits (128), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 44/76 (57%), Gaps = 2/76 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + A T +++D+SF + S  GL   D  K   +++PA   +A ++K FL  R L+   +GG
Sbjct: 337  VDALTGLKVDLSFDNSS--GLAAIDTFKAWKQEYPAMPVIASIIKHFLLIRGLNDVATGG 394

Query: 1452 LSSYCLMLLITRFLQH 1467
            L  + ++ L+T  LQH
Sbjct: 395  LGGFSIICLVTSILQH 410


>gi|326479586|gb|EGE03596.1| topoisomerase TRF4 [Trichophyton equinum CBS 127.97]
          Length = 631

 Score = 53.9 bits (128), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 44/76 (57%), Gaps = 2/76 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + A T +++D+SF + S  GL   D  K   +++PA   +A ++K FL  R L+   +GG
Sbjct: 391  VDALTGLKVDLSFDNSS--GLAAIDTFKAWKQEYPAMPVIASIIKHFLLIRGLNDVATGG 448

Query: 1452 LSSYCLMLLITRFLQH 1467
            L  + ++ L+T  LQH
Sbjct: 449  LGGFSIICLVTSILQH 464


>gi|326470670|gb|EGD94679.1| topoisomerase TRF4 [Trichophyton tonsurans CBS 112818]
          Length = 631

 Score = 53.9 bits (128), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 44/76 (57%), Gaps = 2/76 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + A T +++D+SF + S  GL   D  K   +++PA   +A ++K FL  R L+   +GG
Sbjct: 391  VDALTGLKVDLSFDNSS--GLAAIDTFKAWKQEYPAMPVIASIIKHFLLIRGLNDVATGG 448

Query: 1452 LSSYCLMLLITRFLQH 1467
            L  + ++ L+T  LQH
Sbjct: 449  LGGFSIICLVTSILQH 464


>gi|398406264|ref|XP_003854598.1| topoisomerase-related protein, partial [Zymoseptoria tritici IPO323]
 gi|339474481|gb|EGP89574.1| topoisomerase-related protein [Zymoseptoria tritici IPO323]
          Length = 565

 Score = 53.5 bits (127), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 46/75 (61%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I A T +++DISF++ S  GL+  +  +   EQFP    L  ++KQFL  R++++ ++GG
Sbjct: 362  IDAKTGLKVDISFENDS--GLRAIETFEAWKEQFPEMPVLVTIIKQFLLMRNMNEVHTGG 419

Query: 1452 LSSYCLMLLITRFLQ 1466
            +  + ++ L+   LQ
Sbjct: 420  IGGFTIICLVVSMLQ 434


>gi|327307976|ref|XP_003238679.1| topoisomerase TRF4 [Trichophyton rubrum CBS 118892]
 gi|326458935|gb|EGD84388.1| topoisomerase TRF4 [Trichophyton rubrum CBS 118892]
          Length = 631

 Score = 53.5 bits (127), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 44/76 (57%), Gaps = 2/76 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + A T +++D+SF + S  GL   D  K   +++PA   +A ++K FL  R L+   +GG
Sbjct: 391  VDALTGLKVDLSFDNSS--GLTAIDTFKAWKQEYPAMPVIASIIKHFLLIRGLNDVATGG 448

Query: 1452 LSSYCLMLLITRFLQH 1467
            L  + ++ L+T  LQH
Sbjct: 449  LGGFSIICLVTSILQH 464


>gi|432884542|ref|XP_004074488.1| PREDICTED: uncharacterized protein LOC101158959 [Oryzias latipes]
          Length = 421

 Score = 53.5 bits (127), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 33/80 (41%), Positives = 45/80 (56%), Gaps = 11/80 (13%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LHEE+  F K ++    E   RK      VKR+   ++  WP +   IFGS +TGL LP+
Sbjct: 242  LHEEVMDFYKYMSPRPEEAAMRKE----VVKRIETIIKEQWPSADVQIFGSFSTGLYLPT 297

Query: 1270 SDVDLVVC----LPPVRNLE 1285
            SD+DLVV      PP++ LE
Sbjct: 298  SDIDLVVFGKWERPPLQELE 317


>gi|353241543|emb|CCA73351.1| related to TRF4-topoisomerase I-related protein [Piriformospora
            indica DSM 11827]
          Length = 628

 Score = 53.5 bits (127), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/67 (38%), Positives = 37/67 (55%), Gaps = 2/67 (2%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            +DIS      TG+    ++    E FPA  PL +V+K FL  R +++ Y GGL SY ++ 
Sbjct: 209  VDISLNQT--TGVSAIPVINRYLEHFPALRPLIMVVKAFLNQRGMNEVYKGGLGSYSIIC 266

Query: 1460 LITRFLQ 1466
            L   FLQ
Sbjct: 267  LAISFLQ 273


>gi|84468450|dbj|BAE71308.1| hypothetical protein [Trifolium pratense]
          Length = 518

 Score = 53.5 bits (127), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 48/89 (53%), Gaps = 10/89 (11%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            ++  + +  DISF      G +  + ++E   ++P   PL L+LK FL  R L++ YSGG
Sbjct: 215  VEKKSGLSFDISFDI--DNGPKAAEYIQEAVAKWPQLRPLCLILKVFLQQRELNEVYSGG 272

Query: 1452 LSSYCLMLLITRFLQH--------EHHLG 1472
            + SY L+ ++   L++        EH+LG
Sbjct: 273  IGSYALLTMLMAMLRNVRQSQPTAEHNLG 301


>gi|323352825|gb|EGA85127.1| Trf5p [Saccharomyces cerevisiae VL3]
          Length = 510

 Score = 53.5 bits (127), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++ R+++ LW  +  ++FGS AT L LP SD+D   C+   RN +        E 
Sbjct: 164  NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 213

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN I E      AR+L N+    +  ++ +  T +PII  +                 P+
Sbjct: 214  RNYIYEL-----ARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 250

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ ++AA  +R                
Sbjct: 251  -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 272

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
                E     P    L L++KQFL  R L+  ++GGL  + ++ L+  FL
Sbjct: 273  ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 318


>gi|156837261|ref|XP_001642660.1| hypothetical protein Kpol_1076p8 [Vanderwaltozyma polyspora DSM
            70294]
 gi|156113216|gb|EDO14802.1| hypothetical protein Kpol_1076p8 [Vanderwaltozyma polyspora DSM
            70294]
          Length = 524

 Score = 53.5 bits (127), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 56/244 (22%), Positives = 99/244 (40%), Gaps = 76/244 (31%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++  ++Q  WP +  ++FGS AT L LP SD+D VV            +AG  + 
Sbjct: 137  NQTIGKLRDAVQHHWPDANLHVFGSYATDLYLPGSDIDCVVN----------SKAGDKQS 186

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN     CL   A +L  +    ++ ++ +    +PII  V  +                
Sbjct: 187  RN-----CLYSLASHLKKEGL--AEDIEIIAKARVPIIKFVEPL---------------- 223

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ ++AA  +R            L +T
Sbjct: 224  -----------SKIHVDV-----------SFERTNGLEAAKLIR----------GWLDST 251

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ-HEHHLGRP 1474
            + ++EL           L++KQFL  R L++ ++GGL  + ++ L+  FL  H   L   
Sbjct: 252  NGLRELV----------LIVKQFLQARRLNKVHTGGLGGFSIICLVYSFLHLHPRILANE 301

Query: 1475 INQV 1478
            IN +
Sbjct: 302  INPI 305


>gi|323335976|gb|EGA77253.1| Trf5p [Saccharomyces cerevisiae Vin13]
          Length = 576

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++ R+++ LW  +  ++FGS AT L LP SD+D   C+   RN +        E 
Sbjct: 198  NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN I E      AR+L N+    +  ++ +  T +PII  +                 P+
Sbjct: 248  RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ ++AA  +R                
Sbjct: 285  -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
                E     P    L L++KQFL  R L+  ++GGL  + ++ L+  FL
Sbjct: 307  ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352


>gi|1493831|gb|AAC49397.1| Trf5p [Saccharomyces cerevisiae]
          Length = 625

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++ R+++ LW  +  ++FGS AT L LP SD+D   C+   RN +        E 
Sbjct: 198  NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN I E      AR+L N+    +  ++ +  T +PII  +                 P+
Sbjct: 248  RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ ++AA  +R                
Sbjct: 285  -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
                E     P    L L++KQFL  R L+  ++GGL  + ++ L+  FL
Sbjct: 307  ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352


>gi|392296994|gb|EIW08095.1| Trf5p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 608

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/230 (23%), Positives = 91/230 (39%), Gaps = 75/230 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++ R+++ LW  +  ++FGS AT L LP SD+D   C+   RN +        E 
Sbjct: 164  NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 213

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN I E      AR+L N+       ++ +  T +PII  +                 P+
Sbjct: 214  RNYIYEL-----ARHLKNKGLAI--RMEVIVKTRVPIIKFI----------------EPQ 250

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ ++AA  +R                
Sbjct: 251  -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 272

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
                E     P    L L++KQFL  R L+  ++GGL  + ++ L+  FL
Sbjct: 273  ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 318


>gi|207341968|gb|EDZ69877.1| YNL299Wp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 642

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++ R+++ LW  +  ++FGS AT L LP SD+D   C+   RN +        E 
Sbjct: 198  NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN I E      AR+L N+    +  ++ +  T +PII  +                 P+
Sbjct: 248  RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ ++AA  +R                
Sbjct: 285  -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
                E     P    L L++KQFL  R L+  ++GGL  + ++ L+  FL
Sbjct: 307  ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352


>gi|403419742|emb|CCM06442.1| predicted protein [Fibroporia radiculosa]
          Length = 1487

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 42/76 (55%), Gaps = 2/76 (2%)

Query: 1398 VRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCL 1457
            + +DIS       G+    ++K    + PA   L L++K FL+ RS+++ Y+GGL SY +
Sbjct: 268  IPVDISINQ--GNGVTAGTMIKHFLAELPALRSLVLIVKSFLSQRSMNEVYTGGLGSYSI 325

Query: 1458 MLLITRFLQHEHHLGR 1473
            + L+  FLQ    + R
Sbjct: 326  VCLVISFLQMHPKIRR 341


>gi|1050861|gb|AAC49099.1| Ynl0440p [Saccharomyces cerevisiae]
 gi|1302392|emb|CAA96217.1| TRF5 [Saccharomyces cerevisiae]
          Length = 625

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++ R+++ LW  +  ++FGS AT L LP SD+D   C+   RN +        E 
Sbjct: 198  NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN I E      AR+L N+    +  ++ +  T +PII  +                 P+
Sbjct: 248  RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ ++AA  +R                
Sbjct: 285  -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
                E     P    L L++KQFL  R L+  ++GGL  + ++ L+  FL
Sbjct: 307  ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352


>gi|323346953|gb|EGA81231.1| Trf5p [Saccharomyces cerevisiae Lalvin QA23]
          Length = 616

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++ R+++ LW  +  ++FGS AT L LP SD+D   C+   RN +        E 
Sbjct: 198  NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN I E      AR+L N+    +  ++ +  T +PII  +                 P+
Sbjct: 248  RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ ++AA  +R                
Sbjct: 285  -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
                E     P    L L++KQFL  R L+  ++GGL  + ++ L+  FL
Sbjct: 307  ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352


>gi|301615931|ref|XP_002937419.1| PREDICTED: PAP-associated domain-containing protein 5-like [Xenopus
            (Silurana) tropicalis]
          Length = 572

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 37/64 (57%), Gaps = 1/64 (1%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F K ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 156  LHEEIMDFYKYMSPRPEEEKMRME-VVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDI 214

Query: 1273 DLVV 1276
            DLVV
Sbjct: 215  DLVV 218



 Score = 46.6 bits (109), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 28/43 (65%)

Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++P    L LVLKQFL  R L++ ++GG+ SY L L+   FLQ
Sbjct: 253  KYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 295


>gi|256271295|gb|EEU06367.1| Trf5p [Saccharomyces cerevisiae JAY291]
 gi|349580652|dbj|GAA25811.1| K7_Trf5p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 642

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++ R+++ LW  +  ++FGS AT L LP SD+D   C+   RN +        E 
Sbjct: 198  NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN I E      AR+L N+    +  ++ +  T +PII  +                 P+
Sbjct: 248  RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ ++AA  +R                
Sbjct: 285  -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
                E     P    L L++KQFL  R L+  ++GGL  + ++ L+  FL
Sbjct: 307  ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352


>gi|190409265|gb|EDV12530.1| hypothetical protein SCRG_03424 [Saccharomyces cerevisiae RM11-1a]
          Length = 642

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++ R+++ LW  +  ++FGS AT L LP SD+D   C+   RN +        E 
Sbjct: 198  NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN I E      AR+L N+    +  ++ +  T +PII  +                 P+
Sbjct: 248  RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ ++AA  +R                
Sbjct: 285  -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
                E     P    L L++KQFL  R L+  ++GGL  + ++ L+  FL
Sbjct: 307  ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352


>gi|81360384|ref|NP_014100.2| non-canonical poly(A) polymerase TRF5 [Saccharomyces cerevisiae
            S288c]
 gi|148887014|sp|P48561.2|TRF5_YEAST RecName: Full=Poly(A) RNA polymerase protein 1; AltName:
            Full=Topoisomerase 1-related protein TRF5
 gi|151944249|gb|EDN62528.1| DNA polymerase sigma [Saccharomyces cerevisiae YJM789]
 gi|285814367|tpg|DAA10261.1| TPA: non-canonical poly(A) polymerase TRF5 [Saccharomyces cerevisiae
            S288c]
          Length = 642

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++ R+++ LW  +  ++FGS AT L LP SD+D   C+   RN +        E 
Sbjct: 198  NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN I E      AR+L N+    +  ++ +  T +PII  +                 P+
Sbjct: 248  RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ ++AA  +R                
Sbjct: 285  -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
                E     P    L L++KQFL  R L+  ++GGL  + ++ L+  FL
Sbjct: 307  ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352


>gi|365763604|gb|EHN05131.1| Trf5p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 642

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++ R+++ LW  +  ++FGS AT L LP SD+D   C+   RN +        E 
Sbjct: 198  NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN I E      AR+L N+    +  ++ +  T +PII  +                 P+
Sbjct: 248  RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ ++AA  +R                
Sbjct: 285  -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
                E     P    L L++KQFL  R L+  ++GGL  + ++ L+  FL
Sbjct: 307  ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352


>gi|302148910|pdb|3NYB|A Chain A, Structure And Function Of The Polymerase Core Of Tramp, A
            Rna Surveillance Complex
          Length = 323

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 55/234 (23%), Positives = 85/234 (36%), Gaps = 76/234 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  +  +  +++ LWP +  ++FGS +T L LP SD+D VV            E G  E 
Sbjct: 43   NQTISTIREAVKQLWPDADLHVFGSYSTDLYLPGSDIDCVVT----------SELGGKES 92

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN +         + LA +       ++ V    +PII  V   PH  I  A        
Sbjct: 93   RNNLYSLASHLKKKNLATE-------VEVVAKARVPIIKFV--EPHSGIHIA-------- 135

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                                         S   ++ I+AA  +R                
Sbjct: 136  ----------------------------VSFERTNGIEAAKLIR---------------- 151

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEH 1469
                E  +  P    L L++KQFL  R L+  ++GGL  + ++ L+  FL H H
Sbjct: 152  ----EWLDDTPGLRELVLIVKQFLHARRLNNVHTGGLGGFSIICLVFSFL-HMH 200


>gi|323331833|gb|EGA73245.1| Trf5p [Saccharomyces cerevisiae AWRI796]
          Length = 453

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++ R+++ LW  +  ++FGS AT L LP SD+D   C+   RN +        E 
Sbjct: 198  NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN I E      AR+L N+    +  ++ +  T +PII  +                 P+
Sbjct: 248  RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ ++AA  +R                
Sbjct: 285  -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
                E     P    L L++KQFL  R L+  ++GGL  + ++ L+  FL
Sbjct: 307  ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352


>gi|323307579|gb|EGA60848.1| Trf5p [Saccharomyces cerevisiae FostersO]
          Length = 575

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++ R+++ LW  +  ++FGS AT L LP SD+D   C+   RN +        E 
Sbjct: 131  NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 180

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN I E      AR+L N+    +  ++ +  T +PII  +                 P+
Sbjct: 181  RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 217

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ ++AA  +R                
Sbjct: 218  -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 239

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
                E     P    L L++KQFL  R L+  ++GGL  + ++ L+  FL
Sbjct: 240  ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 285


>gi|259149072|emb|CAY82314.1| Trf5p [Saccharomyces cerevisiae EC1118]
          Length = 642

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++ R+++ LW  +  ++FGS AT L LP SD+D   C+   RN +        E 
Sbjct: 198  NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN I E      AR+L N+    +  ++ +  T +PII  +                 P+
Sbjct: 248  RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ ++AA  +R                
Sbjct: 285  -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
                E     P    L L++KQFL  R L+  ++GGL  + ++ L+  FL
Sbjct: 307  ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352


>gi|302847239|ref|XP_002955154.1| hypothetical protein VOLCADRAFT_96037 [Volvox carteri f. nagariensis]
 gi|300259446|gb|EFJ43673.1| hypothetical protein VOLCADRAFT_96037 [Volvox carteri f. nagariensis]
          Length = 1983

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 43/131 (32%), Positives = 63/131 (48%), Gaps = 14/131 (10%)

Query: 1211 SLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
            SLLHEE+ +F ++ A         ++  +  V R    +WP +R  +FGS A GL+LP S
Sbjct: 1276 SLLHEEVAAFARESAPREDELL-ALSRGLAAVQRVATGIWPTARVVLFGSQANGLALPGS 1334

Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDS----LKTVE 1326
            D+DLVV L    N+     A   EG +G+       AAR+L        D+      TV 
Sbjct: 1335 DLDLVV-LGVTPNM-----ARAAEGLSGLARAT---AARHLETLAHRLRDAGLTRRHTVV 1385

Query: 1327 NTAIPIIMLVV 1337
               +PI+  V+
Sbjct: 1386 KAKVPIVKTVL 1396


>gi|256818788|ref|NP_001157971.1| PAP-associated domain-containing protein 5 isoform b [Mus musculus]
          Length = 637

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 48/165 (29%), Positives = 71/165 (43%), Gaps = 44/165 (26%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 181  LHEEISDFYEYMSPRPEEEKMRME-VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 239

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 240  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 275

Query: 1328 TAIPIIMLV---VEVPHDLIASAASSVQS-----------PKEDA 1358
              +PII L     EV  D+  +  + V++           P+EDA
Sbjct: 276  ATVPIIKLTDSFTEVKVDISFNVQNGVRAADLIKDFTKLHPREDA 320


>gi|452988727|gb|EME88482.1| hypothetical protein MYCFIDRAFT_159726 [Pseudocercospora fijiensis
            CIRAD86]
          Length = 658

 Score = 52.8 bits (125), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 46/75 (61%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + + T++++DISF++ S  GL T   +    +QFP    L  ++KQFLA R L++ ++GG
Sbjct: 327  VDSRTNIKVDISFENDS--GLNTIPTLLAWKQQFPEMPVLVAIIKQFLAMRGLNEVHTGG 384

Query: 1452 LSSYCLMLLITRFLQ 1466
            +  + ++ L+   LQ
Sbjct: 385  IGGFTIICLVVSMLQ 399


>gi|145548044|ref|XP_001459703.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124427529|emb|CAK92306.1| unnamed protein product [Paramecium tetraurelia]
          Length = 650

 Score = 52.8 bits (125), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/99 (28%), Positives = 51/99 (51%), Gaps = 11/99 (11%)

Query: 1372 DMVALDDSASP----KCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPA 1427
            D+  +D++  P    +CS    + +       +DI+     H+G +T + + E  ++F  
Sbjct: 406  DVKVIDNARKPVLKMQCSQEFQNKL-------IDITISKNDHSGRKTANSMIEFQKEFKQ 458

Query: 1428 STPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
               LAL+LK +    +L  +Y GGL+SYC++ +I   LQ
Sbjct: 459  FKSLALILKFYFKSINLLNAYQGGLNSYCILTMILALLQ 497



 Score = 44.7 bits (104), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 44/77 (57%)

Query: 1200 PTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFG 1259
            P  + +L  +   L  EI++F +++      ++P I+  V+ V   +Q +  +SR  ++G
Sbjct: 305  PQHRTTLDDIVKQLSFEINTFSQELNKLLEEQRPIIDKIVQMVDEIVQTVSSKSRAFLYG 364

Query: 1260 SNATGLSLPSSDVDLVV 1276
            S  TGL+L  SD+D+V+
Sbjct: 365  SCYTGLNLLDSDIDIVI 381


>gi|449498731|ref|XP_004160618.1| PREDICTED: uncharacterized protein LOC101229001 [Cucumis sativus]
          Length = 384

 Score = 52.8 bits (125), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/127 (29%), Positives = 64/127 (50%), Gaps = 25/127 (19%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH+EI  FC+ ++    E  AR    + AV+RV   ++ +WP  +  +FGS  TGL LP+
Sbjct: 197  LHKEIVDFCEFLSPTEEERVAR----DSAVERVFSVVKHIWPHCKVEVFGSFQTGLYLPT 252

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
            SD+D+V+             +GI + + G     LQ  +R L+ +   K   ++ +    
Sbjct: 253  SDIDVVIL-----------GSGIPKPQLG-----LQALSRALSQKGIAK--KIQVIGKAR 294

Query: 1330 IPIIMLV 1336
            +PII  +
Sbjct: 295  VPIIKFI 301


>gi|219518398|gb|AAI44798.1| Papd5 protein [Mus musculus]
          Length = 590

 Score = 52.8 bits (125), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 48/165 (29%), Positives = 71/165 (43%), Gaps = 44/165 (26%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 134  LHEEISDFYEYMSPRPEEEKMRME-VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 192

Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
            DLVV      LP    L  ++EA                    L   +    DS+K ++ 
Sbjct: 193  DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 228

Query: 1328 TAIPIIMLV---VEVPHDLIASAASSVQS-----------PKEDA 1358
              +PII L     EV  D+  +  + V++           P+EDA
Sbjct: 229  ATVPIIKLTDSFTEVKVDISFNVQNGVRAADLIKDFTKLHPREDA 273


>gi|358394414|gb|EHK43807.1| hypothetical protein TRIATDRAFT_163212, partial [Trichoderma
            atroviride IMI 206040]
          Length = 690

 Score = 52.4 bits (124), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 47/92 (51%), Gaps = 13/92 (14%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I  AT +R+D+SF++    G++  D      ++FPA   L  V+K FL  R L++  +GG
Sbjct: 481  IDRATGLRVDVSFEN--LGGVKAIDTFLRWKKEFPAMPILVTVIKHFLLMRGLNEPVNGG 538

Query: 1452 LSSYCLMLLITRFLQ-----------HEHHLG 1472
            L  + ++ L+   LQ            EHHLG
Sbjct: 539  LGGFSVICLVVSMLQLMPQVQSRSLIPEHHLG 570


>gi|156096154|ref|XP_001614111.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148802985|gb|EDL44384.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 808

 Score = 52.4 bits (124), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 43/75 (57%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I   T V++D+     S    QTTD +K   ++F    PL ++LK FL  R+L+++Y GG
Sbjct: 486  IDRETGVQVDVCTNQLSSR--QTTDFIKSKLQKFIYMRPLVILLKFFLNTRNLNETYIGG 543

Query: 1452 LSSYCLMLLITRFLQ 1466
            + S+ L  ++  FLQ
Sbjct: 544  IGSFLLCCMVLHFLQ 558


>gi|403159818|ref|XP_003320384.2| hypothetical protein PGTG_01296 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
 gi|375168256|gb|EFP75965.2| hypothetical protein PGTG_01296 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
          Length = 876

 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 56/267 (20%), Positives = 105/267 (39%), Gaps = 35/267 (13%)

Query: 1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
            +  +L  EI SF   +   +   +      ++ + +++   WP +    FGS  T L LP
Sbjct: 68   VQQMLTAEIGSFVAYIQPTHEEHQ-LRQMIIQMIRKTVHSRWPDADVEPFGSFGTKLYLP 126

Query: 1269 SSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
            + D+DLV+    + N +              K   L   A  L  +  +  D +  +   
Sbjct: 127  AGDIDLVIISTQMMNEQ--------------KSRILYKLA-PLIRENNIGQDVV-VIAKA 170

Query: 1329 AIPII---MLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
             +PII    +   +  D+  +  + +           +K  N +  D+  L    S    
Sbjct: 171  KVPIIKFKTIFGNINVDISINQTNGI---------VAMKKVNELLDDIKCLSLDISGSTR 221

Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDL------VKELTEQFPASTPLALVLKQFL 1439
            H+   +   ++    +    S  H    + DL      + ++ E   A+  L LV+K  L
Sbjct: 222  HSRDLDQPLSSRKEQNGPAHSRRHRRTSSPDLSDDEKAISQIIEDLGAAKCLILVIKSVL 281

Query: 1440 ADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
              R +++ Y+GGL SY ++ L+  FLQ
Sbjct: 282  KQRGMNEVYTGGLGSYSIICLVVSFLQ 308


>gi|156088769|ref|XP_001611791.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154799045|gb|EDO08223.1| hypothetical protein BBOV_III006620 [Babesia bovis]
          Length = 468

 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 41/71 (57%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            + V+LDIS        + TT+ VK   +Q+    PL L++K FL  R+L+ +Y GG+ SY
Sbjct: 179  SGVKLDISIWQAE--AINTTNFVKAKCKQYKYLQPLVLLIKLFLQLRNLNDTYIGGIGSY 236

Query: 1456 CLMLLITRFLQ 1466
             L  ++  FLQ
Sbjct: 237  LLYCMVLSFLQ 247


>gi|402220735|gb|EJU00806.1| Nucleotidyltransferase, partial [Dacryopinax sp. DJM-731 SS1]
          Length = 266

 Score = 52.0 bits (123), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 34/63 (53%)

Query: 1411 GLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHH 1470
            GL+   +V       PA  PL +V+K FL  R L++ YSGG  SY ++ ++  FLQ    
Sbjct: 127  GLRAVTVVNRFLWYLPAVRPLVMVIKAFLLQRGLNEPYSGGFGSYTVICMVVSFLQMHPK 186

Query: 1471 LGR 1473
            + R
Sbjct: 187  IRR 189


>gi|313243660|emb|CBY42319.1| unnamed protein product [Oikopleura dioica]
          Length = 513

 Score = 52.0 bits (123), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/72 (38%), Positives = 48/72 (66%), Gaps = 2/72 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            + V++DISF     TG+++  L+K    ++     L LVLKQFL  ++L++ ++GG+SSY
Sbjct: 52   SDVKVDISFNV--KTGIKSVSLIKAFIREYQVLPVLVLVLKQFLLQQNLNEVWTGGISSY 109

Query: 1456 CLMLLITRFLQH 1467
             L+L++  FLQ+
Sbjct: 110  GLILMVLCFLQN 121


>gi|393245685|gb|EJD53195.1| Nucleotidyltransferase [Auricularia delicata TFB-10046 SS5]
          Length = 584

 Score = 52.0 bits (123), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 39/71 (54%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T  R  +        G+ +  +V    ++ PA  PLA+ +K FL +R++++ Y+GGL SY
Sbjct: 222  THGRFAVDISVNQDNGIASGRIVNGFLQELPALRPLAMTVKAFLRERNMNEVYNGGLGSY 281

Query: 1456 CLMLLITRFLQ 1466
              + L+  FLQ
Sbjct: 282  STVCLLVSFLQ 292


>gi|407927280|gb|EKG20178.1| PAP/25A-associated [Macrophomina phaseolina MS6]
          Length = 691

 Score = 52.0 bits (123), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 46/76 (60%), Gaps = 2/76 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T +++DISF++  +TG+      +E  E +PA   +A ++KQ+LA R L++ ++GG
Sbjct: 477  VDRKTGLKVDISFEN--NTGIIANRTFREWKEMYPAMPVIATLIKQYLAMRGLNEVFTGG 534

Query: 1452 LSSYCLMLLITRFLQH 1467
            L  + ++ L+   L +
Sbjct: 535  LGGFSVICLVVSLLHN 550



 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 37/141 (26%), Positives = 64/141 (45%), Gaps = 17/141 (12%)

Query: 1199 CPTRKASLSLMHSLLHEEIDSFCKQVAA---ENTARKPYINWAVKRVTRSLQVLWPRSRT 1255
            C    ++   M   LH+EI  F + V     EN  R   ++  +++VTR     +P S  
Sbjct: 351  CAIDHSATQSMGFWLHKEIADFYEFVRPHPFENEIRLDLVH-RIEQVTRRE---FPGSEL 406

Query: 1256 NIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQE 1315
            + FGS A GL LP++D+DLV+          + +  I  GR  + +  L   +  L  + 
Sbjct: 407  HYFGSFAAGLYLPTADMDLVL----------LSKHFIRTGRGNVTKKNLYKMSTLLEREG 456

Query: 1316 WVKSDSLKTVENTAIPIIMLV 1336
              +  + + +    +PII +V
Sbjct: 457  ISQPGATEVISGARVPIIKIV 477


>gi|403221893|dbj|BAM40025.1| topoisomerase-related nucleotidyltransferase [Theileria orientalis
            strain Shintoku]
          Length = 520

 Score = 52.0 bits (123), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 43/75 (57%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T VR+D+S  + S    +TT  ++E+  ++    PL L++K FL  R+L  +Y GG
Sbjct: 232  VDKRTGVRIDLSCYNES--AFKTTKFIQEMCVKYKYMQPLILLVKLFLQSRNLGDTYFGG 289

Query: 1452 LSSYCLMLLITRFLQ 1466
            + S+ L  +I  FLQ
Sbjct: 290  VGSFLLYCMILSFLQ 304


>gi|402083045|gb|EJT78063.1| DNA polymerase sigma [Gaeumannomyces graminis var. tritici R3-111a-1]
          Length = 732

 Score = 52.0 bits (123), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 43/75 (57%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + A T +++DISF++   TG+   +      E+FPA   L  V+K FLA R L++  +GG
Sbjct: 515  VDARTGLKVDISFEN--STGITAVNTFLAWKEEFPAMPILVTVIKHFLAMRGLNEPVNGG 572

Query: 1452 LSSYCLMLLITRFLQ 1466
            L  + ++ L+   LQ
Sbjct: 573  LGGFSVICLVVSMLQ 587


>gi|444720754|gb|ELW61529.1| HEAT repeat-containing protein 3 [Tupaia chinensis]
          Length = 1047

 Score = 52.0 bits (123), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 48/160 (30%), Positives = 71/160 (44%), Gaps = 34/160 (21%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 646  LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 704

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            DLVV      NL P+                L+ A R     +    DS+K ++   +PI
Sbjct: 705  DLVV-FGKWENL-PL--------------WTLEEALR---KHKVADEDSVKVLDKATVPI 745

Query: 1333 IMLV---VEVPHDLIASAASSVQS-----------PKEDA 1358
            I L     EV  D+  +  + V++           P+EDA
Sbjct: 746  IKLTDSFTEVKVDISFNVQNGVRAADLIKDFTKLHPREDA 785


>gi|366997671|ref|XP_003683572.1| hypothetical protein TPHA_0A00530 [Tetrapisispora phaffii CBS 4417]
 gi|357521867|emb|CCE61138.1| hypothetical protein TPHA_0A00530 [Tetrapisispora phaffii CBS 4417]
          Length = 678

 Score = 52.0 bits (123), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/88 (29%), Positives = 51/88 (57%), Gaps = 3/88 (3%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            ++  + + +D+SF+  +  G++   L++E  +  P    L LV+KQFLA R L+  ++GG
Sbjct: 281  VEPKSQIHIDVSFERTN--GVEAAKLIREWLDDTPGLRELVLVIKQFLATRRLNNVHTGG 338

Query: 1452 LSSYCLMLLITRFLQ-HEHHLGRPINQV 1478
            L  + ++ L+  FL+ H   +   IN++
Sbjct: 339  LGGFSIICLVFCFLKMHPKIITNAINEL 366


>gi|221058739|ref|XP_002260015.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
            knowlesi strain H]
 gi|193810088|emb|CAQ41282.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
            knowlesi strain H]
          Length = 821

 Score = 51.6 bits (122), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 42/71 (59%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T V++D+     S    QTTD +K   E++    PL ++LK FL  R+L+++Y GG+ S+
Sbjct: 499  TGVQVDVCTNQLS--SRQTTDFIKSKMEKYIYLRPLVILLKFFLNTRNLNETYIGGIGSF 556

Query: 1456 CLMLLITRFLQ 1466
             L  ++  FLQ
Sbjct: 557  MLCCMVLHFLQ 567


>gi|443927237|gb|ELU45748.1| Trf5 [Rhizoctonia solani AG-1 IA]
          Length = 622

 Score = 51.6 bits (122), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 21/68 (30%), Positives = 39/68 (57%), Gaps = 2/68 (2%)

Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
            ++DIS    +  G+    ++    ++ PA  PL  + K FL+ R +++ Y+GGL SY ++
Sbjct: 208  KVDISLNQAN--GVSAGKIINNFLKRLPALRPLVFIFKAFLSQREMNEVYNGGLGSYSIV 265

Query: 1459 LLITRFLQ 1466
             ++  FLQ
Sbjct: 266  CMVLSFLQ 273


>gi|355756767|gb|EHH60375.1| hypothetical protein EGM_11721, partial [Macaca fascicularis]
          Length = 483

 Score = 51.6 bits (122), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 1/64 (1%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LHEEI  F + ++      K  +   V R+   ++ LWP +   IFGS  TGL LP+SD+
Sbjct: 19   LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 77

Query: 1273 DLVV 1276
            DLVV
Sbjct: 78   DLVV 81



 Score = 47.8 bits (112), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 33/53 (62%)

Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            +  ++ + T ++P    L LVLKQFL  R L++ ++GG+ SY L L+   FLQ
Sbjct: 107  SVKVLDKATVKYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 159


>gi|380483151|emb|CCF40795.1| Poly(A) RNA polymerase cid14 [Colletotrichum higginsianum]
          Length = 269

 Score = 51.6 bits (122), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/92 (34%), Positives = 45/92 (48%), Gaps = 13/92 (14%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I + T +R+DISF      G Q      E  EQFPA   L  ++K FLA R L++  +GG
Sbjct: 56   IDSITGLRVDISFDRLD--GPQAVKTFAEWKEQFPAMPILVTMIKHFLAMRGLNEPVNGG 113

Query: 1452 LSSYCLMLLITRFLQH-----------EHHLG 1472
            + S+ +  ++   LQ            EHHLG
Sbjct: 114  IGSFTVTCMVVSMLQLMPQVQSRNLIPEHHLG 145


>gi|313231448|emb|CBY08562.1| unnamed protein product [Oikopleura dioica]
          Length = 587

 Score = 51.6 bits (122), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/72 (38%), Positives = 48/72 (66%), Gaps = 2/72 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            + V++DISF     TG+++  L+K    ++     L LVLKQFL  ++L++ ++GG+SSY
Sbjct: 116  SDVKVDISFNV--KTGIKSVSLIKAFIREYQVLPVLVLVLKQFLLQQNLNEVWTGGISSY 173

Query: 1456 CLMLLITRFLQH 1467
             L+L++  FLQ+
Sbjct: 174  GLILMVLCFLQN 185


>gi|449547164|gb|EMD38132.1| hypothetical protein CERSUDRAFT_49354 [Ceriporiopsis subvermispora B]
          Length = 547

 Score = 51.6 bits (122), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 37/75 (49%)

Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
            R  +        G+    ++     + PA   L  V+K FL+ RS+++ ++GGL SY ++
Sbjct: 283  RFSVDISINQGNGVAAGKMINHFLSELPALRALVFVVKSFLSQRSMNEVFTGGLGSYSIV 342

Query: 1459 LLITRFLQHEHHLGR 1473
             L   FLQH   + R
Sbjct: 343  CLAISFLQHHPKIRR 357


>gi|320035002|gb|EFW16944.1| Poly(A) polymerase [Coccidioides posadasii str. Silveira]
          Length = 680

 Score = 51.6 bits (122), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 44/75 (58%), Gaps = 2/75 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T +++D+SF +  ++GL      ++  E FPA   +  V+KQFL  R L++  +GGL  +
Sbjct: 365  TGLKVDLSFDN--NSGLAANRTFQQWKEHFPAMPLIVSVIKQFLLLRGLNEVPTGGLGGF 422

Query: 1456 CLMLLITRFLQHEHH 1470
             ++ L+T  LQH  H
Sbjct: 423  SIICLVTSLLQHLPH 437



 Score = 49.7 bits (117), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 48/184 (26%), Positives = 80/184 (43%), Gaps = 31/184 (16%)

Query: 1170 DHLIAISQLARDQEHPDVAFPLQPLEVQNCPT-------------RKASLSLMHSLLHEE 1216
            D ++  +  ++  E PD  F L    +Q   T             R+A+  L  +L +E 
Sbjct: 192  DEIVGPAPRSQRYEKPDTRFNLDGSIIQAYRTIAGQDSTPWFDRSRQATFHLGAALQNEI 251

Query: 1217 IDSF--CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDL 1274
            +D +   K    E+  R   I     R  R +Q  +P S+ + FGS A+GL LP +DVDL
Sbjct: 252  VDFYHWVKPRPFEDVIRTDLIT----RFERLMQNRFPGSQLHAFGSYASGLYLPVADVDL 307

Query: 1275 VVCLPPVRNLEPIKEAGILEGRNGI--KETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            V+          +  + I +GR  +  K   +     Y+ + E     S++T+ +  +PI
Sbjct: 308  VL----------LSRSFIRQGRKFLCQKIKDIYSLTAYIRDTEIAVPGSIETIAHARVPI 357

Query: 1333 IMLV 1336
            I  V
Sbjct: 358  IKFV 361


>gi|299738269|ref|XP_001838228.2| PAP associated domain containing 5 [Coprinopsis cinerea okayama7#130]
 gi|298403230|gb|EAU83596.2| PAP associated domain containing 5 [Coprinopsis cinerea okayama7#130]
          Length = 431

 Score = 51.6 bits (122), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 81/339 (23%), Positives = 127/339 (37%), Gaps = 89/339 (26%)

Query: 1159 MIQPLSGALLQDHLIAISQLARDQEHPD-VAFPLQPL-------EVQNCPTRKASLSLMH 1210
            M   L G  L+D++IA     R + + D    P+ P           N P    +L+  H
Sbjct: 1    MSSSLLGQTLRDNIIA-GVPTRTKAYSDYTGLPVPPCGDFAPRSTSINAPPWMPTLTSPH 59

Query: 1211 SL-LHEEIDSFC---KQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
             L LH EI+ F    K    E   R   IN    R T+ ++ L P+      GS  +GL 
Sbjct: 60   RLSLHNEIEHFVAYMKPTDKELLLRTDLIN----RFTKLIETLDPQVSVQAVGSTVSGLH 115

Query: 1267 LPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVE 1326
             P+SD+D+VV L   R          L  R  +  T L++  R       ++S     + 
Sbjct: 116  FPTSDIDMVVTLAGAR--AGYYSTSTLSARTTL--TLLEYKIRSSGFARKIES-----IL 166

Query: 1327 NTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSH 1386
            N ++P++          I  A + +      A    +K  N V S M             
Sbjct: 167  NASVPLLR---------ITDAVTGIDIDLTAADEHGVKATNMVQSYM------------- 204

Query: 1387 TSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQ 1446
                              KS + T +++                L +VLK FLA R L  
Sbjct: 205  ------------------KSENATVIKS----------------LVMVLKLFLATRKLGT 230

Query: 1447 SYSGGLSSYCLMLLITRFLQHEH-------HLGRPINQV 1478
            +++GG++SY L+ ++  +++ E        H  RP N V
Sbjct: 231  TFTGGINSYLLVWMMVAWVKLEMPKLVGGAHSERPSNNV 269


>gi|212532361|ref|XP_002146337.1| topoisomerase family protein TRF4, putative [Talaromyces marneffei
            ATCC 18224]
 gi|210071701|gb|EEA25790.1| topoisomerase family protein TRF4, putative [Talaromyces marneffei
            ATCC 18224]
          Length = 683

 Score = 51.6 bits (122), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 46/77 (59%), Gaps = 3/77 (3%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T +++D++F + S  G++     ++  E +PA   +  V+KQFL  RSL++  +GGL  +
Sbjct: 400  TGLKVDMAFDNDS--GIRAIQTFEKWKEAYPAMPAIVAVVKQFLLLRSLNEVPTGGLGGF 457

Query: 1456 CLMLLITRFLQH-EHHL 1471
             ++ L+   LQH  HHL
Sbjct: 458  SIICLVVSLLQHMPHHL 474


>gi|403213331|emb|CCK67833.1| hypothetical protein KNAG_0A01440 [Kazachstania naganishii CBS 8797]
          Length = 526

 Score = 51.6 bits (122), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 43/74 (58%), Gaps = 2/74 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            ++  + + +D+SF+  +  GL+   L++E  E  P    L L++KQFL  R L+  ++GG
Sbjct: 245  VEPESRIHIDVSFERIN--GLEAAKLIREWLESTPGLRELVLIIKQFLHSRRLNNVHTGG 302

Query: 1452 LSSYCLMLLITRFL 1465
            L  + ++ L+  FL
Sbjct: 303  LGGFSIICLVYSFL 316


>gi|254579541|ref|XP_002495756.1| ZYRO0C02332p [Zygosaccharomyces rouxii]
 gi|238938647|emb|CAR26823.1| ZYRO0C02332p [Zygosaccharomyces rouxii]
          Length = 531

 Score = 51.6 bits (122), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 53/230 (23%), Positives = 88/230 (38%), Gaps = 75/230 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  ++ +  +++ LWP +   +FGS AT L LP SD+D V+            + G  E 
Sbjct: 123  NKTIRTLRHAVRKLWPGADLQVFGSYATDLYLPGSDIDCVIN----------SKTGDKEN 172

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            R+ + E       R LA Q       ++ +    +PII  V                 P 
Sbjct: 173  RSSLYELAHFLKNRKLATQ-------VEVIAKARVPIIKFV----------------EPT 209

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ ++AA  +R          + LQ T
Sbjct: 210  -----------SQIHVDV-----------SFERTNGLEAAKLIR----------SWLQQT 237

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
                      P    L L++KQFL  R L+  ++GGL  + ++ L+  FL
Sbjct: 238  ----------PGLRELVLIVKQFLHARRLNNVHTGGLGGFSIICLVYAFL 277


>gi|303323645|ref|XP_003071814.1| PAP/25A associated domain containing protein [Coccidioides posadasii
            C735 delta SOWgp]
 gi|240111516|gb|EER29669.1| PAP/25A associated domain containing protein [Coccidioides posadasii
            C735 delta SOWgp]
          Length = 680

 Score = 51.6 bits (122), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 44/75 (58%), Gaps = 2/75 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T +++D+SF +  ++GL      ++  E FPA   +  V+KQFL  R L++  +GGL  +
Sbjct: 365  TGLKVDLSFDN--NSGLAANRTFQQWKEHFPAMPLIVSVIKQFLLLRGLNEVPTGGLGGF 422

Query: 1456 CLMLLITRFLQHEHH 1470
             ++ L+T  LQH  H
Sbjct: 423  SIICLVTSLLQHLPH 437



 Score = 49.7 bits (117), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 48/184 (26%), Positives = 80/184 (43%), Gaps = 31/184 (16%)

Query: 1170 DHLIAISQLARDQEHPDVAFPLQPLEVQNCPT-------------RKASLSLMHSLLHEE 1216
            D ++  +  ++  E PD  F L    +Q   T             R+A+  L  +L +E 
Sbjct: 192  DEIVGPAPRSQRYEKPDTRFNLDGSIIQAYRTIAGQDSTPWFDRSRQATFHLGAALQNEI 251

Query: 1217 IDSF--CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDL 1274
            +D +   K    E+  R   I     R  R +Q  +P S+ + FGS A+GL LP +DVDL
Sbjct: 252  VDFYHWVKPRPFEDVIRTDLIT----RFERLMQNRFPGSQLHAFGSYASGLYLPVADVDL 307

Query: 1275 VVCLPPVRNLEPIKEAGILEGRNGI--KETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            V+          +  + I +GR  +  K   +     Y+ + E     S++T+ +  +PI
Sbjct: 308  VL----------LSRSFIRQGRKFLCQKIKDIYSLTAYIRDTEIAVPGSIETIAHARVPI 357

Query: 1333 IMLV 1336
            I  V
Sbjct: 358  IKFV 361


>gi|406701338|gb|EKD04487.1| hypothetical protein A1Q2_01263 [Trichosporon asahii var. asahii CBS
            8904]
          Length = 624

 Score = 51.6 bits (122), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 41/75 (54%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + A   + +DIS    +  G+    +V    +  P +  L LV+K FL+ RS+++ Y+GG
Sbjct: 211  VTAEGGINVDISLNQTN--GVSAVKIVNHYLKALPGARELILVIKAFLSQRSMNEVYTGG 268

Query: 1452 LSSYCLMLLITRFLQ 1466
            L SY ++ L   FLQ
Sbjct: 269  LGSYSVICLALSFLQ 283


>gi|401882466|gb|EJT46724.1| hypothetical protein A1Q1_04689 [Trichosporon asahii var. asahii CBS
            2479]
          Length = 631

 Score = 51.2 bits (121), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 41/75 (54%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + A   + +DIS    +  G+    +V    +  P +  L LV+K FL+ RS+++ Y+GG
Sbjct: 211  VTAEGGINVDISLNQTN--GVSAVKIVNHYLKALPGARELILVIKAFLSQRSMNEVYTGG 268

Query: 1452 LSSYCLMLLITRFLQ 1466
            L SY ++ L   FLQ
Sbjct: 269  LGSYSVICLALSFLQ 283


>gi|242212981|ref|XP_002472321.1| predicted protein [Postia placenta Mad-698-R]
 gi|220728598|gb|EED82489.1| predicted protein [Postia placenta Mad-698-R]
          Length = 1512

 Score = 51.2 bits (121), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 36/63 (57%)

Query: 1411 GLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHH 1470
            G+    ++K    + PA   L LV+K FL+ RS+++ Y+GGL SY ++ L   FLQ    
Sbjct: 269  GVTAGKMIKHYLAELPALRSLILVIKSFLSQRSMNEVYTGGLGSYSIVCLAISFLQMHPK 328

Query: 1471 LGR 1473
            + R
Sbjct: 329  IRR 331


>gi|392867853|gb|EAS33553.2| topoisomerase TRF4 [Coccidioides immitis RS]
          Length = 680

 Score = 51.2 bits (121), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 44/75 (58%), Gaps = 2/75 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T +++D+SF +  ++GL      ++  E FPA   +  V+KQFL  R L++  +GGL  +
Sbjct: 365  TGLKVDLSFDN--NSGLAANRTFQQWKEHFPAMPLIVSVVKQFLLLRGLNEVPTGGLGGF 422

Query: 1456 CLMLLITRFLQHEHH 1470
             ++ L+T  LQH  H
Sbjct: 423  SIICLVTSLLQHLPH 437



 Score = 49.7 bits (117), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 48/184 (26%), Positives = 80/184 (43%), Gaps = 31/184 (16%)

Query: 1170 DHLIAISQLARDQEHPDVAFPLQPLEVQNCPT-------------RKASLSLMHSLLHEE 1216
            D ++  +  ++  E PD  F L    +Q   T             R+A+  L  +L +E 
Sbjct: 192  DEIVGPAPRSQRYEKPDTRFNLDGSIIQAYRTIAGQDSTPWFDRSRQATFHLGAALQNEI 251

Query: 1217 IDSF--CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDL 1274
            +D +   K    E+  R   I     R  R +Q  +P S+ + FGS A+GL LP +DVDL
Sbjct: 252  VDFYHWVKPRPFEDVIRTDLIT----RFERLMQNRFPGSQLHAFGSYASGLYLPVADVDL 307

Query: 1275 VVCLPPVRNLEPIKEAGILEGRNGI--KETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            V+          +  + I +GR  +  K   +     Y+ + E     S++T+ +  +PI
Sbjct: 308  VL----------LSRSFIRQGRKFLCQKIKDIYSLTAYIRDTEIAVPGSIETIAHARVPI 357

Query: 1333 IMLV 1336
            I  V
Sbjct: 358  IKFV 361


>gi|401623740|gb|EJS41828.1| trf4p [Saccharomyces arboricola H-6]
          Length = 573

 Score = 51.2 bits (121), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 51/234 (21%), Positives = 87/234 (37%), Gaps = 76/234 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  +  +  +++ LWP +  ++FGS +T L LP SD+D V+            E G  E 
Sbjct: 193  NQTISMIREAVKQLWPDADLHVFGSYSTDLYLPGSDIDCVIT----------SELGGKES 242

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            RN +         + LA +       ++ V    +PII  V                   
Sbjct: 243  RNNLFSLASHLKKKNLATE-------IEVVAKARVPIIKFV------------------- 276

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                    + ++ +H D+           S   ++ ++AA  +R                
Sbjct: 277  --------EPNSGIHIDV-----------SFERTNGLEAAKLIR---------------- 301

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEH 1469
                E     P    L L++KQFL  R L+  ++GGL  + ++ L+  FL H H
Sbjct: 302  ----EWLNDTPGLRELVLIVKQFLHSRRLNNVHTGGLGGFSIICLVFSFL-HMH 350


>gi|449296989|gb|EMC93008.1| hypothetical protein BAUCODRAFT_49110, partial [Baudoinia
            compniacensis UAMH 10762]
          Length = 611

 Score = 51.2 bits (121), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 43/76 (56%), Gaps = 2/76 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I   T +++DISF++   TG+      +E    +PA   + +++KQ LA R L++ +SGG
Sbjct: 407  IDKKTGIKVDISFEN--RTGIVANKTFQEWKVLYPAMPAIVVLIKQLLAMRGLNEVFSGG 464

Query: 1452 LSSYCLMLLITRFLQH 1467
            +  +  + L+   +QH
Sbjct: 465  IGGFTTICLVVSMMQH 480


>gi|429964503|gb|ELA46501.1| hypothetical protein VCUG_01993 [Vavraia culicis 'floridensis']
          Length = 401

 Score = 51.2 bits (121), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/67 (43%), Positives = 39/67 (58%), Gaps = 2/67 (2%)

Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
            R D+S    S  G+  T LVKE+  + PA   +AL LK FL  R L+++  GGL+SY   
Sbjct: 207  RYDLSLNQES--GVIHTKLVKEILHEQPAIKDMALFLKHFLKCRGLNENRRGGLNSYAQF 264

Query: 1459 LLITRFL 1465
            L+I  FL
Sbjct: 265  LMIASFL 271


>gi|426385175|ref|XP_004059104.1| PREDICTED: PAP-associated domain-containing protein 5-like, partial
            [Gorilla gorilla gorilla]
          Length = 240

 Score = 51.2 bits (121), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 9/79 (11%)

Query: 1213 LHEEIDSFCKQVAA--ENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
            LHEEI  F   ++   E  A +  +   VKR+   ++ LWP +   IFGS +TGL LP+S
Sbjct: 136  LHEEIIDFYNFMSPCPEEAAMRREV---VKRIETVVKDLWPTADVQIFGSFSTGLYLPTS 192

Query: 1271 DVDLVVC----LPPVRNLE 1285
            D+DLVV      PP++ LE
Sbjct: 193  DIDLVVFGKWERPPLQLLE 211


>gi|443895250|dbj|GAC72596.1| DNA polymerase sigma [Pseudozyma antarctica T-34]
          Length = 689

 Score = 50.8 bits (120), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 41/70 (58%), Gaps = 4/70 (5%)

Query: 1398 VRLDISFKSPSHT-GLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
            +++DIS    +HT GL T   V     ++P   PL +V+K  L  R + + +SGGL SY 
Sbjct: 210  LKVDISL---NHTNGLTTASFVNSWLRKWPHIRPLIIVVKHLLMQRGMSEVFSGGLGSYS 266

Query: 1457 LMLLITRFLQ 1466
            +++++  FLQ
Sbjct: 267  IIIMVISFLQ 276


>gi|409045762|gb|EKM55242.1| hypothetical protein PHACADRAFT_93478 [Phanerochaete carnosa
            HHB-10118-sp]
          Length = 478

 Score = 50.8 bits (120), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 21/68 (30%), Positives = 36/68 (52%)

Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
            R  +        G++   ++K   +  PA   L L+ K FL+ RS+++ ++GGL SY ++
Sbjct: 264  RFSVDISLNQMNGVKAGTMIKRFLDHIPALQALVLITKSFLSQRSMNEVFTGGLGSYSIV 323

Query: 1459 LLITRFLQ 1466
             L   FLQ
Sbjct: 324  CLAISFLQ 331


>gi|156057775|ref|XP_001594811.1| hypothetical protein SS1G_04619 [Sclerotinia sclerotiorum 1980]
 gi|154702404|gb|EDO02143.1| hypothetical protein SS1G_04619 [Sclerotinia sclerotiorum 1980 UF-70]
          Length = 794

 Score = 50.8 bits (120), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 46/92 (50%), Gaps = 13/92 (14%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T +++D+SF++   TGL      +E   QFPA   L  ++K  LA R L++  +GG
Sbjct: 573  VDGFTGLKVDMSFEN--DTGLIANKTFQEWKAQFPAMPILVTIIKHLLAMRGLNEPVNGG 630

Query: 1452 LSSYCLMLLITRFLQH-----------EHHLG 1472
            +  + +  L+   LQ+           EHHLG
Sbjct: 631  IGGFSVTCLVVSLLQNMPQVKSKTMIPEHHLG 662


>gi|444314869|ref|XP_004178092.1| hypothetical protein TBLA_0A07840 [Tetrapisispora blattae CBS 6284]
 gi|387511131|emb|CCH58573.1| hypothetical protein TBLA_0A07840 [Tetrapisispora blattae CBS 6284]
          Length = 711

 Score = 50.8 bits (120), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 58/258 (22%), Positives = 97/258 (37%), Gaps = 84/258 (32%)

Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
            LL EEI  F   ++  +       N  + ++ ++++ LW  S   +FGS AT L LP SD
Sbjct: 204  LLTEEIKDFVAYISP-SREEIELRNQTIGKIRKAVKKLWKNSDLYVFGSYATDLYLPGSD 262

Query: 1272 VDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT----VEN 1327
            +D V+          I  AG  E R+ + +              W+K + L T    +  
Sbjct: 263  IDCVI----------ISSAGDKENRHSLYQLS-----------SWLKQNKLATKVEVIAK 301

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
              +PII  V                           +  +++H D+           S  
Sbjct: 302  ARVPIIKFV---------------------------EPSSNIHIDV-----------SFE 323

Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
             S+ ++AA ++R            L +T          P    L L++KQFL  R L+  
Sbjct: 324  RSNGLEAAKTIR----------DWLNST----------PGLRELVLIIKQFLNSRRLNDV 363

Query: 1448 YSGGLSSYCLMLLITRFL 1465
            + GGL    ++ ++  FL
Sbjct: 364  HLGGLGGLSIICMVYSFL 381


>gi|71657648|ref|XP_817336.1| DNA polymerase sigma [Trypanosoma cruzi strain CL Brener]
 gi|70882521|gb|EAN95485.1| DNA polymerase sigma, putative [Trypanosoma cruzi]
          Length = 680

 Score = 50.8 bits (120), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 1/76 (1%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            LD+     +  G + + +V ++   FP + PL +++K FL  R + + Y GGL S+   L
Sbjct: 374  LDVDVSINAEDGQRNSAIVADMLRSFPEARPLIVLVKYFLQQRGMHEPYHGGLGSFATTL 433

Query: 1460 LITRFLQ-HEHHLGRP 1474
            L+  FLQ H  +  RP
Sbjct: 434  LVISFLQNHPIYTERP 449


>gi|255082912|ref|XP_002504442.1| predicted protein [Micromonas sp. RCC299]
 gi|226519710|gb|ACO65700.1| predicted protein [Micromonas sp. RCC299]
          Length = 998

 Score = 50.8 bits (120), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 52/241 (21%), Positives = 93/241 (38%), Gaps = 20/241 (8%)

Query: 1235 INWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP-SSDVDLVVCLPPVRNLEPIKEAGIL 1293
            ++  ++ +   +Q L+P +   +FGS  T   +P +S++D+ + LP     +P K+   L
Sbjct: 620  VDMLIRTMNAIVQSLYPTASVEVFGSFPTASWVPGASNLDIALSLPEAVTSDPQKK---L 676

Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQS 1353
            +  N +       A    AN  WV    +  V +   P+I+L              + Q 
Sbjct: 677  DALNSL-------AVALRANPHWV--SEVNVVPSPFRPLILLTTHSAFFQPPPPPQAAQG 727

Query: 1354 PKEDAAHTTLKHDNH--VHSDMVALDDSASPKCSHTSSDNIKAATS-----VRLDISFKS 1406
            P+         +      +SD               +S             + + IS K 
Sbjct: 728  PRPPPGAPPTANGGSPGANSDSPGAPPLPPGPPPPAASVGFGLGNGGVGLPLEVAISIKG 787

Query: 1407 PSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
              H G      +++   ++PA  P+  V K +LA+R L   Y GG+ SY L L+    +Q
Sbjct: 788  KDHKGAAAMKFLRQAETEYPALAPVLAVQKAYLANRGLRGVYKGGIGSYSLALMALHSMQ 847

Query: 1467 H 1467
             
Sbjct: 848  Q 848


>gi|367040851|ref|XP_003650806.1| hypothetical protein THITE_2110633 [Thielavia terrestris NRRL 8126]
 gi|346998067|gb|AEO64470.1| hypothetical protein THITE_2110633 [Thielavia terrestris NRRL 8126]
          Length = 759

 Score = 50.8 bits (120), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 47/93 (50%), Gaps = 13/93 (13%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I+  T +++DISF++   TGL   +  K   EQ+P    L  ++K FL  R L++  +GG
Sbjct: 506  IEHRTGLKVDISFEN--RTGLTAIETFKAWREQYPGMPALVTLIKHFLLMRGLNEPVNGG 563

Query: 1452 LSSYCLMLLITRFLQH-----------EHHLGR 1473
            +  + ++ L+   LQ             HHLG+
Sbjct: 564  IGGFSVICLVVSMLQMMPEVQSGNLDTRHHLGQ 596


>gi|66363120|ref|XP_628526.1| TRF4/5 nucletotidyl transferase [Cryptosporidium parvum Iowa II]
 gi|46229540|gb|EAK90358.1| TRF4/5 nucletotidyl transferase [Cryptosporidium parvum Iowa II]
 gi|323509243|dbj|BAJ77514.1| cgd7_3630 [Cryptosporidium parvum]
 gi|323510373|dbj|BAJ78080.1| cgd7_3630 [Cryptosporidium parvum]
          Length = 480

 Score = 50.8 bits (120), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 40/76 (52%), Gaps = 2/76 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + +   + +DIS    S   + TT  + E  ++F    PL +V+K FL  R LD++Y GG
Sbjct: 233  VDSEACISVDISLNQES--SIDTTKHISECLKKFSTLRPLLIVIKLFLRQRGLDETYLGG 290

Query: 1452 LSSYCLMLLITRFLQH 1467
            L SY    L+  FLQ 
Sbjct: 291  LGSYSQFCLVLSFLQQ 306


>gi|67623901|ref|XP_668233.1| hypothetical protein [Cryptosporidium hominis TU502]
 gi|54659437|gb|EAL38014.1| hypothetical protein Chro.70404 [Cryptosporidium hominis]
          Length = 480

 Score = 50.8 bits (120), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 40/76 (52%), Gaps = 2/76 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + +   + +DIS    S   + TT  + E  ++F    PL +V+K FL  R LD++Y GG
Sbjct: 233  VDSEACISVDISLNQES--SIDTTKHISECLKKFSTLRPLLIVIKLFLRQRGLDETYLGG 290

Query: 1452 LSSYCLMLLITRFLQH 1467
            L SY    L+  FLQ 
Sbjct: 291  LGSYSQFCLVLSFLQQ 306


>gi|388851758|emb|CCF54564.1| related to TRF4-topoisomerase I-related protein [Ustilago hordei]
          Length = 701

 Score = 50.8 bits (120), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 43/77 (55%), Gaps = 4/77 (5%)

Query: 1398 VRLDISFKSPSHT-GLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
            +++DIS    +HT GL T   V     ++P   PL LV+K  L  R + + +SGGL SY 
Sbjct: 211  LKVDISL---NHTNGLTTASYVNGWLRKWPHIRPLILVIKHLLMQRGMSEVFSGGLGSYS 267

Query: 1457 LMLLITRFLQHEHHLGR 1473
            +++++  FLQ    L R
Sbjct: 268  VIIMVISFLQLHPKLQR 284


>gi|238879008|gb|EEQ42646.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 603

 Score = 50.4 bits (119), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 53/231 (22%), Positives = 91/231 (39%), Gaps = 75/231 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  +  + + +   WP + T++FGS AT L LP SD+D+VV          + E G  E 
Sbjct: 191  NNVISTLKKEIGKFWPGTETHVFGSCATDLYLPGSDIDMVV----------VSETGDYEN 240

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            R     + L   + +L  ++  K  +++ + +  +PII  V  V                
Sbjct: 241  R-----SRLYQLSTFLRTKKLAK--NVEVIASAKVPIIKFVDPV---------------- 277

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ + AA  +R            L +T
Sbjct: 278  -----------SELHIDV-----------SFERTNGLDAAKRIR----------RWLIST 305

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
              ++EL           LV+KQFL  R L+  + GGL  Y  +++   FL+
Sbjct: 306  PGLREL----------VLVIKQFLRSRRLNNVHVGGLGGYATIIMCYHFLR 346


>gi|392580130|gb|EIW73257.1| hypothetical protein TREMEDRAFT_22292, partial [Tremella mesenterica
            DSM 1558]
          Length = 303

 Score = 50.4 bits (119), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 29/109 (26%), Positives = 50/109 (45%), Gaps = 12/109 (11%)

Query: 1370 HSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSH------------TGLQTTDL 1417
            H+    L+D A        +DN+   +  R+ I      H             G+    +
Sbjct: 91   HNKKNLLNDLARTMRYAMITDNVVVISKARVPIIKFVTKHGKLNVDISLNQVNGISAGKI 150

Query: 1418 VKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            + +  +  P +  L LV+K FL+ RS+++ Y+GGL SY ++ L+  FLQ
Sbjct: 151  INQYLDVIPGARQLILVVKAFLSQRSMNEVYTGGLGSYSVICLVISFLQ 199


>gi|68480208|ref|XP_715914.1| hypothetical protein CaO19.8059 [Candida albicans SC5314]
 gi|68480321|ref|XP_715864.1| hypothetical protein CaO19.429 [Candida albicans SC5314]
 gi|46437507|gb|EAK96852.1| hypothetical protein CaO19.429 [Candida albicans SC5314]
 gi|46437559|gb|EAK96903.1| hypothetical protein CaO19.8059 [Candida albicans SC5314]
          Length = 603

 Score = 50.4 bits (119), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 53/231 (22%), Positives = 91/231 (39%), Gaps = 75/231 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  +  + + +   WP + T++FGS AT L LP SD+D+VV          + E G  E 
Sbjct: 191  NNVISTLKKEIGKFWPGTETHVFGSCATDLYLPGSDIDMVV----------VSETGDYEN 240

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            R     + L   + +L  ++  K  +++ + +  +PII  V  V                
Sbjct: 241  R-----SRLYQLSTFLRTKKLAK--NVEVIASAKVPIIKFVDPV---------------- 277

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ + AA  +R            L +T
Sbjct: 278  -----------SELHIDV-----------SFERTNGLDAAKRIR----------RWLIST 305

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
              ++EL           LV+KQFL  R L+  + GGL  Y  +++   FL+
Sbjct: 306  PGLREL----------VLVIKQFLRSRRLNNVHVGGLGGYATIIMCYHFLR 346


>gi|315054687|ref|XP_003176718.1| Poly(A) RNA polymerase cid14 [Arthroderma gypseum CBS 118893]
 gi|311338564|gb|EFQ97766.1| Poly(A) RNA polymerase cid14 [Arthroderma gypseum CBS 118893]
          Length = 627

 Score = 50.4 bits (119), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 43/76 (56%), Gaps = 2/76 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + + T +++D+SF + S  GL   D  K   +++P    +A ++K FL  R L+   +GG
Sbjct: 387  VDSLTGLKVDLSFDNSS--GLAAIDTFKAWKQEYPTMPVIASIIKHFLLIRGLNDVATGG 444

Query: 1452 LSSYCLMLLITRFLQH 1467
            L  + ++ L+T  LQH
Sbjct: 445  LGGFSIICLVTSILQH 460


>gi|403356668|gb|EJY77933.1| Poly(A) RNA polymerase putative [Oxytricha trifallax]
          Length = 397

 Score = 50.4 bits (119), Expect = 0.007,   Method: Composition-based stats.
 Identities = 22/56 (39%), Positives = 37/56 (66%)

Query: 1411 GLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            GL+   LV  L ++FP   P+  V+K FL +++L + Y+GG+ S+ L+ +IT +LQ
Sbjct: 216  GLKGLSLVTTLIKEFPELRPMYFVIKAFLKNKNLHKPYTGGIGSFVLINMITFYLQ 271


>gi|348684587|gb|EGZ24402.1| hypothetical protein PHYSODRAFT_349922 [Phytophthora sojae]
          Length = 592

 Score = 50.4 bits (119), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 63/125 (50%), Gaps = 8/125 (6%)

Query: 1215 EEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDL 1274
            EE+  +C        A + +I+ A+  +   ++ +WP +    FGS +TGL LPSSDVD+
Sbjct: 288  EEMARYCAYTQRVVDAARQHIDAAIAHIADGVRKIWPHASVMCFGSYSTGLWLPSSDVDV 347

Query: 1275 VVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQE-WVKSDSLKTVENTAIPII 1333
            VV    + N E  ++       +G+KE  L+  AR L  Q  WV+   +  V    +P+ 
Sbjct: 348  VVM--GLSNSED-RDVPTRFVPDGVKE--LEQLARQLRPQRSWVQ--RVDVVRGAKVPVA 400

Query: 1334 MLVVE 1338
             L++ 
Sbjct: 401  KLILR 405


>gi|403365629|gb|EJY82602.1| PAP-associated domain-containing protein, putative [Oxytricha
            trifallax]
          Length = 1647

 Score = 50.4 bits (119), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 50/211 (23%), Positives = 97/211 (45%), Gaps = 31/211 (14%)

Query: 1257 IFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEW 1316
            ++GS A+ L++  SDVDL +      + + IKE  I E R        +   + L  Q  
Sbjct: 1331 MYGSMASKLAIEESDVDLAII---GLDFKGIKELQIKEMR-----VLYEQLMQVLKAQST 1382

Query: 1317 VKSDSLKTVENTAIPIIMLVVEVP--HDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMV 1374
            V+      +E   IP+I L V++    + I     S +  ++ +  ++++         +
Sbjct: 1383 VQ-----FIETATIPVIKLRVDLEKIRESITCNQESQEHQQQSSGCSSIEDQMRYLGVDI 1437

Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
              +DS        S +++ ++   R+++        G+Q    +++L        P+ L 
Sbjct: 1438 TFEDS--------SMNSLYSSDGQRINL--------GIQCISYIQQLCAAQAELKPIVLF 1481

Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
            LK+ L + SL++ Y GGL+SY L+L+ + FL
Sbjct: 1482 LKKLLQNNSLNEPYHGGLNSYSLILMTSAFL 1512


>gi|145488123|ref|XP_001430066.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124397161|emb|CAK62668.1| unnamed protein product [Paramecium tetraurelia]
          Length = 637

 Score = 50.4 bits (119), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 28/99 (28%), Positives = 50/99 (50%), Gaps = 11/99 (11%)

Query: 1372 DMVALDDSASP----KCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPA 1427
            D+  +D++  P    +CS    + +       +DI+     H G +T + + E  ++F  
Sbjct: 393  DVKVIDNAKKPVLKMQCSKEFQEKL-------IDITISRNDHFGRKTANSMMEFQKEFKQ 445

Query: 1428 STPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
               LAL+LK +    +L  +Y GGL+SYC++ +I   LQ
Sbjct: 446  FKSLALMLKFYFKSINLLNAYQGGLNSYCVLTMILALLQ 484



 Score = 49.7 bits (117), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 45/75 (60%)

Query: 1200 PTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFG 1259
            P  K SL  +   L  EI++F ++++     +KP I+  V+ V  ++Q + P+SR  ++G
Sbjct: 299  PQHKISLDDIVKQLTFEINTFTQELSKYLDEQKPIIDKVVQLVDETVQNVQPKSRAFLYG 358

Query: 1260 SNATGLSLPSSDVDL 1274
            S  TGL+L  SD+D+
Sbjct: 359  SCQTGLNLLDSDIDI 373


>gi|428186552|gb|EKX55402.1| hypothetical protein GUITHDRAFT_83772 [Guillardia theta CCMP2712]
          Length = 440

 Score = 50.4 bits (119), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 37/127 (29%), Positives = 60/127 (47%), Gaps = 19/127 (14%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH EI     QV   N+ R       V  V  +++  W  +   ++GS +T L LP SDV
Sbjct: 97   LHSEIVELLNQVYPTNSTRTRR-QRIVDLVDEAVKSCWEGAHVEVYGSFSTDLYLPHSDV 155

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            D++V           KE  I        E+ ++H A  L +  W +   ++++ENT++P+
Sbjct: 156  DVLVI--------GAKEWSI--------ESKMKHLASRLKSMPWCR--YMRSIENTSVPV 197

Query: 1333 IMLVVEV 1339
            I L  +V
Sbjct: 198  IKLSADV 204


>gi|328770392|gb|EGF80434.1| hypothetical protein BATDEDRAFT_25005 [Batrachochytrium dendrobatidis
            JAM81]
          Length = 547

 Score = 50.4 bits (119), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 44/79 (55%), Gaps = 2/79 (2%)

Query: 1398 VRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCL 1457
            +R+DI   + S  G ++T+L+K    ++P   P+ALVLK  +    LD+ ++GGL  Y L
Sbjct: 161  IRIDIVANTDS--GNESTELMKTWVGKYPTLIPMALVLKLLIRQHRLDKVFTGGLGGYAL 218

Query: 1458 MLLITRFLQHEHHLGRPIN 1476
            + +I   L  + H    I+
Sbjct: 219  LNMIVAVLIRDQHCSDEID 237


>gi|154323524|ref|XP_001561076.1| hypothetical protein BC1G_00161 [Botryotinia fuckeliana B05.10]
          Length = 692

 Score = 50.4 bits (119), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 47/92 (51%), Gaps = 13/92 (14%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + A T +++D+SF++   TGL      +E   QFPA   L  ++K  LA R L++  +GG
Sbjct: 524  VDAITGLKVDMSFEN--DTGLIANKTFQEWKAQFPAMPILVTIIKHLLAMRGLNEPVNGG 581

Query: 1452 LSSYCLMLLITRFLQH-----------EHHLG 1472
            +  + +  L+   LQ+           EH+LG
Sbjct: 582  IGGFSVTCLVVSLLQNLPQVKSGTMVPEHNLG 613


>gi|429966451|gb|ELA48448.1| hypothetical protein VCUG_00057 [Vavraia culicis 'floridensis']
          Length = 400

 Score = 50.4 bits (119), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 38/67 (56%), Gaps = 2/67 (2%)

Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
            R D+S    S  G+  T  VK++ +Q P    +AL LK FL  R L++S  GGL SY  +
Sbjct: 206  RYDLSLNQDS--GIVHTKFVKKVLQQQPYIKDIALFLKYFLKSRGLNESKRGGLCSYAQL 263

Query: 1459 LLITRFL 1465
            L+I  FL
Sbjct: 264  LMIISFL 270


>gi|393218042|gb|EJD03530.1| Nucleotidyltransferase [Fomitiporia mediterranea MF3/22]
          Length = 534

 Score = 50.4 bits (119), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 36/66 (54%), Gaps = 1/66 (1%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH+E+  F   +A     R+   +  V R+   L+  WP ++  +FGSNATGL L   D+
Sbjct: 56   LHDELVQFIGYIAPSKQEREAR-SMLVARIEDLLKSRWPDAQVTVFGSNATGLELTGGDI 114

Query: 1273 DLVVCL 1278
            DLVV  
Sbjct: 115  DLVVAF 120


>gi|71005312|ref|XP_757322.1| hypothetical protein UM01175.1 [Ustilago maydis 521]
 gi|46096726|gb|EAK81959.1| hypothetical protein UM01175.1 [Ustilago maydis 521]
          Length = 730

 Score = 50.4 bits (119), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 41/70 (58%), Gaps = 4/70 (5%)

Query: 1398 VRLDISFKSPSHT-GLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
            +++DIS    +HT GL T   V     ++P   PL LV+K  L  R + + +SGGL SY 
Sbjct: 211  LKVDISL---NHTNGLTTASYVNSWLRKWPHIRPLILVVKYLLMQRGMSEVFSGGLGSYS 267

Query: 1457 LMLLITRFLQ 1466
            +++++  FLQ
Sbjct: 268  VIIMVISFLQ 277


>gi|134080951|emb|CAK41466.1| unnamed protein product [Aspergillus niger]
          Length = 614

 Score = 50.4 bits (119), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 43/75 (57%), Gaps = 2/75 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T +R+D+SF + S  GL   +  ++   ++PA   +  V+KQFL  R L++  +GGL  +
Sbjct: 311  TGLRVDLSFDNDS--GLVANETFQQWKAEYPAMPVIVSVIKQFLLLRGLNEVPTGGLGGF 368

Query: 1456 CLMLLITRFLQHEHH 1470
             +  L+T  LQH  H
Sbjct: 369  SITCLVTSLLQHLPH 383


>gi|358373036|dbj|GAA89636.1| topoisomerase family protein Trf4 [Aspergillus kawachii IFO 4308]
          Length = 682

 Score = 50.1 bits (118), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 44/79 (55%), Gaps = 2/79 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T +R+D+SF + S  GL   +  ++   ++PA   +  V+KQFL  R L++  +GG
Sbjct: 375  VDKLTGLRVDLSFDNDS--GLVANETFQQWKAEYPAMPVIVSVIKQFLLLRGLNEVPTGG 432

Query: 1452 LSSYCLMLLITRFLQHEHH 1470
            L  + +  L+T  LQH  H
Sbjct: 433  LGGFSITCLVTSLLQHLPH 451


>gi|168036465|ref|XP_001770727.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677945|gb|EDQ64409.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1560

 Score = 50.1 bits (118), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 74/263 (28%), Positives = 117/263 (44%), Gaps = 35/263 (13%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH EI  F    +    AR      A+  V  +++ LWP +   +FGS ATGL L  SDV
Sbjct: 1039 LHSEILEFAHTASPSAEARS-CAEAAIDCVRGAVKKLWPNADVEVFGSYATGLCLSHSDV 1097

Query: 1273 DLVVCLPPVRNLEPIK---EAGILEGRNGIKETCLQHAARYLA----NQEWVKSDSLKTV 1325
            D+VV   P     P+K     G+   R  +  + L  + R L     + +W K  S++T+
Sbjct: 1098 DVVVVDAP----PPLKLPDTTGLTGTR--LVASLLAPSIRLLGAALQDYDWCK--SIRTI 1149

Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
            ++  +P+I L       L+ S          D A   +K D  +       +  A+   S
Sbjct: 1150 DSATMPVIKLQCR---PLVTSL---------DPAAPVVKIDITIGGKKTNEEAVATASQS 1197

Query: 1386 HTSSDNIKAATSVRLDISFKSPS--HTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
               S+     TS  +D  F+     H G    + V +  +Q PA  PL L+LK +L  + 
Sbjct: 1198 SEGSE-----TSRHVDSQFELARKFHNGAAAREYVIKRLQQQPALAPLVLLLKSYLQSKG 1252

Query: 1444 LDQSYSGGLSSYCLMLLITRFLQ 1466
            L   Y+GGL S+ L +++  +L+
Sbjct: 1253 LKDVYTGGLGSFSLTIMLVFYLE 1275


>gi|347830146|emb|CCD45843.1| hypothetical protein [Botryotinia fuckeliana]
          Length = 788

 Score = 50.1 bits (118), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 47/92 (51%), Gaps = 13/92 (14%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + A T +++D+SF++   TGL      +E   QFPA   L  ++K  LA R L++  +GG
Sbjct: 564  VDAITGLKVDMSFEN--DTGLIANKTFQEWKAQFPAMPILVTIIKHLLAMRGLNEPVNGG 621

Query: 1452 LSSYCLMLLITRFLQH-----------EHHLG 1472
            +  + +  L+   LQ+           EH+LG
Sbjct: 622  IGGFSVTCLVVSLLQNLPQVKSGTMVPEHNLG 653


>gi|350638915|gb|EHA27270.1| hypothetical protein ASPNIDRAFT_44704 [Aspergillus niger ATCC 1015]
          Length = 682

 Score = 50.1 bits (118), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 43/75 (57%), Gaps = 2/75 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T +R+D+SF + S  GL   +  ++   ++PA   +  V+KQFL  R L++  +GGL  +
Sbjct: 379  TGLRVDLSFDNDS--GLVANETFQQWKAEYPAMPVIVSVIKQFLLLRGLNEVPTGGLGGF 436

Query: 1456 CLMLLITRFLQHEHH 1470
             +  L+T  LQH  H
Sbjct: 437  SITCLVTSLLQHLPH 451


>gi|317034217|ref|XP_001396205.2| topoisomerase family protein TRF4 [Aspergillus niger CBS 513.88]
          Length = 682

 Score = 50.1 bits (118), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 43/75 (57%), Gaps = 2/75 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T +R+D+SF + S  GL   +  ++   ++PA   +  V+KQFL  R L++  +GGL  +
Sbjct: 379  TGLRVDLSFDNDS--GLVANETFQQWKAEYPAMPVIVSVIKQFLLLRGLNEVPTGGLGGF 436

Query: 1456 CLMLLITRFLQHEHH 1470
             +  L+T  LQH  H
Sbjct: 437  SITCLVTSLLQHLPH 451


>gi|241948905|ref|XP_002417175.1| topoisomerase 1-related protein TRF4, putative [Candida dubliniensis
            CD36]
 gi|223640513|emb|CAX44767.1| topoisomerase 1-related protein TRF4, putative [Candida dubliniensis
            CD36]
          Length = 606

 Score = 50.1 bits (118), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 52/231 (22%), Positives = 91/231 (39%), Gaps = 75/231 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  +  +   + + WP + T++FGS AT L LP SD+D+VV          + E G  E 
Sbjct: 194  NNVISTLKTEIGMFWPGTETHVFGSCATDLYLPGSDIDMVV----------VSETGDYEN 243

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            R     + L   + +L  ++  K  +++ + +  +PII  V  +                
Sbjct: 244  R-----SRLYQLSTFLRTKKLAK--NVEVIASAKVPIIKFVDPI---------------- 280

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                       + +H D+           S   ++ + AA  +R            L +T
Sbjct: 281  -----------SELHIDV-----------SFERTNGLDAAKRIR----------RWLIST 308

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
              ++EL           LV+KQFL  R L+  + GGL  Y  +++   FL+
Sbjct: 309  PGLRELV----------LVIKQFLRSRRLNNVHVGGLGGYATIIMCYHFLR 349


>gi|115395044|ref|XP_001213471.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114193040|gb|EAU34740.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 680

 Score = 50.1 bits (118), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 45/79 (56%), Gaps = 2/79 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T +R+D+SF + S  GL   +  ++  +++PA   +  V+KQFL  R L++  +GG
Sbjct: 371  VDKLTGLRVDLSFDNDS--GLIANNTFQQWKQEYPAMPVIVSVVKQFLLLRGLNEVPTGG 428

Query: 1452 LSSYCLMLLITRFLQHEHH 1470
            L  + +  L+T  LQH  H
Sbjct: 429  LGGFSITCLVTSLLQHLPH 447


>gi|403218109|emb|CCK72601.1| hypothetical protein KNAG_0K02380 [Kazachstania naganishii CBS 8797]
          Length = 632

 Score = 50.1 bits (118), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 59/256 (23%), Positives = 106/256 (41%), Gaps = 82/256 (32%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            L +EI  F   ++   +E  +R    N AV+++ ++++ LW  S  ++FGS AT L +P 
Sbjct: 165  LTQEIKDFVAYISPTGSEIISR----NRAVQKIRKAVRSLWRDSDLHVFGSYATDLYMPG 220

Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
            SD+D VV             +  ++  N      L   AR+L ++       ++ +  T 
Sbjct: 221  SDIDCVV------------NSTSMDKENT---QYLYELARHLRDENLAV--QIEVISRTR 263

Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
            +PII  +   PH                         +++H D+           S    
Sbjct: 264  VPIIKFI--EPH-------------------------SNLHIDV-----------SFERL 285

Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
            + I+AA  +R            L+ T  ++EL           L++KQFLA R L+  ++
Sbjct: 286  NGIEAARLIR----------GWLRETPGLRELV----------LIIKQFLAARRLNDVHT 325

Query: 1450 GGLSSYCLMLLITRFL 1465
            GGL  + ++ L+  F+
Sbjct: 326  GGLGGFSIICLVYSFM 341


>gi|407835610|gb|EKF99338.1| DNA polymerase sigma, putative [Trypanosoma cruzi]
          Length = 398

 Score = 50.1 bits (118), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 1/76 (1%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            LD+     +  G + + +V ++   FP + PL +++K FL  R + + Y GGL S+   L
Sbjct: 92   LDVDVSINAEDGQRNSAIVADMLRSFPEARPLIVLVKYFLQQRGMHEPYHGGLGSFATTL 151

Query: 1460 LITRFLQ-HEHHLGRP 1474
            L+  FLQ H  +  RP
Sbjct: 152  LVISFLQNHPIYTERP 167


>gi|302762448|ref|XP_002964646.1| hypothetical protein SELMODRAFT_22150 [Selaginella moellendorffii]
 gi|300168375|gb|EFJ34979.1| hypothetical protein SELMODRAFT_22150 [Selaginella moellendorffii]
          Length = 313

 Score = 50.1 bits (118), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 34/121 (28%), Positives = 61/121 (50%), Gaps = 16/121 (13%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH+EI  FC  ++      +     A+ R++ +++ LW   +  IFGS ATGL LP+SD+
Sbjct: 8    LHQEILDFCTFIS-PTVEEEAERAAAINRISNAIKSLWSHCKIEIFGSYATGLYLPTSDI 66

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDS----LKTVENT 1328
            D V+            ++G     +G++   ++   R LA +  + S +    +K VE+T
Sbjct: 67   DAVIL-----------DSGCDRIVSGLRAISIELRKRKLARKITIISKARVPIIKFVEST 115

Query: 1329 A 1329
            +
Sbjct: 116  S 116



 Score = 48.1 bits (113), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 42/84 (50%), Gaps = 5/84 (5%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +++ + +  DISF      G +    ++      P   PL +VLK FL  R L++ Y GG
Sbjct: 112  VESTSGISFDISFDL--RNGPEAAKFIQRAIGTLPPLRPLCMVLKIFLQQRELNEVYCGG 169

Query: 1452 LSSYCLMLLITRFLQH---EHHLG 1472
            + SY L++++   LQ     HH G
Sbjct: 170  IGSYGLLIMLITHLQMHPSRHHGG 193


>gi|302815631|ref|XP_002989496.1| hypothetical protein SELMODRAFT_11961 [Selaginella moellendorffii]
 gi|300142674|gb|EFJ09372.1| hypothetical protein SELMODRAFT_11961 [Selaginella moellendorffii]
          Length = 307

 Score = 50.1 bits (118), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 34/121 (28%), Positives = 61/121 (50%), Gaps = 16/121 (13%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH+EI  FC  ++      +     A+ R++ +++ LW   +  IFGS ATGL LP+SD+
Sbjct: 8    LHQEILDFCTFISP-TVEEEAERAAAINRISNAIKSLWSHCKIEIFGSYATGLYLPTSDI 66

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDS----LKTVENT 1328
            D V+            ++G     +G++   ++   R LA +  + S +    +K VE+T
Sbjct: 67   DAVIL-----------DSGCDRIVSGLRAISIELRKRKLARKITIISKARVPIIKFVEST 115

Query: 1329 A 1329
            +
Sbjct: 116  S 116



 Score = 48.1 bits (113), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 42/84 (50%), Gaps = 5/84 (5%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +++ + +  DISF      G +    ++      P   PL +VLK FL  R L++ Y GG
Sbjct: 112  VESTSGISFDISFDL--RNGPEAAKFIQRAIGTLPPLRPLCMVLKIFLQQRELNEVYCGG 169

Query: 1452 LSSYCLMLLITRFLQH---EHHLG 1472
            + SY L++++   LQ     HH G
Sbjct: 170  IGSYGLLIMLITHLQMHPSRHHGG 193


>gi|406862982|gb|EKD16031.1| topoisomerase family protein [Marssonina brunnea f. sp.
            'multigermtubi' MB_m1]
          Length = 868

 Score = 50.1 bits (118), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 63/272 (23%), Positives = 97/272 (35%), Gaps = 79/272 (29%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH+EI  F   V   +  ++  +       ++  Q  +       FGS   GL LP+SD+
Sbjct: 539  LHKEIMDFYDYVKPRDFEQEIRLRLVEDLRSKFKQYFYSDDDIQPFGSFPAGLYLPTSDM 598

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQ-EWVKSDSLKTVENTAIP 1331
            DLV           I    I +GR            R LAN  E++K             
Sbjct: 599  DLVC----------ISREYIADGRK-----IFGQGTRGLANFCEFLKK------------ 631

Query: 1332 IIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDN 1391
                  E P D      S  + P                  +V   DS            
Sbjct: 632  -----YEFPLDKKIEVISKAKVP------------------LVKYMDSL----------- 657

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
                T +++DISF++   TG+      +   ++FPA   L  ++K  LA R L++  +GG
Sbjct: 658  ----TGLKVDISFEN--TTGIVANTTFQNWKKEFPAVPILVTLIKHLLAMRGLNEPVNGG 711

Query: 1452 LSSYCLMLLITRFLQH-----------EHHLG 1472
            +  + +  L+   LQ+           EHHLG
Sbjct: 712  IGGFSVTCLVVSLLQNMPQVQSRNLVPEHHLG 743


>gi|340518876|gb|EGR49116.1| hypothetical protein TRIREDRAFT_61322 [Trichoderma reesei QM6a]
          Length = 679

 Score = 50.1 bits (118), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 48/92 (52%), Gaps = 13/92 (14%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +  AT +R+D+SF++    G++  +  +   +++PA   L  V+K +L  R L++  +GG
Sbjct: 463  VDKATGLRVDVSFENLG--GVRAVETFQRWKKEYPAMPALVTVIKHYLLMRGLNEPVNGG 520

Query: 1452 LSSYCLMLLITRFLQ-----------HEHHLG 1472
            L  + ++ L+   LQ            EHHLG
Sbjct: 521  LGGFTVICLVVSMLQLMPEVQSGSLVPEHHLG 552


>gi|145346336|ref|XP_001417645.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577873|gb|ABO95938.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 781

 Score = 50.1 bits (118), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 34/69 (49%)

Query: 1398 VRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCL 1457
            + + IS K  +H GL +   V+   EQ  A  PL  V K  LA + L   Y GGL SY L
Sbjct: 561  LEVHISLKDANHKGLSSMQFVQAAEEQHGALAPLVCVQKAVLASKGLRGVYRGGLGSYAL 620

Query: 1458 MLLITRFLQ 1466
             L+    +Q
Sbjct: 621  TLMALTAIQ 629


>gi|328860813|gb|EGG09918.1| hypothetical protein MELLADRAFT_115680 [Melampsora larici-populina
            98AG31]
          Length = 987

 Score = 50.1 bits (118), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 61/262 (23%), Positives = 102/262 (38%), Gaps = 37/262 (14%)

Query: 1209 MHSLLHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGL 1265
            +  +L  EI SF   +     E+  R   I    K VT    + WP +    FGS  T L
Sbjct: 230  VQQMLTAEIGSFVAYIRPTREEDELRLMIIEMIRKAVT----MQWPDADVVPFGSFGTKL 285

Query: 1266 SLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
             LP  D+DLV+      +   +K+A         K   L   A  L  Q       +  +
Sbjct: 286  YLPGGDIDLVIL-----STRMMKDA---------KSKILYRLAPLLREQNI--GQDVVVI 329

Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVAL-DDSASPKC 1384
                +PII         +  +    + S  +      L+  N +  D+  L  D  + + 
Sbjct: 330  AKAKVPIIKFKT-----IFGNFQVDI-SINQSNGLVALEKVNELLDDVKYLSKDLRAEQR 383

Query: 1385 SHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSL 1444
             H  S N     +   ++          +  + + ++ E+  A+  + LV+K  L  R +
Sbjct: 384  HHQHSRNQSKRRNEETNLEMD-------EEEEAISKVVEELGAAKCMILVIKSVLKQRGM 436

Query: 1445 DQSYSGGLSSYCLMLLITRFLQ 1466
            ++ YSGGL SY ++ L+  FLQ
Sbjct: 437  NEVYSGGLGSYSIICLVISFLQ 458


>gi|407044805|gb|EKE42834.1| nucleotidyltransferase domain containing protein [Entamoeba nuttalli
            P19]
          Length = 338

 Score = 49.7 bits (117), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            L I        G+ TT  +K L  +FP +  L+L +K  L   SL++ Y GGL SY ++L
Sbjct: 131  LSIDISVDCDNGIVTTSYIKTLLTEFPLARTLSLFIKFLLFQNSLNEPYHGGLGSYAIIL 190

Query: 1460 LITRFLQH 1467
            L+  +L++
Sbjct: 191  LVCTYLKN 198


>gi|365758850|gb|EHN00675.1| Trf5p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 642

 Score = 49.7 bits (117), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 42/74 (56%), Gaps = 2/74 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I+  + + +D+SF+  +  GL+   L++E     P    L LV+KQFL  R L+  ++GG
Sbjct: 281  IEPLSQLHIDVSFERTN--GLEAARLIREWLRDSPGLRELVLVIKQFLHSRRLNNVHTGG 338

Query: 1452 LSSYCLMLLITRFL 1465
            L  + ++ L+  FL
Sbjct: 339  LGGFTVICLVYSFL 352


>gi|401837953|gb|EJT41787.1| TRF5-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 642

 Score = 49.7 bits (117), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 42/74 (56%), Gaps = 2/74 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I+  + + +D+SF+  +  GL+   L++E     P    L LV+KQFL  R L+  ++GG
Sbjct: 281  IEPLSQLHIDVSFERTN--GLEAARLIREWLRDSPGLRELVLVIKQFLHSRRLNNVHTGG 338

Query: 1452 LSSYCLMLLITRFL 1465
            L  + ++ L+  FL
Sbjct: 339  LGGFTVICLVYSFL 352


>gi|67465459|ref|XP_648914.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56465219|gb|EAL43527.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
 gi|449703139|gb|EMD43640.1| PAPassociated domain containing protein [Entamoeba histolytica KU27]
          Length = 338

 Score = 49.7 bits (117), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            L I        G+ TT  +K L  +FP +  L+L +K  L   SL++ Y GGL SY ++L
Sbjct: 131  LSIDISVDCDNGIVTTSYIKTLLTEFPLARTLSLFIKFLLFQNSLNEPYHGGLGSYAIIL 190

Query: 1460 LITRFLQH 1467
            L+  +L++
Sbjct: 191  LVCTYLKN 198


>gi|295671679|ref|XP_002796386.1| Poly(A) RNA polymerase cid14 [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226283366|gb|EEH38932.1| Poly(A) RNA polymerase cid14 [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 725

 Score = 49.7 bits (117), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 43/77 (55%), Gaps = 2/77 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T +++D+SF +   TGL      +    QFPA   +  ++KQFL  R L++  +GGL  +
Sbjct: 413  TGLKVDLSFDN--STGLAANKTFQVWKSQFPAMPVIVSIIKQFLLLRGLNEVPTGGLGGF 470

Query: 1456 CLMLLITRFLQHEHHLG 1472
             ++ L+T  LQH  + G
Sbjct: 471  SIICLVTSLLQHLPYSG 487


>gi|255953511|ref|XP_002567508.1| Pc21g04620 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211589219|emb|CAP95359.1| Pc21g04620 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 708

 Score = 49.7 bits (117), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 42/76 (55%), Gaps = 2/76 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T +R+D+SF + S  GL   D  ++   Q+P    +  V+KQFL  R L++  +GG
Sbjct: 366  VDKLTGLRVDLSFDNDS--GLIANDTFQKWKTQYPMMPVILAVIKQFLLIRGLNEVPTGG 423

Query: 1452 LSSYCLMLLITRFLQH 1467
            L  + +  L+T  LQH
Sbjct: 424  LGGFSITCLVTSLLQH 439


>gi|452823485|gb|EME30495.1| DNA polymerase sigma subunit [Galdieria sulphuraria]
          Length = 417

 Score = 49.7 bits (117), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 42/75 (56%), Gaps = 6/75 (8%)

Query: 1206 LSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGL 1265
            ++L + +L  E++ F      E   RK  I    +RVT  ++ +WP S  ++FGS AT L
Sbjct: 95   IALHYEIL--ELERFVSGTREETKQRKQLI----ERVTEIIRQIWPNSSVHVFGSFATNL 148

Query: 1266 SLPSSDVDLVVCLPP 1280
             LP+SD+DL +   P
Sbjct: 149  YLPTSDIDLCILSSP 163



 Score = 44.7 bits (104), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 23/83 (27%), Positives = 47/83 (56%), Gaps = 7/83 (8%)

Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
            ++A+D +  P    T  +     T ++ DISF   +  G++    +++  +++P+  PL 
Sbjct: 188  VMAIDKARVPIIKVTDRE-----TGIQCDISFGRTN--GIENVRHIQKYLKRYPSLRPLM 240

Query: 1433 LVLKQFLADRSLDQSYSGGLSSY 1455
            +V+K FL  R+L++ + GG+ SY
Sbjct: 241  MVIKCFLHQRALNEVHEGGIGSY 263


>gi|226288571|gb|EEH44083.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
          Length = 684

 Score = 49.7 bits (117), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 43/77 (55%), Gaps = 2/77 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T +++D+SF +   TGL      +    QFPA   +  V+KQFL  R L++  +GGL  +
Sbjct: 375  TGLKVDLSFDN--STGLAANKTFQVWKSQFPAMPVIVSVIKQFLLLRGLNEVPTGGLGGF 432

Query: 1456 CLMLLITRFLQHEHHLG 1472
             ++ L+T  LQH  + G
Sbjct: 433  SIICLVTSLLQHLPYSG 449


>gi|119482616|ref|XP_001261336.1| topoisomerase family protein TRF4, putative [Neosartorya fischeri
            NRRL 181]
 gi|119409491|gb|EAW19439.1| topoisomerase family protein TRF4, putative [Neosartorya fischeri
            NRRL 181]
          Length = 701

 Score = 49.7 bits (117), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 43/79 (54%), Gaps = 2/79 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T +R+D+SF + S  GL   +  +    ++PA   +  V+KQFL  R L++  +GG
Sbjct: 374  VDKMTGLRVDLSFDNDS--GLIANNTFQNWKSEYPAMPVIVAVVKQFLLLRGLNEVPTGG 431

Query: 1452 LSSYCLMLLITRFLQHEHH 1470
            L  + +  L+T  LQH  H
Sbjct: 432  LGGFSITCLVTSLLQHLPH 450


>gi|159123133|gb|EDP48253.1| topoisomerase family protein TRF4, putative [Aspergillus fumigatus
            A1163]
          Length = 703

 Score = 49.7 bits (117), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 43/79 (54%), Gaps = 2/79 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T +R+D+SF + S  GL   +  +    ++PA   +  V+KQFL  R L++  +GG
Sbjct: 376  VDKMTGLRVDLSFDNDS--GLIANNTFQNWKSEYPAMPVIVAVVKQFLLLRGLNEVPTGG 433

Query: 1452 LSSYCLMLLITRFLQHEHH 1470
            L  + +  L+T  LQH  H
Sbjct: 434  LGGFSITCLVTSLLQHLPH 452


>gi|167391312|ref|XP_001739718.1| PAP-associated domain-containing protein [Entamoeba dispar SAW760]
 gi|165896477|gb|EDR23882.1| PAP-associated domain-containing protein, putative [Entamoeba dispar
            SAW760]
          Length = 337

 Score = 49.7 bits (117), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 38/68 (55%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            L I        G+ TT  +K L  +FP +  L+L +K  L   SL++ Y GGL SY ++L
Sbjct: 131  LSIDISVDCDNGIVTTTYIKTLLTEFPLARTLSLFIKFLLFQNSLNEPYHGGLGSYAIIL 190

Query: 1460 LITRFLQH 1467
            L+  +L++
Sbjct: 191  LVCTYLKN 198


>gi|70987233|ref|XP_749095.1| topoisomerase family protein TRF4 [Aspergillus fumigatus Af293]
 gi|66846725|gb|EAL87057.1| topoisomerase family protein TRF4, putative [Aspergillus fumigatus
            Af293]
          Length = 702

 Score = 49.7 bits (117), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 43/79 (54%), Gaps = 2/79 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T +R+D+SF + S  GL   +  +    ++PA   +  V+KQFL  R L++  +GG
Sbjct: 375  VDKMTGLRVDLSFDNDS--GLIANNTFQNWKSEYPAMPVIVAVVKQFLLLRGLNEVPTGG 432

Query: 1452 LSSYCLMLLITRFLQHEHH 1470
            L  + +  L+T  LQH  H
Sbjct: 433  LGGFSITCLVTSLLQHLPH 451


>gi|320587855|gb|EFX00330.1| topoisomerase family protein [Grosmannia clavigera kw1407]
          Length = 816

 Score = 49.7 bits (117), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 47/92 (51%), Gaps = 13/92 (14%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T +++D+SF++   TG+       +  EQ+PA   L  ++KQFL  R L++  +GG
Sbjct: 596  VDVKTWLKVDVSFEN--MTGVAAVRTFTKWREQYPAMPVLVTLVKQFLTMRGLNEPVNGG 653

Query: 1452 LSSYCLMLLITRFLQ-----------HEHHLG 1472
            +  + ++ L+   LQ            EHHLG
Sbjct: 654  IGGFSMICLVVSMLQLMPPIQSKDMKGEHHLG 685


>gi|365982357|ref|XP_003668012.1| hypothetical protein NDAI_0A06140 [Naumovozyma dairenensis CBS 421]
 gi|343766778|emb|CCD22769.1| hypothetical protein NDAI_0A06140 [Naumovozyma dairenensis CBS 421]
          Length = 684

 Score = 49.7 bits (117), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 51/230 (22%), Positives = 88/230 (38%), Gaps = 75/230 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++ ++++ LW  S+ +IFGS AT L LP SD+D VV            + G  E 
Sbjct: 227  NKTISKLRKAVKELWSDSQLHIFGSYATDLYLPGSDIDCVVN----------SKMGDKEQ 276

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            R       L   AR+L  +    +  ++ +    +PII  V                   
Sbjct: 277  RQ-----YLYDLARHLKQKGL--TSQVEVIAKARVPIIKFV------------------- 310

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
                    +  + +H D+           S   ++ ++AA  +R                
Sbjct: 311  --------EKSSQIHIDV-----------SFERTNGVEAAKLIR---------------- 335

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
                E     P    L L++KQFL+ R L+  ++GGL  + ++ L+  FL
Sbjct: 336  ----EWLSATPGLRELILIVKQFLSARRLNDVHTGGLGGFTIICLVYSFL 381


>gi|366988339|ref|XP_003673936.1| hypothetical protein NCAS_0A09970 [Naumovozyma castellii CBS 4309]
 gi|342299799|emb|CCC67555.1| hypothetical protein NCAS_0A09970 [Naumovozyma castellii CBS 4309]
          Length = 683

 Score = 49.7 bits (117), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 44/75 (58%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            ++  + + +D+SF+  +  G +   L++E  +  P    L LVLKQFLA + L++  +GG
Sbjct: 303  VEPRSKIHIDVSFEKTN--GAEAAKLIREWIKDTPGLRELVLVLKQFLAVKKLNEVVNGG 360

Query: 1452 LSSYCLMLLITRFLQ 1466
            L  + ++ L+  FL+
Sbjct: 361  LGGFSIICLVYAFLR 375


>gi|444317060|ref|XP_004179187.1| hypothetical protein TBLA_0B08530 [Tetrapisispora blattae CBS 6284]
 gi|387512227|emb|CCH59668.1| hypothetical protein TBLA_0B08530 [Tetrapisispora blattae CBS 6284]
          Length = 461

 Score = 49.7 bits (117), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 44/78 (56%), Gaps = 3/78 (3%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I+  +++ +DISF+  +  GL+   L++   E  P    L L++KQFL  R L+   +GG
Sbjct: 176  IEPNSNIHIDISFERTN--GLEAAKLIRSWLETTPGLRELVLIIKQFLHARRLNNVRTGG 233

Query: 1452 LSSYCLMLLITRFLQHEH 1469
            L  + ++ L+  FL H H
Sbjct: 234  LGGFSIICLVYSFL-HLH 250


>gi|145519187|ref|XP_001445460.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124412915|emb|CAK78063.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1073

 Score = 49.7 bits (117), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 59/272 (21%), Positives = 121/272 (44%), Gaps = 47/272 (17%)

Query: 1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
            + S+  +EID F +   A + +        ++RV   ++  +    T IFGS+ TGL+L 
Sbjct: 723  VKSIYQKEIDQFNRYKVARDIS--------IQRVQHVIKSYFQNCETEIFGSSTTGLALK 774

Query: 1269 SSDVDLVV----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
             SDVD+VV         +  EP+++  ++E  + +K T                    K 
Sbjct: 775  DSDVDMVVYGLHVYTKQQLFEPMRK--LIEIFSELKWTV-----------------QCKH 815

Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKC 1384
            +   ++P+I ++V+   D +        S K +  +  ++  N      + L+       
Sbjct: 816  IFQASVPLIKVLVDPSIDFL--------SFKGEPKYILMQCRN------LDLNLKYGDPS 861

Query: 1385 SHTSSD-NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
             H   D   +     R+   +    + G Q+T    ++ ++    + +A+ LK+ L  + 
Sbjct: 862  QHIFIDITFELTPPYRIYNPYIQAYNIGFQSTQYTIDICDKIQGFSEVAIYLKKLLKIKD 921

Query: 1444 LDQSYSGGLSSYCLMLLITRFLQHEHHLGRPI 1475
            L+ SY+GG+SS+CL +++    Q +H++G+ +
Sbjct: 922  LNDSYTGGISSFCLTIMLAAIGQ-DHNIGQKL 952


>gi|71659078|ref|XP_821264.1| DNA polymerase sigma [Trypanosoma cruzi strain CL Brener]
 gi|70886638|gb|EAN99413.1| DNA polymerase sigma, putative [Trypanosoma cruzi]
          Length = 685

 Score = 49.7 bits (117), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 1/76 (1%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            LD+     +  G + + +V ++   FP + PL +++K FL  R + + Y GGL S+   L
Sbjct: 379  LDVDVSINAEDGQRNSAIVADMLCSFPEARPLIVLVKYFLQQRGMHEPYHGGLGSFATTL 438

Query: 1460 LITRFLQ-HEHHLGRP 1474
            L+  FLQ H  +  RP
Sbjct: 439  LVISFLQNHPIYTERP 454


>gi|190407236|gb|EDV10503.1| DNA polymerase sigma [Saccharomyces cerevisiae RM11-1a]
 gi|259149371|emb|CAY86175.1| Pap2p [Saccharomyces cerevisiae EC1118]
          Length = 584

 Score = 49.3 bits (116), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 45/78 (57%), Gaps = 3/78 (3%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            ++  + + +D+SF+  +  G++   L++E  +  P    L L++KQFL  R L+  ++GG
Sbjct: 284  VEPHSGIHIDVSFERTN--GIEAAKLIREWLDDTPGLRELVLIVKQFLHARRLNNVHTGG 341

Query: 1452 LSSYCLMLLITRFLQHEH 1469
            L  + ++ L+  FL H H
Sbjct: 342  LGGFSIICLVFSFL-HMH 358


>gi|256271045|gb|EEU06149.1| Pap2p [Saccharomyces cerevisiae JAY291]
          Length = 584

 Score = 49.3 bits (116), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 45/78 (57%), Gaps = 3/78 (3%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            ++  + + +D+SF+  +  G++   L++E  +  P    L L++KQFL  R L+  ++GG
Sbjct: 284  VEPHSGIHIDVSFERTN--GIEAAKLIREWLDDTPGLRELVLIVKQFLHARRLNNVHTGG 341

Query: 1452 LSSYCLMLLITRFLQHEH 1469
            L  + ++ L+  FL H H
Sbjct: 342  LGGFSIICLVFSFL-HMH 358


>gi|50286703|ref|XP_445781.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49525087|emb|CAG58700.1| unnamed protein product [Candida glabrata]
          Length = 485

 Score = 49.3 bits (116), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            ++  +   +DISF+  +  GL+   L+++     P    L LV+KQFL  R L+  +SGG
Sbjct: 208  VEVESRTHMDISFERLN--GLEAAKLIRDWLASTPGLRELVLVVKQFLHSRRLNNVHSGG 265

Query: 1452 LSSYCLMLLITRFLQ 1466
            L  + ++ L+  FL+
Sbjct: 266  LGGFSIICLVYSFLR 280


>gi|410730487|ref|XP_003671423.2| hypothetical protein NDAI_0G04030 [Naumovozyma dairenensis CBS 421]
 gi|401780241|emb|CCD26180.2| hypothetical protein NDAI_0G04030 [Naumovozyma dairenensis CBS 421]
          Length = 582

 Score = 49.3 bits (116), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I   + + +D+SF+  +  GL+   L++E     P    + L++KQFL  R L+  ++GG
Sbjct: 274  IDPRSKIHIDVSFERIN--GLEAAKLIREWLNDTPGLREITLIVKQFLHARRLNNVHTGG 331

Query: 1452 LSSYCLMLLITRFLQ 1466
            L  + ++ L+  FLQ
Sbjct: 332  LGGFSIICLVFSFLQ 346


>gi|151945519|gb|EDN63760.1| DNA polymerase sigma [Saccharomyces cerevisiae YJM789]
          Length = 584

 Score = 49.3 bits (116), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 45/78 (57%), Gaps = 3/78 (3%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            ++  + + +D+SF+  +  G++   L++E  +  P    L L++KQFL  R L+  ++GG
Sbjct: 284  VEPHSGIHIDVSFERTN--GIEAAKLIREWLDDTPGLRELVLIVKQFLHARRLNNVHTGG 341

Query: 1452 LSSYCLMLLITRFLQHEH 1469
            L  + ++ L+  FL H H
Sbjct: 342  LGGFSIICLVFSFL-HMH 358


>gi|409052310|gb|EKM61786.1| hypothetical protein PHACADRAFT_190976 [Phanerochaete carnosa
            HHB-10118-sp]
          Length = 434

 Score = 49.3 bits (116), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 49/109 (44%), Gaps = 8/109 (7%)

Query: 1366 DNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRL--------DISFKSPSHTGLQTTDL 1417
            +  +H ++VA  + A+P  +  +      A    +        D+         L     
Sbjct: 68   EQRLHDEIVAFMNYATPNATERTVRERLIARLTEIIQRRYWCSDVKVFGSMAQDLSLLSG 127

Query: 1418 VKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            V+E  +  PA  PL +V+K FLA + L  + +GGL SY L L++  FLQ
Sbjct: 128  VQEYLDTMPALRPLIMVIKMFLAAKGLHSAGTGGLGSYALTLMVISFLQ 176


>gi|154272285|ref|XP_001536995.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150408982|gb|EDN04438.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 673

 Score = 49.3 bits (116), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 43/77 (55%), Gaps = 2/77 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T +++D+SF +   TG+      +    QFPA   +  V+KQFL  R L++  +GGL  +
Sbjct: 363  TGLKVDLSFDN--STGITANKTFQVWKSQFPAMPVIVSVVKQFLLLRGLNEVATGGLGGF 420

Query: 1456 CLMLLITRFLQHEHHLG 1472
             ++ L+T  LQH  + G
Sbjct: 421  SIICLVTSLLQHLPYSG 437


>gi|50294195|ref|XP_449509.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49528823|emb|CAG62485.1| unnamed protein product [Candida glabrata]
          Length = 626

 Score = 49.3 bits (116), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 43/75 (57%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            ++  + + +D+SF+  +  GL+   L++E     P    L LV+KQFL  R L+  ++GG
Sbjct: 292  VEPESDIHIDVSFERSN--GLEAAKLIREWIGDTPGLRELTLVVKQFLHARRLNDVHTGG 349

Query: 1452 LSSYCLMLLITRFLQ 1466
            L  + ++ L+  FL+
Sbjct: 350  LGGFSIICLVFSFLR 364


>gi|225681443|gb|EEH19727.1| polymerase sigma [Paracoccidioides brasiliensis Pb03]
          Length = 684

 Score = 49.3 bits (116), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 43/77 (55%), Gaps = 2/77 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T +++D+SF +   TGL      +    QFPA   +  ++KQFL  R L++  +GGL  +
Sbjct: 375  TGLKVDLSFDN--STGLAANKTFQVWKSQFPAMPVIVSIIKQFLLLRGLNEVPTGGLGGF 432

Query: 1456 CLMLLITRFLQHEHHLG 1472
             ++ L+T  LQH  + G
Sbjct: 433  SIICLVTSLLQHLPYSG 449


>gi|189204442|ref|XP_001938556.1| PAP-associated domain containing protein 5 [Pyrenophora
            tritici-repentis Pt-1C-BFP]
 gi|187985655|gb|EDU51143.1| PAP-associated domain containing protein 5 [Pyrenophora
            tritici-repentis Pt-1C-BFP]
          Length = 599

 Score = 49.3 bits (116), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 43/73 (58%), Gaps = 2/73 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T +++DISF++ S  G+Q      E   ++P    +  ++KQFL  R L++ ++GG+  Y
Sbjct: 379  TKLQVDISFENLS--GVQAQATFVEWKAKYPDMIYMVALMKQFLVMRGLNEVHTGGIGGY 436

Query: 1456 CLMLLITRFLQHE 1468
             ++ LI  +LQ E
Sbjct: 437  SIICLIVSYLQLE 449


>gi|323303298|gb|EGA57094.1| Trf5p [Saccharomyces cerevisiae FostersB]
          Length = 419

 Score = 49.3 bits (116), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 41/74 (55%), Gaps = 2/74 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I+  + + +D+SF+     GL+   L++E     P    L L++KQFL  R L+  ++GG
Sbjct: 247  IEPQSQLHIDVSFERT--XGLEAAKLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGG 304

Query: 1452 LSSYCLMLLITRFL 1465
            L  + ++ L+  FL
Sbjct: 305  LGGFTVICLVYSFL 318



 Score = 44.3 bits (103), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 51/101 (50%), Gaps = 17/101 (16%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++ R+++ LW  +  ++FGS AT L LP SD+D   C+   RN +        E 
Sbjct: 164  NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 213

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLV 1336
            RN I E      AR+L N+    +  ++ +  T +PII  +
Sbjct: 214  RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI 247


>gi|367030285|ref|XP_003664426.1| hypothetical protein MYCTH_2307240 [Myceliophthora thermophila ATCC
            42464]
 gi|347011696|gb|AEO59181.1| hypothetical protein MYCTH_2307240 [Myceliophthora thermophila ATCC
            42464]
          Length = 772

 Score = 49.3 bits (116), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 26/93 (27%), Positives = 47/93 (50%), Gaps = 13/93 (13%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I+  T +++D+SF++  +TG+      K   EQ+P    L  ++K FL  R L++  +GG
Sbjct: 517  IEHKTGLKVDVSFEN--NTGVNAIKTFKAWREQYPGMPALVTLIKHFLLMRGLNEPVNGG 574

Query: 1452 LSSYCLMLLITRFLQH-----------EHHLGR 1473
            +  + ++ L+   LQ             HHLG+
Sbjct: 575  IGGFSVICLVVSMLQMMPEVQSGNLDTRHHLGQ 607


>gi|308803585|ref|XP_003079105.1| nucleotidyltransferase family protein (ISS) [Ostreococcus tauri]
 gi|116057560|emb|CAL53763.1| nucleotidyltransferase family protein (ISS) [Ostreococcus tauri]
          Length = 1238

 Score = 49.3 bits (116), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 33/69 (47%)

Query: 1398 VRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCL 1457
            + + IS K  SH GL     V+   EQ  A  PL  V K  LA + L   Y GGL SY L
Sbjct: 1019 LEVHISLKDGSHKGLSALRFVQAAEEQHGALAPLVCVQKAVLASKGLRGVYKGGLGSYAL 1078

Query: 1458 MLLITRFLQ 1466
             ++    +Q
Sbjct: 1079 TMMALTSIQ 1087


>gi|124513398|ref|XP_001350055.1| nucleotidyltransferase, putative [Plasmodium falciparum 3D7]
 gi|23615472|emb|CAD52463.1| nucleotidyltransferase, putative [Plasmodium falciparum 3D7]
          Length = 858

 Score = 49.3 bits (116), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 41/75 (54%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I     V++DI     S    +TTD +    +++    PL L+LK FL  R+L+++Y GG
Sbjct: 530  IDKELGVQIDICINQKS--SKETTDFIITQMKKYIYLRPLVLLLKFFLHSRNLNETYIGG 587

Query: 1452 LSSYCLMLLITRFLQ 1466
            + S+ L  ++  FLQ
Sbjct: 588  IGSFLLSCMVLHFLQ 602


>gi|82596357|ref|XP_726229.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23481547|gb|EAA17794.1| topoisomerase-related function protein [Plasmodium yoelii yoelii]
          Length = 600

 Score = 48.9 bits (115), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 37/54 (68%)

Query: 1413 QTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++TD VK+  +++    PL +++K FL  R+L+++Y+GG+ S+ L  ++  FLQ
Sbjct: 400  ESTDFVKKKIKEYIYLRPLVILIKFFLNSRNLNETYTGGIGSFMLCCMVLHFLQ 453


>gi|146077494|ref|XP_001463285.1| topoisomerase-related function protein-like protein [Leishmania
            infantum JPCM5]
 gi|134067369|emb|CAM65641.1| topoisomerase-related function protein-like protein [Leishmania
            infantum JPCM5]
          Length = 917

 Score = 48.9 bits (115), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 40/76 (52%), Gaps = 1/76 (1%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            +D+     +  G + ++ + +    +P + PL LV+K F+  R + + Y GGL SY   L
Sbjct: 518  IDVDISVGAVDGKRNSECIVQYMNMYPEALPLTLVVKYFVMQRGMHEPYYGGLGSYAATL 577

Query: 1460 LITRFL-QHEHHLGRP 1474
            L+  FL QH  +  +P
Sbjct: 578  LVVAFLRQHPIYTTQP 593


>gi|157864534|ref|XP_001680977.1| topoisomerase-related function protein-like protein [Leishmania major
            strain Friedlin]
 gi|68124270|emb|CAJ07032.1| topoisomerase-related function protein-like protein [Leishmania major
            strain Friedlin]
          Length = 916

 Score = 48.9 bits (115), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 40/76 (52%), Gaps = 1/76 (1%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            +D+     +  G + ++ + +    +P + PL LV+K F+  R + + Y GGL SY   L
Sbjct: 515  IDVDISVGAVDGKRNSECIVQYMNMYPEALPLTLVVKYFVMQRGMHEPYYGGLGSYAATL 574

Query: 1460 LITRFL-QHEHHLGRP 1474
            L+  FL QH  +  +P
Sbjct: 575  LVVAFLRQHPIYTTQP 590


>gi|336466097|gb|EGO54262.1| hypothetical protein NEUTE1DRAFT_124553 [Neurospora tetrasperma FGSC
            2508]
 gi|350287057|gb|EGZ68304.1| hypothetical protein NEUTE2DRAFT_118045 [Neurospora tetrasperma FGSC
            2509]
          Length = 708

 Score = 48.9 bits (115), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + + T +++D+SF++   TGLQ         +QFP    L  ++K FL  R L++  +GG
Sbjct: 489  VDSRTGLKIDVSFEN--DTGLQAIKTFHAWRDQFPVMPVLVTLIKHFLCMRGLNEPVNGG 546

Query: 1452 LSSYCLMLLITRFLQ 1466
            +  + +  L+T  LQ
Sbjct: 547  IGGFTVTCLVTSMLQ 561


>gi|398010636|ref|XP_003858515.1| topoisomerase-related function protein-like protein [Leishmania
            donovani]
 gi|322496723|emb|CBZ31792.1| topoisomerase-related function protein-like protein [Leishmania
            donovani]
          Length = 916

 Score = 48.9 bits (115), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 40/76 (52%), Gaps = 1/76 (1%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            +D+     +  G + ++ + +    +P + PL LV+K F+  R + + Y GGL SY   L
Sbjct: 517  IDVDISVGAVDGKRNSECIVQYMNMYPEALPLTLVVKYFVMQRGMHEPYYGGLGSYAATL 576

Query: 1460 LITRFL-QHEHHLGRP 1474
            L+  FL QH  +  +P
Sbjct: 577  LVVAFLRQHPIYTTQP 592


>gi|401415443|ref|XP_003872217.1| topoisomerase-related function protein-like protein [Leishmania
            mexicana MHOM/GT/2001/U1103]
 gi|322488440|emb|CBZ23686.1| topoisomerase-related function protein-like protein [Leishmania
            mexicana MHOM/GT/2001/U1103]
          Length = 912

 Score = 48.9 bits (115), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 40/76 (52%), Gaps = 1/76 (1%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            +D+     +  G + ++ + +    +P + PL LV+K F+  R + + Y GGL SY   L
Sbjct: 512  IDVDISVGAVDGKRNSECIVQYMNMYPEALPLTLVVKYFVTQRGMHEPYYGGLGSYAATL 571

Query: 1460 LITRFL-QHEHHLGRP 1474
            L+  FL QH  +  +P
Sbjct: 572  LVVAFLRQHPIYTTQP 587


>gi|238482039|ref|XP_002372258.1| topoisomerase family protein TRF4, putative [Aspergillus flavus
            NRRL3357]
 gi|220700308|gb|EED56646.1| topoisomerase family protein TRF4, putative [Aspergillus flavus
            NRRL3357]
          Length = 608

 Score = 48.9 bits (115), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 43/79 (54%), Gaps = 2/79 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T +R+D+SF + S  GL   +  ++   ++PA   +  V+KQFL  R L++   GG
Sbjct: 299  VDKMTGLRVDLSFDNDS--GLVANNTFQQWKSEYPAMPVIVSVIKQFLLLRGLNEVPCGG 356

Query: 1452 LSSYCLMLLITRFLQHEHH 1470
            L  + +  L+T  LQH  H
Sbjct: 357  LGGFSITCLVTSLLQHLPH 375


>gi|46127561|ref|XP_388334.1| hypothetical protein FG08158.1 [Gibberella zeae PH-1]
          Length = 708

 Score = 48.9 bits (115), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 46/92 (50%), Gaps = 13/92 (14%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T +++D+SF++    G+   D   +  EQ+PA   L  V+K FL  R L++  +GG
Sbjct: 494  VDKQTGLKVDVSFEN--LGGVNAIDTFLQWKEQYPAMPILVTVIKHFLLMRGLNEPVNGG 551

Query: 1452 LSSYCLMLLITRFLQ-----------HEHHLG 1472
            +  + ++ L+   LQ            EHHLG
Sbjct: 552  IGGFSVICLVVSMLQLMPQVQSRSLVPEHHLG 583


>gi|255081724|ref|XP_002508084.1| predicted protein [Micromonas sp. RCC299]
 gi|226523360|gb|ACO69342.1| predicted protein [Micromonas sp. RCC299]
          Length = 1070

 Score = 48.9 bits (115), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 22/68 (32%), Positives = 41/68 (60%), Gaps = 1/68 (1%)

Query: 1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
            ++  LH ++ SF      +++  +  +    K+V+ ++  L+P +R  +FGS ATGL+L 
Sbjct: 627  LYGTLHSQVSSFASTAGPDDSDLQDRL-VVQKKVSEAITALYPEARLEVFGSGATGLALK 685

Query: 1269 SSDVDLVV 1276
             +D+DLVV
Sbjct: 686  CADIDLVV 693


>gi|317139015|ref|XP_001817214.2| topoisomerase family protein TRF4 [Aspergillus oryzae RIB40]
          Length = 659

 Score = 48.9 bits (115), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 43/79 (54%), Gaps = 2/79 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T +R+D+SF + S  GL   +  ++   ++PA   +  V+KQFL  R L++   GG
Sbjct: 350  VDKMTGLRVDLSFDNDS--GLVANNTFQQWKSEYPAMPVIVSVIKQFLLLRGLNEVPCGG 407

Query: 1452 LSSYCLMLLITRFLQHEHH 1470
            L  + +  L+T  LQH  H
Sbjct: 408  LGGFSITCLVTSLLQHLPH 426


>gi|408389494|gb|EKJ68941.1| hypothetical protein FPSE_10866 [Fusarium pseudograminearum CS3096]
          Length = 708

 Score = 48.9 bits (115), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 46/92 (50%), Gaps = 13/92 (14%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T +++D+SF++    G+   D   +  EQ+PA   L  V+K FL  R L++  +GG
Sbjct: 494  VDKQTGLKVDVSFENLG--GVNAIDTFLQWKEQYPAMPILVTVIKHFLLMRGLNEPVNGG 551

Query: 1452 LSSYCLMLLITRFLQ-----------HEHHLG 1472
            +  + ++ L+   LQ            EHHLG
Sbjct: 552  IGGFSVICLVVSMLQLMPQVQSRSLVPEHHLG 583


>gi|85097984|ref|XP_960550.1| hypothetical protein NCU05588 [Neurospora crassa OR74A]
 gi|28922043|gb|EAA31314.1| predicted protein [Neurospora crassa OR74A]
          Length = 715

 Score = 48.9 bits (115), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + + T +++D+SF++   TGLQ         +QFP    L  ++K FL  R L++  +GG
Sbjct: 496  VDSRTGLKIDVSFEN--DTGLQAIKTFHAWRDQFPVMPVLVTLIKHFLCMRGLNEPVNGG 553

Query: 1452 LSSYCLMLLITRFLQ 1466
            +  + +  L+T  LQ
Sbjct: 554  IGGFTVTCLVTSMLQ 568


>gi|145511526|ref|XP_001441685.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124408946|emb|CAK74288.1| unnamed protein product [Paramecium tetraurelia]
          Length = 622

 Score = 48.9 bits (115), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 64/293 (21%), Positives = 120/293 (40%), Gaps = 55/293 (18%)

Query: 1193 PLEVQNCPTRKASLSLMHSLLHE--EIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLW 1250
            P   Q  P ++  L+L  S  H+  E+ S   Q + +N  RK         + R   +++
Sbjct: 282  PHSTQTTPIKRKKLNLTKSKTHDIDELSSAILQKSYQNRIRKQIC------LNRLNYLIY 335

Query: 1251 PRSRTNIF--GSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG-RNGIKETCLQHA 1307
                 N++  GS  TGL L  SDVD+ V     +NL   +    L+   N +K+T     
Sbjct: 336  TNYHLNMYPYGSFETGLDLEISDVDVGVW--GSQNLSYAQIVSFLQLLNNTLKQTPFLIK 393

Query: 1308 ARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDN 1367
            ++ +  Q               +PI+ L +               +PK          D 
Sbjct: 394  SKLIQGQ---------------MPILKLEL---------------NPKSGF----YDEDQ 419

Query: 1368 HVHSD--MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQF 1425
            H+  +     LD +   +        I  +++V     + +P H G  +T +VK+   +F
Sbjct: 420  HIQQNWSYFHLDHNDQGRIIQVDITWIYQSSNV-----YNNP-HLGFASTTIVKDWIHRF 473

Query: 1426 PASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQV 1478
                 + L+LKQ +  ++L+ +++GG+SS+CL +++      +H+      Q+
Sbjct: 474  VWYRDIMLILKQLVKSKNLNDAHTGGISSFCLSIMLAAIYMCKHYTQNDKKQI 526


>gi|358385776|gb|EHK23372.1| hypothetical protein TRIVIDRAFT_114164, partial [Trichoderma virens
            Gv29-8]
          Length = 674

 Score = 48.9 bits (115), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 46/92 (50%), Gaps = 13/92 (14%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +  AT +R+D+SF++    G+   +      +++PA   L  V+K FL  R L++  +GG
Sbjct: 465  VDKATGLRVDVSFEN--LGGVTAIETFARWKKEYPAMPILVTVIKHFLLMRGLNEPVNGG 522

Query: 1452 LSSYCLMLLITRFLQ-----------HEHHLG 1472
            L  + ++ L+   LQ            EHHLG
Sbjct: 523  LGGFTIICLVVSMLQLMPQVQSRNLIPEHHLG 554


>gi|145528001|ref|XP_001449800.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124417389|emb|CAK82403.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1063

 Score = 48.9 bits (115), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 59/268 (22%), Positives = 117/268 (43%), Gaps = 39/268 (14%)

Query: 1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
            +  +  +EID F +  AA + +        ++RV   ++  +    T IFGS+ TGL+L 
Sbjct: 713  VKQIYQKEIDQFNRYKAARDIS--------IQRVQHVIKSYFKNCDTQIFGSSTTGLALK 764

Query: 1269 SSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
             SDVD+VV       L+   +  + E    + E             +W  +   K +   
Sbjct: 765  DSDVDMVVY-----GLQVFTKQQLFEPMRRLIEI--------FTELKW--AVQCKHIFQA 809

Query: 1329 AIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTS 1388
            ++P+I ++V+   D +        S K +  +  ++  N      + L+        H  
Sbjct: 810  SVPLIKVLVDPSIDFL--------SFKGEPKYIIMQCRN------LDLNLKYGDPSQHIF 855

Query: 1389 SD-NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
             D   + A S  +   +    + G Q+T    ++  +    + +A+ LK+ L  + L+ S
Sbjct: 856  IDITFELAPSYAIYNPYIQAFNIGFQSTQFTIDICGKIQGFSEVAIYLKKLLKIKDLNDS 915

Query: 1448 YSGGLSSYCLMLLITRFLQHEHHLGRPI 1475
            Y+GG+SS+CL +++    Q +H +G+ +
Sbjct: 916  YTGGISSFCLTIMLAAIGQ-DHSIGQKL 942


>gi|336276454|ref|XP_003352980.1| hypothetical protein SMAC_03298 [Sordaria macrospora k-hell]
 gi|380092465|emb|CCC09742.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 781

 Score = 48.5 bits (114), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + + T +++D+SF++   TGLQ         +QFP    L  ++K FL  R L++  +GG
Sbjct: 562  VDSRTGLKIDVSFEN--DTGLQAIKTFHAWRDQFPVMPVLVTLIKHFLCMRGLNEPVNGG 619

Query: 1452 LSSYCLMLLITRFLQ 1466
            +  + +  L+T  LQ
Sbjct: 620  IGGFTVTCLVTSMLQ 634


>gi|414881049|tpg|DAA58180.1| TPA: hypothetical protein ZEAMMB73_639297, partial [Zea mays]
          Length = 260

 Score = 48.5 bits (114), Expect = 0.026,   Method: Composition-based stats.
 Identities = 24/64 (37%), Positives = 41/64 (64%), Gaps = 1/64 (1%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH+EI  FC  ++  +T  +     AV+ V+  ++ +WP+ +  +FGS  TGL LP+SD+
Sbjct: 131  LHKEILDFCDFISP-STEEQSSRAAAVQDVSDVVKHIWPQCKVEVFGSFRTGLYLPTSDI 189

Query: 1273 DLVV 1276
            D+V+
Sbjct: 190  DVVI 193


>gi|325095205|gb|EGC48515.1| topoisomerase TRF4 [Ajellomyces capsulatus H88]
          Length = 443

 Score = 48.5 bits (114), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 41/72 (56%), Gaps = 2/72 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T +++D+SF +   TG+      +    QFPA   +  V+KQFL  R L++  +GGL  +
Sbjct: 133  TGLKVDLSFDNS--TGIAANKTFQVWKSQFPAMPVIVSVVKQFLLLRGLNEVATGGLGGF 190

Query: 1456 CLMLLITRFLQH 1467
             ++ L+T  LQH
Sbjct: 191  SIICLVTSLLQH 202


>gi|229595481|ref|XP_001017236.2| Nucleotidyltransferase domain containing protein [Tetrahymena
            thermophila]
 gi|225565997|gb|EAR96991.2| Nucleotidyltransferase domain containing protein [Tetrahymena
            thermophila SB210]
          Length = 1151

 Score = 48.5 bits (114), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 38/64 (59%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            L+EEI  F ++V   N   +P  +  +K+V   ++  +P  + N +GS AT L LP SD+
Sbjct: 832  LNEEIREFEREVNEYNKRMEPTFDELIKKVEDIIKQQYPNIKLNKYGSYATKLCLPWSDI 891

Query: 1273 DLVV 1276
            D+VV
Sbjct: 892  DIVV 895


>gi|209879195|ref|XP_002141038.1| PAP/25A associated domain-containing protein [Cryptosporidium muris
            RN66]
 gi|209556644|gb|EEA06689.1| PAP/25A associated domain-containing protein [Cryptosporidium muris
            RN66]
          Length = 485

 Score = 48.5 bits (114), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 40/76 (52%), Gaps = 2/76 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            + + + + +DIS    S   + TT  + E  ++F    PL + +K FL  R LD++Y GG
Sbjct: 236  VDSESCISVDISLNQES--CIDTTQHICECLKRFDVLRPLIITIKLFLHQRGLDETYLGG 293

Query: 1452 LSSYCLMLLITRFLQH 1467
            L SY    LI  F+Q 
Sbjct: 294  LGSYAQFCLILSFIQQ 309


>gi|156843407|ref|XP_001644771.1| hypothetical protein Kpol_1020p21 [Vanderwaltozyma polyspora DSM
            70294]
 gi|156115421|gb|EDO16913.1| hypothetical protein Kpol_1020p21 [Vanderwaltozyma polyspora DSM
            70294]
          Length = 640

 Score = 48.5 bits (114), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 44/75 (58%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            ++  + + +DISF+  +  GL+   +++E  +  P    L L++KQFLA R L+  ++GG
Sbjct: 255  VEPNSKLHIDISFERLN--GLEVAKIIREWLDDTPGLRELVLIVKQFLAARRLNNVHTGG 312

Query: 1452 LSSYCLMLLITRFLQ 1466
            L    ++ L+  FL+
Sbjct: 313  LGGLSIICLVYSFLK 327



 Score = 45.4 bits (106), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 44/101 (43%), Gaps = 17/101 (16%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  + ++  SL+ LW  S  ++FGS AT L LPSSD+D VV            E G  E 
Sbjct: 172  NVTISKIRNSLKELWSDSDLHVFGSYATDLYLPSSDIDCVVN----------SENGDKEN 221

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLV 1336
            RN +           LA Q       ++ +    +PII  V
Sbjct: 222  RNDLYSLATHFKRNGLAIQ-------VEVIAKARVPIIKFV 255


>gi|323452526|gb|EGB08400.1| hypothetical protein AURANDRAFT_64151 [Aureococcus anophagefferens]
          Length = 1245

 Score = 48.5 bits (114), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 25/56 (44%), Positives = 37/56 (66%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            +DISF++P H GL ++    E   ++P   PL LVLK+ L  R L++ ++GGLSSY
Sbjct: 972  VDISFEAPDHGGLASSRYALEAMSRWPELAPLMLVLKELLVQRGLNEPFTGGLSSY 1027


>gi|440492821|gb|ELQ75356.1| DNA polymerase sigma [Trachipleistophora hominis]
          Length = 402

 Score = 48.5 bits (114), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 24/54 (44%), Positives = 34/54 (62%)

Query: 1412 LQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
            L  T  VK++  + PA   +AL LKQFL  R L+++  GGL+SY  +L+I  FL
Sbjct: 219  LAHTKFVKDILHKQPAIKDMALFLKQFLKCRGLNENRRGGLNSYAQLLMIISFL 272


>gi|70945383|ref|XP_742517.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56521545|emb|CAH76038.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 411

 Score = 48.1 bits (113), Expect = 0.034,   Method: Composition-based stats.
 Identities = 24/71 (33%), Positives = 41/71 (57%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            + V +DI     S    +TT+ VK+  +++    PL +++K FL  R L+ +YSGG+ S+
Sbjct: 114  SGVNVDICINQESSR--ETTEFVKKKIKEYIYLRPLVIIIKFFLNSRGLNDTYSGGIGSF 171

Query: 1456 CLMLLITRFLQ 1466
             L  ++  FLQ
Sbjct: 172  MLCCMVLHFLQ 182


>gi|302405651|ref|XP_003000662.1| Poly(A) RNA polymerase cid14 [Verticillium albo-atrum VaMs.102]
 gi|261360619|gb|EEY23047.1| Poly(A) RNA polymerase cid14 [Verticillium albo-atrum VaMs.102]
          Length = 723

 Score = 48.1 bits (113), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 41/81 (50%), Gaps = 7/81 (8%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I A T +R+DISF      G        +  EQ+PA   L  ++K FLA R L++  +GG
Sbjct: 522  IDAVTGLRVDISFDR--MDGPAAIKTFLDWKEQYPALPILVTIIKHFLAMRGLNEPVNGG 579

Query: 1452 LSSYCLMLLITRFLQHEHHLG 1472
            + S+    L+      EHHLG
Sbjct: 580  IGSFSSKNLVP-----EHHLG 595


>gi|320583303|gb|EFW97518.1| Catalytic subunit of TRAMP (Trf4/Pap2p-Mtr4p-Air1p/2p) [Ogataea
            parapolymorpha DL-1]
          Length = 538

 Score = 48.1 bits (113), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 29/125 (23%), Positives = 61/125 (48%), Gaps = 2/125 (1%)

Query: 1342 DLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLD 1401
            D+     S  ++ K D   +  +  +++ +  + ++  A  K        +  AT + +D
Sbjct: 169  DIDMVVVSKARNNKYDNRSSLYQLSSYIRNHRLGVNVEAIAKAKVPIIKFVDPATKIHID 228

Query: 1402 ISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLI 1461
            ISF+  +  G++  +L+    +  P    L L++KQFL+ R L+  ++GGL  +  + L+
Sbjct: 229  ISFERTN--GIKAAELIISWLKDTPGLRELVLIVKQFLSVRKLNIVHTGGLGGFATICLV 286

Query: 1462 TRFLQ 1466
              FL+
Sbjct: 287  RSFLK 291


>gi|154332398|ref|XP_001562573.1| topoisomerase-related function protein-like protein [Leishmania
            braziliensis MHOM/BR/75/M2904]
 gi|134059463|emb|CAM41689.1| topoisomerase-related function protein-like protein [Leishmania
            braziliensis MHOM/BR/75/M2904]
 gi|449310632|gb|AGE92543.1| topoisomerase-related protein [Leishmania braziliensis]
          Length = 892

 Score = 48.1 bits (113), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 40/76 (52%), Gaps = 1/76 (1%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            +D+     +  G + ++ + +    +P + PL LV+K F+  R + + Y GGL SY   L
Sbjct: 500  IDVDISVGAVDGKRNSECIVQYMNAYPEALPLILVVKYFVMQRGMHEPYHGGLGSYATTL 559

Query: 1460 LITRFL-QHEHHLGRP 1474
            L+  FL QH  +  +P
Sbjct: 560  LVVAFLRQHPIYTTQP 575


>gi|301105803|ref|XP_002901985.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262099323|gb|EEY57375.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 444

 Score = 48.1 bits (113), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 36/125 (28%), Positives = 64/125 (51%), Gaps = 8/125 (6%)

Query: 1215 EEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDL 1274
            +E+  +C          + +I+ A+  +T  ++ +WP +    FGS +TGL LPSSDVD+
Sbjct: 290  QEMTRYCTYTQRVVDEARQHIDAAIAHITEGVRKIWPHASVTCFGSYSTGLWLPSSDVDV 349

Query: 1275 VVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQE-WVKSDSLKTVENTAIPII 1333
            VV    + N E  ++       +G+KE  L+  AR L  Q+ WV+   +  V    +P+ 
Sbjct: 350  VVM--GLSNSED-RDVPTRFLPDGVKE--LEQLARQLRPQKSWVQ--RVDVVRGAKVPVA 402

Query: 1334 MLVVE 1338
             +++ 
Sbjct: 403  KVILR 407


>gi|449310610|gb|AGE92532.1| topoisomerase-related protein [Leishmania braziliensis]
          Length = 893

 Score = 48.1 bits (113), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 40/76 (52%), Gaps = 1/76 (1%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            +D+     +  G + ++ + +    +P + PL LV+K F+  R + + Y GGL SY   L
Sbjct: 501  IDVDISVGAVDGKRNSECIVQYMNAYPEALPLILVVKYFVMQRGMHEPYHGGLGSYATTL 560

Query: 1460 LITRFL-QHEHHLGRP 1474
            L+  FL QH  +  +P
Sbjct: 561  LVVAFLRQHPIYTTQP 576


>gi|83765069|dbj|BAE55212.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|391870459|gb|EIT79642.1| DNA polymerase sigma [Aspergillus oryzae 3.042]
          Length = 390

 Score = 48.1 bits (113), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 43/79 (54%), Gaps = 2/79 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T +R+D+SF + S  GL   +  ++   ++PA   +  V+KQFL  R L++   GG
Sbjct: 177  VDKMTGLRVDLSFDNDS--GLVANNTFQQWKSEYPAMPVIVSVIKQFLLLRGLNEVPCGG 234

Query: 1452 LSSYCLMLLITRFLQHEHH 1470
            L  + +  L+T  LQH  H
Sbjct: 235  LGGFSITCLVTSLLQHLPH 253


>gi|121711275|ref|XP_001273253.1| topoisomerase family protein TRF4, putative [Aspergillus clavatus
            NRRL 1]
 gi|119401404|gb|EAW11827.1| topoisomerase family protein TRF4, putative [Aspergillus clavatus
            NRRL 1]
          Length = 704

 Score = 48.1 bits (113), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 43/79 (54%), Gaps = 2/79 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T +R+D+SF + S  GL   +  +    ++PA   +  V+KQFL  R L++  +GG
Sbjct: 377  VDKLTGLRVDLSFDNDS--GLIANNTFQSWKSEYPAMPVILSVVKQFLLLRGLNEVPTGG 434

Query: 1452 LSSYCLMLLITRFLQHEHH 1470
            L  + +  L+T  LQH  H
Sbjct: 435  LGGFSITCLVTSLLQHLPH 453


>gi|451844986|gb|EMD58301.1| hypothetical protein COCSADRAFT_165704 [Cochliobolus sativus ND90Pr]
          Length = 642

 Score = 48.1 bits (113), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 43/72 (59%), Gaps = 2/72 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T +++DISF++ S  G+Q      +  +++P    +  +LKQFL    L++ ++GG+  +
Sbjct: 447  TQLQVDISFENLS--GVQAQATFAQWKQEYPDMIYMVALLKQFLVMHGLNEVHTGGIGGF 504

Query: 1456 CLMLLITRFLQH 1467
             ++ LI  ++QH
Sbjct: 505  SIICLIVSYIQH 516


>gi|432948655|ref|XP_004084114.1| PREDICTED: DNA polymerase sigma-like, partial [Oryzias latipes]
          Length = 383

 Score = 47.8 bits (112), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 31/44 (70%)

Query: 1423 EQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            +++P    L  VLKQFL  R L++ ++GG+SSY L+L++  FLQ
Sbjct: 24   QKYPVLPYLIFVLKQFLLQRDLNEVFTGGISSYSLILMVISFLQ 67


>gi|302691928|ref|XP_003035643.1| hypothetical protein SCHCODRAFT_104957 [Schizophyllum commune H4-8]
 gi|300109339|gb|EFJ00741.1| hypothetical protein SCHCODRAFT_104957, partial [Schizophyllum
            commune H4-8]
          Length = 671

 Score = 47.8 bits (112), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 42/78 (53%), Gaps = 3/78 (3%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTE--QFP-ASTPLALVLKQFLADRSLDQSY 1448
            I AA   R  I       +GL ++D++       Q P A   L L+LK +L+ R +++ Y
Sbjct: 265  ITAAEYGRFQIDISVNQFSGLVSSDIINGFQRGMQCPIAIRSLVLILKLYLSQRGMNEVY 324

Query: 1449 SGGLSSYCLMLLITRFLQ 1466
            +GGL SY ++ L+  FLQ
Sbjct: 325  TGGLGSYSIVCLVLSFLQ 342


>gi|145535359|ref|XP_001453418.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124421129|emb|CAK86021.1| unnamed protein product [Paramecium tetraurelia]
          Length = 634

 Score = 47.8 bits (112), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 61/299 (20%), Positives = 126/299 (42%), Gaps = 58/299 (19%)

Query: 1193 PLEVQNCPTRKASLSLMHSLLHE--EIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLW 1250
            P   Q  P ++  L+L+ S  ++  E+ ++ +Q + +N  RK         + R   +++
Sbjct: 285  PHSTQTTPIKRKQLNLVKSKTYDIDELSNYIQQKSYQNRIRKQIC------LNRLNYLIY 338

Query: 1251 PRSRTNIF--GSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG-RNGIKETCLQHA 1307
                 N++  GS  TGL L  SDVD  V +   +NL   +    L+   N +K+T     
Sbjct: 339  TNYHLNMYPYGSFETGLDLEISDVD--VGIWGSQNLSYSQIVSFLQLLNNTLKQTPFLIK 396

Query: 1308 ARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDN 1367
            ++ + +Q               +PI+ L +               +PK     +    D 
Sbjct: 397  SKLIQSQ---------------MPILKLEL---------------NPKS----SFYDEDQ 422

Query: 1368 HVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDIS--------FKSPSHTGLQTTDLVK 1419
            H+  +        + +    SS  +     +++DI+        + +P H G  +T +VK
Sbjct: 423  HIQQNWSYFHLDQNDQGIIQSSQFL--GKIIQVDITWIYQWSNVYNNP-HLGFASTTIVK 479

Query: 1420 ELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQV 1478
            +   +F     + L+LK  +  ++L+ +++GG+SS+CL +++      +H+      Q+
Sbjct: 480  DWVSRFVWYRDIMLILKHLVKSKNLNDAHTGGISSFCLSIMLAAIYMCKHYTQNDKKQI 538


>gi|452839453|gb|EME41392.1| hypothetical protein DOTSEDRAFT_46399 [Dothistroma septosporum NZE10]
          Length = 754

 Score = 47.8 bits (112), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 20/75 (26%), Positives = 43/75 (57%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T +++D+SF++ S  GL      ++  + +P    L +++KQ LA R L++ ++GG
Sbjct: 442  VDGHTGIKVDLSFENDS--GLTANQTFQQWKKDYPEMPVLVMIIKQMLAMRGLNEVHTGG 499

Query: 1452 LSSYCLMLLITRFLQ 1466
            +  + ++ L+   LQ
Sbjct: 500  IGGFTIICLVVSMLQ 514


>gi|196008079|ref|XP_002113905.1| hypothetical protein TRIADDRAFT_57804 [Trichoplax adhaerens]
 gi|190582924|gb|EDV22995.1| hypothetical protein TRIADDRAFT_57804 [Trichoplax adhaerens]
          Length = 1063

 Score = 47.4 bits (111), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 64/279 (22%), Positives = 106/279 (37%), Gaps = 69/279 (24%)

Query: 1198 NCPTRKASLS---LMHSLLHEEIDSFC---KQVAAENTARKPYINWAVKRVTRSLQVLWP 1251
            NCP   ++L+    M+      ID FC   K   A +   K +    + RVT+++Q ++P
Sbjct: 426  NCPESISTLNPLPEMNIDYLRAIDEFCYYVKDYYAMSAEEKQFREEILHRVTKAMQAIFP 485

Query: 1252 RSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYL 1311
             +  ++FGS+  G     SDVD+ + +P   +L  + E   L G         Q A R +
Sbjct: 486  EATLHLFGSSKNGFGTKQSDVDMCMMIPD-DSLNCLDEK--LRG---------QEAIRRI 533

Query: 1312 ANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHS 1371
            A Q   KS     V++ +   + +V                                   
Sbjct: 534  AKQLRKKSRDFAKVQDISRATVPIV----------------------------------- 558

Query: 1372 DMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVK---ELTEQFPAS 1428
                +    +P CS            +  DIS+++     +  T+L+     L ++ P  
Sbjct: 559  KFYDVRRYVNPTCS--------LNRKLSCDISYQNA--LAVHNTNLLASYGSLDDRIPI- 607

Query: 1429 TPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
              L LVLK       +  +  G LSSY   L++  FLQH
Sbjct: 608  --LVLVLKLIAKACDIGDASRGSLSSYAHTLMMIYFLQH 644


>gi|346972692|gb|EGY16144.1| Poly(A) RNA polymerase cid14 [Verticillium dahliae VdLs.17]
          Length = 726

 Score = 47.4 bits (111), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 40/81 (49%), Gaps = 7/81 (8%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I A T +R+DISF      G           EQ+PA   L  ++K FLA R L++  +GG
Sbjct: 525  IDAVTGLRVDISFDR--MDGPAAIKTFLNWKEQYPALPILVTIIKHFLAMRGLNEPVNGG 582

Query: 1452 LSSYCLMLLITRFLQHEHHLG 1472
            + S+    L+      EHHLG
Sbjct: 583  IGSFSSKNLVP-----EHHLG 598


>gi|307107081|gb|EFN55325.1| hypothetical protein CHLNCDRAFT_134313 [Chlorella variabilis]
          Length = 1656

 Score = 47.4 bits (111), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 42/78 (53%), Gaps = 13/78 (16%)

Query: 1211 SLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGS---------- 1260
            S LH E+ +F  + A         +  AV  V+ + + LWP+SRT +FGS          
Sbjct: 1118 SPLHHEVAAFAAR-ATPTPTEVAAVQQAVLAVSEAARSLWPQSRTALFGSQASSGGPRRL 1176

Query: 1261 --NATGLSLPSSDVDLVV 1276
              +ATGL+LP SD+D+VV
Sbjct: 1177 LASATGLALPGSDLDIVV 1194



 Score = 43.9 bits (102), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 35/70 (50%), Gaps = 2/70 (2%)

Query: 1401 DISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLL 1460
            D+S  +    G Q  D V+      P   PL L +K FL  R  ++ ++GGLSSY +  +
Sbjct: 1260 DVSLGA--ENGAQAVDFVRRQVLVVPPLRPLCLAVKAFLRQRGSNEVFTGGLSSYSVFNM 1317

Query: 1461 ITRFLQHEHH 1470
            +   LQ E +
Sbjct: 1318 VMAHLQLEGY 1327


>gi|344255571|gb|EGW11675.1| PAP-associated domain-containing protein 5 [Cricetulus griseus]
          Length = 382

 Score = 47.4 bits (111), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 41/135 (30%), Positives = 60/135 (44%), Gaps = 33/135 (24%)

Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRN 1297
             V R+   ++ LWP +   IFGS  TGL LP+SD+DLVV      NL P+          
Sbjct: 5    VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLVV-FGKWENL-PL---------- 52

Query: 1298 GIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLV---VEVPHDLIASAASSVQS- 1353
                  L+ A R     +    DS+K ++   +PII L     EV  D+  +  + V++ 
Sbjct: 53   ----WTLEEALR---KHKVADEDSVKVLDKATVPIIKLTDSFTEVKVDISFNVQNGVRAA 105

Query: 1354 ----------PKEDA 1358
                      P+EDA
Sbjct: 106  DLIKDFTKLHPREDA 120


>gi|224128147|ref|XP_002329093.1| predicted protein [Populus trichocarpa]
 gi|222869762|gb|EEF06893.1| predicted protein [Populus trichocarpa]
          Length = 543

 Score = 47.4 bits (111), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 39/67 (58%), Gaps = 7/67 (10%)

Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
            LH+EI  FC  ++    E  +R      AV+ V   ++ +WP  +  +FGS  TGL LP+
Sbjct: 125  LHKEIVDFCDFLSPTQEEQASRAE----AVRCVFDVIKYIWPNCKVEVFGSFRTGLYLPT 180

Query: 1270 SDVDLVV 1276
            SD+D+V+
Sbjct: 181  SDIDVVI 187



 Score = 47.0 bits (110), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 36/64 (56%), Gaps = 2/64 (3%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            ++  + V  DISF    + G    + +K    ++P   PL L+LK FL  R L++ YSGG
Sbjct: 229  VEKRSGVSFDISFDV--NGGPIAAEFIKNAISKWPELRPLCLILKVFLQQRELNEVYSGG 286

Query: 1452 LSSY 1455
            +SSY
Sbjct: 287  ISSY 290


>gi|302673443|ref|XP_003026408.1| hypothetical protein SCHCODRAFT_114488 [Schizophyllum commune H4-8]
 gi|300100090|gb|EFI91505.1| hypothetical protein SCHCODRAFT_114488, partial [Schizophyllum
            commune H4-8]
          Length = 466

 Score = 47.4 bits (111), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 36/75 (48%), Gaps = 1/75 (1%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
             D+     +  GL    ++++     PA  PL  VLK+FL  R L  +   GL  Y LM 
Sbjct: 212  FDVDIGVNNVDGLNGVPIMQKYLRTMPALRPLLFVLKRFLGQRDLGNAAKSGLGGYGLMC 271

Query: 1460 LITRFLQHEHHLGRP 1474
            LI  FL++    GRP
Sbjct: 272  LIVAFLKNNPQ-GRP 285


>gi|453083459|gb|EMF11505.1| PAP/OAS1 substrate-binding domain-containing protein [Mycosphaerella
            populorum SO2202]
          Length = 632

 Score = 47.4 bits (111), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 41/73 (56%), Gaps = 2/73 (2%)

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
            K    ++ DI+F +P   GL  T +++  ++  P   P+ L +K +   R ++ SYSG L
Sbjct: 238  KDGIGIQCDINFSNP--LGLHNTQMLRCYSKCDPRVRPIVLFMKSWAKQRKINSSYSGTL 295

Query: 1453 SSYCLMLLITRFL 1465
            SSY  +L++  +L
Sbjct: 296  SSYGYVLMVLHYL 308


>gi|342873493|gb|EGU75659.1| hypothetical protein FOXB_13808 [Fusarium oxysporum Fo5176]
          Length = 340

 Score = 47.4 bits (111), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 45/92 (48%), Gaps = 13/92 (14%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T +++D+SF++    G+   D   E   Q+PA   L  V+K FL  R L++  +GG
Sbjct: 124  VDKQTGLKVDVSFENLG--GVNAVDTFLEWKAQYPAMPILVTVIKHFLLMRGLNEPVNGG 181

Query: 1452 LSSYCLMLLITRFLQH-----------EHHLG 1472
            +  + ++ L+   LQ            EHHLG
Sbjct: 182  IGGFTVICLVVSMLQLMPQVQSRNLIPEHHLG 213


>gi|237839809|ref|XP_002369202.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
 gi|211966866|gb|EEB02062.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
          Length = 1196

 Score = 47.4 bits (111), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 70/288 (24%), Positives = 127/288 (44%), Gaps = 33/288 (11%)

Query: 1200 PTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFG 1259
            P+R+  L   H L HEE   F + + A    ++      + RVT   ++LWP      FG
Sbjct: 306  PSRE-KLDFFHRL-HEECVDFLQWIDATEEEQRNRAK-VLARVTAVSRLLWPNCVVCPFG 362

Query: 1260 SNATGLSLPSSDVDLVVCL---PPVRNLEPIKEAGILEGRNGIKETCLQHAA----RYLA 1312
            S+ T L+LP +D+D+ + +   PP+  LE  +E   ++    I E+     A    R+ A
Sbjct: 363  SSYTNLALPHADLDICIFINSDPPL--LEEFRELAAVDSAKEIAESAASAQAASYRRFGA 420

Query: 1313 NQEWVKSDSLKTVENTAIPIIML------VVEVPHDLIASAASSVQSPKEDAAHTTL--- 1363
              E V +++ ++ E                  +       +A +V  P +DA  ++    
Sbjct: 421  LAEKVMAEASRSTETFGYGSSFRRKKEEDACHLRRLAAVLSACTVSLPSQDAPSSSAPNR 480

Query: 1364 -----KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLV 1418
                 + D+ + +D+  + ++  P C       +   T++ +DIS   PS   + T+  +
Sbjct: 481  RESPRRRDSPLVTDLQLILEARVPICRF-----VDTETNLPVDISLDQPS--AVLTSLYI 533

Query: 1419 KELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            +    +FP   PL L+ K  L    L++ + GG+ SY L ++   FLQ
Sbjct: 534  RLQLLRFPLLRPLMLLNKTALKLWRLNEPFKGGVGSYLLFVMTLSFLQ 581


>gi|451992975|gb|EMD85451.1| hypothetical protein COCHEDRAFT_1148848 [Cochliobolus heterostrophus
            C5]
          Length = 624

 Score = 47.4 bits (111), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 42/72 (58%), Gaps = 2/72 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T +++DISF++ S  G+Q      +  + +P    +  +LKQFL    L++ ++GG+  +
Sbjct: 429  TQLQVDISFENLS--GVQAQATFAQWKQDYPDMIYMVALLKQFLVMHGLNEVHTGGIGGF 486

Query: 1456 CLMLLITRFLQH 1467
             ++ LI  ++QH
Sbjct: 487  SIICLIVSYIQH 498


>gi|145525609|ref|XP_001448621.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124416176|emb|CAK81224.1| unnamed protein product [Paramecium tetraurelia]
          Length = 364

 Score = 47.4 bits (111), Expect = 0.068,   Method: Composition-based stats.
 Identities = 62/266 (23%), Positives = 103/266 (38%), Gaps = 79/266 (29%)

Query: 1203 KASLSLMHSLL--HEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGS 1260
            K +L + H +L  H EI  F + ++  +   K  +  A  RV + LQ + P ++   FGS
Sbjct: 52   KKTLMIRHPVLRLHNEIAEFYEYISPSDQEHKRRVT-AYLRVEKYLQDIAPEAQIESFGS 110

Query: 1261 NATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSD 1320
              T + LP++D+D+V                ++E     K+   + AAR +      K +
Sbjct: 111  FKTRMYLPNADIDIV----------------MIETSCTQKQLFKKVAARMMKQTN--KFE 152

Query: 1321 SLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSA 1380
            ++  + N  +PII   VEV                          ++  H D+       
Sbjct: 153  NVNLIANAKVPIIKF-VEV--------------------------ESQYHFDL------- 178

Query: 1381 SPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLA 1440
                S    D +K                   Q  +L K   E +P    L + LK  L 
Sbjct: 179  ----SFNQLDGLK-------------------QIEELEKAF-ELYPELKFLLMTLKCVLR 214

Query: 1441 DRSLDQSYSGGLSSYCLMLLITRFLQ 1466
             R L+++YSGG+ S+ L  +I  FL+
Sbjct: 215  QRDLNETYSGGVGSFLLFQMILAFLR 240


>gi|390597612|gb|EIN07011.1| Nucleotidyltransferase [Punctularia strigosozonata HHB-11173 SS5]
          Length = 464

 Score = 47.0 bits (110), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 39/74 (52%), Gaps = 2/74 (2%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            +DIS    +  G+    ++     + PA   L ++ K FL+ RS+++ Y+GGL SY ++ 
Sbjct: 255  VDISLNQTN--GVAAGKMINRYLRELPALRGLVMITKAFLSQRSMNEVYTGGLGSYSIVC 312

Query: 1460 LITRFLQHEHHLGR 1473
            L   FLQ    + R
Sbjct: 313  LAISFLQMHPKIRR 326


>gi|342320294|gb|EGU12235.1| Trf5 [Rhodotorula glutinis ATCC 204091]
          Length = 522

 Score = 47.0 bits (110), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 34/144 (23%), Positives = 62/144 (43%), Gaps = 27/144 (18%)

Query: 1196 VQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRT 1255
            ++    RK  +    +LL EE+D+F       N   +  ++ A +     ++ LWP +  
Sbjct: 52   MEGLKRRKDDMRHPLALLDEELDAFVTWTRPNNAEHQLRLH-AFRCFQSLVKALWPFATC 110

Query: 1256 NIFGSNATGLSLPSSDVDLVV------CLPPVRNLEPIKEAGILEGRNGIKETCLQHAAR 1309
             +FGS ATGL LP  D D+V+        P V  L  +++A +  G              
Sbjct: 111  ELFGSMATGLYLPDGDFDVVIFDSRLMTTPSVSLLRTLRDAIVRSG-------------- 156

Query: 1310 YLANQEWVKSDSLKTVENTAIPII 1333
                  + ++  ++ VEN  +P++
Sbjct: 157  ------FAQASDIRLVENAKVPLV 174


>gi|221504776|gb|EEE30441.1| trf5, putative [Toxoplasma gondii VEG]
          Length = 1196

 Score = 47.0 bits (110), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 70/288 (24%), Positives = 128/288 (44%), Gaps = 33/288 (11%)

Query: 1200 PTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFG 1259
            P+R+  L   H L HEE   F + + A    ++      + RVT   ++LWP      FG
Sbjct: 306  PSRE-KLDFFHRL-HEECVDFLQWIDATEEEQRNRAK-VLARVTAVSRLLWPNCVVCPFG 362

Query: 1260 SNATGLSLPSSDVDLVVCL---PPVRNLEPIKEAGILEGRNGIKETCLQHAA----RYLA 1312
            S+ T L+LP +D+D+ + +   PP+  LE  +E   ++    I E+     A    R+ A
Sbjct: 363  SSYTNLALPHADLDICIFINSDPPL--LEEFRELAAVDSAKEIAESAASAQAASYRRFGA 420

Query: 1313 NQEWVKSDSLKTVENTAIPIIML------VVEVPHDLIASAASSVQSPKEDAAHTTL--- 1363
              E V +++ ++ E                  +       +A +V  P +DA  ++    
Sbjct: 421  LAEKVMAEASRSTETFGYGSSFRRKKEEDACHLRRLAAVLSACTVSLPSQDAPSSSAPNR 480

Query: 1364 -----KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLV 1418
                 + D+ + +D+  + ++  P C    ++     T++ +DIS   PS   + T+  +
Sbjct: 481  RESPRRRDSPLVTDLQLILEARVPICRFVDTE-----TNLPVDISLDQPS--AVLTSLYI 533

Query: 1419 KELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            +    +FP   PL L+ K  L    L++ + GG+ SY L ++   FLQ
Sbjct: 534  RLQLLRFPLLRPLMLLNKTALKLWRLNEPFKGGVGSYLLFVMTLSFLQ 581


>gi|221484582|gb|EEE22876.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 1196

 Score = 47.0 bits (110), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 70/288 (24%), Positives = 128/288 (44%), Gaps = 33/288 (11%)

Query: 1200 PTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFG 1259
            P+R+  L   H L HEE   F + + A    ++      + RVT   ++LWP      FG
Sbjct: 306  PSRE-KLDFFHRL-HEECVDFLQWIDATEEEQRNRAK-VLARVTAVSRLLWPNCVVCPFG 362

Query: 1260 SNATGLSLPSSDVDLVVCL---PPVRNLEPIKEAGILEGRNGIKETCLQHAA----RYLA 1312
            S+ T L+LP +D+D+ + +   PP+  LE  +E   ++    I E+     A    R+ A
Sbjct: 363  SSYTNLALPHADLDICIFINSDPPL--LEEFRELAAVDSAKEIAESAASAQAASYRRFGA 420

Query: 1313 NQEWVKSDSLKTVENTAIPIIML------VVEVPHDLIASAASSVQSPKEDAAHTTL--- 1363
              E V +++ ++ E                  +       +A +V  P +DA  ++    
Sbjct: 421  LAEKVMAEASRSTETFGYGSSFRRKKEEDACHLRRLAAVLSACTVSLPSQDAPSSSAPNR 480

Query: 1364 -----KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLV 1418
                 + D+ + +D+  + ++  P C    ++     T++ +DIS   PS   + T+  +
Sbjct: 481  RESPRRRDSPLVTDLQLILEARVPICRFVDTE-----TNLPVDISLDQPS--AVLTSLYI 533

Query: 1419 KELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            +    +FP   PL L+ K  L    L++ + GG+ SY L ++   FLQ
Sbjct: 534  RLQLLRFPLLRPLMLLNKTALKLWRLNEPFKGGVGSYLLFVMTLSFLQ 581


>gi|145516679|ref|XP_001444228.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411639|emb|CAK76831.1| unnamed protein product [Paramecium tetraurelia]
          Length = 637

 Score = 47.0 bits (110), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 63/282 (22%), Positives = 113/282 (40%), Gaps = 49/282 (17%)

Query: 1193 PLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRV--TRSLQVLW 1250
            P   Q  P ++  L+L  S   ++ID F       N   K + N   KR+   R   ++ 
Sbjct: 297  PYSSQTTPIKRKQLNLNKSKT-QDIDEF-----VNNIQVKSFQNKIRKRICFNRLHYIIS 350

Query: 1251 PRSRTNIF--GSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAA 1308
              S  NI+  GS  TGL L  SDVD+ +      +   I     L+  N      L    
Sbjct: 351  MNSDLNIYAYGSFETGLDLEVSDVDIGIWGTQTLSYNQI--VNFLQSINQ-----LLKQT 403

Query: 1309 RYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNH 1368
             +L N + ++S          +PI+ L +  P     +  + +Q   +            
Sbjct: 404  PFLVNSKLIQS---------HMPILKLELN-PKTAFYNDDAYIQQNWQS----------- 442

Query: 1369 VHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPAS 1428
                   LD   S K        I   +++     + +P H G  +T ++K+   +F   
Sbjct: 443  -----FHLDQQDSGKIIQVDLTWIYQWSNI-----YNNP-HLGFASTTIIKDWVNRFYWY 491

Query: 1429 TPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHH 1470
              + L+LK  L  ++L+ +++GG+SS+CL +++      +H+
Sbjct: 492  RDIMLILKYLLKSKNLNDAHTGGISSFCLSIMLAAIYMCKHY 533


>gi|254573058|ref|XP_002493638.1| Catalytic subunit of TRAMP (Trf4/Pap2p-Mtr4p-Air1p/2p) [Komagataella
            pastoris GS115]
 gi|238033437|emb|CAY71459.1| Catalytic subunit of TRAMP (Trf4/Pap2p-Mtr4p-Air1p/2p) [Komagataella
            pastoris GS115]
 gi|328354535|emb|CCA40932.1| DNA polymerase sigma subunit [Komagataella pastoris CBS 7435]
          Length = 601

 Score = 46.6 bits (109), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 36/96 (37%), Positives = 47/96 (48%), Gaps = 12/96 (12%)

Query: 1190 PLQPLEVQNCP---TRKASLSL-MHSLLHEEIDSFCKQVA---AENTARKPYINWAVKRV 1242
            PL    +Q  P    R  S  L +   L  EI  F   ++   AE  AR    N AVKR+
Sbjct: 108  PLDDKPIQQYPWIKNRDHSKQLELSDWLTLEIKDFINYISPSIAEIEAR----NNAVKRL 163

Query: 1243 TRSLQV-LWPRSRTNIFGSNATGLSLPSSDVDLVVC 1277
             + +   LWP    N+FGS AT L LP SD+D+V+ 
Sbjct: 164  RKEITTNLWPDCYVNVFGSFATDLYLPGSDIDMVIT 199



 Score = 44.7 bits (104), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 21/74 (28%), Positives = 42/74 (56%), Gaps = 2/74 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I+  + + +D+SF+  +  GL+  + ++    + P    L L++KQFLA R ++  + GG
Sbjct: 241  IEPRSKIHIDVSFEKTN--GLRAAERIQGWLRETPGLRELVLIVKQFLAVRRMNNVHHGG 298

Query: 1452 LSSYCLMLLITRFL 1465
            L  + ++ L+  FL
Sbjct: 299  LGGFSIICLVHSFL 312


>gi|330917097|ref|XP_003297677.1| hypothetical protein PTT_08167 [Pyrenophora teres f. teres 0-1]
 gi|311329513|gb|EFQ94235.1| hypothetical protein PTT_08167 [Pyrenophora teres f. teres 0-1]
          Length = 673

 Score = 46.6 bits (109), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 22/71 (30%), Positives = 41/71 (57%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T +++DISF++ S  G+Q      E   ++P    +  ++KQFL    L++ ++GG+  Y
Sbjct: 478  TKLQVDISFENLS--GVQAQATFAEWKAKYPDMIYMVALMKQFLVMHGLNEVHTGGIGGY 535

Query: 1456 CLMLLITRFLQ 1466
             ++ LI  +LQ
Sbjct: 536  SIICLIVSYLQ 546


>gi|425766989|gb|EKV05577.1| Topoisomerase family protein TRF4, putative [Penicillium digitatum
            Pd1]
          Length = 714

 Score = 46.6 bits (109), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 41/72 (56%), Gaps = 2/72 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T +R+D+SF + S  GL   +  ++   ++P    +  V+KQFL  R L++  +GGL  +
Sbjct: 371  TGLRVDLSFDNDS--GLIANETFQKWKTEYPTMPVILAVIKQFLLIRGLNEVPTGGLGGF 428

Query: 1456 CLMLLITRFLQH 1467
             +  L+T  LQH
Sbjct: 429  SITCLVTSILQH 440


>gi|68074819|ref|XP_679326.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56500049|emb|CAH96511.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 390

 Score = 46.6 bits (109), Expect = 0.11,   Method: Composition-based stats.
 Identities = 18/54 (33%), Positives = 37/54 (68%)

Query: 1413 QTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            ++TD +K+  +++    PL +++K FL  R+L+++Y+GG+ S+ L  ++  FLQ
Sbjct: 248  ESTDFIKKKIKEYIYLRPLVILIKFFLNSRNLNETYTGGIGSFMLCCMVLHFLQ 301


>gi|425780140|gb|EKV18158.1| Topoisomerase family protein TRF4, putative [Penicillium digitatum
            PHI26]
          Length = 714

 Score = 46.6 bits (109), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 41/72 (56%), Gaps = 2/72 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T +R+D+SF + S  GL   +  ++   ++P    +  V+KQFL  R L++  +GGL  +
Sbjct: 371  TGLRVDLSFDNDS--GLIANETFQKWKTEYPTMPVILAVIKQFLLIRGLNEVPTGGLGGF 428

Query: 1456 CLMLLITRFLQH 1467
             +  L+T  LQH
Sbjct: 429  SITCLVTSILQH 440


>gi|367007982|ref|XP_003688720.1| hypothetical protein TPHA_0P01280 [Tetrapisispora phaffii CBS 4417]
 gi|357527030|emb|CCE66286.1| hypothetical protein TPHA_0P01280 [Tetrapisispora phaffii CBS 4417]
          Length = 504

 Score = 46.6 bits (109), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 20/74 (27%), Positives = 42/74 (56%), Gaps = 2/74 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            ++  + + +D+SF+  +  GL   ++++      P+   L L++KQFL  R L+  ++GG
Sbjct: 231  VEPVSGIHIDVSFERDN--GLDAANIIRSWLTSTPSLRELVLIVKQFLNSRRLNDVHTGG 288

Query: 1452 LSSYCLMLLITRFL 1465
            L  + ++ L+  FL
Sbjct: 289  LGGFSIICLVYSFL 302


>gi|302831043|ref|XP_002947087.1| hypothetical protein VOLCADRAFT_87356 [Volvox carteri f. nagariensis]
 gi|300267494|gb|EFJ51677.1| hypothetical protein VOLCADRAFT_87356 [Volvox carteri f. nagariensis]
          Length = 609

 Score = 46.6 bits (109), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 38/75 (50%), Gaps = 2/75 (2%)

Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
             I  +  V +DIS    S  G Q  + +++    FPA  PL LVLK ++ +  L +   G
Sbjct: 353  KITTSAGVSVDISIAGLS--GPQAAEYIRQQVSSFPALRPLVLVLKSYMRELDLAEVAKG 410

Query: 1451 GLSSYCLMLLITRFL 1465
            G+SSY L  ++   L
Sbjct: 411  GISSYGLTYMVMAHL 425


>gi|296088678|emb|CBI38128.3| unnamed protein product [Vitis vinifera]
          Length = 178

 Score = 46.6 bits (109), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 38/61 (62%), Gaps = 1/61 (1%)

Query: 1004 TTDVATEEEISGSFASLSGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSP 1063
            + D  +E+++SGS A+L  +V+ K+ D LP  +L PIIIPN+S+E S  D K     + P
Sbjct: 2    SADALSEKKVSGSLANLPVNVEGKSGDPLPYSLL-PIIIPNMSKESSLKDVKGHGNDEDP 60

Query: 1064 C 1064
             
Sbjct: 61   I 61


>gi|396490001|ref|XP_003843230.1| hypothetical protein LEMA_P073400.1 [Leptosphaeria maculans JN3]
 gi|312219809|emb|CBX99751.1| hypothetical protein LEMA_P073400.1 [Leptosphaeria maculans JN3]
          Length = 717

 Score = 46.6 bits (109), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 23/83 (27%), Positives = 47/83 (56%), Gaps = 5/83 (6%)

Query: 1394 AATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLS 1453
            + T +++DISF++ S  G+Q      +  +++P    +  +LKQFL  R L++ ++GG+ 
Sbjct: 476  SVTRLQVDISFENLS--GMQAQATFAQWKDKYPDMIYMVALLKQFLVMRGLNEVHTGGIG 533

Query: 1454 SYCLMLLITRFLQHEHHLGRPIN 1476
             + ++ LI  ++   H  G+  N
Sbjct: 534  GFAIICLIVHYI---HQAGKAEN 553


>gi|440493090|gb|ELQ75598.1| DNA polymerase sigma [Trachipleistophora hominis]
          Length = 288

 Score = 46.2 bits (108), Expect = 0.15,   Method: Composition-based stats.
 Identities = 26/67 (38%), Positives = 37/67 (55%), Gaps = 2/67 (2%)

Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
            R D+S    S  G+  T  +K+  +Q P    L+L LK FL  R L++S  GGL SY  +
Sbjct: 198  RYDLSLNQES--GIIHTRFIKKALKQQPYIKDLSLFLKYFLKSRGLNESKRGGLCSYAQL 255

Query: 1459 LLITRFL 1465
            L++  FL
Sbjct: 256  LMLINFL 262


>gi|395335008|gb|EJF67384.1| hypothetical protein DICSQDRAFT_77074 [Dichomitus squalens LYAD-421
            SS1]
          Length = 592

 Score = 46.2 bits (108), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 44/78 (56%), Gaps = 3/78 (3%)

Query: 1397 SVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
            S+++D+S  +    GL+   +++   ++ PA   L L LK  L+   L+ +  GGLSSY 
Sbjct: 220  SLKIDVSLNA--TDGLKAVPILRSYFDRMPALRHLVLCLKALLSRHGLNSASFGGLSSYA 277

Query: 1457 LMLLITRFLQHEHHLGRP 1474
            L+ L   FLQ  + +GRP
Sbjct: 278  LICLAISFLQL-NPMGRP 294


>gi|374107870|gb|AEY96777.1| FAEL207Wp [Ashbya gossypii FDAG1]
          Length = 626

 Score = 46.2 bits (108), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 54/231 (23%), Positives = 87/231 (37%), Gaps = 75/231 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N A+KR+  ++Q  WP +  + FGS AT L LP SD+D VV            ++G  + 
Sbjct: 217  NDALKRIRDAVQDFWPDANLHCFGSYATDLYLPGSDIDCVVN----------SKSGDKDN 266

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            +N +           LA Q       +  +    +PII  V                   
Sbjct: 267  KNALYSLASYLKRNGLATQ-------VSVIAKARVPIIKFV------------------- 300

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
            E A+         +H D+           S   ++ ++AA  +R            L  T
Sbjct: 301  EPAS--------QIHIDL-----------SFERTNGVEAAKIIR----------GWLHDT 331

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
              ++EL           L++KQFL  R L+  + GGL  + ++ L   FL+
Sbjct: 332  PGLRELV----------LIVKQFLHARRLNDVHIGGLGGFSIICLAYSFLK 372


>gi|146184040|ref|XP_001027646.2| Chitinase class I family protein [Tetrahymena thermophila]
 gi|146143378|gb|EAS07404.2| Chitinase class I family protein [Tetrahymena thermophila SB210]
          Length = 463

 Score = 46.2 bits (108), Expect = 0.16,   Method: Composition-based stats.
 Identities = 25/75 (33%), Positives = 40/75 (53%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            ++  T +  DISF      G++    VK+  E +P    L +V+K  L  R L ++YSGG
Sbjct: 264  VEKETQINFDISFNK--EDGVKQLSEVKKGLELYPEMKYLIMVMKCILRQRDLHETYSGG 321

Query: 1452 LSSYCLMLLITRFLQ 1466
            + S+ L  +I  FL+
Sbjct: 322  IGSFLLFCMILAFLR 336


>gi|195112905|ref|XP_002001012.1| GI22217 [Drosophila mojavensis]
 gi|193917606|gb|EDW16473.1| GI22217 [Drosophila mojavensis]
          Length = 397

 Score = 45.8 bits (107), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 36/64 (56%), Gaps = 1/64 (1%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            LH+EI+ F   + +  T     +  AV R+ R +  +WP     +FGS  TGL+LP SD+
Sbjct: 25   LHQEIEHFYNYMVSTPTEYMMRME-AVHRIERVVLSIWPDCCIEVFGSFRTGLNLPISDI 83

Query: 1273 DLVV 1276
            D+ V
Sbjct: 84   DIAV 87



 Score = 44.7 bits (104), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 43/78 (55%), Gaps = 7/78 (8%)

Query: 1369 VHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPAS 1428
            V  D+  LD ++ P    T     +  + ++ DISF + S  G++  +LV++  EQFP  
Sbjct: 109  VTDDVNVLDKASVPVVKFT-----EQISRIKFDISFNTTS--GVKAAELVQQFIEQFPEL 161

Query: 1429 TPLALVLKQFLADRSLDQ 1446
              L +VLKQ+L  + L++
Sbjct: 162  PKLVIVLKQYLLLQGLNE 179


>gi|302917206|ref|XP_003052397.1| hypothetical protein NECHADRAFT_1558 [Nectria haematococca mpVI
            77-13-4]
 gi|256733337|gb|EEU46684.1| hypothetical protein NECHADRAFT_1558 [Nectria haematococca mpVI
            77-13-4]
          Length = 630

 Score = 45.8 bits (107), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 45/92 (48%), Gaps = 13/92 (14%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +   T +++DISF++    G+Q  D      E +PA   L  V+K FL  R L++  +GG
Sbjct: 427  VDKQTGLKVDISFEN--LGGVQAVDTFLRWKELYPAMPILVTVIKHFLLMRGLNEPVNGG 484

Query: 1452 LSSYCLMLLITRFLQ-----------HEHHLG 1472
            +  + ++ L+   LQ            EH+LG
Sbjct: 485  IGGFSVICLVVSMLQLNPQVQSRSLVPEHNLG 516


>gi|159474006|ref|XP_001695120.1| hypothetical protein CHLREDRAFT_206054 [Chlamydomonas reinhardtii]
 gi|158276054|gb|EDP01828.1| hypothetical protein CHLREDRAFT_206054 [Chlamydomonas reinhardtii]
          Length = 397

 Score = 45.8 bits (107), Expect = 0.17,   Method: Composition-based stats.
 Identities = 27/77 (35%), Positives = 40/77 (51%), Gaps = 2/77 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            + V +D+S  S S  G +    +++    +PA  PL LVLK ++    L +  SGGLSSY
Sbjct: 193  SGVSVDVSIASDS--GPRAAAFIRQQAVAYPALRPLVLVLKSYMRAEGLAEVASGGLSSY 250

Query: 1456 CLMLLITRFLQHEHHLG 1472
             L  ++   LQ E   G
Sbjct: 251  GLTYMLLAHLQEEAKRG 267


>gi|452988962|gb|EME88717.1| hypothetical protein MYCFIDRAFT_87561 [Pseudocercospora fijiensis
            CIRAD86]
          Length = 848

 Score = 45.8 bits (107), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 42/76 (55%), Gaps = 2/76 (2%)

Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
            D  K    ++ DI+F +P   GL  T +++  ++  P   P+ L +K +   R ++ SYS
Sbjct: 446  DFPKDGVGIQCDINFFNP--LGLHNTQMLRCYSKCDPRVRPMVLFVKSWAKGRKINSSYS 503

Query: 1450 GGLSSYCLMLLITRFL 1465
            G LSSY  +L++  +L
Sbjct: 504  GTLSSYGYVLMVLHYL 519


>gi|45190400|ref|NP_984654.1| AEL207Wp [Ashbya gossypii ATCC 10895]
 gi|50401682|sp|Q9HFW3.1|TRF5_ASHGO RecName: Full=Poly(A) RNA polymerase protein 1; AltName:
            Full=Topoisomerase 1-related protein TRF5
 gi|10444115|gb|AAG17722.1|AF286114_3 Trf5 [Eremothecium gossypii]
 gi|44983296|gb|AAS52478.1| AEL207Wp [Ashbya gossypii ATCC 10895]
          Length = 626

 Score = 45.8 bits (107), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 54/231 (23%), Positives = 87/231 (37%), Gaps = 75/231 (32%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N A+KR+  ++Q  WP +  + FGS AT L LP SD+D VV            ++G  + 
Sbjct: 217  NDALKRIRDAVQDFWPDANLHCFGSYATDLYLPGSDIDCVVN----------SKSGDKDN 266

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            +N +           LA Q       +  +    +PII  V                   
Sbjct: 267  KNALYSLASYLKRNGLATQ-------VSVIAKARVPIIKFV------------------- 300

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
            E A+         +H D+           S   ++ ++AA  +R            L  T
Sbjct: 301  EPAS--------QIHIDL-----------SFERTNGVEAAKIIR----------GWLHDT 331

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
              ++EL           L++KQFL  R L+  + GGL  + ++ L   FL+
Sbjct: 332  PGLRELV----------LIVKQFLHARRLNDVHIGGLGGFSIICLAYSFLK 372


>gi|380471218|emb|CCF47385.1| Poly(A) RNA polymerase cid14, partial [Colletotrichum higginsianum]
          Length = 457

 Score = 45.8 bits (107), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 25/65 (38%), Positives = 35/65 (53%), Gaps = 2/65 (3%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I + T +R+DISF      G Q      E  EQFPA   L  ++K FLA R L++  +GG
Sbjct: 395  IDSITGLRVDISFDR--LDGPQAVKTFAEWKEQFPAMPILVTMIKHFLAMRGLNEPVNGG 452

Query: 1452 LSSYC 1456
            + S+ 
Sbjct: 453  IGSFT 457


>gi|355709232|gb|AES03523.1| polymerase sigma [Mustela putorius furo]
          Length = 417

 Score = 45.8 bits (107), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 20/36 (55%), Positives = 27/36 (75%)

Query: 1431 LALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            L LVLKQFL  R L++ ++GG+SSY L+L+   FLQ
Sbjct: 6    LILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 41


>gi|189211747|ref|XP_001942202.1| Poly(A) RNA polymerase cid13 [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187979401|gb|EDU46027.1| Poly(A) RNA polymerase cid13 [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 636

 Score = 45.8 bits (107), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 60/262 (22%), Positives = 104/262 (39%), Gaps = 58/262 (22%)

Query: 1258 FGSNATGLSLPSSDVDLVVCLPPVRN------------LEPIKEAGILEGRNG------- 1298
            FGS A+G  +P SD+DL V +P  R+            +  + E  +L+ + G       
Sbjct: 199  FGSLASGFGMPGSDMDLAV-VPTYRDPARANDFSIDRGIPRLLERAVLDAKLGGRLLTRT 257

Query: 1299 ---IKETCLQ-----HAARYLANQEW--------VKSDSL------KTVENTAIP----- 1331
               I + C       + A +   Q+W          SD+       K   N  +P     
Sbjct: 258  RVPILKICQSPTETLYTALFEERQKWDELPDEEKYPSDATRSFVQPKPAANHQVPPTPQV 317

Query: 1332 -IIMLVVEVPH-DLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
                 V   P     A   +S+Q+PK +    T +  + V +D    D +       +  
Sbjct: 318  DAAKFVTNFPTLGTTAPTKNSLQAPKTEVKSPTTETSS-VTNDKAKKDQNQRSGKQWSRE 376

Query: 1390 ------DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
                  D  K    ++ DI+F++P   G+  T ++K  +   P   P+ L +K +   R 
Sbjct: 377  KVVGPLDFPKTGCGIQCDINFENP--LGIHNTHMLKCYSLTDPRVRPMVLFVKSWAKRRK 434

Query: 1444 LDQSYSGGLSSYCLMLLITRFL 1465
            ++ +YSG LSSY  +L++  +L
Sbjct: 435  VNSAYSGTLSSYGWVLMVLHYL 456


>gi|407398163|gb|EKF28017.1| DNA polymerase sigma, putative [Trypanosoma cruzi marinkellei]
          Length = 286

 Score = 45.8 bits (107), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 31/55 (56%), Gaps = 1/55 (1%)

Query: 1421 LTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ-HEHHLGRP 1474
            +   FP + PL +V+K FL  R + + Y GGL S+   LL+  FLQ H  +  RP
Sbjct: 1    MLRSFPEAQPLIIVVKYFLQQRGMHEPYHGGLGSFATTLLVISFLQNHPIYTERP 55


>gi|325182222|emb|CCA16676.1| phospholipase putative [Albugo laibachii Nc14]
          Length = 1653

 Score = 45.4 bits (106), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 42/145 (28%), Positives = 67/145 (46%), Gaps = 17/145 (11%)

Query: 1196 VQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRT 1255
            V+  PT +A L+        E++    Q A +N AR      A + + R +Q LW  +  
Sbjct: 1279 VREEPTVEAPLT--------ELNQRILQFACQNEARN---EQARQEIERGIQSLWSDASV 1327

Query: 1256 NIFGSNATGLS--LPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLAN 1313
             +FGS AT LS  + ++DVDLVV LP    +     A ++       E+ +   + + A+
Sbjct: 1328 YLFGSAATSLSIDIETADVDLVVILPSQSTVFRKASAHLVIQLKDHLESLVSEKSLFCAS 1387

Query: 1314 QEWVKSDSLKTVENTAIPIIMLVVE 1338
                KS +   + N  IPI  L +E
Sbjct: 1388 ----KSVNFSLITNARIPITRLWIE 1408


>gi|452839338|gb|EME41277.1| hypothetical protein DOTSEDRAFT_73629 [Dothistroma septosporum NZE10]
          Length = 835

 Score = 45.4 bits (106), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 41/76 (53%), Gaps = 2/76 (2%)

Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
            D  K    ++ DI+F +P   GL  T L++  +   P   P+ L +K +   R ++ SYS
Sbjct: 448  DFPKDGVGIQCDINFFNP--LGLHNTQLLRCYSSCDPRVRPMVLFVKSWAKKRRINSSYS 505

Query: 1450 GGLSSYCLMLLITRFL 1465
            G LSSY  ++++  +L
Sbjct: 506  GTLSSYGYVMMVLHYL 521


>gi|428181724|gb|EKX50587.1| hypothetical protein GUITHDRAFT_135219 [Guillardia theta CCMP2712]
          Length = 186

 Score = 45.4 bits (106), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 22/48 (45%), Positives = 30/48 (62%), Gaps = 5/48 (10%)

Query: 1249 LWPRSRTNIFGSNATGLSLPSSDVDLVV-----CLPPVRNLEPIKEAG 1291
            +WP +   +FGS ATG+ LPSSDVD+VV     C P V+  + +K  G
Sbjct: 65   IWPEAELKVFGSYATGMFLPSSDVDMVVVNCSDCDPLVKLAKVLKSDG 112


>gi|330844415|ref|XP_003294122.1| hypothetical protein DICPUDRAFT_159072 [Dictyostelium purpureum]
 gi|325075475|gb|EGC29358.1| hypothetical protein DICPUDRAFT_159072 [Dictyostelium purpureum]
          Length = 825

 Score = 45.1 bits (105), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 45/74 (60%), Gaps = 2/74 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T   +DI F +PS  G++ T +VK   +Q+ +   L LVLK FL    ++++Y+GG+ SY
Sbjct: 417  TQYNIDICFDTPS--GIENTQVVKSFLKQYKSMRVLLLVLKFFLQQNHVNETYTGGIGSY 474

Query: 1456 CLMLLITRFLQHEH 1469
             L L++  ++Q  H
Sbjct: 475  ALALMVVSYIQLRH 488


>gi|213406299|ref|XP_002173921.1| Poly(A) RNA polymerase cid12 [Schizosaccharomyces japonicus yFS275]
 gi|212001968|gb|EEB07628.1| Poly(A) RNA polymerase cid12 [Schizosaccharomyces japonicus yFS275]
          Length = 357

 Score = 44.7 bits (104), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 2/75 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            + + +DI+F + S   + + +L  E  +  P    L ++LK +L +R LD  Y GGL S 
Sbjct: 124  SGLSVDITFDNAS--AISSANLQNEYADNHPVFRKLFMLLKHWLFERRLDHVYIGGLGSS 181

Query: 1456 CLMLLITRFLQHEHH 1470
             L  +I  +L+ ++H
Sbjct: 182  ALSYMIIGWLEMQYH 196


>gi|380487956|emb|CCF37707.1| hypothetical protein CH063_01756 [Colletotrichum higginsianum]
          Length = 1037

 Score = 44.7 bits (104), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 34/123 (27%), Positives = 53/123 (43%), Gaps = 8/123 (6%)

Query: 1349 SSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTS---SDNI---KAATSVRLDI 1402
            S V  P+ D      KH+      M  L D A    +       D +   K+   V+ DI
Sbjct: 648  SIVHDPQYDYVVPVTKHNIETIKRMRGLRDPAKMAVNQPRDPYKDKLEFPKSGVGVQCDI 707

Query: 1403 SFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLIT 1462
            +F +  H  L  T L++  +   P   PL L +K +   R ++  Y G LSSY  +L++ 
Sbjct: 708  NFAA--HLALHNTLLLRCYSHSDPRVRPLVLFIKHWAKVRGINTPYRGTLSSYGYVLMML 765

Query: 1463 RFL 1465
             +L
Sbjct: 766  HYL 768


>gi|340960188|gb|EGS21369.1| hypothetical protein CTHT_0032240 [Chaetomium thermophilum var.
            thermophilum DSM 1495]
          Length = 1085

 Score = 44.3 bits (103), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 41/144 (28%), Positives = 69/144 (47%), Gaps = 13/144 (9%)

Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSA--SPKCS 1385
            T I I + +++ P  ++ S +S  Q+   DA     K  + + S++  L D A  +P   
Sbjct: 697  TKIRIYIELLKGP--MVPSPSSKTQT---DAVIPLPKDADTLISELRQLGDPAGMAPNQP 751

Query: 1386 HTSSDNI----KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLAD 1441
                +N     K+   V+ DI+F +  H  LQ T L++  +   P   PL L +K +   
Sbjct: 752  RDRYNNPLEFPKSGVGVQCDINFSA--HLALQNTLLLRCYSHCDPRVRPLILFVKHWAKV 809

Query: 1442 RSLDQSYSGGLSSYCLMLLITRFL 1465
            R ++  Y G LSSY  +L++  FL
Sbjct: 810  RRINTPYRGTLSSYGYVLMMLHFL 833


>gi|389600163|ref|XP_001561739.2| conserved hypothetical protein [Leishmania braziliensis
            MHOM/BR/75/M2904]
 gi|322504209|emb|CAM41532.2| conserved hypothetical protein [Leishmania braziliensis
            MHOM/BR/75/M2904]
          Length = 3055

 Score = 44.3 bits (103), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 40/74 (54%), Gaps = 3/74 (4%)

Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
            AT  ++DI+F    H   ++  L + L E FP + PL L LK  +    + +S  GG++S
Sbjct: 1681 ATGTKVDITFAGGEH--WRSMQLTRSLLEVFPHARPLILFLKYCVRSLGVGESEPGGVTS 1738

Query: 1455 YCLMLLITRFLQHE 1468
            + + L++  F  HE
Sbjct: 1739 FAIYLMVLHFY-HE 1751


>gi|340939540|gb|EGS20162.1| hypothetical protein CTHT_0046690 [Chaetomium thermophilum var.
            thermophilum DSM 1495]
          Length = 836

 Score = 44.3 bits (103), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 25/92 (27%), Positives = 44/92 (47%), Gaps = 13/92 (14%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            ++  T +R+DISF++   +G+   +   +    +P    L  V+K FL  R L++  +GG
Sbjct: 582  VERKTGLRIDISFEN--RSGIVAINTFHKWRNLYPGMPALVSVIKHFLLMRGLNEPVNGG 639

Query: 1452 LSSYCLMLLITRFLQH-----------EHHLG 1472
            + S+ +  L+   LQ             HHLG
Sbjct: 640  IGSFSVTCLVVSMLQMMPEVQSGNMDTRHHLG 671


>gi|116201727|ref|XP_001226675.1| hypothetical protein CHGG_08748 [Chaetomium globosum CBS 148.51]
 gi|88177266|gb|EAQ84734.1| hypothetical protein CHGG_08748 [Chaetomium globosum CBS 148.51]
          Length = 780

 Score = 44.3 bits (103), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 23/93 (24%), Positives = 47/93 (50%), Gaps = 13/93 (13%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            I+  T +++D+SF++  ++G+      K   +++P    L  ++K FL  R L++  +GG
Sbjct: 525  IEHKTGLKVDVSFEN--NSGVTAIATFKAWRDRYPGMPALVTLVKHFLLMRGLNEPVNGG 582

Query: 1452 LSSYCLMLLITRFLQH-----------EHHLGR 1473
            +  + ++ L+   LQ             HHLG+
Sbjct: 583  IGGFSVICLVVSMLQMMPEVQSGNLDTRHHLGQ 615


>gi|290988093|ref|XP_002676756.1| predicted protein [Naegleria gruberi]
 gi|284090360|gb|EFC44012.1| predicted protein [Naegleria gruberi]
          Length = 604

 Score = 44.3 bits (103), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 31/53 (58%)

Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            + + ++  +E++PA   L   LK FL  R+L+    GGLSSY L L+I   LQ
Sbjct: 375  SVEFIRRSSEKYPAFKYLCYFLKYFLKQRNLNMVLYGGLSSYGLSLMIVSHLQ 427


>gi|354544020|emb|CCE40742.1| hypothetical protein CPAR2_107770 [Candida parapsilosis]
          Length = 608

 Score = 44.3 bits (103), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 55/124 (44%), Gaps = 24/124 (19%)

Query: 1216 EIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
            E+  F + ++   AE   R   IN   K ++      WP +  ++FGS AT L LP SD+
Sbjct: 164  EMKDFVRYISPSKAEIITRNNVINTLKKEISS----FWPGTEAHVFGSCATDLYLPGSDI 219

Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
            D+VV          I   G  E R     + L   + +L  +   K  +++ + N  +PI
Sbjct: 220  DMVV----------ISSTGDYENR-----SRLYQLSSFLRVKNLAK--NVEVIANAKVPI 262

Query: 1333 IMLV 1336
            I  V
Sbjct: 263  IKFV 266


>gi|396491063|ref|XP_003843481.1| hypothetical protein LEMA_P075910.1 [Leptosphaeria maculans JN3]
 gi|312220060|emb|CBY00002.1| hypothetical protein LEMA_P075910.1 [Leptosphaeria maculans JN3]
          Length = 729

 Score = 44.3 bits (103), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 22/76 (28%), Positives = 42/76 (55%), Gaps = 2/76 (2%)

Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
            D  K    ++ DI+F++P   G+  + +++  +   P   P+ L +K +   R ++ SYS
Sbjct: 447  DFPKTGCGIQCDINFENP--LGIHNSQMLRCYSLTDPRVRPIVLFVKSWAKRRKVNSSYS 504

Query: 1450 GGLSSYCLMLLITRFL 1465
            G LSSY  +L++  +L
Sbjct: 505  GTLSSYGWVLMVLHYL 520


>gi|330915205|ref|XP_003296937.1| hypothetical protein PTT_07185 [Pyrenophora teres f. teres 0-1]
 gi|311330654|gb|EFQ94964.1| hypothetical protein PTT_07185 [Pyrenophora teres f. teres 0-1]
          Length = 513

 Score = 43.9 bits (102), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 42/76 (55%), Gaps = 2/76 (2%)

Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
            D  K    ++ DI+F++P   G+  T ++K  +   P   P+ L +K +   R ++ +YS
Sbjct: 246  DFPKTGCGIQCDINFENP--LGIHNTHMLKCYSLTDPRVRPMVLFVKSWAKRRKVNSAYS 303

Query: 1450 GGLSSYCLMLLITRFL 1465
            G LSSY  +L++  +L
Sbjct: 304  GTLSSYGWVLMVLHYL 319


>gi|378726515|gb|EHY52974.1| DNA polymerase sigma subunit [Exophiala dermatitidis NIH/UT8656]
          Length = 684

 Score = 43.9 bits (102), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 41/72 (56%), Gaps = 2/72 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            + +++D+ F + S  G+   +  ++  +++PA   +  ++KQFL  R L+   +GGL  +
Sbjct: 356  SGLKVDLCFDNDS--GVAAVETFQKWKKEYPAMPIIVAIVKQFLMIRGLNDVATGGLGGF 413

Query: 1456 CLMLLITRFLQH 1467
              + L+T  LQH
Sbjct: 414  STICLVTSLLQH 425


>gi|145533334|ref|XP_001452417.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124420105|emb|CAK85020.1| unnamed protein product [Paramecium tetraurelia]
          Length = 361

 Score = 43.9 bits (102), Expect = 0.73,   Method: Composition-based stats.
 Identities = 23/71 (32%), Positives = 39/71 (54%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            +  + DISF      GL+  D +++    +P    L ++LK  L  R L+++YSGG+ S+
Sbjct: 171  SQYQFDISFNQ--MDGLKQIDEIRKAFTIYPEFKYLIMILKCMLKQRELNETYSGGIGSF 228

Query: 1456 CLMLLITRFLQ 1466
             L  +I  FL+
Sbjct: 229  LLFQMILAFLR 239


>gi|145475559|ref|XP_001423802.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124390863|emb|CAK56404.1| unnamed protein product [Paramecium tetraurelia]
          Length = 354

 Score = 43.9 bits (102), Expect = 0.73,   Method: Composition-based stats.
 Identities = 23/71 (32%), Positives = 40/71 (56%), Gaps = 2/71 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            +S + DISF      G++  D +++    +P    L ++LK  L  R L+++YSGG+ S+
Sbjct: 164  SSYQFDISFNQ--MDGIRQIDEIQKAFTIYPEFKYLIMILKCILKQRDLNETYSGGIGSF 221

Query: 1456 CLMLLITRFLQ 1466
             L  +I  FL+
Sbjct: 222  LLFQMILAFLR 232


>gi|299473006|emb|CBN77407.1| nucleotidyltransferase family protein [Ectocarpus siliculosus]
          Length = 434

 Score = 43.9 bits (102), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 34/100 (34%), Positives = 46/100 (46%), Gaps = 5/100 (5%)

Query: 1213 LHEEIDSFCKQV--AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
            LH EI   C  +    E  A+       +KRV    + L   +R  IFGS  TGL LPSS
Sbjct: 265  LHNEILDLCDLLMPTPEEKAKTAIAITYIKRVVE--ETLGSEARVEIFGSQLTGLVLPSS 322

Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNG-IKETCLQHAAR 1309
            D+D VV   P  +L  +  A       G ++E  +  +AR
Sbjct: 323  DIDSVVLGGPRGSLGSLGAAMYRRQNKGEVREVTVIKSAR 362


>gi|145526448|ref|XP_001449035.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124416601|emb|CAK81638.1| unnamed protein product [Paramecium tetraurelia]
          Length = 572

 Score = 43.9 bits (102), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 18/62 (29%), Positives = 37/62 (59%)

Query: 1409 HTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHE 1468
            H G  +T ++K+   +F       L+LK  L  ++L+ +++GG+SS+CL +++T     +
Sbjct: 462  HLGFASTTIIKDWVNRFYWYRETILILKFLLKSKNLNDAHTGGISSFCLSIMLTAIYMCK 521

Query: 1469 HH 1470
            H+
Sbjct: 522  HY 523


>gi|449688102|ref|XP_004211642.1| PREDICTED: terminal uridylyltransferase 4-like, partial [Hydra
            magnipapillata]
          Length = 196

 Score = 43.9 bits (102), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 37/66 (56%)

Query: 1413 QTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLG 1472
            QT+DL+ + T        LA++++ +     LD + +GGLS YC +L++  +LQH     
Sbjct: 29   QTSDLIIKYTNVDERCAKLAMLIRMWSKACGLDNADNGGLSPYCFVLMVINYLQHTEPPV 88

Query: 1473 RPINQV 1478
             PI +V
Sbjct: 89   LPIIKV 94


>gi|342885079|gb|EGU85188.1| hypothetical protein FOXB_04303 [Fusarium oxysporum Fo5176]
          Length = 1055

 Score = 43.5 bits (101), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 22/73 (30%), Positives = 37/73 (50%), Gaps = 2/73 (2%)

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
            K    V+ DI+F +  H  L  T L++  +   P   P+ L +K +   R ++  Y G L
Sbjct: 730  KTGAGVQCDINFSA--HLALHNTALLRCYSHTDPRVRPMVLFVKHWAKTRGINSGYRGTL 787

Query: 1453 SSYCLMLLITRFL 1465
            SSY  +L++  +L
Sbjct: 788  SSYGYVLMVLHYL 800


>gi|389642869|ref|XP_003719067.1| DNA polymerase sigma [Magnaporthe oryzae 70-15]
 gi|351641620|gb|EHA49483.1| DNA polymerase sigma [Magnaporthe oryzae 70-15]
 gi|440474598|gb|ELQ43333.1| DNA polymerase sigma [Magnaporthe oryzae Y34]
 gi|440486580|gb|ELQ66430.1| DNA polymerase sigma [Magnaporthe oryzae P131]
          Length = 703

 Score = 43.5 bits (101), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 58/260 (22%), Positives = 94/260 (36%), Gaps = 79/260 (30%)

Query: 1213 LHEEIDSF---CKQVAAENTARKPYINWAVKRVTRSL---QVLWPRSRTNIFGSNATGLS 1266
            LH+EI  F    K    E   R+  I+   K +  S      ++P      FGS  + L 
Sbjct: 374  LHKEIVDFYNYAKPRDFEEKLRQGLIDELAKLIRNSQFRDATVYP------FGSFKSNLY 427

Query: 1267 LPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVE 1326
            LP+ D+DLV C               + GR          AARY +     K  +    +
Sbjct: 428  LPTGDMDLVFC-----------SDSYMSGR----------AARYSSKNHVFKFGAFIERK 466

Query: 1327 NTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSH 1386
              A+                                   DNHV      +  +  P   +
Sbjct: 467  QLAV-----------------------------------DNHVEK----ISKARVPLVKY 487

Query: 1387 TSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQ 1446
                 + + T +++D+SF++   TG++  +      EQFP    L   +K FLA R L++
Sbjct: 488  -----VDSRTGLKVDVSFEN--ITGIRAIETFLAWREQFPDMPVLVTCIKHFLAMRGLNE 540

Query: 1447 SYSGGLSSYCLMLLITRFLQ 1466
              +GG+    ++ L+   LQ
Sbjct: 541  PANGGIGGTTVICLVVSMLQ 560


>gi|398408115|ref|XP_003855523.1| hypothetical protein MYCGRDRAFT_21922, partial [Zymoseptoria tritici
            IPO323]
 gi|339475407|gb|EGP90499.1| hypothetical protein MYCGRDRAFT_21922 [Zymoseptoria tritici IPO323]
          Length = 486

 Score = 43.5 bits (101), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 43/80 (53%), Gaps = 5/80 (6%)

Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
               ++ DI+F +P   GL  T +++  +   P   P+ L +K +   R ++ SYSG LSS
Sbjct: 237  GVGIQCDINFFNP--LGLHNTQMLRCYSMCDPRVRPMVLFVKSWAKQRKINSSYSGTLSS 294

Query: 1455 YCLMLLITRFLQHEHHLGRP 1474
            Y  +L++  +L    ++ RP
Sbjct: 295  YGYVLMVLHYLM---NVARP 311


>gi|358378134|gb|EHK15816.1| hypothetical protein TRIVIDRAFT_195758 [Trichoderma virens Gv29-8]
          Length = 1020

 Score = 43.5 bits (101), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 39/73 (53%), Gaps = 2/73 (2%)

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
            K+   V+ DI+F +  H  LQ T L++  +   P   P+ L +K +   R ++  Y G L
Sbjct: 714  KSDVGVQCDINFSA--HLALQNTLLLRCYSHTDPRVRPMILFVKHWAKMRGINSGYRGTL 771

Query: 1453 SSYCLMLLITRFL 1465
            SSY  +L++  +L
Sbjct: 772  SSYGYVLMVLHYL 784


>gi|344278690|ref|XP_003411126.1| PREDICTED: terminal uridylyltransferase 4 [Loxodonta africana]
          Length = 1643

 Score = 43.5 bits (101), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 29/54 (53%)

Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
            TTDL+  L ++ P  TPL L  + +     +D    GG+ SYC  L++  FLQ 
Sbjct: 482  TTDLLAALGKREPVFTPLVLAFRYWAKLCHIDSQTDGGIPSYCFALMVMFFLQQ 535


>gi|345569402|gb|EGX52268.1| hypothetical protein AOL_s00043g57 [Arthrobotrys oligospora ATCC
            24927]
          Length = 747

 Score = 43.5 bits (101), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 50/230 (21%), Positives = 89/230 (38%), Gaps = 53/230 (23%)

Query: 1239 VKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNG 1298
            ++R+T     L P SR   FGS  +G +   +D+D++                     + 
Sbjct: 281  LRRITEICNNLVPGSRIIPFGSLVSGFATKGADMDVIFA------------------HDS 322

Query: 1299 IKETCLQHAARY---LANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
            +      H +     LAN+   +   +  +  T +PI  L ++ P +   S       P 
Sbjct: 323  LHPQPFSHESNVPVRLANEFLKRGFEVDLLIRTRVPI--LKIKTPSNDPGS------RPG 374

Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
              +A   LK D         LD+   P+             ++  DI FK+  H G+  +
Sbjct: 375  SPSAQDALKED---------LDEEPWPE-------------NISCDIGFKA--HLGITNS 410

Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
               +  ++       + L +KQ+  +R L+  Y G LSSY  +L++  FL
Sbjct: 411  YFFRTYSQCDSRFREMVLFVKQWSKNRDLNSPYFGTLSSYGYVLMVAHFL 460


>gi|123449289|ref|XP_001313365.1| PAP/25A associated domain containing protein [Trichomonas vaginalis
            G3]
 gi|121895246|gb|EAY00436.1| PAP/25A associated domain containing protein [Trichomonas vaginalis
            G3]
          Length = 346

 Score = 43.5 bits (101), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 35/71 (49%), Gaps = 2/71 (2%)

Query: 1397 SVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
             + +DISF      G      ++E+    P   P  + LK  L    LDQ + GG+SSY 
Sbjct: 153  GISIDISFDE--LHGPLCVQTIREIFRTIPCILPAQIFLKAMLRRNKLDQPFLGGISSYT 210

Query: 1457 LMLLITRFLQH 1467
            L L+I  ++Q+
Sbjct: 211  LQLMILAYVQY 221


>gi|350586191|ref|XP_003128036.3| PREDICTED: terminal uridylyltransferase 4 [Sus scrofa]
          Length = 1376

 Score = 43.5 bits (101), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 28/57 (49%)

Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHH 1470
            TTDL+  L +  P   PL L  + +     +D    GG+ SYC  L++  FLQ   H
Sbjct: 210  TTDLLAALGKMEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKH 266


>gi|384488317|gb|EIE80497.1| hypothetical protein RO3G_05202 [Rhizopus delemar RA 99-880]
          Length = 403

 Score = 43.5 bits (101), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 40/72 (55%), Gaps = 7/72 (9%)

Query: 1210 HSLLHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
            + +  +E+D F + +     E   R+      V+R+++++   WP +  ++FGS AT L 
Sbjct: 120  YDIFRQEVDCFLRYIEPTEIEMELRES----IVQRISKAISKQWPDAEVSVFGSFATCLF 175

Query: 1267 LPSSDVDLVVCL 1278
            LP+SD+DL +  
Sbjct: 176  LPNSDIDLDITF 187


>gi|223999357|ref|XP_002289351.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220974559|gb|EED92888.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1716

 Score = 43.5 bits (101), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 38/66 (57%), Gaps = 1/66 (1%)

Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRS-RTNIFGSNATGLSLPSSD 1271
            L EEI++F  + +     R+   +  +  +  +L  +WP S    ++GS AT L LPSSD
Sbjct: 926  LSEEINTFMYRRSLALDIRRKERSAVLSALQDTLGRIWPGSCHVEMYGSCATELDLPSSD 985

Query: 1272 VDLVVC 1277
            +DLVVC
Sbjct: 986  LDLVVC 991



 Score = 41.6 bits (96), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 29/66 (43%), Positives = 40/66 (60%), Gaps = 12/66 (18%)

Query: 1400 LDISFKSPSHTGLQTT--------DLVKELTEQFPASTPL---ALVLKQFLADRSLDQSY 1448
            +DI+F+ P H G+ +T        D  KE T   P STP+   A+VLK+ LA R L++ +
Sbjct: 1254 IDITFEGPEHGGIGSTTYSACVVQDACKE-TNLPPESTPVVQVAMVLKELLAQRRLNEPF 1312

Query: 1449 SGGLSS 1454
            SGGLSS
Sbjct: 1313 SGGLSS 1318


>gi|123440101|ref|XP_001310815.1| PAP/25A associated domain containing protein [Trichomonas vaginalis
            G3]
 gi|121892600|gb|EAX97885.1| PAP/25A associated domain containing protein [Trichomonas vaginalis
            G3]
          Length = 439

 Score = 43.5 bits (101), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 31/102 (30%), Positives = 50/102 (49%), Gaps = 7/102 (6%)

Query: 1191 LQPLEVQ-NCP---TRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINW-AVKRVTRS 1245
            L P +VQ +CP     + S S  +  +H EI  F + +    T  + ++ +  +KR   +
Sbjct: 15   LHPCQVQADCPWCNGMEYSHSSTYLCMHREILDFARWILP--TEEEKHLRYLVIKRFRVA 72

Query: 1246 LQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPI 1287
            +  LWP ++    GS ATG  LP+ D+D  V   P  N E +
Sbjct: 73   INQLWPNAKVICHGSTATGTFLPNGDLDFCVLGAPSGNDEDL 114


>gi|219125772|ref|XP_002183147.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405422|gb|EEC45365.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 707

 Score = 43.5 bits (101), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 21/64 (32%), Positives = 34/64 (53%)

Query: 1409 HTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHE 1468
            H G  T+      TE++ +  P+ L LK  L   +LD+ ++GGL SY L +L+   ++  
Sbjct: 523  HNGSDTSPYAGSQTEKYRSFAPVVLALKVVLQQTNLDEPFAGGLGSYKLYVLVAYHIEQH 582

Query: 1469 HHLG 1472
              LG
Sbjct: 583  LLLG 586


>gi|407405189|gb|EKF30316.1| hypothetical protein MOQ_005876 [Trypanosoma cruzi marinkellei]
          Length = 1890

 Score = 43.5 bits (101), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 22/71 (30%), Positives = 37/71 (52%), Gaps = 2/71 (2%)

Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
            AT +R DI+F    H   ++  L   L +++PAS  L L LK  +    +  S  GG++S
Sbjct: 761  ATGMRGDITFAGGEH--WRSMQLTNHLLKRYPASRGLILFLKHCVRQMGIGDSQPGGVTS 818

Query: 1455 YCLMLLITRFL 1465
            + + L++  F 
Sbjct: 819  FAIYLMVLHFF 829


>gi|448519050|ref|XP_003868035.1| non-canonical poly(A) polymerase [Candida orthopsilosis Co 90-125]
 gi|380352374|emb|CCG22600.1| non-canonical poly(A) polymerase [Candida orthopsilosis]
          Length = 604

 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 28/101 (27%), Positives = 46/101 (45%), Gaps = 17/101 (16%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
            N  +  + R +   WP +  ++FGS AT L LP SD+D+VV          I   G  E 
Sbjct: 187  NNVINTLKREVSSFWPGTEAHVFGSCATDLYLPGSDIDMVV----------ISSTGDYEN 236

Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLV 1336
            R+ + +      A+ LA        +++ + +  +PII  V
Sbjct: 237  RSRLYQLSSFLRAKNLAK-------NVEVIASAKVPIIKFV 270


>gi|303285524|ref|XP_003062052.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226456463|gb|EEH53764.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 842

 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 35/78 (44%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T  R+D      +  GL     +K   E FPA  PL L LK+ L   +LD + +GG   Y
Sbjct: 664  TFARMDPRPSVGAVNGLDAVAWIKRQCETFPALRPLVLTLKRLLKTHALDDASTGGCGGY 723

Query: 1456 CLMLLITRFLQHEHHLGR 1473
             L+ L    L+     G+
Sbjct: 724  LLVSLAVSHLRLSGDAGK 741


>gi|440467546|gb|ELQ36762.1| caffeine-induced death protein 1 [Magnaporthe oryzae Y34]
 gi|440488651|gb|ELQ68366.1| caffeine-induced death protein 1 [Magnaporthe oryzae P131]
          Length = 1067

 Score = 43.1 bits (100), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 39/73 (53%), Gaps = 2/73 (2%)

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
            K+   V+ DI+F +  H  LQ T L++  +   P   P+ L +K +   R ++  Y G L
Sbjct: 750  KSGVGVQCDINFSA--HLALQNTLLLRCYSYVDPRVRPMVLFVKHWAKARGINTPYRGTL 807

Query: 1453 SSYCLMLLITRFL 1465
            SSY  +L++  +L
Sbjct: 808  SSYGYVLMVLHYL 820


>gi|449664468|ref|XP_002160563.2| PREDICTED: terminal uridylyltransferase 7-like [Hydra magnipapillata]
          Length = 338

 Score = 43.1 bits (100), Expect = 1.2,   Method: Composition-based stats.
 Identities = 22/66 (33%), Positives = 37/66 (56%)

Query: 1413 QTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLG 1472
            QT+DL+ + T        LA++++ +     LD + +GGLS YC +L++  +LQH     
Sbjct: 212  QTSDLIIKYTNADKRCAKLAMLIRMWSKVCGLDNADNGGLSPYCFVLMVINYLQHTEPPV 271

Query: 1473 RPINQV 1478
             PI +V
Sbjct: 272  LPIIKV 277


>gi|408391691|gb|EKJ71060.1| hypothetical protein FPSE_08724 [Fusarium pseudograminearum CS3096]
          Length = 1059

 Score = 43.1 bits (100), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 22/73 (30%), Positives = 37/73 (50%), Gaps = 2/73 (2%)

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
            K    V+ DI+F +  H  L  T L++  +   P   P+ L +K +   R ++  Y G L
Sbjct: 737  KTGAGVQCDINFSA--HLALHNTALLRCYSHTDPRVRPMVLFVKNWAKIRGINSGYRGTL 794

Query: 1453 SSYCLMLLITRFL 1465
            SSY  +L++  +L
Sbjct: 795  SSYGYVLMVLHYL 807


>gi|389634385|ref|XP_003714845.1| caffeine-induced death protein 1 [Magnaporthe oryzae 70-15]
 gi|351647178|gb|EHA55038.1| caffeine-induced death protein 1 [Magnaporthe oryzae 70-15]
          Length = 1067

 Score = 43.1 bits (100), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 39/73 (53%), Gaps = 2/73 (2%)

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
            K+   V+ DI+F +  H  LQ T L++  +   P   P+ L +K +   R ++  Y G L
Sbjct: 750  KSGVGVQCDINFSA--HLALQNTLLLRCYSYVDPRVRPMVLFVKHWAKARGINTPYRGTL 807

Query: 1453 SSYCLMLLITRFL 1465
            SSY  +L++  +L
Sbjct: 808  SSYGYVLMVLHYL 820


>gi|407844181|gb|EKG01837.1| hypothetical protein TCSYLVIO_007151 [Trypanosoma cruzi]
          Length = 1878

 Score = 43.1 bits (100), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 22/71 (30%), Positives = 37/71 (52%), Gaps = 2/71 (2%)

Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
            AT +R DI+F    H   ++  L   L +++PAS  L L LK  +    +  S  GG++S
Sbjct: 761  ATEMRGDITFAGGEH--WRSMQLTNRLLKRYPASRGLILFLKHCVRQMGIGDSQPGGVTS 818

Query: 1455 YCLMLLITRFL 1465
            + + L++  F 
Sbjct: 819  FAIYLMVLHFF 829


>gi|169599845|ref|XP_001793345.1| hypothetical protein SNOG_02748 [Phaeosphaeria nodorum SN15]
 gi|160705333|gb|EAT89479.2| hypothetical protein SNOG_02748 [Phaeosphaeria nodorum SN15]
          Length = 607

 Score = 43.1 bits (100), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 24/71 (33%), Positives = 36/71 (50%), Gaps = 1/71 (1%)

Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
            LD+     + T L + DL+K   EQ+P    L LV+K     R L   + GG  SY L +
Sbjct: 303  LDVQVVLSNDTSL-SRDLMKGYMEQYPYLRQLHLVVKTMFDVRGLTDVFRGGFGSYTLFM 361

Query: 1460 LITRFLQHEHH 1470
            +I   ++H+ H
Sbjct: 362  MIVASIRHKPH 372


>gi|116196370|ref|XP_001223997.1| hypothetical protein CHGG_04783 [Chaetomium globosum CBS 148.51]
 gi|88180696|gb|EAQ88164.1| hypothetical protein CHGG_04783 [Chaetomium globosum CBS 148.51]
          Length = 1123

 Score = 43.1 bits (100), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 40/73 (54%), Gaps = 2/73 (2%)

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
            K+   V+ DI+F +  H G+Q T L++  +   P   PL L +K +   R ++ +Y G L
Sbjct: 747  KSGVGVQCDINFSA--HLGVQNTLLLRCYSHCDPRVRPLILFVKHWAKVRRINSAYRGTL 804

Query: 1453 SSYCLMLLITRFL 1465
             SY  +L++  +L
Sbjct: 805  GSYGYVLMMLHYL 817


>gi|71415610|ref|XP_809866.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70874312|gb|EAN88015.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 1887

 Score = 43.1 bits (100), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 22/71 (30%), Positives = 37/71 (52%), Gaps = 2/71 (2%)

Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
            AT +R DI+F    H   ++  L   L +++PAS  L L LK  +    +  S  GG++S
Sbjct: 761  ATEMRGDITFAGGEH--WRSMQLTNRLLKRYPASRGLILFLKHCVRQMGIGDSQPGGVTS 818

Query: 1455 YCLMLLITRFL 1465
            + + L++  F 
Sbjct: 819  FAIYLMVLHFF 829


>gi|340521600|gb|EGR51834.1| predicted protein [Trichoderma reesei QM6a]
          Length = 841

 Score = 43.1 bits (100), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 38/73 (52%), Gaps = 2/73 (2%)

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
            K    V+ DI+F +  H  LQ T L++  +   P   P+ L +K +   R ++  Y G L
Sbjct: 534  KTDVGVQCDINFSA--HLALQNTLLLRCYSHTDPRVRPMILFVKHWAKMRGINSGYRGTL 591

Query: 1453 SSYCLMLLITRFL 1465
            SSY  +L++  +L
Sbjct: 592  SSYGYVLMVLHYL 604


>gi|395855062|ref|XP_003799990.1| PREDICTED: terminal uridylyltransferase 4 [Otolemur garnettii]
          Length = 1620

 Score = 43.1 bits (100), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 30/113 (26%), Positives = 45/113 (39%), Gaps = 17/113 (15%)

Query: 1355 KEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQT 1414
            K+ A++  ++ D H    +V   D  S      S+ N  A                   T
Sbjct: 438  KKTASYVDVESDFHAKVPVVVCKDRKSGLLCRVSAGNDMACL-----------------T 480

Query: 1415 TDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
            TDL+  L +  P  TPL L  + +     +D    GG+ SYC  L++  FLQ 
Sbjct: 481  TDLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 533


>gi|451847669|gb|EMD60976.1| hypothetical protein COCSADRAFT_149352 [Cochliobolus sativus ND90Pr]
          Length = 723

 Score = 42.7 bits (99), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 22/76 (28%), Positives = 41/76 (53%), Gaps = 2/76 (2%)

Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
            D  K    ++ DI+F +P   G+  T +++  +   P   P+ L +K +   R ++ +YS
Sbjct: 446  DFPKDGCGIQCDINFANP--LGIHNTHMLRCYSLTDPRVRPIVLFVKSWAKRRKVNSAYS 503

Query: 1450 GGLSSYCLMLLITRFL 1465
            G LSSY  +L++  +L
Sbjct: 504  GTLSSYGWVLMVLHYL 519


>gi|46116842|ref|XP_384439.1| hypothetical protein FG04263.1 [Gibberella zeae PH-1]
          Length = 1052

 Score = 42.7 bits (99), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 22/73 (30%), Positives = 37/73 (50%), Gaps = 2/73 (2%)

Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
            K    V+ DI+F +  H  L  T L++  +   P   P+ L +K +   R ++  Y G L
Sbjct: 730  KTGAGVQCDINFSA--HLALHNTALLRCYSHTDPRVRPMVLFVKNWAKIRGINSGYRGTL 787

Query: 1453 SSYCLMLLITRFL 1465
            SSY  +L++  +L
Sbjct: 788  SSYGYVLMVLHYL 800


>gi|154308271|ref|XP_001553472.1| hypothetical protein BC1G_07881 [Botryotinia fuckeliana B05.10]
          Length = 1246

 Score = 42.7 bits (99), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 32/55 (58%)

Query: 1412 LQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            L+ T ++K   E  P   PLA+++K +   R+++ +  G LSSY  + +I  FLQ
Sbjct: 395  LENTRMIKTYIEIDPRVRPLAMIIKHWTKSRAINDAVGGTLSSYTWICMIINFLQ 449


>gi|71406967|ref|XP_805982.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70869592|gb|EAN84131.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 1625

 Score = 42.7 bits (99), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 22/71 (30%), Positives = 37/71 (52%), Gaps = 2/71 (2%)

Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
            AT +R DI+F    H   ++  L   L +++PAS  L L LK  +    +  S  GG++S
Sbjct: 488  ATEMRGDITFAGGEH--WRSMQLTNHLLKRYPASRGLILFLKHCVRQMGIGDSQPGGVTS 545

Query: 1455 YCLMLLITRFL 1465
            + + L++  F 
Sbjct: 546  FAIYLMVLHFF 556


>gi|332635009|ref|NP_001193859.1| terminal uridylyltransferase 4 [Bos taurus]
 gi|296489128|tpg|DAA31241.1| TPA: Caffeine Induced Death homolog family member (cid-1)-like [Bos
            taurus]
          Length = 1639

 Score = 42.7 bits (99), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 28/54 (51%)

Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
            TTDL+  L +  P  TPL L  + +     +D    GG+ SYC  L++  FLQ 
Sbjct: 479  TTDLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 532


>gi|348554637|ref|XP_003463132.1| PREDICTED: LOW QUALITY PROTEIN: terminal uridylyltransferase 4-like
            [Cavia porcellus]
          Length = 1620

 Score = 42.7 bits (99), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 29/58 (50%)

Query: 1410 TGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
            T   TTDL+  L +  P  TPL L  + +     +D    GG+ SYC +L+   FLQ 
Sbjct: 462  TACLTTDLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFVLMTMFFLQQ 519


>gi|451996743|gb|EMD89209.1| hypothetical protein COCHEDRAFT_1196132 [Cochliobolus heterostrophus
            C5]
          Length = 718

 Score = 42.7 bits (99), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 22/76 (28%), Positives = 41/76 (53%), Gaps = 2/76 (2%)

Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
            D  K    ++ DI+F +P   G+  T +++  +   P   P+ L +K +   R ++ +YS
Sbjct: 441  DFPKDGCGIQCDINFANP--LGIHNTHMLRCYSLTDPRVRPIVLFVKSWAKRRKVNSAYS 498

Query: 1450 GGLSSYCLMLLITRFL 1465
            G LSSY  +L++  +L
Sbjct: 499  GTLSSYGWVLMVLHYL 514


>gi|347441079|emb|CCD34000.1| similar to zinc finger protein [Botryotinia fuckeliana]
          Length = 1243

 Score = 42.7 bits (99), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 32/55 (58%)

Query: 1412 LQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
            L+ T ++K   E  P   PLA+++K +   R+++ +  G LSSY  + +I  FLQ
Sbjct: 392  LENTRMIKTYIEIDPRVRPLAMIIKHWTKSRAINDAVGGTLSSYTWICMIINFLQ 446


>gi|338721674|ref|XP_003364417.1| PREDICTED: terminal uridylyltransferase 4 [Equus caballus]
          Length = 1642

 Score = 42.7 bits (99), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 28/54 (51%)

Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
            TTDL+  L +  P  TPL L  + +     +D    GG+ SYC  L++  FLQ 
Sbjct: 482  TTDLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 535


>gi|440906885|gb|ELR57101.1| Terminal uridylyltransferase 4 [Bos grunniens mutus]
          Length = 1665

 Score = 42.7 bits (99), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 28/54 (51%)

Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
            TTDL+  L +  P  TPL L  + +     +D    GG+ SYC  L++  FLQ 
Sbjct: 500  TTDLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 553


>gi|426215544|ref|XP_004002031.1| PREDICTED: terminal uridylyltransferase 4 isoform 2 [Ovis aries]
          Length = 1643

 Score = 42.7 bits (99), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 28/54 (51%)

Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
            TTDL+  L +  P  TPL L  + +     +D    GG+ SYC  L++  FLQ 
Sbjct: 479  TTDLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 532


>gi|426215542|ref|XP_004002030.1| PREDICTED: terminal uridylyltransferase 4 isoform 1 [Ovis aries]
          Length = 1643

 Score = 42.7 bits (99), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 28/54 (51%)

Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
            TTDL+  L +  P  TPL L  + +     +D    GG+ SYC  L++  FLQ 
Sbjct: 479  TTDLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 532


>gi|345480897|ref|XP_003424241.1| PREDICTED: hypothetical protein LOC100679579 [Nasonia vitripennis]
          Length = 537

 Score = 42.7 bits (99), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 55/242 (22%), Positives = 93/242 (38%), Gaps = 39/242 (16%)

Query: 972  SPFDPLGPGHQAFSYVVPGN-EVPGKVLHSSSTTTDVATEEEISGSFASLSGDV-DSKAL 1029
            +PF    PG   +   +  N  +  K++H+ ++T    TEE++   FA +   + D K L
Sbjct: 135  TPFKNNRPGQHWYRAFMKRNPNLSNKIVHNHASTRAPVTEEDLRDWFAKIRLHLEDKKLL 194

Query: 1030 DTLPCPILRP-----IIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVL 1084
            +  P  I        ++ P  ++  +    K  HE  S             ++  S V+ 
Sbjct: 195  NIEPHRIFNAGESTFMLTPKYNKVLTEEGVKAPHEIVSSN-----------EKDTSTVLF 243

Query: 1085 CVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEV 1144
             +  +   PPP  + D +KT   P        P+ WGV       TTSE     +     
Sbjct: 244  SISASGVMPPPMILFDFKKT---PKKSVIDKIPKGWGVVNTERGLTTSESFYSYITN--- 297

Query: 1145 VWPSWRNKNLSAHPMIQPLSG----------ALLQDHLIAISQLARDQEHPDVAFPLQPL 1194
            V+  W  +N    P+I  + G             +DHLI +  L     +P+    + PL
Sbjct: 298  VFYKWLKENNYVFPVILYVDGRSSQITLPLLKFCKDHLIELIIL-----YPNATHIIHPL 352

Query: 1195 EV 1196
            +V
Sbjct: 353  DV 354


>gi|149693612|ref|XP_001490500.1| PREDICTED: terminal uridylyltransferase 4 isoform 2 [Equus caballus]
          Length = 1647

 Score = 42.7 bits (99), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 28/54 (51%)

Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
            TTDL+  L +  P  TPL L  + +     +D    GG+ SYC  L++  FLQ 
Sbjct: 482  TTDLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 535


>gi|402079697|gb|EJT74962.1| caffeine-induced death protein 1 [Gaeumannomyces graminis var.
            tritici R3-111a-1]
          Length = 1114

 Score = 42.7 bits (99), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 38/71 (53%), Gaps = 2/71 (2%)

Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
               V+ DI+F +  H  LQ T L++  +   P   PL L +K +   R ++ +Y G LSS
Sbjct: 783  GAGVQCDINFSA--HLALQNTLLLRCYSHTDPRVRPLVLFVKHWAKVRGINTAYRGTLSS 840

Query: 1455 YCLMLLITRFL 1465
            Y  +L++  +L
Sbjct: 841  YGYVLMMLHYL 851


>gi|345780735|ref|XP_532573.3| PREDICTED: terminal uridylyltransferase 4 isoform 1 [Canis lupus
            familiaris]
          Length = 1625

 Score = 42.4 bits (98), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 28/54 (51%)

Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
            TTDL+  L +  P  TPL L  + +     +D    GG+ SYC  L++  FLQ 
Sbjct: 462  TTDLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVLFFLQQ 515


>gi|357623592|gb|EHJ74680.1| hypothetical protein KGM_08898 [Danaus plexippus]
          Length = 443

 Score = 42.4 bits (98), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 46/98 (46%), Gaps = 2/98 (2%)

Query: 1193 PLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPR 1252
            P +V+  P    + S   +L  E  D F K    E T RK    W  + +  S++ ++PR
Sbjct: 120  PFQVKFTPDNLLNGSEWDTLSQEIWDKFVKSQQTEETFRKKMNLW--RYLYISIKSMFPR 177

Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEA 1290
                + GS  +G  L SSD+DL + +  +  LEP   A
Sbjct: 178  YGLYVVGSTMSGFGLDSSDMDLCLHVRALAELEPRAHA 215


>gi|330796515|ref|XP_003286312.1| hypothetical protein DICPUDRAFT_150267 [Dictyostelium purpureum]
 gi|325083739|gb|EGC37184.1| hypothetical protein DICPUDRAFT_150267 [Dictyostelium purpureum]
          Length = 2271

 Score = 42.0 bits (97), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 40/72 (55%), Gaps = 2/72 (2%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T  + D+ F      G+  ++L+KE ++  P    L L++K +   + ++ +  G  SS+
Sbjct: 2048 TGTQFDMCFNH--EIGIYKSNLIKEYSDLDPRCKELILLVKYWAQQKDINDASKGTFSSF 2105

Query: 1456 CLMLLITRFLQH 1467
            CL+L++  FLQ+
Sbjct: 2106 CLVLMVIHFLQY 2117


>gi|422296106|gb|EKU23405.1| nucleotidyltransferase-like protein [Nannochloropsis gaditana
            CCMP526]
          Length = 432

 Score = 42.0 bits (97), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 39/72 (54%), Gaps = 2/72 (2%)

Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
             T +  DIS ++P    L+ T L+ E ++  P    LA ++K +   R ++ +  G LSS
Sbjct: 113  GTGLDCDISVENP--LALRNTQLLHEYSQVDPRVRALAYIVKHWARARKINNASGGTLSS 170

Query: 1455 YCLMLLITRFLQ 1466
            Y  +L++  FLQ
Sbjct: 171  YAYILMVLHFLQ 182


>gi|363751202|ref|XP_003645818.1| hypothetical protein Ecym_3523 [Eremothecium cymbalariae DBVPG#7215]
 gi|356889452|gb|AET39001.1| Hypothetical protein Ecym_3523 [Eremothecium cymbalariae DBVPG#7215]
          Length = 683

 Score = 42.0 bits (97), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 27/41 (65%)

Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVV 1276
            N A+ ++ ++++  WP S  + FGS AT L LP SD+D VV
Sbjct: 210  NDAITKIRKAVKSFWPDSDLHCFGSYATDLYLPGSDIDCVV 250


>gi|351697778|gb|EHB00697.1| Terminal uridylyltransferase 4 [Heterocephalus glaber]
          Length = 1668

 Score = 42.0 bits (97), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 28/54 (51%)

Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
            TTDL+  L++  P  TPL L  + +     +D    GG+ SYC  L+   FLQ 
Sbjct: 504  TTDLLAALSKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMAMFFLQQ 557


>gi|187956551|gb|AAI50792.1| Zinc finger, CCHC domain containing 11 [Mus musculus]
          Length = 1644

 Score = 42.0 bits (97), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 28/54 (51%)

Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
            TTDL+  L +  P  TPL L  + +     +D    GG+ SYC  L++  FLQ 
Sbjct: 499  TTDLLAALGKVEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 552


>gi|83977461|ref|NP_780681.2| terminal uridylyltransferase 4 [Mus musculus]
 gi|259554115|sp|B2RX14.2|TUT4_MOUSE RecName: Full=Terminal uridylyltransferase 4; Short=TUTase 4;
            AltName: Full=Zinc finger CCHC domain-containing protein
            11
          Length = 1644

 Score = 42.0 bits (97), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 28/54 (51%)

Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
            TTDL+  L +  P  TPL L  + +     +D    GG+ SYC  L++  FLQ 
Sbjct: 499  TTDLLAALGKVEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 552


>gi|60360306|dbj|BAD90397.1| mKIAA0191 protein [Mus musculus]
          Length = 1556

 Score = 42.0 bits (97), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 30/113 (26%), Positives = 44/113 (38%), Gaps = 17/113 (15%)

Query: 1355 KEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQT 1414
            K+ A +  ++ D H    +V   D  S      S+ N  A                   T
Sbjct: 369  KKSALYIDVESDFHAKVPVVVCKDRKSALLCRVSAGNDMACL-----------------T 411

Query: 1415 TDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
            TDL+  L +  P  TPL L  + +     +D    GG+ SYC  L++  FLQ 
Sbjct: 412  TDLLAALGKVEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 464


>gi|340506956|gb|EGR32991.1| hypothetical protein IMG5_064460 [Ichthyophthirius multifiliis]
          Length = 347

 Score = 41.6 bits (96), Expect = 3.3,   Method: Composition-based stats.
 Identities = 21/75 (28%), Positives = 40/75 (53%), Gaps = 2/75 (2%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            ++ +T    DISF      G++    V++  + +P    L  ++K  L  R L+++Y+GG
Sbjct: 152  VEKSTQTNFDISFNK--EDGVKQLPEVQKCLQIYPEIKYLIFIMKCILRQRDLNETYTGG 209

Query: 1452 LSSYCLMLLITRFLQ 1466
            + S+ L  +I  FL+
Sbjct: 210  IGSFLLFCMILAFLR 224


>gi|449688600|ref|XP_002162478.2| PREDICTED: terminal uridylyltransferase 4-like, partial [Hydra
            magnipapillata]
          Length = 215

 Score = 41.6 bits (96), Expect = 3.5,   Method: Composition-based stats.
 Identities = 21/66 (31%), Positives = 37/66 (56%)

Query: 1413 QTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLG 1472
            QT++L+ + T        LA++++ +     LD + +GGLS YC +L++  +LQH     
Sbjct: 68   QTSNLIIKYTNADERCIKLAMLIRMWSKACGLDNADNGGLSPYCFVLMVINYLQHTEPPV 127

Query: 1473 RPINQV 1478
             PI +V
Sbjct: 128  LPIIRV 133


>gi|396499098|ref|XP_003845390.1| hypothetical protein LEMA_P006980.1 [Leptosphaeria maculans JN3]
 gi|312221971|emb|CBY01911.1| hypothetical protein LEMA_P006980.1 [Leptosphaeria maculans JN3]
          Length = 617

 Score = 41.6 bits (96), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 17/58 (29%), Positives = 30/58 (51%)

Query: 1413 QTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHH 1470
            Q+ D+++   +++P    L  V+K     R L   + GG  SY L ++I   +QH+ H
Sbjct: 326  QSRDIIQRYLKEYPYLLTLFTVIKTIFEIRGLSDVFRGGFGSYSLFMMIVASIQHQPH 383


>gi|330914987|ref|XP_003296862.1| hypothetical protein PTT_07070 [Pyrenophora teres f. teres 0-1]
 gi|311330792|gb|EFQ95034.1| hypothetical protein PTT_07070 [Pyrenophora teres f. teres 0-1]
          Length = 646

 Score = 41.6 bits (96), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 28/79 (35%), Positives = 37/79 (46%), Gaps = 8/79 (10%)

Query: 1199 CPTRKASLSLMHSLLHEEIDSFCKQ---VAAENTARKPYINWAVKRVTRSLQVLWPRSRT 1255
            C  R  + S +H L   EID F K       E  ARK  I+     V   +    P+   
Sbjct: 189  CVNRDRATSGLHRL-DMEIDKFYKYSKPTPHEQNARKQVIDQVRNHVLEQV----PKHHI 243

Query: 1256 NIFGSNATGLSLPSSDVDL 1274
             +FGS  TGL+LP+SD+D 
Sbjct: 244  EVFGSQITGLALPTSDIDF 262


>gi|291385635|ref|XP_002709312.1| PREDICTED: death domain containing 1 [Oryctolagus cuniculus]
          Length = 901

 Score = 41.2 bits (95), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 47/187 (25%), Positives = 74/187 (39%), Gaps = 30/187 (16%)

Query: 528 VAGKGRTNARKTKTVKNKN-KNCTYNNPVPVKDPK-------------------VSVLET 567
           +A +G  NA +T TV   N +   Y N V     K                   V+  E 
Sbjct: 142 IAARGELNAIETATVSPTNGEESHYTNQVQFSTNKAHINSELLEEENNASLNGNVTEKED 201

Query: 568 SSSISLQDEVEKYDKLSAQNVSVDNSTCS-----NVLASNQSSCTSASVPAREGIATQST 622
           S +  L D  E  D    ++++++N   S     +V+ + +      S   RE I T ST
Sbjct: 202 SENKMLSDNAENEDDKQIEHMTIENLNGSRENIHDVMQATEREIHETSESQREEITTSST 261

Query: 623 QEDCVVNSVN-----SECRRFSNGRIDNQTQHFLQETTDSKVECNIISPDMPARDLDNAF 677
             D   N VN     SE  +  N  ++ +    L + TD++V C I +P    + L    
Sbjct: 262 TCDISSNDVNRLPKDSESLKQKNNTMEKEYLEVLNDGTDAQVSCYITAPSNVLQQLVCRI 321

Query: 678 GNSISGI 684
            NS+S +
Sbjct: 322 INSMSSL 328


>gi|76363699|ref|XP_888584.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|12311885|emb|CAC22701.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 3012

 Score = 40.8 bits (94), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 22/70 (31%), Positives = 37/70 (52%), Gaps = 2/70 (2%)

Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
            AT  ++DI+F    H   ++  L + L E FP +  L L LK  +   S  +S  GG++S
Sbjct: 1665 ATGTKVDITFAGGEH--WRSMQLTRSLLEVFPHARTLILFLKYCVRSLSAGESEPGGVTS 1722

Query: 1455 YCLMLLITRF 1464
            + + L++  F
Sbjct: 1723 FAIYLMVLHF 1732


>gi|26339984|dbj|BAC33655.1| unnamed protein product [Mus musculus]
          Length = 435

 Score = 40.8 bits (94), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 28/54 (51%)

Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
            TTDL+  L +  P  TPL L  + +     +D    GG+ SYC  L++  FLQ 
Sbjct: 289  TTDLLAALGKVEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 342


>gi|308805280|ref|XP_003079952.1| DNA polymerase sigma (ISS) [Ostreococcus tauri]
 gi|116058409|emb|CAL53598.1| DNA polymerase sigma (ISS) [Ostreococcus tauri]
          Length = 322

 Score = 40.4 bits (93), Expect = 7.6,   Method: Composition-based stats.
 Identities = 22/81 (27%), Positives = 39/81 (48%)

Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
            +K+A+   + +     +  GL+    ++E  E F A  PL   LK+ LA   L+ + +GG
Sbjct: 1    MKSASPPYIAVDLSLGTTNGLEAVYWIREQVETFAALKPLVFYLKRLLATHHLNDASTGG 60

Query: 1452 LSSYCLMLLITRFLQHEHHLG 1472
               Y L+ L+   L+    +G
Sbjct: 61   CGGYLLVSLVVSHLKQTGSVG 81


>gi|440296452|gb|ELP89279.1| PAP-associated domain containing protein, putative [Entamoeba
            invadens IP1]
          Length = 344

 Score = 40.0 bits (92), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 23/82 (28%), Positives = 43/82 (52%), Gaps = 4/82 (4%)

Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
            T++ +D+S       G+ ++ L   L      +  +AL +K  +   +L++ Y GG+ SY
Sbjct: 125  TTISIDLSINCEG--GIDSSALTHSLLTSSQFTQEIALFVKYLVFQNNLNEPYHGGIGSY 182

Query: 1456 CLMLLITRFLQH--EHHLGRPI 1475
             ++LL   FL+   +H LGR +
Sbjct: 183  AIVLLTATFLKFYPQHSLGRAL 204


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.315    0.130    0.390 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 23,432,144,111
Number of Sequences: 23463169
Number of extensions: 1012528019
Number of successful extensions: 3155370
Number of sequences better than 100.0: 882
Number of HSP's better than 100.0 without gapping: 568
Number of HSP's successfully gapped in prelim test: 314
Number of HSP's that attempted gapping in prelim test: 3151594
Number of HSP's gapped (non-prelim): 3528
length of query: 1478
length of database: 8,064,228,071
effective HSP length: 156
effective length of query: 1322
effective length of database: 8,698,941,003
effective search space: 11500000005966
effective search space used: 11500000005966
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 84 (37.0 bits)